Vendor | Language model | Release Date | examples=3 | examples=10 |
---|---|---|---|---|
Anthropic | claude-3-5-haiku-20241022 | 2024-10-22 | anthropic | anthropic10 |
Anthropic | claude-3-7-sonnet-20250219 | 2025-02-19 | anthropic37 | anthropic3710 |
Anthropic | claude-opus-4-20250514 | 2025-05-14 | opus40 | opus4010 |
Anthropic | claude-sonnet-4-20250514 | 2025-05-14 | sonnet40 | sonnet4010 |
Deep Cogito | cogito:70b | 2025-04-15 | cogito | |
DeepSeek AI | deepseek-r1:70b | 2025-01-20 | deepseek | |
Google DeepMind | gemini-2.0-flash | 2025-01-30 | gemini | gemini10 |
Google DeepMind | gemini-2.0-pro-exp | 2025-02-05 | geminipro | geminipro10 |
Google DeepMind | gemini-2.5-pro-exp-03-25 | 2025-03-25 | gemini25 | gemini2510 |
Google DeepMind | gemma3:27b | 2025-03-12 | gemma gemma3 | |
Meta | llama3.3:latest | 2024-12-06 | llama | |
Microsoft | phi4:latest | 2024-12-12 | phi | |
OpenAI | gpt-3.5-turbo-0125 | 2024-01-25 | openai35 | |
OpenAI | gpt-4.1 | 2025-04-14 | openai41 | openai4110 |
OpenAI | gpt-4.5-preview | 2025-02-27 | openai45 | openai4510 |
OpenAI | gpt-4o-2024-11-20 | 2024-11-20 | openai | openai10 |
OpenAI | gpt-4o-2025-01-29 | 2025-01-29 | gpt-4o-legacy | gpt-4o-legacy10 |
OpenAI | gpt-4o-mini | 2024-07-18 | openailong | |
OpenAI | o1 | 2024-12-05 | openai10o1 openaio1 | |
OpenAI | o3 | 2025-04-16 | openaio3 | openaio310 |
Qwen | qwq:32b | 2025-06-25 | qwq | |
Random | random | 2024-01-01 | random | |
TII (UAE) | falcon3:10b | 2024-12-17 | falcon | falcon10 |
Wilcoxon statistic: 605.00, p-value: 0.75372