Skip to main content

AIME — AI model leaderboard

AI models ranked by AIME, an aggregated third-party benchmark from artificial_analysis. Higher is better. Cross-referenced against our first-party meo scores and Effective Value (𝕍).

Ranking 25 models across the full field · as of 2026-06-07.

#ModelLabAIME
1OpenAI: GPT-5openai95.7%
2OpenAI: o4 Miniopenai94.0%
3OpenAI: o3openai90.3%
4Google: Gemini 2.5 Progoogle88.7%
5OpenAI: o3 Mini Highopenai86.0%
6OpenAI: o3 Miniopenai77.0%
7OpenAI: o1openai72.3%
8Reka Flash 3rekaai51.0%
9Google: Gemini 2.5 Flashgoogle50.0%
10OpenAI: GPT-4.1openai43.7%
11OpenAI: GPT-4.1 Miniopenai43.0%
12Meta: Llama 4 Maverickmeta-llama39.0%
13Meta: Llama 4 Scoutmeta-llama28.3%
14Google: Gemma 3 27Bgoogle25.3%
15OpenAI: GPT-4.1 Nanoopenai23.7%
16Google: Gemma 3 12Bgoogle22.0%
17OpenAI: GPT-4oopenai15.0%
18OpenAI: GPT-4 Turboopenai15.0%
19Microsoft: Phi 4microsoft14.3%
20OpenAI: GPT-4o (2024-08-06)openai11.7%
21OpenAI: GPT-4o-miniopenai11.7%
22OpenAI: GPT-4o (2024-05-13)openai11.0%
23Cohere: Command Acohere9.7%
24Google: Gemma 3 4Bgoogle6.3%
25Microsoft: Phi 4 Mini Instructmicrosoft3.0%

Artificial Analysis (artificialanalysis.ai). Redistribution requires an AA commercial license.

← All rankingsMethodology & 𝕍 →