Skip to main content

MATH-500 — AI model leaderboard

AI models ranked by MATH-500, an aggregated third-party benchmark from artificial_analysis. Higher is better. Cross-referenced against our first-party meo scores and Effective Value (𝕍).

Ranking 25 models across the full field · as of 2026-06-07.

#ModelLabMATH-500
1OpenAI: GPT-5openai99.4%
2OpenAI: o3openai99.2%
3OpenAI: o4 Miniopenai98.9%
4OpenAI: o3 Mini Highopenai98.5%
5OpenAI: o3 Miniopenai97.3%
6Google: Gemini 2.5 Progoogle96.7%
7Google: Gemini 2.5 Flashgoogle93.2%
8OpenAI: GPT-4.1 Miniopenai92.5%
9OpenAI: o1openai92.4%
10OpenAI: GPT-4.1openai91.3%
11Reka Flash 3rekaai89.3%
12Meta: Llama 4 Maverickmeta-llama88.9%
13Google: Gemma 3 27Bgoogle88.3%
14Google: Gemma 3 12Bgoogle85.3%
15OpenAI: GPT-4.1 Nanoopenai84.8%
16Meta: Llama 4 Scoutmeta-llama84.4%
17Cohere: Command Acohere81.9%
18Microsoft: Phi 4microsoft81.0%
19OpenAI: GPT-4o (2024-08-06)openai79.5%
20OpenAI: GPT-4o (2024-05-13)openai79.1%
21OpenAI: GPT-4o-miniopenai78.9%
22Google: Gemma 3 4Bgoogle76.6%
23OpenAI: GPT-4oopenai75.9%
24OpenAI: GPT-4 Turboopenai73.7%
25Microsoft: Phi 4 Mini Instructmicrosoft69.6%

Artificial Analysis (artificialanalysis.ai). Redistribution requires an AA commercial license.

← All rankingsMethodology & 𝕍 →