Skip to main content

SciCode — AI model leaderboard

AI models ranked by SciCode, an aggregated third-party benchmark from artificial_analysis. Higher is better. Cross-referenced against our first-party meo scores and Effective Value (𝕍).

Ranking 82 models across the full field · as of 2026-06-07.

#ModelLabSciCode
1Google: Gemini 3.1 Pro Previewgoogle58.9%
2OpenAI: GPT-5.4openai56.6%
3OpenAI: GPT-5.5openai56.1%
4OpenAI: GPT-5.2-Codexopenai54.6%
5Anthropic: Claude Opus 4.7anthropic54.5%
6Anthropic: Claude Opus 4.8anthropic53.5%
7OpenAI: GPT-5.3-Codexopenai53.2%
8Google: Gemini 3.5 Flashgoogle53.1%
9OpenAI: GPT-5.2 Chatopenai52.1%
10Xiaomi: MiMo-V2.5-Proxiaomi50.2%
11DeepSeek: DeepSeek V4 Prodeepseek50.0%
12OpenAI: GPT-5.4 Miniopenai49.9%
13Google: Gemini 3 Flash Previewgoogle49.9%
14Qwen: Qwen3.7 Maxqwen48.8%
15xAI: Grok 4.3x-ai47.3%
16MiniMax: MiniMax M2.7minimax47.0%
17Anthropic: Claude Sonnet 4.6anthropic46.9%
18OpenAI: GPT-5.4 Nanoopenai46.9%
19OpenAI: o4 Miniopenai46.5%
20Qwen: Qwen3.7 Plusqwen45.5%
21MiniMax: MiniMax M3minimax45.4%
22DeepSeek: DeepSeek V4 Flashdeepseek44.9%
23Z.ai: GLM 5.1z-ai43.8%
24Z.ai: GLM 5 Turboz-ai43.6%
25Z.ai: GLM 5V Turboz-ai43.5%
26Google: Gemma 4 31Bgoogle43.4%
27OpenAI: GPT-5.1openai43.3%
28OpenAI: GPT-5openai42.9%
29Google: Gemini 2.5 Progoogle42.8%
30OpenAI: GPT-5.1-Codex-Miniopenai42.6%
31inclusionAI: Ring-2.6-1Tinclusionai42.4%
32Qwen: Qwen3.5 397B A17Bqwen42.0%
33Qwen: Qwen3.5-122B-A10Bqwen42.0%
34Google: Gemini 3.1 Flash Litegoogle41.9%
35Tencent: Hy3 previewtencent41.2%
36OpenAI: o3openai41.0%
37OpenAI: GPT-5 Codexopenai40.9%
38Qwen: Qwen3.6 Plusqwen40.7%
39OpenAI: GPT-4.1 Miniopenai40.4%
40OpenAI: GPT-5.1-Codexopenai40.2%
41StepFun: Step 3.7 Flashstepfun40.0%
42Google: Gemma 4 26B A4B (free)google40.0%
43OpenAI: o3 Miniopenai39.9%
44Qwen: Qwen3.6 27Bqwen39.8%
45OpenAI: o3 Mini Highopenai39.8%
46Mistral: Mistral Medium 3.5mistralai39.6%
47OpenAI: GPT-5 Miniopenai39.2%
48Prime Intellect: INTELLECT-3prime-intellect39.1%
49OpenAI: gpt-oss-120bopenai38.9%
50Inception: Mercury 2inception38.7%
51StepFun: Step 3.5 Flashstepfun38.5%
52Kwaipilot: KAT-Coder-Pro V2kwaipilot38.3%
53OpenAI: GPT-4.1openai38.1%
54inclusionAI: Ling-2.6-1Tinclusionai37.0%
55OpenAI: GPT-5 Nanoopenai36.6%
56Arcee AI: Trinity Large Thinkingarcee-ai36.1%
57Qwen: Qwen3.6 35B A3Bqwen35.8%
58OpenAI: o1openai35.8%
59OpenAI: gpt-oss-20bopenai34.4%
60OpenAI: GPT-4oopenai33.3%
61OpenAI: GPT-4o (2024-08-06)openai33.1%
62Meta: Llama 4 Maverickmeta-llama33.1%
63Qwen: Qwen3 Coder Nextqwen32.3%
64OpenAI: GPT-4 Turboopenai31.9%
65OpenAI: GPT-4o (2024-05-13)openai30.9%
66Google: Gemini 2.5 Flashgoogle29.1%
67Google: Gemini 2.5 Flash Lite Preview 09-2025google28.5%
68Cohere: Command Acohere28.1%
69Qwen: Qwen3.5-9Bqwen27.5%
70inclusionAI: Ling-2.6-flashinclusionai27.1%
71Reka Flash 3rekaai26.7%
72Microsoft: Phi 4microsoft26.0%
73Xiaomi: MiMo-V2-Flashxiaomi25.9%
74OpenAI: GPT-4.1 Nanoopenai25.9%
75Upstage: Solar Pro 3upstage24.7%
76OpenAI: GPT-4o-miniopenai22.9%
77IBM: Granite 4.1 8Bibm-granite21.8%
78Google: Gemma 3 27Bgoogle21.2%
79Google: Gemma 3 12Bgoogle17.4%
80Meta: Llama 4 Scoutmeta-llama17.0%
81Microsoft: Phi 4 Mini Instructmicrosoft10.8%
82Google: Gemma 3 4Bgoogle7.3%

Artificial Analysis (artificialanalysis.ai). Redistribution requires an AA commercial license.

← All rankingsMethodology & 𝕍 →