Fastest AI models (seconds per correct answer)

Ranked by wall-seconds per correct answer. For interactive and high-throughput workloads, latency per useful result — not raw tokens/sec — is what users feel. Models that think a lot to get there pay for it here.

Ranking 22 models across the meo-tested roster · as of 2026-06-08.

#	Model	Lab	sec/correct
1	Perceptron: Perceptron Mk1	perceptron	24
2	Anthropic: Claude Opus 4.8	anthropic	27
3	xAI: Grok 4.3	x-ai	27
4	OpenAI: GPT-5.5	openai	29
5	Google: Gemini 3.5 Flash	google	45
6	inclusionAI: Ring-2.6-1T	inclusionai	59
7	Google: Gemini 3.1 Pro Preview	google	69
8	Google: Gemini 3.1 Flash Lite	google	72
9	Xiaomi: MiMo-V2.5	xiaomi	94
10	MiniMax: MiniMax M3	minimax	129
11	Owl Alpha	openrouter	146
12	NVIDIA: Nemotron 3 Ultra	nvidia	147
13	StepFun: Step 3.7 Flash	stepfun	191
14	Qwen: Qwen3.7 Max	qwen	201
15	Qwen: Qwen3.7 Plus	qwen	213
16	MoonshotAI: Kimi K2.6	moonshotai	251
17	Xiaomi: MiMo-V2.5-Pro	xiaomi	288
18	Arcee AI: Trinity Large Thinking	arcee-ai	289
19	DeepSeek: DeepSeek V4 Pro	deepseek	327
20	DeepSeek: DeepSeek V4 Flash	deepseek	370
21	Z.ai: GLM 5.1	z-ai	434
22	Tencent: Hy3 preview	tencent	579

← All rankings Methodology & 𝕍 →