Fastest AI models (seconds per correct answer)
Ranked by wall-seconds per correct answer. For interactive and high-throughput workloads, latency per useful result — not raw tokens/sec — is what users feel. Models that think a lot to get there pay for it here.
Ranking 22 models across the meo-tested roster · as of 2026-06-08.
| # | Model | Lab | sec/correct |
|---|---|---|---|
| 1 | Perceptron: Perceptron Mk1 | perceptron | 24 |
| 2 | Anthropic: Claude Opus 4.8 | anthropic | 27 |
| 3 | xAI: Grok 4.3 | x-ai | 27 |
| 4 | OpenAI: GPT-5.5 | openai | 29 |
| 5 | Google: Gemini 3.5 Flash | 45 | |
| 6 | inclusionAI: Ring-2.6-1T | inclusionai | 59 |
| 7 | Google: Gemini 3.1 Pro Preview | 69 | |
| 8 | Google: Gemini 3.1 Flash Lite | 72 | |
| 9 | Xiaomi: MiMo-V2.5 | xiaomi | 94 |
| 10 | MiniMax: MiniMax M3 | minimax | 129 |
| 11 | Owl Alpha | openrouter | 146 |
| 12 | NVIDIA: Nemotron 3 Ultra | nvidia | 147 |
| 13 | StepFun: Step 3.7 Flash | stepfun | 191 |
| 14 | Qwen: Qwen3.7 Max | qwen | 201 |
| 15 | Qwen: Qwen3.7 Plus | qwen | 213 |
| 16 | MoonshotAI: Kimi K2.6 | moonshotai | 251 |
| 17 | Xiaomi: MiMo-V2.5-Pro | xiaomi | 288 |
| 18 | Arcee AI: Trinity Large Thinking | arcee-ai | 289 |
| 19 | DeepSeek: DeepSeek V4 Pro | deepseek | 327 |
| 20 | DeepSeek: DeepSeek V4 Flash | deepseek | 370 |
| 21 | Z.ai: GLM 5.1 | z-ai | 434 |
| 22 | Tencent: Hy3 preview | tencent | 579 |