๐ HF Hub Benchmark Dashboard
Last updated: 2026-06-02T10:17:05 UTC ยท auto-refreshes every 6h
30 benchmarks
305 models
614 entries
27 active
3 empty
๐ง Knowledge ย ยทย 3 benchmarks ยท 88 models
GPQA
50 entriesโ View on Hub
| Model | Score | In $/1M | Out $/1M | Context | TTFT | Throughput | License | Params | Providers | |
|---|---|---|---|---|---|---|---|---|---|---|
| ๐ฅ | moonshotai/Kimi-K2.6 | 90.5 | $0.75 | $3.40 | 262.1K | 232 ms | 78 t/s | other | 1059B | novitatogetherfireworks-ai +2 |
| ๐ฅ | deepseek-ai/DeepSeek-V4-Pro | 90.1 | $1.60 | $3.38 | 1.0M | 653 ms | 59 t/s | mit | 862B | novitatogetherfireworks-ai +2 |
| ๐ฅ | FINAL-Bench/Darwin-28B-REASON | 89.39 | โ | โ | โ | โ | โ | apache-2.0 | 27B | โ |
| 4 | OrionLLM/GRM-2.6-Opus | 89.2 | โ | โ | โ | โ | โ | apache-2.0 | 28B | โ |
| 5 | FINAL-Bench/Darwin-28B-Opus | 88.89 | โ | โ | โ | โ | โ | apache-2.0 | 28B | โ |
| 6 | Qwen/Qwen3.5-397B-A17B | 88.4 | $0.49 | $3.60 | 262.1K | 400 ms | 189 t/s | apache-2.0 | 403B | novitatogetherfeatherless-ai +3 |
| 7 | FINAL-Bench/Darwin-36B-Opus | 88.4 | โ | โ | โ | โ | โ | apache-2.0 | 35B | โ |
| 8 | FINAL-Bench/Darwin-60B-DUO | 88.38 | โ | โ | โ | โ | โ | gemma | โ | โ |
| 9 | OrionLLM/GRM-2.6-Plus | 88.3 | โ | โ | โ | โ | โ | apache-2.0 | 28B | โ |
| 10 | inclusionAI/Ring-2.6-1T | 88.27 | โ | โ | โ | โ | โ | mit | 1026B | โ |
| 11 | deepseek-ai/DeepSeek-V4-Flash | 88.1 | $0.14 | $0.28 | 1.0M | 636 ms | 113 t/s | mit | 158B | novitafireworks-aifeatherless-ai +1 |
| 12 | Qwen/Qwen3.6-27B | 87.8 | โ | โ | โ | โ | โ | apache-2.0 | 28B | โ |
| 13 | moonshotai/Kimi-K2.5 | 87.6 | $0.60 | $3.00 | 262.1K | 387 ms | 40 t/s | other | 1059B | novitafireworks-aifeatherless-ai |
| 14 | tencent/Hy3-preview | 87.2 | โ | โ | โ | โ | โ | other | 299B | โ |
| 15 | FINAL-Bench/Darwin-27B-Opus | 86.9 | โ | โ | โ | โ | โ | apache-2.0 | 28B | โ |
| 16 | Qwen/Qwen3.5-122B-A10B | 86.6 | $0.29 | $2.40 | 262.1K | 419 ms | 80 t/s | apache-2.0 | 125B | novitadeepinfra |
| 17 | zai-org/GLM-5.1 | 86.2 | $1.05 | $3.50 | 202.8K | 528 ms | 63 t/s | mit | 754B | togetherfireworks-aifeatherless-ai +2 |
| 18 | zai-org/GLM-5 | 86 | $1.00 | $3.20 | 202.8K | 579 ms | 88 t/s | mit | 754B | novitatogetherfeatherless-ai +1 |
| 19 | Qwen/Qwen3.6-35B-A3B | 86 | $0.15 | $0.95 | 262.1K | 379 ms | 83 t/s | apache-2.0 | 36B | featherless-aideepinfra |
| 20 | FINAL-Bench/Darwin-31B-Opus | 85.9 | โ | โ | โ | โ | โ | apache-2.0 | 33B | โ |
| 21 | zai-org/GLM-4.7 | 85.7 | $0.60 | $2.20 | 204.8K | 320 ms | 418 t/s | mit | 358B | novitacerebrasfeatherless-ai +1 |
| 22 | Qwen/Qwen3.5-27B | 85.5 | $0.30 | $2.40 | 262.1K | 1,108 ms | 53 t/s | apache-2.0 | 28B | novitafeatherless-ai |
| 23 | MiniMaxAI/MiniMax-M2.5 | 85.2 | $0.30 | $1.20 | 204.8K | 1,095 ms | 80 t/s | other | 229B | novitafireworks-aifeatherless-ai |
| 24 | moonshotai/Kimi-K2-Thinking | 84.5 | $0.60 | $2.50 | 262.1K | 967 ms | 59 t/s | other | 1058B | novitafeatherless-ai |
| 25 | FINAL-Bench/Darwin-9B-NEG | 84.34 | โ | โ | โ | โ | โ | apache-2.0 | 10B | โ |
| 26 | google/gemma-4-31B-it | 84.3 | $0.13 | $0.38 | 262.1K | 396 ms | 60 t/s | apache-2.0 | 33B | novitatogetherfeatherless-ai +1 |
| 27 | Qwen/Qwen3.5-35B-A3B | 84.2 | $0.25 | $2.00 | 262.1K | 907 ms | 96 t/s | apache-2.0 | 36B | novita |
| 28 | Nanbeige/Nanbeige4.1-3B | 83.8 | โ | โ | โ | โ | โ | apache-2.0 | 4B | โ |
| 29 | stepfun-ai/Step-3.5-Flash | 83.5 | $0.10 | $0.30 | 262.1K | 266 ms | 45 t/s | apache-2.0 | 199B | featherless-aideepinfra |
| 30 | nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 | 82.7 | โ | โ | โ | โ | โ | other | 124B | โ |
| 31 | RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 | 82.7 | โ | โ | โ | โ | โ | other | 124B | โ |
| 32 | deepseek-ai/DeepSeek-V3.2 | 82.4 | $0.27 | $0.40 | 163.8K | 1,130 ms | 34 t/s | mit | 685B | novitafeatherless-ai |
| 33 | google/gemma-4-26B-A4B-it | 82.3 | $0.07 | $0.34 | 262.1K | 393 ms | 41 t/s | apache-2.0 | 27B | novitafeatherless-aideepinfra |
| 34 | Qwen/Qwen3.5-9B | 81.7 | $0.12 | $0.18 | 262.1K | 347 ms | 39 t/s | apache-2.0 | 10B | featherless-aiovhcloud |
| 35 | openai/gpt-oss-120b | 80.9 | $0.05 | $0.25 | 131.1K | 190 ms | 913 t/s | apache-2.0 | 120B | groqnovitacerebras +7 |
| 36 | meituan-longcat/LongCat-Flash-Thinking-2601 | 80.5 | โ | โ | โ | โ | โ | mit | 562B | โ |
| 37 | LGAI-EXAONE/EXAONE-4.5-33B | 80.5 | โ | โ | โ | โ | โ | other | 34B | โ |
| 38 | openai/gpt-oss-120b | 80.1 | $0.05 | $0.25 | 131.1K | 190 ms | 913 t/s | apache-2.0 | 120B | groqnovitacerebras +7 |
| 39 | nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 | 79.23 | โ | โ | โ | โ | โ | other | 124B | โ |
| 40 | RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 | 79.23 | โ | โ | โ | โ | โ | other | 124B | โ |
| 41 | LGAI-EXAONE/K-EXAONE-236B-A23B | 79.1 | โ | โ | โ | โ | โ | other | 237B | โ |
| 42 | OrionLLM/GRM-2.5 | 76.7 | โ | โ | โ | โ | โ | apache-2.0 | 5B | โ |
| 43 | arcee-ai/Trinity-Large-Thinking | 76.3 | โ | โ | โ | โ | โ | other | 399B | โ |
| 44 | Qwen/Qwen3.5-4B | 76.2 | โ | โ | โ | โ | โ | apache-2.0 | 5B | โ |
| 45 | nvidia/Nemotron-Cascade-2-30B-A3B | 76.1 | โ | โ | โ | โ | โ | other | 32B | โ |
| 46 | zai-org/GLM-4.7-Flash | 75.2 | $0.07 | $0.40 | 200K | 1,035 ms | 53 t/s | mit | 31B | novitafeatherless-aizai-org |
| 47 | jdopensource/JoyAI-LLM-Flash | 74.43 | โ | โ | โ | โ | โ | โ | 49B | โ |
| 48 | openai/gpt-oss-20b | 74.2 | $0.04 | $0.15 | 131.1K | 232 ms | 499 t/s | apache-2.0 | 22B | groqnovitanscale +4 |
| 49 | openai/gpt-oss-120b | 73.5 | $0.05 | $0.25 | 131.1K | 190 ms | 913 t/s | apache-2.0 | 120B | groqnovitacerebras +7 |
| 50 | openai/gpt-oss-120b | 73.1 | $0.05 | $0.25 | 131.1K | 190 ms | 913 t/s | apache-2.0 | 120B | groqnovitacerebras +7 |