๐Ÿ† HF Hub Benchmark Dashboard
Last updated: 2026-06-02T10:17:05 UTC ยท auto-refreshes every 6h
30 benchmarks
305 models
614 entries
27 active
3 empty

๐Ÿง  Knowledge ย ยทย  3 benchmarks ยท 88 models

Radio
GPQA
ModelScoreIn $/1MOut $/1MContextTTFTThroughputLicenseParamsProviders
๐Ÿฅ‡moonshotai/Kimi-K2.690.5$0.75$3.40262.1K232 ms78 t/sother1059B
novitatogetherfireworks-ai +2
๐Ÿฅˆdeepseek-ai/DeepSeek-V4-Pro90.1$1.60$3.381.0M653 ms59 t/smit862B
novitatogetherfireworks-ai +2
๐Ÿฅ‰FINAL-Bench/Darwin-28B-REASON89.39โ€”โ€”โ€”โ€”โ€”apache-2.027Bโ€”
4OrionLLM/GRM-2.6-Opus89.2โ€”โ€”โ€”โ€”โ€”apache-2.028Bโ€”
5FINAL-Bench/Darwin-28B-Opus88.89โ€”โ€”โ€”โ€”โ€”apache-2.028Bโ€”
6Qwen/Qwen3.5-397B-A17B88.4$0.49$3.60262.1K400 ms189 t/sapache-2.0403B
novitatogetherfeatherless-ai +3
7FINAL-Bench/Darwin-36B-Opus88.4โ€”โ€”โ€”โ€”โ€”apache-2.035Bโ€”
8FINAL-Bench/Darwin-60B-DUO88.38โ€”โ€”โ€”โ€”โ€”gemmaโ€”โ€”
9OrionLLM/GRM-2.6-Plus88.3โ€”โ€”โ€”โ€”โ€”apache-2.028Bโ€”
10inclusionAI/Ring-2.6-1T88.27โ€”โ€”โ€”โ€”โ€”mit1026Bโ€”
11deepseek-ai/DeepSeek-V4-Flash88.1$0.14$0.281.0M636 ms113 t/smit158B
novitafireworks-aifeatherless-ai +1
12Qwen/Qwen3.6-27B87.8โ€”โ€”โ€”โ€”โ€”apache-2.028Bโ€”
13moonshotai/Kimi-K2.587.6$0.60$3.00262.1K387 ms40 t/sother1059B
novitafireworks-aifeatherless-ai
14tencent/Hy3-preview87.2โ€”โ€”โ€”โ€”โ€”other299Bโ€”
15FINAL-Bench/Darwin-27B-Opus86.9โ€”โ€”โ€”โ€”โ€”apache-2.028Bโ€”
16Qwen/Qwen3.5-122B-A10B86.6$0.29$2.40262.1K419 ms80 t/sapache-2.0125B
novitadeepinfra
17zai-org/GLM-5.186.2$1.05$3.50202.8K528 ms63 t/smit754B
togetherfireworks-aifeatherless-ai +2
18zai-org/GLM-586$1.00$3.20202.8K579 ms88 t/smit754B
novitatogetherfeatherless-ai +1
19Qwen/Qwen3.6-35B-A3B86$0.15$0.95262.1K379 ms83 t/sapache-2.036B
featherless-aideepinfra
20FINAL-Bench/Darwin-31B-Opus85.9โ€”โ€”โ€”โ€”โ€”apache-2.033Bโ€”
21zai-org/GLM-4.785.7$0.60$2.20204.8K320 ms418 t/smit358B
novitacerebrasfeatherless-ai +1
22Qwen/Qwen3.5-27B85.5$0.30$2.40262.1K1,108 ms53 t/sapache-2.028B
novitafeatherless-ai
23MiniMaxAI/MiniMax-M2.585.2$0.30$1.20204.8K1,095 ms80 t/sother229B
novitafireworks-aifeatherless-ai
24moonshotai/Kimi-K2-Thinking84.5$0.60$2.50262.1K967 ms59 t/sother1058B
novitafeatherless-ai
25FINAL-Bench/Darwin-9B-NEG84.34โ€”โ€”โ€”โ€”โ€”apache-2.010Bโ€”
26google/gemma-4-31B-it84.3$0.13$0.38262.1K396 ms60 t/sapache-2.033B
novitatogetherfeatherless-ai +1
27Qwen/Qwen3.5-35B-A3B84.2$0.25$2.00262.1K907 ms96 t/sapache-2.036B
novita
28Nanbeige/Nanbeige4.1-3B83.8โ€”โ€”โ€”โ€”โ€”apache-2.04Bโ€”
29stepfun-ai/Step-3.5-Flash83.5$0.10$0.30262.1K266 ms45 t/sapache-2.0199B
featherless-aideepinfra
30nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF1682.7โ€”โ€”โ€”โ€”โ€”other124Bโ€”
31RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-BF1682.7โ€”โ€”โ€”โ€”โ€”other124Bโ€”
32deepseek-ai/DeepSeek-V3.282.4$0.27$0.40163.8K1,130 ms34 t/smit685B
novitafeatherless-ai
33google/gemma-4-26B-A4B-it82.3$0.07$0.34262.1K393 ms41 t/sapache-2.027B
novitafeatherless-aideepinfra
34Qwen/Qwen3.5-9B81.7$0.12$0.18262.1K347 ms39 t/sapache-2.010B
featherless-aiovhcloud
35openai/gpt-oss-120b80.9$0.05$0.25131.1K190 ms913 t/sapache-2.0120B
groqnovitacerebras +7
36meituan-longcat/LongCat-Flash-Thinking-260180.5โ€”โ€”โ€”โ€”โ€”mit562Bโ€”
37LGAI-EXAONE/EXAONE-4.5-33B80.5โ€”โ€”โ€”โ€”โ€”other34Bโ€”
38openai/gpt-oss-120b80.1$0.05$0.25131.1K190 ms913 t/sapache-2.0120B
groqnovitacerebras +7
39nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF1679.23โ€”โ€”โ€”โ€”โ€”other124Bโ€”
40RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-BF1679.23โ€”โ€”โ€”โ€”โ€”other124Bโ€”
41LGAI-EXAONE/K-EXAONE-236B-A23B79.1โ€”โ€”โ€”โ€”โ€”other237Bโ€”
42OrionLLM/GRM-2.576.7โ€”โ€”โ€”โ€”โ€”apache-2.05Bโ€”
43arcee-ai/Trinity-Large-Thinking76.3โ€”โ€”โ€”โ€”โ€”other399Bโ€”
44Qwen/Qwen3.5-4B76.2โ€”โ€”โ€”โ€”โ€”apache-2.05Bโ€”
45nvidia/Nemotron-Cascade-2-30B-A3B76.1โ€”โ€”โ€”โ€”โ€”other32Bโ€”
46zai-org/GLM-4.7-Flash75.2$0.07$0.40200K1,035 ms53 t/smit31B
novitafeatherless-aizai-org
47jdopensource/JoyAI-LLM-Flash74.43โ€”โ€”โ€”โ€”โ€”โ€”49Bโ€”
48openai/gpt-oss-20b74.2$0.04$0.15131.1K232 ms499 t/sapache-2.022B
groqnovitanscale +4
49openai/gpt-oss-120b73.5$0.05$0.25131.1K190 ms913 t/sapache-2.0120B
groqnovitacerebras +7
50openai/gpt-oss-120b73.1$0.05$0.25131.1K190 ms913 t/sapache-2.0120B
groqnovitacerebras +7