STEM Idavidrein/gpqa Benchmark • Updated Mar 5 • 1.25k • 92.8k • 472 lmms-lab/HLE-Verified Preview • Updated Feb 28 • 5.43k • 5 skylenage-ai/HLE-Verified Viewer • Updated Feb 27 • 2.5k • 22k • 18
STEM Idavidrein/gpqa Benchmark • Updated Mar 5 • 1.25k • 92.8k • 472 lmms-lab/HLE-Verified Preview • Updated Feb 28 • 5.43k • 5 skylenage-ai/HLE-Verified Viewer • Updated Feb 27 • 2.5k • 22k • 18