yangzhang33/culture-eval-benchmark-cs-filtered-lite Viewer • Updated 10 days ago • 63.2k • 4.18k • 1
yangzhang33/culture-eval-benchmark-cs-filtered-lite-human-filtered Viewer • Updated 15 days ago • 1.72k • 113
yangzhang33/culture-eval-benchmark-cs-filtered-lite-human-filtered Viewer • Updated 15 days ago • 1.72k • 113
yangzhang33/culture-eval-benchmark-cs-filtered-lite Viewer • Updated 10 days ago • 63.2k • 4.18k • 1
Build error Agents 4 GreekMMLU Leaderboard 📚 4 Explore GreekMMLU benchmark leaderboards for language models