stephen-flood 's Collections Benchmarks
updated
Viewer
• Updated • 2.09k • 117
• 4
Viewer
• Updated • 5.82M • 7.57k
• 43
Viewer
• Updated • 231k • 534k
• 737
Benchmark
• Updated • 17.6k • 922k
• 1.31k
Viewer
• Updated • 19.6k • 22
lighteval/legal_summarization
Viewer
• Updated • 26.9k • 527
• 27
Viewer
• Updated • 1.6k • 136
• 2
lighteval/synthetic_reasoning
Viewer
• Updated • 33k • 635
• 8
lighteval/synthetic_reasoning_natural
Viewer
• Updated • 22k • 477
• 15
Viewer
• Updated • 90.3k • 494
• 3
lighteval/GPT3_unscramble
Viewer
• Updated • 50k • 8
• 1
lighteval/aimo_progress_prize_1
Viewer
• Updated • 10 • 16
Viewer
• Updated • 1.7k • 18
Viewer
• Updated • 72.5k • 4.08k
• 150
Viewer
• Updated • 860k • 56.1k
• 577
Text Classification
• Updated • 53.5k
• 83
Jofthomas/hermes-function-calling-thinking-V1
Viewer
• Updated • 3.57k • 694
• 78
NousResearch/hermes-function-calling-v1
Viewer
• Updated • 11.6k • 16.9k
• 407
Viewer
• Updated • 15.7k • 185
• 7
Viewer
• Updated • 621M • 25.3k
• 88
open-web-math/open-web-math
Viewer
• Updated • 6.32M • 43.7k
• 341