LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models Paper • 2504.10415 • Published Apr 14, 2025 • 9
Why Do Reasoning Models Lose Coverage? The Role of Data and Forks in the Road Paper • 2605.17026 • Published 6 days ago • 2