TOBench: A Task-Oriented Omni-Modal Benchmark for Real-World Tool-Using Agents Paper • 2605.16909 • Published May 16 • 9
view article Article Supercharge your OCR Pipelines with Open Models +5 merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq • Oct 21, 2025 • 315
SpineBench: A Clinically Salient, Level-Aware Benchmark Powered by the SpineMed-450k Corpus Paper • 2510.03160 • Published Oct 3, 2025 • 4