Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning Paper • 2605.06326 • Published 13 days ago • 24
Plan Then Action:High-Level Planning Guidance Reinforcement Learning for LLM Reasoning Paper • 2510.01833 • Published Oct 2, 2025
QCBench: Evaluating Large Language Models on Domain-Specific Quantitative Chemistry Paper • 2508.01670 • Published Aug 3, 2025
$δ$-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published 8 days ago • 117
δ-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published 8 days ago • 117
PRBench: End-to-end Paper Reproduction in Physics Research Paper • 2603.27646 • Published Mar 29 • 29
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published Mar 26 • 132
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published Mar 22 • 77
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data Paper • 2603.15594 • Published Mar 16 • 149
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier Paper • 2603.03756 • Published Mar 4 • 89
Cognitive Foundations for Reasoning and Their Manifestation in LLMs Paper • 2511.16660 • Published Nov 20, 2025 • 11