Learning to Predict Future-Aligned Research Proposals with Language Models Paper • 2603.27146 • Published Apr 6 • 5
Retrieval is Cheap, Show Me the Code: Executable Multi-Hop Reasoning for Retrieval-Augmented Generation Paper • 2605.12975 • Published 8 days ago • 9
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Paper • 2602.01058 • Published Feb 1 • 44