From Token to Action: State Machine Reasoning to Mitigate Overthinking in Information Retrieval Paper • 2505.23059 • Published May 29, 2025 • 13
Retrieval Sources Collection Retrieval sources for retrieval-augmented code generation. • 6 items • Updated Jun 2, 2024 • 7
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 +1 eliebak, lvwerra, lewtun • Jan 28, 2025 • 889
view article Article Formatting Datasets for Chat Template Compatibility nroggendorff • Jun 28, 2024 • 9
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs Paper • 2402.14740 • Published Feb 22, 2024 • 18
HARP: Hesitation-Aware Reframing in Transformer Inference Pass Paper • 2412.07282 • Published Dec 10, 2024 • 4
Arctic-SnowCoder: Demystifying High-Quality Data in Code Pretraining Paper • 2409.02326 • Published Sep 3, 2024 • 19