Running on CPU Upgrade Featured 3.21k The Smol Training Playbook 📚 3.21k The secrets to building world-class LLMs
Running 188 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 188 Building and scaling RL environments for LLM training
Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning Paper • 2512.24265 • Published Dec 30, 2025 • 4
UltraData Collection Ultra Scale, Ultra Quality, Ultra Coverage • 11 items • Updated 22 days ago • 98
Does your data spark joy? Performance gains from domain upsampling at the end of training Paper • 2406.03476 • Published Jun 5, 2024 • 4