Running on CPU Upgrade 235 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 235 Explore synthetic data experiments on a virtual bookshelf
Running Featured 49 Porting nanochat to Transformers: an AI modeling history lesson 📝 49 Learn about ML and Transformers through nanochat
Running 93 Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks 📝 93 Evaluate multilingual models using FineTasks