FrontiersMind/Nandi-Mini-600M-Early-Checkpoint Text Generation β’ 0.6B β’ Updated 1 day ago β’ 18.2k β’ 93
FrontiersMind/Nandi-Mini-150M-Tool-Calling Text Generation β’ 0.2B β’ Updated about 18 hours ago β’ 30.1k β’ 55
view article Article How I contributed a new model to the Transformers library using Codex nielsr β’ Mar 30 β’ 51
FrontiersMind/Nandi-Mini-150M-Instruct Text Generation β’ 0.2B β’ Updated about 18 hours ago β’ 29.9k β’ 54
Running on CPU Upgrade 234 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens π 234 Explore synthetic data experiments on a virtual bookshelf
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix codelion β’ Nov 3, 2025 β’ 65
Running Featured 1.34k FineWeb: decanting the web for the finest text data at scale π· 1.34k Explore and download the FineWeb webβtext dataset
Running 3.85k The Ultra-Scale Playbook π 3.85k The ultimate guide to training LLM on large GPU Clusters
The Instruction Gap: LLMs get lost in Following Instruction Paper β’ 2601.03269 β’ Published Dec 19, 2025 β’ 8
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook π 3.18k The secrets to building world-class LLMs
view reply You don't really have to clone the repo. The FastAPI code is just there for demonstration, and you can code the way you like. The main takeaway is the Dockerfile.
view article Article How to generate text: using different decoding methods for language generation with Transformers patrickvonplaten β’ Mar 1, 2020 β’ 297