view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • 11 days ago • 54
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 395
view article Article Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation exploding-gradients • Sep 16, 2025 • 20
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook 📚 3.18k The secrets to building world-class LLMs
view article Article Open-source DeepResearch – Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier • Feb 4, 2025 • 1.32k
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era MiniMax-AI • Jan 15, 2025 • 48
view article Article Low Latency CPU Based Educational Value Classifier With Generic Educational Value kenhktsui • Jun 12, 2024 • 9