view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego β’ Mar 10 β’ 153
view article Article Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation exploding-gradients β’ Sep 16, 2025 β’ 20
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper β’ 2509.02547 β’ Published Sep 2, 2025 β’ 238
Running 3.85k The Ultra-Scale Playbook π 3.85k The ultimate guide to training LLM on large GPU Clusters
OFA-Sys/chinese-clip-vit-base-patch16 Zero-Shot Image Classification β’ Updated Dec 9, 2022 β’ 46.4k β’ 127