Running 3.88k The Ultra-Scale Playbook ๐ 3.88k The ultimate guide to training LLM on large GPU Clusters
meta-llama/Llama-3.1-8B-Instruct Text Generation โข 8B โข Updated Sep 25, 2024 โข 6.6M โข โข 6.08k