Running on CPU Upgrade Featured 3.21k The Smol Training Playbook 📚 3.21k The secrets to building world-class LLMs
Qwen/Qwen3-Next-80B-A3B-Instruct Text Generation • 81B • Updated Sep 17, 2025 • 255k • • 1.03k
deepseek-ai/DeepSeek-R1-0528 Text Generation • 685B • Updated May 29, 2025 • 7.07M • • 2.45k
Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int4 Image-Text-to-Text • 74B • Updated Sep 24, 2024 • 270 • 30
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v1 Text Generation • 33B • Updated Dec 18, 2024 • 12 • 51