MJPansa/MiniMax-M2.7-REAP-172B-A10B-AutoRound-W4A16 Text Generation β’ 24B β’ Updated 5 days ago β’ 1.49k β’ 5
MJPansa/MiniMax-M2.7-REAP-172B-A10B-AutoRound-W4A16 Text Generation β’ 24B β’ Updated 5 days ago β’ 1.49k β’ 5
Running 3.79k The Ultra-Scale Playbook π 3.79k The ultimate guide to training LLM on large GPU Clusters
openGPT-X/Teuken-7B-instruct-commercial-v0.4 Text Generation β’ 7B β’ Updated Dec 11, 2024 β’ 1.25k β’ 74
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 163