view article Article Welcome Falcon Mamba: The first strong attention-free 7B model +4 JingweiZuo, yellowvm, DhiyaEddine, IChahed, ybelkada, Gkunsch • Aug 12, 2024 • 113
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22, 2024 • 134
view article Article License to Call: Introducing Transformers Agents 2.0 +1 m-ric, lysandre, pcuenq • May 13, 2024 • 137