HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper • 2605.06747 • Published 5 days ago • 41
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 4 days ago • 79
HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness Paper • 2605.02396 • Published 8 days ago • 21
Running Featured 1.34k FineWeb: decanting the web for the finest text data at scale 🍷 1.34k Explore and download the FineWeb web‑text dataset
GigaWorld-Policy: An Efficient Action-Centered World--Action Model Paper • 2603.17240 • Published Mar 18 • 26
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published 15 days ago • 116
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation Paper • 2604.24763 • Published 15 days ago • 69
ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning Paper • 2604.24300 • Published 15 days ago • 65
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published 18 days ago • 226
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications Paper • 2503.07137 • Published Mar 10, 2025 • 2