SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper • 2605.12500 • Published 25 days ago • 191
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 10 days ago • 72
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 10 days ago • 72
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper • 2605.12500 • Published 25 days ago • 191
NEO1_5 Collection From Pixels to Words -- Towards Native One-Vision Models at Scale • 3 items • Updated 9 days ago • 6
view article Article NEO-unify: Building Native Multimodal Unified Models End to End sensenova • Mar 5 • 164
Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition Paper • 2602.08439 • Published Feb 9 • 28
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published Dec 22, 2025 • 68
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published Dec 22, 2025 • 68
Runtime error Agents Featured 1.45k EasyControl Ghibli 🦀 1.45k New Ghibli EasyControl model is now released!!
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness Paper • 2503.21755 • Published Mar 27, 2025 • 33