Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published 6 days ago • 147
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping Paper • 2604.08364 • Published 12 days ago • 96
Cross-Modal Emotion Transfer for Emotion Editing in Talking Face Video Paper • 2604.07786 • Published 12 days ago • 6
INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling Paper • 2604.07209 • Published 13 days ago • 35
Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision Paper • 2604.04934 • Published 15 days ago • 45
Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models Paper • 2603.22782 • Published 28 days ago • 19
The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics Paper • 2603.14375 • Published Mar 15 • 19
RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models Paper • 2603.25502 • Published 26 days ago • 56
One View Is Enough! Monocular Training for In-the-Wild Novel View Generation Paper • 2603.23488 • Published 27 days ago • 5
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Paper • 2603.12254 • Published Mar 12 • 22
Versatile Editing of Video Content, Actions, and Dynamics without Training Paper • 2603.17989 • Published Mar 18 • 17