KVAE-Audio Collection KVAE-Audio is a continuous full-band audio waveform autoencoder • 1 item • Updated 6 days ago • 7
OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data Paper • 2606.13432 • Published 24 days ago • 113
LooseControlVideo: Directorial Video Control using Spatial Blocking Paper • 2606.19495 • Published 18 days ago • 9
SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer Paper • 2605.30409 • Published May 28 • 42
Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation Paper • 2605.15141 • Published May 14 • 96
SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning Paper • 2606.10804 • Published 26 days ago • 53
LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing Paper • 2606.06042 • Published Jun 4 • 24
GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration Paper • 2605.31039 • Published May 29 • 46
Bootstrap Your Generator: Unpaired Visual Editing with Flow Matching Paper • 2606.03911 • Published Jun 2 • 22
LVSA: Training-Free Sparse Attention for Long Video Diffusion Paper • 2605.31057 • Published May 29 • 14
Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos Paper • 2605.18233 • Published May 18 • 93
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published May 1 • 86
ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control Paper • 2604.20816 • Published Apr 22 • 15
Video Analysis and Generation via a Semantic Progress Function Paper • 2604.22554 • Published Apr 24 • 64
DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution Paper • 2507.01012 • Published Jul 1, 2025 • 2
JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion Paper • 2601.22143 • Published Jan 29 • 12
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published Apr 9 • 116