Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning Paper • 2605.21487 • Published 4 days ago • 20
TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation Paper • 2503.07050 • Published Mar 10, 2025 • 1
MindVLA-U1: VLA Beats VA with Unified Streaming Architecture for Autonomous Driving Paper • 2605.12624 • Published 12 days ago • 5
Driving Intents Amplify Planning-Oriented Reinforcement Learning Paper • 2605.12625 • Published 12 days ago • 3
Driving Intents Amplify Planning-Oriented Reinforcement Learning Paper • 2605.12625 • Published 12 days ago • 3
MindVLA-U1: VLA Beats VA with Unified Streaming Architecture for Autonomous Driving Paper • 2605.12624 • Published 12 days ago • 5
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning Paper • 2603.17024 • Published Mar 17 • 109
The Side Effects of Being Smart: Safety Risks in MLLMs' Multi-Image Reasoning Paper • 2601.14127 • Published Jan 20 • 5
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published May 1, 2025 • 44
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction Paper • 2410.21169 • Published Oct 28, 2024 • 30