CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models Paper • 2605.08735 • Published 9 days ago • 67
R-VLM: Region-Aware Vision Language Model for Precise GUI Grounding Paper • 2507.05673 • Published Jul 8, 2025 • 1
Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models Paper • 2505.17225 • Published May 22, 2025 • 64