arxiv:2507.07984
Chenming Zhu
ChaimZhu
AI & ML interests
Multimodal Large Language Models, 3D Perception and Understanding, Embodied AI
Recent Activity
upvoted a paper 25 days ago
MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data upvoted a paper about 1 month ago
Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens upvoted a paper 4 months ago
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence