arxiv:2605.04045
Shengqiong Wu
ChocoWu
AI & ML interests
Large Language Model, Multimodal learning, Scene graph Generation
Recent Activity
liked a dataset 4 days ago
yanlinli/UniM authored a paper 27 days ago
Audio-Visual Intelligence in Large Foundation Models upvoted a paper about 1 month ago
Audio-Visual Intelligence in Large Foundation Models