arxiv:2510.10396
Yu Zhang
AaronZ345
·
AI & ML interests
Multi-Modal Generative AI (Spatial Audio/Music/Singing/Speech).
Recent Activity
new activity 6 days ago
GTSinger/GTSinger:Annotation quality is very low, not usable for training new activity about 1 month ago
GTSinger/GTSinger:Annotation quality is very low, not usable for training authored a paper 7 months ago
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with
Refined Annotations