Yu Zhang's picture

Yu Zhang

AaronZ345

·

https://aaronz345.github.io

AI & ML interests

Multi-Modal Generative AI (Spatial Audio/Music/Singing/Speech).

Recent Activity

new activity 6 days ago

GTSinger/GTSinger:Annotation quality is very low, not usable for training

new activity about 1 month ago

GTSinger/GTSinger:Annotation quality is very low, not usable for training

authored a paper 7 months ago

MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations

View all activity

Organizations

Papers 10

arxiv:2510.10396

arxiv:2508.10924

arxiv:2507.14534

arxiv:2507.06670

models 2

AaronZ345/StyleSinger

Updated May 5, 2025 • 1

AaronZ345/TCSinger

Updated Apr 7, 2025 • 1

datasets 2

AaronZ345/MRSDrama

Preview • Updated Aug 10, 2025 • 1.99k • 1

AaronZ345/GTSinger

Viewer • Updated Jul 24, 2025 • 28.6k • 5k • 15