Zhenghua Bao's picture

Zhenghua Bao

KingZ23

·

AI & ML interests

Multimodal AI Agent Engineer @ FlashIntel | Dual M.Sc. in Computer Science and Internet- and Web-Based Systems @ TU Darmstadt

Recent Activity

upvoted a paper 12 days ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

liked a dataset 12 days ago

withmartian/routerbench

upvoted a paper 13 days ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

View all activity

Organizations

upvoted a paper 12 days ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published 13 days ago • 46

liked a dataset 12 days ago

withmartian/routerbench

Updated Mar 27, 2024 • 829 • 27

upvoted a paper 13 days ago

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published 15 days ago • 131

upvoted a paper 4 months ago

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Paper • 2601.11141 • Published Jan 16 • 23

New activity in FlashLabs/Chroma-4B 4 months ago

Finetuning for other languages?

#13 opened 4 months ago by

updated a model 4 months ago

FlashLabs/Chroma-4B

Any-to-Any • 6B • Updated Jan 28 • 466 • 382

authored a paper 4 months ago

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Paper • 2601.11141 • Published Jan 16 • 23

liked a model 4 months ago

FlashLabs/Chroma-4B

Any-to-Any • 6B • Updated Jan 28 • 466 • 382

upvoted a paper 8 months ago

Lattica: A Decentralized Cross-NAT Communication Framework for Scalable AI Inference and Training

Paper • 2510.00183 • Published Sep 30, 2025 • 8