Transcribe Warsh Quran recitations into Arabic text
Extreme Super-Resolution via Scale Autoregression
Generate speech from text using a reference voice