Running on Zero 37 37 IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System 🎙 Generate speech from text using reference audio
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation Paper • 2504.02542 • Published 12 days ago • 41