Running on Zero 42 42 IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System 🎙 Generate speech from text using reference audio
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 11 days ago • 634k • 1.31k