A unified multimodal understanding and generation model.
Communicate with a multimodal chatbot
Efficient, fast, and natural text to speech with StyleTTS 2!
Upgraded to v1.0!
Generate realistic voice audio from text and audio prompts