unlimited Audio generation with a few added features
automated video and sound synthesis from images
Real-time in-browser speech recognition
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)