Post
2740
Excited to launch two new SOTA text-to-speech models on the TTS Arena:
- OpenVoice V2
- Play.HT 2.0
๐๐ฏ๐ผ๐๐ ๐๐ต๐ฒ ๐ง๐ง๐ฆ ๐๐ฟ๐ฒ๐ป๐ฎ
The TTS Arena is an open sourced Arena where you can enter a prompt, have two models generate speech, and vote on which one is superior.
We compile the results from the votes into a automatically updated leaderboard to allow developers to select the best model.
We've already included models such as ElevenLabs, XTTS, StyleTTS 2, and MetaVoice. The more votes we collect, the sooner we'll be able to show these new models on the leaderboard and compare them!
๐ข๐ฝ๐ฒ๐ป๐ฉ๐ผ๐ถ๐ฐ๐ฒ ๐ฉ๐ฎ
OpenVoice V2 is an open-sourced speech synthesis model created by MyShell AI that supports instant zero-shot voice cloning. It's the next generation of OpenVoice, and is fully open-sourced under the MIT license.
https://github.com/myshell-ai/OpenVoice
๐ฃ๐น๐ฎ๐.๐๐ง ๐ฎ.๐ฌ
PlayโคHT 2.0 is a high-quality proprietary text-to-speech engine. Accessible through their API, this model supports zero-shot voice cloning.
๐๐ผ๐บ๐ฝ๐ฎ๐ฟ๐ฒ ๐๐ต๐ฒ ๐บ๐ผ๐ฑ๐ฒ๐น๐ ๐ผ๐ป ๐๐ต๐ฒ ๐ง๐ง๐ฆ ๐๐ฟ๐ฒ๐ป๐ฎ:
TTS-AGI/TTS-Arena
- OpenVoice V2
- Play.HT 2.0
๐๐ฏ๐ผ๐๐ ๐๐ต๐ฒ ๐ง๐ง๐ฆ ๐๐ฟ๐ฒ๐ป๐ฎ
The TTS Arena is an open sourced Arena where you can enter a prompt, have two models generate speech, and vote on which one is superior.
We compile the results from the votes into a automatically updated leaderboard to allow developers to select the best model.
We've already included models such as ElevenLabs, XTTS, StyleTTS 2, and MetaVoice. The more votes we collect, the sooner we'll be able to show these new models on the leaderboard and compare them!
๐ข๐ฝ๐ฒ๐ป๐ฉ๐ผ๐ถ๐ฐ๐ฒ ๐ฉ๐ฎ
OpenVoice V2 is an open-sourced speech synthesis model created by MyShell AI that supports instant zero-shot voice cloning. It's the next generation of OpenVoice, and is fully open-sourced under the MIT license.
https://github.com/myshell-ai/OpenVoice
๐ฃ๐น๐ฎ๐.๐๐ง ๐ฎ.๐ฌ
PlayโคHT 2.0 is a high-quality proprietary text-to-speech engine. Accessible through their API, this model supports zero-shot voice cloning.
๐๐ผ๐บ๐ฝ๐ฎ๐ฟ๐ฒ ๐๐ต๐ฒ ๐บ๐ผ๐ฑ๐ฒ๐น๐ ๐ผ๐ป ๐๐ต๐ฒ ๐ง๐ง๐ฆ ๐๐ฟ๐ฒ๐ป๐ฎ:
TTS-AGI/TTS-Arena