|
--- |
|
license: mit |
|
tags: |
|
- audio |
|
- text-to-speech |
|
- instant-voice-cloning |
|
language: |
|
- en |
|
- zh |
|
inference: false |
|
--- |
|
|
|
# OpenVoice V2 |
|
|
|
In April 2024, we release OpenVoice V2, which includes all features in V1 and has: |
|
|
|
1. Better Audio Quality. OpenVoice V2 adopts a different training strategy that delivers better audio quality. |
|
|
|
2. Native Multi-lingual Support. English, Spanish, French, Chinese, Japanese and Korean are natively supported in OpenVoice V2. |
|
|
|
3. Free Commercial Use. Starting from April 2024, both V2 and V1 are released under MIT License. Free for commercial use. |
|
|
|
|
|
<video controls autoplay src="https://cdn-uploads.huggingface.co/production/uploads/641de0213239b631552713e4/uCHTHD9OUotgOflqDu3QK.mp4"></video> |
|
|
|
### Features |
|
- **Accurate Tone Color Cloning.** OpenVoice can accurately clone the reference tone color and generate speech in multiple languages and accents. |
|
- **Flexible Voice Style Control.** OpenVoice enables granular control over voice styles, such as emotion and accent, as well as other style parameters including rhythm, pauses, and intonation. |
|
- **Zero-shot Cross-lingual Voice Cloning.** Neither of the language of the generated speech nor the language of the reference speech needs to be presented in the massive-speaker multi-lingual training dataset. |
|
|
|
### How to Use |
|
Please see [usage](https://github.com/myshell-ai/OpenVoice/blob/main/docs/USAGE.md) for detailed instructions. |
|
|
|
### Links |
|
- [Github](https://github.com/myshell-ai/OpenVoice) |
|
- [HFDemo](https://huggingface.co/spaces/myshell-ai/OpenVoiceV2) |
|
- [Discord](https://discord.gg/myshell) |
|
|
|
|