metadata

license: mit
language:
  - ko
pipeline_tag: text-to-speech

MeloTTS

MeloTTS is a high-quality multi-lingual text-to-speech library by MyShell.ai. Supported languages include:

Model card	Example
English (American)	Link
English (British)	Link
English (Indian)	Link
English (Australian)	Link
English (Default)	Link
Spanish	Link
French	Link
Chinese (mix EN)	Link
Japanese	Link
Korean	Link

Some other features include:

The Chinese speaker supports mixed Chinese and English.
Fast enough for CPU real-time inference.

Usage

Without Installation

An unofficial live demo is hosted on Hugging Face Spaces.

Use it on MyShell

There are hundreds of TTS models on MyShell, much more than MeloTTS. See examples here. More can be found at the widget center of MyShell.ai.

Install and Use Locally

Follow the installation steps here before using the following snippet:

from melo.api import TTS

# Speed is adjustable
speed = 1.0
device = 'cpu' # or cuda:0

text = "안녕하세요! 오늘은 날씨가 정말 좋네요."
model = TTS(language='KR', device=device)
speaker_ids = model.hps.data.spk2id

output_path = 'kr.wav'
model.tts_to_file(text, speaker_ids['KR'], output_path, speed=speed)

Join the Community

Open Source AI Grant

We are actively sponsoring open-source AI projects. The sponsorship includes GPU resources, fundings and intellectual support (collaboration with top research labs). We welcome both reseach and engineering projects, as long as the open-source community needs them. Please contact Zengyi Qin if you are interested.

Contributing

If you find this work useful, please consider contributing to the GitHub repo.

Many thanks to @fakerybakery for adding the Web UI and CLI part.

License

This library is under MIT License, which means it is free for both commercial and non-commercial use.

Acknowledgements

This implementation is based on TTS, VITS, VITS2 and Bert-VITS2. We appreciate their awesome work.