Update README.md
Browse files
README.md
CHANGED
@@ -23,4 +23,19 @@ language:
|
|
23 |
|
24 |
The checkpoint open-sourced here is trained by Papercup using the open-source PL-BERT model found here https://github.com/yl4579/PL-BERT. It is trained to be supported by StyleTTS2, which can be found here: https://github.com/yl4579/StyleTTS2. See below for the languages that it has been trained on (the languages correspond to the crowdsourced dataset found here https://huggingface.co/datasets/styletts2-community/multilingual-phonemes-10k-alpha).
|
25 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
Thank you to Aaron (Yinghao) Li for these contributions.
|
|
|
23 |
|
24 |
The checkpoint open-sourced here is trained by Papercup using the open-source PL-BERT model found here https://github.com/yl4579/PL-BERT. It is trained to be supported by StyleTTS2, which can be found here: https://github.com/yl4579/StyleTTS2. See below for the languages that it has been trained on (the languages correspond to the crowdsourced dataset found here https://huggingface.co/datasets/styletts2-community/multilingual-phonemes-10k-alpha).
|
25 |
|
26 |
+
Notable differences compared to the default PL-BERT checkpoint and config available [here](https://github.com/yl4579/StyleTTS2/tree/main/Utils/PLBERT):
|
27 |
+
* Because we are working with many languages, we are using a different tokenizer now: `bert-base-multilingual-cased`.
|
28 |
+
* The PL-BERT model was trained on the data obtained from `styletts2-community/multilingual-phonemes-10k-alpha` for 1.1M iterations.
|
29 |
+
* The `token_maps.pkl` file has changed (also open-sourced here).
|
30 |
+
* We have changed the `util.py` file to deal with an error when loading `new_state_dict["embeddings.position_ids"]`.
|
31 |
+
|
32 |
+
## How do I train StyleTTS2 with this new PL-BERT checkpoint?
|
33 |
+
|
34 |
+
* Simply create a new folder under `Utils` in your StyleTTS2 repository. Call it, for example, `PLBERT-all-languages`.
|
35 |
+
* Copy paste into it `config.yml`, `step_1100000.t7` and `util.py`.
|
36 |
+
* Then, in your StyleTTS2 config file, change `PLBERT_dir` to `Utils/PLBERT-all-languages`.
|
37 |
+
* Now, you need to create train and validation files. You will need to use `espeak` to create a file in the same format as the ones that exist in the `Data` folder of the StyleTTS2 repository. Careful! You will need to change the `language` argument to phonemise your text if it's not in English. You can find the correct language codes [here](https://github.com/espeak-ng/espeak-ng/blob/master/docs/languages.md). For example, Latin American Spanish is `es-419`
|
38 |
+
|
39 |
+
Voila, you can now train a multilingual StyleTTS2 model!
|
40 |
+
|
41 |
Thank you to Aaron (Yinghao) Li for these contributions.
|