readme
Browse files
README.md
ADDED
@@ -0,0 +1,25 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Finetuned TorToiSe Models
|
2 |
+
|
3 |
+
In the `./finetunes/` folder contains a collection of my finetuned models. Each model folder contains:
|
4 |
+
* the `pickle`'d finetuned model for tortoise-tts
|
5 |
+
* the LJSpeech-formatted dataset used to train on it, also containing:
|
6 |
+
- the generated YAML for training stored in `train.yaml`
|
7 |
+
- the openai/whisper output stored in `whisper.json`
|
8 |
+
* a pre-computed voice latents (auto-suggested by parsing each chunk at 10 seconds, seems to be decent)
|
9 |
+
|
10 |
+
Most of these were quickly trained on either my dedicated system (2x6800XTs) or my personal system (1x2060) with a learning rate of `1e-4` for about 200 epochs each, for acceptable results, and to just provide some examples. In the future, I'll retrain these at lower LRs to compare.
|
11 |
+
|
12 |
+
## Model List
|
13 |
+
|
14 |
+
* Harry Mason (Silent Hill)
|
15 |
+
* James Sunderland (Silent Hill 2)
|
16 |
+
* Mitsuru Kirijo (Persona 3)
|
17 |
+
|
18 |
+
### Planned
|
19 |
+
|
20 |
+
* a Japanese focused one
|
21 |
+
- trained on some voice lines from a gacha I haven't touched in years that I still have the ripped assets for
|
22 |
+
- admittedly, I had a decent one that I mistakenly deleted
|
23 |
+
* Patrick Bateman (American Psycho)
|
24 |
+
* Shadow, Rouge, and Knuckles (Sonic Adventure 2)
|
25 |
+
* Melina (Elden Ring)
|