UnidentifiedPerson
commited on
Commit
•
50fb15f
1
Parent(s):
e385ae0
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,18 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
Introduction:
|
2 |
+
This repository contains high quality voice models that aim to replicate the voices of celebrities, influencers, and other famous people. These models can be freely used within Text To Speech (TTS) software, voice changers, or Audio to Audio software.
|
3 |
+
|
4 |
+
Datasets:
|
5 |
+
All of the datasets used to train these models are:
|
6 |
+
Of at least 20-25 minutes long and are collected from online videos, audios of interviews, blogs, and more of said speaker.
|
7 |
+
These datasets are edited to contain the best high-quality audio of the speaker's voice with no background noise, music, silence, or any artifacts.
|
8 |
+
The sample rate of all of these datasets are 44100 hz with the training using the 48k hz.
|
9 |
+
Training:
|
10 |
+
All of the Voice Models are trained using:
|
11 |
+
The algorithm: RVMPE_GPU.
|
12 |
+
RVC-V2 Framework.
|
13 |
+
With pitch guidance.
|
14 |
+
250-300 Total Epochs with the minimum steps reaching 1000 to max steps hitting around 6000.
|
15 |
+
Software used: https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI.
|
16 |
+
All of this was done using my one Nvidia RTX 3090 GPU and 12th Gen Intel Core i7-12700K CPU.
|
17 |
+
Remember:
|
18 |
+
I don't own any of the content used for the dataset creation as well as the voice model training. As such I am not responsible for any misuse or abuse of any of this content. All of this was produced for educational purposes as well as for personal usage not malicious intent. Use these voice models at your risk and enjoy!
|