UnidentifiedPerson
commited on
Commit
•
dd64268
1
Parent(s):
c7d45d3
Update README.md
Browse files
README.md
CHANGED
@@ -17,11 +17,11 @@ Datasets:
|
|
17 |
|
18 |
All of the datasets used to train these models are:
|
19 |
|
20 |
-
1. Of at least 20-25 minutes long and are collected from online videos, audios of interviews, blogs, and more of said speaker.
|
21 |
|
22 |
2. These datasets are edited to contain the best high-quality audio of the speaker's voice with no background noise, music, silence, or any artifacts.
|
23 |
|
24 |
-
3.The sample rate of all of these datasets are
|
25 |
|
26 |
4. For the dataset recording and extraction process sometimes it may not be 100% perfect due to background noise or music interfering and in some cases
|
27 |
I may not even reach to the 20-25 minute mark since there may be very little or no data available, as such I also reduce the number of epochs to 200 to
|
|
|
17 |
|
18 |
All of the datasets used to train these models are:
|
19 |
|
20 |
+
1. Of at least 20-25 minutes long and are collected from online videos, audios of interviews, blogs, and more of said speaker (Mostly interviews as its way easier to collect, edit, and polish).
|
21 |
|
22 |
2. These datasets are edited to contain the best high-quality audio of the speaker's voice with no background noise, music, silence, or any artifacts.
|
23 |
|
24 |
+
3.The sample rate of all of these datasets are 48k hz with the training using the 48k hz.
|
25 |
|
26 |
4. For the dataset recording and extraction process sometimes it may not be 100% perfect due to background noise or music interfering and in some cases
|
27 |
I may not even reach to the 20-25 minute mark since there may be very little or no data available, as such I also reduce the number of epochs to 200 to
|