UnidentifiedPerson
/

VoiceModels

Model card Files Files and versions Community

UnidentifiedPerson commited on Aug 14

Commit

dd64268

•

1 Parent(s): c7d45d3

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -17,11 +17,11 @@ Datasets:
 All of the datasets used to train these models are:
-    1. Of at least 20-25 minutes long and are collected from online videos, audios of interviews, blogs, and more of said speaker.
     2. These datasets are edited to contain the best high-quality audio of the speaker's voice with no background noise, music, silence, or any artifacts.
-    3.The sample rate of all of these datasets are 44100 hz with the training using the 48k hz.
     4. For the dataset recording and extraction process sometimes it may not be 100% perfect due to background noise or music interfering and in some cases
        I may not even reach to the 20-25 minute mark since there may be very little or no data available, as such I also reduce the number of epochs to 200 to

 All of the datasets used to train these models are:
+    1. Of at least 20-25 minutes long and are collected from online videos, audios of interviews, blogs, and more of said speaker (Mostly interviews as its way easier to collect, edit, and polish).
     2. These datasets are edited to contain the best high-quality audio of the speaker's voice with no background noise, music, silence, or any artifacts.
+    3.The sample rate of all of these datasets are 48k hz with the training using the 48k hz.
     4. For the dataset recording and extraction process sometimes it may not be 100% perfect due to background noise or music interfering and in some cases
        I may not even reach to the 20-25 minute mark since there may be very little or no data available, as such I also reduce the number of epochs to 200 to