aandrusenko
commited on
Commit
·
5868990
1
Parent(s):
ca6bd4b
Update README.md
Browse files
README.md
CHANGED
@@ -113,7 +113,7 @@ More training details can be found at the [Esperanto ASR example](https://github
|
|
113 |
|
114 |
### Datasets
|
115 |
|
116 |
-
All the models were trained on
|
117 |
|
118 |
- Train set: ~250 hours.
|
119 |
- Dev set: ~25 hours.
|
|
|
113 |
|
114 |
### Datasets
|
115 |
|
116 |
+
All the models were trained on Mozilla Common Voice Esperanto 11.0 dataset comprising of about 1400 validated hours of Esperanto speech. However, training set consists of a much smaller amount of data, because when forming the train.tsv, dev.tsv and test.tsv, repetitions of texts in train were removed by Mozilla developers.
|
117 |
|
118 |
- Train set: ~250 hours.
|
119 |
- Dev set: ~25 hours.
|