Speech Recognition
Collection
9 items
•
Updated
This model is a version of openai/whisper-large-v3 fine-tuned with a curated collection of Welsh and English speech data (see: techiaith/commonvoice_18_0_cy_en collected originally from Mozilla's Common Voice project.
It achieves the following results on the following language specific test sets:
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss | Wer |
---|---|---|---|---|
0.2097 | 0.2497 | 1000 | 0.2169 | 14.2221 |
0.1621 | 0.4993 | 2000 | 0.1816 | 11.6845 |
0.1406 | 0.7490 | 3000 | 0.1609 | 10.2445 |
0.1242 | 0.9987 | 4000 | 0.1505 | 9.5594 |
Base model
openai/whisper-large-v3