Whisper model finetuned using audio data from CommonVoice Ukrainian v10 train and dev set with additional data via semi-supervised data.
There is a differences in tokenization of source data (in our data normalization process, we replace punctucation with ""
rather than Whisper's " "
). This mismatch leads to a slight degradation on CommonVoice.
- Downloads last month
- 58
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.
Evaluation results
- WER on mozilla-foundation/common_voice_11_0test set self-reported13.010