techiaith
/

whisper-large-v3-ft-commonvoice-cy

Automatic Speech Recognition

Generated from Trainer

Model card Files Files and versions Metrics Training metrics

whisper-large-v3-ft-cv-cy

This model is a version of openai/whisper-large-v3 fine-tuned with the train_all and other_with_excluded custom splits from techiaith/commonvoice_18_0_cy

It achieves the following results on the Common Voice for Welsh release 18's standard test set:

WER: 18.50
CER: 5.32

N.B. this model performs considerably worse on English language speech, but better on Welsh than a bilingual model

Usage

from transformers import pipeline

transcriber = pipeline("automatic-speech-recognition", model="techiaith/whisper-large-v3-ft-cv-cy")
result = transcriber(<path or url to soundfile>)
print (result)

{'text': 'Mae hen wlad fy nhadau yn annwyl i mi.'}

Downloads last month: 33

Safetensors

Model size

1.54B params

Tensor type

F32

·

Inference Providers NEW

Automatic Speech Recognition

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for techiaith/whisper-large-v3-ft-commonvoice-cy

Base model

openai/whisper-large-v3

Finetuned

(435)

this model

Finetunes

1 model

Dataset used to train techiaith/whisper-large-v3-ft-commonvoice-cy

Collection including techiaith/whisper-large-v3-ft-commonvoice-cy

Speech Recognition Models

Models for Welsh language and bilingual speech recognition • 13 items • Updated 20 days ago

Evaluation results

Wer on DewiBrynJones/commonvoice_18_0_cy default
self-reported

0.185

View on Papers With Code