xezpeleta commited on
Commit
9b5d0ad
·
verified ·
1 Parent(s): b6bd817

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -14
README.md CHANGED
@@ -16,35 +16,51 @@ model-index:
16
  name: Automatic Speech Recognition
17
  type: automatic-speech-recognition
18
  dataset:
19
- name: asierhv/composite_corpus_eu_v2.1
20
- type: asierhv/composite_corpus_eu_v2.1
21
  metrics:
22
  - name: Wer
23
  type: wer
24
- value: 6.544273760459599
 
 
25
  ---
26
 
27
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
28
- should probably proofread and complete it, then remove this comment. -->
29
 
30
- # Whisper Large Basque
31
 
32
- This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the asierhv/composite_corpus_eu_v2.1 dataset.
33
- It achieves the following results on the evaluation set:
34
- - Loss: 0.1549
35
- - Wer: 6.5443
36
 
37
  ## Model description
38
 
39
- More information needed
40
 
41
  ## Intended uses & limitations
42
 
43
- More information needed
 
 
 
 
 
 
 
 
 
 
 
 
 
44
 
45
  ## Training and evaluation data
46
 
47
- More information needed
 
 
48
 
49
  ## Training procedure
50
 
@@ -112,4 +128,4 @@ The following hyperparameters were used during training:
112
  - Transformers 4.49.0.dev0
113
  - Pytorch 2.6.0+cu124
114
  - Datasets 3.3.1.dev0
115
- - Tokenizers 0.21.0
 
16
  name: Automatic Speech Recognition
17
  type: automatic-speech-recognition
18
  dataset:
19
+ name: Mozilla Common Voice 18.0
20
+ type: mozilla-foundation/common_voice_18_0
21
  metrics:
22
  - name: Wer
23
  type: wer
24
+ value: 4.84
25
+ language:
26
+ - eu
27
  ---
28
 
29
+ # Whisper Large v3 Basque
 
30
 
31
+ This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) specifically for Basque (eu) language Automatic Speech Recognition (ASR). It was trained on the [asierhv/composite_corpus_eu_v2.1](https://huggingface.co/datasets/asierhv/composite_corpus_eu_v2.1) dataset, which is a composite corpus designed to improve Basque ASR performance.
32
 
33
+ **Key improvements and results compared to the base model:**
34
+
35
+ * **Significant WER reduction:** The fine-tuned model achieves a Word Error Rate (WER) of 6.5443 on the validation set of the `asierhv/composite_corpus_eu_v2.1` dataset, demonstrating a substantial improvement in accuracy for Basque speech.
36
+ * **Exceptional performance on Common Voice:** When evaluated on the Mozilla Common Voice 18.0 dataset, the model achieved a WER of 4.84. This showcases the model's outstanding ability to generalize to diverse Basque speech datasets, and highlights the high accuracy achievable with the large-v3 model.
37
 
38
  ## Model description
39
 
40
+ This model leverages the `whisper-large-v3` architecture, the most powerful variant of the Whisper models, known for its exceptional accuracy in multilingual speech recognition. By fine-tuning this model on a dedicated Basque speech corpus, it achieves state-of-the-art performance in Basque ASR. The `whisper-large-v3` model offers the highest capacity and therefore the highest accuracy, but requires significantly more computational resources.
41
 
42
  ## Intended uses & limitations
43
 
44
+ **Intended uses:**
45
+
46
+ * Ultra-high-accuracy automatic transcription of Basque speech for critical applications.
47
+ * Development of cutting-edge Basque speech-based applications demanding the highest possible precision.
48
+ * Research in Basque speech processing requiring the most accurate transcriptions.
49
+ * Professional transcription services and applications where accuracy is paramount and computational resources are available.
50
+ * Use in scenarios where the highest possible accuracy is required, and the computational cost is justifiable.
51
+
52
+ **Limitations:**
53
+
54
+ * Performance is still influenced by audio quality, with challenges arising from background noise and poor recording conditions.
55
+ * Accuracy may be affected by highly dialectal or informal Basque speech, although the large model mitigates this to a great degree.
56
+ * Despite its high performance, the model may still produce errors, particularly with complex linguistic structures or rare words.
57
+ * The large-v3 model demands substantial computational resources, making it less suitable for real-time or resource-constrained applications.
58
 
59
  ## Training and evaluation data
60
 
61
+ * **Training dataset:** [asierhv/composite_corpus_eu_v2.1](https://huggingface.co/datasets/asierhv/composite_corpus_eu_v2.1). This dataset is a comprehensive and meticulously curated collection of Basque speech data, designed to maximize the performance of Basque ASR systems.
62
+ * **Evaluation Dataset:** The `test` split of `asierhv/composite_corpus_eu_v2.1`.
63
+
64
 
65
  ## Training procedure
66
 
 
128
  - Transformers 4.49.0.dev0
129
  - Pytorch 2.6.0+cu124
130
  - Datasets 3.3.1.dev0
131
+ - Tokenizers 0.21.0