radiogroup-crits commited on
Commit
e990eac
1 Parent(s): 40e0ae3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +94 -0
README.md CHANGED
@@ -1,3 +1,97 @@
1
  ---
 
 
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - it
4
  license: apache-2.0
5
+ datasets:
6
+ - mozilla-foundation/common_voice_11_0
7
+ metrics:
8
+ - wer
9
+ - cer
10
+ tags:
11
+ - audio
12
+ - automatic-speech-recognition
13
+ - hf-asr-leaderboard
14
+ - it
15
+ - mozilla-foundation/common_voice_11_0
16
+ - speech
17
+ - wav2vec2
18
+ model-index:
19
+ - name: XLS-R Wav2Vec2 CV11Ita by radiogroup crits
20
+ results:
21
+ - task:
22
+ name: Speech Recognition
23
+ type: automatic-speech-recognition
24
+ dataset:
25
+ name: Common Voice 11.0 italian
26
+ type: mozilla-foundation/common_voice_11_0
27
+ args: it
28
+ metrics:
29
+ - name: Test WER
30
+ type: wer
31
+ value: 7.12
32
+ - name: Test CER
33
+ type: cer
34
+ value: 1.75
35
+ - name: Test WER (+LM)
36
+ type: wer
37
+ value: 5.77
38
+ - name: Test CER (+LM)
39
+ type: cer
40
+ value: 1.51
41
  ---
42
+ # XLS-R-1B-CV11ITA-LMWIKI500
43
+
44
+ ## Fine-tuned XLS-R 1B model for speech recognition in Italian
45
+
46
+ Fine-tuned [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on Italian using the train and validation splits of [Common Voice 11.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0).
47
+
48
+ When using this model, make sure that your speech input is sampled at 16kHz.
49
+
50
+
51
+ ## Language model information
52
+
53
+ Our language model was generated using a 500-characters data set for each Italian Wikipedia article.
54
+
55
+
56
+ ## Download CommonVoice11.0 dataset for italian language
57
+ ```python
58
+ from datasets import load_dataset
59
+
60
+ dataset = load_dataset("mozilla-foundation/common_voice_11_0", "it", use_auth_token=True)
61
+ ```
62
+
63
+ ## Evaluation Commands
64
+
65
+ To evaluate on `mozilla-foundation/common_voice_11_0` with split `test`:
66
+
67
+ ```bash
68
+ python eval.py --model_id radiogroup-crits/wav2vec2-xls-r-1b-cv11ita-lmwiki500 --dataset mozilla-foundation/common_voice_11_0 --config it --split test --log_outputs --greedy
69
+
70
+ mv log_mozilla-foundation_common_voice_11_0_it_test_predictions.txt log_mozilla-foundation_common_voice_11_0_it_test_predictions_greedy.txt
71
+
72
+ mv log_mozilla-foundation_common_voice_11_0_it_test_targets.txt log_mozilla-foundation_common_voice_11_0_it_test_targets_greedy.txt
73
+
74
+ mv mozilla-foundation_common_voice_11_0_it_test_eval_results.txt mozilla-foundation_common_voice_11_0_it_test_eval_results_greedy.txt
75
+
76
+ python eval.py --model_id radiogroup-crits/wav2vec2-xls-r-1b-cv11ita-lmwiki500 --dataset mozilla-foundation/common_voice_11_0 --config it --split test --log_outputs
77
+
78
+ mv log_mozilla-foundation_common_voice_11_0_it_test_predictions.txt log_mozilla-foundation_common_voice_11_0_it_test_predictions_lm.txt
79
+
80
+ mv log_mozilla-foundation_common_voice_11_0_it_test_targets.txt log_mozilla-foundation_common_voice_11_0_it_test_targets_lm.txt
81
+
82
+ mv mozilla-foundation_common_voice_11_0_it_test_eval_results.txt mozilla-foundation_common_voice_11_0_it_test_eval_results_lm.txt
83
+ ```
84
+
85
+ ## Citation
86
+ If you want to cite this model you can use this:
87
+
88
+ ```bibtex
89
+ @misc{crits2023wav2vec2-xls-r-1b-cv11ita-lmwiki500,
90
+ title={XLS-R Wav2Vec2 CV11Ita by radiogroup crits},
91
+ author={Teraoni Prioletti Raffaele, Casagranda Paolo and Russo Francesco},
92
+ publisher={Hugging Face},
93
+ journal={Hugging Face Hub},
94
+ howpublished={\url{https://huggingface.co/radiogroup-crits/wav2vec2-xls-r-1b-cv11ita-lmwiki500}},
95
+ year={2023}
96
+ }
97
+ ```