--- language: ary metrics: - wer tags: - audio - automatic-speech-recognition - speech - xlsr-fine-tuning-week license: apache-2.0 model-index: - name: XLSR Wav2Vec2 Moroccan Arabic dialect by Boumehdi results: - task: name: Speech Recognition type: automatic-speech-recognition metrics: - name: Test WER type: wer value: 0.244673 --- # Wav2Vec2-Large-XLSR-53-Moroccan-Darija **wav2vec2-large-xlsr-53** fine-tuned on 27 hours (27 people) of labeled Darija Audios. # Old model vs new model Old Model: - The model contains numerous incorrect transcriptions as input - Multiple transcribers. - The audio database is not organized (by gender, age, regions ..). - Wrong wer rate New Model: - Transcriptions are now performed by a single individual. - Each hour of audio is pronounced by a different person. - Fine-tuning is ongoing 24/7 to enhance accuracy, and we are consistently adding more data to the model every day. - Audio database is more organized - True Wer rate
Training Loss | Validation | Loss Wer |
---|---|---|
0.031600 | 0.316006 | 0.217313 |