chrisjay
/

afrospeech-wav2vec-gax

Audio Classification

afro-digits-speech

Inference Endpoints

Model card Files Files and versions Community

chrisjay commited on Oct 10, 2022

Commit

6bc56da

•

1 Parent(s): 9347956

added updates

Files changed (1) hide show

README.md +13 -12

README.md CHANGED Viewed

@@ -25,15 +25,7 @@ model-index:
 # afrospeech-wav2vec-gax
-This model is a fine-tuned version of [facebook/wav2vec2-base](https://huggingface.co/facebook/wav2vec2-base) on the [crowd-speech-africa](https://huggingface.co/datasets/chrisjay/crowd-speech-africa), which was a crowd-sourced dataset collected using the [afro-speech Space](https://huggingface.co/spaces/chrisjay/afro-speech). It achieves the following results on the [validation set](VALID_oromo_gax_audio_data.csv):
-- F1: 1.0
-- Accuracy: 1.0
-The confusion matrix below helps to give a better look at the model's performance across the digits. Through it, we can see the precision and recall of the model as well as other important insights.
-![confusion matrix](afrospeech-wav2vec-gax_confusion_matrix_VALID.png)
 ## Training and evaluation data
@@ -46,8 +38,17 @@ Below is a distribution of the dataset (training and valdation)
 ![digits-bar-plot-for-afrospeech](digits-bar-plot-for-afrospeech-wav2vec-gax.png)
-### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
@@ -56,7 +57,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - num_epochs: 150
-### Training results
 | Training Loss | Epoch |  Validation Accuracy |
 |:-------------:|:-----:|:--------:|
@@ -67,7 +68,7 @@ The following hyperparameters were used during training:
-### Framework versions
 - Transformers 4.21.3
 - Pytorch 1.12.0

 # afrospeech-wav2vec-gax
+This model is a fine-tuned version of [facebook/wav2vec2-base](https://huggingface.co/facebook/wav2vec2-base) on the [crowd-speech-africa](https://huggingface.co/datasets/chrisjay/crowd-speech-africa), which was a crowd-sourced dataset collected using the [afro-speech Space](https://huggingface.co/spaces/chrisjay/afro-speech).
 ## Training and evaluation data
 ![digits-bar-plot-for-afrospeech](digits-bar-plot-for-afrospeech-wav2vec-gax.png)
+## Evaluation performance
+It achieves the following results on the [validation set](VALID_oromo_gax_audio_data.csv):
+- F1: 1.0
+- Accuracy: 1.0
+The confusion matrix below helps to give a better look at the model's performance across the digits. Through it, we can see the precision and recall of the model as well as other important insights.
+![confusion matrix](afrospeech-wav2vec-gax_confusion_matrix_VALID.png)
+## Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - num_epochs: 150
+## Training results
 | Training Loss | Epoch |  Validation Accuracy |
 |:-------------:|:-----:|:--------:|
+## Framework versions
 - Transformers 4.21.3
 - Pytorch 1.12.0