kingabzpro commited on
Commit
f188709
·
1 Parent(s): b33b130

End of training

Browse files
Files changed (1) hide show
  1. README.md +16 -16
README.md CHANGED
@@ -17,8 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.3263
21
- - Wer: 1.0593
 
22
 
23
  ## Model description
24
 
@@ -37,27 +38,26 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 0.0003
41
  - train_batch_size: 32
42
- - eval_batch_size: 16
43
  - seed: 42
44
- - gradient_accumulation_steps: 2
45
- - total_train_batch_size: 64
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
- - lr_scheduler_warmup_steps: 50
49
- - num_epochs: 40
50
 
51
  ### Training results
52
 
53
- | Training Loss | Epoch | Step | Validation Loss | Wer |
54
- |:-------------:|:-----:|:----:|:---------------:|:------:|
55
- | 6.0662 | 6.25 | 100 | 3.4034 | 1.0 |
56
- | 3.348 | 12.5 | 200 | 3.1795 | 1.0 |
57
- | 2.4085 | 18.75 | 300 | 1.5518 | 1.0432 |
58
- | 1.0433 | 25.0 | 400 | 1.2322 | 1.0514 |
59
- | 0.5163 | 31.25 | 500 | 1.2808 | 1.0495 |
60
- | 0.3625 | 37.5 | 600 | 1.3263 | 1.0593 |
61
 
62
 
63
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.3611
21
+ - Wer: 0.2992
22
+ - Cer: 0.0786
23
 
24
  ## Model description
25
 
 
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
+ - learning_rate: 0.0001
42
  - train_batch_size: 32
43
+ - eval_batch_size: 8
44
  - seed: 42
45
+ - gradient_accumulation_steps: 4
46
+ - total_train_batch_size: 128
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
+ - lr_scheduler_warmup_steps: 300
50
+ - num_epochs: 100
51
 
52
  ### Training results
53
 
54
+ | Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
55
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|
56
+ | 7.0431 | 19.05 | 300 | 3.4423 | 1.0 | 1.0 |
57
+ | 2.3233 | 38.1 | 600 | 0.5965 | 0.4757 | 0.1329 |
58
+ | 0.5676 | 57.14 | 900 | 0.3962 | 0.3584 | 0.0954 |
59
+ | 0.3611 | 76.19 | 1200 | 0.3651 | 0.3190 | 0.0820 |
60
+ | 0.2996 | 95.24 | 1500 | 0.3611 | 0.2992 | 0.0786 |
 
61
 
62
 
63
  ### Framework versions