csikasote
/

xls-r-1b-bigcgen-combined-15hrs-model

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

csikasote commited on Dec 29, 2024

Commit

649c08c

·

verified ·

1 Parent(s): 065175e

Model save

Files changed (2) hide show

README.md +79 -0
model.safetensors +1 -1

README.md ADDED Viewed

	@@ -0,0 +1,79 @@

+---
+library_name: transformers
+license: apache-2.0
+base_model: facebook/wav2vec2-xls-r-1b
+tags:
+- generated_from_trainer
+metrics:
+- wer
+model-index:
+- name: xls-r-1b-bigcgen-combined-15hrs-model
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# xls-r-1b-bigcgen-combined-15hrs-model
+This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.6606
+- Wer: 0.7041
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 3e-05
+- train_batch_size: 4
+- eval_batch_size: 4
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 8
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 500
+- num_epochs: 30.0
+### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Wer    |
+|:-------------:|:------:|:----:|:---------------:|:------:|
+| No log        | 0.1410 | 100  | 3.8893          | 1.0    |
+| No log        | 0.2821 | 200  | 2.6866          | 1.0    |
+| No log        | 0.4231 | 300  | 1.4204          | 1.0    |
+| No log        | 0.5642 | 400  | 0.8780          | 0.8847 |
+| 5.5784        | 0.7052 | 500  | 0.8821          | 0.9583 |
+| 5.5784        | 0.8463 | 600  | 0.6898          | 0.7509 |
+| 5.5784        | 0.9873 | 700  | 0.6910          | 0.8690 |
+| 5.5784        | 1.1283 | 800  | 0.6632          | 0.6810 |
+| 5.5784        | 1.2694 | 900  | 0.6165          | 0.6048 |
+| 1.2954        | 1.4104 | 1000 | 0.6006          | 0.6134 |
+| 1.2954        | 1.5515 | 1100 | 0.6859          | 0.7684 |
+| 1.2954        | 1.6925 | 1200 | 0.5857          | 0.6273 |
+| 1.2954        | 1.8336 | 1300 | 0.6305          | 0.6155 |
+| 1.2954        | 1.9746 | 1400 | 0.6213          | 0.5856 |
+| 1.1582        | 2.1157 | 1500 | 0.5891          | 0.5984 |
+| 1.1582        | 2.2567 | 1600 | 0.6606          | 0.7041 |
+### Framework versions
+- Transformers 4.47.1
+- Pytorch 2.5.1+cu124
+- Datasets 3.2.0
+- Tokenizers 0.21.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b141d3f8d6942d285e804eda122bf547c35305e1f836c35fcd89f5bf182a39b8
 size 3850244720

 version https://git-lfs.github.com/spec/v1
+oid sha256:cb92a0847638d471b4130bee519f809d13335485347a2042d1cf329c3d338897
 size 3850244720