UmarRamzan
/

w2v2-bert-urdu

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

UmarRamzan commited on May 15

Commit

5e3ddd2

•

1 Parent(s): 56b6925

Update README.md

Files changed (1) hide show

README.md +17 -7

README.md CHANGED Viewed

@@ -15,24 +15,34 @@ language:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# w2v2-bert-urdu
-This model is a fine-tuned version of [UmarRamzan/w2v2-bert-urdu](https://huggingface.co/UmarRamzan/w2v2-bert-urdu) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.3681
 - Wer: 0.2929
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Wav2Vec-Bert-2.0-Urdu
+This model is a fine-tuned version of [facebook/w2v-bert-2.0](https://huggingface.co/facebook/w2v-bert-2.0) on the Urdu split of the [Common Voice 17](https://huggingface.co/datasets/mozilla-foundation/common_voice_17_0) dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.3681
 - Wer: 0.2929
 ## Model description
+## Usage Instructions
+```python
+from transformers import AutoFeatureExtractor, Wav2Vec2BertModel
+import torch
+from datasets import load_dataset
+dataset = load_dataset("hf-internal-testing/librispeech_asr_demo", "clean", split="validation")
+dataset = dataset.sort("id")
+sampling_rate = dataset.features["audio"].sampling_rate
+processor = AutoProcessor.from_pretrained("UmarRamzan/w2v2-bert-urdu")
+model = Wav2Vec2BertModel.from_pretrained("UmarRamzan/w2v2-bert-urdu")
+# audio file is decoded on the fly
+inputs = processor(dataset[0]["audio"]["array"], sampling_rate=sampling_rate, return_tensors="pt")
+with torch.no_grad():
+    outputs = model(**inputs)
+```
 ## Training procedure