Update README.md
Browse files
README.md
CHANGED
@@ -16,14 +16,27 @@ pip install nemo_toolkit['all']
|
|
16 |
|
17 |
The model is available for use in the NeMo toolkit [3], and can be used as a pre-trained checkpoint for inference or for fine-tuning on another dataset.
|
18 |
|
19 |
-
### Automatically
|
20 |
|
21 |
```python
|
22 |
import nemo.collections.asr as nemo_asr
|
23 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
```
|
25 |
|
26 |
-
### Transcribing
|
27 |
|
28 |
```shell
|
29 |
python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py \
|
@@ -105,11 +118,4 @@ Since this model was trained on publically available speech datasets, the perfor
|
|
105 |
|
106 |
[2] [Google Sentencepiece Tokenizer](https://github.com/google/sentencepiece)
|
107 |
|
108 |
-
[3] [NVIDIA NeMo Toolkit](https://github.com/NVIDIA/NeMo)
|
109 |
-
|
110 |
-
|
111 |
-
## Licence
|
112 |
-
|
113 |
-
License to use this model is covered by the NGC [TERMS OF USE](https://ngc.nvidia.com/legal/terms) unless another License/Terms Of Use/EULA is clearly specified. By downloading the public and release version of the model, you accept the terms and conditions of the NGC [TERMS OF USE](https://ngc.nvidia.com/legal/terms).
|
114 |
-
|
115 |
-
|
|
|
16 |
|
17 |
The model is available for use in the NeMo toolkit [3], and can be used as a pre-trained checkpoint for inference or for fine-tuning on another dataset.
|
18 |
|
19 |
+
### Automatically instantiate the model
|
20 |
|
21 |
```python
|
22 |
import nemo.collections.asr as nemo_asr
|
23 |
+
from huggingface_hub import hf_hub_download
|
24 |
+
|
25 |
+
path = hf_hub_download(repo_id="nvidia/stt_en_conformer_ctc_large",filename="stt_en_conformer_large.nemo")
|
26 |
+
asr_model = nemo_asr.models.EncDecCTCModelBPE.restore_from(path)
|
27 |
+
```
|
28 |
+
|
29 |
+
### Transcribing using Python
|
30 |
+
First, let's get a sample
|
31 |
+
```
|
32 |
+
wget https://dldata-public.s3.us-east-2.amazonaws.com/2086-149220-0033.wav
|
33 |
+
```
|
34 |
+
Then simply do:
|
35 |
+
```
|
36 |
+
asr_model.transcribe(['2086-149220-0033.wav'])
|
37 |
```
|
38 |
|
39 |
+
### Transcribing many audio files
|
40 |
|
41 |
```shell
|
42 |
python [NEMO_GIT_FOLDER]/examples/asr/transcribe_speech.py \
|
|
|
118 |
|
119 |
[2] [Google Sentencepiece Tokenizer](https://github.com/google/sentencepiece)
|
120 |
|
121 |
+
[3] [NVIDIA NeMo Toolkit](https://github.com/NVIDIA/NeMo)
|
|
|
|
|
|
|
|
|
|
|
|
|
|