nvidia
/

stt_it_fastconformer_hybrid_large_pc

Automatic Speech Recognition

hf-asr-leaderboard

Model card Files Files and versions Community

Add links to SDP configs

#5

by igitman - opened Jun 22, 2023

base: refs/heads/main

←

from: refs/pr/5

Discussion Files changed

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -191,9 +191,9 @@ The tokenizers for these models were built using the text transcripts of the tra
 The model in this collection are trained on a composite dataset (NeMo PnC IT ASRSET) comprising of 487 hours of Italian speech:
-- Mozilla Common Voice 12.0 (Italian) - 220 hours after data cleaning
-- Multilingual LibriSpeech (Italian) - 214 hours after data cleaning
-- VoxPopuli transcribed subset (Italian) - 53 hours after data cleaning
 ## Performance

 The model in this collection are trained on a composite dataset (NeMo PnC IT ASRSET) comprising of 487 hours of Italian speech:
+- Mozilla Common Voice 12.0 (Italian) - 220 hours after data cleaning. [Speech Data Processor](https://github.com/NVIDIA/NeMo-speech-data-processor) config used to prepare this data is [here](https://github.com/NVIDIA/NeMo-speech-data-processor/blob/main/dataset_configs/italian/mcv/config.yaml).
+- Multilingual LibriSpeech (Italian) - 214 hours after data cleaning. [Speech Data Processor](https://github.com/NVIDIA/NeMo-speech-data-processor) config used to prepare this data is [here](https://github.com/NVIDIA/NeMo-speech-data-processor/blob/main/dataset_configs/italian/mls/config.yaml).
+- VoxPopuli transcribed subset (Italian) - 53 hours after data cleaning. [Speech Data Processor](https://github.com/NVIDIA/NeMo-speech-data-processor) config used to prepare this data is [here](https://github.com/NVIDIA/NeMo-speech-data-processor/blob/main/dataset_configs/italian/voxpopuli/config.yaml).
 ## Performance