ksingla025 commited on
Commit
0e4530c
1 Parent(s): 063c2d7

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Bengali Speech Tagger - Conformer CTC Model
2
+
3
+ This speech tagger performs transcription for Bengali, annotates key entities, predicts speaker age, dialect and intent.
4
+
5
+ ## Model Details
6
+
7
+ - **Model Type**: NeMo ASR
8
+ - **Architecture**: Conformer CTC
9
+ - **Language**: Bengali
10
+ - **Training Data**: AI4Bharat IndicVoices Bengali V1 and V2 dataset
11
+ - **Task**: Speech Recognition with Entity Tagging
12
+
13
+ ## Usage
14
+
15
+ ```python
16
+ import nemo.collections.asr as nemo_asr
17
+
18
+ # Load model
19
+ asr_model = nemo_asr.models.EncDecCTCModel.from_pretrained('WhissleAI/speech-tagger_be_ctc_meta')
20
+
21
+ # Transcribe audio
22
+ transcription = asr_model.transcribe(['path/to/audio.wav'])
23
+ print(transcription[0])
24
+ ```
25
+
26
+ ## Model Training
27
+
28
+ - Base model: Conformer CTC
29
+ - Fine-tuned on AI4Bharat IndicVoices Marathi dataset
30
+ - Optimized for real-time transcription
31
+
32
+ ## License & Attribution
33
+
34
+ Please cite AI4Bharat when using this model:
35
+ https://indicvoices.ai4bharat.org/