balaramas commited on
Commit
ba063fa
1 Parent(s): ab7482c

Upload 9 files

Browse files
.gitattributes CHANGED
@@ -37,3 +37,4 @@ MUSTC_ROOT_french/en-fr/data/tst-COMMON/wav/ted_1096.wav filter=lfs diff=lfs mer
37
  MUSTC_ROOT_hindi/en-hi/data/tst-COMMON/wav/ted_1096.wav filter=lfs diff=lfs merge=lfs -text
38
  MUSTC_ROOT_german/en-de/data/tst-COMMON/wav/ted_1096.wav filter=lfs diff=lfs merge=lfs -text
39
  MUSTC_ROOT_bengali/en-bn/data/tst-COMMON/wav/ted_1096.wav filter=lfs diff=lfs merge=lfs -text
 
 
37
  MUSTC_ROOT_hindi/en-hi/data/tst-COMMON/wav/ted_1096.wav filter=lfs diff=lfs merge=lfs -text
38
  MUSTC_ROOT_german/en-de/data/tst-COMMON/wav/ted_1096.wav filter=lfs diff=lfs merge=lfs -text
39
  MUSTC_ROOT_bengali/en-bn/data/tst-COMMON/wav/ted_1096.wav filter=lfs diff=lfs merge=lfs -text
40
+ MUSTC_ROOT_nepali/en-ne/data/tst-COMMON/wav/ted_1096.wav filter=lfs diff=lfs merge=lfs -text
MUSTC_ROOT_nepali/en-ne/config_st.yaml ADDED
@@ -0,0 +1,19 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ bpe_tokenizer:
2
+ bpe: sentencepiece
3
+ sentencepiece_model: /home/deepakprasad/nlp_code/Nepali_MUSTC/en-ne/spm_unigram8000_st.model
4
+ input_channels: 1
5
+ input_feat_per_channel: 80
6
+ specaugment:
7
+ freq_mask_F: 27
8
+ freq_mask_N: 1
9
+ time_mask_N: 1
10
+ time_mask_T: 100
11
+ time_mask_p: 1.0
12
+ time_wrap_W: 0
13
+ transforms:
14
+ '*':
15
+ - utterance_cmvn
16
+ _train:
17
+ - utterance_cmvn
18
+ - specaugment
19
+ vocab_filename: spm_unigram8000_st.txt
MUSTC_ROOT_nepali/en-ne/data/tst-COMMON/txt/tst-COMMON.en ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ Back in New York, I am the head of development for a non-profit called Robin Hood.
2
+ When I'm not fighting poverty, I'm fighting fires as the assistant captain of a volunteer fire company.
3
+ Now in our town, where the volunteers supplement a highly skilled career staff, you have to get to the fire scene pretty early to get in on any action.
4
+ I remember my first fire.
5
+ I was the second volunteer on the scene, so there was a pretty good chance I was going to get in.
6
+ But still it was a real footrace against the other volunteers to get to the captain in charge to find out what our assignments would be.
7
+ When I found the captain, he was having a very engaging conversation with the homeowner, who was surely having one of the worst days of her life.
8
+ Here it was, the middle of the night, she was standing outside in the pouring rain, under an umbrella, in her pajamas, barefoot, while her house was in flames.
MUSTC_ROOT_nepali/en-ne/data/tst-COMMON/txt/tst-COMMON.ne ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ न्यूयोर्कमा फर्केर, म रोबिन हुड नामक गैर-लाभकारी संस्थाको विकासको प्रमुख हुँ।
2
+ जब म गरिबीसँग लडिरहेको छैन, म स्वयंसेवक फायर कम्पनीको सहायक कप्तानको रूपमा आगोसँग लडिरहेको छु।
3
+ अब हाम्रो सहरमा, जहाँ स्वयम्सेवकहरूले उच्च दक्ष क्यारियर कर्मचारीहरूलाई पूरक गर्छन्, तपाईंले कुनै पनि कार्यमा जानको लागि धेरै चाँडो आगोको दृश्यमा पुग्नु पर्छ।
4
+ मलाई मेरो पहिलो आगो याद छ।
5
+ म दृश्यमा दोस्रो स्वयंसेवक थिएँ, त्यसैले म भित्र पस्ने एउटा राम्रो मौका थियो।
6
+ तर अझै पनि हाम्रो असाइनमेन्टहरू के हुनेछ भनेर पत्ता लगाउन प्रभारी कप्तानमा पुग्नु अन्य स्वयंसेवकहरूको बिरूद्ध वास्तविक फुटरेस थियो।
7
+ जब मैले कप्तानलाई भेट्टाएँ, उसले घरमालिकसँग धेरै आकर्षक कुराकानी गरिरहेको थियो, जो पक्कै पनि उनको जीवनको सबैभन्दा खराब दिनहरू मध्ये एक थियो।
8
+ यहाँ थियो, मध्यरात, उनी बाहिर मुसलधारे पानीमा, छातामुनि, पजामामा, खाली खुट्टामा उभिरहेकी थिइन्, जबकि उनको घरमा आगो लागेको थियो।
MUSTC_ROOT_nepali/en-ne/data/tst-COMMON/txt/tst-COMMON.yaml ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ - {duration: 5.0, offset: 0.0, rW: 17, uW: 0, speaker_id: spk.1096, wav: test.wav}
2
+ - {duration: 5.160000, offset: 20.290000, rW: 17, uW: 0, speaker_id: spk.1096, wav: ted_1096.wav}
3
+ - {duration: 8.110000, offset: 25.930000, rW: 29, uW: 0, speaker_id: spk.1096, wav: ted_1096.wav}
4
+ - {duration: 1.560000, offset: 34.920000, rW: 5, uW: 0, speaker_id: spk.1096, wav: ted_1096.wav}
5
+ - {duration: 4.180000, offset: 36.730000, rW: 21, uW: 0, speaker_id: spk.1096, wav: ted_1096.wav}
6
+ - {duration: 5.580000, offset: 41.880000, rW: 26, uW: 0, speaker_id: spk.1096, wav: ted_1096.wav}
7
+ - {duration: 8.610001, offset: 48.309999, rW: 27, uW: 0, speaker_id: spk.1096, wav: ted_1096.wav}
8
+ - {duration: 9.680000, offset: 57.510000, rW: 29, uW: 0, speaker_id: spk.1096, wav: ted_1096.wav}
MUSTC_ROOT_nepali/en-ne/data/tst-COMMON/wav/ted_1096.wav ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:69a122c3ad89320ec24cad84b622a01f26c3138b3e5869dc033e65bd0ab73fe1
3
+ size 8990102
MUSTC_ROOT_nepali/en-ne/fbank80.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0bef03a45d7514d5018c4de30d352c736359248e6e8d70d586796aa32b30f4e2
3
+ size 5242360
MUSTC_ROOT_nepali/en-ne/spm_unigram8000_st.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c9b431e41320a5738c0af5368d23c5071a71899c897887f06a22f2efc087dd80
3
+ size 459775
MUSTC_ROOT_nepali/en-ne/spm_unigram8000_st.txt ADDED
The diff for this file is too large to render. See raw diff
 
MUSTC_ROOT_nepali/en-ne/tst-COMMON_st.tsv ADDED
The diff for this file is too large to render. See raw diff