Update README.md
Browse files
README.md
CHANGED
@@ -126,6 +126,13 @@ language:
|
|
126 |
|
127 |
**MODEL DETAILS:** 3rd iteration, K=1000, HuBERT base architecture (95M parameters), 147 languages.
|
128 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
129 |
# Table of Contents:
|
130 |
|
131 |
1. [Summary](https://huggingface.co/utter-project/mHuBERT-147#mhubert-147-models)
|
@@ -135,11 +142,6 @@ language:
|
|
135 |
5. [Intermediate Checkpoints](https://huggingface.co/utter-project/mHuBERT-147#intermediate-checkpoints)
|
136 |
6. [Citing and Funding Information](https://huggingface.co/utter-project/mHuBERT-147#citing-and-funding-information)
|
137 |
|
138 |
-
# mHuBERT-147 models
|
139 |
-
|
140 |
-
mHuBERT-147 are compact and competitive multilingual HuBERT models trained on 90K hours of open-license data in 147 languages.
|
141 |
-
Different from *traditional* HuBERTs, mHuBERT-147 models are trained using faiss IVF discrete speech units.
|
142 |
-
Training employs a two-level language, data source up-sampling during training. See more information in [our paper](https://arxiv.org/pdf/2406.06371).
|
143 |
|
144 |
**This repository contains:**
|
145 |
* Fairseq checkpoint (original);
|
|
|
126 |
|
127 |
**MODEL DETAILS:** 3rd iteration, K=1000, HuBERT base architecture (95M parameters), 147 languages.
|
128 |
|
129 |
+
# mHuBERT-147 models
|
130 |
+
|
131 |
+
mHuBERT-147 are compact and competitive multilingual HuBERT models trained on 90K hours of open-license data in 147 languages.
|
132 |
+
Different from *traditional* HuBERTs, mHuBERT-147 models are trained using faiss IVF discrete speech units.
|
133 |
+
Training employs a two-level language, data source up-sampling during training. See more information in [our paper](https://arxiv.org/pdf/2406.06371).
|
134 |
+
|
135 |
+
|
136 |
# Table of Contents:
|
137 |
|
138 |
1. [Summary](https://huggingface.co/utter-project/mHuBERT-147#mhubert-147-models)
|
|
|
142 |
5. [Intermediate Checkpoints](https://huggingface.co/utter-project/mHuBERT-147#intermediate-checkpoints)
|
143 |
6. [Citing and Funding Information](https://huggingface.co/utter-project/mHuBERT-147#citing-and-funding-information)
|
144 |
|
|
|
|
|
|
|
|
|
|
|
145 |
|
146 |
**This repository contains:**
|
147 |
* Fairseq checkpoint (original);
|