JorgeSarry commited on
Commit
c6cb2ca
·
1 Parent(s): 9d860d5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -3
README.md CHANGED
@@ -1,9 +1,10 @@
 
 
 
1
  This is a smaller version of the google/mt5-base model with only Spanish and some English embeddings left following the procedure outlined here https://towardsdatascience.com/how-to-adapt-a-multilingual-t5-model-for-a-single-language-b9f94f3d9c90
2
 
3
 
4
  The original model has 582M parameters, with 384M of them being input and output embeddings.
5
  After shrinking the sentencepiece vocabulary from 250K to 30K (top 10K English and top 20K Spanish tokens) the number of model parameters reduced to 244M parameters, resulting on a model size reduced from 2.2GB to 0.9GB - 42% of the original one.
6
 
7
- ---
8
- language: es
9
- ---
 
1
+ ---
2
+ language: es
3
+ ---
4
  This is a smaller version of the google/mt5-base model with only Spanish and some English embeddings left following the procedure outlined here https://towardsdatascience.com/how-to-adapt-a-multilingual-t5-model-for-a-single-language-b9f94f3d9c90
5
 
6
 
7
  The original model has 582M parameters, with 384M of them being input and output embeddings.
8
  After shrinking the sentencepiece vocabulary from 250K to 30K (top 10K English and top 20K Spanish tokens) the number of model parameters reduced to 244M parameters, resulting on a model size reduced from 2.2GB to 0.9GB - 42% of the original one.
9
 
10
+