dumitrescustefan
commited on
Commit
·
96b9562
1
Parent(s):
948d8f4
Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,7 @@ license: apache-2.0
|
|
6 |
|
7 |
# MT5x-base-romanian
|
8 |
|
9 |
-
This is a pretrained [
|
10 |
|
11 |
Training was performed with the span corruption task on a clean 80GB Romanian text corpus for 4M total steps with these [scripts](https://github.com/dumitrescustefan/t5x_models), starting from the 1M public mt5x-base checkpoint. The model was trained with an encoder sequence length of 512 and a decoder sequence length of 256; it has the same mt5x vocabulary as the 1M multilingual checkpoint.
|
12 |
|
|
|
6 |
|
7 |
# MT5x-base-romanian
|
8 |
|
9 |
+
This is a pretrained [MT5](https://github.com/google-research/multilingual-t5) base model (390M parameters).
|
10 |
|
11 |
Training was performed with the span corruption task on a clean 80GB Romanian text corpus for 4M total steps with these [scripts](https://github.com/dumitrescustefan/t5x_models), starting from the 1M public mt5x-base checkpoint. The model was trained with an encoder sequence length of 512 and a decoder sequence length of 256; it has the same mt5x vocabulary as the 1M multilingual checkpoint.
|
12 |
|