dumitrescustefan
commited on
Commit
•
46d638b
1
Parent(s):
96b9562
Update README.md
Browse files
README.md
CHANGED
@@ -4,9 +4,7 @@ inference: false
|
|
4 |
license: apache-2.0
|
5 |
---
|
6 |
|
7 |
-
|
8 |
-
|
9 |
-
This is a pretrained [MT5](https://github.com/google-research/multilingual-t5) base model (390M parameters).
|
10 |
|
11 |
Training was performed with the span corruption task on a clean 80GB Romanian text corpus for 4M total steps with these [scripts](https://github.com/dumitrescustefan/t5x_models), starting from the 1M public mt5x-base checkpoint. The model was trained with an encoder sequence length of 512 and a decoder sequence length of 256; it has the same mt5x vocabulary as the 1M multilingual checkpoint.
|
12 |
|
|
|
4 |
license: apache-2.0
|
5 |
---
|
6 |
|
7 |
+
This is a pretrained [MT5](https://github.com/google-research/multilingual-t5) base model (**390M** parameters).
|
|
|
|
|
8 |
|
9 |
Training was performed with the span corruption task on a clean 80GB Romanian text corpus for 4M total steps with these [scripts](https://github.com/dumitrescustefan/t5x_models), starting from the 1M public mt5x-base checkpoint. The model was trained with an encoder sequence length of 512 and a decoder sequence length of 256; it has the same mt5x vocabulary as the 1M multilingual checkpoint.
|
10 |
|