patrickvonplaten
commited on
Commit
·
31b5780
1
Parent(s):
0826f48
Update README.md
Browse files
README.md
CHANGED
@@ -2,4 +2,4 @@ This model trains [T5-V1_1-base](https://huggingface.co/google/t5-v1_1-base) on
|
|
2 |
|
3 |
The official [run_t5_mlm_flax.py](https://github.com/huggingface/transformers/blob/master/examples/flax/language-modeling/run_t5_mlm_flax.py) is copied into the repository and is run using the hyperparameters as defined in *run_t5.sh*.
|
4 |
|
5 |
-
Training loss can be seen directly on the model card.
|
|
|
2 |
|
3 |
The official [run_t5_mlm_flax.py](https://github.com/huggingface/transformers/blob/master/examples/flax/language-modeling/run_t5_mlm_flax.py) is copied into the repository and is run using the hyperparameters as defined in *run_t5.sh*.
|
4 |
|
5 |
+
Training loss can be seen directly on the model card. The full training runs in finished in ca. 4 hours and 30 minutes.
|