Update README
Browse files
README.md
CHANGED
@@ -30,7 +30,7 @@ Tokenizer:
|
|
30 |
Training details:
|
31 |
|
32 |
* Trained for 70K steps (batch size 64) to ppl 27 on mc4 nl tiny 1 epoch
|
33 |
-
* Trained for
|
34 |
* Training continuing
|
35 |
* Block size: 512
|
36 |
* Optimizer: adafactor
|
|
|
30 |
Training details:
|
31 |
|
32 |
* Trained for 70K steps (batch size 64) to ppl 27 on mc4 nl tiny 1 epoch
|
33 |
+
* Trained for 620K steps (batch size 16) to ppl 17.5 on mc4 nl full
|
34 |
* Training continuing
|
35 |
* Block size: 512
|
36 |
* Optimizer: adafactor
|