Saving weights and log at step 420000
Browse files
README.md
CHANGED
@@ -30,7 +30,7 @@ Tokenizer:
|
|
30 |
Training details:
|
31 |
|
32 |
* Trained for 70K steps (batch size 64) to ppl 27 on mc4 nl tiny 1 epoch
|
33 |
-
* Trained for
|
34 |
* Training continuing
|
35 |
* Block size: 512
|
36 |
* Optimizer: adafactor
|
|
|
30 |
Training details:
|
31 |
|
32 |
* Trained for 70K steps (batch size 64) to ppl 27 on mc4 nl tiny 1 epoch
|
33 |
+
* Trained for 420K steps (batch size 16) to ppl 19.0 on mc4 nl full
|
34 |
* Training continuing
|
35 |
* Block size: 512
|
36 |
* Optimizer: adafactor
|
flax_model.msgpack
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5262314590
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:748e3d61a600defeb512048593d4f85161ba408f30ecb0bf5d21a7b4dc9cc9df
|
3 |
size 5262314590
|
pytorch_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5363100545
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:34af1f8a14a2b1db2c796487ede034c347be73b67c7a0069667f284284014d12
|
3 |
size 5363100545
|
runs/events.out.tfevents.1641156371.t1v-n-2f64d7c8-w-0.13342.0.v2
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:87e080048d898a981df47a0956825f5be43a98e3e22d799e89c3160cbeba1f39
|
3 |
+
size 63363863
|