weqweasdas
commited on
Commit
•
102b5ac
1
Parent(s):
b3d02d9
Update README.md
Browse files
README.md
CHANGED
@@ -35,7 +35,8 @@ Difference between this mixture and that of
|
|
35 |
|
36 |
### Training
|
37 |
|
38 |
-
We train the model for one epoch with a learning rate of 5e-6, batch size 512, cosine learning rate decay with a warmup ratio 0.03.
|
|
|
39 |
|
40 |
|
41 |
|
|
|
35 |
|
36 |
### Training
|
37 |
|
38 |
+
We train the model for one epoch with a learning rate of 5e-6, batch size 512, cosine learning rate decay with a warmup ratio 0.03.
|
39 |
+
|
40 |
|
41 |
|
42 |
|