AuriAetherwiing
commited on
Commit
•
ac61a6e
1
Parent(s):
ecaa474
Update README.md
Browse files
README.md
CHANGED
@@ -23,8 +23,9 @@ base_model:
|
|
23 |
</p>
|
24 |
|
25 |
<p>
|
26 |
-
<b>Version 0.1 notes:</b>
|
27 |
-
|
|
|
28 |
|
29 |
|
30 |
<p>
|
|
|
23 |
</p>
|
24 |
|
25 |
<p>
|
26 |
+
<b>Version 0.1 notes:</b><br> Dataset was deduped and cleaned from version 0.0, and learning rate was adjusted. Resulting model seems to be stabler, and 0.0 problems with handling short inputs and min_p sampling seem to be mostly gone.<br>
|
27 |
+
Will be retrained once more, because this run crashed around e1.2 (out of 3) (thanks, DeepSpeed, really appreciate it), and it's still somewhat undertrained as a result.
|
28 |
+
</p>
|
29 |
|
30 |
|
31 |
<p>
|