eunyounglee
commited on
Commit
·
028ef7e
1
Parent(s):
4439967
Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ Data: English News Dataset 2GB
|
|
11 |
|
12 |
<!-- Provide a quick summary of what the model is/does. -->
|
13 |
|
14 |
-
Pretrained GPT-NeoX model with 2.06GB English news dataset. Took about 10 hours to reach 20,000 iterations. Trained on
|
15 |
Different hyperparameter: gradient_accumulation_step 4
|
16 |
|
17 |
## Model Details
|
|
|
11 |
|
12 |
<!-- Provide a quick summary of what the model is/does. -->
|
13 |
|
14 |
+
Pretrained GPT-NeoX model with 2.06GB English news dataset. Took about 10 hours to reach 20,000 iterations. Trained on p3.16xlarge.
|
15 |
Different hyperparameter: gradient_accumulation_step 4
|
16 |
|
17 |
## Model Details
|