Commit
·
4439967
1
Parent(s):
d729496
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,27 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- eng
|
4 |
+
pipeline_tag: text-generation
|
5 |
+
|
6 |
+
Trained: Pretrain
|
7 |
+
Config file: 1.3B
|
8 |
+
Data: English News Dataset 2GB
|
9 |
+
---
|
10 |
+
# Model Card for Model ID
|
11 |
+
|
12 |
+
<!-- Provide a quick summary of what the model is/does. -->
|
13 |
+
|
14 |
+
Pretrained GPT-NeoX model with 2.06GB English news dataset. Took about 10 hours to reach 20,000 iterations. Trained on p3dn.24xlarge.
|
15 |
+
Different hyperparameter: gradient_accumulation_step 4
|
16 |
+
|
17 |
+
## Model Details
|
18 |
+
|
19 |
+
### Model Description
|
20 |
+
|
21 |
+
<!-- Provide a longer summary of what this model is. -->
|
22 |
+
|
23 |
+
|
24 |
+
|
25 |
+
- **Developed by:** Eunyoung Lee
|
26 |
+
- **Model type:** GPT-NeoX
|
27 |
+
- **Language(s) (NLP):** English
|