tingyuansen
commited on
Commit
•
177f477
1
Parent(s):
8885f39
Update README.md
Browse files
README.md
CHANGED
@@ -33,7 +33,7 @@ AstroLLaMA-3-8B-Base_Summary is a specialized base language model for astronomy,
|
|
33 |
- No gradient accumulation
|
34 |
- BF16 format
|
35 |
- Cosine decay schedule for learning rate reduction
|
36 |
-
- Training duration: 1 epoch
|
37 |
- **Primary Use**: Next token prediction for astronomy-related text generation and analysis
|
38 |
- **Reference**: Pan et al. 2024 [Link to be added]
|
39 |
|
|
|
33 |
- No gradient accumulation
|
34 |
- BF16 format
|
35 |
- Cosine decay schedule for learning rate reduction
|
36 |
+
- Training duration: 1 epoch
|
37 |
- **Primary Use**: Next token prediction for astronomy-related text generation and analysis
|
38 |
- **Reference**: Pan et al. 2024 [Link to be added]
|
39 |
|