tingyuansen
commited on
Commit
•
a8860fd
1
Parent(s):
366ad76
Update README.md
Browse files
README.md
CHANGED
@@ -96,7 +96,10 @@ Notably, the instruct version of this model shows even more significant improvem
|
|
96 |
|
97 |
While AstroLLaMA-3-8B performs competitively among models in its class, it does not surpass the performance of the base LLaMA-3-8B model. This underscores the challenges in developing specialized models and the need for more diverse and comprehensive training data.
|
98 |
|
99 |
-
|
|
|
|
|
|
|
100 |
|
101 |
## Ethical Considerations
|
102 |
|
|
|
96 |
|
97 |
While AstroLLaMA-3-8B performs competitively among models in its class, it does not surpass the performance of the base LLaMA-3-8B model. This underscores the challenges in developing specialized models and the need for more diverse and comprehensive training data.
|
98 |
|
99 |
+
This model is released primarily for reproducibility purposes, allowing researchers to track the development process and compare different iterations of AstroLLaMA models.
|
100 |
+
|
101 |
+
For optimal performance and the most up-to-date capabilities in astronomy-related tasks, we recommend using AstroLLaMA-3.1-8B-Plus, where these limitations have been addressed. The newer model incorporates expanded training data beyond astro-ph and features a greatly expanded fine-tuning process, resulting in significantly improved performance.
|
102 |
+
|
103 |
|
104 |
## Ethical Considerations
|
105 |
|