AstroMLab
/

astrollama-3-8b-base_summary

Text Generation

text-generation-inference

Model card Files Files and versions Community

tingyuansen commited on Sep 29

Commit

a8860fd

•

1 Parent(s): 366ad76

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -96,7 +96,10 @@ Notably, the instruct version of this model shows even more significant improvem
 While AstroLLaMA-3-8B performs competitively among models in its class, it does not surpass the performance of the base LLaMA-3-8B model. This underscores the challenges in developing specialized models and the need for more diverse and comprehensive training data.
-It's worth noting that the AstroLLaMA-3-8B-Plus which we will release in the next model release addresses these limitations by expanding beyond astro-ph data.
 ## Ethical Considerations

 While AstroLLaMA-3-8B performs competitively among models in its class, it does not surpass the performance of the base LLaMA-3-8B model. This underscores the challenges in developing specialized models and the need for more diverse and comprehensive training data.
+This model is released primarily for reproducibility purposes, allowing researchers to track the development process and compare different iterations of AstroLLaMA models.
+For optimal performance and the most up-to-date capabilities in astronomy-related tasks, we recommend using AstroLLaMA-3.1-8B-Plus, where these limitations have been addressed. The newer model incorporates expanded training data beyond astro-ph and features a greatly expanded fine-tuning process, resulting in significantly improved performance.
 ## Ethical Considerations