Spaces:
Running
Running
Baseline models
#1
by
esualp
- opened
Hello, thank you for your contribution! Are you planning to share baseline models (scratch) with 1.1B and 3B parameters too?
Hi, thanks for your interest! Due to storage limits, we only kept complete 7B ckpts after finishing the experiments. However, for 1B baselines, you may refer to original tinyllama models: https://github.com/jzhang38/TinyLlama