Baseline models

#1
by esualp - opened

Hello, thank you for your contribution! Are you planning to share baseline models (scratch) with 1.1B and 3B parameters too?

llm-stacking org

Hi, thanks for your interest! Due to storage limits, we only kept complete 7B ckpts after finishing the experiments. However, for 1B baselines, you may refer to original tinyllama models: https://github.com/jzhang38/TinyLlama

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment