PatrickHaller/hgrn2_pile_100m_distill_babylm
Text Generation
•
Updated
•
4.59k
Collection contains relevant models for the BabyLM 2024 submission. The 100m model is for the strict and the 10m is for the strict-small track