Establishing Task Scaling Laws via Compute-Efficient Model Ladders Paper β’ 2412.04403 β’ Published 20 days ago β’ 2
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens Paper β’ 2401.17377 β’ Published Jan 30 β’ 35