Scaling Pre-training to One Hundred Billion Data for Vision Language Models Paper • 2502.07617 • Published 4 days ago • 23
MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion Paper • 2502.04235 • Published 9 days ago • 18