2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published 11 days ago • 92
No More Adam: Learning Rate Scaling at Initialization is All You Need Paper • 2412.11768 • Published 28 days ago • 41
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 23 days ago • 38
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices Paper • 2411.10640 • Published Nov 16, 2024 • 44
Boundary Attention: Learning to Find Faint Boundaries at Any Resolution Paper • 2401.00935 • Published Jan 1, 2024 • 17
LRM: Large Reconstruction Model for Single Image to 3D Paper • 2311.04400 • Published Nov 8, 2023 • 47