Adaptive Length Image Tokenization via Recurrent Allocation Paper • 2411.02393 • Published Nov 4 • 12 • 1
Inference Optimal VLMs Need Only One Visual Token but Larger Models Paper • 2411.03312 • Published Nov 5 • 6 • 1
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration Paper • 2410.18076 • Published Oct 23 • 4 • 2
Autoregressive Large Language Models are Computationally Universal Paper • 2410.03170 • Published Oct 4 • 1 • 1
DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control Paper • 2409.12192 • Published Sep 18 • 4 • 3
DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control Paper • 2409.12192 • Published Sep 18 • 4 • 3
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization Paper • 2409.12903 • Published Sep 19 • 21 • 5
The Unreasonable Ineffectiveness of the Deeper Layers Paper • 2403.17887 • Published Mar 26 • 78 • 14