BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published 20 days ago • 10
The BrowserGym Ecosystem for Web Agent Research Paper • 2412.05467 • Published 19 days ago • 18
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22 • 56
The Impact of Positional Encoding on Length Generalization in Transformers Paper • 2305.19466 • Published May 31, 2023 • 2
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment Paper • 2410.01679 • Published Oct 2 • 24
Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models Paper • 2305.14775 • Published May 24, 2023
BM25S: Orders of magnitude faster lexical search via eager sparse scoring Paper • 2407.03618 • Published Jul 4 • 11
Improving Automatic VQA Evaluation Using Large Language Models Paper • 2310.02567 • Published Oct 4, 2023 • 3
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations Paper • 2407.03471 • Published Jul 3 • 28
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17 • 50
WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild Paper • 2406.04770 • Published Jun 7 • 27
Are NLP Models really able to Solve Simple Math Word Problems? Paper • 2103.07191 • Published Mar 12, 2021 • 1
Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions Paper • 2310.03016 • Published Oct 4, 2023 • 2