If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs Paper • 2412.04144 • Published 29 days ago • 4
On Leakage of Code Generation Evaluation Datasets Paper • 2407.07565 • Published Jul 10, 2024 • 5
Source-Aware Training Enables Knowledge Attribution in Language Models Paper • 2404.01019 • Published Apr 1, 2024 • 1
Discriminator-Guided Multi-step Reasoning with Language Models Paper • 2305.14934 • Published May 24, 2023 • 1