LiveBench: A Challenging, Contamination-Free LLM Benchmark Paper • 2406.19314 • Published Jun 27, 2024 • 22
Large Language Models Must Be Taught to Know What They Don't Know Paper • 2406.08391 • Published Jun 12, 2024 • 1
LiveBench: A Challenging, Contamination-Free LLM Benchmark Paper • 2406.19314 • Published Jun 27, 2024 • 22
Giraffe: Adventures in Expanding Context Lengths in LLMs Paper • 2308.10882 • Published Aug 21, 2023 • 1
Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive Paper • 2402.13228 • Published Feb 20, 2024 • 3