Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift Paper • 2311.15961 • Published Nov 27, 2023
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving Paper • 2502.07640 • Published Feb 11 • 8
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations Paper • 2502.06453 • Published Feb 10