Autonomous Evaluation and Refinement of Digital Agents Paper • 2404.06474 • Published Apr 9, 2024 • 2
Grounding Language in Multi-Perspective Referential Communication Paper • 2410.03959 • Published Oct 4, 2024 • 4
Training Software Engineering Agents and Verifiers with SWE-Gym Paper • 2412.21139 • Published 10 days ago • 20