From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging Paper • 2410.01215 • Published Oct 2, 2024 • 31
DebugBench: Evaluating Debugging Capability of Large Language Models Paper • 2401.04621 • Published Jan 9, 2024 • 2