Collection of Papers on Code Evaluation (from code generation language models)
-
A Survey on Language Models for Code
Paper • 2311.07989 • Published • 21 -
Evaluating Large Language Models Trained on Code
Paper • 2107.03374 • Published • 7 -
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
Paper • 2310.06770 • Published • 4 -
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation
Paper • 2102.04664 • Published • 1