language models for computer grpahics Collection List of papers (and models?) that are specifically used in the domain of computergraphics. Reading list for my thesis • 6 items • Updated Aug 29 • 1
What's the Meaning of Superhuman Performance in Today's NLU? Paper • 2305.08414 • Published May 15, 2023 • 1
SWE-bench: Can Language Models Resolve Real-World GitHub Issues? Paper • 2310.06770 • Published Oct 10, 2023 • 4
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence Paper • 2401.14196 • Published Jan 25 • 46
models to evaluate Collection collecting models I want to evaluate on shadereval-task2: https://github.com/bigcode-project/bigcode-evaluation-harness/pull/173 at fp16!! • 37 items • Updated 16 days ago • 2
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution Paper • 2401.03065 • Published Jan 5 • 10
Interpretability Collection Select papers on language model interpretability with notes • 5 items • Updated Nov 27, 2023 • 4
CodeFusion: A Pre-trained Diffusion Model for Code Generation Paper • 2310.17680 • Published Oct 26, 2023 • 69
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements Paper • 2210.01970 • Published Sep 30, 2022 • 11
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation Paper • 2102.04664 • Published Feb 9, 2021 • 1
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation Paper • 2208.08227 • Published Aug 17, 2022 • 1
Code Evaluation Collection Collection of Papers on Code Evaluation (from code generation language models) • 44 items • Updated 30 days ago • 12
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion Paper • 2310.11248 • Published Oct 17, 2023 • 3
Efficient Streaming Language Models with Attention Sinks Paper • 2309.17453 • Published Sep 29, 2023 • 13
IntelliCode Compose: Code Generation Using Transformer Paper • 2005.08025 • Published May 16, 2020 • 3
CodePlan: Repository-level Coding using LLMs and Planning Paper • 2309.12499 • Published Sep 21, 2023 • 73
OctoPack: Instruction Tuning Code Large Language Models Paper • 2308.07124 • Published Aug 14, 2023 • 28
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code Paper • 2302.05527 • Published Feb 10, 2023 • 1
Textbooks Are All You Need II: phi-1.5 technical report Paper • 2309.05463 • Published Sep 11, 2023 • 86
ReCode: Robustness Evaluation of Code Generation Models Paper • 2212.10264 • Published Dec 20, 2022 • 1
WizardCoder: Empowering Code Large Language Models with Evol-Instruct Paper • 2306.08568 • Published Jun 14, 2023 • 28
The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code Paper • 2305.19213 • Published May 30, 2023 • 1
A Static Evaluation of Code Completion by Large Language Models Paper • 2306.03203 • Published Jun 5, 2023 • 3
Procedural Image Programs for Representation Learning Paper • 2211.16412 • Published Nov 29, 2022 • 1
Out of the BLEU: how should we assess quality of the Code Generation models? Paper • 2208.03133 • Published Aug 5, 2022 • 2
Large Language Models Are State-of-the-Art Evaluators of Code Generation Paper • 2304.14317 • Published Apr 27, 2023 • 2
CodeT5+: Open Code Large Language Models for Code Understanding and Generation Paper • 2305.07922 • Published May 13, 2023 • 4
CodeGen2: Lessons for Training LLMs on Programming and Natural Languages Paper • 2305.02309 • Published May 3, 2023 • 1
Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors Paper • 2306.17156 • Published Jun 29, 2023 • 21