Jan's picture

9 41 6

Jan

Vipitis

·

Vipitis

AI & ML interests

code generation, model evaluation

Recent Activity

updated a collection about 1 month ago

Interpretability

updated a dataset 6 months ago

Vipitis/Shadereval-inputs

updated a Space 7 months ago

Vipitis/shadermatch

View all activity

Organizations

upvoted a collection over 1 year ago

language models for computer grpahics

List of papers (and models?) that are specifically used in the domain of computergraphics. Reading list for my thesis • 6 items • Updated Aug 29, 2024 • 1

upvoted a paper over 1 year ago

Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code

Paper • 2311.07989 • Published Nov 14, 2023 • 26

upvoted 4 papers almost 2 years ago

What's the Meaning of Superhuman Performance in Today's NLU?

Paper • 2305.08414 • Published May 15, 2023 • 1

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 152

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Paper • 2310.06770 • Published Oct 10, 2023 • 9

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25, 2024 • 70

upvoted a collection about 2 years ago

models to evaluate

collecting models I want to evaluate on shadereval-task2: https://github.com/bigcode-project/bigcode-evaluation-harness/pull/173 at fp16!! • 39 items • Updated Nov 17, 2024 • 2

upvoted 2 papers about 2 years ago

CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

Paper • 2401.03065 • Published Jan 5, 2024 • 11

TACO: Topics in Algorithmic COde generation dataset

Paper • 2312.14852 • Published Dec 22, 2023 • 4

upvoted a collection about 2 years ago

Interpretability

Select papers on language model interpretability with notes • 7 items • Updated Dec 18, 2025 • 4

upvoted 2 papers about 2 years ago

Analyzing Transformers in Embedding Space

Paper • 2209.02535 • Published Sep 6, 2022 • 3

Code Execution with Pre-trained Language Models

Paper • 2305.05383 • Published May 8, 2023 • 2

upvoted 6 papers over 2 years ago

CodeFusion: A Pre-trained Diffusion Model for Code Generation

Paper • 2310.17680 • Published Oct 26, 2023 • 73

Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements

Paper • 2210.01970 • Published Sep 30, 2022 • 13

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation

Paper • 2102.04664 • Published Feb 9, 2021 • 2

Program Synthesis with Large Language Models

Paper • 2108.07732 • Published Aug 16, 2021 • 4

MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation

Paper • 2208.08227 • Published Aug 17, 2022 • 1

Measuring Coding Challenge Competence With APPS

Paper • 2105.09938 • Published May 20, 2021 • 1

upvoted a collection over 2 years ago

Code Evaluation

Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29, 2024 • 16

upvoted a paper over 2 years ago

Evaluating Large Language Models Trained on Code

Paper • 2107.03374 • Published Jul 7, 2021 • 8