cruxeval

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

minimario authored a paper 9 months ago

SantaCoder: don't reach for the stars!

minimario authored a paper 9 months ago

LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

minimario authored a paper 9 months ago

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

View all activity

cruxeval-org's activity

minimario

authored 3 papers 9 months ago

SantaCoder: don't reach for the stars!

Paper • 2301.03988 • Published Jan 9, 2023 • 7

LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

Paper • 2310.15164 • Published Oct 23, 2023 • 1

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

Paper • 2403.07974 • Published Mar 12 • 1

minimario

authored a paper 10 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 136

minimario

updated a dataset 11 months ago

cruxeval-org/cruxeval

Viewer • Updated Jan 23 • 800 • 1.43k • 13

minimario

authored a paper 12 months ago

CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

Paper • 2401.03065 • Published Jan 5 • 11

sidaw

authored a paper 12 months ago

CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

Paper • 2401.03065 • Published Jan 5 • 11

minimario

authored 2 papers over 1 year ago

LeanDojo: Theorem Proving with Retrieval-Augmented Language Models

Paper • 2306.15626 • Published Jun 27, 2023 • 17

StarCoder: may the source be with you!

Paper • 2305.06161 • Published May 9, 2023 • 29