Yale NLP Lab

university

https://nlp.cs.yale.edu/

Activity Feed Request to join this org

AI & ML interests

Natural Language Processing at Yale

Recent Activity

mikeweii updated a collection 44 minutes ago

mikeweii updated a collection 44 minutes ago

mikeweii updated a collection 44 minutes ago

View all activity

Papers

RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation

References Improve LLM Alignment in Non-Verifiable Domains

View all Papers

Collections 6

View 6 collections

spaces 3

InstruSumEval

AbGen

Generate an ablation study design based on research details

LimitGen Demo

demo

models 94

yale-nlp/AgentTrek-1.0-32B_webarena-verified_milestone-bert

0.1B • Updated about 1 hour ago

yale-nlp/gpt-oss-20b_webarena-verified_stuck-bert

0.1B • Updated about 1 hour ago

yale-nlp/AgentTrek-1.0-32B_webarena-verified_stuck-bert

0.1B • Updated about 1 hour ago

yale-nlp/gpt-oss-20b_webarena-verified_milestone-bert

0.1B • Updated about 1 hour ago

yale-nlp/modernbert-evocua-milestone-detector

0.1B • Updated 3 days ago • 12

yale-nlp/modernbert-evocua-stuck-detector

0.1B • Updated 3 days ago • 11

yale-nlp/modernbert-qwen-milestone-detector

0.1B • Updated 3 days ago • 13

yale-nlp/modernbert-qwen-stuck-detector

0.1B • Updated 3 days ago • 12

yale-nlp/Qwen3-VL-8B-Anchor-Windows

770k • Updated Feb 4

yale-nlp/Qwen2.5-VL-7B-Anchor-Windows

849k • Updated Feb 4

datasets 28

yale-nlp/Anchor

Viewer • Updated Jan 27 • 30.6k • 28

yale-nlp/MedTutor

Updated Nov 5, 2025 • 295 • 2

yale-nlp/SciArena

Viewer • Updated Oct 15, 2025 • 13.2k • 64 • 25

yale-nlp/SciReas-Pro

Viewer • Updated Sep 18, 2025 • 1.36k • 18 • 1

yale-nlp/MSRS

Viewer • Updated Sep 1, 2025 • 2.44k • 77 • 2

yale-nlp/SciArena-Eval

Viewer • Updated Aug 25, 2025 • 2k • 6

yale-nlp/SciArena-with-paperbank

Viewer • Updated Aug 25, 2025 • 15.2k • 17

yale-nlp/SciDQA

Viewer • Updated Aug 3, 2025 • 2.94k • 125 • 2

yale-nlp/AbGen

Viewer • Updated Jul 14, 2025 • 3.3k • 27 • 3

yale-nlp/LimitGen

Updated Jul 3, 2025 • 72

View 28 datasets