StepWise
AI & ML interests
Natural Language Processing at Yale
Recent Activity
Papers
RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation
References Improve LLM Alignment in Non-Verifiable Domains
models 94
yale-nlp/AgentTrek-1.0-32B_webarena-verified_milestone-bert
0.1B • Updated
yale-nlp/gpt-oss-20b_webarena-verified_stuck-bert
0.1B • Updated
yale-nlp/AgentTrek-1.0-32B_webarena-verified_stuck-bert
0.1B • Updated
yale-nlp/gpt-oss-20b_webarena-verified_milestone-bert
0.1B • Updated
yale-nlp/modernbert-evocua-milestone-detector
0.1B • Updated • 12
yale-nlp/modernbert-evocua-stuck-detector
0.1B • Updated • 11
yale-nlp/modernbert-qwen-milestone-detector
0.1B • Updated • 13
yale-nlp/modernbert-qwen-stuck-detector
0.1B • Updated • 12
yale-nlp/Qwen3-VL-8B-Anchor-Windows
770k • Updated
yale-nlp/Qwen2.5-VL-7B-Anchor-Windows
849k • Updated
datasets 28
yale-nlp/Anchor
Viewer • Updated • 30.6k • 28
yale-nlp/MedTutor
Updated • 295 • 2
yale-nlp/SciArena
Viewer • Updated • 13.2k • 64 • 25
yale-nlp/SciReas-Pro
Viewer • Updated • 1.36k • 18 • 1
yale-nlp/MSRS
Viewer • Updated • 2.44k • 77 • 2
yale-nlp/SciArena-Eval
Viewer • Updated • 2k • 6
yale-nlp/SciArena-with-paperbank
Viewer • Updated • 15.2k • 17
yale-nlp/SciDQA
Viewer • Updated • 2.94k • 125 • 2
yale-nlp/AbGen
Viewer • Updated • 3.3k • 27 • 3
yale-nlp/LimitGen
Updated • 72