Jeremy Young's picture

1

Jeremy Young

NewbieYoung

·

AI & ML interests

None yet

Recent Activity

commented on a paper 9 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

commented on a paper 17 days ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

View all activity

Organizations

commented a paper 9 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 216 •

commented a paper 17 days ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 261 •