Dian Yu's picture

1 10 1

Dian Yu

yudian

·

https://scholar.google.com/citations?user=ERdzqyYAAAAJ&hl=en

AI & ML interests

NLP

Recent Activity

authored a paper 1 day ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

authored a paper 1 day ago

OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas

authored a paper 1 day ago

Improving LLM General Preference Alignment via Optimistic Online Mirror Descent

View all activity

Organizations

None yet

yudian's activity

upvoted a paper about 1 month ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 56

upvoted a paper 2 months ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 40

upvoted a paper 5 months ago

DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search

Paper • 2410.03864 • Published Oct 4, 2024 • 12

upvoted a collection 8 months ago

Reinforcement Learning (RL / RLHF)

19 items • Updated Oct 22, 2024 • 1

upvoted 4 papers 9 months ago

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29, 2024 • 39

Iterative Nash Policy Optimization: Aligning LLMs with General Preferences via No-Regret Learning

Paper • 2407.00617 • Published Jun 30, 2024 • 7

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 97

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning

Paper • 2406.12050 • Published Jun 17, 2024 • 19

upvoted a paper 11 months ago

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18, 2024 • 55

upvoted a paper about 1 year ago

Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models

Paper • 2308.00304 • Published Aug 1, 2023 • 23