2 6

Jongwon Lim

Jongwondd

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

RobotValues: Evaluating Household Robots When Human Values Conflict

upvoted a paper 3 days ago

ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?

upvoted a paper 21 days ago

Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement

View all activity

Organizations

upvoted 2 papers 3 days ago

RobotValues: Evaluating Household Robots When Human Values Conflict

Paper • 2606.03312 • Published 6 days ago • 24

ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?

Paper • 2606.05553 • Published 4 days ago • 45

upvoted a paper 21 days ago

Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement

Paper • 2605.14368 • Published 25 days ago • 16

submitted a paper to Daily Papers 24 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

Paper • 2605.07865 • Published about 1 month ago • 22

commented a paper 24 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

Paper • 2605.07865 • Published about 1 month ago • 22 •

submitted a paper to Daily Papers 26 days ago

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Paper • 2605.07579 • Published about 1 month ago • 18

authored 3 papers 26 days ago

DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine

Paper • 2411.09255 • Published Nov 14, 2024

Learning to Retrieve User History and Generate User Profiles for Personalized Persuasiveness Prediction

Paper • 2601.05654 • Published Apr 19

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Paper • 2605.07579 • Published about 1 month ago • 18

upvoted 2 papers 28 days ago

KL for a KL: On-Policy Distillation with Control Variate Baseline

Paper • 2605.07865 • Published about 1 month ago • 22

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States

Paper • 2605.07579 • Published about 1 month ago • 18

updated a model about 1 month ago

Jongwondd/GRESO_step_90

4B • Updated May 1 • 5

published 2 models about 1 month ago

Jongwondd/GRESO_step_90

4B • Updated May 1 • 5

Jongwondd/Qwen3-4B_GRESO_batch_256

Updated May 1

updated a model about 1 month ago

Jongwondd/convai_hw1

Updated Apr 25

published a model about 1 month ago

Jongwondd/convai_hw1

Updated Apr 25

updated a dataset about 1 month ago

Jongwondd/convai_hw1

Updated Apr 25 • 10

published a dataset about 1 month ago

Jongwondd/convai_hw1

Updated Apr 25 • 10

upvoted a paper about 2 months ago

ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided Decoding

Paper • 2510.00546 • Published Apr 20 • 14

Jongwon Lim

AI & ML interests

Recent Activity

Organizations

Jongwondd's activity