LinkedIn

company

Verified

https://www.linkedin.com

AI & ML interests

None defined yet.

Recent Activity

pb09204048 authored a paper about 2 months ago

TIP: Token Importance in On-Policy Distillation

pb09204048 submitted a paper 3 months ago

On-Policy Self-Distillation for Reasoning Compression

pb09204048 authored a paper 3 months ago

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

View all activity

Papers

Support Tokens, Stability Margins, and a New Foundation for Robust LLMs

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

View all Papers

Articles

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

models 0

None public yet

datasets 0

None public yet