arxiv:2604.14142
mz.w
iiiiwis
AI & ML interests
None yet
Recent Activity
authored a paper 3 days ago
From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space upvoted a paper 3 days ago
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space upvoted a paper about 1 month ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language ModelsOrganizations
None yet