mz.w's picture

mz.w

iiiiwis

·

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space

upvoted a paper 3 days ago

From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space

upvoted a paper about 1 month ago

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

View all activity

Organizations

None yet

Papers 9

arxiv:2604.14142

arxiv:2512.19673

arxiv:2505.02156

arxiv:2412.04905

models 1

iiiiwis/DEMO_Agent

Text Generation • Updated Dec 10, 2024 • 2

datasets 2

iiiiwis/AMPO

Preview • Updated May 15, 2025 • 59 • 1

iiiiwis/DEMO

Viewer • Updated Dec 16, 2024 • 7.98k • 17 • 1