arxiv:2512.14614
wenq
wenqsun
AI & ML interests
Machine learning, computer vision
Recent Activity
upvoted a paper about 2 months ago
WorldCompass: Reinforcement Learning for Long-Horizon World Models upvoted a paper about 2 months ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language ModelsOrganizations
None yet