Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jiaxing
huangjiaxing
Follow
AI & ML interests
None yet
Recent Activity
authored
a paper
about 24 hours ago
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
authored
a paper
3 months ago
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
View all activity
Organizations
None yet
Papers
2
arxiv:
2503.12937
arxiv:
2412.18319
models
None public yet
datasets
None public yet