zhiyuanyou
zhiyuanyou
AI & ML interests
None yet
Recent Activity
authored
a paper
4 days ago
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through
Two-Stage Rule-Based RL
updated
a dataset
14 days ago
zhiyuanyou/Data-DeQA-Score
Organizations
None yet
zhiyuanyou's activity
Is vicuna1.5 tuned from Llama-2 with or without reinforcement learning?
2
#6 opened over 1 year ago
by
zhiyuanyou