arxiv:2412.03704
Xiyao Wang
russwang
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 21 hours ago
Offline Reinforcement Learning for LLM Multi-Step Reasoning
updated
a model
14 days ago
russwang/VisVM-LLaVA-Next-Mistral-7B
Organizations
Papers
10
models
10
russwang/VisVM-LLaVA-Next-Mistral-7B
Updated
•
4
russwang/llava-ov-sft-v1-e3
Updated
russwang/video-ckpt-llm-only-v1-1e-5-e3
Updated
•
4
russwang/video-ckpt-v1-2e-5-e1
Updated
•
3
russwang/video-ckpt-v1-e1
Updated
•
3
russwang/video-ckpt-v1-e3
Updated
•
2
russwang/video-ckpt-v2-e3
Updated
•
2
russwang/video-ckpt-v2-e1
Updated
•
5
russwang/vila_v1_2e-5_ckpt
Updated
•
4
russwang/MCTS_DPO
Updated