3 8

Luo

ramiroluo

LuoXiaoHeics

AI & ML interests

None yet

Recent Activity

upvoted a paper 35 minutes ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

upvoted a paper 1 day ago

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

upvoted a paper 2 days ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

View all activity

Organizations

upvoted a paper 35 minutes ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 2 days ago • 56

upvoted a paper 1 day ago

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Paper • 2605.06326 • Published 8 days ago • 24

upvoted a paper 2 days ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

Paper • 2605.10832 • Published 4 days ago • 20

upvoted a paper 23 days ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

Paper • 2604.19295 • Published 24 days ago • 34

upvoted a paper about 1 month ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published Mar 30 • 85

upvoted a paper 3 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 38

submitted a paper to Daily Papers 3 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 38

New activity in PRIME-RL/P1-VL-30B-A3B 3 months ago

Add metadata and link to paper/code

#1 opened 3 months ago by

nielsr

New activity in PRIME-RL/P1-VL-235B-A22B 3 months ago

Add metadata and links to paper and code

#1 opened 3 months ago by

nielsr

authored 2 papers 3 months ago

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9, 2025 • 32

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

upvoted a paper 3 months ago

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

updated a model 3 months ago

PRIME-RL/P1-VL-235B-A22B

Image-Text-to-Text • 236B • Updated Feb 12 • 7 • 3

published 2 models 3 months ago

PRIME-RL/P1-VL-30B-A3B

Image-Text-to-Text • 31B • Updated Feb 12 • 19 • 3

PRIME-RL/P1-VL-235B-A22B

Image-Text-to-Text • 236B • Updated Feb 12 • 7 • 3

updated a model 3 months ago

PRIME-RL/P1-VL-30B-A3B

Image-Text-to-Text • 31B • Updated Feb 12 • 19 • 3

upvoted a paper 7 months ago

Spotlight on Token Perception for Multimodal Reinforcement Learning

Paper • 2510.09285 • Published Oct 10, 2025 • 37

updated a Space over 2 years ago

HalluChecker

😻

Display leaderboard for LLM hallucination checks

Luo