QRQ's picture

1 35 2

QRQ

RichardQRQ

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

upvoted a paper about 7 hours ago

ToolRL: Reward is All Tool Learning Needs

upvoted a paper about 7 hours ago

Learning to Reason under Off-Policy Guidance

View all activity

Organizations

None yet

RichardQRQ's activity

upvoted 3 papers about 7 hours ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published about 19 hours ago • 29

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published 6 days ago • 21

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published 1 day ago • 46

upvoted 2 papers 7 days ago

VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search

Paper • 2504.09130 • Published 10 days ago • 11

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 8 days ago • 237

upvoted a paper 26 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 27 days ago • 139

upvoted 7 papers about 1 month ago

STEVE: AStep Verification Pipeline for Computer-use Agent Training

Paper • 2503.12532 • Published Mar 16 • 15

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published Mar 19 • 47

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills

Paper • 2503.12533 • Published Mar 16 • 64

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 35

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

Paper • 2503.10615 • Published Mar 13 • 16

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 230

SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models

Paper • 2503.07605 • Published Mar 10 • 67

upvoted 2 papers about 2 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 109

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published Feb 14 • 35

liked a dataset 3 months ago

We-Math/We-Math

Viewer • Updated Sep 6, 2024 • 1.74k • 273 • 18

upvoted 3 papers 3 months ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 27

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 102

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published Jan 8 • 54

liked a dataset 3 months ago

terryoo/TableVQA-Bench

Viewer • Updated Apr 25, 2024 • 1.5k • 1.78k • 23