4 32 11

Xiaoye Qu

Xiaoye08

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

upvoted a paper 5 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

upvoted a paper 17 days ago

Inference-Time Scaling for Generalist Reward Modeling

View all activity

Organizations

Xiaoye08's activity

upvoted a paper 2 days ago

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Paper • 2504.13055 • Published 4 days ago • 16

upvoted a paper 5 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 7 days ago • 232

upvoted 2 papers 17 days ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 18 days ago • 52

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published 18 days ago • 30

upvoted 3 papers 20 days ago

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published 23 days ago • 45

What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published 21 days ago • 52

Effectively Controlling Reasoning Models through Thinking Intervention

Paper • 2503.24370 • Published 21 days ago • 18

authored a paper 21 days ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 25 days ago • 39

commented a paper 21 days ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 25 days ago • 39 •

upvoted a paper 21 days ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 25 days ago • 39

commented a paper 21 days ago

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published 25 days ago • 39 •

upvoted a paper 28 days ago

From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Paper • 2503.12821 • Published Mar 17 • 9

authored a paper 28 days ago

From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Paper • 2503.12821 • Published Mar 17 • 9

upvoted 2 papers about 1 month ago

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published Mar 10 • 21

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 118

authored a paper about 1 month ago

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Paper • 2503.05447 • Published Mar 7 • 7

upvoted a paper about 1 month ago

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Paper • 2503.05447 • Published Mar 7 • 7

upvoted 3 papers about 2 months ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published Mar 3 • 76

Liger: Linearizing Large Language Models to Gated Recurrent Structures

Paper • 2503.01496 • Published Mar 3 • 16

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24 • 28