E Sanchez's picture

E Sanchez

esanchez43

AI & ML interests

None yet

Recent Activity

liked a dataset 6 days ago

arshan-ritual/ritual-agent-configs

upvoted a paper 14 days ago

UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards

upvoted a paper 21 days ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

View all activity

Organizations

None yet

upvoted a paper 14 days ago

UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards

Paper • 2604.14967 • Published 21 days ago • 15

upvoted a paper 21 days ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published 24 days ago • 101

upvoted a paper 24 days ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published 28 days ago • 289

upvoted a paper 25 days ago

Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization

Paper • 2604.08476 • Published 27 days ago • 8

upvoted a paper 29 days ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 627

upvoted a paper about 1 month ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 341

upvoted 2 papers about 2 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 371

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

upvoted 3 papers 2 months ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 194

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 523