16 25 3

Taiwei Shi

MaksimSTW

https://taiweis.com

AI & ML interests

reinforcement learning, alignment, human-AI collaboration, and computational social science

Recent Activity

authored a paper 1 day ago

Video-Based Reward Modeling for Computer-Use Agents

upvoted a paper 12 days ago

Video-Based Reward Modeling for Computer-Use Agents

authored a paper 21 days ago

DP-RFT: Learning to Generate Synthetic Text via Differentially Private Reinforcement Fine-Tuning

View all activity

Organizations

authored a paper 1 day ago

Video-Based Reward Modeling for Computer-Use Agents

Paper • 2603.10178 • Published 14 days ago • 42

upvoted a paper 12 days ago

Video-Based Reward Modeling for Computer-Use Agents

Paper • 2603.10178 • Published 14 days ago • 42

authored a paper 21 days ago

DP-RFT: Learning to Generate Synthetic Text via Differentially Private Reinforcement Fine-Tuning

Paper • 2602.18633 • Published Feb 20 • 2

upvoted a paper 22 days ago

DP-RFT: Learning to Generate Synthetic Text via Differentially Private Reinforcement Fine-Tuning

Paper • 2602.18633 • Published Feb 20 • 2

authored a paper about 1 month ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published Feb 15 • 71

updated a collection about 1 month ago

Papers from LIME Lab

Collection

Papers from LIME Lab • 9 items • Updated Feb 17 • 2

upvoted a paper about 1 month ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published Feb 15 • 71

submitted a paper to Daily Papers about 1 month ago

Experiential Reinforcement Learning

Paper • 2602.13949 • Published Feb 15 • 71

upvoted 2 papers 5 months ago

Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning

Paper • 2510.23473 • Published Oct 27, 2025 • 86

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 138

authored a paper 7 months ago

CoAct-1: Computer-using Agents with Coding as Actions

Paper • 2508.03923 • Published Aug 5, 2025 • 13

upvoted a paper 7 months ago

CoAct-1: Computer-using Agents with Coding as Actions

Paper • 2508.03923 • Published Aug 5, 2025 • 13

upvoted a paper 9 months ago

Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback

Paper • 2506.11930 • Published Jun 13, 2025 • 53

updated 7 models 10 months ago

Taiwei Shi

AI & ML interests

Recent Activity

Organizations

MaksimSTW's activity