Tianqi Liu's picture

3 11

Tianqi Liu

TianqiLiuAI

·

AI & ML interests

None yet

Organizations

TianqiLiuAI's activity

commented a paper 6 months ago

RRM: Robust Reward Model Training Mitigates Reward Hacking

Paper • 2409.13156 • Published Sep 20, 2024 • 5 •

commented a paper 7 months ago

Building Math Agents with Multi-Turn Iterative Preference Learning

Paper • 2409.02392 • Published Sep 4, 2024 • 15 •

commented a paper about 1 year ago

LiPO: Listwise Preference Optimization through Learning-to-Rank

Paper • 2402.01878 • Published Feb 2, 2024 • 20 •