arxiv:2501.12895
Xiaoye Qu
Xiaoye08
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative
Textual Feedback
upvoted
a
paper
15 days ago
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language
Models
Organizations
Papers
10
models
1
datasets
None public yet