Dhrupad Bhardwaj's picture

1 1

Dhrupad Bhardwaj

dhrupadb

·

dhrupadb

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

PILAF: Optimal Human Preference Sampling for Reward Modeling

liked a model 3 months ago

meta-llama/Llama-3.2-3B-Instruct

View all activity

Organizations

None yet

dhrupadb's activity

upvoted a paper 3 days ago

PILAF: Optimal Human Preference Sampling for Reward Modeling

Paper • 2502.04270 • Published 7 days ago • 10