Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

liuhuanbin's picture

liuhuanbin

huanbin11

·

AI & ML interests

None yet

Organizations

None yet

Collections 3

RevisEval: Improving LLM-as-a-Judge via Response-Adapted References

Paper • 2410.05193 • Published 5 days ago • 10

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Paper • 2410.02743 • Published 8 days ago • 5
Self-Boosting Large Language Models with Synthetic Preference Data

Paper • 2410.06961 • Published 3 days ago • 13

models

None public yet

datasets

None public yet

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs