Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
wzhouad
/
Llama3-Instruct-8B-WPO-HB-v2
like
5
Text Generation
Transformers
Safetensors
wzhouad/llama3-ultrafeedback-hybrid-v2
llama
alignment-handbook
conversational
text-generation-inference
Inference Endpoints
arxiv:
2406.11827
arxiv:
2310.01377
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
d73868a
Llama3-Instruct-8B-WPO-HB-v2
1 contributor
History:
1 commit
wzhouad
initial commit
d73868a
verified
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago