Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
wzhouad
/
Llama3-Instruct-8B-WPO-HB-v2
like
5
Text Generation
Transformers
Safetensors
wzhouad/llama3-ultrafeedback-hybrid-v2
llama
alignment-handbook
conversational
text-generation-inference
Inference Endpoints
arxiv:
2406.11827
arxiv:
2310.01377
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
Llama3-Instruct-8B-WPO-HB-v2
/
special_tokens_map.json
Commit History
Upload tokenizer
424a865
verified
wzhouad
commited on
Jul 24, 2024