Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Magpie-Align
/
Llama-3.1-8B-Magpie-Align-v0.2
like
2
Safetensors
Magpie-Align/Llama-3.1-70B-PO-100K-armorm
English
llama
alignment-handbook
trl
dpo
Generated from Trainer
arxiv:
2406.08464
arxiv:
2406.12845
License:
llama3.1
Model card
Files
Files and versions
Community
Train
main
Llama-3.1-8B-Magpie-Align-v0.2
/
model-00002-of-00004.safetensors
Commit History
Training in progress, step 765
cf8884e
verified
flydust
commited on
Aug 3
Training in progress, step 700
037bbde
verified
flydust
commited on
Aug 3
Training in progress, step 600
0fa5b18
verified
flydust
commited on
Aug 3
Training in progress, step 500
0a183b3
verified
flydust
commited on
Aug 3
Training in progress, step 400
b8c4d60
verified
flydust
commited on
Aug 3
Training in progress, step 300
fa08f18
verified
flydust
commited on
Aug 3
Training in progress, step 200
bf37a10
verified
flydust
commited on
Aug 2
Training in progress, step 100
f92c0a2
verified
flydust
commited on
Aug 2