Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Magpie-Align
/
Llama-3.1-8B-Magpie-Align-v0.2
like
2
Safetensors
Magpie-Align/Llama-3.1-70B-PO-100K-armorm
English
llama
alignment-handbook
trl
dpo
Generated from Trainer
arxiv:
2406.08464
arxiv:
2406.12845
License:
llama3.1
Model card
Files
Files and versions
Community
Train
main
Llama-3.1-8B-Magpie-Align-v0.2
Commit History
Update README.md
30e6682
verified
flydust
commited on
Aug 19
Update README.md
9ebe5f7
verified
flydust
commited on
Aug 19
Update README.md
88d17cd
verified
flydust
commited on
Aug 19
End of training
f98f101
verified
flydust
commited on
Aug 3
Model save
d8eec6f
verified
flydust
commited on
Aug 3
Training in progress, step 765
cf8884e
verified
flydust
commited on
Aug 3
Training in progress, step 700
037bbde
verified
flydust
commited on
Aug 3
Training in progress, step 600
0fa5b18
verified
flydust
commited on
Aug 3
Training in progress, step 500
0a183b3
verified
flydust
commited on
Aug 3
Training in progress, step 400
b8c4d60
verified
flydust
commited on
Aug 3
Training in progress, step 300
fa08f18
verified
flydust
commited on
Aug 3
Training in progress, step 200
bf37a10
verified
flydust
commited on
Aug 2
Training in progress, step 100
f92c0a2
verified
flydust
commited on
Aug 2
initial commit
ba80c4f
verified
flydust
commited on
Aug 2