Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
YYYYYYibo
/
zephyr-7b-dpo-qlora-min-pi-part-0
like
0
PEFT
Safetensors
updated
original
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Use this model
580edf6
zephyr-7b-dpo-qlora-min-pi-part-0
/
trainer_state.json
Commit History
Model save
25285c8
verified
YYYYYYibo
commited on
Apr 26
Model save
4738e8b
verified
YYYYYYibo
commited on
Apr 26
Model save
bc59731
verified
YYYYYYibo
commited on
Apr 26
Model save
8f2b315
verified
YYYYYYibo
commited on
Apr 26