Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
plaguss
/
zephyr-7b-lora-adapter-dpo-dibt-v0
like
0
PEFT
Safetensors
mistral
choo-choo
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Use this model
72788aa
zephyr-7b-lora-adapter-dpo-dibt-v0
/
README.md
Commit History
End of training
72788aa
verified
plaguss
HF staff
commited on
Mar 11
Model save
3e6253c
verified
plaguss
HF staff
commited on
Mar 11
End of training
f285af2
verified
plaguss
HF staff
commited on
Mar 6
Model save
847c3d7
verified
plaguss
HF staff
commited on
Mar 6