Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
LaoRay
/
zephyr-7b-dpo-lora
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
trl
dpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-dpo-lora
/
adapter_model.safetensors
Commit History
Training in progress, epoch 19
795da39
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 18
394ac80
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 18
c98f091
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 16
1e38432
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 16
fbf738e
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 14
e4d1c35
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 14
bb59df3
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 12
2cd86ed
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 12
8cdbb1a
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 10
087f83a
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 10
a9a69b6
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 8
d00088e
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 8
2ea3fca
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 6
d85761e
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 6
7c1cd62
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 4
0cb4254
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 4
ef411f8
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 2
7c9506b
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 2
a245f1e
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 0
90178ce
verified
LaoRay
commited on
Aug 14
Training in progress, epoch 0
fbc2401
verified
LaoRay
commited on
Aug 13