Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
lole25
/
zephyr-7b-gpo-gen-i1
like
0
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
1138a30
zephyr-7b-gpo-gen-i1
/
runs
Commit History
Model save
79d4aca
verified
lole25
commited on
Apr 25
Training in progress, step 1200
6d5d5dc
verified
lole25
commited on
Apr 25
Training in progress, step 1100
59f720d
verified
lole25
commited on
Apr 25
Training in progress, step 900
0c5ad0d
verified
lole25
commited on
Apr 25
Training in progress, step 800
6bb9c17
verified
lole25
commited on
Apr 25
Training in progress, step 700
b4249e0
verified
lole25
commited on
Apr 25
Training in progress, step 600
0739e7a
verified
lole25
commited on
Apr 25
Training in progress, step 500
70316de
verified
lole25
commited on
Apr 25
Training in progress, step 400
2c79c39
verified
lole25
commited on
Apr 25
Training in progress, step 300
bd6547d
verified
lole25
commited on
Apr 25
Training in progress, step 200
17e1049
verified
lole25
commited on
Apr 25
Training in progress, step 100
a9424af
verified
lole25
commited on
Apr 25