Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO-2
/
zephyr-7b-gpo-v6-i1
like
0
Follow
DUAL-GPO-2
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
mit
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
3b81c80
zephyr-7b-gpo-v6-i1
/
runs
Commit History
Model save
3b81c80
verified
lole25
commited on
May 7
Training in progress, step 2500
f8eb0a8
verified
lole25
commited on
May 7
Training in progress, step 2400
ae2583a
verified
lole25
commited on
May 7
Training in progress, step 2200
e641e44
verified
lole25
commited on
May 7
Training in progress, step 2100
6b1c93a
verified
lole25
commited on
May 7
Training in progress, step 2000
bf6e1c4
verified
lole25
commited on
May 7
Training in progress, step 1900
56895c7
verified
lole25
commited on
May 7
Training in progress, step 1800
5cea94b
verified
lole25
commited on
May 7
Training in progress, step 1700
bd51477
verified
lole25
commited on
May 7
Training in progress, step 1600
58ccaac
verified
lole25
commited on
May 7
Training in progress, step 1500
4cc523f
verified
lole25
commited on
May 7
Training in progress, step 1400
57a452a
verified
lole25
commited on
May 7
Training in progress, step 1300
9bcc354
verified
lole25
commited on
May 7
Training in progress, step 1100
1625d7f
verified
lole25
commited on
May 7
Training in progress, step 900
d93a0d7
verified
lole25
commited on
May 7
Training in progress, step 800
c37aa6b
verified
lole25
commited on
May 7
Training in progress, step 700
ac96892
verified
lole25
commited on
May 7
Training in progress, step 600
ab09ae3
verified
lole25
commited on
May 7
Training in progress, step 500
0d51dca
verified
lole25
commited on
May 7
Training in progress, step 400
ef593bb
verified
lole25
commited on
May 7
Training in progress, step 300
fa5c7e1
verified
lole25
commited on
May 6
Training in progress, step 200
a26e01c
verified
lole25
commited on
May 6
Training in progress, step 100
72460d3
verified
lole25
commited on
May 6