Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
glider
/
zephyr-7b-dpo-qlora
like
0
Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
zephyr-7b-dpo-qlora
/
runs
Ctrl+K
Ctrl+K
1 contributor
History:
12 commits
glider
Training in progress, step 100
9fd5963
verified
4 months ago
Dec09_09-45-47_node118
Training in progress, step 100
4 months ago
Dec09_12-08-18_node118
Training in progress, step 100
4 months ago
Dec09_12-48-24_node118
Training in progress, step 100
4 months ago
Dec09_12-56-25_node118
Training in progress, step 100
4 months ago
Dec10_10-29-49_node006
Training in progress, step 955
4 months ago
Dec10_13-28-56_node006
Model save
4 months ago
Dec14_01-37-07_c263-gz3-server-iv-002
Training in progress, step 100
4 months ago