Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
martimfasantos
/
tinyllama-1.1b-mt-dpo-full_LR5e-8_BS16_rmsprop_2epochs_new
like
0
Text Generation
Transformers
TensorBoard
Safetensors
haoranxu/ALMA-R-Preference
llama
alignment-handbook
trl
dpo
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
tinyllama-1.1b-mt-dpo-full_LR5e-8_BS16_rmsprop_2epochs_new
Commit History
End of training
ebb39e4
verified
martimfasantos
commited on
Jul 14
Model save
78dc60f
verified
martimfasantos
commited on
Jul 14
Training in progress, step 2700
19a535d
verified
martimfasantos
commited on
Jul 14
Training in progress, step 2500
372cf40
verified
martimfasantos
commited on
Jul 14
Training in progress, step 2300
6248c21
verified
martimfasantos
commited on
Jul 14
Training in progress, step 2100
6dcfd52
verified
martimfasantos
commited on
Jul 14
Training in progress, step 1900
9b20565
verified
martimfasantos
commited on
Jul 14
Training in progress, step 1700
d4f0e4b
verified
martimfasantos
commited on
Jul 14
Training in progress, step 1500
1d40cda
verified
martimfasantos
commited on
Jul 14
Training in progress, step 1300
9380bcb
verified
martimfasantos
commited on
Jul 14
Training in progress, step 1100
31de0dd
verified
martimfasantos
commited on
Jul 14
Training in progress, step 900
6b491be
verified
martimfasantos
commited on
Jul 14
Training in progress, step 800
47b49d3
verified
martimfasantos
commited on
Jul 14
Training in progress, step 600
3c1e1c2
verified
martimfasantos
commited on
Jul 13
Training in progress, step 500
1a94fd9
verified
martimfasantos
commited on
Jul 13
Training in progress, step 300
bf19df0
verified
martimfasantos
commited on
Jul 13
Training in progress, step 100
a3adfc8
verified
martimfasantos
commited on
Jul 13
initial commit
57f86b6
verified
martimfasantos
commited on
Jul 13