Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
martimfasantos
/
tinyllama-1.1b-mt-dpo-full_LR5e-8_BS16_adamw_3epochs
like
0
Text Generation
Transformers
TensorBoard
Safetensors
haoranxu/ALMA-R-Preference
llama
alignment-handbook
trl
dpo
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
tinyllama-1.1b-mt-dpo-full_LR5e-8_BS16_adamw_3epochs
/
runs
Commit History
Model save
f2e2a3a
verified
martimfasantos
commited on
Jul 9
Training in progress, step 4100
44811ff
verified
martimfasantos
commited on
Jul 9
Training in progress, step 3900
5b4d978
verified
martimfasantos
commited on
Jul 9
Training in progress, step 3700
7e0c399
verified
martimfasantos
commited on
Jul 9
Training in progress, step 3500
5fbd3f3
verified
martimfasantos
commited on
Jul 9
Training in progress, step 3300
381dcb3
verified
martimfasantos
commited on
Jul 9
Training in progress, step 3100
deb5892
verified
martimfasantos
commited on
Jul 9
Training in progress, step 2500
e8403aa
verified
martimfasantos
commited on
Jul 9
Training in progress, step 2300
f945c01
verified
martimfasantos
commited on
Jul 9
Training in progress, step 2100
fa8b048
verified
martimfasantos
commited on
Jul 9
Training in progress, step 1900
94b2125
verified
martimfasantos
commited on
Jul 9
Training in progress, step 1500
046dbaf
verified
martimfasantos
commited on
Jul 9
Training in progress, step 1300
5d02707
verified
martimfasantos
commited on
Jul 9
Training in progress, step 1100
fffb030
verified
martimfasantos
commited on
Jul 9
Training in progress, step 900
4f7132f
verified
martimfasantos
commited on
Jul 9
Training in progress, step 700
8218979
verified
martimfasantos
commited on
Jul 9
Training in progress, step 500
5b5d061
verified
martimfasantos
commited on
Jul 9
Training in progress, step 300
4682e14
verified
martimfasantos
commited on
Jul 9
Model save
9d61ede
verified
martimfasantos
commited on
Jul 9
Training in progress, step 4100
a859f06
verified
martimfasantos
commited on
Jul 9
Training in progress, step 3900
27df8da
verified
martimfasantos
commited on
Jul 9
Training in progress, step 3500
445a2f0
verified
martimfasantos
commited on
Jul 9
Training in progress, step 3300
704a0fe
verified
martimfasantos
commited on
Jul 9
Training in progress, step 3100
6232f38
verified
martimfasantos
commited on
Jul 9
Training in progress, step 2900
8c1fd0e
verified
martimfasantos
commited on
Jul 9
Training in progress, step 2700
4e6bbb0
verified
martimfasantos
commited on
Jul 9
Training in progress, step 2500
5661d2f
verified
martimfasantos
commited on
Jul 9
Training in progress, step 2300
1957155
verified
martimfasantos
commited on
Jul 9
Training in progress, step 2100
8a6db7f
verified
martimfasantos
commited on
Jul 9
Training in progress, step 1900
0ca43a1
verified
martimfasantos
commited on
Jul 9
Training in progress, step 1700
9f12f2f
verified
martimfasantos
commited on
Jul 9
Training in progress, step 1500
df49b95
verified
martimfasantos
commited on
Jul 9
Training in progress, step 1300
22c3153
verified
martimfasantos
commited on
Jul 9
Training in progress, step 1100
5fe30ca
verified
martimfasantos
commited on
Jul 9
Training in progress, step 900
d1df991
verified
martimfasantos
commited on
Jul 9
Training in progress, step 700
8ae95b2
verified
martimfasantos
commited on
Jul 9
Training in progress, step 500
f88a2b8
verified
martimfasantos
commited on
Jul 9
Training in progress, step 300
52ba6f5
verified
martimfasantos
commited on
Jul 9