|
--- |
|
license: apache-2.0 |
|
datasets: |
|
- ambrosfitz/ps_data_v2.2 |
|
--- |
|
Run history: |
|
|
|
train/epoch βββββββββββ
β
β
ββββββββ<br> |
|
train/global_step βββββββββββ
β
β
ββββββββ<br> |
|
train/grad_norm ββββ
ββ
ββββββββββββββ<br> |
|
train/learning_rate βββββ
β
ββββββββ
β
βββββ<br> |
|
train/loss βββββ
β
ββββββββββββββ<br> |
|
train/total_flos β<br> |
|
train/train_loss β<br> |
|
train/train_runtime β<br> |
|
train/train_samples_per_second β<br> |
|
train/train_steps_per_second β<br> |
|
|
|
Run summary: |
|
|
|
train/epoch 2.0<br> |
|
train/global_step 20<br> |
|
train/grad_norm 0.13779<br> |
|
train/learning_rate 0.0<br> |
|
train/loss 1.1365<br> |
|
train/total_flos 4.579249185376512e+16<br> |
|
train/train_loss 1.29891<br> |
|
train/train_runtime 1552.5749<br> |
|
train/train_samples_per_second 1.649<br> |
|
train/train_steps_per_second 0.013<br> |