flintlock_3B_v0.1 / README.md
ambrosfitz's picture
Update README.md
243f70a verified
|
raw
history blame
883 Bytes
metadata
license: apache-2.0
datasets:
  - ambrosfitz/ps_data_v2.2

Run history:

train/epoch β–β–β–‚β–‚β–‚β–ƒβ–ƒβ–„β–„β–„β–…β–…β–…β–†β–†β–‡β–‡β–‡β–ˆβ–ˆβ–ˆ train/global_step β–β–β–‚β–‚β–‚β–ƒβ–ƒβ–„β–„β–„β–…β–…β–…β–†β–†β–‡β–‡β–‡β–ˆβ–ˆβ–ˆ train/grad_norm β–ˆβ–ˆβ–„β–…β–„β–…β–ƒβ–‚β–‚β–„β–‚β–‚β–‚β–‚β–β–β–β–β–β– train/learning_rate β–‚β–‚β–ƒβ–„β–…β–…β–†β–‡β–‡β–ˆβ–‡β–‡β–†β–…β–…β–„β–ƒβ–‚β–‚β– train/loss β–‡β–ˆβ–‡β–†β–…β–…β–„β–„β–ƒβ–ƒβ–‚β–‚β–‚β–β–‚β–β–β–β–β– train/total_flos ▁ train/train_loss ▁ train/train_runtime ▁ train/train_samples_per_second ▁ train/train_steps_per_second ▁

Run summary:

train/epoch 2.0 train/global_step 20 train/grad_norm 0.13779 train/learning_rate 0.0 train/loss 1.1365 train/total_flos 4.579249185376512e+16 train/train_loss 1.29891 train/train_runtime 1552.5749 train/train_samples_per_second 1.649 train/train_steps_per_second 0.013