eng-jagoy-t5-001 / README.md
tarsssss's picture
Training in progress epoch 40
5940fac
|
raw
history blame
2.95 kB
metadata
license: apache-2.0
base_model: t5-small
tags:
  - generated_from_keras_callback
model-index:
  - name: tarsssss/eng-jagoy-t5-001
    results: []

tarsssss/eng-jagoy-t5-001

This model is a fine-tuned version of t5-small on an unknown dataset. It achieves the following results on the evaluation set:

  • Train Loss: 5.6945
  • Validation Loss: 5.6383
  • Epoch: 40

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
  • training_precision: float32

Training results

Train Loss Validation Loss Epoch
7.8603 7.4105 0
7.3775 7.1273 1
7.1632 6.9598 2
7.0228 6.8372 3
6.9085 6.7335 4
6.8226 6.6458 5
6.7451 6.5671 6
6.6785 6.5022 7
6.6254 6.4409 8
6.5606 6.3842 9
6.5163 6.3361 10
6.4682 6.2908 11
6.4250 6.2436 12
6.3749 6.1907 13
6.3293 6.1494 14
6.2822 6.1098 15
6.2560 6.0750 16
6.2078 6.0508 17
6.1839 6.0229 18
6.1561 5.9944 19
6.1146 5.9732 20
6.0885 5.9490 21
6.0587 5.9243 22
6.0366 5.9064 23
6.0135 5.8857 24
5.9904 5.8675 25
5.9681 5.8482 26
5.9473 5.8262 27
5.9263 5.8127 28
5.9031 5.7896 29
5.8827 5.7721 30
5.8566 5.7482 31
5.8406 5.7355 32
5.8285 5.7231 33
5.7944 5.7049 34
5.7822 5.6968 35
5.7567 5.6813 36
5.7526 5.6650 37
5.7363 5.6614 38
5.7132 5.6398 39
5.6945 5.6383 40

Framework versions

  • Transformers 4.33.2
  • TensorFlow 2.10.0
  • Datasets 2.15.0
  • Tokenizers 0.13.3