rizvi-rahil786's picture
End of training
9d36701 verified
|
raw
history blame
1.95 kB
metadata
license: apache-2.0
base_model: t5-small
tags:
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: t5-small-hardaDerailKP
    results: []

t5-small-hardaDerailKP

This model is a fine-tuned version of t5-small on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 1.1782
  • Rouge1: 51.8379
  • Rouge2: 41.3714
  • Rougel: 51.9665
  • Rougelsum: 51.9518
  • Gen Len: 6.8067

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 5

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
1.2491 1.0 3079 1.2203 51.123 40.4019 51.1098 51.1009 6.8389
1.0688 2.0 6158 1.1890 50.7818 40.0174 50.827 50.8336 6.7285
1.0126 3.0 9237 1.1782 51.8379 41.3714 51.9665 51.9518 6.8067
0.9778 4.0 12316 1.1825 51.1751 40.5841 51.231 51.254 6.8103
0.9071 5.0 15395 1.1858 51.0503 40.212 51.1279 51.1203 6.7655

Framework versions

  • Transformers 4.39.1
  • Pytorch 2.2.1+cu121
  • Datasets 2.18.0
  • Tokenizers 0.15.2