flan-log-sage / README.md
IrwinD's picture
End of training
467d2d3 verified
|
raw
history blame
21.8 kB
metadata
license: apache-2.0
base_model: google/flan-t5-small
tags:
  - generated_from_trainer
datasets:
  - hdfs_log_summary_dataset
metrics:
  - rouge
model-index:
  - name: flan-log-sage
    results:
      - task:
          name: Sequence-to-sequence Language Modeling
          type: text2text-generation
        dataset:
          name: hdfs_log_summary_dataset
          type: hdfs_log_summary_dataset
          config: default
          split: test
          args: default
        metrics:
          - name: Rouge1
            type: rouge
            value: 0.3738

flan-log-sage

This model is a fine-tuned version of google/flan-t5-small on the hdfs_log_summary_dataset dataset. It achieves the following results on the evaluation set:

  • Loss: 2.0636
  • Rouge1: 0.3738
  • Rouge2: 0.1028
  • Rougel: 0.2953
  • Rougelsum: 0.2962
  • Gen Len: 19.0

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 6
  • eval_batch_size: 6
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 200
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum Gen Len
No log 1.0 7 3.9522 0.1033 0.034 0.1025 0.1023 19.0
No log 2.0 14 3.7383 0.0895 0.0152 0.0895 0.0903 19.0
No log 3.0 21 3.5753 0.184 0.0043 0.1365 0.1364 19.0
No log 4.0 28 3.4379 0.1978 0.0043 0.1442 0.1437 19.0
No log 5.0 35 3.3177 0.1967 0.0043 0.1479 0.1473 19.0
No log 6.0 42 3.2163 0.2099 0.0157 0.1487 0.1482 19.0
No log 7.0 49 3.1227 0.2061 0.0115 0.1533 0.153 18.6
No log 8.0 56 3.0415 0.1742 0.0165 0.1209 0.1187 16.8
No log 9.0 63 2.9693 0.2544 0.0452 0.2011 0.2013 18.1
No log 10.0 70 2.8964 0.2847 0.0431 0.2125 0.2127 18.4
No log 11.0 77 2.8268 0.292 0.041 0.2144 0.2139 18.9
No log 12.0 84 2.7622 0.316 0.0413 0.2289 0.2286 19.0
No log 13.0 91 2.7027 0.316 0.0413 0.2289 0.2286 19.0
No log 14.0 98 2.6504 0.3384 0.0535 0.2504 0.2498 19.0
No log 15.0 105 2.5991 0.332 0.0657 0.2351 0.2345 17.8
No log 16.0 112 2.5532 0.323 0.0774 0.2514 0.2507 17.9
No log 17.0 119 2.5090 0.3752 0.0989 0.3083 0.3067 18.8
No log 18.0 126 2.4714 0.4073 0.1829 0.363 0.3602 19.0
No log 19.0 133 2.4451 0.4031 0.1918 0.3644 0.3628 19.0
No log 20.0 140 2.4147 0.3988 0.1825 0.3444 0.3428 19.0
No log 21.0 147 2.3924 0.4112 0.1871 0.3496 0.3479 19.0
No log 22.0 154 2.3742 0.4413 0.1989 0.3648 0.3621 19.0
No log 23.0 161 2.3566 0.469 0.2249 0.3902 0.386 19.0
No log 24.0 168 2.3394 0.4573 0.2049 0.3761 0.371 19.0
No log 25.0 175 2.3192 0.4612 0.2045 0.3751 0.3695 19.0
No log 26.0 182 2.3011 0.4606 0.2153 0.3791 0.3741 19.0
No log 27.0 189 2.2876 0.4568 0.1974 0.3697 0.3657 19.0
No log 28.0 196 2.2773 0.452 0.1944 0.3694 0.3655 19.0
No log 29.0 203 2.2647 0.4392 0.1819 0.3649 0.3619 19.0
No log 30.0 210 2.2535 0.4234 0.1683 0.359 0.3567 19.0
No log 31.0 217 2.2380 0.4234 0.1614 0.359 0.3567 19.0
No log 32.0 224 2.2316 0.4189 0.1645 0.3464 0.3425 19.0
No log 33.0 231 2.2243 0.4239 0.1696 0.352 0.348 19.0
No log 34.0 238 2.2169 0.4227 0.1653 0.359 0.3567 19.0
No log 35.0 245 2.2017 0.4227 0.1653 0.359 0.3567 19.0
No log 36.0 252 2.1875 0.4176 0.1653 0.3542 0.3511 19.0
No log 37.0 259 2.1828 0.4209 0.169 0.3542 0.3511 19.0
No log 38.0 266 2.1778 0.431 0.1746 0.3555 0.3503 19.0
No log 39.0 273 2.1683 0.4408 0.1847 0.3555 0.3503 19.0
No log 40.0 280 2.1626 0.4387 0.1836 0.3589 0.3537 19.0
No log 41.0 287 2.1544 0.4297 0.1732 0.3537 0.3495 19.0
No log 42.0 294 2.1496 0.4393 0.1771 0.3566 0.3525 19.0
No log 43.0 301 2.1433 0.4345 0.1853 0.3656 0.3626 19.0
No log 44.0 308 2.1347 0.426 0.1718 0.3613 0.3581 19.0
No log 45.0 315 2.1235 0.426 0.1718 0.3576 0.3546 19.0
No log 46.0 322 2.1172 0.4188 0.1682 0.3621 0.3571 19.0
No log 47.0 329 2.1149 0.4188 0.1682 0.3621 0.3571 19.0
No log 48.0 336 2.1124 0.4172 0.1643 0.3539 0.3484 19.0
No log 49.0 343 2.1091 0.4465 0.19 0.3659 0.3609 19.0
No log 50.0 350 2.1041 0.449 0.2075 0.3769 0.3742 19.0
No log 51.0 357 2.0955 0.449 0.2075 0.3769 0.3742 19.0
No log 52.0 364 2.0906 0.4449 0.2077 0.3819 0.3791 19.0
No log 53.0 371 2.0858 0.4332 0.1597 0.3566 0.3535 19.0
No log 54.0 378 2.0800 0.4373 0.1878 0.375 0.3711 19.0
No log 55.0 385 2.0839 0.4216 0.1719 0.3652 0.3612 19.0
No log 56.0 392 2.0877 0.4216 0.1719 0.3652 0.3612 19.0
No log 57.0 399 2.0862 0.4216 0.1719 0.3652 0.3612 19.0
No log 58.0 406 2.0817 0.4472 0.1833 0.3689 0.3675 19.0
No log 59.0 413 2.0783 0.4564 0.1967 0.3732 0.3679 19.0
No log 60.0 420 2.0704 0.4564 0.1934 0.3732 0.3679 19.0
No log 61.0 427 2.0649 0.4566 0.2001 0.381 0.3769 19.0
No log 62.0 434 2.0618 0.4446 0.1931 0.3691 0.3643 19.0
No log 63.0 441 2.0566 0.4418 0.1931 0.3662 0.3611 19.0
No log 64.0 448 2.0469 0.4494 0.2075 0.3772 0.3742 19.0
No log 65.0 455 2.0500 0.4494 0.2075 0.3772 0.3742 19.0
No log 66.0 462 2.0504 0.4494 0.2075 0.3772 0.3742 19.0
No log 67.0 469 2.0531 0.4494 0.2075 0.3772 0.3742 19.0
No log 68.0 476 2.0540 0.4418 0.19 0.3662 0.3611 19.0
No log 69.0 483 2.0494 0.4418 0.1931 0.3662 0.3611 19.0
No log 70.0 490 2.0578 0.4258 0.1769 0.3653 0.3611 19.0
No log 71.0 497 2.0632 0.4293 0.1769 0.3685 0.3645 19.0
1.9892 72.0 504 2.0639 0.4293 0.1769 0.3685 0.3645 19.0
1.9892 73.0 511 2.0627 0.4258 0.1682 0.3646 0.361 19.0
1.9892 74.0 518 2.0551 0.421 0.1643 0.3539 0.3483 19.0
1.9892 75.0 525 2.0444 0.421 0.1643 0.3539 0.3483 19.0
1.9892 76.0 532 2.0428 0.4258 0.1731 0.3653 0.3611 19.0
1.9892 77.0 539 2.0509 0.4255 0.1731 0.3652 0.3609 19.0
1.9892 78.0 546 2.0566 0.4207 0.1643 0.3537 0.3481 19.0
1.9892 79.0 553 2.0575 0.4359 0.1876 0.37 0.3654 19.0
1.9892 80.0 560 2.0479 0.4331 0.1566 0.3432 0.3394 19.0
1.9892 81.0 567 2.0430 0.4334 0.1566 0.3434 0.3396 19.0
1.9892 82.0 574 2.0377 0.4334 0.1566 0.3434 0.3396 19.0
1.9892 83.0 581 2.0349 0.4331 0.1566 0.3432 0.3394 19.0
1.9892 84.0 588 2.0338 0.4331 0.1566 0.3432 0.3394 19.0
1.9892 85.0 595 2.0345 0.4331 0.1566 0.3432 0.3394 19.0
1.9892 86.0 602 2.0359 0.4535 0.1962 0.3792 0.3735 19.0
1.9892 87.0 609 2.0338 0.4535 0.1962 0.3792 0.3735 19.0
1.9892 88.0 616 2.0456 0.4249 0.1728 0.3563 0.3512 19.0
1.9892 89.0 623 2.0556 0.4255 0.1731 0.3652 0.3609 19.0
1.9892 90.0 630 2.0532 0.4249 0.1728 0.3563 0.3512 19.0
1.9892 91.0 637 2.0461 0.4535 0.1962 0.3792 0.3735 19.0
1.9892 92.0 644 2.0395 0.4507 0.1962 0.3794 0.3736 19.0
1.9892 93.0 651 2.0314 0.4507 0.1962 0.3794 0.3736 19.0
1.9892 94.0 658 2.0308 0.4212 0.1728 0.3565 0.3513 19.0
1.9892 95.0 665 2.0359 0.4264 0.1826 0.3691 0.3647 19.0
1.9892 96.0 672 2.0418 0.4223 0.1731 0.3653 0.3611 19.0
1.9892 97.0 679 2.0400 0.4155 0.1682 0.3626 0.3576 19.0
1.9892 98.0 686 2.0303 0.4155 0.1682 0.3626 0.3576 19.0
1.9892 99.0 693 2.0311 0.4155 0.1682 0.3626 0.3576 19.0
1.9892 100.0 700 2.0339 0.4155 0.1682 0.3626 0.3576 19.0
1.9892 101.0 707 2.0359 0.4058 0.1585 0.3385 0.3378 19.0
1.9892 102.0 714 2.0352 0.4085 0.1583 0.3488 0.3475 19.0
1.9892 103.0 721 2.0339 0.4264 0.1643 0.36 0.3578 19.0
1.9892 104.0 728 2.0419 0.4204 0.1603 0.3529 0.3516 19.0
1.9892 105.0 735 2.0433 0.4031 0.1568 0.3427 0.3377 19.0
1.9892 106.0 742 2.0400 0.4137 0.1643 0.3578 0.3524 19.0
1.9892 107.0 749 2.0400 0.4226 0.1692 0.3641 0.361 19.0
1.9892 108.0 756 2.0401 0.4226 0.1692 0.3641 0.361 19.0
1.9892 109.0 763 2.0416 0.4054 0.1564 0.3491 0.3437 19.0
1.9892 110.0 770 2.0372 0.4137 0.1643 0.3578 0.3524 19.0
1.9892 111.0 777 2.0403 0.4211 0.1692 0.3658 0.3597 19.0
1.9892 112.0 784 2.0379 0.4156 0.1612 0.3586 0.3549 19.0
1.9892 113.0 791 2.0369 0.4266 0.1739 0.3683 0.3655 19.0
1.9892 114.0 798 2.0342 0.4296 0.1373 0.3372 0.3367 19.0
1.9892 115.0 805 2.0294 0.4278 0.1376 0.3316 0.3311 19.0
1.9892 116.0 812 2.0272 0.4495 0.1845 0.3704 0.369 19.0
1.9892 117.0 819 2.0245 0.4042 0.1545 0.3464 0.3446 19.0
1.9892 118.0 826 2.0214 0.4149 0.1689 0.3566 0.3539 19.0
1.9892 119.0 833 2.0177 0.4181 0.1373 0.3326 0.3318 19.0
1.9892 120.0 840 2.0206 0.4403 0.1798 0.3627 0.3644 19.0
1.9892 121.0 847 2.0258 0.4042 0.1545 0.3464 0.3446 19.0
1.9892 122.0 854 2.0264 0.4165 0.1373 0.3265 0.3256 19.0
1.9892 123.0 861 2.0250 0.4104 0.1231 0.3161 0.3158 19.0
1.9892 124.0 868 2.0268 0.389 0.1097 0.3055 0.3064 19.0
1.9892 125.0 875 2.0286 0.3945 0.1099 0.3067 0.307 19.0
1.9892 126.0 882 2.0300 0.3893 0.1105 0.3011 0.3011 19.0
1.9892 127.0 889 2.0358 0.4174 0.1562 0.3385 0.3401 19.0
1.9892 128.0 896 2.0387 0.3873 0.1473 0.3284 0.3286 19.0
1.9892 129.0 903 2.0426 0.3869 0.1493 0.3263 0.3267 19.0
1.9892 130.0 910 2.0440 0.4052 0.1616 0.3422 0.3444 19.0
1.9892 131.0 917 2.0471 0.4255 0.1719 0.3685 0.3652 19.0
1.9892 132.0 924 2.0479 0.4255 0.1719 0.3685 0.3652 19.0
1.9892 133.0 931 2.0469 0.4255 0.1719 0.3685 0.3652 19.0
1.9892 134.0 938 2.0428 0.4255 0.1719 0.3685 0.3652 19.0
1.9892 135.0 945 2.0383 0.4052 0.1616 0.3422 0.3444 19.0
1.9892 136.0 952 2.0322 0.4052 0.1616 0.3422 0.3444 19.0
1.9892 137.0 959 2.0371 0.4255 0.1719 0.3685 0.3652 19.0
1.9892 138.0 966 2.0406 0.4255 0.1719 0.3685 0.3652 19.0
1.9892 139.0 973 2.0466 0.4255 0.1719 0.3685 0.3652 19.0
1.9892 140.0 980 2.0515 0.4255 0.1719 0.3685 0.3652 19.0
1.9892 141.0 987 2.0546 0.4255 0.1719 0.3685 0.3652 19.0
1.9892 142.0 994 2.0581 0.4255 0.1719 0.3685 0.3652 19.0
1.0228 143.0 1001 2.0591 0.4255 0.1719 0.3685 0.3652 19.0
1.0228 144.0 1008 2.0599 0.4255 0.1719 0.3685 0.3652 19.0
1.0228 145.0 1015 2.0608 0.4016 0.1573 0.3387 0.3408 19.0
1.0228 146.0 1022 2.0583 0.4016 0.1573 0.3387 0.3408 19.0
1.0228 147.0 1029 2.0512 0.3814 0.1426 0.3263 0.328 19.0
1.0228 148.0 1036 2.0477 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 149.0 1043 2.0428 0.3944 0.1441 0.3266 0.3287 19.0
1.0228 150.0 1050 2.0393 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 151.0 1057 2.0412 0.3944 0.1441 0.3266 0.3287 19.0
1.0228 152.0 1064 2.0437 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 153.0 1071 2.0433 0.3814 0.1426 0.3263 0.328 19.0
1.0228 154.0 1078 2.0468 0.3829 0.1448 0.3232 0.3223 19.0
1.0228 155.0 1085 2.0512 0.3814 0.1426 0.3263 0.328 19.0
1.0228 156.0 1092 2.0545 0.3814 0.1426 0.3263 0.328 19.0
1.0228 157.0 1099 2.0582 0.3814 0.1426 0.3263 0.328 19.0
1.0228 158.0 1106 2.0624 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 159.0 1113 2.0659 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 160.0 1120 2.0690 0.3814 0.1426 0.3263 0.328 19.0
1.0228 161.0 1127 2.0691 0.3814 0.1426 0.3263 0.328 19.0
1.0228 162.0 1134 2.0675 0.3847 0.1464 0.3303 0.3315 19.0
1.0228 163.0 1141 2.0653 0.3847 0.1464 0.3303 0.3315 19.0
1.0228 164.0 1148 2.0641 0.3847 0.1464 0.3303 0.3315 19.0
1.0228 165.0 1155 2.0649 0.3847 0.1464 0.3303 0.3315 19.0
1.0228 166.0 1162 2.0648 0.3869 0.1493 0.3263 0.3267 19.0
1.0228 167.0 1169 2.0627 0.3869 0.1493 0.3263 0.3267 19.0
1.0228 168.0 1176 2.0610 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 169.0 1183 2.0617 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 170.0 1190 2.0617 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 171.0 1197 2.0594 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 172.0 1204 2.0579 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 173.0 1211 2.0578 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 174.0 1218 2.0584 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 175.0 1225 2.0588 0.3847 0.1464 0.3303 0.3315 19.0
1.0228 176.0 1232 2.0585 0.3847 0.1464 0.3303 0.3315 19.0
1.0228 177.0 1239 2.0579 0.3847 0.1464 0.3303 0.3315 19.0
1.0228 178.0 1246 2.0568 0.3844 0.1426 0.3225 0.3238 19.0
1.0228 179.0 1253 2.0549 0.3944 0.1441 0.3266 0.3287 19.0
1.0228 180.0 1260 2.0553 0.3944 0.1441 0.3266 0.3287 19.0
1.0228 181.0 1267 2.0553 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 182.0 1274 2.0557 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 183.0 1281 2.0565 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 184.0 1288 2.0565 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 185.0 1295 2.0569 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 186.0 1302 2.0585 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 187.0 1309 2.0590 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 188.0 1316 2.0597 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 189.0 1323 2.0610 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 190.0 1330 2.0615 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 191.0 1337 2.0621 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 192.0 1344 2.0625 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 193.0 1351 2.0628 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 194.0 1358 2.0632 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 195.0 1365 2.0634 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 196.0 1372 2.0634 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 197.0 1379 2.0633 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 198.0 1386 2.0634 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 199.0 1393 2.0635 0.3738 0.1028 0.2953 0.2962 19.0
1.0228 200.0 1400 2.0636 0.3738 0.1028 0.2953 0.2962 19.0

Framework versions

  • Transformers 4.39.0
  • Pytorch 2.2.1+cu121
  • Datasets 2.18.0
  • Tokenizers 0.15.2