Xmm
/

led-large-16384-cnn_dailymail

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Xmm commited on Jul 18, 2023

Commit

3b540ec

•

1 Parent(s): 937e5af

End of training

Files changed (2) hide show

README.md +11 -7
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ model-index:
     metrics:
     - name: Rouge1
       type: rouge
-      value: 0.38275620598885174
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -31,11 +31,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [allenai/led-base-16384](https://huggingface.co/allenai/led-base-16384) on the cnn_dailymail dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6093
-- Rouge1: 0.3828
-- Rouge2: 0.1701
-- Rougel: 0.2561
-- Rougelsum: 0.3613
 ## Model description
@@ -62,7 +62,7 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Training results
@@ -90,6 +90,10 @@ The following hyperparameters were used during training:
 | 1.6491        | 4.46  | 10000 | 1.6172          | 0.3799 | 0.1681 | 0.2540 | 0.3586    |
 | 1.5994        | 4.68  | 10500 | 1.6132          | 0.3825 | 0.1702 | 0.2560 | 0.3610    |
 | 1.6493        | 4.9   | 11000 | 1.6093          | 0.3828 | 0.1701 | 0.2561 | 0.3613    |
 ### Framework versions

     metrics:
     - name: Rouge1
       type: rouge
+      value: 0.38289524455734836
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [allenai/led-base-16384](https://huggingface.co/allenai/led-base-16384) on the cnn_dailymail dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.5981
+- Rouge1: 0.3829
+- Rouge2: 0.1704
+- Rougel: 0.2569
+- Rougelsum: 0.3614
 ## Model description
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 6
 ### Training results
 | 1.6491        | 4.46  | 10000 | 1.6172          | 0.3799 | 0.1681 | 0.2540 | 0.3586    |
 | 1.5994        | 4.68  | 10500 | 1.6132          | 0.3825 | 0.1702 | 0.2560 | 0.3610    |
 | 1.6493        | 4.9   | 11000 | 1.6093          | 0.3828 | 0.1701 | 0.2561 | 0.3613    |
+| 1.6769        | 5.13  | 11500 | 1.6074          | 0.3831 | 0.1706 | 0.2569 | 0.3619    |
+| 1.6554        | 5.35  | 12000 | 1.6044          | 0.3817 | 0.1695 | 0.2559 | 0.3605    |
+| 1.6155        | 5.57  | 12500 | 1.6010          | 0.3825 | 0.1700 | 0.2561 | 0.3608    |
+| 1.5863        | 5.8   | 13000 | 1.5981          | 0.3829 | 0.1704 | 0.2569 | 0.3614    |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ae4845433170456550ed7e818cbf1880a0b667fab0833a669be14059445b433d
 size 647680813

 version https://git-lfs.github.com/spec/v1
+oid sha256:fe9b5f5e838c57d76d1d698e58e03a4612047d9ad96424c7a2473db751065603
 size 647680813