Xmm
/

led-large-16384-govreport

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Xmm commited on Jul 5, 2023

Commit

5fde500

•

1 Parent(s): 873e7ba

End of training

Files changed (2) hide show

README.md +31 -11
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -4,9 +4,24 @@ tags:
 - generated_from_trainer
 datasets:
 - govreport-summarization
 model-index:
 - name: led-large-16384-govreport
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,15 +31,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [allenai/led-base-16384](https://huggingface.co/allenai/led-base-16384) on the govreport-summarization dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 3.0537
-- eval_rouge1: 0.3850
-- eval_rouge2: 0.1267
-- eval_rougeL: 0.1629
-- eval_rougeLsum: 0.1629
-- eval_runtime: 5561.9865
-- eval_samples_per_second: 0.175
-- eval_steps_per_second: 0.175
-- step: 0
 ## Model description
@@ -51,7 +62,16 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Framework versions

 - generated_from_trainer
 datasets:
 - govreport-summarization
+metrics:
+- rouge
 model-index:
 - name: led-large-16384-govreport
+  results:
+  - task:
+      name: Sequence-to-sequence Language Modeling
+      type: text2text-generation
+    dataset:
+      name: govreport-summarization
+      type: govreport-summarization
+      config: document
+      split: validation
+      args: document
+    metrics:
+    - name: Rouge1
+      type: rouge
+      value: 0.5292696934712731
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [allenai/led-base-16384](https://huggingface.co/allenai/led-base-16384) on the govreport-summarization dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.7702
+- Rouge1: 0.5293
+- Rouge2: 0.2192
+- Rougel: 0.2504
+- Rougelsum: 0.2505
 ## Model description
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 6
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
+| 1.8152        | 3.65  | 500  | 1.7956          | 0.5095 | 0.2040 | 0.2382 | 0.2381    |
+| 1.6981        | 3.66  | 1000 | 1.7624          | 0.5194 | 0.2107 | 0.2437 | 0.2437    |
+| 1.7048        | 5.49  | 1500 | 1.7448          | 0.5253 | 0.2149 | 0.2467 | 0.2467    |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ee1cccb9d34e3afa98366a7d172789d7b83df121778db25eab44227b131f1877
 size 647678513

 version https://git-lfs.github.com/spec/v1
+oid sha256:41574ce7cf69b0ae43d5dc42f803155620320fe0e94accc9aee9d2f32ec7303c
 size 647678513