LA1512
/

led-epoch-1

@@ -4,9 +4,24 @@ tags:
 - generated_from_trainer
 datasets:
 - pubmed-summarization
 model-index:
 - name: results
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -15,6 +30,13 @@ should probably proofread and complete it, then remove this comment. -->
 # results
 This model is a fine-tuned version of [LA1512/PubMed-fine-tune](https://huggingface.co/LA1512/PubMed-fine-tune) on the pubmed-summarization dataset.
 ## Model description
@@ -33,7 +55,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
@@ -43,6 +65,15 @@ The following hyperparameters were used during training:
 - num_epochs: 3
 - label_smoothing_factor: 0.1
 ### Framework versions
 - Transformers 4.39.3

 - generated_from_trainer
 datasets:
 - pubmed-summarization
+metrics:
+- rouge
 model-index:
 - name: results
+  results:
+  - task:
+      name: Sequence-to-sequence Language Modeling
+      type: text2text-generation
+    dataset:
+      name: pubmed-summarization
+      type: pubmed-summarization
+      config: section
+      split: validation
+      args: section
+    metrics:
+    - name: Rouge1
+      type: rouge
+      value: 40.7402
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # results
 This model is a fine-tuned version of [LA1512/PubMed-fine-tune](https://huggingface.co/LA1512/PubMed-fine-tune) on the pubmed-summarization dataset.
+It achieves the following results on the evaluation set:
+- Loss: 3.6196
+- Rouge1: 40.7402
+- Rouge2: 16.1978
+- Rougel: 24.4278
+- Rougelsum: 36.5282
+- Gen Len: 179.6185
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - num_epochs: 3
 - label_smoothing_factor: 0.1
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len  |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
+| 3.6132        | 1.0   | 2500 | 3.6766          | 40.5092 | 15.7678 | 24.1228 | 36.3318   | 183.7205 |
+| 3.5939        | 2.0   | 5000 | 3.6276          | 40.7583 | 16.1779 | 24.4375 | 36.5537   | 181.4365 |
+| 3.5419        | 3.0   | 7500 | 3.6196          | 40.7402 | 16.1978 | 24.4278 | 36.5282   | 179.6185 |
 ### Framework versions
 - Transformers 4.39.3

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3c7d91bd434f5ae1765aaa0ca8412882fa09316abfdd28127d94aa0dc0c644bf
 size 1020714768

 version https://git-lfs.github.com/spec/v1
+oid sha256:9f34a4a5a93856e5d6532db0bb0e56e053b236b0106766ea844b9d61906ffd50
 size 1020714768

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4d38298726dfd7e007c60b5b330e4b25d706d22c5182372b5875727789c9ceec
 size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:8996fa5924b2adc468689d11dc75794fbe5d22d47de3739d11c93031ebbedd55
 size 5048