LA1512 commited on
Commit
34e1c2d
·
verified ·
1 Parent(s): e257a98

LA1512/fine-tuned-distilbart-xsum-12-3-FindSum-2

Browse files
Files changed (3) hide show
  1. README.md +13 -15
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -15,14 +15,14 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # results
17
 
18
- This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-3](https://huggingface.co/sshleifer/distilbart-xsum-12-3) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 3.0029
21
- - Rouge1: 38.0446
22
- - Rouge2: 9.7772
23
- - Rougel: 16.8698
24
- - Rougelsum: 33.5627
25
- - Gen Len: 360.0
26
 
27
  ## Model description
28
 
@@ -48,18 +48,16 @@ The following hyperparameters were used during training:
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
  - lr_scheduler_warmup_steps: 500
51
- - num_epochs: 5
52
  - label_smoothing_factor: 0.04
53
 
54
  ### Training results
55
 
56
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
- |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
58
- | 2.8683 | 1.0 | 2103 | 2.9785 | 38.4791 | 10.8563 | 18.9729 | 33.7913 | 398.0 |
59
- | 2.5296 | 2.0 | 4206 | 2.9247 | 33.8821 | 10.3228 | 18.7384 | 30.1673 | 179.0 |
60
- | 2.3212 | 3.0 | 6309 | 2.9111 | 38.1082 | 10.4958 | 19.915 | 33.3763 | 303.0 |
61
- | 2.1286 | 4.0 | 8412 | 2.9644 | 36.3381 | 9.3725 | 17.9411 | 32.1316 | 289.0 |
62
- | 1.9651 | 5.0 | 10515 | 3.0029 | 38.0446 | 9.7772 | 16.8698 | 33.5627 | 360.0 |
63
 
64
 
65
  ### Framework versions
 
15
 
16
  # results
17
 
18
+ This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-3](https://huggingface.co/sshleifer/distilbart-xsum-12-3) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 2.6835
21
+ - Rouge1: 38.8257
22
+ - Rouge2: 10.9645
23
+ - Rougel: 19.5312
24
+ - Rougelsum: 33.4613
25
+ - Gen Len: 275.0
26
 
27
  ## Model description
28
 
 
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
  - lr_scheduler_warmup_steps: 500
51
+ - num_epochs: 3
52
  - label_smoothing_factor: 0.04
53
 
54
  ### Training results
55
 
56
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
58
+ | 3.2378 | 1.0 | 500 | 3.1436 | 25.8277 | 5.2374 | 13.1124 | 24.06 | 299.0 |
59
+ | 2.804 | 2.0 | 1000 | 2.7802 | 32.9123 | 6.3884 | 16.0251 | 29.3143 | 241.0 |
60
+ | 2.5568 | 3.0 | 1500 | 2.6835 | 38.8257 | 10.9645 | 19.5312 | 33.4613 | 275.0 |
 
 
61
 
62
 
63
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b35ca3bee585851100895d4c5e072165a3735d5cebeb11369f7b96e16c525233
3
  size 1020714768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fa8aa5938df79073f82ab360c7e21c655a86d8fb311ae5b6d0ca9992874515bd
3
  size 1020714768
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4df94969b40afd71f486c03ca70cdca0bf6ead106e6bb8d21dcac5133c3577cc
3
  size 5048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:01c828abb405074a3bc68281e335e97400d625bca626a66a7f9a574326c30379
3
  size 5048