autoevaluator HF staff commited on
Commit
c170f3b
1 Parent(s): 6908696

Add evaluation results on the default config and test split of xsum

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config and test split of the [xsum](https://huggingface.co/datasets/xsum) dataset by

@pszemraj
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-xsum-default-79b8e9-36674145028).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=xsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=xsum).

Files changed (1) hide show
  1. README.md +44 -5
README.md CHANGED
@@ -1,18 +1,57 @@
1
  ---
 
 
2
  license: apache-2.0
 
3
  tags:
4
  - generated_from_trainer
5
  - stacked summaries
6
  - xsum
7
  datasets:
8
  - stacked-summaries/stacked-xsum-1024
 
9
  model-index:
10
  - name: flan-t5-large-stacked-XSUM-1024-WIP-2p8-850-stacked-xsum-1024-evaluated
11
- results: []
12
- language:
13
- - en
14
- library_name: transformers
15
- pipeline_tag: summarization
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
16
  ---
17
 
18
 
 
1
  ---
2
+ language:
3
+ - en
4
  license: apache-2.0
5
+ library_name: transformers
6
  tags:
7
  - generated_from_trainer
8
  - stacked summaries
9
  - xsum
10
  datasets:
11
  - stacked-summaries/stacked-xsum-1024
12
+ pipeline_tag: summarization
13
  model-index:
14
  - name: flan-t5-large-stacked-XSUM-1024-WIP-2p8-850-stacked-xsum-1024-evaluated
15
+ results:
16
+ - task:
17
+ type: summarization
18
+ name: Summarization
19
+ dataset:
20
+ name: xsum
21
+ type: xsum
22
+ config: default
23
+ split: test
24
+ metrics:
25
+ - type: rouge
26
+ value: 39.3722
27
+ name: ROUGE-1
28
+ verified: true
29
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiN2YzZmMyZjk4OTdlMmJiMWY4NzZmNTMwMTFkZmFiNjE3ZjE2MWY2ODY3MGZkYjdiNzBkNDAyMmE2ODRmNzA3YyIsInZlcnNpb24iOjF9.6OEAAa9Hy9xxkAsXyyxylTPoerL8OMg2Cc_jP0KGA25ZQCF3LLZFMrSyi93AMoqBfRCcdaJ7ypE2b1m2YVNcAw
30
+ - type: rouge
31
+ value: 17.5791
32
+ name: ROUGE-2
33
+ verified: true
34
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiY2IwYjBiYTUzNjZlMjdjNjI5NmVjNGU3N2E0MzdhYmIxODI1NTM5N2I4MTcwMGIyNWRkZjhmM2UzZDY2ZWJkYSIsInZlcnNpb24iOjF9.p30-YNYekLdiVwNFGmm3-FJP0iBQgOpkmyxjM8xlNGRGc2iihhNAwe4ewRFYmkRSjnbrHjoGKERBq_-3DzE1Cw
35
+ - type: rouge
36
+ value: 32.6528
37
+ name: ROUGE-L
38
+ verified: true
39
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjg0ZDIyMDUwYzFiZjMwYzUxOTg4MDlmODcwMTdiNzFlNDQ4NDViYTgzNmMwYmRlODgzODM0MjdlMzU3NGU3MCIsInZlcnNpb24iOjF9.kIYuS58Tws-oEyuneymCC4by4yfI0zsq2uHOUYNfEHmGYdetsgFzzBuVmTHa37KRD30MuDk-RPCy-kam25CVCg
40
+ - type: rouge
41
+ value: 32.6438
42
+ name: ROUGE-LSUM
43
+ verified: true
44
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOWJlMDlhOTQ4ZjI5ZWZkZTNmNzZmNGI0NzhkYmYwZGUxNDZkYzFiYTFmOGIxZTgxZWNiMGQ3YzY2NTUwYzhkYyIsInZlcnNpb24iOjF9.S4ioChbfEA1hDf4jnRBCTUk9oLX9bX82We6W4SzsLmT3BAYe1wBPgfpDvSUQ1PQhZk5IaDwfKtKF_z1coqBVDQ
45
+ - type: loss
46
+ value: 1.4964560270309448
47
+ name: loss
48
+ verified: true
49
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOTViZTkwMzQ3MGNlZDJhNTk3NDE5NzBkMDZjMGEyNzNkZTI4ZmJhMWRlYTMwNmRmN2JhNzdkNTQ3N2FlODBmNyIsInZlcnNpb24iOjF9.lNWUw12R20SwZMZEuUnxYsWrkFBNoU9_5ZOiuFF5aT9QsHJC-FSmZ8DXTdVudv6J-BoeA-l5KYowr7GJfbzlDQ
50
+ - type: gen_len
51
+ value: 18.7302
52
+ name: gen_len
53
+ verified: true
54
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZWM2MWQzN2YyY2U0NWZhZGFkMjk0MzFlMTA1YTgxYzAzYjhhZmVmZDI5Mzk4ODgzOGU1NjVhNTk3NmYyNzhkMSIsInZlcnNpb24iOjF9.bL56u1G46OIwdIqZJ-6og_T2yCKFTXrlPQeguZps3ixXokfKqlfCDxz3641yKA3AdMlLe5lDcN3UQReHtiWwBg
55
  ---
56
 
57