krotima1
/

mbart-ht2a-c

Text2Text Generation

abstractive summarization

Inference Endpoints

Model card Files Files and versions Community

Marian Krotil commited on May 23, 2022

Commit

6ff56c1

•

1 Parent(s): 03af426

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ This model is a fine-tuned checkpoint of [facebook/mbart-large-cc25](https://hug
 The model deals with the task ``Headline + Text to Abstract`` (HT2A) which consists in generating a multi-sentence summary considered as an abstract from a Czech news text.
 ## Dataset
-The model has been trained on the private CNC dataset provided by Czech News Center. The dataset includes 3/4M Czech news-based documents consisting of a Headline, Abstract, and Full-text sections. Truncation and padding were set to 512 tokens.
 ## Training
 The model has been trained on 1x NVIDIA Tesla A100 40GB for 60 hours. During training, the model has seen 3712K documents corresponding to roughly 5.5 epochs.
@@ -41,7 +41,7 @@ def summ_config():
  ("repetition_penalty", 1.2),
  ("no_repeat_ngram_size", None),
  ("early_stopping", True),
- ("max_length", 96),
  ("min_length", 10),
  ])),
  #texts to summarize

 The model deals with the task ``Headline + Text to Abstract`` (HT2A) which consists in generating a multi-sentence summary considered as an abstract from a Czech news text.
 ## Dataset
+The model has been trained on the private CNC dataset provided by Czech News Center. The dataset includes 3/4M Czech news-based documents consisting of a Headline, Abstract, and Full-text sections. Truncation and padding were set to 512 tokens for the encoder and 128 for the decoder.
 ## Training
 The model has been trained on 1x NVIDIA Tesla A100 40GB for 60 hours. During training, the model has seen 3712K documents corresponding to roughly 5.5 epochs.
  ("repetition_penalty", 1.2),
  ("no_repeat_ngram_size", None),
  ("early_stopping", True),
+ ("max_length", 128),
  ("min_length", 10),
  ])),
  #texts to summarize