fabiochiu
/

t5-base-tag-generation

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

fabiochiu commited on May 23, 2022

Commit

c5f2a16

•

1 Parent(s): 0c68513

Update README.md

Files changed (1) hide show

README.md +15 -25

README.md CHANGED Viewed

@@ -10,40 +10,30 @@ widget:
  example_title: "Programming"
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# t5-base-tag-generation
-This model is a fine-tuned version of [fabiochiu/t5-base-tag-generation](https://huggingface.co/fabiochiu/t5-base-tag-generation) on the None dataset.
-It achieves the following results on the evaluation set:
-- eval_loss: 0.8474
-- eval_rouge1: 38.6033
-- eval_rouge2: 20.5952
-- eval_rougeL: 36.4458
-- eval_rougeLsum: 36.3202
-- eval_gen_len: 15.257
-- eval_runtime: 343.6547
-- eval_samples_per_second: 2.91
-- eval_steps_per_second: 0.364
-- epoch: 0.31
-- step: 2000
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 4e-05

  example_title: "Programming"
 ---
+# Model description
+This model is [t5-base](https://huggingface.co/t5-base) fine-tuned on the [190k Medium Articles](https://www.kaggle.com/datasets/fabiochiusano/medium-articles) dataset for predicting article tags using the article textual content as input.
+## Data cleaning
+The dataset is composed of Medium articles and their tags. However, each Medium article can have at most five tags, therefore the author needs to choose what he/she believes are the best tags (mainly for SEO-related purposes). This means that an article with the "Python" tag may have not the "Programming Languages" tag, even though the first implies the latter.
+To clean the dataset accounting for this problem, a hand-made taxonomy of about 1000 tags was built. Using the taxonomy, the tags of each articles have been augmented (e.g. an article with the "Python" tag will have the "Programming Languages" tag as well, as the taxonomy says that "Python" is part of "Programming Languages"). The taxonomy is not public, if you are interested in it please send an email at [email protected].
 ## Training and evaluation data
+The model has been trained on a single epoch spanning about 50000 articles, evaluating on 1000 random articles not used during training.
+## Evaluation results
+- eval_loss: 0.8474
+- eval_rouge1: 38.6033
+- eval_rouge2: 20.5952
+- eval_rougeL: 36.4458
+- eval_rougeLsum: 36.3202
+- eval_gen_len: 15.257 # average number of generated tokens
+## Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 4e-05