beyond commited on
Commit
b691e98
1 Parent(s): b3b4c3e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -21
README.md CHANGED
@@ -68,27 +68,23 @@ print(generated_text)
68
  | [`sega-small-chinese`]() | xM | Chinese |
69
 
70
 
71
- ## Intended uses & limitations
72
-
73
-
74
-
75
-
76
-
77
-
78
- ### Limitations and bias
79
-
80
-
81
- ## Training data
82
-
83
-
84
- ## Training procedure
85
-
86
- ### Preprocessing
87
-
88
-
89
- ### Pretraining
90
-
91
- ## Evaluation results
92
 
93
 
94
 
 
68
  | [`sega-small-chinese`]() | xM | Chinese |
69
 
70
 
71
+ ## Data Augmentation for Text Classification Tasks:
72
+ - Setting: Low-resource setting, where only n={50,100,200,500,1000} labeled samples are available for training. The below results are the average of all training sizes.
73
+ - Datasets: [HuffPost](https://huggingface.co/datasets/khalidalt/HuffPost), [BBC](https://huggingface.co/datasets/SetFit/bbc-news), [SST2](https://huggingface.co/datasets/glue), [IMDB](https://huggingface.co/datasets/imdb), [Yahoo](https://huggingface.co/datasets/yahoo_answers_topics), [20NG](https://huggingface.co/datasets/newsgroup).
74
+ - Base classifier: [DistilBERT](https://huggingface.co/distilbert-base-cased)
75
+
76
+ | Method | HuffPost | BBC | SST2 | IMDB | Yahoo | 20NG | avg. |
77
+ |---------|:------------------:|:------------------:|:----------------------:|:----------------------:|:----------:|:----------:|:----------:|
78
+ | | ID / OOD (BBC) | ID / OOD (Huff) | ID / OOD (IMDB) | ID / OOD (SST2) | | | |
79
+ | none | 79.17 / 62.32 | **96.16** / 62.00 | 76.67 / 73.16 | 77.87 / 74.43 | 45.77 | 46.67 | 69.42 |
80
+ | EDA | 79.63 / 67.48 | 95.11 / 58.92 | 75.52 / 69.46 | 77.88 / 75.88 | 45.10 | 46.15 | 69.11 |
81
+ | STA | 80.74 / 69.31 | 95.64 / 64.82 | 77.80 / 73.66 | 77.88 / 74.77 | 46.96 | 47.27 | 70.88 |
82
+ | Back | 80.48 / 67.75 | 95.28 / 63.10 | 76.96 / 72.23 | 78.35 / 75.96 | 46.10 | 46.61 | 70.28 |
83
+ | MLM | 80.04 / 66.80 | 96.07 / 65.39 | 76.61/ 73.11 | 75.73 / 73.70 | 45.35 | 46.53 | 69.93 |
84
+ | C-MLM | 79.96 / 65.10 | 96.13 / **67.80** | 76.91 / 71.83 | 77.31 / 75.02 | 45.29 | 46.36 | 70.17 |
85
+ | LAMBADA | 81.03 / 68.89 | 93.75 / 52.79 | 77.87 / 74.54 | 77.49 / 74.33 | 50.66 | 47.72 | 69.91 |
86
+ | **SEGA (Ours)** | 81.43 / 74.87 | 95.61 / 67.79 | 77.87 / 72.94 | **79.51** / **76.75** | 49.43 | 50.47 | 72.67 |
87
+ | **SEGA-f (Ours)** | **81.82** / **76.18** | 95.78 / 67.79 | **80.59** / **80.32** | 79.37 / 76.61 | **50.12** | **50.81** | **73.94** |
 
 
 
 
88
 
89
 
90