Update README.md
Browse files
README.md
CHANGED
@@ -29,11 +29,11 @@ Here are scores on the devlopment set of six Chinese tasks:
|
|
29 |
|
30 |
|Model|Score|douban|chnsenticorp|lcqmc|tnews(CLUE)|iflytek(CLUE)|ocnli(CLUE)|
|
31 |
|---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
|
32 |
-
|BERT-Tiny|
|
33 |
-
|BERT-Mini|
|
34 |
-
|BERT-Small|0.0|86.5|93.4
|
35 |
-
|BERT-Medium|
|
36 |
-
|BERT-Base|
|
37 |
|
38 |
For each task, we selected the best fine-tuning hyperparameters from the lists below:
|
39 |
- epochs: 3, 5, 8
|
@@ -94,8 +94,6 @@ encoded_input = tokenizer(text, return_tensors='tf')
|
|
94 |
output = model(encoded_input)
|
95 |
```
|
96 |
|
97 |
-
|
98 |
-
|
99 |
## Training data
|
100 |
|
101 |
CLUECorpusSmall is used as training data. We found that models pre-trained on CLUECorpusSmall outperform those pre-trained on CLUECorpus2020, although CLUECorpus2020 is much larger than CLUECorpusSmall.
|
|
|
29 |
|
30 |
|Model|Score|douban|chnsenticorp|lcqmc|tnews(CLUE)|iflytek(CLUE)|ocnli(CLUE)|
|
31 |
|---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
|
32 |
+
|BERT-Tiny|72.3|83.0|91.4|81.8|62.0|55.0|60.3|
|
33 |
+
|BERT-Mini|75.7|84.8|93.7|86.1|63.9|58.3|67.4|
|
34 |
+
|BERT-Small|0.0|86.5|93.4|0.0|65.1|59.4|69.7|
|
35 |
+
|BERT-Medium|77.8|87.6|94.8|88.1|65.6|59.5|71.2|
|
36 |
+
|BERT-Base|79.5|89.1|95.2|89.2|67.0|60.9|75.5|
|
37 |
|
38 |
For each task, we selected the best fine-tuning hyperparameters from the lists below:
|
39 |
- epochs: 3, 5, 8
|
|
|
94 |
output = model(encoded_input)
|
95 |
```
|
96 |
|
|
|
|
|
97 |
## Training data
|
98 |
|
99 |
CLUECorpusSmall is used as training data. We found that models pre-trained on CLUECorpusSmall outperform those pre-trained on CLUECorpus2020, although CLUECorpus2020 is much larger than CLUECorpusSmall.
|