uer commited on
Commit
070a727
·
1 Parent(s): de2f109

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -7
README.md CHANGED
@@ -29,11 +29,11 @@ Here are scores on the devlopment set of six Chinese tasks:
29
 
30
  |Model|Score|douban|chnsenticorp|lcqmc|tnews(CLUE)|iflytek(CLUE)|ocnli(CLUE)|
31
  |---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
32
- |BERT-Tiny|0.0|83.0|91.4|81.8|62.0|55.0|60.3|
33
- |BERT-Mini|0.0|84.8|93.7|86.1|63.9|58.3|67.4|
34
- |BERT-Small|0.0|86.5|93.4||65.1|59.4|69.7|
35
- |BERT-Medium|0.0|87.6|94.8|88.1|65.6|59.5|71.2|
36
- |BERT-Base|0.0|89.1|95.2|89.2|67.0|60.9|75.5|
37
 
38
  For each task, we selected the best fine-tuning hyperparameters from the lists below:
39
  - epochs: 3, 5, 8
@@ -94,8 +94,6 @@ encoded_input = tokenizer(text, return_tensors='tf')
94
  output = model(encoded_input)
95
  ```
96
 
97
-
98
-
99
  ## Training data
100
 
101
  CLUECorpusSmall is used as training data. We found that models pre-trained on CLUECorpusSmall outperform those pre-trained on CLUECorpus2020, although CLUECorpus2020 is much larger than CLUECorpusSmall.
 
29
 
30
  |Model|Score|douban|chnsenticorp|lcqmc|tnews(CLUE)|iflytek(CLUE)|ocnli(CLUE)|
31
  |---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
32
+ |BERT-Tiny|72.3|83.0|91.4|81.8|62.0|55.0|60.3|
33
+ |BERT-Mini|75.7|84.8|93.7|86.1|63.9|58.3|67.4|
34
+ |BERT-Small|0.0|86.5|93.4|0.0|65.1|59.4|69.7|
35
+ |BERT-Medium|77.8|87.6|94.8|88.1|65.6|59.5|71.2|
36
+ |BERT-Base|79.5|89.1|95.2|89.2|67.0|60.9|75.5|
37
 
38
  For each task, we selected the best fine-tuning hyperparameters from the lists below:
39
  - epochs: 3, 5, 8
 
94
  output = model(encoded_input)
95
  ```
96
 
 
 
97
  ## Training data
98
 
99
  CLUECorpusSmall is used as training data. We found that models pre-trained on CLUECorpusSmall outperform those pre-trained on CLUECorpus2020, although CLUECorpus2020 is much larger than CLUECorpusSmall.