Update README.md
Browse files
README.md
CHANGED
@@ -14,7 +14,7 @@ metrics:
|
|
14 |
|
15 |
## Model description
|
16 |
|
17 |
-
This model replicates the "DistilBERT (D)" model from Table 2 of the [DistilBERT paper](https://arxiv.org/pdf/1910.01108.pdf). In this approach, a DistilBERT student is fine-tuned on SQuAD v1.1,
|
18 |
|
19 |
In this version, the following pre-trained models were used:
|
20 |
|
|
|
14 |
|
15 |
## Model description
|
16 |
|
17 |
+
This model replicates the "DistilBERT (D)" model from Table 2 of the [DistilBERT paper](https://arxiv.org/pdf/1910.01108.pdf). In this approach, a DistilBERT student is fine-tuned on SQuAD v1.1, but with a BERT model (also fine-tuned on SQuAD v1.1) acting as a teacher for a second step of task-specific distillation.
|
18 |
|
19 |
In this version, the following pre-trained models were used:
|
20 |
|