Teja-Gollapudi
commited on
Commit
•
a8dc020
1
Parent(s):
a3dae86
Update README.md
Browse files
README.md
CHANGED
@@ -121,9 +121,9 @@ tokenizer = AutoTokenizer.from_pretrained(model_name)
|
|
121 |
| MiniLMv2-L6-H768-from-RoBERTa-Large | 81,529,346 | 4:39:02 | 9:34 | 1:06 | 65.80 | 77.17 | 51.72 | 63.27 |
|
122 |
| RoBERTa-Base | 124,056,578 | 8:50:29 | 18:59 | 2:11 | 69.06 | 80.08 | 55.53 | 66.49 |
|
123 |
| RoBERTa-Large | 354,312,194 | 29:16:06 | 1:01:10 | 7:04 | 74.08 | 84.38 | 62.20 | 72.88 |
|
124 |
-
|TinyRoBERTa | 81,529.346 | 4:27:06
|
125 |
-
|
126 |
|
|
|
127 |
|
128 |
# Limitations and Bias
|
129 |
|
|
|
121 |
| MiniLMv2-L6-H768-from-RoBERTa-Large | 81,529,346 | 4:39:02 | 9:34 | 1:06 | 65.80 | 77.17 | 51.72 | 63.27 |
|
122 |
| RoBERTa-Base | 124,056,578 | 8:50:29 | 18:59 | 2:11 | 69.06 | 80.08 | 55.53 | 66.49 |
|
123 |
| RoBERTa-Large | 354,312,194 | 29:16:06 | 1:01:10 | 7:04 | 74.08 | 84.38 | 62.20 | 72.88 |
|
124 |
+
|TinyRoBERTa | 81,529.346 | 4:27:06 *| 9:54 | 1:04 | 69.38 | 80.07| 53.29| 64.16|
|
|
|
125 |
|
126 |
+
\*: Training times aren't perfectly comparable as TinyRoBERTa was distilled from [VMware/roberta-large-mrqa](https://huggingface.co/VMware/roberta-large-mrqa) that was already trained on MRQA
|
127 |
|
128 |
# Limitations and Bias
|
129 |
|