Update README.md
Browse files
README.md
CHANGED
@@ -15,4 +15,6 @@ The paper titled "Increasing The Performance of Cognitively Inspired Data-Effici
|
|
15 |
|
16 |
<strong>omarmomen/transformer_base_final_2</strong> is a baseline vanilla transformer encoder.
|
17 |
|
18 |
-
The model is pretrained on the BabyLM 10M dataset using a custom pretrained RobertaTokenizer (https://huggingface.co/omarmomen/babylm_tokenizer_32k).
|
|
|
|
|
|
15 |
|
16 |
<strong>omarmomen/transformer_base_final_2</strong> is a baseline vanilla transformer encoder.
|
17 |
|
18 |
+
The model is pretrained on the BabyLM 10M dataset using a custom pretrained RobertaTokenizer (https://huggingface.co/omarmomen/babylm_tokenizer_32k).
|
19 |
+
|
20 |
+
https://arxiv.org/abs/2310.20589
|