Update README.md
Browse files
README.md
CHANGED
@@ -54,10 +54,10 @@ print(tokenizer.decode(sample[0]))
|
|
54 |
|
55 |
The CLEX-Phi-2-2.7B and CLEX-Mixtral-8x7B are trained on [LongCorpus-2.5B](https://huggingface.co/datasets/DAMO-NLP-SG/LongCorpus-2.5B), where the eval results on test set are listed below.
|
56 |
|
57 |
-
| | Train Length | Eval.(32k) | Eval.(64k) | Eval.(128k) |
|
58 |
-
| ----------------- | ------------ | ---------- | ---------- | ----------- |
|
59 |
-
| Phi-2 | 2k | >100 | >100 | >100 |
|
60 |
-
| CLEX-Phi-2 | 32k | 5.
|
61 |
|
62 |
|
63 |
|
|
|
54 |
|
55 |
The CLEX-Phi-2-2.7B and CLEX-Mixtral-8x7B are trained on [LongCorpus-2.5B](https://huggingface.co/datasets/DAMO-NLP-SG/LongCorpus-2.5B), where the eval results on test set are listed below.
|
56 |
|
57 |
+
| | Train Length | Eval.(16k) | | Eval.(32k) | Eval.(64k) | Eval.(128k) |
|
58 |
+
| ----------------- | ------------ | ---------- | ---------- | ----------- |---------- |
|
59 |
+
| Phi-2 | 2k | >100 | >100 | >100 | >100 |
|
60 |
+
| CLEX-Phi-2 | 32k | 5.21 | 5.11 | 5.17 |6.55 |
|
61 |
|
62 |
|
63 |
|