Add multilingual to the language tag (#1)
Browse files- Add multilingual to the language tag (fc8b6a086b8970c3706bcc26a7c5eee54c32d811)
Co-authored-by: Loïck BOURDOIS <[email protected]>
README.md
CHANGED
@@ -1,20 +1,21 @@
|
|
1 |
---
|
2 |
-
language:
|
3 |
-
|
4 |
-
|
|
|
|
|
5 |
tags:
|
6 |
- circassian
|
7 |
- kabardian
|
8 |
-
license: unknown
|
9 |
datasets:
|
10 |
- anzorq/kbd_lat-835k_ru-3M
|
11 |
---
|
12 |
|
13 |
t5-v1_1-small pretrained with mlm task on
|
14 |
|
15 |
-
|
16 |
|
17 |
-
|
18 |
|
19 |
|
20 |
tokenizer: sentencepiece unigram, 8K, shared vocabulary
|
|
|
1 |
---
|
2 |
+
language:
|
3 |
+
- kbd
|
4 |
+
- ru
|
5 |
+
- multilingual
|
6 |
+
license: unknown
|
7 |
tags:
|
8 |
- circassian
|
9 |
- kabardian
|
|
|
10 |
datasets:
|
11 |
- anzorq/kbd_lat-835k_ru-3M
|
12 |
---
|
13 |
|
14 |
t5-v1_1-small pretrained with mlm task on
|
15 |
|
16 |
+
� kbd (custom latin script) 835K lines: a pile of scraped text from news sites, books etc.
|
17 |
|
18 |
+
� ru 3M lines: wiki corpus from OPUS
|
19 |
|
20 |
|
21 |
tokenizer: sentencepiece unigram, 8K, shared vocabulary
|