anzorq lbourdois commited on
Commit
ade82ad
1 Parent(s): 134aa6d

Add multilingual to the language tag (#1)

Browse files

- Add multilingual to the language tag (fc8b6a086b8970c3706bcc26a7c5eee54c32d811)


Co-authored-by: Loïck BOURDOIS <[email protected]>

Files changed (1) hide show
  1. README.md +7 -6
README.md CHANGED
@@ -1,20 +1,21 @@
1
  ---
2
- language:
3
- - kbd
4
- - ru
 
 
5
  tags:
6
  - circassian
7
  - kabardian
8
- license: unknown
9
  datasets:
10
  - anzorq/kbd_lat-835k_ru-3M
11
  ---
12
 
13
  t5-v1_1-small pretrained with mlm task on
14
 
15
- kbd (custom latin script) 835K lines: a pile of scraped text from news sites, books etc.
16
 
17
- ru 3M lines: wiki corpus from OPUS
18
 
19
 
20
  tokenizer: sentencepiece unigram, 8K, shared vocabulary
 
1
  ---
2
+ language:
3
+ - kbd
4
+ - ru
5
+ - multilingual
6
+ license: unknown
7
  tags:
8
  - circassian
9
  - kabardian
 
10
  datasets:
11
  - anzorq/kbd_lat-835k_ru-3M
12
  ---
13
 
14
  t5-v1_1-small pretrained with mlm task on
15
 
16
+ kbd (custom latin script) 835K lines: a pile of scraped text from news sites, books etc.
17
 
18
+ ru 3M lines: wiki corpus from OPUS
19
 
20
 
21
  tokenizer: sentencepiece unigram, 8K, shared vocabulary