ronig commited on
Commit
a71b5a2
·
1 Parent(s): d4b6a1a

Upload tokenizer

Browse files
Files changed (3) hide show
  1. special_tokens_map.json +3 -0
  2. tokenizer.json +2098 -0
  3. tokenizer_config.json +5 -0
special_tokens_map.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+ "pad_token": "<pad>"
3
+ }
tokenizer.json ADDED
@@ -0,0 +1,2098 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "version": "1.0",
3
+ "truncation": null,
4
+ "padding": {
5
+ "strategy": "BatchLongest",
6
+ "direction": "Right",
7
+ "pad_to_multiple_of": null,
8
+ "pad_id": 1,
9
+ "pad_type_id": 0,
10
+ "pad_token": "<pad>"
11
+ },
12
+ "added_tokens": [
13
+ {
14
+ "id": 0,
15
+ "content": "<unk>",
16
+ "single_word": false,
17
+ "lstrip": false,
18
+ "rstrip": false,
19
+ "normalized": false,
20
+ "special": true
21
+ },
22
+ {
23
+ "id": 1,
24
+ "content": "<pad>",
25
+ "single_word": false,
26
+ "lstrip": false,
27
+ "rstrip": false,
28
+ "normalized": false,
29
+ "special": true
30
+ },
31
+ {
32
+ "id": 2,
33
+ "content": "<mask>",
34
+ "single_word": false,
35
+ "lstrip": false,
36
+ "rstrip": false,
37
+ "normalized": false,
38
+ "special": true
39
+ }
40
+ ],
41
+ "normalizer": {
42
+ "type": "NFKC"
43
+ },
44
+ "pre_tokenizer": {
45
+ "type": "ByteLevel",
46
+ "add_prefix_space": true,
47
+ "trim_offsets": true,
48
+ "use_regex": true
49
+ },
50
+ "post_processor": {
51
+ "type": "ByteLevel",
52
+ "add_prefix_space": true,
53
+ "trim_offsets": true,
54
+ "use_regex": true
55
+ },
56
+ "decoder": {
57
+ "type": "ByteLevel",
58
+ "add_prefix_space": true,
59
+ "trim_offsets": true,
60
+ "use_regex": true
61
+ },
62
+ "model": {
63
+ "type": "BPE",
64
+ "dropout": null,
65
+ "unk_token": "<unk>",
66
+ "continuing_subword_prefix": null,
67
+ "end_of_word_suffix": null,
68
+ "fuse_unk": false,
69
+ "vocab": {
70
+ "<unk>": 0,
71
+ "<pad>": 1,
72
+ "<mask>": 2,
73
+ "A": 3,
74
+ "C": 4,
75
+ "D": 5,
76
+ "E": 6,
77
+ "F": 7,
78
+ "G": 8,
79
+ "H": 9,
80
+ "I": 10,
81
+ "K": 11,
82
+ "L": 12,
83
+ "M": 13,
84
+ "N": 14,
85
+ "P": 15,
86
+ "Q": 16,
87
+ "R": 17,
88
+ "S": 18,
89
+ "T": 19,
90
+ "V": 20,
91
+ "W": 21,
92
+ "Y": 22,
93
+ "Ġ": 23,
94
+ "LL": 24,
95
+ "AA": 25,
96
+ "AL": 26,
97
+ "VL": 27,
98
+ "EL": 28,
99
+ "GL": 29,
100
+ "SL": 30,
101
+ "TL": 31,
102
+ "GV": 32,
103
+ "DL": 33,
104
+ "KL": 34,
105
+ "AV": 35,
106
+ "IL": 36,
107
+ "SG": 37,
108
+ "RL": 38,
109
+ "AG": 39,
110
+ "EE": 40,
111
+ "TV": 41,
112
+ "SV": 42,
113
+ "AE": 43,
114
+ "TG": 44,
115
+ "KV": 45,
116
+ "AI": 46,
117
+ "DV": 47,
118
+ "NL": 48,
119
+ "PL": 49,
120
+ "DG": 50,
121
+ "QL": 51,
122
+ "EV": 52,
123
+ "SS": 53,
124
+ "DI": 54,
125
+ "AR": 55,
126
+ "GG": 56,
127
+ "FL": 57,
128
+ "AK": 58,
129
+ "TI": 59,
130
+ "RV": 60,
131
+ "PV": 61,
132
+ "KI": 62,
133
+ "YL": 63,
134
+ "EI": 64,
135
+ "SI": 65,
136
+ "PG": 66,
137
+ "AD": 67,
138
+ "EK": 68,
139
+ "NV": 69,
140
+ "EG": 70,
141
+ "AS": 71,
142
+ "AT": 72,
143
+ "RI": 73,
144
+ "NG": 74,
145
+ "NI": 75,
146
+ "FV": 76,
147
+ "KG": 77,
148
+ "AQ": 78,
149
+ "FG": 79,
150
+ "RG": 80,
151
+ "PE": 81,
152
+ "QV": 82,
153
+ "ST": 83,
154
+ "KE": 84,
155
+ "KD": 85,
156
+ "YV": 86,
157
+ "AF": 87,
158
+ "PI": 88,
159
+ "TT": 89,
160
+ "PD": 90,
161
+ "RE": 91,
162
+ "YG": 92,
163
+ "SF": 93,
164
+ "HL": 94,
165
+ "SD": 95,
166
+ "QI": 96,
167
+ "RD": 97,
168
+ "KK": 98,
169
+ "AN": 99,
170
+ "TD": 100,
171
+ "QG": 101,
172
+ "SE": 102,
173
+ "AP": 103,
174
+ "FI": 104,
175
+ "YI": 105,
176
+ "ML": 106,
177
+ "FD": 107,
178
+ "SR": 108,
179
+ "SK": 109,
180
+ "SN": 110,
181
+ "TE": 111,
182
+ "AY": 112,
183
+ "SP": 113,
184
+ "TP": 114,
185
+ "AM": 115,
186
+ "TF": 116,
187
+ "RR": 117,
188
+ "YD": 118,
189
+ "SQ": 119,
190
+ "ED": 120,
191
+ "GI": 121,
192
+ "AH": 122,
193
+ "RK": 123,
194
+ "NP": 124,
195
+ "RF": 125,
196
+ "TK": 126,
197
+ "VV": 127,
198
+ "TQ": 128,
199
+ "SY": 129,
200
+ "ND": 130,
201
+ "NE": 131,
202
+ "NF": 132,
203
+ "RQ": 133,
204
+ "NK": 134,
205
+ "MV": 135,
206
+ "HV": 136,
207
+ "IV": 137,
208
+ "IG": 138,
209
+ "CL": 139,
210
+ "PP": 140,
211
+ "FE": 141,
212
+ "DE": 142,
213
+ "TY": 143,
214
+ "WL": 144,
215
+ "QE": 145,
216
+ "TR": 146,
217
+ "NY": 147,
218
+ "DK": 148,
219
+ "HG": 149,
220
+ "PK": 150,
221
+ "QK": 151,
222
+ "PF": 152,
223
+ "HI": 153,
224
+ "RY": 154,
225
+ "MG": 155,
226
+ "TS": 156,
227
+ "TN": 157,
228
+ "FK": 158,
229
+ "DD": 159,
230
+ "AC": 160,
231
+ "RP": 161,
232
+ "AW": 162,
233
+ "MI": 163,
234
+ "NN": 164,
235
+ "QD": 165,
236
+ "YE": 166,
237
+ "QQ": 167,
238
+ "YK": 168,
239
+ "TH": 169,
240
+ "DP": 170,
241
+ "VG": 171,
242
+ "ME": 172,
243
+ "SH": 173,
244
+ "DF": 174,
245
+ "YY": 175,
246
+ "RN": 176,
247
+ "CG": 177,
248
+ "MK": 178,
249
+ "YF": 179,
250
+ "SW": 180,
251
+ "IE": 181,
252
+ "VE": 182,
253
+ "YQ": 183,
254
+ "VK": 184,
255
+ "SC": 185,
256
+ "NQ": 186,
257
+ "II": 187,
258
+ "RT": 188,
259
+ "SM": 189,
260
+ "KP": 190,
261
+ "FP": 191,
262
+ "RH": 192,
263
+ "ALL": 193,
264
+ "ID": 194,
265
+ "FN": 195,
266
+ "IK": 196,
267
+ "WG": 197,
268
+ "AAL": 198,
269
+ "QP": 199,
270
+ "YP": 200,
271
+ "CV": 201,
272
+ "NT": 202,
273
+ "RS": 203,
274
+ "HP": 204,
275
+ "FF": 205,
276
+ "HE": 206,
277
+ "YN": 207,
278
+ "MD": 208,
279
+ "WV": 209,
280
+ "ER": 210,
281
+ "QR": 211,
282
+ "QF": 212,
283
+ "TM": 213,
284
+ "TW": 214,
285
+ "VD": 215,
286
+ "GK": 216,
287
+ "TC": 217,
288
+ "LLL": 218,
289
+ "IP": 219,
290
+ "YR": 220,
291
+ "GE": 221,
292
+ "SA": 222,
293
+ "NR": 223,
294
+ "FR": 224,
295
+ "VI": 225,
296
+ "AVL": 226,
297
+ "HH": 227,
298
+ "FS": 228,
299
+ "FQ": 229,
300
+ "GD": 230,
301
+ "EQ": 231,
302
+ "KN": 232,
303
+ "FT": 233,
304
+ "KR": 234,
305
+ "YT": 235,
306
+ "NS": 236,
307
+ "KT": 237,
308
+ "PQ": 238,
309
+ "PR": 239,
310
+ "YS": 240,
311
+ "DT": 241,
312
+ "AGL": 242,
313
+ "KQ": 243,
314
+ "KY": 244,
315
+ "DR": 245,
316
+ "WI": 246,
317
+ "PT": 247,
318
+ "WE": 248,
319
+ "NH": 249,
320
+ "FH": 250,
321
+ "FY": 251,
322
+ "DS": 252,
323
+ "NC": 253,
324
+ "ASL": 254,
325
+ "MP": 255,
326
+ "AEL": 256,
327
+ "QT": 257,
328
+ "AGV": 258,
329
+ "KS": 259,
330
+ "KF": 260,
331
+ "EN": 261,
332
+ "EF": 262,
333
+ "CI": 263,
334
+ "ATL": 264,
335
+ "HR": 265,
336
+ "HD": 266,
337
+ "MR": 267,
338
+ "MN": 268,
339
+ "EP": 269,
340
+ "ET": 270,
341
+ "QS": 271,
342
+ "ADL": 272,
343
+ "GT": 273,
344
+ "QN": 274,
345
+ "VR": 275,
346
+ "MT": 276,
347
+ "QY": 277,
348
+ "DN": 278,
349
+ "DY": 279,
350
+ "IR": 280,
351
+ "KH": 281,
352
+ "PY": 282,
353
+ "GF": 283,
354
+ "ES": 284,
355
+ "VP": 285,
356
+ "IN": 286,
357
+ "EY": 287,
358
+ "GR": 288,
359
+ "IT": 289,
360
+ "PS": 290,
361
+ "EH": 291,
362
+ "ASV": 292,
363
+ "AIL": 293,
364
+ "ASG": 294,
365
+ "WK": 295,
366
+ "WD": 296,
367
+ "GP": 297,
368
+ "MQ": 298,
369
+ "HK": 299,
370
+ "AAV": 300,
371
+ "ATG": 301,
372
+ "SLL": 302,
373
+ "PN": 303,
374
+ "ARL": 304,
375
+ "CD": 305,
376
+ "GQ": 306,
377
+ "AKL": 307,
378
+ "GS": 308,
379
+ "IS": 309,
380
+ "IQ": 310,
381
+ "HF": 311,
382
+ "RC": 312,
383
+ "ĠM": 313,
384
+ "VQ": 314,
385
+ "GY": 315,
386
+ "MF": 316,
387
+ "ATV": 317,
388
+ "HQ": 318,
389
+ "GN": 319,
390
+ "DQ": 320,
391
+ "ADV": 321,
392
+ "CP": 322,
393
+ "HY": 323,
394
+ "MY": 324,
395
+ "ALV": 325,
396
+ "AEE": 326,
397
+ "SSL": 327,
398
+ "HT": 328,
399
+ "IF": 329,
400
+ "MS": 330,
401
+ "APG": 331,
402
+ "CE": 332,
403
+ "RW": 333,
404
+ "VT": 334,
405
+ "ADI": 335,
406
+ "VS": 336,
407
+ "ALG": 337,
408
+ "IY": 338,
409
+ "ADG": 339,
410
+ "LLK": 340,
411
+ "NW": 341,
412
+ "NM": 342,
413
+ "ALE": 343,
414
+ "CK": 344,
415
+ "LLG": 345,
416
+ "LLE": 346,
417
+ "AKV": 347,
418
+ "GVV": 348,
419
+ "SGG": 349,
420
+ "AAG": 350,
421
+ "AFL": 351,
422
+ "QW": 352,
423
+ "QM": 353,
424
+ "DM": 354,
425
+ "RM": 355,
426
+ "SGL": 356,
427
+ "VN": 357,
428
+ "AQL": 358,
429
+ "CT": 359,
430
+ "PW": 360,
431
+ "APL": 361,
432
+ "PH": 362,
433
+ "CR": 363,
434
+ "EW": 364,
435
+ "ANL": 365,
436
+ "DVV": 366,
437
+ "YH": 367,
438
+ "VLD": 368,
439
+ "ASI": 369,
440
+ "ALK": 370,
441
+ "VLV": 371,
442
+ "IH": 372,
443
+ "KW": 373,
444
+ "AVV": 374,
445
+ "DH": 375,
446
+ "LLV": 376,
447
+ "GLV": 377,
448
+ "SLG": 378,
449
+ "EM": 379,
450
+ "CS": 380,
451
+ "TAA": 381,
452
+ "EEL": 382,
453
+ "CN": 383,
454
+ "AGI": 384,
455
+ "DLV": 385,
456
+ "AEV": 386,
457
+ "CF": 387,
458
+ "LLD": 388,
459
+ "QH": 389,
460
+ "AYL": 390,
461
+ "PM": 391,
462
+ "VLG": 392,
463
+ "EEV": 393,
464
+ "ELV": 394,
465
+ "YW": 395,
466
+ "ALD": 396,
467
+ "ALR": 397,
468
+ "CH": 398,
469
+ "VLE": 399,
470
+ "FW": 400,
471
+ "AAI": 401,
472
+ "GGG": 402,
473
+ "RLL": 403,
474
+ "CQ": 404,
475
+ "TLV": 405,
476
+ "KM": 406,
477
+ "ASS": 407,
478
+ "KLV": 408,
479
+ "YM": 409,
480
+ "ĠG": 410,
481
+ "AGG": 411,
482
+ "AVG": 412,
483
+ "ATI": 413,
484
+ "DW": 414,
485
+ "SSV": 415,
486
+ "VLK": 416,
487
+ "SAL": 417,
488
+ "HN": 418,
489
+ "ELK": 419,
490
+ "STL": 420,
491
+ "AIG": 421,
492
+ "SAA": 422,
493
+ "GVG": 423,
494
+ "ELG": 424,
495
+ "TVL": 425,
496
+ "ELI": 426,
497
+ "PLV": 427,
498
+ "RLV": 428,
499
+ "EC": 429,
500
+ "YSL": 430,
501
+ "LLI": 431,
502
+ "WR": 432,
503
+ "ANG": 433,
504
+ "EKL": 434,
505
+ "SGV": 435,
506
+ "AAK": 436,
507
+ "WT": 437,
508
+ "APV": 438,
509
+ "AFV": 439,
510
+ "EIL": 440,
511
+ "TGI": 441,
512
+ "RVV": 442,
513
+ "ANV": 443,
514
+ "DC": 444,
515
+ "AEI": 445,
516
+ "MM": 446,
517
+ "TLL": 447,
518
+ "YC": 448,
519
+ "SDL": 449,
520
+ "WN": 450,
521
+ "EVV": 451,
522
+ "ILG": 452,
523
+ "SLV": 453,
524
+ "TLG": 454,
525
+ "KLI": 455,
526
+ "KVV": 456,
527
+ "SIL": 457,
528
+ "WQ": 458,
529
+ "EAL": 459,
530
+ "ĠL": 460,
531
+ "LLQ": 461,
532
+ "GLK": 462,
533
+ "GLG": 463,
534
+ "DLG": 464,
535
+ "FC": 465,
536
+ "SRL": 466,
537
+ "SSG": 467,
538
+ "MH": 468,
539
+ "EEI": 469,
540
+ "PSV": 470,
541
+ "TKV": 471,
542
+ "AAE": 472,
543
+ "AVI": 473,
544
+ "REL": 474,
545
+ "GLE": 475,
546
+ "AFG": 476,
547
+ "AVK": 477,
548
+ "ALQ": 478,
549
+ "SVL": 479,
550
+ "DLI": 480,
551
+ "TKL": 481,
552
+ "FM": 482,
553
+ "AIV": 483,
554
+ "RLI": 484,
555
+ "PC": 485,
556
+ "WY": 486,
557
+ "ELL": 487,
558
+ "TNL": 488,
559
+ "ANI": 489,
560
+ "KC": 490,
561
+ "PVV": 491,
562
+ "AAR": 492,
563
+ "DLK": 493,
564
+ "ARV": 494,
565
+ "TTL": 495,
566
+ "GLD": 496,
567
+ "AHL": 497,
568
+ "FGG": 498,
569
+ "SLI": 499,
570
+ "ALI": 500,
571
+ "GLI": 501,
572
+ "RLG": 502,
573
+ "SKL": 503,
574
+ "LLR": 504,
575
+ "KEL": 505,
576
+ "EVI": 506,
577
+ "TVV": 507,
578
+ "HS": 508,
579
+ "TEL": 509,
580
+ "EIV": 510,
581
+ "TAV": 511,
582
+ "TLK": 512,
583
+ "RVL": 513,
584
+ "WF": 514,
585
+ "GVI": 515,
586
+ "TAL": 516,
587
+ "DIV": 517,
588
+ "QC": 518,
589
+ "SVV": 519,
590
+ "ILE": 520,
591
+ "DGK": 521,
592
+ "SLK": 522,
593
+ "ILK": 523,
594
+ "SIV": 524,
595
+ "APE": 525,
596
+ "TEE": 526,
597
+ "GLR": 527,
598
+ "ILV": 528,
599
+ "CY": 529,
600
+ "TSV": 530,
601
+ "LLN": 531,
602
+ "NLV": 532,
603
+ "KVI": 533,
604
+ "AML": 534,
605
+ "PEL": 535,
606
+ "EQL": 536,
607
+ "KII": 537,
608
+ "TGK": 538,
609
+ "KIV": 539,
610
+ "SKD": 540,
611
+ "AAD": 541,
612
+ "RTL": 542,
613
+ "VLP": 543,
614
+ "TGL": 544,
615
+ "RAL": 545,
616
+ "AII": 546,
617
+ "AEG": 547,
618
+ "HC": 548,
619
+ "AYV": 549,
620
+ "AVD": 550,
621
+ "AGK": 551,
622
+ "SDI": 552,
623
+ "KLK": 553,
624
+ "QLV": 554,
625
+ "AGE": 555,
626
+ "EII": 556,
627
+ "FLV": 557,
628
+ "HW": 558,
629
+ "EEG": 559,
630
+ "AVE": 560,
631
+ "ILD": 561,
632
+ "HM": 562,
633
+ "RKL": 563,
634
+ "PLL": 564,
635
+ "ELE": 565,
636
+ "SVG": 566,
637
+ "SLE": 567,
638
+ "TPL": 568,
639
+ "API": 569,
640
+ "GVP": 570,
641
+ "AKG": 571,
642
+ "NTL": 572,
643
+ "TGG": 573,
644
+ "AIK": 574,
645
+ "SEE": 575,
646
+ "VLI": 576,
647
+ "SGI": 577,
648
+ "ARG": 578,
649
+ "TSL": 579,
650
+ "GLP": 580,
651
+ "SNL": 581,
652
+ "AYG": 582,
653
+ "KLG": 583,
654
+ "VLN": 584,
655
+ "SGF": 585,
656
+ "SEL": 586,
657
+ "DLE": 587,
658
+ "SGT": 588,
659
+ "FLE": 589,
660
+ "TVG": 590,
661
+ "TLE": 591,
662
+ "AYI": 592,
663
+ "DGI": 593,
664
+ "SNI": 594,
665
+ "AEK": 595,
666
+ "LLP": 596,
667
+ "PGD": 597,
668
+ "PVG": 598,
669
+ "STG": 599,
670
+ "RVI": 600,
671
+ "RIL": 601,
672
+ "NSG": 602,
673
+ "QAL": 603,
674
+ "EKG": 604,
675
+ "WS": 605,
676
+ "TVP": 606,
677
+ "YYC": 607,
678
+ "RAA": 608,
679
+ "WP": 609,
680
+ "RDL": 610,
681
+ "PLG": 611,
682
+ "NLL": 612,
683
+ "SII": 613,
684
+ "TIG": 614,
685
+ "TAD": 615,
686
+ "KGI": 616,
687
+ "DAV": 617,
688
+ "TPV": 618,
689
+ "DVI": 619,
690
+ "RRL": 620,
691
+ "TLD": 621,
692
+ "NGI": 622,
693
+ "QGL": 623,
694
+ "FSG": 624,
695
+ "SAT": 625,
696
+ "RDI": 626,
697
+ "ARI": 627,
698
+ "QLL": 628,
699
+ "TVK": 629,
700
+ "SFL": 630,
701
+ "RSL": 631,
702
+ "RIV": 632,
703
+ "QLI": 633,
704
+ "TLI": 634,
705
+ "AGD": 635,
706
+ "QAA": 636,
707
+ "NGK": 637,
708
+ "EGK": 638,
709
+ "SPL": 639,
710
+ "SYL": 640,
711
+ "RAV": 641,
712
+ "DII": 642,
713
+ "DLP": 643,
714
+ "RGL": 644,
715
+ "TIL": 645,
716
+ "RII": 646,
717
+ "NVL": 647,
718
+ "AKI": 648,
719
+ "QEL": 649,
720
+ "AKE": 650,
721
+ "AGF": 651,
722
+ "MC": 652,
723
+ "YVG": 653,
724
+ "GGI": 654,
725
+ "AID": 655,
726
+ "NLK": 656,
727
+ "AST": 657,
728
+ "GVH": 658,
729
+ "TVI": 659,
730
+ "APD": 660,
731
+ "NVV": 661,
732
+ "NDL": 662,
733
+ "RLF": 663,
734
+ "KLL": 664,
735
+ "PDG": 665,
736
+ "RQL": 666,
737
+ "SAV": 667,
738
+ "SEV": 668,
739
+ "SKI": 669,
740
+ "QSL": 670,
741
+ "TEV": 671,
742
+ "KVG": 672,
743
+ "DAA": 673,
744
+ "CLV": 674,
745
+ "GVE": 675,
746
+ "STV": 676,
747
+ "EGI": 677,
748
+ "ARE": 678,
749
+ "MW": 679,
750
+ "GVK": 680,
751
+ "EIG": 681,
752
+ "RLK": 682,
753
+ "TAE": 683,
754
+ "FTL": 684,
755
+ "SSI": 685,
756
+ "NAA": 686,
757
+ "AIE": 687,
758
+ "AQV": 688,
759
+ "YLV": 689,
760
+ "KVE": 690,
761
+ "SAS": 691,
762
+ "CC": 692,
763
+ "FLG": 693,
764
+ "SVI": 694,
765
+ "PGI": 695,
766
+ "GVD": 696,
767
+ "TSG": 697,
768
+ "NKL": 698,
769
+ "ALP": 699,
770
+ "EVG": 700,
771
+ "SRV": 701,
772
+ "NGL": 702,
773
+ "PVI": 703,
774
+ "RAS": 704,
775
+ "RGG": 705,
776
+ "WM": 706,
777
+ "QVL": 707,
778
+ "TTV": 708,
779
+ "RLE": 709,
780
+ "ĠV": 710,
781
+ "DGG": 711,
782
+ "YLL": 712,
783
+ "DVD": 713,
784
+ "SAG": 714,
785
+ "FVV": 715,
786
+ "NIV": 716,
787
+ "TAG": 717,
788
+ "SQL": 718,
789
+ "NIL": 719,
790
+ "TDL": 720,
791
+ "EID": 721,
792
+ "RVG": 722,
793
+ "NSL": 723,
794
+ "TGE": 724,
795
+ "EGE": 725,
796
+ "RGI": 726,
797
+ "RIG": 727,
798
+ "ADY": 728,
799
+ "FSL": 729,
800
+ "QVV": 730,
801
+ "TPE": 731,
802
+ "EEK": 732,
803
+ "TVD": 733,
804
+ "WH": 734,
805
+ "TRL": 735,
806
+ "FLK": 736,
807
+ "KVK": 737,
808
+ "ILI": 738,
809
+ "KGE": 739,
810
+ "EIK": 740,
811
+ "KLE": 741,
812
+ "SSK": 742,
813
+ "YTL": 743,
814
+ "PPG": 744,
815
+ "RYL": 745,
816
+ "EEE": 746,
817
+ "EKD": 747,
818
+ "ELD": 748,
819
+ "KIK": 749,
820
+ "SAK": 750,
821
+ "QLG": 751,
822
+ "EGD": 752,
823
+ "PIG": 753,
824
+ "TTI": 754,
825
+ "FLL": 755,
826
+ "DLD": 756,
827
+ "QAV": 757,
828
+ "PVL": 758,
829
+ "NGV": 759,
830
+ "KIG": 760,
831
+ "FGL": 761,
832
+ "PGQ": 762,
833
+ "EKE": 763,
834
+ "MAA": 764,
835
+ "RAE": 765,
836
+ "DVG": 766,
837
+ "AGY": 767,
838
+ "NVI": 768,
839
+ "TGV": 769,
840
+ "NAK": 770,
841
+ "FVG": 771,
842
+ "SLD": 772,
843
+ "NSV": 773,
844
+ "ĠP": 774,
845
+ "QKL": 775,
846
+ "NII": 776,
847
+ "DIG": 777,
848
+ "EVK": 778,
849
+ "RNL": 779,
850
+ "DGE": 780,
851
+ "KGK": 781,
852
+ "TFP": 782,
853
+ "SRD": 783,
854
+ "WW": 784,
855
+ "DIK": 785,
856
+ "YLE": 786,
857
+ "STI": 787,
858
+ "FPL": 788,
859
+ "TVE": 789,
860
+ "SIG": 790,
861
+ "EVE": 791,
862
+ "REI": 792,
863
+ "RFL": 793,
864
+ "AAQ": 794,
865
+ "TLP": 795,
866
+ "DVK": 796,
867
+ "KLD": 797,
868
+ "AQG": 798,
869
+ "DID": 799,
870
+ "ARD": 800,
871
+ "SAE": 801,
872
+ "NLG": 802,
873
+ "ARK": 803,
874
+ "RLD": 804,
875
+ "SNG": 805,
876
+ "SKV": 806,
877
+ "AWL": 807,
878
+ "EVD": 808,
879
+ "PSG": 809,
880
+ "YLK": 810,
881
+ "HLL": 811,
882
+ "TAK": 812,
883
+ "EKK": 813,
884
+ "NAL": 814,
885
+ "SFV": 815,
886
+ "PGL": 816,
887
+ "TSN": 817,
888
+ "SDV": 818,
889
+ "SAI": 819,
890
+ "DIE": 820,
891
+ "AAP": 821,
892
+ "NLE": 822,
893
+ "RGV": 823,
894
+ "AKK": 824,
895
+ "RRV": 825,
896
+ "YLG": 826,
897
+ "PGK": 827,
898
+ "TEI": 828,
899
+ "QIG": 829,
900
+ "TDE": 830,
901
+ "DGD": 831,
902
+ "PEG": 832,
903
+ "SEG": 833,
904
+ "QLE": 834,
905
+ "TIK": 835,
906
+ "PLI": 836,
907
+ "RSG": 837,
908
+ "TIE": 838,
909
+ "RGD": 839,
910
+ "QLK": 840,
911
+ "KLQ": 841,
912
+ "RIE": 842,
913
+ "TAT": 843,
914
+ "YAL": 844,
915
+ "SKG": 845,
916
+ "ASE": 846,
917
+ "NLI": 847,
918
+ "YLQ": 848,
919
+ "YGL": 849,
920
+ "NRL": 850,
921
+ "ASK": 851,
922
+ "SVD": 852,
923
+ "YAD": 853,
924
+ "TAR": 854,
925
+ "PAL": 855,
926
+ "NVK": 856,
927
+ "ĠK": 857,
928
+ "FLD": 858,
929
+ "SGD": 859,
930
+ "RTG": 860,
931
+ "QRL": 861,
932
+ "QIV": 862,
933
+ "PTL": 863,
934
+ "QDL": 864,
935
+ "FPP": 865,
936
+ "ASD": 866,
937
+ "NSI": 867,
938
+ "ĠE": 868,
939
+ "YAA": 869,
940
+ "SGK": 870,
941
+ "QSS": 871,
942
+ "TDG": 872,
943
+ "AQI": 873,
944
+ "SPE": 874,
945
+ "ĠD": 875,
946
+ "SFG": 876,
947
+ "KDK": 877,
948
+ "SPI": 878,
949
+ "TAI": 879,
950
+ "FVD": 880,
951
+ "RGE": 881,
952
+ "SLP": 882,
953
+ "NVG": 883,
954
+ "QVI": 884,
955
+ "FID": 885,
956
+ "NTV": 886,
957
+ "TST": 887,
958
+ "TSI": 888,
959
+ "SED": 889,
960
+ "SAD": 890,
961
+ "RIK": 891,
962
+ "ACL": 892,
963
+ "SAR": 893,
964
+ "PAG": 894,
965
+ "NEL": 895,
966
+ "TIV": 896,
967
+ "TDV": 897,
968
+ "NAI": 898,
969
+ "RGK": 899,
970
+ "SLQ": 900,
971
+ "AQK": 901,
972
+ "TLQ": 902,
973
+ "AAF": 903,
974
+ "NLD": 904,
975
+ "FAK": 905,
976
+ "RAG": 906,
977
+ "KVY": 907,
978
+ "TSS": 908,
979
+ "REV": 909,
980
+ "TEK": 910,
981
+ "PED": 911,
982
+ "NDI": 912,
983
+ "TRV": 913,
984
+ "YVD": 914,
985
+ "VLF": 915,
986
+ "SEI": 916,
987
+ "ELY": 917,
988
+ "SGSG": 918,
989
+ "TTG": 919,
990
+ "YLD": 920,
991
+ "TKD": 921,
992
+ "YGI": 922,
993
+ "MAL": 923,
994
+ "AED": 924,
995
+ "RFG": 925,
996
+ "VLH": 926,
997
+ "HLV": 927,
998
+ "YVV": 928,
999
+ "GIV": 929,
1000
+ "YDL": 930,
1001
+ "SFI": 931,
1002
+ "PIV": 932,
1003
+ "RPG": 933,
1004
+ "GLF": 934,
1005
+ "SKE": 935,
1006
+ "TKE": 936,
1007
+ "RAR": 937,
1008
+ "NVE": 938,
1009
+ "SEK": 939,
1010
+ "PLK": 940,
1011
+ "REE": 941,
1012
+ "TFG": 942,
1013
+ "SPV": 943,
1014
+ "SDG": 944,
1015
+ "SGN": 945,
1016
+ "ADE": 946,
1017
+ "KLF": 947,
1018
+ "ELF": 948,
1019
+ "KGD": 949,
1020
+ "NGE": 950,
1021
+ "QKV": 951,
1022
+ "YDP": 952,
1023
+ "TVSS": 953,
1024
+ "REG": 954,
1025
+ "SRI": 955,
1026
+ "NAV": 956,
1027
+ "SDD": 957,
1028
+ "AKD": 958,
1029
+ "RSV": 959,
1030
+ "TQL": 960,
1031
+ "RVE": 961,
1032
+ "TII": 962,
1033
+ "PGV": 963,
1034
+ "EVF": 964,
1035
+ "WC": 965,
1036
+ "NEE": 966,
1037
+ "ASQ": 967,
1038
+ "TGD": 968,
1039
+ "PLE": 969,
1040
+ "NID": 970,
1041
+ "KVD": 971,
1042
+ "WGQG": 972,
1043
+ "SPD": 973,
1044
+ "RFD": 974,
1045
+ "NKI": 975,
1046
+ "ADD": 976,
1047
+ "ATK": 977,
1048
+ "HTL": 978,
1049
+ "ADK": 979,
1050
+ "PGE": 980,
1051
+ "FLQ": 981,
1052
+ "EEM": 982,
1053
+ "SLY": 983,
1054
+ "PLD": 984,
1055
+ "RVK": 985,
1056
+ "VLQ": 986,
1057
+ "RDV": 987,
1058
+ "QII": 988,
1059
+ "ALH": 989,
1060
+ "SNV": 990,
1061
+ "MEL": 991,
1062
+ "AFD": 992,
1063
+ "MLL": 993,
1064
+ "NSS": 994,
1065
+ "SIE": 995,
1066
+ "RAI": 996,
1067
+ "RKI": 997,
1068
+ "QGV": 998,
1069
+ "FAA": 999,
1070
+ "PGG": 1000,
1071
+ "RED": 1001,
1072
+ "PAV": 1002,
1073
+ "ATE": 1003,
1074
+ "NPD": 1004,
1075
+ "TEG": 1005,
1076
+ "TPI": 1006,
1077
+ "KGY": 1007,
1078
+ "FLI": 1008,
1079
+ "FGI": 1009,
1080
+ "TKI": 1010,
1081
+ "SDE": 1011,
1082
+ "QGI": 1012,
1083
+ "TAS": 1013,
1084
+ "FAL": 1014,
1085
+ "GGK": 1015,
1086
+ "SAP": 1016,
1087
+ "SDK": 1017,
1088
+ "NIG": 1018,
1089
+ "RQI": 1019,
1090
+ "SSD": 1020,
1091
+ "STD": 1021,
1092
+ "TFE": 1022,
1093
+ "CM": 1023
1094
+ },
1095
+ "merges": [
1096
+ "L L",
1097
+ "A A",
1098
+ "A L",
1099
+ "V L",
1100
+ "E L",
1101
+ "G L",
1102
+ "S L",
1103
+ "T L",
1104
+ "G V",
1105
+ "D L",
1106
+ "K L",
1107
+ "A V",
1108
+ "I L",
1109
+ "S G",
1110
+ "R L",
1111
+ "A G",
1112
+ "E E",
1113
+ "T V",
1114
+ "S V",
1115
+ "A E",
1116
+ "T G",
1117
+ "K V",
1118
+ "A I",
1119
+ "D V",
1120
+ "N L",
1121
+ "P L",
1122
+ "D G",
1123
+ "Q L",
1124
+ "E V",
1125
+ "S S",
1126
+ "D I",
1127
+ "A R",
1128
+ "G G",
1129
+ "F L",
1130
+ "A K",
1131
+ "T I",
1132
+ "R V",
1133
+ "P V",
1134
+ "K I",
1135
+ "Y L",
1136
+ "E I",
1137
+ "S I",
1138
+ "P G",
1139
+ "A D",
1140
+ "E K",
1141
+ "N V",
1142
+ "E G",
1143
+ "A S",
1144
+ "A T",
1145
+ "R I",
1146
+ "N G",
1147
+ "N I",
1148
+ "F V",
1149
+ "K G",
1150
+ "A Q",
1151
+ "F G",
1152
+ "R G",
1153
+ "P E",
1154
+ "Q V",
1155
+ "S T",
1156
+ "K E",
1157
+ "K D",
1158
+ "Y V",
1159
+ "A F",
1160
+ "P I",
1161
+ "T T",
1162
+ "P D",
1163
+ "R E",
1164
+ "Y G",
1165
+ "S F",
1166
+ "H L",
1167
+ "S D",
1168
+ "Q I",
1169
+ "R D",
1170
+ "K K",
1171
+ "A N",
1172
+ "T D",
1173
+ "Q G",
1174
+ "S E",
1175
+ "A P",
1176
+ "F I",
1177
+ "Y I",
1178
+ "M L",
1179
+ "F D",
1180
+ "S R",
1181
+ "S K",
1182
+ "S N",
1183
+ "T E",
1184
+ "A Y",
1185
+ "S P",
1186
+ "T P",
1187
+ "A M",
1188
+ "T F",
1189
+ "R R",
1190
+ "Y D",
1191
+ "S Q",
1192
+ "E D",
1193
+ "G I",
1194
+ "A H",
1195
+ "R K",
1196
+ "N P",
1197
+ "R F",
1198
+ "T K",
1199
+ "V V",
1200
+ "T Q",
1201
+ "S Y",
1202
+ "N D",
1203
+ "N E",
1204
+ "N F",
1205
+ "R Q",
1206
+ "N K",
1207
+ "M V",
1208
+ "H V",
1209
+ "I V",
1210
+ "I G",
1211
+ "C L",
1212
+ "P P",
1213
+ "F E",
1214
+ "D E",
1215
+ "T Y",
1216
+ "W L",
1217
+ "Q E",
1218
+ "T R",
1219
+ "N Y",
1220
+ "D K",
1221
+ "H G",
1222
+ "P K",
1223
+ "Q K",
1224
+ "P F",
1225
+ "H I",
1226
+ "R Y",
1227
+ "M G",
1228
+ "T S",
1229
+ "T N",
1230
+ "F K",
1231
+ "D D",
1232
+ "A C",
1233
+ "R P",
1234
+ "A W",
1235
+ "M I",
1236
+ "N N",
1237
+ "Q D",
1238
+ "Y E",
1239
+ "Q Q",
1240
+ "Y K",
1241
+ "T H",
1242
+ "D P",
1243
+ "V G",
1244
+ "M E",
1245
+ "S H",
1246
+ "D F",
1247
+ "Y Y",
1248
+ "R N",
1249
+ "C G",
1250
+ "M K",
1251
+ "Y F",
1252
+ "S W",
1253
+ "I E",
1254
+ "V E",
1255
+ "Y Q",
1256
+ "V K",
1257
+ "S C",
1258
+ "N Q",
1259
+ "I I",
1260
+ "R T",
1261
+ "S M",
1262
+ "K P",
1263
+ "F P",
1264
+ "R H",
1265
+ "A LL",
1266
+ "I D",
1267
+ "F N",
1268
+ "I K",
1269
+ "W G",
1270
+ "AA L",
1271
+ "Q P",
1272
+ "Y P",
1273
+ "C V",
1274
+ "N T",
1275
+ "R S",
1276
+ "H P",
1277
+ "F F",
1278
+ "H E",
1279
+ "Y N",
1280
+ "M D",
1281
+ "W V",
1282
+ "E R",
1283
+ "Q R",
1284
+ "Q F",
1285
+ "T M",
1286
+ "T W",
1287
+ "V D",
1288
+ "G K",
1289
+ "T C",
1290
+ "LL L",
1291
+ "I P",
1292
+ "Y R",
1293
+ "G E",
1294
+ "S A",
1295
+ "N R",
1296
+ "F R",
1297
+ "V I",
1298
+ "A VL",
1299
+ "H H",
1300
+ "F S",
1301
+ "F Q",
1302
+ "G D",
1303
+ "E Q",
1304
+ "K N",
1305
+ "F T",
1306
+ "K R",
1307
+ "Y T",
1308
+ "N S",
1309
+ "K T",
1310
+ "P Q",
1311
+ "P R",
1312
+ "Y S",
1313
+ "D T",
1314
+ "A GL",
1315
+ "K Q",
1316
+ "K Y",
1317
+ "D R",
1318
+ "W I",
1319
+ "P T",
1320
+ "W E",
1321
+ "N H",
1322
+ "F H",
1323
+ "F Y",
1324
+ "D S",
1325
+ "N C",
1326
+ "A SL",
1327
+ "M P",
1328
+ "A EL",
1329
+ "Q T",
1330
+ "A GV",
1331
+ "K S",
1332
+ "K F",
1333
+ "E N",
1334
+ "E F",
1335
+ "C I",
1336
+ "A TL",
1337
+ "H R",
1338
+ "H D",
1339
+ "M R",
1340
+ "M N",
1341
+ "E P",
1342
+ "E T",
1343
+ "Q S",
1344
+ "A DL",
1345
+ "G T",
1346
+ "Q N",
1347
+ "V R",
1348
+ "M T",
1349
+ "Q Y",
1350
+ "D N",
1351
+ "D Y",
1352
+ "I R",
1353
+ "K H",
1354
+ "P Y",
1355
+ "G F",
1356
+ "E S",
1357
+ "V P",
1358
+ "I N",
1359
+ "E Y",
1360
+ "G R",
1361
+ "I T",
1362
+ "P S",
1363
+ "E H",
1364
+ "A SV",
1365
+ "A IL",
1366
+ "A SG",
1367
+ "W K",
1368
+ "W D",
1369
+ "G P",
1370
+ "M Q",
1371
+ "H K",
1372
+ "AA V",
1373
+ "A TG",
1374
+ "S LL",
1375
+ "P N",
1376
+ "A RL",
1377
+ "C D",
1378
+ "G Q",
1379
+ "A KL",
1380
+ "G S",
1381
+ "I S",
1382
+ "I Q",
1383
+ "H F",
1384
+ "R C",
1385
+ "Ġ M",
1386
+ "V Q",
1387
+ "G Y",
1388
+ "M F",
1389
+ "A TV",
1390
+ "H Q",
1391
+ "G N",
1392
+ "D Q",
1393
+ "A DV",
1394
+ "C P",
1395
+ "H Y",
1396
+ "M Y",
1397
+ "AL V",
1398
+ "A EE",
1399
+ "S SL",
1400
+ "H T",
1401
+ "I F",
1402
+ "M S",
1403
+ "A PG",
1404
+ "C E",
1405
+ "R W",
1406
+ "V T",
1407
+ "A DI",
1408
+ "V S",
1409
+ "AL G",
1410
+ "I Y",
1411
+ "A DG",
1412
+ "LL K",
1413
+ "N W",
1414
+ "N M",
1415
+ "AL E",
1416
+ "C K",
1417
+ "LL G",
1418
+ "LL E",
1419
+ "A KV",
1420
+ "GV V",
1421
+ "SG G",
1422
+ "AA G",
1423
+ "A FL",
1424
+ "Q W",
1425
+ "Q M",
1426
+ "D M",
1427
+ "R M",
1428
+ "S GL",
1429
+ "V N",
1430
+ "A QL",
1431
+ "C T",
1432
+ "P W",
1433
+ "A PL",
1434
+ "P H",
1435
+ "C R",
1436
+ "E W",
1437
+ "A NL",
1438
+ "DV V",
1439
+ "Y H",
1440
+ "VL D",
1441
+ "A SI",
1442
+ "AL K",
1443
+ "VL V",
1444
+ "I H",
1445
+ "K W",
1446
+ "AV V",
1447
+ "D H",
1448
+ "LL V",
1449
+ "GL V",
1450
+ "SL G",
1451
+ "E M",
1452
+ "C S",
1453
+ "T AA",
1454
+ "E EL",
1455
+ "C N",
1456
+ "AG I",
1457
+ "DL V",
1458
+ "AE V",
1459
+ "C F",
1460
+ "LL D",
1461
+ "Q H",
1462
+ "A YL",
1463
+ "P M",
1464
+ "VL G",
1465
+ "EE V",
1466
+ "EL V",
1467
+ "Y W",
1468
+ "AL D",
1469
+ "AL R",
1470
+ "C H",
1471
+ "VL E",
1472
+ "F W",
1473
+ "AA I",
1474
+ "GG G",
1475
+ "R LL",
1476
+ "C Q",
1477
+ "TL V",
1478
+ "K M",
1479
+ "A SS",
1480
+ "KL V",
1481
+ "Y M",
1482
+ "Ġ G",
1483
+ "AG G",
1484
+ "AV G",
1485
+ "A TI",
1486
+ "D W",
1487
+ "S SV",
1488
+ "VL K",
1489
+ "S AL",
1490
+ "H N",
1491
+ "EL K",
1492
+ "S TL",
1493
+ "AI G",
1494
+ "S AA",
1495
+ "GV G",
1496
+ "EL G",
1497
+ "T VL",
1498
+ "EL I",
1499
+ "PL V",
1500
+ "RL V",
1501
+ "E C",
1502
+ "Y SL",
1503
+ "LL I",
1504
+ "W R",
1505
+ "A NG",
1506
+ "E KL",
1507
+ "S GV",
1508
+ "AA K",
1509
+ "W T",
1510
+ "A PV",
1511
+ "A FV",
1512
+ "E IL",
1513
+ "TG I",
1514
+ "RV V",
1515
+ "A NV",
1516
+ "D C",
1517
+ "AE I",
1518
+ "M M",
1519
+ "T LL",
1520
+ "Y C",
1521
+ "S DL",
1522
+ "W N",
1523
+ "EV V",
1524
+ "IL G",
1525
+ "SL V",
1526
+ "TL G",
1527
+ "KL I",
1528
+ "KV V",
1529
+ "S IL",
1530
+ "W Q",
1531
+ "E AL",
1532
+ "Ġ L",
1533
+ "LL Q",
1534
+ "GL K",
1535
+ "GL G",
1536
+ "DL G",
1537
+ "F C",
1538
+ "S RL",
1539
+ "S SG",
1540
+ "M H",
1541
+ "EE I",
1542
+ "P SV",
1543
+ "T KV",
1544
+ "AA E",
1545
+ "AV I",
1546
+ "R EL",
1547
+ "GL E",
1548
+ "A FG",
1549
+ "AV K",
1550
+ "AL Q",
1551
+ "S VL",
1552
+ "DL I",
1553
+ "T KL",
1554
+ "F M",
1555
+ "AI V",
1556
+ "RL I",
1557
+ "P C",
1558
+ "W Y",
1559
+ "E LL",
1560
+ "T NL",
1561
+ "A NI",
1562
+ "K C",
1563
+ "PV V",
1564
+ "AA R",
1565
+ "DL K",
1566
+ "AR V",
1567
+ "T TL",
1568
+ "GL D",
1569
+ "A HL",
1570
+ "F GG",
1571
+ "SL I",
1572
+ "AL I",
1573
+ "GL I",
1574
+ "RL G",
1575
+ "S KL",
1576
+ "LL R",
1577
+ "K EL",
1578
+ "EV I",
1579
+ "TV V",
1580
+ "H S",
1581
+ "T EL",
1582
+ "EI V",
1583
+ "T AV",
1584
+ "TL K",
1585
+ "R VL",
1586
+ "W F",
1587
+ "GV I",
1588
+ "T AL",
1589
+ "DI V",
1590
+ "Q C",
1591
+ "SV V",
1592
+ "IL E",
1593
+ "DG K",
1594
+ "SL K",
1595
+ "IL K",
1596
+ "SI V",
1597
+ "A PE",
1598
+ "T EE",
1599
+ "GL R",
1600
+ "IL V",
1601
+ "C Y",
1602
+ "T SV",
1603
+ "LL N",
1604
+ "NL V",
1605
+ "KV I",
1606
+ "A ML",
1607
+ "P EL",
1608
+ "E QL",
1609
+ "KI I",
1610
+ "TG K",
1611
+ "KI V",
1612
+ "S KD",
1613
+ "AA D",
1614
+ "R TL",
1615
+ "VL P",
1616
+ "T GL",
1617
+ "R AL",
1618
+ "AI I",
1619
+ "AE G",
1620
+ "H C",
1621
+ "A YV",
1622
+ "AV D",
1623
+ "AG K",
1624
+ "S DI",
1625
+ "KL K",
1626
+ "QL V",
1627
+ "AG E",
1628
+ "EI I",
1629
+ "FL V",
1630
+ "H W",
1631
+ "EE G",
1632
+ "AV E",
1633
+ "IL D",
1634
+ "H M",
1635
+ "R KL",
1636
+ "P LL",
1637
+ "EL E",
1638
+ "SV G",
1639
+ "SL E",
1640
+ "T PL",
1641
+ "A PI",
1642
+ "GV P",
1643
+ "AK G",
1644
+ "N TL",
1645
+ "TG G",
1646
+ "AI K",
1647
+ "S EE",
1648
+ "VL I",
1649
+ "SG I",
1650
+ "AR G",
1651
+ "T SL",
1652
+ "GL P",
1653
+ "S NL",
1654
+ "A YG",
1655
+ "KL G",
1656
+ "VL N",
1657
+ "SG F",
1658
+ "S EL",
1659
+ "DL E",
1660
+ "SG T",
1661
+ "FL E",
1662
+ "TV G",
1663
+ "TL E",
1664
+ "A YI",
1665
+ "DG I",
1666
+ "S NI",
1667
+ "AE K",
1668
+ "LL P",
1669
+ "PG D",
1670
+ "PV G",
1671
+ "S TG",
1672
+ "RV I",
1673
+ "R IL",
1674
+ "N SG",
1675
+ "Q AL",
1676
+ "EK G",
1677
+ "W S",
1678
+ "TV P",
1679
+ "YY C",
1680
+ "R AA",
1681
+ "W P",
1682
+ "R DL",
1683
+ "PL G",
1684
+ "N LL",
1685
+ "SI I",
1686
+ "TI G",
1687
+ "T AD",
1688
+ "KG I",
1689
+ "D AV",
1690
+ "T PV",
1691
+ "DV I",
1692
+ "R RL",
1693
+ "TL D",
1694
+ "NG I",
1695
+ "Q GL",
1696
+ "F SG",
1697
+ "S AT",
1698
+ "R DI",
1699
+ "AR I",
1700
+ "Q LL",
1701
+ "TV K",
1702
+ "S FL",
1703
+ "R SL",
1704
+ "RI V",
1705
+ "QL I",
1706
+ "TL I",
1707
+ "AG D",
1708
+ "Q AA",
1709
+ "NG K",
1710
+ "EG K",
1711
+ "S PL",
1712
+ "S YL",
1713
+ "R AV",
1714
+ "DI I",
1715
+ "DL P",
1716
+ "R GL",
1717
+ "T IL",
1718
+ "RI I",
1719
+ "N VL",
1720
+ "AK I",
1721
+ "Q EL",
1722
+ "AK E",
1723
+ "AG F",
1724
+ "M C",
1725
+ "YV G",
1726
+ "GG I",
1727
+ "AI D",
1728
+ "NL K",
1729
+ "AS T",
1730
+ "GV H",
1731
+ "TV I",
1732
+ "A PD",
1733
+ "NV V",
1734
+ "N DL",
1735
+ "RL F",
1736
+ "K LL",
1737
+ "P DG",
1738
+ "R QL",
1739
+ "S AV",
1740
+ "S EV",
1741
+ "S KI",
1742
+ "Q SL",
1743
+ "T EV",
1744
+ "KV G",
1745
+ "D AA",
1746
+ "CL V",
1747
+ "GV E",
1748
+ "S TV",
1749
+ "EG I",
1750
+ "AR E",
1751
+ "M W",
1752
+ "GV K",
1753
+ "EI G",
1754
+ "RL K",
1755
+ "T AE",
1756
+ "F TL",
1757
+ "SS I",
1758
+ "N AA",
1759
+ "AI E",
1760
+ "AQ V",
1761
+ "YL V",
1762
+ "KV E",
1763
+ "S AS",
1764
+ "C C",
1765
+ "FL G",
1766
+ "SV I",
1767
+ "PG I",
1768
+ "GV D",
1769
+ "T SG",
1770
+ "N KL",
1771
+ "AL P",
1772
+ "EV G",
1773
+ "S RV",
1774
+ "N GL",
1775
+ "PV I",
1776
+ "R AS",
1777
+ "R GG",
1778
+ "W M",
1779
+ "Q VL",
1780
+ "T TV",
1781
+ "RL E",
1782
+ "Ġ V",
1783
+ "DG G",
1784
+ "Y LL",
1785
+ "DV D",
1786
+ "S AG",
1787
+ "FV V",
1788
+ "NI V",
1789
+ "T AG",
1790
+ "S QL",
1791
+ "N IL",
1792
+ "T DL",
1793
+ "EI D",
1794
+ "RV G",
1795
+ "N SL",
1796
+ "TG E",
1797
+ "EG E",
1798
+ "RG I",
1799
+ "RI G",
1800
+ "AD Y",
1801
+ "F SL",
1802
+ "QV V",
1803
+ "T PE",
1804
+ "EE K",
1805
+ "TV D",
1806
+ "W H",
1807
+ "T RL",
1808
+ "FL K",
1809
+ "KV K",
1810
+ "IL I",
1811
+ "KG E",
1812
+ "EI K",
1813
+ "KL E",
1814
+ "SS K",
1815
+ "Y TL",
1816
+ "P PG",
1817
+ "R YL",
1818
+ "EE E",
1819
+ "EK D",
1820
+ "EL D",
1821
+ "KI K",
1822
+ "S AK",
1823
+ "QL G",
1824
+ "EG D",
1825
+ "PI G",
1826
+ "T TI",
1827
+ "F LL",
1828
+ "DL D",
1829
+ "Q AV",
1830
+ "P VL",
1831
+ "N GV",
1832
+ "KI G",
1833
+ "F GL",
1834
+ "PG Q",
1835
+ "EK E",
1836
+ "M AA",
1837
+ "R AE",
1838
+ "DV G",
1839
+ "AG Y",
1840
+ "NV I",
1841
+ "T GV",
1842
+ "N AK",
1843
+ "FV G",
1844
+ "SL D",
1845
+ "N SV",
1846
+ "Ġ P",
1847
+ "Q KL",
1848
+ "NI I",
1849
+ "DI G",
1850
+ "EV K",
1851
+ "R NL",
1852
+ "DG E",
1853
+ "KG K",
1854
+ "TF P",
1855
+ "S RD",
1856
+ "W W",
1857
+ "DI K",
1858
+ "YL E",
1859
+ "S TI",
1860
+ "F PL",
1861
+ "TV E",
1862
+ "SI G",
1863
+ "EV E",
1864
+ "R EI",
1865
+ "R FL",
1866
+ "AA Q",
1867
+ "TL P",
1868
+ "DV K",
1869
+ "KL D",
1870
+ "AQ G",
1871
+ "DI D",
1872
+ "AR D",
1873
+ "S AE",
1874
+ "NL G",
1875
+ "AR K",
1876
+ "RL D",
1877
+ "S NG",
1878
+ "S KV",
1879
+ "A WL",
1880
+ "EV D",
1881
+ "P SG",
1882
+ "YL K",
1883
+ "H LL",
1884
+ "T AK",
1885
+ "EK K",
1886
+ "N AL",
1887
+ "S FV",
1888
+ "P GL",
1889
+ "T SN",
1890
+ "S DV",
1891
+ "S AI",
1892
+ "DI E",
1893
+ "AA P",
1894
+ "NL E",
1895
+ "R GV",
1896
+ "AK K",
1897
+ "R RV",
1898
+ "YL G",
1899
+ "PG K",
1900
+ "T EI",
1901
+ "QI G",
1902
+ "TD E",
1903
+ "DG D",
1904
+ "P EG",
1905
+ "S EG",
1906
+ "QL E",
1907
+ "TI K",
1908
+ "PL I",
1909
+ "R SG",
1910
+ "TI E",
1911
+ "RG D",
1912
+ "QL K",
1913
+ "KL Q",
1914
+ "RI E",
1915
+ "T AT",
1916
+ "Y AL",
1917
+ "S KG",
1918
+ "AS E",
1919
+ "NL I",
1920
+ "YL Q",
1921
+ "Y GL",
1922
+ "N RL",
1923
+ "AS K",
1924
+ "SV D",
1925
+ "Y AD",
1926
+ "T AR",
1927
+ "P AL",
1928
+ "NV K",
1929
+ "Ġ K",
1930
+ "FL D",
1931
+ "SG D",
1932
+ "R TG",
1933
+ "Q RL",
1934
+ "QI V",
1935
+ "P TL",
1936
+ "Q DL",
1937
+ "F PP",
1938
+ "AS D",
1939
+ "N SI",
1940
+ "Ġ E",
1941
+ "Y AA",
1942
+ "SG K",
1943
+ "Q SS",
1944
+ "T DG",
1945
+ "AQ I",
1946
+ "S PE",
1947
+ "Ġ D",
1948
+ "S FG",
1949
+ "KD K",
1950
+ "S PI",
1951
+ "T AI",
1952
+ "FV D",
1953
+ "RG E",
1954
+ "SL P",
1955
+ "NV G",
1956
+ "QV I",
1957
+ "FI D",
1958
+ "N TV",
1959
+ "T ST",
1960
+ "T SI",
1961
+ "SE D",
1962
+ "S AD",
1963
+ "RI K",
1964
+ "A CL",
1965
+ "S AR",
1966
+ "P AG",
1967
+ "N EL",
1968
+ "TI V",
1969
+ "T DV",
1970
+ "N AI",
1971
+ "RG K",
1972
+ "SL Q",
1973
+ "AQ K",
1974
+ "TL Q",
1975
+ "AA F",
1976
+ "NL D",
1977
+ "F AK",
1978
+ "R AG",
1979
+ "KV Y",
1980
+ "T SS",
1981
+ "R EV",
1982
+ "T EK",
1983
+ "PE D",
1984
+ "N DI",
1985
+ "T RV",
1986
+ "YV D",
1987
+ "VL F",
1988
+ "S EI",
1989
+ "EL Y",
1990
+ "SG SG",
1991
+ "T TG",
1992
+ "YL D",
1993
+ "T KD",
1994
+ "YG I",
1995
+ "M AL",
1996
+ "AE D",
1997
+ "R FG",
1998
+ "VL H",
1999
+ "HL V",
2000
+ "YV V",
2001
+ "GI V",
2002
+ "Y DL",
2003
+ "SF I",
2004
+ "PI V",
2005
+ "R PG",
2006
+ "GL F",
2007
+ "S KE",
2008
+ "T KE",
2009
+ "R AR",
2010
+ "NV E",
2011
+ "S EK",
2012
+ "PL K",
2013
+ "R EE",
2014
+ "T FG",
2015
+ "S PV",
2016
+ "S DG",
2017
+ "SG N",
2018
+ "AD E",
2019
+ "KL F",
2020
+ "EL F",
2021
+ "KG D",
2022
+ "NG E",
2023
+ "Q KV",
2024
+ "YD P",
2025
+ "TV SS",
2026
+ "R EG",
2027
+ "S RI",
2028
+ "N AV",
2029
+ "SD D",
2030
+ "AK D",
2031
+ "R SV",
2032
+ "T QL",
2033
+ "RV E",
2034
+ "TI I",
2035
+ "P GV",
2036
+ "EV F",
2037
+ "W C",
2038
+ "N EE",
2039
+ "AS Q",
2040
+ "TG D",
2041
+ "PL E",
2042
+ "NI D",
2043
+ "KV D",
2044
+ "WG QG",
2045
+ "S PD",
2046
+ "R FD",
2047
+ "N KI",
2048
+ "AD D",
2049
+ "AT K",
2050
+ "H TL",
2051
+ "AD K",
2052
+ "PG E",
2053
+ "FL Q",
2054
+ "EE M",
2055
+ "SL Y",
2056
+ "PL D",
2057
+ "RV K",
2058
+ "VL Q",
2059
+ "R DV",
2060
+ "QI I",
2061
+ "AL H",
2062
+ "S NV",
2063
+ "M EL",
2064
+ "AF D",
2065
+ "M LL",
2066
+ "N SS",
2067
+ "SI E",
2068
+ "R AI",
2069
+ "R KI",
2070
+ "Q GV",
2071
+ "F AA",
2072
+ "P GG",
2073
+ "RE D",
2074
+ "P AV",
2075
+ "AT E",
2076
+ "N PD",
2077
+ "T EG",
2078
+ "T PI",
2079
+ "KG Y",
2080
+ "FL I",
2081
+ "FG I",
2082
+ "T KI",
2083
+ "SD E",
2084
+ "QG I",
2085
+ "T AS",
2086
+ "F AL",
2087
+ "GG K",
2088
+ "S AP",
2089
+ "SD K",
2090
+ "NI G",
2091
+ "R QI",
2092
+ "SS D",
2093
+ "ST D",
2094
+ "TF E",
2095
+ "C M"
2096
+ ]
2097
+ }
2098
+ }
tokenizer_config.json ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ {
2
+ "model_max_length": 1000000000000000019884624838656,
3
+ "pad_token": "<pad>",
4
+ "tokenizer_class": "PreTrainedTokenizerFast"
5
+ }