Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
yhavinga
/
dutch-tokenizer-arena
like
7
Running
App
Files
Files
Community
1
bb1fd2b
dutch-tokenizer-arena
/
vocab
/
chinese_llama2
/
demo.py
xu-song
add more tokenizers
f4973d4
about 1 year ago
raw
Copy download link
history
blame
Safe
132 Bytes
from
vocab.chinese_llama2
import
tokenizer
encoding = tokenizer.encode(
"<s>开始</s>站位符<pad>试试<unk>"
)
print
(encoding)