tokenizer-arena / vocab /moss /test_arena.py
xu-song's picture
add more tokenizers
f4973d4
raw
history blame
125 Bytes
from vocab.moss import tokenizer
from utils.zh_util import iter_vocab
tokenizer.get_vocab()
iter_vocab(tokenizer, "test")