Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
xu-song
/
tokenizer-arena
like
58
Running
App
Files
Files
Community
1
a37f943
tokenizer-arena
/
vocab
/
glm_chinese
2 contributors
History:
1 commit
xu-song
update
751936e
over 1 year ago
chinese_sentencepiece
update
over 1 year ago
README.md
Safe
487 Bytes
update
over 1 year ago
__init__.py
Safe
827 Bytes
update
over 1 year ago
convert_vocab_to_txt.py
Safe
689 Bytes
update
over 1 year ago
file_utils.py
Safe
8.38 kB
update
over 1 year ago
glm_chinese.vocab.txt
Safe
659 kB
update
over 1 year ago
sp_tokenizer.py
Safe
4.67 kB
update
over 1 year ago
test.py
Safe
65 Bytes
update
over 1 year ago
test_glm.py
Safe
2.5 kB
update
over 1 year ago
tokenization.py
Safe
51.9 kB
update
over 1 year ago
tokenization_gpt2.py
Safe
13.5 kB
update
over 1 year ago
utils.py
Safe
213 Bytes
update
over 1 year ago
wordpiece.py
Safe
15.5 kB
update
over 1 year ago