tokenizer-arena / README.md
xu-song's picture
config python version
11379e2
|
raw
history blame
513 Bytes
---
title: Tokenizer Arena
emoji:
colorFrom: red
colorTo: gray
sdk: gradio
sdk_version: 3.41.2
app_file: app.py
pinned: false
python_version: 3.9
---
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
## ss
## TODO
- 搜索栏
-
## 统计
## vocabsize
- 增大能提到压缩率,副作用是增大计算量和内存 (getting the most out of your tokenizer for pre-training and)
-
https://huggingface.co/spaces/yenniejun/tokenizers-languages