tokenizer-arena / README.md
xu-song's picture
config python version
11379e2
|
raw
history blame
513 Bytes
metadata
title: Tokenizer Arena
emoji: 
colorFrom: red
colorTo: gray
sdk: gradio
sdk_version: 3.41.2
app_file: app.py
pinned: false
python_version: 3.9

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

ss

TODO

  • 搜索栏

统计

vocabsize

  • 增大能提到压缩率,副作用是增大计算量和内存 (getting the most out of your tokenizer for pre-training and)

https://huggingface.co/spaces/yenniejun/tokenizers-languages