tokenizer-arena / README.md
xu-song's picture
update
f2cec45
|
raw
history blame
436 Bytes
metadata
title: Tokenizer Arena
emoji: 
colorFrom: red
colorTo: gray
sdk: gradio
sdk_version: 4.38.1
app_file: app.py
pinned: false
datasets:
  - cc100
tags:
  - tokenizer
short_description: Compare different tokenizers in char-level and byte-level.

Please visit our GitHub repo for more information: https://github.com/xu-song/tokenizer-arena

Run gradio demo

python app.py

ss