zirui3
/

llm-multilingual-tokenizer

Upload README.md

3315142 over 1 year ago

143 Bytes

summary

multilingual tokenizer trained on multilingual data by using the SentencePiece library and the BPE algorithm.