ydl1y17's picture
Upload custom tokenizer
c7bfbaa verified
raw
history blame
366 Bytes
{"tokenizer_file": "custom_tokenizer.json", "model_type": "bpe", "normalizer": {"type": "Sequence", "normalizers": [{"type": "NFD"}, {"type": "Lowercase"}, {"type": "StripAccents"}]}, "pre_tokenizer": {"type": "Sequence", "pre_tokenizers": [{"type": "Whitespace"}, {"type": "Punctuation"}, {"type": "Digits", "individual_digits": "false"}]}, "do_lower_case": "true"}