01-ai/Yi-34B-Chat · Why tokenizer.json is 8 vocabs off?

in tokenizer.json, token id 3-5 and 9-15 have no definitions.
This has resulted in a discrepany between model (config.json) vocab size and the tokenizer vocab size.

'''
config = AutoConfig.from_pretrained(model)
tokenizer = AutoTokenizer.from_pretrained(tokenizer_path)
config.vocab_size # 64000
len(tokenizer) # 63992
'''

this characteristic has caused decoding errors when Yi-34B is run with other frameworks like VLLM (e.g. https://github.com/vllm-project/vllm/issues/340). If these skipped over ids have no definition, why don't we realign all ids to 0 and eliminate the gap, or fill in some empty definitions to avoid the error.