A tokenizer with a vocab size of 20k for Intro to Deep Learning Homework 4 on Language Modelling and Automatic Speech Recognition.
The tokenizer was trained on LibriSpeech LM text
A tokenizer with a vocab size of 20k for Intro to Deep Learning Homework 4 on Language Modelling and Automatic Speech Recognition.
The tokenizer was trained on LibriSpeech LM text