BERT as language model
Generating synthetic data via self-chatting
Compare different tokenizers in char-level and byte-level.
GPT 4o like bot.
Knowledge-injected Pre-trained Language Model