janked together t5 base arch that uses silu instead of geglu and the q2.5 tokenizer