A collection of tokenisers I have trained (so you don't have to).
Thomas Bauwens
Bauwens
AI & ML interests
NLP
Recent Activity
updated
a model
about 2 months ago
Bauwens/BPE-40k_OSCAR-en-30M
published
a model
about 2 months ago
Bauwens/BPE-40k_OSCAR-en-30M
new activity
10 months ago
Muennighoff/flores200:error loading dataset
Organizations
None yet