Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
BEE-spoke-data
's Collections
smol llama
finetuned smol 220M
Pretrained Encoders
Bee Models 🍯
book genre classifiers
tokenizers
FineWeb Concept Datasets
tokenizers
updated
Aug 7, 2024
trained and adapted tokenizers - various
Upvote
-
BEE-spoke-data/claude-tokenizer
Updated
Apr 20, 2024
BEE-spoke-data/claude-tokenizer-forT5
Updated
Jul 28, 2024
BEE-spoke-data/slimpajama_tok-48128-BPE-forT5
Updated
Aug 7, 2024
BEE-spoke-data/BeeTokenizer
Updated
Jul 20, 2024
•
1
BEE-spoke-data/MiniTokenizer-20480
Updated
Jul 21, 2024
sail/scaling-with-vocab-trained-tokenizers
Updated
Aug 2, 2024
•
2
pszemraj/claude-tokenizer-mlm
Updated
Mar 14, 2024
Upvote
-
Share collection
View history
Collection guide
Browse collections