Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12.3
TFLOPS
27
Ari Jankelowitz
ajankelo
Follow
0 followers
·
2 following
ajankelo
ajankelo
AI & ML interests
None yet
Recent Activity
liked
a model
5 days ago
jxm/cde-small-v2
reacted
to
jxm
's
post
with ❤️
6 days ago
New state-of-the-art BERT-size retrieval model: *cde-small-v2* 🥳🍾 Hi everyone! We at Cornell are releasing a new retrieval model this week. It uses the contextual embeddings framework, is based on ModernBERT backbone, and gets state-of-the-art results on the MTEB benchmark for its model size (140M parameters). cde-small-v2 gets an average score of 65.6 across the 56 datasets and sees improvements from our previous model in *every* task domain (retrieval, classification, etc.). We made a lot of changes to make this model work. First of all, ModernBERT has a better tokenizer, which probably helped this work out-of-the-box. We also followed the principles from the CDE paper and used harder clusters and better hard-negative filtering, which showed a small performance improvement. And we made a few small changes that have been shown to work on the larger models: we disabled weight decay, masked out the prefix tokens during pooling, and added a residual connection from the first-stage to the second-stage for better gradient flow. We're still looking for a computer sponsor to help us scale CDE to larger models. Since it's now state-of-the-art at the 100M parameter scale, it seems to be a reasonable bet that we could train a state-of-the-art large model if we had the GPUs. If you're interested in helping with this, please reach out! Here's a link to the model: https://huggingface.co/jxm/cde-small-v2 And here's a link to the paper: https://huggingface.co/papers/2410.02525
liked
a dataset
about 1 month ago
zenml/llmops-database
View all activity
Organizations
spaces
2
Sort: Recently updated
Build error
🚙
Pklot Experiment Latest
Runtime error
🌖
Rio Favela or Torrox Spain
models
3
Sort: Recently updated
ajankelo/ppo-Huggy
Reinforcement Learning
•
Updated
Mar 17, 2023
•
4
ajankelo/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Mar 15, 2023
•
3
ajankelo/pklot_small_model
Updated
Oct 28, 2022
datasets
1
ajankelo/pklot_50
Updated
Oct 28, 2022
•
38
•
1