Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
5
2
4
Yasas
ysenarath
Follow
gauravbahl's profile picture
1 follower
ยท
5 following
http://www.ysenarath.com
wayasas
ysenarath
yasas.bsky.social
AI & ML interests
NLU, KB, Crisis Informatics, Social Media Mining
Recent Activity
reacted
to
tomaarsen
's
post
with โค๏ธ
3 days ago
An assembly of 18 European companies, labs, and universities have banded together to launch ๐ช๐บ EuroBERT! It's a state-of-the-art multilingual encoder for 15 European languages, designed to be finetuned for retrieval, classification, etc. ๐ช๐บ 15 Languages: English, French, German, Spanish, Chinese, Italian, Russian, Polish, Portuguese, Japanese, Vietnamese, Dutch, Arabic, Turkish, Hindi 3๏ธโฃ 3 model sizes: 210M, 610M, and 2.1B parameters - very very useful sizes in my opinion โก๏ธ Sequence length of 8192 tokens! Nice to see these higher sequence lengths for encoders becoming more common. โ๏ธ Architecture based on Llama, but with bi-directional (non-causal) attention to turn it into an encoder. Flash Attention 2 is supported. ๐ฅ A new Pareto frontier (stronger *and* smaller) for multilingual encoder models ๐ Evaluated against mDeBERTa, mGTE, XLM-RoBERTa for Retrieval, Classification, and Regression (after finetuning for each task separately): EuroBERT punches way above its weight. ๐ Detailed paper with all details, incl. data: FineWeb for English and CulturaX for multilingual data, The Stack v2 and Proof-Pile-2 for code. Check out the release blogpost here: https://huggingface.co/blog/EuroBERT/release * https://huggingface.co/EuroBERT/EuroBERT-210m * https://huggingface.co/EuroBERT/EuroBERT-610m * https://huggingface.co/EuroBERT/EuroBERT-2.1B The next step is for researchers to build upon the 3 EuroBERT base models and publish strong retrieval, zero-shot classification, etc. models for all to use. I'm very much looking forward to it!
updated
a model
3 days ago
ysenarath/roberta-base-hoeken2024hateful-augmented-v2
published
a model
3 days ago
ysenarath/roberta-base-hoeken2024hateful-augmented-v2
View all activity
Organizations
ysenarath
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
3 models
3 months ago
tencent/HunyuanVideo
Text-to-Video
โข
Updated
7 days ago
โข
5.61k
โข
โข
1.75k
Qwen/QwQ-32B-Preview
Text Generation
โข
Updated
Jan 12
โข
249k
โข
โข
1.72k
Datou1111/shou_xin
Text-to-Image
โข
Updated
Dec 9, 2024
โข
1.99k
โข
866
liked
a model
4 months ago
jinaai/jina-embeddings-v3
Feature Extraction
โข
Updated
17 days ago
โข
2.17M
โข
817