Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
60
Mex Ivanov
MexIvanov
Follow
evilfreelancer's profile picture
21world's profile picture
2 followers
Ā·
11 following
MexIvanov
AI & ML interests
NLP, Coding, Quantum Computing and more.
Recent Activity
reacted
to
tomaarsen
's
post
with ā¤ļø
3 days ago
An assembly of 18 European companies, labs, and universities have banded together to launch šŖšŗ EuroBERT! It's a state-of-the-art multilingual encoder for 15 European languages, designed to be finetuned for retrieval, classification, etc. šŖšŗ 15 Languages: English, French, German, Spanish, Chinese, Italian, Russian, Polish, Portuguese, Japanese, Vietnamese, Dutch, Arabic, Turkish, Hindi 3ļøā£ 3 model sizes: 210M, 610M, and 2.1B parameters - very very useful sizes in my opinion ā”ļø Sequence length of 8192 tokens! Nice to see these higher sequence lengths for encoders becoming more common. āļø Architecture based on Llama, but with bi-directional (non-causal) attention to turn it into an encoder. Flash Attention 2 is supported. š„ A new Pareto frontier (stronger *and* smaller) for multilingual encoder models š Evaluated against mDeBERTa, mGTE, XLM-RoBERTa for Retrieval, Classification, and Regression (after finetuning for each task separately): EuroBERT punches way above its weight. š Detailed paper with all details, incl. data: FineWeb for English and CulturaX for multilingual data, The Stack v2 and Proof-Pile-2 for code. Check out the release blogpost here: https://huggingface.co/blog/EuroBERT/release * https://huggingface.co/EuroBERT/EuroBERT-210m * https://huggingface.co/EuroBERT/EuroBERT-610m * https://huggingface.co/EuroBERT/EuroBERT-2.1B The next step is for researchers to build upon the 3 EuroBERT base models and publish strong retrieval, zero-shot classification, etc. models for all to use. I'm very much looking forward to it!
liked
a dataset
13 days ago
TuringsSolutions/MemoryVaccine120
liked
a model
14 days ago
coqui/XTTS-v2
View all activity
Organizations
None yet
models
6
Sort:Ā Recently updated
MexIvanov/MistRAG-7B-ruen-v1-merged
Text Generation
ā¢
Updated
Nov 25, 2024
ā¢
17
MexIvanov/MistRAG-7B-ruen-v1
Text Generation
ā¢
Updated
Nov 25, 2024
MexIvanov/MistRAG-7B-ruen-v1-gguf
Text Generation
ā¢
Updated
Nov 25, 2024
ā¢
85
MexIvanov/zephyr-python-ru
Text Generation
ā¢
Updated
Nov 11, 2024
ā¢
2
MexIvanov/zephyr-python-ru-merged
Text Generation
ā¢
Updated
Nov 11, 2024
ā¢
51
MexIvanov/zephyr-python-ru-gguf
Text Generation
ā¢
Updated
Nov 11, 2024
ā¢
63
ā¢
4
datasets
4
Sort:Ā Recently updated
MexIvanov/RAG-v1-ruen
Viewer
ā¢
Updated
Nov 11, 2024
ā¢
51.4k
ā¢
91
ā¢
1
MexIvanov/image-gen-vector-consistency
Viewer
ā¢
Updated
Aug 30, 2024
ā¢
184
ā¢
68
MexIvanov/CodeExercise-Python-27k-ru
Viewer
ā¢
Updated
Dec 19, 2023
ā¢
27.2k
ā¢
91
ā¢
2
MexIvanov/Vezora-Tested-22k-Python-Alpaca-ru
Viewer
ā¢
Updated
Dec 19, 2023
ā¢
22.6k
ā¢
112
ā¢
2