Aleksei Dorkin's picture

Aleksei Dorkin PRO

adorkin

·

slowwavesleep

AI & ML interests

Computational Linguistics

Recent Activity

updated a dataset 23 days ago

adorkin/nllb-et-en-scores

published a dataset 23 days ago

adorkin/nllb-et-en-scores

liked a model 29 days ago

amd/Instella-3B-Instruct

View all activity

Organizations

adorkin's activity

upvoted a paper about 1 month ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 139

upvoted a paper about 2 months ago

On Domain-Specific Post-Training for Multimodal Large Language Models

Paper • 2411.19930 • Published Nov 29, 2024 • 28

upvoted an article about 2 months ago

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 456

upvoted a collection 2 months ago

SmolVLM 256M & 500M

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated Feb 20 • 72

upvoted a paper 3 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 52

upvoted 2 collections 4 months ago

Multilingual LLM Evaluation

Multilingual Evaluation Benchmarks • 8 items • Updated Mar 3 • 25

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 226

upvoted a paper 4 months ago

LLMs for Extremely Low-Resource Finno-Ugric Languages

Paper • 2410.18902 • Published Oct 24, 2024 • 3

upvoted 5 collections 4 months ago

Llammas 🐑

4 items • Updated Jan 1 • 2

MaLA-LM

MaLA-LM: Massive Language Adaptation of Large Language Models • 7 items • Updated Oct 7, 2024 • 1

Models for dataset curation

9 items • Updated Dec 5, 2024 • 17

4M Models

Multimodal models from https://4m.epfl.ch/ • 17 items • Updated 29 days ago • 31

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 74

upvoted 2 collections 5 months ago

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 11 items • Updated 24 days ago • 59

October 25 Releases

19 items • Updated Oct 25, 2024 • 7

upvoted 2 collections 6 months ago

GLiClass

Generalist and Light-weighted Models for Zero-shot Text Classification • 13 items • Updated Sep 17, 2024 • 14

Salamandra 🦎

18 items • Updated 2 days ago • 55