Evgenii Pishchik's picture

Evgenii Pishchik

epishchik

·

AI & ML interests

None yet

Recent Activity

liked a dataset about 19 hours ago

lmms-lab/LLaVA-Video-178K

liked a Space 3 days ago

jane-street/puzzle

upvoted a collection 8 days ago

View all activity

Organizations

None yet

epishchik's activity

upvoted a collection 8 days ago

Llama 4

Llama 4 release • 10 items • Updated 8 days ago • 420

upvoted a collection 27 days ago

GemmaX2

GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated Feb 7 • 21

upvoted a collection 30 days ago

Gemma 3 Release

17 items • Updated 10 days ago • 327

upvoted 2 collections about 2 months ago

LLaVA-Video

Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 8 items • Updated Feb 21 • 60

SigLIP2

36 items • Updated 10 days ago • 67

upvoted 2 collections 3 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 13 days ago • 441

Multimodal Models

Multimodal models with leading performance. • 17 items • Updated Jan 17 • 33

upvoted 5 collections 4 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated about 1 month ago • 301

KaLM-embedding

11 items • Updated Mar 11 • 24

SAGE v1.0.0 release

5 items • Updated 18 days ago • 1

SAGE v1.1.0 release

4 items • Updated 18 days ago • 5

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 74

upvoted a paper 6 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 114

upvoted a collection 7 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 591

upvoted 5 collections 8 months ago

LLaVa-1.5

LLaVa-1.5 is a series of vision-language models (VLMs) trained on a variety of visual instruction datasets. • 3 items • Updated Mar 18, 2024 • 8

LLaVa-NeXT

LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19, 2024 • 29

Vision-Language Modeling

Our datasets and models for Visual-Language Modeling • 5 items • Updated Nov 25, 2024 • 6

GLM-4

GLM-4 Open Models • 14 items • Updated about 2 hours ago • 117

Yi VL

2 items • Updated May 11, 2024 • 2