SebastianRuff
's Collections
Vision Models 12/2024 Tryout
updated
jiuhai/florence-vl-8b-sft
Updated
•
120
•
18
OpenGVLab/InternVL2_5-26B-MPO
Image-Text-to-Text
•
Updated
•
4.31k
•
13
OpenGVLab/InternVL2_5-78B
Image-Text-to-Text
•
Updated
•
42k
•
169
OpenGVLab/InternVL2_5-38B
Image-Text-to-Text
•
Updated
•
10.4k
•
40
OpenGVLab/InternVL2_5-8B
Image-Text-to-Text
•
Updated
•
47k
•
74
OpenGVLab/InternVL2_5-8B-MPO
Image-Text-to-Text
•
Updated
•
14.5k
•
37
OpenGVLab/InternVL2_5-78B-MPO
Image-Text-to-Text
•
Updated
•
4.56k
•
34
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
•
169k
•
537
deepseek-ai/deepseek-vl2
Image-Text-to-Text
•
Updated
•
4.85k
•
178
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text
•
Updated
•
9.5k
•
66
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text
•
Updated
•
38.5k
•
94
ByteDance/Sa2VA-8B
Image-Text-to-Text
•
Updated
•
4.42k
•
44
ByteDance/Sa2VA-4B
Image-Text-to-Text
•
Updated
•
4.7k
•
61
ByteDance/Sa2VA-1B
Image-Text-to-Text
•
Updated
•
1.36k
•
16
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
•
146k
•
1.02k
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
Updated
•
1.81M
•
•
1.11k
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
Updated
•
137k
•
273
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
•
970k
•
389
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
•
2.29M
•
•
1.29k
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
•
Updated
•
338k
•
656
meta-llama/Llama-3.2-90B-Vision
Image-Text-to-Text
•
Updated
•
8.99k
•
118
OpenGVLab/InternVL2_5-4B-MPO
Image-Text-to-Text
•
Updated
•
6.16k
•
17
OpenGVLab/InternVL2_5-26B
Image-Text-to-Text
•
Updated
•
5.34k
•
33
OpenGVLab/InternVL2_5-38B-MPO
Image-Text-to-Text
•
Updated
•
4.25k
•
19
OpenGVLab/InternVL2_5-4B
Image-Text-to-Text
•
Updated
•
16.4k
•
42
mistralai/Pixtral-12B-2409
Image-Text-to-Text
•
Updated
•
594
mistralai/Pixtral-12B-Base-2409
mistralai/Pixtral-Large-Instruct-2411
Image-Text-to-Text
•
Updated
•
8
•
394
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
•
581k
•
502
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
Updated
•
6.98k
•
153
allenai/Molmo-72B-0924
Image-Text-to-Text
•
Updated
•
3.08k
•
280
allenai/MolmoE-1B-0924
Image-Text-to-Text
•
Updated
•
9.23k
•
135
Efficient-Large-Model/NVILA-15B
Text Generation
•
Updated
•
24k
•
11
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
Updated
•
164k
•
947
THUDM/cogvlm2-llama3-chat-19B
Text Generation
•
Updated
•
7.38k
•
208
THUDM/cogvlm2-llama3-chat-19B-int4
Text Generation
•
Updated
•
1.26k
•
28
lmms-lab/llava-onevision-qwen2-7b-ov
Text Generation
•
Updated
•
155k
•
44
lmms-lab/llava-onevision-qwen2-72b-ov-chat
Image-Text-to-Text
•
Updated
•
597
•
8
THUDM/glm-4v-9b
Updated
•
102k
•
251
AIDC-AI/Ovis1.6-Gemma2-27B
Image-Text-to-Text
•
Updated
•
974
•
60
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
Updated
•
3.11k
•
268
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
•
135k
•
922
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text
•
Updated
•
86.5k
•
366
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
Updated
•
228k
•
450
meta-llama/Llama-3.2-90B-Vision-Instruct
Image-Text-to-Text
•
Updated
•
44.3k
•
•
318
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
•
47.3k
•
765
google/paligemma2-28b-pt-896
Image-Text-to-Text
•
Updated
•
2k
•
45
google/paligemma2-28b-pt-448
Image-Text-to-Text
•
Updated
•
224
•
9
google/paligemma2-10b-pt-896
Image-Text-to-Text
•
Updated
•
4.49k
•
29
google/paligemma2-10b-pt-448
Image-Text-to-Text
•
Updated
•
2.59k
•
12
google/paligemma2-3b-pt-896
Image-Text-to-Text
•
Updated
•
3.08k
•
23
google/paligemma2-3b-pt-448
Image-Text-to-Text
•
Updated
•
12.8k
•
41
xtuner/llava-llama-3-8b-v1_1-gguf
Image-to-Text
•
Updated
•
8.02k
•
206
Efficient-Large-Model/NVILA-8B
Text Generation
•
Updated
•
8.9k
•
3
Efficient-Large-Model/VILA1.5-3b
Text Generation
•
Updated
•
13.1k
•
23
ICTNLP/llava-mini-llama-3.1-8b
Image-Text-to-Text
•
Updated
•
7.07k
•
41
openbmb/MiniCPM-o-2_6
Any-to-Any
•
Updated
•
300k
•
914
omkarthawakar/LlamaV-o1
Question Answering
•
Updated
•
9.61k
•
87
ByteDance/Sa2VA-26B
Image-Text-to-Text
•
Updated
•
171
•
12
MiniMaxAI/MiniMax-VL-01
Image-Text-to-Text
•
Updated
•
2.1k
•
229
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
Updated
•
30.1k
•
214
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
•
231k
•
312
Qwen/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
Updated
•
79.9k
•
156
nvidia/Eagle2-9B
Image-Text-to-Text
•
Updated
•
2.4k
•
35