Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
Inference Endpoints
text-generation-inference
image-text-to-text
custom_code
AutoTrain Compatible
4-bit precision
8-bit precision
Eval Results
Merge
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
5,526
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
microsoft/OmniParser
Image-Text-to-Text
•
Updated
4 days ago
•
4.7k
•
1.01k
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
Sep 30
•
2.3M
•
•
856
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
Sep 18
•
283k
•
1.15k
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
Updated
Sep 21
•
1.07M
•
•
778
rhymes-ai/Aria
Image-Text-to-Text
•
Updated
15 days ago
•
26.7k
•
571
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
Sep 21
•
376k
•
254
jadechoghari/Ferret-UI-Gemma2b
Image-Text-to-Text
•
Updated
18 days ago
•
1.66k
•
35
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
Updated
Sep 27
•
98.6k
•
321
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
18 days ago
•
31.6k
•
729
jadechoghari/Ferret-UI-Llama8b
Image-Text-to-Text
•
Updated
18 days ago
•
583
•
32
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
Updated
Sep 21
•
58.3k
•
153
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
26 days ago
•
76.4k
•
419
microsoft/Florence-2-large
Image-Text-to-Text
•
Updated
about 18 hours ago
•
1.47M
•
1.19k
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
20 days ago
•
127k
•
794
Vikhrmodels/Vikhr-2-VL-2b-Instruct-experimental
Image-Text-to-Text
•
Updated
2 days ago
•
66
•
9
meta-llama/Llama-3.2-90B-Vision-Instruct
Image-Text-to-Text
•
Updated
Sep 30
•
188k
•
244
Salesforce/blip-image-captioning-large
Image-to-Text
•
Updated
Dec 7, 2023
•
2.41M
•
•
1.14k
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
Aug 26
•
179k
•
674
meta-llama/Llama-3.2-90B-Vision
Image-Text-to-Text
•
Updated
Sep 27
•
5.51k
•
93
OpenGVLab/InternVL2-2B
Image-Text-to-Text
•
Updated
Sep 24
•
151k
•
55
OpenGVLab/Mono-InternVL-2B
Image-Text-to-Text
•
Updated
2 days ago
•
16.5k
•
21
BAAI/Aquila-VL-2B-llava-qwen
Image-Text-to-Text
•
Updated
7 days ago
•
1.05k
•
32
mistral-community/pixtral-12b
Image-Text-to-Text
•
Updated
18 days ago
•
28.1k
•
62
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
Updated
14 days ago
•
7.64k
•
238
SeanScripts/pixtral-12b-nf4
Image-Text-to-Text
•
Updated
Sep 26
•
1.81k
•
16
AIDC-AI/Ovis1.6-Llama3.2-3B
Image-Text-to-Text
•
Updated
15 days ago
•
1.46k
•
34
microsoft/trocr-base-handwritten
Image-to-Text
•
Updated
May 27
•
639k
•
322
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27
•
72.4k
•
94
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
2.18M
•
•
821
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Aug 1, 2023
•
1.95M
•
505
Previous
1
2
3
...
100
Next