Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
Inference Endpoints
text-generation-inference
image-text-to-text
custom_code
AutoTrain Compatible
4-bit precision
8-bit precision
Eval Results
Merge
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
6,464
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
•
Updated
Sep 26, 2024
•
262k
•
636
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
Updated
Sep 27, 2024
•
58.3k
•
427
OpenGVLab/InternVL2_5-8B
Image-Text-to-Text
•
Updated
25 days ago
•
34.5k
•
68
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
Updated
about 15 hours ago
•
4.31k
•
25
Alibaba-NLP/gme-Qwen2-VL-2B-Instruct
Sentence Similarity
•
Updated
6 days ago
•
1.4k
•
12
RUC-AIBOX/Virgo-72B
Image-Text-to-Text
•
Updated
1 day ago
•
45
•
5
microsoft/trocr-large-printed
Image-to-Text
•
Updated
May 27, 2024
•
335k
•
149
YipengZhang/LLaVA-UHD-v2
Image-Text-to-Text
•
Updated
about 7 hours ago
•
41
•
5
WueNLP/centurio_qwen
Image-Text-to-Text
•
Updated
1 day ago
•
40
•
4
THUDM/cogagent-9b-20241220
Image-Text-to-Text
•
Updated
18 days ago
•
2.4k
•
39
Alibaba-NLP/gme-Qwen2-VL-7B-Instruct
Sentence Similarity
•
Updated
5 days ago
•
601
•
13
osunlp/UGround-V1-2B
Image-Text-to-Text
•
Updated
5 days ago
•
376
•
5
llamaindex/vdr-2b-v1
Image-Text-to-Text
•
Updated
1 day ago
•
21
•
4
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
1.14M
•
•
859
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
46.8k
•
185
Salesforce/blip2-opt-2.7b
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
267k
•
328
google/matcha-chart2text-pew
Visual Question Answering
•
Updated
Jul 22, 2023
•
418
•
32
google/deplot
Visual Question Answering
•
Updated
Sep 6, 2023
•
8.72k
•
277
liuhaotian/llava-v1.5-7b
Image-Text-to-Text
•
Updated
May 8, 2024
•
1.07M
•
399
xtuner/llava-llama-3-8b-v1_1-transformers
Image-Text-to-Text
•
Updated
Apr 28, 2024
•
352k
•
67
microsoft/Florence-2-base
Image-Text-to-Text
•
Updated
Nov 4, 2024
•
201k
•
198
gokaygokay/Florence-2-Flux-Large
Image-Text-to-Text
•
Updated
Sep 18, 2024
•
14.6k
•
30
OpenGVLab/InternVL2_5-1B
Image-Text-to-Text
•
Updated
25 days ago
•
10.3k
•
41
google/paligemma2-3b-pt-224
Image-Text-to-Text
•
Updated
Dec 5, 2024
•
41k
•
124
nvidia/NVLM-D-72B-mcore
Image-Text-to-Text
•
Updated
4 days ago
•
6
OpenGVLab/InternVL2_5-78B-MPO
Image-Text-to-Text
•
Updated
21 days ago
•
7.63k
•
26
OpenGVLab/InternVL2_5-8B-MPO
Image-Text-to-Text
•
Updated
21 days ago
•
11.9k
•
21
bartowski/QVQ-72B-Preview-GGUF
Image-Text-to-Text
•
Updated
3 days ago
•
187k
•
52
kha-white/manga-ocr-base
Image-to-Text
•
Updated
Jun 22, 2022
•
62.1k
•
133
microsoft/trocr-base-printed
Image-to-Text
•
Updated
May 27, 2024
•
130k
•
•
158
Previous
1
2
3
4
...
100
Next