Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Cerebras
Replicate
Nebius AI Studio
fal
SambaNova
Together AI
Novita
Hyperbolic
Fireworks
HF Inference API
Misc
Reset Misc
Inference Endpoints
text-generation-inference
image-text-to-text
custom_code
4-bit precision
AutoTrain Compatible
8-bit precision
Eval Results
Merge
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
7,768
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
xtuner/llava-phi-3-mini
Image-Text-to-Text
•
Updated
Apr 25, 2024
•
181
•
25
xtuner/llava-phi-3-mini-hf
Image-to-Text
•
Updated
Apr 25, 2024
•
2.73k
•
49
cjpais/moondream2-llamafile
Image-Text-to-Text
•
Updated
May 9, 2024
•
435
•
28
braintacles/brainblip
Image-to-Text
•
Updated
Jun 27, 2024
•
71
•
3
Revrse/icon-captioning-model
Image-to-Text
•
Updated
Apr 29, 2024
•
1.07k
•
•
5
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text
•
Updated
Jul 30, 2024
•
2.13k
•
•
94
TIGER-Lab/Mantis-8B-siglip-llama3
Image-Text-to-Text
•
Updated
Nov 15, 2024
•
14.3k
•
33
google/paligemma-3b-pt-448-jax
Image-Text-to-Text
•
Updated
Jan 29
•
3
•
2
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text
•
Updated
Feb 3
•
1.17k
•
185
Intel/llava-llama-3-8b
Image-Text-to-Text
•
Updated
Jul 1, 2024
•
162
•
14
BUAADreamer/Chinese-LLaVA-Med-7B
Visual Question Answering
•
Updated
May 22, 2024
•
146
•
4
Mit1208/Kosmos-2-PokemonCards-trl-merged
Image-to-Text
•
Updated
May 12, 2024
•
127
•
1
google/paligemma-3b-ft-vqav2-448
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
883
•
16
google/paligemma-3b-mix-224
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
333k
•
69
google/paligemma-3b-ft-docvqa-896
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
338
•
9
google/paligemma-3b-mix-448
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
26.9k
•
106
google/paligemma-3b-pt-896
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
128k
•
117
google/paligemma-3b-pt-448
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
7.96k
•
29
OpenGVLab/Mini-InternVL-Chat-2B-V1-5
Image-Text-to-Text
•
Updated
Feb 5
•
2.5k
•
73
tinyllava/TinyLLaVA-Phi-2-SigLIP-3.1B
Image-Text-to-Text
•
Updated
May 18, 2024
•
5.42k
•
15
gokaygokay/paligemma-rich-captions
Image-Text-to-Text
•
Updated
Jun 15, 2024
•
17
•
10
rohit5895/OCR_NumInput_Base
Image-to-Text
•
Updated
Jan 7
•
90
•
1
RichardLuo/Shotluck-Holmes-1.5
Image-Text-to-Text
•
Updated
May 18, 2024
•
58
•
3
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text
•
Updated
Jan 15
•
25k
•
1.39k
TIGER-Lab/Mantis-8B-Idefics2
Image-Text-to-Text
•
Updated
Nov 15, 2024
•
270
•
13
merve/paligemma_vqav2
Image-Text-to-Text
•
Updated
Dec 18, 2024
•
177
•
13
lamm-mit/Cephalo-Idefics-2-vision-8b-alpha
Image-Text-to-Text
•
Updated
May 30, 2024
•
72
•
1
OpenGVLab/Mini-InternVL-Chat-4B-V1-5
Image-Text-to-Text
•
Updated
Feb 5
•
603
•
62
lamm-mit/Cephalo-Idefics-2-vision-10b-alpha
Image-Text-to-Text
•
Updated
May 30, 2024
•
69
•
1
fhswf/TrOCR_german_handwritten
Image-to-Text
•
Updated
Jun 18, 2024
•
583
•
7
Previous
1
...
7
8
9
10
11
...
100
Next