Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
Inference Endpoints
text-generation-inference
image-text-to-text
custom_code
4-bit precision
AutoTrain Compatible
8-bit precision
Eval Results
Merge
Carbon Emissions
Mixture of Experts
Misc with no match
text-embeddings-inference
Apply filters
Models
6,671
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
xtuner/llava-llama-3-8b
Image-Text-to-Text
•
Updated
Apr 26, 2024
•
20
•
35
xtuner/llava-llama-3-8b-v1_1-hf
Image-Text-to-Text
•
Updated
Apr 28, 2024
•
32
•
25
xtuner/llava-llama-3-8b-v1_1-transformers
Image-Text-to-Text
•
Updated
Apr 28, 2024
•
654k
•
68
cjpais/moondream2-llamafile
Image-Text-to-Text
•
Updated
May 9, 2024
•
367
•
28
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text
•
Updated
Sep 18, 2024
•
1.35k
•
186
Intel/llava-llama-3-8b
Image-Text-to-Text
•
Updated
Jul 1, 2024
•
74
•
13
Mit1208/Kosmos-2-PokemonCards-trl-merged
Image-to-Text
•
Updated
May 12, 2024
•
111
•
1
google/paligemma-3b-ft-vqav2-448
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
154
•
13
google/paligemma-3b-mix-224
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
341k
•
65
google/paligemma-3b-mix-448
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
5.28k
•
105
google/paligemma-3b-pt-448
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
163k
•
28
microsoft/llava-med-v1.5-mistral-7b
Image-Text-to-Text
•
Updated
May 14, 2024
•
15.4k
•
64
tinyllava/TinyLLaVA-Phi-2-SigLIP-3.1B
Image-Text-to-Text
•
Updated
May 18, 2024
•
4.26k
•
13
rohit5895/OCR_NumInput_Base
Image-to-Text
•
Updated
19 days ago
•
31
•
1
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text
•
Updated
11 days ago
•
29k
•
1.39k
TIGER-Lab/Mantis-8B-Idefics2
Image-Text-to-Text
•
Updated
Nov 15, 2024
•
598
•
13
lamm-mit/Cephalo-Idefics-2-vision-8b-alpha
Image-Text-to-Text
•
Updated
May 30, 2024
•
54
•
1
OpenGVLab/Mini-InternVL-Chat-4B-V1-5
Image-Text-to-Text
•
Updated
Dec 18, 2024
•
424
•
61
lamm-mit/Cephalo-Idefics-2-vision-10b-alpha
Image-Text-to-Text
•
Updated
May 30, 2024
•
34
•
1
BhashaAI/ViLaH
Visual Question Answering
•
Updated
Jun 4, 2024
•
86
•
1
openvla/openvla-7b
Image-Text-to-Text
•
Updated
Sep 16, 2024
•
69.6k
•
89
cyberagent/llava-calm2-siglip
Image-to-Text
•
Updated
Jun 12, 2024
•
2.52k
•
24
microsoft/Florence-2-base
Image-Text-to-Text
•
Updated
Nov 4, 2024
•
242k
•
201
microsoft/Florence-2-base-ft
Image-Text-to-Text
•
Updated
Jul 20, 2024
•
161k
•
99
gokaygokay/PaliGemma-PixelProse
Image-Text-to-Text
•
Updated
Jun 18, 2024
•
29
•
11
ahmed-masry/chartgemma
Image-Text-to-Text
•
Updated
Jul 27, 2024
•
2.29k
•
39
onnx-community/Florence-2-base-ft
Image-Text-to-Text
•
Updated
Oct 8, 2024
•
21.7k
•
24
onnx-community/Florence-2-base
Image-Text-to-Text
•
Updated
Oct 8, 2024
•
39
•
9
FreedomIntelligence/HuatuoGPT-Vision-34B
Image-Text-to-Text
•
Updated
Jul 3, 2024
•
551
•
17
OpenGVLab/InternVL2-8B
Image-Text-to-Text
•
Updated
Dec 18, 2024
•
22.4k
•
164
Previous
1
...
4
5
6
7
8
...
100
Next