Edit Models filters

Tasks

Text Generation

Image-Text-to-Text

Parameters

Libraries

Transformers.js

Apps

Inference Providers

Models

5,286

Full-text search

Active filters: image-text-to-text, transformers

ibm-granite/granite-docling-258M

Image-Text-to-Text • 0.3B • Updated 1 day ago • 7.75k • 354

moondream/moondream3-preview

Image-Text-to-Text • 9B • Updated about 5 hours ago • 349 • 154

Hcompany/Holo1.5-7B

Image-Text-to-Text • 8B • Updated 5 days ago • 313 • 66

tencent/POINTS-Reader

Image-Text-to-Text • 4B • Updated 8 days ago • 1.77k • 90

openbmb/MiniCPM-V-4_5

Image-Text-to-Text • 9B • Updated 5 days ago • 66.8k • 958

Hcompany/Holo1.5-3B

Image-Text-to-Text • 4B • Updated 5 days ago • 193 • 28

ibm-granite/granite-docling-258M-mlx

Image-Text-to-Text • 0.3B • Updated 3 days ago • 407 • 24

Hcompany/Holo1.5-72B

Image-Text-to-Text • 73B • Updated 4 days ago • 69 • 21

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 4.23M • • 1.25k

baidu/ERNIE-4.5-VL-28B-A3B-PT

Image-Text-to-Text • 29B • Updated 19 days ago • 197k • • 82

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated 2 days ago • 298 • 15

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated Aug 18 • 36.4k • • 649

baidu/ERNIE-4.5-VL-424B-A47B-PT

Image-Text-to-Text • 424B • Updated 19 days ago • 162k • 96

ds4sd/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated 3 days ago • 136k • 1.58k

google/gemma-3-4b-it

Image-Text-to-Text • 4B • Updated Mar 21 • 1.74M • 849

google/medgemma-4b-it

Image-Text-to-Text • 5B • Updated Jul 9 • 104k • 668

google/gemma-3n-E4B-it

Image-Text-to-Text • 8B • Updated Jul 14 • 214k • 769

OpenGVLab/ScaleCUA-32B

Image-Text-to-Text • 33B • Updated 2 days ago • 12 • 11

prithivMLmods/Gliese-OCR-7B-Post1.0

Image-Text-to-Text • 8B • Updated 5 days ago • 327 • 10

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 904k • • 1.61k

fancyfeast/llama-joycaption-beta-one-hf-llava

Image-Text-to-Text • 8B • Updated May 16 • 78k • 212

google/medgemma-27b-it

Image-Text-to-Text • 29B • Updated Jul 10 • 18.7k • 200

microsoft/Florence-2-large

Image-Text-to-Text • 0.8B • Updated Aug 4 • 678k • 1.67k

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 615k • • 1.09k

nvidia/Cosmos-Reason1-7B

Image-Text-to-Text • 8B • Updated Aug 14 • 368k • 174

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20 • 275k • 1.51k

google/gemma-3n-E2B-it

Image-Text-to-Text • 5B • Updated Jul 14 • 196k • 209

NCSOFT/VARCO-VISION-2.0-14B

Image-Text-to-Text • 15B • Updated 5 days ago • 4.71k • 39

merve/smol-vision

Image-Text-to-Text • Updated 7 days ago • 137

OpenGVLab/InternVL3_5-8B

Image-Text-to-Text • 9B • Updated 21 days ago • 21.5k • 66