Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
carlizor
's Collections
Flux
Image restoration
3D Generation
LLM
Embedding
LLM - Small
Video vision
To Read
Video
Image Segmentation
Image Generation (Fast)
Image Depth
Image caption
Audio
Image Generation
Image that talks
Image Enhance
Image Vision
Image editing
Image upscaling
Face Recognition
Multimodal
LLM - Medium
Image Vision
updated
3 days ago
Upvote
-
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text
•
Updated
Sep 18, 2024
•
1.49k
•
185
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
Updated
Nov 27, 2024
•
3.58k
•
265
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
5 days ago
•
6.9k
•
763
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
1.03k
•
1.52k
deepseek-ai/Janus-1.3B
Any-to-Any
•
Updated
Nov 14, 2024
•
10.3k
•
501
deepseek-ai/JanusFlow-1.3B
Any-to-Any
•
Updated
Nov 18, 2024
•
2.94k
•
80
NexaAIDev/OmniVLM-968M
Updated
27 days ago
•
1.74k
•
494
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
3 days ago
•
108k
•
890
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
Sep 18, 2024
•
763k
•
1.33k
jiuhai/florence-vl-8b-sft
Updated
Dec 3, 2024
•
127
•
18
AI-Safeguard/Ivy-VL-llava
Visual Question Answering
•
Updated
13 days ago
•
3.13k
•
56
OpenGVLab/InternVL2_5-78B
Image-Text-to-Text
•
Updated
26 days ago
•
6.5k
•
159
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
about 20 hours ago
•
118k
•
488
deepseek-ai/deepseek-vl2
Image-Text-to-Text
•
Updated
26 days ago
•
2.4k
•
131
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10, 2024
•
537k
•
492
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
Updated
1 day ago
•
4.68k
•
25
ByteDance/Sa2VA-1B
Image-Text-to-Text
•
Updated
3 days ago
•
266
•
15
Upvote
-
Share collection
View history
Collection guide
Browse collections