LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper • 2501.03895 • Published 4 days ago • 40
laion/CLIP-ViT-H-14-laion2B-s32B-b79K Zero-Shot Image Classification • Updated Jan 16, 2024 • 1.26M • 346
alimama-creative/FLUX.1-dev-Controlnet-Inpainting-Beta Image-to-Image • Updated Oct 12, 2024 • 13.7k • 249
ObjCtrl-2.5D: Training-free Object Control with Camera Poses Paper • 2412.07721 • Published Dec 10, 2024 • 8