LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper âĒ 2411.10440 âĒ Published Nov 15, 2024 âĒ 113
Running on T4 1.04k 1.04k Open NotebookLM ð Personalised Podcasts For All - Available in 13 Languages
Running on Zero 1.03k 1.03k PhotoMaker V2 ð· Generate customized photos of a person based on an image and prompt
Running on Zero 722 722 Florence 2 ð Analyze images to generate captions, detect objects, or perform OCR