Andres Marafioti's picture

Andres Marafioti

andito

·

AI & ML interests

Multimodal models, VLM and TTS

Recent Activity

liked a Space about 3 hours ago

jamesliu1217/EasyControl_Ghibli

updated a Space 4 days ago

HuggingFaceTB/smolvlm-web-benchmarking-all

published a Space 4 days ago

HuggingFaceTB/smolvlm-web-benchmarking-all

View all activity

Organizations

Posts 6

Post

2563

Extremely bullish on @CohereForAI 's Aya Vision (8B & 32B) - new SOTA open-weight VLMs

- 8B wins up to 81% of the time in its class, better than Gemini Flash
- 32B beats Llama 3.2 90B!
- Covers 23 languages, excels in image captioning, VQA & more
- Integrated on transformers from Day 0!

Efficient multimodal models are here to stay!!🔥
Check out their blog! https://huggingface.co/blog/aya-vision

Articles 8

Article

224

SmolVLM2: Bringing Video Understanding to Every Device

View all Articles

Collections 1

Papers 5

arxiv:2503.11576

arxiv:2502.02737

arxiv:2408.12637

arxiv:2005.05032

spaces 6

Technical Interview Internship 2025 Download

Simple space to track downloads of the technical interview

SmolVLM

Ask questions about images or get captions

GGUF My Repo

Speech To Speech Demo

Omni Mini

Record audio and get voice responses

Running on Zero

Florence 2

Ask questions about images to get answers

models 7

andito/moondream05

Updated Dec 6, 2024

andito/SmolVLM-Base-vqav2

Updated Nov 29, 2024 • 4

andito/mlx_summarization

Text Generation • Updated Oct 31, 2024 • 19

andito/SmolLM2-1.7B-Instruct-F16-GGUF

Updated Oct 31, 2024 • 59 • 1

andito/fast-unidic

Updated Sep 23, 2024 • 1

andito/s2s

Updated Sep 20, 2024 • 3

andito/Florence-2-large-ft

Image-to-Text • Updated Jun 22, 2024 • 389 • 4

datasets 3

andito/mathwriting-google

Viewer • Updated Jan 4 • 656k • 478 • 4

andito/math-writing-dataset-google

Updated Jan 4 • 18

andito/chatbot_arena_completions

Viewer • Updated Jul 5, 2024 • 33k • 43 • 2