Apolinário from multimodal AI art's picture

Building on HF

Apolinário from multimodal AI art PRO

multimodalart

huggingface

·

https://multimodal.art

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

mistralai/Voxtral-Mini-Realtime

liked a model 1 day ago

internlm/Intern-S1-Pro

liked a Space 1 day ago

multimodalart/ltx2-audio-to-video

View all activity

Organizations

upvoted a paper 4 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 4 days ago • 199

upvoted an article 17 days ago

Article

Introducing Waypoint-1: Real-time interactive video diffusion from Overworld

+3

18 days ago

•

36

upvoted a paper 25 days ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 122

upvoted a paper about 2 months ago

Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs

Paper • 2512.09742 • Published Dec 10, 2025 • 3

upvoted an article about 2 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

589

upvoted a paper 2 months ago

MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues

Paper • 2512.03046 • Published Dec 2, 2025 • 12

upvoted a changelog 2 months ago

Changelog

Duplicate Datasets

Dec 3, 2025

• 104

upvoted 3 collections 2 months ago

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 149

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 89

Z-Image

7 items • Updated 10 days ago • 137

upvoted an article 2 months ago

Article

Diffusers welcomes FLUX-2

+6

Nov 25, 2025

•

178

upvoted an article 3 months ago

Article

Introducing Cogito v2.1

Nov 19, 2025

•

17

upvoted a paper 3 months ago

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published Nov 17, 2025 • 69

upvoted a collection 3 months ago

The Bestiary

Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated Nov 16, 2025 • 82

upvoted 2 articles 3 months ago

Article

We’re open-sourcing our text-to-image model and the process behind it

Nov 12, 2025

•

83

Article

What makes good reasoning data

Oct 30, 2025

•

44

upvoted a paper 3 months ago

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24, 2025 • 62

upvoted a collection 3 months ago

Emu3.5

Native Multimodal Models are World Learners 🌍 • 4 items • Updated 3 days ago • 74

upvoted an article 3 months ago

Article

Granite 4.0 Nano: Just how small can you go?

Oct 28, 2025

•

123

upvoted a paper 3 months ago

Group Relative Attention Guidance for Image Editing

Paper • 2510.24657 • Published Oct 28, 2025 • 26