Victor Mustar's picture

Victor Mustar PRO

victor

·

victormustar

AI & ML interests

Building the UX of this website

Recent Activity

liked a model about 20 hours ago

ibm-granite/granite-speech-3.2-8b

upvoted a collection 3 days ago

updated a Space 3 days ago

victor/text-flow

View all activity

Organizations

victor's activity

upvoted a collection 3 days ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 3 days ago • 92

upvoted 2 papers 5 days ago

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published 7 days ago • 85

Transformers Use Causal World Models in Maze-Solving Tasks

Paper • 2412.11867 • Published Dec 16, 2024 • 1

upvoted 3 papers 9 days ago

Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy

Paper • 2503.19757 • Published 12 days ago • 47

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 11 days ago • 118

Cube: A Roblox View of 3D Intelligence

Paper • 2503.15475 • Published 17 days ago • 28

upvoted an article 12 days ago

Article

Introducing Gradio's new Dataframe!

13 days ago

• 22

upvoted 3 papers 12 days ago

MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving

Paper • 2503.16905 • Published 16 days ago • 52

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published 16 days ago • 70

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published 13 days ago • 110

upvoted 5 collections 12 days ago

Llama Nemotron

Open, Production-ready Enterprise Models • 3 items • Updated 2 days ago • 28

Wan2.1 14B 480p I2V LoRAs

A collection of Remade's Wan2.1 14B 480p I2V LoRAs • 39 items • Updated 5 days ago • 95

EXAONE-Deep

EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 9 items • Updated 19 days ago • 85

Orpheus TTS

TTS Towards Human-Sounding Speech • 2 items • Updated 18 days ago • 54

💫StarVector Models

StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated 16 days ago • 90

upvoted a collection 13 days ago

UIGEN-T1.5 REASONING MODEL

UIGEN'S Next Iteration. UIGEN-T1.5 is a midway model between 1 and 2, reflecting our new data collection pipeline changes. • 5 items • Updated 13 days ago • 6

upvoted a paper 15 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 17 days ago • 46

upvoted a paper 16 days ago

DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Paper • 2503.15265 • Published 18 days ago • 44

upvoted an article 18 days ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

19 days ago

• 33