Running on Zero Agents Featured 136 Qwen3-ASR Demo 🎙 136 Transcribe audio to text with timestamps and visualization
Running on Zero Agents Featured 1.78k Dia 1.6B 👯 1.78k Generate realistic dialogue from a script, using Dia!
Running on Zero Agents Featured 2.09k PuLID-FLUX 🤗 2.09k Generate customized images from text and reference photos
Running on CPU Upgrade Agents 1.02k Open VLM Leaderboard 🌎 1.02k VLMEvalKit Evaluation Results Collection
Running Agents Featured 2.1k Wan2.1 💻 2.1k Wan: Open and Advanced Large-Scale Video Generative Models
MattyB95/AST-VoxCelebSpoof-Synthetic-Voice-Detection Audio Classification • 86.2M • Updated Jan 31, 2024 • 210 • 4
Running on Zero Agents Featured 5.07k FLUX.1 [Schnell] 🏎 5.07k Generate images from text prompts with FLUX.1-schnell
Running on Zero Agents Featured 731 StyleTTS 2 🗣 731 Efficient, fast, and natural text to speech with StyleTTS 2!