Running on Zero Featured 116 Qwen3-ASR Demo π 116 Transcribe audio to text with multi-language timestamps
Running on Zero Featured 1.76k Dia 1.6B π― 1.76k Generate realistic dialogue from a script, using Dia!
pyannote/speaker-diarization-3.1 Automatic Speech Recognition β’ Updated May 10, 2024 β’ 11.8M β’ 1.69k
Running on Zero Featured 2.07k PuLID-FLUX π€ 2.07k Generate custom images from text and a reference photo
MattyB95/AST-VoxCelebSpoof-Synthetic-Voice-Detection Audio Classification β’ 86.2M β’ Updated Jan 31, 2024 β’ 85 β’ 4
Running on Zero Featured 5.05k FLUX.1 [Schnell] π 5.05k Generate images from text prompts with FLUX.1 Schnell
Configuration error Featured 178 NaturalSpeech3 FACodec π 178 Convert and reconstruct speech files