Spaces for Audio / Voices
- Running on Zero354🚀
- Running on Zero10👅🎙️🥰
SBV2 Chupa Demo
- Running2😊🎙️📖
VisualNovel_sbv_demo
- Running on CPU Upgrade607😊🎙️
Moe TTS
- Running5🏺
Bert-VITS2 AI Abe&Suga&Kishida
- Running33🚀
AICoverGen
- Build error13:🎤
rvc-Blue-archives-hoyogames
- Running38▶️🎤
VTuber RVC Models
- Running336👀
RVC Inference HF
- Running on Zero213🏃
Audio🔹Separator
Vocal and background audio separator
- Running42📉
BlueArchiveTTS
- Running140😆🌖😀
Multi Voice TTS(English/Chinese/Japanese)
[中文/English/日本語]multilingual text-to-speech
- Running on Zero375🔥
Stable Audio Open Zero
- Running137🍏
Applio
A simple, high-quality voice conversion tool
- Running on Zero1.55k🗣️
Voice Clone
- Running on Zero148⚡
RVC⚡ZERO
Voice conversion framework based on VITS
- Running6🎙🐴
Multilingual Anime TTS
- Runtime error1🎶
DiffSinger🎶 Diffusion for Singing Voice Synthesis
- Running123🎵
Ultimate Vocal Remover WebUI
- Running232🍏😺
Aesthetic RVC Inference HF
- Running61⚡
Advanced RVC Inference
- Running775🏃
Vits Models
- Running493🎙🐴
Multilingual Anime TTS
- Running32⚡
LoveLive-ShojoKageki VITS
- Running362🐨
vits-uma-genshin-honkai
- Running3🏺
おしゃべり晋さんメーカー(Style-Bert-VITS2)
- Running10😊▶️
Hololive Style-Bert-VITS2
- Running on Zero463🎼🎶
Midi Music Generator
- Running22🎼
Japanese Lyric Generator
- Running on A10G350🎙
VALL E X
- Running2🔥
AI晋さんメーカー
- Running6📉
BangDream-ShojoKageki Bert VITS2
- Running3📈
lovelive-ShojoKageki VITS JPZH
- Running17🌖
Lovelive-nijigasaki-MB-iSTFT-VITS-ZH&JP
- Running on T42.09k🐶
Bark
- Running999🤗
OpenVoice
- Running270🤗
OpenVoiceV2
- Runtime error59🐠
ChatTTS OpenVoice
- Running on T4178🌍🦜
MassivelyMultilingualTTS
- Running on T42.19k🐸
XTTS
- Running on A10G4.64k🎵
MusicGen
- Runtime error515📞
Seamless M4T v2
- Sleeping60📉
Mars5 Space
- Running on Zero9🎙️💾🔄🗣️
FAcodecV2
- Running on A10G228👋
TTS x Hallo Talking Portrait
Generate Talking avatars from Text-to-Speech
- Running on CPU Upgrade388🎤
RVC Genshin Impact
- Running on Zero87📚
FoleyCrafter
- Running192🏃
Voice Clone Multilingual
Languages ru,en,zh-cn,ja,de,fr,it,pt,pl,tr,ko,nl,cs,ar,es,hu
- Running on Zero14🐨
Talkalkai Cover
- Running on Zero459🎺
Image to Music v2
Get a music sample inspired by the mood of an image
- Running188🕒
Whisper Timestamped
In-browser speech recognition w/ word-level timestamps
- Running on CPU Upgrade544🏆
TTS Arena
Vote on the latest TTS models!
- Running19🥇
TTSDS Benchmark and Leaderboard
Text-To-Speech (TTS) Evaluation using objective metrics.
- Sleeping6🐨
LAKH MIDI Dataset Search
Search and explore LAKH MIDI dataset with MidiCaps
- Running on Zero23📈
PicoAudio
- Running13🏆
Advanced MIDI Search
Search and explore 179k+ MIDI titles
- Running on Zero77🐠
SenseVoice
- Running220🗣️
Whisper Speaker Diarization
- Running239🚀
Faster Whisper Webui
- Running on Zero31🎤
Vocal Separation SOTA
- Running82🐠
BangDream-ShojoKageki Bert VITS2
- Running2🐠
BangDream-ShojoKageki Api
- Running15🐠
BangDream-ShojoKageki Bert VITS2
- Running13🔊
Efficient Audio Captioning
- Running on Zero173🏃
NaturalSpeech3 FACodec
- Running242🌍
tts Text To Speech
- Sleeping4🌍
Edge Tts
- Runtime error14🏆
JA TTS Arena
Vote on the top Japanese TTS models!
- Running10⚡
MIKU TTS
- Running10🎹
Genshin music generation
- Sleeping3⚡
Advanced RVC Inference
- Sleeping🐠
Style Bert VITS2 MT
- Paused3🎙️
ZeroRVC
- Running11👁
Edge TTS w/ More Options
- Runtime error33⚡
EZ Voice Clone
- Runtime error3⚡
Training Helper Rvc
easy training helper For RVC
- Running on Zero20🚀
Anitalker
- Running6:🎤
rvc-Blue-archives
- Sleeping73🌊
Fish Diffusion (HiFiSinger) Demo
- Running15🥰
Japanese Ero Voice Classifier
- Running29😊🎙️📖
Style Bert VITS2 Editor Demo
- Running on A10G395🏆
Fish Speech 1
- Building8🎹->🎵
Piano transcription
- Sleeping1⚡
Rvc Demo
A demo of RVC pip
- Running102🐶
Bark Voice Cloning
- Sleeping1🐸
NeonAI Coqui AI TTS Plugin
- Running105🐸
NeonAI Coqui AI TTS Plugin
- Running145🌍
Qwen2 Audio Instruct Demo
- Running8🗣️
StyleTTS 2
Efficient, fast, and natural text to speech with StyleTTS 2!
- Runtime error12🔥
AICoverGen
- Running11🔥
Harmonic Melody MIDI Mixer
Harmonize and mix any MIDI melody
- Running7🎻
MusicGen Riff
Music Generator | Song Maker Free | Lyrics Generator
- Runtime error30🎵
Ilaria Audio Analyzer
- Running on Zero696😻
Ilaria RVC
- Runtime error4🚀 🗿
MDX UVR
- Running on Zero96🤗
GPT SoVITS V2
- Running7🗣️
Read My Pdf Outloud
- Running6⚡
Vocal Remover
- Running on Zero769🥖
Parler-TTS
High-fidelity Text-To-Speech
- Runtime error3🥰
Japanese Ero Voice Classifier
- Running3🐠
GPT-SoVITS-ToneControl_test
- Running18📊
Umamusume Bert Vits2
- Sleeping1📈
Animalese Py
- Sleeping2🔶
Animalese RVC
- Build error4📊
AI Hanser
- Running on Zero156💻
Stable Audio Live Multiplayer
- Running445👁
Edge TTS Text To Speech
- Running15🐨
Youtube AI Summarizer
- Sleeping4🚀
AICoverGen
- Running1💻
Animalese Js
- Sleeping1💬
ASR Model Comparison
- Running4🔥
AICoverGenMod
- Configuration error1🔨
Ilaria Converter
- Sleeping1👁
RVC UI TES
- Running8🎤
RVC Genshin Impact
- Sleeping1🦀
Voice2VoiceChatbot
- Running🌖
RealTimeVoicetoVoiceChatbot
sp-uhh/speech-enhancement-sgmse
Audio-to-Audio • Updated • 14 • 9- Sleeping2🏃
RVC UI
An easy-to-use voice conversion framework based on VITS.
- Sleeping🏃
RVC
- Sleeping🌍
AI Voice Assistance
- Running on Zero1🗣️
Voice Clone
- Running5🌍
Optimus
- Running38👀
Doc To Dialogue
Transform a report or document into an interview/discussion
- Running46⚡
Voicee
World's fastest Voice Assistant
- Running6🐟
Fish Audio API Demo
- Running on Zero58👁
Musicgen Songstarter Demo
- Running80▶️🐻💿
Hololive Rvc Models V2
- Running23🎹
Advanced MIDI Renderer
Transform and render any MIDI
- Sleeping3🚀
Imagen POP Music Medley Diffusion Transformer
Generate POP music medley with Imagen diffusion transformer
- Sleeping2🔥
Ultimate MIDI Classifier
Classify absolutely any MIDI by genre, song and artist
- Running on Zero4📚
Intelligent MIDI Comparator
Intelligently compare any pair of MIDIs
- Running91🌍
ChatTTS Speaker
- Sleeping2🌖
Bridge Music Transformer
Generate a seamless bridge between two composition parts
- Running57👀
vits-simple-api
- Running11🎙️
Bert VITS Umamusume Genshin HonkaiSR
- Running on Zero32🔊⏫
Audio SR
Fixed fork of the original audio sr!
- Running on Zero156🎤🔄
Seed Voice Conversion
- Running41⚡
Mini Omni
- Running4⚡
Monophonic MIDI Melody Harmonizer
Retrieval augmented harmonization of any MIDI melody
- Running10⚡
MIDI Melody
Add a unique melody to any MIDI file
- Running3🔥
MIDI Chords Mixer
Mix chords from one MIDI to another MIDI
- Sleeping3🏆
Morse To Audio
- Sleeping1🚀
RCV EASY GUI
- Sleeping1⚡
Advanced RVC Inference
- Sleeping2⚡
Lyricsgenius
Get Lyrics from Genius's Link
- Sleeping1👁
Groq Gradio Voice Assistant
- Running2🐠
Hex Separator
- Sleeping2🐠
Groq API Models
Groq API Playground
- Running16👁
GPT-SoVITS-V2-NIIMI SORA
- Paused2🎵
AI Tube Engine MusicGen
- Paused1🎵
AI Tube Engine MusicGen
- Paused1🎵
AI Tube Engine MusicGen
- Paused5🎵
AI Tube Engine MusicGen
- Build error17📚
GPT-SoVITS-V2-Gakuen Idolmaster
- Running on Zero8🌖
UTMOSv2
- Runtime error5⚡
Mini Omni
- Build error10👁
GPT-SoVITS-V2-misc_models
- Configuration error12📊
Bench.audio
LMSYS bench for audio agents
- Runtime error78🌟
Compressed Wav2Lip
- Running79👄
Gradio Lipsync Wav2lip
- Running on Zero6🐨
EchoMimic
- Running21🌍
Wav2lip Gpu
- Running1🏃
Matcha TTS Japanese
Description of Matcha TTS Japanese
- Running89💩
DeepFilterNet2
- Running on Zero12🇫🇷🥖
French Parler-TTS
High-fidelity Text-To-Speech
- Running on Zero255🟣
EzAudio
- Running on Zero13🔥
Kotoba Whisper Demo
- Running1🦀
Matcha Tts Onnx Benchmarks
Benchmark load model and tts time
- Runtime error7⚡
Mini Omni
- Running on Zero2🐠
AIChat-matcha-tts-onnx-en
Give your space a voice! (Demo)
- Running on Zero13🌍
GAMA
- Running on Zero4🏆
GAMA-IT
- Sleeping1🦀
Sbv2 Py
- Running on Zero216🎶
OpenMusic
- Running69🎙️
PodcastGen
Generate a 2-speaker podcast from text input or documents!
- Running3🐠
Mistral 7B Instruct v0.3 Matcha-TTS English
Enjoy TTS Chat
- Sleeping2💨
Moshi
- Running on Zero46🟣
EzAudio ControlNet
- Sleeping3🐟
Fish Audio API Demo
- Runtime error1🐠
Whisper En Tiny
- Running on Zero7🏃
Guided Rock Music Transformer
Controlled source augmented rock music transformer
- Running on Zero20🎷
Long-form MusicGen
Long-form Musicgen
- Running72💻
Multilingual TTS
- Running3🔥
AI岸田文雄メーカー
- Running1🔥
AI菅義偉メーカー
- Runtime error1😻
Audio Mouth
- Running387📚
Pdf2audio
- Running on CPU Upgrade571🏆
Open ASR Leaderboard
- Running on T41.01k🎙️
Open NotebookLM
Personalised Podcasts For All - Available in 13 Languages
- Running on Zero4🔥
Kotoba Whisper Bilingual Demo
- Running on T4404🗣️
MeloTTS
Fast, efficient, & multilingual text-to-speech
- Running on T4184🐤
Canary 1b
- Sleeping1😻
Style Bert VITS2 SW
- Runtime error21👁
Llama 3.2 3b Voice
- Runtime error1📚
Pdf2audio
- Running on Zero716🤯
Whisper Turbo
- Running on Zero278🤯
Realtime Whisper Turbo
Realtime implementation of Whisper large turbo
- Running138🚀
Whisper Large V3 Turbo WebGPU
ML-powered speech recognition directly in your browser
- Running on T4251🐢
Tortoise Tts
ExpressivText-to-Speech
- Running31💻
Russian Text To Speech
- Sleeping5📉
Yt-dlp Wav
- Running on T4278🎼
UnlimitedMusicGen
unlimited Audio generation with a few added features
- Runtime error84🎶
AudioCraft Plus v2.0.0a (MusicGen + AudioGen)
- Runtime error22🎼
MusicGen+ V1.2.7 (HuggingFace Version)
- Running on Zero60🏢
VoiceRestore
- Sleeping3⚡
Whisperturbo
whisper3 turbo
- Running34🎙️
GPT-SoVITS-3s-cloning-free-TTS
- Sleeping3🏺
おしゃべり石破茂メーカー(Style-Bert-VITS2)
- Sleeping1🏺
おしゃべり二階俊博メーカー
- Runtime error3🐠
Text To Meow
- Runtime error4🔥
Rvc Ui
- Running25🌍
Reverb ASR Demo
- Running1😻
Ilaria RVC Mod
- Running on T4301🚀
Resemble Enhance
- Running1💻
Openai Whisper Large V3 Turbo
- Running45💻
RVC PlayGround
- Running50🚀
Podcastfy.ai - An Open Source alternative to NotebookLM's podcast feature
- Running on Zero68🎞️🎺
Video to Music
Generate and apply matching music background to video shot
- Running171👂🎞️
Video SoundFX
Generates a sound effect that matches video shot
- Paused171👂
Image2SFX Comparison
Generates audio environment from an image
- Running on Zero175🍏
Applio
- Running on Zero1.52k🗣️
F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- Running1💜
Heartbeat
- Running114🤗🏆
TTS Spaces Arena
Vote on the top HF TTS models!
- Running on CPU Upgrade64🧝♀️🧛♂️🧚♀️
xVASynth TTS
CPU powered, low RTF, emotional, multilingual TTS
- Running283🎶
— AI Jukebox —
Generate music powered by AI
- Running on L40S320🐠
TANGO
Co-Speech Gesture Video Generation
- Running on Zero14🥰🎤📝
Anime Whisper Demo
- Running on Zero59🏢
Ichigo Llama3.1 S Instruct
- Running6🚀
Whisper Japanese Phone Demo
Whisper model to transcript japanese audio to katakana.
- Running20🔥🚀
CoverGen
- Running on Zero100📈
ClearerVoice-Studio (Speech Enhancement, Separation and Extraction)
Better AI powered platform to purify your speech signal
- Running20♫🔒
Steganography
Text | Image | Audio | Video to Spectrogram || Steganography
- Running15🔥
AICoverGenMod
- Running11🚀
UVR5 UI
- Running on Zero16🗣️
Diva Realtime Chat
- Running on Zero2👁
Kotoba Whisper Diarization Demo
- Sleeping10📚
Synthio Stable Audio Open
Stable audio open model from Synthio paper.
- Sleeping1🚀
RYO EVC
- Runtime error1😻
UVR
- Running on Zero35🌒
Moonshine ASR
Fast & efficient ASR outperforming Whisper!
- Running21🔊
seewav-gui
- Running on Zero70🎵
RWKV Music
Generate MIDI music using RWKV v4!
- Running4💻
MP3 Transcribe
Whisper Transcribe MP3 files, use a GPU to convert faster!
- Running6🗣️0️⃣
StyleTTS 2 Zero
Efficient, fast, and natural text to speech with StyleTTS 2!
- Running on Zero244😻
MaskGCT TTS Demo
MaskGCT TTS Demo
- Running on Zero58🎵
MelodyFlow
- Running on Zero562🤫
Whisper Large V3
- Running on Zero6🚀
Ultimate Chords Progressions Transformer
Self-correcting multi-instrumental chords transformer
- Runtime error8🎶♫
Chords Progressions Transformer
Chords-conditioned music transformer
- Running on Zero25⚡
Fast Whisper Turbo
Ultra-fast Whisper Turbo inference ⚡
- Running on A10G290🔊
AudioLDM2 Text2Audio Text2Music Generation
- Running2🗣️👂
Hey Buddy!
In-Browser Audio Wake-Word Spotting
- Running3🎹
Streamlit Pianoroll
Streamlit pianoroll playback element
- Running7⚡
PolUVR
Audio-Separator by Politrees
- Running on Zero98🚀
Giant Music Transformer
Fast multi-instrumental music transformer
- Sleeping22🌖
Omni Mini (WebRTC)
- Running5🎹
Fortepyan Datasets
Streamlit browser for piano music datasets.
- Running4🎹
PIANO Dataset
Demo of masking tasks from the PIANO dataset
- Running on L40S131💬
Fish Agent
An end-to-end (e2e) Voice Language Model by Fish Audio.
- Running6🎵
Audio to Stems to MIDI Converter
- Running24🌍
Podcast Generation
Generate podcasts with AI avatars
- Running🐠
ChatTTS OpenVoice
- Running1📚
OpenVoice
- Running on Zero5🗣️
F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- Running314📊
Bark with Voice Cloning
- Running27📉
OuteTTS 0.1 350M Demo
- Running on Zero8🎼🎶
Midi Music Generator
- Running4🎵
Audio Lyrics Extractor
- Running10🤔
Did StyleTTS 2 Generate It?
Did StyleTTS 2 generate that audio?!?
- Paused35🌍
Hertz Dev
base model for mono-channel completion
- Running on Zero7⚡
Xtts
- Running on Zero221💬
ChatTTS Forge
- Running on Zero65❤️
Kokoro TTS
Now in 5 languages!
- Running6🌖
Pipertts
- Running49🎧
Nexa Omni Demo
- Running on Zero6😻
MaskGCT TTS Demo
MaskGCT TTS Demo
- Sleeping20📚
Video2music
- Sleeping784🔊
Audioldm Text To Audio Generation
- Running2🦀
So VITS SVC
- Running2👀
GPT SoVITS
- Running on Zero243🗣️
Spanish F5
Spanish finetune for the original F5 model.
- Sleeping1🎤⚡🎤
Dolce SVC
- Running2🎤🦊
Dolce TTS
- Running1⚡
Lipsync
- Sleeping5☕🐰🎤
Chino TTS
- Sleeping2🐨
Style Bert VITS2 NO
- Sleeping1📉
Style Bert VITS2 SU
シャルティアのAI音声合成モデルを作りました。
- Running1🔥
Style Bert VITS2 MHY
早乙女乱馬(女)のAI音声合成モデルを作りました。
- Sleeping1🚀
Style Bert VITS2 SAR
ベアトリスのAI音声合成モデルを作りました。
- Running on L433⚡
Talk To Ultravox
Talk to Fixie.ai's Ultravox with WebRTC ⚡️
- Running2🏃
SoundOfWater
Estimate physical properties merely from pouring sound!
- Running9🐢
Llama Code Editor
Create interactive HTML web pages with your voice
- Running on CPU Upgrade26🐨
sutra-avatar-v2
- Running1🌍
Audio Transcriber
Record an audio, then use AI to transcribe and translate it.
- Running on Zero13🖌️🎶
Inpaint Music Transformer
Large and fast music transformer for pitches inpainting
- Running45🐠
OuteTTS 0.2 500M Demo
- Running9🌖
Tsukasa 司 Speech
- Running7🎵
MusicGen Continuation
- Running4🚀
Semanticodec Ultra Low Bitrate Audio Codec
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a
- Running15📚
Audiosr Versatile Audio Super Resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR
- Running on Zero1🐠
OuteTTS 0.2 500M Demo GPU
- Running2💬
ChatTTS Forge English interface
TTS tool
- Running1📚
Style Bert VITS2 RU2
short_description: 猫屋敷まゆのAI音声合成モデルを作りました。
- Running10🥰🎤🤔
Galgame Voice Finder
- Running1👁
Vad Go
- Running on Zero124👀
Indic Parler-TTS
A demo of Indic Parler-TTS
- Running1🐳
Voice Activity Detection
- Running5👀
Vikhr 4o
- Running1⚡
Audio Arena
audio-arena
- Running18🏢
Wespeaker Demo
- Running4💻
Wesep Tse 2speaker Demo
Target Speaker Extraction with WeSep
- Running13🐢
Wenet Demo
- Running4🏆
Open_ASR_Leaderboard
- Running26🗣️
Text-to-Speech WebGPU
WebGPU text-to-Speech powered by OuteTTS and Transformers.js
- Running10📈
SpeechScore (Speech Quality Metrics and Evaluation)
A home for scoring speech quality
- Running2🐠
Fish Speech Benchmark
Non official benchmark by Fish Speech
- Running on Zero6👅🎙️🥰
Chupa Generator
- Running on Zero5🌖
Japanese Parler-TTS Mini Demo
- Running on Zero4🏢
Japanese Parler-TTS Large Demo
- Running3⚡
Make Anime Emotion Dataset
- Running6😊😱😠
Anime Speech Emotion Recognition
- Running on Zero246🔊
MMAudio — generating synchronized audio from video/text
- Running on Zero26🗣️
Voice Clone
- Running on Zero116🐠
Sound AI SFX
SText to Audio(Sound SFX) Generator
- Running5👁
Talk To Moshi
Talk to Kyutai's moshi - powered by Gradio WebRTC!
- Running on T4371⚡
HierSpeech++ (Zero-shot TTS)
- Running10🌍
Talk To Gradio Docs Rag
Talk to the Gradio docs! Powered by Pydantic and WebRTC ⚡️
- Running4📊
Melody Workshop
"One-minute creation by AI Coding Autonomous Agent MOUSE-I"
- Running on Zero7📉
Text2midi
- Running on Zero62🔊
Audio Llama
generated sound from video/text and search
- Running2🐢
VM Sound Classification
- Running2🪷
Lotus
- Running97🌙
Moonshine Web
Real-time in-browser speech recognition
- Running8💻
Openai Realtime Voice
Talk with openAI's new Realtime Voice API
- Running on Zero7🏆
Fast GeCo
- Running on Zero4📉
SoloAudio
- Running21🎶
Music Genre Classifier
- Running2🪕🎵
Guzheng Playing Tech
GZ_IsoTech
- Running2🪕🎶
Chinese Instruments
CTIS
- Running2🪕🎼
Pentatonic Mode
CNPM
- Running on T416🚀
Kotoba-Speech Demo
- Running1🐨
Audio Edit
- Running on Zero1🔊
MMAudio
Video to Audio