pyannote/speaker-diarization-3.1 Automatic Speech Recognition β’ Updated May 10, 2024 β’ 12M β’ 1.68k
tiantiaf/whisper-large-v3-voice-quality Audio Classification β’ 2B β’ Updated Aug 10, 2025 β’ 3.46k β’ 5
view post Post 3695 π€― π€― Released a high quality finetuned LLM based TTS model that can generate realistic and clear 48khz audio at over 100x realtime speed! π€― π€―Github link: https://github.com/ysharma3501/MiraTTSModel link: https://github.com/ysharma3501/MiraTTSBlog explaining llm tts models: https://huggingface.co/blog/YatharthS/llm-tts-models See translation 4 replies Β· π₯ 10 10 π 6 6 π 3 3 + Reply
Running on Zero MCP Featured 222 ViTPose Transformers β‘ 222 Detect and visualize human poses in images and videos