view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1, 2024 • 72
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 21 days ago • 132
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 136
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 644
Proactive Detection of Voice Cloning with Localized Watermarking Paper • 2401.17264 • Published Jan 30, 2024 • 18
Masked Audio Generation using a Single Non-Autoregressive Transformer Paper • 2401.04577 • Published Jan 9, 2024 • 43
Pheme: Efficient and Conversational Speech Generation Paper • 2401.02839 • Published Jan 5, 2024 • 18
CoMoSVC: Consistency Model-based Singing Voice Conversion Paper • 2401.01792 • Published Jan 3, 2024 • 11
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper • 2312.11514 • Published Dec 12, 2023 • 259
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit Paper • 2312.09911 • Published Dec 15, 2023 • 54