Scaling Transformers for Low-Bitrate High-Quality Speech Coding Paper • 2411.19842 • Published 26 days ago • 10
Efficient Audio Captioning with Encoder-Level Knowledge Distillation Paper • 2407.14329 • Published Jul 19 • 4
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound Paper • 2405.00233 • Published Apr 30 • 13
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models Paper • 2404.12387 • Published Apr 18 • 38
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Paper • 2308.05734 • Published Aug 10, 2023 • 37
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Paper • 2308.05734 • Published Aug 10, 2023 • 37
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining Paper • 2308.05734 • Published Aug 10, 2023 • 37
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies Paper • 2308.01546 • Published Aug 3, 2023 • 17
WavJourney: Compositional Audio Creation with Large Language Models Paper • 2307.14335 • Published Jul 26, 2023 • 43
WavJourney: Compositional Audio Creation with Large Language Models Paper • 2307.14335 • Published Jul 26, 2023 • 43
WavJourney: Compositional Audio Creation with Large Language Models Paper • 2307.14335 • Published Jul 26, 2023 • 43
WavJourney: Compositional Audio Creation with Large Language Models Paper • 2307.14335 • Published Jul 26, 2023 • 43
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models Paper • 2301.12503 • Published Jan 29, 2023