Audio-AGI

community

https://github.com/Audio-AGI

Audio-AGI

Activity Feed Request to join this org

AI & ML interests

Audio x AI

Recent Activity

Xubo-Liu authored a paper 22 days ago

Scaling Transformers for Low-Bitrate High-Quality Speech Coding

haoheliu authored a paper 5 months ago

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

haoheliu authored a paper 8 months ago

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

View all activity

Audio-AGI's activity

Xubo-Liu

authored a paper 22 days ago

Scaling Transformers for Low-Bitrate High-Quality Speech Coding

Paper • 2411.19842 • Published 26 days ago • 10

haoheliu

authored a paper 5 months ago

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Paper • 2407.14329 • Published Jul 19 • 4

haoheliu

authored 2 papers 8 months ago

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Paper • 2405.00233 • Published Apr 30 • 13

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23 • 29

zzk1st

authored a paper 8 months ago

Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

Paper • 2404.12387 • Published Apr 18 • 38

Xubo-Liu

updated a Space about 1 year ago

Runtime error

221

🐠

AudioSep

Xubo-Liu

authored a paper over 1 year ago

Retrieval-Augmented Text-to-Audio Generation

Paper • 2309.08051 • Published Sep 14, 2023 • 6

haoheliu

authored 2 papers over 1 year ago

Retrieval-Augmented Text-to-Audio Generation

Paper • 2309.08051 • Published Sep 14, 2023 • 6

AudioSR: Versatile Audio Super-resolution at Scale

Paper • 2309.07314 • Published Sep 13, 2023 • 25

Xubo-Liu

updated 2 Spaces over 1 year ago

Sleeping

189

🔥

WavJourney

Running

😻

README

qiuqiangkong

authored a paper over 1 year ago

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Paper • 2308.05734 • Published Aug 10, 2023 • 37

Xubo-Liu

authored a paper over 1 year ago

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Paper • 2308.05734 • Published Aug 10, 2023 • 37

haoheliu

authored 2 papers over 1 year ago

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Paper • 2308.05734 • Published Aug 10, 2023 • 37

MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies

Paper • 2308.01546 • Published Aug 3, 2023 • 17

Xubo-Liu

authored a paper over 1 year ago

WavJourney: Compositional Audio Creation with Large Language Models

Paper • 2307.14335 • Published Jul 26, 2023 • 43

qiuqiangkong

authored a paper over 1 year ago

WavJourney: Compositional Audio Creation with Large Language Models

Paper • 2307.14335 • Published Jul 26, 2023 • 43

JinhuaL1ANG

authored a paper over 1 year ago

WavJourney: Compositional Audio Creation with Large Language Models

Paper • 2307.14335 • Published Jul 26, 2023 • 43

haoheliu

authored 2 papers over 1 year ago

WavJourney: Compositional Audio Creation with Large Language Models

Paper • 2307.14335 • Published Jul 26, 2023 • 43

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models

Paper • 2301.12503 • Published Jan 29, 2023

AI & ML interests

Recent Activity

Team members 10

Audio-AGI's activity

AudioSep

WavJourney

README