Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Paper โข 2503.03983 โข Published Mar 6 โข 22
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Paper โข 2503.03983 โข Published Mar 6 โข 22
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data Paper โข 2410.02056 โข Published Oct 2, 2024 โข 6
Do Audio-Language Models Understand Linguistic Variations? Paper โข 2410.16505 โข Published Oct 21, 2024 โข 1
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark Paper โข 2410.19168 โข Published Oct 24, 2024 โข 20
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark Paper โข 2410.19168 โข Published Oct 24, 2024 โข 20
Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation Paper โข 2410.13198 โข Published Oct 17, 2024 โข 10
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds Paper โข 2409.09213 โข Published Sep 13, 2024 โข 13
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds Paper โข 2409.09213 โข Published Sep 13, 2024 โข 13
ASPIRE: Language-Guided Augmentation for Robust Image Classification Paper โข 2308.10103 โข Published Aug 19, 2023
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models Paper โข 2310.08753 โข Published Oct 12, 2023