Running
1
Post ASR N Best Text Speech Emotion Recognition
🥰
N-Best Text only for Speech Emotion Recognition
LLM for Text based Speech Processing
GenSEC: Text-based Generative Audio & Speech Recognition with Cascaded ASR-LLMs
Open Source Model
IEEE SLT 2024, References Paper. See below resources for baseline models and datasets.
@inproceedings{yang2024large,
title={Large language model based generative error correction: A challenge and baselines for speech recognition, speaker tagging, and emotion recognition},
author={Yang, Chao-Han Huck and Park, Taejin and Gong, Yuan and Li, Yuanchao and Chen, Zhehuai and Lin, Yen-Ting and Chen, Chen and Hu, Yuchen and Dhawan, Kunal and {\.Z}elasko, Piotr and others},
booktitle={2024 IEEE Spoken Language Technology Workshop (SLT)},
pages={371--378},
year={2024},
organization={IEEE}
}
N-Best Text only for Speech Emotion Recognition
Generative Error Correction (GER) Task Baseline, WER
Submit evaluations for speaker tagging and view leaderboard