MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers Paper • 2307.02321 • Published Jul 5, 2023 • 7
Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review and Replicability Study Paper • 2304.10909 • Published Apr 21, 2023 • 1
Benchmarking Generative Latent Variable Models for Speech Paper • 2202.12707 • Published Feb 22, 2022
Do We Still Need Automatic Speech Recognition for Spoken Language Understanding? Paper • 2111.14842 • Published Nov 29, 2021
Do End-to-End Speech Recognition Models Care About Context? Paper • 2102.09928 • Published Feb 17, 2021
On Scaling Contrastive Representations for Low-Resource Speech Recognition Paper • 2102.00850 • Published Feb 1, 2021
MultiQT: Multimodal Learning for Real-Time Question Tracking in Speech Paper • 2005.00812 • Published May 2, 2020