VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models Paper • 2412.01822 • Published Dec 2, 2024 • 14
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark Paper • 2305.10615 • Published May 18, 2023 • 1
Joint Prediction and Denoising for Large-scale Multilingual Self-supervised Learning Paper • 2309.15317 • Published Sep 26, 2023
Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data Paper • 2309.13876 • Published Sep 25, 2023 • 1
Improving Massively Multilingual ASR With Auxiliary CTC Objectives Paper • 2302.12829 • Published Feb 24, 2023
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer Paper • 2401.16658 • Published Jan 30, 2024 • 13
YODAS: Youtube-Oriented Dataset for Audio and Speech Paper • 2406.00899 • Published Jun 2, 2024 • 2
Measuring Taiwanese Mandarin Language Understanding Paper • 2403.20180 • Published Mar 29, 2024 • 5
Taiwan LLM: Bridging the Linguistic Divide with a Culturally Aligned Language Model Paper • 2311.17487 • Published Nov 29, 2023 • 2
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models Paper • 2309.15701 • Published Sep 27, 2023 • 2