microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated about 16 hours ago • 1.01M • 1.28k
SEA-VL: Multicultural VL Dataset for Southeast Asia Collection Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia • 3 items • Updated 28 days ago • 16
CRAFT: Extracting and Tuning Cultural Instructions from the Wild Paper • 2405.03138 • Published May 6, 2024 • 1
CRAFT: Extracting and Tuning Cultural Instructions from the Wild Paper • 2405.03138 • Published May 6, 2024 • 1
CoinMath: Harnessing the Power of Coding Instruction for Math LLMs Paper • 2412.11699 • Published Dec 16, 2024 • 1
CoinMath: Harnessing the Power of Coding Instruction for Math LLMs Paper • 2412.11699 • Published Dec 16, 2024 • 1
Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models Paper • 2501.01034 • Published Jan 2
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 30 days ago • 97
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 30 days ago • 97