CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models Paper • 2405.13684 • Published May 22
Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models Paper • 2409.10999 • Published Sep 17
Typhoon 2: A Family of Open Text and Multimodal Thai Large Language Models Paper • 2412.13702 • Published 8 days ago
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation Paper • 2412.03304 • Published 22 days ago • 17
An Efficient Self-Supervised Cross-View Training For Sentence Embedding Paper • 2311.03228 • Published Nov 6, 2023 • 1
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages Paper • 2406.10118 • Published Jun 14 • 30
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines Paper • 2410.12705 • Published Oct 16 • 29
LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models Paper • 2307.07889 • Published Jul 15, 2023 • 1
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models Paper • 2405.13684 • Published May 22
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models Paper • 2303.08896 • Published Mar 15, 2023 • 4
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization Paper • 2301.12307 • Published Jan 28, 2023 • 3