NoLiMa: Long-Context Evaluation Beyond Literal Matching Paper • 2502.05167 • Published about 1 month ago • 15
MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment Paper • 2410.05873 • Published Oct 8, 2024 • 3