Dataset Condensation via Efficient Synthetic-Data Parameterization Paper • 2205.14959 • Published May 30, 2022
Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data Paper • 2301.12321 • Published Jan 29, 2023 • 1
Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup Paper • 2009.06962 • Published Sep 15, 2020
Large-Scale Targeted Cause Discovery with Data-Driven Learning Paper • 2408.16218 • Published Aug 29, 2024 • 1
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Paper • 2502.20172 • Published Feb 27 • 28
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark Paper • 2406.05967 • Published Jun 10, 2024 • 6
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages Paper • 2411.16508 • Published Nov 25, 2024 • 12
ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding Paper • 2411.05049 • Published Nov 7, 2024 • 3
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published Jan 24 • 34
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Paper • 2501.11873 • Published Jan 21 • 66
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Paper • 2501.07171 • Published Jan 13 • 56
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13 • 99
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper • 2412.18619 • Published Dec 16, 2024 • 58