Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation Paper โข 2501.03225 โข Published 4 days ago โข 6 โข 2