-
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models
Paper • 2502.09696 • Published • 43 -
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment
Paper • 2502.10391 • Published • 34 -
Autellix: An Efficient Serving Engine for LLM Agents as General Programs
Paper • 2502.13965 • Published • 18 -
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
Paper • 2502.14739 • Published • 100
Sangyeon Cho
josang1204
·
AI & ML interests
None yet
Recent Activity
updated
a collection
20 days ago
llm
liked
a dataset
20 days ago
li-lab/MMLU-ProX
updated
a dataset
21 days ago
josang1204/preference-tuning-dataset