AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 267 items • Updated about 15 hours ago • 34
The FACTS Grounding Leaderboard: Benchmarking LLMs' Ability to Ground Responses to Long-Form Input Paper • 2501.03200 • Published 7 days ago • 1
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 267 items • Updated about 15 hours ago • 34
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Paper • 2501.04575 • Published 5 days ago • 21
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 267 items • Updated about 15 hours ago • 34
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 5 days ago • 68
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published 6 days ago • 56
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 267 items • Updated about 15 hours ago • 34
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Paper • 2501.02976 • Published 7 days ago • 46
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 267 items • Updated about 15 hours ago • 34
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction Paper • 2501.01957 • Published 10 days ago • 35
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 267 items • Updated about 15 hours ago • 34
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 267 items • Updated about 15 hours ago • 34
MLLM-as-a-Judge for Image Safety without Human Labeling Paper • 2501.00192 • Published 13 days ago • 23
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 267 items • Updated about 15 hours ago • 34
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Paper • 2501.01257 • Published 11 days ago • 46
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 267 items • Updated about 15 hours ago • 34
Training Software Engineering Agents and Verifiers with SWE-Gym Paper • 2412.21139 • Published 14 days ago • 20