ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads? Paper • 2602.19594 • Published Feb 23 • 3
Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts Paper • 2601.03315 • Published Jan 6 • 7