Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows Paper • 2604.20200 • Published Apr 22 • 5
VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation Paper • 2604.21375 • Published Apr 23 • 19
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 3 days ago • 20