IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools Paper • 2605.20682 • Published 8 days ago • 83
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 22 days ago • 101
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 25 days ago • 165
DCAgent2/terminal_bench_2_Qwen3_30B_A3B_Instruct_2507_20260424_001236 Viewer • Updated Apr 24 • 259 • 60 • 1
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 630
An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU Paper • 2603.16428 • Published Mar 17 • 51