Macaron-A2UI: A Model for Generative UI in Personal Agents Paper • 2605.24830 • Published 12 days ago • 80
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 14 days ago • 221
OpenEnv India Hackathon top 100 Collection Top 100 Space submissions from the OpenEnv India Hackathon. • 98 items • Updated 23 days ago • 7
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 48
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 906
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation Paper • 2603.25804 • Published Mar 26 • 30
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents Paper • 2603.24440 • Published Mar 25 • 98
The Y-Combinator for LLMs: Solving Long-Context Rot with λ-Calculus Paper • 2603.20105 • Published Mar 20 • 37
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use Paper • 2603.08262 • Published Mar 9 • 42
view article Article Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline nvidia • Mar 13 • 40
Nemotron RAG Collection Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs • 11 items • Updated 1 day ago • 93
view article Article Build an Agent That Thinks Like a Data Scientist: How We Hit #1 on DABStep with Reusable Tool Generation nvidia • Mar 13 • 18