Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs Paper • 2504.04715 • Published 8 days ago • 12
OpenCodeReasoning: Advancing Data Distillation for Competitive Coding Paper • 2504.01943 • Published 12 days ago • 11
Efficient Model Selection for Time Series Forecasting via LLMs Paper • 2504.02119 • Published 12 days ago • 16
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published 14 days ago • 238
Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models Paper • 2504.04823 • Published 8 days ago • 28
ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition Paper • 2503.21248 • Published 19 days ago • 19
ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation Paper • 2503.21729 • Published 18 days ago • 27
Large Language Model Agent: A Survey on Methodology, Applications and Challenges Paper • 2503.21460 • Published 19 days ago • 73
MedAgent-Pro: Towards Multi-modal Evidence-based Medical Diagnosis via Reasoning Agentic Workflow Paper • 2503.18968 • Published 24 days ago • 6
Open Deep Search: Democratizing Search with Open-source Reasoning Agents Paper • 2503.20201 • Published 20 days ago • 43
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published Mar 12 • 27
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol Paper • 2503.05860 • Published Mar 7 • 9
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice Paper • 2503.05978 • Published Mar 7 • 35