ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition Paper • 2503.21248 • Published 10 days ago • 19
ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation Paper • 2503.21729 • Published 9 days ago • 26
Large Language Model Agent: A Survey on Methodology, Applications and Challenges Paper • 2503.21460 • Published 9 days ago • 68
MedAgent-Pro: Towards Multi-modal Evidence-based Medical Diagnosis via Reasoning Agentic Workflow Paper • 2503.18968 • Published 15 days ago • 6
Open Deep Search: Democratizing Search with Open-source Reasoning Agents Paper • 2503.20201 • Published 11 days ago • 41
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published 24 days ago • 27
LocAgent: Graph-Guided LLM Agents for Code Localization Paper • 2503.09089 • Published 25 days ago • 9
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol Paper • 2503.05860 • Published 29 days ago • 9
Gemini Embedding: Generalizable Embeddings from Gemini Paper • 2503.07891 • Published 26 days ago • 34
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice Paper • 2503.05978 • Published 29 days ago • 34
Quantizing Large Language Models for Code Generation: A Differentiated Replication Paper • 2503.07103 • Published 26 days ago • 7
New Trends for Modern Machine Translation with Large Reasoning Models Paper • 2503.10351 • Published 23 days ago • 22
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond Paper • 2503.10460 • Published 23 days ago • 27
SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially? Paper • 2503.12349 • Published 21 days ago • 41
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey Paper • 2503.12605 • Published 20 days ago • 31
Learning to Inference Adaptively for Multimodal Large Language Models Paper • 2503.10905 • Published 23 days ago • 4