PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment Paper • 2410.13785 • Published Oct 17 • 18
Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena Paper • 2310.05746 • Published Oct 9, 2023
Distilling Script Knowledge from Large Language Models for Constrained Language Planning Paper • 2305.05252 • Published May 9, 2023
Adaptive Chameleon or Stubborn Sloth: Unraveling the Behavior of Large Language Models in Knowledge Clashes Paper • 2305.13300 • Published May 22, 2023 • 2
LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification Paper • 2012.13577 • Published Dec 25, 2020
EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction Paper • 2401.06201 • Published Jan 11 • 2
E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning Paper • 2203.08480 • Published Mar 16, 2022
How Easily do Irrelevant Inputs Skew the Responses of Large Language Models? Paper • 2404.03302 • Published Apr 4 • 2
Character is Destiny: Can Large Language Models Simulate Persona-Driven Decisions in Role-Playing? Paper • 2404.12138 • Published Apr 18
From Persona to Personalization: A Survey on Role-Playing Language Agents Paper • 2404.18231 • Published Apr 28
EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms Paper • 2406.14228 • Published Jun 20 • 1
SEGMENT+: Long Text Processing with Short-Context Language Models Paper • 2410.06519 • Published Oct 9
QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search Paper • 2306.06707 • Published Jun 11, 2023
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper • 2407.16741 • Published Jul 23 • 68
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language Models by Learning from Knowledge Graphs Paper • 2407.00653 • Published Jun 30 • 11
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language Models by Learning from Knowledge Graphs Paper • 2407.00653 • Published Jun 30 • 11