Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Paper • 2501.12895 • Published 5 days ago • 47
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15, 2024 • 12 • 4
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15, 2024 • 12
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15, 2024 • 12
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15, 2024 • 12 • 4
UI Agent Collection a collection of algorithmic agents for user interfaces/interactions and program synthesis • 250 items • Updated 2 days ago • 43
Mirror Collection Mirror: A Universal Framework for Various Information Extraction Tasks https://arxiv.org/abs/2311.05419 • 5 items • Updated Oct 11, 2024 • 1
Mirror Collection Mirror: A Universal Framework for Various Information Extraction Tasks https://arxiv.org/abs/2311.05419 • 5 items • Updated Oct 11, 2024 • 1