Mind the Gap! Static and Interactive Evaluations of Large Audio Models Paper • 2502.15919 • Published 20 days ago • 3
EgoNormia: Benchmarking Physical Social Norm Understanding Paper • 2502.20490 • Published 14 days ago • 5
Grounded Persuasive Language Generation for Automated Marketing Paper • 2502.16810 • Published 17 days ago • 10
Grounded Persuasive Language Generation for Automated Marketing Paper • 2502.16810 • Published 17 days ago • 10
Grounded Persuasive Language Generation for Automated Marketing Paper • 2502.16810 • Published 17 days ago • 10
HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions Paper • 2409.16427 • Published Sep 24, 2024
What Are Tools Anyway? A Survey from the Language Model Perspective Paper • 2403.15452 • Published Mar 18, 2024
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints Paper • 2309.16240 • Published Sep 28, 2023
ReCode: Robustness Evaluation of Code Generation Models Paper • 2212.10264 • Published Dec 20, 2022 • 1
Equipping Transformer with Random-Access Reading for Long-Context Understanding Paper • 2405.13216 • Published May 21, 2024 • 1
Efficient Shapley Values Estimation by Amortization for Text Classification Paper • 2305.19998 • Published May 31, 2023
Word-level Textual Adversarial Attacking as Combinatorial Optimization Paper • 1910.12196 • Published Oct 27, 2019
Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study Paper • 2106.03826 • Published Jun 7, 2021
Weakly-Supervised Methods for Suicide Risk Assessment: Role of Related Domains Paper • 2106.02792 • Published Jun 5, 2021
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? Paper • 2407.04842 • Published Jul 5, 2024 • 55
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints Paper • 2309.16240 • Published Sep 28, 2023
SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents Paper • 2403.08715 • Published Mar 13, 2024 • 21
WebArena: A Realistic Web Environment for Building Autonomous Agents Paper • 2307.13854 • Published Jul 25, 2023 • 25
Don't Copy the Teacher: Data and Model Challenges in Embodied Dialogue Paper • 2210.04443 • Published Oct 10, 2022
COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements Paper • 2306.01985 • Published Jun 3, 2023 • 1