Submitted by alexchen4ai 47 Octo-planner: On-device Language Model for Planner-Action Agents · 4 authors 5
Submitted by BestWishYsh 40 ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation · 10 authors 3
Submitted by zwcolin 25 CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs · 13 authors 2
Submitted by kamanphoebe 15 A Closer Look into Mixture-of-Experts in Large Language Models · 5 authors 2
Submitted by haoningwu 12 MatchTime: Towards Automatic Soccer Game Commentary Generation · 5 authors 4
Submitted by yuchenlin 12 WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs · 8 authors 1
Submitted by jiho283 11 EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records · 9 authors 7
Submitted by Zhiqiang007 10 Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models · 8 authors 1
Submitted by roeiherz 8 Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning · 6 authors 1
Submitted by liweijiang 8 WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models · 11 authors 1
Submitted by lastweek 5 MemServe: Context Caching for Disaggregated LLM Serving with Elastic Memory Pool · 11 authors 1