Improving Sequence-to-Sequence Learning via Optimal Transport Paper • 1901.06283 • Published Jan 18, 2019
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models Paper • 2310.15140 • Published Oct 23, 2023 • 1
LAFITE: Towards Language-Free Training for Text-to-Image Generation Paper • 2111.13792 • Published Nov 27, 2021
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation Paper • 2404.12386 • Published Apr 18
LLaVA-Read: Enhancing Reading Ability of Multimodal Language Models Paper • 2407.19185 • Published Jul 27 • 1
ARTIST: Improving the Generation of Text-rich Images by Disentanglement Paper • 2406.12044 • Published Jun 17
Automatic Pseudo-Harmful Prompt Generation for Evaluating False Refusals in Large Language Models Paper • 2409.00598 • Published Sep 1
TextLap: Customizing Language Models for Text-to-Layout Planning Paper • 2410.12844 • Published Oct 9
Taipan: Efficient and Expressive State Space Language Models with Selective Attention Paper • 2410.18572 • Published Oct 24 • 16
VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use Paper • 2410.16400 • Published Oct 21
LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding Paper • 2411.01106 • Published Nov 2 • 4
Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation Paper • 2406.09305 • Published Jun 13 • 4
InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding Paper • 2306.04933 • Published Jun 8, 2023 • 1
LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding Paper • 2306.17107 • Published Jun 29, 2023 • 11