Negative Token Merging: Image-based Adversarial Feature Guidance Paper β’ 2412.01339 β’ Published 22 days ago β’ 21
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published 20 days ago β’ 118
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters Paper β’ 2412.00174 β’ Published 25 days ago β’ 22
Open-Sora Plan: Open-Source Large Video Generation Model Paper β’ 2412.00131 β’ Published 26 days ago β’ 32