MotiF: Making Text Count in Image Animation with Motion Focal Loss Paper • 2412.16153 • Published 7 days ago • 5
Vamos: Versatile Action Models for Video Understanding Paper • 2311.13627 • Published Nov 22, 2023 • 2
AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? Paper • 2307.16368 • Published Jul 31, 2023 • 11