Boximator: Generating Rich and Controllable Motions for Video Synthesis Paper • 2402.01566 • Published Feb 2, 2024 • 27
Tarsier: Recipes for Training and Evaluating Large Video Description Models Paper • 2407.00634 • Published Jun 30, 2024
Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding Paper • 2501.07888 • Published 12 days ago • 13