VISTA Collection Video Augmentation for Synthetic Video Instruction-following Data Generation • 6 items • Updated 23 days ago
VISTA Collection Video Augmentation for Synthetic Video Instruction-following Data Generation • 6 items • Updated 23 days ago
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale Paper • 2412.05237 • Published Dec 6, 2024 • 47
Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation Paper • 2412.01316 • Published Dec 2, 2024 • 8
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation Paper • 2412.00927 • Published Dec 1, 2024 • 26
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation Paper • 2412.00927 • Published Dec 1, 2024 • 26
VISTA Collection Video Augmentation for Synthetic Video Instruction-following Data Generation • 6 items • Updated 23 days ago
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation Paper • 2412.00927 • Published Dec 1, 2024 • 26 • 2
VISTA Collection Video Augmentation for Synthetic Video Instruction-following Data Generation • 6 items • Updated 23 days ago