[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Peng Jin
Chat-UniVi
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video
Understanding
liked
a model
about 1 month ago
Chat-UniVi/Chat-UniVi-7B-v1.5
Organizations
None yet
Collections
1
models
12
Chat-UniVi/MoH-LLaMA3-8B
Text Generation
•
Updated
•
15
•
3
Chat-UniVi/Chat-UniVi-13B
Video-Text-to-Text
•
Updated
•
675
•
9
Chat-UniVi/Chat-UniVi-7B-v1.5
Video-Text-to-Text
•
Updated
•
59
•
2
Chat-UniVi/MoE-Plus-Plus-7B
Text Generation
•
Updated
•
14
•
4
Chat-UniVi/Chat-UniVi
Video-Text-to-Text
•
Updated
•
69.4k
•
13
Chat-UniVi/MoH-ViT-S-75
Updated
Chat-UniVi/MoH-ViT-S-80
Updated
Chat-UniVi/MoH-ViT-B-50
Updated
Chat-UniVi/MoH-ViT-B-75
Updated
Chat-UniVi/MoH-DiT-XL-90
Updated
•
3