license: llama2 | |
# Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding | |
**Paper or resources for more information:** | |
[[Paper](https://huggingface.co/papers/2311.08046)] [[Code](https://github.com/PKU-YuanGroup/Chat-UniVi)] |