cyyang822 commited on
Commit
3b68028
Β·
1 Parent(s): f764292

update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -16,7 +16,9 @@ tags:
16
 
17
  # PVC-InternVL2-8B
18
 
 
19
  [\[πŸ“‚ GitHub\]](https://github.com/OpenGVLab/PVC)
 
20
 
21
  ## Introduction
22
 
@@ -227,6 +229,19 @@ response = model.chat(tokenizer, pixel_values, question, generation_config, data
227
  print(f'User: {question}\nAssistant: {response}')
228
  ```
229
 
 
 
 
 
 
 
 
 
 
 
 
 
 
230
  ## License
231
 
232
  This project is released under the MIT license. Parts of this project contain code and models from other sources, which are subject to their respective licenses.
 
16
 
17
  # PVC-InternVL2-8B
18
 
19
+ [\[πŸ“œ Paper\]](https://arxiv.org/abs/2412.09613)
20
  [\[πŸ“‚ GitHub\]](https://github.com/OpenGVLab/PVC)
21
+ [\[πŸš€ Quick Start\]](#quick-start)
22
 
23
  ## Introduction
24
 
 
229
  print(f'User: {question}\nAssistant: {response}')
230
  ```
231
 
232
+ ## πŸ–ŠοΈ Citation
233
+
234
+ If you find this work helpful in your research, please consider citing:
235
+
236
+ ```bibtex
237
+ @article{yang2024pvc,
238
+ title={PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models},
239
+ author={Yang, Chenyu and Dong, Xuan and Zhu, Xizhou and Su, Weijie and Wang, Jiahao and Tian, Hao and Chen, Zhe and Wang, Wenhai and Lu, Lewei and and Dai, Jifeng},
240
+ journal={arXiv preprint arXiv:2412.09613},
241
+ year={2024}
242
+ }
243
+ ```
244
+
245
  ## License
246
 
247
  This project is released under the MIT license. Parts of this project contain code and models from other sources, which are subject to their respective licenses.