deepseek-ai
/

deepseek-vl2

Image-Text-to-Text

Inference Endpoints

Model card Files Files and versions Community

XCLiu commited on 10 days ago

Commit

e6adb2b

•

1 Parent(s): 583bcba

Update README.md

Files changed (1) hide show

README.md +12 -3

README.md CHANGED Viewed

@@ -37,6 +37,11 @@ On the basis of `Python >= 3.8` environment, install the necessary dependencies
 pip install -e .
 ```
 ### Simple Inference Example
 ```python
@@ -121,10 +126,14 @@ This code repository is licensed under [MIT License](./LICENSE-CODE). The use of
 ## 5. Citation
 ```
-@misc{wu2024deepseekvl2,
-      title={DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding},
-      author={Wu, Zhiyu and Chen, Xiaokang and Pan, Zizheng and Liu, Xingchao and Liu, Wen and Dai, Damai and Gao, Huazuo and Ma, Yiyang and Wu, Chengyue and Wang, Bingxuan and Xie, Zhenda and Wu, Yu and Hu, Kai and Wang, Jiawei and Sun, Yaofeng and Li, Yukun and Piao, Yishi and Guan, Kang and Liu, Aixin and Xie, Xin and You, Yuxiang and Dong, Kai and Yu, Xingkai and Zhang, Haowei and Zhao, Liang and Wang, Yisong and Ruan, Chong},
       year={2024},
 }
 ```

 pip install -e .
 ```
+### Notifications
+1. We suggest to use a temperature T <= 0.7 when sampling. We observe a larger temperature decreases the generation quality.
+2. To keep the number of tokens managable in the context window, we apply dynamic tiling strategy to <=2 images. When there are >=3 images, we directly pad the images to 384*384 as inputs without tiling.
+3. The main difference between DeepSeek-VL2-Tiny, DeepSeek-VL2-Small and DeepSeek-VL2 is the base LLM.
 ### Simple Inference Example
 ```python
 ## 5. Citation
 ```
+@misc{wu2024deepseekvl2mixtureofexpertsvisionlanguagemodels,
+      title={DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding},
+      author={Zhiyu Wu and Xiaokang Chen and Zizheng Pan and Xingchao Liu and Wen Liu and Damai Dai and Huazuo Gao and Yiyang Ma and Chengyue Wu and Bingxuan Wang and Zhenda Xie and Yu Wu and Kai Hu and Jiawei Wang and Yaofeng Sun and Yukun Li and Yishi Piao and Kang Guan and Aixin Liu and Xin Xie and Yuxiang You and Kai Dong and Xingkai Yu and Haowei Zhang and Liang Zhao and Yisong Wang and Chong Ruan},
       year={2024},
+      eprint={2412.10302},
+      archivePrefix={arXiv},
+      primaryClass={cs.CV},
+      url={https://arxiv.org/abs/2412.10302},
 }
 ```