czczup commited on
Commit
06a956a
·
verified ·
1 Parent(s): 09e1621

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -12
README.md CHANGED
@@ -11,10 +11,6 @@ language:
11
  - multilingual
12
  tags:
13
  - internvl
14
- - vision
15
- - ocr
16
- - multi-image
17
- - video
18
  - custom_code
19
  ---
20
 
@@ -103,8 +99,6 @@ The hyperparameters used for fine-tuning are listed in the following table.
103
 
104
  We provide an example code to run InternVL-Chat-V1-2 using `transformers`.
105
 
106
- We also welcome you to experience the InternVL2 series models in our [online demo](https://internvl.opengvlab.com/).
107
-
108
  > Please use transformers>=4.37.2 to ensure the model works normally.
109
 
110
  ### Model Loading
@@ -504,16 +498,16 @@ If you find this project useful in your research, please consider citing:
504
  journal={arXiv preprint arXiv:2410.16261},
505
  year={2024}
506
  }
507
- @article{chen2023internvl,
508
- title={InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks},
509
- author={Chen, Zhe and Wu, Jiannan and Wang, Wenhai and Su, Weijie and Chen, Guo and Xing, Sen and Zhong, Muyan and Zhang, Qinglong and Zhu, Xizhou and Lu, Lewei and Li, Bin and Luo, Ping and Lu, Tong and Qiao, Yu and Dai, Jifeng},
510
- journal={arXiv preprint arXiv:2312.14238},
511
- year={2023}
512
- }
513
  @article{chen2024far,
514
  title={How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites},
515
  author={Chen, Zhe and Wang, Weiyun and Tian, Hao and Ye, Shenglong and Gao, Zhangwei and Cui, Erfei and Tong, Wenwen and Hu, Kongzhi and Luo, Jiapeng and Ma, Zheng and others},
516
  journal={arXiv preprint arXiv:2404.16821},
517
  year={2024}
518
  }
 
 
 
 
 
 
519
  ```
 
11
  - multilingual
12
  tags:
13
  - internvl
 
 
 
 
14
  - custom_code
15
  ---
16
 
 
99
 
100
  We provide an example code to run InternVL-Chat-V1-2 using `transformers`.
101
 
 
 
102
  > Please use transformers>=4.37.2 to ensure the model works normally.
103
 
104
  ### Model Loading
 
498
  journal={arXiv preprint arXiv:2410.16261},
499
  year={2024}
500
  }
 
 
 
 
 
 
501
  @article{chen2024far,
502
  title={How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites},
503
  author={Chen, Zhe and Wang, Weiyun and Tian, Hao and Ye, Shenglong and Gao, Zhangwei and Cui, Erfei and Tong, Wenwen and Hu, Kongzhi and Luo, Jiapeng and Ma, Zheng and others},
504
  journal={arXiv preprint arXiv:2404.16821},
505
  year={2024}
506
  }
507
+ @article{chen2023internvl,
508
+ title={InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks},
509
+ author={Chen, Zhe and Wu, Jiannan and Wang, Wenhai and Su, Weijie and Chen, Guo and Xing, Sen and Zhong, Muyan and Zhang, Qinglong and Zhu, Xizhou and Lu, Lewei and Li, Bin and Luo, Ping and Lu, Tong and Qiao, Yu and Dai, Jifeng},
510
+ journal={arXiv preprint arXiv:2312.14238},
511
+ year={2023}
512
+ }
513
  ```