update Citation
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ datasets:
|
|
11 |
|
12 |
## Introduction
|
13 |
|
14 |
-
The Imp project aims to provide a family of
|
15 |
|
16 |
As shown in the Table below, `Imp-v1.5-2B-Qwen1.5` significantly outperforms the counterparts of similar model sizes.
|
17 |
|
@@ -27,7 +27,7 @@ pip install transformers # latest version is ok, but we recommend v4.36.0
|
|
27 |
pip install -q pillow accelerate einops
|
28 |
```
|
29 |
|
30 |
-
You can use the following code for model inference. The format of text instruction is similar to [LLaVA](https://github.com/haotian-liu/LLaVA).
|
31 |
```Python
|
32 |
import torch
|
33 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
@@ -78,7 +78,7 @@ If you use our model or refer our work in your studies, please cite:
|
|
78 |
```bibtex
|
79 |
@article{imp2024,
|
80 |
title={Imp: Highly Capable Large Multimodal Models for Mobile Devices},
|
81 |
-
author={Shao, Zhenwei and Yu, Zhou and Yu, Jun and Ouyang, Xuecheng and
|
82 |
journal={arXiv preprint arXiv:2405.12107},
|
83 |
year={2024}
|
84 |
}
|
|
|
11 |
|
12 |
## Introduction
|
13 |
|
14 |
+
The Imp project aims to provide a family of highly capable yet lightweight LMMs. Our `Imp-v1.5-2B-Qwen1.5` is a strong MSLM with only **2B** parameters, which is build upon a small yet powerful SLM [Qwen1.5-1.8B-Chat ](https://huggingface.co/Qwen/Qwen1.5-1.8B-Chat)(1.8B) and a powerful visual encoder [SigLIP ](https://huggingface.co/google/siglip-so400m-patch14-384)(0.4B), and trained on on 1M mixed dataset.
|
15 |
|
16 |
As shown in the Table below, `Imp-v1.5-2B-Qwen1.5` significantly outperforms the counterparts of similar model sizes.
|
17 |
|
|
|
27 |
pip install -q pillow accelerate einops
|
28 |
```
|
29 |
|
30 |
+
You can use the following code for model inference. The format of text instruction is similar to [LLaVA](https://github.com/haotian-liu/LLaVA). Note that the example can only be run on GPUs currently.
|
31 |
```Python
|
32 |
import torch
|
33 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
|
|
78 |
```bibtex
|
79 |
@article{imp2024,
|
80 |
title={Imp: Highly Capable Large Multimodal Models for Mobile Devices},
|
81 |
+
author={Shao, Zhenwei and Yu, Zhou and Yu, Jun and Ouyang, Xuecheng and Zheng, Lihao and Gai, Zhenbiao and Wang, Mingyang and Ding, Jiajun},
|
82 |
journal={arXiv preprint arXiv:2405.12107},
|
83 |
year={2024}
|
84 |
}
|