Update README.md
Browse files
README.md
CHANGED
@@ -104,7 +104,12 @@ print(scores)
|
|
104 |
|
105 |
# Limitations
|
106 |
|
107 |
-
Currently, please ensure that image sizes within the same knowledge base be similar. High variance of image size may cause the model performance degrade. We will augment data and fix this issue in our future version.
|
|
|
|
|
|
|
|
|
|
|
108 |
|
109 |
# Citation
|
110 |
|
|
|
104 |
|
105 |
# Limitations
|
106 |
|
107 |
+
- Currently, please ensure that image sizes within the same knowledge base be similar. High variance of image size may cause the model performance degrade. We will augment data and fix this issue in our future version.
|
108 |
+
|
109 |
+
- The modeling script `modeling_minicpmv` on `huggingface` is not standard yet, the inference code could be further improved.
|
110 |
+
|
111 |
+
- The inference speed is low, because vision encoder uses `timm`, which does not yet support `flash-attn`.
|
112 |
+
|
113 |
|
114 |
# Citation
|
115 |
|