Visual Document Retrieval
ColPali
Safetensors
English
vidore
vidore-experimental

Update example with infinity

#9
by michaelfeil - opened
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -103,6 +103,17 @@ with torch.no_grad():
103
  scores = processor.score_multi_vector(querry_embeddings, image_embeddings)
104
  ```
105
 
 
 
 
 
 
 
 
 
 
 
 
106
  ## Limitations
107
 
108
  - **Focus**: The model primarily focuses on PDF-type documents and high-ressources languages, potentially limiting its generalization to other document types or less represented languages.
 
103
  scores = processor.score_multi_vector(querry_embeddings, image_embeddings)
104
  ```
105
 
106
+ ## Infinity
107
+
108
+ Usage with docker and [Infinity](https://github.com/michaelfeil/infinity).
109
+ Infinity only works with the `-merged` weight variants of ColPali and ColQwen.
110
+
111
+ ```bash
112
+ docker run --gpus all -v $PWD/data:/app/.cache -p "7997":"7997" \
113
+ michaelf34/infinity:0.0.69 \
114
+ v2 --model-id vidore/colpali-v1.2-merged --revision "cd80ee4200c591b788a9c4e21bb5d549d4a04637" --dtype bfloat16 --batch-size 8 --device cuda --engine torch --port 7997
115
+ ```
116
+
117
  ## Limitations
118
 
119
  - **Focus**: The model primarily focuses on PDF-type documents and high-ressources languages, potentially limiting its generalization to other document types or less represented languages.