Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,11 @@ should probably proofread and complete it, then remove this comment. -->
|
|
19 |
|
20 |
This model is a fine-tuned version of [vidore/colpaligemma-3b-pt-448-base](https://huggingface.co/vidore/colpaligemma-3b-pt-448-base) on the [davanstrien/ufo-ColPali](https://huggingface.co/datasets/davanstrien/ufo-ColPali) dataset.
|
21 |
|
22 |
-
|
|
|
|
|
|
|
|
|
23 |
|
24 |
The model achieves the following results on the evaluation set:
|
25 |
- Loss: 0.1064
|
|
|
19 |
|
20 |
This model is a fine-tuned version of [vidore/colpaligemma-3b-pt-448-base](https://huggingface.co/vidore/colpaligemma-3b-pt-448-base) on the [davanstrien/ufo-ColPali](https://huggingface.co/datasets/davanstrien/ufo-ColPali) dataset.
|
21 |
|
22 |
+
The model was trained using the fine tuning [notebook](https://github.com/tonywu71/colpali-cookbooks/blob/main/examples/finetune_colpali.ipynb) from [tonywu71](https://huggingface.co/tonywu71). I changed almost nothing except the data processing steps.
|
23 |
+
|
24 |
+
The dataset used for training was created using synthetic data from [Qwen/Qwen2-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct). The process for making this dataset is discussed more in the [blog post](https://danielvanstrien.xyz/posts/post-with-code/colpali/2024-09-23-generate_colpali_dataset.html).
|
25 |
+
|
26 |
+
|
27 |
|
28 |
The model achieves the following results on the evaluation set:
|
29 |
- Loss: 0.1064
|