davanstrien
/

finetune_colpali_v1_2-ufo-4bit

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

davanstrien HF staff commited on Sep 26, 2024

Commit

8dc8c2d

·

verified ·

1 Parent(s): 7dc4378

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -19,7 +19,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [vidore/colpaligemma-3b-pt-448-base](https://huggingface.co/vidore/colpaligemma-3b-pt-448-base) on the [davanstrien/ufo-ColPali](https://huggingface.co/datasets/davanstrien/ufo-ColPali) dataset.
-This dataset was created using [Qwen/Qwen2-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct). The process for making this dataset is discussed more in the [blog post](https://danielvanstrien.xyz/posts/post-with-code/colpali/2024-09-23-generate_colpali_dataset.html).
 The model achieves the following results on the evaluation set:
 - Loss: 0.1064

 This model is a fine-tuned version of [vidore/colpaligemma-3b-pt-448-base](https://huggingface.co/vidore/colpaligemma-3b-pt-448-base) on the [davanstrien/ufo-ColPali](https://huggingface.co/datasets/davanstrien/ufo-ColPali) dataset.
+The model was trained using the fine tuning [notebook](https://github.com/tonywu71/colpali-cookbooks/blob/main/examples/finetune_colpali.ipynb) from [tonywu71](https://huggingface.co/tonywu71). I changed almost nothing except the data processing steps.
+The dataset used for training was created using synthetic data from [Qwen/Qwen2-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct). The process for making this dataset is discussed more in the [blog post](https://danielvanstrien.xyz/posts/post-with-code/colpali/2024-09-23-generate_colpali_dataset.html).
 The model achieves the following results on the evaluation set:
 - Loss: 0.1064