Update README.md
Browse files
README.md
CHANGED
|
@@ -12,6 +12,7 @@ M3D-CLIP is one of the works in the [M3D](https://github.com/BAAI-DCAI/M3D) seri
|
|
| 12 |
It is a 3D medical CLIP model that aligns vision and language through contrastive loss on the [M3D-Cap](https://huggingface.co/datasets/GoodBaiBai88/M3D-Cap) dataset.
|
| 13 |
The vision encoder uses 3D ViT with 32\*256\*256 image size and 4\*16\*16 patch size.
|
| 14 |
The language encoder utilizes a pre-trained BERT as initialization.
|
|
|
|
| 15 |
The uses of M3D-CLIP:
|
| 16 |
1. 3D medical image and text retrieval task.
|
| 17 |
2. Aligned and powerful image and text features for downstream tasks.
|
|
|
|
| 12 |
It is a 3D medical CLIP model that aligns vision and language through contrastive loss on the [M3D-Cap](https://huggingface.co/datasets/GoodBaiBai88/M3D-Cap) dataset.
|
| 13 |
The vision encoder uses 3D ViT with 32\*256\*256 image size and 4\*16\*16 patch size.
|
| 14 |
The language encoder utilizes a pre-trained BERT as initialization.
|
| 15 |
+
|
| 16 |
The uses of M3D-CLIP:
|
| 17 |
1. 3D medical image and text retrieval task.
|
| 18 |
2. Aligned and powerful image and text features for downstream tasks.
|