flaviagiammarino
/

pubmed-clip-vit-base-patch32

Zero-Shot Image Classification

Inference Endpoints

Model card Files Files and versions Community

flaviagiammarino commited on Jun 14, 2023

Commit

dc082c8

·

1 Parent(s): 880bb40

Update README.md

Files changed (1) hide show

README.md +7 -12

README.md CHANGED Viewed

@@ -16,23 +16,18 @@ PubMedCLIP is a fine-tuned version of [CLIP](https://huggingface.co/docs/transfo
 image–text pairs obtained from [PubMed](https://pubmed.ncbi.nlm.nih.gov/) articles.
 ## Model Details
-PubMedCLIP was trained on the [Radiology Objects in COntext (ROCO)](https://link.springer.com/chapter/10.1007/978-3-030-01364-6_20) dataset, which provides
-over 80,000 samples including diverse imaging modalities (such as ultrasound, X-Ray, MRI, etc.) from various human body regions (such as head, neck, spine, etc.)
-captured from PubMed articles. The texts used for training were taken from the short captions (average length of 20 words) associated with images in the articles.
 ### Model Description
-<!-- Provide a longer summary of what this model is. -->
-- **Developed by:** {{ developers | default("[More Information Needed]", true)}}
-- **Model type:** {{ model_type | default("[More Information Needed]", true)}}
-- **Language(s) (NLP):** {{ language | default("[More Information Needed]", true)}}
-- **License:** {{ license | default("[More Information Needed]", true)}}
-- **Finetuned from model [optional]:** {{ finetuned_from | default("[More Information Needed]", true)}}
-### Model Sources [optional]
-<!-- Provide the basic links for the model. -->
-- **Repository:** {{ repo | default("[More Information Needed]", true)}}
 - **Paper [optional]:** {{ paper | default("[More Information Needed]", true)}}
 - **Demo [optional]:** {{ demo | default("[More Information Needed]", true)}}

 image–text pairs obtained from [PubMed](https://pubmed.ncbi.nlm.nih.gov/) articles.
 ## Model Details
 ### Model Description
+PubMedCLIP was trained on the [Radiology Objects in COntext (ROCO)](https://github.com/razorx89/roco-dataset) dataset, a large-scale multimodal medical imaging dataset.
+The ROCO dataset includes diverse imaging modalities (such as ultrasound, X-Ray, MRI, etc.) from various human body regions (such as head, neck, spine, etc.)
+captured from open-access PubMed articles. The texts used for training PubMedCLIP were taken from the short captions associated with the images in the dataset.
+The authors of PubMedCLIP have released three different pre-trained models at this [link](https://1drv.ms/u/s!ApXgPqe9kykTgwD4Np3-f7ODAot8?e=zLVlJ2) using
+ResNet-50, ResNet-50x4 and ViT32 as image encoders. This repository includes only the ViT32 variant.
+### Model Sources
+- **Repository:** [Official GitHub Repository](https://github.com/sarahESL/PubMedCLIP)
 - **Paper [optional]:** {{ paper | default("[More Information Needed]", true)}}
 - **Demo [optional]:** {{ demo | default("[More Information Needed]", true)}}