facebook
/

ijepa_vith14_22k

Image Feature Extraction

Inference Endpoints

Model card Files Files and versions Community

jmtzt commited on Nov 18

Commit

365fd12

•

1 Parent(s): 007cb2b

Update README.md

Files changed (1) hide show

README.md +20 -13

README.md CHANGED Viewed

@@ -31,27 +31,34 @@ I-JEPA can be used for image classification or feature extraction. This checkpoi
 ## How to use
-Here is how to use this model to classify an image of the COCO 2017 dataset into one of the 1,000 ImageNet classes:
 ```python
 import requests
 from PIL import Image
-from transformers import AutoProcessor, IJepaForImageClassification
-url = "http://images.cocodataset.org/val2017/000000039769.jpg"
-image = Image.open(requests.get(url, stream=True).raw)
 model_id = "jmtzt/ijepa_vith14_22k"
 processor = AutoProcessor.from_pretrained(model_id)
-model = IJepaForImageClassification.from_pretrained(model_id)
-inputs = processor(images=image, return_tensors="pt")
-outputs = model(**inputs)
-logits = outputs.logits
-# model predicts one of the 1000 ImageNet classes
-predicted_class_idx = logits.argmax(-1).item()
-print("Predicted class:", model.config.id2label[predicted_class_idx])
 ```
 ### BibTeX entry and citation info

 ## How to use
+Here is how to use this model for image feature extraction:
 ```python
 import requests
 from PIL import Image
+from torch.nn.functional import cosine_similarity
+from transformers import AutoModel, AutoProcessor
+url_1 = "http://images.cocodataset.org/val2017/000000039769.jpg"
+url_2 = "http://images.cocodataset.org/val2017/000000219578.jpg"
+image_1 = Image.open(requests.get(url_1, stream=True).raw)
+image_2 = Image.open(requests.get(url_2, stream=True).raw)
 model_id = "jmtzt/ijepa_vith14_22k"
 processor = AutoProcessor.from_pretrained(model_id)
+model = AutoModel.from_pretrained(model_id)
+def infer(image):
+    inputs = processor(image, return_tensors="pt")
+    outputs = model(**inputs)
+    return outputs.pooler_output
+embed_1 = infer(image_1)
+embed_2 = infer(image_2)
+similarity = cosine_similarity(embed_1, embed_2)
+print(similarity)
 ```
 ### BibTeX entry and citation info