Update README.md
Browse files
README.md
CHANGED
@@ -29,6 +29,30 @@ The model correctly captures positional uncertainty and produces high-level obje
|
|
29 |
|
30 |
I-JEPA can be used for image classification or feature extraction. This checkpoint in specific is intended for **Feature Extraction**.
|
31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
|
33 |
### BibTeX entry and citation info
|
34 |
If you use I-JEPA or this code in your work, please cite:
|
|
|
29 |
|
30 |
I-JEPA can be used for image classification or feature extraction. This checkpoint in specific is intended for **Feature Extraction**.
|
31 |
|
32 |
+
## How to use
|
33 |
+
|
34 |
+
Here is how to use this model to classify an image of the COCO 2017 dataset into one of the 1,000 ImageNet classes:
|
35 |
+
|
36 |
+
```python
|
37 |
+
import requests
|
38 |
+
|
39 |
+
from PIL import Image
|
40 |
+
from transformers import AutoProcessor, IJepaForImageClassification
|
41 |
+
|
42 |
+
url = "http://images.cocodataset.org/val2017/000000039769.jpg"
|
43 |
+
image = Image.open(requests.get(url, stream=True).raw)
|
44 |
+
|
45 |
+
model_id = "jmtzt/ijepa_vith16_1k"
|
46 |
+
processor = AutoProcessor.from_pretrained(model_id)
|
47 |
+
model = IJepaForImageClassification.from_pretrained(model_id)
|
48 |
+
|
49 |
+
inputs = processor(images=image, return_tensors="pt")
|
50 |
+
outputs = model(**inputs)
|
51 |
+
logits = outputs.logits
|
52 |
+
# model predicts one of the 1000 ImageNet classes
|
53 |
+
predicted_class_idx = logits.argmax(-1).item()
|
54 |
+
print("Predicted class:", model.config.id2label[predicted_class_idx])
|
55 |
+
```
|
56 |
|
57 |
### BibTeX entry and citation info
|
58 |
If you use I-JEPA or this code in your work, please cite:
|