erow
/

SiT

 ---
 license: apache-2.0
+tags:
+- self-supervised learning
+- vision
+- SiT
+inference: false
 ---
+# Model description
+SiT is a self-supervised learning model that combines masked image modeling and contrastive learning. The model is trained on ImageNet-1K.
+# Model Sources
+- https://github.com/Sara-Ahmed/SiT
+- https://arxiv.org/abs/2104.03602
+# Model Card Authors
+Sara Atito, Muhammad Awais, Josef Kittler
+# How to use
+```python
+from modeling_sit import ViTSiTForPreTraining
+# reload
+model = ViTSiTForPreTraining.from_pretrained("erow/SiT")
+```
+# BibTeX entry and citation info
+```
+@inproceedings{atito2023sit,
+  title={SiT is all you need},
+  author={Atito, Sara and Awais, Muhammed and Nandam, Srinivasa and Kittler, Josef},
+  booktitle={2023 IEEE International Conference on Image Processing (ICIP)},
+  pages={2125--2129},
+  year={2023},
+  organization={IEEE}
+}
+```