Transformers
PyTorch
Safetensors
swin
vision
simmim

Swin Transformer (base-sized model)

Swin Transformer model pre-trained on ImageNet-1k using the SimMIM objective at resolution 192x192. It was introduced in the paper SimMIM: A Simple Framework for Masked Image Modeling by Xie et al. and first released in this repository.

Intended use cases

This model is pre-trained only, it's meant to be fine-tuned on a downstream dataset.

Usage

Refer to the documentation.

Downloads last month
429
Safetensors
Model size
89.9M params
Tensor type
I64
·
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train microsoft/swin-base-simmim-window6-192