metadata
pipeline_tag: image-to-image
tags:
- HiT-SR
- image super-resolution
- transformer
- efficient transformer
HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
[Github] | [Paper] | [Supp] | [Video] | [Visual Results]
HiT-SR is a general strategy to improve transformer-based SR methods. We apply our HiT-SR approach to improve SwinIR-Light, SwinIR-NG and SRFormer-Light, corresponding to our HiT-SIR, HiT-SNG, and HiT-SRF. Compared with the original structure, our improved models achieve better SR performance while reducing computational burdens.
π οΈ Setup
Install the dependencies under the working directory:
git clone https://huggingface.co/XiangZ/hit-sr
cd hit-sr
pip install -r requirements.txt
π Usage
For each HiT-SR model, we provide 2x, 3x, 4x upscaling versions:
Repo Name | Model | Upscale | ||
---|---|---|---|---|
XiangZ/hit-sir-2x |
HiT-SIR | 2x | ||
XiangZ/hit-sir-3x |
HiT-SIR | 3x | ||
XiangZ/hit-sir-4x |
HiT-SIR | 4x | ||
XiangZ/hit-sng-2x |
HiT-SNG | 2x | ||
XiangZ/hit-sng-3x |
HiT-SNG | 3x | ||
XiangZ/hit-sng-4x |
HiT-SNG | 4x | ||
XiangZ/hit-srf-2x |
HiT-SRF | 2x | ||
XiangZ/hit-srf-3x |
HiT-SRF | 3x | ||
XiangZ/hit-srf-4x |
HiT-SRF | 4x |
To test the model (use hit-srf-4x as an example):
from hit_sir_arch import HiT_SIR
from hit_sng_arch import HiT_SNG
from hit_srf_arch import HiT_SRF
import cv2
import torch
# detect device
device = torch.device('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu')
# initialize model (change model and upscale according to your setting)
model = HiT_SRF(upscale=4)
# load model (change repo_name according to your setting)
repo_name = "XiangZ/hit-srf-4x"
model = model.from_pretrained(repo_name)
model.to(device)
# test and save results
sr_results = model.infer_image("path-to-input-image", device=device)
cv2.imwrite("path-to-output-location", sr_results)
π Citation
If you find the code helpful in your research or work, please consider citing the following paper.
@inproceedings{zhang2024hitsr,
title={HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution},
author={Zhang, Xiang and Zhang, Yulun and Yu, Fisher},
booktitle={ECCV},
year={2024}
}