Spaces:
Runtime error
Runtime error
title: Ov Seg | |
emoji: π | |
colorFrom: red | |
colorTo: pink | |
sdk: gradio | |
sdk_version: 3.8.2 | |
app_file: app.py | |
pinned: false | |
license: cc-by-nc-4.0 | |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference | |
# [OVSeg] Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP | |
<img src="resources/pytorch-logo-dark.png" width="10%"> | |
This is the official PyTorch implementation of our paper: <br> | |
**Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP**<br> | |
[Feng Liang](https://jeff-liangf.github.io/), [Bichen Wu](https://www.linkedin.com/in/bichenwu), [Xiaoliang Dai](https://sites.google.com/view/xiaoliangdai/), [Kunpeng Li](https://kunpengli1994.github.io/), [Yinan Zhao](https://yinan-zhao.github.io/), [Hang Zhang](https://hangzhang.org/), [Peizhao Zhang](https://www.linkedin.com/in/peizhao-zhang-14846042/), [Peter Vajda](https://sites.google.com/site/vajdap), [Diana Marculescu](https://www.ece.utexas.edu/people/faculty/diana-marculescu) | |
[[arXiv](https://arxiv.org/abs/2210.04150)] [[Project](https://jeff-liangf.github.io/projects/ovseg/)] | |
<p align="center"> | |
<img src="resources/ovseg.gif" width="100%"> | |
</p> | |
## Installation | |
Please see [installation guide](./INSTALL.md). | |
## Data Preparation | |
Please see [datasets preparation](./datasets/DATASETS.md). | |
## Getting started | |
Please see [getting started instruction](./GETTING_STARTED.md). | |
## LICENSE | |
Shield: [![CC BY-NC 4.0][cc-by-nc-shield]][cc-by-nc] | |
The majority of OVSeg is licensed under a | |
[Creative Commons Attribution-NonCommercial 4.0 International License](LICENSE). | |
[![CC BY-NC 4.0][cc-by-nc-image]][cc-by-nc] | |
[cc-by-nc]: http://creativecommons.org/licenses/by-nc/4.0/ | |
[cc-by-nc-image]: https://licensebuttons.net/l/by-nc/4.0/88x31.png | |
[cc-by-nc-shield]: https://img.shields.io/badge/License-CC%20BY--NC%204.0-lightgrey.svg | |
However portions of the project are under separate license terms: CLIP and ZSSEG are licensed under the [MIT license](https://github.com/openai/CLIP/blob/main/LICENSE); MaskFormer is licensed under the [CC-BY-NC](https://github.com/facebookresearch/MaskFormer/blob/main/LICENSE); openclip is licensed under the license at [its repo](https://github.com/mlfoundations/open_clip/blob/main/LICENSE). | |
## Citing OVSeg :pray: | |
If you use OVSeg in your research or wish to refer to the baseline results published in the paper, please use the following BibTeX entry. | |
```BibTeX | |
@article{liang2022open, | |
title={Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP}, | |
author={Liang, Feng and Wu, Bichen and Dai, Xiaoliang and Li, Kunpeng and Zhao, Yinan and Zhang, Hang and Zhang, Peizhao and Vajda, Peter and Marculescu, Diana}, | |
journal={arXiv preprint arXiv:2210.04150}, | |
year={2022} | |
} | |
``` | |