|
--- |
|
license_name: server-side-public-license |
|
license_link: https://www.mongodb.com/licensing/server-side-public-license |
|
tags: |
|
- fashion |
|
- cloth-retrieval |
|
- e-commerce |
|
- segmentation |
|
datasets: |
|
- rizavelioglu/fashionfail |
|
- detection-datasets/fashionpedia |
|
pipeline_tag: object-detection |
|
--- |
|
|
|
## Facere* |
|
|
|
The models proposed in the paper _"FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation"_ |
|
[[paper]](https://arxiv.org/abs/2404.08582) [[project page]](https://rizavelioglu.github.io/fashionfail/): |
|
- `facere_base.onnx`: A pre-trained Mask R-CNN fine-tuned on `Fashionpedia-train`. |
|
- `facere_plus.onnx`: `facere_base` model further fine-tuned on `FashionFail-train`. |
|
|
|
_* Facere (fa:chere) is a Latin word for 'to make', from which the word fashion is derived.[[source]](https://en.wikipedia.org/wiki/Fashion#:~:text=The%20term,to%20make)_ |
|
|
|
## Usage |
|
|
|
```python |
|
from torchvision.io import read_image |
|
from torchvision.models.detection import MaskRCNN_ResNet50_FPN_Weights |
|
from huggingface_hub import hf_hub_download |
|
|
|
path_onnx = hf_hub_download( |
|
repo_id="rizavelioglu/fashionfail", |
|
filename="facere_base.onnx", # or "facere_plus.onnx" |
|
) |
|
|
|
# Load pre-trained model transformations. |
|
weights = MaskRCNN_ResNet50_FPN_Weights.DEFAULT |
|
transforms = weights.transforms() |
|
|
|
# Load image and apply original transformation to the image. |
|
img = read_image("path/to/image") |
|
img_transformed = transforms(img) |
|
|
|
# Create an inference session. |
|
ort_session = onnxruntime.InferenceSession( |
|
path_onnx, providers=["CUDAExecutionProvider", "CPUExecutionProvider"] |
|
) |
|
|
|
# Run inference on the input. |
|
ort_inputs = { |
|
ort_session.get_inputs()[0].name: img_transformed.unsqueeze(dim=0).numpy() |
|
} |
|
ort_outs = ort_session.run(None, ort_inputs) |
|
|
|
# Parse the model output. |
|
boxes, labels, scores, masks = ort_outs |
|
``` |
|
|
|
> Check out the demo code on [HuggingFace Spaces][ff-hf_spaces] for visualizing the output. |
|
|
|
> Also, check out [FashionFail's GitHub repository](https://github.com/rizavelioglu/fashionfail) to get more information on |
|
> training, inference, and evaluation. |
|
|
|
### License |
|
TL;DR: Not available for commercial use, unless the FULL source code is shared! \ |
|
This project is intended solely for academic research. No commercial benefits are derived from it. |
|
Models are licensed under [Server Side Public License (SSPL)](https://www.mongodb.com/legal/licensing/server-side-public-license) |
|
|
|
### Citation |
|
If you find this repository useful in your research, please consider giving a star ⭐ and a citation: |
|
``` |
|
@inproceedings{velioglu2024fashionfail, |
|
author = {Velioglu, Riza and Chan, Robin and Hammer, Barbara}, |
|
title = {FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation}, |
|
journal = {IJCNN}, |
|
eprint = {2404.08582}, |
|
year = {2024}, |
|
} |
|
``` |
|
|
|
[ff-hf_spaces]: https://huggingface.co/spaces/rizavelioglu/fashionfail |