rizavelioglu
/

fashionfail

Object Detection

cloth-retrieval

Model card Files Files and versions Community

fashionfail / README.md

rizavelioglu

update README.md

12d33e8 8 months ago

|

history blame contribute delete

2.88 kB

	---
	license_name: server-side-public-license
	license_link: https://www.mongodb.com/licensing/server-side-public-license
	tags:
	- fashion
	- cloth-retrieval
	- e-commerce
	- segmentation
	datasets:
	- rizavelioglu/fashionfail
	- detection-datasets/fashionpedia
	pipeline_tag: object-detection
	---

	## Facere*

	The models proposed in the paper _"FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation"_
	[[paper]](https://arxiv.org/abs/2404.08582) [[project page]](https://rizavelioglu.github.io/fashionfail/):
	- `facere_base.onnx`: A pre-trained Mask R-CNN fine-tuned on `Fashionpedia-train`.
	- `facere_plus.onnx`: `facere_base` model further fine-tuned on `FashionFail-train`.

	_* Facere (fa:chere) is a Latin word for 'to make', from which the word fashion is derived.[[source]](https://en.wikipedia.org/wiki/Fashion#:~:text=The%20term,to%20make)_

	## Usage

	```python
	from torchvision.io import read_image
	from torchvision.models.detection import MaskRCNN_ResNet50_FPN_Weights
	from huggingface_hub import hf_hub_download

	path_onnx = hf_hub_download(
	repo_id="rizavelioglu/fashionfail",
	filename="facere_base.onnx", # or "facere_plus.onnx"
	)

	# Load pre-trained model transformations.
	weights = MaskRCNN_ResNet50_FPN_Weights.DEFAULT
	transforms = weights.transforms()

	# Load image and apply original transformation to the image.
	img = read_image("path/to/image")
	img_transformed = transforms(img)

	# Create an inference session.
	ort_session = onnxruntime.InferenceSession(
	path_onnx, providers=["CUDAExecutionProvider", "CPUExecutionProvider"]
	)

	# Run inference on the input.
	ort_inputs = {
	ort_session.get_inputs()[0].name: img_transformed.unsqueeze(dim=0).numpy()
	}
	ort_outs = ort_session.run(None, ort_inputs)

	# Parse the model output.
	boxes, labels, scores, masks = ort_outs
	```

	> Check out the demo code on [HuggingFace Spaces][ff-hf_spaces] for visualizing the output.

	> Also, check out [FashionFail's GitHub repository](https://github.com/rizavelioglu/fashionfail) to get more information on
	> training, inference, and evaluation.

	### License
	TL;DR: Not available for commercial use, unless the FULL source code is shared! \
	This project is intended solely for academic research. No commercial benefits are derived from it.
	Models are licensed under [Server Side Public License (SSPL)](https://www.mongodb.com/legal/licensing/server-side-public-license)

	### Citation
	If you find this repository useful in your research, please consider giving a star ⭐ and a citation:
	```
	@inproceedings{velioglu2024fashionfail,
	author = {Velioglu, Riza and Chan, Robin and Hammer, Barbara},
	title = {FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation},
	journal = {IJCNN},
	eprint = {2404.08582},
	year = {2024},
	}
	```

	[ff-hf_spaces]: https://huggingface.co/spaces/rizavelioglu/fashionfail