Spaces:

clip-italian
/

clip-italian-demo

Running

App Files Files Community

vinid commited on Jul 25, 2021

Commit

5e8f750

1 Parent(s): 8e64654

introduction updates

Browse files

Files changed (1) hide show

introduction.md +4 -4

introduction.md CHANGED Viewed

@@ -21,23 +21,23 @@ Thank you for this amazing opportunity, we hope you will like the results! :hear
 In this demo, we present two tasks:
-+ *Text to Image*: This task is essentially an image retrieval task. The user is asked to input a string of text and CLIP is going to
 compute the similarity between this string of text with respect to a set of images. The webapp is going to display the images that
 have the highest similarity with the text query.
 <img src="https://huggingface.co/spaces/clip-italian/clip-italian-demo/raw/main/static/img/text_to_image.png" alt="drawing" width="95%"/>
-+ *Image to Text*: This task is essentially a zero-shot image classification task. The user is asked for an image and for a set of captions/labels and CLIP
 is going to compute the similarity between the image and each label. The webapp is going to display a probability distribution over the captions.
 <img src="https://huggingface.co/spaces/clip-italian/clip-italian-demo/raw/main/static/img/image_to_text.png" alt="drawing" width="95%"/>
-+ *Localization*: This is one of ours **very cool** features and at the best of our knowledge, it is a novel contribution. We can use CLIP
 to find where "something" (like a "cat") is an image. The location of the object is computed by masking different areas of the image and looking at how the similarity to the image description changes.
 <img src="https://huggingface.co/spaces/clip-italian/clip-italian-demo/raw/main/static/img/gatto_cane.png" alt="drawing" width="95%"/>
-+ *Examples & Applications*: This page showcases some interesting results we got from the model, we believe that there are
 different applications that can start from here.
 # Novel Contributions

 In this demo, we present two tasks:
++ **Text to Image**: This task is essentially an image retrieval task. The user is asked to input a string of text and CLIP is going to
 compute the similarity between this string of text with respect to a set of images. The webapp is going to display the images that
 have the highest similarity with the text query.
 <img src="https://huggingface.co/spaces/clip-italian/clip-italian-demo/raw/main/static/img/text_to_image.png" alt="drawing" width="95%"/>
++ **Image to Text**: This task is essentially a zero-shot image classification task. The user is asked for an image and for a set of captions/labels and CLIP
 is going to compute the similarity between the image and each label. The webapp is going to display a probability distribution over the captions.
 <img src="https://huggingface.co/spaces/clip-italian/clip-italian-demo/raw/main/static/img/image_to_text.png" alt="drawing" width="95%"/>
++ **Localization**: This is one of ours **very cool** features and at the best of our knowledge, it is a novel contribution. We can use CLIP
 to find where "something" (like a "cat") is an image. The location of the object is computed by masking different areas of the image and looking at how the similarity to the image description changes.
 <img src="https://huggingface.co/spaces/clip-italian/clip-italian-demo/raw/main/static/img/gatto_cane.png" alt="drawing" width="95%"/>
++ **Examples & Applications**: This page showcases some interesting results we got from the model, we believe that there are
 different applications that can start from here.
 # Novel Contributions