HPAI-BSC
/

SuSy

Image Classification

Transformers

vision

synthetic image detection

Inference Endpoints

Model card Files Files and versions Community

pabberpe commited on Sep 20, 2024

Commit

8881e57

verified ·

1 Parent(s): c9628af

Update README.md

Browse files

Files changed (1) hide show

README.md +46 -9

README.md CHANGED Viewed

@@ -1,13 +1,13 @@
 ---
 license: apache-2.0
 datasets:
 - HuggingFaceM4/COCO
 - ehristoforu/dalle-3-images
 - poloclub/diffusiondb
 - ehristoforu/midjourney-images
 - nateraw/midjourney-texttoimage
 - duchaiten/duchaiten-realistic-sdxl
-- HPAI-BSC/SuSy-Dataset
 tags:
 - vision
 - image-classification
@@ -15,6 +15,21 @@ tags:
 pipeline_tag: image-classification
 metrics:
 - recall
 ---
 # SuSy - Synthetic Image Detector
@@ -24,8 +39,29 @@ metrics:
 - **Repository:** https://github.com/HPAI-BSC/SuSy
 - **Dataset:** https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
 - **Paper:** TBD
 ## Model Details
 <!-- Provide a longer summary of what this model is. -->
@@ -87,6 +123,7 @@ The model may be biased in the following ways:
 The model has the following technical limitations:
 * The performance of the model may be influenced by transformations and editions performed on the images. While the model was trained on some alterations (blur, brightness, compression and gamma) there are other alterations applicable to images that could reduce the model accuracy.
 * The model will not be able to attribute synthetic images to their generative model if the model was not included in the training data.
 * The model is trained on patches with high gray-level contrast. For images composed entirely by low contrast regions, the model may not work as expected.
@@ -131,7 +168,7 @@ See `test_image.py` and `test_patch.py` for other examples on how to use the mod
 The dataset is available at: https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
 | Dataset           | Year | Train | Validation | Test  |
-|-------------------|------|-------|------------|-------|
 | COCO              | 2017 | 2,967 | 1,234      | 1,234 |
 | dalle-3-images    | 2023 | 987   | 330        | 330   |
 | diffusiondb       | 2022 | 2,967 | 1,234      | 1,234 |
@@ -170,7 +207,7 @@ To prepare the training data, we extract 240x240 patches from the images, minimi
 **Data Augmentation**
 | Technique                | Probability | Other Parameters                          |
-|--------------------------|:-----------:|-------------------------------------------|
 | HorizontalFlip           |     0.50    | -                                         |
 | RandomBrightnessContrast |     0.20    | brightness\_limit=0.2 contrast\_limit=0.2 |
 | RandomGamma              |     0.20    | gamma\_limit=(80, 120)                    |
@@ -223,16 +260,16 @@ The evaluation code is available at: https://github.com/HPAI-BSC/SuSy
 #### Authentic Sources
-| Dataset             | Model | Year | Recall |
-|---------------------|-------|------|--------|
-| Flickr30k           | -     | 2014 | 90.53  |
-| Google Landmarks v2 | -     | 2020 | 64.54  |
-| In-the-wild         | -     | 2024 | 33.06  |
 #### Synthetic Sources
 | Dataset     | Model                     | Year | Recall |
-|-------------|---------------------------|------|--------|
 | Synthbuster | Glide                     | 2021 | 53.50  |
 | Synthbuster | Stable Diffusion 1.3      | 2022 | 87.00  |
 | Synthbuster | Stable Diffusion 1.4      | 2022 | 87.10  |

 ---
 license: apache-2.0
 datasets:
+- HPAI-BSC/SuSy-Dataset
 - HuggingFaceM4/COCO
 - ehristoforu/dalle-3-images
 - poloclub/diffusiondb
 - ehristoforu/midjourney-images
 - nateraw/midjourney-texttoimage
 - duchaiten/duchaiten-realistic-sdxl
 tags:
 - vision
 - image-classification
 pipeline_tag: image-classification
 metrics:
 - recall
+widget:
+- src: midjourney-images-example-patch0.jpg
+  output:
+    - label: authentic
+      score: 0.000049
+    - label: dalle-3-images
+      score: 0.004659
+    - label: diffusiondb
+      score: 0.00011
+    - label: midjourney-images
+      score: 0.994384
+    - label: midjourney_tti
+      score: 0.000569
+    - label: realisticSDXL
+      score: 0.000229
 ---
 # SuSy - Synthetic Image Detector
 - **Repository:** https://github.com/HPAI-BSC/SuSy
 - **Dataset:** https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
+- **Model Demo**: https://colab.research.google.com/drive/15nxo0FVd-snOnj9TcX737fFH0j3SmS05
 - **Paper:** TBD
+**Model Results**
+| Dataset             | Type      | Model                     | Year | Recall |
+|:-------------------:|:---------:|:-------------------------:|:----:|:------:|
+| Flickr30k           | Authentic | -                         | 2014 | 90.53  |
+| Google Landmarks v2 | Authentic | -                         | 2020 | 64.54  |
+| Synthbuster         | Synthetic | Glide                     | 2021 | 53.50  |
+| Synthbuster         | Synthetic | Stable Diffusion 1.3      | 2022 | 87.00  |
+| Synthbuster         | Synthetic | Stable Diffusion 1.4      | 2022 | 87.10  |
+| Synthbuster         | Synthetic | Stable Diffusion 2        | 2022 | 68.40  |
+| Synthbuster         | Synthetic | DALL-E 2                  | 2022 | 20.70  |
+| Synthbuster         | Synthetic | MidJourney V5             | 2023 | 73.10  |
+| Synthbuster         | Synthetic | Stable Diffusion XL       | 2023 | 79.50  |
+| Synthbuster         | Synthetic | Firefly                   | 2023 | 40.90  |
+| Synthbuster         | Synthetic | DALL-E 3                  | 2023 | 88.60  |
+| Authors             | Synthetic | Stable Diffusion 3 Medium | 2024 | 93.23  |
+| Authors             | Synthetic | Flux.1-dev                | 2024 | 96.46  |
+| In-the-wild         | Synthetic | Mixed/Unknown             | 2024 | 89.90  |
+| In-the-wild         | Authentic | -                         | 2024 | 33.06  |
 ## Model Details
 <!-- Provide a longer summary of what this model is. -->
 The model has the following technical limitations:
 * The performance of the model may be influenced by transformations and editions performed on the images. While the model was trained on some alterations (blur, brightness, compression and gamma) there are other alterations applicable to images that could reduce the model accuracy.
+* The performance of the model might vary depending on the type and source of images
 * The model will not be able to attribute synthetic images to their generative model if the model was not included in the training data.
 * The model is trained on patches with high gray-level contrast. For images composed entirely by low contrast regions, the model may not work as expected.
 The dataset is available at: https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
 | Dataset           | Year | Train | Validation | Test  |
+|:-----------------:|:----:|:-----:|:----------:|:-----:|
 | COCO              | 2017 | 2,967 | 1,234      | 1,234 |
 | dalle-3-images    | 2023 | 987   | 330        | 330   |
 | diffusiondb       | 2022 | 2,967 | 1,234      | 1,234 |
 **Data Augmentation**
 | Technique                | Probability | Other Parameters                          |
+|:------------------------:|:-----------:|:-----------------------------------------:|
 | HorizontalFlip           |     0.50    | -                                         |
 | RandomBrightnessContrast |     0.20    | brightness\_limit=0.2 contrast\_limit=0.2 |
 | RandomGamma              |     0.20    | gamma\_limit=(80, 120)                    |
 #### Authentic Sources
+| Dataset             | Year | Recall |
+|:-------------------:|:----:|:------:|
+| Flickr30k           | 2014 | 90.53  |
+| Google Landmarks v2 | 2020 | 64.54  |
+| In-the-wild         | 2024 | 33.06  |
 #### Synthetic Sources
 | Dataset     | Model                     | Year | Recall |
+|:-----------:|:-------------------------:|:----:|:------:|
 | Synthbuster | Glide                     | 2021 | 53.50  |
 | Synthbuster | Stable Diffusion 1.3      | 2022 | 87.00  |
 | Synthbuster | Stable Diffusion 1.4      | 2022 | 87.10  |