Update README.md
Browse files
README.md
CHANGED
@@ -1,13 +1,13 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
datasets:
|
|
|
4 |
- HuggingFaceM4/COCO
|
5 |
- ehristoforu/dalle-3-images
|
6 |
- poloclub/diffusiondb
|
7 |
- ehristoforu/midjourney-images
|
8 |
- nateraw/midjourney-texttoimage
|
9 |
- duchaiten/duchaiten-realistic-sdxl
|
10 |
-
- HPAI-BSC/SuSy-Dataset
|
11 |
tags:
|
12 |
- vision
|
13 |
- image-classification
|
@@ -15,6 +15,21 @@ tags:
|
|
15 |
pipeline_tag: image-classification
|
16 |
metrics:
|
17 |
- recall
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
---
|
19 |
|
20 |
# SuSy - Synthetic Image Detector
|
@@ -24,8 +39,29 @@ metrics:
|
|
24 |
|
25 |
- **Repository:** https://github.com/HPAI-BSC/SuSy
|
26 |
- **Dataset:** https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
|
|
|
27 |
- **Paper:** TBD
|
28 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
29 |
## Model Details
|
30 |
|
31 |
<!-- Provide a longer summary of what this model is. -->
|
@@ -87,6 +123,7 @@ The model may be biased in the following ways:
|
|
87 |
The model has the following technical limitations:
|
88 |
|
89 |
* The performance of the model may be influenced by transformations and editions performed on the images. While the model was trained on some alterations (blur, brightness, compression and gamma) there are other alterations applicable to images that could reduce the model accuracy.
|
|
|
90 |
* The model will not be able to attribute synthetic images to their generative model if the model was not included in the training data.
|
91 |
* The model is trained on patches with high gray-level contrast. For images composed entirely by low contrast regions, the model may not work as expected.
|
92 |
|
@@ -131,7 +168,7 @@ See `test_image.py` and `test_patch.py` for other examples on how to use the mod
|
|
131 |
The dataset is available at: https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
|
132 |
|
133 |
| Dataset | Year | Train | Validation | Test |
|
134 |
-
|
135 |
| COCO | 2017 | 2,967 | 1,234 | 1,234 |
|
136 |
| dalle-3-images | 2023 | 987 | 330 | 330 |
|
137 |
| diffusiondb | 2022 | 2,967 | 1,234 | 1,234 |
|
@@ -170,7 +207,7 @@ To prepare the training data, we extract 240x240 patches from the images, minimi
|
|
170 |
**Data Augmentation**
|
171 |
|
172 |
| Technique | Probability | Other Parameters |
|
173 |
-
|
174 |
| HorizontalFlip | 0.50 | - |
|
175 |
| RandomBrightnessContrast | 0.20 | brightness\_limit=0.2 contrast\_limit=0.2 |
|
176 |
| RandomGamma | 0.20 | gamma\_limit=(80, 120) |
|
@@ -223,16 +260,16 @@ The evaluation code is available at: https://github.com/HPAI-BSC/SuSy
|
|
223 |
|
224 |
#### Authentic Sources
|
225 |
|
226 |
-
| Dataset |
|
227 |
-
|
228 |
-
| Flickr30k |
|
229 |
-
| Google Landmarks v2 |
|
230 |
-
| In-the-wild |
|
231 |
|
232 |
#### Synthetic Sources
|
233 |
|
234 |
| Dataset | Model | Year | Recall |
|
235 |
-
|
236 |
| Synthbuster | Glide | 2021 | 53.50 |
|
237 |
| Synthbuster | Stable Diffusion 1.3 | 2022 | 87.00 |
|
238 |
| Synthbuster | Stable Diffusion 1.4 | 2022 | 87.10 |
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
datasets:
|
4 |
+
- HPAI-BSC/SuSy-Dataset
|
5 |
- HuggingFaceM4/COCO
|
6 |
- ehristoforu/dalle-3-images
|
7 |
- poloclub/diffusiondb
|
8 |
- ehristoforu/midjourney-images
|
9 |
- nateraw/midjourney-texttoimage
|
10 |
- duchaiten/duchaiten-realistic-sdxl
|
|
|
11 |
tags:
|
12 |
- vision
|
13 |
- image-classification
|
|
|
15 |
pipeline_tag: image-classification
|
16 |
metrics:
|
17 |
- recall
|
18 |
+
widget:
|
19 |
+
- src: midjourney-images-example-patch0.jpg
|
20 |
+
output:
|
21 |
+
- label: authentic
|
22 |
+
score: 0.000049
|
23 |
+
- label: dalle-3-images
|
24 |
+
score: 0.004659
|
25 |
+
- label: diffusiondb
|
26 |
+
score: 0.00011
|
27 |
+
- label: midjourney-images
|
28 |
+
score: 0.994384
|
29 |
+
- label: midjourney_tti
|
30 |
+
score: 0.000569
|
31 |
+
- label: realisticSDXL
|
32 |
+
score: 0.000229
|
33 |
---
|
34 |
|
35 |
# SuSy - Synthetic Image Detector
|
|
|
39 |
|
40 |
- **Repository:** https://github.com/HPAI-BSC/SuSy
|
41 |
- **Dataset:** https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
|
42 |
+
- **Model Demo**: https://colab.research.google.com/drive/15nxo0FVd-snOnj9TcX737fFH0j3SmS05
|
43 |
- **Paper:** TBD
|
44 |
|
45 |
+
**Model Results**
|
46 |
+
|
47 |
+
| Dataset | Type | Model | Year | Recall |
|
48 |
+
|:-------------------:|:---------:|:-------------------------:|:----:|:------:|
|
49 |
+
| Flickr30k | Authentic | - | 2014 | 90.53 |
|
50 |
+
| Google Landmarks v2 | Authentic | - | 2020 | 64.54 |
|
51 |
+
| Synthbuster | Synthetic | Glide | 2021 | 53.50 |
|
52 |
+
| Synthbuster | Synthetic | Stable Diffusion 1.3 | 2022 | 87.00 |
|
53 |
+
| Synthbuster | Synthetic | Stable Diffusion 1.4 | 2022 | 87.10 |
|
54 |
+
| Synthbuster | Synthetic | Stable Diffusion 2 | 2022 | 68.40 |
|
55 |
+
| Synthbuster | Synthetic | DALL-E 2 | 2022 | 20.70 |
|
56 |
+
| Synthbuster | Synthetic | MidJourney V5 | 2023 | 73.10 |
|
57 |
+
| Synthbuster | Synthetic | Stable Diffusion XL | 2023 | 79.50 |
|
58 |
+
| Synthbuster | Synthetic | Firefly | 2023 | 40.90 |
|
59 |
+
| Synthbuster | Synthetic | DALL-E 3 | 2023 | 88.60 |
|
60 |
+
| Authors | Synthetic | Stable Diffusion 3 Medium | 2024 | 93.23 |
|
61 |
+
| Authors | Synthetic | Flux.1-dev | 2024 | 96.46 |
|
62 |
+
| In-the-wild | Synthetic | Mixed/Unknown | 2024 | 89.90 |
|
63 |
+
| In-the-wild | Authentic | - | 2024 | 33.06 |
|
64 |
+
|
65 |
## Model Details
|
66 |
|
67 |
<!-- Provide a longer summary of what this model is. -->
|
|
|
123 |
The model has the following technical limitations:
|
124 |
|
125 |
* The performance of the model may be influenced by transformations and editions performed on the images. While the model was trained on some alterations (blur, brightness, compression and gamma) there are other alterations applicable to images that could reduce the model accuracy.
|
126 |
+
* The performance of the model might vary depending on the type and source of images
|
127 |
* The model will not be able to attribute synthetic images to their generative model if the model was not included in the training data.
|
128 |
* The model is trained on patches with high gray-level contrast. For images composed entirely by low contrast regions, the model may not work as expected.
|
129 |
|
|
|
168 |
The dataset is available at: https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
|
169 |
|
170 |
| Dataset | Year | Train | Validation | Test |
|
171 |
+
|:-----------------:|:----:|:-----:|:----------:|:-----:|
|
172 |
| COCO | 2017 | 2,967 | 1,234 | 1,234 |
|
173 |
| dalle-3-images | 2023 | 987 | 330 | 330 |
|
174 |
| diffusiondb | 2022 | 2,967 | 1,234 | 1,234 |
|
|
|
207 |
**Data Augmentation**
|
208 |
|
209 |
| Technique | Probability | Other Parameters |
|
210 |
+
|:------------------------:|:-----------:|:-----------------------------------------:|
|
211 |
| HorizontalFlip | 0.50 | - |
|
212 |
| RandomBrightnessContrast | 0.20 | brightness\_limit=0.2 contrast\_limit=0.2 |
|
213 |
| RandomGamma | 0.20 | gamma\_limit=(80, 120) |
|
|
|
260 |
|
261 |
#### Authentic Sources
|
262 |
|
263 |
+
| Dataset | Year | Recall |
|
264 |
+
|:-------------------:|:----:|:------:|
|
265 |
+
| Flickr30k | 2014 | 90.53 |
|
266 |
+
| Google Landmarks v2 | 2020 | 64.54 |
|
267 |
+
| In-the-wild | 2024 | 33.06 |
|
268 |
|
269 |
#### Synthetic Sources
|
270 |
|
271 |
| Dataset | Model | Year | Recall |
|
272 |
+
|:-----------:|:-------------------------:|:----:|:------:|
|
273 |
| Synthbuster | Glide | 2021 | 53.50 |
|
274 |
| Synthbuster | Stable Diffusion 1.3 | 2022 | 87.00 |
|
275 |
| Synthbuster | Stable Diffusion 1.4 | 2022 | 87.10 |
|