pabberpe commited on
Commit
8881e57
·
verified ·
1 Parent(s): c9628af

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -9
README.md CHANGED
@@ -1,13 +1,13 @@
1
  ---
2
  license: apache-2.0
3
  datasets:
 
4
  - HuggingFaceM4/COCO
5
  - ehristoforu/dalle-3-images
6
  - poloclub/diffusiondb
7
  - ehristoforu/midjourney-images
8
  - nateraw/midjourney-texttoimage
9
  - duchaiten/duchaiten-realistic-sdxl
10
- - HPAI-BSC/SuSy-Dataset
11
  tags:
12
  - vision
13
  - image-classification
@@ -15,6 +15,21 @@ tags:
15
  pipeline_tag: image-classification
16
  metrics:
17
  - recall
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  ---
19
 
20
  # SuSy - Synthetic Image Detector
@@ -24,8 +39,29 @@ metrics:
24
 
25
  - **Repository:** https://github.com/HPAI-BSC/SuSy
26
  - **Dataset:** https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
 
27
  - **Paper:** TBD
28
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
29
  ## Model Details
30
 
31
  <!-- Provide a longer summary of what this model is. -->
@@ -87,6 +123,7 @@ The model may be biased in the following ways:
87
  The model has the following technical limitations:
88
 
89
  * The performance of the model may be influenced by transformations and editions performed on the images. While the model was trained on some alterations (blur, brightness, compression and gamma) there are other alterations applicable to images that could reduce the model accuracy.
 
90
  * The model will not be able to attribute synthetic images to their generative model if the model was not included in the training data.
91
  * The model is trained on patches with high gray-level contrast. For images composed entirely by low contrast regions, the model may not work as expected.
92
 
@@ -131,7 +168,7 @@ See `test_image.py` and `test_patch.py` for other examples on how to use the mod
131
  The dataset is available at: https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
132
 
133
  | Dataset | Year | Train | Validation | Test |
134
- |-------------------|------|-------|------------|-------|
135
  | COCO | 2017 | 2,967 | 1,234 | 1,234 |
136
  | dalle-3-images | 2023 | 987 | 330 | 330 |
137
  | diffusiondb | 2022 | 2,967 | 1,234 | 1,234 |
@@ -170,7 +207,7 @@ To prepare the training data, we extract 240x240 patches from the images, minimi
170
  **Data Augmentation**
171
 
172
  | Technique | Probability | Other Parameters |
173
- |--------------------------|:-----------:|-------------------------------------------|
174
  | HorizontalFlip | 0.50 | - |
175
  | RandomBrightnessContrast | 0.20 | brightness\_limit=0.2 contrast\_limit=0.2 |
176
  | RandomGamma | 0.20 | gamma\_limit=(80, 120) |
@@ -223,16 +260,16 @@ The evaluation code is available at: https://github.com/HPAI-BSC/SuSy
223
 
224
  #### Authentic Sources
225
 
226
- | Dataset | Model | Year | Recall |
227
- |---------------------|-------|------|--------|
228
- | Flickr30k | - | 2014 | 90.53 |
229
- | Google Landmarks v2 | - | 2020 | 64.54 |
230
- | In-the-wild | - | 2024 | 33.06 |
231
 
232
  #### Synthetic Sources
233
 
234
  | Dataset | Model | Year | Recall |
235
- |-------------|---------------------------|------|--------|
236
  | Synthbuster | Glide | 2021 | 53.50 |
237
  | Synthbuster | Stable Diffusion 1.3 | 2022 | 87.00 |
238
  | Synthbuster | Stable Diffusion 1.4 | 2022 | 87.10 |
 
1
  ---
2
  license: apache-2.0
3
  datasets:
4
+ - HPAI-BSC/SuSy-Dataset
5
  - HuggingFaceM4/COCO
6
  - ehristoforu/dalle-3-images
7
  - poloclub/diffusiondb
8
  - ehristoforu/midjourney-images
9
  - nateraw/midjourney-texttoimage
10
  - duchaiten/duchaiten-realistic-sdxl
 
11
  tags:
12
  - vision
13
  - image-classification
 
15
  pipeline_tag: image-classification
16
  metrics:
17
  - recall
18
+ widget:
19
+ - src: midjourney-images-example-patch0.jpg
20
+ output:
21
+ - label: authentic
22
+ score: 0.000049
23
+ - label: dalle-3-images
24
+ score: 0.004659
25
+ - label: diffusiondb
26
+ score: 0.00011
27
+ - label: midjourney-images
28
+ score: 0.994384
29
+ - label: midjourney_tti
30
+ score: 0.000569
31
+ - label: realisticSDXL
32
+ score: 0.000229
33
  ---
34
 
35
  # SuSy - Synthetic Image Detector
 
39
 
40
  - **Repository:** https://github.com/HPAI-BSC/SuSy
41
  - **Dataset:** https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
42
+ - **Model Demo**: https://colab.research.google.com/drive/15nxo0FVd-snOnj9TcX737fFH0j3SmS05
43
  - **Paper:** TBD
44
 
45
+ **Model Results**
46
+
47
+ | Dataset | Type | Model | Year | Recall |
48
+ |:-------------------:|:---------:|:-------------------------:|:----:|:------:|
49
+ | Flickr30k | Authentic | - | 2014 | 90.53 |
50
+ | Google Landmarks v2 | Authentic | - | 2020 | 64.54 |
51
+ | Synthbuster | Synthetic | Glide | 2021 | 53.50 |
52
+ | Synthbuster | Synthetic | Stable Diffusion 1.3 | 2022 | 87.00 |
53
+ | Synthbuster | Synthetic | Stable Diffusion 1.4 | 2022 | 87.10 |
54
+ | Synthbuster | Synthetic | Stable Diffusion 2 | 2022 | 68.40 |
55
+ | Synthbuster | Synthetic | DALL-E 2 | 2022 | 20.70 |
56
+ | Synthbuster | Synthetic | MidJourney V5 | 2023 | 73.10 |
57
+ | Synthbuster | Synthetic | Stable Diffusion XL | 2023 | 79.50 |
58
+ | Synthbuster | Synthetic | Firefly | 2023 | 40.90 |
59
+ | Synthbuster | Synthetic | DALL-E 3 | 2023 | 88.60 |
60
+ | Authors | Synthetic | Stable Diffusion 3 Medium | 2024 | 93.23 |
61
+ | Authors | Synthetic | Flux.1-dev | 2024 | 96.46 |
62
+ | In-the-wild | Synthetic | Mixed/Unknown | 2024 | 89.90 |
63
+ | In-the-wild | Authentic | - | 2024 | 33.06 |
64
+
65
  ## Model Details
66
 
67
  <!-- Provide a longer summary of what this model is. -->
 
123
  The model has the following technical limitations:
124
 
125
  * The performance of the model may be influenced by transformations and editions performed on the images. While the model was trained on some alterations (blur, brightness, compression and gamma) there are other alterations applicable to images that could reduce the model accuracy.
126
+ * The performance of the model might vary depending on the type and source of images
127
  * The model will not be able to attribute synthetic images to their generative model if the model was not included in the training data.
128
  * The model is trained on patches with high gray-level contrast. For images composed entirely by low contrast regions, the model may not work as expected.
129
 
 
168
  The dataset is available at: https://huggingface.co/datasets/HPAI-BSC/SuSy-Dataset
169
 
170
  | Dataset | Year | Train | Validation | Test |
171
+ |:-----------------:|:----:|:-----:|:----------:|:-----:|
172
  | COCO | 2017 | 2,967 | 1,234 | 1,234 |
173
  | dalle-3-images | 2023 | 987 | 330 | 330 |
174
  | diffusiondb | 2022 | 2,967 | 1,234 | 1,234 |
 
207
  **Data Augmentation**
208
 
209
  | Technique | Probability | Other Parameters |
210
+ |:------------------------:|:-----------:|:-----------------------------------------:|
211
  | HorizontalFlip | 0.50 | - |
212
  | RandomBrightnessContrast | 0.20 | brightness\_limit=0.2 contrast\_limit=0.2 |
213
  | RandomGamma | 0.20 | gamma\_limit=(80, 120) |
 
260
 
261
  #### Authentic Sources
262
 
263
+ | Dataset | Year | Recall |
264
+ |:-------------------:|:----:|:------:|
265
+ | Flickr30k | 2014 | 90.53 |
266
+ | Google Landmarks v2 | 2020 | 64.54 |
267
+ | In-the-wild | 2024 | 33.06 |
268
 
269
  #### Synthetic Sources
270
 
271
  | Dataset | Model | Year | Recall |
272
+ |:-----------:|:-------------------------:|:----:|:------:|
273
  | Synthbuster | Glide | 2021 | 53.50 |
274
  | Synthbuster | Stable Diffusion 1.3 | 2022 | 87.00 |
275
  | Synthbuster | Stable Diffusion 1.4 | 2022 | 87.10 |