tianzhuotao commited on
Commit
fef4159
·
1 Parent(s): 1f5e026
Files changed (1) hide show
  1. README.md +21 -0
README.md CHANGED
@@ -74,6 +74,27 @@ After that, input the text prompt and then the image path. For example,
74
  The results should be like:
75
  <p align="center"> <img src="imgs/example1.jpg" width="22%"> <img src="vis_output/example1_masked_img_0.jpg" width="22%"> <img src="imgs/example2.jpg" width="25%"> <img src="vis_output/example2_masked_img_0.jpg" width="25%"> </p>
76
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
77
 
78
  ## Citation
79
  If you find this project useful in your research, please consider citing:
 
74
  The results should be like:
75
  <p align="center"> <img src="imgs/example1.jpg" width="22%"> <img src="vis_output/example1_masked_img_0.jpg" width="22%"> <img src="imgs/example2.jpg" width="25%"> <img src="vis_output/example2_masked_img_0.jpg" width="25%"> </p>
76
 
77
+ ## Dataset
78
+ We have collected 1218 images, i.e., 239 train, 200 val, and 779 test. The training and validation sets can be download from <a href="https://drive.google.com/drive/folders/125mewyg5Ao6tZ3ZdJ-1-E3n04LGVELqy?usp=sharing">**this link**</a>.
79
+
80
+ Each image is provided with an annotation JSON file:
81
+ ```
82
+ image_1.jpg, image_1.json
83
+ image_2.jpg, image_2.json
84
+ ...
85
+ image_n.jpg, image_n.json
86
+ ```
87
+ Important keys contained in JSON files:
88
+ ```
89
+ - "text": text instructions.
90
+ - "is_sentence": whether the text instructions are long sentences.
91
+ - "shapes": target polygons.
92
+ ```
93
+
94
+ The elements of the "shapes" exhibit two categories, namely **"target"** and **"ignore"**. The former category is indispensable for evaluation, while the latter category denotes the ambiguous region and hence disregarded during the evaluation process.
95
+
96
+ Besides, we leveraged GPT-3.5 for rephrasing instructions, so images in the training set may have **more than one instructions (but fewer than six)** in the "text" field. Users can randomly select one instruction as the text query to obtain a better model.
97
+
98
 
99
  ## Citation
100
  If you find this project useful in your research, please consider citing: