carohiguera
commited on
Commit
β’
d7beed5
1
Parent(s):
d5cffc6
added force field decoder for DIGIT
Browse files- README.md +47 -0
- digit_t1_forcefield_dino_vitbase_bg/checkpoints/epoch-0014.pth +3 -0
- digit_t1_forcefield_dino_vitbase_bg/checkpoints/epoch-0021.pth +3 -0
- digit_t1_forcefield_dino_vitbase_bg/checkpoints/epoch-0031.pth +3 -0
- digit_t1_forcefield_dino_vitbase_bg/checkpoints/last.ckpt +3 -0
- digit_t1_forcefield_dino_vitbase_bg/config.yaml +218 -0
- digit_t1_forcefield_dino_vitbase_bg/config_tree.log +237 -0
README.md
ADDED
@@ -0,0 +1,47 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: cc-by-nc-4.0
|
3 |
+
tags:
|
4 |
+
- sparsh
|
5 |
+
- force field
|
6 |
+
- digit
|
7 |
+
---
|
8 |
+
|
9 |
+
# Sparsh (DINO) + force field decoder for DIGIT sensor
|
10 |
+
|
11 |
+
We decode the touch representations from Sparsh into normal and shear force fields. This allows us to gather an intuition about what the representations capture in terms of forces in a way that is interpretable for humans.
|
12 |
+
|
13 |
+
## How to Use
|
14 |
+
For testing Sparsh(DINO) + force field decoder live, you only need a DIGIT sensor. Follow these steps to run the demo:
|
15 |
+
|
16 |
+
1. Clone the [sparsh repo](https://github.com/facebookresearch/sparsh.git)
|
17 |
+
|
18 |
+
2. Create a folder for downloading the task checkpoints. For example, `${YOUR_PATH}/outputs_sparsh/checkpoints`.
|
19 |
+
3. Download the Sparsh (DINO) base [checkpoint](https://huggingface.co/facebook/sparsh-dino-base)
|
20 |
+
4. Download the decoder checkpoints from this repo.
|
21 |
+
5. Connect the sensor to your PC. In case of DIGIT, please make sure you have [digit-interface](https://github.com/facebookresearch/digit-interface) installed.
|
22 |
+
6. Make sure the device is recognized by the OS (you can use Cheese in Linux to see the video that the sensor is streaming).
|
23 |
+
|
24 |
+
7. Running the demo for DIGIT (please refer to the Sparsh repo README for more information about how to setup the path configs):
|
25 |
+
|
26 |
+
```bash
|
27 |
+
python demo_forcefield.py +experiment=digit/downstream_task/forcefield/digit_dino paths=${YOUR_PATH_CONFIG} paths.output_dir=${YOUR_PATH}/outputs_sparsh/checkpoints/ test.demo.digit_serial=${YOUR_DIGIT_SERIAL}`
|
28 |
+
```
|
29 |
+
The DIGIT serial number is printed on the back of the sensor and has the format `DXXXXX`.
|
30 |
+
|
31 |
+
8. Take the sensor and slide it across the edge of a table, or across objects with interesting textures! Look at the normal field to localize where you're making contact on the sensor's surface. Look at the shear field to gather an intuition about the direction of the shear force that you applied while sliding the sensor. For example, slide the sensor over an edge up and down to get translational shear or rotate the sensor in place to see torsional slip!
|
32 |
+
|
33 |
+
<!-- <p align="center">
|
34 |
+
<img src="assets/demo_digit.gif" alt="animated" />
|
35 |
+
</p> -->
|
36 |
+
|
37 |
+
### BibTeX entry and citation info
|
38 |
+
```bibtex
|
39 |
+
@inproceedings{
|
40 |
+
higuera2024sparsh,
|
41 |
+
title={Sparsh: Self-supervised touch representations for vision-based tactile sensing},
|
42 |
+
author={Carolina Higuera and Akash Sharma and Chaithanya Krishna Bodduluri and Taosha Fan and Patrick Lancaster and Mrinal Kalakrishnan and Michael Kaess and Byron Boots and Mike Lambeta and Tingfan Wu and Mustafa Mukadam},
|
43 |
+
booktitle={8th Annual Conference on Robot Learning},
|
44 |
+
year={2024},
|
45 |
+
url={https://openreview.net/forum?id=xYJn2e1uu8}
|
46 |
+
}
|
47 |
+
```
|
digit_t1_forcefield_dino_vitbase_bg/checkpoints/epoch-0014.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:de8da71138525cbf12383bf1b25c146cdba93626199a2be044f671f782fc4a10
|
3 |
+
size 14695019
|
digit_t1_forcefield_dino_vitbase_bg/checkpoints/epoch-0021.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:182ceba7f823d334c036df82900bc55d660820777530a2e611124d51d94246eb
|
3 |
+
size 14695019
|
digit_t1_forcefield_dino_vitbase_bg/checkpoints/epoch-0031.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2e41f3194445ea14d21d1ce0fc8ad86a7bdccdbcc64529e4fad1d774f50ad878
|
3 |
+
size 14695019
|
digit_t1_forcefield_dino_vitbase_bg/checkpoints/last.ckpt
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1d10d727b6e711615c265366fb532b540a1ed3f9fd0544ea5d46bc47cc70f888
|
3 |
+
size 541797495
|
digit_t1_forcefield_dino_vitbase_bg/config.yaml
ADDED
@@ -0,0 +1,218 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
paths:
|
2 |
+
data_root: /media/chiguera/GUM/
|
3 |
+
encoder_checkpoint_root: /media/chiguera/2TB/sparsh/encoders_460k/
|
4 |
+
log_dir: /media/chiguera/GUM/tactile_ssl/outputs_sparsh/${experiment_name}/logs/
|
5 |
+
output_dir: /media/chiguera/GUM/tactile_ssl/outputs_sparsh/${experiment_name}/
|
6 |
+
work_dir: ${hydra:runtime.cwd}
|
7 |
+
wandb:
|
8 |
+
project: ${task_name}_${sensor}
|
9 |
+
entity: chiguera
|
10 |
+
save_dir: ${paths.output_dir}
|
11 |
+
id: ${hydra:job.id}_${experiment_name}
|
12 |
+
tags:
|
13 |
+
- ${ssl_name}
|
14 |
+
group: null
|
15 |
+
notes: null
|
16 |
+
data:
|
17 |
+
train_val_split: null
|
18 |
+
train_data_budget: ${train_data_budget}
|
19 |
+
val_data_budget: ${val_data_budget}
|
20 |
+
max_train_data: null
|
21 |
+
sensor: digit
|
22 |
+
dataset:
|
23 |
+
_target_: tactile_ssl.data.vision_tactile_forcefield.VisionTactileBackboneDataset
|
24 |
+
config:
|
25 |
+
sensor: ${data.sensor}
|
26 |
+
remove_bg: true
|
27 |
+
out_format: concat_ch_img
|
28 |
+
num_frames: 2
|
29 |
+
frame_stride: 5
|
30 |
+
path_dataset: ${paths.data_root}/datasets/digitv1/Object-Slide/
|
31 |
+
path_bgs: ${paths.data_root}/datasets/digitv1/Object-Slide/bgs/
|
32 |
+
list_datasets:
|
33 |
+
- 004_sugar_box/dataset_0
|
34 |
+
- 025_mug/dataset_0
|
35 |
+
- 048_hammer/dataset_0
|
36 |
+
- 005_tomato_soup_can/dataset_0
|
37 |
+
- 035_power_drill/dataset_0
|
38 |
+
- 055_baseball/dataset_0
|
39 |
+
- 006_mustard_bottle/dataset_0
|
40 |
+
- 037_scissors/dataset_0
|
41 |
+
- 021_bleach_cleanser/dataset_0
|
42 |
+
- 042_adjustable_wrench/dataset_0
|
43 |
+
- 004_sugar_box/dataset_1
|
44 |
+
- 025_mug/dataset_1
|
45 |
+
- 048_hammer/dataset_1
|
46 |
+
- 005_tomato_soup_can/dataset_1
|
47 |
+
- 035_power_drill/dataset_1
|
48 |
+
- 055_baseball/dataset_1
|
49 |
+
- 006_mustard_bottle/dataset_1
|
50 |
+
- 037_scissors/dataset_1
|
51 |
+
- 021_bleach_cleanser/dataset_1
|
52 |
+
- 042_adjustable_wrench/dataset_1
|
53 |
+
- 004_sugar_box/dataset_2
|
54 |
+
- 025_mug/dataset_2
|
55 |
+
- 048_hammer/dataset_2
|
56 |
+
- 005_tomato_soup_can/dataset_2
|
57 |
+
- 035_power_drill/dataset_2
|
58 |
+
- 055_baseball/dataset_2
|
59 |
+
- 006_mustard_bottle/dataset_2
|
60 |
+
- 037_scissors/dataset_2
|
61 |
+
- 021_bleach_cleanser/dataset_2
|
62 |
+
- 042_adjustable_wrench/dataset_2
|
63 |
+
- 004_sugar_box/dataset_3
|
64 |
+
- 025_mug/dataset_3
|
65 |
+
- 048_hammer/dataset_3
|
66 |
+
- 005_tomato_soup_can/dataset_3
|
67 |
+
- 035_power_drill/dataset_3
|
68 |
+
- 055_baseball/dataset_3
|
69 |
+
- 006_mustard_bottle/dataset_3
|
70 |
+
- 037_scissors/dataset_3
|
71 |
+
- 021_bleach_cleanser/dataset_3
|
72 |
+
- 042_adjustable_wrench/dataset_3
|
73 |
+
- banana/dataset_0
|
74 |
+
- banana/dataset_1
|
75 |
+
- banana/dataset_2
|
76 |
+
- banana/dataset_3
|
77 |
+
- bread/dataset_0
|
78 |
+
- bread/dataset_1
|
79 |
+
- bread/dataset_2
|
80 |
+
- bread/dataset_3
|
81 |
+
- cheese/dataset_0
|
82 |
+
- cheese/dataset_1
|
83 |
+
- cheese/dataset_2
|
84 |
+
- cheese/dataset_3
|
85 |
+
- cookie/dataset_0
|
86 |
+
- cookie/dataset_1
|
87 |
+
- cookie/dataset_2
|
88 |
+
- cookie/dataset_3
|
89 |
+
- corn/dataset_0
|
90 |
+
- corn/dataset_1
|
91 |
+
- corn/dataset_2
|
92 |
+
- corn/dataset_3
|
93 |
+
- lettuce/dataset_0
|
94 |
+
- lettuce/dataset_1
|
95 |
+
- lettuce/dataset_2
|
96 |
+
- lettuce/dataset_3
|
97 |
+
- plum/dataset_0
|
98 |
+
- plum/dataset_1
|
99 |
+
- plum/dataset_2
|
100 |
+
- plum/dataset_3
|
101 |
+
- strawberry/dataset_0
|
102 |
+
- strawberry/dataset_1
|
103 |
+
- strawberry/dataset_2
|
104 |
+
- strawberry/dataset_3
|
105 |
+
- tomato/dataset_0
|
106 |
+
- tomato/dataset_1
|
107 |
+
- tomato/dataset_2
|
108 |
+
- tomato/dataset_3
|
109 |
+
- sphere/dataset_0
|
110 |
+
- sphere/dataset_1
|
111 |
+
- sphere/dataset_2
|
112 |
+
list_datasets_test:
|
113 |
+
- 004_sugar_box/dataset_4
|
114 |
+
- 025_mug/dataset_4
|
115 |
+
- 048_hammer/dataset_4
|
116 |
+
- 005_tomato_soup_can/dataset_4
|
117 |
+
- 035_power_drill/dataset_4
|
118 |
+
- 055_baseball/dataset_4
|
119 |
+
- 006_mustard_bottle/dataset_4
|
120 |
+
- 037_scissors/dataset_4
|
121 |
+
- 021_bleach_cleanser/dataset_4
|
122 |
+
- 042_adjustable_wrench/dataset_4
|
123 |
+
- sphere/dataset_1
|
124 |
+
- sphere/dataset_2
|
125 |
+
look_in_folder: false
|
126 |
+
transforms:
|
127 |
+
with_augmentation: false
|
128 |
+
resize:
|
129 |
+
- 224
|
130 |
+
- 224
|
131 |
+
p_flip: 0.0
|
132 |
+
p_crop: 0.0
|
133 |
+
p_rot: 0.0
|
134 |
+
train_dataloader:
|
135 |
+
batch_size: 20
|
136 |
+
num_workers: 2
|
137 |
+
drop_last: true
|
138 |
+
pin_memory: true
|
139 |
+
persistent_workers: true
|
140 |
+
shuffle: true
|
141 |
+
val_dataloader:
|
142 |
+
batch_size: 20
|
143 |
+
num_workers: 2
|
144 |
+
drop_last: true
|
145 |
+
pin_memory: true
|
146 |
+
persistent_workers: true
|
147 |
+
task:
|
148 |
+
_target_: tactile_ssl.downstream_task.ForceFieldModuleSL
|
149 |
+
model_encoder:
|
150 |
+
_target_: tactile_ssl.model.vit_${ssl_model_size}
|
151 |
+
img_size:
|
152 |
+
- 224
|
153 |
+
- 224
|
154 |
+
in_chans: 6
|
155 |
+
pos_embed_fn: sinusoidal
|
156 |
+
num_register_tokens: 1
|
157 |
+
model_task:
|
158 |
+
_target_: tactile_ssl.downstream_task.ForceFieldDecoderSL
|
159 |
+
embed_dim: ${ssl_model_size}
|
160 |
+
checkpoint_encoder: ${paths.encoder_checkpoint_root}/${ssl_name}_vit${ssl_model_size}.ckpt
|
161 |
+
checkpoint_task: null
|
162 |
+
train_encoder: false
|
163 |
+
optim_cfg:
|
164 |
+
_partial_: true
|
165 |
+
_target_: torch.optim.Adam
|
166 |
+
lr: 0.0001
|
167 |
+
scheduler_cfg: null
|
168 |
+
ssl_config:
|
169 |
+
img_sz:
|
170 |
+
- 224
|
171 |
+
- 224
|
172 |
+
pose_estimator:
|
173 |
+
num_encoder_layers: 18
|
174 |
+
loss:
|
175 |
+
with_mask_supervision: false
|
176 |
+
with_sl_supervision: false
|
177 |
+
with_ssim: true
|
178 |
+
disparity_smoothness: 0.001
|
179 |
+
min_depth: 0.1
|
180 |
+
max_depth: 100.0
|
181 |
+
encoder_type: ${ssl_name}
|
182 |
+
ssl_name: dino
|
183 |
+
sensor: digit
|
184 |
+
ckpt_path: null
|
185 |
+
task_name: t1_forcefield
|
186 |
+
ssl_model_size: base
|
187 |
+
train_data_budget: 1.0
|
188 |
+
val_data_budget: 1.0
|
189 |
+
experiment_name: ${sensor}_${task_name}_${ssl_name}_vit${ssl_model_size}_t0
|
190 |
+
seed: 42
|
191 |
+
data_out_format: concat_ch_img
|
192 |
+
num_frames: 2
|
193 |
+
frame_stride: 5
|
194 |
+
trainer:
|
195 |
+
max_epochs: 31
|
196 |
+
validation_frequency: 2
|
197 |
+
sanity_validate: false
|
198 |
+
save_checkpoint_dir: ${paths.output_dir}/checkpoints
|
199 |
+
checkpoint_interval_type: log
|
200 |
+
max_task_checkpoints: 10
|
201 |
+
save_probe_weights_only: true
|
202 |
+
limit_train_batches: 500
|
203 |
+
limit_val_batches: 150
|
204 |
+
use_distributed_sampler: false
|
205 |
+
devices:
|
206 |
+
- 0
|
207 |
+
test:
|
208 |
+
data:
|
209 |
+
dataset_name:
|
210 |
+
- 005_tomato_soup_can/dataset_0
|
211 |
+
batch_size: 1
|
212 |
+
tester:
|
213 |
+
_partial_: true
|
214 |
+
_target_: tactile_ssl.test.TestForceField
|
215 |
+
demo:
|
216 |
+
_partial_: true
|
217 |
+
_target_: tactile_ssl.test.DemoForceField
|
218 |
+
path_outputs: null
|
digit_t1_forcefield_dino_vitbase_bg/config_tree.log
ADDED
@@ -0,0 +1,237 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
CONFIG
|
2 |
+
βββ data
|
3 |
+
β βββ train_val_split: null
|
4 |
+
β train_data_budget: 1.0
|
5 |
+
β val_data_budget: 1.0
|
6 |
+
β max_train_data: null
|
7 |
+
β sensor: digit
|
8 |
+
β dataset:
|
9 |
+
β _target_: tactile_ssl.data.vision_tactile_forcefield.VisionTactileBackboneDataset
|
10 |
+
β config:
|
11 |
+
β sensor: digit
|
12 |
+
β remove_bg: true
|
13 |
+
β out_format: concat_ch_img
|
14 |
+
β num_frames: 2
|
15 |
+
β frame_stride: 5
|
16 |
+
β path_dataset: /media/chiguera/GUM//datasets/digitv1/Object-Slide/
|
17 |
+
β path_bgs: /media/chiguera/GUM//datasets/digitv1/Object-Slide/bgs/
|
18 |
+
β list_datasets:
|
19 |
+
β - 004_sugar_box/dataset_0
|
20 |
+
β - 025_mug/dataset_0
|
21 |
+
β - 048_hammer/dataset_0
|
22 |
+
β - 005_tomato_soup_can/dataset_0
|
23 |
+
β - 035_power_drill/dataset_0
|
24 |
+
β - 055_baseball/dataset_0
|
25 |
+
β - 006_mustard_bottle/dataset_0
|
26 |
+
β - 037_scissors/dataset_0
|
27 |
+
β - 021_bleach_cleanser/dataset_0
|
28 |
+
β - 042_adjustable_wrench/dataset_0
|
29 |
+
β - 004_sugar_box/dataset_1
|
30 |
+
β - 025_mug/dataset_1
|
31 |
+
β - 048_hammer/dataset_1
|
32 |
+
β - 005_tomato_soup_can/dataset_1
|
33 |
+
β - 035_power_drill/dataset_1
|
34 |
+
β - 055_baseball/dataset_1
|
35 |
+
β - 006_mustard_bottle/dataset_1
|
36 |
+
β - 037_scissors/dataset_1
|
37 |
+
β - 021_bleach_cleanser/dataset_1
|
38 |
+
β - 042_adjustable_wrench/dataset_1
|
39 |
+
β - 004_sugar_box/dataset_2
|
40 |
+
β - 025_mug/dataset_2
|
41 |
+
β - 048_hammer/dataset_2
|
42 |
+
β - 005_tomato_soup_can/dataset_2
|
43 |
+
β - 035_power_drill/dataset_2
|
44 |
+
β - 055_baseball/dataset_2
|
45 |
+
β - 006_mustard_bottle/dataset_2
|
46 |
+
β - 037_scissors/dataset_2
|
47 |
+
β - 021_bleach_cleanser/dataset_2
|
48 |
+
β - 042_adjustable_wrench/dataset_2
|
49 |
+
β - 004_sugar_box/dataset_3
|
50 |
+
β - 025_mug/dataset_3
|
51 |
+
β - 048_hammer/dataset_3
|
52 |
+
β - 005_tomato_soup_can/dataset_3
|
53 |
+
β - 035_power_drill/dataset_3
|
54 |
+
β - 055_baseball/dataset_3
|
55 |
+
β - 006_mustard_bottle/dataset_3
|
56 |
+
β - 037_scissors/dataset_3
|
57 |
+
β - 021_bleach_cleanser/dataset_3
|
58 |
+
β - 042_adjustable_wrench/dataset_3
|
59 |
+
β - banana/dataset_0
|
60 |
+
β - banana/dataset_1
|
61 |
+
β - banana/dataset_2
|
62 |
+
β - banana/dataset_3
|
63 |
+
β - bread/dataset_0
|
64 |
+
β - bread/dataset_1
|
65 |
+
β - bread/dataset_2
|
66 |
+
β - bread/dataset_3
|
67 |
+
β - cheese/dataset_0
|
68 |
+
β - cheese/dataset_1
|
69 |
+
β - cheese/dataset_2
|
70 |
+
β - cheese/dataset_3
|
71 |
+
β - cookie/dataset_0
|
72 |
+
β - cookie/dataset_1
|
73 |
+
β - cookie/dataset_2
|
74 |
+
β - cookie/dataset_3
|
75 |
+
β - corn/dataset_0
|
76 |
+
β - corn/dataset_1
|
77 |
+
β - corn/dataset_2
|
78 |
+
β - corn/dataset_3
|
79 |
+
β - lettuce/dataset_0
|
80 |
+
β - lettuce/dataset_1
|
81 |
+
β - lettuce/dataset_2
|
82 |
+
β - lettuce/dataset_3
|
83 |
+
β - plum/dataset_0
|
84 |
+
β - plum/dataset_1
|
85 |
+
β - plum/dataset_2
|
86 |
+
β - plum/dataset_3
|
87 |
+
β - strawberry/dataset_0
|
88 |
+
β - strawberry/dataset_1
|
89 |
+
β - strawberry/dataset_2
|
90 |
+
β - strawberry/dataset_3
|
91 |
+
β - tomato/dataset_0
|
92 |
+
β - tomato/dataset_1
|
93 |
+
β - tomato/dataset_2
|
94 |
+
β - tomato/dataset_3
|
95 |
+
β - sphere/dataset_0
|
96 |
+
β - sphere/dataset_1
|
97 |
+
β - sphere/dataset_2
|
98 |
+
β list_datasets_test:
|
99 |
+
β - 004_sugar_box/dataset_4
|
100 |
+
β - 025_mug/dataset_4
|
101 |
+
β - 048_hammer/dataset_4
|
102 |
+
β - 005_tomato_soup_can/dataset_4
|
103 |
+
β - 035_power_drill/dataset_4
|
104 |
+
β - 055_baseball/dataset_4
|
105 |
+
β - 006_mustard_bottle/dataset_4
|
106 |
+
β - 037_scissors/dataset_4
|
107 |
+
β - 021_bleach_cleanser/dataset_4
|
108 |
+
β - 042_adjustable_wrench/dataset_4
|
109 |
+
β - sphere/dataset_1
|
110 |
+
β - sphere/dataset_2
|
111 |
+
β look_in_folder: false
|
112 |
+
β transforms:
|
113 |
+
β with_augmentation: false
|
114 |
+
β resize:
|
115 |
+
β - 224
|
116 |
+
β - 224
|
117 |
+
β p_flip: 0.0
|
118 |
+
β p_crop: 0.0
|
119 |
+
β p_rot: 0.0
|
120 |
+
β train_dataloader:
|
121 |
+
β batch_size: 20
|
122 |
+
β num_workers: 2
|
123 |
+
β drop_last: true
|
124 |
+
β pin_memory: true
|
125 |
+
β persistent_workers: true
|
126 |
+
β shuffle: true
|
127 |
+
β val_dataloader:
|
128 |
+
β batch_size: 20
|
129 |
+
β num_workers: 2
|
130 |
+
β drop_last: true
|
131 |
+
β pin_memory: true
|
132 |
+
β persistent_workers: true
|
133 |
+
β
|
134 |
+
βββ trainer
|
135 |
+
β βββ max_epochs: 31
|
136 |
+
β validation_frequency: 2
|
137 |
+
β sanity_validate: false
|
138 |
+
β save_checkpoint_dir: /media/chiguera/GUM/tactile_ssl/outputs_sparsh/digit_t1_forcefield_dino_vitbase
|
139 |
+
β checkpoint_interval_type: log
|
140 |
+
β max_task_checkpoints: 10
|
141 |
+
β save_probe_weights_only: true
|
142 |
+
β limit_train_batches: 500
|
143 |
+
β limit_val_batches: 150
|
144 |
+
β use_distributed_sampler: false
|
145 |
+
β devices:
|
146 |
+
β - 0
|
147 |
+
β
|
148 |
+
βββ paths
|
149 |
+
β βββ data_root: /media/chiguera/GUM/
|
150 |
+
β encoder_checkpoint_root: /media/chiguera/2TB/sparsh/encoders_460k/
|
151 |
+
β log_dir: /media/chiguera/GUM/tactile_ssl/outputs_sparsh/digit_t1_forcefield_dino_vitbase_t0/logs/
|
152 |
+
β output_dir: /media/chiguera/GUM/tactile_ssl/outputs_sparsh/digit_t1_forcefield_dino_vitbase_t0/
|
153 |
+
β work_dir: /media/chiguera/2TB/sparsh/tactile-ssl
|
154 |
+
β
|
155 |
+
βββ wandb
|
156 |
+
β βββ project: t1_forcefield_digit
|
157 |
+
β entity: chiguera
|
158 |
+
β save_dir: /media/chiguera/GUM/tactile_ssl/outputs_sparsh/digit_t1_forcefield_dino_vitbase_t0/
|
159 |
+
β id: 2024.09.28_17-57_digit_t1_forcefield_dino_vitbase_t0
|
160 |
+
β tags:
|
161 |
+
β - dino
|
162 |
+
β group: null
|
163 |
+
β notes: null
|
164 |
+
β
|
165 |
+
βββ task
|
166 |
+
β βββ _target_: tactile_ssl.downstream_task.ForceFieldModuleSL
|
167 |
+
β model_encoder:
|
168 |
+
β _target_: tactile_ssl.model.vit_base
|
169 |
+
β img_size:
|
170 |
+
β - 224
|
171 |
+
β - 224
|
172 |
+
β in_chans: 6
|
173 |
+
β pos_embed_fn: sinusoidal
|
174 |
+
β num_register_tokens: 1
|
175 |
+
β model_task:
|
176 |
+
β _target_: tactile_ssl.downstream_task.ForceFieldDecoderSL
|
177 |
+
β embed_dim: base
|
178 |
+
β checkpoint_encoder: /media/chiguera/2TB/sparsh/encoders_460k//dino_vitbase.ckpt
|
179 |
+
β checkpoint_task: null
|
180 |
+
β train_encoder: false
|
181 |
+
β optim_cfg:
|
182 |
+
β _partial_: true
|
183 |
+
β _target_: torch.optim.Adam
|
184 |
+
β lr: 0.0001
|
185 |
+
β scheduler_cfg: null
|
186 |
+
β ssl_config:
|
187 |
+
β img_sz:
|
188 |
+
β - 224
|
189 |
+
β - 224
|
190 |
+
β pose_estimator:
|
191 |
+
β num_encoder_layers: 18
|
192 |
+
β loss:
|
193 |
+
β with_mask_supervision: false
|
194 |
+
β with_sl_supervision: false
|
195 |
+
β with_ssim: true
|
196 |
+
β disparity_smoothness: 0.001
|
197 |
+
β min_depth: 0.1
|
198 |
+
β max_depth: 100.0
|
199 |
+
β encoder_type: dino
|
200 |
+
β
|
201 |
+
βββ ssl_name
|
202 |
+
β βββ dino
|
203 |
+
βββ sensor
|
204 |
+
β βββ digit
|
205 |
+
βββ ckpt_path
|
206 |
+
β βββ None
|
207 |
+
βββ task_name
|
208 |
+
β βββ t1_forcefield
|
209 |
+
βββ ssl_model_size
|
210 |
+
β βββ base
|
211 |
+
βββ train_data_budget
|
212 |
+
β βββ 1.0
|
213 |
+
βββ val_data_budget
|
214 |
+
β βββ 1.0
|
215 |
+
βββ experiment_name
|
216 |
+
β βββ digit_t1_forcefield_dino_vitbase_t0
|
217 |
+
βββ seed
|
218 |
+
β βββ 42
|
219 |
+
βββ data_out_format
|
220 |
+
β βββ concat_ch_img
|
221 |
+
βββ num_frames
|
222 |
+
β βββ 2
|
223 |
+
βββ frame_stride
|
224 |
+
β βββ 5
|
225 |
+
βββ test
|
226 |
+
βββ data:
|
227 |
+
dataset_name:
|
228 |
+
- 005_tomato_soup_can/dataset_0
|
229 |
+
batch_size: 1
|
230 |
+
tester:
|
231 |
+
_partial_: true
|
232 |
+
_target_: tactile_ssl.test.TestForceField
|
233 |
+
demo:
|
234 |
+
_partial_: true
|
235 |
+
_target_: tactile_ssl.test.DemoForceField
|
236 |
+
path_outputs: null
|
237 |
+
|