Training details

by CaptainZZZ - opened Nov 13

Nov 13

•

Hi author, thanks for the amazing work! I have some questions about train details.
For 33 channels transformer, do we need train a transformer from scratch or fully fine-tune (all parameters from transformer) the SD3's transformer?
Also, I am curious about the size of the train dataset and the train batch size and learning rate.
Thanks!

George0667

Owner Nov 25

I just directly fine-tuned the sd3 medium. The hyper-parameters settings: image size :768; learning rate: 1e-4; batch size: 4 per GPU; 4 GPUs. I only trained for 18000 steps for GPU shortage.

CaptainZZZ

Nov 26

Thanks so much for the reply!
I have another queation, does inpainting need large scale dataset? May I ask your dataset size?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment