Question about fine-tuning recipe?

by phanhoang - opened Oct 9, 2024

Oct 9, 2024

Hi, thanks for your great model!

I have a question about fine-tuning process.
What are the values of the two parameters, lora_rank and lora_alpha, when fine-tuning this model?
And did you adjust the freeze_vision_tower = false parameter during fine-tuning?

phanhoang changed discussion status to closed Oct 9, 2024

phanhoang changed discussion status to open Oct 9, 2024

thusinh1969

EraX JS Company org Oct 9, 2024

•

edited Oct 9, 2024

freeze_vision_tower = false is courageous. Needs to trial & errors.

Large datasets likes 2-3 millions, then you should open vision, better quality.

LoRA rank wise, 128 should be a good one, it improves reasoning.

Cheers,
Nguyên

phanhoang

Oct 10, 2024

@thusinh1969 thank you for your response!

erax

EraX JS Company org Dec 29, 2024

Bản 2B V1.5 có rồi nhé, khá hơn.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment