About QLoRA Implementation

by II-Matto - opened Sep 1, 2023

Sep 1, 2023

Hi, thanks for sharing the great work! I was wondering what is the QLoRA implementation you use for the fine-tuning. Do you use the official QLoRA code (https://github.com/artidoro/qlora/tree/main) or you implement it by yourself? Besides, could you describe the hardware settings and the corresponding training speed?

fangloveskari

Owner Sep 2, 2023

nop, I use https://github.com/hiyouga/LLaMA-Efficient-Tuning.
I use deepspeed ZERO-2 + FlashAttention2 + 4bit QLoRA to training with 8 A100(80G), then the batch size can be set to around 16.
as for training speed, It may take 5-6 hours for training? I'm not really clear with that.

btw, Our team tends to upload a better model and details about the dataset mix-up strategy, the hardware setting, the training settings and steps will also be provided, but it may take several days for that.

II-Matto

Sep 2, 2023

Thanks for your prompt reply. I am anticipating your following work!

II-Matto changed discussion status to closed Sep 2, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment