30b 4bit lora please

by Gumibit - opened Apr 8, 2023

Discussion

Gumibit

Apr 8, 2023

Hi, It would be great if we could have the 4bit flavor of this

thank you.

kbressem

medalpaca org Apr 8, 2023

•

edited Apr 8, 2023

Hey, I'll work on it.
Currently I am benchmarking the models. The ones trained wo LoRA have much better performance, maybe load them in 8bit instead? medalpaca-7b quantized for inference should outperform the 30b model.

In the meantime please raise an issue on GitHub, so I won't forget to do it.

Gumibit

Apr 21, 2023

Thank you, I will try your suggestions

Gumibit changed discussion status to closed Apr 21, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment