Needs the projector file in GGUF as well...

by igzbar - opened Apr 22, 2024

Apr 22, 2024

I think for the model to be complete and be capable of being run via the llava-cli module of llama.cpp, you also need to quantize the projector .safetensors file to GGUF (it's small so Q8_0 would be totally usable).

RDson

Owner Apr 23, 2024

I tried but the ProjectorModel architecture did not work properly for me. :/
If someone else successfully does it then feel free to share.

MichaelBui

Apr 24, 2024

I’ve tried using mmproj from another repo but it doesn’t work (with LM Studio), it keeps saying that the model is not capable to read image

sbene

Apr 27, 2024

I got it running in LMStudio using the vision adapter from koboldcpp for Llama3-8b.
My main model file is the Q5_K_M version.

But performance isn't great, probably because of mismatching quantization accuracies (?)..

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment