GGUF
conversational

b16 is very slow is this model correct ?

#1
by gopi87 - opened

hi i run the b16 but its very slopw please check it

Maybe check if your gpu memory is used.
And also I recommend running Q5_K_M, as most people on the internet says.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment