llama.cpp

by FrescoHF - opened 7 days ago

7 days ago

llama.cpp version b4970 has a performance improvement. How about updating llama.cpp more often? There are important changes coming out there all the time, especially lately

bartowski

Owner 7 days ago

the version i use to make usually has no relevance to performance, you gain the uplift by updating your own engine

I just include that information for reference

The only time it matters is if there are optimizations for how a quant is made, not how it's run

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment