Can you give an example on how to use gguf version or any inference GUI support it now?

by DrNicefellow - opened 9 days ago

Discussion

DrNicefellow

9 days ago

As title

openfree

VIDraft org 9 days ago

As title

example,

https://huggingface.co/openfree/Gemma-3-R1984-27B-Q8_0-GGUF
"Deploy" button click
"HF Inference Endpoints" button click
"Inference Endpoints" menu, "GPU Server" choice
'Deploy' and 'Excute Running'

"git clone ..... model or space path...." <- your server

Link: https://huggingface.co/spaces/VIDraft/Gemma-3-R1984-27B
"Duplicate"
files -> app.py -> code edit -> Replicated "your choic GGUF Model Path"
commit -> building

DrNicefellow

9 days ago

As title

example,

https://huggingface.co/openfree/Gemma-3-R1984-27B-Q8_0-GGUF

"Deploy" button click

"HF Inference Endpoints" button click

"Inference Endpoints" menu, "GPU Server" choice

'Deploy' and 'Excute Running'

or

"git clone ..... model or space path...." <- your server

or

Link: https://huggingface.co/spaces/VIDraft/Gemma-3-R1984-27B

"Duplicate"

files -> app.py -> code edit -> Replicated "your choic GGUF Model Path"

commit -> building

Thank you for the information! I will try them.

DrNicefellow changed discussion status to closed 9 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment