Resources

View closed (2)

1.58 bit version for DeepSeek V3

#18 opened 16 days ago by

ngoquanghuy

Add text-generation pipeline tag

#17 opened 21 days ago by

nielsr

Upload 3 files

#16 opened 28 days ago by

imi2

Is it worth hosting a quantized DeepSeek V3? Cost & performance insights?

#15 opened about 1 month ago by

Techw

Can you add 1.58bit?

#14 opened about 1 month ago by

PSM24

Dynamic quants

#13 opened about 1 month ago by

XelotX

Core dumped when merging DeepSeek-V3-Q2_K_XS

#12 opened about 2 months ago by

MyJerry1996

First review, Q5-K-M require 502Gb RAM, better than Meta 405billions

#11 opened about 2 months ago by

krustik

Issue with --n-gpu-layers 5 Parameter: Model Only Running on CPU

#10 opened about 2 months ago by

vuk123

I’m new to GGUF quants

#9 opened 2 months ago by

fsaudm

I loaded DeepSeek-V3-Q5_K_M up on my 10yrs old old Tesla M40 (Dell C4130)

#8 opened 2 months ago by

gng2info

why use q5 for key cache?

#7 opened 2 months ago by

CHNtentes

What is the required GPU size to run Is a 4090 possible and does it support ollama

#5 opened 2 months ago by

sminbb

I'm a newbie. How to use?

#4 opened 2 months ago by

huangkk

Getting error with Q3-K-M

#2 opened 2 months ago by

alain401

Are these imatrix GGUF quants?

#1 opened 2 months ago by

Kearm