getting very low tokens per second (under 1 t/s) on M2 Ultra 192GB.
#6 opened about 1 month ago
by
j4ys0n
vLLM: Unknwon quantization method
#5 opened about 2 months ago
by
yaronr
Update README.md
#4 opened 2 months ago
by
manitonga
Upload folder using huggingface_hub
2
#1 opened 2 months ago
by
schroneko