Original model: Poro-34B-chat

Description

GGUF-format model files quantized using llama.cpp

We have Q4_K_M and Q5_K_M quantized models available.

Downloads last month
55
GGUF
Model size
35.1B params
Architecture
bloom
Hardware compatibility
Log In to view the estimation

4-bit

5-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Collection including LumiOpen/Poro-34B-chat-GGUF