https://github.com/spydaz # LeroyDyer/Mixtral_AI_MultiToken-Q4_K_M-GGUF This model was converted to GGUF format from [`LeroyDyer/Mixtral_AI_MultiToken`](https://huggingface.co/LeroyDyer/Mixtral_AI_MultiToken) using llama.cpp. Refer to the [original model card](https://huggingface.co/LeroyDyer/Mixtral_AI_MultiToken) for more details on the model. ## Use with llama.cpp
brew install ggerganov/ggerganov/llama.cpp
llama-cli --hf-repo LeroyDyer/Mixtral_AI_MultiToken-Q4_K_M-GGUF --model mixtral_ai_multitoken.Q4_K_M.gguf -p "The meaning to life and the universe is "
llama-server --hf-repo LeroyDyer/Mixtral_AI_MultiToken-Q4_K_M-GGUF --model mixtral_ai_multitoken.Q4_K_M.gguf -c 2048
Downloads last month
160
GGUF
Model size
7.24B params
Architecture
llama

4-bit

Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.