iMatrix GGUFs for https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

iMat generated using Kalomaze's groups_merged.txt

Downloads last month
45
GGUF
Model size
70.6B params
Architecture
llama
Inference Examples
Inference API (serverless) has been turned off for this model.

Model tree for MarsupialAI/Llama-3.1-Nemotron-70B-Instruct_iMat_GGUF

Quantized
(95)
this model

Dataset used to train MarsupialAI/Llama-3.1-Nemotron-70B-Instruct_iMat_GGUF