🦖 T-Rex-mini — GGUF I1 (Quantized - Imatrix)

This is a quantized GGUF version of saturated-labs/T-Rex-mini, converted using llama.cpp and quantized with imatrix.

🔧 Quantization Details

Command:

./llama-quantize.exe --imatrix imatrix.dat t-rex-mini-f16.gguf t-rex-mini-QX_X_X.gguf QX_X_X

GGUF

Model size

8.03B params

Architecture

llama

Hardware compatibility

4-bit

5-bit

6-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(6)

this model