GGUF-IQ-Imatrix experimental quants for dreamgen/opus-v1.2-llama-3-8b.

This will have to uploaded again later.

Using a different testing config to avoid some reported issues so far and to get through the imatrix data generation.
This is experimental. Proper support and fixes should be coming in the respective projects in due time.

Downloads last month: 119

GGUF

Model size

8.03B params

Architecture

llama

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

View +1 file

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.