refactor readme.md
Browse filesadd gguf download links to provided files table
README.md
CHANGED
@@ -17,20 +17,20 @@ quantized_by: taronaeo
|
|
17 |
This repository contains GGUF format model for [IBM Granite 3.0 1B Instruct](https://huggingface.co/ibm-granite/granite-3.0-1b-a400m-instruct). Every model has been verified to work on IBM z15 Mainframe.
|
18 |
|
19 |
### Provided Files
|
20 |
-
| Name
|
21 |
-
|
22 |
-
| granite-3.0-1b-a400m-instruct-be.Q2_K.gguf
|
23 |
-
| granite-3.0-1b-a400m-instruct-be.Q3_K_S.gguf | Q3_K_S | 3 | 571M | very small, high quality loss |
|
24 |
-
| granite-3.0-1b-a400m-instruct-be.Q3_K_M.gguf | Q3_K_M | 3 | 629M | very small, high quality loss |
|
25 |
-
| granite-3.0-1b-a400m-instruct-be.Q3_K_L.gguf | Q3_K_L | 3 | 679M | small, substantial quality loss |
|
26 |
-
| granite-3.0-1b-a400m-instruct-be.Q4_0.gguf
|
27 |
-
| granite-3.0-1b-a400m-instruct-be.Q4_K_S.gguf | Q4_K_S | 4 | 739M | small, greater quality loss |
|
28 |
-
| granite-3.0-1b-a400m-instruct-be.Q4_K_M.gguf | Q4_K_M | 4 | 784M | medium, balanced quality - recommended |
|
29 |
-
| granite-3.0-1b-a400m-instruct-be.Q5_0.gguf
|
30 |
-
| granite-3.0-1b-a400m-instruct-be.Q5_K_S.gguf | Q5_K_S | 5 | 886M | large, low quality loss - recommended |
|
31 |
-
| granite-3.0-1b-a400m-instruct-be.Q5_K_M.gguf | Q5_K_M | 5 | 913M | large, very low quality loss - recommended |
|
32 |
-
| granite-3.0-1b-a400m-instruct-be.Q6_K.gguf
|
33 |
-
| granite-3.0-1b-a400m-instruct-be.Q8_0.gguf
|
34 |
|
35 |
# Original model card: Granite-3.0-1B-A400M-Instruct
|
36 |
|
|
|
17 |
This repository contains GGUF format model for [IBM Granite 3.0 1B Instruct](https://huggingface.co/ibm-granite/granite-3.0-1b-a400m-instruct). Every model has been verified to work on IBM z15 Mainframe.
|
18 |
|
19 |
### Provided Files
|
20 |
+
| Name | Quant Method | Bits | Size | Use Case |
|
21 |
+
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------|------|------|------------------------------------------------------------------------|
|
22 |
+
| [granite-3.0-1b-a400m-instruct-be.Q2_K.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q2_K.gguf) | Q2_K | 2 | 489M | smallest, significant quality loss - not recommended for most purposes |
|
23 |
+
| [granite-3.0-1b-a400m-instruct-be.Q3_K_S.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q3_K_S.gguf) | Q3_K_S | 3 | 571M | very small, high quality loss |
|
24 |
+
| [granite-3.0-1b-a400m-instruct-be.Q3_K_M.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q3_K_M.gguf) | Q3_K_M | 3 | 629M | very small, high quality loss |
|
25 |
+
| [granite-3.0-1b-a400m-instruct-be.Q3_K_L.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q3_K_L.gguf) | Q3_K_L | 3 | 679M | small, substantial quality loss |
|
26 |
+
| [granite-3.0-1b-a400m-instruct-be.Q4_0.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q4_0.gguf) | Q4_0 | 4 | 733M | legacy; small, very high quality loss - prefer using Q3_K_M |
|
27 |
+
| [granite-3.0-1b-a400m-instruct-be.Q4_K_S.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q4_K_S.gguf) | Q4_K_S | 4 | 739M | small, greater quality loss |
|
28 |
+
| [granite-3.0-1b-a400m-instruct-be.Q4_K_M.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q4_K_M.gguf) | Q4_K_M | 4 | 784M | medium, balanced quality - recommended |
|
29 |
+
| [granite-3.0-1b-a400m-instruct-be.Q5_0.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q5_0.gguf) | Q5_0 | 5 | 886M | legacy; medium, balanced quality - prefer using Q4_K_M |
|
30 |
+
| [granite-3.0-1b-a400m-instruct-be.Q5_K_S.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q5_K_S.gguf) | Q5_K_S | 5 | 886M | large, low quality loss - recommended |
|
31 |
+
| [granite-3.0-1b-a400m-instruct-be.Q5_K_M.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q5_K_M.gguf) | Q5_K_M | 5 | 913M | large, very low quality loss - recommended |
|
32 |
+
| [granite-3.0-1b-a400m-instruct-be.Q6_K.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q6_K.gguf) | Q6_K | 6 | 1.1G | very large, extremely low quality loss |
|
33 |
+
| [granite-3.0-1b-a400m-instruct-be.Q8_0.gguf](https://huggingface.co/taronaeo/Granite-3.0-1B-A400M-Instruct-BE-GGUF/blob/main/granite-3.0-1b-a400m-instruct-be.Q8_0.gguf) | Q8_0 | 8 | 1.4G | very large, extremely low quality loss - not recommended |
|
34 |
|
35 |
# Original model card: Granite-3.0-1B-A400M-Instruct
|
36 |
|