Update README.md
Browse files
README.md
CHANGED
@@ -31,12 +31,12 @@ The following specifications:
|
|
31 |
- **Maximum Sequence Length (context):** 32,768 tokens
|
32 |
## Usage
|
33 |
|
34 |
-
| Model | Quantized | Size | Hardware Requirement |
|
35 |
-
|-------------|-----------|--------|--------------------------|
|
36 |
-
| APUS-xDAN4.0-MoE-0402.Q2_K.gguf | Q2_K | 39G | 2x24G GPU memory |
|
37 |
-
| APUS-xDAN4.0-MoE-0402.Q3_K_M_Matrix.gguf | Q3_K_M | 51G | 2x24G GPU memory |
|
38 |
-
| Large | No | Large |
|
39 |
-
| APUS-xDAN4.0-MoE-0402.Q4_K_M.gguf | Q4_K_M | 64G |
|
40 |
|
41 |
|
42 |
|
|
|
31 |
- **Maximum Sequence Length (context):** 32,768 tokens
|
32 |
## Usage
|
33 |
|
34 |
+
| Model | Quantized | Size | Context | Hardware Requirement |
|
35 |
+
|-------------|-----------|--------|--------------------------| --------------------------|
|
36 |
+
| APUS-xDAN4.0-MoE-0402.Q2_K.gguf | Q2_K | 39G | 32k | 2x24G GPU memory |
|
37 |
+
| APUS-xDAN4.0-MoE-0402.Q3_K_M_Matrix.gguf | Q3_K_M | 51G | 32k | 2x24G GPU memory |
|
38 |
+
| Large | No | Large | 32k | 2x24G GPU memory |
|
39 |
+
| APUS-xDAN4.0-MoE-0402.Q4_K_M.gguf | Q4_K_M | 64G | 32k | 2x24G GPU memory |
|
40 |
|
41 |
|
42 |
|