nisten commited on
Commit
daaec3a
1 Parent(s): 677dc77

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -3
README.md CHANGED
@@ -10,9 +10,9 @@ This repository contains CPU-optimized GGUF quantizations of the Meta-Llama-3.1-
10
  ## Available Quantizations
11
 
12
  1. Q4_0_4_8 (CPU FMA-Optimized): ~246 GB
13
- 2. BF16: ~820 GB
14
- 3. Q8_0: ~410 GB
15
- 4. more coming...
16
 
17
  ## Use Aria2 for parallelized downloads, links will download 9x faster
18
 
@@ -33,6 +33,15 @@ aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-optimized-q4048-00005-of-00006.gg
33
  aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-optimized-q4048-00006-of-00006.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-optimized-q4048-00006-of-00006.gguf
34
  ```
35
 
 
 
 
 
 
 
 
 
 
36
  ### BF16 Version
37
 
38
  ```bash
 
10
  ## Available Quantizations
11
 
12
  1. Q4_0_4_8 (CPU FMA-Optimized): ~246 GB
13
+ 2. BF16: ~811 GB
14
+ 3. Q8_0: ~406 GB
15
+ 4. Q2-Q8mix ~ 165Gb
16
 
17
  ## Use Aria2 for parallelized downloads, links will download 9x faster
18
 
 
33
  aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-optimized-q4048-00006-of-00006.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-optimized-q4048-00006-of-00006.gguf
34
  ```
35
 
36
+ ### Q2K-Q8 Mixed 2bit 8bit I wrote myself. This is the smallest coherent one I could make without yet doing imatrix
37
+
38
+ ```verilog
39
+ aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-2kmix8k-00001-of-00004.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-2kmix8k-00001-of-00004.gguf?download=true
40
+ aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-2kmix8k-00002-of-00004.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-2kmix8k-00002-of-00004.gguf?download=true
41
+ aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-2kmix8k-00003-of-00004.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-2kmix8k-00003-of-00004.gguf?download=true
42
+ aria2c -x 16 -s 16 -k 1M -o meta-405b-inst-cpu-2kmix8k-00004-of-00004.gguf https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/resolve/main/meta-405b-inst-cpu-2kmix8k-00004-of-00004.gguf?download=true
43
+ ```
44
+
45
  ### BF16 Version
46
 
47
  ```bash