BootsofLagrangian
commited on
Commit
•
c671a3c
1
Parent(s):
c94416f
Update quantization weight link
Browse files
README.md
CHANGED
@@ -39,6 +39,13 @@ This is the model card of a 🤗 transformers model that has been pushed on the
|
|
39 |
|
40 |
This repo provides full model weight files for AkaLlama-70B-v0.1.
|
41 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
42 |
# Use with transformers
|
43 |
|
44 |
See the snippet below for usage with Transformers:
|
|
|
39 |
|
40 |
This repo provides full model weight files for AkaLlama-70B-v0.1.
|
41 |
|
42 |
+
### Quantized Weights
|
43 |
+
| Method | repo |
|
44 |
+
| :----: | :----: |
|
45 |
+
| [GGUF](https://github.com/ggerganov/ggml/blob/master/docs/gguf.md) | https://huggingface.co/mirlab/AkaLlama-llama3-70b-v0.1-GGUF |
|
46 |
+
| [ExLlamaV2](https://github.com/turboderp/exllamav2) | https://huggingface.co/mirlab/AkaLlama-llama3-70b-v0.1-exl2 |
|
47 |
+
|
48 |
+
|
49 |
# Use with transformers
|
50 |
|
51 |
See the snippet below for usage with Transformers:
|