mirlab
/

AkaLlama-llama3-70b-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

BootsofLagrangian commited on May 19

Commit

c671a3c

•

1 Parent(s): c94416f

Update quantization weight link

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -39,6 +39,13 @@ This is the model card of a 🤗 transformers model that has been pushed on the
 This repo provides full model weight files for AkaLlama-70B-v0.1.
 # Use with transformers
 See the snippet below for usage with Transformers:

 This repo provides full model weight files for AkaLlama-70B-v0.1.
+### Quantized Weights
+| Method | repo |
+| :----: | :----: |
+| [GGUF](https://github.com/ggerganov/ggml/blob/master/docs/gguf.md) | https://huggingface.co/mirlab/AkaLlama-llama3-70b-v0.1-GGUF |
+| [ExLlamaV2](https://github.com/turboderp/exllamav2) | https://huggingface.co/mirlab/AkaLlama-llama3-70b-v0.1-exl2 |
 # Use with transformers
 See the snippet below for usage with Transformers: