TheBloke
/

alpaca-lora-65B-GGML

Model card Files Files and versions

TheBloke commited on Apr 23, 2023

Commit

83a65a5

·

1 Parent(s): 97ee39e

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -7,6 +7,8 @@ inference: false
 Quantised 4bit and 2bit GGMLs of [changsung's alpaca-lora-65B](https://huggingface.co/chansung/alpaca-lora-65b) for CPU inference with [llama.cpp](https://github.com/ggerganov/llama.cpp).
 ## Provided files
 | Bits | Size | RAM required | Name |
 | ---- | ---- | ---- | ---- |

 Quantised 4bit and 2bit GGMLs of [changsung's alpaca-lora-65B](https://huggingface.co/chansung/alpaca-lora-65b) for CPU inference with [llama.cpp](https://github.com/ggerganov/llama.cpp).
+I also have 4bit GPTQ files for GPU inference available here: [TheBloke/alpaca-lora-65B-GPTQ-4bit](https://huggingface.co/TheBloke/alpaca-lora-65B-GPTQ-4bit).
 ## Provided files
 | Bits | Size | RAM required | Name |
 | ---- | ---- | ---- | ---- |