TheBloke
/

ChatAYT-Lora-Assamble-Marcoroni-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Sep 19, 2023

Commit

6c778dd

•

1 Parent(s): 693fd38

Upload README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -5,6 +5,15 @@ license: llama2
 model_creator: TFLai
 model_name: ChatAYT Lora Assamble Marcoroni
 model_type: llama
 quantized_by: TheBloke
 ---
@@ -40,6 +49,7 @@ Multiple GPTQ parameter permutations are provided; see Provided Files below for
 <!-- repositories-available start -->
 ## Repositories available
 * [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/ChatAYT-Lora-Assamble-Marcoroni-GPTQ)
 * [2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference](https://huggingface.co/TheBloke/ChatAYT-Lora-Assamble-Marcoroni-GGUF)
 * [TFLai's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TFLai/ChatAYT-Lora-Assamble-Marcoroni)

 model_creator: TFLai
 model_name: ChatAYT Lora Assamble Marcoroni
 model_type: llama
+prompt_template: '### Instruction:
+ {prompt}
+ ### Response:
+ '
 quantized_by: TheBloke
 ---
 <!-- repositories-available start -->
 ## Repositories available
+* [AWQ model(s) for GPU inference.](https://huggingface.co/TheBloke/ChatAYT-Lora-Assamble-Marcoroni-AWQ)
 * [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/ChatAYT-Lora-Assamble-Marcoroni-GPTQ)
 * [2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference](https://huggingface.co/TheBloke/ChatAYT-Lora-Assamble-Marcoroni-GGUF)
 * [TFLai's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TFLai/ChatAYT-Lora-Assamble-Marcoroni)