TheBloke
/

WizardLM-30B-Uncensored-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on May 22, 2023

Commit

87924a5

•

1 Parent(s): c3cfb5a

Initial GPTQ model commit.

Files changed (1) hide show

README.md +8 -6

README.md CHANGED Viewed

@@ -5,17 +5,19 @@ datasets:
 inference: false
 ---
-# WizardLM 30B Uncensored
-These files are GPTQ 4bit model files for [Eric Hartford's WizardLM 30B 'uncensored'](https://huggingface.co/ehartford/WizardLM-30B-Uncensored).
 It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa).
 ## Other repositories available
-* [4bit GPTQ model for GPU inference](https://huggingface.co/TheBloke/WizardLM-30B-Uncensored-GPTQ)
-* [4-bit, 5-bit and 8-bit GGML models for CPU (+CUDA) inference](https://huggingface.co/TheBloke/WizardLM-30B-Uncensored-GGML)
-* [Eric's unquantised model in fp16 HF format](https://huggingface.co/ehartford/WizardLM-30B-Uncensored)
 ## How to easily download and use this model in text-generation-webui
@@ -43,7 +45,7 @@ This will work with all versions of GPTQ-for-LLaMa. It has maximum compatibility
 It was created without the `--act-order` parameter. It may have slightly lower inference quality compared to the other file, but is guaranteed to work on all versions of GPTQ-for-LLaMa and text-generation-webui.
-* `WizardLM-30B-uncensored-GPTQ-4bit.act-order.safetensors`
   * Works with all versions of GPTQ-for-LLaMa code, both Triton and CUDA branches
   * Works with AutoGPTQ. Use `strict=False` to load.
   * Works with text-generation-webui one-click-installers

 inference: false
 ---
+# WizardLM - uncensored: An Instruction-following LLM Using Evol-Instruct
+These files are GPTQ 4bit model files for [Eric Hartford's 'uncensored' version of WizardLM](https://huggingface.co/ehartford/WizardLM-30B-Uncensored).
 It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa).
+Eric did a fresh 7B training using the WizardLM method, on [a dataset edited to remove all the "I'm sorry.." type ChatGPT responses](https://huggingface.co/datasets/ehartford/WizardLM_alpaca_evol_instruct_70k_unfiltered).
 ## Other repositories available
+* [4bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/WizardLM-30B-uncensored-GPTQ)
+* [4bit and 5bit GGML models for CPU inference](https://huggingface.co/TheBloke/WizardLM-30B-uncensored-GGML)
+* [Eric's unquantised model in HF format](https://huggingface.co/ehartford/WizardLM-30B-Uncensored)
 ## How to easily download and use this model in text-generation-webui
 It was created without the `--act-order` parameter. It may have slightly lower inference quality compared to the other file, but is guaranteed to work on all versions of GPTQ-for-LLaMa and text-generation-webui.
+* `wizard-vicuna-13B-GPTQ-4bit.compat.no-act-order.safetensors`
   * Works with all versions of GPTQ-for-LLaMa code, both Triton and CUDA branches
   * Works with AutoGPTQ. Use `strict=False` to load.
   * Works with text-generation-webui one-click-installers