TheBloke
/

Falcon-180B-Chat-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Sep 7, 2023

Commit

229b765

·

1 Parent(s): 2b78e44

Update README.md

Files changed (1) hide show

README.md +8 -5

README.md CHANGED Viewed

@@ -118,6 +118,8 @@ git clone --single-branch --branch gptq-3bit--1g-actorder_True https://huggingfa
 <!-- README_GPTQ.md-text-generation-webui start -->
 ## How to easily download and use this model in [text-generation-webui](https://github.com/oobabooga/text-generation-webui).
 Please make sure you're using the latest version of [text-generation-webui](https://github.com/oobabooga/text-generation-webui).
 It is strongly recommended to use the text-generation-webui one-click-installers unless you're sure you know how to make a manual install.
@@ -128,12 +130,13 @@ It is strongly recommended to use the text-generation-webui one-click-installers
   - see Provided Files above for the list of branches for each option.
 3. Click **Download**.
 4. The model will start downloading. Once it's finished it will say "Done".
-5. In the top left, click the refresh icon next to **Model**.
-6. In the **Model** dropdown, choose the model you just downloaded: `Falcon-180B-Chat-GPTQ`
-7. The model will automatically load, and is now ready for use!
-8. If you want any custom settings, set them and then click **Save settings for this model** followed by **Reload the Model** in the top right.
   * Note that you do not need to and should not set manual GPTQ parameters any more. These are set automatically from the file `quantize_config.json`.
-9. Once you're ready, click the **Text Generation tab** and enter a prompt to get started!
 <!-- README_GPTQ.md-text-generation-webui end -->
 <!-- README_GPTQ.md-use-from-python start -->

 <!-- README_GPTQ.md-text-generation-webui start -->
 ## How to easily download and use this model in [text-generation-webui](https://github.com/oobabooga/text-generation-webui).
+**NOTE**: I have not tested this model with Text Generation Webui. It *should* work through the Transformers Loader.  It will *not* work through the AutoGPTQ loader, due to the files being sharded.
 Please make sure you're using the latest version of [text-generation-webui](https://github.com/oobabooga/text-generation-webui).
 It is strongly recommended to use the text-generation-webui one-click-installers unless you're sure you know how to make a manual install.
   - see Provided Files above for the list of branches for each option.
 3. Click **Download**.
 4. The model will start downloading. Once it's finished it will say "Done".
+5. Choose Loader: Transformers
+6. In the top left, click the refresh icon next to **Model**.
+7. In the **Model** dropdown, choose the model you just downloaded: `Falcon-180B-Chat-GPTQ`
+8. The model will automatically load, and is now ready for use!
+9. If you want any custom settings, set them and then click **Save settings for this model** followed by **Reload the Model** in the top right.
   * Note that you do not need to and should not set manual GPTQ parameters any more. These are set automatically from the file `quantize_config.json`.
+10. Once you're ready, click the **Text Generation tab** and enter a prompt to get started!
 <!-- README_GPTQ.md-text-generation-webui end -->
 <!-- README_GPTQ.md-use-from-python start -->