gsaivinay
/

wizard-vicuna-13B-SuperHOT-8K-fp16

Text Generation

Transformers

PyTorch

llama

custom_code

text-generation-inference

Model card Files Files and versions Community

gsaivinay commited on Jul 11, 2023

Commit

53c39dd

1 Parent(s): feb06c1

Update README.md

Browse files

Files changed (1) hide show

README.md +2 -72

README.md CHANGED Viewed

@@ -6,15 +6,7 @@ duplicated_from: TheBloke/wizard-vicuna-13B-SuperHOT-8K-fp16
 <!-- header start -->
 <div style="width: 100%;">
-    <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
-</div>
-<div style="display: flex; justify-content: space-between; width: 100%;">
-    <div style="display: flex; flex-direction: column; align-items: flex-start;">
-        <p><a href="https://discord.gg/theblokeai">Chat & support: my new Discord server</a></p>
-    </div>
-    <div style="display: flex; flex-direction: column; align-items: flex-end;">
-        <p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? TheBloke's Patreon page</a></p>
-    </div>
 </div>
 <!-- header end -->
@@ -49,7 +41,7 @@ The provided modelling code, activated with `trust_remote_code=True` will automa
 from transformers import AutoConfig, AutoTokenizer, AutoModelForCausalLM, pipeline
 import argparse
-model_name_or_path = "TheBloke/wizard-vicuna-13B-SuperHOT-8K-fp16"
 use_triton = False
@@ -97,33 +89,7 @@ Provided in the repo is `llama_rope_scaled_monkey_patch.py`, written by @kaioken
 It can be theoretically be added to any Python UI or custom code to enable the same result as `trust_remote_code=True`.  I have not tested this, and it should be superseded by using `trust_remote_code=True`, but I include it for completeness and for interest.
-<!-- footer start -->
-## Discord
-For further support, and discussions on these models and AI in general, join us at:
-[TheBloke AI's Discord server](https://discord.gg/theblokeai)
-## Thanks, and how to contribute.
-Thanks to the [chirper.ai](https://chirper.ai) team!
-I've had a lot of people ask if they can contribute. I enjoy providing models and helping people, and would love to be able to spend even more time doing it, as well as expanding into new projects like fine tuning/training.
-If you're able and willing to contribute it will be most gratefully received and will help me to keep providing more models, and to start work on new AI projects.
-Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.
-* Patreon: https://patreon.com/TheBlokeAI
-* Ko-Fi: https://ko-fi.com/TheBlokeAI
-**Special thanks to**: Luke from CarbonQuill, Aemon Algiz, Dmitriy Samsonov.
-**Patreon special mentions**: zynix , ya boyyy, Trenton Dambrowitz, Imad Khwaja, Alps Aficionado, chris gileta, John Detwiler, Willem Michiel, RoA, Mano Prime, Rainer Wilmers, Fred von Graf, Matthew Berman, Ghost , Nathan LeClaire, Iucharbius , Ai Maven, Illia Dulskyi, Joseph William Delisle, Space Cruiser, Lone Striker, Karl Bernard, Eugene Pentland, Greatston Gnanesh, Jonathan Leane, Randy H, Pierre Kircher, Willian Hasse, Stephen Murray, Alex , terasurfer , Edmond Seymore, Oscar Rangel, Luke Pendergrass, Asp the Wyvern, Junyu Yang, David Flickinger, Luke, Spiking Neurons AB, subjectnull, Pyrater, Nikolai Manek, senxiiz, Ajan Kanaga, Johann-Peter Hartmann, Artur Olbinski, Kevin Schuppel, Derek Yates, Kalila, K, Talal Aujan, Khalefa Al-Ahmad, Gabriel Puliatti, John Villwock, WelcomeToTheClub, Daniel P. Andersen, Preetika Verma, Deep Realms, Fen Risland, trip7s trip, webtim, Sean Connelly, Michael Levine, Chris McCloskey, biorpg, vamX, Viktor Bowallius, Cory Kujawski.
-Thank you to all my generous patrons and donaters!
-<!-- footer end -->
 # Original model card: Kaio Ken's SuperHOT 8K
@@ -159,19 +125,6 @@ I trained the LoRA with the following configuration:
 # Original model card: June Lee's Wizard Vicuna 13B
-<!-- header start -->
-<div style="width: 100%;">
-    <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
-</div>
-<div style="display: flex; justify-content: space-between; width: 100%;">
-    <div style="display: flex; flex-direction: column; align-items: flex-start;">
-        <p><a href="https://discord.gg/Jq4vkcDakD">Chat & support: my new Discord server</a></p>
-    </div>
-    <div style="display: flex; flex-direction: column; align-items: flex-end;">
-        <p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? TheBloke's Patreon page</a></p>
-    </div>
-</div>
-<!-- header end -->
 # Wizard-Vicuna-13B-HF
 This is a float16 HF format repo for [junelee's wizard-vicuna 13B](https://huggingface.co/junelee/wizard-vicuna-13b).
@@ -186,30 +139,7 @@ This model was converted to float16 to make it easier to load and manage.
 * [4bit and 5bit GGML models for CPU inference](https://huggingface.co/TheBloke/wizard-vicuna-13B-GGML).
 * [float16 HF format model for GPU inference](https://huggingface.co/TheBloke/wizard-vicuna-13B-HF).
-<!-- footer start -->
-## Discord
-For further support, and discussions on these models and AI in general, join us at:
-[TheBloke AI's Discord server](https://discord.gg/Jq4vkcDakD)
-## Thanks, and how to contribute.
-Thanks to the [chirper.ai](https://chirper.ai) team!
-I've had a lot of people ask if they can contribute. I enjoy providing models and helping people, and would love to be able to spend even more time doing it, as well as expanding into new projects like fine tuning/training.
-If you're able and willing to contribute it will be most gratefully received and will help me to keep providing more models, and to start work on new AI projects.
-Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.
-* Patreon: https://patreon.com/TheBlokeAI
-* Ko-Fi: https://ko-fi.com/TheBlokeAI
-**Patreon special mentions**: Aemon Algiz, Dmitriy Samsonov, Nathan LeClaire, Trenton Dambrowitz, Mano Prime, David Flickinger, vamX, Nikolai Manek, senxiiz, Khalefa Al-Ahmad, Illia Dulskyi, Jonathan Leane, Talal Aujan, V. Lukas, Joseph William Delisle, Pyrater, Oscar Rangel, Lone Striker, Luke Pendergrass, Eugene Pentland, Sebastain Graf, Johann-Peter Hartman.
-Thank you to all my generous patrons and donaters!
-<!-- footer end -->
 # Original WizardVicuna-13B model card

 <!-- header start -->
 <div style="width: 100%;">
+Cloned from TheBloke repo
 </div>
 <!-- header end -->
 from transformers import AutoConfig, AutoTokenizer, AutoModelForCausalLM, pipeline
 import argparse
+model_name_or_path = "gsaivinay/wizard-vicuna-13B-SuperHOT-8K-fp16"
 use_triton = False
 It can be theoretically be added to any Python UI or custom code to enable the same result as `trust_remote_code=True`.  I have not tested this, and it should be superseded by using `trust_remote_code=True`, but I include it for completeness and for interest.
 # Original model card: Kaio Ken's SuperHOT 8K
 # Original model card: June Lee's Wizard Vicuna 13B
 # Wizard-Vicuna-13B-HF
 This is a float16 HF format repo for [junelee's wizard-vicuna 13B](https://huggingface.co/junelee/wizard-vicuna-13b).
 * [4bit and 5bit GGML models for CPU inference](https://huggingface.co/TheBloke/wizard-vicuna-13B-GGML).
 * [float16 HF format model for GPU inference](https://huggingface.co/TheBloke/wizard-vicuna-13B-HF).
 # Original WizardVicuna-13B model card