KaraKaraWitch
/

L3.1-70b-Inori

@@ -11,30 +11,47 @@ library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# merge
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Fizzarolli/L3.1-70b-glitz-v0.2](https://huggingface.co/Fizzarolli/L3.1-70b-glitz-v0.2) as a base.
-### Models Merged
-The following models were included in the merge:
 * [abacusai/Dracarys-Llama-3.1-70B-Instruct](https://huggingface.co/abacusai/Dracarys-Llama-3.1-70B-Instruct)
 * [Sao10K/L3-70B-Euryale-v2.1](https://huggingface.co/Sao10K/L3-70B-Euryale-v2.1)
 * [gbueno86/Cathallama-70B](https://huggingface.co/gbueno86/Cathallama-70B)
 * [sophosympatheia/New-Dawn-Llama-3.1-70B-v1.1](https://huggingface.co/sophosympatheia/New-Dawn-Llama-3.1-70B-v1.1)
 * [nothingiisreal/L3.1-70B-Celeste-V0.1-BF16](https://huggingface.co/nothingiisreal/L3.1-70B-Celeste-V0.1-BF16)
-* [cyberagent/Llama-3.1-70B-Japanese-Instruct-2407](https://huggingface.co/cyberagent/Llama-3.1-70B-Japanese-Instruct-2407)
-### Configuration
-The following YAML configuration was used to produce this model:
 ```yaml
@@ -52,5 +69,29 @@ base_model: Fizzarolli/L3.1-70b-glitz-v0.2
 parameters:
   normalize: true
 dtype: bfloat16
 ```

 tags:
 - mergekit
 - merge
+- abacusai/Dracarys-Llama-3.1-70B-Instruct
+- Sao10K/L3-70B-Euryale-v2.1
+- gbueno86/Cathallama-70B
+- sophosympatheia/New-Dawn-Llama-3.1-70B-v1.1
+- nothingiisreal/L3.1-70B-Celeste-V0.1-BF16
+- Fizzarolli/L3.1-70b-glitz-v0.2
+- cyberagent/Llama-3.1-70B-Japanese-Instruct-2407
 ---
+# KaraKaraWitch/L3.1-70b-Inori
+Inori is the second 70b for the weekend for me to play around.
+Learning from
+![](Inori.png)
+L3.1-70b-Inori is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
 * [abacusai/Dracarys-Llama-3.1-70B-Instruct](https://huggingface.co/abacusai/Dracarys-Llama-3.1-70B-Instruct)
 * [Sao10K/L3-70B-Euryale-v2.1](https://huggingface.co/Sao10K/L3-70B-Euryale-v2.1)
 * [gbueno86/Cathallama-70B](https://huggingface.co/gbueno86/Cathallama-70B)
 * [sophosympatheia/New-Dawn-Llama-3.1-70B-v1.1](https://huggingface.co/sophosympatheia/New-Dawn-Llama-3.1-70B-v1.1)
 * [nothingiisreal/L3.1-70B-Celeste-V0.1-BF16](https://huggingface.co/nothingiisreal/L3.1-70B-Celeste-V0.1-BF16)
+* [cyberagent/Llama-3.1-70B-Japanese-Instruct-2407](https://huggingface.co/cyberagent/Llama-3.1-70B-Japanese-Instruct-2407
+70b Inori takes a different approach by using Model Stock.
+- Dracarys (I just threw it in, but can be useful for code)
+- Euryale (You all know it!)
+- Cathallama (Athene + turboderp_cat)
+- New Dawn (I heard people like it so I added it in)
+- Celeste (RP)
+- Japanese-Instruct (Enhancement of Japanese Language for the weebs out there.)
+No Hermes was harmed in the making of this model stock merge.
+## Yap / Chat Format
+L3 Instruct.
+## 🧩 Configuration
 ```yaml
 parameters:
   normalize: true
 dtype: bfloat16
 ```
+## 💻 Usage
+```python
+!pip install -qU transformers accelerate
+from transformers import AutoTokenizer
+import transformers
+import torch
+model = "KaraKaraWitch/L3.1-70b-Inori"
+messages = [{"role": "user", "content": "What is a large language model?"}]
+tokenizer = AutoTokenizer.from_pretrained(model)
+prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+pipeline = transformers.pipeline(
+    "text-generation",
+    model=model,
+    torch_dtype=torch.float16,
+    device_map="auto",
+)
+outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
+print(outputs[0]["generated_text"])
+```