HiroseKoichi
/

Llama-Salad-4x8B-V2

Text Generation

nsfw

Not-For-All-Audiences

text-generation-inference

Mixture of Experts

Inference Endpoints

Model card Files Files and versions Community

HiroseKoichi commited on May 29, 2024

Commit

2d08961

•

1 Parent(s): be96274

Create README.md

Files changed (1) hide show

README.md +70 -0

README.md ADDED Viewed

	@@ -0,0 +1,70 @@

+---
+license: llama3
+library_name: transformers
+tags:
+- nsfw
+- not-for-all-audiences
+- llama-3
+- text-generation-inference
+---
+# Llama-Salad-4x8B-V2
+# Details
+- **License**: [llama3](https://llama.meta.com/llama3/license/)
+- **Instruct Format**: [llama-3](https://llama.meta.com/docs/model-cards-and-prompt-formats/meta-llama-3/)
+- **Context Size**: 8K
+## Models Used
+- [Meta-Llama-3-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3-8B-Instruct)
+- [Llama-3-8B-Synthia-v3.5](https://huggingface.co/migtissera/Llama-3-8B-Synthia-v3.5)
+- [Llama-3-Soliloquy-8B-v2](https://huggingface.co/openlynn/Llama-3-Soliloquy-8B-v2)
+- [opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5](https://huggingface.co/dreamgen-preview/opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5)
+## Merge Config
+```yaml
+base_model: NousResearch/Meta-Llama-3-8B-Instruct
+gate_mode: hidden
+dtype: bfloat16
+experts_per_token: 2
+experts:
+  - source_model: NousResearch/Meta-Llama-3-8B-Instruct
+    positive_prompts:
+    - "summarize"
+    - "paraphrase"
+    - "explain"
+    - "define"
+    - "translate"
+    - "multilingual"
+    - "chat"
+    - "conversation"
+  - source_model: migtissera/Llama-3-8B-Synthia-v3.5
+    positive_prompts:
+    - "programming language"
+    - "JavaScript"
+    - "Python programming language"
+    - "Rust programming language"
+    - "CSS markup styling language"
+    - "math"
+    - "code"
+    - "step-by-step"
+    - "logical reasoning"
+  - source_model: openlynn/Llama-3-Soliloquy-8B-v2
+    positive_prompts:
+    - "roleplay"
+    - "erotic roleplay"
+    - "characters"
+    - "scene"
+    - "opinion"
+  - source_model: dreamgen-preview/opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5
+    positive_prompts:
+    - "creative writing"
+    - "storytelling"
+    - "narration"
+    - "narrative setting"
+    - "narrative plot"
+    - "narrative exposition"
+    - "narrative theme"
+    - "narrative climax"
+```