HiroseKoichi commited on
Commit
2d08961
1 Parent(s): be96274

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +70 -0
README.md ADDED
@@ -0,0 +1,70 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3
3
+ library_name: transformers
4
+ tags:
5
+ - nsfw
6
+ - not-for-all-audiences
7
+ - llama-3
8
+ - text-generation-inference
9
+ ---
10
+
11
+ # Llama-Salad-4x8B-V2
12
+
13
+
14
+ # Details
15
+ - **License**: [llama3](https://llama.meta.com/llama3/license/)
16
+ - **Instruct Format**: [llama-3](https://llama.meta.com/docs/model-cards-and-prompt-formats/meta-llama-3/)
17
+ - **Context Size**: 8K
18
+
19
+ ## Models Used
20
+ - [Meta-Llama-3-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3-8B-Instruct)
21
+ - [Llama-3-8B-Synthia-v3.5](https://huggingface.co/migtissera/Llama-3-8B-Synthia-v3.5)
22
+ - [Llama-3-Soliloquy-8B-v2](https://huggingface.co/openlynn/Llama-3-Soliloquy-8B-v2)
23
+ - [opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5](https://huggingface.co/dreamgen-preview/opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5)
24
+
25
+ ## Merge Config
26
+ ```yaml
27
+ base_model: NousResearch/Meta-Llama-3-8B-Instruct
28
+ gate_mode: hidden
29
+ dtype: bfloat16
30
+ experts_per_token: 2
31
+ experts:
32
+ - source_model: NousResearch/Meta-Llama-3-8B-Instruct
33
+ positive_prompts:
34
+ - "summarize"
35
+ - "paraphrase"
36
+ - "explain"
37
+ - "define"
38
+ - "translate"
39
+ - "multilingual"
40
+ - "chat"
41
+ - "conversation"
42
+ - source_model: migtissera/Llama-3-8B-Synthia-v3.5
43
+ positive_prompts:
44
+ - "programming language"
45
+ - "JavaScript"
46
+ - "Python programming language"
47
+ - "Rust programming language"
48
+ - "CSS markup styling language"
49
+ - "math"
50
+ - "code"
51
+ - "step-by-step"
52
+ - "logical reasoning"
53
+ - source_model: openlynn/Llama-3-Soliloquy-8B-v2
54
+ positive_prompts:
55
+ - "roleplay"
56
+ - "erotic roleplay"
57
+ - "characters"
58
+ - "scene"
59
+ - "opinion"
60
+ - source_model: dreamgen-preview/opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5
61
+ positive_prompts:
62
+ - "creative writing"
63
+ - "storytelling"
64
+ - "narration"
65
+ - "narrative setting"
66
+ - "narrative plot"
67
+ - "narrative exposition"
68
+ - "narrative theme"
69
+ - "narrative climax"
70
+ ```