raincandy-u commited on
Commit
cd7dbcb
1 Parent(s): c5fdf15

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +60 -0
README.md ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - facebook
7
+ - meta
8
+ - pytorch
9
+ - llama
10
+ - llama-3
11
+ - moe
12
+ - code
13
+ license: other
14
+ license_name: llama3
15
+ license_link: LICENSE
16
+ ---
17
+ # Aplite-Instruct-4x8B-Llama-3
18
+
19
+ Aplite-Instruct-4x8B-Llama-3 is a experimental MoE (Mixture of Experts) model based on the Llama-3 architecture using Mergekit.
20
+
21
+ ## Disclaimer
22
+
23
+ This model is a research experiment and may generate incorrect or harmful content. The model's outputs should not be taken as factual or representative of the views of the model's creator or any other individual.
24
+
25
+ The model's creator is not responsible for any harm or damage caused by the model's outputs.
26
+
27
+ ## Merge Details
28
+
29
+ ```
30
+ base_model: Meta-Llama-3-8B-Instruct
31
+ experts:
32
+ - source_model: Meta-Llama-3-8B-Instruct
33
+ positive_prompts:
34
+ - "explain"
35
+ - "chat"
36
+ - "assistant"
37
+ - source_model: Llama3-8B-OpenHermes-DPO
38
+ positive_prompts:
39
+ - "python"
40
+ - "math"
41
+ - "solve"
42
+ - "code"
43
+ - source_model: Llama-3-SLERP-8B
44
+ positive_prompts:
45
+ - "chat"
46
+ - "assistant"
47
+ - "AI"
48
+ - source_model: hf-llama3-8b-orpo-v0.0
49
+ positive_prompts:
50
+ - "think"
51
+ - "chat"
52
+ - "code"
53
+ - "roleplay"
54
+ gate_mode: hidden
55
+ dtype: float16
56
+ ```
57
+
58
+ ## Join out Discord
59
+
60
+ If you'd like to discuss potential collaborations or applications, feel free to reach out to me on Discord: [insert Discord link here]