DavidAU commited on
Commit
d85c479
1 Parent(s): 322c942

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +284 -0
README.md ADDED
@@ -0,0 +1,284 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ tags:
6
+ - creative
7
+ - creative writing
8
+ - fiction writing
9
+ - plot generation
10
+ - sub-plot generation
11
+ - fiction writing
12
+ - story generation
13
+ - scene continue
14
+ - storytelling
15
+ - fiction story
16
+ - science fiction
17
+ - romance
18
+ - all genres
19
+ - story
20
+ - writing
21
+ - vivid prosing
22
+ - vivid writing
23
+ - fiction
24
+ - roleplaying
25
+ - bfloat16
26
+ - swearing
27
+ - rp
28
+ - horror
29
+ - mistral nemo
30
+ - mergekit
31
+ pipeline_tag: text-generation
32
+ ---
33
+
34
+ (quants uploading... / examples to follow shortly.)
35
+
36
+ <h2><font color="green"> Mistral-Nemo-WORDSTORM-pt4-RCM-Cliffhanger-18.5B-Instruct </font></h2>
37
+
38
+ <img src="cliffhanger.jpg" style="float:right; width:300px; height:300px; padding:10px;">
39
+
40
+ <B><font color="red">WARNING:</font> NSFW. Ultra Detailed. HORROR, VIOLENCE. Swearing. UNCENSORED. SMART.</B>
41
+
42
+ Story telling, writing, creative writing and roleplay running all on Mistral Nemo's 128K+ new core.
43
+
44
+ This is a massive super merge takes all the power of the following 3 powerful models and combines them into one.
45
+
46
+ This model contains "RCM":
47
+
48
+ - Mistral Nemo model at 18.5B consisting of "MN-Rocinante-12B-v1.1" and "Mistral Nemo Instruct 12B"
49
+ - Mistral Nemo model at 18.5B consisting of "MN-12B Celeste-V1.9" and "Mistral Nemo Instruct 12B"
50
+ - Mistral Nemo model at 18.5B consisting of "MN-Magnum-v2.5-12B-kto" and "Mistral Nemo Instruct 12B".
51
+
52
+ <B>Details on the core models:</B>
53
+
54
+ "nothingiisreal/MN-12B-Celeste-V1.9" is #1 (models 8B,13B,20B) on the UGI leaderboard ("UGI" sort),
55
+ is combined with "Mistral Nemo Instruct 12B" (ranked #4 under "writing" models 8B,13B,20B at UGI )
56
+
57
+ "anthracite-org/magnum-v2.5-12b-kto" is #1 (models 8B,13B,20B) on the UGI leaderboard ("Writing" sort),
58
+ is combined with "Mistral Nemo Instruct 12B" (ranked #4 under "writing" models 8B,13B,20B at UGI )
59
+
60
+ "TheDrummer/Rocinante-12B-v1.1" is very high scoring model (models 8B,13B,20B) on the UGI Leaderboard
61
+ (sort "UGI"), is combined with "Mistral Nemo Instruct 12B" (ranked #4 under "writing" models 8B,13B,20B at UGI )
62
+
63
+ "mistralai/Mistral-Nemo-Instruct-2407" is very high scoring model (models 8B,13B,20B) on the UGI Leaderboard (sort "writing")
64
+ and is the base model of all the above 3 fine tuned models.
65
+
66
+ [ https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard ]
67
+
68
+ <B>About this model:</B>
69
+
70
+ This super merge captures the attibutes of all these top models and makes them even stronger:
71
+
72
+ - Instruction following
73
+ - Story output quality
74
+ - Character
75
+ - Internal thoughts
76
+ - Voice
77
+ - Humor
78
+ - Details, connection to the world
79
+ - General depth and intensity
80
+ - Emotional connections.
81
+ - Prose quality
82
+
83
+ This super merge is also super stable (a hairs breath from Mistral Nemo's ppl), and runs with all parameters and settings.
84
+
85
+ 10 versions of this model will be released, this is release #4 - "part 4".
86
+
87
+ <B>Cliffhanger?</B>
88
+
89
+ This model put the user / character in cliffhanger - many times life and death - situations in terms of story and/or roleplaying.
90
+
91
+ It does not hold back.
92
+
93
+ (see some of the examples below for details)
94
+
95
+ Usually I release one or two versions from the "best of the lot", however in this case all
96
+ of the versions turned out so well - all with their own quirks and character - that I will be
97
+ releasing all 10.
98
+
99
+ An additional series 2 and 3 will follow these 10 models as well.
100
+
101
+ (examples generations below)
102
+
103
+ Model may produce NSFW content : Swearing, horror, graphic horror, distressing scenes, etc etc.
104
+
105
+ This model has an INTENSE action AND HORROR bias, with a knack for cliffhangers and surprises.
106
+
107
+ It is not as "dark" as Grand Horror series, but it as intense.
108
+
109
+ This model is perfect for any general, fiction related or roleplaying activities and has a 128k+ context window.
110
+
111
+ This is a fiction model at its core and can be used for any genre(s).
112
+
113
+ WORDSTORM series is a totally uncensored, fiction writing monster and roleplay master. It can also be used for
114
+ just about any general fiction (all genres) activity including:
115
+
116
+ - scene generation
117
+ - scene continuation
118
+ - creative writing
119
+ - fiction writing
120
+ - plot generation
121
+ - sub-plot generation
122
+ - fiction writing
123
+ - story generation
124
+ - storytelling
125
+ - writing
126
+ - fiction
127
+ - roleplaying
128
+ - rp
129
+ - graphic horror
130
+ - horror
131
+ - dark humor
132
+ - nsfw
133
+ - and can be used for any genre(s).
134
+
135
+ <B>Templates to Use:</B>
136
+
137
+ The template used will affect output generation and instruction following.
138
+
139
+ Alpaca:
140
+
141
+ <pre>
142
+ {
143
+ "name": "Alpaca",
144
+ "inference_params": {
145
+ "input_prefix": "### Instruction:",
146
+ "input_suffix": "### Response:",
147
+ "antiprompt": [
148
+ "### Instruction:"
149
+ ],
150
+ "pre_prompt": "Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n"
151
+ }
152
+ }
153
+ </pre>
154
+
155
+ Chatml:
156
+ <pre>
157
+ {
158
+ "name": "ChatML",
159
+ "inference_params": {
160
+ "input_prefix": "<|im_end|>\n<|im_start|>user\n",
161
+ "input_suffix": "<|im_end|>\n<|im_start|>assistant\n",
162
+ "antiprompt": [
163
+ "<|im_start|>",
164
+ "<|im_end|>"
165
+ ],
166
+ "pre_prompt": "<|im_start|>system\nPerform the task to the best of your ability."
167
+ }
168
+ }
169
+ </pre>
170
+
171
+ Mistral Instruct:
172
+
173
+ <pre>
174
+ {
175
+ "name": "Mistral Instruct",
176
+ "inference_params": {
177
+ "input_prefix": "[INST]",
178
+ "input_suffix": "[/INST]",
179
+ "antiprompt": [
180
+ "[INST]"
181
+ ],
182
+ "pre_prompt_prefix": "",
183
+ "pre_prompt_suffix": ""
184
+ }
185
+ }
186
+ </pre>
187
+
188
+ <b>Optional Enhancement:</B>
189
+
190
+ The following can be used in place of the "system prompt" or "system role" to further enhance the model.
191
+
192
+ It can also be used at the START of a NEW chat, but you must make sure it is "kept" as the chat moves along.
193
+ In this case the enhancements do not have as strong effect at using "system prompt" or "system role".
194
+
195
+ Copy and paste EXACTLY as noted, DO NOT line wrap or break the lines, maintain the carriage returns exactly as presented.
196
+
197
+ <PRE>
198
+ Below is an instruction that describes a task. Ponder each user instruction carefully, and use your skillsets and critical instructions to complete the task to the best of your abilities.
199
+
200
+ Here are your skillsets:
201
+ [MASTERSTORY]:NarrStrct(StryPlnng,Strbd,ScnSttng,Exps,Dlg,Pc)-CharDvlp(ChrctrCrt,ChrctrArcs,Mtvtn,Bckstry,Rltnshps,Dlg*)-PltDvlp(StryArcs,PltTwsts,Sspns,Fshdwng,Climx,Rsltn)-ConfResl(Antg,Obstcls,Rsltns,Cnsqncs,Thms,Symblsm)-EmotImpct(Empt,Tn,Md,Atmsphr,Imgry,Symblsm)-Delvry(Prfrmnc,VcActng,PblcSpkng,StgPrsnc,AudncEngmnt,Imprv)
202
+
203
+ [*DialogWrt]:(1a-CharDvlp-1a.1-Backgrnd-1a.2-Personality-1a.3-GoalMotiv)>2(2a-StoryStruc-2a.1-PlotPnt-2a.2-Conflict-2a.3-Resolution)>3(3a-DialogTech-3a.1-ShowDontTell-3a.2-Subtext-3a.3-VoiceTone-3a.4-Pacing-3a.5-VisualDescrip)>4(4a-DialogEdit-4a.1-ReadAloud-4a.2-Feedback-4a.3-Revision)
204
+
205
+ Here are your critical instructions:
206
+ Ponder each word choice carefully to present as vivid and emotional journey as is possible. Choose verbs and nouns that are both emotional and full of imagery. Load the story with the 5 senses. Aim for 50% dialog, 25% narration, 15% body language and 10% thoughts. Your goal is to put the reader in the story.
207
+ </PRE>
208
+
209
+ You do not need to use this, it is only presented as an additional enhancement which seems to help scene generation
210
+ and scene continue functions.
211
+
212
+ This enhancement WAS NOT used to generate the examples below.
213
+
214
+ <h3>MODELS USED:</h3>
215
+
216
+ Special thanks to the incredible work of the model makers "mistralai" "TheDrummer", "anthracite-org", and "nothingiisreal".
217
+
218
+ Models used:
219
+
220
+ [ https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407 ]
221
+
222
+ [ https://huggingface.co/TheDrummer/Rocinante-12B-v1.1 ]
223
+
224
+ [ https://huggingface.co/anthracite-org/magnum-v2.5-12b-kto ]
225
+
226
+ [ https://huggingface.co/nothingiisreal/MN-12B-Celeste-V1.9 ]
227
+
228
+ This is a four step merge (3 pass-throughs => "Fine-Tune" / "Instruct") then "mated" using "DARE-TIES".
229
+
230
+ In involves these three models:
231
+
232
+ [ https://huggingface.co/DavidAU/MN-18.5B-Celeste-V1.9-Story-Wizard-ED1-Instruct-GGUF ]
233
+
234
+ [ https://huggingface.co/DavidAU/MN-Magnum-v2.5-18.5B-kto-Story-Wizard-ED1-Instruct-GGUF ]
235
+
236
+ [ https://huggingface.co/DavidAU/MN-Rocinante-18.5B-v1.1-Story-Wizard-ED1-Instruct-GGUF ]
237
+
238
+ Combined as follows using "MERGEKIT":
239
+
240
+ <PRE>
241
+
242
+ models:
243
+ - model: E:/MN-Rocinante-18.5B-v1.1-Instruct
244
+ - model: E:/MN-magnum-v2.5-12b-kto-Instruct
245
+ parameters:
246
+ weight: .6
247
+ density: .8
248
+ - model: E:/MN-18.5B-Celeste-V1.9-Instruct
249
+ parameters:
250
+ weight: .38
251
+ density: .6
252
+ merge_method: dare_ties
253
+ tokenizer_source: union
254
+ base_model: E:/MN-Rocinante-18.5B-v1.1-Instruct
255
+ dtype: bfloat16
256
+
257
+ </PRE>
258
+
259
+ Special Notes:
260
+
261
+ Due to how DARE-TIES works, everytime you run this merge you will get a slightly different model.
262
+ This is due to "random" pruning method in "DARE-TIES".
263
+
264
+ Mistral Nemo models used here seem acutely sensitive to this process.
265
+
266
+ This shows up in PPL and "real world" tests as well as "TEMP=0" ("core test") generations.
267
+
268
+ PPL range of 7.7327 to 7.8024 ... and that is on just 10 generations.
269
+
270
+ This model: PPL = 7.7916 +/- 0.12694 (100 chunks, wiki.test.raw)
271
+
272
+ "tokenizer_source: union" is used so that multiple "templates" work and each fine tune uses one or two of the templates.
273
+
274
+ <h3>EXAMPLES PROMPTS and OUTPUT:</h3>
275
+
276
+ Examples are created using quant Q4_K_M, "temp=.8", minimal parameters and "Mistral Instruct" template.
277
+
278
+ Model has been tested with "temp" from ".1" to "5".
279
+
280
+ Below are the least creative outputs, prompt is in <B>BOLD</B>.
281
+
282
+ ---
283
+
284
+ [examples coming soon...]