MarsupialAI
/

SkunkApe-14b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

MarsupialAI commited on Mar 26

Commit

cb875f2

•

1 Parent(s): 69d7466

Update README.md

Files changed (1) hide show

README.md +37 -30

README.md CHANGED Viewed

@@ -1,42 +1,49 @@
 ---
-base_model: []
-library_name: transformers
 tags:
-- mergekit
-- merge
 ---
-# skunk14
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method.
-### Models Merged
-The following models were included in the merge:
-* f:\raw\moist14
-* f:\raw\solstice14
-* f:\raw\fimbul14
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-models:
-  - model: f:\raw\fimbul14
-    parameters:
-      weight: 1.0
-  - model: f:\raw\moist14
-    parameters:
-      weight: 1.0
-  - model: f:\raw\solstice14
-    parameters:
-      weight: 1.0
-merge_method: linear
-dtype: float16
 ```

 ---
+license: cc-by-nc-4.0
+language:
+- en
 tags:
+- solar
 ---
+# Skunk Ape 14b
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/65a531bc7ec6af0f95c707b1/p9tbuezkb2qvf8kWEnO_2.jpeg)
+This version performs *substantially* better than the 16b version.
+This model is a merge of three self-merged Solar-based models in a 14b (64 layer) configuration.  The result of
+this "frankenmerge" is a medium-sized model that contains what I consider to be the best of the solar finetunes.
+Mergefuel:
+  - Sao10K/Fimbulvetr-11B-v2
+  - Sao10K/Solstice-11B-v1
+  - TheDrummer/Moistral-11B-v1
+This model is uncensored and capable of generating objectionable material. However, it is not an explicitely-NSFW model,
+and it has never "gone rogue" and tried to insert NSFW content into SFW prompts in my experience. As with any LLM, no
+factual claims made by the model should be taken at face value. You know that boilerplate safety disclaimer that most
+professional models have?  Assume this has it too. This model is for entertainment purposes only.
+iMatrix GGUFs: https://huggingface.co/MarsupialAI/SkunkApe-14b_iMatrix_GGUF
+# Sample output
+```
+{{[INPUT]}}
+Write a detailed and humorous story about a cute and fluffy bunny that goes to a Gwar concert.
+{{[OUTPUT]}}
+<<<This goes on for a while.  See sample.txt for full output>>>
 ```
+# Prompt format
+Prefers alpaca.
+# Weird merge fuckery
+According to Toasty Pigeon, FM, Akai, and probably others on the KAI discord, this merge method works better than a normal stacked merge.
+I don't pretend to understand why, but the huge PPL improvement (5.96 for this model vs 7.65 for the 16b @ Q4km) indicates that they're right.
+See recipe.txt for all the alchemy.