nsfwthrowitaway69
/

Venus-103b-v1.0

Text Generation

Not-For-All-Audiences

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nsfwthrowitaway69 commited on Nov 30, 2023

Commit

e1cd85e

•

1 Parent(s): 4accbec

Update README.md

Files changed (1) hide show

README.md +25 -0

README.md CHANGED Viewed

@@ -1,3 +1,28 @@
 ---
 license: llama2
 ---

 ---
 license: llama2
+language:
+ - en
+tags:
+ - not-for-all-audiences
 ---
+# Venus 103b - version 1.0
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/655febd724e0d359c1f21096/BSKlxWQSbh-liU8kGz4fF.png)
+## Overview
+A smaller version of Venus-120b that uses the same base models.
+## Model Details
+- A result of interleaving layers of [Sao10K/Euryale-1.3-L2-70B](https://huggingface.co/Sao10K/Euryale-1.3-L2-70B), [NousResearch/Nous-Hermes-Llama2-70b](https://huggingface.co/NousResearch/Nous-Hermes-Llama2-70b), and [migtissera/SynthIA-70B-v1.5](https://huggingface.co/migtissera/SynthIA-70B-v1.5) using [mergekit](https://github.com/cg123/mergekit).
+- The resulting model has 120 layers and approximately 103 billion parameters.
+- See mergekit-config.yml for details on the merge method used.
+- See the `exl2-*` branches for exllama2 quantizations. The 5.65 bpw quant should fit in 80GB VRAM, and the 3.35 bpw quant should fit in 48GB VRAM.
+**Warning: This model will produce NSFW content!**
+## Results
+Seems to be a bit more coherent than Venus-120b, likely due to using SynthIA 1.2b instead of SynthIA 1.5.