Dampish
/

Retry-dante-2.8B-V2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Dampish commited on May 18, 2023

Commit

6687143

·

1 Parent(s): baafcc3

Create README.md

Files changed (1) hide show

README.md +48 -0

README.md ADDED Viewed

	@@ -0,0 +1,48 @@

+---
+license: cc-by-nc-sa-4.0
+---
+Model trained to be as helpful of an assistant as possible.
+### Data split percentage
+60% coding
+10% conversations
+20% instructions
+10% Roleplay
+Now obviously the data contains other elements but this is the biggest part
+### Examples
+```
+The prompt structure can be almost anything, this model has been trained on 0.6 million instruction, which is less than dante.
+ This model has however been trained on a much cleaner and organized dataset and has been retrained multiple times to reach as low
+of a training loss as possible.
+## Examples.
+"You are an AI assistant respond to human in a helpful manner.
+HM: What were the causes for world war 2?
+"
+"Act like a detective from the 1900s, respond to mike in a helpful manner.
+HM: What were the causes for world war 2?
+"
+The prompt also works with alpaca structure. I have purposefully trained it so that it should work like this.
+EOS token is <|end|>.
+Remember to tell it how it should act for best effect.
+```
+### More information
+The base model is GPT-NeoX taken pretrained by redpajama.
+We managed to reach 0.45 Validation loss with a 0.3 learning loss.
+You are not allowed to use this for commerical purposes unless reaching an agreement with the creator @Dampish, @Dampish#3607 on discord.
+The model can easily be further fine tuned to most languages.