afrideva commited on
Commit
df1c76b
1 Parent(s): b8d23e9

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +65 -0
README.md ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: cognitivecomputations/dolphin-2.9.3-qwen2-0.5b
3
+ datasets:
4
+ - cognitivecomputations/Dolphin-2.9
5
+ - teknium/OpenHermes-2.5
6
+ - cognitivecomputations/samantha-data
7
+ - microsoft/orca-math-word-problems-200k
8
+ inference: true
9
+ license: apache-2.0
10
+ model_creator: cognitivecomputations
11
+ model_name: dolphin-2.9.3-qwen2-0.5b
12
+ pipeline_tag: text-generation
13
+ quantized_by: afrideva
14
+ tags:
15
+ - generated_from_trainer
16
+ - axolotl
17
+ - gguf
18
+ - ggml
19
+ - quantized
20
+ ---
21
+
22
+ # dolphin-2.9.3-qwen2-0.5b-GGUF
23
+
24
+ Quantized GGUF model files for [dolphin-2.9.3-qwen2-0.5b](https://huggingface.co/cognitivecomputations/dolphin-2.9.3-qwen2-0.5b) from [cognitivecomputations](https://huggingface.co/cognitivecomputations)
25
+
26
+ ## Original Model Card:
27
+
28
+ # Dolphin 2.9.3 Qwen2 0.5B 🐬
29
+
30
+ Curated and trained by Eric Hartford, Lucas Atkins, and Fernando Fernandes, and Cognitive Computations
31
+
32
+ [![Discord](https://img.shields.io/discord/1156064224225808488?logo=Discord&logoColor=%23ffffff&label=Discord&link=https%3A%2F%2Fdiscord.gg%2FtCMkMDDHwm)](https://discord.gg/cognitivecomputations)
33
+ Discord: https://discord.gg/cognitivecomputations
34
+
35
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/ldkN1J0WIDQwU4vutGYiD.png" width="600" />
36
+
37
+ Our appreciation for the sponsors of Dolphin 2.9.3:
38
+ - [Crusoe Cloud](https://crusoe.ai/) - provided excellent on-demand 8xH100 node
39
+
40
+ This model is based on Qwen2-0.5b, and is governed by the Apache-2.0
41
+
42
+ The base model has 128k context, and the full-weight fine-tuning was with 16k sequence length.
43
+
44
+ Due to the complexities of fine tuning smaller models on datasets created by/for larger models - we removed coding, function calling and systemchat-multilingual datasets when tuning these models.
45
+
46
+
47
+ example:
48
+
49
+ ```
50
+ <|im_start|>system
51
+ You are Dolphin, a helpful AI assistant.<|im_end|>
52
+ <|im_start|>user
53
+ {prompt}<|im_end|>
54
+ <|im_start|>assistant
55
+
56
+ ```
57
+
58
+ Dolphin-2.9.3 has a variety of instruction, and conversational skills.
59
+
60
+ Dolphin is uncensored. We have filtered the dataset to remove alignment and bias. This makes the model more compliant. You are advised to implement your own alignment layer before exposing the model as a service. It will be highly compliant with any requests, even unethical ones. Please read my blog post about uncensored models. https://erichartford.com/uncensored-models You are responsible for any content you create using this model. Enjoy responsibly.
61
+
62
+ Dolphin is licensed according to Apache-2.0 We grant permission for any use, including commercial, that falls within accordance with said license. Dolphin was trained on data generated from GPT4, among other models.
63
+
64
+ Evals:
65
+ <img src="https://i.ibb.co/Ct0McfK/7872e88d-e9bf-4640-a9af-294e3127f87a.png" width="600" />