aashish1904 commited on
Commit
9cedcfb
·
verified ·
1 Parent(s): 924707e

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +99 -0
README.md ADDED
@@ -0,0 +1,99 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ library_name: transformers
5
+ license: cc-by-2.0
6
+
7
+ ---
8
+
9
+ ![](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)
10
+
11
+ # QuantFactory/Duet_Minitron8b_v0.5-GGUF
12
+ This is quantized version of [G-reen/Duet_Minitron8b_v0.5](https://huggingface.co/G-reen/Duet_Minitron8b_v0.5) created using llama.cpp
13
+
14
+ # Original Model Card
15
+
16
+ **Warning: This is a proof of concept. Models trained using the Duet dataset may behave differently from other models, due to the uniqueness of the data generation pipeline. I am also unsure if this model will be good at RP, as it hasn't explicitly seen any multiturn roleplaying data in its training.**
17
+
18
+ **Lastly, this model has not yet undergone extensive testing. It is probably uncensored, and may output false, sexual, or other undesirable information. Use at your own risk. I am not responsible for any harm caused by this model or its outputs.**
19
+
20
+ ## Datasets used:
21
+ - https://huggingface.co/datasets/G-reen/Duet-v0.5
22
+ - https://huggingface.co/datasets/deepmind/code_contests
23
+ - https://huggingface.co/datasets/nothingiisreal/Reddit-Dirty-And-WritingPrompts
24
+ - https://huggingface.co/datasets/Gryphe/Opus-WritingPrompts
25
+ - https://huggingface.co/datasets/Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
26
+ - https://huggingface.co/datasets/lighteval/MATH-Hard
27
+
28
+ ## Model used: https://huggingface.co/nvidia/Mistral-NeMo-Minitron-8B-Base
29
+
30
+ ## Prompt format: Llama3 Chat
31
+ ```
32
+ <|begin_of_text|><|start_header_id|>system<|end_header_id|>
33
+
34
+ You are a helpful AI assistant for travel tips and recommendations<|eot_id|><|start_header_id|>user<|end_header_id|>
35
+
36
+ What can you help me with?<|eot_id|><|start_header_id|>assistant<|end_header_id|>
37
+ ```
38
+
39
+ ## System Prompts used during training:
40
+ - ```You are a creative writer who writes stories based on the given text.```
41
+ - ```You are an AI assistant that thinks step by step and solves problems.```
42
+ - ```Roleplay as the below character while also narrating and thinking step by step to solve problems:```
43
+ - (No system prompt)
44
+
45
+ If you want to generate the "Duet-style" responses, you need to write narration for your character in addition to asking your question. For instance, instead of "What is 6+2234", write 'The (character name here) wondered to himself, "What is 6+2234?"'
46
+
47
+ ## Example:
48
+ ```
49
+ Roleplay as the below character while also narrating and thinking step by step to solve problems:
50
+ James is a pirate from the Caribbean who is shy and angry.
51
+ ---------------
52
+ *James thinks to himself* What is C# code to find the 6th fibonacchi number?
53
+ ```
54
+
55
+ ```
56
+ James took a long drag from his pipe, the sweet taste of the tobacco calming his nerves. He was sitting at his makeshift desk, a wooden crate on the beach, surrounded by scattered papers and half-finished projects. The sun was setting, casting a warm orange glow over the ocean, and James couldn't help but feel a sense of peace wash over him.
57
+
58
+ "Ah, yeah, the 6th Fibonacci number, that's the ticket," he muttered to himself, tapping his pipe against the crate. "Let's see... if I can just write a little C# code to find that..."
59
+
60
+ He rummaged through his papers, pulling out a crumpled sheet with some Python code scrawled on it. "Hmm, I think I can adapt this to C#... Ah, yeah, that's right, I can just... um..."
61
+
62
+ James paused, his eyes narrowing as he thought through the code. "Okay, so I need to define the sequence... that's just a list, right? Then I can use a loop to generate the numbers..."
63
+
64
+ He scribbled furiously, the words spilling out in a rapid-fire sequence.
65
+
66
+ ```csharp
67
+ using System;
68
+
69
+ class Program
70
+ {
71
+ static void Main()
72
+ {
73
+ // Define the sequence
74
+ List<long> fibonacciSequence = new List<long>();
75
+
76
+ // Seed the sequence with the first two numbers
77
+ fibonacciSequence.Add(0);
78
+ fibonacciSequence.Add(1);
79
+
80
+ // Generate the sequence up to the 6th number
81
+ for (int i = 2; i <= 6; i++)
82
+ {
83
+ long nextNumber = fibonacciSequence[i - 2] + fibonacciSequence[i - 1];
84
+ fibonacciSequence.Add(nextNumber);
85
+ }
86
+
87
+ // Output the 6th Fibonacci number
88
+ Console.WriteLine(fibonacciSequence[5]);
89
+ }
90
+ }```
91
+
92
+ As he finished writing, James leaned back, a satisfied grin spreading across his face. "That should do it. Now, let's just... uh... compile it and run it..."
93
+
94
+ He fumbled with his makeshift computer, a salvaged laptop from a sunken ship, before finally getting it to work. The code compiled without errors, and James held his breath as he ran the program.
95
+ th Fibonacci number. Not bad, not bad at all."
96
+ The output flashed on the screen, and James let out a low whistle. "Fifty-five, huh? That's the 6th Fibonacci number. Not bad, not bad at all."
97
+ ```
98
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65a5c0e82823ba72ed2cee7d/8PpaQ1krvuQAh7gKK_MgP.png)
99
+ *I'm currently experimenting with various training setups that utilize the free TPU on Kaggle, so the loss probably isn't as optimal as it could be (I ran out of compute and couldn't do any more experiments).*