munish0838 commited on
Commit
e1689d1
1 Parent(s): 7cf73f8

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +37 -0
README.md ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ base_model: HuggingFaceTB/SmolLM-135M
5
+
6
+ ---
7
+
8
+ ![](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)
9
+
10
+ # QuantFactory/Biggie-SmoLlm-0.15B-Base-GGUF
11
+ This is quantized version of [nisten/Biggie-SmoLlm-0.15B-Base](https://huggingface.co/nisten/Biggie-SmoLlm-0.15B-Base) created using llama.cpp
12
+
13
+ # Original Model Card
14
+
15
+
16
+ ###EVEN SMALLER Frankenstein of smolLm-0.13b upped to 0.15b
17
+ Use this frankenbase for training.
18
+
19
+ Done via semi-automated continuous merging to figure out the recipe.
20
+ Model is more coherent.
21
+
22
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6379683a81c1783a4a2ddba8/H6rv3ULQip4sYPpGGiZZe.png)
23
+
24
+ ```bash
25
+ wget https://huggingface.co/nisten/Biggie-SmoLlm-0.15B-Base/resolve/main/Biggie_SmolLM_0.15B_Base_bf16.gguf
26
+ ```
27
+ ```verilog
28
+ llama-cli -ngl 99 -co --temp 0 -p "How to build a city on Mars via calculating Aldrin-Cycler orbits?" -m Biggie_SmolLM_0.15B
29
+ _Base_bf16.gguf
30
+ ```
31
+ The temperature settings and min p etc need to be adjusted but even at default temp0 it was coherent for first 100 tokens.
32
+ Amazing option for further training. And this is a merge of the base, not the instruct!
33
+
34
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6379683a81c1783a4a2ddba8/UK0_mQxy6GOHKxGKBbdhx.png)
35
+
36
+ I don't understand how the f a 150mb file can talk but it can
37
+