aashish1904 commited on
Commit
f4ae738
·
verified ·
1 Parent(s): 23bf6de

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +113 -0
README.md ADDED
@@ -0,0 +1,113 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ library_name: transformers
5
+ tags:
6
+ - math
7
+ - lora
8
+ - science
9
+ - chemistry
10
+ - biology
11
+ - code
12
+ - text-generation-inference
13
+ - unsloth
14
+ - llama
15
+ license: apache-2.0
16
+ datasets:
17
+ - HuggingFaceTB/smoltalk
18
+ language:
19
+ - en
20
+ - de
21
+ - es
22
+ - fr
23
+ - it
24
+ - pt
25
+ - hi
26
+ - th
27
+ base_model:
28
+ - meta-llama/Llama-3.2-1B-Instruct
29
+
30
+ ---
31
+
32
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
33
+
34
+
35
+ # QuantFactory/FastLlama-3.2-1B-Instruct-GGUF
36
+ This is quantized version of [suayptalha/FastLlama-3.2-1B-Instruct](https://huggingface.co/suayptalha/FastLlama-3.2-1B-Instruct) created using llama.cpp
37
+
38
+ # Original Model Card
39
+
40
+
41
+ ![FastLlama-Logo](FastLlama.png)
42
+
43
+ You can use ChatML & Alpaca format.
44
+
45
+ You can chat with the model via this [space](https://huggingface.co/spaces/suayptalha/Chat-with-FastLlama).
46
+
47
+ **Overview:**
48
+
49
+ FastLlama is a highly optimized version of the Llama-3.2-1B-Instruct model. Designed for superior performance in constrained environments, it combines speed, compactness, and high accuracy. This version has been fine-tuned using the MetaMathQA-50k section of the HuggingFaceTB/smoltalk dataset to enhance its mathematical reasoning and problem-solving abilities.
50
+
51
+ **Features:**
52
+
53
+ Lightweight and Fast: Optimized to deliver Llama-class capabilities with reduced computational overhead.
54
+ Fine-Tuned for Math Reasoning: Utilizes MetaMathQA-50k for better handling of complex mathematical problems and logical reasoning tasks.
55
+ Instruction-Tuned: Pre-trained on instruction-following tasks, making it robust in understanding and executing detailed queries.
56
+ Versatile Use Cases: Suitable for educational tools, tutoring systems, or any application requiring mathematical reasoning.
57
+
58
+ **Performance Highlights:**
59
+
60
+ Smaller Footprint: The model delivers comparable results to larger counterparts while operating efficiently on smaller hardware.
61
+ Enhanced Accuracy: Demonstrates improved performance on mathematical QA benchmarks.
62
+ Instruction Adherence: Retains high fidelity in understanding and following user instructions, even for complex queries.
63
+
64
+ **Loading the Model:**
65
+ ```py
66
+ import torch
67
+ from transformers import pipeline
68
+
69
+ model_id = "suayptalha/FastLlama-3.2-1B-Instruct"
70
+ pipe = pipeline(
71
+ "text-generation",
72
+ model=model_id,
73
+ device_map="auto",
74
+ )
75
+ messages = [
76
+ {"role": "system", "content": "You are a friendly assistant named FastLlama."},
77
+ {"role": "user", "content": "Who are you?"},
78
+ ]
79
+ outputs = pipe(
80
+ messages,
81
+ max_new_tokens=256,
82
+ )
83
+ print(outputs[0]["generated_text"][-1])
84
+ ```
85
+
86
+ **Dataset:**
87
+
88
+ Dataset: MetaMathQA-50k
89
+
90
+ The MetaMathQA-50k subset of HuggingFaceTB/smoltalk was selected for fine-tuning due to its focus on mathematical reasoning, multi-step problem-solving, and logical inference. The dataset includes:
91
+
92
+ Algebraic problems
93
+ Geometric reasoning tasks
94
+ Statistical and probabilistic questions
95
+ Logical deduction problems
96
+
97
+ **Model Fine-Tuning:**
98
+
99
+ Fine-tuning was conducted using the following configuration:
100
+
101
+ Learning Rate: 2e-4
102
+
103
+ Epochs: 1
104
+
105
+ Optimizer: AdamW
106
+
107
+ Framework: Unsloth
108
+
109
+ **License:**
110
+
111
+ This model is licensed under the Apache 2.0 License. See the LICENSE file for details.
112
+
113
+ [☕ Buy Me a Coffee](https://www.buymeacoffee.com/suayptalha)