Quazim0t0 commited on
Commit
700d604
·
verified ·
1 Parent(s): c673978

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -8
README.md CHANGED
@@ -1,28 +1,36 @@
1
  ---
2
- base_model:
3
- - unsloth/phi-4-unsloth-bnb-4bit
4
- - microsoft/phi-4
5
  tags:
6
  - text-generation-inference
7
  - transformers
8
  - unsloth
9
  - llama
10
- - trl
11
- - sft
12
  license: apache-2.0
13
  language:
14
  - en
15
  datasets:
16
  - bespokelabs/Bespoke-Stratos-17k
 
 
 
 
17
  ---
18
 
19
- # Uploaded model
20
 
21
  - **Developed by:** Quazim0t0
22
- - **License:** apache-2.0
23
  - **Finetuned from model :** unsloth/phi-4-unsloth-bnb-4bit
 
24
  - **Trained for 8 Hours on A800 with the Bespoke Stratos 17k Dataset.**
25
- - https://openwebui.com/f/quaz93/phi4_turn_r1_distill_thought_function_v1
 
 
 
 
 
 
 
26
 
27
  # Phi4 Turn R1Distill LoRA Adapters
28
 
 
1
  ---
2
+ base_model: unsloth/phi-4-unsloth-bnb-4bit
 
 
3
  tags:
4
  - text-generation-inference
5
  - transformers
6
  - unsloth
7
  - llama
8
+ - gguf
 
9
  license: apache-2.0
10
  language:
11
  - en
12
  datasets:
13
  - bespokelabs/Bespoke-Stratos-17k
14
+ - bespokelabs/Bespoke-Stratos-35k
15
+ - NovaSky-AI/Sky-T1_data_17k
16
+ - Quazim0t0/BenfordsLawReasoningJSON
17
+ - open-thoughts/OpenThoughts-114k
18
  ---
19
 
20
+ # TurnPhi Project
21
 
22
  - **Developed by:** Quazim0t0
 
23
  - **Finetuned from model :** unsloth/phi-4-unsloth-bnb-4bit
24
+ - **GGUF**
25
  - **Trained for 8 Hours on A800 with the Bespoke Stratos 17k Dataset.**
26
+ - **Trained for 6 Hours on A800 with the Bespoke Stratos 35k Dataset.**
27
+ - **Trained for 2 Hours on A800 with the Benford's Law Reasoning Small 430 Row Dataset, ensuring no overfitting.**
28
+ - **Trained for 4 Hours on A800 with the Sky-T1_data_17k Dataset**
29
+ - **Trained for 6 Hours on A800 with the Openthoughts 114k Dataset.**
30
+ - **18$ Training...I'm actually amazed by the results.**
31
+
32
+ # OpenWeb UI Function
33
+ If using this model for Open WebUI here is a simple function to organize the models responses: https://openwebui.com/f/quaz93/phi4_turn_r1_distill_thought_function_v1
34
 
35
  # Phi4 Turn R1Distill LoRA Adapters
36