Quazim0t0
/

Phi4.Turn.R1Distill_v1.5.1-Tensors

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Quazim0t0 commited on 13 days ago

Commit

700d604

·

verified ·

1 Parent(s): c673978

Update README.md

Files changed (1) hide show

README.md +16 -8

README.md CHANGED Viewed

@@ -1,28 +1,36 @@
 ---
-base_model:
-- unsloth/phi-4-unsloth-bnb-4bit
-- microsoft/phi-4
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - llama
-- trl
-- sft
 license: apache-2.0
 language:
 - en
 datasets:
 - bespokelabs/Bespoke-Stratos-17k
 ---
-# Uploaded  model
 - **Developed by:** Quazim0t0
-- **License:** apache-2.0
 - **Finetuned from model :** unsloth/phi-4-unsloth-bnb-4bit
 - **Trained for 8 Hours on A800 with the Bespoke Stratos 17k Dataset.**
-- https://openwebui.com/f/quaz93/phi4_turn_r1_distill_thought_function_v1
 # Phi4 Turn R1Distill LoRA Adapters

 ---
+base_model: unsloth/phi-4-unsloth-bnb-4bit
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - llama
+- gguf
 license: apache-2.0
 language:
 - en
 datasets:
 - bespokelabs/Bespoke-Stratos-17k
+- bespokelabs/Bespoke-Stratos-35k
+- NovaSky-AI/Sky-T1_data_17k
+- Quazim0t0/BenfordsLawReasoningJSON
+- open-thoughts/OpenThoughts-114k
 ---
+# TurnPhi Project
 - **Developed by:** Quazim0t0
 - **Finetuned from model :** unsloth/phi-4-unsloth-bnb-4bit
+- **GGUF**
 - **Trained for 8 Hours on A800 with the Bespoke Stratos 17k Dataset.**
+- **Trained for 6 Hours on A800 with the Bespoke Stratos 35k Dataset.**
+- **Trained for 2 Hours on A800 with the Benford's Law Reasoning Small 430 Row Dataset, ensuring no overfitting.**
+- **Trained for 4 Hours on A800 with the Sky-T1_data_17k Dataset**
+- **Trained for 6 Hours on A800 with the Openthoughts 114k Dataset.**
+- **18$ Training...I'm actually amazed by the results.**
+# OpenWeb UI Function
+If using this model for Open WebUI here is a simple function to organize the models responses: https://openwebui.com/f/quaz93/phi4_turn_r1_distill_thought_function_v1
 # Phi4 Turn R1Distill LoRA Adapters