yuvraj17
/

Llama-3-8B-spectrum-25

Text Generation

Generated from Trainer

spectrum finetuning

Deepspeed MultiGPU

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

yuvraj17 commited on Sep 10

Commit

bf09c9f

•

1 Parent(s): c1c4a11

Update README.md

corrected Axolotl Version

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -30,7 +30,6 @@ The 25% layer selection ensures minimal computational overhead for fine-tuning.
 ## Training:
 - Trained on **2x A40s (48GB VRAM each)** for over 1 hour using the **Axolotl**.
-- Fine-tuning aimed to optimize the balance between model performance and resource efficiency, demonstrating how targeted spectrum fine-tuning can deliver substantial improvements without the need for full-scale model adjustments.
 ### Training hyperparameters
@@ -58,7 +57,7 @@ The following hyperparameters were used during training:
 ### Framework versions
-- Axolotl 0.4.0
 - Transformers 4.44.2
 - Pytorch 2.4.0+cu121
 - Datasets 2.20.0

 ## Training:
 - Trained on **2x A40s (48GB VRAM each)** for over 1 hour using the **Axolotl**.
 ### Training hyperparameters
 ### Framework versions
+- Axolotl 0.4.1
 - Transformers 4.44.2
 - Pytorch 2.4.0+cu121
 - Datasets 2.20.0