Qurtana
/

SmolLM2-1.7B-Instruct-NuminaMath-TIR

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Qurtana commited on Nov 29, 2024

Commit

0997768

·

verified ·

1 Parent(s): 4b742c5

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -23,4 +23,8 @@ datasets:
 This model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 This model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
+Trained using rank-stablized QLoRA with r = 64 and alpha = 5 for one epoch using the "ChatML" data prep.
+The following heads were targeted: "q_proj", "k_proj", "v_proj", "o_proj", "gate_proj", "up_proj", "down_proj", "embed_tokens", and "lm_head".
 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)