IeBoytsov
/

llama-3-1-sft-qlora-test

alignment-handbook

Generated from Trainer

4-bit precision

Model card Files Files and versions Community

IeBoytsov commited on Nov 30, 2024

Commit

fa8f354

·

verified ·

1 Parent(s): a0760e3

End of training

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -1,13 +1,13 @@
 ---
 base_model: meta-llama/Llama-3.1-8B
 datasets:
-- generator
 library_name: peft
 license: llama3.1
 tags:
 - trl
 - sft
-- alignment-handbook
 - generated_from_trainer
 model-index:
 - name: llama-3-1-sft-qlora-test
@@ -19,7 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
 # llama-3-1-sft-qlora-test
-This model is a fine-tuned version of [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) on the generator dataset.
 ## Model description

 ---
 base_model: meta-llama/Llama-3.1-8B
 datasets:
+- HuggingFaceH4/ultrachat_200k
 library_name: peft
 license: llama3.1
 tags:
+- alignment-handbook
 - trl
 - sft
 - generated_from_trainer
 model-index:
 - name: llama-3-1-sft-qlora-test
 # llama-3-1-sft-qlora-test
+This model is a fine-tuned version of [meta-llama/Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B) on the HuggingFaceH4/ultrachat_200k dataset.
 ## Model description