sengi
/

zephyr-7b-pl-qlora

alignment-handbook

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

sengi commited on Apr 23

Commit

f211441

•

1 Parent(s): 3073f9b

End of training

Files changed (2) hide show

README.md +6 -2
config.json +1 -1

README.md CHANGED Viewed

@@ -2,12 +2,16 @@
 license: apache-2.0
 library_name: peft
 tags:
 - trl
 - sft
 - alignment-handbook
 - generated_from_trainer
 datasets:
-- generator
 base_model: alignment-handbook/zephyr-7b-sft-full
 model-index:
 - name: zephyr-7b-pl-qlora
@@ -19,7 +23,7 @@ should probably proofread and complete it, then remove this comment. -->
 # zephyr-7b-pl-qlora
-This model is a fine-tuned version of [alignment-handbook/zephyr-7b-sft-full](https://huggingface.co/alignment-handbook/zephyr-7b-sft-full) on the generator dataset.
 ## Model description

 license: apache-2.0
 library_name: peft
 tags:
+- alignment-handbook
+- trl
+- sft
+- generated_from_trainer
 - trl
 - sft
 - alignment-handbook
 - generated_from_trainer
 datasets:
+- HuggingFaceH4/ultrachat_200k
 base_model: alignment-handbook/zephyr-7b-sft-full
 model-index:
 - name: zephyr-7b-pl-qlora
 # zephyr-7b-pl-qlora
+This model is a fine-tuned version of [alignment-handbook/zephyr-7b-sft-full](https://huggingface.co/alignment-handbook/zephyr-7b-sft-full) on the HuggingFaceH4/ultrachat_200k dataset.
 ## Model description

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "mistralai/Mistral-7B-v0.1",
   "architectures": [
     "MistralForCausalLM"
   ],

 {
+  "_name_or_path": "alignment-handbook/zephyr-7b-sft-full",
   "architectures": [
     "MistralForCausalLM"
   ],