yizhujiao
/

llama2-7b-sft-math

Generated from Trainer

Model card Files Files and versions Community

yizhujiao commited on Jun 27, 2024

Commit

467c3a2

•

1 Parent(s): 3c492dc

Model save

Files changed (2) hide show

README.md +3 -5
adapter_model.safetensors +1 -1

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ tags:
 - trl
 - sft
 - generated_from_trainer
-base_model: meta-llama/Llama-2-7b-hf
 model-index:
 - name: llama2-7b-sft-math
   results: []
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 # llama2-7b-sft-math
-This model is a fine-tuned version of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on an unknown dataset.
 ## Model description
@@ -36,11 +36,9 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1.41e-05
-- train_batch_size: 32
 - eval_batch_size: 8
 - seed: 42
-- gradient_accumulation_steps: 16
-- total_train_batch_size: 512
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3.0

 - trl
 - sft
 - generated_from_trainer
+base_model: meta-llama/Llama-2-7b-chat-hf
 model-index:
 - name: llama2-7b-sft-math
   results: []
 # llama2-7b-sft-math
+This model is a fine-tuned version of [meta-llama/Llama-2-7b-chat-hf](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) on an unknown dataset.
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1.41e-05
+- train_batch_size: 1
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3.0

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0b9364e251382a06c1764ca5c7ece075a1369cded087b4e9b5bafe2e77dc8123
 size 8405472

 version https://git-lfs.github.com/spec/v1
+oid sha256:9e774cf7efb9a2b138b9e681a17f6065c443469e90e770442cba2ba43fdef1ec
 size 8405472