DavieLion
/

Llama-3.2-1B-SPIN-iter1

alignment-handbook

Generated from Trainer

Model card Files Files and versions Community

DavieLion commited on Dec 29, 2024

Commit

8c632ae

·

verified ·

1 Parent(s): 6a6a712

Update README.md

Files changed (1) hide show

README.md +7 -22

README.md CHANGED Viewed

@@ -1,42 +1,31 @@
 ---
 base_model:
-- DavieLion/Llama-3.2-1B-SPIN-iter0
 tags:
 - alignment-handbook
 - generated_from_trainer
 datasets:
-- DavieLion/SPIN_iter0
-- DavieLion/SPIN_iter1
 model-index:
-- name: iter1-ckpt
   results: []
-license: mit
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# iter1-ckpt
-This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on the DavieLion/SPIN_iter0 and DavieLion/SPIN_iter1 datasets.
 ## Model description
 - Model type: A 1B parameter GPT-like model fine-tuned on synthetic datasets.
 - Language(s) (NLP): Primarily English
-- License: MIT
 - Finetuned from model: meta-llama/Llama-3.2-1B
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
@@ -54,10 +43,6 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_ratio: 0.1
 - num_epochs: 6.0
-### Training results
 ### Framework versions
 - Transformers 4.37.0

 ---
 base_model:
+- meta-llama/Llama-3.2-1B
 tags:
 - alignment-handbook
 - generated_from_trainer
 datasets:
+- HuggingFaceH4/ultrachat_200k
 model-index:
+- name: Llama-3.2-1B-SPIN-iter3
   results: []
+license: llama3.2
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Llama-3.2-1B-SPIN-iter1
+This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) on the [HuggingFaceH4/ultrachat_200k](https://huggingface.co/datasets/HuggingFaceH4/ultrachat_200k) datasets.
 ## Model description
 - Model type: A 1B parameter GPT-like model fine-tuned on synthetic datasets.
 - Language(s) (NLP): Primarily English
+- License: Llama 3.2 Community Lisense Agreement
 - Finetuned from model: meta-llama/Llama-3.2-1B
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - lr_scheduler_warmup_ratio: 0.1
 - num_epochs: 6.0
 ### Framework versions
 - Transformers 4.37.0