llama-duo
/

llama3.1-8b-classification-gpt4o-100k

@@ -1,11 +1,10 @@
 ---
 base_model: meta-llama/Meta-Llama-3.1-8B
 datasets:
-- llama-duo/synth_classification_dataset_dedup
 library_name: peft
 license: llama3.1
 tags:
-- alignment-handbook
 - trl
 - sft
 - generated_from_trainer
@@ -19,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
 # llama3.1-8b-classification-gpt4o-100k
-This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) on the llama-duo/synth_classification_dataset_dedup dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.8520
 ## Model description
@@ -56,18 +55,18 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:------:|:----:|:---------------:|
-| 1.4961 | 0.9978 | 225 | 1.7708 |
-| 1.3952 | 2.0 | 451 | 1.7770 |
-| 1.3491 | 2.9978 | 676 | 1.7484 |
-| 1.3025 | 4.0 | 902 | 1.7902 |
-| 1.2904 | 4.9978 | 1127 | 1.7997 |
-| 1.2729 | 6.0 | 1353 | 1.8170 |
-| 1.2451 | 6.9978 | 1578 | 1.8180 |
-| 1.229 | 8.0 | 1804 | 1.8372 |
-| 1.2239 | 8.9978 | 2029 | 1.8482 |
-| 1.2051 | 9.9778 | 2250 | 1.8520 |
 ### Framework versions

 ---
 base_model: meta-llama/Meta-Llama-3.1-8B
 datasets:
+- generator
 library_name: peft
 license: llama3.1
 tags:
 - trl
 - sft
 - generated_from_trainer
 # llama3.1-8b-classification-gpt4o-100k
+This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.0330
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 1.2062 | 1.0 | 296 | 1.6781 |
+| 1.1339 | 2.0 | 592 | 1.6897 |
+| 1.0779 | 3.0 | 888 | 1.7536 |
+| 1.0043 | 4.0 | 1184 | 1.8225 |
+| 0.9288 | 5.0 | 1480 | 2.0044 |
+| 0.8437 | 6.0 | 1776 | 2.1710 |
+| 0.7654 | 7.0 | 2072 | 2.4080 |
+| 0.7117 | 8.0 | 2368 | 2.6554 |
+| 0.6916 | 9.0 | 2664 | 2.9172 |
+| 0.6652 | 10.0 | 2960 | 3.0330 |
 ### Framework versions

all_results.json CHANGED Viewed

@@ -1,14 +1,9 @@
 {
- "epoch": 9.977827050997783,
- "eval_loss": 1.8520119190216064,
- "eval_runtime": 0.3553,
- "eval_samples": 16,
- "eval_samples_per_second": 2.814,
- "eval_steps_per_second": 2.814,
- "total_flos": 3.3259687719144e+18,
- "train_loss": 1.3362829395929972,
- "train_runtime": 6815.0283,
  "train_samples": 92634,
- "train_samples_per_second": 10.572,
- "train_steps_per_second": 0.33
 }

 {
+ "epoch": 10.0,
+ "total_flos": 4.416382035459834e+18,
+ "train_loss": 0.922980490487975,
+ "train_runtime": 12382.7598,
  "train_samples": 92634,
+ "train_samples_per_second": 7.645,
+ "train_steps_per_second": 0.239
 }

train_results.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
- "epoch": 9.977827050997783,
- "total_flos": 3.3259687719144e+18,
- "train_loss": 1.3362829395929972,
- "train_runtime": 6815.0283,
  "train_samples": 92634,
- "train_samples_per_second": 10.572,
- "train_steps_per_second": 0.33
 }

 {
+ "epoch": 10.0,
+ "total_flos": 4.416382035459834e+18,
+ "train_loss": 0.922980490487975,
+ "train_runtime": 12382.7598,
  "train_samples": 92634,
+ "train_samples_per_second": 7.645,
+ "train_steps_per_second": 0.239
 }

trainer_state.json CHANGED Viewed

The diff for this file is too large to render. See raw diff