FemkeBakker's picture
Training in progress, epoch 2
ecacde4 verified
|
raw
history blame
1.94 kB
metadata
license: llama2
base_model: meta-llama/Llama-2-7b-chat-hf
tags:
  - trl
  - sft
  - generated_from_trainer
model-index:
  - name: AmsterdamDocClassificationLlama200T2Epochs
    results: []

AmsterdamDocClassificationLlama200T2Epochs

This model is a fine-tuned version of meta-llama/Llama-2-7b-chat-hf on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.8173

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 1e-05
  • train_batch_size: 2
  • eval_batch_size: 8
  • seed: 42
  • gradient_accumulation_steps: 8
  • total_train_batch_size: 16
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: cosine
  • lr_scheduler_warmup_ratio: 0.1
  • num_epochs: 2

Training results

Training Loss Epoch Step Validation Loss
1.0345 0.1988 123 0.9800
0.8537 0.3976 246 0.8808
0.5807 0.5964 369 0.8503
0.7419 0.7952 492 0.8413
0.9967 0.9939 615 0.8406
0.7252 1.1939 738 0.8301
0.9605 1.3927 861 0.8214
0.7785 1.5915 984 0.8186
0.7233 1.7903 1107 0.8178
0.8389 1.9891 1230 0.8173

Framework versions

  • Transformers 4.41.1
  • Pytorch 2.3.0+cu121
  • Datasets 2.19.1
  • Tokenizers 0.19.1