sharkMeow's picture
End of training
ca70c15 verified
metadata
base_model: OFA-Sys/chinese-clip-vit-base-patch16
tags:
  - generated_from_trainer
metrics:
  - accuracy
model-index:
  - name: sentance_split_by_time_gpt_concate
    results: []

Visualize in Weights & Biases

sentance_split_by_time_gpt_concate

This model is a fine-tuned version of OFA-Sys/chinese-clip-vit-base-patch16 on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 3.8213
  • Accuracy: 0.0730

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 1e-05
  • train_batch_size: 25
  • eval_batch_size: 20
  • seed: 42
  • gradient_accumulation_steps: 8
  • total_train_batch_size: 200
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 60.0
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Accuracy
2.0921 5.9928 1866 3.0241 0.0774
1.9094 11.9855 3732 3.0886 0.0816
1.7854 17.9783 5598 3.2711 0.0803
1.7194 23.9711 7464 3.4032 0.0787
1.6865 29.9639 9330 3.3919 0.0778
1.658 35.9566 11196 3.4449 0.0767
1.6521 41.9494 13062 3.6068 0.0753
1.6356 47.9422 14928 3.6376 0.0743
1.6355 53.9350 16794 3.7659 0.0739
1.6276 59.9277 18660 3.8213 0.0734

Framework versions

  • Transformers 4.42.3
  • Pytorch 2.3.1+cu121
  • Datasets 2.20.0
  • Tokenizers 0.19.1