gemma_bn_instruct

This model is a fine-tuned version of INSAIT-Institute/BgGPT-Gemma-2-2.6B-IT-v1.0 on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 0.0002
train_batch_size: 8
eval_batch_size: 8
seed: 42
gradient_accumulation_steps: 8
total_train_batch_size: 64
optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
num_epochs: 3
mixed_precision_training: Native AMP

Training Loss	Epoch	Step	Validation Loss
2.0629	0.2844	200	1.9151
1.8976	0.5689	400	1.8619
1.8624	0.8533	600	1.8252
1.787	1.1365	800	1.8043
1.7492	1.4210	1000	1.7880
1.7227	1.7054	1200	1.7763
1.7327	1.9899	1400	1.7640
1.6352	2.2731	1600	1.7663
1.6366	2.5575	1800	1.7605
1.6381	2.8420	2000	1.7552

Base model

Finetuned

Adapter

(2)

this model