alunapr
/

roberta-large-finetuned-lora-captures

+---
+license: mit
+library_name: peft
+tags:
+- generated_from_trainer
+base_model: roberta-large
+metrics:
+- accuracy
+model-index:
+- name: roberta-large-finetuned-lora-captures
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# roberta-large-finetuned-lora-captures
+This model is a fine-tuned version of [roberta-large](https://huggingface.co/roberta-large) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.4657
+- Accuracy: 0.9264
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0003
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- distributed_type: multi-GPU
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 32
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 20
+### Training results
+| Training Loss | Epoch   | Step  | Validation Loss | Accuracy |
+|:-------------:|:-------:|:-----:|:---------------:|:--------:|
+| 0.2608        | 0.9994  | 772   | 0.3877          | 0.8888   |
+| 0.3173        | 2.0     | 1545  | 0.3443          | 0.8932   |
+| 0.2885        | 2.9994  | 2317  | 0.2995          | 0.9161   |
+| 0.2566        | 4.0     | 3090  | 0.2884          | 0.9163   |
+| 0.1908        | 4.9994  | 3862  | 0.3115          | 0.9140   |
+| 0.1973        | 6.0     | 4635  | 0.2891          | 0.9186   |
+| 0.1071        | 6.9994  | 5407  | 0.2913          | 0.9218   |
+| 0.1177        | 8.0     | 6180  | 0.3057          | 0.9212   |
+| 0.1775        | 8.9994  | 6952  | 0.3390          | 0.9184   |
+| 0.0994        | 10.0    | 7725  | 0.3260          | 0.9218   |
+| 0.08          | 10.9994 | 8497  | 0.3303          | 0.9264   |
+| 0.1041        | 12.0    | 9270  | 0.3738          | 0.9209   |
+| 0.0633        | 12.9994 | 10042 | 0.3629          | 0.9271   |
+| 0.0253        | 14.0    | 10815 | 0.3967          | 0.9239   |
+| 0.0625        | 14.9994 | 11587 | 0.4285          | 0.9246   |
+| 0.0627        | 16.0    | 12360 | 0.4360          | 0.9244   |
+| 0.0551        | 16.9994 | 13132 | 0.4430          | 0.9267   |
+| 0.0545        | 18.0    | 13905 | 0.4695          | 0.9251   |
+| 0.0434        | 18.9994 | 14677 | 0.4622          | 0.9271   |
+| 0.021         | 19.9871 | 15440 | 0.4657          | 0.9264   |
+### Framework versions
+- PEFT 0.11.1
+- Transformers 4.40.2
+- Pytorch 2.3.0+cu121
+- Datasets 2.19.1
+- Tokenizers 0.19.0

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6e820303c482ad92ee4fd1c7d9e6af577094bb774553c00a7b7d753c86601ecb
 size 7407672

 version https://git-lfs.github.com/spec/v1
+oid sha256:174f289230069d890d204166fda68fc276acc501ddef3773fa59627fd3460d09
 size 7407672