varun-v-rao
/

gpt2-large-lora-2.95M-snli-model2

Text Classification

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

varun-v-rao commited on Jun 24, 2024

Commit

45239b0

·

verified ·

1 Parent(s): b4f2824

End of training

Files changed (2) hide show

README.md +7 -7
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.879394432026011
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -29,8 +29,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai-community/gpt2-large](https://huggingface.co/openai-community/gpt2-large) on the snli dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3249
-- Accuracy: 0.8794
 ## Model description
@@ -52,7 +52,7 @@ The following hyperparameters were used during training:
 - learning_rate: 2e-05
 - train_batch_size: 128
 - eval_batch_size: 128
-- seed: 10
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
@@ -61,9 +61,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
-| 0.4325        | 1.0   | 4292  | 0.3560          | 0.8632   |
-| 0.396         | 2.0   | 8584  | 0.3342          | 0.8755   |
-| 0.3894        | 3.0   | 12876 | 0.3249          | 0.8794   |
 ### Framework versions

     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.8766510871774029
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [openai-community/gpt2-large](https://huggingface.co/openai-community/gpt2-large) on the snli dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3280
+- Accuracy: 0.8767
 ## Model description
 - learning_rate: 2e-05
 - train_batch_size: 128
 - eval_batch_size: 128
+- seed: 96
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 3
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
+| 0.4344        | 1.0   | 4292  | 0.3571          | 0.8650   |
+| 0.402         | 2.0   | 8584  | 0.3363          | 0.8744   |
+| 0.3958        | 3.0   | 12876 | 0.3280          | 0.8767   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0d59c1e15462aa630731b0b0d51006d4e05d8f70c162507ed1a8ffcee9f38873
 size 3096181368

 version https://git-lfs.github.com/spec/v1
+oid sha256:15ece3503c81f74665b640de407460ca1fc36833ebd6b4d428ef365eaf4f5d97
 size 3096181368