MHGanainy
/

gpt2-xl-lora-multi-6

Generated from Trainer

Model card Files Files and versions Community

MHGanainy commited on about 22 hours ago

Commit

06f51a6

•

1 Parent(s): 8d82753

Model save

Files changed (1) hide show

README.md +3 -5

README.md CHANGED Viewed

@@ -15,8 +15,6 @@ should probably proofread and complete it, then remove this comment. -->
 # gpt2-xl-lora-multi-6
 This model is a fine-tuned version of [openai-community/gpt2-xl](https://huggingface.co/openai-community/gpt2-xl) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 2.0126
 ## Model description
@@ -46,7 +44,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.1
-- lr_scheduler_warmup_steps: 7179
 - num_epochs: 1
 - mixed_precision_training: Native AMP
@@ -56,8 +54,8 @@ The following hyperparameters were used during training:
 ### Framework versions
-- PEFT 0.13.0
-- Transformers 4.45.1
 - Pytorch 2.4.1+cu121
 - Datasets 3.0.1
 - Tokenizers 0.20.0

 # gpt2-xl-lora-multi-6
 This model is a fine-tuned version of [openai-community/gpt2-xl](https://huggingface.co/openai-community/gpt2-xl) on an unknown dataset.
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.1
+- lr_scheduler_warmup_steps: 4643
 - num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Framework versions
+- PEFT 0.13.1
+- Transformers 4.45.2
 - Pytorch 2.4.1+cu121
 - Datasets 3.0.1
 - Tokenizers 0.20.0