lsmille
/

lora_evo_ta_all_layers_3

Generated from Trainer

Model card Files Files and versions

lsmille commited on May 28, 2024

Commit

0307dcb

·

verified ·

1 Parent(s): 610d406

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -21,15 +21,24 @@ It achieves the following results on the evaluation set:
 ## Model description
 lora_alpha = 16 <--------
 lora_dropout = 0.05
 lora_r = 8 <--------
 epochs = 3
 learning rate = 3e-4
 warmup_steps=0.5
 gradient_accumulation_steps = 8
 train_batch = 1
 eval_batch = 1
 ## Intended uses & limitations
 More information needed

 ## Model description
 lora_alpha = 16 <--------
 lora_dropout = 0.05
 lora_r = 8 <--------
 epochs = 3
 learning rate = 3e-4
 warmup_steps=0.5
 gradient_accumulation_steps = 8
 train_batch = 1
 eval_batch = 1
 ## Intended uses & limitations
 More information needed