kil3r
/

gptj6b-lora-owca

Text Generation

Model card Files Files and versions Community

kil3r commited on Apr 23, 2023

Commit

bb8ef0a

·

1 Parent(s): 07a9d86

Create README.md

Files changed (1) hide show

README.md +14 -0

README.md ADDED Viewed

	@@ -0,0 +1,14 @@

+# This repo contains EleutherAI/gpt-j-6B fine tuned on OWCA (https://github.com/Emplocity/owca) using LoRa
+Training params:
+MICRO_BATCH_SIZE = 64
+BATCH_SIZE = 128
+GRADIENT_ACCUMULATION_STEPS = BATCH_SIZE // MICRO_BATCH_SIZE
+EPOCHS = 3
+LEARNING_RATE = 2e-5
+CUTOFF_LEN = 256
+LORA_R = 4
+LORA_ALPHA = 16
+LORA_DROPOUT = 0.05
+warmup_steps=100
+fp16=True