Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
This Model is 8bit Version of EleutherAI/gpt-j-6B. It is converted by Facebook's bitsandbytes library. The original GPT-J takes 22+ GB memory for float32 parameters alone, and that's before you account for gradients & optimizer. So for finetuning on single GPU This model is converted into 8bit.
|