Update README.md
Browse files
README.md
CHANGED
@@ -67,6 +67,22 @@ code = tokenizer.decode(outputs[0], skip_special_tokens=True)
|
|
67 |
print(code)
|
68 |
```
|
69 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
70 |
|
71 |
## Training Details
|
72 |
|
|
|
67 |
print(code)
|
68 |
```
|
69 |
|
70 |
+
### Size Comparison
|
71 |
+
|
72 |
+
```
|
73 |
+
The table shows comparison VRAM requirements for loading and training
|
74 |
+
of FP16 Base Model and 4bit GPTQ quantized model with PEFT.
|
75 |
+
The value for base model referenced from [Model Memory Calculator](https://huggingface.co/docs/accelerate/main/en/usage_guides/model_size_estimator)
|
76 |
+
from HuggingFace
|
77 |
+
```
|
78 |
+
|
79 |
+
|
80 |
+
|
81 |
+
| Model | Total Size | Training Using Adam |
|
82 |
+
| ------------------------|-------------| --------------------|
|
83 |
+
| Base Model | 12.37 GB | 49.48 GP |
|
84 |
+
| 4bitQuantized+PEFT | 3.90 GB | 11 GB |
|
85 |
+
|
86 |
|
87 |
## Training Details
|
88 |
|