JonahYixMAD
commited on
Commit
•
80b512e
1
Parent(s):
ef2d27e
Update README.md
Browse files
README.md
CHANGED
@@ -51,11 +51,4 @@ outputs = model.generate(**inputs, do_sample=True, max_new_tokens=256)
|
|
51 |
print(tokenizer.batch_decode(outputs, skip_special_tokens=True))
|
52 |
```
|
53 |
|
54 |
-
Model | GPU Memory Requirement
|
55 |
-
--- | ---
|
56 |
-
Llama-3.2-3B-Instruct-xMADai-4bit | 6.5 GB → 3.5 GB
|
57 |
-
Llama-3.2-1B-Instruct-xMADai-4bit | 2.5 → 2 GB
|
58 |
-
Llama-3.1-405B-Instruct-xMADai-4bit | 800 GB (16 H100s) → 250 GB (8 V100)
|
59 |
-
Llama-3.1-8B-Instruct-xMADai-4bit | 16 → 7 GB
|
60 |
-
|
61 |
For additional xMADified models, access to fine-tuning, and general questions, please contact us at [email protected] and join our waiting list.
|
|
|
51 |
print(tokenizer.batch_decode(outputs, skip_special_tokens=True))
|
52 |
```
|
53 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
54 |
For additional xMADified models, access to fine-tuning, and general questions, please contact us at [email protected] and join our waiting list.
|