File size: 250 Bytes
2ae5589
 
 
 
ee0911f
 
 
 
 
1
2
3
4
5
6
7
8
9
---
tags:
- fp8
---
Produced using https://github.com/neuralmagic/AutoFP8/blob/b0c1f789c51659bb023c06521ecbd04cea4a26f6/quantize.py

```bash
python quantize.py --model-id meta-llama/Meta-Llama-3-8B-Instruct --save-dir Meta-Llama-3-8B-Instruct-FP8
```