mgoin's picture
Update README.md
2ae5589 verified
|
raw
history blame
250 Bytes
metadata
tags:
  - fp8

Produced using https://github.com/neuralmagic/AutoFP8/blob/b0c1f789c51659bb023c06521ecbd04cea4a26f6/quantize.py

python quantize.py --model-id meta-llama/Meta-Llama-3-8B-Instruct --save-dir Meta-Llama-3-8B-Instruct-FP8