Qwen2-72B-LC-RTN-W4A16 / recipe.yaml
stan-hua's picture
Push folder to HuggingFace Hub
7107545 verified
raw
history blame contribute delete
128 Bytes
DEFAULT_stage:
DEFAULT_modifiers:
QuantizationModifier:
ignore: [lm_head]
targets: Linear
scheme: W4A16