Commit History
QKV fused and sym
f20a0bf
verified
QKV fused and sym
881d80b
verified
Full symmetric
ed4e81d
verified
Full symmetric
1e690df
verified
QKV fused and all linear layers sym
3cee2a6
verified
QKV fused and all linear layers sym
cf48f0f
verified
QKV fused and sym
6f44cfb
verified
QKV fused and sym
b175bf6
verified
Fused QKV quant_params.json with zp
7e99883
verified
Added vae weights with FP16 fix.
2de7ba8
Fused QKV safetensor with zp
0339659
verified
Fused QKV safetensor
348012d
verified
Fused QKV quant_params.json
a793c5a
verified
Fix model loading
7f81513
verified
Updates to minimal quantization script. (#1)
72eb84b
verified
Update quant params structure (#2)
6b62ce4
verified
Reference inputs
17638f5
verified
Updated quant_params
fb3aa3b
verified
Updated params.safetensors
36c8b73
verified
Output reference tensors
6e61570
verified
Quantization script
ecec5b7
verified
Remove potential overflow / saturation error.
161df88
Added comments - highlight possible overflow situation
3f5851c
Updated math model to target int8 x int8 kernels.
4024f9d
Updated QOp model to fuse SmoothQuant scales with input quantization
dca9b6e
Output reference tensors
8e3c05a
verified
Add config.json from stable-diffusion-xl-base-1.0/unet
54be8be
Stella Laurenzo
commited on