Commit History
NVIDIA All sym, guidance scale 8
4a016ba
verified
NVIDIA All sym, guidance scale 8
209043a
verified
NVIDIA All sym, guidance scale 8
dc6d4e1
verified
NVIDIA All sym, guidance scale 8
799135a
verified
NVIDIA All sym, guidance scale 8
9888c6c
verified
MI250 QKV fused and all layers sym, guidance scale 8
6c705ea
verified
MI250 QKV fused and all layers sym, guidance scale 8
743d6f9
verified
MI250 QKV fused and all linear layers sym, guidance scale 7.5, calib steps 12
abd5384
verified
MI250 QKV fused and all linear layers sym, guidance scale 7.5, calib steps 12
660100c
verified
MI250 QKV fused and all linear layers sym, guidance scale 7.5
48152a7
verified
MI250 QKV fused and all linear layers sym, guidance scale 7.5
45f8cad
verified
MI250 QKV fused and all linear layers sym, guidance scale 7.5, no conv_out
d30c97c
verified
MI250 QKV fused and all linear layers sym, guidance scale 7.5, no conv_out
8ec75e6
verified
MI250 QKV fused and all linear layers sym, guidance scale 7.5
ac0d882
verified
MI250 QKV fused and all linear layers sym, guidance scale 7.5
0570b3e
verified
MI250 QKV fused and all linear layers sym, guidance scale 7.5
cb00591
verified
QKV fused and all linear layers sym, guidance scale 7.5
70da055
verified
QKV fused and all linear layers sym, guidance scale 7.5
4128ea1
verified
QKV fused and all linear layers sym
51fa88e
verified
QKV fused and all linear layers sym
5d000b6
verified
QKV fused and sym
f20a0bf
verified
QKV fused and sym
881d80b
verified
Full symmetric
ed4e81d
verified
Full symmetric
1e690df
verified
QKV fused and all linear layers sym
3cee2a6
verified
QKV fused and all linear layers sym
cf48f0f
verified
QKV fused and sym
6f44cfb
verified
QKV fused and sym
b175bf6
verified
Fused QKV quant_params.json with zp
7e99883
verified
Added vae weights with FP16 fix.
2de7ba8
Fused QKV safetensor with zp
0339659
verified
Fused QKV safetensor
348012d
verified
Fused QKV quant_params.json
a793c5a
verified
Fix model loading
7f81513
verified
Updates to minimal quantization script. (#1)
72eb84b
verified
Update quant params structure (#2)
6b62ce4
verified
Reference inputs
17638f5
verified
Updated quant_params
fb3aa3b
verified
Updated params.safetensors
36c8b73
verified
Output reference tensors
6e61570
verified
Quantization script
ecec5b7
verified
Remove potential overflow / saturation error.
161df88
Added comments - highlight possible overflow situation
3f5851c
Updated math model to target int8 x int8 kernels.
4024f9d
Updated QOp model to fuse SmoothQuant scales with input quantization
dca9b6e
Output reference tensors
8e3c05a
verified
Add config.json from stable-diffusion-xl-base-1.0/unet
54be8be
Stella Laurenzo
commited on