Why not small fp8 models
#10
by
ryg81
- opened
Hey why don't you release fp8 model, that can be used by consumer level GPUs easily?
Hey why don't you release fp8 model, that can be used by consumer level GPUs easily?
I think we can do fp8 quantization of text encoder