nougat-small-onnx-quant_avx512_vnni

This was quantized from pszemraj/nougat-small-onnx using the --avx512_vnni flag. You need to have a processor with avx512_vnni instructions for this to work properly.

Downloads last month
6
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including pszemraj/nougat-small-onnx-quant_avx512_vnni