quantized models

by Sio-Nii - opened Apr 12, 2024

Apr 12, 2024

•

edited Apr 12, 2024

there is several quantized model files with same int8 size
model.onnx + model.onnx.data
model.ort
model.with_runtime_opt.ort
and another model.onnx + model.onnx.data in fp16fullonnxsdquantized folder

what difference between them?
which one we need use?

MANOFAi94

Apr 14, 2024

You use ort models for android

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment