It would be great if this could be provided as a GGUF (f16 and maybe q8) - that way we could plug it into our existing multi-modal and voice apps.
· Sign up or log in to comment