Nobody felt like quantizing this model?

by ElvisM - opened Dec 14, 2024

ElvisM

Dec 14, 2024

Weird. Usually GGUF quants pop out in the first hour.

gghfez

Dec 15, 2024

New architecture, it'll take time for the popular inference engines / quant libs to be updated to support it.

Looks like there's an MLX pull request with support if you have a mac

alexrs

Cohere Labs org 4 days ago

alexrs changed discussion status to closed 4 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment