QuantLMs, unpacked to FP16 format - compatible with FP16 GEMMs. After unpacking, QuantLMs have the same architecture as LLaMa.