--- license: gemma base_model: cognitivecomputations/dolphin-2.9.4-gemma2-2b base_model_relation: quantized library_name: mlc-llm pipeline_tag: text-generation --- 4-bit [OmniQuant](https://arxiv.org/abs/2308.13137) quantized version of [dolphin-2.9.4-gemma2-2b](https://huggingface.co/cognitivecomputations/dolphin-2.9.4-gemma2-2b) for inference with the [Private LLM app](https://privatellm.app/).