llmware
/

gemma-2b-it-onnx

Model card Files Files and versions Community

doberst commited on Oct 27, 2024

Commit

17c4801

·

verified ·

1 Parent(s): 44425c9

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -5,14 +5,14 @@ tags:
 - green
 - p2
 - llmware-chat
-- ov
 ---
-# gemma-2b-it-ov
-**gemma-2b-it-ov-ov** is an OpenVino int4 quantized version of Google's Gemma-2B with Instruct Training (IT), providing a very fast, very small inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
-[**gemma-2b-it-ov**](https://huggingface.co/google/gemma-2b-it) is a leading open source foundation model from Google.
 ### Model Description

 - green
 - p2
 - llmware-chat
+- onnx
 ---
+# gemma-2b-it-onnx
+**gemma-2b-it-onnx** is an ONNX int4 quantized version of Google's Gemma-2B with Instruct Training (IT), providing a very fast, very small inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
+[**gemma-2b-it**](https://huggingface.co/google/gemma-2b-it) is a leading open source foundation model from Google.
 ### Model Description