llmware
/

llama-2-13b-chat-ov

Model card Files Files and versions Community

doberst commited on Oct 10

Commit

d42b1df

•

1 Parent(s): 0ac2838

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -1,14 +1,14 @@
 ---
 license: llama2
 inference: false
-tags: [green, llmware-chat, p7, ov, emerald]
 ---
-# llama-2-chat-ov
-**llama-2-chat-ov** is an OpenVino int4 quantized version of Llama-2-Chat, providing a fast inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
-[**llama-2-chat**](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) is the official chat finetuned version of Llama2, and is one of the classic and best all-around chat models from 2023.
 ### Model Description
@@ -16,8 +16,8 @@ tags: [green, llmware-chat, p7, ov, emerald]
 - **Developed by:** meta-llama
 - **Quantified by:** llmware
 - **Model type:** llama2
-- **Parameters:** 7 billion
-- **Model Parent:** meta-llama/Llama-2-7b-chat-hf
 - **Language(s) (NLP):** English
 - **License:** LLama2 Community License
 - **Uses:** Chat and general purpose LLM

 ---
 license: llama2
 inference: false
+tags: [green, llmware-chat, p13, ov, emerald]
 ---
+# llama-2-13b-chat-ov
+**llama-2-13b-chat-ov** is an OpenVino int4 quantized version of Llama-2-13B-Chat, providing a fast inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
+[**llama-2-13b-chat**](https://huggingface.co/meta-llama/Llama-2-13b-chat-hf) is the official 13b chat finetuned version of Llama2, and is one of the classic and best all-around chat models from 2023.
 ### Model Description
 - **Developed by:** meta-llama
 - **Quantified by:** llmware
 - **Model type:** llama2
+- **Parameters:** 13 billion
+- **Model Parent:** meta-llama/Llama-2-13b-chat-hf
 - **Language(s) (NLP):** English
 - **License:** LLama2 Community License
 - **Uses:** Chat and general purpose LLM