doberst commited on
Commit
0f7af22
1 Parent(s): a703c0b

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -13
README.md CHANGED
@@ -1,28 +1,33 @@
1
  ---
2
- license: apache-2.0
3
- inference: false
4
- tags: [green, p1, llmware-fx, ov, emerald]
 
 
 
 
5
  ---
6
 
7
- # slim-summary-tiny-ov
8
 
9
- **slim-summary-tiny-ov** is a specialized function calling model that summarizes a given text and generates as output a Python list of summary points.
10
 
11
- This is an OpenVino int4 quantized version of slim-summary-tiny, providing a very fast, very small inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
12
 
13
 
14
  ### Model Description
15
 
16
- - **Developed by:** llmware
17
- - **Model type:** tinyllama
18
- - **Parameters:** 1.1 billion
19
- - **Model Parent:** llmware/slim-summary-tiny
 
20
  - **Language(s) (NLP):** English
21
  - **License:** Apache 2.0
22
- - **Uses:** Summary bulletpoints extracted from complex business documents
23
  - **RAG Benchmark Accuracy Score:** NA
24
  - **Quantization:** int4
25
-
26
 
27
  ## Model Card Contact
28
 
@@ -30,4 +35,4 @@ This is an OpenVino int4 quantized version of slim-summary-tiny, providing a ver
30
 
31
  [llmware on hf](https://www.huggingface.co/llmware)
32
 
33
- [llmware website](https://www.llmware.ai)
 
1
  ---
2
+ license: gemma
3
+ inference: false
4
+ tags:
5
+ - green
6
+ - p7
7
+ - llmware-chat
8
+ - ov
9
  ---
10
 
11
+ # gemma-7b-it-ov
12
 
13
+ **gemma-7b-it-ov** is an OpenVino int4 quantized version of Google's Gemma-7B with Instruct Training (IT), providing a very fast, very small inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
14
 
15
+ [**gemma-7b-it-ov**](https://huggingface.co/google/gemma-7b-it) is a leading open source foundation chat model from Google.
16
 
17
 
18
  ### Model Description
19
 
20
+ - **Developed by:** Google
21
+ - **Quantized by:** llmware
22
+ - **Model type:** gemma-7b
23
+ - **Parameters:** 7 billion
24
+ - **Model Parent:** google/gemma-7b-it
25
  - **Language(s) (NLP):** English
26
  - **License:** Apache 2.0
27
+ - **Uses:** General purpose chat
28
  - **RAG Benchmark Accuracy Score:** NA
29
  - **Quantization:** int4
30
+
31
 
32
  ## Model Card Contact
33
 
 
35
 
36
  [llmware on hf](https://www.huggingface.co/llmware)
37
 
38
+ [llmware website](https://www.llmware.ai)