Update README.md
Browse files
README.md
CHANGED
@@ -7,13 +7,14 @@ license: cc-by-sa-4.0
|
|
7 |
<!-- Provide a quick summary of what the model is/does. -->
|
8 |
|
9 |
|
10 |
-
**slim-summary-tool** is a 4_K_M quantized GGUF version of slim-summary, providing a small, fast inference implementation, optimized for multi-model concurrent deployment.
|
11 |
|
12 |
-
The size of the self-contained GGUF model binary is 1.71 GB, which is small enough to run locally on a CPU with reasonable inference speed.
|
13 |
|
14 |
The model takes as input a text passage, an optional parameter with a focusing phrase or query, and an experimental optional (N) parameter, which is used to guide the model to a specific number of items return in a summary list.
|
15 |
|
16 |
-
[**slim-summary**](https://huggingface.co/llmware/slim-summary)
|
|
|
17 |
|
18 |
To pull the model via API:
|
19 |
|
|
|
7 |
<!-- Provide a quick summary of what the model is/does. -->
|
8 |
|
9 |
|
10 |
+
**slim-summary-tool** is a 4_K_M quantized GGUF version of slim-summary, providing a small, fast inference implementation, optimized for multi-model concurrent deployment, to provide high-quality summarizations of complex business documents, on a small, specialized locally-deployable model.
|
11 |
|
12 |
+
The size of the self-contained GGUF model binary is 1.71 GB, which is small enough to run locally on a CPU with reasonable inference speed, and has been optimized to maximize high-quality with the ability to deploy on a local machine.
|
13 |
|
14 |
The model takes as input a text passage, an optional parameter with a focusing phrase or query, and an experimental optional (N) parameter, which is used to guide the model to a specific number of items return in a summary list.
|
15 |
|
16 |
+
Please see the usage notes at: [**slim-summary**](https://huggingface.co/llmware/slim-summary)
|
17 |
+
|
18 |
|
19 |
To pull the model via API:
|
20 |
|