junafinity
commited on
Commit
•
c9fc25c
1
Parent(s):
694580e
Update README.md
Browse files
README.md
CHANGED
@@ -45,6 +45,19 @@ Nidum-Limitless-Gemma-2B is now officially available. Explore its capabilities a
|
|
45 |
## Contributing:
|
46 |
We welcome contributions to enhance the model or expand its functionalities. Details on how to contribute will be available in the coming updates.
|
47 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
48 |
## Contact:
|
49 |
For any inquiries or further information, please contact us at **[email protected]**.
|
50 |
|
|
|
45 |
## Contributing:
|
46 |
We welcome contributions to enhance the model or expand its functionalities. Details on how to contribute will be available in the coming updates.
|
47 |
|
48 |
+
## Quantized Model Versions
|
49 |
+
|
50 |
+
To accommodate different hardware configurations and performance needs, Nidum-Limitless-Gemma-2B-GGUF is available in multiple quantized versions:
|
51 |
+
|
52 |
+
| Model Version | Description |
|
53 |
+
|------------------------------------------------|-------------------------------------------------------|
|
54 |
+
| **Nidum-Limitless-Gemma-2B-Q2_K.gguf** | Optimized for minimal memory usage with lower precision. Suitable for resource-constrained environments. |
|
55 |
+
| **Nidum-Limitless-Gemma-2B-Q4_K_M.gguf** | Balances performance and precision, offering faster inference with moderate memory usage. |
|
56 |
+
| **Nidum-Limitless-Gemma-2B-Q8_0.gguf** | Provides higher precision with increased memory usage, suitable for tasks requiring more accuracy. |
|
57 |
+
| **Nidum-Limitless-Gemma-2B-F16.gguf** | Full 16-bit floating point precision for maximum accuracy, ideal for high-end GPUs. |
|
58 |
+
|
59 |
+
It is available here: https://huggingface.co/nidum/Nidum-Limitless-Gemma-2B-GGUF
|
60 |
+
|
61 |
## Contact:
|
62 |
For any inquiries or further information, please contact us at **[email protected]**.
|
63 |
|