nidum
/

Nidum-Gemma-2B-Uncensored

Text Generation

text-generation-inference

Model card Files Files and versions Community

junafinity commited on Aug 6, 2024

Commit

c9fc25c

·

verified ·

1 Parent(s): 694580e

Update README.md

Files changed (1) hide show

README.md +13 -0

README.md CHANGED Viewed

@@ -45,6 +45,19 @@ Nidum-Limitless-Gemma-2B is now officially available. Explore its capabilities a
 ## Contributing:
 We welcome contributions to enhance the model or expand its functionalities. Details on how to contribute will be available in the coming updates.
 ## Contact:
 For any inquiries or further information, please contact us at **[email protected]**.

 ## Contributing:
 We welcome contributions to enhance the model or expand its functionalities. Details on how to contribute will be available in the coming updates.
+## Quantized Model Versions
+To accommodate different hardware configurations and performance needs, Nidum-Limitless-Gemma-2B-GGUF is available in multiple quantized versions:
+| Model Version                                  | Description                                           |
+|------------------------------------------------|-------------------------------------------------------|
+| **Nidum-Limitless-Gemma-2B-Q2_K.gguf**         | Optimized for minimal memory usage with lower precision. Suitable for resource-constrained environments. |
+| **Nidum-Limitless-Gemma-2B-Q4_K_M.gguf**       | Balances performance and precision, offering faster inference with moderate memory usage. |
+| **Nidum-Limitless-Gemma-2B-Q8_0.gguf**         | Provides higher precision with increased memory usage, suitable for tasks requiring more accuracy. |
+| **Nidum-Limitless-Gemma-2B-F16.gguf**          | Full 16-bit floating point precision for maximum accuracy, ideal for high-end GPUs. |
+It is available here: https://huggingface.co/nidum/Nidum-Limitless-Gemma-2B-GGUF
 ## Contact:
 For any inquiries or further information, please contact us at **[email protected]**.