MoxoffSrL
/

AzzurroQuantized

Text Generation

Inference Endpoints

Model card Files Files and versions Community

marcodambra commited on Apr 4, 2024

Commit

fa7a860

·

verified ·

1 Parent(s): 44a1115

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -13,11 +13,14 @@ tags:
 # Model Information
-XXXX is an updated version of [Mistral-7B-v0.2](https://huggingface.co/alpindale/Mistral-7B-v0.2-hf) specifically fine-tuned with SFT and LoRA adjustments.
 - It's trained both on publicly available datasets, like [SQUAD-it](https://huggingface.co/datasets/squad_it), and datasets we've created in-house.
 - it's designed to understand and maintain context, making it ideal for Retrieval Augmented Generation (RAG) tasks and applications requiring contextual awareness.
 # Evaluation
 We evaluated the model using the same test sets as used for the [Open Ita LLM Leaderboard](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard):

 # Model Information
+XXXXQuantized is a compact iteration of the model [XXXX](https://huggingface.co/MoxoffSpA/xxxx), optimized for efficiency.
+It is offered in two distinct configurations: a 4-bit version and an 8-bit version, each designed to maintain the model's effectiveness while significantly reducing its size.
+and computational requirements.
 - It's trained both on publicly available datasets, like [SQUAD-it](https://huggingface.co/datasets/squad_it), and datasets we've created in-house.
 - it's designed to understand and maintain context, making it ideal for Retrieval Augmented Generation (RAG) tasks and applications requiring contextual awareness.
+- It is quantized in a 4-bit version and an 8-bit version suing the prcedure [here](https://github.com/ggerganov/llama.cpp).
+-
 # Evaluation
 We evaluated the model using the same test sets as used for the [Open Ita LLM Leaderboard](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard):