marcodambra commited on
Commit
fa7a860
·
verified ·
1 Parent(s): 44a1115

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -2
README.md CHANGED
@@ -13,11 +13,14 @@ tags:
13
 
14
  # Model Information
15
 
16
- XXXX is an updated version of [Mistral-7B-v0.2](https://huggingface.co/alpindale/Mistral-7B-v0.2-hf) specifically fine-tuned with SFT and LoRA adjustments.
 
 
17
 
18
  - It's trained both on publicly available datasets, like [SQUAD-it](https://huggingface.co/datasets/squad_it), and datasets we've created in-house.
19
  - it's designed to understand and maintain context, making it ideal for Retrieval Augmented Generation (RAG) tasks and applications requiring contextual awareness.
20
-
 
21
  # Evaluation
22
 
23
  We evaluated the model using the same test sets as used for the [Open Ita LLM Leaderboard](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard):
 
13
 
14
  # Model Information
15
 
16
+ XXXXQuantized is a compact iteration of the model [XXXX](https://huggingface.co/MoxoffSpA/xxxx), optimized for efficiency.
17
+ It is offered in two distinct configurations: a 4-bit version and an 8-bit version, each designed to maintain the model's effectiveness while significantly reducing its size.
18
+ and computational requirements.
19
 
20
  - It's trained both on publicly available datasets, like [SQUAD-it](https://huggingface.co/datasets/squad_it), and datasets we've created in-house.
21
  - it's designed to understand and maintain context, making it ideal for Retrieval Augmented Generation (RAG) tasks and applications requiring contextual awareness.
22
+ - It is quantized in a 4-bit version and an 8-bit version suing the prcedure [here](https://github.com/ggerganov/llama.cpp).
23
+ -
24
  # Evaluation
25
 
26
  We evaluated the model using the same test sets as used for the [Open Ita LLM Leaderboard](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard):