Raj-Maharajwala
/

Open-Insurance-LLM-Llama3-8B

Model card Files Files and versions Community

Raj-Maharajwala commited on Nov 24

Commit

732082e

•

1 Parent(s): 370d564

Update README.md

Files changed (1) hide show

README.md +20 -3

README.md CHANGED Viewed

@@ -39,21 +39,38 @@ This model is a domain-specific language model based on Llama 3, fine-tuned for
 - **Model Type:** Instruction-tuned Language Model
 - **Base Model:** nvidia/Llama3-ChatQA-1.5-8B
 - **Finetuned Model:** Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B
 - **Model Architecture:** Llama
 - **Parameters:** 8.05 billion
 - **Developer:** Raj Maharajwala
 - **License:** llama3
 - **Language:** English
-### Quantized Model:
 Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B-GGUF: https://huggingface.co/Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B-GGUF
 ## Training Data
-The model has been fine-tuned on the InsuranceQA dataset, which contains insurance-specific question-answer pairs and domain knowledge.
-trainable params: 20.97M || all params: 8.05B || trainable%: 0.26%
 ## Model Architecture
 The model uses the Llama 3 architecture with the following key components:

 - **Model Type:** Instruction-tuned Language Model
 - **Base Model:** nvidia/Llama3-ChatQA-1.5-8B
 - **Finetuned Model:** Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B
+- **Quantized Model:** Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B-GGUF
 - **Model Architecture:** Llama
 - **Parameters:** 8.05 billion
 - **Developer:** Raj Maharajwala
 - **License:** llama3
 - **Language:** English
+### Quantized Model
 Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B-GGUF: https://huggingface.co/Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B-GGUF
 ## Training Data
+The model has been fine-tuned on the InsuranceQA dataset using LoRA (8 bit), which contains insurance-specific question-answer pairs and domain knowledge.
+trainable params: 20.97M || all params: 8.05B || trainable %: 0.26%
+```bash
+LoraConfig(
+  r=8,
+  lora_alpha=32,
+  lora_dropout=0.05,
+  bias="none",
+  task_type="CAUSAL_LM",
+  target_modules=['up_proj', 'down_proj', 'gate_proj', 'k_proj', 'q_proj', 'v_proj', 'o_proj']
+)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/66315b34b1c6e12e1c304bf8/ZzHaMo1Kt9XNnFh24H3gt.png)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/66315b34b1c6e12e1c304bf8/0sLiphsQL-j5km4c5_vru.png)
+```
 ## Model Architecture
 The model uses the Llama 3 architecture with the following key components: