munish0838 commited on
Commit
8e71420
·
verified ·
1 Parent(s): bad4eff

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +144 -0
README.md ADDED
@@ -0,0 +1,144 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ language:
5
+ - en
6
+ library_name: transformers
7
+ pipeline_tag: text-generation
8
+ tags:
9
+ - Text Generation
10
+ - Transformers
11
+ - llama
12
+ - llama-3
13
+ - 8B
14
+ - nvidia
15
+ - facebook
16
+ - meta
17
+ - LLM
18
+ - insurance
19
+ - research
20
+ - pytorch
21
+ - instruct
22
+ - chatqa-1.5
23
+ - chatqa
24
+ - finetune
25
+ - gpt4
26
+ - conversational
27
+ - text-generation-inference
28
+ datasets:
29
+ - InsuranceQA
30
+
31
+ base_model: "nvidia/Llama3-ChatQA-1.5-8B"
32
+ finetuned: "Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B"
33
+ quantized: "Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B-GGUF"
34
+ license: llama3
35
+
36
+ ---
37
+
38
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
39
+
40
+
41
+ # QuantFactory/Open-Insurance-LLM-Llama3-8B-GGUF
42
+ This is quantized version of [Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B](https://huggingface.co/Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B) created using llama.cpp
43
+
44
+ # Original Model Card
45
+
46
+
47
+ # Open-Insurance-LLM-Llama3-8B
48
+
49
+ This model is a domain-specific language model based on Nvidia Llama 3 ChatQA, fine-tuned for insurance-related queries and conversations. It leverages the architecture of Llama 3 and is specifically trained to handle insurance domain tasks.
50
+
51
+ ## Model Details
52
+
53
+ - **Model Type:** Instruction-tuned Language Model
54
+ - **Base Model:** nvidia/Llama3-ChatQA-1.5-8B
55
+ - **Finetuned Model:** Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B
56
+ - **Quantized Model:** Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B-GGUF
57
+ - **Model Architecture:** Llama
58
+ - **Parameters:** 8.05 billion
59
+ - **Developer:** Raj Maharajwala
60
+ - **License:** llama3
61
+ - **Language:** English
62
+
63
+ ### Quantized Model
64
+
65
+ Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B-GGUF: https://huggingface.co/Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B-GGUF
66
+
67
+ ## Training Data
68
+
69
+ The model has been fine-tuned on the InsuranceQA dataset using LoRA (8 bit), which contains insurance-specific question-answer pairs and domain knowledge.
70
+ trainable params: 20.97M || all params: 8.05B || trainable %: 0.26%
71
+ ```bash
72
+ LoraConfig(
73
+ r=8,
74
+ lora_alpha=32,
75
+ lora_dropout=0.05,
76
+ bias="none",
77
+ task_type="CAUSAL_LM",
78
+ target_modules=['up_proj', 'down_proj', 'gate_proj', 'k_proj', 'q_proj', 'v_proj', 'o_proj']
79
+ )
80
+ ```
81
+
82
+ ## Model Architecture
83
+
84
+ The model uses the Llama 3 architecture with the following key components:
85
+ - 8B parameter configuration
86
+ - Enhanced attention mechanisms from Llama 3
87
+ - ChatQA 1.5 instruction-tuning framework
88
+ - Insurance domain-specific adaptations
89
+
90
+ ## Files in Repository
91
+
92
+ - **Model Files:**
93
+ - `model-00001-of-00004.safetensors` (4.98 GB)
94
+ - `model-00002-of-00004.safetensors` (5 GB)
95
+ - `model-00003-of-00004.safetensors` (4.92 GB)
96
+ - `model-00004-of-00004.safetensors` (1.17 GB)
97
+ - `model.safetensors.index.json` (24 kB)
98
+
99
+ - **Tokenizer Files:**
100
+ - `tokenizer.json` (17.2 MB)
101
+ - `tokenizer_config.json` (51.3 kB)
102
+ - `special_tokens_map.json` (335 Bytes)
103
+
104
+ - **Configuration Files:**
105
+ - `config.json` (738 Bytes)
106
+ - `generation_config.json` (143 Bytes)
107
+
108
+ ## Use Cases
109
+
110
+ This model is specifically designed for:
111
+ - Insurance policy understanding and explanation
112
+ - Claims processing assistance
113
+ - Coverage analysis
114
+ - Insurance terminology clarification
115
+ - Policy comparison and recommendations
116
+ - Risk assessment queries
117
+ - Insurance compliance questions
118
+
119
+ ## Limitations
120
+
121
+ - The model's knowledge is limited to its training data cutoff
122
+ - Should not be used as a replacement for professional insurance advice
123
+ - May occasionally generate plausible-sounding but incorrect information
124
+
125
+ ## Bias and Ethics
126
+
127
+ This model should be used with awareness that:
128
+ - It may reflect biases present in insurance industry training data
129
+ - Output should be verified by insurance professionals for critical decisions
130
+ - It should not be used as the sole basis for insurance decisions
131
+ - The model's responses should be treated as informational, not as legal or professional advice
132
+
133
+ ## Citation and Attribution
134
+
135
+ If you use this model in your research or applications, please cite:
136
+ ```
137
+ @misc{maharajwala2024openinsurance,
138
+ author = {Raj Maharajwala},
139
+ title = {Open-Insurance-LLM-Llama3-8B},
140
+ year = {2024},
141
+ publisher = {HuggingFace},
142
+ url = {https://huggingface.co/Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B}
143
+ }
144
+ ```