Update README.md
Browse files
README.md
CHANGED
@@ -5,15 +5,15 @@ language:
|
|
5 |
base_model: microsoft/phi-2
|
6 |
pipeline_tag: text-generation
|
7 |
---
|
8 |
-
https://arxiv.org/abs/1710.06071
|
9 |
-
# Model Card for Model ID
|
10 |
![](ft_sections.png)
|
11 |
|
12 |
-
This is a small language model designed for scientific research. It
|
13 |
This makes it easier and faster for researchers to understand and organize important information from clinical studies.
|
14 |
|
15 |
## Model Details
|
16 |
|
|
|
|
|
17 |
- **Developed by: Salvatore Saporito
|
18 |
- **Language(s) (NLP):** English
|
19 |
- **Finetuned from model:** https://huggingface.co/microsoft/phi-2
|
@@ -24,7 +24,7 @@ This makes it easier and faster for researchers to understand and organize impor
|
|
24 |
|
25 |
## Uses
|
26 |
|
27 |
-
Automatic identification of sections in (clinical trial) abstracts.
|
28 |
|
29 |
## How to Get Started with the Model
|
30 |
|
@@ -50,6 +50,17 @@ Usage:
|
|
50 |
text = tokenizer.batch_decode(outputs,skip_special_tokens=True)[0]
|
51 |
print(text)
|
52 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
53 |
|
54 |
## Training Details
|
55 |
|
@@ -66,7 +77,7 @@ Generation of dedicated prompt for Causal_LM modelling.
|
|
66 |
|
67 |
#### Training Hyperparameters
|
68 |
|
69 |
-
bnb_config = BitsAndBytesConfig(load_in_4bit=True,
|
70 |
bnb_4bit_quant_type='nf4',
|
71 |
bnb_4bit_compute_dtype=torch.bfloat16,
|
72 |
bnb_4bit_use_double_quant=True)
|
@@ -83,13 +94,7 @@ bnb_config = BitsAndBytesConfig(load_in_4bit=True,
|
|
83 |
|
84 |
#### Metrics
|
85 |
|
86 |
-
|
87 |
-
|
88 |
-
### Results
|
89 |
-
|
90 |
-
|
91 |
-
#### Summary
|
92 |
-
|
93 |
|
94 |
## Technical Specifications [optional]
|
95 |
|
@@ -115,4 +120,9 @@ bnb_config = BitsAndBytesConfig(load_in_4bit=True,
|
|
115 |
|
116 |
torch einops transformers bitsandbytes accelerate peft
|
117 |
|
118 |
-
## Model Card Contact
|
|
|
|
|
|
|
|
|
|
|
|
5 |
base_model: microsoft/phi-2
|
6 |
pipeline_tag: text-generation
|
7 |
---
|
|
|
|
|
8 |
![](ft_sections.png)
|
9 |
|
10 |
+
This is a small language model designed for scientific research application. It is fine tuned to analyzing randomized clinical trial abstracts and to classify sentences into four key sections: Background, Methods, Results, and Conclusion.
|
11 |
This makes it easier and faster for researchers to understand and organize important information from clinical studies.
|
12 |
|
13 |
## Model Details
|
14 |
|
15 |
+
|
16 |
+
|
17 |
- **Developed by: Salvatore Saporito
|
18 |
- **Language(s) (NLP):** English
|
19 |
- **Finetuned from model:** https://huggingface.co/microsoft/phi-2
|
|
|
24 |
|
25 |
## Uses
|
26 |
|
27 |
+
Automatic identification of sections in (randomized clinical trial) abstracts.
|
28 |
|
29 |
## How to Get Started with the Model
|
30 |
|
|
|
50 |
text = tokenizer.batch_decode(outputs,skip_special_tokens=True)[0]
|
51 |
print(text)
|
52 |
|
53 |
+
|
54 |
+
Example:
|
55 |
+
Application on unseen data (
|
56 |
+
|
57 |
+
)
|
58 |
+
|
59 |
+
PROMPT: ['###Unstruct:\nKawasaki disease (KD) is a systemic vasculitis that causes abnormalities in the coronary arteries. Interleukin (IL)-41 is a novel immunoregulatory cytokine involved in the pathogenesis of some inflammatory and immune-related diseases. However, the role of IL-41 in KD is unclear. The purpose of this study was to detect the expression of IL-41 in the plasma of children with KD and its relationship with the disease.\nA total of 44 children with KD and 37 healthy controls (HC) were recruited for this study. Plasma concentrations of IL-41 were determined by ELISA. Correlations between plasma IL-41 levels and KD-related clinical parameters were analyzed by Pearson correlation and multivariate linear regression analysis. Receiver operating characteristic curve analysis was used to assess the clinical value of IL-41 in the diagnosis of KD.\nOur results showed that plasma IL-41 levels were significantly elevated in children with KD compared with HC. Correlation analysis demonstrated that IL-41 levels were positively correlated with D-dimer and N-terminal pro-B-type natriuretic peptide, and negatively correlated with IgM, mean corpuscular hemoglobin concentration, total protein, albumin and pre-albumin. Multivariable linear regression analysis revealed that IgM and mean corpuscular hemoglobin concentrations were associated with IL-41. Receiver operating characteristic curve analysis showed that the area under the curve of IL-41 was 0.7101, with IL-41 providing 88.64 % sensitivity and 54.05 % specificity.\nOur study indicated that plasma IL-41 levels in children with KD were significantly higher than those in HC, and may provide a potential diagnostic biomarker for KD.\n###Struct:\n
|
60 |
+
|
61 |
+
Answer Phi2_RCT1M-ft-heading:
|
62 |
+
|
63 |
+
BACKGROUND: Kawasaki disease (KD) is a systemic vasculitis that causes abnormalities in the coronary arteries. Interleukin (IL)-41 is a novel immunoregulatory cytokine involved in the pathogenesis of some inflammatory and immune-related diseases. However, the role of IL-41 in KD is unclear. The purpose of this study was to detect the expression of IL-41 in the plasma of children with KD and its relationship with the disease. METHODS: A total of 44 children with KD and 37 healthy controls (HC) were recruited for this study. Plasma concentrations of IL-41 were determined by ELISA. Correlations between plasma IL-41 levels and KD-related clinical parameters were analyzed by Pearson correlation and multivariate linear regression analysis. Receiver operating characteristic curve analysis was used to assess the clinical value of IL-41 in the diagnosis of KD. RESULTS: Our results showed that plasma IL-41 levels were significantly elevated in children with KD compared with HC. Correlation analysis demonstrated that IL-41 levels were positively correlated with D-dimer and N-terminal pro-B-type natriuretic peptide, and negatively correlated with IgM, mean corpuscular hemoglobin concentration, total protein, albumin and pre-albumin. Multivariable linear regression analysis revealed that IgM and mean corpuscular hemoglobin concentrations were associated with IL-41. Receiver operating characteristic curve analysis showed that the area under the curve of IL-41 was 0.7101, with IL-41 providing 88.64 % sensitivity and 54.05 % specificity. CONCLUSIONS: Our study indicated that plasma IL-41 levels in children with KD were significantly higher than those in HC, and may provide a potential diagnostic biomarker for KD.']
|
64 |
|
65 |
## Training Details
|
66 |
|
|
|
77 |
|
78 |
#### Training Hyperparameters
|
79 |
|
80 |
+
bnb_config = BitsAndBytesConfig(load_in_4bit=True,
|
81 |
bnb_4bit_quant_type='nf4',
|
82 |
bnb_4bit_compute_dtype=torch.bfloat16,
|
83 |
bnb_4bit_use_double_quant=True)
|
|
|
94 |
|
95 |
#### Metrics
|
96 |
|
97 |
+
Coming soon
|
|
|
|
|
|
|
|
|
|
|
|
|
98 |
|
99 |
## Technical Specifications [optional]
|
100 |
|
|
|
120 |
|
121 |
torch einops transformers bitsandbytes accelerate peft
|
122 |
|
123 |
+
## Model Card Contact
|
124 |
+
|
125 |
+
|
126 |
+
## References
|
127 |
+
|
128 |
+
https://arxiv.org/abs/1710.06071
|