prajwal967
commited on
Commit
·
8ceb898
1
Parent(s):
abdcb27
add brackets
Browse files
README.md
CHANGED
@@ -23,9 +23,9 @@ license: mit
|
|
23 |
|
24 |
# Model Description
|
25 |
|
26 |
-
* A ClinicalBERT [Alsentzer et al., 2019](https://arxiv.org/pdf/1904.03323.pdf) model fine-tuned for de-identification of medical notes.
|
27 |
* Sequence Labeling (token classification): The model was trained to predict protected health information (PHI/PII) entities (spans). A list of protected health information categories is given by [HIPAA](https://www.hhs.gov/hipaa/for-professionals/privacy/laws-regulations/index.html).
|
28 |
-
* A token can either be classified as non-PHI or as one of the 11 PHI types. Token predictions
|
29 |
* The PHI labels that were used for training and other details can be found here: [Annotation Guidelines](https://github.com/obi-ml-public/ehr_deidentification/blob/master/AnnotationGuidelines.md)
|
30 |
* More details on how to use this model, the format of data and other useful information is present in the GitHub repo: [Robust DeID](https://github.com/obi-ml-public/ehr_deidentification).
|
31 |
|
@@ -42,7 +42,7 @@ license: mit
|
|
42 |
|
43 |
# Dataset
|
44 |
|
45 |
-
* The I2B2 2014 [Stubbs and Uzuner, 2015](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4978170/) dataset was used to train this model.
|
46 |
|
47 |
| | I2B2 | | I2B2 | |
|
48 |
| --------- | --------------------- | ---------- | -------------------- | ---------- |
|
@@ -81,3 +81,7 @@ license: mit
|
|
81 |
* Dropout: 0.1
|
82 |
|
83 |
# Results
|
|
|
|
|
|
|
|
|
|
23 |
|
24 |
# Model Description
|
25 |
|
26 |
+
* A ClinicalBERT [[Alsentzer et al., 2019]](https://arxiv.org/pdf/1904.03323.pdf) model fine-tuned for de-identification of medical notes.
|
27 |
* Sequence Labeling (token classification): The model was trained to predict protected health information (PHI/PII) entities (spans). A list of protected health information categories is given by [HIPAA](https://www.hhs.gov/hipaa/for-professionals/privacy/laws-regulations/index.html).
|
28 |
+
* A token can either be classified as non-PHI or as one of the 11 PHI types. Token predictions are aggregated to spans by making use of BILOU tagging.
|
29 |
* The PHI labels that were used for training and other details can be found here: [Annotation Guidelines](https://github.com/obi-ml-public/ehr_deidentification/blob/master/AnnotationGuidelines.md)
|
30 |
* More details on how to use this model, the format of data and other useful information is present in the GitHub repo: [Robust DeID](https://github.com/obi-ml-public/ehr_deidentification).
|
31 |
|
|
|
42 |
|
43 |
# Dataset
|
44 |
|
45 |
+
* The I2B2 2014 [[Stubbs and Uzuner, 2015]](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4978170/) dataset was used to train this model.
|
46 |
|
47 |
| | I2B2 | | I2B2 | |
|
48 |
| --------- | --------------------- | ---------- | -------------------- | ---------- |
|
|
|
81 |
* Dropout: 0.1
|
82 |
|
83 |
# Results
|
84 |
+
|
85 |
+
# Questions?
|
86 |
+
|
87 |
+
Post a Github issue on the repo: [Robust DeID](https://github.com/obi-ml-public/ehr_deidentification).
|