obi
/

deid_bert_i2b2

Token Classification

deidentification

Inference Endpoints

Model card Files Files and versions Community

prajwal967 commited on Feb 16, 2022

Commit

8ceb898

·

1 Parent(s): abdcb27

add brackets

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -23,9 +23,9 @@ license: mit
 # Model Description
-* A ClinicalBERT [Alsentzer et al., 2019](https://arxiv.org/pdf/1904.03323.pdf) model fine-tuned for de-identification of medical notes.
 * Sequence Labeling (token classification): The model was trained to predict protected health information (PHI/PII) entities (spans). A list of protected health information categories is given by [HIPAA](https://www.hhs.gov/hipaa/for-professionals/privacy/laws-regulations/index.html).
-* A token can either be classified as non-PHI or as one of the 11 PHI types. Token predictions can be aggregated to span (e.g., making use of BILOU tagging).
 * The PHI labels that were used for training and other details can be found here: [Annotation Guidelines](https://github.com/obi-ml-public/ehr_deidentification/blob/master/AnnotationGuidelines.md)
 * More details on how to use this model, the format of data and other useful information is present in the GitHub repo: [Robust DeID](https://github.com/obi-ml-public/ehr_deidentification).
@@ -42,7 +42,7 @@ license: mit
 # Dataset
-* The I2B2 2014 [Stubbs and Uzuner, 2015](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4978170/) dataset was used to train this model.
 |           | I2B2                  |            |  I2B2                |            |
 | --------- | --------------------- | ---------- | -------------------- | ---------- |
@@ -81,3 +81,7 @@ license: mit
     * Dropout: 0.1
 # Results

 # Model Description
+* A ClinicalBERT [[Alsentzer et al., 2019]](https://arxiv.org/pdf/1904.03323.pdf) model fine-tuned for de-identification of medical notes.
 * Sequence Labeling (token classification): The model was trained to predict protected health information (PHI/PII) entities (spans). A list of protected health information categories is given by [HIPAA](https://www.hhs.gov/hipaa/for-professionals/privacy/laws-regulations/index.html).
+* A token can either be classified as non-PHI or as one of the 11 PHI types. Token predictions are aggregated to spans by making use of BILOU tagging.
 * The PHI labels that were used for training and other details can be found here: [Annotation Guidelines](https://github.com/obi-ml-public/ehr_deidentification/blob/master/AnnotationGuidelines.md)
 * More details on how to use this model, the format of data and other useful information is present in the GitHub repo: [Robust DeID](https://github.com/obi-ml-public/ehr_deidentification).
 # Dataset
+* The I2B2 2014 [[Stubbs and Uzuner, 2015]](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4978170/) dataset was used to train this model.
 |           | I2B2                  |            |  I2B2                |            |
 | --------- | --------------------- | ---------- | -------------------- | ---------- |
     * Dropout: 0.1
 # Results
+# Questions?
+Post a Github issue on the repo: [Robust DeID](https://github.com/obi-ml-public/ehr_deidentification).