KEPTlongfomer is a medical knowledge enhanced version of Longformer that was further pre-trained using contrastive learning. The model achieves SOTA performance on auto ICD coding on MIMIC-III as of 11/12/2022. A sister model for better performance is available here.
Pre-training
We initialized this model from clinical longformer.
And then pretrained with Hierarchical Self-Alignment Pretrain (HSAP) using Knowledge Graph UMLS. This includes (a) Hierarchy, (b) Synonym, (c) Abbreviation. For more info, see section 3.3 in paper. The learning rate was 5e-5, weight decay was 0.01, adam epsilon was 1e-5.
Usage
See our github for how to use this with prompts on auto ICD coding.
With the following result:
Metric | Score |
---|---|
rec_micro | =0.5729403619819988 |
rec_macro | =0.11342156911120573 |
rec_at_8 | =0.4094837705486378 |
rec_at_75 | =0.8470734920535119 |
rec_at_50 | =0.8005338782352 |
rec_at_5 | =0.2891628170355805 |
rec_at_15 | =0.5768778119750537 |
prec_micro | =0.6411968713105065 |
prec_macro | =0.12227610414493029 |
prec_at_8 | =0.7760972716488731 |
prec_at_75 | =0.197504942665085 |
prec_at_50 | =0.2768090154211151 |
prec_at_5 | =0.8483392645314354 |
prec_at_15 | =0.6178529062870699 |
f1_micro | =0.6051499904242899 |
f1_macro | =0.11768251595637802 |
f1_at_8 | =0.536107150495997 |
f1_at_75 | =0.32032290907137506 |
f1_at_50 | =0.411373195944102 |
f1_at_5 | =0.43131028155283435 |
f1_at_15 | =0.5966627077602488 |
auc_micro | =0.9651754312635265 |
auc_macro | =0.8566590059725866 |
acc_micro | =0.43384592341105344 |
acc_macro | =0.08639139221100567 |
- Downloads last month
- 305
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.