Towards Automatic ICD Coding via Knowledge Enhanced Multi-Task Learning

The aim of ICD coding is to assign International Classification of Diseases (ICD) codes to unstructured clinical notes or discharge summaries. Numerous methods have been proposed for automatic ICD coding in an effort to reduce human labor and errors. However, existing works disregard the data imbalance problem of clinical notes. In addition, the noisy clinical note issue has not been thoroughly investigated. To address such issues, we propose a knowledge enhanced Graph Attention Network (GAT) under multi-task learning setting. Specifically, multi-level information transitions and interactions have been implemented. On the one hand, a large heterogeneous text graph is constructed to capture both intra- and inter-note correlations between various semantic concepts, thereby alleviating the data imbalance issue. On the other hand, two auxiliary healthcare tasks have been proposed to facilitate the sharing of information across tasks. Moreover, to tackle the issue of noisy clinical notes, we propose to utilize the rich structured knowledge facts and information provided by medical domain knowledge, thereby encouraging the model to focus on the clinical notes' noteworthy portion and valuable information. The experimental results on the widely-used medical dataset, MIMIC-III, demonstrate the advantages of our proposed framework.

[1]  X. Wu,et al.  Doctor Specific Tag Recommendation for Online Medical Record Management , 2023, KDD.

[2]  Ruiming Tang,et al.  Single-shot Feature Selection for Multi-task Recommendations , 2023, SIGIR.

[3]  Zijian Zhang,et al.  AutoSTL: Automated Spatio-Temporal Multi-Task Learning , 2023, AAAI.

[4]  Yong Zhang,et al.  IMF: Interactive Multimodal Fusion Model for Link Prediction , 2023, WWW.

[5]  Kun Gai,et al.  Multi-Task Recommendations with Reinforcement Learning , 2023, WWW.

[6]  Songfang Huang,et al.  Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding , 2022, ACL.

[7]  Enhong Chen,et al.  Interaction-aware Drug Package Recommendation via Policy Gradient , 2022, ACM Trans. Inf. Syst..

[8]  G. Qi,et al.  Conditional Generation Net for Medication Recommendation , 2022, WWW.

[9]  Shang-Chi Tsai,et al.  Modeling Diagnostic Label Correlation for Automatic ICD Coding , 2021, NAACL.

[10]  Xiaoli Li,et al.  HGAT: Heterogeneous Graph Attention Networks for Semi-supervised Short Text Classification , 2021, ACM Trans. Inf. Syst..

[11]  Jimeng Sun,et al.  SafeDrug: Dual Molecular Graph Encoders for Recommending Effective and Safe Drug Combinations , 2021, IJCAI.

[12]  Erik Cambria,et al.  Multitask Recalibrated Aggregation Network for Medical Code Prediction , 2021, ECML/PKDD.

[13]  Enhong Chen,et al.  Drug Package Recommendation via Interaction-aware Graph Induction , 2021, WWW.

[14]  Wei Chu,et al.  Question Directed Graph Attention Network for Numerical Reasoning over Text , 2020, EMNLP.

[15]  Pengtao Xie,et al.  Generalized Zero-Shot Text Classification for ICD Coding , 2020, IJCAI.

[16]  N. Razavian,et al.  BERT-XML: Large Scale Automated ICD Coding Using BERT Pretraining , 2020, CLINICALNLP.

[17]  Anthony N. Nguyen,et al.  Inferring Degrees from Incomplete Networks and Nonlinear Dynamics , 2020, IJCAI.

[18]  Fei Li,et al.  ICD Coding from Clinical Text Using Multi-Filter Residual Convolutional Neural Network , 2019, AAAI.

[19]  Philip S. Yu,et al.  EHR Coding with Multi-scale Feature Attention and Structured Knowledge Graph Propagation , 2019, CIKM.

[20]  Yong Zhang,et al.  Hierarchical Inter-Attention Network for Document Classification with Multi-Task Learning , 2019, IJCAI.

[21]  Wei Zhang,et al.  Knowledge-Aware Deep Dual Networks for Text-Based Mortality Prediction , 2019, 2019 IEEE 35th International Conference on Data Engineering (ICDE).

[22]  Yuan Luo,et al.  Graph Convolutional Networks for Text Classification , 2018, AAAI.

[23]  Jimeng Sun,et al.  GAMENet: Graph Augmented MEmory Networks for Recommending Medication Combination , 2018, AAAI.

[24]  Jimeng Sun,et al.  Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review , 2018, J. Am. Medical Informatics Assoc..

[25]  Diego Marcheggiani,et al.  Exploiting Semantics in Neural Machine Translation with Graph Convolutional Networks , 2018, NAACL.

[26]  Jianxin Li,et al.  Large-Scale Hierarchical Text Classification with Recursively Regularized Deep Graph-CNN , 2018, WWW.

[27]  Jimeng Sun,et al.  Explainable Prediction of Medical Codes from Clinical Text , 2018, NAACL.

[28]  Svetha Venkatesh,et al.  Dual Control Memory Augmented Neural Networks for Treatment Recommendations , 2018, PAKDD.

[29]  Qinghua Zheng,et al.  Knowledge Guided Short-Text Classification for Healthcare Applications , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[30]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[31]  Walter Daelemans,et al.  Selecting relevant features from the electronic health record for clinical code prediction , 2017, J. Biomed. Informatics.

[32]  Michael Elhadad,et al.  Multi-Label Classification of Patient Notes a Case Study on ICD Code Assignment , 2017, AAAI 2017.

[33]  Jimeng Sun,et al.  LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity , 2017, KDD.

[34]  Qiang Yang,et al.  A Survey on Multi-Task Learning , 2017, IEEE Transactions on Knowledge and Data Engineering.

[35]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[36]  Nanyun Peng,et al.  Cross-Sentence N-ary Relation Extraction with Graph LSTMs , 2017, TACL.

[37]  Oladimeji Farri,et al.  Condensed Memory Networks for Clinical Diagnostic Inferencing , 2016, AAAI.

[38]  Florian Schmidt,et al.  Neural Document Embeddings for Intensive Care Patient Mortality Prediction , 2016, NIPS 2016.

[39]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[40]  Jimeng Sun,et al.  RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism , 2016, NIPS.

[41]  B. Koopman,et al.  Automatic ICD-10 classification of cancers from free-text death certificates , 2015, Int. J. Medical Informatics.

[42]  Yuan Lu,et al.  An empirical evaluation of supervised learning approaches in assigning diagnosis codes to electronic medical records , 2015, Artif. Intell. Medicine.

[43]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[44]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[45]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[46]  Frank D. Wood,et al.  Diagnosis code assignment: models and evaluation metrics , 2013, J. Am. Medical Informatics Assoc..

[47]  N. Ahuja,et al.  Robust visual tracking via multi-task sparse learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[48]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[49]  Olivier Bodenreider,et al.  From indexing the biomedical literature to coding clinical text: experience with MTI and machine learning approaches , 2007, BioNLP@ACL.

[50]  B. Gage,et al.  Accuracy of ICD-9-CM Codes for Identifying Cardiovascular and Stroke Risk Factors , 2005, Medical care.

[51]  Berthier A. Ribeiro-Neto,et al.  A hierarchical approach to the automatic categorization of medical documents , 1998, CIKM '98.

[52]  Rich Caruana,et al.  Multitask Learning , 1997, Machine Learning.

[53]  Tong Zhou,et al.  Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism , 2021, ACL.

[54]  Jimeng Sun,et al.  Fusion: Towards Automated ICD Coding via Feature Compression , 2021, FINDINGS.