Collaborative residual learners for automatic icd10 prediction using prescribed medications

Clinical coding is an administrative process that involves the translation of diagnostic data from episodes of care into a standard code format such as ICD10. It has many critical applications such as billing and aetiology research. The automation of clinical coding is very challenging due to data sparsity, low interoperability of digital health systems, complexity of real-life diagnosis coupled with the huge size of ICD10 code space. Related work suffer from low applicability due to reliance on many data sources, inefficient modelling and less generalizable solutions. We propose a novel collaborative residual learning based model to automatically predict ICD10 codes employing only prescriptions data. Extensive experiments were performed on two real-world clinical datasets (outpatient & inpatient) from Maharaj Nakorn Chiang Mai Hospital with real case-mix distributions. We obtain multi-label classification accuracy of 0.71 and 0.57 of average precision, 0.57 and 0.38 of F1-score and 0.73 and 0.44 of accuracy in predicting principal diagnosis for inpatient and outpatient datasets respectively.

[1]  Gang Zhang,et al.  An Ensemble Learning Based Framework for Traditional Chinese Medicine Data Analysis with ICD-10 Labels , 2015, TheScientificWorldJournal.

[2]  Sepp Hochreiter,et al.  The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[3]  Ramakanth Kavuluru,et al.  Neural transfer learning for assigning diagnosis codes to EMRs , 2019, Artif. Intell. Medicine.

[4]  W. Bruce Croft,et al.  Combining classifiers in text categorization , 1996, SIGIR '96.

[5]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Daniel L. Rubin,et al.  Radiology report annotation using intelligent word embeddings: Applied to multi-institutional chest CT cohort , 2018, J. Biomed. Informatics.

[7]  Walter Daelemans,et al.  Assigning clinical codes with data-driven concept representation on Dutch clinical free text , 2017, J. Biomed. Informatics.

[8]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[9]  Mark G Weiner,et al.  POINT: Is ICD-10 Diagnosis Coding Important in the Era of Big Data? Yes. , 2018, Chest.

[10]  Sue E Bowman,et al.  Measuring and Benchmarking Coding Productivity: A Decade of AHIMA Leadership , 2019 .

[11]  Syed Mohamed Aljunid,et al.  Potential loss of revenue due to errors in clinical coding during the implementation of the Malaysia diagnosis related group (MY-DRG®) Casemix system in a teaching hospital in Malaysia , 2018, BMC Health Services Research.

[12]  Koldo Gojenola,et al.  Interpretable deep learning to map diagnostic texts to ICD-10 codes , 2019, Int. J. Medical Informatics.

[13]  Pengtao Xie,et al.  On the Automatic Generation of Medical Imaging Reports , 2017, ACL.

[14]  Anthony N. Nguyen,et al.  Automatic ICD-10 classification of cancers from free-text death certificates , 2015, Int. J. Medical Informatics.

[15]  C. Langlotz,et al.  Deep Learning to Classify Radiology Free-Text Reports. , 2017, Radiology.

[16]  Kerin Robinson,et al.  The Risk and Consequences of Clinical Miscoding Due to Inadequate Medical Documentation: A Case Study of the Impact on Health Services Funding , 2009, Health information management : journal of the Health Information Management Association of Australia.

[17]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[18]  Sandra R. Fuller Importance of ICD-10 , 2009 .

[19]  Shyamala G. Nadathur,et al.  Maximising the value of hospital administrative datasets. , 2010, Australian health review : a publication of the Australian Hospital Association.

[20]  Zhi-Hua Zhou,et al.  A Unified View of Multi-Label Performance Measures , 2016, ICML.

[21]  Yuan Lu,et al.  An empirical evaluation of supervised learning approaches in assigning diagnosis codes to electronic medical records , 2015, Artif. Intell. Medicine.

[22]  Kathy Giannangelo Tracking Global Health: is ICD-10 and its Modifications the Solution? , 2004 .

[23]  Fei Li,et al.  ICD Coding from Clinical Text Using Multi-Filter Residual Convolutional Neural Network , 2019, AAAI.

[24]  Igor Kononenko,et al.  Machine learning for medical diagnosis: history, state of the art and perspective , 2001, Artif. Intell. Medicine.