A Clinical Decision Support Framework for Heterogeneous Data Sources

To keep pace with the developments in medical informatics, health medical data is being collected continually. But, owing to the diversity of its categories and sources, medical data has become so complicated in many hospitals that it now needs a clinical decision support (CDS) system for its management. To effectively utilize the accumulating health data, we propose a CDS framework that can integrate heterogeneous health data from different sources such as laboratory test results, basic information of patients, and health records into a consolidated representation of features of all patients. Using the electronic health medical data so created, multilabel classification was employed to recommend a list of diseases and thus assist physicians in diagnosing or treating their patients’ health issues more efficiently. Once the physician diagnoses the disease of a patient, the next step is to consider the likely complications of that disease, which can lead to more diseases. Previous studies reveal that correlations do exist among some diseases. Considering these correlations, a k-nearest neighbors algorithm is improved for multilabel learning by using correlations among labels (CML-kNN). The CML- kNN algorithm first exploits the dependence between every two labels to update the origin label matrix and then performs multilabel learning to estimate the probabilities of labels by using the integrated features. Finally, it recommends the top N diseases to the physicians. Experimental results on real health medical data establish the effectiveness and practicability of the proposed CDS framework.

[1]  Marios Anthimopoulos,et al.  Lung Pattern Classification for Interstitial Lung Diseases Using a Deep Convolutional Neural Network , 2016, IEEE Transactions on Medical Imaging.

[2]  K. Nikita,et al.  Personal Health Systems for Diabetes Management, Early Diagnosis and Prevention , 2019, Consumer-Driven Technologies in Healthcare.

[3]  Peter J. Hunter,et al.  Big Data, Big Knowledge: Big Data for Personalized Healthcare , 2015, IEEE Journal of Biomedical and Health Informatics.

[4]  Mihaela van der Schaar,et al.  Online Transfer Learning for Differential Diagnosis Determination , 2015, AAAI Workshop: WWW and Public Health Intelligence.

[5]  Douglas S. Bell,et al.  Interface design principles for usable decision support: A targeted review of best practices for clinical prescribing interventions , 2012, J. Biomed. Informatics.

[6]  Sreekanth Rallapalli,et al.  Impact of Processing and Analyzing Healthcare Big Data on Cloud Computing Environment by Implementing Hadoop Cluster , 2016 .

[7]  Madhuchhanda Mitra,et al.  Application of Cross Wavelet Transform for ECG Pattern Analysis and Classification , 2014, IEEE Transactions on Instrumentation and Measurement.

[8]  Marc De Hert,et al.  The prevalence of diabetes mellitus type 2 in people with alcohol use disorders: a systematic review and large scale meta-analysis , 2016, Psychiatry Research.

[9]  Francis R. Bach,et al.  Online Learning for Latent Dirichlet Allocation , 2010, NIPS.

[10]  Clement J. McDonald,et al.  What can natural language processing do for clinical decision support? , 2009, J. Biomed. Informatics.

[11]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[12]  Stefania Pieroni,et al.  Personalized medicine: Moving from correlation to causality in breast cancer , 2015 .

[13]  L. Sepiashvili,et al.  Overview of Laboratory Testing and Clinical Presentations of Complement Deficiencies and Dysregulation. , 2016, Advances in clinical chemistry.

[14]  Grigorios Tsoumakas,et al.  Mining Multi-label Data , 2010, Data Mining and Knowledge Discovery Handbook.

[15]  Lindsay I. Smith,et al.  A tutorial on Principal Components Analysis , 2002 .

[16]  Mihaela van der Schaar,et al.  Using Contextual Learning to Improve Diagnostic Accuracy: Application in Breast Cancer Screening , 2016, IEEE Journal of Biomedical and Health Informatics.

[17]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[18]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[19]  Rui Guo,et al.  Classifying syndromes in Chinese medicine using multi-label learning algorithm with relevant features for each label , 2016, Chinese Journal of Integrative Medicine.

[20]  Konstantina S. Nikita,et al.  Guest Editorial: Biomedical Informatics in Clinical Environments , 2015, IEEE Journal of Biomedical and Health Informatics.

[21]  Grigorios Tsoumakas,et al.  An Empirical Study of Lazy Multilabel Classification Algorithms , 2008, SETN.

[22]  P. Kantor Foundations of Statistical Natural Language Processing , 2001, Information Retrieval.

[23]  Katrien Verbert,et al.  Recommender Systems for Health Informatics: State-of-the-Art and Future Perspectives , 2016, Machine Learning for Health Informatics.

[24]  Mário Sarcinelli Filho,et al.  Premature Ventricular beat classification using a dynamic Bayesian Network , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[25]  Yonggang Wen,et al.  Toward Scalable Systems for Big Data Analytics: A Technology Tutorial , 2014, IEEE Access.

[26]  Hareton K. N. Leung,et al.  Incremental Semi-Supervised Clustering Ensemble for High Dimensional Data Clustering , 2016, IEEE Transactions on Knowledge and Data Engineering.

[27]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[28]  C. Anandan,et al.  The Impact of eHealth on the Quality and Safety of Health Care: A Systematic Overview , 2011, PLoS medicine.

[29]  TU MarioHermann Design Principles for Industrie 4 . 0 Scenarios , 2015 .

[30]  Emre Çomak,et al.  A decision support system based on support vector machines for diagnosis of the heart valve diseases , 2007, Comput. Biol. Medicine.

[31]  Yu Tian,et al.  A Shared Decision-Making System for Diabetes Medication Choice Utilizing Electronic Health Record Data , 2017, IEEE Journal of Biomedical and Health Informatics.

[32]  Stephen B. Johnson,et al.  A review of approaches to identifying patient phenotype cohorts using electronic health records , 2013, J. Am. Medical Informatics Assoc..

[33]  P. K. Anooj,et al.  Clinical Decision Support System: Risk Level Prediction of Heart Disease Using Decision Tree Fuzzy Rules , 2022 .

[34]  Li Zhu,et al.  Sparse Representation-Based Heartbeat Classification Using Independent Component Analysis , 2012, Journal of Medical Systems.

[35]  Weider D. Yu,et al.  A Modeling Approach to Big Data Based Recommendation Engine in Modern Health Care Environment , 2015, 2015 IEEE 39th Annual Computer Software and Applications Conference.

[36]  Yu Zhang,et al.  Research on the Smartphone Based eHealth Systems for Strengthing Healthcare Organization , 2016, ICSH.

[37]  Hsiao-Pei Tu,et al.  Association Between Periodontal Disease and Osteoporosis by Gender , 2015, Medicine.

[38]  M. Drazner Contemporary Reviews in Cardiovascular Medicine The Progression of Hypertensive Heart Disease , 2011 .

[39]  Christoph Thuemmler,et al.  Health 4.0: How Virtualization and Big Data are Revolutionizing Healthcare , 2017 .

[40]  Avinash C. Kak,et al.  PCA versus LDA , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  M. Mukaka,et al.  Statistics corner: A guide to appropriate use of correlation coefficient in medical research. , 2012, Malawi medical journal : the journal of Medical Association of Malawi.

[42]  R. Haynes,et al.  Effects of computerized clinical decision support systems on practitioner performance and patient outcomes: Methods of a decision-maker-researcher partnership systematic review , 2010, Implementation science : IS.

[43]  Amanda Clare,et al.  Knowledge Discovery in Multi-label Phenotype Data , 2001, PKDD.

[44]  Rong Jin,et al.  Correlated Label Propagation with Application to Multi-label Learning , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).