Mining Heterogeneous Information Graph for Health Status Classification

In the medical domain, there exists a large volume of data from multiple sources such as electronic health records, general health examination results and surveys. The data contain useful information reflecting people's health and provides great opportunities for studies to improve the quality of healthcare. However, how to mine these data effectively and efficiently still remains a critical challenge. In this paper, we propose an innovative classification model for knowledge discovery from patients' personal health repositories. By based on analytics of massive data in National Health and Nutrition Examination Survey, the study builds a classification model to classify patients' health status and reveal the specific disease potentially suffered by the patient. This paper makes significant contributions to the advancement of knowledge in data mining with an innovative classification model specifically crafted for domain-based data. Moreover, this research contributes to the healthcare community by providing a deep understanding of people's health with accessibility to the patterns in various observations.

[1]  Tracy Hall,et al.  Comparing the performance of fault prediction models which report multiple performance measures: recomputing the confusion matrix , 2012, PROMISE '12.

[2]  Shahram Ebadollahi,et al.  Toward personalized care management of patients at risk: the diabetes case study , 2011, KDD.

[3]  Philippe Salembier,et al.  Precision-Recall-Classification Evaluation Framework: Application to Depth Estimation on Single Images , 2014, ECCV.

[4]  Douglas P. Wagner,et al.  Acute physiology and chronic health evaluation (APACHE II) and Medicare reimbursement , 1984, Health care financing review.

[5]  TaeHyun Hwang,et al.  A Heterogeneous Label Propagation Algorithm for Disease Gene Discovery , 2010, SDM.

[6]  Luca Malcovati,et al.  Revised international prognostic scoring system for myelodysplastic syndromes. , 2012, Blood.

[7]  Jae-Kwon Kim,et al.  Adaptive mining prediction model for content recommendation to coronary heart disease patients , 2014, Cluster Computing.

[8]  Jiawei Han,et al.  Ranking-based classification of heterogeneous information networks , 2011, KDD.

[9]  Jackson T. Wright,et al.  2014 evidence-based guideline for the management of high blood pressure in adults: report from the panel members appointed to the Eighth Joint National Committee (JNC 8). , 2014, JAMA.

[10]  Ching-Hsue Cheng,et al.  A predictive model for cerebrovascular disease using data mining , 2011, Expert Syst. Appl..

[11]  Philip S. Yu,et al.  Spectral clustering for multi-type relational data , 2006, ICML.

[12]  Bernard C. Jiang,et al.  Using data mining techniques for multi-diseases prediction modeling of hypertension and hyperlipidemia by common risk factors , 2011, Expert Syst. Appl..

[13]  E. LESTER SMITH,et al.  AND OTHERS , 2005 .

[14]  M. G. Myriam Hunink Decision Making in Health and Medicine: Preface , 2014 .

[15]  Yizhou Sun,et al.  Ranking-based clustering of heterogeneous information networks with star network schema , 2009, KDD.

[16]  Jeff A. Bilmes,et al.  Label Selection on Graphs , 2009, NIPS.

[17]  Swati A. Bhavsar,et al.  Mining Health Examination Records - A Graph Based Approach , 2019 .

[18]  Marco Loog,et al.  Active learning using uncertainty information , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[19]  Y. Miyazaki,et al.  [Revised international prognostic scoring system (IPSS-R) for myelodysplastic syndromes]. , 2013, [Rinsho ketsueki] The Japanese journal of clinical hematology.

[20]  Chien-Chung Chan,et al.  Predicting disease by using data mining based on healthcare information system , 2012, 2012 IEEE International Conference on Granular Computing.

[21]  Yizhou Sun,et al.  Graph Regularized Transductive Classification on Heterogeneous Information Networks , 2010, ECML/PKDD.

[22]  F. Collins,et al.  A new initiative on precision medicine. , 2015, The New England journal of medicine.

[23]  Milos Hauskrecht,et al.  Learning classification models with soft-label information , 2014, J. Am. Medical Informatics Assoc..

[24]  Chen Luo,et al.  HetPathMine: A Novel Transductive Classification Algorithm on Heterogeneous Information Networks , 2014, ECIR.

[25]  Bekele Afessa,et al.  Comparison of APACHE III, APACHE IV, SAPS 3, and MPM0III and influence of resuscitation status on model performance. , 2012, Chest.