MTPGraph: A Data-Driven Approach to Predict Medical Risk Based on Temporal Profile Graph

With the rapid development of information technologies, which facilitates the perfection of healthcare systems, a variety of clinical data is becoming available. The patient Electronic Health Records (EHR) is one of important sources in healthcare data on which conducts personalized medicine. However, it is challenging if the raw EHRs are directly used to conduct related medical prediction due to its heterogeneity, sparsity and the existence of noise. To address this issue, this paper proposes an integrative data driven medical prediction approach called Medical Temporal Profile Graph (MTPGraph). The approach consists of two parts, first of which is a unified representation for each patient raw EHRs, namely patient temporal profile graph. Secondly, based on this representation, an algorithm TRApriori to obtain temporal feature graphs is further developed which is used to reconstruct each patient temporal profiling. The generated coefficient can be efficiently used for executing medical risk prediction. Finally, we validate MTPGraph through two real world clinical scenarios. The experimental results show that the predicted performance of the approach can be improved significantly in both tasks.

[1]  M. Lab,et al.  ARRHYTHMIA IN HEART FAILURE: ROLE OF MECHANICALLY INDUCED CHANGES IN ELECTROPHYSIOLOGY , 1989, The Lancet.

[2]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[3]  Jason Roy,et al.  Prediction Modeling Using EHR Data: Challenges, Strategies, and a Comparison of Machine Learning Approaches , 2010, Medical care.

[4]  Raghunath Nambiar,et al.  A look at challenges and opportunities of Big Data analytics in healthcare , 2013, 2013 IEEE International Conference on Big Data.

[5]  Fei Wang,et al.  Guest editorial: Special issue on data mining for medicine and healthcare , 2015, Data Mining and Knowledge Discovery.

[6]  Anthony K. H. Tung,et al.  Contextual crowd intelligence , 2014, SKDD.

[7]  Ken Williams,et al.  Validation of Prediction of Diabetes by the Archimedes Model and Comparison With Other Predicting Models , 2008, Diabetes Care.

[8]  David Price,et al.  Comparing virtual consults to traditional consults using an electronic health record: an observational case–control study , 2012, BMC Medical Informatics and Decision Making.

[9]  Fei Wang,et al.  A Framework for Mining Signatures from Event Sequences and Its Applications in Healthcare Data , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  M. Ghassemi,et al.  State of the art review: the data revolution in critical care , 2015, Critical Care.

[11]  Samir Chatterjee,et al.  A predictive modeling engine using neural networks: Diabetes management from sensor and activity data , 2012, 2012 IEEE 14th International Conference on e-Health Networking, Applications and Services (Healthcom).

[12]  Fei Wang,et al.  A methodology for interactive mining and visual analysis of clinical event patterns using electronic health record data , 2014, J. Biomed. Informatics.

[13]  Jimeng Sun,et al.  PARAMO: A PARAllel predictive MOdeling platform for healthcare analytic research using electronic health records , 2014, J. Biomed. Informatics.

[14]  Xiaowu Sun,et al.  Development and Validation of a Mortality Risk-Adjustment Model for Patients Hospitalized for Exacerbations of Chronic Obstructive Pulmonary Disease , 2013, Medical care.

[15]  Hui Xiong,et al.  Temporal Phenotyping from Longitudinal Electronic Health Records: A Graph Based Framework , 2015, KDD.

[16]  Jimeng Sun,et al.  Limestone: High-throughput candidate phenotype generation via tensor factorization , 2014, J. Biomed. Informatics.

[17]  Frans Coenen,et al.  Tree-based partitioning of date for association rule mining , 2006, Knowledge and Information Systems.

[18]  Fei Wang,et al.  Supervised patient similarity measure of heterogeneous patient records , 2012, SKDD.

[19]  Fei Wang,et al.  From micro to macro: data driven phenotyping by densification of longitudinal electronic medical records , 2014, KDD.

[20]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.