Collaborative Prediction Model of Disease Risk by Mining Electronic Health Records

Patient Electronic Health Records (EHR) is one of the major carriers for conducting preventative medicine research. However, the heterogeneous and longitudinal properties make EHRs analysis an inherently challenge. To address this issue, this paper proposes CAPM, a Collaborative Assessment Prediction Model based on patient temporal graph representation, which relies only on a patient EHRs using ICD-10 codes to predict future disease risks. Firstly, we develop a temporal graph for each patient EHRs. Secondly, CAPM uses hybrid collaborative filtering approach to predict each patient’s greatest disease risks based on their own medical history and that of similar patients. Moreover, we also calculate the onset risk with the corresponding diseases in order to take action at the earliest signs. Finally, we present experimental results on a real world EHR dataset, demonstrating that CAPM performs well at capturing future disease and its onset risks.

[1]  Yizhou Sun,et al.  Reciprocal recommendation system for online dating , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[2]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[3]  Kai Zhang,et al.  Temporal Skeletonization on Sequential Data: Patterns, Categorization, and Visualization , 2016, IEEE Transactions on Knowledge and Data Engineering.

[4]  Soon Ae Chun,et al.  Collaborative and trajectory prediction models of medical conditions by mining patients' Social Data , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[5]  Darcy A. Davis,et al.  Predicting individual disease risk based on medical history , 2008, CIKM '08.

[6]  Anthony K. H. Tung,et al.  Contextual crowd intelligence , 2014, SKDD.

[7]  Nitesh V. Chawla,et al.  HealthCareND: leveraging EHR and CARE for prospective healthcare , 2010, IHI.

[8]  Fei Wang,et al.  A methodology for interactive mining and visual analysis of clinical event patterns using electronic health record data , 2014, J. Biomed. Informatics.

[9]  Nitesh V. Chawla,et al.  Time to CARE: a collaborative engine for practical disease prediction , 2010, Data Mining and Knowledge Discovery.

[10]  Fei Wang,et al.  Supervised patient similarity measure of heterogeneous patient records , 2012, SKDD.

[11]  Fei Wang,et al.  From micro to macro: data driven phenotyping by densification of longitudinal electronic medical records , 2014, KDD.

[12]  Thomas Hofmann,et al.  Latent semantic models for collaborative filtering , 2004, TOIS.

[13]  Asmaa S. Hussein,et al.  Smart collaboration framework for managing chronic disease using recommender system , 2014 .