Time to CARE: a collaborative engine for practical disease prediction

The monumental cost of health care, especially for chronic disease treatment, is quickly becoming unmanageable. This crisis has motivated the drive towards preventative medicine, where the primary concern is recognizing disease risk and taking action at the earliest signs. However, universal testing is neither time nor cost efficient. We propose CARE, a Collaborative Assessment and Recommendation Engine, which relies only on patient’s medical history using ICD-9-CM codes in order to predict future disease risks. CARE uses collaborative filtering methods to predict each patient’s greatest disease risks based on their own medical history and that of similar patients. We also describe an Iterative version, ICARE, which incorporates ensemble concepts for improved performance. Also, we apply time-sensitive modifications which make the CARE framework practical for realistic long-term use. These novel systems require no specialized information and provide predictions for medical conditions of all kinds in a single run. We present experimental results on a large Medicare dataset, demonstrating that CARE and ICARE perform well at capturing future disease risks.

[1]  C. T. Orleans,et al.  Does the chronic care model serve also as a template for improving prevention? , 2001, The Milbank quarterly.

[2]  D. Cherry,et al.  National Ambulatory Medical Care Survey: 2001 summary. , 2003, Advance data.

[3]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[4]  A. Barabasi,et al.  Network medicine--from obesity to the "diseasome". , 2007, The New England journal of medicine.

[5]  Yanxi Liu,et al.  DICOVERY OF "BIOMARKERS" FOR ALZHEIMER'S DISEASE PREDICTION FROM STRUCTURAL MR IMAGES , 2007, 2007 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[6]  Francisco Herrera,et al.  A Prediction System for Cardiovascularity Diseases Using Genetic Fuzzy Rule-Based Systems , 2002, IBERAMIA.

[7]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[8]  Kenneth Y. Goldberg,et al.  Eigentaste: A Constant Time Collaborative Filtering Algorithm , 2001, Information Retrieval.

[9]  David Maxwell Chickering,et al.  Dependency Networks for Inference, Collaborative Filtering, and Data Visualization , 2000, J. Mach. Learn. Res..

[10]  R. Snyderman,et al.  Prospective Medicine: The Next Health Care Transformation , 2003, Academic medicine : journal of the Association of American Medical Colleges.

[11]  Carolyn L. Rochester,et al.  MORTALITY AFTER THE HOSPITALIZATION OF A SPOUSE , 2006 .

[12]  W. Kannel,et al.  Factors of risk in the development of coronary heart disease--six year follow-up experience. The Framingham Study. , 1961, Annals of internal medicine.

[13]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[14]  Leroy Hood,et al.  Systems biology, proteomics, and the future of health care: toward predictive, preventative, and personalized medicine. , 2004, Journal of proteome research.

[15]  Darcy A. Davis,et al.  Predicting individual disease risk based on medical history , 2008, CIKM '08.

[16]  David Edelman,et al.  A multidimensional integrative medicine intervention to improve cardiovascular risk , 2006, Journal of General Internal Medicine.

[17]  Dunja Mladenic,et al.  kNN Versus SVM in the Collaborative Filtering Framework , 2006, Data Science and Classification.

[18]  W. Knaus,et al.  Predicting outcome in critical care: the current status of the APACHE prognostic scoring system , 1991, Canadian journal of anaesthesia = Journal canadien d'anesthesie.

[19]  L. Muhlbaier,et al.  Using Medicare Claims for Outcomes Research , 1994, Medical care.

[20]  H P Hartung,et al.  Use of interferon beta in multiple sclerosis: rationale for early treatment and evidence for dose- and frequency-dependent effects on clinical response , 2002, Multiple sclerosis.

[21]  D. Lauderdale,et al.  Epidemiologic uses of Medicare data. , 1993, Epidemiologic reviews.

[22]  J A Knottnerus,et al.  Multimorbidity in general practice: prevalence, incidence, and determinants of co-occurring chronic and recurrent diseases. , 1998, Journal of clinical epidemiology.

[23]  A. Barabasi,et al.  Human disease classification in the postgenomic era: A complex systems approach to human pathobiology , 2007, Molecular systems biology.

[24]  Luo Si,et al.  Flexible Mixture Model for Collaborative Filtering , 2003, ICML.

[25]  Thomas Hofmann,et al.  Latent Class Models for Collaborative Filtering , 1999, IJCAI.

[26]  Thomas Hofmann,et al.  Latent semantic models for collaborative filtering , 2004, TOIS.

[27]  P. Allison,et al.  Mortality after the hospitalization of a spouse. , 2006, The New England journal of medicine.

[28]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[29]  Larry Scherwitz,et al.  Improvement in medical risk factors and quality of life in women and men with coronary artery disease in the Multicenter Lifestyle Demonstration Project. , 2003, The American journal of cardiology.

[30]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[31]  Richard F Mould Prediction of long-term survival rates of cancer patients , 2003, The Lancet.

[32]  Eric Horvitz,et al.  Collaborative Filtering by Personality Diagnosis: A Hybrid Memory and Model-Based Approach , 2000, UAI.

[33]  J. Loscalzo Association studies in an era of too much information: clinical analysis of new biomarker and genetic data. , 2007, Circulation.

[34]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[35]  Charles E. Kahn Collaborative Filtering to Improve Navigation of Large Radiology Knowledge Resources , 2004, Journal of Digital Imaging.

[36]  Ying Wang,et al.  Genomewide association study of leprosy. , 2009, The New England journal of medicine.

[37]  Emily White,et al.  Physician recommendations for dietary change: their prevalence and impact in a population-based sample. , 1995, American journal of public health.

[38]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[39]  Christopher B Forrest,et al.  Comorbidity: Implications for the Importance of Primary Care in ‘Case’ Management , 2003, The Annals of Family Medicine.

[40]  Dunja Mladenic,et al.  Data Sparsity Issues in the Collaborative Filtering Framework , 2005, WEBKDD.

[41]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.