Bayesian Hierarchical Rule Modeling for Predicting Medical Conditions

We propose a statistical modeling technique, called the Hierarchical Association Rule Model (HARM), that predicts a patient's possible future medical conditions given the patient's current and past history of reported conditions. The core of our technique is a Bayesian hierarchical model for selecting predictive association rules (such as "condition 1 and condition 2 $\rightarrow$ condition 3") from a large set of candidate rules. Because this method "borrows strength" using the conditions of many similar patients, it is able to provide predictions specialized to any given patient, even when little information about the patient's history of conditions is available.

[1]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[2]  N. Keenan,et al.  Prevalence of Coronary Heart Disease Risk Factors and Screening for High Cholesterol Levels Among Young Adults, United States, 1999–2006 , 2010, The Annals of Family Medicine.

[3]  Cynthia Rudin,et al.  Sequential Event Prediction with Association Rules , 2011, COLT.

[4]  F. Vogenberg,et al.  Predictive and prognostic models: implications for healthcare decision-making in a modern recession. , 2009, American health & drug benefits.

[5]  K. Furie,et al.  Heart disease and stroke statistics--2007 update: a report from the American Heart Association Statistics Committee and Stroke Statistics Subcommittee. , 2008, Circulation.

[6]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[7]  Jaideep Srivastava,et al.  Selecting the right interestingness measure for association patterns , 2002, KDD.

[8]  A. Raftery,et al.  The Mixture Transition Distribution Model for High-Order Markov Chains and Non-Gaussian Time Series , 2002 .

[9]  Liang Zhang,et al.  MODELING ITEM-ITEM SIMILARITIES FOR PERSONALIZED RECOMMENDATIONS ON YAHOO! FRONT PAGE , 2011, 1111.0416.

[10]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[11]  Howard J. Hamilton,et al.  Choosing the Right Lens: Finding What is Interesting in Data Mining , 2007, Quality Measures in Data Mining.

[12]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[13]  Shyam Visweswaran,et al.  Bayesian rule learning for biomedical data mining , 2010, Bioinform..

[14]  L. Hood,et al.  Predictive, personalized, preventive, participatory (P4) cancer medicine , 2011, Nature Reviews Clinical Oncology.

[15]  Cynthia Rudin,et al.  A Learning Theory Framework for Association Rules and Sequential Events , 2011 .

[16]  William DuMouchel,et al.  Empirical bayes screening for multi-item associations , 2001, KDD '01.

[17]  Nitesh V. Chawla,et al.  Time to CARE: a collaborative engine for practical disease prediction , 2010, Data Mining and Knowledge Discovery.

[18]  R. Sacco,et al.  Race-ethnic differences in the association between lipid profile components and risk of myocardial infarction: The Northern Manhattan Study. , 2011, American heart journal.

[19]  Vogenberg Fr Predictive and prognostic models: implications for healthcare decision-making in a modern recession. , 2009 .

[20]  Galit Shmueli,et al.  To Explain or To Predict? , 2010, 1101.0891.

[21]  Cynthia Rudin,et al.  Bayesian Hierarchical Rule Modeling for Predicting Medical Conditions , 2012 .

[22]  Cynthia Rudin,et al.  Sequential event prediction , 2013, Machine Learning.

[23]  Christian Posse,et al.  Bayesian Mixed-Effects Models for Recommender Systems , 1999 .