Mining Infrequent Causal Associations in Electronic Health Databases

Discovering infrequent causal relationships can help us prevent or correct negative outcomes caused by their antecedents. In this paper, we propose an innovative data mining framework and apply it to mine potential causal associations in electronic patient datasets where the drug-related events of interest occur infrequently. Specifically, we created a novel interestingness measure, exclusive causal-leverage, based on a computational, fuzzy recognition-primed decision (RPD) model that we previously developed. On the basis of this new measure, a data mining algorithm was developed to mine the causal relationship between drugs and their associated adverse drug reactions (ADRs). The algorithm was tested on real patient data retrieved from the Veterans Affairs Medical Center in Detroit, Michigan. The exclusive causal-leverage was employed to rank the potential causal associations between each of the two selected drugs (i.e., enalapril and pravastatin) and 3,954 recorded symptoms, each of which corresponds to a potential ADR. The top 10 drug-symptom pairs for each drug were evaluated by our physicians on the project team. The results showed that the number of symptoms considered as real ADRs for enalapril and pravastatin was 8 and 7 out of 10, respectively.

[1]  P. Waller,et al.  Stephens' detection of new adverse drug reactions , 2004 .

[2]  Robert Orchard,et al.  Fuzzy Reasoning in JESS: The Fuzzyj Toolkit and Fuzzyjess , 2001, ICEIS.

[3]  J. Woodcock,et al.  The safety of newly approved medicines: do recent market removals mean there is a problem? , 1999, JAMA.

[4]  W. Inman,et al.  Under-reporting of adverse drug reactions. , 1985, British medical journal.

[5]  S. Wolfe,et al.  Timing of new black box warnings and withdrawals for prescription medications. , 2002, JAMA.

[6]  Jie Chen,et al.  Mining Unexpected Temporal Associations: Applications in Detecting Adverse Drug Reactions , 2008, IEEE Transactions on Information Technology in Biomedicine.

[7]  Gary Klein,et al.  Strategies of Decision Making , 1989 .

[8]  Peter A. Flach,et al.  Rule Evaluation Measures: A Unifying View , 1999, ILP.

[9]  Willi Klösgen,et al.  Explora: A Multipattern and Multistrategy Discovery Assistant , 1996, Advances in Knowledge Discovery and Data Mining.

[10]  Gregory F. Cooper,et al.  A Simple Constraint-Based Algorithm for Efficiently Mining Observational Databases for Causal Relationships , 1997, Data Mining and Knowledge Discovery.

[11]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.

[12]  G. Niklas Norén,et al.  Temporal pattern discovery in longitudinal electronic patient records , 2010, Data Mining and Knowledge Discovery.

[13]  C. Marano,et al.  To err is human. Building a safer health system , 2005 .

[14]  D. Klein,et al.  The Flawed Basis for FDA Post-Marketing Safety Decisions: The Example of Anti-Depressants and Children , 2006, Neuropsychopharmacology.

[15]  David Heckerman,et al.  Bayesian Networks for Data Mining , 2004, Data Mining and Knowledge Discovery.

[16]  I. Edwards,et al.  Adverse drug reactions: definitions, diagnosis, and management , 2000, The Lancet.

[17]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[18]  P. Corey,et al.  Incidence of Adverse Drug Reactions in Hospitalized Patients , 2012 .

[19]  Pang-Ning Tan,et al.  Interestingness Measures for Association Patterns : A Perspective , 2000, KDD 2000.

[20]  David R. Jones,et al.  Bayesian methods in health technology assessment: a review. , 2000, Health technology assessment.

[21]  John Yen,et al.  A fuzzy logic-based computational recognition-primed decision model , 2007, Inf. Sci..

[22]  John Yen,et al.  A Distributed, Collaborative Intelligent Agent System Approach for Proactive Postmarketing Drug Safety Surveillance , 2010, IEEE Transactions on Information Technology in Biomedicine.

[23]  Rajeev Motwani,et al.  Scalable Techniques for Mining Causal Structures , 1998, Data Mining and Knowledge Discovery.

[24]  S. Evans,et al.  Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports , 2001, Pharmacoepidemiology and drug safety.