A Method for Mining Infrequent Causal Associations and Its Application in Finding Adverse Drug Reaction Signal Pairs

In many real-world applications, it is important to mine causal relationships where an event or event pattern causes certain outcomes with low probability. Discovering this kind of causal relationships can help us prevent or correct negative outcomes caused by their antecedents. In this paper, we propose an innovative data mining framework and apply it to mine potential causal associations in electronic patient data sets where the drug-related events of interest occur infrequently. Specifically, we created a novel interestingness measure, exclusive causal-leverage, based on a computational, fuzzy recognition-primed decision (RPD) model that we previously developed. On the basis of this new measure, a data mining algorithm was developed to mine the causal relationship between drugs and their associated adverse drug reactions (ADRs). The algorithm was tested on real patient data retrieved from the Veterans Affairs Medical Center in Detroit, Michigan. The retrieved data included 16,206 patients (15,605 male, 601 female). The exclusive causal-leverage was employed to rank the potential causal associations between each of the three selected drugs (i.e., enalapril, pravastatin, and rosuvastatin) and 3,954 recorded symptoms, each of which corresponded to a potential ADR. The top 10 drug-symptom pairs for each drug were evaluated by the physicians on our project team. The numbers of symptoms considered as likely real ADRs for enalapril, pravastatin, and rosuvastatin were 8, 7, and 6, respectively. These preliminary results indicate the usefulness of our method in finding potential ADR signal pairs for further analysis (e.g., epidemiology study) and investigation (e.g., case review) by drug safety professionals.

[1]  Fabrice Guillet,et al.  Knowledge-Based Interactive Postmining of Association Rules Using Ontologies , 2010, IEEE Transactions on Knowledge and Data Engineering.

[2]  Yen-Liang Chen,et al.  Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism , 2004, Decision Support Systems.

[3]  H. Prade,et al.  FUZZY PATTERN MATCHING , 1982 .

[4]  William DuMouchel,et al.  Bayesian Data Mining in Large Frequency Tables, with an Application to the FDA Spontaneous Reporting System , 1999 .

[5]  W. Inman,et al.  Under-reporting of adverse drug reactions. , 1985, British medical journal.

[6]  Amedeo Napoli,et al.  Towards Rare Itemset Mining , 2007, 19th IEEE International Conference on Tools with Artificial Intelligence(ICTAI 2007).

[7]  Luigi Troiano,et al.  A Fast Algorithm for Mining Rare Itemsets , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[8]  S. Wolfe,et al.  Timing of new black box warnings and withdrawals for prescription medications. , 2002, JAMA.

[9]  Amedeo Napoli,et al.  Finding Minimal Rare Itemsets and Rare Association Rules , 2010, KSEM.

[10]  D. Haussler,et al.  Hidden Markov models in computational biology. Applications to protein modeling. , 1993, Journal of molecular biology.

[11]  Chengqi Zhang,et al.  Association Rule Mining , 2002, Lecture Notes in Computer Science.

[12]  David Heckerman,et al.  Bayesian Networks for Data Mining , 2004, Data Mining and Knowledge Discovery.

[13]  Wynne Hsu,et al.  Mining association rules with multiple minimum supports , 1999, KDD '99.

[14]  Gary M. Weiss Mining with rarity: a unifying framework , 2004, SKDD.

[15]  Robert Orchard,et al.  Fuzzy Reasoning in JESS: The Fuzzyj Toolkit and Fuzzyjess , 2001, ICEIS.

[16]  Peter A. Flach,et al.  Rule Evaluation Measures: A Unifying View , 1999, ILP.

[17]  Gernot A. Fink,et al.  Markov Models for Pattern Recognition: From Theory to Applications , 2007 .

[18]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[19]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[20]  Jie Chen,et al.  Mining Unexpected Temporal Associations: Applications in Detecting Adverse Drug Reactions , 2008, IEEE Transactions on Information Technology in Biomedicine.

[21]  Joseph M. Tonning,et al.  Pharmacovigilance in the 21st Century: New Systematic Tools for an Old Problem , 2004, Pharmacotherapy.

[22]  A Bate,et al.  From association to alert—a revised approach to international signal analysis , 1999, Pharmacoepidemiology and drug safety.

[23]  Willi Klösgen,et al.  Explora: A Multipattern and Multistrategy Discovery Assistant , 1996, Advances in Knowledge Discovery and Data Mining.

[24]  Tai-Wen Yue,et al.  A Q'tron Neural-Network Approach to Solve the Graph Coloring Problems , 2007 .

[25]  Howard J. Hamilton,et al.  Interestingness measures for data mining: A survey , 2006, CSUR.

[26]  M.H. Hassoun,et al.  Fundamentals of Artificial Neural Networks , 1996, Proceedings of the IEEE.

[27]  Yanqing Ji,et al.  A fuzzy recognition-primed decision model-based causal association mining algorithm for detecting adverse drug reactions in postmarketing surveillance , 2010, International Conference on Fuzzy Systems.

[28]  A. Wall,et al.  Book ReviewTo Err is Human: building a safer health system Kohn L T Corrigan J M Donaldson M S Washington DC USA: Institute of Medicine/National Academy Press ISBN 0 309 06837 1 $34.95 , 2000 .

[29]  P. Waller,et al.  Stephens' detection of new adverse drug reactions , 2004 .

[30]  Isabelle Guyon,et al.  Causality : Objectives and Assessment , 2010 .

[31]  Keun Ho Ryu,et al.  Mining association rules on significant rare data using relative support , 2003, J. Syst. Softw..

[32]  Anthony K. H. Tung,et al.  Efficient Mining of Intertransaction Association Rules , 2003, IEEE Trans. Knowl. Data Eng..

[33]  Shamkant B. Navathe,et al.  Text Mining and Ontology Applications in Bioinformatics and GIS , 2007, International Conference on Machine Learning and Applications.

[34]  E. D. Barnhart Physicians Desk Reference , 1990 .

[35]  Shichao Zhang,et al.  Association Rule Mining: Models and Algorithms , 2002 .

[36]  I. Edwards,et al.  Adverse drug reactions: definitions, diagnosis, and management , 2000, The Lancet.

[37]  Manfred Hauben,et al.  Early Postmarketing Drug Safety Surveillance: Data Mining Points to Consider , 2004, The Annals of pharmacotherapy.

[38]  Gregory F. Cooper,et al.  A Simple Constraint-Based Algorithm for Efficiently Mining Observational Databases for Causal Relationships , 1997, Data Mining and Knowledge Discovery.

[39]  John Yen,et al.  A Distributed, Collaborative Intelligent Agent System Approach for Proactive Postmarketing Drug Safety Surveillance , 2010, IEEE Transactions on Information Technology in Biomedicine.

[40]  Rajeev Motwani,et al.  Scalable Techniques for Mining Causal Structures , 1998, Data Mining and Knowledge Discovery.

[41]  S. Evans,et al.  Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports , 2001, Pharmacoepidemiology and drug safety.

[42]  C. Marano,et al.  To err is human. Building a safer health system , 2005 .

[43]  Hao Ying,et al.  Fuzzy Control and Modeling: Analytical Foundations and Applications , 2000 .

[44]  Max Kuhn,et al.  Postmarketing safety information: how useful are spontaneous reports? , 1999, Pharmacoepidemiology and drug safety.

[45]  Yanqing Ji,et al.  A Potential Causal Association Mining Algorithm for Screening Adverse Drug Reactions in Postmarketing Surveillance , 2011, IEEE Transactions on Information Technology in Biomedicine.

[46]  Pang-Ning Tan,et al.  Interestingness Measures for Association Patterns : A Perspective , 2000, KDD 2000.

[47]  John Yen,et al.  A fuzzy logic-based computational recognition-primed decision model , 2007, Inf. Sci..

[48]  J. Ghosh Causality: Models, Reasoning and Inference, Second Edition by Judea Pearl , 2011 .

[49]  P. Corey,et al.  Incidence of Adverse Drug Reactions in Hospitalized Patients , 2012 .

[50]  Yun Sing Koh,et al.  Finding Sporadic Rules Using Apriori-Inverse , 2005, PAKDD.

[51]  G. Niklas Norén,et al.  Temporal pattern discovery in longitudinal electronic patient records , 2010, Data Mining and Knowledge Discovery.

[52]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[53]  Lei Wu,et al.  Rare Itemset Mining , 2007, Sixth International Conference on Machine Learning and Applications (ICMLA 2007).