A Multi-relational Association Mining Algorithm for Screening Suspected Adverse Drug Reactions

Existing association mining algorithms generally assume that the data is in a single table (relation). One approach to mining multi-relational data tables is to convert the data into a single table and then apply the existing algorithms. However, the converted table may be too large to fit into memory. Moreover, these algorithms often need structures to store large intermediate data, which further restricts them by available memory. In this study, we developed an efficient SQL-based algorithm that directly dealt with multi-relational data tables that need less allocated memory. We also investigated how database indexes and the number of connections affect the performance of such an algorithm. The proposed algorithm was tested using data from the FDA's (Food and Drug Administration) spontaneous reporting system. The data collected was used for detecting potential adverse drug reactions (ADRs) which represent a serious worldwide problem. Our experiment results indicate that the algorithm performs well and is scalable in terms of the number of association rules that are evaluated and the size of the data.

[1]  Daisuke Koide,et al.  Comparison of data mining methodologies using Japanese spontaneous reports , 2004, Pharmacoepidemiology and drug safety.

[2]  David Madigan,et al.  Disproportionality methods for pharmacovigilance in longitudinal observational databases , 2013, Statistical methods in medical research.

[3]  Joseph M. Tonning,et al.  Pharmacovigilance in the 21st Century: New Systematic Tools for an Old Problem , 2004, Pharmacotherapy.

[4]  S. Evans,et al.  Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports , 2001, Pharmacoepidemiology and drug safety.

[5]  Martijn J. Schuemie,et al.  Using Electronic Health Care Records for Drug Safety Signal Detection: A Comparative Evaluation of Statistical Methods , 2012, Medical care.

[6]  G. Niklas Norén,et al.  Temporal pattern discovery in longitudinal electronic patient records , 2010, Data Mining and Knowledge Discovery.

[7]  Jie Chen,et al.  Signaling Potential Adverse Drug Reactions from Administrative Health Databases , 2010, IEEE Transactions on Knowledge and Data Engineering.

[8]  Yanqing Ji,et al.  A Method for Mining Infrequent Causal Associations and Its Application in Finding Adverse Drug Reaction Signal Pairs , 2013, IEEE Transactions on Knowledge and Data Engineering.

[9]  International Drug Monitoring the Role of the Hospital — A WHO Report , 1970, World Health Organization technical report series.

[10]  Salvatore Orlando,et al.  Fast and memory efficient mining of frequent closed itemsets , 2006, IEEE Transactions on Knowledge and Data Engineering.

[11]  Frantz Thiessard,et al.  Evaluation of statistical association measures for the automatic signal generation in pharmacovigilance , 2005, IEEE Transactions on Information Technology in Biomedicine.

[12]  Lisheng Ma,et al.  An Efficient Algorithm for Frequent Closed Itemsets Mining , 2008, 2008 International Conference on Computer Science and Software Engineering.

[13]  Mohammed J. Zaki,et al.  GenMax: An Efficient Algorithm for Mining Maximal Frequent Itemsets , 2005, Data Mining and Knowledge Discovery.

[14]  Abdallah Alashqur,et al.  RDB-MINER: A SQL-Based Algorithm for Mining True Relational Databases , 2010, J. Softw..

[15]  Pang-Ning Tan,et al.  Interestingness Measures for Association Patterns : A Perspective , 2000, KDD 2000.

[16]  Willi Klösgen,et al.  Explora: A Multipattern and Multistrategy Discovery Assistant , 1996, Advances in Knowledge Discovery and Data Mining.

[17]  Peter A. Flach,et al.  Rule Evaluation Measures: A Unifying View , 1999, ILP.

[18]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[19]  P. Waller,et al.  Stephens' detection of new adverse drug reactions , 2004 .

[20]  Jiawei Han,et al.  TFP: an efficient algorithm for mining top-k frequent closed itemsets , 2005, IEEE Transactions on Knowledge and Data Engineering.

[21]  M. Hauben,et al.  Quantitative Methods in Pharmacovigilance , 2003, Drug safety.

[22]  W. Inman,et al.  Under-reporting of adverse drug reactions. , 1985, British medical journal.

[23]  J. O’Donnell Detection of New Adverse Drug Reactions, 4th Edition , 1999 .

[24]  P. Purcell,et al.  Statistical Techniques for Signal Generation , 2002, Drug safety.

[25]  Ehud Gudes,et al.  Association rules mining in vertically partitioned databases , 2006, Data Knowl. Eng..

[26]  J. Woodcock,et al.  The safety of newly approved medicines: do recent market removals mean there is a problem? , 1999, JAMA.

[27]  PeregoRaffaele,et al.  Fast and Memory Efficient Mining of Frequent Closed Itemsets , 2006 .

[28]  P Ryan,et al.  Novel Data‐Mining Methodologies for Adverse Drug Event Discovery and Analysis , 2012, Clinical pharmacology and therapeutics.

[29]  C. Raehl,et al.  Adverse Drug Reactions in United States Hospitals , 2006, Pharmacotherapy.