Biclustering of Adverse Drug Events in the FDA's Spontaneous Reporting System

In this article, we present a new pharmacovigilance data mining technique based on the biclustering paradigm, which is designed to identify drug groups that share a common set of adverse events (AEs) in the spontaneous reporting system (SRS) of the US Food and Drug Administration (FDA). A taxonomy of biclusters is developed, revealing that a significant number of bona fide adverse drug event (ADE) biclusters have been identified. Statistical tests indicate that it is extremely unlikely that the bicluster structures thus discovered, as well as their content, could have arisen by mere chance. Some of the biclusters classified as indeterminate provide support for previously unrecognized and potentially novel ADEs. In addition, we demonstrate the potential importance of the proposed methodology in several important aspects of pharmacovigilance such as providing insight into the etiology of ADEs, facilitating the identification of novel ADEs, suggesting methods and a rationale for aggregating terminologies, highlighting areas of focus, and providing an exploratory tool for data mining.

[1]  Joseph M. Tonning,et al.  Perspectives on the Use of Data Mining in Pharmacovigilance , 2005, Drug safety.

[2]  Lehana Thabane,et al.  Application of data mining techniques in pharmacovigilance. , 2003, British journal of clinical pharmacology.

[3]  K. Solomons,et al.  Toxicity with selective serotonin reuptake inhibitors. , 2005, The American journal of psychiatry.

[4]  A. Fliri,et al.  Analysis of drug-induced effect patterns to link structure and side effects of medicines , 2005, Nature chemical biology.

[5]  William DuMouchel,et al.  Bayesian Data Mining in Large Frequency Tables, with an Application to the FDA Spontaneous Reporting System , 1999 .

[6]  E. Brown,et al.  Effects of Coding Dictionary on Signal Generation , 2002, Drug safety.

[7]  D. Classen,et al.  Adverse drug events in hospitalized patients. Excess length of stay, extra costs, and attributable mortality. , 1997, JAMA.

[8]  Manfred Hauben,et al.  ‘Extreme Duplication’ in the US FDA Adverse Events Reporting System Database , 2007, Drug safety.

[9]  G. Niklas Norén,et al.  Duplicate detection in adverse drug reaction surveillance , 2007, Data Mining and Knowledge Discovery.

[10]  R. I. Breuer Chlorpromazine hepatotoxicity manifested by a selective and sustained rise of serum alkaline phosphatase activity , 1965, The American Journal of Digestive Diseases.

[11]  Lothar Thiele,et al.  A systematic comparison and evaluation of biclustering methods for gene expression data , 2006, Bioinform..

[12]  W. DuMouchel,et al.  Novel Statistical Tools for Monitoring the Safety of Marketed Drugs , 2007, Clinical pharmacology and therapeutics.

[13]  M. Hauben,et al.  Data mining for signals in spontaneous reporting databases: proceed with caution , 2007, Pharmacoepidemiology and drug safety.

[14]  D. Madigan,et al.  The role of data mining in pharmacovigilance , 2005, Expert opinion on drug safety.

[15]  A. Bate,et al.  Quantitative signal detection using spontaneous ADR reporting , 2009, Pharmacoepidemiology and drug safety.

[16]  George Hripcsak,et al.  Automated encoding of clinical documents based on natural language processing. , 2004, Journal of the American Medical Informatics Association : JAMIA.

[17]  Arlindo L. Oliveira,et al.  Biclustering algorithms for biological data analysis: a survey , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[18]  Eckart Zitzler,et al.  BicAT: a biclustering analysis toolbox , 2006, Bioinform..

[19]  S D Small,et al.  The costs of adverse drug events in hospitalized patients. Adverse Drug Events Prevention Study Group. , 1998, JAMA.

[20]  R. O’Neill,et al.  Use of Screening Algorithms and Computer Systems to Efficiently Signal Higher-Than-Expected Combinations of Drugs and Events in the US FDA’s Spontaneous Reports Database , 2002, Drug safety.

[21]  Hans-Peter Kriegel,et al.  Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering , 2009, TKDD.