A pattern discovery framework for adverse event evaluation and inference in spontaneous reporting systems

Safety of medical products is a major public health concern. We present a critical discussion of the currently used analytical tools for mining spontaneous reporting systems (SRS) to identify safety signals after use of medical products. We introduce a pattern discovery framework for the analysis of SRS. The terminology 'pattern discovery' is borrowed from the engineering and artificial intelligence literature and signifies that the basis of the proposed framework is the medical case, formal- izing the cognitive paradigm known to clinicians who evaluate individual patients and individual case safety reports submitted to SRS. The fundamental contribution of this approach is a strong probabilistic component that may account for selection and other biases and facilitates rigorous modeling and inference. We discuss somewhat in depth the concept of signal in pharmacovigilance and connect it with the concept of a pattern; we illustrate this conceptual framework using the example of anaphylaxis. Finally, we propose a research agenda in statistics, informatics, and pharmacovigilance practices needed to advance the pattern discovery framework in both the short and long terms. © 2014 Wiley Periodicals, Inc. Statistical Analysis and Data Mining, 2014

[1]  N. Laird,et al.  Incidence of Adverse Drug Events and Potential Adverse Drug Events: Implications for Prevention , 1995 .

[2]  Marianthi Markatou,et al.  Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection , 2011, J. Am. Medical Informatics Assoc..

[3]  S D Small,et al.  The costs of adverse drug events in hospitalized patients. Adverse Drug Events Prevention Study Group. , 1998, JAMA.

[4]  Maria-Florina Balcan,et al.  A theory of learning with similarity functions , 2008, Machine Learning.

[5]  D. Classen,et al.  Adverse drug events in hospitalized patients. Excess length of stay, extra costs, and attributable mortality. , 1997, JAMA.

[6]  D Madigan,et al.  Logistic Regression in Signal Detection: Another Piece Added to the Puzzle , 2013, Clinical pharmacology and therapeutics.

[7]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[8]  G. Peter,et al.  Intussusception, rotavirus, and oral vaccines: summary of a workshop. , 2002, Pediatrics.

[9]  Alan J. Forster,et al.  A systematic review to evaluate the accuracy of electronic adverse drug event detection , 2012, J. Am. Medical Informatics Assoc..

[10]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[11]  A. Tversky Features of Similarity , 1977 .

[12]  R Ball,et al.  Simulating adverse event spontaneous reporting systems as preferential attachment networks , 2014, Applied Clinical Informatics.

[13]  Guy E. Thwaites,et al.  The Diagnosis and Management of Tuberculous Meningitis , 2002, Practical Neurology.

[14]  Taxiarchis Botsis,et al.  Application of Information Retrieval Approaches to Case Classification in the Vaccine Adverse Event Reporting System , 2013, Drug Safety.

[15]  David R. Cox,et al.  On the consequences of overstratification , 2008 .

[16]  P. McCullagh Sampling bias and logistic models , 2008 .

[17]  D. Bates,et al.  Preventable adverse drug events in hospitalized patients: a comparative study of intensive care and general care units. , 1997, Critical care medicine.

[18]  Maria-Florina Balcan,et al.  Improved Guarantees for Learning via Similarity Functions , 2008, COLT.

[19]  Robert Ball,et al.  Network Analysis of Possible Anaphylaxis Cases Reported to the US Vaccine Adverse Event Reporting System after H1N1 Influenza Vaccine , 2011, MIE.

[20]  Santosh S. Vempala,et al.  A discriminative framework for clustering via similarity functions , 2008, STOC.

[21]  Peter J. Bickel,et al.  The Earth Mover's distance is the Mallows distance: some insights from statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[22]  Jun Li,et al.  Thresholding Events of Extreme in Simultaneous Monitoring of Multiple Risks , 2009 .

[23]  D. Madigan,et al.  The role of data mining in pharmacovigilance , 2005, Expert opinion on drug safety.

[24]  Jan Bonhoeffer,et al.  Anaphylaxis: case definition and guidelines for data collection, analysis, and presentation of immunization safety data. , 2007, Vaccine.

[25]  Stefan Arnborg,et al.  A bayesian recurrent neural network for unsupervised pattern recognition in large incomplete data sets , 2005, Int. J. Neural Syst..

[26]  C Chuang-Stein,et al.  Organization and analysis of safety data using a multivariate approach. , 1992, Statistics in medicine.

[27]  D. Clayton A model for association in bivariate life tables and its application in epidemiological studies of familial tendency in chronic disease incidence , 1978 .

[28]  D. W. Goodall A New Similarity Index Based on Probability , 1966 .

[29]  J Lellouch,et al.  Power and weakness of spontaneous reporting: a probabilistic approach. , 1992, Journal of clinical epidemiology.

[30]  William DuMouchel,et al.  Bayesian Data Mining in Large Frequency Tables, with an Application to the FDA Spontaneous Reporting System , 1999 .

[31]  F Thiessard,et al.  False Discovery Rate Estimation for Frequentist Pharmacovigilance Signal Detection Methods , 2010, Biometrics.

[32]  Pamela W Ewan,et al.  ABC of allergies: Anaphylaxis , 1998 .

[33]  C. Manski Anatomy of the Selection Problem , 1989 .

[34]  Jesse A Berlin,et al.  Recommendations for safety planning, data collection, evaluation and reporting during drug, biologic and vaccine development: a report of the safety planning, evaluation, and reporting team , 2009, Clinical trials.

[35]  N. Laird,et al.  Incidence of adverse drug events and potential adverse drug events , 1995 .

[36]  Fei Wang,et al.  Case-based reasoning in comparative effectiveness research , 2012, IBM J. Res. Dev..

[37]  P. Scheinmann,et al.  Vaccine allergy and pseudo-allergy. , 2003, European journal of dermatology : EJD.

[38]  Claudia Czado,et al.  Pair Copula Constructions for Multivariate Discrete Data , 2012 .

[39]  Robert Ball,et al.  Using simulation to assess the sensitivity and specificity of a signal detection tool for multidimensional public health surveillance data. , 2005, Statistics in medicine.

[40]  Michael Schwenk,et al.  Anaphylaxis , 1919, The Indian medical gazette.

[41]  R Ball,et al.  Can Network Analysis Improve Pattern Recognition Among Adverse Events Following Immunization Reported to VAERS? , 2011, Clinical pharmacology and therapeutics.

[42]  A. Egberts,et al.  Adverse drug events in hospitalized patients A comparison of doctors, nurses and patients as sources of reports , 1999, European Journal of Clinical Pharmacology.

[43]  R. Tiwari,et al.  A Likelihood Ratio Test Based Method for Signal Detection With Application to FDA’s Drug Safety Data , 2011 .

[44]  T. M. F. Smith,et al.  On the validity of interferences from non-random samples , 1983 .

[45]  Andrew Bate,et al.  Terminological Challenges in Safety Surveillance , 2012, Drug Safety.

[46]  George Hripcsak,et al.  A statistical methodology for analyzing co-occurrence data from a large sample , 2007, J. Biomed. Informatics.

[47]  D. Classen,et al.  Adverse drug events in hospitalized patients. Excess length of stay, extra costs, and attributable mortality. , 1997, JAMA.

[48]  N. Shah,et al.  Performance of Pharmacovigilance Signal‐Detection Algorithms for the FDA Adverse Event Reporting System , 2013, Clinical pharmacology and therapeutics.

[49]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[50]  Duncan C. L. Fitzwilliams,et al.  Intussusception , 1903, The Hospital.

[51]  S. Evans,et al.  Use of proportional reporting ratios (PRRs) for signal generation from spontaneous adverse drug reaction reports , 2001, Pharmacoepidemiology and drug safety.

[52]  Johan Hopstadius,et al.  Impact of Stratification on Adverse Drug Reaction Surveillance , 2008, Drug safety.

[53]  D. Bates,et al.  The Costs of Adverse Drug Events in Hospitalized Patients , 1997 .

[54]  Marián Boguñá,et al.  Popularity versus similarity in growing networks , 2011, Nature.

[55]  D. Madigan,et al.  Empirical assessment of methods for risk identification in healthcare data: results from the experiments of the Observational Medical Outcomes Partnership , 2012, Statistics in medicine.

[56]  J J Rybacki,et al.  Adverse drug events in hospitalized patients. , 1997, JAMA.

[57]  Pravin K. Trivedi,et al.  Using Trivariate Copulas to Model Sample Selection and Treatment Effects , 2006 .

[58]  B. Schwartz,et al.  Intussusception among infants given an oral rotavirus vaccine. , 2001, The New England journal of medicine.

[59]  Christy Chuang-Stein,et al.  Safety Analysis in Controlled Clinical Trials , 1998 .

[60]  Patrick B. Ryan,et al.  Evaluation of alternative standardized terminologies for medical conditions within a network of observational healthcare databases , 2012, J. Biomed. Informatics.

[61]  Manfred Hauben,et al.  Defining ‘Signal’ and its Subtypes in Pharmacovigilance Based on a Systematic Review of Previous Definitions , 2009, Drug safety.

[62]  Jelena Savović,et al.  Methods for Causality Assessment of Adverse Drug Reactions , 2008, Drug safety.

[63]  J. Heckman Sample selection bias as a specification error , 1979 .

[64]  Marie Lindquist,et al.  The Need for Definitions in Pharmacovigilance , 2007, Drug safety.

[65]  朱丽琳,et al.  Anaphylaxis , 2006 .

[66]  J. Gower A General Coefficient of Similarity and Some of Its Properties , 1971 .

[67]  J. Copas,et al.  Inference for Non‐random Samples , 1997 .

[68]  F Thiessard,et al.  Pharmacovigilance Data Mining With Methods Based on False Discovery Rates: A Comparative Simulation Study , 2010, Clinical pharmacology and therapeutics.

[69]  D. Oakes,et al.  Bivariate survival models induced by frailties , 1989 .

[70]  David J. Hand,et al.  Pattern Discovery and Detection: A Unified Statistical Methodology , 2004 .

[71]  R. Holman,et al.  Lack of an apparent association between intussusception and wild or vaccine rotavirus infection. , 1998, The Pediatric infectious disease journal.

[72]  W. Amery Why there is a need for pharmacovigilance , 1999, Pharmacoepidemiology and drug safety.

[73]  P Ryan,et al.  Novel Data‐Mining Methodologies for Adverse Drug Event Discovery and Analysis , 2012, Clinical pharmacology and therapeutics.

[74]  D. Madigan,et al.  Evaluating the impact of database heterogeneity on observational study results. , 2013, American journal of epidemiology.

[75]  S D Small,et al.  The incident reporting system does not detect adverse drug events: a problem for quality improvement. , 1995, The Joint Commission journal on quality improvement.

[76]  George Hripcsak,et al.  Use abstracted patient-specific features to assist an information-theoretic measurement to assess similarity between medical cases , 2008, J. Biomed. Informatics.

[77]  Thomas P. Gross,et al.  The US Food and Drug Administration's Sentinel Initiative: Expanding the horizons of medical product safety , 2012, Pharmacoepidemiology and drug safety.

[78]  Peter C Austin,et al.  The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies , 2010, Statistics in medicine.

[79]  Arvind Madaan,et al.  Vaccine allergy: diagnosis and management. , 2003, Immunology and allergy clinics of North America.