Does design matter? Systematic evaluation of the impact of analytical choices on effect estimates in observational studies

Background: Clinical studies that use observational databases, such as administrative claims and electronic health records, to evaluate the effects of medical products have become commonplace. These studies begin by selecting a particular study design, such as a case control, cohort, or self-controlled design, and different authors can and do choose different designs for the same clinical question. Furthermore, published papers invariably report the study design but do not discuss the rationale for the specific choice. Studies of the same clinical question with different designs, however, can generate different results, sometimes with strikingly different implications. Even within a specific study design, authors make many different analytic choices and these too can profoundly impact results. In this paper, we systematically study heterogeneity due to the type of study design and due to analytic choices within study design. Methods and findings: We conducted our analysis in 10 observational healthcare databases but mostly present our results in the context of the GE Centricity EMR database, an electronic health record database containing data for 11.2 million lives. We considered the impact of three different study design choices on estimates of associations between bisphosphonates and four particular health outcomes for which there is no evidence of an association. We show that applying alternative study designs can yield discrepant results, in terms of direction and significance of association. We also highlight that while traditional univariate sensitivity analysis may not show substantial variation, systematic assessment of all analytical choices within a study design can yield inconsistent results ranging from statistically significant decreased risk to statistically significant increased risk. Our findings show that clinical studies using observational databases can be sensitive both to study design choices and to specific analytic choices within study design. Conclusion: More attention is needed to consider how design choices may be impacting results and, when possible, investigators should examine a wide array of possible choices to confirm that significant findings are consistently identified.

[1]  A R Feinstein,et al.  A collection of 56 topics with contradictory results in case-control research. , 1988, International journal of epidemiology.

[2]  HMG-CoA reductase inhibitors and the risk of fractures. , 2000 .

[3]  C. Cooper,et al.  Use of Statins and Risk of Fractures , 2001 .

[4]  G. Davey Smith,et al.  Epidemiology--is it time to call it a day? , 2001, International journal of epidemiology.

[5]  Sander Greenland,et al.  Monte Carlo sensitivity analysis and Bayesian analysis of smoking as an unmeasured confounder in a study of silica and lung cancer. , 2004, American journal of epidemiology.

[6]  Sander Greenland,et al.  Multiple‐bias modelling for analysis of observational data , 2005 .

[7]  Sonia Hernández-Díaz,et al.  Non-steroidal antiinflammatory drugs and the risk of acute myocardial infarction. , 2006, Basic & clinical pharmacology & toxicology.

[8]  Frank de Vries,et al.  Reanalysis of two studies with contrasting results on the association between statin use and fracture risk: the General Practice Research Database. , 2006, International journal of epidemiology.

[9]  C. Kirkness,et al.  Assessment of Cardiometabolic Risk Factors in a National Primary Care Electronic Health Record Database , 2007 .

[10]  An analysis of the exclusion criteria used in observational pharmacoepidemiological studies , 2007 .

[11]  S. Pocock,et al.  Strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies , 2007, BMJ : British Medical Journal.

[12]  H. Whitaker The self controlled case series method , 2008, BMJ : British Medical Journal.

[13]  J. Brooks Why most published research findings are false: Ioannidis JP, Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece , 2008 .

[14]  J. Gill,et al.  Quality of Lipid Management in Outpatient Care: A National Study Using Electronic Health Records , 2008, American journal of medical quality : the official journal of the American College of Medical Quality.

[15]  J. Avorn,et al.  High-dimensional Propensity Score Adjustment in Studies of Treatment Effects Using Health Care Claims Data , 2009, Epidemiology.

[16]  Liam Smeeth,et al.  Oral Bisphosphonates and Risk of Atrial Fibrillation and Flutter in Women: A Self-Controlled Case-Series Safety Analysis , 2009, PloS one.

[17]  A. Pariente,et al.  Data mining on electronic health record databases for signal detection in pharmacovigilance: which events to monitor? , 2009, Pharmacoepidemiology and drug safety.

[18]  Albert G Crawford,et al.  Comparison of GE Centricity Electronic Medical Record database and National Ambulatory Medical Care Survey findings on the prevalence of major conditions in the United States. , 2010, Population health management.

[19]  Gabriela Czanner,et al.  Oral bisphosphonates and risk of cancer of oesophagus, stomach, and colorectum: case-control analysis within a UK primary care cohort , 2010, BMJ : British Medical Journal.

[20]  J. Overhage,et al.  Advancing the Science for Active Surveillance: Rationale and Design for the Observational Medical Outcomes Partnership , 2010, Annals of Internal Medicine.

[21]  W. Dixon,et al.  Bisphosphonates and esophageal cancer—a pathway through the confusion , 2011, Nature Reviews Rheumatology.

[22]  G. Bedogni Applying Quantitative Bias Analysis to Epidemiologic Data , 2011 .

[23]  C. Lawton Exposure to Oral Bisphosphonates and Risk of Esophageal Cancer , 2011 .

[24]  Sonal Singh,et al.  Comparative cardiovascular effects of thiazolidinediones: systematic review and meta-analysis of observational studies , 2011, BMJ : British Medical Journal.

[25]  Malcolm Maclure,et al.  Design considerations in an active medical product safety monitoring system , 2012, Pharmacoepidemiology and drug safety.

[26]  Patrick B. Ryan,et al.  Validation of a common data model for active safety surveillance research , 2012, J. Am. Medical Informatics Assoc..

[27]  Patrick B. Ryan,et al.  Evaluation of alternative standardized terminologies for medical conditions within a network of observational healthcare databases , 2012, J. Biomed. Informatics.

[28]  D. Madigan,et al.  Empirical assessment of methods for risk identification in healthcare data: results from the experiments of the Observational Medical Outcomes Partnership , 2012, Statistics in medicine.

[29]  Patrick B. Ryan,et al.  Health Outcomes of Interest in Observational Data: Issues in Identifying Definitions in the Literature , 2012 .