Methodological approaches in analysing observational data: A practical example on how to address clustering and selection bias.

BACKGROUND Because not every scientific question on effectiveness can be answered with randomised controlled trials, research methods that minimise bias in observational studies are required. Two major concerns influence the internal validity of effect estimates: selection bias and clustering. Hence, to reduce the bias of the effect estimates, more sophisticated statistical methods are needed. AIM To introduce statistical approaches such as propensity score matching and mixed models into representative real-world analysis and to conduct the implementation in statistical software R to reproduce the results. Additionally, the implementation in R is presented to allow the results to be reproduced. METHOD We perform a two-level analytic strategy to address the problems of bias and clustering: (i) generalised models with different abilities to adjust for dependencies are used to analyse binary data and (ii) the genetic matching and covariate adjustment methods are used to adjust for selection bias. Hence, we analyse the data from two population samples, the sample produced by the matching method and the full sample. RESULTS The different analysis methods in this article present different results but still point in the same direction. In our example, the estimate of the probability of receiving a case conference is higher in the treatment group than in the control group. Both strategies, genetic matching and covariate adjustment, have their limitations but complement each other to provide the whole picture. CONCLUSION The statistical approaches were feasible for reducing bias but were nevertheless limited by the sample used. For each study and obtained sample, the pros and cons of the different methods have to be weighted.

[1]  Jasjeet S. Sekhon,et al.  Genetic Optimization Using Derivatives , 2011, Political Analysis.

[2]  S Greenland,et al.  Interpretation and choice of effect measures in epidemiologic analyses. , 1987, American journal of epidemiology.

[3]  Michael G. Kenward,et al.  Multiple Imputation and its Application , 2013 .

[4]  D. De Graeve,et al.  Can we account for selection bias? A comparison between bare metal and drug-eluting stents. , 2011, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[5]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[6]  Maria Blettner,et al.  Avoiding bias in observational studies: part 8 in a series of articles on evaluation of scientific publications. , 2009, Deutsches Arzteblatt international.

[7]  G V Ostir,et al.  Logistic Regression: A Nontechnical Review , 2000, American journal of physical medicine & rehabilitation.

[8]  B. Holle,et al.  Structural characteristics of specialised living units for people with dementia: a cross-sectional study in German nursing homes , 2014, International Journal of Mental Health Systems.

[9]  N. Burns,et al.  The Practice of Nursing Research: Appraisal, Synthesis, and Generation of Evidence - Seventh edition Grove Susan K The Practice of Nursing Research: Appraisal, Synthesis, and Generation of Evidence - Seventh edition 752pp Elsevier 9781455707362 1455707368 [Formula: see text]. , 2008, Nursing standard (Royal College of Nursing (Great Britain) : 1987).

[10]  G. Collins,et al.  Double-adjustment in propensity score matching analysis: choosing a threshold for considering residual imbalance , 2017, BMC Medical Research Methodology.

[11]  S. Valeika Interpretation of odds and risk ratios. , 2014, Journal of veterinary internal medicine.

[12]  Massimo Cannas,et al.  Propensity score matching with clustered data. An application to the estimation of the impact of caesarean section on the Apgar score , 2016, Statistics in medicine.

[13]  J. Hamers,et al.  Are Staffing, Work Environment, Work Stressors, and Rationing of Care Related to Care Workers' Perception of Quality of Care? A Cross-Sectional Study. , 2015, Journal of the American Medical Directors Association.

[14]  Ofer Harel,et al.  Asymptotically Unbiased Estimation of Exposure Odds Ratios in Complete Records Logistic Regression , 2015, American journal of epidemiology.

[15]  Jasjeet S. Sekhon,et al.  Multivariate and Propensity Score Matching Software with Automated Balance Optimization: The Matching Package for R , 2008 .

[16]  Bruno Arpino,et al.  The specification of the propensity score in multilevel observational studies , 2011, Comput. Stat. Data Anal..

[17]  L. Copp Nursing Research: Position Statement. , 1981 .

[18]  W. Stroup Generalized Linear Mixed Models: Modern Concepts, Methods and Applications , 2012 .

[19]  Allan Donner,et al.  Confidence Interval Estimation of the Intraclass Correlation Coefficient for Binary Outcome Data , 2004, Biometrics.

[20]  James Carpenter,et al.  Propensity scores: From naïve enthusiasm to intuitive understanding , 2012, Statistical methods in medical research.

[21]  Justus J. Randolph,et al.  A Step-by-Step Guide to Propensity Score Matching in R , 2014 .

[22]  Peter C Austin,et al.  The performance of different propensity score methods for estimating marginal hazard ratios , 2007, Statistics in medicine.

[23]  D. Rubin Matched Sampling for Causal Effects , 2006 .

[24]  Peter C Austin,et al.  The performance of different propensity score methods for estimating marginal odds ratios, Statistics in Medicine 2007; 26:3078–3094 , 2008 .

[25]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[26]  Peter C Austin,et al.  A comparison of the ability of different propensity score models to balance measured variables between treated and untreated subjects: a Monte Carlo study , 2007, Statistics in medicine.

[27]  B. Giraudeau,et al.  Propensity score methods for estimating relative risks in cluster randomized trials with low‐incidence binary outcomes and selection bias , 2014, Statistics in medicine.

[28]  J. Hardin,et al.  Generalized Estimating Equations , 2002 .

[29]  Frank Yoon,et al.  Variable-ratio matching with fine balance in a study of the Peer Health Exchange. , 2015, Statistics in medicine.

[30]  Fan Li,et al.  Propensity score weighting with multilevel data , 2013, Statistics in medicine.

[31]  Elizabeth A Stuart,et al.  Matching methods for causal inference: A review and a look forward. , 2010, Statistical science : a review journal of the Institute of Mathematical Statistics.

[32]  Ewout W Steyerberg,et al.  Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes , 2011, BMC medical research methodology.

[33]  W. Wong,et al.  Comparison of methods for estimating the intraclass correlation coefficient for binary responses in cancer prevention cluster randomized trials. , 2012, Contemporary clinical trials.

[34]  M Soledad Cepeda,et al.  Comparison of logistic regression versus propensity score when the number of events is low and there are multiple confounders. , 2003, American journal of epidemiology.

[35]  R. D'Agostino Propensity score methods for bias reduction in the comparison of a treatment to a non-randomized control group. , 2005, Statistics in medicine.

[36]  A. Ridder,et al.  Can we account for selection bias? A comparison between bare metal and drug-eluting stents , 2011 .

[37]  B. Holle,et al.  Differences in Case Conferences in Dementia Specific vs Traditional Care Units in German Nursing Homes: Results from a Cross-Sectional Study. , 2016, Journal of the American Medical Directors Association.

[38]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[39]  Zhiqiang Wang,et al.  Propensity score methods to adjust for confounding in assessing treatment effects: bias and precision , 2008 .

[40]  Luke Keele,et al.  Optimal Multilevel Matching in Clustered Observational Studies: A Case Study of the Effectiveness of Private Schools Under a Large-Scale Voucher System , 2014, 1409.8597.

[41]  Pierfrancesco Agostoni,et al.  Are propensity scores really superior to standard multivariable analysis? , 2011, Contemporary clinical trials.

[42]  J H Ellenberg,et al.  Selection bias in observational and experimental studies. , 1994, Statistics in medicine.

[43]  J. Curtis,et al.  The challenge of selection bias and confounding in palliative care research. , 2009, Journal of palliative medicine.

[44]  Til Stürmer,et al.  A review of the application of propensity score methods yielded increasing use, advantages in specific settings, but not substantially different estimates compared with conventional multivariable methods. , 2006, Journal of clinical epidemiology.

[45]  O. Baser,et al.  Too much ado about propensity score models? Comparing methods of propensity score matching. , 2006, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[46]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[47]  Joseph Hilbe,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2009 .

[48]  Anthonius Boer,et al.  Measuring balance and model selection in propensity score methods , 2011, Pharmacoepidemiology and drug safety.

[49]  N. Burns,et al.  The practice of nursing research , 2013 .

[50]  P. Austin Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples , 2009, Statistics in medicine.