Analyses of ‘change scores’ do not estimate causal effects in observational data

BACKGROUND In longitudinal data, it is common to create 'change scores' by subtracting measurements taken at baseline from those taken at follow-up, and then to analyse the resulting 'change' as the outcome variable. In observational data, this approach can produce misleading causal-effect estimates. The present article uses directed acyclic graphs (DAGs) and simple simulations to provide an accessible explanation for why change scores do not estimate causal effects in observational data. METHODS Data were simulated to match three general scenarios in which the outcome variable at baseline was a (i) 'competing exposure' (i.e. a cause of the outcome that is neither caused by nor causes the exposure), (ii) confounder or (iii) mediator for the total causal effect of the exposure variable at baseline on the outcome variable at follow-up. Regression coefficients were compared between change-score analyses and the appropriate estimator(s) for the total and/or direct causal effect(s). RESULTS Change-score analyses do not provide meaningful causal-effect estimates unless the baseline outcome variable is a 'competing exposure' for the effect of the exposure on the outcome at follow-up. Where the baseline outcome is a confounder or mediator, change-score analyses evaluate obscure estimands, which may diverge substantially in magnitude and direction from the total and direct causal effects. CONCLUSION Future observational studies that seek causal-effect estimates should avoid analysing change scores and adopt alternative analytical strategies.

[1]  Stephen Senn,et al.  Change from baseline and analysis of covariance revisited , 2006, Statistics in medicine.

[2]  Maciej Liskiewicz,et al.  Robust causal inference using Directed Acyclic Graphs: the R package , 2018 .

[3]  A. Heppenstall,et al.  Analysing trajectories of a longitudinal exposure: A causal perspective on common methods in lifecourse research , 2019, PloS one.

[4]  D. Altman,et al.  STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT , 1986, The Lancet.

[5]  J. Pearl Lord’s Paradox Revisited – (Oh Lord! Kumbaya!) , 2016 .

[6]  J. Robins,et al.  When is baseline adjustment useful in analyses of change? An example with education and cognitive change. , 2005, American journal of epidemiology.

[7]  Van Breukelen Gj ANCOVA versus change from baseline: more power in randomized studies, more bias in nonrandomized studies [corrected]. , 2006 .

[8]  Peter M. Steiner,et al.  Gain Scores Revisited: A Graphical Models Perspective , 2019, Sociological Methods & Research.

[9]  Yunyun Jiang,et al.  Methods for Analysis of Pre-Post Data in Clinical Research: A Comparison of Five Common Methods , 2017, Journal of biometrics & biostatistics.

[10]  P. D. Oldham,et al.  A note on the analysis of repeated measurements of the same subjects. , 1962, Journal of chronic diseases.

[11]  Andrew J Vickers,et al.  Parametric versus non-parametric statistics in the analysis of randomized trials with non-normally distributed data , 2005, BMC medical research methodology.

[12]  Julius Sim,et al.  Bias, precision and statistical power of analysis of covariance in the analysis of randomized trials with baseline imbalance: a simulation study , 2014, BMC Medical Research Methodology.

[13]  Nan M. Laird,et al.  Further Comparative Analyses of Pretest-Posttest Research Designs , 1983 .

[14]  T. VanderWeele Mediation Analysis: A Practitioner's Guide. , 2016, Annual review of public health.

[15]  F. Lord A paradox in the interpretation of group comparisons. , 1967, Psychological bulletin.

[16]  E. Shahar,et al.  Causal diagrams and change variables. , 2012, Journal of evaluation in clinical practice.

[17]  Patrick Royston,et al.  The design of simulation studies in medical statistics , 2006, Statistics in medicine.

[18]  Lorenzo Richiardi,et al.  Mediation analysis in epidemiology: methods, interpretation and bias. , 2013, International journal of epidemiology.

[19]  Yu-Kang Tu,et al.  Revisiting the relation between change and initial value: a review and evaluation , 2007, Statistics in medicine.

[20]  Dan Geiger,et al.  Identifying independence in bayesian networks , 1990, Networks.