The use of percentage change from baseline as an outcome in a controlled trial is statistically inefficient: a simulation study

BackgroundMany randomized trials involve measuring a continuous outcome - such as pain, body weight or blood pressure - at baseline and after treatment. In this paper, I compare four possibilities for how such trials can be analyzed: post-treatment; change between baseline and post-treatment; percentage change between baseline and post-treatment and analysis of covariance (ANCOVA) with baseline score as a covariate. The statistical power of each method was determined for a hypothetical randomized trial under a range of correlations between baseline and post-treatment scores.ResultsANCOVA has the highest statistical power. Change from baseline has acceptable power when correlation between baseline and post-treatment scores is high;when correlation is low, analyzing only post-treatment scores has reasonable power. Percentage change from baseline has the lowest statistical power and was highly sensitive to changes in variance. Theoretical considerations suggest that percentage change from baseline will also fail to protect from bias in the case of baseline imbalance and will lead to an excess of trials with non-normally distributed outcome data.ConclusionsPercentage change from baseline should not be used in statistical analysis. Trialists wishing to report this statistic should use another method, such as ANCOVA, and convert the results to a percentage change by using mean baseline scores.

[1]  M. Pahor,et al.  Effect of the HMG-CoA reductase inhibitors on blood pressure in patients with essential hypertension and primary hypercholesterolemia. , 1999, Hypertension.

[2]  W. Ettinger,et al.  Exercise and Weight Loss in Obese Older Adults with Knee Osteoarthritis: A Preliminary Study , 2000, Journal of the American Geriatrics Society.

[3]  B Krause,et al.  On problems in measuring change. , 1982, Zeitschrift fur Psychologie mit Zeitschrift fur angewandte Psychologie.

[4]  J. Crouse,et al.  A randomized trial comparing the effect of casein with that of soy protein containing varying amounts of isoflavones on plasma concentrations of lipids and lipoproteins. , 1999, Archives of internal medicine.

[5]  E. Martin,et al.  Randomised clinical trial comparing the effects of acupuncture and a newly designed placebo needle in rotator cuff tendinitis , 1999, PAIN.

[6]  S J Pocock,et al.  Repeated measures in clinical trials: analysis using mean summary statistics and its implications for design. , 1992, Statistics in medicine.

[7]  S Senn,et al.  Analysis of serial measurements in medical research. , 1990, BMJ.

[8]  R Z Omar,et al.  Analysing repeated measurements data: a practical comparison of methods. , 1999, Statistics in medicine.

[9]  T. Hughes-Davies Megestrol acetate for the prevention of hot flashes. , 1995, The New England journal of medicine.

[10]  R. Stephens,et al.  Gemcitabine plus best supportive care (BSC) vs BSC in inoperable non-small cell lung cancer – a randomized trial with quality of life as the primary outcome , 2000, British Journal of Cancer.

[11]  Stephen Senn,et al.  Statistical Issues in Drug Development , 1997 .