Small but important therapeutic effects of new treatments can be most efficiently detected through the study of large randomized prospective series of patients. Such large scale clinical trials are nowadays commonplace. The alternative is years of polemic and debate surrounding several trials each too small to detect plausible differences with any certainty. Such trials produce equivocal and contradictory results, which could be predicted from power calculations based upon sensible pre-trial estimates of treatment differences. Unfortunately such calculations often lead to sample sizes of several thousands.
It is not surprising that investigators tend to be over-optimistic in their estimation of treatment effects (which are necessarily uncertain) especially when the sample size requirements are so stark. In this paper a method is outlined for incorporating into the sample size calculations the uncertainty of the estimate made at the design stage of a clinical trial. In particular a formal scheme is described for deciding how many interim analyses should be performed to satisfy ethical and pragmatic requirements of large clinical trial design. Although the argument will be ‘Bayesian’, the criteria for assessment and comparison will be strictly of a Neyman-Pearson (i.e. significance testing) kind.
[1]
P. Canner.
Monitoring treatment differences in long-term clinical trials.
,
1977,
Biometrics.
[2]
F. J. Anscombe.
Sequential Medical Trials
,
1963
.
[3]
S. Pocock.
Group sequential methods in the design and analysis of clinical trials
,
1977
.
[4]
T. Colton.
A Model for Selecting One of Two Medical Treatments
,
1963
.
[5]
M A Schneiderman,et al.
The role of hypothesis testing in clinical trials. Biometrics seminar.
,
1966,
Journal of chronic diseases.
[6]
K McPherson,et al.
Statistics: the problem of examining accumulating data more than once.
,
1974,
The New England journal of medicine.
[7]
P. Armitage,et al.
Design and analysis of randomized clinical trials requiring prolonged observation of each patient. I. Introduction and design.
,
1976,
British Journal of Cancer.
[8]
P. Armitage,et al.
Repeated Significance Tests on Accumulating Data
,
1969
.
[9]
P. Armitage,et al.
Repeated Significance Tests on Accumulating Data When the Null Hypothesis is Not True
,
1971
.