The UK Medical Research Council (MRC) randomized trial of gastric surgery, ST01, compared conventional (D1) with radical (D2) surgery. Sample size estimation was based upon the consensus opinion of the surgical members of the design team, which suggested that a change in 5-year survival from 20% (D1) to 34% (D2) could be realistic and medically important. On the basis of these survival rates, the sample size for the trial was 400 patients. However, this trial was exceptional in the way that a survey of surgeons’ opinions was made at the start of the trial, in 1986, and again before results were analysed but after termination of the trial in 1994. At the initial survey, the three surgeons from the trial steering committee and 23 other surgeons experienced in treating gastric carcinoma were given detailed questionnaires. They were asked about the expected survival rate in the D1 group, anticipated difference in survival from D2 surgery, and what difference would be medically important and influence future treatment of patients. The consensus opinion of those surveyed was that there might be a survival improvement of 9.4%. In 1994, prior to closure of the trial, and before any survival information was disclosed, the survey was repeated with 21 of the original 26 surgeons. At this second survey, the opinion of the trial steering committee was that 9.5% difference was more realistic. This was in accord with the opinion of the larger group, which remained little changed since 1986. The baseline 5-year D1 survival was thought likely to be about 32%, which corresponded closely to the actual survival of recruited patients. Revised sample size calculations suggested that, on the basis of these more recent opinions, between 800 and 1200 patients would have been required. Both surveys assessed the level of treatment benefit that was deemed to be sufficient for causing surgeons to change their practice. This showed that the 13% difference in survival used as the study target was clinically relevant, but also indicated that many clinicians would remain unwilling to change their practice if the difference is only 9.5%. The experience of this carefully designed trial illustrates the problems of designing long-term, randomized trials. It raises interesting questions about the common practice of basing sample size estimates upon the beliefs of a trial design committee that may include a number of enthusiasts for the trial treatment. If their opinion of anticipated effect sizes drives the design of the trial, rather than the opinion of a larger community of experts that includes sceptics as well as enthusiasts, there is likely to be a serious miscalculation of sample size requirements. © 2000 Cancer Research Campaign
[1]
D J Spiegelhalter,et al.
The CHART trials: Bayesian design and monitoring in practice. CHART Steering Committee.
,
1994,
Statistics in medicine.
[2]
P M Fayers,et al.
Tutorial in biostatistics Bayesian data monitoring in clinical trials.
,
1997,
Statistics in medicine.
[3]
D. Spiegelhalter,et al.
A predictive approach to selecting the size of a clinical trial, based on subjective clinical opinion.
,
1986,
Statistics in medicine.
[4]
P. Fayers,et al.
Sample size: how many patients are necessary?
,
1995,
British Journal of Cancer.
[5]
P. Fayers,et al.
Postoperative morbidity and mortality after D1 and D2 resections for gastric cancer: preliminary results of the MRC randomised controlled surgical trial
,
1996,
The Lancet.
[6]
P. Fayers,et al.
Patient survival after D 1 and D 2 resections for gastric cancer: long-term results of the MRC randomized surgical trial
,
1999,
British Journal of Cancer.
[7]
P. Fayers,et al.
Postoperative morbidity and mortality after D1 and D2 resections for gastric cancer – results of the MRC randomised controlled trial
,
1996
.
[8]
David Machin,et al.
Sample Size Tables for Clinical Studies
,
1997
.