Sufficient trial size to inform clinical practice

Significance A core objective of trials comparing alternative medical treatments is to inform treatment choice in clinical practice, and yet conventional practice in designing trials has been to choose a sample size that yields specified statistical power. Power, a concept in the theory of hypothesis testing, is at most loosely connected to effective treatment choice. This paper develops an alternative principle for trial design that aims to directly benefit medical decision making. We propose choosing a sample size that enables implementation of near-optimal treatment rules. Near optimality means that treatment choices are suitably close to the best that could be achieved if clinicians were to know with certainty mean treatment response in their patient populations. Medical research has evolved conventions for choosing sample size in randomized clinical trials that rest on the theory of hypothesis testing. Bayesian statisticians have argued that trials should be designed to maximize subjective expected utility in settings of clinical interest. This perspective is compelling given a credible prior distribution on treatment response, but there is rarely consensus on what the subjective prior beliefs should be. We use Wald’s frequentist statistical decision theory to study design of trials under ambiguity. We show that ε-optimal rules exist when trials have large enough sample size. An ε-optimal rule has expected welfare within ε of the welfare of the best treatment in every state of nature. Equivalently, it has maximum regret no larger than ε. We consider trials that draw predetermined numbers of subjects at random within groups stratified by covariates and treatments. We report exact results for the special case of two treatments and binary outcomes. We give simple sufficient conditions on sample sizes that ensure existence of ε-optimal treatment rules when there are multiple treatments and outcomes are bounded. These conditions are obtained by application of Hoeffding large deviations inequalities to evaluate the performance of empirical success rules.

[1]  D J Reda,et al.  Single-Drug Therapy for Hypertension in Men -- A Comparison of Six Antihypertensive Agents with Placebo , 1993 .

[2]  D. Berry,et al.  Choosing sample size for a clinical trial using decision analysis , 2003 .

[3]  Richard J. Cook,et al.  Multiplicity Considerations in the Design and Analysis of Clinical Trials , 1996 .

[4]  A. Hrõbjartsson,et al.  Is the placebo powerless? An analysis of clinical trials comparing placebo with no treatment. , 2001, The New England journal of medicine.

[5]  N. Lazar,et al.  The ASA Statement on p-Values: Context, Process, and Purpose , 2016 .

[6]  P. Sedgwick Clinical significance versus statistical significance , 2014, BMJ : British Medical Journal.

[7]  Anthony C. Atkinson,et al.  Optimum Experimental Designs, with SAS , 2007 .

[8]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[9]  Abraham Wald,et al.  Statistical Decision Functions , 1951 .

[10]  C. Manski Statistical treatment rules for heterogeneous populations , 2003 .

[11]  Gábor Lugosi,et al.  Pattern Classification and Learning Theory , 2002 .

[12]  D. Berry Bayesian Statistics and the Efficiency and Ethics of Clinical Trials , 2004 .

[13]  K. Schlag ELEVEN - Tests needed for a Recommendation , 2006 .

[14]  Charles F. Manski,et al.  Minimax-regret treatment choice with missing outcome data , 2007 .

[15]  D. Ellsberg Decision, probability, and utility: Risk, ambiguity, and the Savage axioms , 1961 .

[16]  D. Spiegelhalter Incorporating Bayesian Ideas into Health-Care Evaluation , 2004 .

[17]  Gerald S. Rogers,et al.  Mathematical Statistics: A Decision Theoretic Approach , 1967 .

[18]  D. Altman Statistics and ethics in medical research: III How large a sample? , 1980, British medical journal.

[19]  A. Tetenov Statistical treatment choice based on asymmetric minimax regret criteria , 2009 .

[20]  David J. Spiegelhalter,et al.  Bayesian Approaches to Randomized Trials , 1994, Bayesian Biostatistics.

[21]  J. Karlawish,et al.  The continuing unethical conduct of underpowered clinical trials. , 2002, JAMA.

[22]  K Claxton,et al.  An economic approach to clinical trial design and research priority-setting. , 1996, Health economics.

[23]  Jörg Stoye,et al.  Minimax regret treatment choice with finite samples , 2009 .

[24]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[25]  Jörg Stoye Minimax regret treatment choice with covariates or with limited validity of experiments , 2012 .

[26]  C. Manski,et al.  Admissible treatment rules for a risk-averse planner with experimental data on an innovation , 2007 .

[27]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[28]  Stephen B. Vardeman,et al.  Statistics and Ethics , 2003 .

[29]  V. Bentkus On Hoeffding’s inequalities , 2004, math/0410159.

[30]  J. Fleiss Statistical methods for rates and proportions , 1974 .

[31]  D Meltzer,et al.  Addressing uncertainty in medical cost-effectiveness analysis implications of expected utility maximization for methods to perform sensitivity analysis and the use of cost-effectiveness analysis to set priorities for medical research. , 2001, Journal of health economics.

[32]  Leonard J. Savage,et al.  The Theory of Statistical Decision , 1951 .

[33]  B. Everitt,et al.  Statistical methods for rates and proportions , 1973 .

[34]  C. Dunnett A Multiple Comparison Procedure for Comparing Several Treatments with a Control , 1955 .

[35]  Ben Alamar Social Choice With Partial Knowledge of Treatment Response , 2007 .

[36]  K. Hirano,et al.  Asymptotics for Statistical Treatment Rules , 2009 .

[37]  Canada,et al.  STATISTICAL METHODS FOR , 2004 .

[38]  P. Lichtenberg,et al.  The ethics of the placebo in clinical practice , 2004, Journal of Medical Ethics.

[39]  N. Fisher,et al.  Probability Inequalities for Sums of Bounded Random Variables , 1994 .