Interpreting Randomized Controlled Trials

Simple Summary We provide an extensive review of the fundamental principles of statistical science that are needed to accurately interpret randomized controlled trials (RCTs). We use these principles to explain how RCTs are motivated by the powerful but strange idea that flipping a coin to choose each patient’s treatment is the most statistically reliable way to compare treatments. Random treatment assignment ensures fair comparisons between treatments because it does away with bias and confounding from variables other than treatment. If the goal is to estimate treatment effects in a patient population, rather than compare two or more treatments, then random sampling, not random treatment assignment, is required. However, random sampling is virtually impossible to carry out in a clinical trial because patients are accrued over time as they arrive in the clinic, subject to a trial’s entry criteria. Consequently, in practice, a trial provides a nonrepresentative convenience sample. Valid treatment comparisons provided by RCT data subsequently require additional causal assumptions of transportability of between-treatment effects from the sample to the patient population of interest. This may be used as a basis for choosing treatments for future patients. The present paper discusses what this means for practicing physicians who encounter RCT data in the literature. Abstract This article describes rationales and limitations for making inferences based on data from randomized controlled trials (RCTs). We argue that obtaining a representative random sample from a patient population is impossible for a clinical trial because patients are accrued sequentially over time and thus comprise a convenience sample, subject only to protocol entry criteria. Consequently, the trial’s sample is unlikely to represent a definable patient population. We use causal diagrams to illustrate the difference between random allocation of interventions within a clinical trial sample and true simple or stratified random sampling, as executed in surveys. We argue that group-specific statistics, such as a median survival time estimate for a treatment arm in an RCT, have limited meaning as estimates of larger patient population parameters. In contrast, random allocation between interventions facilitates comparative causal inferences about between-treatment effects, such as hazard ratios or differences between probabilities of response. Comparative inferences also require the assumption of transportability from a clinical trial’s convenience sample to a targeted patient population. We focus on the consequences and limitations of randomization procedures in order to clarify the distinctions between pairs of complementary concepts of fundamental importance to data science and RCT interpretation. These include internal and external validity, generalizability and transportability, uncertainty and variability, representativeness and inclusiveness, blocking and stratification, relevance and robustness, forward and reverse causal inference, intention to treat and per protocol analyses, and potential outcomes and counterfactuals.

[1]  Alexei C. Ionan,et al.  Clinical and Statistical Perspectives on the ICH E9(R1) Estimand Framework Implementation , 2023 .

[2]  M. Burotto,et al.  Cabozantinib plus Nivolumab and Ipilimumab in Renal-Cell Carcinoma. , 2023, The New England journal of medicine.

[3]  S. Ruberg,et al.  Application of Bayesian approaches in drug development: starting a virtuous cycle , 2023, Nature Reviews Drug Discovery.

[4]  Susan Mayo,et al.  What Can Be Achieved with the Estimand Framework? , 2023, Statistics in Biopharmaceutical Research.

[5]  Xinru Wang,et al.  The Sequential Multiple Assignment Randomized Trial for Controlling Infectious Diseases: A Review of Recent Developments. , 2023, American journal of public health.

[6]  S. Greenland Divergence versus decision P‐values: A distinction worth making in theory and keeping in practice: Or, how divergence P‐values measure evidence even when decision P‐values do not , 2022, Scandinavian Journal of Statistics.

[7]  L. Andersen,et al.  Adjustment for Baseline Characteristics in Randomized Clinical Trials. , 2022, JAMA.

[8]  N. Kuderer,et al.  Risk Model Development and Validation in Clinical Oncology: Lessons Learned , 2022, Cancer investigation.

[9]  P. Msaouel Less is More? First Impressions From COSMIC-313 , 2022, Cancer investigation.

[10]  M. Hernán,et al.  Effect of Colonoscopy Screening on Risks of Colorectal Cancer and Related Death. , 2022, The New England journal of medicine.

[11]  H. Fu,et al.  Statistical Issues in Drug Development, 3rd ed. , 2022, The American Statistician.

[12]  D. Rubin Interview with Don Rubin , 2022, Observational Studies.

[13]  A. Vickers,et al.  Methods Modernizing Statistical Reporting in Medical Journals: Challenges and Future Directions. , 2022, European urology.

[14]  M. Mansournia,et al.  P-value, compatibility, and S-value , 2022, Global epidemiology.

[15]  P. Thall,et al.  Utility‐based Bayesian personalized treatment selection for advanced breast cancer , 2022, Journal of the Royal Statistical Society. Series C, Applied statistics.

[16]  P. Stark Pay No Attention to the Model Behind the Curtain , 2022, Pure and Applied Geophysics.

[17]  P. Thall,et al.  Bayesian treatment screening and selection using subgroup‐specific utilities of response and toxicity , 2022, Biometrics.

[18]  P. Thall,et al.  A Causal Framework for Making Individualized Treatment Decisions in Oncology , 2022, Cancers.

[19]  R. Taylor,et al.  Informed decision‐making: Statistical methodology for surrogacy evaluation and its role in licensing and reimbursement assessments , 2022, Pharmaceutical statistics.

[20]  S. Greenland,et al.  To curb research misreporting, replace significance and confidence by compatibility: A Preventive Medicine golden jubilee article. , 2022, Preventive medicine.

[21]  P. Msaouel The Big Data Paradox in Clinical Practice , 2022, Cancer investigation.

[22]  Stephen R. Cole,et al.  Toward a clearer definition of selection bias when estimating causal effects. , 2022, Epidemiology.

[23]  A. Ocampo,et al.  Single‐world intervention graphs for defining, identifying, and communicating estimands in clinical trials , 2022, Statistics in medicine.

[24]  K. Ahrar,et al.  A phase 1-2 trial of sitravatinib and nivolumab in clear cell renal cell carcinoma following progression on antiangiogenic therapy , 2022, Science Translational Medicine.

[25]  Radu V. Craiu,et al.  Six Statistical Senses , 2022, Annual Review of Statistics and Its Application.

[26]  K. Ballman,et al.  Evolving Role of Adjuvant Systemic Therapy for Kidney and Urothelial Cancers. , 2022, American Society of Clinical Oncology educational book. American Society of Clinical Oncology. Annual Meeting.

[27]  B. Pijls The Table I Fallacy: P Values in Baseline Tables of Randomized Controlled Trials. , 2022, The Journal of bone and joint surgery. American volume.

[28]  S. Greenland,et al.  Causal Directed Acyclic Graphs. , 2022, JAMA.

[29]  E. Bareinboim,et al.  On Pearl’s Hierarchy and the Foundations of Causal Inference , 2022, Probabilistic and Causal Inference.

[30]  G. Agnelli,et al.  Medicine before and after David Cox. , 2022, European journal of internal medicine.

[31]  P. Sorger,et al.  Independent Drug Action in Combination Therapy: Implications for Precision Oncology , 2022, Cancer discovery.

[32]  Ying‐Qi Zhao Dynamic Treatment Regimes: Statistical Methods for Precision Medicine , 2022, Journal of the American Statistical Association.

[33]  P. Msaouel,et al.  Missing the trees for the forest: most subgroup analyses using forest plots at the ASCO annual meeting are inconclusive , 2022, Therapeutic advances in medical oncology.

[34]  S. Hancock,et al.  Kidney Cancer, Version 3.2022, NCCN Clinical Practice Guidelines in Oncology. , 2022, Journal of the National Comprehensive Cancer Network : JNCCN.

[35]  P. Msaouel,et al.  Fooled by Randomness. The Misleading Effect of Treatment Crossover in Randomized Trials of Therapies with Marginal Treatment Benefit , 2021, Cancer investigation.

[36]  M. Voss,et al.  In silico modeling of combination systemic therapy for advanced renal cell carcinoma , 2021, Journal for ImmunoTherapy of Cancer.

[37]  S. Greenland,et al.  Addressing exaggeration of effects from single RCTs , 2021, Significance.

[38]  C. Porta,et al.  First-line Nivolumab plus Ipilimumab Versus Sunitinib in Patients Without Nephrectomy and With an Evaluable Primary Renal Tumor in the CheckMate 214 Trial. , 2021, European urology.

[39]  P. Msaouel,et al.  Causal Considerations Can Inform the Interpretation of Surprising Associations in Medical Registries , 2021, Cancer investigation.

[40]  Bernie Taylor The Knowledge Machine: How Irrationality Created Modern Science , 2021, Journal of Geography.

[41]  P. Tamboli,et al.  Efficacy and safety of gemcitabine plus doxorubicin in patients with renal medullary carcinoma. , 2021, Clinical genitourinary cancer.

[42]  P. Msaouel Impervious to Randomness: Confounding and Selection Biases in Randomized Clinical Trials , 2021, Cancer investigation.

[43]  J. Burke,et al.  Adjuvant Pembrolizumab after Nephrectomy in Renal-Cell Carcinoma. , 2021, The New England journal of medicine.

[44]  M. Mansournia,et al.  Causal diagrams for immortal time bias. , 2021, International journal of epidemiology.

[45]  P. Thall,et al.  Precision Bayesian phase I‐II dose‐finding based on utilities tailored to prognostic subgroups , 2021, Statistics in medicine.

[46]  P. Thall,et al.  Making Patient-Specific Treatment Decisions Using Prognostic Variables and Utilities of Clinical Outcomes , 2021, Cancers.

[47]  T. Bathala,et al.  Efficacy and Safety of Bevacizumab Plus Erlotinib in Patients with Renal Medullary Carcinoma , 2021, Cancers.

[48]  P. Thall Adaptive Enrichment Designs in Clinical Trials. , 2021, Annual review of statistics and its application.

[49]  Sherri Rose,et al.  A Review of Generalizability and Transportability , 2021, Annual Review of Statistics and Its Application.

[50]  C. Porta,et al.  Lenvatinib plus Pembrolizumab or Everolimus for Advanced Renal Cell Carcinoma. , 2021, The New England journal of medicine.

[51]  P. Thall,et al.  Bayesian group sequential enrichment designs based on adaptive regression of response and survival time on baseline biomarkers , 2021, Biometrics.

[52]  F. Harrell,et al.  Using Bayesian Methods to Augment the Interpretation of Critical Care Trials. An Overview of Theory and Example Reanalysis of the Alveolar Recruitment for Acute Respiratory Distress Syndrome Trial , 2020, American journal of respiratory and critical care medicine.

[53]  S. Greenland Analysis goals, error-cost sensitivity, and analysis hacking: Essential considerations in hypothesis testing and multiple comparisons. , 2020, Paediatric and perinatal epidemiology.

[54]  Simon Schwab,et al.  The statistical properties of RCTs and a proposal for shrinkage , 2020, Statistics in medicine.

[55]  S. Zohar,et al.  Bayesian dose regimen assessment in early phase oncology incorporating pharmacokinetics and pharmacodynamics , 2020, Biometrics.

[56]  S. Greenland The Causal Foundations of Applied Probability and Statistics , 2020, Probabilistic and Causal Inference.

[57]  Eric A. Cator,et al.  The significance filter, the winner's curse and the need to shrink , 2020, Statistica Neerlandica.

[58]  L. Lyons,et al.  Reproducibility and Replication of Experimental Particle Physics Results , 2020, Issue 2.4, Fall 2020.

[59]  Edward H Kennedy,et al.  Defining and Identifying Per-protocol Effects in Randomized Trials. , 2020, Epidemiology.

[60]  D. D. Shapiro,et al.  Causal Diagram Techniques for Urologic Oncology Research. , 2020, Clinical genitourinary cancer.

[61]  Aki Vehtari,et al.  Regression and Other Stories , 2020 .

[62]  Sander Greenland,et al.  Surprise! , 2020, American journal of epidemiology.

[63]  Byron C. Wallace,et al.  Trialstreamer: A living, automatically updated database of clinical trial reports , 2020, medRxiv.

[64]  N. Waugh,et al.  Treatment effects may remain the same even when trial participants differed from the target population. , 2020, Journal of clinical epidemiology.

[65]  Yi Wang,et al.  Remdesivir in adults with severe COVID-19: a randomised, double-blind, placebo-controlled, multicentre trial , 2020, The Lancet.

[66]  Sven Ove Hansson,et al.  Introduction to Formal Philosophy , 2020 .

[67]  I. Wistuba,et al.  Comprehensive Molecular Characterization Identifies Distinct Genomic and Immune Hallmarks of Renal Medullary Carcinoma. , 2020, Cancer cell.

[68]  Robin L. Jones,et al.  Effect of Doxorubicin Plus Olaratumab vs Doxorubicin Plus Placebo on Survival in Patients With Advanced Soft Tissue Sarcomas: The ANNOUNCE Randomized Clinical Trial. , 2020, JAMA.

[69]  Juhee Lee,et al.  A phase I‐II design based on periodic and continuous monitoring of disease status and the times to toxicity and death , 2020, Statistics in medicine.

[70]  J. Donovan,et al.  Adjuvant chemotherapy in upper tract urothelial carcinoma (the POUT trial): a phase 3, open-label, randomised controlled trial , 2020, The Lancet.

[71]  E. Suzuki,et al.  Causal Diagrams: Pitfalls and Tips , 2020, Journal of epidemiology.

[72]  U. Manne,et al.  Inclusiveness and ethical considerations for observational, translational, and clinical cancer health disparity research , 2019, Cancer.

[73]  Anastasios A. Tsiatis,et al.  Dynamic Treatment Regimes , 2019 .

[74]  Sally Morton,et al.  The Predictive Approaches to Treatment effect Heterogeneity (PATH) Statement , 2019, Annals of Internal Medicine.

[75]  T. Gooley,et al.  Cubic splines to model relationships between continuous variables and outcomes: a guide for clinicians , 2019, Bone Marrow Transplantation.

[76]  G. Velikova,et al.  Cancer as a chronic illness: support needs and experiences , 2019, BMJ Supportive & Palliative Care.

[77]  S. Greenland,et al.  Semantic and cognitive tools to aid statistical science: replace confidence and significance by compatibility and surprise , 2019, BMC Medical Research Methodology.

[78]  D. Hayes HER2 and Breast Cancer - A Phenomenal Success Story. , 2019, The New England journal of medicine.

[79]  S. Greenland,et al.  Multiple comparisons controversies are about context and costs, not frequentism versus Bayesianism , 2019, European Journal of Epidemiology.

[80]  R. D'Agostino,et al.  New Guidelines for Statistical Reporting in the Journal. , 2019, The New England journal of medicine.

[81]  Issa J. Dahabreh,et al.  Extending inferences from a randomized trial to a target population , 2019, European Journal of Epidemiology.

[82]  Jin Tian,et al.  Adjustment Criteria for Generalizing Experimental Findings , 2019, ICML.

[83]  P. Thall,et al.  Conditioning with busulfan plus melphalan versus melphalan alone before autologous haemopoietic cell transplantation for multiple myeloma: an open-label, randomised, phase 3 trial. , 2019, The Lancet. Haematology.

[84]  Sander Greenland,et al.  Valid P-Values Behave Exactly as They Should: Some Misleading Criticisms of P-Values and Their Resolution With S-Values , 2019, The American Statistician.

[85]  Sander Greenland,et al.  Scientists rise up against statistical significance , 2019, Nature.

[86]  J. Pearl,et al.  Note on ‘‘Generalizability of Study Results‘‘ , 2019, Epidemiology.

[87]  W. Linehan,et al.  Updated Recommendations on the Diagnosis, Management, and Clinical Trial Eligibility Criteria for Patients With Renal Medullary Carcinoma. , 2019, Clinical genitourinary cancer.

[88]  D. DeMets,et al.  Challenges of Non–Intention-to-Treat Analyses , 2019, JAMA.

[89]  Ewout Steyerberg,et al.  Personalized evidence based medicine: predictive approaches to heterogeneous treatment effects , 2018, British Medical Journal.

[90]  Arthur S Slutsky,et al.  Extracorporeal Membrane Oxygenation for Severe Acute Respiratory Distress Syndrome and Posterior Probability of Mortality Benefit in a Post Hoc Bayesian Analysis of a Randomized Clinical Trial , 2018, JAMA.

[91]  Sander Greenland,et al.  Inferential Statistics as Descriptive Statistics: There Is No Replication Crisis if We Don’t Expect Replication , 2018, The American Statistician.

[92]  P. Thall,et al.  A Bayesian Machine Learning Approach for Optimizing Dynamic Treatment Regimes , 2018, Journal of the American Statistical Association.

[93]  J. Drazen,et al.  Learning from a Trial Stopped by a Data and Safety Monitoring Board. , 2018, The New England journal of medicine.

[94]  D. Brodie,et al.  Extracorporeal Membrane Oxygenation for Severe Acute Respiratory Distress Syndrome , 2018, The New England journal of medicine.

[95]  Sarah E. Robertson,et al.  Extending inferences from a randomized trial to a new target population , 2018, Statistics in medicine.

[96]  Michael G Hudgens,et al.  A Practical Example Demonstrating the Utility of Single-world Intervention Graphs. , 2018, Epidemiology.

[97]  Bohuslav Melichar,et al.  Nivolumab plus Ipilimumab versus Sunitinib in Advanced Renal‐Cell Carcinoma , 2018, The New England journal of medicine.

[98]  I. Keilegom,et al.  Cure Models in Survival Analysis , 2018 .

[99]  S. George,et al.  On Enrichment Strategies for Biomarker Stratified Clinical Trials , 2018, Journal of biopharmaceutical statistics.

[100]  J. Cuzick Prognosis vs Treatment Interaction , 2018, JNCI cancer spectrum.

[101]  Peter K. Sorger,et al.  Combination Cancer Therapy Can Confer Benefit via Patient-to-Patient Variability without Drug Additivity or Synergy , 2017, Cell.

[102]  G. Isbary,et al.  Effect of Crossover in Oncology Clinical Trials on Evidence Levels in Early Benefit Assessment in Germany. , 2017, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[103]  Peter F Thall,et al.  Bayesian nonparametric statistics: A new toolkit for discovery in cancer research , 2017, Pharmaceutical statistics.

[104]  Ralph B. D'Agostino,et al.  Challenges in the Design and Interpretation of Noninferiority Trials , 2017, The New England journal of medicine.

[105]  W. Dupont,et al.  Second-generation p-values: Improved rigor, reproducibility, & transparency in statistical analyses , 2017, PloS one.

[106]  Andrew Gelman,et al.  The Prior Can Often Only Be Understood in the Context of the Likelihood , 2017, Entropy.

[107]  David Gal,et al.  Statistical Significance and the Dichotomization of Evidence , 2017 .

[108]  W. Kaelin,et al.  Common pitfalls in preclinical cancer target validation , 2017, Nature Reviews Cancer.

[109]  Eric B. Laber,et al.  Dynamic treatment regimes, past, present, and future: A conversation with experts , 2017, Statistical methods in medical research.

[110]  W. Jacobs,et al.  Origins of Combination Therapy for Tuberculosis: Lessons for Future Antimicrobial Development and Application , 2017, mBio.

[111]  S. Greenland For and Against Methodologies: Some Perspectives on Recent Causal and Statistical Inference Debates , 2017, European Journal of Epidemiology.

[112]  M. Ellis,et al.  Phase III Trial Evaluating Letrozole As First-Line Endocrine Therapy With or Without Bevacizumab for the Treatment of Postmenopausal Women With Hormone Receptor-Positive Advanced-Stage Breast Cancer: CALGB 40503 (Alliance). , 2016, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[113]  Robin L. Jones,et al.  Olaratumab and doxorubicin versus doxorubicin alone for treatment of soft-tissue sarcoma: an open-label phase 1b and randomised phase 2 trial , 2016, The Lancet.

[114]  B. Efron,et al.  Computer Age Statistical Inference: Algorithms, Evidence, and Data Science , 2016 .

[115]  J. Pearl,et al.  Causal inference and the data-fusion problem , 2016, Proceedings of the National Academy of Sciences.

[116]  Ying Yuan,et al.  Bayesian Designs for Phase I-II Clinical Trials , 2016 .

[117]  T. Whelan,et al.  Extending Aromatase-Inhibitor Adjuvant Therapy to 10 Years. , 2016, The New England journal of medicine.

[118]  S. Goodman,et al.  Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations , 2016, European Journal of Epidemiology.

[119]  John P A Ioannidis,et al.  Noninferiority is almost certain with lenient noninferiority margins. , 2016, Journal of clinical epidemiology.

[120]  J. Vandenbroucke,et al.  Noninferiority is (too) common in noninferiority trials. , 2016, Journal of clinical epidemiology.

[121]  P. Thall,et al.  Optimization of multi‐stage dynamic treatment regimes utilizing accumulated data , 2015, Statistics in medicine.

[122]  P. Kantoff,et al.  Cabozantinib versus Everolimus in Advanced Renal-Cell Carcinoma. , 2015, The New England journal of medicine.

[123]  Xiao-Li Meng,et al.  There is Individualized Treatment. Why Not Individualized Inference , 2015, 1510.08539.

[124]  Jeffrey N. Rouder,et al.  The fallacy of placing confidence in confidence intervals , 2015, Psychonomic bulletin & review.

[125]  S. Senn Mastering variation: variance components and personalised medicine , 2015, Statistics in medicine.

[126]  S. Greenland,et al.  Limitations of individual causal models, causal graphs, and ignorability assumptions, as illustrated by random confounding and design unfaithfulness , 2015, European Journal of Epidemiology.

[127]  A. Messiah,et al.  Random sample community-based health surveys: does the effort to reach participants matter? , 2014, BMJ Open.

[128]  Christopher Winship,et al.  Endogenous Selection Bias: The Problem of Conditioning on a Collider Variable. , 2014, Annual review of sociology.

[129]  Abdus S Wahed,et al.  Bayesian Nonparametric Estimation for Dynamic Treatment Regimes With Sequential Transition Times , 2014, Journal of the American Statistical Association.

[130]  Susan A. Murphy,et al.  Introduction to SMART designs for the development of adaptive interventions: with application to weight loss research , 2014, Translational behavioral medicine.

[131]  F. Saad,et al.  Randomized controlled trial of early zoledronic acid in men with castration-sensitive prostate cancer and bone metastases: results of CALGB 90202 (alliance). , 2014, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[132]  S. Murphy,et al.  Dynamic Treatment Regimes. , 2014, Annual review of statistics and its application.

[133]  G. Imbens,et al.  Why Ask Why? Forward Causal Inference and Reverse Causal Questions , 2013 .

[134]  A. Giobbie-Hurder,et al.  Challenges of guarantee-time bias. , 2013, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[135]  C. Logothetis,et al.  Molecular classification of prostate cancer progression: foundation for marker-driven treatment of prostate cancer. , 2013, Cancer discovery.

[136]  Shah Ebrahim,et al.  Commentary: Should we always deliberately be non-representative? , 2013, International journal of epidemiology.

[137]  Kenneth J Rothman,et al.  Rebuttal: When it comes to scientific inference, sometimes a cigar is just a cigar. , 2013, International journal of epidemiology.

[138]  Neil Pearce,et al.  Commentary: Representativeness is usually not necessary and often should be avoided. , 2013, International journal of epidemiology.

[139]  J. Gallacher,et al.  Why representativeness should be avoided. , 2013, International journal of epidemiology.

[140]  Erica E. M. Moodie,et al.  Statistical Methods for Dynamic Treatment Regimes: Reinforcement Learning, Causal Inference, and Personalized Medicine , 2013 .

[141]  K. Singh,et al.  Confidence Distribution, the Frequentist Distribution Estimator of a Parameter: A Review , 2013 .

[142]  Christian Melander,et al.  Combination approaches to combat multidrug-resistant bacteria. , 2013, Trends in biotechnology.

[143]  T. Choueiri,et al.  External validation and comparison with other models of the International Metastatic Renal-Cell Carcinoma Database Consortium prognostic model: a population-based study. , 2013, The Lancet. Oncology.

[144]  Abdus S Wahed,et al.  Evaluating joint effects of induction–salvage treatment regimes on overall survival in acute leukaemia , 2013, Journal of the Royal Statistical Society. Series C, Applied statistics.

[145]  S. Senn Tea for three: Of infusions and inferences and milk in first , 2012 .

[146]  D. Stewart Before we throw out progression-free survival as a valid end point... , 2012, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[147]  T. VanderWeele Confounding and Effect Modification: Distribution and Measure , 2012, Epidemiologic methods.

[148]  Elias Bareinboim,et al.  Transportability of Causal Effects: Completeness Results , 2012, AAAI.

[149]  Pranita D. Tamma,et al.  Combination Therapy for Treatment of Infections with Gram-Negative Bacteria , 2012, Clinical Microbiology Reviews.

[150]  Daniel Almirall,et al.  SMART Design Issues and the Consideration of Opposing Outcomes: Discussion of "Evaluation of Viable Dynamic Treatment Regimes in a Sequentially Randomized Trial of Advanced Prostate Cancer" by by Wang, Rotnitzky, Lin, Millikan, and Thall. , 2012, Journal of the American Statistical Association.

[151]  Peter F Thall,et al.  Evaluation of Viable Dynamic Treatment Regimes in a Sequentially Randomized Trial of Advanced Prostate Cancer , 2012, Journal of the American Statistical Association.

[152]  Geri L. Schmotzer Barriers and facilitators to participation of minorities in clinical trials. , 2012, Ethnicity & disease.

[153]  E. Eisenhauer,et al.  Progression-free survival: meaningful or simply measurable? , 2012, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[154]  Guosheng Yin,et al.  Fundamentals of Clinical Trials , 2012 .

[155]  S. Senn Seven myths of randomisation in clinical trials , 2011, Statistics in medicine.

[156]  Jon Williamson,et al.  Foundations of Bayesianism , 2011 .

[157]  Michael Proschan,et al.  Minimize the Use of Minimization with Unequal Allocation , 2011, Biometrics.

[158]  Douglas G Altman,et al.  How to obtain the confidence interval from a P value , 2011, BMJ : British Medical Journal.

[159]  Boris Freidlin,et al.  Overall survival as the outcome for randomized clinical trials with effective subsequent therapies. , 2011, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[160]  G. Pond,et al.  Statistical issues in the use of dynamic allocation methods for balancing baseline covariates , 2011, British Journal of Cancer.

[161]  Elias Bareinboim,et al.  Controlling Selection Bias in Causal Inference , 2011, AISTATS.

[162]  J. Hutton Misleading Statistics , 2010, Pharmaceutical Medicine.

[163]  J. Robins,et al.  Identifiability, exchangeability and confounding revisited , 2009, Epidemiologic perspectives & innovations : EP+I.

[164]  J. Hutton Number needed to treat and number needed to harm are not the best way to report and assess the results of randomised clinical trials , 2009, British journal of haematology.

[165]  C. Tangen,et al.  Prostate-specific antigen progression predicts overall survival in patients with metastatic prostate cancer: data from Southwest Oncology Group Trials 9346 (Intergroup Study 0162) and 9916. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[166]  Toshiro Tango,et al.  Permutation Test Following Covariate-Adaptive Randomization in Randomized Controlled Trials , 2009, Journal of biopharmaceutical statistics.

[167]  S. Senn Lessons from TGN1412 and TARGET: implications for observational studies and meta‐analysis , 2008, Pharmaceutical statistics.

[168]  J. Ioannidis Why Most Discovered True Associations Are Inflated , 2008, Epidemiology.

[169]  N. S. Hall R. A. Fisher and his advocacy of randomization , 2007, Journal of the history of biology.

[170]  Ting-Chao Chou,et al.  Theoretical Basis, Experimental Design, and Computerized Simulation of Synergism and Antagonism in Drug Combination Studies , 2006, Pharmacological Reviews.

[171]  Sander Greenland,et al.  Bayesian perspectives for epidemiological research: I. Foundations and basic methods. , 2006, International journal of epidemiology.

[172]  A. Gelman The Boxer, the Wrestler, and the Coin Flip , 2006 .

[173]  S. Murphy,et al.  An experimental design for the development of adaptive treatment strategies , 2005, Statistics in medicine.

[174]  Jack Cuzick,et al.  Forest plots and the interpretation of subgroups , 2005, The Lancet.

[175]  D. Rubin Causal Inference Using Potential Outcomes , 2005 .

[176]  Stephen Senn,et al.  Controversies concerning randomization and additivity in clinical trials , 2004, Statistics in medicine.

[177]  J. Robins,et al.  A Structural Approach to Selection Bias , 2004, Epidemiology.

[178]  P. Thall,et al.  Once-daily intravenous busulfan and fludarabine: clinical and pharmacokinetic results of a myeloablative, reduced-toxicity conditioning regimen for allogeneic stem cell transplantation in AML and MDS. , 2004, Blood.

[179]  M. Hernán A definition of causal effect for epidemiological research , 2004, Journal of Epidemiology and Community Health.

[180]  Sara T Brookes,et al.  Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test. , 2004, Journal of clinical epidemiology.

[181]  H. Marks Rigorous uncertainty: why RA Fisher is important. , 2003, International journal of epidemiology.

[182]  Peter Armitage,et al.  Fisher, Bradford Hill, and randomization. , 2003, International journal of epidemiology.

[183]  T. Peters,et al.  Design, analysis and presentation of factorial randomised controlled trials , 2003, BMC medical research methodology.

[184]  Karl Swedberg,et al.  Valsartan, captopril, or both in myocardial infarction complicated by heart failure, left ventricular dysfunction, or both. , 2003, The New England journal of medicine.

[185]  E. Walker Regression Modeling Strategies , 2003, Technometrics.

[186]  Stephen Senn,et al.  A Conversation with John Nelder , 2003 .

[187]  Douglas G Altman,et al.  Survival plots of time-to-event outcomes in clinical trials: good practice and pitfalls , 2002, The Lancet.

[188]  D. Heisey,et al.  The Abuse of Power , 2001 .

[189]  R. Royall On the Probability of Observing Misleading Statistical Evidence , 2000 .

[190]  H. Sung,et al.  Evaluating multiple treatment courses in clinical trials. , 2000, Statistics in medicine.

[191]  S Greenland,et al.  Principles of multilevel modelling. , 2000, International journal of epidemiology.

[192]  E. Ziegel,et al.  Statistical Issues in Drug Development , 1999 .

[193]  J. Pearl,et al.  Confounding and Collapsibility in Causal Inference , 1999 .

[194]  S Greenland,et al.  Induction versus Popper: substance versus semantics. , 1998, International journal of epidemiology.

[195]  S Greenland,et al.  Probability Logic and Probabilistic Induction , 1998, Epidemiology.

[196]  S. Senn Testing for baseline balance in clinical trials. , 1994, Statistics in medicine.

[197]  J. Ludbrook,et al.  Issues in biomedical statistics: statistical inference. , 1994, The Australian and New Zealand journal of surgery.

[198]  David J. Spiegelhalter,et al.  Bayesian Approaches to Randomized Trials , 1994, Bayesian Biostatistics.

[199]  S J Senn,et al.  Falsificationism and clinical trials. , 1991, Statistics in medicine.

[200]  Sander Greenland,et al.  On the Logical Justification of Conditional Tests for Two-By-Two Contingency Tables , 1991 .

[201]  D. A. Preece,et al.  R. A. Fisher and Experimental Design: A Review , 1990 .

[202]  S Greenland,et al.  Randomization, Statistics, and Causal Inference , 1990, Epidemiology.

[203]  S J Senn,et al.  The graphical representation of clinical trials with particular reference to measurements over time. , 1990, Statistics in medicine.

[204]  W Godolphin,et al.  Studies of the HER-2/neu proto-oncogene in human breast and ovarian cancer. , 1989, Science.

[205]  D L Sackett,et al.  An assessment of clinically useful measures of the consequences of treatment. , 1988, The New England journal of medicine.

[206]  Donald J. Schuirmann A comparison of the Two One-Sided Tests Procedure and the Power Approach for assessing the equivalence of average bioavailability , 1987, Journal of Pharmacokinetics and Biopharmaceutics.

[207]  W. McGuire,et al.  Human breast cancer: correlation of relapse and survival with amplification of the HER-2/neu oncogene. , 1987, Science.

[208]  Vernon T. Farewell,et al.  Mixture models in survival analysis: Are they worth the risk? , 1986 .

[209]  P. Holland Statistics and Causal Inference , 1985 .

[210]  Frederick Mosteller,et al.  Representative Sampling, IV: The History of the Concept in Statistics, 1895-1939 , 1980 .

[211]  Frederick Mosteller,et al.  Representative Sampling, III: The Current Statistical Literature , 1979 .

[212]  J. Cornfield Recent methodological contributions to clinical trials. , 1976, American journal of epidemiology.

[213]  S. Pocock,et al.  Sequential treatment assignment with balancing for prognostic factors in the controlled clinical trial. , 1975, Biometrics.

[214]  D R Taves,et al.  Minimization: A new method of assigning patients to treatment and control groups , 1974, Clinical pharmacology and therapeutics.

[215]  Emil Frei,et al.  Studies of Sequential and Combination Antimetabolite Therapy in Acute Leukemia: 6-Mercaptopurine and Methotrexate , 1961 .

[216]  E. Freireich,et al.  A comparative study of two regimens of combination chemotherapy in acute leukemia. , 1958, Blood.

[217]  D. Campbell Factors relevant to the validity of experiments in social settings. , 1957, Psychological bulletin.

[218]  R. Carnap Testability and Meaning , 1936, Philosophy of Science.

[219]  R Fisher,et al.  Design of Experiments , 1936 .

[220]  B. Russell I.—On the Notion of Cause , 1913 .

[221]  Xiao-Li Meng,et al.  Double Your Variance, Dirtify Your Bayes, Devour Your Pufferfish, and Draw your Kidstrogram , 2022, The New England Journal of Statistics in Data Science.

[222]  P. Thall Statistical Remedies for Medical Researchers , 2020, Springer Series in Pharmaceutical Statistics.

[223]  Christoph J. Kemper External Validity , 2020, Encyclopedia of Personality and Individual Differences.

[224]  S. Greenland An introduction to instrumental variables for epidemiologists. , 2018, International journal of epidemiology.

[225]  Julian P T Higgins,et al.  Biases in Randomized Trials: A Conversation Between Trialists and Epidemiologists. , 2017, Epidemiology.

[226]  V. Sheppard,et al.  Enrolling Minority and Underserved Populations in Cancer Clinical Research. , 2016, American journal of preventive medicine.

[227]  F. E. Harrell Regression modeling strategies : with applications to linear models, logistic and ordinal regression, and survival analysis , 2015 .

[228]  R. Groenwold,et al.  [Effect modification and interaction]. , 2015, Nederlands tijdschrift voor geneeskunde.

[229]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[230]  B. Chakraborty,et al.  Statistical Methods for Dynamic Treatment Regimes: Reinforcement Learning, Causal Inference, and Personalized Medicine , 2013 .

[231]  David B. Dunson,et al.  Bayesian data analysis, third edition , 2013 .

[232]  James M. Robins,et al.  Single World Intervention Graphs : A Primer , 2013 .

[233]  Natalie Shlomo,et al.  Estimation of an indicator of the representativeness of survey response , 2012 .

[234]  S. Senn You May Believe You Are a Bayesian But You Are Probably Wrong , 2011 .

[235]  P. Richardson,et al.  Clinically relevant end points and new drug approvals for myeloma , 2008, Leukemia.

[236]  S. Greenland Bayesian perspectives for epidemiological research , 2006 .

[237]  S. Hartmann Bayesian Epistemology , 2005 .

[238]  Bradley Efron,et al.  Modern science and the Bayesian-frequentist controversy , 2005 .

[239]  Karl J. Friston,et al.  Variance Components , 2003 .

[240]  A Lanigan,et al.  HER2 as a prognostic and predictive marker for breast cancer. , 2001, Annals of oncology : official journal of the European Society for Medical Oncology.

[241]  Douglas D. Richman,et al.  HIV chemotherapy , 2001, Nature.

[242]  J. Pearl Bayesianism and Causality, or, Why I am Only a Half-Bayesian , 2001 .

[243]  J. Hutton,et al.  Number needed to treat: properties and problems , 2000 .

[244]  J. Pearl,et al.  Causal diagrams for epidemiologic research. , 1999, Epidemiology.

[245]  K. Popper,et al.  Conjectures and refutations;: The growth of scientific knowledge , 1972 .

[246]  K. Popper,et al.  Conjectures and Refutations , 1963 .