The Fragility Index in Multicenter Randomized Controlled Critical Care Trials*

Objectives: Recent literature has drawn attention to the potential inadequacy of frequentist analysis and threshold p values as tools for reporting outcomes in clinical trials. The fragility index, which is a measure of how many events the statistical significance of a result depends on, has been suggested as a means to aid the interpretation of trial results. This study aimed to calculate the fragility index of clinical trials in critical care medicine reporting a statistically significant effect on mortality (increasing or decreasing mortality). Data Sources: Literature search (PubMed/MEDLINE) to identify all multicenter randomized controlled trials in critical care medicine. Study Selection: We identified 862 trials; of which 56 fulfilled eligibility criteria and were included in our analysis. Data Extraction: Calculation of fragility index for trials reporting a statistically significant effect on mortality, and analysis of the relationship between trial characteristics and fragility index. Data Synthesis: The median fragility index was 2 (interquartile range, 1–3.5), and greater than 40% of trials had a fragility index of less than or equal to 1. 12.5% of trials reported loss to follow-up greater than their fragility index. Trial sample size was positively correlated, and reported p value was negatively correlated, with fragility index. Conclusions: In critical care trials reporting statistically significant effects on mortality, the findings often depend on a small number of events. Critical care clinicians should be wary of basing decisions on trials with a low fragility index. We advocate the reporting of fragility index for future trials in critical care to aid interpretation and decision making by clinicians.

[1]  G. Guyatt,et al.  The fragility of statistically significant findings from randomized trials in spine surgery: a systematic survey. , 2015, The spine journal : official journal of the North American Spine Society.

[2]  Francesco Santini,et al.  Mortality in Multicenter Critical Care Trials: An Analysis of Interventions With a Significant Effect* , 2015, Critical care medicine.

[3]  R. Bellomo,et al.  A protocol for the 0.9% saline versus Plasma-Lyte 148 for intensive care fluid therapy (SPLIT) study. , 2014, Critical care and resuscitation : journal of the Australasian Academy of Critical Care Medicine.

[4]  T. Richards,et al.  The IRONMAN trial: a protocol for a multicentre randomised placebo-controlled trial of intravenous iron in intensive care unit patients with anaemia. , 2014, Critical care and resuscitation : journal of the Australasian Academy of Critical Care Medicine.

[5]  Mark Twain,et al.  Seduced by a P-Value… , 2014, Anaesthesia and intensive care.

[6]  W. Weightman,et al.  An Audit of the Statistical Validity of Conclusions of Clinical Superiority in Anaesthesia Journals , 2014, Anaesthesia and intensive care.

[7]  Dylan S. Small,et al.  Outcomes and statistical power in adult critical care randomized trials. , 2014, American journal of respiratory and critical care medicine.

[8]  Andrew Burke,et al.  The statistical significance of randomized controlled trial results is frequently fragile: a case for a Fragility Index. , 2014, Journal of clinical epidemiology.

[9]  Hassane Njimi,et al.  Assessment of the worldwide burden of critical illness: the intensive care over nations (ICON) audit. , 2014, The Lancet. Respiratory medicine.

[10]  David T. Huang,et al.  Protocolized Care for Early Septic Shock (ProCESS) statistical analysis plan. , 2013, Critical care and resuscitation : journal of the Australasian Academy of Critical Care Medicine.

[11]  Kathryn M Rowan,et al.  The Protocolised Management in Sepsis (ProMISe) trial statistical analysis plan. , 2013, Critical care and resuscitation : journal of the Australasian Academy of Critical Care Medicine.

[12]  R. Bellomo,et al.  Statistical analysis plan for the HEAT trial: a multicentre randomised placebo-controlled trial of intravenous paracetamol in intensive care unit patients with fever and infection. , 2013, Critical care and resuscitation : journal of the Australasian Academy of Critical Care Medicine.

[13]  R. Bellomo,et al.  The ADRENAL study protocol: adjunctive corticosteroid treatment in critically ill patients with septic shock. , 2013, Critical care and resuscitation : journal of the Australasian Academy of Critical Care Medicine.

[14]  Bruce M. Psaty,et al.  Minimizing bias in randomized trials: the importance of blinding. , 2010, JAMA.

[15]  S. Aberegg,et al.  Delta inflation: a bias in the design of randomized controlled trials in critical care medicine , 2010, Critical care.

[16]  N. Halpern,et al.  Critical care medicine in the United States 2000–2005: An analysis of bed numbers, occupancy rates, payer mix, and costs* , 2010, Critical care medicine.

[17]  Nick Freemantle,et al.  Interpreting the results of secondary end points and subgroup analyses in clinical trials: should we lock the crazy aunt in the attic? , 2001, BMJ : British Medical Journal.

[18]  R. Bender,et al.  Calculating confidence intervals for the number needed to treat. , 2001, Controlled clinical trials.

[19]  Jonathan A C Sterne,et al.  Sifting the evidence—what's wrong with significance tests? , 2001, BMJ : British Medical Journal.

[20]  S. Goodman Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy , 1999, Annals of Internal Medicine.

[21]  W. W. Rozeboom The fallacy of the null-hypothesis significance test. , 1960, Psychological bulletin.

[22]  K J Rothman,et al.  Significance questing. , 1986, Annals of internal medicine.