Acute Respiratory Distress Syndrome Measurement Error. Potential Effect on Clinical Study Results.

RATIONALE Identifying patients with acute respiratory distress syndrome (ARDS) is a recognized challenge. Experts often have only moderate agreement when applying the clinical definition of ARDS to patients. However, no study has fully examined the implications of low reliability measurement of ARDS on clinical studies. OBJECTIVES To investigate how the degree of variability in ARDS measurement commonly reported in clinical studies affects study power, the accuracy of treatment effect estimates, and the measured strength of risk factor associations. METHODS We examined the effect of ARDS measurement error in randomized clinical trials (RCTs) of ARDS-specific treatments and cohort studies using simulations. We varied the reliability of ARDS diagnosis, quantified as the interobserver reliability (κ-statistic) between two reviewers. In RCT simulations, patients identified as having ARDS were enrolled, and when measurement error was present, patients without ARDS could be enrolled. In cohort studies, risk factors as potential predictors were analyzed using reviewer-identified ARDS as the outcome variable. MEASUREMENTS AND MAIN RESULTS Lower reliability measurement of ARDS during patient enrollment in RCTs seriously degraded study power. Holding effect size constant, the sample size necessary to attain adequate statistical power increased by more than 50% as reliability declined, although the result was sensitive to ARDS prevalence. In a 1,400-patient clinical trial, the sample size necessary to maintain similar statistical power increased to over 1,900 when reliability declined from perfect to substantial (κ = 0.72). Lower reliability measurement diminished the apparent effectiveness of an ARDS-specific treatment from a 15.2% (95% confidence interval, 9.4-20.9%) absolute risk reduction in mortality to 10.9% (95% confidence interval, 4.7-16.2%) when reliability declined to moderate (κ = 0.51). In cohort studies, the effect on risk factor associations was similar. CONCLUSIONS ARDS measurement error can seriously degrade statistical power and effect size estimates of clinical studies. The reliability of ARDS measurement warrants careful attention in future ARDS clinical studies.

[1]  M. Cohen,et al.  Misclassification of acute respiratory distress syndrome after traumatic injury: The cost of less rigorous approaches , 2015, The journal of trauma and acute care surgery.

[2]  G. Rubenfeld,et al.  Interobserver variability in applying a radiographic definition for ARDS. , 1999, Chest.

[3]  I. Holme,et al.  Differences between echocardiographic measurements of left ventricular dimensions and function by local investigators and a core laboratory in a 2-year follow-up study of patients with an acute myocardial infarction. , 2002, European journal of echocardiography : the journal of the Working Group on Echocardiography of the European Society of Cardiology.

[4]  M. Lamy,et al.  The American-European Consensus Conference on ARDS. Definitions, mechanisms, relevant outcomes, and clinical trial coordination. , 1994, American journal of respiratory and critical care medicine.

[5]  D. Schoenfeld,et al.  Ventilation with lower tidal volumes as compared with traditional tidal volumes for acute lung injury and the acute respiratory distress syndrome. , 2000, The New England journal of medicine.

[6]  Ó. Peñuelas,et al.  Comparison of the Berlin definition for acute respiratory distress syndrome with autopsy. , 2013, American journal of respiratory and critical care medicine.

[7]  Ognjen Gajic,et al.  Ventilator settings as a risk factor for acute respiratory distress syndrome in mechanically ventilated patients , 2005, Intensive Care Medicine.

[8]  James F Burke,et al.  Implications of Heterogeneity of Treatment Effect for Reporting and Analysis of Randomized Trials in Critical Care. , 2015, American journal of respiratory and critical care medicine.

[9]  M. Kollef,et al.  Mechanical Ventilation and ARDS in the ED: A Multicenter, Observational, Prospective, Cross-sectional Study. , 2015, Chest.

[10]  D. Schuster Identifying patients with ARDS: time for a different approach , 1997, Intensive Care Medicine.

[11]  M. Shoukri,et al.  Measures of Interobserver Agreement , 2003 .

[12]  S. Bellamy,et al.  The Epidemiology of Acute Respiratory Distress Syndrome in Patients Presenting to the Emergency Department With Severe Sepsis , 2013, Shock.

[13]  D. Wagner,et al.  Veterans Affairs intensive care unit risk adjustment model: Validation, updating, recalibration* , 2008, Critical care medicine.

[14]  R J Cook,et al.  Interobserver variation in interpreting chest radiographs for the diagnosis of acute respiratory distress syndrome. , 2000, American journal of respiratory and critical care medicine.

[15]  David T. Huang,et al.  Improving clinical trial design in acute lung injury , 2003, Critical care medicine.

[16]  Paul N. Lanken,et al.  Research Paper: Validation Study of an Automated Electronic Acute Lung Injury Screening Tool , 2009, J. Am. Medical Informatics Assoc..

[17]  Carlos Flores,et al.  An alternative method of acute lung injury classification for use in observational studies. , 2010, Chest.

[18]  H. Kraemer Ramifications of a population model forκ as a coefficient of reliability , 1979 .

[19]  Diane P. Martin,et al.  Incidence and outcomes of acute lung injury. , 2005, The New England journal of medicine.

[20]  Andrés Esteban,et al.  Acute respiratory distress syndrome: Underrecognition by clinicians and diagnostic accuracy of three clinical definitions* , 2005, Critical care medicine.

[21]  Cheng-Ta Yang,et al.  Diffuse alveolar damage associated mortality in selected acute respiratory distress syndrome patients with open lung biopsy , 2015, Critical Care.

[22]  Arthur S Slutsky,et al.  Acute Respiratory Distress Syndrome The Berlin Definition , 2012 .

[23]  J. R. Landis,et al.  An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. , 1977, Biometrics.

[24]  Salvador Benito,et al.  Characteristics and outcomes in adult patients receiving mechanical ventilation: a 28-day international study. , 2002, JAMA.

[25]  Paul N. Lanken,et al.  The impact of development of acute lung injury on hospital mortality in critically ill trauma patients , 2008, Critical care medicine.

[26]  Vitaly Herasevich,et al.  Validation of an electronic surveillance system for acute lung injury , 2009, Intensive Care Medicine.