Stroke aetiological classification reliability and effect on trial sample size: systematic review, meta-analysis and statistical modelling

BackgroundInter-observer variability in stroke aetiological classification may have an effect on trial power and estimation of treatment effect. We modelled the effect of misclassification on required sample size in a hypothetical cardioembolic (CE) stroke trial.MethodsWe performed a systematic review to quantify the reliability (inter-observer variability) of various stroke aetiological classification systems. We then modelled the effect of this misclassification in a hypothetical trial of anticoagulant in CE stroke contaminated by patients with non-cardioembolic (non-CE) stroke aetiology. Rates of misclassification were based on the summary reliability estimates from our systematic review. We randomly sampled data from previous acute trials in CE and non-CE participants, using the Virtual International Stroke Trials Archive. We used bootstrapping to model the effect of varying misclassification rates on sample size required to detect a between-group treatment effect across 5000 permutations. We described outcomes in terms of survival and stroke recurrence censored at 90 days.ResultsFrom 4655 titles, we found 14 articles describing three stroke classification systems. The inter-observer reliability of the classification systems varied from ‘fair’ to ‘very good’ and suggested misclassification rates of 5% and 20% for our modelling. The hypothetical trial, with 80% power and alpha 0.05, was able to show a difference in survival between anticoagulant and antiplatelet in CE with a sample size of 198 in both trial arms. Contamination of both arms with 5% misclassified participants inflated the required sample size to 237 and with 20% misclassification inflated the required sample size to 352, for equivalent trial power. For an outcome of stroke recurrence using the same data, base-case estimated sample size for 80% power and alpha 0.05 was n = 502 in each arm, increasing to 605 at 5% contamination and 973 at 20% contamination.ConclusionsStroke aetiological classification systems suffer from inter-observer variability, and the resulting misclassification may limit trial power.Trial registrationProtocol available at reviewregistry540.

[1]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[2]  H. Adams,et al.  Interphysician agreement in the diagnosis of subtypes of acute ischemic stroke , 1993, Neurology.

[3]  David B. Matchar,et al.  Improving the Reliability of Stroke Subgroup Classification Using the Trial of ORG 10172 in Acute Stroke Treatment (TOAST) Criteria , 2001, Stroke.

[4]  Matthias J. Müller,et al.  Effects of Interrater Reliability of Psychopathologic Assessment on Power and Sample Size Calculations in Clinical Trials , 2002, Journal of clinical psychopharmacology.

[5]  Yuchiao Chang,et al.  Effect of intensity of oral anticoagulation on stroke severity and mortality in atrial fibrillation. , 2003, The New England journal of medicine.

[6]  Thomas McGinn,et al.  Tips for learners of evidence-based medicine: 3. Measures of observer variability (kappa statistic) , 2004, Canadian Medical Association Journal.

[7]  Jan Potter,et al.  Reliability of the Modified Rankin Scale Across Multiple Raters: Benefits of a Structured Interview , 2005, Stroke.

[8]  B Fure,et al.  TOAST criteria applied in acute ischemic stroke , 2005, Acta neurologica Scandinavica.

[9]  K. Furie,et al.  An evidence‐based causative classification system for acute ischemic stroke , 2005, Annals of neurology.

[10]  B. Kissela,et al.  Interobserver agreement in the trial of org 10172 in acute stroke treatment classification of stroke based on retrospective medical record review. , 2006, Journal of stroke and cerebrovascular diseases : the official journal of National Stroke Association.

[11]  J. Heo,et al.  A New Subtype Classification of Ischemic Stroke Based on Treatment and Etiologic Mechanism , 2006, European Neurology.

[12]  Thomas Benner,et al.  A Computerized Algorithm for Etiologic Classification of Ischemic Stroke: The Causative Classification of Stroke System , 2007, Stroke.

[13]  J. Grotta,et al.  The Virtual International Stroke Trials Archive , 2007, Stroke.

[14]  S. Yusuf,et al.  Dabigatran versus warfarin in patients with atrial fibrillation. , 2009, The New England journal of medicine.

[15]  M. Walters,et al.  Reliability of the Modified Rankin Scale: A Systematic Review , 2009, Stroke.

[16]  M. Walters,et al.  Functional Outcome Measures in Contemporary Stroke Trials , 2009, International journal of stroke : official journal of the International Stroke Society.

[17]  P. Tyrrell,et al.  Classification of Minor Stroke: Intra- and Inter-Observer Reliability , 2009, Cerebrovascular Diseases.

[18]  P. Kelly,et al.  Stroke Subtype Classification to Mechanism-Specific and Undetermined Categories by TOAST, A-S-C-O, and Causative Classification System: Direct Comparison in the North Dublin Population Stroke Study , 2010, Stroke.

[19]  F. Fazekas,et al.  The Causative Classification of Stroke system: An international reliability and optimization study , 2010, Neurology.

[20]  Kam-Wing Ng,et al.  Improving Reliability for , 2010 .

[21]  D. Atar,et al.  Apixaban versus warfarin in patients with atrial fibrillation. , 2011, The New England journal of medicine.

[22]  M. Hennerici,et al.  Comparison of the new ASCO classification with the TOAST classification in a population with acute ischemic stroke , 2012, Journal of Neurology.

[23]  M. Belham,et al.  Towards Understanding the Cause of Stroke in Young Adults Utilising a New Stroke Classification System (A-S-C-O) , 2011, Cerebrovascular Diseases.

[24]  A. Hrõbjartsson,et al.  Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed. , 2011, Journal of clinical epidemiology.

[25]  G. Breithardt,et al.  Rivaroxaban versus warfarin in nonvalvular atrial fibrillation. , 2011, The New England journal of medicine.

[26]  Vista Steering Committees Development, expansion, and use of a stroke clinical trials resource for novel exploratory analyses. , 2012 .

[27]  Marc Fisher,et al.  Development, Expansion, and Use of a Stroke Clinical Trials Resource for Novel Exploratory Analyses , 2012, International journal of stroke : official journal of the International Stroke Society.

[28]  M. McHugh Interrater reliability: the kappa statistic , 2012, Biochemia medica.

[29]  H. S. Nam,et al.  Use of a handheld, computerized device as a decision support tool for stroke classification , 2012, European journal of neurology.

[30]  P. Langhorne,et al.  Improving the Efficiency of Stroke Trials: Feasibility and Efficacy of Group Adjudication of Functional End Points , 2013, Stroke.

[31]  Inter-rater reliability of the A-S-C-O classification system for ischemic stroke , 2013, Journal of Clinical Neuroscience.

[32]  Peter Langhorne,et al.  Reliability (Inter-rater Agreement) of the Barthel Index for Assessment of Stroke Survivors: Systematic Review and Meta-analysis , 2013, Stroke.

[33]  J. Grotta,et al.  How Well Do Standard Stroke Outcome Measures Reflect Quality of Life?: A Retrospective Analysis of Clinical Trial Data , 2013, Stroke.

[34]  Frank Beckers,et al.  Cryptogenic stroke and underlying atrial fibrillation. , 2014, The New England journal of medicine.

[35]  D. Krieger,et al.  EuroHYP-1: European Multicenter, Randomized, Phase III Clinical Trial of Therapeutic Hypothermia plus Best Medical Treatment vs. Best Medical Treatment Alone for Acute Ischemic Stroke , 2014, International journal of stroke : official journal of the International Stroke Society.

[36]  David J Gladstone,et al.  Atrial fibrillation in patients with cryptogenic stroke. , 2014, The New England journal of medicine.

[37]  R. Sacco,et al.  Design of Randomized, Double-Blind, Evaluation in Secondary Stroke Prevention Comparing the Efficacy and Safety of the Oral Thrombin Inhibitor Dabigatran Etexilate vs. Acetylsalicylic Acid in Patients with Embolic Stroke of Undetermined Source (Re-Spect Esus) , 2015, International journal of stroke : official journal of the International Stroke Society.

[38]  H. Diener,et al.  Effects of aspirin on risk and severity of early recurrent stroke after transient ischaemic attack and ischaemic stroke: time-course analysis of randomised trials , 2016, The Lancet.

[39]  PATRICIA VASSELL,et al.  Improving OR Efficiency. , 2016, AORN journal.

[40]  D. Hanley,et al.  Surgical Performance in Minimally Invasive Surgery Plus Recombinant Tissue Plasminogen Activator for Intracerebral Hemorrhage Evacuation Phase III Clinical Trial , 2017, Neurosurgery.

[41]  Helen Rodgers,et al.  Use of a 3-Item Short-Form Version of the Barthel Index for Use in Stroke: Systematic Review and External Validation , 2017, Stroke.

[42]  D. Hanley,et al.  Thrombolytic removal of intraventricular haemorrhage in treatment of severe stroke: results of the randomised, multicentre, multiregion, placebo-controlled CLEAR III trial , 2017, The Lancet.

[43]  S. Connolly,et al.  Rivaroxaban for Stroke Prevention after Embolic Stroke of Undetermined Source , 2018, The New England journal of medicine.

[44]  M. Macleod,et al.  PRECIOUS: PREvention of Complications to Improve OUtcome in elderly patients with acute Stroke. Rationale and design of a randomised, open, phase III, clinical trial with blinded outcome assessment , 2018, European stroke journal.