Retrospective assessment of initial stroke severity with the NIH Stroke Scale.

BACKGROUND AND PURPOSE It is important to adjust stroke outcomes for differences in initial stroke severity. The NIH Stroke Scale (NIHSS) is a commonly used stroke severity measure but has been validated for retrospective scoring only in a subset of stroke clinical trial participants. The purpose of this research was to assess the validity and reliability of an algorithm for retrospective NIHSS scoring in a setting with usual chart documentation. METHODS An algorithm for retrospective NIHSS scoring was developed with written history and physical admission notes. Missing physical examination data were scored as normal. One investigator prospectively scored the admission NIHSS in 32 consecutive stroke patients. Two raters retrospectively scored the NIHSS by applying the algorithm to photocopied admission notes. Linear regression was used to assess interrater reliability and agreement between prospective and retrospective NIHSS scores. The Wilcoxon signed rank test was used to assess systematic scoring bias. Weighted kappa statistics were calculated to assess the level of agreement of individual NIHSS items. RESULTS Only 1 admission note was complete for all NIHSS elements. Interrater reliability was near perfect (r(2)=0.98, P<0. 001). Agreement between prospective and retrospective NIHSS score was also excellent (r(2)=0.94, P<0.001) and there was no systematic bias in retrospective scores. Agreement for individual items was moderate to high for all items except level of consciousness. CONCLUSIONS Retrospective NIHSS scoring with the algorithm is reliable and unbiased even when physical examination elements are missing from the written record. Stroke research using retrospective review of charts or of administrative databases should adjust for differences in stroke severity using such an algorithm.

[1]  A R Localio,et al.  Reliability and validity of estimating the NIH stroke scale score from medical records. , 1999, Stroke.

[2]  K. Pettigrew,et al.  Progression in acute stroke: value of the initial NIH stroke scale score on patient stratification in future trials. , 1999, Stroke.

[3]  A. Rudd,et al.  Casemix and Process Indicators of Outcome in Stroke , 1998, Journal of the Royal College of Physicians of London.

[4]  L. Goldstein,et al.  Retrospective assessment of initial stroke severity with the Canadian Neurological Scale. , 1997, Stroke.

[5]  G. Murray,et al.  Comparison of neurological scales and scoring systems for acute stroke prognosis. , 1996, Stroke.

[6]  C. Warlow,et al.  Effect of correcting outcome data for case mix: an example from stroke medicine , 1996, BMJ.

[7]  B. Söderfeldt,et al.  Functional outcome in patients with lacunar infarction. , 1996, Stroke.

[8]  B. Norrving,et al.  Survival following stroke. A prospective population‐based study of 438 hospitalized cases with prediction according to subtype, severity and age , 1990, Acta neurologica Scandinavica.

[9]  J. Marler,et al.  Measurements of acute cerebral infarction: a clinical examination scale. , 1989, Stroke.

[10]  James N. Davis,et al.  Interrater reliability of the NIH stroke scale. , 1989, Archives of neurology.

[11]  E. Roth,et al.  Multivariate analysis of improvement and outcome following stroke rehabilitation. , 1987, Archives of neurology.

[12]  J. Oxbury,et al.  Predicting the outcome of stroke: acute stage after cerebral infarction. , 1975, British medical journal.

[13]  Jacob Cohen,et al.  The Equivalence of Weighted Kappa and the Intraclass Correlation Coefficient as Measures of Reliability , 1973 .

[14]  Gunnar B. Stickler,et al.  Prognosis , 1970, Clinical pediatrics.