(Stroke. 2001;32:1310.)
© 2001 American Heart Association, Inc.
Original Contributions |
From the Departments of Neurology, Veterans Administration Medical Center, San Diego, and Neurosciences, University of California at San Diego School of Medicine (P.D.L.); Department of Biostatistics and Research Epidemiology, Henry Ford Health Science Center, Detroit, Mich (M.L.); Department of Neurology, Wayne State University School of Medicine, Detroit, Mich (S.R.L.); Department of Neurology, Mayo Clinic, Jacksonville, Fla (T.G.B.); and Department of Neurology, University of Cincinnati (Ohio) (J.B.).
Correspondence to Patrick D. Lyden, MD, Stroke Center (8466), 3rd Floor, OPC, Suite #3, 200 W Arbor Dr, San Diego, CA, 92103-8466. E-mail plyden{at}ucsd.edu
| Abstract |
|---|
|
|
|---|
MethodsThe mNIHSS was derived from our prior clinimetric studies of the NIHSS by deleting poorly reproducible or redundant items (level of consciousness, face weakness, ataxia, dysarthria) and collapsing the sensory item into 2 responses. Reliability of the mNIHSS was assessed with the certification data originally collected to assess the reliability of investigators in the National Institute of Neurological Disorders and Stroke (NINDS) rtPA (recombinant tissue plasminogen activator) Stroke Trial. Validity of the mNIHSS was assessed with the outcome results of the NINDS rtPA Stroke Trial.
ResultsReliability was
improved with the mNIHSS: the number of scale items with poor
coefficients on either of the certification tapes decreased from 8
(20%) to 3 (14%) with the mNIHSS. With the use of factor
analysis, the structure underlying the mNIHSS was found
identical to the original scale. On serial use of the scale, goodness
of fit coefficients were higher with the mNIHSS. With data from part I
of the trial data, the proportion of patients who improved
4 points
within 24 hours after treatment was statistically significantly
increased by tPA (odds ratio, 1.3; 95% confidence limits, 1.0, 1.8;
P=0.05). Likewise, the
odds ratio for complete/nearly complete resolution of stroke symptoms 3
months after treatment was 1.7 (95% confidence limits, 1.2, 2.6) with
the mNIHSS. Other outcomes showed the same agreement when the mNIHSS
was compared with the original scale. The mNIHSS showed good
responsiveness, ie, was useful in differentiating patients likely to
hemorrhage or have a good outcome after stroke.
ConclusionsThe mNIHSS appears to be identical clinimetrically to the original NIHSS when the same data are used for validation and reliability. Power appears to be greater with the mNIHSS with the use of 24-hour end points, suggesting the need for fewer patients in trials designed to detect treatment effects comparable to rtPA. The mNIHSS contains fewer items and might be simpler to use in clinical research trials. Prospective analysis of reliability and validity, with the use of an independently collected cohort, must be obtained before the mNIHSS is used in a research setting.
Key Words: outcome outcome assessment stroke stroke assessment
| Introduction |
|---|
|
|
|---|
scores (poor interrater
reliability).9 We found that
the internal structure of the scale consists of 2 basic factors,
relating to each of the 2 cerebral
hemispheres.8 These previous
investigations suggested that the NIHSS might be modified to make it
simpler and easier to use. A scale with excess items, or items that do
not contribute to the score in a meaningful way, wastes time and
effort. Therefore, we propose a simplified version, the modified NIHSS
(mNIHSS). To estimate the reliability and validity of the new scale, we
used the same data we used previously to investigate the NIHSS. Using
the previously published data allows easy comparison of the clinimetric
features of the mNIHSS and facilitates the design of a prospective
investigation, which is required to fully evaluate the
mNIHSS. | Methods |
|---|
|
|
|---|
=0.62) but in a factor
analysis was redundant.8 This item was
dropped, but the 2 remaining "consciousness" items were retained
because they showed higher
values. The reliability of the ataxia
item was poor, and it contributed little to the internal structure of
the scale; this item was therefore
deleted.7 The facial weakness
and dysarthria items exhibited poor reliability and appeared to be
redundant in a factor analysis and therefore were removed. The
sensory item was collapsed from 3 to 2 choices, on the basis of the
poor
seen in the reliability
study.7 The resulting mNIHSS
is shown in Table 1
|
To evaluate the mNIHSS, we used the data from the National
Institute of Neurological Disorders and Stroke (NINDS) rtPA Stroke
Trials, which were 2 randomized, double-blind, placebo-controlled
trials to assess the effect of recombinant tissue
plasminogen activator (rtPA) treatment for
patients with acute ischemic
stroke.10 Eight clinical
centers participated in the study, with >40 hospitals involved. Six
hundred twenty-four patients were enrolled in 2 separate trials (part
I, n=291; part II, n=333). The 2 trials had the same recruitment and
data collection protocol but different primary outcomes. In part I,
rtPA activity (
4-point improvement over baseline) was assessed 24
hours after stroke onset, but all end points were evaluated 90 days
after stroke as well. Part II was a study of the rtPA treatment
efficacy 90 days after stroke onset. In both parts, the NIHSS scores
were collected at baseline, 2 hours after treatment onset, and 24
hours, 7 to 10 days, and 3 months after stroke onset. Lesion volumes
were estimated from unenhanced CT scans obtained 3 months after stroke
with quantitative volumetry. The details of the CT method will be the
subject of another publication.
Reliability
We measured mNIHSS interrater and intrarater
reliability with the certification data collected
previously.7 The details of
the design of the prior version of the scale, as well as our video
certification method, are described in detail in our prior
report.7 Briefly, we printed
the scale instructions on the face of the stroke scale form so that the
examiner always had them available for reference. We videotaped 4
different examiners examining 11 certification patients on 2 different
certification tapes. Each certification tape contained a brief
introduction, intended to standardize the manner in which the
investigators viewed the patients. To measure the effectiveness of the
training process, we summarized the agreement among raters using
statistics for the case of multiple raters who examine a few
subjects.11 12
The unweighted
is qualified as follows:
<0.40 defines poor
concordance,
between 0.40 and 0.75 defines moderate concordance,
and
>0.75 defines excellent concordance.12
Content Validity: Factor Analysis
To assess the underlying structure of mNIHSS items,
we used factor analysis, as we have described in
detail.8 We recalculated the
goodness of fit on the basis of a 4-factor solution restricted to 11
NIHSS items involved in the mNIHSS, using data collected in the NINDS
rtPA Stroke Trials. We evaluated the goodness of fit on data collected
at the baseline in parts I (n=291) and II (n=333) for
reliability.13 To ensure
that the factor structure was free of time or treatment confounding
effects, the goodness of fit was calculated for data collected at 2
hours, 24 hours, 7 to 10 days, and 3 months after rtPA treatment or
placebo treatment. This analysis was intended to determine
whether the mNIHSS behaved in a manner similar to the NIHSS over serial
examinations.
Concurrent and Predictive Validity
To estimate the validity of the scale, we determined
mNIHSS scores from data collected in the NINDS rtPA Stroke Trials on
624 patients.10 To assess
criterion and predictive validity, we compared the relationship between
mNIHSS and NIHSS scores at baseline, 24 hours, and 3 months using
Spearman correlation coefficients. The intention-to-treat approach was
used to impute the worst score (31 for mNIHSS or 42 for NIHSS score)
for patients who died or missed the follow-up. To measure concurrent
validity, we compared the correlation of mNIHSS with the other
neurological functions (Barthel
Index,14 modified Rankin
Scale,15 and Glasgow Coma
Scale16 ) measured at 3
months on the basis of scores and dichotomized variables. The
coefficients were calculated among the binary outcomes. We used the
mNIHSS to test for treatment effect on improvement at 24 hours and
treatment effect on minimal or no disability at 3 months after stroke
(a 3-month favorable outcome), for comparison with the original
report.10 The improvement at
24 hours is defined as a 4-point decrease from baseline in the mNIHSS
or complete resolution of neurological deficits at 24 hours from stroke
onset, as was used in the prior trial. The 3-month favorable outcome is
the summary statistic defined from a set of measures at 3 months
including a Barthel Index >95, Rankin Scale <1, Glasgow Outcome
Scale=1, and mNIHSS <1. The Mantel-Haenszel test was used with
stratification according to clinical centers and/or time when studying
the 0- to 180-minute group for improvement at 24
hours.17 The relative risk
and 95% confidence limits were reported. A relative risk >1 with 95%
confidence limits excluding 1 indicated significant treatment benefit
of rtPA compared with placebo.
Responsiveness
To test whether the mNIHSS could predict differential
stroke outcomes (similar to responsiveness), we used the mNIHSS to
replace the NIHSS in a predictive model of acute intracranial
hemorrhages.18 The
details of this model have been published: the NIHSS proved to be one
of the key predictors in that analysis, suggesting that the
data set would be a good test of the responsiveness of the
mNIHSS.19 We also modeled
the probability of a 3-month favorable outcome in a prior study; again,
we substituted the mNIHSS for the NIHSS in that model.20
| Results |
|---|
|
|
|---|
values decreased from 20% to 14%. For the mNIHSS, 55% of
the items showed excellent agreement, compared with 40% for the
NIHSS. Compared with 8 items for the NIHSS, only 3 mNIHSS items
showed poor agreement: loss of consciousness questions (item 1C), gaze,
and aphasia.
|
Validity
The internal structure of the mNIHSS was identical to
that of the NIHSS, a measure of content validity (sometimes called
structure validity). The 4-factor solution, which was the best in our
prior investigation of the NIHSS, was also found in the mNIHSS
(Table 3
). The 4-factor loading for each new scale item is shown. The higher the loading, the greater is the contribution of
that item to the factor. As with the NIHSS, the first factor
seems related to left hemisphere function, containing language and the
2 items requiring correct verbal responses. The second factor,
containing the gaze, visual field, neglect, and sensory items, can be
related to right hemisphere function, since right cortical neglect
influences the responses on the sensory and visual items, while aphasia
interferes with responses to the gaze, visual, and sensory items. These
2 effects may work together to cause these items to load most heavily
on the right hemisphere factor. Gaze, visual, and sensory items are
influenced by left hemisphere as well; they load on the right
hemisphere because of the combined effects of aphasia and neglect. The
third factor represents left hemisphere motor function, while
the fourth factor represents right hemisphere motor function.
The dysarthria item loads weakly on both motor items, as might be
expected, but loads most heavily on the left hemisphere. The goodness
of fit statistic (comparative fit index=0.96) is equal to that of the
NIHSS. When used over time, and in placebo-treated compared with
rtPA-treated groups, the mNIHSS values were as robust as those of the
NIHSS, ranging from 0.93 to 0.96. The goodness of fit coefficients were
all greater than the comparable values found for the NIHSS.
|
The mNIHSS was valid for detecting drug treatment
effect. The scale scores clearly differentiated the 2 treatment groups
at 24 hours, as illustrated in
Table 4
. In the original trial, early improvement was defined as complete resolution of the neurological deficit or an
improvement from baseline in the NIHSS score by
4 points 24 hours
after the onset of stroke. For comparison with the original
scale, the data previously published are
listed.10 The relative risk
for scoring
4 points better than baseline is shown. Part I of the
study was designed prospectively to detect this effect and was not
statistically significant in the original study. With the use of the
mNIHSS, however, part I was statistically significant in the 91- to
180-minute and the 0- to 180-minute strata. Each analysis in
Table 4
indicates that the confidence levels are generally
comparable with the use of the mNIHSS. Using another definition of
early improvement, such as 5 or
6 points, results in similar
findings: more rtPA-treated patients showed early
improvement.
|
The mNIHSS also differentiated the treatment groups at 3 months
(Table 5
). The results are shown for various strata
from the original study to allow comparison with Table 4
in the original publication.10 In the
original trial, the primary outcome was defined as an improved odds of
recovery 3 months after treatment, using a global odds ratio. The
global odds ratio is computed from all 4 stroke scales. The odds ratios
calculated with the mNIHSS in place of the original NIHSS are
comparable. The slight decreases in the probability values indicate a
possible increase in power. The odds ratios and relative risk ratios
for a score of 0 or 1 at 3 months are also shown. Again, the ratios are
comparable, and the probability values are slightly lower with the use
of the mNIHSS. Power was estimated directly with the use of a method
for multiple correlated binary
outcomes.21 With part
I data, the power to detect a
4-point improvement by 24 hours was
increased from 24% to 51% with the mNIHSS. With part II data, the
power to detect a favorable outcome (score of 0 or 1) 3 months after
stroke did not improve with the mNIHSS.
|
Responsiveness
When we substituted the mNIHSS for the original scale
in the logistic models, the results were identical to our prior
reports. The predication model of hemorrhage was based on the
rtPA-treated group only because there were so few hemorrhages
in the placebo group. The final multivariable
symptomatic hemorrhage prediction model contained
the same 2 variables as in our prior report: early CT findings and
categorized mNIHSS. The odds ratios for symptomatic
hemorrhage in the rtPA-treated patients (n=306) were 1.65 (95%
confidence limits, 1.13, 3.40) for the mNIHSS score and 6.90 (95%
confidence limits, 2.00, 23.78) for early CT findings of
ischemia. These odds ratios and confidence limits are similar
to those obtained in the prior report with the original NIHSS. The
model for predicting all hemorrhages within 36 hours of
treatment, symptomatic and asymptomatic, was
also identical to the previous report, containing the same
variables: smoking, baseline mNIHSS, early-ischemic CT
findings, and admission blood pressure. Odds ratios and confidence
limits were similar to the previous report (data not
shown).
The prediction of a favorable outcome uses baseline variables that may predict a favorable outcome on the global odds ratio, incorporating all 4 outcome variables. The analysis with mNIHSS was identical to the analysis with the original NIHSS, reported previously.20 The odds ratio for a treatment effect was 2.28 (95% confidence limits, 1.62, 3.22), which compares favorably with the first report. Of considerable importance is the observation that the use of the mNIHSS yields the same interaction terms in the final model, showing that the mNIHSS is similar to the original scale in logistic regression analyses.
| Discussion |
|---|
|
|
|---|
The scale proved nearly identical to the original when used
in various statistical modeling procedures. The correlation of various
outcomes, shown in
Table 6
, may be viewed as validity, in the sense that
patients who score well on the Rankin, Glasgow Outcome, or Barthel
scales also score well on the NIHSS. Furthermore, when compared with
the original scale, the correlation coefficients are nearly identical,
confirming that the mNIHSS behaves in a manner similar to the original.
More importantly, the scale imitated the original in the predictive
models, which can be taken as an indicator of responsiveness. That is,
the mNIHSS tends to predict response of patients to tPA as well as the
original scale, when used in the multivariable
model.20 Likewise, the
mNIHSS predicts likelihood of hemorrhage after tPA treatment as
well as the original in the multivariable model of
symptomatic
hemorrhage.19
However, another view of responsiveness includes the property of
predicting within-individual changes; we could not assess this aspect
of responsiveness.
|
A limitation of this investigation is the use of data collected previously. The NIHSS scores collected during the reliability study and during the actual NINDS rtPA Stroke Trial were collected on forms that contained all 15 items of the NIHSS. For the present analysis we simply analyzed the 11 items proposed for the mNIHSS. While reasonable, this exercise has the potential pitfall that scores are biased. It is a theoretical concern that the presence of the 4 unsettled items may inadvertently influence the scores on the remaining items. To truly estimate the reliability statistics, the 11-item scale must be tested directly in a prospective design. Nevertheless, our data support further investigation of the mNIHSS and suggest that it may indeed be more reliable than the NIHSS.
As an indicator of validity, we used the mNIHSS to predict
outcome in the original NINDS rtPA Stroke Trial. The scale certainly
predicted the outcome that was detected with the NIHSS. The values for
relative risk shown in
Table 4
parallel those seen in the original study using the
NIHSS.10 However, the
absolute values are higher, the confidence levels are generally
narrower, and more of the tests are statistically significant, results
predicted from the improved reliability of the mNIHSS. One might
question, however, whether the mNIHSS might somehow overestimate a
putative drug effect. It is true that the original study showed no
significant effect on the main outcome variable at 24 hours: a
4-point improvement on the NIHSS. However, we have shown elsewhere that
in fact there was a highly significant effect on outcome at 24
hours.22 If any other
criterion had been used for early improvement, ie, 5 or
6 points,
then the original study would have been significantly positive.
Therefore, the observation that the mNIHSS would have yielded a
positive study on the 24-hour outcome is consistent with the
improved power of the modified scale. The greater power of the mNIHSS
would imply that fewer patients would be needed in future trials to
detect significant treatment effects with the use of the 24-hour end
points.
The outcome data 3 months after treatment suggest that the
power for the mNIHSS is similar to the original. For each stratum we
studied
(Table 5
), the confidence limits were generally narrower and
the probability values were generally lower than with the NIHSS.
Reassuringly, the global odds ratio, calculated from all 4 outcome
scales, changed only slightly when the mNIHSS replaced the NIHSS. This
confirms the robust nature of the global test and further suggests that
the mNIHSS may be combined with the other scales in the global test. In
confirmation of this, we note that the correlation coefficients among
the scales remained about the same when the mNIHSS was substituted for
the NIHSS
(Table 6
).
Our study does not address the potential bias of the original NIHSS toward hemispheric lateralization. For each 5-point category of the NIHSS <20, the median volume of right hemisphere infarction was approximately double the volume of left hemisphere infarction.23 The significance of this observation is not yet clear, and we plan further studies of both the NIHSS and the mNIHSS with regard to potential hemispheric bias. Additionally, we did not assess the effect of eliminating other, low-reliability items such as gaze or neglect; these items were believed essential to characterizing the stroke study population.
The mNIHSS is not the ideal stroke scale, but it is clearly an improvement over the previous NIHSS. When used by trained investigators, it shows greater reliability and power than the predecessor and is valid for detecting a treatment effect. A further, prospective confirmation is needed before clinical use.
| Appendix 1 |
|---|
|
|
|---|
| Acknowledgments |
|---|
| Footnotes |
|---|
Received August 15, 2001; revision received December 1, 2000; accepted December 20, 2000.
| References |
|---|
|
|
|---|
2. Boysen G. Stroke scores and scales. Cerebrovasc Dis. 1992;2:239247.
3.
de Haan R, Horn J, Limburg M, Van Der Meulen J, Bossuyt P. A comparison of five stroke scales with measures of disability, handicap, and quality of life. Stroke. 1993;24:11781181.
4.
DOlhaberriague L,
Litvan I, Mitsias P, Mansbach H. A reappraisal of reliability and
validity studies in stroke. Stroke. 1996;27:23312336.
5. Lyden PD, Hantson L. Assessment scales for the evaluation of stroke patients. J Stroke Cerebrovasc Dis. 1998;7:113127.
6.
Cote R, Battista
RN, Wolfson C, Boucher J, Adam J. The Canadian Neurological Scale:
validation and reliability assessment. Neurology. 1989;39:638643.
7. Lyden P, Brott T, Tilley B, Welch KMA, Mascha EJ, Levine S, Haley EC, Grotta J, Marler J, for the NINDS tPA Stroke Study Group. Improved reliability of the NIH Stroke Scale using video training. Stroke. 1994;25:22202226.[Abstract]
8.
Lyden P, Lu M,
Jackson C, Marler J, Kothari R, Brott T, Zivin J, for the National
Institute of Neurological Disorders and Stroke rt-PA Stroke Study
Group. Underlying structure of the National Institutes of Health Stroke
Scale: results of a factor analysis. Stroke. 1999;30:23472354.
9.
Goldstein LB,
Bartels C, Davis JN. Interrater reliability of the NIH Stroke Scale.
Arch Neurol. 1989;46:660662.
10.
NINDS rt-PA Stroke Study Group. Tissue plasminogen
activator for acute ischemic stroke. N Engl J Med. 1995;333:15811587.
11. Fleiss JL. Statistical Methods for Rates and Proportions. 2nd ed. New York, NY: John Wiley & Sons; 1981.
12. Siegel S, Castellan NJ Jr. Nonparametric Statistics for the Behavioral Sciences. 2nd ed. New York, NY: McGraw-Hill; 1992:284291.
13. Bentler P, Bonett D. Significance tests and goodness-of-fit in the analysis of covariance structures. Psychol Bull. 1980;88:588606.
14. Mahoney FT, Barthel DW. Functional evaluation: Barthel Index. Md State Med J. 1965;14:6165.[Medline] [Order article via Infotrieve]
15. Rankin J. Cerebral vascular accidents in patients over the age of 60: prognosis. Scott Med J. 1957;2:200215.[Medline] [Order article via Infotrieve]
16. Jennett B, Teasdale G, Braakman R, Minderhoud J, Heiden J, Kurze T. Prognosis of patients with severe head injury. Neurosurgery. 1979;4:283289.[Medline] [Order article via Infotrieve]
17.
Tilley BC, Marler J, Geller NL, Lu M, Legler J, Brott T, Lyden P, Grotta J. Use of a global test for multiple outcomes in stroke trials with application to the National Institute of Neurological Disorders and Stroke t-PA Trial. Stroke. 1996;27:21362142.
18. Kirshner B, Guyatt G. A methodological framework for assessing health indices. J Chron Dis. 1985;38:2736.[Medline] [Order article via Infotrieve]
19.
NINDS t-PA Stroke
Study Group. Intracerebral hemorrhage after
intravenous t-PA therapy for ischemic stroke.
Stroke. 1997;28:21092118.
20.
NINDS t-PA Stroke
Study Group. Generalized efficacy of t-PA for acute stroke: subgroup
analysis of the NINDS t-PA Stroke Trial.
Stroke. 1997;28:21192125.
21. Tilley BC, Lu M, for the NINDS t-PA Trial Investigators. Analytic issues for stroke clinical trials. Presented at: Society for Clinical Trials meeting; May 1996; Pittsburgh, Pa.
22. Haley EC Jr, Lewandowski C, Tilley BC, for the NINDS rt-PA Stroke Study Group. Myths regarding the NINDS rt-PA Stroke Trial: setting the record straight. Ann Emerg Med. 1997;30:676682.[Medline] [Order article via Infotrieve]
23.
Woo D, Broderick
J, Kothari R, Lu M, Brott T, Marler J, Grotta J, for the NINDS rt-PA
Stroke Study Group. Does the National Institutes of Health Stroke Scale
favor left hemisphere strokes? Stroke. 1999;30:23552359.
Department of Neurology and Neurosurgery, McGill University, Montreal, Quebec, Canada
| Introduction |
|---|
|
|
|---|
Because they can be used for different purposes, such as to evaluate a new therapy within a randomized controlled trial or to closely monitor the neurological evolution of stroke patients after admission to hospital to guide management, stroke scales have to answer different needs and thus can vary in their design and content.R6 For example a scale used to measure a therapeutic effect would benefit from being more comprehensive and sensitive to detect even small but beneficial effects of a new drug.R7 On the other hand, a scale that would be used by different health care professionals to monitor a patients evolution and permit the detection of clinically meaningful changes in neurological status would need to put more emphasis on reproducibility, simplicity, and responsiveness to within-individual changes.R8
Regardless of the indication for which they were designed, stroke scales should adhere to some basic and recognized principles that will ensure their reliability and validity.R4 Some of these include (1) simple and nonambiguous definitions for the modalities tested, (2) a minimum number of grades per item to minimize variability, (3) selection of the most relevant neurological deficits, (4) ease of use and interpretation by observers with different medical backgrounds, and (5) brevity and simplicity. Reliability, an important attribute of any scale, can be assessed through different approaches. These may include intraobserver reliability, which determines the stability of a measure at 2 different points in time when applied by the same observer; interrater reliability, which assesses the reproducibility of a measure between 2 or more raters at the same point in time; and internal consistency, which measures the extent to which the scales items substantively measure the same clinical concept. The scale also has to possess validity, which means that it should reflect as closely as possible the clinical phenomenon under study. In general, one should be concerned with 3 types of validity: first, content validity, which is an index of how well the scale reflects the components of what is being measured; second, criterion validity, which determines whether the scale reflects the current neurological status as defined by a gold standard (concurrent validity) or predicts the future health status of the patient (predictive validity); and finally, construct validity, which assesses whether the neurological deficit measured by the stroke scale is different from other types of deficits quantified by another scale (discriminant validity) and also evaluates the capacity of the scale to correlate with other measures of the same construct over time to detect meaningful changes (convergent validity). Convergent validation is equivalent to establishing the responsiveness of the scale, which is another important property of any clinical instrument.
In the past several years, the NIH Stroke Scale (NIHSS) has been the most widely used clinical instrument in clinical trials to assess therapeutic interventions in the acute stroke setting. It is fairly comprehensive and has also been submitted to several assessments to test its reliability and validity.R7 R9 In addition, a certification program using training videotapes has been in use to increase its reliability.R10
In the preceding article, Lyden et al propose a modified and simplified version of the NIHSS. Their goal was to increase both the reliability of the scale and its capacity to reflect more meaningful clinical information. To do this, they used 2 techniques: first, they identified and excluded selected items that had shown either poor reliability or redundancy in previous analyses, and second, they collapsed the choices for the sensory item. The resulting modified NIHSS (mNIHSS) contains 11 items from the 15 in the initial version, with a worse score of 31 compared with 42 in the original scale, a reduction of about 25% in terms of total points. Based on the data sets from the NINDS rt-PA Stroke Trials and previously collected certification data, they then proceeded to test the mNIHSS for validity and reliability. The authors report improved reliability with the mNIHSS, which was to be expected, but interestingly also show that the modified scale retains good content validity when compared with the NIHSS. In addition, the modified scale performs as well as the original NIHSS in terms of correlation with other concurrently administered scales that reflect various other aspects of patient function, and it also appears at least as powerful (if not more so) in predicting certain specific outcomes, such as neurological impairment at 24 hours. This study represents an important advance; however, some issues remain unresolved and could be further explored in future work. For example, this includes the determination of new cut-point criteria to define meaningful clinical changes or to predict outcome and differential weighting of items to address the problem of hemispheric bias, which was alluded to by the authors.
The current report constitutes very good news for all physicians and other healthcare professionals involved in the evaluation of new therapies for acute stroke patients. These results, based on retrospective analyses, show promise and represent a first step toward greater acceptance and utilization of the NIH stroke scale for the quantification of neurological impairment in clinical trials. Appropriately so, the authors recognize that additional prospective validation studies will be required to confirm and strengthen the present findings. We strongly encourage the authors in this undertaking and look forward to using the modified NIHSS in future clinical trials.
Received August 15, 2001; revision received December 1, 2000; accepted December 20, 2000.
| References |
|---|
|
|
|---|
2. Feinstein AR. An additional basic science for clinical medicine, III: the challenges of comparison and measurement. Ann Intern Med. 1983;99:705712.
3. Mathew NT, Rivera VM, Meyer JS, Charney JZ, Hartmann A. Double-blind evaluation of glycerol therapy in acute cerebral infarction. Lancet. 1972;2:13271329.[Medline] [Order article via Infotrieve]
4. Côté R, Battista RN, Wolfson CM, Hachinski V. Stroke assessment scales: guidelines for development, validation, and reliability assessment. Can J Neurol Sci. 1988;15:261265.[Medline] [Order article via Infotrieve]
5. Candelise L (ed). Stroke scores and scales. Cerebrovasc Dis. 1992;2:239247.
6. Hantson L, DeKeyser J. Neurological scales in the assessment of cerebral infarction. Cerebrovasc Dis. 1994;4(suppl 2):714.
7.
Brott T, Adams HP
Jr, Olinger CP, Marler JR, Barsan WG, Biller J, Spiker J, Holleran R,
Eberle R, Hertzberg V, Rorick M, Moomaw CJ, Walker M. Measurements of
acute cerebral infarction: a clinical examination scale.
Stroke. 1989;20:864870.
8. Côté R, Battista RN, Wolfson C, Boucher J, Adam J, Hachinski V. The Canadian Neurological Scale: validation and reliability assessment. Neurology. 1989;39:638643.
9. Goldstein LB, Bertels C, Davis JN. Interrater reliability of the NIH Stroke Scale. Arch Neurol. 1989;46:660662.
10. Lyden P, Brott T, Tilley B, Welch KMA, Mascha EJ, Levine S, Haley EC, Grotta J, Marler J, and the NINDS TPA Stroke Study Group. Improved reliability of the NIH Stroke Scale using video training. Stroke. 1994;25:22202226.
This article has been cited by other articles:
![]() |
L. Govan, P. Langhorne, and C. J. Weir Categorizing Stroke Prognosis Using Different Stroke Scales Stroke, October 1, 2009; 40(10): 3396 - 3399. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. H. Schwamm, R. G. Holloway, P. Amarenco, H. J. Audebert, T. Bakas, N. R. Chumbler, R. Handschu, E. C. Jauch, W. A. Knight IV, S. R. Levine, et al. A Review of the Evidence for the Use of Telemedicine Within Stroke Systems of Care: A Scientific Statement From the American Heart Association/American Stroke Association Stroke, July 1, 2009; 40(7): 2616 - 2634. [Abstract] [Full Text] [PDF] |
||||
![]() |
D.J. Stott, A. Falconer, H. Miller, J.C. Tilston, and P. Langhorne Urinary tract infection after stroke QJM, April 1, 2009; 102(4): 243 - 249. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.R. Lees, G.A. Ford, K.W. Muir, N. Ahmed, A.G. Dyker, S. Atula, L. Kalra, E.A. Warburton, J.-C. Baron, D.F. Jenkinson, et al. Thrombolytic therapy for acute stroke in the United Kingdom: experience from the safe implementation of thrombolysis in stroke (SITS) register QJM, November 1, 2008; 101(11): 863 - 869. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Sellars, L. Bowie, J. Bagg, M. P. Sweeney, H. Miller, J. Tilston, P. Langhorne, and D. J. Stott Risk Factors for Chest Infection in Acute Stroke: A Prospective Cohort Study Stroke, August 1, 2007; 38(8): 2284 - 2291. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Lampl, J. A. Zivin, M. Fisher, R. Lew, L. Welin, B. Dahlof, P. Borenstein, B. Andersson, J. Perez, C. Caparo, et al. Infrared Laser Therapy for Ischemic Stroke: A New Treatment Strategy: Results of the NeuroThera Effectiveness and Safety Trial-1 (NEST-1) Stroke, June 1, 2007; 38(6): 1843 - 1849. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Dominguez, J. F. Vila, F. Augustovski, V. Irazola, P. R. Castillo, R. R. Escalante, T. G. Brott, and J. F. Meschia Spanish Cross-Cultural Adaptation and Validation of the National Institutes of Health Stroke Scale Mayo Clin. Proc., April 1, 2006; 81(4): 476 - 480. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Lyden, R. Raman, L. Liu, J. Grotta, J. Broderick, S. Olson, S. Shaw, J. Spilker, B. Meyer, M. Emr, et al. NIHSS Training and Certification Using a New Digital Video Disk Is Reliable Stroke, November 1, 2005; 36(11): 2446 - 2449. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. B. Young, C. J. Weir, K. R. Lees, and for the GAIN International Trial Steering Committe Comparison of the National Institutes of Health Stroke Scale With Disability Outcome Measures in Acute Stroke Trials Stroke, October 1, 2005; 36(10): 2187 - 2192. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Lyden, L. Claesson, S. Havstad, T. Ashwood, and M. Lu Factor Analysis of the National Institutes of Health Stroke Scale in Patients With Large Strokes Arch Neurol, November 1, 2004; 61(11): 1677 - 1680. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Davis, A. A. Wong, P. J. Schluter, R. D. Henderson, J. D. O'Sullivan, and S. J. Read Impact of Premorbid Undernutrition on Outcome in Stroke Patients Stroke, August 1, 2004; 35(8): 1930 - 1934. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Aslanyan, C. J. Weir, K. R. Lees, J. L. Reid, and G. T. McInnes Effect of Area-Based Deprivation on the Severity, Subtype, and Outcome of Ischemic Stroke Stroke, November 1, 2003; 34(11): 2623 - 2628. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Weir, S. W. Muir, M. R. Walters, and K. R. Lees Serum Urate as an Independent Predictor of Poor Outcome and Future Vascular Events After Acute Stroke Stroke, August 1, 2003; 34(8): 1951 - 1956. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. E. Kasner, B. L. Cucchiara, M. L. McGarvey, J. M. Luciano, D. S. Liebeskind, and J. A. Chalela Modified National Institutes of Health Stroke Scale Can Be Estimated From Medical Records Stroke, February 1, 2003; 34(2): 568 - 570. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Tirschwell, W.T. Longstreth Jr, K. J. Becker, R. E. Gammans Sr, L. A. Sabounjian, S. Hamilton, and L. B. Morgenstern Shortening the NIH Stroke Scale for Use in the Prehospital Setting Stroke, December 1, 2002; 33(12): 2801 - 2806. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Gladstone, S. E. Black, and A. M. Hakim Toward Wisdom From Failure: Lessons From Neuroprotective Stroke Trials and New Therapeutic Directions Stroke, August 1, 2002; 33(8): 2123 - 2136. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. C. Meyer, T. M. Hemmen, C. M. Jackson, and P. D. Lyden Modified National Institutes of Health Stroke Scale for Use in Stroke Clinical Trials: Prospective Reliability and Validity Stroke, May 1, 2002; 33(5): 1261 - 1266. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. C. Njemanze, J. Anozie, and I. Okadike 3D Vector Component Analysis of the Modified National Institutes of Health Neurological Stroke Scale Stroke, December 1, 2001; 32(12): 2958 - 2960. [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Stroke Home | Subscriptions | Archives | Feedback | Authors | Help | AHA Journals Home | Search Copyright © 2001 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. |