| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
(Stroke. 2005;36:2446.)
© 2005 American Heart Association, Inc.
Original Contributions |
From the Departments of Neurosciences (P.L., R.R., S.O., B.M.) and Family and Preventive Medicine (R.R., L.L.), UCSD School of Medicine, and Neurology (P.L.), Veterans Administration Medical Center, San Diego, Calif; the Department of Neurology, University of Texas-Houston Medical School (J.G., S.S.), Houston, Texas; the University of Cincinnati Medical Center (J.B., J.S.), Cincinnati, Ohio, and the National Institute of Neurological Disorders and Stroke (M.E., M.W., J.M.).
Correspondence to Patrick Lyden, 3350 La Jolla Village Dr, San Diego, CA 92161. E-mail plyden{at}ucsd.edu
| Abstract |
|---|
|
|
|---|
Methods After producing a new NIHSS training and demonstration DVD, we selected 18 patients representing all possible scores on 15 scale items for a new certification DVD. Patients were divided into 3 certification groups of 6 patients each, balanced for lesion side, distribution of scale item findings, and total score. We sought to measure interrater reliability of the certification DVD using methodology previously published for the original videotapes. Raters were recruited from 3 experienced stroke centers. Each rater watched the new training DVD and then evaluated one of the 3 certification groups.
Results Responses were received from 112 raters: 26.2% of all responses came from stroke nurses, 34.1% from emergency departments/other physicians, and 39.6% from neurologists. One half (50%) of raters were previously NIHSS-certified. Item responses were tabulated, scoring performed as previously published, and agreement measured with unweighted
coefficients for individual items and an intraclass correlation coefficient for the overall score.
ranged from 0.21±0.05 (ataxia) to 0.92±0.09 (LOC-C questions). Of 15 items, 2 showed poor, 11 moderate, and 2 excellent agreement based on
scores. The intraclass correlation coefficient for total score was 0.94 (95% confidence interval, 0.84 to 1.00). Reliability scores were similar among specialists and centers, and there were no differences between nurses and physicians.
scores trended higher among raters previously certified.
Conclusions These certification DVDs are reliable for NIHSS certification, and scoring sheets have been posted on a web site for real-time, online certification.
Key Words: clinometrics reliability scales stroke
| Introduction |
|---|
|
|
|---|
| Methods |
|---|
|
|
|---|
A total of 26 patients were selected. To cover all possible scale scores, 18 were used for certification and 8 were used for training. The training patients were used during the instruction section; the script for this instruction section was edited and approved by the expert panel. The 18 certification patients were divided into 3 groups of 6 patients each, balanced for severity and stroke side.
To assess the reliability of the certification patients and confirm the use of the training video, DVDs were sent to 3 stroke centers known for prior use of the NIHSS: UCSD, University of Texas-Houston, and University of Cincinnati. To assure sufficient power for assessing overall NIHSS score reliability, each center was sent 51 DVDs with scoring sheets. The power analysis is based on the intraclass correlation coefficient for the overall NIHSS score and was performed using the software PASS. A sample size of 18 subjects with 51 observations per subject achieves 80% power to detect an intraclass correlation of at least 0.60 under the alternative hypothesis, assuming a null hypothesis correlation of 0.39 and a significance level of 0.05. The DVDs were distributed to raters, including neurologists, emergency department (ED) or other physicians, and nurses; previously uncertified examiners were encouraged to participate. Whether previously certified or not, each rater was asked to view the training video, and then score one of the 3 certification groups; each DVD envelope was labeled to indicate which patient group to use and groups were assigned at random among the 51 DVDs for each site. To avoid bias, only 2 UCSD staff involved in taping submitted responses.
Reliability was studied for both the individual items of the NIHSS and the overall score. Scores for each of the individual items were tabulated. Agreement was assessed with the unweighted
statistic (
) for the case of multiple raters.8 Jackknife technique9 was used to compute the standard error of
(estimate of
) and 95% confidence intervals for
was computed by the standard formula,
±1.96 standard error (SE) (
). These methods are similar to the analytic methods described in the initial reliability paper1 and were chosen to allow us to compare the results from the 2 studies. Using the same methods, secondary analyses of the individual items assessed the reliability separately for subgroups of patients by area of specialty (nurse, neurologist, other/ED, MD), site (UCSD, Cincinnati, Houston), and prior certification status (no, yes). Subgroup comparisons were made using Fisher Z transformation for comparisons of
measures across subgroups within each item11 and adjusting for multiple comparisons using Bonferroni correction. In addition, the scatterplot of item scores for each subject were used to visually compare and confirm the reliability graphically and the consistency of the item score by subgroups.
Agreement on the overall NIHSS score was assessed with the intraclass correlation coefficients (ICC) obtained using a one-way random-effects regression model (model 1) for clustered data (with ratings nested within patients).10 The model used included a random effect for patients and assumed that the within-patient variance was the same across all patients. The ICC is calculated by expressing the between-patient variance as a percentage of the total variance (between-patient+within-patient). It represents the correlation between the scores from 2 randomly chosen raters within a randomly chosen patient. Values of the ICC close to 1.0 indicate excellent agreement among the measurements within a patient. Standard error and 95% confidence intervals of the ICC were calculated using jackknife methods with the resampling done at the patient level.
To assess the effect of covariates (specialty, certification status, group) on the variability (and hence the ICC), we fit random-effects regression models for clustered data in which the within-patient and between-patient variance was allowed to vary across the subgroups of the covariate of interest (model 2). Comparing models 1 and 2 using the likelihood ratio test allows us to determine if the assumption of constant variance, and hence of constant ICC, among the subgroups of raters was valid. Similar to individual items, the scatterplot of overall score for each subject was used to visualize the variation of the total score by subgroups.
| Results |
|---|
|
|
|---|
coefficients for individual items and an intraclass correlation coefficient for the overall score. Table 1 indicates the range of values obtained on each item over all 18 patients. In confirmation of the intended patient selection design, the table documents the presence of all possible responses on all individual scale items. The mean NIHSS total score was 8.8±6.6 (median, 7; range, 1 to 33). The spread of responses in individual items and total scores looks similar among the subgroups, namely, sites, specialties, and prior NIHSS certification status.
|
Table 2 compares the agreement obtained using the unweighted
from the 3 different studies: the current DVD study (18 patients), the first certification videotape (5 patients), and the second certification videotape with 6 patients.1 The agreements ranged from 0.21 (ataxia) to 0.92 (LOCC) using the DVD and the values were remarkably similar to those obtained previously using the videotapes, although weighted
was used in the previous article. The agreements obtained from the DVD were closer to those obtained from tape 2 than tape one except for 2 items with poor concordance (facial weakness and ataxia). There are 2 reasons for this: (1) the video approach on tape 2 more closely resembled the approach used on the DVD; tape one used a significantly different approach; and (2) tape 2 was generally viewed 6 months after passing tape one, so all viewers of tape 2 were recently certified.
|
Among all 18 certification patients, the agreement was similar across all subgroups. There were no differences among nurses, ED physicians; among the study sites, or between novice raters and previously certified raters (data not shown).
Table 3 lists the intraclass correlation coefficient for the overall total NIHSS score and total NIHSS by subgroup. There was very good agreement in the total NIHSS score across all ratings (overall intraclass correlation coefficient of 0.94; 95% confidence interval, 0.84 to 1). There are no statistically significant differences in mean NIHSS scores by prior NIHSS certification status, site, specialty or group. Although there were slight differences in ICC across covariates, in all cases, the agreement still remained very high. The ICC was slightly lower among nurses compared with the neurologists and other/ED physicians. Similarly, the raters from Cincinnati had slightly lower agreement scores compared with the raters in UCSD and Houston. However, none of these differences are considered to be clinically significant. These scores have been provided to the American Heart Association for scoring their online NIHSS training and certification procedure.
|
| Discussion |
|---|
|
|
|---|
We found no differences when the DVD was used by neurologists, ED physicians, and nurses, suggesting that the NIHSS may be appropriate for use in clinical research trials, as well as in daily communication among healthcare providers, but the study may be underpowered to detect a subtle difference. In previous studies, agreement between stroke nurses and neurologists was generally good to excellent.6 Agreement among ED physicians is generally somewhat less than among stroke neurologists, but in our data, there was no difference. Larger trials of the DVD certification will be needed to discern more subtle differences among subgroups.
The DVD format has some advantages over videotape. The digital images can be loaded onto a web site, and the American Heart Association successfully implemented a web-based training campus using our images. This web site allows raters to view the training and certification patient videos online (http://www.asatrainingcampus.net). The DVD technology is more widely available now than videotapes, so NIHSS certification should be possible for many more years, even if videotapes become obsolete.
This study contains certain limitations, the most important of which is that the validation process was done in selected stroke centers. We chose 3 very experienced centers to obtain a best-case impression of how the DVD patients should be scored. However, these scores may not be generally applicable when novice users view the training DVD and then attempt certification. For example, agreement among the 3 stroke center directors was significantly higher than among all other groups. Therefore, we continue to collect scores to determine if the same scoring sheet generally works well outside of experienced centers. This DVD was designed for a single user to view at home or in an office. Its use in group settings is not validated, although this study is underway.
Another inherent limitation is that video technology is a poor substitute for direct examination. In the absence of widespread proctored certification, however, no other option is available. Video certification is now widely used in many disciplines with reasonable validity and reliability.2 It is likely that web-based video training and certification will become more widespread, because the cost efficiencies are significant.
As a result of the unbalanced group sizes, we could not use weighted
statistics, as has been used in previous trials. Unweighted
scores may underestimate agreement, yet in this study, the unweighted
scores were comparable the weighted scores obtained in previous studies. Therefore, the agreement among the viewers was at least as good, and likely better, than that seen previously with the videotapes.
| Acknowledgments |
|---|
| Footnotes |
|---|
Received April 11, 2005; revision received July 22, 2005; accepted August 3, 2005.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
P. D. Lyden, A. Shuaib, K. R. Lees, A. Davalos, S. M. Davis, H.-C. Diener, J. C. Grotta, T. J. Ashwood, H.-G. Hardemark, H. H. Svensson, et al. Safety and Tolerability of NXY-059 for Acute Intracerebral Hemorrhage: The CHANT Trial Stroke, August 1, 2007; 38(8): 2262 - 2269. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. Quinn, K. R. Lees, H.-G. Hardemark, J. Dawson, and M. R. Walters Initial Experience of a Digital Training Resource for Modified Rankin Scale Assessment in Clinical Trials Stroke, August 1, 2007; 38(8): 2257 - 2261. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. R. Bates, C. J. D. Babb, D. E. Casey, C. U. Cates, G. R. Duckwiler, T. E. Feldman, W. A. Gray, K. Ouriel, E. D. Peterson, K. Rosenfield, et al. ACCF/SCAI/SVMB/SIR/ASITN 2007 Clinical Expert Consensus Document on Carotid Stenting: A Report of the American College of Cardiology Foundation Task Force on Clinical Expert Consensus Documents (ACCF/SCAI/SVMB/SIR/ASITN Clinical Expert Consensus Document Committee on Carotid Stenting) Vascular Medicine, February 1, 2007; 12(1): 35 - 83. [PDF] |
||||
![]() |
American Society of Interventional & Therapeutic N, Society for Cardiovascular Angiography and Interve, Society for Vascular Medicine and Biology, Society of Interventional Radiology, E. R. Bates, J. D. Babb, D. E. Casey Jr, C. U. Cates, G. R. Duckwiler, T. E. Feldman, et al. ACCF/SCAI/SVMB/SIR/ASITN 2007 Clinical Expert Consensus Document on Carotid Stenting: A Report of the American College of Cardiology Foundation Task Force on Clinical Expert Consensus Documents (ACCF/SCAI/SVMB/SIR/ASITN Clinical Expert Consensus Document Committee on Carotid Stenting) J. Am. Coll. Cardiol., January 2, 2007; 49(1): 126 - 170. [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Stroke Home | Subscriptions | Archives | Feedback | Authors | Help | AHA Journals Home | Search Copyright © 2005 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. |