Donate Help Contact The AHA Sign In Home
American Heart Association
Stroke
Search: search_blue_button Advanced Search
Stroke. 2003;34:200-202
Published online before print December 5, 2002, doi: 10.1161/01.STR.0000047102.61863.E3
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
34/1/200    most recent
01.STR.0000047102.61863.E3v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrowRequest Permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Johnston, K. C.
Right arrow Articles by Haley, E. C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Johnston, K. C.
Right arrow Articles by Haley, E. C., Jr
Right arrowPubmed/NCBI databases
Medline Plus Health Information
*Stroke
Related Collections
Right arrow Health policy and outcome research
Right arrow Cerebrovascular disease/stroke
Right arrow Clinical Studies
Right arrow Acute Cerebral Infarction

(Stroke. 2003;34:200.)
© 2003 American Heart Association, Inc.


Short Communication

Predicting Outcome in Ischemic Stroke

External Validation of Predictive Risk Models

Karen C. Johnston, MD, MSc; Alfred F. Connors, Jr, MD; Douglas P. Wagner, PhD E. Clarke Haley, Jr, MD

From the Departments of Neurology (K.C.J., E.C.H.) and Health Evaluation Sciences (K.C.J., A.F.C., D.P.W.), University of Virginia, Charlottesville.

Correspondence to Karen C. Johnston, MD, University of Virginia Health System, Department of Neurology, #800394, Charlottesville, VA 22908. E-mail kj4v{at}virginia.edu

Abstract

Background— Six multivariable models predicting 3-month outcome of acute ischemic stroke have been developed and internally validated previously. The purpose of this study was to externally validate the previous models in an independent data set.

Summary of Report— We predicted outcomes for 299 patients with ischemic stroke who received placebo in the National Institute of Neurological Disorders and Stroke rt-PA trial. The model equations used 6 acute clinical variables and head CT infarct volume at 1 week as independent variables and 3-month National Institutes of Health Stroke Scale, Barthel Index, and Glasgow Outcome Scale as dependent variables. Previously developed model equations were used to forecast excellent and devastating outcome for subjects in the placebo tissue plasminogen activator data set. Area under the receiver operator characteristic curve was used to measure discrimination, and calibration charts were used to measure calibration. The validation data set patients were more severely ill (National Institutes of Health Stroke Scale and infarct volume) than the model development subjects. Area under the receiver operator characteristic curves demonstrated remarkably little degradation in the validation data set and ranged from 0.75 to 0.89. Calibration curves showed fair to good calibration.

Conclusions— Our models have demonstrated excellent discrimination and acceptable calibration in an external data set. Development and validation of improved models using variables that are all available acutely are necessary.


Key Words: cerebral ischemia • models, statistical • prognosis • stroke outcome

Amultivariable model that could predict outcome after stroke would be useful in clinical trials to assess the balance of treatment groups and to predict expected outcomes of patients who are lost to follow-up. We developed and internally validated a series of predictive risk models for 229 ischemic stroke patients from the Randomized Trial of Tirilazad Mesylate in Patients with Acute Stroke (RANTTAS)1. However, the models have not been validated in an external data set. The purpose of this study was to assess the validity of those predictive models in an independent data set.

Methods

Study Population
Two hundred ninety-nine patients from the placebo group of the National Institute of Neurological Disorders and Stroke (NINDS) rt-PA trial were used for the validation analysis.2 The NINDS rt-PA trial population has been described in detail previously.2 Briefly, this was an ischemic stroke population treated with intravenous tissue plasminogen activator (tPA) or placebo within 3 hours from symptom onset. Only the placebo group was used for this analysis because intravenous tPA is known to improve clinical outcome, and the predictive model being validated was designed to predict outcome without an intervention.

Three hundred twelve patients were treated with placebo in the NINDS rt-PA trial. Thirteen patients were excluded from our analysis for missing variables: 6 were missing 7- to 10-day CT scan infarct volume, 5 were missing stroke history information, and 2 were missing diabetes history information. The remaining 299 were used for this analysis.

Independent Variables
Baseline clinical information including age, National Institutes of Health Stroke Scale (NIHSS) score,3 history of previous stroke, history of diabetes mellitus, and history of prestroke disability were collected acutely. Infarct volume and stroke subtype were collected at 7 to 10 days after stroke onset.

Outcome Variables
The NIHSS, Barthel Index (BI),4 and Glasgow Outcome Scale (GOS)5 were used as outcome measures 3 months after stroke symptom onset. Each was dichotomized into excellent outcome (NIHSS <=1, BI >=95, GOS =1) or devastating outcome (NIHSS >=20 or death, BI <60 or death, GOS >2) as previously defined.1

Statistical Analysis
The previously defined models were forecast to the study population using the previously defined weights. Model discrimination was assessed using area under the receiver operating characteristic (ROC) curve, which was computed by a nonparametric method.6 An area under the ROC curve of 0.5 indicates no ability to discriminate and an area of 1.0 indicates perfect discrimination. We prespecified an acceptable area under the ROC as >=0.8. Calibration was assessed using calibration curves.7 The perfect 45° line demonstrates ideal calibration. The closer the model calibration is to the ideal line, the better the calibration. Hosmer-Lemeshow tests were performed to assess whether the models differed significantly from perfect calibration.8

Results

The clinical and imaging characteristics of the validation population are compared with those of the original population from the RANTTAS trial9 in Table 1. There were more blacks and fewer whites in the validation population. The validation population had substantially more severe strokes, as demonstrated by higher NIHSS scores and greater infarct volumes at 1 week, as well as worse outcomes when compared with the original data set.


View this table:
[in this window]
[in a new window]
 
TABLE 1. Clinical and Imaging Characteristics of the 2 Populations

The models’ ability to discriminate outcome, in both the original and validation data sets, is demonstrated in Table 2. In the original data set, 5 of the 6 models had excellent discrimination above the 0.8 level. The validation models demonstrate very little decline in area under the ROC curve in all 6 data sets and demonstrate excellent discrimination in 5 of the 6 models.


View this table:
[in this window]
[in a new window]
 
TABLE 2. Model Discrimination: Area Under the Receiver Operating Characteristic Curve

Model calibration is demonstrated in the Figure. Five of the 6 models have calibration curves that are very similar to the line of identity. The excellent outcome as measured by the NIHSS model calibrated less well and predicted a greater probability of excellent outcome than was observed in the validation data set for most of the range of predicted probabilities. For example, in the calibration curve "Excellent NIH Outcome," at a model prediction of 70% probability of excellent outcome (fourth point on the curve), only about 40% of patients were actually observed to have excellent outcome by the NIHSS. Hosmer-Lemeshow tests of the calibration accuracy indicate that 5 of the 6 models were detectably (P<0.05) less than perfect, with all of the models tending to predict better outcomes than were observed.



View larger version (63K):
[in this window]
[in a new window]
 
Calibration curves. The 6 calibration curves for the 6 models are shown. The solid 45° line reflects perfect calibration. Each model’s bias-corrected calibration is demonstrated by the observed line. In each case, the bias-corrected observed line deviates from the ideal line to some degree, but the observed line closely resembles the ideal line in 5 of the 6 models. NIH indicates National Institutes of Health Stroke Scale; BI, Barthel Index; GOS, Glasgow Outcome Scale.

Discussion

These 6 models have now been validated in an external data set. Five of the models have excellent discrimination (ROC area >=0.8). The model for devastating outcome defined by the NIHSS (ROC area=0.75) likely discriminates less well due to the small number of devastating outcomes in the model development data set. Five of the 6 models appear to be reasonably well calibrated, as shown in the calibration curves. The Hosmer-Lemeshow test demonstrates that these are not perfectly calibrated but the calibration appears to be adequate.

The validation population clearly had more severe strokes as demonstrated by both the NIHSS and infarct volume. Although the model discrimination was affected little by this, the poorer calibrations in the 6 models compared with the original models may be related to this difference. The fact that the models had such good discrimination and adequate calibration in 5 of the 6 models, even in such a different population, supports the generalizability of these models.

A predictive tool that would allow an accurate prediction of individual outcome at 3 months could be very useful in clinical research. The great heterogeneity among stroke patients may contribute to difficulty identifying treatment effects in clinical trials.10,11 Heterogeneity in a randomized clinical trial is mathematically expected to result in an underestimate of the treatment effect.11 A predictive model that could adjust for heterogeneity would allow a less biased estimate of the treatment effect and demonstrate a larger treatment effect in the same sized sample.

The small number of subjects and least frequent outcomes in both data sets limits how well the models can predict outcome. These models are also limited by the fact that only 5 of the 7 variables used in the prediction were collected acutely. Infarct volume and stroke subtype were collected at 1 week. Although a prediction of 3-month outcome is still valuable to the clinician at 1 week, this limits the use of these models in acute stroke clinical research. The dichotomized outcomes, though identifying the extreme outcomes (excellent outcome suggesting full or nearly full recovery; devastating outcome suggesting nursing home level disability or death), are not designed to predict the clinically relevant recovery levels in between. These extreme outcomes using the NIHSS, BI, and GOS are most useful in the clinical research realm in that they allow more reliable comparisons of standardized outcomes. These models, therefore, function as the proof of concept that predictive models can be developed and then internally and externally validated for the prediction of 3-month outcome. Future models must now be developed using variables that are all available in the acute setting.

Acknowledgments

Dr Johnston is supported by the National Institutes of Health, National Institute of Neurologic Disorders and Stroke (grant No. K23NS02168-01).

The NINDS rt-PA Stroke Trial was funded by the National Institutes of Health, National Institute of Neurologic Disorders and Stroke through contracts to the participating sites. The RANTTAS study was supported, in part, by the National Institutes of Health, National Institute of Neurological Disorders and Stroke (grant No. R01-NS31554), and Pharmacia and Upjohn Company (Kalamazoo, Mich).

The authors gratefully acknowledge the contribution of the NINDS rt-PA Stroke Trial investigators and the RANTTAS investigators, without whose efforts this work would not have been possible.

Footnotes

This work was presented, in part, at the 27th International Stroke Conference of the American Heart Association, San Antonio, Texas, February 7, 2002.

Received May 28, 2002; revision received July 24, 2002; accepted August 2, 2002.

References

1. Johnston KC, Connors AF, Wagner DP, Knaus WA, Wang X, Haley EC Jr. A predictive risk model for outcomes of ischemic stroke. Stroke. 2000; 31: 448–455.[Abstract/Free Full Text]

2. The National Institute of Neurological Disorders and Stroke rt-PA Stroke Study Group. Tissue plasminogen activator for acute ischemic stroke. N Engl J Med. 1995; 333: 1581–1587.[Abstract/Free Full Text]

3. Lyden P, Brott T, Tilley B, Welch KM, Mascha EJ, Levine S, Haley EC, Grotta J, Marler J. Improved reliability of the NIH Stroke Scale using video training: NINDS TPA Stroke Study Group. Stroke. 1994; 25: 2220–2226.[Abstract]

4. Mahoney FT, Barthel DW. Functional evaluation: Barthel Index. Md Med J. 1965; 14: 61–65.

5. Jennett B, Bond M. Assessment of outcome after severe brain damage. Lancet. 1975; 1: 480–484.[Medline] [Order article via Infotrieve]

6. Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982; 143: 29–36.[Abstract/Free Full Text]

7. Harrell FE, Lee KL, Mark DB. Tutorial in biostatistics: multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996; 15: 367–387.

8. Lemeshow S, Teres D, Klar J, Avrunin JS, Gehlbach SH, Rapoport J. Mortality probability models (MPM II) based on an international cohort of intensive care unit patients. JAMA. 1993; 270: 2478–2486.[Abstract/Free Full Text]

9. The RANTTAS Investigators. A randomized trial of tirilazad mesylate in patients with acute stroke (RANTTAS). Stroke. 1996; 27: 1453–1458.[Abstract/Free Full Text]

10. DeGraba TJ, Hallenbeck JM, Pettigrew KD, Dutka AJ, Kelly BJ. Progression in acute stroke value of the Initial NIH Stroke Scale Score on patient stratification in future trials. Stroke. 1999; 30: 1208–1212.[Abstract/Free Full Text]

11. Gail MH, Wieand S, Piantadosi S. Biased estimates of treatment effect in randomized experiments with non-linear regressions and omitted covariates. Biometrika. 1984; 71: 431–444.[Abstract/Free Full Text]




This article has been cited by other articles:


Home page
Am. J. Neuroradiol.Home page
L.E. Cipriano, M.L. Steinberg, G.S. Gazelle, and R.G. Gonzalez
Comparing and Predicting the Costs and Outcomes of Patients with Major and Minor Stroke Using the Boston Acute Stroke Imaging Scale Neuroimaging Classification System
AJNR Am. J. Neuroradiol., April 1, 2009; 30(4): 703 - 709.
[Abstract] [Full Text] [PDF]


Home page
StrokeHome page
B. Kissela, C. J. Lindsell, D. Kleindorfer, K. Alwell, C. J. Moomaw, D. Woo, M. L. Flaherty, E. Air, J. Broderick, and J. Tsevat
Clinical Prediction of Functional Outcome After Ischemic Stroke: The Surprising Importance of Periventricular White Matter Disease and Race
Stroke, February 1, 2009; 40(2): 530 - 536.
[Abstract] [Full Text] [PDF]


Home page
J. Neurol. Neurosurg. PsychiatryHome page
M Uyttenboogaart, R E Stewart, P C Vroomen, G-J Luijckx, and J De Keyser
Utility of the stroke-thrombolytic predictive instrument
J. Neurol. Neurosurg. Psychiatry, September 1, 2008; 79(9): 1079 - 1081.
[Abstract] [Full Text] [PDF]


Home page
J. Neurol. Neurosurg. PsychiatryHome page
H F Lingsma, D W J Dippel, S E Hoeks, E W Steyerberg, C L Franke, R J van Oostenbrugge, G de Jong, M L Simoons, W J M Scholte op Reimer, and The Netherlands Stroke Survey investigators
Variation between hospitals in patient outcome after stroke is only partly explained by differences in quality of care: results from the Netherlands Stroke Survey
J. Neurol. Neurosurg. Psychiatry, August 1, 2008; 79(8): 888 - 894.
[Abstract] [Full Text] [PDF]


Home page
StrokeHome page
I. R. Konig, A. Ziegler, E. Bluhmki, W. Hacke, P. M.W. Bath, R. L. Sacco, H. C. Diener, C. Weimar, and on behalf of the Virtual International Stroke Tria
Predicting Long-Term Outcome After Acute Ischemic Stroke: A Simple Index Works in Patients From Controlled Clinical Trials
Stroke, June 1, 2008; 39(6): 1821 - 1826.
[Abstract] [Full Text] [PDF]


Home page
J Am Coll CardiolHome page
P. Khatri, R. A. Taylor, V. Palumbo, V. Rajajee, J. M. Katz, J. A. Chalela, A. Geers, J. Haymore, D. M. Kolansky, S. E. Kasner, et al.
The safety and efficacy of thrombolysis for strokes after cardiac catheterization.
J. Am. Coll. Cardiol., March 4, 2008; 51(9): 906 - 911.
[Abstract] [Full Text] [PDF]


Home page
StrokeHome page
K. C. Johnston, D. P. Wagner, X.-Q. Wang, G. C. Newman, V. Thijs, S. Sen, S. Warach, and for the GAIN, Citicoline, and ASAP Investigators
Validation of an Acute Ischemic Stroke Model: Does Diffusion-Weighted Imaging Lesion Volume Offer a Clinically Significant Improvement in Prediction of Outcome? * Definitions and Explanations
Stroke, June 1, 2007; 38(6): 1820 - 1825.
[Abstract] [Full Text] [PDF]


Home page
NeurologyHome page
M.S.V. Elkind, S. Prabhakaran, J. Pittman, W. Koroshetz, M. Jacoby, K. C. Johnston, and for the GAIN Americas Investigators
Sex as a predictor of outcomes in patients treated with thrombolysis for acute stroke
Neurology, March 13, 2007; 68(11): 842 - 848.
[Abstract] [Full Text] [PDF]


Home page
NeurologyHome page
German Stroke Study Collaboration
Predicting outcome after acute ischemic stroke: An external validation of prognostic models
Neurology, February 24, 2004; 62(4): 581 - 585.
[Abstract] [Full Text] [PDF]


Home page
StrokeHome page
C. Weimar, I.R. Konig, K. Kraywinkel, A. Ziegler, and H.C. Diener
Age and National Institutes of Health Stroke Scale Score Within 6 Hours After Onset Are Accurate Predictors of Outcome After Cerebral Ischemia: Development and External Validation of Prognostic Models
Stroke, January 1, 2004; 35(1): 158 - 162.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
34/1/200    most recent
01.STR.0000047102.61863.E3v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrowRequest Permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Johnston, K. C.
Right arrow Articles by Haley, E. C.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Johnston, K. C.
Right arrow Articles by Haley, E. C., Jr
Right arrowPubmed/NCBI databases
Medline Plus Health Information
*Stroke
Related Collections
Right arrow Health policy and outcome research
Right arrow Cerebrovascular disease/stroke
Right arrow Clinical Studies
Right arrow Acute Cerebral Infarction