(Stroke. 2001;32:933.)
© 2001 American Heart Association, Inc.
Original Contributions |
From the Departments of Radiology (O.W., L.Ø., W.A.C., R.G.G., B.R.R., R.M.W., A.G.S.) and Neurology (W.J.K., F.S.B., G.R., L.H.S.), Massachusetts General Hospital, Boston, Mass, and Massachusetts Institute of Technology (O.W.), Cambridge, Mass.
Correspondence to Ona Wu, Mailcode CNY149-2301, MGH-NMR Center, Massachusetts General Hospital, Boston MA 02129. E-mail ona{at}nmr.mgh.harvard.edu
| Abstract |
|---|
|
|
|---|
MethodsDiffusion-weighted MR images (DWI) and perfusion-weighted MR images (PWI) from acute stroke patients scanned within 12 hours of symptom onset were retrospectively studied and used to develop thresholding and generalized linear model (GLM) algorithms predicting tissue outcome as determined by follow-up MRI. The performances of the algorithms were evaluated for each patient by using receiver operating characteristic curves.
ResultsAt their optimal operating points, thresholding algorithms combining DWI and PWI provided 66% sensitivity and 83% specificity, and GLM algorithms combining DWI and PWI predicted with 66% sensitivity and 84% specificity voxels that proceeded to infarct. Thresholding algorithms that combined DWI and PWI provided significant improvement to algorithms that utilized DWI alone (P=0.02) but no significant improvement over algorithms utilizing PWI alone (P=0.21). GLM algorithms that combined DWI and PWI showed significant improvement over algorithms that used only DWI (P=0.02) or PWI (P=0.04). The performances of thresholding and GLM algorithms were comparable (P>0.2).
ConclusionsAlgorithms that combine acute DWI and PWI can assess the risk of infarction with higher specificity and sensitivity than algorithms that use DWI or PWI individually. Methods for quantitatively assessing the risk of infarction on a voxel-by-voxel basis show promise as techniques for investigating the natural spatial evolution of ischemic damage in humans.
Key Words: cerebral ischemia magnetic resonance imaging, diffusion-weighted magnetic resonance imaging, perfusion-weighted stroke, acute
| Introduction |
|---|
|
|
|---|
Attempts have been made to combine DWI and PWI by comparing lesion volumes identified by the 2 techniques. "Diffusion-perfusion mismatches," in which the lesion volumes identified by one modality are larger than those by the other, have been reported by several groups.11 12 13 14 16 Many groups have reported larger lesion enlargement of the acute DWI lesion volume in cases where the acute PWI volume is larger1 13 14 15 16 17 18 19 20 21 than the DWI lesion. In cases where the acute DWI lesion was larger than the PWI lesion, total lesion growth was reduced.13 14 15 22 Based on these observations, many have hypothesized that these DWI-PWI mismatches may allow identification of salvageable tissue in individual patients.
These reported "mismatches" are of volumes of tissue rather than a voxel-by-voxel comparison. Heterogeneity in both ADC12 22 23 24 25 and flow values14 16 19 22 within acute ischemic tissue in humans have been well documented but have not been captured in these initial volumetric approaches. Therefore, volumetric approaches comparing gross differences in DWI and PWI lesion volumes may oversimplify the complex task of assessment of tissue viability in different regions within ischemic tissue. A voxel-by-voxel analysis, such as that developed by Welch and colleagues,23 24 25 26 may provide a more sensitive approach for identifying salvageable tissue. Their studies demonstrated that a combination of T2 and ADC information provided better prediction of cellular necrosis than algorithms that used them separately and that a voxel-by-voxel analysis may better demonstrate the underlying heterogeneity in the lesion.
A natural extension of these signature tissue algorithms is the inclusion of PWI. However, assessing the signatures significance becomes complicated, because each additional parameter leads to an exponential increase in the number of "signatures." Furthermore, assuming only discrete states ignores the variances intrinsic to the data. A more complete algorithm may be one in which inputs are treated as random variables and the output is the probability of infarction for each given tissue voxel. In this study, we investigated a strategy that utilized statistical generalized linear model (GLM) algorithms, in which the output is not a map of stages of infarction but risk of future infarction.
Our hypotheses were therefore 2-fold. First, we sought to determine whether algorithms that combine diffusion and perfusion information provide more sensitive and specific predictors of tissue outcome than algorithms using only subsets of this information. Second, we examined whether a probabilistic algorithm provides an improved indicator of which tissue is at risk of infarction over thresholding-based approaches. We tested both hypotheses by retrospectively applying the different techniques to diffusion and perfusion indices acquired from acute stroke patients and comparing the algorithms voxel-by-voxel performances in predicting which tissue will proceed to infarction.
| Subjects and Methods |
|---|
|
|
|---|
|
Image Acquisition
Imaging was performed on a 1.5-T General Electric
Signa MR instrument with 5.4.2 software (General Electric Medical
Systems) and retrofitted with echo-planar imaging (EPI)
capabilities via an Advanced NMR Systems hardware upgrade that included
the "catch and hold" modification.
Table 1
summarizes the MR acquisition
parameters for the patients. Multislice axial DWI were
acquired by either sampling 3 orthogonal directions at b values of 1010
s/mm2
(n=3)1 or sampling the full
diffusion tensor at b values of 1221 s/mm2
(n=11)28 with single-shot
pulsed field gradient spin-echo EPI using imaging
parameters described in previously published
reports.1 28 The
isotropic DWI was formed from the geometric mean of the high b value
single-shot images. The ADC image was calculated from the slope of the
linear regression fit of the log of the high and low b-value images
versus their b values. PWI were acquired from dynamic susceptibility
contrast images by using either spin-echo (n=10) or gradient-echo (n=4)
EPI pulse sequences. Images were acquired during the first pass of a
bolus of 0.1 mmol/kg (gradient-echo) or 0.2 mmol/kg
(spin-echo) of body weight of gadopentetate dimeglumine contrast agent
(Magnevist; Berlex Laboratories) injected with an MRI-compatible power
injector (Medrad). For both the diffusion and perfusion studies, the
FOV was 400x200 mm2 with an
acquisition matrix of 256x128 acquired with a slice thickness of
6 mm and a 1-mm interslice gap. Relative regional cerebral blood
volume (CBV), relative cerebral blood flow (CBF) and mean transit time
maps were calculated using techniques described in previously published
reports.29 30
Each patient was also imaged with conventional sequences following the
acute stroke protocol previously described in published
reports.1 12 19
Coregistration
The volumetric diffusion, perfusion, and follow-up
data were spatially coregistered with an automated image registration
software package, AIR 3.08 (University of California at Los
Angeles).31 32
The initial low b value T2-weighted EPI, ADC, DWI, and follow-up
T2-weighted FSE images were coregistered to the same dimensions
(128x128x11 or 128x128x10 voxels), orientation, and coordinates as
the perfusion images using an affine, 12-parameter
transformation model and trilinear interpolation. Voxels from
"normal"-appearing gray matter in the unaffected, contralateral
hemisphere from the coregistered initial T2 images were outlined before
generation of the predictive maps. For all 6 acute-stage images, voxel
values were normalized by dividing by the mean of these outlined
regions to produce "relative" values (rT2, rADC, rDWI, rCBF, rCBV,
rMTT).
Development of Generalized Linear Model
Algorithms
In our GLM algorithms, tissue outcome was modeled as
a binary variable (infarcted/noninfarcted) P, where the value 1
represented infarcted tissue and value 0 noninfarcted
tissue. In a GLM, for a binary variable, the probability of tissue
infarcting can be represented by the
logistic function
![]() | (1) |
(x),
the predictor, is a linear function of its
input parameters, x,
![]() | (2) |
the bias or intercept term for the GLM. The
term provides the
base value for P if all of the input parameters
x are zero. The ß coefficients can be
interpreted as the multiplicative effects on P due to changes in the
input
parameters.33 A supervised approach was used to calculate the coefficients in the GLM algorithms. Using commercial image processing software (Alice, Hayden Image Processing Solutions), training regions were selected by outlining brain tissue volumes that were clearly infarcted or noninfarcted in the ipsilateral hemisphere in the coregistered follow-up axial T2 FSE images by a neuroradiologist blinded to the predictive map results. Care was taken to avoid including regions demonstrating chronic changes on T2, such as old stroke lesions or periventricular white matter abnormalities. Selection of "normal" voxels was also limited to the ipsilateral hemisphere in slices that showed evidence of infarction. Combinations of initial rT2 EPI, rADC, rDWI, rCBF, rCBV, and rMTT values from these outlined training regions were used as the input vector x in the training stage. Because GLM algorithms assume independent observations, only every other voxel in the selected ROIs was sampled for the training data in order to reduce correlation. The coefficients for the GLMs, ß, were calculated using an iterative reweighted least-squares algorithm in S-PLUS 3.4 (StatSci). Selection of covariates was on the basis of the Akaike Information Criterion (AIC), whereby terms were included if their addition resulted in reductions in prediction error values that were a function of both training error and complexity.34 The AIC therefore provided an objective means to evaluate the trade-off between minimizing residual training error and complexity.34 The algorithm with the minimum AIC is therefore one with the minimum number of parameters and minimum training error. Automatic parameter selection was not utilized because all the input parameters were not independent with MTT=CBV/CBF and DWI=T2 exp(-b ADC). Therefore, in selecting covariates, independent parameters rT2, rADC, rCBF, and rCBV were considered first for inclusion, followed by the higher-order covariates of rDWI and rMTT. For purposes of comparing the 2 techniques, combinations of DWI and PWI identical to those created for the thresholding algorithms were generated for the GLM algorithms.
To validate the performance of the GLMs, a
jacknifing approach was followed wherein the coefficients for each
patients algorithms were calculated using the other patients in the
study as training data.35
Jacknifing was used to avoid bias that would otherwise occur if the
algorithms performance were evaluated on the same data that
was used to train the algorithm. Using the calculated coefficients, the
risk of a voxel of tissue going on to infarction was calculated with
Equations 1
and 2
. The 95% confidence intervals for the computed risks
were computed from the parameters obtained from S-PLUS
3.4.
To evaluate the jacknifing results for the GLM algorithms,
we compared the computed coefficients for each of the training data
sets to determine if they were significantly different
(P
0.05) from the coefficients
obtained using a data set containing data from all patients. The
average of the coefficients of the GLM algorithms obtained from the 14
training data subsets was also compared with the coefficients of the
aggregate GLM algorithm. Two-tailed
Z tests were used for the
statistical comparisons.
Thresholding Algorithms
For the thresholding algorithms, a strategy similar
to that reported by Welch et
al23 was followed. Tissue
was classified as abnormal if the initial diffusion or perfusion values
were greater than a specified number of standard deviations (SDs) from
the mean value measured in the contralateral noninfarcted gray matter
regions. We generated tissue signature maps by using images calculated
from the diffusion study (T2+ADC+DWI), images calculated from the
perfusion study (CBF+CBV+MTT), and combinations of images from both
studies. For the combined study, we generated signature maps using
combinations of T2 and ADC with 1 perfusion parameter (CBF,
CBV or MTT) and all 6 parameters (T2+ADC+DWI+CBF+CBV+MTT).
The combinations of the parameters used for the
thresholding algorithms were selected to be identical to the
combination of parameters used in the GLM algorithms for
the purpose of comparing the 2 techniques. For creating signature maps,
a threshold of 2 SDs from the mean of the contralateral values was
used. Each of the resulting signatures was taken to represent a
different "state" of infarction. Voxels not meeting any of the
threshold criteria were given a "normal" signature. For the
thresholding algorithms, which are based on an unsupervised approach
not requiring training data from other subjects, the nonnormalized data
sets were used.
Evaluation of Algorithm
Performance
To evaluate the accuracy of the thresholding and GLM
algorithms, the same infarcted and noninfarcted regions used in the
training of the GLM algorithms were used. The performance of
each of the algorithms was evaluated on its ability to accurately
discriminate the infarcted from noninfarcted regions in the ipsilateral
hemisphere. By comparing the predicted maps with lesions demonstrated
on follow-up conventional MR images, the number of voxels predicted to
infarct that actually did infarct (true positives [TP]), and the
number that did not infarct (false-positives [FP]) were tabulated. In
addition, we tracked the number of voxels predicted not to infarct that
remained noninfarcted (true negatives [TN]) as well as those that
became infarcted (false-negatives [FN]). From these counts, the
algorithms sensitivity or true positive ratio, TPR=TP/(TP+FN), and
specificity or true negative ratio, TNR=TN/(TN+FP), were calculated.
Receiver operating characteristic (ROC) curves were then generated for
each algorithm by plotting TPR (sensitivity) against the false-positive
ratio (FPR) (1-specificity). For thresholding algorithms, the number of
SDs was varied from -5 to 5 in 0.1 increments for all
parameters except MTT. For MTT, cutoff values ranged from
-10 to 10 SDs in 0.2 increments due to its larger range of values.
For the GLM algorithms, the probability cutoffs for classifying tissue
to be infarcted were varied from 0 to 1 in 0.01 increments.
The area under the ROC curves (AUC) has been shown to
represent the probability that an image will be correctly
ranked normal or abnormal and therefore used to assess the
performance of diagnostic
systems.36 We calculated the
AUC for the ROC curves for each patient using numerical integration.
The AUCs for the different algorithms were compared by paired 1-tailed
Wilcoxon signed-rank tests. Values of
P
0.05 were considered
significant in all statistical analysis. The
performances of the algorithms were also compared at their
optimal operating points (OOPs) on the ROC
curves.37 As defined by
Halpern,37 the OOP is the
point at which the ROC curve is tangent to the highest
line of slope
![]() | (3) |
| Results |
|---|
|
|
|---|
0.05) from the aggregate
GLM algorithm
(Table 2
|
ROC Analysis
Figure 1
shows the ROC curves of the pooled results from the
thresholding and GLM methods across all 14 patients for the
multivariate and univariate GLM algorithms.
For both approaches, the multivariate GLM algorithms
performed better than the univariate GLM algorithms, as
measured by higher ROC curves. Furthermore, GLM algorithms that
combined diffusion and perfusion data performed better than the
rT2+rADC+rDWI or rCBF+rCBV+rMTT GLM algorithms, as shown by the higher
ROC curves. (In the interest of clarity, only the ROC curve of the
single diffusion-perfusion combinations with the highest curve is
shown. This was the model combining T2, ADC, and MTT.) The full
6-parameter "combined algorithm" has a higher ROC curve
than GLM algorithms using only rT2+rADC+rMTT or rT2+rADC+ rCBF+rCBV
parameters, consistent with the AIC results. For
the diffusion- and perfusion-based GLM algorithms, the
multivariate algorithms provided the best
performance in terms of ROC curves, and therefore the
univariate diffusion and perfusion studies are not
discussed in further detail in this study. Of the combined algorithms,
the algorithm using all 6 parameters provided the best
performance, and therefore the other combined algorithms are
also not discussed in the remainder of this study.
|
Algorithms that use only perfusion imaging appear to have
greater sensitivity in regions of low specificity (FPR>0.3). For
algorithms that use only diffusion imaging, the reverse appear true;
that is, the diffusion-based algorithm had greater sensitivity than
perfusion-based algorithms in ranges of high specificity (FPR<0.3).
When we combine perfusion and diffusion information concurrently, we
obtain an overall increase in sensitivity.
Table 3
shows the specificities associated with the OOPs
for both thresholding and GLM algorithms, along with their
corresponding sensitivities. The OOPs are comparable for both
thresholding and GLM algorithms. For both algorithms, from the ROC
curves shown in
Figure 1
, the combined algorithms have the greatest
sensitivities at each of the specificities listed in
Table 3
.
|
From
Figure 1
, we see that both thresholding and GLM methods
produce similar ROC curves when results were pooled across the 14
subjects. ROC curves were also generated on an individual patient basis
and the area under the curves (AUC) calculated. The differences between
the multivariate algorithms AUCs were calculated for
the thresholding and GLM algorithms. For the thresholding algorithm,
the combined algorithm had significantly higher AUCs than the
diffusion-based algorithm (T2+ADC+DWI)
(P=0.02), indicating better
overall performance of the combined threshold algorithm over
the initially proposed diffusion-only thresholding
algorithm.23 24 25 26
The difference between the combined algorithm and CBF+CBV+MTT threshold
algorithms were not significant
(P=0.21). No significant
difference was found between the performances of threshold
algorithms based purely on diffusion (T2+ADC+DWI) and those based
purely on perfusion (CBF+CBV+MTT)
(P=0.52). For the GLM
algorithms, the combined algorithm showed a significant improvement
over diffusion-based algorithms (rT2+rADC+rDWI)
(P=0.02) and perfusion-based
algorithms (rCBF+rCBV+rMTT)
(P=0.04). There was no
significant difference between multivariate diffusion
and multivariate perfusion GLM algorithms
(P=0.50). The lack of
difference between the diffusion and perfusion algorithms for both GLM
and thresholding algorithms is most likely because diffusion algorithms
have lower sensitivity at low specificity than perfusion algorithms but
higher sensitivity at high specificity, which may in turn translate
into equivalent AUCs.
Differences between the AUCs for the GLM algorithms and their corresponding threshold algorithm counterparts were calculated and compared. The GLM and thresholding algorithms that used diffusion data (P=0.33), perfusion data (P=0.64), or combined algorithms (P=0.27) performed comparably.
Example Cases
Figure 2
shows the acute imaging studies and thresholding
maps for patient 14. The tissue signature maps are the results of using
only hyperacute diffusion data (T2+ADC+DWI), hyperacute perfusion data
(CBF+CBV+MTT), and combining all 6 input parameters
(combined algorithm). The diffusion-based algorithm, though identifying
a smaller region at risk of infarction in the ipsilateral hemisphere
than either the perfusion-based algorithm or combined algorithm, also
demonstrates an abnormal signature in the contralateral hemisphere.
Abnormal tissue signatures in the perfusion-based algorithm are
predominantly limited to the ipsilateral hemisphere, although they
encompass an area much greater than the follow-up infarct volume.
Because misclassifications are cumulative in the thresholding
algorithms, the results in the combined diffusion and perfusion
algorithms have similarly high sensitivity but poor specificity as that
shown for the perfusion-based algorithms. However, a greater number of
tissue states exist in the combined algorithm, which results in greater
heterogeneity than those based on algorithms
incorporating only diffusion or perfusion information.
|
Figure 3
shows the results of the GLM algorithms using the
same imaging data as shown in
Figure 2
. We again observe that algorithms using diffusion
alone (rT2+rADC+rDWI) underestimate the follow-up infarct volume. Maps
that use only perfusion information (rCBF+rCBV+rMTT) overestimate the
follow-up infarct volume. The combined algorithm, however, predicts an
area at high risk of infarction, as evidenced by the red-yellow region,
that correlates well with the follow-up lesion areas, as demonstrated
on the 2-month follow-up T2 FSE image shown in
Figure 2
. In addition, for all algorithms, the regions
predicted to be at high risk of infarction are predominantly localized
to the ipsilateral hemisphere compared with the results of the
thresholding algorithm.
|
The results of applying the statistical algorithms to a
patient with early reperfusion, as defined by follow-up perfusion
studies, are shown in
Figure 4
. The acute MRI studies for patient 11 appear
normal, with the exception of decreased CBF and increased MTT in the
left temporoparietal lobe. The imaging study 8 hours later shows a
slight diffusion abnormality in the area shown abnormal in the initial
perfusion study. However, the remaining perfusion defects appear to
have resolved as demonstrated by the CBF and MTT maps, suggesting the
occurrence of spontaneous reperfusion. Both the thresholding- and the
GLM-based risk maps overpredict the follow-up infarct volume in the
2-month follow-up T2 FSE. The resolution of much of the abnormalities
in the follow-up imaging study was consistent with the
patients improved clinical outcome.
|
| Discussion |
|---|
|
|
|---|
Combined Diffusion and Perfusion
Algorithms
By extending tissue signature algorithms based on
thresholding to include perfusion information, our results show that
such inclusion improves the predictive power of signature maps. While
only a trend toward improved performance was demonstrated in
the case of the combined thresholding algorithm over the
perfusion-based algorithm
(P=0.21), we believe that
further optimization of the threshold approach and refinement in
interpretation of results would improve its utility. Although
statistical significance was not found in the AUCs for the
multivariate diffusion-based algorithm compared with
the multivariate perfusion-based algorithm using either
thresholding or GLM, the ROC curves demonstrate the difference between
the diffusion-based and perfusion-based algorithms in their tradeoffs
between sensitivity and specificity, a characteristic not evident in
simple AUC indices.38 We
observe that diffusion-based algorithms have higher sensitivity in
regions of high specificity or low FPR, whereas perfusion-based
algorithms have higher sensitivity in regions of low specificity.
Combined algorithms appear to provide the best trade-off in terms of
maintaining high sensitivity at high specificity.
The high specificity of diffusion-based algorithms is not unexpected because of the association between high risk of infarction and changes in diffusion parameters, which are believed to detect tissue with altered cellular water homeostasis caused by severe energy depletion and breakdown in Na+-K+ pump activity.22 26 The level of sensitivity of diffusion-based algorithms is time dependent: less sensitive at very early imaging times before DWI reaches its maximum, and more sensitive hours later when DWI lesion size approaches the "final" infarct size. However, a simple reduction of ADC may not be a marker for irreversibly injured tissue, and indeed, a set "threshold" for irreversible ADC reductions may be difficult to determine, because the threshold varies as a function of depth and duration of ischemia.22 The perfusion parameters, on the other hand, presumably reflect the state of nutritive flow to the voxel of tissue. The lack of specificity but high sensitivity in perfusion-based algorithms may be attributed to the presence of metabolically viable hypoperfused tissue at flow levels below the threshold for electrical neuronal failure.39 The likelihood for tissue to infarct is a combined function of the degree and the duration of blood flow reduction, which have been shown to vary spatially and temporally.39 40 41 42 Therefore, perfusion-based algorithms may also have a similar level of time-varying sensitivity and specificity that varies on a voxel-by-voxel basis.
GLM Algorithms
Of the 2 techniques examined in this study for
combining diffusion and perfusion information, the GLM method may
provide results that are straightforward to interpret as additional
parameters are included in the algorithm. In initial
thresholding algorithms, a key feature was the ability to assign each
tissue signature based on imaging to a possible
physiological state of the tissue. However, with
the addition of multiple parameters, each additional term
exponentially complicates output interpretation, since the signature
maps create additional states whose biological significance is not
necessarily clear. Nevertheless, thresholding algorithms may provide
unique insight regarding heterogeneity of the
ischemic lesion at any single point in time. Further
investigations correlating evolution of these signatures with histology
may provide insight into the pathophysiologic significance of the
different signatures. GLM algorithms, on the other hand, provide the
risk of the tissue infarcting as a continuous variable that ranges
between 0 and 1, and therefore, as stroke evolves, the risk of
individual voxels of tissue can be monitored quantitatively by a single
variable. The recruitment of voxels in the presumed
"ischemic penumbra" might therefore be quantified as the
change in risk in the peripheral areas from low probability
to high probability over time.
Our algorithms have been trained on data from patients
who did not receive thrombolytic or neuroprotective
therapy. The 2 patients with spontaneous reperfusion were specifically
not excluded from the training set since their inclusion was believed
to be a better reflection of the naturally occurring ischemic
stroke patient population in which spontaneous reperfusion has been
detected within 24 hours after symptom onset in 24% of patients with
transcranial Doppler
ultrasound.43 Therefore, our
algorithms predictions seem likely to be based on the natural
evolution of ischemic tissue undergoing infarction. However,
our training set is small, and therefore does not yet capture the full
range and frequency of stroke evolution possibilities. For example, if
in a new patient an event occurs to interrupt the progression of
ischemic damage as quantified from the training patient data,
the probability of infarction of individual tissue regions may change
greatly. This was apparent in the case of patient 11, who exhibited
spontaneous reperfusion
(Figure 4
). For such circumstances, progression of infarct
lesion size has been shown to be
diminished.15 44 45 46
A similar change in probabilities might be seen after successful
therapeutic reperfusion or after administration of an effective
neuroprotective agent. This method, therefore, appears to provide a
technique that might be used to monitor this change in risk
quantitatively. Were this approach to be validated, the GLM approach
could become a useful statistical method for evaluating the efficacy of
novel therapies and possibly even develop into a tool to help guide the
choice of appropriate therapy for individual
patients.
Future Investigation
Our data demonstrate in a preliminary fashion the
feasibility of combining diffusion and perfusion information into a
single index of tissue risk. Although collection of additional patient
data will make the specific algorithm parameters more
robust, this would not necessarily change the methodology we have
developed for analyzing and quantifying this natural history data. On
the other hand, we believe there are still many avenues of
investigation for improving these algorithms. Clearly, the
retrospective aspect of this study limited our models. As demonstrated
by the large variance in lesion volumes and etiologies across patients
in this report, prospective studies involving a greater cohort of
patients with standardized MR acquisition parameters and
follow-ups at set intervals are needed to further test the validity of
the algorithmic approaches described here. For example, an
overestimation of "final" lesion volumes in some patients may have
occurred because of the possible presence of vasogenic edema at 5 days
after ictus,47 resulting in
the use of wrongly classified voxels in the training and evaluation of
our algorithms. In addition, inaccuracies in the coregistration may
have introduced errors in both algorithm development and evaluation.
Although intrasubject studies have shown the average misregistration
size to be <1 mm, less than our voxel dimensions, the maximum
misalignment has been reported to be as large as 3.8
mm.32 This suggests that our
algorithms results may be inaccurate for cases involving small
infarct volumes. The addition of acute clinical variables as
covariates may also improve our models performances, as has
been demonstrated by another study that predicted clinical outcome by
combining imaging data with initial clinical
variables.48
A priori assumptions in algorithm design, principally that the risk of infarction changes linearly with the covariates, may also have negatively impacted the performance of both thresholding and GLM algorithms. Several studies have shown that the risk of infarction does not change linearly for some of the algorithm variables. For example, ADC has been well documented to first decrease in acute cerebral ischemia before pseudonormalizing and increasing in the chronic stage.9 This nonlinear behavior may also hold true for perfusion metrics even in the hyperacute stage. Recent studies have found both increased and decreased CBV in acutely imaged lesions (<12 hours) that become infarcted, as shown by follow-up MR studies.16 19 The GLM algorithms we used in this study assume linear behavior. This suggests that additional investigations of algorithms that take into consideration the nonlinear behavior of covariates may provide improved performance.
Finally, there are a few additional technical limitations to our approach. Our models are almost certainly limited because they do not account for the intrinsic anatomic variations in both normal and pathophysiologic conditions. For instance, white matter may be misclassified as territory at risk of infarction because its normal flow values fall within the ischemic range for gray matter. Expert models that can differentiate white from gray matter and apply the appropriate tissue specific model to obtain an assessment of infarction risk can potentially compensate for this limitation.
Conclusion
Despite some limitations, we have shown that algorithms
combining diffusion and perfusion information can assess the risk of
infarction at the acute stage with greater sensitivity and specificity
than algorithms using diffusion and perfusion information separately.
Of the combined algorithms studied, the generalized linear model
algorithm may provide the preferred approach owing to its potentially
greater ease of interpretation with its single index of risk. Although
further investigation and algorithm refinement is necessary, this
method for quantitatively assessing the risk of infarction on a
voxel-by-voxel basis shows promise as a technique for not only gaining
insight into the natural spatial evolution of ischemic damage
in humans but also evaluating the effects that novel therapies may have
on this
process.
| Acknowledgments |
|---|
Received August 17, 2000; revision received December 15, 2000; accepted January 17, 2001.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
H. Ay, E. M. Arsava, J. Rosand, K. L. Furie, A. B. Singhal, P. W. Schaefer, O. Wu, R. G. Gonzalez, W. J. Koroshetz, and A. G. Sorensen Severity of Leukoaraiosis and Susceptibility to Infarct Growth in Acute Stroke Stroke, May 1, 2008; 39(5): 1409 - 1413. [Abstract] [Full Text] [PDF] |
||||
![]() |
G TAN and T GODDARD Neuroimaging applications of multislice CT perfusion Imaging, June 1, 2007; 19(2): 142 - 152. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. P. Adams Jr, G. del Zoppo, M. J. Alberts, D. L. Bhatt, L. Brass, A. Furlan, R. L. Grubb, R. T. Higashida, E. C. Jauch, C. Kidwell, et al. Guidelines for the Early Management of Adults With Ischemic Stroke: A Guideline From the American Heart Association/American Stroke Association Stroke Council, Clinical Cardiology Council, Cardiovascular Radiology and Intervention Council, and the Atherosclerotic Peripheral Vascular Disease and Quality of Care Outcomes in Research Interdisciplinary Working Groups: The American Academy of Neurology affirms the value of this guideline as an educational tool for neurologists. Circulation, May 22, 2007; 115(20): e478 - e534. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. P. Adams Jr, G. del Zoppo, M. J. Alberts, D. L. Bhatt, L. Brass, A. Furlan, R. L. Grubb, R. T. Higashida, E. C. Jauch, C. Kidwell, et al. Guidelines for the Early Management of Adults With Ischemic Stroke: A Guideline From the American Heart Association/ American Stroke Association Stroke Council, Clinical Cardiology Council, Cardiovascular Radiology and Intervention Council, and the Atherosclerotic Peripheral Vascular Disease and Quality of Care Outcomes in Research Interdisciplinary Working Groups: The American Academy of Neurology affirms the value of this guideline as an educational tool for neurologists Stroke, May 1, 2007; 38(5): 1655 - 1711. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M. Menezes, H. Ay, M. Wang Zhu, C. J. Lopez, A. B. Singhal, J. O. Karonen, H. J. Aronen, Y. Liu, J. Nuutinen, W. J. Koroshetz, et al. The Real Estate Factor: Quantifying the Impact of Infarct Location on Stroke Severity Stroke, January 1, 2007; 38(1): 194 - 197. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Wu, S. Christensen, N. Hjort, R. M. Dijkhuizen, T. Kucinski, J. Fiehler, G. Thomalla, J. Rother, and L. Ostergaard Characterizing physiological heterogeneity of infarction risk in acute human ischaemic stroke using MRI Brain, September 1, 2006; 129(9): 2384 - 2393. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. V. Guadagno, P. S. Jones, T. D. Fryer, O. Barret, F. I. Aigbirhio, T. A. Carpenter, C. J. Price, J. H. Gillard, E. A. Warburton, and J.-C. Baron Local Relationships Between Restricted Water Diffusion and Oxygen Consumption in the Ischemic Human Brain Stroke, July 1, 2006; 37(7): 1741 - 1748. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Seitz, S. Meisel, P. Weller, U. Junghans, H.-J. Wittsack, and M. Siebler Initial Ischemic Event: Perfusion-weighted MR Imaging and Apparent Diffusion Coefficient for Stroke Evolution |