| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
(Stroke. 2004;35:1100.)
© 2004 American Heart Association, Inc.
Original Contributions |
From the Section on Stroke Diagnostics and Therapeutics, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, Md.
Correspondence to Dr. Steven Warach, 10 Center Drive, B1-D733, Mail Stop Code 1063, Bethesda, MD 20892-1063. E-mail warachs{at}ninds.nih.gov
| Abstract |
|---|
|
|
|---|
Methods Sixteen patients presenting within 6 hours from symptoms, and having partial reversal of the acute lesion on DWI were studied using conventional CSF-suppressed DWI. Lesions were segmented from coregistered acute DWI and follow-up fluid-attenuated inversion recovery (FLAIR) series. The segmented volumes were applied to conventional (ADCC) and CSF-suppressed ADC (ADCFLIPD) maps to classify each voxel as progressed to infarct or reversed. Individual voxel ADC values were pooled across all patients. Sensitivity to predict reversal, specificity, and accuracy were calculated for both methods.
Results A total of 25 313 voxels were classified as progressed and 31 952 voxels reversed. Across all lesion voxels, ADCFLIPD values more accurately depicted tissue fate compared with ADCC values (P<0.0001). The largest difference in the two methods was in voxels with <75% parenchyma, where the accuracy of ADCC was only 50% compared with 62% for ADCFLIPD.
Conclusion CSF-suppressed ADC measurements gave a more accurate identification of reversible ischemic injury in this sample. We predict that multimodal MRI models of tissue viability in ischemic stroke will be more accurate if CSF-suppressed ADC measurements are used.
Key Words: magnetic resonance imaging cerebral ischemia cerebrospinal fluid stroke, acute
| Introduction |
|---|
|
|
|---|
Several strategies have been used to predict tissue fate-recovery, progression to infarct, or hemorrhagic transformation-using ADC or perfusion parameters alone or in combination.1121 These predictive models have used a range of approaches from identifying thresholds to predictions based on general linear statistical algorithms; from regional analysis to pooling voxel data across all patients. Per voxel analysis is based on the premise that the heterogeneity within the acute lesion may be informative and may be obscured by gross regional volumetric measures that average voxel values within a region.
Within an imaging voxel there can be a mix of normal and ischemic gray and white matter, vasculature and CSF; all with differing ADC.22 As the ADC of CSF is approximately 3 times that of normal tissue, voxels containing CSF can have falsely elevated ADC values due to partial volume averaging effects.23,24 Furthermore, the volume fraction of CSF varies throughout cortical and subcortical areas, eg, adjacent to sulci, ventricles, brain surface, perivascular spaces. Therefore, to best characterize the risk of the underlying tissue, these limitations of imaging need to be addressed.
Effective CSF-suppression has been demonstrated with fluid-attenuated inversion recovery (FLAIR) protocols, commonly used with T2-weighted imaging for chronic lesion identification.25,26 These techniques have been demonstrated to reduce the effect of CSF on ADC measurements when used in a DWI pulse sequence.2729 We hypothesized that CSF-suppressed ADC would have greater predictive accuracy than conventional ADC.
| Methods |
|---|
|
|
|---|
To investigate the relative accuracy of FLIPD in distinguishing voxels that progress to infarct from those that may reverse to normal, we selected patients with evidence of partial lesion recovery. Patients met the following criteria: (1) discharge diagnosis of ischemic stroke; (2) acute imaging prior to the initiation of thrombolytic therapy, in patients who received alteplase; (3) DWI (b=0, b=1000), FLIPD (b=0, b=1000), and perfusion weighted imaging (PWI) exams of good quality within 6 hours of symptom onset; (4) a lesion >2 cm in greatest diameter within the cerebral hemispheres on acute conventional DWI examination; (5) FLAIR examination at least 20 days postictus; and (6) direct or indirect evidence of reperfusion determined by either 50% reduction of the volume of mean transit time (MTT) deficit within 24 hours, or reduction of at least 10% of the ischemic lesion volume on follow-up FLAIR imaging.
Imaging Protocol
Imaging was performed using a 1.5T clinical MRI system and the standard quadrature transmit-receive (TR) head coil. Two series of single shot echo planar (EPI) diffusion-weighted images were acquired: a conventional DWI protocol with repetition and echo times (TR/TE) of 6 s and 72 ms, respectively, and a FLIPD series (TR/TE=9 s/72 ms, inversion time [TI]=2.2 s), both with 128x128 matrix, field of view (FOV)=240, and 20 contiguous but interleaved 7 mm slices. Images were acquired at b=0 and b=1000 with diffusion-weighting gradients along 3 orthogonal axes, for a total of 80 images per series, and a trace-weighted image was calculated by the MRI system software. Residual eddy currents typically resulted in distortion of less than 2 mm in any direction, with worst case of 3 mm in the most inferior slices. The distortion was equivalent in the conventional and CSF-suppressed techniques. Acute conventional FLAIR imaging was conducted with TR/TE=9 s/98 ms, TI=2.2 s, matrix of 256x256 and FOV=240. FLAIR imaging at follow-up was of a higher resolution than the acute series, with slice thickness reduced from 7.0 to 2.0 mm. PWI was perfumed using standard bolus tracking methods. Gadolinium-DTPA was administered at a dose of 0.1 mmol/kg via power injector (5 cc/s) during gradient EPI with TR/TE=2 s/45 ms, matrix of 64x64, FOV=240 and 20 axial slices acquired in 25 time series. Relative MTT maps were calculated using concentration-time curves obtained from the PWI series.
Lesion Segmentation
For each patient, the DWI b=0 examination was first aligned to the anterior-posterior axis and the other image series was coregistered to it using a 7-parameter transformation model with trilinear interpolation and Normalized Mutual Information cost function (Medical Image Processing Analysis and Visualization, version 0.994u).30 Lesion segmentation was performed by an experienced reader, blinded to patient identifiers and clinical data. All acute lesions were segmented from the area of visible hyperintensity on conventional DWI exams. After all DWI exams had been evaluated, the area of hyperintensity on the follow-up FLAIR examination was segmented. In some cases, cortical retraction had developed within the boundaries of the chronic lesion. If the sulcus in that region was not evident on the acute FLAIR examination, the area of the sulcus was included within the follow-up FLAIR segmentation.
ADC maps were calculated from the acute DWI and FLIPD images in IDL (v5.6, Research Systems Incorporated) without thresholding. The segmented regions of interest (ROI) were then applied to both conventional ADC (ADCC) and FLIPD ADC (ADCFLIPD) maps to classify individual voxels as follows: (1) in both acute DWI and chronic FLAIR ROI (ischemic injury progressing to infarct); (2) in acute DWI, but not the chronic FLAIR ROI (ischemic injury reversing to normal); and (3) comparable contralateral hemisphere (normal control). The latter ROI was used for calculation of relative ADCC (rADCC) and was obtained by taking the acute DWI volume and mirroring it about the anterior-posterior axis to the contralateral hemisphere. The rADCC values of each voxel were calculated by dividing each voxel ADCC value by the mean ADCC of the patients contralateral control ROI. The relative parenchyma volume fraction for each of the voxels of interest was calculated from the FLIPD b=0 and conventional DWI b=0 according to previously published methods.24
Data Analysis
For each patient, the mean ADCC and ADCFLIPD were calculated for the progressed and reversed regions. Voxel data then were combined across patients, and differences in the mean and variance among region categories were evaluated with Welch ANOVA, assuming unequal variances. Receiver Operating Characteristic (ROC) curves were created to assess the probability of correctly classifying a voxel given the ADCC, ADCFLIPD, or rADCC value. From the areas under the curve (AUC), standard error, and correlation of diagnostic accuracy between methods, the difference between methods was evaluated using the statistical model of Hanley and McNeil31 for comparing 2 tests applied to the same population. Sensitivity (to predict reversal), specificity (to predict infarct), and accuracy were calculated for each subgroup along the continuum of possible discrimination thresholds for per patient data and pooled voxel data. True positives were data within the reversed category with ADC values that exceeded the discrimination threshold; false-negatives had subthreshold values. False-positives were progressed category data with suprathreshold ADC values; true negatives had subthreshold values.
Variation of ADCC and ADCFLIPD values with the relative parenchymal volume fraction of the voxel was tested with Pearsons correlation. To further examine the effect of CSF on sensitivity, specificity, and accuracy, voxels within each region of interest were divided into subgroups based on the relative parenchymal volume fraction: <75%, 75% to 90%, and >90%. As CSF contamination is minimal for voxels with >90% parenchyma, the threshold values at maximum accuracy for this subgroup were determined for each method, and then applied across all voxels.
For each voxel, its ADCC, ADCFLIPD, and rADCC values were divided by the threshold proposed for that method. Voxels with diffusion measurements greater than the threshold, therefore, had transformed values >1.0, whereas subthreshold values were less than 1. The differences of the paired transformed values were then ranked, and the Wilcoxon paired test was conducted. SPSS statistical software (version 11.0) was used for analyses.
| Results |
|---|
|
|
|---|
|
Pooled Voxel Characteristics
The pooled voxel data of the three categories (progressed, reversed, normal) were significantly distinct from each other within each modality (ADCC: Welch ANOVA F=13912, P<0.001; ADCFLIPD: Welch ANOVA F=33469, P<0.001). Progressed voxels had lower values (ADCC mean 0.671±0.234, ADCFLIPD mean 0.536±0.134), reversed were intermediate (ADCC 0.753±0.253, ADCFLIPD mean 0.622±0.142), and normal voxels had the highest values (ADCC 1.063±0.455, ADCFLIPD 0.807±0.168). Progressed voxels were more often distinguished from normal, based on ADCFLIPD values, than based on ADCC values (AUCFLIPD 0.940, AUCC 0.854; z=49.3, P<0.001). The same was true for distinguishing reversed voxels from normal (AUCFLIPD 0.846, AUCC 0.779; z=40.1, P<0.001). Of interest to this study was the direct comparison of voxels that progressed to infarct versus those that reversed. (Figure 1). For this analysis, the AUC for ADCFLIPD (0.705; 95% CI 0.700 to 0.709) was larger than both ADCC (0.623; 95% CI 0.618 to 0.627) and rADCC (0.691; 95% CI 0.687 to 0.696). ADCFLIPD was significantly more accurate than rADCC (z=7.22, P<0.001) characterizing tissue reversal from progression. As the AUC for ADCC was not large enough to be tested by the Hanley-McNeil method, significance could not be established for the comparison between ADCFLIPD and ADCC.
|
As demonstrated by the overlapping ROC curves for ADCFLIPD and rADCC, the diagnostic efficacy can vary along regions of the ROC plot. The threshold of optimal discrimination accuracy was calculated given the number of true and false-positives and -negatives at the ADC values along the ROC plot. ADCC achieved a maximum diagnostic accuracy of 62% at a threshold of 0.57x103 mm2/s. The maximum accuracy of ADCFLIPD and rADCC were equal at 67%. The ADCFL threshold was 0.54x103 mm2/s. The rADCC threshold was 0.53; when multiplied by the mean ADCC of the normal ROI this equaled 0.56x103 mm2/s.
Impact of CSF Contamination
Lesion ADCC values correlated negatively with
p, the relative parenchymal volume fraction (r=0.80, P<0.01). In this study, 21% of acute DWI lesion voxels had volume fractions <75%, and an additional 34% were in the 75% to 90%
p subgroup. These voxels were evenly distributed among the progress and reverse categories. The maximum accuracy of both methods was similar for voxels >90% parenchyma (69% ADCC; 68% ADCFLIPD), and converged at similar threshold values of 0.51 and 0.52x103 mm2/s respectively. Within this subgroup, the 2 methods differed most in their ability to characterize voxels that progressed to infarct, with ADCFLIPD achieving a higher specificity of 60% compared with 49% for ADCC (Table 2). These discrimination thresholds, 0.51x103 mm2/s and 0.52x103 mm2/s, were then applied to the <75% and 75% to 90%
p subgroups as well as the entire data set. The accuracy of ADCC was observed to decrease to 50% for voxels with <75% parenchyma. In this subgroup, all of the voxels were characterized as reversed due to suprathreshold ADCC values, when, in fact, half of the voxels progressed to infarct. FLIPD methods were less affected by CSF contamination, attaining an accuracy of 67% to 68% for all of the categories except the <75%
p subgroup, where accuracy decreased to 62%. In both conventional and FLIPD methods, all of the normal control voxels were above the threshold and therefore would be classified appropriately. Histograms categorized by volume fraction are shown in Figure 2. The threshold values determined by the pooled data were then applied to patient ADCC and ADCFLIPD maps, to demonstrate spatial agreement with the actual final infarct volume. An example of a representative patient is shown in Figure 3.
|
|
|
Voxel ADCFLIPD was found to significantly differ from both ADCC (W=168.85, P<0.0001) and rADCC (W=65.78, P<0.0001). There were a greater number of voxels where the difference between the ADCC value and the threshold exceeded the difference between the ADCFLIPD value and corresponding threshold. Of the progressed voxels that were correctly classified according to the ADCFLIPD value, 6277 (25%) were misclassified by their ADCC values and 6056 (24%) misclassified by rADCC. At maximum accuracy, rADCC achieved higher sensitivity in characterizing voxels that reversed than did ADCFLIPD. However the false-positive ratio (1-specificity) for rADCC was also higher: 56% as compared with 39% for ADCFLIPD.
| Discussion |
|---|
|
|
|---|
Combined algorithms18 have demonstrated that diffusion-based models differ from those of perfusion parameters, not in their overall accuracy, but in the trade-off between sensitivity and specificity. In the context of decision making for acute thrombolytic therapy, this balance of potential benefit and potential harm is of particular importance. Wu et al18 demonstrated that using rADC values to evaluate infarct progression resulted in a higher false-positive ratio for any given true-positive ratio. Our study was structured so that a higher specificity corresponded to correct classification of infarct progression. While ADCFLIPD and rADCC achieved similar maximum accuracy across the full range of lesion voxels, ADCFLIPD demonstrated greater specificity (61%) than rADCC (44%) and ADCC (40%). When the methods were compared at the optimal discriminant threshold determined by the subgroup of voxels with greater than 90% parenchyma (Table 2), the ability of ADCC values to correctly predict infarct was reduced to only 25%, whereas the ADCFLIPD was only reduced to 54%.
Some models have used automated thresholding to reduce CSF contamination, for example, eliminating voxels above supranormal values such as 1.2x103 mm2/s. When applied to our data set, this filter did not improve the accuracy of either method. Notably, after the filter was applied to ADCC values, 21% of progressed and 14% of reversed voxels remaining still had <75% parenchymal volume fraction. This is a substantial volume of tissue still affected by CSF contamination and could have implications regarding treatment decisions. It should be noted that the CSF contaminated voxels were not restricted to the cortical surfaces (see Figure 3), and, thus, avoiding sampling from the cortical surface would not eliminate this problem.
The FLIPD examination was less than 2 minutes in duration, as part of a 15-minute multiparametric MRI protocol. While the signal to noise ratio of FLIPD is less than that of conventional DWI, all lesions were conspicuous on the FLIPD studies. Additionally, as ADC values can be decreased in perilesion areas, yet maintain normal DWI appearance,12,15 the accuracy of quantitative parameters is more important than visual contrast for the calculation of risk maps.
Several potential limitations of this study require discussion. Ours is a retrospective study and has the limitations inherent to that approach. Prospective confirmation of these results both as a single predictor and as part of a multivariate predictive model is important and is under way. We aimed to make the sample as homogeneous as possible with regard to time from onset, lesion size and location, and evidence of reversibility. The majority of the patients received intravenous recombinant tissue plasminogen activator (rtPA), but the improved accuracy with ADCFLIPD was comparable between the patients who received rtPA and those who did not (results not shown).
The choice of the conventional rather than EPI FLAIR to delineate the chronic lesion was because of greater accuracy afforded by better spatial resolution and contrast to noise than on the b0 EPI, but this may have introduced image coregistration errors. Notwithstanding the noise that may be introduced by such misregistration, we found significant differences on our primary hypothesis, suggesting that such errors were not sufficient to obscure the differences between the acquisition methods.
The pooling of voxel data across patients has been the common approach to identifying thresholds in the prior literature because it can be prospectively applied in the acute setting, and, therefore, we adopted that method here. However, it assumes a complete independence of voxel measurements, which is unlikely to be the case. Voxel measurements within a patient, within a region, and across imaging modalities are likely to be correlated to some degree, and patients with larger lesions would have a relatively greater influence on the results. The effect could be minimizing the magnitude of group differences, while increasing the statistical power. We also compared ADCFLIPD and ADCC for each progressed and reversed regions on a per patient basis (see first paragraph of "Results"), confirmed a greater accuracy with either method, and a greater improvement in accuracies with ADCFLIPD, but that difference was not statistically significant, due to the small sample size. Nonetheless, this regional analysis is consistent with the per voxel analysis.
Acute ischemic lesions in DWI contain heterogeneous ADC values with variable CSF contamination. Removal of CSF contamination provides a more accurate measurement of the underlying tissue ADC. As ADC has been proposed as one measurement to improve patient selection for thrombolytic and other acute stroke therapies, CSF-suppressed ADC measurements may be advantageous in testing the accuracy of predictive models of tissue viability and hemorrhagic risk. Accurate discrimination of salvageable from irreversibly damaged ischemic brain may also have implications for patient management and prognosis in the hyperacute period.
| Acknowledgments |
|---|
Received June 19, 2003; revision received October 14, 2003; accepted December 12, 2003.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
P. E. Chen, J. E. Simon, M. D. Hill, C.-H. Sohn, P. Dickhoff, W. F. Morrish, R. J. Sevick, and R. Frayne Acute Ischemic Stroke: Accuracy of Diffusion-weighted MR Imaging--Effects of b Value and Cerebrospinal Fluid Suppression Radiology, January 1, 2006; 238(1): 232 - 239. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Stroke Home | Subscriptions | Archives | Feedback | Authors | Help | AHA Journals Home | Search Copyright © 2004 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. |