Donate Help Contact The AHA Sign In Home
American Heart Association
Stroke
Search: search_blue_button Advanced Search
Stroke. 2006;37:2951-2956
Published online before print November 2, 2006, doi: 10.1161/01.STR.0000249416.77132.1a
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Correction (v38,p452)
Right arrow All Versions of this Article:
37/12/2951    most recent
01.STR.0000249416.77132.1av1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrowRequest Permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Luby, M.
Right arrow Articles by Warach, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Luby, M.
Right arrow Articles by Warach, S.
Related Collections
Right arrow CT and MRI
Right arrow Acute Cerebral Infarction
Right arrow Thrombolysis

(Stroke. 2006;37:2951.)
© 2006 American Heart Association, Inc.


Original Contributions

Intra- and Interrater Reliability of Ischemic Lesion Volume Measurements on Diffusion-Weighted, Mean Transit Time and Fluid-Attenuated Inversion Recovery MRI

Marie Luby, MEng, MS; Julie L. Bykowski, MD; Peter D. Schellinger, MD, PhD; José G. Merino, MD Steven Warach, MD, PhD

From the the National Institute of Neurological Disorders and Stroke, National Institutes of Health (M.L., J.L.B., J.G.M., S.W.), Bethesda, Maryland; and Neurologische Klinik, Universitätsklinikum Erlangen (P.D.S.), Erlangen, Germany.

Correspondence to Marie Luby, MEng, MS, Section on Stroke Diagnostics and Therapeutics, National Institute of Neurological Disorders and Stroke, National Institutes of Health, 10 Center Drive, Rm. B1D733, MSC 1063, Bethesda, MD 20892. E-mail: lubym{at}ninds.nih.gov


*    Abstract
up arrowTop
*Abstract
down arrowIntroduction
down arrowMethods
down arrowResults
down arrowDiscussion
down arrowReferences
 
Background and Purpose— We investigated the intra- and interrater reliability of ischemic lesion volumes measurements assessed by different MRI sequences at various times from onset.

Methods— Ischemic lesion volumes were measured for intrarater reliability using diffusion-weighted (DWI), mean transit time (MTT) perfusion and fluid-attenuated inversion recovery (FLAIR) MRI at chronic (>3 days from stroke onset) time points. A single intrarater reader, blind to clinical information and time point, repeated the volume measurements on two occasions separated by at least 1 week. Interrater reliability was also obtained in the second set of patients using acute DWI, MTT and chronic FLAIR MRI. Four blinded readers performed these volume measurements. Average deviations across repeat measurements per lesion and differences between sample means between the two measurements were calculated globally, ie, across all sequences and time points, and per reader type for each sequence at each time point.

Results— There was good concordance of the mean sample volumes of the 2 intrarater readings (deviations were <4% and 2 mL globally, <2% and 2 mL for DWI, <6% and 7 mL for MTT, and <2% and 1 mL for FLAIR). There was also good concordance of the interrater readings (<5% and 2 mL globally).

Conclusions— Repeat measurements of stroke lesion volumes show excellent intra- and interrater concordance for DWI, MTT and FLAIR at acute through chronic time points.


Key Words: acute stroke • brain imaging • magnetic resonance • neuroradiology • thrombolysis


*    Introduction
up arrowTop
up arrowAbstract
*Introduction
down arrowMethods
down arrowResults
down arrowDiscussion
down arrowReferences
 
Measurements of ischemic lesion volumes have been used in numerous imaging studies. Investigations of stroke outcomes and studies of the effects of therapeutic interventions including randomized clinical trials have included lesion volumes measured acutely with diffusion (DWI) and perfusion (PWI) weighted MRI and chronically with T2-weighted or fluid-attenuated inversion recovery (FLAIR) MRI.1–9 The use of volumetric measurements as an objective quantitative tool depends on intra- and interrater variability, but limited information is available on the reliability of these measurements across MRI sequence type and time from stroke onset. Reliabilities reported have been restricted to small sample sizes using manual or semiautomated techniques. Martel et al reported an intra- and interobserver repeatability coefficient defined as the 95% CI, of <6 mL for their semiautomatic method of measuring lesion volume using DWI (N=10).10 Ritzl et al reported an interobserver error of <3 mL for measurement of chronic FLAIR volume (N=8).11 Baird et al and Barber et al both reported an interrater variability of 5% with manual segmentation of lesion volumes on DWI.12,13 The current study provides intra- and interrater reliability estimates of lesion volume measurements, which may be relevant to stroke trial design and lesion data analysis.


*    Methods
up arrowTop
up arrowAbstract
up arrowIntroduction
*Methods
down arrowResults
down arrowDiscussion
down arrowReferences
 
Patients
This study is part of a prospective, natural history study of MRI in a consecutive series of tissue plasminogen activator (tPA)-treated patients at the National Institutes of Neurological Disorders and Stroke (NINDS) and Suburban Hospital, Bethesda, Md.14,15 The Institutional Review Board at NINDS and Suburban Hospital approved the study. From February 2000 through January 2005, 147 patients were treated with standard intravenous tPA; of those, 81 patients had an MRI before tPA treatment and were the subject of the intrarater analysis. Patients were eligible for this analysis if they received an MRI scan ("acute" scan) followed by standard intravenous tPA within 3 hours from stroke onset. The study design was to obtain serial MRI approximately 3 hours after the acute scan ("3-hour" scan), approximately 24 hours after the start of tPA ("24-hour" scan), and then at 5, 30, and 90 days after stroke onset ("chronic" scan). Chronic (>3 days from stroke onset) FLAIR volumes were taken from the latest available FLAIR for a given patient. Because of clinical care requirements, patient preferences, or death, all posttreatment MRI time points were not obtained in all patients. Only image time points and sequences with identifiable lesions were included.

A second confirmatory sample for intrarater analysis was obtained from August 1999 through July 2005 in 106 patients diagnosed with an acute cerebrovascular event, ischemic stroke, or transient ischemic attack. These patients had an acute MRI on average within 14 hours of onset and were not treated with tPA. The study design was identical to the tPA patients with the exception of the "3- and 24-hour" scans, which were not obtained. These same patients were also measured for the interrater analysis.

Imaging Sequences
MRI sequences were performed on a GE 1.5-T clinical scanner (Twinspeed; General Electric).

Diffusion-Weighted Imaging
In this study, the DWI spin-echo planar sequence included 20 contiguous axial oblique slices with b=0 and b=1000 s/mm2, isotropically weighted, using repetition time (TR)/echo time (TE)= 6000/72 ms, acquisition matrix of 128x128, 7-mm slice thickness, and 24-cm field of view (FOV).

Perfusion-Weighted Imaging
In this study, the PWI gradient-echo planar sequence included 20 contiguous axial oblique slices with single-dose gadolinium injection of 0.1 mmol/kg through a power injector using 25 phase measurements (2 seconds per phase measurement), TR/TE=2000/45 ms, acquisition matrix of 64x64, 7-mm slice thickness, and 24-cm FOV.

Mean Transit Time
Mean transit time (MTT) maps were calculated from PWI using time concentration curves in this study as the first moment of the time concentration curves divided by the zeroeth moment.

Fluid-Attenuated Inversion Recovery
FLAIR images were used for chronic lesion measurement as the suppression of cerebrospinal fluid in these images is advantageous when delineating the ischemic lesions from the adjacent sulci or ventricles. In this study, the high-resolution FLAIR sequence included 66 contiguous axial oblique slices, TR/TE=9000/92 ms, TI=2200 ms, acquisition matrix of 256x128, 2-mm slice thickness, and 24-cm FOV.

Image Analysis
Image analysis software (Cheshire; Perceptive Informatics) was used for measurement of ischemic lesion volumes from DWI, MTT, and FLAIR MRI sequences. Lesion volumes were measured at acute, subacute (3 and 24 hours), and chronic time points. Readers blind to clinical characteristics and time point used a semiautomated technique for initial identification of all the lesions and a manual editing tool for final corrections to the lesion borders. All lesion areas on a slice-by-slice basis were segmented with a semiautomated segmentation tool followed by manual editing. The semiautomated segmentation tool is based on a watershed method dependent on a series of seed points and the subsequent sampled surrounding area placed by the reader. The volumes were automatically produced by the multiplication of the slice thickness times the total lesion area. The intrarater reader (M.L.) measured the volume of each lesion on two occasions separated by at least 1 week. The intrarater reader trained one medical student (J.B.) and two stroke neurologists (P.S., J.M.) to perform interrater measurements using the same semiautomated technique. For the interrater readings, one reader (M.L.) performed one complete set of these measurements; however, the second set was performed by any combination of two of the three additional interrater readers with no repeat measurements. Only one measurement was produced by one of the additional interrater readers to perform direct comparisons with the first set of measurements.

DWI lesion volumes were assessed on the affected slices with hyperintense areas visible from the b=1000 mm/s2 images. The reader paid particular attention to the typical locations of bilateral artifact and produced apparent diffusion coefficient maps as necessary to identify positive DWI lesions. For MTT assessments, the reader paid particular attention to exclude hyperintensities attributable to the typical susceptibility artifacts adjacent to the paranasal sinuses. The reader assessed the MTT as not evaluable if the signal drop from the contrast did not produce at least a 10% drop or if there was significant patient motion causing inconsistency in the confirmation of the perfusion deficit. In evaluating FLAIR, the reader paid particular attention to preexisting chronic lesions present on the acute FLAIR images to avoid replication of these lesion areas in the current FLAIR volume calculations.

Measurements from each sequence were performed independently of the other time points except for the chronic FLAIR measurements in which reference to the acute FLAIR sequence aided identification of the index stroke. Within each time point, the reader had access to the other sequences to aid in accurate identification of the index lesion.

Patients’ intra- and interrater measurements from single slices of the acute DWI volumes are displayed in Figure 1. These same patients’ acute MTT and chronic FLAIR measurements from the corresponding single slices are displayed in Figures 2 and 3Down.


Figure 1
View larger version (108K):
[in this window]
[in a new window]

 
Figure 1. (A) Intrarater volumetric measurements on single DWI image (top left region of interest [ROI])—(top right ROI). (B) Interrater volumetric measurements on single DWI image (bottom left ROI—(bottom right ROI) with arrow indicating main difference between readings.


Figure 2
View larger version (115K):
[in this window]
[in a new window]

 
Figure 2. (A) Intrarater volumetric measurements on single MTT image (top left ROI)—(top right ROI). (B) Interrater volumetric measurements on single MTT image (bottom left ROI)—(bottom right ROI) with arrows indicating main differences between readings.


Figure 3
View larger version (115K):
[in this window]
[in a new window]

 
Figure 3. (A) Intrarater volumetric measurements on single FLAIR image (top left ROI)—(top right ROI). (B) Interrater volumetric measurements on single FLAIR image (bottom left ROI)—(bottom right ROI) with arrows indicating main differences between readings.

Statistical Analysis
Deviations between measurements were evaluated in two ways. The sample means, globally (across all sequences and time points) and within each reader type (intra- or interrater), time point, and sequence were compared for the two reads. The deviation per lesion was computed as the absolute value of the difference between the two readings; percent deviation per lesion was the per lesion absolute deviation divided by the average of the two readings. All volume data were skewed heavily toward mild. Cube root transformation of the raw volumes was performed to normalize the volumetric data. For example, for intrarater, acute DWI volume sample had a skewness of 2.1 and a kurtosis of 3.9; after transformation, the skewness was 0.7 and the kurtosis was –0.3. The absolute volume of the difference and the percentage deviation of the two readings, raw volume and cube root of volume, were calculated for each reader type and sequence and then averaged across all patients.

Bland-Altman plots were generated for the raw volume data to display the spread of data and the limits of agreement, specifically to illustrate how many of the averaged data points lie within 2 SDs from the mean difference.16 The Bland-Altman plots were used to address the key question of whether one set of volumetric measurements is sufficiently representative or if two sets of measurements are required for providing the most accurate results. The 95% confidence limits are proposed as the repeatability coefficients of one type of measurement for another, ie, one set of volumetric measurement is sufficient rather than requiring two sets in this particular study.16


*    Results
up arrowTop
up arrowAbstract
up arrowIntroduction
up arrowMethods
*Results
down arrowDiscussion
down arrowReferences
 
In the intrarater analysis, DWI had measurable lesions in 68 patients at the acute time point, 62 at 3 hours and 41 at 24 hours. MTT images were measured in 62 patients at the acute time point, 38 at 3 hours and 24 at 24 hours. Chronic FLAIR images were measured in 46 patients. The volume statistics by sequence, DWI, MTT, and FLAIR are listed in Table 1. Only DWI, MTT, and FLAIR measurements in which at least one volume measurement is nonzero are reported in Table 1 and all subsequent tables and figures. The corresponding sample sizes are provided for each reader type, sequence, and time point. The correlation coefficients and cube root of the raw volume data are presented in supplemental Table I, available online at http://stroke.ahajournals.org.


View this table:
[in this window]
[in a new window]

 
TABLE 1. Intrarater Volume (mL) Statistics by Time Point and Sequence


View this table:
[in this window]
[in a new window]

 
TABLE I. Intrarater Volume (mL) Statistics by Time Point and Sequence—Cube Root of Raw Data

The confirmatory intrarater sample had measurable lesions in 98 patients at the acute DWI, 72 patients at the acute MTT, and 28 patients at the chronic FLAIR time points. The volume statistics for the confirmatory sample are presented in Table 1.

In the interrater analysis, there were measurable lesions in 103 patients at the acute DWI, 77 patients at the acute MTT, and 29 patients at the chronic FLAIR time points. The volume statistics by sequence, DWI, MTT, and FLAIR are listed in Table 2 for the interrater analysis.


View this table:
[in this window]
[in a new window]

 
TABLE 2. Interrater Volume (mL) Statistics by Time Point and Sequence

Sequence: Diffusion-Weighted Image
The mean volumes, absolute volume (mL) differences, and percent deviations between the two intrarater readings (N=68, 62, and 41) for DWI at acute, 3-hour, and 24-hour time points are reported in Table 1. Supplemental Figure IDown (panels A through C; available online at http://stroke.ahajournals.org) display the Bland-Altman plots for the DWI raw volume data for the intra- acute, 3-hour, and 24-hour time points. The upper and lower limits, shown as thick black solid lines, were calculated for each plot to represent ±2 SD from the mean. A total of 90%, 92%, and 90% data points were within these boundary limits for the DWI data, respectively. There were no significant changes with respect to time point in the reliability of the DWI data.


Figure 4
View larger version (34K):
[in this window]
[in a new window]

 
Figure I. A through E, DWI sequence. A, Intrarater acute time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. B, Intrarater 3-hour time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. C, Intrarater 24-hour time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. D, Confirmatory intrarater acute time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. E, Interrater acute time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. F through J, MTT sequence. F, Intrarater acute time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. G, Intrarater 3-hour time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. H, Intrarater 24-hour time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. I, Confirmatory intrarater acute time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. J, Interrater acute time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. K through M, FLAIR sequence. K, Intrarater 4-145–day time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. L, Confirmatory intrarater 5–90–day time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean. M, Interrater 5–90–day time point plot of mean volume versus volume difference between 2 sets of measurements (raw numbers) where the thick black lines represent the upper and lower boundaries using ±2 SDs from the mean.


Figure 4
View larger version (30K):
[in this window]
[in a new window]

 
Figure I. Continued

The confirmatory intrarater sample statistics are reported in Table 1. Supplemental Figure IUp (panel D) contains the Bland-Altman plot for the DWI raw volume data. At total of 96% data points were within the boundary limits.

The mean volumes, absolute volume (mL) differences, and percent deviations along with the cube root statistics between the two interrater readings at acute DWI (N=103) time points are reported in Table 2. Supplemental Figure IUp (panel E) displays the interrater acute DWI Bland-Altman plot. A total of 94% data points were within the boundary limits. By excluding DWI lesions <10 mL in volume, the percent deviations (average, median, and SD of 25.01, 18.47, and 26.81, respectively) were comparable to those of the intrarater readings.

In addition to the measured samples, 11 and 12 acute DWI sequences were evaluated separately as normal, zero volumes, for the intra- and interrater readings. Included in the measured samples, there were 4 and 11 acute DWI sequences with one intrarater and interrater readings as zero volume. An additional five and four DWI sequences at 3 and 24 hours were evaluated separately as normal for both intrarater readings. There was one DWI sequence at 3 hours with one intrarater read as normal. There were three DWI sequences at 24 hours with one intrarater reading as normal.

Sequence: Mean Transit Time
The mean volumes, absolute volume (mL) differences, and percent deviations between the two intrarater readings (N=62, 38, and 24) for MTT at acute, 3 hour, and 24 hour time points are reported in Table 1. Supplemental Figure IUp (panels F through H) display the Bland-Altman plots for the MTT raw volume data for the intra- acute, 3-hour, and 24-hour time points. A total of 94%, 92%, and 92% data points were within the boundary lines for the MTT data, respectively. There were no significant changes with respect to time point in the reliability of the MTT data.

The confirmatory intrarater sample statistics are reported in Table 1. Supplemental Figure IUp (panel I) displays the Bland-Altman plot for the MTT raw volume data. At total of 99% data points were within the boundary limits.

The mean volumes, absolute volume (mL) differences, and percent deviations along with the cube root statistics between the two interrater readings at acute MTT (N=77) time points are reported in Table 2. Supplemental Figure IUp (panel J) displays the interrater acute MTT Bland-Altman plot. A total of 95% data points were within the boundary limits. When excluding the MTT deficits <10 mL in volume, the percent deviation statistics improved but were still not comparable to the intrarater readings.

In addition to the measured samples, seven and 34 acute MTT sequences were evaluated separately as normal, zero volumes, for the intra- and interrater readings. Included in the measured samples, there were one and 21 acute MTT sequences with one intrarater and interrater readings as zero volume. An additional five and nine MTT sequences at 3 and 24 hours were evaluated separately as normal for both intrarater readings. There were three MTT sequences at 3 hours with one intrarater read as normal.

Sequence: Fluid-Attenuated Inversion Recovery
The mean volumes, absolute volume (mL) differences, and percent deviations between the two intrareadings (N=46) for chronic FLAIR time points are reported in Table 1. Supplemental Figure IUp (panel K) displays the Bland-Altman plot for the intrarater chronic FLAIR time points. A total of 91% data points were within the boundary lines.

The confirmatory intrarater sample statistics are reported in Table 1. Supplemental Figure IUp (panel L) contains the Bland-Altman plot for the FLAIR raw volume data. A total of 89% data points were within the boundary limits.

The mean volumes, absolute volume (mL) differences, and percent deviations along with the cube root statistics between the two interrater readings at chronic FLAIR (N=29) time points are reported in Table 2. Supplemental Figure IUp (panel M) displays the interrater FLAIR Bland-Altman plot. A total of 93% data points were within the boundary limits. By excluding the FLAIR lesions <10 mL, the sample size was reduced to N=6; however, the percent deviations (average, median, and SD of 39.74, 39.73, and 27.69, respectively) were then comparable to those of the intrarater readings.

In addition to the measured sample, 12 chronic FLAIR sequences were evaluated separately as normal, zero volumes, for both intrarater readings. Included in the measured sample, there were five FLAIR sequences with only one intrarater reading as zero volume. There were no FLAIR sequences evaluated separately as normal, zero volumes, for both interrater readings. Included in the measured sample, there were four FLAIR sequences with only one interrater reading as zero volume.

A subset of nine intrarater patients with all sequences and time points was calculated for comparison purposes to Table 1. The mean volumes (N=9) of the two intrarater readings for DWI were 35.3 mL and 35.9 mL, 46.47 mL and 42.14 mL, and 78.53 mL and 75.89 mL at acute, 3 hours, and 24 hour time points, respectively. The mean volumes (N=9) of the two intrarater readings for MTT were 164.56 mL and 162.04 mL, 134.73 mL and 138.07 mL, and 86.39 mL and 83.83 mL at acute, 3 hours, and 24 hour time points, respectively. The mean volumes (N=9) of the two intrarater readings for FLAIR were 58.05 mL and 58.59 mL. The volume statistics of this subset of nine patients was consistent with those of the larger group of patients contained in Table 1.

The global mean volumes (N=341) of the two intrarater readings were 63.0 mL and 60.9 mL. The absolute volume (mL) differences between the reads were 6.54±13.27/1.92 (0.4 to 6.86) (mean±SD/median [interquartile range {IQR} 25% to 75%]). The percent deviations between the reads were 26.94±46.51/11.14 (0.04 to 0.24).

The global mean volumes (N=198) of the two confirmatory intrarater readings were 30.5 mL and 28.4 mL. The absolute volume (mL) differences between the two readings were 5.84±26.52/1.06 (0.24 to 4.3) (mean±SD/median [IQR 25% to 75%]). The percent deviations between the reads were 48.37±66.26/18.28 (0.07 to 0.51).

The global mean volumes (N=209) of the two interrater readings were 28.9 mL and 27.6 mL. The absolute volume (mL) differences between the reads were 8.69±22.76/1.31 (0.34 to 5.98) (mean±SD/median [IQR 25% to 75%]). The percent deviations between the reads were 67.04±68.87/35.64 (0.18 to 0.87).


*    Discussion
up arrowTop
up arrowAbstract
up arrowIntroduction
up arrowMethods
up arrowResults
*Discussion
down arrowReferences
 
The volume of injury, infarction, and ischemia has assumed an increasingly important role in stroke research in general and in stroke clinical trials in particular. Yet, relatively little attention has been paid to the reliability of volumetric measures and the contribution of measurement error to the overall variance of these outcomes in stroke studies. Because measurement error may potentially obscure true biologic effects and is potentially controllable by the investigator, it is important to understand the sources and magnitude of measurement error.

In the present study, technical variables that could affect lesion volume measurement such as the MRI scanner type, pulse sequence parameters, image processing, and analysis software were held constant. Because the intrarater reader’s experience performing stroke volume measurements with this software has been extensive over a period of years, the skill and consistency in measurement technique can be assumed to be constant over the period of the intrarater study. Thus, we have characterized the intrarater reproducibility independent of other sources of measurement error.

In principle, programs to measure lesion volumes in a completely automated fashion could eliminate this source of error. However, because regions of abnormality on these MRI sequences often lack sharp contrast boundaries with the normal brain and may overlap in signal intensity with nonpathologic structures, investigator judgment is needed to avoid inaccuracies. The interrater study attempts to quantify reader subjectivity, the main concern with interpretations of the images by multiple readers. To reduce the measurement error, the intrarater reader trained the three additional interrater readers in the same image processing techniques and software. The interrater readers were also required to be familiar with the clinical reading of serial stroke MRI scans. Furthermore, the patients and imaging protocol used in the interrater analysis were from the same time period, hospital, and stroke population as the intrarater analysis.

The intrarater repeatability seen for DWI was <1 mL at the acute and 3-hour time points and <2 mL at the 24-hour time point. The overall intrarater percent difference ranged from 2% to 5% for DWI. The interrater percent difference was <5% for acute DWI. For DWI, the error of the lesion volume measurement was caused by two main sources. There were six outliers readily identified by the Bland-Altman plots in the interrater DWI readings caused by reader differences in interpretation of subtle changes, differences in exclusion of sulcal areas, and in one case, misidentification of a chronic lesion as acute. Measurement error was also attributable to significantly smaller lesions seen for the interrater readings.

The overall intrarater MTT percent difference ranged from 1% to 6%. The intrarater repeatability coefficient seen for MTT was <1 mL at the 3-hour time point but <7 mL and <6 mL at the acute and 24-hour time points, respectively. The interrater repeatability coefficient seen for acute MTT was <1 mL. The interrater MTT percent difference was <1%. The larger standard deviations seen with MTT are in part attributable to the larger lesions present in this sequence and the lower spatial resolution in the perfusion sequence acquisitions. This was most evidenced by the four outliers in the interrater MTT readings, which are most readily identified by the Bland-Altman plots.

The intrarater repeatability coefficient seen for FLAIR was <0.6 mL and the overall percent difference of <1.5%. The most consistent lesion volume was seen with FLAIR attributable in part to the increased spatial resolution of this sequence compared with DWI and MTT as well as the resolution of edema at the chronic time point allowing for a more stable measurement. The interrater repeatability coefficient seen for FLAIR was <2 mL but an overall percent difference of <50%. The decreased consistency in the interrater lesion volume measurement with FLAIR was mainly attributable to the significantly smaller lesions seen (approximately 6 mL on average). The two outliers displayed in the corresponding Bland-Altman plot in the interrater FLAIR readings were caused by one reader, including areas of cavitations that were excluded by the other reader. Overall, for the DWI, MTT, and FLAIR volumes by requiring a minimum lesion size, ie, 10 mL, the reproducibility of the measurements, especially percent deviation, is improved.

Quantitative volumes by a single expert reader can provide highly consistent and repeatable results of lesion volumes on DWI, MTT, and FLAIR. The variability seen with the interrater measurements increased compared with the intrarater measurements; however, they also demonstrated consistent and repeatable results. This study provides a resource of volumetric statistics from a large homogenous stroke population to researchers interested in potential selection of particular sequences and series of time points for further investigation.


*    Acknowledgments
 
Sources of Funding

This research was supported by the Division of Intramural Research of the National Institute of Neurological Disorders and Stroke, National Institutes of Health.

Disclosures

None.

Received July 1, 2006; revision received August 3, 2006; accepted August 15, 2006.


*    References
up arrowTop
up arrowAbstract
up arrowIntroduction
up arrowMethods
up arrowResults
up arrowDiscussion
*References
 
1. Warach S. Tissue viability thresholds in acute stroke: the 4-factor model. Stroke. 2001; 32: 2460–2461.[Free Full Text]

2. Thijs VN, Lansberg MG, Beaulieu C, Marks MP, Moseley ME, Albers GW. Is early ischemic lesion volume on diffusion-weighted imaging an independent predictor of stroke outcome? A multivariate analysis. Stroke. 2000; 31: 2597–2602.[Abstract/Free Full Text]

3. Baird AE, Dambrosia J, Janket S, Eichbaum Q, Chaves C, Silver B, Barber PA, Parsons M, Darby D, Davis S, Caplan LR, Edelman RR, Warach S. A three-item scale for the early prediction of stroke recovery. Lancet. 2001; 357: 2095–2099.[CrossRef][Medline] [Order article via Infotrieve]

4. Arenillas JF, Rovira A, Molina CA, Grive E, Montaner J, Alvarez-Sabin J. Prediction of early neurological deterioration using diffusion- and perfusion-weighted imaging in hyperacute middle cerebral artery ischemic stroke. Stroke. 2002; 33: 2197–2205.[Abstract/Free Full Text]

5. Beaulieu C, de Crespigny A, Tong DC, Moseley ME, Albers GW, Marks MP. Longitudinal magnetic resonance imaging study of perfusion and diffusion in stroke: evolution of lesion volume and correlation with clinical outcome. Ann Neurol. 1999; 46: 568–578.[CrossRef][Medline] [Order article via Infotrieve]

6. van Everdingen KJ, van der Grond J, Kappelle LJ, Ramos LMP, Mali WPTM. Diffusion-weighted magnetic resonance imaging in acute stroke. Stroke. 1998; 29: 1783–1790.[Abstract/Free Full Text]

7. Warach S, Pettigrew LW, Dashe JF, Pullicino P, Lefkowitz DM, Sabounjian L, Harnett K, Schwiderski U, Gammans R; Citicholine 010 Investigators. Effect of citicholine on ischemic lesions as measured by diffusion-weighted magnetic resonance imaging. Ann Neurol. 2000; 48: 713–722.[CrossRef][Medline] [Order article via Infotrieve]

8. Hacke W, Albers G, Al-Rawi Y, Bogousslavsky J, Davalos A, Eliasziw M, Fischer M, Furlan A, Kaste M, Lees KR, Soehngen M, Warach S. The Desmoteplase in Acute Ischemic Stroke Trial (DIAS). Stroke. 2005; 36: 66–73.[Abstract/Free Full Text]

9. Warach S, Kaufman D, Chiu D, Devlin T, Luby M, Rashid A, Clayton L, Kaste M, Lees KR, Sacco R, Fisher M. Effect of the glycine antagonist gavestinel on cerebral infarcts in acute stroke patients, a randomized placebo-controlled trial: the GAIN MRI Substudy. Cerebrovasc Dis. 2006; 21: 106–111.[Medline] [Order article via Infotrieve]

10. Martel AL, Allder SJ, Delay GS, Morgan PS, Moody AR. Measurement of infarct volume in stroke patients using adaptive segmentation of diffusion weighted MR images. Medical Image Computing and Computer-Assisted Intervention—MICCAI ’99, LNCS. 1999; 1679: 22–31.

11. Ritzl A, Meisel S, Wittsack HJ, Fink GR, Siebler M, Modder U, Seitz RJ. Development of brain infarct volume as assessed by magnetic resonance imaging (MRI): follow-up of diffusion-weighted MRI lesions. J Magn Reson Imaging. 2004; 20: 201–207.[CrossRef][Medline] [Order article via Infotrieve]

12. Baird AE, Benfield A, Schlaug G, Siewert B, Lovblad KO, Edelman RR, Warach S. Enlargement of human cerebral ischemic lesion volumes measured by diffusion-weighted magnetic resonance imaging. Ann Neurol. 1997; 41: 581–589.[CrossRef][Medline] [Order article via Infotrieve]

13. Barber PA, Darby DG, Desmond PM, Yang Q, Gerraty RP, Jolley D, Donnan GA, Tress BM, Davis SM. Prediction of stroke outcome with echoplanar perfusion- and diffusion-weighted MRI. Neurology. 1998; 51: 418–426.[Abstract/Free Full Text]

14. Chalela JA, Kang DW, Luby M, Ezzeddine M, Latour LL, Todd JW, Dunn B, Warach S. Early magnetic resonance imaging findings in patients receiving tissue plasminogen activator predict outcome: insights into the pathophysiology of acute stroke in the thrombolysis era. Ann Neurol. 2004; 55: 105–112.[CrossRef][Medline] [Order article via Infotrieve]

15. Lee DK, Kim JS, Kwon SU, Yoo SH, Kang DW. Lesion patterns and stroke mechanism in atherosclerotic middle cerebral artery disease: early diffusion-weighted imaging study. Stroke. 2005; 36: 2583–2588.[Abstract/Free Full Text]

16. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986; I: 307–310.




This article has been cited by other articles:


Home page
StrokeHome page
K. M. Barrett, Y. H. Ding, D. P. Wagner, D. F. Kallmes, K. C. Johnston, and for the ASAP Investigators
Change in Diffusion-Weighted Imaging Infarct Volume Predicts Neurologic Outcome at 90 Days: Results of the Acute Stroke Accurate Prediction (ASAP) Trial Serial Imaging Substudy
Stroke, July 1, 2009; 40(7): 2422 - 2427.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Neuroradiol.Home page
P.G. Kranz and J.D. Eastwood
Does Diffusion-Weighted Imaging Represent the Ischemic Core? An Evidence-Based Systematic Review
AJNR Am. J. Neuroradiol., June 1, 2009; 30(6): 1206 - 1212.
[Abstract] [Full Text] [PDF]


Home page
StrokeHome page
M. Ebinger, S. Christensen, D. A. De Silva, M. W. Parsons, C. R. Levi, K. S. Butcher, C. F. Bladin, P. A. Barber, G. A. Donnan, S. M. Davis, et al.
Expediting MRI-Based Proof-of-Concept Stroke Trials Using an Earlier Imaging End Point
Stroke, April 1, 2009; 40(4): 1353 - 1358.
[Abstract] [Full Text] [PDF]


Home page
StrokeHome page
M. R. Gaudinski, E. C. Henning, A. Miracle, M. Luby, S. Warach, and L. L. Latour
Establishing Final Infarct Volume: Stroke Lesion Evolution Past 30 Days Is Insignificant
Stroke, October 1, 2008; 39(10): 2765 - 2768.
[Abstract] [Full Text] [PDF]


Home page
StrokeHome page
P. D. Schellinger and J. B. Fiebach
Establishing Final Infarct Volume: Stroke Lesion Evolution Past 30 Days Is Insignificant
Stroke, October 1, 2008; 39(10): 2693 - 2694.
[Full Text] [PDF]


Home page
StrokeHome page
S. B. Coutts, M. D. Hill, C. R. Campos, Y. B. Choi, S. Subramaniam, J. C. Kosior, A. M. Demchuk, and for the VISION study group
Recurrent Events in Transient Ischemic Attack and Minor Stroke: What Events Are Happening and to Which Patients?
Stroke, September 1, 2008; 39(9): 2461 - 2466.
[Abstract] [Full Text] [PDF]


Home page
StrokeHome page
H. Ay, E. M. Arsava, M. Vangel, B. Oner, M. Zhu, O. Wu, A. Singhal, W. J. Koroshetz, and A. G. Sorensen
Interexaminer Difference in Infarct Volume Measurements on MRI: A Source of Variance in Stroke Research
Stroke, April 1, 2008; 39(4): 1171 - 1176.
[Abstract] [Full Text] [PDF]


Home page
StrokeHome page
K. Butcher, M. Parsons, L. Allport, S. B. Lee, P. A. Barber, B. Tress, G. A. Donnan, S. M. Davis, and for the EPITHET Investigators
Rapid Assessment of Perfusion-Diffusion Mismatch
Stroke, January 1, 2008; 39(1): 75 - 81.
[Abstract] [Full Text] [PDF]


Home page
StrokeHome page
J. G. Merino, L. L. Latour, J. W. Todd, M. Luby, P. D. Schellinger, D.-W. Kang, and S. Warach
Lesion Volume Change After Treatment With Tissue Plasminogen Activator Can Discriminate Clinical Responders From Nonresponders
Stroke, November 1, 2007; 38(11): 2919 - 2923.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Correction (v38,p452)
Right arrow All Versions of this Article:
37/12/2951    most recent
01.STR.0000249416.77132.1av1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrowRequest Permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Luby, M.
Right arrow Articles by Warach, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Luby, M.
Right arrow Articles by Warach, S.
Related Collections
Right arrow CT and MRI
Right arrow Acute Cerebral Infarction
Right arrow Thrombolysis