(Stroke. 1999;30:61-65.)
© 1999 American Heart Association, Inc.
Original Contributions |
From the Ultrasonic Angiology Laboratory, Department of Radiological Sciences, and Department of Surgery (P.R.T.), Guy's Campus, King's College, London, UK.
Correspondence to Dr T.S. Padayachee, Ultrasonic Angiology Laboratory, 2nd Floor, New Guy's House, Guy's Hospital, London SE1 9RT, UK. E-mail s.padayachee{at}umds.ac.uk
| Abstract |
|---|
|
|
|---|
MethodsGrading of plaque morphology from ultrasound images, stored both digitally and to hard copy, was performed by 2 classification schemes. Interobserver agreement was determined by 4 observers. Within-observer agreement was performed at intervals for up to 6 months. Accuracy of the 2 methods was determined by comparison with histology.
ResultsWithin- and between-observer agreement was moderate to
good for full-color digital image analyses, with pooled
values of
p=0.49±0.10 and
p=0.62±0.07
for the 2-category method and
p=0.53±0.06 and
p=0.52±0.05 for the 4-category method, respectively.
Hard copy data analyses gave lower
values. The more
experienced observers produced higher within-observer agreements and
higher correlation with histology.
ConclusionsReproducible grading of ultrasound images is not consistently achievable among experienced observers, and within-observer agreement may vary with time. The current subjective ultrasound characterization of carotid plaque morphology used in clinical trials may be associated with unacceptable levels of reproducibility in some centers. Variability between observers may be reduced by using the simpler 2-category grading of plaque morphology to interrogate full-color digitally stored images. Observer agreement should be audited regularly.
Key Words: atherosclerosis carotid stenosis ultrasonography, Doppler, duplex
| Introduction |
|---|
|
|
|---|
Ultrasound has been used to evaluate plaque morphology, and, more recently, multicenter trials investigating patients with cerebrovascular disease have included plaque characterization in their protocols.9 10 The Asymptomatic Carotid Surgery Trial used a subjective method for ultrasound plaque characterization based on the approach described by Gray-Weale et al.11 The Asymptomatic Carotid Stenosis and Risk of Stroke study10 used the Gray-Weale method with additional categories for calcified and ulcerated plaques, while the European Carotid Plaque Study Group commented on both plaque echogenicity and surface characteristics.12 In all these studies, grading of plaque morphology was largely based on the ultrasound gray scale appearance, which was assessed subjectively by visual inspection of ultrasound images.
Thus, a range of criteria and classification schemes for assessing plaque morphology are currently in use, but there is no consensus on protocol or methodology between trials.11 12 13 14 15 Subjective grading has been compared with histology; however, a wide range of agreements has been reported.11 13 14 15 16 17 18 Furthermore, there are few studies on reproducibility, which is very important when amalgamating data in multicenter trials.19 In this study we determined the accuracy and intraobserver and interobserver agreement of plaque characterization performed by subjective, visual inspection of ultrasound images as currently used in clinical trials. We also investigated the influence of image storage media on image interpretation and reproducibility issues.
| Subjects and Methods |
|---|
|
|
|---|
Ultrasound images were subjectively graded by 2 methods (Figure
).
The first method was according to the
relative contribution of echogenic (high-intensity) and echolucent
(low-intensity) material using the classification by Gray-Weale et
al,11 as follows: type I, predominantly echolucent plaque
with a thin echogenic cap; type II, substantially echolucent lesions
with small areas of echogenicity; type III, predominantly echogenic
lesions with small areas of echolucency; and type IV, uniformly
echogenic lesions (equivalent to homogeneous).
|
The second subjective method classified the plaques as either homogeneous or heterogeneous. Homogeneous plaques had a relatively uniform texture and, compared with the adjacent adventitia, contained uniform echoes that were medium to high level. Heterogeneous plaque had mixed high-, medium-, and low-level echoes and contained at least one well-defined focal echolucent area. For this study, plaque echogenicity was compared with blood and adjacent adventitia. Low echoes were defined as those closely approaching that of blood, and medium to high echoes were those similar to or greater than adventitia or adjacent soft tissue. Plaques that were obscured because of acoustic shadowing were not included in this study.
Four observers examined the hard copy and digital images independently and were blinded to patient identification. The observers were ultrasound technologists with scanning experience ranging from 4 to 15 years and were listed according to their experience. All observers underwent an extensive training session before analysis of the images for this study to standardize the concepts of echogenicity and echolucency. Images were reexamined after a 1-month interval to determine intraobserver variability of the classification methods. Two observers reanalyzed images at 1, 5, and 6 months to determine short-term and long-term variability. All images were assessed off-line either on the same digital analysis system (8 bit, 256 gray scale levels) or on hard copy (Sony, Mavigraph color printer). The ambient lighting, magnification, and gain level of the image screen were maintained constant throughout the analysis.
The excised carotid specimens were stored in formalin, and
histological examination was performed within 1 week of
surgery (range, 1 to 7 days). Histological assessment
of the plaque in the ICA was performed subsequently, and plaques were
graded as homogeneous or heterogeneous and
divided into 4 histological categories equivalent to
the Gray-Weale classification. Statistical analysis was by
,
which determined the conformity in multicategorical data and simple
statistics for calculation of percent agreement and accuracy.
values of <0.20 indicated poor agreement, 0.21 to 0.40 fair agreement,
0.41 to 0.60 moderate agreement, 0.61 to 0.80 good agreement, and 0.81
to 1.0 very good agreement. Pooled
(
p) was
calculated as described by Fleiss20 and
represented the overall interobserver and intraobserver
for all modality and classification schemes.
| Results |
|---|
|
|
|---|
=0.08
to 0.66) for the 2-category classification and gave an overall
classification of moderate according to the
p
value (Table 1
p=0.65±0.08). The 4-category classification
showed poor (
p=0.15±0.06) or fair
(
p=0.34±0.06) agreement when B-mode or
combined B-mode and color images were analyzed, respectively.
The range of agreement values for the 4 observers was wide and tended
to improve with the more experienced observers.
|
Digital Images
The intraobserver agreement for grading plaques from digital
images showed increased
p values, indicating
moderate agreement for B-mode alone and with color for both the
2-category (
p=0.45±0.08 and 0.49±0.10) and
4-category (
p=0.48±0.06 and 0.53±0.06)
classifications (Table 1
).
Plaque morphology grading was repeated by observers 1 and 2 over a
range of time periods (1 month, 5 months, and 6 months) for full-color
images and demonstrated the following values for
, standard error,
and agreement for the 2-category classification: observer 1: 0.89±0.11
(96%), 0.65±0.18 (88%), 0.50±0.21 (85%); observer 2: 0.49±0.18
(77%), 0.42±0.16 (69%), 0.29±0.15 (62%).
Although the agreements were higher for observer 1, both observers showed a range of agreements over the time period studied.
Interobserver Agreement
Interobserver agreement between the 4 observers for the hard copy
data is summarized in Table 2
. For B-mode
images alone,
p values indicated moderate
agreement (
p=0.52±0.07) for the 2-category
approach, and, surprisingly, agreements were reduced to fair after the
addition of color information (
p=0.28±0.08).
Values for the 4-category system showed fair agreements with B-mode
alone or with color (
p=0.28±0.05 and
0.24±0.05) .
|
Interobserver values for the digital images showed good to moderate
agreement for the combined B-mode and color images according to both
the 2-category (
p=0.62±0.07) and 4-category
(
p=0.52±0.05) classification schemes (Table 2
). This was converse to the agreements demonstrated for the
hard copy data, in which color increased variability between observers.
The highest overall agreement was observed with the 2-category method
for grading color flow mapping images displayed in digital format.
Accuracy
The correlation between histology and grading of plaque morphology
from combined B-mode color flow images was poor to fair according to
for the 4 observers (Table 3
). The
overall accuracy of the Gray-Weale approach compared with histology was
30±10%, whereas the 2-category classification showed improved
accuracy of 72±12%.
|
| Discussion |
|---|
|
|
|---|
The present study was conducted with standardized protocols for image acquisition and standardized hard copy and digital image displays; however, a reproducibility study did not fully support previous findings. Intraobserver agreement for the 2-category classification achieved only moderate agreement for digital B-mode images. The addition of color flow information improved delineation of the plaque border and increased agreement for hard copy data but not for digital images. The 4-category classification showed moderate agreement for digital images but poor to fair agreement for hard copy data. The reduced agreement for the latter may be due to the reduced contrast present in hard copy images. This would have the greatest effect on the 4-category classification, which required more subtle interpretation of the relative contribution of echogenic and echolucent material.
Intraobserver agreement was not consistent when studied over a 6-month period, indicating that a single evaluation of agreement is not necessarily representative of the individual's reproducibility in grading images. Although all observers in this study were experienced in ultrasound imaging, only 1 observer achieved consistently good agreement levels after a training period of 3 months. A recent study demonstrated similar variability in within-observer agreement for grading plaque morphology but higher values for between-observer agreement.22
Interobserver variability showed a wide range of agreement values, with good agreement being achieved for color flow digital images graded according to the 2-category classification. The somewhat puzzling finding of lower intraobserver agreements compared with interobserver agreements may be explained by the variation in intraobserver agreement with time. Correlation with histology was disappointing, with almost no correlation in some instances; interestingly, higher values of accuracy were noted for the more experienced observers.
These findings suggest that the previously reported high values of agreement within and between observers are not easily reproducible. This may be due to one of several factors: the subjective nature of the classification process; the number of images that are routinely studied; the experience of the observers; the plethora of classification schemes; the variability in image storage and display media used; variations in image quality; the statistics used; and the distribution of plaque types within each study. This is illustrated by the improvements in agreement achieved by using the more simplistic 2-category classification and also by interrogating full-color digital data compared with hard copy images.
Comparison of results between studies is limited by the different
statistical approaches used. For example, simple percentage values may
suggest a good test, while more detailed
analysis will
reveal inadequacies. In a study on intravascular ultrasound images of
coronary arteries, a high overall agreement of 95% was
reported; however,
values indicated good ability for identifying
hard (
=0.67) and soft (
=0.61) lesions but an inability to detect
lipid.23
The implications of this study are that results of plaque characterization in clinical studies and multicenter trials could be seriously flawed. Improvements to clinical trials could be achieved by standardizing image parameters, using test objects with a range of echogenicity and echolucency to eliminate individual scanner variations, interrogating full-color digitally stored images, and using the simpler 2-category classification scheme to grade plaque morphology. Observers should not participate in multicenter trials using subjective analysis of ultrasound images until they have achieved high levels of intraobserver agreement that are verified throughout the trial period.
| Acknowledgments |
|---|
Received August 5, 1998; revision received September 28, 1998; accepted October 12, 1998.
| References |
|---|
|
|
|---|
2. European Carotid Surgery Trialists' Collaborative Group. MRC European Carotid Surgery Trial: interim results for symptomatic patients with severe (7099%) or with mild (029%) carotid stenosis. Lancet. 1991;337:12351244.[Medline] [Order article via Infotrieve]
3. Mattos MA, Hodgson KJ, Faught WE, Mansour A, Barkmeier LD, Ramsey DE, Sumner DS, Turnipseed WD, Glover JL, Tsapogas MJ, Sicard GA, DeBord JR, Baker WH. Carotid endarterectomy without angiography: is color-flow duplex scanning sufficient? Surgery. 1994;116:776783.[Medline] [Order article via Infotrieve]
4. Imparato AM, Riles TS, Mintzer R, Baumann FG. The importance of haemorrhage in the relationship between gross morphologic characteristics and cerebral symptoms in 376 carotid artery plaques. Ann Surg. 1983;197:195203.[Medline] [Order article via Infotrieve]
5. Fuster V, Stein B, Ambrose JA, Badimon L, Badimon JJ, Chesebro JH. Atherosclerotic plaque rupture and thrombosis: evolving concepts. Circulation. 1990;82(suppl II):II-47II-49.
6. Falk E. Why do plaques rupture? Circulation. 1992;86(suppl III):III-30III-42.
7. Carr S, Farb A, Pearce WH, Virmani R, Yao JST. Atherosclerotic plaque rupture in symptomatic carotid artery stenosis. J Vasc Surg. 1996;23:755766.
8.
Sitzer M, Muller W, Siebler M, Hort W, Kniemeyer HW,
Janke L, Steinmetz H. Plaque ulceration and lumen thrombus are the main
sources of cerebral microemboli in high-grade internal carotid artery
stenosis. Stroke. 1995;26:12311233.
9. Haliday AW, Thomas D, Mansfield A. The Asymptomatic Carotid Surgery Trial (ACST): rationale and design: Steering Committee. Eur J Vasc Surg. 1994;6:703710.
10. Nicolaides AN. Asymptomatic Carotid Stenosis and Risk of Stroke: identification of a high risk group (ACSRS): a natural history study. Int Angiol. 1995;14:2123.
11. Gray-Weale AC, Graham JC, Burnett JR, Bryne K, Lusby RJ. Carotid artery atheroma: comparison of preoperative B-mode ultrasound appearance with carotid endarterectomy specimen pathology. J Cardiovasc Surg. 1988;2:676681.
12. European Carotid Plaque Study Group. Carotid artery plaque composition: relationship to clinical presentation and ultrasound B-mode imaging. J Vasc Endovasc Surg. 1995;10:2330.
13.
Bluth EI, Kay D, Merritt CRB, Sullivan M, Farr G, Mills
NL, Foreman M, Sloan K, Schlater M, Stewart J. Sonographic
characterization of carotid plaque: detection of hemorrhage.
AJR Am J Roentgenol. 1986;146:10611065.
14. Reilly LM, Lusby RJ, Hughes L, Ferrell LD, Stoney RJ, Ehrenfeld WK. Carotid plaque histology using real-time ultrasonography. Am J Surg. 1983;146:188193.[Medline] [Order article via Infotrieve]
15. Hatsukami TS, Thackray BD, Primozich JF, Ferguson MS, Burns DH, Beach KW, Detmer PR, Alpers C, Gordon D, Strandness DE. Echolucent regions in the carotid artery: preliminary analysis comparing three-dimensional histologic reconstructions to sonographic findings. Ultrasound Med Biol. 1994;20:743749.[Medline] [Order article via Infotrieve]
16.
O'Donnell FT, Erdoes L, Mackey WC, McCullough J,
Sheperd A, Heggerick P, Isner J, Callow AD. Correlation of B-mode
ultrasound imaging and arteriography with pathologic findings at
carotid endarterectomy. Arch Surg. 1985;120:443449.
17. Ratcliffe DA, Gallagher PJ, Hames TK, Humphries KN, Webster JHH, Chant ADB. Characterisation of carotid artery disease: comparison of duplex scanning with histology. Ultrasound Med Biol. 1985;11:835840.[Medline] [Order article via Infotrieve]
18. Bock RW, Lusby RJ. Carotid plaque morphology and interpretation of the echolucent lesion. In: Labs KH, Jager KA, FitzGerald DE, Woodcock JP, Neuerberg-Heusler D, eds. Diagnostic Vascular Ultrasound. London, UK: Edward Arnold; 1992;21:254263.
19. Geroulakos G, Ramaswarmi G, Nicolaides A, James K, Labropoulos N, Belcaro G, Holloway M. Characterization of symptomatic and asymptomatic carotid plaques using high resolution real-time ultrasonography. Br J Surg. 1993;80:12742377.[Medline] [Order article via Infotrieve]
20. Fleiss JL. Statistical Methods for Rates and Proportions. 2nd ed. New York, NY: John Wiley & Sons Inc; 1981.
21.
Eliaziw M, Rankin RN, Fox AJ, Haynes RB, Barnett HJM,
for the North American Symptomatic Carotid
Endarterectomy Trial (NASCET) Group. Accuracy and
prognostic consequences of ultrasonography in identifying severe
carotid artery stenosis. Stroke. 1995;26:17471752.
22.
Joakimsen O, Bonaa KH, Stensland-Bugge E.
Reproducibility of ultrasound assessment of carotid plaque occurrence,
thickness, and morphology: the Tromso Study. Stroke. 1998;28:22012207.
23. Van der Lugt A, Gussenhoven EJ, Pasterkamp G, Bom N, Posthuma DJ, Stijnen T. Interobserver reproducibility of qualitative and quantitative analysis of intravascular ultrasound images before and after peripheral balloon angioplasty. Ultrasound Med Biol. 1996;22:399404.[Medline] [Order article via Infotrieve]
This article has been cited by other articles:
![]() |
E. Touze, J.-F. Toussaint, J. Coste, E. Schmitt, F. Bonneville, P. Vandermarcq, J.-Y. Gauvrit, F. Douvrin, J.-F. Meder, J.-L. Mas, et al. Reproducibility of High-Resolution MRI for the Identification and the Quantification of Carotid Atherosclerotic Plaque Components: Consequences for Prognosis Studies and Therapeutic Trials Stroke, June 1, 2007; 38(6): 1812 - 1819. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Reiter, R. Horvat, S. Puchner, W. Rinner, P. Polterauer, J. Lammer, E. Minar, and R.A Bucek Plaque Imaging of the Internal Carotid Artery--Correlation of B-Flow Imaging with Histopathology AJNR Am. J. Neuroradiol., January 1, 2007; 28(1): 122 - 126. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. H. Johnsen, E. B. Mathiesen, E. Fosse, O. Joakimsen, E. Stensland-Bugge, I. Njolstad, and E. Arnesen Elevated High-Density Lipoprotein Cholesterol Levels Are Protective Against Plaque Progression: A Follow-Up Study of 1952 Persons With Carotid Atherosclerosis The Tromso Study Circulation, July 26, 2005; 112(4): 498 - 504. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ouhlous, H. Z. Flach, T. T. de Weert, J. M. Hendriks, M. R. H. M. van Sambeek, D. W. J. Dippel, P. M. T. Pattynama, and A. van der Lugt Carotid Plaque Composition and Cerebral Infarction: MR Imaging Study AJNR Am. J. Neuroradiol., May 1, 2005; 26(5): 1044 - 1049. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.K. Lovett, J.N.E. Redgrave, and P.M. Rothwell A Critical Appraisal of the Performance, Reporting, and Interpretation of Studies Comparing Carotid Plaque Imaging With Histology Stroke, May 1, 2005; 36(5): 1085 - 1091. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Sztajzel, S. Momjian, I. Momjian-Mayor, N. Murith, K. Djebaili, G. Boissard, M. Comelli, and G. Pizolatto Stratified Gray-Scale Median Analysis and Color Mapping of the Carotid Plaque: Correlation With Endarterectomy Specimen Histology of 28 Patients Stroke, April 1, 2005; 36(4): 741 - 745. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Ehlermann, W. Mirau, J. Jahn, A. Remppis, and A. Sheikhzadeh Predictive Value of Inflammatory and Hemostatic Parameters, Atherosclerotic Risk Factors, and Chest X-Ray for Aortic Arch Atheromatosis Stroke, January 1, 2004; 35(1): 34 - 39. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Denzel, K. Balzer, K.-M. Muller, F. Fellner, C. Fellner, and W. Lang Relative Value of Normalized Sonographic In Vitro Analysis of Arteriosclerotic Plaques of Internal Carotid Artery Stroke, August 1, 2003; 34(8): 1901 - 1906. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. Rothwell For Severe Carotid Stenosis Found on Ultrasound, Further Arterial Evaluation Prior to Carotid Endarterectomy Is Unnecessary: The Argument Against Stroke, July 1, 2003; 34(7): 1817 - 1819. [Full Text] [PDF] |
||||
![]() |
L. J. Walker, A. Ismail, W. McMeekin, D. Lambert, A. D. Mendelow, and D. Birchall Computed Tomography Angiography for the Evaluation of Carotid Atherosclerotic Plaque: Correlation With Histopathology of Endarterectomy Specimens Stroke, April 1, 2002; 33(4): 977 - 981. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. Tegos, K. J. Kalomiris, M. M. Sabetai, E. Kalodiki, and A. N. Nicolaides Significance of Sonographic Tissue and Surface Characteristics of Carotid Plaques AJNR Am. J. Neuroradiol., September 1, 2001; 22(8): 1605 - 1612. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.M. Sabetai, T.J. Tegos, A.N. Nicolaides, S. Dhanjil, G.J. Pare, and J.M. Stevens Reproducibility of Computer-Quantified Carotid Plaque Echogenicity : Can We Overcome the Subjectivity? Stroke, September 1, 2000; 31(9): 2189 - 2196. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. G. Worthley, G. Helft, V. Fuster, Z. A. Fayad, O. J. Rodriguez, A. G. Zaman, J. T. Fallon, and J. J. Badimon Noninvasive In Vivo Magnetic Resonance Imaging of Experimental Coronary Artery Lesions in a Porcine Model Circulation, June 27, 2000; 101(25): 2956 - 2961. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Yuan, L. M. Mitsumori, K. W. Beach, and K. R. Maravilla Carotid Atherosclerotic Plaque: Noninvasive MR Characterization and Identification of Vulnerable Lesions Radiology, November 1, 2001; 221(2): 285 - 299. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Stroke Home | Subscriptions | Archives | Feedback | Authors | Help | AHA Journals Home | Search Copyright © 1999 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. |