Abstract
The purpose of this study was to investigate if experienced readers differ when matching polyps shown by both CT colonography (CTC) and optical colonoscopy (OC) and to explore the reasons for discrepancy. Twenty-eight CTC cases with corresponding OC were presented to eight experienced CTC readers. Cases represented a broad spectrum of findings, not completely fulfilling typical matching criteria. In 21 cases there was a single polyp on CTC and OC; in seven there were multiple polyps. Agreement between readers for matching was analyzed. For the 21 single-polyp cases, the number of correct matches per reader varied from 13 to 19. Almost complete agreement between readers was observed in 15 cases (71%), but substantial discrepancy was found for the remaining six (29%) probably due to large perceived differences in polyp size between CT and OC. Readers were able to match between 27 (71%) and 35 (92%) of the 38 CTC detected polyps in the seven cases with multiple polyps. Experienced CTC readers agree to a considerable extent when matching polyps between CTC and subsequent OC, but non-negligible disagreement exists.
Keywords: CT colonography, Colorectal neoplasia, Colorectal polyps, Polyp matching, Colonoscopy
Introduction
Computed tomography colonography (CTC) is an established diagnostic technique for both symptomatic and screening patients [1]. The diagnostic performance of CTC has generally necessitated comparison between CTC and subsequent optical colonoscopy (OC) [2]. Such comparisons are often performed by an experienced radiologist who uses prespecified criteria to match polyps identified by CTC with those found at subsequent OC [2, 3].
A variety of matching criteria have been described, usually based on the location, size, and morphology of polyps [4–8]. For example, a correct matching may be assumed when the polyp identified by CTC is found in the same or adjacent colonic segment as the polyp detected by OC. For convenience the colon is usually divided into six segments: caecum, ascending, transverse and descending colon, sigmoid and rectum [2, 3]. For size matching a frequently described criterion stipulates that the CTC polyp must be within 50% of the diameter measured at colonoscopy. Matching based on general morphology may also be performed.
However, observers using identical criteria may nevertheless match different polyps because the matching procedure is subjective and requires interpretation of both CTC and colonoscopy data. Furthermore, variation in the matching criteria stipulated by different researchers hinders comparisons between different studies.
The purpose of our study was to investigate to what extent experienced readers differ when matching polyps between CTC and OC and to explore the reasons underpinning any differences. Ultimately we aimed to develop criteria to minimize matching disagreement with a view to improve study methodology and facilitate interstudy comparisons.
Methods
Eight highly experienced CTC researchers from six centers in Europe and the USA participated. We administered a questionnaire to document their current criteria for polyp matching. To investigate matching in daily practice, the eight readers were asked to match preselected cases. Reader experience varied from 250 to over 3,000 CTC interpretations and between 100 and 3,000 CTC cases with corresponding OC.
Questionnaire
The questionnaire presented multiple choice questions relating to matching criteria used by readers for their research studies (Table 1). Options were formulated based on descriptions of matching criteria from the literature [4–8].
Table 1.
Patients
A radiology researcher (M. L.) selected 28 cases from two research databases of 170 surveillance patients and 240 fecal occult blood test positive patients who had undergone both CTC and subsequent colonoscopy. These studies had been approved by the local Medical Ethics Committee and informed consent was obtained from all participants.
All CTC examinations had been read prospectively by one of four experienced observers, each with at least 100 CTC interpretations with colonoscopic verification. Observers had marked any polyp and indicated the morphology, size, location, and their confidence.
Colonoscopy with segmental unblinding was performed subsequently by a gastroenterologist, gastroenterology resident, or gastroenterology nurse under supervision. Maximal polyp diameter was estimated by using an opened biopsy forceps and in some cases with a linear measure probe additionally (Olympus America). All colonoscopies were videotaped starting from the caecum.
Case selection
As the present study aimed at evaluating concordance when matching polyps between CTC and OC, selection was biased towards cases likely to prove challenging. A research fellow experienced in matching (>250 matched CTC and OC studies) selected cases with polyps that failed to meet the typical matching criteria specified by the literature [4–8]. For example, a polyp at CTC whose location apparently differed by two segments or more from the location suggested by colonoscopy. Only technically adequate CTC examinations were selected so as not to confound matching, by insufficient distension, for example.
Twenty-one cases were selected where a single polyp at CTC and OC had to be matched by the experienced reader. To evaluate a broad spectrum of potential matching scenarios, in 13 cases the CTC and colonoscopy data were purposely perturbed so that two different patients were combined. In this way, different morphologies and/or locations could be presented to the reader. Figure 1 illustrates examples of three cases. Seven other cases were selected that had multiple polyps at CTC and/or OC. Again, difficult cases were purposely selected whose polyps could not necessarily be matched using established criteria.
Reviewing matching cases
All readers performed the observations at their own department and were free from clinical commitments during the matching procedure. Readers were free to use their own visualization software to read cases, but a laptop with View Forum software (Version 6.2, Philips, Best, Netherlands) was also available. Polyps initially found by the CTC observer (in the original research study) were presented to the readers by a researcher with information on morphology, size, location, and certainty of diagnosis scored by the observer. The experienced readers were able to remeasure the polyps if they wished. Colonoscopic information was also available to the readers: colonoscopy videos, diameter information, and location and morphology.
Polyp matching
Readers completed a data form for each case. For the 21 single-polyp cases the readers indicated whether they considered the CTC and OC polyp a correct match. If readers believed the two polyps were not the same, the researcher queried their reasoning and classified each mismatch as due to disagreement relating to: (1) diameter; (2) morphology; (3) location.
In the seven multiple-polyp cases, readers were invited to indicate which of the polyps presented to them matched and which they believed did not. Again, reasons for mismatching were explored.
Statistical analysis
Because cases were preselected, only descriptive statistics were performed. A per case analysis was performed for the 21 single-polyp cases; for each case the number of the eight observers reporting a match was determined. For size and location matching we determined the number of instances in which a reader did not adhere to their own matching criteria, prespecified by them in the questionnaire.
Of the seven multiple-polyp cases, the number of matched polyps per size category (≥10 mm, 6 to <10 mm, or <6 mm) was counted per case and summarized per observer. Reference diameter was colonoscopic excepting nonmatched CTC polyps.
Results
Questionnaire
All readers stated that they normally use colonoscopy reports or a case record form completed during colonoscopy for polyp matching. Seven readers used video stills of colonoscopic polyps; four readers also used colonoscopy videos; two readers also employed pathology reports for polyp diameter information.
All readers normally required endoscopic information relating to segmental location, size, and morphology. Six readers defined a flat lesion as one whose width must be at least twice its height; two also stated that the polyp must protrude less than 3 mm from the mucosa. The remaining two readers exclusively used this latter definition for flat lesions. Three readers wished to have information about the distance of the polyp from the anus and two wished to have histology information to facilitate matching. Table 2 details the different criteria described by readers for matching.
Table 2.
Morphology | Size | Location |
---|---|---|
CTC polyp needs to have a similar appearance/shape as the colonoscopy polyp (8) | Size of the CTC polyp has to be within 50% of the colonoscopy polyp size (4) | Polyps are in the same or adjacent segment (6) |
And description of morphology by the gastroenterologist resembles that of the CTC observer (6) | Size of the CTC polyp can be maximally 50% smaller or 100% larger than the colonoscopy polyp (1) | Polyps are in the same segment (1) |
Size of the colonoscopy polyp has to be within 50% (1) or 40% (1) of the CTC polyp size | Polyps are within a reasonable distance (judged by the CTC radiologist who is performing the matching) (1) | |
Size of the CTC polyp has to be within 50% of the colonoscopy polyp size or size of the colonoscopy polyp has to be within 50% of the CTC polyp size (1) |
The numbers within parentheses indicate how many readers use the specific matching criterion
Single-polyp cases
Agreement amongst readers could concern agreement on the presence of a match as well as the lack of presence of a match in a case, both important aspects in matching. Disagreement amongst readers in a case means that about half of the readers concern the presence of a match and the other half not. In the 21 cases with a single polyp, readers considered a match between CTC and OC to be present in between 13 (62%) and 19 (90%) of cases, i.e., there was some disagreement as to whether a match between CTC and OC was possible or not due to a perceived unacceptable discrepancy for one or more of the typical matching criteria (size, location, and morphology).
To evaluate the magnitude of this disagreement we analyzed the per case agreement or disagreement. We then found that the readers agreed completely or almost completely in 15 of 21 cases with respect to the presence or lack of a match of CTC and colonoscopy findings. Complete agreement was present in five cases in whom all eight readers agreed on a match. Almost complete agreement on matching (i.e., seven of eight readers indicated a match) was present in seven cases, whereas almost complete agreement on the lack of matching was present in three cases. In six of the 21 cases, however, a considerable disagreement in matching was found. In five cases only four readers indicated a match, and in one case five readers indicated a match. Figure 2 indicates how many cases readers agreed and disagreed in matching the CTC and colonoscopy polyp.
To explore the rationale underpinning this disagreement we evaluated data separately for cases with location, size, and morphology discrepancies. In the five cases that were selected for segmental location difference between the CTC and colonoscopy polyp, there was a high matching agreement across readers. Nearly all readers refused to match polyps where the CTC and colonoscopy location differed by three or more adjacent segments.
Regarding the ten cases where diameter disagreements ostensibly prevented matching between the CTC and colonoscopy, in five cases nearly all readers in practice ignored diameter discrepancies of more than 100% between the CTC and colonoscopy polyp. In the other five cases, while diameter discrepancies of more than 100% again existed between polyps, only four of eight readers (in four cases) and five of eight readers (in one case) found matching possible, indicating poor agreement existed.
In cases with different morphology, agreement for matching was high except for a single case; case 21 demonstrated a fecal residue at CTC and a polyp at colonoscopy in which only four of eight readers performed a match between the fecal residue and a polyp.
Overall, 55 polyps (mean 6.9 polyps per reader) were matched by readers despite individual polyps not fulfilling the criteria for matching on the basis of diameter prespecified by each reader. Overall, 12 polyps (mean 1.5 polyps per reader) were matched despite not fulfilling criteria prespecified by readers for segmental location. Two polyps were matched despite not fulfilling both diameter and segmental criteria.
Multiple-polyp cases
The seven cases with multiple polyps had 11 CTC polyps and 12 colonoscopy polyps of 10 mm or larger, 18 CTC and 12 colonoscopy polyps of 6–9 mm, and nine CTC and 20 colonoscopy polyps smaller than 6 mm. The total number of polyps matched per reader varied from 27 to 35 (Table 3). In case 3, for example, one reader matched five polyps on CTC with OC, while another reader matched nine. For CTC polyps of 10 mm or larger, the number of matches per reader showed less variation, between 9 and 11 (Table 4). For polyps smaller than 6 mm the inter-reader variability was larger, with the number of matches varied from 7 to 14. The number of false negative CTC polyps of 10 mm or larger per reader ranged from one to three and the number of false positive CTC polyps of 10 mm or larger ranged from zero to two. Reasons for not matching CTC and colonoscopy polyps were mismatching due to location, size, and morphology.
Table 3.
Reader | ||||||||
---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | |
Case 1 | 10 | 9 | 10 | 9 | 10 | 8 | 10 | 11 |
Case 2 | 9 | 9 | 9 | 8 | 8 | 7 | 9 | 9 |
Case 3 | 7 | 8 | 8 | 8 | 8 | 5 | 7 | 9 |
Case 4 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 |
Case 5 | 2 | 4 | 3 | 3 | 3 | 3 | 2 | 3 |
Case 6 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
Case 7 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 |
Total | 31 | 34 | 33 | 31 | 33 | 27 | 32 | 35 |
The numbers of matched polyps (all sizes) are indicated per reader. The total number of matches per reader is presented in the last row
Table 4.
Reader | ||||||||
---|---|---|---|---|---|---|---|---|
1 | 2a | 3 | 4 | 5 | 6 | 7 | 8 | |
TP ≥ 10 mm | 11 | 11 | 11 | 10 | 9 | 10 | 11 | 10 |
TP 6–9 mm | 10 | 10 | 9 | 10 | 11 | 10 | 8 | 11 |
TP < 6 mm | 10 | 13 | 13 | 11 | 13 | 7 | 13 | 14 |
FN ≥ 10 mm | 1 | 1 | 1 | 2 | 3 | 2 | 1 | 2 |
FN 6–9 mm | 2 | 2 | 3 | 2 | 1 | 2 | 4 | 1 |
FN < 6 mm | 10 | 6 | 7 | 9 | 7 | 13 | 7 | 6 |
FP ≥ 10 mm | 0 | 0 | 0 | 1 | 1 | 2 | 0 | 0 |
FP 6–9 mm | 2 | 1 | 1 | 2 | 0 | 6 | 2 | 1 |
FP < 6 mm | 5 | 0 | 3 | 3 | 4 | 3 | 4 | 4 |
TP true positive polyp, i.e., a CTC polyp that was matched with a colonoscopy polyp; FN false negative polyp for CTC, i.e., a colonoscopy polyp that was not matched with a CTC polyp; FP false positive polyp for CTC, i.e., a CTC polyp that was not matched with a colonoscopy polyp
aReader 2 had interpreted one case wrongly and this case was therefore excluded
Observations
Six readers remeasured polyps on CTC when a large diameter difference was apparent between it and the OC polyp. To resolve this, four readers disagreed with the diameter recorded by the colonoscopist and reinterpreted the size of the OC polyp shown on video. To determine polyp location precisely, two readers thoroughly examined the colonoscopic video to clarify the colonic segment or location of the polyp compared to a fold. Four readers occasionally scrutinized the morphology of the polyp at video, especially in pedunculated cases (e.g., to determine stalk length) and cases with flat polyps.
Discussion
We have investigated if disagreement exists between experienced readers when attempting to match polyps identified by CTC and subsequent colonoscopy. Eight experienced CTC researchers apparently used similar matching criteria, based on polyp location, size, and morphology. Readers largely agreed when matching cases with single polyps. We found, however, substantial disagreement in a minority but non-negligible proportion of cases. Disagreement was also present in cases with multiple polyps but predominantly for the least relevant polyps, i.e., those smaller than 6 mm.
The CTC literature describes various matching criteria, based on expert opinion rather than an evidence-based approach. Evidence-based matching criteria are difficult to formulate because a robust reference standard for matching corresponding CTC and colonoscopy polyps poses very substantial methodological difficulties. We did not aim to validate matching criteria. Rather, we investigated how readers matched in practice and the level of disagreement between them.
While we found substantial agreement there was also non-negligible disagreement, predominantly due to a large perceived diameter difference between CTC and OC. This was not a constant observation, however, because some cases with similar discrepancies were matched by most readers. The reasons underpinning this observation were unclear, despite us asking readers for their rationale.
Discrepancy was also noted in those cases with multiple polyps. For the most clinically important polyps (≥10 mm), we observed minimal disagreement. In populations with a low prevalence of polyps, matching is less problematic because few polyps need be matched [6]. However, when the number of polyps per patient increases, matching will likely become less straightforward and we have demonstrated inter-reader disagreement.
After case matching, readers were asked for their matching criteria via a questionnaire. All readers reported practically identical matching criteria (described in the Introduction). While these criteria are apparently straightforward, in practice there are several problems. Difficulties when matching location exist because anatomical borders are ill-defined and colonoscopists frequently cannot locate the endoscope tip with precision [9]. In our study almost all readers took this into account and were prepared to match polyps that were not within the same or adjacent colonic segments.
Another problem exists when matching based on polyp diameter. Colonoscopic estimation of diameter is imprecise [10, 11]. CTC measurements are also variable but are probably more accurate [12–14]. We found that most readers remeasured CTC polyps and often redefined the colonoscopist’s assessment of diameter from the video provided. Hence apparently large diameter differences between polyps did not always preclude a match.
Matching of polyps based on morphology also differed between readers because judgment of morphology is subjective. While definitions of lesion morphology are clearly described [15], we found that readers often used different definitions for flat lesions at CTC.
Because we found that experienced readers did not always adhere to established matching criteria, we propose that disagreement is best resolved by consensus. At the very least, two readers would then have to consider whether a match between a polyp imaged by both CTC and OC was possible, which is likely to reduce error and uncertainty. Such an approach is inevitably time consuming and an alternative is to perform consensus matching only when polyps do not satisfy generally accepted criteria for matching. However, as we have stated, these criteria are not evidence-based and our study was not designed to provide such a base. However, we do propose a matching procedure (Fig. 3) suggesting consensus matching by at least two experienced readers where cases do not satisfy conventional matching criteria. Optimally, at least one observer should be a radiologist and the other a colonoscopist since both have different attributes.
A potential limitation of our study is that we did not present pathology reports or the histological diameter of excised polyps. It is however questionable whether these data would provide useful additional information since polyps often shrink after polypectomy due to electrosurgical tissue effects and vascular collapse [16]. Another limitation is that our cases were purposely biased towards difficult cases. This was done to magnify any inter-reader variation in a pragmatic manner. Although this approach was efficient, as a consequence it was impossible to calculate meaningful metrics applicable to real-world scenarios.
In summary, we found that experienced CTC readers agree to a considerable extent when matching polyps detected by CTC to those found at subsequent OC in difficult cases, but non-negligible disagreement exists. Such disagreement may explain data variation of some studies on the diagnostic accuracy of CTC. We suggest using a consensus to minimize disagreement when matching those cases that do not satisfy established matching criteria.
Acknowledgments
Open Access This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
References
- 1.Levin B, Lieberman DA, McFarland B et al (2008) Screening and surveillance for the early detection of colorectal cancer and adenomatous polyps, 2008: a joint guideline from the American Cancer Society, the US Multi-Society Task Force on Colorectal Cancer, and the American College of Radiology. Gastroenterology 134(5):1570–1595 [DOI] [PubMed]
- 2.Dachman AH, Zalis ME (2004) Quality and consistency in CT colonography and research reporting. Radiology 230(2):319–323 [DOI] [PubMed]
- 3.Zalis ME, Barish MA, Choi JR et al (2005) CT colonography reporting and data system: a consensus proposal. Radiology 236(1):3–9 [DOI] [PubMed]
- 4.Iannaccone R, Laghi A, Catalano C et al (2004) Computed tomographic colonography without cathartic preparation for the detection of colorectal polyps. Gastroenterology 127(5):1300–1311 [DOI] [PubMed]
- 5.Johnson CD, Toledano AY, Herman BA et al (2003) Computerized tomographic colonography: performance evaluation in a retrospective multicenter setting. Gastroenterology 125(3):688–695 [DOI] [PubMed]
- 6.Pickhardt PJ, Choi JR, Hwang I et al (2003) Computed tomographic virtual colonoscopy to screen for colorectal neoplasia in asymptomatic adults. N Engl J Med 349(23):2191–2200 [DOI] [PubMed]
- 7.Taylor SA, Charman SC, Lefere P et al (2008) CT colonography: investigation of the optimum reader paradigm by using computer-aided detection software. Radiology 246(2):463–471 [DOI] [PubMed]
- 8.van Gelder RE, Nio CY, Florie J et al (2004) Computed tomographic colonography compared with colonoscopy in patients at increased risk for colorectal cancer. Gastroenterology 127(1):41–48 [DOI] [PubMed]
- 9.Shah SG, Saunders BP, Brooker JC, Williams CB (2000) Magnetic imaging of colonoscopy: an audit of looping, accuracy and ancillary maneuvers. Gastrointest Endosc 52(1):1–8 [DOI] [PubMed]
- 10.Punwani S, Halligan S, Irving P et al (2008) Measurement of colonic polyps by radiologists and endoscopists: Who is most accurate? Eur Radiol 18(5):874–881 [DOI] [PubMed]
- 11.Gupta S, Durkalski V, Cotton P, Rockey DC (2008) Variation of agreement in polyp size measurement between computed tomographic colonography and pathology assessment: clinical implications. Clin Gastroenterol Hepatol 6(2):220–227 [DOI] [PMC free article] [PubMed]
- 12.Burling D, Halligan S, Taylor S et al (2006) Polyp measurement using CT colonography: agreement with colonoscopy and effect of viewing conditions on interobserver and intraobserver agreement. AJR Am J Roentgenol 186(6):1597–1604 [DOI] [PubMed]
- 13.Park SH, Choi EK, Lee SS et al (2007) Polyp measurement reliability, accuracy, and discrepancy: optical colonoscopy versus CT colonography with pig colonic specimens. Radiology 244(1):157–164 [DOI] [PubMed]
- 14.Taylor S, Slater A, Honeyfield L, Burling D, Halligan S (2006) CT colonography: effect of colonic distension on polyp measurement accuracy and agreement—in vitro study. Acad Radiol 13(7):850–859 [DOI] [PubMed]
- 15.Participants in the Paris workshop (2003) he Paris endoscopic classification of superficial neoplastic lesions: esophagus, stomach, and colon: November 30 to December 1, 2002. Gastrointest Endosc 58(6 Suppl):S3–S43 [DOI] [PubMed]
- 16.Gopalswamy N, Shenoy VN, Choudhry U et al (1997) Is in vivo measurement of size of polyps during colonoscopy accurate? Gastrointest Endosc 46(6):497–502 [DOI] [PubMed]