Skip to main content
Journal of Nuclear Medicine logoLink to Journal of Nuclear Medicine
. 2018 Dec;59(12):1857–1864. doi: 10.2967/jnumed.118.217588

Interobserver Agreement for the Standardized Reporting System PSMA-RADS 1.0 on 18F-DCFPyL PET/CT Imaging

Rudolf A Werner 1,2, Ralph A Bundschuh 3, Lena Bundschuh 3, Mehrbod S Javadi 1, Jeffrey P Leal 1, Takahiro Higuchi 2,4, Kenneth J Pienta 5, Andreas K Buck 2, Martin G Pomper 1, Michael A Gorin 1,5, Constantin Lapa 2,*, Steven P Rowe 1,5,*,
PMCID: PMC6278904  PMID: 30190304

Abstract

Recently, the standardized reporting and data system for prostate-specific membrane antigen (PSMA)–targeted PET imaging studies, termed PSMA-RADS version 1.0, was introduced. We aimed to determine the interobserver agreement for applying PSMA-RADS to imaging interpretation of 18F-DCFPyL (2-(3-{1-carboxy-5-[(6-18F-fluoro-pyridine-3-carbonyl)-amino]-pentyl}-ureido)-pentanedioic acid) PET examinations in a prospective setting mimicking the typical clinical workflow at a prostate cancer referral center. Methods: Four readers (2 experienced readers (ERs, >3 y of PSMA-targeted PET interpretation experience) and 2 inexperienced readers (IRs, <1 y of experience)), who had all read the initial publication on PSMA-RADS 1.0, assessed 50 18F-DCFPyL PET/CT studies independently. Per scan, a maximum of 5 target lesions was selected by the observers, and a PSMA-RADS score for every target lesion was recorded. No specific preexisting conditions were placed on the selection of the target lesions, although PSMA-RADS 1.0 suggests that readers focus on the most avid or largest lesions. An overall scan impression based on PSMA-RADS was indicated, and interobserver agreement rates on a target lesion–based, on an organ-based, and on an overall PSMA-RADS score–based level were computed. Results: The number of target lesions identified by each observer was as follows: ER 1, 123; ER 2, 134; IR 1, 123; and IR 2, 120. Among those selected target lesions, 125 were chosen by at least 2 individual observers (all 4 readers selected the same target lesion in 58 of 125 [46.4%] instances, 3 readers in 40 of 125 [32%], and 2 observers in 27 of 125 [21.6%]). The interobserver agreement for PSMA-RADS scoring among identical target lesions was good (intraclass correlation coefficient [ICC] for 4, 3, and 2 identical target lesions, ≥0.60, respectively). For lymph nodes, an excellent interobserver agreement was derived (ICC, 0.79). The interobserver agreement for an overall scan impression based on PSMA-RADS was also excellent (ICC, 0.84), with a significant difference for ER (ICC, 0.97) vs. IR (ICC, 0.74) (P = 0.005). Conclusion: PSMA-RADS demonstrated a high concordance rate in this study, even among readers with different levels of experience. This finding suggests that PSMA-RADS can be effectively used for communication with clinicians and can be implemented in the collection of data for large prospective trials.

Keywords: 18F-DCFPyL, PSMA-RADS, interreader, interobserver, PSMA, prostate cancer


Radiotracers targeting prostate-specific membrane antigen (PSMA), such as the urea-based small-molecule 18F-DCFPyL (2-(3-{1-carboxy-5-[(6-18F-fluoro-pyridine-3-carbonyl)-amino]-pentyl}-ureido)-pentanedioic acid), have demonstrated excellent performance characteristics in identifying sites of disease in subjects with prostate cancer (PCa) (13). However, in patients with an extensive tumor burden (4) or for lesion detection in preoperative lymph node (LN) staging (5), clinical interpreters have to consider certain pitfalls, such as uptake in benign lesions or in nonprostatic malignancies (610). To aid in the interpretation of PSMA-targeted PET imaging studies, multiple structured reporting systems have been proposed. These include the Prostate Cancer Molecular Imaging Standardized Evaluation and the PSMA-Reporting and Data System (PSMA-RADS, version 1.0) (1114). Such frameworks help convey to the reader the level of certainty that an equivocal finding or a finding without a cross-sectional imaging correlate is a site of disease. Striving for a readily applicable system for a clinical observer, PSMA-RADS is simple, easy to memorize and use, and based exclusively on imaging findings (i.e., the site and intensity of radiotracer uptake). Both individual target lesions (maximum of 5 per scan) and the overall impression of the imaging study should receive a PSMA-RADS score. Such scores are on a 5-point scale that reflects the confidence of the interpreting imaging specialist that a given lesion represents a site of PCa (from 1 = definitively benign to 5 = high degree of certainty that PCa is present). PSMA-RADS 1.0 may facilitate the collection of data for larger clinical trials, can serve as a guide for nuclear medicine physicians in interpreting PSMA-targeted PET scans, and can enable efficient communication with referring clinicians (13).

To validate the utility of PSMA-RADS, further confirmatory work on this proposed standardized reporting system is needed and the interobserver agreement among different interpreters has to be addressed. As such, we undertook to determine the interobserver reliability of PSMA-RADS in a prospective setting in which readers with varying experience levels evaluated 50 18F-DCFPyL PET/CT scans randomly selected from a large trial evaluating the clinical utility of the radiotracer. All observers had read the original PSMA-RADS publication but were masked to all information about the patients and were provided no other instructions, thus simulating some elements of a real-world busy clinical PCa practice.

MATERIALS AND METHODS

In total, 50 patients with histologically proven PCa who had undergone 18F-DCFPyL PET/CT imaging were included in this evaluation. All patients were originally imaged as part of an institutional review board–approved protocol (ClinicalTrials.gov identifier NCT02825875), and all patients gave written informed consent. 18F-DCFPyL was used according to Food and Drug Administration Investigational New Drug application 121064.

Imaging Procedure

As per our standard practice, patients were asked to be nil per os (with the exception of water and medications) for at least 4 h before radiotracer injection. 18F-DCFPyL was synthesized as previously described (15). Integrated PET/CT using either a Discovery RX 64-slice PET/CT scanner (GE Healthcare) or a Biograph mCT 128-slice PET/CT scanner (Siemens) operating in 3-dimensional emission mode with CT attenuation correction was performed on all patients. 18F-DCFPyL (≤333 MBq [≤9 mCi]) was administered intravenously, and after an uptake time of approximately 60 min, acquisitions from the mid thigh to the vertex of the skull were conducted, covering 6–8 bed positions (depending on patient height and the scanner) with patients supine. A detailed description was previously published (7).

Imaging Interpretation

PET images were analyzed using XD3 Software (Mirada Medical). PET, CT, and PET/CT images were assessed for all 50 patients. Two experienced readers (a dual board-certified nuclear medicine physician/radiologist [ER 1] and a board-certified nuclear medicine physician [ER 2] with >3 y of experience in reading PSMA-targeted PET scans) and 2 inexperienced readers (a recently board-certified nuclear medicine physician [IR 1] and a resident [IR 2] with <1 y of experience in reading PSMA-targeted PET scans), masked to the clinical status of the patients (other than knowing that the patients had been imaged because of a history of PCa), evaluated all scans independently. Except for ER 1, the remaining 3 readers had no previous experience with reading 18F-labeled PSMA-targeted PET images (i.e., those observers had clinical experience solely in interpreting 68Ga-PSMA-11 or 68Ga-PSMA imaging-and-therapy PET scans). Before beginning the masked independent reads, the IRs underwent a training session with 5 cases to gain familiarity with the workstation and the XD3 Software (Mirada Medical) that was used to display the scans.

PSMA-RADS-1A lesions are benign and have no abnormal radiotracer uptake, whereas PSMA-RADS-1B lesions are benign (often characterized by biopsy or pathognomonic imaging) but do have abnormal radiotracer uptake. Often, characterizing a lesion as PSMA-RADS-1B involves previous conventional imaging or histologic diagnosis; as such, PSMA-RADS-1A and -1B were subsumed under PSMA-RADS-1 in the present masked analysis. No other changes to the PSMA-RADS system were implemented in this study. A complete summary of the PSMA-RADS scoring system (from PSMA-RADS-1 to -5) can be found in a previous publication (13).

In accordance with the specifications of PSMA-RADS 1.0, a maximum of 5 target lesions was selected by the readers. PSMA-RADS suggests that target lesions be those that are largest or have the most intense radiotracer uptake, although ultimately target lesion selection is left to the discretion of the interpreting imaging specialist. Further, a maximum of 3 lesions per organ can be included. The following organ compartments were defined: LNs, skeleton, prostate/local recurrence, soft tissue (other than LNs), liver, thyroid, and lung (16). A PSMA-RADS score had to be assigned to every target lesion. Additionally, all involved organ compartments were identified by the readers, and an overall scan score was assigned. The overall PSMA-RADS score was defined analogously to somatostatin receptor RADS (i.e., the highest PSMA-RADS score of any of the individual target lesions) (17). Moreover, the following general parameters were assessed by each observer in a binary fashion: overall scan result (positive in cases of suggestive radiotracer uptake above the background level), organ involvement, and LN involvement. Additionally, the number of organs affected, the number of organ metastases, the number of LN regions, and the number of LNs had to be indicated on a 5-point scale (from 1 to ≥5 organ metastases, LNs, or number of organs/LN areas affected). The following LN areas were defined: cervical, thoracic/axillary, retroperitoneal, (pre)sacral, and pelvic (16). Moreover, the concordance between both ERs and IRs was evaluated in an interobserver setting for the overall PSMA-RADS score.

Statistical Analysis

Continuous data are presented as mean ± SD. The categoric variables are presented as frequency and percentage. The degrees of agreement were assessed using intraclass correlation coefficients (ICCs) and their 95% confidence intervals (CIs) based on a mean-rating, single-measure, consistency model. According to Cicchetti, an ICC of less than 0.4 indicates poor interobserver agreement, 0.4–0.59 indicates fair agreement, 0.6–0.74 indicates good agreement, and 0.75–1 indicates excellent agreement (18). Statistical analysis was performed using MedCalc statistical software (version 18.2.1; MedCalc Software bvba). The statistical significance level was set at a P value of less than 0.05.

RESULTS

The patients’ characteristics are detailed in Table 1.

TABLE 1.

Patient Characteristics

Parameter Characteristic Data
Median age ± SD (y) 65 ± 8
Race White 38/50 (76%)
Black 9/50 (18%)
Asian/other 3/50 (6%)
Indication for scan Staging 24/50 (48%)
Biochemical recurrence 9/50 (18%)
Biochemical persistence after primary surgery 6/50 (12%)
Primary diagnosis 5/50 (10%)
Potential withdrawal of androgen deprivation therapy 3/50 (6%)
Other 3/50 (6%)
Gleason score (GS) Overall median ± SD (n = 39) 8 ± 1
GS 6 1/39 (2.6%)
GS 7 15/39 (38.4%)
GS 8 7/39 (17.9%)
GS 9 15/39 (38.5%)
GS 10 1/39 (2.6%)
PSA level (ng/mL) Overall median 3.2
Range 0.02–48
Prior therapies Total 41/50 (82%)
Surgery 29/41 (70.7%)
Hormonal therapy 21/41 (51.2%)
Radiation therapy 18/41 (43.9%)
Chemotherapy 6/41 (14.6%)

PSA = prostate specific antigen.

General Parameters

For the 3 parameters that had to be evaluated in a binary fashion (overall scan result, organ involvement, and LN involvement), the interobserver agreement was excellent (ICC, 0.75, 0.80, and 0.78, respectively) (18). Except for the number of organs affected (good interobserver agreement; ICC, 0.74), all general parameters that were evaluated on a 5-point scale demonstrated excellent agreement (number of LN areas affected: ICC, 0.79; number of organ metastases: ICC, 0.92; number of LN metastases: ICC, 0.90). Table 2 summarizes all results for those general scan parameters, and Figure 1 displays the distribution for number of organ and LN metastases for all 4 readers.

TABLE 2.

General Parameters Assessed by the 4 Readers

Level of certainty*
Parameter Parameter Parameter Parameter 0 1 2 3 4 5
Binary fashion
 Overall scan result (negative = 0, positive = 1) 0.75 0.64–0.83 ER 1 13 (26) 37 (74)
ER 2 13 (26) 37 (74)
IR 1 6 (12) 44 (88)
IR 2 13 (26) 37 (74)
 Organ involvement (no = 0, yes = 1) 0.80 0.71–0.88 ER 1 28 (56) 22 (44)
ER 2 27 (54) 23 (46)
IR 1 20 (40) 30 (60)
IR 2 29 (58) 21 (42)
 LN involvement (no = 0, yes = 1) 0.78 0.69–0.86 ER 1 25 (50) 25 (50)
ER 2 27 (54) 23 (46)
IR 1 21 (42) 29 (58)
IR 2 24 (48) 26 (52)
5-point assessment
 Affected organs (n) 0.74 0.62–0.83 ER 1 28 (56) 17 (34) 3 (6) 2 (4)
ER 2 28 (56) 18 (36) 4 (8)
IR 1 20 (40) 20 (40) 9 (18) 1 (2)
IR 2 29 (58) 17 (34) 4 (8)
 Organ metastases (n) 0.92 0.89–0.95 ER 1 28 (56) 10 (20) 1 (2) 3 (6) 2 (4) 6 (12)
ER 2 32 (64) 6 (12) 3 (6) 1 (2) 2 (4) 6 (12)
IR 1 23 (46) 13 (26) 3 (6) 2 (4) 1 (2) 8 (16)
IR 2 29 (58) 8 (16) 3 (6) 3 (6) 2 (4) 5 (10)
 Affected LN areas (n) 0.79 0.70–0.86 ER 1 25 (50) 11 (22) 7 (14) 5 (10) 2 (4)
ER 2 27 (54) 8 (16) 7 (14) 6 (12) 2 (4)
IR 1 21 (42) 19 (38) 9 (18) 1 (2)
IR 2 24 (48) 11 (22) 10 (20) 3 (6) 2 (4)
 LN metastases (n) 0.90 0.85–0.94 ER 1 25 (50) 7 (14) 4 (8) 2 (4) 12 (24)
ER 2 28 (56) 4 (8) 4 (8) 2 (4) 2 (4) 10 (20)
IR 1 25 (50) 8 (16) 2 (4) 3 (6) 3 (6) 9 (18)
IR 2 24 (48) 6 (12) 7 (14) 2 (4) 3 (6) 8 (16)
*

Data are number of patients out of 50 total, followed by percentage in parentheses.

Structured as follows: from 1 to ≥ 5 organ metastases, or number of organs/LN areas affected.

ICC = intraclass coefficient.

FIGURE 1.

FIGURE 1.

Distribution of number of organ (A) and LN (B) metastases for all 4 readers.

Target Lesion- and Compartment-Based Interobserver Agreement

In total, the following numbers of target lesions were recorded by each reader: ER 1, 123; ER 2, 134; IR 1, 123; and IR 2, 120. Among those selected target lesions, 125 were chosen by at least 2 individual observers. The majority of the lesions were assigned to either LN (64/125, 51.2%) or skeleton (39/125, 31.2%) (Table 3).

TABLE 3.

Distribution of Target Lesions Among Compartments, with Agreement Rate and ICC Based on PSMA-RADS

Compartment-based distribution
Agreement rate
ICC
Identical target lesions* LN Bone Prostate/local recurrence Lung Non-LN soft tissue Thyroid gland Liver For all identical target lesions
For minimum agreements For all identical target lesions for LNs
All (n = 125) 64/125 (51.2) 39/125 (31.2) 11/125 (8.8) 5/125 (4.0) 3/125 (2.4) 2/125 (1.6) 1/125 (0.8) NA NA NA NA
4 (n = 58/125, 46.4%) 26/58 (44.8) 19/58 (32.8) 8/58 (13.8) 3/58 (5.2) 1/58 (1.7) 1/58 (1.7) 29/58 (50.0) 46/58 (79.3) 0.60 (0.48–0.71) 0.79 (0.66–0.89)
3 (n = 40/125, 32%) 22/40 (55.0) 12/40 (30.0) 3/40 (7.5) 2/40 (5.0) 1/40 (2.5) 21/40 (52.5) 36/40 (90.0) 0.60 (0.43–0.75) 0.66 (0.44–0.83)
2 (n = 27/125, 21.6%) 16/27 (59.3) 8/27 (29.6) 2/27 (7.4) 1/27 (3.7) 15/27 (55.6) NA 0.62 (0.32–0.81) 0.57 (0.12–0.83)
*

All identical target lesions chosen by minimum of 2 readers, and target lesions identified by 4, 3, and 2 readers.

If 4 readers selected same target lesion, minimum of 3 readers designated same PSMA-RADS score, and if 3 readers selected same target lesion, minimum of 2 readers designated same PSMA-RADS score.

NA = not applicable.

Data in parentheses are 95% CIs for ICC and percentages for all others.

Identical Target Lesion Included by 4 Readers

The identical target lesion was included by all 4 readers in 58 of 125 (46.4%) instances, with the majority of those findings being either LN (26, 44.8%) or bone lesions (19, 32.8%). In 29 (50%) of those 58 target lesions, all 4 readers designated the identical PSMA-RADS score, with another 17 lesions (29.3%) having agreement by 3 readers and the remaining 12 (20.7%) having agreement by 2 readers. The ICC was 0.60 (95% CI, 0.48–0.71). On an organ-based compartment level for all 4 readers selecting the same LN, interobserver agreement was 0.79 (95% CI, 0.66–0.89). Figure 2 illustrates the PSMA-RADS score for 4 identical target lesions among all readers.

FIGURE 2.

FIGURE 2.

Target lesion assessment (identical target lesion included by all 4 readers). PSMA-RADS-1A and -1B were subsumed under PSMA-RADS-1.

Identical Target Lesion Included by 3 Readers

In 40 (32%) of the 125 cases, 3 readers identified an identical target lesion. LNs comprised 22 (55%) of these 40 target lesions, with 12 (30%) being bone findings. In 21 (52.5%), all 3 readers agreed on the same PSMA-RADS score; in 15 (37.5%), 2 readers agreed; and in the remaining 4 (10%), there was no agreement. The ICC was 0.60 (95% CI, 0.43–0.75). Similar to the situation for 4 identical target lesion selections, the interobserver agreement was 0.66 for LN (95% CI, 0.44–0.83).

Identical Target Lesion Included by 2 Readers

In 27 of the 125 identical target lesions (21.6%), a minimum of 2 readers selected the same finding. LNs (16, 59.3%) and bone lesions (8, 29.6%) were seen in the majority of the cases. In approximately half the cases (15, 55.6%), both readers agreed on the PSMA-RADS score, with no concordance being seen in the remaining cases (12, 44.4%). The ICC was 0.62 (95% CI, 0.32–0.81) for 2 identical target lesions (LN) (ICC, 0.57; 95% CI, 0.12–0.83).

Taken together, the ICC for 4, 3, and 2 identical chosen target lesions can be described as good. The number of investigated identical bone lesions by 4, 3, or 2 readers was too small for a reliable assessment of ICCs. Table 3 summarizes the compartment-based and target lesion interobserver agreement findings. Table 4 provides a distribution of the different PSMA-RADS scores for those target lesions that had been included by all 4 readers.

TABLE 4.

Distribution of PSMA-RADS Score for 4 Identical Target Lesions and for Overall PSMA-RADS Score Among the 4 Readers

PSMA-RADS
Parameter Reader 1* 2 3A 3B 3C 3D 4 5
4 identical target lesions ER 1 1/58 (1.7) 4/58 (6.9) 22/58 (37.9) 31/58 (53.5)
ER 2 1/58 (1.7) 1/58 (1.7) 2/58 (3.4) 27/58 (46.6) 27/58 (46.6)
IR 1 3/58 (5.2) 2/58 (3.4) 2/58 (3.4) 4/58 (6.9) 12/58 (20.7) 35/58 (60.3)
IR 2 1/58 (1.7) 1/58 (1.7) 3/58 (5.2) 28/58 (48.3) 25/58 (43.1)
Overall PSMA-RADS ER 1 10/50 (20) 2/50 (4) 1/50 (2) 15/50 (30) 22/50 (44)
ER 2 9/50 (18) 2/50 (4) 3/50 (6) 1/50 (2) 16/50 (32) 19/50 (38)
IR 1 6/50 (12) 5/50 (10) 2/50 (4) 2/50 (4) 3/50 (6) 9/50 (18) 23/50 (46)
IR 2 10/50 (20) 3/50 (6) 1/50 (2) 15/50 (30) 21/50 (42)
*

PSMA-RADS-1A and -1B were subsumed under PSMA-RADS score 1.

n = 58 target lesions and 50 scans. Data in parentheses are percentages.

Overall PSMA-RADS

In the majority of the cases, the readers described the scan impression with an overall PSMA-RADS score of 4 or 5. The ICC was 0.84 (95% CI, 0.77–0.90; that is, excellent agreement). Table 4 gives an overview of the distribution of the different overall PSMA-RADS scores for all 4 readers. Figure 3 illustrates the overall PSMA-RADS distribution among different readers.

FIGURE 3.

FIGURE 3.

Overall-PSMA RADS scoring for all 4 readers. PSMA-RADS-1A and -1B were subsumed under PSMA-RADS-1.

ERs Versus IRs

Compared with ERs serving as a gold standard, the ICC of the ERs for an overall PSMA-RADS score level was 0.97 (95% CI, 0.94–0.98), whereas for the IRs, the ICC was 0.74 (95% CI, 0.58–0.84). A statistically significant difference could be reached for the ICC of the ERs versus the ICC of the IRs (P = 0.005). These findings were further corroborated on a target-based level investigating all identical target lesions that were included by all 4 readers. The ICC for the ERs was 0.80 (95% CI, 0.68–0.88) and was statistically significantly different from the ICC for IRs, 0.53 (95% CI, 0.32–0.60) (P = 0.013). Figures 4 and 5 provide examples of lesions in which reader experience may have played a role in PSMA-RADS scoring.

FIGURE 4.

FIGURE 4.

Images of 60-y-old man undergoing 18F-DCFPyL PET/CT for primary diagnostic assessment (prostate-specific antigen level on date of scan, 13.5; no previous therapies). (A) Whole-body maximum-intensity projection demonstrates multiple sites of suggestive radiotracer uptake (e.g., third right rib [arrowhead] and sixth right rib [arrow]). (B–D) Axial CT (B), axial 18F-DCFPyL PET (C), and axial 18F-DCFPyL PET/CT (D) show mild radiotracer uptake in right iliac LN (arrow). Although ER trained on 18F-DCFPyL PET called this lesion PSMA-RADS-4, the 2 remaining readers trained on 68Ga-PSMA PET called it PSMA-RADS-3A (i.e., suggestive but indeterminate LN) (12). One might speculate that 18F-trained reader had higher confidence in lesion interpretation on 18F-DCFPyL PET scans because of higher sensitivity in detection rate of small lesions using 18F-labeled radiotracers than using 68Ga-PSMA (27). (E–G) All 4 readers classified overall scan impression as PSMA-RADS-5, because CT (E) revealed findings corresponding to sixth right rib metastasis, with discernible radiotracer uptake on axial 18F-DCFPyL PET (F) and axial 18F-DCFPyL PET/CT (G) (doubled arrows). (H–J) Magnification of this sixth-rib suggestive site of uptake provided in axial CT (H), axial 18F-DCFPyL PET (I), and axial 18F-DCFPyL PET/CT (J) further suggested this to be malignant lesion at this uptake site (arrowhead).

FIGURE 5.

FIGURE 5.

Images of 76-y-old man undergoing 18F-DCFPyL PET/CT for staging of metastatic PCa (prostate-specific antigen level on date of scan, 0.63; prior prostatectomy). (A) Whole-body maximum-intensity projection shows radiotracer uptake in right hilar and subcarinal LNs (arrows), lung lesion (arrowhead), and right iliac bone (doubled arrow). (B–D) Axial CT (B), axial 18F-DCFPyL PET (C), and axial 18F-DCFPyL PET/CT (D) show mild to moderate radiotracer uptake in subcarinal LN (arrows). IR called this finding PSMA-RADS-4, whereas ER classified it PSMA-RADS-2 (i.e., likely benign because of low-level uptake in soft-tissue site atypical of metastatic PCa). Hilar and subcarinal LNs remained unchanged on follow-up imaging, suggesting that these are benign. All 4 readers classified overall scan impression as PSMA-RADS-5. (E–G) Axial CT (E), axial 18F-DCFPyL PET (F), and axial 18F-DCFPyL PET/CT (G) show intense radiotracer uptake in right iliac bone (doubled arrows). Apart from that, lung lesion (18F-DCFPyL whole-body maximum intensity projection in A, red arrowhead) was classified as PSMA-RADS-2 by IR. (H–J) Axial CT (H), axial 18F-DCFPyL PET (I), and axial 18F-DCFPyL PET/CT (J) of this lesion further confirm suspicion of benign lesion (arrowheads, most likely peripheral interstitial thickening). Follow-up imaging also corroborated this impression.

DISCUSSION

In light of the growing availability of 68Ga- or 18F-labeled PSMA-targeted imaging agents (1922), the number of molecular imaging specialists that routinely interpret PET scans with these compounds outside controlled clinical trials is expanding (23). However, numerous studies have reported pitfalls in the reading of PSMA-targeted PET studies, including studies of Paget disease, sarcoidosis, or nervous tissue such as ganglia (710). Any systematic approach to the interpretation of PSMA-targeted PET scans should therefore build in a measure of uncertainty as to the presence of PCa. The recently reported system PSMA-RADS 1.0 incorporates such uncertainty with recommended follow-up for indeterminate lesions (12). Further, such a system should also facilitate communication of important findings between image interpreters and referring clinicians, be useful for collecting data in multicenter prospective studies, and allow for the eventual implementation of machine learning algorithms based on the system. For all these applications, high interobserver reproducibility is necessary.

The ICC for the overall PSMA-RADS score for both ERs (0.97) was consistent with excellent interobserver agreement, whereas the 2 IRs still agreed well (0.74) on an overall PSMA-RADS score level (all 4 interpreters, 0.84; Fig. 3). This is in line with previous reports in which ERs demonstrated an almost perfect reproducibility on 68Ga-PSMA-11 PET/CT for specified lesions (low-experience observers, substantial agreement) (16). Notably, these results contrast with other standardized reporting systems, such as prostate imaging (PI)-RADS 2 for prostate MRI (moderate interobserver agreement among experienced radiologists, with a Fleiss k < 0.6) (24). In a similar vein, a significant variation was present in both the PI-RADS distribution between radiologists and, more importantly, in the detection of suspected clinically significant cancer by PI-RADS using multiparametric MRI (25).

On an overall scan impression level, most PET studies were assigned PSMA-RADS-4 or -5 scores by all observers (Table 4). We hypothesize that this finding derives from the high specificity and sensitivity of PSMA-targeted radiotracers. Although PI-RADS highly depends on the experience of the reading radiologists (25), PSMA-RADS seems to be readily applicable even for less experienced readers (ICC, 0.74). These findings were further corroborated on a target lesion level (Fig. 2). Despite the fact that PSMA-RADS provides little specific information on the selection of target lesions, a minimum of 3 readers (i.e., minimum of 1 IR) designated the same PSMA-RADS score within the context of all 4 readers selecting the same target lesion with an agreement rate of more than 79% (Table 3). Moreover, on an organ compartment level, the ICC for LN lesions based on PSMA-RADS was 0.79, which is almost identical to a previous assessment for the interobserver agreement for LNs (Fleiss k = 0.80) (16).

A nuance of the current study is that the ERs gained experience with subtly different PSMA-targeted radiotracers. There is a current trend toward increased use of 18F-labeled PSMA-targeted imaging agents for PCa molecular imaging, although 68Ga-PSMA-11 has been by far the most commonly used radiotracer to date (26). In head-to-head comparisons between 68Ga- and 18F-labeled compounds, a higher detection rate for sites of disease, as well as an increased tumor-to-background ratio, was demonstrated with a radiofluorinated agent (27,28). Some of the differences in interpretation between ER 1 and ER 2 might be related to their relative familiarities with these different PSMA-targeted radiotracers. A common example of a difference in classification between an 18F-trained reader and the 68Ga-trained readers is given in Figure 4: Although ER 1 called uptake in a right iliac LN lesion PSMA-RADS-4 (i.e., PCa highly likely to be present), 2 other readers (ER 2 and IR 1, both trained with 68Ga-PSMA PET imaging agents) classified this lesion as PSMA-RADS-3A (i.e., a suggestive but indeterminate LN) (13). The 18F-trained reader may have higher confidence in lesion interpretation on 18F-DCFPyL PET scans, most likely because of the higher sensitivity in the detection rate of small lesions using 18F-labeled radiotracers than using 68Ga-PSMA PET imaging agents (27,28).

Further corroborating the need for a standardized framework (11,12), one of the IRs classified moderate radiotracer uptake in mediastinal and hilar LNs as PSMA-RADS-4 (Fig. 5), whereas ER 1 called it PSMA-RADS-2 (i.e., likely benign). Even though the IR had potentially misinterpreted the low-level uptake in the LNs (longitudinal follow-up imaging showed no change in these LNs), this potential misinterpretation did not affect the overall scan score. Thus, PSMA-RADS may contribute to a self-learning effect: PSMA-RADS-4 lesions may be downgraded to PSMA-RADS-2 when subsequent imaging confirms stability, which in turn would increase the understanding of the IR about how to differentiate between typical and atypical sites of PCa metastases.

This study had several limitations. First, false-positive findings, in particular on a target lesion level, cannot be ruled out, as histopathologic assessment of the target lesions (many of which are small and not targetable on conventional imaging) would not be feasible. Second, the readers were masked to clinical status and potential corroborative imaging, potentially lowering interobserver agreement; however, the cases in this study were randomly selected and the readers masked to ancillary information in order to create a worst-case-scenario reflection of a busy real-world clinical practice to best test the applicability of PSMA-RADS. Although, in many situations, clinical information would be available to readers, we wished to ascertain the robustness of PSMA-RADS as an imaging-finding–driven construct. Nonetheless, future studies must clarify if providing clinical information has an important impact on the agreement rate of multiple observers and should also include stratification by serum prostate-specific antigen levels. Given the small number of identical bone lesions, ICC could not be provided for bone metastases. However, the readers in this study may have identified different target lesions in some patients with extensive skeletal involvement. Lastly, a larger trial including more scans and readers could further corroborate our preliminary findings. Nonetheless, the agreement rate of the overall PSMA-RADS score was excellent among all observers, and this initial result is promising.

CONCLUSION

In the present prospective study investigating interobserver agreement for the novel structured reporting system PSMA-RADS 1.0, a high concordance rate, even among readers with different experience, was observed. Thus, PSMA-RADS may be a useful framework for interpreting PSMA-targeted imaging studies, which in turn paves the way for implementing PSMA-RADS in the collection of data for larger prospective trials.

DISCLOSURE

Funding was provided by the Prostate Cancer Foundation Young Investigator Award; National Institutes of Health grants CA134675, CA183031, CA184228, and EB024495; and the European Union’s Horizon 2020 research and innovation program under Marie Sklodowska-Curie grant agreement 701983. Martin G. Pomper is a coinventor on a patent covering 18F-DCFPyL and is entitled to a portion of any licensing fees and royalties generated by this technology. This arrangement has been reviewed and approved by the Johns Hopkins University in accordance with its conflict-of-interest policies. He has also received research funding from Progenics Phamaceuticals, the licensee of 18F-DCFPyL. Michael A. Gorin has served as a consultant to, and has received research funding from, Progenics Phamaceuticals. Kenneth J. Pienta has received research funding from Progenics Phamaceuticals. Steven P. Rowe has received research funding from Progenics Phamaceuticals. No other potential conflict of interest relevant to this article was reported.

REFERENCES

  • 1.Giesel FL, Will L, Kesch C, et al. Biochemical recurrence of prostate cancer: initial results with [18F]PSMA-1007 PET/CT. J Nucl Med. 2018;59:632–635. [DOI] [PubMed] [Google Scholar]
  • 2.Calais J, Czernin J, Cao M, et al. 68Ga-PSMA-11 PET/CT mapping of prostate cancer biochemical recurrence after radical prostatectomy in 270 patients with a PSA level of less than 1.0 ng/mL: impact on salvage radiotherapy planning. J Nucl Med. 2018;59:230–237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Szabo Z, Mena E, Rowe SP, et al. Initial evaluation of [18F]DCFPyL for prostate-specific membrane antigen (PSMA)-targeted PET imaging of prostate cancer. Mol Imaging Biol. 2015;17:565–574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Schmuck S, von Klot CA, Henkenberens C, et al. Initial experience with volumetric 68Ga-PSMA I&T PET/CT for assessment of whole-body tumor burden as a quantitative imaging biomarker in patients with prostate cancer. J Nucl Med. 2017;58:1962–1968. [DOI] [PubMed] [Google Scholar]
  • 5.Maurer T, Gschwend JE, Rauscher I, et al. Diagnostic efficacy of 68gallium-PSMA positron emission tomography compared to conventional imaging for lymph node staging of 130 consecutive patients with intermediate to high risk prostate cancer. J Urol. 2016;195:1436–1443. [DOI] [PubMed] [Google Scholar]
  • 6.Rowe SP, Deville C, Paller C, et al. Uptake of 18F-DCFPyL in Paget’s disease of bone, an important potential pitfall in clinical interpretation of PSMA PET studies. Tomography. 2015;1:81–84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Werner RA, Sheikhbahaei S, Jones KM, et al. Patterns of uptake of prostate-specific membrane antigen (PSMA)-targeted 18F-DCFPyL in peripheral ganglia. Ann Nucl Med. 2017;31:696–702. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Rischpler C, Beck TI, Okamoto S, et al. 68Ga-PSMA-HBED-CC uptake in cervical, celiac, and sacral ganglia as an important pitfall in prostate cancer PET imaging. J Nucl Med. 2018;59:1406–1411. [DOI] [PubMed] [Google Scholar]
  • 9.Sheikhbahaei S, Afshar-Oromieh A, Eiber M, et al. Pearls and pitfalls in clinical interpretation of prostate-specific membrane antigen (PSMA)-targeted PET imaging. Eur J Nucl Med Mol Imaging. 2017;44:2117–2136. [DOI] [PubMed] [Google Scholar]
  • 10.Hofman MS, Hicks RJ, Maurer T, Eiber M. Prostate-specific membrane antigen PET: clinical utility in prostate cancer, normal patterns, pearls, and pitfalls. Radiographics. 2018;38:200–217. [DOI] [PubMed] [Google Scholar]
  • 11.Eiber M, Herrmann K, Calais J, et al. Prostate Cancer Molecular Imaging Standardized Evaluation (PROMISE): proposed miTNM classification for the interpretation of PSMA-ligand PET/CT. J Nucl Med. 2018;59:469–478. [DOI] [PubMed] [Google Scholar]
  • 12.Rowe SP, Pienta KJ, Pomper MG, Gorin MA. PSMA-RADS version 1.0: a step towards standardizing the interpretation and reporting of PSMA-targeted PET imaging studies. Eur Urol. 2018;73:485–487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Rowe SP, Pienta KJ, Pomper MG, Gorin MA. Proposal for a structured reporting system for prostate-specific membrane antigen-targeted PET imaging: PSMA-RADS version 1.0. J Nucl Med. 2018;59:479–485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Fanti S, Minozzi S, Morigi JJ, et al. Development of standardized image interpretation for 68Ga-PSMA PET/CT to detect prostate cancer recurrent lesions. Eur J Nucl Med Mol Imaging. 2017;44:1622–1635. [DOI] [PubMed] [Google Scholar]
  • 15.Ravert HT, Holt DP, Chen Y, et al. An improved synthesis of the radiolabeled prostate-specific membrane antigen inhibitor, [18F]DCFPyL. J Labelled Comp Radiopharm. 2016;59:439–450. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Fendler WP, Calais J, Allen-Auerbach M, et al. 68Ga-PSMA-11 PET/CT interobserver agreement for prostate cancer assessments: an international multicenter prospective study. J Nucl Med. 2017;58:1617–1623. [DOI] [PubMed] [Google Scholar]
  • 17.Werner RA, Solnes L, Javadi M, et al. SSTR-RADS version 1.0 as a reporting system for SSTR-PET imaging and selection of potential PRRT candidates: a proposed standardization framework. J Nucl Med. 2018;59:1085–1091. [DOI] [PubMed] [Google Scholar]
  • 18.Cicchetti DV. Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychol Assess. 1994;6:284–290. [Google Scholar]
  • 19.Salas Fragomeni RA, Amir T, Sheikhbahaei S, et al. Imaging of nonprostate cancers using PSMA-targeted radiotracers: rationale, current state of the field, and a call to arms. J Nucl Med. 2018;59:871–877. [DOI] [PubMed] [Google Scholar]
  • 20.Giesel FL, Hadaschik B, Cardinale J, et al. F-18 labelled PSMA-1007: biodistribution, radiation dosimetry and histopathological validation of tumor lesions in prostate cancer patients. Eur J Nucl Med Mol Imaging. 2017;44:678–688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Paddubny K, Freitag MT, Kratochwil C, et al. Fluorine-18 prostate-specific membrane antigen-1007 positron emission tomography/computed tomography and multiparametric magnetic resonance imaging in diagnostics of local recurrence in a prostate cancer patient after recent radical prostatectomy. Clin Genitourin Cancer. 2018;16:103–105. [DOI] [PubMed] [Google Scholar]
  • 22.Werner RA, Andree C, Javadi MS, et al. A voice from the past: rediscovering the Virchow node with prostate-specific membrane antigen-targeted 18F-DCFPyL positron emission tomography imaging. Urology. 2018;117:18–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Fendler WP, Eiber M, Beheshti M, et al. 68Ga-PSMA PET/CT: joint EANM and SNMMI procedure guideline for prostate cancer imaging: version 1.0. Eur J Nucl Med Mol Imaging. 2017;44:1014–1024. [DOI] [PubMed] [Google Scholar]
  • 24.Rosenkrantz AB, Ginocchio LA, Cornfeld D, et al. Interobserver reproducibility of the PI-RADS version 2 lexicon: a multicenter study of six experienced prostate radiologists. Radiology. 2016;280:793–804. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Sonn GA, Fan RE, Ghanouni P, et al. Prostate magnetic resonance imaging interpretation varies substantially across radiologists. Eur Urol Focus. December 6, 2017 [Epub ahead of print]. [DOI] [PubMed] [Google Scholar]
  • 26.Kesch C, Kratochwil C, Mier W, Kopka K, Giesel FL. 68Ga or 18F for prostate cancer imaging? J Nucl Med. 2017;58:687–688. [DOI] [PubMed] [Google Scholar]
  • 27.Dietlein M, Kobe C, Kuhnert G, et al. Comparison of [18F]DCFPyL and [68Ga]Ga-PSMA-HBED-CC for PSMA-PET imaging in patients with relapsed prostate cancer. Mol Imaging Biol. 2015;17:575–584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Dietlein F, Kobe C, Neubauer S, et al. PSA-stratified performance of 18F- and 68Ga-PSMA PET in patients with biochemical recurrence of prostate cancer. J Nucl Med. 2017;58:947–952. [DOI] [PubMed] [Google Scholar]

Articles from Journal of Nuclear Medicine are provided here courtesy of Society of Nuclear Medicine and Molecular Imaging

RESOURCES