Abstract
Objectives
To develop and test zone-specific prostate-specific antigen density (sPSAD) combined with PI-RADS to guide prostate biopsy decision strategies (BDS).
Methods
This retrospective study included consecutive patients, who underwent prostate MRI and biopsy (01/2012–10/2018). The whole gland and transition zone (TZ) were segmented at MRI using a retrained deep learning system (DLS; nnU-Net) to calculate PSAD and sPSAD, respectively. Additionally, sPSAD and PI-RADS were combined in a BDS, and diagnostic performances to detect Grade Group ≥ 2 (GG ≥ 2) prostate cancer were compared. Patient-based cancer detection using sPSAD was assessed by bootstrapping with 1000 repetitions and reported as area under the curve (AUC). Clinical utility of the BDS was tested in the hold-out test set using decision curve analysis. Statistics included nonparametric DeLong test for AUCs and Fisher-Yates test for remaining performance metrics.
Results
A total of 1604 patients aged 67 (interquartile range, 61–73) with 48% GG ≥ 2 prevalence (774/1604) were evaluated. By employing DLS-based prostate and TZ volumes (DICE coefficients of 0.89 (95% confidence interval, 0.80–0.97) and 0.84 (0.70–0.99)), GG ≥ 2 detection using PSAD was inferior to sPSAD (AUC, 0.71 (0.68–0.74)/0.73 (0.70–0.76); p < 0.001). Combining PI-RADS with sPSAD, GG ≥ 2 detection specificity doubled from 18% (10–20%) to 43% (30–44%; p < 0.001) with similar sensitivity (93% (89–96%)/97% (94–99%); p = 0.052), when biopsies were taken in PI-RADS 4-5 and 3 only if sPSAD was ≥ 0.42 ng/mL/cc as compared to all PI-RADS 3-5 cases. Additionally, using the sPSAD-based BDS, false positives were reduced by 25% (123 (104–142)/165 (146–185); p < 0.001).
Conclusion
Using sPSAD to guide biopsy decisions in PI-RADS 3 lesions can reduce false positives at MRI while maintaining high sensitivity for GG ≥ 2 cancers.
Clinical relevance statement
Transition zone-specific prostate-specific antigen density can improve the accuracy of prostate cancer detection compared to MRI assessments alone, by lowering false-positive cases without significantly missing men with ISUP GG ≥ 2 cancers.
Key Points
• Prostate biopsy decision strategies using PI-RADS at MRI are limited by a substantial proportion of false positives, not yielding grade group ≥ 2 prostate cancer.
• PI-RADS combined with transition zone (TZ)-specific prostate-specific antigen density (PSAD) decreased the number of unproductive biopsies by 25% compared to PI-RADS only.
• TZ-specific PSAD also improved the specificity of MRI-directed biopsies by 9% compared to the whole gland PSAD, while showing identical sensitivity.
Supplementary Information
The online version contains supplementary material available at 10.1007/s00330-024-10700-z.
Keywords: Prostatic neoplasms, Magnetic resonance imaging, Prostate-specific antigen density, Clinical decision-making, Image-guided biopsy
Introduction
The introduction of the MRI pathway for the detection of clinically significant prostate cancer (csPCa) has reduced the need for prostate biopsy and lowered the detection of clinically insignificant cancers [1, 2]. The MRI pathway has also increased the accuracy of biopsies through improved targeting and grading with the majority of studies showing non-inferiority of csPCa detection [1–4]. Employing the Prostate Imaging-Reporting and Data System (PI-RADS) before biopsy still encounters a relatively high proportion of false positives in men considered to have csPCa with high inter-center variability, especially in men with PI-RADS 3 scoring [2, 5–7]. Thus, there is an ongoing clinical need for additional markers to reduce the number of false-positive cases to enable a more accurate triage of patients for biopsy.
Recent studies have addressed this unmet clinical need by using prostate volume (PV) normalized prostate-specific antigen (PSA density; PSAD) for biopsy management decisions [8–10]. Several studies have shown that PSAD and prostate MRI suspicion score (e.g., PI-RADS) are independent predictors of csPCa at biopsy [11–13]. In this context, PSAD values greater than 0.15 ng/mL/cc are more indicative of the presence of csPCa, whereas values less than 0.09 ng/mL/cc largely exclude csPCa [4, 14–16]. Furthermore, biopsy decision strategies (BDS), a combination of the PI-RADS scoring and PSAD, have been developed to maximize the accuracy of image-derived information for biopsy planning [14, 15, 17].
The PV is needed for PSAD to be calculated. However, the lack of standardized recommendations for PV determination limits the clinical utility of PSAD, as PV estimates do vary by imaging modality and calculation approach. Both planimetry and the prolate ellipse formula are used clinically [18–21]. Transition zone (TZ)-specific (s)PSAD has been postulated as being a more relevant test because the secretory cells of the TZ are primarily responsible for PSA production [22, 23]. TZ volume due to benign prostatic hyperplasia would better account for PSA values for biopsy decisions [24]. However, there is no specific formula for calculating TZ volume, which poses a barrier to the application of sPSAD in routine clinical practice.
In this context, deep learning (DL) systems have shown greater accuracy for whole prostate and TZ segmentation tasks [25–29]. However, to our knowledge, the diagnostic utility of DL-based automated sPSAD calculation for the detection of csPCa has not been comprehensively explored.
Therefore, the aim of this study was to develop and test a decision strategy combining DL-based sPSAD and PI-RADS to guide prostate biopsies in men at elevated risk of having PCa.
Methods
This study complies with the Checklist for Artificial Intelligence in Medical Imaging and the Standards for Reporting Diagnostic Accuracy studies. The study was approved by the institutional review board and informed consent was waived due to its retrospective design.
Study sample
We included consecutive men undergoing prostate biopsy at our institution between January 2012 and October 2018 as well as transrectal ultrasound (TRUS) and prostate MRI within 6 months before biopsy. Age, serum PSA, and TRUS-based PV were obtained from clinical records. Eligible MRI studies were retrieved from the institution’s PACS and incorporated into a customized DICOM-viewer for image annotation, quality assessment, and post-processing. A subset of this cohort was included in previous analyses, where PSAD was not investigated for PCa detection [30].
Patients with discrepancies in the radiologic-pathologic correlation, inconclusive zonal architecture of the prostate (e.g., indistinct or distorted zonal margins after transurethral resection), MRI using an endorectal coil, and MRI of insufficient quality were excluded (Fig. 1). Finally, the study sample was split into a training (61%), validation (15%), and hold-out test set (24%) comprising consecutive cases from 01/2012 to 01/2016, 01/2016–01/2017, and 01/2017–10/2018, respectively. For external testing, the publicly available prostate-MRI-US-biopsy set was used with a reported csPCa prevalence of 58% (399 of 692); the data set is accessible at https://wiki.cancerimagingarchive.net and detailed in the supplementary material [31].
MRI protocol and PI-RADS scoring
MRI was acquired according to the PI-RADS and ESUR technical recommendation guidelines [5, 32]. Imaging was performed on three MRI scanners, including a 1.5-T (Magnetom Avanto, Siemens Healthineers) and two identical 3-T MRI scanners (Magnetom Skyra, Siemens Healthineers). Prostate MRI protocols comprised T2-weighted imaging (T2W), diffusion-weighted images (DWI), and a dynamic-contrast enhanced (DCE) sequence, if available. Scanning parameters were axial and coronal T2W images (3.0 × 0.47 × 0.47 mm, 18 cm FoV), axial DWI at b values of 0, 50, 500, and 1000 s/mm2 with mono-exponentially calculated apparent diffusion coefficient (ADC) maps, and extrapolated high-b-value DWI images (3.0 × 1.4 × 1.4 mm, 22 cm FoV, calculated b = 1400 s/mm2). All MR images were reviewed for sufficient image quality according to national diagnostic requirements for prostate MRI prior to analysis [33]. Under the supervision of expert radiologists (P.A., B.H., and T.P. each with > 10 years of experience), prostate lesions described by radiology faculty on primary reports were reviewed in consensus by two radiologists (C.H. and N.B. with 3–5 years of experience) using the PI-RADS v.2.1 criteria [5]. Up to four lesions per patient were identified, of which the lesion with the highest reported PI-RADS score was defined as the index lesion for the patient-based analysis. According to PI-RADS v.2.1 guidelines, PV was calculated using the prolate ellipsoid formula: width × height × length × π/6.
TRUS-based volumetry and biopsy procedures
All TRUS examinations and MRI/US fusion-guided targeted biopsies were performed by a team of urologists (H.C., A.M., and K.G. each with > 10 years of experience) at our tertiary university center and in accordance with the European Association of Urology (EAU; 2022) guidelines. According to standard clinical protocol, PV was calculated using the prolate ellipsoid formula.
First, a targeted biopsy of the prostate with three cores per target lesion was performed as described previously [34]. Secondly, a systemic biopsy was performed with 10 cores from the apex, lateral mid-gland, base, and ventral and paraurethral region bilaterally, respectively. If pre-biopsy MRI was negative (PI-RADS 1-2), only systematic biopsy was performed. All cores were potted and documented separately, and examined and analyzed by experienced pathologists (including S.S.) following the ISUP guidelines [4].
Retraining of a deep learning system for prostate segmentation at MRI
Data set and manual prostate segmentation
In the training and validation set, the whole gland (WG) and TZ were manually segmented by two investigators (K.F. & F.D.) under the supervision of two radiologists (C.H. and T.P. with 3 and > 10 years of experience, respectively). Segmentations were performed using the Medical Imaging Interaction Toolkit (MITK; v.2021.10, German Cancer Research Center, Division of Medical Image Computing) and 3D Slicer (v.4.10.2) [35]. Segmentation was performed according to the anatomical landmarks on axial and coronal T2W volumes, specified in the PI-RADS guidelines [5].
DL segmentation model
For automated segmentation of the prostate and TZ, a high-performance, well-established nnU-Net was selected and trained on T2W, ADC, and DWI axial images and the manually generated WG and TZ three-dimensional masks [36]. The training and validation set comprised 972 and 243 men, respectively [36]. The model architecture and training is detailed in the supplementary material. Segmentation performance of the DL system was assessed in the validation set calculating the DICE coefficient ranging from 0 to 1 (0 indicating no spatial overlap of binary segmentations; 1 indicating complete overlap). Moreover, a more comprehensive pairwise agreement test of PVs derived from different MRI- and TRUS-based segmentation approaches was performed separately as detailed in the supplementary material.
Development of a biopsy decision strategy using PI-RADS and sPSAD
For BDS development using PI-RADS and sPSAD, the optimal sPSAD cutoff value was derived from the training set by selecting the cutoff value that achieved the highest specificity without suffering a statistically significant loss in sensitivity in csPCa detection. This was done by (I) calculating sPSAD specificity in incremental steps of 5% from 0 to 100%, (II) measuring the diagnostic performance of the BDS performing biopsy in men with PI-RADS 4-5 or PI-RADS 3 and sPSAD ≥ cutoff value using bootstrapping with 1000 repetitions, and (III) selecting the optimal sPSAD cutoff value using the exact Fisher-Yates test (alpha = 0.05).
Outcomes and statistics
The primary outcome of this study was the detection rate of csPCa (Gleason Score ≥ 3 + 4; ISUP Grade Group (GG) ≥ 2) using sPSAD alone or in combination with PI-RADS scoring.
Patient-based detection accuracy of GG ≥ 2 in the validation set using TRUS- and MRI-based PSAD and sPSAD, respectively, was reported using bootstrapping over 1000 repetitions, and areas under the receiver operating characteristic (ROC) curve (AUCs) were plotted. AUCs were compared using the nonparametric approach proposed by DeLong et al [37].
GG ≥ 2 detection accuracy of the BDS using PI-RADS and sPSAD was evaluated in the validation and the hold-out test set calculating specificity and sensitivity. Finally, the potential added clinical value of BDS for GG ≥ 2 detection was compared with the PI-RADS 3-5 category alone using decision curve analysis (DCA) and the exact Fisher-Yates test. One-sided p-values < 0.05 were considered statistically significant. Statistical analyses were performed using Python (v.3.8.10) with the libraries scipy (v.1.5.4) and Scikit-learn (v.0.23.2).
Results
Study population
This study included 1604 patients, which were split into a training (n = 972), validation (n = 243), and hold-out test set (n = 389; Table 1). Men in this study were 67 (IQR, 61–73) years of age, had PSA of 7.85 (4.86–10.84) ng/mL, and were diagnosed with GG ≥ 2 PCa in 48% (774/1604). GG ≥ 2 disease was suspected at MRI in 1059 men (524 and 535 with PI-RADS 4 and 5, respectively), while 182 men had a PI-RADS category of 3. Seventy-one and 41 patients were excluded due to insufficient image quality and inconclusive zonal architecture of the prostate, respectively.
Table 1.
Characteristics | All men (n = 1604) |
Training set (n = 972) |
Validation set (n = 243) |
Hold-out test set (n = 389) |
---|---|---|---|---|
Age (y; median, IQR) | 67 (61–73) | 66 (61–72) | 67 (61–74) | 66 (60–72) |
PSA level (ng/mL; median, IQR) | 7.85 (4.86–10.84) | 8.55 (5.15–11.95) | 7.82 (5.32–10.32) | 6.31 (4.17–8.45) |
Number of biopsy-naïve patients (%; n) | 48% (763) | 32% (313) | 51% (12 5) | 84% (325) |
Number of patients with PCa (%; n) | 69% (1109) | 65% (632) | 78% (189) | 74% (288) |
Number of patients with ISUP GG ≥ 2 cancers (%; n) | 48% (766) | 46% (449) | 51% (123) | 50% (194) |
ISUP Grade (%; n) | ||||
1 | 21% (343) | 19% (183) | 27% (66) | 33% (94) |
2 | 18% (289) | 15% (149) | 20% (48) | 32% (92) |
3 | 8% (123) | 7% (71) | 8% (20) | 11% (32) |
4 | 16% (264) | 18% (172) | 18% (43) | 17% (49) |
5 | 6% (90) | 6% (57) | 5% (12) | 7% (21) |
Highest patient-based PI-RADS (%; n) | ||||
1/2 | 16% (260) | 20% (198) | 11% (26) | 9% (36) |
3 | 11% (182) | 10% (102) | 9% (21) | 15% (58) |
4 | 33% (524) | 25% (243) | 37% (89) | 49% (191) |
5 | 33% (535) | 35% (338) | 38% (92) | 27% (104) |
NA | 6% (103) | 9% (91) | 6% (15) | - |
PSA prostate-specific antigen, PCa prostate cancer, ISUP GG International Society of Urological Pathology Grade Group, PI-RADS Prostate Imaging-Reporting and Data System
Prostate volume and PSAD
Across the whole study sample prostate volumetry on Segmentation-MRI, Ellipsoid-MRI and Ellipsoid-TRUS revealed a WG volume of 50 (95% confidence interval (95%CI), 22–78) mL, 55 (22–88) mL, and 55 (26–84) mL (p < 0.001, Table 2) with a PSAD of 0.26 (0–0.65) ng/mL/cc, 0.25 (0–0.56) ng/mL/cc, and 0.24 (0–0.63) ng/mL/cc (p < 0.001), respectively. The Ellipsoid-based approach at MRI and TRUS overestimated PV by 10% (55 vs 50 mL; p < 0.001, Supplementary Figure 1). The average MRI-based TZ volume was 31.26 mL and resulted in a sPSAD of 0.55 (0–1.41) ng/mL/cc.
Table 2.
Characteristics | All men (n = 1604) | Training set (n = 972) | Validation set (n = 243) | Hold-out test set (n = 389) |
---|---|---|---|---|
Prostate volume (mL) mean ± SD, range | ||||
Whole gland TRUS-based | 55 ± 29 (6–350) | 57 ± 29 (6–190) | 57 ± 26 (15–180) | 50 ± 31 (16–350) |
Whole gland MRI-based | 50 ± 28 (10–343) | 53 ± 28 (10–205) | 39 ± 22 (11–181) | 49 ± 30 (16–343) |
TZ MRI-based | 31 ± 26 (0.4–332) | 33 ± 26 (0.4–169) | 24 ± 19 (2–144) | 30 ± 29 (4–332) |
PSA density (ng/mL/cc) mean ± SD, range | ||||
Whole gland TRUS-based | 0.24 ± 0.39 (0.01–9.37) | 0.24 ± 0.31 (0.01–4.49) | 0.19 ± 0.15 (0.01–1.25) | 0.24 ± 0.60 (0.01–9.37) |
Whole gland MRI-based | 0.26 ± 0.39 (0.01–7.18) | 0.27 ± 0.35 (0.01–5.80) | 0.30 ± 0.23 (0.04–2.03) | 0.22 ± 0.39 (0.01–7.18) |
TZ MRI-based | 0.55 ± 0.86 (0.01–12.41) | 0.58 ± 0.99 (0.01–12.41) | 0.64 ± 0.6 (0.05–4.61) | 0.43 ± 0.56 (0.03–8.64) |
TRUS transrectal ultrasound, PSA prostate-specific antigen, TZ transition zone
DL-based volume estimates had a segmentation DICE score of 0.89 (95%CI, 0.80–0.97) and 0.84 (0.70–0.99) for the WG and TZ at MRI, respectively, with a mean deviation of 4% from the manually segmented WG volume (2 mL of 50 mL WG; Fig. 2 and Supplementary Figures 1 and 2). The segmentation processing time was 12s (95%CI, 9–15s).
GG ≥ 2 cancer detection using PSAD alone
sPSAD achieved an AUC of 0.73 (95%CI, 0.70–0.76) for GG ≥ 2 detection, outperforming TRUS- and MRI-based PSADs (AUC, 0.69 (0.66–0.72) and 0.71 (0.68–0.74); p < 0.001; Fig. 3a). The performance of PSAD for GG ≥ 2 and any-grade PCa is comparable to literature that distribute within the CI of the plotted AUCs (Supplementary Figure 3).
In patients with clinically challenging PSAD (intermediate-low to intermediate-high risk; 0.1–0.2 ng/mL/cc; Fig. 3b), TRUS- and MRI-based PSAD for GG ≥ 2 was similar (AUC, 0.60 (95%CI, 0.54–0.66) vs. 0.62 (0.57–0.67); p = 0.61) while both were outperformed by the sPSAD (0.66 (0.61–0.71); p = 0.031 and p = 0.023 in comparison to TRUS- and MRI-based PSAD, respectively). On external testing without additional refinements or retraining, sPSAD performed similarly to PSAD (AUC, 0.72 (0.68–0.76) vs 0.71 (0.67–0.75); p = 0.13) but outperformed the whole gland approach in patients with clinically challenging PSAD (0.61 (0.54–0.68) vs 0.54 (0.47–0.61); p = 0.018; Fig. 3c–d).
Biopsy decision strategy using PI-RADS and sPSAD
Performance of the sPSAD-based BDS in the validation set
Performing biopsy in men with PI-RADS 3-5 categories achieved a sensitivity of 100% (95%CI, 100–100%) and specificity of 24% (16–32%) for GG ≥ 2 detection. Following the recommended approach of performing biopsy in men with a PI-RADS 4-5 and PI-RADS 3 if the density is above a designated cutoff, the best-performing BDS achieved a sensitivity of 100% (95%CI, 100–100%; p = 1) and specificity of 36% (28–47%; p = 0.027), utilizing a sPSAD cutoff of 0.42 ng/mL/cc for GG ≥ 2. In comparison, the clinically established BDS using the MRI-based PSAD cutoff of 0.15 ng/mL/cc resulted in a sensitivity of 100% (95%CI, 100–100%; p = 1) and specificity of 33% (23–41%; p = 0.11) for GG ≥ 2 cancers. Any-grade PCa detection results are provided in the supplementary material.
Testing of the sPSAD-based BDS in the hold-out test set
PI-RADS 3-5 achieved a sensitivity and specificity of 97% (95%CI, 94–99%) and 18% (10–20%) for GG ≥ 2, respectively. The developed sPSAD-based BDS achieved a sensitivity of 93% (95%CI, 89–96%; p = 0.053) and specificity of 43% (30–44%; p < 0.001) for GG ≥ 2, when applying a sPSAD cutoff of 0.42 ng/mL/cc. In comparison, the established BDS with a PSAD cutoff of 0.15 achieved a sensitivity of 93% (95%CI, 89–96%; p = 0.053) and a lower specificity of 34% (27–41%; p < 0.001).
Overall impact on biopsy decision and outcome
Applying the proposed PI-RADS and sPSAD-based risk groups, men with a PI-RADS 4-5 or PI-RADS 3 with sPSAD ≥ 0.42 ng/mL/cc showed more men with GG ≥ 2 cancers (prevalence of 38–78%; Fig. 4). Based on the overall EAU accepted risk threshold of 9% for GG ≥ 2 [4], a biopsy could have safely been avoided in men with a PI-RADS 3 and a sPSAD of < 0.42 in the validation set. However, a GG ≥ 2 prevalence of 16% (95%CI, 11–21%) was detected for the same risk group in the hold-out test set. Note also that in the hold-out test set, PI-RADS 1/2 demonstrated an intermediate (16–20%) GG ≥ 2 risk in men with a sPSAD < 0.42 and ≥ 0.42.
DCA revealed that the BDS using PI-RADS 4/5 or PI-RADS 3 and sPSAD ≥ 0.42 resulted in the highest clinical benefit (Fig. 5), avoiding unnecessary biopsies in 18% (95%CI, 12–23%) and 19% (15–22%), and reducing the false-positive rate from 41% (33–48%) to 36% (29–43%; p < 0.001) and 47% (42–52%) to 41% (35–46%; p = 0.027) in the validation and hold-out test sets, respectively (Table 3). At the same time, GG ≥ 2 cancer was only missed in 0% (95%CI, 0–0%) and 7% (3–11%) of cases, respectively. Compared to performing biopsy in all men with PI-RADS 3-5, the number of false-positive findings would have been reduced by 17% (95%CI, 9–25%) and 25% (18–33%) in the validation and hold-out test set (68 (53–82) vs. 82 (67–97; p = 0.027) and 123 (104–142) vs. 165 (146–186; p < 0.001)), respectively. Additionally, the sPSAD-based BDS performance was similar to the PSAD-based BDS in the DCA, minimally reducing the number of false-positive findings (68 (95%CI, 53–82) vs. 73 (58–88); p = 0.28) and improving biopsy avoidance (123 (104–142) vs. 129 (111–148; p = 0.28) with an identical number of missed GG ≥ 2 cancers. Finally, the use of the developed sPSAD-based BDS in the external set, which substantially differed from our cohort (supplementary material), resulted in the highest clinical benefit compared to MRI scoring alone or in combination with PSAD (Fig. 5).
Table 3.
Validation set (n = 228*) | |||||
Biopsy strategy | Men biopsied | Biopsies avoided | ISUP GG ≥ 2 cancers | Biopsies without ISUP GG ≥ 2 cancers (FP) | |
Detected (TP) | Missed (FN) | ||||
PI-RADS 1-5 (reference) | 228 | 0 | 120 | 0 | 108 (47%) |
PI-RADS 3-5 | 202 (89%) | 26 (11%) | 120 (100%) | 0 (0%) | 82 (41%) |
PI-RADS 4-5 | 181 (79%) | 47 (21%) | 117 (98%) | 3 (2%) | 64 (35%) |
PI-RADS 4-5 + 3 & PSAD ≥ 0.15 | 193 (85%) | 35 (15%) | 120 (100%) | 0 (0%) | 73 (38%) |
PI-RADS 4-5 + 3 & sPSAD ≥ 0.42 | 188 (83%) | 40 (18%) | 120 (100%) | 0 (0%) | 68 (36%) |
Hold-out test set (n = 389) | |||||
Biopsy strategy | Men biopsied | Biopsies avoided | ISUP GG ≥ 2 cancers | Biopsies without ISUP GG ≥ 2 cancers (FP) | |
Detected (TP) | Missed (FN) | ||||
PI-RADS 1-5 (reference) | 389 | 0 | 194 | 0 | 195 (50%) |
PI-RADS 3-5 | 353 (91%) | 36 (9%) | 188 (97%) | 6 (3%) | 165 (47%) |
PI-RADS 4-5 | 295 (76%) | 94 (24%) | 176 (91%) | 18 (9%) | 119 (40%) |
PI-RADS 4-5 + 3 & PSAD ≥ 0.15 | 309 (79%) | 80 (21%) | 180 (93%) | 14 (7%) | 129 (42%) |
PI-RADS 4-5 + 3 & sPSAD ≥ 0.42 | 303 (78%) | 86 (22%) | 180 (93%) | 14 (7%) | 123 (41%) |
Fifteen men were missing a Prostate Imaging-Reporting and Data System (PI-RADS) score in the validation set (*). ISUP GG International Society of Urological Pathology Grade Group, TP true positive, FN false negative, FP false positive
Discussion
PSAD is an independent predictor of GG ≥ 2 cancers and is a valuable asset in BDS combined with PI-RADS scoring [4, 17, 38]. However, a TZ-specific (s)PSAD approach, which specifically accounts for PSA increase due to benign prostatic hyperplasia and larger TZ volume, could be a better predictor of biopsy results. To address this, our study presents an approach using PI-RADS and sPSAD for GG ≥ 2 cancer detection, where TZ volumetry was performed semiautomatically using a DL segmentation tool. sPSAD improved the detection of GG ≥ 2 PCa in an internal and external test set compared to PSAD particularly in men with a grey zone PSAD (PSAD of 0.1–0.2 ng/mL/cc). Our BDS using PI-RADS scoring and sPSAD, which recommended biopsy in men with PI-RADS 4/5 or PI-RADS 3 and sPSAD ≥ 0.42, improved the specificity for GG ≥ 2 detection, while sensitivity was not significantly reduced compared to a BDS of PI-RADS 3-5 only. The developed sPSAD-based BDS was the best-performing strategy for GG ≥ 2 diagnosis in DCA, improving biopsy avoidance and reducing the number of false positives while maintaining high sensitivity compared to preforming biopsy in men with a PI-RADS 3-5 only.
The issue of a high false-positive rate of up to 45% in PI-RADS 3-5 cases has been tackled by developing image-based BDS using PI-RADS scoring and PSAD, aiming to improve the positive predictive value of MRI-guided biopsy. In this context, the EAU guideline recommended prostate biopsy in all men with elevated risk for GG ≥ 2 cancer (prevalence ≥ 30%; with PI-RADS 4-5 or PSAD ≥ 0.2 ng/mL/cc), and biopsy should highly be considered in men with an intermediate risk (≥ 20%; PI-RADS 4-5 or PSAD 0.15–0.2) [17]. These EAU risk groups were confirmed by our study achieving similar yields of GG ≥ 2 disease. However, our results show that a zone-specific approach for PSAD calculation would improve both biopsy avoidance and address the need to reduce false positives on prostate MRI.
The adoption of the proposed sPSAD-based BDS is likely to offer immediate clinical advantages in routine practice. This newly proposed strategy demonstrated the highest clinical yield in a well-balanced cohort comprising both biopsy-naive and biopsy-negative men (validation set), as well as in an MRI-first cohort, where over 80% of the individuals were biopsy-naïve (hold-out test set). Specifically, the employment of sPSAD-based BDS led to a 9% increase in specificity for the detection of GG ≥ 2 PCa, as compared to the BDS using whole gland PSAD, while showing identical sensitivity. Moreover, our approach of using a DL tool for sPSAD calculation proved its feasibility in an external test set and can be easily implemented by minor modifications of currently available software, thus enabling clinical translation of the method.
While the study results are promising, there are limitations. First, clinical implications derived from this single-center investigation may be limited by the retrospective study design. However, this study included a large number of consecutive patients, keeping the risk of a selection bias to a minimum. Moreover, clinical utility and feasibility were tested on an internal and external test set and our results on any-grade PCa and GG ≥ 2 detection using PSAD alone and in the context of PI-RADS-based risk groups were similar to published literature [8, 9, 11, 13]. Second, our analysis is limited by the uncertainty over the prevalence of GG ≥ 2 cancer in the study sample, since men did not undergo saturation biopsy. However, this is a limitation of most PCa diagnosis studies. Templated biopsies are the gold standard in the diagnosis context, but our institution preforms MRI/US fusion-guided targeted and systematic biopsy in men with suspicious lesions at MRI, which is in accordance with the EAU guidelines and the MRI diagnosis pathway [2]. Nonetheless, targeted biopsy was not performed in all patients based on MRI risk estimation. However, if only cases with MRI-targeted biopsy had been included, we would have suffered a selection bias in favor of cases with MRI-positive findings, while MRI-invisible PCa potentially would have been excluded. Third, the retrospective PI-RADS consensus review in this study may have differed from the original report, but cases with distinct radiological-pathological discrepancies were excluded. Thus, the overall impact on diagnostic performance can be considered marginal. Fourth, clinical implications of the sPSAD-based BDS at the accepted risk by the EAU guidelines of missing 10% of csPCa could not be demonstrated as PI-RADS 4/5 alone achieved a sensitivity of > 90% [4]; however, our higher population prevalence of 48% is in line with the results of the PAIREDCAP study [39]. Within these limitations, we demonstrated the initial feasibility of using sPSAD for biopsy decision planning.
In conclusion, our findings underscore the concept and use of a TZ-specific (s)PSAD for image-based prostate biopsy decision planning. sPSAD improved GG ≥ 2 cancer detection when compared to PSAD alone and outperformed PI-RADS only based BDS. Specifically, the implementation of sPSAD-based BDS was able to reduce false positives at MRI and improve biopsy avoidance, while maintaining a high detection rate of clinically significant disease. Thus, the use of sPSAD for biopsy decisions should be evaluated in prospective studies.
Supplementary Information
Below is the link to the electronic supplementary material.
Acknowledgements
C.A.H., N.L.B., L.J.S., and T.B. are currently fellows of the BIH Charité (Junior Digital) Clinician Scientist Program funded by the Charité-Universitätsmedizin Berlin and the Berlin Institute of Health.
Abbreviations
- 95%CI
95% confidence interval
- AUC
Area under the ROC
- BDS
Biopsy decision strategy
- csPCa
Clinically significant prostate cancer
- DCA
Decision curve analysis
- DL
Deep learning
- EAU
European Association of Urology
- ISUP GG
International Society of Urological Pathology Grade Group
- PI-RADS
Prostate Imaging-Reporting and Data System
- PSAD
Prostate-specific antigen density
- PV
Prostate volume
- ROC
Receiver operating characteristic curve
- sPSAD
TZ-specific PSAD
- TRUS
Transrectal ultrasound
- TZ
Transition zone
- WG
Whole gland
Funding
Open Access funding enabled and organized by Projekt DEAL. The authors state that this work has not received any funding.
Declarations
Guarantor
The scientific guarantor of this publication is Tobias Penzkofer.
Conflict of interest
The authors of this manuscript declare relationships with the following companies, which however did not influence the presented study: A.R.P. Stockholder of Lucida, speakers beaureu for Siemens and research support of Siemens; L.J.S., P.A., B.H., and T.P. receive research grants from the Collaborative Research Center (CRC) 1340 “Matrix in Vision” funded by the Deutsche Forschungsgemeinschaft (DFG). L.J.S. Research grants, travel stipends, and lecture honoraria from Guerbet; B.H. Consulting fees from Canon/Toshiba; travel support from Canon and Bayer; stock options from pharmaceutical and medical technology companies. T.P. AGO, Aprea AB, ARCAGY-GINECO, Astellas Pharma Global Inc. (APGD), Astra Zeneca, Clovis Oncology, Dohme Corp, Holaira, Incyte Corporation, Karyopharm, Lion Biotechnologies, MedImmune, Merck Sharp, Millennium Pharmaceuticals, Morphotec Inc., NovoCure Ltd., PharmaMar S.A. and PharmaMar USA, Roche, Siemens Healthineers, and TESARO.
Prof. Bernd Hamm is the Editor-in-Chief of European Radiology. As such, he had no role in handling this manuscript and finalizing decisions.
Statistics and biometry
Several authors have significant statistical expertise (G.L.B., A.P., T.P.).
Informed consent
Informed consent was waived given its retrospective design.
Ethical approval
The Institutional Review Board of the Charité Universitätsmedizin Berlin approval was obtained.
Study subjects or cohorts overlap
A subset of the study population investigated here was previously reported in Hamm et al Radiology 2023 (overlap of 75% (1210 men), development of an explainable deep learning system for prostate cancer detection), Beetz et al 2022 (29%, biopsy accuracy correlated with number of biopsy cores), Rudolph et al European Radiology 2020 (28%, validation of PI-RADS features), Beetz et al RöFo 2022 (20%, intra-reader comparison of PI-RADS version 2 and 2.1), and Rudolph 2020 (6%, intra-reader comparison of PI-RADS version 2 and 2.1). However, zone-specific PSAD and biopsy decision strategies using PSAD and PI-RADS were not investigated for prostate cancer detection.
Methodology
• retrospective
• diagnostic or prognostic study
• performed at one institution
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Change history
4/17/2024
Sections "Development of a biopsy decision strategy using PI-RADS and sPSAD" and "Outcomes and statistics" have been formatted as sub-sections of section "Methods".
References
- 1.Drost FJH, Osses DF, Nieboer D et al (2019) Prostate MRI, with or without MRI-targeted biopsy, and systematic biopsy for detecting prostate cancer. Cochrane Database Systemat Rev. 10.1002/14651858.CD012663.pub2 10.1002/14651858.CD012663.pub2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Padhani AR, Barentsz J, Villeirs G et al (2019) PI-RADS steering committee: the PI-RADS multiparametric MRI and MRI-directed biopsy pathway. Radiology 292:464–474 10.1148/radiol.2019182946 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.NICE Guideline NG131. Prostate caner: diagnosis and management. Updated December 15th 2021. Accessed September 1, 2022.
- 4.Mottet N, van den Bergh RC, Briers E et al (2021) EAU-EANM-ESTRO-ESUR-SIOG guidelines on prostate cancer—2020 update. Part 1: screening, diagnosis, and local treatment with curative intent. Eur Urol 79:243–262 10.1016/j.eururo.2020.09.042 [DOI] [PubMed] [Google Scholar]
- 5.Turkbey B, Rosenkrantz AB, Haider MA et al (2019) Prostate imaging reporting and data system version 2.1: 2019 update of prostate imaging reporting and data system version 2. Eur Urol 76:340–351 10.1016/j.eururo.2019.02.033 [DOI] [PubMed] [Google Scholar]
- 6.Schoots IG, Padhani AR (2020) Personalizing prostate cancer diagnosis with multivariate risk prediction tools: how should prostate MRI be incorporated? World J Urol 38:531–545 10.1007/s00345-019-02899-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Barkovich EJ, Shankar PR, Westphalen AC (2019) A systematic review of the existing prostate imaging reporting and data system version 2 (PI-RADSv2) literature and subset meta-analysis of PI-RADSv2 categories stratified by Gleason scores. AJR Am J Roentgenol 212:847–854 10.2214/AJR.18.20571 [DOI] [PubMed] [Google Scholar]
- 8.Hansen NL, Barrett T, Kesch C et al (2018) Multicentre evaluation of magnetic resonance imaging supported transperineal prostate biopsy in biopsy-naïve men with suspicion of prostate cancer. BJU Int 122:40–49 10.1111/bju.14049 [DOI] [PubMed] [Google Scholar]
- 9.Falagario UG, Martini A, Wajswol E et al (2020) Avoiding unnecessary magnetic resonance imaging (MRI) and biopsies: negative and positive predictive value of MRI according to prostate-specific antigen density, 4Kscore and risk calculators. Eur Urol Oncol 3:700–704 10.1016/j.euo.2019.08.015 [DOI] [PubMed] [Google Scholar]
- 10.Thompson IM, Pauler DK, Goodman PJ et al (2004) Prevalence of prostate cancer among men with a prostate-specific antigen level≤ 4.0 ng per milliliter. N Engl J Med 350:2239–2246 10.1056/NEJMoa031918 [DOI] [PubMed] [Google Scholar]
- 11.Boesen L, Nørgaard N, Løgager V et al (2019) Prebiopsy biparametric magnetic resonance imaging combined with prostate-specific antigen density in detecting and ruling out Gleason 7–10 prostate cancer in biopsy-naïve men. Eur Urol Oncol 2:311–319 10.1016/j.euo.2018.09.001 [DOI] [PubMed] [Google Scholar]
- 12.Fascelli M, Rais-Bahrami S, Sankineni S et al (2016) Combined biparametric prostate magnetic resonance imaging and prostate-specific antigen in the detection of prostate cancer: a validation study in a biopsy-naive patient population. Urology 88:125–134 10.1016/j.urology.2015.09.035 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Knaapila J, Jambor I, Perez IM et al (2020) Prebiopsy IMPROD biparametric magnetic resonance imaging combined with prostate-specific antigen density in the diagnosis of prostate cancer: an external validation study. Eur Urol Oncol 3:648–656 10.1016/j.euo.2019.08.008 [DOI] [PubMed] [Google Scholar]
- 14.Van Poppel H, Roobol MJ, Chapple CR et al (2021) Prostate-specific antigen testing as part of a risk-adapted early detection strategy for prostate cancer: European Association of Urology position and recommendations for 2021. Eur Urol 80:703–711 10.1016/j.eururo.2021.07.024 [DOI] [PubMed] [Google Scholar]
- 15.Maggi M, Panebianco V, Mosca A et al (2020) Prostate imaging reporting and data system 3 category cases at multiparametric magnetic resonance for prostate cancer: a systematic review and meta-analysis. Eur Urol Focus 6:463–478 10.1016/j.euf.2019.06.014 [DOI] [PubMed] [Google Scholar]
- 16.Yusim I, Krenawi M, Mazor E, Novack V, Mabjeesh NJ (2020) The use of prostate specific antigen density to predict clinically significant prostate cancer. Sci Rep 10:1–6 10.1038/s41598-020-76786-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Schoots IG, Padhani AR (2021) Risk-adapted biopsy decision based on prostate magnetic resonance imaging and prostate-specific antigen density for enhanced biopsy avoidance in first prostate cancer diagnostic evaluation. BJU Int 127:175 10.1111/bju.15277 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Dianat SS, Ruiz RMR, Bonekamp D, Carter HB, Macura KJ (2013) Prostate volumetric assessment by magnetic resonance imaging and transrectal ultrasound: impact of variation in calculated prostate-specific antigen density on patient eligibility for active surveillance program. J Comput Assist Tomogr 37:589–595 10.1097/RCT.0b013e318296af5f [DOI] [PubMed] [Google Scholar]
- 19.Hamzaoui D, Montagne S, Granger B et al (2022) Prostate volume prediction on MRI: tools, accuracy and variability. Eur Radiol:1-11 [DOI] [PubMed]
- 20.Ko JS, Landis P, Carter HB, Partin AW (2011) Effect of intra-observer variation in prostate volume measurement on prostate-specific antigen density calculations among prostate cancer active surveillance participants. BJU Int 108:1739–1742 10.1111/j.1464-410X.2011.10223.x [DOI] [PubMed] [Google Scholar]
- 21.Lee JS, Chung BH (2007) Transrectal ultrasound versus magnetic resonance imaging in the estimation of prostate volume as compared with radical prostatectomy specimens. Urol Int 78:323–327 10.1159/000100836 [DOI] [PubMed] [Google Scholar]
- 22.Kalish J, Cooner WH, Graham SD Jr (1994) Serum PSA adjusted for volume of transition zone (PSAT) is more accurate than PSA adjusted for total gland volume (PSAD) in detecting adenocarcinoma of the prostate. Urology 43:601–606 10.1016/0090-4295(94)90170-8 [DOI] [PubMed] [Google Scholar]
- 23.Nadler RB, Humphrey PA, Smith DS, Catalona WJ, Ratliff TL (1995) Effect of inflammation and benign prostatic hyperplasia on elevated serum prostate specific antigen levels. J Urol 154:407–413 10.1016/S0022-5347(01)67064-2 [DOI] [PubMed] [Google Scholar]
- 24.Castro HA, Iared W, Santos JEM, Solha RS, Shigueoka DC, Ajzen SA (2018) Impact of PSA density of transition zone as a potential parameter in reducing the number of unnecessary prostate biopsies in patients with PSA levels between 2.6 and 10.0 ng/mL. Int Braz J Urol 44:709–716 10.1590/s1677-5538.ibju.2017.0506 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Sunoqrot MR, Saha A, Hosseinzadeh M, Elschot M, Huisman H (2022) Artificial intelligence for prostate MRI: open datasets, available applications, and grand challenges. Eur Radiol Exp 6:1–13 10.1186/s41747-022-00288-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Aldoj N, Biavati F, Michallek F, Stober S, Dewey M (2020) Automatic prostate and prostate zones segmentation of magnetic resonance images using DenseNet-like U-net. Sci Rep 10:14315 10.1038/s41598-020-71080-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Cuocolo R, Cipullo MB, Stanzione A et al (2019) Machine learning applications in prostate cancer magnetic resonance imaging. Eur Radiol Exp 3:1–8 10.1186/s41747-019-0109-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Cuocolo R, Comelli A, Stefano A et al (2021) Deep learning whole-gland and zonal prostate segmentation on a public MRI dataset. J Magn Reson Imaging 54:452–459 10.1002/jmri.27585 [DOI] [PubMed] [Google Scholar]
- 29.Thimansson E, Bengtsson J, Baubeta E et al (2023) Deep learning algorithm performs similarly to radiologists in the assessment of prostate volume on MRI. Eur Radiol 33:2519–2528 10.1007/s00330-022-09239-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Hamm CA, Baumgärtner GL, Biessmann F et al (2023) Interactive explainable deep learning model informs prostate cancer diagnosis at MRI. Radiology 0:222276 [DOI] [PubMed]
- 31.Natarajan S, Priester A, Margolis D, Huang J, Marks L (2020) Prostate MRI and ultrasound with pathology and coordinates of tracked biopsy (prostate-MRI-US-biopsy). Cancer Imaging Arch 10:7937 [Google Scholar]
- 32.de Rooij M, Israël B, Tummers M et al (2020) ESUR/ESUI consensus statements on multi-parametric MRI for the detection of clinically significant prostate cancer: quality requirements for image acquisition, interpretation and radiologists’ training. Eur Radiol 30:5404-5416 %@ 1432-1084 [DOI] [PMC free article] [PubMed]
- 33.Franiel T, Asbach P, Beyersdorff D et al (2021) mpMRI of the prostate (MR-Prostatography): updated recommendations of the DRG and BDR on patient preparation and scanning protocol. Rofo 193:763–777 [DOI] [PubMed]
- 34.Hofbauer SL, Maxeiner A, Kittner B et al (2018) Validation of prostate imaging reporting and data system version 2 for the detection of prostate cancer. J Urol 200:767–773 10.1016/j.juro.2018.05.003 [DOI] [PubMed] [Google Scholar]
- 35.Wolf I, Vetter M, Wegner I et al (2005) The Medical Imaging Interaction Toolkit. Med Image Anal 9:594–604 10.1016/j.media.2005.04.005 [DOI] [PubMed] [Google Scholar]
- 36.Isensee F, Jaeger PF, Kohl SA, Petersen J, Maier-Hein KH (2021) nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nature Meth 18:203–211 10.1038/s41592-020-01008-z [DOI] [PubMed] [Google Scholar]
- 37.DeLong ER, DeLong DM, Clarke-Pearson DL (1988) Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 44:837–845 [PubMed]
- 38.Falagario UG, Jambor I, Lantz A et al (2021) Combined use of prostate-specific antigen density and magnetic resonance imaging for prostate biopsy decision planning: a retrospective multi-institutional study using the prostate magnetic resonance imaging outcome database (PROMOD). Eur Urol Oncol 4:971–979 [DOI] [PubMed]
- 39.Elkhoury FF, Felker ER, Kwan L et al (2019) Comparison of targeted vs systematic prostate biopsy in men who are biopsy naive: the prospective assessment of image registration in the diagnosis of prostate cancer (PAIREDCAP) study. JAMA Surg 154:811–818 10.1001/jamasurg.2019.1734 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.