Skip to main content
Journal of Clinical Medicine logoLink to Journal of Clinical Medicine
. 2022 Oct 26;11(21):6304. doi: 10.3390/jcm11216304

Radiomics in PI-RADS 3 Multiparametric MRI for Prostate Cancer Identification: Literature Models Re-Implementation and Proposal of a Clinical–Radiological Model

Andrea Corsi 1,2,*,, Elisabetta De Bernardi 3,4,, Pietro Andrea Bonaffini 1,2, Paolo Niccolò Franco 1,2, Dario Nicoletta 1,2, Roberto Simonini 1,2, Davide Ippolito 2,5, Giovanna Perugini 1, Mariaelena Occhipinti 6, Luigi Filippo Da Pozzo 2,7, Marco Roscigno 2,7, Sandro Sironi 1,2
Editor: Theodoros Tokas
PMCID: PMC9656103  PMID: 36362530

Abstract

PI-RADS 3 prostate lesions clinical management is still debated, with high variability among different centers. Identifying clinically significant tumors among PI-RADS 3 is crucial. Radiomics applied to multiparametric MR (mpMR) seems promising. Nevertheless, reproducibility assessment by external validation is required. We retrospectively included all patients with at least one PI-RADS 3 lesion (PI-RADS v2.1) detected on a 3T prostate MRI scan at our Institution (June 2016–March 2021). An MRI-targeted biopsy was used as ground truth. We assessed reproducible mpMRI radiomic features found in the literature. Then, we proposed a new model combining PSA density and two radiomic features (texture regularity (T2) and size zone heterogeneity (ADC)). All models were trained/assessed through 100-repetitions 5-fold cross-validation. Eighty patients were included (26 with GS ≥ 7). In total, 9/20 T2 features (Hector’s model) and 1 T2 feature (Jin’s model) significantly correlated to biopsy on our dataset. PSA density alone predicted clinically significant tumors (sensitivity: 66%; specificity: 71%). Our model obtained a sensitivity of 80% and a specificity of 76%. Standard-compliant works with detailed methodologies achieve comparable radiomic feature sets. Therefore, efforts to facilitate reproducibility are needed, while complex models and imaging protocols seem not, since our model combining PSA density and two radiomic features from routinely performed sequences appeared to differentiate clinically significant cancers.

Keywords: PI-RADS 3, prostate cancer, MRI, radiomics, texture analysis

1. Introduction

Prostate cancer (PC) is the second leading tumor in the male population worldwide [1]. Multiparametric magnetic resonance imaging (mpMRI) is the gold standard for prostate cancer imaging nowadays, proven to be helpful in early diagnosis, being employed in the evaluation of prostate gland lesions, local T-staging or recurrence, and in the assessment of pelvic lymph nodes involvement [2] along with Prostate Imaging Reporting and Data System version 2.1 (PI-RADS v2.1) guidelines [3]. Many studies have demonstrated a high correlation between PI-RADS and the Gleason score (GS) of prostate lesions [4,5,6,7]. However, while PI-RADS 4/5 are considered highly suspicious for neoplasia, the presence of clinically significant cancer in PI-RADS 3 lesions is equivocal (16–21% reported prevalence) [8,9]. Consequently, there is no consensus on the clinical management of PI-RADS 3 lesions, with high variability in protocols used in different centers [10]. A prostate biopsy is mandatory for diagnosis, but it is associated with possible complications (prostatitis, urinary tract infections, and sepsis), which may lead to hospitalization and, in the worst cases, even death. Therefore, it is crucial to timely identify clinically significant tumors (i.e., lesions with a Gleason Score (GS) ≥ 7, according to current literature [11]) among PI-RADS 3 lesions [12,13].

Some single-center studies in the literature have tried to exploit mpMRI radiomic analysis to identify clinically significant prostate cancer (csPCa) with promising results [14,15,16,17,18,19]. However, each center found its own radiomic features pool, likely due to high variability in center-specific population features, gold-standard definition rules, scanners, acquisition parameters, lesion contouring, image preprocessing, and machine learning techniques [20,21]. Furthermore, single-center datasets are almost always unavoidably small, increasing the risk of scarcely robust internal validation. Two papers on PI-RADS 3–5 recently showed that single-center models have a significant performance drop when applied to other centers’ data [22,23]. Efforts must therefore be made to (1) standardize as much as possible (as in radiomic features computation) [24]; (2) build large and multi-center datasets; (3) share developed models for external validation. This will allow us to understand whether general models can work even with center-specific variabilities or if center-specific models are needed instead.

On this basis, the aim of this work is manifold as follows: (a) to assess reproducible csPCa identification models found in the literature on an independent 80-patient dataset while providing details on their architectures; (b) to propose a new csPCa identification model for external validation based on robustly selected and easily obtainable radiomic and clinical features.

2. Materials and Methods

2.1. Study Population

We retrospectively retrieved medical and radiological data from our Institution’s Electronic Medical Records. According to urological indication, the initial population included 945 males who underwent prostate MRI (June 2016–March 2021) for suspected malignancy or active surveillance. From the original cohort, 706 patients were excluded for the following: (a) lack of one/more PI-RADS 3 lesion(s) as per PI-RADS v2.1 (n = 691); (b) no histopathological data within twelve months from MRI scan (n = 11); (c) poor image quality of diffusion-weighted (DWI) and/or in the T2-weighted sequences (n = 2) and apparent diffusion coefficient (ADC) map (n = 1). Accordingly, the final cohort included 80 males.

We collected the following clinical and laboratoristic data (Table 1): age, the most recent serological value of prostate-specific antigen (PSA; ng/mL), PSA density (total PSA/prostatic volume ratio), final histopathological analysis, and mean ADC value (mm2/s) calculated in a single 2D region of interest (ROI), i.e., the largest trackable circular area in the center of the lesion without exceeding the lesion margins.

Table 1.

Characteristics of the final study population.

Population Data
Total, n 80
Age (years), average ± SD (range) 65.2 ± 7.6 (45–81)
PSA (ng/mL), average ± SD (range) 6.8 ± 4.8 (0.5–29.6)
PSA Density, average ± SD (range) 0.15 ± 0.15 (0.01–1.09)
Mean ADC value within 2D ROI (mm2/s) 0.000825 ± 0.000253
(0.00026–0.00141)
PI-RADS 3 lesions histology, n/total (%)
 GS ≥ 3 + 4 26/80 (32.5%)
 GS ≤ 3 + 3 16/80 (20.0%)
Negative, BPH, atrophy 38/80 (47.5%)
Site of PI-RADS 3 lesions, n/total (%)
 Transitional zone 14/80 (17.5%)
 Peripheral zone 66/80 (82.5%)

PSA: prostate-specific antigen; PSA density is obtained by dividing PSA levels (ng/mL) by the volume of the prostatic gland (mL); PI-RADS: Prostate Imaging-Reporting and Data System; BPH: benign prostatic hyperplasia.

2.2. MR Protocol and PI-RADS 3 Lesion Selection

Prostate MRIs were performed on a 3T scanner (Discovery MR750w GEM, GE Healthcare, Chicago, IL, USA), using a 16-channels pelvic anterior-array coil (GE Healthcare, Chicago, IL, USA), and with the patient supine. As per PI-RADS v2.1 criteria [3], MRIs were performed at least six weeks after any prostatic biopsy to avoid a possible source of diagnostic errors due to post-procedural bleeding foci. The standard MRI protocol is summarized in Table 2.

Table 2.

MRI acquisition parameters.

T1-w T2-w DWI
Acquisition plane Axial Axial Axial, coronal, sagittal Axial Axial
Sequence Fast spin-echo (SSFSE) Gradient-recalled echo (GRE); before and after intravenous contrast (DCE) Fast relaxation fast spin echo (FR-FSE) Single-shot fast spin echo (SS-FSE) b values: 50, 1000, 2000 s/mm2
Slice thickness 4 mm 3 mm 3 mm 4 mm 3 mm
Covered area Pelvis Prostate lodge and seminal vesicles Prostate lodge and seminal vesicles Pelvis Prostate lodge

DWI: diffusion-weighted imaging; GRE: gradient-recalled echo; DCE: dynamic contrast enhancement; FRFSE: fast relaxation fast spin echo; SSFSE: single-shot fast spin echo.

Blinded to pathological data, two radiology residents (A.C., P.N.F.; 3 years of experience) reviewed all MRIs in consensus, based on the current standard of care, considering the appearance of the lesions in the T2-w, DWI, ADC, and DCE sequences as per PI-RADS v2.1 [3]. For each patient, we selected a single target lesion (the largest one in case of multiple lesions). A board-certified radiologist (P.A.B.; 10 years of experience) validated the selection.

2.3. Pathological Examination

Each patient underwent a targeted biopsy of PI-RADS 3 lesions (4 cores) at our Institution. Biopsies were executed by a single operator with a total experience of more than 500 target fusion biopsies. We used the trans-rectal access and fusion technique with the reference MRI, a MyLabClassC ultrasound machine, and a virtual navigator fusion system (Esaote S.p.A., Genova, Italy) equipped with an end-fire endorectal probe. Additional systematic biopsies (12–16 cores, according to the following prostate volume: ≤60 mL vs. >60 mL) were performed (Figure 1 and Figure 2) [25]. It was thus possible to choose the prostate parenchymal tissue corresponding to the PI-RADS 3 target lesion as the reference standard. Gleason Score was assigned per 2005 ISUP recommendations (International Society of Urological Pathology) [26]. Each PCa-positive biopsy was evaluated according to the International Society of Urological Pathology 2014 consensus Gleason Grade Group system [11].

Figure 1.

Figure 1

Scheme of systematic template for prostate biopsy. Black dots represent systematic biopsies. Blue dots represent additional systematic biopsies according to prostate volume (>60 mL).

Figure 2.

Figure 2

Illustrations of MRI/TRUS fusion biopsy. (A,B) Peripheral zone target biopsy: (A) trans-rectal ultrasound showing the location of the two lesions (orange and blue dots); (B) same lesions depicted in a T2-w MR (orange and blue dots). (CF) anterior zone target biopsy: (C) trans-rectal ultrasound showing the location of the two lesions (orange and blue dots); (D) same lesions depicted in a T2-w MR (orange and blue dots); (E) fusion image overlapping T2-w MR image on top of transrectal ultrasound (lesions represented as orange and blue dots); (F) ADC map of the corresponding lesions.

2.4. Lesion Segmentation

Anonymized DICOM files of FRFSE-T2-weighted sequences, DWI 2000 s/mm2 sequences, and ADC maps were exported and loaded on dedicated segmentation software, ITK-SNAP 3.8.0 (PICSL, University of Pennsylvania, Philadelphia, PA, USA) [27]. The 3D ROIs were manually delineated on every target lesion (Figure 3), both on T2-weighted sequences and DWI sequences/ADC maps in consensus by two radiology residents (A.C. and P.N.F.; 3 years of experience), and then validated by a board-certified radiologist (P.A.B.; 10 years of experience). Peripheral zone lesions were visible on both T2-weighted and DWI sequences/ADC maps. When a transitional zone lesion was not readily discernible on DWI/ADC maps, the segmentation area was delineated according to that traced on the T2-weighted sequence. An additional 3D ROI for each patient was outlined in the peripheral prostate zone to normalize intensity, avoiding potential focal lesions. Images were all corrected for magnetic field inhomogeneity (algorithm N4, 3D Slicer, http://www.slicer.org (accessed on 17 September 2022)).

Figure 3.

Figure 3

A 64-year-old patient with a PI-RADS 3 lesion in the left mid-gland peripheral zone. (A) Lesion on T2-w sequence depicted as a low-signal 5-mm nodule (white arrowhead); (B) same lesion highlighted on ADC map (grey arrowhead); (C) manual segmentation on ITK-SNAP (red label). Target biopsy revealed fibrosis with focal atrophy without evidence of prostate cancer.

2.5. Reproducible Literature Models Search and Assessment

We searched papers in the literature applying mpMRI radiomics as a tool to identify csPCa among PI-RADS 3 lesions. The following inclusion criteria were used: (1) PI-RADS 3 lesions identified according to PI-RADS v2.1 guidelines; (2) targeted biopsy as ground truth; (3) usage of IBSI-compliant tools for radiomic features computation; (4) adequate description of the methodological details (resampling grid, parameters in radiomic feature computation, selected radiomic features list, and model hyperparameters). Selected works’ details are reported in Table 3.

Table 3.

Selected models’ details.

Reference Hectors 2021 [16] Jin 2022 [19]
Number of subjects 240 103
Scanner 3T (GE Signa, Siemens Skyra) 3T (Siemens Skyra)
Endorectal coil No No
Radiomics MR sequences T2 T2, ADC, DWI (1500 mm/s2)
ROIs 3D (1 operator) 3D (2 operators) on T2 (ADC/DWI registered to T2)
Radiomics platform Pyradiomics FeAture Explorer (Pyradiomics)
Intensity normalization [μ − 3σ:μ + 3σ] inside
the VOI
(x − μ)/σ
Resampling 0.5 × 0.5 × 0.5 mm3 1 × 1 × 1 mm3
Quantization 64 bins Not specified
Model assessment Cross-validation + independent test set Independent test set
Selected radiomic feature details Yes (20 features) Yes (4 features)
Clinical parameters in the model No Yes (PSA, age)
Model Random forest with SMOTE Logistic regression
Radiomics model performances (test set) AUC 0.76
Sensitivity 75.0%
Specificity 79.6%
AUC 0.88
Sensitivity 83%
Specificity 65%

We extracted radiomic features from our 80-patient dataset using the work-specific processing and parameters for each work. We assessed the correlation between selected features and biopsy results through a univariate Mann–Whitney test applied to the entire patient sample. Then, pending the trained model availability, we retrained a model with the work-specific attributes and the work-specific input features on our 80-patient dataset, employing 100 repetitions of 5-fold stratified cross-validation and providing results in terms of sensitivity and specificity on the 500 validation sets. The following details are provided.

2.5.1. T2-Based Hectors et al. Model

T2 images were normalized to range between mean ± 3σ (standard deviation) of the intensity in the volume of interest (VOI), resampled on a 0.5 × 0.5 × 0.5 mm3 voxel grid, and discretized to 64 bins. The selected 20 radiomic features were used as input in a scikit-learn (https://scikit-learn.org/stable/ (accessed on 17 September 2022)) Random Forest Classifier (maximum depth = 16; maximum number of features = none; minimum number of samples per leaf = 2; minimum number of samples required to split = 2; maximum number of leaf nodes = 16). A SMOTE oversampling of the minority class was adopted.

2.5.2. T2 and DWI-Based Jin et al. Model

T2 and DWI image intensities were standardized; images were then resampled on a 1 × 1 × 1 mm3 voxel grid. Since the paper did not specify image discretization details, we used Pyradiomics default. The 4 selected radiomic features, along with patient age and PSA, were normalized using Z normalization and given input to a scikit-learn Logistic Regression classifier.

2.6. Proposal of a New Model to Be Validated by Other Centers

As clinical parameters, we assessed PSA, PSA density, age, and mean ADC value within a 2D ROI. Regarding radiomic features, we normalized T2 and ADC images by dividing voxel intensities by the average intensity computed within the corresponding normalization ROI. T2-weighted and ADC volumes were respectively resampled on a 0.4 × 0.4 × 3.0 mm3 and a 0.8 × 0.8 × 3.0 mm3 voxel grid through b-spline interpolation. In total, 958 radiomic features per patient were computed with Pyradiomics on original volumes (32 bin quantization) and HHH, LLL, HHL, and LLH coif1 wavelet decompositions (8 bin quantization for T2 and 16 bin quantization for ADC). Clinical and radiomic features more robustly related to GS were selected randomly dividing the 80 patients into 5 groups 100 times (maintaining the csPCa balance). In each of the 500 feature selection trials (4 groups at a time, 64 patients), the Mann–Whitney test assessed the univariate association between clinical/radiomic features and biopsy results. At the same time, we investigated the correlation between features using Spearman rank. The feature with the smallest univariate p-value was firstly selected. Then, features with increasing p values (if ≤0.01) were added only if characterized by an absolute value of the Spearman rank correlation <0.5 vs. already selected features. The final selected features pool contains features picked most times out of the 500 trials.

Univariate and multivariate models’ definitions and assessments were performed through 100 repetitions of a 5-fold stratified cross-validation scheme. Univariate models were defined by selecting thresholds maximizing the Youden index on training sets (4 groups, 64 patients) and assessed in terms of sensitivity and specificity on validation sets (1 group, 16 patients). The mean and standard deviation of sensitivity and specificity over the 500 validation trials were finally reported for each selected feature. For multivariate analysis, the following six classification models were considered: linear discriminant, linear, quadratic, and cubic support vector machine (SVM), classification tree, and K-nearest neighbours (KNN). All the possible feature combinations from the selected feature pool were assessed as classification model inputs. Models and optimal thresholds were identified on training sets and evaluated in terms of sensitivity and specificity on the corresponding 500 validation sets. Finally, a model to be shared for external validation was trained on the entire dataset. All the analysis was implemented in scikit-learn.

3. Results

Clinical-pathological results are described in Table 1. The detection rate for csPCa (Gleason score ≥ 3 + 4) at targeted and systematic biopsies in the case of PI-RADS 3, 4, and 5 was 32, 46, and 67%, respectively.

3.1. Assessment of Literature Features/Models

In Table 4, we reported the univariate association between radiomic features contained in Hectors’ [16] and Jin’s [19] models and biopsy results in our 80-patient dataset.

Table 4.

Univariate association between radiomic features contained in Hectors’s [16] and Jin’s [19] models and biopsy results in our 80-patient dataset; features with p-value ≤ 0.05 are in bold.

Hector’s Features p-Value
T2-original_shape_Elongation 0.13
T2-original_shape_Flatness 0.14
T2-original_firstorder_10Percentile 0.94
T2-original_firstorder_InterquartileRange 0.40
T2-original_firstorder_Mean 0.43
T2-original_firstorder_Median 0.51
T2-original_firstorder_RootMeanSquared 0.38
T2-original_glcm_Autocorrelation 0.01
T2-original_glcm_DifferenceEntropy 0.06
T2-original_glcm_InverseVariance 0.02
T2-original_glcm_JointAverage 0.01
T2-original_glcm_JointEnergy 0.04
T2-original_gldm_LargeDependenceLowGrayLevelEmphasis 0.10
T2-original_glrlm_LongRunEmphasis 0.05
T2-original_glrlm_LongRunHighGrayLevelEmphasis 0.01
T2-original_glszm_GrayLevelVariance 0.12
T2-original_glszm_SizeZoneNonUniformity 0.03
T2-original_glszm_SmallAreaEmphasis 0.01
T2-original_ngtdm_Complexity 0.27
T2-original-ngtdm_Strength 0.05
Jin’s Features p-Value
T2-wavelet-HHL_glcm_ClusterTendency 0.005
DWI-original_glcm_ldmn 0.74
DWI-wavelet-LLL_glrlm_LongRunLowGrayLevelEmphasis 0.11
DWI-wavelet-LLL glszm_SizeZoneNonUniformityNormalized 0.75

Regarding the performance of the re-implemented multivariate models relying on these features, Hector’s random forest model obtained a sensitivity of 40% ± 21% and a specificity of 71% ± 15%. Jin’s logistic regression model, which combined radiomic features, age, and PSA, obtained a sensitivity of 36% ± 20% and a specificity of 89% ± 10%.

3.2. Proposed Model

In the 500 feature selection trials, features more often selected and, therefore, more robustly correlated with biopsy were the following: (a) PSA Density (selection rate 100%); (b) a radiomic texture feature computed on the LLL wavelet band of T2-weighted images (T2-wavelet-LLL_glcm_InverseVariance, selection rate 87%); (c) a radiomic texture feature computed on the LLL wavelet band of ADC maps (ADC-wavelet-LLL_glszm_SizeZoneNonUniformity, selection rate 83%). The results of the univariate models’ assessment of the 500 validation trials are shown in Table 5. The correlation with histology was as follows: PSA density 66% ± 21% sensitivity and 71% ± 13% specificity; RF-T2 74% ± 21% sensitivity and 55% ± 15% specificity; RF-ADC 44% ± 19% sensitivity and 83% ± 13% specificity. In Table 5, the results obtained by the best multivariate model are also shown. The best multivariate model was a linear discriminant with the three features in input, which obtained a sensitivity of 80% ± 18% and a specificity of 76% ± 13% on the 500 test trials.

Table 5.

Selected features and performance of univariate and best multivariate models.

Selection Rate Sensitivity Specificity
PSA Density 100% 66% ± 21% 71% ± 13%
T2-wavelet-LLL_glcm_InverseVariance 87% 74% ± 21% 55% ± 15%
ADC-wavelet-LLL_glszm_SizeZoneNonUniformity 83% 44% ± 19% 83% ± 13%
Trivariate linear discriminant model - 80% ± 18% 76% ± 13%

4. Discussion

Two works on mpMRI radiomics in prostate cancer recently showed that single-center models’ performance drops when models are applied to other center data [22,23]. This may be due to the too-small size of the training sample and to differences among centers in MR scanners, acquisition parameters, histological analysis, and segmentation. Protocol standardization, data, and model sharing will hopefully improve models’ reproducibility in the near future. Meanwhile, a step forward toward model generalizability assessment can be made as follows: (1) trying to test radiomics models proposed by others on an external dataset; (2) properly detailing radiomics works so that other groups can assess them on their own data. Unfortunately, to date, few groups in the literature have tested radiomic models developed by other centers. This is often due to the partial lack of details in radiomic papers, which prevents model re-implementation.

In this work, first, we tried to apply reproducible and standard-compliant literature research papers on mpMRI radiomics for PI-RADS 3 csPCa identification on our 80-patient dataset. We reviewed and summarized parameters, methodological choices, and results to simplify further validation by other groups. Then, we proposed a fully detailed and easily implementable new model for assessment on an external dataset. The following two works in literature satisfied our inclusion criteria: one from Hectors et al. [16], who proposed a T2-based model, and one from Jin et al. [19], who proposed a model relying on T2, DWI, age, and PSA. In total, 9 of the 20 radiomic features identified by Hectors et al. resulted significantly correlated to biopsy in our dataset (p-value ranging from 0.01 to 0.05), and 1 of the 4 radiomic features identified by Jin et al. resulted very significantly related to biopsy in our dataset (p-value 0.005). These features are all computed on T2 images, where peripheral and transitional zone lesion contours are easier to delineate and, therefore, likely less user-dependent.

In developing our radiomic model, we performed methodological choices that differed from the two groups. Mainly, we normalized intensities through a peripheral zone normalization ROI (as suggested by Bonekamp et al., 2018 [28]) and applied an FBN quantization with 32 bins on original images, 16 bins on ADC wavelet sub-bands, and 8 bins on T2 wavelet sub-bands. Other normalization/quantization schemes provided worse results and were not shown. The two radiomic features we found most robustly related to biopsy are both computed on the LLL wavelet sub-band, i.e., on a spatially smoothed version of T2 and ADC intensities inside the lesions.

The first feature, T2-wavelet-LLL_glcm_InverseVariance, reflects texture regularity. It was lower than 0.47 on csPCa, thus indicating that clinically significant tumors are characterized by a larger texture irregularity in the low-frequency sub-band of T2 images. This feature alone has good sensitivity (74%) but low specificity (55%). The second feature, ADC-wavelet-LLL_glszm_SizeZoneNonUniformity, measures the variability in the volumes of lesion zones (groups of connected voxels with similar intensity). It was larger than 17 on csPCa, thus indicating their more extensive zone size heterogeneity. This feature alone has a reasonable specificity (83%) but a low sensitivity (44%). It is worth noticing that the normalized version of this feature, computed on DWI, correlated to biopsy in the work of Jin. The lack of significance of Jin’s DWI feature on our dataset may be due to differences in the DWI acquisition protocol as follows: we used a b-value of 2000 mm/s2, while Jin used a b-value of 1500 mm/s2. We can therefore observe the following: (1) there is a coherence between our result and Jin’s; therefore, a greater zone size heterogeneity at the microstructural level is more likely related to malignancy; (2) the computation of this feature on ADC maps may be more robust and repeatable, being less dependent on DWI acquisition b-value.

Similarly to Jin et al., we then developed a multivariate model relying on radiomic and clinical features. However, we did not select clinical and radiomic features independently but included both within a single pool, to which a feature selection strategy was applied. Among clinical features, we selected PSA density, as alone might help tumor discrimination [29] (sensitivity of 66%, specificity of 71% in our dataset). A tri-variate model built on PSA density and the two readily available T2 and ADC radiomic features appears to discriminate csPCa with good confidence (sensitivity of 80%, specificity of 76%). We provided all the methodological details and are available to share trained models and optimized thresholds for external validation.

The model we are proposing is based on bi-parametric MRI (T2 and ADC sequences only, which are routinely acquired). It does not require time-consuming sequences, such as DCE images, which also expose patients to contrast medium-related possible side effects in an effort to build the simplest possible model able to identify csPCa, with the added benefit of reducing segmentation times and guaranteeing better standardization, thereby reducing the possible impact introduced by different DCE acquisition protocols and segmentation methods. Additionally, our model is based only on the following three features: the PSA density (routinely obtained in the standard workup of these patients) and the two radiomic features obtained in two standard mpMRI studies. We are aware that, in machine learning, wrapped and embedded feature selection methods that optimally combine a broader number of features within model optimization or even deep learning models, as seen in Bertelli et al. [30], are often used. However, we preferred to follow a different approach to try to obtain an explainable radiomic model, i.e., one able to explain lesion characteristics related to malignancy.

We think this approach has relevant implications in driving the adoption of radiomics in the clinical management of PI-RADS 3 lesions. From a radiological standpoint, it might complement the radiologist’s evaluation, increasing the overall diagnostic accuracy; on the other hand, from a clinical perspective, it might allow us to rule out unnecessary biopsies, avoiding the risk of procedure-related possible complications in selected patients.

This study has several limitations, mainly the small number of patients and the lack of an independent testing dataset. However, we tried to provide results as robustly as possible by performing both feature selection and model assessment in multiple subsamples. Furthermore, it is necessary to recognize that the pathological standard of reference should be the radical prostatectomy sample, not the histological result of the biopsy. In fact, our recent experience has shown that the combination of target and systematic biopsies fails to detect about 15% of the foci of csPCa at definitive pathology. However, only 4% turned out to be the index lesion (data not yet published). It could also be argued that the method used at our Institution for targeted biopsies is a rigid fusion, while there are elastic fusion technologies that can allow more accurate targeting. However, in a recent systematic review and meta-analysis, no significant difference in the detection of csPCa was identified when comparing rigid and elastic registration for MRI-TRUS fusion-guided biopsy [31].

Moreover, the high PI-RADS 3 lesion prevalence in the peripheral zone (82.5%) did not allow an investigation of any possible zone-related difference, and the overall sample size did not allow us to evaluate sector-related differences. We think there is a need for future research to assess not only regional differences between the transitional and peripheral zones but also different sectors’ related variability. Not clinically significant prostate cancers (Gleason score 3 + 3) were included in the negative group, as data in this last category were not conspicuous. However, this choice does not imply a particularly significant clinical limitation. MRI follow-up, with or without biopsy mapping, is usually performed for PI-RADS 3 lesions. Lastly, since the dataset was quite imbalanced (32.5% of tumors), we decided to optimize thresholds instead of using SMOTE since optimal thresholds provided better results.

5. Conclusions

Standard-compliant works with robust and detailed methodologies achieve comparable radiomic feature sets. Therefore, efforts to facilitate external validation of csPCa identification models with independent datasets are needed to help radiomics gain an effective role in the clinical workflow. In contrast, complex imaging models and protocols do not seem to be required. We showed indeed that PSA density, combined with two radiomic features computed on two routinely performed sequences (T2 and ADC), may potentially discriminate clinically significant prostate cancers (Gleason score ≥ 3 + 4).

Author Contributions

Conceptualization, A.C., E.D.B. and P.A.B.; methodology, A.C., E.D.B. and P.A.B.; software, E.D.B.; validation, E.D.B.; formal analysis, E.D.B.; investigation, A.C., E.D.B. and P.N.F.; resources, A.C., P.A.B. and P.N.F.; data curation, A.C., E.D.B., P.A.B., P.N.F., D.N., R.S. and G.P.; writing—original draft preparation, A.C., E.D.B., P.A.B. and D.N.; writing—review and editing, A.C., E.D.B., P.A.B., P.N.F., D.I., G.P., M.R., M.O., L.F.D.P. and S.S.; visualization, A.C., E.D.B., P.A.B., P.N.F., D.N., R.S., D.I., M.R. and M.O.; supervision, L.F.D.P. and S.S.; project administration, S.S. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the local Institutional Review Board of ASST Papa Giovanni XXIII Bergamo (protocol code ‘mpMR e Sorveglianza Attiva’, 7 June 2018).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Funding Statement

This research received no external funding.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Torre L.A., Bray F., Siegel R.L., Ferlay J., Lortet-Tieulent J., Jemal A. Global cancer statistics, 2012. CA Cancer J. Clin. 2015;65:87–108. doi: 10.3322/caac.21262. [DOI] [PubMed] [Google Scholar]
  • 2.Delongchamps N.B., Rouanne M., Flam T., Beuvon F., Liberatore M., Zerbib M., Cornud F. Multiparametric magnetic resonance imaging for the detection and localization of prostate cancer: Combination of T2-weighted, dynamic contrast-enhanced and diffusion-weighted imaging: Increased prostate cancer detection with multiparametric MRI. BJU Int. 2011;107:1411–1418. doi: 10.1111/j.1464-410X.2010.09808.x. [DOI] [PubMed] [Google Scholar]
  • 3.Turkbey B., Rosenkrantz A.B., Haider M.A., Padhani A.R., Villeirs G., Macura K.J., Tempany C.M., Choyke P.L., Cornud F., Margolis D.J., et al. Prostate Imaging Reporting and Data System Version 2.1: 2019 Update of Prostate Imaging Reporting and Data System Version 2. Eur. Urol. 2019;76:340–351. doi: 10.1016/j.eururo.2019.02.033. [DOI] [PubMed] [Google Scholar]
  • 4.Tamada T., Sone T., Jo Y., Toshimitsu S., Yamashita T., Yamamoto A., Tanimoto D., Ito K. Apparent diffusion coefficient values in peripheral and transition zones of the prostate: Comparison between normal and malignant prostatic tissues and correlation with histologic grade. J. Magn. Reson. Imaging. 2008;28:720–726. doi: 10.1002/jmri.21503. [DOI] [PubMed] [Google Scholar]
  • 5.Nagarajan R., Margolis D., Raman S., Sheng K., King C., Reiter R., Thomas M.A. Correlation of Gleason Scores with Diffusion-Weighted Imaging Findings of Prostate Cancer. Adv. Urol. 2012;2012:374805. doi: 10.1155/2012/374805. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Hambrock T., Somford D.M., Huisman H.J., van Oort I.M., Witjes J.A., Hulsbergen-van de Kaa C.A., Scheenen T., Barentsz J.O. Relationship between Apparent Diffusion Coefficients at 3.0-T MR Imaging and Gleason Grade in Peripheral Zone Prostate Cancer. Radiology. 2011;259:453–461. doi: 10.1148/radiol.11091409. [DOI] [PubMed] [Google Scholar]
  • 7.Kitajima K., Takahashi S., Ueno Y., Miyake H., Fujisawa M., Kawakami F., Sugimura K. Do apparent diffusion coefficient (ADC) values obtained using high b-values with a 3-T MRI correlate better than a transrectal ultrasound (TRUS)-guided biopsy with true Gleason scores obtained from radical prostatectomy specimens for patients with prostate cancer? Eur. J. Radiol. 2013;82:1219–1226. doi: 10.1016/j.ejrad.2013.02.021. [DOI] [PubMed] [Google Scholar]
  • 8.Kasivisvanathan V., Rannikko A.S., Borghi M., Panebianco V., Mynderse L.A., Vaarala M.H., Briganti A., Budäus L., Hellawell G., Hindley R.G., et al. MRI-Targeted or Standard Biopsy for Prostate-Cancer Diagnosis. N. Engl. J. Med. 2018;378:1767–1777. doi: 10.1056/NEJMoa1801993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Schoots I.G. MRI in early prostate cancer detection: How to manage indeterminate or equivocal PI-RADS 3 lesions? Transl. Androl. Urol. 2018;7:70–82. doi: 10.21037/tau.2017.12.31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Padhani A.R., Barentsz J., Villeirs G., Rosenkrantz A.B., Margolis D.J., Turkbey B., Thoeny H.C., Cornud F., Haider M.A., Macura K.J., et al. PI-RADS Steering Committee: The PI-RADS Multiparametric MRI and MRI-directed Biopsy Pathway. Radiology. 2019;292:464–474. doi: 10.1148/radiol.2019182946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Epstein J.I., Egevad L., Amin M.B., Delahunt B., Srigley J.R., Humphrey P.A., Grading Committee The 2014 International Society of Urological Pathology (ISUP) Consensus Conference on Gleason Grading of Prostatic Carcinoma: Definition of Grading Patterns and Proposal for a New Grading System. Am. J. Surg. Pathol. 2016;40:244–252. doi: 10.1097/PAS.0000000000000530. [DOI] [PubMed] [Google Scholar]
  • 12.Greer M.D., Brown A.M., Shih J.H., Summers R.M., Marko J., Law Y.M., Sankineni S., George A.K., Merino M.J., Pinto P.A., et al. Accuracy and agreement of PIRADSv2 for prostate cancer mpMRI: A multireader study: PIRADSv2 for Prostate Tumor Detection. J. Magn. Reson. Imaging. 2017;45:579–585. doi: 10.1002/jmri.25372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Hansen N.L., Kesch C., Barrett T., Koo B., Radtke J.P., Bonekamp D., Schlemmer H.-P., Warren A.Y., Wieczorek K., Hohenfellner M., et al. Multicentre evaluation of targeted and systematic biopsies using magnetic resonance and ultrasound image-fusion guided transperineal prostate biopsy in patients with a previous negative biopsy. BJU Int. 2017;120:631–638. doi: 10.1111/bju.13711. [DOI] [PubMed] [Google Scholar]
  • 14.Giambelluca D., Cannella R., Vernuccio F., Comelli A., Pavone A., Salvaggio L., Galia M., Midiri M., Lagalla R., Salvaggio G. PI-RADS 3 Lesions: Role of Prostate MRI Texture Analysis in the Identification of Prostate Cancer. Curr. Probl. Diagn. Radiol. 2021;50:175–185. doi: 10.1067/j.cpradiol.2019.10.009. [DOI] [PubMed] [Google Scholar]
  • 15.Hou Y., Bao M.-L., Wu C.-J., Zhang J., Zhang Y.-D., Shi H.-B. A radiomics machine learning-based redefining score robustly identifies clinically significant prostate cancer in equivocal PI-RADS score 3 lesions. Abdom. Radiol. 2020;45:4223–4234. doi: 10.1007/s00261-020-02678-1. [DOI] [PubMed] [Google Scholar]
  • 16.Hectors S.J., Chen C., Chen J., Wang J., Gordon S., Yu M., Al Hussein Al Awamlh B., Sabuncu M.R., Margolis D.J.A., Hu J.C. Magnetic Resonance Imaging Radiomics-Based Machine Learning Prediction of Clinically Significant Prostate Cancer in Equivocal PI-RADS 3 Lesions. J. Magn. Reson. Imaging. 2021;54:1466–1473. doi: 10.1002/jmri.27692. [DOI] [PubMed] [Google Scholar]
  • 17.Brancato V., Aiello M., Basso L., Monti S., Palumbo L., Di Costanzo G., Salvatore M., Ragozzino A., Cavaliere C. Evaluation of a multiparametric MRI radiomic-based approach for stratification of equivocal PI-RADS 3 and upgraded PI-RADS 4 prostatic lesions. Sci. Rep. 2021;11:643. doi: 10.1038/s41598-020-80749-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Lim C.S., Abreu-Gomez J., Thornhill R., James N., Al Kindi A., Lim A.S., Schieda N. Utility of machine learning of apparent diffusion coefficient (ADC) and T2-weighted (T2W) radiomic features in PI-RADS version 2.1 category 3 lesions to predict prostate cancer diagnosis. Abdom. Radiol. 2021;46:5647–5658. doi: 10.1007/s00261-021-03235-0. [DOI] [PubMed] [Google Scholar]
  • 19.Jin P., Yang L., Qiao X., Hu C., Hu C., Wang X., Bao J. Utility of Clinical-Radiomic Model to Identify Clinically Significant Prostate Cancer in Biparametric MRI PI-RADS V2.1 Category 3 Lesions. Front. Oncol. 2022;12:840786. doi: 10.3389/fonc.2022.840786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Ferro M., de Cobelli O., Musi G., Del Giudice F., Carrieri G., Busetto G.M., Falagario U.G., Sciarra A., Maggi M., Crocetto F., et al. Radiomics in prostate cancer: An up-to-date review. Adv. Urol. 2022;14:17562872221109020. doi: 10.1177/17562872221109020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Liu X., Elbanan M.G., Luna A., Haider M.A., Smith A.D., Sabottke C.F., Spieler B.M., Turkbey B., Fuentes D., Moawad A., et al. Radiomics in Abdominopelvic Solid-Organ Oncologic Imaging: Current Status. AJR Am. J. Roentgenol. 2022 doi: 10.2214/AJR.22.27695. preprint . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Castillo T.J.M., Starmans M.P.A., Arif M., Niessen W.J., Klein S., Bangma C.H., Schoots I.G., Veenland J.F. A Multi-Center, Multi-Vendor Study to Evaluate the Generalizability of a Radiomics Model for Classifying Prostate cancer: High Grade vs. Low Grade. Diagnostics. 2021;11:369. doi: 10.3390/diagnostics11020369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Bleker J., Yakar D., van Noort B., Rouw D., de Jong I.J., Dierckx R.A.J.O., Kwee T.C., Huisman H. Single-center versus multi-center biparametric MRI radiomics approach for clinically significant peripheral zone prostate cancer. Insights Imaging. 2021;12:150. doi: 10.1186/s13244-021-01099-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Zwanenburg A., Vallières M., Abdalah M.A., Aerts H.J.W.L., Andrearczyk V., Apte A., Ashrafinia S., Bakas S., Beukinga R.J., Boellaard R., et al. The Image Biomarker Standardization Initiative: Standardized Quantitative Radiomics for High-Throughput Image-based Phenotyping. Radiology. 2020;295:328–338. doi: 10.1148/radiol.2020191145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Hansen N., Patruno G., Wadhwa K., Gaziev G., Miano R., Barrett T., Gnanapragasam V., Doble A., Warren A., Bratt O., et al. Magnetic Resonance and Ultrasound Image Fusion Supported Transperineal Prostate Biopsy Using the Ginsburg Protocol: Technique, Learning Points, and Biopsy Results. Eur. Urol. 2016;70:332–340. doi: 10.1016/j.eururo.2016.02.064. [DOI] [PubMed] [Google Scholar]
  • 26.Epstein J.I., Allsbrook W.C., Amin M.B., Egevad L.L., ISUP Grading Committee The 2005 International Society of Urological Pathology (ISUP) Consensus Conference on Gleason Grading of Prostatic Carcinoma. Am. J. Surg. Pathol. 2005;29:1228–1242. doi: 10.1097/01.pas.0000173646.99337.b1. [DOI] [PubMed] [Google Scholar]
  • 27.Yushkevich P.A., Piven J., Hazlett H.C., Smith R.G., Ho S., Gee J.C., Gerig G. User-guided 3D active contour segmentation of anatomical structures: Significantly improved efficiency and reliability. NeuroImage. 2006;31:1116–1128. doi: 10.1016/j.neuroimage.2006.01.015. [DOI] [PubMed] [Google Scholar]
  • 28.Bonekamp D., Kohl S., Wiesenfarth M., Schelb P., Radtke J.P., Götz M., Kickingereder P., Yaqubi K., Hitthaler B., Gählert N., et al. Radiomic Machine Learning for Characterization of Prostate Lesions with MRI: Comparison to ADC Values. Radiology. 2018;289:128–137. doi: 10.1148/radiol.2018173064. [DOI] [PubMed] [Google Scholar]
  • 29.Roscigno M., Stabile A., Lughezzani G., Pepe P., Galosi A.B., Naselli A., Naspro R., Nicolai M., La Croce G., Aljoulani M., et al. The Use of Multiparametric Magnetic Resonance Imaging for Follow-up of Patients Included in Active Surveillance Protocol. Can PSA Density Discriminate Patients at Different Risk of Reclassification? Clin. Genitourin. Cancer. 2020;18:e698–e704. doi: 10.1016/j.clgc.2020.04.006. [DOI] [PubMed] [Google Scholar]
  • 30.Bertelli E., Mercatelli L., Marzi C., Pachetti E., Baccini M., Barucci A., Colantonio S., Gherardini L., Lattavo L., Pascali M.A., et al. Machine and Deep Learning Prediction of Prostate Cancer Aggressiveness Using Multiparametric MRI. Front. Oncol. 2021;11:802964. doi: 10.3389/fonc.2021.802964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Venderink W., de Rooij M., Sedelaar J.P.M., Huisman H.J., Fütterer J.J. Elastic Versus Rigid Image Registration in Magnetic Resonance Imaging-Transrectal Ultrasound Fusion Prostate Biopsy: A Systematic Review and Meta-Analysis. Eur. Urol. Focus. 2018;4:219–227. doi: 10.1016/j.euf.2016.07.003. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.


Articles from Journal of Clinical Medicine are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES