Deep learning model for predicting extraprostatic extension of prostate cancer based on H&E-stained biopsy digital images

Peiling Yu; Nan Liu; Dongli Feng; Yi Jing; MoYu Xia; Hongyu Guo; Yinan Yuan; Weilin Guo; Yini Alatan; Siru Nie; Jinming Zhao; Hongbo Su; Yuan Miao; Qi Miao

doi:10.1080/07853890.2025.2547094

. 2025 Aug 21;57(1):2547094. doi: 10.1080/07853890.2025.2547094

Deep learning model for predicting extraprostatic extension of prostate cancer based on H&E-stained biopsy digital images

Peiling Yu ^a,^#, Nan Liu ^b,^#, Dongli Feng ^c, Yi Jing ^d, MoYu Xia ^d, Hongyu Guo ^a, Yinan Yuan ^a, Weilin Guo ^a, Yini Alatan ^a, Siru Nie ^a, Jinming Zhao ^a, Hongbo Su ^a, Yuan Miao ^b,^*,^✉, Qi Miao ^c,^*,^✉

^aDepartment of Pathology, the First Affiliated Hospital of China Medical University, Shenyang, China

^bDepartment of Pathology, the First Affiliated Hospital and College of Basic Medical Sciences of China Medical University, Shenyang, China

^cDepartment of Radiology, The First Hospital of China Medical University, Shenyang, Liaoning, China

^dNeusoft Research of Intelligent Healthcare Technology, Co. Ltd, Shenyang, China

^{^#}

Peiling Yu and Nan Liu contributed equally to this work as co-first authors.

^{^*}

Qi Miao and Yuan Miao contributed equally to this work as co-correspondence authors.

Supplemental data for this article can be accessed online at https://doi.org/10.1080/07853890.2025.2547094.

^✉

CONTACT Yuan Miao cmumiaoyuan@163.com Department of Pathology, the First Affiliated Hospital and College of Basic Medical Sciences of China Medical University, Shenyang, China

^✉

Qi Miao qmiao@cmu.edu.cn Department of Radiology, The First Hospital of China Medical University, Shenyang, Liaoning, China;

Roles

Peiling Yu: Conceptualization, Data curation, Visualization, Writing – original draft

Nan Liu: Data curation, Funding acquisition, Project administration, Supervision, Writing – review & editing

Dongli Feng: Data curation, Methodology, Resources, Visualization

Yi Jing: Data curation, Formal analysis, Visualization, Writing – original draft

MoYu Xia: Data curation, Formal analysis, Resources, Software, Writing – original draft

Hongyu Guo: Data curation, Formal analysis, Resources, Visualization

Yinan Yuan: Data curation, Formal analysis, Methodology

Weilin Guo: Data curation, Formal analysis, Investigation

Yini Alatan: Data curation, Formal analysis, Visualization

Siru Nie: Data curation, Formal analysis

Jinming Zhao: Data curation, Formal analysis, Software

Hongbo Su: Data curation, Formal analysis, Validation

Yuan Miao: Conceptualization, Methodology, Writing – review & editing

Qi Miao: Conceptualization, Formal analysis, Investigation, Methodology, Validation, Writing – review & editing

PMCID: PMC12372477 PMID: 40841351

Abstract

Background

To develop and validate a deep learning pipeline using prostate biopsy H&E slides to predict extraprostatic extension (EPE) in prostate cancer (PCa) patients.

Methods

A total of 2592 preoperative biopsy H&E slides from 260 consecutive PCa patients who underwent radical prostatectomy were collected from January 2019 to October 2023. Whole-slide images (WSIs) were digitized, tumor regions were annotated, and 224 × 224 pixel patches were extracted. The dataset was randomly divided into training and testing sets at the patient level in a ratio of 8:2. A tumor classification model and an EPE prediction model based on multiple instance learning were trained. Subsequently, we conducted an interpretability analysis of the EPE model and further carried out a correlation analysis between the predicted probabilities of the EPE and the biochemical recurrence (BCR) of the patients.

Results

The ConvNeXt model achieved the best performance in tumor classification, with an accuracy of 0.965 and an area under the curve (AUC) of 0.981 on the test set. For EPE prediction, the model achieved AUCs of 0.943 and 0.886 in the training and test sets, respectively. Key features identified by the model, such as nuclear characteristics, were significantly associated with EPE. Predicted EPE probabilities were strongly correlated with BCR (p = 0.01).

Conclusions

The AI pathology model accurately predicts postoperative EPE via biopsy slides, achieving an AUC of 0.886 on the test set, offering a novel, feasible PCa preoperative risk stratification method to aid personalized treatment.

Keywords: Prostate cancer, deep learning, extraprostatic extension, digital pathology, multiple instance learning

KEY MESSAGES

This study developed a deep learning pipeline based on biopsy H&E slides to predict extraprostatic extension (EPE) with high accuracy.
The model’s predictions were based on key pathological features like nuclear characteristics and were strongly correlated with biochemical recurrence.
This AI-based method offers a feasible tool for preoperative risk stratification and personalized treatment planning in prostate cancer.

1. Introduction

Prostate cancer (PCa) is the most prevalent malignancies in the male urinary system globally, constituting approximately 15% of all male cancers [1]. It ranks second in incidence and the fifth in cancer-related mortality among man [1]. Extraprostatic extension (EPE)—defined as PCa spread through prostate capsule into periprostatic tissues—is observed in 15–40% of radical prostatectomy specimens and portends higher risks of positive surgical margins, biochemical recurrence, and the need for adjuvant therapies [2–4]. Precise identification of EPE is crucial for surgical planning (e.g. neurovascular bundle preservation vs. wide excision) and therapy decision-making [5,6].

Current EPE prediction primarily relies on numerous nomograms integrating PSA levels, digital rectal examination, magnetic resonance imaging (MRI), and biopsy Gleason scores, such as the Partin tables [7], CAPRA score [8], and Memorial Sloan Kettering Cancer Center (MSKCC) nomogram [9]. Among these tools, the Gleason score derived from prostate biopsies plays a crucial role in predicting EPE [10]. However, there exist substantial subjective discrepancies in reporting the Gleason score, with an inter-observer agreement rate merely ranging from 60% to 80% [11]. Moreover, the hematoxylin–eosin (H&E)-stained sections of prostate biopsies contain abundant morphological information that cannot be fully conveyed through pathological reports alone. Consequently, directly extracting morphological features associated with EPE from H&E-stained prostate biopsy sections is of particular importance.

In recent years, artificial intelligence (AI) has emerged as a cutting-edge technology in the medical field, enabling accurate individualized risk assessment. AI shows great potential in predicting EPE. Currently, most of the research on EPE prediction using AI focuses on magnetic resonance imaging (MRI) data. For example, the 3D Swin-Transformer deep learning model developed by Zhao et al. [12], Unfold AI used by Grunden et al. [13], and the machine-learning model developed by Van den Berg et al. [14] have an area under the curve (AUC) ranging from 0.81 to 0.91. There are also some studies that construct AI models based on biopsy information. Sighinolfi et al. [15] externally validated the PRECE model, and Mohamad et al. [16] trained an extreme gradient boosting tree model based on machine learning. Their AUCs are 0.8 and 0.749, respectively.

However, the importance of pathological section images of prostate biopsies has been overlooked in these studies. Although imaging techniques can offer macroscopic information of the entire prostate, not all patients undergo routine MRI examinations prior to radical surgery. Prostate biopsy, serving as the gold standard for diagnosing PCa, is an indispensable step in clinical risk stratification [17]. Currently, clinical methods for evaluating EPE also attach great importance to pathological results. For example, the biopsy Gleason score is one of the key variables in the Partin tables for predicting postoperative staging [7]. Pathological sections not only provide the Gleason score but also contain rich tumor-related details, such as tissue morphology, microenvironment, and tissue distribution patterns. These details can potentially be used to further extract microscopic features associated with EPE through AI technology [18,19].

To the best of our knowledge, no prior study has employed deep-learning techniques to predict EPE based on whole-slide images (WSIs) of prostate biopsies. In this study, we initially developed a classification model to differentiate whole-slide images into normal and tumor tissues. More importantly, we trained a deep-learning model that uses only H&E-stained prostate biopsy sections as input to predict EPE in prostate cancer patients. Our study explored the relationship between the unique microscopic information in pathological sections and EPE, as well as the correlation between the EPE probability predicted by the model and the biochemical recurrence (BCR) of patients.

2. Clinical materials and methods

2.1. Data preparation

This study encompassed patients who underwent systematic biopsy (SBx) or systematic biopsy combined with targeted biopsy (SBx + TBx) at the First Affiliated Hospital of China Medical University between January 2019 and October 2023, were diagnosed with PCa, and subsequently underwent radical prostatectomy. The hematoxylin–eosin-stained sections corresponding to the included patients were scanned at a high resolution of 20× using a NanoZoomer S210 digital slide scanner and evaluated by senior pathologists. The pathological outcomes following radical prostatectomy were regarded as the gold standard for determining the presence of EPE. BCR was defined as two consecutive postoperative prostate-specific antigen (PSA) test values of ≥0.1 ng/ml. For the remaining patients, telephone follow-ups were conducted to confirm the occurrence of BCR.

Inclusion criteria: (1) Diagnosis of PCa via prostate biopsy; (2) Treatment with radical prostatectomy at the First Affiliated Hospital of China Medical University; (3) Availability of complete biopsy H&E-stained section data; (4) Access to complete electronic medical records for review.

Exclusion criteria: (1) History of other urinary system malignancies; (2) Prior receipt of radiotherapy, chemotherapy, or other systemic treatments for PCa before surgery; (3) Incomplete postoperative PSA test records; (4) Incomplete follow-up data or inability to contact the patient for follow-up. (5) The quality of pathological images is poor. The flowchart of patient inclusion in this study is presented in Figure 1.

Figure 1. — Flowchart of inclusion and exclusion: from January 2019 to October 2023, a total of 672 patients were diagnosed with prostate adenocarcinoma by puncture. Among them, 302 patients did not undergo radical surgery, and the remaining 370 patients received radical prostatectomy. Among these 370 patients, 40 received endocrine therapy before surgery, 13 had a history of other urinary system malignancies, and 65 had no PSA results or could not be followed up. Finally, 260 patients were included in the study, with 42 patients used for the tumor classification model and 218 patients used for the EPE prediction model.

This study was approved by the Ethics Committee of the First Affiliated Hospital of China Medical University (Approval No.: AF-SOP-07-1.2-01, KELUNSHEN [2024] No. 1091). Given that the study adopted a retrospective analysis approach using archived pathological specimens and clinical data, and did not involve the exposure of personally identifiable information, the Ethics Committee approved the waiver of informed consent for all participants after review. This exemption complies with the Helsinki Declaration and relevant domestic ethical review regulations, ensuring that the study is conducted within a compliant framework.

2.2. Model construction

This model construction included three main stages: data pre-processing, tumor tissue classification, and EPE prediction using a Multiple Instance Learning (MIL) framework. Figure 2 illustrates the overall technical roadmap of our approach.

Figure 2. — Computational strategy: starting from the patient’s original pathological section, tissue segmentation, gridding, and patch extraction are performed in sequence to obtain patches. Then, the tumor patches are classified, and after patch selection, the selected tumor patches are used for feature extraction. The extracted features are input into the UNI model. The patch features output by the model are classified through modules such as the transformer block to determine the presence or absence of extraprostatic extension.

2.2.1. Digital pathology images pre-processing

To capture fine local details from high-resolution WSI, we converted the color space of all WSI slices from RGB to HSV. We then apply Otsu’s method on the Hue channel to segment cell–tissue regions from the background, followed by morphological operations (noise reduction and hole filling) to refine the segmentation. Using the Histolab algorithm, we segment the tissue regions into 224 × 224 pixel patches. These patches, along with their coordinates, are saved for further analysis, ensuring that each patch contains relevant cell–tissue information.

2.2.2. Tumor tissue segmentation and classification

2.2.2.1. Data selection and labeling

From 414 WSIs of 42 patients, A pathologists (Y.P. with 3-year experience in PCa pathology diagnosis) used QuPath to label PCa areas, generating binary masks. All annotations were reviewed by a senior urology pathologist (M.Y., with 20 years of experience in PCa pathology). Based on these masks, patches with more than 30% cancerous area are classified as cancerous, while patches with no cancer cells are considered non-cancerous. Patches that do not meet either criterion are discarded.

The patches are split at the patient level into training, validation, and test sets with a roughly 7:1:2 ratio (training set: 30 patients, 143657 background patches, 24,255 foreground patches; validation set:4 patients, 15219 background patches, 2034 foreground patches; test set: 8 patients, 39367 background patches, 4718 foreground patches). To address the imbalance (fewer cancerous patches), we augment the cancerous patches with rotations and flips.

2.2.2.2. Model construction

Five pre-trained networks (ResNet, ResNeXt, ViT, Swin-T, and ConvNeXt) are used to build a binary classification model to distinguish cancerous from non-cancerous patches. The network with the best classification result on the test set will be used.

2.2.3. MIL-based EPE prediction

2.2.3.1. Feature extraction

We use a universal self-supervised model (UNI), specifically designed for pathological images, to extract high-dimensional features from the selected cancerous patches. The UNI model, trained on large-scale pathological data, captures subtle morphological details.

2.2.3.2. MIL model design

A novel MIL framework based on the transformer self-attention mechanism (TMIL) is developed. This model includes two transformer encoder layers that integrate multi-head attention, feed-forward neural networks, and layer normalization to capture relationships among patches. The extracted features are then processed by a multilayer perceptron (MLP) and a Softmax layer to output a probability distribution for EPE.

2.2.3.3. Training strategy

The MIL model is trained on patches from 2168 WSIs of 218 patients, with data divided into training and validation sets at an 8:2 ratio. The training uses the Adam optimizer with a dynamic learning rate decay strategy, running for 200 epochs. We employ the SmoothTop1SVM loss function, which focuses on ranking the top predicted probabilities, ensuring sensitivity in global lesion detection.

2.3. Model evaluation

When validating the performance of the tumor tissue classification model, we used accuracy (ACC) and recall (REC) as the primary evaluation metrics. To comprehensively assess the accuracy of the model in predicting the presence of EPE, we utilized the receiver operating characteristic (ROC) curve as the evaluation tool and calculated several key metrics, including the AUC, ACC, precision (PRE), REC, and F1 score.

In addition, we randomly selected patches from those with a high probability of EPEpredicted by the model (the top 20%) and those with a low probability of EPE (the bottom 20%). Then, we used ImageJ software to extract quantifiable morphological vectors and outline the cell nuclei for comparative analysis.

2.4. Statistical analysis

Statistical analysis of the data was carried out using the SPSS 27.0 software. Continuous variables were expressed as the median (interquartile range, IQR), and categorical variables were expressed as counts (percentages). When the quantitative data of two independent samples were normally distributed and had homogeneous variances, the t test was used for comparison. The chi-square test was used for comparing the qualitative data of two independent samples, and the Mann–Whitney U test was used for comparing the ranked data of two independent samples. The Kaplan–Meier survival curve was applied to evaluate the predictive value of the EPE probability values predicted by AI for biochemical BCR during the postoperative follow-up period. Statistical significance was defined as p < 0.05.

3. Results

3.1. Clinical and pathological characteristics of the study cohort

Finally, 260 consecutive PCa patients, including 2592 biopsy H&E sections, were included in this study. The median age was 67 yr (IQR 63–72). Among them, 86 patients (33.08%) were found to have EPE. There were significant differences between the EPE group and the non-EPE group in terms of PSA level, positive biopsy proportion, Biopsy Gleason Grade (Biopsy GG), Prostatectomy Gleason Grade (Prostatectomy GG), Pathological T staging, Pathological N staging, and Nerve Invasion (p < 0.05). However, there were no statistically significant differences in age and Clinical T staging between the two groups (p > 0.05) (Table 1). Among the 260 patients, 25 patients underwent systematic biopsy, and 235 patients underwent targeted fusion biopsy. A total of 61 cases in the EPE group and 106 cases in the non-EPE group underwent multiparametric magnetic resonance imaging (mpMRI) examination. The specific distribution of PI-RADS scores is shown in Supplementary Table 2. Of these, 42 cases were included in the Tumor Tissue Classification Model study. These cases were divided into a training set (30 patients, 143,657 background patches, 24,255 foreground patches), a validation set (4 patients, 15,219 background patches, 2,034 foreground patches), and a test set (8 patients, 39,367 background patches, 4,718 foreground patches) at the patient level in a 7:1:2 ratio. The remaining 218 cases were included in the EPE prediction model study. These cases were divided into training set (n = 174) and test set (n = 44) at the patient level in an 8:2 ratio.

Table 1.

Clinicopathological characteristics of the study cohort.

	EPE	NON-EPE	p value
Sample size (%)^a	86 (33.08%)	174 (66.92%)
Age (years old)^b	67 (63–72)	67 (63–72)	0.467
PSA (ng/ml)^b	22.46 (11.74–43.09)	12.17 (7.85–20.63)	<0.001
Clinical T staging (%)^a			0.657
T1c	28 (32.56%)	66 (37.93%)
T2a	7 (8.14%)	18 (10.34%)
T2b	12 (13.95%)	18 (10.34%)
T2c	39 (45.35%)	72 (41.38%)
Biopsy pathway (%)^a			0.018
Systematic biopsy	83 (96.51%)	22 (12.64%)
Systematic + targeted biopsy	3 (3.49%)	152 (87.36%)
Number of biopsy cores^a	858	1734
Number of positive biopsy cores^a	526	642
Proportion of positive biopsies^b	0.65 (0.4–0.87)	0.3 (0.1–0.6)	<0.001
Biopsy GG (%)^a			<0.001
1	7 (8.14%)	53 (30.46%)
2	6 (6.98%)	36 (20.69%)
3	16 (18.60%)	35 (20.11%)
4	25 (29.07%)	36 (20.69%)
5	32 (37.21%)	14 (8.05%)
Prostatectomy GG (%)^a			<0.001
1	3 (3.49%)	38 (21.84%)
2	16 (18.60%)	55 (31.61%)
3	12 (13.95%)	37 (21.26%)
4	11 (12.79%)	28 (16.09%)
5	44 (51.16%)	16 (9.20%)
Pathological T staging (%)^a			<0.001
PT2	0	174 (100%)
PT3a	56 (65.12%)	0
PT3b	21 (24.42%)	0
PT4	9 (10.47%)	0
Pathological N staging (%)^a			<0.001
N0	67 (77.91%)	163 (93.68%)
N1	19 (22.09%)	11 (6.32%)
Nerve invasion n (%)^a	55 (63.95%)	53 (30.46%)	<0.001

Open in a new tab

^aCounts (percentages).

^bMedian with interquartile range.

3.2. Tumor tissue classification model

The five pre-trained models’ performances on the training set, validation set, and test set are presented in Table 2. On the validation set, ConvNeXt outperformed the others with an accuracy of 96.8%, AUC of 0.979, and recall of 85.3%, and similar performance was observed on the test set (96.5% accuracy, AUC of 0.981, and recall of 84.6%). Supplementary Figure 1 shows strong spatial overlap between pathologist annotations and model predictions, while Figure 3a–d display the ROC, calibration curves, and confusion matrices that confirm the model’s high sensitivity and robustness.

Table 2.

The classification performance of different models.

	Train			Valid			Test
	Acc (%)	AUC	Recall	Acc (%)	AUC	Recall	Acc (%)	AUC	Recall
ResNet	99.2	0.999	0.985	95.3	0.972	0.796	95.2	0.971	0.812
ResNeXt	99.9	1.0	0.999	95.7	0.97	0.727	96.1	0.977	0.767
ViT	98.4	0.998	0.97	95.5	0.972	0.804	95.3	0.976	0.834
Swin-T	98.2	0.997	0.965	96.2	0.971	0.802	95.5	0.982	0.824
ConvNeXt	99.9	1.0	1.0	96.8	0.979	0.853	96.5	0.981	0.846

Open in a new tab

3.3. EPE prediction model

The model achieved an AUC of 0.943 in training set and 0.886 in test set (Figure 3e). In the test set, the model reached an accuracy of 86.4%, precision of 75%, recall of 85.7%, and an F1 score of 0.80. Specifically, 26 of 30 non-EPE patients were correctly predicted as non-EPE, and 12 of 14 EPE patients were correctly identified. The AUCs of MSKCC nomogram, Roach formula, and Partin Table were 0.780, 0.745, and 0.461, respectively in the test set (Figure 3f). The prediction performances of these three models are shown in Table 3. Figure 1g and h present the confusion matrices for the training set and the test set, respectively.

Table 3.

The performance of different models in predicting EPE.

	AUC	ACC	PRE	REC	F1
EPE Prob	0.886	0.864	0.75	0.857	0.8
Partin tables	0.461	0.477	0.378	1	0.549
MSKCC	0.780	0.795	0.632	0.857	0.727
Roach formula	0.745	0.727	0.542	0.929	0.684

Open in a new tab

3.4. The prediction model captures the inherent morphological features from the patches

Key morphological features—including the area, shape, density, and arrangement of the cell nuclei obtained through software-based nuclear segmentation – showed obvious differences between the patches with and without EPE. Patches predicted to have high EPE probability showed tumor nuclei that were larger, more irregular, and denser, often arranged as cords or strands. In contrast, patches with low EPE probability exhibited well-formed, regularly arranged glandular tubules (Figure 4).

Figure 4. — Cell features of the EPE and NON-EPE groups: the tumor cell nuclei were automatically segmented by ImageJ. The tumor cell nuclei in the EPE group tiles exhibited larger volumes, more distorted shapes, and higher densities.

3.5. Correlation between predicted EPE probability and biochemical recurrence

Kaplan–Meier survival analysis (Figure 5) demonstrated that patients classified as EPE positive by our AI model experienced significantly different BCR rates following radical prostatectomy compared to those classified as non-EPE (p = 0.01). This finding confirms that our model’s predictions are not only accurate but also prognostically meaningful. We further explored the relationships between other clinical factors and BCR. Among them, the prediction results of the EPE by the artificial intelligence model, the Gleason Grade Group after radical prostatectomy, and the highest Gleason Grade Group in the biopsy were significantly correlated with biochemical recurrence (see Supplementary Table 1 for details).

4. Discussion

The EPE of PCa is associated with positive surgical margins, biochemical recurrence, and need for adjuvant therapy. However, definitive confirmation of EPE requires postoperative pathological examination. Accurate prediction of EPE during initial diagnosis is crucial in clinical practice [20]. In this study, we developed and validated a deep-learning pipeline for tumor tissue classification and prediction of EPE using digital pathology of prostate biopsy. The model in this study intentionally prioritizes validating the independent diagnostic value of pathological images, aiming to lay a foundation for multimodal integration and first establish the baseline utility of pathological morphology. To our knowledge, this is the first model to use preoperative biopsy images as the sole input for predicting pathological EPE status. Our results demonstrate that biopsy images contain critical microscopic features that correlate with EPE, offering valuable prognostic insights.

Our model achieved an AUC of 0.886 in the test set, outperforming the MSKCC nomogram (AUC = 0.780), Roach formula (AUC = 0.745), and Partin Table (AUC = 0.461), the results are basically consistent with previous studies [21]. The lower performance of Partin Table may be attributed to differences in cohort composition, with relatively few patients with clinical stage higher than T2c in our cohort. Previous AI-based studies on predicting EPE of PCa primarily relied on clinical and MRI data. Kwong et al. developed an AI-based prostate-specific peripheral extension risk assessment tool (SEPERA) by collecting twelve pieces of clinical information, including patient age, PSA, and GS score, to accurately predict lateral-specific EPE in patients with localized PCa, achieving an AUC of 0.77 [22]. Gu et al. applied an effective deep-learning network, NAFNet, to predict adverse pathological events based on MRI images and the AUC could reach 0.799 [23]. Some studies have combined radiomics and clinical data, such as Guerra et al.’s multimodal model, which achieved an AUC of 0.88 [24]. However, these studies overlooked the potential of biopsy pathological images in predicting EPE. Biopsy pathology-based prediction offers several advantages over MRI-based prediction for tumor staging. Unlike MRI, prostate biopsy is the gold standard for diagnosis and an indispensable step in the diagnosis and treatment of PCa. MRI provides a macroscopic view of PCa lesions, biopsy pathology provides direct microscopic evidence of tumor cells, allowing for more precise grading and staging through histological analysis. This detailed tissue information can reveal features—such as cellular atypia, mitotic rate, and specific biomarkers—that MRI may miss. AI-driven analysis of biopsy images enables the identification of subtle features that may not be apparent to pathologists, making it a powerful tool for predicting EPE. Biopsy pathology based DL models could capture microinvasive foci below MRI resolution (typically >3 mm) while avoiding susceptibility artifacts from rectal coils. Additionally, for patients with contraindications to MRI—such as those with pacemakers, metal implants, severe claustrophobia,—biopsy pathology is an effective alternative for EPE prediction.

At the same time, we also conducted a visual analysis of the model. We found that in pathological images with a high probability of EPE, the cell nuclei were larger, the nuclear atypia was more obvious, the density was higher, and the cancer cells did not form glands but were scattered and infiltrated in the stroma. Furthermore, Kaplan–Meier analysis confirmed that our AI-predicted EPE status significantly stratified patients by BCR risk after radical prostatectomy.

Despite these promising results, our study has limitations. First, this is a single-center study, necessitating external validation across multiple institutions before clinical implementation. Differences in tissue processing, staining protocols, and digital scanning methods may affect image quality and consistency. This variability can influence the deep learning model’s performance and may not be easily replicated across institutions. There are still some misjudgments in the current model. False positives may lead to unnecessary neurovascular bundle resection, increasing the risk of erectile dysfunction or urinary incontinence. False negatives may lead to undertreatment, increasing the positive rate of surgical margins. The model should be used as an auxiliary tool for comprehensive clinical evaluation rather than an independent decision-making basis. At present, multi-center validation is a priority to reduce risks. Additionally, the current model relies solely on pathological images without integrating MRI or clinical data. Studies have shown that PCa is an endocrine-responsive tumor, and its occurrence and development are affected by various hormone levels [25]. MpMRI combined with biopsy can improve the diagnostic accuracy of PCa [26], and the integration of MRI and PSA-derived indicators can achieve effective patient risk stratification, thereby providing a valuable decision-making approach [27]. In the future, efforts will focus on developing multimodal models that integrate multi-source data to enhance predictive accuracy.

In conclusion, our deep-learning model enables accurate prediction of EPE using biopsy pathology, supporting early patient stratification and personalized treatment planning. In this study we transformed diagnostic biopsy slides into predictive tools for treatment planning. This study highlights the prognostic value of microscopic pathological features and paves the way for AI-driven advancements in PCa management. In the future, integrating it into a digital pathology platform as a real-time decision support system to further stratify patients’ EPE risks through model probability values may further standardize the quantification of EPE risks in clinical practice and provide more accurate references for surgical planning. Future research will also explore multimodal AI models and novel algorithms to further improve clinical decision-making.

Supplementary Material

Supplemental Material

IANN_A_2547094_SM1623.docx^{(11.3KB, docx)}

supplementary FIG1.png

IANN_A_2547094_SM9513.png^{(475.3KB, png)}

Supplementary Table 2 .docx

IANN_A_2547094_SM9512.docx^{(11.6KB, docx)}

Supplementary Table 1.docx

IANN_A_2547094_SM9511.docx^{(15.6KB, docx)}

Acknowledgements

Not applicable.

Funding Statement

This study was supported by the Natural Science Foundation of Liaoning Province (No. 2024-BS-047).

Availability of data and materials

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Disclosure statement

The authors declare no competing interests.Authors Yi Jing and Moyu Xia are affiliated with Neusoft Medical Systems Co., Ltd. The authors declare that no other financial or non-financial competing interests affect the objectivity of this study. All research processes were independent of the funders, and data analysis and conclusion derivation were not interfered with by the enterprise.

References

1.Cao W, Chen H-D, Yu Y-W, et al. Changing profiles of cancer burden worldwide and in China: a secondary analysis of the global cancer statistics 2020. Chin Med J (Engl). 2021;134(7):783–791. doi: 10.1097/CM9.0000000000001474. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Billis A, Watanabe IC, Costa MV, et al. Iatrogenic and non-iatrogenic positive margins: incidence, site, factors involved, and time to PSA progression following radical prostatectomy. Int Urol Nephrol. 2008;40(1):105–111. doi: 10.1007/s11255-007-9198-6. [DOI] [PubMed] [Google Scholar]
3.Mottet N, van den Bergh RCN, Briers E, et al. EAU-EANM-ESTRO-ESUR-SIOG Guidelines on Prostate Cancer—2020 Update. Part 1: screening, diagnosis, and local treatment with curative intent. Eur Urol. 2021;79(2):243–262. doi: 10.1016/j.eururo.2020.09.042. [DOI] [PubMed] [Google Scholar]
4.Holmberg L, Garmo H, Andersson S-O, et al. Radical prostatectomy or watchful waiting in early prostate cancer. N Engl J Med. 2024;391(14):1362–1364. doi: 10.1056/NEJMc2406108. [DOI] [PubMed] [Google Scholar]
5.Diamand R, Roche J-B, Lacetera V, et al. Predicting contralateral extraprostatic extension in unilateral high-risk prostate cancer: a multicentric external validation study. World J Urol. 2024;42(1):247. doi: 10.1007/s00345-024-04966-7. [DOI] [PubMed] [Google Scholar]
6.Nguyen LN, Head L, Witiuk K, et al. The risks and benefits of cavernous neurovascular bundle sparing during radical prostatectomy: a systematic review and meta-analysis. J Urol. 2017;198(4):760–769. doi: 10.1016/j.juro.2017.02.3344. [DOI] [PubMed] [Google Scholar]
7.Partin AW, Yoo J, Carter HB, et al. The use of prostate specific antigen, clinical stage and Gleason score to predict pathological stage in men with localized prostate cancer. J Urol. 1993;150(1):110–114. doi: 10.1016/s0022-5347(17)35410-1. [DOI] [PubMed] [Google Scholar]
8.Cooperberg MR, Pasta DJ, Elkin EP, et al. The University of California, San Francisco Cancer of the Prostate Risk Assessment score: a straightforward and reliable preoperative predictor of disease recurrence after radical prostatectomy. J Urol. 2005;173(6):1938–1942. doi: 10.1097/01.ju.0000158155.33890.e7. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Ohori M, Kattan MW, Koh H, et al. Predicting the presence and side of extracapsular extension: a nomogram for staging prostate cancer. J Urol. 2004;171(5):1844–1849; discussion 1849. doi: 10.1097/01.ju.0000121693.05077.3d. [DOI] [PubMed] [Google Scholar]
10.Anderson BB, Oberlin DT, Razmaria AA, et al. Extraprostatic extension is extremely rare for contemporary gleason score 6 prostate cancer. Eur Urol. 2017;72(3):455–460. doi: 10.1016/j.eururo.2016.11.028. [DOI] [PubMed] [Google Scholar]
11.Melia J, Moseley R, Ball RY, et al. A UK‐based investigation of inter‐ and intra‐observer reproducibility of Gleason grading of prostatic biopsies. Histopathology. 2006;48(6):644–654. doi: 10.1111/j.1365-2559.2006.02393.x. [DOI] [PubMed] [Google Scholar]
12.Zhao L, Bao J, Wang X, et al. Detecting adverse pathology of prostate cancer with a deep learning approach based on a 3D swin‐transformer model and biparametric mri: a multicenter retrospective study. J Magn Reson Imaging. 2024;59(6):2101–2112. doi: 10.1002/jmri.28963. [DOI] [PubMed] [Google Scholar]
13.Priester A, Mota SM, Grunden KP, et al. Extracapsular extension risk assessment using an artificial intelligence prostate cancer mapping algorithm. BJUI Compass. 2024;5(10):986–997. doi: 10.1002/bco2.421. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.van den Berg I, Soeterik TFW, van der Hoeven EJRJ, et al. The development and external validation of artificial intelligence-driven MRI-based models to improve prediction of lesion-specific extraprostatic extension in patients with prostate cancer. Cancers (Basel). 2023;15(22):5452. doi: 10.3390/cancers15225452. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Sighinolfi MC, Assumma S, Cassani A, et al. Pre-operative prediction of extracapsular extension of prostate cancer: first external validation of the PRECE model on an independent dataset. Int Urol Nephrol. 2023;55(1):93–97. doi: 10.1007/s11255-022-03365-4. [DOI] [PubMed] [Google Scholar]
16.Semwal H, Ladbury C, Sabbagh A, et al. Machine learning and explainable artificial intelligence to predict pathologic stage in men with localized prostate cancer. Prostate. 2025;85(1):3–12. doi: 10.1002/pros.24793. [DOI] [PubMed] [Google Scholar]
17.Wei JT, Barocas D, Carlsson S, et al. Early detection of prostate cancer: AUA/SUO Guideline Part II: considerations for a prostate biopsy. J Urol. 2023;210(1):54–63. doi: 10.1097/JU.0000000000003492. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Paris A and Ilias M.. Image analysis in digital pathology utilizing machine learning and deep neural networks. J Pers Med. 2022;12(9):1444. doi: 10.3390/jpm12091444. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Deng S, Zhang X, Yan W, et al. Deep learning in digital pathology image analysis: a survey. Front Med. 2020;14(4):470–487. doi: 10.1007/s11684-020-0782-9. [DOI] [PubMed] [Google Scholar]
20.Cosma G, Acampora G, Brown D, et al. Prediction of pathological stage in patients with prostate cancer: a neuro-fuzzy model. PLoS One. 2016;11(6):e0155856. doi: 10.1371/journal.pone.0155856. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Blas L, Shiota M, Nagakawa S, et al. Validation of user-friendly models predicting extracapsular extension in prostate cancer patients. Asian J Urol. 2023;10(1):81–88. doi: 10.1016/j.ajur.2022.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Kwong JCC, Khondker A, Meng E, et al. Development, multi-institutional external validation, and algorithmic audit of an artificial intelligence-based Side-specific Extra-Prostatic Extension Risk Assessment tool (SEPERA) for patients undergoing radical prostatectomy: a retrospective cohort study. Lancet Digit Health. 2023;5(7):e435–e445. doi: 10.1016/S2589-7500(23)00067-5. [DOI] [PubMed] [Google Scholar]
23.Gu W-J, Liu Z, Yang Y-J, et al. A deep learning model, NAFNet, predicts adverse pathology and recurrence in prostate cancer using MRIs. NPJ Precis Oncol. 2023;7(1):134. doi: 10.1038/s41698-023-00481-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Guerra A, Orton MR, Wang H, et al. Clinical application of machine learning models in patients with prostate cancer before prostatectomy. Cancer Imaging. 2024;24(1):24. doi: 10.1186/s40644-024-00666-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Miro C, Di Giovanni A, Murolo M, et al. Thyroid hormone and androgen signals mutually interplay and enhance inflammation and tumorigenic activation of tumor microenvironment in prostate cancer. Cancer Lett. 2022;532:215581. doi: 10.1016/j.canlet.2022.215581. [DOI] [PubMed] [Google Scholar]
26.Rapisarda S, Bada M, Crocetto F, et al. The role of multiparametric resonance and biopsy in prostate cancer detection: comparison with definitive histological report after laparoscopic/robotic radical prostatectomy. Abdom Radiol (NY). 2020;45(12):4178–4184. doi: 10.1007/s00261-020-02798-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Jin P, Wang X, Ding Z, et al. Development and validation of risk-stratified biopsy decision pathways incorporating MRI and PSA-derived indicators. Ann Med. 2025;57(1):2446695. doi: 10.1080/07853890.2024.2446695. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material

IANN_A_2547094_SM1623.docx^{(11.3KB, docx)}

supplementary FIG1.png

IANN_A_2547094_SM9513.png^{(475.3KB, png)}

Supplementary Table 2 .docx

IANN_A_2547094_SM9512.docx^{(11.6KB, docx)}

Supplementary Table 1.docx

IANN_A_2547094_SM9511.docx^{(15.6KB, docx)}

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

[CIT0001] 1.Cao W, Chen H-D, Yu Y-W, et al. Changing profiles of cancer burden worldwide and in China: a secondary analysis of the global cancer statistics 2020. Chin Med J (Engl). 2021;134(7):783–791. doi: 10.1097/CM9.0000000000001474. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0002] 2.Billis A, Watanabe IC, Costa MV, et al. Iatrogenic and non-iatrogenic positive margins: incidence, site, factors involved, and time to PSA progression following radical prostatectomy. Int Urol Nephrol. 2008;40(1):105–111. doi: 10.1007/s11255-007-9198-6. [DOI] [PubMed] [Google Scholar]

[CIT0003] 3.Mottet N, van den Bergh RCN, Briers E, et al. EAU-EANM-ESTRO-ESUR-SIOG Guidelines on Prostate Cancer—2020 Update. Part 1: screening, diagnosis, and local treatment with curative intent. Eur Urol. 2021;79(2):243–262. doi: 10.1016/j.eururo.2020.09.042. [DOI] [PubMed] [Google Scholar]

[CIT0004] 4.Holmberg L, Garmo H, Andersson S-O, et al. Radical prostatectomy or watchful waiting in early prostate cancer. N Engl J Med. 2024;391(14):1362–1364. doi: 10.1056/NEJMc2406108. [DOI] [PubMed] [Google Scholar]

[CIT0005] 5.Diamand R, Roche J-B, Lacetera V, et al. Predicting contralateral extraprostatic extension in unilateral high-risk prostate cancer: a multicentric external validation study. World J Urol. 2024;42(1):247. doi: 10.1007/s00345-024-04966-7. [DOI] [PubMed] [Google Scholar]

[CIT0006] 6.Nguyen LN, Head L, Witiuk K, et al. The risks and benefits of cavernous neurovascular bundle sparing during radical prostatectomy: a systematic review and meta-analysis. J Urol. 2017;198(4):760–769. doi: 10.1016/j.juro.2017.02.3344. [DOI] [PubMed] [Google Scholar]

[CIT0007] 7.Partin AW, Yoo J, Carter HB, et al. The use of prostate specific antigen, clinical stage and Gleason score to predict pathological stage in men with localized prostate cancer. J Urol. 1993;150(1):110–114. doi: 10.1016/s0022-5347(17)35410-1. [DOI] [PubMed] [Google Scholar]

[CIT0008] 8.Cooperberg MR, Pasta DJ, Elkin EP, et al. The University of California, San Francisco Cancer of the Prostate Risk Assessment score: a straightforward and reliable preoperative predictor of disease recurrence after radical prostatectomy. J Urol. 2005;173(6):1938–1942. doi: 10.1097/01.ju.0000158155.33890.e7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0009] 9.Ohori M, Kattan MW, Koh H, et al. Predicting the presence and side of extracapsular extension: a nomogram for staging prostate cancer. J Urol. 2004;171(5):1844–1849; discussion 1849. doi: 10.1097/01.ju.0000121693.05077.3d. [DOI] [PubMed] [Google Scholar]

[CIT0010] 10.Anderson BB, Oberlin DT, Razmaria AA, et al. Extraprostatic extension is extremely rare for contemporary gleason score 6 prostate cancer. Eur Urol. 2017;72(3):455–460. doi: 10.1016/j.eururo.2016.11.028. [DOI] [PubMed] [Google Scholar]

[CIT0011] 11.Melia J, Moseley R, Ball RY, et al. A UK‐based investigation of inter‐ and intra‐observer reproducibility of Gleason grading of prostatic biopsies. Histopathology. 2006;48(6):644–654. doi: 10.1111/j.1365-2559.2006.02393.x. [DOI] [PubMed] [Google Scholar]

[CIT0012] 12.Zhao L, Bao J, Wang X, et al. Detecting adverse pathology of prostate cancer with a deep learning approach based on a 3D swin‐transformer model and biparametric mri: a multicenter retrospective study. J Magn Reson Imaging. 2024;59(6):2101–2112. doi: 10.1002/jmri.28963. [DOI] [PubMed] [Google Scholar]

[CIT0013] 13.Priester A, Mota SM, Grunden KP, et al. Extracapsular extension risk assessment using an artificial intelligence prostate cancer mapping algorithm. BJUI Compass. 2024;5(10):986–997. doi: 10.1002/bco2.421. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0014] 14.van den Berg I, Soeterik TFW, van der Hoeven EJRJ, et al. The development and external validation of artificial intelligence-driven MRI-based models to improve prediction of lesion-specific extraprostatic extension in patients with prostate cancer. Cancers (Basel). 2023;15(22):5452. doi: 10.3390/cancers15225452. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0015] 15.Sighinolfi MC, Assumma S, Cassani A, et al. Pre-operative prediction of extracapsular extension of prostate cancer: first external validation of the PRECE model on an independent dataset. Int Urol Nephrol. 2023;55(1):93–97. doi: 10.1007/s11255-022-03365-4. [DOI] [PubMed] [Google Scholar]

[CIT0016] 16.Semwal H, Ladbury C, Sabbagh A, et al. Machine learning and explainable artificial intelligence to predict pathologic stage in men with localized prostate cancer. Prostate. 2025;85(1):3–12. doi: 10.1002/pros.24793. [DOI] [PubMed] [Google Scholar]

[CIT0017] 17.Wei JT, Barocas D, Carlsson S, et al. Early detection of prostate cancer: AUA/SUO Guideline Part II: considerations for a prostate biopsy. J Urol. 2023;210(1):54–63. doi: 10.1097/JU.0000000000003492. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0018] 18.Paris A and Ilias M.. Image analysis in digital pathology utilizing machine learning and deep neural networks. J Pers Med. 2022;12(9):1444. doi: 10.3390/jpm12091444. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0019] 19.Deng S, Zhang X, Yan W, et al. Deep learning in digital pathology image analysis: a survey. Front Med. 2020;14(4):470–487. doi: 10.1007/s11684-020-0782-9. [DOI] [PubMed] [Google Scholar]

[CIT0020] 20.Cosma G, Acampora G, Brown D, et al. Prediction of pathological stage in patients with prostate cancer: a neuro-fuzzy model. PLoS One. 2016;11(6):e0155856. doi: 10.1371/journal.pone.0155856. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0021] 21.Blas L, Shiota M, Nagakawa S, et al. Validation of user-friendly models predicting extracapsular extension in prostate cancer patients. Asian J Urol. 2023;10(1):81–88. doi: 10.1016/j.ajur.2022.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0022] 22.Kwong JCC, Khondker A, Meng E, et al. Development, multi-institutional external validation, and algorithmic audit of an artificial intelligence-based Side-specific Extra-Prostatic Extension Risk Assessment tool (SEPERA) for patients undergoing radical prostatectomy: a retrospective cohort study. Lancet Digit Health. 2023;5(7):e435–e445. doi: 10.1016/S2589-7500(23)00067-5. [DOI] [PubMed] [Google Scholar]

[CIT0023] 23.Gu W-J, Liu Z, Yang Y-J, et al. A deep learning model, NAFNet, predicts adverse pathology and recurrence in prostate cancer using MRIs. NPJ Precis Oncol. 2023;7(1):134. doi: 10.1038/s41698-023-00481-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0024] 24.Guerra A, Orton MR, Wang H, et al. Clinical application of machine learning models in patients with prostate cancer before prostatectomy. Cancer Imaging. 2024;24(1):24. doi: 10.1186/s40644-024-00666-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0025] 25.Miro C, Di Giovanni A, Murolo M, et al. Thyroid hormone and androgen signals mutually interplay and enhance inflammation and tumorigenic activation of tumor microenvironment in prostate cancer. Cancer Lett. 2022;532:215581. doi: 10.1016/j.canlet.2022.215581. [DOI] [PubMed] [Google Scholar]

[CIT0026] 26.Rapisarda S, Bada M, Crocetto F, et al. The role of multiparametric resonance and biopsy in prostate cancer detection: comparison with definitive histological report after laparoscopic/robotic radical prostatectomy. Abdom Radiol (NY). 2020;45(12):4178–4184. doi: 10.1007/s00261-020-02798-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0027] 27.Jin P, Wang X, Ding Z, et al. Development and validation of risk-stratified biopsy decision pathways incorporating MRI and PSA-derived indicators. Ann Med. 2025;57(1):2446695. doi: 10.1080/07853890.2024.2446695. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Deep learning model for predicting extraprostatic extension of prostate cancer based on H&E-stained biopsy digital images

Peiling Yu

Nan Liu

Dongli Feng

Yi Jing

MoYu Xia

Hongyu Guo

Yinan Yuan

Weilin Guo

Yini Alatan

Siru Nie

Jinming Zhao

Hongbo Su

Yuan Miao

Qi Miao

Roles

Abstract

Background

Methods

Results

Conclusions

KEY MESSAGES

1. Introduction

2. Clinical materials and methods

2.1. Data preparation

Figure 1.

2.2. Model construction

Figure 2.

2.2.1. Digital pathology images pre-processing

2.2.2. Tumor tissue segmentation and classification

2.2.2.1. Data selection and labeling

2.2.2.2. Model construction

2.2.3. MIL-based EPE prediction

2.2.3.1. Feature extraction

2.2.3.2. MIL model design

2.2.3.3. Training strategy

2.3. Model evaluation

2.4. Statistical analysis

3. Results

3.1. Clinical and pathological characteristics of the study cohort

Table 1.

3.2. Tumor tissue classification model

Table 2.

Figure 3.

3.3. EPE prediction model

Table 3.

3.4. The prediction model captures the inherent morphological features from the patches

Figure 4.

3.5. Correlation between predicted EPE probability and biochemical recurrence

Figure 5.

4. Discussion

Supplementary Material

Acknowledgements

Funding Statement

Availability of data and materials

Disclosure statement

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases