Abstract
Background
Identifying intestinal node-negative gastric adenocarcinoma (INGA) patients with high risk of recurrence could help perceive benefit of adjuvant therapy for INGA patients following surgical resection. This study evaluated whether the computer-extracted image features of nuclear shapes, texture, orientation, and tumor architecture on digital images of hematoxylin and eosin stained tissue, could help to predict recurrence in INGA patients.
Methods
A tissue microarrays cohort of 160 retrospectively INGA cases were digitally scanned, and randomly selected as training cohort (D1 = 60), validation cohort (D2 = 100 and D3 = 100, D2 and D3 are different tumor TMA spots from the same patient), accompanied with immunohistochemistry data cohort (D3′ = 100, a duplicate cohort of D3) and negative controls data cohort (D5 = 100, normal adjacent tissues). After nuclear segmentation by watershed-based method, 189 local nuclear features were captured on each TMA core and the top 5 features were selected by Wilcoxon rank sum test within D1. A morphometric-based image classifier (NGAHIC) was composed across the discriminative features and predicted the recurrence in INGA on D2. The intra-tumor heterogeneity was assessed on D3. Manual nuclear atypia grading was conducted on D1 and D2 by two pathologists. The expression of HER2 and Ki67 were detected by immunohistochemistry on D3 and D3′, respectively. The association between manual grading and INGA outcome was analysis.
Results
Independent validation results showed the NGAHIC achieved an AUC of 0.76 for recurrence prediction. NGAHIC-positive patients had poorer overall survival (P = 0.017) by univariate survival analysis. Multivariate survival analysis, controlling for T-stage, histology stage, invasion depth, demonstrated NGAHIC-positive was a reproducible prognostic factor for poorer disease-specific survival (HR = 17.24, 95% CI 3.93–75.60, P < 0.001). In contrast, human grading was only prognostic for one reader on D2. Moreover, significant correlations were observed between NGAHIC-positive patients and positivity of HER2 and Ki67 labeling index.
Conclusions
The NGAHIC could provide precision oncology, personalized cancer management.
Electronic supplementary material
The online version of this article (10.1186/s12967-019-1839-x) contains supplementary material, which is available to authorized users.
Keywords: Digital H&E images, Predication, Negative-node gastric adenocarcinoma, Quantitative histomorphometric
Background
The gastric cancer (GC) is a common gastrointestinal tumor with high mortality and the second leading cause of death in China [1]. For these cases, 80% are gastric adenocarcinoma (GA). Nodal metastases are a well-known prognostic factor after radical treatment of gastric cancer. Because intestinal node-negative gastric adenocarcinoma (INGA) patients have a good prognosis, it remains controversial whether adjuvant chemotherapy is needed for INGA patients after surgery. There is controversy surrounding the benefit of adjuvant therapy for patients with resected stage IB, especially pT2N0. The National Comprehensive Cancer Network (NCCN) guidelines suggest for some high-risk cases (pT2N0 with a high histologic grade or the presence of lymph vascular or perineural invasion), the decision to pursue adjuvant therapy should be personalized. Observation is appropriate for patients with resected T2N0 stage IB GC as long as they have undergone adequate lymph node dissection. But guidelines from the European Society for Medical oncology (ESMO) suggest adjuvant therapy for all patients with resected stage IB disease, including those with pT2N0 tumors. NCCN and ESMO recommended adjuvant therapy for all patients with pT3-4N0. Observation without adjuvant therapy for patients with T1N0 who have uninvolved section margins. For patients with early-stage gastric cancer, the risk of lymph node metastasis is low (2–28 percent for T1, 20 percent for T2). While chemotherapy has many side effects, such as loss of hair, myelosuppression, damage to liver and kidney, and additional extensive medical burden, it is critical to distinguish recurrence in INGA patients perceived benefit of adjuvant chemoradiation after an R0 resection.
Although there were many predicative factors for recurrence and could be useful to stratify node-negative gastric cancer patients for adjuvant treatment and tailored follow-up, including lymphatic embolization and perineural infiltration, p53 and Ki67 and greater lymph node retrieval. But these assays are tissue destructive and expensive. Pathologic staging (e.g. nuclear atypia grade) is critical in directing optimal treatment for INGA patients. Unfortunately, pathological analysis is a tedious process and suffered from intra/inter-reader variability.
Computer-aided image analysis has great potential to conquer inconsistencies in virtue of subjective interpretation [2–5]. Quantitative histomorphometry (QH) used computer-aided image analysis to decrypt sub-visual differences of tumor morphology in digital pathology images. With the advancement of computer-aided image technologies, a number of quantitative morphology information was extracted and has been approved to be prognostic, such as tumor nests fractal dimension and stromal morphologic features [6, 7].
Recent reports have shown that nuclear architecture was useful in cancer grading and predicting patient outcomes [2, 5, 6, 8–13]. Genetic instability could be displayed by diversify of nuclear shape and texture, playing important role in metastasis and proliferation that result in cancer recurrence potentially [11, 12]. Quantitative histomorphometry of nucleus architecture was utilized to predict disease recurrence in early-stage non-small cell lung cancer [14], biochemical recurrence [11] for prostate cancers [11, 15] and so on [6, 10, 12].
In this study, we constructed a quantitative histomorphometry based model to distinguish INGA patients who suffered from recurrence versus those did not using a cohort of 60 TMA images. We then validate the model in another validation cohort of 100 TMA images. The work flow of this study was illustrated in Fig. 1. With the help of image classifier, we are looking forward to identify patients who have high risk of recurrence and who might thus receive measurable benefit adjuvant chemotherapy after curative resection.
Materials and methods
Study population
With the approval of the ethical board of Renmin Hospital of Wuhan University (Wuhan, Hubei, China) and abided with the Declaration of Helsinki, 1782 candidate patients with GA were collected from Department of Pathology, Renmin Hospital of Wuhan University archives from 2000 to 2012 retrospectively and consecutively. Two pathologists (Z.Z and N.Z) were then assigned to identify the patients with node-negative gastric adenocarcinoma. Subsequently, the corresponding donor blocks and their H&E stained slides were obtained followed by selecting preferred blocks and marking areas of interest for core punching. Three to six cores of 2 mm in diameter was punched from central tumor/leading edge of the donor block ROI using a thin-wall stainless steel tube and transferred onto the recipient blocks to construct the arrays. The digital H&E images were captured under the Aperio Scan Scope XT Slide Scanner at 40× magnification with a resolution of 0.25 μm per pixel. One of the most representative tumor cores were selected by Z.Z and Z.N for use. The enrolled spots were randomly divided into training cohort (D1 = 60 [37.5%]) and test cohort (D2 = 100 [62.5%]), respectively.
Additionally, three 2 mm punches were removed from different tumor region of D2 and assessed by pathologists for prefer spots after digitally scanning under slide scanner, as well as two 2 mm punches from normal adjunct tissue as negative control. Finally, a third dataset, named D3 (n = 100), was also recruited in this study, containing tissue cores corresponding to the same patients in D2 but extracted from different regions of the tumor. D3 was employed to validate the image classifier to cope with tumor heterogeneity and immunohistochemical staining for HER2. A fourth dataset, named D3′, was duplicate cohort of D3, using for immunohistochemical assessment of Ki67. A fifth dataset, named D5 (n = 100), was obtained from the adjacent normal tissues of D2 as negative controls. D1 contains D+ (recurrence) patients and D− (non-recurrence) patients. In contrast, for D2, D3 and D4, only the digital H&E images were used to predict the recurrence status without any pre-knowledge of the patients.
In this paper, the INGA samples dataset was selected based off the after mentioned inclusion/exclusion criteria. Inclusion criteria comprised of: (1) pathological diagnosis of gastric adenocarcinoma; (2) according to the standard of gastric cancer TNM staging by Union for International Cancer Control: gastric adenocarcinoma TMN stage was limited to pT1–T4N0M0 before postoperative pathology; (3) After radical surgery, more than 16 lymph nodes were selected for biopsy; (4) recurrence or metastasis was confirmed by CT or MRI images, endoscopy, and pathology; (5) complete clinic pathological data (through telephone, data collected by clinicians or database of electronic medical records and all patients were followed up for 5 years). Accompanying exclusion criteria contained: (1) patients with other primary malignant tumors; (2) patients who underwent chemotherapy or immunotherapy before surgery; (3) palliative surgery; (4) other diseases or accidental deaths; (5) residual gastric cancer; (6) lost visits or incomplete data; (7) death within 1 month after operation. The period of no recurrence and metastasis was limited from the time after the surgery to the diagnosis of recurrence or the time of final follow-up. The period of recurrence or metastasis was limited from the time of diagnosis, recurrence or metastasis to death or final follow-up time; the overall survival time was from surgery to death or the last follow-up time. The deadline for follow-up was on December 31st, 2017.
Image analysis
Nuclear segmentation
Each individual nucleus, including cancer tissue and tumor stroma, was detected and segmented by Watershed-based nuclear segmentation method [16] at 40× magnification (0.25 μm/pixel resolution) automatically after color deconvolution for isolating the stain. This resulted in a RGB color digital image for each TMA spot, the similar one was shown in Fig. 2.
Feature extraction
Three different types of quantitative histomorphometric cellular features, covering local architectural features, shape/texture features, local Cell Orientation Graphs features, were extracted from local cluster regions [6, 8] in this study. Of these features, 20 nuclear architectural features, with 12 features from Voronoi Diagram and 8 features from Delaunay Graph, were extracted aimed to capture the nuclear architectural disorder in local regions indicating more aggressive tumor behaviors. 39 nuclear orientation disorder features related to nuclear orientation disorder were derived from Cell Orientation Graphs [10]. 100 shape features and 30 nuclear texture features, comprised of invariant moment, Fourier descriptors of boundary, area, length/width ratios, smoothness, perimeter ration, and area ration and so on, were extracted as described in ref. [17] aiming to capture the disorder linked to shape/texture disorder in local cluster regions. Finally, a total of 189 features were yielded for each TMA core (Table 1) in our study. A comprehensive list of all quantitative features was shown in Additional file 1: Table S1.
Table 1.
Feature type | No. | Description |
---|---|---|
Nuclear shape | 100 | Area ratio, distance ratio, SD of distance, distance ratio, perimeter ratio, variance of distance, fractal dimension, smoothness, invariant moment 1–7, Fourier descriptor 1–10: min/max, mean, SD, median |
Nuclear texture | 30 | Contrast, energy, entropy, inverse variance, invariant moment: mean, SD from each channel |
Nuclear orientation map | 39 | Contrast energy, contrast inverse moment, contrast average, contrast variance, contrast entropy, intensity average, intensity variance, intensity entropy, entropy, energy, correlation, information measure 1, information measure 2: mean, SD, range |
VD | 12 | Perimeter, chord, area: SD, min/max, disorder, average |
DT | 8 | Side length, triangle area: min/max, mean, standard deviation, median, disorder |
In total | 189 |
SD standard deviation, VD Voronoi diagram, DT Delaunay triangulation
Feature selection
Three different feature selection schemes, including the minimum redundancy maximum relevance (MRMR), Wilcoxon rank sum test (WRST), and random forest (RF), were employed to identify the most outstanding pathological morphometric features in the training groups. A randomized threefold cross-validation scheme along with 100 iterations, combing with each feature selection method, was used to guarantee the robustness of the preferred the features. These approaches resulted in a total of three accompanying feature bins for disguising the recurrences and non-recurrence cases within the training group, respectively. In this paper we limited the number of candidate features to 5 aimed to avoid curse of dimensionality or over fitting challenges using box and whisker plots. Each feature bin consisted of 5 most distinguished features accordingly, and these features were considered as a prerequisite for inclusion in subsequent classifier construction procedure.
Classifier construction
Four different machine learning schemes, comprising of analysis of linear discriminant (ALD), analysis of quadratic discriminant (AQD), machine of support vector (MSV), and random forest (RF), in conjunction with the 3 feature bins, resulting in 12 different machine learning combination modes, were applied to construct the candidate histopathological image classifiers for INGA patients (denoted as NGAHICs) within training group. This full join between machine learning scheme and feature bin gave rise to 12 different optional NGAHICs successfully. Subsequently, the optimal histological image classifier (NGAHIC) was settled down across each candidate classifier productivity (AUC, area under the receiver operating characteristic curve) within the training group. Of note, each binary histological image classifier yield a predictive probability value for distinguishing recurrence or non-recurrence case. In this study, the recurrence level was set at 0.5 empirically, namely the NGAHIC predictive probability value (> 0.5) on each core was considered to be recurrence case. All the binary classifiers predictive outcomes were compared with the ground truth label for classifier performance evaluation.
Nuclear atypia grade by human readers
Since the histomorphometric features we investigate related to nuclear atypia, key predictors of prognosis in various cancers [2, 5, 6, 8–15, 18–20]. However, only the modest agreements were achieved among experienced readers [13]. We designed the comparative strategies, aiming to illustrate the pathologist’s inter-reader variability in INGA recurrence prediction and compare the prognostic performance of image classifier against subjective manual nuclear atypia grading. The nuclear atypia grade estimation was conducted by two expert pathologists (Z.Z and N.Z) via visual evaluation of the H&E images on training set and test set. Both human readers were blinded to the ground truth information of the 160 cases. Each pathologist was asked to assign a score between 0 and 2 for each digital image in-house. The nuclear atypia grade was defined as 0, 1 referring to low nuclear atypia grade and 2 referring to high nuclear atypia grade, based off the previous work by Nakashima [13].
Immunohistochemistry
All immunohistochemical stains were performed by the following order: deparaffinizing → antigen retrieval → blocking → primary antibody → washing → blocking → biotinylated secondary antibody → washing → blocking → washing → mounting and observation as our previous work [21]. Each core in D3′ and D3 was immunostained using monoclonal antibody against Ki67 (clone MIB-1; 1:200; Dako, Glostrup, Denmark); polyclonal antibody against HER2/neu (1:200; Dako, Glostrup, Denmark), respectively. The HER2 staining results were assigned a score as IHC 0, IHC 1+, IHC 2+ with FISH (fluorescence in situ hybridization) negative referring negative, IHC 2+ with FISH positive and IHC 3+ indicating positive according to the criteria recommended by Min [22]. Ki67 labeling index was scored by the percentage of nuclei-stained cells observing in 5 randomly selected areas of the section with 400× high-power fields; 200 tumor cells were counted in each area. The Ki67 labeling index was determined as positive (≥ 14% reactive tumor cells) and negative (< 14% reactive tumor cells) as described by Goldhirsch [23].
Survival analysis
Two-sided Fisher’s test was used to analyze the correlations among the data of machine classifier, clinical documents, and pathologic features. Five years survival probabilities were evaluated by the Kaplan–Meier method and log-rank tests were performed to detect recurrent differences. Cox regression model was employed to detect the independently predicted survival of probabilities of variable factors after checking clinical data and pathologic features. The average expression rates of HER2 and Ki67 between NGAHIC-positive and NGAHIC-negative were evaluated using Chi square test. All tests were repeated for three times, hazard ratios, associated 95% confidence intervals, and P values were reported, with the significance level set at 0.05.
Results
Baseline characteristics of the study population
One hundred and sixty patients were finally enrolled in the principal cohort. The clinical and pathological features were shown in Table 2. Of the 160 cases of INGA, most of the patients (122/160 [76.3%]) were married and the media age was 62 years. About 60% (95/160) were men, with 41.7% (n = 25) vs. 60% (n = 60) in D1 and D2, respectively. 89 patients (43.3%) were in T1/T2, whereas 71 (44.4%) had advanced disease (T3/T4). 99 of the 160 patients differentiated well vs. 61 cases differentiated poorly. Of those well-differentiated category, 38 (38/60 [63.3%]) were in D1 and 61 (61/100 [61%]) in D2, separately. Approximate 28% (n = 45) NGA patients were treated with postoperative chemotherapy and more than 65% (n = 103) patients’ tumor size < 5 cm. At the endpoint of the follow-up, 36 patients (36/160 [22.5%]) suffered disease recurrence and 40 patients (40/160 [25%]) dead from related cause.
Table 2.
Variable | Sub variables | Total (%) | D1 (%) | D2/D3 (%) |
---|---|---|---|---|
Number of patients | 160 | 60 | 100 | |
Age | 62.1 ± 9.0 | 63.9 ± 5.7 | 59.1 ± 10.1 | |
Sex | Male | 95 (59.4) | 25 (41.7) | 60 (60.0) |
Female | 65 (40.6) | 35 (58.3) | 40 (40.0) | |
Patient status | Alive | 120 (75.0) | 43 (71.7) | 77 (77.0) |
Dead | 40 (25.0) | 17 (28.3) | 23 (23.0) | |
Recurrence | Yes | 36 (22.5) | 15 (25.0) | 21 (21.0) |
No | 124 (77.5) | 45 (75.0) | 79 (79.0) | |
Tumor diameter (cm) | < 5 | 103 (65.6) | 40 (66.7) | 63 (63.0) |
≥ 5 | 57 (35.6) | 20 (33.3) | 37 (37.0) | |
Invasion depth | Out | 92 (57.5) | 35 (58.3) | 57 (57.0) |
In | 68 (42.5) | 25 (41.7) | 43 (43.0) | |
T stage | T1/T2 | 89 (55.6) | 32 (53.3) | 57 (57.0) |
T3/T4 | 71 (44.4) | 28 (46.7) | 43 (43.0) | |
Histology grade | W/M | 99 (61.9) | 38 (63.3) | 61 (61.0) |
Poorly | 61 (38.1) | 22 (36.7) | 39 (39.0) | |
Postoperative-chemotherapy | Yes | 45 (28.1) | 17 (8.3) | 28 (28.0) |
No | 115 (71.9) | 43 (71.7) | 72 (72.0) | |
Manual nuclear atypia grading | Low | 102 (75.0) | 37 (61.7) | 65 (65.0) |
High | 58 (25.0) | 23 (38.3) | 35 (35.0) |
Out out of serosa, in invasion of serosa, W/M well and moderate-differentiated, poorly poorly-differentiated
Discriminative features
The top 5 discriminative morphologic features identified within the training cohort were range of intensity entropy, range of intensity energy, standard deviation (SD) of perimeter ration, SD of intensify average, and disorder of perimeter, respectively. Notable, the nuclear orientation related morphometric features, (range of intensity entropy, range of intensity energy and SD of intensify average), predominated the discriminated features (3 out of 5). Additional file 2: Table S2 referred a more comprehensive discriminated feature list.
Intuitively, the higher value of the feature was observed, indicating the more distorted of nuclear in local cluster region (Fig. 3). The original H&E digital images (Fig. 3a, e), with accompanying nuclear segmentation contour, nuclear architecture feature map and nuclear orientation feature maps in zoomed region, were shown in Fig. 3, representing recurrence and non-recurrence NGA groups from the first column to the fourth column. For recurrence cases, the nuclear appearance (Fig. 3b, f) showed a bigger variation comparing with the non-recurrences ones. In contrast, the checkerboard architecture feature map appeared to sparser (Fig. 3g) and the nuclear orientation tended to more uniform (Fig. 3h) in local cluster regions in TMA. Comparatively, the nuclear appearance, architecture and nuclear orientation seem to more regular and uniform for the negative controls (Fig. 3j–l).
Classifier performance
Twelve different machine learning combination modes, resulted from full join between 4 machine learning algorithms and 3 feature selection methods, were conduct on the training group and the corresponding performance results were summarized in Table 3. It is notable that the combination of SVM and WRST yielded the best AUC as well as accuracy, specificity and sensitivity (AUC = 0.87, accuracy = 0.89, specificity = 0.88 and sensitivity = 0.78) in distinguishing D+ and D− within training group. Therefore, this combination scheme (SVM combined with WRST) was settled down as the optimal histological image classifier for predicting NGA recurrence (NGAHIC). In the validation set, the NGAHIC (SVM combined with WRST) yielded an AUC = 0.76, accuracy = 0.72, with corresponding specificity = 0.74 and sensitivity = 0.68 (Table 3).
Table 3.
Dataset | Classifier | Feature selection | AUC | Accuracy | Specificity | Sensitivity |
---|---|---|---|---|---|---|
D1 | ALD | WRST | 0.77 ± 0.08 | 0.81 ± 0.08 | 0.82 ± 0.05 | 0.67 ± 0.03 |
MRMR | 0.67 ± 0.05 | 0.80 ± 0.04 | 0.84 ± 0.02 | 0.69 ± 0.09 | ||
RF | 0.76 ± 0.03 | 0.79 ± 0.06 | 0.85 ± 0.02 | 0.62 ± 0.05 | ||
AQD | WRST | 0.81 ± 0.02 | 0.74 ± 0.09 | 0.80 ± 0.03 | 0.71 ± 0.08 | |
MRMR | 0.79 ± 0.06 | 0.77 ± 0.01 | 0.82 ± 0.01 | 0.73 ± 0.01 | ||
RF | 0.72 ± 0.06 | 0.87 ± 0.02 | 0.86 ± 0.03 | 0.75 ± 0.03 | ||
RF | WRST | 0.83 ± 0.03 | 0.79 ± 0.06 | 0.82 ± 0.06 | 0.72 ± 0.04 | |
MRMR | 0.81 ± 0.06 | 0.76 ± 0.04 | 0.80 ± 0.08 | 0.70 ± 0.06 | ||
RF | 0.80 ± 0.05 | 0.73 ± 0.08 | 0.79 ± 0.06 | 0.69 ± 0.08 | ||
SVM | WRST | 0.87 ± 0.03 | 0.89 ± 0.02 | 0.88 ± 0.01 | 0.78 ± 0.08 | |
MRMR | 0.84 ± 0.02 | 0.88 ± 0.01 | 0.84 ± 0.02 | 0.72 ± 0.04 | ||
RF | 0.81 ± 0.05 | 0.84 ± 0.04 | 0.82 ± 0.02 | 0.73 ± 0.07 | ||
D2 | SVM | WRST | 0.76 | 0.72 | 0.74 | 0.68 |
Evaluation values in italic indicate the best machine learning combination
MRMR minimum redundancy maximum relevance, RF random forest, WRST Wilcoxon rank sum test, LDA analysis of linear discriminant, AQD analysis of quadratic discriminant, SVM machine of support vector, AUC area under receiver operating curve
Comparison of human-based nuclear atypia grade and image classifier for predicting recurrence in NGA
The Kaplan–Meier curves represented the survival results for both human readers (Z.Z and N.Z) on D1 and D2, respectively (Fig. 4a–d). For reader 1, the estimation of nuclear atypia grade was not significantly correlated with survival outcome for D1 (P = 0.31) nor D2 (P = 0.16). Whereas, for reader 2, there was a statistical significant negative correlation with human-based estimation of nuclear atypia grade and disease outcome for D2 (P = 0.004), but not for D1 (P = 0.24), conversely.
Correlation of immunohistochemical data and image classifier
HER2 staining was observed in the cytoplasmic membrane of the cancer cells in 16 cases (IHC 0: 36 cases, IHC 1+: 43 cases, IHC 2+ with FISH negative: 5 cases, IHC 2+ with FISH positive: 3 cases and IHC 3+: 13 cases), with the positive rate 81.3% vs. 3.6% in NGAHIC-positive and NGAHIC-negative, respectively. The Ki67 labeling index positive rate was much higher (75.0%) in NGAHIC-positive patients, whereas the Ki67 positive rate was relatively lower (2.4%) in NGAHIC-negative cases. There was statistically significant difference between NGAHIC-positive vs. NGAHIC-negative with positive expression of HER2 (P < 0.001) and Ki67 labeling index (P < 0.001), respectively. More details could be found in Additional file 3: Table S3.
Survival analysis
All the patients were followed up for 5 years (median survival time was about 38 months). Table 4 showed that the result calculated by univariate log-rank survival analysis for the clinical-pathologic features of the test group. It clearly depicted that the classifier negative patients, got better prognosis compared with classifier positive patients (P = 0.017). Figure 4 showed that prognostic prediction results for human readers, NGAHIC, T stage and histology grade by Kaplan–Meier survival curves. Table 5 demonstrated the results calculated by multivariate survival analysis for the major clinical pathologic features and image classifier. The data showed that there was a strong correlation between the result of NGAHIC and prognosis (HR = 17.24, 95% confidence interval = 3.93–75.60, P < 0.001), indicating NGAHIC was a negative predictive factor for INGA patients independently.
Table 4.
Variable | HR (95% CI) | P value |
---|---|---|
Age (< 60 vs. ≥ 60) | 0.66 (0.13–3.35) | 0.621 |
T-stage (T1/T2 vs. T3/T4) | 2.18 (1.10–4.31) | 0.024 |
Histology (W/M vs. poorly) | 3.66 (1.12–11.98) | 0.032 |
Chemotherapy (yes vs. no) | 3.89 (0.92–16.47) | 0.065 |
Invasion depth (out vs. in) | 1.87 (1.03–3.39) | 0.039 |
Tumor diameter (< 5 cm vs. ≥ 5 cm) | 2.15 (0.88–5.22) | 0.091 |
Manual nuclear atypia grading (low vs. high) | 3.08 (0.90–10.49) | 0.072 |
NGAHIC (positive vs. negative) | 4.14 (1.28–13.29) | 0.017 |
CI confidence interval, HR hazard ratio; W/M: well and moderate-differentiated, poorly: poorly differentiated, out: out of serosa, in: invasion of serosa
P value in italic is statistically significant, P < 0.05
Table 5.
Variable | P value | HR (95% CI) |
---|---|---|
T-stage (T1/T2 vs. T3/T4) | 0.34 | 1.42 (0.69–2.42) |
Histology stage (W/M vs. poorly) | 0.16 | 3.61 (0.60–21.64) |
Manual nuclear atypia grading (low vs. high) | 0.23 | 2.55 (0.55–11.75) |
Invasion depth (out vs. in) | 0.51 | 0.56 (0.09–3.14) |
Tumor diameter (< 5 cm vs. ≥ 5 cm) | 0.62 | 0.37 (0.09–18.83) |
NGAHIC (positive vs. negative) | 1.6 × 10 −4 | 17.24 (3.93–75.60) |
CI confidence interval, HR hazard ratio, NGAHIC image classifier, out out of serosa, in invasion of serosa, W/M well and moderate-differentiate
P value in italic is statistically significant, P < 0.05
Discussion
Although INGA patients have a better prognosis than those with lymph node involved, INGA patients still suffered from disease recurrence, mostly seeding through peritoneal or hematogenous spread [24]. Once INGA patients experience recurrence, their lifespan is significantly decreased. The recurrent patients need more close attention, such as more aggressive treatment and advance care planning. Hence there is a need to identify patients with high-risk recurrence following surgery. Nuclear atypia refers to changes in nuclear morphological profiles, including nuclear appearance, size, or arrangement, and has proved to be useful hallmark of cancer prognosis and choice of adjuvant therapies determination, in different types of cancers clinicopathologically [6, 11–14, 19, 25]. However, human-based observations often suffered from inter- and intra-reader variation.
Computer-aid for automatic estimation by image analysis technology has been proved to mitigate the subjectivity by pathologists [6, 11, 12, 14, 25]. In this work, we exploited a computer-aid histomorphometric classifier for accurate prediction of recurrence of INGA patients. An image based models was constructed to extract features of nuclear shape, texture and orientation features from H&E stained TMA images. This designation could capture nuclear morphology features quantitatively and precisely in local tumor region. The data revealed that the more the heterogeneous nuclear features were related to the high risk for disease recurrence and worse prognosis of INGA patients. The Kaplan–Meier analysis along utilizing the log-rank test showed a strong association between the predictions of the image classifier and recurrence for D2. In addition, the tumor heterogeneity was also investigated across comparing the image classifier prediction ability on D2 and D3 (tumor punches from different parts of the same tumor). The image classifier was tent to be prognostic in both D2 (P = 1.6 × 10−4) and D3 (P = 0.02), respectively (Fig. 4h, i).
We also inspected the prognostic performance difference between the image classifier and the human-based nuclear atypia grade for INGA. However, the Kaplan–Meier analysis results with log-rank test showed no significant statistical association between the reader 1 human-based nuclear atypia grade estimation and survival outcome for D1 or D2 (P > 0.05), meanwhile a strong negative statistical relationship was observed with reader 2 and patient outcome for D2 (P = 0.004). In converse, A Kaplan–Meier analysis along utilizing the log-rank test showed a strong association between the predictions of the image classifier and recurrence for D2 (P < 0.05). Likewise, a multivariate Cox proportional survival analysis reported a HR of 17.24 (95% confidence interval: 3.93–75.60, P = 1.6 × 10−4). This could be illustrated by human estimation variability. Indeed, patients with early stage gastric adenocarcinoma exhibit a broad survival range, and the nuclear atypia stage only limited the survival outcome prediction, resulting in discordance diagnosis. Additionally, the morphological features for evaluating nuclear atypia grade are generally difficult to spot by human inspection, but can be identified by computer easily and effectively, such as shape/texture, nuclear arrangement et al. Furthermore, the nuclear atypia grading criteria and the prognostic values of nuclear atypia grade in INGA have not been defined clearly. Hence, each pathologist might be focus on different nuclear morphological profiles, such as nuclear shape (enlarged or hyperchromatic nuclei), nuclear area, disordered nuclear polarity, cytoplasmic mucin reduction or other features subjectively. Finally, different pathologists may have variable expertise evaluating or natural individual variation in their perception of colors, shapes, and relative nuclear pleomorphism/polarity proportions. Comparatively, the local nuclear features, encompassing local nuclear orientation, nuclear shape, and nuclear arrangement, were measured and extracted objectively and thereby the image classifier revealed a strong association with tumor outcomes in early gastric adenocarcinoma. Moreover, a significant correlation between HER2 overexpression and NGAHIC-positive has been observed in INGA. 81.3% of NGAHIC-positive carcinomas were positive for HER2 staining vs. that of 3.6% NGAHIC-negative cancers (P < 0.001). Meanwhile, the IHC staining revealed that the NGAHIC-positive patients have higher positive rate (75.7%) vs. that of in NGAHIC-negative patients (2.4%) for Ki67, with P < 0.001.
In this study, the local nuclear orientation features, indicating the heterogeneity of nuclear polarity, were found to be persistently activated and overexpressed with poor tumor outcomes for distinguishing high-risk recurrence patients and low-risk recurrences patients. Namely, the higher expression of nuclear polarity in the cell cultures, the poorer disease outcomes. Intuitively, aggressive tumors tent to exhibit relatively lower degree of structure and organization as rapid disorganized cell regeneration, compared with less aggressive cancer. These findings present a similar pattern of results as previous works [13, 14, 25]. Additionally, we also inspected the relationship between nuclear shape/texture features, nuclear architecture features and disease prognosis. In the INGA group, the most discriminative features also covered the nuclear shape feature (SD of perimeter ration) and the nuclear architecture feature (disorder of perimeter). This showed that both local anatomical structures (shape and cell nuclei) and local architecture of the tumor cell nucleus (Delaunay triangulation of the nuclear, e.t.) are associated with survival outcomes. Nuclear atypia, referring to the alternations in nuclear structure, such as shape, architecture, orientation, tend to be captured by the computer-extracted features quantitatively and used for cancer grading. These findings consistent with the previous researches stance that the nuclear shape and architecture appears to predictive of patient survival [2, 5, 6, 8–11, 14, 18].
The main contributions of this paper were summarized as follow. (1) In this study, it was a preliminary finding that the relationship between more aggressive clustered tumor computer-extracted H&E image features of nuclear cluster graph and recurrence were strong closed in INGA patients. To our best known, it has never been reported in the literature. (2) The simple binary histomorphometric image classifier could stratify the patient into different prognosis groups. Especially, the NGAHIC positive patients, identify high risk of recurrence patients by the image classifier, has worse disease outcome. This preliminary finding resulted in possibility for image classifier as prognosis marker to be used in potential clinic routine. We imagine with the help of NGAHIC, pathologist could identify high recurrent risk patients through H&E stained digital images, including biopsy or surgical specimen. Providing the accurate pathologic diagnosis, clinicians could make an individualized treatment, such as postoperative close chemotherapy and radiation therapy and follow-up. Certainly, NGAHIC needs to be tested in multicenter study of large samples.
We acknowledge the limitations of this work. We only utilized the 2 mm tissue microarrays, containing a relative small portion of tumor characteristics as composed to whole tumors, for digital assessment. However, recent scholars proposed that the important cell morphological diversity present in one tumor tissue could be obtained in tissue microarrays [26–28]. Furthermore, we will expand our study to whole-slide histopathology images as they contain large amount information and multi-view of tumor. Additionally, the entire cohort in our study is relative small and some of clinical parameters, such as nodal extracapsular extension, margin, were not included in for multivariate analysis. Future efforts will be made to investigate our model on multi-institutional study with considerable samples of INGA.
Conclusion
In summary, we demonstrated that the histopathology image classifier based off local nuclear features, related to disorder of nuclear shape, arrangement, and orientation within the tumor cluster area, can predict recurrence and survival outcomes of INGA patients successfully. This capability is superior to the current practice utilization by nuclear atypia grade assessment by pathologists subjectively. Furthermore, our model could facilitate prognostic prediction based off the collected H&E stained slides routinely, and thereby contributing to the precision oncology, personalized cancer management and advance care planning. Future works will involve research on the response to treatment by analyzing the pathological images digitally.
Additional files
Authors’ contributions
Conceptualization: MY J, LY, PXH, WGD and CL; methodology: MYJ, LY and CL; software: LY and CL; validation: MYJ, LY, ZZ, NZ, XDJ, PXH, CL and WGD; formal analysis: LY, ZZ, XDJ, PXH, NZ and MYJ; investigation: MYJ, LY; resources: MYJ, LY, XDJ, PXH, ZZ and NZ; data curation: MYJ, NZ and ZZ; writing—original draft preparation: MYJ and LY; writing—review and editing: MYJ and LY; visualization: MYJ, LY and CL; supervision: CL and WGD; project administration: CL, WGD; funding acquisition: MYJ, CL and ZZ. All authors read and approved the final manuscript.
Acknowledgements
We thank ZZ and NZ for their expert technical assistance with the tissue microarray construction. We also thank CL and the team members in Case Western Reserve University for their wonderful assistance with digital analysis support.
Competing interests
The authors declare that they have no competing interests.
Availability of data and materials
The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.
Consent for publication
Not applicable.
Ethics approval and consent to participate
Our study was approved by the ethical committee of Renmin Hospital of Wuhan University and abided with the Declaration of Helsinki before using tissue samples for scientific researches purpose only. The written informed consent was waived by the ethical committee for this retrospective study.
Funding
Our experiment was funded by the Hubei Province Natural Science Foundation of China (No. 2018CFB136) and the National Natural Science Foundation of China (Grant No. 61401263, No. 61672333 and No. 81602535), Natural Science Basic Research Plan in Shaanxi Province of China (No. 215JQ6228), Innovation Seed Funding of Wuhan University (TFZZ2018020), Fundamental Research Funds for the Central Universities (GK201903096, GK201901010).
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Meng-Yao Ji, Email: amy_5840@whu.edu.cn.
Lei Yuan, Email: yuanlei009@nwsuaf.edu.cn.
Xiao-Da Jiang, Email: jxd@whu.edu.cn.
Zhi Zeng, Email: zz99@whu.edu.cn.
Na Zhan, Email: zn1980@whu.edu.cn.
Ping-Xiao Huang, Email: hpx@whu.edu.cn.
Cheng Lu, Email: chenglu@snnu.edu.cn.
Wei-Guo Dong, Email: dongweiguo_66@163.com.
References
- 1.Chen W. Cancer statistics: updated cancer burden in China. Chin J Cancer Res. 2015;27:1. doi: 10.3978/j.issn.1000-9604.2015.02.07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Corredor G, Wang X, Zhou Y, Lu C, Fu P, Syrigos KN, Rimm DL, Yang M, Romero E, Schalper KA, et al. Spatial architecture and arrangement of tumor-infiltrating lymphocytes for predicting likelihood of recurrence in early-stage non-small cell lung cancer. Clin Cancer Res. 2018;25(5):1526–1534. doi: 10.1158/1078-0432.CCR-18-2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S. Corrigendum: dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;546:686. doi: 10.1038/nature22985. [DOI] [PubMed] [Google Scholar]
- 4.Madabhushi A, Agner S, Basavanhally A, Doyle S, Lee G. Computer-aided prognosis: predicting patient and disease outcome via quantitative fusion of multi-scale, multi-modal data. Comput Med Imaging Graph. 2011;35:506–514. doi: 10.1016/j.compmedimag.2011.01.008. [DOI] [PubMed] [Google Scholar]
- 5.Yu KH, Zhang C, Berry GJ, Altman RB, Re C, Rubin DL, Snyder M. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nat Commun. 2016;7:12474. doi: 10.1038/ncomms12474. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ali S, Lewis J, Madabhushi A. Spatially aware cell cluster (spACC1) graphs: predicting outcome in oropharyngeal pl6+ tumors. Med Image Comput Comput Assist Interv. 2013;16:412–419. doi: 10.1007/978-3-642-40811-3_52. [DOI] [PubMed] [Google Scholar]
- 7.Chen JM, Qu AP, Wang LW, Yuan JP, Yang F, Xiang QM, Maskey N, Yang GF, Liu J, Li Y. New breast cancer prognostic factors identified by computer-aided image analysis of HE stained histopathology images. Sci Rep. 2015;5:10690. doi: 10.1038/srep10690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ali AS, Veltri R, Epstein JA, Christudass C, Madabhushi A. Cell cluster graph for prediction of biochemical recurrence in prostate cancer patients from tissue microarrays. In: Medical imaging 2013: digital pathology. 2013:387–93.
- 9.Aurello P, Berardi G, Giulitti D, Palumbo A, Tierno SM, Nigri G, D’Angelo F, Pilozzi E, Ramacciato G. Tumor-Stroma Ratio is an independent predictor for overall survival and disease free survival in gastric cancer patients. Surgeon. 2017;15:329–335. doi: 10.1016/j.surge.2017.05.007. [DOI] [PubMed] [Google Scholar]
- 10.Lee G, Ali S, Veltri R, Epstein JI, Christudass C, Madabhushi A. Cell orientation entropy (COrE): predicting biochemical recurrence from prostate cancer tissue microarrays. Med Image Comput Comput Assist Interv. 2013;16:396–403. doi: 10.1007/978-3-642-40760-4_50. [DOI] [PubMed] [Google Scholar]
- 11.Lee G, Veltri RW, Zhu G, Ali S, Epstein JI, Madabhushi A. Nuclear shape and architecture in benign fields predict biochemical recurrence in prostate cancer patients following radical prostatectomy: preliminary findings. Eur Urol Focus. 2017;3:457–466. doi: 10.1016/j.euf.2016.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lu C, Lewis JS, Jr, Dupont WD, Plummer WD, Jr, Janowczyk A, Madabhushi A. An oral cavity squamous cell carcinoma quantitative histomorphometric-based image classifier of nuclear morphology can risk stratify patients for disease-specific survival. Mod Pathol. 2017;30:1655–1665. doi: 10.1038/modpathol.2017.98. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Nakashima Y, Yao T, Hirahashi M, Aishima S, Kakeji Y, Maehara Y, Tsuneyoshi M. Nuclear atypia grading score is a useful prognostic factor in papillary gastric adenocarcinoma. Histopathology. 2011;59:841–849. doi: 10.1111/j.1365-2559.2011.04035.x. [DOI] [PubMed] [Google Scholar]
- 14.Wang X, Janowczyk A, Zhou Y, Thawani R, Fu P, Schalper K, Velcheti V, Madabhushi A. Prediction of recurrence in early stage non-small cell lung cancer using computer extracted nuclear features from digital H&E images. Sci Rep. 2017;7:13543. doi: 10.1038/s41598-017-13773-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Leo P, Shankar E, Elliott R, Janowczyk A, Janaki N, Maclennan G, Madabhushi A, Gupta S. MP35-09 combination of NF-κB/P65 nuclear localization and gland morphologic features is predictive of biochemical recurrence. J Urol. 2018;199:e450. [Google Scholar]
- 16.Veta M, van Diest PJ, Kornegoor R, Huisman A, Viergever MA, Pluim JP. Automatic nuclei segmentation in H&E stained breast cancer histopathology images. PLoS ONE. 2013;8:e70221. doi: 10.1371/journal.pone.0070221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Doyle S, Feldman MD, Shih N, Tomaszewski J, Madabhushi A. Cascaded discrimination of normal, abnormal, and confounder classes in histopathology: Gleason grading of prostate cancer. BMC Bioinform. 2012;13:282. doi: 10.1186/1471-2105-13-282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Awan R, Sirinukunwattana K, Epstein D, Jefferyes S, Qidwai U, Aftab Z, Mujeeb I, Snead D, Rajpoot N. Glandular morphometrics for objective grading of colorectal adenocarcinoma histology images. Sci Rep. 2017;7:16852. doi: 10.1038/s41598-017-16516-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Baiocchi GL, Molfino S, Baronchelli C, Giacopuzzi S, Marrelli D, Morgagni P, Bencivenga M, Saragoni L, Vindigni C, Portolani N, et al. Recurrence in node-negative advanced gastric cancer: novel findings from an in-depth pathological analysis of prognostic factors from a multicentric series. World J Gastroenterol. 2017;23:8000–8007. doi: 10.3748/wjg.v23.i45.8000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Dittmar Y, Schule S, Koch A, Rauchfuss F, Scheuerlein H, Settmacher U. Predictive factors for survival and recurrence rate in patients with node-negative gastric cancer—a European single-centre experience. Langenbecks Arch Surg. 2015;400:27–35. doi: 10.1007/s00423-014-1226-2. [DOI] [PubMed] [Google Scholar]
- 21.Ji MY, Fan DK, Lv XG, Peng XL, Lei XF, Dong WG. The detection of EBP50 expression using quantum dot immunohistochemistry in pancreatic cancer tissue and down-regulated EBP50 effect on PC-2 cells. J Mol Histol. 2012;43:517–526. doi: 10.1007/s10735-012-9424-0. [DOI] [PubMed] [Google Scholar]
- 22.Min KW, Kim DH, Son BK, Kim DH, Kim EK, Seo J, Ahn SB, Jo YJ, Park YS, Ha J. A high Ki67/BCL2 index could predict lower disease-free and overall survival in intestinal-type gastric cancer. Eur Surg Res. 2017;58:158–168. doi: 10.1159/000448945. [DOI] [PubMed] [Google Scholar]
- 23.Goldhirsch A, Wood WC, Coates AS, Gelber RD, Thurlimann B, Senn HJ, Panel M Strategies for subtypes–dealing with the diversity of breast cancer: highlights of the St. Gallen international expert consensus on the primary therapy of early breast cancer 2011. Ann Oncol. 2011;22:1736–1747. doi: 10.1093/annonc/mdr304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Gao Y, Liu W, Arjun S, Zhu L, Ratner V, Kurc T, Saltz J, Tannenbaum A. Multi-scale learning based segmentation of glands in digital colonrectal pathology images. In: Proceedings of SPIE—the international society for optical engineering. 2016:9791. [DOI] [PMC free article] [PubMed]
- 25.Lee G, Ali S, Veltri R, Epstein JI, Christudass C, Madabhushi A. Cell orientation entropy (COrE): predicting biochemical recurrence from prostate cancer tissue microarrays. In: International conference on medical image computing and computer-assisted intervention (Miccai 2013), Pt Iii 2013;8151:396–403. [DOI] [PubMed]
- 26.Bubendorf L, Nocito A, Moch H, Sauter G. Tissue microarray (TMA) technology: miniaturized pathology archives for high-throughput in situ studies. J Pathol. 2001;195:72–79. doi: 10.1002/path.893. [DOI] [PubMed] [Google Scholar]
- 27.Luo X, Zang X, Yang L, Huang J, Liang F, Rodriguez-Canales J, Wistuba II, Gazdar A, Xie Y, Xiao G. Comprehensive computational pathological image analysis predicts lung cancer prognosis. J Thorac Oncol. 2017;12:501–509. doi: 10.1016/j.jtho.2016.10.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Park JA, Atia L, Mitchel JA, Fredberg JJ, Butler JP. Collective migration and cell jamming in asthma, cancer and development. J Cell Sci. 2016;129:3375–3383. doi: 10.1242/jcs.187922. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.