Abstract
This study aimed to develop and validate a practical, reliable assay for prognosis and chemotherapy benefit prediction compared with conventional staging in Gastric cancer (GC). Twenty-three candidate genes with significant correlation between quantitative hybridization and microarray results plus 2 reference genes were selected to form a 25-gene prognostic classifier, which can classify patients into 3 distinct groups of different risk of mortality obtained by analyzing microarray data from 78 frozen tumor specimens. The 25-gene assay was associated with overall survival in both training (P = 0.017) and testing cohort (P = 0.005) (462 formalin-fixed paraffin-embedded samples). The risk prediction in stages I + II is significantly better than that in stages III. Analysis demonstrated that this 25-gene signature is an independent prognostic predictor and show higher prognostic accuracy than conventional TNM staging in early stage patients. Moreover, only high-risk patients in stage I + II were found benefit from adjuvant chemotherapy (P = 0.043), while low-risk patients in stage III were not found benefit from adjuvant chemotherapy. In conclusion, our results suggest that this 25-gene assay can reliably identify patients with different risk for mortality after surgery, especially for stage I + II patients, and might be able to predict patients who benefit from chemotherapy.
Introduction
Gastric cancer (GC) is an aggressive malignancy with high rate of recurrence in patients even undergoing curative resection. Post-operative management depends on accurate prognostic staging to identify the individuals at high risk for relapse. Modest efficacy and considerable toxicities associated with adjuvant chemotherapy. Many factors can affect the prognosis of GC patients, such as TNM stage, while this has limited ability to stratify patients according to their likely outcome1. Moreover, there also have great variation in the histological appearance of GC2. So the current evaluation system for GC is insufficient for predicting the outcome of GC treatment. A more precise staging test would give clinicians the ability to identify patients with statistically heterogeneous outcomes from within otherwise homogeneous clinical groups.
GC is a multistage process involving the accumulation of genetic and epigenetic alterations which may be the important prognostic factors. Gene expression profiling is useful for classifying tumors for various types of cancer patients, and can predict the prognosis for patients with various types of cancer3, 4. However, most prognostic prediction models focused on breast cancer and lung cancer5, 6, only a few studies on GC prognosis7, 8. The profiling genes selected in GC studies have varied considerably. Gene expression profiles were also different according to the microarray platform and the analytic strategy used9–14.
In certain research situations, performing such analyses on archival formalin fixed paraffin-embedded (FFPE) surgical specimens would be advantageous as large libraries of such specimens with long-term follow-up data are widely available. To systematically validate the microarray results, branched DNA signal amplification (Quantigene Plex, QGP), quantitative hybridization assays, which can measure 3 to 80 target RNAs per well with unparalleled accuracy and precision, was used15, 16. In this study, we developed and validated a prognosis-associated 25-gene assay using QGP on FFPE samples. And we also investigated the chemotherapy benefit in different risk group.
Results
Screening of candidate biomarkers by microarray and establishment of a 31-gene prognostic algorithm
Detailed clinicopathological characteristics of the selecting, training, and testing sets were shown in Table 1. For initial screening of candidate biomarkers by microarray profiling, we selected 78 patients with qualified frozen tissues in the first batch.
Table 1.
Clinical and pathological characteristics of patients in three cohorts.
| Selecting Cohort | Training Cohort* | Test Cohort | |
|---|---|---|---|
| N = 78 | N = 102 | N = 360 | |
| Age at resection (years; mean32) | 61.1 ± 9.83 | 61.38 ± 9.51 | 58.97 ± 12.26 |
| Sex | |||
| Male | 58 (74.36%) | 79 (77.45%) | 246 (68.33%) |
| Female | 20 (25.64%) | 23 (22.55%) | 114 (31.67%) |
| Differentiation | |||
| Well-Moderately differentiated | 15 (19.23%) | 11 (10.78%) | 108 (30%) |
| Poorly differentiated | 63 (80.77%) | 91 (89.22%) | 239 (66.39%) |
| Undetermined | 0 | 0 | 13 (3.61%) |
| Lauren Subtype | |||
| Diffuse Type | 26 (33.33%) | 35 (34.31%) | 75 (20.83%) |
| Intestinal Type | 38 (48.72%) | 42 (41.18%) | 242 (67.22%) |
| Mixed Type | 14 (17.95%) | 25 (24.51%) | 43 (11.94%) |
| Location | |||
| Cardia | 27 (34.62%) | 30 (29.41%) | 105 (29.17%) |
| Non-cardia | 51 (65.38%) | 72 (70.59%) | 255 (70.83%) |
| TNM Stage | |||
| I | 3 (3.85%) | 6 (5.88%) | 36 (10%) |
| II | 17 (21.79%) | 23 (22.55%) | 90 (25%) |
| III | 48 (61.54%) | 73 (71. 57%) | 234 (68.82%) |
| IV | 10 (12.82%) | 0 | 0 |
| T Stage | |||
| 1 | 0 (0%) | 1 (0%) | 20 (5.11%) |
| 2 | 7 (8.97%) | 7 (9.84%) | 40 (10.46%) |
| 3 | 26 (33.33%) | 45 (31.15%) | 101 (30.9%) |
| 4 | 45 (57.69%) | 49 (59.02%) | 199 (53.53%) |
| N Stage | |||
| 0 | 14 (17.95%) | 19 (18.63%) | 96 (26.67%) |
| 1 | 11 (14.10%) | 15 (14.71%) | 53 (14.72%) |
| 2 | 19 (24.36%) | 28 (27.45%) | 78 (21.67%) |
| 3 | 34 (43.59%) | 40 (39.22%) | 133 (36.94%) |
| M Stage | |||
| 0 | 68 (87.18%) | 102 (100%) | 360 (100%) |
| 1 | 10 (12.82%) | 0 | 0 |
| Vascular invasion | |||
| V(−) | 31 (39.74%) | 43 (42.16%) | 179 (49.72%) |
| V(+) | 42 (53.85%) | 55 (53.92%) | 181 (50.28%) |
| Not recorded* | 5 (6.41%) | 4 (3.92%) | 0 |
*There is 51 cases overlapped in selecting cohort and training cohort.
On Affymetrix microarray analysis of tumors from the 78 cancer and 24 matched adjacent non-cancerous gastric mucosa, 2880 genes showed significantly differential expression between the GC tissues and adjacent non-cancerous gastric mucosa. We used a Cox proportional hazards modeling as the main analytical test used to develop the prognostic algorithm (to build a prognostic classifier), which selected31 target genes (Table 2) in the 78-patient selecting cohort. Among them, 14 genes were correlated with patient prognosis analyzed by hazard ratios from univariate Cox regression, including 6 protective genes (XAF1, IFITM1, NCOA7, GZF1, APAF1, and TCF7L2, with hazard ratio less than 1), and 8 risk genes (DYRK2, UBA2, EPHB2, PDCD5, FADD, MARCKS, B3GALT6, and ITCH, with hazard ratio more than 1), while 17 genes were related to classical oncogenic pathways or potential therapeutic targets in GC from previous publication, including MMP2, EGFR, MMP7, MET, ERBB2, CDK1, CDK6, SRC, IGF1R, CDK4, PDGFRB, ERBB3, PARP1, FRAP1, CDK3, FLT4, and KDR.
Table 2.
Representative amplified genomic loci and genes by microarray analysis.
| gene | Gene name | Chromosome | Reference sequence | Protein location | Relevant biological functions and pathways | Role in algorithm | P* value | Correlation Coefficient | P# value | HR(95%CI) |
|---|---|---|---|---|---|---|---|---|---|---|
| Survival associated genes from Chip results | ||||||||||
| XAF1 | XIAP associated factor 1 | 17p13.1 | NM_017523 | cytoplasm, nucleus | Affect the progress of the apoptosis signaling pathway | prognosis | 0 | 0.583 | 0 | 0.564 (0.424–0.752) |
| IFITM1 | interferon induced transmembrane protein 1 | 11p15.5 | NM_003641 | membrane, plasma membrane | Influence cell invasion | prognosis | 0 | 0.74 | 0.001 | 0.639 (0.489–0.834) |
| DYRK2 | dual-specificity tyrosine-(Y)-phosphorylation regulated kinase 2 | 12q15 | NM_003583 | cytoplasm, nucleus,membrane | Tyrosine autophosphorylation and catalyzed phosphorylation of histones H3 and H2B | prognosis | 0 | 0.53 | 0.001 | 2.078 (1.347–3.205) |
| NCOA7 | nuclear receptor coactivator 7 | 6q22.32 | NM_181782 | intracellular, nucleus | Cell wall macromolecule catabolic process, positive regulation of transcription | prognosis | 0 | 0.581 | 0.003 | 0.514 (0.330–0.801) |
| UBA2 | ubiquitin-like modifier activating enzyme 2 | 19q12 | NM_005499 | cytoplasm, nucleoplasm | SUMO-activating enzyme for the sumoylation of proteins | prognosis | 0 | 0.577 | 0.011 | 2.064 (1.184–3.599) |
| EPHB2 | EPH receptor B2 | 1p36.1-p35 | NM_004442 | membrane | Angiogenesis, axon guidance | prognosis | 0 | 0.56 | 0.013 | 1.593 (1.101–2.305) |
| PDCD5 | programmed cell death 5 | 19q13.11 | NM_004708 | cytoplasm, nucleus | Apoptotic process | prognosis | 0 | 0.451 | 0.02 | 1.756 (1.094–2.820) |
| FADD | Fas (TNFRSF6)-associated via death domain | 11q13.3 | NM_003824 | membrane, plasma membrane | Apoptotic signaling pathway,TRIF-dependent toll-like receptor signaling pathway | prognosis | 0 | 0.665 | 0.033 | 1.579 (1.038–2.403) |
| B3GALT6 | UDP-Gal:betaGal beta 1,3-galactosyltransferase polypeptide 6 | 1p36.33 | NM_080605 | Golgi membrane | Carbohydrate metabolic process | prognosis | 0 | 0.595 | 0 | 2.256 (1.484–3.428) |
| MARCKS | myristoylated alanine-rich protein kinase C substrate | 6q22.2 | NM_002356 | plasma membrane | Energy reserve metabolic process | prognosis | 0.001 | 0.409 | 0.015 | 1.985 (1.141–3.452) |
| GZF1 | GDNF-inducible zinc finger protein 1 | 20p11.21 | NM_022482 | nucleolus | Negative regulation of transcription, DNA-templated | prognosis | 0.011 | 0.323 | 0 | 0.564 (0.424–0.752) |
| APAF1 | apoptotic peptidase activating factor 1 | 12q23 | NM_013229 | cytoplasm, nucleus | Activation of cysteine-type endopeptidase activity involved in apoptotic process | prognosis | 0.029 | 0.28 | 0.001 | 0.279 (0.127–0.610) |
| ITCH | itchy E3 ubiquitin protein ligase | 20q11.22 | NM_031483 | cytoplasm, nucleus, membrane | Notch signaling pathway, apoptotic process, inflammatory response | prognosis | 0.224 | 0.158 | 0.005 | 2.475 (1.311–4.671) |
| TCF7L2 | transcription factor 7-like 2 | 10q25.3 | NM_030756 | cytoplasm | Canonical Wnt receptor signaling pathway, fat cell differentiation | prognosis | 0.645 | 0.061 | 0 | 0.209 (0.112–0.389) |
| GC therapeutic targets | ||||||||||
| EGFR | epidermal growth factor receptor | 7p12 | NM_005228 | membrane, cytoplasm | MAP kinase kinase kinase activity, cell proliferation | prognosis | 0 | 0.691 | ||
| MMP7 | matrix metallopeptidase 7 | 11q21-q22 | NM_002423 | plasma membrane, extracellular region | Metalloendopeptidase activity, regulation of cell proliferation | prognosis | 0 | 0.656 | ||
| MET | met proto-oncogene | 7q31 | NM_000245 | plasma membrane | Activation of MAPK activity, cell proliferation | prognosis | 0 | 0.534 | ||
| ERBB2 | v-erb-b2 avian erythroblastic leukemia viral oncogene homolog 2 | 17q12 | NM_004448 | plasma membrane | Cell proliferation, ATP binding | prognosis | 0 | 0.743 | ||
| CDK1 | cyclin-dependent kinase 1 | 10q21.1 | NM_001786 | cytoplasm, nucleus | DNA repair, DNA replication | prognosis | 0 | 0.498 | ||
| CDK6 | cyclin-dependent kinase 6 | 7q21-q22 | NM_001259 | cytoplasm, nucleus | G1/S transition of mitotic cell cycle | prognosis | 0 | 0.661 | ||
| IGF1R | insulin-like growth factor 1 receptor | 15q26.3 | NM_000875 | plasma membrane | Inactivation of MAPKK activity, insulin receptor signaling pathway | prognosis | 0 | 0.486 | ||
| CDK4 | cyclin-dependent kinase 4 | 12q14 | NM_001259 | cytoplasm, nucleus | G2/S transition of mitotic cell cycle, cell division | prognosis | 0 | 0.561 | ||
| SRC | v-src avian sarcoma | 20q12-q13 | NM_005417 | cytoplasm, nucleus | Ras protein signal transduction | prognosis | 0.001 | 0.403 | ||
| KDR | kinase insert domain receptor | 4q11-q12 | NM_002253 | plasma membrane | Angiogenesis, endothelial cell differentiation | prognosis | 0.002 | 0.397 | ||
| MMP2 | matrix metallopeptidase 2 | 16q13-q21 | NM_004530 | plasma membrane, extracellular region | Regulation of vascularization and the inflammatory response | prognosis | 0.086 | 0.221 | ||
| PDGFRB | platelet-derived growth factor receptor, beta polypeptide | 5q33.1 | NM_002609 | plasma membrane | G-protein coupled receptor signaling pathway, cell migration | prognosis | 0.148 | 0.187 | ||
| ERBB3 | v-erb-b2 avian erythroblastic leukemia viral oncogene homolog 3 | 12q13 | NM_001982 | plasma membrane | Growth factor binding, negative regulation of cell adhesion | prognosis | 0.154 | 0.185 | ||
| PARP1 | poly (ADP-ribose) polymerase 1 | 1q41-q42 | NM_001618 | nucleolus | DNA damage response, detection of DNA damage | prognosis | 0.458 | 0.097 | ||
| FRAP1 | mechanistic target of rapamycin | 1p36.2 | NM_004958 | membrane, cytoplasm | TOR signaling, ATP binding, drug binding, cell growth | prognosis | 0.738 | 0.044 | ||
| FLT4 | fms-related tyrosine kinase 4 | 5q35.3 | NM_001258 | cytoplasm, plasma membrane | Blood vessel morphogenesis, negative regulation of apoptotic process | prognosis | 0.745 | −0.043 | ||
| CDK3 | cyclin-dependent kinase 3 | 17q25.1 | NM_002021 | cytoplasm, nucleus | Cyclin-dependent protein serine/threonine kinase activity, G0 to G1 transition, cell proliferation | prognosis | 0.892 | 0.018 | ||
| Reference genes | ||||||||||
| TBP | TATA box binding protein | 6q27 | NM_003194 | nucleolus | Transcription initiation; RNA elongation; transcription | Reference | ||||
| PGK1 | phosphoglycerate kinase 1 | Xq13.3 | NM_000291 | cytoplasm | ATP binding; phosphoglycerate kinase activity; carbohydrate metabolic process | Reference | ||||
P* values for the correlation coefficients were estimated by Pearson correlation test.
P# values for the hazard ratios were estimated by univariate Cox regression analysis of the microarray data.
We then derived a formula (Supplementary materials and methods) to calculate the risk score for their risk of mortality for every patient based on their individual 31-gene expression levels. Those GC patients with a high-risk thirty-one gene signature had a shorter median overall survival than the patients with intermediate-risk gene signature and low-risk gene signature (median survival: 13.42 months vs. 32.24 months vs. not reached, P < 0.001, Fig. 1B). Moreover, we also validated our model in the publicly available gastric cancer data set (GSE62254). Similar to our previous results, those GC patients with a high-risk thirty-one gene signature also had a shorter median overall survival than the patients with intermediate-risk gene signature and low-risk gene signature (median survival: 37.93 months vs. 45.77 months vs. not reached, P < 0.001, Supplementary Figure S).
Figure 1.

Study design and the combined gene signature and survival in GC. Panel A showed the study design. Kaplan–Meier survival curve estimated the overall survival according to the 31 gene microarray signature (B) and 25-gene microarray signature33. Kaplan-Meier survival curves for training and testing cohort according to 25-gene signature were showed in Panel D and E.
Gene selection by quantitative hybridization assay in GC
Since in clinical settings where reproducibility, cost, and widespread availability are key priorities, we then aimed to establish a practical prognostic algorithm based on FFPE tissue samples, we systematically measured the expression 31 genes from microarray analysis in 61 matched FFPE tissues by QGP.
For each of the 31 genes, the correlation between relative expression status by QGP and that by microarray was statistically analyzed. Twenty three genes with strong correlation between two assays were selected, including APAF1, NCOA7, XAF1, IFITM1, EGFR, MMP7, MET, ERBB2, CDK1, CDK6, SRC, IGF1R, CDK4, KDR, FADD, EPHB2, PDCD5, MARCKS, GZF1, UBA2, MMP2, DYRK2, and B3GALT6 (Table 2). We test this 23 gene algorithm in 78 frozen microarray setting, the survival difference for each group is obvious, those high-risk gene signature GC patients had a shorter median overall survival than the patients with intermediate-risk gene signature and low-risk gene signature (median survival: 13.42 months vs. 28.95 months vs. 48.13 months, P < 0.001, Fig. 1C).
The twenty-five-gene signature and survival in GC
Then we identified the gene-signature by quantitative hybridization assay. Reference genes were TBP and PGK1 (Supplementary materials and methods). We establish a 25-gene (23 correlated genes plus 2 reference genes) prognostic algorithm based on FFPE tissues. And the coefficient for each of the 23 genes was derived from the previous cohort and formula of risk score calculating for each patients was changed accordingly.
In order to remove excess statistical confounding factors, patients with TNM stage IV were excluded in the training and testing cohort (the characteristics of this group is shown in Table 1). Then we evaluated this 25-gene assay in the FFPE tissue from the training cohort of 102 patients. Those high-risk GC patients were with a shorter median overall survival than the intermediate-risk and low-risk patients (median survival: 18.22 months vs. 37.02 months vs. 48.13 months, P = 0.017, Fig. 1D).
To confirm that the 25-gene algorithm had similar prognostic value in different populations, we tested it in an independent cohort of 360 patients. The general condition between patients in training cohort and test cohort is comparable (Table 3S). We applied the cutoff’s value for categorization in training group to the independent test set of 360 patients. Similar to the training cohort, our results showed that patients with a high-risk gene signature had a shorter median overall survival than those with a low-risk gene signature (median survival: 34.39 months vs. 37.77 months vs. not reached, P = 0.005) (Fig. 1E).
Twenty-five-gene signature and survival in different TNM stages GC patients
We also investigated the gene signature in tumor tissues obtained from GC patients with stage I and II or stage III and IV. First, we analyzed the gene signature in tumor specimens obtained from patients in the selecting cohort including 78 cases with stage I and II or stage III and IV. TNM stage I and II disease combined GC patients, those with a high-risk gene signature had a shorter overall survival than those with a low-risk gene signature (median survival: 9.97 months vs. 59.93 months vs. not reached, P < 0.001, Fig. 2A), among patients with stage III and IV disease, overall survival also showed similar results (median survival: 13.42 months vs. 28.95 months vs. 48.13, P < 0.001, Fig. 2B).
Figure 2.

Twenty-five-gene signature and survival in GC at different TNM stages. Overall survival of patients with stage I + II and III + IV in the first cohort (A and B), stage I + II in training cohort33, and stage I + II in testing cohort (D); Panel E showed the overall survival with stage I + II disease in combined training and testing cohorts; Panel F: ROC curves compare the prognostic accuracy of the gene signature with clinicopathological risk factors in combined training and testing cohorts GC.
Moreover, the associations between the gene signature and prognosis in training and testing cohort were also analyzed with stage I and II or stage III respectively. In the subgroup analysis of 29 patients with TNM stage I and II of training cohort, those with a high-risk gene signature had a shorter overall survival than those with an intermediate-risk gene signature and low-risk gene signature (median survival: 21.45 months vs. 63.39 months vs. not reached, P = 0.002, Fig. 2C). In the subgroup analysis of 126 patients with stage I and II disease of testing cohort, those with a high-risk gene signature also showed a shorter overall survival than those with a low-risk gene signature (median survival: 68.64 months vs. not reached vs. not reached, P = 0.014, Fig. 2D), while either in original training or testing cohort the overall survival in the stage III group did not differ significantly (training cohort: P = 0.194; testing cohort: P = 0.264, figure not shown).
Twenty-five-gene assay is an independent prognostic factor in stage I and II patients
Moreover, we also noted similar results in the patients with stage I and II disease combined training and testing cohort, those with a high-risk gene signature showed a shorter overall survival than those with a low-risk gene signature (median survival: 54.57 months vs. not reached vs. not reached, P < 0.001, Fig. 2E).
Age, sex, gene signature, differentiation, vascular invasion, and TNM stage were included in the Cox multivariate regression analysis. According to the analysis, the high-risk gene signature, differentiation, and tumor stage II were significantly associated with death from any cause among the 155 patients (Table 3) (hazard ratio for the high-risk signature vs. the intermediate-risk signature: 5.325, 95% confidence interval, 2.061 to 13.758, P = 0.001; high-risk signature vs. the low-risk signature: 6.248, 95% confidence interval, 2.320 to 16.826, P < 0.001).
Table 3.
Multivariate analysis of prognostic factors by Cox proportional hazard model in stage I + II GC.
| Variables | Multivariate analysis | |
|---|---|---|
| HR (95%CI) | P value | |
| Age | 1.022 (0.997–1.048) | 0.087 |
| 25-gene signature | ||
| Low-risk | ||
| Intermediate-risk vs. low-risk | 5.325 (2.061–13.758) | 0.001 |
| High-risk vs. low-risk | 6.248 (2.320–16.826) | <0.001 |
| TNM stage | ||
| II vs. I | 3.057 (1.432–6.526) | 0.004 |
| Differentiation | ||
| Poorly vs. Well-Moderately | 2.510 (1.309–4.813) | 0.006 |
The 25-gene signature based classifier also showed significantly higher prognostic accuracy than any clinicopathological risk factor, including TNM stage and differentiation (Fig. 2F). Thus, this signature can add prognostic value to clinicopathological prognostic features.
Twenty-five-gene signature and adjuvant chemotherapy
We noted adjuvant chemotherapy can enhance survival in all 436 patients (another 26 cases chemotherapy information was missed, median survival: 29.62 months vs. 50.79 months, P = 0.003, Fig. 3A), including high-risk median survival (median survival: 23.08 months vs. 48.42 months, P = 0.024, Fig. 3B), and intermediate-risk group (median survival: 29.55 months vs. 44.41 months, P = 0.041, Fig. 3C), while the overall survival in the patients with low-risk did not differ significantly (median survival: 59.30 vs. not reached, P = 0.513, Fig. 3D, Table 4S).
Figure 3.

Prediction effect of chemotherapy benefit in different risk group. Kaplan-Meier survival curves for all patients (A–D), stage I + II (E–H), and III (I–L) in different-risk groups, which were stratified by the receipt of chemotherapy.
In the 149 stage I and II cases, adjuvant chemotherapy did not enhance survival (median survival: not reached vs. not reached, P = 0.101, Fig. 3E). Results from subgroup analysis using our twenty-five-gene signature based classifier showed that patients in the high-risk group had a favorable response to adjuvant chemotherapy (median survival: 36.59 months vs. not reached, P = 0.043, Fig. 3F). While the overall survival in the patients with either intermediate-risk or low-risk did not differ significantly (intermediate-risk group: P = 0.719; low-risk group: P = 0.498, Fig. 3G and H). In the 287 stage III cases, adjuvant chemotherapy can enhance survival (median survival: 19.99 months vs. 29.03 months, P = 0.004, Fig. 3I). Results from subgroup analysis using our twenty-five-gene signature based classifier showed that patients with either the high-risk or intermediate-risk had a favorable response to adjuvant chemotherapy (median survival of high-risk group, 16.54 months vs. 21.80 months, P = 0.054, Fig. 3J; intermediate-risk group, 17.85 months vs. 32.24 months, P = 0.012, Fig. 3K), while the overall survival of patients in low-risk group did not differ significantly (P = 0.912, Fig. 3L, Table 4S).
Moreover, the associations between adjuvant chemotherapy and prognosis in different lymph node metastasis group were also analyzed (N negative, P < 0.001, Fig. 4A; N positive, P = 0.967, Fig. 4E), we found that adjuvant chemotherapy can improve survival in GC patients with either high-risk (P = 0.012, Fig. 4B) or intermediate-risk (P = 0.001, Fig. 4C) group in lymph node metastasis positive group. In the other patients including lymph node metastasis positive group with low-risk (P = 0.454, Fig. 4D) and all lymph node metastasis negative group (High-risk group, P = 0.567, Fig. 4F; Intermediate-risk group, P = 0.51, Fig. 4G; Low-risk group, P = 0.347, Fig. 4H), overall survival is not significantly different between the chemotherapy and no chemotherapy group (Table 4S). The results indicate that our classifier could successfully identify patients who were suitable candidates for adjuvant chemotherapy.
Figure 4.

Prediction effect of chemotherapy benefit in different risk group. Kaplan-Meier survival curves for N (−) (A–D) and N (+) (E–H) in different-risk groups, which were stratified by the receipt of chemotherapy.
Discussion
Our practical, quantitative-hybridization-based assay reliably identified GC patients at high risk for mortality after surgical resection, discriminating such patients with greater accuracy than use of NCCN criteria alone. Among the twenty three genes, most of them can be generally classified into the following types: epidermal growth factor receptor, cell cycle factor, angiogenesis, matrix metalloproteinase, and apoptosis genes17–21. Moreover, some genes also involved in notch signaling pathway, Tor signaling pathway, regulation of transcription22, 23, MAPK signaling pathway, and metabolic process24, 25. Although other groups have developed gene signatures prognostic of survival in GC9, none of these previous studies used FFPE samples. Furthermore, most previous studies did not subject their prognostic signatures to large-scale, independent validation. Taken together, our assay for GC is the first of its kind in these important respects: the performance of the assay in one of the studies in a laboratory that was independent from the laboratory in which the assay was developed, the relatively large sizes of the independent testing cohorts, and the potentially large disparity between the genetic background of one of the test cohorts and that of the original training cohort used for development of the assay.
In this study, we noted that in stage I and II GC patients, those with a high-risk gene signature also showed a poor overall survival than those with a low-risk gene signature, while the overall survival of the patients with stage III did not differ significantly. This is probably because the consideration of palliative nature of surgical treatment in stage III group. The survival of patients with this stage will be affected significantly by many clinical and treatment factors other than genetic background of the cancer, and some gene classifiers study were more willing to focus on early stage cancer26.
Now two large Asian randomized Phase III studies (the ACTS GC and CLASSIC trials) have confirmed the survival benefit for postoperative chemotherapy after curative D2 lymph node dissection in patients with GC27, 28. But not all patients, especially in patients with early stage need chemotherapy and can benefit from it29, 30. In our study, these results indicate that our classifier could successfully identify patients with different stage who may benefit from adjuvant chemotherapy. The results indicate that the 25-gene signature could be used to select early stage GC patients at high risk for adjuvant chemotherapy and advanced stage at either intermediate or high risk for adjuvant chemotherapy. Meanwhile, it may spare early stage at either low-risk or intermediate-risk and advanced stage at low risk patients from for unnecessary chemotherapy.
In conclusion, we identified a 25-gene signature associated with prognosis in GC, and validated it in another 360 cases. Statistical analysis demonstrated it is an independent prognostic predictor. The predicting role of it in stage I and II is significantly better. Moreover, the patients with high-risk assay had a chemotherapy benefit in stage I and II GC, while low-risk patients in stage III were not found benefit from adjuvant chemotherapy.
Materials and Methods
Patients and samples
All of the patients with GC included in this study were diagnosed and surgically treated in Peking University Cancer Hospital between 1996 and 2007 and followed up to January 2013. This investigation was approved by the Institutional Review Boards of the hospital, informed consent was obtained from each patient, and all methods were performed in accordance with the relevant guidelines and regulations. All the frozen samples were collected and stored by Central Biobank Facility and all FFPE samples by Pathological Department of hospital. All frozen sample for this investigation passed the histological re-assessment containing at least 70% tumor cells. All FFPE tissues samples were hematoxylin-eosin (HE) stained and evaluated for one slide by two pathologists (YL and BD) and manually dissected to remove non-cancerous mucosa and mesenchymal tissues to guarantee at least 80% tumor cells. The TNM stage of GC was classified according to the 7th edition of classification recommended by the American Joint Committee on Cancer (AJCC)31. This investigation was performed after approval by Ethics Committee of Peking University Cancer Hospital. Informed general consent was obtained from each patient at the time of collection.
Study design
The study design is shown in Fig. 1. At first, the frozen tissues from 78 patients (78 cancer tissues and 24 matched normal tissues) were profiled by Affymetrix Hu133Plus2 arrays for mRNA expression. Then analysis of microarray data generated candidate biomarkers related to prognosis and consensus therapeutic targets. And only biomarkers with comparable expression detected by quantitative Quantigene assay in the matched FFPE tissues were selected to develop a multiple gene assay, which was then tested and validated in two cohorts of patients with FFPE tissue samples.
In the expression profiling assay, the frozen tissues from all stages (I-IV) patient with complete clinicopathological and follow-up information were randomly selected and retrieved from the Central Biobank Facility. In the following quality control test, samples with tumor percentage <70% by histology evaluation and poor RNA quality were excluded.
In the test and validation phase, only patients with TNM stage I-III GC undergoing curative resection (histologically negative resection margin), and with complete clinicopathological and follow-up information available, were included. The reason for exclusion of stage IV from the validation cohort is because the consideration of palliative nature of surgical treatment for stage IV patients. The survival of patients with stage IV will be affected significantly by many clinical and treatment factors other than genetic background of the cancer. Patients with FFPE tissues not available or fail to pass the quality assessment were also excluded.
Microarray Analysis
Total RNA were extracted from frozen tissues and profiled by Affymetrix Hu133Plus2 arrays for mRNA expression according to the manufacturer’s specifications. Robust Multi-array Analysis (RMA) algorithm provided by software Expression Console was used to call gene level expression values from raw signals. Based on the algorithm published, only the best probe set was selected to represent gene expressions. Any gene without probe set with informative score < 0.5 is removed from this analyses.
Quantitative hybridization Assay in FFPE tissues
After manually dissected the FFPE slides to remove non-cancerous mucosa with scalpels, tissue homogenates were prepared according to the procedure described in the QuantiGene Sample Processing Kit for FFPE Tissues (Panomics, Inc., Fremont, CA). Briefly, 200ul of homogenizing solution supplemented with 2 µl of proteinase K (50 µg/µl) were incubated with 6 deparaffinized 5 µm sections overnight at 65 °C. Then the tissue homogenate was separated from debris by brief centrifugation, and transferred to a new tube.
Standard probe design software was used to design specific oligonucleotide probe sets for detecting target genes by QuantiGene plex 2.0 Reagent Systems (Panomics, Inc.), which gives 400-fold signal amplification. And the assay was performed according to protocols recommended by manufacturer (Panomics, Inc.). Briefly, probe set oligonucleotides were mixed with the sample solution into a 96-well plate. Target RNA was captured during an overnight incubation at 54 °C. Unbound material was removed by three washes with 200 µl of wash buffer followed by sequential hybridization of RNA amplifier molecules, then pre-amplifier hybridization, amplifier hybridization, and label probe hybridization were performed. Finally, plate were prepared for analysis after Streptavidinconjugated Phycoerythrin (SAPE) working reagent was added.
Gene Signature and Statistical Analysis
First, the genes which showed significantly differential expression between the GC tissues and adjacent non-cancerous gastric mucosa were selected from the 78 microarray results. Then we used a Cox proportional hazards modeling as the main analytical test used to develop the prognostic algorithm. Hazard ratios from univariate Cox regression analysis were used to determine which genes were associated with death. Protective genes were defined as those associated with a hazard ratio for death of less than 1; risk genes were defined as those associated with a hazard ratio for death of more than 1. For genes that were significantly correlated with survival, we used a linear combination of the gene-expression coding values weighted by the regression coefficients to calculate a risk score for each patient. Resultant predicted risk scores from the training cohort were divided at the 33rd and 67th percentiles to generate cutoff s for categorization of risk score as low-risk, intermediate-risk, and high-risk. Kaplan–Meier analysisi was used to compare survival the survival distributions of two or more groups of a between-subjects factor with the log-rank test. Multivariate Cox proportional hazards regression analysis with backward, stepwise selection was used to evaluate independent prognostic factors associated with survival. The correlation of the microarray and QGP results was indexed by Pearson’s correlation test. P < 0.05 was considered to indicate statistical significance, and all tests were two-tailed.
Significance
This study develops and validates a practical, reliable assay which can identify patients with different risk for mortality after surgery, and might be able to predict patients who may benefit from chemotherapy in GC.
Electronic supplementary material
Acknowledgements
This work was supported in part by grants from National Key Technology R&D Program (2012AA02A203-B01, 2012AA02A504-B01, 2012AA020101, J. Ji) and Beijing Municipal Science & Technology Commission (Z121100007512010, D131100005313010, L. Zhang, J. Ji).
Author Contributions
X.W., L.Z. and J.J. conceived and designed the experiments; X.W., Y.L., Z.N., R.F., Y.J., L.Z., D.S., H.D., Y.H., X.X., X.C., L.L., T.G., and Z.L. performed the experiments; X.W., L.Z. and Q.J. analyzed the data and contributed to writing and editing the manuscript; L.Z. and J.J. supervised the project and wrote the manuscript.
Competing Interests
Q. Ji has ownership interest (including patents) in AstraZeneca R&D. No potential conflicts of interest were disclosed by the other authors.
Footnotes
Xiaohong Wang, Yiqiang Liu, Zhaojian Niu and Runjia Fu contributed equally to this work.
Electronic supplementary material
Supplementary information accompanies this paper at doi:10.1038/s41598-017-07604-y
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Lianhai Zhang, Email: zlhzlh@hotmail.com.
Jiafu Ji, Email: jijiafu@hsc.pku.edu.cn.
References
- 1.Peters, C. J. et al. A 4-gene signature predicts survival of patients with resected adenocarcinoma of the esophagus, junction, and gastric cardia. Gastroenterology139, 1995–2004 e1915 (2010). [DOI] [PubMed]
- 2.Roberts PO, et al. Pathological factors affecting gastric adenocarcinoma survival in a Caribbean population from 2000–2010. World journal of gastrointestinal surgery. 2014;6:94–100. doi: 10.4240/wjgs.v6.i6.94. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Ji J, et al. Comprehensive analysis of the gene expression profiles in human gastric cancer cell lines. Oncogene. 2002;21:6549–6556. doi: 10.1038/sj.onc.1205829. [DOI] [PubMed] [Google Scholar]
- 4.Cho JY, et al. Gene expression signature-based prognostic risk score in gastric cancer. Clinical cancer research: an official journal of the American Association for Cancer Research. 2011;17:1850–1857. doi: 10.1158/1078-0432.CCR-10-2180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Chen HY, et al. A five-gene signature and clinical outcome in non-small-cell lung cancer. The New England journal of medicine. 2007;356:11–20. doi: 10.1056/NEJMoa060096. [DOI] [PubMed] [Google Scholar]
- 6.Kratz JR, et al. A practical molecular assay to predict survival in resected non-squamous, non-small-cell lung cancer: development and international validation studies. Lancet. 2012;379:823–832. doi: 10.1016/S0140-6736(11)61941-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Xu ZY, Chen JS, Shu YQ. Gene expression profile towards the prediction of patient survival of gastric cancer. Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie. 2010;64:133–139. doi: 10.1016/j.biopha.2009.06.021. [DOI] [PubMed] [Google Scholar]
- 8.Marchet A, et al. Gene expression profile of primary gastric cancer: towards the prediction of lymph node status. Annals of surgical oncology. 2007;14:1058–1064. doi: 10.1245/s10434-006-9090-0. [DOI] [PubMed] [Google Scholar]
- 9.Wang Z, et al. Identification of a 5-gene signature for clinical and prognostic prediction in gastric cancer patients upon microarray data. Med Oncol. 2013;30:678. doi: 10.1007/s12032-013-0678-5. [DOI] [PubMed] [Google Scholar]
- 10.Busuttil RA, et al. A signature predicting poor prognosis in gastric and ovarian cancer represents a coordinated macrophage and stromal response. Clinical cancer research: an official journal of the American Association for Cancer Research. 2014;20:2761–2772. doi: 10.1158/1078-0432.CCR-13-3049. [DOI] [PubMed] [Google Scholar]
- 11.Yin Y, et al. Converting a microarray signature into a diagnostic test: a trial of custom 74 gene array for clarification and prediction the prognosis of gastric cancer. PloS one. 2013;8:e81561. doi: 10.1371/journal.pone.0081561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Shen J, et al. A three-gene signature as potential predictive biomarker for irinotecan sensitivity in gastric cancer. Journal of translational medicine. 2013;11:73. doi: 10.1186/1479-5876-11-73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Rachidi SM, Qin T, Sun S, Zheng WJ, Li Z. Molecular profiling of multiple human cancers defines an inflammatory cancer-associated molecular pattern and uncovers KPNA2 as a uniform poor prognostic cancer marker. PloS one. 2013;8:e57911. doi: 10.1371/journal.pone.0057911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Takeno A, et al. Gene expression profile prospectively predicts peritoneal relapse after curative surgery of gastric cancer. Annals of surgical oncology. 2010;17:1033–1042. doi: 10.1245/s10434-009-0854-1. [DOI] [PubMed] [Google Scholar]
- 15.Ching LK, et al. Transcriptional profiling of TLR-4/7/8-stimulated guinea pig splenocytes and whole blood by bDNA assay. Journal of immunological methods. 2011;373:54–62. doi: 10.1016/j.jim.2011.07.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Flagella M, et al. A multiplex branched DNA assay for parallel quantitative gene expression profiling. Analytical biochemistry. 2006;352:50–60. doi: 10.1016/j.ab.2006.02.013. [DOI] [PubMed] [Google Scholar]
- 17.Murshed F, Farhana L, Dawson MI, Fontana JA. NF-kappaB p65 recruited SHP regulates PDCD5-mediated. apoptosis in cancer cells. Apoptosis: an international journal on programmed cell death. 2014;19:506–517. doi: 10.1007/s10495-013-0939-y. [DOI] [PubMed] [Google Scholar]
- 18.Cappetta A, et al. Advanced gastric cancer (GC) and cancer of the gastro-oesophageal junction (GEJ): focus on targeted therapies. Critical reviews in oncology/hematology. 2012;81:38–48. doi: 10.1016/j.critrevonc.2010.12.006. [DOI] [PubMed] [Google Scholar]
- 19.Zhu, L. M. et al. Tumor suppressor XAF1 induces apoptosis, inhibits angiogenesis and inhibits tumor growth in hepatocellular carcinoma. Oncotarget (2014). [DOI] [PMC free article] [PubMed]
- 20.Yang, C. C., Yao, C. A., Yang, J. C. & Chien, C. T. Sialic Acid Rescues Re-purified Lipopolysaccharide-Induced Acute Renal Failure via Inhibiting TLR4/PKC/gp91-mediated Endoplasmic Reticulum Stress, Apoptosis, Autophagy, and Pyroptosis Signaling. Toxicological sciences: an official journal of the Society of Toxicology (2014). [DOI] [PMC free article] [PubMed]
- 21.Yee C, Yang W, Hekimi S. The intrinsic apoptosis pathway mediates the pro-longevity response to mitochondrial ROS in C. elegans. Cell. 2014;157:897–909. doi: 10.1016/j.cell.2014.02.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Nguyen TA, Hoivik D, Lee JE, Safe S. Interactions of nuclear receptor coactivator/corepressor proteins with the aryl hydrocarbon receptor complex. Archives of biochemistry and biophysics. 1999;367:250–257. doi: 10.1006/abbi.1999.1282. [DOI] [PubMed] [Google Scholar]
- 23.Dambara A, et al. Nucleolin modulates the subcellular localization of GDNF-inducible zinc finger protein 1 and its roles in transcription and cell proliferation. Experimental cell research. 2007;313:3755–3766. doi: 10.1016/j.yexcr.2007.07.003. [DOI] [PubMed] [Google Scholar]
- 24.Gadi D, Wagenknecht-Wiesner A, Holowka D, Baird B. Sequestration of phosphoinositides by mutated MARCKS effector domain inhibits stimulated Ca(2+) mobilization and degranulation in mast cells. Molecular biology of the cell. 2011;22:4908–4917. doi: 10.1091/mbc.E11-07-0614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Cole SE, Mao MS, Johnston SH, Vogt TF. Identification, expression analysis, and mapping of B3galt6, a putative galactosyl transferase gene with similarity to Drosophila brainiac. Mammalian genome: official journal of the International Mammalian Genome Society. 2001;12:177–179. doi: 10.1007/s003350010241. [DOI] [PubMed] [Google Scholar]
- 26.Zhang JX, et al. Prognostic and predictive value of a microRNA signature in stage II colon cancer: a microRNA expression analysis. The lancet oncology. 2013;14:1295–1306. doi: 10.1016/S1470-2045(13)70491-1. [DOI] [PubMed] [Google Scholar]
- 27.Bang YJ, et al. Adjuvant capecitabine and oxaliplatin for gastric cancer after D2 gastrectomy (CLASSIC): a phase 3 open-label, randomised controlled trial. Lancet. 2012;379:315–321. doi: 10.1016/S0140-6736(11)61873-4. [DOI] [PubMed] [Google Scholar]
- 28.Sasako M, et al. Five-year outcomes of a randomized phase III trial comparing adjuvant chemotherapy with S-1 versus surgery alone in stage II or III gastric cancer. Journal of clinical oncology: official journal of the American Society of Clinical Oncology. 2011;29:4387–4393. doi: 10.1200/JCO.2011.36.5908. [DOI] [PubMed] [Google Scholar]
- 29.Lage H. Molecular analysis of therapy resistance in gastric cancer. Dig Dis. 2003;21:326–338. doi: 10.1159/000075356. [DOI] [PubMed] [Google Scholar]
- 30.Hong L, et al. MicroRNAs in gastrointestinal cancer: prognostic significance and potential role in chemoresistance. Expert opinion on biological therapy. 2014;14:1103–1111. doi: 10.1517/14712598.2014.907787. [DOI] [PubMed] [Google Scholar]
- 31.Washington K. 7th edition of the AJCC cancer staging manual: stomach. Annals of surgical oncology. 2010;17:3077–3079. doi: 10.1245/s10434-010-1362-z. [DOI] [PubMed] [Google Scholar]
- 32.Jeong Y, et al. Nuclear receptor expression defines a set of prognostic biomarkers for lung cancer. PLoS medicine. 2010;7:e1000378. doi: 10.1371/journal.pmed.1000378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Licitra L, et al. Evaluation of EGFR gene copy number as a predictive biomarker for the efficacy of cetuximab in combination with chemotherapy in the first-line treatment of recurrent and/or metastatic squamous cell carcinoma of the head and neck: EXTREME study. Ann Oncol. 2011;22:1078–1087. doi: 10.1093/annonc/mdq588. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
