Skip to main content
Springer logoLink to Springer
. 2025 Oct 13;29(1):132–146. doi: 10.1007/s10120-025-01671-1

A gene-expression signature defines a subtype of Stomach Adenocarcinomas with low levels of Claudins and a high ratio of NF-YA long/NF-YA short splicing variants

Alberto Gallo 1, Mirko Ronzio 1, Maria Barbara Campbell 1, Sofia Polettini 1, Enrico Garattini 2, Roberto Mantovani 1, Diletta Dolfini 1,
PMCID: PMC12830421  PMID: 41082105

Abstract

Background

Claudin-3, Claudin-4, and Claudin-7 are expressed on the surface of epithelial cells. Their absence in neoplastic cells of epithelial origin is an aggressiveness marker in different cancers. The NF-YA gene codes for the Nuclear-Transcription-Factor-Y-Subunit-A, which is overexpressed in various tumors. In tumors, the relative ratio of the two major NF-YA alternative splicing isoforms, NF-YA long and NF-YA short, is associated with a mesenchymal phenotype and a poor prognosis. Based on a high NF-YA long/NF-YA short ratio, we generated a 158-gene signature that is common to Claudinlow Breast Carcinomas (BRCA) and Stomach Adenocarcinomas (STAD).

Methods

To better classify STAD Claudinlow tumors, we employed a hierarchical clustering approach based on our 158-gene signature to classify STAD into a Claudinlow subgroup. We tested the classification potential of our signature in TCGA as well as in two additional datasets of tumors. We used the deep-learning DeepCC tool and the 158-gene signature to classify the STAD cell lines available in the CCLE platform. Obtained data were validated with qRT-PCR and Western blots.

Results

The 158-gene signature resulted in the selection of a STAD subgroup with effective Claudinlow expression and with a high NF-YA long/NF-YA short ratio. This Claudinlow subgroup was separated from the EMT subgroup of STAD and it is characterized by poor clinical outcome. We identified nine Claudinlow STAD cell lines with a high NF-YA long/NF-Y short ratio and validated the expression of selected markers.

Conclusions

Our work supports the notion that three overlapping features—low expression of Claudin-3/4/7, high NF-YA long/NF-YA short ratio and a 158-gene signature—mark a specific subset of STAD characterized by mesenchymal features and poor prognosis.

Supplementary Information

The online version contains supplementary material available at 10.1007/s10120-025-01671-1.

Keywords: Classification of STAD tumors, Cell lines, Claudinlow, NF-YA

Introduction

Stomach Adenocarcinoma (STAD) is a heterogeneous type of tumor characterized by a poor survival rate [1]. The Lauren’s classification groups gastric cancers into intestinal (IT), diffuse (DF), and mixed (MX) subtypes according to their histological features [2, 3]. In addition, gastric cancers can be classified on the basis of specific genetic mutations, chromosomal alterations, epigenomic features, and gene expression patterns [47]. The TCGA-based (The Cancer Genome Atlas) molecular classification of gastric cancers [8] defines four distinct subtypes (Reviewed in [911]), i.e., EBV (EBV infected), MSI (MicroSatellite Instability), GS (Genomically Stable), and CIN (Chromosomal Instability) tumors. Similarly, the ACRG-based (Asian Cancer Research Group) molecular classification identifies an equal number of stomach tumor subtypes, i.e., MSS;TP53 (MicroSatellite Stable; TP53 inactive), MSS/TP53+, MSI, and EMT (Epithelial to Mesenchymal Transition) [12].

Claudins are a large family of proteins which are expressed on epithelial cells and are involved in ordered cell–cell interactions [13]. The EMT process reduces the levels or abolishes the expression of certain Claudins via transcriptional silencing of the corresponding genes [14]. Low expression of Claudin-3, Claudin-4, and Claudin-7 (CLDN3, CLDN4, and CLDN7) is associated with 3 types of epithelial cancers: breast cancer (BRCA), STAD, and bladder cancer (BLCA). In BRCA, tumors with low expression of Claudins were originally included in the large Basal-like or Triple-Negative, TNBC subgroup. Additional work led to the identification of a further subgroup of tumors characterized by distinct molecular features and a transcriptomic signature consisting of 437 genes [15]. Subsequent studies reduced the members of this signature to 19 genes [16]. Claudin-4 is frequently overexpressed in STAD, and membranous Claudin-4 expression correlates with better prognosis and reduced cancer cell migration and invasion, while also enhancing tight junction barrier function [17]. A fifth STAD subgroup (Claudinlow) was added to the four types of tumors mentioned above [8, 12]: this additional subgroup was identified on the basis of a 24-gene transcriptomic signature, which was obtained by analysis of the EMT tumors available in the ACRG classification [18]. In the case of BLCA, a separate Claudinlow subgroup was identified and reported in the scientific literature. This BLCA subgroup is associated with a 40-gene transcriptomic signature known as BCL40 [19, 20].

In the BRCA context, the presence of a distinct Claudinlow subgroup was challenged on the basis of the following reasons: (i) aggressive Claudinlow BRCAs arise not only from mesenchymal Basal-like tumors, but also from Luminal tumors, which are generally characterized by epithelial features [21]; (ii) unlike the Basal-like/TNBC counterparts, most Claudinlow tumors show Estrogen-Receptor (ER) expression, which is a hallmark of Luminal cancers [22]; (iii) immunohistochemistry (IHC) studies on CLDN3, CLDN4, and CLDN7 lack univocal correlations between low expression of the 3 Claudins and clinical features or EMT-markers levels (Reviewed in [13]); (iv) often, co-localization of Claudins on the cellular membrane and inside the cell is inconsistent. Overall, the results obtained in BRCA challenge the presence of a distinct Claudinlow subgroup in STAD as well.

NF-Y is a trimeric transcription factor (TF) playing an important role in the regulation of various genetic loci, with particular reference to those involved in the cell cycle and metabolism [23]. We characterized the expression of the NF-Y subunits in epithelial cancers using a systematic approach [23]. In particular, we reported data on the differential regulation of two alternatively spliced isoforms of NF-YA, NF-YAl (NF-YA long) and NF-YAs (NF-YA short), in aggressive cancers. Our data demonstrate a high NF-YAl/NF-YAs expression ratio in the EMT subgroups of BRCA, STAD, and Head and Neck Squamous Cell Carcinoma (HNSCC). With the use of the NF-YAl/NF-YAs expression ratio, we analyzed the STAD and BRCA cases available in the TCGA database, and we identified a common 158-gene signature predicting poor clinical outcomes [24]. We also deconvoluted the single-cell RNA-sequencing (scRNA-seq) data, which allowed us to predict the EMT features of the neoplastic cells within each tumor. In a series of experimental studies, we ablated NF-YA exon-3 in clones derived from Claudinlow BRCA cell lines with a genome editing approach. This resulted in cellular clones characterized by the sole expression of the “epithelial” NF-YAs splicing variant. The genetically modified cells showed normal growth, although they presented an impairment of the mesenchymal/migratory features in vitro and metastatic behavior in vivo [24]. Thus, at least in BRCA cells, the relative levels of the NF-YA isoforms play a relevant role in shaping the EMT phenotype.

In the present study, we compare our 158-gene signature and the 24-gene signature described by Nishijima et al. [18] to identify a Claudinlow subtype in STAD tumors. In addition, we provide an improved classification of the STAD cell lines available in the CCLE (Cancer Cell Line Encyclopedia) database.

Materials and methods

STAD RNA-seq data and molecular classifications

We downloaded the STAD gene expression data available in the TCGA dataset from the http://firebrowse.org/ web page, which is contained in the illuminahiseq_rnaseqv2-RSEM_genes and illuminahiseq_rnaseqv2-RSEM_isoforms files. As of March 2025, RNA-seq data for 415 primary tumor samples were available. We retrieved the FASTQ files associated with: (i) 50 STAD cell lines available in the CCLE (Cancer Cell Line Encyclopedia, SRA BioProject PRJNA523380) database and an independent dataset (SRA BioProject PRJNA327709). When duplicate samples were available in the 2 datasets, we performed our analyses on the CCLE data; (ii) 231 STAD primary tumor samples and 230 paired normal tissues from Harbin Medical University Cancer Hospital (SRA BioProject PRJNA764173) [25]; (iii) 60 STAD primary tumor samples from Seoul National University Hospital (SRA BioProject PRJNA1119255); (iv) 13 STAD primary tumor samples available inside the Istituto di Ricerche Farmacologiche Mario Negri IRCCS (EMBL-EBI Annotare accession E-MTAB-12385 and SRA BioProject PRJEB57792) [26]; (v) 12 STAD primary non-metastatic tumor samples and 5 STAD tumors with peritoneal metastasis from The Fourth Hospital of Hebei Medical University (SRA BioProject PRJNA1220682) [27]. These data were used to define mRNA expression with the use of RSEM-1.3.3 [28]. For the STAD primary tumor samples of the TCGA, we initially applied the classification outlined in [29]. For the CCLE cell lines, we used the classification available in [30] as a starting point for our re-classification procedure. The WHO International Classification of Diseases for Oncology, 3rd Edition (ICD-O-3, https://www.who.int/standards/classifications/other-classifications/international-classification-of-diseases-for-oncology) was downloaded from the cBioPortal (https://www.cbioportal.org/).

Hierarchical clustering and Claudinlow subtype re-classification of the TCGA STAD tumors

We applied a variance-stabilizing transformation (vst) on gene raw count data of the STAD tumors using the DESeq2 R package [31]. Subsequently, we performed hierarchical clustering with our 158-gene signature [24] using SigClust2 (v1.2.4) [32], applying the Ward.D2 linkage and the manhattan metric. The alpha parameter was set to 0.05, and significance at each parent node was assessed using the built-in FWER control procedure before testing daughter nodes.

Differential gene expression and functional enrichment analyses

Differential gene expression analysis of TCGA subtypes was performed using DESeq2 R package. We used expression fold change (log2FC) to denote upregulation or downregulation, while FDR < 0.01 and |log2FC|> 1 were set as inclusion criteria for DEG selection.

We used DAVID [33] (GO_Direct categories) and the Enrichr website (https://maayanlab.cloud/Enrichr) for Gene Ontology (GO) terms and Reactome Pathways enrichment analyses, respectively. The UpSetR package [34] was used to generate the upset plots shown in Fig. 2, while the heatmap was produced with pheatmap (https://github.com/raivokolde/pheatmap). The volcano plot in Supplementary Fig. 2 was created with the EnhancedVolcano package (10.18129/B9.bioc.EnhancedVolcano).

Fig. 2.

Fig. 2

Differentially Expressed Genes in Claudinlow STAD tumors showing a high NF-YAl/NF-YAs expression ratio. a The upper plots show the number of upregulated (Top) and downregulated (Bottom) genes in Claudinlow samples as compared to the other STAD subtypes. In addition, the Panel illustrates each possible intersection among the different comparisons. b The heatmap depicts the enrichment of the Gene Ontology (GO) terms in upregulated (green) and downregulated (red) genes determined by the comparisons shown in (a)

Prediction of STAD tumors stromal and immune cells infiltration

We assessed the Immune and Stromal scores of STAD primary tumors using the tidyestimate package (version 1.1.1, https://github.com/KaiAragaki/tidyestimate), starting from gene-level expression data in the form of log2(TPM + 1).

Analysis of STAD survival data

We retrieved Overall and Progression-Free Survival (OS and PFS, respectively) time records of STAD patients from the http://xena.ucsc.edu/ web page [35]. We stratified all the tumors for which PFS records were available according to the novel classification proposed here. Survival analysis was performed according to the Kaplan–Meier analysis and log-rank test.

Re-classification of the STAD cell-lines

We first extended the original classification of the cell lines [30] to all the available studies using DeepCC [36], run with the log2(TPM + 1) value as input. Next, we conducted ssGSEA (single sample Gene Set Enrichment Analysis) [37] employing our 158-gene signature. We classified cell lines showing a z-score normalized enrichment > 1 as Claudinlow.

Cell cultures

Human gastric cell lines were cultured under standard ATCC or Riken Cell Bank conditions. HS746T, LMSU, MKN7, NCIN87, MKN45, and AGS cells were grown in RPMI (Euroclone ECB9006L) supplemented with FBS (Euroclone ECS0165L) at 10%. All cell line media were supplemented with 1 mM L-glutamine (Euroclone ECB3000D) and 100 μg/mL penicillin/100 μg/mL streptomycin (Euroclone ECB3001D).

RNA extraction and retro-transcription

RNA was isolated using the Tryzol (Sigma Aldrich T9424) protocol. Pellets (3 × 106 cells) were resuspended and lysed in 1 mL of Tryzol. After lysis, 200 µL of chloroform was added and samples were vortexed for 15″. Following a 5’ incubation at room temperature, samples were centrifuged at 12,000 × g at 4 °C for 15’, isopropanol (500 μL) was added to the RNA-containing aqueous phase and the tubes were inverted gently. After centrifugation as above, the supernatant was removed and the RNA pellet was washed by the addition of 70% RNAse-free ethanol (500 μL). RNA pellets were resuspended in DEPC-treated water. We quantified the concentration of RNA with Nanodrop and we determined RNA integrity by agarose gel electrophoresis. As for retro-transcription, total RNA (1 μg) was reverse transcribed using M-MLV SuperMIX 5X + M-MLV mix (Genespin STS-MSX) according to the Manufacturer’s instructions. The cDNAs were diluted 1:5 in TE (Tris 10 mM pH 7.5, EDTA 1 mM).

qRT-PCR and western blot analyses

Real-time PCR was performed using the BIO-RAD CFX ConnectTM RT-PCR system with oligonucleotides designed to amplify 100–200 bp fragments. The housekeeping gene Rps20 was used to normalize the expression data. The relative sample enrichment was calculated with the formula 2–(ΔΔCt), where ΔΔCt = [(Ct sample – Ct Rps20)x − (Ct sample – Ct Rps20)y], x = sample and y = sample control. The set of primers used is described in Supplementary Table 1.

For the preparation of whole-cell extracts, cells were resuspended in RIPA buffer (10 mM TrisHCl pH 8.0, 1 mM EDTA, 0.5 mM EGTA, 0.1% SDS, 0.1% sodium deoxycholate, 140 mM NaCl, 1% Triton X-100, 1 mM PMSF, Protease inhibitor cocktail) and incubated for 30’ on ice. Samples were centrifuged at 13,000 rpm for 10 min at 4 °C, and the supernatant was recovered and quantified using the Bradford protein assay. 30 µg of extracts were loaded on a 12% SDS–polyacrylamide gel and analyzed by Western blot with anti-Vinculin (Sigma-Aldrich 05–386), anti-β-tubulin (Sigma-Aldrich T5293), anti-NF-YA G2 (Santa Cruz Biotech., sc-17753), anti-CDH1 (BD Biosciences., 610,181), and anti-Vimentin (Cell Signaling., D21H3) primary antibodies. Secondary antibodies were peroxidase-conjugated anti-mouse (A4416 Sigma-Aldrich) and anti-rabbit (A6154 Sigma-Aldrich). Detection was performed with ChemiDoc (Bio-Rad) and exported using the Image-Lab software (Bio-Rad).

Results

A novel classification of Claudinlow STADs with the use of the 158-gene signature based on the NF-YAl/NF-YAs expression ratio

We previously classified all STAD tumors in the TCGA database according to the 4 TCGA and ACRG subtypes [29] and complemented this with a fifth Claudinlow subtype defined on the basis of the 24-gene transcriptomic signature described by Nishijima [18].

The number of Claudinlow tumors (Claudinlow: 79) and the other 4 STAD tumor subtypes (EMT:61; MSI:82; MSS/TP53:104; MSS/TP53+:73) are comparable [24]. As expected, the predicted Claudinlow tumors express significantly lower amounts of CLDN7 relative to the MSI, MSS/TP53, and MSS/TP53+ (Fig. S1). A similar situation is observed when CLDN4 levels in Claudinlow, MSI, and MSS/TP53 tumors are considered. Surprisingly, no statistically significant difference in the levels of CLDN3, CLDN4, and CLDN7 mRNAs is detected in Claudinlow and EMT tumors. In addition, Claudinlow, EMT, MSI, MSS/TP53, and MSS/TP53+ tumors show no difference in CLDN3. Overall, our data indicate that the Claudinlow denomination applied to the subgroup of STAD tumors defined by the 24-gene signature is incorrect, as it does not reflect the CLDN3, CLDN4, and CLDN7 expression levels appropriately.

Interestingly, our transcriptomic analyses indicate that the so-called Claudinlow subgroup can be separated from all the other STAD tumor subtypes with the use of the NF-YAl/NF-YAs expression ratio. Indeed, the NF-YAl/NF-YAs median values are significantly higher in Claudinlow tumors relative to the EMT, MSI, MSS/TP53, and MSS/TP53+ counterparts (Fig. S1). On the basis of this observation, we used the 158-gene signature [24], derived by WGCNA (Weighted Gene Co-expression Network Analysis), as associated with a high threshold value of the NF-YAl/NF-YAs ratio in both BRCA and STAD tumors (Supplementary Table 2). A hierarchical clustering of STAD samples based on the 158-gene signature permits a novel definition of the Claudinlow subset, reducing the original number of Claudinlow tumors from 79 to 56 (Supplementary Table 3 and Fig. 1a). According to the ACRG classification, most of the 79 Claudinlow tumors based on the Nishijima classification originate from the EMT subgroup, while a minor portion derives from other subgroups (Fig. 1b). By contrast, the new classification indicates that all newly identified 56 Claudinlow tumors originate from the EMT subgroup of STADs (Fig. 1b). Figure 1c shows the expression boxplots of the mRNAs coding for CLDN3, CLDN4, CLDN7, NF-YAl, NF-YAs, and NF-YAl/NF-YAs values in our panel of STADs classified according to the 158-gene signature. As expected, Claudinlow tumors show significantly lower levels of the CLDN3, CLDN4, and CLDN7 mRNAs relative to the other subgroups. The exception is represented by CLDN3, whose average expression levels are the same in Claudinlow and EMT tumors. In addition, the Claudinlow specimens present the expected increase of the NF-YAl/NF-YAs ratio, due to a statistically significant increase in NF-YAl levels and a corresponding decrease in NF-YAs (Fig. 1c). Looking at WHO ICD-O-3 categories (International Classification of Diseases for Oncology, 3rd Edition, https://www.who.int/standards/classifications/other-classifications/international-classification-of-diseases-for-oncology), we noticed that Claudinlow tumors were significantly enriched in the mucinous adenocarcinoma fractions (n = 6 out of 18 total, Fisher exact test p value = 1.12 × 10–2), while including a smaller number of tubular adenocarcinomas (n = 2 out of 71, p value = 2.29 × 10–3) (Fig. 1d). Mucinous adenocarcinomas are characterized by an increased invasion capacity, and they are localized in the upper part of the stomach [38].

Fig. 1.

Fig. 1

Re-definition of the Claudinlow subtype of the TCGA STAD tumors. a SigClust2 hierarchical clustering of all STAD tumors of TCGA according to our 158-gene signature. Top: heatmap depicting the expression of signature genes within the tumor groups defined by hierarchical clustering. The proposed Claudinlow node is highlighted by a black square. Bottom: the dotplot shows an aggregate view of the signature-genes expression in STAD tumors, as calculated by the median z-score. b Proportion of samples included in the previous (n = 79, Left) and current (n = 56, Right) Claudinlow groups according to their ACRG classification. c Box plots showing the CLDN3, CLDN4, CLDN7, NF-YAl and NF-YAs expression levels as well as the NF-YAl/NF-YAs expression-ratio values of the STAD tumors in the TCGA dataset. The values are shown in CCLE (Transcripts Per Million). Tumor types are defined according to the new molecular classification. d Barplot showing ACRG + Claudinlow subtypes distribution into WHO ICD-O-3 categories for gastric cancer, expressed as percentages of total samples within each subtype. NOS = not otherwise specified

In conclusion, our results support the appropriateness of the 158-gene signature to identify the Claudinlow subset of gastric cancers.

Patterns of genes differentially expressed in the Claudinlow subset of gastric tumors

We used RNA-seq data from the TCGA database to identify DEGs (Differentially Expressed Genes) in the Claudinlow subset of STADs with a “pairwise-comparisons” approach (Supplementary Table 4). The numbers of upregulated and downregulated DEGs determined in each comparison of subsets are shown in Fig. 2a. The minimum number of exclusive DEGs is observed in the Claudinlow/EMT match (1189 upregulated and 1031 downregulated genes). The other Claudinlow matches show a higher number of exclusive DEGs (> 2000 upregulated genes and > 1491 downregulated genes). The upregulated and downregulated DEGs emerging from each comparison can be grouped under several GO (Gene Ontology) terms whose enrichment is characterized by a different degree of significance (Fig. 2b). As for the down-regulated DEGs, the most significant enrichment values are observed for terms describing various mitotic processes. It is worthwhile mentioning that all the comparisons show a significant enrichment of downregulated genes involved in developmental processes, such as establishment of skin barrier, keratinization, epidermis development and epithelial cell differentiation. This is particularly relevant for the comparisons with the MSS/TP53 and MSS/TP53+ subtypes, which are characterized by an epithelial phenotype. As for the upregulated DEGs, the enrichment in genes participating in processes such as muscle contraction, angiogenesis and cell–cell adhesion, which are typical of cells with a mesenchymal phenotype, is of major interest: these processes are particularly enriched in the comparison between the Claudinlow and the EMT tumor subtypes.

To better define differences of the 5 subgroups in TCGA, and verify partitioning of the EMT and Claudinlow clusters, we performed Principal Component Analysis (PCA), and the results are shown in Fig. S2a: while the epithelial subgroups are largely clustered together, EMT samples are distinct, and Claudinlow samples are further separated. Fig. S2b shows a Volcano plot of the differences in DEG between EMT and Claudinlow samples, with several genes showing a clear differential expression between the two subgroups. This is reflected in the Gene Ontology and Reactome Pathways plots of up- and down-regulated genes in the two cohorts (Fig. S2c), showing highly enriched terms of cell adhesion and muscle contraction in Claudinlow, and cell division and mitotic cell cycle in EMT.

Use of the 158-gene signature to classify additional STAD cases

We wished to verify the TCGA results using two additional RNA-seq datasets, PRJNA764173 of 231 and PRJNA1119255 of 60 patients, both of Asian origin (Fig. 3a). Exploiting the 158-gene signature, hierarchical clustering of the 291 samples yielded a Claudinlow node with 35 samples (Fig. 3b). We remark that the incidence of this cluster −12% in these datasets is similar to the Claudinlow cluster in TCGA – 13%. These samples show low expression of Claudins and of the epithelial marker E-Cadherin, high levels of NF-YAl and NF-YAl/NF-YAs ratio, as well as of the mesenchymal marker Vimentin (Fig. 3c). Finally, we used our 158-gene signature to characterize 13 Italian patients (PRJEB43867) [25]: median z-score values are positive in 7 cases, negative in 4 cases, and are close to 0 in 2 cases (GC15 and GC6) (Fig. S3a). The 7 cases characterized by a positive z-score value and GC15 present with high levels of NF-YAl and low levels of NF-YAs, which results in a high NF-YAl/NF-YAs value (Fig. S3a). With the exception of CLDN3 in GC15, the expression of CLDN3, CLDN4, and CLDN7 is low in all these samples (Fig. S3b). Altogether, these results confirm the presence of a subgroup of STAD that has low expression of Claudin-3/-4/-7.

Fig. 3.

Fig. 3

Claudinlow classification in additional primary tumor datasets. a Pie chart of the gastric cancer samples collected from two independent sources, corresponding to the SRA accessions PRJNA764173 and PRJNA1119255. b SigClust2 hierarchical clustering of primary STAD samples detailed in a according to our 158-gene signature. Top: heatmap showing the expression of signature genes across tumor groups identified by hierarchical clustering, with the proposed Claudinlow node outlined in black. Bottom: Dot plot summarizing the expression of signature genes in STAD tumors, represented by the median z-score. c Box plots display the expression levels of CLDN3, CLDN4, CLDN7, NF-YAl, and NF-YAs, along with the NF-YAl/NF-YAs expression ratio, the correlation with the 158-gene signature, and the marker genes CDH1 and VIM in STAD tumors from the independent datasets PRJNA764173 and PRJNA1119255. Expression values are reported in TPM (Transcripts Per Million), while correlations with the 158-gene signature are represented as median z-scores

Claudinlow samples features unique tumor-stromal interactions

In BRCA, Claudinlow tumors were initially described as exhibiting a pronounced immune and stromal cell infiltration [39]. We decided to assess tumor microenvironment composition with the ESTIMATE algorithm, which employs single-sample Gene Set Enrichment Analysis (ssGSEA) to assign an Immune Score and Stromal Score to each tumor sample based on RNA-seq gene expression [40]. Applying this strategy, we discovered that TCGA STAD Claudinlow tumors were characterized by increased Immune and Stromal Scores compared to epithelial subtypes, while EMT showed intermediate values in the two metrics (Fig. 4a).

Fig. 4.

Fig. 4

Claudinlow gastric tumors show distinct microenvironments. a Immune score and stromal score estimated for each gastric cancer subtype. Claudinlow and EMT tumors display the highest immune and stromal infiltration scores. b Comparison of Claudinlow tumors with the broader gastric cancer cohort from the PRJNA764173 and PRJNA1119255 datasets, highlighting elevated stromal scores in Claudinlow tumors relative to the general gastric cancer population. Error bars represent standard errors of the mean

In the second and third dataset of primary gastric cancers, Claudinlow classified samples had a higher Stromal Score than other samples, but a lower Immune Score (Fig. 4b). Together, these results suggest that Claudinlow gastric tumors share with their breast cancer counterparts a distinctive microenvironmental profile marked by elevated stromal content, while immune infiltration appears more variable across datasets.

Insights into the prognostic value of the newly identified subgroup of Claudinlow STADs with a high NF-YAl/NF-YAs expression ratio

To obtain insights into the progression/mortality rates of the 56 Claudinlow tumors and the other subgroups, we performed an evaluation of the clinical data available in the TCGA database. In particular, we compared the PFS (Progression-Free Survival) and the OS (Overall-Survival) curves of the Claudinlow subgroup of patients with the EMT, MSI, MSS/TP53, and MSS/TP53+. The Kaplan–Meier PFS curves demonstrate that there are no statistically significant differences, whereas the OS curves indicate that Claudinlow shows a significantly lower survival rate/probability than MSS/TP53+ and MSI patients (Fig. 5). On the other hand, there is a lack of significant differences between Claudinlow and EMT cases, both marked by “mesenchymal” phenotypes. Thus, we conclude that the prognoses of Claudinlow and EMT tumors are the worst. Unfortunately, there are no clinical data available for the 231 and 60 patients of PRJNA764173 and PRJNA1119255 classified above. To investigate further the markers above, specifically in Grade IV tumors, we interrogated an available RNA-seq dataset (PRJNA1220682) of gastric cancer with or without peritoneal metastasis. The paucity of the samples precludes the definition of clustering as in Fig. 1 and Fig. 3, but the gene expression data of Fig. 6 shows that, bar one exception, gastric cancers that produced a distant metastasis—the hallmark of stage IV cancer—have lower Claudins and E-Cadherin levels, higher NF-YAl, NF-YAl/NF-Ys ratios, and Vimentin.

Fig. 5.

Fig. 5

Clinical outcome of the Claudinlow STAD tumors showing a high NF-YAl/NF-YAs expression-ratio. The Figure shows the Kaplan–Meier survival curves of STAD patients. The Progression Free Survival values (Left) and Overall Survival values (Right) across the STAD molecular subtypes are illustrated. The p-values were determined using the log-rank test

Fig. 6.

Fig. 6

The 158-gene signature strongly correlates with gastric cancer that originated peritoneal metastasis. Bar plots show expression of CLDN3, CLDN4, CLDN7, NF-YAl, and NF-YAs, the NF-YAl/NF-YAs expression ratio, the 158-gene signature score, and the epithelial and mesenchymal markers CDH1 and VIM in individual samples of gastric cancer with or without peritoneal metastasis. Expression data are presented as Transcripts Per Million (TPM), while 158-gene signature values are expressed as median z-scores

A novel classification of the STAD cell lines

The bulk RNA-seq generated from the TCGA, 291 stomach-tumors dataset and our cohort of Italian patients define genes expressed not only in the neoplastic cells, but also in tumor-associated immune, stromal, and endothelial cells. Thus, it is important to perform the same type of analyses on STAD cell lines of the CCLE portal for which RNA-seq data are available.

A first classification of the cell lines, based on the ACRG subtypes, was provided by Lee et al. [30]. However, as illustrated in Fig. 7a, this classification leaves 21 cell lines as unclassified, and it does not include the Claudinlow subtype. To obtain a more inclusive classification, we employ the DeepCC deep-learning tool [36] with the Lee et al. categories serving as the training set. Thereafter, we use the ssGSEA (single-sample Gene Set Enrichment Analysis) platform and our 158-gene signature to identify the Claudinlow subgroup, following the ranking of the cell lines according to their z-score values. With the exception of 3 cell lines (RERF-GC1B, NUGC3, and TGBC11TKB), all other lines are classified in specific ACRG subtypes (Fig. 7a). Given the >1 ssGSEA z-score values, 9 STAD cell lines classify as Claudinlow (Supplementary Table 5). With the exception of NUGC2 (MSS/TP53) and RERF-GC1B (unclassified), these Claudinlow cell lines derive from the EMT subgroup gathered from the DeepCC analyses.

Fig. 7.

Fig. 7

A novel classification and characterization of STAD cell lines based on our 158-gene signature and the expression of the CLDN3/CLDN4/CLDN7 and NF-YAl/NF-YAs mRNAs. a The left heatmap shows: (i) the original classification (first column) of the STAD cell lines available in the CCLE dataset cell lines [30]; (ii) a second classification of the STAD cell lines obtained with the application of the DeepCC deep learning tool (second column); (iii) the classification of the STAD cell lines generated following integration with ssGSEA data (third column). To define the Claudinlow cell lines we consider a ssGSEA z-score value > 1. The right panel shows barplots indicating the TPM (Transcripts Per Million) expression levels of the CLDN3/CLDN4/CLDN7, VIM (vimentin), CDH1 (E-Cadherin), NF-YAl and NF-YAs mRNAs. The rightmost barplot shows the NF-YAl/NF-YAs expression ratio values. b The panel illustrates qRT-PCR determination of CLDN3/ CLDN4/CLDN7/VIM/CDH1/NF-YAl/NF-YAs mRNA expression levels in HS746T, LMSU, MKN7, NCIN87, MKN45 and AGS cell lines. The data are expressed as the MEAN ± SD of the values determined. c. Western Blot analysis showing the amounts of the NF-YAl, NF-YAs, VIM and CDH1 proteins in the HS746T, LMSU, MKN7, NCIN87, MKN45 and AGS cell lines

In parallel, we evaluated the expression levels of the mRNAs encoding CLDN3/CLDN4/CLDN7, the EMT marker VIM (Vimentin), the epithelial marker CDH1 (Cadherin-1) and the NF-YAl/NF-YAs (Fig. 7a). As expected, the Claudinlow cell lines are characterized by low/undetectable levels of the 3 claudins. The only exception is represented by the NUGC2, HS746T and RERF-GC1B cell lines, which express significant amounts of the CLDN4 transcript. The EMT and MSI cell lines characterized by positive z-score values show low CLDN3, CLDN4 and CLDN7 levels. By converse, the MSS/TP53 and MSS/TP53+ cell lines, with the exception of FU97, present with negative z-score values and high levels of the 3 claudins. The VIM mRNA is expressed in 7 of the 9 Claudinlow, 2 of the 5 EMT and in 1 of the 5 MSI cell lines. The CDH1 transcript is expressed in 2 Claudinlow cell lines presenting with undetectable VIM mRNA levels (NUGC2/RERFGC-1B), one EMT (MKN7) and one MSI (NCC59) cell line (Fig. 7a). As expected, the majority of the cell lines characterized by an “epithelial” phenotype express large amount of CDH1 mRNA. Finally, the expression of NF-YAl and NF-YAs is associated with two separate “high-to-low” and “low-to-high” TPM (Transcripts per Millions) gradients in all the STAD cell lines, with the exception of the MSI SNU1 cell line.

We validated the RNA-seq data by performing qRT-PCR studies to measure the amounts of CLDN3, CLDN4, CLDN7, VIM, CDH1, NF-YAl, and NF-YAs in several STAD cell lines (Fig. 7b). With the exception of CLDN4 in the HS746T cell line, the qRT-PCR data indicate that the selected Claudinlow cell lines lack CLDN3, CLDN4, and CLDN7 expression. In line with their epithelial characteristics, the MSS/TP53NCIN87 and MKN45 cell lines express CLDN3, CLDN4, and CLDN7, while the MSS/TP53+ AGS cell line expresses only CLDN4 and CLDN7. Surprisingly, high CLDN3, CLDN4, and CLDN7 expression levels are also observed in the EMT MKN7 cell line. As for VIM, CDH1, NF-YAl, and NF-YAs, the qRT-PCR and the RNA-seq data are entirely consistent. Overall, the qRT-PCR results confirm the RNA-seq analyses, with a single inconsistency represented by CLDN3 expression in the MKN7 cell line.

The intra-cellular expression of a specific mRNA does not necessarily translate into the production of the encoded polypeptide. We address this point in the case of NF-YAl, NF-YAs, VIM, and CDH1 transcripts, as the presence/absence of the corresponding protein may be of functional importance for the homeostasis of the neoplastic cell. Thus, we determined the levels of the NF-YAl, NF-YAs, VIM, and CDH1 proteins in the STAD cell lines used for the qRT-PCR validation experiments (Fig. 7c). The results obtained indicate that the amounts of the NF-YAl, NF-YAs, VIM, and CDH1 proteins determined in each cell line reflect the expression levels of the corresponding transcripts in our qRT-PCR. Specifically, both the qRT-PCR and RNA-Seq data indicate that CDH1 is absent in the AGS cell line, despite its “epithelial” classification. On the other hand, we note that the only cell line classified as EMT expressing CDH1 is the MKN7 that we use here. We conclude that Western Blot Analysis is in line with the qRT-PCR shown above.

Discussion

Our work confirms the existence of a Claudinlow subgroup of gastric cancers and provides a better definition of this tumor type. In a previous report [29], we used the collective expression of CLDN3, CLDN4, and CLDN7 to identify signature genes capable of identifying specific subcategories of epithelial cancers, with particular reference to STAD.

In BRCA, the notion that Claudinlow tumors represent a distinct subgroup was challenged, as some authors proposed to consider the low levels of Claudins as a secondary feature of “classic” BRCA subtypes [22]. In BRCA IHC studies, the data gathered with the use of Claudins as tumor markers posed problems in terms of consistency, coordinated expression, and staining of the proteins located inside or outside the cell [13]. This prompted us to evaluate CLDN3, CLDN4, and CLDN7 expression in the 79 STAD tumors of the TCGA database that we previously classified as Claudinlow according to the Nishijima signature. The expression differences of the CLDN3, CLDN4, and CLDN7 genes between the Claudinlow and the other STAD subtypes were minor and statistically insignificant (Fig. S1).

For this reason, a more precise definition of the Claudinlow STADs was necessary. This is now obtained with the use of our 158-gene signature common to STAD and BRCA tumors [23]. Application of this signature in hierarchical clustering of the TCGA samples captures a set of 56 Claudinlow cancers, which originates from the EMT subset of the ACRG stomach tumors. Relative to the other subgroups, including EMT, the identified Claudinlow set does show statistically reduced levels of CLDN3, CLDN4, and CLDN7.

In addition, our 158-gene signature identifies the subset in three databases of 231, 60, and 13 STAD tumors.

While gene expression differences between the “epithelial” subtypes, Claudinlow and EMT are quite substantial, they are more subtle between these two latter subtypes. Yet, statistically robust terms emerged in GO and Reactome pathways analysis comparing EMT and Claudinlow. Notably, cell adhesion prevails in Claudinlow, and proliferative terms, particularly linked to mitosis, in EMT: this could suggest a further acquisition of migratory properties of the formers.

We used the AI-based DeepCC tool and the 158-gene signature to obtain a classification of the 50 STAD cell lines present in CCLE into one of the 5 subgroups identified with the approaches discussed above. The results were validated by wet experiments performed on 6 selected cell lines. Overall, our data confirm a previous classification based on 29 cell lines [30], with the sole exception of the SNU719 cell line, which we classify as MSS/TP53+ instead of EBV. In addition, our results provide a specific classification of 19 of the previously unclassified 21 cell lines. Finally, they permit the identification of a group of 9 Claudinlow cell lines previously classified as EMT. It is worth mentioning that while the expression of CLDN4 and CLDN7 are generally concordant, the expression of CLDN3 is not always coherent in the TCGA (Fig. 1c), PRJEB43867 (Fig. S3 sample GC15), and CCLE STAD cell lines (Fig. 7). This notion could be considered in future IHC studies of STAD samples.

In conclusion, our data support the presence of an aggressive subset of STAD tumors, which can be correctly denominated as Claudinlow on the basis of CLDN3, CLDN4, and CLDN7 expression. The Claudinlow and the EMT subgroups of STADs are separated from each other. It remains to be established whether the 158-gene signature represents an equally useful tool for the definition of the controversial Claudinlow subtype in BRCA and in BLCA tumors. With respect to this, it should be emphasized that the fraction of Claudinlow tumors is more numerous in the STAD (12/13%) than the BRCA cases of the TCGA dataset (3%, data not shown), which suggests a greater relevance of this phenotype in the former than the latter cancer type.

Supplementary Information

Below is the link to the electronic supplementary material.

10120_2025_1671_MOESM6_ESM.png (994.1KB, png)

Supplementary file6 (PNG 994 KB) Fig S1. CLDN3, CLDN4, CLDN7, NFYAl and NFYAs expression in STAD tumors. Box plots showing the CLDN3, CLDN4, CLDN7, NFYAl and NFYAs expression levels, as well as the NFYAl/NFYAs expression ratio values of the STAD tumors available in the TCGA dataset. The values are shown in CCLE (Transcripts Per Million). Tumor types are defined according to the molecular classification proposed by Gallo et al [26]. The p-values are calculated using the Wilcoxon rank-sum test

10120_2025_1671_MOESM7_ESM.png (4MB, png)

Supplementary file7 (PNG 4082 KB) Fig S2. Differential Gene Expression between Claudinlow and EMT subtypes of STAD tumors. a. Principal Component Analysis (PCA) of transcriptome profiles showing separation among Claudinlow, EMT, MSI, MSS:TP53⁻, MSS:TP53⁺, and normal adjacent gastric tissue samples. b. Volcano plot displaying differentially expressed genes between Claudinlow and EMT subtypes. Significantly upregulated (right) or downregulated (left) genes in Claudinlow are shown in red, while non-significant genes are in black. Dashed vertical lines indicate the filtering thresholds: |log₂FC| > 1 and -log10FC > 2. c. Functional enrichment analysis of differentially expressed genes depicted in b. Gene Ontology (left) and Reactome Pathways (right) terms are shown for genes upregulated (top panels, green) and downregulated (bottom panels, red) in Claudinlow compared with EMT tumors

10120_2025_1671_MOESM8_ESM.png (2.6MB, png)

Supplementary file8 (PNG 2659 KB) Fig S3. Expression of the CLDN3/CLDN4/CLDN7 and NF-YAl/NF-YAs mRNAs in an Italian cohort of STAD patients. a. The leftmost barplot illustrates the correlation level of the 158-gene signature in the tumor tissues of the13 Italian gastric-cancer (GC) patients belonging to the PRJEB43867 cohort. The correlation level is measured using the median z-score values of the signature genes. The other 3 barplots show the NF-YAl and NF-YAs expression levels as well as the NF-YAl/NF-YAs expression ratio values. The NF-YAl and NF-YAs values are shown in CCLE. b. The Panel shows the CLDN3/CLDN4/CLDN7 mRNA expression levels in the tumor tissues. GC samples are classified as G-DIFF and G-INT, as originally reported [26]

Author contributions

AG, MR, MBC, SP contributed to the production of data. AG, MR, MBC, DD designed the study and assisted in report preparation. RM and DD supervised the study and wrote the manuscript. EG contributed to the manuscript writing. All authors have read and approved the final manuscript.

Funding

Open access funding provided by Università degli Studi di Milano within the CRUI-CARE Agreement. The research leading to these results was supported by MUR under PNRR M4C2I1.3 Heal Italia project PE00000019 CUP B43D22000710006 to EG, by PNRR M4 C2 Investimento 1.4, Progetto CN3 RNA, CN00000041, SPOKE n.2, funded by the European Union -NextGenerationEU– NextGenerationEU (CUP: G43C22001320007) to R.M. and from Università degli Studi di Milano (PSR-Linea2) to D.D.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Siegel R, Naishadham D, Jemal A. Cancer statistics, 2012. CA Cancer J Clin. 2012;62:10–29. [DOI] [PubMed] [Google Scholar]
  • 2.Lauren P. The two histological main types of gastric carcinoma: diffuse and so-called intestinal-type carcinoma. An attempt at a histo-clinical classification. Acta Pathol Microbiol Scand. 1965;64:31–49. [DOI] [PubMed] [Google Scholar]
  • 3.Hartgrink HH, Jansen EPM, van Grieken NCT, van de Velde CJH. Gastric cancer. Lancet. 2009;374:477–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Kim B, Bang S, Lee S, Kim S, Jung Y, Lee C, et al. Expression profiling and subtype-specific expression of stomach cancer. Cancer Res. 2003;63:8248–55. [PubMed] [Google Scholar]
  • 5.Jinawath N, Furukawa Y, Hasegawa S, Li M, Tsunoda T, Satoh S, et al. Comparison of gene-expression profiles between diffuse- and intestinal-type gastric cancers using a genome-wide cDNA microarray. Oncogene. 2004;23:6830–44. [DOI] [PubMed] [Google Scholar]
  • 6.Lee Y-S, Cho YS, Lee GK, Lee S, Kim Y-W, Jho S, et al. Genomic profile analysis of diffuse-type gastric cancers. Genome Biol. 2014;15:R55–R55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Tanabe S, Aoyagi K, Yokozaki H, Sasaki H. Gene expression signatures for identifying diffuse-type gastric cancer associated with epithelial-mesenchymal transition. Int J Oncol. 2014;44:1955–70. [DOI] [PubMed] [Google Scholar]
  • 8.Bass AJ, Thorsson V, Shmulevich I, Reynolds SM, Miller M, Bernard B, et al. Comprehensive molecular characterization of gastric adenocarcinoma. Nature. 2014;513:202–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Chia N-Y, Tan P. Molecular classification of gastric cancer. Ann Oncol. 2016;27:763–9. [DOI] [PubMed] [Google Scholar]
  • 10.Min L, Zhao Y, Zhu S, Qiu X, Cheng R, Xing J, et al. Integrated analysis identifies molecular signatures and specific prognostic factors for different gastric cancer subtypes. Transl Oncol. 2017;10:99–107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Battaglin F, Naseem M, Puccini A, Lenz H-J. Molecular biomarkers in gastro-esophageal cancer: recent developments, current trends and future directions. Cancer Cell Int. 2018;18:99–99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Cristescu R, Lee J, Nebozhyn M, Kim K-M, Ting JC, Wong SS, et al. Molecular analysis of gastric cancer identifies subtypes associated with distinct clinical outcomes. Nat Med. 2015;21:449–56. [DOI] [PubMed] [Google Scholar]
  • 13.Popova OP, Kuznetsova AV, Bogomazova SY, Ivanov AA. Claudins as biomarkers of differential diagnosis and prognosis of tumors. J Cancer Res Clin Oncol. 2021;147:2803–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Pan C, Xu A, Ma X, Yao Y, Zhao Y, Wang C, et al. Research progress of claudin-low breast cancer. Front Oncol. 2023;13:1226118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Prat A, Parker JS, Karginova O, Fan C, Livasy C, Herschkowitz JI, et al. Phenotypic and molecular characterization of the claudin-low intrinsic subtype of breast cancer. Breast Cancer Res. 2010;12:R68. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Fougner C, Bergholtz H, Kuiper R, Norum JH, Sørlie T. Claudin-low-like mouse mammary tumors show distinct transcriptomic patterns uncoupled from genomic drivers. Breast Cancer Res. 2019;21:85. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kwon MJ, Kim S-H, Jeong HM, Jung HS, Kim S-S, Lee JE, et al. Claudin-4 overexpression is associated with epigenetic derepression in gastric carcinoma. Lab Invest. 2011;91:1652–67. [DOI] [PubMed] [Google Scholar]
  • 18.Nishijima TF, Kardos J, Chai S, Smith CC, Bortone DS, Selitsky SR, et al. Molecular and clinical characterization of a claudin-low subtype of gastric cancer. JCO Precis Oncol. 2017;1:1–10. [DOI] [PubMed] [Google Scholar]
  • 19.Kardos J, Chai S, Mose LE, Selitsky SR, Krishnan B, Saito R, et al. Claudin-low bladder tumors are immune infiltrated and actively immune suppressed. JCI Insight. 2016;1:e85902. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Li Y, Yang K, Li K, Liu H, Zhao S, Jiao M, et al. Clinical and molecular characteristics of bladder urothelial carcinoma subtypes. J Cell Biochem. 2019;120:9956–63. [DOI] [PubMed] [Google Scholar]
  • 21.Rädler PD, Wehde BL, Triplett AA, Shrestha H, Shepherd JH, Pfefferle AD, et al. Highly metastatic claudin-low mammary cancers can originate from luminal epithelial cells. Nat Commun. 2021;12:3742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Fougner C, Bergholtz H, Norum JH, Sørlie T. Re-definition of claudin-low as a breast cancer phenotype. Nat Commun. 2020;11:1787. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Dolfini D, Gnesutta N, Mantovani R. Expression and function of NF-Y subunits in cancer. Biochim Biophys Acta Rev Cancer. 2024;1879:189082. [DOI] [PubMed] [Google Scholar]
  • 24.Londero M, Gallo A, Cattaneo C, Ghilardi A, Ronzio M, Del Giacco L, et al. NF-YAl drives EMT in claudinlow tumours. Cell Death Dis. 2023;14:65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Han B, Fang T, Zhang Y, Zhang Y, Gao J, Xue Y. Association of the TGFβ gene family with microenvironmental features of gastric cancer and prediction of response to immunotherapy. Front Oncol. 2022;12:920599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Guarrera L, Kurosaki M, Garattini S-K, Gianni M, Fasola G, Rossit L, et al. Anti-tumor activity of all-trans retinoic acid in gastric-cancer: gene-networks and molecular mechanisms. J Exp Clin Cancer Res. 2023;42:298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Ding P, Wu H, Wu J, Li T, Gu R, Zhang L, et al. Non-invasive liquid biopsy based on transcriptomic profiling for early diagnosis of occult peritoneal metastases in locally advanced gastric cancer. NPJ Precis Oncol. 2025;9:109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinform. 2011;12:323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gallo A, Ronzio M, Bezzecchi E, Mantovani R, Dolfini D. NF-y subunits overexpression in gastric adenocarcinomas (STAD). Sci Rep. 2021;11:23764. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lee J, Kim H, Lee JE, Shin S-J, Oh S, Kwon G, et al. Selective cytotoxicity of the NAMPT inhibitor FK866 toward gastric cancer cells with markers of the epithelial-mesenchymal transition, due to loss of NAPRT. Gastroenterology. 2018;155:799-814.e13. [DOI] [PubMed] [Google Scholar]
  • 31.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550–550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kimes PK, Liu Y, Neil Hayes D, Marron JS. Statistical significance for hierarchical clustering. Biometrics. 2017;73:811–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Sherman BT, Hao M, Qiu J, Jiao X, Baseler MW, Lane HC, et al. DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic Acids Res. 2022;50:W216–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Conway JR, Lex A, Gehlenborg N. UpSetR: an R package for the visualization of intersecting sets and their properties. Bioinformatics. 2017;33:2938–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Goldman MJ, Craft B, Hastie M, Repečka K, McDade F, Kamath A, et al. Visualizing and interpreting cancer genomics data via the Xena platform. Nat Biotechnol. 2020;38:675–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Gao F, Wang W, Tan M, Zhu L, Zhang Y, Fessler E, et al. DeepCC: a novel deep learning-based framework for cancer molecular subtype classification. Oncogenesis. 2019;8:44–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Barbie DA, Tamayo P, Boehm JS, Kim SY, Moody SE, Dunn IF, et al. Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1. Nature. 2009;462:108–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Komori K, Kano K, Ando S, Watanabe H, Takahashi K, Maezawa Y, et al. Clinicopathological characteristics and prognosis of mucinous gastric cancer. Anticancer Res. 2023;43:2865–71. [DOI] [PubMed] [Google Scholar]
  • 39.Dias K, Dvorkin-Gheva A, Hallett RM, Wu Y, Hassell J, Pond GR, et al. Claudin-low breast cancer; clinical & pathological characteristics. PLoS ONE. 2017;12:e0168669. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Yoshihara K, Shahmoradgoli M, Martínez E, Vegesna R, Kim H, Torres-Garcia W, et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat Commun. 2013;4:2612. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

10120_2025_1671_MOESM6_ESM.png (994.1KB, png)

Supplementary file6 (PNG 994 KB) Fig S1. CLDN3, CLDN4, CLDN7, NFYAl and NFYAs expression in STAD tumors. Box plots showing the CLDN3, CLDN4, CLDN7, NFYAl and NFYAs expression levels, as well as the NFYAl/NFYAs expression ratio values of the STAD tumors available in the TCGA dataset. The values are shown in CCLE (Transcripts Per Million). Tumor types are defined according to the molecular classification proposed by Gallo et al [26]. The p-values are calculated using the Wilcoxon rank-sum test

10120_2025_1671_MOESM7_ESM.png (4MB, png)

Supplementary file7 (PNG 4082 KB) Fig S2. Differential Gene Expression between Claudinlow and EMT subtypes of STAD tumors. a. Principal Component Analysis (PCA) of transcriptome profiles showing separation among Claudinlow, EMT, MSI, MSS:TP53⁻, MSS:TP53⁺, and normal adjacent gastric tissue samples. b. Volcano plot displaying differentially expressed genes between Claudinlow and EMT subtypes. Significantly upregulated (right) or downregulated (left) genes in Claudinlow are shown in red, while non-significant genes are in black. Dashed vertical lines indicate the filtering thresholds: |log₂FC| > 1 and -log10FC > 2. c. Functional enrichment analysis of differentially expressed genes depicted in b. Gene Ontology (left) and Reactome Pathways (right) terms are shown for genes upregulated (top panels, green) and downregulated (bottom panels, red) in Claudinlow compared with EMT tumors

10120_2025_1671_MOESM8_ESM.png (2.6MB, png)

Supplementary file8 (PNG 2659 KB) Fig S3. Expression of the CLDN3/CLDN4/CLDN7 and NF-YAl/NF-YAs mRNAs in an Italian cohort of STAD patients. a. The leftmost barplot illustrates the correlation level of the 158-gene signature in the tumor tissues of the13 Italian gastric-cancer (GC) patients belonging to the PRJEB43867 cohort. The correlation level is measured using the median z-score values of the signature genes. The other 3 barplots show the NF-YAl and NF-YAs expression levels as well as the NF-YAl/NF-YAs expression ratio values. The NF-YAl and NF-YAs values are shown in CCLE. b. The Panel shows the CLDN3/CLDN4/CLDN7 mRNA expression levels in the tumor tissues. GC samples are classified as G-DIFF and G-INT, as originally reported [26]


Articles from Gastric Cancer are provided here courtesy of Springer

RESOURCES