Abstract
The 3′ untranslated region (3′-UTR) is the vital element regulating gene expression, but most studies have focused on variations in RNA-binding proteins (RBPs), miRNAs, alternative polyadenylation (APA) and RNA modifications. To explore the posttranscriptional function of 3′-UTR somatic mutations in tumorigenesis, we collected whole-genome data from 2413 patients across 18 cancer types. Our updated algorithm, PIVar, revealed 25,216 3′-UTR posttranscriptional impairment-related SNVs (3′-UTR piSNVs) spanning 2930 genes; 24 related RBPs were significantly enriched. The somatic 3′-UTR piSNV ratio was markedly increased across all 18 cancer types, which was associated with worse survival for four cancer types. Several cancer-related genes appeared to facilitate tumorigenesis at the protein and posttranscriptional regulation levels, whereas some 3′-UTR piSNV-affected genes functioned mainly via posttranscriptional mechanisms. Moreover, we assessed immune cell and checkpoint characteristics between the high/low 3′-UTR piSNV ratio groups and predicted 80 compounds associated with the 3′-UTR piSNV-affected gene expression signature. In summary, our study revealed the prevalence and clinical relevance of 3′-UTR piSNVs in cancers, and also demonstrates that in addition to affecting miRNAs, 3′-UTR piSNVs perturb RBPs binding, APA and m6A RNA modification, which emphasized the importance of considering 3′-UTR piSNVs in cancer biology.
Subject terms: Cancer genomics, Prognostic markers, Predictive markers
Introduction
The 3′ untranslated region (3′-UTR) is a posttranscriptional regulatory region that crucially controls gene regulation, and contains many regulatory elements that regulate a variety of mRNA-fate-related processes, such as mRNA processing, mRNA stabilization, translation initiation, and localization1,2. In recent decades, most cancer studies have mainly focused on the variations in miRNA, RNA-binding proteins (RBPs), alternative polyadenylation (APA), and m6A RNA modification on the 3′-UTR of mRNA, but 3′-UTR mutations could also result in mRNA expression changes, and recurrent 3′-UTR mutations in cancer genes have been identified by whole-genome sequencing (WGS) to play vital roles in tumorigenesis2. Therefore, it is crucial to explore the effect of 3′-UTR mutations on pathological processes at the posttranscriptional level.
RBPs are involved in all aspects of RNA regulation, including splicing, modification, mRNA stabilization, translation, subcellular localization, and decay3–5, while the dysregulation of RBPs could systematically disrupt the stable cellular environment6. Aberrant expression of RBPs such as RBM387, HuR8, and eIF2B3,9 is associated with neurodegenerative disorders and cancer progression. Abnormal expression of cancer-related RBPs has been approved to destroy the posttranscriptional regulation network and contributes to tumorigenesis and cancer progression. For example, EZH2 is an immune-related and prognosis-associated RBP in liver cancer10, and LARP1 promotes ovarian cancer tumorigenesis, progression, and chemotherapy resistance11,12.
RBPs regulate gene expression and function by binding to sequence-specific binding motifs in RNA13,14. In cancer cells, there are many accumulated genetic variants that destroy the protein-RNA interactions binding motifs that prevent RBPs from recognizing RNA substrates15. For instance, mutant R521C stabilizes FUS in amyotrophic lateral sclerosis (ALS) patients, facilitating its interaction with RBM45, and decreases the recruitment of HDAC1 to contribute to the pathogenesis of ALS16. In particular, RBPs can bind to 3′-UTR cis-elements to regulate gene expression2. However, it remains unclear how 3′-UTR single nucleotide variants (SNVs) affect RBP-mediated posttranscriptional regulation in human cancers6.
In this study, we comprehensively characterized posttranscriptional regulation by somatic 3′-UTR mutations, which could contribute to tumorigenesis. In total, we characterized over twenty thousand posttranscriptionally impairment-related SNVs (piSNVs) from whole-genome sequencing data (WGS) of 2413 patients across 18 cancer types, via our updated algorithm PIVar. Moreover, we found that somatic 3′-UTR piSNV ratio could be used as a potential prognostic biomarker in various cancers and could assess immune cell and checkpoint characteristics. These 3′-UTR piSNVs could affect genes by mechanisms of posttranscriptional regulation via RBPs, miRNA, APA, and m6A modification, which are involved in many tumor-related pathways to contribute to cancer development3.
Results
Identification and distribution of 3′-UTR piSNVs across cancers
RBPs play vital roles in regulating the mRNA life cycle by binding 3′-UTRs2,3. To evaluate the potential impact of 3′-UTR mutations on posttranscriptional regulation in tumorigenesis, we downloaded somatic mutations derived from WGS data of 2413 patients across 18 cancer types in the PCAWG project17, including biliary adenocarcinoma (Biliary-AdenoCA), osteosarcoma (Bone-Osteosarc), breast adenocarcinoma (Breast-AdenoCa), medulloblastoma (CNS-Medullo), pilocytic astrocytoma (CNS-PiloAstro), esophagus adenocarcinoma (Eso-AdenoCa), kidney renal cell carcinoma (Kidney-RCC), liver hepatocellular carcinoma (Liver-HCC), lymphoid mature B-cell lymphoma (Lymph-BNHL), lymphoid chronic lymphocytic leukemia (Lymph-CLL), myeloid myeloproliferative neoplasm (Myeloid-MPN), ovarian adenocarcinoma (Ovary-AdenoCA), pancreatic adenocarcinoma (Panc-AdenoCA), pancreatic neuroendocrine tumor (Panc-Endocrine), prostate adenocarcinoma (Prost-AdenoCA), skin melanoma (Skin-Melanoma) and stomach adenocarcinoma (Stomach-AdenoCA), as well as our in-house esophageal squamous cell carcinoma (ESCC) WGS samples (663 patients)18, then we employed an updated PIVAR algorithm6 that could identify the variants that disrupt protein-RNA interaction via the alteration of RNA secondary structure and the regulation of gene expression; we identified 25,216 3′-UTR piSNVs among 50,435 piSNVs from 1750 samples of the PCAWG project17 and 663 samples from our in-house ESCC WGS data18 across 18 cancer types (Fig. 1a).
By comparing the genome-wide gene density distribution of identified piSNVs and 3′-UTR piSNVs (orange peaks) in each chromosome, we observed that piSNVs and 3′-UTR piSNVs were clustered in some specific chromosome regions, while there were certain desert regions for piSNVs and 3′-UTR piSNVs (Fig. 1b). Among the genomic regions, 3′-UTR piSNVs were generally more significantly enriched in the 3′-UTR and exon regions in all 18 cancer types (Supplementary Fig. 1a, b), which suggested the SNVs in the 3′-UTR and exon regions have more regulatory functions. We further calculated the ratio of 3′-UTR piSNVs to total 3′-UTR SNVs in each sample across cancer types and found a significantly higher ratio of 3′-UTR piSNVs in pan-cancer samples compared with that of the control samples from the Database of Somatic Mutations in Normal Cells (DSMNC)19 (Fig. 1c). The significant distinction between cancer samples and normal controls revealed the prevalence of posttranscriptionally impairment-related 3′-UTR mutations in cancer genomes, implying their contribution to cancer development.
To further explore the association between 3′-UTR piSNVs and cancers, we downloaded GWAS SNPs data from GWAS Catalog database20 and CCGD-ESCC database21, and we found that there were two 3′-UTR piSNVs, rs63629260 and rs3767, overlapping with GWAS SNPs from GWAS Catalog database; rs63629260 is the 3′-UTR mutation in SPTBN1 and has association with bone mineral density, osteoporosis and fracture22; and rs3767 is the 3′-UTR mutation in ZNF664 and has relation with morphogenesis, organogenesis, adrenal cell renewal, and cancer23. Moreover, there were 121 3′-UTR piSNVs existing in ESCC GWAS data of CCGD-ESCC database (Supplementary Table 1), which suggested that posttranscriptional regulation by somatic 3′-UTR mutations could contribute to tumorigenesis.
Clinical relevance and immune effect of an elevated 3′-UTR piSNV ratio
To assess the correlation between the 3′-UTR piSNV ratio and clinical phenomena, we classified tumor samples according to the 3′-UTR piSNV ratio and performed survival analysis. We found that the overall survival (OS) of patients with four cancer types, including ESCC (Fig. 2a, b), Biliary-AdenoCA (Fig. 2c), Ovary-AdenoCA (Fig. 2d, e), and Stomach-AdenoCA (Fig. 2f), had a significant correlation with the 3′-UTR piSNV ratio. Moreover, the patients with a high 3′-UTR piSNV ratio in each cancer type had a worse survival situation than the patients with a low 3′-UTR piSNV ratio. These results further showed that an increased 3′-UTR piSNV ratio is related to poor clinical outcomes and implied that the 3′-UTR piSNV ratio could function as a potential prognostic index in several types of cancers.
We speculated whether immune microenvironment changes could contribute to survival outcomes. Then, we used the CIBERSORT24 algorithm to compute the relative abundance of 22 types of immune cells (Supplementary Fig. 2a–d). Since the samples for WGS and RNA-seq were unpaired, we finally selected ESCC and Liver-HCC datasets, which had enough samples in both the high and low 3′-UTR piSNV groups, to calculate immune cell abundance. Overall, there was an obvious difference in immune cell enrichment between the high- and low-3′-UTR piSNV ratio groups. Notably, the level of resting NK cells was significantly higher in the high-3′-UTR piSNV ratio group in ESCC (Supplementary Fig. 2a, b); however, the low-3′-UTR piSNV ratio group of the Liver-HCC cohort showed significantly higher levels of resting NK cells (Supplementary Fig. 2c, d). Moreover, type 2 macrophages [M2], as immunoinhibitory cells, were more enriched in the high-3′-UTR piSNV ratio group (Supplementary Fig. 2c, d).
Immune checkpoints are essential for the immune response and can result in cancer cell immune escape, and immune checkpoint inhibitors, in the form of blocking antibodies, are applied to facilitate an immune response in cancers25–28. We further explored the expression difference in immune checkpoint genes between the high- and low-3′-UTR piSNV ratio groups in tumor samples (Fig. 2g), and we found that IDO1 and CD274 were inhibitory immune checkpoints with significantly higher expression in high-3′-UTR piSNV ratio groups of ESCC and Liver-HCC (Supplementary Fig. 2e, f). IDO1 is a target for cancer immunotherapy and encodes a heme enzyme that acts on multiple tryptophan substrates and plays a crucial role in pathophysiological processes such as immunoregulation, antitumor defense, and antioxidant activity29,30. CD274 (PD-L1) is a well-known ligand that binds with the receptor PD1 in T cells, which can block T-cell activation in cancer31. In addition to the immune inhibitors, we also observed that the immune stimulator IL2 was significantly decreased in the high-3′-UTR piSNV ratio group of ESCC (Supplementary Fig. 2e). IL2 is a primary cytokine for T and NK cell proliferation and activation and affects immune homeostasis by regulating regulatory T (Treg) cells, which play a crucial role in immune cancer therapy32,33. These results suggested that 3′-UTR piSNVs are involved in immune microenvironment heterogeneity and induce the dysregulation of several core immune regulators, which contributes to worse clinical outcome.
Functional effect of 3′-UTR piSNVs on RBP binding
Genetic mutations on RNA substrates can destroy the protein–RNA interactions binding motifs to prevent RBPs from recognizing RNA substrates in cancer cells15. To access the experimental effect of these 3′-UTR piSNVs on RBP binding, we performed electrophoresis mobility shift assays (EMSA) on four randomly selected 3′-UTR piSNVs who predicted to alter the binding of PTBP1 (Fig. 3a, b; Supplementary Fig. 3a, b). As shown in Fig. 3a, there was the strong binding between PTBP1 and unmutated RNA probe of TRIM38 that is associated with Fanconi renotubular syndrome34, while it had visible differences in the probe with the 3′-UTR piSNV of TRIM38 in their binding to PTBP1. There were similar observations existing in 3′-UTR piSNV of KIAA1919, CNTLN, and MOB3B on binding of PTBP1 (Fig. 3b; Supplementary Fig. 3a, b).
We further examined the effect of 3′-UTR piSNVs on RBP binding with the PRIdictor35 webtool, which can predict the binding changes between mutual binding sites in RNA and protein, and we found that 3′-UTR piSNVs could destroy the PTBP1-RNA (Fig. 3c, PDB ID: 2AD9) and SRSF1-RNA (Fig. 3d, PDB ID: 6HPJ) binding complex. In detail, PTBP1 can bind to 5′-CUCUCU-3′ RNA oligonucleotides36. By screening the identified 3′-UTR piSNVs, we found that 21 genes had a mutation from C to T at the fifth position of the above motif mainly in Skin-Melanoma and ESCC (Fig. 3a, Supplementary Fig. 3c, and Supplementary Table 2). Another RBP affected by a 3′-UTR piSNV is SRSF1, which plays an important role in the regulation of alternative splicing events. Since SRSF1 can bind to the 5′-AACAAA-3′ RNA oligonucleotide (PDB ID: 6HPJ)37, we speculated that 3’-UTR piSNVs in the AACAAA sequence motif residing in CHRM3 mRNA could disrupt the binding between SRSF1 and their corresponding RNAs (Fig. 3b, Supplementary Fig. 3d, and Supplementary Table 3).
To determine the effect of these 3′-UTR piSNVs on RBP binding, we integrated RBP motif and CLIP-seq-derived RBP binding data38,39 for all identified 3’-UTR piSNVs, which revealed 24 significantly enriched RBPs. Abnormal expression of RBPs leads to tumorigenesis3, so we detected the differentially expressed RBPs between tumor and corresponding normal tissues in different cancer types (Fig. 3e). The results proved that dysregulation of these RBPs occurs broadly in seven cancer types. Consistent with previous studies40,41, the RBPs PTBP1 and SRSF1 were differentially upregulated in 6 and 3 cancer types, respectively. Moreover, LIN28A/LIN28B and IGF2BP1/2/3 were the most differentially upregulated RBPs in over 5 cancer types. Moreover, IGF2BP1/2 were significantly downregulated in Kidney-RCC and Prost-AdenoCA. LIN28A/LIN28B are LIN-28 family members that are associated with the developmental timing and self-renewal of embryonic stem cells, and their aberrant expression is related to cancer progression42. IGF2BP1/2/3 encode members of the insulin-like growth factor 2 mRNA-binding protein family, such as insulin-like growth factor 2 (IGF2), and bind to the mRNAs of several vital genes, and regulate translation. The dysregulation of IGF2BP1/2/3 was found to be associated with skin squamous cell carcinoma43 and pancreatic cancer44. Thus, these results suggested that the identified 3′-UTR piSNVs affect the binding of some cancer-associated RBPs in cancer development.
Effect of 3′-UTR piSNVs on miRNA binding
At the posttranscriptional level, miRNAs play an irreplaceable role in regulating mRNA expression and controlling many biological processes12,45. Moreover, 3′-UTR piSNVs may also influence posttranscriptional regulation by affecting miRNA binding. By using TargetScan46, miRNASNP47, starBase48, and miRDB45 software to predict the binding miRNA of 3′-UTR piSNVs, we identified a total of 1737 miRNAs that could bind to 1035 3′-UTR piSNV-affected genes. Most recurrent miRNAs are miR-3163 (21), miR-340-5p (19) and miR-186-5p (18), regulating over 18 targeted 3′-UTR piSNV-affected genes.
Network illumination of miRNA and the top 50 differential expressed miRNAs showed that several 3′-UTR piSNV-affected genes are regulated by multiple miRNAs (Fig. 4a and Supplementary Table 4). These genes included PTEN, CRIM1, SMARCA5, VCAN, SRGAP1, and RBM27, which may be regulated by over 10 kinds of miRNAs. For example, PTEN is a tumor suppressor gene that is commonly lost in human cancer, and is associated with prostate cancer, glioblastoma, endometrial cancer, lung cancer and breast cancer to varying degrees49,50. We also observed that the majority of recurrent miRNAs have abnormal expression in cancers. Briefly, 5 miRNAs were significantly differentially upregulated in six kinds of cancers, while 6 miRNAs were downregulated in over five kinds of cancers (Fig. 4b). Our results suggested that several miRNAs show the same dysregulated expression patterns in cancers.
Predicted APA and RNA modification change of 3′-UTR piSNV sites
The 3′-UTR is a unique area of each gene that bind many functional elements, affect transcript isoform generation, and contain RNA modifications, such as APA and m6A RNA modification. To further explore the effect of 3′-UTR piSNVs on APA changes and m6A modification of transcripts, we annotated 3′-UTR piSNVs with RNA modification and APA changes by RMVar51 and APADB52. The results showed that 2.8% (628) and 2.3% (515) of 3′-UTR piSNVs had m6A RNA modification and APA changes, respectively (Fig. 4c), occurring mainly in the SLC16A10, LPGAT1, PRKCB, and USP38 genes.
We further considered the effect of m6A regulators on the m6A RNA modification of transcripts, and we found that IGF2BP1/2/3, as m6A readers, had significant differential expression in seven cancer types. IGF2BP1/2/3 are IGF2 mRNA-binding proteins that enhance mRNA transcript stability by recognizing m6A featured sequences53, and are associated with many cancers, including skin squamous cell carcinoma, testicular cancer, and enchondroma. IGF2BP2 functions as a well-known tumor suppressor gene in lymphoma54. We also paid attention to APA regulator expression changes due to 3′-UTR piSNVs and observed widespread dysregulated expression of APA regulators. Most of these APA regulators were specifically upregulated in seven cancer types, while PPP1CB, PCF11, and PABPC1 were significantly downregulated in ESCC (Supplementary Fig. 4). These results suggested that variants of 3′-UTR piSNVs and the dysregulation of m6A RNA modification and APA regulators could cause posttranscriptional impairment in mRNA transcripts to contribute to cancer pathogenesis.
The cooccurrence of 3′-UTR piSNV-affected genes across 18 cancer types
To investigate the influence of genes affected by 3′-UTR piSNVs, we reviewed the somatic 3′-UTR piSNVs in each cancer type and observed many tumor-specific genes in 2930 nonredundant 3′-UTR piSNV-affected genes. The top 3′-UTR piSNV-affected genes were RNF217, HDAC9, GPATCH2L, DNAJC10, EGFR, PROX1, OGFRL1, and HHIP, and variations in these genes arisen in at least 10 cancer types (Fig. 5a). The 3′-UTR piSNVs in gene RNF217 occurred in 63 patients across 10 cancer types. The RNF217 protein is a member of RING1-IBR-RING24 (RBR) ubiquitin protein ligase family, which contains a transmembrane domain and regulates apoptosis signaling by interacting with HAX1 to promote leukemia development55. In total, 70 and 33 patients (among 2413 total patients) had 3′-UTR piSNVs of HDAC9 and EGFR, respectively. The HDAC9 protein is a member of the histone deacetylase family that can repress transcriptional regulation by catalyzing acetyl group removal from lysine residues and is associated with cutaneous T cell lymphoma and maxillary cancer56,57. Epidermal growth factor receptor (EGFR) is a receptor tyrosine kinase that contains an extracellular ligand-binding domain and binds to epidermal growth factor, thus inducing receptor dimerization and tyrosine autophosphorylation leading to motility, growth, cell proliferation, and the development of many types of cancer58,59. EGFR is an oncogene that is widely amplified and mutated in several cancer types, including non-small cell lung cancer, glioblastoma, and basal-like breast cancers60. The T790M EGFR variant has been shown to confer resistance to several drugs, such as gefitinib and erlotinib, and acts as a resistance marker61,62. Our data indicated that 3′-UTR variations in HDAC9 and EGFR may promote cancer development through posttranscriptional regulation.
The transcriptome data from the PCAWG project and our in-house ESCC cohort were analyzed to estimate the differential expression of these 3′-UTR piSNV-affected genes between tumor and corresponding normal tissues in seven cancer types. We found that the majority of the top 5% (58) most frequent 3′-UTR piSNV-affected genes were differentially expressed in at least one cancer type (Fig. 5b). For example, MAPK8IP2, ADAMTS12, and CHAF1B were upregulated in over five cancer types, while HHIP, PROX1, and PLCG2 were downregulated in over five cancer types.
Then, we investigated the 3′-UTR-length corrected frequency of 3’-UTR piSNV occurrence in 859 genes at the protein function and/or posttranscriptional regulation levels. We found that PREX2, ADAMTS12, and PLXDC2 were the genes most frequently affected by 3′-UTR piSNVs genes in terms of both posttranscriptional changes and functional damage (Fig. 5c). PREX2 is a member of the phosphatidylinositol 3,4,5-trisphosphate (PIP3)-dependent Rac exchanger (PREX) family. The domain of PREX2 interacts with the phosphatase and tensin homolog (PTEN) gene product to inhibit PTEN phosphatase activity, thus activating the phosphoinositide-3 kinase (PI3K) signaling pathway, which plays a role in insulin signaling pathways. PREX2 mutation or overexpression has been observed in some cancers63,64. And we found there were both missense mutations and 3′-UTR piSNVs of PREX2 from the same sample in 5 samples of Skin-Melanoma and 1 sample of Panc-AdenoCA. Interestingly, there was one Skin-Melanoma sample having four different missense mutations and 2 different 3′-UTR piSNVs of PREX2, who had higher levels of malignancy (IV, AJCC 7th Edition 2010), while other samples normally had simply one missense mutation or 3′-UTR piSNV. Thus, we inferred that PREX2, ADAMTS12, and PLXDC2 mutations lead to cancer development at both the protein function and posttranscriptional regulation levels. And some identified 3′-UTR piSNV-affected genes, such as SLC6A11, NUDCD2, CTNNA3, RAB3C, and AJAP1, are well-studied cancer/disease related genes with a high deleterious mutation rate. Intriguingly, we found that some 3′-UTR piSNV-affected genes, such as RNF217, PGAM2, RBBP4, PIK3CA, CDYL2, EGFR, and PIK3R2, involved in carcinogenesis mainly via posttranscriptional regulation. The clarification of the features of these 3′-UTR piSNV-affected genes greatly broadened our understanding of cancer biology.
Functional enrichment and pathological network/pathway analysis of 3′-UTR piSNV-affected genes
To explore the systematic function of 3′-UTR piSNV-affected genes, we used the ‘clusterProfiler’ R package65 to perform functional enrichment analysis and found that 3′-UTR piSNV-affected genes were enriched in some canonical cancer pathways and metabolism-related pathways, such as the Notch/Wnt/PI3K-Akt/MAPK/Ras/Hippo/EGFR/ERBB signaling pathways and insulin/thyroid hormone/estrogen pathways (Fig. 6a, b). Then, we applied the Hotnet266 and activePathway67 workflows to identify pathological networks and pathways in the HINT + HI201268,69, iRefIndex70, and Multinet71 protein-protein interaction networks, and we identified 1 significantly altered network and 5 pathological pathways based on the occurrence frequency of each 3′-UTR piSNV gene. The network consisted of NOTCH1, NOTCH2, NOTCH3, JAG1, JAG2, LFNG, MFNG, NUMB, FBXW7, and LINGO1 (Fig. 6c). These genes are involved in the NOTCH signaling pathway, which controls essential cellular processes, such as proliferation, differentiation and branching morphogenesis72,73. Moreover, five pathological pathways, including the PI3K/AKT/FGFR/MAPK signaling, EGFR-related signaling, AMPA-related signaling, Notch signaling and Wnt signaling pathways, had a high correlation with tumor development (Fig. 6d). The above pathological networks and pathways showed that 3′-UTR piSNV-affected genes were involved in vital tumorigenesis processes at the posttranscriptional level.
Therapeutic implications of 3′-UTR piSNVs and 3′-UTR piSNV-affected genes
To investigate the therapeutic implications of these identified 3′-UTR piSNVs, we employed the Connectivity Map (CMAP) workflow74 to assess gene expression profiles for 1309 compounds to identify clinical drugs based on the expression signature of these 3′-UTR piSNV-affected genes. Finally, we identified a total of 80 compounds, and the majority of them were found to have an effect on a specific cancer type (Fig. 6e). 5155877 (HRF-3) was identified as a highly ranked small molecule that affects gene expression signatures associated with 3 cancer types (Fig. 6e). Overall, these identified drugs and compounds might show potential benefits in treating patients with specific types of cancer.
Discussion
The 3′-UTR can drive or enhance cancer pathogenesis at the posttranscriptional gene regulation level by disrupting regulatory element binding and dysregulating oncogenic gene expression1. Previous cancer genomics studies have mostly focused on genetic aberrations of protein-coding regions. WGS of large tumor samples in the PCAWG project and our in-house ESCC cohorts provided an opportunity to explore the comprehensive effect of somatic 3′-UTR mutations on posttranscriptional regulation in cancer. In previous work, we had proved via experimental and computational analyses that somatic mutations identified by PIVar algorithm could disrupt the binding of the RBP-RNA complex6.
By employing updated PIVar, we identified 25,216 3′-UTR piSNVs spanning 2930 genes across 18 cancer types and observed the striking phenomenon that most SNVs were 3′-UTR piSNVs, which implied that RBPs prefer to bind in 3′-UTR rather than other regions to modulate posttranscriptional regulation. We compared cancer and control samples from the DSMNC database, and observed an elevated ratio of somatic 3′-UTR piSNVs in pan-cancer, in addition, a high ratio of 3′-UTR piSNVs indicated poor patient survival for four cancer types (ESCC, Ovary-AdenoCA, Biliary-AdenoCA and Stomach-AdenoCA), and there was still an association between the 3’-UTR piSNV ratio and clinical survival in ESCC and Ovary-AdenoCA even when adjusting for other variables by multivariate Cox regression. These results suggested that the 3′-UTR piSNV ratio could be a potential prognostic biomarker for these four kinds of cancers. Notably, some identified 3′-UTR piSNV-affected genes, such as PREX2, ADAMTS12, and PLXDC2, were regarded as cancer-related genes with a high deleterious mutation rate. For example, PREX2 is an oncogene that interacts with PTEN gene product to inhibit PTEN phosphatase activity, thus activating the PI3K signaling pathway, which plays a role in insulin signaling pathways, and its mutations or overexpression have been observed in some cancers. PREX2 had the second highest 3′-UTR-length corrected frequency of 3’-UTR piSNV occurrence, and 9 of 2413 cancer patients were also identified to carry protein damaging mutations of PREX2; Thus, the 3′-UTR piSNV-affected genes were able to promote carcinogenesis at both the protein function and posttranscriptional regulation levels. Thrillingly, we found that some 3′-UTR piSNV-affected genes, such as URB1, PIK3CA, ROBO1, and FAT4, might exert their function during carcinogenesis mainly via posttranscriptional mechanisms. The identification of these 3′-UTR piSNV-affected genes that were not previously well characterized at the posttranscriptional level, which greatly broadened our understanding of cancer biology. In this study, we only focused on the impaired structure and posttranscriptional regulation of 3′-UTR piSNV-affected genes, while some genes may function as trans-acting factors or epigenetic regulators and perform their function at the transcriptional level; thus, future studies need to comprehensively explore the functions of these identified 3′-UTR piSNV-affected genes.
It is unclear how posttranscriptional regulation affects the tumor microenvironment, which determines the invasiveness of cancers75. We found that in the worse clinical outcome samples that had a high-3′-UTR piSNV ratio, there was distinct heterogeneity of immune cells in different cancer types, and type 2 macrophages [M2], as immunoinhibitory cells, were more enriched in the high-3′-UTR piSNV ratio groups (Supplementary Fig. 2c, d). Moreover, the inhibitory immune checkpoint IDO1/CD274 (PD-L1) had significantly higher expression in the high-3′-UTR piSNV ratio groups (Supplementary Fig. 2e, f). Many previous studies have proved that immune checkpoint therapies (such as anti-PD-1/PD-L1 therapies) have extraordinary therapeutic effects in cancer patients and revolutionize the treatment standard for multiple cancers76. However, more matched expression data of different cancers are needed to illustrate the immune microenvironment characteristics at the posttranscriptional regulation level. Finally, we identified several potential therapeutic compounds for patients with specific cancer types. Recent studies also show that RBPs play a role in viral infections. The expression of RBP RBM47 is upregulated by Dengue virus (DENV) infection and has an inhibitory effect on DENV replication; RBM47 is an Interferon-stimulated genes (ISG) that is upregulated by multiple viral induction or interferon stimulation and has broad-spectrum antiviral effects. rBM47 had no significant effect on IFN production, but had a significant enhancing effect on the activation of the Interferon (IFN) response element ISRE and the expression of ISGs. In the RBP immunoprecipitation assay, RBM47 specifically bound the 3′-UTR of type I IFN receptor IFNAR1 mRNA, stabilized the mRNA, and subsequently increased the protein level of intracellular IFNAR1, promoted viral infection or interferon-induced phosphorylation of STAT1/2, enhanced interferon-stimulated expression of gene ISGs and amplified the antiviral effect of the host77.
Generally, our study revealed the comprehensive characteristics and clinical relevance of 3′-UTR piSNVs across cancers, providing new insight for investigating posttranscriptional regulation, which contributes to tumor progression and may promote the development of new strategies for cancer treatment.
Methods
Identification of 3′-UTR piSNVs from somatic mutation WGS data of 18 cancer types
We downloaded somatic mutations derived from WGS data of 2413 patients across 18 cancer types in the PCAWG (https://dcc.icgc.org/releases/PCAWG/) project17, including Biliary-AdenoCA, Bone-Osteosarc, Breast-AdenoCa, CNS-Medullo, CNS-PiloAstro, Eso-AdenoCa, Kidney-RCC, Liver-HCC, Lymph-BNHL, Lymph-CLL, Myeloid-MPN, Ovary-AdenoCA, Panc-AdenoCA, Panc-Endocrine, Prost-AdenoCA, Skin-Melanoma, and Stomach-AdenoCA, as well as our in-house ESCC WGS samples (663 patients)18. We used WGS samples from the DSMNC (https://dsmnc.big.ac.cn/) database (Database of Somatic Mutations in Normal Cells)19 as a control, and we obtained 0.77 million somatic SNVs occurring in over 579 human normal cells from the DSMNC database.
PIVAR algorithm was our previously developed to evaluate the impact of mutations on posttranscriptional regulation6. We updated PIVAR algorithm to evaluate the impact of mutations on posttranscriptional regulation by adding the latest 318 eCLIP-seq data from ENCODE (112 RBPs) to identify 3′-UTR piSNVs38,39, which could disrupt the binding between RNAs and RBPs. We used the Wilcoxon rank-sum test to compare the distribution of the 3′-UTR piSNV ratio (ratio of 3′-UTR piSNVs to 3′-UTR SNVs) between the control samples and samples of each cancer type. In detail, we calculated the ratio of piSNVs to SNVs in the 3′-UTR for each sample and used Wilcoxon rank-sum test to evaluate the distribution differences of 3′-UTR piSNV ratio between cancer samples in each cancer type and control samples.
GWAS SNPs data was downloaded from GWAS Catalog database20 (https://www.ebi.ac.uk/gwas/downloads) and CCGD-ESCC database21 (http://db.cbi.pku.edu.cn/ccgd/ESCCdb).
Clinical survival analysis
Corresponding clinical data of each cancer type was collected from the PCAWG (https://dcc.icgc.org/releases/PCAWG/) project and our in-house ESCC cohort. Then, the ‘surv_cutpoint’ function in ‘survminer’ R package was used to determine the optimal 3′-UTR piSNV ratio based on the OS data of the patients. Overall survival analysis was performed with the Kaplan–Meier method and p value was calculated with log-rank test. In the multivariate cox proportional regression analysis, age, sex, tumor-node-metastasis (TNM) stage, and the race of patients was assessed to analyze the relationship between the 3′-UTR piSNV ratio and clinical outcomes.
Characteristics of immune cell types and immune regulators between the high- and low-3′-UTR piSNV ratio groups
We employed the CIBERSORT24 algorithm to calculate the abundance scores of 22 immune cell types to evaluate the cellular heterogeneity landscape in ESCC and Liver-HCC by RNA-seq data. We obtained immune checkpoint genes from previous studies76,78 and used the ‘DESeq2’ R package79 to calculate the expression difference between the high-3’-UTR piSNV ratio group and the low-3’-UTR piSNV ratio group. We compared the abundance of each immune cell type and gene expression between the high- and low-3’-UTR piSNV ratio groups by the Wilcoxon rank-sum test.
RBP motif enrichment of 3′-UTR piSNV loci and their impact on RBP binding
Many RBPs interact with mRNAs via a limited set of modular RNA-binding domains, including the RNA recognition motif, heterogeneous nuclear ribonucleoprotein K-homology domain, and zinc fingers80. The RNA-binding domains of RBPs initially determine the specificity and preferences of RNA binding with specific sequence motifs81. Therefore, we downloaded 247 positional weight matrices (PWMs) of the inferred RNA binding motif from the AURA database82, which were used to call motif matches in the transcriptome. RBP motif and CLIP-seq-derived RBP binding were integrated to identify 3′-UTR piSNVs38,39; FDR and OR values were calculated with Fisher’s exact tests to identify the significantly enriched RBPs affected by 3′-UTR piSNVs (FDR < 0.05 and OR > 1).
To further estimate the influence of piSNVs on RBP binding, the crystal structures of the PTBP1-RNA (PDB ID: 2AD9) and SRSF1-RNA (PDB ID: 6HPJ) complexes were downloaded from the Protein Data Bank83 (https://www.rcsb.org/); the binding strength was predicted with the PRIdictor35 webtool (http://bclab.inha.ac.kr/pridictor), and the structures were visualized with PyMOL software (https://pymol.org/).
Chemiluminescent electrophoresis mobility shift assays
To evaluate the functional effect of RNA mutation on binding of RBP, we used LightShift Chemiluminescent RNA EMSA kit (Catalog # 20158; Thermo Scientific, Rockford, USA) to performe electrophoresis mobility shift assays. RBP PTBP1 kept on −80 °C was used in the assay. Then, 200 ng PTBP1 protein was pre-incubated with 0.2 µL tRNA (10 mg/mL) in 1× RNA EMSA binding buffer containing 5% glycerol for 10 min at room temperature to block unspecific binding as much as possible. After that, 100 fmol synthesized 3′-biotin-labeled wild-type or point-mutated RNA oligos (Supplementary Table 5) for each reaction were respectively added to the mixture to a final volume of 20 µl and incubated for 20 min at room temperature. Then, 1× loading buffer was added to the RBP-RNA mixture and immediately loaded into the pre-run 6% TBE polyacrylamide gel, and ran at 100 V for 45–60 min in cooled 0.5× TBE buffer. Samples were then transferred to positively charged nylon membrane (Catalog # FFN15, Beyotime, Shanghai, China), and crosslinked with UV-light crosslinking instrument equipped with 254 nm bulbs for 5 min. The subsequent blocking, washing and detection were performed according to the manufacturer’s instructions.
Predicted RNA modification and APA influence on 3′-UTR piSNVs
We annotated RNA modifications and APA changes in 3′-UTR piSNVs with RMVar51 and APADB52. We collected 22 kinds of APA regulators and 20 kinds of m6A RNA modification regulators, including 7 m6A writers, 2 erasers and 11 readers.
miRNA annotation of 3′-UTR piSNVs
We identified possible binding miRNAs of 3′-UTR piSNVs with TargetScan v7.246, miRNASNP v347, starBase v3.048, and miRDB v6.045 software, and we built a network of the top miRNAs related to 3′-UTR piSNV-affected genes.
Differential expression analysis
RNA-seq/miRNA raw read count data of seven types of cancer tissues and corresponding normal tissues were collected from the PCAWG and our in-house ESCC cohorts. The R package ‘DESeq2’79 was used to evaluate the expression of 2930 3′-UTR piSNV-affected genes, 248 enriched RBPs, miRNA and m6A/APA regulators identified in the previous steps, and genes with fold-change >2 and FDR < 0.05 were considered to be significant differentially expressed genes.
Identification of coexisting protein damage-related mutations in 3′-UTR piSNV-affected genes
Functional consequences on proteins of identified 2930 3′-UTR piSNV-affected genes were predicted with activeDriverWGS tool84. Then, the 3’-UTR-length of each gene was applied to corrected occurrence frequency of protein damaging mutations and 3’-UTR posttranscriptional impaired mutations in these genes was analyzed in all 2413 cancer patients85.
Functional enrichment and pathological network/pathway analysis
We performed functional enrichment analysis via the ‘clusterProfiler’ R package65 with a Bonferroni correction test and identified significant pathways with FDR values < 0.05. Then we used the HotNet266 and activePathway67 workflows to mine significantly mutated subnetworks and pathways according to HINT + HI2012 (a combination of the HINT network68 and the HI-201269), iRefIndex70 and Multinet71 protein–protein interaction networks, and the sample frequency of 3′-UTR piSNVs and damage score for the affected genes were taken as the network heat score to identify significant subnetworks and pathways with default parameters.
Potential clinical drug analysis
To further explore the therapeutic implications of 3′-UTR piSNV-affected genes, drug response signatures assembled in CMAP build 02 (Broad Institute)74 were downloaded to compare with the gene expression of identified 3′-UTR piSNV-affected gene, which contains information on gene expression profiles and the sensitivity to 1309 compounds.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We would like to thank Dr. Huajing Teng (Peking University Cancer Hospital) and Dr. Fengbiao Mao (Peking University Third Hospital) for kindly suggestions. We thank PCAWG projects for the data access. We are also grateful to computational support from Shanghai Mingma Technologies Co., Ltd. This work was supported by funding from the National Key R&D Program of China (2021YFC2501000), the National Natural Science Foundation of China (82030089), the CAMS Innovation Fund for Medical Sciences (2021-1-I2M-018), and the Sanming Project of Medicine in Shenzhen (No. SZSM201812062).
Author contributions
Z.L. conceptualized and supervised the study. W.W. collected data and performed data processing. W.W., W.G., and Y.L. performed computational analyses and statistical analysis. Q.L. performed experimental validation. Z.L., W.W., H.C., Z.S., and Y.C. prepared and revised the manuscript. W.W., W.G., and Q.L. are considered co-first authors, and all authors reviewed and approved the final paper.
Data availability
The majority of the datasets analyzed in this study were downloaded from the Pan-Cancer Analysis of Whole Genomes (PCAWG) study (https://dcc.icgc.org/releases/PCAWG/); NGS and clinical data of ESCC samples are available in Genome Sequence Archive (GSA) (https://ngdc.cncb.ac.cn/gsa-human/) under accession HRA000021; Somatic mutations in normal cells were downloaded from the DSMNC database (https://dsmnc.big.ac.cn/).
Code availability
PIVar algorithm are available at GitHub (https://github.com/WeiWenqing/PIVar). All computational tools used with default parameters in this study are available for download as open-source software and are detailed in “Methods”.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Wenqing Wei, Wenyan Gao, Qinglan Li.
Contributor Information
Zhongsheng Sun, Email: sunzs_biols@163.com.
Zhihua Liu, Email: liuzh@cicams.ac.cn.
Supplementary information
The online version contains supplementary material available at 10.1038/s41525-022-00305-0.
References
- 1.Schuster SL, Hsieh AC. The untranslated regions of mRNAs in cancer. Trends Cancer. 2019;5:245–262. doi: 10.1016/j.trecan.2019.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Mayr C. Regulation by 3’-untranslated regions. Annu. Rev. Genet. 2017;51:171–194. doi: 10.1146/annurev-genet-120116-024704. [DOI] [PubMed] [Google Scholar]
- 3.Gebauer, F. et al. RNA-binding proteins in human genetic disease. Nat. Rev. Genet.10.1038/s41576-020-00302-y (2020). [DOI] [PubMed]
- 4.Li YE, et al. Identification of high-confidence RNA regulatory elements by combinatorial classification of RNA-protein binding sites. Genome Biol. 2017;18:169. doi: 10.1186/s13059-017-1298-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Chen J, et al. The RNA-binding protein ROD1/PTBP3 cotranscriptionally defines AID-loading sites to mediate antibody class switch in mammalian genomes. Cell Res. 2018;28:981–995. doi: 10.1038/s41422-018-0076-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Teng H, et al. Prevalence and architecture of posttranscriptionally impaired synonymous mutations in 8320 genomes across 22 cancer types. Nucleic Acids Res. 2020;48:1192–1205. doi: 10.1093/nar/gkaa019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Zou C, et al. RBM38 in cancer: Role and mechanism. Cell Mol. Life Sci. 2021;78:117–128. doi: 10.1007/s00018-020-03593-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wang X, et al. UDP-glucose accelerates SNAI1 mRNA decay and impairs lung cancer metastasis. Nature. 2019;571:127–131. doi: 10.1038/s41586-019-1340-y. [DOI] [PubMed] [Google Scholar]
- 9.Pavitt GD. Regulation of translation initiation factor eIF2B at the hub of the integrated stress response. Wiley Interdiscip. Rev. RNA. 2018;9:e1491. doi: 10.1002/wrna.1491. [DOI] [PubMed] [Google Scholar]
- 10.Yang S, et al. Identification of an immune-related RNA-binding protein signature to predict survival and targeted therapy responses in liver cancer. Genomics. 2021;113:795–804. doi: 10.1016/j.ygeno.2021.01.021. [DOI] [PubMed] [Google Scholar]
- 11.Hopkins TG, et al. The RNA-binding protein LARP1 is a post-transcriptional regulator of survival and tumorigenesis in ovarian cancer. Nucleic Acids Res. 2016;44:1227–46. doi: 10.1093/nar/gkv1515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Pereira B, Billaud M, Almeida R. RNA-binding proteins in cancer: Old players and new actors. Trends Cancer. 2017;3:506–528. doi: 10.1016/j.trecan.2017.05.003. [DOI] [PubMed] [Google Scholar]
- 13.Zhu Y, et al. POSTAR2: Deciphering the post-transcriptional regulatory logics. Nucleic Acids Res. 2019;47:D203–D211. doi: 10.1093/nar/gky830. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Su R, et al. Global profiling of RNA-binding protein target sites by LACE-seq. Nat. Cell Biol. 2021;23:664–675. doi: 10.1038/s41556-021-00696-9. [DOI] [PubMed] [Google Scholar]
- 15.Hentze MW, et al. A brave new world of RNA-binding proteins. Nat. Rev. Mol. Cell Biol. 2018;19:327–341. doi: 10.1038/nrm.2017.130. [DOI] [PubMed] [Google Scholar]
- 16.Gong J, et al. RBM45 competes with HDAC1 for binding to FUS in response to DNA damage. Nucleic Acids Res. 2017;45:12862–12876. doi: 10.1093/nar/gkx1102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Consortium, I.T.P.-C.A.o.W.G. Pan-cancer analysis of whole genomes. Nature. 2020;578:82–93. doi: 10.1038/s41586-020-1969-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Cui Y, et al. Whole-genome sequencing of 508 patients identifies key molecular features associated with poor prognosis in esophageal squamous cell carcinoma. Cell Res. 2020;30:902–913. doi: 10.1038/s41422-020-0333-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Miao X, et al. DSMNC: A database of somatic mutations in normal cells. Nucleic Acids Res. 2019;47:D971–D975. doi: 10.1093/nar/gky1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Buniello A, et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019;47:D1005–D1012. doi: 10.1093/nar/gky1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Peng L, et al. CCGD-ESCC: A comprehensive database for genetic variants associated with esophageal squamous cell carcinoma in Chinese population. Genomics Proteom. Bioinform. 2018;16:262–268. doi: 10.1016/j.gpb.2018.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kim SK. Identification of 613 new loci associated with heel bone mineral density and a polygenic risk score for bone mineral density, osteoporosis, and fracture. PLoS One. 2018;13:e0200785. doi: 10.1371/journal.pone.0200785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Christakoudi S, et al. GWAS of allometric body-shape indices in UK Biobank identifies loci suggesting associations with morphogenesis, organogenesis, adrenal cell renewal, and cancer. Sci. Rep. 2021;11:10688. doi: 10.1038/s41598-021-89176-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Newman AM, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods. 2015;12:453–7. doi: 10.1038/nmeth.3337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Carlino MS, Larkin J, Long GV. Immune checkpoint inhibitors in melanoma. Lancet. 2021;398:1002–1014. doi: 10.1016/S0140-6736(21)01206-X. [DOI] [PubMed] [Google Scholar]
- 26.Griffin ME, et al. Enterococcus peptidoglycan remodeling promotes checkpoint inhibitor cancer immunotherapy. Science. 2021;373:1040–1046. doi: 10.1126/science.abc9113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ribas A, Wolchok JD. Cancer immunotherapy using checkpoint blockade. Science. 2018;359:1350–1355. doi: 10.1126/science.aar4060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Ju M, et al. Pan-cancer analysis of NLRP3 inflammasome with potential implications in prognosis and immunotherapy in human cancer. Brief Bioinform. 2021;22:bbaa345. doi: 10.1093/bib/bbaa345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Munn DH, Mellor AL. Indoleamine 2,3 dioxygenase and metabolic control of immune responses. Trends Immunol. 2013;34:137–43. doi: 10.1016/j.it.2012.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zhu Y, et al. CaCO3-assisted preparation of pH-responsive immune-modulating nanoparticles for augmented chemo-immunotherapy. Nanomicro Lett. 2020;13:29. doi: 10.1007/s40820-020-00549-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Liu J, et al. Co-delivery of IOX1 and doxorubicin for antibody-independent cancer chemo-immunotherapy. Nat. Commun. 2021;12:2425. doi: 10.1038/s41467-021-22407-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hsu EJ, et al. A cytokine receptor-masked IL2 prodrug selectively activates tumor-infiltrating lymphocytes for potent antitumor therapy. Nat. Commun. 2021;12:2768. doi: 10.1038/s41467-021-22980-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Arenas-Ramirez N, Woytschak J, Boyman O. Interleukin-2: Biology, design, and application. Trends Immunol. 2015;36:763–777. doi: 10.1016/j.it.2015.10.003. [DOI] [PubMed] [Google Scholar]
- 34.Arya R, et al. A genetic association study of carotid intima-media thickness (CIMT) and plaque in Mexican Americans and European Americans with rheumatoid arthritis. Atherosclerosis. 2018;271:92–101. doi: 10.1016/j.atherosclerosis.2017.11.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Tuvshinjargal N, et al. PRIdictor: Protein-RNA interaction predictor. Biosystems. 2016;139:17–22. doi: 10.1016/j.biosystems.2015.10.004. [DOI] [PubMed] [Google Scholar]
- 36.Oberstrass FC, et al. Structure of PTB bound to RNA: Specific binding and implications for splicing regulation. Science. 2005;309:2054–7. doi: 10.1126/science.1114066. [DOI] [PubMed] [Google Scholar]
- 37.Clery A, et al. Structure of SRSF1 RRM1 bound to RNA reveals an unexpected bimodal mode of interaction and explains its involvement in SMN1 exon7 splicing. Nat. Commun. 2021;12:428. doi: 10.1038/s41467-020-20481-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Zhang J, et al. RADAR: Annotation and prioritization of variants in the post-transcriptional regulome of RNA-binding proteins. Genome Biol. 2020;21:151. doi: 10.1186/s13059-020-01979-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Davis CA, et al. The encyclopedia of DNA elements (ENCODE): Data portal update. Nucleic Acids Res. 2018;46:D794–D801. doi: 10.1093/nar/gkx1081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Yap K, et al. A short tandem repeat-enriched RNA assembles a nuclear compartment to control alternative splicing and promote cell survival. Mol. Cell. 2018;72:525–540. doi: 10.1016/j.molcel.2018.08.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Lv Y, et al. SRSF1 inhibits autophagy through regulating Bcl-x splicing and interacting with PIK3C3 in lung cancer. Signal Transduct. Target Ther. 2021;6:108. doi: 10.1038/s41392-021-00495-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Zhao M, et al. Expression of RNA-binding protein LIN28 in classic gastric hepatoid carcinomas, gastric fetal type gastrointestinal adenocarcinomas, and hepatocellular carcinomas: An immunohistochemical study with comparison to SALL4, alpha-fetoprotein, glypican-3, and Hep Par1. Pathol. Res Pract. 2018;214:1707–1712. doi: 10.1016/j.prp.2018.07.037. [DOI] [PubMed] [Google Scholar]
- 43.Liu Z, et al. IGF2BP1 over-expression in skin squamous cell carcinoma cells is essential for cell growth. Biochem. Biophys. Res. Commun. 2018;501:731–738. doi: 10.1016/j.bbrc.2018.05.057. [DOI] [PubMed] [Google Scholar]
- 44.Atay S. Integrated transcriptome meta-analysis of pancreatic ductal adenocarcinoma and matched adjacent pancreatic tissues. PeerJ. 2020;8:e10141. doi: 10.7717/peerj.10141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Chen Y, Wang X. miRDB: An online database for prediction of functional microRNA targets. Nucleic Acids Res. 2020;48:D127–D131. doi: 10.1093/nar/gkz757. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Agarwal V, et al. Predicting effective microRNA target sites in mammalian mRNAs. Elife. 2015;4:e05005. doi: 10.7554/eLife.05005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Liu CJ, et al. miRNASNP-v3: A comprehensive database for SNPs and disease-related variations in miRNAs and miRNA targets. Nucleic Acids Res. 2021;49:D1276–D1281. doi: 10.1093/nar/gkaa783. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Li JH, et al. starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res. 2014;42:D92–7. doi: 10.1093/nar/gkt1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Sun Y, et al. PTENalpha functions as an immune suppressor and promotes immune resistance in PTEN-mutant cancer. Nat. Commun. 2021;12:5147. doi: 10.1038/s41467-021-25417-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Wu RC, et al. Identification of the PTEN-ARID4B-PI3K pathway reveals the dependency on ARID4B by PTEN-deficient prostate cancer. Nat. Commun. 2019;10:4332. doi: 10.1038/s41467-019-12184-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Luo, X. et al. RMVar: An updated database of functional variants involved in RNA modifications. Nucleic Acids Res.10.1093/nar/gkaa811 (2020). [DOI] [PMC free article] [PubMed]
- 52.Muller S, et al. APADB: A database for alternative polyadenylation and microRNA regulation events. Database. 2014;2014:bau076. doi: 10.1093/database/bau076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Huang H, et al. Publisher correction: Recognition of RNA N(6)-methyladenosine by IGF2BP proteins enhances mRNA stability and translation. Nat. Cell Biol. 2020;22:1288. doi: 10.1038/s41556-020-00580-y. [DOI] [PubMed] [Google Scholar]
- 54.Shen C, et al. m(6)A-dependent glycolysis enhances colorectal cancer progression. Mol. Cancer. 2020;19:72. doi: 10.1186/s12943-020-01190-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Fontanari Krause LM, et al. Identification and characterization of OSTL (RNF217) encoding a RING-IBR-RING protein adjacent to a translocation breakpoint involving ETV6 in childhood ALL. Sci. Rep. 2014;4:6565. doi: 10.1038/srep06565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Das S, Natarajan R. HDAC9: An inflammatory link in atherosclerosis. Circ. Res. 2020;127:824–826. doi: 10.1161/CIRCRESAHA.120.317723. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Cui J, et al. IL-4 inhibits regulatory T cells differentiation by HDAC9-mediated epigenetic regulation. Cell Death Dis. 2021;12:501. doi: 10.1038/s41419-021-03769-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Kim DH, Triet HM, Ryu SH. Regulation of EGFR activation and signaling by lipids on the plasma membrane. Prog. Lipid Res. 2021;83:101115. doi: 10.1016/j.plipres.2021.101115. [DOI] [PubMed] [Google Scholar]
- 59.Jin H, et al. EGFR activation limits the response of liver cancer to lenvatinib. Nature. 2021;595:730–734. doi: 10.1038/s41586-021-03741-7. [DOI] [PubMed] [Google Scholar]
- 60.Qiao M, et al. Immune checkpoint inhibitors in EGFR-mutated NSCLC: Dusk or dawn? J. Thorac. Oncol. 2021;16:1267–1288. doi: 10.1016/j.jtho.2021.04.003. [DOI] [PubMed] [Google Scholar]
- 61.Janjigian YY, et al. Dual inhibition of EGFR with afatinib and cetuximab in kinase inhibitor-resistant EGFR-mutant lung cancer with and without T790M mutations. Cancer Discov. 2014;4:1036–45. doi: 10.1158/2159-8290.CD-14-0326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Du Z, et al. Structure-function analysis of oncogenic EGFR kinase domain duplication reveals insights into activation and a potential approach for therapeutic targeting. Nat. Commun. 2021;12:1382. doi: 10.1038/s41467-021-21613-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Yang MH, et al. Somatic mutations of PREX2 gene in patients with hepatocellular carcinoma. Sci. Rep. 2019;9:2552. doi: 10.1038/s41598-018-36810-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Yang J, et al. PREX2 promotes the proliferation, invasion, and migration of pancreatic cancer cells by modulating the PI3K signaling pathway. Oncol. Lett. 2016;12:1139–1143. doi: 10.3892/ol.2016.4688. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- 65.Yu G, et al. clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS. 2012;16:284–7. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Leiserson MD, et al. Pan-cancer network analysis identifies combinations of rare somatic mutations across pathways and protein complexes. Nat. Genet. 2015;47:106–14. doi: 10.1038/ng.3168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Paczkowska M, et al. Integrative pathway enrichment analysis of multivariate omics data. Nat. Commun. 2020;11:735. doi: 10.1038/s41467-019-13983-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Das J, Yu H. HINT: High-quality protein interactomes and their applications in understanding human disease. BMC Syst. Biol. 2012;6:92. doi: 10.1186/1752-0509-6-92. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Yu H, et al. Next-generation sequencing to generate interactome datasets. Nat. Methods. 2011;8:478–80. doi: 10.1038/nmeth.1597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Razick S, Magklaras G, Donaldson IM. iRefIndex: A consolidated protein interaction database with provenance. BMC Bioinform. 2008;9:405. doi: 10.1186/1471-2105-9-405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Khurana E, et al. Interpretation of genomic variants using a unified biological network approach. PLoS Comput. Biol. 2013;9:e1002886. doi: 10.1371/journal.pcbi.1002886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Meurette O, Mehlen P. Notch signaling in the tumor microenvironment. Cancer Cell. 2018;34:536–548. doi: 10.1016/j.ccell.2018.07.009. [DOI] [PubMed] [Google Scholar]
- 73.Majumder S, et al. Targeting notch in oncology: The path forward. Nat. Rev. Drug Discov. 2021;20:125–144. doi: 10.1038/s41573-020-00091-3. [DOI] [PubMed] [Google Scholar]
- 74.Lamb J, et al. The connectivity map: Using gene-expression signatures to connect small molecules, genes, and disease. Science. 2006;313:1929–35. doi: 10.1126/science.1132939. [DOI] [PubMed] [Google Scholar]
- 75.Thorsson V, et al. The immune landscape of cancer. Immunity. 2018;48:812–830. doi: 10.1016/j.immuni.2018.03.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Tang J, Shalabi A, Hubbard-Lucey VM. Comprehensive analysis of the clinical immuno-oncology landscape. Ann. Oncol. 2018;29:84–91. doi: 10.1093/annonc/mdx755. [DOI] [PubMed] [Google Scholar]
- 77.Wang K, et al. RNA-binding protein RBM47 stabilizes IFNAR1 mRNA to potentiate host antiviral activity. EMBO Rep. 2021;22:e52205. doi: 10.15252/embr.202052205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Mariathasan S, et al. TGFbeta attenuates tumour response to PD-L1 blockade by contributing to exclusion of T cells. Nature. 2018;554:544–548. doi: 10.1038/nature25501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Lunde BM, Moore C, Varani G. RNA-binding proteins: Modular design for efficient function. Nat. Rev. Mol. Cell Biol. 2007;8:479–90. doi: 10.1038/nrm2178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Ray D, et al. A compendium of RNA-binding motifs for decoding gene regulation. Nature. 2013;499:172–7. doi: 10.1038/nature12311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Dassi E, et al. AURA: Atlas of UTR regulatory activity. Bioinformatics. 2012;28:142–4. doi: 10.1093/bioinformatics/btr608. [DOI] [PubMed] [Google Scholar]
- 83.Burley SK, et al. RCSB Protein Data Bank: Powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering, and energy sciences. Nucleic Acids Res. 2021;49:D437–D451. doi: 10.1093/nar/gkaa1038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Zhu, H. et al. Candidate cancer driver mutations in distal regulatory elements and long-range Chromatin Interaction Networks. Mol Cell10.1016/j.molcel.2019.12.027 (2020). [DOI] [PubMed]
- 85.Hong W, et al. APAatlas: Decoding alternative polyadenylation across human tissues. Nucleic Acids Res. 2020;48:D34–D39. doi: 10.1093/nar/gkz876. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The majority of the datasets analyzed in this study were downloaded from the Pan-Cancer Analysis of Whole Genomes (PCAWG) study (https://dcc.icgc.org/releases/PCAWG/); NGS and clinical data of ESCC samples are available in Genome Sequence Archive (GSA) (https://ngdc.cncb.ac.cn/gsa-human/) under accession HRA000021; Somatic mutations in normal cells were downloaded from the DSMNC database (https://dsmnc.big.ac.cn/).
PIVar algorithm are available at GitHub (https://github.com/WeiWenqing/PIVar). All computational tools used with default parameters in this study are available for download as open-source software and are detailed in “Methods”.