Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2025 Aug 4;21(8):e1013351. doi: 10.1371/journal.pcbi.1013351

Mutations in tumor signaling, metastases, and synthetic lethality establish distinct patterns

Bengi Ruken Yavuz 1, Ugur Sahin 2, Hyunbum Jang 1,3, Ruth Nussinov 1,3,4,*, Nurcan Tuncbag 5,6,7,*
Editor: Pedro Mendes8
PMCID: PMC12342271  PMID: 40758706

Abstract

Effective identification of oncogenic mutations is essential for diagnosis, forecasting resistance, and metastasis in remission. It is required for an optimal drug regimen. We develop a framework to discover mutations that co-exist in different oncoproteins, and those that are excluded, likely encoding oncogene-induced senescence. First, mapping the proteins onto pathways assists combinatorial drug selections and helps to detect metastases. Second, it provides the molecular basis for synthetic lethality, to date investigated at the genome level. Our pan-cancer profiles of ~60,000 tumor sequences, detect 3424 co-existing tumor-specific mutations. Mapping them onto pathways indicates that they preferentially promote specific primary tumors. We uncover metastatic mutations and provide metastatic breast-cancer markers. This work not only clarifies the mechanistic basis of intratumor mutational diversity but usefully reveals markers for metastasis in patients’ genomes and introduces a novel computational framework for detecting metastasis based on tumor mutational profiles. Mapping the mutations onto pathways provides an invaluable metastasis-targeting resource, guiding drug combinations.

Author summary

Cancer is driven by combinations of mutations in specific proteins in certain cellular pathways; identifying these combinations can reveal new ways to treat the disease more effectively. We analyzed nearly 60,000 tumor genomes from a wide range of cancers to identify mutations that frequently occur together in different cancer driving proteins, as well as combinations that rarely appear in the same tumor. Co-occurring mutations may indicate signaling pathways that are prone to cancer growth and metastasis. In contrast, mutually exclusive mutations may point to processes like senescence or vulnerability to specific treatments. By mapping these mutation patterns onto known cellular pathways, we found thousands of tumor-specific mutation pairs, many linked to distinct cancer types. We also identified key mutations associated with metastasis, including markers specific to metastatic breast cancer. Our study introduces a new computational framework for detecting metastasis and guides drug selection based on the tumor mutational profile. This approach reframes tumor heterogeneity from cell signaling perspective and provides a resource for developing personalized cancer therapies.

Introduction

Here we discover and interpret patterns of mutations in tumor signaling and metastases. Decades ago, it was proposed that cancer evolution requires more than a single mutation [14]. The minimal number has been debated since, and their cancer-specific identification in the proteins, and pathways, were sought [57]. This is crucial since subclones harboring rare resistant mutations proliferate following decimation of sensitive cells [8]. These couple with gene rearrangements, fusions, and overexpression. All increase the number of active protein states [9]. Passenger mutations were observed to be unevenly distributed across cancer genomes, appearing to stem from regional chromatin accessibility [10,11]. Below, we briefly overview mutations in tumor signaling and metastases from our perspective.

Co-occurrence (or exclusivity) of two mutations in different genes (i.e., in trans) are a random event in cancer evolution. If two mutations in different genes occur less or more frequently than expected, they are mutually exclusive or co-occurring, respectively [12,13]. Mutual exclusivity was mostly observed in genes in the same or redundant pathways; co-occurrence was mostly observed in different, or parallel pathways [14]. In our definition, if the pathways recruit the same downstream protein families, they are redundant; if evolutionary-independent, they are parallel. Exclusivity has been attributed to the expectation that acquiring a second strong mutation in the same pathway is unsustainable for the cell since it risks its death [12,13]. It was proposed as associated with tumor subtype, synthetic lethality, and positive selection [15]. We clarified that co-occurrence is restricted [16,17], since a sustained, additive effect on signaling strength of same — or redundant— pathway driver mutations is likely to hyperactivate the proliferation signal, triggering an oncogene-induced senescence (OIS) cellular program [1820].

OIS is a mechanism the cancer cell uses to proliferate- yet dodge cell death. Cancer cells opt for combinations of potent mutations. As early as 1953 [1], and again in 1969 [2], 1971 [3], 1999 [4], 2002 [5], 2015 [21], and more recently, it has been established that multiple co-existing mutations are required for the emergence of cancer. We also know that the number increases during proliferation. Yet, cells survive, suggesting that OIS combinations were sidestepped, or more likely, since somatic mutations emerge sporadically, cells harboring excluded combinations did not endure [22].

OIS, a tumor-suppressive mechanism arresting cell cycle progression, can be the main reason for excluding the co-occurrence of driver mutations in the same, here MAPK, pathway [2325]. Irreversible senescence expresses strong oncogenic stimuli [26]. Overexpression of oncogenes such as KRAS, BRAF, and MYC, which also generate strong proliferation signal, induce OIS, as does increased PI3K/AKT signaling. Loss of PTEN, which negatively regulates the PI3K/AKT pathway, can trigger OIS through a p53-dependent pathway [27], or via coupling with strong PI3K mutational variants. Both constitutively generate signaling lipid PIP3. Sustained hyperactivation of the PI3K/AKT/mTORC1 pathway results in OIS [28,29]. Examples of mutually exclusive relations between gene pairs include BRCA2-TP53, BRCA1-PARP1, and PTEN-PIK3CA in breast cancer; and BRCA1-CCNE1, BRAF-KRAS, ERBB2-KRAS in ovary cancer [30]. Tumor suppressors can be advantageous and detrimental contingent upon the oncogenic context [31]. A third hit—within the same or a different gene in the pathway—may represent alternative routes to tumorigenesis [32].

The mechanisms underlying the mutual exclusivity of oncogenes may depend on the tumor type and interacting oncoproteins. KRASG12D and BRAFV600E are mutually exclusive in lung cancer. Their co-expression triggers oncogene-induced senescence (OIS) [33]. Impaired B-Raf activation may avoid OIS, contributing to mutual exclusivity among certain BRAF mutations [34]. Activating mutations in RAS- and RAF-family proteins, especially the strong BRAFV600E hotspot is one example [35,36]. Absence of co-occurring NRAS and BRAF variants, such as BRAFV600E, in the TCGA skin cutaneous melanoma cohort illustrates mutual exclusivity. Similarly, BRAF and RAS gene family mutations are mutually exclusive in metastatic colorectal cancer tumors [35].

Here we aim to discover (i) cancer-specific mutations that co-exist (doublets or double mutations) in different proteins in tumor-specific signaling, in primary tumors and in metastases, and (ii) those that are excluded. These are significant aims. For the first (i), knowledge of mutations that co-exist in different proteins and mapped onto pathways, can aid in knowledge-based combination drug strategy [37,38], and identify cancer metastases. For the second (ii) synthetic lethality, where a strong mutation in one gene product can be sustained but not in two, parallels excluded multiple strong drivers, clarifying its molecular basis. Related to (i), is the cardinal question of whether same tumor type cells which harbor common driver mutations are likely to adopt common drug resistance mechanisms.

Here we reveal rare but impactful mutation patterns across proteins that drive distinct phenotypes, informing oncology of drug combinations. We observe that mutations disrupting similar functions are mutually exclusive. As to metastatic tumors, some mutations in trans are significantly higher. We provide a list of actionable mutations for metastatic tumors in breast cancer cohort and a tree of double, triple, and quadruple co-existing patterns across the pan-cancer metastatic tumors. These also address the questions of why these signatures, and what can they tell us about the possible molecular mechanisms which are involved. As to synthetic lethality, which has been of broad interest due to its pharmacological potential [3942], we provide a list, a rationale, and raise the possibility that OIS can provide the molecular basis of synthetic lethality.

Results

Tissue distribution and functional relevance of co-existing mutation pairs in cancer

To understand how mutations influence cancer evolution and how pre-existing mutations affect tumor initiation and metastasis [43], we analyzed statistically significant mutation pairs in different proteins using ~60,000 tumor samples from TCGA and GENIE pan-cancer datasets [44]. After identifying significant pairs, we mapped them to pathways and tissues and used Frequency Pattern Growth Tree analysis (treating each tumor as a transaction and alterations as items) to identify potential metastatic markers from co-existing mutations. Our analytical approach is outlined in Fig 1.

Fig 1. Overview of the study.

Fig 1

We obtained tumor mutations from TCGA and AACR GENIE datasets. Then, we applied Fisher’s Exact Test followed by Benjamini-Hochberg multiple corrections. After obtaining 3424 significant mutation pairs, we mapped their components to pathways and tissues for functional interpretation. This helped identify co-altered pathways and protein pairs harboring the mutations specific to tissues. We also identified metastatic tumor markers by utilizing the Frequency Pattern (FP) Growth Tree algorithm.

The distribution of metastatic tumors across tissues with at least 50 samples is shown in Fig 2A. Notably, skin, breast, intestine, lung, ovary, and prostate cancers include a relatively higher proportion of metastatic cases, although primary tumors still outnumber metastases in all tissues.

Fig 2. Statistics of mutation pairs in different proteins.

Fig 2

(A) The windrose plot shows the number of primary (blue) and metastatic (light blue) tumors among tissues with at least 50 samples. The number of all tumors for each cancer type is specified in parenthesis. Breast, bowel, and lung have the highest number of primary tumors. Although the total number of metastatic tumors are less than primary tumors overall, skin, breast, intestine, lung, ovary, and prostate are among the relatively more metastatic. (B) There are 3424 significant mutation pairs in different gene (Fisher’s Exact Test p < 0.05, Benjamini-Hochberg q < 0.3). Their compositions are D + P (n = 1554), D + D (n = 1268) and P + P (n = 602) where D is “Driver” and P is “Passenger” mutation. (C) Box plot showing the frequency of mutation pairs constituents. For each pair, the component with lower frequency is put into “Constituent 1” and the other into “Constituent 2”. The figure shows that most pairs are composed of a frequent and a rare mutation. (D) Stacked bar plot of the mutation pairs in different proteins for 3307 co-existing and 117 excluded. For co-existing mutations, the distribution is as follows: D + D = 1155, D + P = 1550, P + P = 602. Excluded pairs include 113 D + D and 4 D + P. D (Driver), P (Passenger). (E) We call a protein pair A|B if there are mutations on proteins A and B. We divided the pairs into two groups: if the pair belongs to a common pathway, we put it into “Shared Signaling Pathway”, otherwise “Not Share Signaling Pathway”. 1284 protein pairs share at least one signaling pathway where the corresponding mutations are: 754 D + D, 480 D + P, 36 P + P). 638 of them do not share a signaling pathway, and 205 are D + D, 324 D + P, 108 P + P. For the remaining pairs, at least one of the proteins harboring them does not have pathway information. Co-existing mutations are usually present on protein pairs residing in different pathways and this could imply some driver mutations need a helper passenger mutation from a different pathway.

Analyzing missense mutations yielded 3,424 significant mutation pairs across 285 proteins (Fisher Exact Test p < 0.05, Benjamini-Hochberg q < 0.3) (S1 Table). Using the Catalog of Validated Oncogenic Mutations from the Cancer Genome Interpreter [45], mutations were labeled as drivers (D) or passengers (P). The pairs comprised 1554 D + P, 1268 D + D, and 602 P + P combinations (Fig 2B). We categorized mutation pairs by placing the lower frequency mutation as “Constituent 1” and the higher as “Constituent 2” (Fig 2C, and Fig A in S1 Text). Most pairs showed one frequent and one rare mutation. Of the total pairs, 3307 co-existed and 117 were excluded (Fig 2D, S1 Text). Co-existing pairs included 1155 D + D, 1550 D + P, and 602 P + P, while excluded pairs comprised 113 D + D and 4 D + P.

To assess whether mutually exclusive gene pairs in our analysis were enriched for synthetic lethal interactions, we compared our results with established synthetic lethality databases (SynLethDB and BioGRID [46,47]). We identified that 32% (12/37) of mutually exclusive pairs overlapped with known synthetic lethal interactions. Statistical analysis using Fisher’s exact test revealed significant enrichment (p-value = 9.33 x 10-6, OR = 6.32), indicating that mutual exclusivity strongly correlates with synthetic lethality. The overlapping pairs included prominent cancer-related genes such as AKT1|PIK3CA, BRAF|KRAS, and EGFR|TP53. In contrast, co-occurring gene pairs showed no significant enrichment for synthetic lethality (p-value = 0.895, OR = 0.54), suggesting that mutual exclusivity serves as a more reliable indicator of synthetic lethal relationships and may be valuable for identifying clinically relevant therapeutic targets.

To address whether mutually exclusive mutations potentially encoding oncogene-induced senescence (OIS), we conducted Gene Ontology enrichment analysis at MSigDB of the mutually exclusive mutation set. This analysis revealed significant overrepresentation of processes critical to cellular stress responses, including apoptotic processes (FDR q-value = 3.13 x 10-10), regulation of cell population proliferation (FDR q-value = 1.2e-9), and regulation of programmed cell death (FDR q-value = 7.04 x 10-9). Additional enrichment was observed in pathways involving phosphorus metabolic process regulation, protein modification processes, and phosphorylation regulation (FDR q-values between 7.38 x 10-10 and 3.31 x 10-9). These enriched terms reflect key hallmarks of oncogene-induced senescence, particularly the balance between proliferative signaling and growth arrest characterizing the senescent state, providing substantial evidence supporting our hypothesis that mutually exclusive mutations are enriched in pathways contributing to or responding to oncogene-induced stress [48,49].

Our analysis identified 3424 pairs with significant co-existing mutations (denoted as A|B when mutations occur in proteins A and B in the same tumors). Analysis of pathway relationships using the KEGG database (detailed methods in S1 Text) revealed 1280 protein pairs sharing at least one pathway (754 D + D, 487 D + P, 39 P + P), while 638 pairs (206 D + D, 324 D + P, 108 P + P) did not share pathways (Fig 2E). We identified 1,284 protein pairs containing 3,424 doublets and examined their tissue distribution in two categories: oncogene-oncogene (OG-OG) and oncogene-tumor suppressor (OG-TSG) pairs (detailed methods in S1 Text). Node colors show the fraction of samples with mutations in both proteins, normalized to samples with mutations in either protein per tissue. For clarity, we displayed OG-OG pairs with >10 mutation pairs (47 total) (Fig B in S1 Text) and OG-TSG pairs with >20 mutation pairs (54 total) (Panel B of Fig B in S1 Text).

The odds ratio analysis assessing whether the co-existing mutation constituents occur more (or less) frequently than expected under the null hypothesis of independence identified 3,307 co-existing mutation pairs. Notably, 45% of the significant co-occurring pairs involved a driver–passenger (D–P) combination, suggesting that passenger mutations may not arise entirely at random. Instead, they may be functionally tolerated or even selected in the presence of specific driver mutations, hinting at a possible cooperative or permissive relationship. In this context, the passenger mutation could contribute to pathway crosstalk, fine-tune signaling, or influence the tumor phenotype in a driver-dependent context. Our findings suggest a non-random pattern of co-occurrence, possibly indicating that certain passenger mutations are selectively maintained in the presence of specific drivers, potentially supporting tumor progression through complementary or compensatory mechanisms across pathways.

Most mutually exclusive pairs comprise driver mutations of varying frequencies, indicating strong oncogenic signals are complemented by less frequent mutations. These patterns highlight the prognostic potential of mutation combinations (detailed methods in S1 Text), and functional evaluation for therapeutics.

Co-existing KRAS and TP53 mutations alter transcriptional networks in pancreatic cancer

A driver alteration can influence protein signaling and transcriptional profiles. In transcriptional regulation networks, altered proteins may impact upstream activities via feedback loops [50]. We analyzed transcriptional differences between pancreatic adenocarcinomas (PAAD) with mutation pairs and those with single mutations.

KRASG12D is one of the most frequent mutations in pancreatic cancer, often paired with TP53 mutations that impair DNA binding. Among 1502 PAAD tumors, 1007 harbor KRASG12D mutations. Of these, 397 tumors (from TCGA and ACCR GENIE) exhibit significant mutation pairs involving KRASG12D and TP53, including mutations at positions 248 and 273 (DNA contact sites) and position 175 (affecting Zinc ion binding, destabilizing p53). Group 1 includes 24 PAAD tumors with KRASG12D-TP53 significant mutation pairs, while Group 2 (71 tumors) contains single mutations in either KRAS or TP53, or mutations not contributing to significant pairs. Group1 and Group 2 contain tumors only from TCGA with the available transcriptomic data in PAAD.

Transcriptomic analysis revealed 394 differentially expressed genes (Fig 3, 137 upregulated, 257 downregulated; adjusted p-value < 0.05, |log2FC| > 0.5) between Groups 1 and 2, with p-values corrected for multiple hypothesis testing using the Benjamini-Hochberg method. Group 1 tumors showed enrichment of proteins involved in immune response, cell proliferation, and cell-cell signaling. Using the TRRUST (v2) dataset, 55 upstream transcription factors (TFs) were identified (Fig 3), including SP1 (Sp1), SP3 (Sp3), NFKB1, JUN and TP53 (p53). A regulatory network constructed with CancerGeneNet highlighted upregulated proliferation phenotypes, aligning with the transcriptional impact of TP53 driver mutations compared to single mutations [51,52].

Fig 3. Transcriptome analysis of tumors with mutation pairs.

Fig 3

Comparison of gene expression profiles of double mutant and single mutant TP53 cases in KRASG12 mutant PAAD tumors revealed 394 differentially expressed genes (137 upregulated and 257 downregulated) between Group 1 and Group 2. Group 1: PAAD tumors having at least one significant pair composed of KRASG12 and TP53 mutations); Group 2: PAAD tumors having mutation either in KRAS or in TP53 or mutation in any of these proteins that does not contribute to a significant pair. The heatmap shows the z-scores of differentially expressed genes (DEGs) across patient tumors. 55 TFs are retrieved as the transcription factors regulating the differentially expressed genes including the main regulators SP1, SP3, NFKB1, JUN and TP53 obtained from TRRUST version 2 dataset, a manually curated database of human and mouse transcriptional regulatory networks. SP1 (encodes Sp1), SP3 (encodes Sp3), NFKB1, JUN and TP53 (encodes p53).

Sp1 (Specificity Protein 1) is overexpressed in various tumor types and correlates with poor patient survival. High levels of Sp1, Sp3, and Sp4 contribute to cancer cell development, survival, and migration to tissues, including the pancreas [53,54]. In pancreatic ductal adenocarcinoma (PDAC), co-occurring TP53 and KRAS mutations drive tumor growth and metastasis through interaction with CREB-1 (cAMP responsive element binding protein 1) [55]. Mutant p53 and CREB-1 upregulate FOXA1, a pro-metastatic pioneer transcription factor that activates its network and promotes WNT/β-catenin signaling, a key driver of PDAC metastasis. Inhibiting CREB-1 reduces FOXA1 and β-catenin expression, limiting PDAC metastasis. Additionally, TP53 missense mutations in KRAS-transformed PDAC are linked to poor differentiation and exhibit gain-of-function properties [56].

Collectively, these show that co-existing KRAS and TP53 mutations alter transcriptional networks in pancreatic cancer, calling for reconsideration of PDAC drug regimen combinations accounting for both to restrain drug resistance.

Functionally similar alterations do not co-exist in tumors

We define alterations with similar phenotypic outcomes as “functionally equivalent,” which rarely or never co-occur and are often mutually exclusive. One example involves KRASG12D and CDKN2A deletions in pancreatic cancer. Mutually exclusive pairs may act in separate pathways but yield similar phenotypic effects. While co-existing mutations in KRAS are rare, they drive downstream changes promoting a proliferation phenotype. Fig 4A highlights mutations in KRASG12D-mutated PAAD tumors, where alterations in SMAD4, CDKN2A, U2AF1, and GNAS are mutually exclusive and rarely co-occur. CDKN2A and SMAD4 mutations, located in binding regions, rarely coexist in tumor proteins. CDKN2A mutations (R58*/Q, R80*, H83N/R/Y, and D84G) occur at the CDK6 interface (PDB: 1BI7) (Fig 4B), with all but R58*/Q spatially clustered. These mutations disrupt CDKN2A-CDK6 interactions, inhibiting cyclin D binding and retinoblastoma phosphorylation, thereby suppressing proliferation. SMAD4 mutations at D351 and R361 localize to the SMAD4/SMAD3 heterodimer interface [57] (PDB: 1U7F (trimer)) (Fig 4C). These mutations disrupt SMAD homo- and hetero-oligomerization. All mutually exclusive mutations shown in Fig 4 are on non-overlapping paths linked to SMAD4 (encodes Smad4) and their functional outcome may be similar. U2AF1 and GNAS (encodes Gαs) paths are linked to Ras and downstream proliferation phenotype (Fig 4D).

Fig 4. Co-existing mutations in PAAD tumors.

Fig 4

(A) Oncoprint of the mutations that coexist with KRASG12. KRASG12 partners with CDKN2A, SMAD4, GNAS, and U2AF1 which show a mutually exclusive pattern. (B) The structure of CDKN2A/CDK6 complex shows that the mutations R58, R80, H83, and D84 are in the interface region between CDKN2A and CDK6. (C) SMAD4 mutations D351 and R361 are in the interface between SMAD4/SMAD3 complex. (D) A subnetwork of highly frequent alterations co-existing with KRASG12 leading to proliferation and oncogenic signals.

Co-existence of MYC amplification and CDKN2A deletion is relatively rare. MYC drives proliferation, while CDKN2A, a tumor suppressor, restricts cell cycle progression in response to oncogenic or cellular stress. Their cooperation is linked to TP53 mutations in the DNA-binding domain. This aligns with the absence of multiple strong alterations in the same or functionally similar pathways, as such combinations can trigger OIS [58,59].

Altogether, such data pointing that functionally similar alterations do not co-exist in tumors, bespeak of which drug combination may be ineffective.

Distinct oncogenic signaling networks drive breast and pancreatic cancer subtypes

The frequency of co-occurring mutations varies across tissue types, with affected proteins spanning multiple signaling pathways. We analyzed 3,424 significant mutation pairs, examining their tissue-specific distribution across 46 KEGG pathways (see S1 Text). Bubble plot (Fig 5A) illustrates the number and fraction of tumors with mutation-associated pathways across 10 tissues and 25 pathways. Bowel and uterus tissues show mutation pairs across all pathways, while breast, lung, and pancreas tissues exhibit accumulation in specific pathways, including PI3K/AKT, MAPK, FoxO, Neurotrophin, and mTOR. Mutation pairs in breast, lung, and pancreas tissues accumulating in specific pathways prompt the construction of cancer-specific subnetworks for breast and pancreatic tissues. Seed proteins with co-existing mutations were selected based on thresholds: breast (≥0.5%) and pancreas (≥1.5%).

Fig 5. Breast cancer specific subnetwork.

Fig 5

(A) Bubble plot representing the number and fraction of pathway-mutation pair- tumors in each tissue by node size and color, respectively. On the x-axis and y-axis there are 10 tissues and 25 pathways, respectively. Bowel and uterus have mutation pairs across all listed pathways while in breast, lung and pancreas tissues they accumulate in specific pathways including PI3K/AKT, MAPK, FoxO, Neurotrophin, mTOR. (B) Breast cancer specific subnetwork shows seed proteins, oncogenes and tumor suppressors obtained with the Page Rank algorithm from Omni Path with the seed proteins that have mutation pairs, and the tissue specific fraction is greater than 0.5. Blue nodes are the proteins from the PPI network, pink nodes are the seed proteins which are mutation pair components. Transcription factors (TFs) are V-shaped nodes. The border color is green if the gene is tumor suppressor gene and red if oncogene. The edges are solid lines if there is an edge between nodes in the PPI network that contribute to a mutation pair. Dashed lines depict the direct interaction in the PPI. Source and target shape are derived from TRRUST representing the activation or repression. TFs RB1 (encodes the protein Rb1), APC (encodes the protein APC), CTNNB1 (encodes Catenin Beta-1), MYC (encodes the protein Myc), TP53 (encodes the protein p53).

From 8,512 breast and 1,853 pancreatic tumors, a directed PPI network with 20,029 nodes and 276,179 edges was derived from OmniPath [60] (see S1 Text). For breast cancer, 15 seed proteins were used, with initialization scores set to 1 for the PageRank algorithm (α = 0.85). The resulting subnetworks contained 59 nodes and 343 edges for breast (Fig 5B and Fig C in S1 Text) and 40 nodes and 267 edges for pancreas (Fig D in S1 Text). Transcription factors regulating subnetwork nodes were identified, along with their functional roles (activation or repression) to determine arrow types.

To evaluate the robustness of gene prioritization to the choice of the damping factor (α) in the personalized PageRank algorithm, we performed a sensitivity analysis across multiple α values (0.70, 0.75, 0.80, 0.90, and 0.95). Using the same tissue-specific seed genes and the OmniPath protein–protein interaction (PPI) network [60], we compared gene rankings generated with each α value to those obtained with α = 0.85 using Spearman rank correlation. The correlations were consistently high across all tested values, demonstrating strong agreement in rankings. For the pancreas-specific analysis, Spearman coefficients ranged from ρ = 0.9922 to ρ = 0.9997, while for the breast-specific analysis, they ranged from ρ = 0.9944 to ρ = 0.9998. These findings indicate that gene prioritizations are highly robust to variations in the damping factor, supporting the reliability of results obtained with α = 0.85.

For breast cancer, the resulting subnetwork includes the TFs RB1 (encodes the protein Rb1), APC (encodes the protein APC), CTNNB1 (encodes Catenin Beta-1), MYC (encodes the protein Myc), TP53 (encodes the protein p53). Oncogenes or tumor suppressor proteins (nodes) have red and green borders, respectively. Gene set enrichment analysis in MsigDB [61,62] and Webgestalt [63] yielded subnetwork proteins enriched in the following pathways prominently (panels A and B of Fig E in S1 Text): GPCR Signaling (41 proteins overlapped), G-alpha signaling events (28 genes), Chemokine signaling pathway (25 genes), GnRH signaling pathway (21 genes), Gap junction (20 genes), Endothelins (19 genes), LPA receptor mediated events (19 genes). GPCRs are crucial in tumorigenesis, angiogenesis, and metastasis by modulating heterotrimeric G-proteins pathways [64]. GnRH agonists inhibit cancer cell growth and invasion while suppressing ovarian steroid production [65,66]. Chemokines activate pathways like PI3K/AKT/NF-κB and MAPK/ERK, promoting proliferation [67,68]. Targeting these pathways offers potential for breast cancer treatments [69]. In the pancreas cancer subnetwork, key TFS such as APC (encodes APC), RB1 (encodes pRb), DNMT3A, TP53 (p53), MYC, CTNNB1 (Beta Catenin-1) and SMAD4 (Smad4) contribute to mutation pairs. Gene set enrichment analysis with WebGestalt (Panels A and B of Fig F in in S1 Text) revealed upregulated pathways like PI3K/AKT/NF-κB and MAPK/ERK signaling, and platelet activation.

To contextualize our results and distinguish biologically meaningful signals from random network effects, we examined pathways enriched in fewer than 50% of subnetworks reconstructed from randomized interactomes in breast and pancreatic cancers. These infrequently enriched pathways may represent more robust, non-random features of the original network topology. Notably, pathways such as IL-17 signaling, Hedgehog signaling, and Necroptosis—known to regulate inflammation, cell death, and immune evasion—appeared in only 39–44% of random subnetworks, suggesting genuine biological relevance in breast cancer subnetworks. In contrast, pathways like Autoimmune thyroid disease, Phagosome, and systemic lupus erythematosus were rarely enriched (≤10%), and several—including Olfactory transduction and Lysosome—were never enriched in the random subnetworks.

Pathways such as GPCR signaling, GnRH signaling, and chemokine-mediated cascades, while repeatedly enriched, have broad and pleiotropic roles in cellular signaling and may dominate enrichment outputs due to their general connectivity in the interactome. Despite this, the consistent enrichment of key oncogenic pathways—such as PI3K/AKT and MAPK signaling—in both the original and randomized settings support their centrality in cancer progression and aligns with the transcription factor disruptions identified in the pancreatic cancer subnetwork. While these pathways appear most prominent in terms of overlap size, their recurrence in analyses of randomized networks suggests that they may arise due to inherent network topology rather than specific biological signal.

In the pancreas cancer subnetwork, several signaling pathways—including IL-17, Adipocytokine, and Hedgehog signaling—were each enriched in 40% of subnetworks, while Pertussis signaling, involving GPCR activation, appeared in 44%. These pathways are known to influence inflammation, metabolism, and developmental signaling, all central to tumor biology. Oocyte meiosis (25%) may reflect dysregulation in cells. Their selective presence in the original network, but low recurrence across randomized versions, suggests these enrichments reflect disease-specific biological features.

Complex biological functions are mostly managed through protein-protein interactions (PPIs), and mutations occurring in proteins may alter the cell phenotypes [70]. Alterations perturbing PPIs can impact drug outcomes [7173]. Detection of druggable oncoPPIs [71] is promising since some could constitute more cancer-specific therapeutic targets [74,75], although the pre-existing mutation load can impact the efficacy [8]. Among the Fujian cohort mutations classified as metastatic markers in gastric carcinomas, one is a PTPRT mutation [76], and another, Chondrosarcoma TERT promoter mutation is a metastasis marker [77]. A real-world clinicogenomic dataset permitted the discovery of biomarkers that predict treatment outcomes that affect patients’ survival [78]. Treatment-specific genetic alterations in non-small cell lung cancer (NSCLC) include mutually exclusive and co-occurring mutations [79,80]. A whole-genome sequencing of a large Swedish cohort study [81] aimed to identify prognostically relevant co-occurring mutations in microsatellite stable colorectal cancers. The analysis revealed 23 significant co-mutations, with specific pairs such as APC-TCF7L2 and BRAFV600E-RNF43 showing distinct associations with favorable or poor survival outcomes, highlighting the added prognostic value of co-mutation profiling over single-gene analysis.

Overall, coupled with the literature, our analysis indicates that pathways carrying co-existing mutations can help in fathoming the evolution of certain cancer types and guide target selection.

Co-existing mutations are mostly in primary tumors and some rare mutations are signatures of breast cancer metastatic tumors

Genomic comparisons between primary and metastatic tumors show only moderate increases in mutation burden, with common mutations in TP53, MYC, and CDKN2A [8285]. While these studies reveal genomic similarities, key questions remain about the molecular mechanisms driving metastatic phenotypes and their activation [86,87]. Analysis of 40,979 tumors across 25 cancer types found no distinct mutational patterns associated with metastasis [85].

Analysis of the AACR GENIE cohort’s primary and metastatic breast cancer samples revealed that while over 70% of mutation pairs exist in primary tumors, some pairs are metastasis specific. Using frequency pattern growth tree analysis (Fig 6), we examined mutation patterns in the Metastatic Breast Cohort (n = 2385), comprising 52% IDC, 25% BRCA, 9% ILC, 4.5% BRCNOS, and other subtypes. The FP growth tree, analyzing 456 mutations across 2385 samples, yielded 31 nodes and 51 edges, with 40 significant mutation pairs serving as potential metastatic markers (Fig 6A, S2 Table). Strong associations emerged between mutations in ESR1, GATA3, and PIK3CA in metastatic tumors, particularly ESR1 mutations at positions 380, 536, 537, and 538 pairing with PIK3CA mutations at positions 542, 545, and 1047 (Fig 6B). A volcano plot analysis (Fig 7A) illustrated co-existence and mutual exclusivity patterns of metastatic markers, while further analysis confirmed ESR1|PIK3CA mutation pairs as significant metastatic indicators (Fig 7B, p-value < 10-13). GATA3 mutations at positions 293, 335, and 408 showed coupling with ESR1 or PIK3CA mutations, consistent with GATA3’s known functional relationship with ESR1 in transcriptional regulation [88]. The relationships between mutations were visualized in a bubble plot (Fig 7C), excluding 330 pairs with fewer than three co-existing mutations. Analysis of 393 metastatic breast tumors from the Hartwig Medical Foundation (HMF) dataset [89] reveals that 22 tumors harboring both PIK3CA and ESR1 mutations. Specifically, PIK3CA mutations at positions 542, 545, and H1047 paired with ESR1 mutations at positions 380, 536, 537, and 538, confirming the metastatic markers identified by the FP growth tree algorithm.

Fig 6. Rare mutations among the co-existing mutations are specific to metastatic tumors.

Fig 6

(A) Frequency pattern growth tree of missense mutations shows breast cancer metastatic markers where links represent association between mutations. Edges represent correlation strength. Ancestral mutations are shown as dark blue nodes, leading to consequential light blue mutations. Mutation frequencies are represented by node sizes. Mutation pairs accommodated by ESR1, GATA3, and PIK3CA proteins are enriched in metastatic tumors. 45 mutation pairs from 31 nodes and 51 edges in the FP growth tree are among those evaluated as metastatic markers. (B) ESR1|PIK3CA mutations are enriched in breast cancer metastatic tumors derived from FP Growth Tree in panel A. ESR1 mutations at positions 380, 536, 537 and 538 are paired with the major drivers of PIK3CA at positions 542, 545, and 1047.

Fig 7. Metastatic marker statistics.

Fig 7

(A) We visualize each alteration pair with a volcano plot, where the x-axis represents log10(OR) and the y-axis represents log10(Fisherpvalue). Pairs with |log10(OR)|>0.75 indicate strong effect sizes, and a threshold of log10(p)>1.65 corresponds to p < 0.05, our chosen significance level as this threshold is widely used in biological research to denote statistical significance. (B) Propensity of mutation pairs from ESR1|PIK3CA and from other proteins where one side is from ESR1 or PIK3CA in metastatic tumors. (C) The bubble plot evaluating all possible relationships of the mutations at the FP growth tree, where node color depicts coexistence characteristics, and node radius is proportional to the Chi-Square value of the same cohort.

In addition to these, we provide a tree of double, triple, and quadruple co-existing patterns across the pan-cancer metastatic tumors (Panels A and B of Fig G in S1 Text). To construct the tree in Panel B of Fig G, a minimum support of 7 × 10 ⁻ ⁵ and a minimum confidence threshold of 0.95 were used. In the tree, the node TP53R273 is followed by either one of the mutations on ATRX at the positions 1045, 1049, and 1426 where the third node of the triplet is IDH1R132. IDH1, TP53, and ATRX mutations are associated with low-grade astrocytomas collaborate to inhibit SOX2 (a nucleosome-binding pioneer TF acting to make genes accessible to the transcription machinery) and prevent the development of human neural stem cells [90,91]. Quadruples include mutations from BRAF, MSH3, BLM, RNF43, PLK2, NBN. In response to the context-dependent nature of cancer genomics, we derived a breast cancer–specific co-mutation tree illustrating doublet, and triplet mutation patterns, rather than performing a pan-cancer analysis. Due to the limited sample size of the metastatic breast cancer cohort, we applied less stringent thresholds (minimum support = 0.0007, minimum threshold = 0.01). The triplets in metastatic breast cancer include ESR1D538+PIK3CAH1047 + SF3B1K700 and PIK3CAE545 + GATA3M293+NUP93E14. To improve interpretability, direct mutation pairs connected only to the null root were excluded. The resulting tree is shown in Fig H in S1 Text, and the corresponding data is provided in S3 Table.

As breast cancer represents 25% of female cancers globally, with hormone receptor-positive being the most prevalent subtype, marked by estrogen receptor (ER) expression. While endocrine therapy targeting ER is standard, resistance often develops through ESR1 mutations, affecting up to 36% of metastatic breast cancer (MBC) patients [92]. Studies by Gerratana et al. revealed that ESR1 codon 537 variants associated with ER and RAF pathway SNVs, MYC CNVs, and bone metastases, while codon 538 variants correlate with cell cycle SNVs and liver metastases. PIK3CA codons 1047 and 542 showed associations with CNVs, informing therapeutic strategies [93].

Thus, although primary and metastatic tumors share driver mutations [85], most are in primary tumors. Some rare mutations appear enriched in metastatic tumors and may serve as potential markers, although further validation with larger datasets is required to establish their significance as metastatic signatures. A more comprehensive cross-cancer metastases analysis is vital for early detection.

Discussion

Tumorigenesis involves complex protein alterations [9497], where co-existing mutations in different proteins (in trans) undergo evolutionary selection [12,13]. While mutations in parallel pathways often co-exist, those in identical or redundant pathways typically show mutual exclusion, reflecting cellular sustainability limits [14]. This exclusion pattern links synthetic lethality and oncogene-induced senescence (OIS) [1520,98], where hyperactivation of proliferation signals becomes unsustainable.

Understanding cancer evolution requires systematic investigation of mutation epistasis [15,98,99102], supported by both driver-passenger mutation classification tools [30] and methods for identifying mutually exclusive gene sets [103]. There are several tools for identifying biologically meaningful, mutually exclusive gene sets [30], including statistical frameworks (MEScan [103], MEGSA [104], CoMET [105]) and driver module detection (MEMo [106], CoMDP [107]). These approaches, ranging from statistical frameworks to driver module detection [104108], culminate in tools like SELECT [108] that capture evolutionarily dependent alterations affecting drug response. Our analysis of in trans mutations identified 3424 significant co-existing mutations, predominantly comprising one rare and one frequent mutation, with 3307 co-existing pairs and the remainder being mutually exclusive. Co-existing mutations can enhance oncogenic signaling, while mutual exclusivity may result from unsustainable hyperactivity through same-pathway mutations or combinations with pioneer transcription factors like FOXA1 [109]. The FP-growth tree method [110] revealed increased frequency of PIK3CA and ESR1 mutation pairs in metastatic versus primary tumors.

Previous research on in cis mutations (multiple mutations in the same proteins) [44,51,111113] showed increased oncogenic activity and drug sensitivity, with double mutations enhancing PI3K signaling and tumor growth. Clinical trials suggest double-mutant breast cancers respond better to PI3K inhibitors than single-mutant cases [111], though exceptions exist, as seen in EGFRL858/T790’s increased resistance compared to EGFRL858. Building on this, studies of PI3Kα variants showed that co-existing mutations—often combining a hotspot with a weaker mutation—can shift conformational ensembles toward more active states and expose mutation-specific cryptic pockets, underscoring the need for variant-specific combination allosteric therapies [114].

Molecular studies have advanced our understanding of metastatic mechanisms and prognostic factors in cancers, particularly focusing on mutation patterns and their clinical implications. Wang et al. analyzed 114 Chinese NSCLC patients, finding higher metastasis in older patients with TP53 mutations, while EGFR, ASXL2, and STK11 mutations correlated with better survival [115]. Studies on thyroid cancer highlighted advances in biomarker detection enabling personalized treatment [116]. Analysis of 154 lung cancer brain metastases revealed greater tumor heterogeneity than primary tumors, with shared mutations (TTN, TP53), enhanced mitochondrial metabolism but suppressed immunity; combining mitochondrial-targeting drugs with immunotherapy improved mouse survival [117]. In 1817 KRAS-mutant lung adenocarcinomas, KEAP1 and SMARCA4 mutations correlated with metastasis and poor survival, while STK11’s impact varied by metastatic site and KEAP1 status [118]. Potential prognostic biomarkers in lung adenocarcinoma were identified through integrative analysis of mRNAs, miRNAs, and lncRNAs and cell cycle–related ceRNA network [119].

A limitation of our approach is the lack of redundancy reduction by tissue type, which may have allowed cancers with larger sample sizes—such as breast and pancreas—to dominate some signals. While this enabled pan-cancer discovery, it may introduce tissue bias. We later addressed this by identifying the tissue distribution of key doublets and selecting case studies from cancer types with both enrichment and sufficient sample size. Yet another limitation of our approach is the reliance on KEGG as pathway database; while KEGG is widely used and well-curated, different pathway resources such as Reactome [120] or WikiPathways [121] may yield alternative pathway overlaps due to variations in pathway definitions and gene assignments.

This study addresses two critical yet understudied challenges in cancer research: (i) proposing candidate markers potentially linked to metastatis; (ii) mutations whose combinations may be linked to oncogene-induced senescence (OIS), thus are rare, or absent in cancer genomes. As to point (ii), we conjecture that the absence of these combinations may be because the oncogenic signal they generate is too strong for the cell to tolerate. Mutations related to OIS likely emerge sporadically during cancer evolution, and further work is needed to validate this hypothesis. Cells do not have a mechanism for a priori negative selection. Rather, negative selection, or exclusion, is the outcome of cell death. Here we provide a list, a rationale, and suggest that OIS can provide the molecular basis to synthetic lethality, which has been investigated on the gene level. As to (i), to our knowledge, we provide a first candidate list of metastatic markers specific to metastatic samples, including actionable mutations for metastatic tumors, as well as a tree of double, triple, and quadruple co-occurrence patterns across the pan-cancer metastatic. This list (S2 Table) may help in early metastases detection. Advanced computational approaches allow detection of patterns. Co-existing mutation signatures for metastatic tumors, coupled with identification of the respective proteins and pathways, can create interactive maps. Linking them with drugs can create immensely useful tools for the attending physicians [122].

Finally, while the usefulness of metastatic markers to drug discovery is self-evident, exactly how to identify OIS mutations in protein pairs is less so, as indeed is also the case in synthetic lethality [123]. OIS restriction is observed in same- and redundant-pathways. Classically, drugs aim to block protein activity and downstream signaling. Introducing drugs that recreate their action is expected to lead to cell death. At the same time, controlling the signal is challenging, as strong signals can be associated with cell proliferation, but too strong a signal may result in oncogene-induced senescence.

Materials and methods

Data collection and processing

The data and preprocessing protocol is same as in [44] and the corresponding code and input data files are available at https://github.com/bengiruken.

Identification of significant co-existing mutations

Out of a total of 395,801 mutations with a variant allele frequency (VAF) greater than 0.125 [44], 20,157 mutations were found in at least three non-hypermutated tumor samples. We generated all possible binary combinations of these 20,157 mutations, excluding pairs from the same gene. For each unique mutation pair, we created a contingency table. This process yielded 366,573 potential co-existing mutations in different genes, which we evaluated across 59,048 non-hypermutated tumor samples using Fisher’s Exact Test for statistical significance where the contingency table is as follows (Table 1):

Table 1. Contingency table.

a (tumors with both mutations) b (has only first mutation)
c (has only second mutation) d (has neither of the mutations)

where d = 59,048 − (a + b + c) and 59,048 is the number of non-hypermutated samples. We did not apply any initial filter based on the number of tumors with co-existing mutations—cases where a = 0 were also tested. We then adjusted for multiple comparisons using the Benjamini-Hochberg method and proceeded with downstream analyses using 3,424 mutation pairs that had a q-value below 0.3 and were observed in at least three tumor samples. If a co-existing mutation constituent is among the 5601 driver mutations from the Catalog of Validated Oncogenic Mutations (Cancer Genome Interpreter [45]), it is labeled as known driver (D), otherwise passenger (P).

Transcriptome analysis

To identify differentially expressed genes in the group of patients with co-existing mutations compared to the single mutant patient group, we downloaded RNA-seq transcriptome data of the TCGA project from the cBioPortal database (https://www.cbioportal.org). We used median Transcripts Per Kilobase Million (TPM) values of RNA-seq data of PAAD cohort of TCGA. For the PAAD cohort, 177 patients with TPM values, we constructed two groups, where Group 1 is tumors having at least one significant co-existing mutation of type KRASG12D/V/C + TP53mutation and Group 2 is tumors having either single mutant KRASG12D/V/C or single mutant TP53. We calculated the log2FC value of each gene between the double mutant and single mutant groups by using the formula:

log2FC=Mean(GenesTPMvaluesamongDoubleGroupPatients)Mean(GenesTPMvaluesamongSingleGroupPatients) (1)

We identified differentially expressed genes between the double mutant tumors group and the single mutant tumors group (comparison of means of TPM values by Mann-Whitney U-Test). If |log2FC| > 0.5 and adjusted p-value < 0.05 we considered the corresponding gene as differentially upregulated or downregulated in the double mutant group. Benjamini-Hochberg procedure to control the false discovery rate (FDR) for multiple correction was performed using the multipletests function in the statsmodels Python package. We continued our analysis with these differentially expressed genes, and calculated z-scores of each gene as follows:

z=TPMvalueofGeneμσ (2)

where is µ is the mean of TPM values and σ is the standard deviation across all samples in the double mutant and the single mutant groups. After obtaining z-scores for each gene, we sorted the genes as downregulated and upregulated and represented these values as a heatmap (https://seaborn.pydata.org).

We used Webgestalt (http://www.webgestalt.org) for the gene set enrichment analysis where the functional database is selected from Reactome and significantly up- or down-regulated pathways are found. FDR threshold is selected 0.05 and the list of genes ranked by their logFC values are given as input.

Personalized PageRank algorithm

We used a network diffusion-based algorithm to find the most affected region of the interactome given a set of nodes. Given a directed or undirected graph G(v,e) where v ∈ V and e ∈ E and a set of seed nodes S ⊆ V, the personalized PageRank algorithm solves the seed set expansion problem, where it finds which additional nodes may exist in the community besides the nodes in S and ranks them according to their importance. We used the PageRank [124] function implemented in Python networkx library [125]. The damping parameter, alpha, is selected as 0.85 and the number of iterations are 100.

We applied the personalized PageRank algorithm to prioritize genes based on their proximity to tissue-specific seed genes within the OmniPath PPI network [60]. Following standard practice in network analyses and the original PageRank formulation [126], we set the damping factor α to 0.85, which balances local and global network exploration. To assess sensitivity to this parameter, we recomputed PageRank scores using α values of 0.70, 0.75, 0.80, 0.90, and 0.95, keeping the network and seed genes constant. We then compared gene rankings derived at each α to the rankings obtained with α = 0.85 using Spearman rank correlation, a non-parametric statistic appropriate for evaluating rank-order consistency.

Tissue specific subnetworks

To determine tissue specific seed proteins to perform Personalized PageRank algorithm for obtaining tissue specific subnetworks, we calculated the fraction of double-mutant tumors among all tumors in the corresponding tissue for each doublet. Then we evaluated the gene couples harboring co-existing mutation components as seed proteins if the fraction of the tumors with co-existing mutations is greater than 25%. We assigned initial weights as 1. We run the PageRank algorithm on the PPI network from OmniPath [60] with alpha = 0.85. We added the edges with score greater than the threshold 0.001 to the breast specific subnetworks.

We evaluated 46 signaling pathways from KEGG. We obtained pathway-double-mutant tumors by taking the union of all double mutants when at least one of the proteins belong to the corresponding pathway. We filtered the pathways that have less than 80 double mutant tumors and tissues and pathways that have less than 10% double mutant tumor rate. We constructed cancer type specific subnetworks for breast and pancreas tissues with seed proteins that have co-existing mutations and tissue specific double mutant fraction is greater than a certain threshold (thresholds are as follows: breast, 0.5; pancreas, 1.5). The number of seed proteins for breast cancer and pancreas cancer are 12 and 23, respectively.

Shared pathway membership between genes was determined based on overlap in KEGG gene sets, not by matching pathway names, to ensure functional relevance rather than semantic similarity.

To evaluate the robustness of pathway enrichment results against the native topology of the PPI network, we generated 100 randomized directed networks using a degree-preserving edge-swapping algorithm. For each randomized network, PageRank scores were computed using a damping factor of 0.85 and a fixed personalization vector. Genes exceeding a threshold score of 0.002 were used to reconstruct subnetworks.

We then performed pathway enrichment using the KEGG 2021 Human gene sets in gmt format with the GSEApy [127] package. A local Enrichr-based enrichment approach with retry mechanisms was used to ensure reliable results. For each pathway enriched in the original subnetwork, its frequency across the 100 randomized subnetworks was calculated and expressed as a percentage. This allowed identification of pathways that were specifically enriched in the original network and not due to network structural properties.

FP Growth Tree construction

We used the mlextend library’s FreqItems and AssociationRules functions for mining frequent item sets and association rules. The FP growth algorithm is selected for tree construction where each node represents one alteration and each edge in the tree represents the association of the nodes [110]. In the constructed tree, all nodes in the path from root to the distant node are associated with each other and strongly present together in the tumors. The implementation of this workflow or additional documentation on how the FP-Tree algorithm is adapted for this specific use case is available at https://github.com/ugur0sahin/FMPSeeker.

The tendency of the alterations to be specific to metastatic tumor is calculated by

propensity(i)=xi/NiX/N (3)

where xi is the number of metastatic tumors having co-existing mutation i, Ni is the number of tumors having co-existing mutation i, X is the number of metastatic tumors and N is the number tumors in our dataset.

To assess co-occurrence among alterations in the metastatic breast cancer cohort, we first restricted our alteration matrix to this subset of patients. We then evaluated all unique pairs of alterations using itertools.combinations() in Python.

For each alteration pair (A, B), we extracted binary presence/absence vectors across samples from a pandas DataFrame and constructed a 2 × 2 contingency table based on the four possible joint states:

  • a: both alterations present,

  • b: only alteration A present,

  • c: only alteration B present,

  • d: neither present.

These counts form the basis for statistical testing. We used scipy.stats.fisher_exact to compute both the odds ratio (OR), calculated as

OR=adbc

and the Fisher exact test p-value for each pair. Additionally, we applied scipy.stats.chi2_contingency to compute Pearson’s chi-squared statistic and corresponding p-value.

Note: We present further details of the downstream analyses of co-existing mutations in the supplemental information S1 Text.

Supporting information

S1 Text. Contains supplemental text, methods and figures.

(DOCX)

pcbi.1013351.s001.docx (24.5MB, docx)
S1 Table. Statistics of co-existing mutations on different proteins.

(XLSX)

pcbi.1013351.s002.xlsx (529.2KB, xlsx)
S2 Table. Metastatic breast cancer co-existing mutation markers.

(XLSX)

pcbi.1013351.s003.xlsx (12.9KB, xlsx)
S3 Table. Metastatic breast cancer triplet mutations in FP Growth tree.

(XLSX)

pcbi.1013351.s004.xlsx (9.3KB, xlsx)

Data Availability

The results shown here are in whole or part based upon data generated by the TCGA Research Network: https://www.cancer.gov/tcga. All the data and codes are available at https://github.com/bengiruken/DifferentGeneDoublets. The codes and data for obtaining metastatic markers with the FP growth tree is available at https://github.com/ugur0sahin/FMPSeeker.

Funding Statement

This project has been funded in whole or in part with federal funds from the National Cancer Institute, National Institutes of Health, under contract HHSN261201500003I. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the U.S. Government. This Research was supported [in part] by the Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research. NT is supported by funding from TUBITAK the Scientific and Technological Research Projects Funding Programs under the project number 121E245. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Nordling CO. A new theory on cancer-inducing mechanism. Br J Cancer. 1953;7(1):68–72. doi: 10.1038/bjc.1953.8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Ashley DJ. The two “hit” and multiple “hit” theories of carcinogenesis. Br J Cancer. 1969;23(2):313–28. doi: 10.1038/bjc.1969.41 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Knudson AG Jr. Mutation and cancer: statistical study of retinoblastoma. Proc Natl Acad Sci U S A. 1971;68(4):820–3. doi: 10.1073/pnas.68.4.820 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Boland CR, Ricciardiello L. How many mutations does it take to make a tumor? Proc Natl Acad Sci U S A. 1999;96(26):14675–7. doi: 10.1073/pnas.96.26.14675 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Tomlinson I, Sasieni P, Bodmer W. How many mutations in a cancer? Am J Pathol. 2002;160(3):755–8. doi: 10.1016/S0002-9440(10)64896-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Kur J, Hasan N, Szybalski W. Physical and biological consequences of interactions between integration host factor (IHF) and coliphage lambda late p’R promoter and its mutants. Gene. 1989;81(1):1–15. doi: 10.1016/0378-1119(89)90331-4 [DOI] [PubMed] [Google Scholar]
  • 7.Liu Y, Zhang W, Jang H, Nussinov R. SHP2 clinical phenotype, cancer, or RASopathies, can be predicted by mutant conformational propensities. Cell Mol Life Sci. 2023;81(1):5. doi: 10.1007/s00018-023-05052-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Nussinov R, Tsai C-J, Jang H. Anticancer drug resistance: An update and perspective. Drug Resist Updat. 2021;59:100796. doi: 10.1016/j.drup.2021.100796 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Nussinov R, Yavuz BR, Jang H. Molecular principles underlying aggressive cancers. Signal Transduct Target Ther. 2025;10(1):42. doi: 10.1038/s41392-025-02129-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Polak P, Karlić R, Koren A, Thurman R, Sandstrom R, Lawrence M, et al. Cell-of-origin chromatin organization shapes the mutational landscape of cancer. Nature. 2015;518(7539):360–4. doi: 10.1038/nature14221 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Salvadores M, Mas-Ponte D, Supek F. Passenger mutations accurately classify human tumors. PLoS Comput Biol. 2019;15(4):e1006953. doi: 10.1371/journal.pcbi.1006953 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Campbell PJ. Cliques and Schisms of Cancer Genes. Cancer Cell. 2017;32(2):129–30. doi: 10.1016/j.ccell.2017.07.009 [DOI] [PubMed] [Google Scholar]
  • 13.Nussinov R, Tsai C-J, Jang H. Are Parallel Proliferation Pathways Redundant? Trends Biochem Sci. 2020;45(7):554–63. doi: 10.1016/j.tibs.2020.03.013 [DOI] [PubMed] [Google Scholar]
  • 14.Nussinov R, Zhang M, Maloney R, Jang H. Drugging multiple same-allele driver mutations in cancer. Expert Opin Drug Discov. 2021;16(8):823–8. doi: 10.1080/17460441.2021.1905628 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.van de Haar J, Canisius S, Yu MK, Voest EE, Wessels LFA, Ideker T. Identifying Epistasis in Cancer Genomes: A Delicate Affair. Cell. 2019;177(6):1375–83. doi: 10.1016/j.cell.2019.05.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Nussinov R, Tsai C-J, Jang H. A New View of Activating Mutations in Cancer. Cancer Res. 2022;82(22):4114–23. doi: 10.1158/0008-5472.CAN-22-2125 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Nussinov R, Tsai C-J, Jang H. Allostery, and how to define and measure signal transduction. Biophys Chem. 2022;283:106766. doi: 10.1016/j.bpc.2022.106766 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Cisowski J, Sayin VI, Liu M, Karlsson C, Bergo MO. Oncogene-induced senescence underlies the mutual exclusive nature of oncogenic KRAS and BRAF. Oncogene. 2016;35(10):1328–33. doi: 10.1038/onc.2015.186 [DOI] [PubMed] [Google Scholar]
  • 19.Cavalli G, Biavasco R, Borgiani B, Dagna L. Oncogene-induced senescence as a new mechanism of disease: the paradigm of erdheim-chester disease. Front Immunol. 2014;5:281. doi: 10.3389/fimmu.2014.00281 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhang M, Jang H, Nussinov R. PI3K Driver Mutations: A Biophysical Membrane-Centric Perspective. Cancer Res. 2021;81(2):237–47. doi: 10.1158/0008-5472.CAN-20-0911 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Tomasetti C, Marchionni L, Nowak MA, Parmigiani G, Vogelstein B. Only three driver gene mutations are required for the development of lung and colorectal cancers. Proc Natl Acad Sci U S A. 2015;112(1):118–23. doi: 10.1073/pnas.1421839112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Nussinov R, Jang H, Tsai C-J, Cheng F. Precision medicine review: rare driver mutations and their biophysical classification. Biophys Rev. 2019;11(1):5–19. doi: 10.1007/s12551-018-0496-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Schmitt CA, Wang B, Demaria M. Senescence and cancer - role and therapeutic opportunities. Nat Rev Clin Oncol. 2022;19(10):619–36. doi: 10.1038/s41571-022-00668-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wang B, Kohli J, Demaria M. Senescent Cells in Cancer Therapy: Friends or Foes? Trends Cancer. 2020;6(10):838–57. doi: 10.1016/j.trecan.2020.05.004 [DOI] [PubMed] [Google Scholar]
  • 25.Liu X-L, Ding J, Meng L-H. Oncogene-induced senescence: a double edged sword in cancer. Acta Pharmacol Sin. 2018;39(10):1553–8. doi: 10.1038/aps.2017.198 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Zhu H, Blake S, Kusuma FK, Pearson RB, Kang J, Chan KT. Oncogene-induced senescence: From biology to therapy. Mech Ageing Dev. 2020;187:111229. doi: 10.1016/j.mad.2020.111229 [DOI] [PubMed] [Google Scholar]
  • 27.Jung SH, Hwang HJ, Kang D, Park HA, Lee HC, Jeong D, et al. mTOR kinase leads to PTEN-loss-induced cellular senescence by phosphorylating p53. Oncogene. 2019;38(10):1639–50. doi: 10.1038/s41388-018-0521-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Chan KT, Blake S, Zhu H, Kang J, Trigos AS, Madhamshettiwar PB, et al. A functional genetic screen defines the AKT-induced senescence signaling network. Cell Death Differ. 2020;27(2):725–41. doi: 10.1038/s41418-019-0384-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Tu Z, Aird KM, Zhang R. RAS, cellular senescence and transformation: the BRCA1 DNA repair pathway at the crossroads. Small GTPases. 2012;3(3):163–7. doi: 10.4161/sgtp.19884 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Deng Y, Luo S, Deng C, Luo T, Yin W, Zhang H, et al. Identifying mutual exclusivity across cancer genomes: computational approaches to discover genetic interaction and reveal tumor vulnerability. Brief Bioinform. 2019;20(1):254–66. doi: 10.1093/bib/bbx109 [DOI] [PubMed] [Google Scholar]
  • 31.Blair LM, Juan JM, Sebastian L, Tran VB, Nie W, Wall GD, et al. Oncogenic context shapes the fitness landscape of tumor suppression. Nat Commun. 2023;14(1):6422. doi: 10.1038/s41467-023-42156-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Park S, Supek F, Lehner B. Higher order genetic interactions switch cancer genes from two-hit to one-hit drivers. Nat Commun. 2021;12(1):7051. doi: 10.1038/s41467-021-27242-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Cisowski J, Bergo MO. What makes oncogenes mutually exclusive? Small GTPases. 2017;8(3):187–92. doi: 10.1080/21541248.2016.1212689 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Köhler M, Röring M, Schorch B, Heilmann K, Stickel N, Fiala GJ, et al. Activation loop phosphorylation regulates B-Raf in vivo and transformation by B-Raf mutants. EMBO J. 2016;35(2):143–61. doi: 10.15252/embj.201592097 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Ahronian LG, Sennott EM, Van Allen EM, Wagle N, Kwak EL, Faris JE, et al. Clinical Acquired Resistance to RAF Inhibitor Combinations in BRAF-Mutant Colorectal Cancer through MAPK Pathway Alterations. Cancer Discov. 2015;5(4):358–67. doi: 10.1158/2159-8290.CD-14-1518 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Rajagopalan H, Bardelli A, Lengauer C, Kinzler KW, Vogelstein B, Velculescu VE. Tumorigenesis: RAF/RAS oncogenes and mismatch-repair status. Nature. 2002;418(6901):934. doi: 10.1038/418934a [DOI] [PubMed] [Google Scholar]
  • 37.Nussinov R, Yavuz BR, Jang H. Anticancer drugs: How to select small molecule combinations? Trends Pharmacol Sci. 2024;45(6):503–19. doi: 10.1016/j.tips.2024.04.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Nussinov R, Weichhart T, Dlamini Z, Gibbons DL, Van Seuningen I, Konen J, et al. Directions to overcome therapy resistance in cancer. Trends Pharmacol Sci. 2024;45(6):467–71. doi: 10.1016/j.tips.2024.05.001 [DOI] [PubMed] [Google Scholar]
  • 39.Huang A, Garraway LA, Ashworth A, Weber B. Synthetic lethality as an engine for cancer drug target discovery. Nat Rev Drug Discov. 2020;19(1):23–38. doi: 10.1038/s41573-019-0046-z [DOI] [PubMed] [Google Scholar]
  • 40.Shaheen M, Allen C, Nickoloff JA, Hromas R. Synthetic lethality: exploiting the addiction of cancer to DNA repair. Blood. 2011;117(23):6074–82. doi: 10.1182/blood-2011-01-313734 [DOI] [PubMed] [Google Scholar]
  • 41.Liu Q-W, Yang Z-W, Tang Q-H, Wang W-E, Chu D-S, Ji J-F, et al. The power and the promise of synthetic lethality for clinical application in cancer treatment. Biomed Pharmacother. 2024;172:116288. doi: 10.1016/j.biopha.2024.116288 [DOI] [PubMed] [Google Scholar]
  • 42.Setton J, Zinda M, Riaz N, Durocher D, Zimmermann M, Koehler M, et al. Synthetic Lethality in Cancer Therapeutics: The Next Generation. Cancer Discov. 2021;11(7):1626–35. doi: 10.1158/2159-8290.CD-20-1503 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Tomlinson IP, Novelli MR, Bodmer WF. The mutation rate and cancer. Proc Natl Acad Sci U S A. 1996;93(25):14800–3. doi: 10.1073/pnas.93.25.14800 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Yavuz BR, Tsai C-J, Nussinov R, Tuncbag N. Pan-cancer clinical impact of latent drivers from double mutations. Commun Biol. 2023;6(1):202. doi: 10.1038/s42003-023-04519-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Tamborero D, Rubio-Perez C, Deu-Pons J, Schroeder MP, Vivancos A, Rovira A, et al. Cancer Genome Interpreter annotates the biological and clinical relevance of tumor alterations. Genome Med. 2018;10(1):25. doi: 10.1186/s13073-018-0531-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Wang J, Wu M, Huang X, Wang L, Zhang S, Liu H, et al. SynLethDB 2.0: a web-based knowledge graph database on synthetic lethality for novel anticancer drug discovery. Database (Oxford). 2022;2022:baac030. doi: 10.1093/database/baac030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Breitkreutz B-J, Stark C, Reguly T, Boucher L, Breitkreutz A, Livstone M, et al. The BioGRID Interaction Database: 2008 update. Nucleic Acids Res. 2008;36(Database issue):D637-40. doi: 10.1093/nar/gkm1001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Hanahan D, Weinberg RA. Hallmarks of cancer: the next generation. Cell. 2011;144(5):646–74. doi: 10.1016/j.cell.2011.02.013 [DOI] [PubMed] [Google Scholar]
  • 49.Rodier F, Campisi J. Four faces of cellular senescence. J Cell Biol. 2011;192(4):547–56. doi: 10.1083/jcb.201009094 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Alon U. Network motifs: theory and experimental approaches. Nat Rev Genet. 2007;8(6):450–61. doi: 10.1038/nrg2102 [DOI] [PubMed] [Google Scholar]
  • 51.Gorelick AN, Sánchez-Rivera FJ, Cai Y, Bielski CM, Biederstedt E, Jonsson P, et al. Phase and context shape the function of composite oncogenic mutations. Nature. 2020;582(7810):100–3. doi: 10.1038/s41586-020-2315-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Iannuccelli M, Micarelli E, Surdo PL, Palma A, Perfetto L, Rozzo I, et al. CancerGeneNet: linking driver genes to cancer hallmarks. Nucleic Acids Res. 2020;48(D1):D416–21. doi: 10.1093/nar/gkz871 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Hedrick E, Cheng Y, Jin U-H, Kim K, Safe S. Specificity protein (Sp) transcription factors Sp1, Sp3 and Sp4 are non-oncogene addiction genes in cancer cells. Oncotarget. 2016;7(16):22245–56. doi: 10.18632/oncotarget.7925 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Safe S, Shrestha R, Mohankumar K, Howard M, Hedrick E, Abdelrahim M. Transcription factors specificity protein and nuclear receptor 4A1 in pancreatic cancer. World J Gastroenterol. 2021;27(38):6387–98. doi: 10.3748/wjg.v27.i38.6387 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Kim MP, Li X, Deng J, Zhang Y, Dai B, Allton KL, et al. Oncogenic KRAS Recruits an Expansive Transcriptional Network through Mutant p53 to Drive Pancreatic Cancer Metastasis. Cancer Discov. 2021;11(8):2094–111. doi: 10.1158/2159-8290.CD-20-1228 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Zhou Y, Jin J, Ji Y, Zhang J, Fu N, Chen M, et al. TP53 missense mutation reveals gain-of-function properties in small-sized KRAS transformed pancreatic ductal adenocarcinoma. J Transl Med. 2023;21(1):872. doi: 10.1186/s12967-023-04742-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Shi Y, Hata A, Lo RS, Massagué J, Pavletich NP. A structural basis for mutational inactivation of the tumour suppressor Smad4. Nature. 1997;388(6637):87–93. doi: 10.1038/40431 [DOI] [PubMed] [Google Scholar]
  • 58.Schmitt CA, Wang B, Demaria M. Senescence and cancer - role and therapeutic opportunities. Nat Rev Clin Oncol. 2022;19(10):619–36. doi: 10.1038/s41571-022-00668-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Kumari R, Jat P. Mechanisms of Cellular Senescence: Cell Cycle Arrest and Senescence Associated Secretory Phenotype. Front Cell Dev Biol. 2021;9:645593. doi: 10.3389/fcell.2021.645593 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Türei D, Korcsmáros T, Saez-Rodriguez J. OmniPath: guidelines and gateway for literature-curated signaling pathway resources. Nat Methods. 2016;13(12):966–7. doi: 10.1038/nmeth.4077 [DOI] [PubMed] [Google Scholar]
  • 61.Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50. doi: 10.1073/pnas.0506580102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Mootha VK, Lindgren CM, Eriksson K-F, Subramanian A, Sihag S, Lehar J, et al. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet. 2003;34(3):267–73. doi: 10.1038/ng1180 [DOI] [PubMed] [Google Scholar]
  • 63.Wang J, Vasaikar S, Shi Z, Greer M, Zhang B. WebGestalt 2017: a more comprehensive, powerful, flexible and interactive gene set enrichment analysis toolkit. Nucleic Acids Res. 2017;45(W1):W130–7. doi: 10.1093/nar/gkx356 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Lappano R, Maggiolini M. GPCRs and cancer. Acta Pharmacol Sin. 2012;33(3):351–62. doi: 10.1038/aps.2011.183 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Huerta-Reyes M, Maya-Núñez G, Pérez-Solis MA, López-Muñoz E, Guillén N, Olivo-Marin J-C, et al. Treatment of Breast Cancer With Gonadotropin-Releasing Hormone Analogs. Front Oncol. 2019;9:943. doi: 10.3389/fonc.2019.00943 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Garrido MP, Hernandez A, Vega M, Araya E, Romero C. Conventional and new proposals of GnRH therapy for ovarian, breast, and prostatic cancers. Front Endocrinol (Lausanne). 2023;14:1143261. doi: 10.3389/fendo.2023.1143261 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Mishra P, Banerjee D, Ben-Baruch A. Chemokines at the crossroads of tumor-fibroblast interactions that promote malignancy. J Leukoc Biol. 2011;89(1):31–9. doi: 10.1189/jlb.0310182 [DOI] [PubMed] [Google Scholar]
  • 68.Mollica Poeta V, Massara M, Capucetti A, Bonecchi R. Chemokines and Chemokine Receptors: New Targets for Cancer Immunotherapy. Front Immunol. 2019;10:379. doi: 10.3389/fimmu.2019.00379 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Liu H, Yang Z, Lu W, Chen Z, Chen L, Han S, et al. Chemokines and chemokine receptors: A new strategy for breast cancer therapy. Cancer Med. 2020;9(11):3786–99. doi: 10.1002/cam4.3014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Jubb HC, Pandurangan AP, Turner MA, Ochoa-Montaño B, Blundell TL, Ascher DB. Mutations at protein-protein interfaces: Small changes over big surfaces have large impacts on human health. Prog Biophys Mol Biol. 2017;128:3–13. doi: 10.1016/j.pbiomolbio.2016.10.002 [DOI] [PubMed] [Google Scholar]
  • 71.Cheng F, Zhao J, Wang Y, Lu W, Liu Z, Zhou Y, et al. Comprehensive characterization of protein-protein interactions perturbed by disease mutations. Nat Genet. 2021;53(3):342–53. doi: 10.1038/s41588-020-00774-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Cheng F, Liang H, Butte AJ, Eng C, Nussinov R. Personal Mutanomes Meet Modern Oncology Drug Discovery and Precision Health. Pharmacol Rev. 2019;71(1):1–19. doi: 10.1124/pr.118.016253 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Nussinov R, Jang H, Tsai C-J, Cheng F. Review: Precision medicine and driver mutations: Computational methods, functional assays and conformational principles for interpreting cancer drivers. PLoS Comput Biol. 2019;15(3):e1006658. doi: 10.1371/journal.pcbi.1006658 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Ivanov AA. Explore Protein-Protein Interactions for Cancer Target Discovery Using the OncoPPi Portal. Methods Mol Biol. 2020;2074:145–64. doi: 10.1007/978-1-4939-9873-9_12 [DOI] [PubMed] [Google Scholar]
  • 75.Ivanov AA, Revennaugh B, Rusnak L, Gonzalez-Pecchi V, Mo X, Johns MA, et al. The OncoPPi Portal: an integrative resource to explore and prioritize protein-protein interactions for cancer target discovery. Bioinformatics. 2018;34(7):1183–91. doi: 10.1093/bioinformatics/btx743 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Chen C, Shi C, Huang X, Zheng J, Zhu Z, Li Q, et al. Molecular Profiles and Metastasis Markers in Chinese Patients with Gastric Carcinoma. Sci Rep. 2019;9(1):13995. doi: 10.1038/s41598-019-50171-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Zhang Y, Chen Y, Yang C, Seger N, Hesla AC, Tsagkozis P, et al. TERT promoter mutation is an objective clinical marker for disease progression in chondrosarcoma. Mod Pathol. 2021;34(11):2020–7. doi: 10.1038/s41379-021-00848-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Liu R, Rizzo S, Waliany S, Garmhausen MR, Pal N, Huang Z, et al. Systematic pan-cancer analysis of mutation-treatment interactions using large real-world clinicogenomics data. Nat Med. 2022;28(8):1656–61. doi: 10.1038/s41591-022-01873-5 [DOI] [PubMed] [Google Scholar]
  • 79.Smith MR, Wang Y, D’Agostino R Jr, Liu Y, Ruiz J, Lycan T, et al. Prognostic Mutational Signatures of NSCLC Patients treated with chemotherapy, immunotherapy and chemoimmunotherapy. NPJ Precis Oncol. 2023;7(1):34. doi: 10.1038/s41698-023-00373-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Zhang F, Wang J, Xu Y, Cai S, Li T, Wang G, et al. Co-occurring genomic alterations and immunotherapy efficacy in NSCLC. NPJ Precis Oncol. 2022;6(1):4. doi: 10.1038/s41698-021-00243-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Nunes L, Stenersen JM, Kryeziu K, Sjöblom T, Glimelius B, Lothe RA, et al. Co-occurring mutations identify prognostic subgroups of microsatellite stable colorectal cancer. Mol Cancer. 2024;23(1):264. doi: 10.1186/s12943-024-02173-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Zhao Z-M, Zhao B, Bai Y, Iamarino A, Gaffney SG, Schlessinger J, et al. Early and multiple origins of metastatic lineages within primary tumors. Proc Natl Acad Sci U S A. 2016;113(8):2140–5. doi: 10.1073/pnas.1525677113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Liu G, Zhan X, Dong C, Liu L. Genomics alterations of metastatic and primary tissues across 15 cancer types. Sci Rep. 2017;7(1):13262. doi: 10.1038/s41598-017-13650-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Martínez-Jiménez F, Movasati A, Brunner SR, Nguyen L, Priestley P, Cuppen E, et al. Pan-cancer whole-genome comparison of primary and metastatic solid tumours. Nature. 2023;618(7964):333–41. doi: 10.1038/s41586-023-06054-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Christensen DS, Ahrenfeldt J, Sokač M, Kisistók J, Thomsen MK, Maretty L, et al. Treatment Represents a Key Driver of Metastatic Cancer Evolution. Cancer Res. 2022;82(16):2918–27. doi: 10.1158/0008-5472.CAN-22-0562 [DOI] [PubMed] [Google Scholar]
  • 86.Birkbak NJ, McGranahan N. Cancer Genome Evolutionary Trajectories in Metastasis. Cancer Cell. 2020;37(1):8–19. doi: 10.1016/j.ccell.2019.12.004 [DOI] [PubMed] [Google Scholar]
  • 87.Patel SA, Rodrigues P, Wesolowski L, Vanharanta S. Genomic control of metastasis. Br J Cancer. 2021;124(1):3–12. doi: 10.1038/s41416-020-01127-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Theodorou V, Stark R, Menon S, Carroll JS. GATA3 acts upstream of FOXA1 in mediating ESR1 binding by shaping enhancer accessibility. Genome Res. 2013;23(1):12–22. doi: 10.1101/gr.139469.112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Priestley P, Baber J, Lolkema MP, Steeghs N, de Bruijn E, Shale C, et al. Pan-cancer whole-genome analyses of metastatic solid tumours. Nature. 2019;575(7781):210–6. doi: 10.1038/s41586-019-1689-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Modrek AS, Golub D, Khan T, Bready D, Prado J, Bowman C, et al. Low-Grade Astrocytoma Mutations in IDH1, P53, and ATRX Cooperate to Block Differentiation of Human Neural Stem Cells via Repression of SOX2. Cell Rep. 2017;21(5):1267–80. doi: 10.1016/j.celrep.2017.10.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Ohba S, Kuwahara K, Yamada S, Abe M, Hirose Y. Correlation between IDH, ATRX, and TERT promoter mutations in glioma. Brain Tumor Pathol. 2020;37(2):33–40. doi: 10.1007/s10014-020-00360-4 [DOI] [PubMed] [Google Scholar]
  • 92.Herzog SK, Fuqua SAW. ESR1 mutations and therapeutic resistance in metastatic breast cancer: progress and remaining challenges. Br J Cancer. 2022;126(2):174–86. doi: 10.1038/s41416-021-01564-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Gerratana L, Davis AA, Velimirovic M, Clifton K, Hensing WL, Shah AN, et al. Interplay between ESR1/PIK3CA codon variants, oncogenic pathway alterations and clinical phenotype in patients with metastatic breast cancer (MBC): comprehensive circulating tumor DNA (ctDNA) analysis. Breast Cancer Res. 2023;25(1):112. doi: 10.1186/s13058-023-01718-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Boca SM, Kinzler KW, Velculescu VE, Vogelstein B, Parmigiani G. Patient-oriented gene set analysis for cancer mutation data. Genome Biol. 2010;11(11):R112. doi: 10.1186/gb-2010-11-11-r112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Reiter JG, Baretti M, Gerold JM, Makohon-Moore AP, Daud A, Iacobuzio-Donahue CA, et al. An analysis of genetic heterogeneity in untreated cancers. Nat Rev Cancer. 2019;19(11):639–50. doi: 10.1038/s41568-019-0185-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Mendiratta G, Ke E, Aziz M, Liarakos D, Tong M, Stites EC. Cancer gene mutation frequencies for the U.S. population. Nat Commun. 2021;12(1):5961. doi: 10.1038/s41467-021-26213-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Mendiratta G, Jones MK, Stites EC. How often is each gene mutated within the cancer patient population? Mol Cell Oncol. 2022;9(1):2065176. doi: 10.1080/23723556.2022.2065176 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.El Tekle G, Bernasocchi T, Unni AM, Bertoni F, Rossi D, Rubin MA, et al. Co-occurrence and mutual exclusivity: what cross-cancer mutation patterns can tell us. Trends Cancer. 2021;7(9):823–36. doi: 10.1016/j.trecan.2021.04.009 [DOI] [PubMed] [Google Scholar]
  • 99.Rogers ZN, McFarland CD, Winters IP, Naranjo S, Chuang C-H, Petrov D, et al. A quantitative and multiplexed approach to uncover the fitness landscape of tumor suppression in vivo. Nat Methods. 2017;14(7):737–42. doi: 10.1038/nmeth.4297 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Harris TJR, McCormick F. The molecular pathology of cancer. Nat Rev Clin Oncol. 2010;7(5):251–65. doi: 10.1038/nrclinonc.2010.41 [DOI] [PubMed] [Google Scholar]
  • 101.Sanchez-Vega F, Mina M, Armenia J, Chatila WK, Luna A, La KC, et al. Oncogenic Signaling Pathways in The Cancer Genome Atlas. Cell. 2018;173(2):321-337.e10. doi: 10.1016/j.cell.2018.03.035 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Sinkala M. Mutational landscape of cancer-driver genes across human cancers. Sci Rep. 2023;13(1):12742. doi: 10.1038/s41598-023-39608-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Liu S, Liu J, Xie Y, Zhai T, Hinderer EW, Stromberg AJ, et al. MEScan: a powerful statistical framework for genome-scale mutual exclusivity analysis of cancer mutations. Bioinformatics. 2021;37(9):1189–97. doi: 10.1093/bioinformatics/btaa957 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Hua X, Hyland PL, Huang J, Song L, Zhu B, Caporaso NE, et al. MEGSA: A Powerful and Flexible Framework for Analyzing Mutual Exclusivity of Tumor Mutations. Am J Hum Genet. 2016;98(3):442–55. doi: 10.1016/j.ajhg.2015.12.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Leiserson MDM, Wu H-T, Vandin F, Raphael BJ. CoMEt: a statistical approach to identify combinations of mutually exclusive alterations in cancer. Genome Biol. 2015;16(1):160. doi: 10.1186/s13059-015-0700-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Ciriello G, Cerami E, Aksoy BA, Sander C, Schultz N. Using MEMo to discover mutual exclusivity modules in cancer. Curr Protoc Bioinformatics. 2013;Chapter 8:8.17.1-8.17.12. doi: 10.1002/0471250953.bi0817s41 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Zhang J, Wu L-Y, Zhang X-S, Zhang S. Discovery of co-occurring driver pathways in cancer. BMC Bioinformatics. 2014;15(1):271. doi: 10.1186/1471-2105-15-271 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Mina M, Raynaud F, Tavernari D, Battistello E, Sungalee S, Saghafinia S, et al. Conditional Selection of Genomic Alterations Dictates Cancer Evolution and Oncogenic Dependencies. Cancer Cell. 2017;32(2):155-168.e6. doi: 10.1016/j.ccell.2017.06.010 [DOI] [PubMed] [Google Scholar]
  • 109.Iwafuchi-Doi M, Donahue G, Kakumanu A, Watts JA, Mahony S, Pugh BF, et al. The Pioneer Transcription Factor FoxA Maintains an Accessible Nucleosome Configuration at Enhancers for Tissue-Specific Gene Activation. Mol Cell. 2016;62(1):79–91. doi: 10.1016/j.molcel.2016.03.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Han J, Pei J, Yin Y. Mining frequent patterns without candidate generation. ACM SIGMOD Rec. 2000;29(2):1–12. doi: 10.1145/335191.335372 [DOI] [Google Scholar]
  • 111.Vasan N, Razavi P, Johnson JL, Shao H, Shah H, Antoine A, et al. Double PIK3CA mutations in cis increase oncogenicity and sensitivity to PI3Kα inhibitors. Science. 2019;366(6466):714–23. doi: 10.1126/science.aaw9032 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Saito Y, Koya J, Araki M, Kogure Y, Shingaki S, Tabata M, et al. Landscape and function of multiple mutations within individual oncogenes. Nature. 2020;582(7810):95–9. doi: 10.1038/s41586-020-2175-2 [DOI] [PubMed] [Google Scholar]
  • 113.Saito M, Momma T, Kono K. Targeted therapy according to next generation sequencing-based panel sequencing. Fukushima J Med Sci. 2018;64(1):9–14. doi: 10.5387/fms.2018-02 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Jang H, Yavuz BR, Zhang M, Liu Y, Nussinov R. The mechanism of oncogenic PI3K lipid kinase variants at the membrane and their cryptic pockets. bioRxiv. 2025;:2025.06.26.661751. doi: 10.1101/2025.06.26.661751 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Wang K, Ye M, Mo Z, Huang X, Li Y, Wei S. Comprehensive analysis of genomic alterations and novel prognostic biomarkers, and establishment of prediction models of metastasis in metastatic non-small cell lung cancer. J Cancer. 2025;16(1):339–50. doi: 10.7150/jca.97070 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Cendra AS, Pekarek L, Pedrejón BI, Bernier L, Cervantes EDR, Cendra CS, et al. New Horizons of Biomarkers in Metastatic Thyroid Cancer. J Cancer. 2025;16(1):241–64. doi: 10.7150/jca.101395 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Duan H, Ren J, Wei S, Yang Z, Li C, Wang Z, et al. Integrated analyses of multi-omic data derived from paired primary lung cancer and brain metastasis reveal the metabolic vulnerability as a novel therapeutic target. Genome Med. 2024;16(1):138. doi: 10.1186/s13073-024-01410-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Boiarsky D, Lydon CA, Chambers ES, Sholl LM, Nishino M, Skoulidis F, et al. Molecular markers of metastatic disease in KRAS-mutant lung adenocarcinoma. Ann Oncol. 2023;34(7):589–604. doi: 10.1016/j.annonc.2023.04.514 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Yang Y, Zhang S, Guo L. Characterization of Cell Cycle-Related Competing Endogenous RNAs Using Robust Rank Aggregation as Prognostic Biomarker in Lung Adenocarcinoma. Front Oncol. 2022;12:807367. doi: 10.3389/fonc.2022.807367 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Wu G, Haw R. Functional Interaction Network Construction and Analysis for Disease Discovery. Methods Mol Biol. 2017;1558:235–53. doi: 10.1007/978-1-4939-6783-4_11 [DOI] [PubMed] [Google Scholar]
  • 121.Agrawal A, Balcı H, Hanspers K, Coort SL, Martens M, Slenter DN, et al. WikiPathways 2024: next generation pathway database. Nucleic Acids Res. 2024;52(D1):D679–89. doi: 10.1093/nar/gkad960 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Zhou Y, Zhao J, Fang J, Martin W, Li L, Nussinov R, et al. My personal mutanome: a computational genomic medicine platform for searching network perturbing alleles linking genotype to phenotype. Genome Biol. 2021;22(1):53. doi: 10.1186/s13059-021-02269-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Hopkins BD, Goncalves MD, Cantley LC. Insulin-PI3K signalling: an evolutionarily insulated metabolic driver of cancer. Nat Rev Endocrinol. 2020;16(5):276–83. doi: 10.1038/s41574-020-0329-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Dominich S. The modern algebra of information retrieval. Springer Science & Business Media. 2008. p. 330. [Google Scholar]
  • 125.Exploring network structure, dynamics, and function using Networkx2008. 2008.
  • 126.Brin S, Page L. The anatomy of a large-scale hypertextual web search engine (Reprint from Computer Networks and ISDN Systems, vol 30, pg 107-117, 1998). Comput Netw. 2012;56(18):3825–33. doi: 10.1016/j.comnet.2012.10.007 WOS:000 [DOI] [Google Scholar]
  • 127.Fang Z, Liu X, Peltz G. GSEApy: a comprehensive package for performing gene set enrichment analysis in Python. Bioinformatics. 2023;39(1):btac757. doi: 10.1093/bioinformatics/btac757 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Text. Contains supplemental text, methods and figures.

(DOCX)

pcbi.1013351.s001.docx (24.5MB, docx)
S1 Table. Statistics of co-existing mutations on different proteins.

(XLSX)

pcbi.1013351.s002.xlsx (529.2KB, xlsx)
S2 Table. Metastatic breast cancer co-existing mutation markers.

(XLSX)

pcbi.1013351.s003.xlsx (12.9KB, xlsx)
S3 Table. Metastatic breast cancer triplet mutations in FP Growth tree.

(XLSX)

pcbi.1013351.s004.xlsx (9.3KB, xlsx)

Data Availability Statement

The results shown here are in whole or part based upon data generated by the TCGA Research Network: https://www.cancer.gov/tcga. All the data and codes are available at https://github.com/bengiruken/DifferentGeneDoublets. The codes and data for obtaining metastatic markers with the FP growth tree is available at https://github.com/ugur0sahin/FMPSeeker.


Articles from PLOS Computational Biology are provided here courtesy of PLOS

RESOURCES