Skip to main content
BMC Medical Genomics logoLink to BMC Medical Genomics
. 2025 Dec 12;19:14. doi: 10.1186/s12920-025-02289-z

A pathway-informed mutual exclusivity framework to detect genetic interactions in pediatric cancer

Anastasia Spinou 1, Richard Gremmen 1, Jarno Drost 1,2, Patrick Kemmeren 1,3,
PMCID: PMC12822016  PMID: 41388546

Abstract

Background

The exponential increase of sequenced cancer genomes has enabled the in-silico study of genetic interactions in tumors across many cancer types—particularly synthetic lethality, where two gene alterations lead to cell death—and identify new candidate therapeutic targets. This rise is primarily present in adult cancer, while in-silico investigation of genetic interactions remains challenging in pediatric cancer especially in tumors of low incidence. Consequently, this underscores the need for specialized approaches to advance our understanding of genetic interactions in pediatric oncology.

Methods

Here, we present the pathway-informed genetic interaction framework (PIGI) that employs mutual exclusivity and co-occurrence testing and leverages biological pathways to infer candidate genetic interactions. Pathways facilitate the detection of hidden biology by grouping genes affected by single nucleotide variants or small indels in functional units, as well as alleviate key confounders of these analyses: pathway epistasis and cancer subtypes—thereby highlighting genes of greater interest.

Results

PIGI detected candidate genetic interactions by assessing pathway mutual exclusivity and co-occurrence in two primary pediatric cancer datasets, DKFZ and TARGET. PIGI detected, from high significance pathway relationships, 35 mutually exclusive and 2 co-occurring mutated gene pairs. The already known mutually exclusive gene pair of TP53-DROSHA is detected in the much smaller collection of the DKFZ Wilms tumors. Over half of the identified gene pairs represent new discoveries that have not been previously described in the literature. Four of them could be promising candidates for synthetic lethal genetic interactions.

Conclusion

These findings highlight the benefits of genetic interactions inference by exploring a different aspect of pediatric cancer data through pathways and propose new gene pairs for follow-up synthetic lethality experimentation.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12920-025-02289-z.

Keywords: Genetic interactions, Pediatric cancer, Mutual exclusivity, Synthetic lethality, Pathways

Background

Genetic interactions (GIs), the combination of two mutated genes that produces an unexpected phenotype, have been extensively investigated in cancer [14]. In its most extreme form, these interactions can be either synthetic lethal (SL) or cooperative. Synthetic lethality is particularly interesting for uncovering new therapeutic targets by identifying cancer cell vulnerabilities. On the other hand, cooperation suggests an advantage for tumor cells and can offer insights into the mechanisms of cancer development.

The increased availability of sequenced cancer genomes has enabled the exploration of the genomic landscapes of tumors across many cancer types, giving the opportunity to perform in-silico studies to infer GIs [5]. Several inference methods leverage tumor mutations to test gene co-occurrence (CO) and mutual exclusivity (ME), testing the hypothesis whether two mutated genes occur together more or less often than expected [69]. Significant co-occurrence between two genes may indicate a cooperative GI. Conversely, significant mutual exclusivity between two genes may indicate a synthetic sick or SL GI. Key confounders in ME analysis include cancer subtypes and pathway epistasis, which often can result in findings that are not true synthetic sick/lethal candidates. Pathway epistasis, also known as functional redundancy, describes functionally related genes belonging to the same pathway, and mutating either of them leads to the same downstream effect. A mutation in any one of the genes is sufficient to alter pathway function, eliminating selective pressure for additional mutations in other genes within the same pathway [5, 10, 11]. As a consequence, the gene mutations will appear mutually exclusive in the CO/ME test but are likely not SL. While CO/ME testing identifies candidate gene pairs for SL and cooperation, it provides only statistical predictions and does not confirm functional relationships. Validation through literature review or experimental evidence is necessary to distinguish the biologically meaningful interactions from spurious associations and strengthen the confidence of the findings.

The application of GI inference methodologies in pediatric cancer continues to face challenges, as power limitations persist despite the expansion of collected cancer genomes, largely due to small sample sizes per cancer type [12, 13]. However, pediatric cancer genomes remain a valuable resource for uncovering new biological insights. Incorporating prior knowledge can be instrumental for CO/ME testing. Biological pathways are summarized biological knowledge that aid in grouping functionally related genes. Aggregating genes in biologically meaningful functional groups or pathways, decreases the dimensionality of the data and contributes to increasing statistical power [14]. This can be a compelling way to perform an informed and biologically meaningful CO/ME testing to explore different aspects of the data.

Here, we present the pathway-informed genetic interactions framework (PIGI) applied to two pediatric cancer datasets, TARGET and DKFZ [15, 16]. PIGI uses pathway knowledge from Reactome [17] to perform a biologically informed test on genes mutated by single nucleotide variants (SNVs) or small indels. It assesses CO and ME from pathway relationships and results in a set of co-occurring and mutually exclusive gene pairs. It uncovers new gene pair relationships and spotlights potential SL candidates by decreasing pathway epistasis, subtype effects, and cooperative interactions. Finally, PIGI reveals several ME gene pairs with an unknown relationship and four ME gene pairs, reflecting candidate SL GIs.

Methods

Data acquisition

WGS and WES data of the TARGET dataset consisting of 1669 primary tumors of 6 cancer types [16], and of the DKFZ dataset containing 961 primary tumors of 23 cancer (sub)types [15], as shown in Suppl. Figure 1, were retrieved (mutation data are available to inspect in the following portals, https://www.cancer.gov/ccg/ and https://hgserver1.amc.nl/cgi-bin/r2/main.cgi?dscope=DKFZ_PED&option=about_dscope). Following the same processing criteria as described in Daub et al. [12], the WGS/WES data were processed to derive somatic SNVs and small indels with likely functional effects per tumor. The same criteria were applied, also, in excluding hypermutators; a hypermutator sample is considered to have more than 225 coding mutations. Shared observation windows across patients have not been accounted for in tumor selection. Biological duplicates were excluded from the analysis. This processing resulted in a total of 1632 tumors in the TARGET dataset, comprised of 649 WGS samples and 1105 WES samples, and 829 tumors in the DKFZ dataset, comprised of 523 WGS samples and 306 WES samples.

Reactome pathway database and pathway selection

Reactome, a curated and experimentally validated biological pathway database, was used for this study. The complete human pathway gene sets of version v86 and the corresponding pathway hierarchy was retrieved at https://reactome.org/download-data and all genes were used by their ENSEMBL IDs. Reactome groups its pathways in a hierarchical tree structure. The tree begins with level 0 (root) pathways that represent biologically broad and general functions. Moving deeper into the tree’s structure, level 1, 2, 3, …, x, the pathways become increasingly smaller and reflect greater biological specificity. Out of the 2647 pathways, a subset of pathways was selected to maintain statistical power, by excluding small pathways (< 10 genes) and large general pathways (> 300 genes) while still retaining biological relevance. The following criteria were applied: pathways at the second level in the tree that contain < 300 and > 10 genes were selected. Pathways that had fewer than 10 genes were omitted, whereas pathways that contained more than 300 genes were replaced by pathways of the next level deeper in the tree that satisfied the same rule (< 300 & > 10 genes). This process was repeated iteratively until for each pathway the rule was satisfied. This resulted in a set of 413 pathways that contain 9460 genes.

Assignment of mutated genes to pathways

Each tumor’s mutated genes were assigned to their corresponding pathways. If a gene is mutated, then it is assumed that its corresponding pathway is also mutated. If a gene belongs to more than one pathway, it is assumed that all these pathways are mutated. This assignment resulted in a list of pathways that are mutated per tumor and per cancer type. Each mutated pathway was retained in the test if it was mutated in 6 or more tumors. The final number of tumors tested via pathways per cancer type is shown in Suppl. Table 1.

Co-occurrence & mutual exclusivity test

Pathway test with rediscover

Rediscover [18] was applied on mutated pathways per cancer type or cancer subtype as provided by the published datasets. Rediscover uses a statistical test that employs the Poisson-Binomial distribution to identify co-occurring and mutually exclusive events. Rediscover is a reimplementation of DISCOVER [6] and therefore, it takes into account the overall alteration rates of each tumor. This test was designed to detect genetic interactions on gene mutations but here we apply it to pathway mutations. Version 0.3.2 of “Rediscover” R package was used.

FDR estimation

Rediscover (in contrast to DISCOVER) does not perform an FDR control or estimation. FDR control methods were not suitable for this study due to the lack of continuous p-values and uniformity (p-value distribution skewed to 1). Therefore, an empirical FDR was estimated as described by Daub et al. 2021 [12]. This included calculating a null distribution by permuting the input matrices (nperm = 300), applying rediscover, and after obtaining the null distribution, the found p-values were compared to the null distribution. The binary matrices were permuted using permatswap() function (mtype = “prab”, times = 300, rest default settings) of “vegan” R package version 2.6.4.

Post-filtering

Removal of pathway pairs with shared genes

Shared genes between pathways led to identification of false significantly co-occurring pathway pairs, which would not have occurred without the shared genes. Therefore, all pathway pairs with at least one shared gene were removed from further analysis.

Omitting pathway pairs caused by mutation load effects

Mutation load differences among the tumors can yield false positives in mutual exclusivity testing. To account for these potential false positives, the Wilcoxon-Mann–Whitney test (wilcox.exact() function of R package “exactRankTests” version 0.8.35) was applied to the pathway mutation load of the tumors. Pathway mutation load was defined as the number of mutated genes (with functional impact on encoded protein) present in a given tumor that has a specific mutated pathway. If the mutation load of pathway A and pathway B were found significantly different (p-value < = 0.05) then they were excluded from the results.

Gene pair retracing

For each pathway relationship found, the same relationship was applied to their underlying gene pairs. For example, pathway relationship A(genes: a, b, c)-B(genes: e, f) is mutually exclusive, therefore their corresponding genes are also marked as mutually exclusive: a-e, a-f, b-e, b-f, c-e, c-f. Per cancer type for each pathway pair these gene relationships were extracted (by omitting duplicates) to construct a network in Cytoscape.

Gene pair filtering

Filtering out gene pairs occurring in the same pathway

Gene pairs within the same pathway were excluded from the analysis. Gene pairs were assessed within the pathway subset that is used in this study.

Omitting mutually exclusive gene pairs caused by mutation load effects

Mutation load can confound the identification of mutually exclusive gene pairs, potentially resulting in false positives, as described above about pathway mutation load effect. To account for these potential false positives, for each gene pair, mutation load differences were assessed between tumors harboring mutations in each gene of the pair using a Wilcoxon-Mann–Whitney test. The pairs that have a p-value < 0.05 are omitted from the results.

Pathway-gene mutation frequency threshold and gene pair filtering

For each cancer type, the distribution of gene counts in relation to their mutation frequency within each mutated pathway was calculated. The elbow point was identified from each distribution, as shown in Suppl. Figure 2 and Suppl. Table 2, and the mean of all elbow points across all pathways within a cancer type was calculated to establish a cancer-specific mutation frequency threshold. A global minimum threshold for gene mutation frequency was set to >= 3. For cancer types where the mean elbow point for mutation frequency was below 3, gene pairs in which both genes have a mutation frequency >= 3 were retained. For cancer types where the cancer-specific mutation frequency threshold exceeded 3, this higher value was used as the threshold: only gene pairs with both genes meeting or exceeding that higher threshold were selected.

Gene pair selection for TARGET—B-ALL

For facilitating annotation, a subset of significant B-ALL gene pairs was selected due to the large number of findings. The subset consists of maximum 3 gene pairs per gene of the top 15 high mutation frequency genes found in the B-ALL tumors.

Gene pair annotation strategy

Each gene pair was annotated using google scholar to find matching articles describing mutual exclusivity, co-occurrence, co-mutations, outcome, function of each gene, co-operation and synthetic lethality in the relevant cancer type they were found in or other cancer types.

dNdS ratio

The dNdS ratio was calculated to identify genes in gene pairs that are likely cancer drivers. The dndscv method (version 0.0.1.0) [19] was applied per cancer type and per sequencing method, i.e. WGS or WES with default settings. Significance was set at 0.1. More details for numbers of tumors per sequencing method and dNdS output is available in Suppl. Table 3.

Cancer gene census significance

The cancer gene census was obtained on the 24th April 2025 through https://cancer.sanger.ac.uk/cosmic/. The hypergeometric test was used to evaluate the enrichment of cancer gene census genes in the PIGI results.

Results

A pathway-informed genetic interaction framework to uncover genetic interactions in pediatric cancer

We developed the pathway-informed genetic interaction framework (PIGI) to detect candidate genetic interactions by assessing CO and ME of mutated pathways in pediatric cancers. Testing for genetic interactions at the pathway level increases statistical power and exploits data in a biologically meaningful way [17]. This is of particular importance for pediatric cancer, which has low sample numbers per tumor type. Furthermore, it allows easier interpretation of affected biological pathways, as well as reduces the number of detected pathway epistatic relationships [5, 10, 11]. Shortly, the PIGI framework involves four major steps (Fig. 1; Methods). First, data are processed using single-nucleotide variants (SNVs) and indels, resulting in a gene mutation profile for each tumor sample (Fig. 1, step 1). Then, these variants are matched with their corresponding biological pathways obtained from Reactome [17] to derive pathway mutation profiles per tumor. Second, pathway mutation profiles are then tested for CO and ME (Fig. 1, step 2). Rediscover examines the pathway mutation profiles per cancer type (or cancer subtype as provided by the published datasets) for significant CO and ME pathway relationships [6, 18]. In the third step, genes associated with the CO and ME pathway pairs are retraced to gene pairs that are either CO or ME (Fig. 1, step 3). The final step involves filtering to prioritize the most significant CO and ME gene pairs (Fig. 1, step 4). We then seek evidence in literature to propose potential biological explanations of the findings. Here, we apply the PIGI framework to investigate candidate genetic interactions in two pediatric cancer genome sequencing datasets, TARGET [16] and DKFZ [15]. The TARGET dataset contains 1699 tumors across six cancer type and focuses mostly on leukemias (Suppl. Fig. 1). The DKFZ dataset is focused on central nervous tumors and comprises 23 cancer types amounting to 961 tumors in total (Suppl. Fig. 1). Taken together, these datasets provide a diverse panel of tumors that allows for investigating the strengths of the PIGI framework and its ability to detect candidate genetic interactions.

Fig. 1.

Fig. 1

Overview of the PIGI framework. Schematic representation of the PIGI framework describing the steps for obtaining and interpreting ME and CO gene pairs. Step 1, processing of the data. Step 2, applying a co-occurrence test on pathway profiles. Step 3, associate genes with their corresponding pathways to extract ME and CO gene pairs. Step 4, filtering of gene pairs and finally, interpreting the resulting gene pairs using literature

Pathway ME and CO relationships are mostly cancer type specific

Applying the PIGI framework to the TARGET and DKFZ datasets revealed a high number of ME and several CO significant relationships between pathways (Fig. 2a, b, Suppl. Table 4a, b). In the TARGET dataset, PIGI detected 388 ME and 66 CO relationships between pathways (Fig. 2a). In DKFZ, 33 ME and 8 CO relationships are detected between pathways (Fig. 2b). As expected, most of the pathway pairs are directly or indirectly related to the hallmarks of cancer [20, 21] such as proliferation-related signaling, DNA repair and cell death regulation. Individual pathways and pathway CO/ME pairs show similar properties to what has been found before for individual genes and CO/ME gene pairs regarding their association with certain cancer types [2227]. First, individual pathways are associated either with specific cancer types such as “Notch-HLH transcription pathway” (B-cell Acute Lymphoblastic Leukemia (B-ALL), Fig. 2a) and “MHC class II antigen presentation” (Retinoblastoma (RB), Fig. 2b), or found across several cancer types, such as “NCAM signaling for neurite out-growth” (Fig. 2a) or “PIP3 activates AKT signalling” (Fig. 2b) [28]. Though overall, the majority of individual pathways are found only within one cancer type (Suppl. Table 4). Second, CO/ME pathway pairs are, also, mostly cancer-type specific. Significant CO/ME pathway pairs are unique for each tested cancer type except for a few pairs (e.g. “Cellular Senescence” – “Costimulation by CD28 family”, “Cellular Senescence” – “Interferon Signaling” in Medulloblastoma-SHH (MB-SHH) and RB, Fig. 2b). This aligns with previous results for individual gene pairs which have also been found to be mostly cancer-type specific [6, 12]. Third, we detect many more ME pathway pairs than CO pathway pairs (Suppl. Fig. 3a, b), similar to what has been reported before for gene pairs found in the two datasets [12]. We observe both CO/ME pathway pairs across most cancer types while in a few cancer types we find exclusively CO or ME pathway pairs (Suppl. Fig. 3a, b). A high number of ME pathway pairs are found for B-ALL and T-cell Acute Lymphoblastic Leukemia (T-ALL) tumors in TARGET, which is likely due to the large sample size (Fig. 2a, Suppl. Fig. 1a, 3a). Most pathway pairs in DKFZ are found in retinoblastoma (RB) and High-grade glioma (HGG) (Suppl. Figure 3b). Interestingly, Neuroblastoma (NBL), Osteosarcoma (OS), Medulloblastoma (MB), and Pilocytic astrocytoma (Fig. 2b, Suppl. Fig. 3a, 3b) had only few pathway pairs, despite having a considerable number of samples. This may, at least in part, be due to the exclusion of variant types other than SNVs and indels, and these tumor types may be driven by other events. For example, OS, NBL, and MB – Group 3/4 tumors frequently contain structural variants (SVs) and copy number variants (CNVs) events contributing to pathogenesis [2936]. Taken together, these results show that most ME and CO pathway pairs follow previously found patterns in gene pairs and are mostly cancer-type specific.

Fig. 2.

Fig. 2

Significant pathway relationships detected by PIGI are largely cancer-type specific. Graph representation of mutually exclusive (ME) and co-occurring (CO) pathway pairs found by the PIGI framework in the TARGET and DKFZ datasets. Pathways are shown as nodes, ME/CO pathway pairs as solid/dashed lines respectively and colored by cancer type. The legend depicts the number of ME/CO pathway pairs in parentheses for each cancer type. a ME & CO in TARGET tumors per cancer type, ME: 381 and CO: 66. Node labels (nodes in gray) shown only for pathways with 12 or more pathway pairs to prevent visual overload. b ME & CO in DKFZ tumors, ME: 33 and CO: 8. Node labels shown only for pathways with two or more pathway pairs

PIGI controls for mutation load effects between pathways

The increased tumor mutation load is associated with increased mutations accumulated in genes and their associated pathways [37]. High mutation load differences between tumors can therefore lead to false detection of mutual exclusivity between genes and should be accounted for [37]. Similarly, mutation load differences between pathways likely exist. PIGI accounts for these by applying a Wilcoxon-Mann–Whitney test on the pathway mutation load between the tumors for which a particular pathway pair was found to be ME (Methods). By comparing the number of pathway pairs per cancer type with the corresponding cancer type-mutation loads (Suppl. Fig. 3a, c (TARGET) and 3b, d (DKFZ)), the number of pathway pairs per cancer type appears irrespective of the median tumor mutation load. B-ALL has a lower median mutation load compared to T-ALL, and OS, probably due to enrichment of subtypes with underlying aneuploidy or chromosomal rearrangements [38, 39], even though it has the highest number of ME gene pairs (Suppl. Fig. 3a, c). OS tumors show one of the highest median mutation load but have one of the lowest numbers of pathway pairs. Similarly, in DKFZ, HGG-K27M tumors show the highest median mutation load but have low number of pathways (Suppl. Fig. 3b, d). The highest number of pathway pairs is observed in RB which has one of lowest median mutation load. Overall, PIGI test sufficiently accounts for mutation load effects on pathway pairs.

Fig. 3.

Fig. 3

Co-occurring (CO) and mutually exclusive (ME) genes found by PIGI test in TARGET and DKFZ. a Graph depicting CO and ME gene pairs found in tumors of both TARGET and DKFZ. Genes are shown as nodes, ME/CO gene pairs as solid/dashed lines respectively and colored by cancer type. The legend depicts the color per cancer type, shape for ME/CO, and in parentheses, the number of tested tumors and number of gene pairs per cancer type. b Bar plot showing the numbers of CO and ME gene pairs found per cancer type in TARGET and DKFZ, c Graph depicting CO and ME gene pairs (after prioritization of B-ALL findings) found in tumors of both TARGET and DKFZ. Legend as in a, except for the number of B-ALL gene pairs: 26

PIGI identifies several pathway-associated candidate genetic interactions

We next examined the results from the gene retracing step to identify individual CO and ME gene pairs (Fig. 3a, Suppl. Table 5). The gene pairs were retraced from CO/ME pathway pairs (FDR < 0.05), followed by removing gene pairs found in the same pathway, correcting for mutation load differences and applying a pathway-gene-mutation frequency threshold (see Methods section). We observe a considerably higher number of B-ALL gene pairs compared to the other cancer types, which are also exclusively ME (Fig. 3b). This likely reflects both the higher statistical power in B-ALL and the stringency of the p-value threshold. A more lenient threshold would benefit the detection of gene pairs in smaller cancer types (FDR < 0.2; Suppl. Table 6), albeit at the cost of increasing the number of likely false positives. Also, we observe a bias towards ME pairs, which has been observed before and can be attributed to primary tumors not being subjected to selective pressure, e.g., treatment [6]. It is also likely that the low number of co-occurrences is due to the aggregation of protein complex genes or functionally related genes that co-occur in pathways that are advantageous for the cancer cells. This appears somewhat counterintuitive, as a single gene mutation is anticipated to be enough to disrupt a function; thus, gene mutations in these protein complexes or pathways are expected to appear mutually exclusive. However, this disrupting effect is only possible if the gene mutation has a complete functional penetrance. That is not always the case, as we observe co-occurring gene pairs found previously by the CO/ME gene test (CO/ME-GT) on the same dataset by Daub et al. [12] (from now on referred as CO/ME-GT referring to the gene test results from this study) to be aggregated within pathways. For example, the co-occurring JAK1-JAK3 in T-ALL tumors are aggregated within the “MAPK1/MAPK3 signaling” pathway (Suppl. Fig. 4a) and the ATRX-H3F3A in HGG-other tumors are aggregated within the “Telomere Maintenance” pathway (Suppl. Fig. 4b).

Next, for annotation purposes, we restricted B-ALL gene pairs to the ones involving the top 15 highest mutation frequency genes (Methods). These results (Fig. 3c) consisted largely of new gene pairs and candidate genetic interactions that have not been observed before in these datasets in CO/ME-GT gene pairs [12]. The only previously detected pair is the TP53-DROSHA ME in Wilms tumors (WT), which was observed in 111 TARGET Wilms tumors. Whereas here, TP53-DROSHA ME was detected in the much smaller set of 40 DKFZ WTs. PIGI was able to retrieve this finding with ~ 60% fewer tumor samples. Interestingly, PIGI does not detect a significant TP53-DROSHA ME in TARGET WTs. This is likely due to the number of ME tumors in TARGET and DKFZ, which may not be sufficient to reach statistical significance under the current FDR thresholds. To investigate this further, we compared the number of tumors supporting the TP53-DROSHA ME as detected by PIGI and CO/ME-GT in both TARGET and DKFZ (Suppl. Fig. 5a, b, c, d) [12]. In TARGET, CO/ME-GT found TP53-DROSHA ME supported by 31/115 tumors (~ 27%) (Suppl. Fig. 5a), with complete mutual exclusivity and an FDR ranging from 0.0867 to 0.1033. In contrast, PIGI identified TP53-DROSHA related pathways (here presenting the ME pathways with the lowest FDR detected) in 44/110 tumors (~ 40%), with two tumors overlapping and FDR of 0.143. Despite the increase in tumors supporting the mutual exclusivity, the increase is not large enough to reach statistical significance. On the other hand, CO/ME-GT in DKFZ tested TP53-DROSHA ME in 8/44 tumors (~ 18%) and was not considered significant (FDR > 0.2). While PIGI detected TP53-DROSHA ME through pathways in 19/33 tumors (~ 56%) with pathway FDR of 0.0031. Here, the substantial increase in ME supporting tumors—more than half of the dataset—was sufficient to reach statistical significance. Taken together, pathway-level analysis with PIGI can amplify ME signals in small datasets such as DKFZ WT and underscore the importance of the chosen FDR threshold. A more lenient threshold, as used by CO/ME-GT, allows PIGI to detect significantly ME pathways that contain TP53-DROSHA in TARGET WT. However, these results are disregarded from the downstream analysis due to significantly different mutation load between the tumors of the two pathways. Albeit a sole case of recovering known findings in smaller datasets, it indicates the potential of using pathways for identifying genetic interactions in smaller datasets.

A prioritized set of gene pairs reveals candidate synthetic lethal genetic interactions

The literature search revealed several candidate synthetic lethal gene pairs (Fig. 4 and Suppl. Table 7). We grouped CO/ME gene pairs according to their likely biological explanation: 4 candidate synthetic sick/lethal, 20 unknown, 7 pathway epistasis, 4 false positives, and 2 cancer subtype. The candidate synthetic sick/lethal genetic interactions consist of TP53-DROSHA, observed before in CO/ME-GT [12], the new TP53-DGCR8 pair in WT, and CREBBP-WHSC1 and SYNE1-TP53 in B-ALL. The candidate synthetic lethal gene pairs represent ~ 11% (4/35 of the found CO/ME gene pairs by PIGI, only slightly lower than the ~ 13% (3/24) found in CO/ME-GT [12] for both datasets combined (Suppl. Table 8). The largest proportion of the gene pairs lacks supportive literature evidence. Thus, their biological explanation remains unknown and warrants further investigation. Despite the unknown gene pairs, the results are enriched with genes that play an important role in cancer. The dNdS ratio, ratio of non-synonymous to synonymous substitutions, is a measure for natural selection in genes and positive selection indicates likely cancer driver genes. The majority of genes in the gene pairs represent cancer drivers, as 27/33 had a significant dNdS ratio for their relevant cancer types. Also, the genes pairs are significantly (p-value 1.76e-29) enriched for cancer gene census genes (25/33). Additionally, frequently mutated genes such as TP53, KRAS, and NRAS are mutually exclusive with less often mutated genes such as RYR1 and UBA2. PIGI does not explicitly address gene size. Consequently, this could lead to artifacts caused by mutation load differences when genes of varying lengths are included in the analysis, particularly for long genes (gene length > 100 kb). However, that is not the case for most of the findings. For the majority of gene pairs, the Wilcoxon-Mann–Whitney p-value for mutation load differences is above 0.1. Only 32/301 gene pairs have a p-value between 0.1 and 0.05 that could warrant further attention regarding their mutation loads. On the other hand, these low-frequency genes are relatively understudied within the context of the cancer type in which they were found, leading to an inconclusive biological explanation. For example, KRAS and CTCF are mutually exclusive and have largely different biological roles according to literature. While KRAS is well-known for its role in MAPK and PI3K signaling and thereby regulating proliferation [40, 41], CTCF is involved in regulating 3D genome organization, mRNA splicing and, DNA recombination [42]. These processes are not directly related to each other and therefore cannot explain their potential interaction. Few ME gene pairs (n = 2) can likely be explained through cancer subtype not recorded in the datasets (2/35, ~ 6%), a number much lower than the ~ 38% (9/24) proposed before [12]. This may be due to the aggregation of genes into pathways. For example, in B-ALL low-hypodiploid subtype, TP53, RB1 and IKZF2 are frequently mutated, whereas in the near-haploid B-ALL subtype, IKZF3 and genes involved in RTK and RAS signaling are targeted. Despite both having different genetic alterations, both have been shown to display activation of the RAS and PI3K signaling pathways [39]. If tumors from these two subtypes are included in PIGI, due their shared pathway effects, they would be grouped in the pathway analysis, subsequently removing the potential ME confounded by their subtypes. When examining the results of PIGI and CO/ME-GT [12], KRAS-TP53, previously found to be ME in the CO/ME-GT [12] and each gene mutation being associated with different cancer subtype, is now aggregated through the pathway ‘Transcriptional regulation by RUNX3’ (Suppl. Figure 6) and no longer detected as ME by PIGI. Thus, pathway-level aggregation in PIGI can resolve some of the false mutual exclusivity caused by subtype-specific gene mutations. Overall, PIGI indicated four new candidate synthetic lethal genetic interactions and 20 new gene pairs with, to our knowledge, roles yet to be discovered.

Fig. 4.

Fig. 4

Biological interpretation of prioritized CO/ME gene pairs found in tumors of TARGET & DKFZ and examples of PIGI decreasing pathway epistasis. a Graph representation of CO and ME gene pairs demonstrating their likely biological interpretation: synthetic lethality (red), false positive (brown), unknown (blue), subtype (grey), or pathway epistasis (green). Colored nodes show genes that have significant dNdS ratio. Bold gene names describe genes of the cancer gene census, b Heatmap of “PIP3 activates AKT signaling” pathway in AML tumors (TARGET) showing mutual exclusivity of FLT3-KIT within the pathway. c Heatmap of the FLT3 Signaling pathway in B-ALL tumors (TARGET) showing mutual exclusivity between FLT3-KRAS. d Heatmap of the “Hedgehog ‘on’ state” in MB-SHH tumors (DKFZ) showing mutual exclusivity between PTCH1-SMO

Pathways in PIGI framework decrease pathway epistasis

Pathway epistasis, where the same downstream functional effect is caused by mutating individual genes within the same pathway, often confounds mutual exclusivity analyses when searching for synthetic lethality [5, 10, 11]. Despite PIGI employing pathways, ME findings (Fig. 4a) still contain a few gene pairs characterized by pathway epistasis, e.g., TP53-PTCH1 in MB-SHH [43], NRAS-KMT2D [44], PTPN11-UBA2 in B-ALL [45]. This reflects the incompleteness of the pathway database, and pathway epistasis that also assumes pathways are linear with no crosstalk between pathways. Therefore, employing pathways will not completely avoid pathway epistasis. Still, we observe a decrease of pathway epistasis findings, ~ 20% (7/35) compared to the ~ 42% (10/24 ME gene pairs) observed in Daub et al. [12]. Three examples of pathway epistasis no longer confounding mutual exclusivity analyses are depicted in Fig. 4b, c, d. FLT3-KIT (Acute Myeloid Leukemia (AML), Fig. 4b), FLT3-KRAS (B-ALL, Fig. 4c), and PTCH1-SMO (MB-SSH, Fig. 4d) all show clear mutual exclusivity within the same pathway. PIGI removes these ME findings by aggregating individual genes within pathways, thereby decreasing the confounding effects of pathway epistasis when searching for synthetic sick/lethal gene pairs.

New candidate synthetic lethal genetic interactions involving oncogenes/tumor suppressors genes or DNA repair genes

Investigating vulnerabilities, such as those arising from synthetic lethal genetic interactions within DNA damage repair (DDR) pathways, has enabled the development of new targeted therapies. This approach has led to the breakthrough discovery of PARP inhibitors, which exploit defects in homologous recombination, but also other inhibitors for ATR, ATM, WEE1, and PRMT [4651]. Here, we investigated PIGI findings (TP53-DROSHA, TP53-DGCR8, SYNE1-TP53, CREBBP-WHSC1) in the literature and their relation to DDR. These genes are implicated directly or indirectly in DDR, and it is probable that mutations in these gene pairs can lead to a synthetic sick/lethal phenotype. Also, some of these genes are known tumor suppressor genes (TP53, CREBBP) or oncogenes (WHSC1).

Here, we postulate a literature-based biological hypothesis of the function of these candidate synthetic sick/lethal genetic interactions. TP53 and DROSHA play an important role in DDR. TP53 controls G1 to S cell cycle progression, and loss of TP53 is associated with a higher number of double-stranded breaks (DSBs) [52]. DROSHA plays, in addition to its well-described function in microRNA processing, a role in DDR pathways by processing DNA-damage-induced long non-coding RNAs [5355]. Loss of either DROSHA in combination with loss of TP53, could increase the number of DSBs to a state that can be intolerable and fatal for the cell (Fig. 5a). The same effect is expected for TP53-DGCR8, as DGCR8 is part of the microprocessor complex together with DROSHA [54, 56], and loss of DGCR8 could potentially disrupt the functionality of this protein complex. Secondly, SYNE1 and TP53 constitute another case of potential SL. SYNE1 is part of the LINC complex which is important for correct DNA repair by surrounding the damaged DNA locus with a nuclear envelope fold (Fig. 5b) [5759]. We anticipate that upon loss of TP53 and SYNE1, DNA repair is hampered, possibly leading to synthetic sickness or lethality. Finally, CREBBP and WHSC1 are both playing an important role in DDR. WHSC1 is an oncogene that initiates non-homologous end joining and is involved in epithelial-to-mesenchymal transition [60]. Whereas CREBBP is a direct regulator of DDR [61]. Due to their substantial role in DDR, targeting both genes could result in high levels of unrepaired DNA and subsequently a synthetic sick/lethal effect (Suppl. Figure 7). Taken together, these examples highlight the potential of detecting likely synthetic sick/lethal relationships with PIGI, although follow up experimentation is necessary to assess the validity of these hypotheses.

Fig. 5.

Fig. 5

Biological interpretation of candidate synthetic lethal gene pairs found by PIGI test. a Top: function of wild-type TP53, DROSHA in DNA strand break repair processes; homologous recombination (HR) and non-homologous end joining (NHEJ). Bottom: loss-of-function TP53, DROSHA and their effect in DNA damage repair processes could lead to increased accumulation of double-strand breaks and genomic instability. b Top: wild-type function of TP53 and SYNE1 via the LINC complex in DNA damage repair (DDR). Bottom: loss-of-function TP53, SYNE1, and their effect in DDR processes. Failure to pause cell cycle progression in combination with suboptimal nuclear mechanics could lead to increased accumulation of unrepaired DNA. Part of the figure is adapted from Lottersberger et al. 2015

Overall, PIGI framework successfully identified new mutually exclusive (ME) gene pairs in the two primary pediatric cancer datasets, TARGET and DKFZ. It uncovered new gene pair relationships, spotlighted potential SL candidates by decreasing pathway epistasis, subtype effects, and cooperative interactions. It detected a known ME gene pair in the much smaller dataset of the DKFZ Wilms tumors. Finally, PIGI revealed several ME gene pairs of unknown roles and four ME gene pairs that are promising candidate SL genetic interactions.

Discussion

Here we present the pathway-informed genetic interaction framework (PIGI) to leverage tumor mutations in inferring candidate GIs. To our knowledge, this is the first study that employs mutated pathways to infer genetic interactions in pediatric cancer. The PIGI framework employs a statistical test to explore ME and CO between pairs of pathways to retrieve CO and ME relationships between gene pairs. The framework detected 35 ME and 2 CO relationships between gene pairs across 29 cancer types. PIGI spotlighted potential SL candidates by decreasing pathway epistasis, subtype effects, and cooperative interactions. Supporting evidence indicated 4 gene pairs as candidate synthetic lethal, CREBBP-WHSC1, SYNE1-TP53 in B-ALL, and TP53-DROSHA, TP53-DGCR8 in WT, and 20 gene pairs of unknown impact.

PIGI provides a first analysis that exploits pathways for genetic interaction inference in pediatric cancer. Several studies have previously used prior knowledge, such as pathways or protein–protein/functional networks, to detect genetic interactions in adult cancer [6, 10, 62, 63]. Among these approaches, Mutex and BriDGE utilize pathways. Mutex employs signaling pathways to detect gene sets with a shared downstream effect and identify mutual exclusivity in TCGA cancer genomes [62]. BriDGE incorporates a large selection of pathways (833) from KEGG, Biocarta and Reactome to identify genetic interactions in genome-wide association studies [63]. PIGI, however, uses a set of 413 pathways that cover many biological processes to detect candidate genetic interactions. Also, this set of pathways is comprised only by Reactome to ensure up-to-date pathways are included which are recorded in the same consistent and systematic manner. While the aforementioned studies focused on adult cancer, PIGI was designed to investigate pediatric cancer genomes which often face limitations in statistical power. The shorter but comprehensive set of pathways in the PIGI framework aids in decreasing the number of tested variables while still decreasing pathway epistasis and cancer subtype effects. Essentially, PIGI provides a biologically informed testing approach towards ME to identify potentially synthetic sick/lethal genetic interactions.

Pathways confer advantages in pathway analyses, but they also entail limitations. Results are dependent on the completeness of the pathways and their definition. Crosstalk or tissue-specific functional relationships are not always adequately recorded [5, 17, 64]. Consequently, this still retains some pathway epistasis ME with the PIGI framework. Another challenge is the lack of dedicated pathway resources describing the processes across different developmental stages and ages to describe pediatric cancer, a disease stemming from developing tissues [65, 66]. One way to cope with the aforementioned challenges is to retain only active pathways within each tumor by inspecting transcriptome data. However, this analysis can be challenging due to the uncertainty about to what extent developmental pathways in Reactome are representative of human developing tissues. Finally, crosstalk between pathways leads to several pathways having overlapping genes, e.g., TP53 is present in several pathways in Reactome. Albeit biologically sound, it can lead to statistical inflation and incorrect findings. The PIGI framework accounts for that by applying strict thresholds, removing ME/CO pathways with overlapping genes and ME genes that are part of the same pathway.

Confounders pose an extra challenge in the PIGI framework and other similar methods. Cancer subtypes confound mutual exclusivity testing, leading to false candidate synthetic lethal pairs. This can be solved by performing a mutual exclusivity test per cancer subtype [5]. However, this solution is not well-suited for pediatric cancer datasets due to the already limited number of samples per cancer type. The PIGI framework decreases cancer subtype effects by grouping the subtype-associated mutated genes into their common pathway. Conversely, tumor mutation load can affect mutual exclusivity detection. Applying a mutual exclusivity test to tumors with high differences between their mutation loads leads to false-positive mutually exclusive gene findings [37]. Likewise, mutated pathways in tumors with high mutation load will falsely appear as significantly mutually exclusive with pathways derived from tumors with lower mutation load. Unlike BriDGE and Mutex, PIGI controls mutation load effects for both pathway and gene pairs by applying a Wilcoxon-Mann–Whitney test between the tumors of the two pathways or genes.

Completeness of pediatric cancer datasets is a significant parameter in applying genetic interaction inference methodologies. A low level of dataset completeness leads to low sample sizes. Thus, combining pediatric cancer datasets can be beneficial in increasing dataset completeness and, consequently, uncovering new candidate genetic interactions in rare cancer types. However, combining datasets can introduce confounders, such as variations in experimental protocols and analysis pipelines, making it crucial to account for these differences during analysis. Exploring other layers of the mutational landscape of pediatric cancer may also be informative to increase dataset completeness. SNVs and indels constitute an incomplete representation of the mutational landscape within tumor cells, especially for cancers mainly driven by copy number variants (CNVs) or structural variants (SVs). Gradually, more pediatric cancer studies have been integrating SVs and CNVs in ME testing [67, 68] as it confers an advantage of describing the complete mutational landscape of each cancer type and increasing the chance of revealing candidate SL genetic interactions in more cancer types. We deem that gene variant integration will be valuable, not only in applying PIGI or other pathway CO/ME testing methods but also on gene-level CO/ME testing, in maximizing the discovery potential of candidate SL genetic interactions in pediatric cancer.

Lastly, the identification of candidate genetic interactions for experimental validation still remains a challenge. Determining the biological explanation or mechanism of the CO/ME gene pairs entails a great deal of uncertainty due to lack of literature, low recall of search engines and human error. Consequently, most of the gene pairs in this study have an unknown effect (20/37), and a small fraction are candidate SL or sick (4/37). This emphasizes the need for improved strategies to interpret or even predict the underlying mechanism. Literature review currently is the preferred approach for inference studies, but remains laborious, time-consuming and forces the interpretation of only a prioritized subset of gene pairs. Emerging AI-based approaches could address this challenge by enabling more systematic, faster and comprehensive literature search and integrating multiple lines of evidence [6971]. Therefore, allowing the identification of more candidates which otherwise may be missed through manual searches.

Conclusions

In summary, PIGI is a valuable testing framework for studying pediatric cancer genomes to identify new candidate synthetic lethal genetic interactions using publicly available data. While genome-wide screens of tumor-derived cell lines are the only direct way to detect genetic interactions, PIGI provides candidate tumor-derived genetic interactions for guided, follow-up experimentation in pediatric cancer.

Supplementary Information

Supplementary Material 2. (504.2KB, zip)

Acknowledgements

The authors thank Puck Veen, Femke Albertsboer, Joanna von Berg and fellow lab members for insightful discussions and helpful feedback.

Abbreviations

AML

Acute Myeloid leukemia

B-ALL

B-cell Acute Lymphoblastic Leukemia

CO

Co-occurrence

CNV

Copy Number Variant

DDR

DNA Damage Repair

DSB

Double Strand Break

FDR

False Discovery Rate

GI

Genetic Interaction

HGG-other

High-grade glioma—other

MB

Medulloblastoma

MB-SHH

Medulloblastoma—SHH

ME

Mutual Exclusivity

NBL

Neuroblastoma

OS

Osteosarcoma

PIGI

Pathway-Informed Genetic Interaction framework

RB

Retinoblastoma

SL

Synthetic Lethal

SNV

Single Nucleotide Variant

SV

Structural Variant

T-ALL

T-cell Acute Lymphoblastic Leukemia

WGS

Whole Genome Sequencing

WES

Whole Exome Sequencing

WT

Wilms tumor

Authors’ contributions

Conceptualization: A.S., J.D., P.K., Methodology: A.S., R.G., P.K., Analysis: A.S., Supervision: J.D., P.K. Manuscript writing: A.S., P.K., Manuscript revision: A.S., R.G., J.D., P.K. All authors have read and approved the final version.

Funding

We acknowledge with gratitude the financial assistance for this study provided by Foundation Children Cancer Free (KiKa core funding) and the Dutch Cancer Society (KWF), grant #10354.

Data availability

All data are available with accession code/identifiers as indicated in Ethics approval section. All mutation data of TARGET and DKFZ datasets can be inspected in their respective portals [https://www.cancer.gov/ccg/](https://www.cancer.gov/ccg) and [https://hgserver1.amc.nl/cgi-bin/r2/main.cgi?dscope=DKFZ\_PED&option=about\_dscope](https://hgserver1.amc.nl/cgi-bin/r2/main.cgi?dscope=DKFZ_PED&option=about_dscope). Code availability on: [https://doi.org/10.5281/zenodo.15881284](https://doi.org/10.5281/zenodo.15881284).

Declarations

Ethics approval and consent to participate

The Data Access Committees of TARGET and DKFZ datasets have approved the sharing and use of the tumor data for this study. Accession codes for DKFZ: RP012816, PRJEB11430 (European Nucleotide Archive); EGAS00001001139, EGAS00001001953, EGAS00001000607, EGAS00001000381, EGAS00001000906, EGAS00001001297, EGAS00001000443, EGAS00001000213, EGAS00001000263, EGAS00001000192, EGAS00001000255, EGAS00001000254, EGAS00001000253, EGAS00001000256, EGAS00001000246, EGAS00001000379, EGAS00001000380, EGAS00001000346, EGAS00001000349, EGAS00001000347, EGAS00001000192 (European Genome-Phenome Archive). Study identifiers for TARGET: phs000218, phs000463, phs000464, phs000465, phs000467, phs000471, and phs000468.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Ashworth A, Lord CJ, Reis-Filho JS. Genetic interactions in cancer progression and treatment. Cell. 2011;145:30–8. 10.1016/j.cell.2011.03.020. [DOI] [PubMed] [Google Scholar]
  • 2.Dharia NV, Kugener G, Guenther LM, Malone CF, Durbin AD, Hong AL, et al. A first-generation pediatric cancer dependency map. Nat Genet. 2021;53:529–38. 10.1038/s41588-021-00819-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Dixit A, Parnas O, Li B, Chen J, Fulco CP, Jerby-Arnon L, et al. Perturb-Seq: dissecting molecular circuits with scalable single-cell RNA profiling of pooled genetic screens. Cell. 2016;167:1853-1866.e17. 10.1016/j.cell.2016.11.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Tsherniak A, Vazquez F, Montgomery PG, Weir BA, Kryukov G, Cowley GS, et al. Defining a cancer dependency map. Cell. 2017;170:564-576.e16. 10.1016/j.cell.2017.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Deng Y, Luo S, Deng C, Luo T, Yin W, Zhang H, et al. Identifying mutual exclusivity across cancer genomes: computational approaches to discover genetic interaction and reveal tumor vulnerability. Brief Bioinform. 2019;20:254–66. 10.1093/bib/bbx109. [DOI] [PubMed] [Google Scholar]
  • 6.Canisius S, Martens JWM, Wessels LFA. A novel independence test for somatic alterations in cancer shows that biology drives mutual exclusivity but chance explains most co-occurrence. Genome Biol. 2016;17:261. 10.1186/s13059-016-1114-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kim Y-A, Madan S, Przytycka TM. WeSME: uncovering mutual exclusivity of cancer drivers and beyond. Bioinformatics. 2017;33:814–21. 10.1093/bioinformatics/btw242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Leiserson MDM, Blokh D, Sharan R, Raphael BJ. Simultaneous identification of multiple driver pathways in cancer. PLoS Comput Biol. 2013;9:e1003054. 10.1371/journal.pcbi.1003054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Srihari S, Singla J, Wong L, Ragan MA. Inferring synthetic lethal interactions from mutual exclusivity of genetic events in cancer. Biol Direct. 2015;10:57. 10.1186/s13062-015-0086-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Ciriello G, Cerami E, Sander C, Schultz N. Mutual exclusivity analysis identifies oncogenic network modules. Genome Res. 2012;22:398–406. 10.1101/gr.125567.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Cisowski J, Bergo MO. What makes oncogenes mutually exclusive? Small GTPases. 2017;8:187–92. 10.1080/21541248.2016.1212689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Daub JT, Amini S, Kersjes DJE, Ma X, Jäger N, Zhang J, et al. A systematic analysis of genetic interactions and their underlying biology in childhood cancer. Commun Biol. 2021;4:1139. 10.1038/s42003-021-02647-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kattner P, Strobel H, Khoshnevis N, Grunert M, Bartholomae S, Pruss M, et al. Compare and contrast: pediatric cancer versus adult malignancies. Cancer Metastasis Rev. 2019;38:673–82. 10.1007/s10555-019-09836-y. [DOI] [PubMed] [Google Scholar]
  • 14.Ideker T, Dutkowski J, Hood L. Boosting signal-to-noise in complex biology: prior knowledge is power. Cell. 2011;144:860–3. 10.1016/j.cell.2011.03.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Gröbner SN, Worst BC, Weischenfeldt J, Buchhalter I, Kleinheinz K, Rudneva VA, et al. The landscape of genomic alterations across childhood cancers. Nature. 2018. 10.1038/nature25480. [DOI] [PubMed] [Google Scholar]
  • 16.Ma X, Liu Y, Liu Y, Alexandrov LB, Edmonson MN, Gawad C, et al. Pan-cancer genome and transcriptome analyses of 1,699 paediatric leukaemias and solid tumours. Nature. 2018;555:371–6. 10.1038/nature25795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Croft D, O’Kelly G, Wu G, Haw R, Gillespie M, Matthews L, et al. Reactome: a database of reactions, pathways and biological processes. Nucleic Acids Res. 2011;39:D691-697. 10.1093/nar/gkq1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Ferrer-Bonsoms JA, Jareno L, Rubio A. Rediscover: an R package to identify mutually exclusive mutations. Bioinformatics. 2022;38:844–5. 10.1093/bioinformatics/btab709. [DOI] [PubMed] [Google Scholar]
  • 19.Martincorena I, Raine KM, Gerstung M, Dawson KJ, Haase K, Van Loo P, et al. Universal patterns of selection in cancer and somatic tissues. Cell. 2017;171:1029-1041.e21. 10.1016/j.cell.2017.09.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Hanahan D. Hallmarks of cancer: new dimensions. Cancer Discov. 2022;12:31–46. 10.1158/2159-8290.CD-21-1059. [DOI] [PubMed] [Google Scholar]
  • 21.Hanahan D, Weinberg RA. The hallmarks of cancer. Cell. 2000;100:57–70. 10.1016/S0092-8674(00)81683-9. [DOI] [PubMed] [Google Scholar]
  • 22.Fodde R. The APC gene in colorectal cancer. Eur J Cancer. 2002;38:867–71. 10.1016/S0959-8049(02)00040-0. [DOI] [PubMed] [Google Scholar]
  • 23.Kiyoi H, Naoe T. FLT3 mutations in acute myeloid leukemia. Methods Mol Med. 2006;125:189–97. 10.1385/1-59745-017-0:189. [DOI] [PubMed] [Google Scholar]
  • 24.Midha A, Dearden S, McCormack R. EGFR mutation incidence in non-small-cell lung cancer of adenocarcinoma histology: a systematic review and global map by ethnicity (mutMapII). Am J Cancer Res. 2015;5:2892–911. [PMC free article] [PubMed] [Google Scholar]
  • 25.Segditsas S, Tomlinson I. Colorectal cancer and genetic alterations in the Wnt pathway. Oncogene. 2006;25:7531–7. 10.1038/sj.onc.1210059. [DOI] [PubMed] [Google Scholar]
  • 26.Olivier M, Hollstein M, Hainaut P. TP53 mutations in human cancers: origins, consequences, and clinical use. Cold Spring Harb Perspect Biol. 2010;2:a001008. 10.1101/cshperspect.a001008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Prior IA, Lewis PD, Mattos C. A comprehensive survey of Ras mutations in cancer. Cancer Res. 2012;72:2457–67. 10.1158/0008-5472.CAN-11-2612. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Ylipää A, Yli-Harja O, Zhang W, Nykter M. Characterization of aberrant pathways across human cancers. BMC Syst Biol. 2013;7:S1. 10.1186/1752-0509-7-S1-S1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Chen X, Bahrami A, Pappo A, Easton J, Dalton J, Hedlund E, et al. Recurrent somatic structural variations contribute to tumorigenesis in pediatric osteosarcoma. Cell Rep. 2014;7:104–12. 10.1016/j.celrep.2014.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Cortés-Ciriano I, Lee JJ-K, Xi R, Jain D, Jung YL, Yang L, et al. Comprehensive analysis of chromothripsis in 2,658 human cancers using whole-genome sequencing. Nat Genet. 2020;52:331–41. 10.1038/s41588-019-0576-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Huang M, Weiss WA. Neuroblastoma and MYCN. Cold Spring Harb Perspect Med. 2013;3:a014415. 10.1101/cshperspect.a014415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Valentijn LJ, Koster J, Zwijnenburg DA, Hasselt NE, Van Sluis P, Volckmann R, et al. TERT rearrangements are frequent in neuroblastoma and identify aggressive tumors. Nat Genet. 2015;47:1411–4. 10.1038/ng.3438. [DOI] [PubMed] [Google Scholar]
  • 33.Rodriguez-Fos E, Planas-Fèlix M, Burkert M, Puiggròs M, Toedling J, Thiessen N, et al. Mutational topography reflects clinical neuroblastoma heterogeneity. Cell Genom. 2023;3:100402. 10.1016/j.xgen.2023.100402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Northcott PA, Shih DJH, Peacock J, Garzia L, Sorana Morrissy A, Zichner T, et al. Subgroup-specific structural variation across 1,000 medulloblastoma genomes. Nature. 2012;488:49–56. 10.1038/nature11327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Northcott PA, Buchhalter I, Morrissy AS, Hovestadt V, Weischenfeldt J, Ehrenberger T, et al. The whole-genome landscape of medulloblastoma subtypes. Nature. 2017;547:311–7. 10.1038/nature22973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Hovestadt V, Ayrault O, Swartling FJ, Robinson GW, Pfister SM, Northcott PA. Medulloblastomics revisited: biological and clinical insights from thousands of patients. Nat Rev Cancer. 2020;20:42–56. 10.1038/s41568-019-0223-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.van de Haar J, Canisius S, Yu MK, Voest EE, Wessels LF, Ideker T. Identifying epistasis in cancer genomes: a delicate affair. Cell. 2019;177:1375. 10.1016/j.cell.2019.05.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Brady SW, Roberts KG, Gu Z, Shi L, Pounds S, Pei D, et al. The genomic landscape of pediatric acute lymphoblastic leukemia. Nat Genet. 2022;54:1376–89. 10.1038/s41588-022-01159-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Woo JS, Alberti MO, Tirado CA. Childhood B-acute lymphoblastic leukemia: a genetic update. Exp Hematol Oncol. 2014;3:16. 10.1186/2162-3619-3-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Ryan MB, Corcoran RB. Therapeutic strategies to target RAS-mutant cancers. Nat Rev Clin Oncol. 2018;15:709–20. 10.1038/s41571-018-0105-0. [DOI] [PubMed] [Google Scholar]
  • 41.Pantsar T. The current understanding of KRAS protein structure and dynamics. Comput Struct Biotechnol J. 2020;18:189–98. 10.1016/j.csbj.2019.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Ong C-T, Corces VG. CTCF: an architectural protein bridging genome topology and function. Nat Rev Genet. 2014;15:234–46. 10.1038/nrg3663. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Abe Y, Oda-Sato E, Tobiume K, Kawauchi K, Taya Y, Okamoto K, et al. Hedgehog signaling overrides p53-mediated tumor suppression by activating Mdm2. Proc Natl Acad Sci U S A. 2008;105:4838–43. 10.1073/pnas.0712216105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Pan Y, Han H, Hu H, Wang H, Song Y, Hao Y, et al. KMT2D deficiency drives lung squamous cell carcinoma and hypersensitivity to RTK-RAS inhibition. Cancer Cell. 2023;41:88-105.e8. 10.1016/j.ccell.2022.11.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Deng R, Zhao X, Qu Y, Chen C, Zhu C, Zhang H, et al. Shp2 SUMOylation promotes ERK activation and hepatocellular carcinoma development. Oncotarget. 2015;6:9355–69. 10.18632/oncotarget.3323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Cheng B, Pan W, Xing Y, Xiao Y, Chen J, Xu Z. Recent advances in DDR (DNA damage response) inhibitors for cancer therapy. Eur J Med Chem. 2022;230:114109. 10.1016/j.ejmech.2022.114109. [DOI] [PubMed] [Google Scholar]
  • 47.Dominici C, Sgarioto N, Yu Z, Sesma-Sanz L, Masson J-Y, Richard S, et al. Synergistic effects of type I PRMT and PARP inhibitors against non-small cell lung cancer cells. Clin Epigenetics. 2021;13:54. 10.1186/s13148-021-01037-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Kaufman B, Shapira-Frommer R, Schmutzler RK, Audeh MW, Friedlander M, Balmaña J, et al. Olaparib monotherapy in patients with advanced cancer and a germline BRCA1/2 mutation. J Clin Oncol. 2015;33:244–50. 10.1200/JCO.2014.56.2728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Kwok M, Davies N, Agathanggelou A, Smith E, Oldreive C, Petermann E, et al. ATR inhibition induces synthetic lethality and overcomes chemoresistance in TP53- or ATM-defective chronic lymphocytic leukemia cells. Blood. 2016;127:582–95. 10.1182/blood-2015-05-644872. [DOI] [PubMed] [Google Scholar]
  • 50.Peng S, Long M, Chen Q, Yin Z, Zeng C, Zhang W, et al. Perspectives on cancer therapy—synthetic lethal precision medicine strategies, molecular mechanisms, therapeutic targets and current technical challenges. Cell Death Discov. 2025;11:1–17. 10.1038/s41420-025-02418-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Wang M, Ran X, Leung W, Kawale A, Saxena S, Ouyang J, et al. ATR inhibition induces synthetic lethality in mismatch repair-deficient cells and augments immunotherapy. Genes Dev. 2023;37:929–43. 10.1101/gad.351084.123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Speidel D. The role of DNA damage responses in p53 biology. Arch Toxicol. 2015;89:501–17. 10.1007/s00204-015-1459-z. [DOI] [PubMed] [Google Scholar]
  • 53.Cabrini M, Roncador M, Galbiati A, Cipolla L, Maffia A, Iannelli F, et al. DROSHA is recruited to DNA damage sites by the MRN complex to promote non-homologous end joining. J Cell Sci. 2021;134:jcs249706. 10.1242/jcs.249706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Hang Q, Zeng L, Wang L, Nie L, Yao F, Teng H, et al. Non-canonical function of DGCR8 in DNA double-strand break repair signaling and tumor radioresistance. Nat Commun. 2021;12:4033. 10.1038/s41467-021-24298-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Lu W-T, Hawley BR, Skalka GL, Baldock RA, Smith EM, Bader AS, et al. Drosha drives the formation of DNA:RNA hybrids around DNA break sites to facilitate DNA repair. Nat Commun. 2018;9:532. 10.1038/s41467-018-02893-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Ha M, Kim VN. Regulation of microRNA biogenesis. Nat Rev Mol Cell Biol. 2014;15:509–24. 10.1038/nrm3838. [DOI] [PubMed] [Google Scholar]
  • 57.Lottersberger F, Karssemeijer RA, Dimitrova N, de Lange T. 53BP1 and the LINC complex promote microtubule-dependent DSB mobility and DNA repair. Cell. 2015;163:880–93. 10.1016/j.cell.2015.09.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Sur I, Neumann S, Noegel AA. Nesprin-1 role in DNA damage response. Nucleus. 2014;5:173–91. 10.4161/nucl.29023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Sur-Erdem I, Hussain MS, Asif M, Pınarbası N, Aksu AC, Noegel AA. Nesprin-1 impact on tumorigenic cell phenotypes. Mol Biol Rep. 2020;47:921–34. 10.1007/s11033-019-05184-w. [DOI] [PubMed] [Google Scholar]
  • 60.Chen R, Chen Y, Zhao W, Fang C, Zhou W, Yang X, et al. The role of methyltransferase NSD2 as a potential oncogene in human solid tumors. OTT. 2020;13:6837–46. 10.2147/OTT.S259873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Dutto I, Scalera C, Prosperi E. CREBBP and p300 lysine acetyl transferases in the DNA damage response. Cell Mol Life Sci. 2018;75:1325–38. 10.1007/s00018-017-2717-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Babur Ö, Gönen M, Aksoy BA, Schultz N, Ciriello G, Sander C, et al. Systematic identification of cancer driving signaling pathways based on mutual exclusivity of genomic alterations. Genome Biol. 2015;16:45. 10.1186/s13059-015-0612-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Fang G, Wang W, Paunic V, Heydari H, Costanzo M, Liu X, et al. Discovering genetic interactions bridging pathways in genome-wide association studies. Nat Commun. 2019;10:4274. 10.1038/s41467-019-12131-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, et al. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2008;36:D480-484. 10.1093/nar/gkm882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Behjati S, Gilbertson RJ, Pfister SM. Maturation block in childhood cancer. Cancer Discov. 2021;11:542–4. 10.1158/2159-8290.CD-20-0926. [DOI] [PubMed] [Google Scholar]
  • 66.Chen X, Yang W, Roberts CWM, Zhang J. <article-title update="modified" original="Developmental origins shape the pediatric cancer genome">Developmental origins shape the paediatric cancer genome. Nat Rev Cancer. 2024;24:382–98. 10.1038/s41568-024-00684-9. [DOI] [PubMed] [Google Scholar]
  • 67.Brady SW, Liu Y, Ma X, Gout AM, Hagiwara K, Zhou X, et al. Pan-neuroblastoma analysis reveals age- and signature-associated driver alterations. Nat Commun. 2020;11:5183. 10.1038/s41467-020-18987-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Umeda M, Ma J, Westover T, Ni Y, Song G, Maciaszek JL, et al. A new genomic framework to categorize pediatric acute myeloid leukemia. Nat Genet. 2024;56:281–93. 10.1038/s41588-023-01640-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Blaizot A, Veettil SK, Saidoung P, Moreno-Garcia CF, Wiratunga N, Aceves-Martins M, et al. Using artificial intelligence methods for systematic review in health sciences: a systematic review. Res Synth Methods. 2022;13:353–62. 10.1002/jrsm.1553. [DOI] [PubMed] [Google Scholar]
  • 70.Fabiano N, Gupta A, Bhambra N, Luu B, Wong S, Maaz M, et al. How to optimize the systematic review process using AI tools. JCPP Adv. 2024;4:e12234. 10.1002/jcv2.12234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Bernard N, Sagawa Y Jr, Bier N, Lihoreau T, Pazart L, Tannou T. Using artificial intelligence for systematic review: the example of elicit. BMC Med Res Methodol. 2025;25:75. 10.1186/s12874-025-02528-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 2. (504.2KB, zip)

Data Availability Statement

All data are available with accession code/identifiers as indicated in Ethics approval section. All mutation data of TARGET and DKFZ datasets can be inspected in their respective portals [https://www.cancer.gov/ccg/](https://www.cancer.gov/ccg) and [https://hgserver1.amc.nl/cgi-bin/r2/main.cgi?dscope=DKFZ\_PED&option=about\_dscope](https://hgserver1.amc.nl/cgi-bin/r2/main.cgi?dscope=DKFZ_PED&option=about_dscope). Code availability on: [https://doi.org/10.5281/zenodo.15881284](https://doi.org/10.5281/zenodo.15881284).


Articles from BMC Medical Genomics are provided here courtesy of BMC

RESOURCES