Abstract
Decomposing somatic mutation spectra into mutational signatures and their corresponding etiologies provides a powerful approach for investigating the mechanism of DNA damage and repair. Assessing microsatellite (in)stability (MSI/MSS) status and interpreting their clinical relevance in different malignancies offers significant diagnostic and prognostic value. However, little is known about microsatellite (in)stability and its interactions with other DNA repair mechanisms such as homologous recombination (HR) in different cancer types. Based on whole-genome/exome mutational signature analysis, we showed HR deficiency (HRd) and mismatch repair deficiency (MMRd) occur in a significantly mutually exclusive manner in stomach and colorectal adenocarcinomas. ID11 signature with currently unknown etiology was prevalent in MSS tumors, co-occurred with HRd and was mutually exclusive with MMRd. Apolipoprotein B mRNA editing enzyme, Catalytic polypeptide-like (APOBEC) signature co-occurred with HRd and was mutually exclusive with MMRd in stomach tumors. The HRd signature in MSS tumors and the MMRd signature in MSI tumors were the first or second dominant signatures wherever detected. HRd may drive a distinct subgroup of MSS tumors and lead to poor clinical outcome. These analyses offer insight into mutational signatures in MSI and MMS tumors and reveal opportunities for improved clinical diagnosis and personalized treatment of MSS tumors.
Subject terms: Cancer genomics, High-throughput screening
Introduction
Perturbation of cellular DNA damage response and repair systems leads to a high frequency of mutations1,2 and predisposes to development of cancer3. Emerging evidence indicates that inhibition of DNA repair pathways is a therapeutic option for targeted treatment of DNA repair-impaired cancers. Defects in one DNA repair pathway can be compensated by other pathways, suggesting that simultaneous defects in compensating pathways may result in synthetic lethality. Therefore, defects that occur in mutually exclusive patterns can be identified and employed for treatment of DNA repair-defective tumors4,5.
Homologous recombination (HR) is a multistep DNA repair process that is essential to the repair of DNA double-stranded breaks6. HR deficiency (HRd) is prevalent among various tumor types, especially in those of breast, ovaries, and pancreas, which as such are known as HRd-cancers7–9. HRd is a therapeutically actionable marker and a predictor of response to immunotherapy, chemotherapy and poly (ADP-ribose) polymerase inhibitor (PARPi) therapy10. Therefore, adequate assessment of HRd can improve outcome of such therapies. HRd is broadly defined from harboring deleterious alterations of HR pathway related genes to complex genomic scars11. Germline testing of BRCA1/2 bi-allelic inactivation is the current HRd assessment approach in clinic12. However, mechanisms of HRd extends beyond functional loss of BRCA1/2 indicating the need for more comprehensive evaluation approaches. To this end, HRD score was developed based on three independent factors: loss of heterozygosity (LOH)13, telomeric allelic imbalance (TAI)14, and large-scale state transitions (LST)15 to better indicate the extent of underlying genomic instability due to HR16. Discordant results of different HRd measuring approaches make patient stratification and therapeutic selection challenging11, which underscore the requirement for integrating efficient alternative measurement approaches.
The DNA mismatch repair (MMR) system plays an important role in maintaining genomic stability. The MMR system is primarily responsible for base-base mismatches and insertion/deletion mispaired nucleotides. Impairment of MMR limits correction of spontaneous mutations in microsatellites –i.e. DNA elements containing short repeating motifs– resulting in microsatellite instability (MSI) hypermutable tumor phenotype17. Accordingly, MSI-affected tumors may arise from a genetically or epigenetically perturbed MMR system. MSI-affected tumors, also known as MMR deficient (MMRd)-tumors, have been reported in diverse malignancies including colorectal, gastric and endometrial cancer etc., with varying MSI-positivity rates; colorectal and gastric cancers are among the most affected18–22.
Generally based on MSI status, colorectal and gastric cancers can be classified into two categories: (i) The microsatellite instability-high (MSI-H) group, which is caused by defects in the MMR system and accounts for ~15% of colorectal and ~10% of gastric tumors18,23. MSI-H tumors have a slightly better prognosis and do benefit from immune checkpoint blockade (ICB) therapy. (ii) The microsatellite stable (MSS) group, which exhibits chromosomal instability and accounts for the remaining ~85% and ~90% of colorectal and gastric tumors, respectively. MSS tumors are defined based on absence of MSI markers, are rarely sensitive to ICB therapy, and have limited treatment options24–26. Therefore, discovering the characteristics of MSS tumors and predicting their sensitivity to therapeutic agents is a need in the clinic today. Frequent silencing of HR and MMR pathways has been reported in gastrointestinal malignancies. In particular, colorectal cancer is known to have high mutational burden as well as high frequency of mutations in DNA damage and response pathway genes16. The presence and prevalence of HRd as well as its prognostic roles relative to levels of mismatch repair deficiency in colorectal and gastric tumors remains to be studied.
The total number of mutations presents in a tumor specimen is known as the tumor mutational burden (TMB) and has emerged as a novel therapeutic biomarker27,28. Factors contributing to high number of mutations, besides microsatellite instability, are not well-studied, and since TMB only represents the accumulation of somatic mutations, it does not provide evidence for underlying mechanisms. To fill this gap, mutational signatures that reflect both the patterns of mutations and their causative etiology can potentially elucidate relevant biological and mechanistic insights.
Diversity of somatic mutations can be decomposed into individual mutational signatures, describing patterns of mutagenesis that arise because of DNA damage and defective DNA repair processes. By considering the entire coding and non-coding catalogs, mutational signatures have become a powerful tool for identifying processes that generate somatic mutations during tumorigenesis in different cancer types29–31. In this study, we comprehensively investigated mutations occurring in colorectal and gastric tumors. This revealed distinct mutational signature profiles associated with clinical outcome in these two common MMRd-cancer types.
Methods
Mutation data source and cleaning
We accessed, cleaned, and reformatted Simple Somatic Mutation (SSM) files for whole-exome sequencing (WES) and whole-genome sequencing (WGS) of STAD and COAD samples from the data portal of the International Cancer Genome Consortium (ICGC, https://dcc.icgc.org/projects). In brief, Mutations including Single-Base Substitutions (SBSs), Double-Base Substitutions (DBSs) and small Insertions/Deletions (IDs) were extracted from SSM files, converted to mutation matrices and utilized for subsequent analyses. The raw data is publicly accessible via ICGC portal using each unique Sample ID. Information regarding the entire sample set and processed data are provided in Supplementary Table S1(A–X).
The MSI-H/MSS status for WES of stomach adenocarcinoma (STAD) tumors were retrieved and quality-checked for MSI-H (n = 25) and MSS (n = 73); and colorectal adenocarcinoma (COAD) tumors for MSI-H (n = 42) and MSS (n = 164) from two independent sources32,33. We used the samples that had consistent and reliable MSI-H and MSS status available. The SSM files for the WGS data of 75 STAD and 90 COAD tumors from the ICGC portal were obtained similarly.
HRd, LOH, TAI and LST scores were retrieved from16 (https://gdc.cancer.gov/about-data/publications/PanCan-DDR-2018).
Mutational signatures analysis
Using GRCh37 and Catalogue of Somatic Mutations in Cancer (COSMIC, V3.1) (http://cancer.sanger.ac.uk/cosmic/signatures), a non-Negative Matrix Factorization (NMF)-based de novo mutational signatures analysis34,35 (https://github.com/alexandrovlab) were run over WES and WGS of tumor samples.
In addition, the computational tool SigMA36 using hg19 was applied for confirmatory mutational signature analysis. SigMA uses a multivariate approach to accurately detect the mutational signature associated with HR deficiency (SBS3) from WGS, WES and targeted gene panels, even from low mutation counts.
The first and second dominant signatures were extracted and visualized based on their contributions over WES /WGS data of STAD and COAD tumors. R packages of “car”, “ComplexHeatmap”, “vioplot”,“circlize”, and “ggplot”37–39 were used for visualization. Consensus molecular subtype (CMS) for COAD tumors were annotated as described in40,41.
Survival analysis
Progression-free survival (PFS) and overall survival (OS) data were downloaded from cbioportal (https://www.cbioportal.org/). R packages “Survminer” and “survival”42 were used for survival analysis.
Statistical analyses
P values were calculated as appropriate for continuous or categorical variables using Mann Whitney U test, Hypergeometric test or Fisher’s exact test were used. All analyses were conducted in the R statistical environment (R version 4.0.0 http://www.r-project.org/). All reported P values were two tailed; ≤0.05 was considered significant.
Results
We performed mutational signature analyses using two independent and complementary methods. First, we used a NMF-based de novo mutational signature detection approach, decomposed by the COSMIC, V3.1, and analyzed SBSs, IDs, and DBSs detected by WES of MSI-H (n = 25) and MSS (n = 73) STAD, and MSI-H (n = 42) and MSS (n = 164) COAD tumors as well as WGS of STAD (n = 75) and COAD (n = 90) samples.
Mutational signatures of MSI-H exomes reveal MMRd signatures
We detected MMRd SBS signatures in all MSI-H STAD tumors, including SBS15, SBS21, SBS26 and SBS44. The only other signatures in these tumors were SBS1 and SBS5 (Fig. 1a). Similarly, we detected MMRd signatures, including SBS6, SBS15, SBS26 and SBS44, in all but one of the MSI-H COAD tumors, in addition to SBS1 and SBS5 signatures (Fig. 1b). SBS26 in MSI-H STAD and SBS6 MSI-H COAD tumors, had the highest TMB with a median of >10 somatic mutations per Megabase (Fig. 2). MMRd SBS signature had significantly higher contribution in MSI-H compared to MSS tumors in both STAD and COAD (Fig. 2e, left). In contrast, the HRd signature(SBS3) had a significantly higher contribution in MSS tumors compared to MSI-H tumors in both STAD and COAD (Fig. 2e, right). The distribution shapes of MMRd signatures in MSI-H STAD and MSI-H COAD tumors are compared as density plots in (Fig. 2f). The average contribution value of MMRd signature was 0.51 (range: 0.30–0.97) in MSI-H STAD compared to 0.59 (range: 0.33–0.93) in MSI-H COAD tumors.
We detected the ID1 signature in all MSI-H STAD and COAD tumors with a median TMB of ~1 somatic mutation per Megabase (range: 0.1–10), one-tenth of the TMB measured for the SBS signatures (range: 0.1–100) (Fig. 2). Replication slippage has been suggested as the etiology of ID131.
Both MSI-H STAD and MSI-H COAD tumors showed DBS78A. In MSI-H STAD tumors, DBS78A had cosine similarity of 0.57 with COSMIC’s DBS4 and DBS11. In MSI-H COAD tumors, DBS78A had cosine similarity of 0.75 with DBS4 and DBS10. According to31, etiology of DBS10 and DBS11 signatures is associated with MMRd and, respectively, Apolipoprotein B mRNA editing enzyme, Catalytic polypeptide-like (APOBEC); while the etiology of DBS4 is unknown (Figs. 1, 2). DBS signatures are difficult to interpret because they have relatively low mutation counts, and based on our results, their pattern might be shaped by a combination of different etiologies. Supplementary Table S1.
Mutational signatures of MSS exomes reveals distinct group of patients with HRd signature
In 23% of MSS STAD and 21% of MSS COAD tumors, we detected the SBS3 signature, which is caused by HRd8,31 (Fig. 1a,b). In both MSS STAD and MSS COAD tumors, SBS3 had the highest TMB with medians of >1 somatic mutation per Megabase (Fig. 2b,d). The average contribution value of HRd signature was 0.57 (range: 0.33–0.94) in MSS STAD compared to 0.46 (range: 0.33–0.77) in MSS COAD tumors (Fig. 2e,f).
Mutational signatures in the MSS STAD tumors without SBS3 consisted of SBS17 whose etiology is unknown (Fig. 1a). In the MSS COAD tumors without SBS3, a subset of samples was dominated by the SBS10a, b (POLE) mutation signatures. In 24% of MSS COAD tumors, we detected the SBS15 signature, which is associated with the MMRd etiology (Fig. 1b).
MSS STAD tumors had ID1 and ID11 as well as ID16 and ID17 while MSS COAD tumors only had ID1 and ID11. Although the etiology of the ID1 signature is known as replication slippage, etiologies of ID11, ID16, and ID17 remain unknown (Fig. 1a,b). Recently, defects in topoisomerase without defective mismatch repair is proposed as the etiology of ID1743. Our data also supports that ID17 is only detected in a subset of MSS but not in MSI tumors.
MSS STAD tumors showed DBS2, DBS4, and DBS9, whereas MSS COAD tumors showed DBS1, DBS2, DBS4 and DBS8. The etiologies of these signatures remain unknown (Fig. 1a,b). The TMB range was 0.01–10 and 0.01–100 per SBS signature in MSS STAD and MSS COAD tumors, respectively. In contrast, the TMB was 0.01–1 per ID signature. The TMB range was 0.01–1 per DBS signature for both MSS STAD and MSS COAD tumors (Fig. 2b,d) Supplementary Table S1.
Dominant signatures distinguish MSS and MSI-H tumors
All cases of MSI-H STAD tumors but one had MMRd as their first or second dominant signature. MMRd SBS signatures were the first dominant signature in 76%, and the second dominant signature in 20% of MSI-H STAD tumors. In 24% of MSI-H STAD tumors in which MMRd signatures were not the first, SBS1 and SBS5 were the first and/or second dominant signatures (Fig. 3a).
In 23% of MSS STAD tumors, HRd was the first dominant signature; the remaining tumors had SBS1, SBS5, and SBS17 as their first dominant signature. SBS1, SBS5, SBS2 and 13 (APOBEC), and SBS17 signatures were second dominant signatures of MSS STAD tumors. Of note, wherever present, HRd was the first dominant signature in all cases of MSS STAD tumors (Fig. 3b).
MSI-H STAD tumors had only ID1 as the dominant ID signature. MSS STAD tumors had ID1, ID11, ID16, and ID17 as the first dominant ID signatures, and ID1 and ID11 as second dominant ID signatures (Supplementary Fig. S1a). Considering the number of SBS and ID signatures harbored, MSS STAD tumors were more heterogeneous compared to MSI-H STAD tumors (Fig. 3a,b).
MSI-H STAD tumors had only DBS78A (Cosine similarity of 0.57 with DBS4 and DBS11) as their DBS dominant signature. MSS STAD tumors had DBS2, DBS4, and DBS9 as their first dominant and DBS4 as their second dominant DBS signatures (Supplementary Fig. S1b).
Consistent with the pattern observed in STAD tumors, MMRd signatures were the first dominant SBS signature in 88%, and the second dominant signature in the remaining MSI-H COAD tumors. In 12% of MSI-H COAD tumors in which MMRd signatures were not the first dominant, SBS1 (aging) was the first dominant SBS signature. The second dominant signatures of COAD tumors were SBS1, SBS5, and MMRd. In all cases of MSI-H COAD tumors, MMRd was the first or second dominant SBS signature, highlighting the ability of NMF-based de novo mutation signature analysis to detect the MMRd status of MSI-H tumors (Fig. 3a,b).
HRd (SBS3) was the first dominant signature in 16.5% of MSS COAD tumors; SBS1, SBS5, and SBS10a, b (POLE) were the first dominant in the remaining. HRd was the second dominant SBS signatures in 4.3% of MSS COAD tumors with SBS1, SBS5 and MMRd in the remaining. HRd, wherever presented, had the highest TMB and was the first or second dominant signature among MSS COAD tumors (Fig. 3a,b).
As noted earlier, MSI-H COAD tumors had only ID1 as the dominant ID signature. MSS COAD tumors had ID1 and ID11 as the first and second dominant signature (Supplementary Fig. S1c). Considering the number of SBS and ID signatures harbored, MSS COAD tumors were more heterogeneous than MSI-H COAD tumors, similar to MSS STAD tumors (Fig. 3).
MSI-H COAD tumors had only DBS78A (cosine similarity of 0.75 with DBS4 and DBS10) as their dominant signature. However, MSS COAD tumors had DBS1, DBS2, DBS4, DBS8 as the first dominant and DBS1, DBS2, DBS4 as the second dominant signatures (Supplementary Fig. S1d).
Association analysis between CMS of COAD tumors and mutation signatures showed CMS1 and CMS2 were significantly enriched in MSI-H.MMRd and MSS-HRd tumors respectively (Fisher’s exact test, p = < 0.01e-3) (Supplementary Fig. S1e).
Mutational signatures of STAD and COAD WES tumors reveal mutual exclusivity of MMRd and HRd signatures
In both STAD and COAD WES tumors, MMRd and HRd signatures occurred in a mutually exclusive manner. In STAD WES tumors, like HRd, the APOBEC signatures were mutually exclusive with MMRd. In STAD tumors, DBS9 was mutually exclusive with MMRd and co-occurred with SBS17. In addition, SBS17 was mutually exclusive with MMRd signatures in STAD WES tumors. Notably, ID11 with unknown etiology was mutually exclusive with MMRd in both tumors while co-occurred with the HRd(SBS3) in COAD. Similarly, DBS4 and 8 co-occurred with the HRd(SBS3) in COAD (Fig. 3c,d).
MSS tumors with HRd had the poorest patient outcome and showed higher HRd scores
Consistent with previous reports that MSI-H status is associated with slightly improved patient outcome44, we did not observe significant differences in progression-free survival (PFS) of MSI-H and MSS tumors (Fig. 4a). Median PFS times were 19 months for MSI-H and 18 months for MSS tumors. PFS rates at 20 months were 43% for MSI and 40% for MSS tumors (Fig. 4b). However, we did find survival differences between MSI/MSS status when differentiating according to MMRd and HRd signatures. Figure 4b indicates significantly higher likelihood of PFS for MSI-H tumors with MMRd signatures compared to MSS tumors with the HRd signature, at least within the first five years after tumor excision. PFS at 20 months were 43% for MSI-H tumors with MMRd compared to only 28% for MSS tumors with HRd. Median PFS times for MSI-H tumors with MMRd, MSS tumors with HRd and other tumors were 19,13,18 months respectively. (Fig. 4b). OS data were consistent with PFS (Fig. 4c,d) (Supplementary Fig. S2). Because there were few survivors at 60 months, the uncertainly ranges prevent comparison of data beyond 5 years.
To further confirm the HRd status of identified MSS tumors with HRd signature, we compared HRd, LOH, TAI, LST scores which are the measures of genomic instability. In both STAD and COAD, MSS tumors with HRd had significantly higher HRd, LOH, TAI, LST scores compared to MMRd tumors and the rest of tumors (Fig. 4e–h) (Supplementary Fig. S2c–h).
Mutational signature multivariate analysis of MSS and MSI-H tumors portrays HRd subgroup in MSS tumors
As a second approach, and to further confirm the results of NMF-based de novo mutation signatures, we used a complimentary, multivariate method for extracting SBS mutational signature according to COSMIC V2.0.
Consistent with de novo mutation signatures, this approach confirmed the presence of a subgroup of MSS tumors that exhibit the HRd signature and verified the absence of the HRd signature in MSI-H tumors for both STAD and COAD. We also further confirmed that HRd was highly prevalent and was detected as dominant in 27% of STAD MSS, and in 14% of COAD MSS tumors (Supplementary Figs. S3, S4). These results suggested that MMRd and HRd occur in a mutually exclusive manner with no overlap between groups. We also detected SBS18 as a Reactive Oxygen Species (ROS) etiology signature in MSS tumors of both STAD and COAD. It was not detected by de novo mutational signature analysis. Interestingly, we observed the POLH –i.e. SBS9 (AID)– signature in a few MSS STAD tumors. MSI-H tumors showed only SBS1, SBS5, and MMRd signatures while MSS tumors showed a combination of SBS1, SBS5, HRd, and ROS signatures. In both STAD and COAD, heterogeneity of signatures in MSS tumors was higher than in MSI-H tumors. Multivariate analysis also confirmed the pattern of co-occurrence and exclusiveness among HRd, APOBEC and MMRd (Supplementary Figs. S3b, S4b).
Analysis of WGS data reveals presence of distinct groups of MMRd and HRd
Next, we applied the mutational signature analyses described above to the WGS data from STAD and COAD tumors. Twenty percent of STAD tumors had MMRd SBS signature including SBS15, SBS26, and SBS44. A 12% (9 out of 75) subset of STAD tumors had SBS3 as the HRd signature and 24% had only SBS1 and SBS5 signatures. The remaining 44%, in addition to SBS1 and SBS5, mainly exhibited a combination of ROS, SBS28, and SBS2,13 (APOBEC). SBS44 followed by SBS28, SBS18 (ROS), and SBS17 showed the highest TMB (Fig. 5). Colibactin genotoxin has been suggested as a possible etiology of SBS2845.
These analyses also showed that 27% of COAD tumors had MMRd signatures, including SBS15 and SBS20. A 7% (6 out of 90) subset of COAD tumors had the HRd signature, 20% showed the SBS10a, b (POLE) signature, while the rest mainly showed a combination of SBS1, SBS5, SBS40. SBS54 followed by SBS 40 and SBS58 had the highest TMB (Fig. 6).
Both STAD and COAD tumors had a group of cases (11%) with the ID7 (MMRd) signature, which also had the highest ID TMB (Fig. 6, Fig. 6). ID1, ID2, ID3 were present in both COAD and STAD. We also detected ID5 with unknown etiology in STAD tumors. Interestingly we detected ID8, a marker of non-homologous end joining (NHEJ), in these tumors further highlighting the role of HRd in STAD (Figs. 5, 6).
STAD tumors showed combination of DBS2, DBS4, DBS6, DBS9, DBS7 (MMR), and DBS5 associated with prior platinum therapy (PT). DBS4 had the highest TMB. COAD tumors showed DBS78B with cosine similarity <0.5 with COSMIC’s DBS7 and DBS11. The proposed etiology of DBS7 and DBS10 are MMRd (Figs. 5, 6).
Dominant signature analysis of WGS results also confirmed the presence of the HRd subgroup with patterns that were consistent with WES data. Similar to WES, SBS17 was mutually exclusive with MMRd signatures in STAD WGS tumors. In addition, DBS10 and SBS54 co-occurred in COAD WGS tumors, suggesting possible MMRd etiology (Figs. 5c,d, 6c,d, Supplementary Fig. S5).
Signature multivariate analysis of WGS data confirms MMRd and HRd groups
We also applied a complimentary multivariate method to extract SBS signatures, as described above. We detected and distinguished tumors with MMRd and HRd signatures in an unsupervised manner and without the knowledge of the samples’ MSI or MSS molecular subtypes. In both STAD and COAD, MMRd and HRd signatures were mutually exclusive. MMRd signatures were dominant among 8% of STAD and 26% of COAD tumors, while HRd signatures was dominant among 24% of STAD and 7% of COAD tumors (Supplementary Figs. S6, S7).
Consistent with de novo mutation signatures, in both STAD and COAD, mutational signature heterogeneity of signatures in the subset of tumors with MMRd was lower than the rest of tumors. Moreover, we detected the SBS18(ROS) signature using the multivariate method in both STAD and COAD; and observed POLH, i.e. SBS9(AID) signature in a subset of STAD tumors Notably, multivariate analysis of WGS tumors also confirmed the pattern of co-occurrence and exclusiveness among HRd, APOBEC and MMRd (Supplementary Figs. S6, S7).
Discussion
We uncovered possible diagnostic and prognostic roles of mutational signatures as a biomarker for stratification of HRd in MSS tumors. Our findings provide relevant biological insights regarding the relationship between the tumors’ MSI/MSS status and presence of HRd. We found that MSS tumors had a greater degree of heterogeneity in their mutational signatures compared to MSI-H tumors. We showed that MSI-H tumors harbored SBS signatures that faithfully reflected a strong fingerprint of MMRd, which dominated the mutational signature spectra, and suggested selective advantages of these signatures and their possible driver role in shaping tumors mutational landscape. Notably, our findings revealed mutual exclusivity of MMRd and HRd mutational signatures in colorectal and stomach cancers, which is consistent with previous report on gynecological malignancies20.
Although there are appropriate targeted therapies available for MSI-H tumors, treatment options for patients with MSS tumors, which constitute the majority of COAD and STAD cases, have been limited. This is largely due to the fact that MSS tumors are highly heterogeneous, and their genetic and molecular characteristics remain to be fully characterized. To prevent patients from receiving incorrect treatment or missing out on possible treatment opportunities, development of better biomarkers and therapies for MSS patients is necessary. HRd targeted therapy with PARPi is mostly used for treating breast and ovarian cancers12, and was recently approved by the US Food and Drug Administration for treatment of pancreatic cancers46. Our results strongly suggest that the presence of HRd signature can be used as actionable marker to identify a distinct subset of MSS STAD and COAD tumors which may benefit from PARPi therapy. Additionally, as several studies reported HRd patients with diverse cancer types are sensitive to platinum-based chemotherapies47,48, a distinct subset of COAD and STAD MSS tumors with HRd mutational signature may also benefit for this therapeutic regimen.
Previous studies reported that WES can detect clonal mutational signatures that are active in the majority of cancer cells, while WGS is required to extract subclonal mutational signatures49,50. Accordingly, we propose that ID11 and SBS6, which have relatively high TMB and are observed in WES data, are clonal. In contrast, ID7 and SBS54, which also have high TMB, co-occur with other MMRd signatures, and are commonly detected in WGS data, may be active subclonally. WES may not have the power and/or the scope to detect such signatures. Accordingly, it is reasonable to suggest that SBS18, which is an effective signature present in STAD tumors, occurs mainly across the whole genome. Collectively, these results suggest that both WGS and WES can generate reliable mutational signature profiles while WGS data are superior for detecting particular mutational signatures that may affect the non-coding more than the coding genome. Therefore, the appropriate input data for mutational signature analysis can be selected depending on the purpose of experiments.
Extraction of SBS3, 5, 8, and 40, which are known as flat signatures, by NMF-based de novo method is often mathematically challenging31. Our analyses using SigMA36, which is based on a multivariate analysis, showed consistent results for both methodologies and demonstrated that HRd and MMRd signatures were robustly detected by either approach. Consistency of HRd scores with the mutational signature results further confirms presence of a subset of MSS tumors with HRd signature. Unlike our study which considers the entire coding and non-coding mutations to stratify tumors using mutation signature profiles, a recent study employed mutation in limited numbers of preselected HR genes to define HRd status of tumors51. They reported HRd tumors to be more frequent among MSI than MSS tumors. Their data contradicts ours and several other comprehensive studies16,20. Moretto et al. using colorectal tumors without normal match samples could not differentiate biallelic/monoallelic and germline/somatic alterations which leaded to overestimation of HRd status and mis-annotation of a high percentage of MSI tumors as HRd. Several studies have demonstrated that not all HR gene mutations result in identical consequences and only biallelic inactivation of many HR genes are functional and reflect underlying DNA-repair deficiency16,52,53. Therefore, simple annotation of tumors based on presence or absence of mutations without differentiating biallelic/monoallelic and deleterious/passenger alterations leads to erroneous prediction of HRd status.
Notably, we detected SBS9 in STAD but not in COAD tumors. SBS9 is characterized by patterns of mutations that contribute to POLH i.e. AID. POLH encodes for a specialized error-prone polymerase which promotes somatic hypermutation in the variable regions of immunoglobulin genes. Dysregulation and mistargeting of POLH can compromise genome integrity54. POLH signature has been reported in different cancer types55,56, but has been mainly studied in hematological malignancies57. Helicobacter pylori infection has also been suggested to trigger aberrant expression of AID, which induces mutations that may lead to gastric carcinogenesis58. Clinical and biological roles of the POLH signature in STAD tumors, and whether there is an association between Helicobacter pylori infection and the POLH signature remains to be studies.
Several studies have demonstrated that SBS mutational signatures may serve as prognostic or predictive biomarkers across different cancer types9,59,60. Collectively, we propose that the understanding of the mechanistic basis of mutational signatures, as well as their etiology, improves cancer diagnosis and holds prognostic value. Effective determination of MSI/MSS as well as HRd status in different cancer types including gastric and colorectal tumors may improve selection of appropriate therapy with implications for specialized management of patients. We believe further studies are necessary to fully investigate the biological and clinical impact of MMRd and HRd in all malignancies.
Taken together, we presented complementary mutational signature analyses of stomach and colorectal adenocarcinomas using WES and WGS data. We showed that deficiency in HR and MMR result in mutually exclusive mutational signatures. Mutational signatures in MSI tumors were dominated by those caused by MMR deficiency; while mutational signatures in MSS tumors were diverse and identified a distinct group of HR-deficient tumors with the poorest patient outcome. Etiologies detected by our mutational signature analysis, including the prevalence of HR-deficiency, provide a means for selection of therapeutic targets and increasing positive outcomes for cancer patients.
Supplementary information
Acknowledgements
This work was supported by a “Kakenhi” grant from the Japan Society for the Promotion of Science (17K15044) and Tokyo Biochemical Research Foundation (TBRF-RF 170-104). A.F. was recipient of Otsuka Toshimi Scholarship Foundation.
Author contributions
A.F. and S.F. conceptualized and designed the study, developed the methodology, performed all analyses, and interpreted the results. R.K. contributed to medical and biological interpretation of the results. S.F. supervised the study. All authors wrote, reviewed, and approved the manuscript.
Data availability
All data generated or analyzed during this study are included in this published article and its supplementary information files. Supplementary table S1 is deposited on Figshare (10.6084/m9.figshare.22818080)61. All R packages and scripts used for the analyses are available publicly as described in the methods section. ICGC dataset is available at https://dcc.icgc.org/projects,
Code availability
No custom code was used.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Amir Farmanbar, Sanaz Firouzi.
Supplementary information
The online version contains supplementary material available at 10.1038/s41597-023-02331-8.
References
- 1.Kunkel TA, Erie DA. DNA mismatch repair. Annual review of biochemistry. 2005;74:681–710. doi: 10.1146/annurev.biochem.74.082803.133243. [DOI] [PubMed] [Google Scholar]
- 2.Reha-Krantz LJ. DNA polymerase proofreading: Multiple roles maintain genome stability. Biochimica et biophysica acta. 2010;1804:1049–1063. doi: 10.1016/j.bbapap.2009.06.012. [DOI] [PubMed] [Google Scholar]
- 3.Loeb LA. Human cancers express mutator phenotypes: origin, consequences and targeting. Nature reviews. Cancer. 2011;11:450–457. doi: 10.1038/nrc3063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Helleday T. Amplifying tumour-specific replication lesions by DNA repair inhibitors - a new era in targeted cancer therapy. European journal of cancer. 2008;44:921–927. doi: 10.1016/j.ejca.2008.02.044. [DOI] [PubMed] [Google Scholar]
- 5.Ceccaldi R, Rondinelli B, D’Andrea AD. Repair Pathway Choices and Consequences at the Double-Strand Break. Trends in cell biology. 2016;26:52–64. doi: 10.1016/j.tcb.2015.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Tham KC, Kanaar R, Lebbink JHG. Mismatch repair and homeologous recombination. DNA repair. 2016;38:75–83. doi: 10.1016/j.dnarep.2015.11.010. [DOI] [PubMed] [Google Scholar]
- 7.Heeke, A. L. et al. Prevalence of Homologous Recombination-Related Gene Mutations Across Multiple Cancer Types. JCO precision oncology2018, 10.1200/PO.17.00286 (2018). [DOI] [PMC free article] [PubMed]
- 8.Polak P, et al. A mutational signature reveals alterations underlying deficient homologous recombination repair in breast cancer. Nature genetics. 2017;49:1476–1486. doi: 10.1038/ng.3934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Davies H, et al. HRDetect is a predictor of BRCA1 and BRCA2 deficiency based on mutational signatures. Nature medicine. 2017;23:517–525. doi: 10.1038/nm.4292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Pellegrino B, et al. Homologous Recombination Repair Deficiency and the Immune Response in Breast Cancer: A Literature Review. Translational oncology. 2020;13:410–422. doi: 10.1016/j.tranon.2019.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Stewart MD, et al. Homologous Recombination Deficiency: Concepts, Definitions, and Assays. The Oncologist. 2022;27:167–174. doi: 10.1093/oncolo/oyab053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hoppe MM, Sundar R, Tan DSP, Jeyasekharan AD. Biomarkers for Homologous Recombination Deficiency in Cancer. Journal of the National Cancer Institute. 2018;110:704–713. doi: 10.1093/jnci/djy085. [DOI] [PubMed] [Google Scholar]
- 13.Abkevich V, et al. Patterns of genomic loss of heterozygosity predict homologous recombination repair defects in epithelial ovarian cancer. British Journal of Cancer. 2012;107:1776–1782. doi: 10.1038/bjc.2012.451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Birkbak NJ, et al. Telomeric Allelic Imbalance Indicates Defective DNA Repair and Sensitivity to DNA-Damaging Agents. Cancer discovery. 2012;2:366–375. doi: 10.1158/2159-8290.cd-11-0206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Popova T, et al. Ploidy and Large-Scale Genomic Instability Consistently Identify Basal-like Breast Carcinomas with BRCA1/2 Inactivation. Cancer Research. 2012;72:5454–5462. doi: 10.1158/0008-5472.can-12-1470. [DOI] [PubMed] [Google Scholar]
- 16.Knijnenburg TA, et al. Genomic and Molecular Landscape of DNA Damage Repair Deficiency across The Cancer Genome Atlas. Cell Reports. 2018;23:239–254.e236. doi: 10.1016/j.celrep.2018.03.076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Boland CR, et al. A National Cancer Institute Workshop on Microsatellite Instability for cancer detection and familial predisposition: development of international criteria for the determination of microsatellite instability in colorectal cancer. Cancer research. 1998;58:5248–5257. [PubMed] [Google Scholar]
- 18.Vilar E, Gruber SB. Microsatellite instability in colorectal cancer-the stable evidence. Nature reviews. Clinical oncology. 2010;7:153–162. doi: 10.1038/nrclinonc.2009.237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Maruvka YE, et al. Analysis of somatic microsatellite indels identifies driver events in human tumors. Nature biotechnology. 2017;35:951–959. doi: 10.1038/nbt.3966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Farmanbar A, Firouzi S, Kneller R, Khiabanian H. Mutational signatures reveal ternary relationships between homologous recombination repair, APOBEC, and mismatch repair in gynecological cancers. Journal of translational medicine. 2022;20:65. doi: 10.1186/s12967-022-03259-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Bonneville, R. et al. Landscape of Microsatellite Instability Across 39 Cancer Types. JCO precision oncology, 1–15, 10.1200/PO.17.00073 (2017). [DOI] [PMC free article] [PubMed]
- 22.Hause RJ, Pritchard CC, Shendure J, Salipante SJ. Classification and characterization of microsatellite instability across 18 cancer types. Nat Med. 2016;22:1342–1350. doi: 10.1038/nm.4191. [DOI] [PubMed] [Google Scholar]
- 23.Wang K, et al. Whole-genome sequencing and comprehensive molecular profiling identify new driver mutations in gastric cancer. Nature genetics. 2014;46:573–582. doi: 10.1038/ng.2983. [DOI] [PubMed] [Google Scholar]
- 24.Cancer Genome Atlas, N. Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012;487:330–337. doi: 10.1038/nature11252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Sargent DJ, et al. Defective mismatch repair as a predictive marker for lack of efficacy of fluorouracil-based adjuvant therapy in colon cancer. Journal of clinical oncology: official journal of the American Society of Clinical Oncology. 2010;28:3219–3226. doi: 10.1200/JCO.2009.27.1825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Samowitz WS, et al. Poor survival associated with the BRAF V600E mutation in microsatellite-stable colon cancers. Cancer research. 2005;65:6063–6069. doi: 10.1158/0008-5472.CAN-05-0404. [DOI] [PubMed] [Google Scholar]
- 27.Yarchoan M, Hopkins A, Jaffee EM. Tumor Mutational Burden and Response Rate to PD-1 Inhibition. The New England journal of medicine. 2017;377:2500–2501. doi: 10.1056/NEJMc1713444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Chalmers ZR, et al. Analysis of 100,000 human cancer genomes reveals the landscape of tumor mutational burden. Genome medicine. 2017;9:34. doi: 10.1186/s13073-017-0424-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Alexandrov LB, et al. Signatures of mutational processes in human cancer. Nature. 2013;500:415–421. doi: 10.1038/nature12477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Helleday T, Eshtad S, Nik-Zainal S. Mechanisms underlying mutational signatures in human cancers. Nature reviews. Genetics. 2014;15:585–598. doi: 10.1038/nrg3729. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Alexandrov LB, et al. The repertoire of mutational signatures in human cancer. Nature. 2020;578:94–101. doi: 10.1038/s41586-020-1943-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kim TM, Laird PW, Park PJ. The landscape of microsatellite instability in colorectal and endometrial cancer genomes. Cell. 2013;155:858–868. doi: 10.1016/j.cell.2013.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Cortes-Ciriano I, Lee S, Park WY, Kim TM, Park PJ. A molecular portrait of microsatellite instability across multiple cancers. Nature communications. 2017;8:15180. doi: 10.1038/ncomms15180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bergstrom EN, et al. SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events. BMC genomics. 2019;20:685. doi: 10.1186/s12864-019-6041-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Islam SMA, et al. Uncovering novel mutational signatures by de novo extraction with SigProfilerExtractor. Cell Genomics. 2022;2:100179. doi: 10.1016/j.xgen.2022.100179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gulhan DC, Lee JJ, Melloni GEM, Cortes-Ciriano I, Park PJ. Detecting the mutational signature of homologous recombination deficiency in clinical samples. Nature genetics. 2019;51:912–919. doi: 10.1038/s41588-019-0390-2. [DOI] [PubMed] [Google Scholar]
- 37.Gu Z, Eils R, Schlesner M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics. 2016;32:2847–2849. doi: 10.1093/bioinformatics/btw313. [DOI] [PubMed] [Google Scholar]
- 38.Gu Z, Gu L, Eils R, Schlesner M, Brors B. circlize Implements and enhances circular visualization in R. Bioinformatics. 2014;30:2811–2812. doi: 10.1093/bioinformatics/btu393. [DOI] [PubMed] [Google Scholar]
- 39.Adler, D. & Thomas Kelly, T. vioplot: violin plot. (2020).
- 40.Guinney J, et al. The consensus molecular subtypes of colorectal cancer. Nature medicine. 2015;21:1350–1356. doi: 10.1038/nm.3967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Hu, F. et al. Comprehensive Analysis of Subtype-Specific Molecular Characteristics of Colon Cancer: Specific Genes, Driver Genes, Signaling Pathways, and Immunotherapy Responses. Frontiers in Cell and Developmental Biology9, 10.3389/fcell.2021.758776 (2021). [DOI] [PMC free article] [PubMed]
- 42.Therneau, T. M. & Grambsch, P. M. Modeling survival data: extending the Cox model. (Springer, 2000).
- 43.Stantial N, et al. Trapped topoisomerase II initiates formation of de novo duplications via the nonhomologous end-joining pathway in yeast. Proceedings of the National Academy of Sciences of the United States of America. 2020;117:26876–26884. doi: 10.1073/pnas.2008721117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Samowitz WS, et al. Microsatellite instability in sporadic colon cancer is associated with an improved prognosis at the population level. Cancer epidemiology, biomarkers & prevention: a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology. 2001;10:917–923. [PubMed] [Google Scholar]
- 45.Dziubanska-Kusibab PJ, et al. Colibactin DNA-damage signature indicates mutational impact in colorectal cancer. Nature medicine. 2020;26:1063–1069. doi: 10.1038/s41591-020-0908-2. [DOI] [PubMed] [Google Scholar]
- 46.Golan T, et al. Maintenance Olaparib for Germline BRCA-Mutated Metastatic Pancreatic Cancer. The New England journal of medicine. 2019;381:317–327. doi: 10.1056/NEJMoa1903387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Zhao EY, et al. Homologous Recombination Deficiency and Platinum-Based Therapy Outcomes in Advanced Breast Cancer. Clinical cancer research: an official journal of the American Association for Cancer Research. 2017;23:7521–7530. doi: 10.1158/1078-0432.CCR-17-1941. [DOI] [PubMed] [Google Scholar]
- 48.Waddell N, et al. Whole genomes redefine the mutational landscape of pancreatic cancer. Nature. 2015;518:495–501. doi: 10.1038/nature14169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Davies H, et al. Whole-Genome Sequencing Reveals Breast Cancers with Mismatch Repair Deficiency. Cancer research. 2017;77:4755–4762. doi: 10.1158/0008-5472.CAN-17-1083. [DOI] [PubMed] [Google Scholar]
- 50.Nik-Zainal S, et al. The life history of 21 breast cancers. Cell. 2012;149:994–1007. doi: 10.1016/j.cell.2012.04.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Moretto R, et al. Homologous Recombination Deficiency Alterations in Colorectal Cancer: Clinical, Molecular, and Prognostic Implications. JNCI: Journal of the National Cancer Institute. 2021;114:271–279. doi: 10.1093/jnci/djab169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Reid S, et al. Biallelic mutations in PALB2 cause Fanconi anemia subtype FA-N and predispose to childhood cancer. Nature Genetics. 2007;39:162–164. doi: 10.1038/ng1947. [DOI] [PubMed] [Google Scholar]
- 53.Nguyen, L. & W. M. Martens, J. Pan-cancer landscape of homologous recombination deficiency. 11, 5584, 10.1038/s41467-020-19406-4 (2020). [DOI] [PMC free article] [PubMed]
- 54.Rogozin IB, et al. Mutational signatures and mutable motifs in cancer genomes. Briefings in bioinformatics. 2018;19:1085–1101. doi: 10.1093/bib/bbx049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Kasar S, et al. Whole-genome sequencing reveals activation-induced cytidine deaminase signatures during indolent chronic lymphocytic leukaemia evolution. Nature communications. 2015;6:8866. doi: 10.1038/ncomms9866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Rogozin IB, et al. DNA polymerase eta mutational signatures are found in a variety of different types of cancer. Cell cycle. 2018;17:348–355. doi: 10.1080/15384101.2017.1404208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Chapuy B, et al. Molecular subtypes of diffuse large B cell lymphoma are associated with distinct pathogenic mechanisms and outcomes. Nature medicine. 2018;24:679–690. doi: 10.1038/s41591-018-0016-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Matsumoto Y, et al. Helicobacter pylori infection triggers aberrant expression of activation-induced cytidine deaminase in gastric epithelium. Nature medicine. 2007;13:470–476. doi: 10.1038/nm1566. [DOI] [PubMed] [Google Scholar]
- 59.Trucco LD, et al. Ultraviolet radiation-induced DNA damage is prognostic for outcome in melanoma. Nature medicine. 2019;25:221–224. doi: 10.1038/s41591-018-0265-6. [DOI] [PubMed] [Google Scholar]
- 60.Alexandrov LB, Nik-Zainal S, Siu HC, Leung SY, Stratton MR. A mutational signature in gastric cancer suggests therapeutic strategies. Nature communications. 2015;6:8683. doi: 10.1038/ncomms9683. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Farmanbar A, Kneller R, Firouzi S. 2023. Mutational signatures reveal mutual exclusivity of homologous recombination deficiency and mismatch repair deficiency in colorectal and stomach tumors, guiding improved clinical diagnosis and possible personalized treatment. figshare. [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Citations
- Farmanbar A, Kneller R, Firouzi S. 2023. Mutational signatures reveal mutual exclusivity of homologous recombination deficiency and mismatch repair deficiency in colorectal and stomach tumors, guiding improved clinical diagnosis and possible personalized treatment. figshare. [DOI] [PMC free article] [PubMed]
Supplementary Materials
Data Availability Statement
All data generated or analyzed during this study are included in this published article and its supplementary information files. Supplementary table S1 is deposited on Figshare (10.6084/m9.figshare.22818080)61. All R packages and scripts used for the analyses are available publicly as described in the methods section. ICGC dataset is available at https://dcc.icgc.org/projects,
No custom code was used.