Abstract
All cancers originate from a single cell that starts to behave abnormally due to the acquired somatic mutations in its genome. Until recently, the knowledge of the mutational processes that cause these somatic mutations has been very limited. Recent advances in sequencing technologies and the development of novel mathematical approaches have allowed deciphering the patterns of somatic mutations caused by different mutational processes. Here, we summarize our current understanding of mutational patterns and mutational signatures in light of both the somatic cell paradigm of cancer research and the recent developments in the field of cancer genomics.
Current Opinion in Genetics & Development 2014, 24:52–60
This review comes from a themed issue on Cancer genomics
Edited by David J Adams and Ultan McDermott
For a complete overview see the Issue and the Editorial
Available online 29th December 2013
0959-437X/$ – see front matter, © 2013 The Authors. Published by Elsevier Ltd. All rights reserved.
Introduction
Long before the discovery of the double helix [1], it was well established that ultraviolet light (UV) can cause tumours of the skin [2]. While the mechanism was unclear at this time, it was hypothesized that successive doses of UV radiation result in accelerating the relative rate of cell proliferation [3]. The paradigm shifting discovery that the genetic material is contained within a deoxyribonucleic acid led to many studies in the late 1950s and throughout the 1960s examining how organisms protect their DNA from endogenous and exogenous mutations, and a focus was given to ultraviolet induced mutations (reviewed in Ref. [4•]). It was established that exposure to UV light can lead to the formation of dimers of any two adjacent pyrimidine bases on the same DNA strand with a preference for thymine–thymine dimers [4•]. It was further shown that UV irradiation damage predominantly results in cytosine to thymine or cytosine–cytosine to thymine–thymine changes, preferentially occurring at these pyrimidine dimers (i.e. C > T or CC > TT DNA mutations at dipyrimidine sites) [5,6]. This was the first detailed characterization of the pattern of DNA changes occurring due to the activity of an exogenous mutagen and, as such, the very first description of a signature of a mutational process.
While these early studies established the mutational signature of UV light, it was unclear whether UV induced mutations are present and involved in the neoplastic expansion of human cancers. The development of the DNA sequencing technique with chain-terminating inhibitors by Sanger et al. [7] allowed rapid examination of the genetic material contained in cancer cells. In the early 1990s, two studies sequenced exons of the gene TP53 [8•,9•] from several patients and provided experimental evidence that aflatoxin and UV light leave distinct patterns (consistent with the ones observed in experimental systems) of DNA mutations respectively in hepatocellular and squamous-cell carcinomas. These studies confirmed that the mutational signatures of carcinogens are left as ‘evidence’ in the genomes of cancer cells [10] thus spawning research which first examined the mutations across TP53 and later across multiple genes and even whole cancer genomes in order to provide a better understanding of the mutational processes involved in human carcinogenesis.
Mutational patterns of TP53
Multiple independent studies used Sanger sequencing of some (or all) exons of a cancer gene to provide clues to the aetiology of both endogenous and exogenous factors of human carcinogenesis. TP53 was usually selected for this analysis due to its high prevalence of somatic mutations in almost all tumour classes [11••]. Commonly, each of these studies involved multiple samples of a cancer type that were examined for somatic mutations in TP53 (studies reviewed in Refs. [11••,12,13]). The TP53 somatic mutations were aggregated, their spectrum was reported as specific for the given cancer type, and this spectrum was then compared to mutations generated experimentally in in vitro or in vivo systems [11••,13]. It should be noted that the mutational spectra of other genes, albeit rarely, were also used for such analysis [14].
These early studies revealed a significant heterogeneity of the TP53 spectra across different cancer types, which allowed associating some patterns of mutation to known carcinogens. Here, we provide a brief summary of some of the more important findings while details could be found in Refs. [11••,12,13]. The TP53 spectrum of skin carcinomas exhibited C > T and CC > TT mutations at dipyrimidines (all substitutions and dinucleotide substitutions are referred to by the pyrimidine(s) of the mutated Watson-Crick base pair). This was consistent with the in vitro described mutational signature of UV light. The TP53 mutational spectrum derived from lung cancers in tobacco smokers was overwhelmed by C > A substitutions, which coincided with the class of mutation produced experimentally as a result of bulky adduct formation by tobacco carcinogens on guanine [15]. In other tobacco associated cancers, such as oesophageal and head and neck tumours, C > A mutations (while still ubiquitous) were less common while there was a significant increase of T > C mutations. Interestingly, in both smokers and non-smokers, C > T and C > G mutations at non-CpG sites were elevated when compared to all other cancer types, with bladder tumours harbouring the most C > G mutations [11••]. Additionally, it was demonstrated that C > A transversions were common in hepatocellular cancers and these mutations were believed to be associated with aflatoxin, a known carcinogen commonly found in food from southern Africa and Asia [16]. Lastly, all cancer types harboured at least some C > T mutations at CpG dinucleotides (mutated base underlined), a process attributed to the normal cellular event of deamination of 5-methylcytosine [11••].
The analyses of TP53 spectra were the first attempts to bridge the gap between molecular cancer genetics and epidemiology [17]. The large number of studies examining TP53 spectra required a computational resource to facilitate and retrieve the already identified somatic mutations. At first these data were managed by the researchers that were generating it but in 1994 the International Agency for Research on Cancer (IARC) started to maintain a database while providing a free access to it [17]. The first release of the IARC TP53 database contained ∼3 000 somatic mutations [18] while the most recent version (R16) released in November of 2012, which can be found at http://p53.iarc.fr/, contains almost 30 000 somatic mutations in TP53.
Though extremely informative, the data gathered from single gene studies have significant limitations. In these studies, the spectrum of a cancer type is reported by aggregating mutations from multiple samples. This may be adequate when a single mutational process generates the majority of mutations in the particular cancer (e.g. UV light is the predominant mutational process in melanoma [19••]). However, usually multiple mutational processes are operative in a single cancer sample, and combining their mutations generates a mixed composition of the patterns of somatic mutations. In most cases, reporting this jumbled spectrum is uninformative for the diversity of mutational processes operative in a single cancer type or in a single cancer sample [20••]. Moreover, the examined TP53 exons are both under selection and also have a specific nucleotide sequence. This affects the opportunity for observing a somatic mutation and as such the reported spectrum can be a reflection of the processes of selection and/or the nucleotide architecture of the TP53 gene in addition to the processes of mutation [21,22].
Two studies tried to overcome some of the single gene limitations by leveraging a targeted capillary sequencing approach of large number of genes. A survey of the 518 protein kinase genes in 25 human breast cancer samples revealed 92 somatic mutations (90 substitutions and 2 indels) in which C > T transitions and C > G transversions preceded by thymine (i.e. C > T and C > G at TpC, mutated base is underlined) occurred with a higher than expected frequency [23]. This survey was later expanded to 210 cancer samples and it revealed more than 1 000 somatic mutations with significant variations in their patterns across the examined twelve cancer types [24]. Only a small fraction of the mutations reported in these screens are likely to be affected by selection [25], thus indicating that the observed mutational patterns reflect the operative mutational processes in the analyzed samples and not the processes of negative or positive selection.
Mutational patterns identified in next generation sequencing data
The development of second-generation sequencing technologies allowed examination of cancer exomes (i.e. the combined protein coding exons) and even whole cancer genomes. Sequencing cancer exomes has been generally preferred as the majority of known cancer-causing driver somatic substitutions, indels, and copy number changes (although generally not rearrangements) [21] are located in protein coding genes. As the nucleotide sequence of protein coding genes is ∼1% of the whole genome, analysis of exomes is considered an advantageous and cost effective methodology for discovering the genes involved in neoplastic development. As a result, many studies have focused predominantly on the generation and analysis of exome sequences [26].
Early next generation sequencing studies started revealing patterns of somatic substitutions in different cancer types. In 2010, two back-to-back studies in Nature reported the patterns of somatic mutations in a malignant melanoma [27•] and small cell lung carcinoma [28•]. As expected, a strong signature of tobacco carcinogens was found in the genome of the lung cancer, while the mutational signature of ultraviolet light overwhelmed the melanoma genome. These studies demonstrated the value of whole genome sequencing for evaluating signatures of mutational processes by providing greater resolution and mechanistic insight into mutational signatures due to known carcinogens, for example through the identification of a lower prevalence of mutations over the footprints of genes.
Multiple independent studies and international consortiums started sequencing large numbers of samples from both cancer genomes and exomes [26]. An integrated genomic characterization was reported for many different cancer types including: acute lymphoblast leukemia [29–31], acute myeloid leukemia [32], breast cancer [33••,34,35], chronic lymphocytic leukemia [36,37], colorectal cancer [38,39], oesophageal cancer [40], glioblastoma [41], cancers of the head and neck [42,43], kidney cancer [44–46], liver cancer [47,48], lung cancer [49–54], lymphomas [55,56], melanoma [57–60], multiple myeloma [61], ovarian cancer [62], pancreatic cancer [63,64], prostate cancer [65–68], stomach cancer [69–71], uterine cancer [72], and several different types of pediatric tumours [73–79]. While these studies focused on the identification of novel cancer genes, mutational spectra were usually reported for each of the examined samples and some studies even tried to associate certain types of somatic mutations with the activity of mutagens or the failure of DNA repair mechanisms. A brief summary of the mutational patterns identified in these cancer genomics studies is provided in the next paragraph.
In lung cancer, comparison between tobacco smokers and non-smokers revealed that smokers have on average 10-fold increase in the burden of somatic mutations in their cancer genomes [50,51]. Consistent with the experimental evidence for tobacco carcinogens, this elevation is mainly due to the increase of the number of C > A transversions [15]. Examination of the cancer genomes of melanomas confirmed that the majority of mutations are C > T and CC > TT at dipyrimidines in the ultraviolet-associated tumours, while acral melanomas exhibit predominantly C > T transitions at CpG sites [59,60]. In glioblastoma multiforme, it was demonstrated that treatment with an alkylating agent, such as temozolomide, significantly elevates the numbers of somatic mutations and results in a distinct mutational pattern of C > T transitions [41]. In chronic lymphocytic leukemia, it was observed that samples with mutations in the immunoglobulin genes have a higher proportion of T > G transversions [36]. This mutational pattern and its immediate sequencing context are consistent with the activity of the error-prone polymerase η during somatic hypermutation [36,80]. In endometrial and colorectal tumours, a set of ultra-hypermutators with increased mutational frequency of transversions was associated with somatic mutations in polymerase ɛ [44,72]. Microsatellite unstable gastric cancer were observed to have a higher mutation prevalence of both C > T transitions and C > A transversions [71]. Examining the cancer exomes of patients with urothelial carcinoma (of the upper urinary tract) revealed a large number of somatic mutations with an unique pattern of T > A transversions predominately located at CpTpG sites and possessing a very strong transcription strand bias [81]. This pattern of mutations was associated with exposure to aristolochic acid. In oesophageal cancer, a high prevalence of T > G transversions was observed [40] while certain breast cancer genomes were found to be overwhelmed with C > T and C > G mutations at TpC sites [35].
These next generation sequencing studies provided an unbiased look into the patterns of DNA changes across cancer genomes. While they resolved some of the previous limitations from TP53 studies (mostly by examining large portions of the human genome which are usually not under selection and which have a nucleotide context that is representative of the whole human genome) they still did not address the important issue of examining mixtures of mutations generated by different mutational processes.
Mutational signatures derived from patterns of somatic mutations
The somatic mutations in a cancer genome are the cumulative result of the mutational processes that have been operative since the very first division of the fertilized egg, from which the cancer cell was derived [21,22]. Each of these mutations was caused by the activity of endogenous and/or exogenous mutational processes with different strengths (Figure 1). Some of these processes have been active throughout the whole lifetime of the cancer patient while others have been sporadically triggered, for example, due to lifestyle choices (Figure 1). While examining patterns of somatic mutations can provide an indication of the aetiology of the operative mutational processes, it does not allow deciphering the individual mutational signatures that are operative in each sample as usually the pattern of a sequenced cancer genome does not resemble any of the operative mutational processes (Figure 1).
Recently, a theoretical model and computational framework that allows decomposing distinct patterns of somatic mutations from a set of cancer samples was developed [20••]. The mathematical model was an extension of the well-known blind source separation problem, in which original signals need to be separated from a set of mixed signals [82], and the algorithm was based on a method used in face recognition software that allows meaningfully learning distinct parts of objects [83]. The algorithm deciphers the minimal set of mutational signatures that optimally explains the proportion of each mutation type found in each cancer sample and then the method estimates the contribution of each signature to each cancer sample (see Ref. [20••] for more details about this method, including a discussion of its limitations).
Initial application of this approach was performed on the somatic substitutions derived from the whole genomes of 21 breast cancer patients [33••]. In order to increase the resolution of the derived mutational signatures, substitutions were examined using their immediate sequencing context. This included the base immediately 5′ before the somatic mutation and the base immediately 3′ after the somatic mutation; thus resulting in 96 mutation types — 16 different for each of the six types of somatic substitutions. For example, C > T mutations were extended to include C > T with (5′ adenine): ApCpA, ApCpC, ApCpG, ApCpT; (5′ cytosine): CpCpA, CpCpC, CpCpG, CpCpT; (5′ guanine): GpCpA, GpCpC, GpCpG, GpCpT; and (5′ thymine): TpCpA, TpCpC, TpCpG, TpCpT. Including the immediate sequence context allows better differentiation between different mutational processes; for example, distinguishing between C > T mutations due to the formation UV-light induced photodimers (i.e. C > T mutations at dipyrimidine sites such as TpCpC or CpCpC) from C > T mutations due to deamination of 5-methylcytosine (i.e. C > T mutations at CpG sites). The mutational catalogues of the 21 breast cancer genomes were generated, including each of the 96 mutation types, and applying the newly developed method to these catalogues revealed multiple distinct mutational signatures of substitutions. As expected, a mutational signature with features of C > T mutations at CpG sites was identified in most samples, thus reflecting the activity of normal endogenous cellular processes. Further, a mutational signature with C > X mutations at TpC sites was identified and based on similarity between its mutational pattern and in vivo experimental data, it was proposed that this process is due to the activity of the APOBEC family of deaminases and more specifically APOBEC1, APOBEC3A, and/or APOBEC3B [84,85]. Additionally, a rather uniform mutational signature (no prominent features across trinucleotides) was also identified and, interestingly, the activity of this mutational signature in each of the 21 samples allowed separation (by unsupervised hierarchical clustering) of BRCA1 and BRCA2 wild-type breast tumours from BRCA1 and BRCA2 germline mutants. Another mutational signature with unknown aetiology and mutations predominately at C > G at TpC was also identified. In addition to these genome-wide signatures, a localized hypermutation (termed kataegis) was observed in some of the breast cancer samples. This localized hypermutation was predominantly constituted of C > T and C > G substitutions at TpC trinucleotides and it was speculated that it is also due to the activity of the APOBEC enzymes. Lastly, deciphering the independent mutational signatures operative in these breast cancer samples provided the means for timing their activity across different cancer subclones [86].
This initial analysis of the mutational signatures operative in the 21 breast cancer genomes revealed several intriguing mutational processes but its focus was predominantly on substitutions. However, the newly developed approach for deciphering mutational signatures also allows extending mutational signature analysis over an arbitrary selected set of biologically meaningful mutation types [20••]. To demonstrate its applicability, the mutational catalogues of the 21 breast cancer genomes were extended to include double nucleotide substitutions, indels at microhomologies, indels at mono/polynucleotide repeats, and even a complex mutation type such as kataegis. Reanalysing these mutational catalogues demonstrated that kataegis separates as its own mutational process. Further, double nucleotide substitutions and indels at microhomologies associated predominantly with the activity of the previously identified uniform mutational process. Lastly, indels at mono/polynucleotide repeats did not strongly associate with any of the previously described mutational processes [20••].
Extending the previously defined mutational catalogues illustrated the possibility of incorporating additional mutation types and it revealed some associations between substitutions and indels thus providing more biological insight into the identified mutational processes [20••]. Further biological insight was derived by analysing mutational catalogues that incorporate the transcriptional strand on which a substitution resides in the footprints of a gene. Thus, the previously defined 96 substitution types were extended to 192 mutation types. For example, the number of C > T mutations at TpCpA were split into two categories: the number of C > T mutations at TpCpA occurring on the untranscribed strand of a gene and the number of C > T mutations at TpCpA occurring on the transcribed strand. In general, one would expect that these two numbers are approximately the same unless the mutational processes are influenced by activity of the transcriptional machinery. This could happen, for example, due to recruitment of the transcription-coupled component of nucleotide excision repair (NER) [87•]. If a mutational process has a higher number of C > A substitutions on the transcribed strand compared to the C > A substitutions on the untranscribed strand (i.e. note that C > A mutations on the untranscribed strand is the same as G > T mutations on the transcribed strand), this could indicate that the mutations caused by this process are being repaired by NER. As such, this analysis provides a further insight into the operative mutational processes and their interaction with cellular repair processes. A known example of such strand bias due to interplay between a mutational process and a repair mechanism is the formation of photodimers due to UV-light exposure that are repaired by NER and result in a higher number of C > T mutations on the untranscribed strand [87•].
Analysing the transcriptional strand bias of the mutational signatures operative in the 21 breast genomes revealed a weak strand bias of C > A mutations with unknown aetiology [20••]. Interestingly, deciphering mutational signatures from 100 breast cancer exomes revealed exactly the same trinucleotide mutational signatures but with a different strand bias. Specifically, there was an elevation of C > X mutations at TpCpT on the transcribed strand of exomes, which was absent in the complete gene footprints derived from the 21 whole genome sequences [20••]. This transcriptional strand bias could be indicative of exon-specific repair processes that are active in the cell.
The extensive mutational signature analysis performed on the 21 breast cancer genomes was recently expanded and mutational signatures (including substitutions, indels, dinucleotide substitutions, kataegis, and strand bias) were deciphered from 30 different types of human cancer [19••]. The previously developed computational framework was applied to almost five million somatic mutations identified in 7 042 cancer samples (507 from whole genome and 6 535 from whole exome sequences). This included both previously published samples and newly sequenced whole genomes. The analysis revealed 27 distinct mutational signatures [19••]. 22 of these 27 mutational signatures were validated (i.e. confirmed by orthogonal technologies or other approaches), three were associated with technology-specific sequencing artefacts, and two of the mutational signatures remain un-validated due to the lack of access to the relevant biological samples.
This largest cancer genomics analysis to date provided the first global roadmap describing the signatures of mutational processes in human cancer. Each of the cancer types had at least two mutational signatures operative in it, while some (e.g. cancers of the liver and uterus) had up to six distinct mutational processes. Remarkably, most of the cancer samples had at least two mutational signatures active in them. Aetiology was proposed for 11 of the 22 validated mutational signatures. Two of the mutational signatures were associated with age of patient at cancer diagnosis and these signatures were present in 26 of the 30 cancer types and more than 70% of the samples. These two processes exhibit clear features of C > T at CpG sites and most likely reflect mutations due to normal cellular processes (e.g. deamination of 5-methylcytosine, errors due to DNA replication, and so on) and probably account for the majority of somatic mutations prior to neoplastic development.
Based on similarity with in vivo experimental data, two mutational processes (termed Signature 2 and 13) were associated with the activity of the APOBEC family of deaminases. These two signatures exhibit predominantly C > T and C > G mutations at TpC sites and were observed in 16 of the 30 cancer types (∼17% of all examined cancer samples) [19••]. As such, the activity of these mutational signatures (and respectively the APOBEC enzymes) is one of the most significant human carcinogens with prevalence superseding that of tobacco smoking and exposure to UV light. Recently, further evidence was provided for the involvement of APOBEC3B in human cancers, as its expression was elevated in tumours compared to their matched normal samples [88,89].
By comparing the substitution patterns of all signatures with experimental data, one of the mutational signatures was associated with exposure to ultraviolet light while another with benzo[a]pyrene, a known tobacco carcinogen. The signature associated with UV-light exhibited a higher presence of CC > TT dinucleotide substitutions as well as a strand bias indicative of the formation of photodimers, which further confirmed the association. In contrast, a mutational signature associated in lung cancer exhibited predominantly C > A mutations with a transcriptional strand bias suggesting the formation of bulky adducts on guanine. Interestingly, this mutational signature was also associated with CC > AA dinucleotide substitutions with a strong strand bias. Statistical tests comparing smokers with non-smokers in two cancer types (viz. lung adenocarcinoma and tumours of the head and neck) confirmed a highly significant elevation of this ‘tobacco smoking signature’ in smokers indicating that it was due to tobacco mutagens.
Further statistical analysis was performed to associate mutations in genes with the presence of mutational signatures. Distinct mutational signatures were associated with: mutations in BRCA1/2 in breast and pancreatic cancers; failure of the DNA mismatch repair pathway (e.g. due to methylation of the MLH1 promoter) in colorectal cancers; hypermutation of the immunoglobulin gene in CLL; recurring polymerase ɛ mutations in uterine and colorectal cancers. Interestingly, the mutational signature associated with failure of DNA mismatch repair was observed in nine different cancer types. While this process was operative in ∼20% of colorectal cancers and ∼15% of uterine cancers, it was also found in at least 1% of cancer samples in another seven cancer types. Another interesting observation was that while almost all BRCA1/2 mutants exhibit a specific mutational signature, there were also BRCA1/2 wild-type samples with high number of mutations due to this mutational process. Thus, it is possible that some BRCA1/2 wild-type samples might harbour somatic mutations or other abnormalities that result in a failure of homologous repair and activation of this mutational process.
Chemotherapy treatment could cause its own set of somatic mutations [24]. Examining the pre-treatment history of all 7 042 cancer samples revealed that melanomas and glioblastomas pre-treated with an alkylating agent exhibit a distinct mutational signature.
The performed global analysis was able to propose an association for 11 of the 22 validated mutation signatures, while the origins and aetiology of the other 11 mutational signatures remains unknown. Lastly, this study also examined the presence of loci of kataegis across human cancer and it revealed that kataegis is not confined only to breast cancer but it is also present in at least another seven cancer types including pancreas, lung, liver, medulloblastoma, CLL, B-cell lymphomas, and ALL.
Conclusions and future promises
In the past five decades, analysis of mutational patterns has evolved from in vitro observation of DNA changes caused by ultraviolet light, to examination of the mutational spectra generated by sequencing single cancer genes in multiple samples, to performing targeted capillary sequencing screens of multiple genes across hundreds of samples, and more recently to large-scale analysis of the genomes of thousands of cancer patients revealing the signatures of the mutational processes involved in the development of their tumours. In the next decade, thousands of new whole cancer genomes across the majority of cancer types [26] will be generated, which will allow the creation of a final and comprehensive map of mutational signatures. The generation of such a mutagenesis map will most likely require the refinement of existing mathematical methods to accurately examine all known types of somatic mutations: substitutions, indels, copy number variations, structural rearrangements, and potentially even epigenetic changes. These analyses of next generation sequencing data must be complemented with experimental work revealing the aetiology of the identified mutational processes.
References and recommended reading
Papers of particular interest, published within the period of review, have been highlighted as:
• of special interest
•• of outstanding interest
Footnotes
This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
References
- 1.Watson J.D., Crick F.H. Molecular structure of nucleic acids; a structure for deoxyribose nucleic acid. Nature. 1953;171:37–738. doi: 10.1038/171737a0. [DOI] [PubMed] [Google Scholar]
- 2.Rusch H.P., Baumann C.A. Tumor production in mice with ultraviolet irradiation. Am J Cancer. 1939;35:55–62. [Google Scholar]
- 3.Blum H.F. On the mechanism of cancer induction by ultraviolet radiation. J Natl Cancer Inst. 1950;11:463–495. [PubMed] [Google Scholar]
- Witkin E.M. Ultraviolet-induced mutation and DNA repair. Annu Rev Microbiol. 1969;23:487–514. doi: 10.1146/annurev.mi.23.100169.002415. [DOI] [PubMed] [Google Scholar]; This review summarizes the very early effors to understand UV-light induced DNA mutations.
- 5.Howard B.D., Tessman I. Identification of the altered bases in mutated single-stranded DNA. II. In vivo mutagenesis by 5-bromodeoxyuridine and 2-aminopurine. J Mol Biol. 1964;9:364–371. doi: 10.1016/s0022-2836(64)80213-8. [DOI] [PubMed] [Google Scholar]
- 6.Setlow R.B., Carrier W.L. Pyrimidine dimers in ultraviolet-irradiated DNA's. J Mol Biol. 1966;17:237–254. doi: 10.1016/s0022-2836(66)80105-5. [DOI] [PubMed] [Google Scholar]
- 7.Sanger F., Nicklen S., Coulson A.R. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci USA. 1977;74:5463–5467. doi: 10.1073/pnas.74.12.5463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brash D.E., Rudolph J.A., Simon J.A., Lin A., McKenna G.J., Baden H.P., Halperin A.J., Ponten J. A role for sunlight in skin cancer: UV-induced p53 mutations in squamous cell carcinoma. Proc Natl Acad Sci USA. 1991;88:10124–10128. doi: 10.1073/pnas.88.22.10124. [DOI] [PMC free article] [PubMed] [Google Scholar]; This article provides one of the early evidences that UV light leaves a specific mutational imprint on the DNA of a cancer cell, viz., C > T substitutions at dipyrimidines. This imprint is consistent with the one observed in previous in vitro experiments.
- Ozturk M. P53 mutation in hepatocellular carcinoma after aflatoxin exposure. Lancet. 1991;338:1356–1359. doi: 10.1016/0140-6736(91)92236-u. [DOI] [PubMed] [Google Scholar]; This article provides one of the early evidences that aflatoxin leaves a specific mutational imprint on the DNA of a cancer cell, viz., C > A substitutions. This imprint is consistent with the one observed in previous experiments.
- 10.Vogelstein B., Kinzler K.W. Carcinogens leave fingerprints. Nature. 1992;355:209–210. doi: 10.1038/355209a0. [DOI] [PubMed] [Google Scholar]
- Greenblatt M.S., Bennett W.P., Hollstein M., Harris C.C. Mutations in the p53 tumor suppressor gene: clues to cancer etiology and molecular pathogenesis. Cancer Res. 1994;54:4855–4878. [PubMed] [Google Scholar]; This very detailed review summarizes the studies analyzing TP53 mutational spectra and their relationship with epidemiology.
- 12.Hollstein M., Sidransky D., Vogelstein B., Harris C.C. P53 mutations in human cancers. Science. 1991;253:49–53. doi: 10.1126/science.1905840. [DOI] [PubMed] [Google Scholar]
- 13.Hollstein M., Hergenhahn M., Yang Q., Bartsch H., Wang Z.Q., Hainaut P. New approaches to understanding p53 gene tumor mutation spectra. Mutat Res. 1999;431:199–209. doi: 10.1016/s0027-5107(99)00162-1. [DOI] [PubMed] [Google Scholar]
- 14.Capella G., Cronauer-Mitra S., Pienado M.A., Perucho M. Frequency and spectrum of mutations at codons 12 and 13 of the c-k-ras gene in human tumors. Environ Health Perspect. 1991;93:125–131. doi: 10.1289/ehp.9193125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Rodin S.N., Rodin A.S. Origins and selection of p53 mutations in lung carcinogenesis. Semin Cancer Biol. 2005;15:103–112. doi: 10.1016/j.semcancer.2004.08.005. [DOI] [PubMed] [Google Scholar]
- 16.Wogan G.N. Aflatoxins as risk factors for hepatocellular carcinoma in humans. Cancer Res. 1992;52(7 Suppl):2114s–2118s. [PubMed] [Google Scholar]
- 17.Hainaut P., Olivier M., Pfeifer G.P. Tp53 mutation spectrum in lung cancers and mutagenic signature of components of tobacco smoke: lessons from the iarc tp53 mutation database. Mutagenesis. 2001;16:551–553. doi: 10.1093/mutage/16.6.551. [author reply 555–6] [DOI] [PubMed] [Google Scholar]
- 18.Cariello N.F., Cui L., Beroud C., Soussi T. Database and software for the analysis of mutations in the human p53 gene. Cancer Res. 1994;54:4454–4460. [PubMed] [Google Scholar]
- Alexandrov L.B., Nik-Zainal S., Wedge D.C., Aparicio S.A., Behjati S., Biankin A.V., Bignell G.R., Bolli N., Borg A., Borresen-Dale A.L., Boyault S. Signatures of mutational processes in human cancer. Nature. 2013;500:415–421. doi: 10.1038/nature12477. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study provides the first comprehensive map of the signatures of mutational processes across 30 different types of human cancer derived from 7 042 patients
- Alexandrov L.B., Nik-Zainal S., Wedge D.C., Campbell P.J., Stratton M.R. Deciphering signatures of mutational processes operative in human cancer. Cell Rep. 2013;3:246–259. doi: 10.1016/j.celrep.2012.12.008. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study provides a mathematical approach and computational framework that allow identifying mutational signatures from next generation sequencing data.
- 21.Stratton M.R. Exploring the genomes of cancer cells: progress and promise. Science. 2011;331:1553–1558. doi: 10.1126/science.1204040. [DOI] [PubMed] [Google Scholar]
- 22.Stratton M.R., Campbell P.J., Futreal P.A. The cancer genome. Nature. 2009;458:719–724. doi: 10.1038/nature07943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Stephens P., Edkins S., Davies H., Greenman C., Cox C., Hunter C., Bignell G., Teague J., Smith R., Stevens C., O’Meara S. A screen of the complete protein kinase gene family identifies diverse patterns of somatic mutations in human breast cancer. Nat Genet. 2005;37:590–592. doi: 10.1038/ng1571. [DOI] [PubMed] [Google Scholar]
- 24.Greenman C., Stephens P., Smith R., Dalgliesh G.L., Hunter C., Bignell G., Davies H., Teague J., Butler A., Stevens C., Edkins S. Patterns of somatic mutation in human cancer genomes. Nature. 2007;446:153–158. doi: 10.1038/nature05610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Rubin A.F., Green P. Mutation patterns in cancer genomes. Proc Natl Acad Sci USA. 2009;106:21766–21770. doi: 10.1073/pnas.0912499106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.International Cancer Genome C., Hudson T.J., Anderson W., Artez A., Barker A.D., Bell C., Bernabe R.R., Bhan M.K., Calvo F., Eerola I., Gerhard D.S. International network of cancer genome projects. Nature. 2010;464:993–998. doi: 10.1038/nature08987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pleasance E.D., Cheetham R.K., Stephens P.J., McBride D.J., Humphray S.J., Greenman C.D., Varela I., Lin M.L., Ordonez G.R., Bignell G.R., Ye K. A comprehensive catalogue of somatic mutations from a human cancer genome. Nature. 2010;463:191–196. doi: 10.1038/nature08658. [DOI] [PMC free article] [PubMed] [Google Scholar]; The Pleasance et al. back-to-back articles in Nature demonstrate the value of whole genome sequencing for evaluating signatures of mutational processes by providing greater resolution and mechanistic insight.
- Pleasance E.D., Stephens P.J., O’Meara S., McBride D.J., Meynert A., Jones D., Lin M.L., Beare D., Lau K.W., Greenman C., Varela I. A small-cell lung cancer genome with complex signatures of tobacco exposure. Nature. 2010;463:184–190. doi: 10.1038/nature08629. [DOI] [PMC free article] [PubMed] [Google Scholar]; The Pleasance et al. back-to-back articles in Nature demonstrate the value of whole genome sequencing for evaluating signatures of mutational processes by providing greater resolution and mechanistic insight.
- 29.De Keersmaecker K., Atak Z.K., Li N., Vicente C., Patchett S., Girardi T., Gianfelici V., Geerdens E., Clappier E., Porcu M., Lahortiga I. Exome sequencing identifies mutation in cnot3 and ribosomal genes rpl5 and rpl10 in t-cell acute lymphoblastic leukemia. Nat Genet. 2013;45:186–190. doi: 10.1038/ng.2508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Holmfeldt L., Wei L., Diaz-Flores E., Walsh M., Zhang J., Ding L., Payne-Turner D., Churchman M., Andersson A., Chen S.C., McCastlain K. The genomic landscape of hypodiploid acute lymphoblastic leukemia. Nat Genet. 2013;45:242–252. doi: 10.1038/ng.2532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Zhang J., Ding L., Holmfeldt L., Wu G., Heatley S.L., Payne-Turner D., Easton J., Chen X., Wang J., Rusch M., Lu C. The genetic basis of early t-cell precursor acute lymphoblastic leukaemia. Nature. 2012;481:157–163. doi: 10.1038/nature10725. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ding L., Ley T.J., Larson D.E., Miller C.A., Koboldt D.C., Welch J.S., Ritchey J.K., Young M.A., Lamprecht T., McLellan M.D., McMichael J.F. Clonal evolution in relapsed acute myeloid leukaemia revealed by whole-genome sequencing. Nature. 2012;481:506–510. doi: 10.1038/nature10738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nik-Zainal S., Alexandrov L.B., Wedge D.C., Van Loo P., Greenman C.D., Raine K., Jones D., Hinton J., Marshall J., Stebbings L.A., Menzies A. Mutational processes molding the genomes of 21 breast cancers. Cell. 2012;149:979–993. doi: 10.1016/j.cell.2012.04.024. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study demostrates that mutplie mutational processes are operative in 21 breast cancer sample. It also demonstrates the existance of localized somatic hypermutation, termed kataegis, for the first time.
- 34.Shah S.P., Roth A., Goya R., Oloumi A., Ha G., Zhao Y., Turashvili G., Ding J., Tse K., Haffari G., Bashashati A. The clonal and mutational evolution spectrum of primary triple-negative breast cancers. Nature. 2012;486:395–399. doi: 10.1038/nature10933. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Stephens P.J., Tarpey P.S., Davies H., Van Loo P., Greenman C., Wedge D.C., Nik-Zainal S., Martin S., Varela I., Bignell G.R., Yates L.R. The landscape of cancer genes and mutational processes in breast cancer. Nature. 2012;486:400–404. doi: 10.1038/nature11017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Puente X.S., Pinyol M., Quesada V., Conde L., Ordonez G.R., Villamor N., Escaramis G., Jares P., Bea S., Gonzalez-Diaz M., Bassaganyas L. Whole-genome sequencing identifies recurrent mutations in chronic lymphocytic leukaemia. Nature. 2011;475:101–105. doi: 10.1038/nature10113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Quesada V., Conde L., Villamor N., Ordonez G.R., Jares P., Bassaganyas L., Ramsay A.J., Bea S., Pinyol M., Martinez-Trillos A., Lopez-Guerra M. Exome sequencing identifies recurrent mutations of the splicing factor sf3b1 gene in chronic lymphocytic leukemia. Nat Genet. 2012;44:47–52. doi: 10.1038/ng.1032. [DOI] [PubMed] [Google Scholar]
- 38.Cancer Genome Atlas N. Comprehensive molecular characterization of human colon and rectal cancer. Nature. 2012;487:330–337. doi: 10.1038/nature11252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Seshagiri S., Stawiski E.W., Durinck S., Modrusan Z., Storm E.E., Conboy C.B., Chaudhuri S., Guan Y., Janakiraman V., Jaiswal B.S., Guillory J. Recurrent r-spondin fusions in colon cancer. Nature. 2012;488:660–664. doi: 10.1038/nature11282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Dulak A.M., Stojanov P., Peng S., Lawrence M.S., Fox C., Stewart C., Bandla S., Imamura Y., Schumacher S.E., Shefler E., McKenna A. Exome and whole-genome sequencing of esophageal adenocarcinoma identifies recurrent driver events and mutational complexity. Nat Genet. 2013;45:478–486. doi: 10.1038/ng.2591. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Parsons D.W., Jones S., Zhang X., Lin J.C., Leary R.J., Angenendt P., Mankoo P., Carter H., Siu I.M., Gallia G.L., Olivi A. An integrated genomic analysis of human glioblastoma multiforme. Science. 2008;321:1807–1812. doi: 10.1126/science.1164382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Agrawal N., Frederick M.J., Pickering C.R., Bettegowda C., Chang K., Li R.J., Fakhry C., Xie T.X., Zhang J., Wang J., Zhang N. Exome sequencing of head and neck squamous cell carcinoma reveals inactivating mutations in notch1. Science. 2011;333:1154–1157. doi: 10.1126/science.1206923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Stransky N., Egloff A.M., Tward A.D., Kostic A.D., Cibulskis K., Sivachenko A., Kryukov G.V., Lawrence M.S., Sougnez C., McKenna A., Shefler E. The mutational landscape of head and neck squamous cell carcinoma. Science. 2011;333:1157–1160. doi: 10.1126/science.1208130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Cancer Genome Atlas Research Network Comprehensive molecular characterization of clear cell renal cell carcinoma. Nature. 2013;499:43–49. doi: 10.1038/nature12222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Guo G., Gui Y., Gao S., Tang A., Hu X., Huang Y., Jia W., Li Z., He M., Sun L., Song P. Frequent mutations of genes encoding ubiquitin-mediated proteolysis pathway components in clear cell renal cell carcinoma. Nat Genet. 2012;44:17–19. doi: 10.1038/ng.1014. [DOI] [PubMed] [Google Scholar]
- 46.Pena-Llopis S., Vega-Rubin-de-Celis S., Liao A., Leng N., Pavia-Jimenez A., Wang S., Yamasaki T., Zhrebker L., Sivanand S., Spence P., Kinch L. Bap1 loss defines a new class of renal cell carcinoma. Nat Genet. 2012;44:751–759. doi: 10.1038/ng.2323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Kan Z., Zheng H., Liu X., Li S., Barber T.D., Gong Z., Gao H., Hao K., Willard M.D., Xu J., Hauptschein R. Whole-genome sequencing identifies recurrent mutations in hepatocellular carcinoma. Genome Res. 2013;23:1422–1433. doi: 10.1101/gr.154492.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Fujimoto A., Totoki Y., Abe T., Boroevich K.A., Hosoda F., Nguyen H.H., Aoki M., Hosono N., Kubo M., Miya F., Arai Y. Whole-genome sequencing of liver cancers identifies etiological influences on mutation patterns and recurrent mutations in chromatin regulators. Nat Genet. 2012;44:760–764. doi: 10.1038/ng.2291. [DOI] [PubMed] [Google Scholar]
- 49.Seo J.S., Ju Y.S., Lee W.C., Shin J.Y., Lee J.K., Bleazard T., Lee J., Jung Y.J., Kim J.O., Shin J.Y., Yu S.B. The transcriptional landscape and mutational profile of lung adenocarcinoma. Genome Res. 2012;22:2109–2119. doi: 10.1101/gr.145144.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Imielinski M., Berger A.H., Hammerman P.S., Hernandez B., Pugh T.J., Hodis E., Cho J., Suh J., Capelletti M., Sivachenko A., Sougnez C. Mapping the hallmarks of lung adenocarcinoma with massively parallel sequencing. Cell. 2012;150:1107–1120. doi: 10.1016/j.cell.2012.08.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Govindan R., Ding L., Griffith M., Subramanian J., Dees N.D., Kanchi K.L., Maher C.A., Fulton R., Fulton L., Wallis J., Chen K. Genomic landscape of non-small cell lung cancer in smokers and never-smokers. Cell. 2012;150:1121–1134. doi: 10.1016/j.cell.2012.08.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Ding L., Getz G., Wheeler D.A., Mardis E.R., McLellan M.D., Cibulskis K., Sougnez C., Greulich H., Muzny D.M., Morgan M.B., Fulton L. Somatic mutations affect key pathways in lung adenocarcinoma. Nature. 2008;455:1069–1075. doi: 10.1038/nature07423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Peifer M., Fernandez-Cuesta L., Sos M.L., George J., Seidel D., Kasper L.H., Plenker D., Leenders F., Sun R., Zander T., Menon R. Integrative genome analyses identify key somatic driver mutations of small-cell lung cancer. Nat Genet. 2012;44:1104–1110. doi: 10.1038/ng.2396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Rudin C.M., Durinck S., Stawiski E.W., Poirier J.T., Modrusan Z., Shames D.S., Bergbower E.A., Guan Y., Shin J., Guillory J., Rivers C.S. Comprehensive genomic analysis identifies sox2 as a frequently amplified gene in small-cell lung cancer. Nat Genet. 2012;44:1111–1116. doi: 10.1038/ng.2405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Love C., Sun Z., Jima D., Li G., Zhang J., Miles R., Richards K.L., Dunphy C.H., Choi W.W., Srivastava G., Lugar P.L. The genetic landscape of mutations in burkitt lymphoma. Nat Genet. 2012;44:1321–1325. doi: 10.1038/ng.2468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Morin R.D., Mendez-Lago M., Mungall A.J., Goya R., Mungall K.L., Corbett R.D., Johnson N.A., Severson T.M., Chiu R., Field M., Jackman S. Frequent mutation of histone-modifying genes in non-hodgkin lymphoma. Nature. 2011;476:298–303. doi: 10.1038/nature10351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Huang F.W., Hodis E., Xu M.J., Kryukov G.V., Chin L., Garraway L.A. Highly recurrent tert promoter mutations in human melanoma. Science. 2013;339:957–959. doi: 10.1126/science.1229259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Stark M.S., Woods S.L., Gartside M.G., Bonazzi V.F., Dutton-Regester K., Aoude L.G., Chow D., Sereduk C., Niemi N.M., Tang N., Ellis J.J. Frequent somatic mutations in map3k5 and map3k9 in metastatic melanoma identified by exome sequencing. Nat Genet. 2012;44:165–169. doi: 10.1038/ng.1041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Berger M.F., Hodis E., Heffernan T.P., Deribe Y.L., Lawrence M.S., Protopopov A., Ivanova E., Watson I.R., Nickerson E., Ghosh P., Zhang H. Melanoma genome sequencing reveals frequent prex2 mutations. Nature. 2012;485:502–506. doi: 10.1038/nature11071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Hodis E., Watson I.R., Kryukov G.V., Arold S.T., Imielinski M., Theurillat J.P., Nickerson E., Auclair D., Li L., Place C., Dicara D. A landscape of driver mutations in melanoma. Cell. 2012;150:251–263. doi: 10.1016/j.cell.2012.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Chapman M.A., Lawrence M.S., Keats J.J., Cibulskis K., Sougnez C., Schinzel A.C., Harview C.L., Brunet J.P., Ahmann G.J., Adli M., Anderson K.C. Initial genome sequencing and analysis of multiple myeloma. Nature. 2011;471:467–472. doi: 10.1038/nature09837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Jones S., Wang T.L., Shih Ie M., Mao T.L., Nakayama K., Roden R., Glas R., Slamon D., Diaz L.A., Jr., Vogelstein B., Kinzler K.W. Frequent mutations of chromatin remodeling gene arid1a in ovarian clear cell carcinoma. Science. 2010;330:228–231. doi: 10.1126/science.1196333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Jiao Y., Shi C., Edil B.H., de Wilde R.F., Klimstra D.S., Maitra A., Schulick R.D., Tang L.H., Wolfgang C.L., Choti M.A., Velculescu V.E. Daxx/atrx, men1, and mtor pathway genes are frequently altered in pancreatic neuroendocrine tumors. Science. 2011;331:1199–1203. doi: 10.1126/science.1200609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Wu J., Jiao Y., Dal Molin M., Maitra A., de Wilde R.F., Wood L.D., Eshleman J.R., Goggins M.G., Wolfgang C.L., Canto M.I., Schulick R.D. Whole-exome sequencing of neoplastic cysts of the pancreas reveals recurrent mutations in components of ubiquitin-dependent pathways. Proc Natl Acad Sci USA. 2011;108:21188–21193. doi: 10.1073/pnas.1118046108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Baca S.C., Prandi D., Lawrence M.S., Mosquera J.M., Romanel A., Drier Y., Park K., Kitabayashi N., MacDonald T.Y., Ghandi M., Van Allen E. Punctuated evolution of prostate cancer genomes. Cell. 2013;153:666–677. doi: 10.1016/j.cell.2013.03.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Grasso C.S., Wu Y.M., Robinson D.R., Cao X., Dhanasekaran S.M., Khan A.P., Quist M.J., Jing X., Lonigro R.J., Brenner J.C., Asangani I.A. The mutational landscape of lethal castration-resistant prostate cancer. Nature. 2012;487:239–243. doi: 10.1038/nature11125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Barbieri C.E., Baca S.C., Lawrence M.S., Demichelis F., Blattner M., Theurillat J.P., White T.A., Stojanov P., Van Allen E., Stransky N., Nickerson E. Exome sequencing identifies recurrent spop, foxa1 and med12 mutations in prostate cancer. Nat Genet. 2012;44:685–689. doi: 10.1038/ng.2279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Berger M.F., Lawrence M.S., Demichelis F., Drier Y., Cibulskis K., Sivachenko A.Y., Sboner A., Esgueva R., Pflueger D., Sougnez C., Onofrio R. The genomic complexity of primary human prostate cancer. Nature. 2011;470:214–220. doi: 10.1038/nature09744. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Zang Z.J., Cutcutache I., Poon S.L., Zhang S.L., McPherson J.R., Tao J., Rajasegaran V., Heng H.L., Deng N., Gan A., Lim K.H. Exome sequencing of gastric adenocarcinoma identifies recurrent somatic mutations in cell adhesion and chromatin remodeling genes. Nat Genet. 2012;44:570–574. doi: 10.1038/ng.2246. [DOI] [PubMed] [Google Scholar]
- 70.Wang K., Kan J., Yuen S.T., Shi S.T., Chu K.M., Law S., Chan T.L., Kan Z., Chan A.S., Tsui W.Y., Lee S.P. Exome sequencing identifies frequent mutation of arid1a in molecular subtypes of gastric cancer. Nat Genet. 2011;43:1219–1223. doi: 10.1038/ng.982. [DOI] [PubMed] [Google Scholar]
- 71.Nagarajan N., Bertrand D., Hillmer A.M., Zang Z.J., Yao F., Jacques P.E., Teo A.S., Cutcutache I., Zhang Z., Lee W.H., Sia Y.Y. Whole-genome reconstruction and mutational signatures in gastric cancer. Genome Biol. 2012;13:R115. doi: 10.1186/gb-2012-13-12-r115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Cancer Genome Atlas Research Network, Kandoth C., Schultz N., Cherniack A.D., Akbani R., Liu Y., Shen H., Robertson A.G., Pashtan I., Shen R., Benz C.C. Integrated genomic characterization of endometrial carcinoma. Nature. 2013;497:67–73. doi: 10.1038/nature12113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Robinson G., Parker M., Kranenburg T.A., Lu C., Chen X., Ding L., Phoenix T.N., Hedlund E., Wei L., Zhu X., Chalhoub N. Novel mutations target distinct subgroups of medulloblastoma. Nature. 2012;488:43–48. doi: 10.1038/nature11213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Rausch T., Jones D.T., Zapatka M., Stutz A.M., Zichner T., Weischenfeldt J., Jager N., Remke M., Shih D., Northcott P.A., Pfaff E. Genome sequencing of pediatric medulloblastoma links catastrophic DNA rearrangements with tp53 mutations. Cell. 2012;148:59–71. doi: 10.1016/j.cell.2011.12.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Pugh T.J., Weeraratne S.D., Archer T.C., Pomeranz Krummel D.A., Auclair D., Bochicchio J., Carneiro M.O., Carter S.L., Cibulskis K., Erlich R.L., Greulich H., et al Medulloblastoma exome sequencing uncovers subtype-specific somatic mutations. Nature. 2012;488:106–110. doi: 10.1038/nature11329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Jones D.T., Jager N., Kool M., Zichner T., Hutter B., Sultan M., Cho Y.J., Pugh T.J., Hovestadt V., Stutz A.M., Rausch T. Dissecting the genomic complexity underlying medulloblastoma. Nature. 2012;488:100–105. doi: 10.1038/nature11284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Pugh T.J., Morozova O., Attiyeh E.F., Asgharzadeh S., Wei J.S., Auclair D., Carter S.L., Cibulskis K., Hanna M., Kiezun A., Kim J. The genetic landscape of high-risk neuroblastoma. Nat Genet. 2013;45:279–284. doi: 10.1038/ng.2529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Sausen M., Leary R.J., Jones S., Wu J., Reynolds C.P., Liu X., Blackford A., Parmigiani G., Diaz L.A., Jr., Papadopoulos N., Vogelstein B. Integrated genomic analyses identify arid1a and arid1b alterations in the childhood cancer neuroblastoma. Nat Genet. 2013;45:12–17. doi: 10.1038/ng.2493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Zhang J., Wu G., Miller C.P., Tatevossian R.G., Dalton J.D., Tang B., Orisme W., Punchihewa C., Parker M., Qaddoumi I., Boop F.A. Whole-genome sequencing identifies genetic alterations in pediatric low-grade gliomas. Nat Genet. 2013;45:602–612. doi: 10.1038/ng.2611. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Spencer J., Dunn-Walters D.K. Hypermutation at a–t base pairs: the a nucleotide replacement spectrum is affected by adjacent nucleotides and there is no reverse complementarity of sequences flanking mutated a and t nucleotides. J Immunol. 2005;175:5170–5177. doi: 10.4049/jimmunol.175.8.5170. [DOI] [PubMed] [Google Scholar]
- 81.Hoang M.L., Chen C.H., Sidorenko V.S., He J., Dickman K.G., Yun B.H., Moriya M., Niknafs N., Douville C., Karchin R., Turesky R.J. Mutational signature of aristolochic acid exposure as revealed by whole-exome sequencing. Sci Transl Med. 2013;5:197ra102. doi: 10.1126/scitranslmed.3006200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Comon P. Elsevier; 2010. Handbook of blind source separation: independent component analysis and blind deconvolution. [Google Scholar]
- 83.Lee D.D., Seung H.S. Learning the parts of objects by non-negative matrix factorization. Nature. 1999;401:788–791. doi: 10.1038/44565. [DOI] [PubMed] [Google Scholar]
- 84.Taylor B.J., Nik-Zainal S., Wu Y.L., Stebbings L.A., Raine K., Campbell P.J., Rada C., Stratton M.R., Neuberger M.S. DNA deaminases induce break-associated mutation showers with implication of apobec3b and 3a in breast cancer kataegis. eLife. 2013;2:e00534. doi: 10.7554/eLife.00534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Harris R.S., Petersen-Mahrt S.K., Neuberger M.S. Rna editing enzyme apobec1 and some of its homologs can act as DNA mutators. Mol Cell. 2002;10:1247–1253. doi: 10.1016/s1097-2765(02)00742-6. [DOI] [PubMed] [Google Scholar]
- 86.Nik-Zainal S., Van Loo P., Wedge D.C., Alexandrov L.B., Greenman C.D., Lau K.W., Raine K., Jones D., Marshall J., Ramakrishna M., Shlien A. The life history of 21 breast cancers. Cell. 2012;149:994–1007. doi: 10.1016/j.cell.2012.04.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Zeeland A.A., Vreeswijk M.P., de Gruijl F.R., van Kranen H.J., Vrieling H., Mullenders L.F. Transcription-coupled repair: impact on uv-induced mutagenesis in cultured rodent cells and mouse skin tumors. Mutat Res. 2005;577:170–178. doi: 10.1016/j.mrfmmm.2005.03.018. [DOI] [PubMed] [Google Scholar]; This article explains the interplay between transcription-coupled repair and mutations induced by UV-light.
- 88.Roberts S.A., Lawrence M.S., Klimczak L.J., Grimm S.A., Fargo D., Stojanov P., Kiezun A., Kryukov G.V., Carter S.L., Saksena G., Harris S. An apobec cytidine deaminase mutagenesis pattern is widespread in human cancers. Nat Genet. 2013;45:970–976. doi: 10.1038/ng.2702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Burns M.B., Temiz N.A., Harris R.S. Evidence for apobec3b mutagenesis in multiple human cancers. Nat Genet. 2013;45:977–983. doi: 10.1038/ng.2701. [DOI] [PMC free article] [PubMed] [Google Scholar]