Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Dec 3.
Published in final edited form as: Cell Rep. 2014 Nov 20;9(4):1228–1234. doi: 10.1016/j.celrep.2014.10.031

Transcription restores DNA repair to heterochromatin, determining regional mutation rates in cancer genomes

Christina L Zheng 1,2, Nicholas J Wang 3, Jongsuk Chung 4, Homayoun Moslehi 5, J Zachary Sanborn 6, Joseph S Hur 7, Eric A Collisson 8, Swapna S Vemula 9, Agne Naujokas 9, Kami E Chiotti 10, Jeffrey B Cheng 5, Hiva Fassihi 11, Andrew J Blumberg 12, Celeste V Bailey 13, Gary M Fudem 14, Frederick G Mihm 15, Bari B Cunningham 16, Isaac M Neuhaus 5, Dennis H Oh 5,17, James E Cleaver 5, Philip E LeBoit 9, Joseph F Costello 18, Alan R Lehmann 19, Joe W Gray 2,3, Paul T Spellman 2,10, Sarah T Arron 5, Nam Huh 4, Elizabeth Purdom 20,*, Raymond J Cho 5,*
PMCID: PMC4254608  NIHMSID: NIHMS638559  PMID: 25456125

Summary

Somatic mutations in cancer are more frequent in heterochromatic and late-replicating regions of the genome. We report that regional disparities in mutation density are virtually abolished within transcriptionally silent genomic regions of cutaneous squamous cell carcinomas (cSCCs) arising in an XPC−/− background. XPC−/− cells lack global genome nucleotide excision repair (GG-NER), thus establishing differential access of DNA repair machinery within chromatin-rich regions of the genome as the primary cause for the regional disparity. Strikingly, we find that increasing levels of transcription reduce mutation prevalence on both strands of gene bodies embedded within H3K9me3-dense regions, and only to those levels observed in H3K9me3-sparse regions, also in an XPC-dependent manner. Therefore, transcription appears to reduce mutation prevalence specifically by relieving the constraints imposed by chromatin structure on DNA repair. We model this novel relationship between transcription, chromatin state and DNA repair, revealing a new, personalized determinant of cancer risk.

Introduction

Somatic point mutations and chromosomal aberrations in cancer are not distributed uniformly throughout the genome (Alexandrov et al., 2013; Jäger et al., 2013; Lawrence et al., 2013; Polak et al., 2014). Despite the myriad mutational processes active in human cancers (Alexandrov et al., 2013), similar regional patterns of somatic mutation density are observed across many malignancy types, suggesting a common underlying mechanism (Hodgkinson et al., 2012; Lawrence et al., 2013). Chromatin organization heavily influences regional mutation rate, with higher densities of mutation observed in tightly packaged DNA, corresponding to late-replicating portions of the genome and genes with lower expression level (Liu et al., 2013; Schuster-Böckler and Lehner, 2012). For example, more than 40% of mutation frequency variation is correlated with the heterochromatin-associated histone modification H3K9me3 in both solid and hematologic cancer types (Schuster-Böckler and Lehner, 2012).

The reason why chromatin density and replication timing predict regional heterogeneity in mutation prevalence is unclear. Mutation rate correlates most strongly with H3K9me3 and to a lesser degree with H4K20me3 and H3K79me3 (Schuster-Böckler and Lehner, 2012). All three marks correlate with constitutively closed chromatin states, cytogenetically recognized as heterochromatin (Barski et al., 2007; Mikkelsen et al., 2007), suggesting a specific chromatin conformation may underlie the variance. Higher transcription rates correlate with lower prevalence of mutations originating on transcribed strands of genes (Pleasance et al., 2010), but transcription-coupled nucleotide excision repair (TC-NER) explains only a fraction of observed regional heterogeneity. It has been speculated that late-replicating regions suffer from lower fidelity DNA synthesis because of depletion of the free nucleotide pool (Liu et al., 2013; Stamatoyannopoulos et al., 2009). However a direct functional effect of specific chromatin state or replication timing on NER has not been established in humans (Gospodinov and Herceg, 2013). Recently, some melanomas with acquired mutations in NER genes were shown to demonstrate weaker association of mutation density with transcription and DNase I hypersensitivity sites (Polak et al., 2014).

Results

We sought to understand whether observed differences in regional mutation frequency within cancer genomes were driven primarily by NER activity. We studied tumors from patients with xeroderma pigmentosum (XP), a spectrum of genetic disorders associated with defects in NER (Cleaver, 2005). Patients with loss of function in XPC are defective in GG-NER, but proficient in TC-NER. If regional mutation frequency were caused by NER, in an XPC−/− background, we would expect regional disparities in mutation to persist within transcriptionally active portions of the genome, but not within transcriptionally silent regions. To test this hypothesis, whole-genome sequences were obtained from cSCCs arising in five patients with homozygous frameshift mutations (C940del-1) in the XPC gene (Cleaver et al., 2007), as well as from eight patients with no known major germline DNA repair deficiency (repair wild-type, WT) (Table S1). A total of 3,543,126 point mutations were identified. As expected for skin cancers, transitions (C>T/G>A) typical of UV damage predominated among detected mutations, representing 76% of point mutations in WT cSCCs and 86% in XPC−/− cSCCs. Mutation frequency, measured as transition mutations per Kb, was explored in relation to chromatin structure, replication time, and gene expression using ENCODE data derived from keratinocytes (ENCODE Project Consortium et al., 2012).

Regional mutation disparities in cancer genomes result primarily from DNA repair

Consistent with recent work (Liu et al., 2013; Schuster-Böckler and Lehner, 2012), we report that mutation prevalence correlated directly with both H3K9me3 density (P < 0.001) and replication time (P < 0.001), and anti-correlated with density of the repressive mark H3K27me3, within both expressed and non-expressed portions of WT cancer genomes (Figure 1). Strikingly, in all five examined XPC−/− cancers, these associations were virtually abolished in non-expressed portions of the genome, with mutation density at most 10% of that of WT cancers (Figure 1) and reduced to about half of that of WT cancers in expressed portions of the genome, where only TC-NER would be expected to remain active. Increased mutation density was also associated with sparser active histone marks such as H3K27ac and H3K4me1, and these relationships were once again absent within non-expressed regions of XPC−/− cancers (Figure S1).

Figure 1. Regional disparities in mutation density are absent in non-expressed portions of the genome of germline XPC−/− squamous cell carcinomas.

Figure 1

The x-axis of each graph shows increasing ChIP intensity of the heterochromatin-associated histone mark H3K9me3 (ENCODE data, Broad Institute, Panels A, C) and increasing inverse median RepliSeq values representing later replication time (ENCODE data, University of Washington, Panels B, D). The y-axis represents the mutation density per Kb divided by the individual mean. Plotted are values for either 8 aggregated repair wild-type (WT) cancers (solid blue line) or 5 aggregated XPC−/− cancers (broken orange line) for 8 equally sized genomic bins covering approximately 2Gb of expressed genome and 1Gb of non-expressed genome (+/− STD). Whereas mutation density correlates positively with increasing H3K9me3 and later replication time for expressed regions in repair wild-type cancers, these associations are diminished in XPC−/− samples (Panels A, B). In non-expressed portions of the genome, regional disparities in mutation density are almost completely abolished in XPC−/− samples (Panels C, D), indicating loss in the absence of GG-NER. See Figure S1 for additional data with sparser active marks H3K27ac and H3K4me1 and Table S1 for additional information on tumor samples.

In WT cSCC genomic regions with the lowest H3K9me3 density and highest transcription levels, our measure of TC-NER (the reduction of mutation density resulting from lesions on the transcribed strand, as a proportion of all expected mutations) was 29–34% (Table S2). Interestingly, in regions with the highest H3K9me3 density and highest transcription levels, this reduction was only 16–25%, suggesting that exclusion of TC-NER machinery within tightly packaged DNA may decrease its activity. In WT cancers, differences in TC-NER comprised on average only 1.4% of differences between the highest and lowest H3K9me3 densities, at the 70th percentile of most highly expressed genome (Table S3). In contrast, in XPC−/− cancers, 44% of the differences in mutation prevalence between the highest and lowest H3K9me3 levels could be ascribed to differences in TC-NER. Because TC-NER is not affected by loss-of-function in XPC, it is expected that TC-NER would be responsible for a greater proportion of residual disparities in mutation density in XPC−/− cancers (van Hoffen et al., 1995). Collectively, these findings reveal that the primary cause of regional disparities in mutation prevalence is differential access of DNA repair proteins imposed by chromatin state, specifically NER in cSCCs. Because global patterns of H3K9me3 density correlate with mutation prevalence across many different cancer types (Polak et al., 2014; Schuster-Böckler and Lehner, 2012), it is possible that this mechanism is active in other neoplasms and forms of mutagenesis.

Transcription enhances DNA repair only in chromatin-dense portions of the genome

We further analyzed the quantitative effects of GG-NER and TC-NER on mutation density in cancer genomes. In WT cSCCs, regions with greater expression levels showed a significantly decreased density of mutation originating both on the transcribed and untranscribed strands. The magnitude of this effect increased with greater H3K9me3 density and replication time (Figures 2 and S2). Notably, in XPC−/− cancers, higher expression levels only reduced the frequency of mutations resulting from lesions on the transcribed strand, an effect that can be attributed to TC-NER. However, the transcription-dependent (but TC-NER-independent) DNA repair observed on the untranscribed strand of WT cSCCs is possibly identical to an XPC-dependent phenomenon termed transcription domain-associated repair (DAR) (Nouspikel and Hanawalt, 2000; Nouspikel et al., 2006), which affects both strands in expressed regions.

Figure 2. Domain-associated repair restores low mutation rate only to highly transcribed genes in tightly-packaged DNA.

Figure 2

The x-axis denotes increasing expression in NHEK, measured in RPKM (plotted on a log scale). On the y-axis is the mutation density per Kb. Values are plotted for three independent WT cSCCs (A–C) and three independent XPC−/− cSCCs (D–F). The plots show six different H3K9me3 densities representing different chromatin levels, represented by distinct colors, for the transcribed (solid line) and untranscribed (broken line) strands. In WT cancers, both strands show decreasing mutation density in tightly-packaged DNA, illustrating robust domain-associated repair (DAR). DAR restores mutation rate in the most heterochromatic genomic regions to that of euchromatic regions, evidencing a dominant effect over chromatin state, but negligible additional impact in euchromatin (low H3K9me3). Even lower mutation density is seen from lesions on the transcribed strand, presumably representing TC-NER. In contrast, the XPC/− cancers show an absence of DAR, represented by an absence of transcription-dependent repair on the untranscribed strand, but intact TC-NER. See Figure S2 for additional samples and Table S2S6 for more detailed mutation density information.

Although DAR is active on both strands of expressed genes, a representative measure of DAR activity is limited to the untranscribed strand where TC-NER is absent. In the WT cSCC genome, the impact of DAR, measured as decreasing mutation frequency from lesions on the untranscribed strand with increasing expression, was substantial. For example, within non-expressed portions of WT cSCC genomes (RPKM < 0.01), mutation frequencies in regions with high H3K9me3 levels were approximately three-fold greater than those with low H3K9me3 levels, consistent with recent estimates (Lawrence et al., 2013). In contrast, for highly expressed genes (e.g. RPKM = 400), this difference disappeared, with frequency of mutations originating on the untranscribed strand of all regions approaching that of DNA with low chromatin levels (grey dashed line at H3K9me3 = 1, Figure 2). This effect was also seen in three WT basal cell carcinomas (Figures 3F, 3G, and 3H). However, expression levels showed no effect on mutation frequencies in genomic regions with the lowest H3K9me3 levels.

Figure 3. Gene expression significantly alters tumor suppressor mutation rates.

Figure 3

The x-axis shows increasing H3K9me3 intensity, representing a more repressive chromatin state. The y-axis shows the fold increase of the probability of a mutation, given a 50% decrease in expression level, referred to here as θ. Plotted is θ for 20,841 1Kb segments covering transcribed portions of 261 genes recently identified as recurrently mutated in human cancers. Highlighted are 1Kb fragments containing exons for the SCC tumor suppressors TP53 (A), NOTCH1 (B), IRF6 (C), as well as for the gene with exons of greatest average level of such mutation variance, CDC27 (D), which has been shown to be mutated at about 4% in melanomas and 2% in head and neck SCCs. Exons with the highest variance and its corresponding θ are indicated. See Table S7 for θ for all 20,841 1Kb segments.

Proto-oncogene transcription level significantly influences mutation frequency

We noted that the differences in mutation frequency associated with both transcription and chromatin state were of comparable magnitude to those caused by XPC loss-of-function. On average, XPC−/− tumors harbor about a 5-fold greater mutation burden compared to WT cancers in transcribed regions (Table S4) illustrating how modest differences in mutation frequency can confer a large increase in cancer susceptibility. For reference, if five to six independent mutations were required for cSCC formation, a 5-fold increase in frequency of each mutation would raise the cancer rate by about 4,000-fold, approximately the observed increase in XPC patients (DiGiovanna and Kraemer, 2012; Lehmann et al., 2011). For genes in regions of the greatest H3K9me3 density in WT cancers, overall mutation density was lowered up to 4.7-fold as a result of higher expression, resulting from combined activities of GG-NER (in the form of DAR) and TC-NER (Table S5). Furthermore, in WT tumors, we found a 3–4 fold reduction in mutation prevalence resulting from TC-NER of lesions on the transcribed strand (this reduction is 30-fold in XPC−/− tumors, possibly as the result of TC-NER acting in a compensatory role) (Table S6).

These observations led us to explore the possibility that natural variation in mRNA expression levels could exert an important influence on the mutation frequency of oncogenes located in tightly packaged DNA. In expression data obtained from the Genotype-Tissue Expression database (GTEx Consortium, 2013), we found that 72% of genes expressed in skin samples showed a two-fold or greater variation in expression within a group of about 150 individuals. Similarly within ~660 lymphocytic cell lines in the 1000 Genomes Project (Lappalainen et al., 2013), approximately 80% of genes demonstrated at least a two-fold difference. Thus, we assessed the potential impact of a two-fold expression variance in our model. First, the variable θ was modeled: the fold increase in mutation frequency resulting from a 50% decrease in expression level, for a given H3K9me3 level, based on our data in WT cSCCs (Supplemental Experimental Procedures). We then examined θ for 261 genes recently identified in a meta-analysis as recurrently mutated in human cancers (Lawrence et al., 2013). Genes were divided into 20,841 1Kb genomic segments for analysis (Figure 3 and Table S7).

Our estimates of θ predict that a 50% reduction in expression level would increase mutation frequency by 10%–20% or more for multiple exons in the SCC tumor suppressor genes TP53, NOTCH1 and IRF6 (Agrawal et al., 2011; Wang et al., 2011). The exon with the highest θ in this set, 1.21, belongs to CDC27, a gene demonstrating a 2% mutation frequency in head and neck SCCs and 4% in melanomas (Cerami et al., 2012), cancers whose tissues of origin depend on NER to control mutation frequency. The clinical impact of such effects in a population could be evaluated by determining both gene mutation density and expression level in a large series of tumors. We estimate that to have an 80% chance of detecting a 15% increase in mutation density in a gene within highly mutated cancers such as ours, a study would need to be powered with a minimum of approximately 600 samples at each transcript level (Supplemental Experimental Procedures).

Discussion

Our data shows that in the germline absence of GG-NER, regional disparities in mutation density associated with chromatin-dense regions are virtually abolished in non-expressed portions of cancer genomes while the residual differences in mutation prevalence within expressed portions of the genome predominantly arise from disparities in TC-NER. Therefore we establish that DNA repair efficiency is the main source of regional disparities in mutation density in cSCCs.

Unexpectedly, we also find that transcription and chromatin state do not influence mutation density independently. The decrease in mutation prevalence resulting from increasing levels of transcription was found to be correlated with H3K9me3 density and is in fact absent at the lowest H3K9me3 levels. A parsimonious interpretation of these data is that DAR acts solely and dominantly to restore GG-NER to expressed areas within tightly-packaged DNA, perhaps as a result of transcriptional complexes increasing DNA accessibility to damage sensors such as XPC, rather than by a directed process such as TC-NER. This hypothesis agrees with previous observations that DAR proceeds even in the presence of RNA polymerase II inhibitors (Nouspikel et al., 2006) and suggests a mechanism by which active genes maintain lower mutation frequencies, even in heterochromatic regions with reduced access to NER machinery. Highly expressed gene segments with greater H3K9me3 density have these marks concentrated in gene bodies but not promoters. We therefore conclude that gene expression plays a critical role by relieving the structural constraints imposed by densely-packed chromatin on DNA repair machinery, rather than simply influencing mutation density alongside chromatin state either independently or in correlation.

We further establish that the natural variation in transcription level of proto-oncogenes, between individuals, is sufficient to significantly influence their mutation rate. Our results therefore not only reveal a mechanistic basis for variable mutation density within cancer genomes, but also show how to estimate proto-oncogene mutation rates of individuals in a population based on gene expression and chromatin state. These differences represent a novel modulator of risk for specific cancers, deserving further investigation in population-based studies.

Experimental Procedures

Study design, tumor samples and DNA sequencing

Tumor samples were obtained for 5 germline XPC−/− cSCCs following a UCSF Committee on Human Research protocol addressing isolation of these tumors during surgery. At least 5 µg of DNA were collected from dissected tissue or peripheral blood and sequenced using the Illumina HiSeq2000 systems. More than 85% of targeted regions received 70-fold coverage at >90% of bases. Processing of raw sequencing data was performed using BWA (Li et al., 2008), samtools (Li et al., 2009), and GATKsoftware packages [http://www.broadinstitute.org/gatk/#Introduction]. A detailed description of these methods is provided in Supplemental Experimental Procedures.

Processing of NHEK chromatin and replication time data

Hg19 ChIP-seq signal intensity of H3k4me1, H3k9me3, H3k27ac (Ram et al., 2011) and percentage-normalized signal Repli-Seq data (Hansen et al., 2010) were obtained from ENCODE for normal human epidermal keratinocytes (NHEK) (ENCODE Project Consortium et al., 2012). Signal intensities were averaged for 1Kb intervals across the genome. Genomic regions overlapping gaps (e.g. centromeres, telomeres) within the genomic assembly (Hg19 gap track from the UCSC Genome Browser) were excluded.

Identification of expressed and non-expressed genomic regions in NHEK

Whole NHEK cell long polyA and non-polyA RNA fastq files generated by CSHL were downloaded from ENCODE and aligned to Hg19 using STAR V2.3.0.e(Dobin et al., 2013) with default parameters except for allowing for a maximum of 2 mismatches. Non-expressed genomic regions were then identified as 1Kb regions with 0 reads mapped to that region and expressed genomic regions were identified as 1Kb regions with >= 1 read mapped to that region. Genomic regions encompassed by spliced reads (e.g. introns) were included as expressed genomic regions. As defined by these parameters, 2,065,687 1Kb regions were identified as expressed and 1,032,437 1Kb regions were identified as non-expressed.

Processing of NHEK expression data

NHEK RPKM data for Hg19 Gencode v.10 annotated genes were downloaded from the Encode RNA Dashboard (http://genome.crg.es/encode_RNA_dashboard/hg19/). RPKM values were assigned to 1Kb genomic intervals spanning the length of the entire gene, including introns. Modeling of Mutation Density vs. G

Modeling of mutation density vs. genomic feature

For each 1Kb region of the genome, the numbers of mutations were calculated as well as the total number of `callable’ nucleotide positions – i.e. which met the criteria for being called mutated (at least 8× coverage for the tumor and 4× coverage for the normal). Additionally, for the 1Kb bins within annotated gene regions (Hg19 Gencode v.10), the number of mutations on the transcribed and untranscribed strands was counted separately. The expressed/non-expressed portions of the genome were divided into 8 equal bins with respect to increasing intensities of individual histone density signals based on ChIP from ENCODE data. For replication timing, expressed and non-expressed portions of the genome were divided into 8 equal bins based on (1/Repli-seq intensity) ×103. Mutations per megabase within each bin was calculated as the total number of mutations normalized by the total number of callable bases. To aggregate WT and XPC−/− samples, respectively, mutations per megabase for each sample were normalized by the mean mutation rate of each sample.

Relationship of mutation density to histone and expression levels

For analysis, we considered only regions with 1) at least 100 bp of callable positions, 2) non-missing RPKM and Histone values, and 3) annotated genes (Hg19 Gencode v.10). This resulted in a total of 1,160,378 1Kb regions. We fit a generalized linear model (GLM) separately to the transcribed and untranscribed counts for each sample in order to estimate the proportion of bases mutated as a function of the RPKM and Histone values at a base. The probability of a mutation at a position i, denoted as pi, was modeled as a function of its RPKM value (RPKMi) and Histone value (Histonei). We used a standard GLM model for binomial data,

logpi1pi=β0+βRPKM(logRPKMi+e5)+βHistonslogHistonei+βInt(logRPKMi+e5)(logHistonei)

In order to work on the log scale and handle zero valued data, RPKM values were shifted by exp(−5)≈ 0.006, as noted in the above equation. The input to the model was the number of mutated positions and the total number of callable positions, per 1Kb region. We fit this model using the glm function in R, allowing for overdispersion in the data via the standard quasilikelihood option for the binomial family. Confidence intervals for the fitted model were provided via the predict function in R.

Relationship of mutation rate to histone and replication time for non-expressed regions

A similar strategy was performed for calculating the relationship between histone and mutation rate for non-expressed regions. In this analysis, only regions with at least 100bp of callable positions and that were manually identified as non-expressed in the sample as described above, were included, a number that varied per sample. The GLM model was the same as above, only without the terms involving RPKM,

logpi1pi=β0+βHistonelogHistonei

For replication timing, the same model was used, only log Histonei was replaced by 1/Reptimei, where Reptimei refers to Repli-seq intensity for the region. The reported p-value for the significance of histone or replication in predicting mutation rate was the p-value determined by testing the null hypothesis that βHistone=0 or βReptime=0, respectively.

Fold increase in mutation density resulting from 50% decrease in expression

The overall mutation rate (resulting from lesions on the transcribed and untranscribed strands combined) was modeled as a function of histone and RPKM on the log scale using a generalized linear model, in the same manner as described above. Using this model, we calculated the rate of change of the log-odds of mutation rate as a function of RPKM for a fixed level of histone. This was computed as the partial derivative of the log-odds of a mutation with respect to the log of RPKM, which simplifies to

d(logodds)=(βRPKM+βIntlog(Histonei))d(logRPKM)

Because the probability of a mutation is very small, the log-odds are approximately equivalent to the log of the mutation probability. Then for a change in RPKM of X1 to X2, and histone levels held constant, we can approximate the fold change in the mutation rate as

p1p2(X1X2)βRPKM+βIntlog(Histone)

Supplementary Material

1
2
3
4
5
6
7

ACKNOWLEDGEMENTS

All sequencing data were deposited in dbGAP on acceptance of this manuscript. We thank Peggy Tuttle and Maria Damen for help in sample procurement and Dr. Leslie Cope for manuscript comments. This work was supported by the Well Aging Research Center, Samsung Advanced Institute of Technology, under the auspices of Professor Sang Chul Park, the Dermatology Foundation, National Institutes of Health, National Cancer Institute grants K08 CA169865 (RJC) and U54 CA112970 and by the OHSU Knight Cancer Institute (JWG). We appreciate assistance in artwork from Sarah Pyle and Eli Blair Media.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

REFERENCES

  1. Agrawal N, Frederick MJ, Pickering CR, Bettegowda C, Chang K, Li RJ, Fakhry C, Xie T-X, Zhang J, Wang J, et al. Exome sequencing of head and neck squamous cell carcinoma reveals inactivating mutations in NOTCH1. Science. 2011;333:1154–1157. doi: 10.1126/science.1206923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alexandrov LB, Nik-Zainal S, Wedge DC, Aparicio SAJR, Behjati S, Biankin AV, Bignell GR, Bolli N, Borg A, Børresen-Dale A-L, et al. Signatures of mutational processes in human cancer. Nature. 2013;500:415–421. doi: 10.1038/nature12477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Barski A, Cuddapah S, Cui K, Roh T-Y, Schones DE, Wang Z, Wei G, Chepelev I, Zhao K. High-resolution profiling of histone methylations in the human genome. Cell. 2007;129:823–837. doi: 10.1016/j.cell.2007.05.009. [DOI] [PubMed] [Google Scholar]
  4. Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, Jacobsen A, Byrne CJ, Heuer ML, Larsson E, et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2012;2:401–404. doi: 10.1158/2159-8290.CD-12-0095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Cleaver JE. Cancer in xeroderma pigmentosum and related disorders of DNA repair. Nat. Rev. Cancer. 2005;5:564–573. doi: 10.1038/nrc1652. [DOI] [PubMed] [Google Scholar]
  6. Cleaver JE, Feeney L, Tang JY, Tuttle P. Xeroderma pigmentosum group C in an isolated region of Guatemala. J. Invest. Dermatol. 2007;127:493–496. doi: 10.1038/sj.jid.5700555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. DiGiovanna JJ, Kraemer KH. Shining a light on xeroderma pigmentosum. J. Invest. Dermatol. 2012;132:785–796. doi: 10.1038/jid.2011.426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. ENCODE Project Consortium. Bernstein BE, Birney E, Dunham I, Green ED, Gunter C, Snyder M. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Gospodinov A, Herceg Z. Shaping chromatin for repair. Mutat. Res. 2013;752:45–60. doi: 10.1016/j.mrrev.2012.10.001. [DOI] [PubMed] [Google Scholar]
  11. GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 2013;45:580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Hansen RS, Thomas S, Sandstrom R, Canfield TK, Thurman RE, Weaver M, Dorschner MO, Gartler SM, Stamatoyannopoulos JA. Sequencing newly replicated DNA reveals widespread plasticity in human replication timing. Proc. Natl. Acad. Sci. U. S. A. 2010;107:139–144. doi: 10.1073/pnas.0912402107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Hodgkinson A, Chen Y, Eyre-Walker A. The large-scale distribution of somatic mutations in cancer genomes. Hum. Mutat. 2012;33:136–143. doi: 10.1002/humu.21616. [DOI] [PubMed] [Google Scholar]
  14. Van Hoffen A, Venema J, Meschini R, van Zeeland AA, Mullenders LH. Transcription-coupled repair removes both cyclobutane pyrimidine dimers and 6–4 photoproducts with equal efficiency and in a sequential way from transcribed DNA in xeroderma pigmentosum group C fibroblasts. EMBO J. 1995;14:360–367. doi: 10.1002/j.1460-2075.1995.tb07010.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Jäger N, Schlesner M, Jones DTW, Raffel S, Mallm J-P, Junge KM, Weichenhan D, Bauer T, Ishaque N, Kool M, et al. Hypermutation of the inactive X chromosome is a frequent event in cancer. Cell. 2013;155:567–581. doi: 10.1016/j.cell.2013.09.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Lappalainen T, Sammeth M, Friedländer MR, Hoen PAC‘t, Monlong J, Rivas MA, Gonzàlez-Porta M, Kurbatova N, Griebel T, Ferreira PG, et al. Transcriptome and genome sequencing uncovers functional variation in humans. Nature. 2013;501:506–511. doi: 10.1038/nature12531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Lawrence MS, Stojanov P, Polak P, Kryukov GV, Cibulskis K, Sivachenko A, Carter SL, Stewart C, Mermel CH, Roberts SA, et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature. 2013;499:214–218. doi: 10.1038/nature12213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Lehmann AR, McGibbon D, Stefanini M. Xeroderma pigmentosum. Orphanet J. Rare Dis. 2011;6:70. doi: 10.1186/1750-1172-6-70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Li H, Ruan J, Durbin R. Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 2008;18:1851–1858. doi: 10.1101/gr.078212.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R 1000 Genome Project Data Processing Subgroup. The Sequence Alignment/Map format and SAMtools. Bioinforma. Oxf. Engl. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Liu L, De S, Michor F. DNA replication timing and higher-order nuclear organization determine single-nucleotide substitution patterns in cancer genomes. Nat. Commun. 2013;4:1502. doi: 10.1038/ncomms2502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Mikkelsen TS, Ku M, Jaffe DB, Issac B, Lieberman E, Giannoukos G, Alvarez P, Brockman W, Kim T-K, Koche RP, et al. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature. 2007;448:553–560. doi: 10.1038/nature06008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Nouspikel T, Hanawalt PC. Terminally differentiated human neurons repair transcribed genes but display attenuated global DNA repair and modulation of repair gene expression. Mol. Cell. Biol. 2000;20:1562–1570. doi: 10.1128/mcb.20.5.1562-1570.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Nouspikel TP, Hyka-Nouspikel N, Hanawalt PC. Transcription domain-associated repair in human cells. Mol. Cell. Biol. 2006;26:8722–8730. doi: 10.1128/MCB.01263-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Pleasance ED, Cheetham RK, Stephens PJ, McBride DJ, Humphray SJ, Greenman CD, Varela I, Lin M-L, Ordóñez GR, Bignell GR, et al. A comprehensive catalogue of somatic mutations from a human cancer genome. Nature. 2010;463:191–196. doi: 10.1038/nature08658. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Polak P, Lawrence MS, Haugen E, Stoletzki N, Stojanov P, Thurman RE, Garraway LA, Mirkin S, Getz G, Stamatoyannopoulos JA, et al. Reduced local mutation density in regulatory DNA of cancer genomes is linked to DNA repair. Nat. Biotechnol. 2014;32:71–75. doi: 10.1038/nbt.2778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Ram O, Goren A, Amit I, Shoresh N, Yosef N, Ernst J, Kellis M, Gymrek M, Issner R, Coyne M, et al. Combinatorial patterning of chromatin regulators uncovered by genome-wide location analysis in human cells. Cell. 2011;147:1628–1639. doi: 10.1016/j.cell.2011.09.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Schuster-Böckler B, Lehner B. Chromatin organization is a major influence on regional mutation rates in human cancer cells. Nature. 2012;488:504–507. doi: 10.1038/nature11273. [DOI] [PubMed] [Google Scholar]
  29. Stamatoyannopoulos JA, Adzhubei I, Thurman RE, Kryukov GV, Mirkin SM, Sunyaev SR. Human mutation rate associated with DNA replication timing. Nat. Genet. 2009;41:393–395. doi: 10.1038/ng.363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Wang NJ, Sanborn Z, Arnett KL, Bayston LJ, Liao W, Proby CM, Leigh IM, Collisson EA, Gordon PB, Jakkula L, et al. Loss-of-function mutations in Notch receptors in cutaneous and lung squamous cell carcinoma. Proc. Natl. Acad. Sci. 2011 doi: 10.1073/pnas.1114669108. 201114669. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
2
3
4
5
6
7

RESOURCES