Abstract
Non-coding mutations in the TERT promoter (TERTp), typically at one of two bases −124 and −146 bp upstream of the start codon, are among the most prevalent driver mutations in human cancer. Several additional recurrent TERTp mutations have been reported but their functions and origins remain largely unexplained. Here, we show that atypical TERTp mutations arise secondary to canonical TERTp mutations in a two-step process. Canonical TERTp mutations create de novo binding sites for ETS family transcription factors that induce favourable conditions for DNA damage formation by UV light, thus creating a hotspot effect but only after a first mutational hit. In agreement, atypical TERTp mutations co-occur with canonical driver mutations in large cancer cohorts and arise subclonally specifically on the TERTp driver mutant chromosome homolog of melanoma cells treated with UV light in vitro. Our study gives an in-depth view of TERTp mutations in cancer and provides a mechanistic explanation for atypical TERTp mutations.
Subject terms: Cancer genomics, Computational biology and bioinformatics, Genomic analysis, DNA
Mutations at two key positions in the TERT promoter are frequent drivers of cancer, but the role of rare, atypical mutations in this region remains unclear. Here, the authors demonstrate that atypical TERT promoter mutations develop after canonical ones due to locally elevated UV-induced DNA damage.
Introduction
Reactivation of telomerase is a key mechanism that allows cancer cells to divide continuously without undergoing senescence or cell death1. In 2013, two studies demonstrated that point mutations in the Telomerase reverse transcriptase (TERT) promoter (TERTp) can activate TERT transcription, thereby increasing levels of the gene product, the catalytic subunit of telomerase2,3. TERTp mutations are now known to be among the most frequent somatic driver events in human cancer and have been reported in > 50 cancer types4–7.
Recurrent TERTp mutations normally occur as mutually exclusive C > T single nucleotide variants (SNVs) at one of two specific positions, −124 and −146 bp upstream of the ATG start codon (also known as C228T and C250T, respectively). These mutations both create a de novo activating binding site for E26 transformation specific (ETS) family transcription factors (TFs), and both result in formation of an identical sequence, 5’-CCCCTTCCGG-3’ (ETS motif in bold). Mechanistically, the ETS protein GABPA has been proposed to bind the de novo sites as a heterotetramer together with GABPB while interacting with a native (pre-existing) ETS site in the TERTp (known as ETS-200), positioned such that the helical phase is preserved8,9. The −124 and −146 bp regions are thus “proto-ETS sites” primed to be mutationally converted into activating functional sites that recruit the GABP/ETS complex, in both cases through interaction with a native ETS site.
While recurrent TERTp mutations are found exclusively at the −124 and −146 bp positions in most cancer types, increasingly larger whole genome and targeted panel sequencing studies have also uncovered several atypical recurrent TERTp mutations, often in melanoma or other skin cancers (Supplementary Table 1). The most frequent ones include non-ETS-forming C > T SNVs at −126 and −149 bp, close to the canonical driver mutations, and at positions flanking the native ETS site2,10–16. Additionally, a rare recurrent CC > TT dinucleotide variant (DNV) at −139/−138 bp introduces a de novo ETS site shown to activate TERT transcription in reporter assays17. Despite repeated reports of recurrent mutations at several additional positions, the role of such non-canonical TERTp mutations and the reason for their preferential emergence in skin cancers has remained elusive.
Studies dating back to the 1980’s, as well as recent investigations using sequencing-based methods, have shown that many TFs have the ability, through their impact on DNA geometry and flexibility, to modulate DNA damage formation by UV light18–22. Of note, ETS factors have been shown to induce particularly favourable conditions for formation of cyclobutane pyrimidine dimers (CPDs), the principal UV DNA damage lesions, leading to strong C > T mutation hotspots at numerous ETS binding sites (TTCCK motifs, where K is G/T) in skin cancers23–27. These mutations form primarily at the two bases preceding the core motif or, to a lesser extent, at the first central cytosine24,27. Mutations have been shown to form at highly elevated frequency at such sites following UV irradiation in vitro, further supporting that ETS hotspot mutations are generally passengers rather than due to positive selection23,25. Structural analysis of ETS-bound DNA demonstrated DNA conformational changes that facilitate CPD formation24.
Here, we use available TERTp targeted sequencing data, melanoma whole genome sequencing data, and in vitro experiments to explain the aetiology of non-canonical TERTp mutations. Our analyses support that these typically arise in a two-step process, wherein a canonical TERTp driver mutation at a proto-ETS site, allowing ETS to bind, represents a first mutational hit. This sets the stage for a second hit in the form of a passenger UV hotspot mutation at either the newly formed ETS site or the native ETS site, owing to the ability of ETS factors to stimulate UV damage formation where they bind. Our study gives an overview of TERTp mutations in human cancers and clarifies the origin of most atypical recurrent TERTp mutations.
Results
Most atypical TERT promoter mutations are restricted to skin cancers
To obtain an overview, we probed 19,755 independent tumours from a broad range of TERTp-mutated ( ≥ 1% of samples) human cancers with the help of targeted sequencing data in GENIE28 (Supplementary Table 2). Recurrent TERTp mutations were observed predominantly at the −124 and −146 bp positions as expected, and were most prevalent in cancers arising from cells with low rates of self-renewal such as gliomas (reaching up to 96% in anaplastic oligodendroglioma) as well as in skin cancers (77% of skin cutaneous melanomas, SKCM), as previously reported (Fig. 1)29. TERTp mutation frequency varied considerably, with cancer types such as lung and colon adenocarcinoma being at the other end of the spectrum (1.1% and 1.8%, respectively), also in agreement with prior findings30. While the −124 bp mutation dominated over −146 bp in most TERTp mutated cancer types, skin cancers exhibited the two mutations at almost equal frequency (Fig. 1).
Additional recurrent TERTp mutations (present in ≥ 10/19,755 samples) were essentially restricted to skin cancers exhibiting a high fraction of UV light-derived mutations (single and double base mutational signatures SBS7 and DBS1, respectively). These included the ETS-forming DNV at −139/−138 bp, the SNVs at −149 and −126 bp, and at −101 and −100 bp overlapping the native ETS site (Fig. 1). The recurrent mutations were almost exclusively (99.1%) C > T SNVs or CC > TT DNVs in dipyrimidine sequence contexts, compatible with UV mutagenesis. Together, these results implicate UV-related mutational processes as a source of atypical TERTp mutations.
Atypical TERTp events arise at UV damage hotspot positions within the ETS motif
Given the presence of proto-ETS sites (convertible to functional sites) as well as a native (preexisting) ETS site in the TERTp, we hypothesized that elevated UV damage and increased mutation rate at ETS binding sites may play a role in the aetiology of atypical TERTp mutations24–27. To test this, we defined a subcohort, “High TERTp, UV” (1,569 samples), comprising of cancer types having a high average TERTp mutation frequency and high UV mutation burden, while still exhibiting considerable inter-sample variability in these variables (Fig. 1). These were contrasted against a “High TERTp, no UV” subcohort (4,065 samples), with similar levels of TERTp mutations but lacking UV exposure (Fig. 1).
The recurrent TERTp mutations were limited to a small, 56 bp, upstream region, and nearly all atypical mutations were found in the UV-exposed set as expected (Fig. 2a, Supplementary Table 3). Of these, the most common were two ETS-forming CC > TT DNVs: −139/−138 bp, previously suggested to be a driver event2,17, and −125/−124 bp, a driver at the canonical −124 bp (C228T) site2,3 (Fig. 2a). Presumably, strongly elevated CC > TT DNV frequency in UV-exposed cells provides a substrate for positive selection acting on DNVs at these sites in skin cancers (Fig. 1, Supplementary Table 4)31.
Of the remaining, non-ETS-forming, atypical mutations, the most frequent were at the native ETS site, at cytosines upstream of the ETS binding motif known to exhibit a strong UV damage hotspot effect when occupied7,23–25,27 (Fig. 2a, indicated in the sequence). Specifically, SNVs or DNVs at −101/−100 bp (CCTTCCG; mutations underscored, ETS motif bolded) or to a lesser extent at −97 bp (CCTTCCT), a position in the motif known to exhibit a weaker UV hotspot effect25, were detected in 44/1,569 samples (2.8%) in the UV-exposed subcohort but were absent in the non-exposed set. Similarly, atypical mutations at −149 bp and, more rarely, at −148 bp were found in 22/1,569 samples (1.4%) in the UV-exposed set while being absent in non-exposed samples (Fig. 2a). These positions are just upstream of the −146 bp (C250T) canonical proto-ETS driver site compatible with a CPD hotspot effect at this locus. In the same way, variants at −126 bp, present in 14/1,569 samples (0.9%), arise at a predicted hotspot position upstream of the −124 bp (C228T) canonical proto-ETS site (Fig. 2a). The observed patterns of mutagenesis, as well as the association with UV-exposure, thus suggest that the majority of atypical SNVs may be passengers arising due to elevated UV damage at TERTp ETS sites.
Mutation co-occurrence implies a two-step progression
The canonical ETS-forming driver mutations at −146 and −124 bp activate TERT by recruitment of a larger ETS dimer or multimer complex that interacts both with the de novo-formed and the native ETS site (proposedly GABPA/GABPB or ETS1/p52)8,9,32 (illustrated in Fig. 2a). As ETS sites should only exhibit elevated UV mutagenesis when occupied, we next subdivided the UV-associated subcohort by presence or absence of a primary TERTp driver mutation (Fig. 2b).
In the UV-exposed set, all 22 samples with mutations at −149 or −148 bp also carried a corresponding (−146 bp) primary driver event (P = 4.5 × 10−12; Fig. 2b, Supplementary Fig. 1). Of the 14 mutations at −126 bp, 10 occurred in samples having the corresponding (−124 bp) driver event (P = 9.4 × 10−4), while the remaining 4 co-occurred with −146 bp events (Fig. 2b, Supplementary Fig. 1). The closely spaced co-occurring mutations were called together as oligonucleotide variants (ONVs), thus linking them to the same chromosome homologue, in some but not all cases, potentially explained by heterogenous ONV calling in the GENIE cohort33 (Fig. 2b, Supplementary Table 3). Furthermore, nearly all (41/44; 93.2%) native site mutations occurred in samples also having primary driver events, present in 67.4% of all samples in this subcohort, in agreement with the native site only being occupied in tumours following a TERTp driver mutation (P = 6.0 × 10−5; Fig. 2b, Supplementary Fig. 1). Together, these results support that most atypical TERTp SNVs arise in a two-step sequence, wherein a primary driver event first recruits the ETS/GABP complex thus paving the way for a second hit in the form of a UV-induced hotspot mutation at a now-occupied ETS site.
Mutation distribution supports neutral selection for non-ETS-forming atypical mutations
Our results suggest that non-ETS-forming atypical TERTp mutations are passenger events. To further test this, we binned the TERTp and UV-associated skin cancers in GENIE (“High TERTp, UV” subcohort in Fig. 1) by UV mutation burden (C > T substitutions in dipyrimidine contexts) and investigated the distribution of TERTp mutations across the bins. We found that the frequencies of ETS-forming TERTp mutations, both the canonical drivers and the rare DNVs seen in skin cancers, showed relatively limited correlation with UV mutation burden (r = 0.44, P = 0.45, all combined) and were thus not markedly dictated by the degree of UV mutagenesis, as noted previously for driver mutations in melanoma (Fig. 3a)23,34. In contrast, the atypical TERTp mutations correlated strongly with UV mutation burden across samples (r = 0.99, P = 1.3 × 10−3, all combined), as would be expected for UV-induced mutations under neutral selection (Fig. 3b)23. Even within the group of UV-associated skin cancers, atypical mutations thus occur preferably in highly UV-mutated samples. These results are consistent with ETS-forming TERTp mutations being drivers and with non-ETS-forming atypical events arising passively due to UV exposure thus being passengers.
Allele phasing molecularly links atypical TERTp mutations to primary drivers
The proposed two-step model implies that canonical TERTp mutations and secondary atypical events should co-exist on the same chromosome homologue. While GENIE provided evidence for mutation co-occurrence at the patient level, allele phase information was limited due to lack of access to raw sequencing data and possible heterogenous protocols for calling of complex variants33.
To address this, we screened for TERTp mutations in 335 melanomas subjected to whole genome sequencing within the Genomics England 100 k Genomes Project. Of nine identified cases with UV hotspot mutations at the native ETS site (six at −101 bp and three at −100 bp), eight also had a canonical primary TERTp driver event, in agreement with the results from GENIE (Fig. 4a). Similarly, three cases mutated at the −149/−148 bp secondary UV hotspot site all carried −146 bp primary driver events (Fig. 4a). Interestingly, one of these also carried a −126 bp mutation, again mirroring the GENIE analysis. An additional sample mutated at the −126 bp secondary hotspot site simultaneously carried a −124 bp primary driver event. The atypical events (−101, −100, −149 and −126 bp) were more prevalent in high-UV samples, as expected for UV-induced passengers, while established positively selected driver events were more uniformly distributed, again corroborating our results from GENIE (Fig. 4a).
We next phased the atypical TERTp mutations relative to primary driver events in samples where these co-occurred (12 cases described above), by extracting sequencing reads that covered all the key positions (Supplementary Fig. 2). We found that mutations at the native ETS site were always in cis with a primary mutation thus having arisen on the same chromosome homologue (Fig. 4b). Similarly, atypical mutations at −149, −148 bp and −126 always occurred in cis with primary driver mutations (Fig. 4b). Compared to primary mutations, secondary events had similar or only slightly lower allele frequencies, supporting that they occurred early before expansion of major cell clones (Fig. 4b). A sequential model is thus further supported by direct molecular linkage between primary and atypical secondary events in UV-exposed melanomas.
Elevated UV damage formation at TERTp ETS sites upon GABP binding
To test whether ETS binding stimulates UV damage in the TERTp, we determined CPD formation levels following UV irradiation of radiolabelled wild-type or mutant TERTp dsDNA fragments encompassing the de novo and native sites, with or without recombinant GABP complex. Binding of GABP to the TERTp fragments was first established by gel-shift experiments (Supplementary Fig. 3) as well as DNase I footprinting, which revealed interactions at the −124 bp and −146 bp proto-sites specifically when mutated as expected (Fig. 5a, Supplementary Fig. 4). Although the wild-type TERTp is considered incapable of significant GABP complex recruitment in cells8,9, we found that all sequences interacted with recombinant GABP via the native site in this in vitro system (Fig. 5a).
CPD profiling by T4 endonuclease V digestion revealed CPD damage to be consistently elevated at the predicted hotspot CC positions upstream of the mutated, but not wild-type, −146 and −124 bp proto-ETS sites, only in the presence of GABP as predicted (Fig. 5b, Supplementary Fig. 5). Weak yet consistent GABP-dependent CPD induction upstream of the native site was also seen in all fragments, in agreement with the protein binding data (Fig. 5b). Atypical TERTp SNVs thus generally arise at base positions exhibiting elevated UV damage formation following GABP binding in vitro.
Driver-linked secondary mutations arise following UV irradiation in cultured cells
To recapitulate the mutagenic process for atypical TERTp mutations in cellular conditions, we first developed an amplicon sequencing assay to enable detection of low-frequency, subclonal, mutations in the TERTp. The TERTp is notoriously GC rich (>80%), making PCR-based assays challenging35,36. Here, we were able to combine SiMSen-seq ultrasensitive amplicon sequencing37 with a specific polymerase and a GC enhancer additive to sequence the TERTp at high fidelity. We next treated A375 melanoma cells, which carry a heterozygous -146 bp C > T (C250T) TERTp driver mutation, with daily low doses of UVC light during 6 weeks and compared mutation frequencies in untreated and UV-exposed cells using our SiMSen-seq assay. All reads were then phased in relation to mutation status at the −146 bp position to determine if mutations were arising on the TERTp mutant (C250T) or wild-type chromosome homologue (Fig. 6a).
A total of 15,923 and 12,006 error-corrected consensus reads were obtained for these two conditions, respectively (minimum 3 reads oversampling). In non-exposed cells, only a small number of atypical recurrent TERTp mutations were detected, present in at most four consensus reads, likely attributable to sequencing noise or pre-existing genetic heterogeneity (Fig. 6b). In contrast, subclonal recurrent mutations were found at expected hotspot positions after UV exposure, at the native ETS site (−101 and −100 bp) and at −149 and −148 bp, which are predicted secondary UV hotspot bases at the −146 bp proto-ETS site (Fig. 6c). Notably, these atypical mutations were almost perfectly phased together with the −146 bp driver event: 21/23 native site mutations and 21/21 mutations at −149/−148 bp occurred in cis with the −146 bp mutation. Atypical TERTp mutations frequently seen in UV-exposed cancers can thus be induced by UV exposure in cultured cells, but only when preceded by an ETS-recruiting TERTp driver mutation on the same chromosome homologue.
Discussion
Somatic TERTp mutations are the most frequent driver alterations in many cancers and are known to emerge early during oncogenesis, thus being of fundamental importance in human tumour development with several possible uses as prognostic biomarkers1,38–40. Our analyses place recurring TERTp mutation events into three main categories, the first being the canonical driver C > T SNVs at −146 bp (C250T) and −124 bp (C228T), which activate the TERTp through formation of de novo ETS TF factor binding sites. The second encompasses additional, and much less frequent, driver events in the form of CC > TT DNVs at −125/−124 bp and −139/−138 bp, the former being functionally equivalent to the −124 bp driver while the latter is ETS-forming at an additional, third, proto-ETS site. These two have previously been suggested to be drivers, which is reinforced by our results supporting that positive selection is acting on these DNVs2,3,17. Their low frequency may be explained by a general scarcity of DNVs in most cancers; a model supported the observation that the two events are near-exclusive to skin cancers, where UV-induced CC > TT DNVs are relatively abundant31. A small number of additional cases were found in bladder/urothelial tumours, indeed known to harbour CC > TT substitutions stemming from APOBEC mutagenesis (DBS11)31.
The third category, and main focus of this study, encompasses non-ETS forming recurrent TERTp mutations (mainly at −149, −126, −101 and −100 bp), which all exhibited mutational patterns compatible with neutral selection thus deviating clearly from the ETS-forming events. Through genomic analyses of human tumours and in vitro CPD damage and mutagenesis studies, we show that these atypical events stem from UV-hypersensitive bases upstream of ETS-bound sites (CCTTCCK, underscored), whether pre-existing (native) or de novo-formed by driver mutations, further supporting that they are passengers. ETS-bound sites are known to exhibit weaker UV-hypersensitivity also at the central TC dipyrimidine23. While reflected in our CPD formation data as well as in UV mutagenesis results from cultured cells, mutations at this position were lacking in tumours. This is likely due to negative selection, as mutations at the centre of the core motif are expected to counteract the primary driver event by disrupting ETS binding.
The native ETS site in the TERTp has been shown to act as a partner to de novo-formed sites at −146 and −124 bp, enabling recruitment of a multimeric ETS (GABP) complex to both sites8,32. There is strong support for a dependency on dual ETS sites for GABP complex formation and promoter activation including preferential site pair spacing that preserves the helical phase, both in the TERTp and in general8,9. While not recapitulated in our cell-free in vitro system, where recombinant GABP interacted also with a wild-type TERTp fragment via the native site, this model provides a plausible mechanism explaining a dependency on a primary driver event for native site mutations (at −101, −100 and −97 bp) in tumours. Notably, we also observed this dependency when inducing mutations in cultured melanoma cells by UV light. In further agreement with a dual sites model, in melanoma tumours we find that somatically mutated paired ETS sites in general exhibit preferential spacing reflecting the helical period as well as strong overlaps with GABP ChIP peaks (Supplementary Fig. 6). Given the mutant TERTp has been shown to undergo extensive chromatin remodelling41 it is also plausible that the inactive wild-type allele is generally less accessible to regulatory proteins, which may contribute to an interaction between primary and native site mutations.
The native site is adjacent to a second pre-existing ETS site closer to the TSS, and of these tandem sites, the first (ETS−200) has been suggested to be the preferential GABP interaction partner8,9. It may be noted that this is consistently supported by our results, as no mutation hotspot effect was observed at the second site (ETS−195) neither in tumours nor after in vitro UV mutagenesis.
Dependency on a primary driver event is, in principle, mechanistically straight-forward in the case of atypical mutations at the two main proto-ETS sites. UV hotspot mutations at −149 bp and, rarely, at −148 bp (CCTTCCG) were indeed observed exclusively in conjunction with a −146 bp ETS-forming driver mutation that enables ETS docking and thus subsequent stimulation of UV damage (CPD) formation. The situation is analogous for hotspot mutations at −126 bp, which associated strongly with −124 bp ETS-forming driver mutations. However, in a few cases (4/14 in the GENIE), −126 bp SNVs were found to instead co-exist with −146 bp driver mutations. While mechanistically unclear, one may speculate that multimeric ETS complexes could still bind with sufficient affinity to this proto-ETS site following a −146 bp event, or that activation and remodelling of the −146 bp mutant TERTp allele allows another protein to bind and modulate damage at this coordinate.
The study of non-coding driver mutations has often been confounded by unexplained mutagenic phenomena7, motivating careful deciphering of the origins of recurrent mutations in regulatory DNA. By clarifying the mechanism underlying atypical mutations in the TERTp, this study provides a more complete understanding of somatic alterations in one of the most frequently mutated regions in human cancer.
Methods
Processing of GENIE data for TERTp mutation analysis
Targeted sequencing data was obtained in tab-delimited Mutation Annotation Format (MAF) from GENIE v11 via the Synapse platform (http://synapse.org/genie). The initial dataset encompassed 118,094 tumour samples from 756 cancer subtypes (OncoTree codes) and 19 centres. For this study, samples sequenced with assays that lacked the TERTp region or that captured <300,000 bp were removed. For patients with multiple samples only the first reported sample was retained, thus ensuring that all samples were from unique donors. To enable robust calculation of UV mutation signature and mutation burden, cancer subtypes with <40 samples or <500 mutations in total in all samples were further disregarded. Finally, to focus only on cancer subtypes with TERTp mutations, cancer subtypes having <1% TERTp driver mutation frequency were removed. The final dataset comprised of 59 cancer types and 19,755 tumours (16.7% of GENIE v11) from Yale (assay YALE-OCP-V3) and Memorial Sloan Kettering Cancer Center (assays MSK-IMPACT341, MSK-IMPACT410, MSK-IMPACT468) (Supplementary Table 2).
TERTp sites mutated in at least ten out of the 19,755 patients in the cohort were considered as recurrent in Fig. 1. As some single nucleotide variants (SNVs) were reported as being part of oligonucleotide variants (ONV), ONV were at this initial stage split into their corresponding SNP components, whereas dinucleotide variants (DNV) mutations were considered as a single events.
Single base substitution (SBS) and double base substitution (DBS) mutation signatures were calculated using the “DeconstructSigs” R package42 (version 1.9.0 together with COSMIC SBS signatures version 2 and DBS signatures version 3) using a maximum of 5 signatures and with the “exome2genome” option, with remaining parameters set to their default values. Recurrent driver mutations (SNVs present in ≥ 4 samples and DNVs present in more ≥ 2 samples in a given cohort) were removed before the analysis. To enable relevant observations about the role of UV light in the formation of TERTp mutations, cancer subtypes were classified into subcohorts based on the frequency of TERTp driver mutations (−124, −139/−138, −146 bp) in the samples and the fraction of mutations attributed to the UV mutation signatures SBS7 and DBS1. Specifically, the subcohorts were “High TERTp, UV” (≥20% of samples having TERTp driver mutations and ≥ 10% of all mutations being UV-attributed), “High TERTp, no UV” (≥20% TERTp drivers and < 10% UV mutations) and “Low TERTp” (<20% TERTp drivers). Per-sample UV mutation burdens in the “High TERTp, UV” subcohort in Fig. 3 were calculated by considering C > T substitutions in dipyrimidine contexts, following removal of recurrent driver mutations as describe above. P-values for co-occurrence of atypical and driver mutations in patients were calculated using the two-sided Fisher’s exact test without correction for multiple comparisons.
For several samples sequenced using the MSK-IMPACT assays, an error was identified in the GENIE database that incorrectly annotated a subset of −146 bp (C250T) mutations as ONVs together with −150 bp substitutions, confirmed by MSK (Prof. Michael Berger, personal communication). These artifacts were corrected for the data used in our analysis.
Processing of Genomics England data for TERTp mutation analysis
Whole genome sequencing data from Genomics England was accessed through the Genomics England Research Environment. Patients clinically diagnosed with malignant melanoma (345 primary tumours) were initially selected using Genomics England LabKey database. Patients with multiple samples were filtered out, leaving 335 unique primary tumours for further processing. Alignment files (BAM format) and variant call files (VCF) were collected for all samples. Tumour mutation burdens were determined based on the VCF data, and per-sample mutational signature information was extracted using LabKey. Mutations at the key positions of interest (−97, −100, −101, −124, −125, −126, −138, −139, −146, −148 and −149 bp) were initially identified and visualized based on VCF mutations calls. In samples where atypical mutations co-occurred with ETS driver mutations, the events were phased based on the raw alignment data. Briefly, BAM files were uncompressed into SAM format using Samtools43 while only maintaining reads that covered all the positions of interest indicated above. Mutations were called at each such position in each individual read by pinpointing all deviations from the reference sequence in the SAM data. For each sample, the reads were subsequently split by mutation status at the main driver positions, and mutation frequencies at all positions of interest were visualized separately for the two subsets.
CPD profiling and DNase footprinting and EMSA
To create double stranded template, 90 bp oligos containing both the native ETS site and proto-ETS site at −124 and −146 bp in the TERTp (Supplementary Table 5) were labelled with γ-32P. 100 pmol of forward oligo (C-rich strand) was incubated with 5 units of T4 PNK (Cat. No. M0201L, NEB) and 30 uCi of γ-32P-ATP (Cat. No. SRP−301, Hartmann Analytic GmbH) in PNK buffer (70 mM Tris HCl, 10 mM MgCl2, 5 mM DTT pH 7.6) in 50 ul for 60 min at 37 °C and heat inactivated for 20 min at 95 °C and cleaned up with a G25 column (Cat. No. 27532501, Cytiva). 100 pmol of reverse oligo (G-rich strand) was annealed to the forward oligo in a total volume of 100 ul in annealing buffer (15 mM Tris, 100 mM NaCl), incubated at 95 °C for 5 min and the heat block switched off and left overnight.
Recombinant GABPA/B heterodimer complex was produced by cloning genes for GABP β1L and GABP α, the former with an N-terminal His-tag, into the pET-Duet-1 vector followed by expression in Rosetta DE3 E. coli cells9. To determine the appropriate concentration of GABP protein complex to use in the UV footprinting experiment, we performed electrophoretic mobility shift assays (EMSA). GABP protein was diluted from 80 μM stock solution (final conc: 10 μM, 5 μM, 2.5 μM, 1 μM, 0.5 μM and 0.1 μM) and incubated with 20 nM labelled oligo in 15 μl reaction buffer (20 mM Tris-Cl (pH 7.5), 5% glycerol, 5.2 mM MgCl2, 50 mM KCl, 1 mM TCEP) for 1 h at room temperature and run on a 4% native PAGE gel in 0.5x TBE at 4 °C and imaged on a Typhoon scanner after 1 h exposure.
To perform CPD footprinting, 20 nM annealed oligo was incubated for 1 h at room temperature with or without 5 μM GABP protein in 15 μl reaction buffer in triplicate. 3 μl of this reaction was run on a 4% native PAGE gel in 0.5x TBE at 4 °C to confirm binding by EMSA and the remaining 12 μl was irradiated with 5000 J/m2 UVC as a drop on parafilm. Irradiated DNA was diluted to 100 μl in H2O and subjected to phenol chloroform extraction followed by ethanol precipitation to remove GABP protein. DNA was resuspended in 15 μl T4 endonuclease V buffer (100 μg/μl BSA, 100 mM NaCl, 1 mM DTT, 1 mM EDTA, 25 mM Na2HPO4 pH 7.2) and cleaved at CPDs by incubating for 1 hr at 37 °C with 1 unit of T4 endonuclease V (Cat. No. M0308S, NEB). The reaction was stopped with the addition of 2x formamide buffer. Samples were loaded in appropriate amounts such that uncut oligo density, as determined using Multi Gauge v3.0 (Fujifilm), was similar across the wells. Samples were heated to 95% before running on an 8% native PAGE sequencing gel pre-heated to 55 °C and run for 1 h at 1500 V in 1x TBE. Cleavage products were visualized using a Typhoon imager with overnight exposure. For DNAse footprinting, 1 U DNAse I (Cat. No. EN0521, Thermo Fisher) was added to 20 nM oligo with or without GABP protein in reaction buffer and incubated for 5 min at RT. The reaction was stopped with the addition of 2x formamide buffer and 5 ul resolved on native PAGE 8% sequencing gels. Gel bands were quantified with ImageJ using the Analyze tool44. Uncropped gel images are included as Source Data.
Ultrasensitive mutation analysis
A375 melanoma cells (a gift from the Jonas Nilsson laboratory, University of Gothenburg), which have a heterozygous TERTp mutation at −146 (C250T), were grown in DMEM + 10% FCS + gentamycin (Cat. No. 11965092, A5670701 and 15710064, respectively, Thermo Scientific). Cells were treated with 36 J/m2 UVC (254 nm) using a CL−1000 UV crosslinker (UVP) in DMEM in 10 cm plates without lids, 5 days a week for 6 weeks. Cells were reseeded at 1:5 density when confluent, and subsequently frozen at −20 °C. DNA was extracted using the DNeasy Blood and Tissue Kit (Cat. No. 69504, Qiagen). To identify subclonal mutations in the TERTp which arose following UV treatment, primers were designed to sequence the region using SiMSen-seq37 (Supplementary Table 6). The high GC content required optimization of polymerases for amplification of TERTp. Barcoding of 100 ng genomic DNA and amplification proceeded as previously described but with Phusion Plus Mastermix and GC enhancer (Cat. No. F631S, Thermo Fisher). Sequencing was performed on an Illumina Novaseq 6000 instrument in 150 bp single end mode.
The fastq files containing the UMI-barcoded reads were preprocessed and subsequently aligned to the human reference genome (hg38) with UMIErrorCorrect45. Only consensus (error-corrected) reads based on at least three raw reads were maintained and indel-containing consensus reads were removed. Mutations that were present in more than one consensus read were considered in the final analysis.
Analysis of somatic mutations at paired ETS sites
Whole genome somatic mutation calls from 183 melanoma patients from the MELA-AU project46 was downloaded from the International Cancer Genome Consortium’s (ICGC) database (http://dcc.icgc.org). Chromosomes X and Y were excluded from the analysis and only unique calls were considered for each donor. Matches to the GABP/ETS motif (NNTTCCK or NNTTCCG) were mapped in 1 kb upstream regions, defined based on NCBI RefSeq genes and using the 5′-most coding transcript for each gene. Pairs of sites on the same strand and with a start-to-start distance between 7 and 100 bp were retained for further analysis, thus avoiding overlapping sites. Site pairs were subdivided based on the number of overlapping mutations (non-mutated, ≥ 1 mutation/site and ≥ 2 mutations/site). Distance distributions for the different sets were plotted, applying a moving-average smoothening with a window size of 3 bp before normalization (division by total counts). The proportions of site pairs that overlapped with strong GABP ChIP-seq peaks (both site starts within the peak) were computed using ChIP-seq data for 8 cell lines (A549, GM12878, HeLa-S3, HepG2, HL-60, K562, MCF-7 and SK-N-SH) from ENCODE47 available via the UCSC browser in the Encode Haib Tfbs track, using the broadPeak files. Site pairs were required to overlap with at least one peak in one cell line and replicate, and only the top quartile of peaks (based on enrichment score, column 7 in the ENCODE files) were considered for each experiment.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Source data
Acknowledgements
The authors would like to acknowledge the American Association for Cancer Research and its financial and material support in the development of the AACR Project GENIE registry, as well as members of the consortium for their commitment to data sharing. This research was made possible through access to data in the National Genomic Research Library, which is managed by Genomics England Limited (a wholly owned company of the Department of Health and Social Care). The National Genomic Research Library holds data provided by patients and collected by the NHS as part of their care and data collected as part of their participation in research. The National Genomic Research Library is funded by the National Institute for Health Research and NHS England. The Wellcome Trust, Cancer Research UK and the Medical Research Council have also funded research infrastructure. Jesslyn E. Park and Dr. Gavin Knott are acknowledged for their assistance in optimizing GABP expression and purification. E.L. was supported by grants from the Knut and Alice Wallenberg Foundation, the Swedish Medical Research Council and the Swedish Cancer Society. T.L and J.V.d.E was funded by Kom op tegen Kanker (Stand up to Cancer) and the Flemish Cancer Society. M.F. was funded by the Swedish Cancer Society and the Knut and Alice Wallenberg Foundation.
Author contributions
E.L. and K.E. designed the study. K.E., K.M.S. and E.H. performed experiments. V.K.S., A.B., M.L., L.Ö., T.L., K.E. and E.L. analysed data. A.S., M.F., J.D. and J.V.d.E. acted in a supervisory capacity. K.E. and E.L. wrote the manuscript.
Peer review
Peer review information
Nature Communications thanks Jason Wong and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Funding
Open access funding provided by University of Gothenburg.
Data availability
Sequencing data generated for this study has been deposited in NCBI Sequence Read Archive (SRA) with BioProject accession number PRJNA1062776. Melanoma mutation data from the Australian Melanoma Genome Project via the ICGC data portal (https://docs.icgc-argo.org/docs/data-access/icgc-25k-data). NCBI RefSeqAll gene annotations for hg19, via the UCSC table browser (https://genome.ucsc.edu/cgi-bin/hgTables). ENCODE ChIP-seq data for 8 cell lines (A549, GM12878, HeLa-S3, HepG2, HL-60, K562, MCF-7 and SK-N-SH) from UCSC. GENIE v11 (https://www.synapse.org/#!Synapse:syn26706564). Genomics England (https://www.genomicsengland.co.uk/research/research-environment). Source data are provided with this paper.
Code availability
Code is provided as a Supplementary Data 1.
Competing interests
A.S. is a co-inventor of the SiMSen-Seq technology that is patent protected (U.S. Serial No.:15/552,618). A.S. declares stock ownership in Tulebovaasta, Iscaff Pharma and SiMSen Diagnostics, and is a board member of Tulebovaasta. The other authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-024-54158-5.
References
- 1.Bell, R. J. et al. Understanding TERT promoter mutations: a common path to immortality. Mol. Cancer Res14, 315–323 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Horn, S. et al. TERT promoter mutations in familial and sporadic melanoma. Science339, 959–961 (2013). [DOI] [PubMed] [Google Scholar]
- 3.Huang, F. W. et al. Highly recurrent TERT promoter mutations in human melanoma. Science339, 957–959 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Heidenreich, B. & Kumar, R. Altered TERT promoter and other genomic regulatory elements: occurrence and impact. Int J. Cancer141, 867–876 (2017). [DOI] [PubMed] [Google Scholar]
- 5.Heidenreich, B. & Kumar, R. TERT promoter mutations in telomere biology. Mutat. Res Rev. Mutat. Res771, 15–31 (2017). [DOI] [PubMed] [Google Scholar]
- 6.Rachakonda, S., Hoheisel, J. D. & Kumar, R. Occurrence, functionality and abundance of the TERT promoter mutations. Int J. Cancer149, 1852–1862 (2021). [DOI] [PubMed] [Google Scholar]
- 7.Elliott, K. & Larsson, E. Non-coding driver mutations in human cancer. Nat. Rev. Cancer21, 500–509 (2021). [DOI] [PubMed] [Google Scholar]
- 8.Bell, R. J. et al. Cancer. The transcription factor GABP selectively binds and activates the mutant TERT promoter in cancer. Science348, 1036–1039 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Barger, C. J. et al. Conserved features of TERT promoter duplications reveal an activation mechanism that mimics hotspot mutations in cancer. Nat. Commun.13, 5430 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gupta, S. et al. A pan-cancer study of somatic TERT promoter mutations and amplification in 30,773 tumors profiled by clinical genomic sequencing. J. Mol. Diagn.23, 253–263 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Zehir, A. et al. Mutational landscape of metastatic cancer revealed from prospective clinical sequencing of 10,000 patients. Nat. Med23, 703–713 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Andres-Lencina, J. J. et al. TERT promoter mutation subtypes and survival in stage I and II melanoma patients. Int J. Cancer144, 1027–1036 (2019). [DOI] [PubMed] [Google Scholar]
- 13.Fredriksson, N. J., Ny, L., Nilsson, J. A. & Larsson, E. Systematic analysis of noncoding somatic mutations and gene expression alterations across 14 tumor types. Nat. Genet46, 1258–1263 (2014). [DOI] [PubMed] [Google Scholar]
- 14.Heidenreich, B. et al. Telomerase reverse transcriptase promoter mutations in primary cutaneous melanoma. Nat. Commun.5, 3401 (2014). [DOI] [PubMed] [Google Scholar]
- 15.Shain, A. H. et al. Genomic and transcriptomic analysis reveals incremental disruption of key signaling pathways during melanoma evolution. Cancer Cell34, 45–55 e44 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Vinagre, J. et al. Frequency of TERT promoter mutations in human cancers. Nat. Commun.4, 2185 (2013). [DOI] [PubMed] [Google Scholar]
- 17.Huang, D. S. et al. Recurrent TERT promoter mutations identified in a large-scale study of multiple tumour types are associated with increased TERT expression and telomerase activation. Eur. J. Cancer51, 969–976 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Becker, M. M., Wang, Z., Grossmann, G. & Becherer, K. A. Genomic footprinting in mammalian cells with ultraviolet light. Proc. Natl Acad. Sci. USA86, 5315–5319 (1989). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hu, J., Adebali, O., Adar, S. & Sancar, A. Dynamic maps of UV damage formation and repair for the human genome. Proc. Natl Acad. Sci. USA114, 6758–6763 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Pfeifer, G. P., Drouin, R., Riggs, A. D. & Holmquist, G. P. Binding of transcription factors creates hot spots for UV photoproducts in vivo. Mol. Cell Biol.12, 1798–1804 (1992). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wang, Z. & Becker, M. M. Selective visualization of gene structure with ultraviolet light. Proc. Natl Acad. Sci. USA85, 654–658 (1988). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Mao, P., Smerdon, M. J., Roberts, S. A. & Wyrick, J. J. Chromosomal landscape of UV damage formation and repair at single-nucleotide resolution. Proc. Natl Acad. Sci. USA113, 9057–9062 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Fredriksson, N. J. et al. Recurrent promoter mutations in melanoma are defined by an extended context-specific mutational signature. PLoS Genet13, e1006773 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Mao, P. et al. ETS transcription factors induce a unique UV damage signature that drives recurrent mutagenesis in melanoma. Nat. Commun.9, 2626 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Elliott, K. et al. Elevated pyrimidine dimer formation at distinct genomic bases underlies promoter mutation hotspots in UV-exposed cancers. PLoS Genet14, e1007849 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Elliott, K., Singh, V. K., Bostrom, M. & Larsson, E. Base-resolution UV footprinting by sequencing reveals distinctive damage signatures for DNA-binding proteins. Nat. Commun.14, 2701 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Selvam, K., Sivapragasam, S., Poon, G. M. K. & Wyrick, J. J. Detecting recurrent passenger mutations in melanoma by targeted UV damage sequencing. Nat. Commun.14, 2702 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Consortium, A. P. G. AACR project GENIE: powering precision medicine through an international consortium. Cancer Discov.7, 818–831 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Killela, P. J. et al. TERT promoter mutations occur frequently in gliomas and a subset of tumors derived from cells with low rates of self-renewal. Proc. Natl Acad. Sci. USA110, 6021–6026 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Barthel, F. P. et al. Systematic analysis of telomere length and somatic alterations in 31 cancer types. Nat. Genet49, 349–357 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Alexandrov, L. B. et al. The repertoire of mutational signatures in human cancer. Nature578, 94–101 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Li, Y. et al. Non-canonical NF-kappaB signalling and ETS1/2 cooperatively drive C250T mutant TERT promoter activation. Nat. Cell Biol.17, 1327–1338 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Pugh, T. J. et al. AACR project GENIE: 100,000 cases and beyond. Cancer Discov.12, 2044–2057 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bostrom, M. & Larsson, E. Somatic mutation distribution across tumour cohorts provides a signal for positive selection in cancer. Nat. Commun.13, 7023 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Colebatch, A. J. et al. Optimizing amplification of the GC-Rich TERT promoter region using 7-Deaza-dGTP for droplet digital PCR quantification of TERT promoter mutations. Clin. Chem.64, 745–747 (2018). [DOI] [PubMed] [Google Scholar]
- 36.Lim, K. W. et al. Coexistence of two distinct G-quadruplex conformations in the hTERT promoter. J. Am. Chem. Soc.132, 12331–12342 (2010). [DOI] [PubMed] [Google Scholar]
- 37.Stahlberg, A. et al. Simple multiplexed PCR-based barcoding of DNA for ultrasensitive mutation detection by next-generation sequencing. Nat. Protoc.12, 664–682 (2017). [DOI] [PubMed] [Google Scholar]
- 38.Powter, B. et al. Human TERT promoter mutations as a prognostic biomarker in glioma. J. Cancer Res Clin. Oncol.147, 1007–1017 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Rheinbay, E. et al. Analyses of non-coding somatic drivers in 2,658 cancer whole genomes. Nature578, 102–111 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Shain, A. H. et al. The genetic evolution of melanoma from precursor lesions. N. Engl. J. Med.373, 1926–1936 (2015). [DOI] [PubMed] [Google Scholar]
- 41.Stern, J. L., Theodorescu, D., Vogelstein, B., Papadopoulos, N. & Cech, T. R. Mutation of the TERT promoter, switch to active chromatin, and monoallelic TERT expression in multiple cancers. Genes Dev.29, 2219–2224 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Rosenthal, R., McGranahan, N., Herrero, J., Taylor, B. S. & Swanton, C. DeconstructSigs: delineating mutational processes in single tumors distinguishes DNA repair deficiencies and patterns of carcinoma evolution. Genome Biol.17, 31 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics25, 2078–2079 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods9, 676–682 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Osterlund, T., Filges, S., Johansson, G. & Stahlberg, A. UMIErrorCorrect and UMIAnalyzer: software for consensus read generation, error correction, and visualization using unique molecular identifiers. Clin. Chem.68, 1425–1435 (2022). [DOI] [PubMed] [Google Scholar]
- 46.Hayward, N. K. et al. Whole-genome landscapes of major melanoma subtypes. Nature545, 175–180 (2017). [DOI] [PubMed] [Google Scholar]
- 47.Consortium, E. P. An integrated encyclopedia of DNA elements in the human genome. Nature489, 57–74 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Kundra, R. et al. OncoTree: A cancer classification system for precision oncology. JCO Clin. Cancer Inf.5, 221–230 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Sequencing data generated for this study has been deposited in NCBI Sequence Read Archive (SRA) with BioProject accession number PRJNA1062776. Melanoma mutation data from the Australian Melanoma Genome Project via the ICGC data portal (https://docs.icgc-argo.org/docs/data-access/icgc-25k-data). NCBI RefSeqAll gene annotations for hg19, via the UCSC table browser (https://genome.ucsc.edu/cgi-bin/hgTables). ENCODE ChIP-seq data for 8 cell lines (A549, GM12878, HeLa-S3, HepG2, HL-60, K562, MCF-7 and SK-N-SH) from UCSC. GENIE v11 (https://www.synapse.org/#!Synapse:syn26706564). Genomics England (https://www.genomicsengland.co.uk/research/research-environment). Source data are provided with this paper.
Code is provided as a Supplementary Data 1.