Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2023 Feb 8:2023.02.08.527468. [Version 1] doi: 10.1101/2023.02.08.527468

Inverse-folding design of yeast telomerase RNA increases activity in vitro

Kevin J Lebo 2, David C Zappulla 1,2,*
PMCID: PMC9934677  PMID: 36798419

Abstract

Saccharomyces cerevisiae telomerase RNA, TLC1, is an 1157 nt non-coding RNA that functions as both a template for DNA synthesis and a flexible scaffold for telomerase RNP holoenzyme protein subunits. The tractable budding yeast system has provided landmark discoveries about telomere biology in vivo, but yeast telomerase research has been hampered by the fact that the large TLC1 RNA subunit does not support robust telomerase activity in vitro. In contrast, 155–500 nt miniaturized TLC1 alleles comprising the catalytic core domain and lacking the RNA’s long arms do reconstitute robust activity. We hypothesized that full-length TLC1 is prone to misfolding in vitro. To create a full-length yeast telomerase RNA predicted to fold into its biological relevant structure, we took an inverse RNA folding approach, changing 59 nucleotides predicted to increase the energetic favorability of folding into the modeled native structure based on the p-num feature of Mfold software. The sequence changes lowered the predicted ΔG in this “determined-arm” allele, DA-TLC1, by 61 kcal/mol (−19%) compared to wild type. We tested DA-TLC1 for reconstituted activity and found it to be ~5-fold more robust than wild-type TLC1, suggesting that the inverse-folding design indeed improved folding in vitro into a catalytically active conformation. We also tested if DA-TLC1 functions in vivo and found that it complements a tlc1Δ strain, allowing cells to avoid senescence and maintain telomeres of nearly wild-type length. However, all inverse-designed RNAs that we tested had reduced abundance in vivo. In particular, inverse-designing nearly all of the Ku arm caused a profound reduction in telomerase RNA abundance in the cell and very short telomeres. Overall, these results show that inverse design of S. cerevisiae telomerase RNA increases activity in vitro, while reducing abundance in vivo. This study provides a biochemically and biologically tested approach to inverse-design RNAs using Mfold that could be useful for controlling RNA structure in basic research and biomedicine.

INTRODUCTION

Telomeres are the protein-bound repeated segments of DNA at the ends of linear chromosomes that help to define the natural DNA ends. Because canonical DNA polymerases are unable to fully replicate the ends of the chromosome, telomeres shorten with each successive generation1. If left unchecked, this will result in a G2/M cell cycle arrest known as senescence2,3. Many eukaryotes counteract this “end-replication problem” with a ribonucleoprotein complex known as telomerase4. Telomerase is minimally composed of a telomerase reverse transcriptase (TERT) protein and a telomerase RNA component5,6. TERT uses a portion of the RNA as a template for reverse transcription to lengthen the telomeric DNA7. In addition, several accessory protein subunits are required for telomere maintenance in vivo8.

In the budding yeast Saccharomyces cerevisiae, the 1157 nt telomerase RNA is TLC1, which adopts an overall Y-shaped secondary structure with three long arms radiating out from the central catalytic core9,10. This central core secondary structure conforms to the conserved telomerase RNA core consensus model, containing the template and template-boundary element, a pseudoknot with catalytically important base triples, and a core-enclosing helix, all coordinated through an area of required connectivity1113. Yeast TERT, Est2 (Ever-Shorter Telomeres 2), binds to this central core and functions with it to reconstitute core-enzyme activity1316.

In addition to providing the template for telomere synthesis by reverse transcription, TLC1 acts as a flexible scaffold to tether telomerase accessory proteins to the holoenzyme10,1720. The accessory proteins, including the essential Est1 protein and the important Ku heterodimer and Sm7 complex, each bind to the end of a different long RNA arm9,10,2123. Each of these accessory-protein binding sites can be relocated on the RNA with retention of function; i.e., these subunits are flexibly scaffolded10,19,24. The bulk of the long RNA arms between the core and the protein-binding sites can be deleted without eliminating telomerase function in vivo, although RNA domains near the protein-binding sites may still have roles in telomerase function beyond protein tethering14,25. Overall, TLC1 provides a flexible scaffold that tethers holoenzyme protein subunits and RNA domains together without conferring specific orientations for function.

Although telomerase accessory protein subunits are required for telomere-length homeostasis in vivo, the RNA and TERT core enzyme components are minimally sufficient to reconstitute basic enzymatic activity in vitro5,6,26. In S. cerevisiae this minimal telomerase RNP requires the TLC1 RNA and Est2 protein. However, full-length TLC1 only shows robust in vitro function when it has been co-immunopurified from yeast cells14,23,26. When yeast telomerase is in-vitro reconstituted in a rabbit reticulocyte lysate transcription-translation system, wild-type TLC1 is quasi-nonfunctional, displaying only traces of telomerase activity14,27,28.

To allow for in vitro telomerase studies, several mutant variants of TLC1 have been developed. These include a series of miniaturized telomerase RNAs, which involve extensive deletions of the TLC1 arms to make “Mini-T”14 and “Micro-T” variants11,16, or telomerase RNAs with significant structural modifications in the arms, such as triple-stiff arm TLC1 (TSA-T)27. Micro-T RNAs lack the essential protein-binding arms, and would therefore be nonfunctional in vivo11,16. Mini-T and TSA-T are able to support telomere maintenance in cells, and therefore can be used to study the RNA both in vivo and in vitro14,27. However, these TLC1 variants are highly mutated; Mini-T(500) has 657 nucleotides deleted from the three RNA arms, while Triple-Stiff Arm TLC1 has 201 nts deleted from the arms, and several other nucleotide substitutions. Furthermore, both Mini-T and TSA-T have dramatic structural changes from wild-type TLC1. The truncated arms of Mini-T bring the accessory proteins in closer to the central catalytic core, while the stiffened arms in TSA-T hold the accessory proteins away from the core on dsRNA struts. These mutations limit the amount of information that Mini-T and TSA-T can provide about the relationships between structure and function in yeast telomerase RNA. Therefore, we sought to engineer a version of TLC1 with fewer mutations that would be highly functional both in vivo and in vitro.

To design a full-length TLC1 RNA with sequence changes that promote folding into its active conformation (i.e., “inverse RNA folding”2931), we employed the Mfold RNA secondary structure prediction software32,33. Mfold can model both the secondary structure and calculate the predicted ΔG of folded conformations. In addition, a feature of Mfold provides information about how “well-determined” the RNA structure is, based on a P-num heuristic descriptor32,34,35. P-num describes the propensity of a particular nucleotide to make alternative base pairings in iterations of Mfold folding simulations35. A high P-num value indicates that a nucleotide is predicted to pair with many other nucleotides, while a low P-num value reflects few energetically favorable pairing interactions with other nucleotides. Mfold uses a ROYGBIV spectrum scale to annotate the P-num values of each nucleotide, with red representing nucleotides with a low P-num value, and purple or black representing nucleotides with a high value. Red nucleotides in Mfold structures have a high probability of folding as drawn by Mfold in the lowest-free-energy structural conformation; domains with many red nucleotides are therefore more “well-determined,” with a relatively high confidence in the secondary structure model.

Here, we present “Determined Arm TLC1” (DA-TLC1), which retains the wild-type TLC1 length, but has engineered RNA arm sequence that promotes these regions to be more determined to fold into their physiologically relevant structure. Compared to wild-type TLC1, the inverse-designed DA-TLC1 allele reconstitutes ~5-fold more enzymatic activity in vitro when paired with the Est2 protein, although not as robust as Mini-T or TSA-T RNAs. When expressed in vivo, DA-TLC1 provides basic functionality, allowing cells to avoid senescence and maintain shorter telomeres, with the alleles exhibiting reduced RNA abundance. Overall, DA-TLC1 represents a highly functional yeast telomerase RNA that will be useful for future studies of telomerase mechanism both in vivo and in vitro, and demonstrates that long noncoding RNAs with increased in vitro function can be inverse-designed using Mfold.

RESULTS

Inverse design of a TLC1 RNA that folds more readily into its active biological structure: Determined-Arm TLC1 (DA-TLC1)

To create a full-size TLC1 RNA that would fold into a functional conformation in vitro, we made a series of single-nucleotide substitutions in the RNA to optimize lowest-free energy folding of paired regions. The current model of TLC1 secondary structure along with Mfold software predictions were used to guide the inverse design of mutations that would create the most “well-determined” arms35. It is important to note that the Mfold secondary structure prediction for wild-type TLC1 (Fig. 1A) varies from the phylogenetic models9,10 in several locations. First, Mfold does not predict pseudoknots (tertiary structure), although it predicts the formation of the two hairpins in the central core (later shown to be folding intermediates36). Second, some portions of the arms are predicted to fold differently; in particular, the region of the Ku arm distal to the core is very different between the phylogenetically derived9,10 and Mfold model.

Figure 1. Determined-arm DA-TLC1 RNA: inverse-designed to fold more stably than wild-type TLC1 into the biologically relevant secondary structure.

Figure 1.

Shown are the lowest-free-energy Mfold P-num computational secondary structure predictions for wild-type and Determined-Arm TLC1 (DA-TLC1). (A) Mfold secondary structure model for wild-type TLC1. Nucleotides are colored according to the P-num feature of the Mfold web server, which conveys the “determinedness” of low-free energy RNA folding predictions, with the most well-determined nucleotides on the red end of the ROYGBIV spectrum35. The initial ΔG of folding of wild-type TLC1 was predicted by Mfold to be −321 kcal/mol. (B) Lowest-free energy Mfold secondary model for DA-TLC1, with nucleotides colored in the P-num format as in A. The determined Ku arm was designed to fold into the phylogenetically derived structure for the wild-type arm10. The initial ΔG of folding was predicted to be −382 kcal/mol.

We performed ~36 rounds of manually testing mutations and the effects of them on Mfold P-num folding predictions to achieve an inverse-designed full-length telomerase RNA expected to fold more readily into its active secondary structure. “Determined Arm TLC1” (DA-TLC1) RNA has 59 substituted nucleotides across the 1157-nt RNA (Fig. 1B; Supp. Fig. 13). There are 11 nucleotide substitutions in the Ku arm, 33 substitutions in the Est1 arm, and 15 in the terminal arm (Supp. Figs. 1, 3). All the mutations were made within in core-proximal regions of the arms, where no subunits have been shown to bind. These areas of TLC1 have been previously deleted in Mini-T14 and stiffened in TSA-T27 without abolishing telomerase holoenzyme function.

The lowest-free-energy Mfold secondary structure model for DA-TLC1 (Fig. 1B) closely matches the secondary structure of the phylogenetically-derived structure for wild-type TLC1, except in two major ways9,10. First, the central catalytic core is predicted to fold differently from either the phylogenetically supported model10 or Mfold of wild-type TLC1 (n.b., Mfold does not predict pseudoknots, and there is a pseudoknot in this region). Rather than forming the two-hairpin structure seen in the lowest-free-energy Mfold model of wild-type TLC1 (Fig. 1A), the core of DA-TLC1 has only the 3′ hairpin, and the template is partially paired with nucleotides from the opposite side of the core. This is the same secondary structure predicted in the Mfold model for TSA-T27, and could represent a biologically relevant folding conformation. Second, the distal region of the Ku arm in DA-TLC1 is predicted to fold similarly to the Mfold model for wild-type TLC1, rather than either of the phylogenetically predicted structures9,10 (nts 176–409, Fig. 1B). Because the two published phylogenetically supported structures differed in this region, we refrained from inverse-designing DA-TLC1 to conform to either one, rather leaving nucleotides 176–410 the same as wild type.

The Mfold model for DA-TLC1 also shows that each of its arms are energetically more “well determined” than those of wild-type TLC1 (Fig. 1). More nucleotides in each of the three arms of DA-TLC1 are colored red on the P-num scale than in wild-type TLC1, indicating a higher degree of confidence in the indicated folded conformation, based on the program’s free-energy folding simulations (Fig. 1). The arms of DA-TLC1 are therefore energetically more likely to fold into the predicted state than those in wild-type TLC1. Accordingly, DA-TLC1 is predicted to have 61 kcal/mol (19%) lower initial free energy of folding than TLC1 (−382 kcal/mol compared to −321 kcal/mol for wild type).

DA-TLC1 retains function in vivo

We first tested if DA-TLC1 functions in vivo. DA-TLC1 RNA was expressed from a centromere-containing (CEN) plasmid, with the gene’s endogenous promoter and terminator, in a tlc1Δ rad52Δ strain. Deletion of rad52Δ prevents alternative recombination-based survivor pathways for telomere lengthening, and ensures that all cellular telomere lengthening is telomerase-dependent3739. Cells were then restreaked 10 times, representing approximately 250 generations of growth.

DA-TLC1-expressing cells survived without senescing through the course of the ~20 days of cell passaging (Fig. 2A). In contrast, cells lacking a functional telomerase RNA senesced by 125 generations of growth (i.e., within the first ~10 days). This demonstrates that the essential RNA domains in DA-TLC1, including the catalytic core and the Est1-binding arm, fold into functional structures in vivo.

Figure 2. DA-TLC1 RNA can function in place of wild-type TLC1 in vivo, maintaining slightly shorter telomeres and exhibiting reduced abundance.

Figure 2.

(A) Like wild-type TLC1, DA-TLC1 allows yeast cells to grow perpetually and avoid senescing. Each telomerase RNA allele was harbored on a centromere-containing plasmid in a tlc1Δ rad52Δ yeast strain, and restreaked for over 250 generations. Growth shown at 250 generations (10 restreaks on plates). The tlc1Δ strains senesce by ~125 generations. (B) Telomeres in DA-TLC1 cells are ~23 bp shorter than wild-type cells. Shown is a Southern blot of XhoI-digested genomic DNA probed with telomeric repeat sequences. Change in telomere length relative to wild-type TLC1 is indicated for two isolates, ± S.D. (C) The abundance of DA-TLC1 RNA is reduced compared to wild-type. Two independent isolates at 250 generations are shown. Northern blot showing reduced DA-TLC1 RNA levels compared to wild-type. (D) Dot blot of total cellular RNA probed for the shared 3′ end of TLC1 (upper panel), which is common to all RNA variants, and normalized to U1 snRNA control spots (lower panel). Normalized telomerase RNA abundance relative to TLC1 from four isolates is indicated, ± standard error. Dot blots are required to study TSA-T since this RNA allele does not denature in urea-PAGE gels used for northern blotting (as in C) due to its long dsRNA arms.

To test quantitatively how well DA-TLC1 telomerase functions in vivo, we examined average telomere length and it to be similar to wild-type cells. Telomeres were analyzed by Southern blotting genomic DNA from cells expressing DA-TLC1 for 250 generations. DA-TLC1 telomeres were slightly shorter than wild type, while telomeres maintained by TSA-T were moderately shortened, and those maintained by Mini-T were very short (Fig. 2B). Thus, DA-TLC1 is able to maintain nearly wild-type telomeres in vivo, and better than the in vitro-functional mutants TSA-T or Mini-T.

To measure DA-TLC1 RNA abundance compared to wild-type TLC1, we isolated total cellular RNA and analyzed by Northern blot, probing for the Mini-T sequence that is present in both TLC1 and DA-TLC1 transcripts. DA-TLC1 abundance in vivo was lower than wild-type (Fig. 2C). However, DA-TLC1 abundance was somewhat higher than that of Mini-T RNA and similar to that of TSA-T27 (Fig. 2D). Taken together with the fact that DA-TLC1 cells have nearly wild-type telomere length, DA-TLC1 maintains telomeres effectively, especially considering its lower concentration in vivo.

DA-TLC1 has increased activity in vitro compared to wild-type TLC1

The wild-type 1157 nt TLC1 RNA is barely able to support detectable telomerase enzyme function in an in vitro reconstituted telomerase activity assay14,27. We hypothesize that this is due to misfolding of the central core and that the three long TLC1 arms interfere with the folding landscape of the catalytic core region. To test whether the more well-determined secondary structure in the arms of DA-TLC1 promotes telomerase activity in vitro, we performed RNP enzyme activity assays. DA-TLC1 was transcribed by T7 RNA polymerase alongside ProA-tagged yeast TERT, Est2, in a rabbit reticulocyte in vitro transcription and translation system14. Immunopurified telomerase RNP was incubated with a telomeric DNA primer and [α-32P]-dGTP, and products were separated by urea-polyacrylamide gel electrophoresis. DA-TLC1 successfully reconstituted telomerase function in vitro (Fig. 3). The in vitro activity was reproducibly approximately 5-fold greater than wild-type TLC1 yet still substantially lower than Mini-T or TSA-T, each of which support very robust reconstituted activity. These data provide evidence that the well-determined inverse-designed arms of DA-TLC1 improve telomerase catalytic core function in vitro.

Figure 3. DA-TLC1 telomerase activity is increased compared to wild-type TLC1.

Figure 3.

The indicated RNAs were synthesized in a rabbit reticulocyte lysate transcript-translation system along with Est42 (TERT), immunopurified and assayed for activity on a telomeric primer in the presence of [alpha-32P]-dGTP. Activity quantification is normalized to internal loading control and is the average ± standard deviation relative to Mini-T activity.

The secondary structure of the Ku arm affects telomerase function in vivo and in vitro

The Mfold secondary structure model for the Ku-binding arm of DA-TLC1 is similar to the Mfold-predicted folding of the wild-type TLC1 Ku arm (Fig. 4A; Supp. Fig. 3A). However, this conformation differs from the models for the Ku-arm secondary structure that are constrained by covarying base pairs — i.e., phylogenetic support of folding that may not be, or be predicted to be, the lowest-free-energy structure9,10. Thus, we also inverse-designed the Ku arm with alternative-nucleotide sequence to Mfold into a phylogenetically supported structure10 (Fig. 4A, bottom). This “Determined Phylogenetic-structure Ku arm” (DPhyK) has an additional 51 point mutations compared to the DA-TLC1 Ku arm, extending distally out towards the Ku-binding site (Supp. Figs. 1E, 1F, 2E, 2F, and 3A). We then compared a series of determined-arm variants for function in vivo and in vitro: DA-TLC1, DK-TLC1 (with the determined Ku arm from DA-TLC1 and wild-type Est1 and terminal arms), DET-TLC1 (with a wild-type Ku arm and the determined Est1 and terminal arms from DA-TLC1), DPhyK-DA-TLC1 (with the inverse-designed phylogenetically supported Ku arm conformation along with the determined Est1 and terminal arms found in DA-TLC1), and DPhyK-TLC1 (with the inverse-designed phylogenetically supported Ku arm conformation and the wild-type Est1 and terminal arms) (Fig. 4 and Supp. Figs. 13).

Figure 4. Individual inverse-designed arms each contribute to increased DA-TLC1 activity in vitro and decreased RNA abundance in vivo.

Figure 4.

(A) Mfold P-num models for the Ku arm of wild-type, DK-TLC1, and DPhyK-TLC1 alleles. (B) DA-TLC1 supports telomerase function in an in vitro reconstituted telomerase assay. As in Figure 3, TLC1 variants were co-expressed with ProA-Est2 in an in vitro transcription and translation system, co-immunopurified, and reacted with a telomeric DNA primer. Averages ± standard deviation of 2–4 activity-assay replicates of each enzyme preparation are shown. (C) Cells expressing Determined-Arm TLC1 variants grow well and do not senesce. Telomerase RNA alleles were expressed from CEN plasmids in a tlc1Δ, rad52Δ strain. Growth at 250 generations is shown. (D) Telomere Southern blot showing that DA-, DK-, and DET-TLC1 alleles support telomeres near wild-type length, whereas DPhyK-TLC1 and -DA-TLC1 exhibit very short telomeres. (E) Northern blot showing that whereas most inverse-designed TLC1 alleles have modestly reduced RNA abundance, DPhyK alleles have very low levels.

We first tested the Determined-Arm TLC1 RNA variants in vitro. Each of the various Ku-arm variants was able to reconstitute telomerase activity at levels near or slightly higher than DA-TLC1, although still lower than Mini-T (Fig. 4B). These data suggest that driving the folding of the RNA arms by using Mfold to guide inverse-folding design supports increased telomerase core enzymatic activity. This is likely due to indirectly stabilizing the structure of the RNA core by limiting mispairing between nucleotides of the core and the Ku arm. We see this trend when we used a wild-type Ku arm with determined Est1 and terminal arms (Fig. 4B, DET-TLC1), or even when only inverse-designing the Ku arm (Fig. 4B, DK-TLC1 and DPhyK-TLC1).

After demonstrating that the catalytic cores of all of the determined-arm variants were active in vitro, we next tested in vivo function for these alleles, as performed above. All of the determined-arm variants complemented a tlc1Δ strain, without senescing over the course of the ~20 days of cell passaging (Fig. 4C), indicating that the essential regions of TLC1, including the catalytic core and the Est1-binding arm, folded into functional structures in cells.

We next analyzed telomere lengths from cells expressing the determined arm variants. Genomic DNA was isolated after 250 generations of growth and run on a Southern blot (Fig. 4D). Like DA-TLC1, both DK-TLC1 and DET-TLC1 maintain near-wild-type telomeres, suggesting they form highly functional telomerase holoenzymes in vivo (Fig.4D, lanes 2–4). However, RNAs with the secondary-structure driven Ku arm had very short telomeres, ~171–185 bp shorter than wild-type (lanes 5 and 6).

The telomere phenotypes may be the result of differences in RNA abundance. Telomerase RNA levels were assessed by Northern blots performed on total RNA isolated after 250 generations of growth. DK-TLC1 and DET-TLC1 had reduced RNA abundance compared to wild-type TLC1, with 25% and 40% of TLC1 levels respectively, similar to DA-TLC1 (Fig. 4E). This indicates that the mutations made in the determined arms result in lower telomerase RNA levels, despite the ability of the determined-arm TLC1 RNAs to maintain telomeres near wild-type length. On the other hand, constraining the Ku arm to the phylogenetically determined structure in DPhyK-DA-TLC1 and DPhyK-TLC1 results in extremely low RNA abundance, with 2–3% of TLC1 levels (Fig. 4E). This reduction in telomerase RNA is likely responsible for the pronounced decrease in telomere length observed in these two extensively inverse-designed Ku-arm RNAs.

DISCUSSION

The budding yeast Saccharomyces cerevisiae has been a powerful model organism for studying the genetics and molecular mechanisms of telomerase function, leading to landmark discoveries, including the first telomerase subunit gene and genetic discovery of the TERT catalytic protein subunit6,8,40,41. However, biochemical studies have been limited by the misfolding of full-length TLC1 in vitro, as evidenced by the failure to support telomerase enzyme function in an in vitro reconstituted telomerase assay. To bypass this problem, a series of miniaturized mutants of TLC1 (Mini-T alleles) were designed that restore in vitro activity14.

Beyond Mini-T RNAs, yet further-reduced “Micro-T” RNAs have each of the three accessory-protein-binding arms entirely deleted, leaving just a ~170-nt central catalytic core11,16. However, although Micro-T is robustly functional in vitro, it would certainly be unable to function in vivo because it lacks essential accessory-protein-binding sites, such as the binding sites for Est1 and Pop1/6/7 proteins22,25,28. The slightly larger Mini-T RNAs are functional both in vitro and in vivo14. With 657 nts deleted from the RNA arms, Mini-T(500) is highly mutated in both sequence and structure, and the truncated arms draw the accessory-protein binding sites in much closer to the catalytic core. Additionally, Mini-T RNA abundance is ~15% of wild type in vivo, and telomeres maintained at a stable but much shorter length. Finally, TSA-T has the three accessory-protein-binding arms stiffened by the deletion of 201 nts and several nucleotide substitutions 27. TSA-T is functional in vitro, although slightly less than Mini-T and Micro-T. TSA-T is also highly functional in vivo, despite low RNA abundance. On a per-RNA basis, TSA-T actually maintains longer telomeres than wild-type TLC1, despite the fact that the stiffened arms of TSA-T significantly alter the overall architecture of the telomerase RNP. While the many heavily mutated TLC1 RNA mutants have helped make important advances in understanding the molecular mechanisms and organization of the yeast telomerase RNP, a minimally mutated RNA that folds into its native state in vitro would allow for examination of the most biologically accurate RNA conformation and, ultimately, assembly of the biologically relevant RNP complex. Being able to study such a minimally mutated TLC1 biochemically in vitro would advance understanding of both telomerase and the growing field of long noncoding RNAs (lncRNAs).

Paired regions in RNA structure are often replaceable with different base pairs without disturbing structure or function of the RNA or RNP complex. In fact, this is a central tenet of determining RNA structure phylogenetically: nucleotides in a sequence alignment that are paired in the RNA transcript’s folded state may co-vary during evolution, allowing a researcher to infer that the nucleotides are paired. Thus, considering the extensive amount of natural covariation in RNA structures during evolution, it should not be expected that manually changing the identity of base pairs would necessarily disturb the structure or function at most positions of many RNAs. Furthermore, many unpaired nucleotides in loops and junctions often tolerate sequence substitutions; often unpaired residues serve as flexible linkers between paired elements. These paradigms are particularly likely to hold true for telomerase RNA, given its rapid evolution in sequence and its demonstrated function in yeast as a flexible scaffold for holoenzyme subunits10,11,19,20.

To design a minimally mutated wild-type-length TLC1 RNA that folds into its biologically relevant conformation and functions both in vivo and in vitro, we employed Mfold RNA secondary structure prediction software32. Using Mfold’s P-num heuristic descriptor function32,35, which outputs colored nucleotides based on confidence in lowest-free-energy secondary structure modeling (i.e., “determinedness”), we introduced sequence substitutions in TLC1 to design a telomerase RNA with well-determined accessory-protein-binding arms. This was an iterative, manual process of introducing sequence changes and then testing if they had their intended effect on promoting the RNA fold into its physiologically relevant conformation based on Mfold P-num folding predictions. This inverse-folding design approach resulted in “Determined-Arm TLC1” (DA-TLC1), which is wild-type length (1157 nts) with 59 changed nucleotides (Fig. 1 and Supp. Figs. 13). The Mfold secondary structure model for DA-TLC1 is more similar than the wild-type TLC1 Mfold to the phylogenetically derived structure models for TLC19,10, particularly at the base of the Terminal and Est1 arms (Fig. 1; Supp. Figs. 2 and 3), and also with a lower ΔG and increased P-num-based structural confidence (Figs. 1 and 4A).

Because DA-TLC1 is predicted to adopt a secondary structure similar to wild-type TLC1, it may prove to be a useful tool for studying the yeast telomerase flexible scaffold. Like wild-type TLC1, DA-TLC1 can prevent senescence and maintain nearly wild-type-length telomeres in vivo (Fig. 2). Unlike TLC1, however, DA-TLC1 is substantially functional in vitro, extending a telomeric DNA primer in a reconstituted telomerase activity assay along with Est2 protein (Fig. 3). The ability to transition between studying DA-TLC1 in vivo and in vitro could allow for further biochemical and biophysical analyses of yeast telomerase, advancing understanding of yeast telomerase RNP’s structure and function.

DA-TLC1 may also be superior to previously designed TLC1 variants with in vitro activity. While both Mini-T and TSA-T have higher in vitro function than DA-TLC1 (Fig. 4), both of these RNAs have substantially shortened telomeres in vivo. Furthermore, with only 9.5% of the nucleotides mutated, DA-TLC1 is much more similar to wild-type TLC1 in sequence; Mini-T(500) has 657 nts deleted from the long RNA arms, while TSA-T has 223 nts deleted/mutated, resulting in long rigid dsRNA helices between the core and accessory proteins’ binding sites. In contrast, DA-TLC1 was designed to retain wild-type-like secondary structure and is predicted to form an overall holoenzyme structure much more similar to that of TLC1. Therefore, despite lower in vitro activity, DA-TLC1 is likely more biologically relevant than previously reported in vitro-functioning TLC1 variants.

Variants of DA-TLC1 with fewer determined arms may also prove useful for future studies. Promoting lower energy folding of the Est1 and terminal arms together, or just the Ku arm alone, was sufficient to maintain telomeres at near wild-type length in vivo, while also rescuing telomerase activity in vitro at a level similar to DA-TLC1 (Fig. 4). Because these RNA variants have fewer nucleotide mutations than DA-TLC1, they could be considered to be closer to wild-type TLC1 in structure and function. Furthermore, the determined-arm variants could be used strategically to provide a highly functional telomerase RNA while avoiding mutating regions of interest. For example, an RNA with only the “determined” Ku arm could facilitate both in vivo and in vitro studies of the essential Est1-binding arm.

While DA-TLC1 may provide a useful tool, there remain some limitations. Importantly, DA-TLC1 has reduced RNA abundance in vivo (Fig. 2B). Since it maintains nearly wild-type-length telomeres despite low RNA levels, DA-TLC1 may actually have higher per-RNA activity than wild-type TLC1 (similar to TSA-T27). However, this may impact the relevance of DA-TLC1 to TLC1 function. Further study will be required to fully understand the reasons for the low RNA abundance, and to assess what effects this has on telomerase holoenzyme function. Altering the mutations used to create the determined arms may help to mitigate the reduced RNA levels while maintaining a highly functional telomerase RNA.

Another possible limitation of DA-TLC1 concerns the folding predictions for the distal region of the Ku-binding arm. The predicted folding for the DA-TLC1 Ku arm is similar to the Mfold prediction for the wild-type Ku arm (Fig. 4A; Supp. Fig. 3A). However, the Ku arm is one of the most poorly determined regions in the Mfold P-num model of wild type TLC1, so it is not clear whether this structure is truly biologically relevant. The challenge of modeling the full extent of the Ku are is also reflected by the fact that two different structure models have been proposed for it based on phylogenetic analysis9,10. We attempted to drive folding of the Ku arm into a secondary structure based the model that used the four most closely related Saccharomyces species in order to reduce the influence from more divergent species10. However, while the resulting “DPhyK” Ku arm allele was able to support greater telomerase function in vitro than TLC1, the RNA had very low RNA abundance and greatly shortened telomeres (Fig. 4). These phenotypes were more severe than those observed when deleting the Ku-binding site in TLC1Δ48 19,42, indicating that the RNA has greater defects than simple loss of Ku heterodimer binding. This may even suggest a previously unknown role for portions of the Ku arm RNA in TLC1 stability. Thus, is unclear why the secondary-structure-driven Ku arm caused poor telomerase RNA function in cells. It is possible that some of the specific point mutations had unexpected deleterious effects on RNA biogenesis or stability, and/or it may reveal incomplete understanding of the biologically relevant secondary structure in this region. It is also possible that RNA folding within the wild-type Ku arm is inherently structurally dynamic, and that limiting the secondary structure diversity within the distal region of the determined Ku arm prevented transitions into important conformations. Overall, our attempts to create a fully structurally determined Ku arm demonstrate the complexity of this region of the telomerase RNA and show that further examination of the structural nature of this arm is required.

Here, we have reported on a newly designed variant of yeast telomerase RNA that is functional both in vivo and in vitro. Because the overall secondary structure of this determined arm TLC1 is more similar to the wild-type RNA than previously described in vitro-functioning TLC1 mutants, DA-TLC1 and its variants may prove to be useful tools for studying the structure-function relationships in the holoenzyme. While further optimization of the specific mutations may create an even more robust DA-TLC1, we have demonstrated that in vitro function of yeast telomerase RNA can be improved by using structural prediction programs to design the RNA arms to fold into more highly determined, lowest-free energy structures.

Overall, this study provides biochemical and biological evidence that a trans-acting, RNP-forming lncRNA with a largely known secondary structure that does not fold readily in vitro into its native conformation can be modified in sequence to promote the biologically relevant secondary structure formation in vitro. This inverse-folding design is achieved here by converting base pairs and unpaired nucleotides to lower the Mfold-predicted free energy of folding, along with the probabilistic folding predictions of the P-num heuristic feature, to promote folding into the biologically relevant active conformation in vitro. This paradigm should prove useful to studying other RNAs and RNPs, provided that the modifications associated with inverse folding design do not disrupt sequence-specific binding to key factors or activity of the RNA.

METHODS

Design of DA-TLC1

DA-TLC1 was engineered by manually introducing a series of nucleotide substitutions in the arms of TLC1 and evaluating these mutant transcripts using Mfold P-num RNA secondary structure prediction software32,35, with two goals: (1) to drive the lowest-free energy folded state into the phylogenetically predicted structure10, and (2) to increase the “determinedness” of the three RNA arms, guided by the P-num output. Over 30 inverse-designed variations of TLC1 were examined bioinformatically with this approach, ultimately culminating in the reported DA-TLC1, with more well-determined arms than wild-type TLC1, and a lower predicted initial ΔG (−382 kcal/mol, compared to −321 kcal/mol in TLC1; see Figure 1). DA-TLC1 has 59 nucleotide substitutions over its 1157 nt length; i.e., 9.5% of its nucleotides mutated (see Figure 1). 15 nts are substituted in the terminal arm, 11 in the Ku arm, and 33 in the Est1 arm (Supp. Figs. 1 and 3). A gene-synthesis fragment for DA-TLC1 was ordered (Genewiz/Azenta) and ligated into a CEN vector harboring upstream and downstream sequences from genomic TLC1.

The DA-TLC1 variants in Figures 1 and 4 (see sequences in Supp. Fig. 1) were cloned using restriction enzyme sites in TLC1. DK-TLC1 was created by ligating the StuI to NcoI fragment of DA-TLC1 into TLC1, adding the determined Ku arm to wild-type TLC1. DET-TLC1 was created by ligating the StuI to NcoI fragment of TLC1 into DA-TLC1, restoring the wild-type Ku arm sequence to DA-TLC1.

Experiments in yeast

All telomerase RNA alleles were expressed from TRP1-marked centromeric (CEN) plasmids. These plasmids were transformed into strain TCy43 (MAT-a ura3–53 lys2–801 ade2–101 trp1-Δ1 his3-Δ200 leu2-Δ1 VR::ADE2-TEL adh4::URA3-TEL tlc1Δ::LEU2 rad52Δ::HIS3 [pTLC1-LYS2-CEN])23. After shuffling out TLC1 (LYS2/CEN) on solid medium containing α-aminoadipate, colonies were restreaked 10 times (~250 generations; approximately 20 days) on synthetic-compete media lacking tryptophan, with each restreak representing ~25 generations of yeast growth.

Nucleic acid blots

Southern blots were performed as previously described14,19,27. Briefly, cell pellets for genomic DNA isolation were prepared from cultures set from serially restreaked plates. Genomic DNA was isolated (Gentra Puregene system), equal amounts were digested with XhoI and electrophoresed through a large 1.1% agarose gel at 70 V for 17 h, and transferred to Hybond-N+ Nylon membrane (GE). The blot was then probed for yeast telomeric sequence and a 1627-bp non-telomeric fragment of chromosome IV. Average Y′ telomere fragment length was quantified using the weighted average mobility (WAM) method described previously19.

Northern blots were performed as previously described (Zappulla et al., 2005). Briefly, total cellular RNA was isolated from yeast cultures using a modified hot-phenol RNA isolation method43. After boiling, ~10μg of total RNA was separated by Urea-PAGE, transferred to a Hybond-N+ Nylon Membrane (GE), UV- crosslinked (Spectrolinker XL-1500 UV Crosslinker, using two runs of the “Optimal Crosslink” setting), and pre-hybridized in Church buffer for 10 min at 55°C. The membrane was then probed for the 3′ end of TLC1 (nts 906–1140; Lebo and Zappulla, 2012). Relative abundances were determined by normalized TLC1 levels to U1 snRNA. Since U1 is far more abundant, 100-fold less U1 probe was used relative to TLC1 probe.

RNA dot blots were performed as previously described27. These blots are necessary to detect TSA-T RNA abundance, since it does not denature in urea-PAGE gels due to its very long dsRNA arms. Total cellular RNA was extracted from late-log or early-stationary phase yeast cultures by a slightly modified version of the “hot phenol” RNA isolation method43. After boiling, 5 μg of RNA was spotted twice onto Hybond-N+ Nylon Membrane (GE). The membrane was cut in half such that the two dots of each sample were separate membrane sections, air-dried, UV-crosslinked (SpectroLinker XL-1500 UV Crosslinker, using two runs of the “Optimal crosslink” setting), and pre-hybridized in Church buffer at 55°C for 10 minutes. One membrane was probed for the 3′ region of TLC1 shared by the analyzed alleles (nucleotides 906 to 1140), while the other was probed for the U1 snRNA44. Telomerase RNA levels were normalized to U1 levels.

Reconstituted telomerase activity assays

Linearized “run-off” dsDNA templates for T7 transcription of telomerase RNAs were made using PCR (with low-error DNA polymerase) of the gene with a T7 promoter included at the 5¢ end of the forward primer. In vitro telomerase activity assays were performed as described previously14. Briefly, linear DNA template encoding each telomerase RNA was mixed with plasmid containing T7-ProA-Est2 in an RRL transcription and translation system. Telomerase RNP was immunopurified with IgG-Sephadex beads and the beads were washed. Bead-bound telomerase was then incubated with a telomeric primer, dNTPs, and [α−32P]-dGTP. Products were electrophoresed through a 10% polyacrylamide/TBE/urea gel, and imaged using phosphor screens and a Typhoon 9410 Variable Mode Imager. As an internal control for product recovery and loading, ~1 nM [g-32P] end-labeled primer was added before the telomerase reaction. Activity levels were normalized to the internal control.

Supplementary Material

Supplement 1
media-1.pdf (1.9MB, pdf)

ACKNOWLEDGEMENTS

This research was supported in part by the National Institute of General Medical Sciences of the National Institutes of Health award numbers R01 GM118757 and R00 GM080400 and by a March of Dimes Basil O’Connor Starter Scholar Award to DCZ.

REFERENCES

  • 1.Watson J.D. Origin of concatemeric T7 DNA. Nat New Biol 239, 197–201 (1972). [DOI] [PubMed] [Google Scholar]
  • 2.Hayflick L. & Moorhead P.S. The serial cultivation of human diploid cell strains. Exp Cell Res 25, 585–621 (1961). [DOI] [PubMed] [Google Scholar]
  • 3.Weinert T.A. & Hartwell L.H. The RAD9 gene controls the cell cycle response to DNA damage in Saccharomyces cerevisiae. Science 241, 317–22 (1988). [DOI] [PubMed] [Google Scholar]
  • 4.Greider C.W. & Blackburn E.H. Identification of a specific telomere terminal transferase activity in Tetrahymena extracts. Cell 43, 405–13 (1985). [DOI] [PubMed] [Google Scholar]
  • 5.Weinrich S.L. et al. Reconstitution of human telomerase with the template RNA component hTR and the catalytic protein subunit hTRT. Nat Genet 17, 498–502 (1997). [DOI] [PubMed] [Google Scholar]
  • 6.Lingner J. et al. Reverse transcriptase motifs in the catalytic subunit of telomerase. Science 276, 561–7 (1997). [DOI] [PubMed] [Google Scholar]
  • 7.Greider C.W. & Blackburn E.H. A telomeric sequence in the RNA of Tetrahymena telomerase required for telomere repeat synthesis. Nature 337, 331–7 (1989). [DOI] [PubMed] [Google Scholar]
  • 8.Lendvay T.S., Morris D.K., Sah J., Balasubramanian B. & Lundblad V. Senescence mutants of Saccharomyces cerevisiae with a defect in telomere replication identify three additional EST genes. Genetics 144, 1399–412 (1996). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Dandjinou A.T. et al. A phylogenetically based secondary structure for the yeast telomerase RNA. Curr Biol 14, 1148–58 (2004). [DOI] [PubMed] [Google Scholar]
  • 10.Zappulla D.C. & Cech T.R. Yeast telomerase RNA: a flexible scaffold for protein subunits. Proc Natl Acad Sci U S A 101, 10024–9 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Mefford M.A., Rafiq Q. & Zappulla D.C. RNA connectivity requirements between conserved elements in the core of the yeast telomerase RNP. EMBO Journal 32, 2980–2993 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Niederer R.O. & Zappulla D.C. Refined secondary-structure models of the core of yeast and human telomerase RNAs directed by SHAPE. RNA 21, 254–61 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Lin J. et al. A universal telomerase RNA core structure includes structured motifs required for binding the telomerase reverse transcriptase protein. Proc Natl Acad Sci U S A 101, 14713–8 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Zappulla D.C., Goodrich K. & Cech T.R. A miniature yeast telomerase RNA functions in vivo and reconstitutes activity in vitro. Nat Struct Mol Biol 12, 1072–7 (2005). [DOI] [PubMed] [Google Scholar]
  • 15.Mefford M.A., Hass E.P. & Zappulla D.C. A 4-base pair core-enclosing helix in telomerase RNA is essential and binds to the TERT catalytic protein subunit. bioRxiv, 2020.01.10.902601 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Qiao F. & Cech T.R. Triple-helix structure in telomerase RNA contributes to catalysis. Nat Struct Mol Biol 15, 634–40 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zappulla D.C. & Cech T.R. RNA as a flexible scaffold for proteins: yeast telomerase and beyond. Cold Spring Harb Symp Quant Biol 71, 217–24 (2006). [DOI] [PubMed] [Google Scholar]
  • 18.Zappulla D.C. Yeast Telomerase RNA Flexibly Scaffolds Protein Subunits: Results and Repercussions. Molecules 25 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Zappulla D.C. et al. Ku can contribute to telomere lengthening in yeast at multiple positions in the telomerase RNP. RNA 17, 298–311 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Hass E.P. & Zappulla D.C. Repositioning the Sm-Binding Site in Saccharomyces cerevisiae Telomerase RNA Reveals RNP Organizational Flexibility and Sm-Directed 3’-End Formation. Noncoding RNA 6(2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Peterson S.E. et al. The function of a stem-loop in telomerase RNA is linked to the DNA repair protein Ku. Nat Genet 27, 64–7 (2001). [DOI] [PubMed] [Google Scholar]
  • 22.Seto A.G., Livengood A.J., Tzfati Y., Blackburn E.H. & Cech T.R. A bulged stem tethers Est1p to telomerase RNA in budding yeasts. Genes Dev 16, 2800–2810 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Seto A.G., Zaug A.J., Sobel S.G., Wolin S.L. & Cech T.R. Saccharomyces cerevisiae telomerase is an Sm small nuclear ribonucleoprotein particle. Nature 401, 177–80 (1999). [DOI] [PubMed] [Google Scholar]
  • 24.Hass E.P. & Zappulla D.C. Repositioning the Sm-binding site in S. cerevisiae telomerase RNA reveals RNP organizational flexibility and Sm-directed 3′-end formation. bioRxiv doi: 10.1101/167361 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Lebo K.J., Niederer R.O. & Zappulla D.C. A second essential function of the Est1-binding arm of yeast telomerase RNA. RNA 21, 862–876 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Cohn M. & Blackburn E.H. Telomerase in yeast. Science 269, 396–400 (1995). [DOI] [PubMed] [Google Scholar]
  • 27.Lebo K.J. & Zappulla D.C. Stiffened yeast telomerase RNA supports RNP function in vitro and in vivo. RNA 18, 1666–78 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Lemieux B. et al. Active Yeast Telomerase Shares Subunits with Ribonucleoproteins RNase P and RNase MRP. Cell 165, 1171–81 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Churkin A. et al. Design of RNAs: comparing programs for inverse RNA folding. Brief Bioinform 19, 350–358 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Bonnet E., Rzazewski P. & Sikora F. Designing RNA Secondary Structures Is Hard. J Comput Biol 27, 302–316 (2020). [DOI] [PubMed] [Google Scholar]
  • 31.Hofacker I.L. et al. Fast Folding and Comparison of RNA Secondary Structures. Monatshefte fur Chemie 125, 167–188 (1994). [Google Scholar]
  • 32.Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res 31, 3406–15 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Mathews D.H., Sabina J., Zuker M. & Turner D.H. Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J Mol Biol 288, 911–40 (1999). [DOI] [PubMed] [Google Scholar]
  • 34.Zuker M. & Jacobson A.B. “Well-determined” regions in RNA secondary structure prediction: analysis of small subunit ribosomal RNA. Nucleic Acids Res 23, 2791–8 (1995). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zuker M. & Jacobson A.B. Using reliability information to annotate RNA secondary structures. RNA 4, 669–79 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Liu F., Kim Y., Cruickshank C. & Theimer C.A. Thermodynamic characterization of the Saccharomyces cerevisiae telomerase RNA pseudoknot domain in vitro. RNA 18, 973–91 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Le S., Moore J.K., Haber J.E. & Greider C.W. RAD50 and RAD51 define two pathways that collaborate to maintain telomeres in the absence of telomerase. Genetics 152, 143–52 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Lundblad V. & Blackburn E.H. An alternative pathway for yeast telomere maintenance rescues est1-senescence. Cell 73, 347–60 (1993). [DOI] [PubMed] [Google Scholar]
  • 39.McEachern M.J. & Blackburn E.H. Cap-prevented recombination between terminal telomeric repeat arrays (telomere CPR) maintains telomeres in Kluyveromyces lactis lacking telomerase. Genes Dev 10, 1822–34 (1996). [DOI] [PubMed] [Google Scholar]
  • 40.Singer M.S. & Gottschling D.E. TLC1: template RNA component of Saccharomyces cerevisiae telomerase. Science 266, 404–9 (1994). [DOI] [PubMed] [Google Scholar]
  • 41.Lundblad V. & Szostak J.W. A mutant with a defect in telomere elongation leads to senescence in yeast. Cell 57, 633–43 (1989). [DOI] [PubMed] [Google Scholar]
  • 42.Stellwagen A.E., Haimberger Z.W., Veatch J.R. & Gottschling D.E. Ku interacts with telomerase RNA to promote telomere addition at native and broken chromosome ends. Genes Dev 17, 2384–95 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Kohrer K. & Domdey H. Preparation of high molecular weight RNA. Methods Enzymol 194, 398–405 (1991). [DOI] [PubMed] [Google Scholar]
  • 44.Friedman K.L. & Cech T.R. Essential functions of amino-terminal domains in the yeast telomerase catalytic subunit revealed by selection for viable mutants. Genes Dev 13, 2863–74 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1
media-1.pdf (1.9MB, pdf)

Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES