Abstract
Mobile elements comprise close to one half of the mass of the human genome. Only LINE-1 (L1), an autonomous non-Long Terminal Repeat (LTR) retrotransposon, and its non-autonomous partners—such as the retropseudogenes, SVA, and the SINE, Alu—are currently active human retroelements. Experimental evidence shows that Alu retrotransposition depends on L1 ORF2 protein, which has led to the presumption that LINEs and SINEs share the same basic insertional mechanism. Our data demonstrate clear differences in the time required to generate insertions between marked Alu and L1 elements. In our tissue culture system, the process of L1 insertion requires close to 48 hours. In contrast to the RNA pol II-driven L1, we find that pol III transcribed elements (Alu, the rodent SINE B2, and the 7SL, U6 and hY sequences) can generate inserts within 24 hours or less. Our analyses demonstrate that the observed retrotransposition timing does not dictate insertion rate and is independent of the type of reporter cassette utilized. The additional time requirement by L1 cannot be directly attributed to differences in transcription, transcript length, splicing processes, ORF2 protein production, or the ability of functional ORF2p to reach the nucleus. However, the insertion rate of a marked Alu transcript drastically drops when driven by an RNA pol II promoter (CMV) and the retrotransposition timing parallels that of L1. Furthermore, the “pol II Alu transcript” behaves like the processed pseudogenes in our retrotransposition assay, requiring supplementation with L1 ORF1p in addition to ORF2p. We postulate that the observed differences in retrotransposition kinetics of these elements are dictated by the type of RNA polymerase generating the transcript. We present a model that highlights the critical differences of LINE and SINE transcripts that likely define their retrotransposition timing.
Author Summary
SINE retroelement amplification has been extremely successful in the human genome. Although these non-autonomous elements parasitize factors from LINEs, both the human Alu and the cumulative rodent SINEs have generated over one million copies in their respective hosts. Alu-induced mutagenesis is responsible for the majority of the documented instances of human retroelement insertion-induced disease. Our data indicate that SINEs require a shorter period of time to complete insertion than L1s, possibly contributing to the ability of Alu elements to effectively parasitize L1 components. We demonstrate that RNA polymerase changes the timing Alu requires to complete retrotransposition and creates the need for the L1 ORF1protein in addition to ORF2p. We postulate that the way cells manage pol III and pol II (mRNA) transcripts affects the timing of a transcript going through the retrotransposition pathway. We propose a model that highlights some of the critical differences of LINE and SINE transcripts that likely play a crucial role in their retrotransposition process.
Introduction
Mobile elements have constantly assaulted genomes, shaping and molding their structure and organization. In particular, mobile elements have flourished in mammals generating between 40–50% of their genomic sequence [1]–[3]. About one third of the human genome can be attributed directly or indirectly to the activity of the non-LTR retroelements also referred to as LINEs (Long INterspersed Elements). LINE-1 (L1) and its non-autonomous partners Alu, SVA, and retropseudogenes continue to amplify in the human genome. L1 and the SINE (Short INterspersed Element), Alu, are by far the most numerous, adding up to 1.5 million copies [1]. Although Alu mobilization depends on L1 proteins [4], they outnumber L1 inserts by 2 to 1. Similarly, the sum of the total copies of all rodent SINEs outnumber L1 copies about 2 to 1 [2],[3]. Alu and the rodent SINE inserts have been more successful than other non-autonomous retroelements, such as the retropseudogenes [5]. Size and sequence composition differences between SINEs and LINEs may allow the mammalian genome to better tolerate SINE insertions, reviewed in [6]. Negative selection has clearly played a role in reducing L1 copy number through ectopic recombination and elimination of many full length and nearly full length L1 inserts [7]. However, processes other than negative selection must influence the observed differences. The updated reports of diseases caused by de novo inserts (where little, or no, selection has occurred) show that Alu inserts outnumber those of L1 by about 2 to 1 [6],[8].
Tissue culture assay systems indicate that L1 retrotransposition rates are consistently higher than those observed for SINEs [4],[9]. This is possibly a reflection of the strong cis-preference contained by L1 [10],[11], while Alu must compete for L1 proteins in trans. How is it that Alu with a lower retrotransposition rate than L1, contributes more de novo disease cases? It is likely that multiple factors are involved, such as the ability to bind SRP9/14 [12],[13].
Retroelements are mobile elements that amplify through an RNA intermediate in a process known as retrotransposition [14]. There are limited data on the details of the mechanism of LINE retrotransposition, and even less for SINE retrotransposition. The process begins with the generation of RNA (Figure 1A). Active L1 elements express two proteins from a bicistronic mRNA: ORF1p[15] and ORF2p (Figure 1B and C). Both L1 proteins are needed for L1 retrotransposition [16]. In contrast to L1, ORF2p expression is sufficient for SINE retrotransposition [4],[9],[17], while ORF1p may enhance the process [17]. ORF1p possesses nucleic acid chaperone activity [18],[19], an essential property for L1 retrotransposition [19],[20]. ORF2p is a multifunctional protein with endonuclease and reverse transcriptase activities [21],[22]. Both proteins are proposed to interact in cis [10],[11] with the L1 RNA to form a cytoplasmic RNP complex interacting with polyribosomes [20],[23]. SINE RNA is predominantly found in the cytoplasm as an RNP complex [12],[24],[25] (Figure 1C) and uses L1 protein(s) in trans for its mobilization. The endonuclease of the L1 ORF2p generates the first nick within the L1 endonuclease recognition sequence generating single stranded DNA that primes the reverse transcription [22],[26]. Both L1 and Alu are proposed to undergo integration through a target-primed reverse transcription (TPRT) reaction [27].
To generate a new insertion, L1 and SINE elements must return to the nucleus either together or independently (Figure 1D). Reported data suggest that retrotransposition-competent L1 RNPs may transit through the nucleolus [28]. The 3′ poly-A stretch or “A-tail” of LINEs, SINEs and processed pseudogenes is required for the priming of reverse transcription (Figure 1E) [4],[29]. Unlike the post-transcriptionally generated A-tail of pol II RNAs (mRNA), SINE A-tails are included within their sequence and play an important role in SINE retrotransposition [30],[31]. The details of the final integration and ligation of the L1 or Alu inserts into the host DNA remain unclear. Recent reports indicate that cellular factors, such as DNA repair enzymes, may aid in the L1 retrotransposition process [32],[33]. The final inserted sequence is typically flanked by direct repeats (Figure 1G). Non-autonomous retrotransposed inserts, such as Alu, SVA, hYs and retropseudogenes share these hallmarks with L1 inserts, strongly suggesting that these elements use the L1 ORF2p endonuclease generated nick for their integration [34]–[36].
To date, all known SINEs are ancestrally derived from RNA pol III transcribed RNA genes, reviewed in [37]. The vast majority are derived from different tRNA genes and only two (Alu and the rodent B1) originated from the 7SL RNA gene, a component of the signal recognition particle (SRP) [38]. Other examples of pol III transcribed repeats include the four hY genes (hY1, hY3, hY4 and hY5) that likely contributed directly or indirectly to the generation of almost 1000 copies in the human genome by retrotransposition [36],[39]. In contrast to SINEs, an internal RNA pol II promoter drives LINE transcription with the unusual ability to start transcription upstream of its location. Like other pol II RNAs, L1 transcription is regulated by different mechanisms, including promoter methylation [40], transcriptional attenuation due to A-richness [41], premature polyadenylation [42], and the generation of different splice variants [6]. Additional studies suggest that at least some portion of the L1 mRNAs are capped [43] and that the capping enhances L1 translation [44].
Previously, an L1 element tagged with a green fluorescent protein (EGFP) retrotransposition cassette was used to detect L1 retrotransposition “near real time” [45]. The earliest detection of an L1 retrotransposition event was 48 h post-transfection. In this manuscript, we evaluate the timing of retrotransposition (defined as the time required for a retroelement from the initial transcription step to complete an insertion) of tagged Alu and L1. We demonstrate that Alu elements only require about half of the amount of time as L1 to generate an insert. Our data demonstrate that the type of RNA polymerase dictates the retrotransposition timing, but does not determine the retrotransposition rate (defined as the number of inserts a given element can generate, i.e. the “efficiency” of an element). After evaluating several potential time limiting steps, we show that the RNA polymerase type is an important early factor contributing to the divergent retrotransposition kinetics between LINEs and SINEs.
Results
The Use of an HIV Reverse Transcriptase Inhibitor d4t as a Suppressor of L1 and Alu Retrotransposition in Culture
Reverse transcriptase (RT) domains of multiple sources can be grouped into a family of shared sequence homology [NCBI cdd pfam00078.12] [46], including the RT of the human immunodeficiency virus and L1 ORF2 protein. Endogenous RT activity is inhibited by two antiretroviral agents nevirapine and efavirenz [47]. L1 retrotransposition in a culture assay system can be suppressed by the addition of a variety of HIV RT inhibitors [48],[49]. This system utilizes a tagged vector designed to allow expression of the reporter gene only when the retroelement goes through its reverse transcriptase-dependent amplification process (Figure 2A). Therefore, only the newly inserted element will express the reporter gene (e.g. neo).
Using the established L1 and Alu retrotransposition tissue culture assays [4],[16], we evaluated the dose of, 2′,3′-didehydro-3′-deoxy-thymidine (d4t) required to abolish retrotransposition of L1 and L1 ORF2p driven Alu without adversely affecting cell growth and viability. Treatment of transiently transfected HeLa cells showed that both L1 and Alu activities presented a d4t activity inhibitory concentration 50 (IC50) of about 2 µM (Figure S1). For our subsequent experiments we utilized d4t treatments at 50 µM (25 fold the IC50) to inhibit SINE and LINE retrotransposition in tissue culture. We selected this dose for its efficient inhibition of retrotransposition and lack of observed negative effects, determined by colony formation of an unrelated plasmid that expresses a functional neomycin resistance gene and integrates into genomic DNA by random integration rather than by an L1-dependent mechanism (data not shown).
Generation of L1NE-1 and Alu Insertions Have Different Time Requirements
We took advantage of the d4t inhibition to determine L1 and Alu retrotransposition kinetics in cultured cells. By treating cells with d4t at different time points after the transient transfection with the vectors expressing the tagged L1 or Alu plus ORF2p, we specifically inhibited the retrotransposition process at designated time periods (shown in Figure 2B). Any detected L1 or Alu inserts are presumed to have completed the insertion process prior to the addition of the d4t, as inhibition of ORF2p RT activity would prevent the generation of the cDNA. Using this approach, we show that L1 inserts are not detected in cultured cells during the first 24 h post-transfection (Figure 2C). Similar results were previously observed using a green fluorescent protein (EGFP)-tagged L1 element [45],[50]. The earliest detection of L1 inserts occurred at 32 h post-transfection (Figure S2). In contrast, we can easily detect Alu inserts 24 h and sometimes as early as 18 h post-transfection (Figure 2C).
Availability of L1 RNA Is Not a Limiting Factor
Generation of an RNA transcript is an essential first step of the retrotransposition cycle (Figure 1A). Besides serving as a template for protein translation, L1 RNA acts as the insertion template during retrotransposition. Thus, transcriptional limitations or variations can directly impact retrotransposition of L1 elements as well as other retroelements. Previous studies demonstrate that L1 elements generate low amounts of full-length transcripts due to premature polyadenylation [42], transcriptional inefficiency due to A-richness [41], and multiple splicing events [6]. In all these studies, a decrease in the amount of L1 mRNA contributed to reduced retrotransposition and, conversely, the rate increased with higher amounts of full-length L1 RNA [42],[51],[52]. To determine whether L1 RNA transcription and processing contributes to the observed timing difference between L1 and Alu inserts, we performed a time course to evaluate the generation of the spliced RNA product in cells transiently transfected with L1mneo, AluYa5neo TET, and L1neo TET (Figure 3). Because the Alu construct is driven by RNA polymerase III, its tag (neo TET) contains a self splicing intron disrupting the neomycin gene. Therefore, we included an additional L1 construct that contains the exact same self splicing (neo TET) tag present in the Alu vector to control for any potential variations introduced by splicing dynamics. Full-length spliced and unspliced transcripts from Alu and both L1 constructs could be detected as early as 3 hours post-transfection (northern blots shown in Figure S3). The mneo and neo TET tagged L1 constructs exhibited similar kinetics for the spliced transcript (only RNA that will generate G418R colonies when retrotransposed), peaking by 24 h and declining by 72 hours (Figure 3). Splicing efficiency of the RNA produced by different expression vectors was evaluated (Table S1). Equivalent splicing efficiency was observed for the L1 and Alu transcripts sharing the same neomycin cassette (neo TET or mneo). Alu-tag transcripts were only detected in the cytoplasmic fraction at any of the time points evaluated (data not shown), consistent with what has been previously reported for the authentic Alu “untagged” RNA [53]. Despite early L1 mRNA availability, no L1 inserts were observed at the 24 h time point. Spliced Alu transcripts peak around 48 h, declining by 72 h, much like L1 mRNA (Figure 3). However, in contrast to L1, numerous Alu inserts are readily detectable by 24 h. These results demonstrate that the full-length properly spliced L1 RNA is generated in the same time period as the Alu RNA. Thus, it is unlikely that RNA transcription or variation in the type of splicing within the neo cassette account for the observed time difference between the generation of Alu and L1 inserts.
Another difference between the Alu and L1 elements involves the length of the transcript, which could alter the time required by the reverse transcriptase to generate a full-length cDNA. In this assay system full length inserts are not required to generate a G418R colony. In both Alu and L1 assays, inserts are detected with the retrotransposition of the minimal unit of a functional neomycin gene, which is identical in length in both transcripts once the intron is removed. Therefore, the timing differences observed between these two elements should be independent of the transcript length.
L1 ORF2p Activity Can Be Detected within 24 Hours
We next assessed whether the delay reflects the time required for translation of the L1 proteins and the ability to reach the nucleus (Figure 1B–E). ORF2 protein has been notoriously difficult to observe by conventional techniques, such as western blot analysis [28]. As an alternative, the ORF2p activities can be evaluated.
Because Alu elements require ORF2p for retrotransposition, evaluation of Alu retrotransposition serves as an alternate method to detect ORF2p activity. Therefore, we exploited the trans-complementation assay to monitor the ability of L1 to trans-mobilize Alu, using AluYa5neo TET as a reporter construct. We determined the Alu insertion kinetics in cells cotransfected with the AluYa5neo TET plus the L1 no tag vector. Multiple Alu inserts were detected as early as 24 h post-transfection (Figure 4), corroborating the availability of the ORF2p expressed from the L1 vector in the nucleus by 24 h. Equivalent results were observed when using a blasticidin tagged L1 to drive Alu retrotransposition (data not shown). Under our experimental conditions, endogenous L1 present in HeLa cells does not significantly contribute to the generation of the G418R colonies as the Alu vector was unable to generate any inserts without L1 supplementation at 24, 32 and 48 h post-transfection (vector control, Figure 4). A few solitary colonies (2 and 1) were observed at the 42 and 72 h time points. This observation clearly demonstrates that a full-length L1 vector generates enough ORF2p to reach the nucleus within 24 h and to mobilize a tagged Alu element in our assay system. Our observations are in agreement with previously published data demonstrating that cells transiently transfected with L1 exhibit extensive double strand breaks at 24 h post-transfection [33]. The observed DNA breaks are dependent on the endonuclease activity of the L1 ORF2p. Our data strongly suggest that translation and nuclear localization of ORF2p is unlikely to be the main limiting step for the observed differences between the L1 and Alu time requirements.
In addition, pre-transfection of high amounts of ORF2p or any of the L1 factors (proteins and/or RNPs) in trans did not alter L1 retrotransposition timing (Figure S4). This is not surprising considering that L1 RNA exhibits a strong cis-preference for its own translated proteins for retrotransposition [10],[11]. Pre-transfection with ORF2p showed a few more Alu inserts at early time points (data not shown). However, this slight increase was not statistically significant (Student's paired t-test, p = 0.297).
The RNA Polymerase Dictates SINE and LINE Retrotransposition Timing
Transcripts generated from RNA polymerase II and III promoters differ in their capping, 3′ end processing, folding structures, post-transcriptional processing, interaction with translation factors and degradation pathways, reviewed in [54]–[56]. In addition, these two transcriptional complexes can be observed in different spatial locations in the nucleus indicating discrete transcriptional sites [57],[58]. To evaluate the timing of retrotransposition of other pol III-driven genes we generated “tagged” versions of 6 human genes (7SL, U6, hY1, hY3, hY4 and hY5) by cloning the genes with at least 300 bp of their upstream enhancer sequence 5′ of the neo TET cassette (details in materials and methods). Although the “functional” genes are not SINEs per se, we selected these as examples of pol III-driven genes. The human genome contains multiple examples of retrotransposed copies with sequence homology to these genes [36],[59]. Thus, these serve as our best examples of other human pol III-driven constructs. We also included in our analysis the pol III-driven B2 element as a known active rodent SINE [9],[60]. In our d4t-assay system, all tagged pol III-driven elements generated inserts by 24 h post-transfection when supplemented with just L1 ORF2p (Figure 5).
To better understand the RNA polymerase influence on retrotransposition, we also evaluated the time requirement of two pol II-driven (CMV) constructs: ORF1mneo and pol II Alu (Figure 6A). We selected ORF1mneo because it generates a transcript of L1 ORF1, which has previously been used to reflect retropseudogene activity [10]. The ORF1mneo vector can retrotranspose when a source of ORF2p is supplied in trans [10]. The pol II Alu (pCMVYa5mneo) contains an Alu tagged with the “mneo” cassette from the L1-tagged construct [61], which contains pol III terminators (4 Ts) that would generate truncated transcripts if the internal pol III A and B boxes in the Alu sequence are used for transcription. The “normal A-tail” at the end of the Alu sequence and 5′ of the neo cassette (Figure 6A) was not included in order to prevent potential internal priming for TPRT in the cDNA extension step (Figure 1E), which would circumvent inclusion of the neo reporter gene in the retrotransposed copy. Thus, only the Alu body sequence was utilized in the construct. Just like the L1 construct, the A-tail used in the TPRT step is generated from the transcript polyadenylation by the RNA polymerase II from the SV40pA signal at the 3′ end of the neo cassette (Figure 6A). Spliced and unspliced transcripts were detected from both constructs by 24 h (Figure 6B). The tagged ORF1p transcript driven by an ORF2p generated one single insert at 24 hours (Figure 6C), while the total number of colonies generated were 136 and 226 for 48 h and 72 h respectively. It is possible that the endogenous L1 expression in HeLa cells [6] affected the timing. However, our data on Alu retrotransposition indicates that effects from endogenous L1 expression under our experimental conditions are negligible (Figure 4). Most likely, the single G418R colony observed at 24 hours is due to a rare event that escaped d4t inhibition. A quantitative time course evaluation of the spliced RNA product in cells transiently transfected with ORF1mneo and AluYa5mneo further indicates that the availability of spliced product is not limiting retrotransposition timing (Figure 6E).
No pol II-generated Alu inserts were ever observed when supplemented with ORF2p under any conditions tested, representing a rate of less than 1×106 cells/µg of plasmid. However, retrotransposition of the pol II-driven Alu transcript occurred when it was supplemented with both ORF2p and ORF1p expression plasmid (Figure 6D). Under these conditions, G418R colonies were observed at 48 h post-transfection, much like L1 and retropseudogene behavior. No colonies were ever observed at the 24 h time point in 5 independent experiments using triplicates for each time point. Swapping the RNA pol III for an RNA pol II promoter changed the retrotransposition requirements of the tagged Alu to reflect those observed for pseudogenes and LINEs.
The Timing of Retrotransposition Does Not Predict Insertion Rate
Recent data demonstrate that one amino acid substitution in the mouse L1 ORF1 protein dramatically affects retrotransposition rate and the ability to detect new inserts earlier [50]. We evaluated the insertion timing of the most efficient L1 available at the time, the synthetic mouse L1 (L1m syn) previously reported to increase retrotransposition efficiency by more than 200 fold relative to the wildtype L1spa element [52]. Despite having a much higher retrotransposition rate, L1m syn required 48 h to generate inserts even when spliced RNA could be detected as early as 3 hours post-transfection (Figure 7). There were a few (1 to 2) colonies at 24 hours or earlier but these are likely outlier observations as they only represent 0.001 of the total observed G418R colonies. Our data are consistent with the observation that all of the evaluated pol II-driven constructs require 48 h, while all of the pol III-driven constructs generate inserts by 24 h despite their very low retrotransposition rates relative to L1 (Table 1). Because of the large variation in retrotransposition rates, we opted to show the relative number of inserts in the figures for each construct by designating the 48 or 72 hour time point as 100%. While both U6 and Alu tagged transcripts, for example, can generate inserts by 24 hours, their retrotransposition rates (i.e., the actual number of observed inserts) differ dramatically. The same is true for the tagged L1 and ORF1 RNAs.
Table 1. Relative retrotransposition rate of the different tagged constructs in HeLa cells under the same transfection conditions.
Mobile element | Rate G418R colonies (×106/µg at 72 h) mean±SD† | “Relative” rate* | Time requirement for insert (h) |
L1¥ | 7880.0±321.0 | 5000 | ∼48 |
Alu (pol III)$ | 1053.3±160.8 | 500 | ∼24 |
7SL$ | 66.7±12.6 | 25 | ∼24 |
U6$ | 67.1±11.3 | 25 | ∼24 |
hY1$ | 16.7±1.6 | 10 | ∼24 |
hY3$ | 36.9±7.0 | 20 | ∼24 |
hY4$ | 57.5±6.4 | 25 | ∼24 |
hY5$ | 47.5±9.4 | 25 | ∼24 |
ORF1$ (pol II) | 23.1±5.5 | 10 | ∼48 |
Alu (pol II)^ | 1.3±0.9 | 1 | ∼48 |
†: Rates were calculated by determining the number of G418 resistant colonies generated at 72 h, per 1 million cells per µg of transfected plasmid. Rates should not be considered as absolute numbers, as results will vary for different cell lines and conditions.
*: Due to the intrinsic experimental variation, a rough approximation was used to determine the relative retrotransposition rates. The lowest observed rate was arbitrarily designated as “1.”
¥: Data from the JM101/L1.3 construct.
Retrotransposition of the element was driven by an optimized L1 ORF2 expression plasmid (pBudORF2opt). Rates were much lower when a full-length wild type L1 was used as the driver for retrotransposition (not shown).
Retrotransposition of the element was driven by cotransfection with L1 ORF1 and ORF2 expression plasmids (pBudORF1opt and pBudORF2opt).
Discussion
Throughout mammalian evolution different mobile elements have flourished within genomes. Retroelements such as LINEs and SINEs have been particularly successful, generating more than one third of human sequence mass. Interestingly, the parasitic non-autonomous SINE elements outnumber their autonomous LINE partners in the primate and rodent genomes. The success of SINEs is especially evident when compared to the copy numbers of other non-autonomous elements such as the retropseudogenes.
Our data reveal differences between retropseudogenes, Alu, and L1 retrotransposition. When evaluating Alu and L1 retrotransposition kinetics, the tagged Alu transcript required less time to generate an insert. This timing difference can not be attributed to differences in the time required to generate functional transcripts or availability of L1 proteins. It is clear that full-length functional L1 transcripts can be detected as early as 3 hours post-transfection and are abundant by 24 h post-transfection. In addition, the difference observed between Alu and L1 kinetics could not be attributed to the type of detection cassette system (self splicing or not) or to the differences in the retrotransposition rates. L1 colonies were rarely observed (Figure 7) at time points earlier than 48 h. These few observed G418R colonies possibly represent the rare event that circumvented inhibition by d4t (in one experiment a colony was observed even at the zero time point). In our assay, production of L1 ORF2p is not limiting. Our data demonstrate that enough ORF2p is generated from an L1 construct to drive Alu insertions within 24 hours post-transfection, which indicates that ORF2p is made and readily available for Alu transcript mobilization. However, at this time we do not know if the ORF2p reaches the nucleus as a “free” protein or as part of an RNP with the L1 RNA or Alu RNA. As expected, due to the L1 cis-preference [10], pre-transfections with ORF1p, ORF2p or other L1 components, such as full-length transcripts or RNPs, did not affect the L1 time requirement.
Although unexpected, it is not totally surprising that Alu and L1 present different retrotransposition time requirements. Previous data show that, although Alu and L1 share the same insertion hallmarks, the two elements can exhibit differences in their behavior. For example, of two HeLa “cell lines,” only one supports Alu retrotransposition while both support L1 retrotransposition [62]. In addition, Alu and L1 are selectively inhibited by different APOBEC3 proteins [62]. This corroborates our observations that cellular components differentiate between Alu and L1 retrotransposition.
Our data suggest that the observed time differences are dependent on the type of RNA polymerase generating the transcript. Multiple features that distinguish these two transcript types may collectively or individually contribute to the observed differences in the retrotransposition timing between L1 and Alu elements. RNA capping, association with the translational machinery and ORF1 requirement are plausible factors that could influence SINE and LINE retrotransposition kinetics. As a pol II product, L1 mRNA is likely capped. Experimental evidence indicates that at least part of the L1 mRNA is capped [43] and that capping enhances L1 translation in vitro [44]. In contrast, pol III genes lack the 7-methylguanosine cap and are subjected to different processing in a spatially separate location of the nucleus [57],[58]. L1 mRNA likely interacts with most, if not all, of the pol II protein complexes that assemble with the transcription of generic mRNAs, as evidenced by the premature polyadenylation and splicing of L1 transcripts [6],[42].
Even though both pol II and pol III produced RNAs form complexes with various cellular proteins, the structure and composition of these RNPs varies dramatically. As a rule, pol III transcripts do not code for proteins and therefore interact with the translational machinery in a different manner than mRNA. Most known pol III transcripts fold to form a structured RNA and associate with a variety of proteins to form RNPs. Specifically, Alu interaction with SRP9 and SRP14 [12] is thought to transiently provide proximity to the ribosomal complexes and translating L1 RNA, allowing the Alu transcript to efficiently compete for the L1 factors required for retrotransposition [26]. It is also likely that the ability of the dimeric Alu to bind these proteins contributes to the dramatic difference in retrotransposition rates observed between Alu and other SINEs [9],[13]. In contrast, the polyribosomes and translation machinery assemble with the L1 mRNA in a more stable complex to undergo translation. The cis-preference displayed by L1 [20] suggests that the L1 RNA must dissociate from the cellular translation machinery to form L1 RNPs as an intermediate step in the retrotranspositional process. These L1 complexes are composed of L1 RNA, ORF1p [20], and likely ORF2 protein [11]. All three components are shown to co-purify in the polyribosomal fraction of the cytoplasm [11],[23]. It is plausible that ORF1p directly competes with the cellular translation machinery for access to L1 mRNAs, transitioning the L1 transcript away from the polyribosomal fraction and into the retrotranspositionally competent RNPs. Because of their nature and subcellular localization, SINEs completely avoid these two potentially time consuming steps in their mobilization. Therefore, SINE transcripts may enter their retrotransposition cycle as soon as L1 ORF2p becomes available.
The pol II-driven Alu transcripts that are most likely to associate with the cellular translational machinery, at least transiently, require L1 ORF1 protein in addition to ORF2 protein for retrotransposition in a manner reminiscent of retropseudogenes [29]. The retrotransposition time of the pol II-driven Alu parallels that of L1. At this stage it is unclear what the role of ORF1p is in the trans-mobilization of retropseudogenes or the pol II Alu transcript. However, it is consistent with the above-discussed hypothesis implicating ORF1 protein in removing pol II RNAs from their expected cycle of translation and degradation. Thus, the pol II L1 and the pol III Alu transcript interactions with different cellular components may dictate the timing difference between L1 and Alu RNAs to form their respective retrotranspositionally competent complexes.
The inefficient retrotransposition rate of the pol II-driven Alu construct suggests that the presence of an Alu sequence within an mRNA would not facilitate its retrotransposition by L1 factors. Although there is no available data on the SVA promoter, it is unlikely that the pol III polymerase drives SVA transcription due to the presence of numerous pol III terminators within its sequence. Thus, it is questionable whether the truncated antisense Alu-like sequences present in the SVA element contribute to the L1 trans-complementation of this retroposon as previously suggested [35].
In addition to assisting its own retrotransposition, the cis-preference exhibited by L1 may decrease cell damage by limiting random retrotransposition of cellular mRNA. A previous study demonstrated the co-localization of ORF1p and cellular proteins to stress granules[63]. The authors suggest that the sequestering of ORF1 protein in stress granules for degradation may prevent promiscuous binding of ORF1p to non-L1 mRNAs. Thus, as a side effect of L1 self-preference, retropseudogene formation is less likely [5]. In addition, this “cis-preference” could help the L1 transcript “escape” the ribosomal complex and degradation pathways. Once translation is completed, most transcripts decay by several known mRNA degradation pathways, reviewed in [56]. In contrast, pol III transcripts are meant to perform their function as RNA molecules in the cytoplasm or nucleus before degradation by the exosome [64]. Essentially, the functional molecule of pol III genes is the RNA, while for pol II genes the mRNA is an intermediary prior to the generation of the functional protein. In the case of L1, the ORF1p may play an additional role by protecting the L1 RNA from degradation, increasing the chance of returning to the nucleus where the involvement of ORF1p in the L1 integration process has been previously suggested [18],[23]. Thus, the requirement for both ORF1 and ORF2 proteins could contribute to the longer time needed for L1 transcripts to generate inserts. In addition, it is plausible that interactions with different cellular components during insertion, mediated by ORF1p, may contribute to the timing differences observed.
We postulate that the differences observed in retrotransposition kinetics are dictated by the type of RNA polymerase generating the transcript. We propose an initial model where the cytoplasmic interactions of pol II (L1 and mRNA) and pol III transcripts and pathways influence the amplification kinetics of LINEs and SINEs (Figure 8). Overall, it is evident that the type of RNA polymerase generating the transcript alters the timing of mobile element insertion and remains a critical parameter in the classification of different types of retroelements.
Materials and Methods
Plasmids
L1 related vectors
JM101/L1.3 referred to “L1mneo” contains a full-length copy of the L1.3 element and the mneoI indicator cassette cloned in pCEP4 (Invitrogen) [61],[65].
JM101/L1.3 no tag, referred as “L1 no tag” contains a full-length copy of the L1.3 element cloned in pCEP4 [10].
TAM102/L1.3 contains the full-length copy of L1.3 element and the mblastI indicator cassette cloned in pCEP4 [32].
ORF1mneo contains the L1.3 5′UTR, L1.3 ORF1, and the mneoI indicator cassette cloned in pCEP4 [10].
psmL1 contains the codon optimized L1spa sequence and the neomycin indicator cassette cloned in pCEP4 [52].
L1neo TET contains the codon optimized L1RP driven by CMV promoter and tagged with the self splicing intron neo cassette [4] in pBluescript.
JM101/L1.3, JM101/L1.3 no tag, TAM102/L1.3 and ORF1mneo were kind gifts from Dr. John Moran. psmL1 was a kind gift from Dr. Jef Boeke [52]. pBudORF2opt [33]and pBudORF1opt [66] have been previously described. The open reading frames are cloned into the expression vector pBudCE4.1 (Invitrogen), under control of the CMV promoter.
SINE related vectors
Alu-neo TET containing a 7SL upstream enhancer region - Alu core sequence followed by the neo TET self-splicing indicator cassette and 44 A-stretch followed by a pol III terminator [4] was a kind gift from Dr. Thierry Heidmann.
The “SINE”neo TET constructs listed below were created by initially modifying the Aluneo TET vector. The QuickChange site-directed mutagenesis kit (Stratagene) was used to introduce an AatII site (underlined) at the 3′ end of the Alu element with a set of complementary 5′ phosphorylated primers to the following sequence: 5′- AGCCTGGGCGACAGAGCGAGTCGACGTCTCAAATCCCCTCAG -3′ following the manufacturer's recommended protocol. The new construct, referred to as AluYa5neo TET AatII, was then used to introduce the different individual elements and their corresponding upstream enhancer sequences using the BamHI (5′ of 7SL promoter enhancer sequence) and the AatII sites (schematic of the basic vector shown in Figure 2A). The BamHI and AatII sites are underlined.
AluYa5neo TET, contains a larger amount of the upstream pol III enhancer sequence of the 7SL gene (113 bp) and the AluYa5 consensus sequence from p7SLYa5BC1 [67].
7SLneo TET, contains a 413 bp fragment of the Human 7SL RNA gene Accession # M20910 and its upstream pol III enhancer sequence, PCR amplified with primers F7SL: 5′-GGATCCGCCCAGTGTGGGTGTGTCC-3′ and R7SL: 5′ GACGTCAAGAGACGGGGTCTCGCTATG-3′
U6neo TET, contains a 512 bp fragment of the human small nuclear RNA gene HUMUG6 Accession# M14486 and its upstream pol III enhancer sequence PCR amplified with primers FU6: 5′-GGATCCGCAGACACTGCTCGGTAGTT-3′ and RU6: 5′ GACGTCAAGAGACGGGGTCTCGCTATG-3′
hY1neo TET, contains a 478 bp fragment of the human RNA gene hY1 encoding Ro RNA Accession# V00584 and its upstream pol III enhancer sequence PCR amplified with primers FhY1: 5′-GGATCCGTCCCACAGAGCTGTCCGGAGG-3′ and RhY1: 5′-GACGTCAAGACTAGTCAAGTGCAGTAGTGAG-3′
hY3neo TET, contains a 430 bp fragment of the human RNA gene hY3 encoding Ro RNA Accession# V00585 and its upstream pol III enhancer sequence PCR amplified with primers FhY3: 5′-GGATCCCGCTCTAGACGTCCTGGCC-3′ and RhY3: 5′-GACGTCAAGGCTAGTCAAGTGAAGCAG-3′
hY4neo TET, contains a 494 bp fragment of the human RNA gene hY4 encoding Ro RNA Accession# L32608 and its upstream pol III enhancer sequence PCR amplified with primers FhY4: 5′-GGATCCACAGGCAGGGAFACGACAAA-3′ and RhY4: 5′-GACGTCAAGCCAGTCAAATTTAGCAGTGG-3′
hY5neo TET, contains a 667 bp fragment of the human RNA gene hY5 encoding Ro RNA Accession# K01564 and its upstream pol III enhancer sequence PCR amplified with primers FhY5: 5′-GGATCCTGATGATGAAACAAAGCC-3′ and RhY5: 5′-GACGTCAACAGCAAGCTAGTCAAGC-3′.
B2neo TET, contains 113 bp of the upstream pol III enhancer sequence of the 7SL gene and the B2 sequence from p7SLB2BC1 [67].
CMVYa5mneo (“pol II Alu”) the mneoI cassette including the SV40 polyadenylation signal from JM101/L1.3 was amplified by PCR with the high fidelity phusion DNA polymerase (New England Biolabs) using primers sets to introduce the 5′ FseI and a 3′ BglII used for introduction into the compatible FseI-BamHI sites in pGL3CMVYa5BC1-SV40pA [68]. The construct was then modified to eliminate the internal polyA-stretch immediately downstream of the Alu sequence and 5′ to the selection cassette (Figure 6). The pol II Alu transcripts generated from this construct are polyadenylated from the SV40pA signal.
pIRES2-EGFP (BD Biosciences Clontech) was used as the G418R expression plasmid for toxicity control.
All plasmid DNA was purified by alkaline lysis and twice purified by cesium chloride buoyant density centrifugation. DNA quality was also evaluated by the visual assessment of ethidium bromide stained agarose gel electrophoresed aliquots. All new constructs were sequence verified
Retrotransposition Assays
The basic transient L1 [16] or Alu [4] retrotransposition assay was performed as previously described with some minor modifications. Briefly, HeLa cells (ATCC CCL2) were seeded in T-75 flasks at a density of 5×105 cells/flask or in 6 well plates at a density of 2.5–5×104/well. Transient transfections were performed the next day with Lipofectamine Plus following the manufacturer's protocol (Invitrogen), with 3 µg of SINE-neo TET vector plus 1 µg pBud-ORF2opt or 1 µg of L1 no tag. For L1 assays 1 µg of JM101/L1.3 was used. Inhibitory effects on cellular growth or colony formation capabilities by the d4t treatment was evaluated by transfecting cells in parallel with 0.3 µg of a plasmid expressing neomycin resistance (pIRES2-EGFP; BD Biosciences Clontech) as a “toxicity” control. Following removal of transfection cocktail, the cells were treated with the appropriate media containing 400 µg/ml Geneticin/G418 (Fisher Scientific) alone or in combination with 50 µM d4t for selection and/or reverse transcriptase inhibition. After 14 days, cells were fixed and stained for 30 minutes with crystal violet (0.2% crystal violet in 5% acetic acid and 2.5% isopropanol). The inhibitor d4t- (2′,3′-Didehydro-3′-deoxy-thymidine; Sigma-Aldrich) was freshly added to the selection media at the indicated time period. During the inhibitor treatment period all cells in the experiment were refreshed daily for the first week with the appropriate media. The rate of insertion efficiency (retrotransposition rate) was determined as the number of visible G418R-resistant colonies obtained at 72 h after transient transfection of 1×106 seeded HeLa cells with 1 µg of the neo tagged construct.
Northern Blot Analysis
RNA extraction and poly(A) selection was performed as previously described [42]. Total RNA was extracted using the recommended protocol for TRIzol Reagent (Invitrogen) from two 75 cm2 cell culture flasks at 3, 6, 24, 48, and 72 hours post-transfection. The PolyATract mRNA isolation system III (Promega) was used to select polyadenylated RNA species following the manufacturer's protocol. After separation in a 1% (L1) or a 2% (pol III constructs) agarose-formaldehyde gel, the RNA was transferred to a Hybond-N nylon membrane (Amersham Biosciences). The RNA was cross-linked to the membrane using a UV-light (GS Gene linker, BioRad) and pre-hybridized in 30% formamide, 1× Denhardt's solution, 1% SDS, 1 M NaCl, 100 µg/ml salmon sperm DNA, 100 µg/ml-1 yeast t-RNA at 60°C for at least 6 h. The 3′ region of the neomycin gene was amplified by PCR using the following primers T7neo (−): 5′-TAATACGACTCACTATAAGGACGAGGCAGCG-3′ and Neo northern (+): 5″- GAAGAACTCGTCAAGAAGG-3′. The isolated PCR product was used as a DNA template to generate a 32P-CTP (Amersham Biosciences) labeled single strand-specific RNA probe using the MAXIscript T7 kit (Ambion) following the manufacturer's recommended protocol. We utilized material included in the kit to generate the riboprobe for the β-actin. The radiolabeled probe was purified by filtration through a NucAway Spin column (Ambion). Hybridization with the probe (final concentration of 4–12×106 cpm/ml) was carried out overnight in the pre-hybridization solution at 60°C. Two ten-minute washes were performed at high stringency (0.1×SSC, 0.1%SDS) at 60°C. The results of the northern blot assays were evaluated using a Typhoon Phosphorimager (Amersham Biosciences) and the ImageQuant software.
Supporting Information
Acknowledgments
We are indebted and very grateful to Louisiana State University in Baton Rouge and specifically Dr. Mark Batzer for hosting us in his laboratory during our evacuation for hurricane Katrina. We thank the members of Dr. Batzer's laboratory for their support. We also thank Prescott Deininger for his continuous support and insightful comments.
Footnotes
The authors have declared that no competing interests exist.
This research was supported by National Institutes NIH/NCRR P20 RR020152 (http://www.ncrr.nih.gov) and NIH R01GM079709A (http://www.nigms.nih.gov) awarded to AMRE. Competitive Advantage Funds from the Louisiana Cancer Research Consortium (LCRC) were also awarded to AMRE. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of NCRR or NIH. BJW is supported by a Louisiana Cancer Research Consortium (LCRC) Fellowship and a Matching Funds Fellowship Award, 2006–2008 provided from the developmental funds of the Tulane Cancer Center. VPB is supported by NIH/NIA 5K01AG030074-02 and The Ellison Medical Foundation New Scholar in Aging award. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. doi: 10.1038/35057062. [DOI] [PubMed] [Google Scholar]
- 2.Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, et al. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–562. doi: 10.1038/nature01262. [DOI] [PubMed] [Google Scholar]
- 3.Gibbs RA, Weinstock GM, Metzker ML, Muzny DM, Sodergren EJ, et al. Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature. 2004;428:493–521. doi: 10.1038/nature02426. [DOI] [PubMed] [Google Scholar]
- 4.Dewannieux M, Esnault C, Heidmann T. LINE-mediated retrotransposition of marked Alu sequences. Nat Genet. 2003;35:41–48. doi: 10.1038/ng1223. [DOI] [PubMed] [Google Scholar]
- 5.Goncalves I, Duret L, Mouchiroud D. Nature and structure of human genes that generate retropseudogenes. Genome Res. 2000;10:672–678. doi: 10.1101/gr.10.5.672. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Belancio VP, Hedges DJ, Deininger P. LINE-1 RNA splicing and influences on mammalian gene expression. Nucleic Acids Research. 2006;34:1512–1521. doi: 10.1093/nar/gkl027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Song M, Boissinot S. Selection against LINE-1 retrotransposons results principally from their ability to mediate ectopic recombination. Gene. 2006;390:206–213. doi: 10.1016/j.gene.2006.09.033. [DOI] [PubMed] [Google Scholar]
- 8.Chen JM, Stenson PD, Cooper DN, Ferec C. A systematic analysis of LINE-1 endonuclease-dependent retrotranspositional events causing human genetic disease. Hum Genet. 2005;117:411–427. doi: 10.1007/s00439-005-1321-0. [DOI] [PubMed] [Google Scholar]
- 9.Dewannieux M, Heidmann T. L1-mediated retrotransposition of murine B1 and B2 SINEs recapitulated in cultured cells. J Mol Biol. 2005;349:241–247. doi: 10.1016/j.jmb.2005.03.068. [DOI] [PubMed] [Google Scholar]
- 10.Wei W, Gilbert N, Ooi SL, Lawler JF, Ostertag EM, et al. Human L1 retrotransposition: cis preference versus trans complementation. Mol Cell Biol. 2001;21:1429–1439. doi: 10.1128/MCB.21.4.1429-1439.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kulpa DA, Moran JV. Cis-preferential LINE-1 reverse transcriptase activity in ribonucleoprotein particles. Nat Struct Mol Biol. 2006;13:655–660. doi: 10.1038/nsmb1107. [DOI] [PubMed] [Google Scholar]
- 12.Hsu K, Chang DY, Maraia RJ. Human signal recognition particle (SRP) Alu-associated protein also binds Alu interspersed repeat sequence RNAs. Characterization of human SRP9. J Biol Chem. 1995;270:10179–10186. doi: 10.1074/jbc.270.17.10179. [DOI] [PubMed] [Google Scholar]
- 13.Bennett EA, Keller H, Mills RE, Schmidt S, Moran JV, et al. Active Alu retrotransposons in the human genome. Genome Res. 2008;18:1875–1883. doi: 10.1101/gr.081737.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Boeke J, Garfinkel DJ, Styles CA, Fink CR. Ty elements transpose through an RNA intermediate. Cell. 1985;40:491–500. doi: 10.1016/0092-8674(85)90197-7. [DOI] [PubMed] [Google Scholar]
- 15.Holmes SE, Singer MF, Swergold GD. Studies on p40, the leucine zipper motif-containing protein encoded by the first open reading frame of an active human LINE-1 transposable element. J Biol Chem. 1992;267:19765–19768. [PubMed] [Google Scholar]
- 16.Moran JV, Holmes SE, Naas TP, DeBerardinis RJ, Boeke JD, et al. High frequency retrotransposition in cultured mammalian cells. Cell. 1996;87:917–927. doi: 10.1016/s0092-8674(00)81998-4. [DOI] [PubMed] [Google Scholar]
- 17.Wallace N, Wagstaff BJ, Deininger PL, Roy-Engel AM. LINE-1 ORF1 protein enhances Alu SINE retrotransposition. Gene. 2008;419:1–6. doi: 10.1016/j.gene.2008.04.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Martin SL, Bushman FD. Nucleic acid chaperone activity of the ORF1 protein from the mouse LINE-1 retrotransposon. Mol Cell Biol. 2001;21:467–475. doi: 10.1128/MCB.21.2.467-475.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Martin SL, Cruceanu M, Branciforte D, Wai-Lun LP, Kwok SC, et al. LINE-1 retrotransposition requires the nucleic acid chaperone activity of the ORF1 protein. J Mol Biol. 2005;348:549–561. doi: 10.1016/j.jmb.2005.03.003. [DOI] [PubMed] [Google Scholar]
- 20.Basame S, Wai-Lun LP, Howard G, Branciforte D, Keller D, et al. Spatial Assembly and RNA Binding Stoichiometry of a LINE-1 Protein Essential for Retrotransposition. J Mol Biol. 2006;357:351–357. doi: 10.1016/j.jmb.2005.12.063. [DOI] [PubMed] [Google Scholar]
- 21.Mathias SL, Scott AF, Kazazian HH, Jr, Boeke JD, Gabriel A. Reverse transcriptase encoded by a human transposable element. Science. 1991;254:1808–1810. doi: 10.1126/science.1722352. [DOI] [PubMed] [Google Scholar]
- 22.Feng Q, Moran JV, Kazazian HH, Jr, Boeke JD. Human L1 retrotransposon encodes a conserved endonuclease required for retrotransposition. Cell. 1996;87:905–916. doi: 10.1016/s0092-8674(00)81997-2. [DOI] [PubMed] [Google Scholar]
- 23.Kulpa DA, Moran JV. Ribonucleoprotein particle formation is necessary but not sufficient for LINE-1 retrotransposition. Hum Mol Genet. 2005;14:3237–3248. doi: 10.1093/hmg/ddi354. [DOI] [PubMed] [Google Scholar]
- 24.Kremerskothen J, Zopf D, Walter P, Cheng JG, Nettermann M, et al. Heterodimer SRP9/14 is an integral part of the neural BC200 RNP in primate brain. Neurosci Lett. 1998;245:123–126. doi: 10.1016/s0304-3940(98)00215-8. [DOI] [PubMed] [Google Scholar]
- 25.West N, Roy-Engel A, Imataka H, Sonenberg N, Deininger P. Shared Protein Components of SINE RNPs. J Mol Biol. 2002;321:423–432. doi: 10.1016/s0022-2836(02)00542-9. [DOI] [PubMed] [Google Scholar]
- 26.Boeke JD. LINEs and Alus–the polyA connection. Nat Genet. 1997;16:6–7. doi: 10.1038/ng0597-6. [DOI] [PubMed] [Google Scholar]
- 27.Luan DD, Korman MH, Jakubczak JL, Eickbush TH. Reverse transcription of R2Bm RNA is primed by a nick at the chromosomal target site: a mechanism for non-LTR retrotransposition. Cell. 1993;72:595–605. doi: 10.1016/0092-8674(93)90078-5. [DOI] [PubMed] [Google Scholar]
- 28.Goodier JL, Ostertag EM, Engleka KA, Seleme MC, Kazazian HH., Jr A potential role for the nucleolus in L1 retrotransposition. Hum Mol Genet. 2004;13:1041–1048. doi: 10.1093/hmg/ddh118. [DOI] [PubMed] [Google Scholar]
- 29.Esnault C, Maestre J, Heidmann T. Human LINE retrotransposons generate processed pseudogenes. Nat Genet. 2000;24:363–367. doi: 10.1038/74184. [DOI] [PubMed] [Google Scholar]
- 30.Roy-Engel AM, Salem AH, Oyeniran OO, Deininger L, Hedges DJ, et al. Active alu element “A-Tails”: size does matter. Genome Res. 2002;12:1333–1344. doi: 10.1101/gr.384802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Dewannieux M, Heidmann T. Role of poly(A) tail length in Alu retrotransposition. Genomics. 2005;86:378–381. doi: 10.1016/j.ygeno.2005.05.009. [DOI] [PubMed] [Google Scholar]
- 32.Morrish TA, Gilbert N, Myers JS, Vincent BJ, Stamato TD, et al. DNA repair mediated by endonuclease-independent LINE-1 retrotransposition. Nat Genet. 2002;31:159–165. doi: 10.1038/ng898. [DOI] [PubMed] [Google Scholar]
- 33.Gasior SL, Wakeman TP, Xu B, Deininger PL. The human LINE-1 retrotransposon creates DNA double-strand breaks. J Mol Biol. 2006;357:1383–1393. doi: 10.1016/j.jmb.2006.01.089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Jurka J. Sequence patterns indicate an enzymatic involvement in integration of mammalian retroposons. Proc Natl Acad Sci U S A. 1997;94:1872–1877. doi: 10.1073/pnas.94.5.1872. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Ostertag EM, Goodier JL, Zhang Y, Kazazian HH., Jr SVA elements are nonautonomous retrotransposons that cause disease in humans. Am J Hum Genet. 2003;73:1444–1451. doi: 10.1086/380207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Perreault J, Noel JF, Briere F, Cousineau B, Lucier JF, et al. Retropseudogenes derived from the human Ro/SS-A autoantigen-associated hY RNAs. Nucleic Acids Res. 2005;33:2032–2041. doi: 10.1093/nar/gki504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kramerov DA, Vassetzky NS. Short retroposons in eukaryotic genomes. Int Rev Cytol. 2005;247:165–221. doi: 10.1016/S0074-7696(05)47004-7. [DOI] [PubMed] [Google Scholar]
- 38.Walter P, Blobel G. Signal recognition particle contains a 7S RNA essential for protein translocation across the endoplasmic reticulum. Nature. 1982;299:691–698. doi: 10.1038/299691a0. [DOI] [PubMed] [Google Scholar]
- 39.Perreault J, Perreault JP, Boire G. The Ro Associated Y RNAs in Metazoans: Evolution and Diversification. Mol Biol Evol. 2007;24:1678–1689. doi: 10.1093/molbev/msm084. [DOI] [PubMed] [Google Scholar]
- 40.Thayer RE, Singer MF, Fanning TG. Undermethylation of specific LINE-1 sequences in human cells producing a LINE-1-encoded protein. Gene. 1993;133:273–277. doi: 10.1016/0378-1119(93)90651-i. [DOI] [PubMed] [Google Scholar]
- 41.Han JS, Szak ST, Boeke JD. Transcriptional disruption by the L1 retrotransposon and implications for mammalian transcriptomes. Nature. 2004;429:268–274. doi: 10.1038/nature02536. [DOI] [PubMed] [Google Scholar]
- 42.Perepelitsa-Belancio V, Deininger PL. RNA truncation by premature polyadenylation attenuates human mobile element activity. Nat Genet. 2003;35:363–366. doi: 10.1038/ng1269. [DOI] [PubMed] [Google Scholar]
- 43.Athanikar JN, Badge RM, Moran JV. A YY1-binding site is required for accurate human LINE-1 transcription initiation. Nucleic Acids Res. 2004;32:3846–3855. doi: 10.1093/nar/gkh698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Dmitriev SE, Andreev DE, Terenin IM, Olovnikov IA, Prassolov VS, et al. Efficient Translation Initiation Directed by the 900 Nucleotides-Long and GC-Rich 5′ UTR of the Human Retrotransposon LINE-1 mRNA is Strictly Cap-Dependent Rather Than IRES-Mediated. Mol Cell Biol. 2007;27:4685–4697. doi: 10.1128/MCB.02138-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Ostertag EM, Prak ET, DeBerardinis RJ, Moran JV, Kazazian HH., Jr Determination of L1 retrotransposition kinetics in cultured cells. Nucleic Acids Res. 2000;28:1418–1423. doi: 10.1093/nar/28.6.1418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Marchler-Bauer A, Anderson JB, Cherukuri PF, DeWeese-Scott C, Geer LY, et al. CDD: a Conserved Domain Database for protein classification. Nucleic Acids Res. 2005;33:D192–D196. doi: 10.1093/nar/gki069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Sciamanna I, Landriscina M, Pittoggi C, Quirino M, Mearelli C, et al. Inhibition of endogenous reverse transcriptase antagonizes human tumor growth. Oncogene. 2005;24:3923–3931. doi: 10.1038/sj.onc.1208562. [DOI] [PubMed] [Google Scholar]
- 48.Kubo S, Seleme MC, Soifer HS, Perez JL, Moran JV, et al. L1 retrotransposition in nondividing and primary human somatic cells. Proc Natl Acad Sci U S A. 2006;103:8036–8041. doi: 10.1073/pnas.0601954103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Jones RB, Garrison KE, Wong JC, Duan EH, Nixon DF, et al. Nucleoside analogue reverse transcriptase inhibitors differentially inhibit human LINE-1 retrotransposition. PLoS ONE. 2008;3:e1547. doi: 10.1371/journal.pone.0001547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Martin SL, Bushman D, Wang F, Li PW, Walker A, et al. A single amino acid substitution in ORF1 dramatically decreases L1 retrotransposition and provides insight into nucleic acid chaperone activity. Nucleic Acids Res. 2008;36:5845–5854. doi: 10.1093/nar/gkn554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Yang N, Zhang L, Zhang Y, Kazazian HH., Jr An important role for RUNX3 in human L1 transcription and retrotransposition. Nucleic Acids Res. 2003;31:4929–4940. doi: 10.1093/nar/gkg663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Han JS, Boeke JD. A highly active synthetic mammalian retrotransposon. Nature. 2004;429:314–318. doi: 10.1038/nature02535. [DOI] [PubMed] [Google Scholar]
- 53.Liu WM, Maraia RJ, Rubin CM, Schmid CW. Alu transcripts: cytoplasmic localisation and regulation by DNA methylation. Nucleic Acids Res. 1994;22:1087–1095. doi: 10.1093/nar/22.6.1087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Shilatifard A, Conaway RC, Conaway JW. The RNA polymerase II elongation complex. Annu Rev Biochem. 2003;72:693–715. doi: 10.1146/annurev.biochem.72.121801.161551. [DOI] [PubMed] [Google Scholar]
- 55.Geiduschek EP, Kassavetis GA. The RNA polymerase III transcription apparatus. J Mol Biol. 2001;310:1–26. doi: 10.1006/jmbi.2001.4732. [DOI] [PubMed] [Google Scholar]
- 56.Garneau NL, Wilusz J, Wilusz CJ. The highways and byways of mRNA decay. Nat Rev Mol Cell Biol. 2007;8:113–126. doi: 10.1038/nrm2104. [DOI] [PubMed] [Google Scholar]
- 57.Pombo A, Jackson DA, Hollinshead M, Wang Z, Roeder RG, et al. Regional specialization in human nuclei: visualization of discrete sites of transcription by RNA polymerase III. EMBO J. 1999;18:2241–2253. doi: 10.1093/emboj/18.8.2241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Pombo A, Jones E, Iborra FJ, Kimura H, Sugaya K, et al. Specialized transcription factories within mammalian nuclei. Crit Rev Eukaryot Gene Expr. 2000;10:21–29. [PubMed] [Google Scholar]
- 59.Ullu E, Weiner AM. Human genes and pseudogenes for the 7SL RNA component of signal recognition particle. EMBO J. 1984;3:3303–3310. doi: 10.1002/j.1460-2075.1984.tb02294.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Kass DH, Kim J, Rao A, Deininger PL. Evolution of B2 repeats: the muroid explosion. Genetica. 1997;99:1–13. doi: 10.1007/BF02259494. [DOI] [PubMed] [Google Scholar]
- 61.Sassaman DM, Dombroski BA, Moran JV, Kimberland ML, Naas TP, et al. Many human L1 elements are capable of retrotransposition. Nat Genet. 1997;16:37–43. doi: 10.1038/ng0597-37. [DOI] [PubMed] [Google Scholar]
- 62.Hulme AE, Bogerd HP, Cullen BR, Moran JV. Selective inhibition of Alu retrotransposition by APOBEC3G. Gene. 2007;390:199–205. doi: 10.1016/j.gene.2006.08.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Goodier JL, Zhang L, Vetter MR, Kazazian HH., Jr LINE-1 ORF1 Protein Localizes in Stress Granules with Other RNA-Binding Proteins, Including Components of RNA Interference RNA-Induced Silencing Complex. Mol Cell Biol. 2007;27:6469–6483. doi: 10.1128/MCB.00332-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Houseley J, LaCava J, Tollervey D. RNA-quality control by the exosome. Nat Rev Mol Cell Biol. 2006;7:529–539. doi: 10.1038/nrm1964. [DOI] [PubMed] [Google Scholar]
- 65.Dombroski BA, Scott AF, Kazazian HH., Jr Two additional potential retrotransposons isolated from a human L1 subfamily that contains an active retrotransposable element. Proc Natl Acad Sci U S A. 1993;90:6513–6517. doi: 10.1073/pnas.90.14.6513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Wallace NA, Belancio VP, Deininger PL. L1 mobile element expression causes multiple types of toxicity. Gene. 2008;419:75–81. doi: 10.1016/j.gene.2008.04.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Roy AM, West NC, Rao A, Adhikari P, Alemán C, et al. Upstream flanking sequences and transcription of SINEs. J Mol Biol. 2000;302:17–25. doi: 10.1006/jmbi.2000.4027. [DOI] [PubMed] [Google Scholar]
- 68.Roy-Engel AM, El Sawy M, Farooq L, Odom GL, Perepelitsa-Belancio V, et al. Human retroelements may introduce intragenic polyadenylation signals. Cytogenet Genome Res. 2005;110:365–371. doi: 10.1159/000084968. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.