Skip to main content
eLife logoLink to eLife
. 2024 Apr 3;12:RP87811. doi: 10.7554/eLife.87811

A concerted increase in readthrough and intron retention drives transposon expression during aging and senescence

Kamil Pabis 1,2,3,, Diogo Barardo 2,3, Olga Sirbu 4, Kumar Selvarajoo 4,5,6,7, Jan Gruber 1,8, Brian K Kennedy 1,2,3
Editors: Pablo A Manavella9, Kathryn Song Eng Cheah10
PMCID: PMC10990488  PMID: 38567944

Abstract

Aging and senescence are characterized by pervasive transcriptional dysfunction, including increased expression of transposons and introns. Our aim was to elucidate mechanisms behind this increased expression. Most transposons are found within genes and introns, with a large minority being close to genes. This raises the possibility that transcriptional readthrough and intron retention are responsible for age-related changes in transposon expression rather than expression of autonomous transposons. To test this, we compiled public RNA-seq datasets from aged human fibroblasts, replicative and drug-induced senescence in human cells, and RNA-seq from aging mice and senescent mouse cells. Indeed, our reanalysis revealed a correlation between transposons expression, intron retention, and transcriptional readthrough across samples and within samples. Both intron retention and readthrough increased with aging or cellular senescence and these transcriptional defects were more pronounced in human samples as compared to those of mice. In support of a causal connection between readthrough and transposon expression, analysis of models showing induced transcriptional readthrough confirmed that they also show elevated transposon expression. Taken together, our data suggest that elevated transposon reads during aging seen in various RNA-seq dataset are concomitant with multiple transcriptional defects. Intron retention and transcriptional readthrough are the most likely explanation for the expression of transposable elements that lack a functional promoter.

Research organism: Human, Mouse

Introduction

Repetitive DNA makes up at least 50% of the human genome (de Koning et al., 2011) and has been linked to genomic instability, cancer and – somewhat less consistently – species longevity (Pabis, 2021; Khristich and Mirkin, 2020). Transposons are one particularly abundant class of repetitive sequences found in the nuclear genome of eukaryotes and are strongly associated with cancer-causing structural variants like DNA deletions, insertions, or inversions (Rodriguez-Martin et al., 2020). Although transposons are often considered parasitic, harmful or, at best, neutral recent evidence suggests they may contribute to genomic diversification on evolutionary timescales and to the cellular stress-response by providing transcription factor-binding sites (Villanueva-Cañas et al., 2019).

The major transposon families in the human genome are long interspersed nuclear element (LINE), short interspersed nuclear element (SINE), long terminal repeat (LTR) transposons, and DNA transposons comprising 21%, 11%, 8%, and 3% of the human genome, respectively (Kazazian and Moran, 2017).

In recent years, transposon expression, particularly of elements belonging to the LINE-1 family, has been hypothesized to be a near universal marker of aging. Several lines of evidence support a link between transposon reactivation and aging (Gorbunova et al., 2021).

Using RNA-seq, for example, to quantify reads mapping to transposable elements it was shown that expression of these was higher in 10- vs 1-day-old nematodes (LaRocca et al., 2020), 40- vs 10-day-old flies (Wood et al., 2016) and in muscle and liver of aged mice (De Cecco et al., 2019). Transposon expression also increases during in vitro senescence (Colombo et al., 2018) and in fibroblasts isolated from people between the ages of 1 and 96 years (LaRocca et al., 2020), whereas lifespan extending mutations and interventions in mice reduce transposon expression (Wahl et al., 2021).

However, not all studies report such consistent age-related increases across all transposon classes (Ghanam et al., 2019). Moreover, it would be a mistake to conflate changes in RNA-seq-, RNA-, DNA-, and protein-based measurements of transposon expression as evidence for one and the same phenomenon.

Quantification of transposon expression using RNA-seq or PCR techniques remains challenging given their repetitive nature and the fact that 98–99% of all transposons are co-expressed with neighboring transcriptional units (Deininger et al., 2017; Stow et al., 2021). In fact, only a small fraction of LINE-1 and SINE elements, the latter relying on co-transposition by LINEs, are expressed from a functional promoter. It is believed that such ‘hot’ LINE-1 loci drive most transposition events and genomic instability (Deininger et al., 2017; Rodriguez-Martin et al., 2020).

The large number of co-expressed transposons would likely mask any signal from autonomous elements in standard RNA-seq experiments. Therefore, alternative explanations are needed for increased transposon expression in such datasets and these could include age-related changes in transposon adjacent genes for example intron retention and transcriptional readthrough.

Most eukaryotic genes contain introns which need to be removed during a complex co-transcriptional process called splicing. Defects in splicing are the cause of many, often neurologic, hereditary conditions (Scotti and Swanson, 2016). One such type of splicing defect is intron retention. Although basal levels of intron retention may be benign or even physiologic, high levels are considered harmful (Zheng et al., 2020). Aging is accompanied by many changes to splicing patterns (Wang et al., 2018), including intron retention, which is increased during cellular senescence (Yao et al., 2020) and in aging Drosophila, mouse hippocampus and human prefrontal cortex. In addition, levels in human AD brain tissues are elevated even further than in the aged brain (Adusumalli et al., 2019).

Transcription is terminated by the cleavage and polyadenylation machinery when the polymerase transcribes through the polyadenylation signal leading to pre-mRNA cleavage, polymerase pausing, conformational change, and eventually polymerase detachment (Rosa-Mercado et al., 2021). However, this termination may fail leading to so-called readthrough transcription which increases during stress conditions (Rosa-Mercado and Steitz, 2022) and during cellular senescence, at least for a subset of genes (Muniz et al., 2017).

Emerging evidence suggests that intron retention, readthrough, and transposon expression are linked (Rosa-Mercado et al., 2021; Hadar et al., 2022), but this has not been studied in the context of aging. In this manuscript, we show that intron retention and readthrough taken together can explain most of the apparent increase in transposon expression seen in RNA-seq datasets of aging.

Results

Genomic location of transposons implicates them as a marker of multiple transcriptional defects

During aging and senescence the expression of all four major transposon families (LTR, DNA, LINE, and SINE) increases. However, such a global increase in transposon expression cannot be explained by transposon biology alone since most transposons are ancient, and, having accumulated many mutations, lack a functional promoter (Deniz et al., 2019). Instead, we reasoned that transposon expression is associated with the expression of nearby genes.

Consistently, out of 4.5 million annotated transposons 60% were intragenic (most of them intronic), 10% within 5 kb of a gene and the remaining 30% were intergenic, albeit usually still within 100 kb (Table 1). For transposons whose expression significantly changed with age or senescence, the number of intronic and downstream-of-gene transposons is even higher.

Table 1. Fraction of genic or intronic transposons and of transposons downstream (ds) or upstream (us) of genes in two different datasets.

Dataset Genic Intronic ds us ds us n
All transposons 0.604 0.493 0.045 0.049 0.187 0.174 4,520,928
Fleischer et al.
(aging, significant)
0.857 0.564 0.082 0.025 0.105 0.037 7673
Colombo et al.
(senescence, significant)
0.698 0.540 0.107 0.053 0.177 0.118 8730
Within genes Wthin 5 kb Within 100 kb

Based on these findings, we studied intronic transposons as a potential marker for age-related intron retention and downstream-of-gene transposons as a marker of age-related readthrough transcription.

Elevated transposon expression and intron retention with age

To confirmed that age-related transposon expression is indeed elevated as shown by LaRocca et al., 2020, we analyzed the transcriptomic dataset originally generated by Fleischer et al., 2018, which included 143 fibroblasts isolated from donors between the ages of 0–96. This dataset will be hereafter referred to as the ‘aging dataset’. We find that transposon expression increases slowly and non-linearly during aging, with a sudden and sharp increase around 80 years of age (Figure 1A, Figure 1—figure supplement 1) and this increase was highly strand specific (Figure 1—figure supplement 2).

Figure 1. A concerted increase in transposon and intron expression with aging and senescence.

Transposon (A) and intron (B) expression is increased in skin fibroblasts isolated from aged individuals. What is more, expression of both transposons and introns shows a significant correlation (C). Similarly, transposon (D) and intron (E) expression is increased in MDAH041 cells induced into senescence (sen) by treatment with drugs or via passaging. Normalized reads from the top 1000 differentially expressed genes, transposons and introns were used in this analysis. (A) Transposon reads normalized by the expression of adjacent genes plotted across five age groups (adolescent n = 32, young n = 31, middle-aged n = 22, old n = 37, and very old n = 21). (B) Intron reads normalized by the expression of adjacent genes plotted across five age groups (adolescent n = 32, young n = 31, middle-aged n = 22, old n = 37, and very old n = 21). (C) Scatterplot between transposon and intron expression (normalized as in A and B) for all 143 samples. Each sample is colored by age. (D) Transposon reads normalized by the expression of adjacent genes plotted across four senescent conditions (H2O2, 5-azacytidine, adriamycin, and replicative senescence) and four other conditions (serum-starved, immortalized, intermediate passage, and early passage). N = 3 per group. (E) Intron reads normalized by the expression of adjacent genes plotted across four senescent conditions (H2O2, 5-azacytidine, adriamycin, and replicative senescence) and four other conditions (serum-starved, immortalized, intermediate passage, and early passage). N = 3 per group. (F) Scatterplot between transposon and intron expression (normalized as in E and F) for all 24 samples. Each sample is colored by senescence status.

Figure 1.

Figure 1—figure supplement 1. Aging promotes readthrough, transposon expression, and intron retention.

Figure 1—figure supplement 1.

Readthrough, transposon expression, and intron retention are elevated with aging when the expression of each element is normalized to the nearest gene (A) and similarly when all the reads are normalized to library size (B). Reads from the top 1000 differentially expressed elements were used in this analysis (n < 1000 for readthrough after filtering).
Figure 1—figure supplement 2. Transposons show strand-specific expression.

Figure 1—figure supplement 2.

Transposons with inverted strandedness (reverse) show lower expression levels (log counts; A) and no differential expression with age (B) when compared to matched differentially expressed transposons (actual). For this analysis, we selected all transposons showing significant differential expression with age in the actual dataset that also showed at least minimal expression in the strand-inverted analysis (n = 226). Data from Fleischer et al., 2018. (A) The log (counts) are clipped because we only used transposons that passed minimal read filtering in this analysis. (B) The distribution of expression values in the actual dataset is bimodal and positive since some transposons are significantly up- or downregulated. This bimodal distribution is lost in the strand-inverted analysis.
Figure 1—figure supplement 3. Stronger upregulation of transposons and introns than of genic transcripts.

Figure 1—figure supplement 3.

Intron, readthrough, and transposon elements are more strongly elevated with age (A) and cellular senescence (B) than are genic transcripts. The percentage of upregulated transcripts is indicated above each violin plot and the median log10-fold change for genic transcripts is indicated with a dashed red line. (A) Log10-fold changes plotted for all elements that significantly change with age (genes, introns, readthrough, transposon, and read-in transcripts). (B) Log10-fold changes plotted for all elements that significantly change with induced senescence (genes, introns, readthrough, transposon, and read-in transcripts). The four senescent conditions (H2O2, 5-Aza, adriamycin, and replicative senescence) are grouped together and compared with four control conditions (serum-starved, immortalized, intermediate passage, and early passage).
Figure 1—figure supplement 4. Increased transposon expression and intron retention in liver of aged mice.

Figure 1—figure supplement 4.

Transposon expression (A) and intron retention (B) are increased in the liver of 26-month-old mice. In contrast, transposon expression (C) and intron retention (D) are decreased with replicative senescence of mouse fibroblasts. Normalized counts from all (A, B) and top 1000 (C, D) differentially expressed genes, transposons and introns were used in this analysis. Mouse liver data are from Hahn et al., 2017 (GSE92486) and mouse replicative senescence data from Wang et al., 2022 (GSE179880). (A) Transposon reads normalized by the expression of adjacent genes plotted for each sample (ad libitum [AL] 5-month-old, AL 26-month-old, dietary restricted [DR] 5-month-old, and DR 26-month-old, n = 3 per group) (B) Intron reads normalized by the expression of adjacent genes plotted for each sample as in (A). (C) Transposon reads normalized by the expression of adjacent genes plotted for each sample (fibroblasts from passage [P] 1, 3, 5, and 7, n = 4 per group). (D) Intron reads normalized by the expression of adjacent genes plotted for each sample as in (C).
Figure 1—figure supplement 5. Aging promotes readthrough, transposon expression, and intron retention in mouse liver.

Figure 1—figure supplement 5.

Intron, readthrough, and transposon elements are elevated in the liver of aging mice (26- vs 5-month-old, n = 6 per group). Readthrough and transposon expression is especially elevated even when compered to genic transcripts. The percentage of upregulated transcripts is indicated above each violin plot and the median log10-fold change for genic transcripts is indicated with a dashed red line.
Figure 1—figure supplement 6. Correlated age-related increase of transposon and intron loci.

Figure 1—figure supplement 6.

Intron and transposon expression show a significant correlation. For every transposon within an intron, the correlation between the log-fold change of the intron and the transposon is plotted. Scatter plots are shown for all transposons and introns (A, B) and all transposons and introns that are significantly changed (C, D) in the aging dataset (A, C) and with cellular senescence (B, D).
Figure 1—figure supplement 7. Correlated expression of transposon and intron loci.

Figure 1—figure supplement 7.

Intronic counts are correlated with the counts of intronic transposons in the aging dataset (A) and in the senescence dataset (B). In contrast, intragenic reads aligning outside of introns do not correlate with the counts of intronic transposons (labeled ‘outside’). Intronic counts also show no correlation with randomized transposon counts (labeled ‘rand’). Pearson correlation was performed for each sample in these datasets and the R-values for all the samples are shown here. Intronic counts for a gene were defined as the sum of all its intronic counts.
Figure 1—figure supplement 8. Transposons are depleted at splice junctions.

Figure 1—figure supplement 8.

Transposons are evenly distributed within introns except for the region close to splice junctions (A–E). Transposons appear to be excluded from the splice junction-adjacent region both in all introns (A, D) and in significantly retained introns (B, E). In addition, transposon density of all introns and significantly retained introns is comparable (C, F). We included only introns containing at least one transposon in this analysis and normalized their length to 1. (A) Distribution of 2,292,769 transposons within 163,498 introns among all annotated transposons. (B) Distribution of 195,190 transposons within 14,100 introns significantly retained with age. (C) Density (transposon/1 kb of intron) of transposons in all introns (n = 163,498) compared to significantly retained introns (n = 14,100). (D) as in (A). (E) Distribution of 428,130 transposons within 13,205 introns significantly retained with induced senescence. (F) Density (transposon/1 kb of intron) of transposons in all introns (n = 163,498) compared to significantly retained introns (n = 13,205).
Figure 1—figure supplement 9. Introns with and without transposons are retained during aging and senescence.

Figure 1—figure supplement 9.

We split the set of introns that significantly change with cellular aging (A) or cell senescence (B) into introns that contain at least one transposon (has_t) and those that do not contain any transposons (has_no_t). Intron retention is increased in both groups. In this analysis, we included all introns that passed minimal read filtering (n = 63,782 in A and n = 124,173 in B). Median log-fold change indicated with a dashed red line for the group of introns without transposons.

Next, we found that intron retention is elevated with aging (Figure 1B), consistent with the study by Yao et al., 2020, and that samples with high transposon expression also showed elevated intron retention (Figure 1C).

In our analysis, we corrected for gene co-expression by normalizing the expression levels of introns and transposons to the expression levels of the closest gene. Another complementary approach would be to plot and compare the raw, unadjusted log-fold changes for genes, introns, transposons, and other elements. In this analysis, we found that, on average, age-related upregulation of transposons and introns is more pronounced than upregulation of genes (Figure 1—figure supplement 3A), consistent with Figure 1A, B.

Next, we reanalyzed a dataset originally published by Colombo et al., 2018 which will be referred to as the ‘senescence dataset’. Here, we tested whether MDAH041 cells induced into senescence with H2O2, 5-azacytidine, adriamycin, or serial passaging show elevated transposon expression and intron retention. Consistent with the original findings, all these treatments increased the expression of transposons (Figure 1D, Figure 1—figure supplement 3B). The retention of introns was also increased (Figure 1E) and expression levels of introns and transposons correlated with each other (Figure 1F).

Having established that transposons reads and retained introns are elevated with aging and senescence, we tested whether this is conserved between mouse and human. Transposon expression (p < 0.0001) was elevated, and intron retention showed a trend toward elevation (p = 0.14), in the liver of 26-month-old mice, suggesting these age-related transcriptional defects might be conserved between species (Figure 1—figure supplement 4A, B; Figure 1—figure supplement 5). In contrast, we found that increased expression of transposons and intron retention in models of cellular senescence is not conserved between species. To the contrary, early passage mouse fibroblasts showed elevated transposon expression and intron retention compared to senescent fibroblasts (Figure 1—figure supplement 4C, D), which contrasts with the mouse liver and human senescence data.

Transposon expression as a marker of intron retention

So far these age-related events have been considered independently, but given the localization of transposons being mostly intragenic we reasoned that intron retention could bias the transposon findings. Indeed, we found that samples with high intron retention also show high transposon expression. This is true both in the aging dataset (Figure 1C) and in the cellular senescence dataset (Figure 1F), where intronic reads accounted for most of the observed variability in transposon expression (or vice versa). Looking at individual genomic loci, we also found a strong correlation between the log-fold change of a transposon and the log-fold change of the intron it is located in (Figure 1—figure supplement 6).

As a more direct test of this hypothesis, we determined the correlation between the counts of retained introns and the counts of intronic transposons for each sample individually. Non-intronic transposons and randomized transposons served as a negative control. Consistent with the log-fold change data, we found a specific correlation between intronic transposon and intron counts in both datasets, with stronger correlations in the senescence dataset (Figure 1—figure supplement 7).

Although causality is hard to establish from our datasets, we also tested whether transposon expression could affect intron retention. If transposon expression was causally linked to intron retention, the most likely mechanism would be via an impairment of the splice junction. In this case, we would expect more transposons near the splice junctions of retained (differentially expressed) introns. However, we observed a depletion of transposons at the splice junctions of all introns (Figure 1—figure supplement 8A) and of differentially expressed introns (Figure 1—figure supplement 8B, E). Transposon densities were also comparable between these two classes of introns (Figure 1—figure supplement 8C, F). Moreover, introns with and without expressed transposons showed similar levels of age-related upregulation (Figure 1—figure supplement 9). Based on our data, we propose a model (Figure 2) where intron retention contributes to age-related upregulation of transposons, rather than transposons contributing to intron retention.

Figure 2. A schematic model of transposon expression.

Figure 2.

In our model, represented in this schematic, transcription (A) can give rise to mRNAs and pre-mRNAs that contain retained introns when co-transcriptional splicing is impaired. This is often seen during aging and senescence, and these can contain transposon sequences (B). In addition, transcription can give rise to mRNAs and pre-mRNAs that contain transposon sequences toward the 3′-end of the mRNA when co-transcriptional termination at the polyadenylation signal (PAS) is impaired (C, D) as seen with aging and senescence. Some of these RNAs may be successfully polyadenylated (as depicted here) whereas others will be subject to nonsense mediated decay. Image created with Biorender.

© 2024, BioRender Inc

Figure 2 was created using BioRender, and is published under a CC BY-NC-ND license. Further reproductions must adhere to the terms of this license.

Expression of extragenic transposons as a marker of readthrough transcription

Most transposons showing increased expression with aging or cellular senescence mapped within genes or introns and their expression changes may be attributed to intron retention. How can we explain expression of the remaining 14–30% of transposons (Table 1)?

To answer this question, we analyzed the distribution of transposons mapping to extragenic regions. Out of all the transposons, showing significantly changed expression during aging or cellular senescence, most were located within 5 kb of genes, primarily downstream of genes (Figure 3A). The distribution of transposons in the aging and senescence dataset was very similar except for a wider distribution of senescence-associated transposons within a 100-kb window around genes (Figure 3B). This contrasts with the almost completely even distribution of randomly permuted transposons.

Figure 3. Biased distribution of extragenic transposons relative to genes.

(A) Extragenic transposons with significantly (sig.) changed expression during aging or cellular senescence show a biased distribution, when mapped back onto the genome, with a preference toward a 5-kb region at the 3′-end of genes when compared to all annotated transposons (all) or randomly permuted transposons (random). Permutation was performed using the bedtools shuffle function. (B) Extragenic transposons whose expression changes during cellular senescence are spread out further from genes compared to aging-associated transposons.

Figure 3.

Figure 3—figure supplement 1. Increased readthrough levels with aging and senescence.

Figure 3—figure supplement 1.

Readthrough transcription is increased in fibroblasts isolated from the very old (A, B) and after induction of senescence in vitro (C, D). (A) Readthrough was determined in a region 0–10 kb downstream of genes for a subset of genes that were at least 10 kb away from the nearest neighboring gene (n = 684 regions). The log2 ratio of readthrough to gene expression is plotted across five age groups (adolescent n = 32, young n = 31, middle-aged n = 22, old n = 37, and very old n = 21). (B) As in (A), but data are plotted on a per sample basis. (C) Readthrough was determined in a region 0–10 kb downstream of genes for a subset of genes that were at least 10 kb away from the nearest neighboring gene (n = 1045 regions). The log2 ratio of readthrough to gene expression is plotted for the groups comprising senescence (n = 12) and the non-senescent group (n = 6). (D) As in (D), but data are plotted on a per sample basis and for additional control datasets (serum-starved, immortalized, intermediate passage, and early passage). N = 3 per group.

Given the proximity of significantly changed transposons and genes, this suggested that an age-related increase in transcriptional readthrough could explain elevated transposon expression. To test this, we next quantified readthrough in a 10- to 20-kb region downstream of genes. We chose a region not immediately adjacent to genes to avoid biases through basal levels of readthrough, although the data were comparable for closer distances (Figure 3—figure supplement 1).

Indeed, readthrough transcription was increased with aging (Figure 4A, B), and in the senescence dataset, levels of readthrough transcription increased with replicative or chemically induced senescence (Figure 4D, E). Transposon expression correlated with readthrough levels on a per sample basis in both the aging and the senescence dataset (Figure 4C, F).

Figure 4. Increased readthrough levels with aging and senescence.

Readthrough transcription is increased in fibroblasts isolated from the very old (A, B) and after induction of senescence in vitro (D, E). Readthrough transcription is correlated with transposon expression (C, F). (A) Readthrough was determined in a region 10- to 20-kb downstream of genes for a subset of genes that were at least 20 kb away from the nearest neighboring gene (n = 200 regions). The log2 ratio of readthrough to gene expression is plotted across five age groups (adolescent n = 32, young n = 31, middle-aged n = 22, old n = 37, and very old n = 21). (B) As in (A), but data are plotted on a per sample basis. (C) Scatterplot between transposon expression and readthrough levels (normalized as in A and Figure 1) for all 143 samples. Each sample is colored by age. (D) Readthrough was determined in a region 10- to 20-kb downstream of genes for a subset of genes that were at least 20 kb away from the nearest neighboring gene (n = 299 regions). The log2 ratio of readthrough to gene expression is plotted across four senescent conditions (H2O2, 5-Aza, adriamycin, and replicative senescence) and for early passage cells. N = 3 per group. (E) As in (D), but data are plotted on a per sample basis and for additional control datasets (serum-starved, immortalized, intermediate passage, and early passage). N = 3 per group. (F) Scatterplot between transposon expression and readthrough levels (normalized as in A and Figure 1) for all 24 samples. Each sample is colored by senescence status.

Figure 4.

Figure 4—figure supplement 1. Readthrough transcription might be increased with age in mouse liver, but not during in vitro senescence of mouse fibroblasts.

Figure 4—figure supplement 1.

Readthrough transcription is non-significantly increased with age in mouse liver (A, B) and this increase is not attenuated by dietary restriction (B). In contrast, there is no increase in readthrough when mouse fibroblasts undergo replicative senescence (C, D). (A) Readthrough was determined in a region 10- to 20-kb downstream of genes for a subset of genes that were at least 20 kb away from the nearest neighboring gene (n = 474 genes). The log2 ratio of readthrough to gene expression is plotted for liver samples from 5- and 26-month-old mice (n = 6 per group). (B) As in (A), but data are plotted on a per-sample-basis (n = 3 per group). (C) Readthrough was determined in a region 10- to 20-kb downstream of genes for a subset of genes that were at least 20 kb away from the nearest neighboring gene (n = 119 genes). The log2 ratio of readthrough to gene expression is plotted across five age groups (n = 4 per group). (D) As in (C), but data are plotted on a per-sample-basis.
Figure 4—figure supplement 2. Readthrough is increased downstream of genes over a wide range of distances.

Figure 4—figure supplement 2.

Transcriptional readthrough observed with aging or cellular senescence is elevated over a wide region downstream of genes. Readthrough levels were determined for each 10-kb interval downstream of genes, in the range between 0 and 100 kb using the same approach as in Figure 4. Values for each gene and treatment group were log-transformed and pooled. The difference between treatment (either senescence or aging) and controls is shown here. Higher numbers indicate stronger readthrough. The senescence group comprises data from H2O2 treatment, adriamycin, 5-azacytidine, and replicative senescence. The aging group comprises data from the very old subgroup (85–96 years-old) and was compared to the middle-aged group.
Figure 4—figure supplement 3. Read-in does not change with age and is increased by senescence.

Figure 4—figure supplement 3.

Read-in expression is unchanged with aging (A) whereas it is increased with cellular senescence (B). (A) Read-in normalized by the expression of adjacent genes plotted across five age groups (adolescent n = 32, young n = 31, middle-aged n = 22, old n = 37, and very old n = 21). (B) Read-in normalized by the expression of adjacent genes plotted across four senescent conditions (H2O2, 5-azacytidine, adriamycin, and replicative senescence) and four other conditions (serum-starved, immortalized, intermediate passage, and early passage). N = 3 per group.
Figure 4—figure supplement 4. Readthrough and transposon counts are higher closer to genes.

Figure 4—figure supplement 4.

Readthrough counts (rt_counts) decrease exponentially downstream of genes, both in the aging dataset (A) and in the cellular senescence dataset (B). Although noisier, the pattern for transposon counts (transp_cum_counts) is similar with higher counts closer to gene terminals, both in the aging dataset (C) and in the cellular senescence dataset (D). Readthrough counts are the cumulative counts across all genes and samples. Readthrough was determined in 10 kb bins and the values are assigned to the midpoint of the bin for easier plotting. Transposon counts are the cumulative counts across all samples for each transposon that did not overlap a neighboring gene. n = 801 in (C) and n = 3479 in (D).
Figure 4—figure supplement 5. Age-related transposon expression downstream of high readthrough genes is elevated.

Figure 4—figure supplement 5.

Transposons found downstream of genes with high readthrough (hi_RT) show a more pronounced log-fold change (transp_logfc) than transposons downstream of genes with low readthrough (low_RT). This is true in fibroblasts isolated from aged donors (A) and with cellular senescence (B). Furthermore, the difference between high and low readthrough region transposons is diminished for transposons that are more than 10 kb downstream of genes (Transp >10 kb). Transposons in high readthrough regions were defined as those in the top 20% of readthrough log-fold change. Readthrough was measured between 0 and 10 kb downstream from genes. n = 2124 transposons in (A) and n = 6061 transposons in (B) included in the analysis.
Figure 4—figure supplement 6. Intergenic transposons are rarely expressed.

Figure 4—figure supplement 6.

Total counts are the sum of all counts from transposons located in introns, genes, downstream (ds), or upstream (us) of genes (distance to gene <25 kb) or in intergenic regions (distance to gene >25 kb). Counts were defined as cumulative counts across all samples.
Figure 4—figure supplement 7. Age-related changes in readthrough are correlated with changes in transposon expression.

Figure 4—figure supplement 7.

Readthrough and transposon expression are correlated. For every transposon downstream of a gene, the correlation between the log-fold change (logFC) of the transposon and the adjacent readthrough region is plotted. Readthrough was measured in a 10- to 20-kb region downstream of the gene. (A) Log-fold changes for all significant transposons plotted against log-fold changes for all significant, adjacent readthrough regions in the aging dataset. (B) Log-fold changes for all significant transposons plotted against log-fold changes for all significant, adjacent readthrough regions in the senescence dataset. (C) Log-fold changes for all transposons plotted against log-fold changes for all adjacent readthrough regions in the aging dataset. (D) Log-fold changes for all transposons plotted against log-fold changes for all adjacent readthrough regions in the senescence dataset.
Figure 4—figure supplement 8. Positive correlation between readthrough counts and transposons downstream of genes.

Figure 4—figure supplement 8.

Readthrough counts are correlated with the counts of transposons downstream of genes in the aging dataset (A) and in the senescence dataset (B). In contrast, readthrough counts do not correlate with counts of transposons upstream of genes. Readthrough counts also show no correlation with randomized transposon counts (labeled ‘rand’). Pearson correlation was performed for each sample in these datasets and the R-values for all the samples are shown here. Readthrough and transposons were restricted to a region 10-kb up- or downstream of genes.

As shown before for intron retention, these findings may not translate to other species. We found that readthrough was unchanged in the liver of 26-month-old mice (p = 0.22, Figure 4—figure supplement 1A). In addition, readthrough decreased with replicative senescence of mouse fibroblasts, contrary to our expectations (Figure 4—figure supplement 1C, D).

Next, we quantified readthrough across a 100-kb region downstream of genes. We found that both aging and senescence lead to readthrough across the whole region, although senescence was associated with more extensive readthrough further downstream of genes (Figure 4—figure supplement 2, Figure 3B). As another line of evidence we quantified transcription upstream of genes, which is called read-in. This kind of transcription is often attributed to widespread readthrough extending from the end of a gene to the start of another (Roth et al., 2020). Consistent with the extensive readthrough levels observed we found elevated read-in transcription during cellular senescence (Figure 4—figure supplement 3A) but not aging (Figure 4—figure supplement 3B).

Transposon counts followed a similar decaying pattern to readthrough counts downstream of genes (Figure 4—figure supplement 4) and genes with high age-related readthrough were associated with strong age-related upregulation of gene proximal transposons (Figure 4—figure supplement 5). Intergenic transposons showed such low expression levels that their contribution to total counts was negligible (Figure 4—figure supplement 6).

Having shown that samples with higher levels of readthrough also have higher transposon expression (per sample correlation), we asked whether transposons are more often than expected located in readthrough regions (per locus correlation). Indeed as expected, in both datasets, transposons significantly changed with age or senescence were more often found in significant readthrough regions compared to all transposons. Moreover, there was a significant correlation between the log-fold changes of transposons downstream of genes with the log-fold changes of readthrough downstream of the same genes (Figure 4—figure supplement 7). Transposons located elsewhere showed a weaker correlation with readthrough.

To further test whether these correlations between log-fold changes are also reflected on the count level, we determined the correlation between readthrough counts and the counts of transposons downstream of genes for each sample individually. Transposons located upstream of genes and randomized transposons served as a negative control. Consistent with the above data, we found a specific correlation between transposon and readthrough counts in both datasets (Figure 4—figure supplement 8).

Finally, we asked whether readthrough levels are predictive of transposon expression on a per sample basis independently of intron retention. Using the aging dataset with 143 samples we compared a linear model just including intron retention with one also including readthrough and found the second model to perform significantly better (p < 0.0001). In addition, we found that the correlation between readthrough levels and transposon expression remained significant even when correcting for intron retention using partial correlation (R = 0.53, p < 0.0001). We did not test this in the senescence dataset due to its limited size.

Readthrough is a causal inducer of transposon expression

Based on this correlative evidence (Figure 4), we hypothesized that specific inducers of readthrough would also increase transposon expression. To test this, we reanalyzed literature data from studies involving multiple treatments that promote readthrough. These included viral infection, hyperosmotic stress (KCl treatment), heat shock, and oxidative stress.

We found that KCl treatment of HEK293 cells for 1 hr (Rosa-Mercado et al., 2021) indeed strongly induced both readthrough (Figure 5A) and transposon expression (Figure 5B) as determined by RNA-seq and TTLseq. Consistent with this, in our reanalysis of a study using mNET-seq profiling of nascent mRNA (Bauer et al., 2018), KCl treatment of HEK293 cells also induced readthrough (Figure 5D) and transposon expression (Figure 5E), whereas influenza infection caused an intermediate phenotype. Finally, we analyzed RNA-seq data from NIH-3T3 mouse fibroblasts subjected to three stressors over 2 hr (Vilborg et al., 2017). Heatshock caused extensive readthrough (Figure 5—figure supplement 1A) and transposon expression (Figure 5—figure supplement 1B), whereas hydrogen peroxide treatment and KCl caused moderate expression of these.

Figure 5. Elevated transposon expression after induced readthrough.

Readthrough transcription is increased after KCl treatment of HEK293 cells (A, D) and to a lesser extent during influenza infection (D). These readthrough inducing treatments also promote increased transposon expression (B, E). Giving rise to a correlation between readthrough and transposon expression on a per sample basis (C, F). Data in (A–C) are from Rosa-Mercado et al., 2021. In this study, HEK293 cells were subjected to 1 hr of hyperosmotic stress (KCl) after which RNA-seq and TT-TL-seq were performed. Data in (D–F) are from Bauer et al., 2018. In this study, HEK293 cells were subjected to 1 hr of hyperosmotic stress (KCl) after which mNETseq was performed. (A) Readthrough was determined in a region 10- to 20-kb downstream of genes for a subset of genes that was at least 20 kb away from the nearest neighboring gene (n = 1420 regions). (B) To normalize transposon expression counts for each transposon were corrected for the expression of the nearest gene. Normalized transposon counts for each sample are shown as box-whisker plot. (C) Normalized transposon counts (as in B) and readthrough levels are plotted for each sample (n = 12). (D) Readthrough was determined in a region 10- to 20-kb downstream of genes for a subset of genes that was at least 20 kb away from the nearest neighboring gene (n = 2222 regions). (E) To normalize transposon expression counts for each transposon were corrected for the expression of the nearest gene. Normalized transposon levels for each sample are shown as box-whisker plot. (F) Normalized transposon counts (as in B) and readthrough counts are plotted for each sample (n = 22). The shapes indicate the cell type (circles = HEK293, triangles = A549).

Figure 5.

Figure 5—figure supplement 1. Elevated transposon expression after induced readthrough.

Figure 5—figure supplement 1.

Readthrough transcription is increased after heatshock treatment of NIH-3T3 fibroblasts and to a lesser extend after KCl and hydrogen peroxide (A). These readthrough inducing treatments also promote increased transposon expression (B). Therefore readthrough and transposon expression are correlated on a per-sample-basis (C). Data from Vilborg et al., 2017. (A) Readthrough was determined in a region 10- to 20-kb downstream of genes for a subset of genes that was at least 20 kb away from the nearest neighboring gene (n = 732 genes). (B) To normalize transposon expression counts for each transposon were corrected for the expression of the nearest gene. Normalized transposon counts for each sample are shown as box-whisker plot. (C) Normalized transposon counts (as in B) and readthrough counts (medians) are plotted for each sample (n = 8).
Figure 5—figure supplement 2. Heatshock and osmotic stress promote expression of gene proximal transposons.

Figure 5—figure supplement 2.

Specific readthrough induction leads to stronger differential expression of gene proximal transposons (A) and stronger upregulation of extragenic transposons (B) as compared with cellular senescence and aging, which are associated with more modest readthrough. Differential expression was performed for KCl treated HEK293 cells vs control (mNETseq; Bauer et al., 2018), 44°C heatshocked NIH-3T3 vs control (RNA-seq; Vilborg et al., 2017), KCl treated HEK293 cells vs control (RNA-seq; Rosa-Mercado et al., 2021), senescent cells and aged fibroblasts (as detailed in Figure 1). The fraction of all genic and gene proximal transposons (distance <25 kb) among differentially regulated transposons is shown in (A); the remaining transposons are intergenic (distance >25 kb). The fraction of all extra-genic transposons among the significantly upregulated transposons is shown in (B).

We also reasoned that readthrough should specifically increase the expression of transposons that are close to genes. Consistent with this notion, induced readthrough led to preferential expression of gene proximal transposons (i.e. those within 25 kb of genes), when compared with senescence or aging (Figure 5—figure supplement 2A). Moreover, the fraction of significantly upregulated transposons found outside of genes was higher with induced readthrough than with senescence or aging (Figure 5—figure supplement 2B).

Increased expression of autonomous LINE-1 elements with senescence but not aging

Many expressed LINE-1 elements are found within genes, which poses a challenge for the quantification of active LINE-1s. Given this proximity it is unsurprising that a substantial correlation between the expression of transposons and neighboring genes (Figure 6) is seen in RNA-seq.

Figure 6. Increased expression of active LINE-1 elements with senescence but not aging.

Figure 6.

A scatterplot between the log2-fold change of every transposon (p < 0.05) differentially expressed with age or senescence and the log2-fold change of the nearest gene. The percentage of active LINE-1s among transposons with higher-than-expected expression levels is unchanged in aging (A) and increased during senescence (B). For the tables, we compared the percentage of active LINE-1s among all LINE-1 elements for 250 transposons with higher-than-expected expression levels (in orange; labeled ‘outliers’), all significantly changed transposons (blue plus orange; labeled ‘significant’), and all transposons passing read filtering (not plotted; labeled ‘all expressed’). The p-value is based on the comparison with ‘all expressed’.

To mitigate this problem, we selected 250 transposons that showed higher than expected expression levels compared to the nearest gene. We reasoned that LINE-1s with a functional promoter (‘active’ LINE-1s) would be more likely autonomous and should be enriched among transposons that show a divergent expression profile from their neighboring genes.

We found no enrichment for active LINE-1 elements in the aging dataset (Figure 6A). In contrast, transposons significantly upregulated with senescence and, among those, especially the ones with higher-than-expected expression, showed significant enrichment for active LINE-1s (Figure 6B). Moreover, we found that almost 100% of active LINE-1s were increased with senescence which was higher than for all LINE-1s and other transposons.

Finally, we performed the same analysis with a different definition of active LINE-1s as those potentially encoding functional ORF1p and ORF2p. In this analysis, we included all elements encoding an open reading frame >900 nt with high sequence similarity to ORF1p and >3400 nt with high sequence similarity to ORF2p (p < 0.05 for pairwise alignment). Out of 122,626 annotated LINE-1s 585 encoded a putative ORF1p and 133 encoded an ORF2p.

ORF1p-encoding LINE-1s were significantly enriched among transposons with higher-than-expected expression and among transposons significantly changed with senescence or aging. These findings were more robust for senescence than for aging. ORF2p-encoding LINE-1s, in contrast, were too rare to be studied reliably as to their enrichment.

Discussion

We show that transposons would serve as a useful biomarker of aging, even if most of the transposon signal in RNA-seq was due to co-expression with nearby transcriptional units, because they capture different kinds of transcriptional dysfunction.

Introns

Intron retention is one prominent example of dysfunctional splicing during aging, but the association with age-related transposon expression has not been evaluated. A recent study by Gualandi et al., 2022 using RNA-seq data from the 1000 Genomes project found a strong per sample correlation between intron and transposon expression in the general population. In line with these results, we showed that intron retention may explain a large fraction of transposon reads in aging and senescence datasets (Figure 1).

Readthrough, read-in, and transposons

In a study of oncogene-induced senescence, Muniz et al., 2017 found increased readthrough transcription at 91 so-called START loci, which was associated with suppression of neighboring genes through the production of antisense RNAs and several of these antisense RNAs were necessary for the maintenance of the senescent state. Our data are consistent with their model, although we find even more extensive readthrough transcription. Analyzing hundreds of loci we show that senescence promotes transcriptional readthrough leading to the overexpression of intergenic transposons (>25 kb from the nearest gene) and read-in at neighboring genes (Figure 3B, Figure 4—figure supplement 3B). In contrast, aging-related readthrough was qualitatively different, producing shorter readthrough and no read-in (Figure 3B, Figure 4—figure supplement 3A).

Our correlative data suggested that readthrough transcription could account for the expression of many transposons that are outside of genes and thus cannot be explained by intron retention (Figure 4C, F). To test this hypothesis, we reanalyzed data from models of induced readthrough showing that readthrough is indeed a causal inducer for the expression of gene proximal transposons after hyperosmotic stress and viral infection (Figure 5). Given the technical challenges of measuring readthrough and the unreliability of intron retention detection tools (David et al., 2022), our results are likely underestimating the true magnitude of correlation between readthrough and transposon expression.

In a previous study, it was observed that senescence leads to increased expression of transposons and of genes with low abundances (Zhang et al., 2021). This ‘leakage’ was attributed to loss of H3K9me3 histone marks and increased heterochromatin accessibility, but is also consistent with readthrough transcription, as we observed. Indeed, it has been suggested that readthrough is the most likely cause for apparent intergenic transcription (Agostini et al., 2021).

Is there genuine expression of autonomous transposons during aging?

The distribution of transposons showing apparent age-related expression within the genome is an important question since intragenic transposons are more likely to be co-expressed with genes. Using Sanger sequencing of long-range amplicons to study replicative senescence, De Cecco et al., 2019 found a large fraction of transposons mapping to intergenic (69%) regions and few to intragenic/gene proximal regions (31%). In contrast using RNA-seq data, we find that most transposons are located within genes (≥70%; Table 1), although, there are more intergenic transposons significantly changed with senescence than with aging (Figure 3). One explanation for this discrepancy is the mapping approach used. Since their analysis did not use an expectation–maximization algorithm to map back transposons onto the genome, it may underestimate the number of intragenic and gene-adjacent transposons in contrast to our analysis using TELocal.

Given that inactive LINE-1 loci greatly outnumber active loci (Deininger et al., 2017; Stow et al., 2021), and we saw only limited evidence for active LINE-1 transcription in fibroblasts isolated from donors of varying ages (Figure 6A), our data appear to question the role of transposons in organismal aging. An intriguing explanation is provided by De Cecco et al., 2019, who found that LINE-1 ORF1 protein-positive cells colocalize with a small subset of senescent cells. Most likely, such changes would not be obvious from bulk sequencing, perhaps explaining why we did not see elevated LINE-1 expression in the aging dataset. In contrast, the signal would be readily detectable in cultures induced into senescence where most cells do enter senescence, which is consistent with our findings (Figure 6B).

As for another explanation, we speculate that LINE-1 elements expressed through intron retention and readthrough are compatible with translation via leaky scanning or re-initiation (Mouilleron et al., 2016), accounting for increased LINE-1 ORF1 protein levels detected in senescent cells, even if leaky scanning is orders of magnitude less efficient than normal initiation.

We did not study the expression of LTR or SINE elements in detail, which have been implicated in organismal aging (Liu et al., 2023) and age-related diseases such as macular degeneration (de Koning et al., 2011), because we lack consensus annotations for active and inactive members of these families.

Summary and limitations

Taken together, our work and existing literature suggest that RNA-seq, and by extension qPCR, of transposable elements overestimates their true expression and their age-related changes. Such data alone are insufficient to support a ‘transposon hypothesis of aging’ that postulates an increase in autonomous expression of such elements. Instead, a concerted increase in transposon expression, intron retention, and transcriptional readthrough points to a global increase in transcriptional dysfunction during aging and senescence, at least in human cells. Nevertheless, there are several other techniques such as immunoblotting for ORF1 protein, rapid amplification of cDNA ends (RACE), or genomic PCR that may provide evidence in favor of age-related reactivation of potentially autonomous transposons, specifically LINE-1 elements (De Cecco et al., 2019).

Furthermore, since most of our analysis focused on cellular models of aging and senescence, we cannot exclude that RNA-seq data performed on tissues samples would more faithfully and specifically capture the expression of autonomous transposable elements.

Although we have shown increased transcriptional dysfunction with aging and cellular senescence, so far we lack a model to explain these findings. In this context, it is noteworthy that the elongation rate of Pol II is correlated with readthrough transcription (Fong et al., 2015), intron processing, and may increase with aging (Debès et al., 2023), potentially providing a unifying explanation. Another potential explanation that could account for both elevated readthrough and expression of intergenic transposon is loss of repressive heterochromatin (Zhang et al., 2021) during aging or senescence. We were unable to test this hypothesis directly since we only had access to RNA-seq data for these samples.

Methods

Dataset selection, read preprocessing and alignment

We searched the SRA for suitable datasets that used total RNA extraction because this improves the detection of non-canonical transcripts that may be degraded before successful polyadenylation. Publicly available human RNA-seq datasets of aging and senescence, respectively, were obtained from Colombo et al., 2018 (GSE60340) and Fleischer et al., 2018 (GSE113957). Human RNA-seq and TT-TimeLapse-seq (TT-TL-seq) data were obtained from Rosa-Mercado et al., 2021 (GSE152059), human native elongating transcript sequencing (mNET-seq) data from Bauer et al., 2018 (PRJNA432639), and mouse RNA-seq data from Vilborg et al., 2017 (GSE98906).

Mouse liver RNA-seq data from ad libitum and dietary restricted mice were obtained from Hahn et al., 2017 (GSE92486) and mouse replicative senescence data from Wang et al., 2022 (GSE179880).

Reads were filtered, adaptor-trimmed and repaired using fastp (Chen et al., 2018) with a min Phred score of 25. Afterwards reads passing these filters were aligned to the human genome (GRCh38) or mouse genome (GRCm39) using STAR 2.7.8a (Dobin et al., 2013) in single-pass mode. To optimize the quantification of transposons the STAR parameters --outFilterMultimapNmax and --winAnchorMultimapNmax were set to 100. The full pipeline with all the configuration files is available on github (https://github.com/pabisk/aging_transposons copy archived at Pabis, 2024).

Detection of readthrough and intron retention

We employed a modified Automatic Readthrough Transcription Detection (ARTDeco) pipeline, using their provided annotations (Roth et al., 2020) but with a different quantification approach. Readthrough candidate regions in the ARTDeco pipeline were defined based on the longest transcript to exclude the effects of differential 3′-UTR usage. Genic and readthrough reads were counted using featureCounts from the Rsubread package in 10 kb bins downstream of genes, in a window between 0 and 100 kb. Readthrough regions with low coverage, in close proximity to, or overlapping, nearby protein-coding genes were excluded from the analysis. Intron retention was detected using the intron REtention Analysis and Detector (iREAD), which is a tool that detects retention events based on both splice junction reads and intron expression levels (Li et al., 2020). All samples were treated as unstranded.

Locus-specific detection of transposon reads and transcripts

We used TELocal v1.1.1 (mhammell-laboratory/TElocal) to quantify transposon reads, which is functionally similar to TETranscripts (Jin et al., 2015) except it provides locus-specific expression levels. Both tools use an expectation maximization algorithm to assign ambiguous reads. Annotation files were downloaded from the mhammell lab and from genecode (v39 for human and M27 for mouse). TELocal was also used to quantify transcriptomic counts. Again, all samples were treated as unstranded.

Expression and normalization of genes, introns, transposons, and readthrough transcripts

Transcripts and elements with low expression counts were filtered. After filtering, intron, readthrough, read-in, or transposon elements were combined with transcript counts for the analysis in DESeq2. For the aging fibroblast dataset age was normalized and centered and the design model considered age + gender + ethnicity. For the induced senescence dataset all four inducers were included together and the remaining treatment groups were considered non-senescent.

In order to study expression of these elements unbiased by gene co-expression, we selected the top 1000 overexpressed elements and divided their read counts by the read counts of the nearest gene. When not specified in the annotation the nearest genes or elements were identified with the bedtools closest or intersect function (Quinlan and Hall, 2010), as appropriate. Finally, to provide a single expression value for each sample we used the mean log of the normalized expression values.

Expression of potentially active and protein-coding LINE-1 elements

We defined potentially active LINE-1 elements as those that have a functional promoter (McKerrow and Fenyö, 2020). Whereas to define potentially protein-coding LINE-1 elements we predicted open reading frames for each annotated element using the predORF function from the systemPipeR package. Global pairwise alignment to ORF1p (uniprot: Q9UN81) and ORF2p (uniprot: O00370) was performed and only elements with a significant alignment were included for further analyses (p < 0.05).

Acknowledgements

We would like to thank Juliane Liepe, Marco Malavolta, and Chin-Tong Ong for support and/or their help in improving this manuscript and VitaDAO for financial support.

Funding Statement

The funders had no role in study design, data collection, and interpretation, or the decision to submit the work for publication.

Contributor Information

Kamil Pabis, Email: Kamil.pabis@gmail.com.

Pablo A Manavella, Universidad Nacional del Litoral-CONICET, Argentina.

Kathryn Song Eng Cheah, University of Hong Kong, Hong Kong.

Funding Information

This paper was supported by the following grant:

  • National University of Singapore NUHSRO/2020/114 to Brian K Kennedy.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Formal analysis, Investigation, Methodology, Writing - original draft.

Conceptualization, Writing - review and editing.

Methodology, Writing - review and editing.

Conceptualization, Supervision, Methodology, Writing - review and editing.

Formal analysis, Supervision, Writing - review and editing.

Conceptualization, Supervision, Writing - review and editing.

Additional files

MDAR checklist
Supplementary file 1. Supplementary figures and tables.
elife-87811-supp1.docx (28KB, docx)

Data availability

The current manuscript is a computational study, so no data have been generated for this manuscript.

The following previously published datasets were used:

Purcell M, Tainsky M, Kruger A. 2015. Pathway Profiling of Replicative and Induced Senescence. NCBI Gene Expression Omnibus. GSE60340

Fleisher JG, Schulte R, Tsai H, Tyagi S, Ibarra A, Shokhirev MN, Huang L, Hetzer MW, Navlakha S. 2018. Predicting age from the transcriptome of human dermal fibroblasts. NCBI Gene Expression Omnibus. GSE113957

Rosa-Mercado NA, Zimmer JT, Apostolidi M, Rinehart J, Simon MD, Steitz JA. 2021. TT-TL-seq reveals transcriptional profiles that accompany DoG induction after hyperosmotic stress. NCBI Gene Expression Omnibus. GSE152059

Bauer et al. 2018. Meromictic Lake Metagenome. NCBI BioProject. PRJNA257655

Vilborg A, Shalgi R. 2017. Comparative analysis reveals genomic features of stress-induced transcriptional readthrough. NCBI Gene Expression Omnibus. GSE98906

Hahn et al. 2017. Dietary restriction protects from age-associated DNA methylation and induces epigenetic reprogramming of lipid metabolism. NCBI Gene Expression Omnibus. GSE92486

Wang Y, Liu L, Song Y. 2022. Unveiling E2F4, TEAD1 and AP-1 as regulatory transcription factors of the replicative senescence program by multi-omics analysis. NCBI Gene Expression Omnibus. GSE179880

References

  1. Adusumalli S, Ngian ZK, Lin WQ, Benoukraf T, Ong CT. Increased intron retention is a post-transcriptional signature associated with progressive aging and Alzheimer’s disease. Aging Cell. 2019;18:e12928. doi: 10.1111/acel.12928. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Agostini F, Zagalak J, Attig J, Ule J, Luscombe NM. Intergenic RNA mainly derives from nascent transcripts of known genes. Genome Biology. 2021;22:136. doi: 10.1186/s13059-021-02350-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bauer DLV, Tellier M, Martínez-Alonso M, Nojima T, Proudfoot NJ, Murphy S, Fodor E. Influenza virus mounts a two-pronged attack on host RNA polymerase II transcription. Cell Reports. 2018;23:2119–2129. doi: 10.1016/j.celrep.2018.04.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Colombo AR, Elias HK, Ramsingh G. Senescence induction universally activates transposable element expression. Cell Cycle. 2018;17:1846–1857. doi: 10.1080/15384101.2018.1502576. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. David JK, Maden SK, Wood MA, Thompson RF, Nellore A. Retained introns in long RNA-Seq reads are not reliably detected in sample-matched short reads. bioRxiv. 2022 doi: 10.1101/2022.03.11.484016. [DOI] [PMC free article] [PubMed]
  7. Debès C, Papadakis A, Grönke S, Karalay Ö, Tain LS, Mizi A, Nakamura S, Hahn O, Weigelt C, Josipovic N, Zirkel A, Brusius I, Sofiadis K, Lamprousi M, Lu YX, Huang W, Esmaillie R, Kubacki T, Späth MR, Schermer B, Benzing T, Müller RU, Antebi A, Partridge L, Papantonis A, Beyer A. Ageing-associated changes in transcriptional elongation influence longevity. Nature. 2023;616:814–821. doi: 10.1038/s41586-023-05922-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. De Cecco M, Ito T, Petrashen AP, Elias AE, Skvir NJ, Criscione SW, Caligiana A, Brocculi G, Adney EM, Boeke JD, Le O, Beauséjour C, Ambati J, Ambati K, Simon M, Seluanov A, Gorbunova V, Slagboom PE, Helfand SL, Neretti N, Sedivy JM. L1 drives IFN in senescent cells and promotes age-associated inflammation. Nature. 2019;566:73–78. doi: 10.1038/s41586-018-0784-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Deininger P, Morales ME, White TB, Baddoo M, Hedges DJ, Servant G, Srivastav S, Smither ME, Concha M, DeHaro DL, Flemington EK, Belancio VP. A comprehensive approach to expression of L1 loci. Nucleic Acids Research. 2017;45:e31. doi: 10.1093/nar/gkw1067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. de Koning APJ, Gu W, Castoe TA, Batzer MA, Pollock DD. Repetitive elements may comprise over two-thirds of the human genome. PLOS Genetics. 2011;7:e1002384. doi: 10.1371/journal.pgen.1002384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Deniz Ö, Frost JM, Branco MR. Regulation of transposable elements by DNA modifications. Nature Reviews. Genetics. 2019;20:417–431. doi: 10.1038/s41576-019-0106-6. [DOI] [PubMed] [Google Scholar]
  12. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Fleischer JG, Schulte R, Tsai HH, Tyagi S, Ibarra A, Shokhirev MN, Huang L, Hetzer MW, Navlakha S. Predicting age from the transcriptome of human dermal fibroblasts. Genome Biology. 2018;19:221. doi: 10.1186/s13059-018-1599-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Fong N, Brannan K, Erickson B, Kim H, Cortazar MA, Sheridan RM, Nguyen T, Karp S, Bentley DL. Effects of Transcription Elongation Rate and Xrn2 Exonuclease Activity on RNA Polymerase II Termination Suggest Widespread Kinetic Competition. Molecular Cell. 2015;60:256–267. doi: 10.1016/j.molcel.2015.09.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Ghanam AR, Cao J, Ouyang X, Song X. New Insights into Chronological Mobility of Retrotransposons In Vivo. Oxidative Medicine and Cellular Longevity. 2019;2019:2818415. doi: 10.1155/2019/2818415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Gorbunova V, Seluanov A, Mita P, McKerrow W, Fenyö D, Boeke JD, Linker SB, Gage FH, Kreiling JA, Petrashen AP, Woodham TA, Taylor JR, Helfand SL, Sedivy JM. The role of retrotransposable elements in ageing and age-associated diseases. Nature. 2021;596:43–53. doi: 10.1038/s41586-021-03542-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Gualandi N, Iperi C, Esposito M, Ansaloni F, Gustincich S, Sanges R. Meta-Analysis Suggests That Intron Retention Can Affect Quantification of Transposable Elements from RNA-Seq Data. Biology. 2022;11:826. doi: 10.3390/biology11060826. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hadar S, Meller A, Saida N, Shalgi R. Stress-induced transcriptional readthrough into neighboring genes is linked to intron retention. iScience. 2022;25:105543. doi: 10.1016/j.isci.2022.105543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Hahn O, Grönke S, Stubbs TM, Ficz G, Hendrich O, Krueger F, Andrews S, Zhang Q, Wakelam MJ, Beyer A, Reik W, Partridge L. Dietary restriction protects from age-associated DNA methylation and induces epigenetic reprogramming of lipid metabolism. Genome Biology. 2017;18:56. doi: 10.1186/s13059-017-1187-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Jin Y, Tam OH, Paniagua E, Hammell M. TEtranscripts: a package for including transposable elements in differential expression analysis of RNA-seq datasets. Bioinformatics. 2015;31:3593–3599. doi: 10.1093/bioinformatics/btv422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Kazazian HH, Moran JV. Mobile DNA in health and disease. The New England Journal of Medicine. 2017;377:361–370. doi: 10.1056/NEJMra1510092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Khristich AN, Mirkin SM. On the wrong DNA track: Molecular mechanisms of repeat-mediated genome instability. The Journal of Biological Chemistry. 2020;295:4134–4170. doi: 10.1074/jbc.REV119.007678. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. LaRocca TJ, Cavalier AN, Wahl D. Repetitive elements as a transcriptomic marker of aging: evidence in multiple datasets and models. Aging Cell. 2020;19:e13167. doi: 10.1111/acel.13167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Li HD, Funk CC, Price ND. iREAD: a tool for intron retention detection from RNA-seq data. BMC Genomics. 2020;21:128. doi: 10.1186/s12864-020-6541-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Liu X, Liu Z, Wu Z, Ren J, Fan Y, Sun L, Cao G, Niu Y, Zhang B, Ji Q, Jiang X, Wang C, Wang Q, Ji Z, Li L, Esteban CR, Yan K, Li W, Cai Y, Wang S, Zheng A, Zhang YE, Tan S, Cai Y, Song M, Lu F, Tang F, Ji W, Zhou Q, Belmonte JCI, Zhang W, Qu J, Liu GH. Resurrection of endogenous retroviruses during aging reinforces senescence. Cell. 2023;186:287–304. doi: 10.1016/j.cell.2022.12.017. [DOI] [PubMed] [Google Scholar]
  26. McKerrow W, Fenyö D. L1EM: a tool for accurate locus specific LINE-1 RNA quantification. Bioinformatics. 2020;36:1167–1173. doi: 10.1093/bioinformatics/btz724. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Mouilleron H, Delcourt V, Roucou X. Death of a dogma: eukaryotic mRNAs can code for more than one protein. Nucleic Acids Research. 2016;44:14–23. doi: 10.1093/nar/gkv1218. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Muniz L, Deb MK, Aguirrebengoa M, Lazorthes S, Trouche D, Nicolas E. Control of gene expression in senescence through transcriptional read-through of convergent protein-coding genes. Cell Reports. 2017;21:2433–2446. doi: 10.1016/j.celrep.2017.11.006. [DOI] [PubMed] [Google Scholar]
  29. Pabis K. Triplex and other DNA motifs show motif-specific associations with mitochondrial DNA deletions and species lifespan. Mechanisms of Ageing and Development. 2021;194:111429. doi: 10.1016/j.mad.2021.111429. [DOI] [PubMed] [Google Scholar]
  30. Pabis K. Aging_Transposons. swh:1:rev:408f6c28826dee3dcb6895a2ce562fe036de372dSoftware Heritage. 2024 https://archive.softwareheritage.org/swh:1:dir:c721a387602eba72db0a1f63cc8d0b057cb0256b;origin=https://github.com/pabisk/aging_transposons;visit=swh:1:snp:5af3eecc20f11af17e3558be1630a11b213dbed9;anchor=swh:1:rev:408f6c28826dee3dcb6895a2ce562fe036de372d
  31. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Rodriguez-Martin B, Alvarez EG, Baez-Ortega A, Zamora J, Supek F, Demeulemeester J, Santamarina M, Ju YS, Temes J, Garcia-Souto D, Detering H, Li Y, Rodriguez-Castro J, Dueso-Barroso A, Bruzos AL, Dentro SC, Blanco MG, Contino G, Ardeljan D, Tojo M, Roberts ND, Zumalave S, Edwards PA, Weischenfeldt J, Puiggròs M, Chong Z, Chen K, Lee EA, Wala JA, Raine KM, Butler A, Waszak SM, Navarro FCP, Schumacher SE, Monlong J, Maura F, Bolli N, Bourque G, Gerstein M, Park PJ, Wedge DC, Beroukhim R, Torrents D, Korbel JO, Martincorena I, Fitzgerald RC, Van Loo P, Kazazian HH, Burns KH, PCAWG Structural Variation Working Group. Campbell PJ, Tubio JMC, PCAWG Consortium Pan-cancer analysis of whole genomes identifies driver rearrangements promoted by LINE-1 retrotransposition. Nature Genetics. 2020;52:306–319. doi: 10.1038/s41588-019-0562-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Rosa-Mercado NA, Zimmer JT, Apostolidi M, Rinehart J, Simon MD, Steitz JA. Hyperosmotic stress alters the RNA polymerase II interactome and induces readthrough transcription despite widespread transcriptional repression. Molecular Cell. 2021;81:502–513. doi: 10.1016/j.molcel.2020.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Rosa-Mercado NA, Steitz JA. Who let the DoGs out? - biogenesis of stress-induced readthrough transcripts. Trends in Biochemical Sciences. 2022;47:206–217. doi: 10.1016/j.tibs.2021.08.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Roth SJ, Heinz S, Benner C. ARTDeco: automatic readthrough transcription detection. BMC Bioinformatics. 2020;21:214. doi: 10.1186/s12859-020-03551-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Scotti MM, Swanson MS. RNA mis-splicing in disease. Nature Reviews. Genetics. 2016;17:19–32. doi: 10.1038/nrg.2015.3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Stow EC, Kaul T, deHaro DL, Dem MR, Beletsky AG, Morales ME, Du Q, LaRosa AJ, Yang H, Smither E, Baddoo M, Ungerleider N, Deininger P, Belancio VP. Organ-, sex- and age-dependent patterns of endogenous L1 mRNA expression at a single locus resolution. Nucleic Acids Research. 2021;49:5813–5831. doi: 10.1093/nar/gkab369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Vilborg A, Sabath N, Wiesel Y, Nathans J, Levy-Adam F, Yario TA, Steitz JA, Shalgi R. Comparative analysis reveals genomic features of stress-induced transcriptional readthrough. PNAS. 2017;114:E8362–E8371. doi: 10.1073/pnas.1711120114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Villanueva-Cañas JL, Horvath V, Aguilera L, González J. Diverse families of transposable elements affect the transcriptional regulation of stress-response genes in Drosophila melanogaster. Nucleic Acids Research. 2019;47:6842–6857. doi: 10.1093/nar/gkz490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Wahl D, Cavalier AN, Smith M, Seals DR, LaRocca TJ. Healthy aging interventions reduce repetitive element transcripts. The Journals of Gerontology. Series A, Biological Sciences and Medical Sciences. 2021;76:805–810. doi: 10.1093/gerona/glaa302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Wang K, Wu D, Zhang H, Das A, Basu M, Malin J, Cao K, Hannenhalli S. Comprehensive map of age-associated splicing changes across human tissues and their contributions to age-associated diseases. Scientific Reports. 2018;8:10929. doi: 10.1038/s41598-018-29086-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Wang Y, Liu L, Song Y, Yu X, Deng H. Unveiling E2F4, TEAD1 and AP-1 as regulatory transcription factors of the replicative senescence program by multi-omics analysis. Protein & Cell. 2022;13:742–759. doi: 10.1007/s13238-021-00894-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Wood JG, Jones BC, Jiang N, Chang C, Hosier S, Wickremesinghe P, Garcia M, Hartnett DA, Burhenn L, Neretti N, Helfand SL. Chromatin-modifying genetic interventions suppress age-associated transposable element activation and extend life span in Drosophila. PNAS. 2016;113:11277–11282. doi: 10.1073/pnas.1604621113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Yao J, Ding D, Li X, Shen T, Fu H, Zhong H, Wei G, Ni T. Prevalent intron retention fine-tunes gene expression and contributes to cellular senescence. Aging Cell. 2020;19:e13276. doi: 10.1111/acel.13276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Zhang X, Liu X, Du Z, Wei L, Fang H, Dong Q, Niu J, Li Y, Gao J, Zhang MQ, Xie W, Wang X. The loss of heterochromatin is associated with multiscale three-dimensional genome reorganization and aberrant transcription during cellular senescence. Genome Research. 2021;31:1121–1135. doi: 10.1101/gr.275235.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Zheng JT, Lin CX, Fang ZY, Li HD. Intron Retention as a Mode for RNA-Seq data analysis. Frontiers in Genetics. 2020;11:586. doi: 10.3389/fgene.2020.00586. [DOI] [PMC free article] [PubMed] [Google Scholar]

eLife assessment

Pablo A Manavella 1

This study presents fundamental findings on the role of transcription readout and intron retention in transposon expression during aging in mammals. The evidence supporting the claims of the authors is compelling, strongly supporting the authors' claims. The work will be of interest to scientists studying aging, transcription regulation, and epigenetics.

Joint Public Review:

Anonymous

In this manuscript, the authors examined the role of transcription readout and intron retention in increasing transcription of transposable elements during aging in mammals. It is assumed that most transposable elements have lost the regulatory elements necessary for transcription activation. Using available RNA-seq datasets, the authors showed that an increase in intron retention and readthrough transcription during aging contributes to an increase in the number of transcripts containing transposable elements.

Previously, it was assumed that the activation of transposable elements during aging is a consequence of a gradual imbalance of transcriptional repression and a decrease in the functionality of heterochromatin (de repression of transcription in heterochromatin). Therefore, this is an interesting study with important novel conclusion.

The authors revised the manuscript in accordance with the comments. Overall, the manuscript is useful because it shows that there is no direct connection between increased levels of transposon RNA and aging, and further demonstrates the disorganization of the transcriptional apparatus during aging.

eLife. 2024 Apr 3;12:RP87811. doi: 10.7554/eLife.87811.3.sa2

Author Response

Kamil Pabis 1, Diogo Barardo 2, Olga Sirbu 3, Kumar Selvarajoo 4, Jan Gruber 5, Brian K Kennedy 6

The following is the authors’ response to the original reviews.

Public Reviews:

Reviewer #1 (Public Review):

Transcriptional readthrough, intron retention, and transposon expression have been previously shown to be elevated in mammalian aging and senescence by multiple studies. The current manuscript claims that the increased intron retention and readthrough could completely explain the findings of elevated transposon expression seen in these conditions. To that end, they analyze multiple RNA-seq expression datasets of human aging, human senescence, and mouse aging, and establish a series of correlations between the overall expression of these three entities in all datasets.

While the findings are useful, the strength of the evidence is incomplete, as the individual analyses unfortunately do not support the claims. Specifically, to establish this claim there is a burden of proof on the authors to analyze both intron-by-intron and gene-by-gene, using internal matched regions, and, in addition, thoroughly quantify the extent of transcription of completely intergenic transposons and show that they do not contribute to the increase in aging/senescence. Furthermore, the authors chose to analyze the datasets as unstranded, even though strand information is crucial to their claim, as both introns and readthrough are stranded, and if there is causality, than opposite strand transposons should show no preferential increase in aging/senescence. Finally, there are some unclear figures that do not seem to show what the authors claim. Overall, the study is not convincing.

Major concerns:

1. Why were all datasets treated as unstanded? Strand information seems critical, and should not be discarded. Specifically, stranded information is crucial to increase the confidence in the causality claimed by the authors, since readthrough and intron retention are both strand specific, and therefore should influence only the same strand transposons and not the opposite-strand ones.

This is an excellent suggestion. Since only one of our datasets was stranded, we did not run stranded analyses for the sake of consistency. We would like to provide two analyses here that consider strandedness:

First, we find that within the set of all expressed transposons (passing minimal read filtering), 86% of intronic transposons match the strand of the intron (3147 out of 3613). In contrast, the number is 51% after permutation of the strands. Similarly, when we randomly select 1000 intronic transposons 45% match the strandedness of the intron (here we select from the set of all transposons). This is consistent with the idea that most transposons are only detectable because they are co-expressed on the sense strand of other features that are highly expressed.

As for the readthrough data, 287 out of 360 transposons (79%) within readthrough regions matched the strand of the gene and its readthrough.

Second, in the model we postulate, the majority of transposon transcription occurs as a co-transcriptional artifact. This applies equally to genic transposons (gene expression), intronic (intron retention) and gene proximal (readthrough or readin) transposons. Therefore, we performed the following analysis for the set of all transposons in the Fleischer et al. fibroblast dataset.

When we invert the strand annotation for transposons, before counting and differential expression, we would expect the counts and log fold changes to be lower compared to using the “correct” annotation file.

Indeed, we show that out of 6623 significantly changed transposons with age only 226 show any expression in the “inverted run” (-96%). (Any expression is defined as passing basic read filtering.)

Out of the 226 transposons that can be detected in both runs most show lower counts (A) and age-related differential expression converging towards zero (B) in the inverted run (Fig. L1).

Author response image 1. Transposons with inverted strandedness (“reverse”) show lower expression levels (log counts; A) and no differential expression with age (B) when compared to matched differentially expressed transposons (“actual”).

Author response image 1.

For this analysis we selected all transposons showing significant differential expression with age in the actual dataset that also showed at least minimal expression in the strand-inverted analysis (n=226). Data from Fleischer et al. (2018). (A) The log (counts) are clipped because we only used transposons that passed minimal read filtering in this analysis. (B) The distribution of expression values in the actual dataset is bimodal and positive since some transposons are significantly up- or downregulated. This bimodal distribution is lost in the strand-inverted analysis.

1. "Altogether this data suggests that intron retention contributes to the age-related increase in the expression of transposons" - this analysis doesn't demonstrate the claim. In order to prove this they need to show that transposons that are independent of introns are either negligible, or non-changing with age.

We would like to emphasize that we never claimed that intron retention and readthrough can explain all of the age-related increases in transposon expression. In fact, our data is compatible with a multifactorial origin of transposons expression. Age- and senescence-related transposon expression can occur due to: 1/ intron retention, 2/ readthrough, 3/ loss of intergenic heterochromatin. Specifically, we do not try to refute 3.

However, since most transposons are found in introns or downstream of genes, this suggests that intron retention and readthrough will be major, albeit non-exclusive, drivers of age-related changes in transposons expression. Even if the fold-change for intergenic transposons with aging or senescence were higher this would not account for the broadscale expression patterns seen in RNAseq data.

To further illustrate this, we analyzed transposons located in introns, genes, downstream (ds) or upstream (us) of genes (distance to gene < 25 kb) or in intergenic regions (distance to gene > 25 kb). Indeed, we find that although intergenic transposons show similar log-fold changes to other transposon classes (Fig. L2A), their total contribution to read counts is negligible (Fig. L2B, Fig. Fig. S15). We have also now added a more nuanced explanation of this issue to the discussion.

Author response image 2. We analyzed transposons located in introns, genes, downstream (ds) or upstream (us) of genes (distance to gene < 25 kb) or in intergenic regions (distance to gene > 25 kb).

Author response image 2.

Independent of their location, transposons show similar differential expression with aging or cellular senescence (A). In contrast, the expression of transposons (log counts) is highly dependent on their location and the median log(count) value decreases in the order: genic > intronic > ds > us > intergenic.

Author response image 3. Total counts are the sum of all counts from transposons located in introns, genes, downstream (ds) or upstream (us) of genes (distance to gene < 25 kb) or in intergenic regions (distance to gene > 25 kb).

Author response image 3.

Counts were defined as cumulative counts across all samples.

1. Additionally, the correct control regions should be intronic regions other than the transposon, which overall contributed to the read counts of the intron.

1. Furthermore, analysis of read spanning intron and partly transposons should more directly show this contribution.

Thank you for this comment. To rephrase this, if we understand correctly, the concern is that an increase in transposon expression could bias the analysis of intron retention since transposons often make up a substantial portion of an intron. We would like to address this concern with the following three points:

First, if the concern is the correlation between log fold-change of transposons vs log fold-change of their containing introns, we do not think that this kind of data is biased. While transposons make up much of the intron, a single transposon on average only accounts for less than 10% of an intron.

Second, to address this more directly, we show here that even introns that do not contain expressed transposons are increased in aging fibroblasts and after induction of cellular senescence (Fig. S8). This shows that intron retention is universal and most likely not heavily biased by the presence or absence of expressed transposons.

Author response image 4. We split the set of introns that significantly change with cellular aging (A) or cell senescence (B) into introns that contain at least one transposon (has_t) and those that do not contain any transposons (has_no_t).

Author response image 4.

Intron retention is increased in both groups. In this analysis we included all transposons that passed minimal read filtering (n=63782 in A and n=124173 in B). Median log-fold change indicated with a dashed red line for the group of introns without transposons.

Third, we provide an argument based on the distribution of transposons within introns (Fig. L3).

Author response image 5.

The 5’ and 3’ splice sites show the highest sequence conservation between introns, whereas the majority of the intronic sequence does not. This is because these sites contain binding sites for splicing factors such as U1, U2 and SF1 (A). Transposons could affect splicing and we present a biologically plausible mechanism and two ancillary hypotheses here (B). If transposons affect the splicing (retention) of introns the most likely mechanism would be via impairment of splice site recognition because a transposon close to the site forms a secondary structure, binds an effector protein or provides inadequate sequences for pairing. Hypothesis 1: Transposons impair splicing because they are close to the splice site. Hypothesis 2: Transposons do not impair splicing because they are located away from the splice junction. Retained introns should show a similar depletion of transposons around the junction.

Image adapted from: Ren, Pingping, et al. "Alternative splicing: a new cause and potential therapeutic target in autoimmune disease." Frontiers in Immunology 12 (2021): 713540.

Consistent with hypothesis 2 (“transposons do not impair splicing”), we show that the distribution of transposons within introns is similar for the set of all transposons and all significant transposons within significantly overexpressed introns (Fig. S7. A and B is similar in the case of aged fibroblasts; D and E is similar in the case of cellular senescence). If transposon expression was causally linked to changes in intron retention, the most likely mechanism would be via an impairment of splicing. We would expect transposons to be located close to the splice junction, which is not what we observed. Instead, the data is more consistent with intron retention as a driver of transposon expression.

Author response image 6. Transposons are evenly distributed within introns except for the region close to splice junctions (A-E).

Author response image 6.

Transposons appear to be excluded from the splice junction-adjacent region both in all introns (A, D) and in significantly retained introns (B, E). In addition, transposon density of all introns and significantly retained introns is comparable (C, F). We included only introns containing at least one transposon in this analysis. (A) Distribution of 2292769 transposons within 163498 introns among all annotated transposons. (B) Distribution of 195190 transposons within 14100 introns significantly retained with age. (C) Density (transposon/1kb of intron) of transposons in all introns (n=163498) compared to significantly retained introns (n=14100). (D) as in (A) (E) Distribution of 428130 transposons within 13205 introns significantly retained with induced senescence. (F) Density (transposon/1kb of intron) of transposons in all introns (n=163498) compared to significantly retained introns (n=13205).

1. "This contrasts with the almost completely even distribution of randomly permuted transposons." How was random permutation of transposons performed? Why is this contract not trivial, and why is this a good control?

Permutation was performed using the bedtools shuffle function (Quinlan et al. 2010). We use the set of all annotated transposons and all reshuffled transposons as a control. It is interesting to observe that these two show a very similar distribution with transposons evenly spread out relative to genes. In contrast, expressed transposons are found to cluster downstream of genes. This gave rise to our initial working hypothesis that readthrough should affect transposon expression.

1. Fig 4: the choice to analyze only the 10kb-20kb region downstream to TSE for readthrough regions has probably reduced the number of regions substantially (there are only 200 left) and to what extent this faithfully represent the overall trend is unclear at this point.

This is addressed in Suppl. Fig. 7, we repeated the analysis for every 10kb region between 0 and 100kb, showing similar results.

Furthermore, we show below in a new figure that the results are comparable when we measure readthrough in the 0 to 10kb region, while the sample size of readthrough regions is increased.

Finally, it is commonly accepted to remove readthrough regions overlapping genes, which while reducing sample size, increases accuracy for readthrough determination (Rosa-Mercado et al. 2021). Without filtering readthrough regions can overlap neighboring genes which is reflected in an elevated ratio of Readthrough_counts/Genic_counts (Fig. S9).

Author response image 7. Readthrough was determined in a region 0 to 10 kb downstream of genes for a subset of genes that were at least 10 kb away from the nearest neighboring gene (n=684 regions).

Author response image 7.

The log2 ratio of readthrough to gene expression is plotted across five age groups (adolescent n=32, young n=31, middle-aged n=22, old n=37 and very old n=21). (B) As in (A) but data is plotted on a per sample basis. (C) Readthrough was determined in a region 0 to 10 kb downstream of genes for a subset of genes that were at least 10 kb away from the nearest neighboring gene (n=1045 regions). The log2 ratio of readthrough to gene expression is plotted for the groups comprising senescence (n=12) and the non-senescent group (n=6). (D) As in (D) but data is plotted on a per sample basis and for additional control datasets (serum-starved, immortalized, intermediate passage and early passage). N=3 per group.

1. Fig. 5B shows the opposite of the authors claims: in the control samples there are more transposon reads than in the KCl samples.

Thank you for pointing this out. During preparation of the manuscript the labels of Fig. 5B were switched (however, the color matching between Fig. 5A-C is correct). We apologize for this mistake, which we have now corrected.

1. "induced readthrough led to preferential expression of gene proximal transposons (i.e. those within 25 kb of genes), when compared with senescence or aging". A convincing analysis would show if there is indeed preferential proximity of induced transposons to TSEs. Since readthrough transcription decays as a function of distance from TSEs, the expression of transposons should show the same trends if indeed simply caused by readthrough. Also, these should be compared to the extent of transposon expression (not induction) in intergenic regions without any readthrough, in these conditions.

This is a very good suggestion. We now provide two new supplementary figures analyzing the distance-dependence of transposon expression.

In the first figure (Fig. S13) we show that readthrough decreases with distance (A, B) and we show that transposon counts are higher for transposons close to genes, following a similar pattern to readthrough. This is true in fibroblasts isolated from aged donors (A) and with cellular senescence (B).

Author response image 8. Readthrough counts (rt_counts) decrease exponentially downstream of genes, both in the aging dataset (A) and in the cellular senescence dataset (B).

Author response image 8.

Although noisier, the pattern for transposon counts (transp_cum_counts) is similar with higher counts closer to gene terminals, both in the aging dataset (C) and in the cellular senescence dataset (D). Readthrough counts are the cumulative counts across all genes and samples. Readthrough was determined in 10 kb bins and the values are assigned to the midpoint of the bin for easier plotting. Transposon counts are the cumulative counts across all samples for each transposon that did not overlap a neighboring gene. n=801 in (C) and n=3479 in (D).

In the second figure (Fig. S14) we show that transposons found downstream of genes with high readthrough show a more pronounced log-fold change (differential expression) than transposons downstream of genes with low readthrough (defined based on log-fold change). This is true in fibroblasts isolated from aged donors (A) and with cellular senescence (B). Furthermore, the difference between high and low readthrough region transposons is diminished for transposons that are more than 10 kb downstream of genes, as would be expected given that readthrough decreases with distance.

Author response image 9. Transposons found downstream of genes with high readthrough (hi_RT) show a more pronounced log-fold change (transp_logfc) than transposons downstream of genes with low readthrough (low_RT).

Author response image 9.

This is true in fibroblasts isolated from aged donors (A) and with cellular senescence (B). Furthermore, the difference between high and low readthrough region transposons is diminished for transposons that are more than 10 kb downstream of genes (“Transp > 10 kb”). Transposons in high readthrough regions were defined as those in the top 20% of readthrough log-fold change. Readthrough was measured between 0 and 10 kb downstream from genes. n=2124 transposons in (A) and n=6061 transposons in (B) included in the analysis.

Reviewer #2 (Public Review):

In this manuscript, the authors examined the role of transcription readout and intron retention in increasing transcription of transposable elements during aging in mammals. It is assumed that most transposable elements have lost the regulatory elements necessary for transcription activation. Using available RNA-seq datasets, the authors showed that an increase in intron retention and readthrough transcription during aging contributes to an increase in the number of transcripts containing transposable elements.

Previously, it was assumed that the activation of transposable elements during aging is a consequence of a gradual imbalance of transcriptional repression and a decrease in the functionality of heterochromatin (de repression of transcription in heterochromatin). Therefore, this is an interesting study with important novel conclusion. However, there are many questions about bioinformatics analysis and the results obtained.

Major comments:

1. In Introduction the authors indicated that only small fraction of LINE-1 and SINE elements are expressed from functional promoters and most of LINE-1 are co-expressed with neighboring transcriptional units. What about other classes of mobile elements (LTR mobile element and transposons)?

We thank the reviewer for this comment. Historically, most repetitive elements, e.g. DNA elements and retrotransposon-like elements, have been considered inactive, having accrued mutations which prevent them from transposition. On the other hand, based on recent data it is indeed very possible that certain LTR elements become active with aging as suggested in several manuscripts (Liu et al. 2023, Autio et al. 2020). However, these elements are not well annotated and our final analysis (Fig. 6) relies on a well-defined distinction between active and inactive elements. (See also question 2 for further discussion.)

Finally, we would like to point out some of the difficulties with defining expression and re-activation of LTR/ERV elements based on RNAseq data that have been highlighted for the Liu manuscript and are concordant with several of our results:https://pubpeer.com/publications/364E785636ADF94732A977604E0256

Liu, Xiaoqian, et al. "Resurrection of endogenous retroviruses during aging reinforces senescence." Cell 186.2 (2023): 287-304.

Autio A, Nevalainen T, Mishra BH, Jylhä M, Flinck H, Hurme M. Effect of ageing on the transcriptomic changes associated with expression at the HERV-K (HML-2) provirus at 1q22. Immun Ageing. 2020;17(1):11.

1. Results: Why authors considered all classes of mobile elements together? It is likely that most of the LTR containing mobile elements and transposons contain active promoters that are repressed in heterochromatin or by KRAB-C2H2 proteins.

We do not consider LTR containing elements because there is uncertainty regarding their overall expression levels and their expression with aging (Nevalainen et al. 2018). Furthermore, we believe that substantial activity of LTR elements in human genomes should have been detectable through patterns of insertional mutagenesis. Yet studies generally show low to negligible levels of LTR (ERV) mutagenesis. Here, for example, at a 200-fold lower rate than for LINEs (Lee et al. 2012).

Importantly, our analysis in Fig. 6 relies on well-annotated elements like LINEs, which is why we do not include LTR or SINE elements that could be potentially expressed. However, for other analyses we did consider element families independently as can be seen in Table S1, for example.

Nevalainen, Tapio, et al. "Aging-associated patterns in the expression of human endogenous retroviruses." PLoS One 13.12 (2018): e0207407.

Lee, Eunjung, et al. "Landscape of somatic retrotransposition in human cancers." Science 337.6097 (2012): 967-971.

1. Fig. 2. A schematic model of transposon expression is not presented clearly. What is the purpose of showing three identical spliced transcripts?

This is indeed confusing. There are three spliced transcripts to schematically indicate that the majority of transcripts will be correctly spliced and that intron retention is rare (estimated at 4% of all reads in our dataset). We have clarified the figure now, please see below:

Author response image 10. A schematic model of transposon expression.

Author response image 10.

In our model, represented in this schematic, transcription (A) can give rise to mRNAs and pre-mRNAs that contain retained introns when co-transcriptional splicing is impaired. This is often seen during aging and senescence, and these can contain transposon sequences (B). In addition, transcription can give rise to mRNAs and pre-mRNAs that contain transposon sequences towards the 3’-end of the mRNA when co-transcriptional termination at the polyadenylation signal (PAS) is impaired (C, D) as seen with aging and senescence. Some of these RNAs may be successfully polyadenylated (as depicted here) whereas others will be subject to nonsense mediated decay. Image created with Biorender.

© 2024, BioRender Inc

Figure 2 (Author response image 10) was created using BioRender, and is published under a CC BY-NC-ND license. Further reproductions must adhere to the terms of this license.

1. The study analyzed the levels of RNA from cell cultures of human fibroblasts of different ages. The annotation to the dataset indicated that the cells were cultured and maintained. The cells were cultured in high-glucose (4.5mg/ml) DMEM (Gibco) supplemented with 15% (vol/vol) fetal bovine serum (Gibco), 1X glutamax (Gibco), 1X non-essential amino acids (Gibco) and 1% (vol/vol) penicillin-streptomycin (Gibco). How correct that gene expression levels in cell cultures are the same as in body cells? In cell cultures, transcription is optimized for efficient division and is very different from that of cells in the body. In order to correlate a result on cells with an organism, there must be rigorous evidence that the transcriptomes match.

We agree and have updated the discussion to reflect this shortcoming. While we do not have human tissue data, we would like to draw the reviewer’s attention to Fig. S3 where we presented some liver data for mice. We now provide an additional supplementary figure (in a style similar to Fig. S2) showing how readthrough, transposon expression and intron retention changes in 26 vs 5-month-old mice (Fig. S4). Indeed, intron, readthrough and transposons increase with age in mice, although this is more pronounced for transposons and readthrough.

Author response image 11. Intron, readthrough and transposon elements are elevated in the liver of aging mice (26 vs 5-month-old, n=6 per group).

Author response image 11.

Readthrough and transposon expression is especially elevated even when compered to genic transcripts. The percentage of upregulated transcripts is indicated above each violin plot and the median log10-fold change for genic transcripts is indicated with a dashed red line.

Finally, just to elaborate, we used the aging fibroblast dataset by Fleischer et al. for three reasons:

1. Yes, aging fibroblasts could be a model of human aging, with important caveats as you correctly point out,

2. it is one of the largest such datasets allowing us to draw conclusions with higher statistical confidence and do things such as partial correlations

3. it has been analyzed using similar techniques before (LaRocca, Cavalier and Wahl 2020) and this dataset is often used to make strong statements about transposons and aging such as transposon expression in this dataset being “consistent with growing evidence that [repetitive element] transcripts contribute directly to aging and disease”. Our goal was to put these statements into perspective and to provide a more nuanced interpretation.

LaRocca, Thomas J., Alyssa N. Cavalier, and Devin Wahl. "Repetitive elements as a transcriptomic marker of aging: evidence in multiple datasets and models." Aging Cell 19.7 (2020): e13167.

1. The results obtained for isolated cultures of fibroblasts are transferred to the whole organism, which has not been verified. The conclusions should be more accurate.

We agree and have updated the discussion accordingly.

1. The full pipeline with all the configuration files IS NOT available on github (pabisk/aging_transposons).

Thank you for pointing this out, we have now uploaded the full pipeline and configuration files.

1. Analysis of transcripts passing through repeating regions is a complex matter. There is always a high probability of incorrect mapping of multi-reads to the genome. Things worsen if unpaired short reads are used, as in the study (L=51). Therefore, the authors used the Expectation maximization algorithm to quantify transposon reads. Such an option is possible. But it is necessary to indicate how statistically reliable the calculated levels are. It would be nice to make a similar comparison of TE levels using only unique reads. The density of reads would drop, but in this case it would be possible to avoid the artifacts of the EM algorithm.

We thank the reviewer for this suggestion. We show here that mapping only unique alignments (outFilterMultimapNmax=1 in STAR) leads to similar results.

For the aging fibroblast dataset:

Author response image 12.

Author response image 12.

For the induced senescence dataset:

Author response image 13.

Author response image 13.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Purcell M, Tainsky M, Kruger A. 2015. Pathway Profiling of Replicative and Induced Senescence. NCBI Gene Expression Omnibus. GSE60340 [DOI] [PMC free article] [PubMed]
    2. Fleisher JG, Schulte R, Tsai H, Tyagi S, Ibarra A, Shokhirev MN, Huang L, Hetzer MW, Navlakha S. 2018. Predicting age from the transcriptome of human dermal fibroblasts. NCBI Gene Expression Omnibus. GSE113957 [DOI] [PMC free article] [PubMed]
    3. Rosa-Mercado NA, Zimmer JT, Apostolidi M, Rinehart J, Simon MD, Steitz JA. 2021. TT-TL-seq reveals transcriptional profiles that accompany DoG induction after hyperosmotic stress. NCBI Gene Expression Omnibus. GSE152059
    4. Bauer et al. 2018. Meromictic Lake Metagenome. NCBI BioProject. PRJNA257655
    5. Vilborg A, Shalgi R. 2017. Comparative analysis reveals genomic features of stress-induced transcriptional readthrough. NCBI Gene Expression Omnibus. GSE98906 [DOI] [PMC free article] [PubMed]
    6. Hahn et al. 2017. Dietary restriction protects from age-associated DNA methylation and induces epigenetic reprogramming of lipid metabolism. NCBI Gene Expression Omnibus. GSE92486 [DOI] [PMC free article] [PubMed]
    7. Wang Y, Liu L, Song Y. 2022. Unveiling E2F4, TEAD1 and AP-1 as regulatory transcription factors of the replicative senescence program by multi-omics analysis. NCBI Gene Expression Omnibus. GSE179880 [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    MDAR checklist
    Supplementary file 1. Supplementary figures and tables.
    elife-87811-supp1.docx (28KB, docx)

    Data Availability Statement

    The current manuscript is a computational study, so no data have been generated for this manuscript.

    The following previously published datasets were used:

    Purcell M, Tainsky M, Kruger A. 2015. Pathway Profiling of Replicative and Induced Senescence. NCBI Gene Expression Omnibus. GSE60340

    Fleisher JG, Schulte R, Tsai H, Tyagi S, Ibarra A, Shokhirev MN, Huang L, Hetzer MW, Navlakha S. 2018. Predicting age from the transcriptome of human dermal fibroblasts. NCBI Gene Expression Omnibus. GSE113957

    Rosa-Mercado NA, Zimmer JT, Apostolidi M, Rinehart J, Simon MD, Steitz JA. 2021. TT-TL-seq reveals transcriptional profiles that accompany DoG induction after hyperosmotic stress. NCBI Gene Expression Omnibus. GSE152059

    Bauer et al. 2018. Meromictic Lake Metagenome. NCBI BioProject. PRJNA257655

    Vilborg A, Shalgi R. 2017. Comparative analysis reveals genomic features of stress-induced transcriptional readthrough. NCBI Gene Expression Omnibus. GSE98906

    Hahn et al. 2017. Dietary restriction protects from age-associated DNA methylation and induces epigenetic reprogramming of lipid metabolism. NCBI Gene Expression Omnibus. GSE92486

    Wang Y, Liu L, Song Y. 2022. Unveiling E2F4, TEAD1 and AP-1 as regulatory transcription factors of the replicative senescence program by multi-omics analysis. NCBI Gene Expression Omnibus. GSE179880


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES