Skip to main content
PLOS Genetics logoLink to PLOS Genetics
. 2021 Apr 22;17(4):e1009438. doi: 10.1371/journal.pgen.1009438

Toxic Y chromosome: Increased repeat expression and age-associated heterochromatin loss in male Drosophila with a young Y chromosome

Alison H Nguyen 1, Doris Bachtrog 1,*
Editor: R Scott Hawley2
PMCID: PMC8061872  PMID: 33886541

Abstract

Sex-specific differences in lifespan are prevalent across the tree of life and influenced by heteromorphic sex chromosomes. In species with XY sex chromosomes, females often outlive males. Males and females can differ in their overall repeat content due to the repetitive Y chromosome, and repeats on the Y might lower survival of the heterogametic sex (toxic Y effect). Here, we take advantage of the well-assembled young Y chromosome of Drosophila miranda to study the sex-specific dynamics of chromatin structure and repeat expression during aging in male and female flies. Male D. miranda have about twice as much repetitive DNA compared to females, and live shorter than females. Heterochromatin is crucial for silencing of repetitive elements, yet old D. miranda flies lose H3K9me3 modifications in their pericentromere, with heterochromatin loss being more severe during aging in males than females. Satellite DNA becomes de-repressed more rapidly in old vs. young male flies relative to females. In contrast to what is observed in D. melanogaster, we find that transposable elements (TEs) are expressed at higher levels in male D. miranda throughout their life. We show that epigenetic silencing via heterochromatin formation is ineffective on the TE-rich neo-Y chromosome, presumably due to active transcription of a large number of neo-Y linked genes, resulting in up-regulation of Y-linked TEs already in young males. This is consistent with an interaction between the evolutionary age of the Y chromosome and the genomic effects of aging. Our data support growing evidence that “toxic Y chromosomes” can diminish male fitness and a reduction in heterochromatin can contribute to sex-specific aging.

Author summary

Y chromosomes can be toxic. The Y chromosome of many species contains a large number of transposable elements (TEs), which are transcriptionally constrained by repressive chromatin marks. When relieved of these epigenetic modifications, many TEs can readily move from one genomic location to another. We show that TEs located on the Y chromosome are less effectively silenced in male Drosophila, and the toxic Y effect appears more pronounced in a species that contains a larger Y chromosome with more repeats and more actively transcribed genes. Our data demonstrate that repeat-rich Y chromosomes are a genomic liability for males.

Introduction

Males and females differ in many life history traits, and sexual dimorphism in fitness-related traits is often driven by natural selection [1]. However, some traits could vary between the sexes in a non-adaptive manner. For example, males and females often differ in their lifespan [2], and the chromosomal sex determination system has been shown to influence sex-specific longevity. In particular, several studies have suggested that the sex with the heteromorphic sex chromosomes (males in XY species; females in ZW species) has a shorter lifespan on average [3, 4]. The aging process is associated with an overall loss of heterochromatin in many species [5, 6], and differences in the heterochromatin content between sexes could in principle contribute to sex-specific mortality (the toxic Y effect; [7]). In species with heteromorphic sex chromosomes, the amount of repetitive DNA and thus heterochromatin can vary dramatically between males and females, due to the presence of a large, repetitive Y (or W) chromosome in the heterogametic sex. Heterochromatin loss can result in de-repression and mobilization of silenced transposable elements (TEs) [811], and might disproportionally affect the sex with the higher heterochromatin content.

We previously showed that heterochromatin loss differs between the sexes in D. melanogaster [12]. TEs are up-regulated in old male flies, and Y-linked repeats are especially prone to become de-repressed during aging in D. melanogaster. Genetic manipulations of sex chromosome karyotypes suggested that the Y chromosome directly contributes to sex-specific aging in flies: females containing a Y chromosome (XXY females) lived shorter than wildtype XX females, males lacking a Y (X0 males) outlived wildtype males, and males containing two Y chromosomes (XYY males) showed a drastically reduced lifespan compared to XY males [12]. However, repeat-rich Y chromosomes are typically poorly assembled (only 14.6 Mb of the roughly 40-Mb large Y chromosome of D. melanogaster is present in the latest assembly; [13]), making it impossible to directly observe chromatin changes on the heterochromatic Y chromosome and link them to expression changes of repeats using genomic approaches.

Here, we test for an association between heteromorphic sex chromosomes and sex-specific heterochromatin formation and silencing of repetitive DNA in a species with a well-assembled Y chromosome [14]. We assay chromatin and gene expression profiles in young and aged males and females of D. miranda (Fig 1A), a model system for studying the molecular basis of sex chromosome differentiation [1419]. D. miranda has a recently evolved neo-sex chromosome that was formed only 1.5MY ago from an ordinary autosome that fused to the ancestral Y [20]. The neo-X and neo-Y are in the process of differentiating into heteromorphic sex chromosomes, and the neo-Y has dramatically increased in size mainly through an accumulation of TEs [14, 15, 17, 21]. Specifically, a high-quality genome assembly that includes large fractions of heterochromatin contained >90 Mb of neo-Y-linked sequence (compared to ~25 Mb of neo-X linked sequence), and 65 Mb are derived from repetitive elements (compared to 5 Mb on the neo-X) [14]. TEs are uniformly enriched along the neo-Y chromosome and a main contributor to its dramatically increased genome size [14, 15, 17, 21]. For example, the most abundant repeat on the neo-Y is the ISY element, a helitron transposon that is inserted about 22,000 times on the neo-Y/Y chromosome and amounts to more than 16 Mb of DNA (i.e., 16% of neo-Y sequence), while its former homolog, the neo-X chromosome, only harbors about 1,500 copies (3% of the neo-X; less than 1 Mb; [14]). The second most common repeat class that has amplified on the neo-Y are gypsy TEs; roughly 14,300 insertions were found on the neo-Y (15% of neo-Y sequence, 15 Mb) and less than 1 Mb on the neo-X (about 800 insertions, i.e., 3% of its sequence; [14]). Thus, a huge number of TEs has accumulated on the neo-Y chromosome since its origination 1.5 MY ago. Yet this chromosome still contains thousands of protein-coding genes that are embedded within these islands of TEs [14], and many of the neo-Y linked protein-coding genes are still actively transcribed [18, 22]. Thus, unlike the Y chromosome of D. melanogaster, the neo-Y of D. miranda contains thousands of transcribed, euchromatic genes, which are intermingled with repetitive heterochromatin.

Fig 1. Drosophila miranda karyotype and lifespan.

Fig 1

A. Karyotype of D. miranda males and females. Chromosome arms (Muller elements) are color-coded. B. Polytene chromosome squash of D. miranda male stained with HP1 (magenta) marking heterochromatin. C. Genome-wide repetitive content per chromosome. Shown is repeat content (% repeat masked bp) in 50kb windows. D. Kaplan-Meier survivorship curves [72] for MSH22 males (blue) and females (red), with the shaded region indicating the upper and lower 95% confidence interval calculated from the Kaplan-Meier curves. The number of flies counted for each sex (n) to obtain the survivorship curves is indicated.

We use chromatin and transcription profiling to show that heterochromatin formation is compromised on the repeat-rich neo-Y of D. miranda already in young males, resulting in global up-regulation of TEs in males. Age-associated heterochromatin loss is more pronounced in male flies compared to females, which also live shorter, and accompanied by de-repression of satellite repeats in old males. Our data provide empirical support that toxic Y chromosomes can diminish male fitness, and contribute to sex-specific aging in species with heteromorphic sex chromosomes.

Results

Drosophila miranda males have more repeats and live shorter

We chose the standard lab strain MSH22 of D. miranda, a fly that has served as a model for Y chromosome evolution (Fig 1A) [14, 18, 22] to investigate how the ‘toxic’ Y chromosome may influence sex-specific behaviors of repetitive sequences, such as TE expression, the dynamics of heterochromatin during the lifespan of a fly, and sex-specific aging. In D. miranda, an autosome fused to the ancestral sex chromosome about 1.5MY ago [20], creating a neo-Y chromosome which is at an intermediate stage of degeneration. The neo-Y still contains many of its ancestral genes, but has expanded in size by about 3-fold, mostly due to the accumulation of TEs [14, 17]. This increase in repeat content is accompanied by a change in its chromatin structure with many regions along the neo-Y becoming heterochromatic (Fig 1B) [23]. D. miranda has a high-quality genome assembly that comprise large stretches of repetitive heterochromatin, including major parts of the pericentromeres and the Y chromosome [14] (Fig 1C). In particular, D. miranda males have a enormous, ~92 Mb large recently formed and highly repetitive and heterochromatic neo-Y chromosome [14], while the pericentromeric heterochromatin on the X/neo-X only amounts to ~24 Mb (Table 1) [14, 24]. Thus, males contain significantly more repetitive DNA than females, and flow cytometry estimates agree with our assembled genome sizes (Table 1).

Table 1. Assembled length for each chromosome, size of chromosome arms and pericentromere, and repeat-masked bases (in Mb).

The total genome size inferred from flow cytometry is given in brackets.

Total Assembly (Mb) Euchromatic Arms (Mb) Pericentric Heterochromatin Repeatmasked (Mb)
X chr (A/D) 77.6 57.1 20.5 28.9
neo-X (C) 25.3 21.9 3.4 6.1
Y/neo-Y chr 92.1 n/a n/a 76.8
chr4 (B) 32.5 29.1 3.4 6.6
chr2 (E) 35.3 33.3 2 6.2
dot 2.4 0 2.4 1.3
total 173.1 141.4 31.7 49.1
male (2n) 335.4 (354) 203.8 131.6 140.0
female (2n) 346.3 (351.2) 282.9 63.4 98.2

In most Drosophila species male flies live significantly shorter than female flies [25, 26]. We determined longevity for males and females for the reference lab strain from D. miranda (MSH22) at 18°C (from 3 biological replicates, see Materials & Methods). Median survivorship for males is 78 days, and >98 days for females (Fig 1D). Thus, lifespan assays confirm that males live significantly shorter than females, consistent with multiple studies on sex-specific lifespan in Drosophila [25, 26]. Longevity patterns are thus consistent with the notion that the repeat-rich Y chromosome reduces survivorship in the heterogametic sex [3, 4], but note that other factors besides sex (such as male-male aggression) may contribute to differences in longevity between males and females [27].

Heterochromatin loss is more pronounced in males

We collected replicate ChIP-seq data (4 replica for each sex and age) for a histone modification typical of repressive heterochromatin (H3K9me3) from young (9–11 days) and old (80-90-day) D. miranda male and female brains (MSH22) (S1 and S2 Tables). In order to increase mappability of our ChIP-seq data (both to repetitive regions, and to distinguish between reads derived from the neo-X and neo-Y), we collected 100-bp paired-end read data. We spiked-in a fixed amount of chromatin from D. melanogaster to each D. miranda chromatin sample prior to ChIP and sequencing, to compare the genomic distribution of chromatin marks across samples using a ‘spike in’ normalization method [12, 28]. This species pair is sufficiently diverged from each other that there is very little ambiguity in the assignment of reads to the correct species, especially for fast-evolving repetitive DNA (S1 Appendix), and this strategy allows us to compare levels of H3K9me3 enrichment across sexes and time.

Fig 2 shows the genomic distribution of the repressive histone modification H3K9me3 for young and old male and female flies. As expected, heterochromatin is enriched at repetitive regions, including pericentromeres, the small dot chromosome and the repeat-rich Y chromosome (Fig 2A). Overall heterochromatin enrichment is similar for repetitive regions at the X and autosomes in young males and females (Fig 2B). Consistent with heterochromatin being lost during aging, both males and females show a reduction in H3K9me3 enrichment at their pericentromeres in old compared to young flies (Figs 2 and 3). Strikingly, heterochromatin loss is more pronounced in old male flies than in old females (Fig 2A and 2B). This finding is reproducible across individual replicates (S1S6 Figs and S3 Table), and robust with regards to normalization strategy (S1 Fig and S4 Table) and using only uniquely mapping reads (S3 Fig and S5 Table). D. miranda males show significantly more regions that lose H3K9me3 signal (1.5-fold or more in 5-kb windows) during aging compared to females (2936 vs. 316, p < 0.00001; Fisher’s exact test; S7 Fig). Almost all regions (78% of windows) that lose heterochromatin in males are located within the pericentromeres of the autosome/X (1963 windows) and the neo-Y (328 windows) (S7S9 Figs). In females, 25.6% of windows (81) losing heterochromatin occur within the pericentromeres. Many fewer regions gain H3K9me3 signal (1.5-fold or more) during aging (947 in males and 155 in females, p < 0.00001, Fisher’s exact test; S7 Fig). Almost all regions that gain H3K9me3 signal are found along the chromosome arms (S7 Fig), consistent with the idea that some heterochromatin becomes re-distributed in old flies [29]. Faster heterochromatin loss in old males is in agreement with recent findings in D. melanogaster [12]. Note that genes involved in heterochromatin formation and organization show little change in their expression during aging in both males and females, suggesting that heterochromatin loss in males is not due to a male-specific decline in proteins required for heterochromatin formation (S6 Table). These data suggest that increased deterioration of heterochromatin during aging may be a general feature of the sex with more repetitive DNA.

Fig 2. Heterochromatin loss and aging.

Fig 2

A. Genome-wide enrichment of H3K9me3 for young and old D. miranda males and females along the different chromosome arms. Values represent the mean of 4 biological replicates for Young/Old Females and Young Males and the mean of 3 biological replicates for Old Males in 25kb windows. Subtraction plots show the absolute difference in signal of 25kb windows between young and aged flies along the chromosome arms, with females in red and males in blue. One tick mark is equal to 1 Mb on the x-axis. B. Boxplot showing ChIP enrichment differences between biological samples at euchromatin and pericentromeres. Enrichment resolution is in 5kb windows. Significance values calculated (* p<0.05, ** p<1e-6, *** p<1e-12, Wilcoxon test) for males (blue, cyan) and females (red, pink), with pericentromere boundaries defined by [24].

Fig 3. Heterochromatin enrichment across chromosomes.

Fig 3

A. ChIP enrichment across samples grouped by chromosome and binned by amount of repeats (%) per 5kb--window. Values represent the mean of 4 replicates with standard error bars with the exception of the “Old Males” where we omit 1 failed ChIP replicate. B. Immunofluorescence staining for H3K9me3 in male mitotic chromosomes. Arrowhead denotes neo-Y, arrow denotes MullerA-AD (old-X), scale bar is 50μm.

Heterochromatin content and loss on the neo-Y chromosome

The neo-Y is highly repetitive (Fig 1 and Table 1) and enriched for heterochromatin (Fig 2), yet to a lesser extent than pericentromeric regions (Fig 2A and 2B). Note that while the neo-X and neo-Y still show considerable homology at protein-coding genes, there is little mis-mapping between reads derived from the neo-X or neo-Y chromosome (S2 Appendix), indicating that lower H3K9me3 enrichment on the neo-Y is not an artifact due to mis-mapping. The neo-Y harbors several thousand protein-coding genes embedded in highly repeat-rich regions [14, 22], and overall repeat content on the neo-Y is slightly less than of pericentromeres on the X or autosomes (72% of bp are repeat-masked on the neo-Y chromosome, compared to 82–89% of the pericentromeres on X and autosomes).

To evaluate if lower repeat content on the neo-Y can account for lower levels of heterochromatin, we compared H3K9me3 enrichment versus repeat content for 5-kb windows across chromosomes (Fig 3A). As expected, genomic regions with a higher repeat density show more heterochromatin, and absolute levels of H3K9me3 enrichment are very similar between the X and autosomes, and young males and females. However, for repeat-rich windows (>70% repeat masked), the neo-Y chromosome shows considerably lower levels of H3K9me3 enrichment than other chromosomes, even in young males (Fig 3A). In particular, average H3K9me3 enrichment in highly repetitive windows (>70% of bases repeat masked) is about 30% lower on the neo-Y compared to X-linked or autosomal regions in young males, and 24% lower in old males. The same conclusion holds if the top repeat-rich regions (>99.5% repeat masked) are considered (Fig 3A). Thus, this suggests that repetitive DNA on the neo-Y shows less heterochromatin-induced epigenetic silencing than other repeat-rich regions of the genome. Immunofluorescence staining of H3K9me3 in mitotic chromosomes confirms much weaker labeling of the neo-Y chromosome relative to the large pericentromeric region of the X chromosome in male larvae (Figs 3B and S10). While we observe consistent and robust H3K9me3 staining along the repetitive pericentromere on the X chromosome and neo-X/autosomes, only the pericentromeric region of the neo-Y shows intense H3K9me3 staining; the highly repetitive chromosome arms of the neo-Y show considerable weaker H3K9me3 signal (Fig 3B), consistent with our ChIP-seq results (Fig 2). Thus, Immunofluorescence staining independently confirms that the neo-Y chromosome contains lower levels of heterochromatin, compared to other repetitive regions in the genome.

Multiple factors could contribute to lower levels of H3K9me3 enrichment on the neo-Y. The neo-Y chromosome still contains many unique regions with transcribed protein-coding genes [14, 18, 22] and thus encompasses a patchwork of heterochromatic repeats interspersed with unique and genetically active euchromatic regions. In Drosophila, the boundary between heterochromatin and euchromatin is not fixed but determined by the local balance between factors that promote either heterochromatin or euchromatin [30]. Active transcription could limit spreading of heterochromatin on the neo-Y and reduce overall H3K9me3 enrichment [31]. Indeed, the gene density on the neo-Y is substantially higher than in the pericentromeres. The D. miranda annotation contains 255 genes located in the pericentromeres (that is, roughly 9 genes/Mb) and 5625 genes on the Y/neo-Y (61 genes/Mb) [14]. Most genes are transcribed in adult brain samples (246 of all the pericentromeric genes and 4428 of the neo-Y genes; see below), consistent with the idea that active transcription of neo-Y genes could impede heterochromatin formation on this chromosome. Indeed, we find that H3K9me3 enrichment is lower around transcription start sites (TSS) for genes located on the neo-Y, similar to genes located in pericentromeric regions (Fig 4A), supporting the idea that active transcription counteracts local heterochromatin formation. Importantly, if we group neo-Y linked genes into those that are highly expressed from the neo-Y (top 20%) versus those lowly expressed (bottom 40%), we find that H3K9me3 enrichment is substantially lower neighboring genes that are highly expressed (Fig 4B). Lower H3K9me3 levels near highly transcribed genes supports the notion that active transcription interferes with heterochromatin formation on the neo-Y.

Fig 4. Heterochromatin enrichment at transcription start sites (TSS) on the neo-Y and the pericentromeres.

Fig 4

A. The average young male H3K9me3 ChIP enrichment profile +/- 5kb of TSS in 100bp resolution grouped by gene location (neo-Y or pericentromere). B. The average young male enrichment profile at TSS neo-Y genes grouped and plotted by expression level. The resolution is also 100bp.

In addition, heterochromatin on the neo-Y is evolutionarily young, and its overall repeat structure may differ from pericentromeres. The neo-Y was an ordinary autosome until 1.5 MY ago and only acquired its high repeat content and heterochromatin structure after it became fused to the ancestral Y [15, 21]. The repeats present on the neo-Y should therefore encompass mostly recently active TEs, and it is possible that these repeats may initiate less heterochromatin [32]. Indeed, the relative abundance of different TE families differs between the neo-Y chromosome and the rest of the genome (S11 Fig; [14]); we find the neo-Y contains more LTR transposons and helitrons but less simple repeats in comparison to other chromosomes (S11 Fig).

While overall TE composition differs between the neo-Y and other chromosomes, transposons belonging to the same family may also show lower levels of H3K9me3 enrichment if present on the neo-Y. As expected, H3K9me3 profiles at TE families show that there is a general enrichment of this repressive mark in both male and female flies (Fig 5A). To test if neo-Y linked TEs show less H3K9me3 enrichment, we calculated average H3K9me3 enrichment for each TE family based on genome-wide mapping of H3K9me3 separately for copies located on the neo-Y versus other genomic locations. Indeed, copies of TEs inserted onto the neo-Y typically show lower levels of H3K9me3 enrichment than copies on other chromosomes. Global H3K9me3 enrichment is about 30% lower for neo-Y TEs compared to their X- and autosomal homologs in young flies (and 27% lower in old flies), and 92/16 show significantly less/more H3K9me3 if located on the neo-Y relative to copies on the X/autosome in young males (and 25/6 in old ones; Fig 5B). Thus, this suggests that most TEs experience less epigenetic silencing on the neo-Y relative to other regions of the genome.

Fig 5. Heterochromatin enrichment at TEs across sex, time and chromosomes.

Fig 5

Shown is enrichment of H3K9me3 at different repeat families for young and old females and males. A. Log2 H3K9me3 enrichment of all transposable elements (from [55]) in young (5–9 day) and old (80–98 day) male and female MSH22, averaged across 4 replicates for Young/Old Females and Young Males and across 3 replicates for Old Males. H3K9me3 enrichment in males is separated by autosomes/X and neo-Y. Boxplots below with significance values calculated using the Wilcoxon test (* p< 0.05, ** p<0.01, *** p<1e-5). B. Log2(H3K9me3 enrichment) differences between TEs found on the Autosomes/X and the neo-Y for young (top) and old (bottom) males. Data represents averages of 4 replicates for Young/Old Females and Young Males and of 3 replicates for Old Males. The bars denote standard error. Data highlighted in yellow are TEs that show significant differences in H3K9me3 enrichment on autosomes/X and neo-Y (p < 0.05, two-tailed Student’s t-test).

Like other repeat-rich regions, overall levels of heterochromatin on the neo-Y decrease during aging; however, the absolute amount of H3K9me3 loss appears less on the Y in old males, compared to pericentric regions, or the dot chromosome (Figs 2 and S12). Overall, our results show that male D. miranda lose heterochromatin marks more rapidly than female, consistent with our previous findings in D. melanogaster [12]. However, we also find that the neo-Y shows considerably lower levels of heterochromatin than what we might expect given its repeat content. Indeed, levels of H3K9me3 enrichment on the neo-Y in young males resemble reduced levels of heterochromatin at X-linked and autosomal pericentromeres in old males.

Repeat de-repression in young male flies

Heterochromatin silences repetitive DNA [31], and reduced levels of H3K9me3 along the large repeat-rich neo-Y chromosome could have profound consequences on TE silencing in male D. miranda even in young flies. To study TE expression changes between sexes and during aging, we gathered replicated stranded RNA-seq data from young and old D. miranda (3 biological replicates, 100-bp paired-end reads to increase mappability; S7S9 Tables). Indeed, comparison of TE expression between young and old male and female D. miranda reveals that repeat expression is strikingly increased in male flies compared to females, irrespective of age (Figs 6A and 6B and S13). In particular, 135 TE families are significantly upregulated, while 1 is expressed at lower levels in male D. miranda compared to females already in 9-day old flies, and the fraction of transcripts derived from TEs is more than twice as high in males compared to females (the fraction of repetitive reads in all RNA-seq reads is on average 8.0% at young males and 3.7% at young females, S8S10 Tables).

Fig 6. TEs are upregulated in male D. miranda.

Fig 6

Shown is differential TE expression within and between sexes. A. Overall log2 expression by sex and age with rows sorted by log2 female/male TE coverage (right histogram). Boxplots below with significance values calculated using the Wilcoxon test (* p< 0.05, ** p<0.01, *** p<1e-5). Data represents averaged expression across samples. B. Log2 expression differences of TEs between young (left) and old (right) males versus females. Data indicates mean value of 3 biological replicates and bars denote standard error. Color-code indicates significant comparisons (50% significant higher or lower expression; Wald test, p <0.05) and Y-enriched repeats (based on genomic coverage): cyan: Y-enriched, p <0.05; yellow: unbiased, p <0.05; blue: Y-enriched, n.s.; black: unbiased; n.s. Grey line indicates 1:1 expression levels. C. Same as B but comparison between young vs. old males (left) and females (right). D. Log2 TE expression by sex and age grouped by TEs enriched on the neo-Y (blue) and TEs not enriched on the neo-Y (yellow). Significance values calculated using the Wilcoxon test (* p< 0.05, ** p<0.01, *** p<1e-5).

It is difficult to identify the genomic copy from which a TE transcript originated, especially for recently active families with highly similar copies. We used several approaches to test if increased TE expression in males is primarily driven by repeats located on the neo-Y. First, we mapped RNA seq reads to the D. miranda genome, and estimated expression of neo-Y linked copies versus copies on other chromosomes for each TE family [33]. We find that 53% of all TE transcripts in young males are derived from the autosomes and X and 47% are from the neo-Y, suggesting that most of the excess TE transcription in males could be attributed to expression from the neo-Y chromosome (S14 and S15 Figs).

We also quantify expression of TEs enriched on the neo-Y independent of our assembly (Fig 6A and 6B). TE families with more copies on the neo-Y chromosome will show higher coverage in male DNA sequencing libraries compared to female libraries [12]. We ranked TE families based on their male/female genomic coverage and classified 81 TEs as Y-enriched (based on log2(female/male gDNA) coverage; Wei et al. 2020). Indeed, TE families that are enriched on the neo-Y show higher expression in both males and females than the remaining TE families that are not enriched on the neo-Y (Fig 6D). In addition, most of the neo-Y enriched TE families are significantly more expressed in males, with 85% of the neo-Y enriched TEs (69 families) being more highly expressed in young males compared to females. Thus, this confirms the expectation that it is more recently active TEs that invaded the neo-Y chromosome since it stopped recombining with the neo-X 1.5MY ago. Increased copy number of these TEs and lower levels of heterochromatin on the neo-Y might drive their higher expression in males. Note that increased TE expression in male holds even if we account for higher copy number of repeats in males by normalizing by genomic DNA read counts or by estimated copy numbers in the genome assembly (S16 Fig). Thus, higher repeat expression in males is not simply a product of higher TE copy number in males.

No change in TE expression in old flies

A loss of heterochromatin in old individuals can lead to a re-activation of previously silenced TEs [811]. In D. melanogaster, we found that TEs are efficiently repressed in young male and female brain tissue [12]. While old females maintained heterochromatin and silencing of repeats, males aged for a similar amount of time showed a dramatic reduction in heterochromatin at pericentric regions and on their Y chromosome. Heterochromatin loss in males was accompanied by de-repression of TEs, and repeats from the Y chromosome were found to be disproportionally up-regulated in old D. melanogaster males [12].

Female D. miranda efficiently suppress TEs throughout their life, similar to D. melanogaster. In females, 4 TEs showed increased and 36 decreased expression during aging (Fig 6A, using abs(log2(fold-change)) = 0.58), and the total fraction of transcripts derived from repeats decreases during aging (the fraction of repetitive reads in all RNA-seq reads is on average 3.7% at 9 days, vs. 2.9% at 80 days, S3 Table). In contrast to repeat de-repression found in old D. melanogaster males, however, expression of TEs does not change during aging in male D. miranda (Figs 6C and S13). In particular, 6/8 TEs show a in/decrease in expression during aging in D. miranda males (Figs 6C and S13; same fold-change as females), and the total fraction of transcripts derived from repeats slightly decreases during aging (from 8.0% at 9 days to 7.6% at 80 days, S3 Table). Thus, while old D. miranda males show a drastic loss in heterochromatin at pericentromeres on the X and autosomes, this does not result in global expression changes at TEs. However, unlike in D. melanogaster, overall levels of TE expression are substantially increased in male D. miranda relative to females, already in young flies (Figs 6B and S13).

Satellite mis-expression in old male flies

Another type of repetitive DNA that is targeted by heterochromatin is satellite DNA repeats [31]. Satellites are often transcribed, and satellite transcripts may have important biological roles, including heterochromatin formation or centromere assembly [3437]. Unlike D. melanogaster, which has large arrays of satellite DNA composed of simple repeats (sometimes several megabases in size), the genomes of D. miranda and its close relatives contain many fewer, often shorter satellites [24, 38]. Of the 780 different satellite repeat units that were identified in the D. miranda genome assembly [14], only 9 are enriched on the neo-Y (based on male/female genomic coverage), compared to 81 neo-Y/Y enriched TEs (out of 303 TE families total; see above). Thus, unlike TE’s, satellites are not preferentially found on the neo-Y. Satellite DNA is highly enriched for H3K9me3 in young flies (Fig 7A), in both males and females. Mimicking overall patterns of heterochromatin enrichment, we see a clear drop in H3K9me3 levels in old male flies at satellites, but not females (Fig 7A).

Fig 7. Heterochromatin and expression at satellite DNA.

Fig 7

A. Average log2 H3K9me3 enrichment at different satellite DNA for young and old females and males. Boxplots with significance values calculated using the Wilcoxon test (* p< 0.05, ** p<0.01, *** p<1e-5). B. Log2 expression differences between young (left) and old (right) males and females. Data represents mean value of 3 replicates with bars denoting standard error, data in yellow denote significant differentially expressed satellites (50% higher or lower expression, Wald test, p < 0.05). Grey line denotes 1:1 expression levels between the samples. C. Log2 expression differences between young vs. old males (left) and females (right). Plots made in similar fashion as in B.

Many satellite repeats are transcribed, and at similar levels in both sexes; 21/16 satellites show higher/lower expression in young males compared to young females (Figs 7B and S17). Thus, unlike TEs that are highly enriched on the neo-Y and inefficiently silenced in males (Fig 6), no such differences in satellite expression are seen in young males and females. Old males show a dramatic loss in repressive heterochromatin at satellites, and a simultaneous up-regulation at satellite DNA expression. In males, 21 satellites show increased expression during aging compared to 1 satellite showing decreased expression, while expression at 2/1 satellites in/decreased in old females (Figs 7C and S17). Thus, a loss of silencing heterochromatin at satellites in old male D. miranda is accompanied by a de-repression of satellite transcripts.

Mis-expression of neo-Y genes and genes embedded in heterochromatin in old flies

Changes in heterochromatin during aging could also affect protein-coding genes; while most genes are located in repeat-poor euchromatin, some are embedded in heterochromatin. In addition to protein-coding genes located on the neo-Y (5586 in our combined ChIP and RNA analysis), D. miranda also has a total of 376 genes in pericentromeres and the dot chromosome. Genes that are naturally located in heterochromatin, such as pericentric genes in Drosophila or genes on the dot chromosome, have evolved mechanisms to assure gene transcription in this normally suppressive chromatin environment, and often are dependent on it [3941]. Indeed, genes located on the pericentromeres appear to be expressed at slightly lower levels in old flies, compared to young ones, in both sexes (6/18 significantly up/downregulated in aged males and 3/29 in aged females; S18 Fig). Neo-Y genes, on the other hand, only became embedded in their heterochromatic environment within the past 1.5MY. Indeed, as flies age and lose H3K9me3, neo-Y genes become slightly upregulated (249/141 are significantly up/downregulated; S18 Fig). Thus, these results are consistent with a dynamic view of co-evolution of heterochromatin and its host genome; genes embedded in old heterochromatin, such as the pericentromere, had millions of years of evolution to properly function in this typically repressive environment, and depend on it. A loss of heterochromatin during aging thus results in downregulation of pericentric genes. In contrast, genes that only became invaded by repeats and thus heterochromatin more recently, such as neo-Y genes in D. miranda, have not yet had enough time to adopt to this novel environment; losing heterochromatin, therefore, results in slight upregulation of neo-Y genes.

Discussion

Repetitive DNA is composed of TEs and satellite DNA, and their misregulation can induce genomic instability [42]. Transposable elements (TEs) are mobile DNA sequences that can change their position and comprise a substantial fraction of eukaryotic genomes [43]. Satellite DNA is often found near centromeres, and incorrect packaging or expression of some satellite DNA can have profound negative fitness consequences [3437]. Repetitive elements and their host genome are in a constant arms race, which need to co-evolve to evade each other’s response [44, 45]. Mobilization of TEs can cause insertional mutations [43] and natural selection acts to remove deleterious insertions [46]. Several host-encoded mechanisms have emerged during eukaryotic evolution to suppress TE activity [43, 47]. One defense mechanisms widely employed across eukaryotes is transcriptional silencing of repetitive DNA through heterochromatin formation [31]. A host-encoded defense mechanism that prevents movement of a TE family, however, creates selective pressure on these elements to evolve escape mechanisms to avoid the host defense, forcing the host to evolve further mechanisms to silence TEs. Thus, repetitive elements and their host genome are in a constant arms race, where they need to co-evolve to evade each other’s response [44, 45].

The effectiveness of host defense mechanisms to eliminate or silence repetitive elements can vary in space and time. Epigenetic defense mechanisms can vary both on evolutionary time scales, and throughout an individual’s life. Significant chromatin structural changes occur during aging, with a loss of heterochromatin commonly observed in old individuals [5, 29]. Disruption of heterochromatin during an individual’s life can result in mobilization of previously silenced TEs [9, 48]. The effectiveness of epigenetic silencing mechanisms can also vary across evolutionary time scales. In particular, a novel TE that is invading a naïve host genome can escape host surveillance mechanisms and propagate across the genome, until silencing mechanisms evolve to target that TE [32, 49].

Natural selection to remove TE insertions can also be compromised. In the absence of recombination, natural selection is less efficient, and can result in fixation of deleterious mutations, including the accumulation of TEs [50, 51]. Low or non-recombining regions of eukaryotic genomes such as pericentromeres are often an agglomerate of TEs [24, 46, 52], and Y chromosomes of many species, including Drosophila, are comprised almost entirely of repetitive DNA [52, 53]. In fact, the neo-Y of D. miranda has accumulated tens of thousands of TE insertions since it stopped recombining 1.5MY ago [14, 21], demonstrating just how quickly TEs can invade a non-recombining chromosome. Non-recombining Y chromosomes of many species can thus act as reservoirs for TEs [53, 54], and dramatically increase the repeat content in males compared to females [12].

The accumulation of deleterious mutations and repetitive elements on the Y chromosome might lower the survival of the heterogametic sex (“toxic Y” hypothesis) [3, 4]. While TEs may be efficiently silenced in young individuals, deteriorating heterochromatin can affect males disproportionally, and result in shorter lifespans in males [12]. Under the toxic Y model, we expect males to show higher levels of TE expression than females, at least during some stages in their life (for example, in old males when heterochromatin is known to be compromised). Secondly, we would also expect that larger Y chromosomes that contain more repetitive elements are more ‘toxic’ to males than Y chromosomes with fewer or less active repeats.

Our data support both predictions of the toxic Y effect. We show that repeats from the Y are mis-expressed in male D. miranda. We find that the overall number of TE-derived transcripts is about twice as much in males than it is in females, and that TEs located on the Y are primarily responsible for increased TE transcript abundance in males. Secondly, comparison of our data to a recent analysis in D. melanogaster suggests that the larger neo-Y of D. miranda is a much bigger burden for males than the ancient Y of D. melanogaster [12]. Young D. melanogaster males are effectively suppressing their Y-linked repeats and epigenetic defense mechanisms are only breaking down in old males [12]. In contrast, TE suppression is compromised even in young D. miranda males. The Y chromosome of D. melanogaster is substantially smaller than the neo-Y of D. miranda and fully heterochromatic [53, 54]. About ¼ of the 40 Mb large Y chromosome of D. melanogaster is composed of TEs (that is, about 10 Mb), while the rest of the Y mainly consists of satellite DNA [13]. In contrast, almost 75 Mb of Y/neo-Y derived sequence in D. miranda is classified as TE-derived.

In addition to having substantially more repeats, the neo-Y shows lower levels of heterochromatin enrichment compared to repetitive regions on the X or autosomes. The neo-Y of D. miranda contains 1000s of expressed genes, and we find that active transcription of these genes appears to impede the formation of silencing heterochromatin at adjacent TEs on the neo-Y. Thus, not only do male D. miranda have more repeats than females and their Y-chromosome has more TEs than other species (such as D. melanogaster), the TEs present on the neo-Y are particularly poorly silenced, due to the large number of active genes present. Therefore, while Y chromosomes can generally be toxic, the recently formed neo-Y of D. miranda is especially harmful [55]. This increased toxicity of the D. miranda neo-Y can explain why Y-linked TEs are ineffectively silenced throughout the life of a fly, and not only in old males. Note, however, that our comparative study does not allow us to directly demonstrate that the neo-Y of D. miranda reduces male lifespan. Careful genetic manipulations of the sex chromosomes, as done in D. melanogaster [12], are necessary to link the absence or presence of the neo-Y to changes in lifespan.

Overall, we detect higher levels of expression of TEs in male flies relative to female flies, both for young and old flies. Our analysis suggests that low levels of heterochromatin on the neo-Y may be inefficient to fully suppress repetitive elements on this chromosome even in young males. Genome-wide deterioration of heterochromatin in old males may further decrease TE silencing, but may not substantially contribute to overall TE transcript abundance in old males that already have a large, inefficiently silenced highly repeat-rich neo-Y. Indeed, this resembles our findings in D. melanogaster flies with additional Y chromosomes [12]. While wildtype D. melanogaster males efficiently suppress their Y-linked repeats when young, flies with additional Y chromosomes (XXY females or XYY males) show inefficient silencing of Y-linked repeats already in young flies [12]. These patterns are consistent with a dynamic view of co-evolution between selfish TEs and host defense mechanisms. Flies typically can suppress wildtype levels of TE activity when young, but these defense mechanisms may fail as flies age (especially in males which have more repetitive DNA) or when they are being challenged with unusually high numbers of active TEs (extra Y chromosomes in XXY and XYY D. melanogaster, or the recently formed, highly repetitive neo-Y of D. miranda).

The molecular mechanisms underlying increased TE expression during aging, and the contribution of TE activity to aging are only beginning to be understood. A direct link between neurodegeneration and heterochromatin was found in Drosophila: The microtubule-associated protein tau is involved in a number of neurodegenerative disorders, including Alzheimer’s disease. Wide-spread heterochromatin loss was found in tau transgenic Drosophila and mice, and oxidative stress and subsequent DNA damage were identified as mechanistic links between transgenic tau expression and heterochromatin deterioration [56]. Several TEs become highly active in Drosophila brain during normal aging, and exacerbated TE expression in brain results in age-dependent memory impairment and shortened lifespan [11]. Movements of TEs result in DNA damage, and may provide the mechanistic link between TE activity and physiological aging. In mouse, L1 retrotransposons become active during the course of aging; SIRT6—a powerful repressor of L1 activity—binds to the 5’UTR of L1 elements, and facilitates heterochromatin formation [57]. During aging, and also in response to DNA damage, SIRT6 becomes depleted from L1, allowing the activation of these previously silenced TEs [57]. De-repression of L1 elements results in activation of the type-I interferon response and age-associated inflammation, a phenotype of late senescence [58]. A direct link between systemic inflammation and heterochromatin was also observed in Drosophila: genes involved in immune response are enriched in lamin-associated heterochromatin in both flies and mammals, and fat body immunosenecence is caused by age-associated lamin-B reduction, contributing to loss of heterochromatin and de-repression of genes involved in immune responses [59]. Future research will further establish the mechanistic connections between repetitive DNA, TE activity, heterochromatin loss, and sex-specific aging.

Materials & methods

Drosophila strains

We used the reference genome strain MSH22 for analysis [14]. For chromatin and gene expression analyses, flies were grown in incubators at 18°C, 60% relative humidity, and 12h light for the indicated number of days following eclosion, and were then flash-frozen in dry ice and stored at -80°C. Flies were reared on standard molasses fly food: 0.68% agar (w/v), 2.7% yeast (w/v), 6.67% cornmeal (w/v), 0.456% propionic acid (v/v), 1.6% sucrose (w/v), 0.76% of 95% ethanol (v/v), 0.09% Tegosept (w/v), 8.2% molasses (v/v), 0.0625% CaCl2 (w/v), 0.75% Na Tartrate (w/v).

Lifespan assays and tissue preparation

Lifespan assays were conducted following {Linford, 2013 #2008} using the same rearing conditions as described above. Briefly, we collected synchronized embryos on a molasses plate with yeast paste for 16–20 hours. We washed embryos with 1x PBS pH 7.4 three times and dispensed 10μl of embryos per culture vial by pipette. To obtain synchronized adult flies, we collected emerging adults over a 3 day window and aged the adults for at least 7 days to mature and copulate. Afterwards, we separated males and females into separate vials, placing 20 individuals per vial, maintaining them at 18°C with a light:dark setting of 12h:12h. We moved flies to new vials every 3–5 days, without using CO2, and fly deaths were recorded. Flies that were observed escaping the vial were censored. To collect samples for the RNA-seq and ChIP-seq experiment, we censored the entire lifespan experiment once it reached less than 50% survivorship for one sex and flash-froze the remaining flies in liquid nitrogen. In total, 1765 female flies in 107 vials and 1304 male flies in 75 vials were counted for the lifespan assays reported here. We dissected brains from males and females aged 5–9 days (young flies) and 80–98 days old in 1X PBS pH 7.4 (1 brain/sample). We then snap-froze and stored the tissue at -80°C prior to use in the ChIP or RNA-seq assay. For the spike-in, we dissected brains from D. melanogaster yw strain females and pooled 2 brains/sample.

Chromatin immunoprecipitation and sequencing

We modified the native and ultra-low input ChIP protocol [60] for single-brain ChIP-Seq. First, we added 40μL EZ nuclei lysis buffer to frozen tissues and homogenized them with a pestle grinder on ice. We spun cells down at 1000xg for approximately 10 minutes and decanted 20μL of the supernatant. We froze the cell pellet in the lysis buffer at -80°C. We then followed the 100,000 cell count digestion protocol from [60] with a few modifications. For each aged D. miranda sample and D. melanogaster spike-in sample, we added 4.4μL of 1% DOC and 1% Triton-X100 to resuspended cells and mixed thoroughly. Briefly, we digested each sample at 37C for 04:58 (mm:ss) with MNase and quenched the reaction with 4.9μL of 100mM EDTA pH 8.0. We then added the fragmented spike-in samples to each D. miranda sample to account for approximately 20% of the final pooled sample volume (i.e. final sample consisted of 80% D. miranda, 20% D. melanogaster). We reserved 10% of the pooled fragmented sample for the input (fragment control) and used the remaining 90% to perform the chromatin pull-down (ChIP sample, target antibody: H3K9me3 polyclonal classic Diagenode C15410056) according to the 100,000 cell specifications. We used the Rubicon Genomics ThruPlex kit to prepare ChIP-Seq libraries for sequencing with 10 PCR amplification cycles for the input samples and 12 cycles for the ChIP pull-down samples. We sequenced libraries on the Illumina HiSeq 4000 Platform at the Vincent J. Coates Genomics Sequencing Center (Berkeley, CA). In total, we made 4 biological replicates: 2 replicates from the first aging trial, 1 replicate from the second, and 1 replicate from the third.

RNA extraction and RNA-seq

We extracted total RNA from 20 brains per sex per age (3 biological replicas, one from each aging trial) following a standard TRIzol protocol. Briefly, we homogenized tissues first in 100μl TRIzol and then added 900μl TRIzol to proceed with the full-volume protocol. We generated libraries from total RNA using Illumina’s TruSeq Stranded Total RNA Library Prep kit with Ribo-Zero ribosomal RNA reduction chemistry, which depletes the highly abundant ribosomal RNA transcripts (Illumina RS-122-2201). Libraries were run on a Bioanalyzer for fragment traces and sequenced on the Illumina HiSeq 4000 at the Vincent J Coates Genomics Sequencing Center.

Mapping of sequencing reads and data normalization

We mapped all ChIP-seq datasets to the D. miranda genome assembly [14] to the D. melanogaster Release 6 of the genome assembly and annotation [61]. For all ChIP-seq datasets, we used Bowtie2 [62] to map reads to both genomes separately, using the parameters “-D 15 –R 2 –N 0 –L 22 –i S,1,0.50—no-1-mm-upfront”, which allowed us to reduce cross-mapping to the D. melanogaster genome to approximately 1% of 100bp-paired end reads [63]. All ChIP and input samples were processed with bedtools coverage–counts [64]. To calculate the ChIP signal without the D. melanogaster spike-in normalization, we normalized all counts files by library size, and calculated the coverage across 5kb windows for both the ChIP and the input D. miranda alignments. These values were then normalized by the median autosome coverage of euchromatic arms (Muller B and Muller E) to obtain ChIP signal (ChIP/Input) for each sample. To calculate signals normalized by the D. melanogaster spike-in, we used three different approaches (see S1 Fig). First, following the approach from [63], we took the ChIP enrichment from the no-spike normalization method and multiplied these values by a scale factor. The scale factor c is calculated as follows with normal script indicating the number of reads mapped for the given sequencing sample and subscripts indicating the specific species to which reads were mapped:

c = 1 + ChIPD.miranda/(InputD. melanogaster/InputD. miranda)/(InputD. melanogaster+ChIPD. melanogaster). In this respect, we calculated normalization factor c based on the number of mapped reads and this is irrespective of mapping location. We also used a regression-based method from [65]. Briefly, we calculated the median ChIP signal for each sample’s D. melanogaster spike-ins. Then, the ChIP signal in each D. miranda sample was adjusted by a logistic regression based on each sample’s genome-wide D. melanogaster spike-in signal. In a separate spike-in normalization approach, we used a quantile-quantile method [55]. A reference ChIP signal for all D. melanogaster spike-ins was made by aggregating all D. melanogaster ChIP and input coverages, respectively. Then, all individual D. melanogaster spike-ins were adjusted to this reference and the corresponding D. miranda ChIP signals were adjusted accordingly. We emphasize that all spike-in normalization methods do not use specific regions of the genome and are based only on the mapping and/or ChIP enrichment of the D. melanogaster spike-in samples. All three spike-in normalization methods as well as the no-spike normalization procedure yielded quantitatively similar results: young males and females show similar enrichment of H3K9me3 along their pericentromeres; both old males and females show less H3K9me3 enrichment than young flies, and the loss of heterochromatin is more pronounced in males than females, and finally, the neo-Y chromosome generally shows less heterochromatin than pericentromeres (S1 Fig). We further confirmed the consistency of these normalizations by showing that 5kb windows in euchromatin with elevated H3K9me3 enrichment are found consistently in all normalization methods (S10 Table). We used the no-spike normalization procedure for our analysis, and one sample with low ChIP efficacy (old male 1; see S1 Fig) was excluded from the analysis.

The boundaries of euchromatic chromosome arms versus pericentromeric heterochromatin was defined using repeat-enrichment, using a cut-off of 40% repeat-masked DNA across 100 kb sliding windows away from the centromere (from [24]. For the neo-Y chromosome, we identified the pericentromeric region based on increased H3K9me3 signal surrounding the centromeric satellite sequences (see [14]), which roughly corresponds to 9.8Mb of sequences at the end of neo-Y scaffold 1 (S19 Fig).

Gene expression analysis

For each replicate of RNA-seq data, we first mapped RNA-seq reads to a repository of ribosomal DNA scaffolds from NCBI and removed all reads that mapped to this scaffold. Although total RNA library preparation aims to remove the bulk of rRNA transcript, any additional differences in rRNA abundance in sequenced samples are likely to be technical artifacts. We then mapped the remaining reads to the D. miranda genome and to the repeat libraries separately using HISAT2 [66] with default parameters. We then used Subread FeatureCounts [67] to calculate gene counts and repeat counts and used DESeq2 [68] to normalize the libraries and perform differential expression analysis between male/female flies of different ages. In an analogous approach, we used TECount from TEtranscripts [33] to estimate gene and TE expression counts across samples. We used TECounts with default parameters on RNA-seq reads mapped to the D. miranda genome. We also used DESeq2 [68] to perform the differential expression analysis with these counts tables. Both counting methods for repeats yielded similar results for each sample.

For a non-reference based approach, we used dnaPipeTE [69] on RNA-seq reads using the following parameters: -coverage 0.5 -genome-size 200000000 -sample-number 2. We then calculated the proportion of TEs and repeats identified in each sample.

Repeat libraries

We used two repeat libraries for the ChIP-seq and RNA-seq analyses. Our first library was the consensus sequences of known repetitive elements identified in the D. pseudoobscura group [70] and the second was based on the consensus sequences of known satellites identified in D. miranda [14]. The following methods were employed for both repeat libraries.

ChIP enrichment/RNA expression of specific repeats (and by chromosome)

To assess H3K9me2 signal in repetitive elements, we took a similar approach as we did for calculating ChIP enrichment profiles across the genome. First, we mapped both ChIP and input sequencing reads to the D. pseudoobscura family TE library/satellite DNA library using Bowtie2 [62] and the parameters “-D 15 –R 2 –N 0 –L 22 –i S,1,0.50—no-1-mm-upfront”. We then calculated the mean coverage across each repetitive element using Bedtools [64] coverage, and normalized the coverage by the median autosome coverage. We then calculated the ChIP signal at each repeat by dividing the ChIP coverage by the corresponding input coverage. This method accounts for differences in copy number of the repetitive elements by dividing the ChIP coverage by each repeat’s coverage in the input. In a complementary approach, we took ChIP mappings to the entire genome and called enrichment at repetitive elements through TECounts (part of TETranscripts) [33]. To call for repeat enrichment on a per chromosome basis, we split mappings by chromosome and ran TECounts separately for each split mapping. We then normalized counts at repeats in both the ChIP and input by mean autosomal coverage and calculated ChIP enrichment as ChIP/Input.

ChIP profiling at transcription start sites (TSS)

To calculate H3K9me3 enrichment around genes on the neo-Y or pericentromeric regions, we followed the BedTools pipeline for estimating transcription factor occupancy at transcription start sites [64]. Briefly, we calculated genome-wide ChIP enrichment at 100bp resolution as described previously. We then generated intervals flanking transcription start sites of genes on the neo-Y and pericentromeric regions using BedTools slop. We then used Bedtools map to map our ChIP enrichments to these flanking regions and Bedtools groupby to aggregate enrichment profiles across all TSS.

Immunofluorescence staining in mitotic chromosome spreads

We followed an immunofluorescence protocol from [71]. We dissected brains from male 3rd instar larvae in 0.7% NaCl. We incubated brains first in 0.5% sodium citrate for 8 minutes and then in 2% paraformaldehyde in 45% acetic acid fixative solution for 8 minutes. We transferred 4 brains to small drops of fixative solution on a 18mmx18mm siliconized coverslip and placed a clean glass slide on top. The slides were pressed with a thumb for 1.5 minutes and then frozen in liquid nitrogen until bubbling stopped. We used a razor blade to pop off the coverslip and immediately immersed slides in PBS (pH 7.4). The slides were then treated with 100uL of 20units/mL DNAse1 (Thermofisher #89836) in 50mM Tris-HCl pH 8.0, 5mM MgCl2, and 0.05mg/mL BSA for 4.5 minutes. Treatment with DNAse1 increases antibody binding to the H3K9me3 histone mark as reported in [71]. The slides were then washed 3x in PBS for 5 minutes/wash and incubated in 1% Triton X-100 in PBS for 10 minutes. Slides were blocked in 5% nonfat dry milk in PBS for 30 minutes, washed in PBS for 5 minutes, and then incubated with primary antibody (1:300 dilution, Abcam ab8898) in 3% BSA in PBS for 1 hour first at room temperature and later overnight at 4°C in a box lined with wet paper towels. We washed slides 3x in PBS for 5 minutes/wash and then incubated them with secondary antibody Goat anti-rabbit conjugated with Alexa Fluor-568 in 3% BSA in PBS (1:1000 dilution, Thermofisher #A-11011) for 1 hour at room temperature in a box lined with wet paper towels and covered with aluminum foil. Slides were washed 3x in PBS for 5 minutes/wash and dried for 10 minutes in the dark. We mounted slides using Vectashield antifade mounting medium with DAPI (Vector Laboratories H-1200-10) on a coverslip and then sealed the edges with nail polish. Slides were imaged using a Nikon Spinning Disc Confocal microscope and the Nikon Elements AR software at the UC Berkeley Cancer Research Laboratory Molecular Imaging Center. We used Fiji/ImageJ to adjust image brightness and contrast.

Supporting information

S1 Fig. ChIP enrichment across samples using four different normalization strategies (5kb windows; see Materials & Methods for details).

From top to bottom: No spike in normalization: ChIP enrichment normalized by coverage of the autosomes for all biological samples and replicates, ignoring the spike-in. Brown et al. (2020) normalization: ChIP enrichment normalized by assessing proportion of reads derived from sample and spike-in both the ChIP and Input. Bonhoure et al. (2014) normalization: ChIP enrichment normalized by employing a regression-based method using the D. melanogaster spike-in. Wei et al. (2020) normalization: ChIP enrichment normalized by employing a quantile-quantile method using the D. melanogaster spike-in. A. Enrichment for each sample replicate. B. Average enrichment across samples. We omitted the first Old Male sample for all averaged Old Male values because it had lower enrichment signal compared to other samples in both the sample and the spike-in, indicative of inefficient ChIP pull-down.

(PDF)

S2 Fig. ChIP signal after subsampling reads to 37 million reads for each replicate.

A. Enrichment for each sample replicate. B. Average enrichment across samples. Enrichment is normalized without spike-in and omits the failed Old Male replicate.

(PDF)

S3 Fig. ChIP enrichment across samples (average across replicates) with only uniquely mapped reads (5kb windows) using ChIP signal normalized by autosome coverage (3 replicates used for the Old Male category).

(PDF)

S4 Fig. Input coverage across sexes and ages.

(Top) Coverage of the input DNA across samples normalized by the median autosomal coverage. (Bottom) Mean coverage of the input DNA across samples.

(PDF)

S5 Fig. Genome-wide ChIP enrichment with ± 1 standard error (denoted in yellow).

1 tick mark = 1Mb on the x-axis.

(PDF)

S6 Fig. Differential ChIP enrichment between the sexes.

Each data point represents enrichment of old/young male (left) or female (right) samples at a 5---kb window. Data highlighted in yellow denote windows with 50% higher/lower enrichment between samples (p < 0.05, two-tailed Student’s t-test).

(PDF)

S7 Fig. Regions with at least 1.5x H3K9me3 up-/down-regulation within sexes over aging in 5kb-window resolution.

Data values represent the difference of means from old and young samples. One tick mark on the x-axis is equal to 2Mb.

(PDF)

S8 Fig. Enrichment of H3K9me3 (in 5kb windows) at the pericentromeres and 2Mb downstream for A.

males and B. females. Boundaries are indicated by darker region on the chromosome diagrams of the major chromosome arms. Subtraction plots show higher H3K9me2 signal in young (blue) or old (red) flies. Data represents the mean of 4 biological replicates (3 for old males; in black) or the difference of means (subtraction plots, blue or red).

(PDF)

S9 Fig. Sex-specific H3K9me3 gain/loss by compartment (euchromatin/heterochromatin/neo-Y) over aging.

(PDF)

S10 Fig. Immunofluorescence staining for H3K9me3 in males (A-I, replicates from different slides) and females (J).

For all figures, arrowhead denotes neo-Y, arrows denote MullerA-AD, DAPI/DNA is blue channel, H3K9me3 is orange channel, and scale bar is 50μm.

(PDF)

S11 Fig. Repeat composition per chromosome.

Plotted for each Muller element is the total length of repeats (bp) divided by the entire chromosome length (bp).

(PDF)

S12 Fig. ChIP enrichment differences between young and old females and males.

Values represent the difference of means between young and old samples, grouped by chromosome and binned by amount of repeats (%) per 5kb-window.

(PDF)

S13 Fig. Volcano plots for TE expression by sex and age.

Left: (Top) TEs in young females v. young males, (bottom) TEs in old females vs old males. Fold change is indicative of male/female. Right: (Top) TEs in young males vs old males, (bottom) TEs in young females vs old females. Fold change is indicative of young/old.

(PDF)

S14 Fig. TE expression by sex/age and by chromosome group (Autosomes/X and Neo-Y).

(PDF)

S15 Fig. Male TE expression from the Autosomes/X and the Neo-Y in young (left) and old males (right).

Diagonal line denotes 1:1 equal expression for TEs from both chromosome groups.

(PDF)

S16 Fig. Sex-differences in TE expression normalized by sex-differences in TE copy number.

A-C. Shown are expression values for each biological sample accounting for TE copy number differences in the males and females when copies are calculated from A. adult gDNA, B. embryo gDNA, and C. genome repeat annotation. Expression values for each TE were normalized by their respective copy number in the male or female genome. Data plotted represents the averaged values of old and young female and male samples (3 replicates). Significance values calculated (n.s. not significant, * p<0.05, Wilcoxon test). D-F. Shown are Log2(Male/Female Expression) values for each TE along with the corresponding Log2(Male/Female DNA) values calculated from D. adult gDNA, E. embryo gDNA, and F. genome repeat annotation. Line indicates male/female expression is proportional to the male/female copy number of a specific TE.

(PDF)

S17 Fig. Volcano plots for satellite DNA expression by sex and age.

Left: (Top) Satellites in young females v. young males, (bottom) satellites in old females vs old males. Fold change is indicative of male/female. Right: (Top) Satellites in young males vs old males, (bottom) satellites young females vs old females. Fold change is indicative of young/old.

(PDF)

S18 Fig. Heterochromatin and expression of pericentromeric and neo-Y genes.

A. Log2 expression of genes located in the pericentromeres within sexes. Data represents mean values of 3 replicates with bars denoting standard error. Data highlighted in yellow denotes genes with significantly 50% higher or lower expression when comparing fold-change between samples (Wald test, p < 0.05). Grey line indicates 1:1 expression between samples. Boxplot values represent mean values of 3 replicates with significance values calculated using the Wilcoxon test (* p< 0.05, ** p<0.01, *** p<1e-5). B. Log2 expression of genes located on the neo-Y between young and old males. Data represented in a similar manner as in A and data highlighted shows genes that are up-regulated (top-left) and/or down-regulated (bottom-right) by 50% during aging (Wald test, p < 0.05). C. Age-related differences in log2 H3K9me3 enrichment and log2 expression of neo-Y genes. Each data represents the difference of mean values between young and old males. Color-code indicates differentially expressed/enriched comparisons: black: absolute log2(fold-change) < 1in both H3K9me3 and expression; yellow: absolute log2(fold-change)> = 1 in both H3K9me3 and expression. D. Log2 expression differences in genes on the neo-Y and their respective gene copy number. Values denote the difference of mean values between young and old males. Grey line denotes 0 expression difference between the ages.

(PDF)

S19 Fig

Heterochromatin cutoffs (dashed orange line) by % repeats overlap with regions of elevated H3K9me3 enrichment on (A) Muller B, (B) Muller E, (C) Muller C, and (D) Muller AD.

(PDF)

S1 Table. Overview of ChIP data generated.

Description of samples/tissues used for 4 rounds/batches of ChIP-seq experiments.

(PDF)

S2 Table. Mapping statistics of ChIP data.

Description of mapping statistics for ChIP-experiment data for each batch. Reported numbers of reads mapped to the sample and spike-in genomes.

(PDF)

S3 Table. Pearson coefficients between replicates of same biological sample.

We used H3K9me3 enrichment in 5kb windows from the autosomes and X chromosomes to conduct Pearson correlations between each pair of replicates. Pearson correlations were calculated using the base R function cor.test(replicateA, replicateB, method =“pearson”).

(PDF)

S4 Table. Proportion of 5kb-windows in top 20 percentile H3K9me3 enrichment shared between normalization methods.

For each normalization method, we identified the top 20% most enriched 5kb windows in euchromatin (N = 4646). We then identified how many 5kb regions were shared between each normalization method and reported those results as a percentage (no. windows shared between two methods / no. windows in top 20% enrichment).

(PDF)

S5 Table. Mapping statistics of ChIP data for MAPQ > 3 alignments.

(PDF)

S6 Table. Heterochromatin-related genes and their expression change in old males and female.

(PDF)

S7 Table. Overview of RNA data generated.

Description of samples/tissues used for 3 rounds/batches of RNA---seq experiments.

(PDF)

S8 Table. Mapping and counts statistics of RNA data.

Description of mapping statistics for RNA-seq experiment data for each batch. Reported gene and repeat counts by Subread featurecounts and TEtranscripts where applicable.

(PDF)

S9 Table. Mapping and counts statistics of RNA data filtered for MAPQ > 40.

(PDF)

S10 Table. DNAPipeTE estimates of repeats in RNASeq data.

(PDF)

S1 Appendix. D. melanogaster as a spike-in to normalize D. miranda ChIP’s.

(PDF)

S2 Appendix. Cross-mapping between the D. miranda neo-X and neo-Y chromosome.

(PDF)

Acknowledgments

We thank Laura Fanti for help with the immunostaining protocol.

Data Availability

All the sequencing data have been posted on GenBank under BioProject PRJNA644734.

Funding Statement

This work was supported by NIH grants (nos. R01GM076007, R01GM101255 and R01AG057029) to DB. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Tarka M, Guenther A, Niemelä PT, Nakagawa S, Noble DWA. Sex differences in life history, behavior, and physiology along a slow-fast continuum: a meta-analysis. Behav Ecol Sociobiol 2018; 72:132–13. 10.1007/s00265-018-2534-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Maklakov AA, Lummaa V. Evolution of sex differences in lifespan and aging: causes and constraints. Bioessays. 2013; 35:717–724. 10.1002/bies.201300021 [DOI] [PubMed] [Google Scholar]
  • 3.Xirocostas ZA, Everingham SE, Moles AT. The sex with the reduced sex chromosome dies earlier: a comparison across the tree of life. Biol Lett. 2020; 16:20190867. 10.1098/rsbl.2019.0867 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Pipoly I, Bókony V, Kirkpatrick M, Donald PF, Székely T, Liker A. The genetic sex-determination system predicts adult sex ratios in tetrapods. Nature. 2015; 527:91–94. 10.1038/nature15380 [DOI] [PubMed] [Google Scholar]
  • 5.Larson K, Yan S-J, Tsurumi A, Liu J, Zhou J, Gaur K, Guo D, Eickbush TH, Li WX. Heterochromatin formation promotes longevity and represses ribosomal RNA synthesis. PLoS Genet. 2012; 8:e1002473. 10.1371/journal.pgen.1002473 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Haithcock E, Dayani Y, Neufeld E, Zahand AJ, Feinstein N, Mattout A, Gruenbaum Y, Liu J. Age-related changes of nuclear architecture in Caenorhabditis elegans. Proc Natl Acad Sci USA 2005; 102:16690–16695. 10.1073/pnas.0506955102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Marais GAB, Gaillard J-M, Vieira C, Plotton I, Sanlaville D, Gueyffier F, Lemaitre JF. Sex gap in aging and longevity: can sex chromosomes play a role? Biol Sex Differ. 2018; 9:33–14. 10.1186/s13293-018-0181-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Wood JG, Jones BC, Jiang N, Chang C, Hosier S, Wickremesinghe P, Garcia M, Hartnett DA, Burhenn L, Neretti N, Helfand SL. Chromatin-modifying genetic interventions suppress age-associated transposable element activation and extend life span in Drosophila. Proc Natl Acad Sci USA. 2016; 113:11277–11282. 10.1073/pnas.1604621113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Wood JG, Hillenmeyer S, Lawrence C, Chang C, Hosier S, Lightfoot W, Mukherjee E, Jiang N, Schorl C, Brodsky AS, Neretti N, Helfand SL. Chromatin remodeling in the aging genome of Drosophila. Aging Cell. 2010; 9:971–978. 10.1111/j.1474-9726.2010.00624.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.De Cecco M, Criscione SW, Peckham EJ, Hillenmeyer S, Hamm EA, Manivannan J, Peterson AL, Kreiling JA, Neretti N, Sedivy JM. Genomes of replicatively senescent cells undergo global epigenetic changes leading to gene silencing and activation of transposable elements. Aging Cell. 2013; 12:247–256. 10.1111/acel.12047 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Li W, Prazak L, Chatterjee N, Grüninger S, Krug L, Theodorou D, Dubnau J. Activation of transposable elements during aging and neuronal decline in Drosophila. Nat Neurosci. 2013; 16:529–531. 10.1038/nn.3368 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Brown EJ, Nguyen AH, Bachtrog D. The Y chromosome may contribute to sex-specific ageing in Drosophila. Nat Ecol Evol. 2020; 37:466–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Chang C-H, Larracuente AM. Heterochromatin-Enriched Assemblies Reveal the Sequence and Organization of the Drosophila melanogaster Y Chromosome. Genetics. 2019; 211:333–348. 10.1534/genetics.118.301765 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Mahajan S, Wei KH-C, Nalley MJ, Gibilisco L, Bachtrog D. De novo assembly of a young Drosophila Y chromosome using single-molecule sequencing and chromatin conformation capture. PLoS Biol 2018; 16:e2006348. 10.1371/journal.pbio.2006348 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Steinemann M, Steinemann S. Degenerating Y chromosome of Drosophila miranda: a trap for retrotransposons. Proc Natl Acad Sci USA. 1992; 89:7591–7595. 10.1073/pnas.89.16.7591 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Bachtrog D. Expression profile of a degenerating neo-y chromosome in Drosophila. Curr Biol. 2006; 16:1694–1699. 10.1016/j.cub.2006.07.053 [DOI] [PubMed] [Google Scholar]
  • 17.Bachtrog D, Hom E, Wong KM, Maside X, de Jong P. Genomic degradation of a young Y chromosome in Drosophila miranda. Genome Biol. 2008; 9:R30. 10.1186/gb-2008-9-2-r30 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Zhou Q, Bachtrog D. Sex-specific adaptation drives early sex chromosome evolution in Drosophila. Science. 2012; 337:341–345. 10.1126/science.1225385 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ellison CE, Bachtrog D. Dosage compensation via transposable element mediated rewiring of a regulatory network. Science. 2013; 342:846–850. 10.1126/science.1239552 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Bachtrog D, Charlesworth B. Reduced adaptation of a non-recombining neo-Y chromosome. Nature. 2002; 416:323–326. 10.1038/416323a [DOI] [PubMed] [Google Scholar]
  • 21.Bachtrog D. Accumulation of Spock and Worf, two novel non-LTR retrotransposons, on the neo-Y chromosome of Drosophila miranda. Mol Biol Evol. 2003; 20:173–181. 10.1093/molbev/msg035 [DOI] [PubMed] [Google Scholar]
  • 22.Bachtrog D, Mahajan S, Bracewell R. Massive gene amplification on a recently formed Drosophila Y chromosome. Nat Ecol Evol. 2019; 3:1587–1597. 10.1038/s41559-019-1009-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Zhou Q, Ellison CE, Kaiser VB, Alekseyenko AA, Gorchakov AA, Bachtrog D. The epigenome of evolving Drosophila neo-sex chromosomes: dosage compensation and heterochromatin formation. PLoS Biol. 2013; 11:e1001711. 10.1371/journal.pbio.1001711 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Bracewell R, Chatla K, Nalley MJ, Bachtrog D. Dynamic turnover of centromeres drives karyotype evolution in Drosophila. Elife. 2019; 8:923. 10.7554/eLife.49002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Yoon JS, Gagen KP, Zhu DL. Longevity of 68 species of Drosophila. Ohio J Sci. 1990; 90:16–32. [Google Scholar]
  • 26.Tower J, Arbeitman M. The genetics of gender and life span. J Biol. 2009; 8:38–3. 10.1186/jbiol141 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Brenman-Suttner DB, Yost RT, Frame AK, Robinson JW, Moehring AJ, Simon AF. Social behavior and aging: A fly model. Genes Brain Behav. 2020; 19:e12598. 10.1111/gbb.12598 [DOI] [PubMed] [Google Scholar]
  • 28.Li X-Y, Harrison MM, Villalta JE, Kaplan T, Eisen MB. Establishment of regions of genomic activity during the Drosophila maternal to zygotic transition. Elife. 2014; 3:e1003428. 10.7554/eLife.03737 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Sun L, Yu R, Dang W. Chromatin Architectural Changes during Cellular Senescence and Aging. Genes. 2018; 9:211. 10.3390/genes9040211 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Elgin SCR, Reuter G. Position-effect variegation, heterochromatin formation, and gene silencing in Drosophila. Cold Spring Harb Perspect Biol. 2013; 5:a017780–a017780. 10.1101/cshperspect.a017780 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Allshire RC, Madhani HD. Ten principles of heterochromatin formation and function. Nat Rev Mol Cell Biol. 2018; 19:229–244. 10.1038/nrm.2017.119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Molaro A, Malik HS. Hide and seek: how chromatin-based pathways silence retroelements in the mammalian germline. Curr Opin Genet Dev. 2016; 37:51–58. 10.1016/j.gde.2015.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Jin Y, Tam OH, Paniagua E, Hammell M. TEtranscripts: a package for including transposable elements in differential expression analysis of RNA-seq datasets. Bioinformatics 2015; 31:3593–3599. 10.1093/bioinformatics/btv422 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kuhn GCS. Satellite DNA transcripts have diverse biological roles in Drosophila. Heredity 2015; 115:1–2. 10.1038/hdy.2015.12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Usakin L, Abad J, Vagin VV, de Pablos B, Villasante A, Gvozdev VA. Transcription of the 1.688 satellite DNA family is under the control of RNA interference machinery in Drosophila melanogaster ovaries. Genetics 2007; 176:1343–1349. 10.1534/genetics.107.071720 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Topp CN, Zhong CX, Dawe RK. Centromere-encoded RNAs are integral components of the maize kinetochore. Proc Natl Acad Sci USA. 2004; 101:15986–15991. 10.1073/pnas.0407154101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Mills WK, Lee YCG, Kochendoerfer AM, Dunleavy EM, Karpen GH. RNA from a simple-tandem repeat is required for sperm maturation and male fertility in Drosophila melanogaster. Elife. 2019; 8:229. 10.7554/eLife.48940 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Wei KH-C, Lower SE, Caldas IV, Sless TJS, Barbash DA, Clark AG. Variable Rates of Simple Satellite Gains across the Drosophila Phylogeny. Mol Biol Evol. 2018; 35:925–941. 10.1093/molbev/msy005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Riddle NC, Shaffer CD, Elgin SCR. A lot about a little dot—lessons learned from Drosophila melanogaster chromosome 4. Biochem Cell Biol. 2009; 87:229–241. 10.1139/O08-119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Yasuhara JC, DeCrease CH, Wakimoto BT. Evolution of heterochromatic genes of Drosophila. Proc Natl Acad Sci USA. 2005; 102:10958–10963. 10.1073/pnas.0503424102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Yasuhara JC, Wakimoto BT. Molecular landscape of modified histones in Drosophila heterochromatic genes and euchromatin-heterochromatin transition zones. PLoS Genet. 2008; 4:e16. 10.1371/journal.pgen.0040016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Ivics Z, Izsvák Z. Repetitive elements and genome instability. Semin Cancer Biol. 2010; 20:197–199. 10.1016/j.semcancer.2010.08.002 [DOI] [PubMed] [Google Scholar]
  • 43.Bourque G, Burns KH, Gehring M, Gorbunova V, Seluanov A, Hammell M, Imbeault M, Izsvák Z, Levin HL, Macfarlan TS, Mager DL, Feschotte C. Ten things you should know about transposable elements. Genome Biol. 2018; 19:199. 10.1186/s13059-018-1577-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Werren JH. Selfish genetic elements, genetic conflict, and evolutionary innovation. Proc Natl Acad Sci USA 2011; 108 Suppl 2:10863–10870. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Hurst GD, Werren JH. The role of selfish genetic elements in eukaryotic evolution. Nat Rev Genet 2001; 2:597–606. 10.1038/35084545 [DOI] [PubMed] [Google Scholar]
  • 46.Charlesworth B, Langley CH. The evolution of self-regulated transposition of transposable elements. Genetics 1986; 112:359–383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Cosby RL, Chang N-C, Feschotte C. Host-transposon interactions: conflict, cooperation, and cooption. Genes Dev. 2019; 33:1098–1116. 10.1101/gad.327312.119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.O’Sullivan RJ, Karlseder J. The great unravelling: chromatin as a modulator of the aging process. Trends Biochem Sci. 2012; 37:466–476. 10.1016/j.tibs.2012.08.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Bousios A, Nützmann H-W, Buck D, Michieletto D. Integrating transposable elements in the 3D genome. Mob DNA. 2020; 11:8–10. 10.1186/s13100-020-0202-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Charlesworth B, Charlesworth D. The degeneration of Y chromosomes. Philos Trans R Soc Lond, B, Biol Sci. 2000; 355:1563–1572. 10.1098/rstb.2000.0717 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Hill WG, Robertson A. The effect of linkage on limits to artificial selection. Genet Res. 1966; 8:269–294. [PubMed] [Google Scholar]
  • 52.Bachtrog D Y-chromosome evolution: emerging insights into processes of Y-chromosome degeneration. Nat Rev Genet. 2013; 14:113–124. 10.1038/nrg3366 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Pimpinelli S, Berloco M, Fanti L, Dimitri P, Bonaccorsi S, Marchetti E, Caizzi R, Caggese C, Gatti M. Transposable elements are stable structural components of Drosophila melanogaster heterochromatin. Proc Natl Acad Sci USA. 1995; 92:3804–3808. 10.1073/pnas.92.9.3804 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Gatti M, Pimpinelli S. Functional elements in Drosophila melanogaster heterochromatin. Annu Rev Genet. 1992; 26:239–275. 10.1146/annurev.ge.26.120192.001323 [DOI] [PubMed] [Google Scholar]
  • 55.Wei KH-C, Gibilisco L, Bachtrog D. Epigenetic conflict on a degenerating Y chromosome increases mutational burden in Drosophila males. Nat Commun. 2020;11: 5537–9. 10.1038/s41467-020-19134-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Frost B, Hemberg M, Lewis J, Feany MB. Tau promotes neurodegeneration through global chromatin relaxation. Nat Neurosci. 2014;17: 357–366. 10.1038/nn.3639 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Van Meter M, Kashyap M, Rezazadeh S, Geneva AJ, Morello TD, Seluanov A, Gorbunova V. SIRT6 represses LINE1 retrotransposons by ribosylating KAP1 but this repression fails with stress and age. Nat Commun. 2014;5: 5011–10. 10.1038/ncomms6011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.De Cecco M, Ito T, Petrashen AP, Elias AE, Skvir NJ, Criscione SW, et al. L1 drives IFN in senescent cells and promotes age-associated inflammation. Nature. 2019;566: 73–78. 10.1038/s41586-018-0784-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Chen H, Zheng X, Zheng Y. Age-associated loss of lamin-B leads to systemic inflammation and gut hyperplasia. Cell. 2014;159: 829–843. 10.1016/j.cell.2014.10.028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Brind’Amour J, Liu S, Hudson M, Chen C, Karimi MM, Lorincz MC. An ultra-low-input native ChIP-seq protocol for genome-wide profiling of rare cell populations. Nat Commun. 2015; 6:6033. 10.1038/ncomms7033 [DOI] [PubMed] [Google Scholar]
  • 61.Hoskins RA, Carlson JW, Wan KH, Park S, Mendez I, Galle SE, et al. The Release 6 reference sequence of the Drosophila melanogaster genome. Genome Res. 2015; 25:445–458. 10.1101/gr.185579.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012; 9:357–359. 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Brown EJ, Nguyen AH, Bachtrog D. The Drosophila Y chromosome affects heterochromatin integrity genome-wide. Mol. Biol. Evol. 2020; 37: 2808–2824. 10.1093/molbev/msaa082 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010; 26:841–842. 10.1093/bioinformatics/btq033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Bonhoure N, Bounova G, Bernasconi D, Praz V, Lammers F, Canella D, Willis IM, Herr W, Hernandez N, Delorenzi M. Quantifying ChIP-seq data: a spiking method providing an internal reference for sample-to-sample normalization. Genome Res. 2014; 24:1157–1168. 10.1101/gr.168260.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015; 12:357–360. 10.1038/nmeth.3317 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014; 30:923–930. 10.1093/bioinformatics/btt656 [DOI] [PubMed] [Google Scholar]
  • 68.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014; 15:550–21. 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Goubert C, Modolo L, Vieira C, ValienteMoro C, Mavingui P, Boulesteix M. De novo assembly and annotation of the Asian tiger mosquito (Aedes albopictus) repeatome with dnaPipeTE from raw genomic reads and comparative analysis with the yellow fever mosquito (Aedes aegypti). Genome Biol Evol. 2015; 7: 1192–1205. 10.1093/gbe/evv050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Hill T, Betancourt AJ. Extensive exchange of transposable elements in the Drosophila pseudoobscura group. Mob DNA 2018; 9:20. 10.1186/s13100-018-0123-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Fanti L, Perrini B, Piacentini L, Berloco M, Marchetti E, Palumbo G, et al. The trithorax group and Pc group proteins are differentially involved in heterochromatin formation in Drosophila. Chromosoma. 2008; Springer-Verlag;;117: 25–39. 10.1007/s00412-007-0123-7 [DOI] [PubMed] [Google Scholar]
  • 72.Kaplan EL, Meier P. Nonparametric Estimation from Incomplete Observations. Journal of the American Statistical Association. 1958; 53:457–481. [Google Scholar]

Decision Letter 0

R Scott Hawley, Kirsten Bomblies

27 Oct 2020

Dear Dr Bachtrog,

Thank you very much for submitting your Research Article entitled 'Toxic Y chromosome: increased repeat expression and age-associated heterochromatin loss in male Drosophila with a young Y chromosome' to PLOS Genetics. Your manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. The reviers' most significant concerns appear to be:

Two issues need to be fully addressed before this study can be accepted for publication in PLoS Gen.

1. Orthogonal validation that H3K9me3 is indeed lower in the neo-Y chromosome

2. Detailed description and demonstrated robustness of their analysis with respect to repetitive sequences.

These will ensure that this study's two major findings are true (neo-Y has lower H3K9me3 enrichment and TEs are de-repressed in young males already).

There are also numerous concerns detailed in the two reviews.  While we are very aware  of this limits the pandemic places on doing additional experiments, we do ask that you do the experiments you can and repeat and detail the analyses and writing as fully as possible

We hope to be be able to review again a much-revised version.

Your evisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

If possible, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see our guidelines.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions.

Yours sincerely,

R. Scott Hawley

Associate Editor

PLOS Genetics

Kirsten Bomblies

Section Editor: Evolution

PLOS Genetics

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: In this study, Nguyen and Bachtrog addressed an outstanding question in evolutionary genomics – how does the evolution of sex chromosome influence epigenomes and, accordingly, individual phenotypes ("aging"). Studies from the Bachtrog lab previously reported that, compared to females, silenced transposable elements (TEs) and satellite repeats are derepressed in aged D. melanogaster males due to the "toxic Y" effect. In this study, authors reported that in a species with recently evolved neo-Y chromosome, D. miranda, there is similar derepression of TEs and satellite repeats in males. However, the derepression of TEs is not restricted to aged males, but also present in young males. Authors suggested that the much larger neo-Y chromosome is a bigger "burden" and thus more toxic. Another major finding from the study is that, compared to other heterochromatic regions, neo-Y has lower enrichment of heterochromatic marks, and the loss of heterochromatin differs between males and females. While these findings are exciting and could provide new insights for the field, there are several potential caveats that need to be formally addressed to ensure the robustness of the authors' findings. Below I listed my specific comments.

Major issues

1. Most of the major conclusions from this study were drawn from the analysis of H3K9me3 enrichment and the expression of repetitive sequences (satellite repeats or TEs). Because of their repetitive nature, reads from these sequences will appear as "multi-mapped reads" in ChIP-seq or RNA-seq data. However, the authors did not provide enough details about how they address the multiple-mapping issue.

Some of the reported analyses made me especially confused. For instance, in page 6, the authors stated that the results are consistent when using 5kb windows with different percentages of repetitive sequences, even for windows with >99.5% repetitive sequences. This makes it sound like that the authors only focused on unique sequences when estimating H3K9me3 enrichment (but also see below). However, 0.005*5,000 = 25, meaning that the H3K9me3 estimates were based on merely 25bp of unique sequence?

If authors only used unique sequences, it is unclear what the filtering criteria are (e.g., mapping quality). If authors allowed multi-mapped reads, then how did bowtie assign multi-mapped reads, or how authors decided where a repetitive read came from?

The authors also compared the H3K9me3 enrichment level between TEs on the neo-Y and other pericentromeric heterochromatin. How did the authors distinguish repetitive TEs in different genomic locations? Did they use the H3K9me3 enrichment in the unique flanking sequence (if so, data need to be shown)?

In addition, how the RNA-seq comparison is normalized with respect to TE copy number is not clear. Especially, there are TE copies inside and outside heterochromatin regions. How the authors distinguished repetitive copies within and outside heterochromatin (and even more, on neo-Y and other chromosomes)?

A related question - the authors used D. melanogaster spike-in as a control to normalize between samples. While there is enough genetic divergence in unique sequences between the two species, it is unclear whether repetitive sequences also diverge enough to avoid cross-mapping. Could it be possible that D. melanogaster repeats from the spike-in confound the estimates for D. miranda?

2. One major finding from this study is that, compared to pericentromeric heterochromatin of other chromosomes, neo-Y has lower enrichment of H3K9me. Several aspects of this finding need to be further addressed.

- How non-unique sequences (not necessarily repetitive) were treated in the analysis? I presume that the neo-Y still has many shared sequences with the neo-X. A neo-Y ChIP-seq read could also map to neo-X, leading to averaging out the H3K9me3 enrichment level of estimated for neo-Y.

- Because of the uncertainty of how repetitive (#1) and non-unique (see above) sequences are treated in the reported ChIP-seq analysis, I feel it is necessary to use orthogonal methods to demonstrate that neo-Y indeed has lower enrichment of H3K9me3. A straightforward IF experiment should be able to address this.

- Authors suggested that the lower enrichment of H3K9m3 on the neo-Y could be due to the active transcription of genes on the neo-Y. This possibility could and should be further investigated, potentially with existing data. For example, authors can compare the H3K9me3 enrichment level in windows with different distances to transcribing genes, genes with different transcription levels etc, for both neo-Y and heterochromatin on other chromosomes.

- A related and important question - it is intriguing how authors defined heterochromatin differs between neo-Y (using H3K9me3 gradient) and other chromosomes (using % of repeats). Heterochromatin is genomic regions enriched with H3K9me3. I am not sure why the authors chose other metrics for chromosomes other than the neo-Y. I also wonder whether some results could have been driven by this difference in methods between neo-Y and other heterochromatin.

Other comments

3. Authors should include line numbers – it is hard to refer to specific passages without line numbers.

4. In order to know that authors' normalization methods work, it would be helpful to show some euchromatic loci/genes that are known to be stably enriched for K9. Especially, I am concerned about the cross-mapping of D. melanogaster sequences from the spike-in (#1).

A related question – Did the authors use orthologous euchromatic regions? Or any euchromatic regions? Only unique reads? If allowing any euchromatic regions and could be multi-mapped reads, I am worrying that the normalization could be influenced by the difference in H3K9me3 level on the neo-Y between samples (since neo-X and neo-Y still share some sequences, also see #2).

5. I am confused with the authors' argument that active TE families are less epigenetically silenced, and this could explain the lower K9 enrichment on the neo-Y (page 6). At least in D. melanogaster, LTR families are recently active, targeted by a large amount of small RNAs, and more epigenetically silenced.

6. Authors need to explain why the epigenetic silencing of simple repeats and TEs in young and old males differ in D. miranda and D. melanogaster. The toxic Y hypothesis suggests that the source/sink relationship is the key, but does not distinguish between repeat types. This is central to the main theme of this study and needs to be discussed in depth.

Related to this - discussion on page 12 is confusing.

Authors started with that repeats on the Y lead to the toxic effect, but then stated that the differences between D. melanogaster and D. simulans are due to their difference in TE content on the Y. Authors failed to explain why the different TE content matters and whether the toxic Y theory is relevant only for TEs or both repeats.

7. (Page 16) Authors excluded one old male sample (old male 1) based on the boxplot in Figure S1. However, the criterion for excluding that sample is unclear. By eyeballing, I feel that young female 2 is an outlier to other replicates and could also be excluded. Authors may consider more formal analysis (e.g., IDR) to include/exclude a replicate.

8. (Page 5) Authors reported that the majority of the windows that lost K9 are in pericentromeric heterochromatin or neo-Y. However, these genomic regions are also enriched with K9-enriched windows. It could be that the whole genome is losing K9. A more appropriate analysis would be to compare the proportion of windows in each genomic compartment (Het, neo-Y, Eu) that lost K9.

Related to this - it is intriguing that the number of windows with increased K9 is also more for males than females. I wonder whether this has to do with the uneven coverage between males and females (assuming sequencing depth is the same, the average coverage would be higher for females due to the overall smaller genome size). This could lead to a larger variance for estimates in males. This could potentially be addressed by downsampling female data to the same genome coverage.

9. Authors mentioned that male-male aggression could also lead to lower survival of the males. Since the authors mentioned it, it would be helpful to provide either a rationale why this did not impact the authors' results or some empirical validation (e.g., rear flies individually to estimate the lifespan).

10. In Figure S13, no p-values were provided (neither in the text). By looking at the figure, there are no obvious differences in TE expression between males and females (which contradicts with authors' claim on page 9).

11. I am not sure what H3K9me3 Enrichment "Signal" refers to in Figure 2 and some other figures. I think the authors meant "H3K9me3 Fold Enrichment"?

12. Some of the numberings of supplementary figures are wrong (e.g., Figure S11 on page 9, some of those should be Figure S12).

Reviewer #2: This is a very nice manuscript describing sex-specific derepression of transposable elements with age in a Drosophila species with a large Y chromosome. The manuscript is well-written and clear, though I would like to see some minor points addressed.

An alternative to the toxic-Y explanation for some of the results is a male-specific decline in proteins required for heterochromatin formation. I'd agree that this is less likely, but perhaps they can address this alternative with a few more analyses of the RNAseq and ChIPseq data already collected-- e.g., is there a decline in expression in males with age of any relevant genes that is strong relative to that in females? Are overall H3K9me3 levels similar between males and females of the same age? This may be particularly important to show for X-linked genes, particularly as the neo-X in this group is, as I recall, not fully dosage compensated.

While I appreciate that the authors focussed on well-founded conclusions, I did feel they could speculate a bit more about the proximate causes of the hypothesized toxic Y-effect. e.g., TE activity leading to DNA damage and cell-death in the soma. On a related point, there is something of a lack of non-fly literature on this topic cited (e.g.,de Cecco et al Nature volume 566, pages73–78(2019)), where some of these details have been hammered out.

Similarly, I was surprised to see no mention of the essentially similar work of Lemos et al (e.g., PNAS September 7, 2010 107 (36) 15826-15831)

Finally, I am a wondering how well the attempt to correct expression for TE copy numbers works. The standardisation used, as I understand it, is to divide RNAseq reads by DNA reads. While logical, it is worth remembering that heterochromatin is underreplicated, so that TE copies that are effectively heterochromatin-ised by be underrepresented in the DNA data, and therefore undercounted. I don't think this materially affects their conclusions, but would appreciate an argument showing this, as it is effected by exactly the process they are trying to measure. Or, alternatively, they could correct with early embryo DNAseq data if any exists. (As heterochromatin does not yet form in early embryos.)

p8-- the word 'access' is used, but I think 'excess' is meant.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: No: There is a Genbank Bioproject number in the ms, but a search of Genbank brought nothing up. Presumably the data are currently embargoed, and will be available upon publication.

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Decision Letter 1

R Scott Hawley, Kirsten Bomblies

22 Feb 2021

Dear Dr Bachtrog,

We commend the authors for a beautiful Response to Reviewers. The authors so much more than 'did the work'.  The Review Figures were both well done and very helpful.  (Perhaps the authors might wish to add them to the Supplementary Figures?) Either way, this is truly scholarly.  Thank you and congratulations on a terrific paper!

That said, we are pleased to inform you that your manuscript entitled "Toxic Y chromosome: increased repeat expression and age-associated heterochromatin loss in male Drosophila with a young Y chromosome" has been editorially accepted for publication in PLOS Genetics. Congratulations!

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

R. Scott Hawley

Associate Editor

PLOS Genetics

Kirsten Bomblies

Section Editor: Evolution

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Comments from the reviewers (if applicable):

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: 

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-20-01425R1

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

Acceptance letter

R Scott Hawley, Kirsten Bomblies

26 Mar 2021

PGENETICS-D-20-01425R1

Toxic Y chromosome: increased repeat expression and age-associated heterochromatin loss in male Drosophila with a young Y chromosome

Dear Dr Bachtrog,

We are pleased to inform you that your manuscript entitled "Toxic Y chromosome: increased repeat expression and age-associated heterochromatin loss in male Drosophila with a young Y chromosome" has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Katalin Szabo

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. ChIP enrichment across samples using four different normalization strategies (5kb windows; see Materials & Methods for details).

    From top to bottom: No spike in normalization: ChIP enrichment normalized by coverage of the autosomes for all biological samples and replicates, ignoring the spike-in. Brown et al. (2020) normalization: ChIP enrichment normalized by assessing proportion of reads derived from sample and spike-in both the ChIP and Input. Bonhoure et al. (2014) normalization: ChIP enrichment normalized by employing a regression-based method using the D. melanogaster spike-in. Wei et al. (2020) normalization: ChIP enrichment normalized by employing a quantile-quantile method using the D. melanogaster spike-in. A. Enrichment for each sample replicate. B. Average enrichment across samples. We omitted the first Old Male sample for all averaged Old Male values because it had lower enrichment signal compared to other samples in both the sample and the spike-in, indicative of inefficient ChIP pull-down.

    (PDF)

    S2 Fig. ChIP signal after subsampling reads to 37 million reads for each replicate.

    A. Enrichment for each sample replicate. B. Average enrichment across samples. Enrichment is normalized without spike-in and omits the failed Old Male replicate.

    (PDF)

    S3 Fig. ChIP enrichment across samples (average across replicates) with only uniquely mapped reads (5kb windows) using ChIP signal normalized by autosome coverage (3 replicates used for the Old Male category).

    (PDF)

    S4 Fig. Input coverage across sexes and ages.

    (Top) Coverage of the input DNA across samples normalized by the median autosomal coverage. (Bottom) Mean coverage of the input DNA across samples.

    (PDF)

    S5 Fig. Genome-wide ChIP enrichment with ± 1 standard error (denoted in yellow).

    1 tick mark = 1Mb on the x-axis.

    (PDF)

    S6 Fig. Differential ChIP enrichment between the sexes.

    Each data point represents enrichment of old/young male (left) or female (right) samples at a 5---kb window. Data highlighted in yellow denote windows with 50% higher/lower enrichment between samples (p < 0.05, two-tailed Student’s t-test).

    (PDF)

    S7 Fig. Regions with at least 1.5x H3K9me3 up-/down-regulation within sexes over aging in 5kb-window resolution.

    Data values represent the difference of means from old and young samples. One tick mark on the x-axis is equal to 2Mb.

    (PDF)

    S8 Fig. Enrichment of H3K9me3 (in 5kb windows) at the pericentromeres and 2Mb downstream for A.

    males and B. females. Boundaries are indicated by darker region on the chromosome diagrams of the major chromosome arms. Subtraction plots show higher H3K9me2 signal in young (blue) or old (red) flies. Data represents the mean of 4 biological replicates (3 for old males; in black) or the difference of means (subtraction plots, blue or red).

    (PDF)

    S9 Fig. Sex-specific H3K9me3 gain/loss by compartment (euchromatin/heterochromatin/neo-Y) over aging.

    (PDF)

    S10 Fig. Immunofluorescence staining for H3K9me3 in males (A-I, replicates from different slides) and females (J).

    For all figures, arrowhead denotes neo-Y, arrows denote MullerA-AD, DAPI/DNA is blue channel, H3K9me3 is orange channel, and scale bar is 50μm.

    (PDF)

    S11 Fig. Repeat composition per chromosome.

    Plotted for each Muller element is the total length of repeats (bp) divided by the entire chromosome length (bp).

    (PDF)

    S12 Fig. ChIP enrichment differences between young and old females and males.

    Values represent the difference of means between young and old samples, grouped by chromosome and binned by amount of repeats (%) per 5kb-window.

    (PDF)

    S13 Fig. Volcano plots for TE expression by sex and age.

    Left: (Top) TEs in young females v. young males, (bottom) TEs in old females vs old males. Fold change is indicative of male/female. Right: (Top) TEs in young males vs old males, (bottom) TEs in young females vs old females. Fold change is indicative of young/old.

    (PDF)

    S14 Fig. TE expression by sex/age and by chromosome group (Autosomes/X and Neo-Y).

    (PDF)

    S15 Fig. Male TE expression from the Autosomes/X and the Neo-Y in young (left) and old males (right).

    Diagonal line denotes 1:1 equal expression for TEs from both chromosome groups.

    (PDF)

    S16 Fig. Sex-differences in TE expression normalized by sex-differences in TE copy number.

    A-C. Shown are expression values for each biological sample accounting for TE copy number differences in the males and females when copies are calculated from A. adult gDNA, B. embryo gDNA, and C. genome repeat annotation. Expression values for each TE were normalized by their respective copy number in the male or female genome. Data plotted represents the averaged values of old and young female and male samples (3 replicates). Significance values calculated (n.s. not significant, * p<0.05, Wilcoxon test). D-F. Shown are Log2(Male/Female Expression) values for each TE along with the corresponding Log2(Male/Female DNA) values calculated from D. adult gDNA, E. embryo gDNA, and F. genome repeat annotation. Line indicates male/female expression is proportional to the male/female copy number of a specific TE.

    (PDF)

    S17 Fig. Volcano plots for satellite DNA expression by sex and age.

    Left: (Top) Satellites in young females v. young males, (bottom) satellites in old females vs old males. Fold change is indicative of male/female. Right: (Top) Satellites in young males vs old males, (bottom) satellites young females vs old females. Fold change is indicative of young/old.

    (PDF)

    S18 Fig. Heterochromatin and expression of pericentromeric and neo-Y genes.

    A. Log2 expression of genes located in the pericentromeres within sexes. Data represents mean values of 3 replicates with bars denoting standard error. Data highlighted in yellow denotes genes with significantly 50% higher or lower expression when comparing fold-change between samples (Wald test, p < 0.05). Grey line indicates 1:1 expression between samples. Boxplot values represent mean values of 3 replicates with significance values calculated using the Wilcoxon test (* p< 0.05, ** p<0.01, *** p<1e-5). B. Log2 expression of genes located on the neo-Y between young and old males. Data represented in a similar manner as in A and data highlighted shows genes that are up-regulated (top-left) and/or down-regulated (bottom-right) by 50% during aging (Wald test, p < 0.05). C. Age-related differences in log2 H3K9me3 enrichment and log2 expression of neo-Y genes. Each data represents the difference of mean values between young and old males. Color-code indicates differentially expressed/enriched comparisons: black: absolute log2(fold-change) < 1in both H3K9me3 and expression; yellow: absolute log2(fold-change)> = 1 in both H3K9me3 and expression. D. Log2 expression differences in genes on the neo-Y and their respective gene copy number. Values denote the difference of mean values between young and old males. Grey line denotes 0 expression difference between the ages.

    (PDF)

    S19 Fig

    Heterochromatin cutoffs (dashed orange line) by % repeats overlap with regions of elevated H3K9me3 enrichment on (A) Muller B, (B) Muller E, (C) Muller C, and (D) Muller AD.

    (PDF)

    S1 Table. Overview of ChIP data generated.

    Description of samples/tissues used for 4 rounds/batches of ChIP-seq experiments.

    (PDF)

    S2 Table. Mapping statistics of ChIP data.

    Description of mapping statistics for ChIP-experiment data for each batch. Reported numbers of reads mapped to the sample and spike-in genomes.

    (PDF)

    S3 Table. Pearson coefficients between replicates of same biological sample.

    We used H3K9me3 enrichment in 5kb windows from the autosomes and X chromosomes to conduct Pearson correlations between each pair of replicates. Pearson correlations were calculated using the base R function cor.test(replicateA, replicateB, method =“pearson”).

    (PDF)

    S4 Table. Proportion of 5kb-windows in top 20 percentile H3K9me3 enrichment shared between normalization methods.

    For each normalization method, we identified the top 20% most enriched 5kb windows in euchromatin (N = 4646). We then identified how many 5kb regions were shared between each normalization method and reported those results as a percentage (no. windows shared between two methods / no. windows in top 20% enrichment).

    (PDF)

    S5 Table. Mapping statistics of ChIP data for MAPQ > 3 alignments.

    (PDF)

    S6 Table. Heterochromatin-related genes and their expression change in old males and female.

    (PDF)

    S7 Table. Overview of RNA data generated.

    Description of samples/tissues used for 3 rounds/batches of RNA---seq experiments.

    (PDF)

    S8 Table. Mapping and counts statistics of RNA data.

    Description of mapping statistics for RNA-seq experiment data for each batch. Reported gene and repeat counts by Subread featurecounts and TEtranscripts where applicable.

    (PDF)

    S9 Table. Mapping and counts statistics of RNA data filtered for MAPQ > 40.

    (PDF)

    S10 Table. DNAPipeTE estimates of repeats in RNASeq data.

    (PDF)

    S1 Appendix. D. melanogaster as a spike-in to normalize D. miranda ChIP’s.

    (PDF)

    S2 Appendix. Cross-mapping between the D. miranda neo-X and neo-Y chromosome.

    (PDF)

    Attachment

    Submitted filename: 0_CoverLetter.docx

    Data Availability Statement

    All the sequencing data have been posted on GenBank under BioProject PRJNA644734.


    Articles from PLoS Genetics are provided here courtesy of PLOS

    RESOURCES