Skip to main content
Genome Biology and Evolution logoLink to Genome Biology and Evolution
. 2020 Mar 27;12(5):736–749. doi: 10.1093/gbe/evaa064

piRNA Clusters Need a Minimum Size to Control Transposable Element Invasions

Robert Kofler 1,
Editor: Sarah Schaack
PMCID: PMC7259680  PMID: 32219390

Abstract

piRNA clusters are thought to repress transposable element (TE) activity in mammals and invertebrates. Here, we show that a simple population genetics model reveals a constraint on the size of piRNA clusters: The total size of the piRNA clusters of an organism must exceed 0.2% of a genome to repress TE invasions. Moreover, larger piRNA clusters accounting for up to 3% of the genome may be necessary when populations are small, transposition rates are high, and TE insertions are recessive. If piRNA clusters are too small, the load of deleterious TE insertions that accumulate during a TE invasion may drive populations extinct before an effective piRNA-based defense against the TE can be established. Our findings are solely based on three well-supported assumptions: 1) TEs multiply within genomes, 2) TEs are mostly deleterious, and 3) piRNA clusters act as transposon traps, where a single insertion in a cluster silences all TE copies in trans. Interestingly, the piRNA clusters of some species meet our observed minimum size requirements, whereas the clusters of other species do not. Species with small piRNA clusters, such as humans and mice, may experience severe fitness reductions during invasions of novel TEs, which is possibly even threatening the persistence of some populations. This work also raises the important question of how piRNA clusters evolve. We propose that the size of piRNA clusters may be at an equilibrium between evolutionary forces that act to expand and contract piRNA clusters.

Keywords: transposable elements, piRNA clusters, population genetics, forward simulations

Introduction

Transposable elements (TEs) are short stretches of DNA that selfishly propagate within genomes (Doolittle and Sapienza 1980; Orgel and Crick 1980). It is thought that the proliferation of TEs is mostly deleterious to hosts, where negative effects of TEs can arise by three mechanisms: 1) Ectopic recombination among TEs can lead to deleterious chromosomal rearrangements, 2) TE insertions may have direct negative effects, for example, by disrupting genes or regulatory regions, and 3) the products of TEs, such as the transposase, can generate deleterious effects (e.g., DNA breaks) (Nuzhdin 1999). Despite this largely selfish activity, some TE insertions may confer beneficial effects on hosts, such as resistance to insecticides (Daborn et al. 2002; González et al. 2008; Casacuberta and González 2013). However, the distribution of the fitness effects of TE insertions remains an important open question (Arkhipova 2018; Kofler 2019).

The unrestrained proliferation of TEs can drive host populations to extinction (Brookfield and Badge 1997; Kofler 2019); thus, the spread of TEs needs to be controlled. The proliferation of TEs may be controlled at the population level, by negative selection against TE insertions, and at the host level, for example, by small RNAs that repress TE activity (Charlesworth and Charlesworth 1983; Charlesworth and Langley 1989; Brennecke et al. 2007; Blumenstiel 2011; Kofler 2019). In mammals and invertebrates, the host defense system relies on piRNAs, small RNAs between 23 and 29 nt in size (Gunawardane et al. 2007; Brennecke et al. 2008). These piRNAs bind to PIWI-clade proteins that direct the repression of TEs at both transcriptional and posttranscriptional levels (Brennecke et al. 2007; Gunawardane et al. 2007; Sienski et al. 2012; Le Thomas et al. 2013). piRNAs are derived from discrete genomic loci, called piRNA clusters, which may make up substantial portions of genomes (Brennecke et al. 2007). In Drosophila, for example, piRNA clusters account for 3.5% of the genome (Brennecke et al. 2007).

It is believed that the proliferation of an invading TE is stopped when one copy of the TE jumps into a piRNA cluster, which triggers the production of piRNAs against the TE that ultimately silence all TE copies in trans (Bergman et al. 2006; Malone and Hannon 2009; Zanni et al. 2013; Goriaux et al. 2014; Yamanaka et al. 2014; Duc et al. 2019; Ozata et al. 2019). This view is known as the trap model, because piRNA clusters act as genomic traps for active TEs (Bergman et al. 2006). The trap model is currently widely supported by many different observations. For example, artificial sequences inserted into piRNA clusters yield piRNAs complementary to the inserted sequence, and a single insertion in a piRNA cluster is sufficient to silence a reporter construct in trans (Josse et al. 2007; Muerdter et al. 2012). Furthermore, a study directly observing TE invasions noted a rapid emergence of piRNA cluster insertions and piRNAs complementary to the invading TE (Kofler et al. 2018). Finally, computer simulations confirmed that piRNA clusters can stop TE invasions (Lu and Clark 2010; Kelleher et al. 2018; Kofler 2019). In particular, large clusters were able to control TE invasions under a wide range of different conditions (Kofler 2019). However, it is unclear if this observation holds for clusters of any size. Here, we tested the hypothesis that piRNA clusters have a minimum size using computer simulations of TE invasions under the trap model. We found that piRNA clusters must account for at least 0.2% of a genome to stop TE invasions over a wide range of parameters. Even larger piRNA clusters may be necessary when populations are small or TE insertions are recessive.

Results

We performed forward simulations of TE invasions under the trap model to test the hypothesis that piRNA clusters need to have a minimum size (Bergman et al. 2006). The trap model holds that the proliferation of an invading TE is stopped when a TE copy jumps into a piRNA cluster, which triggers the production of piRNAs that silence all TE copies in trans (fig. 1A). Accordingly, we assumed that TEs multiply at a transposition rate of u >0 in individuals without a cluster insertion and at a rate of u =0 in individuals with a cluster insertion. Excisions were ignored as they have a minor influence on the dynamics of TE invasions with piRNA clusters (Kofler 2019). We simulated diploid organisms, with five chromosomes of size 10 Mb and a recombination rate of 4 cM/Mb (fig. 1B). If not mentioned otherwise, the population size was N =1,000. A dual-strand piRNA cluster was simulated at one end of each chromosome (fig. 1B). The size of the piRNA clusters was identical for all individuals within a population. We measured the size of piRNA clusters as a percentage of the genome, as the relative size of piRNA clusters (as a percent), not the absolute size (in bp), determines the invasion dynamics of TEs (Kofler 2019; Kelleher et al. 2018). In our simulations, piRNA clusters with a size of 1% correspond to one cluster of 100 kb at one end of each chromosome (fig. 1B). Initially, we assumed that each TE reduces the fitness of the host (w) by a constant factor (x) such that w=1xn, where n is the number of TE insertions per diploid (Charlesworth and Charlesworth 1983). Furthermore, we assumed that TE insertions in piRNA clusters incur no negative fitness effects (i.e., x =0 for cluster insertions). To avoid the stochastic early stages of an invasion, where TEs may get lost due to genetic drift (Le Rouzic and Capy 2005; Kofler 2019), we launched each TE invasion by randomly distributing 1,000 insertions in individuals of the initial population. These insertions have an initial population frequency of 1/2N.

Fig. 1.

Fig. 1.

—Dynamics of TE invasions with piRNA clusters and deleterious TE insertions. (A) Under the trap model, the proliferation of an active TE (green) is stopped when one copy jumps into a piRNA cluster (i.e., the trap; hatched area), which deactivates all TE copies in trans (red). (B) We simulated five chromosomes with a size of 10 Mb for a diploid organism. A piRNA cluster was simulated at one end of each chromosome and a recombination rate of 4 cM/Mb was used. (C) The abundance of TE insertions during an invasion. Fifty replicates are shown. (D) Number of cluster insertions per diploid during an invasion (av. clu. ins). (E) Average fitness during an invasion.

Invasions of deleterious TEs under the trap model show a characteristic pattern (fig. 1CE; with u =0.1, x =0.01, and a cluster size of 3%; see also Kelleher et al. 2018). Initially, the TE rapidly multiplies within the genome, which markedly reduces the fitness of the host (fig. 1C and E). Next, individuals accumulate increasing numbers of cluster insertions, which slow down the spread of the TE (fig. 1C and D). Finally, negative selection removes deleterious TE insertions, which restores the fitness of the host to nearly initial levels (fig. 1C and E). In our example, the TE invasion temporarily reduced host fitness by ∼20–30% (fig. 1E). In these simulations, piRNA clusters accounted for 3% of the genome. The size of the piRNA clusters is a major factor determining the number of TEs that accumulate during an invasion, that is, more TE insertions will accumulate when piRNA clusters are small (Kofler 2019; Kelleher et al. 2018). The fitness reduction during a TE invasion will be more dramatic for smaller piRNA clusters (Kelleher et al. 2018). We surmised that populations may go extinct when piRNA clusters are very small.

To test these hypotheses, we simulated TE invasions with cluster sizes ranging from 0.001% to 10% (fig. 2A). Throughout this work, we refer to the lowest fitness of a population during an invasion as the minimum fitness (fig. 2A). Furthermore, we assumed that a population went extinct if its average fitness dropped below 0.1 (i.e., min. fitness < 0.1). With an average fitness of 0.1, ∼16–23% of the individuals in a population have a fitness of zero (assuming a linear fitness function; the effect of nonlinear fitness functions will be explored later). These individuals will not contribute any offspring to the next generation. We introduced the extinction threshold of 0.1 to avoid unrealistic simulation conditions where, for example, very few survivors with fitness just above zero could reconstitute the entire population of the next generation.

Fig. 2.

Fig. 2.

—piRNA clusters need a minimum size to control TE invasions. (A) Average fitness of populations during TE invasions with different sizes of piRNA clusters (top panel). Fifty replicates are shown. We refer to the lowest fitness during an invasion as the “minimum fitness” (arrow). Red dashed lines indicate the extinction threshold (w <0.1). With a cluster size of 0.001%, all populations went extinct. (B) Influence of the cluster size on the extinction of populations. Each dot represents the result of a single simulation with a randomly drawn cluster size. For nonextinct populations, the minimum fitness is shown. The size of piRNA clusters in Drosophila melanogaster is indicated as a dashed black line (3%). Random numbers are used for the y-axis. (C) Histogram showing the fraction of extinct populations for different cluster sizes. No extinct populations were found for clusters >0.1%.

As expected, the minimum fitness was close to 1.0 for large clusters (fig. 2A; 10% cluster size). However, the minimum fitness dropped substantially with decreasing cluster sizes (fig. 2A). For clusters with a size of 0.001%, the minimum fitness dropped below 0.1 for all simulated populations, that is, all populations went extinct. This demonstrates that transposon traps, such as piRNA clusters, need a minimum size to prevent the extinction of populations. To identify the required size of piRNA clusters, we performed 2,000 simulations with randomly chosen cluster sizes (fig. 2B). As expected, for clusters with a size of 0.001% the vast majority of the populations went extinct (fig. 2B and C). No extinct populations were observed for clusters >0.1% (fig. 2B and D). Hence, in the simulated scenario, the minimum size of piRNA clusters was ∼0.1%.

Thus far, we have assumed that cluster insertions have no deleterious fitness effects. However, it is possible that TE insertions in piRNA clusters reduce host fitness. For example, cluster insertions will increase genome size, and a large genome may be deleterious (Petrov 2001). We found that negative selection on cluster insertions only had a minor influence on the minimum fitness during an invasion, and consequently, on the extinction rate of populations (supplementary fig. S1, Supplementary Material online). The minimum size of piRNA clusters was slightly larger when all TE insertions, including cluster insertions, had negative fitness effects (x =0.01; supplementary fig. S1 and table S1, Supplementary Material online). For the remainder of the manuscript, we assumed that cluster insertions have no fitness cost to the host (i.e., x =0.0 for cluster insertions).

To identify regions of the parameter space where populations with small piRNA clusters are vulnerable to extinction, we performed simulations with randomly chosen transposition rates (u) and negative effects of TEs (x). We followed the invasions for 5,000 generations and recorded the results (fig. 3). In the absence of piRNA clusters, three principal outcomes can be observed (Kofler 2019). First, populations can go extinct when ux (fig. 3A). Second, populations may lose all TE insertions when negative selection against TEs is strong (x > u; fig. 3A). The minimum fitness of these populations is usually close to the maximum fitness (wmax = 1.0) (supplementary fig. S2, Supplementary Material online). Third, stable TE copy numbers may be attained when the number of TEs removed by negative selection equals the number of novel TE insertions gained by transposition (transposition selection balance) (Charlesworth and Charlesworth 1983; Charlesworth and Langley 1989). In the absence of piRNA clusters, stable TE copy numbers are solely observed in a narrow region of the parameter space (fig. 3A; assuming a linear fitness function; the window is larger with nonlinear fitness functions [Kofler 2019]). If large piRNA clusters (3%) are introduced into the model, the extinction of populations is prevented over the entire parameter space (fig. 3B; see also Kofler 2019). Furthermore, stable TE copy numbers are observed for a wide range of parameters (u > x; fig. 3B). However, when piRNA clusters are small (0.01%), extinct populations reemerge (fig. 3C and D). Extinct populations are mainly observed when transposition rates are high and negative effects of TEs are intermediate (fig. 3C). This raises the question: Why do intermediate deleterious effects of TEs lead to extinctions when piRNA clusters are small? Strongly deleterious TEs are usually quickly removed by negative selection. These TEs are thus unable to accumulate copy numbers high enough to drive populations to extinction. Weakly deleterious TEs may accumulate high copy numbers before cluster insertions stop an invasion. However, the cumulative deleterious effect of TEs with weak effects may be insufficient to drive populations to extinction. Only TEs with intermediate effects can accumulate high enough numbers and be sufficiently deleterious to drive a population to extinction.

Fig. 3.

Fig. 3.

—Small piRNA clusters offer insufficient protection from the extinction of host populations. Each dot represents the outcome of a single simulated TE invasion. The transposition rate (u) and negative effect of TEs (x) were randomly selected. (A) In the absence of piRNA clusters, extinct populations are observed when ux. (B) Extinct populations are not observed when piRNA clusters are large (3% of the genome). (C) For small piRNA clusters (0.01%), extinct populations are observed when transposition rates are high and negative effects are intermediate. (D) Extinct populations are common for very small piRNA clusters (0.001%). Depending on the efficacy of negative selection and transposition (N*u>1 and N*x>1 with N =1,000), the parameter space can be divided into four quadrants. Factors that are effective in a quadrant are shown in brackets.

Next, we aimed to identify the minimum size of piRNA clusters under different scenarios. We first investigated the influence of the transposition rate (u) and the negative effect of TEs (x) using >20,000 simulations with randomly chosen parameters (fig. 4). As TE insertions are usually rapidly purged from populations when x > u (fig. 3), we performed simulations mostly with transposition rates larger than the negative effects of TEs (fig. 4). We identified regions of the parameter space that do not require piRNA clusters for controlling TEs using simulations performed without piRNA clusters (fig. 4A and B; nonwhite dots in left panels). piRNA clusters had a minimum size whenever piRNA clusters were necessary to control TE invasions (fig. 4A and B). The largest piRNA clusters were necessary when the negative effects of TEs were intermediate (fig. 4A) and the transposition rates were high (fig. 4B). The largest piRNA clusters of an extinct population had a size of 0.16%. However, this extinction was observed for a simulation with a high transposition rate of u =0.96. So far, the largest observed transposition rates were slightly smaller, with u=0.20.6 (Robillard et al. 2016; Kofler et al. 2018).

Fig. 4.

Fig. 4.

—The influence of the negative effects of TEs (A) and transposition rates (B) on the minimum size of piRNA clusters. Each dot represents the outcome of a single simulation. The cluster size, the transposition rate, and the negative effect of TEs were randomly selected. To identify regions of the parameter space where piRNA clusters are not necessary to control TEs (nonwhite dots in the left panel), we show results for simulations performed without piRNA clusters. Dashed lines indicate the size of piRNA clusters in Drosophila melanogaster (3%). Large piRNA clusters with sizes of 0.1–1% are necessary to control TEs when transposition rates are high and negative effects of TEs are intermediate.

As the efficacy of negative selection, an important factor counteracting TEs, depends on population sizes, we investigated the influence of this factor (Gillespie 2010). We performed 2,000 simulations with randomly chosen cluster sizes for three different population sizes (20,000, 2,000, and 200; fig. 5A). The population size had a significant influence on the minimum size of piRNA clusters, where small populations required the largest piRNA clusters (fig. 5B; for effect sizes see supplementary table S1, Supplementary Material online). With the smallest evaluated population size (N =200), the largest cluster of an extinct population had a size of 0.13%.

Fig. 5.

Fig. 5.

—Influence of the population size (A, B) and the heterozygous effect (C, D) on the minimum size of piRNA clusters. (A, C) Each dot represents the outcome of a single simulation with randomly chosen cluster sizes, population sizes (20020,000), or heterozygous effects of TEs (dom dominant, add additive, rec recessive). Dashed lines indicate the size of piRNA clusters in Drosophila melanogaster (3%). Random numbers are used for the y-axis. (B, D) The 50 largest clusters of extinct populations. Significance was estimated with Wilcoxon rank-sum tests. *** P <0.001.

Thus far, we have assumed that TE insertions have additive fitness effects, such that homozygous TE insertions reduce host fitness twice as much as heterozygous insertions. However, it is feasible that TE insertions have recessive deleterious effects (Charlesworth and Langley 1989; Lee and Langley 2010). For example, TE insertions may disrupt genes for which a single intact copy is sufficient to generate the wild type (haplosufficient). On the other hand, it is feasible that TE insertions are dominant, for example, due to trans-epigenetic effects (Lee et al. 2020). To investigate the impact of heterozygous effects on the minimum cluster size, we used the fitness function w=12xihom2hxihet, where ihom and ihet are the number of homozygous and heterozygous insertion sites, respectively. For the additive case (h =0.5), this equation simplifies to our standard fitness formula, w=1xn, as the number of TE insertions per diploid genome is n=2ihom+ihet. This approach allows us to model dominant (h =1.0), additive (h =0.5), and recessive (h =0.0) effects of TE insertions. During some simulations with recessive effects, the number of accumulated TE insertions exceeded our computational resources. We terminated these simulations (>25,000 insertions per diploid; fig. 5C, black dots). However, as this problem solely occurred for very small piRNA clusters, an order of magnitude smaller than the clusters where extinctions were first observed, this limitation will not influence our conclusions (fig. 5C). The heterozygous effect had a significant influence on the minimum size of piRNA clusters, where the largest clusters were required for recessive insertions and the smallest for dominant insertions (fig. 5D; for effect sizes, see supplementary table S1, Supplementary Material online). With recessive insertions, the largest cluster of an extinct population had a size of 0.16%, and with dominant insertions, a size of 0.04%. Our findings highlight the influence of the efficacy of negative selection against TEs on the minimum size of piRNA clusters. When the efficacy of negative selection is high (large populations or dominant insertions), small clusters are sufficient to control TEs, whereas large clusters are necessary when the efficacy of negative selection is weak (small populations or recessive insertions).

Thus far, we have assumed that all TE insertions reduce host fitness by the same amount, irrespective of the genomic insertion site. However, it is likely that different TE insertions have very diverse fitness effects (Pasyukova et al. 2004). For example, insertions into coding sequences are probably more harmful than insertions into intergenic regions. We evaluated the effect of heterogeneous fitness effects of TE insertions using the fitness function: w=1i=1nxi. This equation simplifies to our standard fitness formula (w=1xn) when all TE insertions have identical effects. We mixed TE insertions with strong (x =0.1), moderate (x =0.01), weak (x =0.001), and very weak (x =0.0001) deleterious effects at different proportions (supplementary fig. S3A, Supplementary Material online). The heterogeneity of the effect sizes had a significant influence on the minimum size of piRNA clusters (Kruskal–Wallis rank-sum test with the 50 largest clusters of extinct populations; χ2=235,P<2.2e16; supplementary fig. S3B, Supplementary Material online). In all simulated scenarios, piRNA clusters had a minimum size, ranging from 0.005% to 0.05% (supplementary fig. S3B, Supplementary Material online).

Up to this point, we have assumed that each novel TE insertion reduces host fitness by the same amount, irrespective of the number of TE copies already present in the host genome. Hence, we assumed that the host fitness declines linearly with the TE copy number. Nevertheless, it is feasible that host fitness decreases more steeply with the TE copy number (Charlesworth and Charlesworth 1983). For example, ectopic recombination among distant TEs could lead to highly deleterious genomic rearrangements (e.g., inversions or translocations), and the number of ectopic recombination events may increase exponentially with TE abundance (Charlesworth and Charlesworth 1983; Charlesworth and Langley 1989; Montgomery et al. 1991). On the other hand, it is also conceivable that the decline of host fitness slows down with TE copy number. For example, the products of a few TE insertions may suffice to reduce host fitness to some extent, but the products of further TE copies may only have a minor impact. Following Charlesworth and Charlesworth (1983), we modeled interactions among TEs using the equation w=1xnt. This formula allows us to model the linear (t =1; yielding our standard fitness formula w=1xn), steep (t >1), and slow (t <1) decrease of host fitness with TE copy number (fig. 6A). We performed 10,000 simulations with randomly chosen cluster sizes and interaction terms ranging from 0.75 (slow decline) to 1.5 (steep decline; fig. 6). In the simulated scenario, piRNA clusters were not necessary to control TE invasions when t >1.25 (nonwhite dots; fig. 6B). This is in agreement with previous work showing that a steep fitness decline (t >1) extends the parameter space over which TEs may be controlled in the absence of piRNA clusters (Charlesworth and Charlesworth 1983; Kofler 2019). Nevertheless, large piRNA clusters can by themselves protect populations from extinction over the entire parameter space (Kofler 2019). However, at less steep fitness declines (t <1.25), extinct populations were observed for small piRNA clusters. Interestingly, the minimum size of piRNA clusters only varied moderately among the different epistatic interactions of TEs (0.017% with t0.75; 0.045% with t1.0; 0.015% with t1.2; ANOVA with the twenty largest clusters of extinct populations P<3.2e14). Our finding that piRNA clusters have a minimum size is thus robust and applies to different nonlinear declines of host fitness with TE copy number.

Fig. 6.

Fig. 6.

—Influence of epistatic interactions (t) among TEs on the minimum size of piRNA clusters. (A) The fitness of the host may decrease linearly (t =1), steeply (t >1), or slowly (t <1) with the TE copy number. The red dashed line indicates the extinction threshold (w <0.1). (B) Parameter space with epistatic TE interactions. Each dot represents the outcome of a single simulation with a random cluster size and an epistatic interaction term (t). Results of simulations performed without piRNA clusters are shown to identify regions of the parameter space where piRNA clusters are not necessary to control TEs (nonwhite dots in the left panel). The dashed line indicates the size of piRNA clusters in Drosophila melanogaster (3%).

Under the previously simulated scenarios, piRNA clusters with a size of 0.2% were sufficient to protect populations from extinction. However, the joint effect of some important parameters may necessitate even larger clusters. We thus investigated the minimum size of piRNA clusters during a worst-case scenario, that is, 1) small population size (20–100), 2) recessive TE insertions, 3) high transposition rates (u =0.2), and 4) intermediate negative effects of TEs (x =0.01; fig. 7). In these scenarios, piRNA clusters need a size of 1–3% to protect populations from extinction (fig. 7; for effect sizes see supplementary table S1, Supplementary Material online).

Fig. 7.

Fig. 7.

—The minimum size of piRNA clusters under the worst-case scenario, that is, recessive TE insertions of intermediate effects, high transposition rates, and three different small population sizes (right panel). (A) Influence of cluster size on the extinction of populations. Each dot represents the result of a single simulation with a randomly drawn cluster size. The minimum fitness is shown for nonextinct populations. The size of piRNA clusters in Drosophila melanogaster is shown as a dashed black line (3%). Random numbers are used for the y-axis. (B) The 50 largest piRNA clusters of extinct populations.

In summary, we conclude that piRNA clusters need a size of at least 0.2% to protect populations from extinction over a wide range of parameters. In a worst-case scenario involving small populations and recessive insertions, clusters accounting for up to 3% of the genome may be necessary to control TEs.

Discussion

Here, we used a simple population genetics model to reveal a constraint on the size of transposon traps, such as piRNA clusters. To control TE invasions over a wide range of parameters, piRNA clusters need to have a minimum size of 0.2% of the genome. When clusters are smaller, populations may go extinct from the load of deleterious TE insertions that accumulate during a TE invasion.

Our finding that piRNA clusters have a minimum size relies on three well-supported assumptions. First, TEs amplify within genomes at a certain rate. Second, as TE insertions are mostly deleterious, host fitness decreases with TE copy number such that hosts will eventually die from the cumulative effects of the TE insertions. Third, piRNA clusters act as transposon traps such that a TE insertion in a piRNA cluster represses all other copies of the TE in trans.

There is little doubt about our assumption that TEs multiply within host genomes (Wicker et al. 2007; Burt and Trivers 2008). However, we also rely on the widely used assumption that the transposition rate is constant during an invasion (in the absence of piRNAs) (e.g., Charlesworth and Charlesworth 1983; Charlesworth 1991; Lu and Clark 2010; Kelleher et al. 2018; Kofler 2019). It is not entirely clear whether this assumption holds, as very few studies estimated transposition rates during TE invasions. Kofler et al. (2018) traced P-element invasions in experimental Drosophila simulans populations for 40–60 generations. Before the TE was silenced by piRNAs, the P-element had a rather uniform transposition rate of about u =0.15 in hot conditions and u =0.05 in cold conditions (Kofler et al. 2018). Hence, a constant transposition rate is currently a reasonable assumption.

Our assumption that TE insertions are mostly deleterious is also well supported (Yukuhiro et al. 1985; Mackay 1989; Mackay et al. 1991; Houle and Nuzhdin 2004; Pasyukova et al. 2004; Blumenstiel et al. 2014). Moreover, it seems highly unlikely that a complex host defense mechanism, such as the piRNA pathway, could have evolved if TEs were not deleterious. Nevertheless, several major questions remain. Most importantly, the distribution of the fitness effects of TE insertions and the rate at which fitness decays with TE copy number remain unclear (Arkhipova 2018; Kofler 2019). However, our finding that piRNA clusters need a minimum size to control TE invasions is robust and applies under different assumptions about the deleterious effects of TEs. Our conclusion would not hold under a model where host fitness never approaches zero, even when an organism accumulates vast amounts of TEs, but such a model seems unlikely. Indeed, there is ample support for our assumption that TEs may cause the death (or sterility) of hosts. The well-described hybrid dysgenesis systems are good examples (Kidwell et al. 1977). As the piRNA-mediated defense against TEs is maternally transmitted, TEs are reactivated in offspring resulting from crosses of males with a TE and females without the same TE (Brennecke et al. 2008). This TE reactivation may lead to diverse defects. The I-element causes developmental defects that prevent the hatching of larvae (Bucheton et al. 1976; Wang et al. 2018). Hobo and P-element activity lead to dysgenic and mostly sterile ovaries (Kidwell et al. 1977; Kidwell 1983; Blackman et al. 1987; Hill et al. 2016). Sterile gonads may also result from reactivation of multiple TE families, as observed in a hybrid dysgenesis syndrome in Drosophila virilis (Evgen'ev et al. 1997; Erwin et al. 2015). Furthermore, individuals with defects in components of the piRNA pathway, such as Aub or Piwi, are frequently infertile (Lin and Spradling 1997; Ma et al. 2009; Czech et al. 2013; Théron et al. 2018). This infertility may, at least partially, be due to a high TE activity in individuals without functional TE defense (Ma et al. 2009).

Finally, our assumption that piRNA clusters act as traps that control TE activity is well supported. First, it was observed that a single P-element insertion in a piRNA cluster is sufficient to repress a reporter construct in trans (Josse et al. 2007). Second, the insertion of an artificial sequence into a piRNA cluster triggered the production of piRNAs complementary to the sequence (Muerdter et al. 2012). Third, the transposons ZAM and Idefix are inactive in strains with an insertion of these two TEs in the somatic piRNA cluster flamenco, but active in strains without these cluster insertions (Zanni et al. 2013). Fourth, a de novo insertion of ZAM into a germline piRNA cluster restores the repression of ZAM (Duc et al. 2019). Fifth, piRNA clusters are frequently composed of TE sequences (Brennecke et al. 2007), supporting the view that clusters carry the trapped remnants of past TE invasions (but see pachytene clusters in mice [Girard et al. 2006; Ernst et al. 2017]). Finally, computer simulations confirm that genomic traps, such as piRNA clusters, may stop TE invasions (Lu and Clark 2010; Kelleher et al. 2018; Kofler 2019).

The minimum size of piRNA clusters identified in this work (0.2%) rests on the further assumption that TE insertion sites are random. Some transposons, however, have pronounced insertion preferences (Sultana et al. 2017) that may lead to over or underrepresentation of TE insertions within piRNA clusters. It has also been surmised that piRNA clusters can somehow attract TE insertions (Brennecke et al. 2007). Our estimates of the minimum size of piRNA clusters will be too small if TEs avoid inserting into piRNA clusters and too large if TEs preferentially jump into piRNA clusters (or if piRNA clusters attract TEs). As insertion biases vary among TE families (Linheiro and Bergman 2012; Jakšić et al. 2017), robust protection from extinction may require piRNA clusters larger than the estimated 0.2%.

Furthermore, we simulated dual-strand clusters where TE insertions of any orientation generate piRNAs that silence TEs (Malone et al. 2009). For uni-strand clusters, which silence somatic TEs, cluster insertions need to be antisense (Malone et al. 2009). As this basically halves the probability of obtaining a piRNA-producing cluster insertion, we expect that uni-strand clusters need to be larger than dual-strand clusters.

Our estimated minimum size of piRNA clusters is based solely on a population genetics model. It is likely that biochemical processes also constrain the size of piRNA clusters. For example, de Vanssay et al. (2012) showed that loci with seven tandem copies of a transgene may be converted into a piRNA cluster by maternally inherited small RNAs, whereas loci with fewer tandem copies could not be converted.

Our finding that piRNA clusters need a minimum size of 0.2% (3% in the worst case) to control TE invasions raises the important question of whether clusters of different species actually meet this requirement. The size of germline clusters in Drosophila melanogaster is ∼3% of the genome (Brennecke et al. 2007). These clusters thus provide comprehensive protection from TEs, even under our worst-case scenario. Nevertheless, a TE with a strong insertion bias against piRNA clusters could overcome this defense and drive populations to extinction. In D. melanogaster, a distinct set of transposons is active in the somatic tissue surrounding the germline (Song et al. 1997; Blumenstiel 2011; Barckmann et al. 2018). These TEs are controlled by a single somatic cluster, flamenco (Li et al. 2009; Malone et al. 2009), which has a size of ∼0.15% (assuming a flamenco size of 300 kb and a genome size of 200 Mb; Brennecke personal communication; Bosco et al. 2007). As another example, in Koala, piRNA clusters account for 0.17% of the genome (Yu et al. 2019). Somatic clusters in D. melanogaster and piRNA clusters in Koala provide sufficient protection under the majority of the simulated scenarios. However, under a worst-case scenario, that is, small populations and recessive TE insertions, these clusters may be too small to control TEs. In mice, humans, and rats, pachytene piRNA clusters account for ∼0.1% of the genome (Girard et al. 2006). These clusters likely offer insufficient protection from TE invasions under multiple simulated scenarios. Nevertheless, pachytene piRNA clusters may be able to stop TE invasions if the TEs have an insertion bias into these clusters (Ernst et al. 2017). Alternatively, some organisms may have multiple layers of defense against TEs. For example, Kruppel-associated box zinc-finger proteins (KRAB-ZFPs) may silence TEs in some mammals (Yang et al. 2017). KRAB-ZFPs bind to TE sequences and induce heterochromatin formation (Yang et al. 2017). However, some TEs continue to be active even when targeted by KRAB-ZFPs, and KRAB-ZFPs only target some TE families (Yang et al. 2017). Furthermore, it not clear if KRAB-ZFPs are mostly responsible for the long-term maintenance of TE repression, or if they also silence newly invading TEs. However, it is difficult to see how the sequence specificity of KRAB-ZFPs, which is determined by tandem arrays of C2H2 zinc fingers (Yang et al. 2017), could rapidly adapt to the sequence of a newly invading TE. Thus, in contrast to the piRNA pathway, which provides a generalized defense mechanism capable of trapping and silencing any mobile element, KRAB-ZFPs likely constitute a more specialized defense layer, requiring adaptation of the DNA binding sites to the sequence of each novel TE. It would be interesting to estimate the size of piRNA clusters in more species. Blatant violations of our minimum size requirements may indicate that the piRNA pathway is not the primary defense against TEs in some organisms or that the simple version of the trap model, which presumes that a TE invasion is controlled by random TE insertions into piRNA clusters, is incomplete. For example, it may be necessary to incorporate paramutations into the trap model (de Vanssay et al. 2012). Under this scenario, maternally transmitted piRNAs may convert TE insertions into piRNA-producing loci (Le Thomas et al. 2014). Silencing of a novel TE invasion with paramutations could thus proceed like a chain reaction wherein increasing amounts of TE insertions in different individuals are converted into piRNA-producing loci. The source of the first piRNAs that trigger this chain reaction may again be a random insertion into piRNA clusters. In contrast to the simple trap model, where several cluster insertions are required in each individual (Kofler 2019), cluster insertions are only expected in very few individuals under a trap model with paramutations (because euchromatic TE insertions may also yield piRNAs [see also Mohn et al. 2014; Shpiz et al. 2014]). The minimum size of piRNA clusters would thus be much reduced. Currently, many questions about paramutations remain to be answered: 1) Are paramutations important for silencing novel TE invasions, or mostly for maintaining the silencing of a TE?; 2) which fraction of the TE insertions could potentially be converted into piRNA-producing loci?; 3) are paramutated loci stably inherited over generations?; and 4) what is the role of the environment in paramutations (e.g., Casier et al. 2019)?

Finally, it is also feasible that in some organisms, piRNAs against a newly invading TE emerge independently of cluster insertions. For example, it was proposed that the emergence of piRNAs against the KoRV virus in koala is triggered by unspliced TE transcripts (Yu et al. 2019). The mechanism responsible for recognizing unspliced TE transcripts is, however, still unclear. Furthermore, it is not clear if this process silences DNA transposons and how this mechanism distinguishes between transcripts derived from TEs and genes.

Our work also raises the important question of which evolutionary forces shape the evolution of piRNA clusters. In our simulations, populations with small piRNA clusters went extinct whereas populations with large clusters survived. However, it is not necessary to invoke such group selection arguments (Maynard Smith 1964, 1976) to explain the evolution of piRNA clusters in natural populations. The size of piRNA clusters could be polymorphic in natural populations, such that individuals with small and large clusters can be found within a single population. If individuals with large clusters end up with fewer deleterious TE insertions than individuals with small clusters, large clusters may be positively selected. This hypothesis is supported by recent simulation studies, which found that TE insertions within piRNA clusters may be positively selected (Lu and Clark 2010; Kelleher et al. 2018; Kofler 2019). Large clusters will on average end up with more TE insertions than small clusters. Due to the perfect linkage between piRNA clusters and its TE insertions, large clusters will be positively selected. The size of piRNA clusters may thus grow to a level where extinctions of populations are no longer expected.

On the other hand, evolutionary forces may exist that restrict the size of piRNA clusters. If the TE defense machinery comes at a cost to the host (Koonin et al. 2020), large piRNA clusters may be costlier to maintain than small clusters, especially when novel TE invasions are rare. The fitness cost of piRNA clusters may stem from ectopic recombination among cluster insertions or the cellular resources expended for generating vast amounts of piRNAs derived from large portions of genomes. This raises the intriguing possibility that the size of piRNA clusters is at an equilibrium between evolutionary forces that act to expand and contract them. If this hypothesis is true, natural populations should exhibit variations in the size of piRNA clusters. Such variations could exist. A recent study found that the subtelomeric piRNA cluster X-TAS is present in most wild strains of D. melanogaster but is frequently lost in lab strains (Asif-Laidin et al. 2017). Future studies may shed light on the variation of piRNA clusters in natural populations.

Materials and Methods

Simulation Software

All simulations were performed with the Java tool Invade (Kofler 2019). Invade performs individual-based forward simulations of TE activity in diploid organisms. For this work, we released a version (0.8.07) that implements the following new features: 1) computes the minimum fitness during an invasion, 2) offers support for heterozygous effects of TE insertions, and 3) offers support for mixed deleterious fitness effects of TE insertions. In each generation, Invade performs the following steps in the given order: 1) Mate pairs are formed based on the fitness of the individuals, 2) haploid gametes are generated based on the recombination map, 3) novel TE insertions are introduced into the gametes of parents without an insertion in a piRNA cluster, 4) zygotes are formed, 5) the fitness and the number of cluster insertions in the new individuals are computed, and 6) the output is generated (optional).

Simulated Scenarios

We simulated a genome with five chromosomes of 10 Mb (–genome Mb:10,10,10,10,10) and a recombination rate of 4 cM/Mb (–rr cM_Mb:4,4,4,4,4). Invasions were launched by randomly introducing 1,000 TE insertions in individuals of the initial population (–basepop seg:1000). Simulations under our worst-case scenario were launched by randomly distributing 100 insertions in the starting population. This was necessary because 1,000 insertions with an x =0.01 in a small population of 20 reduce the fitness of individuals in the starting population by 50% (with 100 insertions, fitness is only reduced by 5%). The transposition rate (e.g., –u 0.1), the negative effect of TE insertions (e.g., –x 0.01), the population size (e.g., –N 1000), and the cluster size (e.g., –cluster kb:each:100) varied among simulated scenarios. The key parameters used for each scenario are shown at the top of the figures. All TE invasions were simulated for 5,000 generations (–gen 5000), except for the scenario in which we varied the transposition rate from 0.005 to 1.0, in which 10,000 generations were used. To cover the parameter space in one or two dimensions, we used Python scripts that launched between 2,000 and 12,500 simulations with randomly chosen parameter combinations.

Different rates of fitness decay with increasing TE copy numbers were simulated with an epistatic interaction term (e.g., –t 1.0; with the fitness function w=1xnt). To simulate dominant, recessive, and additive TE effects, a heterozygous effect was provided (recessive –nsmodel het:0.0; additive –nsmodel het:0.5; dominant –nsmodel het:1.0). Mixed deleterious effects of TE insertions were simulated by providing the effect sizes of the insertion sites (–nsmodel site:0.1,0.001). Equal amounts of sites were simulated for each effect size.

All statistical analyses were performed using R (R Core Team 2019), and visualizations were done with the ggplot2 library (Wickham 2016).

The sizes of piRNA clusters of humans, mice, and rats were computed using the data of Girard et al. (2006) (supplementary tables S2–S4, Supplementary Material online). We summed up the length of each cluster and divided this sum by the approximate genome size of 3,000 Mb (cluster sizes: mouse 2.70 Mb, human 3.16 Mb, rat 3.29 Mb).

Supplementary Material

evaa064_Supplementary_Data

Acknowledgments

We thank Florian Schwarz for comments, Nayuta Yamashita for critical reading of the manuscript, and all members of the Institute of Population Genetics for feedback and support. This work was supported by an Austrian Science Foundation (FWF) (Grant No. P30036-B25) to R.K.

Data deposition: Invade is available at https://sourceforge.net/projects/invade/. All Python and R scripts used for the analysis are available at SourceForge https://sourceforge.net/projects/te-tools/ (folder “minsize”).

Literature Cited

  1. Arkhipova IR. 2018. Neutral theory, transposable elements and eukaryotic genome evolution. Mol Biol Evol. 35(6):1332–1337. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Asif-Laidin A, et al. 2017. Short and long-term evolutionary dynamics of subtelomeric piRNA clusters in Drosophila. DNA Res. 24(5):459–472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Barckmann B, et al. 2018. The somatic piRNA pathway controls germline transposition over generations. Nucleic Acids Res. 46(18):9524–9536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bergman CM, Quesneville H, Anxolabéhère D, Ashburner M.. 2006. Recurrent insertion and duplication generate networks of transposable element sequences in the Drosophila melanogaster genome. Genome Biol. 7(11):R112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Blackman RK, Grimaila R, Macy M, Koehler D, Gelbart WM.. 1987. Mobilization of hobo elements residing within the decapentaplegic gene complex: suggestion of a new hybrid dysgenesis system in Drosophila melanogaster. Cell 49(4):497–505. [DOI] [PubMed] [Google Scholar]
  6. Blumenstiel JP. 2011. Evolutionary dynamics of transposable elements in a small RNA world. Trends Genet. 27(1):23–31. [DOI] [PubMed] [Google Scholar]
  7. Blumenstiel JP, Chen X, He M, Bergman CM.. 2014. An age-of-allele test of neutrality for transposable element insertions. Genetics 196(2):523–538. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bosco G, Campbell P, Leiva-Neto JT, Markow TA.. 2007. Analysis of Drosophila species genome size and satellite DNA content reveals significant differences among strains as well as between species . Genetics 177(3):1277–1290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Brennecke J, et al. 2007. Discrete small RNA-generating loci as master regulators of transposon activity in Drosophila. Cell 128(6):1089–1103. [DOI] [PubMed] [Google Scholar]
  10. Brennecke J, et al. 2008. An epigenetic role for maternally inherited piRNAs in transposon silencing. Science 322(5906):1387–1392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Brookfield JF, Badge RM.. 1997. Population genetics models of transposable elements. Genetica 100(1/3):281–294. [PubMed] [Google Scholar]
  12. Bucheton A, Lavige J, Picard G, L’heritier P.. 1976. Non-mendelian female sterility in Drosophila melanogaster: quantitative variations in the efficiency of inducer and reactive strains. Heredity 36(3):305–314. [DOI] [PubMed] [Google Scholar]
  13. Burt A, Trivers R.. 2008. Genes in conflict: the biology of selfish genetic elements. Cambridge: Belknap Press. [Google Scholar]
  14. Casacuberta E, González J.. 2013. The impact of transposable elements in environmental adaptation. Mol Ecol. 22(6):1503–1517. [DOI] [PubMed] [Google Scholar]
  15. Casier K, et al. 2019. Environmentally-induced epigenetic conversion of a piRNA cluster. eLife 8:e39842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Charlesworth B. 1991. Transposable elements in natural populations with a mixture of selected and neutral insertion sites. Genet Res. 57(2):127–134. [DOI] [PubMed] [Google Scholar]
  17. Charlesworth B, Charlesworth D.. 1983. The population dynamics of transposable elements. Genet Res. 42(1):1–27. [Google Scholar]
  18. Charlesworth B, Langley CH.. 1989. The population genetics of Drosophila transposable elements. Annu Rev Genet. 23(1):251–287. [DOI] [PubMed] [Google Scholar]
  19. Czech B, Preall JB, McGinn J, Hannon GJ.. 2013. A transcriptome-wide RNAi screen in the Drosophila ovary reveals factors of the germline piRNA pathway. Mol Cell. 50(5):749–761. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Daborn PJ, et al. 2002. A single P450 allele associated with insecticide resistance in Drosophila. Science 297(5590):2253–2256. [DOI] [PubMed] [Google Scholar]
  21. de Vanssay A, et al. 2012. Paramutation in Drosophila linked to emergence of a piRNA-producing locus. Nature 490(7418):112–115. [DOI] [PubMed] [Google Scholar]
  22. Doolittle WF, Sapienza C.. 1980. Selfish genes, the phenotype paradigm and genome evolution. Nature 284(5757):601–603. [DOI] [PubMed] [Google Scholar]
  23. Duc C, et al. 2019. Trapping a somatic endogenous retrovirus into a germline piRNA cluster immunizes the germline against further invasion. Genome Biol. 20(1):1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Ernst C, Odom DT, Kutter C.. 2017. The emergence of piRNAs against transposon invasion to preserve mammalian genome integrity. Nat Commun. 10(8):1411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Erwin AA, et al. 2015. piRNAs are associated with diverse transgenerational effects on gene and transposon expression in a hybrid dysgenic syndrome of D. virilis. PLOS Genet. 11(8):e1005332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Evgen'ev MB, et al. 1997. Penelope, a new family of transposable elements and its possible role in hybrid dysgenesis in Drosophila virilis. Proc Natl Acad Sci USA. 94(1):196–201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Gillespie JH. 2010. Population genetics: a concise guide. Baltimore (MD): JHU Press. [Google Scholar]
  28. Girard A, Sachidanandam R, Hannon GJ, Carmell MA.. 2006. A germline-specific class of small RNAs binds mammalian Piwi proteins. Nature 442(7099):199–202. [DOI] [PubMed] [Google Scholar]
  29. González J, Lenkov K, Lipatov M, Macpherson JM, Petrov DA.. 2008. High rate of recent transposable element-induced adaptation in Drosophila melanogaster. PLOS Biol. 6(10):e251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Goriaux C, Théron E, Brasset E, Vaury C.. 2014. History of the discovery of a master locus producing piRNAs: the flamenco/COM locus in Drosophila melanogaster. Front Genet. 5:1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Gunawardane LS, et al. 2007. A slicer-mediated mechanism for repeat-associated siRNA 5′ end formation in Drosophila. Science 315(5818):1587–1590. [DOI] [PubMed] [Google Scholar]
  32. Hill T, Schlötterer C, Betancourt AJ.. 2016. Hybrid dysgenesis in Drosophila simulans associated with a rapid invasion of the P-element. PLOS Genet. 12(3):e1005920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Houle D, Nuzhdin SV.. 2004. Mutation accumulation and the effect of copia insertions in Drosophila melanogaster. Genet Res. 83(1):7–18. [DOI] [PubMed] [Google Scholar]
  34. Jakšić AM, Kofler R, Schlötterer C.. 2017. Regulation of transposable elements: interplay between TE-encoded regulatory sequences and host-specific trans-acting factors in Drosophila melanogaster. Mol Ecol. 26(19):5149–5159. [DOI] [PubMed] [Google Scholar]
  35. Josse T, et al. 2007. Telomeric trans-silencing: an epigenetic repression combining RNA silencing and heterochromatin formation. PLOS Genet. 3(9):e158–1643. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Kelleher ES, Azevedo RBR, Zheng Y.. 2018. The evolution of small-RNA-mediated silencing of an invading transposable element. Genome Biol Evol. 10(11):3038–3057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Kidwell MG. 1983. Evolution of hybrid dysgenesis determinants in Drosophila melanogaster. Proc Natl Acad Sci USA. 80(6):1655–1659. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Kidwell MG, Kidwell JF, Sved JA.. 1977. Hybrid dysgenesis in Drosophila melanogaster: a syndrome of aberrant traits including mutations, sterility and male recombination. Genetics 86(4):813–833. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kofler R. 2019. Dynamics of transposable element invasions with piRNA clusters. Mol Biol Evol. 36(7):1457–1472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Kofler R, Senti K-A, Nolte V, Tobler R, Schlötterer C.. 2018. Molecular dissection of a natural transposable element invasion. Genome Res. 28(6):824–835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Koonin EV, Makarova KS, Wolf YI, Krupovic M.. 2020. Evolutionary entanglement of mobile genetic elements and host defence systems: guns for hire. Nat Rev Genet. 21(2):119–131. [DOI] [PubMed] [Google Scholar]
  42. Le Rouzic A, Capy P.. 2005. The first steps of transposable elements invasion: parasitic strategy vs. genetic drift. Genetics 169(2):1033–1043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Le Thomas A, et al. 2013. Piwi induces piRNA-guided transcriptional silencing and establishment of a repressive chromatin state. Genes Dev. 27(4):390–399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Le Thomas A, et al. 2014. Transgenerationally inherited piRNAs trigger piRNA biogenesis by changing the chromatin of piRNA clusters and inducing precursor processing. Genes Dev. 28(15):1667–1680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Lee YCG, Langley CH.. 2010. Transposable elements in natural populations of Drosophila melanogaster. Philos Trans R Soc B. 365(1544):1219–1228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Lee YCG, et al. 2020. Pericentromeric heterochromatin is hierarchically organized and spatially contacts H3K9me2 islands in euchromatin. PLoS Genet. 16(3):e1008673. [DOI] [PMC free article] [PubMed]
  47. Li C, et al. 2009. Collapse of germline piRNAs in the absence of argonaute3 reveals somatic piRNAs in flies. Cell 137(3):509–521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Lin H, Spradling A.. 1997. A novel group of pumilio mutations affects the asymmetric division of germline stem cells in the Drosophila ovary. Development 124(12):2463–2476. [DOI] [PubMed] [Google Scholar]
  49. Linheiro RS, Bergman CM.. 2012. Whole genome resequencing reveals natural target site preferences of transposable elements in Drosophila melanogaster. PLoS One 7(2):e30008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Lu J, Clark AG.. 2010. Population dynamics of PIWI-RNAs (piRNAs) and their targets in Drosophila. Genome Res. 20(2):212–227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Ma L, et al. 2009. GASZ is essential for male meiosis and suppression of retrotransposon expression in the male germline. PLOS Genet. 5(9):e1000635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Mackay TF. 1989. Transposable elements and fitness in Drosophila melanogaster. Genome 31(1):284–295. [DOI] [PubMed] [Google Scholar]
  53. Mackay TF, Lyman RF, Jackson MS.. 1991. Effects of P element insertions on quantitative traits in Drosophila melanogaster. Genetics 130(2):315–332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Malone CD, Hannon GJ.. 2009. Small RNAs as guardians of the genome. Cell 136(4):656–668. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Malone CD, et al. 2009. Specialized piRNA pathways act in germline and somatic tissues of the Drosophila ovary. Cell 137(3):522–535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Maynard Smith J. 1964. Group selection and kin selection. Nature 201(4924):1145–1147. [Google Scholar]
  57. Maynard Smith J. 1976. Group selection. Q Rev Biol. 51(2):277–283. [Google Scholar]
  58. Mohn F, Sienski G, Handler D, Brennecke J.. 2014. The Rhino–Deadlock–Cutoff complex licenses noncanonical transcription of dual-strand piRNA clusters in Drosophila. Cell 157(6):1364–1379. [DOI] [PubMed] [Google Scholar]
  59. Montgomery EA, Huang SM, Langley CH, Judd BH.. 1991. Chromosome rearrangement by ectopic recombination in Drosophila melanogaster: genome structure and evolution. Genetics 129(4):1085–1098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Muerdter F, et al. 2012. Production of artificial piRNAs in flies and mice. RNA 18(1):42–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Nuzhdin SV. 1999. Sure facts, speculations, and open questions about the evolution of transposable element copy number. Genetica 107(1/3):129–137. [PubMed] [Google Scholar]
  62. Orgel LE, Crick FH.. 1980. Selfish DNA: the ultimate parasite. Nature 284(5757):604–607. [DOI] [PubMed] [Google Scholar]
  63. Ozata DM, Gainetdinov I, Zoch A, O’Carroll D, Zamore PD.. 2019. PIWI-interacting RNAs: small RNAs with big functions. Nat Rev Genet. 20(2):89–108. [DOI] [PubMed] [Google Scholar]
  64. Pasyukova EG, Nuzhdin SV, Morozova T, Mackay TF.. 2004. Accumulation of transposable elements in the genome of Drosophila melanogaster is associated with a decrease in fitness. J Hered. 95(4):284–290. [DOI] [PubMed] [Google Scholar]
  65. Petrov DA. 2001. Evolution of genome size: new approaches to an old problem. Trends Genet. 17(1):23–28. [DOI] [PubMed] [Google Scholar]
  66. R Core Team 2019. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]
  67. Robillard É, Le Rouzic A, Zhang Z, Capy P, Hua-Van A.. 2016. Experimental evolution reveals hyperparasitic interactions among transposable elements. Proc Natl Acad Sci USA. 113(51):14763–14768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Shpiz S, Ryazansky S, Olovnikov I, Abramov Y, Kalmykova A.. 2014. Euchromatic transposon insertions trigger production of novel pi- and endo-siRNAs at the target sites in the Drosophila germline. PLOS Genet. 10(2):e1004138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Sienski G, Dönertas D, Brennecke J.. 2012. Transcriptional silencing of transposons by Piwi and Maelstrom and its impact on chromatin state and gene expression. Cell 151(5):964–980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Song SU, Kurkulos M, Boeke JD, Corces VG.. 1997. Infection of the germ line by retroviral particles produced in the follicle cells: a possible mechanism for the mobilization of the gypsy retroelement of Drosophila. Development 124(14):2789–2798. [DOI] [PubMed] [Google Scholar]
  71. Sultana T, Zamborlini A, Cristofari G, Lesage P.. 2017. Integration site selection by retroviruses and transposable elements in eukaryotes. Nat Rev Genet. 18(5):292–308. [DOI] [PubMed] [Google Scholar]
  72. Théron E, et al. 2018. The interplay between the argonaute proteins Piwi and Aub within Drosophila germarium is critical for oogenesis, piRNA biogenesis and TE silencing. Nucleic Acids Res. 46(19):10052–10065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Wang L, Dou K, Moon S, Tan FJ, Zhang ZZ.. 2018. Hijacking oogenesis enables massive propagation article hijacking oogenesis enables massive propagation of LINE and retroviral transposons. Cell 174(5):1082–1094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Wicker T, et al. 2007. A unified classification system for eukaryotic transposable elements. Nat Rev Genet. 8(12):973–982. [DOI] [PubMed] [Google Scholar]
  75. Wickham H. 2016. ggplot2: elegant graphics for data analysis. Basel (Switzerland): Springer Nature. [Google Scholar]
  76. Yamanaka S, Siomi MC, Siomi H.. 2014. piRNA clusters and open chromatin structure. Mob DNA. 5(1):22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Yang P, Wang Y, Macfarlan TS.. 2017. The role of KRAB-ZFPs in transposable element repression and mammalian evolution. Trends Genet. 33(11):871–881. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Yu T, et al. 2019. The piRNA response to retroviral invasion of the koala genome. Cell 179(3):632–643. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Yukuhiro BYK, Harada K, Mukai T.. 1985. Viability mutations induced by the P elements in Drosophila melanogaster. Jpn J Genet. 60(6):531–537. [Google Scholar]
  80. Zanni V, et al. 2013. Distribution, evolution, and diversity of retrotransposons at the flamenco locus reflect the regulatory properties of piRNA clusters. Proc Natl Acad Sci USA. 110(49):19842–19847. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

evaa064_Supplementary_Data

Articles from Genome Biology and Evolution are provided here courtesy of Oxford University Press

RESOURCES