Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2026 Jan 30.
Published before final editing as: EMBO J. 2025 Nov 13:10.1038/s44318-025-00631-w. doi: 10.1038/s44318-025-00631-w

A mammalian-like piRNA pathway in Axolotl reveals the origins of piRNA-directed DNA methylation

Xinyu Xiang 1,2,3, Anni Gao 3, Dominik Handler 4, Francisco Falcon 5,6, Diego Rodriguez-Terrones 5, Sergej Nowoshilow 5, Wanlu Liu 3,7,8, Elly M Tanaka 5, Dónal O’Carroll 1,2,
PMCID: PMC7618676  EMSID: EMS211638  PMID: 41233617

Abstract

The piRNA pathway protects animal germlines from active transposons. Mammals employ a cytoplasmic pathway to destroy transposon transcripts during germline reprogramming. This post-transcriptional mechanism is ancient and found throughout the animal kingdom. A nuclear piRNA pathway mediates transposon DNA re-methylation, which is believed to be bespoke to mammals. However, when exactly piRNA-directed DNA methylation evolved remains unknown. We found that a mammalian-like piRNA pathway evolved early in tetrapod evolution and is found and expressed in its current configuration in the axolotl salamander. Analysis of axolotl testes and oocytes revealed diverse repertoires of piRNAs and pervasive post-transcriptional targeting of young transposons. We identified high levels of genome methylation in axolotl spermatozoa, with full-length transposons being heavily methylated. Our findings reveal that the mammalian nuclear piRNA pathway has ancient vertebrate origins, and it has likely been safeguarding the germline throughout most of tetrapod evolution. Thus, the emergence of piRNA-directed DNA methylation is a pivotal epigenetic evolutionary event that may have laid the foundation for germline reprogramming and genomic imprinting.

Keywords: piRNA, Germline, Transposon, DNA Methylation, Germline Reprogramming, Genomic Imprinting

Subject Categories: Evolution & Ecology, RNA Biology

Introduction

The germline is the cell lineage that gives rise to the gametes and ensures the continuity of life. Germline specification during embryonic development occurs by two distinct mechanisms: epigenesis and preformation. Epigenesis is the ancestral mode of germ cell acquisition, in which primordial germ cells (PGCs) are extrinsically induced from pluripotent cells by signals from surrounding tissues (Extavour and Akam, 2003). In contrast, preformation, which has independently evolved multiple times during evolution, involves the autonomous specification of PGCs by maternally derived molecules known as germ plasm (Mahowald, 1977). The fundamental distinction between these two mechanisms is that preformation allows direct germline continuity through germ plasm and occurs early in embryogenesis, while epigenesis relies on inductive signals to specify PGCs later during embryogenesis (Lawson et al, 1999). The convergent evolution of preformation suggests potential selective advantages. It has been hypothesized to confer increased evolvability (Johnson and Alberio, 2015), the capacity to generate heritable, selective, and phenotypic variation (Kirschner and Gerhart, 1998). Among tetrapods, germline specification predominantly follows epigenesis, with the exception of frogs and birds, which utilize preformation (Extavour and Akam, 2003).

Active transposons pose an existential threat to the integrity and continuity of the germline. The mammalian germline is particularly vulnerable during the period of germline reprogramming when transposon-repressing DNA methylation is erased (Seisenberger et al, 2012; Greenberg and Bourc’his, 2019) and transposon expression is unleashed (Bourc’his and Bestor, 2004; Aravin et al, 2007; Carmell et al, 2007). When expressed, active transposons can integrate into new genomic locations, potentially causing gene mutations (Bourque et al, 2018). In addition, transposition is also a major source of DNA damage (Gasior et al, 2006; Farkash and Prak, 2006; Belgnaoui et al, 2006). Given that germ cells are exquisitely sensitive to DNA damage, transposon-inflicted DNA double-strand breaks often result in germ cell death (Goodier and Kazazian, 2008). Indeed, failure to repress transposon activity in the germline broadly results in defective gametogenesis and infertility (Carmell et al, 2007; Bourc’his and Bestor, 2004). That said, transposition does occur in the germline at a very low frequency. For instance, in humans, approximately one in every 200 live births carries a novel LINE1 transposition event (Xing et al, 2009). These insertions are often non-deleterious and can generate selectable genetic variation, providing a potential source for genomic innovation. However, genomes expand when transposons excessively proliferate. An extreme example of genome expansion in vertebrates is the lungfish, a lobe-finned fish with a genome size ranging from 30 to 90 Gb and 60–90% of transposon-derived content (Wang et al, 2021; Schartl et al, 2024). Similarly, the axolotl salamander (Ambystoma mexicanum) possesses a huge 32 Gb genome with ~70% transposon content (Nowoshilow et al, 2018). Interestingly, such genomic gigantism and high transposon burden in vertebrates appear to occur in these water-to-land transition species (Falcon et al, 2023).

While a significant portion of vertebrate genomes originates from transposons, only a tiny fraction of transposon loci retain transposition activity, with the vast majority existing as dead mutated copies (Bourque et al, 2018). For example, only ~100 copies of LINE1 remain active in humans (Brouha et al, 2003), although 53% of the genome is derived from transposons. Time and transposon silencing ultimately kill active transposon copies and eventually transposon families. Over time, transposons accumulate inactivating mutations, and as such they can only survive if they manage to transpose in the germline. So, the fight for transposon survival and the opposing genomic battle to tame transposons occurs in the germline. The PIWI-interacting RNA (piRNA) pathway protects metazoan genomes and germlines from active transposons (Ozata et al, 2019; Wang et al, 2023). Not only does it acutely protect the germline, but it is also the principal transposon-taming force.

At the core of the piRNA pathway are the piRNAs and the PIWI proteins (Ozata et al, 2019; Wang et al, 2023). piRNAs are short non-coding RNAs that are bound to PIWI proteins. Through base complementarity, piRNAs act as guides to recruit the PIWI proteins to cellular RNAs. The piRNA pathway consists of two major components: a cytoplasmic and a nuclear pathway. The cytoplasmic piRNA pathway is responsible for piRNA biogenesis and post-transcriptional transposon silencing (Ozata et al, 2019; Wang et al, 2023). In vertebrates, the cytoplasmic PIWI proteins, PIWIL1 and PIWIL2, are piRNA-guided endonucleases that cleave transposon transcripts to suppress transposition (Houwing et al, 2008; De Fazio et al, 2011; Reuter et al, 2011; Houwing et al, 2007). On the other hand, the nuclear piRNA pathway mediates transcriptional silencing of transposons through DNA methylation and chromatin modifications (Ozata et al, 2019; Wang et al, 2023). The cytoplasmic piRNA pathway is common to most animals (Grimson et al, 2008). In vertebrates, the exoribonuclease PNLDC1 (PARN Like Ribonuclease Domain Containing Exonuclease 1) is responsible for trimming the 3′ of piRNA precursors to their mature length during piRNA biogenesis (Zhang et al, 2017; Ding et al, 2017; Nagirnaja et al, 2021). However, invertebrates apply different enzymes to trim piRNAs, such as Trimmer in silkworms (Izumi et al, 2016), PARN-1 in C. elegans (Tang et al, 2016), and Nibbler in flies (Hayashi et al, 2016; Han et al, 2011; Liu et al, 2011). Although these enzymes share the common function of processing piRNA 3′ termini, their evolutionary divergence reflects lineage-specific adaptations in piRNA biogenesis between vertebrates and invertebrates. The nuclear piRNA pathway has also arisen independently during evolution (Ozata et al, 2019; Wang et al, 2023) and is not present in all animals. Worms, flies, and mammals utilize piRNA-guided nuclear PIWI proteins to repress transposon transcription; however, the downstream factors and molecular mechanisms are distinct (Ozata et al, 2019; Wang et al, 2023). In C. elegans and Drosophila, transposon silencing is achieved through histone methylation and heterochromatinization (Rangan et al, 2011; Wang and Elgin, 2011; Klenov et al, 2011; Sienski et al, 2012; Shirayama et al, 2012; Ashe et al, 2012; Luteijn et al, 2012). In contrast, the mammalian nuclear piRNA pathway primarily silences young active transposons through DNA methylation during the process of de novo genome methylation that follows germline reprogramming (Aravin et al, 2007; Carmell et al, 2007; Kuramochi-Miyagawa et al, 2008). piRNA-directed LINE1 methylation is installed by a two-step authentication process (Dias Mirandela et al, 2024). Firstly, chromatin modification recruits the piRNA factor SPOCD1 through the chromatin reader SPIN1 to transposon promoters (Dias Mirandela et al, 2024). Secondly, through base complementarity piRNAs tether PIWIL4 (MIWI2) to nascent transposon transcripts (De Fazio et al, 2011; Schöpp et al, 2020). The outcome of these events is the recruitment of the de novo DNA methylation machinery (DNMT3L and DNMT3B/C) and piRNA-guided transposon methylation (Bourc’his and Bestor, 2004; Barau et al, 2016; Jain et al, 2017; Zoch et al, 2020, 2024; Stallmeyer et al, 2024). This silencing mechanism also involves other factors such as C19ORF84 (Zoch et al, 2020, 2024) and TEX15 (Yang et al, 2020; Schöpp et al, 2020). In summary, the factors PIWIL4, SPOCD1, SPIN1, DNMT3B/C, C19ORF84, and TEX15 that mediate piRNA-directed DNA methylation define the mammalian nuclear piRNA pathway. Currently, the piRNA-mediated transposon methylation mechanism has only been reported in mammals, with its true origin remaining unknown.

Results

Axolotl has a mammalian-like piRNA pathway

To investigate the origin of the modern mammalian piRNA pathway, we searched for orthologues of mammalian piRNA factors throughout vertebrate evolution (Fig. 1A,B). An intact cytoplasmic piRNA pathway is present in all tetrapods and lobe-finned fish irrespective of germline specification (Fig. 1B; Dataset EV1). Ray-finned fish (Actinopterygii) genomes contain the Piwil1 and Piwil2 genes that encode the key cytoplasmic PIWI proteins responsible for piRNA-guided post-transcriptional transposon silencing and piRNA amplification (De Fazio et al, 2011; Reuter et al, 2011; Di Giacomo et al, 2013) (Fig. 1B). However, not all piRNA biogenesis factors are detected in ray-finned species. For example, both Pnldc1 and Mael are absent in Actinopterygii genomes (Fig. 1B). The elephant shark representative of the cartilaginous fish contained all genes encoding the cytoplasmic piRNA pathway apart from Pnldc1. Pnldc1 is first observed in Coelacanth as well as two lungfish species (Fig. 1B), and as such the modern form of the mammalian cytoplasmic piRNA pathway first evolved in the common ancestor of lobe-finned fishes.

Figure 1. Modern mammalian piRNA pathway might originate from common tetrapod ancestors with the axolotl.

Figure 1

(A) Schematic of piRNA pathway in mammals. Major piRNA biogenesis factors and nucleus factors are listed in box with dashed lines. (B) The piRNA pathway and germline specification mechanism for vertebrates. Left, phylogenetic tree of representative vertebrate species. Middle, genome size, transposon percentage, and the types of germline specification mechanism. Germline specification mechanisms for each species are from Extavour and Akam, 2003 and Hansen and Pelegri, 2021 (see “Methods”). Right, the existence of piRNA factors in vertebrate genomes. For lungfishes, South American lungfish (SA), African lungfish (AF), Australian lungfish (AU). Source data are available online for this figure.

We next analyzed the evolution of the nuclear branch of the mammalian piRNA pathway. Piwil4, which encodes the key nuclear PIWI protein, appears to have originated in the common ancestor of jawed vertebrates and is retained in cartilaginous fish such as the elephant shark, but has been lost in ray-finned fish (Gutierrez et al, 2021). Notably, Piwil4 is present in lobe-finned fish and further retained across all tetrapods, except for birds (Fig. 1B). We next analyzed the recently identified PIWIL4-associated nuclear piRNA factors SPOCD1 (Zoch et al, 2020), C19ORF84 (Zoch et al, 2024), TEX15 (Schöpp et al, 2020; Yang et al, 2020) and SPIN1 (Dias Mirandela et al, 2024) (Fig. 1A). Spocd1 is present in lobe-finned fish, including coelacanth and one lungfish genome (Fig. 1B). Spocd1 is retained in all tetrapod genomes apart from birds. C19orf84 is first observed in axolotl and also found in turtle and all mammals except for platypus (Fig. 1B). The Spin1 gene is present in all jawed vertebrates. TEX15 is found in jawless and cartilaginous fish but lost in some ray- and lobe-finned fish (Fig. 1B), indicating an early origin in vertebrate evolution (Schöpp et al, 2023). Again, TEX15 is found in all tetrapods apart from birds. Next, we analyzed the de novo DNA methylation machinery. As previously reported (Barau et al, 2016; Jain et al, 2017), the Dnmt3c methyltransferase gene that arose from a gene duplication of Dnmt3b is muroid-specific (Fig. 1B). The de novo methyltransferase Dnmt3 gene is ancient and arose in eukaryotic ancestors (Ponger and Li, 2005). Dnmt3a and Dnmt3b arose in the whole-genome duplication event in early vertebrate ancestors (Liu et al, 2020). We found the Dnmt3a gene in nearly all species analyzed except for frog (Fig. 1B), while Dnmt3b is not found in jawless or cartilaginous fish but is present in most other vertebrates except for frogs (Fig. 1B). The Dnmt3l gene encoding the adapter protein is not found in jawless, cartilaginous, ray- or lobe-finned fish. Dnmt3l is present throughout tetrapods but has been lost in birds, frogs, and platypus (Fig. 1B). In summary, the components of the nuclear piRNA pathway arose at different times in evolution, but they are found all together for the first time in axolotl, and strikingly they have been lost in birds.

Axolotl piRNA pathways genes are predominantly expressed in the male and female axolotl germlines

Given that a mammalian-like piRNA pathway is first seen in axolotl, we next sought to explore the axolotl pathway and to search for evidence of its functionality. In mammals, most piRNA pathway factors’ expression is restricted to the germline. The cytoplasmic piRNA pathway is expressed in both male and female human (Girard et al, 2006; Gainetdinov et al, 2017; Williams et al, 2015; Roovers et al, 2015; Yang et al, 2019), mouse (Kuramochi-Miyagawa et al, 2004; Deng and Lin, 2002; Flemr et al, 2013) and golden hamster (Loubalova et al, 2021; Ishino et al, 2021; Hasuwa et al, 2021; Zhang et al, 2021) germlines. In contrast, the nuclear piRNA pathway is restricted to the mammalian male germline (Carmell et al, 2007; Dias Mirandela et al, 2024; Aravin et al, 2008; Kuramochi-Miyagawa et al, 2008). We examined the expression of piRNA factors in axolotl tissues from previously published datasets (Bryant et al, 2017; Ye et al, 2022). Strikingly, the expression of the vast majority of piRNA factors is restricted to gonads (Fig. 2). Tdrd7 was an exception and broadly expressed in the soma (Fig. 2). Spin1 was also expressed in the soma as is the case in mammals (The Human Protein Atlas, 2025; Mouse Genome Informatics Web Site et al, 2025) (Fig. 2). Both the cytoplasmic and nuclear piRNA pathways were expressed in both the male and female axolotl germlines. In summary, the expression of the axolotl piRNA pathway genes is mostly restricted to the germline.

Figure 2. The axolotl has a mammalian-like piRNA pathway.

Figure 2

(A) Heatmap for expression of piRNA pathway factors in axolotl somatic and gonadal tissues. Source data are available online for this figure.

The landscape of active axolotl transposons

It is the active transposons that retain the ability to transpose which pose an acute threat to the germline and are the targets of the piRNA pathway. We therefore sought to identify which transposon families in the axolotl genome retain active copies. We applied the standard transposon annotation pipeline by RepeatMasker (Smit et al, 2013), with the unknown transposons identified by RepeatMasker further annotated by DeepTE (Yan et al, 2020) (see “Axolotl transposon annotation”). Approximately 70% of the axolotl genome is derived from transposon remnants (Nowoshilow et al, 2018). Among these, Long Terminal Repeats (LTRs) were the most abundant class of transposons (Nowoshilow et al, 2018) (Fig. 3A,B), comprising 24.5% of the genome (Fig. 3A,B). To roughly estimate which transposon copies could retain activity, we used the following criteria. Firstly, we selected copies with a length distribution close to the full-length consensus sequence (Dataset EV2). Copies with truncations have lost key elements or factors required for autonomous transposition. Secondly, we selected for young copies based on divergence from the consensus sequence. The accumulation of mutations over time often renders transposons inactive (Ozata et al, 2019). Transposon loci that were both full-length and young were defined as active copies. Using these criteria, for families where we could identify active copies, the frequency of active transposons within that family ranged between 0.003-0.6% (Figs. 3C and EV1A). This frequency translates into two active copies for LINE-I family to >13,000 for LTR-Gypsy family (Dataset EV2). Furthermore, for transposon families that have already proliferated in the axolotl genome, they had numerous active copies. To be specific, six families had more than 1000 likely active copies, encompassing LTR-Gypsy, LTR-ERV1, DNA-hAT, DNA-PIF-Harbinger, DNA-MULE, and DIRS (Fig. 3D; Dataset EV2). These six families constituted the majority of active transposons, accounting for 93.2% of all active copies (Fig. EV1B). Notably, the LTR class, which occupied the largest portion of the genome, had the highest proportion of likely active copies (0.21%, a total of 21,945 likely active copies), suggesting that LTRs might still be expanding in the axolotl genome (Fig. EV1C). In summary, we estimate that the vast majority of the full-length, close to consensus matching and thus likely active axolotl transposons are principally derived from six families and constitute less than 1% of total copies within those respective families.

Figure 3. The landscape of potentially active axolotl transposons.

Figure 3

(A, B) Genomic composition of transposon classes in axolotl. Both the whole genome (A) and each chromosome (B) are displayed. (C) Distribution of substitution rate and sequence length for every single copy in the representative family. Defined potentially active copies are highlighted in red. (D) Lollipop plots display the copy number for each transposon family. Total copy number (left) and potentially active copy number (right) are shown. The red dashed line indicates transposon families with copy numbers greater than 1000. Bar plot indicates potentially active copy number percentage within transposon family. Source data are available online for this figure.

The axolotl genome encodes a diverse repertoire of transposon-targeting piRNAs

Given the piRNA pathway is expressed in both the male and female axolotl germlines, using small RNA sequencing (sRNA-seq), we characterized axolotl piRNAs from two testis and oocyte samples (Fig. 4A; Dataset EV3). To focus on piRNAs, we examined small RNAs with a length greater than 23 nucleotides to exclude 19–22 nucleotide microRNAs (miRNAs) and short interfering RNAs (siRNAs) (Aravin et al, 2006; Girard et al, 2006; Vagin et al, 2006; Lau et al, 2006; Grivna et al, 2006; Saito et al, 2006). Our data captured the widespread existence of piRNAs, with a median length of 29 nt (Fig. 4A; Dataset EV3), consistent with recent findings (Schartl et al, 2024). piRNAs could be identified from both male and female axolotl germlines with the same length distribution, consistent with the piRNA pathway expression in both sexes (Fig. 2). A total number of 12–38 million distinct piRNAs could be identified among each of the samples (Fig. EV2A). Around 88 million unique piRNAs were identified across all samples in total (Fig. EV2A). Notably, piRNA sequences from different samples varied greatly, suggesting that the sequencing was not saturated (Fig. EV2A). We then mapped piRNAs to the genome and found that around 60–70% piRNAs were transposon-derived (Fig. 4B). The annotation of piRNAs between male and female germline differed slightly (Fig. 4B). When mapping piRNAs to transposon families, all the active transposon families identified generate great quantities of piRNAs (Fig. 4C, Fig. EV2B). For example, the active transposon families LTR-ERV1, DNA-PIF-Harbinger and LINE-RTE were among the top five piRNA-producing families (Figs. 4C and EV2B). The piRNA signal over the consensus sequence of representative transposon families indicated that in both male and female germlines, piRNA mapped to sense and antisense strands across the entire consensus sequence (Figs. 4D and EV2C). The pattern of piRNA formation also differed between the sexes (Figs. 4D and EV2C).

Figure 4. The axolotl genome encodes a diverse repertoire of transposon-targeting piRNAs.

Figure 4

(A) Nucleotide (nt) length distribution from small RNA libraries. Mean and S.E.M. are presented. (B) Classification of piRNAs mapped to the axolotl genome. (C) Mean piRNA signal level over four testis and oocyte samples for each transposon family. ****, active copy number >1000; ***, active copy number >100; **, active copy number >10; *, active copy number >0. (D) Tracks for piRNAs targeting both sense and antisense strands of representative transposon consensus sequences. (E) Number of annotated piRNA clusters in the genome. (F) piRNA clusters ranked by the cumulative fraction of piRNA RPMs (orange). The RPKM of piRNAs from each cluster is displayed (gray). The red dashed line indicates the number of top-ranked clusters that collectively account for 90% of cluster-derived piRNAs (90th percentile). (G) Length distribution of annotated piRNA clusters in the genome. (H) The number of piRNA clusters that overlap between the two genders. For all panels, testis, n = 2; oocyte, n = 2. Source data are available online for this figure.

We next identified piRNA clusters in the axolotl genome (Fig. 4E). Approximately 2500 piRNA clusters were identified in the male and female germlines (Fig. 4E). Ranking the piRNA clusters by their cumulative fraction of piRNA reads revealed axolotl piRNAs were predominantly derived from a small number of highly productive clusters (Fig. 4F), similar to observations in other vertebrate and invertebrate species (Konstantinidou et al, 2024). The top 500–1000 clusters accounted for 90% of all cluster-derived piRNAs (Fig. 4F). The majority of axolotl piRNA clusters were unistrand (Fig. 4E), consistent with the unidirectional transcription mechanism typical of mammalian piRNA clusters like mouse (Girard et al, 2006; Aravin et al, 2006; Grivna et al, 2006; Li et al, 2013). Moreover, piRNA clusters in axolotl had a median length of 10 kb (Fig. 4G), a size distribution similar to mammals (Lau et al, 2006; Aravin et al, 2006; Girard et al, 2006; Grivna et al, 2006; Li et al, 2013). Collectively, the annotated piRNA clusters covered approximately 0.13% of the axolotl genome, with an average of 47% of piRNA sequences assigned to these clusters (Dataset EV4). Interestingly, while there is overlap between clusters expressed in the respective germlines, most clusters show sex-specific expression (Figs. 4H and 5A,B). Collectively, these data show abundant and diverse piRNA expression in both male and female axolotl germlines.

Figure 5. The axolotl piRNA clusters exhibit sex-specific expression patterns.

Figure 5

(A) Genome-wise distribution of piRNA signal and piRNA clusters (inner circle), with zoom-in view for chr3p and chr13p (outer circle). (B) Tracks on both sense and antisense strands for piRNA clusters are highlighted in (A). Genic regions and transposon regions are shown. For all panels, testis, n = 2; oocyte, n = 2. Source data are available online for this figure.

The cytoplasmic axolotl piRNA pathway post-transcriptionally silences transposons

The abundance of piRNAs derived from active transposon families indicates a role for these piRNAs in post-transcriptional silencing through piRNA-mediated transposon transcript cleavage. The ping-pong cycle (Brennecke et al, 2007; Gunawardane et al, 2007; Aravin et al, 2007; De Fazio et al, 2011) or piRNA amplification (Fig. 6A) initiates when a complementary piRNA guides PIWI-mediated cleavage of the transposon transcript. Endonucleolytic PIWI-mediated cleavage is at the nucleotide opposed to the tenth nucleotide from the 5′ end of the piRNA. Primary piRNAs have a 5′U (1U) bias, therefore when a primary piRNA directs cleavage of a transcript followed by processing of the 3′ fragment into a secondary piRNA, this secondary piRNA will contain a bias for A at position 10 (10A) (Brennecke et al, 2007). Furthermore, there is also an overlap of 10 nucleotides between the primary and secondary piRNA pair (Brennecke et al, 2007). A second round of piRNA biogenesis mediated by secondary piRNAs results in the amplification of the initiating primary piRNAs. Thus, this amplification cycle not only destroys the transposon transcript to neutralize the acute threat of transposition but also serves to reinforce post-transcriptional silencing through the amplification of transposon-targeting piRNAs. The 10-nucleotide complementarity between amplification pairs as well as the 1U/10A bias can be used to determine the presence of piRNA amplification (Fig. 6A). A robust 10 nucleotide overlap between complementary piRNAs was observed across all classes of transposons in both the male and female germlines (Figs. 6B,D and EV3A). Furthermore, the bias for 10A and 1U was detected in both male and female transposon-related piRNA populations (Figs. 6C,E and EV3B). In summary, robust piRNA amplification is detected and the pathway post-transcriptionally silences transposons in both the adult male and female axolotl germlines.

Figure 6. piRNA pathway post-transcriptionally silences transposons in both male and female germlines.

Figure 6

(A) Schematic of the ping-pong cycle and transposon silencing. Two features of piRNAs generated from the ping-pong cycle for transposon transcript cleavage are displayed. One is the 10 nt overlap between 5′ of complementary piRNA pairs, the other is the 1U/10A preference. (B, D) Relative frequency of the nucleotide distance between 5′ of complementary piRNA pairs. 5′ nucleotide overlap frequency over all families from major transposon classes (B) and representative transposon families (D) are shown. Mean and S.E.M. are presented in (D). (C, E) Nucleotide composition of the first and tenth position in piRNAs. Nucleotide proportion over all families from major transposon classes (C) and representative transposon families (E) are shown. A adenine, U uracil, C cytosine, G guanine. For all panels, testis, n = 2; oocyte, n = 2. Source data are available online for this figure.

Evidence for an active nuclear piRNA pathway in axolotl

In mammals, piRNA-directed DNA methylation epigenetically silences young transposons (Aravin et al, 2007; Carmell et al, 2007; Kuramochi-Miyagawa et al, 2008; Schöpp et al, 2020; Zoch et al, 2020, 2024; Dias Mirandela et al, 2024). Low-coverage maps of axolotl DNA methylation from somatic tissues exist (Klughammer et al, 2023), but they lack the resolution to confidently define features of genomic DNA methylation. To understand the characteristics of genome and transposon methylation in the male axolotl germline, we isolated spermatozoa and performed Enzymatic Methyl-seq (EM-seq) using biological triplicates. Each replicate had a depth/coverage of 1.5×/50% with a combined depth/coverage of 4.5×/77% (Dataset EV5), permitting a confident analysis of genome methylation patterns. In general, axolotl exhibited a relatively high CpG methylation level in sperm (~90%) (Fig. 7A), compared to human and mouse (75% and 85%, respectively) (Molaro et al, 2011; Hammoud et al, 2014). Methylation over axolotl gene bodies averaged around 80%, with a drop to 60% in transcription start site (TSS) regions (Fig. 7A,B). In comparison, the whole region of axolotl transposons was heavily methylated (Figs. 7C,D and EV4A). Active transposons showed higher methylation when compared to all copies, except for Rolling Circle (RC) transposons (Fig. 7C,D). Notably, the promoter regions of axolotl transposons were also heavily methylated, such as the 5′ flanking LTR repeats for LTR elements as well as the 5′ UTR for LINE elements (Fig. 7D).

Figure 7. The genome of axolotl spermatozoa is highly methylated.

Figure 7

(A) Percentages of CpG methylation levels over the whole genome or certain genomic elements. (B) Metaplots of mean CpG methylation over gene bodies and adjacent 2 kb. (C) Percentages of CpG methylation levels over all genomic transposons or certain transposon classes. Methylation levels over all copies or only active copies are shown. (D) Metaplots of mean CpG methylation for selected transposon families over all transposon/active transposon bodies and adjacent 2 kb. (E) Distribution of CpG methylation levels across the whole genome or CGI regions. (F) Genomic annotation for CGIs or shuffled regions. (G) CpG methylation levels over all transposon/TSS regions or regions overlapped with CGIs. (H) Distribution of CpG methylation level and substitution rate for all transposon copies or copies overlapped with CGIs. (I) Percentages of transposons overlapped with CGIs. For boxplots, the middle line represents the median; boxes represent the 25th (bottom) and 75th (top) percentiles; whiskers represent median ± 1.5× interquartile range; and outside values are not shown. Rhombus, mean level. For statistical tests, ***P value <0.001, n.s. not significant; (E, F) Chi-square test, (C, G) unpaired two-sided t test. The exact P values for (C, E–G) are provided in Source Data. For all panels, spermatozoa, n = 3; pooled for analysis. Source data are available online for this figure.

To better understand the methylation landscape in axolotl, we also annotated CpG islands (CGIs) in axolotl. With standard filtering criteria by Gardiner and Frommer (Gardiner-Garden and Frommer, 1987), we identified 248,988 CGIs in axolotl, which mostly ranged from 20 to 1000 bp in length (Fig. EV4B). In mammals, CGIs are associated with low methylation levels (Illingworth and Bird, 2009; Deaton and Bird, 2011). In stark contrast, axolotl CGIs are associated with high methylation levels (Fig. 7A). Approximately 95% of CGIs are highly methylated in axolotl sperm, significantly higher than the genome level (Fig. 7E). Annotation revealed that the majority (93%) of CGIs in axolotl significantly overlapped with transposons rather than TSS/genic regions (Fig. 7F); which is the case for mammalian CGIs (Deaton and Bird, 2011; Smallwood et al, 2011; Sharif et al, 2010). Consistently, there was only a slight CGI enrichment within 100 bp TSS adjacent regions (Fig. EV4C). Notably, the methylation level of CGI-overlapped transposons was significantly higher than that of all transposons, but not significant at gene TSSs (Fig. 7G). This suggests that CGIs in axolotl play a role in transposon silencing. To be specific, all major transposon classes overlapping with CGIs exhibited higher methylation levels and less divergence from the consensus sequence (Figs. 7H and EV4D), indicating a role for CGIs in young transposon silencing. For DNA transposons and DIRS, ~60–80% of likely active copies were covered by CGIs (Figs. 7I and EV4E), implying potential regulation of DNA/DIRS activity. Taken together, we describe a confident map of axolotl genome methylation. The axolotl spermatozoan genome is highly methylated, and CGIs in axolotl are associated predominantly with transposons. In summary, promoter DNA methylation of active transposons is a feature of axolotl spermatozoan genome methylation.

Discussion

Our findings reveal that the mammalian version of the piRNA pathway comprising both cytoplasmic and nuclear components arose early in tetrapod evolution. PNLDC1, a pre-piRNA 3′ trimming exonuclease, mediates precursor piRNA trimming in mammals (Zhang et al, 2017; Ding et al, 2017; Nagirnaja et al, 2021). The presence of this enzyme distinguishes mammalian piRNA biogenesis from more ancient animal pathways. For instance, in Drosophila, a different enzyme Nibbler is employed (Hayashi et al, 2016; Han et al, 2011; Liu et al, 2011). We first observe the presence of Pnldc1 gene in Coelacanth and as such propose that mammalian-like piRNA biogenesis arose early in lobe-finned fish evolution. In contrast, the evolution of a mammalian-like nuclear piRNA pathway occurred later than the more conserved cytoplasmic one. PIWIL4, SPOCD1, and TEX15 are key components of the mammalian nuclear piRNA pathway which arose at different stages of fish evolution. However, we find that all key piRNA pathway components and the de novo methylation machinery first co-occur in axolotl. The presence of PIWI proteins and mammalian piRNA pathway factors in the axolotl genome, with gonad specific expression, underscores the conserved nature of this pathway from amphibians to mammals (Figs. 1B and 2A). Furthermore, both male and female axolotl germlines exhibit abundant transposon-derived piRNAs, with sex-specific profiles (Fig. 4B,C). These piRNAs effectively target active transposons, as evidenced by robust ping-pong amplification cycles (Fig. 6B–E). This indicates an operational cytoplasmic piRNA pathway and post-transcriptional transposon silencing in both axolotl sexes. Remarkably, our study also suggests the existence of a piRNA-directed DNA methylation mechanism targeting young transposons in axolotl, a mechanism previously reported exclusively in mammals. Homologs of all key PIWIL4- and SPOCD1-associated nuclear factors identified in mammals are present in the axolotl genome and expressed in gonads (Fig. 2A). In addition, we observed that active transposons in the axolotl male germ cells are heavily DNA methylated (Fig. 7C,D). CGIs in axolotl are mostly associated with transposons (Fig. 7F), and in contrast to mammals, CGIs are heavily methylated in axolotl spermatozoa (Fig. 7E). Collectively, we provide evidence for a role of the nuclear piRNA pathway and DNA methylation in axolotl transposon control. However, genetic studies will be required to formally demonstrate a functional role for piRNA-directed transposon methylation in safeguarding the axolotl germline.

We also noted that not all tetrapods have a mammalian-like piRNA pathway (Fig. 1B). In frogs, several key nuclear factors are absent. While in birds, the crux of the nuclear pathway, such as PIWIL4, SPOCD1, C19ORF84, and TEX15, have been lost. This begs the question as to why these lineages have lost the nuclear branch of the piRNA pathway. The answer may lie in the mechanism of germ cell acquisition. In tetrapods, the loss of the nuclear piRNA pathway correlates with preformation as the germline specification method that is used in birds and frogs (Extavour and Akam, 2003). While we do not understand the basis of the correlation between the presence of a nuclear piRNA pathway and the usage of epigenesis for germline specification in tetrapods, it might be linked to the evolution of epigenetic germline reprogramming. Preformation is mediated by germ plasm that is maternally inherited in the oocyte cytoplasm and instructs germ cell specification as one of the first embryonic cellular differentiation events (Mahowald, 1977). In this strategy, there is a continuity in the germline through the germ plasm, and maternally deposited piRNAs may confer continuous anti-transposon immunity as is the case in Drosophila (Brennecke et al, 2008). Both C. elegans and Drosophila have nuclear piRNA pathways and utilize preformation (Shirayama et al, 2012; Ashe et al, 2012; Luteijn et al, 2012; Rangan et al, 2011; Wang and Elgin, 2011; Klenov et al, 2011; Sienski et al, 2012; Extavour and Akam, 2003). The nuclear pathway in Drosophila is active in the germline and germline-associated somatic cells, where it prevents endogenous retroviruses that have the capacity to form infectious particles from attacking the germline (Sarot et al, 2004; Li et al, 2009a; Malone et al, 2009). This brings us to the curious case of lungfishes, which is similar to the axolotl, employs epigenesis for germline specification and possesses gigantic genomes with high transposon content (Nowoshilow et al, 2018; Wang et al, 2021; Schartl et al, 2024). However, while there is no evidence showing the axolotl genome size is currently growing (Wang et al, 2021; Schartl et al, 2024), lungfish genomes continue to expand, driven by young transposon activity (Schartl et al, 2024; Wang et al, 2021). Indeed, a recent study revealed that South American and African lungfish express few canonically sized piRNAs, leading to deficiencies in transposon control and contributing to massive genome expansion (Schartl et al, 2024). Consistent with this, we find that the piRNA pathway in lungfish lacks many genes involved in both the cytoplasmic and nuclear branches. Notably, among the sequenced lungfish, the South American lungfish has the largest genome (91 Gb) with 92.5% transposon content, and is currently expanding (Schartl et al, 2024). This further correlates with the fact that the South American lungfish has lost more piRNA pathway genes than the other lungfish species (Fig. 1B). The most significant difference between the lungfish and the axolotl is that the axolotl possesses a complete mammalian-like piRNA pathway in both the cytoplasm and nucleus. We present evidence indicating that both branches are active in axolotl, suggesting that stringent transposon control is in place. Thus, the current large size of the axolotl genome (32 Gb) (Nowoshilow et al, 2018) could reflect rapid transposon expansion in the past or a gradual expansion due to the number of remaining active copies combined with poor genome contraction processes (Sun et al, 2012).

Tracing back to earlier evolutionary events, the emergence of lobe-finned fish, the ancestors of modern lungfish, marked the beginning of the water-to-land transition in vertebrates ~390 million years ago (Narkiewicz and Narkiewicz, 2015). These ancestral species possessed robust fins with skeletal structures that offered support and movement potential, but they remained fully aquatic. Remarkably, a pivotal evolutionary step occurred when a group of early semiaquatic animals emerged from lobe-finned fish, which were the ancient tetrapods around 360 million years ago (Coates et al, 2008). They finally became the common ancestors of modern amphibians, reptiles, birds, and mammals. Our findings suggest that the piRNA pathway underwent a refinement during the water-to-land transition. In lungfish, the crux of the modern piRNA pathway is present but flawed through gene loss, while the axolotl demonstrates a functional and seemingly efficient piRNA pathway, capable of both post-transcriptional and epigenetic silencing of transposons. This suggests that the piRNA pathway in mammals might trace its origins to a conserved mechanism present in ancient tetrapods. The evolution of these efficient piRNA-mediated transposon restriction mechanisms, especially piRNA-mediated DNA methylation, was pivotal to the evolution of germline reprogramming and genomic imprinting. Without the possibility to protect a hypomethylated genome and to precisely reinstall transposon methylation, germline reprogramming would be non-permissive. Thus, the emergence of an efficient piRNA pathway halted rapid or sustained transposon-mediated genome expansion to enable genome contraction, carving the compact terrestrial tetrapod genomes observed today.

Methods

Reagents and tools table

Reagent/resource Reference or source Identifier or catalog number
Experimental models
White axolotl strain (d/d) Elly M. Tanaka Lab DD151
Recombinant DNA
Antibodies
Oligonucleotides and other sequence-based reagents
Chemicals, enzymes, and other reagents
TRIzol Thermo Fisher 15596026
T4 RNA Ligase 2 truncated KQ New England Biolabs M0373L
T4 RNA Ligase 1 New England Biolabs M0204L
SuperScript II Thermo Fisher 18064014
Kapa polymerase Kapa Biosystems KK2602
Pmel New England Biolabs R0560L
Monarch HMW
DNA Extraction Kit
New England Biolabs T3050L
NEBnext Enzymatic
Methyl-seq kit
New England Biolabs E7120L
Software
FastQC V0.11.8 Babraham Bioinformatics
Cutadapt v2.9 https://github.com/marcelm/cutadapt/
Trim Galore V0.6.7 Babraham Bioinformatics
STAR v2.7.0e https://github.com/alexdobin/STAR/
Bowtie2 v2.4.2 http://bowtie-bio.sourceforge.net/bowtie2/index.shtml
SAMtools v1.9 http://www.htslib.org/
FeatureCounts v2.0.0 http://subread.sourceforge.net/
RepeatMasker v4.1.5 http://www.repeatmasker.org/
RepeatModeler v 2.0.5 https://github.com/Dfam-consortium/RepeatModeler/
DeepTE https://github.com/LiLabAtVT/DeepTE/
proTRAC v 2.4.3 https://sourceforge.net/projects/protrac/
circlize https://github.com/jokergoo/circlize
gCluster https://github.com/Xiangyang1984/Gcluster/
Bismark v0.24.0 https://www.bioinformatics.babraham.ac.uk/projects/bismark/
SeqMonk v1.48.1 Babraham Bioinformatics
ggplot2 v3.3.5 https://sourceforge.net/projects/ggplot2.mirror/files/v3.3.5/
RStudio v2022.07.0 https://posit.co/download/rstudio-desktop/
R v3.5.1 https://www.r-project.org/
Igv v2.3.72 g http://software.broadinstitute.org/software/igv/
Illustrator software Adobe Inc.
Other
Illumina HiSeq2500 Illumina
lllumina
NextSeq2000
Illumina

Axolotl strains and sample collection

White axolotl strain (d/d) was used for sample collection. Axolotl husbandry was described in detail previously (Khattak et al, 2014). All lines were bred and maintained at the Research Institute of Molecular Pathology (IMP) facilities. Handling and surgical procedures adhered to local ethics committee guidelines, and animal experiments were conducted with approval from the Magistrate of Vienna (Genetically Modified Organism Office and MA58, City of Vienna, Austria, license GZ51072/2019/16 and license GZ665226/2019/21).

Small RNA-seq was performed on axolotl testis and oocyte samples, each with two biological replicates. EM-seq was performed on axolotl spermatozoa with three biological replicates. Samples were collected from the adult axolotl with hCG (human chorionic gonadotropin) injection to induce gamete maturation.

Small RNA sequencing for axolotl germline

RNA was extracted from two testis and two oocyte samples of the d/d white axolotl strain using TRIzol reagent (15596026), following the manufacturer’s protocol. Small RNA libraries were prepared as described in a previous publication (Jayaprakash et al, 2011). Briefly, 15 µg of total RNA was spiked with radioactively labeled marker RNAs of 19 and 35 nucleotides in length and resolved on a denaturing polyacrylamide gel. Small RNAs were excised from the gel and subjected to 3′ linker ligation using pre-adenylated DNA linkers containing four random nucleotides at the 5′ end to minimize ligation bias, following established protocols (Jayaprakash et al, 2011). The ligation was performed using T4 RNA Ligase 2 truncated KQ (M0373L).

The ligation products were purified on a denaturing polyacrylamide gel, excised based on the shifted spike-in signal, and ligated to RNA 5′ adapters containing four random nucleotides using T4 RNA Ligase 1 (M0204L). After a final purification step via denaturing polyacrylamide gel, the small RNA libraries were completed by reverse transcription using SuperScript II (18064014), followed by PCR amplification with Kapa polymerase (KK2602). The spike-in RNAs were digested using PmeI (R0560L), and the libraries were further purified using low-melt agarose gel excision. Sequencing was performed on an Illumina HiSeq2500 platform in 50-bp single-end read mode.

Whole-genome methylation sequencing for axolotl spermatozoa

DNA was extracted from three sperm samples of the white axolotl strain using Monarch HMW DNA Extraction Kit (NEB, T3050L) and solubilized in 10 mM Tris, pH 9.0, 0.5 mM EDTA. Methyl-seq libraries were prepared using the NEBnext Enzymatic Methyl-seq kit (NEB, E7120L) according to the manufacturer’s instructions. Sequencing was performed on an Illumina NextSeq2000 platform in 150-bp paired-end read mode.

Reference genome for bioinformatics analysis

The axolotl reference genome AmexG_v6.0 from https://www.axolotl-omics.org was employed (Nowoshilow et al, 2018). Axolotl gene annotation AmexT_v47 from https://www.axolotl-omics.org was employed (Nowoshilow et al, 2018). De novo transposon annotation was obtained using custom pipeline (see “Axolotl transposon annotation”).

Phylogeny analysis for piRNA pathway and germline specification mechanism in vertebrates

A phylogenetic tree was constructed according to the UCSC Genome Browser (Perez et al, 2024) (https://genome.ucsc.edu/). Icons for vertebrate species were obtained from PhyloPic (https://www.phylopic.org/), and all icons have been dedicated to the public domain.

Genome size and transposon percentage were obtained from UCSC assembly hubs (Perez et al, 2024) (https://hgdownload.soe.ucsc.edu/hubs/vertebrate/asmStats.html). For lungfish, genome size and transposon percentage were from Schartl et al (Schartl et al, 2024). Germline specification mechanisms for vertebrate species were collected from Extavour et al (Extavour and Akam, 2003) and Hansen et al (Hansen and Pelegri, 2021).

For piRNA pathway-related gene annotation, genome annotated files were obtained from UCSC Genome Browser (Perez et al, 2024) (https://genome.ucsc.edu/), Ensembl Gene Annotation (Howe et al, 2021) (https://www.ensembl.org/) or NCBI Gene Database (Sayers et al, 2022) (https://www.ncbi.nlm.nih.gov/datasets/gene/). For axolotl and lungfishes, genome annotated files were from Now-oshilow et al (Nowoshilow et al, 2018) and Schartl et al (Schartl et al, 2024), respectively. To identify potentially unannotated genes, additional searches were performed using NCBI tBLASTn against the reference genomes. For details, please refer to Dataset EV1.

RNA-seq analysis for axolotl tissues

RNA-seq datasets for axolotl testis, ovary, and 16 somatic tissues were obtained from previously published datasets GSE92429 (Bryant et al, 2017) and GSE182746 (Ye et al, 2022). FastQC v0.11.8 (Andrews, 2010) was applied for raw reads quality control. Alignment was performed using STAR v2.7.0e (Dobin et al, 2013), with no more than three mismatches (--outFilterMismatchNmax 3). The output BAM files were sorted and indexed with SAMtools v1.9 (Li et al, 2009b) for downstream analysis. FeatureCounts v2.0.0 (Liao et al, 2014) was used to quantify gene expression by exon counts (-g gene_id -t exon), and only the unique mapped reads were used for quantification. Read counts were normalized to RPKM for visualization. Heatmap was plotted by ggplot2 v3.3.5 (Wickham, 2016) in RStudio (RStudio Team, 2020).

Axolotl transposon annotation

Axolotl transposon annotation was constructed using a multi-step pipeline by combining the standard RepeatMasker annotation workflow (Smit et al, 2013) with the deep-learning-based annotation tool DeepTE (Yan et al, 2020).

First, transposon consensus sequences were generated. Transposon ancestor sequences were retrieved from the Dfam database using famdb.py from RepeatMasker v4.1.5 (Smit et al, 2013) (--ancestors). De novo transposon consensus sequences were identified with RepeatModeler v2.0.5 (Flynn et al, 2020) employing RMBlast as the search engine (-engine rmblast) and incorporating the LTR structural discovery pipeline alongside the RepeatScout/RECON pipeline (-LTRStruct). For unclassified transposon consensus sequences, DeepTE was used to further categorize them into Metazoan transposon classes (-sp M). The three consensus libraries were combined for genome annotation with RepeatMasker.

Then the genome was annotated in three iterative rounds using RepeatMasker v4.1.5. In the first round, simple repeats were annotated and masked (-e rmblast -noint -xsmall). For the second round, known transposons were annotated using the combined consensus library and applied to the simple repeats masked genome (-e rmblast -nolow), with the results masked prior to the third round. In the third round, the still unclassified transposons from DeepTE were annotated (-e rmblast -nolow). Finally, the outputs from all three rounds were merged to generate a comprehensive transposon annotation for the axolotl genome.

Potentially active transposon identification

For axolotl, the de novo transposon annotation file was used for potentially active transposon identification. Potentially active transposon copies were identified using two criteria. First, copies were required to have lengths close to the full-length consensus sequence, excluding truncated elements. Length thresholds were determined according to published references for each transposon family (Dataset EV2). Second, only young copies with less than 5% sequence divergence from the consensus (percDiv) were included, as higher divergence indicates older copies with accumulated mutations (Novick et al, 2011; Zoch et al, 2020). Transposon copies meeting both criteria were classified as potentially active transposons (Dataset EV2).

Small RNA analysis

FastQC v0.11.8 (Andrews, 2010) was applied for raw reads quality control. The NEB adapter sequence (AGATCGGAAGA) was trimmed from the 3′ ends of raw FASTQ files using Cutadapt v2.9 (Martin, 2011). Reads shorter than 16 bases after trimming were discarded (-m 16), as well as those lacking adapters (--trimmed-only). Four random nucleotides were clipped off from each end. Small RNA sequences with a minimum length of 23 bases were defined as piRNAs for downstream analysis to exclude miRNAs and siRNAs (Dataset EV3). For alignment, reads were mapped to the axolotl genome or transposon consensus sequences with STAR v2.7.0e (Dobin et al, 2013) with the following parameters: up to 1000 multimapped reads permitted (--outFilterMultimapNmax 1000), a maximum of three mismatches allowed (--outFilterMismatchNmax 3), and a minimum of 15 matched bases required (--outFilterMatchNmin 15). All multimapped reads were exported (--outSAMmultNmax -1) and were included for quantification over genic or transposon regions, with each read contributing 1/n to the counts. For genomic annotation, mapped reads were annotated based on the axolotl gene and transposon annotations. Reads that did not align to any genomic features were classified as ‘non-coding’.

piRNA cluster annotation

piRNA clusters were detected using the proTRAC v2.4.3 (Rosenkranz and Zischler, 2012) standard pipeline (Dataset EV4). First, redundant sequences and low-complexity reads were removed with TBr2_collapse.pl and TBr2_duster.pl with default parameters, respectively. The processed reads were then aligned to the axolotl genome using sRNAmapper.pl, allowing up to three mismatches, with a minimum of 15 matched bases (-mismatch 3 -seedmatch 15). Finally, piRNA cluster identification was conducted with proTRAC_2.4.3.pl, using a sliding window size of 5000, a piRNA density threshold of 0.01, and restricting piRNA lengths to 23–33 nucleotides (-swsize 5000 -pdens 0.01 -pimin 23 -pimax 33). piRNA clusters and signals were visualized using the circlize R package (Gu et al, 2014). piRNA signals were aggregated in 1 Mb genomic bins by summing 1/n-weighted scores of all mapped piRNAs, where n is the number of mapping sites per piRNA. The aggregated signals were then normalized to RPM and log2-transformed for visualization.

Ping-pong cycle analysis

Ping-pong analysis was performed using only reads mapped to transposons. For each overlapping sense and antisense read pair, the distance between their 5′ ends was calculated, and the counts were presented as relative frequencies. The nucleotide composition at the first and tenth positions of piRNAs was analyzed and expressed as proportions. Heatmap, line charts, and bar plots were generated using ggplot2 v3.3.5 (Wickham, 2016) in RStudio (RStudio Team, 2020).

CpG island annotation

CpG islands (CGIs) in axolotl were annotated by gCluster (Li et al, 2020) with the canonical Gardiner-Garden and Frommer criteria (Gardiner-Garden and Frommer, 1987). Genomic regions with CG contexts were extracted (pattern = CG), and were then filtered to define CGIs based on G + C content (≥50%), length (≥200 bp), observed/expected CpG ratio (≥0.6), and P value (≤1e-5). A total of 248,988 CGIs could be identified in the axolotl genome.

Whole-genome methylation analysis

For axolotl spermatozoa, three biological replicate samples were prepared and sequenced as described in the “Whole-genome methylation sequencing for axolotl spermatozoa” under “Methods”. To increase sequencing coverage and depth, the three replicates were merged for downstream analysis.

Raw sequencing reads were assessed for quality using FastQC v0.11.8 (Andrews, 2010) and processed using Trim Galore v0.6.7 (Krueger, 2019) with Cutadapt v3.4 (Martin, 2011) to remove adapter sequences and low-quality bases (--trim-n --clip_R2 5 --quality 20 --length 25). Trimmed paired-end reads were aligned to the reference genome or transposon consensus with the standard pipeline from Bismark v0.24.0 (Krueger and Andrews, 2011) with Bowtie2 v2.4.2 (Langmead and Salzberg, 2012), allowing up to 1 bp mismatch and permitting multimapping, with one random alignment reported for ambiguous mappings (-N 1 --ambig_bam). The mapped reads were deduplicated by deduplicate_bismark. CpG methylation data were then extracted using bismark_methylation_extractor with default parameters. For axolotl spermatozoa samples, the methylation conversion rate was determined by aligning all reads to unmethylated lambda and methylated pUC19 spike-in controls using the same Bismark pipeline (Dataset EV5).

The methylation coverage data generated by Bismark v0.24.0 (Krueger and Andrews, 2011) were imported into SeqMonk v1.48.1 (Andrews, 2022) for quantification and visualization. Methylation analysis was performed using a 50-CpG running window approach (Positions per window = 50), retaining only probes with at least 10 read counts (Minimum observations to include feature = 10). For each probe, the mean methylation level of the read counts was calculated. CpG methylation levels over genomic features were determined as the mean methylation of probes overlapping the respective features.

For genomic features, transcription start sites (TSS) were the first base of annotated transcripts, promoters were defined as the 2 kb regions upstream of the TSS, and genes were divided into exon and intron regions based on gene annotations. All transposon copies and potentially active transposon copies were identified as described (see “Axolotl transposon annotation and Potentially active transposon identification”). Transposon methylation analysis in this study excluded repeats overlapping exonic regions. Boxplots, scatterplots and correlation analysis were generated using ggplot2 v3.3.5 (Wickham, 2016) in RStudio (RStudio Team, 2020). Statistical significance was assessed using unpaired two-sided t test.

Statistics and reproducibility

Sample size was not predetermined using statistical methods, and no data were excluded from analyses. Allocation and outcome assessments were not blinded. Statistical analyses were performed in R v3.5.1 (R Core Team, 2018), with specific test methods detailed in the figure legends.

Expanded View Figures

Figure EV1. Identification and distribution of potentially active transposons in the axolotl genome.

Figure EV1

(A) Distribution of substitution rate and sequence length for every single copy in each transposon family. Defined potentially active copies are highlighted in red. (B) Composition of potentially active copies. (C) Scatter plot of percentage of potentially active copy number and total copy number for transposon classes. Transposon classes without potentially active copies are omitted.

Figure EV2. piRNA repertoires and transposon targeting in axolotl germlines.

Figure EV2

(A) The overlap of unique piRNA sequences among the four samples. (B) Mean piRNA signal level over testis or oocyte for each transposon family. ****, active copy number >1000; ***, active copy number >100; **, active copy number >10; *, active copy number >0. (C) Tracks for piRNAs targeting both sense and antisense strands of representative transposon consensus sequence. For all panels, testis, n = 2; oocyte, n = 2.

Figure EV3. Features of ping-pong cycle in axolotl germlines.

Figure EV3

(A) Relative frequency of the nucleotide distance between 5′ ends of complementary piRNA pairs over representative transposon families. Mean and S.E.M. are presented. (B) Nucleotide composition of the first and tenth position in piRNAs over representative transposon families. A, Adenine; U, Uracil; C, Cytosine; G, Guanine. For all panels, testis, n = 2; oocyte, n = 2.

Figure EV4. DNA methylation landscape of minor transposon classes and CpG islands in axolotl spermatozoa.

Figure EV4

(A) Percentages of CpG methylation levels over all copies of minor transposon classes in the axolotl. For boxplots, the middle line represents the median; boxes represent the 25th (bottom) and 75th (top) percentiles; whiskers represent median ± 1.5× interquartile range; and outside values are not shown. Rhombus, mean level. (B) Length distribution of all CGI regions (n = 248,988). (C) CGI enrichment over TSS and adjacent regions. For statistical tests, ***P value < 0.001, n.s. not significant; Chi-square test. (D) Distribution of CpG methylation level and substitution rate for all transposon copies or copies overlapped with CGI of minor transposon classes. (E) Percentages of transposons overlapped with shuffled region. For all panels, spermatozoa, n = 3; pooled for analysis.

Supplementary Material

Supplementary file
Source data 1
Source data 2
Source data 3
Source data 4
Source data 5
Source data 6
Source data 7
Author check list
Author Checklist
Dataset EV1
Dataset EV2
Dataset EV3
Dataset EV4
Dataset EV5
Reagent table

Acknowledgements

This research was supported by funding from Wellcome to D.O’C. (106144 and 225237), International Scientific Research Cooperation Cultivation Project of Zhejiang University International Campus to WL and DO’C, ZJU-UoE Joint Research Project funding from the ZJE institute to WL and DO’C, Natural Science Foundation of China to WL (32170551). We also sincerely acknowledge the funding from the Wellcome Centre for Cell Biology (203149 and multi-user equipment grants 108504 and 092076); the Wellcome Discovery Research Platform for Hidden Cell Biology (226791); and support from the bioinformatics cores of the Wellcome Discovery Research Platform for Hidden Cell Biology. We thank Dr. Yuka Kabayama and Dr. Rebecca V Berrens for their generous assistance on sRNA-seq and EM-seq analysis, and Dr. Julius Brennecke, Dr. Ansgar Zoch, David MacLeod and Tamoghna Chowdhury for their insightful discussions and help with manuscript review. We also extend our gratitude to staffs at the EMBL GeneCore facility in Heidelberg, Germany, for preparing the EM-seq libraries and sequencing all libraries.

Footnotes

Author contributions

Xinyu Xiang: Resources; Data curation; Formal analysis; Validation; Investigation; Visualization; Methodology; Writing—original draft; Project administration; Writing—review and editing. Anni Gao: Formal analysis; Visualization; Methodology. Dominik Handler: Validation; Methodology; Writing—review and editing. Francisco Falcon: Validation; Methodology; Writing—review and editing. Diego Rodriguez-Terrones: Formal analysis; Investigation; Methodology; Writing—review and editing. Sergej Nowoshilow: Supervision; Visualization; Methodology; Writing—review and editing. Wanlu Liu: Supervision; Funding acquisition; Writing—review and editing. Elly M Tanaka: Resources; Supervision; Validation; Methodology; Project administration; Writing—review and editing. Dónal O’Carroll: Conceptualization; Resources; Supervision; Funding acquisition; Investigation; Writing—original draft; Project administration; Writing—review and editing.

Source data underlying figure panels in this paper may have individual authorship assigned. Where available, figure panel/source data authorship is listed in the following database record: biostudies:S-SCDT-10_1038-S44318-025-00631-w.

Disclosure and competing interests statement

The authors declare no competing interests.

Peer review information

A peer review file is available at https://doi.org/10.1038/s44318-025-00631-w

Data availability

All sRNA-seq and EM-seq data have been deposited at the GEO database under the accession number GSE290049. Scripts used for transposon annotation, RNA-seq, sRNA-seq and EM-seq analyses and plots are available on github (https://github.com/XinyuXiang/AxolotlpiRNApathway/).

The source data of this paper are collected in the following database record: biostudies:S-SCDT-10_1038-S44318-025-00631-w.

Expanded view data, supplementary information, appendices are available for this paper at https://doi.org/10.1038/s44318-025-00631-w.

References

  1. Andrews S. FastQC: A quality control tool for high throughput sequence data. 2010. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc.
  2. Andrews S. Seqmonk: a tool to visualise and analyse high throughput mapped sequence data. 2022. Available online at: https://www.bioinformatics.babraham.ac.uk/projects/seqmonk/
  3. Aravin A, Gaidatzis D, Pfeffer S, Lagos-Quintana M, Landgraf P, Iovino N, Morris P, Brownstein MJ, Kuramochi-Miyagawa S, Nakano T, et al. A novel class of small RNAs bind to MILI protein in mouse testes. Nature. 2006;442:203–207. doi: 10.1038/nature04916. [DOI] [PubMed] [Google Scholar]
  4. Aravin AA, Sachidanandam R, Bourc’his D, Schaefer C, Pezic D, Toth KF, Bestor T, Hannon GJ. A piRNA pathway primed by individual transposons is linked to de novo DNA methylation in mice. Mol Cell. 2008;31:785–799. doi: 10.1016/j.molcel.2008.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Aravin AA, Sachidanandam R, Girard A, Fejes-Toth K, Hannon GJ. Developmentally regulated piRNA clusters implicate MILI in transposon control. Science. 2007;316:744–747. doi: 10.1126/science.1142612. [DOI] [PubMed] [Google Scholar]
  6. Ashe A, Sapetschnig A, Weick E-M, Mitchell J, Bagijn MP, Cording AC, Doebley A-L, Goldstein LD, Lehrbach NJ, Le Pen J, et al. piRNAs can trigger a multigenerational epigenetic memory in the germline of C. elegans. Cell. 2012;150:88–99. doi: 10.1016/j.cell.2012.06.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Barau J, Teissandier A, Zamudio N, Roy S, Nalesso V, Hérault Y, Guillou F, Bourc’his D. The DNA methyltransferase DNMT3C protects male germ cells from transposon activity. Science. 2016;354:909–912. doi: 10.1126/science.aah5143. [DOI] [PubMed] [Google Scholar]
  8. Belgnaoui SM, Gosden RG, Semmes OJ, Haoudi A. Human LINE-1 retrotransposon induces DNA damage and apoptosis in cancer cells. Cancer Cell Int. 2006;6:13. doi: 10.1186/1475-2867-6-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bourc’his D, Bestor TH. Meiotic catastrophe and retrotransposon reactivation in male germ cells lacking Dnmt3L. Nature. 2004;431:96–99. doi: 10.1038/nature02886. [DOI] [PubMed] [Google Scholar]
  10. Bourque G, Burns KH, Gehring M, Gorbunova V, Seluanov A, Hammell M, Imbeault M, Izsvák Z, Levin HL, Macfarlan TS, et al. Ten things you should know about transposable elements. Genome Biol. 2018;19:199. doi: 10.1186/s13059-018-1577-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Brennecke J, Aravin AA, Stark A, Dus M, Kellis M, Sachidanandam R, Hannon GJ. Discrete small RNA-generating loci as master regulators of transposon activity in Drosophila. Cell. 2007;128:1089–1103. doi: 10.1016/j.cell.2007.01.043. [DOI] [PubMed] [Google Scholar]
  12. Brennecke J, Malone CD, Aravin AA, Sachidanandam R, Stark A, Hannon GJ. An epigenetic role for maternally inherited piRNAs in transposon silencing. Science. 2008;322:1387–1392. doi: 10.1126/science.1165171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Brouha B, Schustak J, Badge RM, Lutz-Prigge S, Farley AH, Moran JV, Kazazian HH. Hot L1s account for the bulk of retrotransposition in the human population. Proc Natl Acad Sci USA. 2003;100:5280–5285. doi: 10.1073/pnas.0831042100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bryant DM, Johnson K, DiTommaso T, Tickle T, Couger MB, Payzin-Dogru D, Lee TJ, Leigh ND, Kuo T-H, Davis FG, et al. A Tissue-mapped axolotl de novo transcriptome enables identification of limb regeneration factors. Cell Rep. 2017;18:762–776. doi: 10.1016/j.celrep.2016.12.063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Carmell MA, Girard A, van de Kant HJG, Bourc’his D, Bestor TH, de Rooij DG, Hannon GJ. MIWI2 is essential for spermatogenesis and repression of transposons in the mouse male germline. Dev Cell. 2007;12:503–514. doi: 10.1016/j.devcel.2007.03.001. [DOI] [PubMed] [Google Scholar]
  16. Coates MI, Ruta M, Friedman M. Ever since owen: changing perspectives on the early evolution of tetrapods. Annu Rev Ecol Evol Syst. 2008;39:571–592. [Google Scholar]
  17. De Fazio S, Bartonicek N, Di Giacomo M, Abreu-Goodger C, Sankar A, Funaya C, Antony C, Moreira PN, Enright AJ, O’Carroll D. The endonuclease activity of Mili fuels piRNA amplification that silences LINE1 elements. Nature. 2011;480:259–263. doi: 10.1038/nature10547. [DOI] [PubMed] [Google Scholar]
  18. Deaton AM, Bird A. CpG islands and the regulation of transcription. Genes Dev. 2011;25:1010–1022. doi: 10.1101/gad.2037511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Deng W, Lin H. miwi, a murine homolog of piwi, encodes a cytoplasmic protein essential for spermatogenesis. Dev Cell. 2002;2:819–830. doi: 10.1016/s1534-5807(02)00165-x. [DOI] [PubMed] [Google Scholar]
  20. Di Giacomo M, Comazzetto S, Saini H, De Fazio S, Carrieri C, Morgan M, Vasiliauskaite L, Benes V, Enright AJ, O’Carroll D. Multiple epigenetic mechanisms and the piRNA pathway enforce LINE1 silencing during adult spermatogenesis. Mol Cell. 2013;50:601–608. doi: 10.1016/j.molcel.2013.04.026. [DOI] [PubMed] [Google Scholar]
  21. Dias Mirandela M, Zoch A, Leismann J, Webb S, Berrens RV, Valsakumar D, Kabayama Y, Auchynnikava T, Schito M, Chowdhury T, et al. Two-factor authentication underpins the precision of the piRNA pathway. Nature. 2024;634:979–985. doi: 10.1038/s41586-024-07963-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Ding D, Liu J, Dong K, Midic U, Hess RA, Xie H, Demireva EY, Chen C. PNLDC1 is essential for piRNA 3’ end trimming and transposon silencing during spermatogenesis in mice. Nat Commun. 2017;8:819. doi: 10.1038/s41467-017-00854-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Extavour CG, Akam M. Mechanisms of germ cell specification across the metazoans: epigenesis and preformation. Development. 2003;130:5869–5884. doi: 10.1242/dev.00804. [DOI] [PubMed] [Google Scholar]
  25. Falcon F, Tanaka EM, Rodriguez-Terrones D. Transposon waves at the water-to-land transition. Curr Opin Genet Dev. 2023;81:102059. doi: 10.1016/j.gde.2023.102059. [DOI] [PubMed] [Google Scholar]
  26. Farkash EA, Prak ETL. DNA damage and L1 retrotransposition. BioMed Res Int. 2006;2006:037285. doi: 10.1155/JBB/2006/37285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Flemr M, Malik R, Franke V, Nejepinska J, Sedlacek R, Vlahovicek K, Svoboda P. A retrotransposon-driven dicer isoform directs endogenous small interfering RNA production in mouse oocytes. Cell. 2013;155:807–816. doi: 10.1016/j.cell.2013.10.001. [DOI] [PubMed] [Google Scholar]
  28. Flynn JM, Hubley R, Goubert C, Rosen J, Clark AG, Feschotte C, Smit AF. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci USA. 2020;117:9451–9457. doi: 10.1073/pnas.1921046117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Gainetdinov I, Skvortsova Y, Kondratieva S, Funikov S, Azhikina T. Two modes of targeting transposable elements by piRNA pathway in human testis. RNA. 2017;23:1614–1625. doi: 10.1261/rna.060939.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Gardiner-Garden M, Frommer M. CpG islands in vertebrate genomes. J Mol Biol. 1987;196:261–282. doi: 10.1016/0022-2836(87)90689-9. [DOI] [PubMed] [Google Scholar]
  31. Gasior SL, Wakeman TP, Xu B, Deininger PL. The human LINE-1 retrotransposon creates DNA double-strand breaks. J Mol Biol. 2006;357:1383–1393. doi: 10.1016/j.jmb.2006.01.089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Girard A, Sachidanandam R, Hannon GJ, Carmell MA. A germline-specific class of small RNAs binds mammalian Piwi proteins. Nature. 2006;442:199–202. doi: 10.1038/nature04917. [DOI] [PubMed] [Google Scholar]
  33. Goodier JL, Kazazian HH. Retrotransposons revisited: the restraint and rehabilitation of parasites. Cell. 2008;135:23–35. doi: 10.1016/j.cell.2008.09.022. [DOI] [PubMed] [Google Scholar]
  34. Greenberg MVC, Bourc’his D. The diverse roles of DNA methylation in mammalian development and disease. Nat Rev Mol Cell Biol. 2019;20:590–607. doi: 10.1038/s41580-019-0159-6. [DOI] [PubMed] [Google Scholar]
  35. Grimson A, Srivastava M, Fahey B, Woodcroft BJ, Chiang HR, King N, Degnan BM, Rokhsar DS, Bartel DP. Early origins and evolution of microRNAs and Piwi-interacting RNAs in animals. Nature. 2008;455:1193–1197. doi: 10.1038/nature07415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Grivna ST, Beyret E, Wang Z, Lin H. A novel class of small RNAs in mouse spermatogenic cells. Genes Dev. 2006;20:1709–1714. doi: 10.1101/gad.1434406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Gu Z, Gu L, Eils R, Schlesner M, Brors B. circlize Implements and enhances circular visualization in R. Bioinformatics. 2014;30:2811–2812. doi: 10.1093/bioinformatics/btu393. [DOI] [PubMed] [Google Scholar]
  38. Gunawardane LS, Saito K, Nishida KM, Miyoshi K, Kawamura Y, Nagami T, Siomi H, Siomi MC. A slicer-mediated mechanism for repeat-associated siRNA 5’ end formation in Drosophila. Science. 2007;315:1587–1590. doi: 10.1126/science.1140494. [DOI] [PubMed] [Google Scholar]
  39. Gutierrez J, Platt R, Opazo JC, Ray DA, Hoffmann F, Vandewege M. Evolutionary history of the vertebrate Piwi gene family. PeerJ. 2021;9:e12451. doi: 10.7717/peerj.12451. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Hammoud SS, Low DHP, Yi C, Carrell DT, Guccione E, Cairns BR. Chromatin and transcription transitions of mammalian adult germline stem cells and spermatogenesis. Cell Stem Cell. 2014;15:239–253. doi: 10.1016/j.stem.2014.04.006. [DOI] [PubMed] [Google Scholar]
  41. Han BW, Hung J-H, Weng Z, Zamore PD, Ameres SL. The 3’-to-5’ exoribonuclease Nibbler shapes the 3’ ends of microRNAs bound to Drosophila Argonaute1. Curr Biol. 2011;21:1878–1887. doi: 10.1016/j.cub.2011.09.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Hansen CL, Pelegri F. Primordial Germ Cell Specification in Vertebrate Embryos: Phylogenetic Distribution and Conserved Molecular Features of Preformation and Induction. Front Cell Dev Biol. 2021;9:730332. doi: 10.3389/fcell.2021.730332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Hasuwa H, Iwasaki YW, Au Yeung WK, Ishino K, Masuda H, Sasaki H, Siomi H. Production of functional oocytes requires maternally expressed PIWI genes and piRNAs in golden hamsters. Nat Cell Biol. 2021;23:1002–1012. doi: 10.1038/s41556-021-00745-3. [DOI] [PubMed] [Google Scholar]
  44. Hayashi R, Schnabl J, Handler D, Mohn F, Ameres SL, Brennecke J. Genetic and mechanistic diversity of piRNA 3’-end formation. Nature. 2016;539:588–592. doi: 10.1038/nature20162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Houwing S, Berezikov E, Ketting RF. Zili is required for germ cell differentiation and meiosis in zebrafish. EMBO J. 2008;27:2702–2711. doi: 10.1038/emboj.2008.204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Houwing S, Kamminga LM, Berezikov E, Cronembold D, Girard A, van den Elst H, Filippov DV, Blaser H, Raz E, Moens CB, et al. A role for Piwi and piRNAs in germ cell maintenance and transposon silencing in Zebrafish. Cell. 2007;129:69–82. doi: 10.1016/j.cell.2007.03.026. [DOI] [PubMed] [Google Scholar]
  47. Howe KL, Achuthan P, Allen J, Allen J, Alvarez-Jarreta J, Amode MR, Armean IM, Azov AG, Bennett R, Bhai J, et al. Ensembl 2021. Nucleic Acids Res. 2021;49:D884–D891. doi: 10.1093/nar/gkaa942. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Illingworth RS, Bird AP. CpG islands—‘A rough guide’. FEBS Lett. 2009;583:1713–1720. doi: 10.1016/j.febslet.2009.04.012. [DOI] [PubMed] [Google Scholar]
  49. Ishino K, Hasuwa H, Yoshimura J, Iwasaki YW, Nishihara H, Seki NM, Hirano T, Tsuchiya M, Ishizaki H, Masuda H, et al. Hamster PIWI proteins bind to piRNAs with stage-specific size variations during oocyte maturation. Nucleic Acids Res. 2021;49:2700–2720. doi: 10.1093/nar/gkab059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Izumi N, Shoji K, Sakaguchi Y, Honda S, Kirino Y, Suzuki T, Katsuma S, Tomari Y. Identification and functional analysis of the Pre-piRNA 3’ trimmer in silkworms. Cell. 2016;164:962–973. doi: 10.1016/j.cell.2016.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Jain D, Meydan C, Lange J, Bouuaert CC, Lailler N, Mason CE, Anderson KV, Keeney S. rahu is a mutant allele of Dnmt3c, encoding a DNA methyltransferase homolog required for meiosis and transposon repression in the mouse male germline. PLoS Genet. 2017;13:e1006964. doi: 10.1371/journal.pgen.1006964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Jayaprakash AD, Jabado O, Brown BD, Sachidanandam R. Identification and remediation of biases in the activity of RNA ligases in small-RNA deep sequencing. Nucleic Acids Res. 2011;39:e141. doi: 10.1093/nar/gkr693. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Johnson AD, Alberio R. Primordial germ cells: the first cell lineage or the last cells standing? Development. 2015;142:2730–2739. doi: 10.1242/dev.113993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Khattak S, Murawala P, Andreas H, Kappert V, Schuez M, Sandoval-Guzmán T, Crawford K, Tanaka EM. Optimized axolotl (Ambystoma mexicanum) husbandry, breeding, metamorphosis, transgenesis and tamoxifen-mediated recombination. Nat Protoc. 2014;9:529–540. doi: 10.1038/nprot.2014.040. [DOI] [PubMed] [Google Scholar]
  55. Kirschner M, Gerhart J. Evolvability. Proc Natl Acad Sci USA. 1998;95:8420–8427. doi: 10.1073/pnas.95.15.8420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Klenov MS, Sokolova OA, Yakushev EY, Stolyarenko AD, Mikhaleva EA, Lavrov SA, Gvozdev VA. Separation of stem cell maintenance and transposon silencing functions of Piwi protein. Proc Natl Acad Sci USA. 2011;108:18760–18765. doi: 10.1073/pnas.1106676108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Klughammer J, Romanovskaia D, Nemc A, Posautz A, Seid CA, Schuster LC, Keinath MC, Lugo Ramos JS, Kosack L, Evankow A, et al. Comparative analysis of genome-scale, base-resolution DNA methylation profiles across 580 animal species. Nat Commun. 2023;14:232. doi: 10.1038/s41467-022-34828-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Konstantinidou P, Loubalova Z, Ahrend F, Friman A, Almeida MV, Poulet A, Horvat F, Wang Y, Losert W, Lorenzi H, et al. A comparative roadmap of PIWI-interacting RNAs across seven species reveals insights into de novo piRNA-precursor formation in mammals. Cell Rep. 2024;43:114777. doi: 10.1016/j.celrep.2024.114777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Krueger F. Trim galore. 2019. Available online at: https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/
  60. Krueger F, Andrews SR. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics. 2011;27:1571–1572. doi: 10.1093/bioinformatics/btr167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Kuramochi-Miyagawa S, Kimura T, Ijiri TW, Isobe T, Asada N, Fujita Y, Ikawa M, Iwai N, Okabe M, Deng W, et al. Mili, a mammalian member of Piwi family gene, is essential for spermatogenesis. Development. 2004;131:839–849. doi: 10.1242/dev.00973. [DOI] [PubMed] [Google Scholar]
  62. Kuramochi-Miyagawa S, Watanabe T, Gotoh K, Totoki Y, Toyoda A, Ikawa M, Asada N, Kojima K, Yamaguchi Y, Ijiri TW, et al. DNA methylation of retrotransposon genes is regulated by Piwi family members MILI and MIWI2 in murine fetal testes. Genes Dev. 2008;22:908–917. doi: 10.1101/gad.1640708. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Lau NC, Seto AG, Kim J, Kuramochi-Miyagawa S, Nakano T, Bartel DP, Kingston RE. Characterization of the piRNA complex from rat testes. Science. 2006;313:363–367. doi: 10.1126/science.1130164. [DOI] [PubMed] [Google Scholar]
  65. Lawson KA, Dunn NR, Roelen BA, Zeinstra LM, Wright CV, Korving JP, Hogan BL. Bmp4 is required for the generation of primordial germ cells in the mouse embryo. Genes Dev. 1999;13:424–436. doi: 10.1101/gad.13.4.424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Li C, Vagin VV, Lee S, Xu J, Ma S, Xi H, Seitz H, Horwich MD, Syrzycka M, Honda BM, et al. Collapse of germline piRNAs in the absence of Argonaute3 reveals somatic piRNAs in flies. Cell. 2009a;137:509–521. doi: 10.1016/j.cell.2009.04.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009b;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Li X, Chen F, Chen Y. Gcluster: a simple-to-use tool for visualizing and comparing genome contexts for numerous genomes. Bioinformatics. 2020;36:3871–3873. doi: 10.1093/bioinformatics/btaa212. [DOI] [PubMed] [Google Scholar]
  69. Li XZ, Roy CK, Dong X, Bolcun-Filas E, Wang J, Han BW, Xu J, Moore MJ, Schimenti JC, Weng Z, et al. An ancient transcription factor initiates the burst of piRNA production during early meiosis in mouse testes. Mol Cell. 2013;50:67–81. doi: 10.1016/j.molcel.2013.02.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–930. doi: 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
  71. Liu J, Hu H, Panserat S, Marandel L. Evolutionary history of DNA methylation related genes in chordates: new insights from multiple whole genome duplications. Sci Rep. 2020;10:970. doi: 10.1038/s41598-020-57753-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Liu N, Abe M, Sabin LR, Hendriks G-J, Naqvi AS, Yu Z, Cherry S, Bonini NM. The exoribonuclease Nibbler controls 3’ end processing of microRNAs in Drosophila. Curr Biol. 2011;21:1888–1893. doi: 10.1016/j.cub.2011.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Loubalova Z, Fulka H, Horvat F, Pasulka J, Malik R, Hirose M, Ogura A, Svoboda P. Formation of spermatogonia and fertile oocytes in golden hamsters requires piRNAs. Nat Cell Biol. 2021;23:992–1001. doi: 10.1038/s41556-021-00746-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Luteijn MJ, van Bergeijk P, Kaaij LJT, Almeida MV, Roovers EF, Berezikov E, Ketting RF. Extremely stable Piwi-induced gene silencing in Caenorhabditis elegans. EMBO J. 2012;31:3422–3430. doi: 10.1038/emboj.2012.213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Mahowald A. The germ plasm of Drosophila: an experimental system for the analysis of determination. Am Zool. 1977;17:551–563. [Google Scholar]
  76. Malone CD, Brennecke J, Dus M, Stark A, McCombie WR, Sachidanandam R, Hannon GJ. Specialized piRNA pathways act in germline and somatic tissues of the Drosophila ovary. Cell. 2009;137:522–535. doi: 10.1016/j.cell.2009.03.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10–12. [Google Scholar]
  78. Molaro A, Hodges E, Fang F, Song Q, McCombie WR, Hannon GJ, Smith AD. Sperm methylation profiles reveal features of epigenetic inheritance and evolution in primates. Cell. 2011;146:1029–1041. doi: 10.1016/j.cell.2011.08.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Mouse Genome Informatics Web Site, The Jackson Laboratory, Bar Harbor & Maine. Spin1 expression in mouse tissues. Mouse Genome Database (MGD) 2025. https://www.informatics.jax.org/
  80. Nagirnaja L, Mørup N, Nielsen JE, Stakaitis R, Golubickaite I, Oud MS, Winge SB, Carvalho F, Aston KI, Khani F, et al. Variant PNLDC1, defective piRNA processing, and azoospermia. N Engl J Med. 2021;385:707–719. doi: 10.1056/NEJMoa2028973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Narkiewicz K, Narkiewicz M. The age of the oldest tetrapod tracks from Zachełmie, Poland. Lethaia. 2015;48:10–12. [Google Scholar]
  82. Novick PA, Smith JD, Floumanhaft M, Ray DA, Boissinot S. The evolution and diversity of DNA transposons in the genome of the lizard Anolis carolinensis. Genome Biol Evol. 2011;3:1–14. doi: 10.1093/gbe/evq080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Nowoshilow S, Schloissnig S, Fei J-F, Dahl A, Pang AWC, Pippel M, Winkler S, Hastie AR, Young G, Roscito JG, et al. The axolotl genome and the evolution of key tissue formation regulators. Nature. 2018;554:50–55. doi: 10.1038/nature25458. [DOI] [PubMed] [Google Scholar]
  84. Ozata DM, Gainetdinov I, Zoch A, O’Carroll D, Zamore PD. PIWI-interacting RNAs: small RNAs with big functions. Nat Rev Genet. 2019;20:89–108. doi: 10.1038/s41576-018-0073-3. [DOI] [PubMed] [Google Scholar]
  85. Perez G, Barber GP, Benet-Pages A, Casper J, Clawson H, Diekhans M, Fischer C, Gonzalez JN, Hinrichs AS, Lee CM, et al. The UCSC Genome Browser database: 2025 update. Nucleic Acids Res. 2024;53:D1243–D1249. doi: 10.1093/nar/gkae974. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Ponger L, Li W-H. Evolutionary diversification of DNA methyltransferases in eukaryotic genomes. Mol Biol Evol. 2005;22:1119–1128. doi: 10.1093/molbev/msi098. [DOI] [PubMed] [Google Scholar]
  87. Posit team. RStudio: Integrated Development Environment for R. Posit Software, PBC; Boston, MA: 2025. http://www.posit.co/ [Google Scholar]
  88. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; Vienna, Austria: 2023. R Core Team https://www.R-project.org/ [Google Scholar]
  89. Rangan P, Malone CD, Navarro C, Newbold SP, Hayes PS, Sachidanandam R, Hannon GJ, Lehmann R. piRNA production requires heterochromatin formation in Drosophila. Curr Biol. 2011;21:1373–1379. doi: 10.1016/j.cub.2011.06.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Reuter M, Berninger P, Chuma S, Shah H, Hosokawa M, Funaya C, Antony C, Sachidanandam R, Pillai RS. Miwi catalysis is required for piRNA amplification-independent LINE1 transposon silencing. Nature. 2011;480:264–267. doi: 10.1038/nature10672. [DOI] [PubMed] [Google Scholar]
  91. Roovers EF, Rosenkranz D, Mahdipour M, Han C-T, He N, Chuva de Sousa Lopes SM, van der Westerlaken LAJ, Zischler H, Butter F, Roelen BAJ, et al. Piwi proteins and piRNAs in mammalian oocytes and early embryos. Cell Rep. 2015;10:2069–2082. doi: 10.1016/j.celrep.2015.02.062. [DOI] [PubMed] [Google Scholar]
  92. Rosenkranz D, Zischler H. proTRAC - a software for probabilistic piRNA cluster detection, visualization and analysis. BMC Bioinforma. 2012;13:5. doi: 10.1186/1471-2105-13-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Saito K, Nishida KM, Mori T, Kawamura Y, Miyoshi K, Nagami T, Siomi H, Siomi MC. Specific association of Piwi with rasiRNAs derived from retrotransposon and heterochromatic regions in the Drosophila genome. Genes Dev. 2006;20:2214–2222. doi: 10.1101/gad.1454806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Sarot E, Payen-Groschêne G, Bucheton A, Pélisson A. Evidence for a piwi-dependent RNA silencing of the gypsy endogenous retrovirus by the Drosophila melanogaster flamenco gene. Genetics. 2004;166:1313–1321. doi: 10.1534/genetics.166.3.1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Sayers EW, Bolton EE, Brister JR, Canese K, Chan J, Comeau DC, Connor R, Funk K, Kelly C, Kim S, et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2022;50:D20–D26. doi: 10.1093/nar/gkab1112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Schartl M, Woltering JM, Irisarri I, Kneitz S, Pippel M, Brown T, Franchini P, Li J, Li M, et al. The genomes of all lungfish inform on genome expansion and tetrapod evolution. Nature. 2024;634:96–103. doi: 10.1038/s41586-024-07830-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Schöpp T, Prigozhin DM, Douse C, Kaji K, Cook AG, O’Carroll D. The DUF3715 domain has a conserved role in RNA-directed transposon silencing. RNA. 2023;29:1471–1480. doi: 10.1261/rna.079693.123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Schöpp T, Zoch A, Berrens RV, Auchynnikava T, Kabayama Y, Vasiliauskaitė L, Rappsilber J, Allshire RC, O’Carroll D. TEX15 is an essential executor of MIWI2-directed transposon DNA methylation and silencing. Nat Commun. 2020;11:3739. doi: 10.1038/s41467-020-17372-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Seisenberger S, Andrews S, Krueger F, Arand J, Walter J, Santos F, Popp C, Thienpont B, Dean W, Reik W. The dynamics of genome-wide DNA methylation reprogramming in mouse primordial germ cells. Mol Cell. 2012;48:849–862. doi: 10.1016/j.molcel.2012.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Sharif J, Endo TA, Toyoda T, Koseki H. Divergence of CpG island promoters: a consequence or cause of evolution? Dev Growth Differ. 2010;52:545–554. doi: 10.1111/j.1440-169X.2010.01193.x. [DOI] [PubMed] [Google Scholar]
  101. Shirayama M, Seth M, Lee H-C, Gu W, Ishidate T, Conte D, Mello CC. piRNAs initiate an epigenetic memory of nonself RNA in the C. elegans germline. Cell. 2012;150:65–77. doi: 10.1016/j.cell.2012.06.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Sienski G, Dönertas D, Brennecke J. Transcriptional silencing of transposons by Piwi and maelstrom and its impact on chromatin state and gene expression. Cell. 2012;151:964–980. doi: 10.1016/j.cell.2012.10.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Smallwood SA, Tomizawa S, Krueger F, Ruf N, Carli N, Segonds-Pichon A, Sato S, Hata K, Andrews SR, Kelsey G. Dynamic CpG island methylation landscape in oocytes and preimplantation embryos. Nat Genet. 2011;43:811–814. doi: 10.1038/ng.864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Smit AFA, Hubley R, Green P. RepeatMasker Open-40. 2013. 2015 http://www.repeatmasker.org.
  105. Stallmeyer B, Bühlmann C, Stakaitis R, Dicke A-K, Ghieh F, Meier L, Zoch A, MacKenzie MacLeod D, Steingröver J, Okutman Ö, et al. Inherited defects of piRNA biogenesis cause transposon de-repression, impaired spermatogenesis, and human male infertility. Nat Commun. 2024;15:6637. doi: 10.1038/s41467-024-50930-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Sun C, López Arriaza JR, Mueller RL. Slow DNA loss in the gigantic genomes of salamanders. Genome Biol Evol. 2012;4:1340–1348. doi: 10.1093/gbe/evs103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Tang W, Tu S, Lee H-C, Weng Z, Mello CC. The RNase PARN-1 trims piRNA 3’ ends to promote transcriptome surveillance in C. elegans. Cell. 2016;164:974–984. doi: 10.1016/j.cell.2016.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. The Human Protein Atlas. Spin1 expression in human tissues. The Human Protein Atlas. 2025 [Google Scholar]
  109. Vagin VV, Sigova A, Li C, Seitz H, Gvozdev V, Zamore PD. A distinct small RNA pathway silences selfish genetic elements in the germline. Science. 2006;313:320–324. doi: 10.1126/science.1129333. [DOI] [PubMed] [Google Scholar]
  110. Wang K, Wang J, Zhu C, Yang L, Ren Y, Ruan J, Fan G, Hu J, Xu W, Bi X, et al. African lungfish genome sheds light on the vertebrate water-to-land transition. Cell. 2021;184:1362–1376.:e18. doi: 10.1016/j.cell.2021.01.047. [DOI] [PubMed] [Google Scholar]
  111. Wang SH, Elgin SCR. Drosophila Piwi functions downstream of piRNA production mediating a chromatin-based transposon silencing mechanism in female germ line. Proc Natl Acad Sci USA. 2011;108:21164–21169. doi: 10.1073/pnas.1107892109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  112. Wang X, Ramat A, Simonelig M, Liu M-F. Emerging roles and functional mechanisms of PIWI-interacting RNAs. Nat Rev Mol Cell Biol. 2023;24:123–141. doi: 10.1038/s41580-022-00528-0. [DOI] [PubMed] [Google Scholar]
  113. Wickham H. ggplot2: elegant graphics for data analysis. Springer-Verlag; New York: 2016. [Google Scholar]
  114. Williams Z, Morozov P, Mihailovic A, Lin C, Puvvula PK, Juranek S, Rosenwaks Z, Tuschl T. Discovery and characterization of piRNAs in the human fetal ovary. Cell Rep. 2015;13:854–863. doi: 10.1016/j.celrep.2015.09.030. [DOI] [PubMed] [Google Scholar]
  115. Xing J, Zhang Y, Han K, Salem AH, Sen SK, Huff CD, Zhou Q, Kirkness EF, Levy S, Batzer MA, et al. Mobile elements create structural variation: analysis of a complete human genome. Genome Res. 2009;19:1516–1526. doi: 10.1101/gr.091827.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  116. Yan H, Bombarely A, Li S. DeepTE: a computational method for de novo classification of transposons with convolutional neural network. Bioinformatics. 2020;36:4269–4275. doi: 10.1093/bioinformatics/btaa519. [DOI] [PubMed] [Google Scholar]
  117. Yang F, Lan Y, Pandey RR, Homolka D, Berger SL, Pillai RS, Bartolomei MS, Wang PJ. TEX15 associates with MILI and silences transposable elements in male germ cells. Genes Dev. 2020;34:745–750. doi: 10.1101/gad.335489.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  118. Yang Q, Li R, Lyu Q, Hou L, Liu Z, Sun Q, Liu M, Shi H, Xu B, Yin M, et al. Single-cell CAS-seq reveals a class of short PIWI-interacting RNAs in human oocytes. Nat Commun. 2019;10:3389. doi: 10.1038/s41467-019-11312-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  119. Ye F, Zhang G, Chen H, Yu C, Yang L, Fu Y, Li J, Fu S, Sun Z, et al. Construction of the axolotl cell landscape using combinatorial hybridization sequencing at single-cell resolution. Nat Commun. 2022;13:4228. doi: 10.1038/s41467-022-31879-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  120. Zhang H, Zhang F, Chen Q, Li M, Lv X, Xiao Y, Zhang Z, Hou L, Lai Y, Zhang Y, et al. The piRNA pathway is essential for generating functional oocytes in golden hamsters. Nat Cell Biol. 2021;23:1013–1022. doi: 10.1038/s41556-021-00750-6. [DOI] [PubMed] [Google Scholar]
  121. Zhang Y, Guo R, Cui Y, Zhu Z, Zhang Y, Wu H, Zheng B, Yue Q, Bai S, Zeng W, et al. An essential role for PNLDC1 in piRNA 3′ end trimming and male fertility in mice. Cell Res. 2017;27:1392–1396. doi: 10.1038/cr.2017.125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  122. Zoch A, Auchynnikava T, Berrens RV, Kabayama Y, Schöpp T, Heep M, Vasiliauskaitė L, Pérez-Rico YA, Cook AG, Shkumatava A, et al. SPOCD1 is an essential executor of piRNA-directed de novo DNA methylation. Nature. 2020;584:635–639. doi: 10.1038/s41586-020-2557-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  123. Zoch A, Konieczny G, Auchynnikava T, Stallmeyer B, Rotte N, Heep M, Berrens RV, Schito M, Kabayama Y, Schöpp T, et al. C19ORF84 connects piRNA and DNA methylation machineries to defend the mammalian germ line. Mol Cell. 2024;84:1021–1035.:e11. doi: 10.1016/j.molcel.2024.01.014. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary file
Source data 1
Source data 2
Source data 3
Source data 4
Source data 5
Source data 6
Source data 7
Author check list
Author Checklist
Dataset EV1
Dataset EV2
Dataset EV3
Dataset EV4
Dataset EV5
Reagent table

Data Availability Statement

All sRNA-seq and EM-seq data have been deposited at the GEO database under the accession number GSE290049. Scripts used for transposon annotation, RNA-seq, sRNA-seq and EM-seq analyses and plots are available on github (https://github.com/XinyuXiang/AxolotlpiRNApathway/).

The source data of this paper are collected in the following database record: biostudies:S-SCDT-10_1038-S44318-025-00631-w.

Expanded view data, supplementary information, appendices are available for this paper at https://doi.org/10.1038/s44318-025-00631-w.

RESOURCES