Skip to main content
Life Science Alliance logoLink to Life Science Alliance
. 2026 Apr 20;9(7):e202603650. doi: 10.26508/lsa.202603650

Flamenco plasticity tunes somatic piRNAs and rewires isoforms, with implications for heritable transposon spread

Mai Moritoh 1, Chikara Takeuchi 2,3, Yurika Namba 1, Hidenori Nishihara 4, Shigeki Hirakata 1, Chie Owa 5, Shinichi Morishita 5, Yuka W Iwasaki 2, Mikiko C Siomi 1,
PMCID: PMC13097621  PMID: 42009568

Springer drives host–TE hybrid splicing in OSCs, whereas Flamenco remodeling reshapes somatic piRNAs and weakens TE silencing, potentially creating conditions permissive for germline entry.

Abstract

Transposons drive genome innovation, yet how they evade somatic piRNA defenses, reach the germline, and rewire host genes with limited cost remain unclear. Using comparative long-read sequencing in cultured Drosophila ovarian somatic cells, we show that the long terminal repeat transposon Springer modulates host gene expression through promoter-proximal intronic insertions at an AT-rich motif. Its 5′ long terminal repeat initiates transcription that splices into downstream host exons, expanding isoform diversity without adding coding sequence. We catalog 72 Springer-driven isoforms, highlighting its broad mutagenic potential. In parallel, we find that the Flamenco (Flam) uni-strand piRNA cluster, the principal source of somatic piRNAs, undergoes structural remodeling in ovarian somatic cells, replacing antisense transposon fragments with forward-oriented copies. This remodeling reshapes piRNA populations and erodes silencing of specific elements, including Springer. This relaxation of somatic repression may create conditions permissive for continued transposon activity, with possible broader implications for genome change. Our findings suggest that Flam plasticity can couple transposon activity to genome rewiring, with potential evolutionary consequences.

Introduction

Transposons (hereafter referred to as TEs) are pervasive genomic parasites that threaten genome integrity across all domains of life (Kazazian, 2004; Feschotte, 2023). Their mobilization disrupts essential genes and, when occurring in the germline, compromise fertility and transmit deleterious mutations (Engels, 1983; Kazazian et al, 1988). Animals counter this threat with the Piwi-interacting RNA (piRNA) pathway, a small RNA system that enforces TE silencing and preserves reproductive fitness (Cox et al, 1998; Siomi et al, 2011; Czech et al, 2018; Ozata et al, 2019).

piRNAs derive from long single-stranded precursors transcribed from discrete genomic loci known as piRNA clusters (Aravin et al, 2003; Brennecke et al, 2007; Chirn et al, 2015). In Drosophila, these clusters fall into uni-strand and dual-strand types (Brennecke et al, 2007; Malone et al, 2009). Both act as genomic archives of fragmented TEs, generating antisense piRNAs that target active elements. Dual-strand clusters employ non-canonical RNA polymerase II (Pol II) and associated cofactors for bidirectional transcription and are restricted to germ cells (Mohn et al, 2014; Zhang et al, 2014; Andersen et al, 2017). Ovarian follicle cells lack this machinery and rely solely on uni-strand clusters, most prominently Flamenco (Flam) (Vaury et al, 1989; Pélisson et al, 1994; Prud’homme et al, 1995; Malone et al, 2009; Zanni et al, 2013). Loss of Flam function de-represses somatic TEs, establishing it as their master suppressor.

Because follicle cells encapsulate germ cells, TEs first encounter these somatic cells, where Flam captures and silences them (Song et al, 1997; Brennecke et al, 2007; Malone et al, 2009; Peccoud et al, 2017; Varoqui et al, 2025). Cross-species long-read sequencing revealed that Flam is variable even among closely related species, reflecting rapid evolutionary diversification (Signor et al, 2023). Some Drosophila species lack a syntenic Flam yet harbor Flam-like loci that repress long terminal repeat (LTR)-type TEs encoding an envelope (Env) protein, paralleling Drosophila melanogaster (van Lopik et al, 2023). These loci are enriched for antisense-oriented LTR-type TE insertion.

Host defenses constrain TE activity, but many TEs persist and can drive regulatory and evolutionary innovation. Realizing such benefits requires germline establishment, yet developmental barriers and robust somatic piRNA defenses sharply restrict access. How, then, do TEs evade piRNA surveillance to colonize the germline? Once established, they can be vertically transmitted across generation (Senti et al, 2026).

Cultured Drosophila ovarian somatic cells (OSCs), composed solely of follicle cells (Saito et al, 2009), provide a tractable system to prove these dynamics. We previously discovered that the OSC gene Lethal (3) malignant brain tumor (L(3)mbt) harbors an intronic insertion of the LTR-type TE Springer, which drives hybrid splicing (TE−host fusion transcript) from the 5′ LTR to exon 4, generating a truncated isoform, L(3)mbt-S (Yamamoto-Matsuda et al, 2022). Loss of L(3)mbt de-represses piRNA amplification factors, such as Vasa and Aubergine (Aub), and causes female infertility (Sumiyoshi et al, 2016; Coux et al, 2018; Yamamoto-Matsuda et al, 2022). Yet OSCs tolerate Springer, offering an unprecedented window into TE−host interplay.

Here, comparative long-read sequencing of cultured OSCs shows that Springer can influence host gene expression via promoter-proximal intronic insertions that initiate transcription and drive host–TE hybrid splicing without adding coding sequence. Beyond L(3)mbt, we identified 71 additional Springer-driven isoforms, underscoring its mutagenic potential. We also find that Flam shows evidence of dynamic remodeling that can reshuffle TE fragments, reshape piRNA output, and potentially erode silencing of specific elements, including Springer. Such remodeling may modulate somatic control of TE activity across ovarian cell types, with possible implications for TE movement between them. TE mobilization in the germline can contribute to heritable variation and, over evolutionary timescales, may facilitate innovation. Together, our findings support a model in which Flam acts not only as a barrier to TE activity but also as a context-dependent modulator of genome rewiring. In this framework, transient TE up-regulation associated with Flam remodeling could, in some settings, expand isoform diversity and potentially yield evolutionary consequences.

Results

Intronic Springer initiates transcription and hybrid splicing without adding coding sequence

To investigate the mechanism underlying OSC-specific L(3)mbt-S isoform (Yamamoto-Matsuda et al, 2022), we generated a haplotype-resolved OSC genome by PacBio HiFi long-read sequencing and integrated the data with Micro-C XL profiles (Takeuchi et al, 2022). Contigs were assembled with HiFiasm and scaffolded with RagTag, yielding two haplotypes: one containing the canonical L(3)mbt gene (FlyBase; FBgn0002441) and the other carrying a Springer insertion in intron 3 (L3-Springer) (Fig 1A). We hereafter refer to the former as Haplotype_1 (Hap_1) and the latter as Haplotype_2 (Hap_2). For convenience, chromosomes other than 3R harboring L(3)mbt were also assigned to Hap_1 or Hap_2. Karyotyping confirmed that OSCs used in this study are diploid (Fig S1A).

Figure 1. L3-Springer introduces a 5′ splice site that drives hybrid splicing to generate the truncated L(3)mbt-S isoform in cultured OSCs.

Figure 1.

(A) Genome assemblies of OSC haplotypes: Hap_1 carries an intact L(3)mbt locus, whereas Hap_2 harbors Springer (L3-Springer) inserted within the locus. (B) Structures of L(3)mbt in the dm6 genome (upper) and Hap_2 in the OSC genome (lower). Boxes represent exons (gray) and Springer (green). The L3-Springer 5′ splice site (GTAAGTG) is highlighted in red. Schematics of L(3)mbt pre-mRNAs and mRNAs are also shown. TSS, transcription start site; TIS, translation initiation site. The TSS was determined experimentally, whereas the TIS was estimated to be either Met311 or Met325 in our previous study (Yamamoto-Matsuda et al, 2022). (C) Plasmid-based recapitulation of Springer-driven hybrid splicing in OSCs. FL-LS-mCh supports splicing and mCherry expression (red). FL-LS, full-length L3-Springer; mCh, mCherry. Nuclei are shown in blue. Scale bar, 5 μm. TSS, transcription start site; PAS, polyadenylation signal. (D) Deletion of the Springer body (Δ-LS-mCh) preserves mCherry expression. (E) Mutation of the 5′ splice site (Δ-LS-mCh-mut) abolishes mCherry expression. (D) The splice site within Δ-LS-mCh (D) was mutated to CATTCAC (blue).

Figure S1. L(3)mbt isoforms in OSCs and conservation across species.

Figure S1.

(A) Representative confocal image of a DAPI-stained chromosome spread prepared from colchicine-treated OSCs. Eight chromosomes (2n = 8) are visible in this cell. Quantification (n = 30) showed that OSCs used in this study are predominantly diploid (n = 28: 93.3%), with a minor fraction likely polyploid (n = 2: 6.7%). Thirty randomly selected ruptured chromosome spreads, countable, were scored for ploidy. Scale bar, 5 μm. (B) Left, Western blot showing the levels of L(3)mbt-L and L(3)mbt-S in control OSCs, cells depleted of both isoforms (siL(3)mbt), and cells selectively depleted of L(3)mbt-L (siL(3)mbt-L). Right, RT-qPCR analysis of the relative expression levels of piRNA amplification factors (Vasa, Aub, and Ago3), showing that L(3)mbt-S retains repressive activity toward these genes. (C) Conservation of L(3)mbt homologs in fly L(3)mbt-L and L(3)mbt-S, zebrafish L3mbtl1b (Gene ID: 559572), mouse L3mbtl3 (Gene ID: 237339), and human L3MBTL3 (Gene ID: 84456).

Alignment of Hap_2 with L(3)mbt-S (Yamamoto-Matsuda et al, 2022) revealed a 7-nt splice donor (GTAAGTG) located 43 nt downstream of the 5′ LTR of L3-Springer (Fig 1B). Transcription initiates within the 5′ LTR, omitting exons 1–3 and including a 297-nt Springer fragment that lacks an AUG start codon. Translation therefore begins at Met311 (or Met325) in exon 4, yielding L(3)mbt-S, an N-terminally truncated isoform relative to the full-length L(3)mbt-L (Yamamoto-Matsuda et al, 2022). Thus, L3-Springer elicits cryptic transcription initiation and hybrid splicing yet contributes no peptide sequence to the product. Hap_1 expresses the canonical transcript (L(3)mbt-L), consistent with the D. melanogaster reference genome (BDGP Release 6 plus ISO1 MT/dm6; GenBank assembly GCA_000001215.4) (Fig 1B).

Functionally, L(3)mbt-S compensates for loss of L(3)mbt-L in OSCs by repressing targets such as Vasa and Aub, which are piRNA amplification factors (Yamamoto-Matsuda et al, 2022), indicating that N-terminal truncation preserves core activity (Fig S1B). Notably, L(3)mbt-S resembles its vertebrate orthologs (Fig S1C), supporting the possibility that TE-driven isoform variation may have evolutionary consequences. Nevertheless, we cannot exclude the possibility that loss of the N terminus reflects a more ancestral state.

Springer (FlyBase; FBte0000333; 7,546 nt), an endogenous errantivirus (Touret et al, 2014), produces Gag/Pol from unspliced RNAs and Env from spliced RNAs (Fig S2A). Long-read RNA-seq from OSCs detected both unspliced and spliced Springer RNAs and confirmed use of the GTAAGTG donor without auxiliary Springer sequences (Fig S2B). Despite an intact polyadenylation signal (PAS) in the 3′ UTR (Fig S2C), polyadenylated L3-Springer RNAs were not detected in OSC RNA-seq. In S2 cells, however, L3-Springer was efficiently polyadenylated (Fig S2D), indicating that the PAS is intrinsically functional but not recognized in the OSC context. L3-Springer harbors a splice acceptor site (GTTACAG) (Fig S2B). Although the core CAG motif is maintained, the upstream 5′ adenine is suboptimal (Mount, 1982), likely reducing the splicing efficiency. This may represent an additional mechanism that permits L(3)mbt-S expression in OSCs.

Figure S2. The Springer element and L3-Springer.

Figure S2.

(A) Schematic of the Springer element, showing the Gag, Pol, and Env ORFs and the structures of its unspliced and spliced transcripts. (B) Splice donor (red) and splice acceptor (blue) sites within Springer. (C) DNA sequence of the L3-Springer segment, extending from the 5′ end of the Env coding region to the 3′ end of L3-Springer. The Env TIS and stop codon are highlighted in red, and the L3-Springer–specific 29-nt deletion is indicated by a red triangle. The corresponding lowercase sequence is shown below. The PAS is shaded gray. (D) The L3-Springer element was introduced into S2 cells and ectopically expressed, and the resulting transcripts were examined for polyadenylation. All cDNA clones analyzed (n = 10) contained poly(A) tails.

A genomic reporter spanning the L3-Springer 5′ end through L(3)mbt exon 4 (FL-LS-mCh) yielded robust mCherry fluorescence and the expected junctions in OSCs (Figs 1C and S3A and B). A minimum construct (Δ-LS-mCh) retaining the GTAAGTG donor also supported splicing and mCherry fluorescence, whereas mutating this donor (Δ-LS-mCh-mut harboring CATTCAC) abrogated fluorescence (Figs 1D and E, and S3A, C, and D), demonstrating that the identified donor is necessary and sufficient for hybrid splicing.

Figure S3. Reporter assays.

Figure S3.

(A) Visualization of RT–PCR products. RNAs were isolated from OSCs transfected with FL-LS-mCh, Δ-LS-mCh, or Δ-LS-mCh-mut (Fig 1C–E), reverse-transcribed, and amplified using the same primer set (Table S6). (B, C, D) Partial sequences of RNAs transcribed from FL-LS-mCh (B), Δ-LS-mCh (C), and Δ-LS-mCh-mut (D). RNAs were reverse-transcribed using an mCherry-specific primer. TIS, Met311; lowercase letters, partial mCherry sequence; Mutation, mutated splice donor site.

Springer-driven hybrid splicing pervasively rewires host isoforms in OSCs

The dm6 genome contains six full-length or near–full-length Springer copies (>7,500 nt; >99.5%) (Fig 2A and Table S1). In OSCs, we identified 219 insertions in Hap_1 and 209 in Hap_2; none matched the six dm6 loci (Fig 2B and C). Of these, 150 (Hap_1) and 147 (Hap_2) insertions were genic; 129 (Hap_1) and 126 (Hap_2) were intronic, with 64 and 57 oriented forward relative to host transcription (Springer-intF; Fig 2C and Table S2). Analysis of all 428 Springer insertions (219 in Hap_1 and 209 in Hap_2) showed a marked enrichment in intronic regions (Fig 2D), consistent with stronger negative selection against exonic disruption.

Figure 2. Intronic Springer elements broadly promote hybrid splicing and create OSC-specific isoforms.

Figure 2.

(A) Genomic distribution of Springer elements ≥7.5 kb (blue dots) in the dm6 reference genome. The L(3)mbt locus is highlighted in orange. (B) Genomic distribution of Springer elements ≥7.5 kb (blue dots) in OSC Hap_1 and Hap_2. (C) Classification of Springer insertions in the OSC genome. (D) Proportions of exonic (blue), intronic (orange), and intergenic (gray) regions in the OSC genome, and the distribution of Springer loci across exonic (blue), intronic (orange), and intergenic (gray) regions. Springer insertions are enriched in introns. (E) Map of genes harboring Springer-intF that induce hybrid splicing. *, genes with Springer in both haplotypes, genes with two Springer insertions. (F) Examples of host genes (Rcc1, Sap-r, Cf2) generating Springer-induced isoforms. Boxes denote exons (gray) and Springer (green). TSS, transcription start site. (G) Marf1 carries an intronic Springer but shows no fusion transcripts (X, orange). The orange line (top) indicates the region between the Springer insertion and exon 4. (H) Distances between Springer and downstream exons distinguish splicing-competent (Hybrid splicing) versus non-competent insertions (no hybrid splicing). Each dot represents a single Springer insertion and that the plot shows the log10-transformed distance (bp) to the end of the nearest downstream exon, grouped according to the presence or absence of detected hybrid splicing. Red bar, means. Mann–Whitney U test, P = 5.022 × 10−5.

Table S2. Springer-intF elements. (44KB, xls)

Long-read RNA-seq revealed hybrid splicing for 37 of 64 (Hap_1) and 35 of 57 (Hap_2) Springer-intF sites, including L3-Springer (Figs 2C, E, and F and S4A; Table S3). Six genes carried Springer-intF at identical positions in both haplotypes (Fig 2E). In contrast, dm6 carries no Springer-IntF (Fig S4B) and lacks short Springer segments akin to Δ-LS (Fig 1D), suggesting that Springer-dependent isoforms are specific to the OSC genome.

Figure S4. Springer-intF insertions within host genes.

Figure S4.

(A) Six additional examples of Springer-intF insertions that drive host–TE hybrid splicing. (B) Genome-wide classification of Springer insertions in the dm6 genome. (C) Two additional examples of Springer-intF insertions that do not promote hybrid splicing.

Roughly 40% of Springer-intF insertions (27 [Hap_1] and 22 [Hap_2]/121) showed no hybrid splicing (Figs 2C and G, and S4C; Table S4). These sites lay farther from the nearest downstream exon: median 9,572 nt without splicing versus 2,661 nt with splicing (Δ = 6,911 nt; Fig 2H). We propose competition between splicing and premature polyadenylation: long downstream introns favor PAS usage within Springer before the acceptor is reached, yielding Springer mRNAs rather than hybrid isoforms. Additional parameters, such as Pol II elongation kinetics and splice acceptor strength, may also influence this balance.

OSCs produce abundant but sense-biased Springer piRNAs

Although ovarian Springer-piRNAs effectively silence Springer (Sienski et al, 2012; Muerdter et al, 2013), OSCs harbor numerous Springer insertions. Reanalysis of OSC piRNAs (Murano et al, 2019) showed that Springer piRNAs are abundant (Fig 3A), yet Piwi depletion failed to de-repress Springer (Fig S5A). Interestingly, Springer piRNAs in OSCs are strongly sense-biased compared with those produced in ovaries of the Drosophila ISO1 strain (Shpiz et al, 2014) (Fig 3B and C), consistent with limited silencing efficacy. Copia, gypsy1, Transpac, and Xanthias showed similar sense biases and, like Springer, were not markedly de-repressed in Piwi-deficient OSCs (Figs 3B and C, and S5A), indicating that residual antisense piRNAs play little role in their silencing. This is consistent with a previous report showing that antisense piRNAs in limited amounts are insufficient to trigger silencing (Post et al, 2014; Genzor et al, 2021; Ariura et al, 2024).

Figure 3. OSCs are enriched in sense-oriented Springer piRNAs and allow accumulation of Springer elements in active, promoter-proximal introns.

Figure 3.

(A) Counts of Piwi-bound piRNAs obtained by immunoprecipitation (Murano et al, 2019) mapped to LTR TEs (top 30) in OSCs. Green, Springer. (A, B) Strand orientation of the top 30 LTR TE-mapped piRNAs shown in (A). Orange, sense; blue, antisense. (B, C) Comparison of the population of sense piRNAs (%) between ovaries and OSCs for the TE-mapped piRNAs in (B). In OSCs, piRNAs from Springer, copia, Transpac, gypsy1, and Xanthias are sense-biased (orange), whereas those from other TEs (gray), including HMS-Beagle (blue), are not. (D) Enrichment of genomic insertions for selected LTR TEs in OSCs relative to the dm6 reference genome. For each TE family, enrichment was calculated in copy number in the OSC assembly relative to dm6. Springer showed the greatest enrichment in OSCs, followed by copia, Xanthias, and Transpac. The dm6 reference genome served as the baseline. (E) Sequence motif enriched at Springer insertion sites. The bar graph shows insertion frequency across the motif. (E, F) Genomic distribution of the motif in (E) across exonic (blue), intronic (orange), and intergenic (gray) regions of the OSC genome, alongside the distribution of Springer insertions across these regions. Springer insertions are biased toward introns. (G) Springer insertions preferentially occur near the 5′ ends of host genes. (H) Springer insertions preferentially occur within highly expressed genes. (I) Reanalysis of ATAC-seq data (Iwasaki et al, 2016) indicates that Springer insertions are preferentially located in open chromatin. Random, control.

Figure S5. Copia, Xanthias, Transpac, and gypsy1 insertions in the OSC genome.

Figure S5.

(A) RT-qPCR analysis showing the expression levels of various TEs in naïve and Piwi-depleted OSCs. Expression levels for each TE were normalized to the naïve control sample (set to 1). Western blot analysis confirms the efficiency of Piwi knockdown (siPiwi). (B) Genomic classification of copia insertions. (C) Copia-intF insertions that promote hybrid splicing in the OSC genome. (D, E, F) Genomic classification of Xanthias, Transpac, and gypsy1 insertions.

Copia rarely drives hybrid splicing and often perturbs coding frames

Springer showed the strongest copy number enrichment in the OSC genome, increasing by ∼36-fold relative to the dm6 reference (Fig 3D). Copia increased by ∼3-fold, whereas Xanthias and Transpac each increased by ∼2.5-fold. In contrast, gypsy1 showed no increase (fold change < 1), for reasons that remain unclear (Fig 3B).

We identified 119 (Hap_1) and 129 (Hap_2) copia insertions, with 35 and 45 intronic forward-oriented sites (copia-intF; Fig S5B). Only 1/35 (Hap_1) and 4/45 (Hap_2) yielded detectable hybrid splicing (Fig S5C), far below Springer-intF frequencies. We attribute this to weaker internal donor strength (GTATGTT), competition with premature polyadenylation, and potential negative selection. Indeed, copia-derived exonic segments are long (1,484 nt; FlyBase; FBte0000023) and contain multiple AUGs, predisposing to frameshifts or fusion peptides that may compromise cell viability. Xanthias-intF and Transpac-intF showed no hybrid splicing (Fig S5D and E), consistent with the absence of intrinsic splicing in these elements. Genic gypsy1 insertions were not detected (Fig S5F). Together, Springer uniquely and efficiently drives coding sequence–free hybrid splicing.

Springer targets open, promoter-proximal introns of active genes

We investigated the genomic contexts of Springer insertions in OSCs (219 Hap_1; 209 in Hap_2). These insertions were preferentially located within a distinctive AT-rich repeat array and displayed a central positional bias within this array (Fig 3E). Genome-wide, insertions are enriched in introns relative to exons and intergenic regions (Fig 3F). Notably, although exonic regions comprise 23.8% of the OSC genome (Fig 2D), only 5.9% of Springer insertions fall within exons, substantially below the level expected from motif frequency. Nonetheless, these data do not provide definitive evidence for a novel Springer integration bias; rather, they are more consistent with insertions acquired during OSC establishment and/or subsequent long-term culture.

Insertions were enriched in first/second introns and in highly expressed genes (Fig 3G and H). ATAC-seq meta-profiles centered on insertion sites revealed a sharp accessibility peak immediately upstream of integrations, absent from matched random controls (Fig 3I), indicating a preference for open chromatin near 5′ gene regions. We hypothesize that transcription initiation and local Pol II density promote Springer integration, consistent with known roles of host factors in TE targeting.

Extensive Flam remodeling reshapes piRNA populations and silencing capacity

We compared Flam for Springer, copia, Xanthias, Transpac, and gypsy1 fragments between dm6 and OSC haplotypes, focusing on inserts >300 nt because of repetitiveness. In dm6, two reverse-oriented Springer fragments (dm6-Sp#1 and #2) produce Springer-targeting piRNAs (Figs 4A and S6A). In OSCs, these are absent; instead, multiple near–full-length, forward-oriented Springer fragments populate Flam in both haplotypes (Sp#3, Sp#4, Sp#5 in Hap_1 and Sp#4 in Hap_2; Fig 4B and C), generating predominantly sense Springer piRNAs (Fig S6A) and explaining ineffective Springer silencing.

Figure 4. The Flam locus exhibits extensive remodeling in cultured OSCs.

Figure 4.

(A) Organization of the Flam locus in the dm6 reference genome, focusing on Springer, gypsy1, Transpac, copia, and Xanthias fragments (>300 nt). The Flam locus was defined as the genomic region between the flanking genes CG32820 (upstream) and CG14621 (downstream) (see the Materials and Methods section for details). The 0.0 position corresponds to the 5′ boundary of the Flam locus. In the dm6 genome, the Flam locus harbors two Springer fragments and five gypsy1 fragments, but no Transpac, copia, and Xanthias fragments. Notably, the two Springer fragments, Sp#1 and Sp#2 in the dm6 genome (pale green), and four gypsy1 fragments, gy1#1–4 (pale purple), are oriented opposite (reverse) to the direction of Flam transcription. Springer-targeting piRNAs map to Sp#1 and Sp#2 (Fig S6A), indicating that these regions serve as sources of Springer-targeting piRNAs. The orange line represents the region absent from Hap_1 and Hap_2. (B) Organization of the Flam locus in Hap_1. Springer fragments Sp#1 and Sp#3–5 (green), but not Sp#2 (pale green), are oriented in the same (forward) direction as Flam transcription. Transpac (Tr#1, blue) and Xanthias (Xa#1, yellow green) fragments are also oriented in the same (forward) direction as Flam transcription. (C) The blue line indicates the region absent from the dm6 genome; this region is identical to that in Hap_2 (see in (C)). (C) Organization of the Flam locus in Hap_2. copia (co#1, yellow) fragment is oriented in the same (forward) direction as Flam transcription. (D) Structural rearrangements in the Flam region (pink bar) and its 50-kb flanking sequences (black lines) across the dm6 genome and the OSC Hap_1 and Hap_2 assemblies. The 0.0 position corresponds to the 5′ boundary of the Flam locus, defined as the genomic region between CG32820 and CG14621.

Figure S6. piRNA mapping onto Springer and gypsy1 elements.

Figure S6.

(A) Mapping of ovarian and OSC Springer piRNAs to the Springer consensus sequence. Left, ovary; right, OSC. Upper, piRNAs mapped to the sense strand (above the horizontal line, orange) and antisense strand (below the horizontal line, blue). The y-axis indicates counts per million. An enlarged view of the piRNA mapping (part) is also shown. Lower, positions of Springer fragments within the Flam locus. (B) Mapping of ovarian and OSC gypsy1 piRNAs to the gypsy1 consensus sequence. Left, ovary; right, OSC. Upper, piRNAs mapped to the sense strand (above the horizontal line, orange) and antisense strand (below the horizontal line, blue). The y-axis indicates counts per million. Lower, positions of gypsy1 fragments within the Flam locus.

Gypsy1 fragments also revealed remodeling: multiple reverse-oriented copies in dm6 correspond to a reconfigured set across Hap_1/Hap_2, with evidence of an ∼300-kb gap relative to dm6 (Figs 4A–C and S6B). Copia, Xanthias, and Transpac fragments are undetectable in dm6 Flam (down to 20 nt searches) but present in OSC Flam (Tr#1 and Xa#1 in Hap_1 and co#1, Tr#1, and Xa#1/#2 in Hap_2), which produces corresponding piRNAs (Figs 3B and C, and S7A–C). Ovarian piRNAs from these TEs map mainly to dual-strand clusters (Table S5). Collectively, Flam undergoes pervasive remodeling that rewires piRNA populations and impacts TE activity in OSCs.

Figure S7. piRNA mapping onto Xanthias, Transpac, and copia elements.

Figure S7.

(A, B, C) Mapping of ovarian and OSC Xanthias-piRNAs (A), Transpac-piRNAs (B), and copia-piRNAs (C) to their consensus sequences. Left, ovary; right, OSC. Upper, piRNAs mapped to the sense strand (above the horizontal line, orange) and antisense strand (below the horizontal line, blue). The y-axis indicates counts per million. Lower, positions of Xanthias, Transpac, and copia fragments within the Flam locus.

Flam is a hotspot of genome rearrangement

Synteny mapping of dm6 against Hap_1 and Hap_2 revealed intact continuity at the boundaries of Flam but fragmented synteny across most of the locus (Fig 4D). Non-syntenic sequence comprised 52.9% (versus Hap_1) and 38.9% (versus Hap_2) of dm6 Flam, including a minor translocation/inversion event. Although Hap_1 and Hap_2 are broadly similar, Hap_2 uniquely harbors an ∼330-kb 5′ segment densely populated with TE fragments (Figs 4D and S8A). Dot plots further showed that whereas Flam in dm6 and OSC (Hap_1 or Hap_2) is rich in structural variation, the remainder of the X chromosome is comparatively quiescent (Fig 5A–F): 93.5% and 92.6% of the dm6 non-Flam X retain synteny with Hap_1 and Hap_2, respectively.

Figure S8. Architecture and comparative genomics of major piRNA clusters.

Figure S8.

(A) Structural rearrangements in the Flam region (pink bar) and its 50-kb flanking sequences (orange lines) across the OSC Hap_1 and Hap_2 assemblies. (B) Summary statistics and annotations for piRNA clusters 80F, 38C, 42AB, 20A, and Flam. (C, D, E, F) Pairwise comparisons between dm6 and Hap_2 for 80F (C), 38C (D), 42AB (E), and 20A (F).

Figure 5. The Flam locus is a hotspot of genome rearrangement.

Figure 5.

(A) Dot plot showing conservation of the Flam locus between dm6 and Hap_1, revealing large-scale rearrangements. (B) Conservation of the Flam locus between dm6 and Hap_2. (C) Conservation of the Flam locus between Hap_1 and Hap_2. (D) Dot plot showing conservation of the X chromosome between dm6 and Hap_1, with limited rearrangements relative to Flam (small open box), underscoring Flam’s exceptional plasticity. (A) Enlarged view of the boxed region is shown in (A). (E) Conservation of the X chromosome between dm6 and Hap_2. (F) Conservation of the X chromosome between Hap_1 and Hap_2. (G) Conservation of dual-strand cluster 80F between dm6 and Hap_1. (H) Conservation of 80F between Hap_1 and Hap_2. (I) Conservation of dual-strand cluster 38C between dm6 and Hap_1. (J) Conservation of 38C between Hap_1 and Hap_2. (K) Conservation of dual-strand cluster 42AB between dm6 and Hap_1. Although patches of reduced conservation are apparent in the central portion, a continuous diagonal is detected across the entire plot, indicating a high degree of overall conservation. (L) Conservation of 42AB between Hap_1 and Hap_2. (M) Conservation of uni-strand cluster 20A between dm6 and Hap_1. (N) Conservation of 20A between Hap_1 and Hap_2.

Among major dual-strand clusters (80F on 3L [∼44 kb, dm6], 38C on 2L [∼127 kb], 42AB on 2R [∼253 kb]) (Brennecke et al, 2007; Mohn et al, 2014) (Fig S8B), 80F and 38C are largely invariant, whereas 42AB shows modest variation (Figs 5G–L and S8B–E). The tiny uni-strand cluster 20A (∼44 kb, dm6), proximal to Flam, seems more variant than 80F, 38C, and 42AB (Figs 5M and N, and S8B and F). However, notably, 80F, 38C, 42AB, and 20A retain complete synteny between Hap_1 and Hap_2, whereas Flam exhibits marked haplotype disparity (Fig S8A and B), establishing Flam as a hotspot of genome rearrangement in OSCs.

Low frequency of detectable structural change at Flam over a decade

To assess how frequently the Flam locus undergoes structural changes, we compared our OSC Flam assemblies with a previously published OSC short-read genomic dataset (Sienski et al, 2012). This comparison revealed broadly uniform read coverage across Flam in both haplotypes (Fig 6A). We next compared Springer copy number in our OSC Flam assemblies with two earlier OSC datasets (Sienski et al, 2012; Sytnikova et al, 2014). Relative to Sienski et al (2012), we detected only two additional Springer insertions over approximately the past decade, and comparison with Sytnikova et al identified only three additional insertions (Fig 6B). Together, these results suggest that cultured OSCs largely maintain their genomic and biological properties over this period despite low levels of Springer-targeting piRNAs and high Springer expression. Importantly, this stability does not undermine our central model—that Flam plasticity can be linked to TE activation, genomic rewiring, and potentially evolutionary consequences—because the effects of Flam plasticity are not limited to Springer.

Figure 6. Flam remains structurally robust for at least a decade after OSC establishment.

Figure 6.

(A) Comparison of the Flam locus in the OSC Hap_1 and Hap_2 assemblies (this study) with a previously published OSC genomic short-read dataset (Sienski et al, 2012). Reads were mapped to the Hap_1 and Hap_2 assemblies. The y-axis shows normalized genomic sequencing read coverage in OSCs across the Flam locus, indicating that this region has remained structural uniform over at least a decade. The x-axis represents the Flam locus (Mb). (B) Comparison of Springer insertions in the OSC Hap_1 and Hap_2 assemblies with earlier OSC genome analyses. “+2” indicates that two additional Springer insertions were detected in OSCs (this study) relative to Sienski et al (2012), and “+3” indicates that three additional insertions were detected relative to Sytnikova et al (2014). (C) A proposed model for how Flam plasticity shapes host–TE interactions. Left, Flam incorporates fragments of TEs that enter ovarian follicle (somatic) cells and generates piRNAs that repress cognate elements. Flam structural plasticity may also reshape piRNA populations and weaken silencing of specific TEs, potentially making germline entry and vertical transmission more permissive. Right, intronic Springer-driven hybrid splicing may rewire host isoforms, with possible heritable effects and evolutionary consequences.

Discussion

Long-read, haplotype-resolved genome and RNA transcriptome analyses of cultured Drosophila OSCs uncovered two features that connect TE activity to host gene regulation. First, forward-oriented intronic insertions of the LTR element Springer can use the 5′ LTR as a promoter to initiate transcription that splices into downstream host exons, thereby expanding host transcriptional output. Second, the somatic piRNA source locus Flam appears unusually susceptible to local rearrangements that can permit the reactivation of embedded TE segments. Together, these properties outline a route by which piRNA-based somatic defenses can be retuned, reshaping TE repression and, in turn, rewiring host transcriptional programs, including the emergence of new isoforms. In vivo, ovarian follicle cells are heterogeneous and persist for only a single generation during oogenesis, limiting opportunities to observe the full sequence of events from TE activation to defense retuning and transcriptome remodeling, particularly when genomic changes accrue slowly. By contrast, cultured OSCs are comparatively uniform and can be passaged for extended periods, a tractability that likely enabled us to capture these links with unusual clarity.

Importantly, somatic relaxation of TE repression driven by Flam plasticity could also occur in the ovary. Because de-repressed TEs in follicle cells could, in principle, gain access to neighboring germ cells, such somatic relaxation may create conditions permissive for germline entry and, ultimately, heritable change (Fig 6C). We emphasize, however, that this is a mechanistic inference based on insertion directionality and piRNA profiling in cultured OSCs. Direct measurements of germline integration rates and multigenerational transmission will be essential tests for future work, and the frequency and detectability of such events in vivo remain open questions.

Intronic TE–host hybrid transcripts arising from de novo TE insertions have been described, but prior studies have largely described isolated examples, most often detected using short-read data (Sela et al, 2007; Faulkner et al, 2009; Flemr et al, 2013; Modzelewski et al, 2021; Azad et al, 2024). Here, we systematically analyzed genome-wide de novo TE insertions that emerge in the OSC genome in association with shifts in piRNA directionality, focusing on intronic, forward-oriented events, and defined their consequences in this cell type. This approach uncovers an unrecognized mechanism that positions Springer as a genome-wide driver of host transcriptome remodeling and, in some cases, proteome remodeling. Consistent with this possibility, insertion of a Springer mini-cassette (Δ-LS) derived from Δ-LS-mCh (Fig 1D) into Piwi intron 2 disrupted the nuclear localization signal encoded across exons 1 and 2, converting Piwi into a predominantly cytoplasmic protein (Fig S9). Over longer timescales, such TE-driven alterations could plausibly contribute to evolutionary change.

Figure S9. The Springer mini-cassette converts Piwi to be a cytoplasmic protein.

Figure S9.

Upper left, the Piwi gene isolated from the OSC genome was fused to a DNA fragment encoding a FLAG tag to generate the plasmid, Piwi, which was introduced into OSCs to express Piwi-FLAG. Lower left, the Piwi+Δ-LS construct was generated by inserting the Δ-LS fragment (Fig 1D) into intron 2 of the Piwi plasmid and then introduced into OSCs. Right, immunofluorescence detection of Piwi-FLAG (green) in OSCs. Nuclei are shown in blue. Scale bar, 5 μm.

Cultured OSCs can maintain, for extended periods, a sense-biased piRNA pool against Springer and other TEs, including copia and Transpac, a population unlikely to support robust silencing. We speculate that these “non-beneficial” piRNAs largely represent by-products of piRNA biogenesis and, because they impose little fitness cost under culture conditions, are subject to only weak negative selection. Notably, Springer itself may also carry relatively modest fitness costs: Springer-intF splices into host exons without adding coding sequence, expanding isoform diversity while largely decoupling transcriptome diversification from proteome expansion. This limited proteomic burden may help explain why cultured OSCs can tolerate high Springer copy numbers and sustain long-term coexistence.

Key open questions include the molecular basis of Flam remodeling, the host factors that shape Springer’s insertion, and the extent to which similar coding sequence–free hybrid splicing operates across TE families, strains, and species. Resolving these issues will clarify how genomes balance defense with innovation, not by simply turning TE activity off, but by shaping when, where, and how it is allowed to matter.

Materials and Methods

Cell culture

OSCs (Saito et al, 2009) (DGRC Stock 288; https://dgrc.bio.indiana.edu//stock/288; RRID:CVCL_IY73) were cultured at 26°C in Shields and Sang M3 Insect Medium with L-Glutamine (US Biological) supplemented with 10% fly extract (Saito et al, 2009), 10% FBS (NICHIREI), 10 mU/ml insulin, and 0.6 mg/ml glutathione. S2 cells were cultured at 26°C in Schneider’s Drosophila Medium (Gibco) with 10% FBS.

Genome sequencing

For HiFi sequencing, genomic DNA was extracted from OSC using a Genomic-tip 20/G (QIAGEN) and a Genomic DNA Buffer Set (QIAGEN). To assess the quality and integrity of the genomic DNA samples, we ran them for sequencing on Pippin Pulse pulsed-field electrophoresis (Sage Science). We used 0.75% agarose gel (SeaKem Gold; Lonza) on a pre-set default program of 5–80 kb (75 V, 16 h run time) with ∼500 ng DNA/lane. According to the PN101-853-100 protocol (PacBio), the genomic DNA was sheared using a Megaruptor 2 (Diagenode), and the fragments were collected using 0.45× AMPure PB beads (PacBio). The size of the DNA fragments was checked using a TapeStation System (Agilent Technologies). SMRTbell libraries were prepared using an SMRTbell Express Template Prep Kit 2.0 (PacBio) and a BluePippin DNA Size Selection System (Sage Science). The selected libraries were bound with Sequencing Primer v2 and Sequel II Polymerase 2.0 using a Sequel II Binding Kit 2.0 (PacBio). Sequencing was performed on a Sequel II instrument (PacBio) in CCS mode using an SMRT Cell 8 M and a Sequencing Plate 2.0. This process included 8 h of pre-extension and 30 h of movie viewing.

Assembly of the OSC genome

A haplotype-resolved genome of the OSC cell line was assembled using the HiFiasm software (Cheng et al, 2021). This assembly was performed using PacBio long-read sequencing data generated in this study, in conjunction with publicly available Micro-C XL data from OSC cells (Takeuchi et al, 2022). The resulting contigs were subsequently aligned and scaffolded against dm6 reference genome with the Ragtag toolkit (Alonge et al, 2022).

Chromosome preparation and DAPI staining

OSCs were treated with 50 μg/ml colchicine to inhibit microtubule polymerization. Cells were incubated at 26°C for 1 h, washed once with PBS (without Ca2+ and Mg2+), and collected into 1.5-ml microcentrifuge tubes. After removal of the supernatant, the cells were resuspended in 500 μl of 75 mM KCl prepared in distilled water and incubated at 26°C for 30 min to induce hypotonic swelling. Carnoy’s fixative (methanol:acetic acid = 3:1) was added dropwise (500 μl) to the cell suspension, followed by centrifugation. Cells were washed three times with Carnoy’s fixative and then incubated in 500 μl of fresh fixative at 4°C for 1 h. Fixed cells were mounted with VECTASHIELD Mounting Medium with DAPI (Vector Laboratories). Chromosome spreads were imaged using a confocal laser scanning microscope (LSM 980; Carl Zeiss) equipped with a Plan-Apochromat 63×/1.4 Oil DIC M27 objective lens. DAPI was excited with a 405-nm laser.

OSC gene annotation

To annotate the OSC genome, gene annotations were transferred from the dm6 reference genome. First, we generated chain files to map coordinates between the dm6 reference and each OSC haplotype (Hap_1 and Hap_2) using the flo (https://github.com/wurmlab/flo) (Pracana et al, 2017). Using these chain files, we then transferred the dm6 gene annotations (UCSC) to our Hap_1 and Hap_2 assemblies with the liftOver utility from the UCSC toolkit (Hinrichs et al, 2006).

RNAi and transfection

For RNAi, OSCs were first electroporated with 200 pmol of siRNA in 20 μl of Solution SF (Cell Line Nucleofector Kit SF; Lonza) using a Nucleofector 96-well Shuttle device (Lonza). After electroporation, the cells were left at RT for 10 min before plating into culture dishes, and then incubated at 26°C for 48 h. The cells were subsequently subjected to a second electroporation with 600 pmol of siRNA dissolved in 100 μl of transfection buffer (180 mM sodium phosphate, pH 7.2, 5 mM KCl, 15 mM MgCl2, and 50 mM D-mannitol) using the Nucleofector 2b system (Lonza). After a 10-min rest at RT, the cells were plated again and incubated at 26°C for an additional 48 h before collection. siRNA sequences are listed in Table S6. For plasmid transfection, 1.0 × 107 OSCs or S2 cells were mixed with 6–15 μg of plasmid DNA in 100 μl of the same transfection buffer modified from a previous report (Nye et al, 2014) and electroporated with a Nucleofector 2b device (Lonza). Cells were maintained at 26°C and harvested 48 h after transfection.

Table S6. Primer and siRNA sequences. (32.5KB, xls)

Nuclear fractionation

Nuclear fractionation was carried out following a modified version of a previously described protocol (Onishi et al, 2020). Briefly, the cells were suspended in hypotonic buffer (10 mM HEPES-KOH [pH 7.3], 10 mM KCl, 1.5 mM MgCl2, 0.5 mM DTT, 2 μg/ml leupeptin, 2 μg/ml pepstatin A, and 0.5% aprotinin), gently mixed by pipetting, and disrupted by repeated passage through a 25-gauge needle. Lysates were centrifuged at 400g for 10 min, and the pellet was collected as the crude nuclear fraction, while the supernatant was retained as the cytoplasmic fraction. The cytoplasmic fraction was adjusted to 200 mM KCl and clarified by centrifugation at 20,000g for 20 min. The nuclear pellet was washed with hypotonic buffer, resuspended in chromatin co-immunoprecipitation buffer (50 mM HEPES-KOH [pH 7.3], 200 mM KCl, 1 mM EDTA, 1% Triton X-100, 0.1% sodium deoxycholate, 2 μg/ml leupeptin, 2 μg/ml pepstatin A, and 0.5% aprotinin), then sonicated and centrifuged at 20,000g for 20 min. The resulting supernatant was used as the nuclear extract for Western blotting.

Western blotting

Proteins from nuclear extracts were resolved by SDS–polyacrylamide gel electrophoresis and transferred onto Immobilon-P membranes (Millipore). Membranes were blocked in PBS containing 0.1% Tween-20 (T-PBS) supplemented with 5% skim milk and then incubated with anti-L(3)mbt antibody (1:1,000 dilution) (Yamamoto-Matsuda et al, 2022). After washing with T-PBS, membranes were incubated with horseradish peroxidase–conjugated anti-mouse IgG secondary antibody (1:10,000 dilution; cat. no. 55558; Cappel). Protein bands were visualized using Clarity Western ECL Substrate (Bio-Rad), and chemiluminescence signals were captured with a ChemiDoc XRS Plus imaging system (Bio-Rad).

RT-qPCR

Total RNA was extracted from OSCs using ISOGEN II (NIPPON GENE) and treated with TURBO DNase at a final concentration of 0.04 U/μl (Thermo Fisher Scientific) to remove residual genomic DNA. Reverse transcription was performed with ReverTra Ace (Toyobo) according to the manufacturer’s protocol. Quantitative PCR (qPCR) was carried out on a StepOnePlus Real-Time PCR System (Thermo Fisher Scientific) using THUNDERBIRD Next SYBR qPCR Mix (Toyobo). Relative transcript abundance was determined using the ΔΔCt method, with rp49 as the internal reference gene. Primer sequences are listed in Table S6.

Iso-seq and bioinformatic analysis

Total RNA was extracted from cultured cells using ISOGEN II (NIPPON GENE), followed by DNase treatment, phenol/chloroform purification, and ethanol precipitation. Full-length cDNA libraries were prepared with the SMRTbell Prep Kit 3.0 (Pacific Biosciences) according to the manufacturer’s protocol. Sequencing was performed on a PacBio Sequel II platform in HiFi read mode (1-cell run), generating 4,504,293 HiFi reads comprising 9.73 Gb of sequence data, with a read N50 of 2,234 bp, an average read length of 2,160 bp, and an average read quality of Q45. HiFi reads were processed using the Iso-Seq3 pipeline (v.4.0.0) (https://github.com/ylipacbio/IsoSeq3). Demultiplexing and primer removal were conducted with lima in Iso-Seq mode, specifying custom 5′ and 3′ primer sequences (5′-GCA​ATG​AAG​TCG​CAG​GGT​TGG​G-3′ and 3′-GTA​CTC​TGC​GTT​GAT​ACC​ACT​GCT​T-5′). Full-length non-concatemer (FLNC) reads were refined with the --require-polya option to confirm poly(A) tails, and high-quality consensus isoforms were obtained by clustering with quality value scoring enabled. The curated isoforms were aligned independently to OSC genome Hap_1 and Hap_2 using minimap2 (v.2.24-r1122) (Li, 2018) in splice-aware mode with quality value support, while suppressing secondary alignments. Alignment files were processed with SAMtools (v.1.16.1) (Danecek et al, 2021) to convert, sort, and index BAM files for downstream analysis.

Plasmid construction

For the L3-Springer plasmid (Fig S2D), the L3-Springer sequence from the OSC Hap_2 genome was PCR-amplified from OSC genomic DNA and inserted into a modified pAc-based vector lacking the promoter region using Mighty Cloning (TaKaRa Bio). For FL-LS-mCh (Fig 1C), a genomic fragment spanning the L(3)mbt locus from the L(3)-Springer insertion site in intron 3 to the putative start codon of exon 4 (OSC Hap_2 genome) was PCR-amplified from OSC genomic DNA. This fragment, together with an mCherry sequence PCR-amplified from pIB_DDX4N_mCherry_CRY2PHR, was assembled into a modified pAc-based expression vector lacking the promoter using NEBuilder HiFi DNA Assembly Master Mix (New England Biolabs). The Δ-LS-mCh (Fig 1D) and Δ-LS-mCh-mut (Fig 1E) constructs were generated by inverse PCR. For the Piwi plasmid (Fig S9), the Piwi genomic region (including introns) was PCR-amplified from OSC genomic DNA and assembled into the same modified vector with a C-terminal FLAG tag using NEBuilder HiFi DNA Assembly Master Mix (New England Biolabs). To generate the Piwi+Δ-LS construct (Fig S9), the Δ-LS fragment was amplified from the Δ-LS-mCh template and incorporated into the Piwi plasmid using NEBuilder HiFi DNA Assembly Master Mix. Primer sequences used in these plasmid constructions are listed in Table S6.

3′ RACE

S2 cells were transfected with the L3-Springer plasmid by electroporation using the Nucleofector 2b device (Lonza) with program N-20 and cultured at 26°C for 2 d. Total RNA was extracted with ISOGEN II (NIPPON GENE), followed by DNase treatment and phenol/chloroform extraction with ethanol precipitation. 3′ RACE-ready cDNA was synthesized using the SMARTer RACE 5′/3′ Kit (TaKaRa Bio) according to the manufacturer’s instructions. The resulting 20 μl cDNA was diluted with 90 μl Tricine-EDTA buffer. PCR amplification was carried out with SeqAmp DNA Polymerase (TaKaRa Bio) using a gene-specific primer containing a 15-bp overlap sequence for cloning (5′-gat​tac​gcc​aag​ctt​GCT​CCA​TCT​TCA​GTG​AGC​TCG​TGT​GCG​C-3′) under touchdown PCR conditions (Program 1). PCR products were resolved on a 1% TBE agarose gel, and the target band was excised and purified with the FastGene Gel/PCR Extraction Kit (Nippon Genetics). Purified fragments were assembled into the pRACE vector using NEBuilder HiFi DNA Assembly Master Mix (New England Biolabs) and transformed into NEB 5-alpha Competent E. coli (New England Biolabs). Colony PCR was performed with SapphireAmp Fast PCR Master Mix (TaKaRa Bio) using the gene-specific primer and M13 forward primer. PCR products were treated with Exonuclease I (New England Biolabs) and Shrimp Alkaline Phosphatase (New England Biolabs) and sequenced using the BigDye Terminator v3.1 Cycle Sequencing Kit (Thermo Fisher Scientific).

RT–PCR

Total RNA was extracted from OSCs using ISOGEN II (NIPPON GENE) and treated with TURBO DNase at a final concentration of 0.04 U/μl (Thermo Fisher Scientific) to eliminate residual genomic DNA. Reverse transcription was performed with the Transcriptor First Strand cDNA Synthesis Kit (Roche) using oligo(dT) primers. The resulting cDNA was amplified with Q5 High-Fidelity DNA Polymerase (New England Biolabs). PCR products were resolved on a 1% TBE agarose gel, stained with SYBR Gold (Thermo Fisher Scientific) for 10 min, and visualized with a transilluminator (Fig S3A). Primer sequences are listed in Table S6.

Live-cell imaging

OSCs were cultured on glass-bottom dishes (MATSUNAMI). Hoechst 33342 (Lonza) was added to the medium at a final concentration of 1 μg/ml, and cells were incubated at 26°C for 30 min. Live-cell imaging was performed as previously described (Yamazaki et al, 2023), using a confocal laser scanning microscope (LSM 980; Carl Zeiss) equipped with a Plan-Apochromat 63×/1.4 Oil DIC M27 objective lens at RT. Hoechst 33342 and mCherry were excited with 405 and 561 nm lasers, respectively, and images were acquired under identical acquisition settings across all samples within each experiment.

Springer insertion analysis

The dm6 and OSC genomes were analyzed with RepeatMasker (v.4.1.5) (Smit et al, 2015) to annotate TEs. From the resulting GFF files, entries corresponding to the LTR TE Springer were extracted. Insertions ≥7,500 nt in length (≥99.5% of the reference consensus length) were classified as full-length Springer elements. Genomic insertion sites of these elements were visualized using Phenogram (https://visualization.ritchielab.org/phenograms/plot; Fig 2A–C) (Wolfe et al, 2013). Phenogram was also used to visualize selected insertions in Fig 2E.

Springer insertion site analysis

We categorized novel Springer insertions based on their location relative to the lifted-over gene annotations. Insertions were classified as “genic” if they fell within an annotated exon or intron, and “intergenic” otherwise. Genic insertions were further subdivided into “exonic” or “intronic.” To analyze the distance to the nearest splice site, we focused on intronic insertions located on the sense strand relative to the host gene and calculated the distance from the insertion site to the start of the nearest downstream exon using a custom script. Insertions that were found within transcripts identified by our Iso-seq data, potentially creating TE-gene fusion transcripts, were classified as “hybrid splicing” events (see the Data and Materials Availability section for details). For the analysis in Fig 3E–H, we annotated the genomic features of both the FIMO-identified motif sites and the novel Springer insertion sites by mapping their coordinates to the dm6 reference genome. Feature distribution was determined using annotatePeaks.pl. 3′ UTR, 5′ UTR, exon, and noncoding were categorized as “exonic,” whereas intergenic, TTS, promoter-TSS were grouped as “intergenic.” To assess the expression level of genes targeted by insertions or containing the motif, we used OSC gene expression data from FlyBase (release FB2025_01). The intron number (e.g., first intron, second intron) for each intronic insertion was also determined from the HOMER output (Heinz et al, 2010).

TE insertion site and motif analyses

To identify Springer and copia insertion sites in the dm6 genome, 100-bp flanking sequences of these insertions from the OSC genome were mapped back to the dm6 reference. The center points of successfully mapped flanking pairs were defined as the insertion site. Annotations of these insertion sites are analyzed using Homer (annotatePeaks.pl) (Heinz et al, 2010). De novo motif discovery on these insertion sites was performed using MEME-ChIP (Machanick & Bailey, 2011). Springer motif sites were defined by FIMO (Grant et al, 2011) at the threshold of P = 0.001 and annotated in the same way.

Small RNA analysis

Previously published small RNA-seq datasets were re-analyzed: ovary small RNAs from the D. melanogaster ISO1 strain y[1]; cn[1] bw[1] sp[1] (SRR827770) (Shpiz et al, 2014) and Piwi-bound small RNA-seq from OSCs (SRR9158321) (Murano et al, 2019). For the ovary dataset, the Illumina 3′ adapter sequences (TGGAATTCTCGGGTGCCAAGGAACTCCAGTCAC) were removed using Cutadapt (v.4.6) (Martin, 2011). Reads outside the 20–35 nt size range were discarded, leaving ∼13 million reads for analysis. For the OSC dataset, preprocessing was performed following the previous study (Murano et al, 2019), in which four random nucleotides from each end and adapter sequences were removed using Cutadapt, and only reads between 20–35 nt size range were retained for analysis, resulting in ∼3.7 million reads. Trimmed ovary reads were aligned to dm6, whereas OSC reads were mapped to a merged OSC genome assembly (Hap_1 and Hap_2). In both cases, alignments were performed using Bowtie (v.1.3.1) (Langmead et al, 2009) with identical parameters (-v 1, ≤1 mismatch; --best), ensuring consistency between datasets. Only reads that successfully mapped to the dm6 or merged OSC genome were retained for subsequent analyses. For piRNA production analysis (Fig 3A–C), genome-mapped reads were further aligned to consensus sequences of Drosophila TEs obtained from FlyBase using Bowtie with no mismatches allowed, and random multi-mapping was performed with the -k 1 option. Read counts were normalized to counts per million mapped reads (CPM). To quantify strand-specific mapping, custom Python scripts were used to count the number of sense and antisense reads for each TE, based on SAM alignment flags. For TE locus-specific analyses (Figs S6 and S7), the previously generated alignment files were converted to BAM format, sorted, and indexed using SAMtools. Normalized coverage tracks were then produced with deepTools (bamCoverage, v3.5.5) (Ramírez et al, 2016) using counts per million mapped reads (CPM) normalization. The resulting BigWig files were visualized in Integrative Genomics Viewer (Robinson et al, 2011).

TE enrichment analysis

The dm6 and the merged OSC genome assembly (Hap_1 + Hap_2) were annotated with RepeatMasker. From the resulting GFF files, insertions ≥99.5% of the TE consensus length were defined as full-length. For each TE, genomic intervals were merged, the occupied bases were summed, and genome occupancy was calculated as the fraction of occupied bases relative to total genome size. Enrichment was defined as the ratio of occupancy in OSC genome to that in dm6. Custom Python scripts (Biopython-based) were used for these calculations (Fig 3D).

ATAC-seq analysis

Previously published ATAC-seq data (Iwasaki et al, 2016) were re-analyzed for this study. First, raw reads were processed for quality control: read errors were corrected using Rcorrector (v1.0.4) (Song & Florea, 2015) and TranscriptomeAssemblyTools (https://github.com/harvardinformatics/TranscriptomeAssemblyTools), and adapter sequences were removed using TrimGalore (v0.6.6) (https://github.com/FelixKrueger/TrimGalore). Reads shorter than 50 bp were subsequently discarded. The filtered reads were aligned to each of the Hap_1 and Hap_2 haplotypes of the OSC genome using Bowtie2 (v2.5.4) (Langmead & Salzberg, 2012) with the following parameters: “--maxins 4000 --very-sensitive --no-mixed --no-discordant.” Uniquely mapped reads were selected using SAMtools (v1.21) (Danecek et al, 2021). Read coverage was then calculated and normalized to reads per kilobase per million mapped reads (RPKM) using the bamCoverage tool from the deepTools suite (v 3.5.4) (Ramírez et al, 2016), setting the bin size to 10 bp and the minimum mapping quality to 10. For datasets with biological replicates, the signal was averaged using the bigwigAverage tool from deepTools to generate a single representative BigWig file per haplotype. These files were used to compute signal distributions centered on Springer insertion sites (Hap_1: 219 sites; Hap_2: 209 sites) using the computeMatrix tool from deepTools in scale-regions mode. The signal was calculated across each 10-kb window (5 kb upstream and downstream of each site), which was divided into 10-bp bins, with missing data treated as zero. As a control, 200 genomic windows (7,546 bp each, matching the length of a full-length Springer element) were randomly selected using the BEDTools random command (v2.31.1) (Quinlan & Hall, 2010). These control regions were processed through the same computeMatrix pipeline. The resulting signal matrices for both Springer and control sites were used to generate mean profile plots with the plotProfile tool from deepTools. The signal profiles for Hap_1 and Hap_2 were then averaged and plotted using R (R Core Team, 2025).

Definition of piRNA cluster regions

The Flam locus was defined as the genomic region between CG32820 and CG14621, which are conserved flanking genes identified in both the dm6 annotation and the lifted-over OSC gene annotations. Other piRNA cluster regions were defined based on the D. melanogaster piRNA cluster annotations from the ProTRAC database (Rosenkranz & Zischler, 2012), using sequence coordinates corresponding to the reported cluster boundaries. See Table S7 for locus information.

Table S7. Locus information. (28.5KB, xls)

Flam structural rearrangement analysis

Structural differences among the dm6 and OSC genome assemblies were analyzed using minimap2 (Li, 2018) and SyRI (Goel et al, 2019). Genome alignments were performed with minimap2 (v2.29) using the “-cx asm5 –eqx” option in two runs: OSC Hap_2 assembly was aligned to dm6, and dm6 was aligned to the OSC Hap_1 reference assembly. The resulting PAF files were analyzed with SyRI (v1.7.1) using the “--cigar –nosnp” option. Rearrangements in the Flam region and its 50-kb flanking sequences were visualized by excluding highly diverged regions, using plotsr (v1.1.1) (Goel & Schneeberger, 2022) with the “—itx” mode.

Dot plot analysis

Dot plots were generated with ModDotPlot (Sweeten et al, 2024) to compare, for each genomic region, the following pairs: dm6 versus OSC Hap_1, dm6 versus OSC Hap_2, and OSC Hap_1 versus OSC Hap_2. Unless noted otherwise, default settings were used with an identity threshold of -id 90. The sliding window parameter -r was set per analysis as follows: 600 for the Flam locus (Fig 5A–C), 1,000 for the X chromosomes (Fig 5D–F), 800 for 80F (Figs 5G and H, and S8C), 600 for 38C (Figs 5I and J, and S8D), 600 for 42AB (Figs 5K and L, and S8E), and 800 for 20A (Figs 5M and N, and S8F).

Genome comparison analysis

DNA-Seq data from previous study (Sienski et al, 2012) were mapped to the haploid genome (Hap_1/Hap_2) using Bowtie2 (Langmead & Salzberg, 2012) with default settings. A TE insertion was considered preexisting if reads spanning the TE and its flanking sequences were found in the data. Conversely, TE insertions that lacked support from such spanning reads were identified as novel insertions.

Identification of insertion sites in other OSC genomes

The genomic DNA-Seq datasets from previous studies (Sienski et al, 2012; Sytnikova et al, 2014) are mapped to Hap_1 and Hap_2, respectively, using Bowtie2. Then, if the reads which intersect TE and flanking sequencings are detected, these TEs are considered as existing in the genome of the other dataset.

Immunofluorescence

OSC cells were fixed and processed for immunostaining as described previously (Saito et al, 2010). Cells were incubated with an anti-FLAG monoclonal antibody (M2, 1:1,000 dilution; Sigma-Aldrich) as the primary antibody, followed by incubation with an Alexa Fluor 488–conjugated anti-mouse IgG secondary antibody (1:500 dilution; Thermo Fisher Scientific). After washing, samples were mounted in VECTASHIELD Mounting Medium with DAPI (Vector Laboratories) and imaged using a confocal laser scanning microscope (LSM 980; Carl Zeiss).

Statistics and reproducibility

RT-qPCR (Figs S1B and S5A) was performed over three independent experiments, and other live-cell imaging (Fig 1C–E), Western blotting (Fig S1B), 3′ RACE (Fig S2D), RT–PCR (Fig S3A), and immunofluorescence (Fig S9) were performed over two independent experiments. Statistical procedures are described in the figure legends. Sample sizes were not predetermined, and experiments were performed without randomization or blinding.

Supplementary Material

Reviewer comments

Acknowledgements

We thank H Yamazaki, S Yamanaka, R Saito, K Saito, and other members of the Siomi laboratory at the University of Tokyo for useful discussions. We thank I Oka and T Morita for technical assistances. We dedicate this work to the memory of Greg Hannon, a pioneeing leader in this field, whose contributions continue to inspire researchers worldwide. Japan Society for the Promotion of Science 24K18093 (Y Namba), Japan Society for the Promotion of Science 25H01308 (H Nishihara), Japan Society for the Promotion of Science 24K09376 (S Hirakata), Japan Society for the Promotion of Science 25H01305 (YW Iwasaki), Japan Society for the Promotion of Science 25H01303 (MC Siomi), Japan Society for the Promotion of Science Research Fellowship for Young Scientists (DC2) 22KJ2696 (C Takeuchi), and Fusion Oriented REsearch for disruptive Science and Technology JPMJFR224L (YW Iwasaki).

Author Contributions

  • M Moritoh: data curation, formal analysis, investigation, visualization, and writing—original draft, review, and editing.

  • C Takeuchi: data curation, formal analysis, and investigation.

  • Y Namba: formal analysis, validation, and investigation.

  • H Nishihara: formal analysis, investigation, and visualization.

  • S Hirakata: conceptualization, supervision, and investigation.

  • C Owa: formal analysis, validation, and investigation.

  • S Morishita: supervision, methodology, project administration, and writing—review and editing.

  • YW Iwasaki: conceptualization, supervision, and project administration.

  • MC Siomi: conceptualization, supervision, funding acquisition, project administration, and writing—original draft, review, and editing.

Conflict of Interest Statement

The authors declare that they have no conflict of interest.

Data and Materials Availability

All sequencing data supporting this study are available at NCBI. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under accession numbers JBVFKA000000000 and JBVFKB000000000. The versions described in this article are JBVFKA010000000 and JBVFKB010000000. Raw genome HiFi reads and Iso-Seq reads are available in the Sequence Read Archive (SRA) under accession numbers SRR37248010 and SRR37247793 within BioProject PRJNA1417524. Code and data are available at GitHub (https://github.com/Chikara-Takeuchi/2025_moritoh_TE/tree/master).

References

  1. Alonge M, Lebeigle L, Kirsche M, Jenike K, Ou S, Aganezov S, Wang X, Lippman ZB, Schatz MC, Soyk S (2022) Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing. Genome Biol 23: 258. 10.1186/s13059-022-02823-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Andersen PR, Tirian L, Vunjak M, Brennecke J (2017) A heterochromatin-dependent transcription machinery drives piRNA expression. Nature 549: 54–59. 10.1038/nature23482 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Aravin AA, Lagos-Quintana M, Yalcin A, Zavolan M, Marks D, Snyder B, Gaasterland T, Meyer J, Tuschl T (2003) The small RNA profile during Drosophila melanogaster development. Dev Cell 5: 337–350. 10.1016/s1534-5807(03)00228-4 [DOI] [PubMed] [Google Scholar]
  4. Ariura M, Solberg T, Ishizu H, Takahashi H, Carninci P, Siomi H, Iwasaki YW (2024) Drosophila Piwi distinguishes transposons from mRNAs by piRNA complementarity and abundance. Cell Rep 43: 115020. 10.1016/j.celrep.2024.115020 [DOI] [PubMed] [Google Scholar]
  5. Azad MF, Tong T, Lau NC (2024) Transposable element (TE) insertion predictions from RNAseq inputs and TE impact on RNA splicing and gene expression in Drosophila brain transcriptomes. Mob DNA 15: 20. 10.1186/s13100-024-00330-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Brennecke J, Aravin AA, Stark A, Dus M, Kellis M, Sachidanandam R, Hannon GJ (2007) Discrete small RNA-generating loci as master regulators of transposon activity in Drosophila. Cell 128: 1089–1103. 10.1016/j.cell.2007.01.043 [DOI] [PubMed] [Google Scholar]
  7. Cheng H, Concepcion GT, Feng X, Zhang H, Li H (2021) Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods 18: 170–175. 10.1038/s41592-020-01056-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chirn G-W, Rahman R, Sytnikova YA, Matts JA, Zeng M, Gerlach D, Yu M, Berger B, Naramura M, Kile BT, et al. (2015) Conserved piRNA expression from a distinct set of piRNA cluster loci in eutherian mammals. PLoS Genet 11: e1005652. 10.1371/journal.pgen.1005652 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Coux R-X, Teixeira FK, Lehmann R (2018) L(3)mbt and the LINT complex safeguard cellular identity in the Drosophila ovary. Development 145: dev160721. 10.1242/dev.160721 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cox DN, Chao A, Baker J, Chang L, Qiao D, Lin H (1998) A novel class of evolutionarily conserved genes defined by piwi are essential for stem cell self-renewal. Genes Dev 12: 3715–3727. 10.1101/gad.12.23.3715 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Czech B, Munafò M, Ciabrelli F, Eastwood EL, Fabry MH, Kneuss E, Hannon GJ (2018) piRNA-guided genome defense: From biogenesis to silencing. Annu Rev Genet 52: 131–157. 10.1146/annurev-genet-120417-031441 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, et al. (2021) Twelve years of SAMtools and BCFtools. Gigascience 10: giab008. 10.1093/gigascience/giab008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Engels WR (1983) The P family of transposable elements in Drosophila. Annu Rev Genet 17: 315–344. 10.1146/annurev.ge.17.120183.001531 [DOI] [PubMed] [Google Scholar]
  14. Faulkner GJ, Kimura Y, Daub CO, Wani S, Plessy C, Irvine KM, Schroder K, Cloonan N, Steptoe AL, Lassmann T, et al. (2009) The regulated retrotransposon transcriptome of mammalian cells. Nat Genet 41: 563–571. 10.1038/ng.368 [DOI] [PubMed] [Google Scholar]
  15. Feschotte C (2023) Transposable elements: McClintock’s legacy revisited. Nat Rev Genet 24: 797–800. 10.1038/s41576-023-00652-3 [DOI] [PubMed] [Google Scholar]
  16. Flemr M, Malik R, Franke V, Nejepinska J, Sedlacek R, Vlahovicek K, Svoboda P (2013) A retrotransposon-driven dicer isoform directs endogenous small interfering RNA production in mouse oocytes. Cell 155: 807–816. 10.1016/j.cell.2013.10.001 [DOI] [PubMed] [Google Scholar]
  17. Genzor P, Konstantinidou P, Stoyko D, Manzourolajdad A, Marlin Andrews C, Elchert AR, Stathopoulos C, Haase AD (2021) Cellular abundance shapes function in piRNA-guided genome defense. Genome Res 31: 2058–2068. 10.1101/gr.275478.121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Goel M, Schneeberger K (2022) Plotsr: Visualizing structural similarities and rearrangements between multiple genomes. Bioinformatics 38: 2922–2926. 10.1093/bioinformatics/btac196 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Goel M, Sun H, Jiao W-B, Schneeberger K (2019) SyRI: Finding genomic rearrangements and local sequence differences from whole-genome assemblies. Genome Biol 20: 277. 10.1186/s13059-019-1911-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Grant CE, Bailey TL, Noble WS (2011) FIMO: Scanning for occurrences of a given motif. Bioinformatics 27: 1017–1018. 10.1093/bioinformatics/btr064 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK (2010) Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell 38: 576–589. 10.1016/j.molcel.2010.05.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hinrichs AS, Karolchik D, Baertsch R, Barber GP, Bejerano G, Clawson H, Diekhans M, Furey TS, Harte RA, Hsu F, et al. (2006) The UCSC genome browser database: Update 2006. Nucleic Acids Res 34: D590–D598. 10.1093/nar/gkj144 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Iwasaki YW, Murano K, Ishizu H, Shibuya A, Iyoda Y, Siomi MC, Siomi H, Saito K (2016) Piwi modulates chromatin accessibility by regulating multiple factors including histone H1 to repress transposons. Mol Cell 63: 408–419. 10.1016/j.molcel.2016.06.008 [DOI] [PubMed] [Google Scholar]
  24. Kazazian HH, Jr (2004) Mobile elements: Drivers of genome evolution. Science 303: 1626–1632. 10.1126/science.1089670 [DOI] [PubMed] [Google Scholar]
  25. Kazazian HH, Jr., Wong C, Youssoufian H, Scott AF, Phillips DG, Antonarakis SE (1988) Haemophilia A resulting from de novo insertion of L1 sequences represents a novel mechanism for mutation in man. Nature 332: 164–166. 10.1038/332164a0 [DOI] [PubMed] [Google Scholar]
  26. Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9: 357–359. 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10: R25. 10.1186/gb-2009-10-3-r25 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Li H (2018) Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 34: 3094–3100. 10.1093/bioinformatics/bty191 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Machanick P, Bailey TL (2011) MEME-ChIP: Motif analysis of large DNA datasets. Bioinformatics 27: 1696–1697. 10.1093/bioinformatics/btr189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Malone CD, Brennecke J, Dus M, Stark A, McCombie WR, Sachidanandam R, Hannon GJ (2009) Specialized piRNA pathways act in germline and somatic tissues of the Drosophila ovary. Cell 137: 522–535. 10.1016/j.cell.2009.03.040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Martin M (2011) Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J 17: 10. 10.14806/ej.17.1.200 [DOI] [Google Scholar]
  32. Modzelewski AJ, Shao W, Chen J, Lee A, Qi X, Noon M, Tjokro K, Sales G, Biton A, Anand A, et al. (2021) A mouse-specific retrotransposon drives a conserved Cdk2ap1 isoform essential for development. Cell 184: 5541–5558.e22. 10.1016/j.cell.2021.09.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Mohn F, Sienski G, Handler D, Brennecke J (2014) The rhino-deadlock-cutoff complex licenses noncanonical transcription of dual-strand piRNA clusters in Drosophila. Cell 157: 1364–1379. 10.1016/j.cell.2014.04.031 [DOI] [PubMed] [Google Scholar]
  34. Mount SM (1982) A catalogue of splice junction sequences. Nucleic Acids Res 10: 459–472. 10.1093/nar/10.2.459 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Muerdter F, Guzzardo PM, Gillis J, Luo Y, Yu Y, Chen C, Fekete R, Hannon GJ (2013) A genome-wide RNAi screen draws a genetic framework for transposon control and primary piRNA biogenesis in Drosophila. Mol Cell 50: 736–748. 10.1016/j.molcel.2013.04.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Murano K, Iwasaki YW, Ishizu H, Mashiko A, Shibuya A, Kondo S, Adachi S, Suzuki S, Saito K, Natsume T, et al. (2019) Nuclear RNA export factor variant initiates piRNA-guided co-transcriptional silencing. EMBO J 38: e102870. 10.15252/embj.2019102870 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Nye J, Buster DW, Rogers GC (2014) The use of cultured Drosophila cells for studying the microtubule cytoskeleton. Methods Mol Biol 1136: 81–101. 10.1007/978-1-4939-0329-0_6 [DOI] [PubMed] [Google Scholar]
  38. Onishi R, Sato K, Murano K, Negishi L, Siomi H, Siomi MC (2020) Piwi suppresses transcription of Brahma-dependent transposons via Maelstrom in ovarian somatic cells. Sci Adv 6: eaaz7420. 10.1126/sciadv.aaz7420 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Ozata DM, Gainetdinov I, Zoch A, O’Carroll D, Zamore PD (2019) PIWI-Interacting RNAs: Small RNAs with big functions. Nat Rev Genet 20: 89–108. 10.1038/s41576-018-0073-3 [DOI] [PubMed] [Google Scholar]
  40. Peccoud J, Loiseau V, Cordaux R, Gilbert C (2017) Massive horizontal transfer of transposable elements in insects. Proc Natl Acad Sci U S A 114: 4721–4726. 10.1073/pnas.1621178114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Pélisson A, Song SU, Prud’homme N, Smith PA, Bucheton A, Corces VG (1994) Gypsy transposition correlates with the production of a retroviral envelope-like protein under the tissue-specific control of the Drosophila flamenco gene. EMBO J 13: 4401–4411. 10.1002/j.1460-2075.1994.tb06760.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Post C, Clark JP, Sytnikova YA, Chirn G-W, Lau NC (2014) The capacity of target silencing by Drosophila PIWI and piRNAs. RNA 20: 1977–1986. 10.1261/rna.046300.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Pracana R, Priyam A, Levantis I, Nichols RA, Wurm Y (2017) The fire ant social chromosome supergene variant Sb shows low diversity but high divergence from SB. Mol Ecol 26: 2864–2879. 10.1111/mec.14054 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Prud’homme N, Gans M, Masson M, Terzian C, Bucheton A (1995) Flamenco, a gene controlling the gypsy retrovirus of Drosophila melanogaster. Genetics 139: 697–711. 10.1093/genetics/139.2.697 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Quinlan AR, Hall IM (2010) BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 26: 841–842. 10.1093/bioinformatics/btq033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. R Core Team (2025) R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. [Google Scholar]
  47. Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T (2016) deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res 44: W160–W165. 10.1093/nar/gkw257 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP (2011) Integrative genomics viewer. Nat Biotechnol 29: 24–26. 10.1038/nbt.1754 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Rosenkranz D, Zischler H (2012) proTRAC--a software for probabilistic piRNA cluster detection, visualization and analysis. BMC Bioinformatics 13: 5. 10.1186/1471-2105-13-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Saito K, Inagaki S, Mituyama T, Kawamura Y, Ono Y, Sakota E, Kotani H, Asai K, Siomi H, Siomi MC (2009) A regulatory circuit for piwi by the large Maf gene traffic jam in Drosophila. Nature 461: 1296–1299. 10.1038/nature08501 [DOI] [PubMed] [Google Scholar]
  51. Saito K, Ishizu H, Komai M, Kotani H, Kawamura Y, Nishida KM, Siomi H, Siomi MC (2010) Roles for the Yb body components Armitage and Yb in primary piRNA biogenesis in Drosophila. Genes Dev 24: 2493–2498. 10.1101/gad.1989510 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Sela N, Mersch B, Gal-Mark N, Lev-Maor G, Hotz-Wagenblatt A, Ast G (2007) Comparative analysis of transposed element insertion within human and mouse genomes reveals Alu’s unique role in shaping the human transcriptome. Genome Biol 8: R127. 10.1186/gb-2007-8-6-r127 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Senti K-A, Rafanel B, Handler D, Kosiol C, Schlötterer C, Brennecke J (2026) Co-evolving infectivity and expression patterns drive the diversification of endogenous retroviruses. EMBO J 45: 1889–1908. 10.1038/s44318-025-00471-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Shpiz S, Ryazansky S, Olovnikov I, Abramov Y, Kalmykova A (2014) Euchromatic transposon insertions trigger production of novel Pi- and endo-siRNAs at the target sites in the drosophila germline. PLoS Genet 10: e1004138. 10.1371/journal.pgen.1004138 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Sienski G, Dönertas D, Brennecke J (2012) Transcriptional silencing of transposons by Piwi and maelstrom and its impact on chromatin state and gene expression. Cell 151: 964–980. 10.1016/j.cell.2012.10.040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Signor S, Vedanayagam J, Kim BY, Wierzbicki F, Kofler R, Lai EC (2023) Rapid evolutionary diversification of the flamenco locus across simulans clade Drosophila species. PLoS Genet 19: e1010914. 10.1371/journal.pgen.1010914 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Siomi MC, Sato K, Pezic D, Aravin AA (2011) PIWI-Interacting small RNAs: The vanguard of genome defence. Nat Rev Mol Cell Biol 12: 246–258. 10.1038/nrm3089 [DOI] [PubMed] [Google Scholar]
  58. Smit AFA, Hubley R, Green P (2015) RepeatMasker Open-4.0. Available at: http://www.repeatmasker.org. [Google Scholar]
  59. Song L, Florea L (2015) Rcorrector: Efficient and accurate error correction for Illumina RNA-seq reads. Gigascience 4: 48. 10.1186/s13742-015-0089-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Song SU, Kurkulos M, Boeke JD, Corces VG (1997) Infection of the germ line by retroviral particles produced in the follicle cells: A possible mechanism for the mobilization of the gypsy retroelement of Drosophila. Development 124: 2789–2798. 10.1242/dev.124.14.2789 [DOI] [PubMed] [Google Scholar]
  61. Sumiyoshi T, Sato K, Yamamoto H, Iwasaki YW, Siomi H, Siomi MC (2016) Loss of l(3)mbt leads to acquisition of the ping-pong cycle in Drosophila ovarian somatic cells. Genes Dev 30: 1617–1622. 10.1101/gad.283929.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Sweeten AP, Schatz MC, Phillippy AM (2024) ModDotPlot-rapid and interactive visualization of tandem repeats. Bioinformatics 40: btae493. 10.1093/bioinformatics/btae493 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Sytnikova YA, Rahman R, Chirn G-W, Clark JP, Lau NC (2014) Transposable element dynamics and PIWI regulation impacts lncRNA and gene expression diversity in Drosophila ovarian cell cultures. Genome Res 24: 1977–1990. 10.1101/gr.178129.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Takeuchi C, Yokoshi M, Kondo S, Shibuya A, Saito K, Fukaya T, Siomi H, Iwasaki YW (2022) Mod(mdg4) variants repress telomeric retrotransposon HeT-A by blocking subtelomeric enhancers. Nucleic Acids Res 50: 11580–11599. 10.1093/nar/gkac1034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Touret F, Guiguen F, Greenland T, Terzian C (2014) In between: Gypsy in Drosophila melanogaster reveals new insights into endogenous retrovirus evolution. Viruses 6: 4914–4925. 10.3390/v6124914 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. van Lopik J, Alizada A, Trapotsi M-A, Hannon GJ, Bornelöv S, Czech Nicholson B (2023) Unistrand piRNA clusters are an evolutionarily conserved mechanism to suppress endogenous retroviruses across the Drosophila genus. Nat Commun 14: 7337. 10.1038/s41467-023-42787-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Varoqui M, Mohamed M, Mugat B, Gourion D, Lemoine M, Pélisson A, Grimaud C, Chambeyron S (2025) Temporal and spatial niche partitioning in a retrotransposon community of the Drosophila melanogaster genome. Nucleic Acids Res 53: gkaf516. 10.1093/nar/gkaf516 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Vaury C, Bucheton A, Pelisson A (1989) The beta heterochromatic sequences flanking the I elements are themselves defective transposable elements. Chromosoma 98: 215–224. 10.1007/BF00329686 [DOI] [PubMed] [Google Scholar]
  69. Wolfe D, Dudek S, Ritchie MD, Pendergrass SA (2013) Visualizing genomic information across chromosomes with PhenoGram. BioData Min 6: 18. 10.1186/1756-0381-6-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Yamamoto‐Matsuda H, Miyoshi K, Moritoh M, Yoshitane H, Fukada Y, Saito K, Yamanaka S, Siomi MC (2022) Lint‐O cooperates with L(3)mbt in target gene suppression to maintain homeostasis in fly ovary and brain. EMBO Rep 23: e53813. 10.15252/embr.202153813 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Yamazaki H, Namba Y, Kuriyama S, Nishida KM, Kajiya A, Siomi MC (2023) Bombyx Vasa sequesters transposon mRNAs in nuage via phase separation requiring RNA binding and self-association. Nat Commun 14: 1942. 10.1038/s41467-023-37634-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Zanni V, Eymery A, Coiffet M, Zytnicki M, Luyten I, Quesneville H, Vaury C, Jensen S (2013) Distribution, evolution, and diversity of retrotransposons at the flamenco locus reflect the regulatory properties of piRNA clusters. Proc Natl Acad Sci U S A 110: 19842–19847. 10.1073/pnas.1313677110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Zhang Z, Wang J, Schultz N, Zhang F, Parhad SS, Tu S, Vreven T, Zamore PD, Weng Z, Theurkauf WE (2014) The HP1 homolog rhino anchors a nuclear complex that suppresses piRNA precursor splicing. Cell 157: 1353–1363. 10.1016/j.cell.2014.04.030 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S2. Springer-intF elements. (44KB, xls)

Table S6. Primer and siRNA sequences. (32.5KB, xls)

Table S7. Locus information. (28.5KB, xls)

Reviewer comments

Data Availability Statement

All sequencing data supporting this study are available at NCBI. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under accession numbers JBVFKA000000000 and JBVFKB000000000. The versions described in this article are JBVFKA010000000 and JBVFKB010000000. Raw genome HiFi reads and Iso-Seq reads are available in the Sequence Read Archive (SRA) under accession numbers SRR37248010 and SRR37247793 within BioProject PRJNA1417524. Code and data are available at GitHub (https://github.com/Chikara-Takeuchi/2025_moritoh_TE/tree/master).


Articles from Life Science Alliance are provided here courtesy of Life Science Alliance LLC

RESOURCES