Skip to main content
eLife logoLink to eLife
. 2022 Oct 4;11:e80067. doi: 10.7554/eLife.80067

The Drosophila ZAD zinc finger protein Kipferl guides Rhino to piRNA clusters

Lisa Baumgartner 1,2, Dominik Handler 1, Sebastian Wolfgang Platzer 1, Changwei Yu 1, Peter Duchek 1, Julius Brennecke 1,
Editors: Deborah Bourc'his3, Kevin Struhl4
PMCID: PMC9531945  PMID: 36193674

Abstract

RNA interference systems depend on the synthesis of small RNA precursors whose sequences define the target spectrum of these silencing pathways. The Drosophila Heterochromatin Protein 1 (HP1) variant Rhino permits transcription of PIWI-interacting RNA (piRNA) precursors within transposon-rich heterochromatic loci in germline cells. Current models propose that Rhino’s specific chromatin occupancy at piRNA source loci is determined by histone marks and maternally inherited piRNAs, but also imply the existence of other, undiscovered specificity cues. Here, we identify a member of the diverse family of zinc finger associated domain (ZAD)-C2H2 zinc finger proteins, Kipferl, as critical Rhino cofactor in ovaries. By binding to guanosine-rich DNA motifs and interacting with the Rhino chromodomain, Kipferl recruits Rhino to specific loci and stabilizes it on chromatin. In kipferl mutant flies, Rhino is lost from most of its target chromatin loci and instead accumulates on pericentromeric Satellite arrays, resulting in decreased levels of transposon targeting piRNAs and impaired fertility. Our findings reveal that DNA sequence, in addition to the H3K9me3 mark, determines the identity of piRNA source loci and provide insight into how Rhino might be caught in the crossfire of genetic conflicts.

Research organism: D. melanogaster

eLife digest

The genes within our DNA encode the essentials of our body plan and how each task in the body is achieved. However, our genome also contains many repetitive regions of DNA that do not encode functional genes. Some of these regions are genetic parasites known as transposons that try to multiply and spread around the DNA of their host. To prevent transposon DNA from interfering with the way the body operates, humans and other animals have evolved elaborate defense mechanisms to identify transposons and prevent them from multiplying.

In one such mechanism, known as the piRNA pathway, the host makes small molecules known as piRNAs that have sequences complementary to those of transposons, and act as guides to silence the transposons. The instructions to make these piRNAs are stored in the form of transposon fragments in dedicated regions of host DNA called piRNA clusters. These clusters thereby act as genetic memory, allowing the host to recognize and silence specific transposons in other locations within the host’s genome. In fruit flies, a protein called Rhino binds to piRNA clusters that are densely packed to allow piRNAs to be made. However, it remained unclear how Rhino is able to identify and bind to piRNA clusters, but not to other similarly densely packed regions of DNA.

Baumgartner et al. used a combination of genetic, genomic, and imaging approaches to study how Rhino finds its way in the fruit fly genome. They found that another protein called Kipferl interacts with Rhino and is required for Rhino to bind to nearly all piRNA clusters. Since Kipferl can by itself bind to the sequences that Rhino needs to find, the results suggest that Kipferl acts to recruit and initiate Rhino binding within densely packed piRNA clusters. Further experiments found that, in flies lacking Kipferl, Rhino binds to regions of DNA called Satellite repeats, hinting that these selfish sequences may compete for Rhino for their own benefit.

The finding that Kipferl and Rhino work together to define the memory system of the piRNA pathway strongly advances our understanding of how a sequence-specific defense system based on small RNAs can be established.

Introduction

Eukaryotic genomes are littered with transposable elements. By constantly aiming to increase their copy number, transposons pose a major threat to genomic integrity and must therefore be tightly controlled. An important transposon silencing mechanism in animals is the PIWI-interacting small RNA (piRNA) pathway (Siomi et al., 2011; Czech et al., 2018; Ozata et al., 2019, Senti and Brennecke, 2010). Its loss leads to transposon de-repression and mobilization, DNA damage, and sterility. At the center of the piRNA pathway are Argonaute proteins of the PIWI clade loaded with 23-30nt long piRNAs which enable sequence specific repression of transposons at the transcriptional and post-transcriptional levels. To control the highly diverse transposon sequence repertoire, an equally diverse set of piRNAs must be generated. Transposon targeting piRNAs are typically encoded in specialized genomic regions (Brennecke et al., 2007; Houwing et al., 2007; Aravin et al., 2008). These so-called piRNA clusters are enriched in transposon sequences, range from a few to several hundred kilobases in length, and act as heritable and adaptive transposon sequence libraries for the piRNA pathway.

Through their transcription, piRNA clusters provide the essential small RNA precursors with sequence information antisense to active transposon transcripts. The definition of specific genomic loci as piRNA precursor sources is therefore of central importance to determine the pathway’s target spectrum. In Drosophila gonads, two general types of piRNA clusters are distinguished based on their mode of transcription: Uni-strand clusters are transcribed on one genomic strand from canonical RNA polymerase II promoters, giving rise to long, single stranded, and poly-adenylated transcripts (Brennecke et al., 2007; Mohn et al., 2014; Goriaux et al., 2014). The second type are dual-strand piRNA clusters, which are transcribed on both genomic strands (Brennecke et al., 2007; Mohn et al., 2014; Klattenhoff et al., 2009; Zhang et al., 2014b; Chen et al., 2016; Andersen et al., 2017). Dual-strand clusters are active in germline cells and are embedded within heterochromatin. They differ from canonical RNA polymerase II transcription units by a lack of defined promoters, and by suppression of splicing and cleavage- and polyadenylation signals.

The molecular identity of dual-strand piRNA clusters is conferred by Rhino, a germline-specific variant of the canonical Heterochromatin Protein 1 a (Su(var)2–5) (Vermaak and Malik, 2009; Klattenhoff et al., 2009). While Su(var)2–5 mediates gene silencing and chromatin compaction, Rhino acts in an opposite manner: It enables the recruitment of several germline-specific proteins, which are required to engage the cellular gene expression machinery at piRNA clusters and enable the nuclear export of the emerging transcripts (ElMaghraby et al., 2019; Andersen et al., 2017; Mohn et al., 2014; Kneuss et al., 2019; Zhang et al., 2014b; Zhang et al., 2012a; Zhang et al., 2018; Hur et al., 2016; Chen et al., 2016). In rhino mutant flies, dual-strand piRNA clusters lose their transcriptional capacity and convert into canonical heterochromatin. Consequently, piRNA production collapses, transposons are de-repressed, and flies are sterile. Based on these data, Rhino is the defining feature of dual-strand piRNA clusters, and its specific deposition is at the very center of steering piRNA populations to target ‘non-self’ (transposons) but not ‘self’ (genic loci).

Despite an increasing understanding of how Rhino orchestrates piRNA cluster expression, little is known about how cells control the specific deposition of Rhino onto chromatin. Current models involve a role of maternally inherited Piwi-piRNA complexes in the specification of piRNA cluster identity through Piwi-dependent deposition of H3K9 methylation, which is postulated to define dual-strand clusters at the chromatin level (Mohn et al., 2014; Le Thomas et al., 2014; Shpiz et al., 2014; Akkouche et al., 2017). In this manner, Rhino can be recruited adaptively to a changing transposon insertion profile. Indeed, Rhino’s N-terminal chromodomain displays specific affinity to H3K9me3, which is consistently present at piRNA clusters (Mohn et al., 2014; Le Thomas et al., 2014; Yu et al., 2015). However, a large fraction of the pericentromeric, H3K9me3 enriched heterochromatin is not or only weakly bound by Rhino. Since its discovery, a major open question has therefore been how Rhino’s genomic binding profile is defined.

Two observations indicate that, besides H3K9me3, cellular and genomic context impact Rhino’s binding pattern. First, while Rhino is expressed both in ovaries and testes, the identity of dual-strand piRNA clusters differs in the two gonads despite identical DNA sequence (Mohn et al., 2014; Klattenhoff et al., 2009; Chen et al., 2021). Moreover, the genomic Rhino profile is dynamic in testes where the X-chromosomal AT-chX cluster attracts most of the cellular Rhino pool in differentiating spermatocytes but not at earlier developmental stages (Chen et al., 2021). Second, while all stand-alone insertions of active transposons are silenced through piRNA-guided heterochromatin formation, only around 20% of the insertions of any given transposon are, by unknown mechanisms, bound by Rhino (Mohn et al., 2014; Shpiz et al., 2014; Radion et al., 2019, Akulenko et al., 2018). Based on these two observations, Rhino-domains must be defined through a combination of H3K9me3 with additional, unknown activities that bind chromatin via Piwi-independent mechanisms.

Here, we show that the zinc finger protein CG2678/Kipferl defines the majority of Rhino’s chromatin binding pattern in ovaries. The combinatorial readout of Kipferl’s sequence-specific DNA binding together with Rhino’s affinity to the H3K9me3 mark underlies the selective recognition of heterochromatic loci as substrates for piRNA precursor transcription. Our findings also suggest that distinct chromatin loci might compete for the cellular Rhino pool, offering insight into why Rhino is a fast-evolving protein.

Results

The H3K9me3 mark alone cannot explain the large diversity of genomic Rhino domains

As a foundation for studying how Rhino’s chromatin binding specificity is defined molecularly, and how variable it is in fly strains with different transposon insertion profiles, we determined Rhino’s chromatin occupancy genome-wide in two commonly used laboratory strains (w1118 and MTD-Gal4) and in iso1, the strain underlying the D. melanogaster reference genome (Hoskins et al., 2015). Chromatin immunoprecipitation experiments followed by next-generation sequencing (ChIP-seq) confirmed that Rhino binds mostly to extended domains that vary greatly in size and often lack clearly defined boundaries (Mohn et al., 2014). We therefore divided the genome into nonoverlapping genomic 1-kb tiles and calculated the Rhino enrichment per tile in each strain. This revealed that a substantial fraction of the genome (e.g. 3.9 Mbp corresponding to ~3% of the analyzable genome in the w1118 strain) showed greater than 4-fold enrichment (p<0.036, Z-score >2.1) for Rhino over input.

To analyze Rhino’s chromatin binding profile, we divided the genome into pericentromeric heterochromatin and the generally euchromatic chromosome arms (based on H3K9me3 and Su(var)2–5 ChIP-seq data) (Figure 1A). The three well-described Rhino-dependent dual-strand piRNA clusters – cluster 38C, 42AB, and 80F – were not assigned to either category but were analyzed separately as reference loci. Rhino enrichment at major dual-strand clusters but also within the recombination-poor, pericentromeric heterochromatin followed a highly similar pattern in all three fly strains, with only few loci displaying strain-specific quantitative differences (Figure 1B left, middle; Figure 1—figure supplement 1A). Many Rhino domains in pericentromeric heterochromatin (e.g. Figure 1C panel 1) resembled the large piRNA clusters like cluster 80F (Figure 1C panel 2): they exhibited strong H3K9me3 signal, gave rise to abundant piRNAs, and were enriched in diverse transposon sequences. Within chromosome arms, several strain-specific Rhino domains were apparent (Figure 1B right). Based on manual inspection, these corresponded to genomic loci flanking stand-alone transposon insertions (e.g. a flea insertion found only in the w1118 strain (Figure 1C panel 3)). The majority of euchromatic Rhino domains, however, were shared among all three strains (Figure 1B right; Figure 1—figure supplement 1A). Based on the iso1 data and the reference genome, many of these shared domains were not associated with transposon insertions. Upstream of the headcase locus, for example, Rhino was strongly enriched in all strains, although this locus lacks transposon sequences and does not produce abundant piRNAs (Figure 1C panel 4). Nonetheless, these loci displayed dual-strand transcription based on RIP-seq data for Nxf3 (ElMaghraby et al., 2019), the piRNA precursor specific nuclear export factor, indicating that these loci were functional Rhino domains (Figure 1C). The poor piRNA output from these transposon-devoid loci is therefore not due to non-functional Rhino, but most likely due to a lack of piRNA target sites for Aub/Ago3-mediated transcript cleavage in the emerging transcripts, which greatly stimulate piRNA biogenesis (Mohn et al., 2015; Han et al., 2015). Altogether, our data demonstrate that Rhino, in a largely strain-independent manner, binds to thousands of remarkably diverse genomic loci that include, but are not limited to sources of dual-strand piRNAs.

Figure 1. H3K9me3 is insufficient to define the large diversity of genomic Rhino domains.

(A) ChIP-seq enrichment (genome unique reads; 1-kb tiles, one representative replicate each) of H3K9me3 and Su(var)2–5 along the assembled chromosome 3 sequence in w1118 ovaries. Pericentromeric heterochromatin and euchromatic chromosome arms are indicated. (B) Scatter plot comparing average log2 ChIP-seq enrichments for Rhino in ovaries from w1118 (n=2) versus MTD-Gal4 >w-sh (n=3) strains (1-kb tiles separated into pericentromeric heterochromatin and chromosomal arms; piRNA clusters 38C, 42AB, and 80F are shown separately; colored 1-kb tiles correspond to example loci in panel C). (C) UCSC genome browser tracks depicting the diversity of Rhino domains. 1: heterochromatic, transposon-rich locus; 2: piRNA cluster 80F; 3: strain-specific flea insertion (ElMaghraby et al., 2019) in w1118; 4: Rhino domain proximal to euchromatic headcase locus. Unless indicated otherwise, data are from w1118ovaries (ChIP and RIP signal are depicted as coverage per million reads, piRNA coverage normalized to miRNA reads, data is displayed for one representative replicate). GFP-RIP-seq serves as control for non-specific mRNA binding. (D, E) Violin plots showing average log2 fold enrichment of Rhino ChIP-seq over input for 1-kb tiles (n=2) from w1118 ovaries. Tiles were grouped into Rhino-bound and non-Rhino-bound based on a cutoff of fourfold enrichment (corresponding to p=0.036, Z-score=2.1) of Rhino ChIP-seq signal over input in each replicate experiment. Rhino-dependent piRNA clusters 38C, 42AB, and 80F were analyzed separately (shown as box plots due to low number of tiles). Box plots show median (center line), with interquartile range (box) and whiskers indicate 1.5x interquartile range. (F) UCSC genome browser tracks depicting a pericentromeric heterochromatin locus marked by H3K9me3 and bound by Su(var)2–5, but not Rhino (ChIP-seq signal: coverage per million reads; piRNA coverage normalized to miRNA reads; all data obtained from w1118 ovaries, data is displayed for one representative replicate).

Figure 1.

Figure 1—figure supplement 1. Relationships between methylated H3K9 and chromatin target sites of HP1 family members.

Figure 1—figure supplement 1.

(A) Scatter plots contrasting average log2 ChIP-seq enrichment for Rhino in w1118 (n=2) or MTD-Gal4 (n=3) versus iso1 (n=2) ovaries. Depicted are 1-kb tiles separated into pericentromeric heterochromatin and chromosomal arms. piRNA clusters 38C, 42AB, and 80F are excluded from the categorization into heterochromatin and euchromatin and are depicted separately. Indicated colored 1-kb tiles correspond to example loci in Figure 1F. (B, C) Violin plots showing the average log2-fold ChIP-seq enrichment of endogenous and germline specific Su(var)2–5 (transgene expressed under the control of the rhino promoter) and H3K9me2 (C) over input on 1-kb tiles. Tiles are grouped into Rhino-bound and non-Rhino-bound analogous to Figure 1D and E with Rhino-dependent piRNA clusters 38C, 42AB, and 80F depicted separately for comparison. Box plots show median (center line), with interquartile range (box) and whiskers indicate 1.5x interquartile range. (D) Heat map centered on HP1b (left) or HP1c (right) peaks, respectively, depicting the indicated ChIP-seq signal (n=1) in ovaries from the w1118 strain.

Consistent with a reported role of H3K9 methylation in dual-strand piRNA cluster biology, Rhino domains in both pericentromeric heterochromatin and within chromosome arms consistently showed enrichment in H3K9me3 (Figure 1D). However, H3K9me3 levels did not directly correlate with Rhino enrichments: First, enrichments of H3K9me3 were considerably higher for Rhino domains within pericentromeric heterochromatin compared to those in chromosome arms, despite comparable Rhino enrichments (Figure 1D). Second, numerous loci in heterochromatin with high H3K9me3 signal were not or only poorly bound by Rhino (Figure 1E and F). Besides H3K9me3, the recombinant Rhino chromodomain also binds to H3K9me2, yet with lower affinity (Mohn et al., 2014; Le Thomas et al., 2014; Yu et al., 2015). As for H3K9me3, however, a large fraction of H3K9me2 was found outside of Rhino-bound loci (Figure 1—figure supplement 1). In contrast, Su(var)2–5 enrichments followed H3K9me2/3 levels at almost all genomic loci (Figure 1—figure supplement 1C compared to Figure 1—figure supplement 1B). This was also true for a tagged Su(var)2–5 protein expressed exclusively in germline cells where it competes with Rhino for binding to H3K9me3. Thus, methylation of H3K9 alone, while being a hallmark of Rhino domains, cannot explain Rhino’s chromatin profile.

The ZAD-zinc finger CG2678 interacts with Rhino and binds Rhino domains genome-wide

Sequence-specific DNA binding proteins can guide HP1 family proteins to chromatin. HP1b and HP1c, for example, depend on the zinc finger proteins Woc and Row for chromatin binding to thousands of sites that lack signal for Su(var)2–5 and even H3K9me2/3 (Figure 1—figure supplement 1D; Font-Burgada et al., 2008). Considering this, we hypothesized that undiscovered proteins with affinity to DNA motifs or to other histone marks collaborate with H3K9me3 to determine Rhino’s genomic target loci.

To identify Rhino-interacting proteins, we performed a yeast two-hybrid screen using full length Rhino as bait and a cDNA library obtained from ovary mRNAs as prey. We identified 26 putative interactors among 175 sequenced colonies (Supplementary file 2), among them the known Rhino-interactor Deadlock (83 independent clones) (Mohn et al., 2014). With twelve independent clones, the uncharacterized protein CG2678 was the second most enriched screen hit. CG2678 further stood out among the other identified interactors because of its domain architecture: CG2678 encodes a predicted DNA binder whose two annotated protein isoforms carry one or two arrays of C2H2 zinc fingers (ZnFs) (Figure 2A, Figure 2—figure supplement 1). Based on an N-terminal ZnF associated domain (ZAD), CG2678 is a member of the large group of ZAD-ZnF proteins that have radiated extensively in insects (Chung et al., 2007; Chung et al., 2002). Only a handful of the >90 D. melanogaster ZAD-ZnF proteins have been studied, and these are involved in transcription (Bag et al., 2021; Harms et al., 2000), genome organization (Maksimenko et al., 2015; Sabirov et al., 2021), and heterochromatin biology (Kasinathan et al., 2020). As a predicted DNA-binding protein and a putative Rhino interactor, CG2678 was an intriguing candidate for a potential Rhino specificity factor. To ask whether Rhino and CG2678 interacted also in vivo, we performed a proximity labeling experiment (Branon et al., 2018). We generated transgenic flies expressing GFP-tagged Rhino and low levels of the biotin TurboID-Ligase fused to a GFP nanobody, isolated biotinylated proteins under denaturing conditions from ovary lysate, and performed quantitative mass spectrometry. This revealed that CG2678 was enriched in the Rhino-GFP sample versus a control sample at similar levels as the piRNA cluster factors Deadlock, Cutoff, and Bootlegger (Figure 2B; Figure 2—figure supplement 2A). Together, these data implicated CG2678 as a promising candidate for further characterization.

Figure 2. The ZAD-zinc finger CG2678 interacts with Rhino and binds Rhino domains genome-wide.

(A) Genomic CG2678 locus depicting the two annotated transcripts and CG2678 protein domain architecture (location of the frame shift mutation (red asterisk), internal 3xFLAG affinity tag (red circle), and cleavage sites for full locus deletion (red arrows) are indicated). (B) Volcano plot showing fold enrichment of proteins determined by quantitative mass spectrometry (Doblmann et al., 2019) in GFP-Rhino TurboID samples versus nuclear GFP TurboID control (n = 3 biological replicates; statistical significance based on two-sided t-test; P values corrected for multiple testing (Benjamini–Hochberg), see also Figure 2—figure supplement 2A and Figure 2—source data 1). (C, D) Confocal images of nurse cell nuclei (C) and germarium (D). Single channel and merged color images depict immunofluorescence signal for endogenous CG2678 (left, cyan) and Rhino (middle, magenta). Scale bar: 5 µm, dotted line: nuclear outline based on DAPI. (E) Scatter plot depicting correlation of log2-fold Rhino versus CG2678 ChIP-seq enrichment in w1118 ovaries (average of 2 replicates each). CG2678-only tiles are highlighted in green and were defined by significantly higher enrichment for Rhino than CG2678 (n=2, Z-score=3), plus a Rhino enrichment of max. 4-fold in two independent experimental replicates. (F, G) UCSC browser tracks illustrating CG2678 signal at diverse Rhino domains (F) and at CG2678-only peaks (G). ChIP-seq signal is shown as coverage per million sequenced reads for w1118 ovaries for one representative replicate. (H) Violin plots depicting log2-fold enrichment of H3K9me2 (orange, n=1) and H3K9me3 (brown, n=2) at euchromatic 1-kb tiles bound by neither Rhino nor CG2678, both proteins, or CG2678 only. Classification into groups was performed based on binary cutoffs for Rhino (fourfold) and a linear fit for CG2678 co-occupancy in two independent replicate ChIP-seq experiments from w1118 ovaries to extract CG2678-only tiles highlighted in (E).

Figure 2—source data 1. Imputed enrichment and p-values (n=3) for quantitative mass spectrometry data underlying Figure 2B.

Figure 2.

Figure 2—figure supplement 1. Multiple sequence alignment of selected reciprocal best CG2678 homologs generated with mafft v7.505 (Rozewicki et al., 2016) and visualized with espript v3.0 (Robert and Gouet, 2014).

Figure 2—figure supplement 1.

Protein sequences included in the alignment were obtained from the NCBI non-redundant database, their protein accessions being XP_033162862, XP_032579476, EDX12039, XP_039231961, XP_039493628, XP_043654173, XP_044252272, XP_017119505, XP_017057177.
Figure 2—figure supplement 2. CG2678 is expressed in ovaries but not in testes and binds Rhino domains on chromatin.

Figure 2—figure supplement 2.

(A) Cartoon illustrating flexible GFP-nanobody-TurboID setup used for the characterization of proteins in close proximity to GFP-tagged Rhino. (B) Bar graph indicating the expression pattern of CG2678 (Flybase modENCODE Anatomy RNA-Seq; L: larva; P: pupa; A: adult; M: male; F: female). (C) Confocal image of anterior tip of a testes stained for endogenous CG2678 (left, cyan) and Rhino (middle, magenta) with enlarged view of one nucleus (dotted line: nuclear outline based on DAPI; scale bar: 5 µm). (D, E) Scatter plots depicting the correlation of log2-fold Rhino versus CG2678 ChIP-seq enrichments in MTD-Gal4 (D; n=2) or iso1 ovaries (E; n=1).

Like for rhino, CG2678 mRNA levels were detected primarily in ovaries (Flybase gene expression atlas, Figure 2—figure supplement 2B, Larkin et al., 2021). Immunofluorescence experiments revealed CG2678 expression specifically in ovarian germline cells, with no detectable signal above background in somatic cells which lack Rhino-defined piRNA clusters (Figure 2C). In germline nurse cells, CG2678 was enriched in nuclei and accumulated in numerous discrete foci in a pattern that was almost indistinguishable from that of a Rhino staining (Figure 2C). Expression of CG2678 in ovaries, however, was not uniform: Single cell RNA-seq data from adult ovaries indicated low mRNA levels in germline stem cells and increased levels in differentiating nurse cells (Rust et al., 2020). Consistent with this, CG2678 protein, while abundant in differentiating cysts and polyploid nurse cells, was barely detectable in germline stem cells and cystoblasts despite these cells expressing Rhino (Figure 2D). CG2678 was further not detectable in testes (Figure 2—figure supplement 2B, C). These data were intriguing given that the identity of Rhino-dependent dual-strand piRNA clusters differs, for unknown reasons, between males and females (Chen et al., 2021).

ChIP-seq experiments using anti-CG2678 antibody confirmed that the co-localization of CG2678 and Rhino in nuclear foci was due to both proteins binding the same chromatin sites, as CG2678 co-occupied Rhino domains genome-wide in all three wild-type strains (w1118, MTD-Gal4, and iso1 strains) (Figure 2E; Figure 2—figure supplement 2D, E). Closer inspection revealed that CG2678 and Rhino bind to chromatin in a virtually indistinguishable pattern, often occupying broad chromatin domains. This was true at major piRNA clusters (e.g. cluster 80F), Rhino domains in pericentromeric heterochromatin, stand-alone transposon insertions in chromosome arms as well as loci like headcase that lack transposon sequences and piRNA output (Figure 2F).

Notably, 647 genomic 1-kb tiles exhibited significantly higher enrichment for CG2678 than for Rhino (Z-score >3) and were not enriched for Rhino (CG2678-only loci; 15.4% of all Kipferl-bound tiles) (Figure 2E). At these loci, the CG2678 ChIP-seq signal formed narrow peaks (e.g. Figure 2G). 98.7% of the CG2678-only loci resided within chromosome arms, suggesting that lack of H3K9 methylation at these sites might prevent stable Rhino binding. Indeed, almost all CG2678-only loci were devoid of H3K9 methylation (Figure 2G and H). Overall, our findings revealed a molecular relationship between Rhino and the potentially sequence-specific DNA binder CG2678.

Rhino’s chromatin occupancy changes dramatically in CG2678/kipferl mutants

To investigate the role of CG2678 in Rhino biology, we generated mutant fly lines carrying frameshift alleles or a complete deletion of the CG2678 locus (Figure 2A). CG2678 null mutant flies were viable and, in western blot analysis, did not express detectable CG2678 protein (Figure 3A). CG2678 mutant females showed normal egg-laying rates but strong fertility defects (Figure 3B). Insertion of the CG2678 genomic sequence with an internal FLAG-tag (Figure 2A) at the deleted locus restored fertility to wildtype levels (Figure 3B).

Figure 3. Rhino’s chromatin occupancy changes dramatically in CG2678/kipferl mutants.

(A) Western blot analysis verifying CG2678 frame shift (fs1) and locus deletion (Δ1) alleles using a monoclonal antibody against CG2678 (top; CG2678-PB is a minor protein isoform) and depicting Rhino levels in the absence of CG2678 (bottom). Ponceau staining: loading control. (B) Time-resolved hatching rates for eggs laid by w1118 control females in comparison to females carrying a CG2678 frame shift (fs1), locus deletion (Δ1), or tagged rescue construct instead of the CG2678 locus, respectively (AM, PM indicates egg laying time). Total number of eggs laid is indicated for each genotype. (C) Confocal images illustrating localization of GFP-Rhino in nurse cell nuclei of w1118, CG2678 locus deletion (Δ1), and CG2678,nxf3 double mutant females (scale bar: 5 µm). (D) Scatter plot of genomic 1-kb tiles contrasting average log2-fold Rhino ChIP-seq enrichment in ovaries with MTD-Gal4 driven CG2678/kipferl knock down versus control ovaries (average of two replicate experiments each). (E) Violin plots showing average log2-fold Rhino ChIP-seq enrichment in control (n=3) as well as CG2678/kipferl (n=2) or rhino (n=1) germline knockdown ovaries on Rhino-bound 1-kb tiles (defined in Figure 1D) in heterochromatin (HC) and chromosome arms (EC). piRNA clusters 38C, 42AB, and 80F are depicted separately. *** corresponds to P<0,001 based on student’s t-test. Box plots show median (center line), with interquartile range (box) and whiskers indicate 1.5x interquartile range. (F) UCSC browser tracks (ChIP-seq) depicting diverse Rhino domains in control and CG2678/kipferl germline knockdown ovaries (signal shown as coverage per million sequenced reads for one representative replicate). (G) Violin plots showing average log2-fold H3K9me3 ChIP-seq enrichment in control (n=3) and CG2678/kipferl (n=2) germline knockdown for Rhino-bound 1-kb tiles (defined in Figure 1D) in heterochromatin (HC) and along chromosome arms (EC). piRNA clusters 38C, 42AB, and 80F are depicted separately. *** and n.s. corresponds to p<0.001 or p>0.05, respectively, based on student’s t-test. Box plots show median (center line), with interquartile range (box) and whiskers indicate 1.5x interquartile range. (H, I) Jitter plots depicting the log2-fold Rhino ChIP-seq enrichments on transposon (H) and Satellite (I) consensus sequences in indicated genetic backgrounds. (J) Confocal images showing Rsp and 1.688 Satellite RNA FISH signal and GFP-Rhino in nurse cells of w1118 or CG2678/kipferl mutant flies (scale bar: 5 µm).

Figure 3.

Figure 3—figure supplement 1. CG2678/Kipferl impacts Rhino's nuclear localization and chromatin occupancy.

Figure 3—figure supplement 1.

(A) Western blots showing CG2678 or Rhino protein levels in ovarian lysate from indicated genotypes (Ponceau staining served as loading control). Changes in protein levels and isoform ratios between control and rhino depleted ovaries were not reproducible and are likely caused by differences in ovary morphology, which distort the protein composition across samples. (B) Confocal images showing immunofluorescence signal for CG2678 and Rhino in nurse cells of MTD-Gal4-driven control or CG2678 knockdown ovaries (scale bar: 5 µm). (C) Confocal images showing FLAG-tagged CG2678 and Rhino signal in nurse cells expressing an internally FLAG-tagged CG2678 in the CG2678 mutant background (see Figure 2B for location of FLAG tag; scale bar: 5 µm). (D) Confocal images showing the localization of GFP-Rhino with H3K9me3 (top), Deadlock (middle), or Nxf3 (bottom) in CG2678 null mutant nurse cells (scale bar: 5 µm). (E) Scatter plot of genomic 1-kb tiles contrasting average log2-fold enrichment for Rhino ChIP-seq signal in CG2678 knock out versus w1118 control ovaries (three and two replicate experiments, respectively). (F) Violin plots showing average log2-fold H3K9me2 ChIP-seq enrichment for Rhino-bound 1-kb tiles in MTD-Gal4 background in heterochromatin (HC) and along chromatin arms (EC) contrasting control (n=2) to RNAi-mediated CG2678/kipferl knockdown (n=1). Rhino-dependent piRNA clusters 38C, 42AB, and 80F are shown separately. Box plots show median (center line), with interquartile range (box) and whiskers indicate 1.5x interquartile range. (G) Confocal images showing the localization of piRNA cluster RNA via RNA FISH in respect to GFP-Rhino in nurse cells of w1118 or CG2678/kipferl locus deletion flies (scale bar: 5 µm). Imaging conditions were chosen to avoid photo bleaching and to optimally display the large Rhino-GFP accumulations to show that these are not corresponding to either of these clusters. (H) Confocal images showing Rsp and 1.688 Satellite RNA FISH signal in respect to GFP-Rhino in nurse cells of CG2678/kipferl,nxf3 double mutant flies (scale bar: 5 µm).

Loss of CG2678 had no impact on Rhino levels (Figure 3A) but resulted in pronounced changes in Rhino localization: In wildtype nurse cells, Rhino accumulated in many distinct foci throughout the nucleus. In CG2678 mutants, almost all Rhino signal gathered in a few large, often continuous structures adjacent to the nuclear envelope (Figure 3C). Depletion of CG2678 mRNA by germline-specific RNAi caused a similar phenotype (Figure 3—figure supplement 1A, B), and expression of a tagged wildtype CG2678 protein in CG2678 mutants restored wildtype Rhino localization to small, dispersed foci (Figure 3—figure supplement 1C). The prominent Rhino accumulations in CG2678 mutants were enriched in H3K9me3 and the Rhino co-factors Deadlock and Nxf3, indicating that they were genuine chromosomal Rhino domains (Figure 3—figure supplement 1D).

Rhino domains tend to localize at the nuclear periphery also in wildtype ovaries, resulting in a putative piRNA precursor export and biogenesis compartment continuous with cytoplasmic nuage (Zhang et al., 2012a). We hypothesized that the pulling force provided by the piRNA precursor export pathway centered on Nxf3 drags Rhino domains to nuclear pore complexes (ElMaghraby et al., 2019). Indeed, in CG2678,nxf3 double mutant flies, Rhino still accumulated in few, large foci but these were no longer confined to the nuclear envelope (Figure 3C). We termed CG2678 'kipferl' because of the prominent enrichments of Rhino in crescent-shaped structures that reminded us of a famous Austrian pastry.

To explore whether Rhino’s altered nuclear localization was linked to changes in its chromatin occupancy, we performed Rhino ChIP-seq experiments in kipferl-depleted ovaries. In the absence of Kipferl, Rhino’s chromatin association was severely reduced at most genomic sites (Figure 3D and E). These included the major piRNA cluster 80F, nearly all Rhino domains in pericentromeric heterochromatin, as well as stand-alone transposon insertions and euchromatic Rhino domains devoid of transposon insertions (Figure 3E and F). A similar phenotype was apparent in ovaries from flies carrying kipferl null mutant alleles (Figure 3—figure supplement 1E). Importantly, Rhino loss at these various genomic loci was not caused by reduced H3K9 methylation levels, as H3K9me2/3 levels remained high at Rhino-bound regions upon depletion of Kipferl (Figure 3G; Figure 3—figure supplement 1F).

Only 4.6% of all 1-kb tiles exhibiting a more than fourfold Rhino enrichment (n=2) in control flies remained Rhino-bound in kipferl-depleted ovaries. Among these were mainly tiles of piRNA cluster 42AB and, to a lesser extent, cluster 38C (Figure 3D). Fluorescence in situ hybridization (FISH) experiments confirmed that clusters 42AB and 38C are transcribed in kipferl mutants (Figure 3—figure supplement 1G). In line with only low residual Rhino binding especially at cluster 38C, we observe only weak overlap between GFP-Rhino and the RNA FISH signal for these piRNA clusters. Most importantly, however, their respective nuclear RNA-FISH signal did not co-localize with the strong, elongated Rhino accumulations at the nuclear envelope (Figure 3—figure supplement 1G). We therefore hypothesized that the prominent Rhino foci in kipferl mutants correspond to repetitive loci not assembled in the reference genome or not identifiable using genome-unique reads. Based on an analysis of all ChIP-seq reads, the average Rhino enrichment on transposon consensus sequences varied between two and 30-fold in wildtype ovaries, was non-detectable in ovaries depleted for Rhino, and was strongly reduced in the absence of Kipferl (Figure 3H). In contrast, several Satellite sequences, foremost the Responder and 1.688 family Satellites that give rise to Rhino-dependent piRNAs in wildtype ovaries and testes (Chen et al., 2021; Wei et al., 2021), maintained Rhino occupancy or accumulated even more Rhino than in wildtype ovaries (Figure 3I).

Consistent with elevated Rhino occupancy at both Satellites, we observed a corresponding increase in nascent transcripts as indicated by the enhanced RNA FISH signal in nurse cell nuclei of kipferl mutant ovaries compared to wildtype controls (Figure 3J). The FISH signal for Rsp and 1.688 transcripts in kipferl mutants overlapped precisely with the prominent Rhino accumulations at the nuclear envelope. Rsp and 1.688 Satellites form large repetitive tandem arrays in pericentromeric heterochromatin (Wu et al., 1988; Abad et al., 2000). The enormous size of these Satellite loci, particularly the 359 bp array (1.688 family) on the X-chromosome which extends over several megabasepairs, likely explains why Rhino foci in kipferl mutants are so prominent (Figure 3J). The strong Rhino accumulations found in the center of nurse cell nuclei of kipferl,nxf3 double mutant flies also overlapped with RNA FISH signal for Rsp and 1.688 Satellites, indicating that loss of nxf3 did not alter Rhino’s chromatin occupancy pattern (Figure 3—figure supplement 1H). In summary, Rhino’s re-distribution from hundreds of genomic loci to the large, pericentromeric Rsp and 1.688 satellite arrays, which accumulate at the nuclear periphery in an Nxf3-dependent manner, explain the name-giving Rhino localization phenotype in kipferl mutants.

kipferl mutant ovaries display piRNA losses and transposon de-repression

To investigate whether Rhino’s altered chromatin occupancy in kipferl mutants affects piRNA production, we compared Argonaute-bound small RNA populations from kipferl-depleted ovaries to those from rhino-depleted and control ovaries. Total piRNA levels, normalized to miRNA reads, were reduced 4.5-fold in rhino-depleted but only 1.5-fold in kipferl-depleted ovaries (Figure 4—figure supplement 1A). We grouped genomic piRNA sources into somatic source loci (e.g. flamenco), Rhino-independent germline source loci (e.g. uni-strand cluster 20A) and Rhino-dependent germline source loci (e.g. dual-strand clusters 38C, 42AB, 80F) (Figure 4—figure supplement 1B, Mohn et al., 2014). kipferl-depleted ovaries exhibited reduced piRNA levels only from Rhino-dependent germline source loci (Figure 4A). A prominent example was cluster 80F, where piRNA production collapsed in the absence of Kipferl to the same extent as seen in rhino-depleted ovaries (Figure 4A and B). At many other sites, piRNA losses were less severe compared to rhino-depleted ovaries. Among them were piRNA clusters 38C and 42AB, consistent with residual Rhino binding to both clusters in kipferl-depleted ovaries (Figure 4A and B; Figure 3D and E).

Figure 4. kipferl mutant ovaries display piRNA losses and transposon de-repression.

(A) Violin plots showing log2-fold changes in levels of uniquely mapping piRNAs on 1-kb tiles relative to control upon MTD-Gal4 mediated knock down of rhino or kipferl (1-kb tiles were categorized into somatic source loci, Rhino-independent germline source loci and Rhino-dependent germline source loci according to Figure 4—figure supplement 1B, n=1). *** and n.s. corresponds to p<0.001 or p>0.05, respectively, based on Wilcoxon signed-rank test. Box plots show median (center line), with interquartile range (box) and whiskers indicate 1.5x interquartile range. (B) UCSC genome browser tracks displaying piRNA levels at clusters 80F and 42AB in control, kipferl, and rhino knock down ovaries (ChIP-seq signal from MTD-Gal4 control ovaries is depicted as coverage per million reads, piRNA coverage was normalized to miRNA reads, data is given for one replicate). (C) Jitter plot depicting log2-fold changes for piRNA levels mapping antisense to transposon consensus sequences in indicated MTD-Gal4 mediated knock downs compared to control (transposons classified analogous to panel A, n=1). *** and n.s. corresponds to p<0.001 or p>0.05, respectively, based on Wilcoxon signed-rank test. (D, E) Jitter plots showing piRNA levels (per kb sequence) in indicated genotypes mapping to transposons (antisense only) giving rise to Rhino-dependent piRNAs (D) or to Satellite repeats (E). Blue and green dots in panel D indicate fragments of the respective transposon in piRNA clusters 38C, 42AB, or 80F (n=1, compare to Figure 4—figure supplement 1C for full mutant data, and see Figure 4—source data 1). (F) piRNA profiles across the consensus sequences for a representative transposon (Max) and the 359bp 1.688 Satellite. piRNA counts (normalized to miRNAs) are displayed for indicated genotypes. (G) Volcano plot depicting the log2-fold changes in poly-adenylated transposon transcripts in kipferl-depleted versus control ovaries (n=3). (H) Confocal images showing RNA FISH signal for 3S18 transcripts in w1118 and kipferl null mutant ovaries (scale bar: 20 µm).

Figure 4—source data 1. Normalized small RNA-seq counts and their mappings to transposable element sequences.

Figure 4.

Figure 4—figure supplement 1. Loss of Kipferl affects the expression of repetitive elements but not mRNAs in ovaries.

Figure 4—figure supplement 1.

(A) Size profiles of Argonaute-bound small RNAs (miRNA-normalized) isolated from ovaries of indicated genotypes. microRNA reads are shown separately (grey bars). Percentages represent total piRNA levels (23-33nt) relative to control (100%). (B) Classification of genomic 1-kb tiles into somatic (e.g. flamenco), Rhino-independent (e.g. cluster 20A), and Rhino-dependent germline source loci (e.g. clusters 38C, 42AB, 80F). Binary cutoffs are applied at a soma index of 2 do distinguish somatic from germline source loci (see Methods section) and a fourfold reduction in piRNAs upon rhino depletion to determine dependency on Rhino. (C) Jitter plots showing piRNA levels (per kb sequence) in indicated genotypes mapping to transposons (antisense only) giving rise to Rhino-dependent piRNAs or to Satellite repeats. (D) Confocal images showing RNA FISH signal of transcripts for the indicated transposons in w1118 and kipferl whole locus deletion egg chambers (scale bar: 20 µm; purple signal: DAPI). (E) Scatter plots contrasting RNAseq signal (poly-A plus) for expressed genes in indicated genotypes. Values are displayed as Gene length corrected trimmed mean of M-values (GeTMM; Blue dot: rhino. Green dot: kipferl. n=3).

The selective impact of kipferl depletion on germline-specific, Rhino-dependent piRNA source loci was also reflected when piRNAs mapping in antisense orientation to transposon consensus sequences were analyzed (Figure 4C). piRNAs targeting soma-controlled transposons (e.g. Tabor, gypsy5, ZAM; mostly originating from flamenco) were not affected in rhino- or kipferl-depleted ovaries. Among the transposons with dominating germline piRNA populations, nine are targeted by Rhino- and Kipferl-independent piRNAs originating from Rhino-independent source loci like cluster 20A (e.g. 297, roo). All other transposons were targeted by Rhino-dependent piRNAs. For most of these elements, piRNA pools were reduced in kipferl-depleted ovaries (Figure 4D left, E). The few transposons that retained piRNAs or even displayed increased piRNA levels had insertions in piRNA clusters 38C or 42AB, where Rhino binding was largely Kipferl-independent (Figure 4D right).

We finally analyzed the TAS, Rsp, and 1.688 Satellite repeats, which give rise to Rhino-dependent piRNAs (Figure 4F, Chen et al., 2021; Wei et al., 2021). While piRNA levels from sub-telomeric TAS Satellites remained unchanged, those derived from the Rsp and 1.688 Satellites increased strongly in kipferl-depleted ovaries compared to control, consistent with increased Rhino occupancy at these loci (Figure 4E and F; Figure 3I). As a result, Satellite piRNAs which accounted for only 4% of all piRNAs in wildtype ovaries, accounted for more than one in five piRNAs (21%) in kipferl-depleted ovaries. We observed a highly similar redistribution of the piRNA pool in kipferl null mutant ovaries (outliers were likely due to strain-specific differences in the transposon profile) (Figure 4—figure supplement 1C). Thus, in the absence of Kipferl, redistribution of Rhino from its natural binding sites to pericentromeric Satellite repeats results in a substantial loss of transposon targeting piRNAs and increased Satellite piRNAs.

The levels of poly-adenylated transcripts for a number of transposable elements increased significantly in kipferl-depleted ovaries (Figure 4G). RNA FISH experiments confirmed the de-repression of these elements in kipferl null mutant ovaries (Figure 4H; Figure 4—figure supplement 1D). In line with their upregulation, levels of piRNAs antisense to these elements were reduced in Kipferl-depleted ovaries. We note that several other elements with similar losses of antisense piRNAs were not derepressed. In general, transposon de-repression in ovaries lacking Kipferl was less severe compared to ovaries lacking Rhino (Mohn et al., 2014). This might be due to the milder loss of piRNAs in kipferl mutants compared to rhino mutants, or due to Kipferl being expressed only upon germ cell differentiation.

Our combined genetic data suggested that Kipferl acts specifically in the piRNA pathway. Consistent with this, loss of Kipferl had hardly any effect on overall gene expression, similar to the piRNA-specific factor Rhino (Figure 4—figure supplement 1E). By defining most of Rhino’s genomic binding sites and by preventing aberrant Rhino accumulations at Satellite repeats, Kipferl enables nurse cells to mount a robust and diverse piRNA defense that is essential for the tight repression of transposable elements. Based on these findings, we set out to understand the molecular connections between Kipferl, Rhino, and DNA/chromatin.

Kipferl nucleates Rhino domains within H3K9me2/3 loci and binds guanosine-rich DNA motifs

Considering the strong impact of Kipferl depletion on Rhino’s chromatin occupancy, together with the C2H2 ZnF arrays contained in Kipferl, we hypothesized that Kipferl is a direct, sequence-specific DNA binder that functions upstream of Rhino and recruits and/or stabilizes Rhino on chromatin at its binding sites if they are located within an H3K9me2/3 domain. To test this, we investigated Kipferl’s chromatin binding capacity in the absence of Rhino. In rhino mutant ovaries, Kipferl was distributed throughout nurse cell nuclei rather than being enriched in discrete nuclear foci as in wildtype ovaries (Figure 5A). ChIP-seq experiments revealed that Kipferl nevertheless remained bound at most genomic 1-kb tiles that it occupied in wildtype ovaries (Figure 5B). Kipferl enrichment levels, however, often differed between rhino-depleted and control ovaries, with a strong decrease observed at several regions, including the largely Kipferl-independent piRNA clusters 42AB and 38C.

Figure 5. Kipferl binds guanosine-rich DNA motifs and nucleates Rhino domains within H3K9me2/3 loci.

(A) Confocal images of nurse cell nuclei expressing FLAG-tagged Kipferl in indicated genetic backgrounds (scale bar: 5 µm; nuclear outlines based on DAPI as orange dotted line). (B) Scatter plot of genomic 1-kb tiles contrasting average log2-fold Kipferl ChIP-seq enrichment in ovaries with MTD-Gal4 driven rhino knock down (n=2) versus control ovaries (n=3). (C) Heat maps depicting ChIP-seq signal for Kipferl and Rhino in indicated genotypes from representative replicate experiments, centered on broad Kipferl peaks detected in two independent ChIP-seq experiments of w1118 ovaries (data sorted by Kipferl signal in w1118). (D) Meta profile showing Kipferl signal in wildtype and rhino mutant ovaries at all narrow Kipferl peaks determined in rhino mutant ovaries (top). The same peaks are divided into Rhino-Kipferl co-occupied (dotted line) and Kipferl-only peaks (dashed line) based on intersection with wildtype Rhino domains (bottom). (E) Heat maps depicting indicated ChIP-seq signal in indicated genotypes from representative replicate experiments centered on narrow Kipferl peaks detected in two independent ChIP-seq experiments of rhino mutant ovaries (data sorted by Rhino signal in w1118). (F, G) UCSC genome browser tracks depicting Kipferl and Rhino ChIP-seq signals in indicated genotypes for diverse Rhino domains with (F) or without (G) pronounced Kipferl nucleation sites. Zoomed in view of cluster 42AB is given to appreciate small Kipferl enrichments in the peripheral part of the cluster (ChIP-seq signal depicted as coverage per million reads; piRNA coverage normalized to miRNA reads, data is given for one representative replicate). (H) Schematic representation of Kipferl protein domain architecture and binding motifs predicted for the two ZnF arrays (Princeton Cys2His2 PWM predictor). (I) Position weight matrix of Kipferl consensus motif determined by HOMER from top 3000 narrow Kipferl peaks found in two independent ChIP-seq replicates from rhino mutant ovaries. The GRGGN motif was found at least once in 83.4% of peaks versus 50.3% in control sequences (P<10–322). (J) Heat map depicting Rhino-independent Kipferl ChIP-seq signal and GRGGN motif enrichment centered on narrow Kipferl peaks analogous to panel E, sorted by Kipferl signal in rhi[-/-] (motif count: # of motifs per non-overlapping genomic 100-bp window).

Figure 5.

Figure 5—figure supplement 1. Kipferl binds guanosine-rich DNA motifs.

Figure 5—figure supplement 1.

(A) Position weight matrices for highest ranking additional motifs recovered in the top 3000 narrow Kipferl peaks present in two independent replicates of rhino mutant ovaries (p-values indicate significance of motif enrichment over sequence matched control regions). (B) Heat map centered on Kipferl peaks detected via FLAG ChIP-seq in cultured ovarian somatic stem cells (OSCs) stably expressing FLAG-tagged Kipferl under control of the ubi63E promoter. GRGGN motif count is given as count per non-overlapping genomic 100-bp window. ChIP signal is depicted as coverage per million reads for one experiment.

To further investigate the relationship of Rhino and Kipferl at chromatin, we determined Kipferl-bound regions in wildtype ovaries and compared the chromatin occupancy of both proteins at these sites in each other’s absence (Figure 5C). This revealed a strong interdependence of Rhino and Kipferl at regions they co-occupied in wildtype ovaries, which included most heterochromatic Kipferl binding sites, as well as roughly 60% of the euchromatic sites. In agreement with our previous results, Rhino’s chromatin occupancy was highly dependent on Kipferl. Loss of Rhino had a more complex effect on Kipferl’s binding pattern: while the Kipferl ChIP-seq signal was reduced at sites co-occupied by Rhino, residual signal was visible at many of these loci. Furthermore, Kipferl binding at sites lacking Rhino enrichment was fully maintained in the absence of Rhino, supporting Kipferl’s ability to bind to chromatin without Rhino. Prompted by these observations, we determined Kipferl peaks in the absence of Rhino, refocusing our analysis on Kipferl’s intrinsic, Rhino-independent chromatin binding sites. In rhino mutants, Kipferl was enriched at thousands of narrow peaks. Comparison of the Kipferl signal between wildtype and rhino mutants at these sites confirmed that Kipferl’s chromatin occupancy was reduced at sites that were co-occupied by Rhino in wildtype ovaries, while it was unaffected at Kipferl-only peaks (Figure 5D). Moreover, while Kipferl bound to chromatin in similarly narrow peaks at stand-alone and Rhino-occupied regions in rhino mutants, Rhino seemingly supported the spreading of Kipferl to larger domains in wildtype ovaries.

Kipferl binding sites in rhino mutants were found both in pericentromeric heterochromatin (802 peaks) and within chromosomal arms (3334 peaks), and generally coincided with regions bound by Kipferl and Rhino in wildtype ovaries, demonstrating that Kipferl binds the same sites in both genotypes, yet in different patterns (Figure 5E). Kipferl’s chromatin enrichment was strengthened and expanded from the narrow peaks seen in rhino mutant ovaries to broad domains in wildtype ovaries, specifically at peaks where Rhino was present. Kipferl peaks that did not show Rhino binding, on the other hand, did not widen into extended domains. Importantly, Rhino recruitment to Kipferl peaks depended on the presence of H3K9me2/3: Rhino occupied the majority of Kipferl peaks in pericentromeric heterochromatin where H3K9me2/3 is abundant. Within chromosomal arms, Rhino accumulated preferentially at those Kipferl sites that were within a local H3K9me2/3 domain. These findings supported a model in which Kipferl binds chromatin independently of Rhino, Kipferl binding sites act as Rhino nucleation sites when inside local heterochromatin, and Kipferl and Rhino cooperate and spread from nucleation sites into flanking heterochromatic regions, resulting in extended Rhino/Kipferl domains. Consistent with this, Rhino occupancy at and around Kipferl binding sites was dependent on Kipferl and overexpression of Kipferl in germline cells resulted in increased Kipferl binding at all sites, leading to the strengthening of Rhino/Kipferl domains within H3K9me2/3 domains, and the formation of additional small Rhino domains at pre-existing Kipferl sites with low, but detectable H3K9me2/3 levels (Figure 5E).

Kipferl’s intrinsic chromatin binding profile in rhino mutants was strongly predictive of Rhino’s chromatin occupancy, often mirroring the non-uniform enrichment of Rhino in wildtype ovaries, only at lower levels (Figure 5F). Indeed, 60% of Kipferl-dependent Rhino bound regions contained or were within 5 kb of a Kipferl nucleation site. However, at several loci (e.g. the eyeless gene), the extended Rhino domain in wildtype flies encompassed only very weak or no putative Kipferl nucleation sites (Figure 5G). Considering that Rhino depended on Kipferl also at these loci, we speculate that here, both proteins, supported by local H3K9-methylation and putative additional factors, are required for formation of a stable Rhino/Kipferl domain. Finally, in line with the largely Kipferl-independent Rhino occupancy at piRNA clusters 42AB and 38C, we find only weak intrinsic Kipferl binding at these loci (Figure 5G). Nevertheless, Kipferl bound strongly to these clusters in wildtype ovaries, implying that Rhino is capable of stabilizing Kipferl on chromatin. Upon closer inspection, Kipferl influenced Rhino’s chromatin profile even at the Kipferl-independent piRNA cluster 42AB: While Rhino remained bound at the highly repetitive central regions of this cluster in the absence of Kipferl, several small Kipferl nucleation sites towards the periphery of the cluster seem to support the formation of an extended Rhino domain in wildtype ovaries (Figure 5G, zoom). Based on these data, we conclude that Kipferl is a major specificity factor for Rhino in ovaries, and that Kipferl cooperates with Rhino to form extended Rhino domains from defined nucleation sites, with both proteins supporting each other’s chromatin occupancy. Our data further indicate that Kipferl is not the only Rhino specificity factor but demonstrate that it is also required for the stabilization of Rhino domains nucleated by alternative means.

According to the nucleation-site model, Kipferl is expected to bind to specific DNA motifs. Consistent with this, its two ZnF arrays are predicted to bind DNA with a specificity for guanosine-rich motifs (Figure 5H, Persikov and Singh, 2014). We determined sequence motifs that were enriched in Kipferl ChIP-seq peaks identified in rhino mutant ovaries. The top enriched motif (found in >80% of all peaks; p<10–322) closely matched the in silico predictions for Kipferl’s ZnF arrays (Figure 5I). This GRGGN motif was locally enriched at experimentally determined Kipferl peaks, regardless of whether the peak was located in pericentromeric heterochromatin or chromosomal arms, or whether it was within a Rhino domain or constituted a Kipferl-only site (Figure 5J). Additional motifs detected at lower frequency and confidence often displayed variations of the same guanosine-rich motif (Figure 5—figure supplement 1A). In support of Kipferl’s specificity for a GRGGN sequence, stable overexpression of FLAG-tagged Kipferl in cultured ovarian somatic stem cells (OSCs), which do not express Kipferl or Rhino, resulted in Kipferl binding at thousands of defined sites that were enriched in the GRGGN sequence motif (Figure 5—figure supplement 1B). Taken together, although future biochemical experiments will be required to confirm a direct interaction with DNA, our data support a model where the sequence-specific chromatin binding of Kipferl underlies the recruitment and/or stabilization of Rhino at chromatin.

Structure-function analysis of Kipferl’s DNA and Rhino binding activities

The yeast two-hybrid results (Supplementary file 2) and ChIP-seq analyses (Figure 5) suggested that Kipferl binds to both, Rhino and DNA. To examine how these two molecular activities are encoded within the Kipferl protein, we created flies that expressed truncated FLAG-tagged variants instead of endogenous Kipferl, lacking either one of the ZnF arrays (KipferlΔ1st-array, KipferlΔ2nd-array) or the N-terminal ZAD (KipferlΔZAD) (Figure 6A). KipferlΔ1st-array showed strongly reduced chromatin binding capabilities (Figure 6B). Deletion of the second ZnF array (KipferlΔ2nd-array) instead had only mild impacts. The N-terminal ZAD, which is characteristic for the >90 ZAD-ZnF family members in D. melanogaster, promotes anti-parallel homodimerization but has not been directly linked to DNA binding (Jauch et al., 2003; Bonchuk et al., 2021). Yeast two-hybrid experiments confirmed the dimerization capability of Kipferl’s ZAD (Figure 6—figure supplement 1A). Deletion of the ZAD, and therefore abrogation of Kipferl dimerization, resulted in a global loss of Kipferl ChIP-seq signal (Figure 6B), consistent with studies about other ZAD-ZnF proteins (Maksimenko et al., 2020). Together, these findings indicated that Kipferl binds chromatin as a dimer, primarily via its first ZnF array.

Figure 6. Structure-function analysis of Kipferl’s DNA and Rhino binding activities.

(A) Schematic representation of Kipferl rescue constructs harboring the wildtype protein sequence or indicated deletions, as well as an internal 3xFLAG tag. Rescue constructs were introduced into the endogenous kipferl locus via RMCE. (B) Heat map displaying Kipferl variant ChIP-seq signal centered on peaks bound by Kipferl in rhino mutants (data sorted by the ChIP signal detected for the wildtype Kipferl rescue construct; shown are only peaks that are Kipferl-bound in ovaries expressing wildtype tagged Kipferl). (C) Schematic overview of yeast two-hybrid experiments determining the minimal Rhino-interacting fragment in the Kipferl protein. Positive interactions between Kipferl fragments fused to the Gal4 activating domain (AD) and Rhino fused to the Gal4 DNA binding domain (DBD) are indicated. The minimal Kipferl fragment required for the Rhino interaction is highlighted in red (the grey bar indicates a 25 amino acid exon not contained in the reference genome which we identified in a subset of lab strains and included here for completeness). (D) Confocal images of a representative nurse cell nucleus depicting localization of Kipferl lacking ZnF#4 and Rhino in flies expressing only KipferlΔZnF#4 (scale bar: 5 µm). (E) Violin plots showing the average log2-fold enrichment of Rhino ChIP-seq signal over input for Rhino-bound 1-kb tiles (classified in Figure 1D) in w1118 (n=2), kipferl mutant (n=3), KipferlΔZnF#4 (n=1) and rhino mutant (n=1) ovaries. *** corresponds to p<0.001 based on student’s t-test. Box plots show median (center line), with interquartile range (box) and whiskers indicate 1.5x interquartile range. (F) USCS genome browser tracks showing ChIP-seq signal (coverage per million sequenced reads for one representative replicate) for indicated proteins and genotypes at piRNA cluster 80F (left) and the headcase locus (right). (G) Heat map showing indicated ChIP-seq signal, centered on narrow Kipferl peaks detected in two independent ChIP-seq experiments of rhino mutant ovaries (data sorted by ChIP-seq signal detected for wildtype Kipferl rescue). (H) Schematic overview of yeast two-hybrid experiments determining the minimal Kipferl-interacting fragment in Rhino. Positive interactions between Rhino fragments fused to the Gal4 activating domain (AD) and the first ZnF array of Kipferl fused to the Gal4 DNA binding domain (DBD) are indicated. The minimal Rhino fragment required for the Kipferl interaction is highlighted in red. (I) USCS genome browser tracks showing ChIP-seq signal (coverage per million sequenced reads for one replicate) for indicated Rhino variants at piRNA cluster 80F (left) or the headcase locus (right).

Figure 6.

Figure 6—figure supplement 1. Kipferl's 4th zinc finger is required and sufficient for the interaction with Rhino.

Figure 6—figure supplement 1.

(A, C, D) Yeast two-hybrid assay on selective (left) and nonselective medium (right). AD: activation domain; DBD: DNA binding domain. Fragment identity is indicated in Figure 6C. (B) Confocal images showing entire egg chambers with immunofluorescence stainings for Rhino (magenta) and indicated FLAG-tagged Kipferl variants (cyan) (scale bars: 20 µm). (E) USCS genome browser tracks showing indicated ChIP-seq signal as coverage per million sequenced reads in indicated genotypes at piRNA cluster 42AB for one experiment. (F) Bar graph indicating average embryo hatching rate of eggs laid by w1118 control females in comparison to females harboring a deletion of Kipferl’s ZnF #4. Data represents average of three technical replicates for KipferΔZnF#4 (with error bar representing the standard deviation of the mean) and one replicate for w1118. Total number of eggs laid is indicated as n. (G) Yeast two-hybrid assay on selective (left) and nonselective medium (right). FL: full length. AD: activation domain; DBD: DNA binding domain. Fragment identity is indicated in Figure 6H.

To dissect the interaction of Kipferl with Rhino, we first examined the characteristic colocalization of Kipferl and Rhino in nurse cell nuclei. This revealed that both chromatin binding defective variants, KipferlΔ1st-array and KipferlΔZAD, failed to colocalize with Rhino (Figure 6—figure supplement 1B). In both genotypes, Rhino accumulated in prominent domains at the nuclear envelope, as seen in kipferl null mutant ovaries. Artificial dimerization of the KipferlΔZAD variant via the heterologous dimerization domain from the yeast Gcn4 transcription factor or the ZAD of Ouija board, a ZAD-ZnF protein not expressed in ovaries, restored co-localization of Kipferl with Rhino (Figure 6—figure supplement 1B, KipferlGCN4 and Kipferlouib). Thus, neither the ZAD, nor the second ZnF array are critical for binding to Rhino, suggesting that Kipferl’s first ZnF array, besides its central role in chromatin binding, might also enable Rhino binding.

Yeast two-hybrid experiments, probing full length Rhino against Kipferl fragments, confirmed that Rhino interacts with Kipferl’s first ZnF array, with no additional interaction interfaces being identified (Figure 6C, Figure 6—figure supplement 1C). To disentangle the putative DNA-binding and Rhino-binding activities of the first ZnF array, we further narrowed down the interaction between Rhino and Kipferl, revealing a critical role of the 4th ZnF (ZnF#4, Figure 6C; Figure 6—figure supplement 1C, D). We tested this putative split-of-function mutant in vivo. In flies expressing KipferlΔZnF#4 instead of the endogenous protein, Rhino did not co-localize with Kipferl, and displayed the characteristic kipferl mutant phenotype (Figure 6D; Figure 6—figure supplement 1B). Moreover, Rhino lost its chromatin occupancy in kipferlΔZnF#4 flies in a pattern also seen in kipferl null mutants (Figure 6E). This included a complete loss of Rhino at Kipferl-dependent loci like cluster 80F or the headcase locus, but also remaining Rhino signal at Kipferl-independent loci like cluster 42AB, where Rhino’s ChIP-seq signal in kipferlΔZnF#4 or kipferl null mutant flies was comparable (Figure 6F; Figure 6—figure supplement 1E). KipferlΔZnF#4 was diffusely localized in nurse cell nuclei akin to wildtype Kipferl in rhino mutants, and kipferlΔZnF#4 females were sub-fertile (Figure 6D; Figure 6—figure supplement 1F). Strikingly, KipferlΔZnF#4 retained full chromatin binding ability as it was enriched at Kipferl binding sites in a pattern closely mirroring that of wildtype Kipferl in a rhino mutant (Figure 6G). The Rhino- and chromatin-binding activities of Kipferl can therefore be uncoupled: ZnFs 1–3, supplemented by the second ZnF array, allow for putative sequence specific DNA binding, while ZnF 4 interacts with Rhino.

The finding that the 4th C2H2 ZnF fold in Kipferl is sufficient (Figure 6C) and required (Figure 6D and E) for Rhino binding was intriguing. Canonical HP1 interactors typically bind the chromoshadow domain dimer of HP1 proteins via PxVxL motif-containing peptides (Thiru et al., 2004). No such motif was found in Kipferl’s 4th ZnF. We inverted the yeast two-hybrid assay to determine which region of Rhino interacts with Kipferl and discovered that Kipferl interacts with Rhino’s chromodomain (Figure 6H; Figure 6—figure supplement 1G). Kipferl did not interact with the Rhino chromoshadow domain, the Rhino hinge region, or the chromodomain of Su(var)2–5. To test these findings in vivo, we generated flies that expressed Rhino variants with an artificial hinge or with a Gcn4 dimerization domain instead of the chromoshadow domain. Both RhinoCSD-Gcn4 and Rhinoart.-hinge failed to form extended Rhino domains like the wildtype protein, but were enriched at prominent Kipferl nucleation sites such as those in cluster 80F or upstream of the headcase gene (Figure 6I). Flies that expressed a Rhino variant with the chromodomain of Su(var)2–5 had rudimentary ovaries, precluding a meaningful ChIP-seq analysis, further supporting a model where the chromodomain contributes more than just binding to methylated H3K9. Our combined data indicate that Kipferl recruits Rhino to chromatin via a direct contact between its 4th ZnF and the Rhino chromodomain, and that functional Rhino is required for the extension and strengthening of Rhino/Kipferl domains.

Kipferl is required for Rhino domains at diverse stand-alone transposon loci

The piRNA pathway’s primary role is to silence transposable elements. Given their sequence diversity, it was surprising to find that a DNA binding protein with affinity for a short nucleotide motif is required for Rhino’s chromatin occupancy and hence the determination of the ovarian piRNA pool. In fact, transposon sequences overall do not harbor more GRGGN motif occurrences than random genomic tiles (Figure 7—figure supplement 1A). Instead, we observed a large spread of motif density among transposons (Figure 7—figure supplement 1B). The GC-rich gypsy8 and Rt1a/b/c elements harbored between 29 and 38 motifs, while the AT-rich Rsp and 1.688 Satellites harbored none (Figure 7A; Figure 7—figure supplement 1C). Motif density per kilobase showed a significant correlation with Kipferl’s intrinsic ability to bind to transposon sequences (as measured by Kipferl ChIP-seq enrichment in rhino mutant ovaries; R=0.64, p<2.2e-16) (Figure 7B). For most elements, the extent of Rhino-independent Kipferl enrichment was moreover directly correlated with their respective Rhino enrichment levels in wildtype ovaries (Figure 7A and C; R=0.83, p<2.2e-16). Exceptions were telomeric transposons and a few elements contained in clusters 42AB or 38C; these transposons were occupied by Rhino in wildtype ovaries despite no baseline Kipferl binding, and they maintained Rhino binding in kipferl-depleted ovaries, supporting the previous notion that Kipferl-independent chromatin recruitment mechanisms for Rhino likely exist (Figure 7C; Figure 7—figure supplement 1D). As the occurrence of the guanosine-rich Kipferl motif correlated with overall GC-content, enrichments for both Kipferl and Rhino on transposons also correlated with the elements GC-content (Figure 7—figure supplement 1B, E, F; Rmotif-GC=0.78, p<2.2e-16; RGC-Kipf=0.71, p<2.2e-16; RGC-Rhi=0.59, p<1.1e-13). The correlation between GC-content and Rhino enrichment was abolished in ovaries depleted for Kipferl, indicating that Rhino’s preferential binding to GC-rich transposons is due to Kipferl-mediated recruitment and/or stabilization of Rhino on DNA sequences with Kipferl motifs (Figure 7—figure supplement 1G, R = 0.08, p=0.34).

Figure 7. Kipferl is required for the establishment of Rhino domains at stand-alone transposon loci.

(A) ChIP-seq enrichment profiles on consensus sequences of transposons with high (gypsy8 and Rt1b) or low (Burdock and 3S18) number of Kipferl DNA binding motifs per kb sequence. Indicated ChIP-seq signals are displayed as average enrichment over input in two (Kipferl) or three replicates (Rhino) of ovaries from rhino mutant (top tracks) or MTD-Gal4 >wsh control ovaries (bottom tracks). Red bars indicate motif instances on the sense or antisense strand. Numbers indicate average enrichment across the entire element. (B) Scatter plot correlating the GRGGN motif count (normalized to element length) to the Rhino-independent Kipferl ChIP-seq enrichment for each transposon (ChIP-seq enrichments depict average of two independent experiments). (C) Scatter plot depicting the relation between wildtype Rhino ChIP-seq enrichments and Rhino-independent Kipferl ChIP-seq enrichments per transposon (average of two (Kipferl) or three (Rhino) independent experiments; elements indicated in grey are bound by Rhino in a largely Kipferl-independent manner, see also Figure 7—figure supplement 1D). (D) Heat maps depicting indicated ChIP-seq signal in the genomic regions flanking 285 euchromatic stand-alone insertions (red triangles) of Rhino-dependent transposons with low Rhino-independent Kipferl binding (data sorted by the ChIP-seq signal detected for Rhino in MTD-Gal4 >w-sh ovaries). (E) UCSC genome browser tracks of stand-alone transposon insertions found in MTD-Gal4 or w1118 strains, depicting examples of different potential modes of Kipferl dependency. ChIP-seq signal is shown as coverage per million reads for one representative replicate.

Figure 7.

Figure 7—figure supplement 1. GC content and number of GRGGN motifs are predictive of Kipferl and Rhino binding at transposon loci.

Figure 7—figure supplement 1.

(A) Bar graphs showing the distribution of motif frequency in all genomic 1-kb tiles (top) and in transposon consensus sequences (bottom). (B) Scatter plot of overall GC content per transposon sequence versus the number of GRGGN motifs normalized to transposon length. (C) ChIP-seq enrichment profiles (over input) along the length of representative members of the Rsp and 1.688 family of Satellite repeats (both repeats lack Kipferl GRGGN motifs). Indicated ChIP-seq signals are displayed as average enrichment over input in two (Kipferl) or three replicates (Rhino) of ovaries from rhino mutant or control (MTD-Gal4 >w-sh) ovaries. Average enrichment across the entire element is indicated for each ChIP. (D) Scatter plot contrasting Rhino ChIP-seq enrichment (over input) on transposon sequences in control (n=3) versus kipferl-depleted (n=2) ovaries. Elements bound by Rhino independently of Kipferl (see Figure 7C) are enlarged and highlighted in grey (those elements with insertions in piRNA clusters 42AB and 38C are additionally indicated in light and dark blue, respectively, and telomeric elements are indicated in red). (E, F) Scatter plots correlating the overall GC nucleotide content per transposon to the ChIP-seq enrichment detected for Kipferl in rhino knock out ovaries (F) and for Rhino in wildtype (G) or kipferl-depleted ovaries. (G) Scatter plot comparing the average Rhino ChIP-seq enrichment in two replicate experiments on kipferl-depleted ovaries to the overall GC nucleotide content per transposon. (H, I) Left: Classification of transposons into different grades of Rhino (H) and Kipferl (I) dependency based on poly-adenylated sense transcripts detected in wildtype control ovaries versus rhino or kipferl-depleted ovaries (three replicates each). Values are displayed as Gene length corrected trimmed mean of M-values (GeTMM). Right: Scatter plot analogous to Figure 7C depicting the distribution of Rhino (H) or Kipferl (I) dependent elements in respect to their occupancy by Rhino in MTD-Gal4 >w-sh ovaries, as well as their Rhino-independent Kipferl affinity.

Many transposons did not show baseline Kipferl binding, yet their low-level Rhino enrichment in wildtype ovaries still depended on Kipferl. Moreover, several of these transposons (e.g. Burdock, HMS-Beagle, 3S18) were strongly dependent on Rhino for their silencing, and were deregulated also upon loss of Kipferl, although to a weaker extent (Figure 7A; Figure 7—figure supplement 1H, I). The recent observation that flies mutant for the three piRNA clusters 38C, 42AB, and 20A are fully fertile and do not show upregulation of these elements (Gebert et al., 2021) supports the proposal that stand-alone insertions likely act as independent Rhino domains (Mohn et al., 2014; Shpiz et al., 2014), providing piRNAs capable of targeting other insertions in trans. Considering that only about 20% of transposon insertions are transformed into Rhino domains (Akulenko et al., 2018), the average Rhino enrichment mapped to the consensus sequence would be low. We determined all stand-alone insertions of transposons lacking baseline Kipferl-binding in the MTD-Gal4 strain and displayed the levels of Rhino, Kipferl, H3K9me2/3, and piRNAs in the flanking genomic regions of these insertions (Figure 7D). While nearly all insertions were embedded in a local H3K9me2/3 domain, Rhino and Kipferl were only enriched at a subset of these insertions. RNAi-mediated depletion of Piwi or Kipferl resulted in loss of Rhino and piRNAs at these insertions. However, while Piwi loss impaired local heterochromatin formation, Kipferl loss did not (Figure 7D). We conclude that a combination of Kipferl and Piwi-dependent H3K9me2/3 is required to stabilize Rhino even on those transposons that do not contain strong Kipferl nucleation sites.

Based on manual inspection of individual transposon insertions in the different examined fly strains (illustrated in Figure 7E), we propose that Kipferl supports Rhino at stand-alone transposon insertions in one of two ways. In some instances, we find that Rhino-bound insertions occurred nearby a genomic Kipferl binding site, which might be a critical factor to establish a local Rhino domain. In other instances, we find no nearby Kipferl binding site. Here, Kipferl likely acts by stabilizing Rhino which might have been recruited through alternative nucleation factors akin to Kipferl’s role at piRNA clusters 38C and 42AB, where Rhino remains chromatin bound in Kipferl mutant ovaries. Taken altogether, our data indicate that Kipferl acts as a recruitment factor for Rhino, and that it stabilizes the formation of extended Rhino domains, both at its own recruitment sites and at domains nucleated by alternative specificity factors.

Discussion

This study provides direct evidence that DNA sequence is an important determinant of how germline cells define the chromatin binding sites of the fast-evolving HP1 variant protein Rhino. The discovery of Kipferl, a putative DNA-binding zinc finger protein, that cooperates with the H3K9me3 chromatin mark and serves as Rhino guidance factor critically advances our understanding of how dual-strand piRNA source loci are specified.

By systematically comparing different Drosophila melanogaster strains, we find that Rhino domains in ovaries are largely identical between strains yet highly diverse in their genomic location, underlying sequence content, and piRNA output. Given the affinity of Rhino to the histone mark H3K9me3 (Mohn et al., 2014; Le Thomas et al., 2014; Yu et al., 2015), current models postulate that maternally inherited piRNAs define the cellular Rhino profile through epigenetic mechanisms during early embryogenesis (Mohn et al., 2014; Le Thomas et al., 2014; Shpiz et al., 2014; Akkouche et al., 2017). Indeed, Rhino domains are invariably accompanied by local di- and/or tri-methylation of H3K9. However, despite being targeted by maternally inherited piRNAs, not all transposon insertions are converted into a Rhino domain (Akulenko et al., 2018). Moreover, while Piwi depletion leads to a loss of Rhino from chromatin at stand-alone transposon insertions, where H3K9me2/3 is dependent on Piwi, most piRNA source loci, including the major piRNA clusters, retain Rhino binding in the absence of Piwi (Mohn et al., 2014). Finally, we discovered several transposon-free Rhino domains that produce only trace amounts of piRNAs. These domains are difficult to explain by maternally deposited piRNAs as Rhino specifiers, suggesting that additional, locus-specific mechanisms must be in place to define Rhino’s chromatin binding profile. With Kipferl we present a factor capable of chromatin binding upstream of Rhino that has a major impact on nearly all Rhino domains in ovaries, irrespective of their position, transposon content, or piRNA output. Importantly, Kipferl does not influence the H3K9 methylation status, but rather stabilizes Rhino at most of its genomic targets, where H3K9me2/3 is provided through parallel pathways. In this manner, Rhino domains could be established at loci where heterochromatin coincides with Kipferl’s intrinsic chromatin binding pattern. This model provides a possible explanation for the relatively low fraction of transposon insertions that are bound by Rhino, despite piRNA targeting at many other sites. Thereby, Kipferl, with its specific chromatin binding activity, serves as the first example of the long sought-after guidance cue(s) required for specifying Rhino’s binding profile within heterochromatin.

While Kipferl binds to chromatin upstream of Rhino at a multitude of pronounced Rhino domains, not all Kipferl-dependent Rhino domains contain obvious Kipferl nucleation sites. Here, a combination of low intrinsic affinities of Kipferl and Rhino might underlie their observed co-dependence. Alternatively, additional guidance factors might exist, able to facilitate the recruitment, but not the stable binding of Rhino at these sites. The rare, Kipferl-independent Rhino binding at certain loci (e.g. clusters 42AB and 38C) argues in favor of additional specificity factors. An intriguing observation from our work is that Kipferl’s spatio-temporal expression profile differs from that of Rhino: Kipferl levels in ovarian germline stem cells and cystoblasts are very low, and the protein is absent in testes. Kipferl’s ovary-specific expression is therefore a likely contributor for the pronounced differences in Rhino landscapes between ovaries and testes (Chen et al., 2021). At the same time, it implies the existence of additional Kipferl-like factors or complementary molecular mechanisms that collaborate with H3K9me2/3 in testes or during early oogenesis where Rhino is functional in the absence of Kipferl. It will be interesting to see whether other Rhino guidance factors share Kipferl’s function as Rhino stabilizer, or if Kipferl presents a special example of a Rhino guidance and support factor.

Our data indicate that Rhino binds to Kipferl via its chromodomain. Binding of client proteins has so far been assigned to the dimeric C-terminal chromoshadow domain of HP1 family proteins. By recruiting and/or stabilizing Rhino on chromatin via the chromodomain, Kipferl-binding would be compatible with the recruitment of downstream factors like Deadlock via Rhino’s chromoshadow domain (Yu et al., 2018). Our genetic data imply that DNA binding of Kipferl as well as H3K9me2/3 binding of Rhino are required for the formation of a stable Kipferl-Rhino complex on chromatin. Biochemical and structural work will be required to understand how the Kipferl-Rhino interaction evolved to be specific to the Rhino chromodomain and how it is compatible with simultaneous H3K9me2/3 binding. We were unable to generate Kipferl as a recombinant protein to study its DNA-binding potential or the Rhino-Kipferl interaction in vitro. It will further be interesting to investigate whether other chromatin readers utilize a similar mode of specialized recruitment for additional specificity. Known examples where ZnF proteins are involved in HP1 recruitment are the Drosophila HP1 variants HP1b and c, who rely on their cofactors Woc and Row for chromatin binding (Font-Burgada et al., 2008). In mouse embryonic stem cells, HP1beta/gamma bind to selected chromatin sites in a complex with the chromatin remodeler CHD4 and the activity-dependent neuroprotective protein (ADNP) (Ostapcuk et al., 2018). In both cases, interaction with ZnF proteins allows the H3K9me3-independent recruitment of HP1 variants to sites where their function has not been fully elucidated. This mode of recruitment is distinct from Kipferl’s mode of action, which synergizes with underlying H3K9me2/3 at its binding sites to recruit Rhino to chromatin.

With an N-terminal ZAD, Kipferl belongs to the largest group of ZnF proteins in insects, with 92 members in Drosophila melanogaster (Chung et al., 2007; Chung et al., 2002). The functions of the few characterized ZAD-ZnF proteins are diverse and range from transcriptional regulation to chromatin organization and heterochromatin biology. Many ZAD proteins are preferentially expressed in ovaries and during early embryogenesis (Shapiro-Kulnane et al., 2021), and some of them, despite having essential functions, are fast evolving (Kasinathan et al., 2020). It is therefore possible that additional ZAD-ZnF proteins function as guidance factors for Rhino or other HP1 proteins in the germline. If that were the case, the transposon-genome arms race might be a key driver of the diversification of ZAD-ZnF proteins. Diversification of ZnF containing genes is also observed for KRAB or SCAN domain zinc fingers, whose radiation in tetrapods is believed to be fueled by the transposon conflict (Bruno et al., 2019). While KRAB-ZnFs act directly as repressors of transposon expression and show fast evolution of their DNA binding specificity, signatures of positive selection are not concentrated on ZnF domains in ZAD proteins (Kasinathan et al., 2020), indicating different evolutionary pressures acting on the two families of ZnF proteins.

At this point, we can only speculate about the processes that forced the evolution of a factor like Kipferl. The phenotype of kipferl mutants, however, holds important clues. In the absence of Kipferl, the piRNA profile in ovaries is strongly distorted. Most transposons exhibit reduced antisense piRNA levels, and for some this results in their de-repression and accumulation of transposon transcripts in the developing oocyte, potentially causative of the fertility defects of kipferl mutant females. One model would therefore be that Kipferl evolved as a dedicated Rhino specificity factor that targets the DNA sequence of some transposons, active in differentiating ovarian germline cells. While only few transposons contain strong Kipferl nucleation sites, we find that the strongest Kipferl binding sites genome-wide are within sequence fragments of the gypsy8 and DMRT1 family transposons. This indicates that the DNA motif recognized by Kipferl possibly originated from an ancient invasion of these now inactive elements, offering a potential evolutionary requirement for Kipferl as a direct recruitment factor for Rhino. The Rhino decoration of transposons lacking Kipferl nucleation sites might rely on sporadic genomic Kipferl binding sites which allow their specification as piRNA source loci as local chromatin state permits. The major piRNA cluster 80F would be a prominent example, where fragments of gypsy8 and Rt1A and B insertions induce the Kipferl-dependent recruitment of Rhino and subsequent piRNA production also from neighboring transposons. In support of this, the entire cluster 80F is not a strong piRNA source in testes, where Kipferl is not expressed (Chen et al., 2021). Thus, an initially specific homing of Rhino to a subset of transposons via Kipferl might have evolved into a network of transposon insertions within heterochromatin and at selected sites in euchromatic chromosomal arms that co-depend on Kipferl. This system would offer the potential for robust Rhino recruitment, at the cost of occasional off-target Rhino domains, when Kipferl motifs and H3K9 methylation coincide at regions lacking transposon sequences.

We can, however, also envision an alternative scenario, based on the second major phenotype in kipferl mutants: The dramatic re-localization of Rhino from hundreds of domains to Satellite arrays located within pericentromeric heterochromatin. Intriguingly, both the Rsp and the 1.688 Satellites are involved in genetic conflicts (Larracuente and Presgraves, 2012; Ferree and Barbash, 2009; Chen et al., 2021; Ferree and Prasad, 2012). The largest 1.688 Satellite array, the X-chromosomal 359 bp repeat, spans more than 10 Mbp on the pericentromeric X-chromosome and acts as a hybrid lethality locus in crosses between D. melanogaster and D. simulans (Ferree and Barbash, 2009; Chen et al., 2021). Females lacking this repeat, when mated to wildtype males, generate non-viable offspring while no defects occur in the reciprocal cross. This phenotype might stem from a requirement for maternally deposited small RNAs (siRNAs or piRNAs) to prevent mitotic catastrophe caused by uncontrolled 1.688 repeats (Usakin et al., 2007). The Rsp Satellite has been identified genetically as part of the Segregation Distorter system, a meiotic drive system in males (Larracuente and Presgraves, 2012). Both Rsp and 1.688 Satellites give rise to abundant piRNAs in a Rhino-dependent manner in testes and ovaries (Wei et al., 2021; Chen et al., 2021). If piRNA production from Rsp and 1.688 Satellites is important to maintain control of 1.688 repeats or to suppress the Segregation distorter locus during spermatogenesis, Satellites might have evolved to take advantage of the Rhino system and force piRNA production from their own loci. In this scenario, different genomic loci would compete for the cellular Rhino pool to recruit the Rhino-dependent transcription and export machinery. It is conceivable that proteins working in an analogous way to Kipferl are present at Satellites to sequester Rhino to these repeats. Intriguingly, Rhino is among the fastest evolving proteins in the fly genome (Vermaak et al., 2005). Previous models have postulated that the positive selection in Rhino is a consequence of the transposon-genome arms race (Yu et al., 2018). Kipferl is conserved among Drosophila species with stronger divergence outside the melanogaster clade. We do not find pronounced amino acid changes in Kipferl’s zinc fingers involved in DNA binding or interaction with Rhino, nor in the ZAD, although the lack of amino acid polymorphisms at the kipferl locus precludes a conclusive analysis of selective forces acting on the protein (see also Figure 2—figure supplement 1). Nevertheless, our findings point to the provocative possibility that selfish Satellite sequences might be among the central drivers behind Rhino’s fast evolution. We speculate that Kipferl might have evolved out of a necessity for a stabilizer of Rhino that allows it to bind its diverse genomic target loci and to avoid being sequestered by selfish Satellite repeats. While Kipferl’s affinity for guanosine-rich sequences optimally opposes the AT-rich Satellite sequences, its low abundance together with the relatively simple DNA motif it recognizes might constitute an optimal level of promiscuous binding across the genome, allowing the targeting of diverse transposon families.

Materials and methods

Fly strains and husbandry

All fly stocks were kept at 25 °C with 12 hr dark/light cycles. Fly strains with genotypes, identifiers, and original sources are listed in the Key Resource Table and strains generated for this study are available from VDRC (http://stockcenter.vdrc.at/control/main). For ovary dissections, flies were aged for 2–6 days, and put on apple juice plates with fresh yeast paste for 2 days.

Generation of transgenic and mutant fly strains

Frame-shift mutant alleles for CG2678 and rhino were generated in isogenised white embryos after co-injection of plasmids pBS-Hsp70-Cas9 (Addgene #46294) and pU6-BbsI-chiRNA (Addgene #45946) modified to express sgRNAs. Whole locus CG2678 deletion for RMCE was achieved following co-injection into ZH-2A(Act5C-Cas9) embryos (derived from Bloomington stock #58492) of plasmids pXZ13 (Zhang et al., 2014a) containing 1 kb homology arms around a 3xP3-dsRed marker flanked by attP sites, together with pCFD4 (Addgene 49411) (Port et al., 2014) expressing two sgRNAs. sgRNA sequences are given in the Key Resource Table.

Fly strains harboring short hairpin RNA (shRNA) expression cassettes for germline knockdown were generated by cloning shRNA sequences into the Valium-20 vector (Ni et al., 2011) modified with a white selection marker (oligos: Key Resource Table). Transgenic flies harboring GFP tagged wildtype or engineered Rhino constructs were generated via insertion of desired tag sequences under the control of the rhino promoter region and the vasa 3’UTR into the attP40 landing site (Markstein et al., 2008) in flies harboring a rhino frame shift mutation on the same chromosome. Overexpression constructs for CG2678 were injected as pUASz plasmids (DeLuca and Spradling, 2018) containing the full intron-containing sequence of CG2678 into the attP2 landing site (Groth et al., 2004). Transgenic flies expressing TurboID-vhhGFP-3xHA-NLS (cloned from on Addgene #107171) under the eggless enhancer were obtained through integration of a mini-white containing plasmid into the attP40 landing site (Markstein et al., 2008). CG2678 rescue constructs were introduced into the CG2678 whole locus deletion flies through co-injection of pRVV578 plasmid (Addgene #108279) harboring the endogenous CG2678 locus flanked by attB sites, with pBS130 (Addgene #26290) for expression of phiC31 integrase. Successful cassette exchange was monitored via loss of dsRed in eyes and the orientation of the inserted construct was verified by PCR. Rescue constructs harbor an internal 3xFLAG tag at residue S161, as neither amino- nor carboxy-terminal tagging of CG2678 yielded fully functional protein. Exceptions are KipferlΔZAD and KipferlGCN4 constructs, which harbor an N-terminal 3xFLAG tag. Of note, we found that the CG2678 locus harbors an additional 25 amino acid exon, flanked by 109 and 31 nucleotides of intronic sequence up and downstream, respectively, in several laboratory fly strains. Comparison with protein sequences annotated for other Drosophila species confirm the presence of the additional in frame coding sequence, inserted between P347 and K348 of the annotated melanogaster CG2678 protein (see also Figure 2—figure supplement 1). PCR analysis confirmed the absence of the respective DNA sequence from iso1 genomic DNA, which served as the basis of the reference genome. The additional sequence is included in the CG2678 overexpression construct. All RMCE rescue constructs harbor the reference genome locus.

Generation of endogenous knock-in fly strains

Generation of endogenously tagged lines for HP1b and HP1c was achieved through co-injections of pU6-BbsI-chiRNA together with pBS donor plasmids containing 1 kb homology arms into embryos from vas-Cas9; attP2 flies or ZH-2A(Act5C-Cas9) embryos for HP1b and HP1c, respectively. sgRNA sequences are listed in the Key Resource Table.

Antibody generation

Mouse monoclonal antibodies against His-tagged CG2678 (aa M2-K188) and His-tagged Rhino (full length) were generated by the Max Perutz Labs Antibody Facility. Antigens were cloned in pET-15b, transformed in BL21(DE3) E. coli, and purified using Ni-NTA resin (QIAGEN) according to standard protocols. Polyclonal antibodies were raised against a CG2678 peptide (aa R171-I190) at Eurogentec.

Cell lines

Drosophila ovarian somatic cells (OSC) cells were cultured as previously described (Niki et al., 2006; Saito et al., 2009). Stable OSC lines expressing internally tagged CG2678 under control of the ubi63E promoter and an SV40 3’UTR were generated by integration into an RMCE landing site.

Western blot

Five pairs of ovaries were mechanically disrupted in lysis buffer (20 mM Tris pH 8.0, 1% Triton X-100, 2 mM MgCl2, Benzonase, protease inhibitors) using a plastic pestle. Protein concentration of whole ovary lysate was determined via Bradford assay to allow equal loading, and SDS-PAGE gel electrophoresis was performed according to standard procedures. Primary antibodies were incubated at 4 °C overnight, secondary antibodies for 1 hr at RT and the blots were developed using ECL (BioRad). Antibodies and dilutions are listed in the Key Resource Table.

RNA fluorescent in situ hybridization

RNA FISH for piRNA clusters 42AB and 38C, as well as HMS-Beagle, Max, diver, and 3S18 transposons was performed using Stellaris probes (Biosearch Technologies). Probe sequences are listed in Supplementary file 3. RNA FISH for 1.688 and Rsp Satellites was performed using a single fluorescent oligo or an in-house labelled probe set of 48 oligos, respectively (Wei et al., 2021; Gaspar et al., 2017). FISH was performed according to the manufacturers protocol with slight modification. Five ovaries were dissected into ice-cold PBS, fixed at room temperature for 20 min (4% formaldehyde, 0.3% Triton X-100 in PBS), washed three times 5 min at RT (PBS containing 0.3% Triton X-100) and incubated at 4 °C overnight in 70% EtOH to enhance permeabilization. Prior to hybridization, ovaries were rehydrated for 5 min in wash buffer (10% formamide in 2 x SSC). Hybridization was done in 50 μl hybridization buffer (100 mg/ml dextran sulfate and 10% formamide in 2 X SSC) overnight at 37 °C using 0.5 μl Stellaris and Rsp FISH probe per sample and a final concentration of 100 nM for the 1.688 FISH oligo. Samples were rinsed two times in wash buffer and then washed in wash buffer two times for 30 min at 37 °C. Ovaries were counterstained for DNA (DAPI 1:5000 in 2 x SSC) for 5 min at RT and washed two times 5 min with 2 x SSC. Finally, ovaries were mounted on microscopy slides using DAKO mounting medium (Agilent) and equalized at RT for at least 24 hr prior to imaging on a Zeiss LSM 880 inverted Airyscan microscope. Images are given as Z-stack across a maximum of 2 micrometers.

Immunofluorescence staining of ovaries and testes

Five to 10 ovary pairs or testes were dissected into ice cold PBS and subsequently incubated in fixation solution (4% formaldehyde, 0.3% Triton X-100, 1 x PBS) for 20 min at room temperature with rotation. Fixed ovaries were washed 3 x 5 min in PBX (0.3% Triton X-100, 1 x PBS) and blocked with BBX (0.1% BSA, 0.3% Triton X-100, 1 x PBS) for 30 min, all at room temperature with rotation. Primary antibody incubation was performed by incubation at 4 °C overnight with antibodies diluted in BBX followed by three 5-min washes in PBX. Ovaries were then incubated overnight at 4 °C with fluorophore-coupled secondary antibodies, washed three times in PBX including DAPI in the first wash to stain DNA (1:50,000 dilution). The final wash buffer was carefully removed before addition of ∼40 μL DAKO mounting to each sample. The samples were imaged on a Zeiss LSM-880 Axio Imager confocal-microscope and image processing was done using FIJI/ImageJ (Schindelin et al., 2012). Images are given as Z-stack across a maximum of 2 micrometer. All relevant antibodies and dilutions are listed in the Key Resource Table.

Yeast two-hybrid assay

Screening for Rhino interactors

Yeast two-hybrid screening for Rhino interactors was performed by Hybrigenics Services, S.A.S., Paris, France (https://www.hybrigenics-services.com). The coding sequence for full-length Rhino was cloned into pB27 (derived from pBTM116) (Vojtek and Hollenberg, 1995) as an N-terminal fusion to LexA (LexA-Rhi). The construct was sequence verified and used as a bait to screen against a random-primed Drosophila ovary cDNA library constructed into pP6 (derived from pGADGH Bartel et al., 1993). A mating approach with YHGX13 (Y187 ade2-101::loxP-kanMX-loxP, matα) and L40ΔGal4 (matA) yeast strains was used to screen 65 million interactions as previously described (Fromont-Racine et al., 1997). A total of 173 colonies were selected on a medium lacking tryptophan, leucine, and histidine. The prey fragments of the positive clones were amplified by PCR and sequenced at their 5′ and 3′ junctions. The resulting sequences were used to identify the corresponding interacting proteins in the GenBank database (NCBI).

Validation and interaction mapping

Yeast strains were grown in YPD or SC selective medium at 30 °C. pOAD and pOBD used as backbone for cloning were described in Miller and Stagljar, 2004 (see Key Resource Table for yeast strains used in this study). Direct protein interactions were probed as described (Miller and Stagljar, 2004). In brief, assayed proteins were fused to the activation domain (AD) and DNA-binding domain (DBD) of the Gal4 transcription factor and transformed into yeast strains PJ694A (AD) and PJ694α (DBD). Individually transformed colonies were selected, picked and mated. Interactions were detected upon spotting of a dilution series of mated yeast on selective (-LTH) plates. Parallel plating on non-selective (-LT) plates controlled for presence of both plasmids.

Biotin proximity labeling

Flies expressing GFP-Rhino or GFP fused to a nuclear localization signal were crossed to flies expressing low levels of TurboID biotin ligase fused to GFP-nanobody and the progeny was kept in cages on apple juice plates and fed with yeast paste containing 100 µM biotin (Sigma) for 16 hr prior to ovary dissection (adapted from Roux et al., 2018). A total of 100  µl of ovaries were washed once with ice-cold PBS and dounced (6 times) in 1.2  ml of pre-extraction buffer (10  mM Tris-HCl pH 7.5, 2  mM MgCl2, 3  mM CaCl2, 0.5% NP40, 10% glycerol, cOmplete Protease Inhibitor Cocktail (Roche)) and incubated at 4  °C with nutation for 15 min followed by centrifugation for 5  min at 20,000 g at 4 °C for mild pre-extraction of cytoplasmic contaminants. The nuclear-enriched fraction was resuspended in lysis buffer (50  mM Tris-HCl pH 7.5, 150  mM NaCl, 0.1% SDS, 0.5% Na-Deoxycholate, 1% Triton-X, 1 mM DTT, Benzonase, cOmplete Protease Inhibitor Cocktail (Roche)) and homogenized using first an electric plastic pestle tool (20 s on ice), followed by further douncing (20 times) and incubation at 4  °C with nutation for 2 hr. The lysate was cleared twice by centrifugation at 18,000 g for 10 min before mixing with 100 µL magnetic Pierce Streptavidin beads (ThermoFischer) pre-equilibrated in lysis buffer, followed by overnight incubation at 4  °C. The beads were washed once with lysis buffer for 10  min at 4  °C, with 2% SDS (10 min at room temperature), and 10 minutes each with wash buffers 1 (50mM HEPES pH 7.5, 500  mM NaCl, 1  mM EDTA, 0.1% Na-Deoxycholate, 1% Triton-X) and 2 (10mM Tris-HCl pH 7.5, 250  mM LiCl, 1  mM EDTA, 0.5% Na-Deoxycholate, 1% NP40) for a minimum of 10  min at 4  °C, followed by five washes without detergent (20 mM Tris-HCl pH 7.5, 137 mM NaCl) before further downstream mass spectrometry analysis.

Mass spectrometry

Mass spectrometry was carried out as described in Batki et al., 2019.

Scoring of embryo hatching rates

To measure female fertility, 10 virgin females were collected and aged for 2–3 days with at least 24 hr of mating with three w1118 males. The hatching rate of fertilized eggs laid onto apple juice plates within a period of 4–7 hr was determined 30 hr after egg laying (25 degrees), as percentage of hatched eggs from total. Plates with less than 50 eggs were disregarded in the analysis. Wildtype females were included in every experiment as control.

Definition and Curation of 1-kb genomic tiles

The four assembled chromosomes of the Drosophila melanogaster genome (dm6 assembly) were split into non-overlapping 1-kb tiles. The tiles were annotated by intersection with genomic annotations for piRNA clusters. Tile mappability was determined by intersection with genomic blocks of continuous mappability using bedtools coverage. Tiles with mappability below 25% were excluded from all analyses (2761 1-kb tiles). Further exclusion criteria included a more than threefold deviation from median values for representative input libraries for either of the three wildtype genotypes used in this study (w1118, MTD-Gal4 >w-sh, iso1; affecting 18,268 1-kb tiles), as well as tiles showing strong residual Rhino or CG2678 signal in ChIP-seq libraries prepared from the respective knock out ovaries (20 and 495 tiles, respectively).

Heterochromatin and euchromatin definitions used in this study

We used ovary H3K9me3 ChIP-seq data to define the extent of pericentromeric heterochromatin and euchromatic chromosome arms. The heavily H3K9me3 covered pericentric regions of the assembled chromosomes, as well as the entire chromosome 4 were classified as heterochromatic, while the rest was annotated as euchromatic. Detailed coordinates can be found in . Small genome contigs not assembled into the four major chromosomes were excluded from all analyses. For 1-kb tile analyses, piRNA clusters 42AB, 38C, and 80F were not included into either category, but were analyzed separately as reference loci.

ChIP-Seq

ChIP was performed as previously described (Lee et al., 2006). In brief, 150 μl of ovaries were dissected into ice-cold PBS, crosslinked with 1.8% formaldehyde in PBS for 10 min at room temperature, quenched with Glycine, rinsed in PBS and flash frozen in liquid nitrogen after removing all PBS. Frozen ovaries were disrupted in PBS using a dounce homogenizer, centrifuged at low speed and the pellet was resuspended in lysis buffer. For ChIP from OSCs 5–10 million cells were crosslinked, quenched, and lysed. Sonication (Bioruptor) resulted in DNA fragment sizes of 200–800 bp. Immunoprecipitation with specific antibodies was done overnight at 4 °C in 350–700 μl total volume using 1/3 to 1/4 of chromatin per ChIP (antibodies are listed in Key Resource Table). Then, 40 μl Dynabeads (equal mixture of Protein G and A, Invitrogen) were added and incubated for 1 hr at 4°. After multiple washes, immuno-precipitated protein-DNA complexes were eluted with 1% SDS, treated with RNAse-A, decrosslinked overnight at 65 °C, and proteins were digested with proteinase K before clean-up using ChIP DNA Clean & Concentrator columns (Zymo Research). Barcoded libraries were prepared according to manufacturer’s instructions using the NEBNext Ultra II DNA Library Prep Kit for Illumina (NEB), and sequenced on a HiSeqV4, NextSeq550, or NovaSeqSP (Illumina).

RNA-Seq

Strand-specific RNA seq was performed as described previously (Zhang et al., 2012b). In brief, total RNA was extracted from 5 to 10 ovaries from 7-day-old flies using Trizol (Invitrogen). Total RNA was purified using RNAeasy columns (QIAGEN). Six micrograms of total RNA were subjected to polyA selection and subsequent fragmentation, reverse transcription, and library preparation according to manufacturer’s instructions using the NEBNext Ultra DNA Library Prep Kit for Illumina (NEB) for sequencing on an Illumina NovaSeqSP instrument.

Small RNA-Seq

Small RNA cloning was performed as described in Grentzinger et al., 2020. In brief, ovaries were lysed and Argonaute- sRNA complexes were isolated using TraPR ion exchange spin columns. sRNAs were subsequently purified using Trizol and subjected to ligations of 3′ and 5′ barcoded adapters containing 4 random nucleotides at the ends to reduce ligation biases, reverse transcribed, PCR amplified, and sequenced on an Illumina NextSeq550 instrument.

Computational analysis

ChIP-Seq analysis

ChIP-seq reads were trimmed to remove the adaptor sequences and to adjust all reads to 50 bp irrespective of sequencing mode. Reads were mapped to the dm6 genome using Bowtie (version.1.3.0, settings: -f -v 3 a --best --strata --sam), allowing up to three mismatches. Genome unique reads were mapped to 1-kb tiles and a pseudocount of 1 was added after normalization to library depth, before enrichment over input values were determined. Each ChIP-seq sample was adjusted with a correction factor determined from median input levels and median background levels to reach median background enrichment of 1 to correct for unequal ChIP efficiency. To classify genomic regions into Rhino domains and non-Rhino domains, we used a binary cutoff of 4-fold enrichment calculated from two independent replicate experiments of the relevant wildtype genotypes. This cutoff corresponds to a p-value of <0.05. Kipferl-only 1-kb tiles were those that had no Rhino enrichment (below 4-fold) and that were significantly enriched in Kipferl over Rhino (Z-score=3). Replicates were averaged for genomic 1-kb tile analyses.

ChIP-seq peak calling

We used MACS2 (Zhang et al., 2008) with --broad --broad-cutoff 0.1 for Kipferl and Rhino in wildtype ovaries due to the broad extent of Rhino/Kipferl domains. The ‘narrow peak’ setting was used for the remaining experiments. Peaks mapping to genomic contigs outside the four main chromosomes were discarded, and peaks were filtered for a score of 50 (broad peaks; p<10–5) and 30 (narrow peaks; p<10–3). Kipferl-Rhino shared versus Kipferl-only peaks were distinguished by intersection of narrow peaks called for Kipferl in two independent replicate experiments of rhino mutant ovaries, with broad peaks called for Rhino in two independent replicate experiments of w1118 ovaries using bedtools intersect with -u -f 0.75 for shared domains and -v for Kipferl-only domains. Rhino-independent Kipferl peaks that were detected independently in two replicate experiments were grouped into heterochromatic and euchromatic by intersection with heterochromatin coordinates outlined above. Kipferl DNA binding motifs were recovered from the top ~3000 summits of Rhino-independent Kipferl peaks (achieved through a score cutoff of 7 on summits) using HOMER (Heinz et al., 2010). Heatmaps display one representative replicate and were produced through deeptools (Ramírez et al., 2016).

ChIP-seq analysis on transposon consensus sequences

Genome mapping reads longer than 23 nucleotides were mapped to TE consensus sequences (Supplementary file 4) using bowtie (v.1.3.0; settings: -f -v 3 a --best --strata --sam) allowing up to three mismatches. Reads mapping to multiple elements were assigned to the best mapping position. Reads mapping to multiple positions were randomly distributed. Library depth normalized ChIP and input reads, respectively, were averaged over all nucleotide positions of each element to give one value per element. ChIP-seq enrichment was calculated after adding a pseudo count of 1 and adjusted using sample-specific correction factors determined from background 1-kb tiles to reach median background enrichments of 1. Corrected per-base enrichment was calculated for TE ChIP-seq profiles and replicates were averaged.

Motif instances

Occurrences of Kipferl DNA binding motifs were determined using PWMScan (ccg.epfl.ch/pwmtools /pwmscan.php) on the dm6 genome and on TE consensus sequences. For display in heatmaps, cumulative motif counts on both genomic strands were intersected with non-overlapping 100-bp windows.

smallRNA-Seq Analysis:

Raw reads were trimmed for linker sequences and the 4 random nucleotides flanking the small RNA before mapping to the Drosophila melanogaster genome (dm6), using Bowtie (version.1.3.0, settings: -f -v 3 a --best --strata --sam) with 0 mismatch allowed. Genome mapping reads were intersected with Flybase genome annotations (r6.40) using Bedtools to allow the removal of reads mapping to rRNA, tRNA, snRNA, snoRNA loci and the mitochondrial genome. For TE mappings, all genome mappers were used allowing no mismatches. Reads mapping to multiple elements (Supplementary file 4) were assigned to the best match. Reads mapping equally well to multiple positions were randomly distributed. Libraries were normalized to 1 Mio miRNA reads. For the 1-kb tiles analysis, a pseudocount of 1 was added after normalization to library depth and correction for the mappability of the respective 1-kb tile. Tiles with fewer than 10 mapping piRNAs in all libraries were disregarded for sRNA analysis to avoid distortion due to very low abundant piRNAs. For calculation of piRNAs mapping to TEs, sense and antisense piRNAs were kept separate, and counts were normalized to TE length. For classification of tiles and transposons into somatic, Rhino-independent, and Rhino-dependent source loci, the soma index was determined as the log2 ratio of somatic (Piwi-IP in piwi GLKD, normalized to library depth) and germline (GL-Piwi IP, normalized to library depth) piRNAs mapping to each tile or TE (Mohn et al., 2014). Classification by Kipferl-dependency of TEs was achieved by a binary cutoff of at a twofold reduction in antisense piRNA levels in kipferl knock down compared to control.

RNA-Seq analysis

For the RNAseq analysis, genome matching reads (STAR v2.7.10a; settings: --outSAMmode NoQS --readFilesCommand cat --alignEndsType Local --twopassMode Basic --outReadsUnmapped Fastx --outMultimapperOrder Random --outSAMtype SAM --outFilterMultimapNmax 1000 --winAnchorMultimapNmax 2000 --outFilterMismatchNmax 3 --seedSearchStartLmax 30 --alignSoftClipAtReferenceEnds No --outFilterType BySJout --alignSJoverhangMin 15 --alignSJDBoverhangMin 1) were randomized in order and quantified using Salmon (v.1.7.0; settings: --dumpEqWeights --seqBias --gcBias --useVBOpt --numBootstraps 100 l SF --incompatPrior 0.0 --validateMappings). For the analysis we used the FlyBase transcriptome (r6.40) which has been masked for sequences similar to transposons. To include both strands of transposons in the analysis, TE-consensus sequences were added to the FlyBase transcriptome in sense and antisense orientation. For gene expression visualization Salmon results were further processed to GeTMM values using edgeR (v3.34.0). For differential gene expression analysis Salmon results were processed using DeSeq2 (v1.32.0).

TE insertion Calling

Euchromatic TE insertions were extracted from insertions called previously (Mohn et al., 2014), and nearby insertions were merged using bedtools merge -d 100, as these mostly corresponded to the same insertions called at slightly different positions. Further, all insertions overlapping UCSC repeat masker track annotations were discarded. For analysis of non-Kipferl nucleation site containing, but Rhino-dependent TEs, the resulting list was subsequently filtered for elements with no Kipferl enrichment in rhino knock out ovaries and at least 10-fold difference in piRNA levels between control and MTD-Gal4 mediated rhino germline knock down. This retrieved 285 euchromatic solo TE insertion sites in the genome of our experimental MTD-Gal4 strains (Supplementary file 5).

Acknowledgements

We thank the VBCF core facilities (NGS, VDRC, Protein Chemistry) as well as the IMBA/GMI/IMP BioOptics facility for excellent support, and the IMBA Fly House for generating transgenic and CRISPR-edited fly lines. Mia Levine gave invaluable input on the analysis of positive selection at the kipferl locus. The Max Perutz Laboratories Monoclonal Antibody facility generated the CG2678 and Rhino hybridoma cell lines. Steven DeLuca shared the pUASz plasmid. Sarah Barnes helped with cloning of the KipferlΔZnF#4 construct, Bernardo Almeida gave advice on motif analyses, and Maria Novatchkova supported bioinformatic analyses. We thank the Brennecke laboratory for help throughout this project and A Larracuente, F Mohn, J Schnabl, and S Barnes for comments on the manuscript. This research was funded by the Austrian Academy of Sciences, the European Community (ERC-2015-CoG-682181) and the Austrian Science Fund (F4303 and W1207). LB was funded by a Boehringer Ingelheim Fond PhD fellowship. CY was funded by a VIP2 postdoc fellowship as part of the EU Horizon 2020 research and innovation program (Marie Skłodowska-Curie grant no. 847548).

Appendix 1

Appendix 1—key resources table.

Reagent type (species) or resource Designation Source or reference Identifiers Additional information
Antibody anti-CG2678#2
(Rabbit polyclonal)
This paper CG2678#2_4P39glyc, raised against Kipferl peptide R171-I190 Anti-Kipferl polyclonal antibody,
available from Brennecke
lab; ChIP (7 µL per IP)
Antibody anti-CG2678 M4
(Mouse monoclonal)
This paper M4 3A6-3E1-E11, raised against Kipferl amino acids M2-K188 Anti-Kipferl monoclonal
WB antibody, available
from Brennecke lab; WB (1:250)
Antibody anti-CG2678 M3
(Mouse monoclonal)
This paper M3 2 C5-3C3, raised against Kipferl amino acids M2-K188 Anti-Kipferl monoclonal
IF antibody, available
from Brennecke lab; IF (1:500)
Antibody anti-Rhino
(Mouse monoclonal)
This paper 6B7-F2, raised against full length denatured Rhino protein Anti-Rhino monoclonal
WB antibody, available
from Brennecke lab; WB (1:10)
Antibody anti-Rhino
(Rabbit polyclonal)
Mohn et al., 2014 Rhino#1_3573gly ChIP (5 µL per IP),
IF (1:1000)
Antibody anti-Deadlock
(Mouse monoclonal)
Andersen et al., 2017 5B5-6D7-3H10 IF (1:100)
Antibody anti-Nxf3 (Mouse monoclonal) ElMaghraby et al., 2019 8E4-F1 IF (1:50)
Antibody Histone H3K9me3
antibody (Rabbit polyclonal)
Active motif ID_source:39161 ChIP (5 µL per IP), IF (1:100)
Antibody Anti-Histone H3
(di methyl K9)
(mouse monoclonal)
Abcam ID_source:mAbcam1220 ChIP (5 µL per ChIP)
Antibody C1A9 HP1 (Su(var)2–5)
(Mouse monoclonal)
DSHB ID_source:C1A9 ChIP (5 µL per IP), IF (1:100)
Antibody ANTI-FLAG(R) M2
(Mouse monoclonal)
Sigma Aldrich ID_source:F1804-1MG ChIP (2 µL per IP), IF (1:2000)
genetic reagent (D. melanogaster) w1118;;; Bloomington stock 3605 w 1118 wildtype, cultivated in
our lab for several years
genetic reagent (D. melanogaster) w;;; Susan Celniker iso1 wildtype
genetic reagent (D. melanogaster) Cog-GAL4; NGT-GAL4; nos-GAL4; Bloomington stock 31777 Referred to as
'maternal triple
driver' (MTD)
Gal4 driver
genetic reagent (D. melanogaster) w;; pW20>w_sh[attP2]/TM3, Sb; Mohn et al., 2014, VDRC-ID 313772 white (CG2759) RNAi line
genetic reagent (D. melanogaster) w;; pW20>rhi_sh[attP2]/TM3, Sb; Mohn et al., 2014, VDRC-ID 313156 Rhino (CG10683) RNAi line
genetic reagent (D. melanogaster) w;; pW20>CG2678_sh2
[attP2]/TM3, Sb;
this paper Kipferl (CG2678) Kipferl RNAi line,
available from VDRC;
sh-oligo sequence:
ctagcagtCTCGAAGG
CTTTCATGCGTAA
tagttatattcaagcata
TTACGCATGAAA
GCCTTCGAGgcg
genetic reagent (D. melanogaster) w;; pGLKD >piwi_sh2
[attP2]/TM3,Sb;
Senti and Brennecke, 2010, VDRC-ID 313199 Piwi (CG6122) RNAi line
genetic reagent (D. melanogaster) w;; CG2678[Δ1]
(dsRed+)/TM3,Sb;
this paper Kipferl (CG2678) Kipferl mutant allele,
available from VDRC;
LB1-RMCEm31
genetic reagent (D. melanogaster) w;; CG2678[Δ2](dsRed+)/TM3,Sb; this paper Kipferl (CG2678) Kipferl mutant allele,
available from VDRC;
LB1-RMCEm21; has
aberrant deregulation
of blood transposon
genetic reagent (D. melanogaster) w;; CG2678[fs1]/TM3,Sb; this paper Kipferl (CG2678) Kipferl mutant allele,
available from VDRC;
LB1-FSm52; indel (–7);
sequence CCTGCGT
CCTGGCCGTGC-------
TTTCCGGTTCAAGT
GGCAAAGCGAGCAGAG
genetic reagent (D. melanogaster) w;; CG2678RMCE[S161-
3xFLAG]/TM3,Sb;
this paper Kipferl (CG2678) Kipferl tagged construct
(RMCE), available from
VDRC; wildtype 3xFLAG
tagged rescue construct
inserted into LB1-RMCEm31
genetic reagent (D. melanogaster) w; prhino >3xFLAG/V5/
Precission/GFP-
Rhino[attP40],rhi[g2m11]/CyO;
CG2678[Δ1](dsRed+),
nxf3[A2-2]/TM3,Sb;
this paper Rhino, Kipferl, Nxf3 mutant allele combination,
available from VDRC;
Nxf3[A2-2] allele
published in
ElMaghraby et al., 2019
genetic reagent (D. melanogaster) w; prhino >3xFLAG/V5/Precission/GFP-Rhino[attP40],rhi[g2m11]/CyO;; this paper Rhino (CG10683) tagged Rhino construct,
available from VDRC
genetic reagent (D. melanogaster) w; prhino >3xFLAG/V5/Precission/GFP-Rhino[attP40],rhi[g2m11]/CyO; CG2678[Δ1](dsRed+)/TM3,Sb; this paper Rhino, Kipferl mutant allele combination,
available from VDRC
genetic reagent (D. melanogaster) w; rhi[18-7]/CyO;; Andersen et al., 2017, VDRC-ID 313488 Rhino (CG10683) mutant allele
genetic reagent (D. melanogaster) w; rhi[g2m11]/CyO;; this paper Rhino (CG10683) Rhino mutant allele,
available from VDRC;
indel –7; seq: ATGTCT
CGCAACCA-------cc-A
ATCTTGGTCTGGTC
GATGCACCGCCTAATG
genetic reagent (D. melanogaster) w;; pUASz >CG2678[attP2]; this paper Kipferl (CG2678) tagged Kipferl construct,
available from VDRC;
intron containing CG2678
sequence including
non-mapped exon3
genetic reagent (D. melanogaster) w;; CG2678RMCE
[ΔZnFarray1-S161-
3xFLAG]/TM3,Sb;
this paper Kipferl (CG2678) tagged Kipferl construct
(RMCE), available from
VDRC; 3xFLAG tagged
ZnF array 1 deletion
construct inserted into
LB1-RMCEm31
genetic reagent (D. melanogaster) w;; CG2678RMCE
[ΔZnFarray2-S161-
3xFLAG]/TM3,Sb;
this paper Kipferl (CG2678) tagged Kipferl construct
(RMCE), available from
VDRC; 3xFLAG tagged
ZnF array 2 deletion
construct inserted into
LB1-RMCEm31
genetic reagent (D. melanogaster) w;; CG2678RMCE
[3xFLAG-ΔZAD]/TM3,Sb;
this paper Kipferl (CG2678) tagged Kipferl construct
(RMCE), available from
VDRC; 3xFLAG tagged
ZAD deletion construct
inserted into LB1-RMCEm31
genetic reagent (D. melanogaster) w;; CG2678RMCE[3xFLAG-ZAD::GCN4]/TM3,Sb; this paper Kipferl (CG2678) tagged Kipferl construct
(RMCE), available from
VDRC; 3xFLAG tagged
ZAD GCN4 replacement
construct inserted into
LB1- RMCEm31
genetic reagent (D. melanogaster) w;; CG2678RMCE
[3xFLAG-ouibZAD]/TM3,Sb;
this paper Kipferl (CG2678) Tagged Kipferl construct
(RMCE), available from VDRC;
3xFLAG tagged ZAD
replacement construct
inserted into LB1-RMCEm31
genetic reagent (D. melanogaster) w;; CG2678RMCE[ΔZnF#4-S161-3xFLAG]/TM3,Sb; this paper Kipferl (CG2678) tagged Kipferl construct
(RMCE), available from
VDRC; 3xFLAG tagged ZnF4
deletion construct inserted
into LB1-RMCEm31
genetic reagent (D. melanogaster) w; prhino >3xFLAG/V5/
Precission/GFP-
Rhino(CSD::GCN4)
[attP40],rhi[g3m13]/CyO;;
this paper Rhino (CG10683) tagged Rhino construct,
available from VDRC;
Rhino indel –14; seq:
TGGGCGTCCCCAGG---------------AGCGGTTTTCCGAA
CGAGAACAACACC,
Rhino CSD was replaced
by the Gcn4 dimerization
domain, homozygous not
viable, crossed to w;
rhi[18-7]/CyO;; for experiments
genetic reagent (D. melanogaster) w; prhino >3xFLAG/
V5/Precission/GFP-
Rhino(art.hinge)
[attP40],rhi[g3m13]/CyO;;
this paper Rhino (CG10683) tagged Rhino construct,
available from VDRC;
Rhino indel –14; seq:
TGGGCGTCCCCAGG---------------
AGCGGTTTTCCGA
ACGAGAACAACACC,
Rhino hinge was replaced
by a scrambled amino acid
sequence of the same length,
homozygous not viable,
crossed to w; rhi[18-7]/CyO;;
for experiments
genetic reagent (D. melanogaster) w;; prhino >3xFLAG/
V5/Precission/GFP-
Su(var)2-5[attP2];
this paper HP1a (CG8409) tagged HP1a construct,
available from VDRC
genetic reagent (D. melanogaster) w, 3xFLAG/V5/
Precission/GFP-HP1b;;;
this paper HP1b (CG7041) Endogenously tagged HP1b,
available from VDRC
genetic reagent (D. melanogaster) w;; 3xFLAG/V5/
Precission/GFP-HP1c;
this paper HP1c (CG6990) Endogenously tagged HP1c,
available from VDRC
genetic reagent (D. melanogaster) w; peggless >TurboID-
linker-vhhGFP-3xHA-
NLS[attP40]/CyO;;
this paper - Transgenic construct,
available from VDRC; TurboID
biotin ligase fused to
GFP nanobody
genetic reagent (D. melanogaster) w;; 3xFLAG/V5/Precission/
GFP(replacing
CG13741 CDS)-NLS
[attP2]/TM3,Sb;
ElMaghraby et al., 2019 Boot (CG13741) tagged construct; nuclear
GFP used as contol
Sequence-based reagent Rhino_g2 This paper CRISRP guide RNA (indel) GACCAAGATTTGGTCGCTGA
Sequence-based reagent Rhino_g3 This paper CRISRP guide RNA (indel) GTCCCCAGGTTCTGGTGAAG
Sequence-based reagent CG2678_g1 This paper CRISRP guide RNA (RMCE left) GTACAAATGATCAGTGCGA
Sequence-based reagent CG2679_g2 This paper CRISRP guide RNA (RMCE right) GAAGGCATTAAGTAGCATG
Sequence-based reagent CG2678_g3 This paper CRISRP guide RNA (indel) GAACCGGAAAGCATTCTGCA
Sequence-based reagent HP1b_g2 This paper CRISRP guide RNA (N-terminal endogenous tagging) CACAATGGCCGAATTCTCAG
Sequence-based reagent HP1c_g2 This paper CRISRP guide RNA (N-terminal endogenous tagging) GATGCGCTCCACCACGAAGT
Strain, strain background (Saccharomyces cerevisiae) Matα, trp1-901, leu2-3,
112, ura3-52, his3-200,
gal4Δ, gal80Δ, GAL2-
ADE2, LYS2::GAL1-HIS3,
met2::GAL7-lacZ
Mohn et al., 2014
Strain, strain background (Saccharomyces cerevisiae) MatA, trp1-901, leu2-3,
112, ura3-52, his3-200,
gal4Δ, gal80Δ, GAL2-
ADE2, LYS2::GAL1-HIS3,
met2::GAL7-lacZ
Mohn et al., 2014
Strain, strain background (Saccharomyces cerevisiae) pOAD Mohn et al., 2014 in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-Rhino Mohn et al., 2014 in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOBD-Rhino Mohn et al., 2014 in YGS1 (Matα)
Strain, strain background (Saccharomyces cerevisiae) pOAD-HP1a Mohn et al., 2014 in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-Rhino(1-100) Mohn et al., 2014 in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-Rhino(101-300) Mohn et al., 2014 in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-Rhino(301-418) Mohn et al., 2014 in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOBD-CG2678-ZF4nolinker this paper Y2H construct, available
from Brennecke lab; in
YGS1 (Matα)
Strain, strain background (Saccharomyces cerevisiae) pOBD-CG2678-190-311 this paper Y2H construct, available
from Brennecke lab; in
YGS1 (Matα)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678FL this paper Y2H construct, available
from Brennecke lab; in
YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678frag1 this paper Y2H construct, available
from Brennecke lab; in
YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678frag2 this paper Y2H construct, available from
Brennecke lab; in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678frag3 this paper Y2H construct, available from
Brennecke lab; in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678frag4 this paper Y2H construct, available from
Brennecke lab; in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678frag5 this paper Y2H construct, available from
Brennecke lab; in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678-ZF1-2 this paper Y2H construct, available from
Brennecke lab; in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678-ZF1-3 this paper Y2H construct, available from
Brennecke lab; in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678ZF2-4 this paper Y2H construct, available from
Brennecke lab; in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678ZF3-4 this paper Y2H construct, available from
Brennecke lab; in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678ZF4nolinker this paper Y2H construct, available from
Brennecke lab; in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-OuibZAD this paper Y2H construct, available from
Brennecke lab; in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-Rhi19-85 this paper Y2H construct, available from
Brennecke lab; in YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOAD-CG2678-190-311 this paper Y2H construct, available
from Brennecke lab; in
YGS2 (MatA)
Strain, strain background (Saccharomyces cerevisiae) pOBD-CG2678frag1 this paper Y2H construct, available
from Brennecke lab; in
YGS1 (Matα)
Strain, strain background (Saccharomyces cerevisiae) pOAD-HP1CD this paper Y2H construct, available
from Brennecke lab; in
YGS2 (MatA)

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Julius Brennecke, Email: julius.brennecke@imba.oeaw.ac.at.

Deborah Bourc'his, Institut Curie, France.

Kevin Struhl, Harvard Medical School, United States.

Funding Information

This paper was supported by the following grants:

  • European Research Council ERC-2015-CoG-682181 to Julius Brennecke.

  • Marie Curie 847548 to Changwei Yu.

  • Austrian Science Fund F4303 and W1207 to Julius Brennecke.

  • Boehringer Ingelheim Fonds to Lisa Baumgartner.

Additional information

Competing interests

No competing interests declared.

No competing interests declared.

Author contributions

Conceptualization, Data curation, Formal analysis, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing – original draft, Project administration, Writing – review and editing.

Data curation, Software, Formal analysis.

Investigation, Methodology.

Resources, CY established and optimized the versatile GFP-nanobody based biotin proximity labeling assay in flies.

Methodology, PD designed and generated all CRISPR engineered flies.

Conceptualization, Resources, Supervision, Funding acquisition, Writing – original draft, Project administration, Writing – review and editing.

Ethics

All work described in this paper uses Drosophila melanogaster as model organism. Relevant safety procedures in terms of handling transgenic flies are in place.

Additional files

Supplementary file 1. Chromosomal coordinates of major heterochromatic and euchromatic compartments used in this study.
Supplementary file 2. Putative protein-protein interactors of Rhino identified through a yeast two-hybrid screen.
elife-80067-supp2.xlsx (9.1KB, xlsx)
Supplementary file 3. Sequences of RNA FISH probes used in this study.
elife-80067-supp3.xlsx (15KB, xlsx)
Supplementary file 4. Consensus sequences of transposable elements analyzed in this study.
elife-80067-supp4.xlsx (186.9KB, xlsx)
Supplementary file 5. Chromosomal coordinates of stand-alone transposon insertions identified in the MTD-Gal4 background that are absent in the reference genome.
elife-80067-supp5.xlsx (17.3KB, xlsx)
MDAR checklist

Data availability

Sequencing data sets have been deposited to the NCBI GEO archive (GSE202468). All fly strains generated for this study are available via the VDRC. All genome-wide sequencing data (ChIP-seq, small RNA-seq, RNA-seq) can be browsed at https://genome-euro.ucsc.edu/s/balisa/dm6_elife220506.

The following dataset was generated:

Brennecke J. 2022. The Drosophila ZAD zinc finger protein Kipferl guides Rhino to piRNA clusters. NCBI Gene Expression Omnibus. GSE202468

References

  1. Abad JP, Agudo M, Molina I, Losada A, Ripoll P, Villasante A. Pericentromeric regions containing 1.688 satellite DNA sequences show anti-kinetochore antibody staining in prometaphase chromosomes of Drosophila melanogaster. Molecular & General Genetics. 2000;264:371–377. doi: 10.1007/s004380000331. [DOI] [PubMed] [Google Scholar]
  2. Akkouche A, Mugat B, Barckmann B, Varela-Chavez C, Li B, Raffel R, Pélisson A, Chambeyron S. Piwi is required during Drosophila embryogenesis to license dual-strand pirna clusters for transposon repression in adult ovaries. Molecular Cell. 2017;66:411–419. doi: 10.1016/j.molcel.2017.03.017. [DOI] [PubMed] [Google Scholar]
  3. Akulenko N, Ryazansky S, Morgunova V, Komarov PA, Olovnikov I, Vaury C, Jensen S, Kalmykova A. Transcriptional and chromatin changes accompanying de novo formation of transgenic pirna clusters. RNA. 2018;24:574–584. doi: 10.1261/rna.062851.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Andersen PR, Tirian L, Vunjak M, Brennecke J. A heterochromatin-dependent transcription machinery drives pirna expression. Nature. 2017;549:54–59. doi: 10.1038/nature23482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Aravin AA, Sachidanandam R, Bourc’his D, Schaefer C, Pezic D, Toth KF, Bestor T, Hannon GJ. A pirna pathway primed by individual transposons is linked to de novo DNA methylation in mice. Molecular Cell. 2008;31:785–799. doi: 10.1016/j.molcel.2008.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bag I, Chen S, Rosin LF, Chen Y, Liu C-Y, Yu G-Y, Lei EP. M1BP cooperates with CP190 to activate transcription at TAD borders and promote chromatin insulator activity. Nature Communications. 2021;12:4170. doi: 10.1038/s41467-021-24407-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bartel P, Chien CT, Sternglanz R, Fields S. Elimination of false positives that arise in using the two-hybrid system. BioTechniques. 1993;14:920–924. [PubMed] [Google Scholar]
  8. Batki J, Schnabl J, Wang J, Handler D, Andreev VI, Stieger CE, Novatchkova M, Lampersberger L, Kauneckaite K, Xie W, Mechtler K, Patel DJ, Brennecke J. The nascent RNA binding complex sfinx licenses pirna-guided heterochromatin formation. Nature Structural & Molecular Biology. 2019;26:720–731. doi: 10.1038/s41594-019-0270-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bonchuk A, Boyko K, Fedotova A, Nikolaeva A, Lushchekina S, Khrustaleva A, Popov V, Georgiev P. Structural basis of diversity and homodimerization specificity of zinc-finger-associated domains in Drosophila. Nucleic Acids Research. 2021;49:2375–2389. doi: 10.1093/nar/gkab061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Branon TC, Bosch JA, Sanchez AD, Udeshi ND, Svinkina T, Carr SA, Feldman JL, Perrimon N, Ting AY. Efficient proximity labeling in living cells and organisms with turboid. Nature Biotechnology. 2018;36:880–887. doi: 10.1038/nbt.4201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Brennecke J, Aravin AA, Stark A, Dus M, Kellis M, Sachidanandam R, Hannon GJ. Discrete small RNA-generating loci as master regulators of transposon activity in Drosophila. Cell. 2007;128:1089–1103. doi: 10.1016/j.cell.2007.01.043. [DOI] [PubMed] [Google Scholar]
  12. Bruno M, Mahgoub M, Macfarlan TS. The arms race between KRAB-zinc finger proteins and endogenous retroelements and its impact on mammals. Annual Review of Genetics. 2019;53:393–416. doi: 10.1146/annurev-genet-112618-043717. [DOI] [PubMed] [Google Scholar]
  13. Chen Y-CA, Stuwe E, Luo Y, Ninova M, Le Thomas A, Rozhavskaya E, Li S, Vempati S, Laver JD, Patel DJ, Smibert CA, Lipshitz HD, Toth KF, Aravin AA. Cutoff suppresses RNA polymerase II termination to ensure expression of pirna precursors. Molecular Cell. 2016;63:97–109. doi: 10.1016/j.molcel.2016.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Chen P, Luo Y, Aravin AA. RDC complex executes a dynamic pirna program during Drosophila spermatogenesis to safeguard male fertility. PLOS Genetics. 2021;17:e1009591. doi: 10.1371/journal.pgen.1009591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Chung H-R, Schäfer U, Jäckle H, Böhm S. Genomic expansion and clustering of ZAD-containing C2H2 zinc-finger genes in Drosophila. EMBO Reports. 2002;3:1158–1162. doi: 10.1093/embo-reports/kvf243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Chung H-R, Löhr U, Jäckle H. Lineage-specific expansion of the zinc finger associated domain ZAD. Molecular Biology and Evolution. 2007;24:1934–1943. doi: 10.1093/molbev/msm121. [DOI] [PubMed] [Google Scholar]
  17. Czech B, Munafò M, Ciabrelli F, Eastwood EL, Fabry MH, Kneuss E, Hannon GJ. PiRNA-guided genome defense: from biogenesis to silencing. Annual Review of Genetics. 2018;52:131–157. doi: 10.1146/annurev-genet-120417-031441. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. DeLuca SZ, Spradling AC. Efficient expression of genes in the Drosophila germline using a UAS promoter free of interference by hsp70 pirnas. Genetics. 2018;209:381–387. doi: 10.1534/genetics.118.300874. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Doblmann J, Dusberger F, Imre R, Hudecz O, Stanek F, Mechtler K, Dürnberger G. ApQuant: accurate label-free quantification by quality filtering. Journal of Proteome Research. 2019;18:535–541. doi: 10.1021/acs.jproteome.8b00113. [DOI] [PubMed] [Google Scholar]
  20. ElMaghraby MF, Andersen PR, Pühringer F, Hohmann U, Meixner K, Lendl T, Tirian L, Brennecke J. A heterochromatin-specific RNA export pathway facilitates pirna production. Cell. 2019;178:964–979. doi: 10.1016/j.cell.2019.07.007. [DOI] [PubMed] [Google Scholar]
  21. Ferree PM, Barbash DA. Species-specific heterochromatin prevents mitotic chromosome segregation to cause hybrid lethality in Drosophila. PLOS Biology. 2009;7:e1000234. doi: 10.1371/journal.pbio.1000234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Ferree PM, Prasad S. How can satellite DNA divergence cause reproductive isolation? let us count the chromosomal ways. Genetics Research International. 2012;2012:430136. doi: 10.1155/2012/430136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Font-Burgada J, Rossell D, Auer H, Azorín F. Drosophila hp1c isoform interacts with the zinc-finger proteins WOC and relative-of-WOC to regulate gene expression. Genes & Development. 2008;22:3007–3023. doi: 10.1101/gad.481408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Fromont-Racine M, Rain JC, Legrain P. Toward a functional analysis of the yeast genome through exhaustive two-hybrid screens. Nature Genetics. 1997;16:277–282. doi: 10.1038/ng0797-277. [DOI] [PubMed] [Google Scholar]
  25. Gaspar I, Wippich F, Ephrussi A. Enzymatic production of single-molecule FISH and RNA capture probes. RNA. 2017;23:1582–1591. doi: 10.1261/rna.061184.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Gebert D, Neubert LK, Lloyd C, Gui J, Lehmann R, Teixeira FK. Large Drosophila germline pirna clusters are evolutionarily labile and dispensable for transposon regulation. Molecular Cell. 2021;81:3965–3978. doi: 10.1016/j.molcel.2021.07.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Goriaux C, Desset S, Renaud Y, Vaury C, Brasset E. Transcriptional properties and splicing of the flamenco pirna cluster. EMBO Reports. 2014;15:411–418. doi: 10.1002/embr.201337898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Grentzinger T, Oberlin S, Schott G, Handler D, Svozil J, Barragan-Borrero V, Humbert A, Duharcourt S, Brennecke J, Voinnet O. A universal method for the rapid isolation of all known classes of functional silencing small rnas. Nucleic Acids Research. 2020;48:e79. doi: 10.1093/nar/gkaa472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Groth AC, Fish M, Nusse R, Calos MP. Construction of transgenic Drosophila by using the site-specific integrase from phage phic31. Genetics. 2004;166:1775–1782. doi: 10.1534/genetics.166.4.1775. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Han BW, Wang W, Li C, Weng Z, Zamore PD. Noncoding RNA pirna-guided transposon cleavage initiates zucchini-dependent, phased pirna production. Science. 2015;348:817–821. doi: 10.1126/science.aaa1264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Harms E, Chu T, Henrion G, Strickland S. The only function of grauzone required for Drosophila oocyte meiosis is transcriptional activation of the cortex gene. Genetics. 2000;155:1831–1839. doi: 10.1093/genetics/155.4.1831. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Molecular Cell. 2010;38:576–589. doi: 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Hoskins RA, Carlson JW, Wan KH, Park S, Mendez I, Galle SE, Booth BW, Pfeiffer BD, George RA, Svirskas R, Krzywinski M, Schein J, Accardo MC, Damia E, Messina G, Méndez-Lago M, de Pablos B, Demakova OV, Andreyeva EN, Boldyreva LV, Marra M, Carvalho AB, Dimitri P, Villasante A, Zhimulev IF, Rubin GM, Karpen GH, Celniker SE. The release 6 reference sequence of the Drosophila melanogaster genome. Genome Research. 2015;25:445–458. doi: 10.1101/gr.185579.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Houwing S, Kamminga LM, Berezikov E, Cronembold D, Girard A, van den Elst H, Filippov DV, Blaser H, Raz E, Moens CB, Plasterk RHA, Hannon GJ, Draper BW, Ketting RF. A role for piwi and pirnas in germ cell maintenance and transposon silencing in zebrafish. Cell. 2007;129:69–82. doi: 10.1016/j.cell.2007.03.026. [DOI] [PubMed] [Google Scholar]
  35. Hur JK, Luo Y, Moon S, Ninova M, Marinov GK, Chung YD, Aravin AA. Splicing-independent loading of TREX on nascent RNA is required for efficient expression of dual-strand pirna clusters in Drosophila. Genes & Development. 2016;30:840–855. doi: 10.1101/gad.276030.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Jauch R, Bourenkov GP, Chung HR, Urlaub H, Reidt U, Jäckle H, Wahl MC. The zinc finger-associated domain of the Drosophila transcription factor Grauzone is a novel zinc-coordinating protein-protein interaction module. Structure. 2003;11:1393–1402. doi: 10.1016/j.str.2003.09.015. [DOI] [PubMed] [Google Scholar]
  37. Kasinathan B, Colmenares SU, McConnell H, Young JM, Karpen GH, Malik HS. Innovation of heterochromatin functions drives rapid evolution of essential ZAD-ZNF genes in Drosophila. eLife. 2020;9:e63368. doi: 10.7554/eLife.63368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Klattenhoff C, Xi H, Li C, Lee S, Xu J, Khurana JS, Zhang F, Schultz N, Koppetsch BS, Nowosielska A, Seitz H, Zamore PD, Weng Z, Theurkauf WE. The Drosophila HP1 homolog rhino is required for transposon silencing and pirna production by dual-strand clusters. Cell. 2009;138:1137–1149. doi: 10.1016/j.cell.2009.07.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kneuss E, Munafò M, Eastwood EL, Deumer U-S, Preall JB, Hannon GJ, Czech B. Specialization of the Drosophila nuclear export family protein nxf3 for pirna precursor export. Genes & Development. 2019;33:1208–1220. doi: 10.1101/gad.328690.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Larkin A, Marygold SJ, Antonazzo G, Attrill H, Dos Santos G, Garapati PV, Goodman JL, Gramates LS, Millburn G, Strelets VB, Tabone CJ, Thurmond J, FlyBase Consortium FlyBase: updates to the Drosophila melanogaster knowledge base. Nucleic Acids Research. 2021;49:D899–D907. doi: 10.1093/nar/gkaa1026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Larracuente AM, Presgraves DC. The selfish segregation distorter gene complex of Drosophila melanogaster. Genetics. 2012;192:33–53. doi: 10.1534/genetics.112.141390. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Le Thomas A, Stuwe E, Li S, Du J, Marinov G, Rozhkov N, Chen Y-CA, Luo Y, Sachidanandam R, Toth KF, Patel D, Aravin AA. Transgenerationally inherited pirnas trigger pirna biogenesis by changing the chromatin of pirna clusters and inducing precursor processing. Genes & Development. 2014;28:1667–1680. doi: 10.1101/gad.245514.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Lee TI, Johnstone SE, Young RA. Chromatin immunoprecipitation and microarray-based analysis of protein location. Nature Protocols. 2006;1:729–748. doi: 10.1038/nprot.2006.98. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Maksimenko O, Bartkuhn M, Stakhov V, Herold M, Zolotarev N, Jox T, Buxa MK, Kirsch R, Bonchuk A, Fedotova A, Kyrchanova O, Renkawitz R, Georgiev P. Two new insulator proteins, pita and ZIPIC, target CP190 to chromatin. Genome Research. 2015;25:89–99. doi: 10.1101/gr.174169.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Maksimenko O, Kyrchanova O, Klimenko N, Zolotarev N, Elizarova A, Bonchuk A, Georgiev P. Small Drosophila zinc finger C2H2 protein with an N-terminal zinc finger-associated domain demonstrates the architecture functions. Biochimica et Biophysica Acta. Gene Regulatory Mechanisms. 2020;1863:194446. doi: 10.1016/j.bbagrm.2019.194446. [DOI] [PubMed] [Google Scholar]
  46. Markstein M, Pitsouli C, Villalta C, Celniker SE, Perrimon N. Exploiting position effects and the gypsy retrovirus insulator to engineer precisely expressed transgenes. Nature Genetics. 2008;40:476–483. doi: 10.1038/ng.101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Miller J, Stagljar I. Using the yeast two-hybrid system to identify interacting proteins. Methods in Molecular Biology. 2004;261:247–262. doi: 10.1385/1-59259-762-9:247. [DOI] [PubMed] [Google Scholar]
  48. Mohn F, Sienski G, Handler D, Brennecke J. The rhino-deadlock-cutoff complex licenses noncanonical transcription of dual-strand pirna clusters in Drosophila. Cell. 2014;157:1364–1379. doi: 10.1016/j.cell.2014.04.031. [DOI] [PubMed] [Google Scholar]
  49. Mohn F, Handler D, Brennecke J. Noncoding RNA. pirna-guided slicing specifies transcripts for zucchini-dependent, phased pirna biogenesis. Science. 2015;348:812–817. doi: 10.1126/science.aaa1039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Ni J-Q, Zhou R, Czech B, Liu L-P, Holderbaum L, Yang-Zhou D, Shim H-S, Tao R, Handler D, Karpowicz P, Binari R, Booker M, Brennecke J, Perkins LA, Hannon GJ, Perrimon N. A genome-scale shrna resource for transgenic rnai in Drosophila. Nature Methods. 2011;8:405–407. doi: 10.1038/nmeth.1592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Niki Y, Yamaguchi T, Mahowald AP. Establishment of stable cell lines of Drosophila germ-line stem cells. PNAS. 2006;103:16325–16330. doi: 10.1073/pnas.0607435103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Ostapcuk V, Mohn F, Carl SH, Basters A, Hess D, Iesmantavicius V, Lampersberger L, Flemr M, Pandey A, Thomä NH, Betschinger J, Bühler M. Activity-dependent neuroprotective protein recruits HP1 and CHD4 to control lineage-specifying genes. Nature. 2018;557:739–743. doi: 10.1038/s41586-018-0153-8. [DOI] [PubMed] [Google Scholar]
  53. Ozata DM, Gainetdinov I, Zoch A, O’Carroll D, Zamore PD. PIWI-interacting rnas: small rnas with big functions. Nature Reviews Genetics. 2019;20:89–108. doi: 10.1038/s41576-018-0073-3. [DOI] [PubMed] [Google Scholar]
  54. Persikov AV, Singh M. De novo prediction of DNA-binding specificities for cys2his2 zinc finger proteins. Nucleic Acids Research. 2014;42:97–108. doi: 10.1093/nar/gkt890. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Port F, Chen HM, Lee T, Bullock SL. Optimized CRISPR/cas tools for efficient germline and somatic genome engineering in Drosophila. PNAS. 2014;111:E2967–E2976. doi: 10.1073/pnas.1405500111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Radion E, Sokolova O, Ryazansky S, Komarov PA, Abramov Y, Kalmykova A. The integrity of pirna clusters is abolished by insulators in the Drosophila germline. Genes. 2019;10:E209. doi: 10.3390/genes10030209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T. DeepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Research. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Robert X, Gouet P. Deciphering key features in protein structures with the new endscript server. Nucleic Acids Research. 2014;42:W320–W324. doi: 10.1093/nar/gku316. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Roux KJ, Kim DI, Burke B, May DG. BioID: A screen for protein-protein interactions. Current Protocols in Protein Science. 2018;91:19. doi: 10.1002/cpps.51. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Rozewicki J, Li S, Amada KM, Standley DM, Katoh K. MAFFT-DASH: integrated protein sequence and structural alignment. Nucleic Acids Research. 2016;47:W5–W10. doi: 10.1093/nar/gkz342. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Rust K, Byrnes LE, Yu KS, Park JS, Sneddon JB, Tward AD, Nystul TG. A single-cell atlas and lineage analysis of the adult Drosophila ovary. Nature Communications. 2020;11:5628. doi: 10.1038/s41467-020-19361-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Sabirov M, Kyrchanova O, Pokholkova GV, Bonchuk A, Klimenko N, Belova E, Zhimulev IF, Maksimenko O, Georgiev P. Mechanism and functional role of the interaction between CP190 and the architectural protein pita in Drosophila melanogaster. Epigenetics & Chromatin. 2021;14:16. doi: 10.1186/s13072-021-00391-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Saito K, Inagaki S, Mituyama T, Kawamura Y, Ono Y, Sakota E, Kotani H, Asai K, Siomi H, Siomi MC. A regulatory circuit for piwi by the large maf gene traffic jam in Drosophila. Nature. 2009;461:1296–1299. doi: 10.1038/nature08501. [DOI] [PubMed] [Google Scholar]
  64. Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez J-Y, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Senti K-A, Brennecke J. The pirna pathway: a fly’s perspective on the guardian of the genome. Trends in Genetics. 2010;26:499–509. doi: 10.1016/j.tig.2010.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Shapiro-Kulnane L, Bautista O, Salz HK. An RNA-interference screen in Drosophila to identify ZAD-containing C2H2 zinc finger genes that function in female germ cells. G3. 2021;11:jkaa016. doi: 10.1093/g3journal/jkaa016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Shpiz S, Ryazansky S, Olovnikov I, Abramov Y, Kalmykova A. Euchromatic transposon insertions trigger production of novel pi- and endo-sirnas at the target sites in the Drosophila germline. PLOS Genetics. 2014;10:e1004138. doi: 10.1371/journal.pgen.1004138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Siomi MC, Sato K, Pezic D, Aravin AA. PIWI-interacting small rnas: the vanguard of genome defence. Nature Reviews. Molecular Cell Biology. 2011;12:246–258. doi: 10.1038/nrm3089. [DOI] [PubMed] [Google Scholar]
  69. Thiru A, Nietlispach D, Mott HR, Okuwaki M, Lyon D, Nielsen PR, Hirshberg M, Verreault A, Murzina NV, Laue ED. Structural basis of HP1/PXVXL motif peptide interactions and HP1 localisation to heterochromatin. The EMBO Journal. 2004;23:489–499. doi: 10.1038/sj.emboj.7600088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Usakin L, Abad J, Vagin VV, de Pablos B, Villasante A, Gvozdev VA. Transcription of the 1.688 satellite DNA family is under the control of RNA interference machinery in Drosophila melanogaster ovaries. Genetics. 2007;176:1343–1349. doi: 10.1534/genetics.107.071720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Vermaak D, Henikoff S, Malik HS. Positive selection drives the evolution of rhino, a member of the heterochromatin protein 1 family in Drosophila. PLOS Genetics. 2005;1:10009. doi: 10.1371/journal.pgen.0010009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Vermaak D, Malik HS. Multiple roles for heterochromatin protein 1 genes in Drosophila. Annual Review of Genetics. 2009;43:467–492. doi: 10.1146/annurev-genet-102108-134802. [DOI] [PubMed] [Google Scholar]
  73. Vojtek AB, Hollenberg SM. Ras-raf interaction: two-hybrid analysis. Methods in Enzymology. 1995;255:331–342. doi: 10.1016/s0076-6879(95)55036-4. [DOI] [PubMed] [Google Scholar]
  74. Wei X, Eickbush DG, Speece I, Larracuente AM. Heterochromatin-dependent transcription of satellite dnas in the Drosophila melanogaster female germline. eLife. 2021;10:e62375. doi: 10.7554/eLife.62375. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Wu CI, Lyttle TW, Wu ML, Lin GF. Association between a satellite DNA sequence and the responder of segregation distorter in D. melanogaster. Cell. 1988;54:179–189. doi: 10.1016/0092-8674(88)90550-8. [DOI] [PubMed] [Google Scholar]
  76. Yu B, Cassani M, Wang M, Liu M, Ma J, Li G, Zhang Z, Huang Y. Structural insights into rhino-mediated germline pirna cluster formation. Cell Research. 2015;25:525–528. doi: 10.1038/cr.2015.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Yu B, Lin YA, Parhad SS, Jin Z, Ma J, Theurkauf WE, Zhang ZZ, Huang Y. Structural insights into rhino-deadlock complex for germline pirna cluster specification. EMBO Reports. 2018;19:e45418. doi: 10.15252/embr.201745418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, Liu XS. Model-based analysis of chip-seq (MACS) Genome Biology. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Zhang F, Wang J, Xu J, Zhang Z, Koppetsch BS, Schultz N, Vreven T, Meignin C, Davis I, Zamore PD, Weng Z, Theurkauf WE. UAP56 couples pirna clusters to the perinuclear transposon silencing machinery. Cell. 2012a;151:871–884. doi: 10.1016/j.cell.2012.09.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Zhang Z, Theurkauf WE, Weng Z, Zamore PD. Strand-specific libraries for high throughput RNA sequencing (RNA-seq) prepared without poly(A) selection. Silence. 2012b;3:9. doi: 10.1186/1758-907X-3-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Zhang X, Koolhaas WH, Schnorrer F. A versatile two-step CRISPR- and RMCE-based strategy for efficient genome engineering in Drosophila. G3: Genes, Genomes, Genetics. 2014a;4:2409–2418. doi: 10.1534/g3.114.013979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Zhang Z, Wang J, Schultz N, Zhang F, Parhad SS, Tu S, Vreven T, Zamore PD, Weng Z, Theurkauf WE. The HP1 homolog rhino anchors a nuclear complex that suppresses pirna precursor splicing. Cell. 2014b;157:1353–1363. doi: 10.1016/j.cell.2014.04.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Zhang G, Tu S, Yu T, Zhang X-O, Parhad SS, Weng Z, Theurkauf WE. Co-dependent assembly of Drosophila pirna precursor complexes and pirna cluster heterochromatin. Cell Reports. 2018;24:3413–3422. doi: 10.1016/j.celrep.2018.08.081. [DOI] [PMC free article] [PubMed] [Google Scholar]

Editor's evaluation

Deborah Bourc'his 1

This paper reports fundamental insights into host-transposon interactions, and more specifically how specific genomic loci may be elected for producing anti-transposon piRNAs in Drosophila. The authors provide here compelling evidence that Kipferl, a ZAD zinc-finger protein, helps guide Rhino to G-rich motifs present in piRNA-producing loci in the female germline, demonstrating for the first time the existence of sequence-specific factors in piRNA biogenesis. The findings are of broad interest to the fields of heterochromatin and transposon biology.

Decision letter

Editor: Deborah Bourc'his1
Reviewed by: Zhao Zhang2

Our editorial process produces two outputs: i) public reviews designed to be posted alongside the preprint for the benefit of readers; ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Decision letter after peer review:

Thank you for submitting your article "The Drosophila ZAD zinc finger protein Kipferl guides Rhino to piRNA clusters" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Kevin Struhl as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Zhao Zhang (Reviewer #2).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

The reviewers have greatly enjoyed reading the manuscript and recognized the quality, novelty, and impact of the findings, supported by a large breadth of complementary approaches. How active piRNA-producing loci can emerge from heterochromatin and how they can be distinguished from other heterochromatic loci is an essential question, which more generally touches upon the question of the transcriptional activity of supposedly inert heterochromatin. Importantly, the identification of Kipferl may lead the path for the identification of other sequence-specific determinants of piRNA-producing loci, as H3K9me3-enrichment seems unlikely to be the sole feature allowing piRNA loci to be transcribed.

We identified five points that need further attention and are listed below. Most of them involve toning down some of the claims and /or integrating prior knowledge in the novel model, such as the epistatic position of Kipferl towards maternally deposited piRNAs in the specification of piRNA-producing loci. Please address them in a revised version of your manuscript. In addition, answer the individual comments of the reviewers in a detailed rebuttal letter.

1 – Kipferl was identified as a Rhino partner by yeast two-hybrid. However, independent confirmation by co-IP experiments seems to be missing. Please provide such experimental validation.

2 – Regarding the potential position of Kipferl at the interface of a genetic conflict, reviewers were interested to know whether Kipferl was conserved outside of melanogaster and more particularly, in simulans, and if so, whether some rapidly evolving amino-acids could be identified between the two species.

3 – The claim that Kipferl directly binds to DNA should be toned down, as recruitment to Rhino-occupied loci through protein/protein interactions is equally possible at that stage, unless biochemical evidence is provided.

4 – Please integrate the role of maternally deposited piRNAs into the model of Rhino recruitment to piRNA loci, and where they stand regarding Kipferl. For example, are these two modes of recruitment temporally separated (maternal piRNAs being more in stem/progenitor cells of the female germline, while Kipferl may be acting in more differentiated female germ cells, and be therefore involved in maintenance rather than original specification)? Do maternal piRNAs and Kipferl activate different piRNA clusters? Or do they act redundantly? No specific experiments are required here, but the view(s) of the authors on that matter would be very useful to share, as maternal piRNA-dependent specification of piRNA-producing loci is a well-accepted model.

5 – The authors propose that Kipferl acts upstream from Rhino but the hierarchy between Kipferl and Rhino seems more complicated than this simple model, as Kipferl binding at piRNA loci can also be reduced or even completely gone in Rhino mutants. The authors indeed discuss that Rhino is required for ectopic Kipferl spreading to Satellite loci, but it seems that Rhino is also required to some extent for Kipferl to bind at canonical piRNA loci. The authors are not required to carry out a tethering experiment-as suggested by reviewer #2- but the text should be modulated accordingly.

Figure revisions:

– Please provide statistics in violin and box plots. Currently, the reader may have the impression that changes may be present but non-significant.

– Genomic coordinates have to be provided on UCSC genome tracks.

– Please homogenize your scales in figures of genome tracks, particularly when comparing different genotypes for a given locus (see Figures 1C and 5C for example).

Reviewer #1 (Recommendations for the authors):

1. Without Kipferl, H3K9me3 seems to mildly (but visibly) increase (Figure 3G) while H3K9me2 seems to mildly (but visibly) decrease (Figure S3G). Is there a statistically significant difference between the two genotypes for K9me2/3? Is there a correlation between change in H3K9me2/3 and change in Rhino binding and what is their correlation coefficient?

2. In Abstract, Intro and Discussion, authors describe the exciting possibility that Rhino enrichment on satDNA in kipferl-/- could reflect a conflict. Given that Rhino binds satDNA in wildtype, what about the alternative possibility that it is merely a passive consequence of Rhino having to re-localize to Kipferl-independent loci in kipferl-/-? Or is there a disproportionate, higher-than-expected enrichment onto satDNA, with respect to 42AB and 38C, upon loss of Kipferl? Is there an increase in 42AB and 38C transcripts in kipferl-/- by RNA FISH? What makes up the remaining 79% piRNAs when 21% are satDNA piRNAs at line 307?

3. It is unclear which sequencing experiment is done with replicates and how many replicates, which should be clearly described. For replicated experiment, is a representative dataset OR the average of replicates shown? In sequencing experiments without replicates, how could one be convinced of the result?

4. In all confocal images, is only one z-slice shown? Other than Figure 3C, Rhino seems to have additional foci besides the pastry domain at the absence of Kipferl (e.g., Figures 6D, S3C, S3H). Showing the z-projection for single nurse cells might make statements about Rhino localization more convincing, especially the claim that kipferl, nxf3 double mutants restore Rhino foci that are absent in kipferl-/-.

5. The correlation coefficient with respective p-value should be reported whenever a correlation argument is made throughout the manuscript. e.g., lines 161, 347 and throughout Figures7 and S7.

6. Text description should not exaggerate the result. e.g., "every" at line 556. Text often says Kipferl is important for most/majority of Rhino domains, could it be supported by actual numbers (# out of total #, either # domain or # 1kb tiles)? Given that most active piRNA loci 42AB and 38C, as well as telomeric transposons, are all Kipferl-independent, one wonders if Kipferl really controls most/majority of Rhino domains. Since there are many Rhino-only and Kipferl-only domains/1kb tiles, listing actual numbers helps readers contextualize the relationship between Rhino and Kipferl as well as the prevalence of exceptions.

Reviewer #2 (Recommendations for the authors):

This is a compelling story to establish the function of Kipferl in recruiting Rhino to the piRNA-producing loci in Drosophila ovarian genome. I enjoyed reading and considering it in the past few days. I felt that there are some questions still unanswered--particularly how Rhino is recruited to 42AB. However, given the authors already generated massive amount of data and the complexity of Rhino recruiting mechanisms, I don't think it would be fair to ask for an answer here. Here are a few suggestions:

1. I think the authors should comment on the potential homolog of Kipferl in other Drosophila species. Rhino appears to be rapidly evolving, even simulans Rhino could not be compatible with the melanogaster piRNA machineries. Does Kipferl homolog exist in simulans. If yes, how different it is with the melanogaster Kipferl, especially the amino acids involving in Rhino and DNA binding. Answering these questions could potentially broaden the impact of this story.

2. Line159, H3K9me3 or H3K9me2?

3. I am confused about the message from Line 365-367. Do you mean Rhino peaks towards the periphery?

4. Line 452: motives or motifs.

5. The last paragraph is way too long. It is good to only deliver one message per paragraph.

Reviewer #3 (Recommendations for the authors):

The manuscript, although well written, is very dense with a lot of figures which sometimes makes it difficult to follow.

– In all the manuscript there is no mention of significance in the violin and box plots when the control KD and the kipferl KD are compared. As the manuscript now stands, the reader can have the impression that the changes might be important but non-significant.

– For each UCSC genome browser track, the genomic coordinates of the presented region should be added either on the figure or in the legend.

Figure Sup 3:

– Based on the figure legend, there are some inversions the E is the B, the B◊D, the D◊E.

– The picture of the FISH experiments of the 42AB and 38C are not really convincing. Furthermore, it is surprising not to see any 80F FISH experiments.

– Figure 4D. the name of the TEs and the presence in cluster 42AB, 38C is very informative. It should also be added for the Kipferl dependent TE piRNAs. The presence in the cluster 80F should also be described.

– The scale of the UCSC genome browser tracks are variable. For a given locus it is important to keep the same scale between different genotypes. Figure 1C, considering the heterochromatic region and the Cluster 80F, the Rhino ChIP-seq track is at 300 for the w1118 and MTD-Gal4 genotypes whereas it is at 150 for the Iso1 genotype. It is the same problem in Figure 5C for the Kipferl ChIP-seq results on the 42AB cluster.

eLife. 2022 Oct 4;11:e80067. doi: 10.7554/eLife.80067.sa2

Author response


Essential revisions:

1 – Kipferl was identified as a Rhino partner by yeast two-hybrid. However, independent confirmation by co-IP experiments seems to be missing. Please provide such experimental validation.

We have attempted this experiment several times using various buffer conditions. A co-IP of Rhino and Kipferl has proven to be very challenging. In our experience, most of Kipferl is tightly associated with chromatin and cannot be solubilized under buffer conditions that are compatible with the Rhino interaction. Moreover, co-expression of the two proteins in S2 cells for pull-down Western experiments did not allow any conclusions, as we observed background binding of Kipferl to the beads. However, we have conducted a complementary experiment that strongly supports that Rhino and Kipferl do not only interact in a heterologous yeast two-hybrid assay, but that the two proteins are also in very tight physical proximity in vivo. Briefly, a TurboID experiment (taking advantage of GFP-tagged Rhino in flies co-expressing the TurboID Biotin ligase fused to a GFP nanobody) recovered the complete set of known Rhino interactors, and in addition CG2678/Kipferl. As a control, we used flies expressing nuclear GFP alone alongside TurboID-GFP_nanobody (in a triplicate experiment coupled to label free quantitative mass spectrometry). We added this important complementary evidence for a Rhino-Kipferl interaction to Figure 2.

2 – Regarding the potential position of Kipferl at the interface of a genetic conflict, reviewers were interested to know whether Kipferl was conserved outside of melanogaster and more particularly, in simulans, and if so, whether some rapidly evolving amino-acids could be identified between the two species.

Based on sequence conservation and gene synteny, Kipferl orthologs can be confidently identified in several other Drosophilid species, including D. simulans. We added the protein sequence alignment of Kipferl paralogs to Figure 2 Figure supplement 1. As far as we can tell, there are no obvious signs for rapidly evolving amino acids in the region that underlies the Rhino interaction (ZnF#4). We also did not find clear signs of positive selection within Kipferl when analyzing D. melanogaster natural strains. The absence of signs for positive selection has been mentioned in the previous Discussion and we slightly expanded that statement (last paragraph). In the future, it will be interesting to perform genetic swap experiments with Kipferl orthologs from other species.

3 – The claim that Kipferl directly binds to DNA should be toned down, as recruitment to Rhino-occupied loci through protein/protein interactions is equally possible at that stage, unless biochemical evidence is provided.

We agree that this is formally possible. We do, however, like to point out that Kipferl does bind chromatin also independently of Rhino (at Kipferl-only loci), foremost based on the data where we expressed Kipferl in the OSC line that does not express detectable Rhino. We toned down the argumentation in the manuscript accordingly throughout the text relating to Figures 5 and 6.

4 – Please integrate the role of maternally deposited piRNAs into the model of Rhino recruitment to piRNA loci, and where they stand regarding Kipferl. For example, are these two modes of recruitment temporally separated (maternal piRNAs being more in stem/progenitor cells of the female germline, while Kipferl may be acting in more differentiated female germ cells, and be therefore involved in maintenance rather than original specification)? Do maternal piRNAs and Kipferl activate different piRNA clusters? Or do they act redundantly? No specific experiments are required here, but the view(s) of the authors on that matter would be very useful to share, as maternal piRNA-dependent specification of piRNA-producing loci is a well-accepted model.

This is indeed an important point. Our data do not allow strong conclusions at this point. We extended this part of the discussion to reflect our current view on this topic (second paragraph).

5 – The authors propose that Kipferl acts upstream from Rhino but the hierarchy between Kipferl and Rhino seems more complicated than this simple model, as Kipferl binding at piRNA loci can also be reduced or even completely gone in Rhino mutants. The authors indeed discuss that Rhino is required for ectopic Kipferl spreading to satellite loci, but it seems that Rhino is also required to some extent for Kipferl to bind at canonical piRNA loci. The authors are not required to carry out a tethering experiment-as suggested by reviewer #2- but the text should be modulated accordingly.

The reviewers are correct in pointing out that our data is not consistent with a strict Kipferl -> Rhino hierarchy. This is as Rhino has a clear impact on Kipferl’s chromatin binding capacity as well. As we do find Kipferl-only loci but not any clear Rhino-only loci, we argue that Kipferl is capable of binding DNA/chromatin in the absence of Rhino, but that Rhino has a strong stabilizing and spreading impact on Kipferl at sites that are within an H3K9-methyl domain. It is, however, important to point out that Rhino is capable of binding chromatin also independently of Kipferl in situations, where Kipferl is experimentally deleted (kipferl mutant) or in early stages of oogenesis where Kipferl is barely detectable. We added specific comments to this in the text (Results relating to Figure 5 and paragraph 3 of the Discussion).

Figure revisions:

– Please provide statistics in violin and box plots. Currently, the reader may have the impression that changes may be present but non-significant.

We added the mean of the fold change for each violin plot (biological effect size), as well as the statistical significance as calculated via student’s t-test (for ChIP-seq enrichments) or Wilcoxon signed-rank test (for sRNA-seq fold changes).

– Genomic coordinates have to be provided on UCSC genome tracks.

We added the coordinates of each screenshot below the locus name in the various figures.

– Please homogenize your scales in figures of genome tracks, particularly when comparing different genotypes for a given locus (see Figures 1C and 5C for example).

We agree with this overall comment. For Figure 1, however, we think it makes more sense to leave the data display as is and to add a comment in the text saying that “iso1 has the same pattern, but at some loci shows Rhino ChIP-seq signal at a lower level”. We speculate that this might be due to the fact that the kipferl locus in iso1 differs from the kipferl locus in all examined experimental fly strains as it lacks part of the linker between the two zinc finger arrays (this is mentioned in the method section, line 810-816). For Figure 5, we made the y-axis scale consistent between genotypes. We also added panels with a zoom-in to show the remaining Kipferl enrichment pattern in better detail.

Reviewer #1 (Recommendations for the authors):

1. Without Kipferl, H3K9me3 seems to mildly (but visibly) increase (Figure 3G) while H3K9me2 seems to mildly (but visibly) decrease (Figure S3G). Is there a statistically significant difference between the two genotypes for K9me2/3? Is there a correlation between change in H3K9me2/3 and change in Rhino binding and what is their correlation coefficient?

The reviewer is right. Our data indicate a small shift from H3K9me2 to me3 upon depletion of Kipferl. However, this was not limited to Rhino-bound genomic tiles. In the absence of more quantitative experiments using spike-in normalization approaches, we refrain from making a strong conclusion from this. Our key argument is that Rhino chromatin occupancy is lost at most sites despite high levels of remaining H3K9-methylation.

2. In Abstract, Intro and Discussion, authors describe the exciting possibility that Rhino enrichment on satDNA in kipferl-/- could reflect a conflict. Given that Rhino binds satDNA in wildtype, what about the alternative possibility that it is merely a passive consequence of Rhino having to re-localize to Kipferl-independent loci in kipferl-/-? Or is there a disproportionate, higher-than-expected enrichment onto satDNA, with respect to 42AB and 38C, upon loss of Kipferl? Is there an increase in 42AB and 38C transcripts in kipferl-/- by RNA FISH? What makes up the remaining 79% piRNAs when 21% are satDNA piRNAs at line 307?

Other than piRNAs originating from clusters 42AB or 38C (whose levels remain unchanged or decrease slightly), piRNAs derived from several Satellites increase strongly (up to 10-fold) although total piRNA levels in kipferl mutant ovaries decrease by about 40%. We did not observe increased RNA FISH signal for 42AB and 38C in the kipferl mutants compared to wildtype, while a strong increase in RNA FISH signal is seen for Satellites. This argues in favor of sequestration of the piRNA cluster machinery to Satellites, rather than just a distribution of free Rhino to remaining loci capable of binding Rhino. The remaining 80% piRNAs in kipferl mutants are somatic piRNAs (~20%) and piRNAs derived from Rhino dependent source loci where Kipferl is less important (e..g. cluster 42AB, cluster 38C, telomeres).

3. It is unclear which sequencing experiment is done with replicates and how many replicates, which should be clearly described. For replicated experiment, is a representative dataset OR the average of replicates shown? In sequencing experiments without replicates, how could one be convinced of the result?

We specified the number of biological replicates in the figure legends throughout the manuscript and provide now additional data (Figure 4 - Figure supplement 1C) for full mutant sRNA-seq data (control, kipferl -/- and rhino -/-) to complement the missing replicates for the previous analysis. These results confirm our analysis from the RNAi-mediated knockdown samples.

4. In all confocal images, is only one z-slice shown? Other than Figure 3C, Rhino seems to have additional foci besides the pastry domain at the absence of Kipferl (e.g., Figures6D, S3C, S3H). Showing the z-projection for single nurse cells might make statements about Rhino localization more convincing, especially the claim that kipferl, nxf3 double mutants restore Rhino foci that are absent in kipferl-/-.

All confocal images are no more than 5-10 z-sections of 0.2 μm distance and are therefore displaying only a part of the entire nucleus. This is stated now in the methods section (relating to RNA FISH and immunofluorescence stainings). In kipferl mutant nurse cells, the large peri-nuclear accumulations (the ‘Kipferl’) is the most prominent, but not the only visible Rhino domain. Regarding the Nxf3 comment: This might have been misleading in the text (now formulated more clearly, text relating to Figure 3, last paragraph): In nxf3-kipferl double mutants, no Rhino domains are restored. It is just that the Kipferl-structure is not anymore at the nuclear periphery, but rather internalized.

5. The correlation coefficient with respective p-value should be reported whenever a correlation argument is made throughout the manuscript. e.g., lines 161, 347 and throughout Figures7 and S7.

We changed the wording in lines 161 (now 164) and 347 (now 404-405) and added coefficients throughout the text relating to Figure 7.

6. Text description should not exaggerate the result. e.g., "every" at line 556. Text often says Kipferl is important for most/majority of Rhino domains, could it be supported by actual numbers (# out of total #, either # domain or # 1kb tiles)? Given that most active piRNA loci 42AB and 38C, as well as telomeric transposons, are all Kipferl-independent, one wonders if Kipferl really controls most/majority of Rhino domains. Since there are many Rhino-only and Kipferl-only domains/1kb tiles, listing actual numbers helps readers contextualize the relationship between Rhino and Kipferl as well as the prevalence of exceptions.

In our data, Kipferl in fact localizes to every Rhino domain in wildtype ovaries (see Figure 2), even though it does not bind upstream of Rhino at all of these sites. When using binary cutoffs of peak-calling algorithms, 97% of all analyzable Rhino domains (in kb) are dependent on Kipferl. Of these, 65% are found within 5 kb up or downstream of a Rhino-independent Kipferl peak.

Reviewer #2 (Recommendations for the authors):

This is a compelling story to establish the function of Kipferl in recruiting Rhino to the piRNA-producing loci in Drosophila ovarian genome. I enjoyed reading and considering it in the past few days. I felt that there are some questions still unanswered--particularly how Rhino is recruited to 42AB. However, given the authors already generated massive amount of data and the complexity of Rhino recruiting mechanisms, I don't think it would be fair to ask for an answer here. Here are a few suggestions:

1. I think the authors should comment on the potential homolog of Kipferl in other Drosophila species. Rhino appears to be rapidly evolving, even simulans Rhino could not be compatible with the melanogaster piRNA machineries. Does Kipferl homolog exist in simulans. If yes, how different it is with the melanogaster Kipferl, especially the amino acids involving in Rhino and DNA binding. Answering these questions could potentially broaden the impact of this story.

Please see our response to the major point #2 above.

2. Line159, H3K9me3 or H3K9me2?

Here we are indeed talking about H3K9me2. We adjusted the text to make this more clear.

3. I am confused about the message from Line 365-367. Do you mean Rhino peaks towards the periphery?

Within the large piRNA cluster 42AB we observe a restriction of Rhino toward more central regions of the cluster upon loss of Kipfer, with a reduction of Rhino towards the boundaries of the cluster. It is at these more peripheral sites (where Rhino depends on Kipferl even at cluster 42AB) that we observe small Kipferl enrichment peaks in the rhino mutant Kipferl ChIP-seq data set. These low, but detectable Kipferl binding sites likely contribute to the stabilization of Rhino on these more “peripheral” regions of the cluster. We adjusted the text to convey this better (lines 427-436).

4. Line 452: motives or motifs.

All instances were changed to motifs.

5. The last paragraph is way too long. It is good to only deliver one message per paragraph.

We split this paragraph into two.

Reviewer #3 (Recommendations for the authors):

The manuscript, although well written, is very dense with a lot of figures which sometimes makes it difficult to follow.

– In all the manuscript there is no mention of significance in the violin and box plots when the control KD and the kipferl KD are compared. As the manuscript now stands, the reader can have the impression that the changes might be important but non-significant.

We added statistical significance parameters to the relevant plots as well as mean effect sizes.

– For each UCSC genome browser track, the genomic coordinates of the presented region should be added either on the figure or in the legend.

Coordinates have been added.

Figure Sup 3:

– Based on the figure legend, there are some inversions the E is the B, the B◊D, the D◊E.

This has been corrected.

– The picture of the FISH experiments of the 42AB and 38C are not really convincing. Furthermore, it is surprising not to see any 80F FISH experiments.

The piRNA cluster FISH experiments aim to establish that the large Rhino accumulations observed in the absence of Kipferl do not correspond to the remaining Rhino-bound piRNA cluster loci. To that end the presented images were selected to depict the large Rhino-GFP domain in the same z plane with foci for both clusters. Due to photo bleaching effects we are not able to image deeper stacks, and likely miss weak Rhino signal that would be expected at 38C and 42AB foci. We believe that the low levels of Rhino that remain at 42AB and especially at 38C are another reason why we don’t see stronger overlap. As cluster 80F lacks Rhino occupancy as well as piRNAs, we did not see the importance of a dedicated cluster 80F FISH experiment.

– Figure 4D. the name of the TEs and the presence in cluster 42AB, 38C is very informative. It should also be added for the Kipferl dependent TE piRNAs. The presence in the cluster 80F should also be described.

We added the information about cluster 80F to the existing panel, as well as the cluster information about the top 20 elements losing antisense piRNA counts in the kipferl knock down.

– The scale of the UCSC genome browser tracks are variable. For a given locus it is important to keep the same scale between different genotypes. Figure 1C, considering the heterochromatic region and the Cluster 80F, the Rhino ChIP-seq track is at 300 for the w1118 and MTD-Gal4 genotypes whereas it is at 150 for the iso1 genotype. It is the same problem in Figure 5C for the Kipferl ChIP-seq results on the 42AB cluster.

We changed this. Please also see our comment above (main revision requirements).

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Brennecke J. 2022. The Drosophila ZAD zinc finger protein Kipferl guides Rhino to piRNA clusters. NCBI Gene Expression Omnibus. GSE202468 [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    Figure 2—source data 1. Imputed enrichment and p-values (n=3) for quantitative mass spectrometry data underlying Figure 2B.
    Figure 4—source data 1. Normalized small RNA-seq counts and their mappings to transposable element sequences.
    Supplementary file 1. Chromosomal coordinates of major heterochromatic and euchromatic compartments used in this study.
    Supplementary file 2. Putative protein-protein interactors of Rhino identified through a yeast two-hybrid screen.
    elife-80067-supp2.xlsx (9.1KB, xlsx)
    Supplementary file 3. Sequences of RNA FISH probes used in this study.
    elife-80067-supp3.xlsx (15KB, xlsx)
    Supplementary file 4. Consensus sequences of transposable elements analyzed in this study.
    elife-80067-supp4.xlsx (186.9KB, xlsx)
    Supplementary file 5. Chromosomal coordinates of stand-alone transposon insertions identified in the MTD-Gal4 background that are absent in the reference genome.
    elife-80067-supp5.xlsx (17.3KB, xlsx)
    MDAR checklist

    Data Availability Statement

    Sequencing data sets have been deposited to the NCBI GEO archive (GSE202468). All fly strains generated for this study are available via the VDRC. All genome-wide sequencing data (ChIP-seq, small RNA-seq, RNA-seq) can be browsed at https://genome-euro.ucsc.edu/s/balisa/dm6_elife220506.

    The following dataset was generated:

    Brennecke J. 2022. The Drosophila ZAD zinc finger protein Kipferl guides Rhino to piRNA clusters. NCBI Gene Expression Omnibus. GSE202468


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES