Abstract
To limit transposable element (TE) mobilization, most eukaryotes have evolved small RNAs to silence TE activity via homology-dependent mechanisms. Small RNAs, 20–30 nucleotides in length, bind to PIWI proteins and guide them to nascent transcripts by sequence complementarity, triggering the recruitment of histone methyltransferase enzymes on chromatin to repress the transcriptional activity of TEs and other repeats. In the ciliate Paramecium tetraurelia, 25-nt scnRNAs corresponding to TEs recruit Polycomb Repressive Complex 2 (PRC2), and trigger their elimination during the formation of the somatic nucleus. Here, we sequenced sRNAs during the entire sexual cycle with unprecedented resolution. Our data confirmed that scnRNAs are produced from the entire germline genome, from TEs and non-TE sequences, during meiosis. Non-TE scnRNAs are selectively degraded, which results in the specific selection of TE-scnRNAs. We demonstrate that PRC2 is essential for the selective degradation of non-TE-scnRNAs, independently of its histone methyltransferase activity. We further show that the PRC2 cofactor Rf4 is required for the physical interaction between the scnRNA-binding protein Ptiwi09 and the zinc finger protein Gtsf1, pointing to an architectural role of PRC2 in scnRNA degradation.
Graphical Abstract
Graphical Abstract.
Introduction
Small silencing RNAs bound to Argonaute/Piwi proteins target RNAs for degradation. Conversely, endogenous RNAs are able to trigger degradation of their cognate small RNA. Instances of target-directed degradation of microRNAs have been reported in mammalian cells, Caenorhabditis elegans and Drosophila [1]. In ciliates, a different class of small RNAs, the scnRNAs, that resemble the metazoan Piwi-interacting RNAs (piRNAs), undergo selective degradation during developmentally programmed DNA elimination [2–6].
In the ciliate Paramecium tetraurelia, two types of nuclei coexist in the same cell: the germline micronucleus (MIC) and the somatic macronucleus (MAC). During the self-fertilization process of autogamy, the MIC undergoes meiosis and karyogamy to produce the zygotic nucleus. New MICs and new MACs develop from mitotic products of the zygotic nucleus (Fig. 1) [7]. The MAC contains a reduced genome (72 Mb) compared to the MIC genome (108 Mb) as a result of massive and reproducible genome elimination (∼30 Mb) that occurs at each sexual cycle [8–10]. Eliminated sequences include 45 000 internal eliminated sequences (IESs) that are remnants of transposable elements (TEs) scattered throughout the genome [10]. Other eliminated sequences (OES) [11] correspond to large regions comprising repeats such as TEs or satellites [9].
Figure 1.
The genome scanning model. The progression of nuclear developmental events during autogamy is represented above, and below, the molecular steps of the scanning model. (i) scnRNA biogenesis. During meiosis, 25-nt scnRNA are produced by the Dicer-like proteins Dcl2/3 from the whole MIC genome (MAC-destined sequences in black, MIC-specific sequences in orange) and are loaded onto Ptiwi01/09. (ii) scnRNA selection. Ptiwi–scnRNA complexes are transported to the maternal MAC. Homologous pairing between scnRNAs and MAC non-coding RNAs (ncRNA) leads to the degradation of MAC-scnRNAs (black) mediated by the zinc finger protein Gtsf1, and selection of MIC-specific scnRNAs (orange) (iii) scnRNA-guided histone mark deposition and DNA elimination. scnRNAs are transported to the new developing MAC, where they recognize homologous sequences by base-pairing interactions with nascent ncRNAs. Interaction between Ptiwi09 and PRC2 guides the deposition of H3K9me3 and H3K27me3 on MIC-specific sequences. These histone marks would guide the recruitment of the excision complex, allowing elimination of MIC-specific sequences (orange).
The sites of DNA elimination do not seem to be specified by the DNA sequence in the MIC genome because, beyond an invariable TA at IES extremities, no strictly conserved sequence motif has been identified in or around IESs. Instead, DNA elimination is epigenetically controlled by the maternal MAC. For a subset of IESs, the artificial introduction of an IES in the maternal MAC was shown to inhibit the elimination of the zygotic IES in the new MAC of the sexual progeny [12, 13]. To explain the sequence specificity of this maternal control, a model has been proposed whereby a crosstalk between the maternal MAC and the new MAC is achieved by a genome-wide sequence comparison mediated by RNA molecules [3]. Around 25-nt long scnRNAs produced from germline MIC transcripts by the Dicer-like proteins Dcl2 and Dcl3 [2, 4, 6] are bound to Ptiwi01/09 proteins and transported to the maternal MAC [14]. It is believed that scnRNAs are then sorted out by pairing interactions with nascent non-coding RNAs produced by the MAC genome (Fig. 1) [3, 15, 16]. Non-coding RNAs are thought to trigger the degradation of their cognate small RNA (MAC-scnRNAs), while scnRNAs corresponding to MIC-specific sequences (hereafter called MIC-scnRNAs), which by definition cannot pair with MAC transcripts, are thought to be retained. This selective degradation of MAC-scnRNAs would thus result in the specific selection of the subpopulation corresponding to MIC-scnRNAs. In support of this model, the requirement of complementary MAC transcripts for scnRNA selection has been directly demonstrated in Paramecium [16].
The complexes formed by MIC-scnRNAs bound to Ptiwi01/09 are transported from the maternal MAC to the new MAC. In a mechanism very similar to that described in fungi and metazoans [17], scnRNA/Piwi complexes pair to complementary nascent transcripts [18] and guide heterochromatin formation in the new MAC, thereby providing specificity despite the lack of conserved motifs on eliminated sequences (Fig. 1). A physical interaction between Ptiwi09 and the Polycomb Repressive Complex 2 (PRC2-Ezl1) was recently shown to mediate the establishment of heterochromatin at scnRNA-targeted regions in the new MAC (Fig. 1) [19, 20]. PRC2-Ezl1 comprises a core complex and additional cofactors; the catalytic subunit Ezl1 trimethylates histone H3 on lysines 27 and 9 (H3K27me3 and H3K9me3) in the new MAC [21]. Deposition of these repressive marks is essential for the elimination of TEs and of 70% of IESs [21, 22]. Both H3K27me3 and H3K9me3 marks are associated with TEs [21]. A second class of sRNAs called iesRNAs (26–29 nt), produced from excised IES transcripts by a distinct Dicer protein, Dcl5, and bound to Ptiwi10/11, is reported to enhance the excision of a subset of IESs [4, 23]. Both classes of sRNAs appear to act synergistically in the new MAC. Indeed, RNAi-mediated co-depletion of the 3 Dicer proteins Dcl2, Dcl3, and Dcl5 inhibited the excision of 50% of IESs, while depletion of Dcl5 alone or co-depletion of Dcl2 and Dcl3 inhibited the excision of a much smaller subset of IESs (5% and 7% of IESs, respectively) [4, 22].
Although scnRNA selection is a key process in the regulation of DNA elimination, its molecular mechanism remains elusive. We and others recently identified the Paramecium zinc-finger protein Gtsf1, which associates with Ptiwi09, as essential for the degradation of MAC-scnRNAs [24, 25]. To further understand the mechanism of scnRNA selection, we sequenced small RNAs during the entire sexual cycle of autogamy with unprecedented resolution. Our results provide a comprehensive picture of the dynamics of scnRNAs and of their timing of production. Thanks to these data, we confirm recent reports that PRC2 and its cofactors are required for scnRNA selection, thus making an essential contribution to DNA elimination earlier than previously realized. We further show that scnRNA selection relies on a histone methyltransferase-independent function of PRC2, demonstrating a novel mode of action for PRC2 in small RNA biology. Our data point to an architectural role of PRC2 in bringing together Ptiwi09 and Gtsf1.
Materials and methods
Paramecium strains, cultivation, and autogamy
Experiments were carried out with the entirely homozygous strain 51 of P. tetraurelia (wild-type or mutants nd7-1). We used two Rdr mutant strains isolated in [26]. The rdr1-5.28 line corresponds to a putative null allele for Rdr1, containing a frameshift mutation leading to a premature stop codon upstream of the catalytic domain. The rdr2–1.24 line is a strong hypomorphic allele that retains an intragenic IES (located downstream of the conserved RdRP domain region), due to a T-to-A mutation in one of the boundaries required for excision during macronuclear development. Cells were grown in wheat grass powder (WGP) infusion medium bacterized the day before use with Klebsiella pneumoniae, unless otherwise stated, and supplemented with 0.8 μg/mL β-sitosterol. Cultivation and autogamy (self-fertilization process) were carried out at 27°C as described [27, 28].
Gene silencing experiments
For T7Pol-driven dsRNA production in silencing experiments plasmids L4440 with fragments of each gene and Escherichia coli strain HT115 DE3 were used. Sequences used for silencing of ICL7a, EZL1, SUZ12.like, RF2, RF4, EAP1, DCL2, DCL3, GTSF1 and EMA1 were segments 1–580 of PTET.51.1.G0700039 (ICL7a); 989–1501 of PTET.51.1.G1740049 (EZL1); 590–946 of PTET.51.1.G0190277 (SUZ12.like); 1514–1916 of PTET.51.1.G1190062 (RF2); 623–1038 of PTET.51.1.G0570234 (RF4); 298–486 of PTET.51.1.G1310069 (EAP1); 434–1032 of PTET.51.1.G0210241 (DCL2); 630–1468 of PTET.51.1.G0990073 (DCL3); and 249–479 of PTET.51.1.G0490019 (GTSF1) (pOC19) as previously published [2, 19, 22, 24, 29]. For silencing of PTIWI01/09, segments 51–439 of PTET.51.1.G0660118 (PTIWI09) and 41–441 of PTET.51.1.G0710112 (PTIWI01) (pMG11) were cloned between T7 promoters in plasmid L4440. Preparation of silencing medium and RNAi during autogamy were performed as described in [30].
Transformation with tagged fusion transgenes
The GFP-PTIWI09, GFP-EZL1H526A, 3XFLAG-HA-EZL1, 3XFLAG-HA-PTIWI09 (pAH30), 3XFLAG-HA-GTSF1 (pOC17) transgenes were previously described [21, 24, 31]. For the construction of the 3XFLAG-HA-PTIWI09DA mutant transgene (pMG18) the aspartic acid-coding triplet GCT from nucleotide 1674 to 1676 (D541) of PTET.51.1.G0660118 was replaced by GAT (alanine). pAH30 and pMG18 are RNAi-resistant. For the construction of H3K27M mutant transgenes, the lysine-coding triplet AAA from nucleotide 85 to 87 (K27) was replaced by ATG (methionine) in the H3P1 (PTET.51.1.G1450006) and in the H3P3 (PTET.51.1.G0270094) genes. Each gene was expressed under the control of their transcription signals (promoter and 3′UTR). The H3P1 (PTET.51.1.G1450006) (pAH2) transcription signals contains 150-bp upstream and 192-bp downstream of its open reading frame. The H3P3 (PTET.51.1.G0270094) (pAH11) transcription signals contains 89-bp upstream and 147-bp downstream of its open reading frame. For the H3P3K27M-GFP construct (pAH13), a GFP tag codon-optimized for the P. tetraurelia genetic code (same as GFP-PTIWI09 and GFP-EZL1H526A) was added to the 3′ end of the gene. Plasmids carrying the transgenes were linearized with restriction enzymes and microinjected into the MAC of vegetative 51 cells in the MAC.
To measure the copy number of the transgene per haploid genome (cphg), the transgene copy number is normalized to the endogenous EZL1 gene copy number by qPCR. qPCR was performed using LightCycler® 480 SYBR Green I Master (catalog number 04 707 516 001, Roche) on the Light Cycler 480 system (Roche). qPCR amplification was done with primers listed in Supplementary Table S1.
Production of sexual progeny
Lethality after autogamy was assessed by transferring 30–60 individual post-autogamous cells to standard growth medium and measuring their ability to resume vegetative growth. Cells with a functional new MAC were identified as normally growing survivors unable to undergo a novel round of autogamy if starved after ∼8 divisions.
Indirect immunofluorescence and GFP localization
For immunofluorescence, cells were fixed as previously described [21] and incubated overnight at room temperature with primary antibodies as follows: rabbit anti-H3K9me3 (1:200) [21], rabbit anti-H3K27me3 (1:1000) [21] and mouse anti-FLAG (1:200) (Thermo Fisher Scientific Cat#MA1-91878; RRID: AB_1 957 945). Cells were labeled with Alexa Fluor 568-conjugated goat anti-rabbit IgG (Thermo Fisher Scientific Cat#A-11036; RRID: AB_10 563 566) or Alexa Fluor 568-conjugated goat anti-mouse IgG (Thermo Fisher Scientific Cat#A-11031; RRID: AB_144 696) at 1:500 for 1 h, stained with 1 μg/mL Hoechst for 5–10 min and finally mounted in Citifluor AF2 glycerol solution (Biovalley Cat#AF2-25). For GFP detection, cells were fixed for 15 min in PHEM 1X, 4% formaldehyde, 4% sucrose at room temperature and washed in Tris-buffered saline-Tween 20 0.1% (TBST). Hoechst staining was performed with 1 μg/mL Hoechst for 5–10 min. Cells were washed in TBST and mounted in Citifluor AF2 glycerol solution. Images were acquired using a Zeiss LSM 980 laser-scanning confocal microscope and a Plan-Apochromat 63 × /1.40 oil DIC M27 objective. Z-series were performed with Z-steps of 0.35 μm.
Immunoprecipitation
Paramecium nuclear protein extracts were performed as previously described [21]. Around 106 autogamous cells (T0) were lysed with a Potter-Elvehjem homogenizer in 3 volumes of lysis buffer (10 mM Tris pH 6.8, 10 mM MgCl2, 0.2% Nonidet P-40, 1 mM PMSF, 4 mM benzamidine, 1x Complete EDTA-free Protease Inhibitor Cocktail tablets (Roche)). The nuclei-containing pellet was collected by centrifugation and washed with the addition of 2.5 volumes of washing solution (0.25 M sucrose, 10 mM MgCl2, 10 mM Tris pH 7.4, 1 mM PMSF, 4 mM benzamidine, 1x Complete EDTA-free Protease Inhibitor Cocktail tablets (Roche)). The pellet was incubated in 1 volume of nuclear extraction buffer 2 × (100 mM Hepes pH 7.8, 100 mM KCl, 300 mM NaCl, 0.2 mM EDTA, 20% Glycerol, 2 mM DTT, 0.02% Nonidet P-40, 2 mM PMSF, 2x Complete EDTA-free Protease Inhibitor Cocktail tablets (Roche)) for 1 h at 4°C. The salt-extractable nuclear fraction at 150 mM NaCl was recovered following centrifugation for 3 min at 10 000 g at 4°C. Nuclear extracts were incubated overnight at 4°C with 150 μl anti-FLAG M2 magnetic beads (M8823, Sigma) that were pre-washed with 1 mL TEGN buffer (20 mM Tris pH 8, 0.1 mM EDTA, 10% Glycerol, 150 mM NaCl, 0.01% Nonidet P-40). Beads were washed five times with TEGN buffer and eluted with 3xFLAG peptide (F4799, Sigma-Aldrich) (45 μL) and the same volume of TEGN buffer at 4°C for 5 h. RNase I treatment in anti-FLAG Gtsf1 IP was performed as described in [19] except for the use of 2 mM Ribonucleoside Vanadyl Complex (NEB) instead of 4 mM benzamidine. After incubation of half of the beads with 200 U of RNase I (Thermo Fisher Scientific) for 5 h at 4°C, beads were washed three times with TEGN buffer and boiled in Laemmli sample buffer.
Mass spectrometry
Sample preparation
Gel plugs were washed with a destaining solution of ACN/NH4HCO3 50 mM (50/50), then reduced with Tris(2-Carboxyethyl) Phosphine Hydrochloride (TCEP-HCl) 10 mM and alkylated with S-Methyl methanethiosulphonate (MMTS) 20 mM. Proteins were then in gel-digested overnight at 37°C with 20 μL of 50 mM NH4HCO3 buffer containing 1 μg of sequencing-grade trypsin/Lys C mix (Promega). The digested peptides were loaded and desalted on evotips provided by Evosep (Odense, Denmark) according to manufacturer's procedure before LC–MS/MS analysis.
LC-MS/MS acquisition
Samples were analyzed on a timsTOF Pro 2 mass spectrometer (Bruker Daltonics, Bremen, Germany) coupled to an Evosep one system (Evosep, Odense, Denmark) operating with the 30SPD method developed by the manufacturer. Briefly, the method is based on a 44-min gradient and a total cycle time of 48 min with a C18 analytical column (0.15 × 150 mm, 1.9μm beads, ref EV-1106) equilibrated at 40°C and operated at a flow rate of 500 nL/min. H2O/0.1% FA was used as solvent A and ACN/ 0.1% FA as solvent B. The timsTOF Pro 2 was operated in Data-Dependent Analysis-Parallel Accumulation Serial Fragmentation (DDA-PASEF) mode over a 1.3-s cycle time. Mass spectra for MS and MS/MS scans were recorded between 100 and 1700 m/z.
Data analysis
MS raw files were processed using PEAKS Online 11 (build 1.9, Bioinformatics Solutions Inc.). Data were searched against the Paramecium tetraurelia database downloaded from ParameciumDB website (parameciumtetraurelia_mac_51_annotation_v2.0.protein.fasta, 40 460 entries). Parent mass tolerance was set to 20 ppm, with fragment mass tolerance to 0.05 Da. Specific tryptic cleavages were selected and a maximum of two missed cleavages were allowed. The following post-translational modifications were considered for identification: Oxidation (M), Deamidation (NQ), Acetylation (Protein N-term) as variable and Beta-methylthiolation (C) as fixed. Identifications were filtered based on a 1% FDR (False Discovery Rate) threshold at both peptide and protein group levels. Label-free quantification was performed using the PEAKS Online 11 quantification module, allowing a mass tolerance of 5 ppm, a CCS error tolerance of 0.01, and a 0.25-min retention time shift tolerance for match between runs. Protein abundance was inferred using the top N peptide method and TIC was used for normalization. Multivariate statistics on protein measurements were performed using Qlucore Omics Explorer 3.9 (Qlucore AB, Lund, SWEDEN). A positive threshold value of 1 was set to allow a log2 transformation of abundance data for normalization, i.e. all abundance data values below the threshold are replaced by 1 before transformation. The transformed data were finally used for statistical analysis, i.e. the evaluation of differentially present proteins between two groups using a bilateral Student’s t-test.
Western blot and silver staining
Electrophoresis and blotting were carried out according to standard procedures. Samples were run on 10% Tris-Glycine polyacrylamide gels (Bio-Rad). Blotting was performed for 1 h at 100 V using a nitrocellulose membrane (GE10600002, Merck). FLAG (1:1000) (MAI-91878, Thermo Fisher Scientific), α-tubulin (1:5000) (sc-8035, Santa Cruz), Paramecium H3 (1:5000) [21], HA (1:2000) (H6908, Merck), and Ezl1 (1:1000) [19] were used for primary antibodies. Secondary horseradish peroxidase-conjugated anti-mouse or anti-rabbit IgG antibodies (Promega) were used at 1:2500 dilution followed by detection by ECL. Alternatively, secondary fluorescent anti-mouse (DyLight 800, BioRad) or anti-rabbit (StarBright700) IgG antibodies were used at 1:2000 dilution.
For silver staining, samples were run on 4–20% Tris-Glycine extended polyacrylamide gels (Cat #4 568 095, BioRad) for 20 min at 200 V and the fast protocol of the SilverQuest Silver Staining Kit (Thermo) was used according the manufacturer’s instructions.
Small RNA isolation and immunoprecipitation
Around 200 mL of cells were collected and frozen at T0 after the onset of autogamy. FLAG-Ptiwi09 immunoprecipitation (IP) was performed as described in [14]. Cells were lysed using a dounce homogenizer in 2 mL lysis buffer (50 mM Tris HCL pH 8, 150 mM NaCl, 5 mM MgCl2, 1 mM DTT, 0.5% sodium deoxycholate, 2 mM vanadyl ribonuclease complex (VRC), 1% Triton X-100, 10% glycerol, 1x Complete EDTA-free Protease Inhibitor Cocktail tablets (Roche)). Around 1 mL of lysate was used to perform IP with 70 μL of anti-FLAG M2 magnetic beads (M8823, Sigma) (pre-washed five times in lysis buffer) overnight at 4°C. The unbound fraction was kept, and the beads were washed five times with IP buffer (50 mM Tris HCL pH 8, 150 mM NaCl, 5 mM MgCl2, 1 mM DTT, 0.5% sodium deoxycholate, 2 mM VRC, 1% Triton X-100, 10% glycerol, 1x Complete EDTA-free Protease Inhibitor Cocktail tablets (Roche)) and resuspended in 1 mL IP buffer. Around 1/10 of the resuspended beads was kept for western blot analysis. After removal of the supernatant, 300 μl of proteinase K (10 μg/mL) in Tris pH 7.5 (10 mM), EDTA pH 8 (5 mM), 0.5% SDS is added to the rest of the beads and incubated for 20 min at 42°C. RNA extraction was performed as previously described [21].
DNA extraction and PCR
DNA samples were prepared from ≤1000 autogamous cells using the NucleoSpin Tissue kit (Macherey-Nagel). For PCR analysis of IES excision, PCR amplifications were performed with Expand Long Template Enzyme mix (Expand Long Template PCR system, Roche). Primers are available in Supplementary Table S1.
RNA extraction, RT-PCR and sequencing
RNA samples were extracted from 200 to 400 mL cultures at 2000–4000 cells/mL as previously described [21] at different time points during autogamy. For RT-PCR, RNA samples were reverse-transcribed with RevertAid H Minus Reverse Transcriptase (Thermo Scientific) using random hexamer primers (Thermo Scientific) according to the manufacturer’s instructions. It was then followed by PCR amplifications in a final volume of 25 μL, with 10 pmol of each primer, 10 nmol of each dNTP and 2 U of DyNAzyme II DNA polymerase (Thermo Scientific). For sRNA-seq, small RNAs of 15–30 nt (or 10–30 nt for the analysis of the PTIWI09DA mutant and Ptiwi09 IP) were purified from total RNA on a 15% TBE/Urea gel. Small RNA libraries were constructed using the NEBNext Small RNA kit according to the manufacturer recommendations. The quality of the final libraries was assessed with an Agilent Bioanalyzer, using an Agilent High Sensitivity DNA Kit. Libraries were pooled in equimolar proportions and sequenced using a single read 75 pb run on an Illumina NextSeq500 instrument, using NextSeq 500 High Output 75 cycles kit.
Bioinformatic analyses
Sequencing data were demultiplexed using CASAVA (v1.8.2) and bcl2fastq2 (v2.18.12). Illumina adapters were removed using cutadapt (v1.12), only keeping reads with a minimal length of ten nucleotides. Paired-end reads were mapped on expected contaminants (mitochondrial genomes, ribosomal DNA, and bacterial genomes). sRNA-seq data were mapped on reference genomes using BWA (v0.7.15 -n 0). sRNA-seq data were analyzed as previously [32]. Sequencing metrics are available in Supplementary Table S2.
Reference genomes, software, and R packages
Sequencing reads were mapped on expected contaminants (mitochondrial genomes, ribosomal DNA, and bacterial genomes) then mapped on Paramecium genome references using BWA (v0.7.15) without any mismatches. The P. tetraurelia strain 51 reference genomes used in this study were: the MAC (ptetraurelia_mac_51.fa), the MAC + IES (ptetraurelia_mac_51_with_ies.fa) and the MIC (ptetraurelia_mic2.fa) [9, 33]. Gene annotation v2.0 (ptetraurelia_mac_51_annotation_v2.0.gff3), IES annotation v1 (internal_eliminated_sequence_PGM_ParTIES.pt_51.gff3) were also used [33, 34]. All files are available from the ParameciumDB download section (https://paramecium.i2bc.paris-saclay.fr/download/Paramecium/tetraurelia/51/) [34]. R (v4.0.4) packages were used to generate images (ggplot2 v3.3.5; ComplexHeatmap v2.6.2; GenomicRanges v1.42; rtracklayer v1.50).
sRNA quantification
Small RNA reads were successively mapped on the MAC, MAC + IES, and MIC genomes to attribute them to a specific genome compartment: MAC-destined sequence (MDS), IES, or OES. In the case of a silencing experiment, the 23-nt siRNA reads that map to the RNAi targets were removed. Read counts were normalized using the number of sequenced reads using a G + C content < 50%, compatible with a Paramecium G + C genomic content (∼27%).
Scaled genome coverage of scnRNAs
To analyse the evolution of sRNA genome coverage, the sequencing data were sampled to 15M reads between 20 and 30 nt long. After mapping on the MIC genome, only 25-nt sRNA were kept. The coverage of the MIC genome was calculated using bamCoverage (deeptools v2.0) and a bin window of 100 nt. Using this window coverage, the percentage of genome covered by at least 1X was determined for each sample.
sRNA clusters
Only sRNAs of 23-nt in length mapped on the MAC genome were considered. Sequencing data from all the wild-type autogamy time course were merged to stringently annotate 136 short read clusters (SRC) using the program ShortStack (v3.8.5 –nohp –mincov 1000 –pad 100) [35]. The boundaries of these SRCs were manually adjusted and compared to, already published, vegetative SRCs with a minimum overlap of 23-nt [36–38](see Supplementary Table S3). The number of 23-nt sRNAs were counted on each SRC, with the information regarding transcription orientation, and normalized by the total number of sequenced sRNAs (20–30 nt with GC < 50%) and their length (RPKM). A SRC and a gene were associated with a minimum overlap of 20 nt. K-means clustering (iter.max = 100) was used to define three groups of SRCs based on their RPKM coverage during wild-type autogamy time course experiments. siRNA transcription orientation ratio was calculated using counts obtained with SAMTools (samtools view). Unidirectional and bidirectional transcription ratios were calculated only for SRCs overlapping a gene and covered by at least five siRNA reads. mRNA RPKM coverage of SRCs was calculated using htseq-count (v0.11.2 stranded = no -mode = intersection-nonempty) on TopHat2 mappings (v2.1.1 –min-intron-length 15 –max-intron-length 100 –read-mismatch 1 –coverage-search) [39].
Results
Dynamics of scnRNAs during the sexual cycle
To get a comprehensive picture of scnRNA dynamics, we sequenced small RNA populations at 11 time points covering with unprecedented resolution the entire sexual cycle of autogamy (self-fertilization) (Fig. 2). The time course experiments covered the whole duration of pre- and post-zygotic nuclear events, including the time window during which DNA elimination occurs [40]. Small RNA populations were mapped with no mismatches against the reference genomes and subdivided in three categories according to their length: 25-nt for scnRNAs; 26–29 nt for iesRNAs and 23-nt for small interfering RNA (siRNAs) (Supplementary Fig. S1). (For the analysis of iesRNAs and siRNAs, see Supplementary Text, Supplementary Figs S1–S3, Supplementary Tables S3–S4).
Figure 2.
scnRNA dynamics during Paramecium sexual cycle. (A). Analysis of 25-nt-scnRNA populations in vegetative cells and at different time points during autogamy. Bar plots show the normalized counts, using the total number of sRNAs (see Materials and Methods), of 25-nt reads for each sample that are unmapped or map the MAC genome, IESs, or OES reference sequences (top) and the proportion of 25-nt reads for each category (middle). Genome: proportion of each category (MAC, OES, IES) in the MIC genome. Progression of autogamy is followed by cytology with Hoechst staining (bottom) and is not synchronous in the cell population [76]. Five different morphological states were scored for each autogamy time-course sample: “Vegetative” cells with two MICs and a single MAC; “Meiosis” is scored by observation of MIC division at different stages of meiosis I and II; “Fragmented maternal MAC” corresponds to maternal MAC into many small pieces. (Fragments are only completely lost, by dilution, during the first vegetative cell divisions after autogamy); clearly visible “developing new MACs”; at the end of autogamy, the first cellular division distributes the two new MACs to the daughter cells (“karyonides”). Two independent time course (tc1 and tc2) experiments are displayed: tc1 covers veg, t-2 to t0 and tc2 covers T0 to T45. Overlap between tc1 and tc2 is ensured by t0 and T0, which are alike. The numbers correspond to hours before (e.g. t-2) or after T0 (e.g. T5). At least 100 cells were scored for each time point by fluorescence microscopy. (B). Representative genomic region depicting 25-nt scnRNAs at different time points of autogamy. DNA-seq coverage (bin = 1) for the MIC and the MAC DNA samples on the MIC genome (NODE_1273_length_39 901_cov_28.471216 between 10 and 40 kb), and IES and TE annotations are indicated. ChIP-seq enrichment (deeptools bamCompare bin = 5) of H3K9me3 and H3K27me3 compared to the DNA Input sample from Pgm-depleted cells at T50 (ENA accessions ERS6678598, ERS6678600 and ERS6678597) [19] is displayed, indicating that the OES is enriched for both histone marks in the new developing MAC. scnRNA coverage (bin = 100) on the MIC genome at different time points of autogamy (T-2 to T45) is indicated at the bottom. To compare time points, the 25-nt sRNA coverages were calculated on equivalent read datasets (15e6 sRNA between 20 and 30-nt long). (C). Using the scaled normalized datasets presented in B., the percentage of the MIC genome covered by at least 1X of 25-nt scnRNA was calculated at each time point of autogamy.
25-nt scnRNAs begin to accumulate as soon as meiotic cells are detected in the culture (t-2) and persist until cells undergoing meiosis are no longer detected (T10) (Fig. 2A), in line with the idea that scnRNA biogenesis occurs during the meiotic phase.
Because the cell populations are not synchronous, however (Fig. 2A), we cannot determine with certainty which cells produce scnRNAs. We were able to overcome to some degree by taking advantage of an observation we made with a GFP-PTIWI09 transgene [31]. Cells were transformed with this fusion gene, whose expression is under the control of the endogenous sequences up- and downstream of PTIWI09, and sexual events were triggered by starvation. GFP fluorescence was detected in the cytoplasm and in the maternal MAC during meiosis I, before it concentrated in the new MAC, as previously reported [31]. However, we noted that, in transformants with high copy numbers of the transgene and high GFP-Ptiwi09 protein levels, as analyzed by western blot, 100% of cells were arrested in early meiosis I before each MIC yields four haploid nuclei (Supplementary Fig. S1). In these arrested cells, the GFP fluorescence was detected in the meiotic MICs, as well as in the cytoplasm and in the maternal MAC (Supplementary Fig. S1). This localization appears different from that of the other transformants that normally progress through autogamy (Supplementary Fig. S1), for which GFP was always excluded from the meiotic MIC [31]. We used cells expressing GFP-Ptiwi09 arrested in meiosis I as a tool to assess whether scnRNAs can be detected at this stage. Small RNA sequencing experiments showed that 25-nt scnRNAs accumulate in meiosis I-arrested cells expressing GFP-Ptiwi09 (Supplementary Fig. S1), suggesting that scnRNAs are produced prior to or during early meiosis I.
As previously reported from the sequencing of small RNAs from three time points (early conjugation time point in [6], and early and late developmental stages during autogamy in [4]), our dataset of 11 time points during autogamy confirmed that scnRNAs are produced from both strands (Supplementary Fig. S1) and cover the whole MIC genome (Fig. 2B and C). This can be clearly observed when examining the mapping of 25-nt scnRNAs across an individual genomic region (Fig. 2B), where MAC sequences, as well MIC sequences (IES or OES), are covered. Using the draft MIC genome assembly [9], we found that >88% of the MIC genome is covered by at least 1X of normalized sequencing depth, from the moment scnRNAs are detected (t-2) and until T5 (Fig. 2C). No bias in scnRNA coverage between IESs and MAC-destined sequences could be observed at the early time points (until T0) (Supplementary Table S5). We conclude that scnRNAs are produced from the whole MIC genome in Paramecium tetraurelia.
As autogamy proceeds, we observed an increase in the proportion of 25-nt scnRNAs mapping to OES or IES over scnRNAs mapping to MAC sequences (Fig. 2A-B). This increase started from T3 in our time course experiment, and by T20, scnRNAs mostly corresponded to MIC-specific (IES + OES) sequences (Fig. 2A). As illustrated for an individual genomic region, scnRNAs cover the entire OES region, that comprises annotated TEs, enriched in H3K9me3 and H3K27me3 histone marks in the new MAC, and the IESs scattered in the flanking regions (Fig. 2B). Genome-wide analysis of scnRNA coverage showed that it is rather homogeneous whatever the enrichment in histone marks (H3K27me3 and H3K9me3) at T0, as expected. As development proceeds (from T0 to T45), the correlation between scnRNA coverage and histone mark enrichment increases (Supplementary Fig. S1), in agreement with the data presented in Fig. 2B. As reported for one early and one late developmental time points [4], the enrichment in MIC-scnRNAs is consistent with the predicted effect of scnRNA selection, resulting from the degradation of MAC-scnRNAs, which correspond to regions that are not enriched in histone marks.
The slicer activity of Ptiwi09 is required for the removal of the passenger strand
Argonaute proteins can carry an endoribonuclease (slicer) activity responsible of the cleavage of the RNA complementary to the guide RNA loaded to the protein. The slicer activity is provided by a conserved catalytic tetrad composed of a DEDX (X is usually H or D) motif in the Piwi domain that adopts an RNaseH fold [41]. The scnRNA binding protein Ptiwi09 possesses a DEDH motif, suggesting it may carry a slicer activity [31]. Given that MAC-scnRNAs are thought to be degraded upon pairing with complementary RNA molecules (Fig. 1), we decided to investigate the role of Ptiwi09 slicer activity in scnRNA selection. We obtained Paramecium transformants expressing full-length, 3xFLAG-HA-tagged mutant Ptiwi09 carrying an aspartic acid-to-alanine substitution (D541A) in the first aspartic acid residue of the conserved DEDH motif within the PIWI domain, expected to abolish slicer activity [42–45]. Endogenous Ptiwi09 and Ptiwi01 protein expression was specifically depleted through RNA interference while the mutant 3xFLAG-HA-PTIWI09DA transgene was designed to be RNAi-resistant. In vivo complementation assay for three independent transformants expressing full-length, 3xFLAG-HA-tagged wild-type Ptiwi09 indicated that the 3xFLAG-HA-Ptiwi09WT (Ptiwi09WT) protein is functional and can rescue the lethality and DNA elimination defects associated with Ptiwi01/09 depletion (Fig. 3A and Supplementary Fig. S4). In contrast, the Ptiwi09DA mutant protein is unable to complement these defects in two independent transformants (Fig. 3A and Supplementary Fig. S4), indicating that the slicer activity of Ptiwi09 exerts an essential function during sexual events.
Figure 3.
The slicer activity of Ptiwi09 is required for the removal of the passenger strand. (A). Production of sexual progeny of non-injected cells and cells transformed with 3XFLAG-HA-PTIWI09 (PTIWI09WT) or 3XFLAG-HA-PTIWI09DA (PTIWI09DA), following PTIWI01/09 or ICL7 (CTL) RNAi. PTIWI09WT transformants #1, #2 and #3 contain respectively 0.3 copies of the transgene per haploid genome (cphg), 8.4 cphg and 33 cphg. PTIWI09DA transformants #1 and #2 contain, respectively, 9.6 and 57 cphg. The total number of cells analyzed for each condition and the number of independent experiments (in parentheses) are indicated. (B). Western blot analysis of whole-cell extracts from non-injected (NI) cells and cells expressing a 3XFLAG-HA-Ptiwi09 (PTIWI09WT #2) fusion protein or a mutated 3XFLAG-HA-Ptiwi09DA (PTIWI09DA #2) fusion protein, with FLAG antibodies to detect 3XFLAG-HA-Ptiwi09WT or 3XFLAG-HA-Ptiwi09DA and alpha-tubulin antibodies for normalization. Whole cell extracts were done at T0, following PTIWI01/09 or ICL7 (CTL) RNAi. (C). FLAG immunostaining at T0 of cells transformed with 3XFLAG-HA-PTIWI09 (PTIWI09WT #2) or 3XFLAG-HA-PTIWI09DA (PTIWI09DA #2), following PTIWI01/09 RNAi. Representative confocal images are displayed. Overlay of Z-projections are presented. Scale bar is 10 μm. (D). Analysis of 25-nt scnRNAs at T0 in non-injected (NI) cells and cells transformed with 3XFLAG-HA-PTIWI09 transgenes (PTIWI09WT #2 or PTIWI09DA #2) in PTIWI01/09 RNAi conditions. Proportion of 25-nt scnRNAs with the 5′-UNG signature, characteristics of the guide strand (black), or those that do not start with a 5′-U (A, G or C: V) and end with a CNANN-3′ signature, characteristics of the passenger strand (grey).
Comparable amounts of Ptiwi09WT and Ptiwi09DA proteins were detected by western blot analysis at T0 (Fig. 3B), indicating that the slicer activity is not required for the accumulation of Ptiwi09. Yet the mutant protein displays a different localization compared to the control (Fig. 3C and Supplementary Fig. S4). Indeed, immunofluorescence experiments with FLAG antibodies indicated that the Ptiwi09WT protein localizes in the cytoplasm and in the maternal MAC at T0, as observed for the GFP-Ptiwi09 fusion protein [31]. In contrast, the Ptiwi09DA mutant protein is detected in the cytoplasm throughout autogamy and does not strongly accumulate in the maternal MAC. This indicates that the slicer activity is required for normal Ptiwi09 accumulation in the maternal MAC.
We analyzed the expression of scnRNAs in the absence of the slicer activity of Ptiwi09 and found that scnRNAs accumulate at T0 in the slicer mutant and in wildtype, indicating that the slicer activity is not required for the biogenesis of scnRNAs (Supplementary Fig. S4). This is consistent with previous reports showing that the Dcl2 and Dcl3 proteins are responsible for scnRNA production [2, 4, 46]. We did not detect shorter small RNAs products that would indicate they are degraded in the slicer mutant (Supplementary Fig. S4). scnRNA duplexes are produced by the Dicer-like enzymes with 2-nt 3′ overhangs at both ends [46]. 5′-UNG scnRNAs, or ‘guide’ strands, are preferentially stabilized (70% of the population), while the complementary CNA of the other strand, or ‘passenger’ strands, are less abundant in the scnRNA population (10%) in wild type conditions (Fig. 3D and Supplementary Fig. S4). When the slicer Ptiwi09 mutant protein is expressed at T0, we found that the proportion of 5′-UNG scnRNAs is reduced, and that of the passenger strand is increased, whether or not the endogenous Ptiwi01/09 proteins are depleted (Fig. 3D and Supplementary Fig. S4). We conclude that the slicer activity is used for removal of the passenger strand after loading duplex scnRNAs, and this would allow the formation of functional scnRNA-Ptiwi01/09 complexes. Indeed, catalytically active Argonaute proteins use their slicer activity to cleave the passenger strand of perfect or nearly perfect duplexes generated by Dicer [47–49]. Thus, similarly to the Tetrahymena Twi1p protein [50], Ptiwi09 slicer activity appears to be essential for strand selection, and precludes the possibility to examine the effects of the slicer activity in downstream steps.
PRC2-Ezl1 core complex and its associated cofactors are required for scnRNA selection
ScnRNA selection, which results in the enrichment in MIC-scnRNAs, is thought to occur in the maternal MAC during MIC meiosis (Fig. 1). Proteins involved in this process would thus be localized in the maternal MAC at this developmental time. While the PRC2-Ezl1 complex catalyzes deposition of H3K27me3 and H3K9me3 in the new MAC, it is first localized in the maternal MAC during MIC meiosis where it catalyzes solely H3K27me3 [19–22]. To assess whether PRC2-Ezl1 is involved in the selection process, we sequenced small RNA populations during autogamy at three different time points (T0; T10 and T35) in cells depleted for the PRC2-Ezl1 core components Ezl1 and Suz12.like and for the accessory proteins Rf2, Eap1 and Rf4 (Fig. 4A and Supplementary Fig. S5). As previously described upon depletion of PRC2-Ezl1 components [19], the 25-nt scnRNAs are produced from the MIC genome at the beginning of the sexual cycle (T0) with the same proportions of MAC, OES and IES scnRNAs as in control RNAi conditions. In contrast, depletion of the scnRNA biogenesis factors Dcl2/3 leads to the loss of scnRNAs (Fig. 4A) [4, 14]. However, as development proceeds (T10 and T35), MIC-specific scnRNAs (OES + IES) are not enriched over MAC-specific scnRNAs in PRC2-Ezl1-depleted cells, indicating an absence of MIC-specific scnRNA selection (Fig. 4A and B and Supplementary Fig. S5). Thus, PRC2-Ezl1 components are broadly required for MAC-scnRNA degradation. This result is in agreement with a previous study showing scnRNA selection defects at a late developmental stage during autogamy upon depletion of the PRC2 core subunit Caf1 [51], as well as a more recent report similarly implicating Ezl1 and Rf4 [20]. In cells depleted for Ezl1, Suz12.like, Caf1, and Rf2, the PRC2–Ezl1 complex collapses [19, 20]. Therefore, we conclude that PRC2-Ezl1 function in scnRNA selection requires the integrity of the complex.
Figure 4.
PRC2-Ezl1 core complex and its associated cofactors are required for scnRNA selection. (A). Left: Normalized counts of 25 nt-long scnRNAs at T0 in ICL7 (CTL), EZL1, SUZ12.like, RF2, EAP1, RF4, and DCL2/3 RNAi conditions. Bar plots show the reads that are unmapped or match the MAC genome, IESs and OES reference sequences. Right: Proportion of 25nt-long scnRNAs that match the MAC genome, IESs and OES reference sequences at different time points of the sexual cycle in control (CTL), EZL1, SUZ12.like, RF2, EAP1, and RF4 RNAi conditions. Genome: proportion of each category (MAC, OES, IES) in the MIC genome. (B). Representative genomic region (same as in Fig. 2B) depicting 25 nt scnRNAs at T35 in ICL7 (CTL), EZL1, SUZ12.like, RF2, EAP1, and RF4 RNAi. DNA-seq coverage (bin = 1) for the MIC and the MAC DNA samples on the MIC genome (NODE_1273_length_39 901_cov_28.471216 between 10 and 40 kb), and IES and TE annotations are indicated. ChIP-seq enrichment (deeptools bamCompare bin = 5) of H3K9me3 and H3K27me3 in PGM RNAi conditions (data from [19]) is displayed. Scaled scnRNA coverages (bin = 100) were calculated on equivalent read datasets (15e6 sRNA between 20 and 30-nt long) for all samples except RF2, for which all available reads (3.2e6) were considered. (C). GFP-Ptiwi09 localization in CTL, DCL2/3 and EZL1 RNAi conditions in meiosis and during MAC development. Representative confocal images are displayed. Overlay of Z-projections are presented. Dashed white circles indicate the new developing MACs. Scale bar is 10 μm. Note that, in contrast to control cells, basal bodies on the cell cortex are not decorated with GFP fluorescence in Ezl1-depleted cells. (D). Sequencing analysis of scnRNA populations after Ptiwi09 IP in EZL1 KD. Left: Western blot analysis of whole cell extracts at T0 and T20 from cells expressing the 3XFLAG-HA-Ptiwi09 protein (Pt09WT) or wild-type cells (WT), before (input) and after FLAG IP (Ptiwi09 IP) upon EZL1 RNAi or no RNAi (CTL). Anti-HA antibodies are used to detect Ptiwi09 and alpha-tubulin for normalization. Right: Proportion of 25-nt-long scnRNAs that match the MAC genome, IES and OES reference sequences after Ptiwi09 IP at T0 and T20 in EZL1 RNAi. Genome: proportion of each category (MAC, OES, IES) in the MIC genome.
Importantly, because we still detect maternal MAC ncRNA production in the EZL1 knockdown experiments, the lack of scnRNA selection cannot be explained by a lack of maternal MAC ncRNA transcription (Supplementary Fig. S5). To determine whether impairment of scnRNA selection could be explained by defects in the localization of scnRNAs, we analyzed the localization of the scnRNA-binding protein Ptiwi09 with a GFP-PTIWI09 transgene. In control cells, the GFP-Ptiwi09 fusion protein was detected in the cytoplasm and in the maternal MAC during MIC meiosis then localized exclusively in the developing MAC, as expected [31](Fig. 4C). In contrast, in Dcl2/3-depleted cells, in which scnRNA biogenesis is abrogated, the GFP-Ptiwi09 fusion protein was no longer detected during MIC meiosis and MAC development (Fig. 4C). This observation is consistent with unloaded Argonaute proteins being generally unstable [52, 53]. Thus, GFP-Ptiwi09 localization likely reflects scnRNA localization. In Ezl1-depleted cells, where scnRNA selection is defective, GFP-Ptiwi09 still localized in the cytoplasm and in the maternal MAC during MIC meiosis, and then in the new developing MACs, as in control conditions (Fig. 4C). We conclude that the lack of MIC-specific scnRNA selection in the absence of Ezl1 does not appear to be associated with the mis-localization of scnRNAs.
To assess whether unselected scnRNAs still bind to Ptiwi09, we performed Ptiwi09 IP in EZL1 RNAi conditions (Fig. 4D) followed by small RNA sequencing, when Ptiwi09 is present in the maternal MAC (T0) and when it is in the new developing MACs (T20). 25 nt-scnRNAs are present at both time points (Supplementary Fig. S5) and the same proportions of MIC- and MAC scnRNAs are detected in the IP at both time points (Fig. 4D). This indicates that the non-selected scnRNAs that accumulate in Ezl1-depleted cells are bound to Ptiwi09.
H3K27me3 in the maternal MAC is dispensable for scnRNA selection
PRC2-Ezl1 catalyzes H3K27me3 in the maternal MAC [21, 22]. To directly assess whether H3K27me3 is required for scnRNA selection, we sought to identify an experimental condition where H3K27me3 no longer accumulates in the maternal MAC. We designed a mutant H3 transgene carrying a lysine to methionine substitution in position 27 (H3K27M), a mutation described to cause a dominant negative effect preventing H3K27me3 accumulation at the endogenous H3 by inhibiting the enzymatic activity of PRC2 and resulting in genome-wide depletion of H3K27me3 [54, 55].
Five H3 proteins are encoded in the Paramecium tetraurelia MAC genome. H3P1 and H3P3 are H3.1 histones and are constitutively expressed during the life cycle [56]. We transformed Paramecium cells with histone H3P1 or H3P3 transgenes bearing K27M mutations and induced sexual events. Transformation with H3P1K27M or H3P3K27M transgenes abolished H3K27me3 accumulation in the maternal MAC without affecting H3K27me3 deposition in the MIC during meiosis I or in new developing MACs based on immunofluorescence experiments (Fig. 5A, Supplementary Fig. S6). This does not appear to be due to a mislocalization of Ezl1 because in cells co-transformed with H3P3K27M and FLAG-EZL1 transgenes, the fusion protein FLAG-Ezl1 localized in the MIC and the maternal MAC during meiosis I then in new developing MACs (Supplementary Fig. S6), as was previously reported for a GFP-EZL1 transgene [22]. The H3P1K27M or H3P3K27M transgenes had no major effect on the accumulation of H3K9me3 in the new MAC (Supplementary Fig. S6). Testing elimination of some IESs did not reveal any DNA elimination defect (Supplementary Fig. S6) and the survival of the sexual progeny was not affected (Fig. 5B), indicating that deposition of H3K27me3 in the maternal MAC did not appear to be required for IES elimination and viability of the sexual progeny. We sequenced the small RNA populations at two time points in H3P1K27M transformed cells (Supplementary Fig. S6). The scnRNAs were produced normally at early developmental stages (T0) and were enriched in MIC-specific sequences at later stages (T20) (Fig. 5C and Supplementary Fig. S6). Thus, the scnRNA selection process does not appear to require H3K27 trimethylation in the maternal MAC. Consistent with this idea, H3K27me3 deposition is not affected in the maternal MAC of Eap1-depleted cells [19], yet scnRNA selection is impaired (Fig. 4A and B), suggesting that PRC2-Ezl1 integrity is critical for scnRNA selection for other reasons.
Figure 5.
Ezl1 histone methyltransferase activity is not required for scnRNA selection. (A). H3K27me3 immunostaining of non-injected cells (NI), and of cells transformed with H3P1K27M or H3P3K27M during MAC development. Representative confocal images are displayed. Overlay of Z-projections are presented. Dashed white circles indicate the new developing MACs. Note the abnormal accumulation of H3K27me3 in the new MICs in cells transformed by H3P1K27M or H3P3K27M, or with GFP-EZL1H526A mutant transgene (panel D). The same abnormal localization was observed in RNAi-mediated Ezl1-depleted cells [21]. Scale bar is 10 μm. (B). Production of sexual progeny in cells transformed with H3P1K27M or H3P3K27M. The total number of analyzed cells and the number of independent experiments (in parenthesis) are indicated. (C). Left: Normalized counts of 25-nt scnRNAs at T0 in control (CTL) RNAi, in cells transformed with H3P1K27M, and in cells transformed with GFP-EZL1H526A upon endogenous EZL1 RNAi. Bar plots show the reads that are unmapped or match the MAC genome, IES, and OES reference sequences. Right: Proportion of 25-nt scnRNAs that match the MAC genome, IES and OES reference sequences at T0 and T20-35 of the sexual cycle. Genome: proportion of each category (MAC, OES, IES) in the MIC genome. (D). H3K27me3 immunostaining and GFP-EZL1H526A localization during new MAC development in cells transformed with GFP-EZL1H526A upon endogenous EZL1 RNAi. Representative confocal images are displayed. Overlay of Z-projections of magnified views of Hoechst staining, H3K27me3-specific antibodies and GFP signal are presented. Dashed white circles indicate the new developing MACs. Note the abnormal accumulation of H3K27me3 in the new MICs in cells transformed with the GFP-EZL1H526A mutant transgene. Scale bar is 10 μm.
Ezl1 histone methyltransferase activity is not required for scnRNA selection
Our data indicate that PRC2-Ezl1, but not its signature H3K27me3 in the maternal MAC, is necessary for scnRNA selection. We hypothesized that PRC2-Ezl1 may function in scnRNA selection by methylating substrates other than H3K27. To test this possibility, we used a catalytic mutant of Ezl1. The H526A mutation in the SET domain of Ezl1 was previously shown to abolish Ezl1 catalytic activity in vitro and in vivo [21]. In cells expressing the RNAi-resistant GFP-EZL1H526A mutant transgene and silenced for the endogenous EZL1 gene, H3K27me3 no longer accumulated in the maternal MAC nor in the new developing MACs, even though the mutant Ezl1 protein still correctly localized (Fig. 5D and Supplementary Fig. S6). In this context, we performed small RNA sequencing. Contrary to our expectations, scnRNAs were produced normally at the beginning of the sexual cycle (T0) and MIC-specific scnRNAs were correctly selected at later stages (T35) (Fig. 5C and Supplementary Fig. S6). Thus, we conclude that the function of PRC2-Ezl1 in scnRNA selection is independent of its histone methyltransferase activity.
The PRC2 cofactor Rf4 is required for the interaction between Ptiwi09 and Gtsf1 in the maternal MAC
Previous work showed that the Ptiwi09 protein interacts in the maternal MAC with the tandem zinc finger protein Gtsf1, which is essential for scnRNA selection, and PRC2 [24, 25]. We therefore reasoned that the interaction between Ptiwi09 and Gtsf1 might be mediated by PRC2. To test this hypothesis, we compared the proteins interacting with Gtsf1 in control conditions with those found in cells depleted for Rf4, the PRC2 cofactor known to bridge PRC2 to Ptiwi09 in the new developing MAC [19]. The localization in the maternal MAC of the 3xFLAG-HA-Gtsf1 fusion protein in control and Rf4-depleted cells was confirmed by immunofluorescence (Fig. 6A). The Gtsf1 nuclear signal appears lower in RF4 RNAi compared to control RNAi, based on immunofluorescence and western blot analysis of nuclear extracts (Fig. 6A and Supplementary Fig. S7). Yet, Rf4 is not required for Gtsf1 expression, given the similar levels of the Gtsf1 fusion protein in whole cells extracts from control and Rf4-depleted cells (Supplementary Fig. S7). Immunoprecipitation (IP) of the nuclear Gtsf1 fusion protein, in 3 replicates for the control and Rf4-depleted cells, respectively (Materials and Methods), was followed by quantitative label-free mass spectrometry (Fig. 6B). Statistical analyses revealed 161 differential proteins (out of 2613 identified proteins) in Rf4-depleted cells compared with control (fold change ≥ 2; P value ≤ 0.05; unique peptide ≥ 2) (Fig. 6B; Supplementary Fig. S7). We recovered Gtsf1 in both conditions, as expected (Fig. 6B). Among the 64 proteins that were significantly depleted upon RF4 RNAi, not only did we find Rf4 as expected, but we also found PRC2 components (Ezl1, Eed, and Suz12-like) and cofactors (Rf2 and Eap1), Ptiwi09 and its interacting proteins (Ptiwi01, Ptiwi03 and Ema1a) (Fig. 6B) [19]. This result is in agreement with a previous study reporting that PRC2 composition at early stages of autogamy includes Ezl1, Caf1, Eed, Suz12-like, Rf2, and Rf4 [20]. Therefore, we conclude that depletion of the PRC2-Ezl1 cofactor Rf4 disrupts the interaction between Gtsf1, Ptiwi09, and PRC2.
Figure 6.
The PRC2 cofactor Rf4 is required for the interaction between Ptiwi09 and Gtsf1 in the maternal MAC. (A). Gtsf1 IP experiments upon ICL7 (CTL) or RF4 RNAi. Anti-FLAG immunostaining of cells transformed with a 3XFLAG-HA-GTSF1 transgene are shown, at the same time point as the preparation of nuclear extracts for FLAG IP (T0) followed by quantitative mass spectrometry. Scale bar, 10 μm. (B). Volcano plot of the quantitative label-free mass-spectrometry analysis of 3xFLAG-HA-Gtsf1 affinity purification upon RF4 versus ICL7 (CTL) RNAi. Three replicates were analyzed for each condition. Significantly enriched proteins in Gtsf1 IP upon RF4 RNAi over Gtsf1 IP upon CTL RNAi are shown in dark grey (statistical t-test), Fold Change ≥ 2, and P-value ≤ 0.05, only proteins identified with more than two unique peptides are considered. Gtsf1 is highlighted in blue. PRC2 components and Ptiwi09-associated proteins are highlighted in yellow. (C). Ptiwi09 IP experiments upon CTL (no RNAi) or RF4 RNAi. Anti-FLAG immunostaining of cells transformed with a 3XFLAG-HA-PTIWI09 transgene are shown, at the same time point as the preparation of nuclear extracts for FLAG IP (T0). Scale bar, 10 μm. (D). Western blot analysis of nuclear extracts from cells expressing the 3XFLAG-HA-Ptiwi09 protein (Ptiwi09WT) or 3XFLAG-HA (CTL) before (input) or after IP. Anti-FLAG antibodies are used to detect Ptiwi09, anti-Ezl1 to detect Ezl1, and anti-histone H3 antibodies for normalization. (E). Working model. The presence of PRC2 core subunits and cofactors in the maternal MAC but not the deposition of H3K27me3 appears to be required for the degradation of MAC-scnRNAs. A histone methyltransferase-independent function of PRC2-Ezl1 in the degradation of MAC-scnRNAs may contribute to the stabilization of the interaction between MAC-scnRNA/Ptiwi09 complexes and nascent non-coding RNAs. Through an architectural role, PRC2-Ezl1 would bring together Ptiwi09 and Gtsf1 through the Rf4 protein, leading to Ptiwi09 ubiquitylation and degradation via the proteasome.
To assess whether the interaction between Gtsf1 and PRC2-Ezl1 might involve RNA, we examined Gtsf1 IP after control and RNase I treatment, in two independent biological replicates (Supplementary Fig. S7). We detected Ezl1 in the Gtsf1 IP but not in the control IP, as expected. After treatment with RNase I, Ezl1 was still found in the Gtsf1 IP, consistent with a RNA-independent interaction between Gtsf1 and PRC2-Ezl1. This observation is in agreement with previous work showing that the interaction between Gtsf1 and Ptiwi09 is RNA-independent [24].
Given that Rf4 impairs Gtsf1 localization in the maternal MAC, we examined the localization and levels of Ptiwi09 fusion protein in control and Rf4-depleted cells. Western blot analysis of whole cells extracts and immunostaining indicated that RF4 RNAi has no effect on Ptiwi09 (Supplementary Fig. S7).
To assess whether Rf4 is required for the interaction between Ptiwi09 and PRC2-Ezl1 in the maternal MAC, we performed Ptiwi09 IP in control, and RF4 RNAi conditions and examined the presence of Ezl1 by western blot. Cells expressing the 3xFLAG-HA-Ptiwi09 fusion protein in the maternal MAC were used for the IP of the FLAG tag from nuclear extracts (Fig. 6C and Supplementary Fig. S7). Western blot analysis indicated that Ezl1 is found in the control Ptiwi09 IP, as expected, while it is not detected in Rf4-depleted cells (Fig. 6D). Consistently, we found that Ezl1 is no longer detected in the maternal MAC upon RF4 RNAi (Supplementary Fig. S7). Thus, the interaction between Ptiwi09 and Ezl1 is mediated, directly or indirectly, by the PRC2 cofactor Rf4 in the maternal MAC, as it is the case in the new developing MAC [19].
Discussion
This study provides a comprehensive description of scnRNA populations during the sexual process of autogamy in Paramecium tetraurelia (Fig. 2). Using cells arrested in meiosis I, we demonstrate that scnRNAs begin accumulating prior to or during early meiosis I. Covering the entire sexual cycle of autogamy with sRNA sequencing at 11 time points, we determined, at an unprecedented resolution, that scnRNAs are produced from both strands for the whole MIC genome in Paramecium tetraurelia, with no bias in scnRNA coverage between IESs and MAC-destined sequences. The mechanism underlying this genome-wide transcription is being thoroughly studied but remains elusive [57, 58].
After scnRNAs are produced from the entire MIC genome, genome-wide degradation of MAC-scnRNAs occurs; the earliest time at which degradation is detected corresponds to the onset of DNA elimination (Fig. 2). As a result of this specific degradation, the subpopulation of scnRNAs corresponding to MIC-specific sequences is selected and persists until the end of MAC development. The dynamics of scnRNAs we observed during Paramecium sexual events contrasts with that described in another ciliate, Tetrahymena thermophila [5]. Indeed, during Tetrahymena conjugation, scnRNAs are preferentially produced from eliminated sequences, resulting in a biased production of scnRNAs from MIC-specific sequences. In addition, there is a rather limited selection process. In contrast, the massive degradation of MAC-scnRNAs (>70 Mb of complexity) makes Paramecium a very good model to decipher the mechanisms involved in scnRNA selection.
Here, we showed that RNAi-mediated knockdown of genes encoding core components of the PRC2-Ezl1 core complex (Ezl1 and Suz12.like) and its associated cofactors (Rf2, Rf4, and Eap1) prevented scnRNA selection (Fig. 4). Thus, the PRC2-Ezl1 complex is involved in MAC-scnRNA degradation, in agreement with previous studies [20, 51]. Ezl1 has been shown to transiently localize in the maternal MAC, where the selection process likely occurs, before it is detected in the new developing MACs [22]. Likewise, Ezl1-interacting partners are also transiently localized in the maternal MAC before they localize in the new MAC [19, 20]. Wang et al. [20] report that the PRC2 complex exhibits a similar composition in the maternal MAC and the new MAC. Consistent with these observations, pull down of the Gtsf1 protein localized in the maternal MAC (Fig. 6) identified known core PRC2 components and cofactors, including Eap1 which was not detected in [20]. We thus conclude that PRC2-Ezl1 function in scnRNA selection requires the integrity of the complex (Fig. 4), its interaction with Ptiwi09 via Rf4 (Fig. 6), and the accessory factor Eap1 (Fig. 4), whose exact role remains to be determined.
Monitoring the localization of the scnRNA-binding protein Ptiwi09 indicated that the scnRNA-Ptiwi09 complexes are correctly localized, despite the lack of selection in PRC2-Ezl1 depleted cells (Fig. 4). Consistently, Ptiwi09 proteins present in the new MAC are loaded with both MAC- and MIC-scnRNAs in Ezl1-depleted cells (Fig. 4). Thus, in absence of selection, both MAC- and MIC- scnRNAs are present in the new MAC, a situation that is radically different from wild-type cells, where only MIC-scnRNAs are present in the new MAC and guide histone modifications at target DNA sequences [19]. We conclude that the mere presence of scnRNA-Ptiwi09 complexes in the new MAC does not appear to be sufficient to trigger DNA elimination. This confirms that the availability of PRC2-Ezl1 components in the new MAC is also critical for DNA elimination to occur, as implied by recent work from us and others [19, 20].
We showed that the specific abrogation of H3K27me3 in the maternal MAC, the signature of PRC2-Ezl1, had no impact on scnRNA selection, demonstrating that this mark does not play a role in this process (Fig. 5). To suppress H3K27me3, we expressed dominant negative forms of the H3.1 protein in which lysine 27 is replaced by a methionine (H3P1K27M and H3P3K27M) (Fig. 5). These mutant proteins prevented the accumulation of H3K27me3 in the maternal MAC, but not in the new MAC, even though the mutated H3 and the PRC2 catalytic subunit Ezl1 were detected in both nuclei (Supplementary Fig. S6). The reason why the dominant negative effect of the H3.1 mutant protein is restricted to the maternal MAC is not known. The molecular mechanism by which H3K27M reduces H3K27me2/3 in cells remains a matter of debate more generally [59, 60]. Post-translational modifications of the same H3 tail, both proximal and distal to K27M, can greatly diminish the inhibition of PRC2 [61]. Furthermore, the H3K27M-to-PRC2 stoichiometric ratio is believed to dictate the degree of PRC2 inhibition [62]. Thus, we speculate that H3.1 mutant proteins are unable to exert a dominant negative effect in the developing MACs because (i) H3P1 and H3P3 are not the substrates of PRC2-Ezl1 in the developing MACs; other H3 proteins are expressed during MAC development [56]; (ii) H3P1 and H3P3 in the developing MACs bear post-translational modifications that impede the dominant negative effect of the H3K27M mutation; or (iii) the amounts of PRC2 relative to H3K27M in the developing MACs are too high to be sensitive to H3K27M-mediated inhibition of PRC2 activity.
We did not observe any apparent consequence of the lack of H3K27me3 in the maternal MAC on DNA elimination and sexual progeny production. Therefore, we conclude that H3K27me3 deposition in the maternal MAC is not required for DNA elimination or sexual progeny production. At this point, the role of H3K27me3 in the maternal MAC remains elusive. If H3K27me3 controls gene expression, it might not involve the expression of genes necessary for the completion of sexual events.
Our findings prompted us to investigate whether PRC2-Ezl1 might function through a different mechanism, independent of H3K27 methylation. Like many histone-modifying enzymes, the mammalian PRC2 has non-histone substrates, including the nuclear receptor Retinoid-Related Orphan Receptor Alpha [63], the transcription factor GATA4 [64], the facultative PRC2 cofactors JARID2 [65] and PALI1 [66], and the RNAPII transcription elongation factor Elongin A [67]. To assess whether PRC2-Ezl1 function in the scnRNA selection process may be decoupled from its methyltransferase activity, we tested a catalytic dead mutant of Ezl1, carrying a histidine-to-alanine substitution in its catalytic SET domain (H526A) [21]. We showed that scnRNA selection occurred normally in the EZL1H526A catalytic mutant, demonstrating that PRC2-Ezl1 has a function independent of its histone methyltransferase activity (Fig. 5). One example of a non-catalytic function of the mammalian PRC2-Ezh1 was previously shown in an experimental setup where the artificial recruitment of a catalytically inactive Ezh1 led to transcriptional repression of a reporter via chromatin compaction [68].
The scanning model proposes that the discrimination between MAC- and MIC- scnRNA/Ptiwi09 complexes results from the pairing of scnRNAs with complementary nascent non-coding transcripts produced by the maternal MAC. Only MAC-scnRNAs are degraded. One potential mechanism involves the post-transcriptional modification 2′-O-methylation at the 3′ terminus by the methyltransferase HEN1. Tetrahymena scnRNAs, like piRNAs in metazoans, are 2′-O-methylated at their 3′ ends [69, 70]. Hen1p-mediated 2′-O-methylation was reported to stabilize scnRNAs. Loss of Hen1p causes a reduction in the abundance and length of scnRNAs, defects in programmed genome elimination, and low production of sexual progeny [70]. One could imagine that the scanning process leads to the differential 2′-O-methylation of MAC- and MIC- scnRNAs: MIC-scnRNAs would be 2′-O-methylated, which would protect them from degradation, while MAC-scnRNAs would not. The re-analysis of published data [38] contradicts this hypothesis, however, because HEN1 KD during Paramecium autogamy did not impair scnRNA selection (Supplementary Fig. S8). Thus, 2′-O-methylation of scnRNAs does not appear to be involved in the selection process.
Only MAC-scnRNAs display extensive complementarity with the non-coding RNA targets. We propose that the pairing between MAC-scnRNAs and MAC non-coding RNA leads to the degradation of the Ptiwi09 protein and its bound scnRNAs. In agreement with this model, recent work provided evidence that Ptiwi09 is ubiquitylated in a Gtsf1-dependent manner and Ptiwi09 degradation involves the proteasome [24]. In essence, the proposed mechanism is similar to microRNA target-directed degradation, in which pairing of the miRNA to the cognate RNA leads to ubiquitination and proteasomal degradation of Ago and of the miRNA [71, 72]. Based on our findings, we propose that the non-histone methyltransferase function of PRC2-Ezl1 in the degradation of MAC-scnRNAs contributes to the stabilization of the interaction between MAC-scnRNA/Ptiwi09 complexes and nascent non-coding RNAs.
What could be the role of PRC2 in scnRNA selection? We show that the PRC2 cofactor Rf4 interacts, directly or indirectly, with Gtsf1 and Ptiwi09 (Fig. 6B), and that Rf4 controls Gtsf1 nuclear localization (Fig. 6A and Supplementary Fig. S7), but not that of Ptiwi09 (Supplementary Fig. S7). We envision two non-exclusive hypotheses for the role of PRC2 that takes into account its interaction with Gtsf1 and Ptiwi09. In the first hypothesis, PRC2 may bring together Ptiwi09 and Gtsf1, suggestive of an architectural role for PRC2-Ezl1. The physical interaction mediated by the PRC2 cofactor Rf4 might be necessary for the nuclear anchoring of Gtsf1 in the maternal MAC. The impaired nuclear localization of Gtsf1 in Rf4-depleted cells would thus be a consequence of the loss of interaction between Rf4 and Gtsf1 (Fig. 6B). If the physical interaction between Ptiwi09 and Gtsf1 requires Rf4, it raises the possibility that Ptiwi09 may not interact directly with Gtsf1, unlike the direct binding of PIWI and GTSF1 observed in Drosophila and mice [73, 74]. In the second hypothesis, the PRC2 cofactor Rf4 may be important to trigger a conformational change or a post-translational modification (other than lysine methylation) required for Gtsf1-Ptiwi09 interaction. Loss of this interaction, in the absence of Rf4, would impair Gtsf1 anchoring in the nucleus. Whether Rf4, Gtsf1 and Ptiwi09 interact directly or indirectly, and how these proteins interact with each other will require further investigation.
In addition to its catalytic role in the new MAC, where it works in concert with Ptiwi09 to deposit histone marks on TEs [19], our study reveals that PRC2 performs a distinct, histone methyltransferase-independent function in the maternal MAC, where, in association with Ptiwi09 and Gtsf1, it participates in scnRNA degradation (Fig. 6E). Future work will be needed to elucidate the mechanisms by which PRC2-Ezl1 exerts its function independently of its histone methyltransferase activity and whether they are analogous to those involved in microRNA target-directed degradation.
Supplementary Material
Acknowledgements
We wish to thank the members of the Duharcourt lab for fruitful discussions, and Sébastien Léon for his insightful advice. We thank Daniel Holoch and Eric Meyer for comments on the manuscript. We acknowledge the ImagoSeine facility, member of the FranceBioImaging infrastructure supported by the ANR-10-INSB-04, and the sequencing and bioinformatics expertise of the I2BC High-throughput sequencing facility, supported by France Génomique (funded by the French National Program “Investissement d’Avenir” ANR-10-INBS-09).
Author contributions: C.M.-P. (Conceptualization [equal], Investigation [equal], Methodology [equal], Validation [equal], Visualization [equal], Writing – original draft [equal]), O.C. (Conceptualization [equal], Investigation [equal], Validation [equal], Visualization [equal], Writing – original draft [equal], Writing – review & editing [equal]), M.G. (Investigation [equal], Validation [equal], Visualization [equal], Writing – original draft [equal], Writing – review & editing [equal]), A.V. (Investigation [equal], Resources [equal]), A.F. (Investigation [equal]), A.H. (Investigation [equal]), M.L.-A. (Investigation [equal], G.C. (Data curation [equal], Formal Analysis [equal], Investigation [equal]), O.A. (Conceptualization [equal], Data curation [equal], Formal Analysis [equal], Investigation [equal], Visualization [equal], Writing – original draft [equal], Writing – review & editing [equal]), S.D. (Conceptualization [lead], Data curation [equal], Funding acquisition [lead], Investigation [equal], Supervision [lead], Visualization [equal], Writing – original draft [lead], Writing – review & editing [lead]).
Contributor Information
Caridad Miró-Pina, Université Paris Cité, CNRS, Institut Jacques Monod, Paris F-75013, France.
Olivia Charmant, Université Paris Cité, CNRS, Institut Jacques Monod, Paris F-75013, France.
Marina Giovannetti, Université Paris Cité, CNRS, Institut Jacques Monod, Paris F-75013, France.
Augustin de Vanssay, Université Paris Cité, CNRS, Institut Jacques Monod, Paris F-75013, France.
Andrea Frapporti, Université Paris Cité, CNRS, Institut Jacques Monod, Paris F-75013, France.
Adeline Humbert, Université Paris Cité, CNRS, Institut Jacques Monod, Paris F-75013, France.
Maoussi Lhuillier-Akakpo, Université Paris Cité, CNRS, Institut Jacques Monod, Paris F-75013, France.
Guillaume Chevreux, Université Paris Cité, CNRS, Institut Jacques Monod, Paris F-75013, France.
Olivier Arnaiz, Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Gif-sur-Yvette cedex 91198, France.
Sandra Duharcourt, Université Paris Cité, CNRS, Institut Jacques Monod, Paris F-75013, France.
Supplementary data
Supplementary data is available at NAR online.
Conflict of interest
None declared.
Funding
This work was supported by the Centre National de la Recherche Scientifique, the Agence Nationale pour la Recherche (ANR) [project “SELECTION” ANR-23-CE12-0027 to S.D. and G.C.]; [project “POLYCHROME” ANR-19-CE12-0015 to S.D. and O.A.] and [project “MODIFICATION” ANR-25-CE12-7757 to S.D.]; the LABEX Who Am I? to S.D. (ANR-11-LABX-0071; ANR-11-IDEX-0005–02); the Fondation de la Recherche Médicale “Equipe FRM EQU202203014643” to S.D. C.M.P. was recipient of PhD fellowships from Université Paris Cité and Fondation ARC, and a LABEX Who Am I? transition postdoc fellowship, O.C. of PhD fellowships from Université Paris Cité and Fondation de la Recherche Médicale (FDT202204014919) and of a EUR G.E.N.E. transition postdoc fellowship (ANR-17-EURE-0013); Université Paris Cité IdEx #ANR-18-IDEX-0001 funded by the French Government through its “Investments for the Future” program), and M.G. of a PhD fellowship from Université Paris Cité. Funding to pay the Open Access publication charges for this article was provided by Agence Nationale pour la Recherche (ANR) project “SELECTION” ANR-23-CE12-0027.
Data availability
sRNA-seq datasets generated for this study have been deposited at the ENA with Study Accession numbers PRJEB59349 and PRJEB88919. The details can be found in Supplementary Table S2. Mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE [75] partner repository accession number PXD063336. The statistical data and R scripts used to generate the bioinformatic images, the original, uncropped blots and numerical values for each figure, the maps of the plasmids generated in this study, and the processed mass spectrometry data were deposited at https://doi.org/10.5281/zenodo.17154025. All other relevant data supporting the key findings of this study are available within the article and its Supplementary data file or from the corresponding author upon reasonable request.
References
- 1. Han J, Mendell JT MicroRNA turnover: a tale of tailing, trimming, and targets. Trends Biochem Sci. 2023; 48:26–39. 10.1016/j.tibs.2022.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Lepere G, Nowacki M, Serrano V et al. Silencing-associated and meiosis-specific small RNA pathways in Paramecium tetraurelia. Nucleic Acids Res. 2009; 37:903–15. 10.1093/nar/gkn1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Mochizuki K, Fine NA, Fujisawa T et al. Analysis of a piwi-related gene implicates small RNAs in genome rearrangement in tetrahymena. Cell. 2002; 110:689–99. 10.1016/S0092-8674(02)00909-1. [DOI] [PubMed] [Google Scholar]
- 4. Sandoval PY, Swart EC, Arambasic M et al. Functional diversification of dicer-like proteins and small RNAs required for genome sculpting. Dev Cell. 2014; 28:174–88. 10.1016/j.devcel.2013.12.010. [DOI] [PubMed] [Google Scholar]
- 5. Schoeberl UE, Kurth HM, Noto T et al. Biased transcription and selective degradation of small RNAs shape the pattern of DNA elimination in Tetrahymena. Genes Dev. 2012; 26:1729–42. 10.1101/gad.196493.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Singh DP, Saudemont B, Guglielmi G et al. Genome-defence small RNAs exapted for epigenetic mating-type inheritance. Nature. 2014; 509:447–52. 10.1038/nature13318. [DOI] [PubMed] [Google Scholar]
- 7. Betermier M, Duharcourt S Programmed rearrangement in ciliates: paramecium. Microbiol Spectr. 2014; 2: 10.1128/microbiolspec.MDNA3-0035-2014. [DOI] [PubMed] [Google Scholar]
- 8. Aury JM, Jaillon O, Duret L et al. Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia. Nature. 2006; 444:171–8. 10.1038/nature05230. [DOI] [PubMed] [Google Scholar]
- 9. Guérin F, Arnaiz O, Boggetto N et al. Flow cytometry sorting of nuclei enables the first global characterization of Paramecium germline DNA and transposable elements. BMC Genomics. 2017; 18:327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Sellis D, Guérin F, Arnaiz O et al. Massive colonization of protein-coding exons by selfish genetic elements in Paramecium germline genomes. PLoS Biol. 2021; 19:e3001309. 10.1371/journal.pbio.3001309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Swart EC, Denby Wilkes C, Sandoval PY et al. Identification and analysis of functional associations among natural eukaryotic genome editing components. F1000Res. 2017; 6:1374. 10.12688/f1000research.12121.1. [DOI] [Google Scholar]
- 12. Duharcourt S, Keller AM, Meyer E Homology-dependent maternal inhibition of developmental excision of internal eliminated sequences in Paramecium tetraurelia. Mol Cell Biol. 1998; 18:7075–85. 10.1128/MCB.18.12.7075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Duharcourt S, Butler A, Meyer E Epigenetic self-regulation of developmental excision of an internal eliminated sequence on Paramecium tetraurelia. Genes Dev. 1995; 9:2065–77. 10.1101/gad.9.16.2065. [DOI] [PubMed] [Google Scholar]
- 14. Furrer DI, Swart EC, Kraft MF et al. Two sets of piwi proteins are involved in distinct sRNA pathways leading to elimination of germline-specific DNA. Cell Rep. 2017; 20:505–20. 10.1016/j.celrep.2017.06.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Aronica L, Bednenko J, Noto T et al. Study of an RNA helicase implicates small RNA-noncoding RNA interactions in programmed DNA elimination in Tetrahymena. Genes Dev. 2008; 22:2228–41. 10.1101/gad.481908. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Lepere G, Betermier M, Meyer E et al. Maternal noncoding transcripts antagonize the targeting of DNA elimination by scanRNAs in Paramecium tetraurelia. Genes Dev. 2008; 22:1501–12. 10.1101/gad.473008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Martienssen R, Moazed D RNAi and heterochromatin assembly. Cold Spring Harb Perspect Biol. 2015; 7:a019323. 10.1101/cshperspect.a019323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Maliszewska-Olejniczak K, Gruchota J, Gromadka R et al. TFIIS-dependent non-coding transcription regulates developmental genome rearrangements. PLoS Genet. 2015; 11:e1005383. 10.1371/journal.pgen.1005383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Miró-Pina C, Charmant O, Kawaguchi T et al. Paramecium Polycomb repressive complex 2 physically interacts with the small RNA-binding PIWI protein to repress transposable elements. Dev Cell. 2022; 57:1037–52. 10.1016/j.devcel.2022.03.014. [DOI] [PubMed] [Google Scholar]
- 20. Wang C, Solberg T, Maurer-Alcalá XX et al. A small RNA-guided PRC2 complex eliminates DNA as an extreme form of transposon silencing. Cell Rep. 2022; 40:111263. 10.1016/j.celrep.2022.111263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Frapporti A, Miró Pina C, Arnaiz O et al. The Polycomb protein Ezl1 mediates H3K9 and H3K27 methylation to repress transposable elements in Paramecium. Nat Commun. 2019; 10:2710. 10.1038/s41467-019-10648-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Lhuillier-Akakpo M, Frapporti A, Denby Wilkes C et al. Local effect of enhancer of zeste-like reveals cooperation of epigenetic and cis-acting determinants for zygotic genome rearrangements. PLoS Genet. 2014; 10:e1004665. 10.1371/journal.pgen.1004665. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Allen SE, Hug I, Pabian S et al. Circular concatemers of ultra-short DNA segments produce regulatory RNAs. Cell. 2017; 168:990–9. 10.1016/j.cell.2017.02.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Charmant O, Gruchota J, Arnaiz O et al. The PIWI-interacting protein Gtsf1 controls the selective degradation of small RNAs in Paramecium. Nucleic Acids Res. 2025; 53:gkae1055. 10.1093/nar/gkae1055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Wang C, Lyv L, Solberg T et al. GTSF1 is required for transposon silencing in the unicellular eukaryote Paramecium tetraurelia. Nucleic Acids Res. 2024; 52:13206–23. 10.1093/nar/gkae925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Marker S, Carradec Q, Tanty V et al. A forward genetic screen reveals essential and non-essential RNAi factors in Paramecium tetraurelia. Nucleic Acids Res. 2014; 42:7268–80. 10.1093/nar/gku223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Beisson J, Betermier M, Bre MH et al. Maintaining clonal Paramecium tetraurelia cell lines of controlled age through daily reisolation. Cold Spring Harb Protoc. 2010; 2010:pdb prot5361. 10.1101/pdb.prot5361. [DOI] [PubMed] [Google Scholar]
- 28. Beisson J, Betermier M, Bre MH et al. Mass culture of Paramecium tetraurelia. Cold Spring Harb Protoc. 2010; 2010:pdb prot5362. 10.1101/pdb.prot5362. [DOI] [PubMed] [Google Scholar]
- 29. Garnier O, Serrano V, Duharcourt S et al. RNA-mediated programming of developmental genome rearrangements in Paramecium tetraurelia. Mol Cell Biol. 2004; 24:7370–9. 10.1128/MCB.24.17.7370-7379.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Baudry C, Malinsky S, Restituito M et al. PiggyMac, a domesticated piggyBac transposase involved in programmed genome rearrangements in the ciliate Paramecium tetraurelia. Genes Dev. 2009; 23:2478–83. 10.1101/gad.547309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Bouhouche K, Gout JF, Kapusta A et al. Functional specialization of Piwi proteins in Paramecium tetraurelia from post-transcriptional gene silencing to genome remodelling. Nucleic Acids Res. 2011; 39:4249–64. 10.1093/nar/gkq1283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. de Vanssay A, Touzeau A, Arnaiz O et al. The Paramecium histone chaperone Spt16-1 is required for Pgm endonuclease function in programmed genome rearrangements. PLoS Genet. 2020; 16:e1008949. 10.1371/journal.pgen.1008949. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Arnaiz O, Mathy N, Baudry C et al. The Paramecium germline genome provides a niche for intragenic parasitic DNA: evolutionary dynamics of internal eliminated sequences. PLoS Genet. 2012; 8:e1002984. 10.1371/journal.pgen.1002984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Arnaiz O, Meyer E, Sperling L ParameciumDB 2019: integrating genomic data across the genus for functional and evolutionary biology. Nucleic Acids Res. 2020; 48:D599–605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Johnson NR, Yeoh JM, Coruh C et al. Improved placement of multi-mapping small RNAs. G3 Bethesda Md. 2016; 6:2103–11. 10.1534/g3.116.030452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Carradec Q, Gotz U, Arnaiz O et al. Primary and secondary siRNA synthesis triggered by RNAs from food bacteria in the ciliate Paramecium tetraurelia. Nucleic Acids Res. 2015; 43:1818–33. 10.1093/nar/gku1331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Karunanithi S, Oruganti V, Marker S et al. Exogenous RNAi mechanisms contribute to transcriptome adaptation by phased siRNA clusters in Paramecium. Nucleic Acids Res. 2019; 47:8036–49. 10.1093/nar/gkz553. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Solberg T, Mason V, Wang C et al. Developmental mRNA clearance by PIWI-bound endo-siRNAs in Paramecium. Cell Rep. 2023; 42:112213. 10.1016/j.celrep.2023.112213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Arnaiz O, Cain S, Cohen J et al. ParameciumDB: a community resource that integrates the Paramecium tetraurelia genome sequence with genetic data. Nucleic Acids Res. 2007; 35:D439–44. 10.1093/nar/gkl777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Zangarelli C, Arnaiz O, Bourge M et al. Developmental timing of programmed DNA elimination in Paramecium tetraurelia recapitulates germline transposon evolutionary dynamics. Genome Res. 2022; 32:2028–42. 10.1101/gr.277027.122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Nakanishi K, Weinberg DE, Bartel DP et al. Structure of yeast Argonaute with guide RNA. Nature. 2012; 486:368–74. 10.1038/nature11211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Baumberger N, Baulcombe DC Arabidopsis ARGONAUTE1 is an RNA Slicer that selectively recruits microRNAs and short interfering RNAs. Proc Natl Acad Sci USA. 2005; 102:11928–33. 10.1073/pnas.0505461102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Liu J, Carmell MA, Rivas FV et al. Argonaute2 Is the catalytic engine of mammalian RNAi. Science. 2004; 305:1437–41. 10.1126/science.1102513. [DOI] [PubMed] [Google Scholar]
- 44. Reuter M, Berninger P, Chuma S et al. Miwi catalysis is required for piRNA amplification-independent LINE1 transposon silencing. Nature. 2011; 480:264–7. 10.1038/nature10672. [DOI] [PubMed] [Google Scholar]
- 45. Rivas FV, Tolia NH, Song J-J et al. Purified Argonaute2 and an siRNA form recombinant human RISC. Nat Struct Mol Biol. 2005; 12:340–9. 10.1038/nsmb918. [DOI] [PubMed] [Google Scholar]
- 46. Hoehener C, Hug I, Nowacki M Dicer-like enzymes with sequence cleavage preferences. Cell. 2018; 173:234–47. 10.1016/j.cell.2018.02.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Matranga C, Tomari Y, Shin C et al. Passenger-strand cleavage facilitates assembly of siRNA into Ago2-containing RNAi enzyme complexes. Cell. 2005; 123:607–20. 10.1016/j.cell.2005.08.044. [DOI] [PubMed] [Google Scholar]
- 48. Miyoshi K, Tsukumo H, Nagami T et al. Slicer function of Drosophila argonautes and its involvement in RISC formation. Genes Dev. 2005; 19:2837–48. 10.1101/gad.1370605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Rand TA, Petersen S, Du F et al. Argonaute2 cleaves the anti-guide strand of siRNA during RISC activation. Cell. 2005; 123:621–9. 10.1016/j.cell.2005.10.020. [DOI] [PubMed] [Google Scholar]
- 50. Noto T, Kurth HM, Kataoka K et al. The tetrahymena argonaute-binding protein Giw1p directs a mature argonaute-siRNA complex to the nucleus. Cell. 2010; 140:692–703. 10.1016/j.cell.2010.02.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Ignarski M, Singh A, Swart EC et al. Paramecium tetraurelia chromatin assembly factor-1-like protein PtCAF-1 is involved in RNA-mediated control of DNA elimination. Nucleic Acids Res. 2014; 42:11952–64. 10.1093/nar/gku874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Martinez NJ, Gregory RI Argonaute2 expression is post-transcriptionally coupled to microRNA abundance. RNA. 2013; 19:605–12. 10.1261/rna.036434.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Smibert P, Yang J-S, Azzam G et al. Homeostatic control of Argonaute stability by microRNA availability. Nat Struct Mol Biol. 2013; 20:789–95. 10.1038/nsmb.2606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Bender S, Tang Y, Lindroth AM et al. Reduced H3K27me3 and DNA hypomethylation are major drivers of gene expression in K27M mutant pediatric high-grade gliomas. Cancer Cell. 2013; 24:660–72. 10.1016/j.ccr.2013.10.006. [DOI] [PubMed] [Google Scholar]
- 55. Lewis PW, Müller MM, Koletsky MS et al. Inhibition of PRC2 activity by a gain-of-function H3 mutation found in pediatric glioblastoma. Science. 2013; 340:857–61. 10.1126/science.1232245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Lhuillier-Akakpo M, Guérin F, Frapporti A et al. DNA deletion as a mechanism for developmentally programmed centromere loss. Nucleic Acids Res. 2016; 44:1553–65. 10.1093/nar/gkv1110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Gruchota J, Denby Wilkes C, Arnaiz O et al. A meiosis-specific Spt5 homolog involved in non-coding transcription. Nucleic Acids Res. 2017; 45:4722–32. 10.1093/nar/gkw1318. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Owsian D, Gruchota J, Arnaiz O et al. The transient Spt4-Spt5 complex as an upstream regulator of non-coding RNAs during development. Nucleic Acids Res. 2022; 50:2603–20. 10.1093/nar/gkac106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Jain SU, Rashoff AQ, Krabbenhoft SD et al. H3 K27M and EZHIP impede H3K27-methylation spreading by inhibiting allosterically stimulated PRC2. Mol Cell. 2020; 80:726–35. 10.1016/j.molcel.2020.09.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Sarthy JF, Meers MP, Janssens DH et al. Histone deposition pathways determine the chromatin landscapes of H3.1 and H3.3 K27M oncohistones. Life. 2020; 9:e61090. 10.7554/eLife.61090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Brown ZZ, Müller MM, Jain SU et al. Strategy for “Detoxification” of a cancer-derived histone mutant based on mapping its interaction with the methyltransferase PRC2. J Am Chem Soc. 2014; 136:13498–501. 10.1021/ja5060934. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Stafford JM, Lee C-H, Voigt P et al. Multiple modes of PRC2 inhibition elicit global chromatin alterations in H3K27M pediatric glioma. Sci Adv. 2018; 4:eaau5935. 10.1126/sciadv.aau5935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Lee JM, Lee JS, Kim H et al. EZH2 generates a methyl degron that is recognized by the DCAF1/DDB1/CUL4 E3 ubiquitin ligase complex. Mol Cell. 2012; 48:572–86. 10.1016/j.molcel.2012.09.004. [DOI] [PubMed] [Google Scholar]
- 64. He A, Shen X, Ma Q et al. PRC2 directly methylates GATA4 and represses its transcriptional activity. Genes Dev. 2012; 26:37–42. 10.1101/gad.173930.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Sanulli S, Justin N, Teissandier A et al. Jarid2 methylation via the PRC2 complex regulates H3K27me3 deposition during cell differentiation. Mol Cell. 2015; 57:769–83. 10.1016/j.molcel.2014.12.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Zhang Q, Agius SC, Flanigan SF et al. PALI1 facilitates DNA and nucleosome binding by PRC2 and triggers an allosteric activation of catalysis. Nat Commun. 2021; 12:4592. 10.1038/s41467-021-24866-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Ardehali MB, Anselmo A, Cochrane JC et al. Polycomb repressive complex 2 methylates elongin A to regulate transcription. Mol Cell. 2017; 68:872–84. 10.1016/j.molcel.2017.10.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Margueron R, Li G, Sarma K et al. Ezh1 and Ezh2 maintain repressive chromatin through different mechanisms. Mol Cell. 2008; 32:503–18. 10.1016/j.molcel.2008.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Ji L, Chen X Regulation of small RNA stability: methylation and beyond. Cell Res. 2012; 22:624–36. 10.1038/cr.2012.36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Kurth HM, Mochizuki K 2′-O-methylation stabilizes Piwi-associated small RNAs and ensures DNA elimination in tetrahymena. RNA. 2009; 15:675–85. 10.1261/rna.1455509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Han J, LaVigne CA, Jones BT et al. A ubiquitin ligase mediates target-directed microRNA decay independently of tailing and trimming. Science. 2020; 370:eabc9546. 10.1126/science.abc9546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72. Shi CY, Kingston ER, Kleaveland B et al. The ZSWIM8 ubiquitin ligase mediates target-directed microRNA degradation. Science. 2020; 370:eabc9359. 10.1126/science.abc9359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Ohtani H, Iwasaki YW, Shibuya A et al. DmGTSF1 is necessary for Piwi-piRISC-mediated transcriptional transposon silencing in the Drosophila ovary. Genes Dev. 2013; 27:1656–61. 10.1101/gad.221515.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Yoshimura T, Watanabe T, Kuramochi-Miyagawa S et al. Mouse GTSF1 is an essential factor for secondary piRNA biogenesis. MBO Rep. 2018; 19:e42054. 10.15252/embr.201642054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Perez-Riverol Y, Bai J, Bandla C et al. The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences. Nucleic Acids Res. 2022; 50:D543–52. 10.1093/nar/gkab1038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Berger JD Autogamy in Paramecium. Cell cycle stage-specific commitment to meiosis. xp Cell Res. 1986; 166:475–85. 10.1016/0014-4827(86)90492-1. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
sRNA-seq datasets generated for this study have been deposited at the ENA with Study Accession numbers PRJEB59349 and PRJEB88919. The details can be found in Supplementary Table S2. Mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE [75] partner repository accession number PXD063336. The statistical data and R scripts used to generate the bioinformatic images, the original, uncropped blots and numerical values for each figure, the maps of the plasmids generated in this study, and the processed mass spectrometry data were deposited at https://doi.org/10.5281/zenodo.17154025. All other relevant data supporting the key findings of this study are available within the article and its Supplementary data file or from the corresponding author upon reasonable request.