Abstract
Most endogenous retroviruses (ERVs) in mammals are incapable of retrotransposition; therefore, why ERV derepression is associated with lethality during early development has been a mystery. Here, we report that rapid and selective degradation of the heterochromatin adapter protein TRIM28 triggers dissociation of transcriptional condensates from loci encoding super-enhancer (SE)-driven pluripotency genes and their association with transcribed ERV loci in murine embryonic stem cells. Knockdown of ERV RNAs or forced expression of SE-enriched transcription factors rescued condensate localization at SEs in TRIM28-degraded cells. In a biochemical reconstitution system, ERV RNA facilitated partitioning of RNA polymerase II and the Mediator coactivator into phase-separated droplets. In TRIM28 knockout mouse embryos, single-cell RNA-seq analysis revealed specific depletion of pluripotent lineages. We propose that coding and noncoding nascent RNAs, including those produced by retrotransposons, may facilitate ‘hijacking’ of transcriptional condensates in various developmental and disease contexts.
Subject terms: Gene regulation, Stem cells, Epigenomics
TRIM28 depletion in embryonic stem cells disconnects transcriptional condensates from super-enhancers, which is rescued by knockdown of endogenous retroviruses.
Main
ERVs make up around 10% of mammalian genomes, and ERVs are repressed by multiple mechanisms including heterochromatin, DNA methylation and modification of their RNA transcripts1–13. One of the best-studied repressive pathways involves the TRIM28 heterochromatin corepressor that is recruited by KRAB-ZFP transcription factors to ERVs in pluripotent embryonic stem cells (ESCs), where it recruits the histone H3 K9 methyltransferase SETDB1 and the heterochromatin protein HP1α that together establish a repressive chromatin environment7,9,14–17. ERV derepression is associated with lethality at various embryonic stages and in ESCs deficient for the TRIM28-HP1α pathway7–10, although most ERVs in mice and humans have lost their ability to undergo retrotransposition1–6, and deletion of entire clusters of KRAB-ZFP factors does not lead to elevated transposition rates in mice18. These findings suggest that RNA transcripts produced by ERVs may contribute to developmental phenotypes associated with ERV derepression.
RNA has long been recognized as a component of phase-separated biomolecular condensates, including stress granules, splicing speckles and Cajal bodies19, and recent studies indicate that RNA may make important contributions to nuclear condensates formed by transcriptional regulatory proteins20. During transcription, nascent RNA is thought to promote formation of transcriptional condensates enriched in RNA polymerase II (RNAPII) and the Mediator coactivator through electrostatic interactions that contribute to phase separation21. ERV RNAs can be transcribed at hundreds, if not thousands, of genomic loci, and many nuclear noncoding RNAs localize to the loci where they are produced22. These data lead us to hypothesize that ERV RNA transcripts may impact the genomic distribution of transcriptional condensates in cells deficient for ERV repression.
Here, we test the model that ERV RNA transcripts contribute to lethality associated with ERV derepression through disrupting the genomic distribution of transcriptional condensates. We found that RNAPII-containing condensates, typically associating with SE-driven pluripotency genes, are hijacked by transcribed ERV loci upon acute perturbation of the machinery responsible for ERV repression in ESCs. Condensate association was dependent on ERV RNAs and was rescued by ERV RNA knockdown or forced expression of pluripotency transcription factors. The results highlight an important role of ERV RNA transcripts in nuclear condensates in pluripotent cells.
Results
Rapid and selective degradation of TRIM28 in mESCs
ERVs, including intracisternal A-type particles (IAPs), are bound by members of the TRIM28-HP1α pathway and marked by H3K9me3 in murine ESCs (mESCs)7,9,14–17 (Fig. 1a,b and Supplementary Fig. 1a–g). In contrast, heterochromatin components tend not to occupy enhancers bound by the pluripotency transcription factors (TFs) OCT4, SOX2 and NANOG that drive the cell-type-specific transcriptional program of mESCs7,23,24 (Fig. 1a,b and Supplementary Fig. 1h–j).
Resolving the direct consequences of ERV derepression has been impeded, in part, by limitations of classic gene disruption strategies and the essential nature of the TRIM28-HP1α pathway in mESCs7–10. To overcome these challenges, we generated an mESC line that encodes degradation-sensitive TRIM28-FKBP alleles using the dTAG system (Fig. 1c and Supplementary Fig. 2a,b)25. Directed differentiation and tetraploid aggregation assays confirmed that TRIM28-FKBP mESCs maintained pluripotency and a gene expression profile similar to parental V6.5 mESCs (Supplementary Fig. 2d–n). Endogenously tagged TRIM28 experienced reversible, ligand-dependent proteolysis with near-complete degradation after 6 h of exposure to the dTAG-13 ligand (Fig. 1d and Supplementary Fig. 2b). Quantitative mass spectrometry confirmed that TRIM28 degradation was highly selective up to 24 h of dTAG-13 treatment (Supplementary Fig. 2c). Short-term (up to 24 h) TRIM28 degradation did not substantially alter the protein levels of pluripotency markers (for example, OCT4, SOX2, SSEA-1) (Fig. 1d and Supplementary Fig. 2j-l), suggesting that acute TRIM28 degradation did not markedly alter the pluripotent state.
Reduced SE transcription in TRIM28-degraded mESCs
To monitor changes in transcriptional activity upon acute TRIM28 degradation, we used TT-SLAM-seq, a recently developed genome-wide nascent transcription readout26. TT-SLAM-seq combines metabolic labeling and chemical nucleoside conversion (SLAM-seq)27 with selective enrichment of newly synthesized RNA (TT-seq)28 to detect nascent RNA transcription with high temporal resolution and sensitivity (Supplementary Fig. 3a–c). Consistent with previous reports7,29, we observed derepression of several main classes of ERVs, including IAPs, MMERVK10C and MMERVK9C elements in TRIM28-degraded ESCs (Fig. 1e and Supplementary Fig. 4a–f) and loss of H3K9me3 at these sites (Supplementary Fig. 4g). Derepression of ERVs was also confirmed with extended TRIM28 degradation for 96 h and RNA-seq (Fig. 1e and Supplementary Fig. 4a–e). The TT-SLAM-seq data revealed around 250 genes whose transcription was significantly induced and around 300 genes whose transcription was significantly reduced upon 24 h of TRIM28 degradation (greater than twofold, false discovery rate (FDR) < 0.05) (Fig. 1f,g). The downregulated genes were enriched for SE-associated pluripotency genes (NES = −1.6, P < 10−3) (Fig. 1h). Downregulation of these genes was associated with the reduction of nascent transcription at the SEs (Fig. 1i and Extended Data Fig. 1a–d), which tended to precede the reduction of transcription at the SE-driven gene (Fig. 1f and Extended Data Fig. 1a–c). These results were unexpected, as TRIM28 binds to ERVs in mESCs and is not bound at enhancers or SEs (Fig. 1b and Supplementary Fig. 1i,j). These data reveal the direct transcriptional response to the loss of TRIM28 and suggest that acute TRIM28 degradation leads to reduction of SE transcription in ESCs.
Reduced SE-condensate association in TRIM28-degraded mESCs
Components of the transcription machinery, for example, RNAPII and the Mediator coactivator, form biomolecular condensates that associate with SEs in ESCs30–33, and the presence of RNAPII condensates at genomic sites correlates with elevated transcriptional activity32. We thus hypothesized that reduction of SE transcription in TRIM28-degraded ESCs may be caused by reduced association of transcriptional condensates with SE loci. To test this idea, we visualized the genomic region containing the well-studied SE at the miR290-295 locus using nascent RNA-fluorescence in situ hybridization (FISH) and transcriptional condensates using immunofluorescence (IF) against RNAPII31. RNAPII puncta consistently colocalized with the miR290-295 locus in control ESCs, and the colocalization was reduced after 24 h of TRIM28 degradation (Fig. 1j and Extended Data Fig. 2a), while the overall level of RNAPII did not change (Extended Data Fig. 2b,c). Similar results were observed at the Fgf4 SE locus (Extended Data Fig. 2d,e). These data indicate that transcriptional condensates associate less with SE loci in TRIM28-degraded ESCs.
To further probe colocalization between RNAPII condensates and SEs, we used live-cell super-resolution photoactivated localization microscopy (PALM)32. We used an mESC line that encodes 24 copies of an MS2 stem-loop integrated into the SE-driven Sox2 gene, a transgene encoding the MCP MS2-binding protein with a SNAP tag (MCP-SNAP) and Rpb1 RNAPII subunit endogenously tagged with the Dendra2 photoconvertible fluorophore. We integrated the degradation-sensitive FKBP tag into the Trim28 locus in these cells, enabling acute TRIM28 degradation (Extended Data Fig. 2f). In this system, the MCP-SNAP protein can be used to visualize nascent RNA produced by the Sox2 gene, and the Dendra2 tag can be used to track RNAPII clusters32. We then visualized RNAPII clusters for 2 min in live mESCs using PALM and measured the size and distance of the RNAPII cluster nearest to the Sox2 locus. We found that 24 h dTAG treatment led to a significant reduction in the size of the RNAPII cluster nearest to the Sox2 locus (P = 2 × 10−4, Wilcoxon–Mann–Whitney test) (Fig. 1k) and an increase in the distance between the locus and the nearest RNAPII cluster (P = 0.04, Wilcoxon–Mann–Whitney test) (Fig. 1k), while the global size of RNAPII clusters in the cells and the average number of RNAPII clusters per cell did not change (Fig. 1k). These data indicate reduced association of RNAPII condensates at the Sox2 SE locus upon acute TRIM28 degradation in live cells.
Derepressed IAP RNA foci overlap RNAPII condensates
To investigate whether transcriptional condensates colocalize with derepressed ERVs, we visualized IAP ERV loci with RNA-FISH. Nuclear IAP foci became progressively apparent after 24–48 h of TRIM28 degradation (Fig. 2a and Extended Data Fig. 3a), and some nuclear IAP foci colocalized with RNAPII puncta visualized with IF (mean Manders’ overlap coefficient (MOC), 0.193; n = 24 cells) (Fig. 2a and Extended Data Fig. 3a). Colocalization of IAP foci was similarly observed with Mediator puncta visualized with IF using antibodies against the MED1 (MOC, 0.135; n = 24 cells) (Fig. 2b) and MED23 Mediator subunits (Extended Data Fig. 3b). Overall, ~20% of IAP foci were located within 200 nm of an RNAPII or MED1 puncta, a distance range compatible with regulatory interactions32 (Fig. 2c). Consistent with the colocalization of transcriptional condensates with IAP foci and their reduced colocalization with SEs, the occupancy of RNAPII, Mediator and the transcription-associated H3K27Ac chromatin mark increased at various ERV families already after 24 h of TRIM28 degradation, while their enrichment was reduced at SEs (Fig. 2d and Supplementary Fig. 5a,b).
As transcriptional condensates may associate with multiple distant DNA sites, we explored the possibility that condensates associating with ERVs incorporate ERV-proximal genes. To this end, we visualized the Cthrc1 locus using nascent RNA-FISH, as Cthrc1 was among the top upregulated genes in the TT-SLAM-seq data after 24 h of TRIM28 degradation and is located within 100 kb of three ERVs (Fig. 2e). We found that the Cthrc1 locus colocalized with RNAPII puncta in TRIM28-degraded ESCs (Fig. 2f and Extended Data Fig. 4a). The locus also colocalized with puncta formed by the NFY TF, whose motif is highly enriched in the long terminal repeat (LTR) of IAPs and other ERVs (Fig. 2g and Extended Data Fig. 4b,c), but not with puncta of a control TF NRF1 (Extended Data Fig. 4d). Transient (30 min) treatment of the cells with 1.5% 1,6 hexanediol (1-6 HD)—a short chain aliphatic alcohol that dissolves various biomolecular condensates including RNAPII condensates32 (Extended Data Fig. 3c,d)— reduced the level of Cthrc1 nascent RNA (twofold, P < 0.05, t-test) in TRIM28-degraded cells, indicating that RNAPII condensates contribute to the upregulation of this gene (Extended Data Fig. 3e). We then used CRISPR–Cas9 to delete the three ERVs at the Cthrc1 locus and found that, in the absence of the three ERVs, induction of Cthrc1 and other genes in the locus was compromised upon TRIM28 degradation (Fig. 2h and Supplementary Fig. 6a–e). To further probe contacts between derepressed ERVs and genes, we performed in situ Hi-C in control and TRIM28-degraded ESCs. We found that 24 h of TRIM28 degradation did not lead to marked genome-wide changes in chromatin contacts (Supplementary Figs. 7a,b and 8a,b) but did lead to a shift of the most-induced ERV taxa from the inactive ‘B’ towards the active ‘A’ compartment (Fig. 2i and Supplementary Fig. 7c) and a moderate increase in the contact frequency of ERVs with transcribed genes and SEs (Fig. 2j and Supplementary Fig. 7d). These results demonstrate that transcriptional condensates may incorporate genes proximal to derepressed ERVs.
SE-enriched TFs rescue condensate localization
RNAPII- and Mediator-containing condensates are thought to be anchored at SEs by TFs that are enriched at these sites34. One would thus expect that overexpression of SE-enriched TFs rescues the reduced association of transcriptional condensates with SEs in TRIM28-degraded cells. To test this idea, we generated degradation-sensitive TRIM28-FKBP alleles in an induced pluripotent stem cell (iPSC) line that contains integrated transgenes encoding OCT4, SOX2, KLF4 and MYC under a doxycycline-inducible promoter (Fig. 3a,b and Extended Data Fig. 5a,b)35. The OCT4, SOX2 and KLF4 TFs are highly enriched at SEs in ESCs36. TRIM28 degradation in the iPSCs led to the appearance of IAP foci as revealed by IAP RNA-FISH (Fig. 3c,d). Overexpression of OCT4, SOX2, KLF4 and MYC substantially reduced the fraction of iPSCs containing IAP foci (Fig. 3c,d) and overall IAP RNA level in the cell population (Fig. 3e). Furthermore, OCT4, SOX2, KLF4 and MYC overexpression rescued the extent of colocalization of RNAPII puncta with the miR290-295 SE locus in TRIM28-degraded cells (Fig. 3f) while the overall levels of RNAPII subunits did not change (Extended Data Fig. 5c,d). OCT4, SOX2, KLF4 and MYC overexpression also partially rescued the downregulation of the miR290-295 SE RNA and Pri-miR290-295 transcript in TRIM28-degraded cells (Fig. 3g) and nascent transcript levels at the Klf4, Fgf4, Oct4 and Mycn SE loci (Extended Data Fig. 5e–h). These results suggest that forced expression of SE-binding TFs prevents the loss of transcriptional condensates at the miR290-295 SE locus and attenuates IAP induction in TRIM28-degraded cells.
Roles of ERV RNA in condensate formation and localization
RNA is a key component of numerous biomolecular condensates19 and nascent RNA can enhance phase separation of transcriptional regulatory proteins21. Therefore, we hypothesized that RNA produced at ERV loci may contribute to the genomic localization of RNAPII-containing condensates. To test this idea, we knocked down various ERV RNAs in TRIM28-degraded cells. Expression of shRNAs targeting the four most prominent ERV families (IAPs, MMERVK10Cs, MMERVK9Cs, MMETns) partially rescued the downregulation of SEs and their associated genes after 24 h of TRIM28 degradation while knocking down IAPs alone did not (Extended Data Fig. 6a–d). However, expression of the shRNAs for 24 h before inducing TRIM28 degradation (for 24 h) almost entirely rescued the upregulation of ERV transcript levels (Fig. 4a,b and Extended Data Fig. 6e,g), the appearance of IAP RNA-FISH foci (Extended Data Fig. 6h), reduced transcription at SEs and their associated genes (Fig. 4b,c and Extended Data Fig. 6f) and the reduced association of RNAPII condensates at the mir290-295 SE locus (Fig. 4d and Extended Data Fig. 6i–j). These results indicate that knockdown of ERV RNAs rescues the decrease of SE transcription and reduced condensate localization at SEs in TRIM28-degraded cells.
To further dissect the relationship between ERV RNA and transcriptional condensates, we performed in vitro reconstitution experiments. We purified recombinant, mCherry-tagged C-terminal domain (CTD) of RNAPII, which was shown previously to form condensates in vitro30,33, and mixed it with fluorescein-labeled in vitro transcribed IAP RNA fragments. The IAP RNA fragments facilitated RNAPII CTD droplet formation in a dose-dependent manner (Fig. 4e,f and Extended Data Fig. 7a), and the IAP RNA was enriched within RNAPII CTD droplets in a dose-dependent manner (Fig. 4e,g). IAP RNA also facilitated condensation of the intrinsically disordered region (IDR) of the MED1 Mediator subunit (Extended Data Fig. 7b–e)—a frequently used in vitro model of Mediator31,33,34. Furthermore, IAP RNA enhanced droplet formation of purified recombinant HP1α (an in vitro model of heterochromatin37), but the optimal concentration of the RNA for HP1α was about fivefold lower than that for MED1 IDR in this in vitro system (Fig. 4h and Extended Data Fig. 8a,b). As expected, various other RNAs for example SE RNA21 and RNA from main satellite repeats38 also enhanced droplet formation of MED1 IDR and HP1α in vitro, but the difference in the optimal RNA concentration stayed consistently about fivefold (Extended Data Fig. 8a,b). Moreover, IAP RNA fragments facilitated partitioning of both the MED1 IDR and NFYC-IDR into IAP-RNA-containing heterotypic droplets (Fig. 4i,j and Extended Data Fig. 7f–i). These results indicate that IAP RNA can enhance droplet formation of key transcriptional regulatory proteins, and suggest a mechanistic basis for the difference of the effect of RNA on heterochromatin and transcriptional condensates.
Transgenic ERVs compete with SEs for activators
Derepressed ERV loci seem to compete for transcriptional condensates with SEs, in part through producing RNA that facilitates condensation of transcriptional activators. To probe this competition directly, we investigated whether simultaneously activated transcription at repetitive loci (for example, ERVs) could compromise transcription at SEs. First, we attempted to activate IAPs using CRISPRa39, but targeting a dCas9-VP64 protein to IAPs with several guide RNAs failed to produce meaningful transcription at those elements. We then mimicked the effect of simultaneous ERV induction by generating an mESC line containing multiple copies of an integrated PiggyBac transposon (Extended Data Fig. 9a–d). The transposon encoded a Dox-inducible green fluorescent protein (GFP) transgene with a polyA between two loxP sites, and ~900 bp fragments of IAPEz ERVs (Fig. 4k). Transfection of a plasmid encoding a Cre-recombinase enabled the generation of an isogenic line (‘IAPEz line’) encoding Dox-inducible IAPEz transgenes with the same copy number and insertion sites as GFP in the parental mESC line (‘GFP line’) (Fig. 4k). Quantitative PCR with reverse transcription (qRT–PCR) analyses confirmed induction of either GFP or IAPEz transcription upon Dox treatment in the respective lines (Fig. 4l). Moreover, induction of IAPEz transcription led to a rapid reduction of SE transcription at the miR290-295, Klf4 and Fgf4 loci, and reduced transcript levels of the associated genes, whereas it generally did not affect transcript levels of typical enhancer-associated genes (Fig. 4m and Extended Data Fig. 9e). In contrast, induction of GFP transcription had only a mild effect on SEs (Fig. 4m and Extended Data Fig. 9e). Consistent with the specific effect of IAPEz RNA induction, cellular fractionation experiments revealed that about twice as much of the IAPEz RNA is retained in the nuclear fraction compared with the GFP RNA (Extended Data Fig. 9f). Similar results were observed in a second pair of mESC lines in which the IAPEz fragment was substituted with fragments of MMERVK10C ERVs of around 900 bp (Extended Data Fig. 9g–i). Induction of MMERVK10C transcription from a PiggyBac transposon compromised transcription at SEs and their associated genes (Extended Data Fig. 9g–i). These results demonstrate that simultaneous activation of transgenic ERVs may compromise SE transcription in mESCs, and the ERV RNA seems to play an important role in this process.
ERV derepression correlates with loss of pluripotent cells
The above results suggest that pluripotent stem cells fail to maintain transcription of SE-driven genes when ERV repression is compromised. This model predicts that the amount of ERV products would correlate with the inability of embryos to maintain a pluripotent compartment. To test this model in vivo, we used our recently developed zygotic perturbation platform (Fig. 5a)40–42. We generated zygotic deletion mutants of TRIM28, SETDB1, HP1α and other epigenetic regulators implicated in ERV repression, and assayed the timing and amount of the GAG protein produced by IAPs (Fig. 5b). IAP GAG foci were detected in E3.5 blastocysts of TRIM28, SETDB1 and KDM1A knockout (KO) mutants, and these mutations were lethal at around E6.5 in embryos (Fig. 5b,c) suggesting that early appearance of IAP GAG foci may correlate with the onset of embryonic lethality.
To probe which cell types are affected by derepression of ERVs, we used single-cell RNA-seq (scRNA-seq). We created a reference cell state map of an early mouse embryo spanning a window of developmental stages (E5.5 to E7.0) that encompass the onset of lethality in TRIM28 KO mice (around E6.5) (Extended Data Fig. 10a–f,i)7,43,44. We then generated TRIM28-deficient embryos using Cas9/sgRNA delivery into zygotes (Extended Data Fig. 10a–c) and mapped cell states using scRNA-seq (Fig. 5d and Extended Data Fig. 10g). The E6.5 TRIM28 KO scRNA-seq data revealed a dramatic scarcity of epiblast cells normally derived from pluripotent cells of the inner cell mass (ICM) (Fig. 5d), and an abundance of extraembryonic lineages (for example, parietal endoderm) (Fig. 5e and Supplementary Fig. 9a,b). Quantifying the fraction of reads mapping to various ERV taxa revealed that IAPs and MMERVK10Cs were derepressed in all cell types, while MMERVK9Cs and MMETns were derepressed specifically in epiblast cells, resulting in a higher total fraction of reads from ERVs (Fig. 5e and Extended Data Fig. 10h). These data indicate that the amount of ERV RNA transcripts is especially high in pluripotent cells in TRIM28 KO embryos.
IF imaging corroborated specific depletion of pluripotent cells in early embryos. The pluripotency factors NANOG, OCT4, SOX2 and KLF4 were already virtually absent in the ICM of TRIM28 KO E3.5 blastocysts (Fig. 5f and Supplementary Fig. 10a-b), and the inner part of the blastocysts was instead populated by cells expressing the endoderm marker GATA6 (Fig. 5f). This phenotype is reminiscent of NANOG knockout embryos, in which the pluripotent ICM is replaced by GATA6-expressing parietal endoderm-like cells (Fig. 5f)45. These data are consistent with upregulation of endoderm markers observed in TRIM28-degraded ESCs (Supplementary Fig. 11a-c). Overall, these findings suggest that extended ERV derepression results in the loss of expression of pluripotency genes and consequent depletion of pluripotent cells in early mouse embryos.
Discussion
The results presented here support a model in which ERV retrotransposons have the capacity to hijack biomolecular condensates formed by key transcriptional regulatory proteins in pluripotent cells (Fig. 5g). This model may help explain why thousands of transposition-incapable ERVs are repressed in mammals and how their reactivation could alter cellular fates in the absence of retrotransposition7,9,18,46–50.
Derepressed ERVs seem to compete with SEs for transcriptional condensates in pluripotent cells, in part through the RNA transcripts they produce. In vitro, various RNA species facilitate phase separation of transcriptional regulators and heterochromatin proteins, primarily via engaging in electrostatic interactions21,38. In ESCs, forced transcription of ERV RNA from multiple loci led to a profound decrease in SE transcription while transcription of GFP RNA had a moderate effect (Fig. 4k–m and Extended Data Fig. 9g–i), suggesting sequence contribution to the impact of ERV RNAs on RNAPII-containing condensates in vivo. Recent studies reported that m6A methylation plays an important role in repressing ERV transcripts11–13. The contribution of ERV RNA to the hijacking of transcriptional condensates from SEs may explain the in vivo role and importance of ERV RNA modifications.
Many nuclear noncoding RNAs are known to localize to the loci where they are produced22, but their functions are mysterious. Transient expression of ERVs and nuclear IAP foci have been described in early-stage human and mouse embryos48,49,51,52 and adult immune cells53, suggesting that ERV RNAs may play important roles in transcriptional programs during mammalian development. Consistently, previous studies suggested that derepressed transposons can act as enhancers and alternative promoters of cellular genes29,48,53–55. RNA transcripts produced by ERVs may thus contribute to the genomic distribution of condensates in various developmental contexts.
Condensate hijacking by ERVs may contribute to disease. Trim28 haploinsufficiency is associated with obesity56 and predisposes to Wilms’ tumor57. Some ERVs may function as enhancers in acute myeloid leukemia58, and ERV transcription is associated with neurological diseases59 such as amyotrophic lateral sclerosis60 and schizophrenia61. The capacity of ERV RNAs to hijack transcriptional condensates may shed light on the molecular basis of these conditions.
Methods
Licenses
All animal procedures were performed in our specialized facility, following all relevant animal welfare guidelines and regulations, approved by the Max Planck Institute for Molecular Genetics and LAGeSo, Berlin (license number, G0247/18-SGr1; and Harvard University (IACUC protocol 28-21)). S2 work was performed following all relevant guidelines and regulations, approved by the Max Planck Institute for Molecular Genetics and the local authorities LAGeSo, Berlin (license number, 222/15-17a).
Cell culture
The V6.5 mESCs and iPSCs were cultured on irradiated primary mouse embryonic fibroblasts (MEFs) under standard serum/leukemia inhibitory factor (LIF) conditions (KO DMEM containing 15% fetal bovine serum (FBS), supplemented with 1× penicillin/streptomycin, 1× GlutaMAX supplement, 1× nonessential amino acids, 0.05 mM β-mercaptoethanol (all from Gibco) and 1,000 U ml–1 LIF).
For ChIP–seq, TT-SLAM–seq and RNA-seq experiments, mESCs were depleted from MEFs by incubating them on gelatin-coated cell culture plates for 45 min at 37 °C, allowing MEFs to attach while mESCs remain in suspension. MEF depletion was performed twice, after which mESCs were seeded on gelatin-coated plates and maintained in serum/LIF conditions with 2,000 U ml–1 LIF.
For RNA-FISH combined with IF, MEF-depleted cells were grown on round 18-mm glass coverslips (Roth LH23.1). Coverslips were coated with 5 µg ml–1 of poly-l-ornithine (Sigma-Aldrich, catalog no. P4957) for 30 min at 37 °C and with 5 µg ml–1 of Laminin (Corning, catalog no. 354232) overnight at 37 °C.
To perturb RNAPII condensates, cells were treated for 30 min with 1.5% 1-6 HD (Sigma) in serum/LIF conditions with 2,000U ml–1 LIF (Extended Data Fig. 3c,d).
Generation of the TRIM28-FKBP ESC line
To knock in the degradation-sensitive FKBPF36V tag at the N-terminus of TRIM28, a repair template containing homology arms spanning upstream and downstream of the target site was cloned into a pUC19 vector (NEB) (Supplementary Fig. 2a). The repair template included a mRuby2 fluorescent protein sequence, P2A linker and the FKBP tag sequence (Supplementary Fig. 2a)25. mRuby2 sequence was amplified from the mRuby2-N1 plasmid (Addgene, catalog no. 54614), and the P2A-FKBP sequence was amplified from the PITCh dTAG donor vector (Addgene, catalog no. 91792). A guide RNA (Supplementary Table 1b) targeting the N-terminus of TRIM28 was cloned into the sgRNA-Cas9 vector pX458 (Addgene, catalog no. 48138). The repair template and the sgRNA-Cas9 vector were transfected into V6.5 mESCs and iPSCs by nucleofection using Amaxa 4D Nucleofector X Unit (Lonza) according to the manufacturer’s instructions. To screen for positive integrations, the transfected cells were sorted for mRuby2 fluorescent protein expression with flow cytometry. The sorted cells were seeded as single cells and expanded for a few days. Single colonies were picked and genotyped for the correct integration with Western blot.
TRIM28 degradation
Before treatment, cells were seeded on 0.2% gelatin-coated plates after two rounds of MEF depletion. For degradation of TRIM28, 500 nM of dTAG-13 compound25 was mixed with mESC medium (supplemented with 2,000 U mL–1 LIF) and incubated for the time indicated; medium was changed daily for fresh dTAG-13.
RNA-FISH combined with IF
RNA-FISH combined with IF was performed essentially as described31. For IF, dTAG-13- or DMSO- treated cells were fixed in 4% paraformaldehyde for 10 min at RT and stored in PBS at 4 °C. All buffers and antibodies were diluted in RNase-free PBS (Thermo Fisher, catalog no. AM9624). Cells were permeabilized with 0.5% Triton X-100 (Thermo Fisher, catalog no. 85111) for 10 min at RT, followed by three consecutive 5 min PBS washes. Cells were then incubated in the primary antibody (RNAPII (Abcam, catalog no. ab817) at 1:500, NFY-A (Santa Cruz, catalog no. sc-17753 X) at 1:250, NRF1 (Abcam, catalog no. ab55744) at 1:500, MED1 (Abcam, catalog no. ab64965) at 1:500 and MED23 (Bethyl Labs, catalog no. A300-425A) in PBS overnight. After two 5 min PBS washes, cells were incubated in the secondary antibody (Invitrogen, goat anti-mouse Alexa Fluor 488 (catalog no. A-11001) or goat anti-rabbit Alexa Fluor 488 (catalog no. A-11008)) at 1:500 in PBS for 60 min at room temperature. Cells were washed twice in PBS for 5 min and re-fixed with 4% paraformaldehyde in PBS for 10 min at room temperature. Following two 5 min PBS washes, cells were washed once with 20% Stellaris RNA-FISH Wash Buffer A (Biosearch Technologies, catalog no. SMF-WA1-60) and 10% deionized formamide (EMD Millipore, catalog no. S4117) in RNase-free water (Invitrogen, catalog no. 10977035) for 5 min at RT. Cells were hybridized with 90% Stellaris RNA-FISH Hybridization Buffer (Biosearch Technologies, catalog no. SMF-HB1-10), 10% deionized formamide and 12.5 or 25 µM Stellaris RNA-FISH probes. Probes were hybridized in a humidified chamber overnight at 37 °C. Cells were washed with Wash Buffer A for 30 min at 37 °C and stained with 0.24 µg ml–1 4,6-diamidino-2-phenylindole (DAPI) in Wash Buffer A for 3 min at room temperature. Cells were washed with Stellaris RNA-FISH Wash Buffer B (Biosearch Technologies, catalog no. SMF-WB1-20) for 5 min at RT, mounted onto glass microscopy slides with Vectashield mounting medium (Vector Laboratories, catalog no. H-1900) and sealed using transparent nail polish. Images were acquired with LSM880 Airyscan microscope equipped with a Plan-Apochromat ×63/1.40 oil differential interference contrast objective or Z1 Observer (Zeiss) microscope with ×100 magnification with Zen 2.3 v.2.3.69.1016 (blue edition) or Zen (black edition). Images were processed with ZEN 3.1 (Zeiss) and ImageJ software v.2.1.0/1.53i (Figs. 1j, 2a–b, 2f–g, 3f and 4d and Extended Data Figs. 2a, 3a–b,d, 4d and 6h). ImageJ colocalization plugins were used for colocalization analysis of ERV IAP RNA-FISH with RNAPII and MED1 IF62,63. For nearest RNAPII cluster distance analysis in the miR290-295 RNA-FISH dataset, z-projections consisting of ±4.5 slices around the FISH spot were obtained in both channels and thresholded to allow detection of individual RNAPII clusters. Center of mass distances to the nearest cluster were calculated using FIJI (DiAna)63. RNA-FISH probes were designed and generated by Biosearch Technologies Stellaris RNA-FISH to target introns of miR290-295 primary transcript and Cthrc1, and IAPEz transcripts. Sequences of RNA-FISH probes are available in Supplementary Table 1a.
Live-cell PALM
Live-cell PALM imaging was carried out as described before32,64,65. mESCs used for live-cell PALM imaging were derived from R1 background, with the Sox2 gene tagged with 24 repeats of MS2 stemloops at its mRNA 3′ end, Rpb1 tagged with Dendra2 at its N-terminus, EF1α-NLS-MCP-SNAP inserted stably into the genome and both alleles of Trim28 tagged with the degradation-sensitive FKBP tag (Extended Data Fig. 2f). Cells were simultaneously illuminated with 1.3 W cm–2 near UV light (405 nm) for photoconversion of Dendra2 and 3.2 kW cm–2 (561 nm) for fluorescence detection with an exposure time of 50 ms. We acquired images of Dendra2-RNAPII for 100 s (2,000 frames) for quantification of Pol II clusters. For dual-color imaging, cells were incubated with 100 nM JF646 SNAP ligand for 20 min and washed with 2i medium, followed by 30 min incubation in 2i media without JF646-HaloTaq ligands, to wash out unbound SNAP ligands before fluorescence imaging in L-15 medium. We acquired 50 frames (2.5 s) with 642 nm excitation with a power intensity of 2.5 kW cm–2 and quickly switched to simultaneous 405/561 imaging for PALM. Super-resolution images were reconstructed and analyzed using MTT66 and qSR67. RNAPII cluster size was defined as the total number of localizations within the image acquisition time (100 s). The distance was calculated as the distance between the center of the MS2 nascent transcription site and the center of the nearest RNAPII cluster (Fig. 1k).
TT-SLAM-seq
TT-SLAM-seq was performed as described previously26. Briefly, cells were treated with DMSO or 500 nM dTAG-13 for 2, 6 or 24 h and subjected to 15 min of 4-thiouridine (4sU) labeling using 500 µM 4sU. Total RNA was extracted with Trizol (Ambion) and 24:1 chloroform:isoamylalcohol (Sigma) while using 0.1 mM dithiothreitol (DTT) in isopropanol precipitation and ethanol washes. For each sample, 50 µg of total RNA was fragmented with Magnesium RNA Fragmentation Module (NEB), and fragmentation buffer was removed from samples with ethanol precipitation in presence of 0.1 mM DTT. RNA was then resuspended in 350 µl RNase-free water, diluted in biotinylation buffer (200 mM HEPES pH 7.5, and 10 mM EDTA) and topped up with 5 µg MTS-Biotin (previously diluted to 50 µg ml–1 in dimethylformamide) to reach a final volume of 500 µl. The biotinylation reaction was incubated for 30 min at room temperature while keeping samples in rotation and protected from light. Unbound biotin was removed with acid-phenol:chloroform extraction (125:24:1, Ambion) and isopropanol precipitation. Biotinylated RNA was resuspended in 100 µl RNase-free water, denatured in 65 °C for 10 min and then cooled on ice for 5 min. The biotinylated RNA was captured with 100 µl µMACS streptavidin beads (Miltenyi) by incubating for 15 min in rotation while keeping samples protected from light. µMACS columns were equilibrated on magnetic stand with nucleic acid equilibration buffer and two times with biotinylation buffer (20 mM HEPES, 1 mM EDTA, pH 8). Beads were transferred to columns and washed three times with wash buffer (100 mM Tris-HCl pH 7.5, 10 mM EDTA, 1 M NaCl and 0.1 % Tween 20), and labeled RNA was eluted twice with a total 200 µl of 100 mM DTT. RNA was cleaned up with RNeasy Minelute columns (Qiagen) and eluted to RNase-free water with 1 mM DTT. 4sU residues of RNA were alkylated with iodoacetamide treatment (10 mM iodoacetamide in 50 mM NaPO4, pH 8 and 50 % DMSO) by incubating samples in 50 °C for 15 min, followed by quenching with 20 mM DTT. RNA samples were purified with ethanol precipitation and treated with Turbo DNase (Invitrogen). Sequencing libraries were prepared with NEBNext Ultra II Directional RNA Library Prep Kit and NEBNext Multiplex Oligos (NEB), according to manufacturer’s instructions, except using 8 min incubation time in fragmentation step.
Generating wild-type and mutant mouse embryos
Zygotes were generated by in vitro fertilization (IVF) as previously described68. Briefly, B6D2F1 female mice aged 7–9 weeks were superovulated with two rounds of hormone injections (5 IU of pregnant mare serum gonadotrophin followed by 5 IU of human chorionic gonadotrophin after 46 h). Oocytes were isolated and cultured in pre-gassed KSOM medium before IVF. F1 (C57BL/6J × Castaneous) sperm isolated from the cauda epididymis were thawed and used for IVF. At 6 h after fertilization, zygotes were washed in M2 medium for multiple rounds and then prepared for electroporation. Alt-R CRISPR–Cas9 and guide RNAs ribonucleoproteins were prepared as described previously40. Guide RNAs used to target the genes are listed in Supplementary Table 1b. Zygotes were washed in three drops of OptiMEM Reduced Serum Medium (Thermo Fisher Scientific) before electroporation. NEPA21 electroporator (NEPAgene) was used for electroporating zygotes with the following settings for a small chamber: four poring pulses of 34 V for 2.5 ms with an interval of 50 ms were used to generate pores in the zona pellucida layer. Voltage decay was set at 10% and (+) polarity. To enable intake of the ribonucleoproteins, five transfer pulses of 5 V were applied for 50 ms each with an interval of 50 ms. Voltage decay for the transfer was set at 40% with an alternating polarity of (+) and (−). Electroporated zygotes were washed in three drops of KSOM medium and cultured in pre-gassed KSOM drops until blastocyst stage under standard embryo culture conditions. Blastocysts were scored for viability and morphology and retransferred bilaterally in a clutch of 15 blastocysts per uterine horn into day 2.5 pseudopregnant CD-1 surrogate female mice. E6.5 embryos were dissected from the uterus in 1× Hanks’ Balanced Salt Solution and used for further analysis. E5.5 wild-type embryos were generated with the setup, and mock electroporation with guide targeting GFP sequence was used.
scRNA-seq of embryos
E5.5 wild-type and E6.5 TRIM28 mutant embryos were dissected from the decidua in 1× Hanks’ Balanced Salt Solution and then washed in 1× PBS. Reichert’s membrane was removed carefully with sharp forceps and glass capillaries, and the embryos were washed in 1× PBS with 0.4% BSA. The embryos were disaggregated with TrypLE Express (Gibco) with gentle pipetting every 10 min up to a total of 40 min at 37 °C. The dissociated cells were counted for viability and then washed in 1× PBS with 0.4% BSA for a total of three washes at 4 °C and 1,200 r.p.m. for 5 min. The cells were subjected to scRNA-seq using a 10x Genomics Chromium Single Cell 3′ v.2 kit. Single-cell libraries were generated following the manufacturer’s instructions with the exception of the cycle number used. Libraries were sequenced on a Novaseq6000 with asymmetric reads and a depth of 300–350 million fragments per library.
Average image and radial distribution analysis
The image analysis pipeline used for the colocalization analysis of RNA-FISH combined with IF was described previously31. Briefly, MATLAB scripts were used to identify RNA-FISH foci in z stacks through intensity thresholding (the same threshold was used for image sets shown on the same figure panels) and create RNA-FISH signal centroids (x, y, z) that were stitched together and positioned in a box of size l = 1.5 μm. For identified FISH foci, signal from corresponding location in the IF channel was collected in the l × l square centered at the RNA-FISH focus at each corresponding z-slice. The IF signal centered at FISH foci for each FISH and IF pair were then combined to calculate an average intensity projection, providing averaged data for IF signal intensity within a l × l square centered at FISH foci. The same process was carried out for the FISH signal intensity centered on its own coordinates, providing averaged data for FISH signal intensity within a l × l square centered at FISH foci. As a control, this same process was carried out for IF signal centered at random nuclear positions generated using custom Python scripts. These average intensity projections were then used to generate two-dimensional contour maps of the signal intensity or radial distribution plots. Contour plots are generated using inbuilt functions in MATLAB. The intensity radial function ((r)) is computed from the average data. For the contour plots of the IF channel, an intensity colormap consisting of 14 bins with gradients of black, violet and green was generated. For the FISH channel, black to magenta was used. The generated colormap was employed to 14 evenly spaced intensity bins for all IF plots. The averaged IF centered at FISH or at randomly selected nuclear locations were plotted using the same color scale. For the radial distribution plots, the Spearman correlation coefficients, r, were computed and reported between the FISH and IF (centered at FISH) signal. A two-tailed Student’s t-test, comparing the Spearman correlation calculated for all pairs, was used to generate P values (Figs. 1j, 2f–g, 3f and 4d and Extended Data Figs. 2a and 4d).
Bioinformatics
All analyses were carried out using R v.3.6.3 unless stated otherwise.
TT-SLAM-seq processing
Raw reads were trimmed by quality, Illumina adapter content and polyA content analogous to the RNA-seq samples and aligned with STAR with parameters ‘–outFilterMultimapNmax 50–outReadsUnmapped Fastx’ to the SILVA database69 (downloaded 6 March 2020) to remove rRNA content. Unaligned reads were afterwards reverse-complemented using the seqtk ‘seq’ command (https://github.com/lh3/seqtk, v.1.3-r106; parameters: -r). Reverse-complemented reads were processed using SLAM-DUNK70 with the ‘all’ command (v.0.4.1; parameters: -rl 100 -5 0) with the GENCODE gene annotation (VM19) as ‘-b’ option. Reads with a ‘T > C’ conversion representing nascent transcription were filtered from the BAM files using alleyoop (provided together with SLAM-DUNK) with the ‘read-separator’ command. Counts per gene were quantified based on the ‘T > C’-converted reads using htseq-count (v.0.11.4; parameters:–stranded=yes,–nonunique=all)71. FPKM values were calculated based on the resulting counts. For genome-wide coverage tracks, technical replicates were merged using samtools ‘merge’72. Coverage tracks for single and merged replicates were obtained using deepTools bamCoverage72 (v.3.4.3; parameters:–normalizeUsing CPM) separately for the forward and reverse strand based on the ‘T > C’-converted reads.
Enhancer and SE annotation
The annotation of SErs, enhancers and enhancer constituents was taken from Whyte et al.73. Coordinates were lifted from mm9 to mm10 using UCSC liftOver. These coordinates were used throughout this study for all enhancer-associated analyses (Supplementary Table 2).
Retrotransposon element definition
The genome-wide retrotransposon annotation of LTR, LINE and SINE elements was downloaded from Repbase74. Based on the Repbase classification system, we used the element annotation as LTR, LINE or SINE as the retrotransposon classes. Retrotransposon families considered in this study were L1 and L2 elements (LINE), ERV1, ERV3, ERVK, ERVL and MALR (LTR), as well as Alu, B2, B4 and MIR elements (SINE). Repeat subfamilies used in this study were subdivided into IAP, MMERVK and MMETn (ERVK) elements. IAPs and MMERVKs consist of multiple different subfamilies as annotated by Repbase (Supplementary Figs. 1a and 4a), which we summarized under these broader terms. The classification is consistent with retrotransposon classification described in previous studies1,75,76.
Full-length retrotransposons were defined based on the Repbase repeat annotation. For full-length ERVK elements, we required the element to consist of an inner part with two flanking LTRs. First, elements annotated as inner parts (containing the keyword ‘int’) were merged if they belonged to the same subfamily and were located within maximal 200 bp of each other. Second, only the merged inner parts with an annotated ERVK LTR within a distance of, at most, 50 bp on each side were selected as full-length element candidates. For IAPs specifically, only LTRs that belonged to an IAP subfamily were considered. No size restrictions were applied on the inner parts or LTRs, which could lead to potential false positive candidates that are too truncated to be able to be transcribed, but, on the other hand, provides an unbiased definition of full-length repeat elements. The subfamily per element was defined based on the inner part. Inner parts flanked by only one LTR were termed half-length elements. LTRs without an inner part were termed solo LTRs. To provide a broad overview of potential full-length L1 elements, only annotated elements with a size of greater than 6 kb were shown. The genomic coordinates of retrotransposons are listed in Supplementary Table 2a–e.
scRNA-seq processing
Fastq files for the wild-type timepoints E6.5 and E7.0 were downloaded from GEO (Supplementary Table 1c)77. For the wild-type time point E5.5 and the Trim28 KO, raw reads (fastq) were generated using Cell Ranger (https://support.10xgenomics.com/single-cell-gene-expression/software/downloads/latest) (v.4) from 10x Genomics Inc. with the command ‘cellranger mkfastq.’ Reads from all timepoints were aligned against the mouse genome (mm10), and barcodes and unique molecular identifiers were counted using ‘cellranger count’. Multiple sequencing runs were combined using ‘cellranger aggr.’
Retrotransposon expression quantification
Global repeat expression quantification from RNA-seq, TT-SLAM-seq and scRNA-seq (Fig. 1e and Supplementary Fig. 4b-e) was carried out as described40. Briefly, to estimate the expression for each retrotransposon subfamily without bias due to gene expression, only reads not overlapping any gene were considered for the analysis. Reads overlapping splice sites, as well as reads with a high polyA content, were removed. The remaining reads were counted per subfamily only if they aligned uniquely or multiple times to elements of the same subfamily. Here, any annotated element of a specific subfamily from Repbase was considered independent of our full-length ERVK annotations. Reads aligning to multiple elements were counted only once. For scRNA-seq samples, reads were counted per subfamily, sample and cell state. The number of reads per subfamily was normalized by library size for RNA-seq and TT-SLAM-seq samples and normalized by reads aligning to genes and repeats for scRNA-seq samples. Fold change (FC) was calculated with respect to the DMSO or wild-type samples.
Statistical tests
The statistical significance of the difference of IAP expression between DMSO control and dTAG timepoints for TT-SLAM-seq and RNA-seq was calculated using an unpaired two-sided t-test (Fig. 1e). Statistical significance of differences in FC (versus DMSO) in control versus 1-6 HD-treated cells was estimated with unpaired two-sided t-test (Extended Data Fig. 3e). All other tests are described in the figure legends.
Definition of boxplot elements
In Figs. 1i and 4c,f–h,j, Extended Data Figs. 7c,e and 8b and Supplementary Fig. 7c, elements depicted in boxplots are as follows: middle line, median; box limits, upper and lower quartile; whiskers, 1.5× interquartile range. In Extended Data Figs. 2b–d and 5c, elements depicted in dot plots are as follows: middle line, mean; whiskers, s.d.; points, all data points.
Statistics and reproducibility
For all RNA-FISH combined with IF experiments, the target combination of gene transcript and transcriptional activator was probed on one coverslip of mESCs and at least two viewpoints were acquired. The number of detected foci included in the radial plot analysis is indicated under nfoci in Figs. 1j, 2f,g, 3f and 4d and Extended Data Figs. 2b,d, 4d and 6j. For Fig. 2a,b, n indicates the number of analyzed nuclei collected from at least three viewpoints, whereas the total number of detected IAPez foci is indicated in Fig. 2c (1,774 for RNAPII and 2,735 for MED1). Colocalizing foci (distance <200 nm) from Fig. 2a,b are indicated in Fig. 2c (344 of 1,774 for RNAPII and 381 of 2,735 for MED1). In Fig. 1h, enhancer constituents with significant transcription (FPKM > 1) are included (n = 117 for super-enhancers, n = 153 for typical enhancers).
IAP RNA-FISH–RNAPII IF experiments were repeated three times. Images and analysis of one representative experiment are displayed in Fig. 2a, and those from a second replicate experiments in Extended Data Fig. 3a. IAP RNA-FISH–MED1/MED23 IF images are from one biological replicate staining experiment (Fig. 2b and Extended Data Fig. 3b). IF images of 1-6 HD-treatment experiments (Extended Data Fig. 3d) and Cthrc1 RNA–NRF1 IF images are from one biological replicate staining experiment (Extended Data Fig. 4d).
For in vitro biochemistry experiments (Fig. 4e,i and Extended Data Figs. 7a,b,d,g,h and 8a), at least one independent slide containing the indicated mix was imaged and at least five independent viewpoints were acquired for each slide. Data are displayed as boxplots (Fig. 4f,g,h,j and Extended Data Figs. 7c,e,i and 8b), and each dot represents an individual droplet (n is the total number of droplets). In the boxplots, the lower box limit was set to the 25th percentile, upper box limit was set to the 75th percentile, the center line indicates the median and the whiskers represent the range within 1.5× interquartile. The following numbers of viewpoints and droplets were analyzed (formatted as condition per number of replicates per number of viewpoints per experiment per total number of droplets shown if the figure contains a boxplot): Fig. 4e,f,g, (IAP RNA: 0 nM; Fluorescein: 73 nM)/3/5/4,152, (3.7 nM)/3/5/1,337, (37 nM)/3/5/8,142, (73 nM)/3/5/11,018; Fig. 4h, (all conditions)/1/5/5 (each dot represents an image); Fig. 4i,j, (IAP RNA: 0 nM)/2/5/308, (15 nM)/2/5/332; Extended Data Fig. 7a–e, (IAP RNA: 0 nM; Fluorescein: 73 nM)/2/5/945, (3.7 nM)/2/5/293, (37 nM)/2/5/217, (73 nM)/2/5/321; Extended Data Fig. 7g, (all conditions)/2/5; Extended Data Fig. 7h,i, (mEGFP)/2/5/337, (NFYC-IDR-mEGFP)/2/5/434; Extended Data Fig. 8a,b, (IAP RNA-Cy5 all combinations)/1/5/5 images, (Maj Sat Repeat RNA-Cy5 all combinations)/1/10/10 images, (MiR290-295 SE RNA-Cy5 with MED1 IDR-mEGFP)/1/10/10 images and (MiR290-295 SE RNA-Cy5 with HP1α-mCherry)/1/20/20 images).
Sample sizes in Fig. 1k are as follows: left, sample size: 157 (DMSO), 112 (dTAG-13) cells; middle left, sample size: 157 (DMSO), 112 (dTAG-13) cells; middle, right, sample size: 2,591 (DMSO), 1,572 (dTAG-13) RNAPII clusters; right, number of RNAPII clusters per cell: sample size: 94 (DMSO), 59 (dTAG-13) cells.
Sample sizes in Fig. 4c are as follows: left panel shows SE constituents with FPKM > 0.05 (n = 163), middle panel contains SE-associated genes (n = 185) and right panel includes all active genes FPKM > 1 (n = 11,525).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Online content
Any methods, additional references, Nature Research reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41588-022-01132-w.
Supplementary information
Acknowledgements
We thank T. Aktas and A. Bulut-Karslioglu for comments on the manuscript, C. Haggerty (MPIMG) for S2 cells, G. Winter (CeMM, Vienna), M. Jäger (CeMM, Vienna), M. Biesaga (IRB, Barcelona) and X. Salvatella (IRB, Barcelona) for sharing RNAPII antibody, A. Dall’Agnese and E. Guo for advice on FISH-IF, S. Basu for helping with colocalization analysis, S. Mackowiak for help with code, C. Althoff for help with Western blots, B. Lukaszewsa-McGreal for help with mass spectrometry, C. Hillgardt, C. Franke and the MPIMG transgenic facility for help with mouse work and the MPI-MG Sequencing core for sequencing. This work was funded by the Max Planck Society and partially supported by grants from the NIH (1P50HG006193 to A.M.; 5R01GM134734 to I.I.C.), the Deutsche Forschungsgemeinschaft (DFG) Priority Program SPP 2202 Grant HN 4/1-1 and HN 4/3-1 (to D.H.), the Austrian Academy of Sciences and the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (ERC-CoG-866166, RiboTrace) (to S.L.A.). H.N. is supported by fellowships from the Emil Aaltonen, Orion Research Foundation and Instrumentarium Science Foundation.
Extended data
Source data
Author contributions
V.A. conceived the study, generated mESC lines, designed and performed RNA-FISH-IF experiments, analyzed and interpreted data, supported TT-SLAM-seq experiments and revised the manuscript. A.S.K. conceived the study, generated and characterized mESC lines, performed directed differentiation experiments, scRNA-seq, repeat knockdown, IAP-gag RNA-FISH, in vivo KO experiments, tetraploid aggregation experiments, whole-embryo IF, flow cytometry experiments and analysis and revised the manuscript. H.N. conceived the study, performed ChIP–seq, TT-SLAM-seq and in situ Hi-C experiments, analyzed and interpreted the data from TT-SLAM-seq and Hi-C experiments, assisted in RNA-seq analysis, performed and supported qRT–PCR experiments, generated ERV-TKO cell lines and revised the manuscript. C.R. conceived the study, designed and performed OSKM ectopic expression and ERV overexpression experiments, generated and characterized iPSC-OSKM-TRIM28-FKBP, mESC-IAPEz and mESC-MMERVK10C cell lines and revised the manuscript. S.H. performed initial processing of RNA-seq, TT-SLAM-seq, ChIP–seq and scRNA-seq datasets, analyzed and interpreted RNA-seq, ChIP–seq and scRNA-seq data, assisted in the TT-SLAM-seq data analysis and revised the manuscript. J.N. performed protein purification and droplet assays. N.F., N.P. and S.L.A. supported TT-SLAM-seq methodology and data analysis. M.D. and I.I.C. designed, performed and analyzed time-correlated PALM experiments. H.K. assisted in scRNA-seq analysis. Z.D.S. and L.W. performed in vivo Trim28 KO and tetraploid aggregation experiments. R.W. generated and performed NPC experiments. M.W. supported cloning and qRT–PCR experiments. R.B. performed quantification for IAP-gag RNA-FISH. S.M. wrote code for image analysis. D.M. performed mass spectrometry. B.T. supported next-generation sequencing methodology. A.M. conceived the study and supervised the work. D.H. conceived the study, supervised the work and wrote the manuscript.
Peer review
Peer review information
Nature Genetics thanks Alessio Zippo, Todd Macfarlan and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Funding
Open access funding provided by Max Planck Society.
Data availability
All data are available in the Supplementary Information. Sequence data were deposited at GEO under the accession number GSE159468. Mass spectrometry data were deposited at ProteomeXchange under the accession ID PDX021895. Plasmids generated in the study are available at Addgene. Source data are provided with this paper.
Code availability
All custom code used in this study was deposited along with the raw data at Zenodo (10.5281/zenodo.6521914).
Competing interests
S.L.A. declares competing interest based on a granted patent related to SLAMseq. S.L.A. is co-founder, advisor and member of the board of QUANTRO Therapeutics GmbH
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Vahid Asimi, Abhishek Sampath Kumar, Henri Niskanen, Christina Riemenschneider.
Extended data
is available for this paper at 10.1038/s41588-022-01132-w.
Supplementary information
The online version contains supplementary material available at 10.1038/s41588-022-01132-w.
References
- 1.Thompson PJ, Macfarlan TS, Lorincz MC. Long terminal repeats: from parasitic elements to building blocks of the transcriptional regulatory repertoire. Mol. cell. 2016;62:766–776. doi: 10.1016/j.molcel.2016.03.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Grewal SI, Jia S. Heterochromatin revisited. Nat. Rev. Genet. 2007;8:35–46. doi: 10.1038/nrg2008. [DOI] [PubMed] [Google Scholar]
- 3.Cordaux R, Batzer MA. The impact of retrotransposons on human genome evolution. Nat. Rev. Genet. 2009;10:691–703. doi: 10.1038/nrg2640. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Payer LM, Burns KH. Transposable elements in human genetic disease. Nat. Rev. Genet. 2019;20:760–772. doi: 10.1038/s41576-019-0165-8. [DOI] [PubMed] [Google Scholar]
- 5.Rowe HM, Trono D. Dynamic control of endogenous retroviruses during development. Virology. 2011;411:273–287. doi: 10.1016/j.virol.2010.12.007. [DOI] [PubMed] [Google Scholar]
- 6.Wells JN, Feschotte CA. A field guide to eukaryotic transposable elements. Annu Rev. Genet. 2020;54:539–561. doi: 10.1146/annurev-genet-040620-022145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Rowe HM, et al. KAP1 controls endogenous retroviruses in embryonic stem cells. Nature. 2010;463:237–240. doi: 10.1038/nature08674. [DOI] [PubMed] [Google Scholar]
- 8.Cammas F, et al. Mice lacking the transcriptional corepressor TIF1beta are defective in early postimplantation development. Development. 2000;127:2955–2963. doi: 10.1242/dev.127.13.2955. [DOI] [PubMed] [Google Scholar]
- 9.Matsui T, et al. Proviral silencing in embryonic stem cells requires the histone methyltransferase ESET. Nature. 2010;464:927–931. doi: 10.1038/nature08858. [DOI] [PubMed] [Google Scholar]
- 10.Hu G, et al. A genome-wide RNAi screen identifies a new transcriptional module required for self-renewal. Genes Dev. 2009;23:837–848. doi: 10.1101/gad.1769609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Xu W, et al. METTL3 regulates heterochromatin in mouse embryonic stem cells. Nature. 2021;591:317–321. doi: 10.1038/s41586-021-03210-1. [DOI] [PubMed] [Google Scholar]
- 12.Chelmicki T, et al. m(6)A RNA methylation regulates the fate of endogenous retroviruses. Nature. 2021;591:312–316. doi: 10.1038/s41586-020-03135-1. [DOI] [PubMed] [Google Scholar]
- 13.Liu J, et al. The RNA m(6)A reader YTHDC1 silences retrotransposons and guards ES cell identity. Nature. 2021;591:322–326. doi: 10.1038/s41586-021-03313-9. [DOI] [PubMed] [Google Scholar]
- 14.Lachner M, O’Carroll D, Rea S, Mechtler K, Jenuwein T. Methylation of histone H3 lysine 9 creates a binding site for HP1 proteins. Nature. 2001;410:116–120. doi: 10.1038/35065132. [DOI] [PubMed] [Google Scholar]
- 15.Wolf G, et al. The KRAB zinc finger protein ZFP809 is required to initiate epigenetic silencing of endogenous retroviruses. Genes Dev. 2015;29:538–554. doi: 10.1101/gad.252767.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Schultz DC, Ayyanathan K, Negorev D, Maul GG, Rauscher FJ., 3rd SETDB1: a novel KAP-1-associated histone H3, lysine 9-specific methyltransferase that contributes to HP1-mediated silencing of euchromatic genes by KRAB zinc-finger proteins. Genes Dev. 2002;16:919–932. doi: 10.1101/gad.973302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Yang BX, et al. Systematic identification of factors for provirus silencing in embryonic stem cells. Cell. 2015;163:230–245. doi: 10.1016/j.cell.2015.08.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wolf G, et al. KRAB-zinc finger protein gene expansion in response to active retrotransposons in the murine lineage. eLife. 2020;9:e56337. doi: 10.7554/eLife.56337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Roden C, Gladfelter AS. RNA contributions to the form and function of biomolecular condensates. Nat. Rev. Mol. Cell Biol. 2021;22:183–195. doi: 10.1038/s41580-020-0264-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Sharp PA, Chakraborty AK, Henninger JE, Young RA. RNA in formation and regulation of transcriptional condensates. RNA. 2022;28:52–57. doi: 10.1261/rna.078997.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Henninger JE, et al. RNA-mediated feedback control of transcriptional condensates. Cell. 2021;184:207–225.e24. doi: 10.1016/j.cell.2020.11.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Quinodoz SA, et al. RNA promotes the formation of spatial compartments in the nucleus. Cell. 2021;184:5775–5790.e30. doi: 10.1016/j.cell.2021.10.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Elsasser SJ, Noh KM, Diaz N, Allis CD, Banaszynski LA. Histone H3.3 is required for endogenous retroviral element silencing in embryonic stem cells. Nature. 2015;522:240–244. doi: 10.1038/nature14345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Bulut-Karslioglu A, et al. Suv39h-dependent H3K9me3 marks intact retrotransposons and silences LINE elements in mouse embryonic stem cells. Mol. Cell. 2014;55:277–290. doi: 10.1016/j.molcel.2014.05.029. [DOI] [PubMed] [Google Scholar]
- 25.Nabet B, et al. The dTAG system for immediate and target-specific protein degradation. Nat. Chem. Biol. 2018;14:431–441. doi: 10.1038/s41589-018-0021-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Reichholf B, et al. Time-resolved small RNA sequencing unravels the molecular principles of microRNA homeostasis. Mol. Cell. 2019;75:756–768 e757. doi: 10.1016/j.molcel.2019.06.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Muhar M, et al. SLAM-seq defines direct gene-regulatory functions of the BRD4-MYC axis. Science. 2018;360:800–805. doi: 10.1126/science.aao2793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Schwalb B, et al. TT-seq maps the human transient transcriptome. Science. 2016;352:1225–1228. doi: 10.1126/science.aad9841. [DOI] [PubMed] [Google Scholar]
- 29.Rowe HM, et al. TRIM28 repression of retrotransposon-based enhancers is necessary to preserve transcriptional dynamics in embryonic stem cells. Genome Res. 2013;23:452–461. doi: 10.1101/gr.147678.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Boehning M, et al. RNA polymerase II clustering through carboxy-terminal domain phase separation. Nat. Struct. Mol. Biol. 2018;25:833–840. doi: 10.1038/s41594-018-0112-y. [DOI] [PubMed] [Google Scholar]
- 31.Sabari BR, et al. Coactivator condensation at super-enhancers links phase separation and gene control. Science. 2018;361:eaar3958. doi: 10.1126/science.aar3958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Cho WK, et al. Mediator and RNA polymerase II clusters associate in transcription-dependent condensates. Science. 2018;361:412–415. doi: 10.1126/science.aar4199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Guo YE, et al. Pol II phosphorylation regulates a switch between transcriptional and splicing condensates. Nature. 2019;572:543–548. doi: 10.1038/s41586-019-1464-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Boija A, et al. Transcription factors activate genes through the phase-separation capacity of their activation domains. Cell. 2018;175:1842–1855 e1816. doi: 10.1016/j.cell.2018.10.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Wernig M, et al. A drug-inducible transgenic system for direct reprogramming of multiple somatic cell types. Nat. Biotechnol. 2008;26:916–924. doi: 10.1038/nbt1483. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Whyte WA, et al. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell. 2013;153:307–319. doi: 10.1016/j.cell.2013.03.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Larson AG, et al. Liquid droplet formation by HP1alpha suggests a role for phase separation in heterochromatin. Nature. 2017;547:236–240. doi: 10.1038/nature22822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Huo X, et al. The nuclear matrix protein SAFB cooperates with major satellite RNAs to stabilize heterochromatin architecture partially through phase separation. Mol. Cell. 2020;77:368–383.e7. doi: 10.1016/j.molcel.2019.10.001. [DOI] [PubMed] [Google Scholar]
- 39.Konermann S, et al. Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex. Nature. 2015;517:583–588. doi: 10.1038/nature14136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Grosswendt S, et al. Epigenetic regulator function through mouse gastrulation. Nature. 2020;584:102–108. doi: 10.1038/s41586-020-2552-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Smith ZD, et al. Epigenetic restriction of extraembryonic lineages mirrors the somatic transition to cancer. Nature. 2017;549:543–547. doi: 10.1038/nature23891. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Andergassen D, Smith ZD, Kretzmer H, Rinn JL, Meissner A. Diverse epigenetic mechanisms maintain parental imprints within the embryonic and extraembryonic lineages. Dev. Cell. 2021;56:2995–3005 e2994. doi: 10.1016/j.devcel.2021.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Messerschmidt DM, et al. Trim28 is required for epigenetic stability during mouse oocyte to embryo transition. Science. 2012;335:1499–1502. doi: 10.1126/science.1216154. [DOI] [PubMed] [Google Scholar]
- 44.Sampath Kumar A, et al. Loss of maternal Trim28 causes male-predominant early embryonic lethality. Genes Dev. 2017;31:12–17. doi: 10.1101/gad.291195.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Mitsui K, et al. The homeoprotein Nanog is required for maintenance of pluripotency in mouse epiblast and ES cells. Cell. 2003;113:631–642. doi: 10.1016/S0092-8674(03)00393-3. [DOI] [PubMed] [Google Scholar]
- 46.Mouse Genome Sequencing, C. et al. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–562. doi: 10.1038/nature01262. [DOI] [PubMed] [Google Scholar]
- 47.Maksakova IA, et al. Retroviral elements and their hosts: insertional mutagenesis in the mouse germ line. PLoS Genet. 2006;2:e2. doi: 10.1371/journal.pgen.0020002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Macfarlan TS, et al. Embryonic stem cell potency fluctuates with endogenous retrovirus activity. Nature. 2012;487:57–63. doi: 10.1038/nature11244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Wang J, et al. Primate-specific endogenous retrovirus-driven transcription defines naive-like stem cells. Nature. 2014;516:405–409. doi: 10.1038/nature13804. [DOI] [PubMed] [Google Scholar]
- 50.Kunarso G, et al. Transposable elements have rewired the core regulatory network of human embryonic stem cells. Nat. Genet. 2010;42:631–634. doi: 10.1038/ng.600. [DOI] [PubMed] [Google Scholar]
- 51.Fadloun A, et al. Chromatin signatures and retrotransposon profiling in mouse embryos reveal regulation of LINE-1 by RNA. Nat. Struct. Mol. Biol. 2013;20:332–338. doi: 10.1038/nsmb.2495. [DOI] [PubMed] [Google Scholar]
- 52.Grow EJ, et al. Intrinsic retroviral reactivation in human preimplantation embryos and pluripotent cells. Nature. 2015;522:221–225. doi: 10.1038/nature14308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Chuong EB, Elde NC, Feschotte C. Regulatory evolution of innate immunity through co-option of endogenous retroviruses. Science. 2016;351:1083–1087. doi: 10.1126/science.aad5497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Jachowicz JW, et al. LINE-1 activation after fertilization regulates global chromatin accessibility in the early mouse embryo. Nat. Genet. 2017;49:1502–1510. doi: 10.1038/ng.3945. [DOI] [PubMed] [Google Scholar]
- 55.Fueyo R, Judd J, Feschotte C, Wysocka J. Roles of transposable elements in the regulation of mammalian transcription. Nat. Rev. Mol. Cell Biol. 2022;23:481–497. doi: 10.1038/s41580-022-00457-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Dalgaard K, et al. Trim28 haploinsufficiency triggers bi-stable epigenetic obesity. Cell. 2016;164:353–364. doi: 10.1016/j.cell.2015.12.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Diets IJ, et al. TRIM28 haploinsufficiency predisposes to Wilms tumor. Int. J Cancer. 2019;145:941–951. doi: 10.1002/ijc.32167. [DOI] [PubMed] [Google Scholar]
- 58.Deniz O, et al. Endogenous retroviruses are a source of enhancers with oncogenic potential in acute myeloid leukaemia. Nat. Commun. 2020;11:3506. doi: 10.1038/s41467-020-17206-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Geis FK, Goff SP. Silencing and transcriptional regulation of endogenous retroviruses: an overview. Viruses. 2020;13:884. doi: 10.3390/v12080884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Li W, et al. Human endogenous retrovirus-K contributes to motor neuron disease. Sci. Transl. Med. 2015;7:307ra153. doi: 10.1126/scitranslmed.aac8201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Karlsson H, et al. Retroviral RNA identified in the cerebrospinal fluids and brains of individuals with schizophrenia. Proc. Natl Acad. Sci. USA. 2001;98:4634–4639. doi: 10.1073/pnas.061021998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Bolte S, Cordelieres FP. A guided tour into subcellular colocalization analysis in light microscopy. J. Microsc. 2006;224:213–232. doi: 10.1111/j.1365-2818.2006.01706.x. [DOI] [PubMed] [Google Scholar]
- 63.Gilles JF, Dos Santos M, Boudier T, Bolte S, Heck N. DiAna, an ImageJ tool for object-based 3D co-localization and distance analysis. Methods. 2017;115:55–64. doi: 10.1016/j.ymeth.2016.11.016. [DOI] [PubMed] [Google Scholar]
- 64.Cho WK, et al. RNA polymerase II cluster dynamics predict mRNA output in living cells. eLife. 2016;5:e13617. doi: 10.7554/eLife.13617. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Cisse II, et al. Real-time dynamics of RNA polymerase II clustering in live human cells. Science. 2013;341:664–667. doi: 10.1126/science.1239053. [DOI] [PubMed] [Google Scholar]
- 66.Serge A, Bertaux N, Rigneault H, Marguet D. Dynamic multiple-target tracing to probe spatiotemporal cartography of cell membranes. Nat. Methods. 2008;5:687–694. doi: 10.1038/nmeth.1233. [DOI] [PubMed] [Google Scholar]
- 67.Andrews JO, et al. qSR: a quantitative super-resolution analysis tool reveals the cell-cycle dependent organization of RNA polymerase I in live human cells. Sci. Rep. 2018;8:7424. doi: 10.1038/s41598-018-25454-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Nakagata N. Cryopreservation of mouse spermatozoa and in vitro fertilization. Methods Mol. Biol. 2011;693:57–73. doi: 10.1007/978-1-60761-974-1_4. [DOI] [PubMed] [Google Scholar]
- 69.Quast C, et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013;41:D590–D596. doi: 10.1093/nar/gks1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Neumann T, et al. Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets. BMC Bioinf. 2019;20:258. doi: 10.1186/s12859-019-2849-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Anders S, Pyl PT, Huber W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Ramirez F, Dundar F, Diehl S, Gruning BA, Manke T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 2014;42:W187–W191. doi: 10.1093/nar/gku365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Hnisz D, et al. Super-enhancers in the control of cell identity and disease. Cell. 2013;155:934–947. doi: 10.1016/j.cell.2013.09.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Jurka J, et al. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 2005;110:462–467. doi: 10.1159/000084979. [DOI] [PubMed] [Google Scholar]
- 75.Stocking C, Kozak CA. Murine endogenous retroviruses. Cell. Mol. Life Sci. 2008;65:3383–3398. doi: 10.1007/s00018-008-8497-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Crichton JH, Dunican DS, Maclennan M, Meehan RR, Adams IR. Defending the genome from the enemy within: mechanisms of retrotransposon suppression in the mouse germline. Cell. Mol. Life Sci. 2014;71:1581–1605. doi: 10.1007/s00018-013-1468-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Chan MM, et al. Molecular recording of mammalian embryogenesis. Nature. 2019;570:77–82. doi: 10.1038/s41586-019-1184-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data are available in the Supplementary Information. Sequence data were deposited at GEO under the accession number GSE159468. Mass spectrometry data were deposited at ProteomeXchange under the accession ID PDX021895. Plasmids generated in the study are available at Addgene. Source data are provided with this paper.
All custom code used in this study was deposited along with the raw data at Zenodo (10.5281/zenodo.6521914).