Abstract
The placenta is an organ with extraordinary phenotypic diversity in eutherian mammals. Recent evidence suggests that numerous human placental enhancers are evolved from lineage-specific insertions of endogenous retroviruses (ERVs), yet the transcription factors (TFs) underlying their regulation remain largely elusive. Here, by first focusing on MER41, a primate-specific ERV family previously linked to placenta and innate immunity, we uncover the binding motifs of multiple crucial trophoblast TFs (GATA2/3, MSX2, GRHL2) in addition to innate immunity TFs STAT1 and IRF1. Integration of ChIP-seq data confirms the binding of GATA2/3, MSX2, and their related factors on the majority of MER41-derived enhancers in human trophoblast stem cells (TSCs). MER41-derived enhancers that are constitutively active in human TSCs are distinct from those activated upon interferon stimulation, which is determined by the binding of relevant TFs and their subfamily compositions. We further demonstrate that GATA2/3 and MSX2 have prevalent binding to numerous other ERV families — indicating their broad impact on ERV-derived enhancers. Functionally, the derepression of many syncytiotrophoblast genes after MSX2 knockdown is likely to be mediated by regulatory elements derived from ERVs — suggesting ERVs are also important for mediating transcriptional repression. Overall, this study characterizes the regulation of ERV-derived regulatory elements by GATA2/3, MSX2, and their cofactors in human TSCs, and provides mechanistic insights into the importance of ERVs in human trophoblast regulatory network.
Endogenous retroviruses (ERVs) make up ∼8% of the human genome. ERVs originate from germline infection by exogenous retroviruses, which can then be vertically inherited and expanded in the host during evolution (Stoye 2012; Johnson 2019). The intact structure of ERVs includes the internal protein-coding genes (e.g., gag, pol, and env) flanked by long terminal repeats (LTRs) at both ends, yet the internal genes frequently got lost via LTR recombination resulting in a high proportion of “solo-LTRs” left in the genome (Johnson 2019). ERVs have long been ignored as “junk DNA,” partly because most of them are epigenetically repressed (Sharif et al. 2016; Deniz et al. 2019; Wolf et al. 2020), yet accumulating evidence suggests that specific ERV families could be activated during embryonic development, tumorigenesis, or immune response (Macfarlan et al. 2012; Chuong et al. 2016; Schoenfelder et al. 2018; Todd et al. 2019; Ito et al. 2020). Many ERV families are lineage-specific and rich with transcription factor binding sites (TFBSs) in their LTRs, and therefore can be co-opted as cis-regulatory elements to boost the genetic novelty of the host (Chuong et al. 2016, 2017; Senft and Macfarlan 2021; Sun et al. 2021; Buttler and Chuong 2022).
The placenta is a temporary organ crucial for nutrient/waste exchange, hormone secretion, and maternal-fetus immune tolerance during pregnancy (Maltepe and Fisher 2015; Ander et al. 2019). Even though shared by all eutherian mammals, placentae are extraordinarily diversified in their shape, structure, and cellular composition (Ramsey et al. 1976; Mossman 1987; Hemberger et al. 2020). At the molecular level, numerous genes have altered expression patterns in human placenta relative to rodents or even other primates (Rosenkrantz et al. 2021; Sun et al. 2021). Correspondingly, the enhancers of the placenta are the least conserved across species compared with other tissues (Sun et al. 2021), with ERVs playing crucial roles for lineage-specific enhancer evolution. For example, comparison between mouse and rat trophoblast stem cells (TSCs) indicates that the mouse-specific RLTR13D5 family creates hundreds of enhancers cobound by the core trophoblast transcription factors (TFs) CDX2, EOMES, and ELF5 (Chuong et al. 2013). The functional importance of ERVs in primate placentae has also been analyzed recently. For example, one primate-specific THE1B element creates an enhancer driving CRH expression to influence gestation length (Dunn-Fletcher et al. 2018). Recent transcriptomic and epigenomic comparisons for human, macaque, and mouse placentae further identified dozens of ERV families that are significantly enriched in human placental enhancers (Sun et al. 2021).
Despite the well-recognized importance of ERVs in the placenta, the transcription factors that regulate ERV activation in the human placenta remain largely obscure. Previously we demonstrated that the primate-specific MER41 is among the top most enriched ERV families in human placental enhancers (Sun et al. 2021). MER41 has six subfamilies, including A/B/C/D/E/G (Kojima 2018), that have been shown to play important roles in innate immunity (Buttler and Chuong 2022). Multiple studies suggest that MER41 elements facilitate primate-specific innate immunity evolution by creating hundreds of interferon (IFN)-stimulated cis-elements bound by STAT1 and IRF1 (Schmid and Bucher 2010; Chuong et al. 2016; Raviram et al. 2018). This raises the question of which TFs mediate the activation of MER41-associated enhancers in placenta. We previously identified the serum response factor (SRF) as one TF that binds dozens of MER41-associated enhancers (Sun et al. 2021), including one adjacent to FBN2, a human-placenta-expressed gene crucial for cell invasion (Yu et al. 2020). However, SRF only binds a relatively small percentage (∼6%) of MER41-associated enhancers, indicating the existence of additional upstream TFs. Uncovering the regulators of ERV-associated enhancers will be crucial for an in-depth understanding of the gene regulatory network in human placenta.
In this study, we aim at identifying the TFs that potentially regulate the cis-regulatory elements associated with MER41 transposons and other ERV families in human trophoblast stem cells. By in-depth characterization of the genome-wide binding and regulatory function of the identified candidate TFs, we expect this study will improve our understanding of the regulatory mechanism and function of ERV-associated enhancers in human placenta.
Results
Tissue-specific activation of MER41-associated enhancers in human placenta and trophoblast cells
The primate-specific MER41 family creates numerous lineage-specific enhancers in human placenta (Sun et al. 2021). To better characterize MER41-associated enhancers (hereafter abbreviated as MER41-enhancers), we examined the epigenetically annotated enhancers (i.e., H3K27ac peaks that are >500 bp away from any TSSs) in different human tissues and trophoblast cells. Reanalysis of the ChIP-seq data of H3K27ac from the ENCODE Project (Supplemental Table S1) demonstrates that multiple subfamilies of MER41 (A/B/C/D) are specifically enriched in the enhancers of placenta (as well as the chorion, a fetally derived extraembryonic membrane) relative to other human tissues (Fig. 1A; Supplemental Figs. S1, S2). Furthermore, genes adjacent to MER41-enhancers have increased expression in human placenta relative to other tissues and mouse placenta (Fig. 1B; Supplemental Fig. S3), and some enriched Gene Ontology (GO) terms (e.g., endocytosis, chordae embryonic development) for MER41-enhancers are placenta-related (Supplemental Fig. S4). Further inspection suggests that the genes with increased expression in human placenta relative to mouse are most significantly enriched surrounding enhancers derived from MER41B/D, followed by moderate enrichment surrounding MER41C-enhancers (Supplemental Fig. S5). Integration of the promoter Capture-HiC data for human trophoblast-like cells (Jung et al. 2019) showed that MER41 elements (particularly MER41B) are significantly enriched in the enhancers that contact promoters, and genes that loop with MER41B-enhancers have higher expression in placenta relative to other tissues (Supplemental Fig. S6).
Figure 1.
Characterization of MER41-enhancers in different human tissues and trophoblast cells. (A) Enrichment of MER41 subfamilies in the epigenetically annotated enhancers for different human tissues. Enhancers were annotated as H3K27ac peaks that are more than 500 bp away from any TSSs. (B) The bar plot shows that, relative to distal genes, higher proportion of MER41-enhancer adjacent genes have increased expression in human placenta relative to mouse. MER41-enhancer adjacent genes are defined as those with their TSSs within 50 kb from any MER41-enhancers. P-value calculated using Fisher's exact test is indicated. (C) Enrichment of MER41 subfamilies in the enhancers annotated for human TSC, STB, and EVT. (D) Overlapping of the MER41-enhancers annotated for human TSC, STB, and EVT. (E) Heatmap shows the H3K27ac and H3K9me3 intensity for the shared or cell-specific MER41-enhancers for different human trophoblast cells. (F) IGV tracks showing the transcriptomic and epigenomic profiles surrounding the representative MER41-enhancer adjacent to FBN2.
Through integration of published epigenomic data (Okae et al. 2018), we further annotated the enhancers for human TSC (hTSC), syncytiotrophoblast (STB), and extravillous trophoblast (EVT) cells, which show substantial differences among these cell types (Supplemental Fig. S7). MER41 elements are enriched in the enhancers for all three trophoblast cell types (Fig. 1C). Hundreds of MER41-enhancers are specifically activated in one or two trophoblast cell types (Fig. 1D,E; Supplemental Table S2), and their chromatin state correlates with the transcription of at least some adjacent genes, such as FBN2 and C1QTNF6 (Fig. 1F; Supplemental Fig. S8). Together, these results confirm the tissue-specific activation of MER41-enhancers in human placenta and indicate their potential regulatory effects on some adjacent genes.
Identification of GATA2/3 and MSX2 as putative regulators of MER41-enhancers in human TSCs
To screen for candidate TFs that bind MER41-enhancers, we first determined the motif composition of MER41B consensus sequence and one representative MER41-enhancer adjacent to FBN2. Apart from several TFs (i.e., STAT1, IRF1, SRF) already known to bind MER41 elements (Chuong et al. 2016; Sun et al. 2021), the binding motifs for dozens of other TFs are predicted (Fig. 2A,B). Four of them, including GATA2/3, MSX2, and GRHL2, have placenta-enriched expression (Fig. 2C; Supplemental Fig. S9), and comparison among different trophoblast cell types showed that GATA3 and MSX2 have relatively high expression in TSC whereas GATA2 and GRHL2 have relatively high expression in STB (Supplemental Fig. S10). Particularly, GATA2/3 are key pioneer TFs for human and mouse TSC specification (Home et al. 2017; Krendl et al. 2017; Paul et al. 2017), and MSX2 is essential in restraining STB gene expression in human TSCs (Hornbachner et al. 2021). Even though GRHL2 has not been analyzed in human placenta, it is known to regulate trophoblast branching morphogenesis in mice (Walentin et al. 2015). Inspection of other subfamilies, including MER41A/C/D, also gave similar results (Supplemental Fig. S11), and direct comparison of the MER41-enhancers to MER41 elements lacking the H3K27ac mark showed significant enrichment of the motif for GATA3 (Supplemental Fig. S12), but not MSX2, which demonstrates a role in transcriptional repression (Hornbachner et al. 2021). Of note, the motif of GRHL2 is enriched in MER41-enhancers 1.6-fold, yet the P-value is insignificant (P = 0.135; Supplemental Fig. S12). Together, motif prediction and transcriptomic analysis uncovered GATA2/3, MSX2, and GRHL2 as candidate trophoblast TFs that bind MER41-enhancers in human placenta.
Figure 2.
Integrative analysis identifies GATA2/3 and MSX2 as candidate regulators of MER41-enhancers in human TSCs. (A,B) Motif configurations of MER41B consensus (A) and one representative MER41-enhancer adjacent to FBN2. (B) The motifs for several known (STAT1, SRF) and novel (GATA2/3, MSX2, GRHL2) TFs of MER41 are labeled in red color. (C) Expression profiles of the four TFs, including GATA2/3, MSX2, and GRHL2, that have placenta-enriched expression. The color gradient indicates the column Z-score based on normalized TPM values. (D) Venn diagram showing the peak overlapping of GATA2/3 and MSX2. (E) Enrichment of MER41 subfamilies in the peaks for GATA2/3, MSX2, and cofactors. (F) Bar plots demonstrate the percentages of MER41-enhancers that are bound by GATA2/3, MSX2, and cofactors. The overlap of MER41-enhancers with TF peaks were determined by using the window function of BEDTools with setting: −w 0. (G) Heatmaps demonstrate the prevalent binding of GATA2/3, MSX2, and cofactors on MER41-enhancers in human TSCs.
Recent studies have profiled the genome-wide binding of GATA2/3 (together with TFAP2A/C, which are collectively coined as the “trophectoderm four”) and MSX2 (together with its cofactors ARID1A and SMARCA4) in human TSCs by using ChIP-seq (Krendl et al. 2017; Hornbachner et al. 2021). Through reanalysis of these data, we demonstrate that GATA2/3 and MSX2 have substantially overlapped binding (Fig. 2D), and all these TFs and cofactors have enriched binding to MER41-enhancers in human TSCs (Fig. 2E; Supplemental Table S3). The majority (>60%) of 634 MER41-enhancers annotated for human TSC are bound by GATA3 and MSX2 (and its cofactors ARID1A and SMARCA4), as indicated by both overlapping analysis and heatmap visualization (Fig. 2F,G). The remaining TFs also have widespread binding to MER41-enhancers, even though of lower frequency (Fig. 2F,G). These results were further confirmed by manual inspection of a few representative loci, such as FBN2 and C1QTNF6 (Supplemental Fig. S13). Together, our integrative analyses identified several crucial trophoblast TFs, including GATA2/3 and MSX2, as putative regulators of MER41-enhancers in human TSCs.
Direct comparison of MER41-enhancers that are constitutively active or IFN-stimulated in human TSCs
Intrigued by previous findings that MER41 elements facilitate the evolution of lineage-specific enhancers that are either constitutively activated in normal placenta (Sun et al. 2021) or interferon-stimulated in different human cells (Chuong et al. 2016), we directly compared between placenta- and immune-related MER41-enhancers using human TSC as model. For this purpose, we performed transcriptomic and epigenomic (H3K27ac) profiling of human TSCs with or without interferon gamma (IFNG) stimulation (Supplemental Table S1). As expected, IFNG stimulates hundreds of genes in human TSCs that are highly associated with immune response (Supplemental Fig. S14). We next determined thousands of IFN-stimulated enhancers (Supplemental Fig. S15) and further examined their overlap with MER41 subfamilies. Although MER41A/B/C are enriched with similar folds within constitutively active enhancers, MER41B is enriched within IFN-stimulated enhancers to a higher degree (Fig. 3A). Impressively, the constitutively active enhancers, in contrast to IFN-stimulated ones, are indeed unresponsive to IFN stimulation (Fig. 3B,C; Supplemental Table S4), and closer inspection of representative loci adjacent to FBN2 and AIM2 confirmed their distinct IFN response (Fig. 3D,E). As expected, the motifs for GATA3 and STAT1 are enriched within constitutively active and IFN-stimulated MER41-enhancers, respectively (Fig. 3F). Accordingly, GATA2/3, MSX2, and cofactors bind the constitutively active group with higher frequency and intensity (Fig. 3G,H), and in contrast, immune TFs including STAT1 and IRF1 prefer to bind IFN-stimulated enhancers based on the data for multiple human cell types (Supplemental Fig. S16), which partly explains why one group of MER41-enhancers are IFN-stimulated whereas the other group is constitutively activated in human TSCs. Together, substantial differences regarding the subfamilies and TF binding exist between the placenta- and immune-related MER41-enhancers, suggesting the functional divergence of MER41 elements during evolution.
Figure 3.
Characterization and comparison of the MER41-enhancers that are constitutively active or IFN-stimulated in human TSCs. (A) Enrichment of MER41 subfamilies in the two groups of MER41-enhancers that are constitutively active or IFN-stimulated in human TSCs. (B,C) Heatmaps show the H3K27ac intensity flanking the two groups of MER41-enhancers with or without IFNG stimulation. (D) Frequency of the motifs for GATA3 and STAT1 in constitutively active or IFN-stimulated MER41-enhancers, respectively. P-values calculated by using Fisher's exact test are indicated. (E,F) Representative IGV tracks show the alterations of gene expression and H3K27ac occupancy flanking the MER41-enhancers adjacent to FBN2 and AIM2, respectively. The color gradients indicate the RPKM values (ChIP – Input) calculated from the ChIP-seq data. (G) Bar plots for the binding frequency of GATA2/3, MSX2, and cofactors on the two groups of MER41-enhancers. P-values calculated by using Fisher's exact test are indicated. (H) Heatmaps show the binding intensity of GATA2/3, MSX2, and cofactors on the two groups of MER41-enhancers. The color gradients indicate the RPKM values (ChIP – Input) calculated from the ChIP-seq data.
Prevalent binding of GATA2/3 and MSX2 on different families of ERV-derived enhancers in human TSCs
Given the enriched binding of the aforementioned trophoblast TFs on MER41-enhancers, we are eager to check if other families of ERV-associated enhancers are also bound by these TFs in human TSCs. Globally, GATA2/3, MSX2, and cofactors preferentially occupy ERVs; specifically, 30.0% and 23.8% of the peaks for GATA2 and GATA3 overlap ERV elements, which is higher than the 8.8% as expected in randomly shuffled peaks across genome (Fig. 4A). We further examined their binding to each ERV family, and as expected, MER41A/B/C are among the top enriched (Fig. 4B). Dozens of other ERV families, such as LTR8, MER11A, MER4B, LTR5_Hs, and MER4A1/D1/D/E, are also significantly enriched (Fig. 4B; Supplemental Fig. S17). Many of the enriched ERV families are primate- or even human-specific (Supplemental Table S5). The ERV families enriched in TSC enhancers and the peaks for GATA2/3 and MSX2 correlate well (Fig. 4C), and correspondingly, 49.2% (7637 out of 15,504) of the TSC enhancers that overlap ERVs are bound by GATA2/3 or MSX2 (Fig. 4D), significantly higher than the percentage for other enhancers (P < 2.2 × 10−16, OddRatio = 3.1). The degree of enriched binding to ERV-derived enhancers is higher for GATA2/3 + MSX2 and GATA-only peaks relative to MSX2-only peaks (Supplemental Figs. S18–S20). Indeed, the top enriched ERV families not only have remarkable binding by GATA3 and MSX2, but also possess high intensity of H3K27ac marks (Fig. 4E), suggesting these ERV elements also form enhancers. Closer inspection further confirmed the binding of these TFs on one LTR8-associated enhancer adjacent to the imprinted gene H19 (Fig. 4F) and one MER11A-associated enhancer adjacent to FSTL3 (Fig. 4G); these two genes are both tightly associated with placenta development and diseases (Fowden et al. 2006; Xie et al. 2018; Gong et al. 2021). Together, these results suggest that apart from MER41 elements, these core trophoblast TFs also have prevalent binding to many other families of ERV-associated enhancers, suggesting the broad involvement of these TFs and the bound ERV-associated enhancers in human TSCs.
Figure 4.
Binding of GATA2/3 and cofactors on different families of ERV-associated enhancers in human TSCs. (A) Percentages of the peaks that bind ERV elements for GATA2/3, MSX2, and cofactors. The gray line represents the expected percentage calculated through the comparison of the randomly shuffled peaks (after pooling the peaks for GATA2/3, MSX2, and cofactors) against ERV elements. (B) Enrichment of the top 20 ERV families that have enriched binding by GATA2/3, MSX2, or cofactors. These ERV families are selected based on the ranking according to the enrichment P-values against the TF peaks. (C) Correlation for the ERV enrichment in annotated enhancers versus GATA2/3 and MSX2 peaks. (D) Comparison of the percentages of enhancers (with or without ERV overlapping) that are bound by at least one TF among GATA2/3 and MSX2. P-value calculated by Fisher's exact test is denoted. (E) Averaged profiles show the intensity of H3K27ac, GATA3, and MSX2 occupancy on the top ten ERV families that have enriched binding by GATA2/3, MSX2, and cofactors. (F,G) Representative IGV tracks show the epigenetic profiles and trophoblast TF binding near the LTR8- and MER11A-associated enhancers adjacent to H19 and FSTL3, respectively.
MSX2 represses numerous STB genes in human TSCs through binding ERV-associated regulatory elements
Despite the prevalent binding of GATA2/3 and MSX2 on ERV-associated enhancers, it should be noted that these TFs have distinct functions: whereas GATA2/3 are recognized as pioneer TFs in human TSCs for transcriptional activation (Krendl et al. 2017), MSX2 was recently identified as a transcriptional repressor to restrain the undesired expression of STB genes in human TSCs (Hornbachner et al. 2021). Given that the way ERV elements mediate the function of transcription repressors is rarely reported, in-depth analysis of the binding and function of MSX2 on ERV-associated enhancers is highly desirable. In order to examine the role of MSX2 specifically on the function of ERV-derived enhancers in TSCs, we evaluated published RNA-seq and H3K27ac ChIP-seq data in WT and MSX2 knockdown (MSX2KD) TSCs (Hornbachner et al. 2021). After confirming the repression of STB genes by MSX2 (Supplemental Fig. S21), we further found that MSX2 has enriched binding to dozens of ERV families, including MER41A/B/C, LTR8, LTR8B, MER21A, and LTR3A (Fig. 5A). To clarify the function of MSX2 on ERVs, we compared the H3K27ac marks on the top ten enriched ERV families after the knockdown of MSX2. As expected, knockdown of MSX2 causes increased H3K27ac levels on the MSX2-bound ERV elements, yet the degrees of changes differ among ERV families (Fig. 5B). For example, the H3K27ac level for LTR8B is increased more than threefold, whereas LTR3A is increased more moderately. As control, the H3K27ac level on unbound ERV families shows no remarkable change (Supplemental Fig. S22). These results suggest that MSX2 has a global repressive effect on the bound ERV-derived enhancers in human TSCs.
Figure 5.
MSX2 restricts the expression of some STB genes in human TSCs through the binding of ERV-derived enhancers. (A) Enrichment of ERV families within MSX2 peaks. The top 20 families as ranked by P-values are presented. (B) Averaged curves show the ChIP intensity of MSX2 and H3K27ac (WT vs. MSX2KD) flanking the MSX2-bound elements from the top ten ERV families. (C) MA plot shows the differential H3K27ac peaks between WT and MSX2KD human TSCs. (D) Comparison of the MSX2-bound percentages between genomic loci with increased or decreased H3K27ac levels after MSX2KD. P-value calculated with Fisher's exact test is denoted. (E) Enrichment of ERV families within MSX2-bound loci that have increased H3K27ac levels after MSX2KD. (F) Comparison of the altered expression (MSX2KD vs. WT) across different groups of genes defined based on their association with MSX2-regulated enhancers. A threshold of <10 kb from TSSs was used to group genes. MSX2-regulated enhancers are further classified as two groups (ERV-derived or non-ERV-derived) based on their overlapping with ERV elements. P-values calculated from two-sided student's t-test are denoted. (G,H) Representative IGV tracks showing several LTR8B- or LTR8-derived enhancers that are likely mediating the MSX2-dependent repression of STB genes, including PSG1/3/4, DPEP2NB, DDX28, and DYSF. The RNA-seq and H3K27ac ChIP-seq data in WT and MSX2KD TSCs were adopted from Hornbachner et al. (2021).
MSX2 is known to restrict the expression of STB genes in human TSCs (Hornbachner et al. 2021). To uncover if ERV-associated cis-elements participate in this process, we determined the loci with significantly altered H3K27ac levels after knockdown of MSX2 (MSX2KD) for further analysis by integrating recently published data (Hornbachner et al. 2021). The majority (2519 out of 2706) of these differential loci have increased H3K27ac levels (Fig. 5C), and MSX2 peaks overlap 54.4% of those with increased H3K27ac, significantly higher than those with decreased H3K27ac (Fig. 5D). Several ERV families are also significantly overrepresented within the MSX2-regulated cis-elements (defined as genomic loci that overlap MSX2 peaks and have increased H3K27ac levels in MSX2KD), including LTR8B, HERVK3-int, and LTR8 (Fig. 5E). We further examined the genes adjacent to MSX2-regulated cis-elements, which are further classified as ERV-derived or non-ERV-derived, and as expected, they both have increased expression after MSX2KD (Fig. 5F). Manual inspection of the canonical STB genes repressed by MSX2 indicates that some are likely to be regulated through ERV-derived cis-elements. For example, multiple PSG genes (PSG1/3/4/6/8/9) that get derepressed in MSX2KD harbor LTR8B-derived cis-elements that underlie the regulation of MSX2 (Fig. 5G; Supplemental Fig. S23). In addition, one LTR8-derived cis-element between DPEP2NB and DDX28 and another within DYSF are also likely regulated by MSX2 (Fig. 5H). Together, these results suggest that MSX2 restricts the undesired expression of many STB genes in human TSCs through ERV-associated regulatory elements.
Discussion
In eutherian mammals, the placenta is a transient organ characterized by extraordinary phenotypic diversity (Gerri et al. 2020; Hemberger et al. 2020). At the molecular level, hundreds of placental genes have lineage-specific expression (Dunn-Fletcher et al. 2018; Soncin et al. 2018; Rosenkrantz et al. 2021; Sun et al. 2021), and many are driven by the fast-evolving enhancers which frequently overlap specific ERV families (Chuong et al. 2013; Sun et al. 2021). It is highly desirable to delineate how ERV-associated enhancers participate in the trophoblast gene network, which is under the cooperative regulation of many TFs (Papuchova and Latos 2022).
Our study was initiated by focusing on the primate-specific MER41 family, which is among the top enriched in human placental enhancers (Sun et al. 2021). MER41 elements are also known to create thousands of IFN-stimulated enhancers to drive innate immunity evolution (Schmid and Bucher 2010; Chuong et al. 2016; Raviram et al. 2018). Yet unlike immune-related MER41 elements that have already been well characterized, how MER41-enhancers are regulated in human placenta remains obscure. Motif scanning of the consensus and representative sequences of MER41 indicates that they harbor the binding motifs for multiple TFs, including GATA2/3, MSX2, and GRHL2 which are important regulators for trophoblast lineage maintenance and development in human and/or mice (Ralston et al. 2010; Walentin et al. 2015; Home et al. 2017; Krendl et al. 2017; Rhee et al. 2017; Hornbachner et al. 2021). Integration of public data (Krendl et al. 2017; Hornbachner et al. 2021) verified that the majority of MER41-enhancers are bound by GATA3, MSX2, and their cofactors (TFAP2A/2C, ARID1A, and SMARCA4). Therefore, the trophoblast TFs, particularly GATA2/3 and MSX2, were identified as promising regulators of MER41-enhancers in human TSCs. Functionally, it is tempting to speculate that, through the binding of GATA2/3, MSX2, and their cofactors, MER41-enhancers participate in the human trophoblast regulatory network.
Although MER41 is among the top enriched in human placental enhancers, dozens of other ERV families are also highly enriched; therefore, we wondered if they are regulated through the same set of TFs. We found that GATA2/3 and MSX2 have prevalent binding to dozens of other families of ERV-associated enhancers, including many that are primate-specific (e.g., LTR8, MER4, and MER11) or even human-specific (e.g., LTR5_Hs). Indeed, the ERV families frequently bound by these TFs are well correlated with those enriched in TSC enhancers, suggesting GATA2/3 and MSX2 have a broad effect on ERV-associated TSC enhancers. GATA2/3 are generally recognized as transcriptional activators (Takaku et al. 2016; Krendl et al. 2017), whereas MSX2 is recognized as a transcriptional repressor for STB genes (Hornbachner et al. 2021). In addition, GATA2 and GATA3 are known to have functional redundancy in mouse placenta (Home et al. 2017). Therefore, the cooperation or even competition of these TFs in regulating ERV-associated enhancers is expected. Currently, the molecular functions of these TFs have still not been systematically compared. We expect that the combinatorial function of these TFs on the cobinding loci, including ERV-derived ones, will be clarified through the knockout of each TF and the manipulation of their binding motifs in representative cis-elements.
Even though GATA2/3 and MSX2 are most abundant in placenta, they also express and function in a few other tissues or cell types. For example, GATA3 is also expressed in T cells (Wei et al. 2011; Van de Walle et al. 2016) and mammary gland (Theodorou et al. 2013). So, why are the ERV-associated enhancers, which are frequently bound by these TFs, preferably active in placenta? One possible reason is the unique placenta-specific epigenetic and chromatin signatures. Unlike most other tissues which have global hypermethylation, placenta is abundant with partially methylated domains (Schroeder et al. 2013; Decato et al. 2017) which are also featured in many tumors (Hansen et al. 2011). Mechanistically, one previous study speculated that placenta and tumor share converged DNA methylation pathways that mediate the establishment of such epigenetic features (Lorincz and Schübeler 2017). Given the well-recognized function of DNA methylation for epigenetic repression, it is possible that many genomic loci (particularly ERV-derived ones) are more accessible in the placenta relative to other tissues; thus, these loci are more likely to be activated in placenta. The derepression of ERVs has also been observed in numerous types of tumors, presumably due to the attenuation of the epigenetic silencing (Chuong et al. 2017; Ito et al. 2020). Therefore, it is possible that the prevalent binding of trophoblast TFs on ERV-associated enhancers may also rely on the relatively relaxed epigenetic environment in trophoblast cells. Apart from the epigenetic/chromatin environment, additional mechanisms, such as the existence of specific cofactors, may also contribute to the tissue-specific binding and regulation of ERV-associated enhancers by these TFs in placenta.
Most previous studies about ERV-derived enhancers focused on their activation, yet how they mediate the function of transcription repressors is rarely reported. Aiming at mechanistic insights into the involvement of ERVs for transcriptional repression, we performed in-depth analysis on MSX2, a homeobox gene which is known to be crucial in restraining the undesired expression of STB genes in human TSCs (Hornbachner et al. 2021). Although hundreds of KRAB zinc-finger proteins evolved to repress ERVs in mammals (Imbeault et al. 2017; Wolf et al. 2020), how homeobox genes participate in the repression of ERVs remains poorly understood. The knockdown of MSX2 resulted in globally increased H3K27ac levels on the ERV-associated enhancers it bound, which suggests that it has a global repressive effect on the bound ERVs. For the genomic loci under the repression of MSX2, several ERV families are also significantly overrepresented. Genes adjacent to MSX2-bound enhancers, ERV-derived or not, usually have increased expression after MSX2KD. Furthermore, some STB genes are likely to be regulated through ERV-derived enhancers, such as LTR8B-derived enhancers for multiple PSG genes, an LTR8-derived adjacent to DPEP2NB, DDX28, and DYSF. Therefore, MSX2 restricts the undesired expression of many STB genes in human TSCs through ERV-associated enhancers. These results further suggest that through the recruiting of different transcriptional activators and repressors, ERVs can be co-opted as a cis-regulatory module to mediate the accurate expression of associated genes in the host.
Taken together, this study uncovered the prevalent regulation of multiple trophoblast TFs, particularly GATA2/3, MSX2, and their cofactors, on numerous ERV-derived enhancers in human TSCs. Through comprehensive analysis of the links between ERV-derived enhancers, upstream regulators, and downstream targets, this study provides novel mechanistic insights into the functional involvement of ERVs in the human trophoblast regulatory network.
Methods
Human TSC culture and treatment
Human TSCs derived from human cytotrophoblast cells (Okae et al. 2018) were a gift from the Okae lab. They were cultured in a trophoblast stem cell medium (TSM), as described previously (Sun et al. 2021). Interferon stimulation was performed by adding 1000 U/mL of IFNG (PBL 11500-2) into the culturing medium of human TSCs. After 24 h, untreated and IFNG-treated samples were both collected for subsequent ChIP-seq (H3K27ac) and RNA-seq experiments.
RNA-seq
Total RNA for human TSCs was extracted using RNeasy Micro kit (Qiagen 74004) with on-column DNase digestion, and then submitted for library construction by TruSeq stranded mRNA sample preparation kit (Illumina). Two biological replicates were used per sample. RNA-seq libraries were sequenced as 75 bp paired-end reads with a HiSeq 2500 (Illumina) platform.
Raw reads were trimmed with Trim Galore! v0.6.4 (https://github.com/FelixKrueger/TrimGalore). Transcript per million (TPM) values were calculated with RSEM v1.3.2 (Li and Dewey 2011). To perform differential expression analysis, we aligned trimmed reads to the reference genome (GRCh38 for human) using STAR v2.7.3 (Dobin et al. 2013), and then obtained gene-level read counts using the featureCount function from subread v2.0.0 (Liao et al. 2013). At last, differentially expressed genes were identified using DESeq2 v1.30.1 (Love et al. 2014) with the cutoff: FDR < 0.05 and |log2Foldchange|>1.
ChIP-seq
ChIP-seq for human TSCs was performed following our previous study (Sun et al. 2021). Chromatin fragmentation was performed using Diagenode Bioruptor Plus Sonicator. The antibody for H3K27ac (Abcam ab4729) is used. The amount of chromatin is 20 μg per reaction. Two biological replicates were used per sample. ChIP-seq libraries were constructed using Takara SMARTer ThruPLEX DNA-seq Kit (Takara R400674) and sequenced as 50 bp paired-end reads with HiSeq 2500 (Illumina) platform.
Reads were trimmed with Trim Galore! v0.6.4 and then aligned to the corresponding reference genome (GRCh38 for human) using Bowtie 2 v2.3.5 (Langmead and Salzberg 2012) with default settings. Polymerase chain reaction (PCR) duplicates were removed using the rmdup function of SAMtools v1.13 (Li et al. 2009). After confirming the data reproducibility, reads from biological replicates were pooled together for further analysis. Peak calling was performed with MACS v2.2.6 (Zhang et al. 2008) with settings: −g hs −q 0.05. The peaks were further cleaned by removing those that overlap ENCODE Blacklist V2 regions (Amemiya et al. 2019). Differential binding analysis was performed using DiffBind v3.4.11 (Ross-Innes et al. 2012) with settings: minOverlap = 1, summits = 400, method = DBA_EDGER.
Reference genome and annotation
Reference genome and gene annotation for human (GRCh38) were downloaded from the Ensembl database (release 102) (Yates et al. 2020). Transposable element annotations were downloaded from the RepeatMasker website (http://www.repeatmasker.org/) on May 27, 2016. The clades for ERV families were obtained from the Dfam database (Hubley et al. 2016). Genome mappability along the reference genome was calculated using the GEM-mappability program from GEM (GEnome Multitool) suite (Derrien et al. 2012).
Gene Ontology enrichment analysis
Gene Ontology enrichment analyses for differentially expressed genes were performed using Metascape (Zhou et al. 2019). Gene Ontology enrichment analyses for genomic regions (e.g., peaks and putative enhancers) were performed with GREAT (McLean et al. 2010).
Motif analysis
Motif scanning on MER41 consensus sequences (downloaded from the RepeatMasker website on May 27, 2016) was performed against the nonredundant JASPAR2022 CORE vertebrates motif database (JASPAR2022_CORE_vertebrates_nonredundant) using FIMO (Grant et al. 2011) with settings: ‐‐text ‐‐thresh 1E-3. The locations of predicted motifs were visualized with Integrative Genomics Viewer (IGV) v2.11.1 (Thorvaldsdottir et al. 2013), and redundant predictions (e.g., multiple prediction results of the same TF on overlapped position) were manually adjusted to get optimized visualization. Motif occurrence on representative MER41 elements was retrieved directly from the UCSC Genome Browser (Lee et al. 2020). Motif enrichment analysis on MER41-derived enhancers relative to MER41 elements that were lacking the H3K27ac mark was performed by using the findMotifs.pl script from HOMER v4.11 (Heinz et al. 2010) with default settings.
Epigenetic annotation of regulatory elements
Putative regulatory elements are defined based on histone modifications and genomic distribution. Promoters are defined as H3K4me3 occupied regions, and enhancers as H3K27ac peaks that are more than 500 bp away from TSSs.
Annotation of the genes adjacent to given genomic regions
The genes adjacent to given genomic regions (e.g., peaks, enhancers, ERV elements) were determined by comparing gene TSSs against specified genomic regions using the window function of BEDTools v2.29.2 (Quinlan and Hall 2010), with the distance threshold specified by the parameter w. The TSS annotation file was retrieved from Ensembl database (release 102) by using BioMart (Yates et al. 2020).
Differential expression analysis between human and mouse
Interspecies differential expression analysis between human and mouse placentae was performed using ExprX (https://github.com/mingansun/ExprX) as previous described (Sun et al. 2021). In brief, only 1-to-1 orthologous gene pairs annotated by Ensembl database (Yates et al. 2020) were used for analysis, and differentially expressed genes were determined using the Rank Product method with the cutoff of FDR < 0.05 and |log2foldchange| > 1.
ERV enrichment analysis
To determine if certain ERV families are overrepresented within given genomic regions (e.g., enhancers or peaks), we adopted the fisher function of BEDTools v2.29.2 (Quinlan and Hall 2010), which determines the enrichment fold and P-value between two lists of genomic intervals by using Fisher's exact test. A wrapper named TEenrich-FET (https://github.com/mingansun/TE_analysis) was further implemented to simplify the analysis of multiple ERV families. To control for Family-Wise Error Rate, the calculated P-values were further adjusted with Bonferroni method.
Statistical analysis and data visualization
All statistical analyses were performed with R statistical programming language (R Core Team 2020). Heatmaps for ChIP-seq data were generated using deepTools v3.5.1 (Ramírez et al. 2014), with ChIP signal normalized as RPKM values. Heatmaps from gene expression clustering analysis were generated using pheatmap (https://github.com/raivokolde/pheatmap). RNA-seq and ChIP-seq tracks were visualized using IGV v2.11.1 (Thorvaldsdottir et al. 2013).
Data access
All raw and processed sequencing data generated in this study have been submitted to the NCBI Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/geo/) under accession number GSE209541.
Supplementary Material
Acknowledgments
This study was supported by grants from the National Natural Science Foundation of China (No. 32270584 and No. 31900422 to M.A.S.; No. 82130047 and No. 81971414 to B.C.), the National Key Research and Development Program of China (No. 2022YFC2702403 to B.C.), the Intramural Research Program of the Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health (T.S.M.), the 111 Project No. D18007, the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD), and the Natural Sciences Foundation of Fujian Province of China (No. 2020J06003 to B.C.). We thank Dr. Hiroaki Okae and Dr. Takahiro Arima for kindly providing human trophoblast stem cells. This study utilized the computational resources of Yangzhou University College of Veterinary Medicine High-Performance Computing cluster.
Footnotes
[Supplemental material is available for this article.]
Article published online before print. Article, supplemental material, and publication date are at https://www.genome.org/cgi/doi/10.1101/gr.277150.122.
Competing interest statement
The authors declare no competing interests.
References
- Amemiya HM, Kundaje A, Boyle AP. 2019. The ENCODE blacklist: identification of problematic regions of the genome. Sci Rep 9: 9354. 10.1038/s41598-019-45839-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ander SE, Diamond MS, Coyne CB. 2019. Immune responses at the maternal-fetal interface. Sci Immunol 4: eaat6114. 10.1126/sciimmunol.aat6114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buttler CA, Chuong EB. 2022. Emerging roles for endogenous retroviruses in immune epigenetic regulation. Immunol Rev 305: 165–178. 10.1111/imr.13042 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chuong EB, Rumi MA, Soares MJ, Baker JC. 2013. Endogenous retroviruses function as species-specific enhancer elements in the placenta. Nat Genet 45: 325–329. 10.1038/ng.2553 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chuong EB, Elde NC, Feschotte C. 2016. Regulatory evolution of innate immunity through co-option of endogenous retroviruses. Science 351: 1083–1087. 10.1126/science.aad5497 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chuong EB, Elde NC, Feschotte C. 2017. Regulatory activities of transposable elements: from conflicts to benefits. Nat Rev Genet 18: 71–86. 10.1038/nrg.2016.139 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Decato BE, Lopez-Tello J, Sferruzzi-Perri AN, Smith AD, Dean MD. 2017. DNA methylation divergence and tissue specialization in the developing mouse placenta. Mol Biol Evol 34: 1702–1712. 10.1093/molbev/msx112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deniz O, Frost JM, Branco MR. 2019. Regulation of transposable elements by DNA modifications. Nat Rev Genet 20: 417–431. 10.1038/s41576-019-0106-6 [DOI] [PubMed] [Google Scholar]
- Derrien T, Estellé J, Marco Sola S, Knowles DG, Raineri E, Guigó R, Ribeca P. 2012. Fast computation and applications of genome mappability. PLoS One 7: e30377. 10.1371/journal.pone.0030377 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. 2013. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29: 15–21. 10.1093/bioinformatics/bts635 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dunn-Fletcher CE, Muglia LM, Pavlicev M, Wolf G, Sun MA, Hu YC, Huffman E, Tumukuntala S, Thiele K, Mukherjee A, et al. 2018. Anthropoid primate-specific retroviral element THE1B controls expression of CRH in placenta and alters gestation length. PLoS Biol 16: e2006337. 10.1371/journal.pbio.2006337 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fowden AL, Sibley C, Reik W, Constancia M. 2006. Imprinted genes, placental development and fetal growth. Horm Res 65 Suppl 3: 50–58. 10.1159/000091506 [DOI] [PubMed] [Google Scholar]
- Gerri C, McCarthy A, Alanis-Lobato G, Demtschenko A, Bruneau A, Loubersac S, Fogarty NME, Hampshire D, Elder K, Snell P, et al. 2020. Initiation of a conserved trophectoderm program in human, cow and mouse embryos. Nature 587: 443–447. 10.1038/s41586-020-2759-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gong S, Gaccioli F, Dopierala J, Sovio U, Cook E, Volders PJ, Martens L, Kirk PDW, Richardson S, Smith GCS, et al. 2021. The RNA landscape of the human placenta in health and disease. Nat Commun 12: 2639. 10.1038/s41467-021-22695-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grant CE, Bailey TL, Noble WS. 2011. FIMO: scanning for occurrences of a given motif. Bioinformatics 27: 1017–1018. 10.1093/bioinformatics/btr064 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hansen KD, Timp W, Bravo HC, Sabunciyan S, Langmead B, McDonald OG, Wen B, Wu H, Liu Y, Diep D, et al. 2011. Increased methylation variation in epigenetic domains across cancer types. Nat Genet 43: 768–775. 10.1038/ng.865 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK. 2010. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell 38: 576–589. 10.1016/j.molcel.2010.05.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hemberger M, Hanna CW, Dean W. 2020. Mechanisms of early placental development in mouse and humans. Nat Rev Genet 21: 27–43. 10.1038/s41576-019-0169-4 [DOI] [PubMed] [Google Scholar]
- Home P, Kumar RP, Ganguly A, Saha B, Milano-Foster J, Bhattacharya B, Ray S, Gunewardena S, Paul A, Camper SA, et al. 2017. Genetic redundancy of GATA factors in the extraembryonic trophoblast lineage ensures the progression of preimplantation and postimplantation mammalian development. Development 144: 876–888. 10.1242/dev.145318 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hornbachner R, Lackner A, Papuchova H, Haider S, Knöfler M, Mechtler K, Latos PA. 2021. MSX2 safeguards syncytiotrophoblast fate of human trophoblast stem cells. Proc Natl Acad Sci 118: e2105130118. 10.1073/pnas.2105130118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hubley R, Finn RD, Clements J, Eddy SR, Jones TA, Bao W, Smit AF, Wheeler TJ. 2016. The Dfam database of repetitive DNA families. Nucleic Acids Res 44: D81–D89. 10.1093/nar/gkv1272 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Imbeault M, Helleboid PY, Trono D. 2017. KRAB zinc-finger proteins contribute to the evolution of gene regulatory networks. Nature 543: 550–554. 10.1038/nature21683 [DOI] [PubMed] [Google Scholar]
- Ito J, Kimura I, Soper A, Coudray A, Koyanagi Y, Nakaoka H, Inoue I, Turelli P, Trono D, Sato K. 2020. Endogenous retroviruses drive KRAB zinc-finger protein family expression for tumor suppression. Sci Adv 6: eabc3020. 10.1126/sciadv.abc3020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson WE. 2019. Origins and evolutionary consequences of ancient endogenous retroviruses. Nat Rev Microbiol 17: 355–370. 10.1038/s41579-019-0189-2 [DOI] [PubMed] [Google Scholar]
- Jung I, Schmitt A, Diao Y, Lee AJ, Liu T, Yang D, Tan C, Eom J, Chan M, Chee S, et al. 2019. A compendium of promoter-centered long-range chromatin interactions in the human genome. Nat Genet 51: 1442–1449. 10.1038/s41588-019-0494-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kojima KK. 2018. Human transposable elements in Repbase: genomic footprints from fish to humans. Mob DNA 9: 2. 10.1186/s13100-017-0107-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krendl C, Shaposhnikov D, Rishko V, Ori C, Ziegenhain C, Sass S, Simon L, Müller NS, Straub T, Brooks KE, et al. 2017. GATA2/3-TFAP2A/C transcription factor network couples human pluripotent stem cell differentiation to trophectoderm with repression of pluripotency. Proc Natl Acad Sci 114: E9579–E9588. 10.1073/pnas.1708341114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with Bowtie 2. Nat Methods 9: 357–359. 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee CM, Barber GP, Casper J, Clawson H, Diekhans M, Gonzalez JN, Hinrichs AS, Lee BT, Nassar LR, Powell CC, et al. 2020. UCSC Genome Browser enters 20th year. Nucleic Acids Res 48: D756–D761. 10.1093/nar/gkz1012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li B, Dewey CN. 2011. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12: 323. 10.1186/1471-2105-12-323 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup. 2009. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25: 2078–2079. 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liao Y, Smyth GK, Shi W. 2013. The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote. Nucleic Acids Res 41: e108. 10.1093/nar/gkt214 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lorincz MC, Schübeler D. 2017. Evidence for converging DNA methylation pathways in placenta and cancer. Dev Cell 43: 257–258. 10.1016/j.devcel.2017.10.009 [DOI] [PubMed] [Google Scholar]
- Love MI, Huber W, Anders S. 2014. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15: 550. 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Macfarlan TS, Gifford WD, Driscoll S, Lettieri K, Rowe HM, Bonanomi D, Firth A, Singer O, Trono D, Pfaff SL. 2012. Embryonic stem cell potency fluctuates with endogenous retrovirus activity. Nature 487: 57–63. 10.1038/nature11244 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maltepe E, Fisher SJ. 2015. Placenta: the forgotten organ. Annu Rev Cell Dev Biol 31: 523–552. 10.1146/annurev-cellbio-100814-125620 [DOI] [PubMed] [Google Scholar]
- McLean CY, Bristor D, Hiller M, Clarke SL, Schaar BT, Lowe CB, Wenger AM, Bejerano G. 2010. GREAT improves functional interpretation of cis-regulatory regions. Nat Biotechnol 28: 495–501. 10.1038/nbt.1630 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mossman HW. 1987. Vertebrate fetal membranes: comparative ontogeny and morphology, evolution, phylogenetic significance, basic functions, research opportunities. Rutgers University Press, New Brunswick, NJ. [Google Scholar]
- Okae H, Toh H, Sato T, Hiura H, Takahashi S, Shirane K, Kabayama Y, Suyama M, Sasaki H, Arima T. 2018. Derivation of human trophoblast stem cells. Cell Stem Cell 22: 50–63.e6. 10.1016/j.stem.2017.11.004 [DOI] [PubMed] [Google Scholar]
- Papuchova H, Latos PA. 2022. Transcription factor networks in trophoblast development. Cell Mol Life Sci 79: 337. 10.1007/s00018-022-04363-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paul S, Home P, Bhattacharya B, Ray S. 2017. GATA factors: master regulators of gene expression in trophoblast progenitors. Placenta 60 Suppl 1: S61–S66. 10.1016/j.placenta.2017.05.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quinlan AR, Hall IM. 2010. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26: 841–842. 10.1093/bioinformatics/btq033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ralston A, Cox BJ, Nishioka N, Sasaki H, Chea E, Rugg-Gunn P, Guo G, Robson P, Draper JS, Rossant J. 2010. Gata3 regulates trophoblast development downstream of Tead4 and in parallel to Cdx2. Development 137: 395–403. 10.1242/dev.038828 [DOI] [PubMed] [Google Scholar]
- Ramírez F, Dundar F, Diehl S, Gruning BA, Manke T. 2014. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res 42: W187–W191. 10.1093/nar/gku365 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramsey EM, Houston ML, Harris JW. 1976. Interactions of the trophoblast and maternal tissues in three closely related primate species. Am J Obstet Gynecol 124: 647–652. 10.1016/0002-9378(76)90068-5 [DOI] [PubMed] [Google Scholar]
- Raviram R, Rocha PP, Luo VM, Swanzey E, Miraldi ER, Chuong EB, Feschotte C, Bonneau R, Skok JA. 2018. Analysis of 3D genomic interactions identifies candidate host genes that transposable elements potentially regulate. Genome Biol 19: 216. 10.1186/s13059-018-1598-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- R Core Team. 2020. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. https://www.R-project.org/. [Google Scholar]
- Rhee C, Lee BK, Beck S, LeBlanc L, Tucker HO, Kim J. 2017. Mechanisms of transcription factor-mediated direct reprogramming of mouse embryonic stem cells to trophoblast stem-like cells. Nucleic Acids Res 45: 10103–10114. 10.1093/nar/gkx692 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rosenkrantz JL, Gaffney JE, Roberts VHJ, Carbone L, Chavez SL. 2021. Transcriptomic analysis of primate placentas and novel rhesus trophoblast cell lines informs investigations of human placentation. BMC Biol 19: 127. 10.1186/s12915-021-01056-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ross-Innes CS, Stark R, Teschendorff AE, Holmes KA, Ali HR, Dunning MJ, Brown GD, Gojis O, Ellis IO, Green AR, et al. 2012. Differential oestrogen receptor binding is associated with clinical outcome in breast cancer. Nature 481: 389–393. 10.1038/nature10730 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmid CD, Bucher P. 2010. MER41 repeat sequences contain inducible STAT1 binding sites. PLoS One 5: e11425. 10.1371/journal.pone.0011425 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schoenfelder S, Mifsud B, Senner CE, Todd CD, Chrysanthou S, Darbo E, Hemberger M, Branco MR. 2018. Divergent wiring of repressive and active chromatin interactions between mouse embryonic and trophoblast lineages. Nat Commun 9: 4189. 10.1038/s41467-018-06666-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schroeder DI, Blair JD, Lott P, Yu HO, Hong D, Crary F, Ashwood P, Walker C, Korf I, Robinson WP, et al. 2013. The human placenta methylome. Proc Natl Acad Sci 110: 6037–6042. 10.1073/pnas.1215145110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Senft AD, Macfarlan TS. 2021. Transposable elements shape the evolution of mammalian development. Nat Rev Genet 22: 691–711. 10.1038/s41576-021-00385-1 [DOI] [PubMed] [Google Scholar]
- Sharif J, Endo TA, Nakayama M, Karimi MM, Shimada M, Katsuyama K, Goyal P, Brind'Amour J, Sun MA, Sun Z, et al. 2016. Activation of endogenous retroviruses in Dnmt1−/− ESCs involves disruption of SETDB1-mediated repression by NP95 binding to hemimethylated DNA. Cell Stem Cell 19: 81–94. 10.1016/j.stem.2016.03.013 [DOI] [PubMed] [Google Scholar]
- Soncin F, Khater M, To C, Pizzo D, Farah O, Wakeland A, Arul Nambi Rajan K, Nelson KK, Chang CW, Moretto-Zita M, et al. 2018. Comparative analysis of mouse and human placentae across gestation reveals species-specific regulators of placental development. Development 145: dev156273. 10.1242/dev.156273 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stoye JP. 2012. Studies of endogenous retroviruses reveal a continuing evolutionary saga. Nat Rev Microbiol 10: 395–406. 10.1038/nrmicro2783 [DOI] [PubMed] [Google Scholar]
- Sun MA, Wolf G, Wang Y, Senft AD, Ralls S, Jin J, Dunn-Fletcher CE, Muglia LJ, Macfarlan TS. 2021. Endogenous retroviruses drive lineage-specific regulatory evolution across primate and rodent placentae. Mol Biol Evol 38: 4992–5004. 10.1093/molbev/msab223 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takaku M, Grimm SA, Shimbo T, Perera L, Menafra R, Stunnenberg HG, Archer TK, Machida S, Kurumizaka H, Wade PA. 2016. GATA3-dependent cellular reprogramming requires activation-domain dependent recruitment of a chromatin remodeler. Genome Biol 17: 36. 10.1186/s13059-016-0897-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Theodorou V, Stark R, Menon S, Carroll JS. 2013. GATA3 acts upstream of FOXA1 in mediating ESR1 binding by shaping enhancer accessibility. Genome Res 23: 12–22. 10.1101/gr.139469.112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thorvaldsdottir H, Robinson JT, Mesirov JP. 2013. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform 14: 178–192. 10.1093/bib/bbs017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Todd CD, Deniz O, Taylor D, Branco MR. 2019. Functional evaluation of transposable elements as enhancers in mouse embryonic and trophoblast stem cells. eLife 8: e44344. 10.7554/eLife.44344 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van de Walle I, Dolens AC, Durinck K, De Mulder K, Van Loocke W, Damle S, Waegemans E, De Medts J, Velghe I, De Smedt M, et al. 2016. GATA3 induces human T-cell commitment by restraining notch activity and repressing NK-cell fate. Nat Commun 7: 11171. 10.1038/ncomms11171 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walentin K, Hinze C, Werth M, Haase N, Varma S, Morell R, Aue A, Pötschke E, Warburton D, Qiu A, et al. 2015. A Grhl2-dependent gene network controls trophoblast branching morphogenesis. Development 142: 1125–1136. 10.1242/dev.113829 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wei G, Abraham BJ, Yagi R, Jothi R, Cui K, Sharma S, Narlikar L, Northrup DL, Tang Q, Paul WE, et al. 2011. Genome-wide analyses of transcription factor GATA3-mediated gene regulation in distinct T cell types. Immunity 35: 299–311. 10.1016/j.immuni.2011.08.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wolf G, de Iaco A, Sun MA, Bruno M, Tinkham M, Hoang D, Mitra A, Ralls S, Trono D, Macfarlan TS. 2020. KRAB-zinc finger protein gene expansion in response to active retrotransposons in the murine lineage. eLife 9: e56337. 10.7554/eLife.56337 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xie J, Xu Y, Wan L, Wang P, Wang M, Dong M. 2018. Involvement of follistatin-like 3 in preeclampsia. Biochem Biophys Res Commun 506: 692–697. 10.1016/j.bbrc.2018.10.139 [DOI] [PubMed] [Google Scholar]
- Yates AD, Achuthan P, Akanni W, Allen J, Allen J, Alvarez-Jarreta J, Amode MR, Armean IM, Azov AG, Bennett R, et al. 2020. Ensembl 2020. Nucleic Acids Res 48: D682–D688. 10.1093/nar/gkz966 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu Y, He JH, Hu LL, Jiang LL, Fang L, Yao GD, Wang SJ, Yang Q, Guo Y, Liu L, et al. 2020. Placensin is a glucogenic hormone secreted by human placenta. EMBO Rep 21: e49530. 10.15252/embr.201949530 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, et al. 2008. Model-based Analysis of ChIP-Seq (MACS). Genome Biol 9: R137. 10.1186/gb-2008-9-9-r137 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, Benner C, Chanda SK. 2019. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun 10: 1523. 10.1038/s41467-019-09234-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





