Abstract
Background
The endothelial-to-hematopoietic transition (EHT) process during definitive hematopoiesis is highly conserved in vertebrates. Stage-specific expression of transposable elements (TEs) has been detected during zebrafish EHT and may promote hematopoietic stem cell (HSC) formation by activating inflammatory signaling. However, little is known about how TEs contribute to the EHT process in human and mouse.
Results
We reconstructed the single-cell EHT trajectories of human and mouse and resolved the dynamic expression patterns of TEs during EHT. Most TEs presented a transient co-upregulation pattern along the conserved EHT trajectories, coinciding with the temporal relaxation of epigenetic silencing systems. TE products can be sensed by multiple pattern recognition receptors, triggering inflammatory signaling to facilitate HSC emergence. Interestingly, we observed that hypoxia-related signals were enriched in cells with higher TE expression. Furthermore, we constructed the hematopoietic cis-regulatory network of accessible TEs and identified potential TE-derived enhancers that may boost the expression of specific EHT marker genes.
Conclusions
Our study provides a systematic vision of how TEs are dynamically controlled to promote the hematopoietic fate decisions through transcriptional and cis-regulatory networks, and pre-train the immunity of nascent HSCs.
Supplementary Information
The online version contains supplementary material available at 10.1186/s12915-024-01939-5.
Keywords: Endothelial-to-hematopoietic transition, Transposable element, Hematopoietic stem cell, Inflammatory signaling, Cis-regulatory element, Hypoxia
Background
HSCs are pluripotent cells that possess the ability of self-renewal and lineage differentiation to maintain lifelong hematopoiesis. In embryos, HSCs arise from endothelial cells (ECs) with hematopoietic potential in the aorta-gonad-mesonephro (AGM) region, as tracked by time-lapse imaging [1–5]. The EHT process is highly conserved across vertebrate embryos [6], which occurs at embryonic day (E) 10.5–11.5 in mouse [7, 8] and at Carnegie stage 13–17 (4–6 weeks) in human [9, 10]. A sub-cluster of arterial endothelial cells (AECs) is found to undergo fate decisions to become hematopoietic endothelial cells (HECs) [11–13]. Furthermore, at least two types of HSC precursors (distinguished by CD45 expression) have been identified in mouse intra-aortic hematopoietic clusters (IAHCs) [14–17]. Single-cell technologies have expanded our understanding of cellular heterogeneity and complex relationships during developmental hematopoiesis [18, 19]. Utilizing single-cell RNA sequencing (scRNA-seq) and single-cell sequencing assay for transposase-accessible chromatin (scATAC-seq), the continuous EHT trajectory and an intermediate cell population proximal to HECs (termed pre-HECs) have been delineated in mouse [20, 21]. Recently, a signature gene set (RUNX1+HOXA9+MLLT3+MECOM+HLF+SPINK2+) was discovered that can distinguish human HSCs from other hematopoietic progenitor cells, providing the first single-cell landscape of HSC development from origination to maturation in the human embryo [22]. However, the comparability of cell types and conservation of marker genes in human and mouse EHT remain to be systematically investigated.
The EHT process is strictly regulated by multiple factors at the transcriptional and epigenetic levels [23, 24]. Transcription factors such as RUNX1, GFI1, and GATA2 play vital roles in HSC development [25–27]. Signaling pathways, including NOTCH, WNT, YAP, and VEGF, are also involved in HSC fate decisions [24]. Additionally, inflammatory signals [28, 29], such as interleukins (IL-1, IL-3, and IL-6) [30, 31], tumor necrosis factors (TNF), and interferon signals (IFN), have been highlighted to regulate HSC emergence [32, 33]. In the innate immune system, pattern recognition receptors (PRRs) such as Toll-like receptors (TLRs), RIG-I-like receptors (RLRs), NOD-like receptors (NLRs), and C-type lectin receptors (CLRs), are key activators of inflammatory responses [34]. Toll-like receptor 4 (TLR4) has been shown to regulate HSC formation by promoting NOTCH activity through MyD88-mediated NF-κB signaling [29]. A recent zebrafish study revealed that RLRs (RIG-I, MDA5, and LGP2) are involved in HSC formation through the activation of downstream inflammatory signaling, such as TNF receptor-associated factors (TRAFs) [35]. Typically, PRRs induce antiviral immune responses by recognizing nucleic acids produced by exogenous pathogens [36, 37]. During HSC formation, the ligands for PRRs are puzzling because the AGM region is supposed to be a sterile niche. However, abundant transposable elements (TEs) may provide endogenous nucleic acids for PRRs [38]. Interestingly, TE expression has been detected during zebrafish EHT and is demonstrated to affect HSC generation through the RLR pathway [35]. Nevertheless, the contributions of TEs to human and mouse EHT remain largely unknown.
TEs consist of retrotransposons and DNA transposons (DNAs). Retrotransposons are the most abundant TEs in human and mouse, whether long interspersed nuclear elements (LINEs), short interspersed nuclear elements (SINEs) or hominid SVAs (SINE-VNTR-Alu), and long terminal repeats (LTRs), which are also known as endogenous retroviruses (ERVs). TE jumping may induce genome instability and diseases [39, 40]. Therefore, various defense systems have evolved to domesticate TEs, such as chromatin modification, small RNA silencing, and post-transcriptional repression [41, 42]. In vertebrates, the Krüppel-associated box zinc finger protein (KRAB-ZFP) is a prominent silencing system that inhibits TEs through interactions with KAP1 to recruit DNA methyltransferases (DNMT), SETDB1, HP1, and the nucleosome remodeling deacetylase (NuRD) complex [41, 43]. The human silencing hub (HUSH) complex, coupled with the ATPase MORC2, deposits H3K9me3 on TEs. Although TEs are usually silenced, they can be activated in a temporary or tight fashion at both transcriptional and epigenetic levels to shape embryonic development [44, 45]. Single-cell studies have shown that TEs exhibit cell type-specific expression during gastrulation and organogenesis and participate in the dynamic regulation of pluripotency reprogramming and lineage differentiation [46–48]. Related evidence also suggests that TEs can contribute to hematopoietic regeneration and fate decisions [49–51].
In this study, we conducted a comprehensive survey of the genomic landscape and potential regulatory functions of TEs in human and mouse. Through a systematic single-cell investigation, we provide new insights into the contribution of TEs to the expression and regulatory landscape of EHT, which may shed light on the role of TEs in the context of stem cell development and other cell type transition systems.
Results
Widespread TEs harbor great regulatory potential in human and mouse
TEs in human and mouse genomes are classified into 4 classes: LINEs, SINEs (including hominid SINE-VNTR-Alu retrotransposons, SVAs), LTRs, and DNAs. These TEs are further categorized into 42 superfamilies (1176 families) and 41 superfamilies (1256 families) in human and mouse, respectively (Additional file 1: Table S1 and S2). TEs constitute approximately 46.38 and 41.76% of the human and mouse genomes (Fig. 1A, E; Additional file 1: Table S3 and S4), with the majority located in non-coding regions, such as intergenic regions, introns, and UTRs (Fig. 1B, F; Additional file 1: Table S5 and S6). Notably, TEs are less abundant near transcription start sites (TSS) and transcription termination sites (TTS) (Fig. 1C, G), potentially maintaining gene transcription specificity [52]. To investigate the regulatory potential of TEs, we assessed the overlap between TE superfamilies and CpG islands, as well as candidate cis-regulatory elements (cCREs) obtained from ENCODE-SCREEN [53]. The cCREs annotations include the genomic locations of promoter-like sites (PLS), proximal enhancer-like signatures (pELS), proximal enhancer-like signatures (pELS), distal enhancer-like signatures (dELS), CTCF signatures, and DNase-H3K4me3 signatures derived from ChIP-seq (H3K4me3, H3K27ac, and CTCF) and DNase-seq data. We employed BEDTools [54] to calculate the intersection between TE annotations and regulatory element coordinates. To enhance the annotation accuracy, we considered a TE to have a potential regulatory function if more than 50% of its length overlapped with a cCRE. In human, SINEs/SVAs contribute to more than 39% of CpG islands (Fig. 1D; Additional file 2: Table S1 and S7). While in mouse, SINEs overlap with only 1.47% of CpG islands, and LINEs (especially L1) and LTRs (ERV1 and ERVK) each contribute to more than 15% (Fig. 1H; Additional file 3: Table S1 and S7). Abundant CpG sites typically maintain TEs in a repressed state. Potentially, TEs can be activated and play a role in embryonic development through demethylation processes, such as epigenetic reprogramming [55, 56]. Among cCREs, a considerable proportion (36.39–57.55% in human and 21.34–42.95% in mouse) of pELS, dELS, CTCF, and DNase-H3K4me3 signatures intersect with TEs (Fig. 1D, H; Additional file 2: Table S2-S6; Additional file 3: Table S2-S6), suggesting that TEs may have evolved regulatory potential and may contribute to pluripotency and early embryogenesis at both transcriptional and epigenetic levels [57–61]. In this study, we focus on analyzing TE expression and chromatin accessibility during EHT to elucidate the potential regulatory mechanisms of TEs in HSC formation.
TEs are detected with low cell type specificity on the conserved EHT trajectories of human and mouse
By using the published scRNA-seq data of human and mouse AGM (Additional file 4: Table S1) [21, 22], we reconstructed the EHT trajectories (Fig. 2A, E ). To characterize the EHT cell types, we identified ECs and HSCs from the AGM UMAP, and annotated cell clusters by using marker genes of VECs (CDH5, NRP2, NR2F2), AECs (GJA4, HEY1, DLL4), pre-HECs (TMEM100, GJA5, EDN1), and HSCs (RUNX1, MYB, HLF) (Fig. 2B, F; Additional file 5: Fig. S1A-C, F and Fig. S2A-C, F). Identification of HECs in UMAP clusters is challenging, and therefore they were annotated based on the signatures of CDH5+ RUNX1+ MYCN+ PTPRC− (Fig. 2B, F; Additional file 5: Fig. S1D-F and Fig. S2D-F). To confirm the consistency of EHT cell types between the two species, we integrated the human and mouse EHT data based on the shared homologous genes. The results showed that the EHT cell types were highly mapped with each other and the majority of EHT markers were conserved (Additional file 5: Fig. S3A, B). For instance, ACE is positive, CD44 is low, and KIT is negative in both human and mouse pre-HECs [20, 62]. However, IL33 and SPINK2 are only expressed in human pre-HECs and HSCs, respectively, while in mouse, Ikzf2 is more enriched in HECs and HSCs (Additional file 5: Fig. S3D).
To explore the TE dynamics on the EHT trajectories, we computed the family-level TE expression using scTE [48]. In scTE, reads mapping to any TE copy in the genome are collapsed to a single TE subtype, reducing errors in multi-mapped read allocation. The differential expression analysis showed that the overall average log2FC values of TEs were relatively low. We selected 0.25, which is close to the median of average log2FC values, as the threshold to obtain relatively more differentially expressed TEs (named marker TEs). Finally, we identified 214 and 96 marker TEs in human and mouse EHT clusters (average log2FC≥0.25 and adjusted P-value≤0.05), respectively. Among these, 198 TEs (92.52%) in human were enriched in pre-HECs, while 72 TEs (75%) in mouse belonged to AECs and pre-HECs (Additional file 6: Table S1 and Table S2). Notably, in human EHT, HERV15-int and HERVK-int appeared to be enriched in pre-HECs and HECs, whereas in mouse EHT, IAPEY4-I-int and RMER20B were highly expressed in HSCs (Fig. 2B, F). We calculated marker TE specificity as the difference between the percentage expression of a TE in a cluster and its percentage expression in the remaining clusters. In cell types where TEs were expressed, most of them show low cell type specificities (Fig. 2C, G). The marker TEs that showed relatively higher specificities (percent difference>25%) are mostly LTRs or ERVs, which is consistent in both human and mouse. The top marker TEs for each cell type of human and mouse EHT were displayed in Fig. 2D, H.
TEs form a distinguished co-upregulation pattern during human pre-HEC and mouse AEC/pre-HEC specification
To identify modules that may participate in common regulatory processes during EHT, we performed co-expression network analysis using hdWGCNA [63]. This approach allows us to identify groups of genes and transposable elements (TEs) that exhibit similar expression patterns across different cell types during EHT. A total of 988 filtered TEs, 528 marker genes (average log2FC≥0.5 and adjusted P-value≤0.05) in human and 864 filtered TEs, 421 marker genes in mouse were included for co-expression analysis (Additional file 6: Table S3 and S4). Those selected genes and TEs were clustered into 5 modules (HME1-5 and MME1-5) in both human and mouse, in which HME1-4 (MME1-4) were enriched in VECs, AECs, pre-HECs, and HECs/HSCs, respectively (Fig. 3A–D; Additional file 7: Table S1 and S2). The top 10 hub genes/TEs of each module are listed in Fig. 3B and D. Conserved hub genes, such as GJA5 and TMEM100 in pre-HECs, and MYB, SPI1, and CORO1A in HSCs, were found in both human and mouse. Notably, most of the filtered TEs (645 of 670 in human and 659 of 699 in mouse) were clustered in HME5 and MME5 (Fig. 3B, D). These TEs were expressed throughout the entire EHT peaking in human pre-HECs. In mouse samples, TEs were significantly upregulated at the AEC stage, continuing into the pre-HEC stage. Although LTRs accounted for the largest proportion of TEs in HME5 and MME5 (39.84 and 56.90% in human and mouse, respectively), their module connectivity scores (kME scores) were relatively low (Fig. 3E, F). The kME score, defined as the correlation between the expression profile of a feature and the module in a co-expression network by WGCNA [64], measures how strongly a TE is connected to a particular module. Therefore, the lower kME scores of LTRs indicate that they may not be the hub nodes of the network. In contrast, SINEs and LINEs exhibited higher kME scores in both HME5 and MME5, suggesting their potential role as core regulators in the network. Among TEs with kME scores greater than 0.3, L1 was the most abundant in both human and mouse (94 in 190 and 68 in 179) (Additional file 7: Table S1 and S2). Furthermore, 260 common TEs in human and mouse are found to be enriched in HME5 and MME5 (Additional file 7: Table S3). Some top-ranked (according to kME scores) common TEs (L1, L2, and MIR) and species-specific TEs (Alu and mouse-specific B2 and B4) were displayed in Fig. 3G, H.
TE silencers are transiently relaxed in human pre-HECs and mouse AECs/pre-HECs
Differential expression analysis revealed pervasive upregulation of TEs in pre-HECs against VECs and HSCs (Fig. 4A, B; Additional file 8: Table S1-S4). However, in mouse, TE activation was already observed in AECs, possibly due to the presence of more pre-HEC-primed cells (Fig. 3D). As TEs are normally repressed, their transient activation in pre-HECs is likely due to the downregulation of TE silencers. We therefore calculated the relative expression (i.e., module score) of each TE silencing module during EHT using AddModuleScore in Seurat [65]. The majority of TE silencers were downregulated in pre-HECs compared to VECs and HSCs (for example, 84.07 and 77.49% of KRAB-ZFPs in human and mouse, respectively) (Fig. 4C, F; Additional file 8: Table S5 and S6; KRAB-ZFP genes are available from [66]). Among the downregulated KRAB-ZFPs, ZNF84, ZNF382, and ZNF429 were found to bind significantly to L1 superfamily in human [67]. In embryonic stem cells, ZNF91 and ZNF93 can respectively repress SVAs and L1 in human [68], while Zfp932 regulates ERVK in mouse. ZNF268, ZNF300, and ZNF589 were shown to be related to hematopoietic differentiation [69]. Co-factors recruited by KRAB-ZFPs (such as TRIM28, CBX3, and SETDB1) and TE silencers closely related to KRAB-ZFPs (such as DNMTs and NuRD complex), also exhibited relatively low expression levels in pre-HECs (Fig. 4C, E, F and H). Additionally, the HUSH complex (HUSHs), P-element induced Wimpy testis-related genes (PIWIs), and other TE silencers were also expressed relatively lower in pre-HECs than in other cell types, despite overall low expression levels (Fig. 4C, F). These findings suggested that various TE silencers were relaxed by specific mechanisms, leading to transient TE activation during pre-HEC specification (Fig. 4D, G). Interestingly, these TE silencers were upregulated after the pre-HEC stage to re-suppress TE activity, explaining why some members of the DNMT complex (e.g., DNMT1 [70] and EZH2 [71]) and NuRD complex (e.g., HDAC1 and HDAC2 [72]) are required for HSC formation [73].
TE product sensing facilitates immune activation during HSC orientation
As a main source of endogenous nucleic acids [38, 41], TE products likely activate downstream inflammatory signaling pathways through PRRs. We identified upregulation of a large number of RNA and DNA sensors both in human and mouse HECs/HSCs (Fig. 5A, E; Additional file 9: Table S1 and S2). In human HECs/HSCs, IFIH1 (MDA5) and DDX58 (RIG-I) of RLRs, as well as NLRP1, NLRP2, NLRC3, and NLRX1 of NLRs, were significantly upregulated. Although some TLRs also showed upregulation in HECs/HSCs, their expression levels were low in both species (expression percentage<10%). In addition, protein kinase R genes (PKRs), including EIF2A, EIF2AK1, EIF2AK2, and EIF2AK4, were significantly upregulated in both human and mouse HECs/HSCs. DNA sensors, such as cGAS/STING (TMEM173), were possibly activated by cDNA intermediates of retrotransposons [74, 75]. Typical downstream intermediates of RLRs and cGAS/STING, including MAVS, TRAF3, TBK1, IRF3, and NF-κB, showed highly conserved upregulation patterns (Fig. 5A, E; Additional file 9: Table S3 and S4). Functional enrichment analysis detected both interferon alpha (IFNα) and interferon gamma (IFNγ) response pathways in HECs/HSCs, confirming the activation of these PRRs (Additional file 9: Table S5 and S6). Interestingly, IFNAR1 and IFNGR2 consistently appeared to function earlier in both species (immediately after TE upregulation), while IFNAR2 and IFNGR1 showed complementary patterns (Additional file 9: Table S3 and S4). Gene set enrichment analysis (GSEA) on gene ontology (GO) revealed inflammatory signals (e.g., TNF and IL6) and immune response in HECs/HSCs (Fig. 5B, F; Additional file 9: Table S5 and S6). Taken together, we speculated that TE products (elevated in human pre-HECs or earlier in mouse AECs) could induce inflammatory signals through PRRs during EHT, and trigger immune response pathways to activate HSC progression (Fig. 5C, G and D, H ).
TE accessibility is dynamically controlled during EHT
To explore the potential cis-regulatory function of TEs on HSC origination, we systematically analyzed the scATAC-seq data in the E10.5 mouse AGM. Using cell types transferred from scRNA-seq data, a coherent EHT process was achieved based on scATAC-seq data (Fig. 6A; Additional file 10: Fig. S1A-D). The gene activities of EHT signatures obtained from scRNA-seq were well-fitted to the EHT clusters (Fig. 6B; Additional file 10: Fig. S1E). Next, we calculated the TE activities in each cell (reflecting the degree of TE accessibility) at the locus level. Differential accessibility analysis revealed that TEs were more accessible in pre-HECs compared to endothelial cells (Fig. 6C; Additional file 11: Table S1), aligning with the elevated TE expression in AECs/pre-HECs (Fig. 4A, B). A total of 148 differentially accessible TEs (DATEs, average log2FC≥0. 25 & adjusted P-value≤0.05) were enriched in pre-HECs (Fig. 6C), while few DATEs were identified between pre-HECs and HECs/HSCs (Additional file 11: Table S2). Notably, more differentially accessible peaks (DAPs) were found in pre-HECs compared to endothelial cells when analyzing all peaks, suggesting chromatin reprogramming in pre-HECs. Unlike the low specificity of TE expression in scRNA-seq data, a considerable number of cell type-specific open TEs were identified in scATAC-seq data (Fig. 6D; Additional file 11: Table S3). DNAs accounted for the least amount of these cell type-specific open TEs, and LINEs accounted for more in pre-HECs than in other cell types (Fig. 6E). As expected, the majority of open TEs overlapped with the distal enhancers annotated by ENCODE, suggesting their potential enhancer roles.
Differential accessibility analysis for all peaks identified 4230 cell type-specific DAPs, of which 2918 overlapped with TEs, named TEPs (Fig. 6F; Additional file 11: Table S4). The gene regions closest to these TEPs contained many EHT-associated signatures, such as Gja4 (AECs), Gja5, Edn1 (pre-HECs), and Gata2, Cd44, Runx1 (HECs/HSCs) (Additional file 10: Fig. S2; Additional file 11: Table S4). Gja5, a member of the connexin gene family, had elevated expression in mouse AECs and pre-HECs (Fig. 2B, F). The promoter of two Gja5 transcripts (ENSMUST00000072600 and ENSMUST00000132256, annotated in EPD [76]) was specifically more accessible in pre-HECs (Fig. 6G), while two upstream enhancer-like peaks showed high accessibility in both AECs and pre-HECs, potentially permitting earlier Gja5 expression in AECs. Two TEs (chr3:96937220-96937315:L2c and chr3:96954448-96954510:ID_B1) inside the Gja5 gene body may function as enhancers (termed eL2c and eID_B1) to promote Gja5 expression in pre-HECs. These TE regions are also annotated as ELSs in ENCODE. We applied Cicero [77] to predict the cis-co-accessibility networks (CCANs) among peaks detected near or inside Gja5. Cicero calculates a co-access score (representing the strength of co-accessibility) for each pair of peaks, and links with a co-access score lower than 0.4 are filtered to reduce false positives. Although the potential enhancer eID_B1 had the greatest increase in accessibility in pre-HECs, it was also open in AECs and may interact with the two upstream promoter-like regions (Fig. 6G). The potential enhancer eL2c was only opened in pre-HECs, consistent with the accessibility pattern of the proximal promoter, and thus could be more likely to cooperatively increase Gja5 expression. However, the reason for Gja5 upregulation in pre-HECs remains to be further explored, although a recent study indicated its importance for HSCs to dampen oxidative stress [78].
Cell type-specific accessible TEs shape the hematopoietic cis-regulatory networks
While some TEs have been found to act as enhancers driving the expression of hematopoietic-related genes, the co-regulation mode of these TEs remains unclear. To address this, we used Cicero to construct the CCANs of all cell type-specific DAPs (including TEPs and non-TEPs) by filtering out links with a co-access score less than 0.4 (Fig. 7A; Additional file 11: Table S5). Analysis of TE compositions of cell type-specific TEPs revealed that ID_B1 (Alu superfamily) was abundant in all cell types. The top-ranked TEs showed high consistency across cell clusters but were enriched to different motifs in different EHT stages (Fig. 7B; Additional file 11: Table S6), possibly related to the variation accumulated on different TE copies during evolution [79, 80]. Surprisingly, TEs were found to participate in shaping most cis-regulatory networks closely related to the EHT process. For example, TE-involved SOX and GATA binding sites were mostly open in VECs, while TE-involved RUNX binding sites gained increased accessibility in HECs/HSCs. A joint analysis of the enriched motifs and the corresponding transcription factors (TFs) (Fig. 7B, C) revealed that although KLF motifs (Klf7, Klf10 and Klf12) were active in AECs and later stages, these TFs were downregulated to control the developmental fate of AECs. SOX motifs significantly increased activity in AECs in advance, but the expression of TFs (Sox4, Sox6, Sox13, and Sox17) peaked after entering the pre-HEC stage. These motif regions may be cooperatively bound by other TFs in addition to the SOX family, as supported by a recent study showing NF-κB collaborating with IRF3 and other factors to promote nucleosome remodeling [81]. This suggests that when a specific TF is not expressed, its potential collaborators make it possible that the motif occupancy can still be detected. Likewise, Gata3 and Gata6 had higher motif activities in both pre-HECs and HECs/HSCs but were only highly expressed in pre-HECs. This dual regulation via motif binding activity and TF expression precisely shapes the lineage determination during EHT.
Prediction of cell type-specific TF-target networks based on interactions from TRRUST [82] revealed that some EHT signatures were involved downstream of TE-bound TFs (Additional file 10: Fig. S3; Additional file 11: Table S7), such as Kdr, Flt1 (VECs), Smad6, Vegfc (pre-HECs) and Kit, Ikzf1 (HECs/HSCs). GO enrichment analysis showed that these cis-regulatory networks shaped by cell type-specific TEs were enriched in various known functional modules during EHT (Fig. 7D; Additional file 11: Table S8).
The hypoxic AGM niche may be partially responsible for the transient TE activation preceding hematopoietic fate commitment
The downregulation of TE silencing systems in pre-HECs may be the main reason for the enhanced TE activity; however, the underlying mechanisms regulating these TE silencers remain unclear. The RNA velocity estimated by Velocyto [83] and scVelo [84] showed that more unspliced RNAs were found in pre-HECs, indicating a significantly high differentiation rate and lack of cell cycle activity (Fig. 8A, B). GSEA analysis showed that mRNA splicing, RNA catabolic process, and cell cycle were downregulated in the pre-HEC stage (Fig. 8C; Additional file 12: Table S1). The downregulation of oxidative phosphorylation and upregulation of lipid metabolic process suggested that pre-HECs may undergo metabolic reprogramming [62]. Notably, genes related to epithelial-to-mesenchymal transition (EMT) and response to hypoxia were upregulated in pre-HECs. Hypoxia has been shown to promote zebrafish HSC formation through hypoxia-inducible factors (HIFs; hif-1a and hif-2a) and Notch signaling [85], and to induce HSPC-like cells from human embryonic stem cells (hESCs) in vitro [86]. The IAHC cluster region in the mouse embryo (E10) was directly observed to be hypoxic by staining [87, 88]. Analysis of hypoxia-related genes in human EHT revealed high expression of EPAS1 (HIF2A) and HIF3A [89] in pre-HECs, while HIF1A was more expressed in VECs and AECs (Fig. 8D; Additional file 12: Table S2). Many hypoxia-induced downstream genes were also found to be enriched in pre-HECs, such as SLC2A3 [90], CXCL12/CXCR4 [91], NOTCH1, VEGFC, EDN1, MMP2/MMP14, GATA6, TGFB2, and THBS1. Spatial transcriptome analysis of human embryo (CS15) demonstrated enrichment of these hypoxia-induced genes in IAHCs (Fig. 8F). The expression patterns of hypoxia-induced genes were opposite to those of TE silencers (Fig. 8D, E), especially DNMT1 and UHRF1 (Fig. 8D, F). To investigate the relationship between hypoxia and TE activation, the TE expression landscape was recalculated in the human AGM dataset. The results indicated local hypoxic areas besides pre-HECs, including stromal cells, which also exhibited higher TE expression levels (Additional file 13: Fig. S1A-C). Few cell type-specific TEs were identified in the AGM region (Additional file 13: Fig. SD), consistent with findings during the EHT trajectory (Fig. 2C, G).
Furthermore, we included a comprehensive time-series RNA-seq study [92] examining the hypoxia response of human umbilical vein endothelial cells (HUVECs) to confirm the relationship between hypoxia and TE activation. TEs were found to be broadly activated after 12 h under extremely low oxygen concentrations (0.1 and 1%), while no significant upregulation was observed in groups with oxygen greater than 3% (Fig. 8G; Additional file 12: Table S3). Different TE classes showed similar upregulation patterns under hypoxia (Additional file 13: Fig. S2). Coincidentally, many TE silencers (e.g., KRAB-ZFP members ZNF268, ZNF300, ZNF589) were greatly downregulated after 12 h of hypoxic culture. HIF3A appeared more correlated with TE expression patterns than HIF1A and EPAS1 (Fig. 8G). Collectively, we hypothesized that the hypoxic AGM niche might induce transient TE activation by inhibiting the expression of TE silencers, which is postulated to be critical for the EHT process (Fig. 5E, J). Analysis of pre-HEC-specific markers in HUVEC data revealed that SOX17, HEY1, and HEY2 were not upregulated under hypoxia (Additional file 13: Fig. S2E, Fig. S3), suggesting distinct roles for these genes during pre-HEC specification.
Discussion
TEs are abundant in eukaryotic genomes and have evolved essential roles in transcriptional and epigenetic regulation [41, 42, 57, 59, 79]. Recent single-cell sequencing technologies have revealed the broad expression and crucial roles of TEs in developing embryos [46–48]. Although TEs are expressed during definitive hematopoiesis and HSC regeneration [35, 49, 50], the mechanisms of TE activation and their cis-regulatory roles during EHT remain to be investigated. In this work, we demonstrate how cells conservatively program the EHT process and drive HSC formation by dynamically regulating the expression and chromatin accessibility of TEs.
Leveraging the single-cell datasets of human and mouse AGM [21, 22], we reconstructed the EHT trajectories and presented the dynamic landscape of TE expression. Consistent with findings in zebrafish [35], only a few cell type-specific TEs (mainly LTRs) were identified during EHT (Fig. 2C, G). Among marker TEs in human, primate-restricted HERVK transcripts and HERVS71 have been reported to be abundant during human gastrulation [47]. We unexpectedly observed that many TEs were consistently upregulated during human pre-HEC and mouse AEC/pre-HEC specification (Fig. 3B, D; Fig. 4A, B). Coincidentally, TE silencing systems were at relatively low levels from AECs to pre-HECs (Fig. 4C–H), which could partially account for TE activation in this period. Interestingly, two RNA transferases METTL3 and METTL14, which can form nuclear complexes and control TE activity through m6A modification [93, 94], were also downregulated in pre-HECs.
Screening common PRRs revealed that many PRRs were upregulated in HECs/HSCs (Fig. 5A, E), suggesting their activation by TE products or other non-coding RNAs [35, 38]. Notably, TE activation and TE sensing are not synchronized, with TEs largely transcribed in human pre-HECs and mouse AECs/pre-HECs not activating PRRs and inflammatory signals until HECs/HSCs. A possible explanation might be that the RNA catabolic process and cell metabolism are quiescent in pre-HECs (Fig. 8B, C), which delays the TE sensing (Fig. 9). When cells are transdifferentiated from pre-HECs into HECs/HSCs, energy metabolism is reactivated, allowing them to respond to various TE products through PRRs. This conserved TE activation and sensing resembles a rehearsal mechanism during the EHT process, allowing nascent HSCs to learn antigen-like properties from endogenous nucleic acid repertoire (e.g., TEs) and complete a pluripotent immune activation.
TEs also exert genome regulatory functions through cis-regulatory elements, especially enhancers [57, 59, 61, 79, 80]. We observed cell type-specific dynamic accessible patterns of TEs (Fig. 6D) and identified two TE-associated enhancers, eL2c and eID_B1, with increased accessibility in pre-HECs and predicted interaction with the Gja5 promoter (Fig. 6G). Through motif prediction and TF expression analysis of cell type-specific TEPs, we recovered the TE-involved cis-regulatory networks during EHT (Fig. 7).
Evidence shows that environmental stress (including heat shock, oxidative, and chemotherapy) [49, 95] is one of the driving forces to induce TE activation. In our study, we inferred that the AGM hypoxic niche may be partially responsible for the activation of TEs in pre-HECs. Hypoxia in the AGM region (IAHC cluster) has already been observed and shown to promote HSC formation [85–88], and our study attributes this role in part to TE activation. Interestingly, stromal and epithelial cells in AGM also exhibited hypoxia and relatively high TE expression (Fig. 8F; Additional file 13: Fig. S1A), indicating that TE activation by hypoxia appears to be cell type insensitive. HIF3A seems to be more related to the TE expression, as evidenced by the upregulation of TEs in stromal and epithelial cells that do not express EPAS1 but express HIF3A (Additional file 13: Fig. S1B, E).
Conclusions
TEs are domesticated during the evolution of eukaryotic genomes and mediate the emergence of novel regulatory elements [41]. Our study extends the understanding of the potential upstream and downstream effects of TE transcription during EHT at the single-cell level and fills the gap in knowledge of TEs as cis-regulatory elements driving HSC development. Many TEs were found to be upregulated during human pre-HEC and mouse AEC/pre-HEC specification, coinciding with the downregulation of TE silencers. PRR-mediated TE product sensing and activation of inflammatory signaling were delayed until the HSC stage, which may be due to the metabolic reprogramming in pre-HECs. Analysis of scATAC-seq data revealed that dynamically accessible TEs shape the hematopoietic cis-regulatory network to coordinate the EHT process. We additionally reported that the hypoxic AGM niche may be partially responsible for the transient TE activation before hematopoietic fate commitment. Further investigations are required to confirm such a hypothesis. In summary, this study provides a systematical single-cell analysis to uncover how TEs, through dynamic expression and chromatin accessibility, orchestrate the EHT process and drive HSC formation. Our findings contribute to a better understanding of the regulatory roles of TEs in developmental hematopoiesis and provide a foundation for future research in this field.
Methods
TE coverage, distribution, and regulatory potential analysis
TE annotations, genomic annotations (intergenic, intron, 3′ UTR, 5′ UTR, CDS), and unmasked CpG islands of human (hg38) and mouse (mm10) were obtained from the UCSC Genome Browser database [96]. The intersection of TEs and gene structures was measured using BEDTools (v2.30.0) [54]. ChIPseeker (v1.34.1) [97] was used to visualize the TE distributions. The reference annotations of cCREs (including PLS, pELS, dELS, CTCF-only, and DNase-H3K4me3) were downloaded from ENCODE [53]. To enhance the annotation accuracy of TE regulatory potential, the overlapping between TE and cCRE should exceed 50% of the TE’s length. The heatmap representations were generated using the R package pheatmap (v1.0.12).
Single-cell RNA-seq data processing
Raw sequencing data of human and mouse AGM were downloaded from GEO with accession numbers GSE162950 [22] and GSE137117 [21]. The detailed information on samples used in this study can be found in Additional file 4: Table S1. Reads were mapped to the human (refdata-gex-GRCh38-2020-A) and mouse (refdata-gex-mm10-2020-A) reference genomes using CellRanger (v7.1.0). Seurat (v4.3.0) [65] was used to perform downstream analysis. Cells with less than 200 unique molecular identifiers (UMIs) or greater than 15% mitochondrial expression were removed and clusters with unusually low RNA features or counts were also filtered. SCTransform (v0.3.5) [98] was used to normalize the clean data. Batch effects were corrected by Harmony (v0.1.1) [99]. Marker genes were identified using FindAllMarkers with MAST [100]. Cell types were annotated according to the marker genes provided in [22] (Additional file 4). Integration of the human and mouse EHT data was achieved by Seurat CCA based on the shared homologous genes. The R package biomaRt (v2.54.1) [101] was used to map gene symbols of mouse to human.
Single-cell TE quantification and differential expression analysis
We applied scTE (v1.0) [48] to quantify the TE expression at the family level. To keep the consistency of the read counting results, we incorporated the same gene annotations as CellRanger and TE annotations from UCSC to build the genome indices. The count matrix of only LINEs, SINEs/SVAs, LTRs, and DNAs was kept and merged into the Seurat object. Cell type-specific marker TEs were identified using FindAllMarkers. Differential expression analysis of TEs was performed using FindMarkers.
Co-expression gene and TE module analysis
Cell type-specific marker genes (average log2FC≥0.5 and adjusted P-value≤0.05) and TEs counting more than 50 were extracted for co-expression analysis using hdWGCNA (v0.2.16) [63, 102], which extends the standard WGCNA [64] pipeline into scRNA-seq analysis. Genes and TEs with low expression (less than 50 cells with expression >0) across cells were filtered out. The single cells were first aggregated into pseudobulk (meta) cells to reduce the dropout effect. A similarity matrix was built by calculating the pairwise Pearson correlations between genes and TEs. The similarity matrix was transformed into an adjacency matrix by applying a soft-thresholding power to emphasize strong correlations. Hierarchical clustering was performed to identify modules of genes and TEs with similar expression patterns. The co-expression networks were visualized with UMAP. The kME score was computed based on module eigengene to measure how strongly a gene or TE is connected to a particular module within the network. The module scores of TEs with kME≥0.3 in HME5 and MME5 were calculated using AddModuleScore in Seurat.
TE silencing and sensing analysis
Genes related to TE silencing were collected from the literature [41, 43]. Potential KRAB-ZFP genes in human and mouse were obtained from [66]. The whole list of TE silencers analyzed in this study can be found in Additional file 8: Table S5 and S6. Genes associated with TE sensing (including PRRs and downstream intermediates) were extracted from publications [34, 36, 38, 74, 75]. TE sensing genes and inflammatory factors are listed in Additional file 9: Table S3 and S4. Differential expression analysis of genes between hematopoietic cells (HECs/HSCs) and endothelial cells (VECs/AECs) was performed using FindMarkers.
Functional enrichment analysis
Functional enrichment analysis was performed using clusterProfiler (v4.6.2) [103]. GO (biological process) and Molecular Signatures Database (hallmark gene sets) are included.
Single-cell ATAC-seq data processing
Raw sequencing data of mouse AGM (E10.5) was downloaded from GEO with accession GSE137115 [21]. Reads were mapped to the mouse reference genome (refdata-cellranger-arc-mm10-2020-A) using cellranger-atac (v2.1.0). Signac (v1.9.0) [104] was used to perform downstream analysis, including quality control, normalization, dimension reduction, and clustering. After estimating the gene activities, the cell types of scATAC-seq data were annotated through cross-modality integration and label transfer from scRNA-seq data using CCA [98]. The final cell types were corrected according to the gene activities of known EHT markers.
Single-cell TE accessibility estimation and differential accessible analysis
The count matrix of TEs was estimated using FeatureMatrix in Signac. Cell type-specific open TEs were identified by FindAllMarkers. Differentially accessible peaks and TEs between cell types were identified using FindMarkers. Each of the open TEs was assigned to the closest gene using ClosestFeature. TE-related differentially accessible peaks (Additional file 10: Fig. S2) were plotted on the mouse genome using karyoloteR (v1.24.0) [105].
CCAN construction
The cis-co-accessible peaks were identified using Cicero (v1.3.9) [77]. The links with the co-access score of more than 0.4 were extracted to construct the CCAN network, which was visualized in Cytoscape (v3.9.0) [106].
Motif enrichment and TF expression analysis
The motif enrichment analysis was performed in Signac. The motif position frequency matrices were from JASPAR [107]. Motifs enriched in TE-related differentially accessible peaks were found by FindMotifs. The motif activity was computed by chromVAR (v1.20.2) [108]. Active motifs were selected according to the expression of corresponding TFs from scRNA-seq data. The cell type-specific TF-target network (Additional file 10: Fig. S3) was constructed based on interactions from TRRUST [82]. The average expression of the target genes in the target cell type was required to be more than 0.25.
Spatial transcriptome data processing
Raw sequencing data of human AGM (CS15, sample 7) spatial transcriptome was downloaded from GEO with accession GSE162950 [22]. Reads were mapped to the human reference genome (refdata-gex-GRCh38-2020-A) using Space Ranger (v2.0.1).
HUVEC bulk RNA-seq data processing
Raw sequencing data of HUVECs against hypoxia was downloaded from SRA with accession PRJNA561635 [92]. The detailed information on samples used in this study can be found in Additional file 4: Table S2. We treated each sample as a single cell and thus can still use scTE to quantify TE and gene expression. The gene modules of pre-HEC markers (Additional file 13: Fig. S3) in HUVEC data were predicted using WGCNA [64].
Statistics
Statistical analysis was conducted in R (version 4.2.3). The differential expression testing was achieved with Seurat FindMarkers using Wilcoxon Rank Sum test and P-values were adjusted by Bonferroni correction method. GSEA was conducted in clusterProfiler using permutation test and P-values were adjusted by Benjamini-Hochberg method. The scRNA-seq and spatial transcriptome data for human AGM that were analyzed in this study are available from GEO (GSE162950) [22]. The scRNA-seq and scATAC-seq data for mouse AGM are available from GEO (GSE137117) [21]. The time series RNA-seq data for HUVEC are available from SRA (PRJNA561635) [92]. The detailed information on samples used in this study can be found in Additional file 4. All analysis pipelines, in-house scripts and files for reproducing the results in this study can be accessed at GitHub [109] (https://github.com/ventson/hscTE). We also provide a web interface (https://bis.zju.edu.cn/hscTE, implemented using UCSC Cell Browser [110]) to visualize TE and gene expression during human and mouse EHT. The multi-faceted display (including TEs, CpG, cCREs, peaks, and genome coverages) of mouse EHT scATAC-seq data is available from https://bis.zju.edu.cn/hscTE/jbrowse/?data=mouse, which is implemented by JBrowse [111].
Supplementary Information
Acknowledgements
The authors would like to thank members in Ming Chen’s lab for discussion and valuable suggestions.
Authors’ contributions
MC, HH, and WDL conceived and supervised the study. CF, RXT, and SGX processed the data and prepared the manuscript. YHC helped RNA velocity and trajectory analysis. SDL and XTH helped the spatial transcriptome analysis. YFC, YCZ, YJL, YMH, YSH, HP, and ZXW contributed to the analysis pipeline construction and modification. HYC, SLZ, and QYN helped the scATAC-data analysis. JYH provided constructive discussion and suggestions. CF, RXT, SGX, MC, HH, and WDL wrote and revised the manuscript with input from all the other authors. All authors read and approved the final manuscript.
Funding
This work was supported by the National Natural Sciences Foundation of China (32300532), National Key Research and Development Program of China (2023YFE0112300; 2018YFC0310602; 2016YFA0501704), the Science and Technology Innovation Leading Scientist (2022R52035), the 151 Talent Project of Zhejiang Province (first level), Postdoctoral Fellowship Program of CPSF (GZC20232322), Natural Science Foundation of Shanxi Province (201901D111381), and Doctor Foundation of The Second Hospital of Shanxi Medical University (201701-4).
Availability of data and materials
The scRNA-seq and spatial transcriptome data for human AGM that were analyzed in this study are available from GEO (GSE162950) [22]. The scRNA-seq and scATAC-seq data for mouse AGM are available from GEO (GSE137117) [21]. The time series RNA-seq data for HUVEC are available from SRA (PRJNA561635) [92]. The detailed information on samples used in this study can be found in Additional file 4 . All analysis pipelines, in-house scripts and files for reproducing the results in this study can be accessed at GitHub [109] (https://github.com/ventson/hscTE). We also provide a web interface (https://bis.zju.edu.cn/hscTE , implemented using UCSC Cell Browser [110]) to visualize TE and gene expression during human and mouse EHT. The multi-faceted display (including TEs, CpG, cCREs, peaks, and genome coverages) of mouse EHT scATAC-seq data is available from https://bis.zju.edu.cn/hscTE/jbrowse/?data=mouse , which is implemented by JBrowse [111].
Declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Cong Feng, Ruxiu Tie and Saige Xin have contributed equally to this work.
Contributor Information
Wenda Luo, Email: luowd@enzemed.com.
He Huang, Email: hehuangyu@126.com.
Ming Chen, Email: mchen@zju.edu.cn.
References
- 1.Bertrand JY, Chi NC, Santoso B, Teng S, Stainier DY, Traver D. Haematopoietic stem cells derive directly from aortic endothelium during development. Nature. 2010;464(7285):108–111. doi: 10.1038/nature08738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Boisset JC, van Cappellen W, Andrieu-Soler C, Galjart N, Dzierzak E, Robin C. In vivo imaging of haematopoietic cells emerging from the mouse aortic endothelium. Nature. 2010;464(7285):116–120. doi: 10.1038/nature08764. [DOI] [PubMed] [Google Scholar]
- 3.Eilken HM, Nishikawa S, Schroeder T. Continuous single-cell imaging of blood generation from haemogenic endothelium. Nature. 2009;457(7231):896–900. doi: 10.1038/nature07760. [DOI] [PubMed] [Google Scholar]
- 4.Kissa K, Herbomel P. Blood stem cells emerge from aortic endothelium by a novel type of cell transition. Nature. 2010;464(7285):112–115. doi: 10.1038/nature08761. [DOI] [PubMed] [Google Scholar]
- 5.Lancrin C, Sroczynska P, Stephenson C, Allen T, Kouskoff V, Lacaud G. The haemangioblast generates haematopoietic cells through a haemogenic endothelium stage. Nature. 2009;457(7231):892–895. doi: 10.1038/nature07679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Jagannathan-Bogdan M, Zon LI. Hematopoiesis. Development. 2013;140(12):2463–2467. doi: 10.1242/dev.083147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.de Bruijn MF, Speck NA, Peeters MC, Dzierzak E. Definitive hematopoietic stem cells first develop within the major arterial regions of the mouse embryo. EMBO J. 2000;19(11):2465–2474. doi: 10.1093/emboj/19.11.2465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Medvinsky A, Dzierzak E. Definitive hematopoiesis is autonomously initiated by the AGM region. Cell. 1996;86(6):897–906. doi: 10.1016/S0092-8674(00)80165-8. [DOI] [PubMed] [Google Scholar]
- 9.Ivanovs A, Rybtsov S, Ng ES, Stanley EG, Elefanty AG, Medvinsky A. Human haematopoietic stem cell development: from the embryo to the dish. Development. 2017;144(13):2323–2337. doi: 10.1242/dev.134866. [DOI] [PubMed] [Google Scholar]
- 10.Ivanovs A, Rybtsov S, Welch L, Anderson RA, Turner ML, Medvinsky A. Highly potent human hematopoietic stem cells first emerge in the intraembryonic aorta-gonad-mesonephros region. J Exp Med. 2011;208(12):2417–2427. doi: 10.1084/jem.20111688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hou S, Li Z, Zheng X, Gao Y, Dong J, Ni Y, Wang X, Li Y, Ding X, Chang Z, et al. Embryonic endothelial evolution towards first hematopoietic stem cells revealed by single-cell transcriptomic and functional analyses. Cell Res. 2020;30(5):376–392. doi: 10.1038/s41422-020-0300-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zhao S, Feng S, Tian Y, Wen Z. Hemogenic and aortic endothelium arise from a common hemogenic angioblast precursor and are specified by the Etv2 dosage. Proc Natl Acad Sci U S A. 2022;119(13):e2119051119. doi: 10.1073/pnas.2119051119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Zeng Y, He J, Bai Z, Li Z, Gong Y, Liu C, Ni Y, Du J, Ma C, Bian L, et al. Tracing the first hematopoietic stem cell generation in human embryo by single-cell RNA sequencing. Cell Res. 2019;29(11):881–894. doi: 10.1038/s41422-019-0228-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Taoudi S, Gonneau C, Moore K, Sheridan JM, Blackburn CC, Taylor E, Medvinsky A. Extensive hematopoietic stem cell generation in the AGM region via maturation of VE-cadherin+CD45+ pre-definitive HSCs. Cell Stem Cell. 2008;3(1):99–108. doi: 10.1016/j.stem.2008.06.004. [DOI] [PubMed] [Google Scholar]
- 15.Zhou F, Li X, Wang W, Zhu P, Zhou J, He W, Ding M, Xiong F, Zheng X, Li Z, et al. Tracing haematopoietic stem cell formation at single-cell resolution. Nature. 2016;533(7604):487–492. doi: 10.1038/nature17997. [DOI] [PubMed] [Google Scholar]
- 16.Baron CS, Kester L, Klaus A, Boisset JC, Thambyrajah R, Yvernogeau L, Kouskoff V, Lacaud G, van Oudenaarden A, Robin C. Single-cell transcriptomics reveal the dynamic of haematopoietic stem cell production in the aorta. Nat Commun. 2018;9(1):2517. doi: 10.1038/s41467-018-04893-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Vink CS, Calero-Nieto FJ, Wang X, Maglitto A, Mariani SA, Jawaid W, Gottgens B, Dzierzak E. Iterative single-cell analyses define the transcriptome of the first functional hematopoietic stem cells. Cell Rep. 2020;31(6):107627. doi: 10.1016/j.celrep.2020.107627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ye F, Huang W, Guo G. Studying hematopoiesis using single-cell technologies. J Hematol Oncol. 2017;10(1):27. doi: 10.1186/s13045-017-0401-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zhang P, Li X, Pan C, Zheng X, Hu B, Xie R, Hu J, Shang X, Yang H. Single-cell RNA sequencing to track novel perspectives in HSC heterogeneity. Stem Cell Res Ther. 2022;13(1):39. doi: 10.1186/s13287-022-02718-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Fadlullah MZH, Neo WH, Lie ALM, Thambyrajah R, Patel R, Mevel R, Aksoy I, Do Khoa N, Savatier P, Fontenille L, et al. Murine AGM single-cell profiling identifies a continuum of hemogenic endothelium differentiation marked by ACE. Blood. 2022;139(3):343–356. doi: 10.1182/blood.2020007885. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Zhu Q, Gao P, Tober J, Bennett L, Chen C, Uzun Y, Li Y, Howell ED, Mumau M, Yu W, et al. Developmental trajectory of prehematopoietic stem cell formation from endothelium. Blood. 2020;136(7):845–856. doi: 10.1182/blood.2020004801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Calvanese V, Capellera-Garcia S, Ma F, Fares I, Liebscher S, Ng ES, Ekstrand S, Aguade-Gorgorio J, Vavilina A, Lefaudeux D, et al. Mapping human haematopoietic stem cells from haemogenic endothelium to birth. Nature. 2022;604(7906):534–540. doi: 10.1038/s41586-022-04571-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Orkin SH, Zon LI. Hematopoiesis: an evolving paradigm for stem cell biology. Cell. 2008;132(4):631–644. doi: 10.1016/j.cell.2008.01.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ding Y, Liu Z, Liu F. Transcriptional and epigenetic control of hematopoietic stem cell fate decisions in vertebrates. Dev Biol. 2021;475:156–164. doi: 10.1016/j.ydbio.2021.03.003. [DOI] [PubMed] [Google Scholar]
- 25.de Bruijn M, Dzierzak E. Runx transcription factors in the development and function of the definitive hematopoietic system. Blood. 2017;129(15):2061–2069. doi: 10.1182/blood-2016-12-689109. [DOI] [PubMed] [Google Scholar]
- 26.Katsumura KR, Bresnick EH, Group GFM The GATA factor revolution in hematology. Blood. 2017;129(15):2092–2102. doi: 10.1182/blood-2016-09-687871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Thambyrajah R, Mazan M, Patel R, Moignard V, Stefanska M, Marinopoulou E, Li Y, Lancrin C, Clapes T, Moroy T, et al. GFI1 proteins orchestrate the emergence of haematopoietic stem cells through recruitment of LSD1. Nat Cell Biol. 2016;18(1):21–32. doi: 10.1038/ncb3276. [DOI] [PubMed] [Google Scholar]
- 28.Clapes T, Lefkopoulos S, Trompouki E. Stress and non-stress roles of inflammatory signals during HSC emergence and maintenance. Front Immunol. 2016;7:487. doi: 10.3389/fimmu.2016.00487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.He Q, Zhang C, Wang L, Zhang P, Ma D, Lv J, Liu F. Inflammatory signaling regulates hematopoietic stem and progenitor cell emergence in vertebrates. Blood. 2015;125(7):1098–1106. doi: 10.1182/blood-2014-09-601542. [DOI] [PubMed] [Google Scholar]
- 30.Collins A, Mitchell CA, Passegue E. Inflammatory signaling regulates hematopoietic stem and progenitor cell development and homeostasis. J Exp Med. 2021; 218(7):e20201545. [DOI] [PMC free article] [PubMed]
- 31.Tie R, Li H, Cai S, Liang Z, Shan W, Wang B, Tan Y, Zheng W, Huang H. Interleukin-6 signaling regulates hematopoietic stem cell emergence. Exp Mol Med. 2019;51(10):1–12. doi: 10.1038/s12276-019-0320-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Espin-Palazon R, Stachura DL, Campbell CA, Garcia-Moreno D, Del Cid N, Kim AD, Candel S, Meseguer J, Mulero V, Traver D. Proinflammatory signaling regulates hematopoietic stem cell emergence. Cell. 2014;159(5):1070–1085. doi: 10.1016/j.cell.2014.10.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Li Y, Esain V, Teng L, Xu J, Kwan W, Frost IM, Yzaguirre AD, Cai X, Cortes M, Maijenburg MW, et al. Inflammatory signaling regulates embryonic hematopoietic stem and progenitor cell production. Genes Dev. 2014;28(23):2597–2612. doi: 10.1101/gad.253302.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Takeuchi O, Akira S. Pattern recognition receptors and inflammation. Cell. 2010;140(6):805–820. doi: 10.1016/j.cell.2010.01.022. [DOI] [PubMed] [Google Scholar]
- 35.Lefkopoulos S, Polyzou A, Derecka M, Bergo V, Clapes T, Cauchy P, Jerez-Longres C, Onishi-Seebacher M, Yin N, Martagon-Calderon NA, et al. Repetitive elements trigger RIG-I-like receptor signaling that regulates the emergence of hematopoietic stem and progenitor cells. Immunity. 2020;53(5):934–951 e939. doi: 10.1016/j.immuni.2020.10.007. [DOI] [PubMed] [Google Scholar]
- 36.Bowie AG, Unterholzner L. Viral evasion and subversion of pattern-recognition receptor signalling. Nat Rev Immunol. 2008;8(12):911–922. doi: 10.1038/nri2436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kowalinski E, Lunardi T, McCarthy AA, Louber J, Brunel J, Grigorov B, Gerlier D, Cusack S. Structural basis for the activation of innate immune pattern-recognition receptor RIG-I by viral RNA. Cell. 2011;147(2):423–435. doi: 10.1016/j.cell.2011.09.039. [DOI] [PubMed] [Google Scholar]
- 38.Chen YG, Hur S. Cellular origins of dsRNA, their recognition and consequences. Nat Rev Mol Cell Biol. 2022;23(4):286–301. doi: 10.1038/s41580-021-00430-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Burns KH. Repetitive DNA in disease. Science. 2022;376(6591):353–354. doi: 10.1126/science.abl7399. [DOI] [PubMed] [Google Scholar]
- 40.Payer LM, Burns KH. Transposable elements in human genetic disease. Nat Rev Genet. 2019;20(12):760–772. doi: 10.1038/s41576-019-0165-8. [DOI] [PubMed] [Google Scholar]
- 41.Almeida MV, Vernaz G, Putman ALK, Miska EA. Taming transposable elements in vertebrates: from epigenetic silencing to domestication. Trends Genet. 2022;38(6):529–553. doi: 10.1016/j.tig.2022.02.009. [DOI] [PubMed] [Google Scholar]
- 42.Senft AD, Macfarlan TS. Transposable elements shape the evolution of mammalian development. Nat Rev Genet. 2021;22(11):691–711. doi: 10.1038/s41576-021-00385-1. [DOI] [PubMed] [Google Scholar]
- 43.Ecco G, Imbeault M, Trono D. KRAB zinc finger proteins. Development. 2017;144(15):2719–2729. doi: 10.1242/dev.132605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Hutchins AP, Pei D. Transposable elements at the center of the crossroads between embryogenesis, embryonic stem cells, reprogramming, and long non-coding RNAs. Sci Bull (Beijing) 2015;60(20):1722–1733. doi: 10.1007/s11434-015-0905-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Low Y, Tan DEK, Hu Z, Tan SYX, Tee WW. Transposable element dynamics and regulation during zygotic genome activation in mammalian embryos and embryonic stem cell model systems. Stem Cells Int. 2021;2021:1624669. doi: 10.1155/2021/1624669. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Chang NC, Rovira Q, Wells J, Feschotte C, Vaquerizas JM. Zebrafish transposable elements show extensive diversification in age, genomic distribution, and developmental expression. Genome Res. 2022;32(7):1408–1423. doi: 10.1101/gr.275655.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Pontis J, Pulver C, Playfoot CJ, Planet E, Grun D, Offner S, Duc J, Manfrin A, Lutolf MP, Trono D. Primate-specific transposable elements shape transcriptional networks during human development. Nat Commun. 2022;13(1):7178. doi: 10.1038/s41467-022-34800-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.He J, Babarinde IA, Sun L, Xu S, Chen R, Shi J, Wei Y, Li Y, Ma G, Zhuang Q, et al. Identifying transposable element expression dynamics and heterogeneity during development at the single-cell level with a processing pipeline scTE. Nat Commun. 2021;12(1):1456. doi: 10.1038/s41467-021-21808-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Clapes T, Polyzou A, Prater P, Sagar, Morales-Hernandez A, Ferrarini MG, Kehrer N, Lefkopoulos S, Bergo V, Hummel B, et al. Chemotherapy-induced transposable elements activate MDA5 to enhance haematopoietic regeneration. Nat Cell Biol. 2021;23(7):704–717. doi: 10.1038/s41556-021-00707-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Clapes T, Trompouki E. Hematopoietic regeneration under the spell of epigenetic-epitranscriptomic factors and transposable elements. Curr Opin Hematol. 2020;27(4):264–272. doi: 10.1097/MOH.0000000000000585. [DOI] [PubMed] [Google Scholar]
- 51.Ye M, Goudot C, Hoyler T, Lemoine B, Amigorena S, Zueva E. Specific subfamilies of transposable elements contribute to different domains of T lymphocyte enhancers. Proc Natl Acad Sci U S A. 2020;117(14):7905–7916. doi: 10.1073/pnas.1912008117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Feng C, Dai M, Liu Y, Chen M. Sequence repetitiveness quantification and de novo repeat detection by weighted k-mer coverage. Brief Bioinform. 2021;22(3):bbaa086. [DOI] [PubMed]
- 53.Moore JE, Purcaro MJ, Pratt HE, Epstein CB, Shoresh N, Adrian J, Kawli T, Davis CA, Dobin A, Kaul R, et al. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature. 2020;583(7818):699–710. doi: 10.1038/s41586-020-2493-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26(6):841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Walter M, Teissandier A, Perez-Palacios R, Bourc'his D. An epigenetic switch ensures transposon repression upon dynamic loss of DNA methylation in embryonic stem cells. Elife. 2016;5:e11418. [DOI] [PMC free article] [PubMed]
- 56.Zhou W, Liang G, Molloy PL, Jones PA. DNA methylation enables transposable element-driven genome expansion. Proc Natl Acad Sci U S A. 2020;117(32):19359–19366. doi: 10.1073/pnas.1921719117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.de Souza FS, Franchini LF, Rubinstein M. Exaptation of transposable elements into novel cis-regulatory elements: is the evidence always strong? Mol Biol Evol. 2013;30(6):1239–1251. doi: 10.1093/molbev/mst045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Lee HJ, Hou Y, Maeng JH, Shah NM, Chen Y, Lawson HA, Yang H, Yue F, Wang T. Epigenomic analysis reveals prevalent contribution of transposable elements to cis-regulatory elements, tissue-specific expression, and alternative promoters in zebrafish. Genome Res. 2022;32(7):1424–1436. doi: 10.1101/gr.276052.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Sundaram V, Wysocka J. Transposable elements as a potent source of diverse cis-regulatory sequences in mammalian genomes. Philos Trans R Soc Lond B Biol Sci. 2020;375(1795):20190347. doi: 10.1098/rstb.2019.0347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Todd CD, Deniz O, Taylor D, Branco MR. Functional evaluation of transposable elements as enhancers in mouse embryonic and trophoblast stem cells. Elife. 2019;8:e44344. [DOI] [PMC free article] [PubMed]
- 61.Andrews G, Fan K, Pratt HE, Phalke N, Zoonomia Consortium section s. Karlsson EK, Lindblad-Toh K, Gazal S, Moore JE, Weng Z, et al. Mammalian evolution of human cis-regulatory elements and transcription factor binding sites. Science. 2023;380(6643):eabn7930. doi: 10.1126/science.abn7930. [DOI] [PubMed] [Google Scholar]
- 62.Oatley M, Bolukbasi OV, Svensson V, Shvartsman M, Ganter K, Zirngibl K, Pavlovich PV, Milchevskaya V, Foteva V, Natarajan KN, et al. Single-cell transcriptomics identifies CD44 as a marker and regulator of endothelial to haematopoietic transition. Nat Commun. 2020;11(1):586. doi: 10.1038/s41467-019-14171-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Morabito S, Reese F, Rahimzadeh N, Miyoshi E, Swarup V. hdWGCNA identifies co-expression networks in high-dimensional transcriptomics data. Cell Rep Methods. 2023;3(6):100498. doi: 10.1016/j.crmeth.2023.100498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. doi: 10.1186/1471-2105-9-559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Hao Y, Hao S, Andersen-Nissen E, Mauck WM, 3rd, Zheng S, Butler A, Lee MJ, Wilk AJ, Darby C, Zager M, et al. Integrated analysis of multimodal single-cell data. Cell. 2021;184(13):3573–3587 e3529. doi: 10.1016/j.cell.2021.04.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Zhang Y, He F, Zhang Y, Dai Q, Li Q, Nan J, Miao R, Cheng B. Exploration of the regulatory relationship between KRAB-Zfp clusters and their target transposable elements via a gene editing strategy at the cluster specific linker-associated sequences by CRISPR-Cas9. Mob DNA. 2022;13(1):25. doi: 10.1186/s13100-022-00279-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Imbeault M, Helleboid PY, Trono D. KRAB zinc-finger proteins contribute to the evolution of gene regulatory networks. Nature. 2017;543(7646):550–554. doi: 10.1038/nature21683. [DOI] [PubMed] [Google Scholar]
- 68.Jacobs FM, Greenberg D, Nguyen N, Haeussler M, Ewing AD, Katzman S, Paten B, Salama SR, Haussler D. An evolutionary arms race between KRAB zinc-finger genes ZNF91/93 and SVA/L1 retrotransposons. Nature. 2014;516(7530):242–245. doi: 10.1038/nature13760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Ecco G, Cassano M, Kauzlaric A, Duc J, Coluccio A, Offner S, Imbeault M, Rowe HM, Turelli P, Trono D. Transposable elements and their KRAB-ZFP controllers regulate gene expression in adult tissues. Dev Cell. 2016;36(6):611–623. doi: 10.1016/j.devcel.2016.02.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Li Y, Tang C, Liu F, Zhu C, Liu F, Zhu P, Wang L. DNA methylation safeguards the generation of hematopoietic stem and progenitor cells by repression of Notch signaling. Development. 2022;149(10):dev200390. [DOI] [PMC free article] [PubMed]
- 71.Soto RA, Najia MAT, Hachimi M, Frame JM, Yette GA, Lummertz da Rocha E, Stankunas K, Daley GQ, North TE. Sequential regulation of hemogenic fate and hematopoietic stem and progenitor cell formation from arterial endothelium by Ezh1/2. Stem Cell Rep. 2021;16(7):1718–1734. doi: 10.1016/j.stemcr.2021.05.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Thambyrajah R, Fadlullah MZH, Proffitt M, Patel R, Cowley SM, Kouskoff V, Lacaud G. HDAC1 and HDAC2 modulate TGF-beta signaling during endothelial-to-hematopoietic transition. Stem Cell Rep. 2018;10(4):1369–1383. doi: 10.1016/j.stemcr.2018.03.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Kasper DM, Nicoli S. Epigenetic and epitranscriptomic factors make a mark on hematopoietic stem cell development. Curr Stem Cell Rep. 2018;4(1):22–32. doi: 10.1007/s40778-018-0113-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Gazquez-Gutierrez A, Witteveldt J. S RH, Macias S: Sensing of transposable elements by the antiviral innate immune system. RNA. 2021;27(7):735–752. doi: 10.1261/rna.078721.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Yang Y, Huang Y, Zeng Z. Advances in cGAS-STING signaling pathway and diseases. Front Cell Dev Biol. 2022;10:800393. doi: 10.3389/fcell.2022.800393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Meylan P, Dreos R, Ambrosini G, Groux R, Bucher P. EPD in 2020: enhanced data visualization and extension to ncRNA promoters. Nucleic Acids Res. 2020;48(D1):D65–D69. doi: 10.1093/nar/gkz1014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Pliner HA, Packer JS, McFaline-Figueroa JL, Cusanovich DA, Daza RM, Aghamirzaie D, Srivatsan S, Qiu X, Jackson D, Minkina A, et al. Cicero predicts cis-regulatory DNA interactions from single-cell chromatin accessibility data. Mol Cell. 2018;71(5):858–871 e858. doi: 10.1016/j.molcel.2018.06.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Cacialli P, Mahony CB, Petzold T, Bordignon P, Rougemont AL, Bertrand JY. A connexin/ifi30 pathway bridges HSCs with their niche to dampen oxidative stress. Nat Commun. 2021;12(1):4484. doi: 10.1038/s41467-021-24831-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Feschotte C. Transposable elements and the evolution of regulatory networks. Nat Rev Genet. 2008;9(5):397–405. doi: 10.1038/nrg2337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Floc'hlay S, Wong E, Zhao B, Viales RR, Thomas-Chollier M, Thieffry D, Garfield DA, Furlong EEM. Cis-acting variation is common across regulatory layers but is often buffered during embryonic development. Genome Res. 2020;31(2):211–224. doi: 10.1101/gr.266338.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Feng AC, Thomas BJ, Purbey PK, de Melo FM, Liu X, Daly AE, Sun F, Lo JH, Cheng L, Carey MF et al. The transcription factor NF-kappaB orchestrates nucleosome remodeling during the primary response to Toll-like receptor 4 signaling. Immunity. 2024;57(3):462–477 e9. [DOI] [PMC free article] [PubMed]
- 82.Han H, Cho JW, Lee S, Yun A, Kim H, Bae D, Yang S, Kim CY, Lee M, Kim E, et al. TRRUST v2: an expanded reference database of human and mouse transcriptional regulatory interactions. Nucleic Acids Res. 2018;46(D1):D380–D386. doi: 10.1093/nar/gkx1013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.La Manno G, Soldatov R, Zeisel A, Braun E, Hochgerner H, Petukhov V, Lidschreiber K, Kastriti ME, Lonnerberg P, Furlan A, et al. RNA velocity of single cells. Nature. 2018;560(7719):494–498. doi: 10.1038/s41586-018-0414-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Bergen V, Lange M, Peidli S, Wolf FA, Theis FJ. Generalizing RNA velocity to transient cell states through dynamical modeling. Nat Biotechnol. 2020;38(12):1408–1414. doi: 10.1038/s41587-020-0591-3. [DOI] [PubMed] [Google Scholar]
- 85.Gerri C, Marass M, Rossi A, Stainier DYR. Hif-1alpha and Hif-2alpha regulate hemogenic endothelium and hematopoietic stem cell formation in zebrafish. Blood. 2018;131(9):963–973. doi: 10.1182/blood-2017-07-797795. [DOI] [PubMed] [Google Scholar]
- 86.Wang N, Chen C, Cheng Y, Fu Y, Zhong Z, Yang Y, Lv L, Chen H, Huang J, Duan Y. Hypoxia drives hematopoiesis with the enhancement of T lineage through eliciting arterial specification of hematopoietic endothelial progenitors from hESC. Stem Cell Res Ther. 2022;13(1):282. doi: 10.1186/s13287-022-02967-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Imanirad P, Dzierzak E. Hypoxia and HIFs in regulating the development of the hematopoietic system. Blood Cells Mol Dis. 2013;51(4):256–263. doi: 10.1016/j.bcmd.2013.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Imanirad P, Solaimani Kartalaei P, Crisan M, Vink C, Yamada-Inagawa T, de Pater E, Kurek D, Kaimakis P, van der Linden R, Speck N, et al. HIF1alpha is a regulator of hematopoietic progenitor and stem cell development in hypoxic sites of the mouse embryo. Stem Cell Res. 2014;12(1):24–35. doi: 10.1016/j.scr.2013.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Jaskiewicz M, Moszynska A, Serocki M, Kroliczewski J, Bartoszewska S, Collawn JF, Bartoszewski R. Hypoxia-inducible factor (HIF)-3a2 serves as an endothelial cell fate executor during chronic hypoxia. EXCLI J. 2022;21:454–469. doi: 10.17179/excli2021-4622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Mimura I, Nangaku M, Kanki Y, Tsutsumi S, Inoue T, Kohro T, Yamamoto S, Fujita T, Shimamura T, Suehiro J, et al. Dynamic change of chromatin conformation in response to hypoxia enhances the expression of GLUT3 (SLC2A3) by cooperative interaction of hypoxia-inducible factor 1 and KDM3A. Mol Cell Biol. 2012;32(15):3018–3032. doi: 10.1128/MCB.06643-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Sugiyama T, Kohara H, Noda M, Nagasawa T. Maintenance of the hematopoietic stem cell pool by CXCL12-CXCR4 chemokine signaling in bone marrow stromal cell niches. Immunity. 2006;25(6):977–988. doi: 10.1016/j.immuni.2006.10.016. [DOI] [PubMed] [Google Scholar]
- 92.Klomp J, Hyun J, Klomp JE, Pajcini K, Rehman J, Malik AB. Comprehensive transcriptomic profiling reveals SOX7 as an early regulator of angiogenesis in hypoxic human endothelial cells. J Biol Chem. 2020;295(15):4796–4808. doi: 10.1074/jbc.RA119.011822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Chelmicki T, Roger E, Teissandier A, Dura M, Bonneville L, Rucli S, Dossin F, Fouassier C, Lameiras S, Bourc'his D. m(6)A RNA methylation regulates the fate of endogenous retroviruses. Nature. 2021;591(7849):312–316. doi: 10.1038/s41586-020-03135-1. [DOI] [PubMed] [Google Scholar]
- 94.Xiong F, Wang R, Lee JH, Li S, Chen SF, Liao Z, Hasani LA, Nguyen PT, Zhu X, Krakowiak J, et al. RNA m(6)A modification orchestrates a LINE-1-host interaction that facilitates retrotransposition and contributes to long gene vulnerability. Cell Res. 2021;31(8):861–885. doi: 10.1038/s41422-021-00515-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Villanueva-Canas JL, Horvath V, Aguilera L, Gonzalez J. Diverse families of transposable elements affect the transcriptional regulation of stress-response genes in Drosophila melanogaster. Nucleic Acids Res. 2019;47(13):6842–6857. doi: 10.1093/nar/gkz490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Haeussler M, Zweig AS, Tyner C, Speir ML, Rosenbloom KR, Raney BJ, Lee CM, Lee BT, Hinrichs AS, Gonzalez JN, et al. The UCSC Genome Browser database: 2019 update. Nucleic Acids Res. 2019;47(D1):D853–D858. doi: 10.1093/nar/gky1095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Yu G, Wang LG, He QY. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics. 2015;31(14):2382–2383. doi: 10.1093/bioinformatics/btv145. [DOI] [PubMed] [Google Scholar]
- 98.Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck WM, 3rd, Hao Y, Stoeckius M, Smibert P, Satija R. Comprehensive integration of single-cell data. Cell. 2019;177(7):1888–1902 e1821. doi: 10.1016/j.cell.2019.05.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Korsunsky I, Millard N, Fan J, Slowikowski K, Zhang F, Wei K, Baglaenko Y, Brenner M, Loh PR, Raychaudhuri S. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat Methods. 2019;16(12):1289–1296. doi: 10.1038/s41592-019-0619-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Finak G, McDavid A, Yajima M, Deng J, Gersuk V, Shalek AK, Slichter CK, Miller HW, McElrath MJ, Prlic M, et al. MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data. Genome Biol. 2015;16:278. doi: 10.1186/s13059-015-0844-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Durinck S, Spellman PT, Birney E, Huber W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat Protoc. 2009;4(8):1184–1191. doi: 10.1038/nprot.2009.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Morabito S, Miyoshi E, Michael N, Shahin S, Martini AC, Head E, Silva J, Leavy K, Perez-Rosendahl M, Swarup V. Single-nucleus chromatin accessibility and transcriptomic characterization of Alzheimer's disease. Nat Genet. 2021;53(8):1143–1155. doi: 10.1038/s41588-021-00894-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16(5):284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Stuart T, Srivastava A, Madad S, Lareau CA, Satija R. Single-cell chromatin state analysis with Signac. Nat Methods. 2021;18(11):1333–1341. doi: 10.1038/s41592-021-01282-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Gel B, Serra E. karyoploteR: an R/Bioconductor package to plot customizable genomes displaying arbitrary data. Bioinformatics. 2017;33(19):3088–3090. doi: 10.1093/bioinformatics/btx346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Castro-Mondragon JA, Riudavets-Puig R, Rauluseviciute I, Lemma RB, Turchi L, Blanc-Mathieu R, Lucas J, Boddie P, Khan A, Manosalva Perez N, et al. JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 2022;50(D1):D165–D173. doi: 10.1093/nar/gkab1113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Schep AN, Wu B, Buenrostro JD, Greenleaf WJ. chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nat Methods. 2017;14(10):975–978. doi: 10.1038/nmeth.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Data processing pipeline for exploring transposable elements (TEs) during the endothelial-to-hematopoietic transition (EHT). GitHub. 2023. https://github.com/ventson/hscTE.
- 110.Speir ML, Bhaduri A, Markov NS, Moreno P, Nowakowski TJ, Papatheodorou I, Pollen AA, Raney BJ, Seninge L, Kent WJ, et al. UCSC Cell Browser: visualize your single-cell data. Bioinformatics. 2021;37(23):4578–4580. doi: 10.1093/bioinformatics/btab503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Buels R, Yao E, Diesh CM, Hayes RD, Munoz-Torres M, Helt G, Goodstein DM, Elsik CG, Lewis SE, Stein L, et al. JBrowse: a dynamic web platform for genome visualization and analysis. Genome Biol. 2016;17:66. doi: 10.1186/s13059-016-0924-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The scRNA-seq and spatial transcriptome data for human AGM that were analyzed in this study are available from GEO (GSE162950) [22]. The scRNA-seq and scATAC-seq data for mouse AGM are available from GEO (GSE137117) [21]. The time series RNA-seq data for HUVEC are available from SRA (PRJNA561635) [92]. The detailed information on samples used in this study can be found in Additional file 4 . All analysis pipelines, in-house scripts and files for reproducing the results in this study can be accessed at GitHub [109] (https://github.com/ventson/hscTE). We also provide a web interface (https://bis.zju.edu.cn/hscTE , implemented using UCSC Cell Browser [110]) to visualize TE and gene expression during human and mouse EHT. The multi-faceted display (including TEs, CpG, cCREs, peaks, and genome coverages) of mouse EHT scATAC-seq data is available from https://bis.zju.edu.cn/hscTE/jbrowse/?data=mouse , which is implemented by JBrowse [111].