Abstract
Angiogenesis is extensively involved in embryonic development and requires complex regulation networks, whose defects can cause a variety of vascular abnormalities. Cis-regulatory elements control gene expression at all developmental stages, but they have not been studied or profiled in angiogenesis yet. In this study, we exploited public DNase-seq and RNA-seq datasets from a VEGFA-stimulated in vitro angiogenic model, and carried out an integrated analysis of the transcriptome and chromatin accessibility across the entire process. Totally, we generated a bank of 47,125 angiogenic cis-regulatory elements with promoter (marker by H3K4me3) and/or enhancer (marker by H3K27ac) activities. Motif enrichment analysis revealed that these angiogenic cis-regulatory elements interacted preferentially with ETS family TFs. With this tool, we performed an association study using our WES data of TAPVC and identified rs199530718 as a cis-regulatory SNP associated with disease risk. Altogether, this study generated a genome-wide bank of angiogenic cis-regulatory elements and illustrated its utility in identifying novel cis-regulatory SNPs for TAPVC, expanding new horizons of angiogenesis as well as vascular abnormality genetics.
Subject terms: Angiogenesis, Data mining, Development
Introduction
Angiogenesis refers to the physiology process of forming new blood vessels from existing vascular networks, which is essential for vascular morphogenesis in almost all tissues in the body1. Any defects in angiogenesis can lead to vascular abnormality, including arteriovenous malformation2, congenital heart disease3 and infantile hemangioma4. Such kind of disease is common in children and adults, and may cause damage to health to varying degrees. Abnormal angiogenesis is also the hallmark of cancer and inflammatory and ischemic diseases5. It not only contributes to disease progression but serves as a promising target for drug treatment. Therefore, owing to its crucial role in human health, angiogenesis has gained substantial interest among researchers over the past decade.
With continued efforts and research, tremendous advances have been made in inspecting the complex molecular and genetic mechanisms underlying angiogenesis in human6,7. Particularly, endothelial cell proliferation, adhesion, migration and tube formation are thought to be key events for the angiogenic process7. Such endothelial cell behaviors can be regulated by external signaling components (e.g., BMP68 and DLL49), transcription factors (TFs) (e.g., ETS110 and FOXC111) and epigenetic enzymes (e.g., EP30012 and EZH213), thereinto vascular endothelial growth factor A (VEGFA) exemplifies one of the most powerful regulators. As an extracellular signaling factor, VEGFA activates a cascade of endothelial cell gene transcription and controls almost every stage of angiogenesis in human physiology and diseases14.
In recent years, cis-regulation has emerged as an important mechanism of controlling gene expression in embryonic development, which requires the combinatorial interplay of TFs with defined cis-regulatory elements in the genome15,16. There is no unified definition of cis-regulatory elements yet, but in most cases they comprise promoters, enhancers, silencers and insulators17. Notably, comprehensive chromatin and epigenetic landscapes act as efficient tools for genome-wide characterization of cis-regulatory elements18,19. It is extremely noteworthy that cis-regulatory elements vary among different cell-types and tissues20, implying the necessity of compendium analysis of such elements during human embryo morphogenesis. Up till now, cis-regulatory elements have been carefully mapped in several human organs across early developmental time points, including the brain21, heart22 and adipose23. But few studies have concentrated on the field of angiogenesis, so that cis-regulatory elements associated with this process (hereafter referred to as ‘angiogenic cis-regulatory elements’) still remain obscure.
In this study, to profile angiogenic cis-regulatory elements, we conducted an integrated analysis of the transcriptome and chromatin accessibility in a VEGFA-stimulated in vitro angiogenic model, as shown in Fig. 1. We generated a bank of 47,125 angiogenic cis-regulatory elements with promoter and/or enhancer activities. The angiogenic cis-regulatory elements were all located outside ‘gene desert’ regions and enriched for motifs of angiogenesis-relevant TFs. Using this bank, we performed a post exome-wide association study (EWAS) of total anomalous pulmonary venous connection (TAPVC) and found rs199530718 as a novel cis-regulatory single nucleotide polymorphism (SNP). These results provide a general landscape of cis-regulation in angiogenesis, and demonstrate the utility of angiogenic cis-regulatory elements in elucidating the genetics of vascular abnormality.
Fig. 1.
Overall workflow of this study.
Results
Examining temporal transcriptome changes in VEGFA-stimulated HUVECs
The VEGFA-induced stimulation of human umbilical vein endothelial cells (HUVECs) (hereafter referred to as ‘VEGFA-HUVEC angiogenic model’) is an excellent in vitro system for studying the cis-regulation of angiogenesis10,24–26. To profile angiogenic cis-regulatory elements, we retrieved public genetic and epigenetic sequencing datasets of HUVECs before (H-0) and after VEGFA stimulation for 1 (H-1), 4 (H-4) and 12 (H-12) h (Fig. 2a)10,24. As open chromatin regions shared a similar genome-wide distribution with cis-regulatory elements27, we performed correlation analysis of the DNase-seq tag densities among these four stages to determine the relevance of their chromatin accessibility. Intriguingly, the entire stimulation process could be grouped into two time periods, that is, the early (H-0 and H-1) and late (H-4 and H-12) periods (Fig. 2b). This result, which had never been reported in VEGFA-HUVEC angiogenic model, demonstrated that the chromatin accessibility of HUVECs underwent temporal changes when stimulated by VEGFA. Given the landscapes of chromatin accessibility and gene expression were reciprocal causation in in vitro cardiogenesis22, we next examined the temporal transcriptome features of VEGFA-HUVEC angiogenic model. PCA analysis of the retrieved RNA-seq datasets discovered remarkable heterogeneity between the early and late periods of stimulation (Fig. 2c). Collectively, we speculated that in the late stimulation period, VEGFA might reprogrammed, or remodeled, HUVECs into a hitherto undescribed cell type.
Fig. 2.
Temporal transcriptome dynamics of VEGFA-stimulated HUVECs. (a) Schematic illustration of the overall experimental design. Public genetic and epigenetic sequencing datasets of HUVECs before and after VEGFA stimulation for 1, 4 and 12 h were retrieved for analysis. (b) Correlation heat map of the DNase-seq tag densities in H-0, H-1, H-4 and H-12. Samples with similar chromatin accessibility were highlighted by solid or dashed borders. (c) PCA plot showing the first two components of H-0, H-1, H-4 and H-12. (d) Distribution of DEGs in pairwise comparison. (e) Time series patterns of DEGs during VEGFA stimulation. (f) Dot plots showing enriched GO BP terms of DEGs in Clusters 1 and 2 (left panel) as well as Clusters 3 and 4 (right panel), respectively. (g) Heat map of representative genes of Clusters 1 to 4.
We then screened differentially expressed genes (DEGs) in VEGFA-HUVEC angiogenic model to unveil the nature of H-4 and H-12. There were a total of 839 DEGs identified in the entire stimulation process (Fig. 2d), which was identical with Wang’s report24. These DEGs were further classified into four patterns (215 genes in Cluster 1, 197 in Cluster 2, 262 in Cluster 3, and 175 in Cluster 4) according to time-series clustering analysis (Fig. 2e, Data 1). Genes in Clusters 1 and 2 showed high expression level in the early stimulation period, but were monotonically downregulated in the late period. By contrast, genes in Clusters 3 and 4 were continuously upregulated in the early stimulation period, and showed high expression level in the late period. We thus categorized the genes in Cluster 1 as H-0 specific enriched, Cluster 2 as H-1 specific enriched, Cluster 3 as H-4 specific enriched, and Cluster 4 as H-12 specific enriched, respectively.
Gene Ontology (GO) analysis of Cluster 1 or 2 gene set obtained GO Biological Process (BP) terms closely aligned with endothelial function (e.g., wound healing28, BMP signal pathway29 and inflammatory response30) (Fig. 2f-left panel). Further investigation of these two gene sets identified a DEG subset essential for endothelial identity, such as ZEB231, EDN132 and PROX133 (Fig. 2g, Supplementary Fig. 1a). As for Cluster 3 or 4 gene set, we got GO BP terms related with progenitor cell function (e.g., angiogenesis34, stem cell division and cell differentiation) (Fig. 2f-right panel). Specially, these two gene sets were defined by progenitor cell markers like CD3435, NR5A236 and MEF2C37 (Fig. 2g, Supplementary Fig. 1a). Taken together, our data suggested VEGFA reprogrammed HUVECs into a progenitor-like fate, and H-4 and H-12 exhibited angiogenic transcriptome features.
Temporal transitions in VEGFA-stimulated HUVECs reflected by chromatin accessibility
Considering transcriptome as a readout of the cis-regulatory network, we investigated both the DNase-seq signals and the RNA-seq signals of DEGs at different stages in VEGFA-HUVEC angiogenic model. In Cluster 1 and 2 gene sets, we examined the ZEB2 and SOX7 gene loci due to their crucial roles in maintaining endothelial cell fate31,38. Compared with H-4 and H-12, H-0 and H-1 had higher enrichment of the DNase-seq signals at putative promoters and enhancers at both ZEB2 and SOX7 gene loci, which was consistent with the stages when these two genes were highly expressed (Fig. 3a,b,e,f). The MEF2C and NR5A2 gene loci in Cluster 3 and 4 gene sets were then examined since they participated in pluripotency maintenance36,37. Their putative promoters and enhancers had more enriched DNase-seq signals in H-4 and H-12 than in H-0 or H-1, showing consistency with their respective mRNA expression dynamics (Fig. 3c,d,g,h). From these results, we observed temporal changes in chromatin accessibility related to VEGFA stimulation and correlated with gene transcriptions. The temporal transitions in VEGFA-stimulated HUVECs could be precisely reflected by the epigenetic dynamics. Specifically, the chromatin accessibility landscapes of H-4 and H-12 revealed the cis-regulatory network of angiogenesis.
Fig. 3.
Temporal chromatin accessibility dynamics of VEGFA-stimulated HUVECs. (a–d) Normalized expression levels of ZEB2 (a), SOX7 (b), MEF2C (c) and NR5A2 (d) during VEGFA stimulation, respectively. Data represented means ± SEM (n = 2 per group). e-h Normalized epigenetic and expression profiles at the ZEB2 (e), SOX7 (f), MEF2C (g) and NR5A2 (h) loci during VEGFA stimulation, respectively.
Identifying angiogenic cis-regulatory elements
To identify angiogenic cis-regulatory elements, we searched for open chromatin regions in H-4 and H-12 based on the retrieved DNase-seq datasets. There were 72,113 significant DNase-seq peaks in H-4 as well as 75,280 in H-12, which were recognized as their respective open chromatin regions (Fig. 4a). Then a total of 90,572 angiogenic open chromatin regions was identified by merging the above genomic regions in H-4 and H-12 (Fig. 4b). Of all the angiogenic open chromatin regions, 29,929 (33.1%) were in promoters, 4,952 (5.5%) were in exons, 30,228 (33.4%) were in introns, 4,180 (4.6%) were in UTR5/UTR3, and 21,283 (23.4%) were in intergenic regions (Fig. 4c, Data 2). The genomic distribution of angiogenic open chromatin regions was similar with that of other human tissues39, suggesting our strategy was accurate for profiling such genomic regions.
Fig. 4.
Epigenetic profiling predicted angiogenic cis-regulatory elements. (a) Read density heat maps showing normalized DNase-seq enrichments in H-4 (left panel) and H-12 (right panel) at their respective peak regions (center ± 2 kb). Peak regions in each heat map were represented as horizontal rows, and ranked by decreasing signal strength. (b) Number of open chromatin regions in H-4, H-12 and the whole late stimulation period. (c) Genomic distribution of angiogenic open chromatin regions. (d,e) Average profiles and read density heat maps showing normalized H3K27ac (d) and H3K4me3 (e) enrichments in H-4 and H-12 at angiogenic open chromatin regions (center ± 2 kb). Black solid and dashed borders were used to highlight genomic regions with and without ChIP-seq signals, respectively. (f) Typical examples of angiogenic open chromatin regions with histone modification of H3K27ac (i), H3K4me3 (ii) and both (iii). (g) Number of angiogenic open chromatin regions with H3K27ac and/or H3K4me3 modifications.
Since most of cis-regulatory elements in vertebrate genomes were enhancers and promoters40, we here confined angiogenic cis-regulatory elements to angiogenic open chromatin regions with enhancer or promoter activity. Public ChIP-seq datasets for two histone modifications, H3K27ac and H3K4me3, in H-4 and H-12 were retrieved in subsequent analysis. These two marks were widely used to label enhancers and promoters, respectively41,42. In H-4, there were 40,186 (44.4%) angiogenic open chromatin regions with H3K27ac enrichment and 22,654 (25.0%) with H3K4me3 enrichment (Fig. 4d,e). In H-12, there were 39,824 (44.0%) angiogenic open chromatin regions with H3K27ac enrichment and 22,010 (24.3%) with H3K4me3 enrichment (Fig. 4d,e). After merging these genomic regions in H-4 and H-12, we found 42,378 angiogenic open chromatin regions with enhancer activity (H3K27ac modification), and 23,745 with promoter activity (H3K4me3 modification) (Fig. 4g).
Noteworthy, most of the above open chromatin regions were monofunctional with either enhancer or promoter activity, as exemplified by the HDAC5 and NOTCH2 gene loci (Fig. 4fi,ii). The rest were bifunctional with both enhancer and promoter activities, as exemplified by the NR5A2 gene locus (Fig. 4fiii). It was in accordance with a previous conclusion that some genomic regions might switch between enhancer and promoter signatures43,44. Thereby, we merged all open chromatin regions with H3K27ac and/or H3K4me3 modifications in H-4 and H-12, and obtained a total of 47,125 angiogenic cis-regulatory elements (Fig. 4g).
Depicting epigenetic signatures of angiogenic cis-regulatory elements
We next compared angiogenic cis-regulatory elements with known features of the human genome. Of all the angiogenic cis-regulatory elements, 20,887 (44.3%) were in promoters, 2,111 (4.5%) were in exons, 14,458 (30.7%) were in introns, 2,361 (5.0%) were in UTR5/UTR3, and 7,308 (15.5%) were in intergenic regions (Fig. 5a, Data 3). Obviously, the vast majority of angiogenic cis-regulatory elements were resided in genome noncoding regions, which conformed to the basic characteristic of regulatory DNA sequences17. On the other hand, we analyzed the genomic locations of angiogenic cis-regulatory elements according to gene annotation. 30,785 (65.3%) angiogenic cis-regulatory elements were found to locate within 5 kb upstream or downstream from their respective nearest TSSs, whereas the rest were distal from their respective neighboring genes (5 kb to 100 kb) (Fig. 5b). None of angiogenic cis-regulatory elements were settled in ‘gene desert’ regions (>500 kb that were devoid of protein coding genes). Collectively, our identified angiogenic cis-regulatory elements were almost noncoding sequences and might regulate gene transcription in angiogenesis via long-range interactions.
Fig. 5.
General features of angiogenic cis-regulatory elements. (a) Genomic distribution of angiogenic cis-regulatory elements. (b) Distance of each angiogenic cis-regulatory element from its nearest TSS. (c) Significant MSigDB pathways and phenotypes enriched in angiogenic cis-regulatory elements. (d) TF binding motifs enriched in angiogenic cis-regulatory elements.
Further functional annotation of angiogenic cis-regulatory elements was conducted via GREAT. As expected, we found that the enriched MSigDB pathways were tightly related with angiogenesis, including Notch signal pathway45, elongation arrest and recovery46, NFAT signal pathway47 and transcription regulated by SMAD2/3:SMAD4 heterotrimer (Fig. 5c). These angiogenic cis-regulatory elements were also involved in abnormal vascular endothelial cell development (Fig. 5c), suggesting their important role in angiogenesis regulation. We then used HOMER to predict TFs that could potentially bind with angiogenic cis-regulatory elements. The TF with known motifs enriched in angiogenic cis-regulatory elements as well as high expression level in both H-4 and H-12 was recognized as a candidate. Over 15% of angiogenic cis-regulatory elements were enriched for motifs of TFs crucial for angiogenesis, such as FLI148 (17.1%), ETV449 (17.6%), ERG50 (20.6%), ETS110 (14.2%) and ETV151 (18.7%) (Fig. 5d). These five TFs also had persistently detectable mRNA levels in HUVECs during VEGFA stimulation (Supplementary Fig. 1b–f). With these results, we concluded that our identified angiogenic cis-regulatory elements contained comprehensive information on angiogenesis cis-regulation.
Using angiogenic cis-regulatory elements as instrument for identifying cis-regulatory SNPs associated with TAPVC risk
A relevant usage of angiogenic cis-regulatory elements was to guide post-EWAS studies by identifying vascular abnormality-associated cis-regulatory SNPs. We employed this instrument to screen cis-regulatory SNPs associated with TAPVC risk, a congenital heart disease mainly caused by aberrant angiogenesis3. The analysis pipeline had been put forward in our previous study22, and was shown in Supplementary Fig. 2a. Whole-exome sequencing (WES) data of 78 TAPVC cases and 100 controls passed quality control, and a subset of 121,107 common SNPs with high quality was selected for exome-wide association analysis. Of note, there was no population stratification between cases and controls (Supplementary Fig. 2b,c). We thus examined the exome-wide association in an additive logistic regression model without adjustment for any covariates. 25 SNPs showed statistical evidence of exome-wide association with TAPVC and were listed in Data 4 (Fig. 6a, Supplementary Fig. 2d). To avoid any potential impact of linkage disequilibrium (LD) on the findings, we further set the threshold of r2 < 0.6 and obtained 7 independent lead SNPs among the exome-wide associated SNPs (Fig. 6a). LD expansion with a cutoff r2 value of 0.2 revealed another 34 SNPs that were in LD with at least one of the independent lead SNPs. Together, our EWAS discovered a total of 41 SNPs in association with TAPVC risk (Fig. 6a, Data 5). Of all the TAPVC-associated SNPs, 26 (63.4%) were in exons, 9 (22.0%) were in introns, and 6 (14.6%) were in intergenic regions (Fig. 6b).
Fig. 6.
Using angiogenic cis-regulatory elements to find new cis-regulatory SNPs for TAPVC. (a) Number of exome-wide associated, independent lead, LD and TAPVC-associated SNPs. (b) Genomic distribution of TAPVC-associated SNPs. (c) Schematic illustration of the analysis pipeline. TAPVC-associated SNPs located in angiogenic cis-regulatory elements were recognized as cis-regulatory SNP candidates for TAPVC. (d) Predicted TFs bound to the rs199530718 locus via JASPAR database.
Next, our goal was to find if there were any cis-regulatory SNPs for TAPVC among 41 TAPVC-associated SNPs. Two TAPVC-associated SNPs were located within angiogenic cis-regulatory elements and thereby were recognized as cis-regulatory SNP candidates, namely rs199530718 and rs201538928 (Fig. 6c). rs199530718 was predicted to interact with PRDM1 to form the ‘SNP-TF’ circuit (Fig. 6d), whereas rs201538928 was not located at any TF motifs (data not shown). PRDM1 was a well-studied regulator of embryonic stem cell pluripotency and could affect the process of endothelial cell differentiation52,53. Once its DNA-binding motif was disrupted by rs199530718, PRDM1 would fail to bind to the angiogenic cis-regulatory element containing the rs199530718-A allele. It could disable the PRDM1-mediated pluripotent transcription network in endothelial cells, making them hard to adopt the progenitor-like fate in angiogenesis. In summary, our analyses indicated that rs199530718 was a cis-regulatory SNP linked with TAPVC, validating the angiogenic cis-regulatory elements as an important tool to investigate vascular abnormality genetics.
Discussion
This study carried out a comprehensive assessment of the transcriptome and chromatin signatures in VEGFA-HUVEC angiogenic model, and generated a bank of 47,125 angiogenic cis-regulatory elements. We used this bank to analyze the TAPVC-associated SNPs and discovered a novel cis-regulatory SNP for TAPVC, namely rs199530718. The risk allele rs199530718-A was predicted to disrupt the PRDM1-binding site on an angiogenic cis-regulatory element, thereby causing aberrant angiogenesis. Generally, this study provided a valuable tool for epigenetic dissection of angiogenesis and genetics of vascular abnormality.
Although the VEGFA-HUVEC angiogenic model is not new, the novelty of this study lies in its pure computational approach, integrating DNase-seq, RNA-seq and ChIP-seq datasets deposited on the Gene Expression Omnibus (GEO). Previous analysis of these datasets, combined with cellular assay, has revealed VEGFA-induced transcriptional responses and VEGFA-responsive enhancers in endothelial cells. These results not only elucidated the basic features of VEGFA-HUVEC angiogenic model but also suggested that it was reliable for mimicking angiogenesis. Here, we focused on the angiogenic process rather than the endothelial cell itself. Specifically, based on VEGFA-HUVEC angiogenic model, our computational approach re-exploited the above GEO datasets to study the cis-regulation of angiogenesis.
In vitro differentiation models recapitulating organ and tissue development have been widely applied to explore the cis-regulation of embryonic morphogenesis. Typical examples include in vitro differentiation of cardiomyocytes22,54, retina55 and hypothalamic neurons23. In this study, VEGFA-HUVEC angiogenic model was used to investigate the cis-regulation of angiogenesis. Although this model had been established over a decade, we reached a previously unreported conclusion that HUVECs would adopt a progenitor-like fate in the late stimulation period. Concretely, H-1 showed the endothelial transcriptome characteristics, whereas H-4 and H-12 had enriched BP terms related to angiogenesis and multi-lineage differentiation potential. Under this condition, VEGFA might act as a chemical molecule to reprogram HUVECs into progenitor-like cells to trigger angiogenesis. It was a reasonable speculation as the reprogramming effect of VEGFA had already been observed on endothelial cells in hepatocellular carcinoma56. Besides, our conclusion conformed to the classic sprouting angiogenesis theory, which highlighted the indispensable role of endothelial progenitor cells during the angiogenic process34. Therefore in subsequent analysis, diverse epigenetic datasets of H-4 and H-12 were integratedly examined to profile angiogenic cis-regulatory elements.
It is worth mentioning that none of the techniques available today can provide direct information on genomic cis-regulatory elements. Thus in the study, we adopted an indirect profiling method by firstly mapping angiogenic open chromatin regions with enhancer and/or promoter activities. Briefly, the angiogenic open chromatin regions were detected via DNase-seq datasets, and the enhancer or promoter activity was detected via histone ChIP-seq datasets. This method is mature for profiling genomic cis-regulatory elements, of note, as evidenced by recent achievements in the field of somite and heart development22,57. On the other hand, our identified angiogenic cis-regulatory elements were enriched for DNA-binding motifs of ETS family TFs (e.g., FLI1, ETV4, ERG and ETS1). TFs belonging to the ETS family are master regulators of endothelial cell gene transcription and participate actively in angiogenic signal transduction46. Depleting these TFs can impair angiogenesis and lead to vascular abnormality during embryogenesis58. Moreover, Zhang et al.10 have demonstrated that almost all of the angiogenic enhancers contain ETS TF motif sequences. All these evidences prove that the angiogenic cis-regulatory element bank we generated is comprehensive and reflects the nature of angiogenesis.
The angiogenic cis-regulatory element bank serves as a valuable resource for investigating the angiogenesis and genetics of vascular abnormality. In this study, the bank was used in a post-EWAS analysis to annotate potential cis-regulatory functions of the TAPVC-associated SNPs. Typical examples of such application can be seen in the etiological study of common diseases, such as ventricular septal defect22, acute lymphoblastic leukemia59 and Parkinson’s disease60. Such kind of cis-regulatory element banks also contributes to researches including gene transcription control61, targeted gene finding62, multigenome DNA sequence conservation63 and gene therapy64. For instance, Lee et al. screened proximal cis-regulatory elements in the IL-10 gene loci of Th1 and Th2 cells, and reported a new enhancer that can regulate IL-10 expression in distinct T helpers. Here, while we advocate the use of the angiogenic cis-regulatory element bank to recognize cis-regulatory SNPs for vascular abnormality, we also emphasize that, the ‘SNP-TF’ circuit is a vital clue to prioritize that TF for future follow-up studies.
To sum up, our integrated genetic and epigenetic analysis has generated a genome-wide bank of angiogenic cis-regulatory elements. Browsing the bank enables recognition and understanding of novel cis-regulatory SNPs linked with TAPVC. This study is limited by the lack of evidence from molecular and cellular experiments, which hinders our efforts to further explore angiogenic cis-regulatory elements. Nevertheless, the angiogenic cis-regulatory element bank and the study itself have provided a tool for investigating the cis-regulation of angiogenesis, and contribute to understand genetics of vascular abnormality.
Methods
High-throughput datasets
For this study, high-throughput data from HUVECs before and/or after VEGFA stimulation were reanalyzed. Raw FASTQ files for RNA-seq and DNase-seq were downloaded from GEO series GSE4116610,65, and for H3K27ac and H3K4me3 ChIP-seq were downloaded from GEO series GSE10962624,66. Before alignment, raw sequencing reads were trimmed to generate clean reads via Trim Galore (version 0.6.7) with parameters ‘-q 20 --length 20 --stringency 4 --e 0.1’.
Reanalysis of RNA-seq datasets
Clean reads were aligned to the hg19 reference genome via Hisat2 (version 2.2.1) with default parameters67, and then SAMtools (version 1.9) was used to remove duplicate reads68. Total reads that overlapped the exons of the genes were counted via HTSeq (version 0.13.5) with parameters ‘-s n -t exon’69. Raw gene expression values were computed as counts per million mapped reads (CPM).
For visualization, post-filtered BAM files were normalized and converted to BIGWIG format via deepTools2 bamCoverage with parameters ‘--normalizeUsing RPGC --effectiveGenomeSize 2864785220 --binSize 10’70. Gene tracks were visualized via Integrative Genomics Viewers (IGV).
For principal component analysis (PCA), principal components of gene expression data from all samples were calculated via R function ‘prcomp’. The first two components were then visualized via R package ‘ggplot2’.
For differential analysis, differential expression was assessed by performing all pairwise comparisons among samples. R package ‘DESeq 2’ was used to identify DEGs following the criteria of |log2 (fold change)| ≥ 0.58 and adjust p ≤ 0.01. Time-series clustering of DEGs was analyzed via R package ‘Mfuzz’ with parameter ‘c = 4’.
For functional annotation, GO enrichment analysis for each time-series cluster of DEGs was carried out via DAVID database (https://david.ncifcrf.gov/)71. The GO terms with p < 0.05 were considered as significant and visualized via R package ‘ggplot2’.
Reanalysis of DNase-seq datasets
Clean reads were aligned to hg19 genome via Bowtie2 (version 2.4.4) with default parameters72. Aligned BAM files were then processed to remove low quality mapped and duplicate reads. Peak calling was performed via MACS2 (version 2.1.1.20160309) with parameters ‘--nomodel --shift 100 --extsize 200 -q 0.05’.
For visualization, the pipelines of generating BIGWIG files and visualizing gene tracks were the same as those for RNA-seq datasets. Particularly, significant DNase-seq peaks were visualized via deepTools2 plotHeatmap.
For correlation analysis, genome-wide correlation matrix was calculated via deepTools2 multiBamSummary and plotCorrelation with parameters ‘--corMethod pearson --binSize 10000’. Post-filtered BAM files of all samples were imported as inputs. The correlation heat map was generated via R package ‘pheatmap’.
Reanalysis of ChIP-seq datasets
The analysis pipeline for ChIP-seq reads was the same as that for DNase-seq datasets. Particularly, broad peaks were called via MACS2 with parameters ‘—broad --broad-cutoff 0.1’ and then visualized via IGV.
For read density analysis, the read density matrix was counted via deepTools2 computeMatrix with parameters ‘--referencePoint center -a 2000 -b 2000’, and then was visualized via deepTools2 plotHeatmap.
Identification and annotation of open chromatin regions
The DNase-seq peaks in each sample constituted the initial set of its respective open chromatin regions. Genomic location annotation of open chromatin regions was performed via R package ‘ChIPseeker’.
Identification and annotation of cis-regulatory elements
The DNase-seq peaks that had an overlap with H3K27ac and/or H3K4me3 peaks in each sample constituted the initial set of its respective cis-regulatory elements. Genomic location annotation of cis-regulatory elements was performed via R package ‘ChIPseeker’. Pathway and other enriched functions were predicted via GREAT (version 3.0.0; http://great.stanford.edu/public/html/)73. The enriched terms with p < 0.05 were considered as significant. TF motif enrichment analysis was performed via HOMER with the algorithm ‘findMotifsGenome.pl’74. The enriched motifs with p < 1 × 10−20 were considered as significant.
Exome-wide association analysis
WES data of 78 TAPVC cases and 100 healthy controls were derived from our previous study3. All of the study population was unrelated and recruited from Xinhua Hospital affiliated to Shanghai Jiao Tong University. Before enrollment, written informed consents were signed by participants or their guardians.
For individual quality control, no individuals were filtered out owing to sex discrepancies or low genotyping rate (<95%). For SNP quality control, SNPs were excluded if they were located on sex chromosomes, if their call rate was <95%, if the minor allele frequency (MAF) was <0.05 among controls, or if the p value in Hardy-Weinberg equilibrium test was < 1 × 10−5 among controls. A total of 121,107 high-quality SNPs passed quality control testing and was included for exome-wide association analysis.
For population stratification analysis, PCA of 78 TAPVC cases and 100 controls was performed via PLINK (version 1.90) using all high-quality SNPs75. The first two eigenvectors were visualized via R package ‘ggplot2’.
For association analysis, exome-wide associations were assessed in an additive logistic regression model via PLINK without adjustment for any covariates. SNPs with p < 1 × 10−5 were considered as exome-wide associated. The quantile-quantile (Q-Q) plot and the Manhattan plot were both generated via R package ‘qqman’.
For LD analysis, independent lead SNPs were extracted from exome-wide associated SNPs which were independent from each other at r2 < 0.6. LD SNPs were extracted from high-quality SNPs which were in LD (r2 > 0.2) with at least one independent lead SNP. TAPVC-associated SNPs were the union of independent lead SNPs and LD SNPs. Particularly, functional annotation of the TAPVC-associated SNPs was carried out via ANNOVAR76.
Supplementary information
Acknowledgements
We thank Zhongli Jiang (the University of Purdue, USA) and Hongyu Shao (the University of Sheffield, UK) for their assistance with the EWAS analysis. This study was supported by the National Natural Science Foundation of China (81802699, 82170304, 82370371).
Author contributions
L.H.J., Z.Y.H., L.W., Y.Y. and K.S. conceptualized and designed the study. L.H.J. and Z.Y.H. generated and analyzed the data. X.T.M., J.R.L., Y.W.L., B.Q.Y. and Y.W.L. wrote the manuscript. L.H.J., Z.Y.H., L.L.L. and L.W. performed the revision process. All authors contributed to critical editing of the manuscript.
Data availability
All data analyzed in this study were summarized in Supplementary Table 1. GEO datasets were available at https://ncbi.nlm.nih.gov/geo/. All high-throughput tools and R packages used in this study were public resources as described in Methods. All data generated were accessible on the figshare repository (ref. 77), as listed below:
1. Data 1: DEGs identified in the VEGFA-HUVEC angiogenic model.
2. Data 2: angiogenic open chromatin regions.
3. Data 3: angiogenic cis-regulatory elements.
4. Data 4: exome-wide associated SNPs for TAPVC.
5. Data 5: TAPVC-associated SNPs.
Code availability
No custom code was used.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Lihui Jin, Zhenyuan Han.
Contributor Information
Lihui Jin, Email: jinlh1995@sjtu.edu.cn.
Zhenyuan Han, Email: zhenyuan_han@163.com.
Lin Wang, Email: linwang@bjmu.edu.cn.
Yu Yu, Email: yuyu@xinhuamed.com.cn.
Kun Sun, Email: sunkun@xinhuamed.com.cn.
Supplementary information
The online version contains supplementary material available at 10.1038/s41597-024-03272-6.
References
- 1.Liu T, Zhang L, Joo D, Sun S-C. NF-κB signaling in inflammation. Signal Transduct. Target. Ther. 2017;2:17023. doi: 10.1038/sigtrans.2017.23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Lee H-W, et al. Role of venous endothelial cells in developmental and pathologic angiogenesis. Circulation. 2021;144:1308–1322. doi: 10.1161/CIRCULATIONAHA.121.054071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Shi X, et al. Next-generation sequencing identifies novel genes with rare variants in total anomalous pulmonary venous connection. EBioMedicine. 2018;38:217–227. doi: 10.1016/j.ebiom.2018.11.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Queisser A, Seront E, Boon LM, Vikkula M. Genetic basis and therapies for vascular anomalies. Circ. Res. 2021;129:155–173. doi: 10.1161/CIRCRESAHA.121.318145. [DOI] [PubMed] [Google Scholar]
- 5.Potente M, Gerhardt H, Carmeliet P. Basic and therapeutic aspects of angiogenesis. Cell. 2011;146:873–887. doi: 10.1016/j.cell.2011.08.039. [DOI] [PubMed] [Google Scholar]
- 6.Rogers, M. S. & D’Amato, R. J. Common polymorphisms in angiogenesis. Cold Spring Harb. Perspect. Med. 2 (2012). [DOI] [PMC free article] [PubMed]
- 7.Lamalice L, Le Boeuf F, Huot J. Endothelial cell migration during angiogenesis. Circ. Res. 2007;100:782–794. doi: 10.1161/01.RES.0000259593.07661.1e. [DOI] [PubMed] [Google Scholar]
- 8.Pulkkinen HH, et al. BMP6/TAZ-Hippo signaling modulates angiogenesis and endothelial cell response to VEGF. Angiogenesis. 2021;24:129–144. doi: 10.1007/s10456-020-09748-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Pitulescu ME, et al. Dll4 and Notch signalling couples sprouting angiogenesis and artery formation. Nat. Cell Biol. 2017;19:915–927. doi: 10.1038/ncb3555. [DOI] [PubMed] [Google Scholar]
- 10.Zhang B, et al. A dynamic H3K27ac signature identifies VEGFA-stimulated endothelial enhancers and requires EP300 activity. Genome Res. 2013;23:917–927. doi: 10.1101/gr.149674.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Prasitsak T, et al. Foxc1 is required for early stage telencephalic vascular development. Dev. Dyn. 2015;244:703–711. doi: 10.1002/dvdy.24269. [DOI] [PubMed] [Google Scholar]
- 12.Yao TP, et al. Gene dosage-dependent embryonic development and proliferation defects in mice lacking the transcriptional integrator p300. Cell. 1998;93:361–372. doi: 10.1016/S0092-8674(00)81165-4. [DOI] [PubMed] [Google Scholar]
- 13.Lu C, et al. Regulation of tumor angiogenesis by EZH2. Cancer Cell. 2010;18:185–197. doi: 10.1016/j.ccr.2010.06.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Leung DW, Cachianes G, Kuang WJ, Goeddel DV, Ferrara N. Vascular endothelial growth factor is a secreted angiogenic mitogen. Science. 1989;246:1306–1309. doi: 10.1126/science.2479986. [DOI] [PubMed] [Google Scholar]
- 15.Arnone MI, Davidson EH. The hardwiring of development: organization and function of genomic regulatory systems. Development. 1997;124:1851–1864. doi: 10.1242/dev.124.10.1851. [DOI] [PubMed] [Google Scholar]
- 16.Ghazi A, VijayRaghavan KV. Developmental biology. Control by combinatorial codes. Nature. 2000;408:419–420. doi: 10.1038/35044174. [DOI] [PubMed] [Google Scholar]
- 17.Schmitz RJ, Grotewold E, Stam M. Cis-regulatory sequences in plants: Their importance, discovery, and future challenges. Plant Cell. 2022;34:718–741. doi: 10.1093/plcell/koab281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.de Laat W, Duboule D. Topology of mammalian developmental enhancers and their regulatory landscapes. Nature. 2013;502:499–506. doi: 10.1038/nature12753. [DOI] [PubMed] [Google Scholar]
- 19.Klemm SL, Shipony Z, Greenleaf WJ. Chromatin accessibility and the regulatory epigenome. Nat. Rev. Genet. 2019;20:207–220. doi: 10.1038/s41576-018-0089-8. [DOI] [PubMed] [Google Scholar]
- 20.Nott A, et al. Brain cell type-specific enhancer-promoter interactome maps and disease-risk association. Science. 2019;366:1134–1139. doi: 10.1126/science.aay0793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Song M, et al. Mapping cis-regulatory chromatin contacts in neural cells links neurop sychiatric disorder risk variants to target genes. Nat. Genet. 2019;51:1252–1262. doi: 10.1038/s41588-019-0472-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Jin L, et al. Integrated genomic analysis identifies novel low-frequency cis-regulatory variant rs2279658 associated with VSD risk in Chinese children. Front. Cell Dev. Biol. 2022;10:1062403. doi: 10.3389/fcell.2022.1062403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Joslin AC, et al. A functional genomics pipeline identifies pleiotropy and cross-tissue effects within obesity-associated GWAS loci. Nat. Commun. 2021;12:5253. doi: 10.1038/s41467-021-25614-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Wang S, et al. A dynamic and integrated epigenetic program at distal regions orchestrates transcriptional responses to VEGFA. Genome Res. 2019;29:193–207. doi: 10.1101/gr.239053.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Abhinand CS, et al. Temporal phosphoproteomic analysis of VEGF-A signaling in HUVECs: an insight into early signaling events associated with angiogenesis. J. Cell Commun. Signal. 2023;17:1067–1079. doi: 10.1007/s12079-023-00736-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Sunitha P, et al. Temporal VEGFA responsive genes in HUVECs: Gene signatures and potential ligands/receptors fine-tuning angiogenesis. J. Cell Commun. Signal. 2019;13:561–571. doi: 10.1007/s12079-019-00541-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ackermann AM, Wang Z, Schug J, Naji A, Kaestner KH. Integration of ATAC-seq and RNA-seq identifies human alpha cell and be ta cell signature genes. Mol. Metab. 2016;5:233–244. doi: 10.1016/j.molmet.2016.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Malinda KM, et al. Thymosin alpha 1 stimulates endothelial cell migration, angiogenesis, and wound healing. J. Immunol. 1998;160:1001–1006. doi: 10.4049/jimmunol.160.2.1001. [DOI] [PubMed] [Google Scholar]
- 29.Han O, Pak B, Jin S-W. The role of BMP signaling in endothelial heterogeneity. Front. Cell Dev. Biol. 2021;9:673396. doi: 10.3389/fcell.2021.673396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Ambrozova G, et al. Nitro-oleic acid inhibits vascular endothelial inflammatory responses and the endothelial-mesenchymal transition. Biochim. Biophys. Acta. 2016;1860:2428–2437. doi: 10.1016/j.bbagen.2016.07.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Liu Y, et al. MicroRNA-200c-3p inhibits proliferation and migration of renal artery endothelial cells by directly targeting ZEB2. Exp. Cell Res. 2020;387:111778. doi: 10.1016/j.yexcr.2019.111778. [DOI] [PubMed] [Google Scholar]
- 32.Schönbach, C. et al. NKX2-3 transcriptional regulation of endothelin-1 and VEGF signaling in human intestinal microvascular endothelial cells. PLoS ONE6 (2011). [DOI] [PMC free article] [PubMed]
- 33.Hong Y-K, et al. Prox1 is a master control gene in the program specifying lymphatic end othelial cell fate. Dev. Dyn. 2002;225:351–357. doi: 10.1002/dvdy.10163. [DOI] [PubMed] [Google Scholar]
- 34.Laurenzana A, et al. Endothelial progenitor cells in sprouting angiogenesis: proteases pave the way. Curr. Mol. Med. 2015;15:606–620. doi: 10.2174/1566524015666150831131214. [DOI] [PubMed] [Google Scholar]
- 35.Sidney LE, Branch MJ, Dunphy SE, Dua HS, Hopkinson A. Concise review: evidence for CD34 as a common marker for diverse progenitors. Stem Cells. 2014;32:1380–1389. doi: 10.1002/stem.1661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Festuccia N, Owens N, Chervova A, Dubois A, Navarro P. The combined action of Esrrb and Nr5a2 is essential for murine naïve pluripotency. Development. 2021;148:dev199604. doi: 10.1242/dev.199604. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Vitali C, Tripodo C, Colombo MP. MEF2C and SOCS2 in stemness regulation. Oncoscience. 2015;2:936–937. doi: 10.18632/oncoscience.279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Yao Y, Yao J, Boström KI. SOX transcription factors in endothelial differentiation and endothelial-mesenchymal transitions. Front. Cardiovasc. Med. 2019;6:30. doi: 10.3389/fcvm.2019.00030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Li J, et al. Accurate annotation of accessible chromatin in mouse and human primordial germ cells. Cell Res. 2018;28:1077–1089. doi: 10.1038/s41422-018-0096-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Lu F, et al. Establishing chromatin regulatory landscape during mouse preimplantation development. Cell. 2016;165:1375–1388. doi: 10.1016/j.cell.2016.05.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Creyghton MP, et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc. Nat. Acad. Sci. USA. 2010;107:21931–219316. doi: 10.1073/pnas.1016071107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Lauberth SM, et al. H3K4me3 interactions with TAF3 regulate preinitiation complex assembly and selective gene activation. Cell. 2013;152:1021–1036. doi: 10.1016/j.cell.2013.01.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Leung D, et al. Integrative analysis of haplotype-resolved epigenomes across human tissues. Nature. 2015;518:350–354. doi: 10.1038/nature14217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Roadmap Epigenomics Consortium, et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–330. doi: 10.1038/nature14248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Ramasamy SK, Kusumbe AP, Wang L, Adams RH. Endothelial notch activity promotes angiogenesis and osteogenesis in bone. Nature. 2014;507:376–380. doi: 10.1038/nature13146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Chen J, et al. VEGF amplifies transcription through ETS1 acetylation to enable angiogenesis. Nat. Commun. 2017;8:383. doi: 10.1038/s41467-017-00405-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Graef IA, Chen F, Chen L, Kuo A, Crabtree GR. Signals transduced by Ca(2+)/calcineurin and NFATc3/c4 pattern the developing vasculature. Cell. 2001;105:863–875. doi: 10.1016/S0092-8674(01)00396-8. [DOI] [PubMed] [Google Scholar]
- 48.Toyama T, et al. The impact of transcription factor Fli1 deficiency on the regulation o f angiogenesis. Exp. Dermatol. 2017;26:912–918. doi: 10.1111/exd.13341. [DOI] [PubMed] [Google Scholar]
- 49.Harel S, et al. ETS1, ELK1, and ETV4 transcription factors regulate angiopoietin-1 signaling and the angiogenic response in endothelial cells. Front. Physiol. 2021;12:683651. doi: 10.3389/fphys.2021.683651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Shah AV, Birdsey GM, Randi AM. Regulation of endothelial homeostasis, vascular development and angiogenesis by the transcription factor ERG. Vascul. Pharmacol. 2016;86:3–13. doi: 10.1016/j.vph.2016.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Petit FG, Salas R, Tsai M-J, Tsai SY. The regulation of COUP-TFII gene expression by Ets-1 is enhanced by the steroid receptor co-activators. Mech. Ageing Dev. 2004;125:719–732. doi: 10.1016/j.mad.2004.03.009. [DOI] [PubMed] [Google Scholar]
- 52.Chu LF, Surani MA, Jaenisch R, Zwaka TP. Blimp1 expression predicts embryonic stem cell development in vitro. Curr. Biol. 2011;21:1759–1765. doi: 10.1016/j.cub.2011.09.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Niimi K, Nakae J, Inagaki S, Furuyama T. FOXO1 represses lymphatic valve formation and maintenance via PRDM1. Cell Rep. 2021;37:110048. doi: 10.1016/j.celrep.2021.110048. [DOI] [PubMed] [Google Scholar]
- 54.Bertero A, et al. Dynamics of genome reorganization during human cardiogenesis reveal an RBM20-dependent splicing factory. Nat. Commun. 2019;10:1538. doi: 10.1038/s41467-019-09483-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Xie H, et al. Chromatin accessibility analysis reveals regulatory dynamics of developing human retina and hiPSC-derived retinal organoids. Sci. Adv. 2020;6:eaay5247. doi: 10.1126/sciadv.aay5247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Sharma A, et al. Onco-fetal reprogramming of endothelial cells drives immunosuppressive macrophages in hepatocellular carcinoma. Cell. 2020;183:377–394.e321. doi: 10.1016/j.cell.2020.08.040. [DOI] [PubMed] [Google Scholar]
- 57.Mok GF, et al. Characterising open chromatin in chick embryos identifies cis-regulatory elements important for paraxial mesoderm formation and axis extension. Nat. Commun. 2021;12:1157. doi: 10.1038/s41467-021-21426-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Wei G, et al. Ets1 and Ets2 are required for endothelial cell survival during embryonic angiogenesis. Blood. 2009;114:1123–1130. doi: 10.1182/blood-2009-03-211391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Yang H, et al. Noncoding genetic variation in GATA3 increases acute lymphoblastic leukemia risk through local and global changes in chromatin conformation. Nat. Genet. 2022;54:170–179. doi: 10.1038/s41588-021-00993-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Lee AJ, et al. Characterization of altered molecular mechanisms in Parkinson’s disease through cell type-resolved multiomics analyses. Sci. Adv. 2023;9:eabo2467. doi: 10.1126/sciadv.abo2467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Lee C-G, et al. A distal cis-regulatory element, CNS-9, controls NFAT1 and IRF4-mediated IL-10 gene activation in T helper cells. Mol. Immunol. 2009;46:613–621. doi: 10.1016/j.molimm.2008.07.037. [DOI] [PubMed] [Google Scholar]
- 62.Zhang W, et al. Cis-regulatory element based targeted gene finding: genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana. Bioinformatics. 2005;21:3074–3081. doi: 10.1093/bioinformatics/bti490. [DOI] [PubMed] [Google Scholar]
- 63.Kuntz SG, et al. Multigenome DNA sequence conservation identifies Hox cis-regulatory elements. Genome Res. 2008;18:1955–1968. doi: 10.1101/gr.085472.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Antoniou P, et al. Base-editing-mediated dissection of a γ-globin cis-regulatory element for the therapeutic reactivation of fetal hemoglobin expression. Nat. Commun. 2022;13:6618. doi: 10.1038/s41467-022-34493-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.2013. NCBI Sequence Read Archive. SRP015904
- 66.2019. NCBI Sequence Read Archive. https://identifers.org/ncbi/insdc.sra:SRP131219
- 67.Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019;37:907–915. doi: 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Li H, et al. The sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Anders S, Pyl PT, Huber W. HTSeq–a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Ramírez F, et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 2009;4:44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
- 72.Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.McLean CY, et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 2010;28:495–501. doi: 10.1038/nbt.1630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Heinz S, et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell. 2010;38:576–589. doi: 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Purcell S, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:e164. doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Jin LH, 2024. Genome-wide profiling of angiogeniccis-regulatory elements unravelscis-regulatory SNPs for vascular abnormality. figshare. [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Citations
- 2013. NCBI Sequence Read Archive. SRP015904
- 2019. NCBI Sequence Read Archive. https://identifers.org/ncbi/insdc.sra:SRP131219
- Jin LH, 2024. Genome-wide profiling of angiogeniccis-regulatory elements unravelscis-regulatory SNPs for vascular abnormality. figshare. [DOI] [PMC free article] [PubMed]
Supplementary Materials
Data Availability Statement
All data analyzed in this study were summarized in Supplementary Table 1. GEO datasets were available at https://ncbi.nlm.nih.gov/geo/. All high-throughput tools and R packages used in this study were public resources as described in Methods. All data generated were accessible on the figshare repository (ref. 77), as listed below:
1. Data 1: DEGs identified in the VEGFA-HUVEC angiogenic model.
2. Data 2: angiogenic open chromatin regions.
3. Data 3: angiogenic cis-regulatory elements.
4. Data 4: exome-wide associated SNPs for TAPVC.
5. Data 5: TAPVC-associated SNPs.
No custom code was used.






