Skip to main content
eLife logoLink to eLife
. 2024 Feb 9;13:e85235. doi: 10.7554/eLife.85235

Characterization of caffeine response regulatory variants in vascular endothelial cells

Carly Boye 1, Cynthia A Kalita 1, Anthony S Findley 1, Adnan Alazizi 1, Julong Wei 1, Xiaoquan Wen 2, Roger Pique-Regi 1,3,, Francesca Luca 1,3,4,
Editors: Stephen CJ Parker5, Detlef Weigel6
PMCID: PMC10901511  PMID: 38334359

Abstract

Genetic variants in gene regulatory sequences can modify gene expression and mediate the molecular response to environmental stimuli. In addition, genotype–environment interactions (GxE) contribute to complex traits such as cardiovascular disease. Caffeine is the most widely consumed stimulant and is known to produce a vascular response. To investigate GxE for caffeine, we treated vascular endothelial cells with caffeine and used a massively parallel reporter assay to measure allelic effects on gene regulation for over 43,000 genetic variants. We identified 665 variants with allelic effects on gene regulation and 6 variants that regulate the gene expression response to caffeine (GxE, false discovery rate [FDR] < 5%). When overlapping our GxE results with expression quantitative trait loci colocalized with coronary artery disease and hypertension, we dissected their regulatory mechanisms and showed a modulatory role for caffeine. Our results demonstrate that massively parallel reporter assay is a powerful approach to identify and molecularly characterize GxE in the specific context of caffeine consumption.

Research organism: Human

Introduction

Caffeine is the most widely consumed stimulant in the world (Planning Committee for a Workshop on Potential Health Hazards Associated with Consumption of Caffeine in Food and Dietary Supplements, Food and Nutrition Board, Board on Health Sciences Policy, Institute of Medicine, 2014). Caffeine produces a vascular response in the endothelium, causing vasodilation. The vascular endothelium, the innermost layer of arteries, is involved in several important functions, including regulation of blood flow, angiogenesis, thrombosis, and coagulation (Hadi et al., 2005; Krüger-Genge et al., 2019). Endothelial dysfunction occurs in diseases such as atherosclerosis and hypertension (Xu et al., 2021), eventually leading to coronary artery disease (CAD) (Matsuzawa and Lerman, 2014). Multiple studies have investigated the role of caffeine in cardiovascular disease (CVD), and more broadly, vascular health in general, with conflicting results (Chieng et al., 2022; Ding et al., 2014; Turnbull et al., 2017) on the role of caffeine in CVD risk. Ding et al., 2014 meta-analyzed 36 studies and found no association between heavy coffee consumption and increased risk of CVD (Ding et al., 2014). Similarly, Turnbull et al., 2017 observed that moderate caffeine consumption was not associated with an increased risk of CVD or other cardiovascular events such as heart failure (Turnbull et al., 2017). Multiple studies suggested that caffeine may be beneficial in reducing the risk of CAD (Choi et al., 2015; Miranda et al., 2018; Voskoboinik et al., 2019), while others provided evidence that caffeine may reduce the risk of heart failure, but had no significant effect on the risk of coronary heart disease or CVD (Stevens et al., 2021). Most recently, Chieng et al., 2022 found that decaffeinated, ground, and instant coffee significantly decreased CVD risk and mortality (Chieng et al., 2022). The conflicting results from these epidemiological studies may have several causes, including potential interactions between caffeine consumption and other environmental and genetic risk factors. Recent molecular studies investigated the consequences of caffeine exposure on chromatin accessibility and gene expression in vascular endothelial cells (Findley et al., 2019). This study identified response factor motifs for caffeine, defined as transcription factor motifs that are enriched in differentially accessible regions, and demonstrated that caffeine can induce changes in gene regulation in endothelial cells.

Analyzing the changes in gene expression upon exposure to environmental stimuli is a powerful approach to discover genotype–environment interactions (GxE). These molecular GxE result in a different response depending on genotype (Knowles et al., 2018; Knowles et al., 2017; Moyerbrailean et al., 2016b), potentially through allele-specific effects (ASE) on response factor binding or other environmental-specific gene regulatory mechanisms. Yet regulatory sequences that are differentially bound in response to environmental perturbations are poorly annotated. Single-nucleotide polymorphisms (SNPs) within caffeine response factor binding sites were enriched for artery expression quantitative trait loci (eQTLs) colocalized with CAD risk variants (Findley et al., 2019). The results of this study thus suggested that SNPs within regulatory elements active in the presence of caffeine may play a role in CAD risk and pointed to GxE in gene regulation as a potential mechanism underlying caffeine modulation of genetic risk for CAD. However, only a limited number of molecular GxE for caffeine have been studied so far, thus the transcription factors and regulatory sequences involved in caffeine GxE remain uncharacterized. Furthermore, it is important to study GxE in the relevant cell type; that is, endothelial cells which constitute the vascular endothelium. For these reasons, it is crucial to investigate and validate the mechanisms behind caffeine GxE in vascular endothelial cells.

Massively parallel reporter assays (MPRA) have allowed studies of noncoding genetic variants and their role in gene regulation, at unprecedented scale (Arnold et al., 2013; Gordon et al., 2020; Kalita et al., 2018; Melnikov et al., 2012; Patwardhan et al., 2012; Tewhey et al., 2016; Ulirsch et al., 2016; Vockley et al., 2015; Wang et al., 2018). Originally developed to study the gene regulatory potential of promoters and enhancer sequences, MPRA protocols have been further developed to study regulatory genetic variation and fine map association signals (Kalita et al., 2018; Tewhey et al., 2016; Ulirsch et al., 2016; Vockley et al., 2015). MPRAs with synthetic regulatory sequences can test allelic activity for candidate regulatory variants independently of their allele frequency in the population (Kalita et al., 2018; Tewhey et al., 2016; Ulirsch et al., 2016; Vockley et al., 2015). In MPRAs, DNA sequences containing each allele are transfected into cells and RNA-seq is used to quantify the transcripts for each allele. To directly test allelic effects of tens of thousands of candidate regulatory sequences predicted to affect transcription factor binding (CentiSNPs; Moyerbrailean et al., 2016a), we used an MPRA called Biallelic Targeted STARR-Seq (BiT-STARR-Seq) (Kalita et al., 2018). Only two previous studies have used MPRAs to investigate DNA sequences that regulate the transcriptional response to treatments (Johnson et al., 2018; Shlyueva et al., 2014). One study utilized STARR-Seq to characterize enhancer activity in Drosophila cells upon treatment with the hormone ecdysone; however, it did not investigate GxE (Shlyueva et al., 2014). The other study utilized STARR-Seq to investigate the response to glucocorticoid treatment in the human cell line A549 (Johnson et al., 2018). Although this study investigated GxE interactions, only a small number of variants were tested as this study was limited to preexisting variation within the samples (as opposed to designed target sequences) and only two variants had significant GxE (Johnson et al., 2018). We aim to identify and validate the DNA sequences that regulate the transcriptional response to caffeine in the vascular endothelium and how genetic variation present in these regulatory elements may affect the transcriptional response to caffeine (Figure 1A).

Figure 1. Study design.

(A) Genetic variants modulate transcriptional response dependent on environmental conditions. The pink bars represent different alleles present in our targets, and the green triangle represents a bound transcription factor. These target sequences are transfected into cells, RNA is extracted and sequenced, and then activity is measured for targets for both alleles. (B) Library composition based on annotation category: single-nucleotide polymorphisms (SNPs) predicted to alter transcription factor binding using a combination of sequence information and experimental data (centiSNPs) (Moyerbrailean et al., 2016b), SNPs associated with complex traits (genome-wide association studies [GWAS]), and expression quantitative trait loci (eQTL) in GTEx.

Figure 1.

Figure 1—figure supplement 1. Coverage histograms for all 12 libraries used in this study.

Figure 1—figure supplement 1.

Caffeine libraries are on the top, and water libraries are on bottom.

Results

Active regulatory regions in response to caffeine

For this study, we used a library of targets that was designed to capture a large number of predicted gene regulatory variants in motifs for hundreds of transcription factor binding sites (Figure 1B). The targets in our library consisted of self-transcribing enhancer regions containing a minimal promoter, a reporter gene, and the oligonucleotide containing the candidate regulatory SNP. These candidate regulatory SNPs belong to multiple categories, including SNPs predicted to alter transcription factor binding using a combination of sequence information and experimental data (centiSNPs) (Moyerbrailean et al., 2016b), SNPs associated with complex traits (genome-wide association studies [GWAS]), and eQTLs in GTEx. In addition, we included 1676 negative control sequences. To test if these putative regulatory sequences mediate the response to caffeine, we used DESeq2 to test for differential activity of the targets in cells treated with caffeine compared to cells in the control group (see ‘Materials and methods’ for the specific model). The library contained motifs in both the forward and reverse orientations within separate targets. Since these motifs may induce direction-specific effects, we performed the differential activity analysis per each direction separately (see Figure 2—figure supplement 1; see also Figure 2—figure supplements 2 and 3), and considered any target with false discovery rate (FDR) < 10% in either direction as significant. We observed 772 significantly differentially active targets: 546 upregulated targets and 226 downregulated targets (Figure 2A, Supplementary file 1), showing that caffeine overall increases the activity of the regulatory elements.

Figure 2. Active regulatory regions in caffeine response in vascular endothelial cells.

(A) Volcano plot for DESeq2 results showing targets differentially active in caffeine. The light red points are significant (false discovery rate [FDR] < 10%) targets containing caffeine response factor binding sites, the black points are significant targets not containing a caffeine response factor binding site, and the gray points are nonsignificant targets. The inset contains a QQ plot for targets containing a caffeine response factor binding site (red), or no caffeine response factor binding site (black). (B) Motifs enriched via test of proportions (p<0.05) within differentially active targets. Names of caffeine response factors are bolded. For B-D panels error bars represent the 95% confidence interval (motif occurrence, n >100). (C) Motifs enriched via test of proportions within upregulated targets (p<0.05). (D) Motifs enriched via test of proportions within downregulated targets (p<0.05).

Figure 2.

Figure 2—figure supplement 1. Principal component analysis (PCA) plots from read count data in the first (A) and second (B) batch (experiment), annotated by direction.

Figure 2—figure supplement 1.

Figure 2—figure supplement 2. Principal component analysis (PCA) plots from read count data in the first (A) and second (B) batch (experiment), annotated by allele.

Figure 2—figure supplement 2.

Figure 2—figure supplement 3. Heatmap plots from read count data in the first (A) and second (B) batch (experiment).

Figure 2—figure supplement 3.

We then focused on differentially active targets containing a known caffeine response factor as determined based on chromatin accessibility data from endothelial cells treated with caffeine (Findley et al., 2019). We observed that these targets had lower p-values, as observed in the QQ plot in Figure 2A (inset). To identify any additional transcription factors that may be important for the response to caffeine, we conducted a motif scan for 838 known transcription factor binding motifs using the JASPAR CORE Vertebrates 2022 database (Castro-Mondragon et al., 2022; Supplementary file 2). We found 19 motifs that were enriched for being within differentially active targets (Figure 2B, Supplementary file 3). We found the motif for ZNF423, one of the caffeine response factors, was enriched within the differentially active targets. The three most enriched motifs were NFATC1, NFATC4, and NFATC2. The NFAT transcription factor family is known for their involvement in the Ca2+/NFAT pathway. This signaling pathway plays an important role in maintaining the homeostasis of vascular endothelial cells (Wang et al., 2020) and contributes to the mediation of proliferation and migration (Johnson et al., 2003; Wang et al., 2020). Thus, improper signaling of the Ca2+/NFAT pathway can induce endothelial dysfunction (Garcia-Vaz et al., 2020; Wang et al., 2020). In diabetic mice, NFAT expression exacerbated atherosclerosis (Blanco et al., 2018; Zetterqvist et al., 2014) and increased foam cell formation (Du et al., 2021). In human coronary artery smooth muscle cells, NFAT signaling mediates vascular calcification (Goettsch et al., 2011). To better understand the regulatory response to caffeine, we then investigated which motifs were enriched for being within upregulated or downregulated targets separately (Supplementary file 3). We observed 19 motifs enriched for being within upregulated targets (Figure 2C) and 23 motifs enriched for being within downregulated targets (Figure 2D). Motifs enriched for being within upregulated targets include the previously mentioned NFATC1 and ZNF423. Motifs enriched for being within downregulated targets include the previously mentioned NFATC2 and ZNF423. We also observed that the motif for SREBF2, also called SREBP2, is enriched for being within downregulated targets. In hepatocytes, caffeine is known to suppress SREBF2 activity, which reduces PCSK9 expression, and thus increases LDLR expression, which could be protective against CVD (Lebeau et al., 2022). The corresponding transcription factors for these motifs could also play a role in mediating the response to caffeine in vascular endothelial cells. The motif for TEAD4 was also identified as enriched for being within downregulated targets. Interestingly, a CAD GWAS risk variant disrupts binding of TEAD4 in smooth muscle cells, causing lower expression of p16, which could potentially contribute to the risk identified at this locus (Almontashiri et al., 2015).

Allelic effects on gene regulation within conditions and in response to caffeine

To investigate how genetic variation affects regulatory sequences and their function in cells treated with caffeine and in the control samples, we tested for ASE. Since the library contained the same sequence in both the forward and reverse orientations in independent targets and the regulatory effect may be direction-dependent, we tested for ASE in each SNP/direction pair separately (Supplementary file 4; Figure 3—figure supplement 1, see ‘BiT-STARR-Seq Library Design’ section for a more detailed description of terminology used). We observed 689 SNP/direction pairs (corresponding to 665 SNPs) with significant ASE out of 50,914 SNP/direction pairs (30,680 SNPs) tested (2.2%, FDR < 10%) (Figure 3A). Additionally, our library contained negative control sequences that were predicted to not have an allelic regulatory effect. These negative control sequences tend to have higher p-values than other sequences in our library, as predicted (Figure 3A). These results demonstrate that genetic variation within regulatory sequences within our library can modulate gene expression levels in vascular endothelial cells.

Figure 3. Allelic effects on gene regulation within conditions and in response to caffeine.

(A) QQ plot depicting the p-values for allele-specific effects (ASE), with negative control sequences labeled in lighter red (caffeine) or lighter blue (control). (B) QQ plot depicting the p-values for conditional allele-specific effects (cASE), with targets containing caffeine response factor motifs annotated in pink and all other sequences in blue. (C) Enrichment via Fisher’s exact test (p<0.05) of ASE (green) and cASE (purple) variants in open chromatin regions and targets containing artery expression quantitative trait loci (eQTL). Error bars indicate 95% confidence intervals. The sample sizes for each test are reported in Supplementary file 3.

Figure 3.

Figure 3—figure supplement 1. Z-score scatter plot from allele-specific effect (ASE) analysis comparing caffeine and water.

Figure 3—figure supplement 1.

Figure 3—figure supplement 2. Distribution of ZT-ZC (case_z) scores from conditional allele-specific effect (cASE) analysis.

Figure 3—figure supplement 2.

Figure 3—figure supplement 3. Artery expression quantitative trait loci (eQTL) enrichment via Torus for footprint single-nucleotide polymorphisms (SNPs) (control, black) or caffeine response factors (caffeine, red).

Figure 3—figure supplement 3.

‘Meta’ is the combination of all three artery tissue types in GTEx. Error bars indicate 95% confidence interval. 3,401,115 SNPs were used in this analysis.

To directly test for GxE in the molecular response to caffeine, we tested for conditional allele-specific effects (cASE), where ASE is only significant in one condition, or significantly different between the two conditions. When testing for cASE, we observed 6 significant SNP/direction pairs (corresponding to 6 SNPs) out of 23,814 (15,927 SNPs) tested (FDR < 5%) (Figure 3B, Supplementary file 4, Figure 3—figure supplement 2). Additionally, we annotated which of these cASE targets contains a known caffeine response factor as defined in Findley et al., 2019, based on ATAC-seq data. There is an enrichment for these targets containing caffeine response factors, as expected (Figure 3B). The size of this enrichment may be underestimated due to the relatively small amount of caffeine response factor motifs (4) present in the designed library. Importantly, these variants contribute to inter-individual variation in response to caffeine. Thus, we sought to further characterize these variants.

To investigate the regulatory architecture underlying these genetic effects on gene expression, we asked whether ASE (FDR < 10%) and cASE variants (here defined at a nominal p<0.0215, N = 569) were enriched in open chromatin regions as annotated in Findley et al., 2019. ASE variants were 2.4-fold enriched within open chromatin regions (p<2.2e-16, Figure 3C, green), while a more moderate trend was observed for cASE (1.2-fold, p=0.088, Figure 3C, purple). This difference in enrichment could be due to the difference between the native chromatin context versus the reporter assay context. Environmental effects on gene regulatory sequences may have a more complex regulatory architecture influenced by the chromatin context that may explain the difference in the enrichment results between ASE and cASE (Supplementary file 3).

Genetic regulation of gene expression can be context-dependent, with factors such as cell type (Donovan et al., 2020; Kim-Hellmuth et al., 2020), developmental states (Cuomo et al., 2020; Strober et al., 2019), and environmental stimuli all contributing to GxE (GxE-eQTLs, also known as response eQTLs, dynamic eQTLs, context-eQTLs) (e.g., see Alasoo et al., 2019; Barreiro et al., 2012; Çalışkan et al., 2015; Findley et al., 2021; Kim-Hellmuth et al., 2017; Maranville et al., 2011; Moyerbrailean et al., 2016b). These context-specific effects can be captured without large cohorts if the appropriate experimental design is applied (Findley et al., 2021). Allele-specific expression experiments in two different conditions can detect GxE in small sample sizes compared to eQTL studies (Moyerbrailean et al., 2016b). To investigate the abundance of GxE missing from large databases such as GTEx (Consortium, 2020), we tested if cASE variants were enriched for artery eQTLs. Using variants within the open chromatin regions described above, we conducted a Fisher’s exact test and found that ASE variants were 1.38 times more likely to be artery eQTLs (p=0.01, Figure 3C, green) compared to variants that did not show ASE. We observed a more moderate trend for cASE variants (1.2 times more likely to be artery eQTLs, p=0.26, Figure 3C, purple, Supplementary file 3).

Characterizing ASE and cASE across transcription factor motifs

We hypothesized that the regulatory context defined by the transcription factor motifs present in each target determines the effect of a genetic variant on expression in caffeine treated cells. We conducted a motif scan of the library of targets for 838 known transcription factor binding motifs from JASPAR (Castro-Mondragon et al., 2022; Supplementary file 2). We then used a test of proportions to identify any motifs that were disproportionately within targets containing significant ASE or cASE variants. For targets containing ASE variants, we observed 44 enriched motifs (Figure 4). For targets containing cASE variants, we observed 18 enriched motifs (Figure 4, Supplementary file 3 ). Factors of interest for cardiovascular function include NRF1, enriched for targets containing cASE and ASE variants, which is known to regulate lipid metabolism (Hirotsu et al., 2012; Huss and Kelly, 2004), and is annotated as part of the lipid metabolism pathway in Reactome (Fabregat et al., 2018). KLF15 and KLF14 are also enriched in targets with cASE and ASE. KLF15 is involved in cardiac lipid metabolism (Prosdocimo et al., 2014; Prosdocimo et al., 2015), and KLF14 has previously been associated with CVD (Fryar et al., 2012; Hu et al., 2018). Lastly, SREBF2, which was identified as enriched within our differential activity results, also has implications for disease as discussed previously. This implies that SREBF2 are important both in the interindividual response to caffeine as well as disease state, linking our identified GxE with atherosclerotic disease.

Figure 4. Transcription factors contributing to allele-specific effect (ASE) and conditional allele-specific effect (cASE).

Figure 4.

Motifs enriched via test of proportions (p<0.05) for significant ASE (green) or cASE (purple). The dotted lines represent the baseline proportion (mean number of significant variants within any motif) for ASE (green) and cASE (purple). Bolded factors are caffeine response factors as defined in Findley et al., 2019. Error bars represent the 95% confidence interval, motif occurrence n>100.

We also wanted to investigate if ASE and cASE variants were disproportionately present in caffeine response factor binding sites, which may indicate that caffeine response factors’ regulatory function may be modified by genetic variation. For this analysis, we used annotations from Findley et al., 2019, which defines caffeine response factors as transcription factors with motifs that were significantly enriched or depleted in differentially accessible chromatin after treatment with caffeine. Factors INSM1 and PLAG1 are caffeine response factors as defined in Findley et al., 2019, confirming that genetic variation may modulate the response to caffeine by increasing binding activity of these transcription factors.

Validation of the regulatory mechanism for fine-mapped artery eQTLs

Computational fine-mapping is a commonly used method to identify causal variants, often for complex traits; however, further functional validation is usually needed to confirm the regulatory mechanism underlying fine-mapped causal variants. We previously showed that artery eQTLs are enriched in caffeine response factor motifs (also see Figure 3—figure supplement 3; Findley et al., 2019). We now leverage this finding to fine-map artery eQTLs using DAP-G and the caffeine response factor annotation from Findley et al., 2019. In our library, we tested 187 fine-mapped variants. We identified significant ASE for six SNPs (six SNP/direction pairs), thus validating the regulatory function of these fine-mapped causal eQTLs (Supplementary file 5). We also identified two fine-mapped artery eQTLs with significant cASE (p<0.0215; 2 SNP/direction; Supplementary file 5), which may represent hidden GxE in GTEx.

We then investigated if the genes shown to be linked to CAD and hypertension risk using both TWAS and colocalization analysis (INTACT, Okamoto et al., 2023) can be further modulated by GxE with caffeine. INTACT combines TWAS and colocalization approaches, thus overcoming the TWAS inherent linkage disequilibrium limitations. We identified eight cASE variants that regulate the expression of genes associated with CAD and hypertension (<10% FDR for INTACT analysis, p<0.0215 for cASE analysis, Supplementary file 6). We used available experimental data and computational predictions to dissect the regulatory mechanisms underlying context-specific allelic effects for two of these SNPs and their impact on CAD risk (Figures 5 and 6). We considered allelic effects on transcription factor binding, expression of the transcription factors, and disease risk. Specifically, we considered (i) predicted effects on transcription factor binding from CentiSNP, which uses a combination of sequence information and chromatin accessibility data to predict alleles that increase binding of specific transcription factors (Moyerbrailean et al., 2016a); (ii) changes in the expression of the genes encoding for the relevant transcription factors and changes in chromatin accessibility (Findley et al., 2019); and (iii) allelic effect on the target genes (this study and GTEx).

Figure 5. Example genetic variant with GxE with caffeine where caffeine may increase genetic risk of CAD.

Figure 5.

(A) Potential mechanism for rs4938344. (B) Transcription factors containing rs4938344 in a binding site are upregulated (via DESeq2) upon caffeine exposure (error bars are +/- standard error, FDR<10%, n=14). (C) Logos of transcription factor motifs with rs4938344 highlighted. (D) GTEx violin plot for AP000892.6. (E) Effect size from the BiT-STARR-Seq assay for this single-nucleotide polymorphism (SNP) within each condition (error bars are +/- standard error, n=4 replicates per condition, cASE p<0.0215, see ‘cASE Analysis’ section of methods). (F) Locus zoom plots showing genome-wide association studies (GWAS) and expression quantitative trait loci (eQTL) data for hypertension (left) and coronary artery disease (CAD) (right) in tibial artery tissue.

Figure 6. Example genetic variant with GxE with caffeine where caffeine may decrease genetic risk of CAD.

Figure 6.

(A) Potential mechanism for rs4527034. (B) TERF2IP is upregulated (via DESeq2) upon caffeine exposure (error bars are +/- standard error, FDR<10%, n = 14). (C) Logos of TERF2IP motif with rs4527034 highlighted. (D) GTEx violin plot for KAT8. (E) Effect size from the BiT-STARR-Seq assay for this single-nucleotide polymorphism (SNP) within each condition (error bars are +/- standard error, n=4 replicates per condition, cASE p<0.0215, see ‘cASE Analysis’ section of methods). (F) Locus zoom plots showing genome-wide association studies (GWAS) and expression quantitative trait loci (eQTL) data for hypertension in coronary artery tissue.

rs4938344 is an eQTL regulating the long noncoding RNA AP000892.6. The reference allele at this locus, G, results in decreased expression of AP000892.6 (as measured in GTEx and in the caffeine condition of our assay, Figure 5D and E, respectively). INTACT associated high expression of AP000892.6 with decreased risk of hypertension and CAD (Figure 5F). This SNP is predicted to modulate binding of GABP (a known repressor of transcription; Genuario and Perry, 1996) and ETV1 at this site (Figure 5C). These transcription factors are upregulated in caffeine-exposed endothelial cells (Findley et al., 2019; Figure 5B). This increase in expression uncovers allelic differences in gene regulation which are not detected in the absence of caffeine, likely because of the low expression of the repressor. The allelic differences in binding of these factors should lead to allelic differences in the expression of AP000892.6. Accordingly, the reference allele for this variant exhibited lower activity in response to caffeine in our BiT-STARR-Seq experiments (Figure 5E). This effect is consistent with the GTEx artery eQTL for AP000892.6 (Figure 5D). In summary, caffeine induces higher expression of the ETV1 and GABP transcription factors, which then bind preferentially to the reference allele at rs4938344; this results in lower expression of AP000892.6 and increased risk for CAD and hypertension (Figure 5A). AP000892.6 interacts with the RB1CC1 RNA (Gong et al., 2018), which may play a role in atherosclerosis via its function in forming the autophagosome (Chen et al., 2021).

rs4527034 is an eQTL regulating the KAT8 gene. The reference allele at this locus, A, results in decreased expression of KAT8 (as measured in GTEx and in the control condition of our assay, Figure 6D and E, respectively). INTACT associated high expression of KAT8 with increased risk of hypertension (Figure 6F). This SNP is predicted to modulate binding of the TERF2IP transcription factor at this site (Figure 6C). TERF2IP is upregulated in caffeine-exposed endothelial cells (Findley et al., 2019; Figure 6B). This increase in expression may saturate all binding sites in the caffeine condition, while the transcription factor may only bind to the preferential allele in the control condition. The allelic differences in binding of these factors should lead to allelic differences in expression of KAT8 in the control condition, which is what we observe both in our BiT-STARR-Seq experiments (Figure 6E) and in GTEx artery eQTL for KAT8 (Figure 6D). In summary, in the absence of caffeine, TERF2IP binds preferentially to the reference allele at rs4527034, which results in lower expression of KAT8 and reduced risk for hypertension. In the presence of caffeine, TERF2IP is upregulated, resulting in increased binding and lower expression of KAT8, independently of the genotype, with an expected overall protective effect on hypertension. Confirming this potential mechanism for disease risk, TERF2IP expression levels were found to affect plaque formation in a mouse model (Kotla et al., 2019). High expression of KAT8, a histone acetyltransferase, also coincides with atherosclerotic progression, and histone acetylation increases in plaques within vascular endothelial cells (Greißel et al., 2016; Zhang et al., 2018).

Discussion

This study utilized the MPRA BiT-STARR-Seq to identify gene regulatory activity in vascular endothelial cells exposed to caffeine. By utilizing BiT-STARR-Seq, we were able to identify a molecular response to caffeine, ASE, and cASE. By combining our results with preexisting annotations, we were able to characterize variants exhibiting cASE and identify potential mechanisms for some of these variants.

Heart disease is one of the leading causes of death in the United States according to the CDC (http://wonder.cdc.gov/ucd-icd10.html). The most common type of heart disease is CAD, which affects over 18 million adults over the age of 20 y (Fryar et al., 2012). The common risk factors of CAD include hypertension, high cholesterol levels, and family history (Brown et al., 2022; Hajar, 2017). CAD occurs when plaques form in the arteries (atherosclerosis), causing a narrowing of the artery, which reduces blood flow to the heart. The innermost layer of the artery is composed of endothelial cells. The endothelium is involved in several important functions, including regulation of blood flow, angiogenesis, thrombosis, and coagulation (Hadi et al., 2005; Krüger-Genge et al., 2019). Endothelial dysfunction occurs in diseases such as atherosclerosis and hypertension (Xu et al., 2021), eventually leading to CAD (Matsuzawa and Lerman, 2014). The molecular mechanisms behind endothelial dysfunction and the resulting diseases are largely unknown. Characterizing these molecular mechanisms is crucial in order to gain a more complete understanding of these disease phenotypes. Additionally, although caffeine is known to produce a vascular response, the current literature does not come to a consensus on the role of caffeine in CAD risk. Here, we characterized the regulatory response of noncoding variants to caffeine in vascular endothelial cells using an MPRA.

BiT-STARR-Seq, the MPRA used in this study, has several advantages over other methods used to detect GxE. One common method of detecting GxE is response eQTL mapping, which includes collecting samples from large cohorts and exposing those cells to environmental perturbations (e.g., see Alasoo et al., 2019; Alasoo et al., 2018; Barreiro et al., 2012; Çalışkan et al., 2015; Fairfax et al., 2014; Huang et al., 2020; Kim-Hellmuth et al., 2017; Knowles et al., 2018; Lee et al., 2014; Mangravite et al., 2013; Manry et al., 2017; Maranville et al., 2011; Nédélec et al., 2016; Quach et al., 2016). This method has several disadvantages as it cannot easily interrogate rare variants, relies on variation existing in a cohort (instead of investigating variants of interest), and requires larger cohort sizes to have enough power to detect GxE. In contrast, because our method uses a designed library of targets (Kalita et al., 2018), we are able to interrogate rare variants easily as our targets are synthesized. Similarly, we can design a library of specific variants to investigate (such as candidate regulatory variants) instead of relying on variation within a cohort (Kalita et al., 2018). BiT-STARR-Seq also allows us to directly compare two alleles within the same sequence context without requiring a large cohort. Despite the advantages of BiT-STARR-Seq, unlike response eQTL mapping, we are not interrogating these SNPs in their native chromatin context. Future work may include using genome-editing tools such as CRISPR to directly insert the desired variants in their endogenous locations in the genome. For our study, we determined BiT-STARR-Seq to be the ideal assay to determine GxE for a large number of SNPs.

We observed a regulatory response to caffeine treatment, consistent with previous studies in the same cell type (Findley et al., 2019; Moyerbrailean et al., 2016b). These results suggest that caffeine exposure significantly changes the regulatory activity of vascular endothelial cells, which may have important implications regarding the impact of lifestyle in CAD. As caffeine may modulate gene regulatory activity, the resulting impact on gene expression may increase or decrease CAD risk. In addition, we identified novel transcription factors contributing to the regulatory response to caffeine including several NFAT transcription factors, and SREBF2. NFAT transcription factors are largely known for their role in the Ca2+/NFAT signaling pathway, where Ca2+ binds to calmodulin, stimulating calcineurin, which then causes NFAT factors to localize in the nucleus (Crabtree and Olson, 2002; Klee et al., 1998). Caffeine is known to cause an increase in Ca2+ in human aortic endothelial cells (Corda et al., 1995), so it is understandable that we find these factors enriched for being within targets that respond to caffeine exposure. SREBF2, also known as SREBP2, is involved in sterol homeostasis (Horton et al., 2003). This result implies these novel transcription factors important for the regulatory response to caffeine may also contribute to understanding the role of caffeine in CAD risk. This coincides with findings that caffeine exposure can alter expression of genes, including those for transcription factors in mouse cardiomyocytes (Fang et al., 2014). Another study aimed to uncover mechanisms relevant to CVD upon caffeine exposure and found that caffeine inhibits the transcription factor SREBP2, which causes an overall protective effect against CVD (Lebeau et al., 2022). These results coincide with our findings.

Noncoding regions of the genome contain regulatory variants that modulate gene expression. In this study, we identify and characterize over 600 variants exhibiting ASE. Numerous noncoding variants have been implicated in CAD risk via GWAS (Hartmann et al., 2022; Kessler and Schunkert, 2021; Koyama et al., 2020; Nikpay et al., 2015; Temprano‐Sagrera et al., 2022; van der Harst and Verweij, 2018), but they are generally uncharacterized. Few lead noncoding variants have been thoroughly investigated. One of these led the authors to propose and validate a molecular mechanism connecting expression of the EDN1 gene to the phenotypic outcome (Gupta et al., 2017; Wang and Musunuru, 2018). The specific mechanisms that detail how noncoding variants contribute to CAD will be critical in understanding CAD risk and ultimately developing clinical treatments. While understanding various genetic risk factors for CAD is important, GxE for these variants also have an impact on phenotype and have not been widely studied.

Since previous studies have shown that GxE-eQTL can modulate complex disease risk, we expect that GxE detected in our assay may be relevant to CAD (Alasoo et al., 2019; Alasoo et al., 2018; Barreiro et al., 2012; Çalışkan et al., 2015; Fairfax et al., 2014; Findley et al., 2021; Huang et al., 2020; Kim-Hellmuth et al., 2017; Knowles et al., 2018; Lee et al., 2014; Mangravite et al., 2013; Manry et al., 2017; Maranville et al., 2011; Nédélec et al., 2016; Quach et al., 2016). We tested for cASE, which occurs when ASE are only significant in one condition, or significantly different between the two conditions. This analysis identifies GxE which are important in understanding disease risk while accounting for genetic and environmental context. We identified 6 variants that regulate the gene expression response to caffeine and demonstrated that context-aware MPRAs can be used to dissect molecular mechanisms underlying cardiovascular health.

By fine-mapping artery eQTLs and combining the data collected from our assay with preexisting annotations, we produced potential mechanisms for two cASE variants through altered transcription factor expression and binding in response to caffeine. Our results indicated that both genetic and environmental factors are important in determining risk, and that the interaction between these factors can be informative to mechanisms and phenotypic consequences. Importantly, by utilizing multiple functional annotations, we are able to identify variants that may be relevant to disease but did not reach genome-wide significance in GWAS, possibly because of their context-specific effects. By studying different environmental contexts, we can identify that, in these instances, the presence of caffeine can impact the risk of poor cardiovascular health outcomes. If environmental context was not considered and this work was conducted solely in the control condition, the caffeine modulatory effect on risk would have been missed.

Although we investigate GxE for caffeine in vascular endothelial cells, our experimental approach can be applied to various different complex diseases and their relevant cell types and treatments. To further validate our work, genome-editing tools could be used to investigate the effect of these variants in their native chromatin context. Additional validation could include allele-specific and condition-specific transcription factor binding assays (such as electrophoretic mobility shift assays) for the fine-mapped variants.

Our study demonstrates the importance of considering environmental contexts when investigating gene regulatory activity as we identify several thousand instances of GxE in our library of candidate regulatory variants. Our data, combined with preexisting annotations, allowed us to identify transcription factors involved in GxE in caffeine and describe specific potential molecular mechanisms for some of these GxE. Our results provide important insights into the molecular regulatory effect of caffeine exposure and GxE for caffeine in vascular endothelial cells.

Materials and methods

Cell culture

Human umbilical vein endothelial cells (HUVECs) were obtained from Lonza (Cat# CC-2517-0000315288). Cells were cultured at 37°C with 5% CO2 and seeded at 5000 cells/cm2. EGM-2 growth medium (Lonza) was used to culture the cells.

Treatment

Treatment concentration was the same as used in previous studies (Findley et al., 2019; Moyerbrailean et al., 2016b). We used a caffeine concentration of 1.16 × 10–3 M. In addition, water was used as a vehicle control as that was the solvent used to prepare the caffeine treatment.

BiT-STARR-Seq library design

We designed 43,556 target regulatory regions each containing an SNP in the middle and with a total length of 200 nucleotides. This set of targets corresponds to 87,112 constructs each containing only one of two alleles at the test SNP. Additionally, each construct can be integrated in the forward or reverse orientation, leading to a maximum of 174,224 constructs in either direction. Please also see below for a description of how we use library-related terms throughout the article. The library used is the same as reported in Kalita et al., 2018. Briefly, the library of target regulatory sequences consisted of several categories of regulatory variants, including eQTLs (Innocenti et al., 2011; Wen et al., 2015), SNPs predicted to disrupt transcription factor binding (centiSNPs) (Moyerbrailean et al., 2016a), and SNPs associated with complex traits in GWAS (Pickrell, 2014). Negative controls that were not predicted to have a regulatory effect were also included in the library (Moyerbrailean et al., 2016a). It is important to note that these negative controls are only predicted not to have a regulatory effect via computational annotation (Moyerbrailean et al., 2016a), so they may not be representative of true negative controls. This is why we largely do not utilize these SNPs as negative controls within our analyses. Our predictions of regulatory activity also did not account for environmental context, thus these sequences are also not suited to annotate our cASE results.

  • SNP (n = 43,556): Refers to a genetic variant tested for allelic effects on gene regulation.

  • Target (n = 43,556): 200-nucleotide-long oligonucleotide sequence that contains the test SNP in the middle of the target.

  • Construct (n = 87,112): Synthesized 200-nucleotide-long oligonucleotide sequence that contains only one of the two possible alleles at the test SNP. Each target corresponds to two constructs.

  • Direction: Constructs can integrate in either the forward or reverse direction relative to the direction of transcription in the BiT-STARR-Seq plasmid. Therefore, two directions are possible for each construct.

  • SNP/direction pair (n = 87,112): An SNP tested for allelic effects on gene regulation contrasting the expression of two constructs that are integrated in the same direction. All statistical tests are performed at this level, testing in each direction separately.

BiT-STARR-Seq experiments

Oligonucleotides were synthesized and used to create a DNA plasmid library, which was sequenced and used as a subsequent input for the ASE analysis. The DNA library was transfected into HUVECs using the Lonza Nucleofector X platform. Cells were electroporated using the DS-120 setting with primary cell solution P5. Caffeine was added at 1.16 × 10–3 M after transfection. Cells were incubated for 24 hr and lysed. We completed 6 replicates per treatment condition (caffeine and the water vehicle control) or 12 replicates in total.

Library preparation and sequencing

RNA was extracted using the RNeasy Plus Mini kit (QIAGEN, Cat# 74136). A cDNA library was prepared using the Superscript III First-Strand Synthesis kit (Invitrogen, Cat# 18080-400). Sequencing was completed using the Illumina Nextseq 500 to generate 125 cycles for read 1, 30 cycles for read 2, 8 cycles for the fixed multiplexing index 2, and 10 cycles for index 1 (variable barcode). The average sequencing depth per library was 39,235,611 reads, for a total of 470,827,333 reads (Figure 1—figure supplement 1, Supplementary file 7).

Processing sequence data

To analyze the RNA-seq data, we began by demultiplexing our data using the bcl2fastq software to create demultiplexed FASTQ files. We then aligned to hg19 using HISAT2. Afterward, we applied a filter to ensure the UMIs present match the expected UMI pattern (RDHBVDHBVD). Reads with short UMIs or those that do not match the expected sequence were removed. The resulting BAM files were then deduplicated using UMItools. We ran samtools mpileup followed by bcftools query to output read counts per each allele/direction combination.

Differential activity analysis

To test for a molecular response to caffeine, we used the R package DESeq2 (Love et al., 2014). To determine which model would best test for a molecular response to caffeine, we completed principal component analysis to identify major sources of variation. We identified that the first PC clearly represented allelic effects (Figure 2—figure supplement 2, also see Figure 3—figure supplements 1 and 3), thus we included allele (reference or alternate) as part of our model. Our model tested the effect of treatment, correcting for allele (~allele + treatment), as we observed a strong allelic effect. We ran DESeq2 for each direction (see Figure 2—figure supplement 1) as the library contained motifs in both the forward and reverse orientations within separate target sequences. We considered targets as significant with Benjamini–Hochberg FDR <10%.

ASE analysis

To test for ASE, we utilized the R package quantitative allele-specific analysis of reads (QuASAR-MPRA) (Kalita et al., 2017). QuASAR-MPRA is an extension of the QuASAR package which allows for analysis of barcoded MPRA data. QuASAR-MPRA uses a beta-binomial model and accounts for uneven initial allelic proportions present in the DNA library. We used the fitQuasarMpra() function to test for ASE in each experiment separately, estimating the ASE effect and its standard error. For each SNP/direction pair, we meta-analyzed the effect size using a weighted mean utilizing inverse-variance weighting for each condition separately. The z-score for each SNP-direction pair is subsequently calculated as the meta-analyzed effect size minus the DNA proportion, divided by the meta-analyzed standard error of the effect size. We then required each identifier to be within four or greater replicates (out of the six total replicates) and performed multiple test correction using the Benjamini–Hochberg procedure. Significant ASE was then defined as having an FDR <10%.

cASE analysis

To test for cASE, we used a method previously developed by our lab called differential allele-specific test, or ΔAST. The calculation for this parameter ΔZ is provided in Moyerbrailean et al., 2016a as well as below. The QuASAR-MPRA package outputs betas for the treatment (βT) and the control (βC), as well as the standard error (se) for both groups, which are used to calculate a Z score for each condition independently (βse). To contrast ASE between conditions we define the cASE statistic (ΔZ)2 , as (ΔZ)2 = (ZT - ZC)2 /2. From this (ΔZ)2 statistic, we used the genomic control procedure (Devlin et al., 2001; Devlin and Roeder, 1999) to calibrate the median χ2 to the expected value and we calculated the p-values using the χ2 distribution. We then used the p.adjust() function to perform multiple test correction using the Benjamini–Hochberg procedure. Significant cASE was then defined as having an FDR<5%.

Motif enrichment analysis

The motif scan was completed using the PWMScan tool, using all PWMs within the JASPAR (Castro-Mondragon et al., 2022) 2022 CORE database (838 motifs). A threshold of -t 10 (base 2) was used for the motif scan, which was restricted to the regions of our designed targets. Once the scan was complete, motifs that were present less than 100 times in the library were removed. For differentially active targets in response to caffeine, 222 motifs passed this filter. For motifs containing ASE variants, 359 motifs passed this filter. For motifs containing cASE variants, 417 motifs passed the filter. A test of proportion (prop.test() in R) was performed per each transcription factor, where the null proportion was the total number of significant targets/variants containing/within motifs (differentially active target, ASE or cASE) divided by the total number of nonsignificant targets/variants containing/within motifs. The test was done per motif, where the proportion being tested is the same as the null proportion, but conducted per motif rather than across all motifs. The related calculations are shown below:

  • Expected null proportion = na/nb

  • Observed motif proportion = nc/nd

  • na = number of targets/variants of interest containing/within any motif

  • nb = number of targets/variants containing/within any motif

  • nc = number of targets/variants of interest containing/within specific motif

  • nd = number of targets/variants containing/within specific motif

ASE (FDR <10%) and cASE (p<0.0215) were considered significant. For follow-up analyses of cASE features, we relaxed the significance threshold to nominal p-value <0.0215.

Open chromatin region enrichment analysis

To test if certain variants were within open chromatin regions, we obtained the list of differentially accessible regions tested in Findley et al., 2019. We considered any accessible region (annotated as differentially accessible or not). Bedtools bed intersect tool was used to complete the overlap with the appropriate datasets (DESeq2, ASE, or cASE results), resulting in a list of targets or SNPs that were within open chromatin regions. This annotation was then used to complete the Fisher’s exact test. Enrichments for significant ASE or cASE variants were performed separately.

Artery eQTL enrichment analysis

To determine if certain variants were artery eQTLs, we obtained GTEx v8 (Castro-Mondragon et al., 2022) data for aorta, coronary, and tibial artery tissues. We then intersected the list of variants of interest with the list of artery eQTLs. To determine if artery eQTLs were within differentially active targets, bedtools intersect was used, resulting in a list of targets that contained artery eQTLs. We further subset this list to variants within open chromatin regions (see previous section). This annotation was then used to complete the Fisher’s exact test. Enrichments for significant ASE or cASE variants were performed separately.

Fine-mapping analysis with DAP-G

Based on a previous study (Findley et al., 2019), we define caffeine response factors as transcription factors with motifs that were significantly enriched or depleted in differentially accessible chromatin after treatment with caffeine. We annotated genetic variants into two categories: (1) genetic variants in motifs for response factors and (2) genetic variants in motifs for transcription factors that are not caffeine response factors. By integration of these genetic variants annotation, we estimated the probability of each SNP regulating gene expression in a Bayesian hierarchical model using TORUS (Wen, 2016). These probabilities are then used in DAP-G (Zhang et al., 2020) to fine-map eQTLs from all three artery tissues in GTEx V8. A total of 364,427,888 eQTLs were fine-mapped across three artery tissues. We filtered for a posterior inclusion probability of >0.9.

Identification of putative causal genes

To better understand how cASE variants may impact traits related to cardiovascular health, we performed colocalization analysis of the causal GWAS variants with eQTLs using fastENLOC (Wen et al., 2017) by integration of fine-mapped eQTLs in the three artery tissues (see above section) and fine-mapped GWAS signals for CAD and hypertension. We fine-mapped GWAS using DAP (Wen, 2016). From fastENLOC, we obtained the gene locus-level colocalization probability for each gene which is used to evaluate how the gene is associated with complex traits or diseases. Intuitively, colocalization analysis identified the overlap of causal eQTLs and GWAS hits. However, it lacked the sensitivity due to the failure in distinguishing between the vertical pleiotropy (genetic effects on traits mediated by gene expression) and horizontal pleiotropy (independent effects on gene expression and traits) (Okamoto et al., 2023). To overcome the limitations in a single approach, we combined the evidence from colocalization and TWAS to estimate the probability of putative causal genes using the R package INTACT (Okamoto et al., 2023). Here, the TWAS data we utilized were from PTWAS (Zhang et al., 2020). We determined putative causal genes with FDR < 10%. We then identified variants regulating these putative causal genes using the DAP-G fine-mapping results, requiring a SNP-level colocalization probability >0.5, with no threshold on PIP (Supplementary file 6).

Acknowledgements

We thank the members of the Luca and Pique-Regi laboratories for helpful comments and discussions.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Roger Pique-Regi, Email: rpique@wayne.edu.

Francesca Luca, Email: fluca@wayne.edu.

Stephen CJ Parker, University of Michigan, United States.

Detlef Weigel, Max Planck Institute for Biology Tübingen, Germany.

Funding Information

This paper was supported by the following grants:

  • National Institute of General Medical Sciences R01GM109215 to Francesca Luca, Roger Pique-Regi.

  • National Institute of Environmental Health Sciences R01ES033634 to Francesca Luca, Roger Pique-Regi, Xiaoquan Wen.

Additional information

Competing interests

No competing interests declared.

Author contributions

Data curation, Formal analysis, Validation, Visualization, Writing – original draft, Writing – review and editing.

Data curation, Investigation, Methodology, Writing – review and editing.

Data curation, Formal analysis, Writing – review and editing.

Investigation, Writing – review and editing.

Formal analysis, Visualization, Writing – original draft, Writing – review and editing.

Software, Methodology, Writing – review and editing.

Conceptualization, Supervision, Funding acquisition, Writing – original draft, Project administration, Writing – review and editing.

Conceptualization, Supervision, Funding acquisition, Writing – original draft, Project administration, Writing – review and editing.

Additional files

Supplementary file 1. Differential activity results.

Output from DESeq2. Column ‘V1’ is the genomic position (hg19) of the SNP within the construct being tested, ‘baseMean’ refers to normalized expression, ‘log2FoldChange’ refers to the effect size, ‘lfcSE’ is the standard error of the effect size, ‘stat’ is the log2FoldChange divided by lfcSE, ‘pvalue’ is the nominal p-value, and ‘padj’ is the adjusted p-value. Please refer to DESeq2 documentation for additional details.

elife-85235-supp1.txt (2.1MB, txt)
Supplementary file 2. PWMScan results.

Includes all variants in the designed library that are within a motif from the JASPAR 2022 CORE Vertebrates database determined by PWMScan. Columns 1–3 are the genomic position of the test SNP, column 4 is the rsID for the test SNP, and column 5 is the ID of the motif in JASPAR.

elife-85235-supp2.txt (11.5MB, txt)
Supplementary file 3. Characterization of regulatory regions and variants.

Includes contingency tables for all Fisher’s exact tests reported in this study. Please see the legend provided within the file. Briefly, A1–C5 is the contingency table and summary of results for ASE variants within open chromatin regions, A6–C10 for ASE variants overlapping with artery eQTLs, A11–C15 for cASE variants within open chromatin, and A16–C20 for cASE variants overlapping with artery eQTLs.

elife-85235-supp3.xlsx (10.1KB, xlsx)
Supplementary file 4. Allele-specific effects and conditional allele-specific effects results.

Output from ASE/cASE analysis. The ‘identifier’ column is the SNP/direction pair, ‘meta_estimate’ is the effect size (of the ASE), ‘meta_se’ is the standard error of the effect size, ‘n’ is the number of replicates containing nonzero read counts, ‘DNA_prop’ is the proportion of reads (reference/alternate) present in the DNA library, ‘meta_z’ is the z-score, ‘meta_p’ is the nominal ASE p-value, ‘meta_padj’ is the adjusted ASE p-value before filtering for SNP/direction pairs where n > 3, ‘new_padj’ is the adjusted ASE p-value after filtering for SNP/direction pairs where n > 3, ‘group’ refers to the applicable condition for the statistic (.x suffix for caffeine, .y suffix for control), ‘case_z’ is ZT-ZC, ‘case_p’ is the nominal p-value for cASE, and ‘case_padj’ is the adjusted p-value for cASE.

elife-85235-supp4.txt (11.9MB, txt)
Supplementary file 5. Fine-mapped artery eQTLs with significant ASE or cASE.

The ‘chr’ column contains the chromosome information, ‘pos’ is the genomic position (0-based, hg19), ‘pos1’ is the genomic position (1-based, hg19), ‘identifier’ is the SNP/direction pair, ‘rsID’ refers to rsID, ‘padj_min’ refers to the minimum p-adjusted value for ASE (across conditions), ‘ID’ is an ID to identify the genomic position in the format of chr_pos1, ‘PIP’ is the posterior inclusion probability as calculated by DAP-G, ‘gene’ refers to the Ensembl gene ID, ‘tissue’ refers to the GTEx artery tissues (tibial, aorta, or coronary), ‘group’ identifies if the variant has significant ASE or significant cASE, ‘case_p’ refers to the cASE nominal p-value, and ‘case_padj’ refers to the cASE adjusted p-value.

Supplementary file 6. Variants that regulate putatively casual genes as identified via INTACT.

The ‘Identifier’ column is the SNP/direction pair, ‘ID’ is an ID to identify the genomic position in the format of chr_pos1 (see above), ‘Gene ID’ refers to the Ensembl gene ID, ‘Gene symbol’ refers to the gene symbol, ‘Trait’ is CAD or hypertension (HTN), ‘Tissue’ refers to the GTEx artery tissues (tibial, aorta, or coronary), ‘DAP-G PIP’ is the posterior inclusion probability as calculated by DAP-G, ‘GWAS z-score’ is the z-score for the GWAS of the corresponding trait, ‘GWAS pvalue’ is the nominal p-value for the GWAS of the corresponding trait, ‘PTWAS-INTACT PCG’ is the probability of being a putative causal gene as calculated by INTACT, ‘PTWAS-INTACT FDR’ is the FDR statistic as calculated by INTACT, ‘INTACT z-score’ is the z-score statistic as calculated by INTACT, ‘cASE_p’ is the cASE nominal p-value, ‘cASE_padj_(original)’ is the cASE adjusted p-value, ‘cASE_z’ is ZT-ZC, ‘DE_construct?’ indicates if the construct is significantly differentially active as defined by our assay, ‘Caffeine_ASE_z’ is the z-score for ASE (caffeine condition), ‘Water_ASE_z’ is the z-score for ASE (control condition), ‘centiSNP motifs’ refers to motifs the SNP is within as identified by centiSNP, and ‘centiSNP category’ refers to if the motif is a footprintSNP, effectSNP, or switchSNP as identified by centiSNP.

elife-85235-supp6.xlsx (11.8KB, xlsx)
Supplementary file 7. Read counts.

Read counts for all libraries at different steps of the data processing pipeline. ‘Library Name’ is an identifier used for the sample, ‘Treatment’ is the condition (caffeine or control), ‘Dedup’ refers to the number of reads after deduplication, and ‘>1 read either direction’ describes the number of reads that have >1 read for each allele in either direction.

MDAR checklist

Data availability

FASTQ files and read count data are available at the GEO accession number GSE221514. Supplemental files are available at https://doi.org/10.5281/zenodo.7327508.

The following datasets were generated:

Boye C, Kalita C, Findley A, Alazizi A, Wei J, Wen X, Luca F, Pique-Regi R. 2024. Characterization of caffeine response regulatory variants in vascular endothelial cells. NCBI Gene Expression Omnibus. GSE221514

Boye C, Kalita C, Findley A, Alazizi A, Wei J, Wen X, Pique-Regi R, Luca F. 2024. Characterization of caffeine response regulatory variants in vascular endothelial cells. Zenodo.

The following previously published dataset was used:

Findley AS, Richards AL, Petrini C, Alazizi A, Shanku AG, Davis GO, Hauff N, Sorokin Y, Wen X, Luca F, Doman E, Pique-Regi R. 2019. Gene-Environment Interactions (GxE) and Complex Traits. NCBI Gene Expression Omnibus. phs001176.v3.p1

References

  1. Alasoo Kaur, Rodrigues J, Mukhopadhyay S, Knights AJ, Mann AL, Kundu K, Hale C, Dougan G, Gaffney DJ, HIPSCI Consortium Shared genetic effects on chromatin and gene expression indicate a role for enhancer priming in immune response. Nature Genetics. 2018;50:424–431. doi: 10.1038/s41588-018-0046-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alasoo K, Rodrigues J, Danesh J, Freitag DF, Paul DS, Gaffney DJ. Genetic effects on promoter usage are highly context-specific and contribute to complex traits. eLife. 2019;8:e41673. doi: 10.7554/eLife.41673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Almontashiri NAM, Antoine D, Zhou X, Vilmundarson RO, Zhang SX, Hao KN, Chen HH, Stewart AFR. 9p21.3 coronary artery disease risk variants disrupt TEAD transcription factor-dependent transforming growth factor β regulation of p16 expression in human aortic smooth muscle cells. Circulation. 2015;132:1969–1978. doi: 10.1161/CIRCULATIONAHA.114.015023. [DOI] [PubMed] [Google Scholar]
  4. Arnold CD, Gerlach D, Stelzer C, Boryń ŁM, Rath M, Stark A. Genome-wide quantitative enhancer activity maps identified by STARR-seq. Science. 2013;339:1074–1077. doi: 10.1126/science.1232542. [DOI] [PubMed] [Google Scholar]
  5. Barreiro LB, Tailleux L, Pai AA, Gicquel B, Marioni JC, Gilad Y. Deciphering the genetic architecture of variation in the immune response to Mycobacterium tuberculosis infection. PNAS. 2012;109:1204–1209. doi: 10.1073/pnas.1115761109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Blanco F, Heinonen SE, Gurzeler E, Berglund LM, Dutius Andersson AM, Kotova O, Jönsson-Rylander AC, Ylä-Herttuala S, Gomez MF. In vivo inhibition of nuclear factor of activated T-cells leads to atherosclerotic plaque regression in IGF-II/LDLR-/-ApoB100/100 mice. Diabetes & Vascular Disease Research. 2018;15:302–313. doi: 10.1177/1479164118759220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Brown JC, Gerhardt TE, Kwon E. Risk Factors For Coronary Artery DiseaseStatPearls. Treasure Island: StatPearls Publishing; 2022. [PubMed] [Google Scholar]
  8. Çalışkan M, Baker SW, Gilad Y, Ober C. Host genetic variation influences gene expression response to rhinovirus infection. PLOS Genetics. 2015;11:e1005111. doi: 10.1371/journal.pgen.1005111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Castro-Mondragon JA, Riudavets-Puig R, Rauluseviciute I, Lemma RB, Turchi L, Blanc-Mathieu R, Lucas J, Boddie P, Khan A, Manosalva Pérez N, Fornes O, Leung TY, Aguirre A, Hammal F, Schmelter D, Baranasic D, Ballester B, Sandelin A, Lenhard B, Vandepoele K, Wasserman WW, Parcy F, Mathelier A. JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles. Nucleic Acids Research. 2022;50:D165–D173. doi: 10.1093/nar/gkab1113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chen Y, Zeng A, He S, He S, Li C, Mei W, Lu Q. Autophagy-related genes in atherosclerosis. Journal of Healthcare Engineering. 2021;2021:6402206. doi: 10.1155/2021/6402206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Chieng D, Canovas R, Segan L, Sugumar H, Voskoboinik A, Prabhu S, Ling LH, Lee G, Morton JB, Kaye DM, Kalman JM, Kistler PM. The impact of coffee subtypes on incident cardiovascular disease, arrhythmias, and mortality: long-term outcomes from the UK Biobank. European Journal of Preventive Cardiology. 2022;29:2240–2249. doi: 10.1093/eurjpc/zwac189. [DOI] [PubMed] [Google Scholar]
  12. Choi Y, Chang Y, Ryu S, Cho J, Rampal S, Zhang Y, Ahn J, Lima JAC, Shin H, Guallar E. Coffee consumption and coronary artery calcium in young and middle-aged asymptomatic adults. Heart. 2015;101:686–691. doi: 10.1136/heartjnl-2014-306663. [DOI] [PubMed] [Google Scholar]
  13. Consortium Gte. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369:1318–1330. doi: 10.1126/science.aaz1776. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Corda S, Spurgeon HA, Lakatta EG, Capogrossi MC, Ziegelstein RC. Endoplasmic reticulum Ca2+ depletion unmasks a caffeine-induced Ca2+ influx in human aortic endothelial cells. Circulation Research. 1995;77:927–935. doi: 10.1161/01.res.77.5.927. [DOI] [PubMed] [Google Scholar]
  15. Crabtree GR, Olson EN. NFAT signaling: choreographing the social lives of cells. Cell. 2002;109 Suppl:S67–S79. doi: 10.1016/s0092-8674(02)00699-2. [DOI] [PubMed] [Google Scholar]
  16. Cuomo ASE, Seaton DD, McCarthy DJ, Martinez I, Bonder MJ, Garcia-Bernardo J, Amatya S, Madrigal P, Isaacson A, Buettner F, Knights A, Natarajan KN, Consortium H, Vallier L, Marioni JC, Chhatriwala M, Stegle O. Single-cell RNA-sequencing of differentiating iPS cells reveals dynamic genetic effects on gene expression. Nature Communications. 2020;11:810. doi: 10.1038/s41467-020-14457-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Devlin B, Roeder K. Genomic control for association studies. Biometrics. 1999;55:997–1004. doi: 10.1111/j.0006-341x.1999.00997.x. [DOI] [PubMed] [Google Scholar]
  18. Devlin B, Roeder K, Wasserman L. Genomic control, a new approach to genetic-based association studies. Theoretical Population Biology. 2001;60:155–166. doi: 10.1006/tpbi.2001.1542. [DOI] [PubMed] [Google Scholar]
  19. Ding M, Bhupathiraju SN, Satija A, van Dam RM, Hu FB. Long-term coffee consumption and risk of cardiovascular disease: a systematic review and a dose-response meta-analysis of prospective cohort studies. Circulation. 2014;129:643–659. doi: 10.1161/CIRCULATIONAHA.113.005925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Donovan MKR, D’Antonio-Chronowska A, D’Antonio M, Frazer KA. Cellular deconvolution of GTEx tissues powers discovery of disease and cell-type associated regulatory variants. Nature Communications. 2020;11:955. doi: 10.1038/s41467-020-14561-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Du M, Yang L, Liu B, Yang L, Mao X, Liang M, Huang K. Inhibition of NFAT suppresses foam cell formation and the development of diet-induced atherosclerosis. FASEB Journal. 2021;35:e21951. doi: 10.1096/fj.202100947R. [DOI] [PubMed] [Google Scholar]
  22. Fabregat A, Sidiropoulos K, Viteri G, Marin-Garcia P, Ping P, Stein L, D’Eustachio P, Hermjakob H. Reactome diagram viewer: data structures and strategies to boost performance. Bioinformatics. 2018;34:1208–1214. doi: 10.1093/bioinformatics/btx752. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Fairfax BP, Humburg P, Makino S, Naranbhai V, Wong D, Lau E, Jostins L, Plant K, Andrews R, McGee C, Knight JC. Innate immune activity conditions the effect of regulatory variants upon monocyte gene expression. Science. 2014;343:1246949. doi: 10.1126/science.1246949. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Fang X, Mei W, Barbazuk WB, Rivkees SA, Wendler CC. Caffeine exposure alters cardiac gene expression in embryonic cardiomyocytes. American Journal of Physiology-Regulatory, Integrative and Comparative Physiology. 2014;307:R1471–R1487. doi: 10.1152/ajpregu.00307.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Findley AS, Richards AL, Petrini C, Alazizi A, Doman E, Shanku AG, Davis GO, Hauff N, Sorokin Y, Wen X, Pique-Regi R, Luca F. Interpreting coronary artery disease risk through gene–environment interactions in gene regulation. Genetics. 2019;213:651–663. doi: 10.1534/genetics.119.302419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Findley AS, Monziani A, Richards AL, Rhodes K, Ward MC, Kalita CA, Alazizi A, Pazokitoroudi A, Sankararaman S, Wen X, Lanfear DE, Pique-Regi R, Gilad Y, Luca F. Functional dynamic genetic effects on gene regulation are specific to particular cell types and environmental conditions. eLife. 2021;10:e67077. doi: 10.7554/eLife.67077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Fryar CD, Chen TC, Li X. Prevalence of uncontrolled risk factors for cardiovascular disease: United States, 1999-2010. NCHS Data Brief. 2012;1:1–8. [PubMed] [Google Scholar]
  28. Garcia-Vaz E, McNeilly AD, Berglund LM, Ahmad A, Gallagher JR, Dutius Andersson A-M, McCrimmon RJ, Zetterqvist AV, Gomez MF, Khan F. Inhibition of NFAT signaling restores microvascular endothelial function in diabetic mice. Diabetes. 2020;69:424–435. doi: 10.2337/db18-0870. [DOI] [PubMed] [Google Scholar]
  29. Genuario RR, Perry RP. The GA-binding protein can serve as both an activator and repressor of ribosomal protein gene transcription. The Journal of Biological Chemistry. 1996;271:4388–4395. doi: 10.1074/jbc.271.8.4388. [DOI] [PubMed] [Google Scholar]
  30. Goettsch C, Rauner M, Hamann C, Sinningen K, Hempel U, Bornstein SR, Hofbauer LC. Nuclear factor of activated T cells mediates oxidised LDL-induced calcification of vascular smooth muscle cells. Diabetologia. 2011;54:2690–2701. doi: 10.1007/s00125-011-2219-0. [DOI] [PubMed] [Google Scholar]
  31. Gong J, Shao D, Xu K, Lu Z, Lu ZJ, Yang YT, Zhang QC. RISE: a database of RNA interactome from sequencing experiments. Nucleic Acids Research. 2018;46:D194–D201. doi: 10.1093/nar/gkx864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Gordon MG, Inoue F, Martin B, Schubach M, Agarwal V, Whalen S, Feng S, Zhao J, Ashuach T, Ziffra R, Kreimer A, Georgakopoulos-Soares I, Yosef N, Ye CJ, Pollard KS, Shendure J, Kircher M, Ahituv N. lentiMPRA and MPRAflow for high-throughput functional characterization of gene regulatory elements. Nature Protocols. 2020;15:2387–2412. doi: 10.1038/s41596-020-0333-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Greißel A, Culmes M, Burgkart R, Zimmermann A, Eckstein HH, Zernecke A, Pelisek J. Histone acetylation and methylation significantly change with severity of atherosclerosis in human carotid plaques. Cardiovascular Pathology. 2016;25:79–86. doi: 10.1016/j.carpath.2015.11.001. [DOI] [PubMed] [Google Scholar]
  34. Gupta RM, Hadaya J, Trehan A, Zekavat SM, Roselli C, Klarin D, Emdin CA, Hilvering CRE, Bianchi V, Mueller C, Khera AV, Ryan RJH, Engreitz JM, Issner R, Shoresh N, Epstein CB, de Laat W, Brown JD, Schnabel RB, Bernstein BE, Kathiresan S. A genetic variant associated with five vascular diseases is a distal regulator of endothelin-1 gene expression. Cell. 2017;170:522–533. doi: 10.1016/j.cell.2017.06.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Hadi HAR, Carr CS, Al Suwaidi J. Endothelial dysfunction: cardiovascular risk factors, therapy, and outcome. Vascular Health and Risk Management. 2005;1:183–198. [PMC free article] [PubMed] [Google Scholar]
  36. Hajar R. Risk factors for coronary artery disease: historical perspectives. Heart Views. 2017;18:109–114. doi: 10.4103/HEARTVIEWS.HEARTVIEWS_106_17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Hartmann K, Seweryn M, Sadee W. Interpreting coronary artery disease GWAS results: a functional genomics approach assessing biological significance. PLOS ONE. 2022;17:e0244904. doi: 10.1371/journal.pone.0244904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Hirotsu Y, Hataya N, Katsuoka F, Yamamoto M. NF-E2-related factor 1 (Nrf1) serves as a novel regulator of hepatic lipid metabolism through regulation of the Lipin1 and PGC-1β genes. Molecular and Cellular Biology. 2012;32:2760–2770. doi: 10.1128/MCB.06706-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Horton JD, Shah NA, Warrington JA, Anderson NN, Park SW, Brown MS, Goldstein JL. Combined analysis of oligonucleotide microarray data from transgenic and knockout mice identifies direct SREBP target genes. PNAS. 2003;100:12027–12032. doi: 10.1073/pnas.1534923100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Hu W, Lu H, Zhang J, Fan Y, Chang Z, Liang W, Wang H, Zhu T, Garcia-Barrio MT, Peng D, Chen YE, Guo Y. Krüppel-like factor 14, a coronary artery disease associated transcription factor, inhibits endothelial inflammation via NF-κB signaling pathway. Atherosclerosis. 2018;278:39–48. doi: 10.1016/j.atherosclerosis.2018.09.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Huang QQ, Tang HHF, Teo SM, Mok D, Ritchie SC, Nath AP, Brozynska M, Salim A, Bakshi A, Holt BJ, Khor CC, Sly PD, Holt PG, Holt KE, Inouye M. Neonatal genetics of gene expression reveal potential origins of autoimmune and allergic disease risk. Nature Communications. 2020;11:3761. doi: 10.1038/s41467-020-17477-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Huss JM, Kelly DP. Nuclear receptor signaling and cardiac energetics. Circulation Research. 2004;95:568–578. doi: 10.1161/01.RES.0000141774.29937.e3. [DOI] [PubMed] [Google Scholar]
  43. Innocenti F, Cooper GM, Stanaway IB, Gamazon ER, Smith JD, Mirkov S, Ramirez J, Liu W, Lin YS, Moloney C, Aldred SF, Trinklein ND, Schuetz E, Nickerson DA, Thummel KE, Rieder MJ, Rettie AE, Ratain MJ, Cox NJ, Brown CD. Identification, replication, and functional fine-mapping of expression quantitative trait loci in primary human liver tissue. PLOS Genetics. 2011;7:e1002078. doi: 10.1371/journal.pgen.1002078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Johnson EN, Lee YM, Sander TL, Rabkin E, Schoen FJ, Kaushal S, Bischoff J. NFATc1 mediates vascular endothelial growth factor-induced proliferation of human pulmonary valve endothelial cells. Journal of Biological Chemistry. 2003;278:1686–1692. doi: 10.1074/jbc.M210250200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Johnson GD, Barrera A, McDowell IC, D’Ippolito AM, Majoros WH, Vockley CM, Wang X, Allen AS, Reddy TE. Human genome-wide measurement of drug-responsive regulatory activity. Nature Communications. 2018;9:5317. doi: 10.1038/s41467-018-07607-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Kalita CA, Moyerbrailean GA, Brown C, Wen X, Luca F, Pique-Regi R. QuASAR-MPRA: accurate allele-specific analysis for massively parallel reporter assays. bioRxiv. 2017 doi: 10.1101/105627. [DOI] [PMC free article] [PubMed]
  47. Kalita CA, Brown CD, Freiman A, Isherwood J, Wen X, Pique-Regi R, Luca F. High-throughput characterization of genetic effects on DNA–protein binding and gene transcription. Genome Research. 2018;28:1701–1708. doi: 10.1101/gr.237354.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Kessler T, Schunkert H. Coronary artery disease genetics enlightened by genome-wide association studies. JACC. Basic to Translational Science. 2021;6:610–623. doi: 10.1016/j.jacbts.2021.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Kim-Hellmuth S, Bechheim M, Pütz B, Mohammadi P, Nédélec Y, Giangreco N, Becker J, Kaiser V, Fricker N, Beier E, Boor P, Castel SE, Nöthen MM, Barreiro LB, Pickrell JK, Müller-Myhsok B, Lappalainen T, Schumacher J, Hornung V. Genetic regulatory effects modified by immune activation contribute to autoimmune disease associations. Nature Communications. 2017;8:266. doi: 10.1038/s41467-017-00366-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Kim-Hellmuth S, Aguet F, Oliva M, Muñoz-Aguirre M, Kasela S, Wucher V, Castel SE, Hamel AR, Viñuela A, Roberts AL, Mangul S, Wen X, Wang G, Barbeira AN, Garrido-Martín D, Nadel BB, Zou Y, Bonazzola R, Quan J, Brown A, Martinez-Perez A, Soria JM, Getz G, Dermitzakis ET, Small KS, Stephens M, Xi HS, Im HK, Guigó R, Segrè AV, Stranger BE, Ardlie KG, Lappalainen T, Aguet F, Anand S, Ardlie KG, Gabriel S, Getz GA, Graubert A, Hadley K, Handsaker RE, Huang KH, Kashin S, Li X, MacArthur DG, Meier SR, Nedzel JL, Nguyen DT, Segrè AV, Todres E, Balliu B, Barbeira AN, Battle A, Bonazzola R, Brown A, Brown CD, Castel SE, Conrad DF, Cotter DJ, Cox N, Das S, de Goede OM, Dermitzakis ET, Einson J, Engelhardt BE, Eskin E, Eulalio TY, Ferraro NM, Flynn ED, Fresard L, Gamazon ER, Garrido-Martín D, Gay NR, Gloudemans MJ, Guigó R, Hame AR, He Y, Hoffman PJ, Hormozdiari F, Hou L, Im HK, Jo B, Kasela S, Kellis M, Kim-Hellmuth S, Kwong A, Lappalainen T, Li X, Liang Y, Mangul S, Mohammadi P, Montgomery SB, Muñoz-Aguirre M, Nachun DC, Nobel AB, Oliva M, Park Y, Park Y, Parsana P, Rao AS, Reverter F, Rouhana JM, Sabatti C, Saha A, Stephens M, Stranger BE, Strober BJ, Teran NA, Viñuela A, Wang G, Wen X, Wright F, Wucher V, Zou Y, Ferreira PG, Li G, Melé M, Yeger-Lotem E, Barcus ME, Bradbury D, Krubit T, McLean JA, Qi L, Robinson K, Roche NV, Smith AM, Sobin L, Tabor DE, Undale A, Bridge J, Brigham LE, Foster BA, Gillard BM, Hasz R, Hunter M, Johns C, Johnson M, Karasik E, Kopen G, Leinweber WF, McDonald A, Moser MT, Myer K, Ramsey KD, Roe B, Shad S, Thomas JA, Walters G, Washington M, Wheeler J, Jewell SD, Rohrer DC, Valley DR, Davis DA, Mash DC, Branton PA, Barker LK, Gardiner HM, Mosavel M, Siminoff LA, Flicek P, Haeussler M, Juettemann T, Kent WJ, Lee CM, Powell CC, Rosenbloom KR, Ruffier M, Sheppard D, Taylor K, Trevanion SJ, Zerbino DR, Abell NS, Akey J, Chen L, Demanelis K, Doherty JA, Feinberg AP, Hansen KD, Hickey PF, Jasmine F, Jiang L, Kaul R, Kibriya MG, Li JB, Li Q, Lin S, Linder SE, Pierce BL, Rizzardi LF, Skol AD, Smith KS, Snyder M, Stamatoyannopoulos J, Tang H, Wang M, Carithers LJ, Guan P, Koester SE, Little AR, Moore HM, Nierras CR, Rao AK, Vaught JB, Volpi S, GTEx Consortium Cell type–specific genetic regulation of gene expression across human tissues. Science. 2020;369:aaz8528. doi: 10.1126/science.aaz8528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Klee CB, Ren H, Wang X. Regulation of the calmodulin-stimulated protein phosphatase, calcineurin. Journal of Biological Chemistry. 1998;273:13367–13370. doi: 10.1074/jbc.273.22.13367. [DOI] [PubMed] [Google Scholar]
  52. Knowles DA, Davis JR, Edgington H, Raj A, Favé M-J, Zhu X, Potash JB, Weissman MM, Shi J, Levinson DF, Awadalla P, Mostafavi S, Montgomery SB, Battle A. Allele-specific expression reveals interactions between genetic variation and environment. Nature Methods. 2017;14:699–702. doi: 10.1038/nmeth.4298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Knowles DA, Burrows CK, Blischak JD, Patterson KM, Serie DJ, Norton N, Ober C, Pritchard JK, Gilad Y. Determining the genetic basis of anthracycline-cardiotoxicity by molecular response QTL mapping in induced cardiomyocytes. eLife. 2018;7:e33480. doi: 10.7554/eLife.33480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Kotla S, Vu HT, Ko KA, Wang Y, Imanishi M, Heo K-S, Fujii Y, Thomas TN, Gi YJ, Mazhar H, Paez-Mayorga J, Shin J-H, Tao Y, Giancursio CJ, Medina JL, Taunton J, Lusis AJ, Cooke JP, Fujiwara K, Le N-T, Abe J-I. Endothelial senescence is induced by phosphorylation and nuclear export of telomeric repeat binding factor 2-interacting protein. JCI Insight. 2019;4:e124867. doi: 10.1172/jci.insight.124867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Koyama S, Ito K, Terao C, Akiyama M, Horikoshi M, Momozawa Y, Matsunaga H, Ieki H, Ozaki K, Onouchi Y, Takahashi A, Nomura S, Morita H, Akazawa H, Kim C, Seo J, Higasa K, Iwasaki M, Yamaji T, Sawada N, Tsugane S, Koyama T, Ikezaki H, Takashima N, Tanaka K, Arisawa K, Kuriki K, Naito M, Wakai K, Suna S, Sakata Y, Sato H, Hori M, Sakata Y, Matsuda K, Murakami Y, Aburatani H, Kubo M, Matsuda F, Kamatani Y, Komuro I. Population-specific and trans-ancestry genome-wide analyses identify distinct and shared genetic risk loci for coronary artery disease. Nature Genetics. 2020;52:1169–1177. doi: 10.1038/s41588-020-0705-3. [DOI] [PubMed] [Google Scholar]
  56. Krüger-Genge A, Blocki A, Franke RP, Jung F. Vascular endothelial cell biology: an update. International Journal of Molecular Sciences. 2019;20:4411. doi: 10.3390/ijms20184411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Lebeau PF, Byun JH, Platko K, Saliba P, Sguazzin M, MacDonald ME, Paré G, Steinberg GR, Janssen LJ, Igdoura SA, Tarnopolsky MA, Wayne Chen SR, Seidah NG, Magolan J, Austin RC. Caffeine blocks SREBP2-induced hepatic PCSK9 expression to enhance LDLR-mediated cholesterol clearance. Nature Communications. 2022;13:770. doi: 10.1038/s41467-022-28240-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Lee MN, Ye C, Villani A-C, Raj T, Li W, Eisenhaure TM, Imboywa SH, Chipendo PI, Ran FA, Slowikowski K, Ward LD, Raddassi K, McCabe C, Lee MH, Frohlich IY, Hafler DA, Kellis M, Raychaudhuri S, Zhang F, Stranger BE, Benoist CO, De Jager PL, Regev A, Hacohen N. Common genetic variants modulate pathogen-sensing responses in human dendritic cells. Science. 2014;343:1246980. doi: 10.1126/science.1246980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Mangravite LM, Engelhardt BE, Medina MW, Smith JD, Brown CD, Chasman DI, Mecham BH, Howie B, Shim H, Naidoo D, Feng Q, Rieder MJ, Chen YDI, Rotter JI, Ridker PM, Hopewell JC, Parish S, Armitage J, Collins R, Wilke RA, Nickerson DA, Stephens M, Krauss RM. A statin-dependent QTL for GATM expression is associated with statin-induced myopathy. Nature. 2013;502:377–380. doi: 10.1038/nature12508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Manry J, Nédélec Y, Fava VM, Cobat A, Orlova M, Thuc NV, Thai VH, Laval G, Barreiro LB, Schurr E. Deciphering the genetic control of gene expression following Mycobacterium leprae antigen stimulation. PLOS Genetics. 2017;13:e1006952. doi: 10.1371/journal.pgen.1006952. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Maranville JC, Luca F, Richards AL, Wen X, Witonsky DB, Baxter S, Stephens M, Di Rienzo A. Interactions between glucocorticoid treatment and cis-regulatory polymorphisms contribute to cellular response phenotypes. PLOS Genetics. 2011;7:e1002162. doi: 10.1371/journal.pgen.1002162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Matsuzawa Y, Lerman A. Endothelial dysfunction and coronary artery disease: assessment, prognosis, and treatment. Coronary Artery Disease. 2014;25:713–724. doi: 10.1097/MCA.0000000000000178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Melnikov A, Murugan A, Zhang X, Tesileanu T, Wang L, Rogov P, Feizi S, Gnirke A, Callan CG, Kinney JB, Kellis M, Lander ES, Mikkelsen TS. Systematic dissection and optimization of inducible enhancers in human cells using a massively parallel reporter assay. Nature Biotechnology. 2012;30:271–277. doi: 10.1038/nbt.2137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Miranda AM, Steluti J, Goulart AC, Benseñor IM, Lotufo PA, Marchioni DM. Coffee consumption and coronary artery calcium score: cross-sectional results of ELSA-Brasil (Brazilian Longitudinal Study of Adult Health) Journal of the American Heart Association. 2018;7:e007155. doi: 10.1161/JAHA.117.007155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Moyerbrailean GA, Kalita CA, Harvey CT, Wen X, Luca F, Pique-Regi R. Which genetics variants in DNase-Seq footprints are more likely to alter binding? PLOS Genetics. 2016a;12:e1005875. doi: 10.1371/journal.pgen.1005875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Moyerbrailean GA, Richards AL, Kurtz D, Kalita CA, Davis GO, Harvey CT, Alazizi A, Watza D, Sorokin Y, Hauff N, Zhou X, Wen X, Pique-Regi R, Luca F. High-throughput allele-specific expression across 250 environmental conditions. Genome Research. 2016b;26:1627–1638. doi: 10.1101/gr.209759.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Nédélec Y, Sanz J, Baharian G, Szpiech ZA, Pacis A, Dumaine A, Grenier JC, Freiman A, Sams AJ, Hebert S, Pagé Sabourin A, Luca F, Blekhman R, Hernandez RD, Pique-Regi R, Tung J, Yotova V, Barreiro LB. Genetic ancestry and natural selection drive population differences in immune responses to pathogens. Cell. 2016;167:657–669. doi: 10.1016/j.cell.2016.09.025. [DOI] [PubMed] [Google Scholar]
  69. Nikpay M, Goel A, Won H-H, Hall LM, Willenborg C, Kanoni S, Saleheen D, Kyriakou T, Nelson CP, Hopewell JC, Webb TR, Zeng L, Dehghan A, Alver M, Armasu SM, Auro K, Bjonnes A, Chasman DI, Chen S, Ford I, Franceschini N, Gieger C, Grace C, Gustafsson S, Huang J, Hwang S-J, Kim YK, Kleber ME, Lau KW, Lu X, Lu Y, Lyytikäinen L-P, Mihailov E, Morrison AC, Pervjakova N, Qu L, Rose LM, Salfati E, Saxena R, Scholz M, Smith AV, Tikkanen E, Uitterlinden A, Yang X, Zhang W, Zhao W, de Andrade M, de Vries PS, van Zuydam NR, Anand SS, Bertram L, Beutner F, Dedoussis G, Frossard P, Gauguier D, Goodall AH, Gottesman O, Haber M, Han B-G, Huang J, Jalilzadeh S, Kessler T, König IR, Lannfelt L, Lieb W, Lind L, Lindgren CM, Lokki M-L, Magnusson PK, Mallick NH, Mehra N, Meitinger T, Memon FUR, Morris AP, Nieminen MS, Pedersen NL, Peters A, Rallidis LS, Rasheed A, Samuel M, Shah SH, Sinisalo J, Stirrups KE, Trompet S, Wang L, Zaman KS, Ardissino D, Boerwinkle E, Borecki IB, Bottinger EP, Buring JE, Chambers JC, Collins R, Cupples LA, Danesh J, Demuth I, Elosua R, Epstein SE, Esko T, Feitosa MF, Franco OH, Franzosi MG, Granger CB, Gu D, Gudnason V, Hall AS, Hamsten A, Harris TB, Hazen SL, Hengstenberg C, Hofman A, Ingelsson E, Iribarren C, Jukema JW, Karhunen PJ, Kim B-J, Kooner JS, Kullo IJ, Lehtimäki T, Loos RJF, Melander O, Metspalu A, März W, Palmer CN, Perola M, Quertermous T, Rader DJ, Ridker PM, Ripatti S, Roberts R, Salomaa V, Sanghera DK, Schwartz SM, Seedorf U, Stewart AF, Stott DJ, Thiery J, Zalloua PA, O’Donnell CJ, Reilly MP, Assimes TL, Thompson JR, Erdmann J, Clarke R, Watkins H, Kathiresan S, McPherson R, Deloukas P, Schunkert H, Samani NJ, Farrall M. A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nature Genetics. 2015;47:1121–1130. doi: 10.1038/ng.3396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Okamoto J, Wang L, Yin X, Luca F, Pique-Regi R, Helms A, Im HK, Morrison J, Wen X. Probabilistic integration of transcriptome-wide association studies and colocalization analysis identifies key molecular pathways of complex traits. American Journal of Human Genetics. 2023;110:44–57. doi: 10.1016/j.ajhg.2022.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Patwardhan RP, Hiatt JB, Witten DM, Kim MJ, Smith RP, May D, Lee C, Andrie JM, Lee SI, Cooper GM, Ahituv N, Pennacchio LA, Shendure J. Massively parallel functional dissection of mammalian enhancers in vivo. Nature Biotechnology. 2012;30:265–270. doi: 10.1038/nbt.2136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Pickrell JK. Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. American Journal of Human Genetics. 2014;94:559–573. doi: 10.1016/j.ajhg.2014.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Planning Committee for a Workshop on Potential Health Hazards Associated with Consumption of Caffeine in Food and Dietary Supplements, Food and Nutrition Board, Board on Health Sciences Policy, Institute of Medicine . Caffeine in Food and Dietary Supplements: Examining Safety: Workshop Summary. Washington: National Academies Press; 2014. [DOI] [PubMed] [Google Scholar]
  74. Prosdocimo DA, Anand P, Liao X, Zhu H, Shelkay S, Artero-Calderon P, Zhang L, Kirsh J, Moore D, Rosca MG, Vazquez E, Kerner J, Akat KM, Williams Z, Zhao J, Fujioka H, Tuschl T, Bai X, Schulze PC, Hoppel CL, Jain MK, Haldar SM. Kruppel-like factor 15 is a critical regulator of cardiac lipid metabolism. The Journal of Biological Chemistry. 2014;289:5914–5924. doi: 10.1074/jbc.M113.531384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Prosdocimo DA, John JE, Zhang L, Efraim ES, Zhang R, Liao X, Jain MK. KLF15 and PPARα cooperate to regulate cardiomyocyte lipid gene expression and oxidation. PPAR Research. 2015;2015:201625. doi: 10.1155/2015/201625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Quach H, Rotival M, Pothlichet J, Loh YHE, Dannemann M, Zidane N, Laval G, Patin E, Harmant C, Lopez M, Deschamps M, Naffakh N, Duffy D, Coen A, Leroux-Roels G, Clément F, Boland A, Deleuze JF, Kelso J, Albert ML, Quintana-Murci L. Genetic adaptation and neandertal admixture shaped the immune system of human populations. Cell. 2016;167:643–656. doi: 10.1016/j.cell.2016.09.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Shlyueva D, Stelzer C, Gerlach D, Yáñez-Cuna JO, Rath M, Boryń ŁM, Arnold CD, Stark A. Hormone-responsive enhancer-activity maps reveal predictive motifs, indirect repression, and targeting of closed chromatin. Molecular Cell. 2014;54:180–192. doi: 10.1016/j.molcel.2014.02.026. [DOI] [PubMed] [Google Scholar]
  78. Stevens LM, Linstead E, Hall JL, Kao DP. Association between coffee intake and incident heart failure risk: a machine learning analysis of the FHS, the ARIC Study, and the CHS. Circulation. Heart Failure. 2021;14:e006799. doi: 10.1161/CIRCHEARTFAILURE.119.006799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Strober BJ, Elorbany R, Rhodes K, Krishnan N, Tayeb K, Battle A, Gilad Y. Dynamic genetic regulation of gene expression during cellular differentiation. Science. 2019;364:1287–1290. doi: 10.1126/science.aaw0040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Temprano‐Sagrera G, Sitlani CM, Bone WP, Martin‐Bornez M, Voight BF, Morrison AC, Damrauer SM, de Vries PS, Smith NL, Sabater‐Lleal M, Dehghan A, Heath AS, Morrison AC, Reiner AP, Johnson A, Richmond A, Peters A, van Hylckama Vlieg A, McKnight B, Psaty BM, Hayward C, Ward‐Caviness C, O’Donnell C, Chasman D, Strachan DP, Tregouet DA, Mook‐Kanamori D, Gill D, Thibord F, Asselbergs FW, Leebeek FWG, Rosendaal FR, Davies G, Homuth G, Temprano G, Campbell H, Taylor HA, Bressler J, Huffman JE, Rotter JI, Yao J, Wilson JF, Bis JC, Hahn JM, Desch KC, Wiggins KL, Raffield LM, Bielak LF, Yanek LR, Kleber ME, Sabater‐Lleal M, Mueller M, Kavousi M, Mangino M, Liu M, Brown MR, Conomos MP, Jhun M, Chen M, de Maat MPM, Pankratz N, Smith NL, Peyser PA, Elliot P, de Vries PS, Wei P, Wild PS, Morange PE, van der Harst P, Yang Q, Le N, Marioni R, Li R, Damrauer SM, Cox SR, Trompet S, Felix SB, Völker U, Tang W, Koenig W, Jukema JW, Guo X, Lindstrom S, Wang L, Smith EN, Gordon W, van Hylckama Vlieg A, de Andrade M, Brody JA, Pattee JW, Haessler J, Brumpton BM, Chasman DI, Suchon P, Chen M, Turman C, Germain M, Wiggins KL, MacDonald J, Braekkan SK, Armasu SM, Pankratz N, Jackson RD, Nielsen JB, Giulianini F, Puurunen MK, Ibrahim M, Heckbert SR, Bammler TK, Frazer KA, McCauley BM, Taylor K, Pankow JS, Reiner AP, Gabrielsen ME, Deleuze J, O’Donnell CJ, Kim J, McKnight B, Kraft P, Hansen J, Rosendaal FR, Heit JA, Psaty BM, Tang W, Kooperberg C, Hveem K, Ridker PM, Morange P, Johnson AD, Kabrhel C, Trégouët D, Smith NL, Malik R, Chauhan G, Traylor M, Sargurupremraj M, Okada Y, Mishra A, Rutten‐Jacobs L, Giese A, van der Laan SW, Gretarsdottir S, Anderson CD, Chong M, Adams HH, Ago T, Almgren P, Amouyel P, Ay H, Bartz TM, Benavente OR, Bevan S, Boncoraglio GB, Brown RD, Butterworth AS, Carrera C, Carty CL, Chasman DI, Chen W, Cole JW, Correa A, Cotlarciuc I, Cruchaga C, Danesh J, de Bakker PI, DeStefano AL, den Hoed M, Duan Q, Engelter ST, Falcone GJ, Gottesman RF, Grewal RP, Gudnason V, Gustafsson S, Haessler J, Harris TB, Hassan A, Havulinna AS, Heckbert SR, Holliday EG, Howard G, Hsu F, Hyacinth HI, Arfan Ikram M, Ingelsson E, Irvin MR, Jian X, Jiménez‐Conde J, Johnson JA, Jukema JW, Kanai M, Keene KL, Kissela BM, Kleindorfer DO, Kooperberg C, Kubo M, Lange LA, Langefeld CD, Langenberg C, Launer LJ, Lee J, Lemmens R, Leys D, Lewis CM, Lin W, Lindgren AG, Lorentzen E, Magnusson PK, Maguire J, Manichaikul A, McArdle PF, Meschia JF, Mitchell BD, Mosley TH, Nalls MA, Ninomiya T, O’Donnell MJ, Psaty BM, Pulit SL, Rannikmäe K, Reiner AP, Rexrode KM, Rice K, Rich SS, Ridker PM, Rost NS, Rothwell PM, Rotter JI, Rundek T, Sacco RL, Sakaue S, Sale MM, Salomaa V, Sapkota BR, Schmidt R, Schmidt CO, Schminke U, Sharma P, Slowik A, Sudlow CL, Tanislav C, Tatlisumak T, Taylor KD, Thijs VN, Thorleifsson G, Thorsteinsdottir U, Tiedt S, Trompet S, Tzourio C, van Duijn CM, Walters M, Wareham NJ, Wassertheil‐Smoller S, Wilson JG, Wiggins KL, Yang Q, Yusuf S, Amin N, Aparicio HS, Arnett DK, Attia J, Beiser AS, Berr C, Buring JE, Bustamante M, Caso V, Cheng Y, Hoan Choi S, Chowhan A, Cullell N, Dartigues J, Delavaran H, Delgado P, Dörr M, Engström G, Ford I, Gurpreet WS, Hamsten A, Heitsch L, Hozawa A, Ibanez L, Ilinca A, Ingelsson M, Iwasaki M, Jackson RD, Jood K, Jousilahti P, Kaffashian S, Kalra L, Kamouchi M, Kitazono T, Kjartansson O, Kloss M, Koudstaal PJ, Krupinski J, Labovitz DL, Laurie CC, Levi CR, Li L, Lind L, Lindgren CM, Lioutas V, Mei Liu Y, Lopez OL, Makoto H, Martinez‐Majander N, Matsuda K, Minegishi N, Montaner J, Morris AP, Muiño E, Müller‐Nurasyid M, Norrving B, Ogishima S, Parati EA, Reddy Peddareddygari L, Pedersen NL, Pera J, Perola M, Pezzini A, Pileggi S, Rabionet R, Riba‐Llena I, Ribasés M, Romero JR, Roquer J, Rudd AG, Sarin A, Sarju R, Sarnowski C, Sasaki M, Satizabal CL, Satoh M, Sattar N, Sawada N, Sibolt G, Sigurdsson Á, Smith A, Sobue K, Soriano‐Tárraga C, Stanne T, Colin Stine O, Stott DJ, Strauch K, Takai T, Tanaka H, Tanno K, Teumer A, Tomppo L, Torres‐Aguila NP, Touze E, Tsugane S, Uitterlinden AG, Valdimarsson EM, van der Lee SJ, Völzke H, Wakai K, Weir D, Williams SR, Wolfe CD, Wong Q, Xu H, Yamaji T, Sanghera DK, Melander O, Jern C, Strbian D, Fernandez‐Cadenas I, Longstreth WT, Rolfs A, Hata J, Woo D, Rosand J, Pare G, Hopewell JC, Saleheen D, Stefansson K, Worrall BB, Kittner SJ, Seshadri S, Fornage M, Markus HS, Howson JM, Kamatani Y, Debette S, Dichgans M. Multi‐phenotype analyses of hemostatic traits with cardiovascular events reveal novel genetic associations. Journal of Thrombosis and Haemostasis. 2022;20:1331–1349. doi: 10.1111/jth.15698. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Tewhey R, Kotliar D, Park DS, Liu B, Winnicki S, Reilly SK, Andersen KG, Mikkelsen TS, Lander ES, Schaffner SF, Sabeti PC. Direct identification of hundreds of expression-modulating variants using a multiplexed reporter assay. Cell. 2016;165:1519–1529. doi: 10.1016/j.cell.2016.04.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Turnbull D, Rodricks JV, Mariano GF, Chowdhury F. Caffeine and cardiovascular health. Regulatory Toxicology and Pharmacology. 2017;89:165–185. doi: 10.1016/j.yrtph.2017.07.025. [DOI] [PubMed] [Google Scholar]
  83. Ulirsch JC, Nandakumar SK, Wang L, Giani FC, Zhang X, Rogov P, Melnikov A, McDonel P, Do R, Mikkelsen TS, Sankaran VG. Systematic functional dissection of common genetic variation affecting red blood cell traits. Cell. 2016;165:1530–1545. doi: 10.1016/j.cell.2016.04.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. van der Harst P, Verweij N. Identification of 64 novel genetic loci provides an expanded view on the genetic architecture of coronary artery disease. Circulation Research. 2018;122:433–443. doi: 10.1161/CIRCRESAHA.117.312086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Vockley CM, Guo C, Majoros WH, Nodzenski M, Scholtens DM, Hayes MG, Lowe WL, Reddy TE. Massively parallel quantification of the regulatory effects of noncoding genetic variation in a human cohort. Genome Research. 2015;25:1206–1214. doi: 10.1101/gr.190090.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Voskoboinik A, Koh Y, Kistler PM. Cardiovascular effects of caffeinated beverages. Trends in Cardiovascular Medicine. 2019;29:345–350. doi: 10.1016/j.tcm.2018.09.019. [DOI] [PubMed] [Google Scholar]
  87. Wang X, He L, Goggin SM, Saadat A, Wang L, Sinnott-Armstrong N, Claussnitzer M, Kellis M. High-resolution genome-wide functional dissection of transcriptional regulatory regions and nucleotides in human. Nature Communications. 2018;9:5380. doi: 10.1038/s41467-018-07746-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Wang X, Musunuru K. Confirmation of causal rs9349379- PHACTR1 expression quantitative trait locus in human-induced pluripotent stem cell endothelial cells. Circulation. Genomic and Precision Medicine. 2018;11:e002327. doi: 10.1161/CIRCGEN.118.002327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Wang Y, Hu J, Liu J, Geng Z, Tao Y, Zheng F, Wang Y, Fu S, Wang W, Xie C, Zhang Y, Gong F. The role of Ca2+/NFAT in dysfunction and inflammation of human coronary endothelial cells induced by sera from patients with Kawasaki disease. Scientific Reports. 2020;10:4706. doi: 10.1038/s41598-020-61667-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Wen X, Luca F, Pique-Regi R. Cross-population joint analysis of eQTLs: fine mapping and functional annotation. PLOS Genetics. 2015;11:e1005176. doi: 10.1371/journal.pgen.1005176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Wen X. Molecular QTL discovery incorporating genomic annotations using Bayesian false discovery rate control. The Annals of Applied Statistics. 2016;10:1619–1638. doi: 10.1214/16-AOAS952. [DOI] [Google Scholar]
  92. Wen X, Pique-Regi R, Luca F. Integrating molecular QTL data into genome-wide genetic association analysis: Probabilistic assessment of enrichment and colocalization. PLOS Genetics. 2017;13:e1006646. doi: 10.1371/journal.pgen.1006646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Xu S, Ilyas I, Little PJ, Li H, Kamato D, Zheng X, Luo S, Li Z, Liu P, Han J, Harding IC, Ebong EE, Cameron SJ, Stewart AG, Weng J. Endothelial Dysfunction in atherosclerotic cardiovascular diseases and beyond: from mechanism to pharmacotherapies. Pharmacological Reviews. 2021;73:924–967. doi: 10.1124/pharmrev.120.000096. [DOI] [PubMed] [Google Scholar]
  94. Zetterqvist AV, Berglund LM, Blanco F, Garcia-Vaz E, Wigren M, Dunér P, Andersson AMD, To F, Spegel P, Nilsson J, Bengtsson E, Gomez MF. Inhibition of nuclear factor of activated T-cells (NFAT) suppresses accelerated atherosclerosis in diabetic mice. PLOS ONE. 2014;8:e65020. doi: 10.1371/journal.pone.0065020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Zhang W, Song M, Qu J, Liu GH. Epigenetic modifications in cardiovascular aging and diseases. Circulation Research. 2018;123:773–786. doi: 10.1161/CIRCRESAHA.118.312497. [DOI] [PubMed] [Google Scholar]
  96. Zhang Y, Quick C, Yu K, Barbeira A, Consortium Gte, Luca F, Pique-Regi R, Kyung Im H, Wen X. PTWAS: investigating tissue-relevant causal molecular mechanisms of complex traits using probabilistic TWAS analysis. Genome Biology. 2020;21:232. doi: 10.1186/s13059-020-02026-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

Editor's evaluation

Stephen CJ Parker 1

This important study identifies context-specific regulatory variants by an MPRA screen in vascular endothelial cells exposed to caffeine. The authors use a compelling and creative approach to pinpoint potential molecular mechanisms of gene-by-environment effects on gene regulation. The variants they identify are likely linked to complex disease risk.

Decision letter

Editor: Stephen CJ Parker1
Reviewed by: Adelaide Tovar2

Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Decision letter after peer review:

Thank you for submitting your article "Characterization of caffeine response regulatory variants in vascular endothelial cells" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Detlef Weigel as the Senior Editor.

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

Please see the reviewer comments below, which elaborate on these overall essential revisions. Importantly, all three reviewers had a productive internal discussion and are in agreement about these revisions.

1. Clarify p-value calculations, N oligos, Figure 2 ambiguities, selection of null SNPs, and method details. Convincingly demonstrate the overall robustness of the underlying data.

2. Show a more convincing locus than the PIP4K2B example, which is only supported by a pTWAS result (with LD contamination issues) with no fine mapping or even nominal p-value evidence.

Reviewer #1 (Recommendations for the authors):

It would be good to have the number of ASE and cASE variants clearly stated.

Figure 1 illustrates only parts of the questions of this study. It would be helpful to include also an illustration of the regulatory element activity underwater/caffeine conditions, and allelic comparison under the two comparisons.

"Only one lead non-coding variant has been thoroughly investigated" (referring to EDN1) but what about e.g. SORT1? One could debate what's "thorough" but I'd soften this statement.

Supplemental figure 2: what is the ethanol experiment here? I'm confused.

The quality of the supplementary figures and their legends could be improved. There are some errors in references to figure panels (SFigure 2 & 3), and axis legends that are way too small (SFigure 5). Please arrange the layout so that the figure and legend are on the same page.

"Only two previous studies have used MPRAs to investigate DNA sequences that regulate the transcriptional response to treatments"- it is of course a matter of taste if preprints count here, but I'd add this one too: https://www.biorxiv.org/content/10.1101/2020.09.30.321323v1.full

Reviewer #2 (Recommendations for the authors):

1. After performing differential activity analysis to determine significantly differentially active constructs, the authors move to investigate the allele effects of gene regulation in both treatment conditions. one point needs clarification:

When ASE is tested for each SNP/direction in the control condition, 30096 pairs were tested, vs 49441 pairs in the caffeine treatment condition. How was the number of test pairs determined? Was this based on the number of differentially active constructs in that condition?

2. Likewise with the cASE test: how was the initial SNP list of 28922 SNPs determined?

3. In figure 5, identifying fine-mapped artery eQTLs with significant cASE, pTWAS was used to investigate the genes regulated by the eQTLs and their phenotypic consequences. Although the code is cited, it would be beneficial to have a brief summary of pTWAS, similar to the summary given for DAP-G in the methods section.

4. The legend in supplemental figure 2 says "Z-score scatter plots from ASE analysis comparing water and ethanol (A) and caffeine and water (B)". However, there doesn't appear to be a Suppl. Figure 2A and 2B, just a single scatter plot. Likewise, ethanol is mentioned here as a treatment condition and does not appear elsewhere in the manuscript.

5. The authors acknowledge the limitation of their MPRA with respect to native chromatin context in the discussion. Likewise, although HUVECs are a common model cell line for endothelial cells, they have their own limitations with respect to CAD pathophysiology. Adding this to the discussion would be beneficial.

6. In figure five the SNP rs228271 is shown as an example of the prioritization approach, along with a proposed example mechanism. It would be interesting to see if part of this mechanism can be validated. For example: if the HUVECs cells used in this study are homozygous for the alternate allele of rs228271, does caffeine treatment decrease PIP4K2B activity?

7. Columns in supplementary table 2 need to be explained.

8. There are several places where the authors cite the literature but it is not clear how the cited studies and results relate to their study. Some examples are below:

a. Authors state that "In diabetic mice, NFAT expression exacerbated atherosclerosis (Blanco et al., 2018; Zetterqvist et al., 2014) and increased foam cell formation (Du et al., 2021)" In which cells did NFAT expression exacerbated atherosclerosis? In endothelial cells or other cells? How is this study relevant to caffeine response in endothelial cells? Is there a known causal link between caffeine consumption and atherosclerosis?

b. Authors state that "In hepatocytes, caffeine is known to suppress SREBF2 activity, which reduces PCSK9 expression, and thus increases LDLR expression, which could be protective against cardiovascular disease (Lebeau et al., 2022)." How do the authors translate this knowledge to ECs and caffeine response? Does the same LDLR/PCSK9 pathway exist in ECs? Does SREBF2 regulate it? What is known about the relationship between caffeine and lipid signaling in ECs?

c. Authors state that "Factors of interest for cardiovascular function include NRF1, which is enriched for constructs containing both ASE and cASE variants, is known to regulate lipid metabolism (Hirotsu et al., 2012; Huss and Kelly, 2004), and is annotated as part of the lipid metabolism pathway in Reactome (Fabregat et al., 2018). KLF15 and KLF14 are enriched only in constructs with ASE. KLF15 is involved in cardiac lipid metabolism (Prosdocimo et al., 2014, 2015), and KLF14, has previously been associated with cardiovascular disease (Chen et al., 2012; Hu et al., 2018)." what is the relationship between lipid metabolism and caffeine response in ECs?

9. Authors perform enrichment analysis of cASE variants in GTEx artery eQTLs to ask if GTEx artery eQTLs are missing GxE eQTL effects. Artery eQTLs are enriched for gene expression from smooth muscle cells. There are endothelial eQTL datasets (PMIDs 32442411, 20170901, 23667179) Those are the appropriate datasets to test the authors' hypothesis and should be used in this study.

Reviewer #3 (Recommendations for the authors):

1. You include some detail about experimental characteristics including a heatmap of counts per sample and PCA plots – elaborating to explicitly describe inter-replicate correlation and range of oligo activity (not just allelic bias) would be appreciated. It would also be helpful to give even limited details about where these variants come from, including which tissues the eQTLs correspond to, which diseases/traits for GWAS, etc. to set expectations for some of the enrichment analyses that are performed.

2. Because you seem to be testing different sets of variants/construct orientations (i.e., denominators) across different analyses, it would be helpful to give greater clarity on where these numbers are coming from and to add sample sizes to your figure legends. Additionally, the abstract also points out some very specific numbers that are never mentioned in the main text (e.g., 7,152 variants that regulate the expression of caffeine, only mentioned in the discussion).

3. The methods section mentions the use of QuASAR-MPRA, which enables "analysis of barcoded MPRA data." You mention the use of UMIs to deduplicate reads (accounting for PCR duplicates), but are the oligos in this library barcoded during plasmid preparation? If so, what was the average and range of barcode depth across oligos?

4. Negative control oligos are included in the library, but no description of how they were used for the analysis is given. They could be highlighted in a different color on the heat map and/or a description could be given whether they were used for setting thresholds.

5. You show data and a proposed model for a specific variant in figure 5, but the inclusion of actual BiT-STARR-seq data would augment your interpretation and descriptive model.

6. One additional study to cite in the introduction is "Functional characterization of T2D-associated SNP effects on baseline and ER stress-responsive β cell transcriptional activation" (PMID: 34475398). This study examined treatment-specific effects on allelic bias in an MPRA, including variants known to be e- and caQTL.

7. Though you present a model for GxE at a specific variant in figure 5, the study design cartoon in figure 1 only demonstrates allelic effects. Given that context-specific effects are the focal point of the manuscript, it would help readers to illustrate some hypothetical G, E, and GxE effects on MPRA activity.

eLife. 2024 Feb 9;13:e85235. doi: 10.7554/eLife.85235.sa2

Author response


Essential revisions:

Please see the reviewer comments below, which elaborate on these overall essential revisions. Importantly, all three reviewers had a productive internal discussion and are in agreement about these revisions.

1. Clarify p-value calculations, N oligos, Figure 2 ambiguities, selection of null SNPs, and method details. Convincingly demonstrate the overall robustness of the underlying data.

We thank the reviewers and editor for these comments which we have addressed as described below:

P-value calculations: Previously, to summarize the DESeq results we had plotted the lowest pvalue per construct (either the p-value for the forward or reverse orientation). By selecting the pvalues this way, the plot seemed to have inflated p-values. Instead, we have changed the plot to show all p-values (for both directions when available).

Author response image 1.

Author response image 1.

N oligos and selection of null SNPs: we have clarified the nomenclature used across the manuscript to refer to SNP/direction pairs or SNPs and the number of oligos tested for each analysis. We have also clarified the selection of null SNPs. All these changes were introduced in the “BiT-STARR-Seq Library Design” section of methods and are also copied below:“We designed 43,556 target regulatory regions each containing a SNP in the middle and with a total length of 200 nucleotides. This set of targets corresponds to 87,112 constructs each containing only one of two alleles at the test SNP. Additionally, each construct can be integrated in the forward or reverse orientation, leading to a maximum of 174,224 constructs in either direction. Please also see below for a description of how we use library-related terms throughout the paper. The library used is the same as reported in Kalita et al. (Kalita et al., 2018). Briefly, the library of target regulatory sequences consisted of several categories of regulatory variants, including eQTLs (Innocenti et al., 2011; Wen et al., 2015), SNPs predicted to disrupt transcription factor binding (centiSNPs) (Moyerbrailean et al., 2016a), and SNPs associated with complex traits in GWAS(Pickrell, 2014). Negative controls that were not predicted to have a regulatory effect were also included in the library (Moyerbrailean et al., 2016a). It is important to note that these negative controls are only predicted not to have a regulatory effect via computation annotation (Moyerbrailean et al., 2016b), so they may not be representative of true negative controls. This is why we largely do not utilize these SNPs as negative controls within our analyses. Our predictions of regulatory activity also did not account for environmental context, thus these sequences are also not suited to annotate our cASE results.

SNP (n = 43,556): Refers to a genetic variant tested for allelic effects on gene regulation. Target (n = 43,556): 200 nucleotide-long oligonucleotide sequence that contains the test SNP in the middle of the target.

Construct (n = 87,112): Synthesized 200 nucleotide-long oligonucleotide sequence that contains only one of the two possible alleles at the test SNP. Each target corresponds to two constructs.. Direction: Constructs can integrate in either the forward or reverse direction relative to the direction of transcription in the BiT-STARR-seq plasmid. Therefore two directions are possible for each construct.

SNP/direction pair (n = 87,112): A SNP tested for allelic effects on gene regulation contrasting the expression of two constructs that are integrated in the same direction. All statistical tests are performed at this level, testing in each direction separately.”

Additionally, we now utilize the NULL sequences to annotate our ASE results, and show a depletion for these sequences. See Figure 3A. We have also prepared a plot zoomed-in to show the deviation in the 0-1 region (Author response image 2).

Author response image 2.

Author response image 2.

Figure 2 ambiguities: Thanks for pointing out this inconsistency in Supplemental figure 2. We had used the wrong caption by mistake and have now corrected it.Methods: We have added a brief description of pTWAS in the Results section to aid the reader in interpreting the results we present. Since we added additional analyses (INTACT), we have expanded our methods section to include this as well. We have also generally added more detail in the methods section.

2. Show a more convincing locus than the PIP4K2B example, which is only supported by a pTWAS result (with LD contamination issues) with no fine mapping or even nominal p-value evidence.

We agree that a stronger example could be provided. We have utilized INTACT, which identifies putative causal genes, to provide better example variants. This method combines TWAS and colocalization approaches to mitigate issues with LD contamination. We now provide the following 2 examples in the text, and include 8 total examples in supplemental table 6:

rs4938344 is an eQTL regulating the long non-coding RNA AP000892.6. The reference allele at this locus, G, results in decreased expression of AP000892.6. (as measured in GTEx and in the caffeine condition of our assay, Figure 5D and 5E respectively). INTACT associated high expression of AP000892.6 with decreased risk of hypertension and CAD (Figure 5F). This SNP is predicted to modulate binding of GABP (a known repressor of transcription(Genuario and Perry, 1996)) and ETV1 at this site (Figure 5C). These transcription factors are upregulated in caffeine exposed endothelial cells (Findley et al., 2019) (Figure 5B). This increase in expression uncovers allelic differences in gene regulation which are not detected in the absence of caffeine, likely because of the low expression of the repressor. The allelic differences in binding of these factors should lead to allelic differences in expression of AP000892.6. Accordingly, the reference allele for this variant exhibited lower activity in response to caffeine in our BiT-STARR-seq experiments (Figure 5E). This effect is consistent with the GTEx artery eQTL for AP000892.6 (Figure 5D). In summary, caffeine induces higher expression of the ETV1 and GABP transcription factors, which then bind preferentially to the reference allele at rs4938344, this results in lower expression of AP000892.6 and increased risk for CAD and hypertension (Figure 5A). AP000892.6 interacts with the RB1CC1 RNA (Gong et al., 2018), which may play a role in atherosclerosis via its function in forming the autophagosome (Chen et al., 2021).

rs4527034 is an eQTL regulating the KAT8 gene. The reference allele at this locus, A, results in decreased expression of KAT8 (as measured in GTEx and in the control condition of our assay, Figure 6D and 6E respectively). INTACT associated high expression of KAT8 with increased risk of hypertension (Figure 6F). This SNP is predicted to modulate binding of the TERF2IP transcription factor at this site (Figure 6C). TERF2IP is upregulated in caffeine exposed endothelial cells (Findley et al., 2019) (Figure 6B). This increase in expression may saturate all binding sites in the caffeine condition, while the transcription factor may only bind to the preferential allele in the control condition. The allelic differences in binding of these factors should lead to allelic differences in expression of KAT8 in the control condition, which is what we observe both in our BiT-STARR-seq experiments (Figure 6E) and in GTEx artery eQTL for KAT8 (Figure 6D). In summary, in the absence of caffeine, TERF2IP binds preferentially to the reference allele at rs4527034, this results in lower expression of KAT8 and reduced risk for hypertension. In the presence of caffeine, TERF2IP is upregulated, resulting in increased binding and lower expression of KAT8, independently of the genotype, with an expected overall protective effect on hypertension. Confirming this potential mechanism for disease risk, TERF2IP expression levels were found to affect plaque formation in a mouse model (Kotla et al., 2019). High expression of KAT8, a histone acetyltransferase, also coincides with atherosclerotic progression, and histone acetylation increases in plaques within vascular endothelial cells (Greißel et al., 2016; Zhang et al., 2018).

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Boye C, Kalita C, Findley A, Alazizi A, Wei J, Wen X, Luca F, Pique-Regi R. 2024. Characterization of caffeine response regulatory variants in vascular endothelial cells. NCBI Gene Expression Omnibus. GSE221514 [DOI] [PMC free article] [PubMed]
    2. Boye C, Kalita C, Findley A, Alazizi A, Wei J, Wen X, Pique-Regi R, Luca F. 2024. Characterization of caffeine response regulatory variants in vascular endothelial cells. Zenodo. [DOI] [PMC free article] [PubMed]
    3. Findley AS, Richards AL, Petrini C, Alazizi A, Shanku AG, Davis GO, Hauff N, Sorokin Y, Wen X, Luca F, Doman E, Pique-Regi R. 2019. Gene-Environment Interactions (GxE) and Complex Traits. NCBI Gene Expression Omnibus. phs001176.v3.p1 [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    Supplementary file 1. Differential activity results.

    Output from DESeq2. Column ‘V1’ is the genomic position (hg19) of the SNP within the construct being tested, ‘baseMean’ refers to normalized expression, ‘log2FoldChange’ refers to the effect size, ‘lfcSE’ is the standard error of the effect size, ‘stat’ is the log2FoldChange divided by lfcSE, ‘pvalue’ is the nominal p-value, and ‘padj’ is the adjusted p-value. Please refer to DESeq2 documentation for additional details.

    elife-85235-supp1.txt (2.1MB, txt)
    Supplementary file 2. PWMScan results.

    Includes all variants in the designed library that are within a motif from the JASPAR 2022 CORE Vertebrates database determined by PWMScan. Columns 1–3 are the genomic position of the test SNP, column 4 is the rsID for the test SNP, and column 5 is the ID of the motif in JASPAR.

    elife-85235-supp2.txt (11.5MB, txt)
    Supplementary file 3. Characterization of regulatory regions and variants.

    Includes contingency tables for all Fisher’s exact tests reported in this study. Please see the legend provided within the file. Briefly, A1–C5 is the contingency table and summary of results for ASE variants within open chromatin regions, A6–C10 for ASE variants overlapping with artery eQTLs, A11–C15 for cASE variants within open chromatin, and A16–C20 for cASE variants overlapping with artery eQTLs.

    elife-85235-supp3.xlsx (10.1KB, xlsx)
    Supplementary file 4. Allele-specific effects and conditional allele-specific effects results.

    Output from ASE/cASE analysis. The ‘identifier’ column is the SNP/direction pair, ‘meta_estimate’ is the effect size (of the ASE), ‘meta_se’ is the standard error of the effect size, ‘n’ is the number of replicates containing nonzero read counts, ‘DNA_prop’ is the proportion of reads (reference/alternate) present in the DNA library, ‘meta_z’ is the z-score, ‘meta_p’ is the nominal ASE p-value, ‘meta_padj’ is the adjusted ASE p-value before filtering for SNP/direction pairs where n > 3, ‘new_padj’ is the adjusted ASE p-value after filtering for SNP/direction pairs where n > 3, ‘group’ refers to the applicable condition for the statistic (.x suffix for caffeine, .y suffix for control), ‘case_z’ is ZT-ZC, ‘case_p’ is the nominal p-value for cASE, and ‘case_padj’ is the adjusted p-value for cASE.

    elife-85235-supp4.txt (11.9MB, txt)
    Supplementary file 5. Fine-mapped artery eQTLs with significant ASE or cASE.

    The ‘chr’ column contains the chromosome information, ‘pos’ is the genomic position (0-based, hg19), ‘pos1’ is the genomic position (1-based, hg19), ‘identifier’ is the SNP/direction pair, ‘rsID’ refers to rsID, ‘padj_min’ refers to the minimum p-adjusted value for ASE (across conditions), ‘ID’ is an ID to identify the genomic position in the format of chr_pos1, ‘PIP’ is the posterior inclusion probability as calculated by DAP-G, ‘gene’ refers to the Ensembl gene ID, ‘tissue’ refers to the GTEx artery tissues (tibial, aorta, or coronary), ‘group’ identifies if the variant has significant ASE or significant cASE, ‘case_p’ refers to the cASE nominal p-value, and ‘case_padj’ refers to the cASE adjusted p-value.

    Supplementary file 6. Variants that regulate putatively casual genes as identified via INTACT.

    The ‘Identifier’ column is the SNP/direction pair, ‘ID’ is an ID to identify the genomic position in the format of chr_pos1 (see above), ‘Gene ID’ refers to the Ensembl gene ID, ‘Gene symbol’ refers to the gene symbol, ‘Trait’ is CAD or hypertension (HTN), ‘Tissue’ refers to the GTEx artery tissues (tibial, aorta, or coronary), ‘DAP-G PIP’ is the posterior inclusion probability as calculated by DAP-G, ‘GWAS z-score’ is the z-score for the GWAS of the corresponding trait, ‘GWAS pvalue’ is the nominal p-value for the GWAS of the corresponding trait, ‘PTWAS-INTACT PCG’ is the probability of being a putative causal gene as calculated by INTACT, ‘PTWAS-INTACT FDR’ is the FDR statistic as calculated by INTACT, ‘INTACT z-score’ is the z-score statistic as calculated by INTACT, ‘cASE_p’ is the cASE nominal p-value, ‘cASE_padj_(original)’ is the cASE adjusted p-value, ‘cASE_z’ is ZT-ZC, ‘DE_construct?’ indicates if the construct is significantly differentially active as defined by our assay, ‘Caffeine_ASE_z’ is the z-score for ASE (caffeine condition), ‘Water_ASE_z’ is the z-score for ASE (control condition), ‘centiSNP motifs’ refers to motifs the SNP is within as identified by centiSNP, and ‘centiSNP category’ refers to if the motif is a footprintSNP, effectSNP, or switchSNP as identified by centiSNP.

    elife-85235-supp6.xlsx (11.8KB, xlsx)
    Supplementary file 7. Read counts.

    Read counts for all libraries at different steps of the data processing pipeline. ‘Library Name’ is an identifier used for the sample, ‘Treatment’ is the condition (caffeine or control), ‘Dedup’ refers to the number of reads after deduplication, and ‘>1 read either direction’ describes the number of reads that have >1 read for each allele in either direction.

    MDAR checklist

    Data Availability Statement

    FASTQ files and read count data are available at the GEO accession number GSE221514. Supplemental files are available at https://doi.org/10.5281/zenodo.7327508.

    The following datasets were generated:

    Boye C, Kalita C, Findley A, Alazizi A, Wei J, Wen X, Luca F, Pique-Regi R. 2024. Characterization of caffeine response regulatory variants in vascular endothelial cells. NCBI Gene Expression Omnibus. GSE221514

    Boye C, Kalita C, Findley A, Alazizi A, Wei J, Wen X, Pique-Regi R, Luca F. 2024. Characterization of caffeine response regulatory variants in vascular endothelial cells. Zenodo.

    The following previously published dataset was used:

    Findley AS, Richards AL, Petrini C, Alazizi A, Shanku AG, Davis GO, Hauff N, Sorokin Y, Wen X, Luca F, Doman E, Pique-Regi R. 2019. Gene-Environment Interactions (GxE) and Complex Traits. NCBI Gene Expression Omnibus. phs001176.v3.p1


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES