Skip to main content
Nature Communications logoLink to Nature Communications
. 2018 Dec 3;9:5141. doi: 10.1038/s41467-018-07340-5

GWAS and colocalization analyses implicate carotid intima-media thickness and carotid plaque loci in cardiovascular outcomes

Nora Franceschini 1,#, Claudia Giambartolomei 2,#, Paul S de Vries 3, Chris Finan 4, Joshua C Bis 5, Rachael P Huntley 4, Ruth C Lovering 4, Salman M Tajuddin 6, Thomas W Winkler 7, Misa Graff 1, Maryam Kavousi 8, Caroline Dale 9, Albert V Smith 10,11, Edith Hofer 12,13, Elisabeth M van Leeuwen 8, Ilja M Nolte 14, Lingyi Lu 15, Markus Scholz 16,17, Muralidharan Sargurupremraj 18, Niina Pitkänen 19, Oscar Franzén 20,21, Peter K Joshi 22, Raymond Noordam 23, Riccardo E Marioni 24,25, Shih-Jen Hwang 26,27, Solomon K Musani 28, Ulf Schminke 29, Walter Palmas 30, Aaron Isaacs 8,31, Adolfo Correa 28, Alan B Zonderman 6, Albert Hofman 8,32, Alexander Teumer 33,34, Amanda J Cox 35,36, André G Uitterlinden 8,37, Andrew Wong 38, Andries J Smit 39, Anne B Newman 40, Annie Britton 41, Arno Ruusalepp 21,42,43, Bengt Sennblad 44,45, Bo Hedblad 46, Bogdan Pasaniuc 2,47, Brenda W Penninx 48, Carl D Langefeld 15, Christina L Wassel 49, Christophe Tzourio 18, Cristiano Fava 46,50, Damiano Baldassarre 51,52, Daniel H O’Leary 53, Daniel Teupser 17,54, Diana Kuh 38, Elena Tremoli 52,55, Elmo Mannarino 56, Enzo Grossi 57, Eric Boerwinkle 3,58, Eric E Schadt 20,21, Erik Ingelsson 59,60,61, Fabrizio Veglia 52, Fernando Rivadeneira 8,37, Frank Beutner 62, Ganesh Chauhan 18,63, Gerardo Heiss 1, Harold Snieder 14, Harry Campbell 22, Henry Völzke 33,34, Hugh S Markus 64, Ian J Deary 24,65, J Wouter Jukema 66, Jacqueline de Graaf 67, Jacqueline Price 22, Janne Pott 16,17, Jemma C Hopewell 68, Jingjing Liang 69, Joachim Thiery 17,70, Jorgen Engmann 4, Karl Gertow 44, Kenneth Rice 71, Kent D Taylor 72, Klodian Dhana 73, Lambertus A L M Kiemeney 74, Lars Lind 75, Laura M Raffield 76, Lenore J Launer 6, Lesca M Holdt 17,54, Marcus Dörr 34,77, Martin Dichgans 78,79, Matthew Traylor 64, Matthias Sitzer 80, Meena Kumari 41,81, Mika Kivimaki 41, Mike A Nalls 82,83, Olle Melander 46, Olli Raitakari 19,84, Oscar H Franco 8,85, Oscar L Rueda-Ochoa 8,86, Panos Roussos 20,87,88, Peter H Whincup 89, Philippe Amouyel 90,91,92, Philippe Giral 93, Pramod Anugu 28, Quenna Wong 94, Rainer Malik 78, Rainer Rauramaa 95,96, Ralph Burkhardt 17,97,98, Rebecca Hardy 38, Reinhold Schmidt 12, Renée de Mutsert 99, Richard W Morris 100, Rona J Strawbridge 44,101, S Goya Wannamethee 102, Sara Hägg 103, Sonia Shah 4, Stela McLachlan 22, Stella Trompet 23,66, Sudha Seshadri 104, Sudhir Kurl 105, Susan R Heckbert 5,106, Susan Ring 107,108, Tamara B Harris 6, Terho Lehtimäki 109,110, Tessel E Galesloot 74, Tina Shah 4, Ulf de Faire 111,112, Vincent Plagnol 113, Wayne D Rosamond 1, Wendy Post 114, Xiaofeng Zhu 69, Xiaoling Zhang 27,115, Xiuqing Guo 72,116, Yasaman Saba 117; MEGASTROKE Consortium, Abbas Dehghan 8,118, Adrie Seldenrijk 119, Alanna C Morrison 3, Anders Hamsten 44, Bruce M Psaty 106,120, Cornelia M van Duijn 8,68, Deborah A Lawlor 107,108, Dennis O Mook-Kanamori 99,121, Donald W Bowden 122, Helena Schmidt 117, James F Wilson 22,123, James G Wilson 124, Jerome I Rotter 72,116, Joanna M Wardlaw 24,125, John Deanfield 4, Julian Halcox 126, Leo-Pekka Lyytikäinen 109,110, Markus Loeffler 16,17, Michele K Evans 6, Stéphanie Debette 18, Steve E Humphries 127, Uwe Völker 34,128, Vilmundur Gudnason 10,11, Aroon D Hingorani 4, Johan L M Björkegren 20,21,42,129,, Juan P Casas 9, Christopher J O’Donnell 130,131,132,
PMCID: PMC6277418  PMID: 30510157

Abstract

Carotid artery intima media thickness (cIMT) and carotid plaque are measures of subclinical atherosclerosis associated with ischemic stroke and coronary heart disease (CHD). Here, we undertake meta-analyses of genome-wide association studies (GWAS) in 71,128 individuals for cIMT, and 48,434 individuals for carotid plaque traits. We identify eight novel susceptibility loci for cIMT, one independent association at the previously-identified PINX1 locus, and one novel locus for carotid plaque. Colocalization analysis with nearby vascular expression quantitative loci (cis-eQTLs) derived from arterial wall and metabolic tissues obtained from patients with CHD identifies candidate genes at two potentially additional loci, ADAMTS9 and LOXL4. LD score regression reveals significant genetic correlations between cIMT and plaque traits, and both cIMT and plaque with CHD, any stroke subtype and ischemic stroke. Our study provides insights into genes and tissue-specific regulatory mechanisms linking atherosclerosis both to its functional genomic origins and its clinical consequences in humans.


Carotid intima-media thickness (cIMT) and plaque are associated with subclinical atherosclerosis and coronary heart disease (CHD). Here, the authors identify and prioritize genetic loci for cIMT and plaque by GWAS and colocalization approaches and further demonstrate genetic correlation with CHD and stroke.

Introduction

Atherosclerosis is characterized by an accumulation of lipid-rich and inflammatory deposits (plaques) in the sub-intimal space of medium and large arteries. Plaque enlargement leads to blood flow limitation, organ ischemia, and/or tissue necrosis. Plaque rupture can lead to abrupt vascular occlusion, which underlies clinical cardiovascular events, including myocardial infarction and ischemic stroke. Coronary heart disease (CHD) accounts for one in seven deaths, and stroke accounts for one in 20 deaths in the US1. Because atherosclerosis has a long pre-clinical phase, early detection of atherosclerosis using non-invasive methods may help identify individuals at risk for atherosclerotic clinical events2, and provides an opportunity for prevention. Subclinical atherosclerosis can be detected by B-mode ultrasound measurement of common carotid artery intima-media thickness (cIMT) or carotid plaques1.

Subclinical and clinical atherosclerosis has known genetic components3. Genome-wide association studies (GWAS) of subclinical atherosclerosis have previously identified three loci significantly associated with cIMT at ZHX2, APOC1, and PINX1, and two loci associated with common carotid artery plaque at PIK3CG and EDNRA4. An exome-wide-association study identified significant associations of the APOE ε2 allele with cIMT and coronary artery calcification5. The APOE single nucleotide polymorphism (SNP) rs7412 is in linkage disequilibrium (LD) with the APOC1 variant, thus representing the same signal. Additional GWAS-identified associations were reported for carotid plaque at the 9p21 and SFXN2 loci6, and for cIMT at the CFDP1-TMEM170A locus7. However, these prior studies were of limited sample size and genomic coverage, and failed to investigate the etiological role that subclinical atherosclerosis may have on atherosclerotic clinical events.

Herein, we perform a large meta-analysis of GWAS of subclinical atherosclerosis by analyzing 1000 Genomes imputed genotype data obtained from collaborations between the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium8 and the University College London-Edinburgh-Bristol (UCLEB) consortium9. One of the greatest challenges in the translation of GWAS findings to biological understanding is related to the limited access to RNA expression data from disease-relevant tissues. Consequently, we sought to reliably identify the tissue-specific gene regulatory functions responsible for the GWAS signals by prioritizing candidate genes for established and novel loci of cIMT and carotid plaque using statistical methods for colocalization10. These methods integrate identified loci with expression quantitative loci (eQTLs) inferred from cardiovascular disease-relevant genetics of RNA expression, the Stockholm-Tartu Atherosclerosis Reverse Network Engineering Task (STARNET) study, where arterial wall and metabolic-related RNA samples were collected from up to 600 patients with CHD11. We also evaluate the relationships of cIMT and carotid plaque with clinically apparent CHD and stroke using summary data from two large consortia. In summary, our study sequentially assesses the genetic epidemiology and tissue-specific patterns of gene regulation involved in the formation of subclinical atherosclerosis traits across cardiovascular disease-related tissues.

Results

Study description

The study design is shown in Fig. 1. We undertook meta-analysis of GWAS in individuals of European ancestry for cIMT (up to 71,128 participants from 31 studies) and carotid plaque (up to 48,434 participants from 17 studies; 21,540 with defined carotid plaque) (Supplementary Table 1). cIMT and plaque were evaluated using high-resolution B-mode ultrasonography and reading protocols as previously reported4. Carotid plaque was defined by atherosclerotic thickening of the common carotid artery wall or the proxy measure of luminal stenosis greater than 25% (Supplementary Table 2). Each cohort performed association analyses using standardized protocols (Methods) for variants imputed based on the 1000 Genomes Project (1000G) phase 1 v3 reference. Extensive quality control (QC) was applied to data, and there was little evidence for population stratification in any of the studies for either trait (Supplementary Table 3). The study-specific results were combined using fixed-effect meta-analyses, given the low heterogeneity across studies (0% heterogeneity)12.

Fig. 1.

Fig. 1

Overall study design. a GWAS meta-analyses of cIMT and carotid plaque for gene discovery. b Local and genome-wide shared genetic basis using gene expression and clinical outcomes GWAS data

GWAS meta-analyses of cIMT and carotid plaque

For cIMT, 11 loci had at least one SNP association that reached the genome-wide association threshold (p < 5 × 10−8), of which eight were newly described and three have been previously reported (Table 1). The closest genes for the eight loci were: 1q32.2 intergenic (rs201648240), ATP6AP1L (rs224904), AIG1 (rs6907215), PIK3CG (rs13225723), MCPH1 (rs2912063), SGK223 (rs11785239), VTI1 (rs1196033), and CBFA2T3 (rs844396). For three loci previously reported, the closest genes were ZHX2 (rs148147734), PINX1 (rs200482500), and APOE (rs7412).

Table 1.

Loci significantly associated with cIMT and plaque GWAS

SNP Chr:position Nearest coding gene Alleles (effect/other) Effect allele freq. Beta (SE) p N
Newly identified loci for cIMT
rs201648240 1:208953176-indel LINC01717 −/AA 0.83 −0.0062 (0.0011) 4 × 10−9 54,752
rs224904 5:81637916 ATP6AP1L C/G 0.95 −0.0088 (0.0016) 5 × 10−8 68,962
rs6907215 6:143608968 AIG1 T/C 0.60 −0.0040 (0.0007) 5 × 10−8 64,586
rs13225723 7:106416467 PIK3CG A/G 0.22 0.0052 (0.0009) 3 × 10−9 68,070
rs2912063 8:6486033 MCPH1 A/G 0.71 0.0045 (0.0008) 9 × 10−9 67,401
rs11785239 8:8205010 SGK223 T/C 0.65 −0.0043 (0.0008) 9 × 10−9 67,107
rs11196033 10:114410998 VTI1A A/C 0.48 0.0042 (0.0008) 4 × 10−8 57,995
rs844396 16:88966667 CBFA2T3 T/C 0.30 −0.0051 (0.0009) 6 × 10−9 50,377
Newly identified loci for plaque
rs200495339 19:11189298-indel LDLR −/G 0.11 −0.1023 (0.0179) 1  × 10−8 36,569
Known loci for cIMT
rs148147734a 8:123401537-indel ZHX2 −/G 0.54 0.0050 (0.0007) 3  × 10−11 58,141
rs200482500a 8:10606223-indel PINX1 −/GTACC 0.52 0.0056 (0.0008) 7 × 10−12 58,141
rs7412a 19:45412079 APOE T/C 0.08 −0.0119 (0.0015) 1 × 10−14 44,607
Known loci for plaque
rs11413744b 4:148395284-indel EDNRA −/T 0.86 −0.1586 (0.0253) 4 × 10−10 39,577
rs17477177b 7:106411858 PIK3CG T/C 0.79 −0.1305 (0.0197) 4 × 10−11 47,863
rs9632884b 9:22072301 9p21 C/G 0.48 0.1127 (0.0163) 5 × 10−12 45,943
rs113309773b 16:75432686-indel CFDP1- TMEM170A −/C 0.46 −0.1259 (0.0194) 9 × 10−11 37,104

p = p-values of association from linear regression analysis, N = total number in meta-analyses

aPublished cIMT SNP in LD with our most significant SNP: rs11781551 (r2 = 0.95 with rs148147734), rs6601530 (r2 = 0 with rs200482500), and rs445925 (r2 = 0.60 with rs7412)

bPublished plaque SNP in LD with our most significant SNP: rs1878406 (r2 = 0.98 with rs11413744), rs17398575 (r2 = 0.8 with rs17477177), rs9644862 (r2 = 0.79 with rs9632884), and rs4888378 (r2 = 0.94 with rs113309773)

The PIK3CG is a newly described locus for cIMT, but has been previously reported in a GWAS of carotid plaque4. The two signals on chromosome 8 near MCPH1 (rs2912063) and SGK223 (rs11785239) were confirmed to be independent through conditional analysis (Supplementary Table 4). At the PINX1 locus, the lowest association p-value variant (rs200482500) was not in LD with the previously reported associated variant in the region (rs6601530, r2 = 0.0, Table 1), thus representing an independent signal at this locus. Two additional loci for cIMT had an SNP that reached suggestive evidence for association (p < 1.0 × 10−7) including an SNP nearby APOB (rs515135) and an intronic low frequency variant at ATG4B (rs139302128, minor allele frequency [MAF] = 0.03) (Supplementary Table 5).

The GWAS meta-analysis for carotid plaque identified five loci, of which one has not been previously described (nearby gene LDLR) (Table 1). At four known loci associated with carotid plaque (nearby genes EDNRA, PIK3CG, CFDP1-TMEM170A, and at the 9p21 region), the most significantly associated variants were in LD with the previously reported SNPs (Table 1)4,6,7, indicating that these SNPs mark the same association at each locus. Two suggestive loci (p < 10−7) were also identified nearby the genes TMCO5B and STEAP2-AS1 (Supplementary Table 5). Conditional analyses confirmed the presence of a single independent signal at each locus. Manhattan and QQ plots from the meta-analysis of cIMT and carotid plaque are shown in Supplementary Figure 1 and regional plots in Supplementary Figure 2. Forest Plots for all loci are shown in Supplementary Figure 3.

Regulatory annotations of GWAS SNPs for cIMT/carotid plaque

To better define potentially causal variants within the identified genetic risk loci, we jointly analyzed the GWAS data with functional genomic information such as annotations on active transcription sites or open chromatin regions (i.e., performed a fine-mapping functional genome-wide association analysis using fGWAS13). Only variants in the PINX1 region were found to have a high probability that its association with cIMT is driven by SNPs that fall within transcription sites in adipose-derived mesenchymal stem cells at a DNaseI-hypersensitive site (Supplementary Figure 4), a finding that provides a down-stream mechanistic explanation for the cIMT signal in the PINX1 locus.

To further explore the regulatory functions of variants in the identified loci for cIMT and carotid plaque, we investigated whether the identified lead SNPs were also eQTLs using vascular RNAseq data from GTEx (aorta, coronary and tibial arteries, heart atrial appendage, and heart left ventricle) and from the coronary artery disease cohort of STARNET (i.e., from the atherosclerotic-lesion-free internal mammary artery [MAM] and atherosclerotic aortic root [AOR]). Lead SNP associated with cIMT and carotid plaque (rs13225723) in the PIK3CG locus was found to be vascular-specific eQTLs for CCDC71L and PRKAR2B in GTEx aorta as well as in STARNET AOR and MAM tissues (Table 2, Fig. 2), suggesting that the genetic regulation of these two genes are responsible for risk variation in cIMT and carotid plaque development in this locus.

Table 2.

Gene expression results for significant SNPs in GTEx and STARNET tissues

SNP eQTLa (Gene, p) GTEx eQTLa (Gene, p) STARNET tissues
AORb HEART (ATR/VEN)c AOR MAM
rs201648240 CAMK1G, 0.0094
AL031316.1, 0.0040
CD34,0.00532
TRAF3IP3, 0.0097
rs6907215 AL023584.1, 0.005384704 (VEN) ENSG00000217648, 0.00046 ENSG00000217648, 0.8 × 10−5
rs13225723 AC005050.1, 1 × 10−10
ENSG00000177820.5, 7.0 × 10−5
CCDC71L, 5 × 10−6
PRKAR2B, 4 × 10−8
PIK3CG, 10 × 10−3
CCDC71L, 6 × 10−36
PRKAR2B, 7 × 10−7
SYPL1, 0.0043
CCDC71L, 3 × 10−33
PRKAR2B, 6 × 10−8
NAMPT, 6 × 10−6
rs2912063 MCPH1, 0.0041 ENSG00000271743.1, 0.0093 (VEN) MCPH1-AS1, 0.0020
rs11785239 AC022784.1, 0.0078 (VEN) ERI1, 0.0069 PPP1R3B, 0.0036
rs844396 ENSG00000141012.8, 0.003
AC092384.2, 0.001
CBFA2T3, 1 × 10−7
ZNF469, 0.004 (ATR)
AC092384.3, 5 × 10−6 (ATR)
AC092384.1, 0.002 (ATR)
CBFA2T3, 0.0004 (ATR)
ZNF469, 0.002 (VEN)
AC138028.4, 0.001 (VEN)
ENSG00000224888.3, 0.009(VEN)
PIEZO1, 0.0004 (VEN)
GALNS, 0.004 (VEN)
RPL13, 0.0024
ZNF276, 0.0070
TRAPPC2L, 0.0091
TRAPPC2L, 0.0040
ZNF276, 0.0059
rs200495339 ENSG00000267105.1, 0.0005 (VEN)
rs148147734 DERL1, 0.0082
rs200482500 AF131215.6, 0.005
AF131215.5, 0.001
AF131215.5, 0.002 (ATR)
AF131215.6, 0.003 (VEN)
AF131215.5, 0.004 (VEN)
rs7412 ENSG00000267163.1, 0.007
rs11413744 PRMT9, 0.004
rs17477177 ENSG00000267052.1, 6 × 10−11
ENSG00000177820.5, 5 × 10−6
CCDC71L, 4 × 10−7
PRKAR2B, 2 × 10−8
BCAP29, 0.002 (ATR) CCDC71L, 2 × 10−37
PRKAR2B, 6 × 10−7
SYPL1, 0.0091
CCDC71L,1 × 10−33
PRKAR2B, 2 × 10−8
NAMPT, 1 × 10−5
rs9632884 DMRTA1, 0.007 (ATR) CDKN2B, 2 × 10−3 CDKN2B, 2 × 10−3
rs113309773 BCAR1, 6 × 10−11
ENSG00000261783.1, 2 × 10−16
GABARAPL2, 0.004
ENSG00000261783.1, 1 × 10−5 (ATR)
ENSG00000166822.8, 0.005 (ATR)
ENSG00000261783.1, 0.0003 (VEN)
ZFP1, 4 × 10−4
AC009078.2, 0.002
BCAR1, 3 × 10−12
CFDP1, 0.002
TMEM170A, 0.009

p = p-values of association from linear regression analysis

aThe lead SNP from GWAS is considered an eQTL if the cis-association has a nominal p-value of association <0.01. Multiple but not all lead SNPs reach genome-wide significance (p < 10−4).

bThis includes aorta (AOR)

cThis includes heart atrial (ATR) and heart left ventricle (VEN)

Fig. 2.

Fig. 2

Pairwise colocalization results for genes identified for cIMT and carotid plaque GWAS meta-analysis with STARNET expression datasets. Red indicates a high posterior probability of colocalization and blue a high probability of no colocalization of the same SNP with tissue eQTLs

Colocalization analysis of GWAS data and STARNET eQTLs

To identify further candidate genes in tissues affected by atherosclerosis that had strong evidence of sharing the same variant for cIMT and carotid plaque as found in our GWAS, we conducted pairwise colocalization analysis of these genetic variants with cis-eQTLs in the STARNET study10.

The pairwise colocalization analysis is based on coloc, a Bayesian statistical methodology that tests pairwise colocalization of SNPs in GWAS with eQTLs and, in this fashion, generates posterior probabilities for each locus weighting the evidence for competing hypothesis of either no colocalization or sharing of a distinct SNP at each locus10. We used summary statistics from all SNPs within a 200-kb window around each gene covered by the eQTL datasets (N = 18,705, see Methods), and analyzed each eQTL-GWAS dataset pair (Supplementary Table 6). A posterior probability of ≥75% was considered strong evidence of the tissue-specific eQTL-GWAS pair influencing both the expression and GWAS trait at a particular region. Results for this analysis are shown in Table 3 and Supplementary Figure 5. The strongest evidence for an effect on gene expression within the regions identified in our standard GWAS meta-analysis was for the CCDC71L and PRKAR2B genes at the previously described chromosome 7 cIMT locus (PIK3CG in Table 2, Fig. 2). These genes showed evidence of colocalization for both cIMT and carotid plaque in AOR and MAM tissues (Table 3, Fig. 3). CCDC71L had the highest probability (>95%) for colocalization for cIMT, and MAM and AOR tissue eQTLs, and for carotid plaque, and MAM and AOR tissue eQTLs. We found a low probability of colocalization of the SNP with the PIK3CG gene expression (<1%).

Table 3.

Colocalization of cIMT and plaque with eQTLs in tissues from patients with CHD in STARNET tissues for genes/tissues combinations that have more than 75% probability to share the same associated variant

Region (chr:start-stop) Trait Gene SNP with best joint probability p, BETA (SE), Tissue posterior probability (PPA)a Direction of effect GWAS/eQTL
cIMT /plaque GWAS AOR eQTL MAM eQTL
chr3:63561280-65833136 cIMT ADAMTS9 rs17676309 (T/C) 2 × 10−6,
-0.0035 (0.0007)
2 × 10−25,
−0.65 (0.06)
PPA=0.93
1 × 10−23,
−0.61 (0.06)
PPA=0.89
−/−
chr10:99017729-101017321 cIMT LOXL4 rs55917128 (T/C) 5 × 10−7,
0.0037 (0.0007)
6 × 10−8,
0.33 (0.06)
PPA=0.79
+/+
chr7:105299372-107743409 cIMT CCDC71L
PRKAR2B
rs12705390 (A/G) 5 × 10−9,
0.0049 (0.0008)
2 × 10−37,
0.81 (0.06)
PPA=0.97
6 × 10−7,
0.34 (0.07)
PPA=0.93
1 × 10−33,
0.755 (0.06)
PPA=0.97
2 × 10−8,
0.368 (0.06)
PPA=0.96
+/+
+/+
Plaque CCDC71L
PRKAR2B
rs12705390 (A/G) 4 × 10−8,
0.12 (0.022)
2 × 10−37,
0.80 (0.06)
PPA=0.97
6 × 10−7,
0.33 (0.07)
PPA=0.93
1 × 10−33,
0.75 (0.06)
PPA=0.97
2 × 10−8,
0.37 (0.06)
PPA=0.96
+/+
+/+

PPA posterior probability of sharing same SNP higher than 75%, cIMT common carotid artery intima-media thickness, AOR aorta, MAM mammary artery

aThis signal reaches genome-wide significance in cIMT/plaque, and reaches a high probability of being mediated by the genes in AOR and MAM

Fig. 3.

Fig. 3

Association results at the CCDC71L locus (chromosome 7), showing a high posterior probability of a shared variant for cIMT and carotid plaque in AOR and MAM eQTLs. −log10(p) SNP association p-values for cIMT (plot A) and carotid plaque (plot B), and eQTL in AOR (plot C) and eQTL in SF (plot D). Association results in SF tissue have a low probability of a shared signal with cIMT and carotid plaque, possibly indicating a different mechanism in this tissue. eQTLs in MAM are identical to AOR and not shown. The p-values were calculated by fitting a linear regression model with cIMT or plaque as dependent variable and imputed SNPs as independent variables. Each dot is an SNP and the color indicates linkage disequilibrium (r2) with the best hit (in purple)

The eQTL associations at two additional loci (ADAMTS9, LOXL4) in MAM or AOR showed evidence of colocalization with cIMT or carotid plaque, although GWAS association p-values at these loci did not meet the genome-wide significance threshold (Table 3, Supplementary Figure 5). Albeit with weaker magnitudes, the expression of these two genes were also associated with the top colocalizing SNPs as detected in RNAseq data in GTEx aorta (rs17676309, chr3:64730121, ADAMTS9, p = 0.0003 and rs55917128, chr10:100023359, LOXL4, p = 0.0005).

Colocalization of CHD and stroke GWAS and STARNET eQTLs

We next assessed if the four genes (CCDC71L, PRKAR2B, ADAMTS9, LOXL4) identified through colocalization of cIMT/carotid plaque with tissue-specific eQTLs also showed evidence for colocalization with CHD and stroke traits (Supplementary Data 1 and Supplementary Figure 6). We used GWAS summary data for CHD (CARDIoGRAMPlusC4D), and stroke subtypes (MEGASTROKE) and AOR and MAM STARNET tissue eQTLs for these analyses. CCDC71L and PRKAR2B had suggestive evidence of sharing the same variant with large vessel disease stroke in both AOR and MAM tissues (probability of colocalization ≥20%, Supplementary Data 1). In contrast, there was strong evidence (≥75%) to reject a shared variant for CHD and eQTLs at this locus, thus suggesting there is atherosclerotic outcome specificity at vascular level for this locus (Supplementary Figure 5). Three of these genes, CCDC71L, PRKAR2B, and ADAMTS9, showed evidence for shared genetic influences of cIMT or carotid plaque on CHD/stroke outcomes when testing the joint association using moloc, a multiple-trait extension of coloc14 (Supplementary Table 7). We also highlight the expression of KIAA1462 gene in MAM, carotid plaque/cIMT, and CHD, which were positively correlated (Supplementary Figure 7). This gene has suggestive evidence of pairwise colocalization with carotid plaque (67% of probability of shared variant between carotid plaque and eQTL in MAM), as well as a high probability of shared variant between MAM eQTL expression of this gene, GWAS carotid plaque or cIMT, and CHD traits (Supplementary Table 7). We note, however, that the GWAS signal for outcomes across the datasets did not reach genome-wide significance and larger sample sizes may be needed to strengthen the evidence for involvement in disease outcomes.

Genetic correlations of cIMT/carotid plaque and clinical outcomes

To provide etiological insights into the role of measures of subclinical atherosclerosis and major atherosclerotic disease outcomes such as CHD and ischemic stroke, we quantified the genetic correlation using cross-trait LD score regression, a method that estimates genetic correlation across different traits using summary level data15. We used summary statistics between cIMT/carotid plaque with CHD and stroke meta-analysis of GWAS. Both cIMT and carotid plaque had positive significant genetic correlations with CHD (all p < 0.05 after adjusting for multiple testing), though the magnitude of the correlation was twice as strong for carotid plaque (0.52) as for cIMT (0.20) (Table 4). There was also evidence for genetic correlations between cIMT with any stroke and ischemic stroke subtype.

Table 4.

Genetic correlation between CHD and stroke traits with cIMT and plaque, and cIMT with plaque using LD score and meta-GWAS

Cardiovascular disease trait Subclinical atherosclerosis trait Genetic correlation SE z p
CHDa cIMT 0.20 0.05 4.1114 4 × 10−5
Any stroke cIMT 0.30 0.07 4.2301 2.3 × 10−5
Ischemic strokeb cIMT 0.31 0.07 4.646 3.4 × 10−6
Cardio-embolic strokeb cIMT 0.10 0.09 1.0729 0.28
Small vessel disease strokeb cIMT 0.33 0.18 1.8728 0.06
CHDa Carotid plaque 0.52 0.08 6.4263 1.3 × 10−10
Any strokeb Carotid plaque 0.28 0.10 2.7097 0.007
Ischemic strokeb Carotid plaque 0.27 0.10 2.6578 0.008
Cardio-embolic strokeb Carotid plaque 0.06 0.14 0.4684 0.64
Small vessel disease strokeb Carotid plaque −0.03 0.24 −0.1344 0.89
Plaque cIMT 0.40 0.10 3.9667 7.3 × 10−5

aCARDIoGRAMPlusC4D

bMEGASTROKE consortium. Unable to estimate the genetic correlations with large vessel disease

Pathway analysis and druggability

Gene Ontology (GO) analyses of genes identified in the loci for cIMT and carotid plaque according to our meta-analysis of GWAS (Table 1 and Supplementary Table 5) and in the colocalization analyses (Table 3, Supplementary Table 7) showed that cIMT genes are enriched in lipoprotein-related terms and cholesterol efflux, whereas carotid plaque genes are enriched in terms associated with fibroblast apoptosis (Supplementary Figure 8). Analysis of the cIMT genes using a GO Slim additionally identified several of the genes that were associated with terms describing cardiovascular development, cell adhesion, and immune processes, processes already considered relevant to atherosclerosis. Specifically, there is corroborating evidence from GO that CCDC71L, PRKAR2B, and TWIST1 are associated with cIMT/carotid plaque as they are involved in lipid metabolism, with similar support that ADAMTS9, CDH13, and KIAA1462 are associated with cIMT or carotid plaque risk as they are all involved in cell adhesion and, together with TWIST1, in cardiovascular system development (Supplementary Data 2).

From the loci associated with cIMT and carotid plaque, we identified seven genes (ATG4B, ALPL, LDLR, APOB, EDNRA, APOE, and ADAMTS9) whose encoded proteins are targets at various stages of the drug development process (Supplementary Tables 8 and 9). ADAMTS9 gene encodes a protein likely to be druggable16. ATG4B, ALPL, and LDLR are proteins being targeted by compounds in pre-clinical phase (tier 2), while APOB and EDNRA are proteins targeted by drugs in clinical phase or licensed (tier 1). APOB is the target of an approved FDA drug for treatment of familial hypercholesterolemia. EDNRA gene encodes for endothelin A receptor, against which several antagonists have been developed for the treatment of pulmonary arterial hypertension or which are in advanced clinical phase development for non-small cell lung cancer and diabetic nephropathy.

Discussion

We provide results of a large meta-analysis of GWAS of subclinical atherosclerosis and we integrate our results with tissue-specific gene expression data using eQTLs from both the early (MAM) and late advanced (AOR) atherosclerotic arterial wall from the STARNET study to enable reliable discovery of genes with biological evidence of an increased probability for conferring inherited risk of atherosclerosis development. Our discovery approach using GWAS meta-analyses identified 16 loci significantly associated with either cIMT or carotid plaque, of which nine are novel.

The integration of GWAS and tissue-specific cis-eQTLs for the joint analyses of tissue-specific eQTLs from CHD patients identified two potentially additional loci colocalizing with cIMT or carotid plaque: chr3:63561280-65833136 (ADAMTS9), chr10:99017729-101017321 (LOXL4). ADAMTS9 is a metalloproteinase involved in thrombosis and angiogenesis and has been associated with cardiometabolic traits (waist-to-hip ratio, waist circumference, and type 2 diabetes) in GWAS, and with coronary artery calcification in a gene-by-smoking interaction GWAS17,18. LOXL4 encodes a lysyl oxidase involved in crosslinks of collagen and elastin in the extracellular matrix. This family of proteins are involved in the development of elastic vessels and mechanical strength of the vessel wall, and their inhibition was associated with the development of abdominal aortic aneurysms and more severe atherosclerosis in experimental models19.

Some loci identified in our meta-analysis of GWAS include genes in known pathways for atherosclerosis, including LDLR, which is related to lipid pathways and CHD, and identified for associations with carotid plaque in our study. For most of the loci, however, the underlying gene implicated in signals are unknown. Our colocalization approach found both CCDC71L and PRKAR2B as the most likely genes at the chromosome 7 locus, where PIK3CG was previously the suggested gene. This finding is in agreement with a targeted sequencing study of subclinical atherosclerosis15. An additional SNP (rs342286) at this locus has been associated with platelets volume and reactivity, and cardiovascular traits. However, rs342286 is not in LD with our most significant SNP and it is not associated with cIMT or carotid plaque in our studies (p = 0.49 and 0.01, respectively). Of interest, the variant we identified in this study showed evidence for colocalization with cIMT/carotid plaque and large vessel disease stroke but not CHD, therefore showing tissue and outcome-specificity. CCDC71L has unknown function. PRKAR2B codes for one of the several regulatory subunits of cAMP-dependent protein kinase and its expression is ubiquitous. In vitro studies have shown that adenosine-induced apoptosis of arterial smooth muscle cells involves a cAMP-dependent pathway20.

Measures of cIMT and carotid plaque reflect vascular pathophysiologic and atherosclerosis processes, respectively, with carotid plaque more strongly reflecting atherosclerotic clinical events. An important contribution of this study is the supporting evidence for overall genetic correlations of CHD and stroke (any cause and ischemic stroke) with subclinical atherosclerosis traits, estimated using LD score methods. Further highlighting the potential biological relevance of our findings, the genetic correlations estimates for CHD were stronger for carotid plaque than for cIMT. However, cIMT and carotid plaque GWAS were correlated, and the genetic correlations estimates with stroke were similar for cIMT or carotid plaque, and not significant for carotid plaque. The colocalization analyses provided additional insights in the relationships between subclinical atherosclerosis, clinical outcomes, and tissue-specific regulation at specific genomic regions. For example, our suggestive top gene association in multi-trait colocalization for KIAA1462 included MAM eQTLs, carotid plaque, and CHD, supporting the shared genetic effects at this locus of atherosclerosis in carotid and coronary arteries. KIAA1462 has been previously reported in the same locus identified by GWAS for CHD21. This gene encodes a protein involved in cell–cell junctions in endothelial cells22, which was recently shown to be involved in pathologic angiogenic process in in vitro and in vivo experimental models23. These findings suggest that there may be important differences in vascular bed regulation at distinctive regions for atherosclerotic cardiovascular and stroke outcomes that may help to identify genes and specific targets for CHD or stroke prevention and treatment.

Additional studies in diverse and large samples across the multiple datasets are needed to explore these results further. As more summary statistics become available for other clinical end-points beyond stroke and CHD (both in terms of larger sample size and richer genome coverage), and as further refinements in clinical phenotypes emerge (e.g. from CHD to acute coronary syndrome sub-components), strategies to integrate this knowledge using methods such as moloc10 and eCAVIAR24 will continue to be essential for harnessing genome-wide findings in the drug-discovery process.

In summary, our study is a large GWAS meta-analysis of cIMT and carotid plaque. Through a sequential approach of discovery and colocalization studies, we provide deeper insights into disease causal genes of subclinical cIMT and carotid plaque formation. We confirmed three loci and identified nine novel loci in the meta-analyses of cIMT and carotid plaque. Additionally, we provide strong evidence for the role of three novel genes from our integrative analysis of GWAS and eQTL data. Moreover, the identified correlations with CHD and stroke highlight novel biological pathways that merit further assessments as novel targets for drug development.

Methods

Ethics statement

All human research was approved by the relevant institutional review boards for each study, and conducted according to the Declaration of Helsinki. All participants provided written informed consent.

Populations and phenotypes

The discovery GWAS in this study consists of a collaboration between the CHARGE 8 and the UCLEB consortia9, for genetic studies of cIMT and carotid plaque among individuals of European ancestry (Supplementary Note 1). All studies followed standardized protocols for phenotype ascertainment and statistical analyses. The descriptive characteristics of participating studies are shown in Supplementary Table 1.

cIMT and carotid plaque measures were evaluated using high-resolution B-mode ultrasonography and reading protocols as previously reported4. We used data from the baseline examination or the first examination in which carotid ultrasonography was obtained. cIMT was defined by the mean of the maximum of several common carotid artery measurements, measured at the far wall or the near wall. For most studies, this was an average of multiple measurements from both the left and right arteries. We also examined a carotid plaque phenotype, defined by atherosclerotic thickening of the carotid artery wall or the proxy measure of luminal stenosis greater than 25% (Supplementary Table 2).

Genotyping, imputation, and study-level quality control

Genotyping arrays and QC pre-imputation are shown in Supplementary Table 3. Each GWAS study conducted genome-wide imputation using a Phase 1 integrated (March 2012 release) reference panel from the 1000G Consortium using IMPUTE225 or MaCH/minimac26, and used Human Reference Genome Build 37. Sample QC was performed with exclusions based on call rates, extreme heterozygosity, sex discordance, cryptic relatedness, and outlying ethnicity. SNP QC excluded variants based on call rates across samples and extreme deviation from Hardy–Weinberg equilibrium (Supplementary Table 3). Non-autosomal SNPs were excluded from imputation and association analysis.

Pre-meta-analysis GWAS study-level QC was performed using EasyQC software27. This QC excluded markers absent in the 1000G reference panel; non A/C/G/T/D/I markers; duplicate markers with low call rate; monomorphic SNPs and those with missing values in alleles, allele frequency, and beta estimates; SNPs with large effect estimates or standard error (SE) ≥10; and SNPs with allele frequency difference >0.3 compared to 1000G reference panel. There was a total of 9,574,088 SNPs for the cIMT meta-analysis and 8,578,107 SNPs for the carotid plaque meta-analysis.

Statistical analyses

Within each study, we used linear and logistic regression to model cIMT and carotid plaque, respectively, and an additive genetic model (SNP dosage) adjusted for age, sex, and up to 10 principal components. We combined summary estimates from each study and each trait using an inverse variance weighted meta-analysis. Additional filters were applied during meta-analyses including imputation quality (MACH r2 < 0.3 and IMPUTE info <0.4), a minor allele frequency (MAF) <0.01, and SNPs that were not present in at least four studies. The genome-wide significance threshold was considered at p < 5.0 × 10−8.

To assess the evidence for independent associations at each locus attaining genome-wide significance, we performed conditional analysis in a 1-Mb genomic interval flanking the lead SNP using GCTA28. This approach uses summary meta-analysis statistics and a LD matrix from an ancestry-matched sample to perform approximate conditional SNP association analysis. The estimated LD matrix was based on 9713 unrelated individuals of European ancestry from the ARIC study, which was genotyped using an Affymetrix 6.0 array and imputed to the 1000G panel using IMPUTE225.

Gene expression analysis using GTEx

GTEx Analysis V6 (dbGaP Accession phs000424.v6.p1) eQTL results were downloaded from GTEx portal for 44 tissues, and then mapped to SNPs listed in Table 1. We used a false discovery rate (FDR) of ≤0.05.

Colocalization analyses using eQTLs

We integrated our GWAS results with cis-eQTL data using a Bayesian method (coloc)10. This method evaluates whether the GWAS and eQTL associations best fit a model in which the associations are due to a single shared variant (summarized by the posterior probability). We used gene expression datasets from multiple tissues from patients with CHD of the STARNET study, including blood, MAM, AOR, subcutaneous fat (SF), visceral fat (VAF), skeletal muscle (SKLM), and liver (LIV) obtained from 600 patients during open heart surgery11. Pairwise colocalization was tested between these expression disease tissue datasets and GWAS results from our cIMT/carotid plaque GWAS meta-analysis. We used GWAS and eQTL summary statistics of SNPs within a 200-kb window around each gene covered by the eQTL datasets. A posterior probability of colocalization ≥0.75 was considered a strong evidence for a causal gene. Next, we reported the gene(s) in the STARNET datasets that had the strongest evidence of sharing the same variant with cIMT or carotid plaque genome-wide. In an alternative analysis, we also tested loci with an SNP that reached a threshold of significant or suggestive genome-wide significance for cIMT or carotid plaque (reported in Table 1, Supplementary Table 5). For each region 200kb around the SNP with the lowest association p-value, we report the gene with the highest probability of being responsible for the GWAS signal (Supplementary Table 6).

Pairwise colocalization for these genes was also tested for publicly available GWAS for CHD case-controls (CARDIoGRAMPlusC4D) and stroke case-controls (MEGASTROKE consortium). The MEGASTROKE dataset uses genotypes imputed to the 1000G phase I haplotype panel. The European ancestry sample used to generate these results consisted of 40,585 stroke cases and 406,111 controls from 15 cohorts and two consortia: the METASTROKE and CHARGE consortia29. The phenotypes used in this analysis were any stroke (n = 39,067 cases, total n = 442,142), ischemic stroke (IS, n = 32,686 cases, total n = 423,266), and etiologic stroke subtypes:cardioembolic stroke (CE, n = 6,820 cases, total n = 314,368), large vessel disease (n = 4,113, total n = 202,263), and small vessel disease (SVD, n = 4,975, total n = 242,250). To explore multi-trait colocalizations, we used moloc14 with prior probabilities of 10−4 for GWAS/GWAS/eQTL, 10−6 for GWAS+eQTL/GWAS or GWAS+GWAS/eQTL, and 10−7 for colocalization of all three association signals.

Functional annotation and epigenetic enrichment analyses

From the Epigenome Roadmap Project30,31, we obtained regulatory information using broad classes of chromatin states (n = 127 tissues) capturing promoter-associated, transcription-associated, active intergenic, and large-scale repressed and repeat-associated states. From ENCODE32, we obtained chromatin states, uniformly processed transcription factor (TF) Chip assays and DNaseI Hypersensitivity sites (DHS) for nine cells lines. From FANTOM533, we used information from expression of enhancers in each tissue (n = 112), and enhancers that are positively differentially expressed against any other tissue (n = 110).

We used fGWAS13 to identify genomic annotations that are enriched within the cIMT results and to select the variants with support for a functional role based on the most informative annotations. We only considered cIMT for these analyses because of the small number of identified loci for carotid plaque. We first estimated the enrichment parameters for each annotation individually and identified the set of annotations with significant marginal associations. We then applied 10-fold cross-validation likelihood and forward selection to identify the set of annotations that significantly improve the model fit, and reverse selection of each annotation included in the model, as suggested in the fGWAS workflow. We reported the model with the highest cross-validation likelihood and SNPs that have regional posterior probability of association (PPA) >0.9 and directly overlap the genomic annotations considered.

Overall genetic correlation analysis

Genetic correlation between cIMT/carotid plaque, CHD, and stroke traits were calculated using LD score regression approach LD-score, which uses GWAS summary statistics and is not affected by sample overlap. This method relies on the fact that the χ2 association statistic for a given SNP includes the effects of all SNPs that are in LD with it and it calculates genetic correlation by partitioning the SNP heritabilities15. Genetic correlations between stroke traits (IS, CE, large vessel disease, and SVD) and cIMT and carotid plaque were calculated using software available at http://github.com/bulik/ldsc with GWAS summary statistics for our cIMT/carotid plaque GWAS, CARDIOGRAMPlusC4D data, and stroke GWAS. We used the LD-scores15, which are based on the 1000 Genomes European population and estimated within 1-cM windows. Based on ten tests performed (two subclinical traits and five outcomes), we set the significance threshold to p = 0.005.

PATHWAY ANALYSES. Methods for GO Slim: The Ensembl identifiers of all protein-coding genes identified as in LD with the 12 variants for cIMT and 15 variants for carotid plaque (including variants from main and suggestive signals, Table 1 and Supplementary Table 5), and five genes for which there is strong evidence of colocalization (Table 3), were mapped to UniProt accession numbers, using the UniProt ID mapping service (http://www.uniprot.org/uploadlists/). A GO Slim analysis was performed on this list using QuickGO (www.ebi.ac.uk/QuickGO) and the Generic GO Slim. The GO terms used in the final slim analysis were further refined by adding/removing GO terms to provide more detailed information about the processes covered.

Methods for GO term enrichment analysis: The VLAD gene list analysis and visualization tool (http://proto.informatics.jax.org/prototypes/vlad/) was used to perform a GO term enrichment analysis on the same UniProt accessions as listed for the GO Slim. The background annotation set was obtained from the goa_human.gaf file (dated 21 November 2017, downloaded from ftp://ftp.ebi.ac.uk/pub/databases/GO/goa/HUMAN/) and the ontology data was obtained from the go-basic.obo file provided in the VLAD tool (analysis run 28 November 2017).

The LD block around top SNPs associated with cIMT and carotid plaque was constructed using LD information from the 1000 Genomes panel, as previously outlined in Finan et al.16. Briefly, the boundaries of the LD region were defined as the positions of the variants furthest upstream and downstream of a GWAS SNP with an r2 value of ≥0.5 and within a 1-Mbp flank on either side of the GWAS variant. Associated variants that were not present in the 1000 Genomes panel that were not in LD with any other variants were given a nominal flank of 2.5 kbp on either side of the association. Gene annotations using Ensembl version 79 were then overlapped to the LD region.

Druggable genes

We examined the druggability status for the nearest coding genes identified in our GWAS analysis on cIMT and carotid plaque, including significant (novel and replicated) and suggestive ones, as well as genes identified through colocalization analysis. The druggable gene set was calculated using the previously described criteria: novel targets of first-in-class drugs licensed since 2005; the targets of drugs currently in late phase clinical development; pre-clinical phase small molecules with protein binding measurements reported in the ChEMBL database; and genes encoding secreted or plasma membrane proteins that are targets of monoclonal antibodies and other bio-therapeutics16. We defined three tiers of druggable gene sets based on their drug development. In Tier 1, 1427 genes were targets of approved small molecules and biotherapeutic drugs and clinical-phase drug candidates. Tier 2 comprised 682 genes encoding targets with known bioactive drug-like small molecule binding partners and those with significant sequence similarity to approved drug targets. Tier 3 contained 2370 genes encoding secreted or extracellular proteins, proteins with more distant similarity to approved drug targets, and druggable genes not included in Tier 1 or 2 such as GPCRs, nuclear hormone receptors, ion channels, kinases, and phosphodiesterases.

URLs

For GTEx, see http://gtexportal.org/. For Coloc, see https://cran.r-project.org/web/packages/coloc/coloc.pdf. For, Moloc, see https://github.com/clagiamba/moloc/blob/master/man/moloc-package.Rd. For CARDIoGRAMPlusC4D, see www.cardiogramplusc4d.org/. For LD scores, www.broadinstitute.org/~bulik/eur_ldscores/. For UniProt ID, www.uniprot.org/uploadlists/. For QuickGO, www.ebi.ac.uk/QuickGO. For VLAD tool, see http://proto.informatics.jax.org/prototypes/vlad/.

Electronic supplementary material

Supplementary Data 1 (192.2KB, pdf)
Peer Review File (38.3KB, xlsx)
Supplementary Data 2 (103.4KB, xlsx)
41467_2018_7340_MOESM5_ESM.pdf (93.8KB, pdf)

Description of Additional Supplementary Files

Acknowledgements

The work was supported by the following grants: National Institute of Health grants: R21HL123677, R21-HL140385, DK104806-01A1, R01-MD012765-01A1 (NF), National Institutes of Health awards R01HG009120, R01HG006399, U01CA194393, T32NS048004 (CG), the American Heart Association Grant #17POST33350042 (PV), the British Heart Foundation (RG/13/5/30112) and the National Institute for Health Research University College London Hospitals Biomedical Research Centre (RCL and RPH), the British Heart Foundation FS/14/55/30806 (JCH), the German Federal Ministry of Education and Research (BMBF) in the context of the e:Med program (e:AtheroSysMed), the DFG as part of the CRC 1123 (B3), and the FP7/2007-2103 European Union project CVgenes@target (grant agreement number Health-F2-2013-601456). We thank Li-Ming Gan for assistance with the STARNET study and Jon White for assistance with UCLEB analyses. Additional acknowledgements are included in Supplementary Note 2.

Author contributions

N.F., C.G., J.C.B., M.K., C.D., M.S., S.K.M., U.S., W.P., A.B.Z., A.H., A.T., A.G.U., A.B.N., B.W.P., C.D.L., E.T., F.R., H.V., I.J.D., L.J.L., M.D., O.R., O.H.F., R.S., R.M., T.B.H., T.L., U.F., W.P., A.D., A.S., A.H., C.M.D., D.A.L., D.O.M.-K., D.W.B., H.S., J.F.W., J.G.W., J.I.R., J.M.W., M.L., M.K.E., S.E.H., U.V., V.G., A.D.H., J.P.C., and C.J.O. contributed to study concept and design. C.D., M.S., S.K.M., U.S., W.P., A.I., A.C., A.B.Z., A.T., A.G.U., A.W., A.J.S., A.B., A.R., B.H., B.W.P., C.F., D.B., D.H.O., D.T., D.K., E.T., E.M., E.G., E.B., E.E.S., E.I., F.R., F.B., G.H., H.C., H.V., H.S.M., I.J.D., J.W.J., J.G., J.P., J.T., J.E., K.D.T., L.K., L.L., L.J.L., L.H., M.D., M.S., M.K., M.K., M.A.N., O.M., O.R., P.H.W., P.G., P.A., R.R., R.B., R.H., R.S., R.M., R.W.M.,. S.G.W., S.M.L., S.T., S.K., S.R.H., S.R., T.B.H., T.L., T.G., T.S., U.F., V.P., W.R., W.P., X.Z., A.S., A.H., B.M.P., C.M.D., D.A.L., D.O.M.-K., D.W.B., H.S., J.F.W., J.G.W., J.I.R., J.C.H., J.M.W., J.D., J.H., M.K.E., S.E.H., U.V., V.G., A.D.H., J.L.M.B., J.P.C., and C.J.O. contributed to acquisition of genotyping or phenotypic data. N.F., C.G., C.F., J.C.B., R.P.H., R.C.L., S.M.T., T.W.W., M.G., M.K., C.D., A.V.S., E.H., E.M.L., I.M.N., L.L., M.S., M.S., N.P., O.F., P.K.J., R.N., R.E.M., S.-J.H., S.K.M., U.S., A.I., A.T., K.R., A.J.C., B.S., C.D.L., C.W., F.V., G.C., H.S., J.P., J.L., K.G., L.M.R., M.T., M.A.N., O.M., P.R., P.A., Q.W., R.J.S., S.H., S.S., S.M.L., T.S., X.Z., X.Z., X.G., Y.S., and L.-.P.L. contributed to statistical analysis and interpretation of the data. N.F., C.G., P.S.V., J.C.B., M.K., S.K.M., A.D.H., and J.P.C. contributed to drafting of the manuscript. All authors contributed to the critical revision of the manuscript.

Data availability

All relevant summary statistics data that support the findings of this study have been deposit in the database of Genotypes and Phenotypes (dbGaP) under the CHARGE acquisition number (https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000930.v6.p1; accession phs000930.v6.p1). GWAS data for most US studies are already available in dbGAP.

Competing interests

C.F. received a fee for speaking at a course by Springer Healthcare/Malesci. E.I. is a scientific advisor for Precision Wellness, Cellink and Olink Proteomics for work unrelated to the present project. M.A.N.’s participation in this project was supported by a consulting contract between Data Tecnica International and the National Institute on Aging, NIH, Bethesda, MD, USA. M.A.N. also consults for Illumina Inc., the Michael J. Fox Foundation, and University of California Healthcare. B.M.P. serves on the DSMB of a clinical trial funded by Zoll LifeCor and on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. D.A.L. has received support from Roche Diagnostics and Medtronic for biomarker research unrelated to this paper. J.P.C. has received funding from GSK regarding methodological work around electronic health records, and -omics for drug discovery. All remaining authors declare no competing interests.

Footnotes

A full list of consortium members can be found at the end of the article.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Nora Franceschini, Claudia Giambartolomei.

Contributor Information

Johan L. M. Björkegren, Email: johan.bjorkegren@mssm.edu

Christopher J. O’Donnell, Email: Christopher.ODonnell@va.gov

MEGASTROKE Consortium:

Yukinori Okada, Aniket Mishra, Loes Rutten-Jacobs, Anne-Katrin Giese, Sander W. van der Laan, Solveig Gretarsdottir, Christopher D. Anderson, Michael Chong, Hieab H. H. Adams, Tetsuro Ago, Peter Almgren, Philippe Amouyel, Hakan Ay, Traci M. Bartz, Oscar R. Benavente, Steve Bevan, Giorgio B. Boncoraglio, Robert D. Brown, Jr., Adam S. Butterworth, Caty Carrera, Cara L. Carty, Daniel I. Chasman, Wei-Min Chen, John W. Cole, Ioana Cotlarciuc, Carlos Cruchaga, John Danesh, Paul I. W. de Bakker, Anita L. DeStefano, Marcel den Hoed, Qing Duan, Stefan T. Engelter, Guido J. Falcone, Rebecca F. Gottesman, Raji P. Grewal, Stefan Gustafsson, Jeffrey Haessler, Tamara B. Harris, Ahamad Hassan, Aki S. Havulinna, Elizabeth G. Holliday, George Howard, Fang-Chi Hsu, Hyacinth I. Hyacinth, M. Arfan Ikram, Marguerite R. Irvin, Xueqiu Jian, Jordi Jiménez-Conde, Julie A. Johnson, J. Wouter Jukema, Masahiro Kanai, Keith L. Keene, Brett M. Kissela, Dawn O. Kleindorfer, Charles Kooperberg, Michiaki Kubo, Leslie Lange, Carl D. Langefeld, Claudia Langenberg, Jin-Moo Lee, Robin Lemmens, Didier Leys, Cathryn M. Lewis, Wei-Yu Lin, Arne G. Lindgren, Erik Lorentzen, Patrik K. Magnusson, Jane Maguire, Ani Manichaikul, Patrick F. McArdle, James F. Meschia, Thomas H. Mosley, Toshiharu Ninomiya, Martin J. O’Donnell, Sara L. Pulit, Kristiina Rannikmäe, Alexander P. Reiner, Kathryn M. Rexrode, Stephen S. Rich, Paul M. Ridker, Natalia S. Rost, Peter M. Rothwell, Tatjana Rundek, Ralph L. Sacco, Saori Sakaue, Michele M. Sale, Veikko Salomaa, Bishwa R. Sapkota, Reinhold Schmidt, Carsten O. Schmidt, Ulf Schminke, Pankaj Sharma, Agnieszka Slowik, Cathie L. M. Sudlow, Christian Tanislav, Turgut Tatlisumak, Vincent N. S. Thijs, Gudmar Thorleifsson, Unnur Thorsteinsdottir, Steffen Tiedt, Stella Trompet, Matthew Walters, Nicholas J. Wareham, Sylvia Wassertheil-Smoller, Kerri L. Wiggins, Qiong Yang, Salim Yusuf, Tomi Pastinen, Arno Ruusalepp, Eric E. Schadt, Simon Koplev, Veronica Codoni, Mete Civelek, Nick Smith, David A. Trégouët, Ingrid E. Christophersen, Carolina Roselli, Steven A. Lubitz, Patrick T. Ellinor, E. Shyong Tai, Jaspal S. Kooner, Norihiro Kato, Jiang He, Pim van der Harst, Paul Elliott, John C. Chambers, Fumihiko Takeuchi, Andrew D. Johnson, Dharambir K. Sanghera, Olle Melander, Christina Jern, Daniel Strbian, Israel Fernandez-Cadenas, W. T. Longstreth, Jr, Arndt Rolfs, Jun Hata, Daniel Woo, Jonathan Rosand, Guillaume Pare, Danish Saleheen, Kari Stefansson, Bradford B. Worrall, Steven J. Kittner, Joanna M. M. Howson, and Yoichiro Kamatani

Electronic supplementary material

Supplementary Information accompanies this paper at 10.1038/s41467-018-07340-5.

References

  • 1.Mozaffarian D, et al. Heart Disease and Stroke Statistics-2016 Update: a report from the American Heart Association. Circulation. 2016;133:e38–e360. doi: 10.1161/CIR.0000000000000350. [DOI] [PubMed] [Google Scholar]
  • 2.Frieden TR, Berwick DM. The Million Hearts initiative--preventing heart attacks and strokes. N. Engl. J. Med. 2011;365:e27. doi: 10.1056/NEJMp1110421. [DOI] [PubMed] [Google Scholar]
  • 3.O’Donnell CJ, Nabel EG. Genomics of cardiovascular disease. N. Engl. J. Med. 2011;365:2098–2109. doi: 10.1056/NEJMra1105239. [DOI] [PubMed] [Google Scholar]
  • 4.Bis JC, et al. Meta-analysis of genome-wide association studies from the CHARGE consortium identifies common variants associated with carotid intima media thickness and plaque. Nat. Genet. 2011;43:940–947. doi: 10.1038/ng.920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Natarajan, P. et al. Multiethnic Exome-Wide Association Study of Subclinical Atherosclerosis. Circ. Cardiovasc. Genet.9, 511–520 (2016). [DOI] [PMC free article] [PubMed]
  • 6.Pott J, et al. Genome-wide meta-analysis identifies novel loci of plaque burden in carotid artery. Atherosclerosis. 2017;259:32–40. doi: 10.1016/j.atherosclerosis.2017.02.018. [DOI] [PubMed] [Google Scholar]
  • 7.Gertow K, et al. Identification of the BCAR1-CFDP1-TMEM170A locus as a determinant of carotid intima-media thickness and coronary artery disease risk. Circ. Cardiovasc. Genet. 2012;5:656–665. doi: 10.1161/CIRCGENETICS.112.963660. [DOI] [PubMed] [Google Scholar]
  • 8.Psaty BM, et al. Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium: design of prospective meta-analyses of genome-wide association studies from 5 cohorts. Circ. Cardiovasc. Genet. 2009;2:73–80. doi: 10.1161/CIRCGENETICS.108.829747. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Shah S, et al. Causal relevance of blood lipid fractions in the development of carotid atherosclerosis: Mendelian randomization analysis. Circ. Cardiovasc. Genet. 2013;6:63–72. doi: 10.1161/CIRCGENETICS.112.963140. [DOI] [PubMed] [Google Scholar]
  • 10.Giambartolomei C, et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014;10:e1004383. doi: 10.1371/journal.pgen.1004383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Franzen O, et al. Cardiometabolic risk loci share downstream cis- and trans-gene regulation across tissues and diseases. Science. 2016;353:827–830. doi: 10.1126/science.aad6970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Pfeiffer RM, Mitchell HG, Pee D. On combining data from genome-wide assocition studies to discover disease-associated SNPs. Stat. Sci. 2009;24:547–560. doi: 10.1214/09-STS286. [DOI] [Google Scholar]
  • 13.Pickrell JK. Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. Am. J. Hum. Genet. 2014;94:559–573. doi: 10.1016/j.ajhg.2014.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Giambartolomei, C. et al. A Bayesian framework for multiple trait colo-calization from summary association statistics. Bioinformatics34, 2538–2545 (2018). [DOI] [PMC free article] [PubMed]
  • 15.Finucane HK, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 2015;47:1228–1235. doi: 10.1038/ng.3404. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Finan Chris, Gaulton Anna, Kruger Felix A., Lumbers R. Thomas, Shah Tina, Engmann Jorgen, Galver Luana, Kelley Ryan, Karlsson Anneli, Santos Rita, Overington John P., Hingorani Aroon D., Casas Juan P. The druggable genome and support for target identification and validation in drug development. Science Translational Medicine. 2017;9(383):eaag1166. doi: 10.1126/scitranslmed.aag1166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zeggini E, et al. Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes. Nat. Genet. 2008;40:638–645. doi: 10.1038/ng.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Polfus LM, et al. Genome-wide association study of gene by smoking interactions in coronary artery calcification. PLoS ONE. 2013;8:e74642. doi: 10.1371/journal.pone.0074642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Remus EW, et al. The role of lysyl oxidase family members in the stabilization of abdominal aortic aneurysms. Am. J. Physiol. Heart Circ. Physiol. 2012;303:H1067–H1075. doi: 10.1152/ajpheart.00217.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Peyot ML, et al. Extracellular adenosine induces apoptosis of human arterial smooth muscle cells via A(2b)-purinoceptor. Circ. Res. 2000;86:76–85. doi: 10.1161/01.RES.86.1.76. [DOI] [PubMed] [Google Scholar]
  • 21.Erdmann J, et al. Genome-wide association study identifies a new locus for coronary artery disease on chromosome 10p11.23. Eur. Heart J. 2011;32:158–168. doi: 10.1093/eurheartj/ehq405. [DOI] [PubMed] [Google Scholar]
  • 22.Akashi M, Higashi T, Masuda S, Komori T, Furuse M. A coronary artery disease-associated gene product, JCAD/KIAA1462, is a novel component of endothelial cell-cell junctions. Biochem. Biophys. Res. Commun. 2011;413:224–229. doi: 10.1016/j.bbrc.2011.08.073. [DOI] [PubMed] [Google Scholar]
  • 23.Hara T, et al. Targeted disruption of JCAD (Junctional Protein Associated With Coronary Artery Disease)/KIAA1462, a coronary artery disease-associated gene product, inhibits angiogenic processes in vitro and in vivo. Arterioscler. Thromb. Vasc. Biol. 2017;37:1667–1673. doi: 10.1161/ATVBAHA.117.309721. [DOI] [PubMed] [Google Scholar]
  • 24.Hormozdiari F, Kostem E, Kang EY, Pasaniuc B, Eskin E. Identifying causal variants at loci with multiple signals of association. Genetics. 2014;198:497–508. doi: 10.1534/genetics.114.167908. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 2009;5:e1000529. doi: 10.1371/journal.pgen.1000529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Howie B, Fuchsberger C, Stephens M, Marchini J, Abecasis GR. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat. Genet. 2012;44:955–959. doi: 10.1038/ng.2354. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Winkler TW, et al. Quality control and conduct of genome-wide association meta-analyses. Nat. Protoc. 2014;9:1192–1212. doi: 10.1038/nprot.2014.071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011;88:76–82. doi: 10.1016/j.ajhg.2010.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Malik R, et al. Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat. Genet. 2018;50:524–537. doi: 10.1038/s41588-018-0058-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Roadmap Epigenomics Consortium, et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–330. doi: 10.1038/nature14248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Zhu LJ. Integrative analysis of ChIP-chip and ChIP-seq dataset. Methods Mol. Biol. 2013;1067:105–124. doi: 10.1007/978-1-62703-607-8_8. [DOI] [PubMed] [Google Scholar]
  • 32.ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Andersson R, et al. An atlas of active enhancers across human cell types and tissues. Nature. 2014;507:455–461. doi: 10.1038/nature12787. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data 1 (192.2KB, pdf)
Peer Review File (38.3KB, xlsx)
Supplementary Data 2 (103.4KB, xlsx)
41467_2018_7340_MOESM5_ESM.pdf (93.8KB, pdf)

Description of Additional Supplementary Files

Data Availability Statement

All relevant summary statistics data that support the findings of this study have been deposit in the database of Genotypes and Phenotypes (dbGaP) under the CHARGE acquisition number (https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000930.v6.p1; accession phs000930.v6.p1). GWAS data for most US studies are already available in dbGAP.


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES