Abstract
Cardiovascular disease is among the most significant health problems in the United States today, with its major risk factor, hypertension, disproportionately affecting African Americans (AAs). Although genome-wide association studies (GWAS) have identified dozens of common variants associated with blood pressure (BP) and hypertension in European Americans; these variants collectively explain less than 2.5% of BP variance, and most of the genetic variants remain yet to be identified. Here we report the results from rare variant analysis of systolic blood pressure (SBP) using 94,595 rare and low frequency variants (MAF < 5%) from the Illumina exome array genotyped in 2,045 HyperGEN AA. In addition to single-variant analysis, four gene-level association tests were used for analysis: burden and famSKAT tests using minor allele frequency (MAF) cutoffs of 1% and 5%. Gene-based methods often provided lower p-values than the single-variant approach. Some consistency was observed across these four gene-based analysis options. While neither gene-based analysis nor single variant analysis produced genome-wide significant results, the top signals, which had supporting evidence from multiple gene-based methods, were of borderline significance. Though additional molecular validations are required, six of the 16 most promising genes are biologically plausible with physiological connections to blood pressure regulation.
Keywords: Rare variants, Exome chip, Gene-based analysis, burden tests, SKAT, SBP, family studies
Introduction
Cardiovascular diseases are among the most significant health problems in the United States today, with hypertension affecting African Americans (AAs) disproportionately. Blood pressure (BP) and hypertension have been challenging phenotypes for gene discovery due to their complex nature. Although genome-wide association studies (GWAS) have successfully identified thousands of loci affecting human complex diseases, these identified loci only explain small proportions of heritability. For example, three recent GWAS consortia [1–3] identified dozens of common variants associated with blood pressure (BP) and hypertension in European Americans (EAs). However, these variants collectively explain less than 2.5% of BP variance, and most of the genetic variants remain yet to be identified [4]. The missing heritability might be partially explained by rare and low frequency variants, with minor allele frequencies (MAF) below 5% [5].
Because the power to detect individual rare variants is limited by their low frequency, the development of novel statistical approaches for the analysis of rare variants has been an active area of research. These methods are broadly classified as burden and non-burden tests. Burden tests collapse rare variants in a genomic region (typically a gene) into a single burden variable to test for the combined effects of all rare variants in the region or gene. Such tests include the cohort allelic sum test (CAST) [6], the combined multivariate and collapsing (CMC) method [7], the group-wise weighted sum test [8], and the proportion and presence tests of Morris and Zeggini [9]. Since these burden tests implicitly assume that all variants influence the trait in the same direction and with the same magnitude of effect, their power is reduced if their effects are not homogeneous [10]. Several non-burden tests have been developed that are less sensitive to heterogeneous effects [11–13]. Two one-sided statistics proposed by Ionita-Laza et al [11] quantify enrichment in risk variants and protective variants. The C-alpha test uses the distribution of genetic variation observed in cases and controls, maintaining consistent power when the target set contains both risk and protective variants [12]. The sequence kernel association test (SKAT) provides a flexible and computationally efficient approach using a score-based variance component test and applicable to both continuous and discrete traits [13]. Using the framework of linear mixed effect models, SKAT has been recently extended to handle family data, which is called the family-based SKAT (famSKAT) [14].
Here we report the results from rare variant analysis using 94,595 rare and low frequency variants from the Illumina exome array (ExomeChip) genotyped in HyperGEN African Americans (AA). We performed four gene-level association analyses of systolic blood pressure (SBP): burden and famSKAT tests with minor allele frequency (MAF) cutoffs of 1% and 5% accounting for family correlation. To account for population stratification in our AA subjects, we used principal components (PCs) of the common variants (with MAF ≥ 5%) on the ExomeChip as covariates in our analysis. A varying number of PCs (no PC, first 2 PCs and first 10 PCs) were used to evaluate the impact of population stratification on the analysis of rare variants.
Method
Study Sample
We studied African Americans (AA) in the Hypertension Genetic Epidemiology Network (HyperGEN) from the Family Blood Pressure Program [15]. The study recruited AA and EA participants at five field centers to investigate the genetic causes of hypertension and related conditions. Study participants were one of three types: 1) individuals in hypertensive sibships with at least two siblings diagnosed with hypertension; 2) random subjects, who were age-matched with hypertensive siblings; or 3) unmedicated adult offspring of one or more of the hypertensive siblings. The study obtained informed consent from participants and approval from the appropriate institutional review boards.
Genotype and Phenotype Data
A total of 2,147 HyperGEN AAs were genotyped using the Illumina exome array that covers the genome-wide exome variants. Illumina’s clustering algorithm (GenTrain version 1.0), as implemented in GenomeStudio, was used for genotype calling. To check the quality and consistency of the genotyping process, 19 blind duplicate samples were included. We removed these duplicate samples and corrected sample mix-ups and pedigree errors, resulting in 2,111 subjects. We removed monomorphic markers and SNPs with missing rate > 5% or Hardy-Weinberg p-value <10−6 and removed any genotypes with a non-Mendelian pattern of inheritance. The number of variants after quality control and exclusions was 124,759. There were 30,164 common variants (with MAF ≥ 5%) and 94,595 low frequency variants (with MAF < 5%). There were also extremely rare variants including 15,017 singletons and 9,323 doubletons.
We included 2,045 subjects with genotype, phenotype and covariates information. Systolic blood pressure (SBP) was measured (on an mmHg scale) with a DINAMAP model 1846 SX/P automated oscillometric device using a standard protocol. Briefly, after a 5-minute rest period, resting BP was measured six times, with two minutes gap between measurements. The average of 3 SBP measurements was used for each subject. In all analyses, we added 15 mmHg to their SBP values for subjects taking antihypertensive or blood pressure lowering medications. Age, sex, and body mass index (BMI) were included as covariates. In addition, a varying number of PCs derived from common variants were included as covariates, as discussed below.
Statistical Analysis
We performed single-variant analysis and the four sets of genome-wide gene-level association analysis. The four sets of gene-level association analysis are burden tests with minor allele frequency (MAF) cutoffs of 1% and 5% (Burden-1 and Burden-5) and famSKAT tests with MAF cutoffs of 1% and 5% (famSKAT-1 and famSKAT-5). We used the annotation file for the Illumina exome array data, available from the CHARGE (Cohorts for Heart and Aging Research in Genetic Epidemiology) study [16]. For single-variant analysis, we excluded singletons because false positive results may arise from an influential phenotype value of one subject. For gene-level analyses, we excluded genes with only one variant as their results were identical to single-variant analysis. We included singletons for gene-level analyses. For example, when a gene included multiple singletons, all these singletons were jointly used for our gene-based analysis.
To perform the burden tests in HyperGEN AAs, we extended the burden test by Morris and Zeggini [9] to account for family structure. For the combined effect of multiple rare variants in a genomic region, Morris and Zeggini [6] developed the burden test in a linear regression framework. For family data, a linear mixed model with a random polygenic component is commonly used to account for phenotypic correlations among related individuals. Therefore, we used a linear mixed effect modeling framework and performed our analysis with GenABEL [17] and ProbABEL [18], which uses the kinship matrix based on the pedigree structure.
Population stratification is known to create spurious associations if not properly controlled. The common practice in GWA studies has been to control for stratification by using a number of top principal components (PCs) of the GWAS markers as covariates in association analysis. However, the optimal number of PCs has not been well established even for common variant analysis, as shown by Peloso and Lunetta [19]. Therefore, to account for population stratification in our AA subjects, we used a varying number of the leading PCs as covariates in our analysis: no PC, the top 2 PCs and the top 10 PCs. Although the impact of population stratification on the analysis of rare variants has shown to be more subtle and complex [20–23], several papers have demonstrated advantages of using PCs derived from common variants (with MAF ≥ 5%) [24–27]. Therefore, we computed PCs derived from 30,164 common variants in our exome chip data. The genomic inflation factor λ was calculated as the ratio of the observed median chi-square value to the expected median chi-square value.
Results
We performed single-variant analysis and the four sets of genome-wide gene-based analysis using 79,578 low frequency variants: Burden-1, Burden-5, famSKAT-1, famSKAT-5. For a varying number of PCs, genomic inflation factors λ are shown in Table 1. Single-variant analysis exhibited no genomic inflation whether PCs were included or not (with all λ ≤ 1.002). Likewise, burden tests did not exhibit much inflation, with 2 PCs leading to the lowest inflation (λ=1.025 and 1.006 for Burden-1 and Burden-5, respectively). In contrast, SKAT analyses exhibited modest inflation, with no PCs leading to the lowest inflation (λ=1.075 and 1.094 for famSKAT-1 and famSKAT-5, respectively). Based on Table 1 and also because our subjects were of African ancestry, we decided to present the results using 2 PCs. Quantile-quantile (QQ) plots are shown in Supplemental Figure 1.
Table 1.
Genomic inflation values (λ) for the five genome-wide analysis options using each of principal components (PCs) adjustments
| Number of PCs | Single-variant | Burden-1 | Burden-5 | famSKAT-1 | famSKAT-5 |
|---|---|---|---|---|---|
| 0 | 1.002 | 1.028 | 1.007 | 1.075 | 1.094 |
| 2 | 0.999 | 1.025 | 1.006 | 1.079 | 1.102 |
| 10 | 0.999 | 1.041 | 1.010 | 1.107 | 1.119 |
In addition to single-variant analysis, four gene-level association tests were used to analyze the data: burden tests with a minor allele frequency (MAF) cutoff of 1% and 5% (Burden-1 and Burden-5), famSKAT tests with a frequency cutoff of 1% and 5% (famSKAT-1 and famSKAT-5). Quantile-quantile (QQ)-plots are shown in Supplemental Figure 1.
Manhattan plots for single-variant analysis and the four sets of gene-based analysis using 2 PCs are shown in Figure 1. The genome-wide Bonferroni-corrected threshold is 6.4×10−7 (=0.05/79,578) for a single-variant analysis. Gene-based tests with MAF cutoff of 1% and 5% included 11,958 and 12,895 genes, respectively; the corresponding Bonferroni-corrected thresholds were 4.2×10−6 and 3.9×10−6, respectively. Table 2 presents the top 11 low frequency variants from our single-variant analysis (with P < 6.0×10−5) and Table 3 shows the top 16 genes from the four gene-based results (with P < 2.0 ×10−4). We found no genome-wide significant results based on the Bonferroni-corrected threshold. However, we found that gene-based methods often provided lower p-values than the single-variant approach, as shown in Table 3.
Figure 1.
Manhattan plots of the single-variant analysis and the four gene-level analysis options (Burden-1, Burden-5, famSKAT-1, and famSKAT-5). For single-variant analysis, the −log10(p) of each SNP was plotted at the chromosomal location of the SNP, and for each gene-level analysis, the −log10(p) of each gene was plotted at the chromosomal location of the first SNP of the gene. These results were based on using 2 PCs.
Table 2.
Top low frequency variants from single-variant analysis with p-value < 6×10−5 (All these variants were exonic non-synonymous.)
| Rank | SNP | Chr | BP | Gene | MAF | Beta | SE | P |
|---|---|---|---|---|---|---|---|---|
| 1 | rs138085317 | 4 | 114,290,810 | ANK2 | 0.0042 | 27.3 | 5.6 | 1.2×10−6 |
| 2 | rs143104022 | 18 | 30,791,983 | C18orf34 | 0.0012 | 43.3 | 9.9 | 1.2×10−5 |
| 3 | rs144164391 | 6 | 25,779,375 | SLC17A4 | 0.0012 | 42.8 | 9.9 | 1.4×10−5 |
| 4 | rs75307540 | 11 | 6,452,889 | HPX | 0.0093 | 15.9 | 3.8 | 2.9×10−5 |
| 5 | rs145926130 | 16 | 1,536,575 | PTX4 | 0.0017 | 35.8 | 8.6 | 3.2×10−5 |
| 6 | rs35124934 | 2 | 3,744,990 | ALLC | 0.0342 | 8.1 | 2.0 | 4.3×10−5 |
| 7 | rs6665210 | 1 | 145,532,823 | ITGA10 | 0.0193 | −10.7 | 2.6 | 4.4×10−5 |
| 8 | rs75508009 | 5 | 111,519,758 | EPB41L4A | 0.0022 | 30.2 | 7.4 | 4.5×10−5 |
| 9 | rs144714540 | 1 | 45,809,001 | TOE1 | 0.0010 | 48.8 | 12 | 4.6×10−5 |
| 10 | rs139543898 | 1 | 158,389,863 | OR10K2 | 0.0005 | 63.4 | 15.6 | 5.0×10−5 |
| 11 | rs146171474 | 4 | 122,833,191 | TRPC3 | 0.0007 | 56 | 13.9 | 5.8×10−5 |
Genome-wide Bonferroni-corrected threshold corresponds to 6.4×10−7 (=0.05/79,578). Bold-faced genes are biologically plausible, providing physiological connection to BP regulation (as presented in Discussion); except for TRPC3, which contained only one variant, these were included in Table 3.
Table 3.
16 top genes from each gene-based analysis with p-value < 2.0 ×10−4 (The smallest P value for each gene is shown in bold.)
| Group | Gene | Gene Positiona | Gene-level association p-values | Direction of single-variant associationb |
Top single-variant signal (MAF < 5%) | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Burden-1 | Burden-5 | famSKAT-1 | famSKAT-5 | SNPc | MAF | P-value | ||||
| 1 | DOPEY2 | Chr 21: 37.5 Mb | 8.3×10−5 | 2.8×10−3 | 8.1×10−4 | 2.6×10−3 | +++-+++-++-(−)-++++-+++ | rs142091518 | 0.0002 | 6.1×10−4 |
| NSUN5 | Chr 7: 72.7 Mb | 8.6×10−5 | 8.6×10−5 | 3.6×10−3 | 3.6×10−3 | ++++ | rs77133388 | 0.0002 | 6.5×10−3 | |
| FGF4 | Chr 11: 69.6 Mb | 1.2×10−4 | 1.2×10−4 | 5.6×10−4 | 5.6×10−4 | ++ | rs140567674 | 0.0002 | 2.9×10−3 | |
| OR52H1 | Chr 11: 5.6 Mb | 1.2×10−4 | 0.73 | 5.5×10−4 | 0.27 | +(+)(+)+(−)(−)+++(−)(+)+ | rs143228643 | 0.0037 | 4.0×10−4 | |
| ANKRD50 | Chr 4: 125.6 Mb | 1.5×10−4 | 0.77 | 1.6×10−4 | 0.55 | (−)+(+)(−) | rs140645873 | 0.0002 | 1.5×10−4 | |
| ADC | Chr 1: 33.6 Mb | 1.5×10−4 | 0.17 | 6.2×10−3 | 8.6×10−3 | (−)++++-++ | rs150169064 | 0.0044 | 0.02 | |
| RAB3GAP1 | Chr 2: 135.8 Mb | 1.7×10−4 | 0.13 | 5.1×10−3 | 0.03 | -(+)+++++(+)(−)(−)- | rs150478342 | 0.0017 | 1.7×10−4 | |
| 2 | FANK1 | Chr 10: 127.7 Mb | 0.05 | 1.7×10−4 | 0.51 | 9.3×10−3 | ---(−)--(−)-- | rs141545587* | 0.0005 | 0.01 |
| ANKRD6 | Chr 6: 90.3 Mb | 0.26 | 1.7×10−4 | 0.51 | 4.3×10−3 | (+)(+)++ | rs61736690* | 0.0472 | 7.3×10−3 | |
| 3 | HPX | Chr 11: 6.5 Mb | 3.2×10−4 | 0.41 | 7.1×10−5 | 4.0×10−3 | +++++(−)(−)(−) | rs75307540 | 0.0093 | 2.9×10−5 |
| MARCH8 | Chr 10: 46.0 Mb | 0.01 | 0.01 | 1.2×10−4 | 1.2×10−4 | --+ | rs140989641 | 0.0022 | 1.0×10−4 | |
| ARHGAP1 | Chr 11: 46.7 Mb | 7.5×10−3 | 7.5×10−3 | 1.4×10−4 | 1.4×10−4 | -+- | rs11822837* | 0.0044 | 1.5×10−4 | |
| CD3E | Chr 11: 118.2 Mb | 4.6×10−4 | 4.6×10−4 | 1.5×10−4 | 1.5×10−4 | +++ | rs140639753 | 0.0015 | 1.1×10−4 | |
| 4 | MOCS1 | Chr 6: 39.9 Mb | 0.11 | 2.3×10−5 | 0.01 | 1.4×10−5 | ++-(+)+- | rs7762875 | 0.0208 | 6.2×10−5 |
| ALLC | Chr 2: 3.7 Mb | 0.95 | 1.5×10−4 | 0.03 | 1.7×10−5 | -+--+(+)(+)-(+) | rs35124934 | 0.0342 | 4.3×10−5 | |
| ITGA10 | Chr 1: 145.5 Mb | 0.15 | 8.8×10−3 | 1.3×10−3 | 8.6×10−5 | -+(−)+-+-(+)--++(−)++-(−)- | rs6665210 | 0.0193 | 4.4×10−5 | |
Four gene-level association tests were used to analyze the data: burden tests with minor allele frequency (MAF) cutoffs of 1% and 5% (Burden-1 and Burden-5), famSKAT tests with MAF cutoffs of 1% and 5% (famSKAT-1 and famSKAT-5). Bold-faced genes are biologically plausible, providing physiological connection to blood pressure regulation (as presented in Discussion).
Gene position is defined according to hg19, GRCh37 (Genome Reference Consortium Human Reference 37).
Direction of single-variant association statistics for variants with MAF< 5%. Variants in parentheses have MAF >1%. Bonferroni-corrected threshold corresponds to 4.2×10−6 and 3.9×10−6 for MAF <1% and MAF <5%, respectively.
All these most strongly associated variants were exonic non-synonymous.
The three variants with * were also splicing.
We found modest consistency across these four gene-based analysis options. The results for the burden test and famSKAT test within each MAF threshold were moderately correlated (Spearman correlation = 0.55). The MAF thresholds (1% and 5%) made more difference for the burden tests than famSKAT tests (Spearman correlation = 0.50 and 0.72, respectively). The scatterplots of pairwise −log10(p) values across four gene-based analysis options are presented in Supplemental Figure 2.
We found that all 16 top genes in Table 3 indeed had supporting evidence from multiple gene-based methods. To further evaluate the relative performance of each gene-based method, we classified these genes in Table 3 into 4 groups. Note that the low frequency variants (with 1% ≤ MAF < 5%) are presented in parenthesis in the “direction of single-variant association” column. For genes in group 1, the Burden-1 approach provided the strongest evidence; at these genes, rare variants had mostly unidirectional effects. We found that famSKAT also provided corroborating evidence for some of those genes (notably NSUN5 and FGF4), which is assuring. When the low frequency variants had the opposite effects as compared to rare variants (as OR52H1), the Burden-5 approach provided much weaker evidence than Burden-1. In contrast, when the low frequency variants had the same effects as rare variants, the Burden-5 approach provided stronger evidence than Burden-1, as shown in group 2. We found that famSKAT indeed performed better on genes with multidirectional variants, as shown in groups 3 and 4 (notably on HPX).
Some genes in Table 3 had the most strongly associated single variant with the same magnitude of p-value as the gene-level results. To assess if the signal from gene-level analyses is primarily driven by the one single variant or aggregated evidence from multiple low-frequency variants, we performed conditional analysis by excluding this most strongly associated variant and using it as an additional covariate. Supplemental Table 1 provides p-values obtained by our conditional analysis. The signal for DOPEY2 appears to reflect aggregate evidence from multiple low-frequency variants. Five other genes including NSUN5 also have a hint of aggregated effects, whereas the remaining genes have no significant conditional p-values, indicating their signals mostly driven by one variant.
To improve the selection of variants to be included in the gene-level tests, we also considered functional annotation information. It turned out that most (90,875 out of 94,595) low frequency variants were non-synonymous, stop-gain/loss, splice, or missense variants. Therefore, when we repeated all four gene-level analyses with these functional variants only, the results were very similar (Spearman correlation = 0.97) as shown in the first row of Supplemental Figure 3. Roughly half (N=41,393) of the low frequency variants were stop-gain/loss, splice or missense and predicted to be damaging by 2 of the 4 algorithms in dbNSFP (Mutation Taster, Polyphen HDIV, SIFT or LRT). We further performed four gene-level analyses with these damaging variants and found a subset of genes with lower p-values (shown in the second row of Supplemental Figure 3). The most notable one was MOCS1, where Burden-5 result was enhanced (P = 4.5 ×10−6). Our results illustrate that excluding neutral variants may help improve the power of gene-level tests.
Although our focus was on low frequency variants, we also performed single-variant analysis on common variants. Among 30,164 common variants, our strongest signal was observed at rs11711441 on chromosome 3 (with P = 1.7 ×10−5). Out of 60 known BP variants identified by the 9 articles [3, 28–35], only 13 were available in our ExomeChip data. Among these 13 known variants, the smallest p-value from our analysis was observed at rs3184504 in SB2B3 (with P = 0.0029); the effect was the same direction as [3], but with lower MAF (0.057). Our results may not be a surprise given a limited number of common variants in the Exome Chip data.
Discussion
We performed analysis of systolic blood pressure (SBP) using 79,578 rare and low frequency variants from the Illumina exome array genotyped in 2,045 HyperGEN AAs. While neither gene-based analysis nor single variant analysis produced genome-wide significant results, the most promising genes from each of the 4 methods had supporting evidence from some of the other gene-based methods. Of the genes for which the burden approach provided strongest evidence, three genes are of particular interest as potential BP candidate genes: NSUN5 (NOP2/Sun domain family), ADC (arginine decarboxylase), and RAB3GAP1 (RAB3 GTPase activating protein subunit 1). All 3 genes had corroborating evidence from SKAT, as well. As the burden approach provided more significant evidence, the SNP effects in these genes are likely homogeneous. NSUN5 is shown to be deleted in Williams-Beuren syndrome [36], a multi-system developmental disorder whose symptoms include hypertension [37]. ADC contains a conformational change-inducing missense mutation (substituting a polar amino acid, serine, for a non-polar one, alanine) that has been previously associated with SBP [38]. RAB3GAP1 has been previously identified to be associated with sudden cardiac death in the MetaboChip study [39].
Of the genes with strong evidence from famSKAT analysis, three have potential connections to BP: HPX (Hemopexin), ARHGAP1 (Rho GTPase activating protein 1), and ALLC (allantoicase). HPX is a plasma glycoprotein that transports heme to the liver, and it is a major BP candidate gene owing to its role in regulating expression of the angiotensin II type I receptor AGTR1 and modulating sensitivity to angiotensin [40] as well as its identification as a risk factor for late renal graft failure [41]. ARHGAP1 is a guanosine-triphosphate metabolizing enzyme that plays a role in the regulation of tubule formation, an important early step in angiogenesis. ALLC is involved in the uric acid degradation pathway; uric acid levels have been associated with BP [42].
Single-variant analysis provided evidence for two additional potential candidate genes: ANK2 (ankyrin 2) and TRPC3 (transient receptor potential cation channel). ANK2 is a member of the ankyrin family of proteins and is required for the targeting and stability of the Na+/Ca2+ exchanger in cardiomyocytes. Mutations in this gene lead to long-QT syndrome and cardiac arrhythmia [43], and ANK2 was located in a 1-lod support interval linked to resting heart rate [44]. TRPC3 is a renal Ca2+ channel expressed in principal cells of the collecting duct. Expression levels of TRPC3 have been associated with systolic hypertension in humans [45] and are higher in spontaneously hypertensive rats than in normotensive Wistar Kyoto rats [46].
To evaluate the impact of population stratification on the analysis of rare variants, we used a varying number of leading principal components (PCs) as covariates in our analysis. We found that type I error rates were well controlled for the single-variant and burden approaches, whereas the famSKAT approach exhibited modest inflation, as also noted by the famSKAT developers [14]. Furthermore, a recent article by Zawistowski et al [23] also reports higher levels of inflation from the other non-burden tests including the C-alpha test [12] and the original SKAT [13]. These non-burden tests have greater power to detect association in genes containing rare variants with heterogeneous effects but may be more vulnerable to population stratification. Given the lack of significant findings due to low power to detect rare variants, consistent findings from multiple gene-based methods including both burden and non-burden approaches with different thresholds is reassuring. These findings are compelling candidates for external validation.
Supplementary Material
Acknowledgements
We thank anonymous reviewers for their constructive and insightful comments, which substantially improved the manuscript. The authors thank Maria Sosa for lab assistance. The work was partly supported by multiple Grants from the National Heart, Lung, and Blood Institute (NHLBI), National Institutes of Health (NIH): HL086694, HL111249, HL055673, and HL121091.
References
- 1.Newton-Cheh C, Johnson T, Gateva V, Tobin MD, Bochud M, Coin L, Najjar SS, Zhao JH, Heath SC, Eyheramendy S, et al. Genome-wide association study identifies eight loci associated with blood pressure. Nature genetics. 2009;41(6):666–676. doi: 10.1038/ng.361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Levy D, Ehret GB, Rice K, Verwoert GC, Launer LJ, Dehghan A, Glazer NL, Morrison AC, Johnson AD, Aspelund T, et al. Genome-wide association study of blood pressure and hypertension. Nature genetics. 2009;41(6):677–687. doi: 10.1038/ng.384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.International Consortium for Blood Pressure Genome-Wide Association S. Ehret GB, Munroe PB, Rice KM, Bochud M, Johnson AD, Chasman DI, Smith AV, Tobin MD, Verwoert GC, et al. Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk. Nature. 2011;478(7367):103–109. doi: 10.1038/nature10405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, et al. Finding the missing heritability of complex diseases. Nature. 2009;461(7265):747–753. doi: 10.1038/nature08494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.McCarthy MI, Hirschhorn JN. Genome-wide association studies: potential next steps on a genetic journey. Human molecular genetics. 2008;17(R2):R156–R165. doi: 10.1093/hmg/ddn289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Morgenthaler S, Thilly WG. A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST) Mutation research. 2007;615(1–2):28–56. doi: 10.1016/j.mrfmmm.2006.09.003. [DOI] [PubMed] [Google Scholar]
- 7.Li B, Leal SM. Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. American journal of human genetics. 2008;83(3):311–321. doi: 10.1016/j.ajhg.2008.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Madsen BE, Browning SR. A groupwise association test for rare mutations using a weighted sum statistic. PLoS Genet. 2009;5(2):e1000384. doi: 10.1371/journal.pgen.1000384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Morris AP, Zeggini E. An evaluation of statistical approaches to rare variant analysis in genetic association studies. Genet Epidemiol. 2010;34(2):188–193. doi: 10.1002/gepi.20450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Cohen JC, Pertsemlidis A, Fahmi S, Esmail S, Vega GL, Grundy SM, Hobbs HH. Multiple rare variants in NPC1L1 associated with reduced sterol absorption and plasma low-density lipoprotein levels. Proceedings of the National Academy of Sciences of the United States of America. 2006;103(6):1810–1815. doi: 10.1073/pnas.0508483103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ionita-Laza I, Buxbaum JD, Laird NM, Lange C. A new testing strategy to identify rare variants with either risk or protective effect on disease. PLoS genetics. 2011;7(2):e1001289. doi: 10.1371/journal.pgen.1001289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Neale BM, Rivas MA, Voight BF, Altshuler D, Devlin B, Orho-Melander M, Kathiresan S, Purcell SM, Roeder K, Daly MJ. Testing for an unusual distribution of rare variants. PLoS genetics. 2011;7(3):e1001322. doi: 10.1371/journal.pgen.1001322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wu MC, Lee S, Cai T, Li Y, Boehnke M, Lin X. Rare-variant association testing for sequencing data with the sequence kernel association test. American journal of human genetics. 2011;89(1):82–93. doi: 10.1016/j.ajhg.2011.05.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Chen H, Meigs JB, Dupuis J. Sequence kernel association test for quantitative traits in family samples. Genetic epidemiology. 2013;37(2):196–204. doi: 10.1002/gepi.21703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Williams RR, Rao DC, Ellison RC, Arnett DK, Heiss G, Oberman A, Eckfeldt JH, Leppert MF, Province MA, Mockrin SC, et al. NHLBI family blood pressure program: methodology and recruitment in the HyperGEN network. Hypertension genetic epidemiology network. Annals of epidemiology. 2000;10(6):389–400. doi: 10.1016/s1047-2797(00)00063-6. [DOI] [PubMed] [Google Scholar]
- 16.Psaty BM, O'Donnell CJ, Gudnason V, Lunetta KL, Folsom AR, Rotter JI, Uitterlinden AG, Harris TB, Witteman JC, Boerwinkle E, et al. Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium: Design of prospective meta-analyses of genome-wide association studies from 5 cohorts. Circulation Cardiovascular genetics. 2009;2(1):73–80. doi: 10.1161/CIRCGENETICS.108.829747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Aulchenko YS, Ripke S, Isaacs A, van Duijn CM. GenABEL: an R library for genome-wide association analysis. Bioinformatics. 2007;23(10):1294–1296. doi: 10.1093/bioinformatics/btm108. [DOI] [PubMed] [Google Scholar]
- 18.Aulchenko YS, Struchalin MV, van Duijn CM. ProbABEL package for genome-wide association analysis of imputed data. BMC bioinformatics. 2010;11:134. doi: 10.1186/1471-2105-11-134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Peloso GM, Lunetta KL. Choice of population structure informative principal components for adjustment in a case-control study. BMC genetics. 2011;12:64. doi: 10.1186/1471-2156-12-64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Mathieson I, McVean G. Differential confounding of rare and common variants in spatially structured populations. Nature genetics. 2012;44(3):243–246. doi: 10.1038/ng.1074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Jiang Y, Epstein MP, Conneely KN. Assessing the impact of population stratification on association studies of rare variation. Human heredity. 2013;76(1):28–35. doi: 10.1159/000353270. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.O'Connor TD, Kiezun A, Bamshad M, Rich SS, Smith JD, Turner E, Project NES, Esp Population Genetics SAWG. Leal SM, Akey JM. Fine-scale patterns of population stratification confound rare variant association tests. PloS one. 2013;8(7):e65834. doi: 10.1371/journal.pone.0065834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zawistowski M, Reppell M, Wegmann D, St Jean PL, Ehm MG, Nelson MR, Novembre J, Zollner S. Analysis of rare variant population structure in Europeans explains differential stratification of gene-based tests. European journal of human genetics : EJHG. 2014 doi: 10.1038/ejhg.2013.297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. American journal of human genetics. 2007;81(3):559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.He H, Zhang X, Ding L, Baye TM, Kurowski BG, Martin LJ. Effect of population stratification analysis on false-positive rates for common and rare variants. BMC proceedings. 2011;5(Suppl 9):S116. doi: 10.1186/1753-6561-5-S9-S116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Babron MC, de Tayrac M, Rutledge DN, Zeggini E, Genin E. Rare and low frequency variant stratification in the UK population: description and impact on association tests. PloS one. 2012;7(10):e46519. doi: 10.1371/journal.pone.0046519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Price AL, Zaitlen NA, Reich D, Patterson N. New approaches to population stratification in genome-wide association studies. Nature reviews Genetics. 2010;11(7):459–463. doi: 10.1038/nrg2813. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Johnson T, Gaunt TR, Newhouse SJ, Padmanabhan S, Tomaszewski M, Kumari M, Morris RW, Tzoulaki I, O'Brien ET, Poulter NR, et al. Blood pressure loci identified with a gene-centric array. American journal of human genetics. 2011;89(6):688–700. doi: 10.1016/j.ajhg.2011.10.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Padmanabhan S, Melander O, Johnson T, Di Blasio AM, Lee WK, Gentilini D, Hastie CE, Menni C, Monti MC, Delles C, et al. Genome-wide association study of blood pressure extremes identifies variant near UMOD associated with hypertension. PLoS genetics. 2010;6(10):e1001177. doi: 10.1371/journal.pgen.1001177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kato N, Takeuchi F, Tabara Y, Kelly TN, Go MJ, Sim X, Tay WT, Chen CH, Zhang Y, Yamamoto K, et al. Meta-analysis of genome-wide association studies identifies common variants associated with blood pressure variation in east Asians. Nature genetics. 2011;43(6):531–538. doi: 10.1038/ng.834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wain LV, Verwoert GC, O'Reilly PF, Shi G, Johnson T, Johnson AD, Bochud M, Rice KM, Henneman P, Smith AV, et al. Genome-wide association study identifies six new loci influencing pulse pressure and mean arterial pressure. Nature genetics. 2011;43(10):1005–1011. doi: 10.1038/ng.922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Franceschini N, Fox E, Zhang Z, Edwards TL, Nalls MA, Sung YJ, Tayo BO, Sun YV, Gottesman O, Adeyemo A, et al. Genome-wide association analysis of blood-pressure traits in African-ancestry individuals reveals common associated genes in African and non-African populations. American journal of human genetics. 2013;93(3):545–554. doi: 10.1016/j.ajhg.2013.07.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Ganesh SK, Chasman DI, Larson MG, Guo X, Verwoert G, Bis JC, Gu X, Smith AV, Yang ML, Zhang Y, et al. Effects of long-term averaging of quantitative blood pressure traits on the detection of genetic associations. American journal of human genetics. 2014;95(1):49–65. doi: 10.1016/j.ajhg.2014.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Tragante V, Barnes MR, Ganesh SK, Lanktree MB, Guo W, Franceschini N, Smith EN, Johnson T, Holmes MV, Padmanabhan S, et al. Gene-centric meta-analysis in 87,736 individuals of European ancestry identifies multiple blood-pressure-related loci. American journal of human genetics. 2014;94(3):349–360. doi: 10.1016/j.ajhg.2013.12.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Simino J, Shi G, Bis JC, Chasman DI, Ehret GB, Gu X, Guo X, Hwang SJ, Sijbrands E, Smith AV, et al. Gene-age interactions in blood pressure regulation: a large-scale investigation with the CHARGE, Global BPgen, and ICBP Consortia. American journal of human genetics. 2014;95(1):24–38. doi: 10.1016/j.ajhg.2014.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Doll A, Grzeschik KH. Characterization of two novel genes, WBSCR20 and WBSCR22, deleted in Williams-Beuren syndrome. Cytogenetics and cell genetics. 2001;95(1–2):20–27. doi: 10.1159/000057012. [DOI] [PubMed] [Google Scholar]
- 37.Schubert C. The genomic basis of the Williams – Beuren syndrome. Cell Mol Life Sci. 2009;66(7):1178–1197. doi: 10.1007/s00018-008-8401-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Hong KW, Jin HS, Lim JE, Cho YS, Go MJ, Jung J, Lee JE, Choi J, Shin C, Hwang SY, et al. Non-synonymous single-nucleotide polymorphisms associated with blood pressure and hypertension. Journal of human hypertension. 2010;24(11):763–774. doi: 10.1038/jhh.2010.9. [DOI] [PubMed] [Google Scholar]
- 39.Huertas-Vazquez A, Nelson CP, Guo X, Reinier K, Uy-Evanado A, Teodorescu C, Ayala J, Jerger K, Chugh H, Wtccc, et al. Novel loci associated with increased risk of sudden cardiac death in the context of coronary artery disease. PloS one. 2013;8(4):e59905. doi: 10.1371/journal.pone.0059905. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Krikken JA, Lely AT, Bakker SJ, Borghuis T, Faas MM, van Goor H, Navis G, Bakker WW. Hemopexin activity is associated with angiotensin II responsiveness in humans. Journal of hypertension. 2013;31(3):537–541. doi: 10.1097/HJH.0b013e32835c1727. discussion 542. [DOI] [PubMed] [Google Scholar]
- 41.Krikken JA, van Ree RM, Klooster A, Seelen MA, Borghuis T, Lems SP, Schouten JP, Bakker WW, Gans RO, Navis G, et al. High plasma hemopexin activity is an independent risk factor for late graft failure in renal transplant recipients. Transplant international : official journal of the European Society for Organ Transplantation. 2010;23(8):805–812. doi: 10.1111/j.1432-2277.2010.01055.x. [DOI] [PubMed] [Google Scholar]
- 42.Loeffler LF, Navas-Acien A, Brady TM, Miller ER, 3rd, Fadrowski JJ. Uric acid level and elevated blood pressure in US adolescents: National Health and Nutrition Examination Survey, 1999–2006. Hypertension. 2012;59(4):811–817. doi: 10.1161/HYPERTENSIONAHA.111.183244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Sedlacek K, Stark K, Cunha SR, Pfeufer A, Weber S, Berger I, Perz S, Kaab S, Wichmann HE, Mohler PJ, et al. Common genetic variants in ANK2 modulate QT interval: results from the KORA study. Circulation Cardiovascular genetics. 2008;1(2):93–99. doi: 10.1161/CIRCGENETICS.108.792192. [DOI] [PubMed] [Google Scholar]
- 44.Martin LJ, Comuzzie AG, Sonnenberg GE, Myklebust J, James R, Marks J, Blangero J, Kissebah AH. Major quantitative trait locus for resting heart rate maps to a region on chromosome 4. Hypertension. 2004;43(5):1146–1151. doi: 10.1161/01.HYP.0000122873.42047.17. [DOI] [PubMed] [Google Scholar]
- 45.Thilo F, Baumunk D, Krause H, Schrader M, Miller K, Loddenkemper C, Zakrzewicz A, Krueger K, Zidek W, Tepel M. Transient receptor potential canonical type 3 channels and blood pressure in humans. Journal of hypertension. 2009;27(6):1217–1223. doi: 10.1097/HJH.0b013e32832a5a9f. [DOI] [PubMed] [Google Scholar]
- 46.Liu D, Yang D, He H, Chen X, Cao T, Feng X, Ma L, Luo Z, Wang L, Yan Z, et al. Increased transient receptor potential canonical type 3 channels in vasculature from hypertensive rats. Hypertension. 2009;53(1):70–76. doi: 10.1161/HYPERTENSIONAHA.108.116947. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.

