Skip to main content
PLOS One logoLink to PLOS One
. 2020 Apr 21;15(4):e0232048. doi: 10.1371/journal.pone.0232048

Admixture mapping identifies genetic regions associated with blood pressure phenotypes in African Americans

Zhi Liu 1, Daniel Shriner 2, Nancy F Hansen 1, Charles N Rotimi 2, James C Mullikin 1,3,*; on behalf of the NISC Comparative Sequencing Program3,
Editor: Heming Wang4
PMCID: PMC7173845  PMID: 32315356

Abstract

Hypertension occurs at a higher rate in African Americans than in European Americans. Based on the assumption that causal variants are more frequently found on DNA segments inherited from the ancestral population with higher disease risk, we employed admixture mapping to identify genetic loci with excess local African ancestry associated with blood pressure. Chromosomal regions 1q21.2–21.3, 4p15.1, 19q12 and 20p13 were significantly associated with diastolic blood pressure (β = 5.28, -7.94, -6.82 and 5.89, P-value = 6.39E-04, 2.07E-04, 6.56E-05 and 5.04E-04, respectively); 1q21.2–21.3 and 19q12 were also significantly associated with mean arterial pressure (β = 5.86 and -6.40, P-value = 5.32E-04 and 6.37E-04, respectively). We further selected SNPs that had large allele frequency differences within these regions and tested their association with blood pressure. SNP rs4815428 was significantly associated with diastolic blood pressure after Bonferroni correction (β = -2.42, P-value = 9.57E-04), and it partially explained the admixture mapping signal at 20p13. SNPs rs771205 (β = -1.99, P-value = 3.37E-03), rs3126067, rs2184953 and rs58001094 (the latter three exhibit strong linkage disequilibrium, β = -2.3, P-value = 1.4E-03) were identified to be significantly associated with mean arterial pressure, and together they fully explained the admixture signal at 1q21.2–21.3. Although no SNP at 4p15.1 showed large ancestral allele frequency differences in our dataset, we detected association at low-frequency African-specific variants that mapped predominantly to the gene PCDH7, which is most highly expressed in aorta. Our results suggest that these regions may harbor genetic variants that contribute to the different prevalence of hypertension.

Introduction

Hypertension is a strong risk factor for coronary artery disease. Hypertension is a heritable disease, with an estimated heritability of 30–50% [1]. So far, genome-wide association studies (GWAS) have been successful at identifying genetic loci associated with hypertension, but the effect sizes of individual loci have been small, and currently-identified loci only explain a small fraction of the total heritability (4–6%) [2]. Other genetic determinants for blood pressure are still yet to be discovered.

One way to investigate the unexplained heritability is to study less represented populations, such as African Americans. In the United States, hypertension is diagnosed more frequently in African Americans than European Americans, representing 40% of African Americans vs. 28% of European Americans among people from all age groups [3], and 60% of African Americans vs. 38% of European Americans among individuals aged between 45 to 85 years [4]. African Americans form an admixed population in which each individual not only displays different proportions of European and African ancestry, but their chromosomes also show ancestral mosaicisms resulting from recombination across generations. To identify causal genetic loci in admixed populations, admixture mapping serves as a powerful tool when large allele frequency differences are present in the ancestral populations. Ancestral allele frequency differences at causal loci may contribute to disease prevalence differences in different populations.

Previously, GWAS for hypertension have identified genetic loci such as those containing mutations in genes PDE3A [5], NOS3 [6] and CYP17A1-CNNM2-NT5C2 [7]. In addition to GWAS, admixture mapping has also been successful at identifying ancestral haplotypes significantly associated with hypertension. Up until now, several loci have been reported to be associated with blood pressure, such as 6q24 and 21q21 [8]; within 21q21, CXADR was likely to play a role in blood pressure in African Americans [9]; by utilizing CARe consortium data, 5p13 was identified to be associated with diastolic blood pressure (DBP), with 3 uncorrelated SNPs within this region adequately accounting for the observed association [10]. For Hispanics, 6p12.3 was found to be associated with local African ancestry for mean arterial pressure (MAP) and DBP, but no variants were identified that drove these associations [11]. Compared to GWAS, admixture mapping has a much lower testing burden and thus requires much smaller sample sizes.

In this study, we utilized the ClinSeq® cohort exome sequencing data to identify genetic regions where local African ancestry was associated with blood pressure phenotypes. We fine-mapped these regions and identified genetic variants with large ancestral allele frequency differences that drove local ancestral associations. Furthermore, we replicated these variants identified from ClinSeq® in an independent cohort of Africans.

Materials and methods

Patient samples and exome-sequencing

The ClinSeq® study was approved by the Institutional Review Boards at the National Institutes of Health and informed consent was obtained from each participant. The ClinSeq® A2 cohort, which consists of 503 unrelated participants of self-reported African descent, aged between 45–65, were recruited between 2012–2017 and seen at the Clinical Center of the National Institutes of Health. Participants were not ascertained based on any particular phenotype and were interviewed and measured for various anthropometric and clinical variables. Blood samples were collected, from which DNA was isolated and exomic regions of interest were captured using the Integrated DNA Technologies (IDT) capture kit. Whole exome sequencing was performed at the NIH Intramural Sequencing Center, Rockville, MD. Variant calling was performed using bam2mpg [12]. As a quality control (QC) step, single nucleotide polymorphisms (SNPs) were filtered for GQ ≥ 10 and GQ/DP > 0.5; only autosomal SNPs were retained; SNPs discovered to be out of Hardy-Weinberg equilibrium (P-value < 5.7E-07) based on an exact test [13], monomorphic SNPs, SNPs with call rate < 0.98, and SNPs with minor allele frequency (MAF) < 0.005 were removed. In a subsequent QC step, samples with mismatched sex, samples from related individuals and samples without phenotypes were removed. Based on principal component (PC) plots, samples outside of ± 4 standard deviations from the African American cluster by the first two eigenvalues were removed.

Phenotypes

DBP and systolic blood pressure (SBP) were measured using a Dinamap instrument and obtained from the left arm after five minutes of rest with the subject in a sitting position with legs uncrossed. Two of these measurements were obtained and an average was taken for our analysis. For patients who took anti-hypertension medication, 5 mmHg and 10 mmHg were added to DBP and SBP, respectively [14]. MAP values were calculated as 1/3 * medically adjusted SBP + 2/3 * medically adjusted DBP. To check that values were normally distributed, we used a Shapiro-Wilk test. Untransformed DBP (W = 0.99528, P-value = 0.1424), MAP (W = 0.9942, P-value = 0.05881) and Log10-transformed SBP (W = 0.996, P-value = 0.2496) were found to be normally distributed (S1 Fig).

Admixture mapping

Local ancestry was inferred using SEQMIX v0.12 [15]. For ancestry inference purposes, SNPs were pruned differently than in the association study. All markers, regardless of MAF, were retained. To ensure SNPs that were free of linkage disequilibrium (LD) with each other, SNPs were further pruned based on LD and sequencing depth: within a window size of 200, each step of 20, and r2 threshold of 0.1, only one of a pair of two markers that had a higher sequencing depth was retained. After pruning, 316,761 markers were retained for local ancestry analyses. The CEU and TSI datasets from the 1000 Genome Project [16] were used as European population references; the YRI and LWK datasets were used as African population references. Global ancestry was inferred as principal components using LASER v2.04 (Locating Ancestry from SEquence Reads) (S2 Fig) [17]. The overall percentage of European and African ancestry for each individual was also estimated by averaging local ancestry across the individual’s entire exome. Correlations were calculated between European, African ancestral percentage and principal components estimated by LASER (S3 Fig).

For admixture mapping, the following regression equation was used:

MedicallyadjustedBP=β0+β1GAA+β2(GAALAA)+β3age+β4age2+β5sex+β6BMI+ε

GAA represents global African ancestry, and LAA represents local African ancestry at a specific locus. By definition, global ancestry and local ancestry are correlated. Therefore, we used the difference between global and local ancestry instead of just local ancestry for the purpose of easier interpretation of the resulting β2. β2 is interpreted as the local response accounting for the global ancestral effect, which every locus carries, and it predicts how the additional ancestry at each particular locus would contribute to the phenotype.

To estimate the effective number of tests, we used the method described by Shriner et al. [18]. The R package “coda” [19] was used for the estimation. More specifically, an autoregressive (AR) model was fitted to the vector of local African ancestry and the spectral density at frequency zero was evaluated. The order of the fitted AR model is chosen by minimizing the Akaike Information Criterion (AIC). The effective number of tests for chromosomes for each individual was summed and then averaged across all individuals. In ClinSeq®, the total effective number of tests was equivalent to 66.84. Therefore, the genome-wide significant level α was 0.05/66.84 = 7.48E-04 (-log10 α = 3.13), and significant regions were identified as contiguous regions within 1 unit drop of the peak LOD score, which yields approximately a 95% confidence interval.

Regional association study

To identify SNPs that accounted for association signals discovered by admixture mapping, regional association studies were performed within admixture mapping significant regions. Since associated regions show a significant correlation between local ancestry and phenotype, we expect SNPs that drive the association signal to show a substantial allele frequency difference in different ancestral populations. Therefore, we only tested SNPs with allele frequency differences over 0.4 (δ > 0.4) between European and African ancestral populations from the 1000 Genome Project. PLINK [20] was used to test the association between medically adjusted BP and genotypes at these SNPs, adjusting for the first 10 principal components to account for population structure. The following regression equation was used to perform the association analysis:

MedicallyadjustedBP=β0+β1genotype+β2age+β3age2+β4sex+β5BMI+i=110βi+5PCi+ε

Conditional admixture mapping

We performed conditional admixture mapping in order to estimate the degree to which associated SNPs explained the observed admixture mapping signals. For this step, the genotypes of significant SNPs from the regional association study were included as covariates in the admixture mapping equation to test whether one or more SNPs were able to account for admixture mapping signals. P-values for the local ancestry coefficient β2 were recorded to investigate if they remained significant after adjusting for those SNPs. If β2 was not significant after inclusion of a SNP, we interpreted it as the included SNPs were able to account for local ancestry effects observed in admixture mapping.

Replication cohort description

We attempted to replicate our findings in the Africa America Diabetes Mellitus (AADM) cohort [21]. AADM is a study of type 2 diabetes in sub-Saharan Africans. The study is comprised of 5,231 participants recruited from university medical centers in Accra and Kumasi in Ghana; Enugu, Ibadan, and Lagos in Nigeria; and Eldoret in Kenya. Blood pressure was measured in the sitting position using an oscillometric device (Omron Healthcare, Kyoto, Japan). Three readings were taken with a ten-minute interval between readings. The reported DBP and SBP values were the average of the second and third readings. Weight was measured in light clothes on an electronic scale to the nearest 0.1 kg. Height was measured with a stadiometer to the nearest 0.1 cm. Body mass index was calculated as weight (kg) divided by the square of height (m2). Individuals taking antihypertensive medication were excluded, leaving 2,957 individuals for analysis. For both DBP and SBP, values were inverse-normalized after adjusting for sex, age, and age2. Genotyping was performed using the Affymetrix® Axiom® Genome-Wide PanAFR Array Set (n = 1,808) and the Illumina Infinium MEGA BeadChip, versions 1 (n = 3,046) and 2 (n = 377). For each array, quality control was performed as described previously [22]. After excluding SNPs with a minor allele frequency < 5%, a genotyping call rate < 90%, or a Hardy-Weinberg P-value < 0.001, principal components analysis was performed on 124,266 SNPs common to all three genotyping arrays. For each array, imputation was performed using the African Genome Resources reference panel available from the Sanger Imputation Service, using EAGLE2 [23] for pre-phasing and PBWT [24] for imputation. Association testing was performed using a linear mixed model in EPACTS (https://github.com/statgen/EPACTS) [25], with body mass index and the first three principal components as fixed effects and the genetic relatedness matrix as a random effect. The reason for only adjusting three principal components is that according to the Tracy-Widom test [26], only three principal components were significant. The first PC separated Kenyans from Ghanaians and Nigerians and also separated the Kenyans. The second PC separated Ghanaians from Nigerians. The third PC separated 11 Yoruba. Additional PCs did not explain significant amounts of variance (S4 Fig). To account for the fact that AADM is enriched for cases of type 2 diabetes, we included type 2 diabetes status as a covariate in the association analysis. Ancestry proportions were inferred by projecting genotype data onto a previously described reference panel [27] using ADMIXTURE version 1.3.0 [28]. This study was approved by the Institutional Review Boards at each study site, Howard University, and the National Institutes of Health and informed consent was obtained from each participant.

Results

ClinSeq® cohort characteristics

484 individuals passed QC and were included in the final admixture mapping analyses. The cohort characteristics are shown in Table 1. The correlation between the average African ancestry and DBP, log10(SBP), and MAP were 0.121, 0.064 and 0.108, respectively (P-value = 0.007, 0.159 and 0.017, respectively), indicating that all three blood pressure phenotypes increased as the percentage of African ancestry increased.

Table 1. Data description.

Characteristics ClinSeq® AADM
Age (SD) 56 (6) 50 (13)
Sex (Female %) 74% 63%
BMI (SD) 32 (10) 27 (6)
Diabetes 17% 50%
Anti-Hypertension Meds 45% 36%
Median African Ancestry (IQR) 79% (16%) 92% (13%)
Mean SBP (SD) 125 (14) 137 (24)
Mean DBP (SD) 73 (9) 82 (13)

Local ancestry and global ancestry inference

The average estimated African ancestry in our study sample was 76.7±12.8%. The average number of ancestral switch points per individual was 127. European ancestry percentage was highly correlated with PC1, and African ancestry percentage was highly correlated with PC2, both with a correlation coefficient of r < -0.97 (S3 Fig).

Admixture mapping

We next performed admixture mapping to identify regions where local African ancestry was significantly associated with blood pressure phenotypes. For DBP, four regions reached exome-wide significance: 1q21.2–21.3 (β = 5.28, P-value = 6.39E-04), 4p15.1 (β = -7.94, P-value = 2.07E-04), 19q12 (β = -6.82, P-value = 6.56E-05), and 20p13 (β = 5.89, P-value = 5.04E-04) (Fig 1A and Table 2). Among these four regions, two overlapped regions where local African ancestry was significantly associated with MAP: 1q21.2–21.3 (β = 5.86, P-value = 5.32E-04) and 19q12 (β = -6.40, P-value = 6.37E-04) (Fig 1B and Table 2). For SBP, no region reached exome-wide significance.

Fig 1. Manhattan plot indicates chromosomal regions where local African ancestry is associated with DBP or MAP.

Fig 1

A) At chromosomal regions 1q21.2–21.3 (β = 5.28, P-value = 6.39E-04), 4p15.1 (β = -7.94, P-value = 2.07E-04), 19q12 (β = -6.82, P-value = 6.56E-05), and 20p13 (β = 5.89, P-value = 5.04E-04), local African ancestry was significantly associated with DBP. B) Two of the above regions overlapped regions where local African ancestry was significantly associated with MAP: 1q21.2–21.3 (β = 5.86, P-value = 5.32E-04) and 19q12 (β = -6.40, P-value = 6.37E-04). The red bar indicates the exome-wide significance threshold of 3.13.

Table 2. Significant regions showing the association between blood pressure and African ancestry.

Chr Region Region (Mb) Top signal location Beta Std. error P-value SNPs remained after QC SNPs with δ > 0.4
DBP
1 1q21.2–21.3 148.2–154.7 1:151499346 5.28 1.54 6.39E-04 1034 16
4 4p15.1 30.7–31.1 4:31144153 -7.94 2.12 2.07E-04 17 0
19 19q12 29.7–31.0 19:30020063 -6.82 1.69 6.56E-05 84 0
20 20p13 0.6–5.3 20:2187986 5.89 1.68 5.04E-04 636 5
MAP
1 1q21.2–21.3 148.2–153.3 1:151501906 5.86 1.68 5.32E-04 782 13
19 19q12 29.7–32.1 19:30101339 -6.40 1.86 6.37E-04 101 1

Regional association test

Since SNPs with large allele frequency differences between ancestral populations carry the most information about ancestry, we identified SNPs that had over 40% allele frequency differences between European and African ancestral populations from the 1000 Genome Project within significantly associated regions. For DBP, 21 SNPs had δ > 0.4 and for MAP, 14 SNPs had δ > 0.4 (Table 2). These SNPs were tested for associations with DBP and MAP, respectively. Multiple testing thresholds were calculated using a Bonferroni correction. For DBP, the significance threshold was 0.05/21 = 0.00238 and for MAP, it was 0.05/14 = 0.00357. For DBP, SNP rs4815428, which is a 3’ UTR variant downstream of the gene TMC2, reached significance after Bonferroni correction. For MAP, four SNPs reached significance after Bonferroni correction: rs3126067, rs2184953, rs58001094, and rs771205. SNPs rs3126067, rs2184953 and rs58001094 are in high linkage disequilibrium with each other and are all coding variants in FLG. SNP rs771205 is a missense variant in MINDY1 (Table 3).

Table 3. Significant SNPs within admixture mapping peak regions.

Chr BP rsID Nearby genes Std. error Beta A1 A2 A1 frequency δ P-valuea Mutation type
EUR AFR
DBP
20 2597978 rs4815428 TMC2 0.73 -2.42 G A 0.34 0.85 0.51 9.57E-04 Non-coding transcript exon
MAP
1 150975108 rs771205 MINDY1 0.67 -1.99 T C 0.03 0.68 0.65 3.37E-03 Missense
1 152276889 rs3126067 FLG 0.74 -2.37 G A 0.15 0.79 0.64 1.39E-03 Synonymous
1 152280782 rs2184953 FLG 0.73 -2.33 G A 0.18 0.79 0.61 1.44E-03 Missense
1 152283862 rs58001094 FLG 0.73 -2.33 C G 0.18 0.78 0.60 1.44E-03 Missense

a Multiple testing thresholds were calculated as 0.05/21 = 2.38E-03 for DBP and 0.05/14 = 3.57E-03 for MAP. Bold SNPs indicate they are in high linkage disequilibrium with each other (r2 > 0.99, D’ = 1, estimated in the dataset).

Conditional admixture mapping test

After performing the regional association study, we tested whether significant SNPs can explain admixture mapping signals by including SNP genotypes as covariates in the admixture mapping model. If local African ancestry in the admixture mapping model fails to reach significance after adjusting for SNP genotypes, it indicates that the included SNPs can explain the admixture mapping signal. For DBP, only SNP rs4815428 was significant and passed the Bonferroni correction. By including rs4815428 as a covariate in the admixture mapping model, the peak P-value at chr20:2187986 increased from 5.04E-04 to 0.025. Local African ancestry failed to reach exome-wide significance. At the P-value < 0.05 level, the peak was still significant, indicating that there might be other markers weakly contributing to the admixture mapping signal (Fig 2A). For MAP, inclusion of rs3126067 increased the peak P-value from 5.32E-04 to 0.022 (Fig 2B). After adjusting for both rs3126067 and rs771205, the peak P-value increased to 0.146 (Fig 2B). This means the admixture mapping signal was completely explained by SNPs rs3126067 and rs771205 (or variants tagged by those two SNPs).

Fig 2. Conditional admixture mapping study in DBP significant region chr20:0.6–5.3 Mb and MAP significant region chr1:148.2–153.3 Mb.

Fig 2

A) For DBP in the region of chr20:0.6–5.3 Mb, after adjusted for SNP rs4815428, the significant signal partially disappeared. SNP rs4815428 partially explained the significant admixture mapping signal in this region. B) For MAP in the region of chr1:148.2–153.3 Mb, after adjusted for SNPs rs3126067 and rs771205, the significant signal completely disappeared. These two SNPs fully explained the significant admixture mapping signal in this region. The x-axis is not to scale due to missing intronic regions.

Replication analyses

We attempted to replicate the association of five SNPs with blood pressure phenotypes in the AADM dataset of sub-Saharan African samples. No SNPs passed the genome-wide significant threshold used in AADM (S1 Table). We also performed an association study in the four identified admixture mapping peak regions (Table 4). In chromosome 4, the top SNP rs145765242 maps to gene PCDH7, which is most highly expressed in aorta according to GTEx [29]. The top SNP rs145765242 has an alternative allele frequency of 0.6% in the African samples in the Genome Aggregation Database (gnomAD) [30].

Table 4. Top associated SNPs within admixture mapping peak regions for DBP in the AADM dataset.

Chr BP REF ALT rsID Nearby gene Alt. freq Stat P-value Beta Std. error R2 Alt. freqAFR Alt. freqEUR
1 150579702 TA T rs11360645 ENSA 0.198 -3.63 2.84E-04 -0.12 0.03 0.0045 0.192 0.549
4 30813550 C A rs145765242 PCDH7 0.00947 3.77 1.65E-04 0.48 0.13 0.0048 0.00794 0
19 30950681 C T rs150339768 ZNF536 0.00118 -4.01 6.27E-05 -1.45 0.36 0.0054 0.00099 0.00398
20 3200824 CT C NA ITPA 0.425 -4.10 4.31E-05 -0.10 0.03 0.0057 NA NA

Discussion

We utilized admixture mapping methods to identify genetic regions associated with blood pressure phenotypes in African Americans. We identified four regions for diastolic blood pressure and two regions for mean arterial pressure that reached exome-wide significance in our admixture mapping study. Two MAP regions overlapped with the DBP regions, consistent with the fact that MAP is defined partially as a function of DBP.

The significant admixture mapping region on chromosome 4 predominantly mapped to protocadherin 7 (PCDH7), which is most highly expressed in aorta according to GTEx [29]. Meta-analysis of nearly 35,000 individuals with African ancestry found SNP rs11931572, which tagged PCDH7, to be significantly associated with DBP [31]. The SNP rs11931572 has a low alternative allele frequency of 5% in Africans according to 1000 Genomes [16] and gnomAD [30]. This is consistent with our finding that this genetic region was significantly associated with African ancestry in both ClinSeq® and AADM datasets. However, utilizing admixture mapping drastically decreased the required sample size to identify this gene.

Within admixture mapping significant regions, five SNPs that had large ancestral allele frequency deviations (δ > 0.4) were significantly associated with BP. For DBP, SNP rs4815428 partially explained the admixture signal on chromosome 20. For MAP, SNPs rs3126067, rs2184953 and rs58001094 were in high LD with each other, and along with SNP rs771205, fully explained the admixture signal on chromosome 1.

SNPs rs3126067, rs2184953 and rs58001094 had previously been reported to be associated with ichthyosis and atopy [32] and atopic dermatitis [33]. All three SNPs are in the coding region of the gene filaggrin (FLG), an intermediate filament-associated protein that aggregates keratin intermediate filaments in mammalian epidermis. Previously, this gene had been reported to be associated with asthma [34], ichthyosis [32] and abnormal inflammatory response [35, 36]. In a study of rheumatoid arthritis patients, first-degree relatives who were negative for rheumatoid arthritis but were positive for antibodies to citrullinated filaggrin had higher SBP and DBP than those who were antibody-negative [37], suggesting that there may be a correlation, although not necessarily a causation, between blood pressure and FLG.

SNP rs771205 is in the coding region of MINDY1, which encodes a hydrolase that removes lysine-48-linked conjugated ubiquitin from proteins [38]. It has exodeubiquitinase activity with a preference for long polyubiquitin chains and may play a regulatory role at the level of protein turnover [39]. This genetic region was previously reported to be linked to late-onset Alzheimer’s disease [40, 41].

SNP rs4815428 is located in TMC2, which encodes transmembrane channel-like protein 2. TMC2 is a potential ion channel required for the mechano-transduction of cochlear hair cells [42]. A study in the Han Chinese population reported that TMC2 was among the top genes associated with BP response to the cold pressor test (CPT), which is associated with an increased risk of cardiovascular disease [43].

Previous admixture mapping studies of BP in African Americans were mostly performed on genotyping chip data with imputation, without much emphasis on whole exome sequencing (WES) data. WES data provide greater coverage of rarer variants at exonic regions than chip data, therefore enabling us to test more variants for local ancestry association signals. Although the five SNPs we identified were all common SNPs, it is also possible for rare variants to drive the admixture mapping signals as well. With 484 African American samples, we could not identify any rare variants that passed multiple testing correction (results not shown). In addition, utilizing WES data may leave out ancestral switch points occurring within non-exonic regions, causing the average number of switch points to be lower than those estimated using WGS or chip data.

Our five significantly-associated SNPs were not replicable in the AADM study, possibly due to the following reasons. 1) The African American individuals recruited in the ClinSeq® study were relatively healthy individuals, mostly without an identifiable disease trait such as diabetes or cardiovascular disease, whereas AADM was enriched for cases of type 2 diabetes. However, the inclusion of type 2 diabetes status as a covariate made negligible differences in the association statistics. 2) The African ancestry in the African Americans ClinSeq® study might not be sufficiently matched to the Africans in the AADM study. Participants were recruited for the AADM study from three countries: Nigeria, Ghana, and Kenya. Individuals from Nigeria and Ghana are expected to share ancestry with African Americans, based on historical records of the trans-Atlantic slave trade. However, the contribution to African Americans from other places such as Senegal is not as well captured by AADM. 3) The AADM participants were recruited on the African continent, while the ClinSeq® participants were recruited in North America. Differences in environmental factors between the studies may have contributed to relatively smaller effect sizes in AADM than in ClinSeq®. 4) The causal variants may be specific to European ancestry. 5) The SNPs associated in ClinSeq® are not the SNPs underlying differential risk in the regions identified by admixture mapping.

To demonstrate that these five SNPs are not just ancestry-informative markers (i.e., confounders), we stratified samples based on their local ancestry at each SNP and performed genotypic association within each stratum, which, by definition, cannot be confounded by local ancestry. We then combined all strata and performed a random effects meta-analysis to get an overall estimation of the association for each SNP. We were able to demonstrate that genotypic association at rs4815428 was not confounded by local ancestry. This SNP has the largest effect size among all five SNPs. We were not able to demonstrate that the rest of the four SNPs were not confounded (S2 Table). After stratification, each stratum has a very small sample size. We believe that increasing the study sample size may give a more definitive answer.

In summary, we performed admixture mapping analyses on the ClinSeq® African American cohort and identified four genetic regions associated with blood pressure phenotypes. We fine-mapped these regions and identified five SNPs that are the main driving forces of associations between local African ancestry and blood pressure phenotypes at two of the four regions. SNP rs3126067, rs2184953 and rs58001094, all located in FLG, and SNP rs771205, located in MINDY1, were significantly associated with MAP. SNP rs4815428, located in TMC2, was significantly associated with DBP. Finally, region 4p15.1, despite containing no SNPs exhibiting large allele frequency deviation in our dataset, maps predominantly to PCDH7, which is most highly expressed in aorta.

Supporting information

S1 Fig. Distribution of systolic blood pressure, diastolic blood pressure, and mean arterial pressure for the ClinSeq® A2 dataset.

The phenotype distributions are approximately normal.

(TIFF)

S2 Fig. Scatter plot for ClinSeq® A2 dataset in HGDP panel.

This figure shows principal components 1 and 2 of the ClinSeq® A2 dataset using the HGDP reference panel. The red cluster represents African ancestral populations; the blue cluster represents European ancestral populations; the yellow cluster represents East Asian populations, and the purple cluster represents Native American populations from the HGDP reference panel. The black cluster represents the ClinSeq® study.

(TIFF)

S3 Fig. Correlation between global European ancestry and principal component 1 and global African ancestry and principal component 2.

This figure shows the global European ancestry estimated by averaging local ancestry across the entire exome plotted against principal component 1 as estimated by LASER, and the global African ancestry plotted against principal component 2. The X axis denotes global ancestry; the Y axis denotes principal components. Each dot denotes an African American individual who has passed QC.

(TIFF)

S4 Fig. Principal component analysis plots of 5,231 participants from the AADM Study.

Principal component 1 separates Kenyans from Ghanaians and Nigerians and also separates the Kenyans. Principal component 2 separates Ghanaians from Nigerians. Principal component 3 separates 11 Yoruba.

(TIFF)

S1 Table. Replication study in the AADM dataset.

(DOCX)

S2 Table. Meta-analyses of local-ancestry stratified genotypic association tests.

(DOCX)

Acknowledgments

We sincerely thank the patients and their families for their participation and support of this project. We thank the medical staffs for taking clinical measurements and patient samples collection. We thank all the authors for their work towards completion of this project. We specially thank Dr. Leslie Biesecker for his initiation of the ClinSeq® project, reviewing and commenting on the manuscript.

NISC Comparative Sequencing Program author contributors: Beatrice B. Barnabas, MPH, MSc; Sean Black, MSc; Gerard G. Bouffard, PhD; Shelise Y. Brooks, BS; Holly Coleman, MSc; Lyudmila Dekhtyar, MSc; Xiaobin Guan, PhD; Joel Han, BS; Shi-ling Ho, BS; Richelle Legaspi, MSc; Quino L. Maduro, BS; Catherine A. Masiello, MSc; Jennifer C. McDowell, PhD; Casandra Montemayor, MSc; Morgan Park, PhD; Nancy L. Riebow, BS; Karen Schandler, MSc; Chanthra Scharer, BS; Brian Schmidt, BS; Christina Sison, BS; Sirintorn Stantripop, BS; James W. Thomas, PhD; Pamela J. Thomas, PhD; Meghana Vemulapalli, MSc; Alice C. Young, BA. The lead author for NISC Comparative Sequencing Program: James C. Mullikin, PhD. Email: mullikin@mail.nih.gov.

Data Availability

Data is available on dbGaP: https://www.ncbi.nlm.nih.gov/gap/advanced_search/?TERM=clinseq.

Funding Statement

This research was supported by the Intramural Research Program of the National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892.

References

  • 1.Ehret GB, Caulfield MJ. Genes for blood pressure: an opportunity to understand hypertension. Eur Heart J. 2013;34(13):951–61. Epub 2013/01/11. 10.1093/eurheartj/ehs455 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Ehret GB, Bakris B, Forman J. Genetic factors in the pathogenesis of hypertension. UpToDate. 2017;Topic 3847 Version 17.0.
  • 3.Fryar CD, Ostchega Y, Hales CM, Zhang G, Kruszon-Moran D. Hypertension Prevalence and Control Among Adults: United States, 2015–2016 NCHS Data Brief No. 2892017. Available from: https://www.cdc.gov/nchs/products/databriefs/db289.htm. [PubMed]
  • 4.Kramer H, Han C, Post W, Goff D, Diez-Roux A, Cooper R, et al. Racial/ethnic differences in hypertension and hypertension treatment and control in the multi-ethnic study of atherosclerosis (MESA). Am J Hypertens. 2004;17(10):963–70. 10.1016/j.amjhyper.2004.06.001 . [DOI] [PubMed] [Google Scholar]
  • 5.Maass PG, Aydin A, Luft FC, Schachterle C, Weise A, Stricker S, et al. PDE3A mutations cause autosomal dominant hypertension with brachydactyly. Nat Genet. 2015;47(6):647–53. Epub 2015/05/12. 10.1038/ng.3302 . [DOI] [PubMed] [Google Scholar]
  • 6.Salvi E, Kutalik Z, Glorioso N, Benaglio P, Frau F, Kuznetsova T, et al. Genomewide association study using a high-density single nucleotide polymorphism array and case-control design identifies a novel essential hypertension susceptibility locus in the promoter region of endothelial NO synthase. Hypertension. 2012;59(2):248–55. Epub 2011/12/21. 10.1161/HYPERTENSIONAHA.111.181990 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Schunkert H, Konig IR, Kathiresan S, Reilly MP, Assimes TL, Holm H, et al. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nature genetics. 2011;43(4):333–8. 10.1038/ng.784 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zhu X, Luke A, Cooper RS, Quertermous T, Hanis C, Mosley T, et al. Admixture mapping for hypertension loci with genome-scan markers. Nat Genet. 2005;37(2):177–81. Epub 2005/01/25. 10.1038/ng1510 . [DOI] [PubMed] [Google Scholar]
  • 9.Shetty PB, Tang H, Tayo BO, Morrison AC, Hanis CL, Rao DC, et al. Variants in CXADR and F2RL1 are associated with blood pressure and obesity in African-Americans in regions identified through admixture mapping. J Hypertens. 2012;30(10):1970–6. Epub 2012/08/24. 10.1097/HJH.0b013e3283578c80 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Zhu X, Young JH, Fox E, Keating BJ, Franceschini N, Kang S, et al. Combined admixture mapping and association analysis identifies a novel blood pressure genetic locus on 5p13: contributions from the CARe consortium. Human molecular genetics. 2011;20(11):2285–95. 10.1093/hmg/ddr113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Sofer T, Baier LJ, Browning SR, Thornton TA, Talavera GA, Wassertheil-Smoller S, et al. Admixture mapping in the Hispanic Community Health Study/Study of Latinos reveals regions of genetic associations with blood pressure traits. PLoS One. 2017;12(11):e0188400 Epub 2017/11/21. 10.1371/journal.pone.0188400 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Teer JK, Bonnycastle LL, Chines PS, Hansen NF, Aoyama N, Swift AJ, et al. Systematic comparison of three genomic enrichment methods for massively parallel DNA sequencing. Genome Res. 2010;20(10):1420–31. Epub 2010/09/03. 10.1101/gr.106716.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wigginton JE, Cutler DJ, Abecasis GR. A note on exact tests of Hardy-Weinberg equilibrium. Am J Hum Genet. 2005;76(5):887–93. Epub 2005/03/25. 10.1086/429864 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Levy D, Ehret GB, Rice K, Verwoert GC, Launer LJ, Dehghan A, et al. Genome-wide association study of blood pressure and hypertension. Nat Genet. 2009;41(6):677–87. Epub 2009/05/12. 10.1038/ng.384 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Hu Y, Willer C, Zhan X, Kang HM, Abecasis GR. Accurate local-ancestry inference in exome-sequenced admixed individuals via off-target sequence reads. Am J Hum Genet. 2013;93(5):891–9. 10.1016/j.ajhg.2013.10.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Genomes Project C, Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, et al. A global reference for human genetic variation. Nature. 2015;526(7571):68–74. 10.1038/nature15393 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Wang C, Zhan X, Liang L, Abecasis GR, Lin X. Improved ancestry estimation for both genotyping and sequencing data using projection procrustes analysis and genotype imputation. Am J Hum Genet. 2015;96(6):926–37. Epub 2015/06/02. 10.1016/j.ajhg.2015.04.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Shriner D, Adeyemo A, Rotimi CN. Joint ancestry and association testing in admixed individuals. PLoS Comput Biol. 2011;7(12):e1002325 Epub 2012/01/05. 10.1371/journal.pcbi.1002325 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Plummer M, Best N, Cowles K, Vines K. CODA: Convergence Diagnosis and Output Analysis for MCMC. R News. 2006;6:7–11. [Google Scholar]
  • 20.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. 10.1086/519795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Rotimi CN, Dunston GM, Berg K, Akinsete O, Amoah A, Owusu S, et al. In search of susceptibility genes for type 2 diabetes in West Africa: the design and results of the first phase of the AADM study. Ann Epidemiol. 2001;11(1):51–8. Epub 2001/02/13. 10.1016/s1047-2797(00)00180-0 . [DOI] [PubMed] [Google Scholar]
  • 22.Adeyemo AA, Tekola-Ayele F, Doumatey AP, Bentley AR, Chen G, Huang H, et al. Evaluation of Genome Wide Association Study Associated Type 2 Diabetes Susceptibility Loci in Sub Saharan Africans. Front Genet. 2015;6:335 Epub 2015/12/05. 10.3389/fgene.2015.00335 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Loh PR, Danecek P, Palamara PF, Fuchsberger C, Y AR, H KF, et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat Genet. 2016;48(11):1443–8. Epub 2016/10/28. 10.1038/ng.3679 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Durbin R. Efficient haplotype matching and storage using the positional Burrows-Wheeler transform (PBWT). Bioinformatics. 2014;30(9):1266–72. Epub 2014/01/15. 10.1093/bioinformatics/btu014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Kang HM. EPACTS: Efficient and Parallelizable Association Container Toolbox, version 3.2.6. https://github.com/statgen/EPACTS.
  • 26.Shriner D. Improved eigenanalysis of discrete subpopulations and admixture using the minimum average partial test. Hum Hered. 2012;73(2):73–83. Epub 2012/03/24. 10.1159/000335899 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Baker JL, Rotimi CN, Shriner D. Human ancestry correlates with language and reveals that race is not an objective genomic classifier. Sci Rep. 2017;7(1):1572 Epub 2017/05/10. 10.1038/s41598-017-01837-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Shringarpure SS, Bustamante CD, Lange K, Alexander DH. Efficient analysis of large datasets and sex bias with ADMIXTURE. BMC Bioinformatics. 2016;17:218 Epub 2016/05/25. 10.1186/s12859-016-1082-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Consortium GT. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013;45(6):580–5. Epub 2013/05/30. 10.1038/ng.2653 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. Variation across 141,456 human exomes and genomes reveals the spectrum of loss-of-function intolerance across human protein-coding genes. 2019:531210 10.1101/531210%JbioRxiv [DOI] [Google Scholar]
  • 31.Sung YJ, Winkler TW, de Las Fuentes L, Bentley AR, Brown MR, Kraja AT, et al. A Large-Scale Multi-ancestry Genome-wide Study Accounting for Smoking Behavior Identifies Multiple Significant Loci for Blood Pressure. Am J Hum Genet. 2018;102(3):375–400. Epub 2018/02/20. 10.1016/j.ajhg.2018.01.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kiritsi D, Valari M, Fortugno P, Hausser I, Lykopoulou L, Zambruno G, et al. Whole-exome sequencing in patients with ichthyosis reveals modifiers associated with increased IgE levels and allergic sensitizations. J Allergy Clin Immunol. 2015;135(1):280–3. Epub 2014/12/03. 10.1016/j.jaci.2014.09.042 . [DOI] [PubMed] [Google Scholar]
  • 33.Kim M, Yoo J, Kim J, Park J, Han E, Jang W, et al. Association of FLG single nucleotide variations with clinical phenotypes of atopic dermatitis. PLoS One. 2017;12(12):e0190077 Epub 2017/12/28. 10.1371/journal.pone.0190077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Rice NE, Patel BD, Lang IA, Kumari M, Frayling TM, Murray A, et al. Filaggrin gene mutations are associated with asthma and eczema in later life. J Allergy Clin Immunol. 2008;122(4):834–6. Epub 2008/09/02. 10.1016/j.jaci.2008.07.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Barnes KC. An update on the genetics of atopic dermatitis: scratching the surface in 2009. J Allergy Clin Immunol. 2010;125(1):16–29 e1-11; quiz 30–1. Epub 2010/01/30. 10.1016/j.jaci.2009.11.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Boguniewicz M, Leung DY. Atopic dermatitis: a disease of altered skin barrier and immune dysregulation. Immunol Rev. 2011;242(1):233–46. Epub 2011/06/21. 10.1111/j.1600-065X.2011.01027.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Hughes-Austin JM, Gan RW, Deane KD, Weisman MH, Demoruelle MK, Sokolove J, et al. Association of Antibodies to Citrullinated Protein Antigens with Blood Pressure in First-Degree Relatives of Rheumatoid Arthritis Patients: The Studies of the Etiology of Rheumatoid Arthritis. Am J Nephrol. 2017;46(6):481–7. Epub 2017/12/14. 10.1159/000485259 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Kristariyanto YA, Abdul Rehman SA, Weidlich S, Knebel A, Kulathu Y. A single MIU motif of MINDY-1 recognizes K48-linked polyubiquitin chains. EMBO Rep. 2017;18(3):392–402. Epub 2017/01/14. 10.15252/embr.201643205 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Abdul Rehman SA, Kristariyanto YA, Choi SY, Nkosi PJ, Weidlich S, Labib K, et al. MINDY-1 Is a Member of an Evolutionarily Conserved and Structurally Distinct New Family of Deubiquitinating Enzymes. Mol Cell. 2016;63(1):146–55. Epub 2016/06/14. 10.1016/j.molcel.2016.05.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Liu F, Arias-Vasquez A, Sleegers K, Aulchenko YS, Kayser M, Sanchez-Juan P, et al. A genomewide screen for late-onset Alzheimer disease in a genetically isolated Dutch population. Am J Hum Genet. 2007;81(1):17–31. Epub 2007/06/15. 10.1086/518720 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Butler AW, Ng MY, Hamshere ML, Forabosco P, Wroe R, Al-Chalabi A, et al. Meta-analysis of linkage studies for Alzheimer's disease—a web resource. Neurobiol Aging. 2009;30(7):1037–47. Epub 2009/04/14. 10.1016/j.neurobiolaging.2009.03.013 . [DOI] [PubMed] [Google Scholar]
  • 42.Martinac B. The ion channels to cytoskeleton connection as potential mechanism of mechanosensitivity. Biochim Biophys Acta. 2014;1838(2):682–91. Epub 2013/07/28. 10.1016/j.bbamem.2013.07.015 . [DOI] [PubMed] [Google Scholar]
  • 43.Yang X, Gu D, He J, Hixson JE, Rao DC, Lu F, et al. Genome-wide linkage and regional association study of blood pressure response to the cold pressor test in Han Chinese: the genetic epidemiology network of salt sensitivity study. Circ Cardiovasc Genet. 2014;7(4):521–8. Epub 2014/07/17. 10.1161/CIRCGENETICS.113.000332 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Heming Wang

28 Jan 2020

PONE-D-19-36060

Admixture mapping identifies genetic regions associated with blood pressure phenotypes in African Americans

PLOS ONE

Dear Dr. Mullikin,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

We would appreciate receiving your revised manuscript by Mar 13 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Heming Wang, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (if provided):

In general this manuscript is technically sound and well-written. One key question raised by reviewer 1 is lack of replication. Can you try to look up those variants in African-American exome array data such as FBPP, ARIC, or MESA? Since you mentioned that those variants may reflect European ancestry association, replication using European cohorts may also be interesting.

Journal Requirements:

When submitting your revision, we need you to address these additional requirements:

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.plosone.org/attachments/PLOSOne_formatting_sample_main_body.pdf and http://www.plosone.org/attachments/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. One of the noted authors is a group or consortium [NISC Comparative Sequencing Program]. In addition to naming the author group, please list the individual authors and affiliations within this group in the acknowledgments section of your manuscript. Please also indicate clearly a lead author for this group along with a contact email address.

3. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This is a well-written manuscript focusing on identifying genetic loci associated with blood pressure traits in African Americans using admixture mapping.

Major concern/suggestion:

Overall, the discovery analyses are adequate but the authors have failed to show whether the replication cohort would be a suitable sample for replication analysis. ClinSeq participants were African Americans who were not ascertained for any particular phenotype but the AADM participants were Africans with T2D. I'd like the authors to explain the rationale behind using AADM as a replication cohort instead of another African American population-based study, with similar estimated African ancestry. I would strongly suggest incorporating additional replication analysis considering how poorly AADM replicated the discovery findings. In Table 1, please update the sample characteristics to include the replication cohort(s) so that readers can easily examine the similarities and differences between the discovery and replication samples.

Minor questions/suggestions:

-Page 9: The authors stated "For both DBP and SBP, values were inverse-normalized after adjusting for sex, age, and age^2." Other studies have included BMI as a covariate in this step, what was the reason to omit BMI in this instance?

-Page 10: What was the reason for only adjusting 3 PCs for AADM?

-Page 10: Please add median & SD of SBP and DBP in Table 1.

Reviewer #2: The paper addresses an important topic related to genetic susceptibility to hypertension in African Americans, who have a high burden of hypertension compared to other ethnic/racial groups in the US. Genome-wide association studies (GWAS) have identified a small number of blood pressure loci in individuals of African ancestry, with some challenges being sample size required for association studies and the ancestral admixture in this population. In this paper, the authors used an approach that leverage the European and African admixture (which have been shown to identify novel loci not uncover in GWAS) and exome sequencing data from a cohort of African Americans to identify genetic regions where local African ancestry was associated with hypertension-related phenotypes. Using this approach, they identified four genomic regions where local African ancestry was significantly associated with DBP, with two of them overlapping with significant findings for MAP. Some interesting findings are related to subsequent association analyses within these regions that identified SNPs that partially or completely account for the admixture mapping findings in conditional analyses. The authors attempted to replicate findings in an African study of diabetes, although results were not significant. This is a well-though and carefully done study with multiple strengths in the design, approach and follow-up analyses. The authors are well-known leaders in the field. I have some minor comments to help clarify some of the aspects of the study.

For the admixture mapping, models used the difference between global and local ancestry instead of just local ancestry. This may be of interest to highlight in the paper and perhaps include the reason for this approach.

Page 11, please clarify what you mean by average number of ancestral switch points per individuals was 127.

Table 1, add the interquartiles for African ancestry %, so one can see the range of the African admixture in the discovery sample.

It may be of interest to include the interpretation of betas for African local ancestry shown in Table 2.

Table 3. For the SNPs in LD (chromosome 1), include in the footnote id the LD was estimated in the data (or 1000G Project reference panel).

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Apr 21;15(4):e0232048. doi: 10.1371/journal.pone.0232048.r002

Author response to Decision Letter 0


13 Mar 2020

Please see attached PDF "Response to Reviewers.pdf" to see figures in the response. However, here is the text of the response:

March 10th, 2020

Dear Reviewers,

We sincerely thank you for your time and efforts on reviewing this manuscript. We have addressed all comments in a revised version of the manuscript. Please see our responses to each specific comment below.

1. Additional Editor Comments (if provided):

In general this manuscript is technically sound and well-written. One key question raised by reviewer 1 is lack of replication. Can you try to look up those variants in African-American exome array data such as FBPP, ARIC, or MESA? Since you mentioned that those variants may reflect European ancestry association, replication using European cohorts may also be interesting.

ARIC and MESA are prospective studies of atherosclerosis in middle-aged adults. These studies are not population-based but rather are ascertained for individuals asymptomatic for cardio-vascular disease. Hence, participants in these studies are expected to not have hypertension at baseline. The FBPP is the parent program of GENOA, which is a cohort study with ascertainment for sibships in which at least two siblings had essential hypertension diagnosed prior to 60 years of age. Like ARIC and MESA, this study is not population-based. Thus, these studies would not satisfy Reviewer 1.

2. Reviewer #1: This is a well-written manuscript focusing on identifying genetic loci associated with blood pressure traits in African Americans using admixture mapping.

Overall, the discovery analyses are adequate but the authors have failed to show whether the replication cohort would be a suitable sample for replication analysis. ClinSeq participants were African Americans who were not ascertained for any particular phenotype but the AADM participants were Africans with T2D. I'd like the authors to explain the rationale behind using AADM as a replication cohort instead of another African American population-based study, with similar estimated African ancestry. I would strongly suggest incorporating additional replication analysis considering how poorly AADM replicated the discovery findings. In Table 1, please update the sample characteristics to include the replication cohort(s) so that readers can easily examine the similarities and differences between the discovery and replication samples.

We are unclear what the reviewer’s basis was for claiming that AADM “poorly replicated the discovery findings” and we do not agree that “considering how poorly AADM replicated the discovery findings” necessarily leads to the conclusion that non-replication is a failure and hence that AADM is an unsuitable replication study. It is plausible that AADM is performing as expected to rule out variants that do not explain the admixture signal. A genetic epidemiology study of West Africans that reflects descendants of either the parental African populations or populations genetically close to the parental African populations, which AADM does (and is currently the only such study), is suitable for replicating associations of specific variants. Even if we use another African American population with similar estimated mean percentage of African ancestry, that population could represent different parental ancestries of African and European combinations. To account for the fact that AADM is enriched for cases of type 2 diabetes, we included T2D status as a covariate in the association analysis (p. 10). This change is shown in Table 4 and Table S1.

For a rare European-specific variant (minor allele frequency less than 1%), in a typical African American population (e.g., 20% European and 80% African ancestry), the chance to find it in regions of European ancestry in African Americans is about 20% x 1%, which is close to 0%. This drastically reduces the power we have to replicate rare variants in African Americans, so we opted to focus on common or African-specific variants and used an African replication cohort.

We updated the sample characteristics and included the replication cohort in Table 1.

2.1. Page 9: The authors stated "For both DBP and SBP, values were inverse-normalized after adjusting for sex, age, and age^2." Other studies have included BMI as a covariate in this step, what was the reason to omit BMI in this instance?

We included BMI as a covariate during the association analysis using EPACTS. Not adjusting for BMI at the earlier step facilitated analyses of different models using EPACTS, i.e., including or excluding BMI as a covariate, without having to generate multiple versions of the blood pressure phenotypes.

2.2. Page 10: What was the reason for only adjusting 3 PCs for AADM?

The Tracy-Widom test (https://www.ncbi.nlm.nih.gov/pubmed/22441298) revealed three significant PCs. The first PC separates Kenyans from Ghanaians and Nigerians and also separates the Kenyans. The second PC separates Ghanaians from Nigerians. The third PC separates 11 Yoruba. Additional PCs do not explain significant amounts of variance. This explanation is added to the “Replication cohort description” section.

PCA plots of 5,231 participants from the AADM Study

2.3. Page 10: Please add median & SD of SBP and DBP in Table 1.

The mean and standard deviation of SBP and DBP are added in Table 1.

3. Reviewer #2: The paper addresses an important topic related to genetic susceptibility to hypertension in African Americans, who have a high burden of hypertension compared to other ethnic/racial groups in the US. Genome-wide association studies (GWAS) have identified a small number of blood pressure loci in individuals of African ancestry, with some challenges being sample size required for association studies and the ancestral admixture in this population. In this paper, the authors used an approach that leverage the European and African admixture (which have been shown to identify novel loci not uncover in GWAS) and exome sequencing data from a cohort of African Americans to identify genetic regions where local African ancestry was associated with hypertension-related phenotypes. Using this approach, they identified four genomic regions where local African ancestry was significantly associated with DBP, with two of them overlapping with significant findings for MAP. Some interesting findings are related to subsequent association analyses within these regions that identified SNPs that partially or completely account for the admixture mapping findings in conditional analyses. The authors attempted to replicate findings in an African study of diabetes, although results were not significant. This is a well-though and carefully done study with multiple strengths in the design, approach and follow-up analyses. The authors are well-known leaders in the field. I have some minor comments to help clarify some of the aspects of the study.

3.1. For the admixture mapping, models used the difference between global and local ancestry instead of just local ancestry. This may be of interest to highlight in the paper and perhaps include the reason for this approach.

We used the difference between global and local ancestry instead of just local ancestry for the purpose of easier interpretation of the resulting beta. Global ancestry is the average of local ancestry across all loci; consequently, global and local ancestry are correlated. Our beta is interpreted as the local response accounting for the global ancestral effect, which every locus carries, i.e., it predicts how the additional ancestry at each particular locus would contribute to the phenotype. We highlighted this in the fourth paragraph of the admixture mapping method section.

3.2. Page 11, please clarify what you mean by average number of ancestral switch points per individuals was 127.

Individual 1, Chromosome 1

In the illustrated figure above, for individual 1, chromosome 1, the ancestry switched five times for the first haplotype and two times for the second haplotype, so the total number of switch points for individual 1, chromosome 1 is 5+2=7.

We calculated the total times of ancestry switching from African to European or vice versa for each chromosome of each individual, and then added that up for all chromosomes of each individual. We added the total switch points for all chromosome of all individuals and divided by the total number of individuals to get the average number of ancestral switch points per person as 127.

The denser the markers used to detect ancestry, the more ancestry switches are likely to be detected. We used exome sequencing data, which does not target intergenic regions. Off-target reads may not be adequate to identify all ancestry switches in such regions. Therefore, our average switch points are lower than those identified using whole genome sequencing or genotyping data.

3.3. Table 1, add the interquartiles for African ancestry %, so one can see the range of the African admixture in the discovery sample.

The median and interquartile ranges of the percent African ancestry for both the discovery and replication studies have been added to Table 1.

3.4. It may be of interest to include the interpretation of betas for African local ancestry shown in Table 2.

Beta can be interpreted as how much can excess local African ancestry contribute to an individual’s blood pressure at that particular locus.

3.5. Table 3. For the SNPs in LD (chromosome 1), include in the footnote id the LD was estimated in the data (or 1000G Project reference panel).

The LD level is R2 > 0.99 and D’ = 1 for any pair of the three SNPs, which was estimated from the dataset. This was indicated in Table 3.

We would like to thank you again for reviewing our work. If you have any questions or concerns regarding this manuscript, please feel free to address them to me at mullikin@mail.nih.gov. Thank you for your time and consideration of this manuscript.

Sincerely,

James C. Mullikin

Head, Comparative Genomics Unit

Director, NIH Intramural Sequencing Center

National Human Genome Research Institute

National Institutes of Health

5625 Fishers Lane

Room 5N-01Q

Rockville, MD 20852

Tel (301) 496-2416, Fax (301) 435-6170

Attachment

Submitted filename: Response to Reviewers.pdf

Decision Letter 1

Heming Wang

7 Apr 2020

Admixture mapping identifies genetic regions associated with blood pressure phenotypes in African Americans

PONE-D-19-36060R1

Dear Dr. Mullikin,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

Heming Wang, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

The authors have sufficiently addressed the reviewers' questions. I understand their challenge in acquiring additional replication data and eager to publish this paper as soon as possible. Just to clear, ARIC and MESA are considered as population-based studies.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I'd like to thank the authors for addressing my comments and suggestions in the revised manuscript. The only part that I am still unclear about is the rationale behind the authors stating that AADM is the only study that "reflects descendants of either the parental African populations or populations genetically close to the parental African populations". There is no indication within the manuscript that ClinSeq participants are descendants of West Africans (and if it's the case, then it should be specified). If the discovery cohort is a population-based cohort not ascertained for any specific phenotypes/diseases, then have the authors considered using an African American biobank cohort for additional replication?

Reviewer #2: The authors have answered appropriately all my questions and I have no further suggestions for this paper.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Acceptance letter

Heming Wang

9 Apr 2020

PONE-D-19-36060R1

Admixture mapping identifies genetic regions associated with blood pressure phenotypes in African Americans

Dear Dr. Mullikin:

I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

For any other questions or concerns, please email plosone@plos.org.

Thank you for submitting your work to PLOS ONE.

With kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Heming Wang

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Distribution of systolic blood pressure, diastolic blood pressure, and mean arterial pressure for the ClinSeq® A2 dataset.

    The phenotype distributions are approximately normal.

    (TIFF)

    S2 Fig. Scatter plot for ClinSeq® A2 dataset in HGDP panel.

    This figure shows principal components 1 and 2 of the ClinSeq® A2 dataset using the HGDP reference panel. The red cluster represents African ancestral populations; the blue cluster represents European ancestral populations; the yellow cluster represents East Asian populations, and the purple cluster represents Native American populations from the HGDP reference panel. The black cluster represents the ClinSeq® study.

    (TIFF)

    S3 Fig. Correlation between global European ancestry and principal component 1 and global African ancestry and principal component 2.

    This figure shows the global European ancestry estimated by averaging local ancestry across the entire exome plotted against principal component 1 as estimated by LASER, and the global African ancestry plotted against principal component 2. The X axis denotes global ancestry; the Y axis denotes principal components. Each dot denotes an African American individual who has passed QC.

    (TIFF)

    S4 Fig. Principal component analysis plots of 5,231 participants from the AADM Study.

    Principal component 1 separates Kenyans from Ghanaians and Nigerians and also separates the Kenyans. Principal component 2 separates Ghanaians from Nigerians. Principal component 3 separates 11 Yoruba.

    (TIFF)

    S1 Table. Replication study in the AADM dataset.

    (DOCX)

    S2 Table. Meta-analyses of local-ancestry stratified genotypic association tests.

    (DOCX)

    Attachment

    Submitted filename: Response to Reviewers.pdf

    Data Availability Statement

    Data is available on dbGaP: https://www.ncbi.nlm.nih.gov/gap/advanced_search/?TERM=clinseq.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES