Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2009 Dec 2.
Published in final edited form as: Hum Genet. 2008 Jul 25;124(2):147–154. doi: 10.1007/s00439-008-0532-6

Exploration of the utility of ancestry informative markers for genetic association studies of African Americans with type 2 diabetes and end stage renal disease

Keith L Keene 1,2,3, Josyf C Mychaleckyj 1,2,4,5, Tennille S Leak 1, Shelly G Smith 1, Peter S Perlegas 6, Jasmin Divers 7, Carl D Langefeld 7, Barry I Freedman 5, Donald W Bowden 1,5,6, Michèle M Sale 1,2,3,5,8
PMCID: PMC2786006  NIHMSID: NIHMS157274  PMID: 18654799

Abstract

Admixture and population stratification are major concerns in genetic association studies. We wished to evaluate the impact of admixture using empirically derived data from genetic association studies of African Americans (AA) with type 2 diabetes (T2DM) and end-stage renal disease (ESRD). Seventy ancestry informative markers (AIMs) were genotyped in 577 AA with T2DM-ESRD, 596 AA controls, 44 Yoruba Nigerian (YRI) and 39 European American (EA) controls. Genotypic data and association results for eight T2DM candidate gene studies in our AA population were included. Ancestral estimates were calculated using FRAPPE, ADMIXMAP and STRUCTURE for all AA samples, using varying numbers of AIMs (25, 50, and 70). Ancestry estimates varied significantly across all three programs with the highest estimates obtained using STRUCTURE, followed by ADMIXMAP; while FRAPPE estimates were the lowest. FRAPPE estimates were similar using varying numbers of AIMs, while STRUCTURE estimates using 25 AIMs differed from estimates using 50 and 70 AIMs. Female T2DM-ESRD cases showed higher mean African proportions as compared to female controls, male cases, and male controls. Age showed a weak but significant correlation with individual ancestral estimates in AA cases (r2=0.101; P=0.019) and in the combined set (r2=0.131; P=3.57×10−5). The absolute difference between frequencies in parental populations, absolute δ, was correlated with admixture impact for dominant, additive, and recessive genotypic models of association. This study presents exploratory analyses of the impact of admixture on studies of AA with T2DM-ESRD and supports the use of ancestral proportions as a means of reducing confounding effects due to admixture.

Keywords: Population structure, African American, diabetes

Introduction

In the United States, 20.8 million people, or 7% of the total population, have diabetes (Kaiser Family Foundation 2007). Diabetes is a growing epidemic whose risk factors include obesity, race, and family history. An African American (AA) is twice as likely to develop diabetes as compared to a Caucasian peer (Brancati et al. 2000). Evidence of a genetic contribution to the susceptibility of type 2 diabetes mellitus (T2DM) has been established through twin (Hawkes 1997), familial aggregation (Rotimi et al. 1994), and segregation studies (Goodman and Chung 1975).

There are a limited number of genetic association studies evaluating the susceptibility of minority populations, especially AA, to developing T2DM (Elbein 2007). Genetic association studies using admixed populations are presented with the additional challenge of addressing the contributions due to admixture. An admixed population is the product of mating between individuals from reproductively isolated ancestral populations (Tang et al. 2005). The two largest minority groups in the United States, African Americans and Hispanic Americans, are both examples of admixed populations. Markers that are differentially distributed between two parental populations provide more information regarding their ancestral origin (Pfaff et al. 2004). This divergence in frequencies may result in deviations from Hardy Weinberg Equilibrium (HWE) and produce apparent but spurious association results with disease traits in the presence of admixture linkage disequilibrium (ALD) (Long 1991).

One approach for locating and identifying disease genes is through pedigree-linkage analysis followed by subsequent genetic-association studies. Association studies often attempt to identify susceptibility loci using population samples containing unrelated individuals. Spurious associations may occur if the samples are obtained from two or more subpopulations (population substructure) and if the distribution of the trait and the allele frequency of the polymorphism varies among the subpopulations (Reiner et al. 2005). Although there is a growing debate concerning the contributions of admixture in association studies, few projects address this issue using empirically-derived (Fernandez and Shiver 2004), rather than simulated, data.

In an effort to evaluate the impact of admixture on individual estimates of ancestry and association studies of T2DM-ESRD, 70 AIMs were genotyped in a population of 577 AA with T2DM-ESRD, 596 AA controls, 44 Yoruba Nigerians and 39 EA controls. Additionally, genotypic results from studies of eight T2DM candidate genes (TCF7L2, PPARγ, CAPN10, KCNJ11, TCF1, HNF4α, ESR1 and ENPP1) in AA with T2DM-ESRD were also included to further explore the impact of admixture on estimates of association. This study represents the first exploratory study of the impact of admixture in genetic association studies of AA with T2DM-ESRD.

Materials and Methods

Subjects

This study was conducted under Institutional Review Board approval from Wake Forest University School of Medicine, and adhered to the tenets of the Declaration of Helsinki. Identification, clinical characteristics, and recruitment of AA and European American (EA) patients and controls have been described previously (Freedman et al. 1997; Yu et al. 1996). Briefly, 577 unrelated AA patients with T2DM born in North Carolina, South Carolina, Georgia, Tennessee or Virginia were recruited from dialysis facilities. Individuals with type 1 diabetes, as diagnosed in patients with a history of diabetic ketoacidosis, or who developed diabetes mellitus prior to age 25 and received continuous insulin therapy since diagnosis, were excluded. A diagnosis of T2DM was based upon an initial diagnosis of diabetes mellitus after age 35 years, receiving oral hypoglycemic agents or dietary therapy without insulin for at least one year after initial diagnosis, and active treatment with diabetes medications. Cases had T2DM diagnosed at least 5 years prior to initiating renal replacement therapy, background or greater diabetic retinopathy and/or > 3+ proteinuria on urinalysis in the absence of other causes of nephropathy. Five hundred ninety-six unrelated AA controls born in North Carolina, South Carolina, Georgia, Tennessee or Virginia, who did not have a current diagnosis of T2DM or renal disease, were recruited. Thirty-nine unrelated EA controls without known T2DM or renal disease were recruited using the same recruitment strategy as our AA controls. DNA extraction was performed using the PureGene system (Gentra Systems, Minneapolis, MN). DNA was also obtained from 44 Yoruba Nigerians (YRI) from the National Institute of General Medical Sciences (NIGMS) Human Variation Collection (Coriell Cell Repositories, Camden, NJ).

AIMs selection and genotyping

Seventy biallelic Ancestry Informative Markers (AIMs) were genotyped by Illumina Inc.’s Custom Genotyping Service (San Diego, CA) or using a MassARRAY genotyping system (Sequenom Inc., San Diego, CA) (Buetow KH 2001) in 577 AA with T2DM-ESRD, 596 AA controls, 44 YRI and 39 EA controls (Supplementary Table 1). The AIMs were chosen from the previous literature (Shriver et al. 2003; Smith et al. 2004), or to ensure at least one SNP on each autosome, and adequate physical map separation to ensure linkage equilibrium. Fifty-three AIMs were genotyped using Illumina Inc.’s Custom Genotyping Service (San Diego, CA). The remaining seventeen AIMs were genotyped using iPLEX or hME methodology on a MassARRAY genotyping system (Sequenom Inc., San Diego, CA) (Buetow KH 2001): rs2814778, rs6003, rs2065160, rs2752, rs3287, rs3340, rs455726, rs2763, rs2161, rs3176921, rs680273, rs1042602, rs1800498, rs1800404, rs4646, rs2228478, and rs2816. Primer sequences are available on request.

Statistical analyses

Hardy Weinberg equilibrium (HWE) values were determined by calculating a χ2 statistic and corresponding P-value.

Ancestral proportions were calculated using the programs STRUCTURE (Pritchard et al. 2000), FRAPPE (Frequentist Estimation of individual ancestry proportion) (Tang et al. 2005) and ADMIXMAP (McKeigue et al. 2000) under a two population model. Estimates of “pseudo-ancestral” allele frequencies were obtained from genotyped YRI and EA samples.

Delta values (δ), defined as the difference between the YRI major allele frequency and corresponding allele frequency in EA, were calculated for each AIM using the genotyped data from the 44 YRI and 39 EA individuals. Absolute δ values were used to rank the AIMs, with the highest and lowest absolute δ values earning a ranking of 1 and 70 respectively (Supplementary Table 1). AIMs were then grouped into three sets, representing the subset of AIMs with the highest δ values: “Top 25” (the 25 SNPs with the largest absolute δ values), “Top 50” (50 highest absolute δ values), and all 70 AIMs (“All”) with both programs.

Statistical analysis for correlation and analysis of variance (ANOVA) were determined using Graph Pad Prism 5 (www.graphpad.com). Non parametric (Spearman) correlation coefficients and corresponding P-values were calculated for each pair of data sets. Non parametric ANOVA (Kruskal-Wallis) tests were performed to determine differences in means across groups.

African American association with T2DM-ESRD

Genotypic association results for 208 SNPs from eight T2DM candidate genes: two Calpain 10 (CAPN10) SNPs, one Peroxisome proliferator–activated receptor γ (PPARG) SNP, five Transcription factor 7-like 2 (TCF7L2) SNPs, one ATP-sensitive inwardly rectifying potassium channel subunit Kir6.2 (KCNJ11) SNP, one Hepatic transcription factor 1 (TCF1) SNP, three Hepatocyte nuclear factor 4-α (HNF4A) SNPs (Sale et al. 2007), 146 Estrogen receptor α gene (ESR1) SNPs (Keene et al 2008b) and 49 Ectonucleotide Pyrophosphatase/Phosphodiesterase 1 (ENPP1) SNPs, (Keene et al 2008a), genotyped in this same African American (AA) case-control population, were included in this analysis to determine the impact of admixture adjustments in AA genetic association studies of T2DM-ESRD. Briefly, one hundred fifty SNPs, spanning more than 476 kb, located in the coding and flanking regions of ESR1 were genotyped in 577 AA T2DM-ESRD cases and 596 AA controls. The genotyping success rates for the 150 SNPs in the AA cases and controls range from 94.1–100%. Forty-nine ENPP1 SNPs were genotyped in the same 577 AA T2DM-ESRD cases and 596 controls. The genotyping success rates for the 49 SNPs in the AA cases and controls range from 90.4–100%. TCF7L2 SNP genotyping success rates were 100% in case subjects and >99.8% in control subjects. Genotyping success rates for the remaining candidate SNPs were >97.7% in case subjects and >96.5% in control subjects.

Unadjusted genotypic tests for the dominant, additive, and recessive models were evaluated by calculating a χ2 statistic and corresponding P-value using the program SNPGWA (C. Langefeld, personal communication). Due to a lack of validity of the large sample χ2 test statistic, only the dominant model P-values were considered for SNPs with 10 or fewer individuals that were homozygous for the minor allele. Admixture adjusted P-values were subsequently determined by performing covariate adjustment of individual estimates of African ancestry using 70 AIMs (using FRAPPE estimates) for AA subjects in logistic regression tests of dominant, additive, and recessive models of association as implemented in the program SNPADMIX (C. Langefeld, personal communication).

Results

Population characteristics

Characteristics of the AA case and control populations have been published previously (Sale et al. 2007) and are shown in Supplementary Table 2. Controls were significantly younger than cases (P<0.0001), although they were significantly older than the mean age at T2DM diagnosis in cases (P<0.0001). Additionally, age data were unavailable for 148 controls recruited during the early phase of the study. There was a higher proportion of females cases (61%) than female controls (51%).

Admixture Informative Markers

Genotyping success rates for the 70 AIMs were 97.2–100% in AA cases, 94.1–100% in AA controls, 95.5–100% in Yoruban Africans, and 94.9–100% in Caucasian Europeans (CE). The mean absolute delta values (±SD) were 0.836±0.054, 0.743±0.120, and 0.617±0.237 for 25, 50, and 70 AIMs respectively. Sixty-one of the 70 AIMs had an absolute delta value ≥0.300, while the remaining nine had values ranging from 0.055 to 0.278.

Deviations fromHWE

Using a Hardy Weinberg Equilibrium (HWE) threshold of P<0.01, three AIMs deviated from expected HWE proportions in the AA cases (rs2228478, P=2.80 × 10−9; rs2814778, P=0.00008; and rs4821667, P=0.0022) and eight SNPs (rs2228478, P= 1.5 × 10−10; rs1941141, P=0.00033; rs7957445, P=0.00094; rs3287, P=0.0014; rs3176921, P=0.0015; rs1374092, P=0.0054; rs7349, P=0.0065; and rs4436849, P=0.0086) were inconsistent in the AA controls.

For SNPs genotyped in candidate genes (without regard to ancestral frequencies), two ENPP1 SNPs, rs858341 (P= 0.004) and rs9493105 (P= 0.001) deviated from expected HWE proportions in the AA controls while all SNPs were consistent with HWE in the AA cases. Five ESR1 SNPs deviated from expected HWE proportions in the AA cases (rs9397459, rs9341070, rs9340969, rs1569788, and rs722208) and one SNP (rs9341070) was inconsistent with HWE in the AA controls. One additional SNP was inconsistent with HWE in both case and control subjects (HNF4A SNP rs4810424; P < 0.0001 in both case and control subjects).

Estimates of ancestral proportions

Estimates of African ancestral proportions from STRUCTURE, FRAPPE and ADMIXMAP, calculated using varying numbers of AIMs (25, 50, and 70), are shown in Figure 1. The mean proportion (±SD) of African ancestry in the combined sample set was 0.91±0.14, 0.90±0.14, and 0.90±0.14 for STRUCTURE, 0.79±0.14, 0.80±0.13, and 0.80±0.13 for FRAPPE and 0.87±0.11, 0.86±0.12, and 0.85±0.12 for ADMIXMAP using 25, 50, and 70 AIMs respectively. The individual ancestral estimates for each program and each subset were highly correlated (r2>0.90; P<0.0001) (Figure 2 a, b, c), however, the mean group ancestral estimates were highest using STRUCTURE, followed by ADMIXMAP, while FRAPPE estimates were the lowest. Supplementary Figure 1 displays the distribution of admixture estimates for the combined set for all programs using the three subsets of AIMs. Ancestral estimates obtained from FRAPPE were similar across the three subsets, as determined by performing one way ANOVA analysis (P=0.070). In contrast, ADMIXMAP estimates comparing 70 AIMs versus 25 AIMs were statistically different (P<0.05), while 50 AIMs versus 25 AIMs and 70 AIMs versus 50 AIMs were similar. However, using STRUCTURE, only estimates for 70 and 50 AIMs were similar.

Figure 1. African ancestral proportions in AA cases and controls using varying numbers of AIMs.

Figure 1

Mean ancestral estimates using FRAPPE (F; circles), ADMIXMAP (A; squares), and STRUCTURE (S; diamonds) with varying numbers of AIMs (25, 50, and 70) in AA cases and controls (Combined, n=1,173), AA cases only (n=577), and AA controls only (n=596).

Figure 2. Correlation between individual ancestral estimates for 70 AIMs using.

Figure 2

Figure 2

Figure 2

a) FRAPPE and ADMIXMAP b) FRAPPE and STRUCTURE and c) ADMIXMAP and STRUCTURE.

Gender stratified distribution of admixture estimates for male and female cases and controls are presented in Figure 3. Gender stratified distribution of admixture estimates for male and female cases and controls are presented in Figure 3. The mean proportion of African ancestry (as determined using FRAPPE, 70 AIMs ±SD) was 0.781±0.144 (n= 306) for the female controls, and 0.826±0.123 (n= 352) for the female cases. The mean proportion of African ancestry was 0.800±0.115 (n= 290) for the male controls, and 0.803±0.148 (n= 225) for the male cases. Using ANOVA, the mean African ancestry estimates for the female cases differed significantly from the female controls (P< 0.0001), all samples combined (P<0.05), all controls (P<0.0001), and male controls (P<0.01), but were not significantly different from male case estimates.

Figure 3. Estimates of individual African ancestry (from FRAPPE) stratified by gender.

Figure 3

Histogram displaying the distribution of individual ancestral estimates from FRAPPE for AA male cases (n=225), female cases (n=352), male controls (n=290), and female controls (n=306).

There was a significant but weak correlation between age and individual admixture estimates in the AA cases (r2= 0.101; P= 0.019) and combined (AA cases and controls) (r2=0.131; P=3.57 × 10−5) that was not observed in the AA controls (r2=0.016; P=0.733) alone (Supplementary Figure 2 a,b,c).

Impact of admixture on genotypic tests of association

Admixture adjustments were performed for all genotypic associations of candidate genes. Seven of the nine ENPP1 SNPs and 25 of the 31 ESR1 SNPs remained significantly associated with T2DM-ESRD in one or more genotypic models of association, after adjusting for admixture. Of the remaining 13 SNPs, seven SNPs were associated with T2DM-ESRD, six of which remained significant after adjusting for admixture. The impact of admixture on association results is depicted in figure 4, showing the difference between the −log(unadjusted) and −log(admixture adjusted) P-value, plotted against the corresponding absolute δ value for that SNP. The impact of admixture was significantly correlated with absolute delta value for the dominant (r2=0.1997; P=0.0051), additive (r2=0.2662; P=0.0015), and recessive (r2=0.1735; P=0.0410) models of association.

Figure 4. Impact of adjusting for admixture on genotypic association results for 208 SNPs from eight T2DM candidate genes.

Figure 4

Relationship between absolute δ and the impact of admixture (using FRAPPE generated individuals ancestral estimates) on genotypic tests of association for dominant, additive, and recessive models in 208 SNPs from eight candidate gene studies for T2DM. Absolute δ, defined as the absolute difference between the Yoruban African major allele frequency and corresponding allele frequency in European Americans, is plotted along the x-axis while the admixture impact, defined as the difference between the −log(unadjusted) and −log(admixture adjusted) P-value is plotted along the y-axis.

Discussion

Several studies have investigated the theoretical impact of admixture association with disease traits using simulated data. We have evaluated the impact of admixture on individual estimates of ancestry generated using three methods, correlations with gender and age, and the impact on association results.

Individual ancestral estimates, calculated using FRAPPE, ADMIXMAP, and STRUCTURE were highly correlated (r2>0.90; P<0.0001). However, the overall group means for STRUCTURE were significantly higher compared to ADMIXMAP and FRAPPE, which is consistent with reports using simulated data (Tsai et al. 2005). These differences in estimations can be contributed to the methodologies implemented in the various programs (Maximum Likelihood Estimation (FRAPPE) and Bayesian methodologies (ADMIXMAP and STRUCTURE)) as well as the way each program generates ancestral allele frequencies from the two parental populations. Using ANOVA, intra-program mean ancestral estimates using FRAPPE were similar for all three subsets of AIMS (25, 50, 70, Figure 1). In contrast mean ancestral estimates from STRUCTURE were similar only for estimates obtained using 70 and 50 AIMS. However, ADMIXMAP estimates were similar when comparing 70 and 50 AIMs, as well as 50 and 25 AIMs but the mean group estimates using 25 AIMs were significantly higher than the estimates obtained using the total set of 70 AIMs, suggesting that 25 highly informative AIMs may not be sufficient to determine “accurate” individual ancestral proportions, but 50 highly informative AIMs may suffice. Although using 100 AIMs for estimating ancestry in Latino subjects has been proposed by Tsai et al (2005), their data also suggests that more informative markers (FST higher than 0.5) could reduce the number of markers were needed to decrease the type I error rate. When they simulated 25 AIMs with FST higher than 0.5, the number of AIMs required for controlling the excess of false positives was less than 50 (Tsai et al. 2005).

The mean ancestral estimate (using FRAPPE, 70 AIMs) for the combined AA in this study (0.80±0.13) was similar to previous AA estimates from Winston Salem, NC (0.791) (Reiner et al. 2005) , Washington, DC (0.787) (Shriver et al. 2003) , Chicago, IL (.806) (Smith et al. 2004), Birmingham, AL (0.812) (Reiner et al. 2007) , and New York (0.802), but different from Gullah speaking AA from the Sea Islands of coastal South Carolina (0.965) (Parra et al. 2001).

Gender stratified ancestral estimates (±SD) (as determined using FRAPPE, 70 AIMs) were 0.781±0.145 for female controls (n= 306), 0.826±0.123 for female cases (n= 352), 0.800±0.115 for male controls (n= 290), and 0.803±0.148 for male cases (n= 225). ANOVA analysis showed the mean ancestral estimate in female cases differed significantly from male (P<0.01), and female controls (P< 0.0001), but were similar to male case estimates. These findings also hold true when performing logit transformations of ancestry estimates (data not shown). Although ancestral lineage differences have been supported by studies utilizing X, Y, and mitochondrial markers, (Lind et al. 2007; McLean et al. 2005), the divergent frequencies in the female cases and controls suggests that females from our recruitment region with higher African proportion may have an increased risk for developing T2DM-ESRD. This theory is supported by the unexplained increase in risk for having a family history of ESRD that is present among African American women, relative to men (Freedman et al. 2005; McClellan et al. 2007) . The higher proportion of ancestry in female cases may hint at a biological relationship between gender and greater African ancestry. A modification of the thrifty gene hypothesis (Neel 1999) is one possible explanation for the higher African ancestry observed in AA female T2DM-ESRD patients. If AA females with T2DM have genes of “African descent” that are involved in energy storage and expenditure, but also influence or are influenced by gender-specific pathways, when exposed to a westernized lifestyle, these individuals will be at an increased risk of obesity and subsequently developing T2DM. African ancestry and markers for ancestry have previously shown to be correlated with body composition traits in older AA (Shaffer et al. 2007) and associated with resting metabolic rate and obesity in AA women (n=145) (Fernandez et al. 2003) . One possible pathway, suggested by our previous association studies, is through estrogen receptor mediated actions. Estrogen has shown to have antidiabetic actions in both rodent models as well as clinical trials (Louet et al. 2004; Margolis et al. 2004; Seed 2002) . Our association results suggest a differential association with ESR1 variants and T2DM-ESRD in males (7 SNPs) versus females (35 SNPs) (Keene et al 2008b).

There was a significant but weak correlation between age and individual admixture estimates in the AA cases (r2= 0.101; P= 0.019) and combined (AA cases and controls) (r2=0.131; P=3.57×10−5), that was not observed in the AA controls (r2=−0.016; P=0.733) alone. The slope of the trend line for the cases suggests that older AA patients may have higher African ancestry. The lack of significance in the controls may be due to missing age data for 148 controls. Eighty-two individuals with missing age data had an individual ancestry estimate greater than 0.79 (the control group mean) while only 12 individuals with missing age data had an individual ancestry estimate less than 0.60, suggesting a bias towards missing age data in AA controls with higher ancestral data may have impacted our ability to detect the modest relationship observed in cases. The trend in the AA cases does suggest that younger generations of AA may have increasing levels of European admixture, therefore it may be important to test for this correlation before performing age- and admixture adjusted analyses.

The impact of admixture on association results was significantly correlated with absolute δ value for the dominant, additive, and recessive genotypic models of association. Additionally, there was no correlation between minor allele frequency or genotypic frequency with the impact of admixture, suggesting that there is no bias of admixture effect based on the actual allele frequency in the admixed population. Of the 208 SNPs, 47 (22.6%) were nominally associated with T2DM-ESRD (before adjusting for admixture) in this AA population. Of those 47 SNPs, nine were no longer associated with T2DM-ESRD after adjusting for African ancestry (Supplementary Table 3). Although these nine SNPs only represent 4% of the total SNP count, they account for nearly 20% of all of the associated SNPs, thus demonstrating the increased chance of type I error due to admixture. Admixture also affected the most highly associated SNP in this study, TCF7L2 SNP rs7903146, with an admixture-adjusted dominant model P-value of 2.55 × 10−8, compared with the unadjusted dominant model P-value of 1.79 × 10−8.

Associations for this SNP have been replicated in T2DM association studies from several populations (Sale et al. 2007) and the SNP has very similar allele frequencies for Africans and Caucasians, with an absolute δ value of only 0.0149. We conclude that admixture adjustments should be performed for all markers in an admixed population regardless of parental population frequencies, δ values, or actual allele and genotypic frequencies in the admixed population; however it is especially important to account for admixture in genetic association studies where the frequency of the trait/disease of interest and the allele frequency of the polymorphism varies significantly among the ancestral populations (subpopulations).

Increasing the sample size of the parental populations may provide more accurate estimates of the parental frequencies for each marker, which in turn could improve individual estimates of ancestry. Additionally, sampling from more divergent African populations may also increase the accuracy of these estimates, although historical records and genetic studies (Reed and Tishkoff 2006) suggest that the majority of AA have ancestral origins from West Africa. Ancestry estimates can be determined and may vary depending on the DNA source (i.e. autosomes, sex chromosomes and mitochondrial DNA). One limitation of this study was the lack of ancestral estimates using mitochondrial DNA and sex chromosomes, however the estimates obtained using 70 autosomes appear to be informative for ancestry and useful for association studies.

This study is the first such admixture study of its kind to analyze the effects of admixture in AA populations with T2DM-ESRD. Investigation of differing number of AIMs suggest as few as 50 AIMs may be sufficient to generate reliable ancestral estimates in AA, although more studies are necessary to verify this finding. Additionally, this study provides evidence that higher proportions of African ancestry show a modest correlation with age in AA cases from this population, and even greater correlation with T2DM-ESRD status in AA females. The higher ancestry in AA females suggests that there may be a biological relationship between African ancestry and sex as it relates to T2DM-ESRD, however further evaluations are necessary in order to identify the exact relationship. This study provides insight into the influence and critical importance of considering admixture in AA genetic association studies.

Supplementary Material

Supplementary Material

Acknowledgments

We thank the individuals recruited as cases and controls for their participation, recruiters Joyce Byers and Mitzie Spainhour, technician Candace Gordon, programmer Matt Stiegert, and Mark Hansen and colleagues at Illumina Inc. This work was supported by grants DK066358, DK072550, DK070941, the Wake Forest University General Clinical Research Center M01 RR07122, and a Career Development Award from the American Diabetes Association (MMS).

References

  1. Brancati FL, Kao WH, Folsom AR, Watson RL, Szklo M. Incident type 2 diabetes mellitus in African American and white adults: the Atherosclerosis Risk in Communities Study. Jama. 2000;283:2253–2259. doi: 10.1001/jama.283.17.2253. [DOI] [PubMed] [Google Scholar]
  2. Buetow KH, Edmonson M, MacDonald R, Clifford R, Yip P, Kelley J, Little DP, Strausberg R, Koester H, Cantor CR, Braun A. High-throughput development and characterization of a genomewide collection of gene-based single nucleotide polymorphism markers by chip-based matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Proc Natl Acad Sci U S A. 2001;98:581–584. doi: 10.1073/pnas.021506298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Elbein SC. Evaluation of polymorphisms known to contribute to risk for diabetes in African and African-American populations. Curr Opin Clin Nutr Metab Care. 2007;10:415–419. doi: 10.1097/MCO.0b013e3281e2c99a. [DOI] [PubMed] [Google Scholar]
  4. Fernandez JR, Shiver MD. Using genetic admixture to study the biology of obesity traits and to map genes in admixed populations. Nutr Rev. 2004;62:S69–S74. doi: 10.1111/j.1753-4887.2004.tb00091.x. [DOI] [PubMed] [Google Scholar]
  5. Fernandez JR, Shriver MD, Beasley TM, Rafla-Demetrious N, Parra E, Albu J, Nicklas B, Ryan AS, McKeigue PM, Hoggart CL, Weinsier RL, Allison DB. Association of African genetic admixture with resting metabolic rate and obesity among women. Obes Res. 2003;11:904–911. doi: 10.1038/oby.2003.124. [DOI] [PubMed] [Google Scholar]
  6. Freedman BI, Volkova NV, Satko SG, Krisher J, Jurkovitz C, Soucie JM, McClellan WM. Population-based screening for family history of end-stage renal disease among incident dialysis patients. Am J Nephrol. 2005;25:529–535. doi: 10.1159/000088491. [DOI] [PubMed] [Google Scholar]
  7. Freedman BI, Yu H, Spray BJ, Rich SS, Rothschild CB, Bowden DW. Genetic linkage analysis of growth factor loci and end-stage renal disease in African Americans. Kidney Int. 1997;51:819–825. doi: 10.1038/ki.1997.115. [DOI] [PubMed] [Google Scholar]
  8. Goodman MJ, Chung CS. Diabetes mellitus: discrimination between single locus and multifactorial models of inheritance. Clin Genet. 1975;8:66–74. doi: 10.1111/j.1399-0004.1975.tb01956.x. [DOI] [PubMed] [Google Scholar]
  9. Hawkes CH. Twin studies in diabetes mellitus. Diabet Med. 1997;14:347–352. doi: 10.1002/(SICI)1096-9136(199705)14:5<347::AID-DIA332>3.0.CO;2-7. [DOI] [PubMed] [Google Scholar]
  10. Lind JM, Hutcheson-Dilks HB, Williams SM, Moore JH, Essex M, Ruiz-Pesini E, Wallace DC, Tishkoff SA, O'Brien SJ, Smith MW. Elevated male European and female African contributions to the genomes of African American individuals. Hum Genet. 2007;120:713–722. doi: 10.1007/s00439-006-0261-7. [DOI] [PubMed] [Google Scholar]
  11. Long JC. The genetic structure of admixed populations. Genetics. 1991;127:417–428. doi: 10.1093/genetics/127.2.417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Kaiser Family Foundation. State Health Facts Online. 2007 http://www.statehealthfacts.org/
  13. Keene KL, Mychaleckyj JC, Smith SG, Leak TS, Perlegas PS, Langefeld CD, Freedman BI, Rich SS, Bowden DW, Sale MM. Association of the distal region of the Ectonucleotide Pyrophosphatase/Phosphodiesterase 1 (ENPP1) gene with Type 2 Diabetes in an African American Population enriched for nephropathy. Diabetes. 2008a;57(4):1057–1062. doi: 10.2337/db07-0886. [DOI] [PubMed] [Google Scholar]
  14. Keene KL, Mychaleckyj JC, Smith SG, Leak TS, Perlegas PS, Langefeld CD, Herrington DM, Freedman BI, Rich SS, Bowden DW, Sale MM. Comprehensive evaluation of the Estrogen Receptor Alpha Gene reveals further evidence for association with type 2 diabetes enriched for nephropathy in an African American population. Hum Genet. 2008b;123(4):333–341. doi: 10.1007/s00439-008-0482-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Louet JF, LeMay C, Mauvais-Jarvis F. Antidiabetic actions of estrogen: insight from human and genetic mouse models. Curr Atheroscler Rep. 2004;6:180–185. doi: 10.1007/s11883-004-0030-9. [DOI] [PubMed] [Google Scholar]
  16. Margolis KL, Bonds DE, Rodabough RJ, Tinker L, Phillips LS, Allen C, Bassford T, Burke G, Torrens J, Howard BV. Effect of oestrogen plus progestin on the incidence of diabetes in postmenopausal women: results from the Women's Health Initiative Hormone Trial. Diabetologia. 2004;47:1175–1187. doi: 10.1007/s00125-004-1448-x. [DOI] [PubMed] [Google Scholar]
  17. McClellan W, Speckman R, McClure L, Howard V, Campbell RC, Cushman M, Audhya P, Howard G, Warnock DG. Prevalence and characteristics of a family history of end-stage renal disease among adults in the United States population: Reasons for Geographic and Racial Differences in Stroke (REGARDS) renal cohort study. J Am Soc Nephrol. 2007;18:1344–1352. doi: 10.1681/ASN.2006090952. [DOI] [PubMed] [Google Scholar]
  18. McKeigue PM, Carpenter JR, Parra EJ, Shriver MD. Estimation of admixture and detection of linkage in admixed populations by a Bayesian approach: application to African-American populations. Ann Hum Genet. 2000;64:171–186. doi: 10.1017/S0003480000008022. [DOI] [PubMed] [Google Scholar]
  19. McLean DC, Jr, Spruill I, Argyropoulos G, Page GP, Shriver MD, Garvey WT. Mitochondrial DNA (mtDNA) haplotypes reveal maternal population genetic affinities of Sea Island Gullah-speaking African Americans. Am J Phys Anthropol. 2005;127:427–438. doi: 10.1002/ajpa.20047. [DOI] [PubMed] [Google Scholar]
  20. Neel JV. Diabetes mellitus: a "thrifty" genotype rendered detrimental by "progress"? 1962. Bull World Health Organ. 1999;77:694–703. discussion 692-3. [PMC free article] [PubMed] [Google Scholar]
  21. Parra EJ, Kittles RA, Argyropoulos G, Pfaff CL, Hiester K, Bonilla C, Sylvester N, Parrish-Gause D, Garvey WT, Jin L, McKeigue PM, Kamboh MI, Ferrell RE, Pollitzer WS, Shriver MD. Ancestral proportions and admixture dynamics in geographically defined African Americans living in South Carolina. Am J Phys Anthropol. 2001;114:18–29. doi: 10.1002/1096-8644(200101)114:1<18::AID-AJPA1002>3.0.CO;2-2. [DOI] [PubMed] [Google Scholar]
  22. Pfaff CL, Barnholtz-Sloan J, Wagner JK, Long JC. Information on ancestry from genetic markers. Genet Epidemiol. 2004;26:305–315. doi: 10.1002/gepi.10319. [DOI] [PubMed] [Google Scholar]
  23. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155(2):945–959. doi: 10.1093/genetics/155.2.945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Reed FA, Tishkoff SA. African human diversity, origins and migrations. Curr Opin Genet Dev. 2006;16:597–605. doi: 10.1016/j.gde.2006.10.008. [DOI] [PubMed] [Google Scholar]
  25. Reiner AP, Carlson CS, Ziv E, Iribarren C, Jaquish CE, Nickerson DA. Genetic ancestry, population sub-structure, and cardiovascular disease-related traits among African-American participants in the CARDIA Study. Hum Genet. 2007;121:565–575. doi: 10.1007/s00439-007-0350-2. [DOI] [PubMed] [Google Scholar]
  26. Reiner AP, Ziv E, Lind DL, Nievergelt CM, Schork NJ, Cummings SR, Phong A, Burchard EG, Harris TB, Psaty BM, Kwok PY. Population structure, admixture, and aging-related phenotypes in African American adults: the Cardiovascular Health Study. Am J Hum Genet. 2005;76:463–477. doi: 10.1086/428654. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Rotimi C, Cooper R, Cao G, Sundarum C, McGee D. Familial aggregation of cardiovascular diseases in African-American pedigrees. Genet Epidemiol. 1994;11:397–407. doi: 10.1002/gepi.1370110502. [DOI] [PubMed] [Google Scholar]
  28. Sale MM, Smith SG, Mychaleckyj JC, Keene KL, Langefeld CD, Leak TS, Hicks PJ, Bowden DW, Rich SS, Freedman BI. Variants of the transcription factor 7-like 2 (TCF7L2) gene are associated with type 2 diabetes in an African-American population enriched for nephropathy. Diabetes. 2007;56(10):2638–2642. doi: 10.2337/db07-0012. [DOI] [PubMed] [Google Scholar]
  29. Seed M. The choice of hormone replacement therapy or statin therapy in the treatment of hyperlipidemic postmenopausal women. Atheroscler Suppl. 2002;3:53–63. doi: 10.1016/s1567-5688(01)00009-5. [DOI] [PubMed] [Google Scholar]
  30. Shaffer JR, Kammerer CM, Reich D, McDonald G, Patterson N, Goodpaster B, Bauer DC, Li J, Newman AB, Cauley JA, Harris TB, Tylavsky F, Ferrell RE, Zmuda JM. Genetic markers for ancestry are correlated with body composition traits in older African Americans. Osteoporos Int. 2007;18:733–741. doi: 10.1007/s00198-006-0316-6. [DOI] [PubMed] [Google Scholar]
  31. Shriver MD, Parra EJ, Dios S, Bonilla C, Norton H, Jovel C, Pfaff C, Jones C, Massac A, Cameron N, Baron A, Jackson T, Argyropoulos G, Jin L, Hoggart CJ, McKeigue PM, Kittles RA. Skin pigmentation, biogeographical ancestry and admixture mapping. Hum Genet. 2003;112:387–399. doi: 10.1007/s00439-002-0896-y. [DOI] [PubMed] [Google Scholar]
  32. Smith MW, Patterson N, Lautenberger JA, Truelove AL, McDonald GJ, Waliszewska A, Kessing BD, Malasky MJ, Scafe C, Le E, De Jager PL, Mignault AA, Yi Z, De The G, Essex M, Sankale JL, Moore JH, Poku K, Phair JP, Goedert JJ, Vlahov D, Williams SM, Tishkoff SA, Winkler CA, De La Vega FM, Woodage T, Sninsky JJ, Hafler DA, Altshuler D, Gilbert DA, O'Brien SJ, Reich D. A high-density admixture map for disease gene discovery in african americans. Am J Hum Genet. 2004;74:1001–1013. doi: 10.1086/420856. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Tang H, Peng J, Wang P, Risch NJ. Estimation of individual admixture: analytical and study design considerations. Genet Epidemiol. 2005;28:289–301. doi: 10.1002/gepi.20064. [DOI] [PubMed] [Google Scholar]
  34. Tsai HJ, Choudhry S, Naqvi M, Rodriguez-Cintron W, Burchard EG, Ziv E. Comparison of three methods to estimate genetic ancestry and control for stratification in genetic association studies among admixed populations. Hum Genet. 2005;118:424–433. doi: 10.1007/s00439-005-0067-z. [DOI] [PubMed] [Google Scholar]
  35. Yu H, Bowden DW, Spray BJ, Rich SS, Freedman BI. Linkage analysis between loci in the renin-angiotensin axis and end-stage renal disease in African Americans. J Am Soc Nephrol. 1996;7:2559–2564. doi: 10.1681/ASN.V7122559. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material

RESOURCES