Abstract
Objectives: Using a novel candidate SNP approach, we aimed to identify a possible genetic basis for the higher glioma incidence in Whites relative to East Asians and African-Americans. Methods: We hypothesized that genetic regions containing SNPs with extreme differences in allele frequencies across ethnicities are most likely to harbor susceptibility variants. We used International HapMap Project data to identify 3,961 candidate SNPs with the largest allele frequency differences in Whites compared to East Asians and Africans and tested these SNPs for association with glioma risk in a set of White cases and controls. Top SNPs identified in the discovery dataset were tested for association with glioma in five independent replication datasets. Results: No SNP achieved statistical significance in either the discovery or replication datasets after accounting for multiple testing or conducting meta-analysis. However, the most strongly associated SNP, rs879471, was found to be in linkage disequilibrium with a previously identified risk SNP, rs6010620, in RTEL1. We estimate rs6010620 to account for a glioma incidence rate ratio of 1.34 for Whites relative to East Asians. Conclusion: We explored genetic susceptibility to glioma using a novel candidate SNP method which may be applicable to other diseases with appropriate epidemiologic patterns.
Keywords: glioma, candidate SNP association study, ancestry informative markers, admixture, race, ethnicity, brain cancer
Introduction
Incidence rates of adult primary malignant brain tumors (PMBT), most of which are gliomas (Kohler et al., 2011), vary among ethnic groups (Darefsky and Dubrow, 2009; Dubrow and Darefsky, 2011). The age-standardized incidence rate for northern American non-Hispanic Whites is 2.5–3.0 times the rate among East Asians and around twice the rate among African-Americans. The latter ratio is likely to be higher for comparisons of White to African populations, given the ∼20% European content of the African-American genome (Patterson et al., 2004); however presently there are no data allowing an evaluation (Darefsky and Dubrow, 2009). These ethnic differences in PMBT incidence are unlikely to be solely ascribable to factors such as access to care or diagnostic facilities; in particular, the White-East Asian difference is observed in comparisons among different countries as well as within the United States, where both groups have similar access (Darefsky and Dubrow, 2009; Dubrow and Darefsky, 2011). Notably, ethnic incidence variation has been observed for both grade IV glioma (glioblastoma, or GBM) and non-GBM tumors (Dubrow and Darefsky, 2011).
The only established environmental risk factor for glioma is exposure to high-dose ionizing radiation (Bondy et al., 2008; Ostrom and Barnholtz-Sloan, 2011), which accounts for a small number of cases; furthermore, studies have demonstrated a consistent inverse association with history of allergy (Schoemaker et al., 2010; Lachance et al., 2011) as well as evidence of interaction effects between history of allergy and several established glioma risk alleles (Schoemaker et al., 2010). However, epidemiologic studies have provided no conclusive evidence for diagnostic radiation (Davis et al., 2011), electromagnetic field exposure from residential power lines (Wrensch et al., 1999), smoking (Mandelzweig et al., 2009), alcohol consumption (Efird et al., 2004), nutritional factors (Bondy et al., 2008), or cell phone use (Cardis et al., 2010) as risk factors. Collectively, these observations suggest that ethnic group associated genetic variants, rather than environmental factors, underscore variation in glioma incidence among ethnic groups. Such an assertion is supported by a number of studies suggesting genetic pathways to glioma may differ across ethnicities (Mochizuki et al., 1999; Chen et al., 2001; Das et al., 2002; Wiencke et al., 2005). Following on from this it is possible that the frequencies of haplotypes associated with glioma susceptibility will differ between Whites and East Asians/Africans, such that haplotypes harboring alleles associated with an increased glioma risk would be more prevalent among Whites and conversely haplotypes associated with decreased glioma risk would be more prevalent among East Asians and Africans. Identification of these haplotypes offers the prospect of gaining valuable insight into genes influencing glioma risk.
Here we employed a candidate SNP approach to identify previously unknown genetic variants associated with glioma risk through the identification of SNPs that may tag glioma-related haplotypes. Our primary hypothesis is based on the premise that the same alleles confer protection against glioma in both East Asians and Africans. Consequently, we propose that these alleles are carried at a greater frequency by both East Asians and Africans than by Whites and that genetic regions (i.e., haplotypes) containing SNPs with the greatest allele frequency differences between Whites and both East Asians and Africans (with the same direction of difference) are particularly likely to harbor these alleles. To take into account the possibility that alleles that confer protection in East Asians differ from alleles that confer protection in Africans, we also propose a secondary hypothesis that genetic regions containing SNPs with the greatest allele frequency differences between Whites and either East Asians or Africans, but not both, are likely to harbor protective alleles which are distinct from those identified under the primary hypothesis.
Given that ethnic incidence differences are broadly similar for GBM and non-GBM glioma (Dubrow and Darefsky, 2011), we postulate that polymorphisms driving these incidence differences are common across these glioma subtypes, and therefore consider all gliomas combined without stratification. Since large differences in allele frequency are needed to account for even a relatively small portion of the White/East Asian or White/African incidence rate ratio (Figure 1), we restrict our analyses to SNPs showing the largest frequency differences.
Materials and Methods
Selection of candidate SNPs
To select candidate SNPs we used allele frequency data on unrelated individuals from six populations included in the International HapMap Project Phase III (Altshuler et al., 2010): 113 Utah residents with Northern and Western European ethnicity (CEU); 102 Toscans from Italy (TSI); 137 Han Chinese from Beijing, China (CHB); 113 Japanese from Tokyo, Japan (JPT); 147 Yoruba from Ibadan, Nigeria (YRI); and 110 Luhya from Webuye, Kenya (LWK). We grouped CEU and TSI together as Whites, CHB and JPT as East Asians, and YRI and LWK as Africans. Data from Phase III, release 28 were downloaded from the HapMap Project File Transfer Protocol1. In this release, frequency of genotype missingness per SNP was required to be <0.05 per population, and SNPs were excluded with Hardy Weinberg P < 10−7. We used in-house Perl scripts to calculate allele frequencies and call rates, according to ethnic group, for SNPs genotyped previously for our discovery genome-wide association study (GWAS) of high-grade adult glioma (Wrensch et al., 2009). SNPs with call rates <95% among Whites, East Asians, or Africans, respectively, were excluded. For each SNP, differences in reference allele frequencies were calculated for Whites vs. East Asians and Whites vs. Africans, as well as the average difference if differences were in the same direction (e.g., the frequency of allele A is low in Whites but high in both East Asians and Africans).
To test our primary hypothesis, we selected SNPs with the greatest average allele frequency differences (provided equivalent directionality) as defined by three categories: “Highest” (≥0.70), “High” (0.60 to <0.70), and “Moderate” (0.40 to <0.60). To test our secondary hypothesis, we selected SNPs for which the allele frequency difference was Highest (≥0.70) in one population comparison, but Low (<0.40) in the other (“Highest/Low”).
Discovery dataset
Descriptive characteristics of discovery set cases and controls are presented in Table 1. Subjects providing genotype data for the discovery phase included 692 high-grade glioma cases and 3,992 controls originally assembled for the 2009 GWAS of glioma by Wrensch et al. (2009). Briefly, cases included 622 individuals of European ethnicity from the San Francisco Adult Glioma Study (AGS) and 70 from The Cancer Genome Atlas (TCGA; McLendon et al., 2008), aged 20 or older with incident histologically confirmed anaplastic astrocytoma (n = 97) or GBM (n = 595; International Classification of Diseases for Oncology, morphology codes 9380–9481). Controls included 602 subjects from AGS identified using random digit dialing and frequency matched to cases on age, sex, and ethnicity, as well as 3,390 subjects from the Illumina iControl Database2. All subjects were confirmed to be unrelated and of European ethnicity by multidimensional scaling analysis.
Table 1.
Characteristicb | Cases |
Controls |
||
---|---|---|---|---|
Adult Glioma Study (n = 622) | The Cancer Genome Atlas (n = 70) | Adult Glioma Study (n = 602) | Illumina iControls (n = 3,390) | |
Age | 55 ± 0.5 | 54 ± 1.7 | 56 ± 0.6 | 29 ± 0.4 |
GENDER | ||||
Male | 398 (64%) | 40 (57%) | 319 (53%) | 1,254 (37%) |
Female | 224 (36%) | 30 (43%) | 283 (47%) | 2,136 (63%) |
TUMOR SUBTYPE | ||||
Grade III | 97 (16%) | 0 (0%) | – | – |
Grade IV | 525 (84%) | 70 (100%) | – | – |
aFrom Wrensch et al. (2009).
bTable values are mean ± standard error for continuous variables and n (column%) for categorical variables.
Discovery dataset genotyping
Details of sample preparation and genotyping have been provided previously (Wrensch et al., 2009). Briefly, DNA from all AGS cases and controls was isolated from whole blood using Qiagen’s Gentra Puregene DNA isolation kit, and genotyping was conducted using Illumina’s HumanCNV370-Duo BeadChip. AGS samples were required to have a call rate of at least 98%. SNPs deviating from Hardy Weinberg equilibrium in AGS or Illumina controls (P < 10−5) were excluded from further analysis, as were those with greater than 5% missing data in any of the four subject groups (AGS cases or controls, TCGA cases, Illumina controls).
Replication datasets
We investigated our top candidate SNPs from the discovery dataset in five independent sets of cases and controls. Detailed procedures of subject selection and genotyping have been described previously (Shete et al., 2009; Wrensch et al., 2009; Sanson et al., 2011). Mayo Clinic cases (n = 176), 65% with GBM and 35% with grade III glioma, were diagnosed in Rochester, Minnesota between 2005 and 2008. Controls (n = 174) were identified from among individuals who had a general medical exam at the Mayo Clinic, and were matched to cases on sex, age, race, and residence. All cases and controls were genotyped using Illumina Human 610Quad arrays.
The four other replication datasets (UK, US, French, and German) were previously included in a pooled GWAS of glioma (Shete et al., 2009; Sanson et al., 2011). Briefly, the UK GWAS comprised 631 cases ascertained through the INTERPHONE study (Cardis et al., 2010) and 2,699 controls from the 1958 Birth Cohort (Power and Elliott, 2006). The US GWAS comprised 1,247 cases recruited through MD Anderson Cancer Center in Houston, Texas and 2,236 controls from the Cancer Genetic Markers of Susceptibility study (Hunter et al., 2007). The French GWAS comprised 1,423 cases from the Service de Neurologie Mazarin, Groupe Hospitalier Pitié-Salpêtrière Paris, and 1,190 controls from the SU.VI.MAX study (Hercberg et al., 2004). The German GWAS comprised 846 cases recruited from the University of Bonn Medical Center, and 1,310 controls from the KORA (Holle et al., 2005; Wichmann et al., 2005), POPGEN (Krawczak et al., 2006), and Heinz Nixdorf RECALL studies (Schmermund et al., 2002). Cases in the UK and US GWAS were genotyped using Illumina Human 610Quad arrays, and cases from the French and German GWAS were genotyped using Illumina HumanHap660 arrays. Controls in the UK GWAS were genotyped using Illumina Human 1M Duo arrays; the US controls on Illumina HumanHap240, 300, and 500 arrays; the French controls on Illumina HumanHap660 arrays; and the German controls using Illumina HumanHap550 arrays.
Statistical analyses
Odds ratios and 95% confidence intervals for the association of candidate SNPs with glioma in the discovery and replication sets were calculated using unconditional logistic regression under an additive model (0, 1, or 2 copies of the minor allele). Potential population stratification was adjusted for using principal components derived by the EIGENSTRAT method and included in the logistic regression model (Price et al., 2006).
Discovery set results were evaluated in comparison to Bonferroni-adjusted significance thresholds based on a study-wide significance threshold of 0.05, calculated separately for each of the four subgroups of candidate SNPs [Highest allele frequency difference (≥0.70), High (0.60 to <0.70), Moderate (0.40 to <0.60), and Highest/Low (≥0.70 in one population, <0.40 in the other population)] such that the significance thresholds accounted for the prior probability of association with glioma according to our hypotheses (i.e., P = 0.0125 per subgroup). With 38 SNPs in the Highest allele frequency difference category, the significance threshold for this category was 0.0125/38 = 3.29 × 10−4. Statistical thresholds for the High, Moderate, and Highest/Low subgroups were 4.70 × 10−5, 4.34 × 10−6, and 1.61 × 10−5, respectively (Table 2). For replication set analyses we used a nominal significance level of 0.05. All P-values reported (discovery and replication) are one-sided because of the directionality inherent in the hypothesis being tested. The generic inverse variance method was used (assuming a fixed effects model) to obtain meta-analysis results for combined discovery and replication set data.
Table 2.
Category | Number of SNPs | Significance threshold | |
---|---|---|---|
Primary hypothesis | Highest mean allele frequency difference (≥0.70) | 38 | 3.29 × 10−4 |
High mean allele frequency difference (0.60 to <0.70) | 266 | 4.70 × 10−5 | |
Moderate mean allele frequency difference (0.40 to <0.60) | 2,883 | 4.34 × 10−6 | |
Secondary hypothesis | Highest/low allele frequency differences (≥0.70 in one comparison and <0.40 in the other) | 774 | 1.61 × 10−5 |
Analyses were conducted and data visualized using SAS software version 9.2 (SAS Institute, Cary, NC, USA), Haploview version 4.2 (Barrett et al., 2005), R version 2.13.1, Microsoft Excel, Galaxy Project software (Blankenberg et al., 2001), and Review Manager Version 5.1.4 (Cochrane Collaboration, Oxford, UK).
Results
Selected candidate SNPs
Of 275,895 SNPs for which genotype data on discovery glioma cases and controls were previously available, HapMap data were not available or were of insufficient quality for 1,188 (0.43%). Predetermined allele frequency difference criteria were met for 3,961 of the 274,707 remaining SNPs (Figure 2). We identified 2,883 SNPs in the Moderate (0.40 to <0.60) allele frequency difference category, 266 in the High (0.60 to < 0.70) category, 38 in the Highest (≥0.70) category, and 774 in the Highest/Low category (≥0.70 in one population comparison, but <0.40 in the other; Table 2; Figure 3).
Discovery set
A Manhattan plot of the 3,961 SNP-glioma associations is shown in Figure 4. The most strongly associated SNP, rs879471 in STMN3 on chromosome 20q13 (P = 1.72 × 10−4), maps 39.9 kb from rs6010620, a SNP intronic to RTEL1 that was previously identified as a top hit in the GWAS conducted by Wrensch et al. (2009). Conditioning rs879471 on rs6010620 did not, however, provide evidence of a separate signal (P = 0.22), so rs879471 was excluded from further analysis.
While no SNP association attained our predetermined Bonferroni-adjusted significance levels (Table 2), 10 genes (SMARCA2, BRE, SLCO3A1, MORN5, C10orf11, RBM27, PTPRJ, SPIB, NMNAT1, and RPUSD3) were identified containing at least one SNP with P < 0.01. In order to investigate SNPs that may be markers of glioma risk but were excluded by our strict allele frequency criteria, we tested 260 additional SNPs in these 10 genes and within 5 kb upstream and downstream for association with glioma risk (regardless of ethnic allele frequency differences). We excluded four genes (SLCO3A1, C10orf11, PTPRJ, and NMNAT1) from further analysis because the direction of association of additional tested SNPs with glioma risk was inconsistent with our hypothesis. All SNPs with P < 0.01 across the six remaining genes (SMARCA2, BRE, MORN5, RBM27, SPIB, and RPUSD3) were selected for replication (n = 20 SNPs, nine of which were identified in the secondary discovery analysis).
Replication sets
Discovery and replication set results for the 20 selected SNPs are presented in Table 3. Genotype data were not available for rs4464229 or rs3863 in the UK, US, French, or German sets, and data were not available for rs4666022 in any of the replication sets. For the remaining SNPs, four achieved nominal significance (P < 0.05) with an odds ratio in the same direction as in the discovery set in one of the five replication sets and one achieved nominal significance in two of the replication sets; however, none achieved statistical significance in a replication set after correcting for multiple comparisons, and none achieved nominal significance in the meta-analysis.
Table 3.
rsID | Gene | Discovery set P-valuea | UCSF discovery set (n = 692)b | Mayo Clinic (n = 176) | UK series (n = 631) | US series (n = 1,247) | French series (n = 1,423) | German series (n = 846) | Meta-analysis (n = 5,015)d | P-value for heterogeneitye |
---|---|---|---|---|---|---|---|---|---|---|
Odds ratio (two-sided 95% confidence interval) |
||||||||||
rs11127125 | BRE | 5.75E-03 | 0.83 (0.71, 0.96) | 1.25 (0.87, 1.81) | 0.89 (0.77, 1.04) | 1.17 (1.03, 1.31)* | 1.06 (0.93, 1.21) | 1.00 (0.86, 1.17) | 0.99 (0.93, 1.05) | 0.61 |
rs4464229 | BRE | 5.05E-04 | 0.77 (0.65, 0.90) | 1.37 (0.93, 2.01) | NAc | NA | NA | NA | 0.91 (0.81, 1.02) | 0.28 |
rs4666020 | BRE | 2.90E-04 | 0.76 (0.64, 0.89) | 1.37 (0.93, 2.02) | 0.91 (0.77, 1.07) | 1.16 (1.02, 1.31)* | 1.04 (0.90, 1.20) | 1.02 (0.87, 1.19) | 0.98 (0.92, 1.04) | 0.41 |
rs4666022 | BRE | 3.38E-03 | 0.82 (0.70, 0.95) | NA | NA | NA | NA | NA | NA | - |
rs10175508 | BRE | 7.40E-04 | 0.79 (0.68, 0.91) | 1.21 (0.84, 1.74) | 0.89 (0.77, 1.04) | 1.13 (1.01, 1.27)* | 1.06 (0.93, 1.21) | 1.00 (0.86, 1.16) | 0.98 (0.93, 1.04) | 0.46 |
rs10173528 | BRE | 7.80E-03 | 0.86 (0.77, 0.97) | 1.11 (0.82, 1.50) | 0.92 (0.81, 1.05) | 1.02 (0.92, 1.13) | 1.00 (0.90, 1.12) | 1.10 (0.97, 1.24) | 0.98 (0.94, 1.03) | 0.77 |
rs10173426 | BRE | 4.09E-03 | 0.85 (0.75, 0.96) | 1.10 (0.80, 1.50) | 0.89 (0.78, 1.01)* | 1.00 (0.90, 1.10) | 0.97 (0.87, 1.09) | 1.07 (0.94, 1.22) | 0.98 (0.93, 1.02) | 0.83 |
rs13402525 | BRE | 5.40E-03 | 0.85 (0.75, 0.96) | 1.08 (0.79, 1.48) | 0.89 (0.78, 1.01)* | 1.00 (0.90, 1.11) | 0.98 (0.87, 1.09) | 1.08 (0.95, 1.23) | 0.98 (0.93, 1.03) | 0.81 |
rs3863 | BRE | 9.10E-03 | 1.15 (1.02, 1.30) | 0.87 (0.63, 1.21) | NA | NA | NA | NA | 1.03 (0.92, 1.16) | 0.57 |
rs11131194 | RPUSD3 | 1.98E-03 | 0.84 (0.75, 0.95) | 0.73 (0.53, 1.02)* | 0.90 (0.79, 1.02) | 1.01 (0.91, 1.12) | 1.06 (0.95, 1.19) | 0.93 (0.82, 1.05) | 0.97 (0.92, 1.01) | 0.61 |
rs2279895 | RBM27 | 6.95E-03 | 1.17 (1.03, 1.33) | 0.96 (0.69, 1.34) | 1.04 (0.91, 1.19) | 1.01 (0.91, 1.12) | 1.10 (0.98, 1.25) | 0.99 (0.86, 1.13) | 1.02 (0.97, 1.08) | 0.97 |
rs11953506 | RBM27 | 7.75E-04 | 1.23 (1.08, 1.40) | 0.86 (0.62, 1.21) | 1.04 (0.90, 1.19) | 1.03 (0.92, 1.15) | 1.10 (0.97, 1.24) | 0.97 (0.84, 1.12) | 1.02 (0.96, 1.08) | 0.89 |
rs11953090 | RBM27 | 1.12E-03 | 1.22 (1.07, 1.39) | 0.87 (0.62, 1.22) | 1.04 (0.90, 1.19) | 1.01 (0.91, 1.13) | 1.10 (0.97, 1.25) | 0.96 (0.84, 1.11) | 1.02 (0.96, 1.07) | 0.89 |
rs2304033 | RBM27 | 1.03E-03 | 1.22 (1.08, 1.39) | 0.87 (0.62, 1.22) | 1.03 (0.90, 1.19) | 1.01 (0.91, 1.13) | 1.10 (0.97, 1.25) | 0.96 (0.84, 1.11) | 1.02 (0.96, 1.07) | 0.87 |
rs13155119 | RBM27 | 1.17E-03 | 1.22 (1.07, 1.39) | 0.87 (0.62, 1.22) | 1.04 (0.91, 1.20) | 1.01 (0.91, 1.13) | 1.10 (0.97, 1.24) | 0.97 (0.84, 1.11) | 1.02 (0.96, 1.07) | 0.90 |
rs3829070 | SMARCA2 | 2.69E-03 | 0.76 (0.63, 0.92) | 0.72 (0.42, 1.24) | 0.91 (0.75, 1.11) | 0.93 (0.80, 1.09) | 1.15 (0.97, 1.36) | 0.93 (0.77, 1.13) | 0.95 (0.89, 1.02) | 0.69 |
rs3829072 | SMARCA2 | 2.17E-04 | 0.71 (0.58, 0.86) | 1.00 (0.61, 1.66) | 1.07 (0.89, 1.28) | 0.87 (0.75, 1.01)* | 1.09 (0.92, 1.29) | 0.84 (0.69, 1.01)* | 0.94 (0.89, 1.01) | 0.53 |
rs10964907 | SMARCA2 | 2.19E-03 | 0.74 (0.61, 0.91) | 0.88 (0.51, 1.52) | 1.00 (0.82, 1.22) | 0.88 (0.75, 1.04) | 1.10 (0.92, 1.31) | 0.85 (0.69, 1.04) | 0.94 (0.88, 1.01) | 0.73 |
rs870272 | MORN5 | 1.03E-03 | 0.83 (0.73, 0.93) | 1.04 (0.75, 1.44) | 0.95 (0.84, 1.08) | 0.97 (0.88, 1.07) | 0.95 (0.85, 1.06) | 0.89 (0.79, 1.01)* | 0.96 (0.92, 1.01) | 0.93 |
rs3745516 | SPIB | 1.60E-03 | 0.81 (0.70, 0.93) | 0.78 (0.54, 1.14) | 1.03 (0.89, 1.19) | 1.10 (0.98, 1.23) | 0.97 (0.85, 1.09) | 0.94 (0.82, 1.09) | 0.98 (0.93, 1.03) | 0.64 |
ap-Values are one-sided.
bn = number of cases.
cNA, data not available.
dn = 868 for rs4464229 and rs3863.
eHeterogeneity assessed using Cochran’s Q statistic (Cochran, 1954).
*Statistical significance at P < 0.05.
Discussion
Here we have applied a novel candidate SNP method to identify glioma risk alleles, taking advantage of ethnic group differences in glioma incidence. Specifically, we tested the hypothesis that genetic regions containing SNPs with extreme differences in allele frequencies across ethnicities harbor variants that drive the ethnic group variation in glioma incidence. Although in the present study no SNPs from our discovery set reached our predetermined significance thresholds, we identified 10 genes containing one or more SNPs with P < 0.01, and selected 20 SNPs across six genes for replication. However, no SNP was statistically significant in any of the replication sets after accounting for multiple comparisons, and no SNP was nominally significant in the meta-analysis.
There were some limitations of this study which may have prevented the detection of glioma-associated SNPs. While our allele frequency difference criteria for candidate SNPs were designed to be inclusive of SNPs that could be responsible for a meaningful risk difference across ethnic groups, it is possible that these criteria excluded SNPs that are, in fact, associated with glioma risk but did not meet our criteria. Additionally, given 692 cases and 3,992 controls in our discovery set, our power calculations demonstrate that for a moderately common putative risk allele (0.20 allele frequency), we had 80% power to detect an odds ratio as low as 1.46. Yet for a relatively rare risk allele (0.05 allele frequency), we had 80% power to detect an odds ratio no lower than 1.88. Thus, it is plausible that our set of candidate SNPs includes one or more variants with low to moderate association with glioma risk, but that we were underpowered to detect such an association. Furthermore, it should be noted that although we postulated that polymorphisms driving ethnic group incidence differences are common across glioma subtypes, it is possible that differences in the glioma subtype distribution in the discovery and replication sets impacted the replicability of our findings. The ability of our approach to detect SNPs that tag glioma-related haplotypes may also have been degraded by heterogeneity across ethnic groups in the haplotype that a given tagging SNP represents. Finally, we note that our study was unable to assess potential interaction effects between risk loci, gene-environment interactions, or the role of rare variants.
Although this study did not lead to the discovery of novel glioma-associated SNPs, it is noteworthy that our most strongly associated candidate SNP, rs879471, was in strong linkage disequilibrium with rs6010620 (D′ = 0.78 in HapMap CEU + TSI, data from Haploview version 4.2), a top hit from the Wrensch et al. (2009) GWAS. This suggests the successful identification of a haplotype that differs in frequency across ethnic groups and is related to glioma risk. In this respect the A allele of rs6010620, which is protective against glioma (OR = 0.68, 95% CI: 0.58–0.79), is considerably more common in East Asians than Whites (frequency of 0.697 vs. 0.228, respectively), but did not meet our strict allele frequency difference criteria because the frequency in Africans was 0.019. On the basis of a calculation similar to that presented in Figure 1, we would conclude that this SNP is sufficient to account for an incidence rate ratio of 1.34 for Whites relative to East Asians. Notably, another top hit from the same GWAS, rs1412829, has a risk allele C (OR = 1.39, 95% CI: 1.24–1.57) that is more common in Whites than East Asians or Africans but also did not meet our allele frequency criteria (frequency of 0.402, 0.104, and 0.009, respectively). Based on our calculations, this SNP can account for a White to East Asian incidence rate ratio of 1.22, and a White to African incidence rate ratio of 1.30. Thus, these two SNPs alone may account for a meaningful proportion of the observed inter-ethnic incidence rate ratios. None of the other five established glioma susceptibility loci contribute to the inter-ethnic incidence rate differences (Table 4).
Table 4.
SNP | Reference allele | Reference allele frequency |
Odds ratio (95% confidence interval)a | White-East Asian incidence rate ratio | White-African incidence rate ratio | ||
---|---|---|---|---|---|---|---|
White | East Asian | African | |||||
rs6010620 | A | 0.23 | 0.70 | 0.02 | 0.68 (0.58, 0.79)b | 1.34 | 0.90 |
rs1412829 | C | 0.40 | 0.10 | 0.01 | 1.39 (1.24, 1.57)b | 1.22 | 1.30 |
rs2736100 | G | 0.54 | 0.41 | 0.42 | 1.27 (1.19, 1.37)c | 1.05 | 1.05 |
rs4295627 | G | 0.15 | 0.24 | 0.16 | 1.36 (1.29, 1.43)c | 0.95 | 0.99 |
rs4977756 | G | 0.37 | 0.23 | 0.34 | 1.24 (1.19, 1.30)c | 1.06 | 1.01 |
rs498872 | T | 0.28 | 0.24 | 0.08 | 1.18 (1.13, 1.24)c | 1.01 | 1.07 |
rs2252586 | T | 0.28 | 0.02 | 0.32 | 1.18 (1.11, 1.25)d | 1.09 | 0.99 |
The candidate SNP approach used in this study provides a viable alternative to admixture mapping, which investigates the genetic makeup of recently admixed groups to localize disease-related variants (Patterson et al., 2004; Smith and O’Brien, 2005). In the context of glioma research, the admixture method would ideally be applied to a set of African-American cases and controls, where excess European ethnicity shared among African-American cases would be suggestive of genetic regions that may play a role in glioma risk. For conditions like hypertension, where prevalence differs widely between ethnicities and samples from an admixed group are available, the pairing of admixture mapping with GWAS has proven to be an effective means of identifying disease-related variants (Levy et al., 2009; Zhu et al., 2011). However, when samples from an admixed group are not available, we present our approach as an alternative complement to GWAS. Specifically, this approach to identify disease-causing variants is attractive where (1) the disease incidence differs substantially across two or more ethnic groups; (2) ethnic group differences tend to persist independent of geographic location, suggesting a genetic etiology; (3) there is an availability of cases and controls of uniform ethnicity; and (4) an admixture approach is not feasible given the unavailability of a sufficient number of appropriate admixed cases.
While we did not identify novel glioma susceptibility variants in this analysis, we conclude that the additional risk in White populations conferred by rs6010620 and rs1412829 lends support to our initial hypothesis, and provides an impetus for a larger discovery set and/or pursuing admixture mapping. Given the rarity of glioma among African-Americans and the resultant difficulties inherent in collecting enough African-American cases to perform an admixture mapping study, further application of our method may be the preferred approach.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
This study was supported by the Nancy Hildreth Memorial Summer Fellowship in Chronic Disease Epidemiology, Yale School of Public Health. Work at University of California, San Francisco was supported by the National Institutes of Health (grant numbers R01CA52689, P50CA097257, R25CA126831, and CA112355), as well as the National Brain Tumor Foundation, the UCSF Lewis Chair in Brain Tumor Research and by donations from families and friends of John Berardi, Helen Glaser, Elvera Olsen, Raymond E. Cooper, and William Martinusen. Assembly of the French cohort was supported by Ligue Nationale contre le Cancer and the Institut National du Cancer (INCA; PL 046). Computational work was supported by the Yale University Biomedical High Performance Computing Center and the National Institutes of Health (RR19895). We thank Drs. Josephine Hoh, Yuri Galachyants, and Alan Wu for helpful early conceptual discussions.
Footnotes
References
- Altshuler D. M., Gibbs R. A., Peltonen L., Dermitzakis E., Schaffner S. F., Yu F., et al. (2010). Integrating common and rare genetic variation in diverse human populations. Nature 467, 52–58 10.1038/nature09298 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barrett J. C., Fry B., Maller J., Daly M. J. (2005). Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21, 263–265 10.1093/bioinformatics/bti464 [DOI] [PubMed] [Google Scholar]
- Blankenberg D., Kuster G. V., Coraor N., Ananda G., Lazarus R., Mangan M., et al. (2001). Galaxy: a web-based genome analysis tool for experimentalists. Curr. Protoc. Mol. Biol. 89, 1–21 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bondy M. L., Scheurer M. E., Malmer B., Barnholtz-Sloan J. S., Davis F. G., Il’Yasova D., et al. (2008). Brain tumor epidemiology: consensus from the Brain Tumor Epidemiology Consortium. Cancer 113, 1953–1968 10.1002/cncr.23741 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cardis E., Deltour I., Vrijheid M., Combalot E., Moissonnier M., Tardy H., et al. (2010). Brain tumour risk in relation to mobile telephone use: results of the INTERPHONE international case–control study. Int. J. Epidemiol. 39, 675–694 10.1093/ije/dyq079 [DOI] [PubMed] [Google Scholar]
- Chen P., Aldape K., Wiencke J. K., Kelsey K. T., Miike R., Davis R. L., et al. (2001). Ethnicity delineates different genetic pathways in malignant glioma. Cancer Res. 61, 3949–3954 [PubMed] [Google Scholar]
- Cochran W. G. (1954). The combination of estimates from different experiments. Biometrics 10, 101–129 10.2307/3001666 [DOI] [Google Scholar]
- Darefsky A. S., Dubrow R. (2009). International variation in the incidence of adult primary malignant neoplasms of the brain and central nervous system. Cancer Causes Control 20, 1593–1604 10.1007/s10552-009-9404-1 [DOI] [PubMed] [Google Scholar]
- Das A., Tan W.-L., Teo J., Smith D. R. (2002). Glioblastoma multiforme in an Asian population: evidence for a distinct genetic pathway. J. Neurooncol. 60, 117–125 10.1023/A:1020622415786 [DOI] [PubMed] [Google Scholar]
- Davis F., Il’Yasova D., Rankin K., McCarthy B., Bigner D. D. (2011). Medical diagnostic radiation exposures and risk of gliomas. Radiat. Res. 175, 790–796 10.1667/RR2186.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dubrow R., Darefsky A. S. (2011). Demographic variation in incidence of adult glioma by subtype, United States, 1992–2007. BMC Cancer 11, 325–325 10.1186/1471-2407-11-325 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Efird J., Friedman G., Sidney S., Klatsky A., Habel L., Udaltsova N., et al. (2004). The risk for malignant primary adult-onset glioma in a large, multiethnic, managed-care cohort: cigarette smoking and other lifestyle behaviors. J. Neurooncol. 68, 57–69 10.1023/B:NEON.0000024746.87666.ed [DOI] [PubMed] [Google Scholar]
- Hercberg S., Galan P., Preziosi P., Bertrais S., Mennen L., Malvy D., et al. (2004). The SU.VI.MAX study: a randomized, placebo-controlled trial of the health effects of antioxidant vitamins and minerals. Arch. Intern. Med. 164, 2335–2342 10.1001/archinte.164.21.2335 [DOI] [PubMed] [Google Scholar]
- Holle R., Happich M., Löwel H., Wichmann H. E. (2005). KORA – a research platform for population based health research. Gesundheitswesen 67, 19–25 10.1055/s-2005-858235 [DOI] [PubMed] [Google Scholar]
- Hunter D. J., Kraft P., Jacobs K. B., Cox D. G., Yeager M., Hankinson S. E., et al. (2007). A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. Nat. Genet. 39, 870–874 10.1038/ng2075 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kohler B. A., Ward E., McCarthy B. J., Schymura M. J., Ries L. G., Eheman C., et al. (2011). Annual report to the nation on the status of cancer, 1975–2007, featuring tumors of the brain and other nervous system. J. Natl. Cancer Inst. 103, 1–23 10.1093/jnci/djq540 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krawczak M., Nikolaus S., Von Eberstein H., Croucher P. J. P., El Mokhtari N. E., Schreiber S. (2006). PopGen: population-based recruitment of patients and controls for the analysis of complex genotype-phenotype relationships. Public Health Genomics 9, 55–61 [DOI] [PubMed] [Google Scholar]
- Lachance D. H., Yang P., Johnson D. R., Decker P. A., Kollmeyer T. M., McCoy L. S., et al. (2011). Associations of high-grade glioma with glioma risk alleles and histories of allergy and smoking. Am. J. Epidemiol. 174, 574–581 10.1093/aje/kwr124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Levy D., Ehret G. B., Rice K., Verwoert G. C., Launer L. J., Dehghan A., et al. (2009). Genome-wide association study of blood pressure and hypertension. Nat. Genet. 41, 677–687 10.1038/ng.384 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mandelzweig L., Novikov I., Sadetzki S. (2009). Smoking and risk of glioma: a meta-analysis. Cancer Causes Control 20, 1927–1938 10.1007/s10552-009-9386-z [DOI] [PubMed] [Google Scholar]
- McLendon R., Friedman A., Bigner D., Meir E. G. V. (2008). Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature 455, 1061–1068 10.1038/nature07385 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mochizuki S., Iwadate Y., Namba H., Yoshida Y., Yamaura A., Sakiyama S., et al. (1999). Homozygous deletion of the p16/MTS-1/CDKN2 gene in malignant gliomas is infrequent among Japanese patients. Int. J. Oncol. 15, 983–992 [DOI] [PubMed] [Google Scholar]
- Ostrom Q. T., Barnholtz-Sloan J. S. (2011). Current state of our knowledge on brain tumor epidemiology. Curr. Neurol. Neurosci. Rep. 11, 329–335 10.1007/s11910-011-0189-8 [DOI] [PubMed] [Google Scholar]
- Patterson N., Hattangadi N., Lane B., Lohmueller K. E., Hafler D. A., Oksenberg J. R., et al. (2004). Methods for high-density admixture mapping of disease genes. Am. J. Hum. Genet. 74, 979–1000 10.1086/420871 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Power C., Elliott J. (2006). Cohort profile: 1958 British birth cohort (National Child Development Study). Int. J. Epidemiol. 35, 34–41 10.1093/ije/dyi183 [DOI] [PubMed] [Google Scholar]
- Price A. L., Patterson N. J., Plenge R. M., Weinblatt M. E., Shadick N. A., Reich D. (2006). Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909 10.1038/ng1847 [DOI] [PubMed] [Google Scholar]
- Sanson M., Hosking F. J., Shete S., Zelenika D., Dobbins S. E., Ma Y., et al. (2011). Chromosome 7p11.2 (EGFR) variation influences glioma risk. Hum. Mol. Genet. 20, 2897–2904 10.1093/hmg/ddr192 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmermund A., Möhlenkamp S., Stang A., Grönemeyer D., Seibel R., Hirche H., et al. (2002). Assessment of clinically silent atherosclerotic disease and established and novel risk factors for predicting myocardial infarction and cardiac death in healthy middle-aged subjects: rationale and design of the Heinz Nixdorf RECALL Study. Am. Heart J. 144, 212–218 10.1067/mhj.2002.123579 [DOI] [PubMed] [Google Scholar]
- Schoemaker M. J., Robertson L., Wigertz A., Jones M. E., Hosking F. J., Feychting M., et al. (2010). Interaction between 5 genetic variants and allergy in glioma risk. Am. J. Epidemiol. 171, 1165–1173 10.1093/aje/kwq075 [DOI] [PubMed] [Google Scholar]
- Shete S., Hosking F. J., Robertson L. B., Dobbins S. E., Sanson M., Malmer B., et al. (2009). Genome-wide association study identifies five susceptibility loci for glioma. Nat. Genet. 41, 899–904 10.1038/ng.407 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith M. W., O’Brien S. J. (2005). Mapping by admixture linkage disequilibrium: advances, limitations and guidelines. Nat. Rev. Genet. 6, 623–632 10.1038/nrg1657 [DOI] [PubMed] [Google Scholar]
- Wichmann H. E., Gieger C., Illig T. (2005). KORA-Gen: resource for population genetics, controls and a broad spectrum of disease phenotypes. Gesundheitswesen 67, 26–30 10.1055/s-2005-858226 [DOI] [PubMed] [Google Scholar]
- Wiencke J. K., Aldape K., McMillan A., Wiemels J., Moghadassi M., Miike R., et al. (2005). Molecular features of adult glioma associated with patient race/ethnicity, age, and a polymorphism in O6-methylguanine-DNA-methyltransferase. Cancer Epidemiol. Biomarkers Prev. 14, 1774–1783 10.1158/1055-9965.EPI-05-0089 [DOI] [PubMed] [Google Scholar]
- Wrensch M., Jenkins R. B., Chang J. S., Yeh R.-F., Xiao Y., Decker P. A., et al. (2009). Variants in the CDKN2B and RTEL1 regions are associated with high-grade glioma susceptibility. Nat. Genet. 41, 905–908 10.1038/ng.408 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wrensch M., Yost M., Miike R., Lee G., Touchstone J. (1999). Adult glioma in relation to residential power frequency electromagnetic field exposures in the San Francisco Bay area. Epidemiology 10, 523–527 10.1097/00001648-199909000-00010 [DOI] [PubMed] [Google Scholar]
- Zhu X., Young J. H., Fox E., Keating B. J., Franceschini N., Kang S., et al. (2011). Combined admixture mapping and association analysis identifies a novel blood pressure genetic locus on 5p13: contributions from the CARe consortium. Hum. Mol. Genet. 20, 2285–2295 10.1093/hmg/ddr113 [DOI] [PMC free article] [PubMed] [Google Scholar]