To the Editors:
In a recent issue of the journal, the research article “Association of polymorphism in cytochrome P450 2C9 with susceptibility to head and neck cancer and treatment outcome”, by Yadav et al. (2014) reported that cytochrome P450 2C9 single nucleotide polymorphisms (SNPs), CYP2C9*2 (rs1799853) and CYP2C9*3 (rs1057910), showed a risk association with head and neck squamous cell carcinoma (HNSCC). They also reported the statistical interaction of these polymorphisms with tobacco smoking, tobacco chewing, and alcohol consumption. In this study, the author underscored several important points, however, failed to address several limitations.
The author reported significant deviation from the Hardy-Weinberg equilibrium (HWE) for controls for both polymorphisms (Yadav et al., 2014). The Hardy-Weinberg disequilibrium (HWD) in controls is a major limitation and results should be interpreted cautiously (Salanti et al., 2005). Use of the Cochran-Armitage test of trend has been suggested if there is a significant deviation (P-value < 0.05) from HWE (Lewis and Knight, 2012). HWE elucidates that in a large population, alleles segregate randomly (Lewis and Knight, 2012, Lunetta, 2008). Allele and genotype frequencies in the population do not change until there is selection, mutation, or migration. Departure from HWE in controls may be due to several issues, including cryptic population structure (Salanti et al., 2005, Lewis and Knight, 2012, Lunetta, 2008). It has been recommended to exclude SNPs which show significant deviation from HWE in controls to avoid spurious associations. However, the criterion of exclusion depends on several factors (Lunetta, 2008).
The Indian population is very heterogeneous, with several sub-population structures among ethnic groups existing across India (Basu et al., 2003, Indian Genomic Variation Consortium, 2008, Rich et al., 2009). The North Indian population is also not homogenous. Even pooling samples by state of origin can mask population substructure (Indian Genomic Variation Consortium, 2008). Counting them as a homogenous population will increase false-positive disease associations due to systematic ancestry difference between cases and controls (Indian Genomic Variation Consortium, 2008, Rich et al., 2009). In this study, the author did not account for population stratification as a major confounder. Population stratification can be avoided by classifying individuals into known subgroups (e.g., by ethnicity or caste) in cases and controls, to avoid confounding (Lewis and Knight, 2012, Lunetta, 2008). Most importantly, the results should be adjusted for any covariates related to population structure (Lunetta, 2008). Genomic controls can also be added to reduce confounding by alleles.
The second limitation of this study is the use of different genetic models (Fig. 1) (Lewis and Knight, 2012) for the same polymorphism. In this study, the author used the Cochran-Armitage test of trend (odds ratio per allele), as suggested in the case of HWE deviation for association analysis (Lewis and Knight, 2012), and the dominant model (allele positivity) for the interaction analysis, which may suggest data dredging (Smith, 2002). The genetic model should be same for all statistical analyses pertaining to the same polymorphism. For example, in this study, interaction analysis should be done by using the Cochran-Armitage test of trend as well. The dominant model is also not suggested in the case of HWD (Lewis and Knight, 2012). Another major issue is that data provided in the text do not match the provided tables. Odds ratios (ORs) and confidence interval (CI) in tables II and III have some ambiguity with the text.
Fig. 1.
Analysis methods in a case-control genetic association study used in Yadav et al. Cochran-Armitage test was used for association analysis and dominant model was used for interaction analysis.
The interaction analysis conducted in this study is actually a stratified analysis by exposure variables (exposure vs non-exposure) (Yadav et al., 2014). Stratified analysis is used to judge confounding or effect modification. Stratified analysis should also be adjusted for major confounders to avoid false-positive associations. It is not clear whether there is any statistical adjustment for major confounders. To compare the two strata (exposure vs non-exposure), an overall analysis combining the ORs for exposure and non-exposure for the dominant model is necessary for any interpretation in this study.
| (1) |
In the case of effect modification, stratified analysis gives different ORs for each stratum. A statistically significant difference (P-value < 0.05) between these two ORs suggests statistical interaction (ORinteraction) (Table 1) (Liu et al., 2004, Clayton and McKeigue, 2001) and can easily be done by adding an interaction term in a logistic regression model (Kraft et al., 2007). Here G denotes genotype, E denotes environmental variable and G × E denotes interaction term. Kraft et al. (2007) have elucidated a detailed explanation of gene-environment interaction in genetic association studies. It is not pragmatic to claim an interaction without proper statistical interaction analysis.
| (2) |
Table 1.
Complete frequency data for cases and controls for interaction analysis.
| Disease Status | Genotype present |
Genotype absent |
||
|---|---|---|---|---|
| Exposed | Unexposed | Exposed | Unexposed | |
| Case | a | b | e | f |
| Control | c | d | g | h |
| Odds ratio | ORGE = ah/cf | ORG = bh/df | ORE = eh/gf | OR00 = 1 (ref) |
In any genetic association study, cautious interpretation is pivotal to avoid spurious associations. Any methodological error during the study design or statistical analysis may introduce bias which may reduce the validity of the study (Salanti et al., 2005, Price et al., 2006), and of course replication failure. HWD raises several questions for additional thinking about a study. Statistical adjustment for sub-population and genomic controls are not a panacea, but play a critical role to reduce type I error.
References
- Basu A. Ethnic India: a genomic view, with special reference to peopling and structure. Genome Res. 2003;13:2277–2290. doi: 10.1101/gr.1413403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clayton D., McKeigue P.M. Epidemiological methods for studying genes and environmental factors in complex diseases. Lancet. 2001;358:1356–1360. doi: 10.1016/S0140-6736(01)06418-2. [DOI] [PubMed] [Google Scholar]
- Indian Genomic Variation Consortium Genetic landscape of the people of India: a canvas for disease gene exploration. J. Genet. 2008;87(3–20) doi: 10.1007/s12041-008-0002-x. [DOI] [PubMed] [Google Scholar]
- Kraft P. Exploiting gene-environment interaction to detect genetic associations. Hum. Hered. 2007;63:111–119. doi: 10.1159/000099183. [DOI] [PubMed] [Google Scholar]
- Lewis C.M., Knight J. Introduction to genetic association studies. Cold Spring Harb. Protoc. 2012;2012(3):297–306. doi: 10.1101/pdb.top068163. [DOI] [PubMed] [Google Scholar]
- Liu X. Genetic dissection methods: designs used for tests of gene-environment interaction. Curr. Opin. Genet. Dev. 2004;14:241–245. doi: 10.1016/j.gde.2004.04.011. [DOI] [PubMed] [Google Scholar]
- Lunetta K.L. Genetic association studies. Circulation. 2008;118:96–101. doi: 10.1161/CIRCULATIONAHA.107.700401. [DOI] [PubMed] [Google Scholar]
- Price A.L. Principal component analysis corrects for stratification in genome-wide association studies. Nat. Genet. 2006;38(8):904–909. doi: 10.1038/ng1847. [DOI] [PubMed] [Google Scholar]
- Rich D. Reconstructing Indian population history. Nature. 2009;461:489–494. doi: 10.1038/nature08365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salanti G. Hardy-Weinberg equilibrium in genetic association studies: an empirical evaluation of reporting, deviations and power. Eur. J. Hum. Genet. 2005;13:840–848. doi: 10.1038/sj.ejhg.5201410. [DOI] [PubMed] [Google Scholar]
- Smith G.D. Data dredging, bias, or confounding. BMJ. 2002;325(7378):1437–1438. doi: 10.1136/bmj.325.7378.1437. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yadav S.S. Association of polymorphism in cytochrome P450 2C9 with susceptibility to head and neck cancer and treatment outcome. App. Tran. Gen. 2014;3:8–13. doi: 10.1016/j.atg.2013.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

