Abstract
Introduction
Allelic frequency distribution of drug metabolizing enzyme genes among populations is important to identify risk groups for adverse drug reaction and to select representative populations for clinical trials. Although India emerged as an important hub for clinical trials, information about the pharmacogenetic diversity for this region is still lacking. Here, we investigated genetic diversity of cytochrome-P450-2C9 (CYP2C9) gene which metabolizes wide range of drugs and is highly expressed in the human liver.
Methods
In total, 1278 individuals from 36 diverse Indian populations, 210 individuals from in-house data-repository and 489 other South Asian samples from the 1000 Genomes Project were selected. Variants observed in CYP2C9 gene were subjected to various statistical analyses.
Results
High frequency of CYP2C9*3 (~13%) and CYP2C9*3/*3 (~1%) was observed among South Asians, compared to 21 populations living outside the Indian subcontinent. The allelic/genotypic frequency does not correlate with geographical location or linguistic affiliation, except populations speaking Tibeto-Burmans language, who have lower frequency of CYP2C9*3 and CYP2C9*3/*3. Since, South Asians practice strict endogamy, presence of unique mutation and high frequency of homozygous genotypes not surprising. CYP2C9*3 has been associated with therapeutic response.The effect of CYP2C9*3/*3 is more pronounced compared to heterozygous and wild type homozygous genotypes as evident in many in vitro studies. As South Asians have high frequency, it would be interesting to explore potential of CYP2C9*3 as a marker for personalized therapy. Our study revealed several rare functional variants, which form eight novel and rare haplotypes of CYP2C9 (CYP2C9*63–*70). Of which, CYP2C9*64, *65, *66, *68, *69 and *70 haplotypes are South Asian-specific.
Conclusion
Overall, we find high genetic heterogeneity within South Asians and identified South Asian-specific putative functional CYP2C9 haplotypes. High frequency of CYP2C9*3 and CYP2C9*3/*3 was observed in South Asian populations. Taken together, current study greatly enriches the knowledge of naturally occurring CYP2C9 variants and its diversity in South Asia, which are relevant to further CYP2C9-related functional research and for personalized medicine.
Keywords: pharmacogenetics, CYP2C9, South Asians, genetic diversity
Introduction
Heterogeneous drug response is the major hurdle in the successful treatment of diseases, which is due to genetic variations in the drug metabolizing enzyme genes. Knowledge of allelic frequency distribution of drug metabolizing enzymes within populations can be useful to identify risk groups for adverse drug reaction and to optimize drug doses. It can be utilized to select representative populations in clinical trials. The cytochrome P450 (CYP) family is an important enzyme of ADME (related to absorption, distribution, metabolism and excretion of drug) genes, of which CYP2C9 is the major constituent of CYP2C subfamily in the human liver. It metabolizes a wide range of drugs including anticoagulant (warfarin), nonsteroidal anti-inflammatory (celecoxib, diclofenac), antidiabetic (nateglinide, tolbutamide), antihypertensive (irbesartan, losartan) and anti-epileptic (phenytoin).1 Several variations in CYP2C9 have been reported, which affect metabolism of the drug. Most notable variations are CYP2C9*2 (R144C) and CYP2C9*3 (I359L), which significantly decreases enzyme activity.2 Interestingly, these variations are highly heterogeneous among world population; (1) 8–19% and 3.3–16.3% in Caucasian; (2) 0–0.1% and 1.1–3.6% in Asian; (3) 2.9% and 2.0% in African-American; and (4) 0–4.3% and 0–2.3% in Black/African, respectively.3 In addition, other rare and functionally relevant variations were also reported in various populations, which includes; (1) CYP2C9*6, 0.6% frequency in African-Americans;4 (2) CYP2C9*4, 0.5% in African-Americans and 6% in Caucasians;2,5 and (3) CYP2C9*13, 0.19–0.45% in Asian.6 Dai et al reported several rare variants in the Han Chinese population.7
Several studies have been performed on CYP2C9 in Indian populations. However, most of studies have focused only on CYP2C9*3 and CYP2C9*2 variants. Grik et al observed CYP2C9*3 only in the Indo-European population (0.38–1.85%), whereas it was absent in Dravidian, Austroasiatic and Tibeto-Burman populations.8 Indian populations are well known for their genetic diversity and practice of endogamy, hence they are expected to have high frequency of homozygous allele9. Many studies have shown that the variations in CYP2C9 are associated with therapeutic heterogeneity in Indian populations. CYP2C9*2 and *3 has been reported with less hydroxylation (or metabolism) of phenytoin in vivo in South Indian populations,10 compared to wild type CYP2C9*1. Ramasamy et al reported phenytoin toxicity in a patient with normal dose of 300 mg/day, who had CYP2C9*3/*3 genotype.11 The same symptoms were also reported by Thakkar et al in South Indian populations.12 Both of these drugs are metabolized by CYP2C9. Some of the drugs, metabolized by CYP2C9 have narrow therapeutic index eg warfarin, phenytoin, and tolbutamide. This is the reason that small change in the metabolizing activity of CYP2C9 may cause major changes in an individual’s response against a drug. Considering this, we explored genetic diversity of functionally relevant variations of CYP2C9 within the Indian subcontinent and compared with other world populations. The outcome of this study may be useful to understand heterogeneous therapeutic response and development of personalized therapy for the populations of Indian subcontinent. Moreover, identification of South Asian-specific putative functional variants and associated haplotypes will open opportunity for further study.
Materials and Methods
Details of Samples
A total of 1278 samples from 36 diverse Indian populations, in terms of ethnicity, linguistic and geographical locations, were included in this study (Table 1).9,13 Furthermore, 210 samples of South Asian origin were selected from our collection of whole genome/exome datasets. For comparison, 489 and 598 samples of South Asian origin were selected from the 1000 Genomes Project and GenomeAsia 100K Project, respectively.14,15 This work has been approved by the Institutional Ethical Committee of CSIR-Centre for Cellular and Molecular Biology (CSIR-CCMB), Hyderabad, India. Informed written consent has been obtained from all the participants. The present study is conducted in accordance with the Declaration of Helsinki.
Table 1.
Source | Populations | State/Geographical Region | Latitude | Longitude | Linguistic | Sample Size | Missing Data (%) | Allele Frequency | HWE p-value | |||
---|---|---|---|---|---|---|---|---|---|---|---|---|
A (*1) | C (*3) | |||||||||||
Count | Frequency (95%CI)a | Count | Frequency (95%CI)a | |||||||||
Current study | Mahli | Jharkhand | 85 | 23.46 | Austroasiatic | 38 | 4 (10.526) | 60 | 0.882 (0.781–0.948) | 8 | 0.118 (0.052–0.219) | 0.382 |
Gond | Chattisgarh | 81.6 | 19.87 | Austroasiatic | 37 | 8 (21.622) | 56 | 0.966 (0.881–0.996) | 2 | 0.034 (0.004–0.119) | 7.24×10−8* | |
Kharia | Chattisgarh | 85.44 | 23.33 | Austroasiatic | 86 | 14 (16.279) | 134 | 0.931 (0.876–0.966) | 10 | 0.069 (0.034–0.124) | 0.527 | |
Gond | Madhya Pradesh | 77.4 | 26.12 | Austroasiatic | 38 | 7 (18.421) | 56 | 0.903 (0.801–0.964) | 6 | 0.097 (0.036–0.199) | 0.551 | |
Ho | Jharkhand | 85.33 | 23.35 | Austroasiatic | 67 | 2 (2.985) | 110 | 0.846 (0.772–0.903) | 20 | 0.154 (0.097–0.228) | 0.660 | |
Kolhas | Andhra Pradesh | 79.98 | 14.46 | Dravidian | 14 | 2 (14.286) | 22 | 0.917 (0.73–0.99) | 2 | 0.083 (0.01–0.27) | 0.752 | |
Adi Dravidar | Tamil Nadu | 77.73 | 11.35 | Dravidian | 15 | 1 (6.667) | 23 | 0.821 (0.631–0.939) | 5 | 0.179 (0.061–0.369) | 0.416 | |
Telaga | Andhra Pradesh | 83.53 | 18.17 | Dravidian | 12 | 0 (0) | 18 | 0.75 (0.533–0.902) | 6 | 0.25 (0.098–0.467) | 5.32×10−4* | |
Thoti | Andhra Pradesh | 80.64 | 16.51 | Dravidian | 29 | 0 (0) | 40 | 0.69 (0.555–0.805) | 18 | 0.31 (0.195–0.445) | 7.24×10−8* | |
Naidu | Andhra Pradesh | 79.6 | 13.22 | Dravidian | 21 | 11 (52.381) | 19 | 0.95 (0.751–0.999) | 1 | 0.05 (0.001–0.249) | 0.868 | |
Reddy | Andhra Pradesh | 78.48 | 17.37 | Dravidian | 24 | 1 (4.167) | 40 | 0.87 (0.737–0.951) | 6 | 0.13 (0.049–0.263) | 0.472 | |
Mudaliar | Tamil Nadu | 79.13 | 12.92 | Dravidian | 48 | 3 (6.25) | 82 | 0.911 (0.832–0.961) | 8 | 0.089 (0.039–0.168) | 1.97×10−11* | |
Gammavokklu | Karnataka | 74.83 | 12.93 | Dravidian | 19 | 4 (21.053) | 28 | 0.933 (0.779–0.992) | 2 | 0.067 (0.008–0.221) | 0.782 | |
Vysya | Andhra Pradesh | 77.65 | 14.68 | Dravidian | 60 | 10 (16.667) | 90 | 0.9 (0.824–0.951) | 10 | 0.1 (0.049–0.176) | 0.432 | |
Gawli | Karnataka | 74.77 | 13.33 | Dravidian | 89 | 10 (11.236) | 136 | 0.861 (0.797–0.911) | 22 | 0.139 (0.089–0.203) | 2.33×10−5* | |
Medari | Andhra Pradesh | 80.61 | 16.56 | Dravidian | 4 | 0 (0) | 7 | 0.875 (0.473–0.997) | 1 | 0.125 (0.003–0.527) | 0.775 | |
Madar | Karnataka | 75.05 | 15.33 | Dravidian | 70 | 9 (12.857) | 111 | 0.91 (0.844–0.954) | 11 | 0.09 (0.046–0.156) | 2.07×10−12* | |
Patkar | Andhra Pradesh | 78.1 | 15.8 | Dravidian | 20 | 1 (5) | 24 | 0.632 (0.46–0.782) | 14 | 0.368 (0.218–0.54) | 1.3×10−5* | |
Raj-Gond | Madhya Pradesh | 78.7 | 23.87 | Dravidian | 28 | 19 (67.857) | 18 | 1 (0.815–1) | 0 | 0 (0–0.185) | 1 | |
Adhiyan | Tamil Nadu | 79.41 | 13.72 | Dravidian | 44 | 4 (9.091) | 77 | 0.963 (0.894–0.992) | 3 | 0.038 (0.008–0.106) | 0.805 | |
Kurumba | Tamil Nadu | 79.09 | 12.94 | Dravidian | 15 | 2 (13.333) | 24 | 0.923 (0.749–0.991) | 2 | 0.077 (0.009–0.251) | 0.764 | |
Chenchu | Andhra Pradesh | 78.47 | 17.37 | Dravidian | 27 | 2 (7.407) | 34 | 0.68 (0.533–0.805) | 16 | 0.32 (0.195–0.467) | 5.73×10−7* | |
Kurumba | Madhya Pradesh | 75.83 | 22.71 | Dravidian | 26 | 6 (23.077) | 28 | 0.7 (0.535–0.834) | 12 | 0.3 (0.166–0.465) | 7.74×10−6* | |
Vaddera | Andhra Pra desh | 79.48 | 18.72 | Dravidian | 8 | 0 (0) | 10 | 0.625 (0.354–0.848) | 6 | 0.375 (0.152–0.646) | 4.67×10−3* | |
Brahmin-Tiwari | Uttar Pradesh | 82.68 | 25.73 | Indo-European | 44 | 13 (29.545) | 59 | 0.952 (0.865–0.99) | 3 | 0.048 (0.01–0.135) | 0.777 | |
Kashmiri pandit | Jammu and Kashmir | 75.83 | 34.37 | Indo-European | 21 | 0 (0) | 37 | 0.881 (0.744–0.96) | 5 | 0.119 (0.04–0.256) | 0.144 | |
Bhil | Gujarat | 72.67 | 23.03 | Indo-European | 4 | 0 (0) | 8 | 1 (0.631–1) | 0 | 0 (0–0.369) | 1 | |
Gamit | Gujrat | 72.83 | 21.17 | Indo-European | 45 | 7 (15.556) | 73 | 0.961 (0.889–0.992) | 3 | 0.039 (0.008–0.111) | 0.8 | |
Tharu | Uttarakhand | 79.5 | 29.38 | Indo-European | 30 | 3 (10) | 49 | 0.907 (0.797–0.969) | 5 | 0.093 (0.031–0.203) | 0.078 | |
Warli | Maharastra | 72.95 | 19.17 | Indo-European | 70 | 7 (10) | 111 | 0.881 (0.811–0.932) | 15 | 0.119 (0.068–0.189) | 0.283 | |
Baiswar | Uttar Pradesh | 82.6 | 25.15 | Indo-European | 40 | 6 (15) | 57 | 0.838 (0.729–0.916) | 11 | 0.162 (0.084–0.271) | 0.260 | |
Pandit | Haryana | 76.87 | 29.96 | Indo-European | 40 | 12 (30) | 47 | 0.839 (0.717–0.924) | 9 | 0.161 (0.076–0.283) | 4.41×10−6* | |
Bhilala | Madhya Pradesh | 75.3 | 22.6 | Indo-European | 49 | 9 (18.367) | 71 | 0.888 (0.797–0.947) | 9 | 0.113 (0.053–0.203) | 0.018 | |
Chakhesang_Naga | Nagaland | 94.48 | 26.12 | Tibeto-Burman | 33 | 19 (57.576) | 28 | 1 (0.877–1) | 0 | 0 (0–0.123) | 1 | |
Naga-sema | Nagaland | 93.81 | 25.7 | Tibeto-Burman | 40 | 21 (52.5) | 35 | 0.921 (0.786–0.983) | 3 | 0.079 (0.017–0.214) | 0.708 | |
Mizo | Mizoram | 92.83 | 23.2 | Tibeto-Burman | 23 | 7 (30.435) | 29 | 0.906 (0.75–0.98) | 3 | 0.094 (0.02–0.25) | 0.679 | |
Total | South Asia | 79.51 | 23.66 | - | 1278 | 224 (17.53) | 1265b | 0.905 (0.888–0.92)b | 133b | 0.095 (0.802–0.112)b | 0.767b | |
South Asians (NGS data repository) |
South Asians | 79.51 | 23.66 | - | 210 | – | 360 | 0.86 (0.82–0.89) | 60 | 0.14 (0.111–0.180) | 0.58 | |
1000 Genomes Project |
ACB | Africa | −59.61 | 13.19 | - | 96 | – | 191 | 0.995 (0.971–1) | 1 | 0.005 (0–0.029) | 0.959 |
ASW | Africa | −88.62 | 36.07 | - | 61 | – | 120 | 0.984 (0.942–0.998) | 2 | 0.016 (0.002–0.058) | 0.896 | |
ESN | Africa | 3.33 | 6.53 | - | 99 | – | 198 | 1 (0.982–1) | 0 | 0 (0–0.018) | 1 | |
GWD | Africa | −15.87 | 13.43 | - | 113 | – | 226 | 1 (0.984–1) | 0 | 0 (0–0.016) | 1 | |
LWK | Africa | 34.76 | 0.60 | - | 99 | – | 198 | 1 (0.982–1) | 0 | 0 (0–0.018) | 1 | |
MSL | Africa | −12.91 | 8.45 | - | 85 | – | 170 | 1 (0.979–1) | 0 | 0 (0–0.021) | 1 | |
YRI | Africa | 3.83 | 7.42 | - | 108 | – | 216 | 1 (0.983–1) | 0 | 0 (0–0.017) | 1 | |
CLM | America | −75.67 | 6.27 | - | 94 | – | 176 | 0.936 (0.891–0.967) | 12 | 0.064 (0.033–0.109) | 0.509 | |
MXL | America | −99.08 | 19.30 | - | 64 | – | 125 | 0.977 (0.933–0.995) | 3 | 0.023 (0.005–0.067) | 0.848 | |
PEL | America | −77.06 | −12.06 | - | 85 | – | 168 | 0.988 (0.958–0.999) | 2 | 0.012 (0.001–0.042) | 0.913 | |
PUR | America | −66.91 | 18.20 | - | 104 | – | 199 | 0.957 (0.919–0.980) | 9 | 0.043 (0.020–0.081) | 0.645 | |
CDX | East Asian | 100.67 | 21.98 | - | 93 | – | 181 | 0.973 (0.938–0.991) | 5 | 0.027 (0.009–0.062) | 0.79 | |
CHB | East Asian | 116.12 | 39.94 | - | 103 | – | 198 | 0.961 (0.925–0.983) | 8 | 0.039 (0.017–0.075) | 0.682 | |
CHS | East Asian | 109.81 | 26.67 | - | 105 | – | 200 | 0.952 (0.914–0.977) | 10 | 0.048 (0.023–0.086) | 0.101 | |
JPT | East Asian | 139.57 | 35.67 | - | 104 | – | 204 | 0.981 (0.951–0.995) | 4 | 0.019 (0.005–0.049) | 0.842 | |
KHV | East Asian | 106.41 | 10.77 | - | 99 | – | 191 | 0.965 (0.929–0.986) | 7 | 0.035 (0.014–0.071) | 0.715 | |
CEU | European | 3.42 | 46.72 | - | 99 | – | 185 | 0.934 (0.890–0.965) | 13 | 0.066 (0.035–0.110) | 0.484 | |
FIN | European | 24.97 | 60.15 | - | 99 | – | 187 | 0.944 (0.903–0.972) | 11 | 0.056 (0.028–0.097) | 0.558 | |
GBR | European | −0.16 | 51.49 | - | 91 | – | 169 | 0.929 (0.881–0.961) | 13 | 0.071 (0.039–0.119) | 0.463 | |
IBS | European | −3.82 | 40.44 | - | 107 | – | 196 | 0.916 (0.870–0.949) | 18 | 0.084 (0.051–0.130) | 0.342 | |
TSI | European | 12.48 | 41.94 | - | 107 | – | 196 | 0.916 (0.870–0.949) | 18 | 0.084 (0.051–0.130) | 0.760 | |
BEB | South Asia | 90.39 | 23.65 | Indo-European | 86 | – | 152 | 0.884 (0.826–0.928) | 20 | 0.116 (0.072–0.174) | 0.222 | |
GIH | South Asia | 72.44 | 23.02 | Indo-European | 103 | – | 179 | 0.869 (0.815–0.912) | 27 | 0.131 (0.088–0.185) | 0.506 | |
ITU | South Asia | 78.48 | 17.39 | Dravidian | 102 | – | 183 | 0.897 (0.847–0.935) | 21 | 0.103 (0.065–0.153) | 0.931 | |
PJL | South Asia | 72.91 | 33.67 | Indo-European | 96 | – | 173 | 0.901 (0.850–0.939) | 19 | 0.099 (0.061–0.150) | 0.945 | |
STU | South Asia | 79.86 | 6.92 | Dravidian | 102 | –- | 184 | 0.902 (0.853–0.939) | 20 | 0.098 (0.061–0.147) | 0.272 | |
GA projectc | South Asians | South Asia | 73.45 | 27.36 | NA | 1448 | – | 1290 | 0.891 (0.874–0.907) | 158 | 0.109 (0.094–0.126) | 0.598 |
Notes: aConfidence interval was calculated using Clopper–Pearson method. bComputed after removing samples which were not in HWE. cGenomeAsia 100K project. *P-values for those populations which were not in Hardy–Weinberg equilibrium.
Sample Collection and DNA Isolation
Ten milliliter intravenous blood samples of subjects were collected in an EDTA vacutainer, after obtaining informed written consent. Genomic DNA was extracted from whole blood, using the protocol described previously.16 These steps were followed for all samples which were subjected to either Sanger sequencing or next-generation sequencing (exome/genome).
Re-sequencing of CYP2C9, Genotyping and Analysis
All the nine exons, their respective intron-exon boundary, 3ʹ and 5ʹ UTR of CYP2C9 have been re-sequenced. For designing of primer, DNA sequence of ENST00000260682 from Ensembl (v75) has been used. Out of 3 mRNA of CYP2C9, only ENST00000260682 translate to protein. Primer3.0 web-based tool (http://simgene.com/Primer3) was used for designing the primers and further primers specificity were checked with NCBI-primer blast. The details of primer sequences are given in Supplementary Table 1. Polymerase chain reaction (PCR) was performed in 10.0 µL volume, which contains 5.0 µL of 2⨰ EmeraldAmp GT PCR master mix, 10.0 ng of genomic DNA and 0.1 p mole (final concentration) of each primer. Thermal cycling conditions used are as follows: initial denaturation step of five minutes at 94°C, followed by 35 cycles of denaturation step of 30 seconds. at 94°C, annealing step of 30 seconds. at 55°C, extension step of two minutes at 72°C, followed by single step of final extension of seven minutes at 72°C. PCR products were cleaned with Exo-SAP-IT (USB, Affymetrix, USA) with recommended protocol of the manufacturer. Cleaned PCR products (1.0 µL) were subjected to sequencing using BigDye terminator (v3.1) cycle sequencing kit (Thermo Fisher Scientific, USA) and analyzed using ABI 3730XL DNA Analyzer. Sequences were edited and assembled using AutoAssembler (v1.0) software. Statistical analysis was performed using R packages. Gap package was used to calculate HWE equilibrium. The 95% confidence interval of allelic and genotypic percentage was calculated with Clopper–Pearson and Sison–Glanz method using DescTools package of R. Surfer trial version (18.1.186) was used to interpolate frequency spectrum with Kriging gridding method and plots were generated using maps and spaMM package of R.
Next-Generation Sequencing (NGS)
For whole genome and exome sequencing, libraries were prepared as per manufacturer’s protocol using Illumina Nextera DNA Flex Library Prep kit and Illumina TrueSeq DNA LP for enrichment kit, respectively. Sequencing of above library was performed on Illumina NovaSeq 6000 system. On an average of 30⨰ and 100⨰ coverage was generated for the whole genome and exome, respectively.
Variants Calling, Annotation and Phasing
The sequencing data from all the samples was trimmed for adapters using Cutadapt (v2.7). The whole-genome datasets were aligned and processed to call variants using the pipeline of DRAGEN (v3.6.3), a Bio-IT platform for genome sequence data analysis. In case of whole-exome datasets, reads were aligned using the BWA tool (v0.7.10) and variants were called using the recommended pipeline of GATK4. The human reference genome version GRCh38 was used for the alignments of reads. The BCF tool was used to extract variants present in the CYP2C9. In the next step, all VCF files were combined with option “CombineGVCFs“ of GATK. Variants were annotated using “Variant Effect Predictor” tool of Ensembl (v95.3). For phasing of the variants, “PopgenPipeline Platform” (PPP) was used with PHASE algorithm of BEAGLE. Novel haplotypes obtained in the current study are deposited to PharmVar (https://www.pharmvar.org/).
Results
Diversity of CYP2C9*3 in Indian Populations
The A>C (rs1057910/CYP2C9*3) is a non-synonymous mutation, which replace isoleucine with leucine (ATT>CTT; Ile359Leu) and decreases enzyme activity. To explore the “C” allele frequency in Indian populations, initially we confirmed Hardy–Weinberg equilibrium (HWE). It was observed that 11 populations were not in HWE (p-value <0.01), which include one Indo-European population, Haryana Pandit (p-value=4.41×10−6), one Austroasiatic, Gond (p-value=7.24×10−8) and nine Dravidian populations; Mudaliar and Nadar from Tamil Nadu (p-value=1.97×10−11 and 2.07×10−12, respectively), Gawali from Karnataka (p-value=2.33×10−5), Kurumba from Kerala (p-value=7.74×10−6) and Thoti, Chenchu, Patkar and Vaddera from Andhra Pradesh (p-value=5.32×10−4, 7.24×10−8, 5.73×10−7, 1.3×10−5 and 4.67×10−3, respectively) (Table 1).
Initially, we excluded those samples, which were not in HWE and estimated 9.51% (133 out of 1398) “C” allele in Indian populations, similar (p-value=0.286 and 0.2425) to South Asian populations of the 1000 Genomes Project (107 out of 978) and the GenomeAsia 100K Project (158 out of 1448) (Figure 1A). Further, we categorized samples on the basis of their linguistic affiliation and observed that Tibeto-Burman have lowest percentage of “C” allele (6.12%; 6 out of 98). Moreover, we observed 9.82% (44 out of 448), 8.41% (32 out of 380) and 9.88% (51 out of 516) of “C” allele frequency in Austro-Asiatic, Dravidian and Indo-European populations, respectively (Table 1). Interestingly, Tibeto-Burmans are insignificantly different (p-value=0.1127) from East Asians (27 out of 1001). Adi Dravidiars (scheduled caste) of Tamil Nadu, Ho (scheduled tribe) of Jharkhand and Baiswar (caste) of Uttar Pradesh have 17.857%, 15.385% and 16.176% of CYP2C9*3, respectively, which are higher in their respective linguistic group; while “C” allele is completely absent in Bhil of Gujarat, Raj-Gond of Madhya Pradesh and Chakesang Naga of Nagaland (Table 1). Our findings suggest that a high level of local heterogeneity exists in Indian subcontinent and we did not find any correlation with geographical distance (Figure 1B and Table 1). It is evident in the allele frequency map that Indian populations have a high frequency of CYP2C9*3, compared to other world populations (Figure 1A and Table 1). We observed a decreasing gradient of “C” allele frequency from the Indian subcontinent to Europeans (Figure 1A).
On the basis of founder events and longtime practice of endogamy, we have already predicted a high frequency of homozygous alleles in Indian populations.9,17 Since CYP2C9*3/*3 significantly decreases metabolic activity of enzymes compared to both CYP2C9*1/*3 and CYP2C9*1/*1, it would be interesting to explore genotype frequencies also in Indian populations. As expected, we observed a higher percentage (<5%) of CYP2C9*3/*3 among Indians, comparative to other world populations, who have 0–1% (Figure 1C and Table 2). Out of 21 populations of the 1000 Genomes Project, who lived outside the Indian subcontinent, only TSI (Italian populations) and CHS (South Chinese populations) have homozygous genotype (0.9 and 1%), while out of five populations who are living in the Indian subcontinent, three (PJL, ITU, and GIH) have 1% of CYP2C9*3/*3 (Table 2). Moreover, 1.25% South Asian samples of the GenomeAsia 100K project, were homozygous for the CYP2C9*3 allele. In the present study, we observed 0–5% CYP2C9*3/*3, of which Bhilala of Madhya Pradesh and Ho of Jharkhand have 5% and 3%, respectively; higher in Indo-Europeans and Austro-Asiatic linguistic groups (Table 2 and Figure 1D). We did not observe homozygous genotype CYP2C9*3/*3 in Tibeto-Burman as well as in Dravidian populations after excluding the populations, which were not in HWE (Figure 1D). In the NGS data repository, “C” allele was observed in 14.28% (60 out of 420). Out of 210 subjects, five (2.39%) and 50 (23.81%) were homozygous and heterozygous for the “C” allele, respectively.
Table 2.
Source | Populations | State/Geographical Region | Lat. | Long. | Linguistic | Sample Size | Missing Data | Genotype Frequency | HWE p-value | |||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
AA (*1/*1) | AC (*1/*3) | CC (*3/*3) | ||||||||||||
Count | Frequency (95%CI)a | Count | Frequency (95% CI)a | Count | Frequency (95%CI)a | |||||||||
Current study | Mahli | Jharkhand | 85 | 23.46 | Austroasiatic | 38 | 4 (10.526) | 27 | 0.794 (0.676–0.918) | 6 | 0.176 (0.059–0.3) | 1 | 0.029 (0–0.153) | 0.382 |
Gond | Chattisgarh | 81.6 | 19.87 | Austroasiatic | 37 | 8 21.622) | 28 | 0.966 (0.931–1) | 0 | 0 (0–0.058) | 1 | 0.034 (0–0.093) | 7.24×10−8* | |
Kharia | Chattisgarh | 85.44 | 23.33 | Austroasiatic | 86 | 14 (16.279) | 62 | 0.861 (0.792–0.937) | 10 | 0.139 (0.069–0.215) | 0 | 0 (0–0.076) | 0.527 | |
Gond | Madhya Pradesh | 77.4 | 26.12 | Austroasiatic | 38 | 7 (18.421) | 25 | 0.806 (0.71–0.958) | 6 | 0.194 (0.097–0.345) | 0 | 0 (0–0.151) | 0.551 | |
Ho | Jharkhand | 85.33 | 23.35 | Austroasiatic | 67 | 2 (2.985) | 47 | 0.723 (0.631–0.839) | 16 | 0.246 (0.154–0.362) | 2 | 0.031 (0–0.146) | 0.660 | |
Kolhas | Andhra Pradesh | 79.98 | 14.46 | Dravidian | 14 | 2 (14.286) | 10 | 0.833 (0.75–1) | 2 | 0.167 (0.083–0.409) | 0 | 0 (0–0.242) | 0.752 | |
Adi Dravidar | Tamil Nadu | 77.73 | 11.35 | Dravidian | 15 | 1 (6.667) | 9 | 0.643 (0.429–0.878) | 5 | 0.357 (0.143–0.593) | 0 | 0 (0–0.236) | 0.416 | |
Telaga | Andhra Pradesh | 83.53 | 18.17 | Dravidian | 12 | 0 (0) | 9 | 0.75 (0.583–1) | 0 | 0 (0–0.259) | 3 | 0.25 (0.083–0.509) | 5.32×10−4* | |
Thoti | Andhra Pradesh | 80.64 | 16.51 | Dravidian | 29 | 0 (0) | 20 | 0.69 (0.552–0.869) | 0 | 0 (0–0.179) | 9 | 0.31 (0.172–0.489) | 7.24×10−8* | |
Naidu | Andhra Pradesh | 79.6 | 13.22 | Dravidian | 21 | 11 (52.381) | 9 | 0.9 (0.8–1) | 1 | 0.1 (0–0.265) | 0 | 0 (0–0.165) | 0.868 | |
Reddy | Andhra Pradesh | 78.48 | 17.37 | Dravidian | 24 | 1 (4.167) | 17 | 0.739 (0.609–0.935) | 6 | 0.261 (0.13–0.457) | 0 | 0 (0–0.196) | 0.472 | |
Mudaliar | Tamil Nadu | 79.13 | 12.92 | Dravidian | 48 | 3 (6.25) | 41 | 0.911 (0.844–0.984) | 0 | 0 (0–0.073) | 4 | 0.089 (0.022–0.162) | 1.97×10−11* | |
Gammavokklu | Karnataka | 74.83 | 12.93 | Dravidian | 19 | 4 (21.053) | 13 | 0.867 (0.8–1) | 2 | 0.133 (0.067–0.329) | 0 | 0 (0–0.196) | 0.782 | |
Vysya | Andhra Pradesh | 77.65 | 14.68 | Dravidian | 60 | 10 (16.667) | 40 | 0.8 (0.7–0.904) | 10 | 0.2 (0.1–0.304) | 0 | 0 (0–0.104) | 0.432 | |
Gawli | Karnataka | 74.77 | 13.33 | Dravidian | 89 | 10 (11.236) | 63 | 0.797 (0.722–0.884) | 10 | 0.127 (0.051–0.214) | 6 | 0.076 (0–0.163) | 2.33×10−5* | |
Medari | Andhra Pradesh | 80.61 | 16.56 | Dravidian | 4 | 0 (0) | 3 | 0.75 (0.5–1) | 1 | 0.25 (0–0.601) | 0 | 0 (0–0.351) | 0.775 | |
Madar | Karnataka | 75.05 | 15.33 | Dravidian | 70 | 9 (12.857) | 55 | 0.902 (0.852–0.98) | 1 | 0.016 (0–0.095) | 5 | 0.082 (0.033–0.161) | 2.07×10−12* | |
Patkar | Andhra Pradesh | 78.1 | 15.8 | Dravidian | 20 | 1 (5) | 12 | 0.632 (0.474–0.876) | 0 | 0 (0–0.244) | 7 | 0.368 (0.211–0.612) | 1.3×10−5* | |
Raj-Gond | Madhya Pradesh | 78.7 | 23.87 | Dravidian | 28 | 19 (67.857) | 9 | 1 (1–1) | 0 | 0 (0–0.181) | 0 | 0 (0–0.181) | 1 | |
Adhiyan | Tamil Nadu | 79.41 | 13.72 | Dravidian | 44 | 4 (9.091) | 37 | 0.925 (0.875–1) | 3 | 0.075 (0.025–0.16) | 0 | 0 (0–0.085) | 0.805 | |
Kurumba | Tamil Nadu | 79.09 | 12.94 | Dravidian | 15 | 2 (13.333) | 11 | 0.846 (0.769–1) | 2 | 0.154 (0.077–0.379) | 0 | 0 (0–0.225) | 0.764 | |
Chenchu | Andhra Pradesh | 78.47 | 17.37 | Dravidian | 27 | 2 (7.407) | 17 | 0.68 (0.52–0.861) | 0 | 0 (0–0.181) | 8 | 0.32 (0.16–0.501) | 5.73×10−7* | |
Kurumba | Madhya Pradesh | 75.83 | 22.71 | Dravidian | 26 | 6 (23.077) | 14 | 0.7 (0.55–0.919) | 0 | 0 (0–0.219) | 6 | 0.3 (0.15–0.519) | 7.74×10−6* | |
Vaddera | Andhra Pradesh | 79.48 | 18.72 | Dravidian | 8 | 0 (0) | 5 | 0.625 (0.375–0.959) | 0 | 0 (0–0.334) | 3 | 0.375 (0.125–0.709) | 4.67×10−3* | |
Brahmin-Tiwari | Uttar Pradesh | 82.68 | 25.73 | Indo-European | 44 | 13 (29.545) | 28 | 0.903 (0.839–1) | 3 | 0.097 (0.032–0.205) | 0 | 0 (0–0.109) | 0.777 | |
Kashmiri pandit | Jammu and Kashmir | 75.83 | 34.37 | Indo-European | 21 | 0 (0) | 17 | 0.81 (0.714–0.995) | 3 | 0.143 (0.048–0.328) | 1 | 0.048 (0–0.233) | 0.144 | |
Bhil | Gujarat | 72.67 | 23.03 | Indo-European | 4 | 0 (0) | 4 | 1 (1–1) | 0 | 0 (0–0.416) | 0 | 0 (0–0.416) | 1 | |
Gamit | Gujrat | 72.83 | 21.17 | Indo-European | 45 | 7 (15.556) | 35 | 0.921 (0.868–1) | 3 | 0.079 (0.026–0.168) | 0 | 0 (0–0.089) | 0.8 | |
Tharu | Uttarakhand | 79.5 | 29.38 | Indo-European | 30 | 3 (10) | 23 | 0.852 (0.778–0.998) | 3 | 0.111 (0.037–0.258) | 1 | 0.037 (0–0.184) | 0.078 | |
Warli | Maharastra | 72.95 | 19.17 | Indo-European | 70 | 7 (10) | 48 | 0.762 (0.667–0.866) | 15 | 0.238 (0.143–0.342) | 0 | 0 (0–0.104) | 0.283 | |
Baiswar | Uttar Pradesh | 82.6 | 25.15 | Indo-European | 40 | 6 (15) | 23 | 0.676 (0.529–0.825) | 11 | 0.324 (0.176–0.472) | 0 | 0 (0–0.148) | 0.260 | |
Pandit | Haryana | 76.87 | 29.96 | Indo-European | 40 | 12 (30) | 23 | 0.821 (0.714–0.962) | 1 | 0.036 (0–0.176) | 4 | 0.143 (0.036–0.283) | 4.41×10−6* | |
Bhilala | Madhya Pradesh | 75.3 | 22.6 | Indo-European | 49 | 9 (18.367) | 33 | 0.825 (0.725–0.931) | 5 | 0.125 (0.025–0.231) | 2 | 0.05 (0–0.156) | 0.018 | |
Chakhesang_Naga | Nagaland | 94.48 | 26.12 | Tibeto-Burman | 33 | 19 57.576) | 14 | 1 (1–1) | 0 | 0 (0–0.116) | 0 | 0 (0–0.116) | 1 | |
Naga-sema | Nagaland | 93.81 | 25.7 | Tibeto-Burman | 40 | 21 (52.5) | 16 | 0.842 (0.737–1) | 3 | 0.158 (0.053–0.331) | 0 | 0 (0–0.173) | 0.708 | |
Mizo | Mizoram | 92.83 | 23.2 | Tibeto-Burman | 23 | 7 (30.435) | 13 | 0.813 (0.688–1) | 3 | 0.188 (0.063–0.39) | 0 | 0 (0–0.202) | 0.679 | |
Total | South Asia | 79.51 | 23.66 | – | 1278 | 224 (17.53) | 573b | 0.82 (0.793–0.848)b | 119b | 0.17 (0.143–0.199) | 7b | 0.010 (0–0.039)b | 0.767b | |
South Asians (NGS data repository) |
South Asians | 79.51 | 23.66 | – | 210 | – | 155 | 0.74 (0.681–0.798) | 50 | 0.24 (0.181–0.298) | 5 | 0.024 (0–0.084) | 0.58 | |
1000 Genomes Project | ACB | Africa | −59.61 | 13.19 | – | 96 | – | 95 | 0.990 (0.979–1) | 1 | 0.010 (0–0.028) | 0 | 0 (0–0.018) | 0.959 |
ASW | Africa | −88.62 | 36.07 | – | 61 | – | 59 | 0.967 (0.934–1) | 2 | 0.033 (0–0.067) | 0 | 0 (0–0.034) | 0.896 | |
ESN | Africa | 3.33 | 6.53 | – | 99 | – | 99 | 1 (1–1) | 0 | 0 (0–0.016) | 0 | 0 (0–0.016) | 1 | |
GWD | Africa | −15.87 | 13.43 | – | 113 | – | 113 | 1 (1–1) | 0 | 0 (0–0.014) | 0 | 0 (0–0.014) | 1 | |
LWK | Africa | 34.76 | 0.60 | – | 99 | – | 99 | 1 (1–1) | 0 | 0 (0–0.016) | 0 | 0 (0–0.016) | 1 | |
MSL | Africa | −12.91 | 8.45 | – | 85 | – | 85 | 1 (1–1) | 0 | 0 (0–0.019) | 0 | 0 (0–0.019) | 1 | |
YRI | Africa | 3.83 | 7.42 | – | 108 | – | 108 | 1 (1–1) | 0 | 0 (0–0.015) | 0 | 0 (0–0.015) | 1 | |
CLM | America | −75.67 | 6.27 | – | 94 | – | 82 | 0.872 (0.819–0.943) | 12 | 0.128 (0.074–0.198) | 0 | 0 (0–0.070) | 0.509 | |
MXL | America | −99.08 | 19.30 | – | 64 | – | 61 | 0.953 (0.922–1) | 3 | 0.047 (0.016–0.100) | 0 | 0 (0–0.053) | 0.848 | |
PEL | America | −77.06 | −12.06 | – | 85 | – | 83 | 0.976 (0.953–1) | 2 | 0.024 (0–0.048) | 0 | 0 (0–0.024) | 0.913 | |
PUR | America | −66.91 | 18.20 | – | 104 | – | 95 | 0.913 (0.875–0.971) | 9 | 0.087 (0.048–0.144) | 0 | 0 (0–0.057) | 0.645 | |
CDX | East Asian | 100.67 | 21.98 | – | 93 | – | 88 | 0.946 (0.914–0.992) | 5 | 0.054 (0.022–0.100) | 0 | 0 (0–0.046) | 0.79 | |
CHB | East Asian | 116.12 | 39.94 | – | 103 | – | 95 | 0.922 (0.883–0.975) | 8 | 0.078 (0.039–0.130) | 0 | 0 (0–0.053) | 0.682 | |
CHS | East Asian | 109.81 | 26.67 | – | 105 | – | 96 | 0.914 (0.876–0.970) | 8 | 0.076 (0.038–0.132) | 1 | 0.010 (0–0.065) | 0.101 | |
JPT | East Asian | 139.57 | 35.67 | – | 104 | – | 100 | 0.962 (0.933–0.994) | 4 | 0.038 (0.010–0.071) | 0 | 0 (0–0.033) | 0.842 | |
KHV | East Asian | 106.41 | 10.77 | – | 99 | – | 92 | 0.929 (0.889–0.978) | 7 | 0.071 (0.030–0.119) | 0 | 0 (0–0.048) | 0.715 | |
CEU | European | 3.42 | 46.72 | – | 99 | – | 86 | 0.869 (0.808–0.930) | 13 | 0.131 (0.071–0.192) | 0 | 0 (0–0.061) | 0.484 | |
FIN | European | 24.97 | 60.15 | – | 99 | – | 88 | 0.889 (0.838–0.951) | 11 | 0.111 (0.061–0.173) | 0 | 0 (0–0.062) | 0.558 | |
GBR | European | −0.16 | 51.49 | – | 91 | – | 78 | 0.857 (0.802–0.934) | 13 | 0.143 (0.088–0.220) | 0 | 0 (0–0.077) | 0.463 | |
IBS | European | −3.82 | 40.44 | – | 107 | – | 89 | 0.832 (0.766–0.899) | 18 | 0.168 (0.103–0.236) | 0 | 0 (0–0.067) | 0.342 | |
TSI | European | 12.48 | 41.94 | – | 107 | – | 90 | 0.841 (0.785–0.914) | 16 | 0.150 (0.093–0.222) | 1 | 0.009 (0–0.082) | 0.760 | |
BEB | South Asia | 90.39 | 23.65 | Indo-European | 86 | – | 66 | 0.767 (0.686–0.857) | 20 | 0.233 (0.151–0.322) | 0 | 0 (0–0.090) | 0.222 | |
GIH | South Asia | 72.44 | 23.02 | Indo-European | 103 | – | 77 | 0.748 (0.670–0.832) | 25 | 0.243 (0.165–0.327) | 1 | 0.010 (0–0.094) | 0.506 | |
ITU | South Asia | 78.48 | 17.39 | Dravidian | 102 | – | 82 | 0.804 (0.735–0.881) | 19 | 0.186 (0.118–0.264) | 1 | 0.010 (0–0.087) | 0.931 | |
PJL | South Asia | 72.91 | 33.67 | Indo-European | 96 | – | 78 | 0.812 (0.740–0.885) | 17 | 0.177 (0.104–0.250) | 1 | 0.010 (0–0.083) | 0.945 | |
STU | South Asia | 79.86 | 6.92 | Dravidian | 102 | – | 82 | 0.804 (0.735–0.882) | 20 | 0.196 (0.127–0.274) | 0 | 0 (0–0.078) | 0.272 | |
GA projectc | South Asians | South Asia | 73.45 | 27.36 | NA | 724 | – | 576 | 0.796 (0.768–0.826) | 138 | 0.191 (0.163–0.221) | 10 | 0.014 (0–0.044) | 0.598 |
Notes: aConfidence interval was calculated using Sison–Glanz method; bComputed after removing samples which were not in HWE; cGenomeAsia 100K project. *P-values for those populations which were not in Hardy–Weinberg equilibrium.
Other Putative Functional Variants and Novel Haplotypes
A few rare nonsynonymous variants have also been observed in the current study. In 1278 samples, nonsynonymous C>T variant (rs28371685) which replaces the amino acid arginine with tryptophane (p.Arg335Trp) and determines the CYP2C9*11 haplogroup was found in three samples (one each in Chenchu, Telagas of Andhra Pradesh, and Mudliar of Tamil Nadu). Besides this, other functional variants rs1799853 (p.Arg144Cys) and rs72558189 (p.Arg335Trp) were observed in 10 and six samples of NGS data repository, respectively. These variants are associated with CYP2C9*2 and *14 haplotypes (Table 3).
Table 3.
Haplotype | Haplotype Countsa | Haplotype Frequency | rsID | Type of Mutation | Amino Acid | SIFT; Polyphen |
---|---|---|---|---|---|---|
Other rare haplotypes | ||||||
CYP2C9*2 | 200/10/0 | 0.024 | rs1799853 | Nonsynonymous | p.Arg144Cys | Tolerated (0.05); probably damaging (0.986) |
CYP2C9*11 | 1053/3/0 | 0.0014 | rs28371685 | Nonsynonymous | p.Arg335Trp | Tolerated (1); benign (0) |
CYP2C9*14 | 204/6/0 | 0.014 | rs72558189 | Nonsynonymous | p.Arg125His | Deleterious (0.05); benign (0.445) |
Novel haplotypes | ||||||
CYP2C9*63 | 209/1/0 | 0.0024 | rs141489852 | Nonsynonymous | p.Arg144His | Deleterious (0.01); probably damaging (0.95) |
CYP2C9*64 | 209/1/0 | 0.0024 | Novel (c.839C>G) | Nonsynonymous | p.Ser280Cys | Deleterious (0.01); possibly damaging (0.45) |
CYP2C9*65 | 209/1/0 | 0.0024 | Novel (c.978G>T) | Nonsynonymous | p.Glu326Asp | Deleterious (0.01); probably damaging (0.998) |
CYP2C9*66 | 209/1/0 and1054/2/0 | 0.0024 and 0.001 | rs578144976 | Nonsynonymous | p.Leu362Val | Tolerated (1); benign (0) |
CYP2C9*67 | 209/1/0 | 0.0024 | rs4918758;rs776908257 | Upstream; non-synonymous | p.Arg433Trp | Deleterious (0.01); probably damaging (0.965) |
CYP2C9*68 | 209/1/0 | 0.0024 | rs9332092; rs9332093; rs61604699; rs4918758; rs9332098; rs1057910; rs542577750; rs1057911 | Upstream (5); nonsynonymous; splice_donor; synonymous | p.Ile359Leu | Deleterious (0.02); benign (0.045) |
CYP2C9*69 | 208/2/0 | 0.0048 | rs4918758; novel (c.572A>G) | Upstream; nonsynonymous | p.Asp191Gly | Deleterious (0.01); probably damaging (0.98) |
CYP2C9*70 | 209/1/0 | 0.0024 | rs4918758; novel (c.1325G>T) | Upstream; nonsynonymous | p.Gly442Val | Deleterious (0.01); benign (0.003) |
Note: aMajor allele homozygous/heterozygous/minor allele homozygous.
In total, eight rare and putative functional variants were not present in any reported CYP2C9 haplotypes. To determine the haplotypes, variants present within 3000 base-pair upstream and 250 base-pair downstream of CYP2C9 were utilized. In total, eight haplotypes were identified and annotation was obtained from PharmVar consortium (Table 3, Figure 2A and B). The haplotype CYP2C9*69 was identified in two subjects, CYP2C9*66 was identified in three subjects while other haplotypes were observed in only one subject. The nonsynonymous variants present in CYP2C9*63, *64, *65, *67 and *69 are predicted to be deleterious in both SIFT and Polyphen predictions. The p.Leu362Val present within CYP2C9*66 is predicted to be tolerated/benign. The Leu362 is present within hydrophobic substrate binding pocket of CYP2C9 and conversion from leucine to valine can affect assess of drug to the heme group of active site.24 A rare splice-site donor variant rs542577750 is present within CYP2C9*68 which can affect splicing of intron-7 (Figure 2B).
In the Genome Aggregation Database project (gnomAD), rs578144976 and rs542577750 is reported only in South Asian samples (allele frequency=0.00085 and 0.00049). Moreover, the c.839C>G, c.978G>T, c.572A>G and c.1325G>T was not observed in any subjects of the gnomAD project. Besides South Asian subjects, the rs141489852 and rs776908257 was observed in American and non-Finnish European populations also. It suggests that CYP2C9*64, *65, *66, *68, *69 and *70 haplotypes are South Asian-specific.
Discussion
CYP2C9 is highly expressed in the human liver and metabolizes a wide range of drugs. Several nonsynonymous mutations have been associated with less catalytic activity of CYP2C9 and intrinsic clearance of drugs. The CYP2C9*3 allele has been reported with hypersensitive reaction against phenytoin in epilepsy patients,18 and decreased metabolism of celecoxib.19 It was also reported with high incidence of response rate against sulfonamides, and urea derivatives.20 The in vitro studies suggest that CYP2C9*2 and CYP2C9*3 alleles reduce enzyme activity 29–94% and 71–91%, respectively, clearance rate of many drugs, which includes S-warfarin, tolbutamide, fluvastatin, glimepiride, tenoxicam, candesartan, celecoxib and phenytoin.21 Of which, S-warfarin, phenytoin and tolbutamide have a narrow therapeutic index and patients need the right amount of drug depending upon age, gender, and genetic make-up for successful treatment of disease. Moreover, homozygous mutations have more effect compared to heterozygous. The CYP2C9*3/*3 reduces 95% compared to 64% clearance rate by CYP2C9*1/*3.22 Considering the higher level of evidence of association between CYP2C9*3 and drug response, CPIC (Clinical Pharmacogenomics Implementation Consortium) categorized CYP2C9*3 under level-1A.23
Many studies have shown that the variations in CYP2C9 are associated with therapeutic heterogeneity in Indian populations. CYP2C9*2 and *3 have been reported with less hydroxylation (or metabolism) of phenytoin in vivo in South Indian populations,10 compared to wild type CYP2C9*1. Ramasamy et al reported phenytoin toxicity in a patient with normal dose of 300 mg/day, who had CYP2C9*3/*3 genotype.11 The same symptoms were also reported by Thakkar et al in South Indian populations.12 South Asians have a unique evolutionary history and have been practicing endogamy for many centuries, hence the high frequency of homozygous CYP2C9*3/*3 identified in the current study is not surprising. A similar trend was also observed in samples of the 1000 Genomes Project in which South Asians have high allelic and genotypic frequency of CYP2C9*3. Since CYP2C9*3/*3 has a more pronounced effect, we predict heterogeneous drug response in South Asians compared to other world populations. It would be interesting to find out if all South Asian populations have a high frequency of CYP2C9*3 and *3/*3 alleles. We explored the frequency distribution, but did not find any correlation with linguistic or geographical location. Some of the populations have a high frequency of CYP2C9*3, eg 35.7% of individuals from the Adi Dravidars have the CYP2C9*3 allele, while some of the populations have a low frequency of the CYP2C9*3 allele. Approximately 14–28%, 0–36%, 0–32%, and 0–19% of individuals speaking Austro-Asiatic, Dravidian, Indo-European and Tibeto-Burman languages had the CYP2C9*3 allele. This suggests that South Asians are highly heterogeneous for this locus. Moreover, patients from Vysya, Mahli, Warli, Medari, Reddy, Ho, Baiswar, and Adi Dravidar populations, who have >20% individuals with CYP2C9*3 allele, should be genotyped for better treatment of disease. But this approach must be established first and its efficacy must be evaluated. We also find other rare haplotypes. Of which, three were already reported and eight were novel. Out of eight novel haplotypes, CYP2C9*64, *65, *66, *68, *69*70 and haplotypes are South Asian-specific as variants present within these haplotypes are reported only in South Asian subjects of the gnomAD project. All of the novel haplotypes are predicted to be deleterious and may have effects on protein function. It would be interesting to explore the effects of these novel haplotypes on the metabolic activity of CYP2C9 and find genetic association with therapeutic response in large samples.
Conclusions
In conclusion, we identified high genetic heterogeneity in CYP2C9 locus among South Asian populations. We observed higher frequency of CYP2C9*3 and CYP2C9*3/*3 alleles among South Asian populations, compared to populations from the rest of the world. The CYP2C9*3 has been associated with therapeutic response. Moreover, in the in vitro studies, the effect of CYP2C9*3/*3 allele was seen more pronounced compared to heterozygous and wild type homozygous genotype. As South Asians have a high frequency of CYP2C9*3, it would be interesting to explore the potential of CYP2C9*3 as marker for personalized therapy. Furthermore, it would be interesting to compare frequency of responder and nonresponder patients among populations and to find correlation with frequency spectrum of pharmacologically important variations. We also observed several nonsynonymous rare variants and novel haplotypes (CYP2C9*63-*70) in the present study. Of which, CYP2C9*64, *65, *66, *68, *69 and *70 haplotypes are South Asian-specific. The SIFT and PolyPhen algorithm predicts that these variants are deleterious and damaging. Therefore, individuals having CYP2C9 haplotypes with deleterious variants may have different metabolic activity compared to wild type. Collectively, our data provide fundamental knowledge of CYP2C9 genetic polymorphisms in South Asia, which could be relevant to further CYP2C9-related functional research and for personalized medicine.
Acknowledgments
We express our deepest condolence on the passing away of Mr Saurav Sharma. This work was supported by Council of Scientific and Industrial Research (CSIR), Government of India. Sheikh Nizamuddin was supported by ICMR JRF-SRF research fellowship. KT was supported by J C Bose Fellowship from Science and Engineering Research Board (SERB), Department of Science and Technology, Government of India (GAP0542). We thank Prof. Andrea Gaedigk for her help in submission of haplotypes to the PharmVar consortium.
Disclosure
The authors report no conflicts of interest in this work.
References
- 1.Miners JO, Birkett DJ. Cytochrome P4502C9: an enzyme of major importance in human drug metabolism. Br J Clin Pharmacol. 1998;45(6):525–538. doi: 10.1046/j.1365-2125.1998.00721.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Sullivan-Klose TH, Ghanayem BI, Bell DA, et al. The role of the CYP2C9-Leu359 allelic variant in the tolbutamide polymorphism. Pharmacogenetics. 1996;6(4):341–349. doi: 10.1097/00008571-199608000-00007 [DOI] [PubMed] [Google Scholar]
- 3.Yasuda SU, Zhang L, Huang SM. The role of ethnicity in variability in response to drugs: focus on clinical pharmacology studies. Clin Pharmacol Ther. 2008;84(3):417–423. [DOI] [PubMed] [Google Scholar]
- 4.Kidd RS, Curry TB, Gallagher S, Edeki T, Blaisdell J, Goldstein JA. Identification of a null allele of CYP2C9 in an African-American exhibiting toxicity to phenytoin. Pharmacogenetics. 2001;11(9):803–808. doi: 10.1097/00008571-200112000-00008 [DOI] [PubMed] [Google Scholar]
- 5.Kimura M, Ieiri I, Mamiya K, Urae A, Higuchi S. Genetic polymorphism of cytochrome P450s, CYP2C19, and CYP2C9 in a Japanese population. Ther Drug Monit. 1998;20(3):243–247. doi: 10.1097/00007691-199806000-00001 [DOI] [PubMed] [Google Scholar]
- 6.Si D, Wang J, Zhang Y, Zhong D, Zhou H. Distribution of CYP2C9*13 allele in the Chinese Han and the long-range haplotype containing CYP2C9*13 and CYP2C19*2. Biopharm Drug Dispos. 2012;33(6):342–345. doi: 10.1002/bdd.1804 [DOI] [PubMed] [Google Scholar]
- 7.Dai DP, Xu RA, Hu LM, et al. CYP2C9 polymorphism analysis in Han Chinese populations: building the largest allele frequency database. Pharmacogenomics J. 2014;14(1):85–92. doi: 10.1038/tpj.2013.2 [DOI] [PubMed] [Google Scholar]
- 8.Giri AK, Khan NM, Grover S, et al. Genetic epidemiology of pharmacogenetic variations in CYP2C9, CYP4F2 and VKORC1 genes associated with warfarin dosage in the Indian population. Pharmacogenomics. 2014;15(10):1337–1354. doi: 10.2217/pgs.14.88 [DOI] [PubMed] [Google Scholar]
- 9.Reich D, Thangaraj K, Patterson N, Price AL, Singh L. Reconstructing Indian population history. Nature. 2009;461(7263):489–494. doi: 10.1038/nature08365 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Rosemary J, Surendiran A, Rajan S, Shashindran CH, Adithan C. Influence of the CYP2C9 AND CYP2C19 polymorphisms on phenytoin hydroxylation in healthy individuals from south India. Indian J Med Res. 2006;123(5):665–670. [PubMed] [Google Scholar]
- 11.Ramasamy K, Narayan SK, Chanolean S, Chandrasekaran A. Severe phenytoin toxicity in a CYP2C9*3*3 homozygous mutant from India. Neurol India. 2007;55(4):408–409. doi: 10.4103/0028-3886.33300 [DOI] [PubMed] [Google Scholar]
- 12.Thakkar AN, Bendkhale SR, Taur SR, Gogtay NJ, Thatte UM. Association of CYP2C9 polymorphisms with phenytoin toxicity in Indian patients. Neurol India. 2012;60(6):577–580. doi: 10.4103/0028-3886.105189 [DOI] [PubMed] [Google Scholar]
- 13.Moorjani P, Thangaraj K, Patterson N, et al. Genetic evidence for recent population mixture in India. Am J Hum Genet. 2013;93(3):422–438. doi: 10.1016/j.ajhg.2013.07.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Genomes Project C, Auton A, Brooks LD, et al. A global reference for human genetic variation. Nature. 2015;526(7571):68–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.GenomeAsia KC. The GenomeAsia 100K Project enables genetic discoveries across Asia. Nature. 2019;576(7785):106–111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Thangaraj K, Joshi MB, Reddy AG, Gupta NJ, Chakravarty B, Singh L. CAG repeat expansion in the androgen receptor gene is not associated with male infertility in Indian populations. J Androl. 2002;23(6):815–818. [PubMed] [Google Scholar]
- 17.Nakatsuka N, Moorjani P, Rai N, et al. The promise of discovering population-specific disease-associated genes in South Asia. Nat Genet. 2017;49(9):1403–1407. doi: 10.1038/ng.3917 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ramasamy K, Narayan SK, Shewade DG, Chandrasekaran A. Influence of CYP2C9 genetic polymorphism and undernourishment on plasma-free phenytoin concentrations in epileptic patients. Ther Drug Monit. 2010;32(6):762–766. doi: 10.1097/FTD.0b013e3181fa97cc [DOI] [PubMed] [Google Scholar]
- 19.Tang C, Shou M, Rushmore TH, et al. In-vitro metabolism of celecoxib, a cyclooxygenase-2 inhibitor, by allelic variant forms of human liver microsomal cytochrome P450 2C9: correlation with CYP2C9 genotype and in-vivo pharmacokinetics. Pharmacogenetics. 2001;11(3):223–235. doi: 10.1097/00008571-200104000-00006 [DOI] [PubMed] [Google Scholar]
- 20.Zhou K, Donnelly L, Burch L, et al. Loss-of-function CYP2C9 variants improve therapeutic response to sulfonylureas in type 2 diabetes: a Go-DARTS study. Clin Pharmacol Ther. 2010;87(1):52–56. [DOI] [PubMed] [Google Scholar]
- 21.Lee CR, Goldstein JA, Pieper JA. Cytochrome P450 2C9 polymorphisms: a comprehensive review of the in-vitro and human data. Pharmacogenetics. 2002;12(3):251–263. doi: 10.1097/00008571-200204000-00010 [DOI] [PubMed] [Google Scholar]
- 22.Yasar U, Tybring G, Hidestrand M, et al. Role of CYP2C9 polymorphism in losartan oxidation. Drug Metab Dispos. 2001;29(7):1051–1056. [PubMed] [Google Scholar]
- 23.Thorn CF, Klein TE, Altman RB. PharmGKB: the pharmacogenomics knowledge base. Methods Mol Biol. 2013;1015:311–320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Williams PA, Cosme J, Ward A, Angove HC, MatakVinkovic D, Jhoti H. Crystal structure of human cytochrome P450 2C9 with bound warfarin. Nature. 2003;424(6947):464–468. doi: 10.1038/nature01862 [DOI] [PubMed] [Google Scholar]