Abstract
Glucose-6-phosphate dehydrogenase deficiency (G6PD deficiency; OMIM #300908) is the most common inborn error disorders worldwide. While the G6PD is the key enzyme of removing oxidative stress in erythrocytes, the early diagnosis is utmost vital to prevent chronic and drug-, food- or infection-induced hemolytic anemia. The characterization of the mutations is also important for the subsequent genetic counseling, especially for female carrier with ambiguous enzyme activities and males with mild mutations. While multiplex SNaPshot assay and Sanger sequencing were performed on 500 G6PD deficient males, five newly discovered variations, namely c.187G > A (p.E63K), c.585G > C (p.Q195H), c.586A > T (p.I196F), c.743G > A (p.G248D), and c.1330G > A (p.V444I) were detected in the other six patients. These variants were previously named as the Pingtung, Tainan, Changhua, Chiayi, and Tainan-2 variants, respectively. The in silico analysis, as well as the prediction of the structure of the resultant mutant G6PD protein indicated that these five newly discovered variants might be disease causing mutations.
Keywords: G6PD deficiency, Mutation analysis, In silico analysis, Structural predication
Subject area | Genetics, Genomics and Molecular Biology |
More specific subject area | Inborn errors of metabolism |
Type of data | Tables, Figures |
How data was acquired | DNA sequencing using 3730xl Genetic Analyzer (Thermo Fisher Scientific, Waltham, MA, USA), mutation severity prediction softwares, structural effect prediction software |
Data format | Analyzed |
Experimental factors | DNA extracted from dried blood spot used in newborn screening |
Experimental features | Bioinformatic tools |
Data source location | Taiwan |
Data accessibility | Provided within this article |
Related research article | Chiu YH, Chen HJ, Chang YC, Liu YN, Kao SM, Liu MY, Weng YY, Hsiao KJ, Liu TT. Applying a multiplexed primer extension method on dried blood spots increased the detection of carriers at risk of glucose-6-phosphate dehydrogenase deficiency in newborn screening program. Clin. Chim. Acta 495 (2019) 271–277. https://doi.org/10.1016/j.cca.2019.04.074[1]. |
Value of the Data
|
1. Data
This dataset presented the in silico and structural analysis of the five newly discovered variations, namely c.187G > A (p.E63K), c.585G > C (p.Q195H), c.586A > T (p.I196F), c.743G > A (p.G248D), and c.1330G > A (p.V444I) (Fig. 1), detected in the six Taiwanese G6PD deficient patients using Sanger Sequencing (Table 1).
Table 1.
Patient Number | A129 | A244 | A281 | A367 | A397 | A453 |
---|---|---|---|---|---|---|
Sex | Male | Male | Male | Male | Male | Male |
Place of Birth | Changhua | Chiayi | Tainan | Tainan | Pingtung | Tainan |
Age at newborn screening (day) | 2 | 2 | 2 | 2 | 3 | 3 |
G6PD activity in newborn screening (U/gHb)a | 0.2 | 5.5 | 5.3 | 1.7 | 5.7 | 5.1 |
Age when confirmed (day) | 34 | 9 | 22 | 15 | 14 | 11 |
Confirmed G6PD activity (U/gHb)b | 0.1 | 6.1 | 5.5 | 0.2 | 8.6 | 6.5 |
Variation found | c.586A > T | c.743G > A | c.1330G > A | c.585G > C | c.187G > A | c.1330G > A |
Clinical referral was recommended for those enzyme activity ≦6.0 U/gHb.
The confirmed diagnosis was performed through a quantitative enzyme activity assay by using fresh whole blood. G6PD-deficiency would be suggested for those with G6PD activity ≦10.0 U/gHb.
The comparison sequence of these variants in G6PD protein of different species [2], including Homo sapiens, Mus musculus, Danio rerio (zebrafish), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans were presented in Fig. 2. The in silico analysis using SIFT [3], PolyPhen-2 [3], Mutation Taster [4] and Slicing Finder [5] softwares, as well as the conservation between species and allele frequency in Taiwanese population [6] were summarized in Table 2. Furthermore, the amino acid alterations were presented in the functional domains [7] (Fig. 3) and in partial 3D model of G6PD [8] (Fig. 4). The structure of the resultant mutant G6PD protein were analyzed by HOPE, Have yOur Protein Explained [9] (Table 3).
Table 2.
Nucleotide substitution | Amino acid substitution | SIFT | PolyPhen-2 | Mutation Taster | Splicing finder | Conservationa | Allele Frequencyb | Predicted Classc |
---|---|---|---|---|---|---|---|---|
c.187G > A | p.E63K | Tolerated | Benign | Disease causing | Potential alteration | Moderately | <2/1417d | III-IV |
c.585G > C | p.Q195H | Damaging | Probably damaging | Disease causing | Potential alteration | Highly | <1/1000 | II |
c.586A > T | p.I196F | Damaging | Probably damaging | Disease causing | Potential alteration | Highly | <1/1000 | II |
c.743G > A | p.G248D | Damaging | Probably damaging | Disease causing | Probably no impact | Highly | <1/1000 | III |
c.1330G > A | p.V444I | Tolerated | Possibly damaging | Disease causing | Potential alteration | Highly | <1/1000 | III |
Sequence comparison between Homo sapiens, Mus musculus, Danio rerio (zebrafish), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans and Saccharomyces cerevisiae as shown in Fig. 2.
Allele frequency in Taiwanese population (https://taiwanview.twbiobank.org.tw/browse38, accessed on 25 April 2019) [6].
Classification of G6PD variants in the study according to the WHO definition [7].
Two alleles in 1417 people with indeterminate sex.
Table 3.
Mutants | Structure prediction by HOPE algorithma |
---|---|
p.E63K | The wide-type residue forms a salt bridge with arginine at position 104. The difference in charge will disturb the ionic interaction made by the original, wild-type residue. |
p.Q195H | The wild-type residue forms a hydrogen bond with arginine at position 192. The size difference between wild-type and mutant residue makes that the new residue is not in the correct position to make the same hydrogen bond as the original wild-type residue did. |
p.I196F | The mutant residue is bigger than the wild-type residue and is located in a domain that is important for the activity of the protein and in contact with residues in another domain. The mutation can affect this interaction and as such affect protein function. |
p.G248D | The wild-type residue is a glycine, the most flexible of all residues. This flexibility might be necessary for the protein's function. Mutation of this glycine can abolish this function. |
p.V444I | The mutant residue is bigger than the wild-type residue and is located in a domain that is important for binding of other molecules. The mutation might affect this interaction and thereby disturb signal transfer from binding domain to the activity domain. |
Using software Have yOur Protein Explained (HOPE, http://www.cmbi.ru.nl/hope/) [9].
2. Experimental design, materials and methods
2.1. Mutation identification: sanger sequencing
In 500 G6PD-deficient male newborns detected by G6PD enzyme activity assay [10], nine of which do not carry any of the 21 common mutations described in Taiwan and Southeast Asia using multiplex SNaPshot assay [1]. Their dried blood spots used in newborn screening were subsequently subjected to mutational analysis by sequencing. The whole coding exons and exon-intron boundary sequences of G6PD gene were amplified and analyzed by forward and reverse Sanger sequencing. Putative mutations were confirmed by sequencing of an independent PCR product. The study protocol was reviewed and approved by the Institutional Review Board of Taipei City Hospital, Taiwan.
2.2. Sequence alignments between species
Conservation of the peptide sequence around the affected residues was assessed by alignment of orthologous and human G6PD sequences with ClustalW2, [2].
2.3. Severity prediction and allele frequency in population
Different online algorithms were used to predict the functional consequences of the five variants. The in silico analyses were performed using the SIFT [3], PolyPhen-2 [3], MutationTaster2 [4], and Human Splicing Finder [5] programs. Furthermore, the allele frequency of the alterations in Taiwanese population was listed as provided in Taiwan Biobank [6].
2.4. Distribution of mutations along the coding region and protein sequence
Distribution of alterations was highlighted in the coding region and the functional domains [7]. The A at the ATG translational initiation codon was numbered as 1 in reference accession number NM_001042351. The amino acid numbers were counted from the N-terminal Met of human G6PD protein.
2.5. 3D structure model of wide type G6PD protein
The 3D structure of G6PD variations observed in this study were presented based on the X-ray crystal structure available at the Protein Data Bank from human G6PD protein (PDB code 1QKI) [8].
2.6. Prediction of structural effects of variations
When protein structure is important to predict the effects of variants [11], effect of mutations over G6PD protein structure was determined using HOPE (Have yOur Protein Explained) software [9].
Acknowledgments
This research was supported by the Taipei City Government, Taiwan [grant number 10501-62-058], and Taipei City Hospital, Taiwan [grant number TPCH-103-002].
Contributor Information
Kwang-Jen Hsiao, Email: hsiao@pmf.tw.
Tze-Tze Liu, Email: ttliu@ym.edu.tw, tze@pmf.tw.
Conflict of Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
References
- 1.Chiu Y.H., Chen H.J., Chang Y.C., Liu Y.N., Kao S.M., Liu M.Y. Applying a multiplexed primer extension method on dried blood spots increased the detection of carriers at risk of glucose-6-phosphate dehydrogenase deficiency in newborn screening program. Clin. Chim. Acta. 2019;495:271–277. doi: 10.1016/j.cca.2019.04.074. [DOI] [PubMed] [Google Scholar]
- 2.Larkin M.A., Blackshields G., Brown N.P., Chenna R., McGettigan P.A., McWilliam H. Clustal W and clustal X version 2.0. Bioinformatics. 2007;23:2947–2948. doi: 10.1093/bioinformatics/btm404. [DOI] [PubMed] [Google Scholar]
- 3.Flanagan S.E., Patch A.M., Ellard S. Using SIFT and PolyPhen to predict loss-of-function and gain-of-function mutations. Genet. Test. Mol. Biomark. 2010;14:533–537. doi: 10.1089/gtmb.2010.0036. https://doi:10.1089/gtmb.2010.0036 [DOI] [PubMed] [Google Scholar]
- 4.Schwarz J.M., Cooper D.N., Schuelke M., Seelow D. MutationTaster2: mutation prediction for the deep-sequencing age. Nat. Methods. 2014;11:361–362. doi: 10.1038/nmeth.2890. https://doi:10.1038/nmeth.2890 [DOI] [PubMed] [Google Scholar]
- 5.Desmet F.O., Hamroun D., Lalande M., Collod-Béroud G., Claustres M., Béroud C. Human Splicing Finder: an online bioinformatics tool to predict splicing signals. Nucleic Acids Res. 2009;37:e67. doi: 10.1093/nar/gkp215. https://doi:10.1093/nar/gkp215 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Biobank Taiwan. 2019. Genetic and Medical Information for Taiwan.https://taiwanview.twbiobank.org.tw/browse38 accessed. [Google Scholar]
- 7.Cappellini M.D., Fiorelli G. Glucose-6-phosphate dehydrogenase deficiency. Lancet. 2008;37:64–74. doi: 10.1016/S0140-6736(08)60073-2. https://doi:10.1016/S0140-6736(08)60073-2 [DOI] [PubMed] [Google Scholar]
- 8.Au S.W., Gover S., Lam V.M., Adams M.J. Human glucose-6-phosphate dehydrogenase: the crystal structure reveals a structural NADP(+) molecule and provides insights into enzyme deficiency. Structure. 2000;8:293–303. doi: 10.1016/s0969-2126(00)00104-0. [DOI] [PubMed] [Google Scholar]
- 9.Venselaar H., Te Beek T.A., Kuipers R.K., Hekkelman M.L., Vriend G. Protein structure analysis of mutations causing inheritable diseases. An e-Science approach with life scientist friendly interfaces. BMC Bioinf. 2010;11:548. doi: 10.1186/1471-2105-11-548. https://doi:10.1186/1471-2105-11-548 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Chiang S.H., Fan M.L., Hsiao K.J. External quality assurance programme for newborn screening of glucose-6-phosphate dehydrogenase deficiency. Ann. Acad. Med. Singapore. 2008;37:84. [PubMed] [Google Scholar]
- 11.Muniz J.R.C., Szeto N.W., Frise R., Lee W.H., Wang X.S., Thöny B. Role of protein structure in variant annotation: structural insight of mutations causing 6-pyruvoyl-tetrahydropterin synthase deficiency. Pathology. 2019;51:274–280. doi: 10.1016/j.pathol.2018.11.011. https://doi:10.1016/j.pathol.2018.11.011 [DOI] [PubMed] [Google Scholar]