Abstract
Missense mutations in the α-galactosidase A (GLA) gene comprising the majority of mutations responsible for Fabry disease result in heterogeneous phenotypes ranging from the early onset severe “classic” form to the “later-onset” milder form. To elucidate the molecular basis of Fabry disease from the viewpoint of structural biology, we comprehensively examined the effects of different substitutions at the same residue in the amino acid sequence of GLA on the structural change in the enzyme molecule and the clinical phenotype by calculating the number of atoms affected and the root-mean-square-distance value, and by coloring of the atoms influenced by the amino acid replacements. The results revealed that the severity of the structural change influences the disease progression, i.e., a small structural change tends to lead to the later-onset form and a large one to the classic form. Furthermore, the study revealed the residues important for expression of the GLA activity, i.e., residues involved in construction of the active site, a disulfide bond or a dimer. Structural study from such a viewpoint is useful for elucidating the basis of Fabry disease.
Introduction
Fabry disease (MIM 301500) is an X-linked genetic disorder resulting from a deficiency of α-galactosidase A (GLA; EC 3.2.1.22) activity [1]. GLA deficiency causes the progressive accumulation of glycolipids, predominantly globotriaosylceramide, in lysosomes of cells. The disease exhibits a wide range of clinical phenotypes, from the early-onset severe “classic” form to the “later-onset” milder one [2]. Generally, male patients with the classic form of Fabry disease, who have little or no GLA activity, develop pain in the peripheral extremities, hypohidrosis, angiokeratomas and corneal opacities in childhood or adolescence, and manifest renal, cardiac, and cerebrovascular complications in the fourth to fifth decade of life [3]. On the other hand, male patients with the later-onset form, who have residual GLA activity, develop heart and kidney disorders without the childhood symptoms [4]. Heterozygous Fabry females exhibit a wide spectrum of disease severity ranging from asymptomatic to presentation with the classic disease due to random X-chromosomal inactivation [5].
The GLA gene is localized to Xq22.1 and encodes a precursor GLA comprising a 429-amino acid polypeptide, the enzyme being glycosylated and then processed to the mature form comprising 398 amino acids, and it exists as a homodimer in lysosomes [1]. Each monomer contains a (β/α)8 barrel domain containing the active site and an anti- parallel β-sheet domain [6]. So far, more than 600 genetic mutations causing Fabry disease have been identified, and it is known that gross alterations, nonsense mutations, and most of the splicing mutations of the GLA gene lead to the classic form. However missense mutations comprising the majority of mutations result in heterogeneous phenotypes ranging from the classic form to the later-onset one.
Previously, Garman and his research group determined the GLA structure by means of X-ray crystallography and analyzed the locations of missense and nonsense mutations in the three-dimensional structure [6], [7]. Our research group studied structural changes caused by missense mutations responsible for Fabry disease by calculating the numbers of affected atoms and the root-mean-square distance (RMSD) values [8], and proposed a phenotype prediction model based on sequential and structural information [9].
In this study, we comprehensively examined different substitutions at the same residue in the amino acid sequence of GLA, focusing on their effects on the structural change in the enzyme protein and the clinical phenotype, as such investigation will provide us with information about the relationship between the enzyme structure and the disease.
Materials and Methods
GLA missense mutations
We collected GLA missense mutations and polymorphisms registered on the Human Gene Mutation Database (http://www.hgmd.cf.ac.uk/) and Fabry database (http://fabry-database.org/). From them, we selected cases in which more than two substitutions at the same residue in the amino acid sequence of GLA have been reported. Finally, we analyzed 157 amino acid substitutions at 67 residues in this study.
Structural modeling of mutant GLAs
Structural models of mutant GLA monomers were built by means of homology modeling using molecular modeling software, TINKER (http://dasher.wustl.edu/tinker/) [10]–[14]. The crystal structure of human GLA (PDB: 1R46) [6] was used as a template, and energy minimization was performed. The root-mean-square gradient value was set at 0.05 kcal/mol·Å.
Calculation of the number of atoms influenced by an amino acid substitution and the RMSD values between the wild type GLA and mutant GLAs
Each mutant model was superimposed on the wild type GLA structure based on the Cα atoms by the least-square-mean fitting algorithm, in which the optimal rotations and translations are found by minimizing the sum of the squared distances among all structures in the superposition [15]–[19]. We defined that the atom was affected by an amino acid substitution when the position of the atom in a mutant differed from that in the wild type structure by more than 0.15 Å. We calculated the numbers of atoms affected in the main chain and in the side chain of the enzyme, and in the active site (E170 and E231). Then, we calculated the RMSD values between the wild type GLA and mutant GLAs [15]–[19].
Determination of the solvent-accessible surface area (ASA) value
The ASA value of an amino acid residue in the wild type GLA was calculated using Stride (http://webclu.bio.wzw.tum.de/stride/) to evaluate the location of the residue in the GLA molecule.
Coloring of the atoms influenced by an amino acid substitution
To determine the influence of the amino acid substitutions geographically and semi- quantitatively, coloring of the influenced atoms in the three-dimensional structure of the enzyme molecule was performed for 12 mutants (M72I, M72R, M72V, E66G, E66K, E66Q, C56G, C56F, C56Y, W236C, W236L, and W236R) as to four positions (M72, E66, C56, and W236) in the GLA structure. The colors of affected atoms were shown on the basis of the distance between the wild type and mutant one.
Statistical analysis
To determine the differences in the number of the affected atoms and the RMSD value between the classic Fabry group and later-onset one, statistical analysis was performed using Excel 2013 (Microsoft, Redmond, WA) by means of one side Welch's t test, it being taken that there was a significant difference if p<0.05. Then, power analysis (http://www.statmethods.net/stats/power.html) was performed using G*POWER3 to evaluate statistical power for this Welch's t test [20]. In power analysis calculation, sample sizes of two groups and significant level were set to 134, 11, and 0.05, respectively.
Results
Different substitutions at the same residue in the amino acid sequence of GLA
We examined the numbers of affected atoms for the whole enzyme protein and for the active site, and the RMSD and ASA values. The results are shown in Table 1. The numbers of atoms affected in the main chain and in the side chain, and the RMSD values in the classic Fabry group were 107±129 (134), 131±152 (134), and 0.089±0.074 Å (134), respectively. The values are expressed as average ± standard deviation (number of cases). On the other hand, in the later-onset Fabry group, they were 23±36 (11), 28±50 (11), and 0.033±0.038 Å (11), respectively. The statistical analysis showed significant differences between the classic Fabry group and the later-onset Fabry group in numbers of affected atoms in the main chain (P<0.001, Welch's t test) and in the side chain (P<0.001, Welch's t test), and RMSD (P<0.001, Welch's t test). The results of the power analysis revealed that the estimated values of power were 0.70, 0.72, and 0.80 for numbers of affected atoms in the main chain and in the side chain, and RMSD, respectively. This suggests that the structural change resulting from the amino acid substitutions leading to the classic phenotype is essentially greater than that in the later-onset one, although there are some exceptional cases, i.e., in R112H and R301Q, the numbers of affected atoms and the RMSD values are apparently large, although the patients with these mutations exhibited the later-onset phenotype (Table 1). Furthermore, the results revealed that there were no later-onset Fabry cases in which the structure of the active site was affected, although there were 57 affected cases among the 134 classic Fabry ones. This suggests that a defect of the active site tends to lead to the classic phenotype.
Table 1. Different substitutions at the same residue of the amino acid sequence of α-galactosidase A.
| Genotype | Phenotype | Number of affected atoms | RMSD | ASA | Reference | ||
| Main chain | Side chain | Active site | (Å) | (Å2) | |||
| N34K | Classic | 165 | 153 | 0 | 0.196 | 25.7 | Hum Genomics 2006, 2: 297–309 |
| N34S | Classic | 6 | 1 | 0 | 0.029 | 25.7 | Am J Hum Genet 1993, 53: 1186–97 |
| P40L | Classic | 80 | 83 | 0 | 0.111 | 0.8 | J Invest Med 2000, 48: 227–35 |
| P40S | Classic | 15 | 12 | 0 | 0.032 | 0.8 | FEBS Lett 1990, 259: 353–6 |
| M42T | Classic | 9 | 9 | 0 | 0.023 | 3.8 | Mol Genet Metab 2002, 76: 23–30 |
| M42V | Classic | 12 | 13 | 0 | 0.028 | 3.8 | Eur J Hum Genet 1996, 4: 219–24 |
| G43R | Classic | 413 | 471 | 12 | 0.177 | 0.0 | Mol Med 2002, 8: 306–12 |
| G43V | Classic | 219 | 250 | 1 | 0.126 | 0.0 | Mol Genet Metab 2002, 76: 23–30 |
| H46R | Classic | 25 | 27 | 0 | 0.043 | 0.0 | Mol Med 1997, 3: 174–82 |
| H46Y | Hetero | 83 | 119 | 0 | 0.088 | 0.0 | Hum Mutat 2001, 18: 459 |
| R49C | Classic | 125 | 136 | 0 | 0.109 | 55.3 | Pharmacogenet Genomics 2008, 18: 773–80 |
| R49G | Classic | 182 | 197 | 0 | 0.169 | 55.3 | Mol Med 2002, 8: 306–12 |
| R49L | Classic | 240 | 259 | 4 | 0.201 | 55.3 | Hum Mol Genet 1994, 3: 667–9 |
| R49P | Classic | 174 | 209 | 2 | 0.125 | 55.3 | Hum Mutat 2001, 18: 459 |
| R49S | Classic | 361 | 365 | 2 | 0.251 | 55.3 | Eur J Hum Genet 1996, 4: 219–24 |
| C52R | Classic | 195 | 242 | 15 | 0.136 | 49.0 | Hum Mutat 1996, 8: 38–43 |
| C52S | Classic | 2 | 1 | 0 | 0.022 | 49.0 | Hum Mol Genet 1994, 3: 1795–9 |
| C52Y | Classic | 56 | 75 | 0 | 0.073 | 49.0 | Biochim Biophys Acta 2010, 1802: 247–52 |
| C56F | Classic | 67 | 78 | 0 | 0.128 | 38.4 | Hum Mol Genet 1994, 3: 1795–9 |
| C56G | Classic | 51 | 60 | 0 | 0.131 | 38.4 | Am J Hum Genet 1993, 53: 1186–97 |
| C56Y | Classic | 58 | 65 | 0 | 0.132 | 38.4 | Eur J Hum Genet 1996, 4: 219–24 |
| E66G | Classic | 45 | 74 | 0 | 0.062 | 29.2 | Am J Hum Genet 2006, 79: 31–40 |
| E66K | Classic | 422 | 503 | 7 | 0.361 | 29.2 | Hum Mutat 2005, 25: 412 |
| E66Q | Polymorphism | 23 | 32 | 0 | 0.048 | 29.2 | Hum Genet 1992, 89: 29–32. |
| M72I | Classic | 38 | 46 | 0 | 0.054 | 0.0 | Mol Med 2002, 8: 306–12 |
| M72R | Classic | 145 | 198 | 1 | 0.119 | 0.0 | Ned Tijdschr Geneeskd 200, 144: 2412–5 |
| M72V | Later-onset | 7 | 6 | 0 | 0.026 | 0.0 | Hum Mutat 1998, Suppl 1: S213–16 |
| L89P | Classic | 6 | 12 | 0 | 0.023 | 0.0 | Mol Med 1997, 3: 174–82 |
| L89R | Classic | 286 | 337 | 4 | 0.164 | 0.0 | Hum Mol Genet 1994, 3: 1795–9 |
| D92H | Classic | 324 | 482 | 14 | 0.140 | 0.2 | Eur J Hum Genet 1996, 4: 219–24 |
| D92Y | Classic | 408 | 580 | 16 | 0.182 | 0.2 | Mol Med 1997, 3: 174–82 |
| D93G | Classic | 216 | 264 | 14 | 0.133 | 0.4 | Eur J Hum Genet 1996, 4: 219–24 |
| D93N | Classic | 60 | 131 | 8 | 0.073 | 0.4 | J Mol Med 2005, 83: 647–54 |
| D93V | Classic | 200 | 284 | 12 | 0.128 | 0.4 | Hum Genomics 2006, 2: 297–309 |
| C94S | Classic | 20 | 19 | 0 | 0.031 | 0.6 | Hum Mutat 2001, 18: 459 |
| C94Y | Classic | 180 | 210 | 0 | 0.142 | 0.6 | Mol Med 1997, 3: 174–82 |
| A97P | Classic | 33 | 31 | 0 | 0.054 | 9.3 | Br J Dermatol 2002, 147: 545–8 |
| A97V | Later-onset | 10 | 12 | 0 | 0.025 | 9.3 | Mol Med 1997, 3: 174–82 |
| R100K | Classic | 37 | 22 | 0 | 0.043 | 28.8 | Hum Mol Genet 1994, 3: 1795–9 |
| R100T | Classic | 205 | 232 | 5 | 0.137 | 28.8 | Mol Med 1997, 3: 174–82 |
| R112C | Classic | 26 | 40 | 0 | 0.037 | 26.1 | J Invest Med 2000, 48: 227–35 |
| R112H | Later-onset | 70 | 70 | 0 | 0.082 | 26.1 | Hum Mol Genet 1994, 3: 1795–9 |
| R112S | Classic | 25 | 35 | 0 | 0.034 | 26.1 | Hum Mutat 2005, 25: 299–305 |
| F113L | Later-onset | 3 | 2 | 0 | 0.014 | 4.4 | Mol Med 1997, 3: 174–82 |
| F113S | Hetero | 0 | 0 | 0 | 0.005 | 4.4 | Hum Mutat 2001, 18: 459 |
| G138E | Classic | 167 | 203 | 1 | 0.121 | 0.0 | Mol Med 2002, 8: 306–12 |
| G138R | Classic | 209 | 259 | 4 | 0.277 | 0.0 | Mol Med 1997, 3: 174–82 |
| C142R | Classic | 50 | 89 | 13 | 0.065 | 37.2 | Mol Med 1999, 5: 806–11 |
| C142Y | Classic | 18 | 25 | 9 | 0.046 | 37.2 | Hum Genet 1995, 95: 557–61 |
| S148N | Classic | 21 | 25 | 0 | 0.039 | 0.0 | J Invest Med 2000, 48: 227–35 |
| S148R | Classic | 128 | 177 | 9 | 0.104 | 0.0 | Mol Med 1997, 3: 174–82 |
| W162C | Classic | 35 | 35 | 0 | 0.045 | 26.1 | Hum Genet 1996, 98: 719–26 |
| W162R | Classic | 28 | 44 | 0 | 0.043 | 26.1 | Am J Hum Genet 1993, 53: 1186–97 |
| L166G | Classic | 91 | 101 | 1 | 0.081 | 0.6 | Hum Genomics 2006, 2: 297–309 |
| L166V | Classic | 14 | 14 | 0 | 0.031 | 0.6 | Hum Genet 1995, 95: 557–61 |
| D170H | Classic | 236 | 302 | 8 | 0.120 | 0.0 | Hum Mutat 2003 Sup, 22: 258 |
| D170V | Classic | 88 | 134 | 4 | 0.072 | 0.0 | Mol Med 1997, 3: 174–82 |
| G171C | Classic | 69 | 85 | 7 | 0.065 | 4.5 | J Dermatol Sci 2008, 52: 61–4 |
| G171D | Classic | 88 | 103 | 8 | 0.095 | 4.5 | Hum Mutat 2005, 25: 299–305 |
| G171R | Classic | 315 | 347 | 12 | 0.284 | 4.5 | J Mol Med 2005, 83: 647–54 |
| C172F | Classic | 12 | 25 | 2 | 0.039 | 33.5 | J Hum Genet 2001, 46: 192–6 |
| C172G | Classic | 10 | 7 | 2 | 0.028 | 33.5 | Hum Mutat 2003, 22: 486–92 |
| C172R | Classic | 18 | 45 | 3 | 0.049 | 33.5 | J Invest Med 2000, 48: 227–35 |
| C172Y | Classic | 12 | 26 | 2 | 0.041 | 33.5 | Hum Mol Genet 1994, 3: 1795–9 |
| G183A | Hetero | 35 | 44 | 0 | 0.063 | 2.1 | Biochim Biophys Acta 2010, 1802: 247–52 |
| G183D | Classic | 262 | 296 | 8 | 0.204 | 2.1 | Mol Med 1999, 5: 806–811 |
| G183S | Classic | 62 | 103 | 0 | 0.088 | 2.1 | Mol Genet Metab 2002, 76: 23–30 |
| M187T | Classic | 0 | 0 | 0 | 0.006 | 0.0 | Hum Genomics 2006, 2: 297–309 |
| M187V | Classic | 8 | 13 | 0 | 0.030 | 0.0 | J Invest Med 2000, 48: 227–35 |
| S201F | Classic | 0 | 2 | 0 | 0.007 | 6.6 | Hum Mutat 2005, 25: 299–305 |
| S201Y | Classic | 0 | 2 | 0 | 0.011 | 6.6 | Hum Genomics 2006, 2: 297–309 |
| C202Y | Classic | 382 | 434 | 12 | 0.209 | 0.0 | Mol Med 1997, 3: 174–82 |
| C202W | Hetero | 276 | 313 | 7 | 0.175 | 0.0 | Hum Mol Genet 1994, 3: 503–5 |
| P205R | Classic | 455 | 571 | 9 | 0.257 | 0.2 | Mol Genet Metab 2002, 76: 23–30 |
| P205T | Classic | 6 | 16 | 0 | 0.022 | 0.2 | Eur J Hum Genet 1996, 4: 219–24 |
| Y207C | Classic | 3 | 4 | 3 | 0.017 | 50.9 | J Inherit Metab Dis 2009, 32: 424–40 |
| Y207S | Classic | 4 | 5 | 3 | 0.019 | 50.9 | Mol Genet Metab 2002, 76: 23–30 |
| P210L | Later-onset | 0 | 0 | 0 | 0.003 | 93.3 | New case |
| P210S | Later-onset | 0 | 0 | 0 | 0.003 | 93.3 | New case |
| Y216C | Classic | 1 | 1 | 0 | 0.005 | 7.5 | Biochim Biophys Acta 2010, 1802: 247–52 |
| Y216D | Classic | 187 | 246 | 6 | 0.145 | 7.5 | Mol Med 1997, 3: 174–82 |
| C223R | Classic | 518 | 589 | 15 | 0.255 | 0.0 | Mol Genet Metab 2002, 76: 23–30 |
| C223Y | Classic | 451 | 546 | 16 | 0.220 | 0.0 | Mol Genet Metab 2002, 76: 23–30 |
| N224D | Classic | 65 | 67 | 0 | 0.057 | 0.0 | Hum Mutat 1998, Suppl 1; S288–90 |
| N224S | Classic | 42 | 47 | 0 | 0.054 | 0.0 | J Invest Med 2000, 48: 227–35 |
| D231G | Classic | 20 | 58 | 3 | 0.045 | 54.9 | Exp Mol Med 2009, 31: 1–7 |
| D231V | Classic | 22 | 64 | 3 | 0.041 | 54.9 | Hum Mutat 2008, 29: 331 |
| D234E | Classic | 20 | 27 | 1 | 0.043 | 40.9 | Hum Mutat 2005, 25: 299–305 |
| D234Y | Classic | 352 | 456 | 13 | 0.270 | 40.9 | Mol Genet Metab 2002, 76: 23–30 |
| S235C | Classic | 0 | 0 | 0 | 0.004 | 53.9 | Mol Med 1999, 5: 806–811 |
| S235F | Classic | 0 | 1 | 0 | 0.004 | 53.9 | Hum Mutat 2008, 29: 331 |
| W236C | Classic | 2 | 7 | 0 | 0.012 | 40.6 | Eur J Hum Genet 1996, 4: 219–24 |
| W236L | Classic | 0 | 2 | 0 | 0.005 | 40.6 | Mol Med 1999, 5: 806–811 |
| W236R | Classic | 6 | 23 | 0 | 0.025 | 40.6 | Hum Genomics 2006, 2: 297–309 |
| D244H | Hetero | 226 | 287 | 5 | 0.122 | 74.8 | Mol Med 1999, 5: 806–811 |
| D244N | Classic | 20 | 42 | 0 | 0.037 | 74.8 | Hum Mol Genet 1994, 3: 1795–9 |
| G258R | Hetero | 97 | 87 | 0 | 0.101 | 0.8 | Hum Mutat 2001, 18: 459 |
| G258V | Classic | 67 | 67 | 0 | 0.087 | 0.8 | Hum Mutat 2008, 29: 331 |
| D264V | Classic | 132 | 194 | 9 | 0.097 | 11.2 | Am J Hum Genet 1993, 53: 1186–97 |
| D264Y | Classic | 103 | 132 | 6 | 0.087 | 11.2 | Hum Mutat 2005, 25: 299–305 |
| D266E | Classic | 40 | 69 | 8 | 0.061 | 5.9 | Mol Med 2002, 8: 306–12 |
| D266H | Hetero | 445 | 596 | 16 | 0.200 | 5.9 | J Investig Med 2000, 48: 227–35 |
| D266N | Classic | 48 | 75 | 10 | 0.059 | 5.9 | Clin Genet 2000, 58: 228–33 |
| D266V | Classic | 26 | 43 | 7 | 0.039 | 5.9 | Am J Hum Genet 1993, 53: 1186–97 |
| D266Y | Classic | 39 | 79 | 10 | 0.047 | 5.9 | Mol Genet Metab 2008, 95: 224–8 |
| M267I | Classic | 75 | 105 | 16 | 0.095 | 4.2 | Mol Med 1999, 5: 806–811 |
| M267R | Classic | 132 | 192 | 15 | 0.097 | 4.2 | Hum Genomics 2006, 2: 297–309 |
| V269A | Classic | 10 | 18 | 2 | 0.029 | 0.0 | Hum Mol Genet 1993, 2: 1051–3 |
| V269M | Classic | 113 | 129 | 8 | 0.104 | 0.0 | Hum Genomics 2006, 2: 297–309 |
| G271C | Classic | 58 | 57 | 0 | 0.066 | 0.2 | Mol Genet Metab 2002, 76: 23–30 |
| G271S | Classic | 45 | 65 | 0 | 0.057 | 0.2 | Hum Genomics 2006, 2: 297–309 |
| G271V | Classic | 195 | 239 | 7 | 0.149 | 0.2 | Hum Genomics 2006, 2: 297–309 |
| N272K | Classic | 60 | 100 | 0 | 0.065 | 2.8 | Hum Mol Genet 1994, 3: 1795–9 |
| N272S | Classic | 2 | 3 | 0 | 0.013 | 2.8 | Eur J Hum Genet 2004, 12: 678–81 |
| Q279E | Later-onset | 34 | 26 | 0 | 0.035 | 20.4 | Hum Genet 1992, 89: 29–32 |
| Q279H | Classic | 102 | 123 | 4 | 0.094 | 20.4 | Hum Mutat 2001, 18: 459 |
| Q279R | Classic | 55 | 56 | 0 | 0.061 | 20.4 | Hum Mutat 2003 Sup, 22: 258 |
| Q280H | Hetero | 87 | 104 | 0 | 0.079 | 0.0 | Hum Mutat 2001, 18: 459 |
| Q280K | Classic | 30 | 41 | 0 | 0.040 | 0.0 | J Mol Med 2005, 83: 647–54 |
| T282A | Classic | 3 | 0 | 0 | 0.008 | 1.2 | Hum Mutat 2008, 29: 331 |
| T282N | Classic | 6 | 9 | 0 | 0.023 | 1.2 | J Hum Genet 2001, 46: 192–6 |
| A285D | Classic | 32 | 29 | 0 | 0.041 | 0.0 | Hum Genomics 2006, 2: 297–309 |
| A285P | Classic | 21 | 29 | 0 | 0.043 | 0.0 | Hum Mutat 2005, 25: 299–305 |
| W287C | Classic | 16 | 14 | 0 | 0.028 | 1.6 | Mol Med 1997, 3: 174–82 |
| W287G | Classic | 11 | 12 | 0 | 0.026 | 1.6 | Eur J Hum Genet 1996, 4: 219–24 |
| A288D | Classic | 86 | 96 | 0 | 0.106 | 0.0 | Hum Mol Genet 1994, 3: 1795–9 |
| A288P | Classic | 50 | 60 | 0 | 0.068 | 0.0 | Mol Genet Metab 2002, 76: 23–30 |
| P293A | Classic | 75 | 58 | 0 | 0.069 | 2.6 | Mol Genet Metab 2002, 76: 23–30 |
| P293T | Classic | 13 | 18 | 0 | 0.032 | 2.6 | Hum Genomics 2006, 2: 297–309 |
| M296I | Later-onset | 5 | 12 | 0 | 0.018 | 0.0 | New Eng J Med 1995, 333: 288–93 |
| M296V | Later-onset | 7 | 13 | 0 | 0.023 | 0.0 | New Eng J Med 1991, 324: 395–399 |
| S297C | Classic | 0 | 1 | 0 | 0.011 | 0.0 | Mol Med 2002, 8: 306–12 |
| S297F | Classic | 215 | 268 | 8 | 0.145 | 0.0 | Am J Hum Genet 1993, 53: 1186–97 |
| N298H | Hetero | 140 | 181 | 0 | 0.089 | 0.0 | Eur J Hum Genet 1996, 4: 219–24 |
| N298K | Classic | 106 | 183 | 0 | 0.092 | 0.0 | Hum Mutat 1996, 8: 38–43 |
| N298S | Classic | 18 | 14 | 0 | 0.029 | 0.0 | Mol Med 1997, 3: 174–82 |
| R301Q | Later-onset | 112 | 166 | 0 | 0.126 | 40.5 | Am J Hum Genet 1990, 47: 784–9 |
| R301P | Classic | 73 | 94 | 0 | 0.074 | 40.5 | J Hum Genet 2001, 46: 192–6 |
| N320K | Classic | 150 | 154 | 0 | 0.105 | 0.0 | Biochem Biophys Res Commun 1995, 214: 1219–24 |
| N320Y | Classic | 239 | 286 | 0 | 0.183 | 0.0 | J Invest Med 2000, 48: 227–35 |
| Q321E | Classic | 124 | 144 | 0 | 0.088 | 42.6 | Mol Med 1999, 5: 806–811 |
| Q321R | Classic | 22 | 24 | 0 | 0.034 | 42.6 | Hum Genomics 2006, 2: 297–309 |
| G328A | Classic | 145 | 139 | 0 | 0.100 | 0.0 | Am J Hum Genet 1993, 53: 1186–97 |
| G328R | Classic | 560 | 584 | 4 | 0.283 | 0.0 | Hum Genet 1992, 89: 29–32 |
| G328V | Classic | 222 | 239 | 0 | 0.160 | 0.0 | Hum Mutat 2005, 25: 299–305 |
| E358A | Classic | 102 | 175 | 0 | 0.091 | 90.8 | Hum Mutat 2005, 25: 299–305 |
| E358G | Classic | 77 | 136 | 0 | 0.083 | 90.8 | Mol Med 2002, 8: 306–12 |
| E358K | Classic | 571 | 661 | 9 | 0.293 | 90.8 | Hum Mutat 1998, Suppl 1: S139–40 |
| G373D | Classic | 103 | 116 | 0 | 0.116 | 1.0 | Hum Mutat 2001, 17: 353 |
| G373S | Classic | 1 | 0 | 0 | 0.004 | 1.0 | Biochem Biophys Res Commun 1995, 214: 1219–24 |
| C382W | Hetero | 271 | 265 | 0 | 0.205 | 0.2 | Intern Med J 2002, 32: 575–84 |
| C382Y | Classic | 241 | 245 | 0 | 0.203 | 0.2 | Hum Mutat 2003 Sup, 22: 258 |
| P409A | Classic | 13 | 11 | 0 | 0.049 | 37.2 | Hum Mutat 2001, 18: 459 |
| P409S | Classic | 10 | 4 | 0 | 0.024 | 37.2 | Mol Med 2002, 8: 306–12 |
| P409T | Hetero | 21 | 18 | 0 | 0.078 | 37.2 | Hum Mutat 2001, 18: 459 |
| T410A | Later-onset | 0 | 0 | 0 | 0.005 | 18.6 | Clin Genet 2003, 63: 205–9 |
| T410P | Classic | 77 | 93 | 0 | 0.089 | 18.6 | Hum Mutat 2008, 29: 331 |
Classic, the classic form of Fabry disease; Later-onset, the later-onset form; Hetero, heterozygote of Fabry disease; and Polymorphism, GLA polymorphism.
Structural analysis of representative amino acid substitutions
We examined different amino acid substitutions at M72, E66, C56, and W236, because they are expected to provide us with useful information for elucidating the mechanism by which structural changes caused by them influence the severity of the disease and for identifying residues essential for the maintenance of proper folding. The localization of these residues in the dimer is shown in Fig. 1. The residues are widely distributed over the GLA molecule and are distant from the catalytic residues (D170 and D231).
Figure 1. Structure of the GLA dimer and positions of the amino acid residues involved in the substitutions.
The backbone is shown as a line. Subunit A and subunit B comprising the dimer are shown in light blue and green, respectively. The amino acids involved in the substitutions (C56, E66, M72 and W236) and the catalytic residues (D170 and D231) are indicated as a CPK model. Front view (left) and back view (right).
M72 (M72I, M72R, and M72V)
M72 is located on the α-helix (66–84) of the (β/α)8 barrel domain. The ASA value of this residue is 0 Å2, suggesting that it is fully buried. The numbers of atoms influenced by M72I in the main chain, side chain and active site are 38, 46 and 0, respectively, the RMSD value being 0.054 Å. The numbers of atoms influenced by M72R in the main chain, side chain and active site are 145, 198, and 1, respectively, the RMSD value being 0.119 Å. Considering the results, the structural changes in GLA caused by these amino acid substitutions are thought to be large. The patients with these mutations exhibited the classic form of Fabry disease. On the other hand, as to M72V, the numbers of atoms influenced in the main chain, side chain and active site are 7, 6 and 0, respectively, the RMSD value being 0.026 Å. This suggests that the structural change caused by M72V is small, and that it does not affect the active site. The patients with M72V exhibited the later-onset Fabry disease. Coloring of the influenced atoms allowed clear visualization of the differences in the structural changes between these cases (Fig. 2a).
Figure 2. Coloring of the atoms in the three-dimensional structure of GLA influenced by amino acid substitutions at M72 (a), E66 (b), C56 (c), and W236 (d).
The backbone of GLA is shown as a line. The atoms of the substituted residues are indicated as small black spheres and the influenced atoms as large spheres. The colors of the influenced atoms show the distances between the wild type and mutant ones as follows: 0.15 Å≤cyan <0.30 Å, 0.30 Å≤green <0.45 Å, 0.45 Å≤yellow <0.60 Å, 0.60 Å≤orange <0.75 Å, and red ≥0.75 Å.
E66 (E66G, E66K and E66Q)
E66 is located on the α-helix (66–84) of the (β/α)8 barrel domain. The ASA value is 29.2 Å2, suggesting that the residue is half-exposed to the solvent. For the E66G substitution, the numbers of atoms influenced in the main chain, side chain and active site are 45, 74, and 0, respectively, the RMSD value being 0.062 Å. For the E66K substitution, the numbers of atoms affected in the main chain, side chain and active site are 422, 503, and 7, respectively, the RMSD value being 0.361 Å. The patients with such large structural changes exhibited the classic form of Fabry disease. On the other hand, as to the E66Q substitution, which has been reported to be a functional polymorphism [21], the numbers of atoms affected in the main chain, side chain, and active site are 23, 32, and 0, respectively, the RMSD value being 0.048 Å. These results suggest that the structural change is moderate and that it does not affect the active site. Fig. 2b clearly shows that the structural change caused by E66Q is restricted to a small region on the molecular surface, although those caused by E66G and E66K extend over a broad area around the substituted residue.
C56 (C56G, C56F, and C56Y)
C56 is located between two α-helices (47–50 and 66–84). The ASA value of the residue is 38.4 Å2, suggesting that it is exposed to the solvent. The C56 residue forms a disulfide bond with C63 (Fig. 3), and it plays an important role in conformation of the enzyme molecule. Fig. 2c shows the structural changes caused by the C56G, C56F, and C56Y amino acid substitutions. These amino acid substitutions at the C56 position are predicted to disturb the formation of disulfide bond between C56 and C63, and thus the mutant proteins would be excessively degraded before they are transported to the lysosomes. All of the patients with these mutations presented the classic form of Fabry disease.
Figure 3. GLA structure and residues involving a disulfide bond.

The backbone of GLA is shown as a line. The atoms involved in the formation of a disulfide bond (C52-C94, C56-C63, C142-C172, C202-C223, and C378-C382) and the catalytic residues (D170 and D231) are shown as a CPK model. Front view (left) and back view (right).
W236 (W236C, W236L, and W236R)
W236 is located on the α-helix (236–247) of the (β/α)8 barrel domain, the ASA value being 40.6 Å2, suggesting that the residue is exposed to the solvent. As Fig. 2d shows, the structural changes caused by W236C, W236L, and W236R are small (The numbers of atom in the main chain affected by W236C, W236L, and W236R are 2, 0, and 6, respectively, and those in the side chain are 7, 2, and 23, respectively. The RMSD values for them are 0.012 Å, 0.005 Å, and 0.025 Å, respectively). None of them affects the active site. However, as W236 is located on the dimer interface of GLA (Fig. 1), and the side chain of W236 forms a hydrogen bond with E358 (Fig. 4), the amino acid substitution is thought to affect the conformation of the GLA molecule.
Figure 4. The hydrogen bond between W236 and E358.

The side chain of W236 forms a hydrogen bond with E358. The backbone of GLA is shown as a ribbon model, and W236 and E358 are indicated as a stick. The hydrogen bond is shown as a yellow dotted line. Front view (left) and back view (right).
Discussion
Recently, the results of newborn screening revealed a high incidence (1 in ∼1, 250–9,000) of Fabry disease [22]–[24]. As Fabry disease can be treated with recombinant human GLAs [25]–[27], it is very important to understand the basis of the disease and to predict the outcome for patients found on screening. For this purpose, a structural study will provide us with valuable information. Garman and Garboczi reported that there are at least two classes of mutations in GLA that lead to disease progression: those near the active site and those of buried residues distant from the active site that adversely affect the folded state of the molecule, and a mild phenotype tends to be more solvent-accessible than a severe one [6]. Our research group obtained essentially the same results as those of Garman and Garboczi. Our previous study revealed that structural changes in the classic Fabry group were generally large and tended to be localized to the core region or located in the functionally important region including the active site, and that those in the later-onset group were small and localized on the surface of the molecule [8].
As further structural study, we focused on different substitutions at the same residue in the amino acid sequence of GLA, because such specific cases are useful for examining the influence of the severity of the structural changes on the disease progression and for identifying the residues important for the expression of GLA activity.
In this study, we could select 157 amino acid substitutions at 67 residues from two databases, and examined the correlation between the structural changes in GLA and the clinical phenotype. The results revealed that the structural changes leading to the later-onset Fabry disease tend to be smaller than those for the classic Fabry disease, i.e., M72 is buried and E66 is exposed to the solvent, and at both residues, amino acid substitutions causing a small structural change (M72V and E66Q) lead to later-onset Fabry disease or a functional polymorphism, and ones causing a large structural change (M72I, M72R, E66G, and E66K) result in classic Fabry disease. This study also revealed residues important for expression of the GLA activity. A structural change affecting the active site tends to lead to the classic form. C56 and W236 are thought to be involved in the formation of a disulfide bond and the dimer, respectively. Substitutions at these residues should affect proper folding and lead to classic Fabry disease, even if the structural change is small.
In conclusion, we investigated the effects of different substitutions at the same residue in the amino acid sequence of GLA on structural changes in the enzyme molecule and the clinical phenotype. The results revealed that structural changes influence the disease progression. Structural study from such a unique viewpoint is useful for elucidation of the basis of Fabry disease.
Acknowledgments
We wish to thank Dr. J. Ponder (Department of Biochemistry and Molecular Biophysics, Washington University) for providing us with the TINKER software.
Funding Statement
This work was supported by the Program for Research on Intractable Diseases of Health and Labor Science Research (to HS); the Program for the Promotion of Fundamental Studies in Health Sciences of the National Institute of Biomedical Innovation (ID: 09-15, to HS); and the Japan Society for the Promotion of Science (JSPS ID: 23659527, to HS). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Desnick RJ, Ioannou YA, Eng CM (2001) α-Galactosidase A deficiency: Fabry disease. In: Scriver CR, Sly WA, Beaudet A L, Valle D, editors. The Metabolic and Molecular Bases of Inherited Disease, eighth edition. New York: McGraw-Hill. pp. 3733–3774.
- 2. Nance CS, Klein CJ, Banikazemi M, Dikman SH, Phelps RG, et al. (2006) Later-onset Fabry disease: an adult variant presenting with the cramp-fasciculation syndrome. Arch Neurol 63: 453–457. [DOI] [PubMed] [Google Scholar]
- 3. MacDermot KD, Holmes A, Miners AH (2001) Anderson-Fabry disease: clinical manifestations and impact of disease in a cohort of 98 hemizygous males. J Med Genet 38: 750–760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Nakao S, Takenaka T, Maeda M, Kodama C, Tanaka A, et al. (2001) An atypical variant of Fabry's disease in men with left ventricular hypertrophy. New Engl J Med 333: 288–293. [DOI] [PubMed] [Google Scholar]
- 5. MacDermot KD, Holmes A, Miners AH (2001) Anderson-Fabry disease: clinical manifestations and impact of disease in a cohort of 60 obligate carrier females. J Med Genet 38: 769–775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Garman SC, Garboczi DN (2004) The molecular defect leading to Fabry disease: Structure of human α-galactosidase. J Mol Biol 337: 319–335. [DOI] [PubMed] [Google Scholar]
- 7.Garman SC (2007) Structure-function relationships in alpha-galactosidase A. Acta Paediatr Suppl 96: 6–16. [DOI] [PMC free article] [PubMed]
- 8. Sugawara K, Ohno K, Saito S, Sakuraba H (2008) Structural characterization of mutant alpha-galactosidases causing Fabry disease. J Hum Genet 53: 812–824. [DOI] [PubMed] [Google Scholar]
- 9. Saito S, Ohno K, Sese J, Sugawara K, Sakuraba H (2010) Prediction of the clinical phenotype of Fabry disease based on protein sequential and structural information. J Hum Genet 55: 175–178. [DOI] [PubMed] [Google Scholar]
- 10. Kundrot C, Ponder J, Richards F (1991) Algorithms for calculating excluded volume and its derivatives as a function of molecular conformation and their use in energy minimization. J Comp Chem 12: 402–409. [Google Scholar]
- 11. Dudek M, Ponder J (1995) Accurate modeling of the intramolecular electrostatic energy of proteins. J Comput Chem 16: 791–816. [Google Scholar]
- 12. Kong Y, Ponder J W (1997) Reaction field methods for off-center multipoles. J Chem Phys 107: 481–492. [Google Scholar]
- 13. Pappu RV, Hart RW, Ponder JW (1998) Analysis and application of potential energy smoothing for global optimization. J Phys Chem B102: 9725–9742. [Google Scholar]
- 14. Ren P, Ponder JW (2003) Polarizable atomic multipole water model for molecular mechanics simulation. J Phys Chem B107: 5933–5947. [Google Scholar]
- 15. Kabsch W (1976) A solution for the best rotation to relate two sets of vectors. Acta Crystallogr A32: 827. [Google Scholar]
- 16. Kabsch W (1978) A discussion of the solution for the best rotation to relate two sets of vectors. Acta Crystallogr A34: 922–923. [Google Scholar]
- 17. Sakuraba H, Matsuzawa F, Aikawa S, Doi H, Kotani M, et al. (2002) Molecular and structural studies of the GM2 gangliosidosis 0 variant. J Hum Genet 47: 176–183. [DOI] [PubMed] [Google Scholar]
- 18. Sakuraba H, Matsuzawa F, Aikawa S, Doi H, Kotani M, et al. (2004) Structural and immunocytochemical studies on α-N-acetylgalactosaminidase deficiency (Schindler/Kanzaki disease). J Hum Genet 49: 1–8. [DOI] [PubMed] [Google Scholar]
- 19. Saito S, Ohno K, Sugawara K, Sakuraba H (2008) Structural and clinical implications of amino acid substitutions in N-acetylgalactosamine-4-sulfatase: Insight into mucopolysaccharidosis type VI. Mol Genet Metab 93: 419–425. [DOI] [PubMed] [Google Scholar]
- 20. Faul F, Erdfelder E, Buchner A, Lang A-G (2009) Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses. Behavior Research Methods 41: 1149–1160. [DOI] [PubMed] [Google Scholar]
- 21. Togawa T, Tsukimura T, Kodama T, Tanaka T, Kawashima I, et al. (2012) Fabry disease: Biochemical, pathological and structural studies of the α-galactosidase A with E66Q amino acid substitution. Mol Genet Metab 105: 615–620. [DOI] [PubMed] [Google Scholar]
- 22. Spada M, Pagliardini S, Yasuda M, Tukel T, Thiagarajan G, et al. (2006) High incidence of later-onset Fabry disease revealed by newborn screening. Am J Hum Genet 79: 31–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Hwu WL, Chien YH, Lee NC, Chiang SC, Dobrovolny R, et al. (2009) Newborn screening for Fabry disease in Taiwan reveals a high incidence of the later-onset GLA mutation c.936+919G>A (IVS4+919G>A). Hum Mutat 30: 1397–1405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Nakamura K, Hattori K, Endo F (2011) Newborn screening for lysosomal storage disorders. Am J Med Genet 157: 63–71. [DOI] [PubMed] [Google Scholar]
- 25. Eng CM, Banikazemi M, Gordon RE, Goldman M, Phelps R, et al. (2001) A phase 1/2 clinical trial of enzyme replacement in Fabry disease: Pharmacokinetic, substrate clearance, and safety studies. Am J Hum Genet 68: 711–722. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Eng CM, Guffon N, Wilcox WR, Germain DP, Lee P, et al. (2001) Safety and efficacy of recombinant human alpha-galactosidase A-replacement therapy in Fabry's disease. N Engl J Med. 345: 9–16. [DOI] [PubMed] [Google Scholar]
- 27. Schiffmann R, Murray GJ, Treco D, Daniel P, Sellos-Moura M, et al. (2000) Infusion of alpha-galactosidase A reduces tissue globotriaosylceramide storage in patients with Fabry disease. Proc Natl Acad Sci USA 97: 3653–3670. [DOI] [PMC free article] [PubMed] [Google Scholar]


