Introduction
Myo-Inositol is commonly found in nature, particularly in soil. Various microorganisms, including soil bacteria, are able to grow on inositol as the sole source of carbon.1 These findings suggest that myo-inositol catabolism might be conserved among these different microorganisms, although the molecular genetics of the many genes involved have not yet been well characterized.1 In B. subtilis genome, the inositol catabolism pathway consists of several genes including iolI. The iolI gene (NCBI accession P42419) codes for a conserved 278-residue protein of unknown function that shows no sequence homology with proteins of known structure [cluster 2500 in PROTOMAP, www.protomap.cs.huji.ac.il]. Therefore, it was suggested that this protein may display a previously unobserved structural fold. There are sequence homologies to other structurally undetermined proteins, predominately from the B. subtilis family, including one in the inositol catabolism pathway (iolH). PSI-BLAST2 analysis revealed 335 sequence homologs of IolI in bacteria, archaea, and eukaryota (using inclusion threshold 0.005). As a part of the Midwest Center for Structural Genomics (MCSG) initiative (www.mcsg.anl.gov), we have determined the crystal structure of IolI protein at 1.6 Å resolution by using a semiautomated, high-throughput approach. After acquisition of the crystallographic data, structure solution and refinement were completed by using <6.5 h of CPU time. Rapid data acquisition and fast structure determination capabilities are essential components of the high-throughput structural genomics programs. Analysis of the IolI crystallographic structure, however, reveals that the protein adopts a beta-barrel (TIM) configuration3 [Fig.1(a)], suggesting structural homology to both endonuclease IV,4 a DNA repair enzyme, and xylose isomerase,5 a sugar-metabolizing enzyme (PDB accession 1QTW and 4XIS, respectively).
Fig. 1.
Structure of IolI protein and its metal-binding site. a: Ribbon diagram showing the spatial arrangement of α-helices and β-sheets comprising beta barrel conformation adopted by IolI protein. Side-chains of conserved residues are shown, including those coordinating Zn ion (green). View is approximately perpendicular to the axis of the barrel. b: 2Fo-Fc map of IolI protein in the presence of 2 mM ZnCl2. c: 2Fo-Fc map of IolI protein in the presence of 2 mM MnCl2. A water molecule (H2O92) is residing in roughly the same location as the zinc atom of Figure 1(b).
Materials and Methods
The open reading frame of B. subtilis IolI protein was amplified from genomic DNA with Pfx DNA polymerase by using conditions and reagents provided by the vendor (Invitrogen, Carlsbad, CA). The gene was cloned into the pET30XaLIC (Novagen, Madison, WI) by using the Ligation Independent Cloning protocol.6 The resulting expression clone expressed a fusion protein with a thrombin-cleavable His-tag and a factor Xaclevable S-tag. The selenomethionine (SeMet) derivative protein was overexpressed in E. coli BL21-Gold(DE3) (Stratagene) as in Ref. 9. The cells were lysed in binding buffer (300 mM NaCl, 5% glycerol, 50 mM Na phosphate, pH 8.0, 10 mM imidazole, and 10 mM β-mercaptoethanol) by sonication after the addition of lysozyme to 1 mg/mL and 1 mM PMSF. The lysate was clarified by centrifugation, passed through a 0.2-µm filter and applied to Ni-NTA Superflow resin (Qiagen). The protein was eluted from the column with 250 mM imidazole and dialyzed into thrombin cleavage/capture buffer A (20 mM Tris-HCl, pH 8.4, 0.15 mM NaCl, 2.5 mM CaCl2). The His-tag was cleaved from the protein by treatment with biotinylated thrombin by following the manufacturer’s protocol (Novagen) The cleaved protein was then purified from the cleaved His-tag and uncleaved protein by passing the mixture through a Ni-NTA column. SeMet-labeled protein was purified by using identical protocol.
Crystals of SeMet-derivitized IolI protein were grown in hanging drops containing 1.50–1.75 M ammonium sulphate, 0.050 M Tris buffer, pH 8.00, 0.10 M sodium chloride, 1.0 mM ethylenediaminetetraacetic acid (EDTA), and 3.0 mM dithiothreitol (DTT). A single crystal of approximately 0.3 × 0.3 × 0.2 mm was cryoprotected by using 25% w/v sucrose made up in reservoir solution, and flash-frozen in liquid nitrogen. The crystal belonged to orthorhombic space group P21212, with cell dimensions a = 74.6 Å, b = 104.7 Å, c = 48.2 Å,α = β = γ = 90°. The absorption edge of Se was determined by using a fluorescence scan of the IolI crystal, followed by examination of the fluorescence data using CHOOCH.8 A three-wave-length MAD data set was collected in < 1 hour using the Structural Biology Center 19ID beamline of the Advanced Photon Source (APS), Argonne National Laboratory (ANL), and data were processed in near real time by using the HKL2000 suite.8 Diffraction data extended to 1.6 Å resolution. All six Se atoms in asymmetric unit were found by using SOLVE10 and CNS11 programs. The 1.6 Å MAD map produced by CNS was of superior quality and allowed autotracing of 276 residues with wARP12 and remaining two residues with QUANTA.13 Final refinement was completed by using CNS (annealing, water molecule identification, individual isotropic B refinement). The entire process of data processing, MAD phasing, autotracing, and refinement took a total of 6.5 h CPU time. Crystal characteristics, data collection, and structure solution results and structure refinement statistics are shown in Table I.
TABLE I.
Summary of IolI Crystal Data, MAD Data Collection, and Refinement
| Unit cell parameters (angstroms, degrees) | a = 74.29 Å, b = 104.77 Å, c = 48.42 Å, α = β = γ = 90 |
||||
| Space group | P 21212 (#18) | ||||
| Molecular weight [278 residues (SeMet)] | 31,937 Da | ||||
| Molecules per asymmetric unit (a.u.) | 1 | ||||
| Selenomethionine residues per a.u. | 6 | ||||
| MAD data | Edge energy | Peak energy | High energy | ||
| Wavelength (Å) | 0.9791 | 0.9793 | 0.9466 | ||
| Resolution limit (Å) | 1.60 | 1.60 | 1.60 | ||
| No. of unique reflections: | 49,179 | 49,182 | 49,100 | ||
| Overall data completeness (%) | 96.9 | 96.9 | 96.7 | ||
| Overall data redundancy | 6.35 | 7.22 | 7.34 | ||
| Overall Rmerge (%) | 9.0 | 8.4 | 6.7 | ||
| Figure of merit (FOM) | 0.709 | ||||
| Phasing power | 2.43 | 2.85 | 1.60 | ||
| Resolution range (Å) | 50.0–1.6 | ||||
| No. of reflections (all) | 96,476 | ||||
| No. of reflections (observed) | 90,306 | ||||
| Percent reflections observed | 93.6 | ||||
| σ Cutoff | 0.0 | ||||
| Overall R-value (%) | 0.213 | ||||
| Free R-value (%) | 0.217 | ||||
| RMSD from ideal geometry | |||||
| Bond length | 0.006 | ||||
| Angle | 1.5 | ||||
| Dihedral | 21.8 | ||||
| Improper | 0.75 | ||||
| No. of protein non-hydrogen atoms | 2,229 | ||||
| No. of water molecules | 182 | ||||
| Mean B-factor (Å 2) | 12.5 | ||||
| Ramachandran plot statistics (%) | |||||
| Residues in most favored regions | 92.5 | ||||
| Residues in additional allowed regions | 7.1 | ||||
| Residues in generously allowed regions | 0.4 | ||||
| Residues in disallowed region | 0.0 | ||||
IolI protein shows structural homology with E. coli endonuclease IV and xylose isomerase. All three proteins show a TIM barrel fold and bind metal ions. However, there are striking differences in the metal-binding pocket. Endonuclease IV binds three Zn+2 ions,4 whereas xylose isomerase binds two Mn+2 ions.5 To test the metal-binding properties of the IolI protein, crystals were soaked for 2 h in 2 mM ZnCl2 or 2 mM MnCl2. Single wavelength data sets were collected to 1.8 Å resolution. The resulting structures revealed a single Zn ion bound via residues Asp174, His177, His200, and Glu246 (Fig. 1, Table II). The apparent inability to bind Mn+2 ions suggests the metal-binding region of the IolI protein better resembles endonuclase IV catalytic site. However, IolI binds only one Zn ion (site 3) and apparently is unable to bind Zn at sites 1 and 2. The superposition of the two proteins shows that endonuclease IV’s His69 residue coordinating Zn in site 1 has been replaced with Asn66 in IolI, whereas the Zn2 site has His231 replaced with Arg217. Analysis of the Zn3-binding site, however, shows that all four zinc-binding residues are preserved in both endonuclease IV and IolI; it is this site that actively binds a single zinc ion in IolI protein (Fig. 1). Therefore, introduction of just two mutations, His66→Asn and His217→Arg, can explain the lost of the Zn1 and Zn2 binding. The substitution of critical binding-pocket residues at zinc sites 1 and 2 suggest IolI may have a different biochemical function than endonuclase IV. In fact, among IolI sequence homologs, several sugar/alcohol-metabolizing enzymes are found such as hexulose-6-phosphate isomerase, sugar-phosphate isomerase, D-tagatose 3-epimerase, 4-hydroxyphenylpyruvate dioxygenase, hydroxy-pyruvate isomerase. Our data show that catalytic sites of proteins can undergo significant protein sequence modifications that alter the metal ion-binding properties. This may have significant role in evolution of protein function.
TABLE II.
Comparison of Zinc-Binding Site Residues in E. coli Endonuclease IV and B. subtilis IolI
| Endonuclease IV (E. coli) | IOLI (B. subtilis) |
|---|---|
| His69 Zn1 | Asn66 |
| Glu145 Zn1 | Glu142 |
| Asp229 Zn2 | Asp216 |
| His231 Zn2 | Arg217 |
| Asp179 Zn3 | Asp174Zn1 |
| His182 Zn3 | His177 Zn1 |
| His216 Zn3 | His200 Zn1 |
| Glu261 Zn3 | Glu246Zn1 |
Acknowledgments
We thank all members of the Structural Biology Center at Argonne National Laboratory for their help in conducting experiments, Dr. Roman Las-kowski for help with CATH analysis, and Lindy Keller for help in preparation of this manuscript.
Grant sponsor: National Institutes of Health Grant; Grant number: GM62414-01; Grant sponsor: U.S. Department of Energy, Office of Biological and Environmental Research; Grant number: W-31-109-Eng-38.
Footnotes
Atomic coordinates for IOLI plus zinc and IOLI minus zinc have been deposited into the Protein Data Bank (PDB) as entries 1I6Nand 1I6O, respectively.
REFERENCES
- 1.Yoshida KI, Aoyama D, Ishio I, Shibayama T, Fujita Y. Organization and transcription of the myo-inositol operon, iol, of Bacillus subtilis. J Bacteriol. 1997;179:4591–4598. doi: 10.1128/jb.179.14.4591-4598.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Altschul SF, Thomas LM, Schäffer AA, Zhang JH, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wierenga RK. The TIM-barrel fold: a versatile framework for efficient enzymes. FEBS Lett. 2001;492:193–198. doi: 10.1016/s0014-5793(01)02236-0. [DOI] [PubMed] [Google Scholar]
- 4.Hosfield DJ, Guan Y, Haas BJ, Cunningham RP, Tainer JA. Structure of the DNA repair enzyme endonuclease IV and its DNA complex: double-nucleotide flipping at abasic sites and three-metal-ion catalysis. Cell. 1999;98:397–408. doi: 10.1016/s0092-8674(00)81968-6. [DOI] [PubMed] [Google Scholar]
- 5.Whitlow M, Howard AJ, Finzel BC, Poulos TL, Winborne E, Gilliland GL. A metal-mediated hydride shift mechanism for xylose isomerase based on the 1.6 A Streptomyces rubiginosus structures with xylitol and D-xylose. Proteins. 1991;9:153–173. doi: 10.1002/prot.340090302. [DOI] [PubMed] [Google Scholar]
- 6.Aslanidis C, de Jong PJ. Ligation-independent cloning of PCR products (LIC-PCR) Nucleic Acids Res. 1990;18:6069–6074. doi: 10.1093/nar/18.20.6069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Evans G, Pettifer RF. CHOOCH: a program for deriving anomalous-scattering factors from X-ray fluorescence spectra. J Appl Crystallogr. 2001;34:82–86. [Google Scholar]
- 8.Otwinowski Z, Minor W. Processing of x-ray diffraction data collected in oscillation mode. Methods Enzymol. 1997;276:307–326. doi: 10.1016/S0076-6879(97)76066-X. [DOI] [PubMed] [Google Scholar]
- 9.Walsh MA, Dementieva I, Evans G, Sanishvili R, Joachimiak A. Taking MAD to the extreme: ultrafast protein structure determination. Acta Crystallogr. 1999;D55:1168–1173. doi: 10.1107/s0907444999003698. [DOI] [PubMed] [Google Scholar]
- 10.Terwilliger TC, Berendzen J. Automated MAD and MIR structure solution. Acta Crystallogr. 1999;D55:849–861. doi: 10.1107/S0907444999000839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Brunger AT, et al. Crystallography and NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. 1998;D54:905–921. doi: 10.1107/s0907444998003254. [DOI] [PubMed] [Google Scholar]
- 12.Perrakis A, Morris RJ, Lamzin VS. Automated protein model building combined with iterative structure refinement. Nat Struct Biol. 1999;6:458–463. doi: 10.1038/8263. [DOI] [PubMed] [Google Scholar]
- 13.QUANTA. San Diego: Molecular Simulations Inc; 2000. [Google Scholar]

