Abstract
The recently discovered prokaryotic signal transducer HemAT, which has been described in both Archaea and Bacteria, mediates aerotactic responses. The N-terminal regions of HemAT from the archaeon Halobacterium salinarum (HemAT-Hs) and from the Gram-positive bacterium Bacillus subtilis (HemAT-Bs) contain a myoglobin-like motif, display characteristic heme–protein absorption spectra, and bind oxygen reversibly. Recombinant HemAT-Hs and HemAT-Bs shorter than 195 and 176 residues, respectively, do not bind heme effectively. Sequence homology comparisons and three-dimensional modeling predict that His-123 is the proximal heme-binding residue in HemAT from both species. The work described here used site-specific mutagenesis and spectroscopy to confirm this prediction, thereby providing direct evidence for a functional domain of prokaryotic signal transducers that bind heme in a globin fold. We postulate that this domain is part of a globin-coupled sensor (GCS) motif that exists as a two-domain transducer having no similarity to the PER-ARNT-SIM (PAS)-domain superfamily transducers. Using the GCS motif, we have identified several two-domain sensors in a variety of prokaryotes. We have cloned, expressed, and purified two potential globin-coupled sensors and performed spectral analysis on them. Both bind heme and show myoglobin-like spectra. This observation suggests that the general function of GCS-type transducers is to bind diatomic oxygen and perhaps other gaseous ligands, and to transmit a conformational signal through a linked signaling domain.
Keywords: proximal histidine, transducer
Globins are heme-containing proteins that are involved in binding and/or transport of diatomic oxygen. Presently, more than 700 globin sequences are known (1). It has been proposed that all globins have evolved from an ancestral redox protein of about 17 kDa that displayed the globin fold, which is characterized by the presence of eight helices, designated A through H (2). The residues absolutely conserved among all globins are the proximal histidine in the F helix (F8) and phenylalanine in the CD region (CD1) (3, 4). Highly conserved residues include the distal histidine in the E helix (E7), phenylalanine in the CD4 region, and proline at the beginning of the C helix (C2).
We recently discovered heme-containing transducers in the archaeon Halobacterium salinarum (HemAT-Hs) and the Gram-positive bacterium Bacillus subtilis (HemAT-Bs). These proteins bind diatomic oxygen and mediate an aerotactic response (5). The N termini of these transducers resemble myoglobin, and their C termini are homologous to the cytoplasmic signaling domain of bacterial chemoreceptors. We have also described three-dimensional homology models of the putative oxygen-sensing domain of HemATs (6). In these models the overall globin topology, including the orientation of the heme prosthetic group, is preserved, as is the hydrophobic core of the heme-binding pocket and the electrostatic stabilization of the CD region. Therefore, an experimental determination of the organization of the heme-binding pocket is of particular interest because this domain regulates the activity of the signaling domain in response to the binding of oxygen.
In this study we aimed to identify the regions of HemATs required for heme binding. To localize the minimal heme-binding region, we generated several C-terminal truncated derivatives of HemATs. The HemAT-Hs195 and HemAT-Bs176 fragments retained the heme- and oxygen-binding properties of the respective native proteins, whereas shorter versions of either protein did not bind heme effectively. We also determined the effect of replacing each histidine in these fragments with alanine and identified His-123 as the proximal heme-binding residue in both HemATs. Using this information, we constructed a 90-residue myoglobin-like domain transducer motif and identified several globin-coupled two-domain sensors in a variety of prokaryotes. We then cloned, expressed, purified, and performed spectral analysis of globin-coupled sensors (GCSs) from Caulobacter crescentus and Thiobacillus ferrooxidans. Both of them bind heme and show myoglobin-like spectra. We propose that globin-coupled sensors are a class of two-domain transducers having no similarity to the PER-ARNT-SIM (PAS)-domain superfamily transducers (7–10).
Materials and Methods
Strains.
Escherichia coli competent cells JM109 from Promega were used for vector transformation. BL21(DE3) pLysS cells from Novagen were used for expression of the recombinant proteins. TOP10 cells from Invitrogen were used for TOPO PCR cloning.
Homology Modeling and Sequence Extraction and Alignment.
Homology modeling of HemATs was performed according to Hou (6). Using the blast program (11), the entire nonredundant database at the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/blast) was searched. Sequencing of C. crescentus, Bacillus anthracis and T. ferrooxidans was accomplished at The Institute of Genomic Research (TIGR) (http://www.tigr.org). Sequencing of Bordetella bronchiseptica and Bordetella pertussis was performed at the Sanger Centre (http://www.sanger.ac.uk). Subsequent blast searches were performed on the finished and unfinished genomic sequences at TIGR and the Sanger Centre. Extracted sequences were initially aligned by using clustalx (Windows-based version of clustalw is available at ftp://ftp.ebi.ac.uk/pub/software/dos/clustalw/clustalx) and were manually adjusted by using MegAlign (DNAStar, Madison, WI).
Construction of HemAT-Hs and HemAT-Bs Truncations.
C-terminal six-His-tagged HemAT-Hs and HemAT-Bs truncations were constructed by a two-step PCR strategy. NdeI top primer and a site-specific primer were used in the first PCR to amplify different lengths of gene fragments. For HemAT-Hs, four different gene fragments encoding polypeptide fragments were created. Site-specific primer sequences are listed as follows: hemAT-Hs185 (5′-GTGGTGGTGGTGGTGGTGTGCGATCTGCTGATCGAAGGTCAACAGC-3′), hemAT-Hs190 (5′- GTGGTGGTGGTGGTGGTGGATGTAGGTGTCCATTGCGATCTGC-3′), hemAT-Hs195 (5′-GTGGTGGTGGTGGTGGTGCTGGGCGTACGAGTCGATGTAGGTG-3′), and hemAT-Hs200 (5′-GTGGTGGTGGTGGTGGTGCTCGTCGTGGAGGCGCTGGGCGTACG-3′). For HemAT-Bs, three different polypeptide fragments were created. Site-specific primers are AS FOLLOWS: hemAT-Bs171 (5′-TTAGTGGTGGTGGTGGTGGTGTAAGTTCAAGATTTTTGTTGTTGC-3′), hemAT-Bs176 (5′-TTAGTGGTGGTGGTGGTGGTGGACAAGCTGCTGTTCTAAGTTCAAG-3′), and hemAT-Bs180 (5′-TTAGTGGTGGTGGTGGTGGTGTTAAAACGCTTAAAGGACAAGC-3′). The PCR products were subcloned into E. coli expression vector pET3a.
Construction of N-Terminal His-Tagged HemAT-Hs and Its Histidine Mutation Derivatives.
A recombinant gene fragment encoding the N-terminal His-tag HemAT-Hs construction (12) was used as the template for PCR site-directed mutagenesis. Individual histidine mutants were generated by using the QuickChange Site-Directed Mutagenesis protocol (Stratagene). Each mutated hemAT-Hs in pET14b vector was confirmed by DNA sequencing. Final constructions containing individual mutations were transformed into E. coli pLysS cells for expression.
Construction of C-Terminal His-Tagged HemAT-Bs and Its Histidine Mutation Derivatives.
A recombinant hemAT-Bs gene was constructed by using a two-step PCR strategy. In the first PCR step, a six-His codon was fused to the hemAT-Bs in front of the stop codon. In the second PCR step, a BamHI restriction site was introduced at the 3′ end of the gene. A TOPO cloning kit (Invitrogen) was used to clone the PCR product. NdeI/BamHI restriction sites were used to subclone the hemAT-BsC-6x-His fragment into the E. coli expression vector pET3a. This construction was used as a template for site-directed mutagenesis PCR according to the procedure described above. Mutated genes were confirmed by DNA sequencing and the final vectors were transformed into the E. coli pLysS cells for expression.
Cloning and Construction of Putative GCS Gene Expression Vectors.
Genomic DNAs of C. crescentus and T. ferrooxidans were purchased from the American Type Culture Collection. Putative GCS genes were amplified by PCR. The primers are listed as follows. For C. crescentus 5′-GGCATATGGTCAACCAATTCTTACCAGCGAGTCGG-3′ and 5′-CTAGTGGTGGTGGTGGTGGTGGCGCCCGGGACGGGCGAAAGCGGC-3′; for T. ferrooxidans 5′-CGCATATGATCTTGATCGATTGAAAAAAGC-3′ and 5′-TAGTGGTGGTGGTGGTGGTGCTTGGCCGGCAAGTCGTCGCAG-3′. The final PCR products were cloned into a TOPO vector, and genes were subcloned into E. coli expression vector pET3a.
Expression and Purification of Recombinant HemATs and Putative GCSs in E. coli.
Expression and one-step purification of HemATs, their histidine mutant derivatives, truncated HemATs, and putative GCSs were performed according to Piatibratov et al. (12).
Overexpression of HemAT-H123A in Their Native Hosts.
hemAT-Hs/pKJ427 and hemAT-Bs/pEB112 constructions (5) were used as templates for PCR site-directed mutagenesis of His-123 as described above. Mutations were confirmed by DNA sequencing, and vectors carrying mutated genes were then transformed into the H. salinarum ΔhemAT-Hs strain and B. subtilis Δten strain (5) by using standard halobacterial and B. subtilis transformation procedures (5).
SDS/PAGE and Heme Staining.
The purified HemAT and truncation samples were analyzed by modified dimethoxy-benzidine heme staining as described by Piatibratov et al. (12).
Determination of Protein Concentration and Heme Content.
Protein concentrations were determined by using the Pierce Coomassie Plus Protein Assay Reagent kit according to the manufacturer's protocol. The heme content of purified HemATs was determined by the alkaline hematin method (13).
Results
Identification of the Oxygen-Binding Domains of HemAT-Hs and HemAT-Bs.
We began by generating fragments of hemAT-Hs that encode polypeptides spanning the first 185, 190, 195, and 200 residues, respectively, of the normal gene product. These recombinant proteins were constructed with N-terminal His tags and were purified (Fig. 1A). The effect of C-terminal truncation was explored by examining their absorption spectra in the near-UV and visible region for peaks characteristic of oxygen-bound heme proteins. Fragments HemAT-Hs195 and HemAT-Hs200 bound heme more effectively than the shorter fragments. In particular, HemAT-Hs185 displayed relatively little heme binding as indicated by the low absorption peak at 410 nm (Fig. 1C). The low absorption peak of this polypeptide may be due to changes in the conformation of the heme-binding pocket or to improper folding of the domain.
Similarly, we generated His-tagged fragments of hemAT-Bs that encode polypeptides spanning the first 171, 176, and 180 residues of the protein (Fig. 1B). HemAT-Bs171 formed inclusion bodies and required 6–8 M urea to solubilize. Therefore, absorption spectra were determined only with the other two fragments (Fig. 1D). As seen previously with full-length HemAT-Bs, the oxygen-bound forms of these fragments were relatively short lived, converting to the met [Fe(III)] form after about 5 h at room temperature and after about 24 h at 4°C. The spectra of the met forms were very similar to that of the met form of hemoglobin from Paramecium (14) and cyanobacteria (15). In contrast, the oxygen-bound form of HemAT-Hs195 has a lifetime of 24–48 h at room temperature and of several weeks at 4°C (data not shown).
HemAT-Hs195 and HemAT-Bs176 Bind Oxygen Reversibly.
Both HemAT-Hs195 and HemAT-Bs176 have absorption spectra typical of oxygen-bound heme proteins, with maxima at 410 nm (Soret), 580 nm (α band), and 542 nm (β band) (Fig. 2 A and B). After deoxygenation with sodium dithionite, the Soret bands of both fragments shifted to 428 nm, and the α and β bands converged to a broad peak at 554 nm for HemAT-Hs195 and 560 nm for HemAT-Bs176. This behavior is consistent with the formation of the deoxygenated forms and is similar to the effect seen with deoxymyoglobin (5). When the deoxygenated forms of HemAT-Hs195 and HemAT-Bs176 were reexposed to atmospheric oxygen, their absorption spectra reverted to that of the oxygenated forms (Fig. 2 A and B). Thus, both HemAT-Hs195 and HemAT-Bs176 behave like other well characterized, heme-containing proteins and exhibit a reversible oxygen-binding capacity such as that seen with their full-length counterparts (5). We measured the oxygen-binding affinities of HemAT-Hs and HemAT-Bs. The association constants for oxygen binding to both HemAT-Hs and HemAT-Bs are 8 × 106 M−1 and 4 × 105 M−1, respectively, as compared with 1.2 × 106 M−1 for sperm whale myoglobin (SWMb) and 2.0 × 104 M−1 for RmFixLH (16).
His-123 Is the Proximal Residue in Both HemATs.
All globins have a histidine residue that binds directly to the fifth coordinate of the heme iron. The His-123 residue in both HemAT-Hs and HemAT-Bs can be aligned with this histidine residue (His-93, F8) in myoglobin (5). We generated models of the three-dimensional structures of HemAT-Hs195 and HemAT-Bs176 (Fig. 3 A and C) based on the overall fold of myoglobin (Fig. 3B). The side chain of the relevant heme-binding histidine residue is shown in each structure along with the heme prosthetic group itself. The models for HemAT-Hs195 and HemAT-Bs176 are both consistent with the idea that His-123 is the residue that directly contacts the heme iron.
To determine experimentally whether His-123 is involved in heme binding, the four histidine residues in the N-terminal domains of HemAT-Hs (His-20, His-71, His-123, and His-198) and HemAT-Bs (His-75, His-86, His-99, and His-123) were changed to alanine residues by site-directed mutagenesis. Modified heme-specific staining was performed on these proteins after subjecting them to nondenaturing PAGE. Heme staining was observed with the wild-type protein and all of the mutant proteins from both species except for the Ala-123 variants (data not shown).
Fig. 4 shows the optical properties of purified HemAT-Hs and HemAT-Bs and their alanine-substituted derivatives. The spectra for most of the variants resemble those of the parental proteins, but both of the Ala-123 mutant proteins lack the characteristic spectral features associated with heme binding. The Ala-123 proteins did have a low absorbance near 410 nm that might be attributable to nonspecific heme binding. In capillary assays, no aerotactic response was observed with either H. salinarum or B. subtilis strains expressing the respective Ala-123 proteins, whereas the other mutant proteins exhibited aerotactic responses similar to those of cells producing the corresponding wild-type proteins (data not shown). Taken together, these data provide compelling support for the identity of His-123 as the proximal heme-binding residue in both HemATs.
GCS Motif.
The heme-containing oxygen sensors FixL (17, 18) and Dos (19) are members of the PAS-domain superfamily (7–10). The PAS domain has a distinctive fold with a six-stranded β sheet, and different proteins containing this domain are known to sense redox potential, oxygen, proton motive force, or light. However, HemAT proteins do not contain a PAS domain. We speculated that the oxygen-sensing globin-fold domain of HemAT could be more widespread than previously thought. To test this idea, we designed a 90-residue template that contains the globin-coupled transducer motif. This sequence was used to search for potential two-domain GCSs. A critical feature of the GCS motif is the spacing between the highly conserved phenylalanine in the CD1 region and the highly conserved histidine at the F8 position (3).
We performed a blast search with this GCS motif in all available genome databases and found six additional HemAT homologs (Fig. 5A). The C. crescentus, B. anthracis, and Bacillus halodurans homologs are classified as methyl-accepting chemotaxis proteins, and an alignment of the highly conserved signaling domain diagnostic for these proteins is shown in Fig. 5B. The remaining proteins showing the GCS motif include putative histidine kinase from Bordetella spp. and phosphodiesterase from T. ferrooxidans. Members of this last group of proteins contain potential diguanylate cyclase/phosphodiesterase domains (Fig. 5C). We cloned, expressed, purified, and performed spectral analysis of two predicted GCSs from C. crescentus and T. ferrooxidans. Both displayed similar absorption spectra in the near-UV and visible regions, as is characteristic of oxygen-bound heme proteins (Fig. 5D).
Discussion
Myoglobin folds into a single monomeric, globular domain that contains eight α-helices and one high-affinity heme-binding site. The work presented here identified a similar heme-binding unit in archaeal and bacterial aerotaxis transducers, which we call HemATs. The N-terminal 195 residues of HemAT from the archaeon H. salinarum (HemAT-Hs195) and the 176 N-terminal residues of HemAT from the Gram-positive bacterium B. subtilis (HemAT-Bs176) comprise minimal domains that bind heme as effectively as the full-length proteins. These sequences coincide approximately with the regions in which the HemATs can be aligned with the myoglobin sequence (5).
The function of His-123 as the primary heme-binding residue in both HemAT-Hs and HemAT-Bs distinguishes these proteins from the CooA CO sensors, which contain a thiolate ligand in the Fe(III) state and a histidine-bound ligand in the Fe(II) state (20, 21). Other heme-based sensors, such as FixL and guanylate cyclase, bind heme exclusively through coordination with histidine (20). The structural homology between HemAT and SWMb is apparent when the amino acid sequences are aligned (5). His-123 of HemATs is in a position corresponding to the heme-binding His (F8) of SWMb. Using this alignment, helical regions in HemATs can be assigned and shown to correspond to the A, B, C, D, E, F, G, and H helices of SWMb. HemATs also contain loop regions connecting the helical domains, although their length varies significantly from the length of the loops in SWMb (6). Identification of this structural homology depends critically on the demonstration that His-123 is the proximal heme-binding residue in HemAT.
None of the previously described heme-based sensors, including FixL (O2 sensor), soluble guanylate cyclase (NO sensor), Dos protein (O2 sensor), and CooA protein (CO sensor) use the globin fold (18, 19, 20). HemAT-Hs and HemAT-Bs are thus, to our knowledge, the first group of heme-based sensors shown to employ the globin motif and to couple it to the signaling functions of the methyl-accepting chemotaxis proteins. The GCS motif that we describe also identified two additional proteins that are thought to be a histidine kinase and a phosphodiesterase, respectively. The spectral properties of GCS from C. crescentus and T. ferrooxidans indeed confirmed our prediction of the existence of a GCS class. Thus, the presence of HemAT homologs in divergent microorganisms (from Archaea to Bacteria) indicates that GCSs are likely to be early arising components during evolution of the sensory arsenal of life.
Our data are consistent with a two-domain model for globin-coupled sensors. In the case of HemATs, the N-terminal domain serves as an oxygen sensor by virtue of its bound heme, whereas the C-terminal domain shows homology to chemotactic signal transducers (Fig. 5B) and presumably interfaces with the pathway that controls motility. This transducer-like domain begins at residue 222 for HemAT-Hs and at residue 198 for HemAT-Bs. We postulate that the intervening “linker regions” are involved in communication between the sensing and signaling domains, probably through propagation of conformational changes initiated upon binding oxygen to heme (Fig. 5E).
In summary, we propose that the GCS represents a class of sensors distinct from the PAS-domain superfamily. These sensors may have evolved as two-domain proteins with the capacity to sense diatomic oxygen and other gaseous ligands through a sensor domain that binds heme with globin fold. The sensor domain transmits the information of ligand binding to the rest of the cell through a variety of linked signaling domains (Fig. 5E).
Acknowledgments
We thank Gerald Hazelbauer, Mike Manson, Sandy Parkinson, Barry Taylor, Paul Patek, and JoAnn Radway for helpful comments on the manuscript. We also thank Hyung Suk Yu, Claude Belisle, and Summer Lum for participation in different stages of protein purification. This investigation was supported by National Science Foundation Grant MSB-087124 and by a University of Hawaii intramural grant to M.A.
Abbreviations
- GCS
globin-coupled sensor
- PAS
PER-ARNT-SIM
- SWMb
sperm whale myoglobin
- TIGR
The Institute of Genomic Research
Footnotes
This paper was submitted directly (Track II) to the PNAS office.
References
- 1.Kapp O H, Moens L, Vanfleteren J, Trotman C N, Suzuki T, Vinogradov S N. Protein Sci. 1995;4:2179–2190. doi: 10.1002/pro.5560041024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Moens L, Vanfleteren J, Van de Peer Y, Peters K, Kapp O, Czeluzniak J, Goodman M, Blaxter M, Vinogradov S N. Mol Biol Evol. 1996;13:324–333. doi: 10.1093/oxfordjournals.molbev.a025592. [DOI] [PubMed] [Google Scholar]
- 3.Bashford D, Chothia C, Lesk A M. J Mol Biol. 1987;196:199–216. doi: 10.1016/0022-2836(87)90521-3. [DOI] [PubMed] [Google Scholar]
- 4.Vinogradov S N, Walz D A, Pohajdak B. Comp Biochem Physiol. 1992;103:759–773. doi: 10.1016/0305-0491(92)90193-u. [DOI] [PubMed] [Google Scholar]
- 5.Hou S, Larsen R W, Boudko D, Riley C W, Karatan E, Zimmer M, Ordal G W, Alam M. Nature (London) 2000;403:540–544. doi: 10.1038/35000570. [DOI] [PubMed] [Google Scholar]
- 6.Hou S. Ph.D. thesis. Honolulu: Univ. of Hawaii; 2000. [Google Scholar]
- 7.Taylor B L, Zhulin I B. Microbiol Mol Biol Rev. 1999;63:479–506. doi: 10.1128/mmbr.63.2.479-506.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Zhulin I B, Taylor B L. Mol Microbiol. 1998;29:1522–1523. [PubMed] [Google Scholar]
- 9.Zhulin I B, Taylor B L, Dixon R. Trends Biochem Sci. 1997;22:331–333. doi: 10.1016/s0968-0004(97)01110-9. [DOI] [PubMed] [Google Scholar]
- 10.Gong W, Hao B, Mansy S S, Gonzalez G, Gilles-Gonzalez M A, Chan M K. Proc Natl Acad Sci USA. 1998;95:15177–15182. doi: 10.1073/pnas.95.26.15177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Altschul S F, Gish W, Miller W, Myers E W, Lipman D J. J Mol Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 12.Piatibratov M, Hou S, Brooun A, Yang J, Chen H, Alam M. Biochim Biophys Acta. 2000;1524:149–154. doi: 10.1016/s0304-4165(00)00151-3. [DOI] [PubMed] [Google Scholar]
- 13.Zander L, Lang W, Wolf H. Clin Chim Acta. 1984;136:83–93. doi: 10.1016/0009-8981(84)90250-x. [DOI] [PubMed] [Google Scholar]
- 14.Tsubamoto Y, Matsuoka A, Yusa K, Shikama K. Eur J Biochem. 1990;193:55–59. doi: 10.1111/j.1432-1033.1990.tb19303.x. [DOI] [PubMed] [Google Scholar]
- 15.Thorsteinsson M V, Bevan D R, Ebel R E, Weber R E, Potts M. Biochim Biophys Acta. 1996;1292:133–139. doi: 10.1016/0167-4838(95)00178-6. [DOI] [PubMed] [Google Scholar]
- 16.Gilles-Gonzalez M A, Gonzalez G, Perutz M F. Biochemistry. 1994;33:8067–8073. doi: 10.1021/bi00192a011. [DOI] [PubMed] [Google Scholar]
- 17.Gilles-Gonzalez M A, Ditta G S, Helinski D R. Nature (London) 1991;350:170–172. doi: 10.1038/350170a0. [DOI] [PubMed] [Google Scholar]
- 18.Monson E K, Weinstein M, Ditta G S, Helinski D R. Proc Natl Acad Sci USA. 1992;89:4280–4284. doi: 10.1073/pnas.89.10.4280. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Delgado-Nixon V M, Gonzalez G, Gilles-Gonzalez M A. Biochemistry. 2000;39:2685–2691. doi: 10.1021/bi991911s. [DOI] [PubMed] [Google Scholar]
- 20.Rodgers K R. Curr Opin Chem Biol. 1999;3:158–167. doi: 10.1016/S1367-5931(99)80028-3. [DOI] [PubMed] [Google Scholar]
- 21.Shelver D, Kerby R L, He Y, Roberts G P. Proc Natl Acad Sci USA. 1997;94:11216–11220. doi: 10.1073/pnas.94.21.11216. [DOI] [PMC free article] [PubMed] [Google Scholar]