Abstract
Surface (S)-layers, para-crystalline arrays of protein, are deposited in the envelope of most bacterial species. These surface organelles are retained in the bacterial envelope through the non-covalent association of proteins with cell wall carbohydrates. Bacillus anthracis, a Gram-positive pathogen, produces S-layers of the protein Sap, which uses three consecutive repeats of the surface-layer homology (SLH) domain to engage secondary cell wall polysaccharides (SCWP). Using x-ray crystallography, we reveal here the structure of these SLH domains, which assume the shape of a three-prong spindle. Each SLH domain contributes to a three-helical bundle at the spindle base, whereas another α-helix and its connecting loops generate the three prongs. The inter-prong grooves contain conserved cationic and anionic residues, which are necessary for SLH domains to bind the B. anthracis SCWP. Modeling experiments suggest that the SLH domains of other S-layer proteins also fold into three-prong spindles and capture bacterial envelope carbohydrates by a similar mechanism.
Keywords: Bacteria, Protein Domains, Protein Sorting, Protein Structure, Protein Targeting, Bacillus anthracis, S-layer, SAP, SLH Domains, Secondary Cell Wall Polysaccharide
Introduction
Surface layers (S-layers)3 are para-crystalline sheets of protein, which self-assemble on the surface of microbial cells to form contiguous layers (1, 2). Most organisms that elaborate S-layers do so by abundantly producing and secreting a single protein species (3). Whether an organism produces an S-layer as a component of its envelope structure is assessed by electron microscopy of the cell surface (4). In this manner, species from nearly every branch of the Bacteria and Archaea have been discovered to produce S-layers (2). Proteins within S-layers fulfill variable functions in that they act either as a scaffold or enzyme in the bacterial envelope (5), promote nutrient diffusion or transport (6), or contribute to virulence by enabling microbial adhesion to infected host tissues (7).
Most, but not all, S-layer proteins of bacteria share three tandem ∼55 amino acid repeats of the Surface Layer Homology (SLH) domain (8–10). Secreted proteins encoding three tandem SLH domains are tethered to the bacterial envelope by non-covalent interactions between the SLH domains and a secondary cell wall carbohydrate (11). SLH domains are remarkable for being both necessary and sufficient for the incorporation of chimeric proteins into S-layers (12, 13). The SbsC protein of Geobacillus stearothermophilus is an example for a class of protein that forms S-layers without SLH domains (14). SbsC binds to the secondary cell wall polysaccharide (SCWP) of G. stearothermophilus via its N-terminal domain, which consists of three triple-helical bundles that are connected by two contiguous helices (14). The N-terminal domain of SbsC has high similarity with S-layer proteins from G. stearothermophilus, Geobacillus kaustophilus, and Geobacillus tepidamans (14) and is not similar to proteins with SLH domains.
The Gram-positive bacterium Bacillus anthracis is a rod-shaped, spore-forming pathogen of mammalian hosts (15). The envelope of its vegetative forms is composed of a plasma membrane and peptidoglycan layer with attached secondary cell wall polysaccharide (SCWP) (16) and poly-d-γ-glutamic acid capsule (PDGA) (17, 18). The genome of B. anthracis encompasses 24 open reading frames whose predicted translation products each contain a secretion signal and three tandem SLH domains (19). An operon of two such genes, sap and eag, encodes the main S-layer proteins, Sap and EA1, and is adjacent to csaB, a gene required for decorating SCWP with ketal-pyruvate (11, 20). B. anthracis SCWP is a polymer with the repeating structure [→6)-α-GlcNAc-(1→4)-β-ManNAc-(1→4)-β-GlcNAc-(1→]n, where α-GlcNAc is substituted with α-Gal and β-Gal at O3 and O4, respectively, and the β-GlcNAc is substituted with α-Gal at O3 (21). The position of the ketal-pyruvate modification on the SCWP is not known. B. anthracis can form S-layers from both extractable antigen 1 (EA1) and surface array protein (Sap) by tethering the SLH domains of these polypeptides to pyruvylated SCWP (20, 22, 23). C-terminal to the SLH domains, S-layer proteins encode crystallization domains, sequences predicted to enable subunit-subunit interactions within the S-layer (22, 24, 25). A simple model for S-layer assembly is that secreted subunits are recruited to the edge of an extant S-layer network via enthalpy-driven interactions between crystallization domains and are then tethered to the SCWP via the SLH domains (11). This model matches growth of the S-layer(s) with increases in the avidity of these networks for the cell wall. In this manner, bacilli assemble an S-layer on top of their peptidoglycan and thread PDGA capsule between S-layer protein subunits (26, 27). To gain insight into the molecular mechanisms of S-layer assembly, we determined here the three-dimensional structure of the SLH domains of Sap by x-ray crystallography and further explored the mechanism of binding to the SCWP.
EXPERIMENTAL PROCEDURES
Bacterial Strains and Plasmids
B. anthracis Sterne 34F2 or its variants were grown in brain heart infusion broth (BHI) or Luria broth (LB) at 37 °C. Escherichia coli strains grown in LB and supplemented with ampicillin (100 μg ml−1) where appropriate. psapSLH was generated by PCR amplification from the B. anthracis sap open reading frame with the primers P281 (5′-GGAATTCCATATGGGTAAAACATTCCCAGACGTTC-3′) and P282 (5′-TTT CTCGAGTTCTGTACCGAACTGCTTGTCAG-3′). PCR product and pET16b vector were restricted with NdeI and XhoI, ligated and transformed into E. coli. pgst-SLH1–3 was generated by PCR amplification from the B. anthracis sap open reading frame with the primers P277 (5′-TTTGGATCCGGTAAAACATTCCCAGACGTTC-3′) and P278 (5′-TTTGAATTCTTCTGTACCGAACTGCTTGTCAG-3′). The PCR product and pGEX-2TK vector were restricted with EcoRI and BamHI, ligated and transformed into E. coli. Truncation variants GST-SLH1 and GST-SLH1–2 were generated by PCR amplification with primers P277 and P603 (5′-TTTGAATTCGTAGCTGCTTCTGCACGAGTTAA-3′) or P604 (5′-TTTGAATTCTTCTACAAGAAGAGATGCCATAGAAAC-3′), respectively.
Protein Crystallization
E. coli BL21(DE3) cells harboring psapSLH were grown under conditions that promote SeMet incorporation (28). Labeled protein was subjected to immobilized metal affinity chromatography (IMAC) (29), concentrated and purified further by size exclusion chromatography on a Superdex 75 16/60 column equilibrated with 20 mm HEPES, 250 mm NaCl, 2 mm DTT, pH 8.0. A Mosquito liquid dispenser (TTP LabTech) was used to dispense 0.4 μl of protein (20 mg/ml) and 0.4 μl of reservoir solution into 96-well CrystalQuick plates (Greiner); droplets were equilibrated against 140 μl of reservoir solution. Crystals formed in 0.1 m citrate pH 5.5, 2.0 m ammonium sulfate at 4 °C were treated with cryoprotectant (crystallization buffer containing 3.5 m ammonium sulfate), mounted on CryoLoops (Hampton Research) and frozen in liquid nitrogen.
X-ray Diffraction and Structure Determination
Single-wavelength anomalous diffraction (SAD) data were collected near the selenium absorption peak at 100 °K from a single SeMet-labeled crystal at the 19ID beamline of the Structural Biology Center at the Advanced Photon Source (Argonne National Laboratory) using the program SBCcollect. Intensities were integrated and scaled with the HKL3000 suite (30). Heavy atom sites were located using the program SHELXD (31) and phased with the program MLPHARE (32). The density modified map output was submitted for model building with programs ARP/wARP. The structure was completed with manual model building using the program COOT (33) and refined with REFMAC (34). The final R was 16.6% and the free R was 18.9% with zero Σ cutoff. The stereochemistry of the structure was checked with PROCHECK (35). Comparison of SapSLH structure with other proteins in the Protein Data Bank (PDB) via the DALI server (Holm) identified only very limited structural homology. The closest homologue is a domain of X-prolyl dipeptidyl aminopeptidase (PDB ID 1LNS, Z-score 5.8, RMSD 3.1 Å) (36) that matches about half of SapSLH.
GST-SLH Purification and S-layer Assembly
E. coli BL21 (DE3) carrying pgst-SLH or its variants was grown in 1-liter cultures to A600 1 and induced with 1 mm IPTG for 3 h at 37 °C. Cells were harvested by centrifugation, suspended in PBS and disrupted by French press. Cleared lysate was subjected to affinity chromatography on glutathione-Sepharose and eluted with 20 mm reduced glutathione. For circular dichroism studies, GST-SLH variants were bound to glutathione-Sepharose and treated with 50 units of thrombin (Sigma) for 3 h at room temperature. Eluate was treated with 100 μl of benzamidine-Sepharose to remove thrombin and dialyzed against 8 mm NaH2PO4, 1.5 mm Na2HPO4. Purified protein (100 μg ml−1) was subjected to circular dichroism using an AVIV 202 CD Spectrometer at room temperature.
For S-layer assembly assays, B. anthracis Sterne was grown overnight in BHI. Vegetative forms were sedimented by centrifugation at 10,000 × g, suspended in 3 m urea and heated in a boiling water bath for 30 min to strip the S-layer from murein sacculi with attached SCWP. Cell wall preparations were washed with water and then with PBS and stored in aliquots at −20 °C. GST or GST-SLH were mixed with cell wall suspensions at the indicated A600 and incubated at room temperature for 10 min followed by centrifugation at 10,000 × g. Supernatant and sediment were mixed with sample buffer, heated to 95 °C and analyzed by SDS-PAGE. Protein was quantified by measuring pixel intensity of acquired images with Adobe Photoshop CS3.
RESULTS
Crystal Structure of the SLH Domains of Sap
The 814 amino acid Sap precursor harbors a N-terminal signal peptide (residues 1–30), three SLH domains (residues residues 34–197) as well as a large C-terminal domain (residues 210–814) that promotes the crystallization of the S-layer protein (25). The nucleotide sequence of the B. anthracis sap gene, i.e. codons 31 through 210, was cloned into the expression vector pET16b to generate psapSLH (Fig. 1A). This plasmid was transformed into E. coli BL21(DE3) and T7 RNA polymerase-mediated expression of SapSLH in bacterial cultures was induced with IPTG. SapSLH encompasses an N-terminal ten histidyl tag and amino acids Gly31-Glu210 of Sap (Fig. 1C), and was purified from cleared bacterial lysates by affinity chromatography on Ni-NTA-Sepharose and analyzed for purity by Coomassie-stained SDS-PAGE (Fig. 1B). SapSLH crystallized in the tetragonal space group P412121 with one monomer in the asymmetric unit. Its structure was determined using the single-wavelength anomalous diffraction (SAD) approach with Se-Met labeled crystals and refined to 1.80 Å resolution. All residues could be assigned either to the most favored or the allowed regions of Ramachandran plot statistics (Table 1). The structural model that could be derived from these data accounts for the positions of amino acid residues Lys32-Thr209 of Sap (Fig. 1C).
TABLE 1.
Data Collection | |
Space group | P41212 |
Unit cell (Å) | a = b = 77.77, c = 100.41 |
MW Da (residue) | 24353.4 (223)1 |
Mol (AU) | 1 |
SeMet (AU) | 6 |
Wavelength(Å) | 0.9794 (peak) |
Resolution range (Å) | 54.96–1.80 (1.80–1.85)2 |
Number of unique reflections | 27750 (2003) |
Redundancy | 11.1 (6.3)2 |
Completeness (%) | 99.86 (98.6) |
Rmerge (%) | 0.099 (0.748)2 |
I/ σ(I) | 26.83 (1.61)2 |
Phasing | |
RCullis (anomalous) (%) | 0.70 |
Figure of merit (%) | 0.2498 |
Refinement | |
Resolution | 54.96–1.80 |
Reflections (work/test) | 27750/1486 |
Rcrystal/Rfree (%) | 16.6/18.9 |
Rms deviation from ideal geometry | 0.017/1.485 |
Bond length (Å)/angle (°) | |
Number of protein atoms | 1741 |
Number of sulfate/water molecules | 1/241 |
Mean B-value (Å2) (mainchain/sidechain) | 1.399/3.177 |
Ramachandran plot statistic (%) | |
Residues in most favored regions | 92.0 |
Residues in additional allowed regions | 8.0 |
1 Includes His tag.
2 Numbers in parenthesis include highest resolution shell.
The overall structure of the three SLH domains resembles a three-prong spindle, where each prong is derived from a single SLH domain (Fig. 2). The base of the spindle is assembled from all three domains, each of which contributes a single helix that associates into a three-helical bundle (Fig. 2). The three SLH domains of SapSLH, SLH1 (residues 31–90), SLH2 (91–151) and SLH3 (152–209), share limited sequence identity: SLH1 versus SLH2 26%, SLH1 versus SLH3 39%, and SLH2 versus SLH3 26%. Nevertheless, the structures formed by each of the three SLH domains are nearly identical (Fig. 3). Thus, SapSLH can be considered as a pseudo-trimer that is assembled from its three SLH domains. When inspected from the C terminus (cargo view), the SLH domain prongs of SapSLH proceed clockwise from the N terminus and surround the three helical bundle base of the spindle (a, b, and c in Fig. 2A). Each SLH domain includes a helix on the lateral side of the molecule, two loops (A and B in Fig. 2B) and the beginning of one linker helix (Fig. 2B).
A Model for SCWP Binding to SapSLH
A group of five residues, named the ITRAE motif for its consensus sequence (Fig. 1C), is partially conserved among the SLH domains of bacterial S-layer proteins (10) and occupies the last four residues of loop B and the first residue of the central helix bundle (Fig. 2C). Within the SLH domains of Sap, these motifs have the sequences LTRAE, IDRVS, and VTKAE and contain the cationic residues Arg72, Arg131, and Lys193, respectively (Fig. 1C). The corresponding positively charged residues of the ITRAE motif are considered crucial for the incorporation of protein into the S-layer of Thermoanaerobacterium thermosulfurigenes (13). Analysis of the solvent accessible surface of SapSLH revealed three small tunnels at the spindle base (Fig. 4). The tunnels and ITRAE motifs are arranged such that Arg72 from SLH1 penetrates the tunnel in SLH2, appearing on the surface between SLH2 and SLH3 (Fig. 2D). Similarly, Arg131 from SLH2 penetrates the tunnel in SLH3 and appears on the surface between SLH1 and SLH3, whereas Lys193 is inserted into the SLH1 tunnel and displayed on the surface between SLH2 and SLH3 (Fig. 2D). Thus, all three SLH domains contribute residues to the surface structure of each of the inter-prong grooves (IPG) that are formed by the adjacent prongs of S-layer proteins (Fig. 5, A and B).
We wondered whether the inter-prong grooves formed by adjacent SLH domains promote association of the S-layer protein with its carbohydrate ligand. In agreement with this conjecture, six amino acids corresponding to the most conserved residues among B. anthracis SLH domains contribute to the surface of the inter-prong grooves, for example Arg72, Phe95, Asp97, Lys143, Trp164, and Lys168 of IPG2 (Figs. 5C and 1C). The SCWPs of different bacterial species represent a complex set of carbohydrates with hexose units organized in linear and branched fashion (37). We modeled B. anthracis SCWP into the structure of SLH domains to analyze the anchoring of carbohydrates within their three-prong spindle (Fig. 5).
The SCWP molecule was constructed as described in Choudhury et al. (21). A selection of low energy conformers of the molecule were manually modeled into the presumed binding sites of IPG1, IPG2, and IPG3 by performing rigid body rotations and translations of the SCWP to minimize steric clashing with SLH molecule. Once a suitable pose was identified, a short conjugate gradient minimization procedure was performed (38).
The conserved residues in IPG2 and the modeled SCWP are shown in Fig. 5CD. Similar SCWP poses and conserved residue arrangements were observed for the equivalent residues on the surface of IPG1 and IPG3, but are not shown for visual clarity. The conserved phenylalanine residues (Phe34, Phe95, and Phe156) are located at the domain interfaces, most likely playing an important role in domain packing (Fig. 5AB). However, the conserved aspartic acid residues (Asp36, Asp97, and Asp156) are located outside of the SCWP binding domains. Their planar arrangement on the bottom of the molecule suggests that they may be involved in interacting with peptidoglycan or the cell wall linkage units, which provide a tether for SCWP and the envelope of bacilli (20). Of note, our modeling experiments with SLH domains cannot consider the essential contribution of the SCWP ketal-pyruvyl, as the position of this carbohydrate modification is not yet known. We therefore sought to test whether the conserved Asp and Arg/Lys residues in the inter-prong grooves of SapSLH can indeed contribute to binding SCWPs in the envelope of bacilli.
Functional Studies of SapSLH Variants
Glutathione S-transferase (GST) hybrids were used to examine the contribution of individual Sap SLH domains and their key residues toward B. anthracis SCWP binding. Purified GST-SLH1–3, a hybrid encompassing amino acids 31–210 of Sap fused to the C terminus of GST, was incubated with B. anthracis vegetative forms that had been stripped of their S-layer. As a measure of SCWP binding, we monitored co-sedimentation of GST hybrids with bacilli following centrifugation (Fig. 6). As a control, GST alone did not sediment with vegetative forms, whereas all GST-SLH1–3 molecules co-sedimented with 10 A600 units of bacilli (Fig. 6, A and B). A reduction in the number of bacilli led to reduced co-sedimentation of GST-SLH1–3 (Fig. 6, A and B). These results are in agreement with the general concept that the SLH domains of S-layer proteins and the SCWP of bacilli represent a receptor-ligand interaction. The removal of individual SLH domains in GST-SLH1–2 and GST-SLH1 abolished the association of GST hybrids with the SCWP of bacilli (Fig. 6, A and B). When compared with SLH1–3, the CD spectra of isolated SLH1–2 and SLH1 domains displayed stepwise decreases in helix content but retained the overall pattern expected of helical proteins (Fig. 6C). We presume that the lack of GST-SLH1–2 and GST-SLH1 association with SCWP is due to the inability of these variants to form the three-pronged spindle structure rather than defects in the overall folding of individual SLH domains. These data demonstrate that a functional binding interface requires the presence of all three tandem SLH domains.
A striking feature of our SapSLH structure is the juxtaposition of conserved, basic residues extending into the inter-prong grooves of adjacent SLH domains (vide supra), to conserved aspartic acid residues positioned in the loop region of each spindle prong (Fig. 2D). To test whether these residues are required for SCWP association we generated two variants. GST-SLHRRK/AAA harbors alanine substitutions in all three basic residues (R72A, R131A, and K193A), whereas GST-SLHDDD/AAA carries substitutions in the three aspartic acid residues (D36A, D97A, and D158A). GST-SLHRRK/AAA displayed a reduction in its SCWP binding capacity as approximately half of the protein failed to associate with vegetative forms even at 10 A600 units (Fig. 6, A and B). A similar phenotype was observed for GST-SLHDDD/AAA variants. Thus, charged residues conserved in the inter-prong grooves of SLH domains contribute to the binding of S-layer proteins to the SCWP of B. anthracis.
In Silico Prediction of the Three-prong Spindle Structure of SLH Domains from Other S-layer Proteins
Excepting the amino acids Phe34, Asp36, Trp42, Gly54, Gly57, Glu64, Pro65, Arg72, Ala76, and Asn84 which are conserved in many B. anthracis SLH domains, the SLH domain peptide sequence is variable (Fig. 1C). To ascertain whether the sequence of these domains can satisfy the spatial constraints imposed by the empirically derived SapSLH structure, we modeled aligned sequences of tandem SLH domain triplets from the B. anthracis EA1 S-layer protein and 22 S-layer associated proteins. Each SLH sequence was first aligned with Sap residues 31–209 using ClustalW.
These alignments and the SapSLH PDB file were used to produce in silico predicted structures with MODELLER 9v8 using a standard Python script and models refined with PyMOL (MacPyMOL 1.1r1). In silico modeled SLH structures were compared with the empirically derived SapSLH structure in two ways. First, we used the PyMol align function to measure root mean squared deviation (RMSD) in angstroms (Å) between a set of all atoms in the modeled SLH domains and the SapSLH structure (Table 2). Using these criteria, every set of three SLH domains produced average RMSD values from SapSLH equal to or less than 0.461 Å (Table 2). Performing the same calculation with only the α-carbon backbone, we derived alignments where the largest average RMSD distance was calculated to be 0.269 Å for 123 carbons (Table 2). Second, we subjected all 24 SLH domain PDB files to analysis using PROCHECK (35) to judge the stereochemical quality of each modeled SLH domain (and SapSLH). PROCHECK measures whether or not all residues lay within the allowed portions of the Ramachandran plot and scores the likeliness of all modeled bond lengths and angles occurring. Supplemental Table S1 records the PROCHECK outputs for SapSLH and all 23 modeled SLH domains. Many of our in silico determined structures satisfy all criteria while some fail by containing a single or as many as two amino acids with disallowed F/Ψ angles. As the observed RMSD values, derived from greater than 100 atoms, are low and each of these modeled SLH domains satisfy most, if not all, PROCHECK criteria, we propose that, though highly divergent in primary sequence (Supplemental Table S1), all SLH domains within the B. anthracis genome likely adopt a similar fold (Fig. 7).
TABLE 2.
Polypeptide |
Aligment of SLH domains |
||||
---|---|---|---|---|---|
NCBI locus | Name | All atoms | RMSD | Cα | RMSD |
BAS0841 | Sap | – | – | – | – |
BAS0842 | EA1 | 884 | 0.172 | 152 | 0.119 |
pXO1–90 | BslA | 749 | 0.362 | 123 | 0.269 |
pXO1–54 | BslB | 758 | 0.3 | 129 | 0.215 |
BAS0916 | BslC | 697 | 0.461 | 116 | 0.3 |
BAS1048 | BslD | 735 | 350 | 116 | 0.214 |
BAS1049 | BslE | 759 | 0.287 | 130 | 0.205 |
BAS1050 | BslF | 694 | 0.34 | 118 | 0.254 |
BAS1787 | BslG | 762 | 0.292 | 132 | 0.21 |
BAS2160 | BslH | 749 | 0.323 | 125 | 0.232 |
BAS3093 | BslI | 758 | 0.312 | 125 | 0.203 |
BAS3425 | BslJ | 752 | 0.317 | 127 | 0.225 |
BAS1021 | BslK | 831 | 0.216 | 135 | 0.128 |
BAS1246 | BslL | 755 | 0.227 | 139 | 0.168 |
BAS2613 | BslM | 739 | 0.274 | 120 | 0.175 |
BAS4693 | BslN | 748 | 0.281 | 127 | 0.222 |
BAS1683 | BslO | 714 | 0.257 | 121 | 0.179 |
BAS0829 | BslP | 639 | 0.327 | 106 | 0.2 |
BAS3089 | BslQ | 660 | 0.292 | 114 | 0.201 |
BAS3463 | BslR | 616 | 0.283 | 110 | 0.194 |
BAS0851 | BslS | 832 | 0.2 | 134 | 0.118 |
BAS1682 | BslT | 934 | 0.224 | 146 | 0.122 |
BAS2351 | BslU | 797 | 0.19 | 141 | 0.133 |
pXO2–42 | AmiA | 808 | 0.18 | 129 | 0.1 |
The high degree of similarity in shape and position can be appreciated by viewing the SapSLH structure and the modeled structure for the SLH domains of EA1 (EA1SLH) when displayed side-by-side or superimposed (Fig. 7, A and B). The modeled EA1SLH structure aligns closely with the SapSLH structure (Fig. 7, A--C), producing an RMSD of 0.119 Å for 152 α-carbons (Table 2) with all amino acids within the allowed regions of the Ramachandran plot (Supplemental Table S1). These results explain earlier observations that the SLH domains of Sap and EA1 bind to the envelope of B. anthracis with similar affinity (39). A superimposition of all 23 modeled SLH domains aligned to SapSLH is provided in Fig. 7D. Perfect alignment is not achieved in all cases, as some SLH domain sequences, when aligned to Sap, exhibit higher disparity in their primary sequences (Fig. 7D). Nevertheless, these structures still contain remarkably similar secondary structures with all models containing helices a, b, c, x, y, and z in the same position and of the same length. We therefore hypothesize that all SLH domains of S-layer proteins in B. anthracis form a three-prong spindle structure and bind to the SCWP in a manner similar to the SLH domain of Sap. Analyzing the predicted secondary structure of SLH domains from S-layer proteins of diverse microbes, Engelhardt and Peters predicted that SLH domains may assume a similar structure (40). Their prediction is corroborated by the results presented in Fig. 7. Structural coordinates for SapSLH have been deposited in the Protein Database, PDB identification number 3PYW.
DISCUSSION
Binding of SLH domains to pyruvylated secondary cell wall carbohydrates is thought to be an ancestral mechanism for the anchoring of S-layer proteins to the envelope of bacteria (41). B. anthracis elaborates S-layers from two secreted polypeptides with N-terminal SLH domains, Sap and EA1 (22, 42). The SLH domains of Sap and EA1 bind to pyruvylated SCWP, a carbohydrate that is tethered via peptidoglycan linkage units to the murein sacculus of this microbe (20). Sap and EA1 are synthesized by bacilli in great abundance and form two-dimensional para-crystalline arrays on solid surfaces, a feature that is encrypted in their large C-terminal crystallization domains (25). Twenty-two B. anthracis S-layer-associated proteins (BSLs) also harbor N-terminal SLH domains (19). Unlike Sap and EA1, BSLs are minor components of the envelope that use their SLH domains to engage SCWPs but do not form para-crystalline arrays (19, 20). One of the S-layer associated proteins, BslA, promotes the adhesion of bacilli to host tissues and is required for the pathogenesis of anthrax disease (7, 19). The S-layer protein BslK binds to hemin, thereby contributing to an iron-uptake pathway that retrieves heme from hemoglobin for subsequent transport across the many layers of the B. anthracis envelope (6, 43). The AmiA protein contains three tandem SLH domains and is a peptidoglycan hydrolase (44). The molecular function of other S-layer proteins is not yet known, however, a set of eight, including AmiA, are putative hydrolases, which may act to shape the peptidoglycan layer of bacilli.
Here we used x-ray crystallography to reveal the three-dimensional structure of the SLH domain of Sap, whose overall shape resembles that of a three-prong spindle. Molecular modeling experiments suggest that the SLH domains of other proteins assume a similar structure. Further, the inter-prong grooves of this domain can accommodate both linear and branched carbohydrates, i.e. the secondary cell wall polysaccharides that are known to function as the docking platform for the assembly of S-layers. Our experiments identify conserved residues in the SLH domains of Sap and other S-layer proteins of bacilli (Phe34, Asp36, Trp42, and Arg72) and we show that two of these, Asp36 and Arg72, contribute to their interaction with SCWP.
The SCWP of B. anthracis is a polymer with the repeating structure [→6)-α-GlcNAc-(1→4)-β-ManNAc-(1→4)-β-GlcNAc-(1→]n, where α-GlcNAc is substituted α-Gal and β-Gal at O3 and O4, respectively, and the β-GlcNAc is substituted with α-Gal at O3 (21). CsaB is thought to generate pyruvyl-ketal modifications of SCWP, which are a pre-requisite for S-layer assembly (11). The pyruvyl-ketal introduces a strong negative charge modification into the polysaccharide that promotes binding of the SLH domains in S-layer proteins to the bacterial envelope (20). We surmise that some of the conserved positively charged residues in the inter-prong grooves enable S-layer protein binding to the pyruvyl-ketal of SCWP (20).
Cholera toxin represents another secreted bacterial polypeptide that binds to carbohydrates, specifically the GM1 glycosphingolipid of host cell membranes (45). Cholera toxin B assembles into a pentameric ring structure (46) and docks onto the host cell receptor in a manner whereby subunit interfaces capture the glycosphingolipid ligand (47). Choleragenoid, the toxin B subunit alone, is used as a vaccine antigen to elicit protective antibodies that prevent association of toxin with GM1 receptor. A similar strategy may be plausible to prevent S-layer assembly, and this may provide protection against B. anthracis or other infectious agents that also use S-layer proteins for the pathogenesis of their associated diseases (16).
Supplementary Material
Acknowledgments
The submitted manuscript has been created by University of Chicago Argonne, LLC, Operator of Argonne National Laboratory (“Argonne”). Argonne, a U.S. Dept. of Energy Office of Science laboratory, is operated under Contract No. DE-AC02-06CH11357. The U.S. Government retains for itself, and others acting on its behalf, a paid-up nonexclusive, irrevocable worldwide license in said article to reproduce, prepare derivative works, distribute copies to the public, and perform publicly and display publicly, by or on behalf of the Government.
This work was supported, in whole or in part, by Grants GM074942 (to A. J.) and AI69227 (to O. S.) from the National Institutes of Health.
The atomic coordinates and structure factors (code 3PYW) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
The on-line version of this article (available at http://www.jbc.org) contains supplemental Fig. S1.
- S-layer
- surface layer
- SLH
- surface layer homology
- RMSD
- root mean square deviation
- SCWP
- secondary cell wall polysaccharide
- EA
- extractable antigen
- Sap
- surface array protein
- IMAC
- immobilized metal affinity chromatography
- SAD
- single-wavelength anomalous diffraction
- BSL
- B. anthracis S-layer-associated protein
- PDB
- Protein Data Bank.
REFERENCES
- 1. Houwink A. L., Le Poole J. B. (1952) Physikalische Verhandlungen 3, 98 [Google Scholar]
- 2. Engelhardt H. (2007) J. Struct. Biol. 160, 115–124 [DOI] [PubMed] [Google Scholar]
- 3. Sleytr U. B. (1997) FEMS Microbiol. Rev. 20, 5–12 [Google Scholar]
- 4. Houwink A. L. (1953) Biochim. Biophys. Acta 10, 360–366 [DOI] [PubMed] [Google Scholar]
- 5. Houwink A. L. (1956) J. Gen. Microbiol. 15, 146–150 [DOI] [PubMed] [Google Scholar]
- 6. Tarlovsky Y., Fabian M., Solomaha E., Honsa E., Olson J. S., Maresso A. W. (2010) J. Bacteriol. 192, 3503–3511 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Kern J. W., Schneewind O. (2010) Mol. Microbiol. 75, 324–332 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Fujino T., Béguin P., Aubert J. P. (1993) J. Bacteriol. 175, 1891–1899 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Lupas A., Engelhardt H., Peters J., Santarius U., Volker S., Baumeister W. (1994) J. Bacteriol. 176, 1224–1233 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Lupas A. (1996) Mol. Microbiol. 20, 897–898 [DOI] [PubMed] [Google Scholar]
- 11. Mesnage S., Fontaine T., Mignot T., Delepierre M., Mock M., Fouet A. (2000) EMBO J. 19, 4473–4484 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Mesnage S., Tosi-Couture E., Fouet A. (1999) Mol. Microbiol. 31, 927–936 [DOI] [PubMed] [Google Scholar]
- 13. May A., Pusztahelyi T., Hoffmann N., Fischer R. J., Bahl H. (2006) Arch. Microbiol. 185, 263–269 [DOI] [PubMed] [Google Scholar]
- 14. Pavkov T., Egelseer E. M., Tesarz M., Svergun D. I., Sleytr U. B., Keller W. (2008) Structure 16, 1226–1237 [DOI] [PubMed] [Google Scholar]
- 15. Koch R. (1876) Beiträge zur Biologie der Pflanzen 2, 277–310 [Google Scholar]
- 16. Leoff C., Saile E., Rauvolfova J., Quinn C. P., Hoffmaster A. R., Zhong W., Mehta A. S., Boons G. J., Carlson R. W., Kannenberg E. L. (2009) Glycobiology 19, 665–673 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Bruckner V., Kovacs J., Denes G. (1953) Nature 172, 508 [DOI] [PubMed] [Google Scholar]
- 18. Fouet A. (2009) Mol. Aspects Med. 30, 374–385 [DOI] [PubMed] [Google Scholar]
- 19. Kern J. W., Schneewind O. (2008) Mol. Microbiol. 68, 504–515 [DOI] [PubMed] [Google Scholar]
- 20. Kern J., Ryan C., Faull K., Schneewind O. (2010) J. Mol. Biol. 401, 757–775 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Choudhury B., Leoff C., Saile E., Wilkins P., Quinn C. P., Kannenberg E. L., Carlson R. W. (2006) J. Biol. Chem. 281, 27932–27941 [DOI] [PubMed] [Google Scholar]
- 22. Mesnage S., Tosi-Couture E., Mock M., Gounon P., Fouet A. (1997) Mol. Microbiol. 23, 1147–1155 [DOI] [PubMed] [Google Scholar]
- 23. Mignot T., Mesnage S., Couture-Tosi E., Mock M., Fouet A. (2002) Mol. Microbiol. 43, 1615–1627 [DOI] [PubMed] [Google Scholar]
- 24. Couture-Tosi E., Delacroix H., Mignot T., Mesnage S., Chami M., Fouet A., Mosser G. (2002) J. Bacteriol. 184, 6448–6456 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Candela T., Mignot T., Hagnerelle X., Haustant M., Fouet A. (2005) Microbiology 151, 1485–1490 [DOI] [PubMed] [Google Scholar]
- 26. Mesnage S., Tosi-Couture E., Gounon P., Mock M., Fouet A. (1998) J. Bacteriol. 180, 52–58 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Richter S., Anderson V. J., Garufi G., Lu L., Budzik J. M., Joachimiak A., He C., Schneewind O., Missiakas D. (2009) Mol. Microbiol. 71, 404–420 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. van Duyne G. D., Standaert R. F., Karplus P. A., Schreiber S. L., Clardy J. (1993) J. Mol. Biol. 229, 105–124 [DOI] [PubMed] [Google Scholar]
- 29. Kim Y., Dementieva I., Zhou M., Wu R. Y., Lezondra L., Quartey P., Joachimiak G., Korolev O., Li H., Joachimiak A. (2004) J. Struct. Funct. Genomics 5, 111–118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Minor W., Cymborowski M., Otwinowski Z., Chruszcz M. (2006) Acta Crystallogr. D Biol. Crystallogr. 62, 859–866 [DOI] [PubMed] [Google Scholar]
- 31. Schneider T. R., Sheldrick G. M. (2002) Acta Crystallogr. D Biol. Crystallogr. 58, 1772–1779 [DOI] [PubMed] [Google Scholar]
- 32. CCP4 (COLLABORATIVE COMPUTATIONAL PROJECT, N.) (1994) Acta Crystallogr. D Biol. Crystallogr. 50, 760–763 [DOI] [PubMed] [Google Scholar]
- 33. Emsley P., Cowtan K. (2004) Acta Crystallogr. D Biol. Crystallogr. 60, 2126–2132 [DOI] [PubMed] [Google Scholar]
- 34. Murshudov G. N., Vagin A. A., Dodson E. J. (1997) Acta Crystallogr. D Biol. Crystallogr. 53, 240–255 [DOI] [PubMed] [Google Scholar]
- 35. Laskowski R. A., MacArthur M. W., Moss D. S., Thornton J. M. (1993) J. Appl. Crystallogr. 26, 283–291 [Google Scholar]
- 36. Chich J. F., Rigolet P., Nardi M., Gripon J. C., Ribadeau-Dumas B., Brunie S. (1995) Proteins 23, 278–281 [DOI] [PubMed] [Google Scholar]
- 37. Schäffer C., Messner P. (2005) Microbiology 151, 643–651 [DOI] [PubMed] [Google Scholar]
- 38. Case D. A., Cheatham T. E., 3rd, Darden T., Gohlke H., Luo R., Merz K. M., Jr., Onufriev A., Simmerling C., Wang B., Woods R. J. (2005) J. Comput. Chem. 26, 1668–1688 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Chauvaux S., Matuschek M., Beguin P. (1999) J. Bacteriol. 181, 2455–2458 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Engelhardt H., Peters J. (1998) J. Struct. Biol. 124, 276–302 [DOI] [PubMed] [Google Scholar]
- 41. Cava F., De Pedro M. A., Schwarz H., Henne A., Berenguer J. (2004) Mol. Microbiol. 52, 677–690 [DOI] [PubMed] [Google Scholar]
- 42. Etienne-Toumelin I., Sirard J. C., Duflot E., Mock M., Fouet A. (1995) J. Bacteriol. 177, 614–620 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Maresso A. W., Garufi G., Schneewind O. (2008) PLoS pathogens 4, e1000132 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Mesnage S., Fouet A. (2002) J. Bacteriol. 184, 331–334 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. van Heyningen W. E., Carpenter C. C., Pierce N. F., Greenough W. B., 3rd (1971) J. Infect. Dis. 124, 415–418 [DOI] [PubMed] [Google Scholar]
- 46. Zhang R. G., Westbrook M. L., Westbrook E. M., Scott D. L., Otwinowski Z., Maulik P. R., Reed R. A., Shipley G. G. (1995) J. Mol. Biol. 251, 550–562 [DOI] [PubMed] [Google Scholar]
- 47. Merritt E. A., Sarfaty S., van den Akker F., L'Hoir C., Martial J. A., Hol W. G. (1994) Protein Sci. 3, 166–175 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.