Skip to main content
The Journal of Biological Chemistry logoLink to The Journal of Biological Chemistry
. 2012 Aug 3;287(41):34856–34865. doi: 10.1074/jbc.M112.371054

Targeted Protein Engineering Provides Insights into Binding Mechanism and Affinities of Bacterial Collagen Adhesins*

Caná L Ross ‡,1, Xiaowen Liang ‡,1, Qing Liu , Barbara E Murray §, Magnus Höök , Vannakambadi K Ganesh ‡,2
PMCID: PMC3464587  PMID: 22865854

Background: Collagen adhesins play critical roles in bacterial colonization.

Results: Several structural aspects influence binding affinity, binding ratio, and kinetics of binding to type I collagen.

Conclusion: Discrete structural elements within the bacterial collagen adhesin control ligand affinity and binding mechanism.

Significance: Understanding the collagen/MSCRAMM interaction sheds insight on how domain architecture influences the mechanism of collagen binding and affinity.

Keywords: Bacterial Adhesion, Biophysics, Collagen, Extracellular Matrix Proteins, Infectious Diseases, Microbial Pathogenesis, Molecular Modeling, Protein Chimeras, Protein Conformation, Protein Engineering

Abstract

The collagen-binding bacterial proteins, Ace and Cna, are well characterized on the biochemical and structural level. Despite overall structural similarity, recombinant forms of the Ace and Cna ligand-binding domains exhibit significantly different affinities and binding kinetics for collagen type I (CI) in vitro. In this study, we sought to understand, in submolecular detail, the bases for these differences. Using a structure-based approach, we engineered Cna and Ace variants by altering specific structural elements within the ligand-binding domains. Surface plasmon resonance-based binding analysis demonstrated that mutations that are predicted to alter the orientation of the Ace and Cna N1 and N2 subdomains significantly affect the interaction between the MSCRAMM (microbial surface components recognizing adhesive matrix molecule) and CI in vitro, including affinity, association/dissociation rates and binding ratio. Moreover, we utilized this information to engineer an Ace variant with an 11,000-fold higher CI affinity than the parent protein. Finally, we noted that several engineered proteins that exhibited a weak interaction with CI recognized more sites on CI, suggesting an inverse correlation between affinity and specificity.

Introduction

Bacterial colonization of host tissues is a critical step in the infection process, and many pathogenic bacteria produce proteins that promote bacterial adhesion. One such family of proteins, the MSCRAMMs3 (microbial surface components recognizing adhesive matrix molecules), interacts with components of the host extracellular matrices (13). MSCRAMMs found on Gram-positive bacteria typically belong to a family of cell wall anchored proteins that have similar structural organizations. In general, the N-terminal portion of these proteins contains the ligand-binding domain in the so-called A region, whereas the C-terminal half contains repeated motifs often known as the B domains that appear to extend and project the ligand-binding domain away from the bacterial surface (4). The C terminus also contains the cell wall sorting motifs, including the LPXTG sequence, which is essential for sortase-dependent anchoring to the cell wall, followed by a stretch of hydrophobic residues and a short segment of positively charged amino acids (5, 6).

Despite a conserved structural organization, MSCRAMMs are largely species-specific, and homologues with substantial sequence similarities within the ligand-binding domains are not found in other bacteria. One exception is a family of collagen-binding MSCRAMMs, which includes Cna from Staphylococcus aureus (7), Ace from Enterococcus faecalis (8), Acm from Enterococcus faecium (9), Cne from Streptococcus equi (10), Cnm from Streptococcus mutans (11), RspA and RspB from Erysipelothrix rhusiopathiae (12), and BA0871 and BA5258 from Bacillus anthracis (13).

Cna and Ace are the most extensively studied members of this family of collagen-binding MSCRAMMs. The Ace and Cna ligand-binding regions are composed of N1 and N2, subdomains that each adopt an IgG fold variant (1416). Structural comparisons of Cna31–344 in the apo form and in complex with a synthetic collagen peptide led to the proposal of a multistep ligand-binding mechanism called the Collagen Hug (15). In this model, ligand binding is initiated by a low affinity interaction between the ligand and residues present within a shallow “trench” located on the N2 subdomain. Next, the interdomain linker wraps around the triple helical collagen and repositions the N1 subdomain. Finally, the complex is stabilized when the C-terminal N2 extension, acting as a “latch,” inserts into a cleft and complements a β-sheet within the neighboring N1 subdomain (15). In the resulting complex, the MSCRAMM forms an interdomain hole through which the collagen projects. Structural comparisons and mutagenesis studies indicate that Ace employs a similar ligand-binding mechanism (16).

The crystal structures of Cna31–344 and Ace32–367 indicate that the hole (consisting of the interdomain linker and the N1 and N2 subdomains) created by ligand hugging can only accommodate a single collagen triple helix. Therefore, it is likely that Cna and Ace interact with monomeric triple helix collagen but not with fibrillar collagen (15, 16). Furthermore, Cna31–344 does not recognize gelatin or denatured collagen, indicating that the rod-like shape of the triple helix is critical for binding (15).

Expression of Ace and Cna on the surface of E. faecalis and S. aureus, respectively, results in increased bacterial adherence to immobilized collagen type I (CI) (8, 1719). In addition, surface expression of Ace mediates E. faecalis adherence to collagen type IV (CIV), laminin, and dentin (20, 21), whereas the presence of the cna gene has been associated with adherence of S. aureus clinical isolates to laminin and CIV (22). Furthermore, several studies demonstrate that Ace and Cna are virulence factors in different animal models. S. aureus strains harboring the cna gene exhibit increased virulence in animal models of staphylococcal infection, including endocarditis, arthritis, osteomyelitis, mastitis, and keratitis (18, 2326). It has also been demonstrated that ace-null strains of E. faecalis are attenuated in a murine model of urinary tract infection and a rat endocarditis model (27). In addition, recombinant forms of Cna can be used as effective vaccine components, whereas passive immunization with Cna antibodies is protective in a S. aureus septic death mouse model (28). Also, active immunization of rats with Ace32–367 significantly decreases the incidence of endocarditis, whereas prophylactic treatment of rats with purified Ace antibodies significantly reduces the numbers of E. faecalis isolated from vegetations as well as the number infected (27).

Despite the high degree of structural similarity between Ace32–367 and Cna31–344, these proteins demonstrate significantly different affinities and binding kinetics for CI in vitro (8, 15, 16). To understand the bases for these differences, we used a structure-based approach to engineer proteins harboring alterations within specific structural elements. Surface plasmon resonance (SPR)-based binding analysis demonstrates that sequence alterations within the N1 subdomains of Ace and Cna significantly affect the stability of the interaction between the MSCRAMM and the ligand in vitro. The mutations, which are predicted to alter the orientation and range of motion of the N1 and N2 subdomains, influenced collagen affinity, rates of association/dissociation, and binding site preference. We also found that the sequence and length of the interdomain linker influenced binding of the proteins to immobilized CI and determined that although there is an optimal length for the interdomain linker, the sequence of the interdomain linker is more important.

EXPERIMENTAL PROCEDURES

Medium and Growth Conditions

Overnight cultures of Escherichia coli strains were grown at 30 °C in LB medium (37 °C for plasmid extraction) with shaking (200 rpm). For protein expression, E. coli TOPP3 was diluted 1:100 into fresh LB medium and grown at 37 °C with shaking (250 rpm). During logarithmic growth phase, 200 μm isopropyl-1-thio-β-d-galactopyranoside was added, and growth was continued at 30 °C for 16 h, after which cells were harvested by centrifugation at 3500 rpm for 20 min using a Sorvall RC 3B Plus centrifuge. Pellets were frozen at −20 °C. Ampicillin (Sigma-Aldrich) or carbenicillin (Sigma-Aldrich) was used at concentrations of 100 μg/ml.

DNA Manipulation and Plasmid Construction

Sequence changes to the DNA regions encoding the interdomain linker and the C-D loop of ace and cna were introduced using site-directed mutagenesis (29) or overlap extension PCR (30) using primers listed in supplemental Table S1 in a mixture containing Phusion polymerase (Thermo Scientific, Vantaa, Finland), HF buffer (Thermo Scientific), 50 ng of DNA template, and 200–250 nm dNTP mix (Invitrogen). Following digestion with DpnI (New England Biolabs, Ipswich, MA), PCRs generated from site-directed mutagenesis were transformed into E. coli TOPP10 using standard procedures. Purified overlap PCRs were incubated at 72 °C with Taq polymerase (New England Biolabs) and 200 nm dATP (Invitrogen) and purified using DNA clean and concentrator 5 (Zymo Research, Carlsbad, CA) prior to ligation into pGEMTeasy (Promega, Madison, WI).

Sequencing was performed to verify site-directed mutagenesis and pGEMTeasy constructs containing inserts (Genewiz, Houston, TX). Following digestion with BamHI and SalI (New England Biolabs), the sequence of interest was purified from pGEMTeasy using the gel extraction kit (Zymo Research) and ligated into the previously digested expression vector, pQE30, using T4 DNA ligase (Promega). Standard procedures were followed for DNA transformation into E. coli TOPP10 cells. Final plasmids are described in supplemental Table S2.

Protein Purification

The His-tagged recombinant proteins were purified with a 5-ml nickel-charged HiTrap chelating column (GE Healthcare, Uppsala, Sweden) and a 5-ml anion exchange Sepharose column (GE Healthcare) as described by Barbu et al. (31). Fractions obtained from the anion exchange column were concentrated and dialyzed against PBS, 10 mm EDTA, pH 7.4. Recombinant proteins were >95% pure. The concentration of the recombinant protein was determined by absorbance measured at 280 nm and calculated using the extinction coefficient for each protein.

Molecular Modeling

Molecular modeling studies were performed using Coot (32) and Insight II (Accelrys, Inc., San Diego, CA). To build a model of Ace in the Cna conformation, individual N1 and N2 domains of Ace (Protein Data Bank (PDB) id: 2Z1P) were superimposed on corresponding N1 and N2 domains of the crystal structures of Cna (PDB id: 2F68). Superposition based on secondary structure matching (33) available in Coot (32) was used for the superposition. One hundred and eighteen Cα atoms in the N1 subdomain and 137 Cα atoms in the N2 subdomain aligned with root mean square deviations of 1.67 and 1.50 Å, respectively. The model of the Ace molecule in the Cna conformation showed steric clashes between the N1 and N2 subdomain. To build a molecular model of the AceC4 harboring the Cna N1 region (C-D loop, D-strand, and D-D′ loop), a homology model was built using the HOMOLOGY module in the Insight II package (Accelrys, Inc.). After building a model of the Ace N1 domain with the Cna N1 region, this domain was superimposed on the Ace with Cna conformation to build a model of AceC4.

SPR Analysis

The interactions of Cna31–344, Ace32–367, and the mutant derivatives with immobilized rat tail CI (R&D Systems, Inc., Minneapolis, MN) were characterized using a Biacore 3000 (GE Healthcare/Biacore) at 25 °C. Sensor chip C1 and amine-coupling kits were obtained from the same company and used to covalently attach CI onto the sensor surface using the amine coupling procedure as recommended by the manufacturer. The matrix-free C1 chip was chosen because it can achieve low density immobilization and avoid mass transfer limitation. Also, the flat surface allows interactions to take place closer to the surface, which is beneficial when working with multivalent interaction partners and large molecules. Briefly, after the surface was activated for 2 min, 10 μl of 5 μg/ml CI solution (stock of 5 mg/ml in 0.01 n HCl was diluted in 10 mm sodium acetate, pH 5.5) was injected into the flow cell at a flow rate of 5 μl/min. Approximately 440 response units (RU) of collagen were immobilized. Lower (∼150 and 180 RU) density collagen surfaces were also prepared by adjusting the activation time and volume of CI injected. A Cna surface was prepared on a separate sensor chip using 5 μg/ml Cna31–344 in 10 mm sodium acetate buffer, pH 5.5. The reference flow cells were prepared with activation and deactivation steps where no protein was coupled. Phosphate-buffered saline (8 mm Na2HPO4, 1.5 mm KH2PO4, pH 7.4, 2.7 mm KCl, 137 mm NaCl, and 0.005% Tween 20) was used as running buffer. Binding was performed at a relatively high flow rate of 30 μl/min, although no mass transfer was observed in the system. To regenerate the sensor surface, bound protein was removed by flowing 10 mm glycine (pH 1.5) over the surface for 1 min.

Base line-corrected SPR response curves (with buffer blank run further subtracted) were used for affinity determination. For steady-state analysis, equilibrium response (Req) of each injection was collected and plotted against the concentration (A) of injected protein. A binding isotherm was fitted to the data using equation 1 (GraphPad Prism 4, GraphPad Software, Inc., La Jolla, CA) to obtain the equilibrium dissociation constant (KD) and binding response maximum (Rmax).

graphic file with name zbc04112-2448-m01.jpg

Nonequilibrium data were globally fit to a predefined two-state model using the BIAevaluation software (Version 4.1). The Rmax, association and dissociation rate constants (ka1, kd1) for the binding state (A + B ↔ AB), and forward and backward rate constants (ka2, kd2) for the conformational change state (AB ↔ AB*) were obtained from the fitting with T-value ≥100 as criteria for acceptable fit. This model describes 1:1 binding of analyte to immobilized ligand followed by a conformational change in the complex (34). It simply assumes that the only way AB* can dissociate to release free A is through prior conversion to the form AB, where A is Cna or Ace protein in solution, B is immobilized collagen, AB is the complex formed by binding A to B, and AB* is the complex formed by conformational change from AB. The apparent dissociation constant (KDapp) was calculated from the kinetic parameters obtained from the curve fitting.

graphic file with name zbc04112-2448-m02.jpg

Theoretical maximum response for 1:1 binding (Rmax1:1) was calculated using Equation 3, where Rimmob is the SPR response of the protein immobilized on the sensor surface and MWsol and MWimmob are molecular weight of soluble and immobilized protein, respectively. Molecular weights for Ace and Cna are ∼39,000 and ∼36,000, respectively. Because the collagen molecule was kept at very low concentration in a low salt and low pH buffer during immobilization, the monomer (∼300 kDa) conformation was maintained.

graphic file with name zbc04112-2448-m03.jpg

Molar binding ratio (N) is defined as the number of soluble proteins bound to a single immobilized molecule and is determined by comparing the experimental value Rmax with Rmax1:1.

graphic file with name zbc04112-2448-m04.jpg

RESULTS AND DISCUSSION

Comparisons of the Cna31–344 and Ace32–367 Structures and Interactions with Collagen

Superpositions of apo crystal structures of Cna31–344 and Ace32–367 are shown in Fig. 1A. Structural comparisons of the two structures revealed that the interdomain orientations of the Ace32–367 (PDB id: 2Z1P) N1 and N2 subdomains are altered by about 36° when compared with that of Cna31–344 (PDB id: 2F68) (16) with the interdomain linker residues disordered in both the apo structures. As a result, the apo-Ace32–367 structure is in a partially open conformation with the latch partially closed, whereas Cna31–344 exhibits a latch-closed conformation (16). Consistent with previous studies, Ace32–367 and Cna31–344 exhibit different binding affinities and kinetics in SPR analysis of MSCRAMM binding to immobilized CI (Table 1, Fig. 1B) (8, 15, 16). Cna31–344 has a higher affinity for collagen (KDapp ∼66.0 nm) than Ace32–367 (KD ∼43.2 μm). Also, the association and complete dissociation of Ace32–367 with immobilized CI are rapid, and the reaction follows a one-step steady-state binding model (Fig. 1, B and C, Table 1). However, the interaction of CI with Cna31–344 is more complex, consistent with a two-state conformational change model (Fig. 1, B and C, Table 1).

FIGURE 1.

FIGURE 1.

Structural and biochemical comparisons of recombinant Cna31–344 and Ace32–367. A, stereo view of the superposition of crystal structures of apo-Cna31–344 (PDB id: 2F68) and Ace32–367 (PDB id: 2Z1P). The N2 subdomains were used for superposition. The Cna31–344 and Ace32–367 molecules are green and cyan, respectively. The N1-N2 interdomain linker regions, which are disordered in the crystal structures, are denoted by dotted lines. B, SPR profiles of interactions. Biacore sensograms were generated by injecting Cna31–344 or Ace32–367 at the indicated concentration over immobilized CI. C, reactions representing the proposed binding models describing the observed one-step reaction between Ace32–367 and CI and the two-state conformational change reaction between Cna31–344 and CI. D, structure-based amino acid alignment of the Cna and Ace interdomain linkers.

TABLE 1.

Kinetic parameters for MSCRAMM/CI interaction

Binding of soluble proteins to immobilized collagen were analyzed using Biacore (shown in Figs. 4 and 5). See ‘Experimental Procedures’ for determination of equilibrium dissociation constants KD, apparent dissociation constants KDapp, and molar binding ratios N. Values are mean (with standard error in parentheses) values obtained from a minimum of two independent experiments.

Protein ka1 kd1 ka2 kd2 KDapp KD N
× 104m1s1 × 102 s1 × 103 s1 × 103 s1 nm μm
Cna31–344 7.33 (0.47) 1.51 (0.14) 3.28 (0.07) 1.52 (0.08) 66.0 (12.0) 11.7 (0.4)
CnaΔEAG 0.30 (0.04) 3.86 (0.05) 14.5 (0.1) 0.73 (0.01) 625 (115) 5.0 (0.4)
CnaQ̂IE 2.70 (0.15) 16.0 (2.0)
CnaA1 17.6 (2.5) 3.89 (0.08) 4.85 (0.4) 0.46 (0.01) 19.0 (3.0) 12.8 (1.1)
CnaA2 11.8 (0.5) 2.46 (0.07) 2.32 (0.1) 1.71 (0.08) 88.5 (6.5) 10.0 (0.4)
CnaA3 11.6 (1.5) 15.9 (1.7)
CnaA4 37.7 (2.0) 15.5 (1.8)
CnaA1A4 26.2 (10.5) 13.6 (1.4)
Ace32–367 43.2 (8.6) 13.7 (1.4)
AceΔQIE 29.0 (0.3) 11.3 (1.2)
AceΔETGQIE 30.1 (0.8) 9.6 (1.3)
AceC1 63.0 (3.0) 3.5 (0.5)
AceC2 51.7 (1.8) 4.6 (0.6)
AceC3 36.8 (0.4) 13.7 (1.8)
AceC4 34.4 (3.2) 0.93 (0.05) 6.68 (0.30) 0.62 (0.16) 2.2 (0.3) 5.0 (0.3)
AceC1C4 1.52 (0.15) 1.65 (0.17) 7.34 (0.49) 0.28 (0.01) 41.5 (12.5) 5.6 (0.3)

This simplified two-state reaction model involves two steps: 1) the formation of the CI·Cna31–344 complex (ka1 and kd1) and 2) a conformational change that may occur after formation of the complex (ka2 and kd2) (Fig. 1C, Table 1). The interaction of Cna31–344 with collagen involves several steps. We propose that after the initial interaction between collagen and the N2 subdomain of a Cna31–344 molecule in the closed form, additional conformational changes occur, including opening of the N1-N2 subdomains, reorientation of the N1 subdomain, which allows the interdomain linker to wrap around the collagen, and the latching event that stabilizes the collagen·Cna31–344 complex (supplemental Fig. S1). Although we believe that this model is most likely, we cannot exclude a mechanism where the open form and closed form of Cna31–344 are in an equilibrium and only the open form is capable of binding the collagen molecule (15).

In an effort to understand the molecular bases for these biochemical differences, we compared the structures of the Ace and Cna ligand-binding domains and found significant variation in the interdomain linkers and interdomain orientation (Fig. 1A). The interdomain hole of Ace32–367, formed by the interdomain linker and the N1-N2 subdomains, is more spacious than that of Cna31–344. Notably, structure-based sequence alignment showed that the Ace32–367 interdomain linker is 40% longer than that of Cna31–344 and likely contributes to the larger interdomain space (Fig. 1D). In addition, the crystal structure of Cna31–344 in complex with a collagen peptide shows the interdomain linker tightly wrapped around the ligand, suggesting that 8 amino acids is the optimal interdomain linker length. Therefore, it is possible that the Ace32–367 interdomain linker may be longer than required for monomeric collagen binding. Furthermore, the sequences of the Ace and Cna interdomain linkers share little sequence homology (Fig. 1D). Considering that structural data demonstrate specific contacts between the Cna31–344 interdomain linker and residues within a synthetic collagen peptide, it is possible that sequence differences within this structural element could influence ligand binding (15).

Structural comparisons also revealed that the interdomain orientations of the Ace32–367 N1 and N2 subdomains are altered by about 36° when compared with that of Cna31–344 (16). As a result, the apo-Ace structure is in a partially open conformation with the latch partially closed (16). Although it is possible that this altered orientation represents apo-Ace32–367 in equilibrium between the open and closed conformation (16), molecular modeling predicts that Ace32–367 cannot adopt the interdomain orientation exhibited by Cna31–344. Specifically, the C-D loop at the N terminus of strand D located within the Ace32–367 N1 subdomain is longer than the corresponding C-D loop of Cna31–344 (Fig. 2A). Molecular modeling of Ace32–367 in the Cna31–344 conformation predicts that the Ace32–367 C-D loop located within the N1 subdomain would clash with the A-B loop of the Ace32–367 N2 subdomain (Fig. 2B). Therefore, it is likely that Ace32–367 utilizes a more open conformation than Cna31–344 to avoid an apparent steric conflict between the two subdomains.

FIGURE 2.

FIGURE 2.

Implications of the Ace32–367 interdomain orientation. A, structure of Ace32–367. The open conformation exhibited by Ace32–367 allows the A-B loop within the N2 subdomain (yellow ribbon) to avoid steric conflicts with the C-D loop of the N1 subdomain (green ribbon). For clarity, the shorter Cna31–344 C-D loop (red) is superimposed on the Ace N1 subdomain. B, structural model of Ace32–367 in the Cna31–344 conformation. The long C-D loop within the N2 subdomain (green ribbon) makes severe clashes with the A-B loop of the N1 subdomain (yellow ribbon), suggesting that Ace32–367 cannot adopt a Cna-like conformation.

The altered interdomain orientation of the Ace32–367 N1-N2 subdomains and/or the more spacious interdomain hole of Ace32–367 could result in a loose or “slippery” binding region, which might contribute to the lower binding affinity and less stable binding, reflected by a faster dissociation when compared with Cna31–344 (Fig. 1B). To understand how these structural differences contribute to the observed differences in ligand binding, we evaluated the binding kinetics of several recombinant Ace and Cna proteins containing mutations that are predicted to alter the possible orientation of the N1 and N2 subdomains as well as the sequence and length of the interdomain linkers (Fig. 3).

FIGURE 3.

FIGURE 3.

Description of parent and mutant recombinant Ace and Cna proteins. A, crystal structure of Ace32–367 demonstrating the location of interdomain linker (blue dotted line) and C-D loop (red) within the N2 subdomain where sequences were altered. B, list describing the parent and mutant recombinant proteins and corresponding sequences of the interdomain linker (blue) and C-D loop region (red). The Cna sequences are in uppercase, and the Ace sequences are lowercase and bolded. C, diagram of N1 subdomain and the latching segment following the N2 subdomain (yellow) with labeled β-strands. The segment of the N1 subdomain that is altered in the protein chimeras is shown in red.

Linker Length Affects Binding to Type I Collagen

To address the contribution of interdomain linker length to collagen binding, we engineered Cna and Ace proteins containing elongated and/or truncated interdomain linkers (Fig. 3) and compared the affinity and binding kinetics to immobilized CI using SPR (Figs. 4 and 5). We determined that the interdomain linker length influenced multiple aspects of in vitro CI binding, including affinity and binding ratio (number of MSCRAMMs bound to a single collagen molecule).

FIGURE 4.

FIGURE 4.

Biacore analysis of the interactions between CI and the recombinant Cna proteins. 2-fold increasing concentrations of Cna proteins were injected over the immobilized CI (180 RU for A, D, and E; and 400 RU for B, C, F, G, and H). Sensorgrams are shown in black with lower curve corresponding to lower concentration of Cna protein injected. Kinetic analysis for A, C, D, and E (the fitted curves are shown in red) was performed to obtain the rate constants (shown in Table 1) KDapp and Rmax. For steady-state interactions (B, F, G, and H), a binding isotherm was created (inset) to determine equilibrium KD and Rmax. Drawings in each graph indicate the general location of the mutations. The presence of the mutations within the interdomain linker is represented as a filled in arc, and the mutations affecting the interdomain orientation are represented as a filled circle.

FIGURE 5.

FIGURE 5.

Biacore analysis of the interactions between CI and the recombinant Ace proteins. Ace proteins were injected over the immobilized CI (440 RU for A–F; and 150 RU for G and H). For steady-state interaction (A–F), a binding isotherm was created (inset) to determine equilibrium KD and Rmax. Kinetic analysis was performed for G and H to obtain the rate constants (shown in Table 1) KDapp and Rmax. Drawings in each graph indicate the general location of the mutations. The presence of the mutations within the interdomain linker is represented as a filled in arc, and the mutations affecting the interdomain orientation are represented as a filled circle.

Cna proteins containing interdomain linkers harboring insertions or deletions demonstrated significantly lower CI binding affinities than Cna31–344, indicating that the length of the Cna interdomain linker is optimal for CI affinity in vitro (Fig. 4, A–C, Table 1). The addition of 3 amino acids to the Cna interdomain linker resulted in a protein (Cna∧QIE) with an ∼40-fold decrease in binding affinity (Table 1, Fig. 4B), whereas deleting three residues from the interdomain linker yielded a protein (CnaΔEAG) with an ∼9-fold decrease in affinity for CI (Table 1, Fig. 4C). The length of the interdomain linker also influenced the number of sites in CI recognized by the MSCRAMM; Cna∧QIE showed an increased binding ratio of 16, and CnaΔEAG exhibited a decreased binding ratio of 5. A binding ratio of 13 was calculated for Cna31–344 (Table 1). Taken together, these data demonstrate that the native Cna interdomain linker length is optimal for CI binding in vitro as revealed by affinity calculations. However, a longer interdomain linker may allow the protein to recognize a greater number of binding sites within the collagen molecule (Table 1).

Removing residues from the Ace interdomain linker yielded proteins with CI affinities that were only marginally higher than the parent protein, Ace32–366 (KD ∼43.2 μm); AceΔQIE exhibited a KD of ∼29.0 μm, and AceΔETGQIE exhibited a KD of ∼30.1 μm (Fig. 5, A–C, Table 1). However, deleting residues within the Ace interdomain linker negatively affected the number of sites recognized on the collagen molecule. Although the parent protein exhibited a binding ratio (N) of 14, binding ratios of 11 and 10 were calculated for AceΔQIE and AceΔETGQIE, respectively (Table 1). These results indicate that decreasing the length of the Ace interdomain linker does not strongly influence affinity of Ace for CI, but as with Cna, the longer interdomain linker may allow the protein to recognize a larger number of binding sites within CI.

Because of the repetitive nature and linear structure of the CI molecule, the multiple MSCRAMM-binding sites observed in SPR likely represent independent binding sites. However, to confirm that the data are not due to aggregation of the injected proteins, we examined binding of solubilized CI to immobilized Cna31–344 (supplemental Fig. S2). The resulting binding ratio of 1:11 demonstrates that one CI molecule was recognized by ∼11 Cna31–344 molecules on the sensor surface, which is close to the N of 11.7 observed in experiments utilizing soluble Cna31–344 and immobilized CI (Table 1). Differences in binding ratio are expected because immobilization is random. Therefore, a portion of the immobilized Cna31–344 will not be able to interact with the CI molecule. Furthermore, the extremely slow off-rate observed in the SPR sensorgrams in supplemental Fig. S2 is due to the effects of valency, which occurs when a single CI molecule is anchored by multiple immobilized Cna31–344.

Linker Sequence Affects Affinity and Binding Site Recognition for Type I Collagen

The Ace and Cna interdomain linker sequences share little homology, and structural analysis of the MSCRAMM·ligand complex revealed that amino acids within the Cna interdomain linker make specific contacts with the collagen peptide (15). Specifically, Val-172, located near the C terminus of the interdomain linker, interacts with a proline residue from the leading chain of the collagen peptide. This interaction may facilitate repositioning of the N1 subdomain, which is an important step in the ligand binding process of Cna (15). Currently, a structure for the Ace ligand-binding domain in complex with collagen has not been resolved. Therefore, specific residues within the interdomain linker that make contact with the collagen ligand are unknown. However, based on position within the Ace interdomain linker, it is likely that the C-terminal residues Tyr-176 and/or Pro-177 play a role similar to that of Val-172.

To understand how the sequence of the interdomain linker contributes to collagen binding, we replaced the 8-amino acid interdomain linker from Cna31–344 with 1) the full-length, 14-amino acid Ace32–367 linker, 2) a 12-amino acid Ace linker lacking two N-terminal residues, and 3) a 10-amino acid Ace linker lacking two N- and two C-terminal residues, creating chimeric proteins CnaA1, CnaA2, and CnaA3, respectively (Fig. 3). Similarly, the Ace32–367 interdomain linker was replaced with 1) the 8-amino acid Cna linker, 2) a 10-amino acid sequence containing the Cna interdomain linker plus two C-terminal Ace linker residues, and 3) a 12-amino acid sequence containing the Cna interdomain linker flanked by 2 amino acids each from the N and C termini of the Ace linker to generate the chimeric proteins AceC1, AceC2, and AceC3, respectively (Fig. 3). We found that changes to the length and sequence of the interdomain linkers impacted the binding kinetics of the proteins and underscored the importance of specific residues within the interdomain linker sequence.

SPR analysis revealed that the sequence of the interdomain linker influenced the CI affinity and binding ratio. CnaA1, which contains the full-length Ace interdomain linker, exhibited a higher affinity for CI (KDapp ∼19.0 nm) than the native Cna31–344, indicating that the sequence of the longer Ace interdomain linker can facilitate high affinity binding to CI in vitro (Fig. 4, A and D, Table 1). Additionally, removing the two N-terminal Ace interdomain linker residues, as in CnaA2, yielded a protein with an affinity (KDapp ∼88.5 nm) similar to that of Cna31–344 (Fig. 4, A and E, Table 1), whereas CnaA3, which is lacking the N- and C-terminal Ace interdomain linker residues, exhibited a much lower affinity for CI (KD ∼11.6 μm) (Fig. 4, A and F, Table 1). The binding ratio for CnaA3 was the highest (N ∼16), whereas the binding ratios of CnaA1 (N ∼13) and CnaA2 (N ∼10) were more similar to Cna31–344 (N ∼12) (Table 1).

We determined that changes to the length and sequence of the Ace interdomain linkers resulted in modest changes in CI affinity (Table 1, Fig. 5, A, D, and E). However, the numbers of CI-binding sites were significantly altered (Table 1). Specifically, Ace32–367 and AceC3, which harbor interdomain linkers containing the N-terminal residues, Val-164 and Thr-165, and the C-terminal residues, Tyr-176 and Pro-177, exhibited binding ratios of 14. Proteins containing interdomain linkers lacking the N- and C-terminal residues, AceC1 (lacks Ace residues Val-164, Thr-165, Tyr-176, and Pro-177) and AceC2 (lacks Tyr-176 and Pro-177), exhibited binding ratios ranging from 4 to 5.

Taken together, the SPR data obtained from the engineered Cna and Ace proteins harboring interdomain linkers with altered sequences and lengths suggest that the N- and C- terminal residues of the Ace interdomain linker positively influence binding to CI in vitro. Moreover, it is possible that the C-terminal residues, YP, present within the Ace interdomain linker sequence stack better with the CI proline residues than the corresponding SV amino acids from the Cna interdomain linker (Fig. 3). As a result, proteins containing the YP residues may be able to anchor the collagen molecule within the N2 domain, suggesting that the ligand affinity is affected more by the sequence than by the length of the interdomain linker.

The Interdomain Orientation Affects Affinity and Binding Kinetics for Type I Collagen

Changes to the Ace interdomain sequence and length did not result in recombinant proteins that demonstrate stable interactions with immobilized CI, as was observed for the engineered Cna proteins, indicating that another aspect of the Ace32–367 structure is responsible for the low affinity and fast on- and off-rate. Analysis of the structure of apo-Ace32–367 reveals a protein with a partially open conformation. The Ace32–367 N1 subdomain contains a short D-D′ loop, a short D strand, and a large C-D loop, whereas the Cna31–344 N1 subdomain contains a large D-D′ loop, a longer D strand, and a short C-D loop (Figs. 3 and 2C). The larger C-D loop likely precludes the Ace32–367 protein from adopting a conformation that is optimal for ligand binding, thus preventing a tight interaction with collagen (Fig. 2B). In addition, the larger D-D′ loop found in Cna31–344 forms part of the latching trench and may yield a stronger latching event than the shorter Ace32–367 D-D′ loop. The open conformation and/or weaker latching event may explain why Ace32–367 has a lower affinity and more rapid off-rate for immobilized CI when compared with Cna31–344.

To understand how structural regions that do not interact with the ligand can influence orientation of the N1 and N2 subdomains, we engineered a protein that was predicted to adopt a more closed conformation than Ace32–367. We replaced the amino acid sequence spanning the Ace32–367 C-D loop, D-strand, and D-D′ loop with the corresponding residues from Cna, creating AceC4 (Fig. 3). In addition, we replaced the Cna31–344 C-D loop, D-strand, and D-D′ loop with the corresponding sequence from Ace to create CnaA4 (Fig. 3), which should adopt a conformation that is more open than that exhibited by Cna31–344.

SPR analysis demonstrated that changes predicted to influence the N1 and N2 interdomain orientation affect several aspects of the MSCRAMM/CI interaction. The CI affinity of AceC4 was ∼11,000-fold higher than Ace32–367 (Fig. 5, A and G, Table 1), whereas CnaA4 exhibited an ∼25-fold decrease in affinity for CI when compared with Cna31–344 (Fig. 4, A and F, Table 1). These results suggest that a more closed conformation allows a tighter interaction with CI.

The partially open conformation observed for apo-Ace32–367 may explain why the purified Ace proteins containing the shortest interdomain linkers (AceC1 and AceΔETGQIE) exhibited the lowest binding ratios for CI. In an effort to assess the relationship between the interdomain linker and the interdomain orientation, we created AceC1C4, which contains the shorter Cna interdomain linker and the shorter Cna C-D loop and CnaA1A4 that harbors the full-length Ace interdomain linker and the larger Ace C-D loop (Fig. 4). AceC1C4 showed a 600-fold increase in CI affinity when compared with AceC1, which harbors the Cna linker but not the smaller C-D loop (Fig. 5, A and F, Table 1). However, CnaA1A4 (KD ∼26.2 μm), which contains the Ace interdomain linker sequence and the larger C-D loop, showed an affinity for C1 that was comparable with CnaA4 (KD ∼37.7 μm). Together, these data suggest that removing the apparent steric hindrance from the Ace N1 and N2 subdomains allows the Ace protein to accommodate a shorter interdomain linker.

We also found that mutations to the C-D loop region, which would likely influence the interdomain orientation of Ace32–367 and Cna31–344 proteins, significantly altered the overall binding kinetics of the engineered proteins, resulting in an “Ace-like” Cna and a “Cna-like” Ace. The Cna proteins (CnaA4 and CnaA1A4) containing the larger C-D loop exhibited similar affinities and binding profiles (a rapid association and complete dissociation) to CI as Ace32–366 (Fig. 4, A, G and H, Table 1). On the other hand, SPR analysis of the AceC4 and AceC1C4 interaction with immobilized CI revealed kinetic data that were more similar to results obtained with Cna31–344 (Figs. 4A and 5, G and H, Table 1). This is most evident when comparing the dissociation of the purified proteins. Dissociation from CI by AceC4 and AceC1C4 is slow and incomplete (Fig. 5, G and H, Table 1) and suggests an interaction with CI that is more complex than the parent protein Ace32–367 and may involve a conformational change.

Altering the C-D loop also affected the number of CI-binding sites recognized by the proteins. The binding ratios (N) for the purified Ace proteins containing the smaller C-D loop were significantly lower (AceC4 N ∼5, AceC1C4 N ∼4) than the parent protein (Ace32–367 N ∼14), whereas the binding ratios for CnaA4 (N ∼15.5) and CnaA1A4 (N ∼13.6) were higher than the binding ratio for Cna31–344 (N ∼11.7) (Table 1), suggesting that a more open conformation may allow the protein to recognize a greater number of binding sites within the CI molecule (Table 1).

Our data demonstrate that the lower affinity and unstable binding kinetics for immobilized CI exhibited by Ace32–367 are in large part due to the presence of a longer loop that connects the C and D strands of the Ace32–367 N1 subdomain, which likely results in an altered orientation of the N1 and N2 subdomains when compared with Cna31–344. Furthermore, these results indicate that the longer Ace interdomain linker may be necessary to accommodate the altered interdomain orientation observed in the apo crystal structure of Ace (16). Surprisingly, the SPR data revealed that a more open conformation may allow the proteins to recognize a greater number of binding sites on the CI molecule as all proteins containing the Ace N1 sequence, and presumably a more open conformation, exhibited higher binding ratios.

Concluding Remarks

Our studies point to an inverse correlation between affinity and binding site specificity. However, it is not clear whether a high affinity interaction or the ability to recognize multiple binding sites would be more advantageous for mediating bacterial attachment to a collagenous surface. A high affinity interaction between a surface-anchored protein and a host ligand would facilitate attachment and proliferation. On the other hand, recognition of a greater number of binding sites would result in increased avidity, and disassociation at low affinity motifs by a fraction of the MSCRAMMs may not result in detachment of the bacterium from the substrate. We are currently working to understand how alterations within the ligand-binding domains of Ace and Cna affect adherence of E. faecalis and S. aureus to CI, CIV, and Ln in vitro. We are also examining how the presence of the ligand-binding domains affects virulence in different animal models. These studies may also yield insight into how affinity and binding site specificity affect the host/pathogen interaction.

This study demonstrates that the MSCRAMM/CI interaction is complex as several structural features influence ligand binding affinity, specificity, and kinetics. For the Cna-like MSCRAMMs, the interdomain orientation appears to be most important followed by the interdomain linker sequence and finally interdomain linker length. Our data indicate that although the regions within Ace32–367 that are predicted to interact with collagen should facilitate a tighter interaction, the more open orientation and larger interdomain hole adopted by Ace32–367 are not optimal for high affinity CI binding. This structural variation may allow Ace to bind additional rod-shaped ligands with a relatively larger diameter or recognize different collagen arrangements. Therefore, it is possible that small variations in the structure could dictate ligand specificity, underscoring the need to study key MSCRAMM/ligand interactions individually in submolecular detail.

Acknowledgment

We thank Jose Rivera for editing the manuscript.

*

This work was supported, in whole or in part, by National Institutes of Health Grant R01AI047923 from the NIAID (to B. E. M. and M. H.). This work was also supported by American Heart Association Fellowship 10POST4160090 (to C. L. R.). A patent (pending) application has been filed based on some results in the manuscript.

3
The abbreviations used are:
MSCRAMM
microbial surface components recognizing adhesive matrix molecule
CI
collagen type I
CIV
collagen type IV
SPR
surface plasmon resonance
RU
response units.

REFERENCES

  • 1. Patti J. M., Allen B. L., McGavin M. J., Höök M. (1994) MSCRAMM-mediated adherence of microorganisms to host tissues. Annu. Rev. Microbiol. 48, 585–617 [DOI] [PubMed] [Google Scholar]
  • 2. Hawiger J., Timmons S., Strong D. D., Cottrell B. A., Riley M., Doolittle R. F. (1982) Identification of a region of human fibrinogen interacting with staphylococcal clumping factor. Biochemistry 21, 1407–1413 [DOI] [PubMed] [Google Scholar]
  • 3. Greene C., McDevitt D., Francois P., Vaudaux P. E., Lew D. P., Foster T. J. (1995) Adhesion properties of mutants of Staphylococcus aureus defective in fibronectin-binding proteins and studies on the expression of fnb genes. Mol. Microbiol. 17, 1143–1152 [DOI] [PubMed] [Google Scholar]
  • 4. Foster T. J., Höök M. (1998) Surface protein adhesins of Staphylococcus aureus. Trends Microbiol. 6, 484–488 [DOI] [PubMed] [Google Scholar]
  • 5. Schneewind O., Model P., Fischetti V. A. (1992) Sorting of protein A to the staphylococcal cell wall. Cell 70, 267–281 [DOI] [PubMed] [Google Scholar]
  • 6. Schneewind O., Fowler A., Faull K. F. (1995) Structure of the cell wall anchor of surface proteins in Staphylococcus aureus. Science 268, 103–106 [DOI] [PubMed] [Google Scholar]
  • 7. Patti J. M., Boles J. O., Höök M. (1993) Identification and biochemical characterization of the ligand-binding domain of the collagen adhesin from Staphylococcus aureus. Biochemistry 32, 11428–11435 [DOI] [PubMed] [Google Scholar]
  • 8. Rich R. L., Kreikemeyer B., Owens R. T., LaBrenz S., Narayana S. V., Weinstock G. M., Murray B. E., Höök M. (1999) Ace is a collagen-binding MSCRAMM from Enterococcus faecalis. J. Biol. Chem. 274, 26939–26945 [DOI] [PubMed] [Google Scholar]
  • 9. Nallapareddy S. R., Weinstock G. M., Murray B. E. (2003) Clinical isolates of Enterococcus faecium exhibit strain-specific collagen binding mediated by Acm, a new member of the MSCRAMM family. Mol. Microbiol. 47, 1733–1747 [DOI] [PubMed] [Google Scholar]
  • 10. Lannergård J., Frykberg L., Guss B. (2003) CNE, a collagen-binding protein of Streptococcus equi. FEMS Microbiol. Lett. 222, 69–74 [DOI] [PubMed] [Google Scholar]
  • 11. Sato Y., Okamoto K., Kagami A., Yamamoto Y., Igarashi T., Kizaki H. (2004) Streptococcus mutans strains harboring collagen-binding adhesin. J. Dent. Res. 83, 534–539 [DOI] [PubMed] [Google Scholar]
  • 12. Shimoji Y., Ogawa Y., Osaki M., Kabeya H., Maruyama S., Mikami T., Sekizaki T. (2003) Adhesive surface proteins of Erysipelothrix rhusiopathiae bind to polystyrene, fibronectin, and type I and IV collagens. J. Bacteriol. 185, 2739–2748 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Xu Y., Liang X., Chen Y., Koehler T. M., Höök M. (2004) Identification and biochemical characterization of two novel collagen-binding MSCRAMMs of Bacillus anthracis. J. Biol. Chem. 279, 51760–51768 [DOI] [PubMed] [Google Scholar]
  • 14. Symersky J., Patti J. M., Carson M., House-Pompeo K., Teale M., Moore D., Jin L., Schneider A., DeLucas L. J., Höök M., Narayana S. V. (1997) Structure of the collagen-binding domain from a Staphylococcus aureus adhesin. Nat. Struct. Biol. 4, 833–838 [DOI] [PubMed] [Google Scholar]
  • 15. Zong Y., Xu Y., Liang X., Keene D. R., Höök A., Gurusiddappa S., Höök M., Narayana S. V. (2005) A “Collagen Hug” model for Staphylococcus aureus CNA binding to collagen. EMBO J. 24, 4224–4236 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Liu Q., Ponnuraj K., Xu Y., Ganesh V. K., Sillanpää J., Murray B. E., Narayana S. V., Höök M. (2007) The Enterococcus faecalis MSCRAMM ACE binds its ligand by the Collagen Hug model. J. Biol. Chem. 282, 19629–19637 [DOI] [PubMed] [Google Scholar]
  • 17. Switalski L. M., Speziale P., Höök M. (1989) Isolation and characterization of a putative collagen receptor from Staphylococcus aureus strain Cowan 1. J. Biol. Chem. 264, 21080–21086 [PubMed] [Google Scholar]
  • 18. Patti J. M., Bremell T., Krajewska-Pietrasik D., Abdelnour A., Tarkowski A., Rydén C., Höök M. (1994) The Staphylococcus aureus collagen adhesin is a virulence determinant in experimental septic arthritis. Infect. Immun. 62, 152–161 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Xu Y., Rivas J. M., Brown E. L., Liang X., Höök M. (2004) Virulence potential of the staphylococcal adhesin CNA in experimental arthritis is determined by its affinity for collagen. J. Infect. Dis. 189, 2323–2333 [DOI] [PubMed] [Google Scholar]
  • 20. Nallapareddy S. R., Qin X., Weinstock G. M., Höök M., Murray B. E. (2000) Enterococcus faecalis adhesin, Ace, mediates attachment to extracellular matrix proteins collagen type IV and laminin as well as collagen type I. Infect. Immun. 68, 5218–5224 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Hubble T. S., Hatton J. F., Nallapareddy S. R., Murray B. E., Gillespie M. J. (2003) Influence of Enterococcus faecalis proteases and the collagen-binding protein, Ace, on adhesion to dentin. Oral Microbiol. Immunol. 18, 121–126 [DOI] [PubMed] [Google Scholar]
  • 22. de Bentzmann S., Tristan A., Etienne J., Brousse N., Vandenesch F., Lina G. (2004) Staphylococcus aureus isolates associated with necrotizing pneumonia bind to basement membrane type I and IV collagens and laminin. J. Infect. Dis. 190, 1506–1515 [DOI] [PubMed] [Google Scholar]
  • 23. Hienz S. A., Schennings T., Heimdahl A., Flock J. I. (1996) Collagen binding of Staphylococcus aureus is a virulence factor in experimental endocarditis. J. Infect. Dis. 174, 83–88 [DOI] [PubMed] [Google Scholar]
  • 24. Mamo W., Fröman G., Müller H. P. (2000) Protection induced in mice vaccinated with recombinant collagen-binding protein (CnBP) and α-toxoid against intramammary infection with Staphylococcus aureus. Microbiol. Immunol. 44, 381–384 [DOI] [PubMed] [Google Scholar]
  • 25. Rhem M. N., Lech E. M., Patti J. M., McDevitt D., Höök M., Jones D. B., Wilhelmus K. R. (2000) The collagen-binding adhesin is a virulence factor in Staphylococcus aureus keratitis. Infect. Immun. 68, 3776–3779 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Elasri M. O., Thomas J. R., Skinner R. A., Blevins J. S., Beenken K. E., Nelson C. L., Smeltzer M. S. (2002) Staphylococcus aureus collagen adhesin contributes to the pathogenesis of osteomyelitis. Bone 30, 275–280 [DOI] [PubMed] [Google Scholar]
  • 27. Singh K. V., Nallapareddy S. R., Sillanpää J., Murray B. E. (2010) Importance of the collagen adhesin Ace in pathogenesis and protection against Enterococcus faecalis experimental endocarditis. PLoS Pathog. 6, e1000716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Nilsson I. M., Patti J. M., Bremell T., Höök M., Tarkowski A. (1998) Vaccination with a recombinant fragment of collagen adhesin provides protection against Staphylococcus aureus-mediated septic death. J. Clin. Invest. 101, 2640–2649 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Papworth C., Bauer J. C., Braman J., Wright D. (1996) Site-directed mutagenesis in one day with >80% efficiency. Strategies 9, 3–4 [Google Scholar]
  • 30. Ho S. N., Hunt H. D., Horton R. M., Pullen J. K., Pease L. R. (1989) Site-directed mutagenesis by overlap extension using the polymerase chain reaction. Gene 77, 51–59 [DOI] [PubMed] [Google Scholar]
  • 31. Barbu E. M., Ganesh V. K., Gurusiddappa S., Mackenzie R. C., Foster T. J., Sudhof T. C., Höök M. (2010) β-Neurexin is a ligand for the Staphylococcus aureus MSCRAMM SdrC. PLoS Pathog. 6, e1000726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Emsley P., Cowtan K. (2004) Coot: model-building tools for molecular graphics. Acta Crystallogr D Biol. Crystallogr. 60, 2126–2132 [DOI] [PubMed] [Google Scholar]
  • 33. Krissinel E., Henrick K. (2004) Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallogr D Biol. Crystallogr. 60, 2256–2268 [DOI] [PubMed] [Google Scholar]
  • 34. Biacore Life Sciences (2004) BIAevaluation Software Handbook, GE Healthcare, Pittsburgh, PA [Google Scholar]

Articles from The Journal of Biological Chemistry are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES