Abstract
Gram-positive bacteria contain sortase enzymes on their cell surfaces that catalyze transpeptidation reactions critical for proper cellular function. In vitro, sortases are used in sortase-mediated ligation (SML) reactions for a variety of protein engineering applications. Historically, sortase A from Staphylococcus aureus (saSrtA) has been the enzyme of choice to catalyze SML reactions. However, the stringent specificity of saSrtA for the LPXTG sequence motif limits its uses. Here, we describe the impact on substrate selectivity of a structurally conserved loop with a high degree of sequence variability in all classes of sortases. We investigate the contribution of this β7–β8 loop by designing and testing chimeric sortase enzymes. Our chimeras utilize natural sequence variation of class A sortases from eight species engineered into the SrtA sequence from Streptococcus pneumoniae. While some of these chimeric enzymes mimic the activity and selectivity of the WT protein from which the loop sequence was derived (e.g., that of saSrtA), others results in chimeric Streptococcus pneumoniae SrtA enzymes that are able to accommodate a range of residues in the final position of the substrate motif (LPXTX). Using mutagenesis, structural comparisons, and sequence analyses, we identify three interactions facilitated by β7–β8 loop residues that appear to be broadly conserved or converged upon in class A sortase enzymes. These studies provide the foundation for a deeper understanding of sortase target selectivity and can expand the sortase toolbox for future SML applications.
Keywords: sortases, enzymes, target selectivity, structural biology, protein biochemistry, protein engineering
Abbreviations: Abz, 2-aminobenzoyl fluorophore; baSrtA, Bacillus anthracis SrtA; CWSS, cell wall sorting signal; Dnp, 2,4-dinitrophenyl quencher; lmSrtA, Listeria monocytogenes SrtA; saSrtA, sortase A from Staphylococcus aureus; SEC, size-exclusion chromatography; SML, sortase-mediated ligation; spSrtA, Streptococcus pneumoniae SrtA; spSrtAfaecalis, β7–β8 loop residues from Enterococcus faecalis; spSrtAlactis, β7–β8 loop residues from Lactococcus lactis; spSrtAmonocytogenes, β7–β8 loop residues from Listeria monocytogenes; spSrtAoralis, β7–β8 loop residues from Streptococcus oralis; spSrtAsuis, β7–β8 loop residues from Streptococcus suis
Sortases are cysteine transpeptidase enzymes that gram-positive bacteria use to covalently attach proteins to their cell wall for various functions, including the assembly of pili or display of virulence factors (1, 2, 3). There are six recognized classes of sortase enzymes (classes A–F), with roles in vivo ranging from general purpose or “housekeeping” functions (classes A and E), to more specific roles such as the construction of the bacterial pilus (class C) (1, 4). These enzymes recognize a cell wall sorting signal (CWSS) on the outer membrane of gram-positive bacteria (1, 5). For class A sortases, the CWSS is the sequence LPXTG (1, 5). Using previously published numbering (L = P4, P = P3, X = P2, T = P1, and G = P1′), P4, P3, and/or P1′ of this motif vary among different classes (5). After target recognition, a His–Cys–Arg catalytic triad facilitates a transpeptidation reaction whereby the CWSS is first cleaved between the P1 and P1′ residues via nucleophilic attack by the catalytic Cys, resulting in a thioester linkage with the P1 position of the CWSS. Resolution of this acyl-enzyme intermediate is then achieved by nucleophilic attack by an amino group displayed on the cell wall building block lipid II, or in the case of pilus formation, displayed on a separate protein subunit (1, 3, 5, 6). The final result is the formation of a new amide linkage, with the portion of the substrate N-terminal to the CWSS now covalently attached at its C terminus to the amine nucleophile ligation partner.
The ability to cleave a substrate sequence and subsequently ligate a second component (for example a protein or synthetic peptide derivative) makes sortases an attractive tool for protein engineering efforts, commonly called sortase-mediated ligation (SML) or sortagging (3). Sortase A from Staphylococcus aureus (saSrtA) was the first of these enzymes discovered and continues to see widespread use for in vitro SML experiments (1, 7). Notable improvements in SML technology have occurred in recent years, including strategies for limiting the reversibility of the ligation reaction and the development of saSrtA variants with dramatically improved catalytic efficiency (3, 8, 9). However, as a consequence of the narrow substrate selectivity of saSrtA (10), the majority of SML examples rely on the combination of one ligation partner displaying an LPXTG motif near its C-terminus with another possessing one or more N-terminal glycines. This restricted substrate scope can be advantageous, for example, in the use of SML for labeling specific polypeptides in complex mixtures, but it also represents a limitation for certain applications (9, 11, 12). Highlighting this point, an increasing number of studies have demonstrated that the use of naturally occurring sortases or engineered sortases with altered substrate selectivity offers distinct advantages such as reducing the necessity for point mutations in protein semisynthesis applications (12), enabling the labeling of endogenous proteins that do not naturally contain the LPXTG motif (11, 13), and allowing labeling of multiple sites within the same protein target (11, 14). Thus, the engineering and discovery of sortases with altered substrate profiles, along with a better understanding of the biochemical basis for sortase substrate selectivity, represent important areas for the continued development of SML technology.
Previous mutagenesis and structural studies of various sortases provide a wealth of knowledge about substrate recognition, including initial ligand recognition and subsequent cleavage (thioesterification), as well as nucleophile recognition and mechanistic details of peptide ligation (transpeptidation) (1, 2, 15). Specifically, the catalytic residues of all native sortases identified to date are (using saSrtA numbering unless specified otherwise) as follows: His120 (general acid/base), Cys184 (nucleophile, acyl-enzyme intermediate), and Arg197 (transition state stabilization) (Fig. 1A) (1, 15). In addition, directed evolution studies have identified mutations (P94R/D160N/D165A/K190E/K196T) that are together able to boost the catalytic efficiency of saSrtA by 120-fold (8). Of these five mutations, several are in two of the three structurally conserved loops in class A sortases located near the peptide-binding cleft: those between the β4, β5 strands (β4–β5 loop), the β6, β7 strands (β6–β7 loop, where D165A occurs), and the β7, β8 strands (β7–β8 loop, where K190E and K196T are located). Notably, while the increase in enzyme activity afforded by these mutations included a 3.6-fold increase in kcat, the effect was dominated by a 33-fold decrease in KM, suggesting these loop residues may be important in CWSS recognition (8).
Figure 1.
The sortase-fold is conserved in all classes of bacterial sortases. A, the peptide-bound structure of S. aureus SrtA (saSrtA) is shown in cartoon representation, with β-strands colored and labeled (PDB ID: 2KID) (19). The side chains of the catalytic residues (H120, C184, and R197) are shown as sticks, colored by heteroatom (O = red, N = blue, S = yellow), and labeled. The disulfide-linked peptide analog, Cbz-LPAT∗, where Cbz is a carbobenzyloxy protecting group and T∗ is (2R,3S)-3-amino-4-mercapto-2-butanol, is shown as black sticks and colored by heteroatom (19). A zoomed-in version of the active site is shown in the black box, with features indicated as in panel A. The variable loops are labeled and indicated by gray arrows. B, the overall sortase fold is well conserved in proteins of different classes. Here, structures for class B (PDB ID: 1NG5), class C (3O0P), class D (2LN7), class E (5CUW), and class F (5UUS) sortases are shown in cartoons, with conserved β-strands colored as in panel A, highlighting the 8-stranded sortase fold. The conserved catalytic triad is shown in sticks (and colored by heteroatom) for all.
Additional evidence for the role of loop residues was obtained from more targeted directed evolution and mutagenesis studies. For example, it was demonstrated that the β6–β7 loop of saSrtA directly confers specificity at P4 of the recognition motif (LPXTG), and residues other than leucine (L) can be accommodated using sortases with mutations in the β6–β7 loop (12, 16, 17). Indeed, substitution of the β6–β7 loop residues from saSrtB into the saSrtA enzyme alters substrate recognition to that of a sortase B protein (NPQTN) (18). Turning to the β7–β8 loop, the NMR structure of saSrtA covalently bound to a modified LPAT∗ peptide mimetic revealed a noncovalent interaction between W194 in saSrtA and the Thr residue in P1 (LPXTG) (19, 20). Mutation of W194 in saSrtA decreased the reaction rate, although it was not essential to catalysis (20). Taken together, these past studies reveal that sequence variation within sortase loops directly affects both activity and selectivity for target ligands. Furthermore, conservation of the closed eight-stranded β-barrel core in all sortase A-F structures that have been reported to date suggests that these principles may apply to non–class A sortases as well (Fig. 1B) (2).
In this work, we specifically look at natural sequence variation in the β7–β8 loop of class A sortases, using Streptococcus pneumoniae SrtA (spSrtA) as a model system. The β7–β8 loop was initially identified after sequence and evolutionary conservation analyses as a region of notable variability in class A (and other) sortases. We find that the β7–β8 loop sequence dramatically affects both overall enzyme activity and selectivity at P1′ of the CWSS. Our data are consistent with a recent publication that investigated the grafting of β7–β8 loop sequences from saSrtA and Bacillus anthracis SrtA (baSrtA) into Streptococcus pyogenes SrtA (18). This work also suggested that W194 (saSrtA numbering) may play a role in the substrate recognition of the reported chimeras (21). Here, we have profiled the substrate preferences of over a dozen loop chimeras and single- or double-mutants targeting the β7–β8 loop. While we also observe a role for W194 in substrate recognition, our data suggest that it is unique to saSrtA and not broadly applicable to describe β7–β8 loop–mediated class A sortase function. Indeed, the combination of functional enzyme assays and analysis of reported sortase structures in the present work suggests three different β7–β8 loop–mediated interactions that affect selectivity and activity.
Results
Sequence analyses of bacterial sortases
To investigate general sequence variation in class A sortases, we first created a multiple sequence alignment of the eight SrtA sequences used in our previous work, proteins from: B. anthracis, Enterococcus faecalis, Lactococcus lactis, Listeria monocytogenes, S. aureus, Streptococcus oralis, S. pneumoniae, and Streptococcus suis (Fig. 2A) (22). Although this is a small subset of SrtA sequences, we reasoned that this representative group would reveal general sequence trends or sequence variations as these enzymes had exhibited clear differences in substrate preferences in our previous study (19). Indeed, variations were present in loop regions, specifically those of the β2–β3, β4–β5, β6–β7, and β7–β8 loops (Fig. 2A).
Figure 2.
Sequence analysis and evolutionary conservation of class A sortases. A, multiple sequence alignment of previously studied class A sortase sequences (22). Alignment was performed using T-Coffee, and the figure was prepared using BoxShade. Black boxes indicate loop regions in saSrtA, as labeled. The black box for the β7–β8 loop cuts off the final five residues of the saSrtA loop (GVWEK) because the alignment is not correct in this region. B, the ConSurf server was used to investigate evolutionary conservation, using saSrtA (PDB ID: 2KID) as a template and a multiple sequence alignment of 400 class A sortase sequences. The results are shown on 2KID in surface representation (key below) and the LPAT∗ peptidomimetic is in black sticks and colored by heteroatom. Conservation is highest in the peptide-binding cleft and on the backside of the active site, where lipid II is hypothesized to bind. C, cartoon representation of the ConSurf results as described in panel B, with the side chain sticks of the β4–β5, β6–β7, and β7–β8 loops shown. SaSrtA binds calcium, which is shown as a black sphere and labeled. saSrtA, sortase A from Staphylococcus aureus.
In addition to our sequence analysis, we also wanted to look more broadly at global evolutionary conservation in class A sortases. We extracted 400 SrtA sequences from the NCBI and used MAFFT to create a multiple sequence alignment (23). To visualize evolutionary conservation, we used the ConSurf server, with saSrtA (PDB ID: 2KID) as our structural template (Fig. 2B) (24, 25). This analysis confirmed that class A sortase sequences are quite variable, with very few residues showing a high degree of conservation (dark maroon color). However, validating our analysis, the peptide-binding cleft (occupied by the peptidomimetic LPAT∗ in 2KID) is very well conserved, as is the presumed endogenous lipid II–binding site on the backside of the enzyme (Fig. 2B) (2, 26).
As suggested by our initial multiple sequence alignment (Fig. 2A), ConSurf confirmed that the three structural loops that border the peptide binding cleft, the β4–β5, β6–β7, and β7–β8 loops, are all quite variable (Fig. 2C). Any conservation in these loops that we do see may also be an artifact of the multiple sequence alignment because of variable lengths; for example, E195 saSrtA in the β7–β8 loop appears to be highly conserved, but this is because of the fact that out of the 400 sequences used in the alignment, only 10 have any residue at this position (9 of which are Glu) and all are of Staphylococcus sequences.
Given that the β6–β7 loop has been shown to be intimately involved in sortase substrate recognition, we were intrigued that our analysis revealed similar levels of variability in the β4–β5 and β7–β8 loops (18). In the case of β7–β8, we were also motivated by its proximity to the enzyme active site and CWSS P1′ position (15). Therefore, we sought to further explore how the β7–β8 loop affects the activity and substrate specificity of a sortase with narrow substrate tolerance (saSrtA) versus one that is more promiscuous (spSrtA).
Loop-swapped β7–β8 variants reveal differences in position P1′ selectivity for S. aureus and S. pneumoniae SrtA enzymes
In our previous work, we found that the most striking differences in substrate tolerances among the class A sortases studied were observed at the P1′ (LPXTG) of the substrate motif (19). For example, while saSrtA is specific for a Gly residue at P1′, SrtA from S. pneumoniae (spSrtA) recognizes over 10 of the 20 amino acids at this position in a 24-h end-point assay (22). To determine whether the β7–β8 loop played a role in these differing substrate preferences, we began by engineering two loop-swapped variants: saSrtApneumoniae (which contains the β7–β8 loop residues from spSrtA (CEDLAATER, where the catalytic cysteine and arginine are underlined)) and spSrtAaureus (with β7–β8 residues CDDYNEKTGVWEKR from saSrtA). Notably, the length of the saSrtA β7–β8 loop contains an additional five residues, as compared with the spSrtA β7–β8 loop. The saSrtA loop also uniquely contains W194, which is predicted to directly contact the P1 threonine of the LPXTG motif (19). In addition, while both loops are predicted to have an overall net negative charge at the physiological pH, the saSrtA loop contains two positively charged lysine residues that are not present in spSrtA. Both chimeric sortases were expressed and purified from Escherichia coli and were isolated as soluble, monomeric enzymes as described previously and in Experimental procedures (Fig. S1) (22). Based on the migration of these variants using size-exclusion chromatography (SEC), the variants are not aggregated and retain a similar radius of gyration and oligomeric status (Fig. S1).
To monitor enzymatic activity and selectivity of the saSrtApneumoniae, spSrtAaureus, and their WT counterpart proteins, we utilized well-established FRET quencher probes consisting of different substrate motifs flanked by a 2-aminobenzoyl fluorophore (Abz) and a 2,4-dinitrophenyl quencher (Dnp) (20, 27, 28). Probes containing three substrate variants were initially prepared (Abz-LPATAG-K(Dnp), Abz-LPATGG-K(Dnp), Abz-LPATSG-K(Dnp), varying only at P1′ in bold) and used to test the relative activity of our WT and chimeric enzymes. For simplicity, we have hereafter omitted the Abz, K(Dnp), and C-terminal glycine from peptide descriptions. For comparing enzyme activity, a standard 2-h reaction time was utilized and an excess of H2NOH was included to resolve the acyl enzyme intermediates. For consistency, all reactions were also conducted in the presence of Ca2+, which is a required cofactor for saSrtA. A reaction end point (indicated by the increase in Abz fluorescence) for all enzyme/substrate pairings was then expressed relative to averaged benchmark reactions of WT saSrtA with the standard LPATG substrate (Figs. 3A and S2A). This benchmark reaction was consistently found to give ∼84% conversion to the expected transacylation products when independently monitored via RP-HPLC (Fig. S2B).
Figure 3.
Interchanging β7–β8 loops in class A sortases modulates substrate selectivity and activity for target sequences that vary at position P1′ of the canonical LPXTG motif. A, comparison of substrate selectivity for WT saSrtA and spSrtA proteins, as well as β7–β8 loop chimeras saSrtApneumoniae, spSrtAaureus, and W194A spSrtAaureus. Substrate cleavage was monitored via an increase in fluorescence at 420 nm from reactions of the fluorophore-quencher probes Abz-LPATGG-K(Dnp), Abz-LPATAG-K(Dnp), and Abz-LPATSG-K(Dnp) (represented as LPATG, LPATA, and LPATS) in the presence of excess hydroxylamine. Bar graphs represent the mean normalized fluorescence (± SD) from at least three independent experiments at the 2-h reaction timepoint, as compared with saSrtA and the peptide LPATG. B, representative HPLC chromatogram for the reaction of Abz-LPATGG-K(Dnp) and H2NOH in the presence of spSrtAaureus. This reaction was conducted in the presence of Ca2+. Selective cleavage between the threonine (T) and glycine (G) residues was observed, with an overall conversion of 80% (∗ = Abz-LPAT-NHOH reaction product. Low peak intensity is due to the weak absorbance of Abz at 360 nm). Additional HPLC data for select substrate/enzyme pairings are provided in Figure S2, B–F. saSrtA, sortase A from Staphylococcus aureus; spSrtAmonocytogenes, β7–β8 loop residues from Listeria monocytogenes; Abz, 2-aminobenzoyl fluorophore; Dnp, 2,4-dinitrophenyl quencher.
Based on our previous results, we predicted that spSrtA would show activity for all three peptides, while saSrtA would be selective for LPATG (22). Consistent with this prediction, our results confirmed that spSrtA was equally capable of processing all three substrates, whereas saSrtA was restricted to LPATG (Fig. 3A). Our assay also revealed a marked reduction in spSrtA activity versus saSrtA, which was not captured in our previous study, likely because of the extended reaction time (24 h) used in that work (22). With respect to the chimeric enzymes, our results clearly showed that the sequence of the β7–β8 loop was a major determinant of activity and specificity. Specifically, the saSrtApneumoniae protein was completely inactive while spSrtAaureus functionally mimicked the narrow substrate preference of the WT saSrtA enzyme (Fig. 3A). This result is consistent with recently published data (21). To verify that our sortases were cleaving substrates at the expected site, reactions exhibiting a normalized fluorescence value of 0.2 or higher were independently monitored by RP-HPLC and LC-MS, which confirmed cleavage between P1 and P1′ (Figs. 3B and S2, B–F, Table S1). Notably, reactions for HPLC and LC-MS characterization were conducted in the presence and absence of Ca2+, which demonstrated that this cofactor was not required for the activity of spSrtA and spSrtAaureus.
Continuing on with the SpSrtAaureus chimera, we next wanted to determine if the Trp residue derived from the saSrtA loop played a significant role in enzyme activity. In WT saSrtA, the W194 residue (using saSrtA numbering) is known to affect enzyme activity of saSrtA and has been shown to interact with the threonine of the LPXTG motif (Fig. S3) (19, 20). We therefore expressed and purified the corresponding “W194A” mutant of spSrtAaureus and tested this variant with A-, G-, and S-containing peptides in our assay. This mutation in saSrtA has previously been characterized with respect to enzymatic activity but was not previously investigated with respect to possible effects on P1′ selectivity (20). Indeed, our W194A spSrtAaureus protein exhibited a 17% reduction in reaction progress for LPATG, while retaining its preference for Gly-containing peptides (Fig. 3A). However, W194A spSrtAaureus also revealed activity for the A- and S-containing peptides suggesting that the Trp residue acts as a selectivity filter (Fig. 3A).
Variability in position P1′ selectivity and transpeptidase activity in S. pneumoniae SrtA β7–β8 variants
In addition to the profound shift in substrate scope observed for spSrtAaureus, we were also intrigued that the overall reactivity of this chimera for LPATG was comparable with that seen with WT saSrtA. This stood in sharp contrast to the reaction of LPATG with WT spSrtA, where reaction progress was nearly two-thirds lower within the 2-h reaction time of our assay (Fig. 3A). Based on this, we wondered if similar gains in reactivity for substrates other than LPATG could be achieved by substituting in residues from additional SrtA proteins (22, 27). To test this, we created an additional six spSrtA variants containing loop residues from SrtA proteins that we had evaluated previously (22). These chimeras included the β7–β8 loop residues from B. anthracis, E. faecalis (spSrtAfaecalis), L. lactis (spSrtAlactis), L. monocytogenes (spSrtAmonocytogenes), S. oralis (spSrtAoralis), and S. suis (spSrtAsuis) (Fig. 4A) (22). To avoid confusion in the numbering of loops with variable lengths, we will hereafter refer to the N-terminal positions of each β7–β8 loop by numbering with respect to the catalytic Cys (β7–β8+1, β7–β8+2, etc.) that precedes the loop, whereas the C-terminal loop residue will be numbered relative to the catalytic Arg (β7–β8−1) (Fig. 4A).
Figure 4.
The sequence of the β7–β8 loop dramatically affects selectivity and activity for spSrtA. A, the β7–β8 loop sequences of the chimeric proteins used are listed, with representative numbering for residues in the β7–β8 loop labeled for spSrtA and spSrtAaureus. B and C, substrate selectivity profiles for WT spSrtA (B) and chimeric spSrtA variants (C). Substrate cleavage monitored via an increase in fluorescence at 420 nm from reactions of fluorophore-quencher probes with the generic structure Abz-LPATXG-K(Dnp) (LPATX) in the presence of hydroxylamine. Bar graphs represent the mean normalized fluorescence (±SD) from at least three independent experiments. spSrtAfaecalis, spSrtA with the β7-β8 loop residues from Enterococcus faecalis; spSrtAlactis, spSrtA with the β7-β8 loop residues from Lactococcus lactis; spSrtAoralis, spSrtA with the β7-β8 loop residues from Streptococcus oralis; spSrtAsuis, spSrtA with the β7-β8 loop residues from Streptococcus suis; Abz, 2-aminobenzoyl fluorophore; Dnp, 2,4-dinitrophenyl quencher.
The six chimeric proteins were expressed and purified using the same protocol as spSrtAaureus, and as described in Experimental procedures. The purity of all proteins was validated by SDS-PAGE, and SEC was consistent with the isolated proteins being predominantly monomeric (Fig. S1). With the new chimeras in hand, we conducted an initial evaluation of relative activity using the LPATG, LPATA, and LPATS substrates described above. While the majority of constructs exhibited significant reactivity across all three substrates, spSrtAanthracis and spSrtAmonocytogenes, containing the β7–β8 loop residues from B. anthracis and L. monocytogenes, respectively, proved to be mostly inactive (Fig. S4A). For the remaining enzymes, the spSrtAoralis protein behaved similarly to WT spSrtA, while spSrtAfaecalis, spSrtAlactis, and spSrtAsuis showed improved performance for A-, G-, and S-containing substrates (Fig. 4, B and C and S4A). This was particularly interesting in the case of spSrtAfaecalis, given that the WT SrtA enzyme from E. faecalis was previously shown to have poor reactivity for the same test substrates despite the use of higher enzyme loading and considerably longer reaction times (22).
Based on initial experiments with the A-, G-, and S-containing peptides, we next wanted to expand our peptide pool to assess the relative reactivity of our active chimeric spSrtA variants for peptides containing all 20 amino acids at P1′. For comparison, a similar substrate profile was generated for WT spSrtA. As shown in Figure 4B, within the 2-h time frame of our assay, the WT protein was rather selective in its substrate recognition, with reactivity limited to A-, G-, and S-containing peptides. We note here that this somewhat limited substrate scope that appears to differ from the more promiscuous behavior reported previously for spSrtA. We attribute this to the fact that longer reaction times (24 h) and higher enzyme loadings (5-fold higher than the loading used here) were utilized in this earlier work (22). Similar to WT spSrtA, the spSrtAoralis was limited to A-, G-, and S-containing peptides, albeit with slightly elevated reactivity in the case of LPATA and LPATG. Finally, we were intrigued to find that our spSrtAfaecalis, spSrtAlactis, and spSrtAsuis proteins all show increased promiscuity for a variety of amino acids at P1′ in our assay (Fig. 4C).
Overall, the spSrtAfaecalis, spSrtAlactis, and spSrtAsuis proteins showed the largest increase in activity and promiscuity for this library of peptides. The spSrtAfaecalis and spSrtAlactis proteins each recognized 15 of the 20 amino acids at P1′ with normalized fluorescence values of ≥0.05, while spSrtAsuis recognized 14 of the 20 (Fig. 4C). We chose 0.05 as a cut-off value to compare with the peptide activities of the spSrtA protein, which shows normalized fluorescence values of -0.02 to 0.02 for all non-G-, S-, or A-containing peptides, with the exception of LPATC (at 0.04 ± 0.01). Furthermore, spSrtAfaecalis and spSrtAsuis exhibited ∼3-fold higher reaction progress for the G-, S-, and A-containing peptides than spSrtA.
As verification of the results of our fluorescence assay, we also characterized a subset of enzyme/substrate combinations using RP-HPLC and LC-MS. Focusing on spSrtAfaecalis, we repeated reactions that exhibited normalized fluorescence values of ≥0.1 (LPATX, X = A, F, G, I, L, M, N, S, V, W, Y) (Fig. S4B). Reactions were conducted in the absence of Ca2+ to confirm that this cofactor was not required for activity. Successful substrate cleavage was observed in all cases, ranging from a high of 78% conversion in the case of LPATG, to only 6% conversion in the case of LPATW over 2 h at room temperature (RT) (Fig. S4B). Notably, the trends in relative substrate preferences observed by HPLC were consistent with those found in our original fluorescence assay (Fig. S5). In addition, while LC-MS characterization confirmed that substrate cleavage was occurring between the P1 and P1′ of all sequences, certain substrates (LPATX, X = W,F,L,M,Y) containing bulky hydrophobic residues also produced alternate products arising from cleavage on the C-terminal side of P1′ (Table S1, Fig. S4). In the case of LPATL, this alternate cleavage product was actually the major species obtained after reaction with spSrtAfaecalis. We note here that this capacity for alternate cleavage has been reported previously for WT spSrtA and thus appears to be maintained in the spSrtAfaecalis chimera (22).
Variability in position P1′ selectivity and ligase activity in S. pneumoniae SrtA β7–β8 variants
As a final assessment of the reactivity of the spSrtAfaecalis chimera, we next evaluated its ability to ligate amino acid nucleophiles in place of the H2NOH that was utilized in our fluorescence assay. For a series of test substrates (LPATX, X = A,S,V), spSrtAfaecalis was able to successfully ligate the corresponding free amino acid carboxamides (X-NH2 = A-NH2, S-NH2, V-NH2) with very good efficiency (Fig. 5). As expected from our fluorescence assay results, reaction progress with LPATV was slower than that observed for LPATA and LPATS. Specifically, reactions with LPATV required 8 h at RT to consume 85% of the initial peptide substrate, whereas reactions with LPATA/S exhibited >95% substrate conversion within 3 h. Importantly, the desired LPATX-NH2 species was the major ligation product in all reactions as determined by LC-MS (Fig. 5, Table S1). Trace levels of substrate hydrolysis were also observed via LC-MS; however, the ratio of successful ligation to hydrolysis was 15:1 or better as estimated from mass spectral peak intensities. In reactions involving LPATV, we also detected low levels of substrate cleavage on the C-terminal side of the P1′ valine residue. The extent of this alternate cleavage pathway was minimal, accounting for only ∼4% of the substrate cleavage events based on comparisons of HPLC peak areas for G-K(Dnp) and VG-K(Dnp) (Fig. 5). Interestingly, LC-MS characterization of these same reactions involving LPATV, V-NH2, and spSrtAfaecalis failed to show clear evidence for the formation of ligation or hydrolysis products derived from the alternate cleavage pathway, potentially because of their low levels in solution.
Figure 5.
spSrtAfaecalisoutperforms WT spSrtA in model amino acid ligation reactions. HPLC chromatograms (320 nm) for model ligations between Abz-LPATXG-K(Dnp) and excess X-NH2 nucleophiles catalyzed by spSrtAfaecalis (black curves) or WT spSrtA (blue curves). Ligations were conducted in the absence of Ca2+. Chromatograms for LPATA/S represent the 3-h reaction timepoint, and chromatograms for LPATV correspond to the 8-h timepoint. All peak identities were confirmed via LC-MS (Table S1). spSrtAfaecalis, spSrtA with the β7–β8 loop residues from Enterococcus faecalis; Abz, 2-aminobenzoyl fluorophore; Dnp, 2,4-dinitrophenyl quencher.
For comparison, we also performed the same set of test ligations with WT spSrtA. In all cases, reaction progress was significantly reduced as compared with that of spSrtAfaecalis (Fig. 5). In particular, spSrtA exhibited minimal product formation with the LPATV system, representing a 10-fold reduction in reaction progress relative to spSrtAfaecalis for this atypical sortase substrate motif. Building from this result, an initial attempt to utilize the LPATV sequence as a handle for site-specific protein modification was made by installing this motif at the C-terminus of a full-size protein target. However, this protein substrate proved to be unreactive in the presence of both spSrtAfaecalis and WT spSrtA (data not shown).
Mutagenic investigation of the contribution of β7–β8 loop residues
To dissect the contribution of each residue in the β7–β8 loop of spSrtA, we made a series of alanine mutations. Specifically, we mutated all non-Ala residues to alanine to produce the following spSrtA variants: E208A, D209A, L210A, T213A, and E214A. Analytical SEC of the final protein preparations suggested that these mutants were predominantly monomeric (Fig. S6). When tested using our FRET-based activity assay, we did not see major effects on P1′ selectivity in these mutants for G-, S-, and A-containing peptides; however, we did observe striking effects on overall reactivity (Fig. 6A). Specifically, a 2-fold increase in reaction progress was observed for the E208A and E214A mutations, suggesting that the native glutamic acid residues in spSrtA have a negative effect on its activity. The L210A and T213A mutations resulted in ∼50% reduction in reaction conversion, and the D209A spSrtA protein was entirely unreactive (Fig. 6A).
Figure 6.
Residues in the β7–β8 loop participate in interactions that affect enzyme activity and selectivity, based on structural analyses and mutagenesis. A, enzyme assays on mutant spSrtA and spSrtAfaecalis variants (fluorophore-quencher substrate assay conditions identical to those described in the legend to Fig. 3). Bar graphs represent the mean normalized fluorescence values (± SD) from at least three independent experiments. B and C, all SrtA structures are in gray ribbons. The β7–β8 loop side chains are all in stick representation and colored by heteroatom. Residues that participate in hydrogen bonds have non–gray carbons and are labeled. The hydrogen bonds are shown with a black dashed line, with measurements indicated. For NMR structures (PDB IDs 2KID and 2RUI), the state that was used for the image is labeled. Although not all NMR states contained the interaction indicated, in both cases, there were several states that revealed measurements consistent with a noncovalent interaction. Any side-chain sticks are colored by heteroatom (O = red, N = blue). B, β7–β8 intraloop hydrogen bond. The structures in this figure are as follows: S. aureus SrtA (2KID, blue carbons), S. pyogenes SrtA (3FN5, cyan carbons), S. mutans SrtA (4TQX, pink carbons), and L. monocytogenes SrtA (5HU4, orange carbons). C, interaction between the β7–β8 loop and β6 strand. The structures in this figure are as follows: S. aureus SrtA (PDB ID 2KID, blue carbons), B. anthracis SrtA (2RUI, dark pink carbons), S. pneumoniae SrtA (4O8L, gray carbons), and A. oris SrtA (5UTT, gold carbons). spSrtAfaecalis, spSrtA with the β7–β8 loop residues from Enterococcus faecalis.
Stereochemical basis of β7–β8 variant selectivity and activity—(1) A stabilizing intraloop hydrogen bond
To gain a stereochemical understanding of our biochemical results, we analyzed available structures of class A sortases in the Protein Data Bank. To our knowledge, the 3D structure of an active, monomeric form of spSrtA has yet to be reported. Available crystal structures of the domain-swapped dimer show that the β7–β8 loop is located at, and participates in, the dimer interface (PDB codes 4O8L, 4O8T, and 5DV0). Therefore, we chose to broaden our search to non-spSrtA structures, and in doing so, we identified three putative β7–β8 loop-mediated interactions in class A sortases, which will be described in the following three sections.
First, we observed an intra-loop hydrogen bond in the β7–β8 loops of several class A sortases (Fig. 6B). This Thr-mediated hydrogen bond is evident in SrtA proteins from S. aureus (PDB code 2KID), S. pyogenes (3FN5), Streptococcus mutans (4TQX), and L. monocytogenes SrtA (lmSrtA) (5HU4), among others not shown (Fig. 6B). Notably, the S. pyogenes SrtA protein (3FN5) includes multiple rotamers in one protomer at the Thr position (T214), and the hydrogen bond with D210 is conserved in both (Fig. S7A). We predict that in spSrtA, this hydrogen bond will be formed between the side chains of residues β7–β8+2 D209 and β7–β8+6 T213. Indeed, we see this interaction in a homology model of the monomeric spSrtA protein generated using SWISS-MODEL (Fig. S7B).
For this model, the S. pyogenes SrtA structure (PDB code 3FN5) was used as a template because the crystallized form of this enzyme (S. pyogenes SrtA residues S81-T249) has 63% sequence identity with spSrtA (29, 30, 31). An alignment of our spSrtA model with 3FN5 revealed an overall RMSD of 0.083 Å over 567 main chain atoms. We further validated our model using structural alignments with a monomer extracted from the domain-swapped dimer structure (RMSD of 0.603 Å over 483 main chain atoms), as well as other SrtA structures from Streptococcus species, including those from Streptococcus agalactiae and S. mutans (PDB codes 3RCC (RMSD of 0.773 Å over 384 main chain atoms) and 4TQX (RMSD of 0.456 Å over 530 main chain atoms)), respectively (Fig. S7, C and D).
Our mutagenesis results are consistent with this proposed interaction. Specifically, the T213A mutation, which disrupts a potential intraloop hydrogen bond between the β7–β8+2 D209 and β7–β8+6 T213, reduced spSrtA activity by 54 to 73% for G-, S-, and A-containing peptides (Fig. 6A). When we attempted to reverse the hydrogen bond geometry with a D209T/T213D double mutant, the resulting enzyme exhibited only trace reactivity (Fig. 6A). Taken together, our findings are consistent with at least one stabilizing intraloop hydrogen bond being a generally conserved and functionally relevant feature of β7–β8 SrtA loops. It should be noted, however, that the exact nature and location of this interaction likely varies. For example, in Actinomyces oris SrtA, this hydrogen bond is observed between β7–β8+3 D227 and β7–β8+11 S219 (Fig. S7E).
Stereochemical basis of β7–β8 variant selectivity and activity—(2) A noncovalent interaction between the β6−2 residue and β7–β8 loop
In addition to an intraloop hydrogen bond, we observe interactions between the β7–β8 loop and β6−2 residues in multiple class A sortase structures (Fig. 6C). For example, K196 (β7–β8−1) of saSrtA interacts with the β6−2 D160 in several of the states of the reported NMR structure, PDB code: 2KID (Fig. 6C) (19). We also see a reasonable electrostatic interaction distance for the β7–β8−1 and β6−2 residues (K195 and E165, respectively) in several of the NMR states for baSrtA (PDB code 2RUI), as well as in the domain-swapped dimer structure of spSrtA (PDB code: 4O8L), which shows the E214 β7–β8 loop residue of one protomer interacting with R184 (Fig. 6C). Interestingly, the nature of this interaction can change, as in the case of Actinomyces oris SrtA, where both residues are hydrophobic leucine residues (L186 and L228 for the β6−2 and β7–β8−1 residues, respectively) (Fig. 6C).
Our results suggest that this β7–β8 loop/β6−2 interaction has a negative effect on sortase activity. This is supported by the >2-fold increase in substrate conversion for both the β7–β8−1 E214A and β7–β8+1 E208A spSrtA mutants as compared with WT spSrtA (Fig. 6A). While the reported domain-swapped dimer structure of spSrtA does not exhibit an obvious interaction between β7–β8+1 (E208) and β6−2 (R184), we note our spSrtA homology model does suggest this interaction is likely, with a distance between a guanidinium nitrogen atom of R184 and a side-chain carboxylate oxygen atom on E208 equal to 2.7 Å (Fig. S7B). As further evidence, given that this β7–β8+1 position is a glycine in the more active spSrtAfaecalis chimera, we expressed, purified, and tested the activity of two contrasting mutants at this site: E208G spSrtA (which would eliminate the putative β7–β8+1/β6−2 interaction) and G145E spSrtAfaecalis (E. faecalis SrtA numbering, which would restore the putative β7–β8+1–β6−2 interaction). Our results are consistent with our predictions, and we saw a 21 to 49% reduction in substrate conversion for the reaction of G-, S-, and A-containing peptides with G145E spSrtAfaecalis as compared with the initial spSrtAfaecalis chimera (Fig. 6A). In contrast, a >2-fold increase in reaction progress was observed for E208G spSrtA relative to WT spSrtA (Fig. 6A).
Overall, our data are consistent with recent work where a triple mutant of S. pyogenes SrtA (E189H/V206I/E215A, where E215A is a mutation at the β7–β8−1 position) resulted in 6.6-fold enhanced catalytic efficiency (32). In addition, K196T in the catalytically enhanced pentamutant saSrtA protein is also located at the β7–β8−1 position (8). Taken together, the data support that a noncovalent interaction between the β7–β8 loop and β6 strand negatively affects SrtA activity.
Stereochemical basis of β7–β8 variant selectivity and activity—(3) An interaction between the β7–β8 and β4–β5 loops
NMR structures of saSrtA and baSrtA proteins in the unbound and bound states (PDB codes: 1IJA, 2KID, 2KW8, and 2RUI) suggest distinct mechanisms of substrate binding (Fig. S8). In saSrtA, the β7–β8 loop is ordered in both states but moves upon ligand binding (Fig. S8A). In contrast, a unique N-terminal appendage in baSrtA regulates active site accessibility, as previously described, and the β7–β8 loop transitions from a disordered-to-ordered state upon substrate binding (Fig. S8B) (33). In both proteins, however, binding of ligand corresponds to a shift in the position of the β7–β8 loop such that it is located much closer to the β4–β5 loop (Fig. 7).
Figure 7.
Interaction between the β7–β8 and β4–β5 loops. All SrtA structures are in gray ribbons. The β7–β8 loop side chains are all in stick representation and colored by heteroatom. The structures in this figure are as follows: S. aureus (unbound: 1IJA and bound: 2KID, blue carbons), B. anthracis (unbound: 2KW8 and bound: 2RUI, dark pink carbons), L. monocytogenes (5HU4, orange carbons), and Streptococcus pneumoniae (homology model (Fig. S7), green carbons). The arrows indicate residue movement from the unbound to bound structures.
In analyses of the previously published baSrtA structures, the authors of this work mention that the β4–β5+2/β4–β5+3 positions (for baSrtA, these are β4–β5+2 M128 and β4–β5+3 S129) play a role in stabilizing a hydrophobic residue, the β7–β8+3 V190, upon ligand binding (Fig. 7) (33). The corresponding residues in saSrtA are F64, I65, and Y129, respectively, which would also enable a favorable hydrophobic interaction (Fig. 7). The lmSrtA structure (PDB ID: 5HU4) also shows a similar interaction: β4–β5+2 M129 and β4–β5+3 R130 with P191 in the β7–β8+3 position (note: M129 is the full-length protein numbering according to UniProt ID SRTA_LISMO, but is M58 in 5HU4, R130 is R59, and P191 is P120), in the unbound state (Fig. 7). Finally, our spSrtA model suggests a potential interaction between the β4–β5+2 I63 and β4–β5+3 F64 residues with β7–β8+3 L210 (Fig. 7).
In agreement with this proposed interaction between the β7–β8 and β4–β5 loops, we observed that our L210A spSrtA mutant reduced the activity of the protein by 46%, 59%, and 77% for the A-, G-, and S-containing peptides, respectively (Fig. 6A). While the alanine methyl side chain retains the hydrophobic character of the WT leucine, we speculate that its reduced size is insufficient to space the gap between the β7–β8 and β4–β5 loops, and thus this critical interaction is disrupted. Along these lines, we expressed, purified, and tested an L210P spSrtA mutant, which substitutes L210 with the proline residue found in spSrtAmonocytogenes, a chimeric protein which was essentially inactive in our hands (Fig. S4A). In our substrate assay, this mutant produced results identical to those of L210A, further supporting the importance of a specific interaction between the β7–β8+3 residue and β4–β5 loop for sortase activity (Fig. 6A).
Discussion
Although target sequence recognition by S. aureus SrtA is rigidly selective for a P1′ glycine, this is not true of all class A sortases, such as those from S. pneumoniae and S. pyogenes (10, 14, 29, 34). Building from our previous work, in which spSrtA was found to accept peptides containing Gly, Ala, and Ser and other residues at P1′, we have shown here that this broadened substrate scope can be attributed to the sequence of the β7–β8 loop (26). Moreover, variations in β7–β8 loop sequences can substantially impact the overall enzyme activity, affording chimeric sortases that outperform their WT counterpart in vitro. Together with others, the present study implicates all of the variable loops in class A sortases as being important determinants of enzyme function (12, 18, 21, 33).
With respect to structure, we propose three interactions that are facilitated by residues in the β7–β8 loop of spSrtA. They are as follows: (1) an intraloop hydrogen bond that positively affects catalytic efficiency, typically mediated by a threonine residue at the β7–β8+6 or β7–β8+7 position, (2) an interaction that hinders enzyme activity between the β7–β8 loop and β6−2 residues, and (3) a positive interaction between the β7–β8+3 and β4–β5+2/β4–β5+3 residues, typically of hydrophobic nature. Notably, there appear to be other features in this structurally conserved loop that are unique to certain class A sortases. These include the W194 residue of saSrtA, which specifically interacts with the P1 position of the CWSS and likely acts as a selectivity filter, based on our W194A spSrtAaureus data (Fig. 3A) (19). Others identified a disordered-to-ordered transition of the baSrtA β7–β8 loop, as well as regulation by an N-terminal appendage, although more research is needed to determine whether or not this is shared by other class A sortases (33). Furthermore, while the spSrtA β7–β8 loop is seven residues in length, several class A sortase loops, for example, those of saSrtA, baSrtA, and lmSrtA studied here, are longer than seven residues. Stabilizing interactions mediated by backbone atoms likely vary in loops of differing lengths, a characteristic that was not studied in detail here. Future work is also needed to assess how the loop length influences the identified interactions described here in the context of sortase A enzymes, as well as other position-specific interactions found in other sortase classes.
In addition to informing our fundamental understanding of sortase substrate recognition, this work also has implications for the further development of SML as a protein engineering tool (3, 35). Through exchange of β7–β8 loop residues between class A sortases, we have generated chimeras such as spSrtAfaecalis and spSrtAlactis, with measurable activity against peptides possessing 15 of the 20 amino acids at P1′. Notably, stability or minor changes in the folded structures between variants were not determined and these could manifest as some of the differences in enzyme function observed. With additional development, each of these sortase chimera/substrate combinations potentially offers a new handle for in vitro SML applications. While preliminary attempts here to modify a protein target displaying an LPATV sequence using spSrtAfaecalis were unsuccessful, we consider it likely that optimization of the placement of the LPATV site may restore reactivity. This includes examination of the accessibility requirements for the LPATV sequence and assessment of the impact of residues N- or C-terminal to the core LPATV motif. Similar factors are known to affect the success of SML reactions with the widely used saSrtA/LPXTG system (36, 37, 38) and may need to be evaluated for our chimeras.
If successful, the development of these new sortase/substrate pairs has exciting consequences for SML engineering efforts: (1) it increases options for dual-labeling single proteins or multiplexed labeling of multiple proteins in the same systems (11, 39, 40), and (2) it may reduce the need to mutate naturally occurring protein sequences to render their termini compatible with SML. For example, using our previously published program, MotifAnalyzer, we found 190 instances of LPXTG in 189 unique proteins in the human proteome. However, if the P1′ position is now flexible, this number becomes 3606 instances of LPXTX in 2930 unique proteins (41).
Finally, the three variable loops mentioned here (β4–β5, β6–β7, and β7–β8) are conserved in all classes of sortases (Fig. 1B), and previous work determining and engineering sortase selectivity of different classes, for example, sortase B, suggests similar roles for these loops in substrate recognition (5, 18, 42). Developing a deeper understanding of how residues in these loops affect substrate selectivity in all sortase classes may enable dramatic expansion of the sortase “toolbox” (Fig. 8), potentially allowing the development of ligases that are tailored to the needs of specific protein targets while also limiting off-target effects (5, 11, 12, 13, 14, 17, 22). In the over 20 years since saSrtA was discovered, the sortase superfamily has proven to be both a workhorse for protein engineering efforts and an exciting system for future discoveries and insight into the stereochemistry and mechanisms of target recognition.
Figure 8.
Building a sortase toolbox for SMLexperiments. Work from ourselves and others can be used to create a sortase “toolbox” for SML experiments, taking advantage of the various sequence motifs, both endogenous and engineered. Recognition sequences for various sortase subclasses are described in (5, 11, 12, 13, 14, 17, 22). SML, sortase-mediated ligation.
Experimental procedures
Protein expression and purification
WT spSrtA and saSrtA proteins were expressed and purified as previously described (22). All other constructs, including chimeric and mutant proteins, were purchased from GenScript in the pET28a(+) vector. In general, protein expression and purification protocols were very similar to those previously described (22). Briefly, plasmids were transformed into E. coli BL21 (DE3) competent cells and grown in LB media, with protein induction at A600 0.6 to 0.8 using 0.15 M IPTG for 18 to 20 h at 18 °C.
After cell harvest in the lysis buffer [0.05 M Tris, pH 7.5, 0.15 M NaCl, and 0.0005 M EDTA], the protein was purified using a 5-ml HisTrap HP column (GE Life Sciences, now Cytiva), using wash [0.05 M Tris, pH 7.5, 0.15 M NaCl, 0.02 M imidazole, pH 7.5, and 0.001 M TCEP] and elution [wash buffer, with 0.3 M imidazole, pH 7.5] buffers. SEC was conducted using a HiLoad 16/600 Superdex 75 column (GE Life Sciences, now Cytiva) in the SEC running buffer (0.05 M Tris, pH 7.5, 0.15 M NaCl, and 0.001 M TCEP). Purified protein corresponding to the monomeric peak was concentrated using an Amicon Ultra-15 Centrifugal Filter Unit (10,000 NWML) and analyzed by SDS-PAGE and analytical SEC (Figs. S1 and S6). Protein not immediately used was flash-frozen in the SEC running buffer and stored at −80 °C.
Peptide synthesis
Detailed synthetic procedures are provided in the supporting information. Briefly, all peptides were synthesized via manual Fmoc solid-phase peptide synthesis. Peptides were synthesized either individually or in tandem using Fmoc Rink amide MBHA resin or SynPhase lantern solid supports. All other materials, including suitably protected Fmoc amino acids, and reagents for coupling, deprotection, and resin cleavage were obtained from commercial sources and used without further purification. All peptides were purified using RP-HPLC, and their identities were confirmed via ESI-MS. Before use in sortase-catalyzed transacylation reactions, each purified peptide was prepared as a concentrated stock solution in DMSO and/or water (see supporting information for details).
Fluorescence assay for sortase activity
Reactions were performed in a Costar round-bottom, black, 96-well plate at a 100-μl reaction volume under the following conditions: 5 μM sortase, 50 μM peptide substrate, and 5 mM hydroxylamine nucleophile. All reactions contained 10% (v/v) 10× sortase reaction buffer (500 mM Tris, pH 7.5, 1500 mM NaCl, and 100 mM CaCl2). Reactions also contained residual DMSO from the peptide stock solutions (0.5–1.5% (v/v), with the exception of the Phe- and Val-containing peptides at 5%). The peptides containing phenylalanine or valine required 5% (v/v) DMSO for solubility under the reaction conditions. 1 mM TCEP was also included in reactions utilizing the Abz-LPATCG-K(Dnp) substrate. Reactions were initiated by the addition of the sortase enzyme, which were prepared as 10× stock solutions in 50 mM Tris, pH 7.5, 150 mM NaCl, and 1 mM TCEP. Microplates were analyzed using a BioTek Synergy H1 plate reader. The fluorescence intensity of each well was measured at 2-min time intervals over a 2-h period at RT (λex = 320 nm, λem = 420 nm, and detector gain = 75). All reactions were performed at least in triplicate, and all of the raw fluorescence data utilized in this study are provided in Table S2. For each substrate sequence, the background fluorescence of the intact peptide in the absence of enzyme was subtracted from the observed experimental data. Background-corrected fluorescence data were then normalized to the fluorescence intensity of a benchmark reaction between WT saSrtA and Abz-LPATGG-K(Dnp) (Fig. S2A).
HPLC and LC-MS characterization of sortase-catalyzed reactions
Select pairings of sortase enzyme (5 μM or 10 μM for the X-NH2 reactions), substrate (50 μM), and nucleophile (5 mM H2NOH or X-NH2) were repeated in the presence or absence of Ca2+ under reaction conditions that were otherwise identical to those described above for the fluorescence assay. These reactions were then analyzed using a Dionex UltiMate 3000 HPLC system interfaced with an Advion CMS expression mass spectrometer. Separations were achieved with a Phenomenex Kinetex 2.6 μM C18 100 Å column (100 × 2.1 mm) (aqueous [95% water, 5% MeCN, 0.1% formic acid]/MeCN [0.1% formic acid] mobile phase at 0.3 ml/min, method: hold 10% MeCN for 0.0–0.5 min, linear gradient of 10–90% MeCN for 0.5–7.0 min, hold 90% MeCN for 7.0–8.0 min, linear gradient of 90–10% MeCN for 8.0–8.1 min, re-equilibrate at 10% MeCN for 8.1–13.25 min]).
Sequence and structural analyses
All sequences were downloaded from either the NCBI database or UniProt, as indicated (43, 44, 45). Sequence alignments were performed using MAFFT, T-Coffee, or BlastP (23, 46, 47). Visualization of our T-Coffee alignment was done using BoxShade. ConSurf analyses were performed using the online server, with 2KID as a template and insertion or our own multiple sequence alignment performed using MAFFT (24, 25). Alignments were visualized using Jalview (48). Homology modeling was performed using the SWISS-MODEL web interface (30, 31). Structural analyses and figure rendering were done using PyMOL. Enzyme assay graphs were prepared using GraphPad Prism 9.1.2.
Data availability
All data are contained in the article and the supporting information.
Supporting information
This article contains supporting information (22, 49, 50).
Conflict of interests
The authors declare that they have no conflicts of interest with the contents of this article.
Acknowledgments
The authors would like to thank the other members of the Amacher and Antos laboratories for helpful discussions and assistance. Start-up funds from Western Washington University also contributed to this project.
Author contributions
I. M. P., S. A. S., J. D. V., K. L. H., J. M. A., and J. F. A. data curation; I. M. P., S. A. S., J. D. V., K. L. H., and J. M. A. formal analysis; I. M. P., J. D. V., and J. M. A. validation; I. M. P., S. A. S., J. D. V., D. A. J., M. G., K. J., J. E. S., H. M. K., K. L. H., J. M. A., and J. F. A. investigation; I. M. P., J. D. V., J. M. A., and J. F. A. methodology; I. M. P., S. A. S., J. D. V., D. A. J., M. G., K. J., J. E. S., H. M. K., K. L. H., J. M. A., and J. F. A. writing–review and editing; J. D. V., K. L. H., J. M. A., and J. F. A. visualization; K. L. H., J. M. A., and J. F. A. funding acquisition; J. M. A. and J. F. A. conceptualization; J. M. A. and J. F. A. resources; J. M. A. and J. F. A. supervision; J. M. A. and J. F. A. writing–original draft.
Funding and additional information
J. F. A. was funded by NSF CHE-1904711 and J. M. A. was funded by a Cottrell Scholar Award from the Research Corporation for Science Advancement. K. L. H. was supported by the NIAID, National Institutes of Health, Award Number F32AI145111. In addition, I. M. P. received an Elwha Undergraduate Summer Research Award and D. A. J. received a Joseph and Karen Morse Student Research in Chemistry Fellowship to fund summer research. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Edited by Joseph Jez
Footnotes
Present address for Jordan D. Valgardson: Department of Chemical Systems Biology, Stanford University, Palo Alto, CA, USA.
Contributor Information
John M. Antos, Email: antosj@wwu.edu.
Jeanine F. Amacher, Email: amachej@wwu.edu.
Supporting information
References
- 1.Spirig T., Weiner E.M., Clubb R.T. Sortase enzymes in Gram-positive bacteria. Mol. Microbiol. 2011;82:1044–1059. doi: 10.1111/j.1365-2958.2011.07887.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Jacobitz A.W., Kattke M.D., Wereszczynski J., Clubb R.T. Sortase transpeptidases: Structural biology and catalytic mechanism. Adv. Protein Chem. Struct. Biol. 2017;109:223–264. doi: 10.1016/bs.apcsb.2017.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Antos J.M., Truttmann M.C., Ploegh H.L. Recent advances in sortase-catalyzed ligation methodology. Curr. Opin. Struct. Biol. 2016;38:111–118. doi: 10.1016/j.sbi.2016.05.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Di Girolamo S., Puorger C., Castiglione M., Vogel M., Gébleux R., Briendl M., Hell T., Beerli R.R., Grawunder U., Lipps G. Characterization of the housekeeping sortase from the human pathogen propionibacterium acnes: First investigation of a class F sortase. Biochem. J. 2019;476:665–682. doi: 10.1042/BCJ20180885. [DOI] [PubMed] [Google Scholar]
- 5.Bradshaw W.J., Davies A.H., Chambers C.J., Roberts A.K., Shone C.C., Acharya K.R. Molecular features of the sortase enzyme family. FEBS J. 2015;282:2097–2114. doi: 10.1111/febs.13288. [DOI] [PubMed] [Google Scholar]
- 6.Marraffini L.A., DeDent A.C., Schneewind O. Sortases and the art of anchoring proteins to the envelopes of gram-positive bacteria. Microbiol. Mol. Biol. Rev. 2006;70:192–221. doi: 10.1128/MMBR.70.1.192-221.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mazmanian S.K., Liu G., Ton-That H., Schneewind O. Staphylococcus aureus sortase, an enzyme that anchors surface proteins to the cell wall. Science. 1999;285:760–763. doi: 10.1126/science.285.5428.760. [DOI] [PubMed] [Google Scholar]
- 8.Chen I., Dorr B.M., Liu D.R. A general strategy for the evolution of bond-forming enzymes using yeast display. Proc. Natl. Acad. Sci. U. S. A. 2011;108:11399–11404. doi: 10.1073/pnas.1101046108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Freund C., Schwarzer D. Engineered sortases in peptide and protein chemistry. Chembiochem. 2021;22:1347–1356. doi: 10.1002/cbic.202000745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kruger R.G., Otvos B., Frankel B.A., Bentley M., Dostal P., McCafferty D.G. Analysis of the substrate specificity of the Staphylococcus aureus sortase transpeptidase SrtA. Biochemistry. 2004;43:1541–1551. doi: 10.1021/bi035920j. [DOI] [PubMed] [Google Scholar]
- 11.Dorr B.M., Ham H.O., An C., Chaikof E.L., Liu D.R. Reprogramming the specificity of sortase enzymes. Proc. Natl. Acad. Sci. U. S. A. 2014;111:13343–13348. doi: 10.1073/pnas.1411179111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Piotukh K., Geltinger B., Heinrich N., Gerth F., Beyermann M., Freund C., Schwarzer D. Directed evolution of sortase A mutants with altered substrate selectivity profiles. J. Am. Chem. Soc. 2011;133:17536–17539. doi: 10.1021/ja205630g. [DOI] [PubMed] [Google Scholar]
- 13.Podracky C.J., An C., DeSousa A., Dorr B.M., Walsh D.M., Liu D.R. Laboratory evolution of a sortase enzyme that modifies amyloid-β protein. Nat. Chem. Biol. 2021;17:317–325. doi: 10.1038/s41589-020-00706-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Antos J.M., Chew G.-L., Guimaraes C.P., Yoder N.C., Grotenbreg G.M., Popp M.W.-L., Ploegh H.L. Site-specific N- and C-terminal labeling of a single polypeptide using sortases of different specificity. J. Am. Chem. Soc. 2009;131:10800–10801. doi: 10.1021/ja902681k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Clancy K.W., Melvin J.A., McCafferty D.G. Sortase transpeptidases: Insights into mechanism, substrate specificity, and inhibition. Biopolymers. 2010;94:385–396. doi: 10.1002/bip.21472. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Bentley M.L., Lamb E.C., McCafferty D.G. Mutagenesis studies of substrate recognition and catalysis in the sortase A transpeptidase from Staphylococcus aureus. J. Biol. Chem. 2008;283:14762–14771. doi: 10.1074/jbc.M800974200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Schmohl L., Bierlmeier J., Gerth F., Freund C., Schwarzer D. Engineering sortase A by screening a second-generation library using phage display. J. Pept. Sci. 2017;23:631–635. doi: 10.1002/psc.2980. [DOI] [PubMed] [Google Scholar]
- 18.Bentley M.L., Gaweska H., Kielec J.M., McCafferty D.G. Engineering the substrate specificity of Staphylococcus aureus Sortase A. The beta6/beta7 loop from SrtB confers NPQTN recognition to SrtA. J. Biol. Chem. 2007;282:6571–6581. doi: 10.1074/jbc.M610519200. [DOI] [PubMed] [Google Scholar]
- 19.Suree N., Liew C.K., Villareal V.A., Thieu W., Fadeev E.A., Clemens J.J., Jung M.E., Clubb R.T. The structure of the Staphylococcus aureus sortase-substrate complex reveals how the universally conserved LPXTG sorting signal is recognized. J. Biol. Chem. 2009;284:24465–24477. doi: 10.1074/jbc.M109.022624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Ton-That H., Mazmanian S.K., Alksne L., Schneewind O. Anchoring of surface proteins to the cell wall of Staphylococcus aureus. Cysteine 184 and histidine 120 of sortase form a thiolate-imidazolium ion pair for catalysis. J. Biol. Chem. 2002;277:7447–7452. doi: 10.1074/jbc.M109945200. [DOI] [PubMed] [Google Scholar]
- 21.Wójcik M., Szala K., van Merkerk R., Quax W.J., Boersma Y.L. Engineering the specificity of Streptococcus pyogenes sortase A by loop grafting. Proteins. 2020;88:1394–1400. doi: 10.1002/prot.25958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Nikghalb K.D., Horvath N.M., Prelesnik J.L., Banks O.G.B., Filipov P.A., Row R.D., Roark T.J., Antos J.M. Expanding the scope of sortase-mediated ligations by using sortase homologues. Chembiochem. 2018;19:185–195. doi: 10.1002/cbic.201700517. [DOI] [PubMed] [Google Scholar]
- 23.Katoh K., Rozewicki J., Yamada K.D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2019;20:1160–1166. doi: 10.1093/bib/bbx108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ashkenazy H., Abadi S., Martz E., Chay O., Mayrose I., Pupko T., Ben-Tal N. ConSurf 2016: An improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Res. 2016;44:W344–W350. doi: 10.1093/nar/gkw408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Landau M., Mayrose I., Rosenberg Y., Glaser F., Martz E., Pupko T., Ben-Tal N. ConSurf 2005: The projection of evolutionary conservation scores of residues on protein structures. Nucleic Acids Res. 2005;33:W299–W302. doi: 10.1093/nar/gki370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Perry A.M., Ton-That H., Mazmanian S.K., Schneewind O. Anchoring of surface proteins to the cell wall of Staphylococcus aureus. III. Lipid II is an in vivo peptidoglycan substrate for sortase-catalyzed surface protein anchoring. J. Biol. Chem. 2002;277:16241–16248. doi: 10.1074/jbc.M109194200. [DOI] [PubMed] [Google Scholar]
- 27.Schmohl L., Bierlmeier J., von Kügelgen N., Kurz L., Reis P., Barthels F., Mach P., Schutkowski M., Freund C., Schwarzer D. Identification of sortase substrates by specificity profiling. Bioorg. Med. Chem. 2017;25:5002–5007. doi: 10.1016/j.bmc.2017.06.033. [DOI] [PubMed] [Google Scholar]
- 28.Kruger R.G., Dostal P., McCafferty D.G. Development of a high-performance liquid chromatography assay and revision of kinetic parameters for the Staphylococcus aureus sortase transpeptidase SrtA. Anal. Biochem. 2004;326:42–48. doi: 10.1016/j.ab.2003.10.023. [DOI] [PubMed] [Google Scholar]
- 29.Race P.R., Bentley M.L., Melvin J.A., Crow A., Hughes R.K., Smith W.D., Sessions R.B., Kehoe M.A., McCafferty D.G., Banfield M.J. Crystal structure of Streptococcus pyogenes sortase A: Implications for sortase mechanism. J. Biol. Chem. 2009;284:6924–6933. doi: 10.1074/jbc.M805406200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Arnold K., Bordoli L., Kopp J., Schwede T. The SWISS-model workspace: A web-based environment for protein structure homology modelling. Bioinformatics. 2006;22:195–201. doi: 10.1093/bioinformatics/bti770. [DOI] [PubMed] [Google Scholar]
- 31.Bordoli L., Kiefer F., Arnold K., Benkert P., Battey J., Schwede T. Protein structure homology modeling using SWISS-MODEL workspace. Nat. Protoc. 2009;4:1–13. doi: 10.1038/nprot.2008.197. [DOI] [PubMed] [Google Scholar]
- 32.Zou Z., Nöth M., Jakob F., Schwaneberg U. Designed Streptococcus pyogenes sortase A accepts branched amines as nucleophiles in sortagging. Bioconjug. Chem. 2020;31:2476–2481. doi: 10.1021/acs.bioconjchem.0c00486. [DOI] [PubMed] [Google Scholar]
- 33.Chan A.H., Yi S.W., Terwilliger A.L., Maresso A.W., Jung M.E., Clubb R.T. Structure of the Bacillus anthracis sortase A enzyme bound to its sorting signal: A flexible amino-terminal appendage modulates substrate access. J. Biol. Chem. 2015;290:25461–25474. doi: 10.1074/jbc.M115.670984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Raeeszadeh-Sarmazdeh M., Parthasarathy R., Boder E.T. Site-specific immobilization of protein layers on gold surfaces via orthogonal sortases. Colloids Surf. B Biointerfaces. 2015;128:457–463. doi: 10.1016/j.colsurfb.2015.02.044. [DOI] [PubMed] [Google Scholar]
- 35.Li J., Zhang Y., Soubias O., Khago D., Chao F.-A., Li Y., Shaw K., Byrd R.A. Optimization of sortase A ligation for flexible engineering of complex protein systems. J. Biol. Chem. 2020;295:2664–2675. doi: 10.1074/jbc.RA119.012039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Pritz S., Wolf Y., Kraetke O., Klose J., Bienert M., Beyermann M. Synthesis of biologically active peptide nucleic acid-peptide conjugates by sortase-mediated ligation. J. Org. Chem. 2007;72:3909–3912. doi: 10.1021/jo062331l. [DOI] [PubMed] [Google Scholar]
- 37.Guimaraes C.P., Witte M.D., Theile C.S., Bozkurt G., Kundrat L., Blom A.E.M., Ploegh H.L. Site-specific C-terminal and internal loop labeling of proteins using sortase-mediated reactions. Nat. Protoc. 2013;8:1787–1799. doi: 10.1038/nprot.2013.101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Popp M.W., Artavanis-Tsakonas K., Ploegh H.L. Substrate filtering by the active site crossover loop in UCHL3 revealed by sortagging and gain-of-function mutations. J. Biol. Chem. 2009;284:3593–3602. doi: 10.1074/jbc.M807172200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hess G.T., Guimaraes C.P., Spooner E., Ploegh H.L., Belcher A.M. Orthogonal labeling of M13 minor capsid proteins with DNA to self-assemble end-to-end multiphage structures. ACS Synth. Biol. 2013;2:490–496. doi: 10.1021/sb400019s. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Wuethrich I., Peeters J.G.C., Blom A.E.M., Theile C.S., Li Z., Spooner E., Ploegh H.L., Guimaraes C.P. Site-specific chemoenzymatic labeling of aerolysin enables the identification of new aerolysin receptors. PLoS One. 2014;9 doi: 10.1371/journal.pone.0109883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Valgardson J., Cosbey R., Houser P., Rupp M., Van Bronkhorst R., Lee M., Jagodzinski F., Amacher J.F. MotifAnalyzer-PDZ: A computational program to investigate the evolution of PDZ-binding target specificity. Protein Sci. 2019;28:2127–2143. doi: 10.1002/pro.3741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Kang C.-Y., Huang I.-H., Chou C.-C., Wu T.-Y., Chang J.-C., Hsiao Y.-Y., Cheng C.-H., Tsai W.-J., Hsu K.-C., Wang S. Functional analysis of Clostridium difficile sortase B reveals key residues for catalytic activity and substrate specificity. J. Biol. Chem. 2020;295:3734–3745. doi: 10.1074/jbc.RA119.011322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Magrane M., Consortium U. UniProt Knowledgebase: A hub of integrated protein data. Database (Oxford) 2011;2011 doi: 10.1093/database/bar009. bar009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.UniProt Consortium UniProt: A worldwide hub of protein knowledge. Nucleic Acids Res. 2019;47:D506–D515. doi: 10.1093/nar/gky1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.NCBI Resource Coordinators Database resources of the national center for biotechnology information. Nucleic Acids Res. 2017;45:D12–D17. doi: 10.1093/nar/gkw1071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Notredame C., Higgins D.G., Heringa J. T-coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 2000;302:205–217. doi: 10.1006/jmbi.2000.4042. [DOI] [PubMed] [Google Scholar]
- 47.Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 48.Clamp M., Cuff J., Searle S.M., Barton G.J. The Jalview Java alignment editor. Bioinformatics. 2004;20:426–427. doi: 10.1093/bioinformatics/btg430. [DOI] [PubMed] [Google Scholar]
- 49.Johanning K., Juliano M.A., Juliano L., Lazure C., Lamango N.S., Steiner D.F., Lindberg I. Specificity of prohormone convertase 2 on proenkephalin and proenkephalin-related substrates. J. Biol. Chem. 1998;273:22672–22680. doi: 10.1074/jbc.273.35.22672. [DOI] [PubMed] [Google Scholar]
- 50.Bennett N.R., Jarvis C.M., Alam M.M., Zwick D.B., Olson J.M., Nguyen H.V., Johnson J.A., Cook M.E., Kiessling L.L. Modular polymer antigens to optimize immunity. Biomacromolecules. 2019;20:4370–4379. doi: 10.1021/acs.biomac.9b01049. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data are contained in the article and the supporting information.