Abstract
Sialidases (3.2.1.18) may exhibit trans-sialidase activity to catalyze sialylation of lactose if the active site topology is congruent with that of the Trypanosoma cruzi trans-sialidase (EC 2.4.1.-). The present work was undertaken to test the hypothesis that a particular aromatic sandwich structure of two amino acids proximal to the active site of the T. cruzi trans-sialidase infers trans-sialidase activity. On this basis, four enzymes with putative trans-sialidase activity were identified through an iterative alignment from 2909 native sialidases available in GenBank, which were cloned and expressed in Escherichia coli. Of these, one enzyme, SialH, derived from Haemophilus parasuis had an aromatic sandwich structure on the protein surface facing the end of the catalytic site (Phe168; Trp366), and was indeed found to exhibit trans-sialidase activity. SialH catalyzed production of the human milk oligosaccharide 3’-sialyllactose as well as the novel trans-sialylation product 3-sialyllactose using casein glycomacropeptide as sialyl donor and lactose as acceptor. The findings corroborated that Tyr119 and Trp312 in the T. cruzi trans-sialidase are part of an aromatic sandwich structure that confers trans-sialylation activity for lactose sialylation. The in silico identification of trans-glycosidase activity by rational active site topology alignment thus proved to be a quick tool for selecting putative trans-sialidases amongst a large group of glycosyl hydrolases. The approach moreover provided data that help understand structure-function relations of trans-sialidases.
Introduction
The CAZy database for carbohydrate-active enzymes contains more than 280000 identified glycosyl hydrolases (GH) [1]. A recent trend in the field of glycobiology is the screening for trans-glycosylation activity among enzymes from various GH families [2]. In trans-glycosylation, the glycosyl residue retained in the covalent intermediate (CI) is attacked by a hydroxyl group of an acceptor molecule, e.g. a sugar or an alcohol, to which transfer of the glycosyl residue occurs, rather than to a water molecule as is the case in hydrolysis [3]. Thus, a trans-glycosidase can be defined as a GH whose primary activity at a given set of conditions is the transfer of a glycoside residue from a donor molecule to an acceptor molecule different from water.
In nature, saccharide synthesis is typically carried out by glycosyltransferases. However, glycosyltransferases are not suitable for industrial application in vitro as they require the use of nucleotide-activated glycosides as donor substrates [4]. In the trans-glycosylation reaction, however, a specific monosaccharide is transferred from one glycan (donor) to another glycan (acceptor), enabling the use of cheap biomass donors.
Despite the interest in trans-glycosidases, their rate of identification has been slow. Experimentally, GH enzymes are often identified using synthetic chromogenic substrates amenable to high-throughput screening, in which a chromophore is released irrespective of whether the monosaccharide is transferred to water (hydrolysis) or to a glycan or an aglycone moiety (trans-glycosylation). Very few screening assays for the specific identification of trans-glycosylation enzymes have been suggested [5] and consequently the vast majority of trans-glycosidases identified thus far have been identified via different routes. Therefore, it seems enticing to use bioinformatics to search for trans-glycosidases, but this strategy is hampered by the very limited number of known trans-glycosidases in different GH families. Furthermore, trans-glycosidases tend to be structurally more closely related to hydrolytic enzymes of the same GH family than to trans-glycosidases of different families, suggesting that subtle molecular adjustments rather than major modifications lead to the evolution of trans-glycosidases and questioning the existence of conserved traits among trans-glycosidases [6]. The structure-function relationship of trans-glycosidases is far from fully understood, but the importance of acceptor binding, substrate orientation, and exclusion of water from the active site has been pointed out in several cases [7–10].
As part of our quest to develop biocatalytic routes for human milk oligosaccharide synthesis, the aim of this study was to identify a trans-sialidase activity marker through the investigation of the trans-sialidase of the protozoan Trypanosoma cruzi (TcTS). The 3D structure and mechanism of TcTS was first reported in 2002 [9] and, due to its significant contribution to T. cruzi virulence, TcTS is to date the best characterized trans-sialidase. Other trans-sialidases have been identified, all originating from the Trypanosoma genus [11–14].
TcTS was identified as the ideal case study for several reasons. Firstly, it is extremely selective towards trans-sialylation with minimal hydrolytic activity [15]. Furthermore, the strictly hydrolytic sialidase of T. rangeli (TrSA), which has a sequence identity of 70% to TcTS, has recently been engineered to primarily carry out trans-sialylation under certain reaction conditions [10, 16, 17]. Trans-sialidases are also of commercial interest as it has been demonstrated that both TcTS and mutants of TrSA can efficiently sialylate a variety of glycans, enabling the production of sialylated human milk oligosaccharides [10, 16, 18].
In TcTS, two residues proximal to the active site (Y119 and W312) forming an aromatic sandwich have been reported to contribute to the trans-sialylation activity through stacking interactions with the acceptor glycan and creation of a narrow, water-excluding entry to the active site [9]. Introducing the Y119S mutation in TcTS caused an almost complete loss of trans-sialidase activity [19]. On the other hand, introduction of the opposite mutation in TrSA was not sufficient to confer trans-sialidase activity on TrSA, although the hydrolase activity was halved [19]. These results underline that Y119 is necessary, albeit not sufficient, for trans-sialidase activity in TcTS. Molecular dynamics (MD) simulations have shown the conformational flexibility of W312: when in the apo form, the catalytic cleft of TcTS is in an open form, but upon binding of a sialic acid (SA) moiety, W312 changes conformation to form a catalytic cleft, which is considerably more closed than the one of TrSA [20]. This conformational change leads to reduced solvent exposure of the TcTS covalent intermediate, which is indeed the most sensitive phase in catalysis when it comes to solvent exposure. Y119 affects W312 mobility, again emphasizing the importance of this aromatic sandwich [20]. It has also been suggested from MD simulations that conformational changes of the aromatic sandwich is responsible for expulsion of the donor aglycone leaving group from the active site followed by loading of the acceptor substrate into a position where the subsequent bond formation can take place. All this happens in a movement where W312 assisted by Y119 functions as a molecular shovel, and the aromatic sandwich thus plays a critical role in orientation of the acceptor [21]. Furthermore, Y119 has been introduced into all mutants of TrSA successfully displaying trans-sialidase activity [10, 15]. Although remarkably improved trans-sialidase activity has been obtained, the trans-sialidase activity of TcTS is still superior, underlining that the creation of a trans-sialidase–even from a well-characterized template–is not trivial [10, 15].
As an alternative, this work undertakes the task of using a rational 3D alignment approach in the search for novel, native trans-sialidases using the knowledge that the Y119-W312 aromatic sandwich is of paramount importance for trans-sialidase activity and thus a suitable trans-sialidase identification marker due to its specific structural characteristics facilitating the definition of a search motif. The hypothesis that was tested in this study is therefore: A sialidase with an aromatic sandwich proximal to the active site, as it is found in TcTS, will be a trans-sialidase. In the study an iterative in silico screening process a database of 2909 sialidases was gradually reduced to four candidates, one with an aromatic sandwich and three with similarly narrow binding clefts, but missing one of the aromatic sandwich residues. Recombinant expression followed by enzymatic activity studies were used to evaluate the four identified candidates. The candidate having an aromatic sandwich, a sialidase from Haemophilus parasuis, indeed proved to be a trans-sialidase.
Materials & Methods
Bioinformatics and in silico screening
A search motif was created to identify possible trans-sialidase candidates. Adopting the nomenclature proposed by Aasland et al. [22], the five amino acid search motif is given as ΩxRDR, where ‘Ω’ represents an aromatic amino acid and ‘x’ is any residue (Box 1). Thus, the search motif identified TcTS with W312 and R314 (bold) as shown in Box 1. The terminal residues (‘DR’) were included in the very short search motif to achieve sufficient specificity and enable a meaningful search.
Box 1. The search motif ΩxRDR used for in silico screening. ‘Ω’ represents an aromatic amino acid and ‘x’ is any residue. The bottom row indicates the corresponding residues in TcTS.
Motif: | Ω x R D R |
● ● ● ● | |
TcTS (W312—R316): | W L R D R |
The initial library dataset was assembled by acquisition of all genes in the NCBI databank matching a description of ‘sialidase’ or ‘neuraminidase’. Putative enzymes were included and redundant sequences were removed. In total, 2909 (putative) sialidases were included and screened for the presence of the motif. Candidate enzymes were modeled by the HHPred server [23] using the TcTS model (PDB ID: 1S0I) [11] as the template when possible (otherwise, the top 5 template hits were used). Subsequently, each individual model was structurally aligned to the TcTS crystal structure model using the structure alignment function of the PyMOL Molecular Graphics System (Version 1.7.4 Schrödinger, LLC).
Micromonospora viridifaciens neuraminidase (MvSA; PDB ID: 1W8O) [24], Vibrio cholerae (VcSA; PDB ID: 1W0P) [25], Clostridium perfringens (CpSA; PDB ID: 2VK5) [26], and Salmonella typhimurium (StSA; PDB ID: 3SIL) were used for analysis of the active site structure. Again, PyMOL was used for preparation of figures.
Enzyme production
The candidate genes as they appear in GenBank were prepared for expression in the following manner: Signaling sequences were identified and removed. At the 5’ end of the protein coding sequence, a His6-tag sequence followed by a protease site sequence was added. The resulting genes were synthesized and inserted into the pJ414 expression vector by DNA2.0 (Menlo Park, CA, USA) allowing for IPTG-induced expression of E. coli BL21 DE3 transformed with the resulting plasmid. Transformants were picked and used to inoculate 1 L of auto-induction medium ZYM-5052 [27] containing 100 μg/mL ampicillin for 16 h with shaking at 25°C in a 5 L shake-flask. The cells were harvested by centrifugation (20 min at 5000 g) and re-suspended in binding buffer (20 mM citrate-phosphate buffer, 100 mM NaCl, 15 mM imidazole, pH 7.4). Cells were lysed by addition of 50 μg/mL lysozyme, followed by a 10 min period of incubation on ice and 6 x 30 s sonication, separated by equal breaks. The cell debris was pelleted by centrifugation at 20000 g for 20 min. The supernatant was filtered through a 0.45 μm filter and subsequently loaded onto a 1 mL Ni2+-sepharose gravity column and washed by addition of 10 mL of a wash buffer containing 24 mM imidazole, 100 mM NaCl and 20 mM citrate-phosphate buffer (pH 7.4). The protein was eluted from the column in 3 mL of elution buffer containing 500 mM imidazole, 100 mM NaCl and 20 mM citrate-phosphate buffer (pH 7.4). Imidazole was removed from the samples using PD10 columns (GE Healthcare, Uppsala, Sweden) following the manufacturer’s instructions. The purity of the final enzyme preparations was confirmed by SDS-PAGE.
Time course experiment
Casein glycomacropeptide (CGMP) was obtained and purified as described previously [28]. Lactose was added to the CGMP solution to obtain final concentrations of 137.5 g/L and 50.6 g/L, respectively. Enzyme preparations were diluted to equal concentrations of total protein and an appropriate volume was heat inactivated at 90°C for 30 min. For each time point, reaction duplicates with active enzyme were started by addition of 135 μL enzyme to 365 μL substrate mix (both preheated to 30°C) and the resulting reaction mixture (100 g/L lactose, 37 g/L CGMP corresponding to approximately 4 mM 3’-bound and 4 mM 6’-bound SA) was incubated at 30°C with shaking (700 rpm). Inactivated enzymes were used as controls. Reactions were stopped by heat inactivation at 90°C for 10 min before the reaction was transferred to a 5 kDa spin filter (Sartorius AG, Göttingen, Germany) and centrifuged for 10 min at 5000 g and 4°C. As heat inactivation was unsuccessful for the SialA enzyme, these samples were stopped directly by 5 kDa filtration as described above. To evaluate the trans-sialylation activity of the enzymes, product-to-hydrolysis ratios were calculated as:
where (at a given time point) CP is the concentration of a given sialyllactose (SL) variant and CSA is the concentration of SA. A trans-sialidase was defined as an enzyme exhibiting an RP,H > 1 at any time point.
Sample analysis by HPAEC-PAD
Concentrations of 3’SL, 6’SL and SA in the reaction permeate were measured by high-performance anion exchange chromatography with pulsed amperometric detection (HPAEC-PAD) analysis using a CarboPacTM PA100 (4 mm × 250 mm) analytical column equipped with a CarboPacTM PA100 (4 mm × 50 mm) guard column (Dionex Corp., Sunnyvale, CA) on a Dionex ICS-3000 system (Dionex Corp., Sunnyvale, CA). The operating conditions and analysis procedure have been described previously [17], and resulted in base line separation of the analytes. SA (Sigma-Aldrich, Steinheim, Germany), 3’SL and 6’SL (Carbosynth Ltd, Compton, UK) were used as external standards. For novel reaction products where no standard was available, the product formation was estimated using the extinction coefficient of 3’SL. As part of the identification of the novel trans-sialylation compound 3-sialyllactose, the reaction mixture was treated with β-galactosidase from Aspergillus oryzae (Sigma-Aldrich, Steinheim, Germany) prior to HPAEC-PAD analysis and purification by anion exchange chromatography.
Anion exchange chromatography
Anion exchange chromatography for the purification of sialylated compounds prior to MALDI-TOF and NMR analysis was performed with an ÄKTA purifier 100 (GE Healthcare, Uppsala, Sweden) equipped with a SepharoseQ column (GE Healthcare, Uppsala, Sweden) as described previously [18].
Matrix-assisted laser desorption/ionization—time of flight (MALDI-TOF)
A 1 μL sample and 1 μL of 10 mg/mL norharmane matrix in 50% acetonitrile was applied to a MTP 384 target plate ground steel BC (Bruker Daltonics GmbH, Bremen, Germany), mixed on the target and dried under a stream of air. The samples were analyzed with an Ultraflex MALDI-TOF/TOF instrument (Bruker Daltonics GmbH, Bremen, Germany). The instrument was operated in negative acquisition mode and controlled by the FlexControl 3.3 software package. All spectra were obtained using the linear mode with an acceleration voltage of 20 kV, and pulsed ion extraction of 10 ns. The acquisition range used was from m/z 400 to 2500. Post-source decay (PSD) spectra using the Bruker Daltonics LIFT system were recorded at 8 kV precursor ion acceleration voltage and fragment acceleration (LIFT voltage 19 kV). The reflector voltage 1 and 2 were set to 29.6 and 13.9 kV, respectively.
NMR spectroscopy
One-dimensional 1H, 2D 1H1H and 1H13C NMR spectra were acquired at 298 K on a Bruker Avance III 600 spectrometer, operating at a 1H frequency of 600.13 MHz, equipped with a 5-mm 1H TXI probe (Bruker BioSpin, Germany). Proton and carbon chemical shifts were measured in reference to an internal acetone standard (Sigma-Aldrich; 1H δ 2.225 and 13C δ 31.08 ppm). After synthesis, samples were lyophilized and redissolved in 650 μL D2O (Sigma-Aldrich). 1D 1H NMR spectra were recorded with a spectral width of 6000 Hz, and a total of 64 scans were collected into 32 K complex data points. Water signal was irradiated using pre-saturation. 1H1H COSY spectra were recorded with 256 increments in 4096 complex data points with a spectral width of 6000 Hz. 1H13C HSQC spectra were recorded with a spectral sweep width of 6000 Hz in t2 and 22637 Hz in t1 direction. All spectra were acquired and processed using Topspin 3.0 (Bruker BioSpin, Germany). With regard to evaluation of spectra, all resonances are referenced to internal acetone (δ 2.225).
Results & Discussion
TcTS analysis and trans-sialidase identification strategy
A visual representation of the TcTS Michaelis complex with 3’SL is presented in Fig 1 and an overview of the 11 residues interacting with the substrate in the Michaelis complex is given in Table 1 [9, 11]. Of these, R35, R245, and R314 form an Arg triad responsible for sialyl carboxylate fixation, whereas D59 is the general acid/base catalyst and Y342 acts as the nucleophile; all of these are conserved in all sialidases and trans-sialidases [9, 24]. In addition, R53, D96, W120, and Q195 also take part in sialyl fixation, while W312 and Y119 interacts with and orient the acceptor (Table 1; Fig 1). Aiming to identify new trans-sialidases, this study focused on the residues interacting in the Michaelis complex of TcTS (Fig 1; Table 1), which are conserved in all trypanosomal trans-sialidases, rather than looking for the seven amino acid residues conserved in all hydrolytic sialidases, which apart from the Arg triad, the general acid/base catalyst, and the nucleophile also include E230 acting as a base catalyst for the nucleophile and E357 stabilizing R35 through a salt bridge [9, 24]. Several factors pointed to the aromatic sandwich comprised of W312 and Y119, which are unique to trypanosomal trans-sialidases, as a good trans-sialidase identification marker. Apart from being reported to be essential for the trans-sialidase activity of TcTS, W312 in the aromatic sandwich is located close to R314, which is one of the conserved residues in all sialidases and trans-sialidases [9, 11]. The close proximity of the two residues enabled the creation of a search motif which could be used for a database search, namely ΩxRDR, where ‘Ω’ represents an aromatic amino acid and ‘x’ is any residue (Box 1).
Table 1. Summary of the interactions between TcTS and 3’-sialyllactose in the Michaelis complex; SA is sialyl [11, 24].
TcTS properties | Residues | Role of residues | Hydrogen bonds(to substrate) | Hydrophobic interactions | Strictly conserved |
---|---|---|---|---|---|
Sialic acid moiety fixation | R35 | Carboxylate fixation | SA_O1A, SA_ O1B | - | + |
R245 | - | + | |||
R314 | - | + | |||
D96 | Acetamide fixation | SA_N5 | - | - | |
W120 | Glycerol fixation | SA_O9 | - | - | |
Q195 | SA_O8, SA_O9 | - | - | ||
R53 | Ring fixation | SA_O4 | - | - | |
Catalysis | D59 | Acid/base catalyst | SA_O2 | - | + |
Y342 | Enzymatic nucleophile | (covalently bound to C2) | - | + | |
Lactose moiety fixation | Y119 | Aromatic sandwich | -* | + | - |
W312 | - | + | - |
*Y119 goes through a conformational change during the catalytic process, where a hydrogen bond to the substrate is formed [11].
The initial database search was an identification of putative sialidases with an aromatic residue in a position relative to the conserved Arg residue where it could compose one half of an aromatic sandwich. The terminal ‘DR’ part of the motif corresponding to residues 315–316 in TcTS is not well conserved in non-trypanosomal sialidases, but the risk of losing potential trans-sialidases due to the specificity of the search motif was accepted as a premise for obtaining a manageable quantity of sequences for the subsequent homology modeling. Furthermore, false positives could be identified due to the presence of an aromatic residue not being a part of an acceptor binding site.
Therefore, a secondary screen was devised to validate the identified candidates, that is, to determine whether the other half of the sandwich was present. This was done by construction of homology models of the sequences in the reduced library. These homology models were aligned with TcTS and evaluated manually based on two criteria: 1) restoration of the active site with nine sialyl interacting residues, i.e. the seven residues involved in sialyl fixation and the two catalytic residues (Table 1), and 2) the formation of a narrow (shielded) binding cleft above the active site, preferably with an aromatic sandwich composed of two aromatic amino acid residues.
Evaluation of initial database reduction
The initial database screening using the ΩxRDR search motif (Box 1) resulted in the identification of 46 sequences, of which 10 were redundant and 20 were sequences of already known trans-sialidases from T. cruzi and T. brucei (S1 Table). The resulting 16 was an acceptable number of sequences to enter into the next round of screening where homology models were created. Since the initial sialidase database included putative enzymes, faulty classification of enzymes identified during the sequence screening would also result in identification of false positives. However, the presence of false positives amongst the identified sequences did not pose a problem since such sequences were highly unlikely to produce homology models with an intact active site.
Evaluation of homology models
For two of the 16 sequences, no template for homology modeling was available (S2 Table). Homology models for the other 14 candidates were constructed and were–without regard to the quality of the models–aligned to TcTS for inspection (Fig 2; S1 Fig). Of these, 12 sequences fell for criterion 1 as they did not contain the nine sialyl interacting active site residues (S1 Fig; S2 Table). In an effort to identify any sequences that might have been missed during the initial sequence search, the two remaining sequences of the enzyme candidate models were submitted to an NCBI BLAST search and the two closest homologues were inspected to identify sequences with an aromatic residue in a position equivalent to W312 in TcTS. This led to the identification of two such sequences that were introduced in the study, modeled, and successfully evaluated against the two selection criteria (S2 Table). In total, four candidates fulfilled the two criteria, i.e. they had all nine sialyl interacting active site residues, as well as a narrow binding cleft sandwich structure, namely (putative) sialidases from Actinomyces oris, Haemophilus parasuis, Manheimia heamolytica, and Pasteurella multocida, which were named SialA, SialH, SialM, and SialP, respectively (Table 2; Fig 2). SialM, SialP, and SialH had sequence identities to each other of 58–62%, whereas SialA had sequence identities to the three others of 25–29%.
Table 2. An overview of the origin and selected properties of the candidate genes.
Candidate | Organism name | Uniprot ID | Submitted Name | MW | Length |
---|---|---|---|---|---|
SialA | Actinomyces oris/Actinomyces viscosus* | Q44562* | NanH sialidase* | 73,089 | 685 AA |
SialH | Haemophilus parasuis | I7AJQ9 | NanH sialidase | 89,948 | 803 AA |
SialM | Manheimia heamolytica | E2P8L4 | NanH sialidase | 89,245 | 795 AA |
SialP | Pasteurella multocida | I1VL53** | NanH sialidase | 89,217 | 798 AA |
*The sequence of Actinomyces oris (sequence ID given in S1 Table) was cloned and expressed in this work, but is not available in UniProt. Hence, the UniProt ID and submitted of the closest homologue (A. viscosus; 97% sequence identity), which was deselected out of redundancy in this work (S1 Table), is given here.
**This UniProt entry has recently been made redundant; UniProt ID Q9EZV6 lists a 51 amino acids shorter P. multocida NanH sialidase with a sequence identity of 95%.
One candidate enzyme, SialH, fulfilled both predefined selection criteria: In addition to a perfectly aligned active site, the SialH model predicted the presence of an aromatic sandwich comprised by W366 and F168, aligning to the TcTS W312 and Y119, respectively (Table 3; Fig 2). The remaining three candidates, SialP, SialA, and SialM, did not appear to harbor aromatic sandwiches. As predicted, the enzymes displayed aromatic amino acid residues corresponding to TcTS W312, but the other half of the aromatic sandwich, corresponding to TcTS Y119 was missing, although in SialA a Trp was found adjacent to the residue G404 that aligns with TcTS Y119, which could be a modeling error. Modeling errors were also suspected with respect to SialM G296 and SialA T473, since the expected amino acids were found in direct adjacency (Table 3).
Table 3. Summary of the 3D alignment visualized in Fig 2.
TcTS properties | TcTS | SialH | SialP | SialA | SialM |
---|---|---|---|---|---|
Sialic acid moiety fixation | R35 | R80 | R73 | R311 | R79 |
R245 | R298 | R291 | R524 | G296 (R297)1 | |
R314 | R368 | R359 | R592 | R360 | |
D96 | D143 | D136 | D379 | D142 | |
W120 | W169 | W162 | W403 | W168 | |
Q195 | Q245 | Q238 | T473 (Q472)1 | Q243 | |
R53 | R99 | R92 | R330 | R98 | |
Catalysis | D59 | D105 | D98 | D340 | D104 |
Y342 | Y402 | Y393 | Y620 | Y394 | |
Lactose moiety fixation | Y119 | F1683 | N1612 | G404 (W403)1 | Q1672 |
W312 | W366 | Y3573 | W590 | W358 |
1 Modeling error suspected—alternative alignment in brackets
2 Non-aromatic residue
3 Alternative aromatic residue
Although an aromatic sandwich was absent in three enzymes, they were included in the study because of the closed, shielded nature of their binding clefts, which might facilitate the exclusion of water from the active site. The exclusion of water from the active site has previously been suggested to inhibit hydrolytic activity and in turn promote trans-sialidase activity in engineered sialidases, albeit by a different suggested mechanism [10]. A series of sialidase structure alignments were carried out to assert the notion that a closed active site will render trans-sialidase activity. The sialidase of Micromonospora viridifaciens (MvSA), which is reported to exhibit no trans-sialidase activity [29], showed a very open active site structure compared to TcTS, thus supporting selection criterion 2 (Fig 3). In addition, the sialidases of Vibrio cholerae, Clostridium perfringens and Salmonella typhimurium have previously been reported to exhibit limited trans-sialidase activity [30]. Indeed, their active sites are less open than the one of MvSA, but still considerably more open than the one of TcTS (Fig 3).
Experimental characterization of trans-sialidase candidates
The four identified candidate enzymes were examined for trans-sialidase activity upon successful expression in E. coli as His6-tagged proteins. As previously, casein glycomacropeptide (CGMP) was used as donor substrate and lactose as the acceptor substrate for 3’SL production [16, 28, 31] and this reaction was performed in a time study setup, to evaluate trans-sialidase activity (Table 4; Fig 4). CGMP is a side stream product from cheese production and contains 9 mM bound α-2,3- and α-2,6-linked sialic acid in a ratio of 1:1 [32], which makes it an interesting substrate for industrial production of sialylated compounds. Our primary candidate, SialH, was capable of catalyzing trans-sialylation with a product-to-hydrolysis ratio peaking at above 2.5 for the production of 3’SL, making it the first bacterial trans-sialidase to be identified (Table 4). This novel trans-sialidase produced a relatively stable 3’SL concentration of 1.5 mM (Fig 4). Interestingly, another reaction product was formed besides 3’SL and SA. Analyzed by MALDI-TOF, the product gave rise to an m/z 632-ion suggesting that it is an SL compound (Fig 5), but HPAEC-PAD analysis ruled out that it was 6´SL (S2 Fig). The compound was susceptible to β-galactosidase hydrolysis, indicating that the compound was not sialylated on the Gal moiety at the non-reducing end (S2 Fig). The lack of sialylation at the non-reducing galactose was confirmed by NMR (S3 Fig), and the novel compound was identified as 3-sialyllactose (3SL). 3’-sialylated glycans are the primary product of trans-sialylation reactions using the Trypanosoma trans-sialidases, but also 6’-sialylation has been reported using other sialidases [30]. So far, 3SL is a novel trans-sialylation product.
Table 4. Best product-to-hydrolysis ratio (RP,H) for the four candidates calculated for each trans-sialylation product.
3’SL | 6’SL | 3SL | |
---|---|---|---|
SialH | 2.5 | 0.010 | 0.70 |
SialP | 0.066 | 0.004 | 0.020 |
SialA | n.a. | 0.016 | n.a. |
SialM | 0.088 | 0.020 | n.a. |
For the three remaining enzyme candidates selected based on the narrow acceptor site, only very limited trans-sialylation activity was observed and they primarily hydrolyzed the donor-bound SA (Fig 4; Table 4). Product specificity in the trans-sialylation varied among the candidates, which is not uncommon for this class of enzymes [30]: SialP preferentially catalyzed synthesis of 3’SL, but the novel product 3SL was also observed. SialM preferred synthesis of 3’SL over 6’SL, whereas SialA appeared to exclusively catalyze synthesis of 3’SL. However, concentrations of these trans-sialylation products never exceeded 0.16 mM (Fig 4), and they were completely hydrolyzed after 30 h. The finding that SialH is the only trans-sialidase among the identified candidates supports the hypothesis that an aromatic sandwich above the active site is a suitable marker for identification of trans-sialidases.
Conclusions
The rational identification of novel trans-sialidases using a simple computational approach based on a putative acceptor binding sites was explored. The study was based on the hypothesis that an aromatic sandwich present immediately above the active site can be used as a trans-sialidase identification marker. From a database of 2909 sequences coding for genes categorized as sialidases, one candidate–SialH from H. parasuis–was predicted to have an active site displaying an aromatic sandwich comprised of W366 and F168. Three additional candidates displayed a narrow binding cleft (but no aromatic sandwich) and were included in experimental evaluation of trans-sialidase activity. The results supported the initial hypothesis regarding the importance of the aromatic sandwich with SialH being the only trans-sialidase among the four analyzed candidates. SialH was shown to exhibit a product-to-hydrolysis ratio (RP,H) of 2.5, categorizing the enzyme as a trans-sialidase since RP,H>1. Surprisingly, a total of three sialylation products were identified during this study, including the novel trans-sialylation product 3SL, which was produced efficiently by SialH (R3SL,H = 0.7). It is the first time that a sequence analysis approach has been successful in identifying a trans-sialidase and the first time a rational approach has led to the discovery of a trans-sialidase outside the Trypanosoma genus.
Supporting Information
Acknowledgments
We thank Arla Foods Ingredients Group for the gift of the CGMP.
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
This work was supported by the funding from Danish Dairy Research Foundation (Mejeribrugets Forskningsfond; www.ddrf.dk) (grant to RTN). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Lombard V, Golaconda Ramulu H, Drula E, Coutinho PM, Henrissat B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014;42: D490–D495. 10.1093/nar/gkt1178 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Zeuner B, Jers C, Mikkelsen JD, Meyer AS. Methods for improving enzymatic trans-glycosylation for synthesis of human milk oligosaccharide biomimetics. J Agric Food Chem. 2014;62: 9615–9631. 10.1021/jf502619p [DOI] [PubMed] [Google Scholar]
- 3.Koshland DE. Stereochemistry and mechanism of enzymatic reactions. Biol Rev. 1953;28: 416–436. [Google Scholar]
- 4.Desmet T, Soetaert W, Bojarová P, Křen V, Dijkhuizen L, Eastwick-Field V, et al. Enzymatic glycosylation of small molecules: challenging substrates require tailored catalysts. Chem Eur J. 2012;18: 10786–10801. 10.1002/chem.201103069 [DOI] [PubMed] [Google Scholar]
- 5.Koné FMT, Béchec ML, Sine JP, Dion M, Tellier C. Digital screening methodology for the directed evolution of transglycosidases. Protein Eng, Des Sel. 2009;22: 37–44. [DOI] [PubMed] [Google Scholar]
- 6.Bissaro B, Monsan P, Fauré R, O’Donohue MJ. Glycosynthesis in a waterworld: new insight into the molecular basis of transglycosylation in retaining glycoside hydrolases. Biochem J. 2015;467: 17–35. 10.1042/BJ20141412 [DOI] [PubMed] [Google Scholar]
- 7.Feng HY, Drone J, Hoffmann L, Tran V, Tellier C, Rabiller C, et al. Converting a β-glycosidase into a β-transglycosidase by directed evolution. J Biol Chem. 2005;280: 37088–37097. [DOI] [PubMed] [Google Scholar]
- 8.Osanjo G, Dion M, Drone J, Solleux C, Tran V, Rabiller C, et al. Directed evolution of the α-L-fucosidase from Thermotoga maritima into an α-L-transfucosidase. Biochemistry 2007;46: 1022–1033. [DOI] [PubMed] [Google Scholar]
- 9.Buschiazzo A, Amaya MF, Cremona ML, Frasch AC, Alzari PM. The crystal structure and mode of action of trans-sialidase, a key enzyme in Trypanosoma cruzi pathogenesis. Mol Cell. 2002;10: 757–768. [DOI] [PubMed] [Google Scholar]
- 10.Jers C, Michalak M, Larsen DM, Kepp KP, Li H, Guo Y, et al. Rational design of a new Trypanosoma rangeli trans-sialidase for efficient sialylation of glycans. PLoS One 2014;9: e83902 10.1371/journal.pone.0083902 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Amaya MF, Watts AG, Damager I, Wehenkel A, Nquyen T, Buschiazzo A, et al. Structural insights into the catalytic mechanism of Trypanosoma cruzi trans-sialidase. Structure 2004;12: 775–784. [DOI] [PubMed] [Google Scholar]
- 12.Ammar Z, Plazolles N, Baltz T, Coustou V. Identification of trans-sialidases as a common mediator of endothelial cell activation by African trypanosomes. PLoS Pathogens 2013;9: e1003710 10.1371/journal.ppat.1003710 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Montagna G, Cremona ML, Paris G, Amaya MF, Buschiazzo A, Alzari PM, et al. The trans-sialidase from the african trypanosome Trypanosoma brucei. Eur J Biochem. 2002;269:2941–2950. [DOI] [PubMed] [Google Scholar]
- 14.Tiralongo E, Martensen I, Grötzinger J, Tiralongo J, Schauer R. Trans-sialidase-like sequences from Trypanosoma congolense conserve most of the critical active site residues found in other trans-sialidases. Biol Chem. 2003;384: 1203–1213. [DOI] [PubMed] [Google Scholar]
- 15.Paris G, Ratier L, Amaya MF, Nguyen T, Alzari PM, Frasch ACC. A sialidase mutant displaying trans-sialidase activity. J Mol Biol. 2005;345: 923–934. [DOI] [PubMed] [Google Scholar]
- 16.Michalak M, Larsen DM, Jers C, Almeida JRM, Willer M, Li H, et al. Biocatalytic production of 3′-sialyllactose by use of a modified sialidase with superior trans-sialidase activity. Process Biochem. 2014;49: 265–270. [Google Scholar]
- 17.Zeuner B, Luo J, Nyffenegger C, Aumala V, Mikkelsen JD, Meyer AS. Optimizing the biocatalytic productivity of an engineered sialidase from Trypanosoma rangeli for 3’-sialyllactose production. Enzyme Microb Technol. 2014;55: 85–93. 10.1016/j.enzmictec.2013.12.009 [DOI] [PubMed] [Google Scholar]
- 18.Holck J, Larsen DM, Michalak M, Li H, Kjærulff L, Kirpekar F, et al. Enzyme catalysed production of sialylated human milk oligosaccharides and galactooligosaccharides by Trypanosoma cruzi trans-sialidase. New Biotechnol 2014;31: 156–165. [DOI] [PubMed] [Google Scholar]
- 19.Buschiazzo A, Tavares GA, Campetella O, Spinelli S, Cremona ML, Paris G, et al. Structural basis of sialyltransferase activity in trypanosomal sialidases. EMBO J. 2000;19: 16–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Demir Ö, Roitberg AE. Modulation of catalytic function by differential plasticity of the active site: case study of Trypanosoma cruzi trans-sialidase and Trypanosoma rangeli sialidase. Biochemistry 2009;48: 3398–3406. 10.1021/bi802230y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Mitchell FL, Miles SM, Neres J, Bichenkova EV, Bryce RA. Tryptophan as a molecular shovel in the glycosyl transfer activity of Trypanosoma cruzi trans-sialidase. Biophys J. 2010;98: L38–L40. 10.1016/j.bpj.2010.01.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Aasland R, Abrams C, Ampe C, Ball LJ, Bedford MT, Cesareni G, et al. Normalization of nomenclature for peptide motifs as ligands of modular protein domains. FEBS Lett. 2002;513: 141–144. [DOI] [PubMed] [Google Scholar]
- 23.Söding J, Biegert A, Lupas AN. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res. 2005;33: W244–W248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Watson JN, Newstead S, Dookhun V, Taylor G, Bennet AJ. Contribution of the active site aspartic acid to catalysis in the bacterial neuraminidase from Micromonospora viridifaciens. FEBS Lett. 2004;577: 265–269. [DOI] [PubMed] [Google Scholar]
- 25.Moustafa I, Connaris H, Taylor M, Zaitsev V, Wilson JC, Kiefel MJ, et al. Sialic acid recognition by Vibrio cholerae neuraminidase. J Biol Chem. 2004;279: 40819–40826. [DOI] [PubMed] [Google Scholar]
- 26.Newstead SL, Potter JA, Wilson JC, Xu G, Chien CH, Watts AG, et al. The structure of Clostridium perfringens NanI sialidase and its catalytic intermediates. J Biol Chem. 2008;283: 9080–9088. 10.1074/jbc.M710247200 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Studier FW. Protein production by auto-induction in high-density shaking cultures. Protein Expression Purif. 2005;41: 207–234. [DOI] [PubMed] [Google Scholar]
- 28.Luo J, Nordvang RT, Morthensen ST, Zeuner B, Meyer AS, Mikkelsen JD, et al. An integrated membrane system for the biocatalytic production of 3’-sialyllactose from dairy by-products. Bioresour Technol. 2014;166C: 9–16. [DOI] [PubMed] [Google Scholar]
- 29.Smith LE, Eichinger D. Directed mutagenesis of the Trypanosoma cruzi trans-sialidase enzyme identifies two domains involved in its sialyltransferase activity. Glycobiology 1997;7: 445–451. [DOI] [PubMed] [Google Scholar]
- 30.Schmidt D, Sauerbrei B, Thiem J. Chemoenzymatic synthesis of sialyl oligosaccharides with sialidases employing transglycosylation methodology. J Org Chem. 2000;65: 8518–8526. [DOI] [PubMed] [Google Scholar]
- 31.Nordvang RT, Luo J, Zeuner B, Prior R, Andersen MF, Mikkelsen JD, et al. Separation of 3′-sialyllactose and lactose by nanofiltration: A trade-off between charge repulsion and pore swelling induced by high pH. Sep Purif Technol. 2014;138: 77–83. [Google Scholar]
- 32.Villumsen NS, Jensen HB, Thu Le TT, Møller HS, Nordvang RT, Nielsen LR, et al. Self-assembly of caseinomacropeptide as a potential key mechanism in the formation of visible storage induced aggregates in acidic whey protein isolate dispersions. Int Dairy J. 2015;49: 8–15. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.