Skip to main content
The Journal of Biological Chemistry logoLink to The Journal of Biological Chemistry
. 2017 Apr 5;292(21):8856–8863. doi: 10.1074/jbc.M117.777383

Production of homogeneous glycoprotein with multisite modifications by an engineered N-glycosyltransferase mutant

Qitao Song ‡,1, Zhigang Wu §,1, Yueyuan Fan , Woran Song , Peiru Zhang , Li Wang , Faxing Wang , Yangyang Xu , Peng G Wang ‡,2, Jiansong Cheng ‡,3
PMCID: PMC5448120  PMID: 28381551

Abstract

Naturally occurring N-glycoproteins exhibit glycoform heterogeneity with respect to N-glycan sequon occupancy (macroheterogeneity) and glycan structure (microheterogeneity). However, access to well-defined glycoproteins is always important for both basic research and therapeutic purposes. As a result, there has been a substantial effort to identify and understand the catalytic properties of N-glycosyltransferases, enzymes that install the first glycan on the protein chain. In this study we found that ApNGT, a newly discovered cytoplasmic N-glycosyltransferase from Actinobacillus pleuropneumoniae, has strict selectivity toward the residues around the Asn of N-glycosylation sequon by screening a small library of synthetic peptides. The inherent stringency was subsequently demonstrated to be closely associated with a critical residue (Gln-469) of ApNGT which we propose hinders the access of bulky residues surrounding the occupied Asn into the active site. Site-saturated mutagenesis revealed that the introduction of small hydrophobic residues at the site cannot only weaken the stringency of ApNGT but can also contribute to enormous improvement of glycosylation efficiency against both short peptides and proteins. We then employed the most efficient mutant (Q469A) other than the wild-type ApNGT to produce a homogeneous glycoprotein carrying multiple (up to 10) N-glycans, demonstrating that this construct is a promising biocatalyst for potentially addressing the issue of macroheterogeneity in glycoprotein preparation.

Keywords: glycoprotein, glycosyltransferase, N-linked glycosylation, post-translational modification (PTM), protein engineering

Introduction

N-Glycosylation is one of the most prominent post-translational protein modifications and plays a role in various cellular functions including protein folding, sorting, and stability and cell-cell interactions. The modification is associated with all three domains of life (Eukarya, Bacteria, and Archaea) and has similar molecular mechanisms. In short, an oligosaccharide is assembled in a stepwise manner on a lipid anchor, giving rise to a lipid-linked oligosaccharide, and then transferred en bloc onto the acceptor asparagine residues by oligosaccharyltransferase (OST)4 (13).

Recently, an alternative N-glycosylation system has been discovered in a few species in the class of γ-proteobacteria (46). A striking difference from the canonical N-glycosylation is that this novel pathway begins with the attachment of the linking monosaccharide from nucleotide-activated donors to asparagine side chains, which is catalyzed by a cytoplasmic N-glycosyltransferase (NGT) (7). The resolved crystal structure of NGT from Actinobacillus pleuropneumoniae (ApNGT) revealed that the N-glycosyltransferase contains an N-terminal α-helical domain fold and a C-terminal GT-B fold with two Rossmann-like domains. The extensive contacts between the α-helical domain and the GT-B domains generate a unique funnel-shaped groove adjacent to the UDP-hexose-binding site, which was proposed to accommodate the acceptor protein (8). Quite interestingly, ApNGT is neither phylogenetically nor structurally related to known OSTs. The GT-B domain of ApNGT is instead closely related to that of an O-GlcNAc transferase from Xanthomonas campestris (XcOGT) (8, 9), which involves in the regulation of the cellular protein O-GlcNAcylation (10).

So far, ApNGT is the most extensively investigated NGT-like N-glycosyltransferase and is capable of transferring both UDP- and GDP-activated glucose to an asparagine residue in the conserved Asn-X-Ser/Thr (where X ≠ Pro) sequon. UDP-Gal and UDP-Xyl can also be recognized by ApNGT, but substituted hexoses (GlcA, GlcNAc, GalNAc and Neu5Ac) are not tolerated (7). More recently, ApNGT has been employed for synthesis of polypeptides bearing defined complex oligosaccharides by coupling with an endoglycosidase (ENGase)-catalyzed transglycosylation. Because the Glc-Asn linkage is resistant to peptide-N-glycosidase (PNGase) cleavage, it was proposed to be able to improve the in vivo stability of glycopeptides/proteins (11).

As we know, bacterial OSTs recognize a more specific sequon that is extended by a negatively charged amino acid (Asp or Glu) at the −2 position relative to the Asn ((D/E)YNX(S/T); Y and X ≠ Pro) (12, 13). In contrast, ApNGT is capable of modifying polypeptides featuring sequences of eukaryotic N-glycoproteins despite no orthologous structure shared between ApNGT and OSTs, which highlights the hypothesis of convergent evolution of these two N-glycosylation systems (7). Heterogeneous proteins such as human erythropoietin (hEPO) and cholera toxin subunit B (CtxB) have been reported to be glycosylated when co-expressed with ApNGT, suggesting a novel route for glycoengineering in bacteria (14). In addition, the introduction of N-glycosylation sequons to increase carbohydrate content has also proved to be an effective strategy for improving properties of protein therapeutics (15, 16). Therefore, systematic investigation of position-specific residue preference within and surrounding the Asn-X-Ser/Thr sequon would facilitate the application of ApNGT in glycoprotein production. In this study we investigated the preference feature of ApNGT by screening three panels of synthetic peptide substrates and found a critical residue (Gln-469) being closely associated with the stringency of ApNGT. Site-saturated mutagenesis at this site gave rise to a series of valuable mutants with significantly enhanced glycosylation efficiency and expanded acceptor specificity. The most efficient mutant (Q469A) was successfully used for the preparation of homogeneous glycoprotein carrying multiple (up to 10) N-glycans, whereas the wild-type ApNGT produced heterogeneous glycoprotein.

Results

Investigation of position-specific residue preference of ApNGT

The residue preference within and surrounding the N-glycosylation sequon (position −1, +1, and +3) was investigated by screening three panels of synthetic peptide substrates in a systematic manner. For each position, at least five representative amino acids containing basic, acidic, hydroxyl, small hydrophobic, or aromatic side chains were evaluated. As shown in Fig. 1, A and B, both acidic residues and basic residues are disfavored for ApNGT at either the −1 or +1 position, whereas basic residues are particularly preferred at the +3 position. Also, residues with small side chains are relatively preferred at positions −1 and +1. Unexpectedly, the peptide with Gly at +1 position nearly abolished glycosylation. In light of the investigation, a shortest optimal sequence motif recognized by ApNGT would be given as “ANVTK.” Further investigation showed that only the peptide optimized at all three positions (containing the optimal motif) could achieve the maximum glycosylation (supplemental Fig. S1).

Figure 1.

Figure 1.

The residue preference within and surrounding the N-glycosylation sequon (−1, +1, and +3 positions). For each position at least five representative amino acids containing basic, acidic, hydroxyl, small hydrophobic, or aromatic side chains were evaluated. X presents variant amino acids. Contrasted to the mutant Q469A (0.4 μm), an ∼23-fold excess of the wild-type ApNGT (9.2 μm) was used in the assays due to its low efficiency.

Gln-469 contributes to acceptor specificity

Structural alignment of ApNGT and XcOGT, which share the closely related GT-B domains, showed that an extended loop-connecting sheet β10 and helix α19 (Lβ10α19) sticks out toward UDP complexed with ApNGT (Fig. 2A). Residues Gln-469 within Lβ10α19 and His-277 are structurally adjacent to each other and form the narrow “neck” of the acceptor binding funnel. Because the pair of residues locates above the UDP-binding site, the narrow entryway formed by them might block the access of amino acids with bulky side chains into the active site and thus plays a role in determining the residue preference surrounding the Asn of N-glycosylation sequons. His-277, which is conserved across all GT41 family members (classification according to CAZy database; Ref. 17), has been proposed to be involved in catalysis. The replacement of His-277 with Ala resulted in loss of ∼95% catalytic activity (8). However, the mutation of Gln-469 to Ala led to an ∼100-fold increase in the turnover rate constant (kcat) and an ∼162-fold increase in the catalytic efficiency (kcat/Km) against a peptide GGFNLTYTIER (supplemental Table S1). We next explored the residue preference feature of Q469A in the same manner as done for the wild-type ApNGT previously. As shown in Fig. 1, the mutant Q469A and the wild-type ApNGT show a similar preference toward charged residues at all three positions, whereas the former exhibits better tolerance at both the −1 and +1 positions for residues harboring the bulky side chain, such as Leu, Phe, and the acylated Lys at the −1 position as well as Ile and Leu at the +1 position. In addition, the mutant Q469A exhibits better tolerance toward the Gln-X-Ser/Thr sequon (supplemental Fig. S2, A and B), which has been reported to be glycosylated by the wild-type ApNGT at an extremely low rate (>15,000-fold reduction in turnover rate as compared with Asn-X-Ser/Thr; Ref. 7).

Figure 2.

Figure 2.

Putative peptide-binding sites of ApNGT. Shown is a comparative representation of the active sites of ApNGT (PDB ID 3Q3H) and XcOGT (PDB ID 2XGO). The extended loop (Lβ10α19) connecting sheet β10 and helix α19 in ApNGT is highlighted in red (A). The potential acceptor binding groove of ApNGT is shown in surface representation and colored according to the electrostatic potential (blue in the positive regions and red in the negative regions). The electrostatic potential surface was calculated by using PyMOL (Schrödinger, LLC) (B). UDP complexed in the structure of ApNGT is shown as sticks.

Site-saturated mutagenesis

To fully understand the role of Gln-469 in catalysis and determination of acceptor specificity, site-saturated mutagenesis was performed at this critical site. Finally, 17 mutants were successfully constructed, except two mutants (Q469M and Q469R) that were not expressed in Escherichia coli. The relative activity of the variants was assessed against a synthetic peptide containing the optimal motif ANVTK. The residue preference around the N-glycosylation site for each mutant was also probed in single catalytic system against multiple peptides in one pot (multipeptide one-pot). The glycosylated product profile was then analyzed by MALDI-TOF mass spectrometry. As we expected, the replacement of the target glutamine with small hydrophobic amino acids, such as Gly, Ala, Val, Ile, and Leu, resulted in more flexible residue selectivity at −1 and +1 positions (Table 1). Given that the mutant Q469A and the wild-type enzyme share the exact same residue preference at +3 position of the Asn-X-Ser/Thr sequon (Fig. 1C), the apparent difference of preference at the +3 position revealed by these variants was probably due to their different catalytic efficiency. To our surprise, most ApNGT variants including the wild-type enzyme showed comparable activity against a peptide (GGANVTKTIER) containing the optimal motif ANVTK, although the mutant Q469A is most efficient (Fig. 3).

Table 1.

The effect of substitutions at Gln-469 on the preference profile of ApNGT

The same amount of enzymes (1.4 μm) was used for all assays. Three sets of mixed peptides used were (GG[G/A/S/V/L/D/K/F]NLTYTIER, GGSN[G/A/S/P/V/L/D/K/F]TYTIER and GGSNLT[A/S/D/K/R/Y]TIER). −, products were not detected.

Mutants Residue preference at various positions
−1 +1 +3
Wild type Val
Q469G Gly Ala Ser Val Leu Phe Val Leu Phe Ala Ser Lys Arg Tyr
Q469A Gly Ala Ser Val Leu Phe Ala Ser Val Leu Phe Ala Ser Lys Arg Tyr
Q469V Ala Ser Leu Ala Ser Val Leu Phe Ala Ser Lys Arg Tyr
Q469I Ala Val Phe Lys
Q469L Gly Ala Ser Val Phe
Q469S Ala Phe Val Lys
Q469N Phe Val Lys
Q469T Val
Q469H Val
Q469C Val
Q469E Val
Q469F Val
Q469Y Val
Q469D
Q469K
Q469W
Q469P

Figure 3.

Figure 3.

The relative activity of 18 ApNGT variants against a peptide (GGANVTKTIER) containing the optimal sequence ANVTK. Assays were performed in a total volume of 30 μl containing 1 mm various peptides, 20 mm UDP-Glc, 0.4 μm enzyme, and buffered with 100 mm Tris-HCl (pH 8.0) at 37 °C for 45 min. All assays were analyzed by RP-HPLC and carried out in duplicate.

Protein glycosylation

To compare the catalytic efficiency of the wild-type ApNGT and the mutant Q469A, which is the most efficient N-glycosyltransferase obtained in this study (Fig. 3), in protein glycosylation, a soluble fragment corresponding to amino acids 1205–1536 of a mature HMW1 adhesin (HMW1ct) was employed as a protein acceptor. HMW1 mediates adherence to respiratory epithelial cells and undergoes heavy N-glycosylation at 31 different sites (18). HMW1ct carries 12 Asn-X-Ser/Thr sequons (Fig. 4A), and 3 of them (AANVTP5LNTTP6GT and NINATSGP7) have been identified as the modification sites of ApNGT (19). We first assessed whether the wild-type ApNGT and the mutant Q469A are capable of modifying all the potential N-glycosylation sites. As shown in Fig. 5, A–D, the mutant Q469A catalyzed much more productive glycosylation as expected. However, uniform glycoform could not be produced by both isoforms no matter by raising the ratio of enzyme to acceptor protein or elongating the reaction time.

Figure 4.

Figure 4.

Twelve Asn-X-Ser/Thr sequons of HMW1ct acceptor (A) and the corresponding glycosylation reactions catalyzed by the mutant Q469A (B). Site-directed mutagenesis was used to generate 12 mutants (P1-P12) from a previously synthesized template carrying 12 inactive sequons with the Asn substituted by Ala. Each mutant has a single Asn-X-Ser/Thr sequon. Protein glycosylation was analyzed by MALDI-TOF mass spectrometry. Acceptor proteins and the potential products were labeled with gray lines and colored lines, respectively. The matrix adduct ion appears to be the addition of a matrix molecule to the protein ions with an accompanying loss of water (37, 38).

Figure 5.

Figure 5.

Mass spectrometry analysis of protein (HMW1ct) glycosylation catalyzed by the wild-type ApNGT (red line) or the mutant Q469A (blue line). Various molar ratios of acceptor protein (HMW1ct possessing 12 Asn-X-Ser/Thr sequons (A, B, C, and D) or engineered HMW1ct with 7 (E) or 10 (F) Asn-X-Ser/Thr sequons) to enzyme (120:1, 30:1, or 10:1) were used in glycosylation reactions, which were incubated for 1 or 12 h. The numbers of added N-glucoses were labeled. The sequences of optimized N-glycosylation sites (P4, P10, and P12) are QVNLTAQ, NANVTGS, and EANVTIT, respectively. a.u., absorbance units.

To find out which sites are incompetent, site-directed mutagenesis was used to generate 12 mutants from a previously synthesized template that carries 12 inactive sequons with the Asn substituted by Ala. Each mutant bears a single Asn-X-Ser/Thr sequon. Finally, we detected five incompetent sequons (EANVTSAP1, QVNLSAQP4, NANGSGSP10, RVNITGDP11, and EPNNTITP12). As shown in Fig. 4B, mutant acceptors carrying P1, P4, P11, and P12 sequons were not glycosylated by the mutant Q469A at all, although that carrying P10 was partially glycosylated. Because charged residues are found in P1 (Glu at the −2 position) and P11 (Arg and Asp at the −2 and +4 positions, respectively) sites, the residue preference at these two distal positions (−2 and +4) was further explored in a multipeptide one-pot manner as described above. As shown in the supplemental Fig. S3, significant inhibition of charged, aromatic, and bulky uncharged residues was not observed at −2 and +4 positions for the Q469A mutant. According to this discovery, the inefficiency of P1 (EANVTSA) and P11 (RVNITGD) might lead from some undesired secondary structures. By contrast, proline, and asparagine and glycine are disfavored at the −1 and +1 positions, respectively, by both the mutant Q469A and the wild-type enzyme (Fig. 1, A and B). Also, the threonine has been reported to be much preferred (>10-fold) by ApNGT, as opposed to serine, in position +2 (7). Thus, the potential N-glycosylation sites of P4 (QVNLSAQ), P10 (NANGSGS), and P12 (EPNNTIT) could be further optimized to enhance the glycosylation level.

After the five incompetent sites (P1, P4, P10, P11, and P12) were eliminated by converting the Asn of Asn-X-Ser/Thr sequons to Ala, the Q469A-catalyzed glycosylation yielded a uniform glycoprotein as shown in Fig. 5E. We next optimized the sequons of P4, P10, and P12 by replacing five disfavored residues with the preferred ones (QVNLTAQP4′, NANVTGSP10′, and EANVTITP12′) in which and P1 and P11 remained inactive. Ultimately, a homogeneous glycoprotein carrying up to 10 N-glucoses was produced by using the mutant Q469A (Fig. 5F). In contrast, the wild-type ApNGT could not fulfill the uniform glycosylation for proteins bearing multiple N-glycosylation sequons. In comparison with Q469A, several mutants, such as Q469G and Q469V, also exhibited analogous efficiency against HMW1ct beyond relatively flexible acceptor selectivity (supplemental Fig. S4 and Table 1).

Donor specificity of the mutant Q469A

Although Gln-469 structurally positions close to the UDP moiety and has been reported to make hydrogen-bond interactions with one phosphate group of UDP (via a water molecule) (8), the mutant Q469A possesses quite similar donor specificity to the wild-type ApNGT (active to both UDP-Glc and UDP-Gal but not their acetylated derivatives; data not shown).

Residue preference feature of known N-glycoproteins in the PDB database

Based on the amino acid composition analysis of a set of 2250 non-redundant N-glycan sites from 1433 glycoproteins in the Protein Data Bank crystallographic database (20), charged residues were found in quite low abundance at the positions adjacent to the occupied Asn (−1 and +1), although a strong preference for any single residue was not observed (Fig. 6). Because the mutant Q469A also dislikes sequons harboring charged residues at −1 and +1 positions, the most known N-glycan sites could be well-tolerated by Q469A.

Figure 6.

Figure 6.

Amino acid preferences in occupied N-glycan sites. 1433 glycoproteins containing N-linked GlcNAc were retrieved from the PDB database by using a web-based glycan searching tool (Glycan Modeling and Stimulation). Total 2250 non-redundant N-glycan sites derived from these glycoproteins were identified by using pdb2linucs (39) and used to generate the sequence logo with WebLogo (40). Neighboring residues located downstream (positions +1 to +5) and upstream (positions −1 to −3) from the occupied asparagine residue (position 0) are shown. The size of each letter represents the residue prevalence at the putative position.

Discussion

In this study the systematic investigation of position-specific residue preference within and around the N-glycan sequon indicated that ApNGT has strict specificity for residues especially at the −1 and +1 positions. The optimal N-glycosylation motif recognized by ApNGT (ANVTK) is unexpectedly similar to a previously reported one (PYNVTK) preferred by an archaeal OST from Pyrococcus furiosus particularly at positions +1 and +3 (21). Because the NGT involved N-glycosylation system has been proposed to evolved independently (22), it remains mysterious how ApNGT and the archaeal OST have such a similar residue preference feature. More interestingly, NGTs from either β- or γ-proteobacteria seem not to evolve independently of each other, suggesting that horizontal gene transfer might occur between the two classes (supplemental Fig. S5).

Gln-469, whose side chain sways into the UDP-binding pocket and forms the narrow neck of the acceptor-binding funnel with His-277, was herein established to be involved in determining the acceptor specificity of enzyme. The introduction of small hydrophobic residues at Gln-469, such as Gly, Ala, and Val, not only weakened the stringency of ApNGT but also contributed to an enormous improvement of glycosylation efficiency against both peptides and proteins. A plausible hypothesis is that the substitutions can facilitate the accommodation of peptides/proteins into the active site and thereby enhanced the turnover rate constant. The deduction can also be supported by the observation that a very short peptide (SNLTY) was well-glycosylated by the mutant Q469A instead of the wild-type ApNGT (supplemental Fig. S2, C and D). On the other hand, it is not difficult to understand why most ApNGT variants including the wild-type enzyme and even Q469F and Q469Y can well accept a peptide (GGANVTKTIER) containing the optimal motif ANVTK with small residues Ala and Val at the −1 and +1 positions, respectively (Fig. 3). In contrast to bacterial OSTs, which regulate acceptor specificity by requiring an additional acidic residue at −2 position, ApNGT evolved (relatively strict) selectivity by recruiting a critical residue (Gln-469) within the catalytic site. In addition, the introduction of variant residues at Gln-469 of ApNGT is structurally feasible, but exchange of the loop (Lβ10α19) with the corresponding fragment of XcOGT resulted in a completely inactive protein (data not shown), suggesting that the unique loop might be required for substrate binding and/or structure maintaining.

Further inspection of the potential acceptor-binding site of ApNGT revealed that residues of Phe-39 and His-272 introduce bulky bumps in the van der Waals surface of the funnel-shaped groove. Although the “bump-and-hole” strategies have been successfully applied for retuning the selectivity of proteins and protein-ligand engineering (2325), replacement of them with a smaller alanine was previously demonstrated to decrease the enzyme activity to various extents (8).

Naturally occurring N-glycoproteins usually exhibit glycoform heterogeneity with respect to N-glycan sequon occupancy (macroheterogeneity) and glycan structure (microheterogeneity) (2628). In recent years great efforts have been made for preparation of glycan-defined glycoproteins including in vivo glycoengineering and in vitro chemical or chemoenzymatic synthesis (2834). Additionally, introducing extra N-linked glycosylation sites into therapeutic proteins, such as erythropoietin, has also been proven to improve their serum half-life and therapeutic efficiency (35). In this study the engineered Q469A (the most efficient mutant) has been proven to efficiently modify proteins with multiple N-glycosylation sites and, therefore, presents a promising biocatalyst for potentially addressing the issue of glycoprotein macroheterogeneity. Meanwhile, sequon-optimization was also demonstrated to be a useful strategy for the generation of uniform glycoprotein. Moreover, the relatively extended acceptor selectivity of the mutant Q469A would allow fewer alterations toward acceptor proteins for reaching uniform glycosylation.

On the other hand, inaccessible Asn-X-Ser/Thr sequon or nearby exceptional secondary structures will obviously inhibit the glycosylation. For example, incompetent P1 and P11 sites probably resulted from structural elements because no disfavored residues are found at −1, +1, and +3 positions. Therefore, known the three-dimensional structure would greatly benefit the N-glycosylation site engineering either for modifying an existing Asn-X-Ser/Thr sequon or inserting a new one.

Taken together, Gln-469, which is conserved in NGTs of subgroup 2 (supplemental Fig. S6), was demonstrated to be a critical amino acid for ApNGT to maintain its stringency. Introduction of small hydrophobic residues at the site has created a series of efficient mutants with expanded selectivity on Asn-X-Ser/Thr sequon. Because signals that control the efficiency of N-linked glycosylation at individual Asn residues have not been fully defined, efficient N-glycosylation machines, such as the mutant Q469A, are always valuable for either in vivo or in vitro glycosylation methods. Additionally, this is the first report that homogeneous glycoprotein carrying multiple (up to 10) N-glycans were in vitro enzymatically prepared.

Experimental procedures

Materials

UDP-Glc and UDP-Gal used were purchased from Sigma. UDP-GlcNAc and UDP-GalNAc were enzymatically synthesized as previously described (36).

Cloning and mutant construction

apNGT was cloned from genomic DNA of A. pleuropneumoniae ATCC 27088D (purchased from ATCC, Manassas, VA) and then cloned into pET15b. hMW1ct and its isoform bearing 12 inactive Asn-X-Ser/Thr sequons with the Asn substituted by Ala were codon-optimized (for the E. coli expression system) and synthesized by Genewiz (Suzhou, China). Mutagenesis was performed using a fast site-directed mutagenesis kit (TransGen, Beijing, China). All mutations were confirmed by DNA sequencing.

Expression and purification of recombinant proteins

The E. coli BL21 (DE3) strains, transformed with the corresponding expression plasmids, were cultured in LB medium supplemented with ampicillin (100 μg ml−1). Overexpression of the target protein was achieved by inducing the E. coli culture with 0.1 mm isopropyl-β-d-thiogalactopyranoside (IPTG) when the A600 reached 0.8 followed by incubating at 18 °C for 20 h. The cells were harvested by centrifugation at 6000 × g for 10 min, resuspended in 100 mm Tris-HCl buffer (pH 8.0), and broken by sonication. After centrifugation (13,523 × g, 20 min) of the cell lysate, the supernatant was applied onto a Ni2+-nitrilotriacetic acid affinity column (Qiagen, Hilden, Germany). The column was pre-equilibrated with 8 column volumes of binding buffer (5 mm imidazole, 0.5 m NaCl, and 20 mm Tris-HCl (pH 8.0)) before the lysate was loaded. After washing with 8 column volumes of washing buffer (50 mm imidazole, 0.5 m NaCl, and 20 mm Tris-HCl (pH 8.0)), the target protein was eluted with elution buffer (200 mm imidazole, 0.5 m NaCl, and 20 mm Tris-HCl (pH 8.0)). Elution fractions were analyzed by 12% SDS-PAGE and Coomassie staining. The concentration of purified enzymes was obtained using a BCA Protein Assay kit (Pierce) with bovine serum albumin as a protein standard.

Peptide synthesis

Wang resin (GL Biochem, Shanghai, China) (1 g) was swelled with DCM (10 ml) for 30 min. l-Arg (l-type arginine) (1 mmol eq), DIC (1 mmol), and 4-dimethylaminopyridine (1 mmol) in DMF (10 ml) were added to the resin and bubbled with N2 for 3 h. Then pyridine (1 ml) and acetic anhydride (1 ml) were added and removed after 30 min. The resin was washed with DCM and DMF. The protecting group Fmoc was removed subsequently by 20% piperidine in DMF (monitored by a ninhydrin test). The second amino acid l-Glu (3 mmol), 1-hydroxybenzotriazole (3 mmol), and DIC (3 mmol) were added to the resin and bubbled with N2 for 1 h. The resin was washed with DCM and DMF (monitored by the ninhydrin test; no blue appeared), whereupon the remaining amino acids were added sequentially repeating the above steps. Acp (6-aminocaproic acid) (3 mmol), 1-hydroxybenzotriazole (3 mmol), and DIC (3 mmol) in DMF were added and bubbled with N2 for 1 h. Then FITC (3 mmol) and ethyldiisopropylamine (3 mmol) in DMF were added and bubbled with N2 for 4 h (monitored by the ninhydrin test; no blue appeared). At last, 95% TFA in water was added and oscillated for 2 h to give the crude peptide product. Purification by preparative HPLC afforded target compound (purity ≥98%) following by lyophilization to give a pale yellow solid powder ultimately.

Enzymatic assays for glycosylation of peptides and proteins

For systematic investigation of position-specific residue preference within and surrounding N-glycan sequon, assays were performed in a total volume of 30 μl containing 1 mm concentrations of various peptides, 20 mm UDP-Glc, 9.2 μm ApNGT (wild type), or a 0.4 μm concentration of mutant Q469A and buffered with 50 mm Tris-HCl (pH 8.0) at 37 °C for 45 min. Reactions were then stopped by the addition of 20 μl of glycine stop solution (pH 10.6) and analyzed by reverse phase HPLC (RP-HPLC, Phenomenex Aeris PEPTIDE 3.6 μm XB-C18, 250 × 4.6-mm column, gradient elution from 14–42% acetonitrile over 30 min). Glycosylated peptides were further identified by mass spectrometry, which was performed on an Autoflex III TOF/TOF200 mass spectrometer (Bruker Daltonics, Leipzig, Germany). Apparent kinetic parameters were obtained by varying the concentration of FITC-labeled peptide from 5 to 80 μm and keeping the UDP-glucose concentration fixed (20 mm). The FITC-labeled peptide/glycopeptide was detected by excitation at 495 nm and emission at 520 nm. All assays were carried out in duplicate, and the data were fitted into the Michaelis-Menten equation using Grafit 6.0 (Erithacus Software Ltd). Multipeptide one-pot assays were carried out in 50 mm Tris-HCl (pH 8.0) containing a 1 mm mixture of various peptides, 20 mm UDP-Glc, and 1.4 μm enzyme at 37 °C for 45 min. The glycosylated product profile was analyzed by MALDI-TOF mass spectrometry. 10 mg ml−1 α-cyano-4-hydroxy-cinnamic acid (Sigma) was used as the matrix and premixed with samples in a ratio of 1:1 (v/v).

The assays for glycosylation of proteins were performed in a total volume of 200 μl containing 12.8 μm various HMW1ct derivatives, 5 mm UDP-Glc, appropriate amounts of ApNGT (WT), or the mutant Q469A and buffered with 50 mm Tris-HCl (pH 8.0). Glycosylated products were analyzed by mass spectrometry, which was performed on an AutoflexIII LRF200-CID mass spectrometer (Bruker Daltonics). Samples were concentrated using centrifugal ultrafiltration devices (Amicon® Ultra 0.5 ml Centrifugal Filters, Merck-Millipore). 10 mg ml−1 sinapinic acid (Sigma) was used as the matrix and premixed with samples in a ratio of 1:1 (v/v).

Author contributions

Q. S. and Z. W. conducted most of the experiments and analyzed the results. Y. F., W. S., and P. Z. performed and analyzed the experiments shown in Fig. 4. L. W., F. W., and Y. X. conducted the experiments shown in Fig. 6. P. G. W. and J. C. conceived the idea for the project and wrote the paper.

Supplementary Material

Supplemental Data

Acknowledgment

We acknowledge Y. Liang for assistance in mass spectrometry analysis.

This work was supported by Natural Science Foundation Grants 21372130 and 81302682. The authors declare that they have no conflicts of interest with the contents of this article.

4
The abbreviations used are:
OST
oligosaccharyltransferase
NGT
N-glycosyltransferase
ApNGT
A. pleuropneumoniae NGT
XcOGT
X. campestris OGT
DMF
N,N-dimethylformamide
DIC
N,N′-diisopropylcarbodiimide.

References

  • 1. Dell A., Galadari A., Sastre F., and Hitchen P. (2010) Similarities and differences in the glycosylation mechanisms in prokaryotes and eukaryotes. Int. J. Microbiol. 2010, 148178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Abu-Qarn M., Eichler J., and Sharon N. (2008) Not just for Eukarya anymore: protein glycosylation in bacteria and archaea. Curr. Opin. Struct. Biol. 18, 544–550 [DOI] [PubMed] [Google Scholar]
  • 3. Burda P., and Aebi M. (1999) The dolichol pathway of N-linked glycosylation. Biochim. Biophys. Acta 1426, 239–257 [DOI] [PubMed] [Google Scholar]
  • 4. Grass S., Lichti C. F., Townsend R. R., Gross J., and St Geme J. W. 3rd. (2010) The Haemophilus influenzae HMW1C protein is a glycosyltransferase that transfers hexose residues to asparagine sites in the HMW1 adhesin. PLoS Pathog. 6, e1000919. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. McCann J. R., and St Geme J. W. 3rd. (2014) The HMW1C-like glycosyltransferases: an enzyme family with a sweet tooth for simple sugars. PLoS Pathog. 10, e1003977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Fleckenstein J. M., Roy K., Fischer J. F., and Burkitt M. (2006) Identification of a two-partner secretion locus of enterotoxigenic Escherichia coli. Infect. Immun. 74, 2245–2258 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Naegeli A., Michaud G., Schubert M., Lin C. W., Lizak C., Darbre T., Reymond J. L., and Aebi M. (2014) Substrate specificity of cytoplasmic N-glycosyltransferase. J. Biol. Chem. 289, 24521–24532 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Kawai F., Grass S., Kim Y., Choi K. J., St Geme J. W. 3rd, and Yeo H. J. (2011) Structural insights into the glycosyltransferase activity of the Actinobacillus pleuropneumoniae HMW1C-like protein. J. Biol. Chem. 286, 38546–38557 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Martinez-Fleites C., Macauley M. S., He Y., Shen D. L., Vocadlo D. J., and Davies G. J. (2008) Structure of an O-GlcNAc transferase homolog provides insight into intracellular glycosylation. Nat. Struct. Mol. Biol. 15, 764–765 [DOI] [PubMed] [Google Scholar]
  • 10. Clarke A. J., Hurtado-Guerrero R., Pathak S., Schüttelkopf A. W., Borodkin V., Shepherd S. M., Ibrahim A. F., and van Aalten D. M. (2008) Structural insights into mechanism and specificity of O-GlcNAc transferase. EMBO J. 27, 2780–2788 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Lomino J. V., Naegeli A., Orwenyo J., Amin M. N., Aebi M., and Wang L. X. (2013) A two-step enzymatic glycosylation of polypeptides with complex N-glycans. Bioorg. Med. Chem. 21, 2262–2270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Kowarik M., Young N. M., Numao S., Schulz B. L., Hug I., Callewaert N., Mills D. C., Watson D. C., Hernandez M., Kelly J. F., Wacker M., and Aebi M. (2006) Definition of the bacterial N-glycosylation site consensus sequence. EMBO J. 25, 1957–1966 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Gerber S., Lizak C., Michaud G., Bucher M., Darbre T., Aebi M., Reymond J. L., and Locher K. P. (2013) Mechanism of bacterial oligosaccharyltransferase: in vitro quantification of sequon binding and catalysis. J. Biol. Chem. 288, 8849–8861 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Naegeli A., Neupert C., Fan Y. Y., Lin C. W., Poljak K., Papini A. M., Schwarz F., and Aebi M. (2014) Molecular analysis of an alternative N-glycosylation machinery by functional transfer from Actinobacillus pleuropneumoniae to Escherichia coli. J. Biol. Chem. 289, 2170–2179 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Sinclair A. M., and Elliott S. (2005) Glycoengineering: the effect of glycosylation on the properties of therapeutic proteins. J. Pharm. Sci. 94, 1626–1635 [DOI] [PubMed] [Google Scholar]
  • 16. Mazola Y., Chinea G., and Musacchio A. (2011) Integrating bioinformatics tools to handle glycosylation. PLoS Comput. Biol. 7, e1002285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Cantarel B. L., Coutinho P. M., Rancurel C., Bernard T., Lombard V., and Henrissat B. (2009) The carbohydrate-active EnZymes database (CAZy): an expert resource for glycogenomics. Nucleic Acids Res. 37, D233–D238 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Gross J., Grass S., Davis A. E., Gilmore-Erdmann P., Townsend R. R., and St Geme J. W. 3rd. (2008) The Haemophilus influenzae HMW1 adhesin is a glycoprotein with an unusual N-linked carbohydrate modification. J. Biol. Chem. 283, 26010–26015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Choi K. J., Grass S., Paek S., St Geme J. W 3rd, Yeo H. J. (2010) The Actinobacillus pleuropneumoniae HMW1C-like glycosyltransferase mediates N-linked glycosylation of the Haemophilus influenzae HMW1 adhesin. PLoS ONE 5, e15888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Berman H. M., Westbrook J., Feng Z., Gilliland G., Bhat T. N., Weissig H., Shindyalov I. N., and Bourne P. E. (2000) The Protein Data Bank. Nucleic Acids Res. 28, 235–242 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Igura M., and Kohda D. (2011) Quantitative assessment of the preferences for the amino acid residues flanking archaeal N-linked glycosylation sites. Glycobiology 21, 575–583 [DOI] [PubMed] [Google Scholar]
  • 22. Schwarz F., and Aebi M. (2011) Mechanisms and principles of N-linked protein glycosylation. Curr. Opin. Struct. Biol. 21, 576–582 [DOI] [PubMed] [Google Scholar]
  • 23. Baud M. G., Lin-Shiao E., Cardote T., Tallant C., Pschibul A., Chan K. H., Zengerle M., Garcia J. R., Kwan T. T., Ferguson F. M., and Ciulli A. (2014) Chemical biology: a bump-and-hole approach to engineer controlled selectivity of BET bromodomain chemical probes. Science 346, 638–641 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Clackson T., Yang W., Rozamus L. W., Hatada M., Amara J. F., Rollins C. T., Stevenson L. F., Magari S. R., Wood S. A., Courage N. L., Lu X., Cerasoli F. Jr, Gilman M., and Holt D. A. (1998) Redesigning an FKBP-ligand interface to generate chemical dimerizers with novel specificity. Proc. Natl. Acad. Sci. U.S.A. 95, 10437–10442 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Doerr A. (2015) A bumpy, holey method to probe proteins. Nat. Methods 12, 14. [DOI] [PubMed] [Google Scholar]
  • 26. Spahn P. N., and Lewis N. E. (2014) Systems glycobiology for glycoengineering. Curr. Opin. Biotechnol. 30, 218–224 [DOI] [PubMed] [Google Scholar]
  • 27. Rudd P. M., and Dwek R. A. (1997) Glycosylation: heterogeneity and the 3D structure of proteins. Crit. Rev. Biochem. Mol. Biol. 32, 1–100 [DOI] [PubMed] [Google Scholar]
  • 28. Rich J. R., and Withers S. G. (2009) Emerging methods for the production of homogeneous human glycoproteins. Nat. Chem. Biol. 5, 206–215 [DOI] [PubMed] [Google Scholar]
  • 29. Grogan M. J., Pratt M. R., Marcaurelle L. A., and Bertozzi C. R. (2002) Homogeneous glycopeptides and glycoproteins for biological investigation. Annu. Rev. Biochem. 71, 593–634 [DOI] [PubMed] [Google Scholar]
  • 30. Wang P., Dong S., Shieh J. H., Peguero E., Hendrickson R., Moore M. A., and Danishefsky S. J. (2013) Erythropoietin derived by chemical synthesis. Science 342, 1357–1360 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Yang Z., Wang S., Halim A., Schulz M. A., Frodin M., Rahman S. H., Vester-Christensen M. B., Behrens C., Kristensen C., Vakhrushev S. Y., Bennett E. P., Wandall H. H., and Clausen H. (2015) Engineered CHO cells for production of diverse, homogeneous glycoproteins. Nat. Biotechnol. 33, 842–844 [DOI] [PubMed] [Google Scholar]
  • 32. Wu Z., Jiang K., Zhu H., Ma C., Yu Z., Li L., Guan W., Liu Y., Zhu H., Chen Y., Li S., Li J., Cheng J., Zhang L., and Wang P. G. (2016) Site-directed glycosylation of peptide/protein with homogeneous O-linked eukaryotic N-glycans. Bioconjug. Chem. 27, 1972–1975 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Izumi M., Makimura Y., Dedola S., Seko A., Kanamori A., Sakono M., Ito Y., and Kajihara Y. (2012) Chemical synthesis of intentionally misfolded homogeneous glycoprotein: a unique approach for the study of glycoprotein quality control. J. Am. Chem. Soc. 134, 7238–7241 [DOI] [PubMed] [Google Scholar]
  • 34. Schwarz F., Huang W., Li C., Schulz B. L., Lizak C., Palumbo A., Numao S., Neri D., Aebi M., and Wang L. X. (2010) A combined method for producing homogeneous glycoproteins with eukaryotic N-glycosylation. Nat. Chem. Biol. 6, 264–266 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Elliott S., Egrie J., Browne J., Lorenzini T., Busse L., Rogers N., and Ponting I. (2004) Control of rHuEPO biological activity: the role of carbohydrate. Exp. Hematol. 32, 1146–1155 [DOI] [PubMed] [Google Scholar]
  • 36. Muthana M. M., Qu J., Li Y., Zhang L., Yu H., Ding L., Malekan H., and Chen X. (2012) Efficient one-pot multienzyme synthesis of UDP-sugars using a promiscuous UDP-sugar pyrophosphorylase from Bifidobacterium longum (BLUSP). Chem. Commun. (Camb) 48, 2728–2730 [DOI] [PubMed] [Google Scholar]
  • 37. Beavis R. C., and Chait B. T. (1990) High-accuracy molecular mass determination of proteins using matrix-assisted laser desorption mass spectrometry. Anal. Chem. 62, 1836–1840 [DOI] [PubMed] [Google Scholar]
  • 38. Beavis R. C., and Chait B. T. (1989) Cinnamic acid derivatives as matrices for ultraviolet laser desorption mass spectrometry of proteins. Rapid Commun. Mass Spectrom. 3, 432–435 [DOI] [PubMed] [Google Scholar]
  • 39. Lütteke T., Frank M., and von der Lieth C. W. (2004) Data mining the protein data bank: automatic detection and assignment of carbohydrate structures. Carbohydr. Res. 339, 1015–1020 [DOI] [PubMed] [Google Scholar]
  • 40. Crooks G. E., Hon G., Chandonia J. M., and Brenner S. E. (2004) WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Data

Articles from The Journal of Biological Chemistry are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES