Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Aug 20.
Published in final edited form as: Chem Biol. 2015 Jul 16;22(8):1052–1062. doi: 10.1016/j.chembiol.2015.06.017

Enhanced aromatic sequons increase oligosaccharyltransferase glycosylation efficiency and glycan homogeneity

Amber N Murray 1,2,6, Wentao Chen 1,2,6, Aristotelis Antonopoulos 3, Sarah R Hanson 1, R Luke Wiseman 2,4, Anne Dell 3, Stuart M Haslam 3, David L Powers 5, Evan T Powers 1,*, Jeffery W Kelly 1,2,*
PMCID: PMC4546532  NIHMSID: NIHMS705618  PMID: 26190824

SUMMARY

N-glycosylation plays an important role in protein folding and function. Previous studies demonstrate that a phenylalanine residue introduced at the n-2 position relative to an Asn-Xxx-Thr/Ser N-glycosylation sequon increases the sequon’s glycan occupancy in insect cells. Here, we show that any aromatic residue at n-2 increases glycan occupancy in human cells, and we show that this effect is dependent upon oligosaccharyltransferase substrate preferences rather than differences in other cellular processing events such as degradation or trafficking. Moreover, aromatic residues at n-2 alter glycan processing in the Golgi, producing proteins with less complex N-glycan structures. These results demonstrate that manipulating the sequence space surrounding N-glycosylation sequons is useful both for controlling glycosylation efficiency, thus enhancing glycan occupancy, and for influencing the N-glycan structures produced.

Graphical Abstract

graphic file with name nihms705618u1.jpg

INTRODUCTION

N-glycosylation is a prevalent co- and/or post-translational modification of proteins that traverse the cellular secretory pathway (Apweiler, et al., 1999). N-glycans play crucial roles in protein homeostasis, modulating protein stability (Culyba, et al., 2011; Hanson, et al., 2009; Hebert, et al., 2014; Imperiali and Rickert, 1995; Joao and Dwek, 1993; Price, et al., 2012; Wang, et al., 1996; Wormald and Dwek, 1999), chaperone-mediated folding (Hammond, et al., 1994; Hebert, et al., 2014; Helenius and Aebi, 2001; Jitsuhara, et al., 2002; Oliver, et al., 1997; Ou, et al., 1993; Schmaltz, et al., 2011; Ware, et al., 1995), vesicular trafficking (Lu, et al., 1997; Martina, et al., 1998), and endoplasmic reticulum-associated degradation (ERAD) (Parodi, 2000; Vembar and Brodsky, 2008). N-glycans also strongly influence biological function, including immune/inflammatory responses and circulating glycoprotein clearance, wherein carbohydrate binding proteins (lectins) recognize specific glycoforms of surface glycoproteins (Ahrens, 1993; Bertozzi and Kiessling, 2001; Chui, et al., 2001; Sorensen, et al., 2012).

During N-glycosylation, the heteromeric oligosaccharyltransferase enzyme complex (OST) in the lumen of the endoplasmic reticulum (ER) catalyzes the transfer of the Glc3Man9GlcNAc2 oligosaccharide from a dolichol phosphate donor to the side-chain amide nitrogen of an acceptor Asn (Kornfeld and Kornfeld, 1985). Proteins translocated into the ER are scanned by OST for the N-glycosylation sequon Asn-Xxx-Ser/Thr (or, rarely, Asn-Xxx-Cys), where Xxx is any residue but Pro, and upon recognition of this sequon, the oligosaccharide is transferred to Asn either co- or post-translationally (Bas, et al., 2011; Ruiz-Canada, et al., 2009). Removal of two terminal glucoses from the N-glycan by ER glucosidases allows N-glycoproteins to engage the lectin chaperones calnexin and/or calreticulin, promoting proper folding (Caramelo and Parodi, 2008). Once folded, N-glycoproteins are packaged into vesicles and proceed to the Golgi, where they can interact with N-glycan processing enzymes that trim and elaborate glycans, affording a complex array of glycoforms (Lowe and Marth, 2003; Nairn, et al., 2008).

It is estimated that one-third of sequons are not occupied by N-glycans (Apweiler, et al., 1999; Petrescu, et al., 2004; Surleac, et al., 2012). It is not known whether signals exist in a protein’s structure that discourage OST from glycosylating proteins at a site that would render them unfoldable. Similarly, it is not known whether cues exist in a protein’s primary or secondary structure that increase OST’s glycosylation efficiency for sequons in β-turns, a context that can lead to an increase in protein stability (Culyba, et al., 2011; Hanson, et al., 2009; Price, et al., 2011).

We have previously shown that introduction of a Phe residue at the n-2 position relative to the Asn of a sequon stabilizes glycosylated Rattus norvegicus cluster of differentiation 2 adhesion domain (CD2ad), Homo sapiens muscle acylphosphatase, and Homo sapiens Pin1 WW domain (Culyba, et al., 2011; Hanson, et al., 2009; Price, et al., 2011). This stabilization is due to protein-carbohydrate interactions driven largely by dispersion forces between Phe, the N-glycan, and Thr in the context of a reverse turn (Chen, et al., 2013). These forces create an enhanced aromatic sequon (EAS) structural motif that can be incorporated into glycosylation-naïve proteins to confer glycosylation-dependent native state stabilization (Culyba, et al., 2011). Phe at n-2 also increases glycan occupancy in CD2ad and muscle acylphosphatase secreted from Sf9 insect cells (Culyba, et al., 2011), although it was not known whether this results from decreased ERAD, increased folding and trafficking, or increased OST glycosylation efficiency. Statistical analysis of the primary sequence surrounding sequons indicates an increased occurrence of aromatic residues immediately preceding occupied sequons as compared to unoccupied sequons (Petrescu, et al., 2004; Surleac, et al., 2012), suggesting a potential OST preference for aromatic residues at n-2. Moreover, the residue at n-2 must be Asp or Glu for bacterial OST to glycosylate a sequon, demonstrating that this position can influence glycosylation efficiency (Kowarik, et al., 2006; Wacker, et al., 2002).

Here, we report that human OST preferentially glycosylates substrates containing EASs. We express the rat ortholog of CD2ad, a protein that is natively glycosylation naïve, without or with an EAS containing Phe, His, Tyr, or Trp at n-2 in human cells and observe increased glycan occupancy for all EASs in this human cell line. We also express a protein, transthyretin (TTR), with a native sequon that is poorly glycosylated and observe an increase in glycan occupancy after introduction of Phe at n-2. To directly test OST’s preferences for Phe at n-2, and to remove any contributions of cellular processing differences between differentially-glycosylated proteins, we express tandem repeats of two different N-glycoproteins, CD2ad and fibroblast growth factor 9 (FGF9), each as a single polypeptide chain. These tandem repeat sequences show that, for both N-glycoproteins, the EAS is a better OST substrate than the standard sequon within the same polypeptide chain. Lastly, we report that EASs alter glycan processing, with different residues at n-2 leading to different glycoform product ensembles. Thus, our results show that the sequence surrounding N-glycosylation sequons affects both OST-mediated glycosylation and glycosyltransferase-mediated glycan elaboration in the Golgi, suggesting that primary sequence manipulation can be used to alter glycoprotein structure and function.

RESULTS

EASs increase glycan occupancy of CD2ad

To probe the effect of aromatic residues at n-2 on glycan occupancy, we used the glycosylation-naïve rat ortholog of CD2ad as a model protein. Previously, we showed that an N-glycosylation sequon introduced into CD2ad’s type I β-bulge reverse turn is glycosylated in Sf9 insect cells, and we determined the folding energetics of its glycoforms (Culyba, et al., 2011; Hanson, et al., 2009). However, the effects of EASs on glycan occupancy in proteins expressed in human cells and the mechanisms involved are unknown. In HEK 293 cells, we expressed variants of CD2ad containing an N-terminal preprotrypsin leader sequence (PPTLS) to target the protein to the ER, an N-terminal hexahistidine tag preceding a FLAG tag, and an Asp67Thr mutation (to introduce an N-glycosylation sequon; Figure 1A). The sequence surrounding the glycosylation site of the wild-type variant (CD2-L) is Leu63Ala64Asn65Gly66Thr67, with Leu at n-2. We created variants with Phe (CD2-F), His (CD2-H), Trp (CD2-W), or Tyr (CD2-Y) at n-2 to determine whether these aromatic residues increase glycan occupancy.

Figure 1. Aromatic residues at n-2 decrease the amount of non-glycosylated CD2ad and increase the amount of secreted glycosylated CD2ad.

Figure 1

(A) CD2ad expression constructs, both CD2-L (Leu) and aromatic (Aro) variants, allow for targeting to the secretory pathway via the PPTLS and Western blot analysis via the FLAG tag. (B) Anti-FLAG Western blots of HEK 293 cell lysate and media display bands corresponding to glycosylated (green arrow) and non-glycosylated (blue arrow) CD2ad variants. A faint nonspecific band is detected by anti-FLAG in the cell lysate, as seen in the EGFP negative control lanes. (C) Quantification of Western blots shows variants with EASs display less non-glycosylated protein in cell lysate and media compared to CD2-L. Band intensities are normalized to that of the CD2-L + glycan band. Solid bars quantify glycosylated protein, and open bars quantify non-glycosylated protein. Error bars indicate standard deviations from three biological replicates; ** denotes P < 0.01. See also Figure S1.

HEK 293 cells were transfected with expression vectors for CD2ad variants, and cell lysates and media were harvested three days post-transfection. For each sample, cell lysate (25 μg total protein) was subjected to SDS-PAGE and visualized by anti-FLAG Western blotting. Cell media samples were analyzed similarly, where volumes loaded per lane were normalized to corresponding cell lysate protein concentrations (Figure 1B). The band intensities for glycosylated (green arrows) and non-glycosylated (blue arrows) CD2ad variants were quantified and normalized to the + glycan band of CD2-L (Figure 1C). In cell lysates, the + glycan band (solid bars) is not significantly affected by aromatic residues at n-2. However, the − glycan band (open bars) decreases for all aromatic variants. In media samples, there is significantly more + glycan band for all EAS-containing proteins and significantly less − glycan band for all EAS-containing proteins than for CD2-L. To ensure that glycosylated and non-glycosylated species were correctly identified, all variants were digested with PNGase F to remove N-glycans, causing the + glycan bands to decrease and the − glycan bands to increase (Figure S1). These results suggest that incorporating any aromatic residue at n-2 can increase glycan occupancy of proteins expressed in human cells.

EASs increase glycan occupancy of D18G transthyretin

We next attempted to boost the glycan occupancy of a poorly glycosylated native sequon by introducing Phe at n-2. Transthyretin (TTR) contains one native sequon in a type IV β-turn, Asn98Asp99Ser100, which is poorly glycosylated, owing both to Asp at n+1 and Ser at n+2 (Kasturi, et al., 1997). D18G TTR is an amyloidogenic disease-associated mutant that is inefficiently secreted and subject to ERAD (Hammarstrom, et al., 2003; Sekijima, et al., 2005). This variant has been shown to be post-translationally glycosylated by OST to prevent aggregation and to facilitate degradation (Sato, et al., 2012).

We expressed FLAG-tagged variants of D18G TTR in HEK 293 cells, one with the wild type sequence Thr96Ala97Asn98Asp99Ser100 and one with Phe at n-2 (T96F). We then performed pulse-chase experiments in the presence of 10 μM lactacystin, to minimize proteasomal degradation effects. After a 20-min pulse and a 3-h chase, cell lysates and media were harvested, and 35S-labeled D18G TTR variants were immunoprecipitated with anti-FLAG affinity resin and quantified by phosphorimaging. D18G T96F glycan occupancy is twice that of D18G (0.12 ± 0.01 versus 0.06 ± 0.01) (Figure 2A–B). There is less total D18G than D18G T96F, indicating that proteasomal inhibition may be incomplete over the duration of the experiment or that D18G is being degraded in another way (Figure S2A). PNGase F digestion of samples causes the + glycan bands (green arrow) to disappear, indicating that this band corresponds to glycosylated TTR (Figure S2B). These results suggest that incorporating an EAS can increase glycan occupancy of a poor native sequon.

Figure 2. Phe at n-2 of D18G TTR increases the glycan occupancy of its poorly-glycosylated native sequon.

Figure 2

(A) Autoradiogram of immunoprecipitated [35S]-labeled D18G TTR variants from HEK 293 cell lysates displays bands corresponding to glycosylated (green arrow) and non-glycosylated (blue arrow) variants. Pulse-chase was performed in the presence of 10 μM lactacystin, and samples were harvested after a 3 h chase. (B) Quantification of bands shows that Phe at n-2 increases D18G TTR glycan occupancy. Error bars indicate standard deviations from three biological replicates; ** denotes P < 0.01. See also Figure S2.

EASs increase glycan occupancy in single-chain tandem repeats of CD2ad

The enhanced glycan occupancy of the EAS in CD2ad and TTR could come not only from OST’s glycosylation preferences, but also from differential intracellular processing of the CD2ad variants. To determine more directly whether OST preferentially glycosylates EASs, we created single-chain tandem domain constructs that allow for expression of 2 CD2ad sequences within a single polypeptide chain. Because both CD2ad repeats are on the same polypeptide chain, they are unlikely to be differentially synthesized, trafficked, or degraded. Each construct contained a PPT leader sequence, a FLAG tag, a CD2ad sequence, a Tobacco Etch Virus (TEV) protease cleavage site, a hemagglutinin (HA) tag, and a second CD2ad sequence (Figure 3A). Quantitative comparison of the glycan occupancy of each CD2ad from the same polypeptide allowed us to determine the contribution of OST preference for particular sequons independent of other cellular processes. Four such constructs were created: L-F, where the N-terminal CD2ad has Leu at n-2 and the C-terminal CD2ad has Phe at n-2; F-L, where the N-terminal CD2ad has Phe at n-2 and the C-terminal CD2ad has Leu at n-2; L-L, where both domains have Leu at n-2; and F-F, where both domains have Phe at n-2.

Figure 3. Single-chain tandem CD2ad experiments confirm OST’s preference for the EAS.

Figure 3

(A) The tandem CD2ad cassette allows for expression of two CD2ad variants within a single polypeptide chain. Incubation with TEV protease cleaves the single-chain tandem protein into an N-terminal protein containing the FLAG epitope and the first CD2ad and a C-terminal protein containing the HA epitope and the second CD2ad. (B) Anti-FLAG Western blots of cell lysate and media display bands corresponding to glycosylated (green arrow) and non-glycosylated (blue arrow) N-terminal CD2ad. Anti-HA Western blots display bands corresponding to glycosylated (green arrow) and non-glycosylated (blue arrow) C-terminal CD2ad. (C) Quantification of Western blots of cleaved CD2ad shows that glycan occupancy is higher for domains containing Phe at n-2 than for domains containing Leu at n-2, independent of whether the Phe-containing domain is N-terminal or C-terminal. Uncorrected values are glycan occupancies calculated from raw gel band intensities, and corrected values are corrected for low TEV cleavage efficiency of non-glycosylated tandem protein, as explained in the text. Solid bars quantify glycan occupancy of the N-terminal CD2ad, and open bars quantify glycan occupancy of the C-terminal CD2ad. Error bars indicate standard deviations from three biological replicates; # denotes P = 0.05, * denotes P < 0.05, and ** denotes P < 0.01. See also Figure S3 and Table S1.

At three days post-transfection in HEK 293 cells, cell lysates and media were harvested and analyzed by Western blotting. When probed with either anti-FLAG antibody or anti-HA antibody, tandem proteins exhibit three bands, corresponding to species with two (red triangles), one (green triangles), or zero N-glycans (blue triangles) (Figure S3A). The disappearance of the higher two bands following PNGase F digestion, and the appearance of the lower band, confirms that the upper bands contain glycosylated tandem proteins (Figure S3B). Total intensity for all three bands was quantified for each sample to confirm that anti-FLAG (solid bars) and anti-HA (open bars) antibodies reported the same relative intensities (Figure S3C). In cell lysate samples, total tandem protein remains constant across constructs. However, total tandem protein in cell media samples differs, in that there is less L-L tandem protein and more F-F tandem protein (Figure S3C), which may reflect the known native-state stabilization provided by Phe at n-2 in CD2ad (Culyba, et al., 2011).

We next digested the tandem proteins in each cell lysate and each cell media sample using TEV protease. Cleaved samples were analyzed by Western blot, where anti-FLAG reports on the N-terminal CD2ad and anti-HA reports on the C-terminal CD2ad (Figure 3B). Following TEV cleavage, the ~28 kDa bands corresponding to uncleaved tandem proteins decrease in intensity, and new ~14 kDa bands appear corresponding to cleaved proteins (Figure S3D). Glycan occupancy is higher for the CD2ad with Phe at n-2 than for the CD2ad with Leu at n-2, regardless of whether the Phe-containing CD2ad is at the N- or C-terminus in cell lysate or in media samples (Figure 3C).

The decrease in intensity of uncleaved tandem proteins in lysate samples differs for the three ~28 kDa tandem protein bands, which correspond to doubly, singly, and non-glycosylated tandem proteins, indicating that the efficiency of TEV protease cleavage in lysate depends on the extent of glycosylation of the tandem proteins. TEV cleavage efficiency decreases with decreased glycosylation: doubly, singly, and non-glycosylated tandem proteins have cleavage efficiencies of ~70–80%, ~40–60%, and ~ 5–15%, respectively (Table S1). We do not yet understand the cause of this differential cleavage efficiency, but because each pair of domains is part of a single polypeptide chain before TEV cleavage (Figure 3A), direct comparisons of glycan occupancies between the N- and C-terminal fragments following TEV cleavage are still meaningful. The uneven TEV protease cleavage efficiencies for a given tandem protein construct in lysate lead to over-representation of the absolute sequon occupancies for both the N- and C-terminal domains because of the relatively high cleavage efficiency of the doubly glycosylated tandem proteins and the relatively low cleavage efficiency of the non-glycosylated tandem proteins. However, the effect on the relative glycan occupancies of the N- and C-terminal domains is small. Furthermore, this direct comparison is independent of differences in total protein concentrations between different constructs (e.g., that more F-F is present than L-L), because only domains cleaved from a single polypeptide chain are analyzed.

We address the differential TEV cleavage efficiency in lysate in two ways. First, we have attempted to correct the gel band intensities of the 14 kDa glycosylated and non-glycosylated N- and C-terminal domains by subtracting the excess band intensity due to the high cleavage efficiency of the doubly glycosylated tandem protein from the glycosylated band and adding the deficient band intensity due to the low cleavage efficiency of the non-glycosylated tandem protein to the non-glycosylated band (Table S1). These corrections lower the observed glycan occupancy by ~0.2–0.3 in each case, but the differences between the sequon occupancies of domains with Leu at n-2 vs. Phe at n-2 are maintained (Figure 3C, corrected values). Second, we have increased TEV cleavage efficiency by first immunoprecipitating tandem proteins from cell lysates using anti-FLAG affinity resin. The extent to which this increases the fraction of cleaved tandem protein is difficult to assess because of interference from the anti-FLAG light chain band, which is near the 28 kDa tandem protein bands (Figure S3E, asterisk). Nevertheless, quantification of the 14 kDa bands corroborates that the glycan occupancy is higher when Phe is the n-2 residue instead of Leu (cf. Figure S3F to Figure 3C).

EASs increase glycan occupancy in single-chain tandem repeats of fibroblast growth factor 9 (FGF9)

To understand whether the OST preference for EASs is CD2ad specific or more universal, we created analogous single-chain tandem constructs using Homo sapiens FGF9, a protein with a native EAS (Phe77Pro78Asn79Gly80Thr81) within a type I β-bulge turn conformation, similar to that of CD2ad (Culyba, et al., 2011) (Figure 4A). Initial experiments (data not shown) indicated that the Asn79-Gly80-Thr81 sequon for FGF9 domains exhibited nearly 100% glycan occupancy, irrespective of the presence or absence of Phe at the n-2 position. Therefore, the native Thr81 residue was mutated to Ser81. Modifying the sequon to the less efficiently glycosylated Asn-Gly-Ser decreased “baseline” glycan occupancy (Kasturi, et al., 1995; Petrescu, et al., 2004) of the sequon to allow for a more sensitive measurement of the effects of the EAS on glycan occupancy. Four such constructs were created: A-F, where the N-terminal FGF9 has Ala at n-2 and the C-terminal FGF9 has Phe at n-2; F-A, where the N-terminal FGF9 has Phe at n-2 and the C-terminal FGF9 has Ala at n-2; A-A, where the N-terminal FGF9 has Ala at n-2 and the C-terminal FGF9 has Ala at n-2; and F-F, where the N-terminal FGF9 has Phe at n-2 and the C-terminal FGF9 has Phe at n-2. As is common practice when evaluating the effect of an amino acid side chain on a process of interest, Ala was chosen as the n-2 residue in the domains without an EAS to limit any side chain contributions that might be introduced from other residues (Morrison and Weiss, 2001). For these tandem constructs, a V5 epitope precedes the N-terminal FGF9, and an HA epitope precedes the C-terminal FGF9.

Figure 4. Single-chain tandem FGF9 experiments confirm OST’s preference for the EAS.

Figure 4

(A) The tandem FGF9 cassette allows for expression of two FGF9 variants within a single polypeptide chain. Incubation with TEV protease cleaves the single-chain tandem protein into an N-terminal protein containing the V5 epitope and the first FGF9 and a C-terminal protein containing the HA epitope and the second FGF9. (B) Anti-V5 Western blots of cell lysate and media display bands corresponding to glycosylated (green arrow) and non-glycosylated (blue arrow) N-terminal FGF9. Anti-HA Western blots display bands corresponding to glycosylated (green arrow) and non-glycosylated (blue arrow) C-terminal FGF9. (C) Quantification of Western blots of cleaved FGF9 shows that glycan occupancy is higher for domains containing Phe at n-2 than for domains containing Leu at n-2 from cell lysates, independent of whether the Phe-containing domain is N-terminal or C-terminal. Glycan occupancy in cell lysates is also higher in general for the N-terminal FGF9 than for the C-terminal FGF9, as highlighted in the A-A and F-F controls, independent of which residue is at n-2. In cell media, the N-terminal domain’s glycan occupancy overshadows EAS effects. Solid bars quantify glycan occupancy of the N-terminal FGF9, and open bars quantify glycan occupancy of the C-terminal FGF9. Error bars indicate standard deviations from three biological replicates; * denotes P < 0.05 and ** denotes P < 0.01. See also Figure S4, Table S1, and Table S2.

At two days post-transfection in HEK 293T cells, cell lysates and media were harvested and analyzed by Western blotting, as before. When probed with either anti-V5 antibody or anti-HA antibody, tandem proteins exhibit three bands, corresponding to species with two (red triangles), one (green triangles), or zero N-glycans (blue triangles) (Figure S4A). The disappearance of the upper two bands following PNGase F digestion, and the appearance of the lower band, confirms that the upper bands contain glycosylated tandem proteins (Figure S4B). The amount of tandem protein expressed between variants differs, with less overall protein for the A-A and F-F variants than for the A-F and F-A variants. However, total band intensity for all three bands is the same whether reported by anti-V5 (solid bars) or anti-HA (open bars) for each tandem protein (Figure S4C).

Following TEV cleavage, the ~ 40 kDa bands corresponding to uncleaved tandem proteins decrease in intensity, and new ~20 kDa bands appear corresponding to cleaved proteins (Figure S4D). The A-A and F-F controls indicate that the N-terminal FGF9 has higher glycan occupancy than the C-terminal FGF9 (Figure 4C), which is likely because sequons close to the C-terminus are glycosylated relatively inefficiently (Ben-Dor, et al., 2004; Gavel and von Heijne, 1990; Shrimal, et al., 2013). Sequons near the C-terminus can be glycosylated post-translationally by only the STT3B-containing isoform of OST, whereas internal sequons can be glycosylated by both the STT3A- and STT3B-containing isoforms of OST (Shrimal, et al., 2013). Thus, the lower effective abundance of OST isoforms that are competent to glycosylate C-terminal sequons could explain their lower glycan occupancy. Nevertheless, in cell lysate samples, glycan occupancy is higher for the cleaved FGF9 with Phe at n-2 than for the cleaved FGF9 with Ala at n-2, regardless of whether the Phe-containing FGF9 domain is at the N- or C-terminus (Figure 4B–C). In cell media samples, the N-terminal glycosylation preference of OST overshadows EAS effects on sequon occupancy. These results reinforce our hypothesis that OST prefers EASs; however, they also suggest that EAS effects on glycan occupancy are context dependent.

To quantify the effect of EASs on glycan occupancy, independent of the effects of N- versus C-terminal domain placement, we fit the glycan occupancy data to linear models with an indicator variable for the effect of Phe at n-2, an indicator variable for the effect of C-terminal placement of the EAS, and an interaction term to determine whether these effects are additive or whether they modulate each other (see Supplemental Information for details). These models fit the data well and indicate that Phe at n-2 significantly increases glycan occupancy for FGF9 lysate and FGF9 media, CD2ad lysate, and CD2ad media. These models also indicate that C-terminal placement of the EAS significantly decreases glycan occupancy for FGF9, although this effect is not as clear in CD2ad. The results of the regression are summarized in Table S2.

EASs are selected for in proteins that can be N-glycosylated

Given that OST glycosylation efficiency is increased in EASs, a selection pressure may exist that favors aromatic amino acids at n-2 in proteins that can be N-glycosylated (i.e., proteins that traverse the secretory pathway). To evaluate this hypothesis, we compared the frequencies of amino acids in positions five residues immediately before and after Asn-Xxx-Ser/Thr sequences between proteins that can and cannot be N-glycosylated (“glycosylatable” vs. “non-glycosylatable” proteins). We defined glycosylatable proteins as those that have a Gene Ontology (GO) annotation indicating that they reside in the endoplasmic reticulum, the Golgi, the lysosome, the plasma membrane, or the extracellular space. We defined non-glycosylatable proteins as those that have a GO annotation indicating that they reside in the cytosol or the mitochondria. We excluded proteins present in both lists.

The final set of glycosylatable proteins comprises 3,076 proteins, the mean length of which is 562 amino acids. The sequences of these proteins include 9,770 Asn-Xxx-Ser/Thr sequences that are more than five residues away from the N- and C-termini, for an overall occurrence rate of 0.62 sequons per 100 amino acids. The final list of non-glycosylatable proteins consists of 3,198 proteins, the mean length of which is 575 amino acids.

These sequences include 7,766 Asn-Xxx-Ser/Thr sequences that are more than five residues away from the N- and C-termini, for an overall occurrence rate of 0.45 Asn-Xxx-Ser/Thr sequences per 100 amino acids. The overall frequencies of aromatic amino acids (Phe, Tyr, Trp, His) at all positions within the proteome are 0.109 and 0.097 for glycosylatable and non-glycosylatable proteins, respectively. The difference between the overall frequency of aromatic amino acids and the frequency of aromatic amino acids at positions near Asn-Xxx-Ser/Thr is shown in Figure 5. (Raw frequency data is shown in Figure S5.) For glycosylatable proteins, aromatic amino acids are enriched generally in positions N-terminal to potential sequons (n-5 to n-1). The P-values that result from testing the hypothesis that aromatic amino acids are more enriched at these positions in glycosylatable than in non-glycosylatable proteins are < 0.01 at all positions except for n-3, where there is no discernable enrichment of aromatic amino acids (Figure 5). This effect is especially strong at n-2, where the enrichment of aromatic amino acids is approximately fourfold higher in glycosylatable than in non-glycosylatable proteins (0.0346 ± 0.0036 vs. 0.0081 ± 0.0035). This result shows that, on a genome-wide scale, aromatic amino acids are favored at the n-2 position relative to Asn-Xxx-Ser/Thr sequences in glycosylatable proteins, consistent with the hypothesis that EASs are selected for to improve OST glycosylation efficiency.

Figure 5. Aromatic residues at n-2 are enriched in glycosylatable proteins compared to non-glycosylatable proteins.

Figure 5

The differential frequency of aromatic amino acids (frequency at given position – overall frequency) for glycosylatable proteins that traverse the secretory pathway (solid bars) and non-glycosylatable proteins in the cytosol/mitochondria (open bars) are plotted vs. their position relative to the Asn of N-glycosylation sequons. Error bars indicate standard deviations; * denotes P < 0.01 for a large sample, one-tailed test of the differences between binomial proportions. See also Figure S5.

EASs affect the N-glycan structures in CD2ad and FGF9

In addition to affecting OST glycosylation efficiency, aromatic amino acids at n-2 alter the SDS-PAGE banding patterns of glycosylated CD2ad harvested from cell media. Whereas glycosylated, secreted CD2-L exhibits one band, aromatic variants exhibit two bands (Figure 1B, Figure S6A), which collapse to the non-glycosylated size after digestion with PNGase F (Figure S6B). We hypothesized that residues at n-2 might influence the activities of Golgi enzymes involved in N-glycan processing, leading to different glycoform populations that have different gel banding patterns.

We analyzed the glycan composition of CD2-L, CD2-F, and CD2-H purified from HEK 293 media (Figure S6A) using MALDI-TOF-MS glycoform analysis (Ceroni, et al., 2008; Jang-Lee, et al., 2006). A multi-institutional assessment of glycomics methodologies demonstrated that MALDI-TOF-MS is a reliable method for relative quantification based on the signal intensities of permethylated glycans, especially when comparing signals over a small mass range within the same spectrum (Wada, et al., 2007). Therefore, while absolute quantification of candidate N-glycans by this method is not strictly possible, valuable quantitative information can be extracted from mass spectrometric data.

MALDI-TOF-MS demonstrated that CD2-L has a diverse array of glycoforms, including 15% oligomannose, 54% hybrid, and 31% complex N-glycans (Figure 6A top panel, B). In the EAS-containing CD2-F variant, glycoforms are far more homogeneous, comprising 92% oligomannose N-glycans (Figure 6A middle panel, B). This observation is consistent with previous work on the EAS-containing human ortholog of CD2ad, which has been shown to retain the oligomannose glycoforms when expressed in CHO cells (Recny, et al., 1992; Rudd, et al., 1999; Wyss, et al., 1995). CD2-H has a glycoform population of intermediate complexity, with 46% oligomannose, 39% hybrid, and 15% complex N-glycans (Figure 6A bottom panel, B).

Figure 6. Different residues at n-2 alter glycoform populations of secreted variants of CD2ad.

Figure 6

(A) MALDI-TOF mass spectra of permethylated N-glycans released by PNGase F digestion indicate that CD2-L N-glycans are highly heterogeneous and comprise a diverse set of oligomannose, hybrid, and complex structures. The heterogeneity is greatly reduced for CD2-F N-glycans, which are mainly oligomannose. The heterogeneity of CD2-H N-glycans is intermediate between that of CD2-L and that of CD2-F. All molecular ions are [M+Na]+. Putative structures are based on composition, tandem MS/MS and biosynthetic knowledge. Structures that show sugars outside of a bracket have not been unequivocally defined. (B) Comparison of N-glycan peak intensities confirms the reduction in complex glycoforms for CD2-F and CD2-H variants, compared to CD2-L. See also Figure S6.

Given that the MALDI-TOF-MS results indicate that CD2-F is more homogeneous than CD2-L, it is perhaps surprising that CD2-F runs as two bands on SDS-PAGE while CD2-L runs as a single band. To further understand these observations, we analyzed the gel-extracted upper and lower + glycan CD2-F gel bands by MALDI-TOF-MS. We found that the upper band contains multiple glycoforms (including oligomannose), whereas the lower band contains only oligomannose glycans (Figure S6C). Thus, it appears that the oligomannose glycoforms of our CD2 constructs co-migrate with the more processed hybrid and complex-type glycoforms up to a point, but when the oligomannose glycoforms are highly enriched—as they are in CD2-F—the excess runs as a separate band. To corroborate this notion, we treated purified samples of CD2-F, CD2-L, and CD2-H with α1-2,3 mannosidase (which catalyzes the hydrolysis of α1-2 and α1-3 linked mannose residues from N-glycans). This treatment changed the gel mobility of the lower CD2-F + glycan band, while the mobilities of all of the other CD2ad variant bands were insensitive to mannosidase cleavage (Figure S6B). While it is clear that the digestion of the N-glycans with mannosidase was not complete (since the mobilities of the oligomannose glycoforms in the upper bands did not change), this result nevertheless qualitatively corroborates the mass spectrometric findings that the oligomannose glycoforms are highly enriched in CD2-F.

To test whether EAS alteration of glycoform populations extends to other proteins, we used MALDI-TOF-MS to analyze the glycan composition of FGF9-A (with Ala at n-2) and FGF9-F (with Phe at n-2) secreted from HEK 293F cells. FGF9-A and FGF9-F were expressed as fusions to non-glycosylated IgG Fc to allow affinity purification using Protein A (Figure S7). FGF9-A has a more diverse array of glycoforms, including 6% oligomannose, 2% hybrid, and 92% complex N-glycans, whereas the EAS-containing FGF9-F variant has less glycoform heterogeneity, with 1% oligomannose, 21% hybrid, and 78% complex N-glycans (Figure 7A–B).

Figure 7. Different residues at n-2 alter glycoform populations of secreted variants of FGF9-Fc fusions.

Figure 7

(A) MALDI-TOF mass spectra of permethylated N-glycans released by PNGase F digestion indicate that FGF9-A N-glycans are more complex than FGF9-F N-glycans. All molecular ions are [M+Na]+. Putative structures are based on composition, tandem MS/MS and biosynthetic knowledge. Structures that show sugars outside of a bracket have not been unequivocally defined. (B) Comparison of N-glycan peak intensities confirms the reduction in complex glycoforms for FGF9-F compared to FGF9-A. See also Figure S7.

Thus, in both CD2ad and FGF9, EASs alter Golgi elaboration of N-glycans, shifting glycoform populations away from complex N-glycans and toward hybrid or oligomannose N-glycans. These results indicate that the primary sequence surrounding an N-glycosylation sequon modulates which glycoforms are produced.

DISCUSSION

N-glycosylation greatly expands the functionality of the proteome. The human N-glycome is much more complex than the human proteome (Bertozzi and Kiessling, 2001; Freeze, 2006), and differences in glycan occupancy and glycoform diversity exist between copies of the same protein produced by a given cell. A vast N-glycan processing network containing over 100 enzymes affects both glycosylation efficiency and subsequent N-glycan processing. Deficiencies in this network account for many congenital disorders of glycosylation (Chui, et al., 1997; Jaeken, 2010; Shrimal, et al., 2013) and loss-of-function diseases (Glozman, et al., 2009), while alterations can generate new biological function. Thus, variability in glycan occupancy and diversity strongly influence biological function.

We have shown that the primary sequence surrounding an N-glycosylation sequon strongly affects how efficiently it is glycosylated by OST. EASs are preferred by OST, exhibit higher glycan occupancy, and can result in native state stabilization (Culyba, et al., 2011; Hanson, et al., 2009; Price, et al., 2011). OST may therefore have evolved to prefer EASs, because of the likelihood that glycosylating such proteins would stabilize the native state and increase the overall levels of functional protein.

We have also shown that the primary sequence surrounding a sequon in a folded protein can strongly affect N-glycan processing. The extent to which an N-glycan is processed is determined, at least in part, by the accessibility of the N-glycan to the Golgi-resident glycan elaboration and/or trimming enzymes (Hsieh, et al., 1983; Hubbard, 1988; Rudd, et al., 1999; Thaysen-Andersen and Packer, 2012; Trimble, et al., 1983). Based on the decrease in complex-type glycoforms exhibited in EAS-containing variants of CD2ad and FGF9, it appears that an interaction between an N-glycan and a single protein side chain, like the interaction between the N-glycan and an aromatic side chain in an EAS (Chen, et al., 2013; Culyba, et al., 2011; Price, et al., 2011), can decrease the accessibility of an N-glycan enough to profoundly alter the extent to which it is processed in the Golgi.

Since EASs are excellent substrates for OST-mediated glycosylation and can reduce glycan elaboration in the secretory pathway, EASs can influence both intrinsic protein stabilization and the N-glycan structures produced. EASs can be engineered into therapeutic proteins to increase stability, glycan occupancy, and glycoform homogeneity, boosting total protein yields and decreasing immunogenicity. Additionally, manipulating the sequence space surrounding N-glycosylation sequons in therapeutic proteins may provide control over half-life and clearance by shifting glycoform populations toward or away from structures that can be cleared by lectins (Sorensen, et al., 2012). There are likely many other primary and secondary protein structure effectors of the glycoproteome that are yet to be discovered. Further exploration of the EAS, and perhaps the flanking sequence, should elucidate engineering guidelines for enhancing glycosylation by OST and provide strategies for controlling glycan structural homogeneity. This structure-based approach offers the exciting prospect of manipulating protein stability, immunogenicity, targeting, and a host of other properties conveyed by protein N-glycosylation.

SIGNIFICANCE

N-glycosylation, the oligosaccharyltransferase (OST)-mediated conjugation of oligosaccharides onto the Asn side chain NH of proteins, occurs to the majority of proteins that pass through the cellular secretory pathway. These N-glycans add an additional level of functional and structural diversity to the eukaryotic proteome. N-glycans affect protein folding intrinsically through native state protein–glycan interactions and extrinsically by influencing N-glycoprotein engagement with proteostasis network pathways, including chaperone-mediated folding and proteasomal degradation pathways. N-glycans also engage with glycan binding proteins, affecting myriad biological processes. Herein we report that incorporating an aromatic amino acid (Phe, Tyr, His, or Trp) at the n-2 position relative to an N-glycosylation sequon, and thereby creating an “enhanced aromatic sequon”, increases sequon occupancy, due to OST substrate preferences rather than differences in other cellular processing events such as degradation or trafficking. Moreover, the preference for aromatic amino acids in the n-2 position is evident not just biochemically, but also evolutionarily, given the increased frequency of aromatic amino acids in this position in glycosylatable vs. non-glycosylatable proteins. The enhanced aromatic sequon also influences the composition of the attached N-glycans by altering glycan remodeling in the Golgi. The tandem domain experimental approach employed is also significant because it should be useful for further exploration of the sequence preferences of OST and those of the Golgi N-glycan processing enzymes. In summary, this approach offers the opportunity to manipulate N-glycoprotein structure, stability, immunogenicity, and targeting, thereby augmenting protein therapeutic stability, shelf-life, and half-life via sequence engineering.

EXPERIMENTAL PROCEDURES

Expression construct preparation

Standard protocols were used for vector construction. Briefly, all expression constructs were cloned into vectors allowing for CMV promoter-driven transcription. Mutations were introduced via QuikChange site-directed mutagenesis (Agilent Technologies 200515). The protocols for vector construction are described in detail in the Supplemental Experimental Procedures.

Cell culture

CD2ad-containing proteins were produced from HEK 293 cells transfected with FuGENE 6 (Roche 11814443001) and cultured as monolayers in D-MEM/F12 (Gibco 11330032) supplemented with 1% Penicillin-Streptomycin-Glutamine (Gibco 10378-016) and 10% Fetal Bovine Serum (Omega Scientific FB-02) media under standard conditions. Tandem FGF9 repeat proteins were produced from HEK 293T cells transfected with Xtreme Gene 9 (Roche 06365787001) and cultured as described above. FGF9-Fc fusions were produced from suspension HEK 293F cells transfected with 293Fectin (Invitrogen 12347-019) and cultured in FreeStyle 293 (Invitrogen 12338) under standard conditions. A detailed protocol for the pulse-chase experiments is available in the Supplemental Experimental Procedures.

Purification of CD2ad and FGF9 variants

CD2-L, CD2-F, and CD2-H were expressed as described above. Media from transfected cells was harvested every 2 days, and cells were split and expanded as necessary to maintain 50–80% confluency. Total harvested media volume for each variant was ~500 ml. Urea powder was added to media samples to 8 M to denature proteins and reduce interactions between CD2ad variants and BSA or other secreted proteins. Variants were purified from denatured media samples by FPLC. Samples were first run over 5 ml Histrap FF columns (GE Healthcare 17-5255-01) in 8 M urea + 150 mM NaCl + 30 mM Bis-Tris pH 6.8. CD2ad was eluted with a linear gradient of imidazole. Fractions containing CD2ad were pooled and concentrated in Amicon Ultra-15 Centrifugal Filter Units (Millipore UFC901008). 1 mM TCEP was added to concentrated samples, which were then purified by size exclusion chromatography in 8 M urea + 30 mM Tris pH 7.4 + 0.5% SDS using a Superdex 200 column (GE Healthcare 17-5175-01). Fractions containing CD2ad were again concentrated, dialyzed against 50 mM ammonium bicarbonate pH 8.4, and lyophilized. Samples were analyzed for purity by SDS-PAGE and silver staining (Pierce 21642).

FGF9-A and FGF9-F were expressed in HEK 293F suspension cultures, harvested 6 days post-transfection, and centrifuged at 6000 × g for 30 min to pellet cells before filtering by 0.22 μm filter to remove residual cells. Supernatant was run over a 5 mL HiTrap Protein A column (GE 17-0403-01), and Fc-fusion proteins were eluted with 0.1 M Glycine pH 2.7. Following dialysis into PBS, purified Fc-fusion proteins were analyzed for purity by SDS-PAGE and Coomassie blue staining.

Western blotting

Three days post-transfection for CD2ad proteins and two days post-transfection for FGF9 proteins, media was harvested and centrifuged at 1,000 × g for 2 min to pellet cell contaminants, after which the supernatant was transferred to fresh tubes. Cells were harvested at the same time from culture dishes by trypsinization (Gibco 12605-010) and centrifuged at 1,000 × g for 2 min. Cell pellets were washed with DPBS (Gibco 14190-136) and lysed in lysis buffer (DPBS + 1% Triton X-100 + 1 mM EDTA) for 30 min at 4 °C. Lysed cells were centrifuged at 10,000 × g for 10 min, and post-nuclear supernatants were transferred to fresh tubes. Total protein in cell lysates was quantified by Micro BCA (Thermo Scientific 23235). For PNGase F digests, samples were incubated with 1 μl PNGase F (NEB P0704S) for 2 h at 37 °C, according to manufacturer’s instructions. For TEV digests, samples were incubated with 2 μl AcTEV Protease (Invitrogen 12575-015) at 30 °C overnight, according to manufacturer’s instructions. For mannosidase digests, purified CD2ad variants (25 ng) were incubated with 0.5 μl α1-2,3 mannosidase (NEB P0729S) for 2 hr at 37 °C, according to manufacturer’s instructions. For CD2ad single domain constructs and CD2ad and FGF9 tandem constructs, Laemmli buffer containing 1 mM TCEP was added to 25 μg lysate per sample, and media samples volumes were proportional to the total protein concentrations of their respective lysates. For tandem constructs, Laemmli buffer containing 1 mM TCEP was added to 10 μg lysate per sample, and media samples volumes were proportional to the amount of cells present in their respective lysates. For purified CD2ad variants, Laemmli buffer containing 1 mM TCEP was added to 25 ng purified protein. Lanes labeled “C” contain 20 ng non-glycosylated CD2ad control expressed in and purified from E. coli. (Culyba, et al., 2011) All samples were boiled in Laemmli buffer for 5 min and subjected to SDS-PAGE using 4–12% or 12% NuPAGE Novex Bis-Tris pre-cast gels (Invitrogen NP0323BOX, NP0343BOX) and MOPS running buffer (Invitrogen NP0001). Proteins were transferred to nitrocellulose membranes, which were stained with Ponceau S to visualize sample loading and transfer. Western blotting then proceeded using Odyssey blocking buffer (LI-COR 927-40000) and mouse anti-FLAG antibody (Agilent 2004 72-21, 1:2000), mouse anti-HA antibody (Covance MMS-101P, 1:2000) or mouse anti-V5 antibody (Life Technologies R960-25, 1:5000). Proteins were visualized and quantified using goat anti-mouse IRDye 800CW (LI-COR 827-08364, 1:15000) and the Odyssey Infrared Imaging System.

Glycomic analysis

Glycomic analysis was carried out using GlycoWorkBench according to previous protocols (Ceroni, et al., 2008; Jang-Lee, et al., 2006). A detailed protocol for the glycomic analysis is available in the Supplemental Experimental Procedures.

Linear Regression models and amino acid frequency analysis

Detailed protocols for the linear regression modeling of glycan occupancies and amino acid frequency analysis are available in the Supplemental Experimental Procedures.

Supplementary Material

supplement

HIGHLIGHTS.

  • aromatic amino acids two residues before sequons increase glycosylation efficiency

  • aromatic amino acids two residues before sequons decrease N-glycan heterogeneity

  • increased glycan occupancy results from oligosaccharyltransferase preferences

  • decreased N-glycan heterogeneity results from suppressed Golgi glycan remodeling

Acknowledgments

The authors thank Professor Ian A. Wilson for the Fc fusion expression vector, Dr. Joseph Genereux for the D18G TTR expression construct, Dr. Elizabeth Culyba for purified non-glycosylated CD2ad, and Dr. Colleen Fearns for critical reading of and assistance in the preparation of this publication. This work was supported by the NIH R01 grant (GM051105 to J.W.K. and E.T.P), the Skaggs Institute for Chemical Biology, the Lita Annenberg Hazen Foundation, and the Biotechnology and Biological Sciences Research Council grant (BB/K016164/1 to A.D. and S.M.H).

Footnotes

AUTHOR CONTRIBUTIONS

A.N.M. and W.C. designed and executed experiments testing glycan occupancy/glycosylation efficiency and expressed and purified proteins for glycoform analysis. A.A. and S.M.H. identified all glycoforms described. S.R.H. and R.L.W. helped to design experiments. D.L.P. performed statistical analyses of the data. E.T.P. performed statistical analyses of the data, amino acid frequency analysis, and helped to design experiments. A.D. and J.W.K. directed the project. A.N.M., E.T.P. and J.W.K. wrote the manuscript. All authors edited the manuscript.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Ahrens PB. Role of target cell glycoproteins in sensitivity to natural killer cell lysis. J Biol Chem. 1993;268:385–391. [PubMed] [Google Scholar]
  2. Apweiler R, Hermjakob H, Sharon N. On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. Biochim Biophys Acta. 1999;1473:4–8. doi: 10.1016/s0304-4165(99)00165-8. [DOI] [PubMed] [Google Scholar]
  3. Bas T, Gao GY, Lvov A, Chandrasekhar KD, Gilmore R, Kobertz WR. Post-translational N-glycosylation of type I transmembrane KCNE1 peptides: implications for membrane protein biogenesis and disease. J Biol Chem. 2011;286:28150–28159. doi: 10.1074/jbc.M111.235168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Ben-Dor S, Esterman N, Rubin E, Sharon N. Biases and complex patterns in the residues flanking protein N-glycosylation sites. Glycobiology. 2004;14:95–101. doi: 10.1093/glycob/cwh004. [DOI] [PubMed] [Google Scholar]
  5. Bertozzi CR, Kiessling LL. Chemical glycobiology. Science. 2001;291:2357–2364. doi: 10.1126/science.1059820. [DOI] [PubMed] [Google Scholar]
  6. Caramelo JJ, Parodi AJ. Getting in and out from calnexin/calreticulin cycles. J Biol Chem. 2008;283:10221–10225. doi: 10.1074/jbc.R700048200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Ceroni A, Maass K, Geyer H, Geyer R, Dell A, Haslam SM. GlycoWorkbench: a tool for the computer-assisted annotation of mass spectra of glycans. J Proteome Res. 2008;7:1650–1659. doi: 10.1021/pr7008252. [DOI] [PubMed] [Google Scholar]
  8. Chen W, Enck S, Price JL, Powers DL, Powers ET, Wong CH, Dyson HJ, Kelly JW. Structural and energetic basis of carbohydrate-aromatic packing interactions in proteins. J Am Chem Soc. 2013;135:9877–9884. doi: 10.1021/ja4040472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chen W, Enck S, Price JL, Powers DL, Powers ET, Wong CH, Dyson HJ, Kelly JW. Structural and energetic basis of carbohydrate-aromatic packing interactions in proteins. J Am Chem Soc. 2013;135:9877–9884. doi: 10.1021/ja4040472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chui D, Oh-Eda M, Liao YF, Panneerselvam K, Lal A, Marek KW, Freeze HH, Moremen KW, Fukuda MN, Marth JD. Alpha-mannosidase-II deficiency results in dyserythropoiesis and unveils an alternate pathway in oligosaccharide biosynthesis. Cell. 1997;90:157–167. doi: 10.1016/s0092-8674(00)80322-0. [DOI] [PubMed] [Google Scholar]
  11. Chui D, Sellakumar G, Green R, Sutton-Smith M, McQuistan T, Marek K, Morris H, Dell A, Marth J. Genetic remodeling of protein glycosylation in vivo induces autoimmune disease. Proc Natl Acad Sci U S A. 2001;98:1142–1147. doi: 10.1073/pnas.98.3.1142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Culyba EK, Price JL, Hanson SR, Dhar A, Wong CH, Gruebele M, Powers ET, Kelly JW. Protein native-state stabilization by placing aromatic side chains in N-glycosylated reverse turns. Science. 2011;331:571–575. doi: 10.1126/science.1198461. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Freeze HH. Genetic defects in the human glycome. Nat Rev Genet. 2006;7:537–551. doi: 10.1038/nrg1894. [DOI] [PubMed] [Google Scholar]
  14. Gavel Y, von Heijne G. Sequence differences between glycosylated and non-glycosylated Asn-X-Thr/Ser acceptor sites: implications for protein engineering. Protein Eng. 1990;3:433–442. doi: 10.1093/protein/3.5.433. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Glozman R, Okiyoneda T, Mulvihill CM, Rini JM, Barriere H, Lukacs GL. N-glycans are direct determinants of CFTR folding and stability in secretory and endocytic membrane traffic. J Cell Biol. 2009;184:847–862. doi: 10.1083/jcb.200808124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Hammarstrom P, Sekijima Y, White JT, Wiseman RL, Lim A, Costello CE, Altland K, Garzuly F, Budka H, Kelly JW. D18G transthyretin is monomeric, aggregation prone, and not detectable in plasma and cerebrospinal fluid: a prescription for central nervous system amyloidosis? Biochemistry. 2003;42:6656–6663. doi: 10.1021/bi027319b. [DOI] [PubMed] [Google Scholar]
  17. Hammond C, Braakman I, Helenius A. Role of N-linked oligosaccharide recognition, glucose trimming, and calnexin in glycoprotein folding and quality control. Proc Natl Acad Sci U S A. 1994;91:913–917. doi: 10.1073/pnas.91.3.913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hanson SR, Culyba EK, Hsu TL, Wong CH, Kelly JW, Powers ET. The core trisaccharide of an N-linked glycoprotein intrinsically accelerates folding and enhances stability. Proc Natl Acad Sci U S A. 2009;106:3131–3136. doi: 10.1073/pnas.0810318105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Hebert DN, Lamriben L, Powers ET, Kelly JW. The intrinsic and extrinsic effects of N-linked glycans on glycoproteostasis. Nat Chem Biol. 2014;10:902–910. doi: 10.1038/nchembio.1651. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Helenius A, Aebi M. Intracellular functions of N-linked glycans. Science. 2001;291:2364–2369. doi: 10.1126/science.291.5512.2364. [DOI] [PubMed] [Google Scholar]
  21. Hsieh P, Rosner MR, Robbins PW. Selective cleavage by endo-beta-N-acetylglucosaminidase H at individual glycosylation sites of Sindbis virion envelope glycoproteins. J Biol Chem. 1983;258:2555–2561. [PubMed] [Google Scholar]
  22. Hubbard SC. Regulation of glycosylation. The influence of protein structure on N-linked oligosaccharide processing. J Biol Chem. 1988;263:19303–19317. [PubMed] [Google Scholar]
  23. Imperiali B, Rickert KW. Conformational implications of asparagine-linked glycosylation. Proc Natl Acad Sci U S A. 1995;92:97–101. doi: 10.1073/pnas.92.1.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Jaeken J. Congenital disorders of glycosylation. Ann N Y Acad Sci. 2010;1214:190–198. doi: 10.1111/j.1749-6632.2010.05840.x. [DOI] [PubMed] [Google Scholar]
  25. Jang-Lee J, North SJ, Sutton-Smith M, Goldberg D, Panico M, Morris H, Haslam S, Dell A. Glycomic profiling of cells and tissues by mass spectrometry: fingerprinting and sequencing methodologies. Methods Enzymol. 2006;415:59–86. doi: 10.1016/S0076-6879(06)15005-3. [DOI] [PubMed] [Google Scholar]
  26. Jitsuhara Y, Toyoda T, Itai T, Yamaguchi H. Chaperone-like functions of high-mannose type and complex-type N-glycans and their molecular basis. J Biochem. 2002;132:803–811. doi: 10.1093/oxfordjournals.jbchem.a003290. [DOI] [PubMed] [Google Scholar]
  27. Joao HC, Dwek RA. Effects of glycosylation on protein structure and dynamics in ribonuclease B and some of its individual glycoforms. Eur J Biochem. 1993;218:239–244. doi: 10.1111/j.1432-1033.1993.tb18370.x. [DOI] [PubMed] [Google Scholar]
  28. Kasturi L, Chen H, Shakin-Eshleman SH. Regulation of N-linked core glycosylation: use of a site-directed mutagenesis approach to identify Asn-Xaa-Ser/Thr sequons that are poor oligosaccharide acceptors. Biochem J. 1997;323(Pt 2):415–419. doi: 10.1042/bj3230415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kasturi L, Eshleman JR, Wunner WH, Shakin-Eshleman SH. The hydroxy amino acid in an Asn-X-Ser/Thr sequon can influence N-linked core glycosylation efficiency and the level of expression of a cell surface glycoprotein. J Biol Chem. 1995;270:14756–14761. doi: 10.1074/jbc.270.24.14756. [DOI] [PubMed] [Google Scholar]
  30. Kornfeld R, Kornfeld S. Assembly of asparagine-linked oligosaccharides. Annu Rev Biochem. 1985;54:631–664. doi: 10.1146/annurev.bi.54.070185.003215. [DOI] [PubMed] [Google Scholar]
  31. Kowarik M, Young NM, Numao S, Schulz BL, Hug I, Callewaert N, Mills DC, Watson DC, Hernandez M, Kelly JF, et al. Definition of the bacterial N-glycosylation site consensus sequence. EMBO J. 2006;25:1957–1966. doi: 10.1038/sj.emboj.7601087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Lowe JB, Marth JD. A genetic approach to Mammalian glycan function. Annu Rev Biochem. 2003;72:643–691. doi: 10.1146/annurev.biochem.72.121801.161809. [DOI] [PubMed] [Google Scholar]
  33. Lu X, Mehta A, Dadmarz M, Dwek R, Blumberg BS, Block TM. Aberrant trafficking of hepatitis B virus glycoproteins in cells in which N-glycan processing is inhibited. Proc Natl Acad Sci U S A. 1997;94:2380–2385. doi: 10.1073/pnas.94.6.2380. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Martina JA, Daniotti JL, Maccioni HJ. Influence of N-glycosylation and N-glycan trimming on the activity and intracellular traffic of GD3 synthase. J Biol Chem. 1998;273:3725–3731. doi: 10.1074/jbc.273.6.3725. [DOI] [PubMed] [Google Scholar]
  35. Morrison KL, Weiss GA. Combinatorial alanine-scanning. Curr Opin Chem Biol. 2001;5:302–307. doi: 10.1016/s1367-5931(00)00206-4. [DOI] [PubMed] [Google Scholar]
  36. Nairn AV, York WS, Harris K, Hall EM, Pierce JM, Moremen KW. Regulation of glycan structures in animal tissues: transcript profiling of glycan-related genes. J Biol Chem. 2008;283:17298–17313. doi: 10.1074/jbc.M801964200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Oliver JD, van der Wal FJ, Bulleid NJ, High S. Interaction of the thiol-dependent reductase ERp57 with nascent glycoproteins. Science. 1997;275:86–88. doi: 10.1126/science.275.5296.86. [DOI] [PubMed] [Google Scholar]
  38. Ou WJ, Cameron PH, Thomas DY, Bergeron JJ. Association of folding intermediates of glycoproteins with calnexin during protein maturation. Nature. 1993;364:771–776. doi: 10.1038/364771a0. [DOI] [PubMed] [Google Scholar]
  39. Parodi AJ. Role of N-oligosaccharide endoplasmic reticulum processing reactions in glycoprotein folding and degradation. Biochem J. 2000;348 (Pt 1):1–13. [PMC free article] [PubMed] [Google Scholar]
  40. Petrescu AJ, Milac AL, Petrescu SM, Dwek RA, Wormald MR. Statistical analysis of the protein environment of N-glycosylation sites: implications for occupancy, structure, and folding. Glycobiology. 2004;14:103–114. doi: 10.1093/glycob/cwh008. [DOI] [PubMed] [Google Scholar]
  41. Price JL, Culyba EK, Chen W, Murray AN, Hanson SR, Wong CH, Powers ET, Kelly JW. N-glycosylation of enhanced aromatic sequons to increase glycoprotein stability. Biopolymers. 2012;98:195–211. doi: 10.1002/bip.22030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Price JL, Powers DL, Powers ET, Kelly JW. Glycosylation of the enhanced aromatic sequon is similarly stabilizing in three distinct reverse turn contexts. Proc Natl Acad Sci U S A. 2011;108:14127–14132. doi: 10.1073/pnas.1105880108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Price JL, Powers DL, Powers ET, Kelly JW. Glycosylation of the enhanced aromatic sequon is similarly stabilizing in three distinct reverse turn contexts. Proc Natl Acad Sci U S A. 2011;108:14127–14132. doi: 10.1073/pnas.1105880108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Recny MA, Luther MA, Knoppers MH, Neidhardt EA, Khandekar SS, Concino MF, Schimke PA, Francis MA, Moebius U, Reinhold BB, et al. N-glycosylation is required for human CD2 immunoadhesion functions. J Biol Chem. 1992;267:22428–22434. [PubMed] [Google Scholar]
  45. Rudd PM, Wormald MR, Harvey DJ, Devasahayam M, McAlister MS, Brown MH, Davis SJ, Barclay AN, Dwek RA. Oligosaccharide analysis and molecular modeling of soluble forms of glycoproteins belonging to the Ly-6, scavenger receptor, and immunoglobulin superfamilies expressed in Chinese hamster ovary cells. Glycobiology. 1999;9:443–458. doi: 10.1093/glycob/9.5.443. [DOI] [PubMed] [Google Scholar]
  46. Ruiz-Canada C, Kelleher DJ, Gilmore R. Cotranslational and posttranslational N-glycosylation of polypeptides by distinct mammalian OST isoforms. Cell. 2009;136:272–283. doi: 10.1016/j.cell.2008.11.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Sato T, Sako Y, Sho M, Momohara M, Suico MA, Shuto T, Nishitoh H, Okiyoneda T, Kokame K, Kaneko M, et al. STT3B-dependent posttranslational N-glycosylation as a surveillance system for secretory protein. Mol Cell. 2012;47:99–110. doi: 10.1016/j.molcel.2012.04.015. [DOI] [PubMed] [Google Scholar]
  48. Schmaltz RM, Hanson SR, Wong CH. Enzymes in the synthesis of glycoconjugates. Chem Rev. 2011;111:4259–4307. doi: 10.1021/cr200113w. [DOI] [PubMed] [Google Scholar]
  49. Sekijima Y, Wiseman RL, Matteson J, Hammarstrom P, Miller SR, Sawkar AR, Balch WE, Kelly JW. The biological and chemical basis for tissue-selective amyloid disease. Cell. 2005;121:73–85. doi: 10.1016/j.cell.2005.01.018. [DOI] [PubMed] [Google Scholar]
  50. Shrimal S, Ng BG, Losfeld ME, Gilmore R, Freeze HH. Mutations in STT3A and STT3B cause two congenital disorders of glycosylation. Hum Mol Genet. 2013;22:4638–4645. doi: 10.1093/hmg/ddt312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Shrimal S, Trueman SF, Gilmore R. Extreme C-terminal sites are posttranslocationally glycosylated by the STT3B isoform of the OST. J Cell Biol. 2013;201:81–95. doi: 10.1083/jcb.201301031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Sorensen AL, Clausen H, Wandall HH. Carbohydrate clearance receptors in transfusion medicine. Biochim Biophys Acta. 2012;1820:1797–1808. doi: 10.1016/j.bbagen.2012.07.008. [DOI] [PubMed] [Google Scholar]
  53. Surleac MD, Spiridon LN, Tacutu R, Milac AL, Petrescu SM, Petrescu AJ. The Structural Assessment of Glycosylation Sites Database - SAGS - An Overall View on N-Glycosylation. In: Petrescu SM, editor. In Glycosylation. InTech; 2012. pp. 3–20. [Google Scholar]
  54. Thaysen-Andersen M, Packer NH. Site-specific glycoproteomics confirms that protein structure dictates formation of N-glycan type, core fucosylation and branching. Glycobiology. 2012;22:1440–1452. doi: 10.1093/glycob/cws110. [DOI] [PubMed] [Google Scholar]
  55. Trimble RB, Maley F, Chu FK. GlycoProtein biosynthesis in yeast. protein conformation affects processing of high mannose oligosaccharides on carboxypeptidase Y and invertase. J Biol Chem. 1983;258:2562–2567. [PubMed] [Google Scholar]
  56. Vembar SS, Brodsky JL. One step at a time: endoplasmic reticulum-associated degradation. Nat Rev Mol Cell Biol. 2008;9:944–957. doi: 10.1038/nrm2546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Wacker M, Linton D, Hitchen PG, Nita-Lazar M, Haslam SM, North SJ, Panico M, Morris HR, Dell A, Wren BW, et al. N-linked glycosylation in Campylobacter jejuni and its functional transfer into E. coli. Science. 2002;298:1790–1793. doi: 10.1126/science.298.5599.1790. [DOI] [PubMed] [Google Scholar]
  58. Wada Y, Azadi P, Costello CE, Dell A, Dwek RA, Geyer H, Geyer R, Kakehi K, Karlsson NG, Kato K, et al. Comparison of the methods for profiling glycoprotein glycans--HUPO Human Disease Glycomics/Proteome Initiative multi-institutional study. Glycobiology. 2007;17:411–422. doi: 10.1093/glycob/cwl086. [DOI] [PubMed] [Google Scholar]
  59. Wang C, Eufemi M, Turano C, Giartosio A. Influence of the carbohydrate moiety on the stability of glycoproteins. Biochemistry. 1996;35:7299–7307. doi: 10.1021/bi9517704. [DOI] [PubMed] [Google Scholar]
  60. Ware FE, Vassilakos A, Peterson PA, Jackson MR, Lehrman MA, Williams DB. The molecular chaperone calnexin binds Glc1Man9GlcNAc2 oligosaccharide as an initial step in recognizing unfolded glycoproteins. J Biol Chem. 1995;270:4697–4704. doi: 10.1074/jbc.270.9.4697. [DOI] [PubMed] [Google Scholar]
  61. Wormald MR, Dwek RA. Glycoproteins: glycan presentation and protein-fold stability. Structure. 1999;7:R155–160. doi: 10.1016/s0969-2126(99)80095-1. [DOI] [PubMed] [Google Scholar]
  62. Wyss DF, Choi JS, Li J, Knoppers MH, Willis KJ, Arulanandam AR, Smolyar A, Reinherz EL, Wagner G. Conformation and function of the N-linked glycan in the adhesion domain of human CD2. Science. 1995;269:1273–1278. doi: 10.1126/science.7544493. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplement

RESOURCES