Background: O-Glycans are known to modulate the biological activity and physicochemical properties of type I interferon.
Results: A single sugar unit modulates the structure and dynamics of the protein.
Conclusion: In this study, we propose an explanation of the effects of glycosylation on biological activity.
Significance: The results will facilitate the study of any O-GalNAc glycoprotein.
Keywords: Glycoprotein, Glycosyltransferases, Interferon, Nuclear Magnetic Resonance, Structural Biology
Abstract
Enzymatic addition of GalNAc to isotopically labeled IFNα2a produced in Escherichia coli yielded the O-linked glycoprotein GalNAcα-[13C,15N]IFNα2a. The three-dimensional structure of GalNAcα-IFNα2a has been determined in solution by NMR spectroscopy at high resolution. Proton-nitrogen heteronuclear Overhauser enhancement measurements revealed that the addition of a single monosaccharide unit at Thr-106 significantly slowed motions of the glycosylation loop on the nanosecond time scale. Subsequent addition of a Gal unit produced Gal(β1,3)GalNAcα-[13C,15N]IFNα2a. This extension resulted in a further decrease in the dynamics of this loop. The methodology used here allowed the first such description of the structure and dynamics of an O-glycoprotein and opens the way to the study of this class of proteins.
Introduction
Cells infected by viruses produce type I interferons to signal the presence of the invader to uninfected neighboring cells (1). These cytokines are called interferon because they bind to specific cell-surface receptors to activate specific genes that will interfere with virus replication. The discovery of their biological functions led to the development of these proteins as therapeutics. In fact, type I interferons are among the most widely used human recombinant protein therapeutics for the treatment of several cancers and viral infections. Within the 13 α-subtypes, only two, IFNα2a (Roferon-A®) and IFNα2b (Intron-A®), are marketed for therapeutic use. Recombinant IFNα2 is a polypeptide of 165 amino acids. Subtypes a and b differ only by one residue at position 23 (Lys to Arg). The three-dimensional structure was determine by Klaus et al. (2) using NMR spectroscopy in solution. The fold consists of five α-helices (labeled A–E), four of which are arranged in an antiparallel fashion to form a left-handed four-helix bundle. Two disulfide bonds stabilize the conformation. During the development of recombinant versions of IFNα2 from Escherichia coli for therapeutic use, Adolf et al. (3) discovered that three subtypes (a–c) of native IFNα2 are O-glycosylated at Thr-106. This residue is located in the CD loop at one end of the helix bundle and may not interact with the receptor. In fact, the recently reported structures of complexes between IFNα2a and its receptor by NMR (4) and x-ray (5) do not shed light on such possible interactions. This may explain the similar biological activity observed for both the recombinant IFNα2 produced in bacteria and the glycosylated version purified from human leukocytes. However, Loignon et al. (6) reported that O-glycosylated IFNα2b produced in HEK293 cells inhibits viral activity more than IFNα2b prepared in E. coli. This suggests that the glycan does modulate biological activity.
In general, glycosylation of serine/threonine residues of proteins plays important roles in protein localization and trafficking, protein solubility, antigenicity, and cell-cell interactions (7). Recent evidence suggested that the glycan at Thr-106 might not improve the stability of the conformation under thermal stress. Using far-UV circular dichroism and calorimetry to monitor thermal unfolding, Johnston et al. (8) showed that glycosylated IFNα2b is less thermostable by ∼2 °C than the non-glycosylated form. In contrast, the opposite effect was observed for glycosylated granulocyte colony-stimulating factor. The latter is more thermostable by ∼5 °C compared with the non-glycosylated E. coli-derived version (9). These reports thus suggest that glycosylation modulates conformational stability. It may also play a role in protein solubility and half-life in serum (10). In addition, IFNα2 with covalently linked PEG exhibits a significantly increased bioavailability. One injection per week of the N-terminally PEGylated version of IFNα2b achieved the same therapeutic result as three injections per week of the non-PEGylated protein (11). Moreover, a PEGylated version of GalNAcα-IFNα2b, in which the PEG moiety was linked to the sugar, had a significantly greater plasma concentration and half-life and 25-fold greater antiviral activity than E. coli-derived IFNα2b (12). The role played by the glycan in modulating these biophysical properties (thermostability and solubility) may be explained in terms of perturbations of the conformation and/or dynamics of the polypeptide chain. Therefore, we aimed to apply NMR spectroscopy techniques to study the effects of glycosylation of IFNα2a on its structure and dynamics.
The main challenge of using NMR spectroscopy lies in the production of glycoprotein samples enriched with stable isotopes such as carbon-13 and nitrogen-15, which are required to determine the three-dimensional structure and measure parameters sensitive to protein dynamics. Recently, a eukaryotic expression system based on human cell lines (HEK293) has been reported to produce high levels of glycosylated IFNα2b (6). Although isotope incorporation may be feasible using defined serum-free media with such eukaryotic expression systems, it is a very expensive endeavor. In addition, this approach produces glycosylated IFNα2b with a heterogeneous glycan composition. Adolf et al. (3) detected three major glycan structure variants in naturally produced IFNα2: the core-1 (or T antigen) structure Gal(β1,3)GalNAc and the latter substituted with either lactosamine linked to the GalNAc unit or sialic acid linked to the Gal moiety. Less abundant structures may incorporate fucose (3). Therefore, to address both challenges of isotope incorporation and glycan heterogeneity, we chose a strategy based on the production of labeled IFNα2a in E. coli, followed by an in vitro enzymatic stepwise addition of the various monosaccharide units. This approach, first proposed by Wong and co-workers (13), provides a flexible labeling scheme and facilitates the production of a homogeneous glycoprotein for NMR studies. A priori, one may consider making a fully 13C,15N-labeled O-glycoprotein to take full advantage of triple resonance NMR techniques. Because UDP-[13C,15N]GalNAc is not commercially available, its synthesis from the very expensive [13C,15N]GlcNAc (14, 15) would be required. The many steps involved would further increase the final cost of producing a fully labeled sample. Instead, we opted for the preparation of a fully labeled polypeptide chain bearing unlabeled glycans. This labeling scheme was more practical in view of our intention to first study the effects of a single GalNAc residue on the protein structure and dynamics.
EXPERIMENTAL PROCEDURES
Preparation of GalNAcα-[13C,15N]IFNα2a
The procedure for the preparation of [13C,15N]IFNα2a has been described (16). Enzymatic addition of GalNAc to [13C,15N]IFNα2a was carried out using human polypeptide GalNAc-transferase 2 (GalNAc-T2).2 The enzyme was produced in E. coli as a fusion protein with maltose-binding protein.3 The reaction was performed at 25 °C for 3 h with 1 mg/ml [13C,15N]IFNα2a in 20 mm MES buffer (pH 6.0) containing 4 mm MnCl2, 150 mm NaCl, 0.005% NaN3, 0.005% Tween 80, 1 mm UDP-GalNAc, and 20 milliunits/ml MalE-GalNAc-T2. The reaction was followed by capillary electrophoresis analysis using a Beckman Coulter P/ACE MDQ system equipped with a photodiode array detector. The aliquots were denatured at 50 °C for 5 min in 0.1% SDS, 1 mm DTT, and 2.5 mm EDTA. The separation was performed at 18 kV in 20 mm Na2B4O7 (pH 9.4) and 50 mm SDS using a bare fused silica capillary (50 μm × 50 cm) equilibrated at 25 °C. GalNAcα-[13C,15N]IFNα2a was purified as described below before galactosylation.
Preparation of Gal(β1,3)GalNAcα-[13C,15N]IFNα2a
Enzymatic addition of a Gal unit to purified GalNAcα-[13C,15N]IFNα2a was carried out using a MalE fusion of the Campylobacter jejuni β1,3-galactosyltransferase (MalE-CgtB) (17). The reaction was performed at 25 °C for 4 h with 1 mg/ml GalNAcα-[13C,15N]IFNα2a in 50 mm MES buffer (pH 6.0) containing 6 mm MnCl2, 1.0 mm DTT, 2.5 mm UDP-Gal, and 100 milliunits/ml MalE-CgtB. The reaction was followed by capillary electrophoresis as described above.
Purification of Labeled Glycoproteins
The reaction mixtures were desalted on a HiPrep 26/10 desalting column (GE Healthcare). The column was eluted at 5 ml/min with 1.5 column volumes of 25 mm NaOAc (pH 5.0). The desalted fractions were pooled and loaded on a 5-ml HiTrap SP HP column (GE Healthcare) equilibrated in 25 mm NaOAc buffer (pH 5.0). The material was eluted at 5 ml/min using a 0–0.8 m NaCl gradient over 16 column volumes. Fractions (2 ml) absorbing at 280 nm were analyzed by SDS-PAGE (15% acrylamide) to determine which fractions contained the labeled glycoprotein.
NMR Spectroscopy
NMR measurements were performed at 25 °C on Bruker AVANCE III 600-MHz and AVANCE 700-MHz spectrometers equipped with a triple resonance cryogenic probe head. Samples were made of 0.7–0.9 mm labeled proteins ([15N]IFNα2a, [13C,15N]IFNα2a, GalNAcα-[13C,15N]IFNα2a, or Gal(β1,3)GalNAcα-[13C,15N]IFNα2a) in buffer containing 25 mm deuterated acetic acid (pH 3.5), 0.1 mm 3-(trimethylsilyl)-1-propanesulfonic acid sodium, and 5% deuterium oxide. Resonance assignments of the backbone atoms were obtained from two-dimensional 1H,15N heteronuclear single quantum correlation (HSQC) and 1H,13C heteronuclear multiple quantum correlation and three-dimensional HNCO, HN(CO)CA, HNCA, HNCACB, and CBCA(CO)NH experiments. Side chain resonances were assigned using three-dimensional HCCH total correlation spectroscopy, three-dimensional HCCH COSY, Aro-CB(CGCD)HD, and Aro-CB(CGCDCE)HE. For structure calculation purposes, three-dimensional 15N- and 13C-edited NOESY-HSQC, four-dimensional CC NOESY, and four-dimensional CN NOESY experiments were recorded with mixing times of 150 ms. Resonances for GalNAc were assigned using two-dimensional filtered NOESY with 0.7 mm GalNAcα-[13C,15N]IFNα2a in buffer containing 100% D2O. All spectra were referenced relative to 3-(trimethylsilyl)-1-propanesulfonic acid sodium, processed using NMRPipe (18), and analyzed with NMRViewJ (19) and Sparky (30). The protein chemical shifts were deposited in the Biological Magnetic Resonance Data Bank (BMRB) under accession numbers 18135 and 18344 for GalNAcα-IFNα2a and Gal(β1–3)GalNAcα-IFNα2a, respectively. The 15N{1H} steady-state heteronuclear Overhauser enhancement experiment, described by Farrow et al. (20), was used to measure backbone dynamics at 298 K. The pulse sequence from the Bruker library was modified to add a relaxation delay of 3 s at the start of the pulse sequence. Three measurements at both fields (600 and 700 MHz) were performed for error estimation.
Structure Calculation
Structure calculations were performed using CYANA 2.1 (21). The amino acid library of CYANA had been modified to incorporate torsion angle parameters for an O-glycosylated threonine residue, where the hydroxyl oxygen of threonine was covalently attached to the reducing end (C1) of the GalNAc moiety in an axial orientation, resulting in an α-glycosidic bond. Cartesian coordinates of the GalNAcα-Thr residue were obtained by linking a threonine residue and GalNAc using the Chimera software package (22). A new CYANA entity was then created using the generated atomic coordinates and editing torsion angle definitions following the template for threonine (from the CYANA standard library) and the template for GalNAc. A total of 258 backbone torsion angles were derived from chemical shifts using TALOS+ (23). In addition, calculations were performed with 66 hydrogen bond constraints inferred from the sequential short- and medium-range NOE pattern (Table 1). Structure validation was performed using the following packages: the Protein Structure Validation Software Suite (PSVS; version 1.4) from the Northeast Structural Genomics Consortium (24), the structure validation program PROCHECK (25), and the structure validation software suites PDBsum (26) and WHATIF (27). No further refinement was necessary to obtain a well resolved NMR ensemble of structures. The NMR structure ensemble was deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics (RCSB), under code 2LMS. Visualization of the structure was performed with the Chimera software package (22).
TABLE 1.
Statistics of the 20 conformers of GalNAcα-IFNα2a
| Intramolecular NOE restraints | |
| Total | 5016 |
| Short-range (|i − j| ≤ 1) | 3476 |
| Medium-range (1 ≤ |i − j| < 5) | 830 |
| Long-range (|i − j| ≥ 5) | 710 |
| No. of non-redundant distance restraints | |
| Total | 2824 |
| Intraresidual (i = j ) | 731 |
| Sequential (|i − j| = 1 ) | 712 |
| Medium-range (1 < |i − j| ≤ 4) | 721 |
| Long-range (|i − j| ≥ 5 ) | 660 |
| GalNAc-protein | 20 |
| Hydrogen bondsa | 66*2 |
| Disulfide bridges | 2*3 |
| Dihedral angle restraints | |
| Total | 258 |
| Φ | 129 |
| Ψ | 129 |
| Restraint violations | |
| Distance > 0.5 Å | 0 |
| Dihedral > 5° | 0 |
| r.m.s.d. to averaged coordinates (Å) | |
| Backbone atoms (N, Cα, C′) | 0.9 ± 0.3 |
| All heavy atoms | 1.4 ± 0.3 |
| Ramachandran plot quality (%)b | |
| Residues in most favored regions | 93.2 |
| Residues in additionally allowed regions | 6.8 |
| Residues in generously allowed regions | 0.0 |
| Residues in disallowed regions | 0.0 |
| r.m.s.d. | |
| For covalent bonds (Å) | 0.001 |
| For covalent angles | 0.2° |
a For each hydrogen bond, two distance restraints were defined: 1.8 Å ≤ rNH-O ≤ 2.0 Å and 2.7 Å ≤ rN-O ≤ 3.0 Å. For each disulfide bridge, three distance restraints were used: 2.0 Å ≤ rSγ-Sγ ≤ 2.1 Å, 3.0 Å ≤ rCβ-Sγ ≤ 3.1 Å, and 3.0 Å ≤ rSγ-Cβ ≤ 3.1 Å.
b The residue ranges used were 2–32, 38–43, 46–100, and 102–158.
RESULTS
Sample Preparation
The preparation of glycosylated IFNα2a was carried out in two steps. First, the polypeptide was produced with carbon-13 and nitrogen-15 enrichment in E. coli. Then, an unlabeled GalNAc unit was added enzymatically in vitro to the fully labeled folded protein. This labeling scheme has two advantages: the cost effectiveness of using unlabeled monosaccharide units and the possibility to use isotope-filtered NMR experiments that can distinguish between protein and glycan resonances.
The polypeptide contains 14 serine and 10 threonine residues, half of which are well exposed to the bulk water. Considering that there are no consensus sequences for the addition of O-GalNAc at Ser/Thr residues, the choice of the appropriate glycosyltransferase isoform was determined from the work of DeFrees et al. (12). The authors subjected a synthetic peptide (residues 100–113) covering the potential glycosylation site of IFNα2a to six isoforms of GalNAc-T. Analysis of the reaction products by matrix-assisted laser desorption ionization time-of-flight mass spectrometry suggested that GalNAc-T2 added a single GalNAc unit at Thr-106. The addition of the first unit of the core-1 glycan, also referred to as the Tn antigen, was quantitative as showed by SDS-PAGE (supplemental Fig. 1). The initial analysis of the glycosylation reaction product was carried out using two-dimensional 1H,15N HSQC experiments to verify that GalNAc-T2 added the GalNAc unit at Thr-106 only. Comparison of the proton-nitrogen NMR correlation spectra of GalNAcα-IFNα2a and non-glycosylated IFNα2a showed that the small glycan induced the largest chemical shift perturbations at the glycosylation site and surrounding residues (Fig. 1A). The overlay revealed that both Thr-106 and Thr-108 showed large chemical shift differences. Considering that amide chemical shifts are very sensitive to the smallest change in the local magnetic environment, the large differences indicated either that Thr-108 was highly perturbed by the glycosylation of Thr-106 or that both residues had been glycosylated. Fortunately, carbon chemical shifts are better probes to detect changes in the covalent structure of the side chains of amino acids after post-translational modification. The large change in chemical shift (+9.92 ppm) observed for the β-carbon of Thr-106 in the HNCACB and CBCA(CO)NH experiments confirmed this residue as the glycosylation site (Fig. 1B). Indeed, an average difference of 0.17 ppm was measured for the β-carbon chemical shifts of all other Ser/Thr residues.
FIGURE 1.
A, overlay of two-dimensional HSQC results for IFNα2a (black) and GalNAcα-IFNα2a (red). B, strips from the HNCACB for IFNα2a and GalNAcα-IFNα2a. The resonance for Cβ of Thr-106 was significantly altered upon glycosylation, whereas that of Thr-108 was unchanged. Note that the Cβ resonance for Thr-106 is aliased and appears at the top portion of the strip (asterisk).
Resonance Assignment
Protein resonance assignment was carried out following the standard protocol based on the usual array of triple resonance NMR techniques. Resonance assignment of the GalNAc unit was carried out using 13C,15N-filtered experiments that select glycan (unlabeled) resonances by filtering out all carbon- and nitrogen-bound proton signals. After several attempts, direct observation of the GalNAc anomeric proton using filtered experiments was not successful. However, spectra from filtered experiments did show what could be attributed to a correlation between H1 and H2, from which we could obtain the chemical shift of H1. However, the symmetrical correlation (H2 to H1) was not observed (supplemental Fig. 2A). Using a two-dimensional 1H,13C heteronuclear multiple quantum correlation experiment recorded over 5 days, it was possible to measure unambiguously the unique anomeric 1H and 13C chemical shift pair at natural abundance. Although this proton-carbon correlation resonates in a specific region of the spectrum that is sufficiently distant from the intense protein signals (supplemental Fig. 2B), it is not the case for all other proton-carbon correlations of the pyranose ring. These resonate at the edge of the fully labeled α-carbon region and are masked by the latter. The remaining GalNAc proton chemical shifts were assigned using two-dimensional 13C,15N-filtered NOESY (supplemental Fig. 2A).
Backbone Dynamics
The steady-state heteronuclear Overhauser effect between nitrogen-15 and proton was measured at two different field strengths to assess the effects of glycosylation on the protein backbone dynamics. The addition of a single GalNAc unit at Thr-106 resulted in a reduction of the backbone motions of the neighboring residues (Fig. 2), whereas the motions of all other residues were not significantly changed (supplemental Fig. 3). Elongation of the glycan chain with a galactose unit further reduced the motions of the loop surrounding the glycosylation site. Although the effects of the disaccharide on the dynamics were small, they were nevertheless significant. However, we did not pursue the determination of the three-dimensional structure of Gal(β1,3)GalNAcα-IFNα2a.
FIGURE 2.
Plot of the normalized 15N{1H} NOE measured at 25 °C and 700 MHz for [13C,15N]IFNα2a (red bars), GalNAcα-[13C,15N]IFNα2a (green bars), and Gal(β1,3)GalNAcα-[13C,15N]IFNα2a (black bars) plotted for residues 90–120. The decrease in mobility is prominent for residues 105, 106, 108, and 112 (green bars). It is noteworthy that although the alteration on a nanosecond time scale motion was unaffected upon addition of the second saccharide moiety for these residues, residues 90 and 95 displayed a further decrease in mobility when the second saccharide unit was attached (black bars). For the complete plot, see supplemental Fig. 3.
Three-dimensional Structure
The ensemble of structures of GalNAcα-IFNα2a shown in Fig. 3A was calculated using NOE-derived distance constraints and backbone torsion angle constraints derived from chemical shifts. The calculations were carried out with a fully extended polypeptide chain as a starting point to eliminate the possibility of any bias. The ensemble of structures shows the same overall conformation as reported by Klaus et al. (2) for the non-glycosylated protein in solution (Protein Data Bank code 1ITF) with two differences. The first nine residues at the N-terminal end of GalNAcα-IFNα2a adopt a well defined pseudo-helical (twisted coil) conformation, whereas these residues are not as well defined in the IFNα2a structure. The glycosylation loop (residues 101–108) is also better defined in the GalNAcα-IFNα2a ensemble compared with the IFNα2a ensemble (Fig. 3B). The orientation of the GalNAc unit (Fig. 3C) results from the measurement of 20 distance constraints between the glycosylation loop residues and the methyl and anomeric protons of the GalNAc moiety. Among the sugar-protein NOE contacts, most constraints are measured between the acetyl group methyl and Val-105 on one side and between the anomeric proton and residues Glu-107 and Thr-108 on the other.
FIGURE 3.
A, ensemble of 20 conformers of GalNAcα-IFNα2a (Protein Data Bank code 2LMS). B, NMR ensemble of 25 conformers of IFNα2a (Protein Data Bank code 1ITF). C, ensemble of 20 conformers of the GalNAc unit in which residues 105–107 were superimposed using pairwise least-squares fitting performed on all atoms of these three residues with the software Chimera (22). The average r.m.s.d. values for the backbone atoms of residues 105–107 and the atoms of the pyranose rings are 0.5 and 1.3 Å, respectively. D, model of GalNAcα-IFNα2a superimposed on the IFNα2a structure within the ternary complex (Protein Data Bank code 3SE3).
DISCUSSION
The structure of IFNα2a is a five-helix bundle fold with mobile segments. These are the first eight residues in N-terminal residues 42–52 and the glycosylation loop comprising residues 101–108 linking helices 3 and 4. The dynamic behavior of the N terminus is somewhat surprising considering that the first residue in the sequence is a cysteine involved in a disulfide bridge with Cys-98 located in helix 3. In contrast, residues 22–44 adopt an elongated peptide stretch terminated by a one-turn helix that displays a similar rigidity to all α-helices. Therefore, the presence of backbone motions produced an averaging of interproton distances, resulting in the measurement of fewer NOE-derived distance constraints (see Fig. 1a in Ref. 2). Thus, these mobile segments were not well defined (low root mean square deviation (r.m.s.d.)) in the ensemble of conformers shown in Fig. 3B. This dynamic behavior is probably an important factor responsible for the absence of coordinates for these residues in the x-ray structure of the IFNα2a-IFN receptor complex (5).
Attenuation of backbone dynamics was induced upon addition of a single GalNAc unit at Thr-106 (Fig. 2). A priori, this attenuation should reduce the averaging of interproton distances, thus facilitating the measurement of more distance constraints involving flexible segments. The ensemble of structures (Fig. 3A) determined in solution by NMR for GalNAcα-IFNα2a shows that the glycosylation loop and the N-terminal residues have a lower r.m.s.d. compared with the ensemble obtained for IFNα2a (Fig. 3B). This suggests that glycosylation-induced rigidification of the mobile segment neighboring Thr-106 may have contributed to obtaining a better defined structure; it is most probably not the major factor. The total number of restraints used by the software CYANA to calculate the structure of GalNAcα-IFNα2a is almost double the number used for IFNα2a. Although the same three- and four-dimensional versions of NOESY experiments were recorded on both proteins, the use of cryogenic probe head technology in this study was certainly the major factor explaining the difference. Indeed, a plot of the number of restraints per residues for each amino acid in the sequence (Fig. 4) revealed that the N terminus and the glycosylation loop both show a similar increase in the number of restraints. Although it is not possible to tease apart the respective contributions of the more sensitive instrumentation and the glycosylation to this increase in the number of constraints, the latter probably benefited from the less mobile glycosylation loop.
FIGURE 4.
Number of sequential (white bars), medium-range (gray bars), and long-range (black bars) restraints per residue for each amino acid in the protein sequence.
A first glance at Fig. 3A suggests that the conformation of the GalNAc unit is not well defined. This is partly due to the fact that the glycosylation loop experiences mobility, and it is not as well defined as the helices. Therefore, we assessed the conformation of the GalNAc unit in relationship to the protein backbone of tripeptide 105–107. This was accomplished with a superimposition of the backbone atoms from residues 105–107 (Fig. 3C) to use these atoms as a common reference for all conformers. Considering that few distance restraints were measured between the sugar (anomeric and methyl protons of the acetyl group only) and protons on the glycosylation loop, the conformation of the GalNAc unit was well defined with respect to the backbone of this loop (average r.m.s.d. of 1.3 Å). The orientation of the GalNAc moiety brings the amide proton of the acetyl group in proximity to the backbone carbonyl of Thr-106 with an average distance of 3.0 Å over all 20 structures. A similar conformation has been observed in synthetic O-GalNAc glycopeptides (28), suggesting that the acetyl group modulates the orientation of the GalNAc unit with the peptide backbone. In addition, the 3-hydroxyl, the attachment point of the next sugar, is pointing away from the protein, toward the bulk of the solvent. This indicates that the sugar unit adopts an appropriate conformation to allow recognition by glycoprotein-GalNAc 3β-galactosyltransferase (C1GALT1). This enzyme is responsible for elongation of the chain with galactose to make the core-1 glycan. Because a three-dimensional structure of C1GALT1 was not available, it was not possible to model the interaction.
The higher potency observed for glycosylated IFNα2b (6) triggered our interest to determine whether glycan-protein interactions could be present during the receptor-binding event. Using the structure of the ternary complex that includes both the α- and β-chains of the type I interferon α/β receptor (5), we substituted IFNα2a with our NMR structure of GalNAcα-IFNα2a into the complex to look for possible glycan-protein interactions (Fig. 3D). Before searching potential interactions with the receptor, it is worth mentioning that polar (hydrogen bonds) and hydrophobic contacts stabilize carbohydrate-protein complexes. Usually, a residue with an aromatic side chain mediates the latter. One example of this can be seen in the structure of the complex between lectin II from Ulex europaeus and its Gal(β1,3)GalNAc ligand (29), where a tyrosine interacts with the hydrophobic side of the GalNAc residue. Examination of the receptor area susceptible to interaction with the GalNAc unit revealed that a number of side chain atoms of residues on the α-chain near the glycosylation loop of GalNAcα-IFNα2a were missing from the crystal structure of the ternary complex. Of particular interest, tryptophan at position 183 was missing, but it was manually added. The resulting model suggests that an interaction between the hydrophobic side of the pyranose ring of the sugar and Trp-183 is possible. The ideal geometry to properly position the hydrophobic face of the sugar on top of the aromatic ring is not present. However, a minor reorientation of the backbone would achieve this accommodation. In fact, among the 20 conformers of the NMR ensemble, the model was built using the one conformer that best positioned the sugar near the tryptophan side chain. We refrained from manually fitting the interactions, but a model from a theoretical simulation might produce a realistic model. In addition, the heteronuclear Overhauser enhancement data indicate that this loop does remain sufficiently mobile and could accommodate the appropriate conformation with a negligible energy penalty. Furthermore, this reorientation of GalNAc would still leave the 3-hydroxyl pointing away toward the solvent, thus indicating that the interaction between GalNAc and Trp-183 could exist with the core-1 glycan or longer chains. This interaction may also explain the higher potency of O-glycosylated IFNα2b versus the non-glycosylated version. The additional contact between the GalNAc unit and Trp-183 would provide an increase in affinity between the cytokine and its receptor. Therefore, additional sugar residues on the glycan that increase glycan complexity and heterogeneity would display a similar potency if the longer chains did not possess excessive dynamics or steric interference. Moreover, the lower thermostability observed by Johnston et al. (8) for O-glycosylated IFNα2b may be explained by intermolecular interactions involving hydrophobic surfaces of the protein exposed during the unfolding process and the hydrophobic faces of sugar units of the glycan. The absence of such carbohydrate-protein interactions in non-glycosylated IFNα2b might explain the higher melting temperature observed.
In summary, the application of the reported enzymatic synthesis approach and the simple labeling scheme utilized to produce GalNAc-αIFNα2a allowed the determination of a three-dimensional structure at high resolution and the study of the dynamics. It is noteworthy that a single GalNAc unit suffices to modulate the backbone dynamics near the glycosylation site. To our knowledge, this is the first example of the characterization of the structure and dynamics of an O-GalNAc glycoprotein in solution. In conjunction with the recently reported structures of IFN complexes, our results shed light on the biological role of glycans.
Acknowledgments
We thank Drs. Jeremy Kunkel and Michael Johnston for critical reading of the manuscript and Dr. Warren W. Wakarchuk for useful discussions.
This work was supported by a Faculty of Graduate and Postdoctoral Affairs M.S. scholarship (to P. J. F. B.) and a National Science and Engineering Research Council Discovery Grant (to Y. A.).

This article contains supplemental Figs. 1–3.
The atomic coordinates and structure factors (code 2LMS) have been deposited in the Protein Data Bank (http://wwpdb.org/).
The protein chemical shifts have been deposited in the Biological Magnetic Resonance Data Bank under BMRB accession numbers 18135 and 18344.
Data not shown.
- GalNAc-T2
- GalNAc-transferase 2
- HSQC
- heteronuclear single quantum correlation
- r.m.s.d.
- root mean square deviation.
REFERENCES
- 1. Parham P. (2005) The Immune System, 2nd Ed., p. 250, Garland Science, New York [Google Scholar]
- 2. Klaus W., Gsell B., Labhardt A. M., Wipf B., Senn H. (1997) The three-dimensional high resolution structure of human interferon α2a determined by heteronuclear NMR spectroscopy in solution. J. Mol. Biol. 274, 661–675 [DOI] [PubMed] [Google Scholar]
- 3. Adolf G. R., Kalsner I., Ahorn H., Maurer-Fogy I., Cantell K. (1991) Natural human interferon α2 is O-glycosylated. Biochem. J. 276, 511–518 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Nudelman I., Akabayov S. R., Scherf T., Anglister J. (2011) Observation of intermolecular interactions in large protein complexes by 2D-double difference nuclear Overhauser enhancement spectroscopy: application to the 44 kDa interferon-receptor complex. J. Am. Chem. Soc. 133, 14755–14764 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Thomas C., Moraga I., Levin D., Krutzik P. O., Podoplelova Y., Trejo A., Lee C., Yarden G., Vleck S. E., Glenn J. S., Nolan G. P., Piehler J., Schreiber G., Garcia K. C. (2011) Structural linkage between ligand discrimination and receptor activation by type I interferons. Cell 146, 621–632 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Loignon M., Perret S., Kelly J., Boulais D., Cass B., Bisson L., Afkhamizarreh F., Durocher Y. (2008) Stable high volumetric production of glycosylated human recombinant IFNα2b in HEK293 cells. BMC Biotechnol. 8, 65–80 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Brockausen I., Schachter H., Stanley P. (1999) in Essentials of Glycobiology (Varki A., Cummings R., Esko J., Freeze H., Hart G., Marth J., eds) pp. 115–128, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY [Google Scholar]
- 8. Johnston M. J., Frahm G., Li X., Durocher Y., Hefford M. A. (2011) O-Linked glycosylation leads to decreased thermal stability of interferon α2b as measured by two orthogonal techniques. Pharm. Res. 28, 1661–1667 [DOI] [PubMed] [Google Scholar]
- 9. Oh-eda M., Hasegawa M., Hattori K., Kuboniwa H., Kojima T., Orita T., Tomonou K., Yamazaki T., Ochi N. (1990) O-Linked sugar chain of human granulocyte colony-stimulating factor protects it against polymerization and denaturation, allowing it to retain its biological activity. J. Biol. Chem. 265, 11432–11435 [PubMed] [Google Scholar]
- 10. Youngster S., Wang Y. S., Grace M., Bausch J., Bordens R., Wyss D. F. (2002) Structure, biology, and therapeutic implications of pegylated interferon α2b. Curr. Pharm. Des. 8, 2139–2157 [DOI] [PubMed] [Google Scholar]
- 11. Glue P., Fang J. W., Rouzier-Panis R., Raffanel C., Sabo R., Gupta S. K., Salfi M., Jacobs S. (2000) Pegylated interferon α2b: pharmacokinetics, pharmacodynamics, safety, and preliminary efficacy data. Clin. Pharmacol. Ther. 68, 556–567 [DOI] [PubMed] [Google Scholar]
- 12. DeFrees S., Wang Z.-G., Xing R., Scott A. E., Wang J., Zopf D., Gouty D. L., Sjoberg E. R., Panneerselvam K., Brinkman-Van der Linden E. C., Bayer R. J., Tarp M. A., Clausen H. (2006) GlycoPEGylation of recombinant therapeutic proteins produced in Escherichia coli. Glycobiology 16, 833–843 [DOI] [PubMed] [Google Scholar]
- 13. Brik A., Ficht S., Wong C.-H. (2006) Strategies for the preparation of homogenous glycoproteins. Curr. Opin. Chem. Biol. 10, 638–644 [DOI] [PubMed] [Google Scholar]
- 14. Zhao G., Guan W., Cai L., Wang P. G. (2010) Enzymatic route to preparative-scale synthesis of UDP-GlcNAc/GalNAc, their analogues and GDP-fucose. Nat. Protoc. 5, 636–646 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Yamamoto K., Kawai H., Tochikura T. (1981) Preparation of uridine diphosphate-N-acetylgalactosamine from uridine diphosphate-N-acetylglucosamine by using microbial enzymes. Appl. Environ. Microbiol. 41, 392–395 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Panjwani N., Hodgson D. J., Sauvé S., Aubin Y. (2010) Assessment of the effects of pH, formulation and deformulation on the conformation of interferon α2 by NMR. J Pharm. Sci. 99, 3334–3342 [DOI] [PubMed] [Google Scholar]
- 17. Bernatchez S., Gilbert M., Blanchard M.-C., Karwaski M.-F., Li J., DeFrees S., Wakarchuk W. W. (2007) Variants of the β1,3-galactosyltransferase CgtB from the bacterium Campylobacter jejuni have distinct specificities. Glycobiology 17, 1333–1343 [DOI] [PubMed] [Google Scholar]
- 18. Delaglio F., Grzesiek S., Vuister G. W., Zhu G., Pfeifer J., Bax A. (1995) NMRpipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 6, 277–293 [DOI] [PubMed] [Google Scholar]
- 19. Johnson B. A., Blevins R. A. (1994) NMRView: a computer program for the visualization and analysis of NMR data. J. Biomol. NMR 4, 603–614 [DOI] [PubMed] [Google Scholar]
- 20. Farrow N. A., Zhang O., Forman-Kay J. D., Kay L. E. (1995) Comparison of the backbone dynamics of a folded and an unfolded SH3 domain existing in equilibrium in aqueous buffer. Biochemistry 34, 868–878 [DOI] [PubMed] [Google Scholar]
- 21. Güntert P. (2004) Automated NMR protein structure calculation with CYANA. Methods Mol. Biol. 278, 353–378 [DOI] [PubMed] [Google Scholar]
- 22. Pettersen E. F., Goddard T. D., Huang C. C., Couch G. S., Greenblatt D. M., Meng E. C., Ferrin T. E. (2004) UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 [DOI] [PubMed] [Google Scholar]
- 23. Shen Y., Delaglio F., Cornilescu G., Bax A. (2009) TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts. J. Biomol. NMR 4, 213–223 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Bhattacharya A., Tejero R., Montelione G. T. (2007) Evaluating protein structures determined by structural genomics consortia. Proteins 66, 778–795 [DOI] [PubMed] [Google Scholar]
- 25. Laskowski R. A., MacArthur M. W., Moss D. S., Thornton J. M. (1993) PROCHECK: a program to check the stereochemical quality of protein structures. J. Appl. Cryst. 26, 283–291 [Google Scholar]
- 26. Laskowski R. A. (2007) Enhancing the functional annotation of PDB structures in PDBsum using key figures extracted from the literature. Bioinformatics 23, 1824–1827 [DOI] [PubMed] [Google Scholar]
- 27. Vriend G. (1990) WHAT IF: a molecular modeling and drug design program. J. Mol. Graph. 8, 52–56 [DOI] [PubMed] [Google Scholar]
- 28. Coltart D. M., Royyuru A. K., Williams L. J., Glunz P. W., Sames D., Kuduk S. D., Schwarz J. B., Chen X.-T., Danishefsky S. J., Live D. H. (2002) Principles of mucin architecture: structural studies on synthetic glycopeptides bearing clustered mono-, di-, tri-, and hexasaccharide glycodomains. J. Am. Chem. Soc. 124, 9833–9844 [DOI] [PubMed] [Google Scholar]
- 29. Loris R., De Greve H., Dao-Thi M. H., Messens J., Imberty A., Wyns L. (2000) Structural basis of carbohydrate recognition by lectin II from Ulex europaeus, a protein with a promiscuous carbohydrate-binding site. J. Mol. Biol. 301, 987–1002 [DOI] [PubMed] [Google Scholar]
- 30. Goddard T. D., Kneller D. G. (2008) SPARKY 3, University of California, San Francisco [Google Scholar]




