Abstract
HPLC has been employed for decades to enhance detection sensitivity and quantification of complex analytes within biological mixtures. Among these analytes, glycans released from glycoproteins and glycolipids have been characterized as underivatized or fluorescently tagged derivatives by HPLC coupled to various detection methods. These approaches have proven extremely useful for profiling the structural diversity of glycoprotein and glycolipid glycosylation but require the availability of glycan standards and secondary orthogonal degradation strategies to validate structural assignments. A robust method for HPLC separation of glycans as their permethylated derivatives, coupled with in-line multidimensional ion fragmentation (MSn) to assign structural features independent of standards, would significantly enhance the depth of knowledge obtainable from biological samples. Here, we report an optimized workflow for LC-MS analysis of permethylated glycans that includes sample preparation, mobile phase optimization, and MSn method development to resolve structural isomers on-the-fly. We report baseline separation and MSn of isomeric N- and O-glycan structures, aided by supplementing mobile phases with Li+, which simplifies adduct heterogeneity and facilitates cross-ring fragmentation to obtain valuable monosaccharide linkage information. Our workflow has been adapted from standard proteomics-based workflows and, therefore, provides opportunities for laboratories with expertise in proteomics to acquire glycomic data with minimal deviation from existing buffer systems, chromatography media, and instrument configurations. Furthermore, our workflow does not require a mass spectrometer with high-resolution/accurate mass capabilities. The rapidly evolving appreciation of the biological significance of glycans for human health and disease requires the implementation of high-throughput methods to identify and quantify glycans harvested from sample sets of sufficient size to achieve appropriately powered statistical significance. The LC-MSn approach we report generates glycan isomeric separations and robust structural characterization and is amenable to autosampling with associated throughput enhancements.
Keywords: liquid chromatography, permethylation, glycan, reverse phase, mass spectrometry, nanoflow, nano spray ionization, ion trap, isomer
Abbreviations: ACN, acetonitrile; AcOH, acetic acid; CFG, Consortium for Functional Glycomics; DP, degree of polymerization; EIC, extracted ion chromatogram; ITMS, ion trap mass spectrometer; LiOAc, lithium acetate; NaOH, sodium hydroxide; MeI, methyl iodide; mTEC, mouse tracheal epithelial cell; nanoLC-NSI-MS, nanoflow Liquid Chromatography-Nanospray Ionization-Mass Spectrometry; MSn, multidimensional ion fragmentation; RP, reverse phase
Graphical Abstract
Highlights
-
•
LC separation of N- and O-linked glycans as their permethylated derivatives.
-
•
Glycan isomers resolved on reversed phase columns commonly used for peptides.
-
•
Analytes identified by exact mass and retention time relative to glycan standards.
-
•
Structural topologies validated by on-the-fly MSn using accessible instrumentation.
In Brief
Structural analysis of glycoprotein N-linked and O-linked glycans is greatly enhanced by permethylation following release from the protein backbone. HPLC of permethylated glycans resolves isomeric glycans that are difficult to discern by direct infusion methodologies. An approach is described that utilizes standard peptide separation columns and accessible instrumentation. The methodology is easily adoptable by laboratories already familiar with LC-based peptide separations to achieve isomeric resolution of glycans released from biological materials.
Every living cell is covered in a dense and complex array of glycoconjugates that modulate various biological processes such as molecular recognition, cell–cell, and cell–matrix interactions. The structural diversity of glycans arises from differences in monosaccharide compositions, anomeric confirmation, glycosidic linkages, branching, and their attachment to protein or lipid (1). A hallmark of protein glycosylation is the phenomenon of microheterogeneity or the property that any single glycosylation site may be unmodified or modified by more than one glycan structure when comparing molecules of the same protein, even if produced in the same cell type. This diversity presents unique challenges for characterizing glycoprotein glycosylation and creates vital needs for technologies that robustly capture glycan structural details (2). The biochemical and analytical tools currently deployed for glycomic analysis are almost as diverse as the biological roles of glycans and are highly influenced by the research purpose.
Given the complexity of glycan structures, with special reference to the broad range of isomeric possibilities, glycan separation by LC has emerged as a state-of-the-art approach in glycomics. Almost all previous LC work has been done on underivatized or fluorescently tagged glycans, allowing high sensitivity of detection and high resolution separations based on size, charge, and hydrophobicity (3). These approaches require previously characterized glycan standards as reference materials or sequential enzymatic (e.g., exoglycosidase sequencing) and/or chemical treatments (e.g., hydrofluoric acid) for complete glycan characterization. While offline or online MS-based methods can be combined with tagged-glycan separations, the sensitivity and information content of the resulting data is significantly influenced by the ionization characteristics and chemical nature of the tag. In that respect, permethylation has emerged as a routine derivatization procedure that improves sensitivity by enhancing MS ionization. Glycan permethylation converts the oxygen of all hydroxyl groups, including the sialic acid carboxylates, into an O-methyl groups (O-CH3). Because glycans generally have a large number of free hydroxyls, the O-methyl groups of permethylated glycans dominate the chemical behavior of the molecule, enhancing and equalizing ionization across diverse glycan structures (4). Permethylation thereby allows for the analysis and quantification of acidic and neutral structures in the mass spectrometer's positive ion detection mode.
In sequential fragmentation MS/MS and MSn analysis, “scars” of a free hydroxyl group instead of O-methyl are detected at the original position of glycosidic linkages, thereby providing valuable linkage information through cross-ring cleavages. Permethylation and heavy metal cations facilitate these cross-ring fragments enabling in-depth structural characterization of glycans by direct infusion into the mass spectrometer without LC separation (5, 6, 7, 8). However, structural elucidation of isomeric permethylated glycans poses analytic challenges in direct infusion MS because they are detected at the same m/z. This complication can be conquered by combining LC separation of permethylated glycans with intelligent MSn in realtime. Various LC-MS methods for analyzing permethylated glycans have been published recently (9, 10). Here, we report our solution, which is designed to be rapidly adaptable by laboratories familiar with standard proteomic workflows. Using an Ultimate 3000 RSLC HPLC system coupled to a Velos Pro Dual Pressure Linear Ion Trap, we describe parameters of sample preparation, mobile phase and gradient optimization, as well as postacquisition data analysis of permethylated chemically synthesized standards and glycans released from either standard proteins or biological sample mixtures. We have achieved baseline separation and MSn of isomeric N- and O-glycan structures to aid in confident identification and quantification by pushing the ionization into a single adduct form, namely lithium. Our workflow simplifies postacquisition data analyses and enhances throughput of glycomic data collection without drastically changing chromatography media used for proteomics or requiring a mass spectrometer with high-resolution/accurate mass capabilities.
Experimental Procedures
Materials and Reagents
PNGase F (N-glycanase) was obtained from the Complex Carbohydrate Research Center, University of Georgia, (Dr Kelley Moremen). PNGase A was acquired from NEB. N-Glycan standards were obtained from the Consortium for Functional Glycomics (CFG) and from The Scripps Research Institute (Dr James Paulson). O-Glycan standards were obtained as part of an NIH Common Fund grant (R21AI123161) from the University of Georgia (Dr Christopher M. West). Sodium hydroxide (NaOH) (50%) was purchased from Fisher Scientific. Sep-Pak C18 disposable extraction columns were obtained from Waters Corporation. AG-50W-X8 cation exchange resin (H+ form) was purchased from Bio-Rad, and trifluoroacetic acid from Pierce. Malto-series oligosaccharides were obtained from Wako Chemicals. Trypsin, Chymotrypsin, bovine pancreatic ribonuclease B (RNaseB), Fetuin, Dextran (Leuconostoc spp., Cat# 31388), and all other chemical reagents were purchased from Sigma-Aldrich. Fly powder, a dried preparation of Drosophila embryo glycoproteins, was prepared as previously described and used as a source of pauci-mannose glycans (11). Mouse brain extracts were prepared as previously described for analysis of brain O-glycans (12). HIV gp120 was recombinantly expressed in glyco-engineered Pichia pastoris strains obtained from GlycoFi to generate proteins with restricted glycan diversity (13, 14).
Preparation of protein-rich powder and subsequent O-glycan release
Primary mouse tracheal epithelial cells (mTECs), mouse brain (Pel-Freez Biologicals), or HEK293 cells (kind gift from Dr Henrik Clausen, University of Copenhagen, Denmark) were homogenized in ice-cold 50% (v/v) aqueous methanol and delipidated with chloroform/methanol/water (4:8:3, v/v/v) as described previously (11, 15). Insoluble proteins were precipitated by centrifugation, and protein pellets were washed twice with ice-cold acetone before drying under a gentle Nitrogen stream to produce fine protein powder. Two to 3 mg of protein powder was then subjected to reductive β-elimination, and the released O-glycan alditols were purified as described previously (16). Briefly, protein powder was resuspended in 100 mM NaOH containing 1 M sodium borohydride and incubated for 18 h at 45 °C in a glass tube sealed with a Teflon-lined screw top. After incubation, the reaction mixture was neutralized with 10% acetic acid (AcOH) on ice and desalted using a AG-50W-X8 (H+ form) column (1 ml bed volume) before borate removal and Sep-pack C18 cartridge clean-up.
Release and reduction of N-glycans
Preparation of glycopeptides and release of N-glycans was performed as described previously (11). Briefly, 20 to 75 μg of glycoproteins (RNAse B, Fetuin, gp120) were dried and resuspended in trypsin buffer (0.1 M Tris-HCl, pH 8.2, containing 1 mM CaCl2) by sonication and boiled for 5 min before addition of trypsin and chymotrypsin solutions. After incubation for 18 h at 37 °C, the (glyco)peptide mixture was boiled for 5 min and adjusted to 5% AcOH before a Sep-Pak C18 cartridge column clean up. Glycopeptides were eluted stepwise in 20% isopropyl alcohol in 5% AcOH and 40% isopropyl alcohol in 5% AcOH. Both, the 20 and 40% isopropyl alcohol fractions were pooled and evaporated to dryness. Dried glycopeptides were resuspended in 25 mM sodium phosphate buffer, pH 7.5, for digestion with PNGaseF (fetuin, ribonuclease B) or in 50 mM ammonium acetate buffer, pH 4.5, for digestion with PNGaseA (fly powder) before incubation for 18 h at 37 °C. PNGase-released oligosaccharides were separated from residual (glyco)peptides by another round of Sep-Pak C18 cartridge clean-up.
For the reduction of PNGase-released N-glycans and standards, dried glycans were resuspended in 100 μl of 50 mM NaOH, vortexed, and ultrasonic water bath sonicated before the addition of 100 μl 2% (w/v) sodium borohydride in 50 mM NaOH. After incubation for at least 4 h at RT, 200 μl of 10% (v/v) AcOH were added to neutralize the reaction. Samples were dried down under a gentle nitrogen stream before borate removal by adding 400 μl of 10% (v/v) AcOH in methanol and repeated evaporation (up to three times).
Permethylation
All released N- and O-linked glycans as well as standards were permethylated before MS analysis according to the method by Anumula and Taylor (17). Dextran, malto-series oligosaccharide standards and CFG N-glycan standards were permethylated with 13C methyliodide (13C-MeI).
nanoLC-MS/MS of permethylated glycans
Dried permethylated glycans and standards were dissolved in 100% methanol, and an aliquot (usually 5–10% of the total sample) was combined with an internal standard mix (13C-permethylated degree of polymerization [DP]4) and mobile phase A. For each nanoLC-MS/MS analysis, 3 to 6 μl of the prepared sample mix was injected for LC separation at 60 °C and a constant flow rate of 300 nl/min using an Ultimate 3000 RSLC (Thermo Fisher Scientific/Dionex) equipped with a PepMap Acclaim analytical C18 column (75 μm × 15 cm, 2 μm pore size) coupled to a Thermo Fisher Scientific Velos Pro Dual-Pressure Linear Ion Trap mass spectrometer (ITMS), ending with sample ionization via a stainless steel emitter. The three solvent systems used were either 0.1% formic acid in water for mobile phase A and 0.1% formic acid in 80% acetonitrile (ACN) for mobile phase B (“Proteomics buffers”); 0.1% AcOH containing 0.1 mM lithium acetate (LiOAc) for mobile phase A and 0.1% AcOH in 80% ACN containing 0.1 mM LiOAc for mobile phase B (“Low lithium buffers”); or 0.02% AcOH containing 1 mM LiOAc for mobile phase A and 0.02% AcOH in 80% ACN containing 1 mM LiOAc for mobile phase B (“High lithium buffers”).
For O-glycans, after equilibrating the column in 99% mobile phase A for 5 min, separation was achieved using a linear gradient from 30% to 70% mobile phase B over 150 min. For N-glycans, after equilibrating the column in 99% mobile phase A for 5 min, separation was achieved using a linear gradient from 45% to 70% mobile phase B over 150 min. The analytical column was regenerated after each run by ramping to 99% mobile phase B for 10 min and then returning to 99% mobile phase A to re-equilibrate.
The Velos Pro Dual Pressure Linear ITMS was operated in positive ion mode with a spray voltage of 1.8 to 2.2 kV and capillary temperature set at 210 °C. The MS method consisted of first collecting a Full ITMS (MS1) survey scan in profile mode and m/z mass range from 500 to 2000 with automatic gain control Target set to 30,000.00, followed by data-dependent MS2 fragmentation of the top 3 to 5 most intense peaks using collision-induced dissociation at 40 to 42% collision energy and an isolation window of 2 m/z. Dynamic exclusion parameters were set to exclude ions for fragmentation for 15 s if they were detected and fragmented five times in 15 s. For targeted MSn approaches, intelligent fragmentation methods were generated using neutral loss function and product dependent MS3-5 acquisition to determine sialic acid linkage of terminally sialylated N-glycan structures. To normalize elution times on the RP-column in terms of g.u., 13C-permethylated dextran (4–10 g.u.) was analyzed before and after a sequence of samples. After the completion of a sample set, in preparation for limited HPLC use for an extended period of time, the postcolumn-fused silica lines were manually flushed with 50% methanol in water to avoid clogging of the stainless steel emitter with lithium salts. All data were processed manually using the Xcalibur software package 2.0. GRITS Toolbox; a freely available tool for semiautomated annotation of glycomic MS data was also employed to generate candidates for structural assignment (18). The GRITS Toolbox parameters used for annotations were set as follows: all glycosidic and cross-ring cleavage types enabled (B, Y, C, Z, A, X); maximum number of cleavages and cross-ring cleavages, 2; mass type, average; mass accuracy for full MS, 1 Da; mass accuracy for MSn, 250 ppm; precursor intensity cut-off, 5%; database, “N-glycans (topology)” with prefilter human monosaccharides enabled; reducing end, reduced; PerDeriv type, perMe; derivative mass and adduct were set according to experimental conditions, isotopically heavy methyl (13C-MeI) or Na/Li as appropriate. Raw mass spectrometry data files have been deposited at GlycoPOST (https://glycopost.glycosmos.org/) under the Accession ID number GPST000123.
Results
Sample preparation
Our workflow can be applied to any biological starting material. In the first preparation steps, glycosphingolipids are extracted from the biological sample before acetone precipitation to obtain protein rich powder (Fig. 1). Starting with 1 to 2 mg of protein powder each, we can prepare N- and O-glycans. For N-glycans, the glycoproteins are proteolytically cleaved into glycopeptides (usually by trypsin and/or chymotrypsin) and purified using a C18 solid-phase extraction column. N-glycans are then enzymatically released with PNGase F. For liquid chromatography-nanospray ionization-mass spectrometry (LC-NSI-MS) analysis, released N-glycans can also be reduced in mild sodium borate conditions before borate removal and permethylation for improved chromatographic separation (see supplemental Fig. S1). O-glycans are chemically released by reductive β-elimination, followed by desalting, borate removal, and C18 clean up. Following permethylation and phase partitioning, we subjected the organic phase, which contains neutral and nonsulfated glycans, to analysis by LC-NSI-MS using various buffer and gradient conditions.
O-glycan separation and mobile phase optimization
To test the separation of permethylated O-glycans using our platform, initial RP C18 nanoLC-separations were performed on O-glycans released by reductive beta-elimination from primary mTEC. These initial separations used conventional acidic proteomic mobile phases at an elevated column oven temperature (60 °C) to facilitate baseline separation (Fig. 2A). As has been previously described and also in our analysis, lower column oven temperatures (40 °C and 50 °C) resulted in significant peak broadening (10). By generating extracted ion chromatograms (EICs) of expected glycan species, the major mTEC O-glycans were resolved, and multiple EIC peaks were identified for some, revealing the possibility of isomeric structures (Fig. 2B).
One disadvantage of using acidic mobile phases that are standard for proteomic analyses is the formation of multiple alkali metal glycan adducts which makes data analysis and, importantly, quantification more difficult. As demonstrated for a simple sialyl Tn O-glycan structure, multiple ions including protonated [M + H]+, lithiated [M+Li]+, sodiated [M+Na]+, and potassiated [M + K]+ adducts were detected in a Full MS scan using conventional proteomics mobile phases (Fig. 2C). To promote the ionization into only one molecular ion species and to enhance cross-ring cleavages, LiOAc was supplemented in the mobile phases at 0.1 mM resulting in an improvement but not complete shift toward lithiated ion species. By increasing the concentration to 1 mM LiOAc and reducing acid concentration, adduct ions were shifted to essentially lithiated-only forms (Fig. 2C). Combining EICs with the data-dependent MS/MS data acquisition in real-time, the structures of two isomeric sialyl Tn O-glycans were resolved with the minor branched structure eluting earlier (35 min) than the more abundant linear structure (42 min) (Fig. 2D).
N-glycan separation and gradient optimization
To assess the feasibility of separating permethylated N-glycans, we used PNGaseF-released N-glycans from RNAseB as our glycan source. Initial peak-splitting during the LC-separation was observed, presumably because of mutarotation of the anomeric configuration at the free reducing end before permethylation (α- versus β-anomer; supplemental Fig. S1A). To ameliorate this complication, reduction of PNGase-released N-glycans before permethylation is strongly recommended for simplifying glycans to elute as one peak rather than two α- and β-anomer peaks (supplemental Fig. 1B). Peak splitting was not observed in our mTEC O-glycan analysis because the reducing termini of the released O-glycans are reduced during the release by reductive β-elimination as part of our standard workflow (Fig. 1).
Using an initial separation gradient of 30% to 70% mobile phase B over 150 min, baseline separation was achieved for reduced and permethylated paucimannosidic, high-mannose, and complex type N-glycan structures released from fly protein powder, RNAseB, and Fetuin, respectively (Fig. 3A). As this gradient might be insufficient for separation of larger N-glycans such as tetra-antennary structures, a modified gradient with a rapid increase in the first 5 min to 45% B followed by a shallow gradient of 45% to 70%B over 150 min was applied and was observed to not change the overall elution pattern of the tested N-glycans (Fig. 3B). Using our platform and elution conditions, attempts to shorten the gradient to less than 180 min as described by others resulted in suboptimal separation of glycans and increased numbers of co-eluting species (3, 9, 10).
Chromatographic retention times can be converted to glucose units
In many instances, the combination of LC retention time and accurate mass of a detected analyte can lead to structural assignment by reference to well-characterized standards. However, assignment/annotation solely based on retention time and mass is frequently impractical because of technical challenges associated with variability in column performance, clogged tubing, clogged emitters, etc. Therefore, a more robust relationship between glycan structure and retention time can be achieved by referencing glycan elution positions to a hexose polymer ladder, yielding retention times in glucose units or g.u (19, 20). Dextran polymers ranging in length (DP) from 4 to 10 monosaccharide units where reduced and permethylated with 13C-MeI to produce a dextran reference ladder of g.u. standards (DP4-10). Aliquots of the reference ladder were injected before and after sample acquisition queues, and the resulting retention times were averaged from EICs for each predicted mass within 10 ppm (Fig. 4A).
To test the robustness of our method for characterizing the elution positions of glycans with greater complexity than the dextran ladder, we analyzed four CFG standards (107, 108, 109 and 110) which represent bisected, complex branched N-glycans and one CFG standard (121), a sialylated biantennary N-glycan. Separation of all five reduced and 13C-permethylated CFG standards was achieved resulting in standard 107 (bisected, biantennary N-glycan) eluting before standard 108 (bisected, triantennary N-glycan with two GlcNAc residues attached to the core α1,3 mannose). Interestingly, a gap of 6 min in retention time was detected between the elution of the isomeric structure of standard 108, which is standard 109 (bisected, triantennary N-glycan with two GlcNAc residues attached to the core α1,6 mannose) demonstrating the power of our modified gradient to achieve separations based on size and hydrophobicity (Fig. 4B). CFG standard 110 (bisected, tetra-antennary N-glycan) eluted approximately 2.6 min after standard 109 and before the final sialylated, biantennary N-glycan standard 121. Using the linear regression fitting from the dextran ladder runs before and after the sample queue, the observed retention times were converted to reverse phase glucose units (RP g.u.) to express the elution positions of the analyzed CFG standards (Fig. 4C). Dextran ladder was analyzed before a large sample acquisition queue and subsequently after 25 days of continuous operation (supplemental Fig. S2). Retention times of isomaltose polymers after nearly 1 month of continuous use were consistent within 2 to 3%. Furthermore, retention times were consistent regardless of amount injected, in a dynamic range of 15.6 to 1000 fmol of injected standard (supplemental Fig. S3).
Similar to the N-glycan standards, we tested the ability of our method to separate chemically synthesized, reduced, and 13C-permethylated O-glycan isomeric standards derived from the human pathogen Trypanosoma cruzi, the causative agent for Chagas disease in South America (21, 22). The four tetrasaccharide O-glycan isomers analyzed only differ by either the cyclic form of a single monosaccharide (pyranose or furanose) or a single linkage. Owing to the very small differences in isomeric confirmation between these four monosaccharides, a modified gradient was applied (35–40% B over 170 min) (Fig. 5). The four standards were analyzed separately (Fig. 5, A–D) or as a mixture (Fig. 5E) and demonstrated nearly baseline separation using the modified gradient program. Of the four isomeric O-glycan structures, the pyranose structures eluted over 20 min earlier than furanose-containing isoforms. As described previously, the observed retention times were converted to RP g.u. using the linear regression fitting from dextran ladder runs before and after the sample queue using the same gradient program (Fig. 5F).
Characterization of N-glycans released from recombinant HIV gp120 expressed in glyco-engineered P. pastoris
The data described above demonstrates the ability of our method to resolve N- and O-glycan structures derived from abundant glycoproteins or prepared as synthetic standards. To assess the applicability of our method for detecting biologically relevant glycosylation patterns, we analyzed N-glycans released from recombinantly expressed HIV gp120 envelope glycoproteins harvested from six different glyco-engineered strains of P. pastoris (13, 14). Each of these strains lack glycosyltransferase activities that result in the generation of restricted cellular N-glycan profiles predicted to be to predominated by specific high-mannose or complex type glycans. After release, reduction, and permethylation, N-glycans from each gp120 preparation were first separated and analyzed individually, demonstrating that the detected N-glycans are consistent with the targeted glycosyltransferases in each glyco-engineered strain (Fig. 6, A–F). For example, N-glycans of the composition Hex3HexNAc2-ol were only detected in the strain engineered to generate Man3GlcNAc2 structures on gp120 glycoprotein (Fig. 6A) and Hex5HexNAc2-ol in the Man5GlcNAc2 engineered strain (Fig. 6B). A similar pattern was observed in engineered strains producing only Man8GlcNAc2 and Man9GlcNAc2 as well as Hex8HexNAc2-Hex10HexNAc2 structures (Fig. 6, C and D). Pichia strains engineered to synthesize N-glycan structures with terminal galactose residues or terminal sialic acid modifications also exhibited Man5GlcNAc2 structures in addition to the expected complex type structures (Fig. 6, E and F). To assess the capacity of our LC-MS system to resolve the range of glycans released from these expression platforms, we mixed equal amounts of each sample and analyzed as described above. We were able to accurately detect and resolve all of the various N-glycan structures derived from the recombinant gp120 glycoforms that were detected when each were analyzed individually (Fig. 6G). Furthermore, we applied an automated MSn workflow which targeted the neutral loss of permethylated sialic acid (Δm/z = 375) to trigger product dependent MS3-5 acquisitions. Because lithium was included in the LC separation mobile phases, this workflow revealed that the terminal sialic acids of the N-glycan structures released from these gp120 preparations are α2,6-linked (Fig. 6, H and I).
Analysis of O-glycan isomers from biological samples
O-linked glycans linked to Ser/Thr residues can be divided into various subtypes depending on the initiating monosaccharide linked directly to the polypeptide backbone. The most abundant class is the mucin-type O-GalNAc-initiated structures. Other types of O-glycans include O-Mannose, O-Fucose, O-Glucose, O-GlcNAc, O-Xylose, all of which have diverse biological functions (23, 24). Previous studies have documented the detection of both O-GalNAc and O-Mannose-initiated disaccharides and Galβ3GalNAc and GlcNAcβ2Man, respectively, in mouse brain (12, 25). In direct infusion MS, these glycans are detected at the same m/z because they have the same Hex1HexNAc1 composition. While MS/MS fragmentation of the permethylated disaccharides can detect the presence of these two glycans, the fragmentation patterns of the isolated ions can be complicated as they are a mix of the two species and quantification of each individual disaccharide is problematic. To identify the presence or absence of Hex1HexNAc1 isomers in mouse brain, we analyzed the released and permethylated glycans by our LC-MS workflow (Fig. 7A). An EIC of the expected singly charged, lithiated structures (m/z 518.3) revealed two peaks of differing relative abundance at 17.8 and 19.2 min. MS2 fragmentation of m/z 518 at the respective times is consistent with the HexNAc-initiated disaccharide (O-GalNAc) eluting earlier than the Hexose-initiated structure (O-mannose), as determined by unique fragments at the reducing terminus (Fig. 7A, insets).
We also analyzed O-glycans harvested from cells that were glyco-engineered to be deficient in the O-Mannosylation pathway implicated in congenital muscular dystrophies (26, 27, 28). The glycosyltransferase POMGNT1 (Protein O-Linked Mannose N-Acetyl-glucosaminyltransferase 1) transfers β2-linked GlcNAc to an underlying α-mannose linked to Ser or Thr to generate Core M1-type glycans which can be further extended by B4GALT1 with β4-linked galactose (29). O-glycans were released from glycoproteins harvested from HEK293 wildtype cells or HEK293 cells harboring a Crispr-Cas9-mediated global disruption of the POMGNT1 locus (30). As a control, a non-O-Mannose structure (Xylα3Xylα3Glc-ol), which modifies a consensus found in the EGF repeats of a relatively small number of proteins (31, 32, 33), was detected (m/z 593.3 [M+Li]+) in both samples (Fig. 7B). An EIC of m/z 722.1 ([M+Li]+) which corresponds to the composition Hex2HexNAc1 (O-mannose Core M1 structure, Galβ4GlcNAcβ2Man-ol) demonstrates the presence of the glycan in HEK293 WT but not in the POMGNT1-deficient cell line.
Discussion
The field of glycomics has historically been eager to engage new analytic platforms based on their promise to deliver greater sensitivity and enhanced detection of structural features (34, 35, 36, 37). Despite tremendous advances in various analytic technologies, it remains true that no single technique, besides NMR, is capable of routinely and completely defining glycan structures. Thus, the field will continue to benefit from the development of an expanding array of analytic approaches. We have combined two powerful technologies, glycan permethylation and LC, in a manner that should be adoptable by analytic laboratories familiar with standard LC-MS/MS proteomic techniques. To facilitate adoption of our nanoLC-NSI-MSn approach, we have optimized run conditions and buffer compositions for analysis of N-linked or O-linked glycans. Using these conditions, we achieved baseline separations of challenging isomeric glycans and also demonstrated the power of on-the-fly MSn to distinguish structural features such as the linkage position of sialic acids.
Other investigators have previously described LC-based separations of glycans and, more particularly, of permethylated glycans (3, 9, 10). These approaches are all valuable contributions to the arsenal of glycomic technology. In many cases, these techniques make use of glycan standards to define the retention times of known structures, generating a database of RT values that, when combined with accurate mass detection, can provide confident identification of components in complex mixtures. Glycans with novel RT values can be at least partially characterized using these methods by applying off-line exoglycosidase digestions (sequencing) and re-analysis (37). Our approach is also amenable to off-line exoglycosidase digestion with re-analysis, takes advantage of RT and accurate mass, and will, over-time build a library of RP g.u. values to help define structural features of novel glycans. The resolving power of our method is most clearly demonstrated by its ability to generate near-baseline separation of four tetrasaccharides that share the same monosaccharide composition and topology but differ only in a single linkage position (see Fig. 5). To this resolving power, we add the capability of capturing MSn data on-the-fly to help assign structures with novel RT values in a single analytic run (see Figs. 6 and 7). The inclusion of lithium in the mobile phase enhances this capability by facilitating the generation of cross-ring fragments indicative of specific structural features (5).
We optimized the lithium concentration in the mobile phase so that we detect glycans as a single adduct species. This simplification allows quantification to be done from peak areas detected in MS1 for a single adduct, enhancing sensitivity and resolution of glycan species across the chromatogram. As mentioned above, our method incorporates the inclusion of RT standards (DP4-10) which allows the calculation of relative retention times in Glc units (RP g.u.), facilitating cross-platform and run-to-run identification of glycans of interest. We have previously demonstrated that permethylated glycans yield nearly the same signal intensity regardless of their mass or complexity, unless modified with sulfate or phosphate (4). Thus, known amounts of DP standards can not only serve as RP g.u. references but may facilitate quantification of molar amounts of analyte. However, additional work is required to determine whether this approach can be validated as semiquantitative or can provide absolute quantitation across various classes of glycans. A current limitation of our approach is the need for well-defined glycan standards to fully characterize complex mixtures of unknown structures. Ideally, a standard would be available for every glycan in the analyte and, over time, extensive tables of RP g.u. values would be generated to facilitate rapid identification of components of interest. Confidence in the usefulness of RP g.u. tables populated by our approach will be heightened by the MSn data that are generated to support structural features of standards or mixtures of standards.
Previously published glycomic studies from our laboratories and many others, more than can be reasonably cited, have relied heavily on direct infusion of permethylated glycans. These experiments, while valuable for in-depth structural analysis, require the presence of the operator for sample load, run, and subsequent wash before load of next sample. The nanoLC-NSI-MSn method we describe here, like other LC-based methods, is amenable to autosampling, greatly increasing analytic throughput. Furthermore, data collection is almost entirely automated compared with more labor-intensive direct infusion experiments. This marked benefit also highlights a bottleneck in all glycomic analysis, but especially for LC-based analysis, namely, facile analysis of large data sets. Software platforms capable of handling and annotating large glycan data sets generated by nanoLC-NSI-MSn approaches are only in their infancy (38). We made use of our in-house developed GRITS Toolbox to sieve through the data by providing structural candidates based on MS/MS data (18). However, neither this platform nor any other is currently capable of integrating MSn data for permethylated glycans across an entire LC run. Furthermore, software packages developed for LC-based proteomic applications are not ideal for interpreting LC of permethylated glycans because the underlying assumptions regarding isotopic distributions of amino acids are not transferable to permethylated monosaccharides.
Fully automated annotation packages for handling permethylated glycan LC-separation data are a significant need for expanding the adoption of this powerful approach for glycomics. Here, we have provided a useful and robust separation platform that will facilitate the generation of data essential for developing and testing such tools. In the meantime, as these new software tools are developed and implemented, the expanded ability of our nanoLC-NSI-MSn platform to resolve and quantify glycan isomers, combined with the capacity for high-throughput analysis, will provide new opportunities for investigating the diversity of glycosylation at scales capable of answering biologically important questions.
Data availability
Raw mass spectrometry data files have been deposited at GlycoPOST (https://glycopost.glycosmos.org/) under the accession ID number GPST000123.
Conflict of interest
The authors declare no competing interests.
Acknowledgments
We wish to gratefully acknowledge Brent Weatherly for his graciously given advice, encouragement, and assistance regarding computational analysis and informatic considerations over the course of this project and others, before his untimely death. We also thank Rob Bridger for his technical support of our mass spectrometry instrumentation, Dr Christopher M. West (University of Georgia) for providing the T. cruzi O-Glycan isomeric standards and Dr Henrik Clausen (University of Copenhagen) for providing HEK293 cell lines. We are grateful for technical advice from Drs. Christa Feasley, Julian Saba, and Sergei Snovida (ThermoFisher Scientific). Four N-linked glycan standards (standards 107–110) were received from the Consortium for Functional Glycomics (CFG), and an additional N-linked glycan standard (standard 121) was received as a gift from Dr James Paulson (The Scripps Research Institute). The instrumentation utilized in this analysis was provided by ThermoFisher in partial support of the ThermoFisher Center of Excellence for Glycomics and Glycoproteomics at the CCRC, Lance Wells and Michael Tiemeyer, Co-Directors.
Funding and additional information
This work was supported in part by grants from National Institute of Health/National Institute of General Medical Sciences (R01GM111939 to L. W., P41GM103490 to M. T. and L. W., R21AI123161 to L. W., and R01GM130915 to L. W.) and National Institutes of Health/National Institute of Allergy and Infectious Disease (R21AI123161 to L. W.). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Author contributions
S. K., M. O. S, M. T., and L. W. designed research; S. K. and M. O. S performed research; S. L. contributed key reagents; S. K., M. O. S, M. T., and L. W. analyzed data; S. K., M. O. S, M. T., and L. W. wrote the paper.
Footnotes
This article contains supporting information.
Contributor Information
Lance Wells, Email: lwells@ccrc.uga.edu.
Michael Tiemeyer, Email: mtiemeyer@ccrc.uga.edu.
Supporting Information
References
- 1.Werz D.B., Ranzinger R., Herget S., Adibekian A., von der Lieth C.W., Seeberger P.H. Exploring the structural diversity of mammalian carbohydrates ("glycospace") by statistical databank analysis. ACS Chem. Biol. 2007;2:685–691. doi: 10.1021/cb700178s. [DOI] [PubMed] [Google Scholar]
- 2.Marino K., Bones J., Kattla J.J., Rudd P.M. A systematic approach to protein glycosylation analysis: A path through the maze. Nat. Chem. Biol. 2010;6:713–723. doi: 10.1038/nchembio.437. [DOI] [PubMed] [Google Scholar]
- 3.Vreeker G.C., Wuhrer M. Reversed-phase separation methods for glycan analysis. Anal. Bioanal. Chem. 2017;409:359–378. doi: 10.1007/s00216-016-0073-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Mehta N., Porterfield M., Struwe W.B., Heiss C., Azadi P., Rudd P.M., Tiemeyer M., Aoki K. Mass spectrometric quantification of N-linked glycans by reference to exogenous standards. J. Proteome Res. 2016;15:2969–2980. doi: 10.1021/acs.jproteome.6b00132. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ashline D., Singh S., Hanneman A., Reinhold V. Congruent strategies for carbohydrate sequencing. 1. Mining structural details by MSn. Anal. Chem. 2005;77:6250–6262. doi: 10.1021/ac050724z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lapadula A.J., Hatcher P.J., Hanneman A.J., Ashline D.J., Zhang H., Reinhold V.N. Congruent strategies for carbohydrate sequencing. 3. OSCAR: An algorithm for assigning oligosaccharide topology from MSn data. Anal. Chem. 2005;77:6271–6279. doi: 10.1021/ac050726j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Reinhold V., Zhang H., Hanneman A., Ashline D. Toward a platform for comprehensive glycan sequencing. Mol. Cell Proteomics. 2013;12:866–873. doi: 10.1074/mcp.R112.026823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Zhang H., Singh S., Reinhold V.N. Congruent strategies for carbohydrate sequencing. 2. FragLib: An MSn spectral library. Anal. Chem. 2005;77:6263–6270. doi: 10.1021/ac050725r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hsiao C.T., Wang P.W., Chang H.C., Chen Y.Y., Wang S.H., Chern Y., Khoo K.H. Advancing a high throughput glycotope-centric glycomics workflow based on nanoLC-MS(2)-product dependent-MS(3) analysis of permethylated glycans. Mol. Cell Proteomics. 2017;16:2268–2280. doi: 10.1074/mcp.TIR117.000156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zhou S., Hu Y., Mechref Y. High-temperature LC-MS/MS of permethylated glycans derived from glycoproteins. Electrophoresis. 2016;37:1506–1513. doi: 10.1002/elps.201500568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Aoki K., Perlman M., Lim J.M., Cantu R., Wells L., Tiemeyer M. Dynamic developmental elaboration of N-linked glycan complexity in the Drosophila melanogaster embryo. J. Biol. Chem. 2007;282:9127–9142. doi: 10.1074/jbc.M606711200. [DOI] [PubMed] [Google Scholar]
- 12.Stalnaker S.H., Aoki K., Lim J.M., Porterfield M., Liu M., Satz J.S., Buskirk S., Xiong Y., Zhang P., Campbell K.P., Hu H., Live D., Tiemeyer M., Wells L. Glycomic analyses of mouse models of congenital muscular dystrophy. J. Biol. Chem. 2011;286:21180–21190. doi: 10.1074/jbc.M110.203281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hamilton S.R., Davidson R.C., Sethuraman N., Nett J.H., Jiang Y., Rios S., Bobrowicz P., Stadheim T.A., Li H., Choi B.K., Hopkins D., Wischnewski H., Roser J., Mitchell T., Strawbridge R.R., et al. Humanization of yeast to produce complex terminally sialylated glycoproteins. Science. 2006;313:1441–1443. doi: 10.1126/science.1130256. [DOI] [PubMed] [Google Scholar]
- 14.Hamilton S.R., Gerngross T.U. Glycosylation engineering in yeast: The advent of fully humanized yeast. Curr. Opin. Biotechnol. 2007;18:387–392. doi: 10.1016/j.copbio.2007.09.001. [DOI] [PubMed] [Google Scholar]
- 15.You Y., Richer E.J., Huang T., Brody S.L. Growth and differentiation of mouse tracheal epithelial cells: Selection of a proliferative population. Am. J. Physiol. Lung Cell Mol. Physiol. 2002;283:L1315–1321. doi: 10.1152/ajplung.00169.2002. [DOI] [PubMed] [Google Scholar]
- 16.Aoki K., Porterfield M., Lee S.S., Dong B., Nguyen K., McGlamry K.H., Tiemeyer M. The diversity of O-linked glycans expressed during Drosophila melanogaster development reflects stage- and tissue-specific requirements for cell signaling. J. Biol. Chem. 2008;283:30385–30400. doi: 10.1074/jbc.M804925200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Anumula K.R., Taylor P.B. A comprehensive procedure for preparation of partially methylated alditol acetates from glycoprotein carbohydrates. Anal. Biochem. 1992;203:101–108. doi: 10.1016/0003-2697(92)90048-c. [DOI] [PubMed] [Google Scholar]
- 18.Weatherly D.B., Arpinar F.S., Porterfield M., Tiemeyer M., York W.S., Ranzinger R. GRITS Toolbox-a freely available software for processing, annotating and archiving glycomics mass spectrometry data. Glycobiology. 2019;29:452–460. doi: 10.1093/glycob/cwz023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lee Y.C., Lee B.I., Tomiya N., Takahashi N. Parameterization of contribution of sugar units to elution volumes in reverse-phase HPLC of 2-pyridylaminated oligosaccharides. Anal. Biochem. 1990;188:259–266. doi: 10.1016/0003-2697(90)90603-7. [DOI] [PubMed] [Google Scholar]
- 20.Takahashi N., Wada Y., Awaya J., Kurono M., Tomiya N. Two-dimensional elution map of GalNAc-containing N-linked oligosaccharides. Anal. Biochem. 1993;208:96–109. doi: 10.1006/abio.1993.1013. [DOI] [PubMed] [Google Scholar]
- 21.Sheikh M.O., Gas-Pascual E., Glushka J.N., Bustamante J.M., Wells L., West C.M. Trypanosoma cruzi 13C-labeled O-Glycan standards for mass spectrometry. Glycobiology. 2019;29:280–284. doi: 10.1093/glycob/cwy111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Tarleton R.L. Chagas disease: A solvable problem, ignored. Trends Mol. Med. 2016;22:835–838. doi: 10.1016/j.molmed.2016.07.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Joshi H.J., Narimatsu Y., Schjoldager K.T., Tytgat H.L.P., Aebi M., Clausen H., Halim A. SnapShot: O-glycosylation pathways across kingdoms. Cell. 2018;172:632–632.e632. doi: 10.1016/j.cell.2018.01.016. [DOI] [PubMed] [Google Scholar]
- 24.Varki A., Kornfeld S. In: Essentials of Glycobiology. Varki A., Cummings R.D., Esko J.D., Stanley P., Hart G.W., Aebi M., Darvill A.G., Kinoshita T., Packer N.H., Prestegard J.H., Schnaar R.L., Seeberger P.H., editors. Cold Spring Harbor Laboratory Press; Cold Spring Harbor, NY: 2015. Historical background and overview; pp. 1–18. [Google Scholar]
- 25.Stalnaker S.H., Hashmi S., Lim J.M., Aoki K., Porterfield M., Gutierrez-Sanchez G., Wheeler J., Ervasti J.M., Bergmann C., Tiemeyer M., Wells L. Site mapping and characterization of O-glycan structures on alpha-dystroglycan isolated from rabbit skeletal muscle. J. Biol. Chem. 2010;285:24882–24891. doi: 10.1074/jbc.M110.126474. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Falsaperla R., Pratico A.D., Ruggieri M., Parano E., Rizzo R., Corsello G., Vitaliti G., Pavone P. Congenital muscular dystrophy: From muscle to brain. Ital. J. Pediatr. 2016;42:78. doi: 10.1186/s13052-016-0289-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Liu J., Ball S.L., Yang Y., Mei P., Zhang L., Shi H., Kaminski H.J., Lemmon V.P., Hu H. A genetic model for muscle-eye-brain disease in mice lacking protein O-mannose 1,2-N-acetylglucosaminyltransferase (POMGnT1) Mech. Dev. 2006;123:228–240. doi: 10.1016/j.mod.2005.12.003. [DOI] [PubMed] [Google Scholar]
- 28.Sheikh M.O., Halmo S.M., Wells L. Recent advancements in understanding mammalian O-mannosylation. Glycobiology. 2017;27:806–819. doi: 10.1093/glycob/cwx062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Praissman J.L., Wells L. Mammalian O-mannosylation pathway: Glycan structures, enzymes, and protein substrates. Biochemistry. 2014;53:3066–3078. doi: 10.1021/bi500153y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Narimatsu Y., Joshi H.J., Nason R., Van Coillie J., Karlsson R., Sun L., Ye Z., Chen Y.H., Schjoldager K.T., Steentoft C., Furukawa S., Bensing B.A., Sullam P.M., Thompson A.J., Paulson J.C., et al. An atlas of human glycosylation pathways enables display of the human glycome by gene engineered cells. Mol. Cell. 2019;75:394–407.e395. doi: 10.1016/j.molcel.2019.05.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hase S., Kawabata S., Nishimura H., Takeya H., Sueyoshi T., Miyata T., Iwanaga S., Takao T., Shimonishi Y., Ikenaka T. A new trisaccharide sugar chain linked to a serine residue in bovine blood-coagulation factor-Vii and factor-Ix. J. Biochem-tokyo. 1988;104:867–868. doi: 10.1093/oxfordjournals.jbchem.a122571. [DOI] [PubMed] [Google Scholar]
- 32.Hase S., Nishimura H., Kawabata S.I., Iwanaga S., Ikenaka T. The structure of (Xylose)2glucose-O-serine 53 found in the 1st epidermal growth factor-like domain of bovine blood-clotting factor-Ix. J. Biol. Chem. 1990;265:1858–1861. [PubMed] [Google Scholar]
- 33.Nishimura H., Kawabata S., Kisiel W., Hase S., Ikenaka T., Takao T., Shimonishi Y., Iwanaga S. Identification of a disaccharide (Xyl-Glc) and a trisaccharide (Xyl2-Glc) O-glycosidically linked to a serine residue in the 1st epidermal growth factor-like domain of human factor-Vii and factor-Ix and protein-Z and bovine protein-Z. J. Biol. Chem. 1989;264:20320–20325. [PubMed] [Google Scholar]
- 34.Li Q., Xie Y., Wong M., Lebrilla C.B. Characterization of cell glycocalyx with mass spectrometry methods. Cells. 2019;8 doi: 10.3390/cells8080882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Mulloy B., Dell A., Stanley P., J H.P. In: Essentials of Glycobiology. rd, Varki A., Cummings R.D., Esko J.D., Stanley P., Hart G.W., Aebi M., Darvill A.G., Kinoshita T., Packer N.H., Prestegard J.H., Schnaar R.L., Seeberger P.H., editors. Cold Spring Harbor Laboratory Press; Cold Spring Harbor, NY: 2015. Structural analysis of glycans; pp. 639–652. [Google Scholar]
- 36.Reiding K.R., Bondt A., Hennig R., Gardner R.A., O'Flaherty R., Trbojevic-Akmacic I., Shubhakar A., Hazes J.M.W., Reichl U., Fernandes D.L., Pucic-Bakovic M., Rapp E., Spencer D.I.R., Dolhain R., Rudd P.M., et al. High-throughput serum N-glycomics: Method comparison and application to study rheumatoid arthritis and pregnancy-associated changes. Mol. Cell Proteomics. 2019;18:3–15. doi: 10.1074/mcp.RA117.000454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Rudd P.M., Guile G.R., Kuster B., Harvey D.J., Opdenakker G., Dwek R.A. Oligosaccharide sequencing technology. Nature. 1997;388:205–207. doi: 10.1038/40677. [DOI] [PubMed] [Google Scholar]
- 38.Abrahams J.L., Campbell M.P., Packer N.H. Building a PGC-LC-MS N-glycan retention library and elution mapping resource. Glycoconj. J. 2018;35:15–29. doi: 10.1007/s10719-017-9793-4. [DOI] [PubMed] [Google Scholar]
- 39.Varki A., Cummings R.D., Aebi M., Packer N.H., Seeberger P.H., Esko J.D., Stanley P., Hart G., Darvill A., Kinoshita T., Prestegard J.J., Schnaar R.L., Freeze H.H., Marth J.D., Bertozzi C.R., et al. Symbol nomenclature for graphical representations of glycans. Glycobiology. 2015;25:1323–1324. doi: 10.1093/glycob/cwv091. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Raw mass spectrometry data files have been deposited at GlycoPOST (https://glycopost.glycosmos.org/) under the accession ID number GPST000123.