Skip to main content
Molecular & Cellular Proteomics : MCP logoLink to Molecular & Cellular Proteomics : MCP
. 2021 Feb 6;20:100060. doi: 10.1074/mcp.R120.002090

Recent Advances in Software Tools for More Generic and Precise Intact Glycopeptide Analysis

Weiqian Cao 1,2,3,, Mingqi Liu 1, Siyuan Kong 1, Mengxi Wu 1,4, Yang Zhang 1,3, Pengyuan Yang 1,2,3,4,
PMCID: PMC8724820  PMID: 33556625

Abstract

Intact glycopeptide identification has long been known as a key and challenging barrier to the comprehensive and accurate understanding the role of glycosylation in an organism. Intact glycopeptide analysis is a blossoming field that has received increasing attention in recent years. MS-based strategies and relative software tools are major drivers that have greatly facilitated the analysis of intact glycopeptides, particularly intact N-glycopeptides. This article provides a systematic review of the intact glycopeptide-identification process using MS data generated in shotgun proteomic experiments, which typically focus on N-glycopeptide analysis. Particular attention is paid to the software tools that have been recently developed in the last decade for the interpretation and quality control of glycopeptide spectra acquired using different MS strategies. The review also provides information about the characteristics and applications of these software tools, discusses their advantages and disadvantages, and concludes with a discussion of outstanding tools.

Keywords: software tools, intact glycopeptide analysis, mass spectrometry, bottom-up experimental strategies, quality control, software applications

Abbreviations: CID, collision-induced dissociation; ETD, electron-transfer dissociation; FDR, false discovery rate; HCD, higher-energy collision dissociation; PD, product dependent; Q-TOF, quadrupole-time of flight; SCE, stepped collision energy

Graphical abstract

graphic file with name fx1.jpg

Highlights

  • Summarize the development of software tools for intact glycopeptide analysis.

  • Survey the MS acquisition strategies for glycopeptide spectrum acquisition.

  • Conclude the quality control methods for glycopeptide-spectrum matching.

  • Show the application of intact glycopeptide analysis in glycoproteomic research.

In Brief

This article provides a systematic review of the most recent MS-based strategies and corresponding software tools for the analysis of intact glycopeptides, particularly intact N-glycopeptides, reported in the last decade, including the process of identifying N-glycopeptides from MS data, the existing methods of MS data acquisition and interpretation, the quality control methods, the display of results, and the software applications.


Protein glycosylation has long been known as a heterogeneous post-translational modification generating greater protein diversities than other post-translational modifications (1). These diversities alter the functions of proteins and subsequently exert a profound effect on various biological processes (2, 3). The ability to perform an in-depth and precise identification of glycoproteins is the key to obtaining a comprehensive and accurate understanding of the role of glycosylation in an organism.

MS has been recognized as an ideal tool for glycoprotein analysis since the first report in 1978 (4). MS-based glycoproteomics, which emerged with the goal of defining the glycoproteome of a biological system, has made strides in deglycosylation-centric glycoproteomic studies in the past few decades (5, 6, 7, 8, 9). Previously, enriched glycopeptides were usually deglycosylated to generate an identifiable mass tag on glycosylation sites before the MS analysis that can be directly searched using routine proteomic software. Although the use of the deglycosylation-centric strategy as a compromise greatly simplified the MS identification and expanded our knowledge of glycoproteins, the site-specific glycan information was lost during the removal of glycans from glycopeptides. Subsequently, a more challenging issue in the high-throughput and precise identification of intact glycopeptides (also known as site-specific glycan analysis) was identified to better assess the biological attributes of the glycosylation.

Since then, an analysis of intact glycopeptides has been a challenge for several reasons. First, the inherently low abundance and ionization efficiency of glycopeptides make them difficult to identify using MS. Second, MS dissociation energy for glycans and peptides differ, hampering the acquisition of informative glycopeptide spectra using the single MS fragmentation method. Third, glycopeptides, which comprise a group of various types of monosaccharides linked to amino acid residues with different compositions and linkages, are so complicated that routine proteomic search engines are not qualified for an analysis of such enormous numbers of variable modifications. Many robust and mature glycopeptide enrichment methods developed in the past decades have dramatically improved the abundance of glycopeptides in MS analysis and considerably reduced the signal suppression of glycopeptide in the mass spectra by nonglycopeptides. Subsequently, the urgent needs for intact glycopeptide analysis are the ability to obtain informative glycopeptide spectra data and reliably decipher those data with expert software.

In recent years, substantial progress has been achieved in addressing the immediate needs for an analysis of intact glycopeptides, particularly N-glycopeptides (10). Some generic analytical pipelines and useful software became available and enabled a high-throughput and precise analysis of intact glycopeptides. Through intact glycopeptide analyses, glycopeptide sequences with attached glycan composition can be conceivably gained. This review will specify the most recent MS-based strategies and corresponding software tools for the analysis of intact glycopeptides, particularly intact N-glycopeptides, reported in the last decade, while reviewing the process of identifying N-glycopeptides from MS data, the existing methods of MS data acquisition and interpretation, the quality control methods, the display of results, and the software applications (Table 1).

Table 1.

A summary table of the most recent MS-based strategies and corresponding software tools for the analysis of intact N-glycopeptides

Pipeline Spectral collection
Scoring/matching
Quality control
MS/MS
MS3/de- glycopeptide Peptide
Glycan
Diagnostic ions Peptide Glycan Intact glycopeptide
HCD CID ETD Method Database Method Database
GlycoFragwork, etc. (Tang's laboratory) BY-C Y-a c/z Probability matches Selected proteins De novo Glycan classification Decoy sequence
GlycoPep Detector, etc. (Desaire's laboratory) Y-a c/z Probability matches Selected proteins Probability matches Manual input Decoy sequence
GlycoPeptide Search Y-a b/y Existing algorithm Selected proteins Core Y-ions matches GlycomeDB Spectral filtration Decoy sequence
GP Finder Y-a Probability matches Selected proteins Probability matches Glycomic analysis Spectral filtration Decoy precursor mass
Sweet-Heart, etc. (Khoo's laboratory) BY-c Y-a c/z b/y from MS3 Existing algorithm Proteome Probability matches Semi de-novo Product-trigger Decoy sequence
ArMone Y-c De-glyco neededb Mass compare Sub- Proteomee Core Y-ions matches GlycomeDB
GRIP Y-c Y-a De-glyco neededb Mass compare Sub- Proteomee Probability matches GlycomeDB
GPQuest Y-a De-glyco neededc Spectral library Sub- Proteomee Mass compare Glycomic analysis Spectral filtration Decoy sequence
GlycoFinder BY-a Mass compare Selected proteins Core Y-ions matches Manual input Spectral filtration
GlycoMaster DB Y-a c/z Existing algorithm Proteome Probability matches GlycomeDB Spectral filtration Decoy sequence
MAGIC B b/y Y-a Existing algorithmd Proteome Core Y-ions matches Semi de-novo Spectral filtration Decoy sequence
I-GPA, etc. (Yoo's laboratory) B b/y Y-a Probability matches Sub- Proteomef Probability matches GlycomeDB Glycan classification Decoy glycopeptide
GlycoPAT b/y Y-a c/z Probability matches Selected proteins Probability matches Semi de-novo Spectral filtration Decoy glycopeptide
SugarQB B b/y Y-a Existing algorithmd Proteome Core Y-ions matches Manual input Spectral filtration Decoy sequence
glyXtoolMS B b/y Y-a Probability matches Selected proteins Core Y-ions matches GlyTouCan Glycan classification
pMatchGlyco B b/y Y-a De-glyco neededc Spectral library Sub- Proteomee Mass compare GlycomeDB Decoy glycopeptide
GPSeeker b/y Y-a Existing algorithm Proteome Probability matches Glycomic analysis Spectral filtration Decoy sequence Decoy fragmentg
O-Search BY-c Existing algorithmd Proteome Core Y-ions matches Semi de-novo Spectral filtration Decoy sequence
pGlyco BY-c Y-a b/y from MS3 Existing algorithm Proteome Probability matches GlycomeDB Product-trigger Decoy sequence Decoy fragment
pGlyco 2.0 B b/y Y-a Analyzeda Existing algorithm Proteome Probability matches GlycomeDB Spectral filtration Decoy sequence Decoy fragment Comprehensive probability

CID, collision-induced dissociation; ETD, electron-transfer dissociation; HCD, higher-energy collision dissociation; Y-a, all Y ions; Y-c, Y ions from N-glycan core.

a

Different collisional energies.

b

Pregenerated de-glycopeptide database.

c

Pregenerated de-glycopeptide spectra.

d

Substitution of precursor mass.

e

Candidates from de-glyco experiment.

f

Candidates from the sample-dependent database.

g

The decoy method from pGlyco.

MS Fragmentation Methods for Glycopeptides

In the past 10 years, software tools for the automatic identification of intact glycopeptides using different LC-MS/MS strategies were developed. Regardless of the type of strategy used, sufficient fragments from both peptide backbones and the attached glycans are required. Informative glycopeptide mass spectra are difficult to acquire because the fragmentation varies widely between the two components because of the differences in the physical and chemical properties of the glycosidic bonds and peptide bonds. Dissociation methods, including resonance activation (ion trap) collision-induced dissociation (CID), beam-type (quadrupole-time of flight [Q-TOF]) CID, higher-energy collision dissociation (HCD), and electron-transfer dissociation (ETD), are the most commonly used approaches in glycopeptide identification.

CID Fragmentation

Mass spectra generated by ion-trap CID are dominated by Y-ions but contain a few B-ions and a limited number of b/y-ions because of the single-bond cleavages of precursor ions, low dissociation energy, and the “1/3” cutoff in the low m/z region (11), which can be used to identify the conjugated glycans but not for confident identification of the location of the glycosylation site and peptide sequences, particularly in complex samples. Thus, this technique has only been applied to analyze glycopeptides in simple samples in some previous studies (12, 13, 14, 15).

HCD Fragmentation

In contrast, HCD tends to produce abundant diagnostic oxonium ions, such as m/z 138.055, 204.087 and 366.140 fragments. It also yields B- and Y-ions for glycan identification and partial b/y-ion series for peptide identification. Studies showed an increase in the b/y-ion series as the HCD collision energy increased (16). Different collision energies in HCD-MS/MS produce complementary fragments of the glycan and peptides (17, 18). GlycoFinder used HCD collision energies of 30% and 50% to analyze glycopeptide in simple samples (19). The fragmentation patterns produced by the beam-type CID in the Q-TOF spectra resemble the patterns produced by HCD in C-trap on Orbitrap mass spectrometers.

Many studies have investigated the behaviors of glycopeptide fragmentation using different dissociation methods (20, 21, 22, 23, 24, 25). In 2016, we used various MS/MS collision parameters, including CID and HCD, each with nine different energies, as well as electron-transfer/collision-induced dissociation/electron-transfer/higher-energy collision dissociation. Stepped collision energies (SCEs)–HCD–MS/MS at 20-30-40% energies generated the most informative and abundant fragment ions for both the glycan and peptide of a glycopeptide in a single spectrum (26). Since then, SCE–HCD–MS/MS has been widely used in high-throughput identification of intact glycopeptides (27, 28).

ETD Fragmentation

ETD primarily breaks N-Cα bonds and generates c/z-ions. Ideally, ETD generates abundant c/z-ions from peptide backbone fragments, which are very useful for the identification of glycosylation sites and peptide sequences. However, ETD often suffers from incomplete fragmentation, leading to a large amount of residual precursor ions (29). Thus, ETD is rarely used alone and is frequently combined with HCD or CID methods for glycopeptide identification. With the development of the ion-trigger technique in MS, HCD–product-dependent (PD)–ETD increases the efficiency of combining fragments because ion-trigger technology generally avoids the reaction time of ETD wasted on nonglycopeptide dissociation. Hybrid dissociation techniques developed for an Orbitrap Fusion mass spectrometer, such as electron-transfer/collision-induced dissociation (30, 31), electron-transfer/higher-energy collision dissociation (293233), and activated ion electron transfer dissociation (34, 35), where a supplementary energy source is applied to all fragment ions formed by ETD, have shown great potential in N-glycopeptide identification (11, 35, 36, 37, 38, 39).

Bottom-Up Experimental and MS Acquisition Strategies

In a typical bottom-up intact glycopeptide-identification strategy, glycopeptides are first enriched to reduce the complexity and increase the detectability of glycopeptides. Then, enriched intact glycopeptides are analyzed using different LC-MS/MS strategies and interpreted with different software tools. After surveying the development of software tools for MS-based glycopeptide-identification strategies in last 10 years, two major types of software have been used according to the experimental strategy with or without a deglycosylation procedure: one is the software combining the results from two LC-MS/MS analyses (one with deglycosylated peptides and another with native intact glycopeptide) and the other is the software that directly identifies mass spectra from an intact glycopeptide analysis.

Experimental and MS Acquisition Strategy With the Deglycosylation Procedure

Generally, in the experimental strategy with the deglycosylation step, two experimental workflows, deglycosylated peptide analysis and intact glycopeptide analysis, are performed. A spectral library of peptides containing glycosylation sites in the samples is first built by analyzing the isolated deglycosylated peptides using HCD/CID LC-MS/MS. Spectra of intact glycopeptides are selected using diagnostic oxonium ions. The glycan composition at a glycosylation site is determined by matching the mass difference between the precursor ion of the intact glycopeptide and the deglycosylated peptide to the glycan database. Three main software tools support this identification strategy: GRIP developed by us in 2014 (40), ArMone 2.0 developed by Cheng et al. in 2014 (41), and GPQuest developed by Toghi Eshghi et al. in 2015 (42). Although GRIP, ArMone, and GPQuest have been used for high-throughput profiling of intact N-glycopeptides in complex samples, the two workflows make the experimental operation tedious. GlycoSeq developed by Yu et al. in 2016 (43), which aims to identify glycopeptides in isolated glycoproteins from their CID spectra alone, also requires a preliminary step for identifying the deglycosylated sites.

Experimental and MS Acquisition Strategy Without the Deglycosylation Procedure

In the experimental strategy without the deglycosylation step, software tools were developed to directly identify intact glycopeptides in mass spectra. The software tools were designed either for the interpretation of MS/MS data from one type of fragmentation method or for MS/MS data from multiple fragmentation methods. In early years, many software tools were expertly designed for a specific dissociation-based LC-MS/MS analysis of glycopeptides. For example, GlycoPeptide Search (14, 15), GP Finder (12), and MAGIC (44) were all developed for a Q-TOF–CID MS/MS analysis, in which GP Finder was specifically designed for a nonspecific analysis of digested glycopeptides using Pronase. ProteinProspector (45, 46) and GlycoPep Detector (47) were designed for an ETD–MS/MS analysis. GlycoPep Grader (13), which was developed by the same laboratory as GlycoPep Detector, was used for CID–MS/MS analyses. Although the experimental procedures of these software-supported analytical strategies are easy to operate, information about both glycan and peptide fragments in a spectrum obtained from a single dissociation method is insufficient and substantially limits the throughput and reliability of the identification. All of these software tools, except MAGIC (44), are only suitable for an analysis of simple samples.

Combined dissociation methods have been applied to analyze glycopeptides and obtain informative glycopeptide mass spectra. HCD, ETD, and ion-trap CID are often combined to dissociate the same precursor ions because of their complementary nature when applied to N-glycopeptide dissociation. Corresponding software tools were developed for deciphering a precursor with multiple fragment types in MS/MS data. Byonic, which is a commercial and most commonly used software (48), supports searching MS/MS data with multiple fragment types. gFinder (49) and I-GPA (50) developed by Yoo et al. were both designed to interpret CID/HCD–MS/MS spectra. GlypID 2.0 improves upon the GlypID tool (51) developed by Tang et al. and supports searches for HCD and CID fragmentation ions. GlycoFragwork developed by the same laboratory characterizes glycopeptide in biological proteome samples by scoring complementary fragmentation techniques, including CID, HCD, and ETD. GlycoPAT (52) supports searches of CID/ETD–MS/MS or CID/HCD/ETD–MS/MS data. GlycoMaster DB (53) was designed to decipher HCD/ETD combined with MS/MS data and HCD-MS/MS data alone. Sweet-Heart, a series of software tools developed by Khoo et al. substantially improved the analytical procedure of glycopeptide identification from multiple fragmentation mass data. Sweet-Heart was first proposed in 2013 as an integrated suite of computational tools enabling the sequencing and identification of glycopeptides from low-resolution and low accuracy CID-MS2 and targeted-CID-MS3 (54). In 2014, taking advantage of the new trihybrid Orbitrap configuration, the authors experimented with adding a parallel ion-trap CID-MS2 data acquisition mode to the HCD-PD-ETD function and optimized Sweet-Heart for this analytical procedure (55).

The combination of HCD/CID with ETD is one of the widely used methods in multiple dissociation strategies. However, the long reaction time and low dissociation efficiency of ETD have limited the throughput of glycopeptide analysis. Recently, a preferable strategy using SCE-HCD-MS/MS and a dedicated search engine pGlyco 2.0 (26) to completely utilize the abundant information in SCE-HCD-MS/MS spectra were developed, enabling a high-throughput analysis of N-glycopeptides.

Interpretation of Glycopeptide Mass Spectra

The method used to interpret complex tandem mass spectra generated from glycopeptides is the core of a search tool. Generally, processes used to interpret glycopeptide mass spectra include screening glycopeptide mass spectra and matching the glycan moiety and carrier peptide with glycopeptide spectra.

Selection of Glycopeptide Mass Spectra

A screen of glycopeptide mass spectra is preferred to avoid interference from nonglycosylated peptide spectra and improve the reliability of the identification. The abundant glycopeptide-specific oxonium ions obtained upon HCD are normally used to determine whether the spectrum is attributed to a glycopeptide. The HexNAc-derived fragment ions m/z 138.055, m/z 204.087, and m/z 366.140 derived from Hex+HexNAc are mostly used as diagnostic ions for the selection of glycopeptide spectra. These diagnostic ions have been utilized either “online” in the “HCD-PD” mode to enable a more efficient time of dissociation of glycopeptides instead of nonglycopeptides in the subsequent triggered ETD or CID or “offline” as a diagnostic ion assistant for the software-based selection of glycopeptide spectra. In addition, fragment ions m/z 292.103/274.092 and m/z 308.098/290.087 for NeuAc/NeuAc-H2O and NeuGc/NeuGc-H2O, respectively, are specific for sialic acid–containing glycopeptides. Other diagnostic oxonium ions, such as m/z 109.028, m/z 115.035, m/z 127.039, and 145.050 from fragments of Hex, m/z 126.055, m/z 144.066, m/z 168.066, and m/z 186.076 from fragments of HexNAc, m/z 657.140 from Hex+HexNAc+NeuAc, and m/z 673.230 from Hex+HexNAc+NeuGc, are also used alone or in combination for either the selection of glycopeptide spectra or glycan matching.

Another strategy for selecting glycopeptide spectra is based on precursor ions (56, 57, 58). Isotope-targeted glycoproteomics (IsoTag), in which metabolic labeling of glycoproteome is combined with the chemical enrichment and isotopic recoding of glycopeptides with a dibromide motif developed by Woo et al., (58) is the most representative approach for that strategy. The natural abundance of the stable isotopes, 79Br and 81Br (1:1), were used in the study by Palaniappan et al. (57) to provide a ready source for isotope recoding and convoluting the mass envelope of a glycopeptide with a triplet signature (1:2:1(M, M + 2, M + 4)), which can be detected computationally using a pattern-recognition algorithm developed in-house.

Matching of the Glycan Moiety and Carrier Peptide

Computational tools tackle the matching of the glycan moiety and carrier peptides using different approaches probably according to the different experimental and MS acquisition strategies they used (Fig. 1). Generally, for an analysis of a glycan moiety, most tools perform probability matches against the glycan database from GlycomeDB (currently merged into GlyTouCan). A few tools developed semi de novo glycan profiling algorithms; for carrier peptide analysis, software tools perform probability matching against select protein sequences. When a complete proteome database is used in a large-scale analysis, the incorporation of existing proteomic search engines (i.e., Mascot, Sequest, and pFind) is a routine approach (59, 60).

Fig. 1.

Fig. 1

General approaches for matching of the glycan moiety and carrier peptide.

The approach for the strategy with the deglycosylation step considers the carrier peptide as a predefined mass increment according to their pre-established spectral library of peptides containing glycosylation sites to deduce the glycan moiety from glycopeptide spectra with different glycan matching strategies. The core of GRIP is a network-centric algorithm for deciphering glycan fragmentation CID MS/MS spectra (40). ArMone 2.0 identifies glycans by searching a constructed glycan database with a calculated molecular weight of glycans and MS/MS spectra of the glycopeptide, while only scoring the pentasaccharide core structure observed in the N-glycosylation (Man3GlcNAc2) of mammalian proteins. GPQuest assigns the glycan composition by matching the mass difference between the precursor ion of the intact glycopeptide and the glycosite-containing peptide to a glycan database (42). It is practically impossible to distinguish two glycan compositions with the same molecular weight when deducing the glycan moiety based only on the molecular weight. The same limitation also exists when directly matching the carrier peptide from the glycopeptide spectra with the peptide library, although common retention times have been used to filter between the two experiments (identification of the intact glycopeptide and deglycosylated peptide) in ArMone 2.0.

The interpretation approach for the strategy using combined fragmentation methods without the deglycosylation step generally assigns the glycan composition from acquired CID-MS/MS spectra that are rich in glycan fragment ions and define the carrier peptide sequences by an additional mode of MS2 or MS3 analysis (61). The corresponding computational tools mentioned in the “Experimental and MS Acquisition Strategy Without the Deglycosylation Procedure” section all use this approach to deduce the glycan composition and peptide sequences of glycopeptides. For example, Sweet-Heart identifies the glycan composition in low-resolution and low-accuracy ion-trap MS2 data and putative peptide backbone in targeted MS3 data (54).

The approaches used by software tools to interpret the single fragmentation patterns differ. Some mandatorily detect the glycan and peptide ions in one spectrum. Meanwhile, other search tools, such as MAGIC (44) and SugarQB (62) for HCD/Q-TOF spectra interpretation, detect Y-ion patterns from the fragmentation of the common trimannosyl core of N-glycopeptides, and search the subsequent peptide sequence using common database search engines, such as Mascot, through the generation of in silico spectra by overwriting the original precursor with the naked peptide m/z and removing all of the glycan-related ions.

A superior interpretation strategy was used by pGlyco 2.0 (26), which was developed for the interpretation of abundant information in SCE-HCD-MS/MS spectra. It performed an integrated open search of each spectrum: a spectrum was first scored against the glycome database to identify the glycan candidates and then scored against the proteome database to identify the peptide candidates. This process substantially increased the efficiency and accuracy of the interpretation.

Quality Control Methods for Glycopeptide-Spectrum Matching

The false discovery rate (FDR) is an important measurement and a crucial parameter used to define the specificity of the identification and quality control of results. Since this technique was first proposed by Gygi in proteomics (63, 64), the target-decoy approach has been adopted as an acceptable method to calculate the FDRs for glycopeptides.

Before 2016, almost all identification software estimated the FDR of glycopeptide identification only at the peptide level by using a decoy database created based on the target protein sequence database. For example, GPQuest (42) creates a decoy database by randomly arranging amino acid sequences of those glycosite-containing peptides identified in their two-step experimental strategy with the same length as the target database. GlycoPep Evaluator (65) is a software program designed for the FDR analysis of small datasets that generates a decoy database with fixed mass of attached glycan for each glycopeptide in a 1:20 target-to-decoy ratio, according to a user-specified mass tolerance interval.

However, a high FDR would be obtained for glycopeptide identification results even if the peptide-spectrum match score is high because the FDR is only calculated at the peptide level and does not control for glycan identification. For example, Wu et al. (55) manually compared the results of HCD spectra identified using Sweet-Heart and Byonic (48) and validated as high as 37% false positives from 551 positive spectrum matches despite a claim of zero FDR using the Byonic criteria because they lack an FDR estimation of the glycan identity. As shown in the study by us, Byonic might substantially underestimate the metric of FDR for glycopetide identification (26).

Researchers also realized that an FDR evaluation at all three levels of matches to glycans, peptides, and glycopeptides is needed for comprehensive quality control (66). GP Finder (12) was a pioneer program attempting to resolve the problem of glycan decoy database construction. This software mainly focuses on identifying glycopeptides in simple samples. The issues with the mathematical model and algorithm of FDR for glycopeptide identification remained unsolved until pGlyco developed a novel target-decoy approach to estimate the FDR of the glycan identification. The scoring scheme for glycan identification by pGlyco was a revised version of the previously reported algorithm for the CID-MS/MS spectral analysis of glycopeptides (40). pGlyco 2.0 conducts comprehensive quality control, including an FDR analysis of glycans and peptides, and uses a new model for the glycopeptide FDR estimation (Fig. 2A). In addition, a new quantitative analysis pipeline using metabolically labeled glycoproteome samples was established to validate the FDR reported by the pGlyco 2.0 (Fig. 2B) (26).

Fig. 2.

Fig. 2

A dedicated software pGlyco 2.0 for site-specific glycan interpretation.A, the flowchart of pGlyco 2.0. B, the FDR validation workflow. Reprinted with permission from Ref (26), under Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/). FDR, false discovery rate.

Mass Spectrum Annotation and Result Display

Automated annotation of glycopeptide spectra and a proper result display greatly facilitate subsequent analyses of intact glycopeptides. However, a range of previously developed software tools, including many well-designed search engines such as Sweet-Heart (54), ArMone (41), and GPQuest (42), do not include these functions. Some subsequently developed software tools, such as pGlyco (61), glyXtoolMS (67), and GPSeeker (68), conduct simple mass spectrum annotation steps. A web server, MAGIC-web (69), which was established by the MAGIC (44) software development team, enables mass spectrum annotation and glycan structure visualization. This function was designed as one of the functional modules on the web server and not integrated into the search engine of MAGIC and MAGIC+, requiring users have to complete additional operation themselves.

The dedicated search engine pGlyco 2.0 (26) considered that issue. In pGlyco 2.0, all identified spectra were automatically annotated and displayed by the software tool gLabel embedded in pGlyco 2.0, which greatly facilitates manual verification and data analysis (Fig. 3A). Another impressive tool for visualizing results was proposed by Riley et al. (35). The authors developed several approaches to visualize identified glycopeptides and explore profiles of heterogeneity present at multiple levels of proteomic information, from glycosites to subcellular regions (Fig. 3, B and C), using a dataset identified by an activated ion electron transfer dissociation–enabled method, which proved useful for analyzing intact glycopeptides in future studies. In addition, some specialized glycopeptide mass spectrum annotation and visualization tools, such as PepSweetener (70) and gpAnnotate (71), were also designed to facilitate the identification and analysis of results.

Fig. 3.

Fig. 3

Glycopeptide spectrum annotation and result display.A, automated annotation of glycopeptide spectra by pGlyco 2.0. B, glycoprotein–glycan network maps. C, a glycan co-occurrence heat map represents the number of times glycan pairs appeared together at the same glycosite. Reprinted with permission from Ref (26) and Ref (35), under Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).

Software Applications

Advanced and dedicated software tools for high-throughput glycopeptide identification and efficient identification strategies have facilitated the increase in the N-glycoproteome coverage in recent years (48, 62, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81). The intact glycopeptide identification scale increased by three orders of magnitude from 2010 to the present. Before 2014, only tens to hundreds intact glycopeptides were identified because almost all identification strategies and software tools were only suitable for glycopeptide analysis in simple or standard glycoprotein sample and the analytical throughput was low. For example, only 45 N-glycopeptides were identified using GlycoPep Grader in 2012 (13). Pompach et al. (15) identified 57 glycopeptides in haptoglobin and 14 glycopeptides using GlycoPeptide Search. The development of advanced search tools, more powerful instrumentation, and well-designed MS acquisition methods have increased the scale of identification from approximately 100 (42, 54, 82) to more than 1000 (40, 53, 76, 83). One of the most impressive and representative studies was published in 2016 by Sun et al. (83), who used GPQuest software and described an innovative method called solid-phase extraction of N-linked glycans and glycosite-containing peptides for N-glycopeptide identification. The authors successfully identified 1562 N-glycopeptides in different cell lines using the GPQuest software they developed in 2015 (42), which was a highlight and substantial achievement at that time.

In 2017, a major breakthrough was achieved by pGlyco 2.0 in the global characterization of intact N-glycopeptides (10, 26). The powerful search engine pGlyco 2.0 with comprehensive quality control and the ingenious experimental strategy identified more than 10,000 glycopeptides and reached the deepest and largest scale ever reported. A total of 10,009 intact N-glycopeptides were identified in five mouse tissues. The generic, precise, and open-access characters (download freely from http://pfind.ict.ac.cn/software/pGlyco/index.html) attracted increasing attention to pGlyco 2.0, and this method adopted by an increasing number of studies (28, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95). For instance, pGlyco 2.0 was used to identify 3524 intact N-glycopeptides in the APP/PS1 mouse model of Alzheimer’s disease and WT mice. Based on the further integration and identification of N-glycopeptides, proteome and lectin microarray data for glycan epitopes and dysfunctions of N-glycoproteins were shown to affect the development of Alzheimer’s disease (27). Using pGlyco 2.0, Zhang et al. (92). established an in-depth N-glycopeptide profile of human plasma and identified 1644 intact N-glycopeptides.

Byonic has become commercially available around 2012 (48). Since then, it has been widely used in glycopeptide identification (96, 97, 98, 99, 100). In addition, Byonic enables the user to perform glycopeptide searches by specifying glycan masses using Byonic’s usual modification fine-control mechanism, which greatly facilitates many studies (101, 102).

Conclusions and Future Perspectives

Powerful software tools are tremendously advancing the depth and precision of the identification of intact glycopeptides at a rapid rate and will greatly facilitate a comprehensive and accurate understanding of the role of glycosylation in organisms. Here, we reviewed the recent advances in software tools for MS-based identification of intact glycopeptides, typically for intact N-glycopeptides (Table 1). Throughout its development in the last decade, the field of N-glycopeptide analysis recently advanced in the last 5 years, changing the status of software development that lagged behind several years ago (103).

Glycopeptide analysis is moving toward the next stage of more comprehensive and quantitative identification. In our view, however, several problems must be solved in this process. The first is to develop dedicated strategies and search engines for intact O-glycopeptides. The problem of identifying O-glycopeptides are many folds, such as lack of efficient enrichment method, hard to obtain informative O-glycopeptide mass spectra. Although strategies and interpretation tools for O-glycoepeptides are emerging (104, 105, 106), they are immature and not as generic and precise as software tools for N-glycopeptides. The second issue to be addressed is the accurate site-specific localization of the glycan, particularly for O-glycopeptides, because several glycosylation sites occur adjacently in one digested O-glycopeptides, which presents a substantial challenge for both MS identification and software interpretation. The third is that generic software tools supporting both MS and tandem MS quantitation of intact glycopeptides urgently needed, which is extremely important for explorations of the roles of glycosylation in different biological and pathological stages. Fourth and importantly, because only the composition and a small degree of topology (such as fucose being on chitobiose core, etc.) can be generated by the current glycopeptide analysis approaches, the analyses of glycan structures and glycan isoforms on glycopeptides are additional and important challenges due to the inherent complexity of glycans. Extensive developments in techniques and software tools are needed to achieve more exquisite and superb tools for intact glycopeptide characterization and are anticipated in the next 5 to 10 years.

Conflict of interest

The authors declare no competing interests.

Acknowledgments

This work was supported by grants from the National Natural Science Foundation of China Project (91853102), the National Key Research and Development Program (2016YFA0501303, 2018YFC0910300, and 2016YFB0201702), and the Innovative Research Team of High-level Local Universities in Shanghai.

Author contributions

W. C. wrote the manuscript and supervised this work. M. L. helped to draft the manuscript. S. K., W. M., and Y. Z. participated in reference collection. P. Y. supervised this work.

Contributor Information

Weiqian Cao, Email: wqcao@fudan.edu.cn.

Pengyuan Yang, Email: pyyang@fudan.edu.cn.

References

  • 1.Hart G.W., Copeland R.J. Glycomics hits the big time. Cell. 2010;143:672–676. doi: 10.1016/j.cell.2010.11.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Feizi T., Mulloy B. Carbohydrates and glycoconjugates. Glycomics: The new era of carbohydrate biology. Curr. Opin. Struct. Biol. 2003;13:602–604. doi: 10.1016/j.sbi.2003.09.001. [DOI] [PubMed] [Google Scholar]
  • 3.Varki A. Biological roles of glycans. Glycobiology. 2017;27:3–49. doi: 10.1093/glycob/cww086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Morris H.R., Thompson M.R., Osuga D.T., Ahmed A.I., Chan S.M., Vandenheede J.R., Feeney R.E. Antifreeze glycoproteins from the blood of an antarctic fish. The structure of the proline-containing glycopeptides. J. Biol. Chem. 1978;253:5155–5162. [PubMed] [Google Scholar]
  • 5.Mirgorodskaya E., Roepstorff P., Zubarev R.A. Localization of O-glycosylation sites in peptides by electron capture dissociation in a Fourier transform mass spectrometer. Anal. Chem. 1999;71:4431–4436. doi: 10.1021/ac990578v. [DOI] [PubMed] [Google Scholar]
  • 6.Medzihradszky K.F., Gillece-Castro B.L., Settineri C.A., Townsend R.R., Masiarz F.R., Burlingame A.L. Structure determination of O-linked glycopeptides by tandem mass spectrometry. Biomed. Environ. Mass Spectrom. 1990;19:777–781. doi: 10.1002/bms.1200191205. [DOI] [PubMed] [Google Scholar]
  • 7.Zielinska D.F., Gnad F., Wisniewski J.R., Mann M. Precision mapping of an in vivo N-glycoproteome reveals rigid topological and sequence constraints. Cell. 2010;141:897–907. doi: 10.1016/j.cell.2010.04.012. [DOI] [PubMed] [Google Scholar]
  • 8.Kaji H., Saito H., Yamauchi Y., Shinkawa T., Taoka M., Hirabayashi J., Kasai K., Takahashi N., Isobe T. Lectin affinity capture, isotope-coded tagging and mass spectrometry to identify N-linked glycoproteins. Nat. Biotechnol. 2003;21:667–672. doi: 10.1038/nbt829. [DOI] [PubMed] [Google Scholar]
  • 9.Zhang H., Li X.J., Martin D.B., Aebersold R. Identification and quantification of N-linked glycoproteins using hydrazide chemistry, stable isotope labeling and mass spectrometry. Nat. Biotechnol. 2003;21:660–666. doi: 10.1038/nbt827. [DOI] [PubMed] [Google Scholar]
  • 10.Cao W.Q., Liu M.Q., Kong S.Y., Wu M.X., Huang Z.Z., Yang P.Y. Novel methods in glycomics: A 2019 update. Expert Rev. Proteomics. 2020;17:11–25. doi: 10.1080/14789450.2020.1708199. [DOI] [PubMed] [Google Scholar]
  • 11.Hu H., Khatri K., Klein J., Leymarie N., Zaia J. A review of methods for interpretation of glycopeptide tandem mass spectral data. Glycoconj. J. 2016;33:285–296. doi: 10.1007/s10719-015-9633-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Strum J.S., Nwosu C.C., Hua S., Kronewitter S.R., Seipert R.R., Bachelor R.J., An H.J., Lebrilla C.B. Automated assignments of N- and O-site specific glycosylation with extensive glycan heterogeneity of glycoprotein mixtures. Anal. Chem. 2013;85:5666–5675. doi: 10.1021/ac4006556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Woodin C.L., Hua D., Maxon M., Rebecchi K.R., Go E.P., Desaire H. GlycoPep grader: A web-based utility for assigning the composition of N-linked glycopeptides. Anal. Chem. 2012;84:4821–4829. doi: 10.1021/ac300393t. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Chandler K.B., Pompach P., Goldman R., Edwards N. Exploring site-specific N-glycosylation microheterogeneity of haptoglobin using glycopeptide CID tandem mass spectra and glycan database search. J. Proteome Res. 2013;12:3652–3666. doi: 10.1021/pr400196s. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Pompach P., Chandler K.B., Lan R., Edwards N., Goldman R. Semi-automated identification of N-glycopeptides by hydrophilic interaction chromatography, nano-reverse-phase LC-MS/MS, and glycan database search. J. Proteome Res. 2012;11:1728–1740. doi: 10.1021/pr201183w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Khatri K., Staples G.O., Leymarie N., Leon D.R., Turiak L., Huang Y., Yip S., Hu H., Heckendorf C.F., Zaia J. Confident assignment of site-specific glycosylation in complex glycoproteins in a single step. J. Proteome Res. 2014;13:4347–4355. doi: 10.1021/pr500506z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Cao Q., Zhao X., Zhao Q., Lv X., Ma C., Li X., Zhao Y., Peng B., Ying W., Qian X. Strategy integrating stepped fragmentation and glycan diagnostic ion- based spectrum refinement for the identification of core fucosylated glycoproteome using mass spectrometry. Anal. Chem. 2014;86:6804–6811. doi: 10.1021/ac501154a. [DOI] [PubMed] [Google Scholar]
  • 18.Reiding K.R., Bondt A., Franc V., Heck A.J.R. The benefits of hybrid fragmentation methods for glycoproteomics. Trends Analyt. Chem. 2018;108:260–268. [Google Scholar]
  • 19.Cao L., Tolic N., Qu Y., Meng D., Zhao R., Zhang Q., Moore R.J., Zink E.M., Lipton M.S., Pasa-Tolic L., Wu S. Characterization of intact N- and O-linked glycopeptides using higher energy collisional dissociation. Anal. Biochem. 2014;452:96–102. doi: 10.1016/j.ab.2014.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Hinneburg H., Stavenhagen K., Schweiger-Hufnagel U., Pengelley S., Jabs W., Seeberger P.H., Silva D.V., Wuhrer M., Kolarich D. The art of destruction: Optimizing collision energies in quadrupole-time of flight (Q-TOF) instruments for glycopeptide-based glycoproteomics. J. Am. Soc. Mass Spectrom. 2016;27:507–519. doi: 10.1007/s13361-015-1308-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hoffmann E.d., Stroobant V. J. Wiley; Chichester, West Sussex, England; Hoboken, NJ: 2007. Mass Spectrometry: Principles and Applications. [Google Scholar]
  • 22.Kolli V., Roth H.A., De La Cruz G., Fernando G.S., Dodds E.D. The role of proton mobility in determining the energy-resolved vibrational activation/dissociation channels of N-glycopeptide ions. Anal. Chim. Acta. 2015;896:85–92. doi: 10.1016/j.aca.2015.09.013. [DOI] [PubMed] [Google Scholar]
  • 23.Zhang Y., Xie X., Zhao X., Tian F., Lv J., Ying W., Qian X. Systems analysis of singly and multiply O-glycosylated peptides in the human serum glycoproteome via EThcD and HCD mass spectrometry. J. Proteomics. 2018;170:14–27. doi: 10.1016/j.jprot.2017.09.014. [DOI] [PubMed] [Google Scholar]
  • 24.Aboufazeli F., Kolli V., Dodds E.D. A comparison of energy-resolved vibrational activation/dissociation characteristics of protonated and sodiated high mannose N-glycopeptides. J. Am. Soc. Mass Spectrom. 2015;26:587–595. doi: 10.1007/s13361-014-1070-1. [DOI] [PubMed] [Google Scholar]
  • 25.Kuo C.W., Guu S.Y., Khoo K.H. Distinctive and complementary MS(2) fragmentation characteristics for identification of sulfated sialylated N-glycopeptides by nanoLC-MS/MS workflow. J. Am. Soc. Mass Spectrom. 2018;29:1166–1178. doi: 10.1007/s13361-018-1919-9. [DOI] [PubMed] [Google Scholar]
  • 26.Liu M.Q., Zeng W.F., Fang P., Cao W.Q., Liu C., Yan G.Q., Zhang Y., Peng C., Wu J.Q., Zhang X.J., Tu H.J., Chi H., Sun R.X., Cao Y., Dong M.Q., et al. pGlyco 2.0 enables precision N-glycoproteomics with comprehensive quality control and one-step mass spectrometry for intact glycopeptide identification. Nat. Commun. 2017;8:438. doi: 10.1038/s41467-017-00535-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Fang P., Xie J., Sang S., Zhang L., Liu M., Yang L., Xu Y., Yan G., Yao J., Gao X., Qian W., Wang Z., Zhang Y., Yang P., Shen H. Multilayered N-glycoproteome profiling reveals highly heterogeneous and dysregulated protein N-glycosylation related to Alzheimer's disease. Anal. Chem. 2020;92:867–874. doi: 10.1021/acs.analchem.9b03555. [DOI] [PubMed] [Google Scholar]
  • 28.Zhang Y., Zhao W., Zhao Y., Mao Y., Su T., Zhong Y., Wang S., Zhai R., Cheng J., Fang X., Zhu J., Yang H. Comparative glycoproteomic profiling of human body fluid between healthy controls and patients with papillary thyroid carcinoma. J. Proteome Res. 2019 doi: 10.1021/acs.jproteome.9b00672. [DOI] [PubMed] [Google Scholar]
  • 29.Frese C.K., Altelaar A.F., van den Toorn H., Nolting D., Griep-Raming J., Heck A.J., Mohammed S. Toward full peptide sequence coverage by dual fragmentation combining electron-transfer and higher-energy collision dissociation tandem mass spectrometry. Anal. Chem. 2012;84:9668–9673. doi: 10.1021/ac3025366. [DOI] [PubMed] [Google Scholar]
  • 30.Campbell J.L., Hager J.W., Le Blanc J.C. On performing simultaneous electron transfer dissociation and collision- induced dissociation on multiply protonated peptides in a linear ion trap. J. Am. Soc. Mass Spectrom. 2009;20:1672–1683. doi: 10.1016/j.jasms.2009.05.009. [DOI] [PubMed] [Google Scholar]
  • 31.Kolbowski L., Mendes M.L., Rappsilber J. Optimizing the parameters governing the fragmentation of cross-linked peptides in a tribrid mass spectrometer. Anal. Chem. 2017;89:5311–5318. doi: 10.1021/acs.analchem.6b04935. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Chen G., Zhang Y., Trinidad J.C., Dann C., 3rd Distinguishing sulfotyrosine containing peptides from their phosphotyrosine counterparts using mass spectrometry. J. Am. Soc. Mass Spectrom. 2018;29:455–462. doi: 10.1007/s13361-017-1854-1. [DOI] [PubMed] [Google Scholar]
  • 33.Li S., Zhou Y., Xiao K., Li J., Tian Z. Selective fragmentation of the N-glycan moiety and protein backbone of ribonuclease B on an Orbitrap Fusion Lumos Tribrid mass spectrometer. Rapid Commun. Mass Spectrom. 2018;32:2031–2039. doi: 10.1002/rcm.8273. [DOI] [PubMed] [Google Scholar]
  • 34.Riley N.M., Hebert A.S., Durnberger G., Stanek F., Mechtler K., Westphall M.S., Coon J.J. Phosphoproteomics with activated ion electron transfer dissociation. Anal. Chem. 2017;89:6367–6376. doi: 10.1021/acs.analchem.7b00212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Riley N.M., Hebert A.S., Westphall M.S., Coon J.J. Capturing site-specific heterogeneity with large-scale N-glycoproteome analysis. Nat. Commun. 2019;10:1311. doi: 10.1038/s41467-019-09222-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Caval T., Zhu J., Heck A.J.R. Simply extending the mass range in electron transfer higher energy collisional dissociation increases confidence in N-glycopeptide identification. Anal. Chem. 2019;91:10401–10406. doi: 10.1021/acs.analchem.9b02125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Chen Z., Yu Q., Hao L., Liu F., Johnson J., Tian Z., Kao W.J., Xu W., Li L. Site-specific characterization and quantitation of N-glycopeptides in PKM2 knockout breast cancer cells using DiLeu isobaric tags enabled by electron-transfer/higher-energy collision dissociation (EThcD) Analyst. 2018;143:2508–2519. doi: 10.1039/c8an00216a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Parker B.L., Thaysen-Andersen M., Fazakerley D.J., Holliday M., Packer N.H., James D.E. Terminal galactosylation and sialylation switching on membrane glycoproteins upon TNF-alpha-induced insulin resistance in adipocytes. Mol. Cell. Proteomics. 2016;15:141–153. doi: 10.1074/mcp.M115.054221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Yu Q., Wang B., Chen Z., Urabe G., Glover M.S., Shi X., Guo L.W., Kent K.C., Li L. Electron-transfer/higher-energy collision dissociation (EThcD)-enabled intact glycopeptide/glycoproteome characterization. J. Am. Soc. Mass Spectrom. 2017;28:1751–1764. doi: 10.1007/s13361-017-1701-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Liu M., Zhang Y., Chen Y., Yan G., Shen C., Cao J., Zhou X., Liu X., Zhang L., Shen H., Lu H., He F., Yang P. Efficient and accurate glycopeptide identification pipeline for high- throughput site-specific N-glycosylation analysis. J. Proteome Res. 2014;13:3121–3129. doi: 10.1021/pr500238v. [DOI] [PubMed] [Google Scholar]
  • 41.Cheng K., Chen R., Seebun D., Ye M., Figeys D., Zou H. Large-scale characterization of intact N-glycopeptides using an automated glycoproteomic method. J. Proteomics. 2014;110:145–154. doi: 10.1016/j.jprot.2014.08.006. [DOI] [PubMed] [Google Scholar]
  • 42.Toghi Eshghi S., Shah P., Yang W., Li X., Zhang H. GPQuest: A spectral library matching algorithm for site-specific assignment of tandem mass spectra to intact N-glycopeptides. Anal. Chem. 2015;87:5181–5188. doi: 10.1021/acs.analchem.5b00024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Yu C.Y., Mayampurath A., Zhu R., Zacharias L., Song E., Wang L., Mechref Y., Tang H. Automated glycan sequencing from tandem mass spectra of N-linked glycopeptides. Anal. Chem. 2016;88:5725–5732. doi: 10.1021/acs.analchem.5b04858. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Lynn K.S., Chen C.C., Lih T.M., Cheng C.W., Su W.C., Chang C.H., Cheng C.Y., Hsu W.L., Chen Y.J., Sung T.Y. MAGIC: An automated N-linked glycoprotein identification tool using a Y1-ion pattern matching algorithm and in silico MS(2) approach. Anal. Chem. 2015;87:2466–2473. doi: 10.1021/ac5044829. [DOI] [PubMed] [Google Scholar]
  • 45.Baker P.R., Trinidad J.C., Chalkley R.J. Modification site localization scoring integrated into a search engine. Mol. Cell. Proteomics. 2011;10 doi: 10.1074/mcp.M111.008078. M111.008078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Medzihradszky K.F., Kaasik K., Chalkley R.J. Tissue-specific glycosylation at the glycopeptide level. Mol. Cell. Proteomics. 2015;14:2103–2110. doi: 10.1074/mcp.M115.050393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Zhu Z., Hua D., Clark D.F., Go E.P., Desaire H. GlycoPep detector: A tool for assigning mass spectrometry data of N-linked glycopeptides on the basis of their electron transfer dissociation spectra. Anal. Chem. 2013;85:5023–5032. doi: 10.1021/ac400287n. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Bern M., Kil Y.J., Becker C. Byonic: Advanced peptide and protein identification software. Curr. Protoc. Bioinformatics. 2012 doi: 10.1002/0471250953.bi1320s40. Chapter 13:Unit13.20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Kim J.W., Hwang H., Lim J.S., Lee H.J., Jeong S.K., Yoo J.S., Paik Y.K. gFinder: A web-based bioinformatics tool for the analysis of N-glycopeptides. J. Proteome Res. 2016;15:4116–4125. doi: 10.1021/acs.jproteome.6b00772. [DOI] [PubMed] [Google Scholar]
  • 50.Park G.W., Kim J.Y., Hwang H., Lee J.Y., Ahn Y.H., Lee H.K., Ji E.S., Kim K.H., Jeong H.K., Yun K.N., Kim Y.S., Ko J.H., An H.J., Kim J.H., Paik Y.K., et al. Integrated GlycoProteome analyzer (I-GPA) for automated identification and quantitation of site-specific N-glycosylation. Sci. Rep. 2016;6:21175. doi: 10.1038/srep21175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Wu Y., Mechref Y., Klouckova I., Mayampurath A., Novotny M.V., Tang H. Mapping site-specific protein N-glycosylations through liquid chromatography/mass spectrometry and targeted tandem mass spectrometry. Rapid Commun. Mass Spectrom. 2010;24:965–972. doi: 10.1002/rcm.4474. [DOI] [PubMed] [Google Scholar]
  • 52.Liu G., Cheng K., Lo C.Y., Li J., Qu J., Neelamegham S. A comprehensive, open-source platform for mass spectrometry-based glycoproteomics data analysis. Mol. Cell. Proteomics. 2017;16:2032–2047. doi: 10.1074/mcp.M117.068239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.He L., Xin L., Shan B., Lajoie G.A., Ma B. GlycoMaster DB: Software to assist the automated identification of N-linked glycopeptides by tandem mass spectrometry. J. Proteome Res. 2014;13:3881–3895. doi: 10.1021/pr401115y. [DOI] [PubMed] [Google Scholar]
  • 54.Wu S.W., Liang S.Y., Pu T.H., Chang F.Y., Khoo K.H. Sweet-Heart - an integrated suite of enabling computational tools for automated MS2/MS3 sequencing and identification of glycopeptides. J. Proteomics. 2013;84:1–16. doi: 10.1016/j.jprot.2013.03.026. [DOI] [PubMed] [Google Scholar]
  • 55.Wu S.W., Pu T.H., Viner R., Khoo K.H. Novel LC-MS(2) product dependent parallel data acquisition function and data analysis workflow for sequencing and identification of intact glycopeptides. Anal. Chem. 2014;86:5478–5486. doi: 10.1021/ac500945m. [DOI] [PubMed] [Google Scholar]
  • 56.Froehlich J.W., Dodds E.D., Wilhelm M., Serang O., Steen J.A., Lee R.S. A classifier based on accurate mass measurements to aid large scale, unbiased glycoproteomics. Mol. Cell. Proteomics. 2013;12:1017–1025. doi: 10.1074/mcp.M112.025494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Palaniappan K.K., Pitcher A.A., Smart B.P., Spiciarich D.R., Iavarone A.T., Bertozzi C.R. Isotopic signature transfer and mass pattern prediction (IsoStamp): An enabling technique for chemically-directed proteomics. ACS Chem. Biol. 2011;6:829–836. doi: 10.1021/cb100338x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Woo C.M., Iavarone A.T., Spiciarich D.R., Palaniappan K.K., Bertozzi C.R. Isotope-targeted glycoproteomics (IsoTaG): A mass-independent platform for intact N- and O-glycopeptide discovery and analysis. Nat. Methods. 2015;12:561–567. doi: 10.1038/nmeth.3366. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Eng J.K., McCormack A.L., Yates J.R. An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom. 1994;5:976–989. doi: 10.1016/1044-0305(94)80016-2. [DOI] [PubMed] [Google Scholar]
  • 60.Chi H., Liu C., Yang H., Zeng W.F., Wu L., Zhou W.J., Wang R.M., Niu X.N., Ding Y.H., Zhang Y., Wang Z.W., Chen Z.L., Sun R.X., Liu T., Tan G.M., et al. Comprehensive identification of peptides in tandem mass spectra using an efficient open search engine. Nat. Biotechnol. 2018;36:1059–1061. doi: 10.1038/nbt.4236. [DOI] [PubMed] [Google Scholar]
  • 61.Zeng W.F., Liu M.Q., Zhang Y., Wu J.Q., Fang P., Peng C., Nie A., Yan G., Cao W., Liu C., Chi H., Sun R.X., Wong C.C., He S.M., Yang P. pGlyco: A pipeline for the identification of intact N-glycopeptides by using HCD- and CID-MS/MS and MS3. Sci. Rep. 2016;6:25102. doi: 10.1038/srep25102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Stadlmann J., Taubenschmid J., Wenzel D., Gattinger A., Durnberger G., Dusberger F., Elling U., Mach L., Mechtler K., Penninger J.M. Comparative glycoproteomics of stem cells identifies new players in ricin toxicity. Nature. 2017;549:538–542. doi: 10.1038/nature24015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Elias J.E., Gygi S.P. Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat. Methods. 2007;4:207–214. doi: 10.1038/nmeth1019. [DOI] [PubMed] [Google Scholar]
  • 64.Elias J.E., Haas W., Faherty B.K., Gygi S.P. Comparative evaluation of mass spectrometry platforms used in large-scale proteomics investigations. Nat. Methods. 2005;2:667–675. doi: 10.1038/nmeth785. [DOI] [PubMed] [Google Scholar]
  • 65.Zhu Z., Su X., Go E.P., Desaire H. New glycoproteomics software, GlycoPep evaluator, generates decoy glycopeptides de novo and enables accurate false discovery rate analysis for small data sets. Anal. Chem. 2014;86:9212–9219. doi: 10.1021/ac502176n. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Shipman J.T., Su X., Hua D., Desaire H. DecoyDeveloper: An on-demand, de novo decoy glycopeptide generator. J. Proteome Res. 2019;18:2896–2902. doi: 10.1021/acs.jproteome.9b00203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Pioch M., Hoffmann M., Pralow A., Reichl U., Rapp E. glyXtool(MS): An open-source pipeline for semiautomated analysis of glycopeptide mass spectrometry data. Anal. Chem. 2018;90:11908–11916. doi: 10.1021/acs.analchem.8b02087. [DOI] [PubMed] [Google Scholar]
  • 68.Xiao K., Tian Z. GPSeeker enables quantitative structural N-glycoproteomics for site- and structure-specific characterization of differentially expressed N-glycosylation in hepatocellular carcinoma. J. Proteome Res. 2019;18:2885–2895. doi: 10.1021/acs.jproteome.9b00191. [DOI] [PubMed] [Google Scholar]
  • 69.Lih T.M., Choong W.K., Chen C.C., Cheng C.W., Lin H.N., Chen C.T., Chang H.Y., Hsu W.L., Sung T.Y. MAGIC-web: A platform for untargeted and targeted N-linked glycoprotein identification. Nucleic Acids Res. 2016;44:W575–W580. doi: 10.1093/nar/gkw254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Domagalski M.J., Alocci D., Almeida A., Kolarich D., Lisacek F. PepSweetener: A web-based tool to support manual annotation of intact glycopeptide MS spectra. Proteomics Clin. Appl. 2018;12 doi: 10.1002/prca.201700069. [DOI] [PubMed] [Google Scholar]
  • 71.Cheng K., Pawlowski G., Yu X., Zhou Y., Neelamegham S. DrawGlycan-SNFG & gpAnnotate: Rendering glycans and annotating glycopeptide mass spectra. Bioinformatics. 2019;36:1942–1943. doi: 10.1093/bioinformatics/btz819. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Hu Y., Shah P., Clark D.J., Ao M., Zhang H. Reanalysis of global proteomic and phosphoproteomic data identified a large number of glycopeptides. Anal. Chem. 2018;90:8065–8071. doi: 10.1021/acs.analchem.8b01137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Mayampurath A., Song E., Mathur A., Yu C.Y., Hammoud Z., Mechref Y., Tang H. Label-free glycopeptide quantification for biomarker discovery in human sera. J. Proteome Res. 2014;13:4821–4832. doi: 10.1021/pr500242m. [DOI] [PubMed] [Google Scholar]
  • 74.Wu J., Qin H., Li T., Cheng K., Dong J., Tian M., Chai N., Guo H., Li J., You X., Dong M., Ye M., Nie Y., Zou H., Fan D. Characterization of site-specific glycosylation of secreted proteins associated with multi-drug resistance of gastric cancer. Oncotarget. 2016;7:25315–25327. doi: 10.18632/oncotarget.8287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Yang G., Hu Y., Sun S., Ouyang C., Yang W., Wang Q., Betenbaugh M., Zhang H. Comprehensive glycoproteomic analysis of Chinese hamster ovary cells. Anal. Chem. 2018;90:14294–14302. doi: 10.1021/acs.analchem.8b03520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Shu Q., Li M., Shu L., An Z., Wang J., Lv H., Yang M., Cai T., Hu T., Fu Y., Yang F. Large-scale identification of N-linked intact glycopeptides in human serum using HILIC enrichment and spectral library search. Mol. Cell. Proteomics. 2020;19:672–689. doi: 10.1074/mcp.RA119.001791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Stadlmann J., Hoi D.M., Taubenschmid J., Mechtler K., Penninger J.M. Analysis of PNGase F-resistant N-glycopeptides using SugarQb for proteome discoverer 2.1 reveals cryptic substrate specificities. Proteomics. 2018;18 doi: 10.1002/pmic.201700436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Xu F., Wang Y., Xiao K., Hu Y., Tian Z., Chen Y. Quantitative site- and structure-specific N-glycoproteomics characterization of differential N-glycosylation in MCF-7/ADR cancer stem cells. Clin. Proteomics. 2020;17:3. doi: 10.1186/s12014-020-9268-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Choo M.S., Wan C., Rudd P.M., Nguyen-Khuong T. GlycopeptideGraphMS: Improved glycopeptide detection and identification by exploiting graph theoretical patterns in mass and retention time. Anal. Chem. 2019;91:7236–7244. doi: 10.1021/acs.analchem.9b00594. [DOI] [PubMed] [Google Scholar]
  • 80.An Z., Shu Q., Lv H., Shu L., Wang J., Yang F., Fu Y. N-linked glycopeptide identification based on open mass spectral library search. Biomed. Res. Int. 2018;2018:1564136. doi: 10.1155/2018/1564136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Nasir W., Toledo A.G., Noborn F., Nilsson J., Wang M., Bandeira N., Larson G. SweetNET: A bioinformatics workflow for glycopeptide MS/MS spectral analysis. J. Proteome Res. 2016;15:2826–2840. doi: 10.1021/acs.jproteome.6b00417. [DOI] [PubMed] [Google Scholar]
  • 82.Mayampurath A., Yu C.Y., Song E., Balan J., Mechref Y., Tang H. Computational framework for identification of intact glycopeptides in complex samples. Anal. Chem. 2014;86:453–463. doi: 10.1021/ac402338u. [DOI] [PubMed] [Google Scholar]
  • 83.Sun S., Shah P., Eshghi S.T., Yang W., Trikannad N., Yang S., Chen L., Aiyetan P., Hoti N., Zhang Z., Chan D.W., Zhang H. Comprehensive analysis of protein glycosylation by solid-phase extraction of N-linked glycans and glycosite-containing peptides. Nat. Biotechnol. 2016;34:84–88. doi: 10.1038/nbt.3403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Cao L., Qu Y., Zhang Z., Wang Z., Prytkova I., Wu S. Intact glycopeptide characterization using mass spectrometry. Expert Rev. Proteomics. 2016;13:513–522. doi: 10.1586/14789450.2016.1172965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Chen Z., Huang J., Li L. Recent advances in mass spectrometry (MS)-based glycoproteomics in complex biological samples. Trends Analyt. Chem. 2019;118:880–892. doi: 10.1016/j.trac.2018.10.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Qin H., Chen Y., Mao J., Cheng K., Sun D., Dong M., Wang L., Wang L., Ye M. Proteomics analysis of site-specific glycoforms by a virtual multistage mass spectrometry method. Anal. Chim. Acta. 2019;1070:60–68. doi: 10.1016/j.aca.2019.04.025. [DOI] [PubMed] [Google Scholar]
  • 87.Qin H., Dong X., Mao J., Chen Y., Dong M., Wang L., Guo Z., Liang X., Ye M. Highly efficient analysis of glycoprotein sialylation in human serum by simultaneous quantification of glycosites and site-specific glycoforms. J. Proteome Res. 2019;18:3439–3446. doi: 10.1021/acs.jproteome.9b00332. [DOI] [PubMed] [Google Scholar]
  • 88.Gao W., Li H., Liu L., Huang P., Wang Z., Chen W., Ye M., Yu X., Tian R. An integrated strategy for high-sensitive and multi-level glycoproteome analysis from low micrograms of protein samples. J. Chromatogr. A. 2019;1600:46–54. doi: 10.1016/j.chroma.2019.04.041. [DOI] [PubMed] [Google Scholar]
  • 89.Li Q., Xie Y., Wong M., Lebrilla C.B. Characterization of cell glycocalyx with mass spectrometry methods. Cells. 2019;8:882. doi: 10.3390/cells8080882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Park D.D., Xu G., Wong M., Phoomak C., Liu M., Haigh N.E., Wongkham S., Yang P., Maverakis E., Lebrilla C.B. Membrane glycomics reveal heterogeneity and quantitative distribution of cell surface sialylation. Chem. Sci. 2018;9:6271–6285. doi: 10.1039/c8sc01875h. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Suttapitugsakul S., Sun F., Wu R. Recent advances in glycoproteomic analysis by mass spectrometry. Anal. Chem. 2020;92:267–291. doi: 10.1021/acs.analchem.9b04651. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Zhang Y., Mao Y., Zhao W., Su T., Zhong Y., Fu L., Zhu J., Cheng J., Yang H. Glyco-CPLL: An integrated method for in-depth and comprehensive N-glycoproteome profiling of human plasma. J. Proteome Res. 2020;19:655–666. doi: 10.1021/acs.jproteome.9b00557. [DOI] [PubMed] [Google Scholar]
  • 93.Zhu J., Warner E., Parikh N.D., Lubman D.M. Glycoproteomic markers of hepatocellular carcinoma-mass spectrometry based approaches. Mass Spectrom. Rev. 2019;38:265–290. doi: 10.1002/mas.21583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Brown C.J., Grassmyer K.T., MacDonald M.L., Clemmer D.E., Trinidad J.C. Glycoproteome analysis of human serum and brain tissue. bioRxiv. 2019 doi: 10.1101/647081. [DOI] [Google Scholar]
  • 95.Zhang S., Cao X., Liu C., Li W., Zeng W., Li B., Chi H., Liu M., Qin X., Tang L., Yan G., Ge Z., Liu Y., Gao Q., Lu H. N-glycopeptide signatures of IgA2 in serum from patients with hepatitis B Virus-related Liver diseases. Mol. Cell. Proteomics. 2019;18:2262–2272. doi: 10.1074/mcp.RA119.001722. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Lin Y.H., Zhu J., Meijer S., Franc V., Heck A.J.R. Glycoproteogenomics: A frequent gene polymorphism affects the glycosylation pattern of the human serum fetuin/alpha-2-HS-glycoprotein. Mol. Cell. Proteomics. 2019;18:1479–1490. doi: 10.1074/mcp.RA119.001411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Zhang H., Lv Y., Du J., Shao W., Jiao F., Xia C., Gao F., Yu Q., Liu Y., Zhang W., Zhang Y., Qin W., Qian X. A GSH functionalized magnetic ultra-thin 2D-MoS2 nanocomposite for HILIC- based enrichment of N-glycopeptides from urine exosome and serum proteins. Anal. Chim. Acta. 2020;1098:181–189. doi: 10.1016/j.aca.2019.11.012. [DOI] [PubMed] [Google Scholar]
  • 98.Zhao X., Zheng S., Li Y., Huang J., Zhang W., Xie Y., Qin W., Qian X. An integrated mass spectroscopy data processing strategy for fast identification, in-depth, and reproducible quantification of protein O-glycosylation in a large cohort of human urine samples. Anal. Chem. 2020;92:690–698. doi: 10.1021/acs.analchem.9b02228. [DOI] [PubMed] [Google Scholar]
  • 99.Dang L., Jia L., Zhi Y., Li P., Zhao T., Zhu B., Lan R., Hu Y., Zhang H., Sun S. Mapping human N-linked glycoproteins and glycosylation sites using mass spectrometry. Trends Analyt. Chem. 2019;114:143–150. doi: 10.1016/j.trac.2019.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Reiding K.R., Franc V., Huitema M.G., Brouwer E., Heeringa P., Heck A.J.R. Neutrophil myeloperoxidase harbors distinct site-specific peculiarities in its glycosylation. J. Biol. Chem. 2019;294:20233–20245. doi: 10.1074/jbc.RA119.011098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Yang L., Sun Z., Zhang L., Cai Y., Peng Y., Cao T., Zhang Y., Lu H. Chemical labeling for fine mapping of IgG N-glycosylation by ETD-MS. Chem. Sci. 2019;10:9302–9307. doi: 10.1039/c9sc02491c. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Di Y., Zhang L., Zhang Y., Zhao H., Yan G., Yao J., Zhang S., Lu H. MdCDPM: A mass defect-based chemical-directed proteomics method for targeted analysis of intact sialylglycopeptides. Anal. Chem. 2019;91:9986–9992. doi: 10.1021/acs.analchem.9b01798. [DOI] [PubMed] [Google Scholar]
  • 103.Desaire H. Glycopeptide analysis, recent developments and applications. Mol. Cell. Proteomics. 2013;12:893–901. doi: 10.1074/mcp.R112.026567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Mao J., You X., Qin H., Wang C., Wang L., Ye M. A new searching strategy for the identification of O-linked glycopeptides. Anal. Chem. 2019;91:3852–3859. doi: 10.1021/acs.analchem.8b04184. [DOI] [PubMed] [Google Scholar]
  • 105.Yang W., Ao M., Hu Y., Li Q.K., Zhang H. Mapping the O-glycoproteome using site-specific extraction of O-linked glycopeptides (EXoO) Mol. Syst. Biol. 2018;14 doi: 10.15252/msb.20188486. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Ye Z., Mao Y., Clausen H., Vakhrushev S.Y. Glyco-DIA: A method for quantitative O-glycoproteomics with in silico- boosted glycopeptide libraries. Nat. Methods. 2019;16:902–910. doi: 10.1038/s41592-019-0504-x. [DOI] [PubMed] [Google Scholar]

Articles from Molecular & Cellular Proteomics : MCP are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES