Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2023 May 17:2023.05.17.541150. [Version 1] doi: 10.1101/2023.05.17.541150

“Ghost” fragment ions in structure and site-specific glycoproteomics analysis

Diana Campos a,*, Michael Girgis b, Qiang Yang c, Guanghui Zong d, Radoslav Goldman e,f, Lai-Xi Wang d, Miloslav Sanda a,e,f,*
PMCID: PMC10245710  PMID: 37292769

Abstract

Mass spectrometry (MS) can unlock crucial insights into the intricate world of glycosylation analysis. Despite its immense potential, the qualitative and quantitative analysis of isobaric glycopeptide structures remains one of the most daunting hurdles in the field of glycoproteomics. The ability to distinguish between these complex glycan structures poses a significant challenge, hindering our ability to accurately measure and understand the role of glycoproteins in biological systems. A few recent publications described the use of collision energy (CE) modulation to improve structural elucidation, especially for qualitative purposes. Different linkages of glycan units usually demonstrate different stabilities under CID/HCD fragmentation conditions. Fragmentation of the glycan moiety produces low molecular weight ions (oxonium ions) that can serve as a structure-specific signature for specific glycan moieties, however, specificity of these fragments has never been examined closely. Here, we investigated fragmentation specificity using synthetic stable isotope-labelled glycopeptide standards. These standards were isotopically labelled at the reducing terminal GlcNAc, which allowed us to resolve fragments produced by oligomannose core moiety and fragments generated from outer antennary structures. Our research identified the potential for false positive structure assignments due to the occurrence of “Ghost” fragments resulting from single glyco unit rearrangement or mannose core fragmentation within the collision cell. To mitigate this issue, we have established a minimal intensity threshold for these fragments to prevent the misidentification of structure-specific fragments in glycoproteomics analysis. Our findings provide a crucial step forward in the quest for more accurate and reliable glycoproteomics measurements.

Keywords: Glycoproteomics, stable isotope-labelled glycopeptides, Glycopeptide fragmentation, sialic acid rearrangement, Mass Spectrometry

Graphical abstract

graphic file with name nihpp-2023.05.17.541150v1-f0001.jpg

LETTER

Precise characterization of protein glycosylation, in particular glycan structure analysis at a peptide specific site, has always been hampered by the microheterogeneity and nonlinear structural complexity of glycans 1. In an attempt to overcome this challenge, mass spectrometry (MS) workflows and glycoproteomics software packages have been heavily implemented 2, 3. Partial characterization of glycan composition, intact precursors or fragment ions, were analysed by a variety of software tools such as Byonic 4, pGlyco 2.0 5, GPQuest 6, GPSeeker, MetaMorpheus 7, MSFragger-Glyco 8, and StrucGP 9. Although these types of software can serve as valuable tools for structural and site-specific glycan interpretation, this analysis still requires a good degree of manual data examination/curation, as the software readouts may lead to the assignment of false positive oxonium ion fragments.

Researchers have made noteworthy discoveries regarding glycan rearrangements during collision events at specific energies. These observations include rearrangements of hexose, as demonstrated by Wuhrer, Koeleman and colleagues in 2009 10, and fucose, as shown in studies by Wuhrer, Koeleman et al. in 2006 11 and Acs, Ozohanics et al. in 2018 12. The results of these studies emphasized the critical importance of setting precise thresholds for particular fragments. This practice is crucial to prevent the erroneous identification of motifs that are unique to a structure and could potentially cause significant problems.

Our team, along with other researchers, have developed a technique to determine the structural information from the tandem mass spectra of glycopeptides 13, 14. We used low collision energy beam type fragmentation to precisely assign outer antennary structural motifs and, in addition, we used structure specific fragments for their quantitation 15, 16. By modulating the collision energy, we are able to identify specific glycopeptide motifs with unparalleled accuracy (Figure 1). However, our work with synthetic standards has revealed a discrepancy between the structure-specific ions and their synthetic counterparts.

Figure 1.

Figure 1.

MS/MS spectra of the following structural motifs of G2FS IgG glycopeptide recorded under low collision energy NCE 10 (upper panel) and high collision energy NCE 45 (lower panel).

In this study, we utilized synthetic isotope-labelled standards (SIS) of glycopeptides and CE modulation to explore the existence of false positive signatures of specific outer antennary structures such as LacdiNAc, outer arm fucosylation, or outer arm GlcNAc sialylation and fucosylation, known as “Ghost “ fragment oxonium ions. To ensure the accuracy of our findings, we used IgG glycopeptide standards with 13C stable isotope incorporated into the reducing terminal GlcNAc, which creates a 6 Dalton difference compared to the natural IgG1 glycopeptides 17. This step was crucial in our research to avoid the contamination by natural isomers s that could compromise the reliability of our results.

Our LC-MS/MS-PRM workflow (detailed in the Supplementary information/methods) allowed us to record High Collision Dissociation (HCD) spectra of the tryptic digest of IgG1 glycopeptides and investigated fragmentation of five IgG glycopeptide standards (G0F, G1F, G2F, S1G2F, and S2G2F structures). We created separate PRM methods with transition lists with Normalized Collision Energy (NCE) range 10–60 using step 5 for each glycopeptide standard, and recorded spectra for charge state 3+ and 2+. This allowed us to estimate the relative abundance of “Ghost” oxonium ions to prevent false positive glyco-structure assignments.

For instance, fragment m/z 407.2 is commonly described and used as the signature for LacdiNAc (GalNAc-GlcNAc) outer antennae specific structure 9. However, isobaric diHexNAc fragment can theoretically be produced by fragmentation of the core of chitobiose (GlcNAc-GlcNA) (Figure 2). It is practically impossible to separate these isobaric ions in complex biological matrices without structure-specific separation of oxonium ions utilizing ion mobility mass spectrometry 18. Incorporating stable isotope 13C into GlcNAc produces a 6 Dalton difference between the “heavy” and “light” isotopic glycopeptides resulting in fragment ion m/z 413.2 in the diHexNAc fragment ion created by the chitobiose core fragmentation (Figure 2). Thus, with CE modulation, we were able to calculate a threshold for the false-positive LacdiNAc structure assignment (Figure 3 and Table 1). The area of the integrated peak was used for further data processing. We determined that the intensity of false positive LacdiNAc ion could be up to 3.58% of relative intensity at collision energy NCE 45. Fucose rearrangement was described previously by us and others 11, 12, 14. We evaluated the production of false-positive LacNAc-fucose ion (m/z 512.2), SialoLacNAc-fucose ion (m/z 803.3) and GlcNAc-fucose ion (m/z 350.1) and determined that the false outer arm LacNAc-fucose fragment could be up to 2.89%, the SialoLacNAc-fucose fragment up to 3.19%, and GlcNAc-fucose up to 1,81%, all at collision energy NCE 15. In case of the extended structures such as sialylated glycans the false production is whole fucosylated antenna (m/z 803.2) followed by fucosylated antenna fragments (m/z 512.2) and fucosylated GlcNAc (m/z 350.1) with the optimum slightly shifted to higher collision energies. Fragment m/z 350 is in this case specific for outer antennary structure because terminal GlcNAc is stable isotope labelled and fragment m/z 356 was not identified in any spectra. Therefore, false positive determination of outer antennary fucosylation is high under low collision energy commonly used for the glycan structure assignment.

Figure 2.

Figure 2.

Schematic depiction of LacdiNAc specific ions and fucose and sialic acid rearrangements.

Figure 3.

Figure 3.

Collision energy settings for PRM of labelled IgG1 FC glycopeptide standards. Absolute intensity of fragment ions m/z 350; 413.2; 512.2; 495.2 and 803.3 as function of collision energy for structures G0F (A), G1F (B), G2F (C), SG2F (D) and S2G2F €.

Table 1.

Threshold percentages of MS/MS relative abundance of “Ghost” oxonium ions for false positive glyco-structure assignment.

IgG santard Fragment ion m/z
413.2 (407.2) (%) 350,1 (%) 512.2 (%) 803.3 (%) 495.2 (%)
G0F 0,7 (NCE 45) 1,43 (NCE 15) 0,48 (NCE 15)
G1F 0,48 (NCE 45) 1,81 (NCE 15) 2,18 (NCE 15) - -
G2F 0,41 (NCE 45) 0,29 (NCE 15) 2,89 (NCE 15) - -
S1G2F 3,27 (NCE 45) 0,2 (NCE 20) 2,47 (NCE 15) 1,08 (NCE 15) 0,2 (NCE 15)
S2G2F 3,58 (NCE 45) 0,24 (NCE 20) 1,67 (NCE 20) 3,19 (NCE 15) 0,58 (NCE 15)

The sialic acid attached to the GlcNAc is an atypical sialylated glycan that was described in human samples in some previous studies 19, 20. These glycoproteomics studies used ion (SA-HexNAc) m/z 495.2 as the signature for this unusual (non-human) structure. The m/z 495.2 ion is commonly produced by fragmentation of a sialo-LacdiNAc structure (Figure 2) or Sialo-T antigen14.To our knowledge, this is the first-time sialic acid rearrangement was described, similar to fucose rearrangement using HCD fragmentation. Production of m/z 495.2 ion showed very similar characteristics compared to fucose rearrangement and could be falsely used for the determination of SA-GlcNAc linkage. We determined a threshold percentage of 0.58% of 495.2 at 15% collision energy relative intensity as the highest false positive signal of SA-HexNAc outer arm structure. As expected, CE dependent fragmentation, profiles for fucose and sialic acid rearrangement demonstrated similar pattern (Figure 3). In other words, if the percentage of this m/z 495.2 ion exceeds 0.58% under the specified conditions, it is likely to be a false positive result for the SA-GlcNAc linkage. Additionally, the fragmentation profiles for fucose and sialic acid rearrangement showed a similar pattern. This means that the sialic acid rearrangement can be distinguished from fucose rearrangement based on the different patterns observed in the fragmentation profiles. Overall, the discovery of sialic acid rearrangement is an important contribution to the field of glycobiology and has implications for the accurate identification of specific glycan structures in human samples.

Despite recent advances in glycopeptide analysis, there is still a lack of optimal fragmentation conditions to distinguish between false-positive fragments resulting from rearrangement and those produced by fragmentation of the mannose core. As a result, it is essential to establish threshold settings and conduct further investigations to minimize false assignments of glycopeptide structures. By doing so, we can improve the accuracy and reliability of glycopeptide analysis, ultimately leading to a better understanding of the roles of glycosylation in various biological processes.

Supplementary Material

Supplement 1
media-1.docx (16.5KB, docx)

ACKNOWLEDGMENTS:

We thank Ms. He Chen and Mr. Zhihao Zheng for technical assistance. This work was supported in part by the National Institutes of Health NIH grants U01CA230692 to MS

REFERENCES:

  • (1).Zhu Z.; Desaire H. Carbohydrates on Proteins: Site-Specific Glycosylation Analysis by Mass Spectrometry. Annu Rev Anal Chem (Palo Alto Calif) 2015, 8, 463–483. DOI: 10.1146/annurev-anchem-071114-040240. [DOI] [PubMed] [Google Scholar]
  • (2).Goldman R.; Sanda M. Targeted methods for quantitative analysis of protein glycosylation. Proteomics Clin Appl 2015, 9 (1–2), 17–32. DOI: 10.1002/prca.201400152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (3).Palaniappan K. K.; Bertozzi C. R. Chemical Glycoproteomics. Chem Rev 2016, 116 (23), 14277–14306. DOI: 10.1021/acs.chemrev.6b00023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (4).Bern M.; Kil Y. J.; Becker C. Byonic: advanced peptide and protein identification software. Curr Protoc Bioinformatics 2012, Chapter 13, 13 20 11-13 20 14. DOI: 10.1002/0471250953.bi1320s40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (5).Liu M. Q.; Zeng W. F.; Fang P.; Cao W. Q.; Liu C.; Yan G. Q.; Zhang Y.; Peng C.; Wu J. Q.; Zhang X. J.; et al. pGlyco 2.0 enables precision N-glycoproteomics with comprehensive quality control and one-step mass spectrometry for intact glycopeptide identification. Nat Commun 2017, 8 (1), 438. DOI: 10.1038/s41467-017-00535-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (6).Toghi Eshghi S.; Shah P.; Yang W.; Li X.; Zhang H. GPQuest: A Spectral Library Matching Algorithm for Site-Specific Assignment of Tandem Mass Spectra to Intact N-glycopeptides. Anal Chem 2015, 87 (10), 5181–5188. DOI: 10.1021/acs.analchem.5b00024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (7).Lu L.; Riley N. M.; Shortreed M. R.; Bertozzi C. R.; Smith L. M. O-Pair Search with MetaMorpheus for O-glycopeptide characterization. Nat Methods 2020, 17 (11), 1133–1138. DOI: 10.1038/s41592-020-00985-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (8).Polasky D. A.; Yu F.; Teo G. C.; Nesvizhskii A. I. Fast and comprehensive N- and O-glycoproteomics analysis with MSFragger-Glyco. Nat Methods 2020, 17 (11), 1125–1132. DOI: 10.1038/s41592-020-0967-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (9).Shen J.; Jia L.; Dang L.; Su Y.; Zhang J.; Xu Y.; Zhu B.; Chen Z.; Wu J.; Lan R.; et al. StrucGP: de novo structural sequencing of site-specific N-glycan on glycoproteins using a modularization strategy. Nat Methods 2021, 18 (8), 921–929. DOI: 10.1038/s41592-021-01209-0. [DOI] [PubMed] [Google Scholar]
  • (10).Wuhrer M.; Koeleman C. A.; Deelder A. M. Hexose rearrangements upon fragmentation of N-glycopeptides and reductively aminated N-glycans. Anal Chem 2009, 81 (11), 4422–4432. DOI: 10.1021/ac900278q. [DOI] [PubMed] [Google Scholar]
  • (11).Wuhrer M.; Koeleman C. A.; Hokke C. H.; Deelder A. M. Mass spectrometry of proton adducts of fucosylated N-glycans: fucose transfer between antennae gives rise to misleading fragments. Rapid Commun Mass Spectrom 2006, 20 (11), 1747–1754. DOI: 10.1002/rcm.2509. [DOI] [PubMed] [Google Scholar]
  • (12).Acs A.; Ozohanics O.; Vekey K.; Drahos L.; Turiak L. Distinguishing Core and Antenna Fucosylated Glycopeptides Based on Low-Energy Tandem Mass Spectra. Anal Chem 2018, 90 (21), 12776–12782. DOI: 10.1021/acs.analchem.8b03140. [DOI] [PubMed] [Google Scholar]
  • (13).Maliepaard J. C. L.; Damen J. M. A.; Boons G.-J. P. H.; Reiding K. R. Glycoproteomics-compatible MS/MS-based quantification of glycopeptide isomers. bioRxiv 2023, 2023.2001.2031.526390. DOI: 10.1101/2023.01.31.526390. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (14).Sanda M.; Benicky J.; Goldman R. Low Collision Energy Fragmentation in Structure-Specific Glycoproteomics Analysis. Anal Chem 2020, 92 (12), 8262–8267. DOI: 10.1021/acs.analchem.0c00519. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (15).Sanda M.; Ahn J.; Kozlik P.; Goldman R. Analysis of site and structure specific core fucosylation in liver cirrhosis using exoglycosidase-assisted data-independent LC-MS/MS. Sci Rep 2021, 11 (1), 23273. DOI: 10.1038/s41598-021-02838-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (16).Sanda M.; Pompach P.; Brnakova Z.; Wu J.; Makambi K.; Goldman R. Quantitative liquid chromatography-mass spectrometry-multiple reaction monitoring (LC-MS-MRM) analysis of site-specific glycoforms of haptoglobin in liver disease. Mol Cell Proteomics 2013, 12 (5), 1294–1305. DOI: 10.1074/mcp.M112.023325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (17).Wang S.; Liu D.; Qu J.; Zhu H.; Chen C.; Gibbons C.; Greenway H.; Wang P.; Bollag R. J.; Liu K.; et al. Streamlined Subclass-Specific Absolute Quantification of Serum IgG Glycopeptides Using Synthetic Isotope-Labeled Standards. Anal Chem 2021, 93 (10), 4449–4455. DOI: 10.1021/acs.analchem.0c04462. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (18).Sanda M.; Morrison L.; Goldman R. N- and O-Glycosylation of the SARS-CoV-2 Spike Protein. Anal Chem 2021, 93 (4), 2003–2009. DOI: 10.1021/acs.analchem.0c03173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (19).Miyazaki K.; Ohmori K.; Izawa M.; Koike T.; Kumamoto K.; Furukawa K.; Ando T.; Kiso M.; Yamaji T.; Hashimoto Y.; et al. Loss of disialyl Lewis(a), the ligand for lymphocyte inhibitory receptor sialic acid-binding immunoglobulin-like lectin-7 (Siglec-7) associated with increased sialyl Lewis(a) expression on human colon cancers. Cancer Res 2004, 64 (13), 4498–4505. DOI: 10.1158/0008-5472.CAN-03-3614. [DOI] [PubMed] [Google Scholar]
  • (20).Tsuchida A.; Okajima T.; Furukawa K.; Ando T.; Ishida H.; Yoshida A.; Nakamura Y.; Kannagi R.; Kiso M.; Furukawa K. Synthesis of disialyl Lewis a (Le(a)) structure in colon cancer cell lines by a sialyltransferase, ST6GalNAc VI, responsible for the synthesis of alpha-series gangliosides. J Biol Chem 2003, 278 (25), 22787–22794. DOI: 10.1074/jbc.M211034200. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1
media-1.docx (16.5KB, docx)

Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES