Skip to main content
ACS AuthorChoice logoLink to ACS AuthorChoice
. 2025 Jul 3;36(8):1686–1695. doi: 10.1021/jasms.5c00093

De Novo Glycan Annotation of Mass Spectrometry Data

Margot Bligh †,‡,*, Sebastian Silva-Solar , Linda Biehler †,, Christopher C J Fitzgerald †,, Conor J Crawford §, Mikkel Schultz-Johansen †,, Sofie Niggemeier †,, Peter H Seeberger §,, Manuel Liebeke †,, Jan-Hendrik Hehemann †,
PMCID: PMC12333352  PMID: 40605194

Abstract

Carbohydrates are fundamental molecules of life that are involved in virtually all biological processes. The chemical diversity of glycanscarbohydrate chainsenables diverse functions but also challenges analytics. Annotation of glycans in mass spectrometry (MS) data relies heavily on experimental databases or manual calculations, hindering the discovery of novel glycan compositions and structures. Here, we introduce GlycoAnnotateRa package in the open-source programming language Rfor de novo annotation of glycan compositions in MS data. GlycoAnnotateR calculates all possible monomer and modification combinations, which are then filtered against a defined set of chemical rules to provide biologically relevant compositions. The “glycoPredict” function can return compositions for oligosaccharides ranging from 1 to 22 monomers in length while accounting for four different modifications in under 10 min with less than 4 GB of random-access memory (RAM). Here, three case studies demonstrate the efficacy and versatility of GlycoAnnotateR: (1) accurate identification of mono- and oligosaccharide standards, (2) characterization of sulfated fucan oligosaccharides obtained by enzymatic digestion of fucoidan, a complex algal glycan, and (3) reproduction and expansion of glycan annotations for a published mouse lung MALDI-MS imaging data set previously annotated by NGlycDB. GlycoAnnotateR rapidly provides accurate annotations and complements existing R packages for MS data processing, enabling metabolomic and glycomic data integration. This combinatorial, rule-based approach enhances glycan annotation capabilities and supports hypothesis generation in glycoscience, expanding our ability to explore the chemical space of glycan diversity.

Keywords: glycans; carbohydrates; mass spectrometry; R, annotation tool


graphic file with name js5c00093_0006.jpg


graphic file with name js5c00093_0005.jpg

Introduction

Carbohydrates are one of the fundamental molecules of life. Myriad biological functionsincluding energy storage, cell structure, signaling, communication and defenserely on glycans, , which are carbohydrate polymers. The carbon fixed from carbon dioxide via photosynthesis is incorporated into carbohydrates, meaning most organic carbon on Earth is funneled through a carbohydrate step. Glycosylation, the attachment of carbohydrate chains to proteins, is the most complex and common post-translational modification. The importance of glycans in human health and disease has been increasingly recognized over the last two decades, with over 100 human disorders associated with major defects in glycosylation pathways. These diverse functions are enabled through the chemical diversity of carbohydrates, which far outweighs that of DNA or proteins. ,

Glycan diversity and other analytical challenges mean that high-throughput, accessible sequencing of carbohydrates remains unattainable to date. As in proteomics, mass spectrometry (MS) is the most advanced technique in glycomics, offering high sensitivity and accuracy. Glycans are typically ionized by electrospray ionization (ESI) or matrix assisted laser desorption ionization (MALDI). In combination with chromatography or orthogonal techniques such as ion mobility, or with certain fragmentation techniques, MS provides the ability to separate and identify isomeric carbohydrates. − ,,

Glycan annotation is the processing of assigning glycan identities to peaks in mass spectra. Glycan compositioni.e., the number of different monomers and modifications in a glycan structureis the most basic level of annotation, and can be achieved by comparison of observed and theoretical m/z values. For example, four desialylated N-glycans derived from bovine fibrinogen, a blood glycoprotein, were identified as constituting five hexose monomers and four N-acetylhexosamine monomers based on MS data by Fellenberg et al.; m/z values of 821.304–821.306 were observed, which deviate from the theoretical m/z value of the [M+2H]2+ adduct by 0.1–2.5 ppm. The four isomers were separated by porous graphitized carbon (PGC) liquid chromatography (LC). In the same study, MS/MS resolved the sequence of the monomers in each of these branched glycan structures, as well as the structures of five additional branched glycans. For example, a fragment ion with an m/z value of 366.129 was annotated as a dimer constituting one hexose and one N-acetylhexosamine (dehydrated, [M + H]+ ion). The intensity of this ion was diagnostic for a terminal Hex-HexNAc sequence on the glycan branches. Higher order structural information such as the stereochemistry of carbohydrate hydroxyl groups and glycosidic linkages, and the position of modifications and linkages may require inclusion of additional techniques beyond MS, such as NMR. The practical relevance of glycan annotation is manifest in the structure–function relationships of glycans; staying with the fibrinogen example, the level of sialylation of fibrinogen N-glycans impacts polymerization of blood clots, and therefore deciphering glycan structures by means of mass spectrometry has clinical implications.

Automated glycan annotation currently depends primarily on experimental databases. Database matching is proposed to be sufficient for the analysis of specific glycomes that have been intensively studied, such as the serum N-glycome and the human milk glycome. The term glycome is the glycan equivalent of the proteome, and refers to the complete set of glycan structures present in a defined biological sample. However, despite the many technological advances in glycobiology over the last decades, achieving complete annotated glycan libraries for the many other glycomes of interest to biomedical, industrial, environmental and other fields remains a monumental task. As glycan structures are not encoded directly in DNA, but rather processively synthesized by a series of glycosyltransferases, glycosidases and other transferase enzymes in reactions regulated by the metabolic state of the cell and other factors, it is not currently possible to predict a glycome based only on the genome or proteome of an organism. While the nonredundant (nr) nucleotide and protein databases hosted by NCBI currently contain 97 and 595 million sequences respectively, the three main glycan structure databases GlyConnect, KEGG GLYCAN and GlyTouCan currently comprise 5,609, 11,538 and 250,837 glycan structures, respectively. The differences in the sizes of these glycan databases reflect their level of curation (high to none). These databases are patently insufficient to cover glycan diversity; for context, 45 human glycosyltransferases could theoretically synthesize more than 1.1 million N-glycan structures of 15 monomers or less. Thus, annotating glycans relying on experimental databases means that many glycans are likely to be overlooked. Identification of novel glycan compositions and structures is crucial to progress not only in glycomics but life science research in general given the prevalence of glycosylation modifications across life. For example, in 2021 it was discovered using click chemistry that small noncoding RNAs carry sialylated glycans, despite no previous link between RNA and glycans in nature. Mass spectrometry enabled identification of the glycans on RNA in this study. Easy to use and free tools for annotation of novel glycan compositions in MS data will be of importance for the growth of glycomics in medical and other life sciences.

Current options for annotation of glycans in MS data are also difficult to incorporate into pipelines with open-source tools and provide relatively limited output, hindering untargeted glycan analyses. The semiautomatic GlycoWorkBench is a widely used tool to assist interpretation of glycan mass spectra, but it is a stand-alone application, and is no longer maintained. GlycoWorkBench was part of the EuroCarb initiative that is no longer funded. Therefore, other EuroCarb web-based tools such as Glyco-peakFinder are not available. GlycoMod is a tool to annotate a short list of glycan masses, but a limited number of options (e.g., adduct types) can be queried at one time and the output has low interoperability. Promising techniques based on deep learning are developing but currently remain challenged by the sparsity of glycobiology data sets. In practice, annotation of unknown and novel glycans, especially those not part of glycoproteins such as marine algal glycans, currently relies heavily on manual calculations, which quickly becomes a prohibitively lengthy task for complex mixtures. A broadly applicable, simple, open-source tool for annotation of glycan compositions that can be incorporated into existing metabolomics workflows, for example XCMS- or CARDINAL-based pipelines in R, , is needed.

To this end we created ‘GlycoAnnotateR’ (https://github.com/margotbligh/GlycoAnnotateR), an R package for de novo annotation of glycan compositions in MS data. It has three main modules: (i) calculation of possible compositions given constraining parameters (glycoPredict); (ii) annotation of putative oligosaccharide ions in MS data by comparison with theoretical m/z values of calculated compositions (glycoAnnotate); and (iii) retrieval of MS/MS peaks in XCMS objects with annotated precursor ions (glycoMS2Extract), and annotation of products ions based on the precursor annotation(s) (glycoMS2Annotate). Calculation of glycan compositions was enabled by derivation of a simple combinatorial equation to describe compositional possibilities, and construction of a set of chemical rules to filter output to meaningful possibilities. Our inclusive approach ensures signals from as many different glycans as possible are recruited, and therefore enables definition of a space nearing the potential glycome. Three use cases presented here show the utility and versatility of GlycoAnnotateR.

Experimental Section

Vendor-specific MS data files were converted to mzML format using the ProteoWizard tool “msconvert” v3.0.20239 with a “peakPicking” filter, unless specified otherwise. Detailed information on chemical rules, oligosaccharide synthesis, and sample and data processing for the three use cases is provided in the Supporting Information.

GlycoAnnotateR

The base code for GlycoAnnotateR was written in Python, which is called in the R package via reticulate. In the “glycoPredict” function, compositions are first “calculated” according to eq , then filtered by the number of modifications per monomer, a set of defined chemical rules (Table S1), and user input. The “glycoAnnotateR” and “glycoMS2Annotate” functions annotate user data by comparison of experimental and calculated m/z values, with “glycoPredict” parameters for the latter defined by precursor annotations. Detailed documentation on all parameters and three tutorials can be found on GitHub (https://margotbligh.github.io/GlycoAnnotateR/). Briefly, users can constrain the degree of polymerization (DP) range, modifications, maximum number of modifications per monomer on average, monomer types, scan range, polarity, ionization type (ESI or MALDI), adducts, labels from derivatization, and compositional rules. There is an additional option to allow for double sulfation. Hexose, pentose, deoxyhexose and sialic acid are referred to as monomers in the naming, while all other components are considered modifications. For example, N-acetylglucosamine is referred to as “Hex1 N-Acetyl1” in our approximation of IUPAC naming. If calculated compositions are in the GlycoConnect database they are linked by ID (database downloaded in tsv format in March 2024). Benchmarking of “glycoPredict” was performed with a range of parameters using Snakemake.

Use Case 1: Commercial and Synthetic Standards

Commercial mono- and oligosaccharide standards were dissolved in Milli-Q-water (Table S2). The mixture of 16 analytes was analyzed by LC-MS/MS using hydrophilic interaction liquid chromatography (HILIC) separation (Accucore 150 Amide, Thermo Fisher) and an Orbitrap mass spectrometer (Thermo Fisher Q-Exactive Plus). Standards were first analyzed in full MS mode (positive and negative polarity), and then with data-dependent MS/MS with an inclusion list (Table S3). Preprocessing (peak picking, merging, grouping and filling) was performed with XCMS v4.0.2 in R. Resulting features were grouped by retention time to enable isotope detection with CAMERA v1.56.0. Positive and negative features were annotated at 1.5 and 2 ppm, respectively. MS/MS spectra associated with annotated features were extracted, processed with MSnbase, and then fragment ions were annotated at 3 ppm.

Defined carbohydrate structures were synthesized: (1) fucan hexasaccharide; (2) glucan pentasaccharide; (3) sulfated fucan disaccharide; (4) mannan trisaccharide; (5) sulfated mannose monosaccharide. Structures 2 and 3 had an amino-pentyl linker (Figure S1). The automated glycan assembly of structures 1, 3 and 4 followed published protocols. , Structure 2 was prepared with dibutyl phosphate donors. Structure 5 was synthesized using solution-phase synthesis. Purification and analysis by HPLC were performed on an Agilent 1200 series equipped with an evaporative light scattering (ELS) detector (G4260B) and a single quadrupole mass spectrometer (G6135B). Mass spectra were obtained using an Agilent 6210 ESI-TOF mass spectrometer (QTOF). QTOF data were imported into R with MSnbase. Files were filtered by retention time to a 15 s window according to the total ion chromatogram peak (Figure S2). Spectra were then averaged, normalized, and filtered at a normalized intensity of 0.8 and annotated at 15 ppm. HPLC-ELS-MS files were parsed with the python package ‘rainbow’. Ion chromatograms were extracted based on the annotations assigned to the corresponding QTOF data (±0.1 Da).

Use Case 2: Fucoidan Enzyme Digest

fucoidan (Carbosynth; 3 mg mL–1) was digested overnight with the recombinantly expressed GH107 endo-fucoidanase Wv323. The negative control contained heat-inactivated enzyme. Digests were desalted using silica spin columns, with loading at 95% (v/v) ethanol and elution of oligosaccharides at 85% (v/v) ethanol. Crude digests and flow-through fractions were visualized by carbohydrate polyacrylamide gel electrophoresis (C-PAGE) with Stains-All dye. , Digests were analyzed in negative polarity by LC-MS with the same instrumentation as commercial standards. LC-MS data were processed with XCMS and GlycoAnnotateR to generate a timed inclusion list (Table S4) for LC-MS/MS with targeted selected ion monitoring and data-dependent MS/MS acquisition. MS/MS spectra were extracted and annotated with GlycoAnnotateR. Data files are available on MassIVE (doi:10.25345/C57D2QK2 V).

Use Case 3: Publicly Available MALDI-MSI Data Set

NGlycDB v1 annotations assigned at a false discovery rate (FDR) of 5% to a mouse lung MALDI-FTICR data set (“20210401_lung_p27_1”) were downloaded in csv format from METASPACE. The m/z values in the downloaded table were reannotated by GlycoAnnnotateR, using a custom database built with glycoPredict for mammalian N-glycans. The imzML file was also downloaded from METASPACE, and preprocessed (peak picking, aligning and binning) with Cardinal v3.2.1 in R. The same custom database was used for annotation of preprocessed data. Annotated features were then segmented using spatial shrunken centroids.

Results and Discussion

Theoretical Compositions Calculated by GlycoAnnotateR

GlycoAnnotateR calculates all possible oligosaccharide compositions, given a set of constraining parameters, to annotate ions (Figure A). Composition is defined here as the number and type of monomers and modifications in an oligosaccharide. For flexibility, we consider pentoses and hexoses as the fundamental monomer blocks. All functional groups therefore become ‘modifications’ (e.g., N-acetyl, amine, carboxylic acid groups). Certain modified monomers e.g., uronic acids are typically considered monomers in their own right. However, our deconvolution allows maximum flexibility in building compositions, as modifications are not assigned to any specific monomers. The maximum number of different compositions can be obtained following eq

N=i=dD(i+1)m+c1 1

where N is the number of different compositions, d is the lowest degree of polymerization (DP), D is the highest DP, m is the number of different modifications, and c is the number of different monomer types (defined by number of C in ring). Possible modifications include carboxylic acid, sialic acid, deoxy, phosphate, sulfate, N-acetyl, O-acetyl, O-methyl, anhydro-bridge and amino. GlycoAnnotateR also supports inclusion of labels or linkers added to glycans e.g., an amino-pentyl linker, and custom labels or modifications. If two types of monomers are considered, i.e., hexose and pentose (m = 0, c = 2), N corresponds to the summation of the number of multisubsets of size i from a set of size 2, as represented with a Pascal’s triangle (Figure B). N grows exponentially with the number of different modifications. With a DP of two, one type of monomer and modification (e.g., O-methyl; i = 2, c = 1, m = 1), three compositions are possible as (2 + 1)1 = 3: Hex2, Hex2 O-Methyl1, Hex2 O-Methyl2. There are nine possible compositions with two modifications (e.g., O-methyl and amino: i = 2, c = 1, m = 2) as (2 + 1)2 = 9 (Figure A, step 3). If pentose and hexose monomers are considered (i = 2, c = 2, m = 2), N increases to 27 ((2 + 1)3 = 27) (Table S5). The high compositional diversity of oligosaccharides (Figure D) demands the abstracted calculation approach taken here in order to include novel compositions.

1.

1

Calculations for the total number of possible compositions for modified hexose and/or pentose-based oligosaccharides. (A) Example of the basic workflow of GlycoAnnotateR. (1) Hexose monomers of the chosen degree of polymerization (DP) are “built” (here DP = 2). (2–3) Expansion to account for pentose monomers and modifications (here there are 2 possible modifications). (4) Compositions are filtered by the average number of modifications per monomer; here, the number must be ≤ 1. Grayed out compositions are removed. (5) Compositions are filtered by a set of defined chemical rules. Here, the “dot modification” cannot be on a hexose. (6) Expansion for adducts. Here, H, Na and K adducts are considered for negative MALDI ionization. The ‘triangle modification’ represents an anionic group, therefore [M-2H+Na] and [M-2H+K] are computed for compositions with a triangle modification. (7) Annotation of experimental m/z value(s) by comparison with calculated m/z values. (B) When c is 2 (i.e., hexose and pentose monomers are possible), the number of nonmodified compositions (N) equals DP plus one. Row number is equal to DP, the number of columns in each row is N for that row, and each number represents the total number of ordered combinations. (C) The numbers of possible isomers for linear oligosaccharides composed of only hexoses, or hexoses and pentoses, of DP1–6 (eq ). (D) The numbers of possible compositions for oligosaccharides of DP 1–6 summarized with c values of 1 (left, hexose or pentose) and 2 (right, hexose and pentose) and 0–2 modifications (m) (eq ).

Structural diversity is considered in eq

I=2d×2H+3d×d!H!(dH)!×(4H×3dH×(dH3d+H4d))×2d 2

where I is the number of isomers, d is the DP, and H is the number of hexose monomers. eq is a generalization of Laine’s 1994 calculations for linear structures of hexoses that account for linkage position and stereochemistry, and order, stereochemistry and ring form of monomers. Equation additionally considers pentose monomers. The number of linear structures possible for a hexasaccharide increases to 518,781,583,491,072 (519 trillion) (Figure C). We could not account for modifications in eq due to specific constraints of each functional group, which challenged summarization in combinatorial form. Equation is explained in detail in the Supporting Information.

Compositions calculated in eq are filtered to be meaningful. Equation assumes (i) each modification can occur once per monomer, and (ii) each monomer can have all modifications at once. These assumptions are however not always true. Monomers typically carry at most 3 modifications, for example, disulfated fucose in fucoidans, but generally this number is much lower. Computed compositions are filtered by the average number of modifications per monomer, with a maximum allowed value of 3. Further filtering is applied based on a set of rules for which modifications can occur on the same monomer, and on which monomers (Table S1), which was compiled from investigation of published structures and chemical rules (see Supporting Information for details). Users can also manually limit specific elements or include the limits previously described for N- and O-linked glycans.

The function “glycoPredict”, which calculates compositions as described, finishes in less than 10 min using less than 4 Gb of random-access memory (RAM) for a DP range of 1 to 22, accounting for four different modifications, and pentose and hexose monomers (Figure S3). Therefore, it will easily run on a standard computer.

GlycoAnnotateR Accurately Annotates Simple Glycans

The accuracy of GlycoAnnotateR was validated with LC-MS/MS analysis of 16 commercial carbohydrate standards ranging from DP 1–6 and featuring alditol, deoxy, carboxylic acid, O-methyl, amino, N-acetyl, sulfate, and anhydro-bridge modifications. All 15 detectable standards were correctly annotated in positive and/or negative polarities after XCMS and CAMERA preprocessing in R (Table S6, Figure A, Figure S4). No galacturonic acid peak was detected due to streaking and poor ionizability (Figure S5). Annotation of multiple adducts in different polarities and isotope detection helped to resolve ambiguous annotations in MS1 data (Table S6, Figure A, Figure S6). Negative polarity MS/MS spectra associated with annotations were extracted and annotated with GlycoAnnotateR to confirm MS1 annotations (e.g., Figure C). Fourteen ions deriving from impurities and in-source fragmentation, distinguishable by retention time, were also annotated (Table S6, Figure S4). The latter were frequently annotated as “AnhydroBridge”-containing compounds, likely resulting from dehydration reactions during ionization. “AnhydroBridge”, “Unsaturated”, and “Dehydrated” modifications all result from loss of a water, but refer to different structural features (see Supporting Information). “AnhydroBridge” was selected as a possible modification in this analysis for the carrageenan standards. The detection of 14 ions likely corresponding to impurities and in-source fragmentation products even in this simple analysis highlights the need for automated annotation. Annotation of impurities and in-source fragmentation products is important to control for contaminants and reduce the size of the “dark metabolome”. Thus, GlycoAnnotateR, in combination with existing tools, allows users to harness the full sensitivity and complexity of LC-MS data.

2.

2

Commercial and synthetic mono- and oligosaccharide standards accurately annotated by GlycoAnnotateR. Known structures are shown in each panel. Annotations (black text) show the output format of GlycoAnnotateR. (A) Average HILIC-ESI-Orbitrap mass spectrum (retention time 148–155 s) of N-acetylglucosamine with annotations of adducts and isotopes. (B) Averaged and normalized mass spectrum of a synthetic trisulfated fucose dimer directly infused into a QTOF mass spectrometer. Only peaks with normalized intensities >0.8 are shown. One sulfate group was lost during ionization. (C) Fragment ions generated by HILIC-ESI-Orbitrap MS/MS of k-carrageenan DP 4 (Hex4 Anhydrobridge2 Sulfate2 [M-2H]−2, m/z 394.049) and annotated by GlycoAnnotateR. Exact fragment identities are shown in red text.

Since commercial oligosaccharides are typically derived from natural sources, impurities can generally only be avoided by chemical synthesis. To unambiguously demonstrate the accuracy of GlycoAnnotateR, annotations were performed on QTOF data from five synthetic mono- and oligosaccharides (Figure S1). All compounds were assigned a single, correct annotation by GlycoAnnoteR in one 15 ppm annotation step (Figure B, Figure S7).

GlycoAnnotateR Maps Highly Sulfated Oligosaccharides from Fucoidan

To demonstrate its utility in LC-MS data exploration, GlycoAnnotateR was used to identify unknown oligosaccharides obtained by enzymatic hydrolysis of fucoidan. Fucoidans are sulfated fucose-rich polysaccharides from brown algae of growing interest due to their cosmetic, therapeutic and carbon sequestration potential. , Combining characterization of carbohydrate-active enzymes with MS analysis of the oligosaccharides generated by these enzymes is a promising approach for structural characterization of glycans, especially complex glycans such as fucoidan. Different brown algae species synthesize fucoidans with different compositions and structures. , Here, we examined fucoidan from M. pyrifera, a globally important kelp. The GH107 endo-fucoidanases P5AFcnA and Wv323 were recently shown to have similar but distinct activities on α-1,3 fucosyl linkages. , Since GH107_P5AFcnA is active on M. pyrifera fucoidan, , we hypothesized that GH107_Wv323 would also be active. We confirmed this new enzyme–substrate pair by C-PAGE (Figure A), and then proceeded to analyze the liberated oligosaccharides with HILIC separation and an FT-Orbitrap mass spectrometer.

3.

3

Enzymatic digestion of fucoidan with GH107_Wv323 yielded a series of sulfated oligosaccharides. (A) Sulfated oligosaccharide reaction products separated by C-PAGE according to size and charge and visualized with Stains-All dye. , Digest: fucoidan + GH107_Wv323 enzyme. Negative control: fucoidan + inactive GH107_Wv323 enzyme. From left to right the wells were loaded with the crude digest (‘initial’) and filtrates eluted from desalting columns (percentage indicates % ethanol (v/v) with Milli-Q-water). The ladder-like pattern of bands is indicative of endo-acting enzyme activity. (B) Extracted ion chromatograms for oligosaccharide ions that were annotated in LC-MS data by GlycoAnnotateR. SNFG symbols indicate hypothetical structures based on the annotated compositions. Note that complete structures were not resolved in this analysis. Mean m/z values from bottom to top: 243.018, 322.975, 234.013, 273.992, 548.991, 313.969, 628.948, 386.999, 426.977, 466.956, 460.028, 500.007, 539.986, 652.994, 692.972. Purple shaded areas indicate precursor peaks for C-E. (C–E) Normalized fragmentation spectra extracted and annotated by GlycoAnnotateR from selected ion monitoring of the digest. The inclusion list was generated using the information from B. Precursor m/z values and annotations (hypothetical SNFG) are shown in the top left and above each panel, respectively. Annotation formatting represents the output of GlycoAnnotateR. Annotations with n values indicate peaks assigned multiple compositions. Values of n: (C) 1, 2, 3, 4; (D) 1, 2, 3; (E) 1, 2. Spectra were averaged across retention times: (C) 753–815 s; (D) 580–650 s; (E) 400–414 s.

The digest products, first putatively annotated in LC-MS data and then confirmed by LC-MS/MS, were identified as a series of highly sulfated oligosaccharides ranging from DP 1 to 5. All 13 unique compositions assigned to products contained only deoxyhexoses most likely representing fucose, which constitutes 50–90% of fucoidan monomers ,, (Figure B). This analysis resolved only compositions, not structures, and therefore deoxyhexoses cannot be assigned identities. Two hexose-containing annotations were also assigned to peaks, but these peaks were also present in the negative control (Figure S9). After accounting for isomers, adducts and in-source fragmentation, we identified 18 distinct sulfated mono- and oligosaccharide structures in the digest that were absent from the negative control (Figure B, Figure S9, Table S7). We found an average deoxyhexose: sulfate ratio of 1:1.49, 3-fold lower than the reported fucose: sulfate ratio of 1:0.58. , This disparity could indicate differences between fucoidan extracts, or heterogeneous sulfation along the polysaccharide, both of which are known features of fucoidans. , On-the-fly analysis of LC-MS data with XCMS and GlycoAnnotateR generated an inclusion list for a targeted selected ion monitoring data-dependent MS/MS experiment (Table S4). MS/MS spectra were extracted and annotated for eight peaks with GlycoAnnotateR, which confirmed the MS1 annotations (Figure C-E, Table S8–15). Annotations showed fragmentation of labile sulfate, hydroxyl groups (resulting in dehydrations) and glycosidic bonds, consistent with chain shortening and charge losses from the precursor ions. This experiment exemplifies the practical usefulness of GlycoAnnotateR in first assigning putative annotations to unknown, novel oligosaccharides, and then guiding and analyzing MS/MS experiments to confirm those annotations.

GlycoAnnotateR Reproduced and Expanded MALDI-MSI Data Set Annotations

To demonstrate compatibility with MALDI-MS imaging and compare to an established tool, we reanalyzed a mouse lung imaging MALDI-FTICR data set available on METASPACE. This data set originates from the publication of NGlycDB, an N-glycan database derived from GlyConnect that is integrated into METASPACE. GlycoAnnotateR is not presented as a direct alternative to NGlycDB, but rather as a complementary tool for R-based workflows and hypothesis generation.

In a first comparison of only the annotation step, GlycoAnnotateR reproduced all NGlycDB annotations, and assigned additional annotations. The NGlycDB annotation table using the spatially informed false discovery rate (FDR) of 5% exported from METASPACE contained 88 peaks, which we annotated with GlycoAnnotateR. Chosen parameters were suitableaccording to rules defined previously for mammalian N-glycans, which do not typically contain pentose monomers. Except for four peaks that were assigned pentose-containing annotations by NGlycDB, and are therefore likely false annotations, all peaks were assigned the same annotation by NGlycDB and GlycoAnnotateR (Table S16). In addition, 25% of the peaks were assigned more than one annotation by GlycoAnnotateR, with 1.45 ± 1.03 (SD) annotations per peak on average. In some cases, the new annotations had lower mass deviations from the m/z values of the peaks than the NGlycDB annotations (Table S16). GlycoAnnotateR is therefore not only comparable to the established NGlycDB tool, but also generates novel, more diverse annotations potentially important for further investigation.

In an additional application case for GlycoAnnotateR the centroided MALDI-FTICR data were preprocessed in R with Cardinal. Here, GlycoAnnotateR assigned glycan annotations to approximately 15-fold more peaks than NGlycDB. 1,355 peaks were annotated by GlycoAnnotateR. 64 of the 88 NGlycDB annotations (5% FDR) were reproduced in the analysis, with the difference attributable to preprocessing differences between METASPACE and Cardinal. Annotated peaks were segmented into five classes based on their spatial distributions, with 78.8 peaks per class on average. Two distributions identified in the original analysis, corresponding to peaks annotated as N-glycans distributed throughout the lung tissue and at the main aerial vessel, were represented by two classes (Figure A–D, Figure S10). A third class contained peaks abundant at the outer edges of the tissue section, potentially the mesothelium, where sulfated glycans have been found (Figure F–I). The top-ranked peaks in this class were annotated as trisulfated glycans (Table S17, Figure S11), none of which are currently in GlyConnect. Many of the annotated ions in this class appear to represent fragments or adducts of each other. For example, three ions that colocalized to the outer edge have masses matching those of fragments of a putative, trisulfated hybrid N-glycan (Figure E–I; Table S18). Two ions in this group are annotated as adducts of the same N-glycan fragment (Figure G,H). While these annotated compositions require confirmation, for example by MS/MS, they suggest a potential biologically meaningful pattern of glycans with novel compositions.

4.

4

GlycoAnnotateR reveals a pattern of ions annotated as trisulfated glycans localized to the edges of a mouse lung tissue section. The mouse lung imaging MALDI-FTICR data (METASPACE: “20210401_lung_p27_1”) was preprocessed with Cardinal and annotated by GlycoAnnotateR. Example ion images are shown for peaks in different classes after segmentation analysis of annotated peaks. The m/z values are indicated below each panel (±0.01 Da). Colors indicate normalized ion intensities. Annotated compositions are represented with SNFG symbols. (A,B) Example images for a class distributed throughout the tissue: Hex8 N-Acetyl2: [M + Na]+ and Hex9 N-Acetyl3: [M + Na]+ respectively. Annotated compositions are represented with SNFG symbols. (C,D) Example images for a class colocalized with the main aerial vessel: Hex9 DeoxyHex1 N-Acetyl6: [M + Na]+ and Hex9 DeoxyHex2 N-Acetyl6: [M + Na]+ respectively. (E) Hypothetical structure of a trisulfated, hybrid N-glycan. The ion image for the theoretical m/z value of the [M-4H+5Na]+ ion of this structure is shown in (F). Bond cleavages shown by arrows could have generated the ions shown in (G–I), which are annotated as Hex7 N-Acetyl3 NeuAc1 O-Acetyl1 Sulfate3 [M-4H+5Na]+ and [M-3H+4Na]+, and Hex7 N-Acetyl3 Sulfate3 [M-2H+3Na]+ respectively.

Conclusion

Here, we demonstrate that GlycoAnnotateR facilitates the de novo annotation of glycan compositions in MS data. We show the flexibility of the tool across multiple MS platforms, including with ESI and MALDI ionization, and QTOF, quadrupole, Orbitrap and FTICR mass spectrometers. GlycoAnnotateR works across different R-based mass spectrometry pipelines with different packages, allowing for integration with metabolomics data sets. Such integration will be required for systems-level glycobiology. GlycoAnnotateR allows for a diversity of monomers, modifications, and labels added by derivatization, aiding the analysis of glycans from diverse systems and prepared in diverse workflows. Utility was demonstrated here for commercial and synthetic standards, a sulfated algal glycan, and mammalian glycans. The inclusion of diagnostic fragment ions in future iterations of the tool will further increase its utility. The calculation component driving the tool means that novel glycan structures can be identified for further analysis, for example in biomarker discovery. Once validated, these structures can then populate the currently sparse glycan databases, facilitating future structure–function studies to understand the biological roles of glycans.

Supplementary Material

js5c00093_si_001.pdf (2.6MB, pdf)

Acknowledgments

M.B., L.B., C.C.J.F., S.N. and J.-H.H. acknowledge support from the Cluster of Excellence initiative (EXC-2077–390741603), the DFG (HE 7217/1-1) and Heisenberg grant Glyco-Carbon Cycling in the Ocean, and the European Research Council grant C-Quest (47010790). Generous financial support of the Max-Planck Society is gratefully acknowledged (M.B., S.S.S., M.L., C.J.C, P.H.S., J.-H.H.). C.J.C. was funded by MSCA grant MARINEGLYCAN (101029842). M.S.-J. received funding from the BMBF SNAP BlueBio project (Grant 161B0943).

LC-MS data for commercial standards are available on Github (https://github.com/margotbligh/GlycoAnnotateR/tree/master/inst/extdata). LC-MS data for the fucoidan digest are available on MASSive.

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/jasms.5c00093.

  • Detailed information on chemical rules, oligosaccharide synthesis, sample preparation, data acquisition, and data processing including additional tables and plots (PDF)

M.B., M.L., and J.-H.H. designed the study. M.B. and S.S.-S. encoded the tool. C.C.J.F. and L.B. defined the set of chemical rules. M.B., C.C.J.F., S.N., M.S.-J., and C.J.C performed experiments. C.J.C. and P.H.S. provided synthetic standards and MS data. All authors edited and approved the final manuscript.

Open access funded by Max Planck Society.

The authors declare no competing financial interest.

References

  1. Bligh M., Nguyen N., Buck-Wiese H., Vidal-Melgosa S., Hehemann J. H.. Structures and Functions of Algal Glycans Shape Their Capacity to Sequester Carbon in the Ocean. Current Opinion in Chemical Biology. 2022;71:102204. doi: 10.1016/j.cbpa.2022.102204. [DOI] [PubMed] [Google Scholar]
  2. Varki A.. Biological Roles of Glycans. Glycobiology. 2017;27(1):3–49. doi: 10.1093/glycob/cww086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Nevins D. J.. Sugars: Their Origin in Photosynthesis and Subsequent Biological Interconversions. The American Journal of Clinical Nutrition. 1995;61:915S. doi: 10.1093/ajcn/61.4.915S. [DOI] [PubMed] [Google Scholar]
  4. Dell A., Morris H. R.. Glycoprotein Structure Determination by Mass Spectrometry. Science. 2001;291:2351. doi: 10.1126/science.1058890. [DOI] [PubMed] [Google Scholar]
  5. Laine R.. A Calculation of All Possible Oligosaccharide Isomers Both Branched and Linear Yields 1.05 × 1012 Structures for a Reducing Hexasaccharide: The Isomer Barrier to Development of Single-Method Saccharide Sequencing or Synthesis Systems. Glycobiology. 1994;4:759–767. doi: 10.1093/glycob/4.6.759. [DOI] [PubMed] [Google Scholar]
  6. Tabang D. N., Ford M., Li L.. Recent Advances in Mass Spectrometry-Based Glycomic and Glycoproteomic Studies of Pancreatic Diseases. Frontiers in Chemistry. 2021;9:1. doi: 10.3389/fchem.2021.707387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Han L., Costello C. E.. Mass Spectrometry of Glycans. Biochemistry (Moscow) 2013;78(7):710–720. doi: 10.1134/S0006297913070031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Ruhaak L. R., Xu G., Li Q., Goonatilleke E., Lebrilla C. B.. Mass Spectrometry Approaches to Glycomic and Glycoproteomic Analyses. Chem. Rev. 2018;118(17):7886–7930. doi: 10.1021/acs.chemrev.7b00732. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Sun Y., Liu Y., Jiang K., Wang C., Wang Z., Huang L.. Electrospray ionization mass spectrometric analysis of κ-carrageenan oligosaccharides obtained by degradation with κ-carrageenase from Pedobacter hainanensis . J. Agric. Food Chem. 2014;62(11):2398–2405. doi: 10.1021/jf500429r. [DOI] [PubMed] [Google Scholar]
  10. Drake R. R., Powers T. W., Jones E. E., Bruner E., Mehta A. S., Angel P. M.. MALDI Mass Spectrometry Imaging of N-Linked Glycans in Cancer Tissues. Adv. Cancer Res. 2017;134:85–116. doi: 10.1016/bs.acr.2016.11.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Buck-Wiese H., Fanuel M., Liebeke M., Le Mai Hoang K., Pardo-Vargas A., Seeberger P. H., Hehemann J. H., Rogniaux H., Jackson G. P., Ropartz D.. Discrimination of β-1,4- and β-1,3-Linkages in Native Oligosaccharides via Charge Transfer Dissociation Mass Spectrometry. J. Am. Soc. Mass Spectrom. 2020;31(6):1249–1259. doi: 10.1021/jasms.0c00087. [DOI] [PubMed] [Google Scholar]
  12. Hofmann J., Hahm H. S., Seeberger P. H., Pagel K.. Identification of carbohydrate anomers using ion mobility-mass spectrometry. Nature. 2015;526(7572):241–244. doi: 10.1038/nature15388. [DOI] [PubMed] [Google Scholar]
  13. Fellenberg M., Behnken H., Nagel T., Wiegandt A., Baerenfaenger M., Meyer B.. Glycan analysis: Scope and limitations of different techniques - A case for integrated use of LC-MS­(/MS) and NMR techniques. Anal Bioanal Chem. 2013;405(23):7291–7305. doi: 10.1007/s00216-013-7164-y. [DOI] [PubMed] [Google Scholar]
  14. Nellenbach K., Kyu A., Guzzetta N., Brown A.. Differential sialic acid content in adult and neonatal fibrinogen mediates differences in clot polymerization dynamics. Blood Adv. 2021;5:5202–5214. doi: 10.1182/bloodadvances.2021004417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Varki, A. ; Cummings, R. D. ; Esko, J. D. ; Stanley, P. ; Hart, G. W. ; Aebi, M. ; Mohnen, D. ; Kinoshita, T. ; Packer, N. H. ; Prestegard, J. H. ; Schnaar, R. L. ; Seeberger, P. H. . Essentials of Glycobiology, 4th ed.; Cold Spring Harbor Laboratory Press: 2022. 10.1101/9781621824213. [DOI] [PubMed] [Google Scholar]
  16. Alocci D., Mariethoz J., Gastaldello A., Gasteiger E., Karlsson N. G., Kolarich D., Packer N. H., Lisacek F.. GlyConnect: Glycoproteomics Goes Visual, Interactive, and Analytical. J. Proteome Res. 2019;18(2):664–677. doi: 10.1021/acs.jproteome.8b00766. [DOI] [PubMed] [Google Scholar]
  17. Hashimoto, K. ; Kanehisa, M. . KEGG GLYCAN for Integrated Analysis of Pathways, Genes, and Glycan Structures. In Handbook of Glycomics; Cummings, R. D. , Pierce, J. M. , Eds.; Academic Press: 2010; pp 197–210. 10.1016/B978-0-12-373600-0.00009-3. [DOI] [Google Scholar]
  18. Akune Y., Abrahams J. L., Zhang H.. et al. Comprehensive Analysis of the N-Glycan Biosynthetic Pathway Using Bioinformatics to Generate UniCorn: A Theoretical N-Glycan Structure Database. Carbohydr. Res. 2016;431:56–63. doi: 10.1016/j.carres.2016.05.012. [DOI] [PubMed] [Google Scholar]
  19. Flynn R. A., Pedram K., Malaker S. A., Batista P. J., Smith B. A. H., Johnson A. G., George B. M., Majzoub K., Villalta P. W., Carette J. E., Bertozzi C. R.. Small RNAs Are Modified with N-Glycans and Displayed on the Surface of Living Cells. Cell. 2021;184(12):3109–3124.e22. doi: 10.1016/j.cell.2021.04.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Ceroni A., Maass K., Geyer H., Geyer R., Dell A., Haslam S. M.. GlycoWorkbench: A Tool for the Computer-Assisted Annotation of Mass Spectra of Glycans. J. Proteome Res. 2008;7(4):1650–1659. doi: 10.1021/pr7008252. [DOI] [PubMed] [Google Scholar]
  21. Maass K., Ranzinger R., Geyer H., Von Der Lieth C. W., Geyer R.. “Glyco-Peakfinder” - De Novo Composition Analysis of Glycoconjugates. Proteomics. 2007;7(24):4435–4444. doi: 10.1002/pmic.200700253. [DOI] [PubMed] [Google Scholar]
  22. Cooper C. A., Gasteiger E., Packer N. H.. GlycoMod-A Software Tool for Determining Glycosylation Compositions from Mass Spectro-Metric Data. Proteomics. 2001;1(2):340–349. doi: 10.1002/1615-9861(200102)1:2<340::AID-PROT340>3.0.CO;2-B. [DOI] [PubMed] [Google Scholar]
  23. Yang Y., Fang Q.. Prediction of Glycopeptide Fragment Mass Spectra by Deep Learning. Nat. Commun. 2024;15(1):2448. doi: 10.1038/s41467-024-46771-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Kawahara R., Chernykh A., Alagesan K., Bern M., Cao W., Chalkley R. J., Cheng K., Choo M. S., Edwards N., Goldman R., Hoffmann M., Hu Y., Huang Y., Kim J. Y., Kletter D., Liquet B., Liu M., Mechref Y., Meng B., Neelamegham S., Nguyen-Khuong T., Nilsson J., Pap A., Park G. W., Parker B. L., Pegg C. L., Penninger J. M., Phung T. K., Pioch M., Rapp E., Sakalli E., Sanda M., Schulz B. L., Scott N. E., Sofronov G., Stadlmann J., Vakhrushev S. Y., Woo C. M., Wu H. Y., Yang P., Ying W., Zhang H., Zhang Y., Zhao J., Zaia J., Haslam S. M., Palmisano G., Yoo J. S., Larson G., Khoo K. H., Medzihradszky K. F., Kolarich D., Packer N. H., Thaysen-Andersen M.. Community Evaluation of Glycoproteomics Informatics Solutions Reveals High-Performance Search Strategies for Serum Glycopeptide Analysis. Nat. Methods. 2021;18(11):1304–1316. doi: 10.1038/s41592-021-01309-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Li H., Chiang A. W. T., Lewis N. E.. Artificial Intelligence in the Analysis of Glycosylation Data. Biotechnology Advances. 2022;60:108008. doi: 10.1016/j.biotechadv.2022.108008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Smith C. A., Want E. J., O’Maille G., Abagyan R., Siuzdak G.. XCMS: Processing Mass Spectrometry Data for Metabolite Profiling Using Nonlinear Peak Alignment, Matching, and Identification. Anal. Chem. 2006;78(3):779–787. doi: 10.1021/ac051437y. [DOI] [PubMed] [Google Scholar]
  27. Bemis K. A., Föll M. C., Guo D., Lakkimsetty S. S., Vitek O.. Cardinal v.3: A Versatile Open-Source Software for Mass Spectrometry Imaging Analysis. Nat. Methods. 2023;20(12):1883–1886. doi: 10.1038/s41592-023-02070-z. [DOI] [PubMed] [Google Scholar]
  28. Chambers M. C., MacLean B., Burke R., Amodei D., Ruderman D. L., Neumann S., Gatto L., Fischer B., Pratt B., Egertson J., Hoff K., Kessner D., Tasman N., Shulman N., Frewen B., Baker T. A., Brusniak M. Y., Paulse C., Creasy D., Flashner L., Kani K., Moulding C., Seymour S. L., Nuwaysir L. M., Lefebvre B., Kuhlmann F., Roark J., Rainer P., Detlev S., Hemenway T., Huhmer A., Langridge J., Connolly B., Chadick T., Holly K., Eckels J., Deutsch E. W., Moritz R. L., Katz J. E., Agus D. B., MacCoss M., Tabb D. L., Mallick P.. A Cross-Platform Toolkit for Mass Spectrometry and Proteomics. Nat. Biotechnol. 2012;30(10):918–920. doi: 10.1038/nbt.2377. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Ushey, K. ; Allaire, J. ; Tang, Y. . Reticulate: Interface to ‘Python’. R package version 1.38.0, https://rstudio.github.io/reticulate/.
  30. McNaught A. D.. Nomenclature of Carbohydrates (IUPAC Recommendations 1996) 1996;68(10):1919–2008. doi: 10.1351/pac199668101919. [DOI] [PubMed] [Google Scholar]
  31. Mölder F., Jablonski K. P., Letcher B., Hall M. B., Tomkins-Tinch C. H., Sochat V., Forster J., Lee S., Twardziok S. O., Kanitz A., Wilm A., Holtgrewe M., Rahmann S., Nahnsen S., Köster J.. Sustainable Data Analysis with Snakemake. F1000Res. 2021;10:33. doi: 10.12688/f1000research.29032.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Kuhl C., Tautenhahn R., Böttcher C., Larson T. R., Neumann S.. CAMERA: An Integrated Strategy for Compound Spectra Extraction and Annotation of Liquid Chromatography/Mass Spectrometry Data Sets. Anal. Chem. 2012;84(1):283–289. doi: 10.1021/ac202450g. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Gatto L., Gibb S., Rainer J.. MSnbase, Efficient and Elegant R-Based Processing and Visualization of Raw Mass Spectrometry Data. J. Proteome Res. 2021;20(1):1063–1069. doi: 10.1021/acs.jproteome.0c00313. [DOI] [PubMed] [Google Scholar]
  34. Crawford C. J., Schultz-Johansen M., Luong P., Vidal-Melgosa S., Hehemann J.-H., Seeberger P. H.. Automated Synthesis of Algal Fucoidan Oligosaccharides. J. Am. Chem. Soc. 2024;146(27):18320–18330. doi: 10.1021/jacs.4c02348. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Danglad-Flores J., Leichnitz S., Sletten E. T., Abragam Joseph A., Bienert K., Le Mai Hoang K., Seeberger P. H.. Microwave-Assisted Automated Glycan Assembly. J. Am. Chem. Soc. 2021;143(23):8893–8901. doi: 10.1021/jacs.1c03851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Weishaupt M. W., Hahm H. S., Geissner A., Seeberger P. H.. Automated Glycan Assembly of Branched β-(1,3)-Glucans to Identify Antibody Epitopes. Chem. Commun. 2017;53(25):3591–3594. doi: 10.1039/C7CC00520B. [DOI] [PubMed] [Google Scholar]
  37. Krull, J. ; Crawford, C. J. ; Sidhu, C. ; Solanki, V. ; Bligh, M. ; Rößler, L. ; Singh, R. K. ; Huang, G. ; Robb, C. S. ; Teeling, H. ; Seeberger, P. H. ; Schweder, T. ; Hehemann, J.-H. . Polyelectrolyte Mannan from Diatoms Reshapes Sunlit Ocean Microbiome. bioRxiv, 2024. 10.1101/2024.07.03.601839 [DOI]
  38. Schultz-Johansen, M. ; Parnami, J. ; Hellige, I. ; Hehemann, J.-H. . Silica-based purification and desalting of fucoidan. ChemRxiv., 2025. 10.26434/chemrxiv-2025-fc6bq [DOI]
  39. Robb, M. ; Hobbs, J. K. ; Boraston, A. B. . Separation and Visualization of Glycans by Fluorophore-Assisted Carbohydrate Electrophoresis. In Protein-Carbohydrate Interactions. Methods in Molecular Biology; Abbot, D. , Lammerts van Bueren, A. , Eds.; Humana Press: 2017; Vol. 1588, pp 215–221. 10.1007/978-1-4939-6899-2_17. [DOI] [PubMed] [Google Scholar]
  40. Andrade J. P. S., Oliveira C. P., Tovar A. M. F., Mourão P. A. de S., Vilanova E.. A Color-Code for Glycosaminoglycans Identification by Means of Polyacrylamide Gel Electrophoresis Stained with the Cationic Carbocyanine Dye Stains-All. Electrophoresis. 2018;39(4):666–669. doi: 10.1002/elps.201700391. [DOI] [PubMed] [Google Scholar]
  41. Veličković D., Bečejac T., Mamedov S., Sharma K., Ambalavanan N., Alexandrov T., Anderton C. R.. Rapid Automated Annotation and Analysis of N-Glycan Mass Spectrometry Imaging Data Sets Using NGlycDB in METASPACE. Anal. Chem. 2021;93(40):13421–13425. doi: 10.1021/acs.analchem.1c02347. [DOI] [PubMed] [Google Scholar]
  42. Kopplin G., Rokstad A. M., Mélida H., Bulone V., Skjåk-Bræk G., Aachmann F. L.. Structural Characterization of Fucoidan from Laminaria Hyperborea: Assessment of Coagulation and Inflammatory Properties and Their Structure-Function Relationship. ACS Appl. Bio Mater. 2018;1(6):1880–1892. doi: 10.1021/acsabm.8b00436. [DOI] [PubMed] [Google Scholar]
  43. Chen J. L., Lee C., Lu I. C., Chien C. L., Lee Y. T., Hu W. P., Ni C. K.. Theoretical Investigation of Low Detection Sensitivity for Underivatized Carbohydrates in ESI and MALDI. Journal of Mass Spectrometry. 2016;51(12):1180–1186. doi: 10.1002/jms.3889. [DOI] [PubMed] [Google Scholar]
  44. Giera M., Aisporna A., Uritboonthai W., Siuzdak G.. The Hidden Impact of In-Source Fragmentation in Metabolic and Chemical Mass Spectrometry Data Interpretation. Nat. Metab. 2024;6:1647. doi: 10.1038/s42255-024-01076-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Ponce N. M. A., Stortz C. A.. A Comprehensive and Comparative Analysis of the Fucoidan Compositional Data Across the Phaeophyceae. Front Plant Sci. 2020 doi: 10.3389/fpls.2020.556312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Priyan Shanura Fernando I., Kim K. N., Kim D., Jeon Y. J.. Algal Polysaccharides: Potential Bioactive Substances for Cosmeceutical Applications. Crit Rev. Biotechno. 2019;39:99–113. doi: 10.1080/07388551.2018.1503995. [DOI] [PubMed] [Google Scholar]
  47. Luthuli S., Wu S., Cheng Y., Zheng X., Wu M., Tong H.. Therapeutic Effects of Fucoidan: A Review on Recent Studies. Mar Drugs. 2019;17:487. doi: 10.3390/md17090487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Vidal-Melgosa S., Sichert A., Francis T., Bartosik D., Niggemann J., Wichels A., Willats W. G. T., Fuchs B. M., Teeling H., Becher D., Schweder T., Amann R., Hehemann J. H.. Diatom Fucan Polysaccharide Precipitates Carbon during Algal Blooms. Nat. Commun. 2021;12(1):1–13. doi: 10.1038/s41467-021-21009-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Salmeán A. A., Willats W. G. T., Ribeiro S., Andersen T. J., Ellegaard M.. Over 100-Year Preservation and Temporal Fluctuations of Cell Wall Polysaccharides in Marine Sediments. Front Plant Sci. 2022;13:1. doi: 10.3389/fpls.2022.785902. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Reyes-Weiss D. S., Bligh M., Rhein-Knudsen N., Hehemann J. H., Liebeke M., Westereng B., Horn S. J.. Application of MALDI-MS for Characterization of Fucoidan Hydrolysates and Screening of Endo-Fucoidanase Activity. Carbohydr. Polym. 2024;340:122317. doi: 10.1016/j.carbpol.2024.122317. [DOI] [PubMed] [Google Scholar]
  51. Kloareg B., Badis Y., Cock J. M., Michel G.. Role and Evolution of the Extracellular Matrix in the Acquisition of Complex Multicellularity in Eukaryotes: A Macroalgal Perspective. Genes (Basel) 2021;12(7):1059. doi: 10.3390/genes12071059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Eger A. M., Marzinelli E. M., Beas-Luna R., Blain C. O., Blamey L. K., Byrnes J. E. K., Carnell P. E., Choi C. G., Hessing-Lewis M., Kim K. Y., Kumagai N. H., Lorda J., Moore P., Nakamura Y., Pérez-Matus A., Pontier O., Smale D., Steinberg P. D., Vergés A.. The Value of Ecosystem Services in Global Marine Kelp Forests. Nat. Commun. 2023;14(1):1894. doi: 10.1038/s41467-023-37385-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Vickers C., Liu F., Abe K., Salama-Alber O., Jenkins M., Springate C. M. K., Burke J. E., Withers S. G., Boraston A. B.. Endo-Fucoidan Hydrolases from Glycoside Hydrolase Family 107 (GH107) Display Structural and Mechanistic Similarities to -L-Fucosidases from GH29. J. Biol. Chem. 2018;293(47):18296–18308. doi: 10.1074/jbc.RA118.005134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Sichert A., Corzett C. H., Schechter M. S., Unfried F., Markert S., Becher D., Fernandez-Guerra A., Liebeke M., Schweder T., Polz M. F., Hehemann J. H.. Verrucomicrobia Use Hundreds of Enzymes to Digest the Algal Polysaccharide Fucoidan. Nat. Microbiol. 2020;5(8):1026–1039. doi: 10.1038/s41564-020-0720-2. [DOI] [PubMed] [Google Scholar]
  55. Zou P., Yang X., Yuan Y., Jing C., Cao J., Wang Y., Zhang L., Zhang C., Li Y.. Purification and Characterization of a Fucoidan from the Brown Algae Macrocystis Pyrifera and the Activity of Enhancing Salt-Stress Tolerance of Wheat Seedlings. Int. J. Biol. Macromol. 2021;180:547–558. doi: 10.1016/j.ijbiomac.2021.03.039. [DOI] [PubMed] [Google Scholar]
  56. Thinh P. D., Menshova R. V., Ermakova S. P., Anastyuk S. D., Ly B. M., Zvyagintseva T. N.. Structural Characteristics and Anticancer Activity of Fucoidan from the Brown Alga Sargassum Mcclurei. Mar Drugs. 2013;11(5):1456. doi: 10.3390/md11051456. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Fogarty C. A., Harbison A. M., Dugdale A. R., Fadda E.. How and Why Plants and Human N-Glycans Are Different: Insight from Molecular Dynamics into the “Glycoblocks” Architecture of Complex Carbohydrates. Beilstein Journal of Organic Chemistry. 2020;16:2046–2056. doi: 10.3762/bjoc.16.171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Takeda-Uchimura Y., Ikezaki M., Akama T. O., Ihara Y., Allain F., Nishitsuji K., Uchimura K.. GlcNAc6ST2/CHST4 Is Essential for the Synthesis of R-10G-Reactive Keratan Sulfate/Sulfated N-Acetyllactosamine Oligosaccharides in Mouse Pleural Mesothelium. Molecules. 2024;29(4):764. doi: 10.3390/molecules29040764. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

  1. Ushey, K. ; Allaire, J. ; Tang, Y. . Reticulate: Interface to ‘Python’. R package version 1.38.0, https://rstudio.github.io/reticulate/.

Supplementary Materials

js5c00093_si_001.pdf (2.6MB, pdf)

Data Availability Statement

LC-MS data for commercial standards are available on Github (https://github.com/margotbligh/GlycoAnnotateR/tree/master/inst/extdata). LC-MS data for the fucoidan digest are available on MASSive.


Articles from Journal of the American Society for Mass Spectrometry are provided here courtesy of American Chemical Society

RESOURCES