Skip to main content
JACS Au logoLink to JACS Au
. 2022 Dec 2;3(1):4–12. doi: 10.1021/jacsau.2c00477

Worldwide Glycoscience Informatics Infrastructure: The GlySpace Alliance

Frederique Lisacek †,, Michael Tiemeyer , Raja Mazumder §, Kiyoko F Aoki-Kinoshita ∥,*
PMCID: PMC9875223  PMID: 36711080

Abstract

graphic file with name au2c00477_0005.jpg

The GlySpace Alliance was formed in 2018 among the principal investigators of three major glycoscience portals: Glyco@Expasy, GlyCosmos, and GlyGen, representing Europe, Asia, and the United States, respectively. While each of these portals has its unique user interface, the aim is to provide the same basic data set of glycan-related omics data. These portals will be introduced with the aim to enable users to find their target information in the most efficient manner, in particular, in terms of the chemical structures of glycans and their functions.

Keywords: databases, web portals, glycoscience, carbohydrates, bioinformatics

Introduction

History of Glycan Databases

The first carbohydrate repository was developed by the Complex Carbohydrate Research Center (CCRC) at the University of Georgia. It was called the Complex Carbohydrate Structure Database (CCSD) but later known as CarbBank, after the name of the software used to access CCSD.1 The database contained over 40 000 entries upon the end of its development in the late 1990s. The data was made publicly available, however, and taken up by three major databases in the United States, Europe, and Japan: the Consortium for Functional Glycomics (CFG),2 Glycosciences.de,3 and KEGG GLYCAN,4 respectively. While the CFG has now been incorporated into the National Center for Functional Glycomics (NCFG), the data is still available on the original web site. Both Glycosciences.de and KEGG GLYCAN continue to update and make available the glycan data from CarbBank, integrating them with their respective resources. Glycosciences.de focuses on the 3D structures of glycans, extracting carbohydrates from the Protein Data Bank (PDB), while KEGG GLYCAN continues to populate their PATHWAY information with glycan-related genes and diseases.

Many other databases on carbohydrates have also been developed in the meantime. For example, the Bacterial Carbohydrate Structure Database (BCSDB) has combined with their plant and fungal carbohydrate structure database to form the Carbohydrate Structure Database (CSDB).5 The well-known CAZy database of carbohydrate active enzymes also continues to develop with new glycogene families being added continuously.6

Glycan-Centered Web Portals: The GlySpace Alliance

On the other hand, new web portals developed specifically for integrating omics data centered on glycans have also emerged. Glycomics@ExPASy,7 recently renamed Glyco@Expasy, is housed at the well-known Expasy Swiss Bioinformatics Resource Portal and provides user-friendly access to various databases and software tools. GlyCosmos8 was developed in Japan under their database integration program to integrate glycan data across all life science domains using Semantic Web technologies. GlyGen9 was developed under the U.S. NIH Common Fund program to integrate and disseminate carbohydrate and glycoconjugate data. Because of the overlap in these portals, these three projects have agreed to communicate and collaborate as the GlySpace Alliance to ensure the integrity of glycan-based data around the world.

Nomenclature

While comprehensive reviews of various textual and graphical representations of glycans are already available, here we describe the three major representations used across the portals of the GlySpace Alliance.10

SNFG

The Symbol Nomenclature For Glycans (SNFG)11 is the proposed standard for representing glycan structures graphically. Specific symbols are proposed for specific monosaccharides with defined RGB codes for coloring which can be distinguished even if printed in monochrome. The SNFG glycan pages at https://www.ncbi.nlm.nih.gov/glycans/snfg.html give examples of usage, including how glycans should be depicted with substituents and ambiguous linkages. Figure 1 illustrates a complex example of an N-linked glycan with possible attachment of three monosaccharides at various positions.

Figure 1.

Figure 1

Example of an N-linked glycan with various possible attachments of monosaccharides. (a) SNFG representation, drawn using SugarDrawer, where the symbols indicate the following: blue square = N-acetylglucosamine, green circle = mannose, red triangle = fucose. (b) WURCS representation, whereas (c) GlycoCT representation for the same structure. Yellow highlighted text represents the monosaccharide residues, green highlighted text represents the linkage information, and bold text represents the ambiguous linkages.

WURCS

WURCS (version 2)12 is the text representation of glycans used in GlyTouCan.13 It is a linear string format designed to uniquely represent any glycan composition, structure, or fragment. Rules are devised such that when the WURCS representation of a glycan is normalized, the same unique representation is obtained, ensuring that any glycan structure can be represented uniquely using the official WURCS string. Several tools for drawing and converting glycans into WURCS format are available, as listed in Table 1. The latest information can be obtained from the WURCS working group homepage at https://www.wurcs-wg.org/.

Table 1. List of Software Tools for Obtaining WURCS Formatted Strings for Glycans.

tool name description URL
GlycanFormat Converter Web various online tools to convert from glycan text formats such as GlycoCT and IUPAC into WURCS are available https://glyconavi.org/Tools/tool/gfc.php
GlyCosmos Web API user interface for various APIs provided by GlyCosmos, including conversion of WURCS to images in SNFG format https://api.glycosmos.org/
Glycan converters various glycan text format converters used by GlyCosmos, including tools to normalize WURCS, obtain image files, as well as GlyTouCan IDs https://glycosmos.org/glycans/converter
SugarDrawer and GlycanBuilder2 graphic user interfaces at GlyCosmos to search glycans by drawing them on a canvas and querying glycan databases such as GlyTouCan https://glycosmos.org/glycans/graphic

GlycoCT

GlycoCT is the oldest and currently most often used format for representing glycans.14 It is used mainly in Glyco@Expasy and software tools such as GlycanBuilder and GlycoWorkBench,15 which are tools used for analyzing glycomics mass spectra data. GlycoCT is a multiline format for representing glycan structures and compositions. The format is intended to be human readable and easily compressed and includes a canonicalization algorithm to ensure that there is only a single representation for a glycan structure. Software libraries such as glypy16 for Python users are available, and many of the software tools in Table 1 also support export into GlycoCT format.

Glyco@Expasy

Glyco@Expasy centralizes references to strictly web-based glycoinformatics resources, whether hosted or not on Expasy. The aim of this portal is to make glycobiology more accessible to scientists in both the glycosciences and the protein sciences. It strongly advocates for presenting glycans as mediators of protein–protein interactions. The major sections of Glyco@Expasy will be described next.

Main Portal

The landing page displays a series of clickable, colored circles (bubbles) representing web portals (red), tools (green), and databases (yellow) in the glycoscience domains (Figure 2). To help users select an appropriate database or software tool, the collected resources are categorized into a hierarchy representing the different fields of glycobiology, such as glycans, glycoconjugates, glycan binding, etc. The checkbox for each term triggers zooming in the bubble chart to focus on the theme of interest. Users can zoom out by clicking outside the circle in focus. Zooming in is also possible by clicking on a circle of interest. All zooming operations systematically check or uncheck boxes in the hierarchy of terms on the left. Information regarding the resource(s) in a selected circle is displayed in the middle of the page, along with a short description, a link to the resource, and a link to the related publication. A second tab displays an interactive dependency wheel with the same color scheme to highlight relationships between resources. Glyco@Expasy strives to update information at least yearly, as new web-based glycoinformatics resources are being released or old ones outdated.

Figure 2.

Figure 2

Screenshots on the left featuring successively zoomed in information (zoom enabled by mouse click on bubble) ending on the GlyTouCan database as an example of use of the Glyco@Expasy thematic classification. Screenshots on the bottom right show the effect of mousing over the dependency wheel with the example of GlyTouCan. Color code for resources is shown in the upper right corner.

Glycans and Glycomes

This section includes carbohydrate enzymes (CAZymes), gene expression, and glycan structures in 2D and 3D forms.

CAZymes

At the time of this writing, the CAZyme section includes the CAZy and KEGG Pathway databases as well as several tools that make use of such glycogene information, such as to predict glycan structures or to digest glycans.

Two-Dimensional Structures

The section on 2D structures contains several subgroups based on interest: mass spectrometry/liquid chromatography (MS/LC), structure databases, 2D drawing tools, and utility tools. MS/LC includes the databases UniCarb-DB17 and GlycoStore,18 which are databases known to store such information for glycans, and tools such as Glycoforest,19 GlycoWorkbench, and GRITS.20 Utility tools itemize in-house integrative tools that ease navigation across resources, such as substructure search.21

Three-Dimensional Structures

The 3D structure section focuses on the spatial conformations of glycans and includes two subsections: 3D viewers/predictors and NMR. The former further contains links to tools for predicting and displaying glycans in 3D, including SweetUnityMol,22 GLYCAN-web, and LiteMol viewer.23 Under NMR is the CASPER tool for assigning chemical shifts to NMR data.24 The GAG-DB is also available under this section as a resource for accessing the 3D structures of glycosaminoglycans.25

Glycan Binding

The glycan binding section consists of three subsections: glycan epitopes, glycan-binding proteins, and glycan–protein interactions. Glycan epitopes include the GlycoEpitope database26 and two tools: Glydin27 and GLAD.28 Glydin is an online tool for investigating various known carbohydrate epitopes accumulated from various sources including the literature and the GlycoEpitope database. GLAD is an online tool to investigate glycan microarray experiments and the glycan binding epitopes that can be obtained as a result.

The glycan-binding proteins subsection consists of two databases: the Database of Anti-Glycan Reagents (DAGR)29 and UniLectin,30 a curated and predicted lectin database based on a classification of protein folds. On the other hand, the glycan–protein interactions section consists of three databases: MatrixDB,31 SugarBindDB,32 and ViralZone.33 MatrixDB is a database of interactions in the extracellular matrix, much of which involves proteoglycans. SugarBindDB is a database of glycan and pathogen interactions, whereas ViralZone is a database of virus information in general, provided by Expasy.

Glycoconjugates

In Glyco@Expasy, the glycoconjugates section is divided into glycolipids, proteins, and proteome, glycoproteins (including intact glycopeptides), and glycosites. Without going into too much detail, the glycolipids, protein, and proteome sections consist of databases, whereas intact glycopeptides and glycosites include various tools for analyzing such data. Other than intact glycopeptides, glycoproteins also include two databases GlyGen and GlyConnect,34 the latter of which is the main curated glycoprotein database that is shared across the GlySpace Alliance.

GlyCosmos

GlyCosmos is the official portal of the Japanese Society for Carbohydrate Research (JSCR), but its scope is focused on integrating omics data centered on glycans at a worldwide scale. Semantic Web technologies are used for integration and supplementation of semantics to the data directly, such that not only users but also researchers can learn and glean new knowledge from the data integrated across domains. At the time of this writing, version 3.0 provides four sections: repositories, data resources, tools, and standards.

Repositories

The repositories section lists the various data repositories whereby users can submit their data. This includes glycan structures for GlyTouCan,13 glycomics mass spectrometry data for GlycoPOST35 (raw data) and UniCarb-DR36 (glycan structures), and GlyComb for glycoconjugate data (currently only glycopeptides).

Data Resources

The data resources section is organized based on the type of data being targeted: glycogenes, glycoproteins, lectins, glycolipids, glycans, pathways, diseases, and organisms. Each data resource list has a header indicating from what databases/resources the data has been integrated and the date when it was obtained. Filtering options are available on the left, where the different columns to be displayed can be selected and keywords can be entered to search for specific terms in different fields. AND and OR search can be chosen to either search for entries that satisfy all criteria or any of the entered criteria, respectively. The headers of the table in the center panel can be clicked to sort in decreasing or increasing order. Here, we describe the content that is common across the GlySpace Alliance, namely, the Data Resources on glycogenes, glycoproteins, and glycans will be described.

Glycogenes

The glycogenes section is an integrated list of glycan-related genes, including glycosyltransferases and hydrolases, accumulated from KEGG BRITE,37 the Glycogene Database (GGDB),38 and others. Each entry has its own detailed page identified by gene ID. This page contains the translated proteins, details about the KEGG BRITE annotations, details about any reactions in which it is involved, known diseases, and links to ChIP-Atlas39 and LIPID MAPS,40 where available.

Glycoproteins

The glycoproteins section lists the glycoproteins accumulated from UniProt,41 GlyConnect,34 and GlyGen.9 Each entry detail page has several sections listed in the contents on the upper right: a summary, annotations from UniProt, sequence information highlighting the N-glycosylation sequon, a viewer to browse the glycosylation features of the sequence, related pathways, expression information displayed graphically based on the Human Protein Atlas,42 disease information, 3D structures from the Protein Data Bank (PDB),43 and references from the literature. If the entry is a lectin (glycan-binding protein) then lectin information is also displayed, for example, from UniLectin,30 which provides 3D data, the Lectin frontier Database (LfDB),44 from which the top three binding glycans based on frontal affinity chromatography are listed, and MCAW-DB,45 which displays the glycans that are recognized by the given lectin as a glycan profile.

Glycans

Glycan data are automatically retrieved from GlyTouCan on a weekly basis. They are then supplemented with additional metainformation such as IUPAC Condensed46 and GlycoCT representations, monoisotopic mass, and subsumption level, indicating its level of structural detail. The entry detail page additionally displays the 3D atomic structure of the glycan, where possible (all atomic information needs to be known), along with external links to other relevant databases. If the glycan is annotated in GGDB as a substrate or product of one of its genes, the corresponding data is displayed on this page. Similarly, if it is a glycan that has been analyzed using LC/MS in GALAXY, a list of the degrading enzymes and other similar glycans in GALAXY are displayed. Core protein information from GlyGen and GlycoProtDB47 as well as epitope information from GlycoEpitope26 and LfDB and tissue expression as annotated by GlycomeAtlas48 are also listed.

Tools

Many users have needs to more easily analyze glycan-related information, and the GlySpace Alliance member portals provide various tools to aid users to do so. In GlyCosmos, GlycoMaple49 and the drawing tools SugarDrawer50 and GlycanBuilder in addition to the various glycan format conversion tools described in the Nomenclature section are available.

GlycoMaple

GlycoMaple is a web tool to aid users in understanding the complex relationships between glycogenes and glycans. All of the major biosynthetic pathways of various glycans are provided, as shown in Figure 3. Gene expression data from the Human Protein Atlas can be selected from the HPA(cell) and HPA(tissue) pull-down menus to select from cell lines and tissues, respectively. Users can also upload their own data by choosing their file in the upper left. The expression values will be displayed according the expression value ranges selected in the panel above the pathway diagram. All of the pathways will be updated to reflect the expression data, which allows users to assess the relationships between various glycan types affected by the same gene expression data.

Figure 3.

Figure 3

Screenshot of the GlycoMaple tool, which allows users to analyze glycogene expression on top of the various glycan biosynthesis pathways known.

Standards

The standards section simply provides links to the standard ontologies and nomenclature used in GlyCosmos. These were mainly described earlier in the Nomenclature section.

GlyGen

GlyGen was initially developed by CCRC (University of Georgia, USA) and George Washington University collaborators with the support of the NIH Glycoscience Common Fund.9 It facilitates the exploration of data related to glycans, proteins, and glycoproteins in Homo sapiens, Mus musculus, Rattus norvegicus, Hepatitis C viruses, SARS-CoV-2, and SARS-related coronavirus. Data is accessible through an intuitive web portal (Figure 4; https://www.glygen.org) or as resource-specific (CSV, RDF) format files (https://data.glygen.org). Glygen also provides a number of tools to explore these data sets.

Figure 4.

Figure 4

GlyGen home page provides users with search interfaces that access glycan, protein, glycosylation site, glycan motif, structural subsumption relationship, and glycan composition data. Access to data files, APIs, and a SPARQL end point are also provided for more advanced users.

Glycans

Glycan data can be searched using simple, advanced, composition, structure, and substructure searches, each providing different routes to search data at various levels of expertise or granularity. The results of any of these searches lead users to a glycan details page, which lists a variety of useful information, including GlyTouCan accession number, mass values, composition string and glycan type, organisms in which the glycan is found, any names given to the glycan, motif information, list of associated proteins, glycan binding proteins, enzymes involved in its biosynthesis, related glycans in terms of ambiguity (often used by mass spectrometrists when analyzing glycomics data), tissues and cell lines in which the glycan is expressed, and publication information.

Glycoproteins

Similar to glycans, glycoproteins can also be searched for by various categories, including disease, gene, glycan, organism, pathway, or protein name/ID. The resulting protein details page has a plethora of information gathered from various resources including relevant cross-links; detailed glycan-related information is also presented, including site-specific glycan modification at the composition and structural level. This section also divides the information into those that have been reported in the literature (with and without glycan data), those that have been predicted, and those that have been mined from the literature. Phosphorylation and glycation information is also provided when available.

Tools

The tools section of GlyGen provides a variety of ways to explore the data in GlyGen. ID mapping, glycan motifs, BLAST, Sand Box, and Structure Browser. These tools allow users to search the glycans and proteins in GlyGen based on different parameters. For example, glycans containing certain motifs such as sialyl-Lewis X can be searched via the glycan motifs, or glycans that are known to be involved in specific biosynthetic pathways can be searched using the Sand Box.

Integration of Glycan and Glycosylation Data with Other Data Types

GlyGen contributes its data and its experience in the glycoinformatics domain to recently expanding efforts that are integrating large data sets funded by the NIH, such as GTEx, HuBMAP, HMP, LINCS, SPARC, IDG, Metabolomics Workbench, and others (https://commonfund.nih.gov/dataecosystem). Under the shared umbrella of the Common Fund Data Ecosystem (CFDE), these groups provide access to harmonized data sets that include gene expression, disease phenotypes, druggable targets, metabolomics, microbiome dynamics, cell signatures, anatomic correlates of disease processes, and others to allow investigators to ask cross-cutting questions in novel ways. A key component of this effort is the development and adoption of shared metadata models and well-developed ontologies to ensure that all data types are findable, accessible, interoperable, and reusable (FAIR). The CFDE data portal (https://nih-cfde.org) juxtaposes glycoscience data with other data types, increasing the likelihood that users currently unaware of the importance of glycans and glycosylation may find themselves exploring the resources provided by GlyGen and, through our interconnections, the complementary resources available through all GlySpace Alliance members.

Summary

In summary, the members of the GlySpace Alliance have developed their respective resources from various perspectives, as described in more detail in ref (51). In short, Glyco@Expasy aims to provide data from the user’s perspective along the lines of Expasy’s approach, while GlyCosmos attempts to provide extensive coverage of glycan-related omics data, and GlyGen focuses on integration with genes, proteins, and other data types. Depending on the user, one portal may be easier to use or more appropriate for their research domain. The Alliance members meet periodically to discuss not only the sharing of their resources and data integrity but also the means to make their resources more easily accessible to users.

We make note that all GlySpace Alliance members are continuously working on improving their resources to adhere to the FAIR principles. All data is freely available under CC-BY-4.0 license. Users are free to share and adapt, transform, and build upon the material for any purpose, including commercially as long as they provide attribution. Moreover, all members of the Alliance use well-known standards and ontologies and provide APIs and SPARQL end points as additional access points, ensuring that the data is findable, accessible, interoperable, and reusable.52 The homepage at http://glyspace.org is updated to reflect this, and user feedback is always welcome to improve these resources.

As these resources continue to develop and expand the integration with other omics resources, we envision that users would be able to access the GlySpace Alliance homepage to easily navigate the plethora of glyco-related resources currently available. Whether the user is a student new to the field or a veteran in an orthogonal field to glycobiology, they should be able to easily find the data that they are targeting. This data should be annotated with provenance and references to the literature along with educational resources to learn more about the data of interest. Conversely, these data should become better integrated with major genomics, proteomics, lipidomics, and metabolomics databases, such that the importance of glycosylation can be understood from various contexts. In addition, model organism databases could have improved integrations with the functional aspects of glycosylation via the GlySpace Alliance. As a whole, we hope to introduce the concept of an ”Expanded Central Dogma” including glycosylation as a standard to enlighten the life sciences.

Acknowledgments

GlyCosmos was supported by JST NBDC Grant Number JPMJND2204, Glyco@ExPASy was supported by the Swiss Federal Government through the State Secretariat for Education, Research and Innovation and the Swiss National Science Foundation (grant number 31003A_179249), and GlyGen is supported by the National Institutes of Health (R24GM146616 and CFDE Project Number 1OT2OD032092).

The authors declare no competing financial interest.

References

  1. Doubet S.; Bock K.; Smith D.; Darvill A.; Albersheim P. The Complex Carbohydrate Structure Database. Trends Biochem. Sci. 1989, 14, 475–477. 10.1016/0968-0004(89)90175-8. [DOI] [PubMed] [Google Scholar]
  2. Raman R.; Venkataraman M.; Ramakrishnan S.; Lang W.; Raguram S.; Sasisekharan R. Advancing glycomics: implementation strategies at the consortium for functional glycomics. Glycobiology 2006, 16, 82R–90R. 10.1093/glycob/cwj080. [DOI] [PubMed] [Google Scholar]
  3. Lütteke T.A Practical Guide to Using Glycomics Databases; Springer Japan: Tokyo, 2017; pp 335–350. [Google Scholar]
  4. Kanehisa M.A Practical Guide to Using Glycomics Databases; Springer Japan: Tokyo, 2017; pp 177–193. [Google Scholar]
  5. Toukach P. V.; Egorova K. S. Carbohydrate structure database merged from bacterial, archaeal, plant and fungal parts. Nucleic acids research 2016, 44, D1229–D1236. 10.1093/nar/gkv840. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Drula E.; Garron M.-L.; Dogan S.; Lombard V.; Henrissat B.; Terrapon N. The carbohydrate-active enzyme database: functions and literature. Nucleic Acids Res. 2022, 50, D571–D577. 10.1093/nar/gkab1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Mariethoz J.; Alocci D.; Gastaldello A.; Horlacher O.; Gasteiger E.; Rojas-Macias M.; Karlsson N. G.; Packer N. H.; Lisacek F. Glycomics@ExPASy: Bridging the gap. Mol. Cell. Proteomics 2018, 17, 2164. 10.1074/mcp.RA118.000799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Yamada I.; Shiota M.; Shinmachi D.; Ono T.; Tsuchiya S.; Hosoda M.; Fujita A.; Aoki N. P.; Watanabe Y.; Fujita N.; Angata K.; Kaji H.; Narimatsu H.; Okuda S.; Aoki-Kinoshita K. F. The GlyCosmos Portal: a unified and comprehensive web resource for the glycosciences. Nat. Methods 2020, 17, 649–650. 10.1038/s41592-020-0879-8. [DOI] [PubMed] [Google Scholar]
  9. York W. S; Mazumder R.; Ranzinger R.; Edwards N.; Kahsay R.; Aoki-Kinoshita K. F; Campbell M. P; Cummings R. D; Feizi T.; Martin M. GlyGen: Computational and Informatics Resources for Glycoscience. Glycobiology 2020, 30, 72. 10.1093/glycob/cwz080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Aoki-Kinoshita K.; Lisacek F.; Mazumder R.; York W.; Packer N. The GlySpace Alliance: Toward a collaborative global glycoinformatics community. Glycobiology 2020, 30, 70. 10.1093/glycob/cwz078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Neelamegham S.; Aoki-Kinoshita K.; Bolton E.; Frank M.; Lisacek F.; Lütteke T.; O’Boyle N.; Packer N.; Stanley P.; Toukach P. Updates to the Symbol Nomenclature for Glycans guidelines. Glycobiology 2019, 29, 620. 10.1093/glycob/cwz045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Matsubara M.; Aoki-Kinoshita K.; Aoki N.; Yamada I.; Narimatsu H. WURCS 2.0 Update to Encapsulate Ambiguous Carbohydrate Structures. J. Chem. Inf. Model. 2017, 57, 632. 10.1021/acs.jcim.6b00650. [DOI] [PubMed] [Google Scholar]
  13. Fujita A.; Aoki N. P.; Shinmachi D.; Matsubara M.; Tsuchiya S.; Shiota M.; Ono T.; Yamada I.; Aoki-Kinoshita K. F. The international glycan repository GlyTouCan version 3.0. Nucleic acids research 2021, 49, D1529–D1533. 10.1093/nar/gkaa947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Herget S.; Ranzinger R.; Maass K.; von der Lieth C.-W. GlycoCT – a unifying sequence format for carbohydrates. Carbohydr. Res. 2008, 343, 2162–2171. 10.1016/j.carres.2008.03.011. [DOI] [PubMed] [Google Scholar]
  15. Damerell D.; Ceroni A.; Maass K.; Ranzinger R.; Dell A.; Haslam S. M. The GlycanBuilder and GlycoWorkbench glycoinformatics tools: Updates and new developments. Biological Chemistry 2012, 393, 1357–1362. 10.1515/hsz-2012-0135. [DOI] [PubMed] [Google Scholar]
  16. Klein J.; Zaia J. glypy: An Open Source Glycoinformatics Library. J. Proteome Res. 2019, 18, 3532–3537. 10.1021/acs.jproteome.9b00367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Hayes C. A.; Karlsson N. G.; Struwe W. B.; Lisacek F.; Rudd P. M.; Packer N. H.; Campbell M. P. UniCarb-DB: a database resource for glycomic discovery. Bioinformatics 2011, 27, 1343–1344. 10.1093/bioinformatics/btr137. [DOI] [PubMed] [Google Scholar]
  18. Zhao S.; Walsh I.; Abrahams J. L.; Royle L.; Nguyen-Khuong T.; Spencer D.; Fernandes D. L.; Packer N. H.; Rudd P. M.; Campbell M. P. GlycoStore: a database of retention properties for glycan analysis. Bioinformatics 2018, 34, 3231–3232. 10.1093/bioinformatics/bty319. [DOI] [PubMed] [Google Scholar]
  19. Horlacher O.; Jin C.; Alocci D.; Mariethoz J.; Müller M.; Karlsson N. G.; Lisacek F. Glycoforest 1.0. Anal. Chem. 2017, 89, 10932–10940. 10.1021/acs.analchem.7b02754. [DOI] [PubMed] [Google Scholar]
  20. Weatherly B.; Arpinar S.; Porterfield M.; Tiemeyer M.; York W. S.; Ranzinger R. GRITS Toolbox-a freely available software for processing, annotating and archiving glycomics mass spectrometry data. Glycobiology 2019, 29, 452–460. 10.1093/glycob/cwz023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Daponte V.; Hayes C.; Mariethoz J.; Lisacek F. Dealing with the Ambiguity of Glycan Substructure Search. Molecules 2022, 27, 65. 10.3390/molecules27010065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Perez S.; Tubiana T.; Imberty A.; Baaden M. Three-dimensional representations of complex carbohydrates and polysaccharides–SweetUnityMol: a video game-based computer graphic software. Glycobiology 2015, 25, 483–91. 10.1093/glycob/cwu133. [DOI] [PubMed] [Google Scholar]
  23. Sehnal D.; Svobodova R.; Berka K.; Pravda L.; Midlik A.; Koca J.. Visualization and Analysis of Protein Structures with LiteMol Suite. Structural Bioinformatics; Gáspári Z., Ed.; Methods in Molecular Biology; Springer US: New York, 2020; Vol. 2112; pp 1–13. [DOI] [PubMed] [Google Scholar]
  24. Furevi A.; Ruda A.; Angles d’Ortoli T.; Mobarak H.; Stahle J.; Hamark C.; Fontana C.; Engstrom O.; Apostolica P.; Widmalm G. Complete 1H and 13C NMR chemical shift assignments of mono-to tetrasaccharides as basis for NMR chemical shift predictions of oligo-and polysaccharides using the computer program CASPER. Carbohydr. Res. 2022, 513, 108528. 10.1016/j.carres.2022.108528. [DOI] [PubMed] [Google Scholar]
  25. Perez S.; Bonnardel F.; Lisacek F.; Imberty A.; Ricard Blum S.; Makshakova O. GAG-DB, the New Interface of the Three-Dimensional Landscape of Glycosaminoglycans. Biomolecules 2020, 10, 1660. 10.3390/biom10121660. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kawasaki T.; Nakao H.; Takahashi E.; Tominaga T. GlycoEpitope: the integrated Database of Carbohydrate Antigens and Antibodies. Japan. Trend. Glycosci. and Glycotechnol. 2006, 18, 267–272. 10.4052/tigg.18.267. [DOI] [Google Scholar]
  27. Alocci D.; Ghraichy M.; Barletta E.; Gastaldello A.; Mariethoz J.; Lisacek F. Understanding the glycome: an interactive view of glycosylation from glycocompositions to glycoepitopes. Glycobiology 2018, 28, 349–362. 10.1093/glycob/cwy019. [DOI] [PubMed] [Google Scholar]
  28. Mehta A. Y.; Cummings R. D. GLAD: GLycan Array Dashboard, a visual analytics tool for glycan microarrays. Bioinformatics 2019, 35, 3536–3537. 10.1093/bioinformatics/btz075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Sterner E.; Flanagan N.; Gildersleeve J. C. Perspectives on Anti-Glycan Antibodies Gleaned from Development of a Community Resource Database. ACS Chem. Biol. 2016, 11, 1773–1783. 10.1021/acschembio.6b00244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Imberty A.; Bonnardel F.; Lisacek F. UniLectin, A One-Stop-Shop to Explore and Study Carbohydrate-Binding Proteins. Curr. Protoc. 2021, 1, e305 10.1002/cpz1.305. [DOI] [PubMed] [Google Scholar]
  31. Clerc O.; Deniaud M.; Vallet S. D.; Naba A.; Rivet A.; Perez S.; Thierry-Mieg N.; Ricard-Blum S. MatrixDB: Integration of new data with a focus on glycosaminoglycan interactions. Nucleic Acids Res. 2019, 47, D376–D381. 10.1093/nar/gky1035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Mariethoz J.; Khatib K.; Mannic T.; Alocci D.; Campbell M. P.; Packer N. H.; Mullen E. H.; Lisacek F.. A Practical Guide to Using Glycomics Databases; Springer Japan: Tokyo, 2017; pp 247–260. [Google Scholar]
  33. Masson P.; Hulo C.; De Castro E.; Bitter H.; Gruenbaum L.; Essioux L.; Bougueleret L.; Xenarios I.; Le Mercier P. ViralZone: recent updates to the virus knowledge resource. Nucleic Acids Res. 2012, 41, D579–D583. 10.1093/nar/gks1220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Alocci D.; Mariethoz J.; Gastaldello A.; Gasteiger E.; Karlsson N. G.; Kolarich D.; Packer N. H.; Lisacek F. GlyConnect: Glycoproteomics Goes Visual, Interactive, and Analytical. J. Proteome Res. 2019, 18, 664–677. 10.1021/acs.jproteome.8b00766. [DOI] [PubMed] [Google Scholar]
  35. Watanabe Y.; Aoki-Kinoshita K. F.; Ishihama Y.; Okuda S. GlycoPOST realizes FAIR principles for glycomics mass spectrometry data. Nucleic Acids Res. 2021, 49, D1523–D1528. 10.1093/nar/gkaa1012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Rojas-Macias M. A.; Mariethoz J.; Andersson P.; Jin C.; Venkatakrishnan V.; Aoki N. P.; Shinmachi D.; Ashwood C.; Madunic K.; Zhang T.; et al. Towards a standardized bioinformatics infrastructure for N- and O-glycomics. Nat. Commun. 2019, 10, 1–10. 10.1038/s41467-019-11131-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Kanehisa M.; Sato Y.; Furumichi M.; Morishima K.; Tanabe M. New approach for understanding genome variations in KEGG. Nucleic Acids Res. 2019, 47, D590–D595. 10.1093/nar/gky962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Narimatsu H.; Suzuki Y.; Aoki-Kinoshita K. F.; Fujita N.; Sawaki H.; Shikanai T.; Sato T.; Togayachi A.; Yoko-o T.; Angata K.; Kubota T.; Noro E.. A Practical Guide to Using Glycomics Databases; Springer Japan: Tokyo, 2017; pp 163–175. [Google Scholar]
  39. Zou Z.; Ohta T.; Miura F.; Oki S. ChIP-Atlas 2021 update: a data-mining suite for exploring epigenomic landscapes by fully integrating ChIP-seq, ATAC-seq and Bisulfite-seq data. Nucleic Acids Res. 2022, 50, W175–W182. 10.1093/nar/gkac199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Sud M.; Fahy E.; Cotter D.; Brown A.; Dennis E. A.; Glass C. K.; Merrill A. H. Jr; Murphy R. C.; Raetz C. R.; Russell D. W.; Subramaniam S. LMSD: LIPID MAPS structure database. Nucleic Acids Res. 2007, 35, D527–D532. 10.1093/nar/gkl838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. The UniProt Consortium UniProt: A worldwide hub of protein knowledge. Nucleic Acids Res. 2019, 47, D506–D515. 10.1093/nar/gky1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Karlsson M.; Zhang C.; Mear L.; Zhong W.; Digre A.; Katona B.; Sjostedt E.; Butler L.; Odeberg J.; Dusart P.; et al. A single–cell type transcriptomics map of human tissues. Sci. Adv. 2021, 7, eabh2169 10.1126/sciadv.abh2169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Feng Z.; Westbrook J. D.; Sala R.; Smart O. S.; Bricogne G.; Matsubara M.; Yamada I.; Tsuchiya S.; Aoki-Kinoshita K. F.; Hoch J. C.; Kurisu G.; Velankar S.; Burley S. K.; Young J. Y. Enhanced validation of small-molecule ligands and carbohydrates in the Protein Data Bank. Structure 2021, 29, 393–400. 10.1016/j.str.2021.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Hirabayashi J.; Tateno H.; Shikanai T.; Aoki-Kinoshita K.; Narimatsu H. The lectin frontier database (LfDB), and data generation based on frontal affinity chromatography. Molecules 2015, 20, 951–973. 10.3390/molecules20010951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Hosoda M.; Takahashi Y.; Shiota M.; Shinmachi D.; Inomoto R.; Higashimoto S.; Aoki-Kinoshita K. F. MCAW-DB: A glycan profile database capturing the ambiguity of glycan recognition patterns. Carbohydr. Res. 2018, 464, 44–56. 10.1016/j.carres.2018.05.003. [DOI] [PubMed] [Google Scholar]
  46. McNaught A. D. International Union of Pure and Applied Chemistry and International Union of Biochemistry and Molecular Biology. Joint Commission on Biochemical Nomenclature. Nomenclature of carbohydrates. Carbohydr. Res. 1997, 297, 1–92. 10.1016/S0008-6215(97)83449-0. [DOI] [PubMed] [Google Scholar]
  47. Kaji H.; Shikanai T.; Sasaki-Sawa A.; Wen H.; Fujita M.; Suzuki Y.; Sugahara D.; Sawaki H.; Yamauchi Y.; Shinkawa T.; Taoka M.; Takahashi N.; Isobe T.; Narimatsu H. Large-scale identification of N-glycosylated proteins of mouse tissues and construction of a glycoprotein database, GlycoProtDB. J. Proteome Res. 2012, 11, 4553–66. 10.1021/pr300346c. [DOI] [PubMed] [Google Scholar]
  48. Konishi Y.; Aoki-Kinoshita K. The GlycomeAtlas tool for visualizing and querying glycome data. Bioinformatics 2012, 28, 2849–2850. 10.1093/bioinformatics/bts516. [DOI] [PubMed] [Google Scholar]
  49. Huang Y. F.; Aoki K.; Akase S.; Ishihara M.; Liu Y. S.; Yang G.; Kizuka Y.; Mizumoto S.; Tiemeyer M.; Gao X. D.; Aoki-Kinoshita K. F.; Fujita M. Global mapping of glycosylation pathways in human-derived cells. Developmental Cell 2021, 56, 1195–1209. 10.1016/j.devcel.2021.02.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Tsuchiya S.; Matsubara M.; Aoki-Kinoshita K. F.; Yamada I. SugarDrawer: A Web-Based Database Search Tool with Editing Glycan Structures. Molecules 2021, 26, 7149. 10.3390/molecules26237149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Lisacek F.; Aoki-Kinoshita K. F.; Vora J. K.; Mazumder R.; Tiemeyer M. In Comprehensive Glycoscience, 2nd ed.; Barchi J. J., Ed.; Elsevier: Oxford, 2021; pp 507–521. [Google Scholar]
  52. Wilkinson M. D.; Dumontier M.; Aalbersberg I. J.; Appleton G.; Axton M.; Baak A.; Blomberg N.; Boiten J.-W.; da Silva Santos L. B.; Bourne P. E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. 10.1038/sdata.2016.18. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from JACS Au are provided here courtesy of American Chemical Society

RESOURCES