Abstract
Glycans are ubiquitous in biology, but their complex structure and biosynthesis have challenged research of their wide-ranging roles. Here, the authors comment on current trends on the role of chemical methodologies in the field of glycobiology.
At times in which science has a key role in society, we would do well to remember that interdisciplinarity is a vital ingredient to scientific progress. Science across boundaries has always played a key role in the glycosciences. Chemical methodologies in particular have been indispensable even since the 1920s, when antigenic carbohydrate-protein conjugates were generated using diazonium salts of bacterial glycans by Avery1.
A considerable proportion of all proteins are glycosylated2, and carbohydrates (glycans) decorate the surface of every living cell. The chemical structure of monosaccharides allows glycosidic bonds to vary in their regio- and stereoisomerism. With the added complexity of branching, glycans display a great diversity of shape and thereby function. It is often surprising to those new to the field how much influence glycans can have in biological systems. For instance, many viruses, including HIV and the coronaviruses responsible for ongoing and past outbreaks, “shield” their surface proteins heavily by glycosylation, thereby influencing their vulnerability to the immune system3,4. Therefore, target glycosylation has to be a key consideration in vaccine development. There are countless other examples of the importance of glycans in many of the most pressing topics in biology from cancer to immunology, for which we direct the reader to more comprehensive resources to gain further insight2. Despite their now clear relevance in many fields in biomedicine, glycans have often been overlooked relative to the other major biopolymers, due to technical roadblocks unique to the glycosciences. These are summarised below.
Challenges in glycobiology
Glycans are not directly encoded in the genome; they are secondary gene products, built through the combinatorial interplay of glycosyltransferases (GTs) and glycosidases. The repertoire of these enzymes determines the glycome: the identity of glycan structures, free or as part of glycoconjugates, in a given cell/tissue/organism at a given point in time. Particular predictions about the glycome of a cell can be made by following biosynthetic principles and enzyme specificities. This is especially important in bacteria that have a limited set of carbohydrate-active enzymes which can serve as predictors of glycan structure based on genome mining5. In eukaryotes, biosynthetic considerations are generally complicated by a much larger enzyme repertoire that displays redundancy and functional compensation. The dynamics of processive glycosylation within the secretory pathway are thus poorly understood.
Glycans cannot easily be mutagenised. Biosynthetic processes of glycans are far more dynamic than those of nucleic acids and proteins that follow a largely linear information flow of primary sequence. While the well-established and routine editing methods for the latter biopolymers have been key to the great leaps in our understanding of them, swapping one monosaccharide for another in the biosynthesis of a native glycan is often all but impossible. This makes many of the experimental paradigms common for determining function of protein and nucleic acids infeasible, necessitating alternate approaches to the study of glycans.
Glycan sequencing is a longstanding bottleneck. Mass spectrometry (MS) is the most widely used method, but is complicated by biological (e.g. microheterogeneity), technical (e.g. difficulty in enrichment), and physical (e.g. ionizability) challenges. Other methods such as nuclear magnetic resonance (NMR) spectroscopy are powerful but typically require larger sample amounts. As the glycans produced by a cell can differ greatly based on cell type, environmental cues, and disease states such as cancer2, this difficulty in measuring the state of the glycome is a major hindrance.
The provision of amounts of glycans sufficient for their study is often difficult. While certain glycan structures can be isolated from natural sources in high purity and abundance, this is not the case for most structures of biological importance. The complex, branching structure of many glycans also makes them highly challenging to synthesise, due to the exquisite stereo- and chemoselectivity required, further impeding progress on their investigation (Fig. 1a).
Finally, the identification of glycan-binding partners is challenging. While long polysaccharides can be immobilised for platebased assays, this is not the case for shorter oligosaccharides6. Glycans are thus often refractory to standard immunological techniques such as enzyme-linked immunosorbent assays (ELISAs). This often makes assigning specific biological functions to individual glycan structures in complex biological environments an issue.
Solutions employing chemical biology
Challenges spark creativity. The limited ability of molecular biology methods to tackle the above technical roadblocks has provided an incentive for chemistry to rise to the challenge. Now a key part of the modern glycosciences, chemical glycobiology has become a fertile ground for optimising methods in general; due to the inherent difficulties in the field, methods that work in the glycosciences often prove to be state-of-the-art and transferrable to other fields.
While chemical glycobiology is a tremendously creative area, a small number of methods have particularly pushed the field towards maturity as a discipline within modern biology.
Carbohydrate chemistry
While the synthesis of carbohydrates is a century-old discipline, the past two decades have seen a revolution in the throughput and accessibility of complex oligosaccharides. This development has been fuelled by innovations in automated glycan synthesis using both solid and solution phases7,8, as well as by innovations in chemoenzymatic syntheses9 (Fig. 1b). As a result, many more biologically important glycans are now synthetically accessible. Cutting-edge synthetic methods still work best in a chemistry-focused laboratory, due to the elaborate compound purification and characterisation techniques demanded. The efforts of the synthesis community are directed towards expanding the repertoire of accessible structures as well as improving the economy of chemical transformations. While these tasks are deeply rooted in classical synthetic methodology, automated approaches to scouting reaction conditions10 (Fig. 1c) and continued discovery of new carbohydrate-active enzymes (http://www.cazy.org) are sure to further advance this subfield.
Glycan microarrays and display techniques
A chemistry-based solution to the lack of immobilisability of natural glycans was the development of glycan microarrays. Conjugation to immobilisable linkers or to lipids11,12 has allowed for parallel robotic printing of glycan probe libraries in miniscule amounts (typically femtomoles) on a solid support. These probes are then screened for interaction with potential binding partners similarly to an ELISA (Fig. 1d). While glycan microarrays have existed for decades, they remain state-of-the-art and have become a key resource in understanding glycan recognition systems. The next years will likely see novel arraying methods come to maturity, such as multiplex bead arrays which aim to boost throughput13, cellbased arrays to present glycoconjugates in a near-native environment14, and liquid arrays of densely conjugated, DNA-barcoded virions15. It is imperative that the scientific community recognises glycans as a major determinant of biological function, and that the need to profile glycan binding is matched by the provision of screening facilities.
Metabolic oligosaccharide engineering
In the early 1990s, Reutter observed that certain GTs can accept chemically modified nucleotide-sugars as substrates16. Chemists soon set out to equip these glycans with chemical, editable functional groups that can be traced after incorporation into glycans by the cellular biosynthetic machinery17. A number of these metabolic olisaccharide engineering (MOE) probes have been developed since; many of these display bioorthogonal click handles to facilitate specific tagging. This technology offers an elegant solution for many of the technical roadblocks discussed above: incorporation of MOE probes allows for enrichment of glycans, and their identification by proteomics and imaging18 (Fig. 2a). In order to truly enter modern biology, next-generation MOE techniques must focus on specificity, enabling the development of reporters for subtypes of glycans. One strategy to achieve such specificity is the “bump-and-hole” tactic in which a GT is engineered to accommodate a chemically modified sugar that is ideally not used by wild-type enzymes. This approach yields a bioorthogonal reporter of glycans synthesised within cells by the selected enzyme. We have used this tactic to inform on the cellular substrate specificities of members of the human GalNAc-T GT family19–21 (Fig. 2b). Probes with high potency and specificity for cytoplasmic and nuclear O-GlcNAc glycosylation have also been developed, allowing for studying these glycoconjugates by imaging and MS-proteomics22,23. The next years will see this and similar approaches to render MOE more specific to more reliably inform on the biological implications of glycans.
Analytical chemistry
The glycosciences have seen some of the most elaborate technological advances in MS. Along with technical innovations in all areas of the sequencing workflow from sample preparation and enrichment to data analysis, glycan structure determination has become far more tractable in recent years (Fig. 2c). This is especially apparent in MS-glycoproteomics, where the readout of both glycan and peptide sequences has been achieved through orthogonal fragmentation techniques such as higher-energy collisional dissociation (HCD) and electron transfer dissociation (ETD)24. Glycopeptides typically require enrichment to increase the signal to background ratio, and bioorthogonal chemistry has provided a means to this end18. The use of specialised surfaces for the specific capture of glycans from complex biological mixtures has shown promise to this end24. Distinguishing between isomeric monosaccharides as well as different anomeric linkages remains a particular challenge, especially in glycan structures that cannot be inferred by biosynthetic considerations. While great improvements have been made to liquid chromatography in recent years, additional innovations are required. Emerging approaches to dissect glycan structures in unprecedented detail include ion mobility mass spectrometry25 and laser-induced infrared spectroscopy26 to unravel the molecular identity of each monosaccharide and linkage. The next years will see further implementation of these techniques into routine glycan analysis, allowing the role of glycans to be probed.
Outlook
Progress on chemical methodologies has greatly expanded our toolbox to dissect the wide-ranging roles of glycans. The coming years are sure to bring more innovative chemical biology-based solutions to the challenges posed by studying glycobiology, providing deeper insights and making the field more accessible.
Acknowledgements
This work was supported by a Crick-HEI studentship funded by the Department of Chemistry at Imperial College London and the Francis Crick Institute (to M.I.Z-H.). This work was supported by the Francis Crick Institute which receives its core funding from Cancer Research UK (FC001749), the UK Medical Research Council (FC001749) and the Wellcome Trust (FC001749). We would also like to thank the members of the Chemical Glycobiology Laboratory for proof-reading, and Prof. Ten Feizi (Imperial College London) for very helpful comments.
Footnotes
Author contributions
M. I. Z.-H. and B.S. both contributed to the writing and preparation of this manuscript.
Competing interests
The authors declare no competing interests.
Additional information
Reprints and permission information is available at http://www.nature.com/reprints
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Avery OT, Goebel WF. Chemo-immunological studies on conjugated carbohydrate-proteins. J Exp Med. 1929;50:533–550. doi: 10.1084/jem.50.4.533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Varki A, et al., editors. Essentials of Glycobiology. 3rd. Cold Spring Harbor Laboratory Press; New York: 2017. [Google Scholar]
- 3.Watanabe Y, Allen JD, Wrapp D, McLellan JS, Crispin M. Sitespecific glycan analysis of the SARS-CoV-2 spike. Science. 2020:eabb9983. doi: 10.1126/science.abb9983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Casalino L, et al. Shielding and Beyond: The Roles of Glycans in SARS-CoV-2 Spike Protein. 2020 doi: 10.1101/2020.06.11.146522v1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Bentley SD, et al. Genetic analysis of the capsular biosynthetic locus from all 90 pneumococcal serotypes. PLoS Genet. 2006;2:262–269. doi: 10.1371/journal.pgen.0020031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Feizi T, Fazio F, Chai W, Wong C-H. Carbohydrate microarrays — a new set of technologies at the frontiers of glycomics. Curr Opin Struct Biol. 2003;13:637–645. doi: 10.1016/j.sbi.2003.09.002. [DOI] [PubMed] [Google Scholar]
- 7.Guberman M, Seeberger PH. Automated glycan assembly: a perspective. J Am Chem Soc. 2019;141:5581–5592. doi: 10.1021/jacs.9b00638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wu C-Y, Wong C-H. In: Glycoscience: Biology and Medicine. Endo T, Seeberger PH, Hart GW, Wong C-H, Taniguchi N, editors. Springer; Japan: 2021. pp. 1–7. [Google Scholar]
- 9.Liu L, et al. Streamlining the chemoenzymatic synthesis of complex N-glycans by a stop and go strategy. Nat Chem. 2019;11:161–169. doi: 10.1038/s41557-018-0188-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Chatterjee S, Moon S, Hentschel F, Gilmore K, Seeberger PH. An empirical understanding of the glycosylation reaction. J Am Chem Soc. 2018;140:11942–11953. doi: 10.1021/jacs.8b04525. [DOI] [PubMed] [Google Scholar]
- 11.Li Z, Feizi T. The neoglycolipid (NGL) technology-based microarrays and future prospects. FEBS Lett. 2018;592:3976–3991. doi: 10.1002/1873-3468.13217. [DOI] [PubMed] [Google Scholar]
- 12.Geissner A, Seeberger PH. Glycan arrays: from basic biochemical research to bioanalytical and biomedical applications. Annu Rev Anal Chem. 2016;9:223–247. doi: 10.1146/annurev-anchem-071015-041641. [DOI] [PubMed] [Google Scholar]
- 13.Purohit S, et al. Multiplex glycan bead array for high throughput and high content analyses of glycan binding proteins. Nat Commun. 2018;9 doi: 10.1038/s41467-017-02747-y. 258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Narimatsu Y, et al. An atlas of human glycosylation pathways enables display of the human glycome by gene engineered cells. Mol Cell. 2019;75:e5. doi: 10.1016/j.molcel.2019.05.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Sojitra M, et al. Genetically encoded, multivalent liquid glycan array (LiGA) 2020 doi: 10.1101/2020.03.24.997536v2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kayser H, et al. Biosynthesis of a nonphysiological sialic acid in different rat organs, using N-propanoyl-D-hexosamines as precursors. J Biol Chem. 1992;267:16934–16938. [PubMed] [Google Scholar]
- 17.Mahal LK. Engineering chemical reactivity on cell surfaces through oligosaccharide biosynthesis. Science. 1997;276:1125–1128. doi: 10.1126/science.276.5315.1125. [DOI] [PubMed] [Google Scholar]
- 18.Parker CG, Pratt MR. Click chemistry in proteomic investigations. Cell. 2020;180:605–632. doi: 10.1016/j.cell.2020.01.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Choi J, et al. Engineering orthogonal polypeptide GalNAc-transferase and UDP-sugar pairs. J Am Chem Soc. 2019;141:13442–13453. doi: 10.1021/jacs.9b04695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Schumann B, et al. Bump-and-Hole Engineering Identifies Specific Substrates of Glycosyltransferases in Living Cells. Mol Cell. 2020;78:824–834 e15. doi: 10.1016/j.molcel.2020.03.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Debets MF, et al. Metabolic precision labeling enables selective probing of O-linked N-acetylgalactosamine glycosylation. 2020 doi: 10.1101/2020.04.23.057208v1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Chuh KN, Zaro BW, Piller F, Piller V, Pratt MR. Changes in metabolic chemical reporter structure yield a selective probe of O -GlcNAc modification. J Am Chem Soc. 2014;136:12283–12295. doi: 10.1021/ja504063c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Li J, et al. An OGA-resistant probe allows specific visualization and accurate identification of O -GlcNAc-modified proteins in cells. ACS Chem Biol. 2016;11:3002–3006. doi: 10.1021/acschembio.6b00678. [DOI] [PubMed] [Google Scholar]
- 24.Yu A, et al. Advances in mass spectrometry-based glycoproteomics. Electrophoresis. 2018;39:3104–3122. doi: 10.1002/elps.201800272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Manz C, Pagel K. Glycan analysis by ion mobility-mass spectrometry and gas-phase spectroscopy. Curr Opin Chem Biol. 2018;42:16–24. doi: 10.1016/j.cbpa.2017.10.021. [DOI] [PubMed] [Google Scholar]
- 26.Schindler B, et al. Anomeric memory of the glycosidic bond upon fragmentation and its consequences for carbohydrate sequencing. Nat Commun. 2017;8 doi: 10.1038/s41467-017-01179-y. 973. [DOI] [PMC free article] [PubMed] [Google Scholar]