Abstract
Glycosylation is a ubiquitous post-translational modification read by glycan-binding proteins (GBP) that encodes important functions, but a robust understanding of these interactions and their consequences can be challenging to uncover. Glycan-GBP interactions are transient and weak, making them difficult to capture, and glycosylation is dynamic and heterogenous, necessitating study in native cellular environments to identify endogenous ligands. Proximity labeling, an experimental innovation that labels biomolecules close to a protein of interest, has recently emerged as a powerful strategy to overcome these limitations, allowing interactors to be tagged in cells for subsequent enrichment and identification by mass spectrometry-based proteomics. We will describe this nascent technique and discuss its applications in the last five years with different GBP classes, including Siglecs, galectins, and non-human lectins.
Graphical Abstract

Scheme of general proximity labeling approach for glycan-binding proteins.
A proximity labeling agent (PLA) is targeted to a glycan-binding protein interacting with its endogenous glycoprotein counter-receptor in live cells. Upon activation of the PLA, proteins near the glycan-binding protein are labeled with a tag. This tag is usually biotin, which can be used to enrich labeled proteins for identification via mass-spectrometry.
Introduction
The case for proximity labeling
Cell surfaces are coated by the glycocalyx, the dense collection of proteins and lipids molecules decorated with carbohydrate moieties called glycans. These glycoconjugates can serve as interactors for glycan-binding proteins (GBPs). As glycoconjugates and GBPs are often interchangeably referred to as ligands and receptors, for clarity we will refer to the protein carrier that has been post-translationally modified with glycans as the counter-receptor, and the proteins that bind to the glycan units as GBPs. GBPs serve important biological roles, including immunological regulation, cell adhesion, and differentiation [1, 2]. Thus, a thorough study of the binding events between GBPs and their glycoconjugate counter-receptors can prove useful in understanding a myriad of cellular processes and disease mechanisms.
Despite the ubiquity of glycosylation and its importance, the study of glycan-GBP interactions has been thwarted by the inherent challenges of studying these binding events in cells. Glycosylation is a dynamic post-translational modification that is not directly templated by the genetic code. While putative sites of N-linked glycosylation can be computationally predicted from consensus sequences, glycan occupancy is not guaranteed [3, 4]. Further, glycan modifications are highly heterogeneous. There are multiple carbohydrate monomers that can be linked to each other at many positions with different stereochemistry, and with varying degrees of branching. These variations are critical considerations in glycan-GBP interactions, as GBPs show distinct binding preferences for the identity and linkage of glycans [5]. Furthermore, glycan-GBP binding is often transient, occurring with koff < 0.01 s−1 [6], and they are thermodynamically weaker than protein-protein interactions (KD ~ mM) [7]. As such, multivalency (from either the GBP or glycan), where several binding interactions occur simultaneously, is often required to elicit biologically relevant functions [8]. This accumulation of multiple, non-covalent interactions results in high avidity events that enhance the overall apparent affinity of GBP-glycan interactions [9-11]. Whereas much of glycoscience has been occupied with understanding the glycan structural requirements involved in binding GBPs, there is reason to place the preservation of native multivalency at the forefront of designing binding assays. Standard biochemical approaches, like co-immunoprecipitation from disrupted cell lysates, results in identification of few, if any, GBP interactors [12]. Further, lysate-based methods like immunoprecipitation disrupt the natural organization of biomolecules and can result in the identification of false counter-receptors, as interactions usually inaccessible are artificially generated [13, 14].
The recapitulation of the endogenous, multivalent presentation and diversity of glycans in vitro is immensely challenging, but advances in chemical synthesis and new tools have emerged to systematically study the tolerances of GBPs towards various glycan architectures. For example, well-defined glycan structures can be printed onto arrays [15, 16] or biologically presented in genetically modified bacteriophages and mammalian cells to measure GBP binding [17-20]. The largest limitation of these formats, however, is the inability to study the glycoconjugate itself within a native cellular environment. Extracellular glycan-GBP interactions can elicit responses that vary with the identity of the interactor [21], and the presentation of a glycoconjugate, including its localization on the cell surface and the identity of neighboring proteins, can affect GBP binding by altering the clustering and multivalency of presented ligands [22]. Thus, there is a growing appreciation to study glycan-GBP interactions in endogenous environments to accurately assess binding.
Proximity labeling is a method that tags biomolecules (usually proteins) neighboring a protein of interest to identify putative interactors. This protein of interest is often termed the “bait,” as it delivers a proximity labeling agent (PLA) to its apprehended targets. Developed over a decade ago, and originally intended for the study of protein-protein interactions [23] or mapping the constituents of specific cellular compartments [24, 25], proximity labeling technologies have typically co-opted non-human enzymes to promiscuously biotinylate proteins within a short radius. These biotinylated proteins can then be enriched with streptavidin and identified via mass spectrometry (MS)-based proteomics.
Glycoscientists have recognized that this system is particularly apt as a tool to overcome the inherent challenges of studying glycan-GBP interactions. Proximity labeling can be used to convert the low affinity, transient interactions of glycans and GBPs to covalent biotinylation, effectively taking a snapshot of the dynamic interactions in the cellular milieu. Advantageously, native interactions are preserved with spatiotemporal accuracy. Importantly, proximity labeling can also be performed in live, non-genetically modified cells, where the native glycosylation of proteins and the endogenous presentation of glycoprotein counter-receptors are preserved. Thus, biologically relevant interactors can be probed in a cellular context with preservation of multivalency and cellular architecture. Glycan-GBP focused proximity labeling methods share a similar strategy: allow a GBP bait to bind to its glycoprotein counter-receptors and then employ a proximity labeling agent (PLA) to tag the interactome with a biotin moiety in situ for enrichment, followed by western blotting or mass spectrometry (MS)-based proteomic identification. In general, PLAs biotinylate proteins near a molecular bait, but they can vary in the catalytic moiety, which inherently affects the radius of biotinylation, duration of biotinylation, and the degree to which multivalency is preserved (Fig. 1).
Figure 1: Comparison of proximity labeling agents.
Proximity labeling agents (PLAs) vary in their size and labeling chemistries, inherently affecting the labeling radius, the reaction time for tag addition, and the identity of the tag deposited. These parameters should be carefully considered in proximity labeling experiments to preserve the native binding of glycan-binding proteins and to minimize promiscuity of labeling. (Abbreviations: horseradish peroxidase – HRP, engineered ascorbate peroxidase – APEX2, iron (S)-1-(p-bromoacetamidobenzyl) EDTA – Fe-BABE, adenosine monophosphate – AMP)
Proximity labeling approaches were first applied to illuminate glycan-GBP interactions in 2017, and in this Review, we will discuss such examples over the last five years, emphasizing the applicability of this approach towards various GBPs. We have organized our discussion based on unique classes of GBP, as they are known to engage glycoprotein counter-receptors in different ways, and proximity labeling strategies are usually adapted to minimize perturbations in these binding events. We also highlight the future of this field, as proximity labeling for glycan-GBP interactions is still in its infancy and promises to be a significant tool for discovering more precise glycan-mediated biology.
Sialic acid-binding immunoglobulin-type lectins (Siglecs)
Sialic acid-binding immunoglobulin-type lectins (Siglecs) are immunomodulatory proteins that preferentially bind to sialic acid glycans [26]. Most Siglecs possess an intracellular immunoreceptor tyrosine-based inhibitory motif (ITIM) that recruits phosphatases after ligand engagement to dampen immune responses. These interactions are known to contribute to cancer cell immune evasion and autoimmunity; however, a mechanistic understanding of Siglec-mediated immune suppression is lacking in many contexts, as the counter-receptors are still relatively unknown [26-28].
In 2017, two groups independently used horseradish peroxidase (HRP) as the PLA to investigate the binding of Siglecs in cells. HRP is a metalloenzyme with a heme active site that generates free radicals upon incubation with hydrogen peroxide (H2O2). Upon co-incubation with biotin tyramide and H2O2, HRP catalyzes the formation of biotin tyramide radicals. These are short-lived species that rapidly react with electron-rich residues with a labeling radius of approximately 20 nm. The catalytic efficiency of HRP allows for short labeling times (< 10 min) that are important consideration for achieving temporal resolution. Wu et al. examined the Sialoadhesion (Siglec-1) counter-receptors on human erythrocytes [29] using a tripartite fusion protein composed of Siglec-1, HRP, and a human Fc domain. This construct was pre-complexed with an anti-Fc antibody to induce protein oligomerization, mimicking the clustering of Siglecs to promote cell surface binding. The authors confirmed that a known interactor of Siglec-1, Glycophorin A, was biotinylated by western blotting [30], and they orthogonally confirmed their results with a second proximity labeling method where HRP was introduced as a conjugate to an anti-Fc antibody, directing the PLA to a Siglec-1-Fc construct lacking its own HRP. Biotinylation of Glycophorin A was still observed, suggesting that both the peroxidase chimera and the peroxidase-conjugated antibodies could be used for proximity labeling. This work established the foundation for proximity labeling for GBPs but was not extended to MS-based proteomics. Chang et al. concurrently employed a similar strategy for Siglec-2 interactors in B cells [31]. They used an HRP conjugated anti-FLAG antibody to localize the PLA to a FLAG-tagged Siglec-2-Fc protein. Again, an Fc domain was used to promote the multivalency required for Siglec binding. After proximity labeling, samples were enriched for MS-based proteomics, resulting in the identification of new and previously known interactors. In the same work, they also characterized the Siglec-15 interactome in macrophages via proximity labeling, identifying the first known interactors of Siglec-15, including CD44. Further functional validation was performed with CD44 knockdowns, which demonstrated decreased binding of Siglec-15 and hampered cell fusion, an essential feature of osteoclast differentiation. Importantly, this work demonstrated the potential of proximity labeling integrated with MS-based proteomics to uncover new counter-receptors for GBPs. An important consideration in proximity labeling approaches is the radius of labeling. Using a direct system where the bait GBP and HRP are fused together versus a secondary system where a subsequent HRP incubation step (e.g. using an HRP-conjugated antibody) is performed can yield different labeling radii, and the latter approach can extend the labeling radius to ~200 nm and increase the likelihood of labeling proteins that are not direct interactors or counter-receptors [32].
With these foundational data, other groups shortly adopted proximity labeling for Siglecs studies. HRP-antibody conjugates were used for proximity labeling of Siglec-7 counter-receptors on K562 lymphoblasts to understand natural killer (NK) cell immune responses [33]. As expected for glycan-GBP interactions, co-immunoprecipitations of Siglec-7 and putative interactors by Yoshimura et al. showed inconsistent pull-down of proteins of interest. Thus, proximity labeling was performed to capture these transient interactions, which confirmed that CD43 is a ligand of Siglec-7. Subsequent cellular studies showed that CD43 expressed on K562 cells bound to Siglec-7 on NK cells to attenuate NK cytotoxicity, again emphasizing the power of proximity labeling to identify functional GBP ligands. This finding was independently confirmed the same year through a CRISPR screen conducted by the Bertozzi group, providing confidence that proximity labeling can uncover relevant interactors of GBPs [34].
In the previous examples, exogenous Siglec proteins were added to live cells to identify interactors, but a critical consideration for the Siglec class of proteins is their ability to not only have interactions with external stimuli (trans interactors) but also ligands on their own cell surface (cis interactors). Indeed, previous work has shown distinct binding preferences for cis and trans ligands [14, 35]. To capture this nuance, a cis-interaction proximity labeling platform was generated using HRP conjugated antibodies directed to endogenously expressed Siglec-2 [36]. A 2,6-sialyltransferase knockout was used as a control to delineate glycan-mediated events and those that do not require sialic acid. Interestingly, Western blotting confirmed that CD45 and Siglec-2 itself was biotinylated regardless of sialic acid, a finding supported by previous work [37]. This finding emphasizes that proximity labeling tags neighboring proteins of bait GBPs, whether engaged in a glycan-mediated interaction or protein-protein interaction, necessitating the use of appropriate controls for discrimination.
Galectins
Our group applied proximity labeling to another class of GBPs called Galectins. Galectins are a class of soluble lectins that preferentially bind to β-galactoside motifs [38]. These GBPs are known regulators of immune responses [39], liver fibrosis [40], and myogenesis [41]; however, the molecular mechanisms underlying Galectin-mediated biology remain poorly understood. Unlike monomeric Siglecs with anchoring transmembrane domains, Galectins can oligomerize and can be secreted extracellularly [42]. Thus, some understanding of the effects of the PLA tag towards oligomerization and secretion is important for this class of GBPs. We generated fusion proteins of a peroxidase and Galectins of interest and showed that normal oligomerization was not perturbed by the addition of the PLA. Instead of HRP, we used another heme-containing metalloenzyme, the engineered ascorbate peroxidase APEX2, as it is uniquely tolerant to both the extracellular oxidative and the reducing intracellular environments [43], a feature that is important for studying Galectins which can exhibit both types of interactions [44]. Like HRP, APEX2 affords short labeling times (~ 1 min) for high temporal resolution, which we envisioned could be useful for examining how Galectin interactomes change as they are endocytosed. APEX2 chimeras can be overexpressed by transfection, allowing for a comparison of interactomes based on subcellular localization, simply by varying between exogenous incubation and transfection. This new method was first optimized with APEX2-fused Galectin-3 in hepatic stellate cells (Fig. 2A), [45] followed by APEX2-fused Galectin-1 in myoblasts [46]. In both instances, we used quantitative MS-based proteomics to confirm known interactors and discover new ones (Fig. 2B). During this work, we have found that it is applicable to many cell types. Importantly, however, we have found that the optimization of protein expression, protein concentrations, and competition during labeling are key towards achieving success, which can be optimized via microscopy (Fig. 2C).
Figure 2: Proximity labeling with APEX2 to capture the Galectin-3 interactome.
(A) A scheme of galectin-based proximity labeling strategy shows the incubation of the Galectin-3 APEX2 fusion protein on the cell surface of live cells, followed by incubation with biotin phenol and hydrogen peroxide. This method results in the biotinylation of cell surface proteins, which can be visualized with immunofluorescence microscopy or enriched for mass spectrometry-based proteomics. (B) The interactome of galectin-3 identified by mass spectrometry is represented as a plot of the proteins enriched by proximity labeling and those competed by addition of lactose. Putative galectin-3 interactors of interest are highlighted in orange. (C) Proximity labeling was performed and biotinylation was observed by microscopy using streptavidin-Cy5. Dose-dependent labeling is observed, as well as competition with lactose to show glycan-mediated interactions. (B) and (C) reprinted with permission from E. Joeh, T. O’Leary, W. Li, R. Hawkins, J.R. Hung, C.G. Parker, and M.L. Huang, Mapping glycan-mediated galectin-3 interactions by live cell proximity labeling, PNAS. 117 (2020) 44, 27329-27338. Copyright 2020 National Academy of Sciences.
Non-human lectins
Proximity labeling has recently been performed with non-human lectins that have been repurposed as scouting proteins. Unlike studies with Siglecs and galectins in which the primary goal is to elucidate unknown counter-receptors, proximity labeling with non-human lectins is being used to analyze native glycosylation patterns on glycoproteins. Herculean efforts have been made to thoroughly characterize the glycan binding preferences of lectins to glycans, often with computational studies, glycan arrays, or a combination of both [47-49]. Traditionally, these lectins are used to identify global changes in glycosylation, but their use in proximity labeling has proven useful to identify changes in glycosylation of individual proteins.
The first use of a non-human lectin for proximity labeling was performed with Maackia amurensis lectin (MAL II), a plant lectin known to bind α-2,3 sialic acids [31]. MAL II was biotinylated and incubated with erythrocytes followed by the addition of streptavidin-HRP to label nearby proteins with biotin upon introduction of H2O2 and biotin tyramide. This work laid the foundation for the application of non-human lectins to study other glycosylation motifs, including the plant lectin PHA, which recognizes the terminal galactose residues of complex N-glycans [50]. Biotinylated PHA was used in the SUGAR-seq method to elucidate changes in glycosylation of individual cell surface proteins [51]. Using streptavidin-HRP to perform proximity labeling with biotinylated PHA, Kearney et al. showed that T cells derived from healthy and tumor positive mice had distinct glycosylation profiles. Interestingly, cells could be sorted into groups of high and low PHA binding, which correlated with the degree of chromatin accessibility and differentiation state, underlining the importance of glycosylation in cell function and state. Further, glycoproteins implicated in adhesion, migration, and activation showed changes in their enrichment in high and low glycosylation groups, suggesting changes in glycosylation which could affect their ability to serve as receptors for GBPs.
Another method called GlycoID uses proximity labeling to detect O-GlcNAc modifications [52]. Unlike the peroxidase-based PLAs, an engineered biotin ligase called TurboID was used [53, 54]. This enzyme uses ATP to convert biotin into AMP-biotin, which is then appended to nearby, accessible lysine residues, covalently tagging proteins without H2O2. This facet renders TurboID-based methods less toxic than HRP- and APEX2-mediated approaches, with longer labeling times as a trade-off. TurboID requires minute to hours for labeling, as it does not use radical propagation like the peroxidase-based PLAs, but it is tolerant of multiple cellular compartments like APEX2. To detect O-GlcNAc sugars, the authors generated a fusion construct of GafD (a bacterial lectin that recognizes O-GlcNAc [55]) and TurboID, which was localized to the nucleus and cytosol to reveal different hubs of O-GlcNAc-ylation. Advantageously, this platform was used to assess physiological changes over time. For example, they subjected their cells to starvation or insulin stimulation and observed distinct changes associated with RNA splicing, metabolism, and signaling. The temporal and spatial resolution of this technique underscores the utility of proximity labeling to measure the dynamics of glycosylation.
Not all proximity labeling approaches use enzyme-based PLAs. Instead, the Lebrilla group developed Lectin PROXL, which uses click chemistry to functionalize lectins with a small molecule-based PLA. Lysine residues on lectins are functionalized with NHS-PEG-azide followed by conjugation to the PLA, a dibenzocyclooctyne (DBCO)-modified iron (S)-1-(p-bromoacetamidobenzyl) EDTA (Fe-BABE) moiety [56]. This motif mimics the heme active site of the metalloproteins HRP and APEX2 to oxide proteins in the presence of H2O2. The oxidation of proteins can then be detected by mass spectrometry, effectively labeling interactors for identification without depositing an enrichment tag like biotin. Because Fe-BABE is approximately 2 kilodaltons in size and catalyzes radical formation, the labeling time and radius are smaller than GlycoID. Lectin PROXL was used to probe the specificity of non-human lectins, but theoretically Lectin PROXL could be used to identify Galectin and Siglec interactors, as the lectin functionalization step only requires a free lysine residue. They found high correlation between oxidation site and putative glycosylation sites, bringing confidence to their identified targets and confirming that the smaller radius of Fe-BABE can yield higher resolution to functional hits. In general, they identified that sialic acid- and fucose-binding lectins had greater specificity and selectivity than other lectins, an important consideration for glycobiologists that commonly use these probes to analyze glycosylation patterns. One limitation of this method is that biotin is not used for enrichment, so cell membrane proteins are isolated by centrifugation to detection these relatively low abundance proteins compared to the whole proteome. It is possible that this separation introduces contaminating intracellular proteins and incomplete capture of the cell membrane, leading to less coverage than biotin-based enrichment strategies.
The future of glycan directed proximity labeling and considerations
In the past five years, the utility of proximity labeling has begun to be appreciated by the glycobiology community. GBP ligands once undetectable by traditional biochemical techniques due to weak, transient interactions unamenable to traditional biochemical approaches are now identifiable in their native cellular context with an array of proximity labeling agents (Figure 3). We can also use proximity labeling to characterize the heterogeneous glycocalyx, which could emerge as a powerful tool to map changes in disease contexts. For example, cancer cells are hypersialylated to evade immunological surveillance, but the proteins that gain this glycan modification are relatively unknown. A comparison of normal and cancerous cells using proximity labeling with sialic acid-binding non-human lectins could reveal what proteins are excessively sialylated, allowing for further study of their role in immune evasion. While the current strategies have been directed towards GBPs, there is also the potential to expand proximity labeling to glycoconjugates, like proteoglycans [57].
Figure 3: Representation of current proximity labeling approaches applied to live cells.
Glycan-binding proteins (GBP) of interest are allowed to bind to their glycoconjugate counterreceptors in live cells on the cell surface or intracellularly. A proximity labeling agent (PLA) is directed to this GBP through various strategies, including streptavidin-horseradish peroxidase (HRP), HRP-conjugated antibodies, and direct tethering of chimeric constructs with HRP, engineered ascorbate peroxidase (APEX2), TurboID, or iron (S)-1-(p-bromoacetamidobenzyl) EDTA (Fe-BABE). Activation of the PLA results in the tagging of proteins near the GBP, which can be identified by mass spectrometry-based proteomics.
We also note that, while not within the scope of this review, there is also a recent effort to capture GBP-glycoprotein interactions by derivatizing cell surface glycans. Such incorporation of derivatized sugars, called metabolic oligosaccharide engineering, is another method for identifying GBP interactions for which we will provide key examples but will not cover exhaustively, as there are other reviews focused on this topic [58, 59]. For example, the Paulson group has used azide-modified sialic acids to detect Siglec ligands by MS-based proteomics [14, 35]. Additionally, the Kohler group has used diazirine-modified N-acetylglucosamine (GlcNAc) to crosslink GBPs and their ligands for MS-based proteomic identification of galectin-1 ligands [13]. As a final example, the Lebrilla group used azide-modified monosaccharides to click on the Fe-BABE moiety to glycoprotein ligands for subsequent oxidation of neighboring putative GBP interactors [60]. As an extension of this work, GBPs were modified with cyclooctyne to crosslink GBPs directly to the azide-modified sugars on glycoproteins to identify counter-receptors [61]. These approaches depend on the successful incorporation of derivatized monosaccharides by the endogenous glycosylation machinery of cells, which varies by cell type. To overcome this limitation, the Kohler and Capicciotti groups concurrently published an exo-enzymatic strategy to introduce diazirine-modified sialic acids to the cell surface [62, 63]. Here, a neuraminidase cleaves sialic acids off the cell surface and diazirine-modified sugars are subsequently added with a sialyltransferase to photocrosslink GBPs and their counter-receptors.
There are important caveats to consider when performing proximity labeling experiments. First, iron and peroxidase-based labeling (while rapid and efficient) is toxic in vivo, restricting these approaches to cultured cells. The rise of enzyme evolution and engineering has yielded efficient biotin ligases (like TurboID used for GlycoID), but the true scope of glycan proximity labeling in live animals has yet to be explored. Second, a growing concern in the field of proximity labeling is the delineation between an interactor and a functional counter-receptor. Due to the labeling radius of PLAs, there is a significant probability that tagged proteins are not relevant binding partners. Shortening incubation time or changing the PLA can mitigate off-target labeling. The use of soluble glycan competitors, inhibitors of the glycosylation machinery, and lectins with active site mutations are standard controls for proximity labeling and are essential for parsing out non-glycan mediated interactors, but better controls (e.g. anti-epitope antibodies) can increase confidence that biotinylated proteins are indeed functional interactors. To further filter hits, we envision the coupling of glycan-GBP proximity labeling with peroxidase-based cell surface enrichment techniques to determine the specificity of a GBP interaction relative to the abundance of the protein on the cell surface [64]. Third, the method of proximity labeling can also affect the amenability of tagged proteins to various proteomics workflows. During peroxidase- and TurboID-based methods, biotin is covalently linked to proteins that are enriched via streptavidin beads. Elution of these proteins is usually performed by protease digestion, resulting in the release of peptides for MS identification. In this case, biotinylated portions of the protein still remain bound to the bead. Therefore, proteomics databases can be searched without needing to add the mass of the biotin tag to the expected masses of peptides. However, biotinylated portions of the protein are trapped to the beads, which theoretically includes the region of the protein closest to the PLA, likely including the glycosylation site. Thus, standard biotin tyramide is optimal for proteomics experiments, rather than glycoproteomics. For glycoproteomic experiments, cleavable biotin linkers (usually acid- or photo-labile) can be used to release whole proteins from the streptavidin beads before protease digestion, preserving glycan information. Database searches will have to use search parameters that include the mass of the tag remaining after cleavage. Alternatively, the Lectin PROXL method is amenable to proteomics and glycoproteomics because biotin is not used for subsequent bead-based enrichment. Here, the search parameters for raw MS interpretation are paramount to distinguish oxidized and non-oxidized proteins, as GBP interactors are identified by this oxidation. Finally, after producing a list of putative interactors via proximity labeling, a significant amount of biochemical leg work must be performed to validate this binding event and characterize its biological relevance. We posit that collaborations between glycobiologists and bioinformaticians can improve the identification of functionally relevant targets for validation studies.
Acknowledgments
A.E.R. is supported by funding from the National Science Foundation Graduate Research Fellowship Program (NSF GRFP) and the Skaggs Graduate Fellowship enabled by the Schimmel Family Foundation. M.L.H is supported by funding from the National Institute of General Medical Sciences (NIGMS R35GM142462) and the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK R56DK126895).
Footnotes
Declaration of interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Citations
- [1].Schnaar RL: Glycans and glycan-binding proteins in immune regulation: A concise introduction to glycobiology for the allergist. J Allergy Clin Immunol 2015, 135:609–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Taylor ME and Drickamer K: Mammalian sugar-binding receptors: known functions and unexplored roles. FEBS J 2019, 286:1800–814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [3].Petrescu AJ, Milac AL, Petrescu SM, Dwek RA and Wormald MR: Statistical analysis of the protein environment of N-glycosylation sites: implications for occupancy, structure, and folding. Glycobiology 2004, 14:103–14. [DOI] [PubMed] [Google Scholar]
- [4].Ben-Dor S, Esterman N, Rubin E and Sharon N: Biases and complex patterns in the residues flanking protein N-glycosylation sites. Glycobiology 2004, 14:95–101. [DOI] [PubMed] [Google Scholar]
- [5].Bojar D, Meche L, Meng G, Eng W, Smith DF, Cummings RD and Mahal LK: A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities. ACS Chemical Biology 2022, [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Reuel NF, Ahn JH, Kim JH, Zhang J, Boghossian AA, Mahal LK and Strano MS: Transduction of glycan-lectin binding using near-infrared fluorescent single-walled carbon nanotubes for glycan profiling. J Am Chem Soc 2011, 133:17923–33. [DOI] [PubMed] [Google Scholar]
- [7].Collins BE and Paulson JC: Cell surface biology mediated by low affinity multivalent protein-glycan interactions. Curr Opin Chem Biol 2004, 8:617–25. [DOI] [PubMed] [Google Scholar]
- [8].Mammen M, Choi S-K and Whitesides GM: Polyvalent Interactions in Biological Systems: Implications for Design and Use of Multivalent Ligands and Inhibitors. Angewandte Chemie International Edition 1998, 37:2754–2794. [DOI] [PubMed] [Google Scholar]
- [9].Lee RT and Lee YC: Affinity enhancement by multivalent lectin-carbohydrate interaction. Glycoconjugate Journal 2000, 17:543–551. [DOI] [PubMed] [Google Scholar]
- [10].Kiessling LL, Gestwicki JE and Strong LE: Synthetic multivalent ligands as probes of signal transduction. Angew Chem Int Ed Engl 2006, 45:2348–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].O'Reilly MK and Paulson JC, Multivalent Ligands for Siglecs, in: (Eds.), Methods in Enzymology, 2010, pp. 343–363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [12].Obermann J, Priglinger CS, Merl-Pham J, Geerlof A, Priglinger S, Gotz M and Hauck SM: Proteome-wide Identification of Glycosylation-dependent Interactors of Galectin-1 and Galectin-3 on Mesenchymal Retinal Pigment Epithelial (RPE) Cells. Mol Cell Proteomics 2017, 16:1528–1546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Wu H, Shajahan A, Yang J-Y, Capota E, Wands AM, Arthur CM, Stowell SR, Moremen KW, Azadi P and Kohler JJ: A photo-cross-linking GlcNAc analog enables covalent capture of N-linked glycoprotein-binding partners on the cell surface. Cell Chemical Biology 2022, 29:84–97.e8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Han S, Collins BE, Bengtson P and Paulson JC: Homomultimeric complexes of CD22 in B cells revealed by protein-glycan cross-linking. Nat Chem Biol 2005, 1:93–7. [DOI] [PubMed] [Google Scholar]
- [15].Blixt O, Head S, Mondala T, Scanlan C, Huflejt ME, Alvarez R, Bryan MC, Fazio F, Calarese D, Stevens J, Razi N, Stevens DJ, Skehel JJ, van Die I, Burton DR, Wilson IA, Cummings R, Bovin N, Wong CH and Paulson JC: Printed covalent glycan array for ligand profiling of diverse glycan binding proteins. Proc Natl Acad Sci U S A 2004, 101:17033–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Wang L, Cummings RD, Smith DF, Huflejt M, Campbell CT, Gildersleeve JC, Gerlach JQ, Kilcoyne M, Joshi L, Serna S, Reichardt NC, Parera Pera N, Pieters RJ, Eng W and Mahal LK: Cross-platform comparison of glycan microarray formats. Glycobiology 2014, 24:507–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].Nielsen MI, Stegmayr J, Grant OC, Yang Z, Nilsson UJ, Boos I, Carlsson MC, Woods RJ, Unverzagt C, Leffler H and Wandall HH: Galectin binding to cells and glycoproteins with genetically modified glycosylation reveals galectin-glycan specificities in a natural context. J Biol Chem 2018, 293:20249–20262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Sojitra M, Sarkar S, Maghera J, Rodrigues E, Carpenter EJ, Seth S, Ferrer Vinals D, Bennett NJ, Reddy R, Khalil A, Xue X, Bell MR, Zheng RB, Zhang P, Nycholat C, Bailey JJ, Ling CC, Lowary TL, Paulson JC, Macauley MS and Derda R: Genetically encoded multivalent liquid glycan array displayed on M13 bacteriophage. Nat Chem Biol 2021, 17:806–816. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Bull C, Nason R, Sun L, Van Coillie J, Madriz Sorensen D, Moons SJ, Yang Z, Arbitman S, Fernandes SM, Furukawa S, McBride R, Nycholat CM, Adema GJ, Paulson JC, Schnaar RL, Boltje TJ, Clausen H and Narimatsu Y: Probing the binding specificities of human Siglecs by cell-based glycan arrays. Proc Natl Acad Sci U S A 2021, 118: [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Briard JG, Jiang H, Moremen KW, Macauley MS and Wu P: Cell-based glycan arrays for probing glycan-glycan binding protein interactions. Nat Commun 2018, 9:880. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Nakahara S and Raz A, On the Role of Galectins in Signal Transduction, in: (Eds.), Methods in Enzymology, 2006, pp. 273–289. [DOI] [PubMed] [Google Scholar]
- [22].Cohen M and Varki A, Modulation of glycan recognition by clustered saccharide patches, in: (Eds.), Int Rev Cell Mol Biol, 2014, pp. 75–125. [DOI] [PubMed] [Google Scholar]
- [23].Roux KJ, Kim DI, Raida M and Burke B: A promiscuous biotin ligase fusion protein identifies proximal and interacting proteins in mammalian cells. J Cell Biol 2012, 196:801–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24].Hung V, Zou P, Rhee HW, Udeshi ND, Cracan V, Svinkina T, Carr SA, Mootha VK and Ting AY: Proteomic mapping of the human mitochondrial intermembrane space in live cells via ratiometric APEX tagging. Mol Cell 2014, 55:332–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Rhee H-W, Zou P, Udeshi ND, Martell JD, Mootha VK, Carr SA and Ting AY: Proteomic Mapping of Mitochondria in Living Cells via Spatially Restricted Enzymatic Tagging. Science 2013, 339:1328–1331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Duan S and Paulson JC: Siglecs as Immune Cell Checkpoints in Disease. Annu Rev Immunol 2020, 38:365–395. [DOI] [PubMed] [Google Scholar]
- [27].Crocker PR, Paulson JC and Varki A: Siglecs and their roles in the immune system. Nature Reviews Immunology 2007, 7:255–266. [DOI] [PubMed] [Google Scholar]
- [28].Macauley MS, Crocker PR and Paulson JC: Siglec-mediated regulation of immune cell function in disease. Nat Rev Immunol 2014, 14:653–66. [DOI] [PMC free article] [PubMed] [Google Scholar]
- ** [29]. Wu G, Nagala M and Crocker PR: Identification of lectin counter-receptors on cell membranes by proximity labeling. Glycobiology 2017, 27:800–805. This is a pioneering example of the development of proximity labeling for GBPs. Method optimization was performed with Siglec-1, and its interaction with Glycophorin A is probed. This was also the first example of using a non-human lectin for proximity labeling which was used as a proof of concept for the labeling of glycoconjuagtes with specific glycan motifs.
- [30].Crocker PR, Kelm S, Dubois C, Martin B, S.McWilliam A, Shotton DM, Paulson JC and Gordon S: Purification and properties of sialoadhesin, a sialic acid-binding receptor of murine tissue macrophages. The EMBO Journal 1991, 10:1661–1669. [DOI] [PMC free article] [PubMed] [Google Scholar]
- ** [31]. Chang L, Chen YJ, Fan CY, Tang CJ, Chen YH, Low PY, Ventura A, Lin CC, Chen YJ and Angata T: Identification of Siglec Ligands Using a Proximity Labeling Method. J Proteome Res 2017, 16:3929–3941. This is the first example of proximity labeling used with a GBP to identify novel interactors by MS-based proteomics. Method optimization was performed with Siglec-2 to test if known interactors were biotinylated, while novel interactors were identified for Siglec-15. One interactor, CD44, was shown to affect osteoclast fusion through knockout, hinting that binding of Siglec-15 to CD44 could affect differentiation
- * [32]. Oakley JV, Buksh BF, Fernandez DF, Oblinsky DG, Seath CP, Geri JB, Scholes GD and MacMillan DWC: Radius measurement via super-resolution microscopy enables the development of a variable radii proximity labeling platform. Proc Natl Acad Sci U S A 2022, 119:e2203027119. This work compares proximity labeling platforms with an emphasis on the radius of labeling to assess the promiscuity of these methods. They present a new aryl-azide-based μMap platform, which they argue advantageoulsy has an intermediate labeling distance not previously accessible.
- ** [33]. Yoshimura A, Asahina Y, Chang LY, Angata T, Tanaka H, Kitajima K and Sato C: Identification and functional characterization of a Siglec-7 counter-receptor on K562 cells. J Biol Chem 2021, 296:100477. Here CD43 is identified as a ligand of Siglec-7 via proximity labeling coupled with MS-based proteomics. This interaction was shown to mitigate cytotoxicity of natural killer cells toward K652 cells. This is an excellent example of how proximity labeling and mass-spectrometry can be coupled to find specific counter-receptors of GBPs to uncover unknown biology.
- [34].Wisnovsky S, Mockl L, Malaker SA, Pedram K, Hess GT, Riley NM, Gray MA, Smith BAH, Bassik MC, Moerner WE and Bertozzi CR: Genome-wide CRISPR screens reveal a specific ligand for the glycan-binding immune checkpoint receptor Siglec-7. Proc Natl Acad Sci U S A 2021, 118: [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Ramya TN, Weerapana E, Liao L, Zeng Y, Tateno H, Liao L, Yates JR 3rd, Cravatt BF and Paulson JC: In situ trans ligands of CD22 identified by glycan-protein photocross-linking-enabled proteomics. Mol Cell Proteomics 2010, 9:1339–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- * [36]. Alborzian Deh Sheikh A, Akatsu C, Imamura A, Abdu-Allah HHM, Takematsu H, Ando H, Ishida H and Tsubata T: Proximity labeling of cis-ligands of CD22/Siglec-2 reveals stepwise alpha2,6 sialic acid-dependent and -independent interactions. Biochem Biophys Res Commun 2018, 495:854–859. Here HRP-conjugated antibodies directed towards endogenously expressed Siglec-2 were used for proximity labeling of cis interactors of this GBP. This is a new extension of previous Siglec-focused proximity labeling methods that use recombinant Siglecs incubated on cells to identify trans interactors.
- [37].Zhang M and Varki A: Cell surface sialic acids do not affect primary CD22 interactions with CD45 and surface IgM nor the rate of constitutive CD22 endocytosis. Glycobiology 2004, 14:939–49. [DOI] [PubMed] [Google Scholar]
- [38].Johannes L, Jacob R and Leffler H: Galectins at a glance. J Cell Sci 2018, 131: [DOI] [PubMed] [Google Scholar]
- [39].Rabinovich GA and Toscano MA: Turning 'sweet' on immunity: galectin-glycan interactions in immune tolerance and inflammation. Nat Rev Immunol 2009, 9:338–52. [DOI] [PubMed] [Google Scholar]
- [40].Henderson NC, Mackinnon AC, Farnworth SL, Kipari T, Haslett C, Iredale JP, Liu FT, Hughes J and Sethi T: Galectin-3 expression and secretion links macrophages to the promotion of renal fibrosis. Am J Pathol 2008, 172:288–98. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [41].Blazev R, Ashwood C, Abrahams JL, Chung LH, Francis D, Yang P, Watt KI, Qian H, Quaife-Ryan GA, Hudson JE, Gregorevic P, Thaysen-Andersen M and Parker BL: Integrated Glycoproteomics Identifies a Role of N-Glycosylation and Galectin-1 on Myogenesis and Muscle Development. Molecular & Cellular Proteomics 2021, 20: [DOI] [PMC free article] [PubMed] [Google Scholar]
- [42].Rabinovich GA, Toscano MA, Jackson SS and Vasta GR: Functions of cell surface galectin-glycoprotein lattices. Curr Opin Struct Biol 2007, 17:513–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [43].Lam SS, Martell JD, Kamer KJ, Deerinck TJ, Ellisman MH, Mootha VK and Ting AY: Directed evolution of APEX2 for electron microscopy and proximity labeling. Nat. Methods 2015, 12:51–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [44].Liu F-T, Patterson RJ and Wang JL: Intracellular functions of galectins. Biochimica et Biophysica Acta 2002, 1572:263–273. [DOI] [PubMed] [Google Scholar]
- ** [45]. Joeh E, O'Leary T, Li W, Hawkins R, Hung JR, Parker CG and Huang ML: Mapping glycan-mediated galectin-3 interactions by live cell proximity labeling. Proc Natl Acad Sci U S A 2020, 117:27329–27338. This is the first example of proximity labeling used to identify galectin interactors. Here, a fusion protein of Galectin-3 and APEX2 was used to identify GBP counter-receptors when exogenously incubated and transiently overexpressed in hepatic stellate cells. A list of putative interactors, including undiscovered and known interactors, was generated by MS-based proteomics; however, functional validation was not performed.
- * [46]. Vilen Z, Joeh E, Critcher M, Parker CG and Huang ML: Proximity Tagging Identifies the Glycan-Mediated Glycoprotein Interactors of Galectin-1 in Muscle Stem Cells. ACS Chem Biol 2021, 16:1994–2003. In this work, a fusion protein of Galectin-1 and APEX2 was used to identify GBP counter receptors in myoblasts via MS-based proteomics. New and known interactors were identified and validated by ELISA and on-blot labeling, but biological function was not explored.
- [47].Klamer Z and Haab B: Combined Analysis of Multiple Glycan-Array Datasets: New Explorations of Protein-Glycan Interactions. Anal Chem 2021, 93:10925–10933. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [48].Haab BB and Klamer Z: Advances in Tools to Determine the Glycan-Binding Specificities of Lectins and Antibodies. Mol Cell Proteomics 2020, 19:224–232. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [49].Grant OC, Tessier MB, Meche L, Mahal LK, Foley BL and Woods RJ: Combining 3D structure with glycan array data provides insight into the origin of glycan specificity. Glycobiology 2016, 26:772–783. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [50].Hammarstrom S, Hammarstrom M-L, Sundblad G, Arnarp J and Lönngren J: Mitogenic leukoagglutinin from Phaseolus vulgaris binds to a pentasaccharide unit in N-acetyllactosamine-type glycoprotein glycans. Proc. Natl. Acad. Sci. U S A 1982, 79:1611–1615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- ** [51]. Kearney CJ, Vervoort SJ, Ramsbottom KM, Todorovski I, Lelliott EJ, Zethoven M, Pijpers L, Martin BP, Semple T, Martelotto L, Trapani JA, Parish IA, Scott NE, Oliaro J and Johnstone RW: SUGAR-seq enables simultaneous detection of glycans, epitopes, and the transcriptome in single cells. Sci. Adv 2021, 7: Biotinylated PHA was used as a bait for proximity labeling to sort populations of T cells into high and low glycosylation groups. Protien expression and transcriptome data were also collected to compare these cell populations, which were shown to have distinct activity profiles. This paper brings proximity labeling for glycans into a new age with the ability to overlay techniques for multi-omics study.
- ** [52]. Liu Y, Nelson ZM, Reda A and Fehl C: Spatiotemporal Proximity Labeling Tools to Track GlcNAc Sugar-Modified Functional Protein Hubs during Cellular Signaling. ACS Chem Biol 2022, 17:2153–2164. In this work, GlcNAcylation was explored in the nucleus and cytosol using a fusion protein of the GlcNAc-binding lectin GafD and the biotin ligase TurboID. This approach was used to study changes in GlcNAc-ylation as a result of insulin stimulation and starvation. This was the first use of TurboID to probe GBP interactions, which advantageously could be performed in vivo compared to peroxidase-based labeling.
- [53].Branon TC, Bosch JA, Sanchez AD, Udeshi ND, Svinkina T, Carr SA, Feldman JL, Perrimon N and Ting AY: Efficient proximity labeling in living cells and organisms with TurboID. Nat Biotechnol 2018, 36:880–887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [54].Cho KF, Branon TC, Udeshi ND, Myers SA, Carr SA and Ting AY: Proximity labeling in mammalian cells with TurboID and split-TurboID. Nat Protoc 2020, 15:3971–3999. [DOI] [PubMed] [Google Scholar]
- [55].Saarela S, Taira S, Nurmiaho-Lassila E-L, Makkonen A and Rhen M: The Escherichia coli G-Fimbrial Lectin Protein Participates Both in Fimbrial Biogenesis and in Recognition of the Receptor N-Acetyl-D-Glucosamine. Journal of Bacteriology 1995, 177:1477–1484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [56].Xie Y, Sheng Y, Li Q, Ju S, Reyes J and Lebrilla CB: Determination of the glycoprotein specificity of lectins on cell membranes through oxidative proteomics. Chem Sci 2020, 11:9501–9512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [57].O'Leary TR, Critcher M, Stephenson TN, Yang X, Hassan AA, Bartfield NM, Hawkins R and Huang ML: Chemical editing of proteoglycan architecture. Nat Chem Biol 2022, 18:634–642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [58].Gilormini PA, Batt AR, Pratt MR and Biot C: Asking more from metabolic oligosaccharide engineering. Chem Sci 2018, 9:7585–7595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [59].Wu H and Kohler J: Photocrosslinking probes for capture of carbohydrate interactions. Curr Opin Chem Biol 2019, 53:173–182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [60].Li Q, Xie Y, Xu G and Lebrilla CB: Identification of potential sialic acid binding proteins on cell membranes by proximity chemical labeling. Chem Sci 2019, 10:6199–6209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- ** [61]. Xie Y, Chen S, Li Q, Sheng Y, Alvarez MR, Reyes J, Xu G, Solakyildirim K and Lebrilla CB: Glycan-protein cross-linking mass spectrometry reveals sialic acid-mediated protein networks on cell surfaces. Chem Sci 2021, 12:8767–8777. A panel of lectins was functionalized with iron to oxidize proximal proteins upon incubation with hydrogen peroxide. This oxidation can be detected by MS to identify proteins with the glycan motifs preferentially bound by each lectin. They found that sialic acid-binding lectins and the fucose-binding lectins had higher specificity and sensitivity compared to other lectins and could map site-specific binding due to the short radius of this labeling technique.
- [62].Babulic JL and Capicciotti CJ: Exo-Enzymatic Cell-Surface Glycan Labeling for Capturing Glycan–Protein Interactions through Photo-Cross-Linking. Bioconjugate Chemistry 2022, 33:773–780. [DOI] [PubMed] [Google Scholar]
- [63].Yarravarapu N, Konada RSR, Darabedian N, Pedowitz NJ, Krishnamurthy SN, Pratt MR and Kohler JJ: Exo-Enzymatic Addition of Diazirine-Modified Sialic Acid to Cell Surfaces Enables Photocrosslinking of Glycoproteins. Bioconjugate Chemistry 2022, 33:781–787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [64].Vilen Z, Reeves AE, O'Leary TR, Joeh E, Kamasawa N and Huang ML: Cell Surface Engineering Enables Surfaceome Profiling. ACS Chem Biol 2022, [DOI] [PMC free article] [PubMed] [Google Scholar]



