Skip to main content
Molecular & Cellular Proteomics : MCP logoLink to Molecular & Cellular Proteomics : MCP
. 2025 Aug 28;24(10):101064. doi: 10.1016/j.mcpro.2025.101064

Comprehensive Evaluation of Cleavable Bioorthogonal Probes for Site-Specific O-GlcNAc Proteomics

Chunyan Hou 1, Hemeng Zhang 1, Jingtao Deng 1, Xiaoxin Wang 2, Stephen Byers 1, Moshe Levi 2, Daniel TS Pak 3, Kelley W Moremen 4, Huadong Pei 1, Gerald W Hart 4, Junfeng Ma 1,
PMCID: PMC12506516  PMID: 40885482

Abstract

O-linked β-N-acetylglucosamine (O-GlcNAc) modification (i.e., O-GlcNAcylation) on proteins is an essential modification in physiology and pathology. Although O-GlcNAcylation is functionally critical, its analysis has been challenging. Despite the existence of a number of methods developed in the past years, which one(s) might have the best performance is largely unclear. To that end, we conducted a rigorous comparison of several cleavable bioorthogonal biotin-alkyne probes which showed promise for sensitive O-GlcNAc proteomics. In brief, we developed chemoenzymatic labeling/click chemistry-based analytical workflows for O-GlcNAc proteomics by utilizing four cleavable bioorthogonal probes, including photocleavabe-biotin-alkyne (PC-biotin-alkyne), dialkoxydiphenylsilane-biotin-alkyne (DADPS-biotin-alkyne); 1-(4,4-dimethyl-2,6-dioxocyclohex-1-ylidene)ethyl-biotin-alkyne (Dde-biotin-alkyne), and diazobenzene-biotin-alkyne (Diazo-biotin-alkyne). The analytical performance of these probes was evaluated with synthetic O-GlcNAc peptides and then benchmarked by using mouse brain lysates for O-GlcNAc proteomics. Besides providing valuable technical insights into O-GlcNAc proteomics methods, our work yielded an unprecedented O-GlcNAc proteome depth in the mouse brain. In total, 2906 O-GlcNAc sites were unambiguously assigned on 878 proteins. Among them, 1611 sites were newly identified, including 138 O-GlcNAcylated tyrosine residues. Our work will help guide the selection/development of O-GlcNAc proteomics methods for future studies, provide an invaluable resource for functional elucidation of protein O-GlcNAcylation in brain biology, and yield critical insights into tyrosine O-GlcNAcylation.

Keywords: O-GlcNAc, proteomics, PC-biotin-alkyne, DADPS-biotin-alkyne, Dde-biotin-alkyne, Diazo-biotin-alkyne, chemoenzymatic labeling

Graphical Abstract

graphic file with name ga1.jpg

Highlights

  • Head-to-head comparison of cleavable bioorthogonal probes for O-GlcNAc proteomics.

  • >2900 unambiguous O-GlcNAc sites on 878 proteins identified from mouse brain.

  • Tyrosine O-GlcNAcylation on 138 residues of 90 proteins found in mouse brain.

In Brief

O-linked β-N-acetylglucosamine (O-GlcNAc) modification (i.e., O-GlcNAcylation), despite its functional importance, suffers from severe technical challenges to analyze. We performed systematic evaluation of several cleavable bioorthogonal probes for site-specific O-GlcNAc proteomics. Our work provides not only valuable technical insights to O-GlcNAc proteomics methods, but also a comprehensive O-GlcNAc proteome dataset of mouse brain (including tyrosine O-GlcNAcylation).


O-linked β-N-acetylglucosamine (O-GlcNAc) addition (O-GlcNAcylation) is a fundamentally important post-translational modification on proteins (1, 2). Distinct from the traditional N-glycosylation and mucin-type O-glycosylation, O-GlcNAcylation almost exclusively modifies intracellular proteins (3). O-GlcNAcylation is predominantly present in the nucleus and cytoplasm and is also found in mitochondria (4, 5, 6). Besides the classic serine (Ser) and threonine (Thr) residues, we recently identified tyrosine (Tyr) as a new target for protein O-GlcNAcylation (7). O-GlcNAc addition is catalyzed by O-GlcNAc-transferase (OGT) and removed by O-GlcNAcase (OGA). To date, >20% of the human proteome (>4000 proteins) are known to be O-GlcNAc modified with at least one site (8, 9). O-GlcNAcylation has been found in many species, ranging from viruses and bacteria to plants and mammals (10). It is now clear that O-GlcNAcylation plays critical roles in virtually all biochemical processes examined (including transcription, translation, the cell cycle, metabolism and signaling) (11, 12, 13, 14, 15, 16, 17).

The brain is one of the tissues that shows the highest expression of OGT/OGA and O-GlcNAcylation levels (10, 18, 19). Neuron-specific deletion of OGT leads to neonatal lethality, likely due to abnormal neuronal development (20). O-GlcNAcylation is directly linked to a broad range of brain functions, including learning, memory, satiety, movement, and others (21, 22, 23). Protein O-GlcNAcylation is not only important for brain physiology but is also associated with several neurodegenerative disorders (e.g., Alzheimer’s disease and Parkinson’s disease) (24, 25). For example, the presence of O-GlcNAcylation on proteins, such as microtubule-associated protein Tau and β-amyloid precursor protein APP, in brain samples were reported shortly after the discovery of protein O-GlcNAcylation (26, 27). In addition, overall compromised O-GlcNAcylation levels were observed in brain and pluripotent stem cell-derived neurons from patients with Alzheimer’s disease (28, 29). Increased O-GlcNAcylation on Tau and α-synuclein (α-Syn) can also influence the formation of toxic aggregates which are associated with neurodegeneration (30, 31, 32, 33). Thus, modulating O-GlcNAc signaling may represent a promising neuroprotective strategy to treat neurodegenerative diseases. To date, a number of inhibitors targeting OGA have been tested in clinical trials (34, 35, 36, 37).

Site-specific O-GlcNAc characterization is instrumental in the dissection of O-GlcNAc signaling and its functional consequences in different biomedical scenarios. Significant progress has been made with technological advances in high-throughput methods, especially tandem mass spectrometry (MS/MS)-based proteomics in recent years (38, 39, 40, 41). However, large-scale O-GlcNAc site mapping remains a daunting challenge—it is still far from routine to identify a few thousand or even several hundred O-GlcNAc proteins/sites in individual studies. Given that the substoichiometric nature of O-GlcNAcylation appears to be a key hurdle, selective and sensitive enrichment for O-GlcNAc proteins/peptides is indispensable. By leveraging the unique biochemical properties of O-GlcNAc, two categories of enrichment materials/methods have been developed: one is affinity enrichment, exemplified by using antibodies (42, 43), lectins (44, 45, 46, 47, 48, 49, 50, 51), and hydrophilic interaction chromatography (52, 53); another is chemical/biochemical derivatization, e.g., alkaline β-elimination followed by Michael addition (54, 55, 56), and chemoenzymatic-/metabolic-labeling followed by click chemistry (57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73). Although affinity-based enrichment materials/methods provide a convenient tool, their affinity and specificity towards the monosaccharide O-GlcNAc moiety have been notoriously weak, rendering poor enrichment efficiency (generally <5%) (74). In contrast, (bio)chemical derivatization serves as a promising alternative. The metabolic labeling approach (e.g., by using azido-derived analogs such as N-azidoacetylgalactosamine-tetraacylated (Ac4GalNAz) (75, 76, 77), utilizing the UDP-GlcNAc salvage pathways in cells) simplifies the enrichment workflow somewhat, but it is not applicable to tissue samples. Chemoenzymatic labeling, in conjunction with click chemistry, particularly by using cleavable biotin-alkyne probes, represents a promising enrichment method for sensitive O-GlcNAc proteomics, regardless of sample sources. A series of cleavable biotin-alkyne probes have been developed and applied for O-GlcNAc proteomics, including photocleavabe-biotin-alkyne (PC-biotin-alkyne) (7, 58, 60, 61, 65, 68, 70, 71), 1-(4,4-dimethyl-2,6-dioxocyclohex-1-ylidene)ethyl-biotin-alkyne (Dde-biotin-alkyne) (62), dialkoxydiphenylsilane-biotin-alkyne (DADPS-biotin-alkyne) (64, 71), disulfide-biotin-dibenzocyclooctyne (Disulfide Biotin DBCO) (59), and diazobenzene-biotin-alkyne (Diazo-biotin-alkyne) (72, 73). Although there are multiple cleavable biotin-alkyne probes, their performance for O-GlcNAc proteomics has not been investigated systematically.

The goal of the present study was to perform a rigorous comparison of several cleavable biotin-alkyne probes (including PC-biotin-alkyne, DADPS-biotin-alkyne, Dde-biotin-alkyne, and Diazo-biotin-alkyne) for chemoenzymatic labeling-based O-GlcNAc proteomics in a site-specific manner. After being evaluated with synthetic O-GlcNAc peptides, the probes were benchmarked for O-GlcNAc proteomics of mouse brain lysates. In total, 2906 O-GlcNAc sites were unambiguously identified, representing the biggest dataset of any biological samples (including brain) characterized in one study to date. Among the 1611 sites newly reported, 138 tyrosine sites were O-GlcNAcylated, suggesting that endogenous Tyr O-GlcNAcylation is widespread in murine brain (even without external stimulation). Collectively, our work not only provides technical insights into the O-GlcNAc proteomics methods but also serves as a comprehensive resource for site-specific functional studies of protein O-GlcNAcylation in brain physiology.

Experimental Procedures

Experimental Design and Rationale

The lack of sensitive and robust methods for O-GlcNAc proteomics has been a major hurdle for many years, largely due to the limited efficiency of enrichment methods. Despite the recent emergence of a panel of methods, their performance was almost exclusively illustrated by the analysis of selected specific samples. We reasoned that a systematic evaluation of current methods would facilitate further advances of O-GlcNAc proteomics and its biomedical applications.

Recently, we performed a head-to-head comparison of several affinity-based enrichment materials/methods (including the OGA mutant CpOGAD298N, lectin AANL6, and the PTMScan O-GlcNAc [GlcNAc-S/T] motif kit) for O-GlcNAc proteomics of human pancreatic cancer-derived PANC-1 cells (74). Moreover, we prepared nitro-oxide-grafted nanospheres with enhanced hydrogen bonding interaction for O-GlcNAc proteomics (53), yielding substantially more O-GlcNAc sites from the similar cell lysate samples after analysis with the similar set of nanoUPLC-MS/MS instruments and experimental settings (although this method was not directly included in the head-to-head comparison study). In addition, built upon enrichment by incorporating chemoenzymatic labeling with PC-biotin-alkyne-based click chemistry, we developed an integrated workflow and benchmarked its performance for ultradeep O-GlcNAc proteomics of PANC-1 cells (7). Our data showed that, in comparison to these affinity-based enrichment materials/methods, chemoenzymatic labeling followed by PC-biotin-alkyne-based click chemistry has some striking advantages, such as the use of much lower amounts of starting material, much higher enrichment selectivity, and more O-GlcNAc sites/proteins identified (Supplemental Table S1). Prompted by these findings, we proposed to further evaluate the performance of the chemoenzymatic labeling/click chemistry-based method. Specifically, we set out to compare four cleavable biotin-alkyne probes (including PC-biotin-alkyne, DADPS-biotin-alkyne, Dde-biotin-alkyne, and Diazo-biotin-alkyne), which are commercially available and have been exploited for O-GlcNAc proteomics. Given that different cleavable biotin-alkyne probes might have different performance, we aimed to conduct a head-to-head comparison of the four cleavable biotin-alkyne reagents for site-specific O-GlcNAc proteomics. We first set up the methods using synthetic O-GlcNAc peptides, then we compared the analytical performance of the four cleavable biotin-alkyne reagents for O-GlcNAc proteomics by analyzing mouse brain samples.

Labeling of Standard O-GlcNAc Peptides

Standard peptides TAPTS(O-GlcNAc)TIAPG (in which S5 is O-GlcNAcylated) and YSPT(O-GlcNAc)SPSK (in which T4 is O-GlcNAcylated) were custom-synthesized by AnaSpec, Inc. Another peptide AGY(O-GlcNAc)SQGATQYTQAQQTR (in which Y3 is O-GlcNAcylated) was synthesized in our lab, as described previously (7). A mixture of the three O-GlcNAcylated peptides was prepared in 50 μl of 20 mM HEPES buffer (pH 7.9). Subsequently, 11 μl of 100 mM MnCl2, 12.5 μl of 0.5 mM uridine 5′-diphospho-N-acetylazidogalactosamine disodium salt (UDP-GalNAz; BioChemSyn), and 4 μg of recombinant GFP-GalT1 (Y289L, prepared by following the procedure described previously (78)) were added. The mixture was incubated at 4 °C overnight, followed by desalting using a C18 spin column (BioPureSPN Mini, The Nest Group). The peptides were dried, dissolved in 275 μl of 50 mM HEPES buffer (pH 7.9), and split into equal parts for copper-catalysed azide–alkyne cycloaddition (CuAAC) with the four cleavable biotin alkynes (Cat# CCT-1454, Vector Laboratories) in parallel. Briefly, 1 μl of 10 mM PC-, DADPS-, Dde-, or Diazo-biotin-alkyne, 3 μl of 10 mM CuSO4-BTTAA (1:2, measured as CuSO4; Vector Laboratories), and 5 μl of 50 mM sodium ascorbate were added in order. The mixture was incubated at room temperature for 2 h. Next, 0.5 ml of PBS buffer was added along with 120 μl of high-capacity NeutrAvidin agarose slurry (Thermo Fisher Scientific) and incubated for 3 h, followed by sequential washing with cold PBS, water, 20% methanol, and 70% methanol. The enriched peptides conjugated with PC-biotin alkyne were released in 400 μl of 0.1% formic acid (FA) by irradiation at 365 nm (UVP XX-15L UV Bench Lamp, Analytik Jena) for 1 h. The peptides conjugated with DADPS-, Dde-, and Diazo-biotin-alkynes were released by incubation in 400 μl of 10% FA, 2% hydrazine, and freshly prepared 25 mM Na2S2O4 aqueous solution for 1 h, respectively (by referring to previous publications (62, 64, 73)). The released peptides were then cleaned up using a C18 spin column, dried, and kept at −20 °C before nanoUPLC-MS/MS analysis.

Protein Extraction From Mouse Brain

Mice were obtained from Jackson Laboratories and maintained according to the National Institute of Health Guide for Care and Use of Laboratory Animals, with the approval by the Institutional Animal Care and Use Committee of Georgetown University (with the protocol # 2017-0059), as described previously (79). In brief, C57BL/6 wild-type mice were raised in an animal care facility with 12/12-h light-dark cycle, with access to a regular chow diet and water ad libitum. Three-month old mice were euthanized by carbon dioxide inhalation, and the whole brain excised. Mouse brain was washed with cold PBS buffer before being cut into small pieces on ice. Subsequently, 0.5 ml of a lysis buffer (containing 5% sodium dodecyl sulfate (SDS), 50 mM triethylammonium bicarbonate (TEAB), 1 × cOmplete EDTA-free protease inhibitor cocktail, 75 mM NaCl, 1 mM EDTA and 50 μM PUGNAc (MilliporeSigma)) was added. Tissue samples were then homogenized and sonicated on ice. Next, 5 μl of 0.1 M MgCl2 and 5 μl of Benzonase nuclease (MilliporeSigma) were added followed by an incubation at room temperature for 20 min. The lysate was cleared by centrifugation at 14,000g at 4 °C for 20 min. The concentration of proteins in the supernatant was determined by BCA assay.

Enrichment of O-GlcNAc Peptides From Mouse Brain Extracts

Four milligrams of proteins extracted from the mouse brain were used for each enrichment with the biotin alkynes, using a procedure similar to what we described previously (7). Briefly, proteins were first reduced with 20 mM 1,4-dithiothreitol (DTT), then alkylated with 40 mM iodoacetamide (IAA). After methanol/chloroform precipitation, proteins were resuspended in 400 μl of 20 mM HEPES buffer (pH 7.9) containing 1% SDS. Next, 490 μl of water, 800 μl of labeling buffer (125 mM NaCl, 5% NP-40, 50 mM HEPES, pH 7.9), 105 μl of 100 mM MnCl2, 100 μl of 0.5 mM UDP-GalNAz, 10 μg GFP-GalT1 (Y289L), 0.5 μl of recombinant PNGase F (VWR), and 1 μl of thermosensitive alkaline phosphatase (TSAP, Cat# M9910, Promega) were added and incubated at 4 °C overnight. Then an additional 0.5 μl of TSAP was added, followed by incubation at 37 °C for two more hours. The proteins were precipitated again and resuspended in 1 ml of HEPES buffer (pH 7.9) containing 0.5% SDS. To the solution, 11 μl of 10 mM PC-, DADPS-, Dde-, or Diazo-biotin-alkyne, 33 μl of 10 mM CuSO4-BTTAA (1:2), and 55 μl of 50 mM sodium ascorbate were added and incubated at room temperature for 2 h. After precipitation, the proteins were dissolved in 0.8 ml of 50 mM TEAB buffer (pH 8) containing 8 M urea, diluted by 50 mM TEAB buffer, and digested with trypsin (w/w = ∼30:1) at 37 °C overnight. O-GlcNAc peptides in the digests were enriched and released following the same protocol described in the previous section for the labeling of standard O-GlcNAc peptides.

nanoUPLC-MS/MS

The nanoUPLC-MS/MS analysis was performed on a nanoAcquity UPLC system (Waters) coupled to an Orbitrap Fusion Lumos mass spectrometer (Thermo Fisher) with similar settings described previously (7) with slight modifications. Dried samples were dissolved in 0.1% FA and loaded onto a C18 Trap column (Waters Acquity UPLC M-Class Trap, Symmetry C18, 100 Å, 5 μm, 180 μm × 20 mm) at 10 μl/min for 4 min. Separation was carried out on an analytical column (Waters Acquity UPLC M-Class, Peptide BEH C18, 300 Å, 1.7 μm, 75 μm × 150 mm) at a column temperature of 45 °C and a flow rate of 400 nl/min, using 60- and 300-min gradients for standard peptides and mouse brain samples, respectively. The 60-min gradient of buffer A (2% ACN, 0.1% formic acid) and buffer B (0.1% formic acid in ACN) was as follows: 1% buffer B at 0 min, 5% buffer B at 1 min, 30% buffer B at 40 min, 50% buffer B at 50 min, 90% buffer B at 52 min, 90% buffer B at 60 min. The 300-min gradient was 1% buffer B at 0 min, 5% buffer B at 1 min, 30% buffer B at 210 min, 45% buffer B at 260 min, 90% buffer B at 270 min, 90% buffer B at 290 min, 1% buffer B at 290.1 min, and 1% buffer B at 300 min. The MS data were acquired in data-dependent acquisition (DDA) mode by an Orbitrap Fusion Lumos mass spectrometer with an ion spray voltage of 2.4 kV and an ion transfer temperature of 275 °C. The mass spectra were recorded with Xcalibur 4.7. The MS parameters were set as below: Detector Type: Orbitrap; Orbitrap Resolution: 120,000; Scan Range: 350-1800 m/z; RF Lens: 30%; AGC Target: Standard; Maximum Injection Time Mode: Auto; Microscans: 1. Charge state: 2 to 8; Cycle Time: 3 s.

EThcD without dynamic exclusion was used for standard peptides, whereas HCD product-dependent EThcD (HCD-pd-EThcD) with a dynamic exclusion duration of 40 s was applied for MS/MS acquisition of mouse brain samples. In HCD-pd-EThcD mode, EThcD was triggered by the oxonium ions of HexNAc (m/z 126.055, 138.055, 144.066, 168.065, 186.076, and 204.086), as well as the major fragments resulting from the tags—m/z 300.130 and 503.210 for both PC- and Dde-biotin-alkyne, m/z 329.146 and 532.225 for DADPS-biotin-alkyne, and m/z 507.220 and 710.299 for Diazo-biotin-alkyne—observed in HCD scans. MS/MS parameters were set as below: Isolation Mode: Quadrupole; Isolation Window: 1.6 m/z; Stepped collision energy of 22.5, 30, 37.5% for HCD was used. Detector Type: Orbitrap; Resolution: 30,000; Normalized AGC Target: 200%. Supplemental activation (SA) collision energy of EThcD was set as 30%.

Data Analysis

Mass spectrometric data files for synthetic peptides were processed with Proteome Discoverer (PD, version 2.4; Thermo Fisher Scientific) with Sequest HT, as described previously (53, 74, 80). Proteome Discoverer (PD, version 2.4; Thermo Fisher Scientific) with Sequest HT and MaxQuant (MQ, version 2.3.0.0, Max Planck Institute of Biochemistry) were used to process the raw data files of mouse brain samples using similar parameters as those described previously (7). UniProt Mus musculus database (TaxID: 10090, downloaded on February 2, 2023; 17137 sequences) was used for database searching. Raw files of each biotin alkyne were searched as a single batch. The false-discovery rate (FDR) was determined by using a target-decoy search strategy. The decoy-sequence database contains each sequence in reverse orientation, enabling FDR estimation. In PD, strict and relaxed target FDRs for highly and medium confident peptide/protein hits were lower than 1% and 5%, respectively. In MQ, FDR was set at 1% for both PSM and protein. Full digestion mode was selected for trypsin at Lys and Arg, allowing a maximum of two missed cleavages. Mass additions of 709.292 Da and 531.217 Da on Ser/Thr/Tyr/Asn were searched for Diazo- and DADPS-treated samples, respectively, with 502.202 Da applied to both Dde- and PC-treated samples. As many as five modifications were allowed per peptide. Other settings for PD and MQ were kept as the default unless otherwise specified. The precursor mass tolerance was set at 10 ppm, whereas the fragment-mass tolerance was set at 0.02 Da. Carbamidomethylation of cysteines (+57.0215 Da) was set as a fixed modification, and variable modifications of deamidation (N, +0.984 Da), methionine oxidation (M, +15.9949 Da), acetyl (N-terminus, +42.011 Da), and loss (N-terminus M, −131.040 Da), or loss + acylation (N-terminus M, −89.030 Da) were allowed. The modification sites identified by Sequest HT were further filtered by IMP-ptmRS with a site probability ≥0.75. When using MQ, carbamidomethylation of cysteines was set as a fixed modification, and the variable modifications included methionine oxidation, asparagine deamidation, and acetyl (N-terminus). A localization probability threshold of 0.75 was used to distinguish unambiguous and ambiguous sites. The raw files of standard peptide samples were searched against a customized FASTA file containing the corresponding peptide sequences without cleavage using PD with a fixed value PSM validator (delta Cn, which is the normalized score difference between the currently selected PSM and the highest-scoring PSM for that spectrum, better than or equal to 0.05). The MS level chromatograms of labeled standard peptides were extracted at the theoretical m/z with a mass tolerance of 5 ppm using Thermo Scientific FreeStyle 1.8 SP2.

Statistics

Labeling of standard O-GlcNAc peptides was performed in triplicate for each biotin alkyne reagent, with labeled intensity quantified at the MS1 level based on peak area. Mouse brain lysates pooled from six mice were used for four enrichments per biotin alkyne reagent, and each sample was analyzed in duplicate by nanoUPLC-MS/MS. All eight runs per biotin alkyne reagent were searched as a single batch. Statistical mean comparisons between box blots were assessed using a t test.

Results

Development of Workflows for O-GlcNAc Peptides

Three standard O-GlcNAc peptides, including YSPT(O-GlcNAc)SPSK, TAPTS(O-GlcNAc)TIAPG, and AGY(O-GlcNAc)SQGATQYTQAQQTR, were used to develop the workflows and assess their performance for O-GlcNAc analysis (Fig. 1A). In brief, peptides were subjected to GalT1(Y289L)-mediated chemoenzymatic labeling and then click chemistry with four cleavable biotin-alkyne probes, including PC-biotin-alkyne, DADPS-biotin-alkyne, Dde-biotin-alkyne, and Diazo-biotin-alkyne (Fig. 1B). Biotin-tagged O-GlcNAc peptides were captured on neutravidin beads and then released in specific cleavage conditions. The released peptides were then analyzed by nanoUPLC-MS/MS in electron-transfer/higher-energy collision dissociation (EThcD) mode.

Fig. 1.

Fig. 1

Analysis of O-GlcNAc peptides by using four cleavable biotin-alkyne probes.A, schematic diagram for O-GlcNAc analysis of peptides, which involves GalT1(Y289L)-mediated chemoenzymatic labeling, click chemistry with cleavable biotin-alkyne probes, capture with neutravidin beads, and release of tagged O-GlcNAc peptides. The released peptides are then subjected to tandem mass spectrometry in EThcD mode, with the data files analyzed by Proteome Discoverer. B, structures of four cleavable biotin-alkyne probes, i.e., PC-biotin-alkyne, DADPS-biotin-alkyne, Dde-biotin-alkyne, and Diazo-biotin-alkyne. Cleavage sites are indicated in dashed green lines.

Figure 2 shows the EThcD MS/MS spectra of the tagged O-GlcNAcylated peptide having the sequence AGY(O-GlcNAc)SQGATQYTQAQQTR in which Y3 is O-GlcNAcylated, after enrichment using each of the four cleavable biotin-alkyne probes. As seen, abundant fragmentation ions (particularly c3+, c4+, z14+, and z15+) clearly annotate the O-GlcNAcylated peptide and the modified Tyr residue. Moreover, dominant fragmentation at the glycosidic linkage can be observed, as illustrated by the presence of diagnostic ion pairs of the tags (at m/z 300.130 and 503.210 for both PC- and Dde-biotin-alkyne, m/z 329.146 and 532.225 for DADPS-biotin-alkyne, and m/z 507.220 and 710.299 for Diazo-biotin-alkyne), as expected. A similar fragmentation pattern was observed in the EThcD MS/MS spectra for the other two tagged O-GlcNAcylated peptides (supplemental Fig. S1). These data suggest that all the biotin-alkyne probes worked well not only for the Ser/Thr O-GlcNAcylated peptides but also for the Tyr O-GlcNAcylated peptide. This result is in agreement with our previous finding that the chemoenzymatic labeling/click chemistry approach can enrich Tyr O-GlcNAcylated peptides in an unbiased manner (for both synthetic peptides and peptides from complex samples) (7).

Fig. 2.

Fig. 2

Representative mass spectra of synthetic O-GlcNAc peptides after enrichment, with tyrosine O-GlcNAcylated peptide ‘AGY(O-GlcNAc)SQGATQYTQAQQTR’ as an example (in which Y3 is O-GlcNAcylated). Of note, major fragments of the tags, i.e., m/z 300.130 and 503.210 for both PC- and Dde-biotin-alkyne, m/z 329.146 and 532.225 for DADPS-biotin-alkyne, and m/z 507.220 and 710.299 for Diazo-biotin-alkyne, are highlighted in green. O-GlcNAc site in peptide sequence is shown in red.

We next compared the intensities of tagged O-GlcNAc peptides after enrichment using each biotin-alkyne probe (supplemental Fig. S2). Interestingly, in comparison to Dde-biotin-alkyne, the other three probes (i.e., PC-biotin-alkyne, DADPS-biotin-alkyne, and Diazo-biotin-alkyne) provided higher intensities of the tagged O-GlcNAc peptides. The relatively poor performance of the Dde-biotin-alkyne probe might be due to its relative instability during sample preparation (62, 81). Of note, each O-GlcNAc peptide showed different intensities. For example, compared with PC-biotin-alkyne, DADPS-biotin-alkyne yielded significantly higher signals for Ser/Thr O-GlcNAc peptides but similar abundances for the Tyr O-GlcNAc peptide. These data suggest that the biotin-alkyne probes can unbiasedly enrich all three types of O-GlcNAc peptides. In addition, the excellent reproducibility of technical triplicates demonstrates the reliability of the analytical workflows.

Comparison of Different Probes for O-GlcNAc Proteomics of Mouse Brain Samples

We then set out to further evaluate the performance of the cleavable bio-orthogonal probes-based methods by using complex samples. Given the relatively high O-GlcNAcylation levels and the critical physiological/pathological importance, brain samples were used to benchmark the application of the four cleavable bio-orthogonal probes for O-GlcNAc proteomics. Figure 3 illustrates the workflow, which is built on those developed for O-GlcNAc peptide samples, for brain O-GlcNAc proteomics. In brief, proteins extracted from mouse brain were subjected to GalT1(Y289L)-mediated chemoenzymatic labeling, click chemistry with cleavable biotin-alkyne probes, protein digestion with trypsin, capture with neutravidin beads, and release of tagged O-GlcNAc peptides. The released peptides were analyzed by nanoUPLC-MS/MS using 300-min gradients and HCD-pd-EThcD fragmentation, in which EThcD was triggered by the oxonium ions of HexNAc (m/z 126.055, 138.055, 144.066, 168.065, 186.076, and 204.086) and the major fragments of the tags (i.e., m/z 300.130 and 503.210 for both PC- and Dde-biotin-alkyne, m/z 329.146 and 532.225 for DADPS-biotin-alkyne, and m/z 507.220 and 710.299 for Diazo-biotin-alkyne). Four replicates were performed for each probe. The resulting data files were then processed by Proteome Discoverer.

Fig. 3.

Fig. 3

A schematic diagram of O-GlcNAc proteomics of mouse brain samples by using four cleavable biotin-alkyne probes. In this workflow, proteins extracted from mouse brain are subjected to GalT1(Y289L)-mediated chemoenzymatic labeling, click chemistry with cleavable biotin-alkyne probes, protein digestion with trypsin, capture with neutravidin beads, and release of tagged O-GlcNAc peptides. The released peptides are then analyzed by tandem mass spectrometry in HCD-pd-EThcD mode, with the data files analyzed by Proteome Discoverer or MaxQuant. (Four replicates were performed for each probe and the resulting peptides were injected twice for technical duplicates).

In total, more than 3 × 106 MS/MS spectra were acquired, resulting in 1963 unambiguous O-GlcNAc sites from 740 O-GlcNAc proteins. A complete list of all the identified O-GlcNAc peptides, sites (including confidence scores, residues localized, and related PSMs), and proteins with unambiguous sites is shown in Supplemental Data S1. Among the cleavable biotin-alkyne probes, PC and DADPS gave comparable numbers of O-GlcNAc peptide to spectrum matches (PSMs), modified peptide sequences, unambiguous O-GlcNAc sites, and O-GlcNAc proteins with unambiguous sites (Table 1). These numbers are twice those of Dde and 4.5- to 5-fold those of Diazo. Our data suggest that PC and DADPS outperform Dde and Diazo for O-GlcNAc proteomics in general.

Table 1.

Identifications of PSMs, unique O-GlcNAc peptides, unambiguous sites, and proteins in each dataset by PD

Identification PC DADPS Dde Diazo
MS/MS 813,230 792,185 733,768 690,778
Modified PSMs 37,609 42,048 18,746 8451
Success rate (%) 4.62 5.31 2.55 1.22
Modified PSMs activated by HCD 20,330 24,403 11,516 6186
Modified PSMs activated by EThcD 17,279T 17,645 7230 2265
HCD/EThcD ratio 1.18 1.38 1.59 2.73
Modified peptide sequences 1395 1426 878 356
Unambiguous sites 1155 1084 650 201
Proteins with unambiguous sites 566 538 308 115

All data reported is the combination of two injections from four replicates.

We next investigated the physiochemical properties of O-GlcNAc peptides identified from each biotin-alkyne probe (Fig. 4). Although a similar number of O-GlcNAc peptides (1395 versus 1426) was identified from PC and DADPS, the DADPS approach yielded far more non-modified peptides (Fig. 4A). We reasoned that the stronger hydrophobicity of the DADPS-biotin-alkyne probe might lead to non-specific binding of peptides during enrichment. An enrichment specificity of 47% was achieved for the PC approach, which is consistent with our previous work (7). Clearly, even though the four biotin-alkyne probes undergo the same processing procedures, a varied enrichment specificity (ranging from ∼22% for DADPS/Dde to ∼47% for PC) is obtained. Regarding identification reproducibility, 23% of O-GlcNAc peptides were identified by the PC approach in all eight runs, while 33% of O-GlcNAc peptides were identified by the DADPS approach (Fig. 4B). About 56% and 65% of the O-GlcNAc peptides were identified in at least 4 runs from the PC approach and the DADPS approach, respectively. A majority of O-GlcNAc peptides (ranging from 75% for Diazo to 85% for DADPS) were identified in at least 2 runs for all the approaches. These data further demonstrate an excellent reproducibility of the analytical workflows even for complex samples, in which DADPS and PC yield the highest reproducibility. An average of seven and eight PSMs per O-GlcNAc peptide were obtained from the PC approach and the DADPS approach, respectively (Fig. 4C). In contrast, five PSMs per O-GlcNAc peptide were obtained from the Dde approach and the Diazo approach. Although the PC approach and the DADPS approach contained comparative numbers of amino acids, the Dde approach and the Diazo approach yield longer peptides (up to 36 amino acids for Diazo) (Fig. 4D). Of note, >90% of the O-GlcNAc peptides identified from all the biotin-alkyne probes carry 3 to 5 positive charges (Fig. 4E). However, the Diazo approach produced a substantially higher percentage of O-GlcNAc peptides carrying more than or equal to five positive charges, which might be ascribed to the longer peptides on average. Interestingly, among O-GlcNAc peptides carrying multiple positive charges, PC yielded a slightly higher portion of peptides with five positive charges and a lower portion of peptides with three positive charges than those of DADPS (Fig. 4E). The slightly increased numbers of peptides carrying high positive charges could be a result of the additional positive charge of an amine group on the tag after UV cleavage. Furthermore, we investigated the overlap of O-GlcNAc peptides across four cleavable biotin probes (supplemental Fig. S3). Up to 84% (299/356) of the O-GlcNAc peptides identified from the Diazo approach were also identified by the other three probes. We reasoned that the non-overlapped peptides resulted from other probes might lead to a significant shift in the peptide length distribution. Thus, the length obtained with other probes become shorter than that of Diazo and that of the median of overlapped peptides. Lastly, we performed sequence motif analysis of all O-GlcNAc peptides identified from each probe, with largely similar patterns observed (supplemental Fig. S4). This data suggests that the four probes have comparable selectivity toward O-GlcNAc peptides/sites.

Fig. 4.

Fig. 4

O-GlcNAc peptides identified from mouse brain by using four cleavable biotin-alkyne probes-based O-GlcNAc proteomics.A, enrichment specificity of O-GlcNAc peptides. Comparison is performed in terms of stripped peptide sequence numbers with and without O-GlcNAcylation. B, identification frequency of O-GlcNAc peptides across eight runs for each cleavable biotin probe. C, distribution of total PSMs per O-GlcNAc peptide in all runs (outliers omitted). D, length distribution of O-GlcNAc peptides runs (outliers omitted). E, distribution of charge states of O-GlcNAc precursors. Statistical significance was determined using a t test. ∗∗p ≤ 0.01; ∗∗∗∗p ≤ 0.0001.

Notably, despite a comparable performance as PC and DADPS for synthetic O-GlcNAc peptides, the Diazo approach yielded fewer identifications for O-GlcNAc proteomics in brain samples, which might be due to the relatively low cleavage efficiency of diazo probes by Na2S2O4 as reported previously (81, 82). We then compared unambiguous O-GlcNAc sites identified using different biotin alkynes. As shown in Figure 5A, 1155 O-GlcNAc sites were unambiguously identified from the PC-biotin-alkyne method. Among them, 500 O-GlcNAc sites were not identified by any other biotin alkynes. The DADPS approach yielded 1084 O-GlcNAc sites, slightly less than that of PC but significantly more than that of Dde and DIAZO (which identified 650 and 201 sites, respectively). These results demonstrate that PC and DADPS are superior to Dde and Diazo for the identification of unambiguous O-GlcNAc sites. Consistent with the modified peptides identified, approximately one-fourth of the O-GlcNAc sites overlap between PC and DADPS. The representative mass spectra of a Tyr O-GlcNAcylated peptide identified using both PC- and DADPS-biotin-alkyne are shown in Figure 5B. The peptide ‘333HMLGEDDYTRPPEPVYSTVNK353’ from the protein discs large homolog 2 (Dlg2, also known as PSD-93/Chapsyn-110) is O-GlcNAcylated at Y340, with mass additions of 502.202 Da and 531.217 Da observed after tagging with PC and DADPS, respectively. The major fragments produced from these tags, together with the oxonium ions of the O-GlcNAc moiety (not highlighted due to space limitation), show that the peptide is O-GlcNAcylated. The fragment ions (particularly c8+, c9+, z13+, y13+, y14+) show that O-GlcNAc modifies Y340 rather than its neighboring residue T341 (Fig. 5B). Of note, O-GlcNAcylated Tyr sites have been detected using all biotin alkynes, with a ratio ranging from 3% to 7% among all the O-GlcNAcylated sites (supplemental Fig. S5).

Fig. 5.

Fig. 5

O-GlcNAc sites identified from mouse brain, with representative mass spectra of Tyr O-GlcNAcylation illustrated.A, Overlap of unambiguous O-GlcNAc sites identified from mouse brain by using four cleavable biotin-alkyne probes-based O-GlcNAc proteomics. B, representative mass spectra of a Tyr O-GlcNAcylated peptide 333HMLGEDDYTRPPEPVYSTVNK353 of protein discs large homolog 2 (Dlg2) identified by the PC-biotin-alkyne approach (top) and the DADPS-biotin-alkyne approach (bottom). Major fragments of the tags are highlighted in green. O-GlcNAc site in peptide sequence is shown in red.

The overlap of proteins corresponding to the O-GlcNAcylated sites identified across the four biotin alkynes is presented in Supplemental Fig. S6. Most proteins (61% of the 740 identified) were detected with more than one biotin alkyne, indicating that the identified O-GlcNAc modifications are concentrated in a subset of proteins of the entire mouse brain proteome.

Construction of a Comprehensive Dataset of Brain O-GlcNAc Proteomics

Considering that each search engine has its own distinct algorithm for peptide matching and modification localization (83), we set out to use different search engines to further improve O-GlcNAc proteome coverage. Besides PD, MaxQuant (MQ) was used to process the raw files of O-GlcNAc proteomics from all biotin alkyne probes, as we described previously (7). A list of the detailed information of unambiguous O-GlcNAc sites, including the fragmentation modes of best localized, best score, and best PEP scans, is provided in Supplemental Data S2. As shown in Figure 6A, besides 1188 O-GlcNAc sites unambiguously identified by both approaches, MaxQuant identified 775 additional O-GlcNAc sites. Supplemental Fig. S7 shows the representative mass spectrum of a Tyr O-GlcNAcylated peptide identified from MQ. As seen from Supplemental Fig. S7, major fragments of the tag (i.e., m/z 329.145 and 532.224) and key fragment ions (including c7+, c8+, and c9+) demonstrate the O-GlcNAcylation on Y340 of the peptide ‘333HMLGEDDYTRPPEPVYSTVNK353’ from the protein discs large homolog 2 (Dlg2) after profiling with the DADPS-biotin-alkyne-based enrichment method followed by MQ analysis.

Fig. 6.

Fig. 6

A comprehensive dataset of unambiguous O-GlcNAc sites from mouse brain.A, O-GlcNAc sites identified from PD and MQ. B, comparison with unambiguous O-GlcNAc sites reported in mouse previously. C, distribution of O-GlcNAcylated residues. D, O-GlcNAc motif analysis. E, conservation of O-GlcNAcylated Tyr sites between human and mouse. Protein sequences were aligned using the Clustal Omega tool (UniProt). Symbols indicate conservation levels: “∗” fully conserved residues, “:” conservation among residues with strongly similar properties (Gonnet PAM 250 score >0.5), “.” conservation among residues with weakly similar properties (Gonnet PAM 250 score ≤0.5), and “ ” non-conserved residues. O-GlcNAc site in peptide sequence is shown in red. F, overlap of O-GlcNAcylated Tyr sites and phosphorylated Tyr sites identified from mouse brain and from other samples.

The combination of PD and MQ yielded a list of 2906 O-GlcNAc sites unambiguously assigned with high confidence (localization probability ≥75%), with details shown in Supplemental Data S3. Of note, more than 99% of the O-GlcNAc sites identified using PD had a localization probability ≥90%, and the majority of O-GlcNAc sites from MQ also exceeded this threshold (supplemental Fig. S8), suggesting high stringency of the overall localization confidence. Among all the O-GlcNAc sites unambiguously mapped, 1611 O-GlcNAc sites (55%) were newly identified, according to O-GlcNAcAtlas 4.0, which contains 4766 unambiguous O-GlcNAc sites on proteins from all types of mouse samples studied previously (Fig. 6B) (9). In addition to O-GlcNAcylation on many Ser/Thr sites, 138 Tyr sites were found to be O-GlcNAcylated (corresponding to 96 proteins). Among all the Tyr O-GlcNAcylation sites, 42 were identified by both PD and MQ. In addition, 40 sites and 56 sites were uniquely assigned by PD and MQ, respectively. The ratio of all O-GlcNAcylated Ser, Thr, and Tyr residues is approximately 50.4%:44.9%:4.7% (Fig. 6C), which is similar to our previous findings from human cells (1).

Figure 6D shows the flanking amino acids around the target residues. A pattern P-P-V-S/T-S-S/A appears to be dominant, similar to those of modified Ser/Thr observed previously (10). In contrast to O-GlcNAcylated Ser/Thr, a slightly different pattern is observed for Tyr O-GlcNAcylation. P-X-Y-S-P is most common for O-GlcNAcylated Tyr sites, similar to that described in a previous report using cultured human cells (7).

We evaluated the potential conservation of O-GlcNAcylated Tyr sites between mice and humans (7) (Fig. 6E). Out of the 96 Tyr O-GlcNAcylated proteins, 17 were also identified to be Tyr O-GlcNAcylated in cultured human cells (7), including zinc finger RNA-binding protein and transcriptional repressor p66-beta. Despite the relatively small numbers, conservation of Tyr O-GlcNAcylation is observed on a number of sites. For example, O-GlcNAcylation was found on Y194 and Y201 of the Zinc finger RNA-binding protein (Zfr) in both mice and human samples. In addition, O-GlcNAcylation on Y299 on transcriptional repressor p66-beta (Gatad2b) and Y2145 on transcription factor HIVEP2 (Hivep2) in mice was observed in the corresponding ortholog proteins in humans. The relative conservation between species indicates that O-GlcNAcylated Tyr sites is of functional importance.

To explore the potential cross-talk between Tyr O-GlcNAcylation and Tyr phosphorylation, we compared the O-GlcNAcylated Tyr sites to a recently published mouse brain phosphoproteomics study, which identified 1429 phosphorylated Tyr sites (84). 12 sites overlapped, representing 9% of all O-GlcNAcylated Tyr sites (Fig. 6F). This ratio was increased to 13%, when compared to a comprehensive phosphorylation database which catalogued 9808 phosphorylated Tyr sites from all mouse samples studied previously (PhosphoSitePlus (85), accessed on February 27, 2025) (Fig. 6F). Taken together, 24 O-GlcNAcylated Tyr sites (17%) colocalize with reported phosphorylated Tyr sites. These ratios are substantially lower than those observed in human cells (∼30.6%) upon acute treatment with Thiamet G (an OGA inhibitor) (7). This difference might be partially ascribed to the different biological conditions used, given that the O-GlcNAcylated Tyr sites in the current study were identified at physiological conditions without experimental stimulation.

In total, we identified 878 proteins with unambiguous O-GlcNAc sites (Fig. 7A). Among them, 293 proteins (33%) had not been previously identified with unambiguous O-GlcNAc sites, according to O-GlcNAcAtlas 4.0 (Fig. 7B). By performing subcellular localization analysis, we found that >98% of the O-GlcNAcylated proteins are localized in intracellular domains (except endoplasmic reticulum and Golgi apparatus). A GO enrichment analysis demonstrates that the 878 O-GlcNAcylated proteins are enriched in biological processes such as the dendrite development, postsynapse organization, and the regulation of synapse structure, activity and organization (Fig. 7C). In addition, they are highly enriched in postsynaptic density, asymmetric synapse, and associated molecular functions (Fig. 7C). Interestingly, a further analysis of the 96 proteins with O-GlcNAcylated Tyr sites reveals that they are also enriched in similar GO terms as those with modified Ser/Thr (supplemental Fig. S9).Of the 558 mouse kinases in the protein kinase database (86), 27 kinases were found with unambiguous O-GlcNAc sites in the current study, including glycogen synthase kinase 3 beta (Gsk3b) and Calcium/calmodulin-dependent protein kinase type IV (Camk4). Among the 96 Tyr O-GlcNAcylated proteins, 30 were also found to be Tyr phosphorylated proteins in the mouse brain phosphoproteomics dataset (84). Clearly, exploration of the roles of Ser/Thr/Tyr O-GlcNAcylation in modulating phosphorylation and the functions of kinases themselves will be an important endeavor. The detailed cross-talk mechanisms between O-GlcNAcylation and phosphorylation (particularly Tyr O-GlcNAcylation and Tyr phosphorylation) at the enzyme- and substrate modification site-specific- levels await further elucidation.

Fig. 7.

Fig. 7

A comprehensive dataset of O-GlcNAcylated proteins from mouse brain.A, proteins with unambiguous O-GlcNAcylated sites identified from PD and MQ. B, comparison to O-GlcNAcylated proteins with unambiguous sites reported in mouse previously. C, GO enrichment analysis of O-GlcNAcylated proteins with unambiguous sites.

Last but not least, GlcNAcylation was identified on 342 Asn residues of 240 proteins (Supplemental Data S4). Amongst, 45% of the identified N-GlcNAc sites are annotated as known N-glycosylation sites, according to the UniProt annotation (https://www.uniprot.org/). For example, GlcNAcylation was observed on N197 of the palmitoyl-protein thioesterase 1 (PPT1) and N109 of leukocyte surface antigen CD47 (CD47) (supplemental Fig. S10). Supplemental Fig. S10B highlights an example of an N-GlcNAc peptide ‘104DAMVGNYTCEVTELSR119’ from CD47, a cell surface protein involved in cell-to-cell interactions. The presence of c6+ and z11+ confidently identifies the modification on N109, and the HCD fragmentation pattern (insert) shows the modification is GlcNAc rather than GalNAc. Further motif analysis of the N-GlcNAc peptides (supplemental Fig. S11) suggests strong enrichment for peptides containing the N-X-S/T sequence, similar as the classic N-glycosylation. The discovery of N-linked GlcNAc-modified peptides is somewhat unexpected, given that a single N-linked GlcNAc was identified on a number of proteins by using lectin weak-affinity chromatography-based enrichment previously (44, 45, 50). Our data suggests that the chemoenzymatic labeling-based workflow can enrich N-GlcNAc peptides, in addition to Ser/Thr/Tyr O-GlcNAc peptides. Although previous work indicated that N-GlcNAcylation can occur by cytosolic deglycosylating enzyme (endo-β-Nacetylglucosaminidase; ENGase) under normal conditions (50, 87), its functional importance in physiology and pathology remains largely unknown.

Discussion

Protein O-GlcNAcylation plays critical roles on many proteins across species. Despite its functional importance, large scale site-specific characterization of protein O-GlcNAcylation has been a bottleneck for many years. Efficient enrichment O-GlcNAc proteins/peptides from complex samples holds promise for deep O-GlcNAc proteomics. By leveraging previous efforts, we set out to evaluate and improve existing analytical methods, aiming to establish more sensitive and robust workflows. We had previously systematically evaluated the performance of several affinity enrichment methods/materials for O-GlcNAc proteomics (74). In another study, we utilized chemoenzymatic labeling coupled with click chemistry to analyze the same set of human cell samples using the same set of instruments (7). Very encouragingly, the resulting workflow yielded unprecedented O-GlcNAc proteome depth, enabling our discovery of Tyr O-GlcNAcylation. Prompted by that work, in the present study we proposed to assess and further optimize chemoenzymatic labeling/click chemistry-based methods. Specifically, we conducted a head-to-head comparison of workflows involving four cleavable biotin-alkyne reagents for site-specific O-GlcNAc proteomics. After being evaluated using standard O-GlcNAc peptides, the methods were then benchmarked for O-GlcNAc proteomics of mouse brain samples.

By using three standard O-GlcNAc peptides, we established analytical workflows integrating GalT1 (Y289L) chemoenzymatic labeling, click chemistry with four cleavable biotin-alkyne reagents (PC, DADPS, Dde, and Diazo), and EThcD mass spectrometry for O-GlcNAc analysis (Fig. 1). We found that all the biotin-alkyne probes could be used to enrich not only Ser/Thr O-GlcNAcylated peptides but also the Tyr O-GlcNAcylated peptide, providing further evidence that the chemoenzymatic labeling/click chemistry approach can enrich all types of O-GlcNAcylated peptides in an unbiased manner (7). However, differences were observed for the resulting O-GlcNAc peptides enriched with different biotin-alkyne probes (supplemental Fig. S1). Among the probes, Dde yielded the lowest intensity of all the tagged O-GlcNAc peptides, which is likely due to its degradation during sample processing (62). By analyzing the brain lysates, we revealed that all the biotin-alkyne probes could be used to profile the O-GlcNAc proteome of complex samples, despite significant differences in coverage. In comparison to the Dde probe, PC and DADPS yield many more identifications (including more O-GlcNAc PSMs, O-GlcNAc peptides, unambiguous O-GlcNAc sites, and O-GlcNAc proteins) (Table 1). Distinct from its performance for standard O-GlcNAc peptides, the Diazo approach gave the fewest O-GlcNAc identifications. These data suggest that the PC and DADPS approaches are better choices for sensitive O-GlcNAc proteomics. Moreover, although similar reproducibility of standard O-GlcNAc peptides was achieved from workflows of all probes (supplemental Fig. S2), PC and DADPS approaches demonstrated an overall higher reproducibility of O-GlcNAc peptides from brain samples (Fig. 4B). From the enrichment specificity perspective, a specificity up to 47% was achieved for the PC approach, which is much higher than the other three biotin-alkyne probes (Fig. 4A). Even though a low specificity down to 22% was obtained from the DADPS and Dde approaches, this is still substantially higher than that of the affinity enrichment approaches (74). Collectively, our data suggest that the PC-biotin-alkyne approach, followed by the DADPS-biotin-alkyne approach, provides the highest sensitivity, reproducibility, and specificity for O-GlcNAc proteomics. Besides presenting two analytical strategies, i.e., for peptide-level labeling (Fig. 1) and for protein-level labeling (Fig. 3), our work provides invaluable insights into the technical aspects of O-GlcNAc proteomics and will help guide the selection of appropriate methods for O-GlcNAc proteomics by others. Of note, we employed HCD-pd-EThcD in which oxonium ions from HCD were used to trigger EThcD fragmentation in our study, with the generated HCD and EThcD spectra searched together in a combined workflow during data processing. We noticed that although the vast majority of O-GlcNAc sites were identified by spectra based on EThcD only or both HCD and EThcD, a small portion (<10%) was exclusively contributed by HCD mass spectra. Despite a high localization threshold (≥0.75) being used, careful annotation of such sites should be taken. Efforts on the development of more advanced mass spectrometric fragmentation techniques and site-scoring algorithms would facilitate improved site localization in O-GlcNAc proteomics studies.

We exemplified the application of the O-GlcNAc proteomics workflows to the profiling of O-GlcNAcylated proteins in mouse brain lysates, as brain (61, 63, 68) and the synaptosome (43, 45) contain proteins heavily modified by O-GlcNAcylation. Using mouse brain lysates as a complex sample, we generated a comprehensive dataset of protein O-GlcNAcylation. In total, 2906 O-GlcNAc sites were unambiguously mapped on 878 proteins. Remarkably, this is several-fold higher than the numbers of O-GlcNAc sites identified from mouse brain samples previously (61, 63, 68). To our knowledge, this represents not only the biggest dataset of an individual project on brain samples, but also the biggest dataset of O-GlcNAc sites reported in an individual study of any samples to date. Besides mapping many known O-GlcNAc sites, 1611 O-GlcNAc sites were newly identified in this study. For example, microtubule-associated protein 2 (MAP-2), a dendritically enriched protein and marker of synaptic plasticity (88), was previously reported to be an O-GlcNAc protein, with a number of sites identified (45, 49, 68). In addition to mapping several known sites, we unambiguously identified several new O-GlcNAc sites, including S361 (which was reported previously as an ambiguous site (49)), S496, and S1057. Of note, although our dataset is of unprecedented depth, there is still room to further expand the catalog of O-GlcNAcylated proteins/sites. For example, O-GlcNAcylation on S705, S714, and T719 on the microtubule-associated protein tau was identified by our strategy, whereas two O-GlcNAcylated sites, S692 and T705 were identified from tau-enriched samples and mouse hippocampus samples (49, 89). Thus, instead of using the whole brain lysates, other sample preparation methods such as sub-region fractionation or immunoprecipitation will certainly help increase the identification of O-GlcNAcylated sites/proteins. We anticipate that the substantially expanded dataset of O-GlcNAcylated sites/proteins in the brain will spur more interest in probing the roles of O-GlcNAcylation in the brain. Arguably, further functional studies on protein O-GlcNAcylation will provide a deeper understanding of brain physiology and facilitate elucidation of the etiology of neurogenerative diseases such as Alzheimer’s disease and Parkinson’s disease.

Finally, our dataset of mouse brain O-GlcNAc proteomics provides invaluable insights into the biology of protein O-GlcNAcylation, particularly Tyr O-GlcNAcylation. Our previous work identified O-GlcNAcylation on 121 Tyr sites on proteins in cultured human cancer cells stimulated with thiamet G (an inhibitor of OGA) (7). However, several questions remained, e.g., 1) it was unclear whether Tyr O-GlcNAcylation existed on endogenous proteins without any stimulation, and 2) it was unknown whether Tyr O-GlcNAcylation would be a common phenomenon in other types of samples. Very encouragingly, our mouse brain O-GlcNAc proteomics datasets have offered some valuable hints. Firstly, 138 Tyr sites were found O-GlcNAcylated from mouse brain, suggesting that Tyr O-GlcNAcylation can be detected without external stimulation, and Tyr O-GlcNAcylation does exist in sample sources other than human cancer cells. Based upon all these findings, we argue that Tyr O-GlcNAcylation is a widespread modification on many proteins in multiple species. Secondly, comparing the identified Tyr O-GlcNAcylation sites between human and mouse samples allows us to explore the potential conservation of Tyr O-GlcNAcylation across species. Although the datasets are still relatively small and the sample types are not strictly comparable, we do observe conservation of some Tyr O-GlcNAcylation sites on proteins. The cross-species conservation of Tyr O-GlcNAcylation sites further indicates that Tyr O-GlcNAcylation has a regulatory role. Thirdly, our work enables a better understanding of other key features of Tyr O-GlcNAcylation, such as its percentage distribution and site-specific cross-talk with Tyr phosphorylation. Amongst the O-GlcNAc-modified residues, the ratio of O-GlcNAcylated Tyr appears to be relatively small (4–5%). Moreover, O-GlcNAcylated Tyr sites and phosphorylated Tyr sites do not seem to overlap overwhelmingly (<15%) in basal conditions, displaying certain agreement with that observed between O-GlcNAcylated Ser/Thr sites and phosphorylated Ser/Thr sites (45). It will be intriguing to see whether there is a higher degree of colocalization (i.e., reciprocal competition) between Tyr O-GlcNAcylation and Tyr phosphorylation in stimulated conditions, as shown in our previous report (7). In addition, besides many serine/threonine kinases, several tyrosine kinases, including tyrosine-protein kinase ABL2 (Abl2) and tyrosine kinase non-receptor protein 2 (activated CDC42 kinase 1, Tnk2), are themselves O-GlcNAcylated. Further exploration of the roles of O-GlcNAcylation on tyrosine kinases' function will offer new insights into the cross-talk at the enzyme level, as well as the intracellular network mediated by Ser/Thr/Tyr O-GlcNAcylation and Ser/Thr/Tyr phosphorylation.

In conclusion, we developed chemoenzymatic labeling/click chemistry-based analytical workflows for O-GlcNAc proteomics, focusing on assessing the performance of four cleavable biotin-alkyne reagents (PC-biotin-alkyne, DADPS-biotin-alkyne, Dde-biotin-alkyne, and Diazo-biotin-alkyne). Besides standard O-GlcNAc peptides, mouse brain samples were used to validate the analytical merits of these reagents. Our data suggest that workflows using PC-biotin-alkyne and DADPS-biotin-alkyne outperform the others regarding sensitivity, reproducibility, and specificity for O-GlcNAc proteomics. The analytical workflows established and the evaluations performed will help guide the selection of analytical workflows for site-specific and systems-scale O-GlcNAc proteomics studies. By benchmarking their applications to mouse brain samples, we have generated the largest dataset of brain O-GlcNAc proteomics to date, which provides not only a reference for further studying O-GlcNAcylation in the brain but also critical insight into Tyr O-GlcNAcylation. Taken together, our data provides both invaluable insights into the technical aspects of O-GlcNAc proteomics and additional understanding of O-GlcNAcylation biology (particularly Tyr O-GlcNAcylation). We anticipate that our work will likely contribute to the acceleration of O-GlcNAc proteomics research, creating instrumental tools to interrogate functional roles of protein O-GlcNAcylation in a wide range of biomedical settings.

Data Availability

This article contains supplemental data, including all validated data reported in tabular format. The mass spectrometry proteomics data, including raw, .msf, and annotated/indexed glycopeptide spectra, have been deposited to the ProteomeXchange Consortium via the PRIDE (90) partner repository with the dataset identifier PXD063995 and 10.6019/PXD063995. Annotated data used for the generation of figures, as well as all codes utilized for generating figures and processing data, are available upon request to the corresponding author.

Supplemental Data

This article contains supplemental data (7, 53, 74).

Conflict of interest

The authors declare that they have no conflicts of interest with the contents of this article.

Acknowledgments

We are indebted to previous members of the Dr Gerald W. Hart lab at the Johns Hopkins University and Dr Donald F. Hunt lab at University of Virginia for insightful communications in the early stage of developing bioorthogonal methods for site-specific O-GlcNAc proteomics.

Author Contributions

C. H. and J. M. conceptualization; C. H., J. D., H. Z., and X. W. investigation; S. B., M. L., D. T. S. P., K. W. M., H. P., G. W. H. resources; C. H. and J. M. writing–original draft; C. H., S. B., M. L., D. T. S. P., K. W. M., H.P., G. W. H., and J. M. writing–review and editing.

Funding and Additional Information

This work was in part supported by NIH/NCI grant P30 CA051008, the Mizutani Foundation for Glycoscience, and the CCB Program Pilot Fund. The Orbitrap Lumos Tribrid mass spectrometer was partially supported by Dekelbaum Foundation.

Supplementary data

Supplemental Data S1
mmc1.xlsx (32.3MB, xlsx)
Supplemental Data S2
mmc2.xlsx (1.4MB, xlsx)
Supplemental Data S3
mmc3.xlsx (140.1KB, xlsx)
Supplemental Data S4
mmc4.xlsx (41.8KB, xlsx)
Supplemental Figures and Table
mmc5.docx (55.9MB, docx)

References

  • 1.Torres C.R., Hart G.W. Topography and polypeptide distribution of terminal N-acetylglucosamine residues on the surfaces of intact lymphocytes. Evidence for O-linked GlcNAc. J. Biol. Chem. 1984;259:3308–3317. [PubMed] [Google Scholar]
  • 2.Holt G.D., Hart G.W. The subcellular distribution of terminal N-acetylglucosamine moieties. Localization of a novel protein-saccharide linkage, O-linked GlcNAc. J. Biol. Chem. 1986;261:8049–8057. [PubMed] [Google Scholar]
  • 3.Hart G.W., Housley M.P., Slawson C. Cycling of O-linked beta-N-acetylglucosamine on nucleocytoplasmic proteins. Nature. 2007;446:1017–1022. doi: 10.1038/nature05815. [DOI] [PubMed] [Google Scholar]
  • 4.Hanover J.A., Yu S., Lubas W.B., Shin S.H., Ragano-Caracciola M., Kochran J., et al. Mitochondrial and nucleocytoplasmic isoforms of O-linked GlcNAc transferase encoded by a single mammalian gene. Arch. Biochem. Biophys. 2003;409:287–297. doi: 10.1016/s0003-9861(02)00578-7. [DOI] [PubMed] [Google Scholar]
  • 5.Ma J., Liu T., Wei A.C., Banerjee P., O’Rourke B., Hart G.W. O-GlcNAcomic profiling identifies widespread O-Linked β-N-Acetylglucosamine modification (O-GlcNAcylation) in oxidative phosphorylation System regulating cardiac mitochondrial function. J. Biol. Chem. 2015;290:29141–29153. doi: 10.1074/jbc.M115.691741. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Banerjee P.S., Ma J., Hart G.W. Diabetes-associated dysregulation of O-GlcNAcylation in rat cardiac mitochondria. Proc. Natl. Acad. Sci. U. S. A. 2015;112:6050–6055. doi: 10.1073/pnas.1424017112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hou C., Deng J., Wu C., Zhang J., Byers S., Moremen K.W., et al. Ultradeep O-GlcNAc proteomics reveals widespread O-GlcNAcylation on tyrosine residues of proteins. Proc. Natl. Acad. Sci. U. S. A. 2024;121 doi: 10.1073/pnas.2409501121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Ma J., Li Y., Hou C., Wu C. O-GlcNAcAtlas: a database of experimentally identified O-GlcNAc sites and proteins. Glycobiology. 2021 doi: 10.1093/glycob/cwab003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hou C., Li W., Li Y., Ma J. O-GlcNAcAtlas 4.0: an updated protein O-GlcNAcylation database with site-specific quantification. J. Mol. Biol. 2025;21 doi: 10.1016/j.jmb.2025.169033. [DOI] [PubMed] [Google Scholar]
  • 10.Ma J., Hou C., Wu C. Demystifying the O-GlcNAc code: a systems view. Chem. Rev. 2022;122:15822–15864. doi: 10.1021/acs.chemrev.1c01006. [DOI] [PubMed] [Google Scholar]
  • 11.Hart G.W., Slawson C., Ramirez-Correa G., Lagerlof O. Cross talk between O-GlcNAcylation and phosphorylation: roles in signaling, transcription, and chronic disease. Annu. Rev. Biochem. 2011;80:825–858. doi: 10.1146/annurev-biochem-060608-102511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Bond M.R., Hanover J.A. O-GlcNAc cycling: a link between metabolism and chronic disease. Annu. Rev. Nutr. 2013;33:205–229. doi: 10.1146/annurev-nutr-071812-161240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Yang X., Qian K. Protein O-GlcNAcylation: emerging mechanisms and functions. Nat. Rev. Mol. Cell Biol. 2017;18:452–465. doi: 10.1038/nrm.2017.22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Chatham J.C., Zhang J., Wende A.R. Role of O-Linked N-Acetylglucosamine protein modification in cellular (Patho)Physiology. Physiol. Rev. 2021;101:427–493. doi: 10.1152/physrev.00043.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Zachara N.E., Akimoto Y., Boyce M., Hart G.W. In: Essentials of Glycobiology [Internet] 4th ed. Varki A., Cummings R.D., Esko J.D., Stanley P., Hart G.W., Aebi M., et al., editors. Cold Spring Harbor Laboratory Press; Cold Spring Harbor (NY): 2022. The O-GlcNAc modification. [Google Scholar]
  • 16.Cheng S.S., Mody A.C., Woo C.M. Opportunities for therapeutic modulation of O-GlcNAc. Chem. Rev. 2024;124:12918–13019. doi: 10.1021/acs.chemrev.4c00417. [DOI] [PubMed] [Google Scholar]
  • 17.Aizezi Y., Yuan Y., Xu S.L., Wang Z.Y. A tale of two sugars: o-glcnac and O-fucose orchestrate growth, development, and acclimation in plants. Trends Biochem. Sci. 2025;50:332–343. doi: 10.1016/j.tibs.2025.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Kreppel L.K., Blomberg M.A., Hart G.W. Dynamic glycosylation of nuclear and cytosolic proteins: CLONING AND CHARACTERIZATION OF A UNIQUE O-GlcNAc TRANSFERASE WITH MULTIPLE TETRATRICOPEPTIDE REPEATS. J. Biol. Chem. 1997;272:9308–9315. doi: 10.1074/jbc.272.14.9308. [DOI] [PubMed] [Google Scholar]
  • 19.Lubas W.A., Frank D.W., Krause M., Hanover J.A. O-Linked GlcNAc transferase is a conserved nucleocytoplasmic protein containing tetratricopeptide repeats. J. Biol. Chem. 1997;272:9316–9324. doi: 10.1074/jbc.272.14.9316. [DOI] [PubMed] [Google Scholar]
  • 20.N O., Ne Z., Gw H., Jd M. Ogt-dependent X-chromosome-linked protein glycosylation is a requisite modification in somatic cell function and embryo viability. Mol. Cell. Biol. 2004;24:1680–1690. doi: 10.1128/MCB.24.4.1680-1690.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Je R., Pm C., De M., Rl N., Ec P., Lc H.W. Dynamic O-GlcNAc modification regulates CREB-mediated gene expression and memory formation. Nat. Chem. Biol. 2012;8:253–261. doi: 10.1038/nchembio.770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lagerlöf O., Slocomb J.E., Hong I., Aponte Y., Blackshaw S., Hart G.W., et al. The nutrient sensor OGT in PVN neurons regulates feeding. Science. 2016;351:1293–1296. doi: 10.1126/science.aad5494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Lagerlöf O. O-GlcNAc cycling in the developing, adult and geriatric brain. J. Bioenerg. Biomembr. 2018;50:241–261. doi: 10.1007/s10863-018-9760-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Gong C.X., Liu F., Iqbal K. O-GlcNAcylation: a regulator of tau pathology and neurodegeneration. Alzheimer’s Demen. 2016;12:1078–1089. doi: 10.1016/j.jalz.2016.02.011. [DOI] [PubMed] [Google Scholar]
  • 25.Pratt M.R., Vocadlo D.J. Understanding and exploiting the roles of O-GlcNAc in neurodegenerative diseases. J. Biol. Chem. 2023;299 doi: 10.1016/j.jbc.2023.105411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Griffith L.S., Mathes M., Schmitz B. Beta-amyloid precursor protein is modified with O-linked N-acetylglucosamine. J. Neurosci. Res. 1995;41:270–278. doi: 10.1002/jnr.490410214. [DOI] [PubMed] [Google Scholar]
  • 27.Arnold C.S., Johnson G.V., Cole R.N., Dong D.L., Lee M., Hart G.W. The microtubule-associated protein tau is extensively modified with O-linked N-acetylglucosamine. J. Biol. Chem. 1996;271:28741–28744. doi: 10.1074/jbc.271.46.28741. [DOI] [PubMed] [Google Scholar]
  • 28.Liu F., Iqbal K., Grundke-Iqbal I., Hart G.W., Gong C.X. O-GlcNAcylation regulates phosphorylation of tau: a mechanism involved in Alzheimer’s disease. Proc. Natl. Acad. Sci. U. S. A. 2004;101:10804–10809. doi: 10.1073/pnas.0400348101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Huang C.W., Rust N.C., Wu H.F., Yin A., Zeltner N., Yin H., et al. Low glucose induced Alzheimer’s disease-like biochemical changes in human induced pluripotent stem cell-derived neurons is due to dysregulated O-GlcNAcylation. Alzheimer’s Demen. 2023;19:4872–4885. doi: 10.1002/alz.13058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Yuzwa S.A., Macauley M.S., Heinonen J.E., Shan X., Dennis R.J., He Y., et al. A potent mechanism-inspired O-GlcNAcase inhibitor that blocks phosphorylation of tau in vivo. Nat. Chem. Biol. 2008;4:483–490. doi: 10.1038/nchembio.96. [DOI] [PubMed] [Google Scholar]
  • 31.Yuzwa S.A., Shan X., Macauley M.S., Clark T., Skorobogatko Y., Vosseller K., et al. Increasing O-GlcNAc slows neurodegeneration and stabilizes tau against aggregation. Nat. Chem. Biol. 2012;8:393–399. doi: 10.1038/nchembio.797. [DOI] [PubMed] [Google Scholar]
  • 32.Marotta N.P., Lin Y.H., Lewis Y.E., Ambroso M.R., Zaro B.W., Roth M.T., et al. O-GlcNAc modification blocks the aggregation and toxicity of the protein α-synuclein associated with Parkinson’s disease. Nat. Chem. 2015;7:913–920. doi: 10.1038/nchem.2361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Balana A.T., Mahul-Mellier A.L., Nguyen B.A., Horvath M., Javed A., Hard E.R., et al. O-GlcNAc forces an α-synuclein amyloid strain with notably diminished seeding and pathology. Nat. Chem. Biol. 2024;20:646–655. doi: 10.1038/s41589-024-01551-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Selnick H.G., Hess J.F., Tang C., Liu K., Schachter J.B., Ballard J.E., et al. Discovery of MK-8719, a potent O-GlcNAcase inhibitor as a potential treatment for tauopathies. J. Med. Chem. 2019;62:10062–10097. doi: 10.1021/acs.jmedchem.9b01090. [DOI] [PubMed] [Google Scholar]
  • 35.Permanne B., Sand A., Ousson S., Nény M., Hantson J., Schubert R., et al. O-GlcNAcase inhibitor ASN90 is a multimodal drug candidate for Tau and α-Synuclein proteinopathies. ACS Chem. Neurosci. 2022;13:1296–1314. doi: 10.1021/acschemneuro.2c00057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Cheng Z., Shang N., Wang X., Kang Y., Zhou J., Lan J., et al. Discovery of 4-(Arylethynyl)piperidine derivatives as Potent nonsaccharide O-GlcNAcase inhibitors for the treatment of alzheimer’s disease. J. Med. Chem. 2024;67:14292–14312. doi: 10.1021/acs.jmedchem.4c01132. [DOI] [PubMed] [Google Scholar]
  • 37.Kielbasa W., Goldsmith P., Donnelly K.B., Nuthall H.N., Shcherbinin S., Fleisher A.S., et al. Discovery and clinical translation of ceperognastat, an O-GlcNAcase (OGA) inhibitor, for the treatment of Alzheimer’s disease. Alzheimer’s Demen. 2024;10 doi: 10.1002/trc2.70020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Ma J., Wu C., Hart G.W. Analytical and biochemical perspectives of protein O-GlcNAcylation. Chem. Rev. 2021;121:1513–1581. doi: 10.1021/acs.chemrev.0c00884. [DOI] [PubMed] [Google Scholar]
  • 39.Ma J., Hart G.W. O-GlcNAc profiling: from proteins to proteomes. Clin. Proteomics. 2014;11:8. doi: 10.1186/1559-0275-11-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Maynard J.C., Chalkley R.J. Methods for enrichment and assignment of N-Acetylglucosamine modification sites. Mol. Cell Proteomics. 2021;20 doi: 10.1074/mcp.R120.002206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Udeshi N.D., Hart G.W., Slawson C. From Fringe to the mainstream: how ETD MS brought O-GlcNAc to the masses. Mol. Cell Proteomics. 2024;23 doi: 10.1016/j.mcpro.2024.100859. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Zhao P., Viner R., Teo C.F., Boons G.J., Horn D., Wells L. Combining high-energy C-Trap dissociation and electron transfer dissociation for protein O-GlcNAc modification site assignment. J. Proteome Res. 2011;10:4088–4104. doi: 10.1021/pr2002726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Burt R.A., Dejanovic B., Peckham H.J., Lee K.A., Li X., Ounadjela J.R., et al. Novel antibodies for the simple and efficient enrichment of native O-GlcNAc modified peptides. Mol. Cell Proteomics. 2021;20 doi: 10.1016/j.mcpro.2021.100167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Chalkley R.J., Thalhammer A., Schoepfer R., Burlingame A.L. Identification of protein O-GlcNAcylation sites using electron transfer dissociation mass spectrometry on native peptides. Proc. Natl. Acad. Sci. U. S. A. 2009;106:8894–8899. doi: 10.1073/pnas.0900288106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Trinidad J.C., Barkan D.T., Gulledge B.F., Thalhammer A., Sali A., Schoepfer R., et al. Global identification and characterization of both O-GlcNAcylation and phosphorylation at the murine synapse. Mol. Cell Proteomics. 2012;11:215–229. doi: 10.1074/mcp.O112.018366. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Nagel A.K., Schilling M., Comte-Walters S., Berkaw M.N., Ball L.E. Identification of O-Linked N-Acetylglucosamine (O-GlcNAc)-modified osteoblast proteins by Electron transfer dissociation Tandem Mass spectrometry reveals proteins critical for bone Formation. Mol. Cell Proteomics. 2013;12:945–955. doi: 10.1074/mcp.M112.026633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Xu S.L., Chalkley R.J., Maynard J.C., Wang W., Ni W., Jiang X., et al. Proteomic analysis reveals O-GlcNAc modification on proteins with key regulatory functions in Arabidopsis. Proc. Natl. Acad. Sci. U. S. A. 2017;114:E1536–E1543. doi: 10.1073/pnas.1610452114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Liu W., Han G., Yin Y., Jiang S., Yu G., Yang Q., et al. AANL (Agrocybe aegerita lectin 2) is a new facile tool to probe for O-GlcNAcylation. Glycobiology. 2018;28:363–373. doi: 10.1093/glycob/cwy029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.White C.W., Fan X., Maynard J.C., Wheatley E.G., Bieri G., Couthouis J., et al. Age-related loss of neural stem cell O-GlcNAc promotes a glial fate switch through STAT3 activation. Proc. Natl. Acad. Sci. U. S. A. 2020;117:22214–22224. doi: 10.1073/pnas.2007439117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Maynard J.C., Fujihira H., Dolgonos G.E., Suzuki T., Burlingame A.L. Cytosolic N-GlcNAc proteins are formed by the action of endo-β-N-acetylglucosaminidase. Biochem. Biophys. Res. Commun. 2020;530:719–724. doi: 10.1016/j.bbrc.2020.06.127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Shrestha R., Karunadasa S., Grismer T.S., Reyes A.V., Xu S.L. SECRET AGENT O-GlcNAcylates hundreds of proteins involved in diverse cellular processes in arabidopsis. Mol. Cell Proteomics. 2024;23 doi: 10.1016/j.mcpro.2024.100732. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Zhang W., Liu T., Dong H., Bai H., Tian F., Shi Z., et al. Synthesis of a highly azide-reactive and thermosensitive biofunctional reagent for efficient enrichment and large-scale identification of O-GlcNAc proteins by mass spectrometry. Anal. Chem. 2017;89:5810–5817. doi: 10.1021/acs.analchem.6b04960. [DOI] [PubMed] [Google Scholar]
  • 53.Wu C., Shi S., Hou C., Luo Y., Byers S., Ma J. Design and preparation of novel nitro-oxide-grafted nanospheres with enhanced hydrogen bonding interaction for O-GlcNAc analysis. ACS Appl. Mater. Inter. 2022;14:47482–47490. doi: 10.1021/acsami.2c15039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Wells L., Vosseller K., Cole R.N., Cronshaw J.M., Matunis M.J., Hart G.W. Mapping sites of O-GlcNAc modification using affinity tags for serine and threonine post-translational modifications. Mol. Cell Proteomics. 2002;1:791–804. doi: 10.1074/mcp.m200048-mcp200. [DOI] [PubMed] [Google Scholar]
  • 55.Overath T., Kuckelkorn U., Henklein P., Strehl B., Bonar D., Kloss A., et al. Mapping of O-GlcNAc sites of 20 S proteasome subunits and Hsp90 by a novel biotin-cystamine tag. Mol. Cell Proteomics. 2012;11:467–477. doi: 10.1074/mcp.M111.015966. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Ma J., Banerjee P., Whelan S.A., Liu T., Wei A.C., Ramirez-Correa G., et al. Comparative proteomics reveals dysregulated mitochondrial O-GlcNAcylation in diabetic hearts. J. Proteome Res. 2016;15:2254–2264. doi: 10.1021/acs.jproteome.6b00250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Khidekel N., Ficarro S.B., Clark P.M., Bryan M.C., Swaney D.L., Rexach J.E., et al. Probing the dynamics of O-GlcNAc glycosylation in the brain using quantitative proteomics. Nat. Chem. Biol. 2007;3:339–348. doi: 10.1038/nchembio881. [DOI] [PubMed] [Google Scholar]
  • 58.Wang Z., Udeshi N.D., O’Malley M., Shabanowitz J., Hunt D.F., Hart G.W. Enrichment and site mapping of O-Linked N-Acetylglucosamine by a combination of Chemical/Enzymatic tagging, photochemical cleavage, and Electron transfer dissociation Mass spectrometry. Mol. Cell Proteomics. 2010;9:153–160. doi: 10.1074/mcp.M900268-MCP200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Ma J., Wang W.H., Li Z., Shabanowitz J., Hunt D.F., Hart G.W. O-GlcNAc site mapping by using a combination of chemoenzymatic labeling, copper-free click chemistry, reductive cleavage, and electron-transfer dissociation Mass spectrometry. Anal. Chem. 2019;91:2620–2625. doi: 10.1021/acs.analchem.8b05688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Wang Z., Udeshi N.D., Slawson C., Compton P.D., Sakabe K., Cheung W.D., et al. Extensive crosstalk between O-GlcNAcylation and phosphorylation regulates cytokinesis. Sci. Signal. 2010;3:ra2. doi: 10.1126/scisignal.2000526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Alfaro J.F., Gong C.X., Monroe M.E., Aldrich J.T., Clauss T.R.W., Purvine S.O., et al. Tandem mass spectrometry identifies many mouse brain O-GlcNAcylated proteins including EGF domain-specific O-GlcNAc transferase targets. PNAS. 2012;109:7280–7285. doi: 10.1073/pnas.1200425109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Griffin M.E., Jensen E.H., Mason D.E., Jenkins C.L., Stone S.E., Peters E.C., et al. Comprehensive mapping of O-GlcNAc modification sites using a chemically cleavable tag. Mol. Biosyst. 2016;12:1756–1759. doi: 10.1039/c6mb00138f. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Wang S., Yang F., Petyuk V.A., Shukla A.K., Monroe M.E., Gritsenko M.A., et al. Quantitative proteomics identifies altered O-GlcNAcylation of structural, synaptic and memory-associated proteins in Alzheimer’s disease. J. Pathol. 2017;243:78–88. doi: 10.1002/path.4929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Woo C.M., Lund P.J., Huang A.C., Davis M.M., Bertozzi C.R., Pitteri S.J. Mapping and quantification of over 2000 O-linked glycopeptides in activated human T cells with isotope-targeted glycoproteomics (Isotag) Mol. Cell Proteomics. 2018;17:764–775. doi: 10.1074/mcp.RA117.000261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Li J., Li Z., Duan X., Qin K., Dang L., Sun S., et al. An isotope-coded photocleavable probe for quantitative profiling of protein O-GlcNAcylation. ACS Chem. Biol. 2019;14:4–10. doi: 10.1021/acschembio.8b01052. [DOI] [PubMed] [Google Scholar]
  • 66.Xu S., Sun F., Wu R. A chemoenzymatic method based on easily accessible enzymes for profiling protein O-GlcNAcylation. Anal. Chem. 2020;92:9807–9814. doi: 10.1021/acs.analchem.0c01284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Liu J., Hao Y., He Y., Li X., Sun D., Zhang Y., et al. Quantitative and site-specific chemoproteomic profiling of protein O-GlcNAcylation in the cell cycle. ACS Chem. Biol. 2021;16:1917–1923. doi: 10.1021/acschembio.1c00301. [DOI] [PubMed] [Google Scholar]
  • 68.Huynh V.N., Wang S., Ouyang X., Wani W.Y., Johnson M.S., Chacko B.K., et al. Defining the dynamic regulation of O-GlcNAc proteome in the mouse Cortex---the O-GlcNAcylation of synaptic and trafficking proteins related to neurodegenerative diseases. Front Aging. 2021;2 doi: 10.3389/fragi.2021.757801. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Chen Y., Tang F., Qin H., Yue X., Nie Y., Huang W., et al. Endo-M mediated chemoenzymatic approach enables reversible glycopeptide labeling for O-GlcNAcylation analysis. Angew. Chem. Int. Ed. Engl. 2022;61 doi: 10.1002/anie.202117849. [DOI] [PubMed] [Google Scholar]
  • 70.Xu S., Tong M., Suttapitugsakul S., Wu R. Spatial and temporal proteomics reveals the distinct distributions and dynamics of O-GlcNAcylated proteins. Cell Rep. 2022;39 doi: 10.1016/j.celrep.2022.110946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Liu J., Hao Y., Wang C., Jin Y., Yang Y., Gu J., et al. An optimized isotopic photocleavable tagging strategy for site-specific and quantitative profiling of protein O-GlcNAcylation in colorectal cancer metastasis. ACS Chem. Biol. 2022;17:513–520. doi: 10.1021/acschembio.1c00981. [DOI] [PubMed] [Google Scholar]
  • 72.Li S., Zhu H., Wang J., Wang X., Li X., Ma C., et al. Comparative analysis of Cu (I)-catalyzed alkyne-azide cycloaddition (CuAAC) and strain-promoted alkyne-azide cycloaddition (SPAAC) in O-GlcNAc proteomics. Electrophoresis. 2016;37:1431–1436. doi: 10.1002/elps.201500491. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Wang J., Dou B., Zheng L., Cao W., Zeng X., Wen Y., et al. Synthesis of Na2S2O4 mediated cleavable affinity tag for labeling of O-GlcNAc modified proteins via azide-alkyne cycloaddition. Bioorg. Med. Chem. Lett. 2021;48 doi: 10.1016/j.bmcl.2021.128244. [DOI] [PubMed] [Google Scholar]
  • 74.Hou C., Wu C., Wu Z., Cheng Y., Li W., Sun H., et al. Systematic evaluation of affinity enrichment methods for O-GlcNAc proteomics. J. Proteome Res. 2024;23:4422–4432. doi: 10.1021/acs.jproteome.4c00388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Vocadlo D.J., Hang H.C., Kim E.J., Hanover J.A., Bertozzi C.R. A chemical approach for identifying O-GlcNAc-modified proteins in cells. PNAS. 2003;100:9116–9121. doi: 10.1073/pnas.1632821100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Nandi A., Sprung R., Barma D.K., Zhao Y., Kim S.C., Falck J.R., et al. Global identification of O-GlcNAc-Modified proteins. Anal. Chem. 2006;78:452–458. doi: 10.1021/ac051207j. [DOI] [PubMed] [Google Scholar]
  • 77.Boyce M., Carrico I.S., Ganguli A.S., Yu S.H., Hangauer M.J., Hubbard S.C., et al. Metabolic cross-talk allows labeling of O-linked β-N-acetylglucosamine-modified proteins via the N-acetylgalactosamine salvage pathway. PNAS. 2011;108:3141–3146. doi: 10.1073/pnas.1010045108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Moremen K.W., Ramiah A., Stuart M., Steel J., Meng L., Forouhar F., et al. Expression system for structural and functional studies of human glycosylation enzymes. Nat. Chem. Biol. 2018;14:156–162. doi: 10.1038/nchembio.2539. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Myakala K., Wang X.X., Shults N.V., Krawczyk E., Jones B.A., Yang X., et al. NAD metabolism modulates inflammation and mitochondria function in diabetic kidney disease. J. Biol. Chem. 2023;299 doi: 10.1016/j.jbc.2023.104975. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Li W., Hou C., Li Y., Wu C., Ma J. HexNAcQuest: a tool to distinguish O-GlcNAc and O-GalNAc. J. Am. Soc. Mass Spectrom. 2022;33:2008–2012. doi: 10.1021/jasms.2c00172. [DOI] [PubMed] [Google Scholar]
  • 81.Yang Y., Verhelst S.H.L. Cleavable trifunctional biotin reagents for protein labelling, capture and release. Chem. Commun. 2013;49:5366–5368. doi: 10.1039/c3cc42076k. [DOI] [PubMed] [Google Scholar]
  • 82.Szychowski J., Mahdavi A., Hodas J.J.L., Bagert J.D., Ngo J.T., Landgraf P., et al. Cleavable biotin probes for labeling of biomolecules via azide−alkyne cycloaddition. J. Am. Chem. Soc. 2010;132:18351–18360. doi: 10.1021/ja1083909. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Hou C., Li W., Li Y., Ma J. O-GlcNAc informatics: advances and trends. Anal. Bioanal. Chem. 2024;417:895–905. doi: 10.1007/s00216-024-05531-2. [DOI] [PubMed] [Google Scholar]
  • 84.Yang S., Han Y., Li Y., Zhang L., Yan G., Yuan J., et al. Rapid and high-sensitive phosphoproteomics elucidated the spatial dynamics of the mouse brain. Anal. Chem. 2023;95:10703–10712. doi: 10.1021/acs.analchem.3c01486. [DOI] [PubMed] [Google Scholar]
  • 85.Hornbeck P.V., Kornhauser J.M., Latham V., Murray B., Nandhikonda V., Nord A., et al. 15 years of PhosphoSitePlus®: integrating post-translationally modified sites, disease variants and isoforms. Nucleic Acids Res. 2019;47:D433–D441. doi: 10.1093/nar/gky1159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Caenepeel S., Charydczak G., Sudarsanam S., Hunter T., Manning G. The mouse kinome: discovery and comparative genomics of all mouse protein kinases. Proc. Natl. Acad. Sci. U. S. A. 2004;101:11707–11712. doi: 10.1073/pnas.0306880101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Huang C., Harada Y., Hosomi A., Masahara-Negishi Y., Seino J., Fujihira H., et al. Endo-β-N-acetylglucosaminidase forms N-GlcNAc protein aggregates during ER-associated degradation in Ngly1-defective cells. Proc. Natl. Acad. Sci. U. S. A. 2015;112:1398–1403. doi: 10.1073/pnas.1414593112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.DeGiosio R.A., Grubisha M.J., MacDonald M.L., McKinney B.C., Camacho C.J., Sweet R.A. More than a marker: potential pathogenic functions of MAP2. Front Mol. Neurosci. 2022;15 doi: 10.3389/fnmol.2022.974890. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Morris M., Knudsen G.M., Maeda S., Trinidad J.C., Ioanoviciu A., Burlingame A.L., et al. Tau post-translational modifications in wildtype and human amyloid precursor protein transgenic mice. Nat. Neurosci. 2015;18:1183–1189. doi: 10.1038/nn.4067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Perez-Riverol Y., Bandla C., Kundu D.J., Kamatchinathan S., Bai J., Hewapathirana S., et al. The PRIDE database at 20 years: 2025 update. Nucleic Acids Res. 2025;53:D543–D553. doi: 10.1093/nar/gkae1011. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Data S1
mmc1.xlsx (32.3MB, xlsx)
Supplemental Data S2
mmc2.xlsx (1.4MB, xlsx)
Supplemental Data S3
mmc3.xlsx (140.1KB, xlsx)
Supplemental Data S4
mmc4.xlsx (41.8KB, xlsx)
Supplemental Figures and Table
mmc5.docx (55.9MB, docx)

Data Availability Statement

This article contains supplemental data, including all validated data reported in tabular format. The mass spectrometry proteomics data, including raw, .msf, and annotated/indexed glycopeptide spectra, have been deposited to the ProteomeXchange Consortium via the PRIDE (90) partner repository with the dataset identifier PXD063995 and 10.6019/PXD063995. Annotated data used for the generation of figures, as well as all codes utilized for generating figures and processing data, are available upon request to the corresponding author.


Articles from Molecular & Cellular Proteomics : MCP are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES