Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2024 Aug 29:2024.08.28.610074. [Version 1] doi: 10.1101/2024.08.28.610074

O-glycosylation contributes to mammalian glycoRNA biogenesis

Jennifer Porat 1,2,*, Christopher P Watkins 1,2,*, Chunsheng Jin 3,*, Xixuan Xie 4,5, Xiao Tan 6,7,8,9, Charlotta G Lebedenko 1,2, Helena Hemberger 1,2, Woojung Shin 6,10, Peiyuan Chai 1,2, James J Collins 6,9,11, Benjamin A Garcia 5, Daniel Bojar 12, Ryan A Flynn 1,2,13,#
PMCID: PMC11384000  PMID: 39257776

Abstract

There is an increasing appreciation for the role of cell surface glycans in modulating interactions with extracellular ligands and participating in intercellular communication. We recently reported the existence of sialoglycoRNAs, where mammalian small RNAs are covalently linked to N-glycans through the modified base acp3U and trafficked to the cell surface. However, little is currently known about the role for O-glycosylation, another major class of carbohydrate polymer modifications. Here, we use parallel genetic, enzymatic, and mass spectrometry approaches to demonstrate that O-linked glycan biosynthesis is responsible for the majority of sialoglycoRNA levels. By examining the O-glycans associated with RNA from cell lines and colon organoids we find known and previously unreported O-linked glycan structures. Further, we find that O-linked glycans released from small RNA from organoids derived from ulcerative colitis patients exhibit higher levels of sialylation than glycans from healthy organoids. Together, our work provides flexible tools to interrogate O-linked glycoRNAs (O-glycoRNA) and suggests that they may be modulated in human disease.

Introduction

Glycosylation of RNAs is an emerging topic but remains poorly understood in both scope and function. Initially, a metabolic reporter of sialic acid (peracetylated N-azidoacetylmannosamine, Ac4ManNAz) was used to discover that small noncoding RNAs from mammalian cells were modified with N-glycans1. These N-glycans have a variety of compositions, which include sialic acid and fucose modifications1,2. These “sialoglycoRNAs” are presented on the external surface of live cells and can interact with Sialic acid binding Ig-like lectin (Siglec) receptors1. Thereafter, sialoglycoRNAs were also found on neutrophils, and treatment of the neutrophils with RNase caused a reduction in the ability for neutrophils to be recruited to sites of inflammation3. In addition, RNase treatment of THP-1 monocytes and macrophages impaired their adhesion to epithelial cells, suggesting that cell surface RNAs may function in various cell surface processes4. More recently, we reported a covalent linkage between an N-glycan and RNA that occurs via the modified base 3-(3-amino-3-carboxypropyl)uridine (acp3U)5, providing evidence of a direct RNA-glycan connection.

In the context of the initial glycoRNA discovery, the Ac4ManNAz signal was robustly sensitive to PNGaseF cleavage1, directing focus towards characterizing N-glycans. Imaging technologies such as sialic acid aptamer and RNA in situ hybridization-mediated proximity ligation assay (ARPLA), which can visualize specific sialoglycoRNAs on the surface of cells, demonstrated loss of signal after PNGaseF cleavage4. Using a newly developed glycomics method leveraging Data Independent Acquisition (DIA) mass spectrometry (MS) methods (called glycanDIA) we recently reported the profile of N-glycans released from small RNAs across five primary murine tissues, finding many sialylated and fucosylated species6. Consistent with this, work using more traditional Data Dependent Analysis (DDA) MS found similar features of N-glycans released from twelve primary human tissues2. Curiously, during the course of our characterization of the acp3U-glycan linkage and using RNA optimized periodate oxidation and aldehyde labeling (rPAL), a non-metabolic reporter approach to detect native sialoglycoRNAs, we found that PNGaseF and loss of acp3U biosynthesis enzymes had a significant but minor reduction in the total levels of rPAL signal5. This suggested that the bulk of the rPAL-labeled sialoglycoRNA signal was not derived from N-glycans.

O-linked glycans are another major type of protein glycosylation. Much of protein O-glycan synthesis is initiated by addition of N-acetylgalactosamine (GalNAc) to serine or threonine residues by one of up to 20 Golgi-localized GalNAc transferases7. In contrast to N-glycans, where a single core glycan is linked to the amide nitrogen on asparagine by OligoSaccharyl Transferase (OST), there are 8 major core structures found in mammalian O-glycans. Depending on the available glycosyltransferase genes, the addition of galactose (Gal) to the initial GalNAc (Galβ1→3GalNAc) forms Core-1 structures and are generated (e.g. in HEK293 cells) in the Golgi by C1GALT1 (the core-1 synthase) and its obligate chaperone COSMC (C1GALT1C1)8. Core-1 structures (Figure 1A) are found across the body while the others are more tissue selective9,10. In line with this, previous glycomics of wild type HEK293 cells revealed that the majority of glycans were based on Core-1 and Core-2 structures11. The initiating GalNAc or Core-1 structures can be sialylated on the Gal or GalNAc by Golgi-localized sialyltransferases, or N-acetylglucosamine (GlcNAc) can be added to the Core-1 structure by Core-2 synthases to generate a Core-2 O-glycan (Figure 1A)12. The lack of a common core structure among O-glycans, as well as the lack of an enzyme capable of releasing intact O-glycans, analogous to PNGase F for N-glycan removal (reviewed in 13), have made structural studies of O-glycans difficult and to date, no O-glycans have been unambiguously detected on RNA.

Figure 1. SialoglycoRNAs are dependent O-glycan biosynthetic enzymes.

Figure 1.

A. Schematic of N-glycan biosynthesis (Endoplasmic Reticulum, Golgi inset) and O-glycan biosynthesis, with the associated glycosyltransferases responsible for each maturation step.

B. RNA periodate oxidation and aldehyde ligation (rPAL) northern blot of total RNA from wild type HEK293, STT3A Knockout (KO), STT3B KO, and COSMC KO. An in-gel Sybr stain of total RNA and detection of biotinylated sialoglycoRNA on the membrane (Strep) are shown, along with a quantification of sialoglycoRNA signal (n= 3 biological replicates, mean±SEM, unpaired student’s t-test).

C. rPAL northern as in (B), here with WT vs. COSMC KO clone #2 vs. C1GALT1 KO.

D. rPAL northern as in (B), here with WT vs. MGAT1 KO.

E. rPAL northern as in (B), here with WT vs. GCNT1 KO.

F. rPAL northern as in (B), here with WT vs. GALNT1/GALNT2/GALNT3 KO.

G. RNA proximity labeling (rPL) of live HEK293 cells using the O-glycan binding lectin VVL (binding GalNAc), followed by in vitro digestion of extracted total RNA with sialidase or RNase. *lower molecular weight (MW) signal, ** ultra-high molecular weight signal are noted.

H. rPL as in (G), here fractionating small and large RNAs.

I. rPL as in (G), here labeling in proximity of VVL (binding a terminal GalNAc) or the N-glycan binding lectin WGA (binding GlcNac and sialic acid) on WT, COSMC KO, or C1GALT1 KO HEK293 cells.

Here we investigate the hypothesis that some of the glycans on RNA may be of an O-linked nature. We use parallel technical approaches to interrogate the composition of O-glycans released from small RNAs isolated from mammalian cells. Using single-gene knockout cell lines and rPAL, we identify the glycosyltransferase genes required for cellular sialoglycoRNA biogenesis. We also present an optimized workflow to label glycoRNAs with galactose oxidase (GAO), expanding the pool of gel-detectable glycoRNAs beyond sialoglycoRNAs. We characterize sialylated and non-sialylated O-glycan structures on mammalian RNA using DIA and DDA mass spectrometry, revealing known and novel O-glycan structures. Finally, we demonstrate that O-linked glycans on RNA from colon organoids derived from patients with Ulcerative Colitis exhibit higher levels of sialic acid than healthy organoids, suggesting that O-linked glycoRNAs may become dysregulated in disease states. Together we uncover O-glycoRNA as a major contributor to the total levels of glycoRNAs in mammalian cells.

Results

Genetic dissection of human glycosyltransferases reveals O-glycans as the major component of sialoglycoRNAs

Recent large-scale efforts have been undertaken to examine how each glycosyltransferase (GTase) impacts various aspects of glycan production in human cells11,14, however these efforts were focused largely on glycoprotein phenotypes. Here we applied rPAL labeling to 21 individual GTase knockout (KO) HEK293 cells14. Importantly, rPAL labels native sialoglycoRNA5, thereby eliminating the need for metabolic reporters that may have different sialyltransferase preferences15. We initially profiled the rPAL signal generated from total RNA extracted from WT HEK293 cells, OST KOs (STT3A-KO or STT3B-KO, involved in N-glycan biogenesis), or COSMC-KO8. Neither STT3A-KO nor STT3B-KO led to a significant loss of overall rPAL signal, while the distribution of rPAL signal from STT3A-KO was slightly up-shifted (Figure 1B, Table S1). In contrast, COSMC-KO led to a ~75% loss in rPAL signal and a broadening of the remaining signal, resulting in a split smear (Figure 1B). Loss of COSMC leads to truncated O-glycans and an overall reduction in sialylation14, consistent with a loss of sialic acid containing O-glycans on RNA as read out in the rPAL blots. We confirmed here that the loss of C1GALT1 mirrored that of COSMC-KO as well as validated the COSMC phenotype with a second KO clone (Figure 1C). Loss of MGAT1, critical for the biogenesis of all known complex N-glycans amenable for sialic acid capping (Figure 1A), did not result in a statistically significant reduction in rPAL signal but was accompanied by a minor change in migration in the gel (Figure 1D, S1A, Table S1).

Given the apparent contribution of O-glycan biogenesis enzymes to rPAL signal, we examined how Core-1 and Core-2 synthases impacted rPAL signal. HEK293 cells have been shown be dominated by Core-2 structures, with no Core-3 or Core-4 structures on protein templates11. COSMC/C1GALT1 are important for generating Core-1 and -2 type O-glycans, so we evaluated the rPAL signal after GCNT1-KO (responsible for Core-2 type O-glycans) and while we found no obvious change in intensity compared to WT, there was a downshifting in the rPAL signal (Figure 1E, S1B), suggesting some contribution of Core-2 O-glycans to rPAL signal. Since we saw a loss of rPAL signal in COSMC and C1GALT1 KOs, we would predict that truncated (asialylated) O-glycans remain or are increased in their apparent abundance. Consistent with this, combinatorial KO of COSMC and over expression (OE) of the sialyltransferase ST6GALNAC1 (which can sialylate GalNAc) recovered the rPAL signal lost after KO of COSMC alone (Figure S1C). Finally, much of the biosynthetic flux of O-glycosylation occurs through the action of polypeptide N-acetylgalactosaminyltransferase (ppGalNAc-T, GALNT) 1 and 2 with additional contribution from ppGalNAc-T3, which glycosylate serine and threonine on proteins in the secretory pathway16,17. Combined KO of ppGALNT1,2,3 resulted in approximately 60% reduction in the rPAL signal (Figure 1F), confirming a dependency on cellular, secretory O-glycosylation.

There are 20 known membrane-bound sialyltransferase enzymes that transfer sialic acid to Gal in humans, some of which have specificity for various templates (i.e., lipids vs. proteins18). As orthogonal validation of the rPAL signal being due to sialic acid labeling, combined knockout of the sialyltransferases ST3GAL1/2/3/4/5/6 (α2-3 linked sialic acids) and ST6GAL1/2 (α2-6 linked sialic acids) resulted in robust loss of sialoglycoRNA labeling with rPAL (Figure S1D). In the background of a complete ST3-KO, knockdown of ST6GAL1/2 did not result in any further changes to rPAL signal (Figure S1D). More limited combinations of KO demonstrated partial effects on the rPAL signal, showing that in isolation, loss of ST3GAL4/6, ST3GAL1/2, and ST6GAL1/2 can cause partial reductions in rPAL signal (Figure S1A, S1B) and that combining ST3GAL3/4/6 and ST6GAL1/2 KOs can make the rPAL reduction more robust (Figure S1A). Consistent with this, single gene KOs of ST3GAL1, 2, or 3 had minimal effect on the total rPAL levels, although knockout of ST3GAL1 or 2 slowed the mobility of sialoglycoRNAs (Figure S1E). Taken together, when cells can mostly generate α2-3 linked sialic acids (ST6 loss) the rPAL signal shifts down (faster migration) while when the cells can mostly generate α2-6 linked sialic acid the rPAL signal shifts up (slower migration). Overexpression experiments confirmed this, as over-expression of ST3GAL4 (in a ST3GAL3/4/6 KO) resulted in increased rPAL signal and molecular weight shift down (Figure S1D) while over-expression of ST6GAL1 (in a ST3GAL3/4/6, ST6GAL1/2 KO) partially increased the rPAL signal but shifted the molecular weight up (Figure S1A, Table S1).

Small RNAs are in proximity to O-glycans on the cell surface

We previously demonstrated that terminal GlcNAcβ binding lectins like WGA19 are in proximity to sialoglycoRNAs on the surface of HeLa cells1; however we did not previously assess if O-glycan binding lectins are also in proximity to cell surface sialoglycoRNAs. We next investigated the ability of various biotinylated lectins to label any RNA on the surface of living HEK293 cells (Methods). Importantly, these “RNA proximity labeling” (rPL)20 experiments monitor the levels of cell surface RNA in proximity to specific lectin ligands, rather than the abundance of the lectin ligand directly. All lectins used were able to label proteins on the surface of WT and KO cells, confirming the binding and labeling activity (Figure S1F). Consistent with our previous observations on HeLa cells, the sialic acid-binding lectins WGA and MAA-II generated high molecular weight (MW) biotin signal (Figure S1G). We also examined the labeling pattern surrounding ligands of Jacalin (mainly binding Core-1 O-glycans in HEK293 cells19) and VVL (GalNAc-binding, Tn antigen19) and found that both lectins clearly produced high MW signal, with Jacalin being more similar to WGA and MAA-II. VVL and WGA labeling also produced an ultra-high MW signal (**) as well as signal closer to the 28S region (Figure S1G, lower MW, *).

Given that VVL has high specificity for the truncated Tn antigen (terminal GalNAc) O-glycan structure19 and displayed a different pattern for high MW biotin signal, we investigated if this biotin signal was related to sialoglycoRNAs. Notably, the rPL assay reports on RNA species in physical proximity to ligands of the utilized lectin (here VVL) on the cell surface. In vitro RNase digestion reduced the signal of all bands, including the high MW band and putative rRNA bands (Figure 1G). In vitro sialidase treatment of the VVL-rPL signal selectively reduced the signal near the 28S rRNA region while maintaining the ultra-high MW region, suggesting that some of the RNA species proximal to VVL substrates are likely sialoglycoRNA (Figure 1G). Further, the high MW and 28S-adjacent signal fractionated with the small RNAs and not the long RNAs, consistent with our previous findings that N-glycans are found on small RNA1 (Figure 1H). We repeated the proximity labeling using WGA and VVL on WT, COSMC-KO, and C1GALT1-KO14 HEK293 cells. C1GALT1-KO cells were included to assess if the catalytic activity of O-glycosylation is required or if this observation relates to a non-O-glycosylation activity of COSMC. Proximity labeling around WGA was minimally affected by the loss of either COSMC or C1GALT1 (Figure 1I). In contrast, RNA labeled in proximity to VVL was affected by the loss of COSMC and C1GALT1, specifically the disappearance of the lower-MW biotin signal (*) (Figure 1I). COSMC or C1GALT1 KO lead to O-glycan truncation and loss of some mature, sialylated O-glycans8, which is consistent with the loss of the sialidase-sensitive lower-MW biotin signal (*) in the COSMC and C1GALT1 KO. Truncated mature surface-presented O-glycans in the KO cell lines are still accessible to VVL, leading to the maintenance of higher MW, more poorly sialylated proximal RNA. Together, these data suggest that small RNAs are in close proximity to O-glycans on the cell surface.

Galactose oxidase labels glycoRNA

Although our findings indicate that the O-glycan biogenesis pathway contributes to sialoglycoRNA levels, many O-glycan structures lack sialic acid in HEK29311 (Figure 1A), so we sought out an alternative labeling method to expand our detection of glycoRNA beyond sialoglycoRNAs and allow derivatization of another monosaccharide. Recent studies have proposed the use of galactose oxidase (GAO) for labeling glycoRNA21,22, although biochemical evidence of GAO’s ability to selectively label glycoRNAs has not been demonstrated yet. GAO is a fungal enzyme that oxidizes galactose (Gal) and N-acetylgalactosamine (GalNAc) at the C6 position23. Reasoning that this would allow us to label Gal and GalNAc on O-glycans (Figure 2A), we optimized a set of conditions that would allow robust labeling while minimizing RNA degradation. Many sources of commercial GAO exist from multiple fungal species and with a wide variety of expression and purification methods; it was therefore necessary to investigate their ability to selectively oxidize glycoRNA in vitro. Indeed, when we tested three commercial sources of Dactylium dendroides GAO (from Worthington, MedChem Express, and Sigma, whole protein analysis of the commercial stocks in Figure S2A), we observed that Worthington GAO specifically generated a product in the presence of small RNA, while MedChem Express GAO generated the same product regardless of whether small RNA was present or absent in the reaction (Figure 2B). Although the signal generated by Sigma GAO was RNA-dependent, labeling was substantially weaker than that observed with Worthington, so all subsequent optimizations were performed using Worthington GAO.

Figure 2. Galactose oxidase (GAO) labels glycoRNA.

Figure 2.

A. Reaction schematic of the oxidation of galactose (Gal) or N-acetylgalactosamine (GalNAc), followed by ligation of biotin (ARP-Bt) to the newly generated aldehyde.

B. GAO northern with and without wild type HEK293 small RNA labeled with GAO purchased from Worthington, MedChem Express, or Sigma. *RNA-independent labeling using MedChem Express GAO, **RNA-dependent labeling using Sigma GAO. An in-gel Sybr stain of small RNA and detection of biotinylated glycoRNA on the membrane (Strep) are shown.

C. GAO northern as in (B), with wild type and C1GALT1 KO HEK293 small RNA pre-treated with sialidase. A quantification of GAO signal is shown (n= 3 biological replicates, mean±SEM, unpaired student’s t-test).

D. GAO northern as in (C), with native (non sialidase-treated) wild type and C1GALT1 KO HEK293 small RNA.

E. Heatmap of O-glycan structures released from K562 and HEK293 small RNA and identified by Data Independent acquisition (DIA). Glycan structures are indicated (n= 3 biological replicates).

GAO is a Cu(II) metalloenzyme and a potent oxidizer23, to the point that addition of GAO itself is sufficient to degrade small RNA (Figure S2B). The addition of RNaseOUT, EDTA, or heat inactivation of GAO failed to rescue RNA degradation, suggesting that degradation is likely not due to contaminating RNase in GAO or GAO catalytic activity (Figure S2B). To mitigate the propensity of GAO to degrade small RNA, we titrated the amount of GAO and used a reaction buffer consisting of 20% DMSO and 100 U of HRP per 1 U of GAO (Figure S2C), which enhance reaction conditions and rate through multiple mechanisms24. We optimized the reactivity of HRP by using a 50 mM phosphate buffer of pH 6, ensuring robust protection of small RNA integrity through high HRP activity (Figure S2C). Importantly, Gal and GalNAc are commonly present as terminal and penultimate carbohydrates in N- and O-glycans. When Gal and GalNAc are penultimate carbohydrates, they are linked to the terminal sialic acid at either carbon 3 or 6, with the frequency of which linkage being variable across cell type and cell status. The α2-6 sialic acid linkage directly prevents oxidation by GAO, requiring removal of sialic acid. We therefore introduced a sialidase step prior to the GAO reaction for all samples when establishing our optimized reaction conditions. Finally, in vitro RNase digestion prior to GAO labeling resulted in a loss of signal, confirming that our labeling conditions are specific for RNA (Figure S2D).

Next, we tested the ability of GAO to detect differences in glycoRNA resulting from O-glycan truncation by comparing GAO labeling of RNA from wild type and C1GALT1 KO cells (Figure 2C). We detected no significant difference when we included a sialidase digestion prior to GAO labeling, consistent with the idea that any changes in accessibility of the C6 position on Gal or GalNAc as a result of decreased sialylation in the C1GALT1 KO would be lost upon sialidase treatment of small RNA. In contrast, omitting the sialidase step to enable profiling of native, intact glycoRNA led to increased GAO labeling of the C1GALT1 KO relative to wild type RNA (Figure 2D). Taken together, our optimized GAO labeling reaction is suitable for profiling mammalian glycoRNA and yields additional biological insight into the sialylation state of glycoRNA upon comparison of signal with and without prior sialidase treatment.

O-glycomics reveal an expanded set of glycans that purify with RNA

As an orthogonal means to examine O-glycans on RNA, we purified small RNA from the HEK293 (kidney) and K562 (lymphoblast) immortalized cell lines and used O-glycomics to profile glycans released from small RNA. Because N-glycan profiles have now been established, we focused on possible O-glycan signals. O-glycans were released from small RNA material using standard reductive beta-elimination procedures and processed for MS using positive mode ionization and Data Independent Acquisition (DIA) scans (glycanDIA6). In general, the glycoRNAs of HEK293 and K562 cells were well separated and samples clearly clustered by cell line (Figure 2E, Table S2), with very few structures shared across all samples, namely sialylated Core-1 O-glycans. We identified a cluster of sialylated Core-2 structures that were characteristic of HEK293 cells, while K562 exclusively presented Core-1 O-glycans on their glycoRNAs, similar to what was seen in a genome-wide knockout screen of Siglec-7 ligands on K562 cells25. The finding of Core-1 and -2 structures on HEK293 cells matches those found in protein-linked O-glycans in that cell line26. Similarly, for K562 cells, lectin microarray studies27 support the prevalence of Core-1 O-glycans, where it should be noted that this measurement likely also inadvertently co-measured glycoRNAs on the surface of these cells.

O-glycomics on human colon organoids and primary colon tissue RNA

The diversity of N-glycans in glycoRNAs from primary tissues was greater than what we initially observed from cell lines1,6, To extend our analysis of glycoRNA beyond 2D cell culture, we set out to profile O-glycans on RNA isolated from 3D colon organoids derived from healthy colon as well as from Ulcerative Colitis (UC) patients28, which would also allow us to model a disease process. First, we examined the gel patterns of the sialoglycoRNAs between colon cancer cells (Caco2) and an organoid derived from non-inflamed colon tissue. Here we found a dramatic increase in intensity and a down-shifting of the rPAL signal, per mass of recovered total RNA (Figure 3A). The highly abundant and broader smearing rPAL signal observed from the organoid RNA was RNase sensitive (Figure S3A), as we would expect for sialoglycoRNAs. We then performed rPAL labeling and O-glycomics on RNA from colon organoids derived from 3 healthy patients (Figure 3B, C). Here we used negative mode ionization, given the higher abundance of rPAL signal and thus negatively charged sialoglycans, Data Dependent Acquisition (DDA), and glycosidase treatments to characterize O-glycan structures from organoid small RNA (Figure 3C). For DDA glycomics, sialidase treatment of released O-glycans prior to glycomics measurements further enhanced the structural elucidation of the neutral backbones of sialylated glycans, which otherwise only presented uninformative general sialic acid fragmentation products (e.g., B1 at m/z 290). Analysis of glycan compositions of O-glycans released from RNA revealed that many compositions have sialic acid (NeuAc) (Figure 3C), in agreement with the intense rPAL signal observed for organoid sialoglycoRNA.

Figure 3. Mass spectrometry defines the O-glycan profile of glycoRNAs.

Figure 3.

A. rPAL northern blot of total RNA from a colon organoid derived from healthy cells and colon cancer cells (Caco 2). An in-gel Sybr stain of total RNA and detection of biotinylated sialoglycoRNA on the membrane (Strep) are shown.

B. rPAL northern as in (A), here with total RNA from colon organoids derived from healthy patients (n= 3 biological replicates).

C. Glycan composition percent (for all compositions with a percent greater than 0) of O-glycans released from RNA from healthy organoids and identified by Data-Dependent Acquisition (DDA) (n= 3 biological replicates).

D. rPAL northern as in (A), here with total RNA from colon organoids derived from non-inflamed (n) or inflamed (i) cells (n= 3 biological replicates).

E. Heatmap of O-glycans identified by DDA in small RNA from organoids derived from inflamed Ulcerative Colitis (iUC) and non-inflamed Ulcerative Colitis (nUC) cells (two-tailed Welch’s t-test corrected by the Benjamini-Hochberg procedure, n= 3 biological replicates).

F. MS/MS spectrum of a novel sulfated O-fucose, SGalβ1-4(Fucα1-3)GlcNAcβ1-3Fucα (m/z 576).

To understand whether this pattern changed in the context of disease, separate organoids were generated using cells from inflamed (iUC) and non-inflamed (nUC) regions of the colon from the same patient. rPAL analysis of small RNA from these organoids demonstrated similar total levels but a small down-shifting in the glycoRNA signal in the inflamed samples as compared to the non-inflamed regions (Figure 3D). These samples were analyzed for their RNA-linked O-glycans using DDA-MS, detecting 105 unique O-glycans (Table S3, Figure S3B); for DDA we were able to structurally characterize 56 glycans (Table S3). Consolidating this analysis into glycan motifs29, we noticed that nearly all of the glycans were O-GalNAc and O-Fucose (Fuc) type glycans, as well as a few peeling-like structures with hexose at the reducing end (peeling products from beta-elimination reviewed in 13), terminating in galactose (Figure 3E, Table S3). We next evaluated if there were any statistical differences in the glycan motifs in the inflammatory environment of UC on glycoRNAs, similar to what has been reported for other types of glycans30. We performed a differential glycomics expression and identified a shift in the glycome, with a separate clustering of samples from healthy and inflamed tissue (Figure 3E, blue stars). This clustering was mainly driven by significantly higher levels of sialylated glycoRNAs in organoids from inflamed tissue (e.g., glycoRNAs exhibiting sialyl-Lewis X and VIM motifs), while the glycoRNAs from normal tissue predominantly exposed undecorated terminal LacNAc units that were sialylated in the inflamed case. This is in agreement with our rPAL results showing a slight downshifting of glycoRNA from inflamed organoids (Figure 3D), as sialyl-Lewis X motifs contain α2-3 sialic acid linkages, which migrate faster than α2-6 linkages (Figure S1B, ST6 KO). Overall, the identified structures predominantly comprised Core-2 and Core-1 structures, with very few Core-5 structures (Table S3). This is notable, as such a profile would instead be expected from salivary mucin glycans31, not the colon from which the tissues have been extracted, and which is typically rich in Core-3 and -4 structures on its protein-linked glycans32.

Examining the glycoforms in more detail, we noticed multiple structures derived from our glycoRNA samples that have, to our knowledge, never been described before (using a graph-based search in the glycowork-internal glycan database33, accounting for linkage ambiguities). Beyond GalNAc, we found examples of extended O-Fucosylated (Fuc) glycans that exhibited sulfated Lewis structures (e.g., SGalβ1-4(Fucα1-3)GlcNAcβ1-3Fucα) (Figure 3F, Figure S3C, Table S4), which has not yet been reported. These sulfated structures could be relevant in the context of the reported increase in Siglec binding that is mediated by sulfation34, as we have shown that Siglecs bind to glycoRNAs1. In total, we identified 6 O-fucosylated O-glycans (m/z 530, 610, 676, 756, 821, 967), of which 2, m/z 530 and 821, are known glycan structures that have been described in human coagulation Factor IX35,36 and Notch137 (Figure S3, Table S4). The Y ions at m/z 165 in both MS2 and MS3 (inset) suggests the presence of Fuc at the reducing end and the cross-ring cleavage at m/z 263 indicated a β1,4-linked terminal Gal residue (Figure S3D). Our observation of a Lewis X type O-fucose is new, where fragmentation ions at m/z 366 and 590 suggest that one Fuc is attached to an internal N-acetylglucosamine (GlcNAc) and another Fuc is at the reducing end (Figure 3F). The MS3 of fragment ions at m/z 610 confirms that the sulfated Gal is linked to the 4 position of GlcNAc (3.5A cleavage at m/z 315) (Figure 3F, Table S4).

Discussion

Here we have expanded upon our previous observations that cell surface glycoRNAs are modified with N-glycan structures1,5 to demonstrate the dependency of glycoRNA levels on O-glycan biosynthesis and presence of O-glycans on RNAs: O-glycoRNA.

We demonstrate the utility of previously developed tools for glycoRNA analysis, such as native sialoglycoRNA labeling with rPAL5, lectin proximity labeling1, GAO labeling21,22, and glycanDIA6, and DDA-MS for profiling O-glycans on glycoRNA. In addition to providing information on sialoglycoRNA abundance—where we have shown that the majority of rPAL signal from cancer cell lines comes from O-glycans—rPAL can also provide information on glycan composition through qualitative and quantitative analysis of gel migration. The nature of the sialic acid linkage influences the apparent molecular weight of the glycoRNA signal, with α2-6 sialic acid linkages migrating slower than α2-3 linkages (Figure 1). These gel-based insights, coupled with systematic analysis of the effects of knocking out human glycosyltransferases, have thus revealed that sialoglycoRNAs detected by rPAL are chemically diverse and heterogeneous in nature, consisting of N-glycans1,5 and O-glycans, which contain α2-3 and α2-6 sialic acid linkages. Importantly, this composition is specific to HEK293 cells, as our O-glycomics revealed unique glycoRNA profiles across cancer cell types and human colon organoids and tissues (Figure 3). It will therefore be interesting to assess the relative contribution of different glycosyltransferases to rPAL and GAO signals from different cell lines, cell types, and tissue samples to continue understanding how different glycan biosynthetic pathways result in cell type-specific glycoRNA profiles.

Previous work demonstrated GAO-mediated labeling of total RNA, hydrazide bead coupling, and PNGaseF release to analyze N-glycoRNA21 and more recently it was reported that O-glycosidase treatment of the hydrazide beads leads to release of RNA22, although the biochemical specificity of GAO in total RNA was not made clear in these works. We note that GAO could also label tRNA containing the hypermodification galactosyl-queosine (GalQ)38,39 or other currently known RNA modifications, so specific examination of the labeling patterns of new glycoRNA labeling methods is critical. Here, we have established an updated protocol using GAO to label mammalian glycoRNA and demonstrated its utility for profiling sialylated and asialylated glycoRNA (Figure 2). Our method has been optimized for maintaining RNA integrity, which is an important consideration for additional applications, such as sequencing GAO-labeled glycoRNA. Further, we show that truncated O-glycans can be labeled with GAO, and that truncation can be assessed by comparing GAO signal with and without sialidase treatment prior to oxidation. Thus, combined use of rPAL and GAO labeling will enable the characterization of an expanded set of mammalian glycoRNAs. Further, extending these methods for native glycoRNA isolation and coupling this to mass spectrometry could provide evidence of specific linker structures that connect O-glycans directly to mammalian small RNAs.

Lectin blotting is another informative tool for inferring the glycan composition of glycoconjugates including glycoRNA, and repurposing lectins for cell surface proximity labeling provides additional topological insight demonstrating the presence of small RNA proximal to O-glycans on the cell surface. As cell surface RNA has been linked to various immune functions such as neutrophil recruitment to sites of inflammation3 and monocyte binding to epithelial cells4, future work should address the relative contribution of N-glycosylation and O-glycosylation of RNA using the single gene KOs we have biochemically defined here or inhibitors of various glycosyltransferases.

When comparing the cell line glycans to the colon organoids, we found that the glycans released from the organoids comprise nearly only O-linked glycans (Figure 3). It cannot be excluded that small amounts of N-glycans may also be present on these samples, yet it seems clear that these samples have a much greater abundance of O-glycans, analogous to the rich protein-linked O-glycosylation of mucins in this tissue40. A possible explanation for this could be a partly shared biosynthetic machinery for glycoproteins and glycoRNA, which predisposes cells in these tissue niches to produce O-glycans on their proteins and RNA. We further note that, even in the case of the cell lines with robustly measurable N-glycans on their RNA, O-glycans still dominated the total signal.

The restructuring of the RNA O-glycome in the comparison of inflamed and uninflamed tissue regions, with statistically significant and interpretable differences even at very small sample sizes, indicates the potential of this new class of biomolecules as a source for diagnostic biomarkers and potential therapeutic intervention. We envision that diagnostic information derived from glycoRNAs could be synergistic in combination with glycomic information from other parts of the glycome, for instance enabling the future development of glycan-based risk scores that integrate glycoproteomics and glycanDIA and DDA for glycoRNA. Given the potentially related aspects of glycan biosynthesis between protein- and RNA-linked glycans, future work should elucidate which features of the respective glycome are regulated “globally” (all biopolymers) and which are subject to carrier-specific regulation (protein vs. RNA). This would both advance the research of these important molecules as well as identify opportunities for specifically modulating aspects of the glycome for therapeutic purposes.

Individual loss of GTase activity can lead to cell compensation which may alter the flux through the glycosylation pathway outside of the specific gene perturbed. For example, loss of early GTase activity like that with OST or GALNTs can change the expression of other GTases and chaperones that depend on N- and O-glycans for proper biological function. Therefore, the effect of an individual GTase may need to be examined in more depth and with other strategies outside simple KO to fully understand their impact on glycoRNA biosynthesis. More generally, our work here does not achieve a direct understanding of the linkage between the O-glycans and small RNA. While this will be important to fully characterize the biosynthesis and atomic nature of O-glycoRNAs, it is likely that significant technology development will be required to enable this discovery.

Methods

Data and code availability

  • Raw Mass Spectrometry data have been deposited in the GlycoPOST database and accession numbers are listed below

Mass Spectrometry Data for organoids (Data Dependent Acquisition) This Study GlycoPOST GPST000463
Mass Spectrometry Data for HEK293 and K562 (Data Independent Acquisition) This Study GlycoPOST, GPST000455

Cell culture

HEK293 and K562 (ATCC) cells were grown at 37°C and 5% CO2. HEK293 cells were cultured in DMEM media (ThermoFisher Scientific) supplemented with 10% fetal bovine serum (FBS, ThermoFisher Scientific) and 1% penicillin/streptomycin (P/S, ThermoFisher Scientific). K562 cells were cultured in RPMI-1640 media (ThermoFisher Scientific) supplemented with 10% FBS and 1% P/S. All HEK293 KO cells were a gift from the Copenhagen Center for Glycomics14,41. All cells were checked for and maintained as mycoplasma free.

Colon organoid culture

Human colon organoids were grown in 3D culture and harvested as detailed in 28. In brief, human colon organoids previously generated from de-identified patient specimens (gift of Dr. Hyun Jung Kim) were removed from liquid nitrogen and thawed at 37°C. 500 μL thawed organoids were resuspended into 9 mL IntestiCult Human Organoid Growth Medium (Stemcell technologies) and centrifuged at 150 g at 4°C for 5 minutes. After removal of supernatant, ice cold Matrigel (Corning) was used to resuspended the organoids (30 μL Matrigel for each 100-200 organoids). The organoids in Matrigel were then plated onto 24-well sterile tissue culture plates (Falcon 353047) on ice with ~30 μL per well. The plates were inverted to allow 3D Matrigel domes to set for 10 minutes at 37°C. Then, 500 μL of IntestiCult growth medium were added to each well, and organoids were continuously cultured in a humidified cell culture chamber (18% O2, 5% CO2). Media was changed every other day, and growth was monitored using a phase-contrast microscope.

When significant cellular debris accumulated in the lumen of the organoids (generally 7-10 days), the organoids were split into subcultures by removing media from each well, adding 500 μL ice cold Cell Recovery Solution (Corning), and incubating on ice for 30 minutes to dissolve Matrigel. Wells of the same experimental group were combined into a 15 mL conical tube and centrifuged at 100 g at 4 °C for 5 minutes. Supernatant was removed and 1 mL of room temperature TrypLE Express (GIBCO) was added to each conical tube. The solution was pipetted up and down 5 times before addition of 9 mL room temperature PBS. The conical tubes were then centrifuged at 100 g at 4°C for 5 minutes to pellet the organoids, and supernatant was removed. 30 μL of ice cold Matrigel per each desired new well was added to the pellet on ice with a goal split ratio of 1:3 to 1:5. Organoids were resuspended in Matrigel with gentle pipetting, and 30 uL was pipetted into each well of new 24-well sterile tissue culture plates on ice. The plates were then inverted and to allow 3D Matrigel domes to set for 10 minutes at 37°C. Then, 500 μL of IntestiCult growth medium were added to each well, and organoids were continuously cultured in a humidified cell culture chamber (18% O2, 5% CO2) as above.

Once sufficient quantities of organoids were obtained by continuous culture, organoids were harvested from Matrigel by incubation in Cell Recovery Solution as described above and pelleted. The pellets were then resuspended in 1 mL RNAzol RT (Molecular Research Center) and vortexed for 1 minute before being flash frozen in liquid nitrogen.

RNA isolation, Native sialoglycoRNA labeling (rPAL), and blotting

RNA was extracted with RNAzol RT (Molecular Research Center, Inc.) largely following the manufacturer's protocol and as we have previously reported42. For purification of the aqueous phase, it was transferred to clean tubes and 1.1X volumes of isopropanol was added. The RNA is then purified over a Zymo column (Zymo Research). First, 350 μL of pure water was added to each column and spun at 10,000x g for 30 seconds, and the flowthrough was discarded. Next, precipitated RNA from the RNAzol RT extraction (or binding buffer precipitated RNA, below) is added to the columns, spun at 10,000x g for 10-20 seconds, and the flowthrough is discarded. This step is repeated until all the precipitated RNA is passed over the column once. Next, the column is washed three times total: once using 400 μL RNA Prep Buffer (3M GuHCl in 80% EtOH), twice with 400 μL 80% ethanol. The first two spins are at 10,000x g for 20 seconds, the last for 30 sec. The RNA is then treated with Proteinase K (Ambion) on the column. Proteinase K is diluted 1:19 in water and added directly to the column matrix, and then allowed to incubate on the column at 37°C for 45 min. The column top is sealed with either a cap or parafilm to avoid evaporation. After the digestion, the columns are brought to room temperature for 5 min. Next, eluted RNA is spun out into fresh tubes and a second elution with water is performed. To the eluate, 1.5 μg of the mucinase StcE (Sigma-Aldrich) is added for every 50 μL of RNA, and placed at 37°C for 30 minutes to digest. The RNA is then cleaned up again using a Zymo column. Here, 2 volumes of RNA Binding Buffer (Zymo Research) was added and vortexed for 10 seconds, and then 2 volumes (samples + buffer) of 100% ethanol was added and vortexed for 10 sec. This is then bound to the column, cleaned up as described above, and eluted twice with water. The final enzymatically digested RNA is quantified using a Nanodrop under the manufacturer’s RNA settings. For small RNA isolation, 2 volumes of adjusted RNA binding buffer (1:1 Zymo RNA Binding Buffer and 100% ethanol) is added to total RNA and vortexed, followed by binding to a Zymo column. Column flowthrough, consisting of small RNA, is collected, added to 2 volumes of 100% ethanol, vortexed, and bound to a fresh Zymo column.The columns are washed twice with 400 μL 80% ethanol, as above, and eluted twice with water.

rPAL labeling was performed as previously described42. Briefly lyophilized, enzymatically treated RNA was blocked. To make the blocking buffer, 1μL 16 mM mPEG3-Ald (BroadPharm), 15 μL 1 M MgSO4 and 12 μL 1 M NH4OAc pH5 (with HCl) are mixed together (final buffer composition: 570 μM mPEG3-Ald + 500 mM MgSO4 + 450 mM NH4OAc pH5). 28μL of the blocking buffer is added to the lyophilized RNA, mixed completely by vortexing, and then incubated for 45 minutes at 35°C to block. The samples are briefly allowed to cool to room temperature (2-3 min), then working quickly, 1 μL 30 mM aldehyde reactive probe (ARP/aminooxy biotin, Cayman Chemicals, stock made in water) is added first, then 2μL of 7.5 mM NaIO4 is added. The periodate is allowed to perform oxidation for exactly 10 minutes at room temperature in the dark. The periodate is then quenched by adding 3 μL of 22 mM sodium sulfite (stock made in water). The quenching reaction is allowed to proceed for 5 minutes at 25°C. Both the sodium periodate and sodium sulfite stocks were made fresh within 20 minutes of use. Next, the reactions are moved back to the 35°C heat block, and the ligation reaction is allowed to occur for 90 min. The reaction is then cleaned up using a Zymo-I column. 19 μL of water is added to bring the reaction volume to 50 μL, and the Zymo protocol is followed as per the above details. If samples will be analyzed on an agarose gel for glycoRNA visualization, the RNA is then eluted from the column using 2X 6.2μL water (final volume approximately 12 μL).

To visualize the periodate labeled RNA, it is run on a denaturing agarose gel, transferred to a nitrocellulose (NC) membrane, and stained with streptavidin in a manner as per 42. After elution from the column as described above, the RNA is combined with 12 μL of Loading Buffer (95% formamide, 18 mM EDTA, 0.025% SDS) with a final concentration of 2X SybrGold (ThermoFisher Scientific) and denatured at 55°C for 10 minutes. Immediately after this incubation, the RNA is placed on ice for at least 2 minutes. The samples are then loaded into a 1% agarose, 0.75% formaldehyde, 1.5x MOPS Buffer (Lonza) denaturing gel. RNA is electrophoresed in 1x MOPS at 115V for between 34 or 45 min, depending on the length of the gel. Subsequently, the RNA is visualized on a UV gel imager, and excess gel is cut away; leaving ~0.75 cm of gel around the outer edges of sample lanes will improve transfer accuracy. The RNA is transferred as previously described with a buffer of 3M NaCl solution at pH 1, for 90 minutes at 25 °C42. Post transfer, the membrane is rinsed in 1x PBS and dried on Whatman Paper (GE Healthcare). Dried membranes are rehydrated in Intercept Protein-Free Blocking Buffer, TBS (Li-Cor Biosciences), for 30 minutes at 25 °C. After the blocking, the membranes are stained using Streptavidin-IR800 (Li-Cor Biosciences) diluted 1:5,000 in Intercept blocking buffer for 30 minutes at 25 °C. Excess Streptavidin-IR800 was washed from the membranes using three washes with 0.1% Tween-20 (Sigma) in 1x PBS for 3 minutes each at 25 °C. The membranes were then briefly rinsed with PBS to remove the Tween-20 before scanning. Membranes were scanned on a Li-Cor Odyssey CLx scanner (Li-Cor Biosciences).

Galactose oxidase (GAO) labeling and blotting

For the initial sialidase treatment, a 10 uL reaction is performed using 2.5 U of α2-3,6,8 neuraminidase (NEB; Cat. No. P0720L) and 1 U of α2-3,6,8,9 neuraminidase (NEB; Cat. No. P0722L), 50 mM phosphate buffer, pH 6.0, and small RNA (up to 10 ug) for 1 hour at 37 °C. During the final 10 minutes of incubation, both GAO (Worthington; Cat. No. LS004524) and HRP (Sigma-Aldrich; Cat. No. 516531) are reconstituted in UltraPure water (1 U/uL for GAO and 20 U/uL for HRP). Importantly, these reagents must be freshly reconstituted as they are both transition metal-dependent oxidases and lose activity during storage after reconstitution. Following this, 10 uL of the GAO label mix is added, consisting of 100 U of HRP, 1 U of GAO, and 4 uL of DMSO (final concentrations: 5 U/uL HRP, 0.05 U GAO/uL, 20% DMSO). This reaction proceeds at room temperature in the dark for 15 minutes, after which 1 uL of 30 mM ARP-Biotin (Cayman chemicals, Cat. No. 10009350) is added and the reaction incubated at 35 °C for 30 minutes in the dark. Samples were then cleaned using Zymo I columns and eluted using two subsequent volumes of 6.7 uL. For detection by Northern blot, GAO-labeled RNA is loaded into a 1% agarose, 0.75% formaldehyde, 0.8X MOPS Buffer (Lonza) denaturing gel and electrophoresed, transferred, and probed as above.

In vitro enzyme digestions

All digestions were performed on 30 μg of total RNA in a 20 μL at 37 °C for 60 min. To digest RNA the following was used: 3 μL of RNase cocktail (0.5U/mL RNaseA and 20U/mL RNase T1, Thermo Fisher Scientific) with 20 mM Tris-HCl (pH 8.0), 100 mM KCl and 0.1 mM MgCl2. To digest sialic acid: 2 μL of α2-3,6,8 Neuraminidase (50U/mL, New England Biolabs, NEB) with GlycoBuffer 1 (NEB) was added.

RNA proximity labeling (rPL)

This was performed largely as described previously1 with specific modifications noted here. HEK293 cells were cultured as above typically in 10 cm plates. Cells were rinsed once in ice-cold 1x PBS, discarding after each wash, and blocked in Lectin Blocking Buffer (LBB, 20 mM HEPES, 150 mM NaCl, 1 mM MgCl2, 1 mM MnCl2, 1 mM CaCl2, 0.5% BSA) for 15 min at 4 °C. Blocking buffer was then discarded and replaced with LBB+Lectin+Strep-HRP. Typically, 4 mL of this was prepared at a concentration of 5 μg/mL biotinylated lectin and 6 μg/mL Strep-HRP, these components were first mixed together on ice for 30 min prior to the addition of LBB. LBB+Lectin+Strep-HRP staining occurred for 45 min at 4 °C, after which the cells were rinsed twice in ice-cold 1x PBS + 1 mM CaCl2 + 1 mM MgCl2 (PBS++). Immediately after this 3 mL of PBS++ with 200 μM biotin-aniline (Iris Biotech GMBH) was added to each plate and incubated on ice for 1 min. Plates were then moved to the bench top and H2O2 was added to a final concentration of 1 mM. This reaction occurred for precisely 2 min, after which plates were brought back on ice, PBS++/biotin-aniline/ H2O2 was aspirated, and cells were quickly but gently rinsed twice in Quenching Buffer: 5 mM Trolox, 10 mM sodium ascorbate and 10 mM sodium azide in PBS++. After removing the Quenching Buffer, TRIzol was added directly to the plate and RNA extracted and processed as described above for enzymatic digestions as well as blotting.

DDA glycomics and analysis

For O-glycan analysis, lyophilized RNA samples were incubated with 50 μL 0.5 M NaBH4 and 50 mM NaOH at 50 °C overnight. The released O-glycans were desalted using Dowex 50W. After dryness, the sample was dissolved in 15 μL of water for liquid chromatography-electrospray ionization tandem mass spectrometry (LC-ESI/MS).

The oligosaccharides were separated on a column (10 cm × 250 μm) packed in-house with 5 μm porous graphite particles (Hypercarb, Thermo-Hypersil, Runcorn, UK). The oligosaccharides were injected on to the column and eluted with an acetonitrile gradient (Buffer A, 10 mM ammonium bicarbonate; Buffer B, 10 mM ammonium bicarbonate in 80% acetonitrile). The gradient (0-45% Buffer B) was eluted for 46 min, followed by a wash step with 100% Buffer B, and equilibrated with Buffer A in next 24 min. A 40 cm × 50 μm i.d. fused silica capillary was used as a transfer line to the ion source.

The samples were analyzed in negative ion mode on a LTQ linear ion trap mass spectrometer (Thermo Electron, San José, CA), with an IonMax standard ESI source equipped with a stainless steel needle kept at −3.5 kV. Compressed air was used as nebulizer gas. The heated capillary was kept at 270°C, and the capillary voltage was −50 kV. Full scan (m/z 380-2000, two microscan, maximum 100 ms, target value of 30,000) was performed, followed by data-dependent MS2 scans (two microscans, maximum 100 ms, target value of 10,000) with normalized collision energy of 35%, isolation window of 2.5 units, activation q=0.25 and activation time 30 ms). The threshold for MS2 was set to 300 counts. Data acquisition and processing were conducted with Xcalibur software (Version 2.0.7).

DIA glycomics (glycanDIA)

The extracted RNAs were resuspended with 200 μL of 100 mM HEPES buffer, and the mixture was heated using a thermomixer at 100 °C for 2 min. The removal of N-glycans was performed by adding 4 μL of PNGase F (500,000 units/mL, NEB), followed by incubation at 37 °C overnight. The O-glycosylated RNAs were purified by the RNA column (Zymo Research), as noted above. The O-glycans were released using beta-elimination as described in the established protocol43. Briefly, the eluted RNAs (45μL) were mixed with 5μL of 2M NaOH and 50μL of 2M NaBH4, and the mixture was incubated at 45 °C for 18 hrs. The reaction was quenched by acetic acid on ice, followed by desalting the O-glycans using the Hypercarb SPE 96-well plate (Thermo Scientific) with 0.1% (v/v) TFA in water washing step and elution with 40% (v/v) ACN and 0.05% (v/v) TFA in water. The eluted O-glycans were dried using the SpeedVac system (Thermo Scientific) and reconstituted with 40% (v/v) ACN and 1% (v/v) TFA in water for further iSPE-HILIC (HILICON) purification. The purified sample was dried using the SpeedVac system and reconstituted in water for LC-MS/MS analysis.

The GlycanDIA analysis was employed as previously described with some modifications for O-glycan analysis6. O-glycans were directly characterized using a Waters nanoAcquity UPLC System coupled with a SCIEX ZenoTOF 7600 mass spectrometer. Analytes were separated using Hypercarb Porous Graphitic Carbon HPLC Column (1 mm x 100 mm, 3 μm, Thermo Scientific) at a constant flow rate of 40 μL/min with MS buffer A (0.1% formic acid (v/v) and buffer B (ACN with 0.1% FA (v/v). The chromatography gradient consisted of 0–2 min, 0% B; 2–20 min, 0-16% B; 20–40 min, 16%-72% B; 40–42 min, 72%-100% B; 42-52 min, 100% B; 52-54 min 100%-0% B; 54-70 min 0% B. Ion source gas1, gas2, and curtain gas were at 40, 60, and 35 psi, respectively. O-Glycans were detected under SWATH positive mode, MS1 spectra were acquired over a mass range of m/z 300–1650 with 0.25 s accumulation time. The 40 SWATH mass windows were built based on the O-glycan library (Table S4), and collision-induced fragmentation was performed with nitrogen gas using dynamic collision energy. The product ions were monitored from the range m/z 120-1600 with 20 ms accumulation time.

Glycomics analysis

The DIA data was analyzed using GlycanDIA Finder6 and validated manually using Skyline44. All glycans with structural annotations were used to renormalize the relative abundance of each sample to a total sum of 100%. Missing data were imputed from a left-shifted Gaussian distribution (mean = 0.01, standard deviation = 0.01) if group means were below three, otherwise via a k-nearest neighbor imputer, with k = 3. Then, the quantify_motifs function from glycowork (version 0.8) was used with the keyword known, to convert relative abundances of sequences into those of motifs, followed by a renormalization. Variance-based filtering was used to remove motifs with a total variance lower than 1%, followed by variance-stabilizing normalization (i.e., log-transformation and scaling of samples to zero mean, unit variance). Differential expression of motifs then was tested via Welch’s t-tests, followed by a Benjamini-Hochberg correction for multiple testing. All the above is contained within the get_differential_expression function (glycowork version 0.8).

Heatmaps for cell line data:

Using the relative abundances of O-linked glycans from triplicates of HEK293 and K562 cells, we used the get_heatmap and annotate_figure functions from glycowork (version 0.8.133) to depict the hierarchical clustering of glycan structures rendered in the SNFG style.

Heatmaps for organoid data:

Heatmaps for organoid glycoRNAs from normal and inflamed tissue from ulcerative colitis patients were generated similar to the ones for cell lines, with the difference of setting “motifs = True” in get_heatmap, choosing the motif subset of “known” within glycowork (version 0.8.1). Differential expression was performed using the get_differential_expression function within glycowork, using the default settings as described in29, with the same motif choice as the heatmap. Significantly differentially expressed glycan motifs between organoid glycoRNA from normal and inflamed tissue are depicted on the heatmap, along with their adjusted p-values.

Supplementary Material

Supplement 1

Table S1. Summary of rPAL signal intensity and migration changes upon GTase KOs and OEs.

media-1.docx (12.9KB, docx)
Supplement 2

Table S2. Data independent acquisition Mass Spec O-glycomics from HEK293 and K562 cell lines.

media-2.xlsx (13.2KB, xlsx)
Supplement 3

Table S3. Data dependent acquisition Mass Spec O-glycomics from organoids.

media-3.xlsx (115.8KB, xlsx)
Supplement 4

Table S4. O-glycan library and GlycanDIA window setup for the analysis.

media-4.xlsx (56.8KB, xlsx)
Supplement 5

Figure S1. Dissection of O-glycan biosynthesis with sialoglycoRNA labeling.

A. rPAL northern with WT, MGAT1 KO, ST3GAL4/6 KO, ST3GAL3/4/6+ST6GAL1/2 KO, and ST3GAL3/4/6+ST6GAL1/2 KO + ST6GAL1 over expression (OE). An in-gel Sybr stain of total RNA and detection of biotinylated sialoglycoRNA on the membrane (Strep) are shown, along with a quantification of sialoglycoRNA signal (n= 3 biological replicates,mean±SEM, unpaired student’s t-test).

B. rPAL northern as in (A), here with WT, GCNT1 KO, ST3GAL1/2 KO, and ST6GAL1/2 KO.

C. rPAL northern as in (A), here with WT, COSMC KO + ST6GALNAC1 OE.

D. rPAL northern as in (A), here with WT, ST3GAL1/2/3/4/5/6, ST3GAL1/2/3/4/5/6+ST6GAL1/2, and ST3GAL3/4/6 KO + ST3GAL4 OE.

E. rPAL northern as in (A), here with WT, ST3GAL1 KO, ST3GAL2 KO, ST3GAL3 KO clone a, ST3GAL3 KO clone b.

F. Western blot of whole cell lysate from RNA proximity labeling (rPL) experiments. Total protein stains from each sample (left) demonstrate even loading while biotin signal (Strep) demonstrate Lectin-dependent patterns of biotinylation.

G. RNA proximity labeling (rPL) northern blot of live HEK293 cells using the following biotinylated lectins: control (no lectin), WGA, Jacalin, VVL, and MAA-I. *lower molecular weight (MW) signal, ** ultra-high molecular weight signal are noted.

Figure S2: Optimization of GAO for glycoRNA labeling

A. SDS-PAGE of Worthington, MedChemExpress, and Sigma GAO (65-68 kDa).

B. Urea PAGE of HEK293 small RNA incubated with 1 or 10 units of Worthington GAO supplemented with RNaseOUT or EDTA, or heat-inactivated GAO.

C. GAO northern of HEK293 small RNA treated with various amounts of Worthington GAO and HRP. An in-gel Sybr stain of small RNA and detection of biotinylated glycoRNA on the membrane (Strep) are shown.

D. GAO northern as in (B) of GAO-labeled HEK293 small RNA with and without RNase A/T1/III pre-treatment (n= 3 technical replicates, mean±SEM, unpaired student’s t-test).

Figure S3. rPAL and mass spectrometry characterization of O-glycans on RNA from Ulcerative Colitis tissue-derived organoids.

A. rPAL northern of total RNA from nUC organoids with or without RNase A/T1 treatment. An in-gel Sybr stain of total RNA and detection of biotinylated sialoglycoRNA on the membrane (Strep) are shown.

B. Heatmap of O-glycan sequences released from organoids derived from inflamed (iUC) and non-inflamed (nIC) Ulcerative Colitis tissue and identified by Data Dependent Acquisition (DDA) (n=3 biological replicates)

C. Previously unreported O-glycan structures that were discovered from the DDA-based MS analysis of O-glycans derived from human colon organoid small RNA. Proposed compositions and linkages are shown and annotated.

D. MS/MS spectrum of one conventional O-fucose, Galβ1-4GlcNAcβ1-3Fucα (m/z 530).

ACKNOWLEDGMENTS

We thank Henrik Clausen, Yoshiki Narimatsu, Sergey Vakhrushev, and Yang Zhang for their generous sharing of clonal knockout cells of the glycosyltransferases as well as extensive and thoughtful discussions of O-glycan biogenesis. We also thank Stacy Malaker for helpful discussions about O-glycan mass spectrometry and other members of the Flynn lab for helpful comments and discussions.

This work was supported by grants from Burroughs Wellcome Fund Career Award for Medical Scientists (R.A.F.), The Rita Allen Foundation (R.A.F.), National Institute of General Medical Sciences of the National Institutes of Health under award number GM151157 (R.A.F.), Branco Weiss Fellowship – Society in Science (D.B.), the Knut and Alice Wallenberg Foundation (D.B.), and the University of Gothenburg, Sweden (D.B.), the American Gastroenterological Association Research Scholar Award (X.T.), NIH NIDDK 1K08DK132516 (X.T.), and the Wyss Institute Clinical Fellowship (X.T.). We thank SciLifeLab and BioMS, funded by the Swedish research council, for providing financial support to the Proteomics Core Facility, Sahlgrenska Academy.

Footnotes

DECLARATION OF INTERESTS

R.A.F. is a stockholder of ORNA Therapeutics. R.A.F. is a board of directors member and stockholder of Chronus Health and Blue Planet Systems. The other authors declare no competing interests.

References

  • 1.Flynn R.A., Pedram K., Malaker S.A., Batista P.J., Smith B.A.H., Johnson A.G., George B.M., Majzoub K., Villalta P.W., Carette J.E., et al. (2021). Small RNAs are modified with N-glycans and displayed on the surface of living cells. Cell 184, 3109–3124.e22. 10.1016/j.cell.2021.04.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bi M., Zhang Z., Wang T., Liang H., and Tian Z. (2023). A draft of human N-glycans of glycoRNA. bioRxiv, 2023.09.18.558371. 10.1101/2023.09.18.558371. [DOI] [Google Scholar]
  • 3.Zhang N., Tang W., Torres L., Wang X., Ajaj Y., Zhu L., Luan Y., Zhou H., Wang Y., Zhang D., et al. (2024). Cell surface RNAs control neutrophil recruitment. Cell 187, 846–860.e17. 10.1016/j.cell.2023.12.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Ma Y., Guo W., Mou Q., Shao X., Lyu M., Garcia V., Kong L., Lewis W., Ward C., Yang Z., et al. (2023). Spatial imaging of glycoRNA in single cells with ARPLA. Nature Biotechnology. 10.1038/s41587-023-01801-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Xie Y., Chai P., Till N.A., Hemberger H., Lebedenko C.G., Porat J., Watkins C.P., Caldwell R.M., George B.M., Perr J., et al. (2024). The modified RNA base acp3U is an attachment site for N-glycans in glycoRNA. Cell. 10.1016/j.cell.2024.07.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Xie Y., Liu X., Zhao C., Chen S., Wang S., Lin Z., Robison F.M., George B.M., Flynn R.A., Lebrilla C.B., et al. (2024). Development and application of GlycanDIA workflow for glycomic analysis. bioRxiv, 2024.03.12.584702. 10.1101/2024.03.12.584702. [DOI] [Google Scholar]
  • 7.de Las Rivas M., Lira-Navarrete E., Gerken T.A., and Hurtado-Guerrero R. (2019). Polypeptide GalNAc-Ts: from redundancy to specificity. Curr Opin Struct Biol 56, 87–96. 10.1016/j.sbi.2018.12.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Ju T., and Cummings R.D. (2002). A unique molecular chaperone Cosmc required for activity of the mammalian core 1 beta 3-galactosyltransferase. Proc Natl Acad Sci U S A 99, 16613–16618. 10.1073/pnas.262438199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Brockhausen I., Wandall H.H., Hagen K.G.T., and Stanley P. (2022). O-GalNAc Glycans. In Essentials of Glycobiology, Varki A., Cummings R. D., Esko J. D., Stanley P., Hart G. W., Aebi M., Mohnen D., Kinoshita T., Packer N. H., Prestegard J. H., et al. , eds. (Cold Spring Harbor Laboratory Press; ), pp. 117–128. 10.1101/glycobiology.4e.10. [DOI] [Google Scholar]
  • 10.Brockhausen I. (2006). Mucin-type O-glycans in human colon and breast cancer: glycodynamics and functions. EMBO Rep 7, 599–604. 10.1038/sj.embor.7400705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Huang Y.-F., Aoki K., Akase S., Ishihara M., Liu Y.-S., Yang G., Kizuka Y., Mizumoto S., Tiemeyer M., Gao X.-D., et al. (2021). Global mapping of glycosylation pathways in human-derived cells. Dev Cell 56, 1195–1209.e7. 10.1016/j.devcel.2021.02.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wandall H.H., Nielsen M.A.I., King-Smith S., de Haan N., and Bagdonaite I. (2021). Global functions of O-glycosylation: promises and challenges in O-glycobiology. FEBS J 288, 7183–7212. 10.1111/febs.16148. [DOI] [PubMed] [Google Scholar]
  • 13.Wilkinson H., and Saldova R. (2020). Current Methods for the Characterization of O-Glycans. J Proteome Res 19, 3890–3905. 10.1021/acs.jproteome.0c00435. [DOI] [PubMed] [Google Scholar]
  • 14.Narimatsu Y., Joshi H.J., Nason R., Van Coillie J., Karlsson R., Sun L., Ye Z., Chen Y.-H., Schjoldager K.T., Steentoft C., et al. (2019). An Atlas of Human Glycosylation Pathways Enables Display of the Human Glycome by Gene Engineered Cells. Mol Cell 75, 394–407.e5. 10.1016/j.molcel.2019.05.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Chang P.V., Chen X., Smyrniotis C., Xenakis A., Hu T., Bertozzi C.R., and Wu P. (2009). Metabolic labeling of sialic acids in living animals with alkynyl sugars. Angew Chem Int Ed Engl 48, 4030–4033. 10.1002/anie.200806319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Bennett E.P., Mandel U., Clausen H., Gerken T.A., Fritz T.A., and Tabak L.A. (2012). Control of mucin-type O-glycosylation: a classification of the polypeptide GalNAc-transferase gene family. Glycobiology 22, 736–756. 10.1093/glycob/cwr182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Shan A., Lu J., Xu Z., Li X., Xu Y., Li W., Liu F., Yang F., Sato T., Narimatsu H., et al. (2019). Polypeptide N-acetylgalactosaminyltransferase 18 non-catalytically regulates the ER homeostasis and O-glycosylation. Biochim Biophys Acta Gen Subj 1863, 870–882. 10.1016/j.bbagen.2019.01.009. [DOI] [PubMed] [Google Scholar]
  • 18.Varki A. (2008). Sialic acids in human health and disease. Trends Mol Med 14, 351–360. 10.1016/j.molmed.2008.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bojar D., Meche L., Meng G., Eng W., Smith D.F., Cummings R.D., and Mahal L.K. (2022). A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities. ACS Chem Biol 17, 2993–3012. 10.1021/acschembio.1c00689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Kageler L., Perr J., and Flynn R.A. (2024). Tools to investigate the cell surface: Proximity as a central concept in glycoRNA biology. Cell Chem Biol 31, 1132–1144. 10.1016/j.chembiol.2024.04.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Li J., Yue S., Gao Z., Hu W., Liu Z., Xu G., Wu Z., Zhang X., Zhang G., Qian F., et al. (2023). Novel Approach to Enriching Glycosylated RNAs: Specific Capture of GlycoRNAs via Solid-Phase Chemistry. Anal Chem 95, 11969–11977. 10.1021/acs.analchem.3c01630. [DOI] [PubMed] [Google Scholar]
  • 22.Li J., Wang L., Chen Y., Zhang S., Wen Z., Zhen X., Zhang H., Zhou Y., Xu L., and Yang S. (2024). O-Glycosylated RNA Identification and Site-specific Prediction by Solid-phase Chemoenzymatic TnORNA method and PONglyRNA tool. bioRxiv, 2024.06.18.599663. 10.1101/2024.06.18.599663. [DOI] [Google Scholar]
  • 23.Whittaker J.W. (2002). Galactose oxidase. Adv Protein Chem 60, 1–49. 10.1016/s0065-3233(02)60050-6. [DOI] [PubMed] [Google Scholar]
  • 24.Asbjørn Toftgaard Pedersen William R. Birmingham, Rehn Gustav, Charnock Simon J., Turner Nicholas J., and Woodley John M. (2015). Process Requirements of Galactose Oxidase Catalyzed Oxidation of Alcohols. Org. Process Res. Dev. 2015, 19, 11, 1580–1589 19, 1580-1589. [Google Scholar]
  • 25.Wisnovsky S., Möckl L., Malaker S.A., Pedram K., Hess G.T., Riley N.M., Gray M.A., Smith B.A.H., Bassik M.C., Moerner W.E., et al. (2021). Genome-wide CRISPR screens reveal a specific ligand for the glycan-binding immune checkpoint receptor Siglec-7. Proc Natl Acad Sci U S A 118. 10.1073/pnas.2015024118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Fujitani N., Furukawa J., Araki K., Fujioka T., Takegawa Y., Piao J., Nishioka T., Tamura T., Nikaido T., Ito M., et al. (2013). Total cellular glycomics allows characterizing cells and streamlining the discovery process for cellular biomarkers. Proc Natl Acad Sci U S A 110, 2105–2110. 10.1073/pnas.1214233110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Tateno H., Uchiyama N., Kuno A., Togayachi A., Sato T., Narimatsu H., and Hirabayashi J. (2007). A novel strategy for mammalian cell surface glycome profiling using lectin microarray. Glycobiology 17, 1138–1146. 10.1093/glycob/cwm084. [DOI] [PubMed] [Google Scholar]
  • 28.Shin W., and Kim H.J. (2022). 3D in vitro morphogenesis of human intestinal epithelium in a gut-on-a-chip or a hybrid chip with a cell culture insert. Nat Protoc 17, 910–939. 10.1038/s41596-021-00674-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Lundstrøm J., Urban J., and Bojar D. (2023). Decoding glycomics with a suite of methods for differential expression analysis. Cell Rep Methods 3, 100652. 10.1016/j.crmeth.2023.100652. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kudelka M.R., Stowell S.R., Cummings R.D., and Neish A.S. (2020). Intestinal epithelial glycosylation in homeostasis and gut microbiota interactions in IBD. Nat Rev Gastroenterol Hepatol 17, 597–617. 10.1038/s41575-020-0331-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Chaudhury N.M.A., Proctor G.B., Karlsson N.G., Carpenter G.H., and Flowers S.A. (2016). Reduced Mucin-7 (Muc7) Sialylation and Altered Saliva Rheology in Sjögren’s Syndrome Associated Oral Dryness. Mol Cell Proteomics 15, 1048–1059. 10.1074/mcp.M115.052993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Thomsson K.A., Holmén-Larsson J.M., Angström J., Johansson M.E., Xia L., and Hansson G.C. (2012). Detailed O-glycomics of the Muc2 mucin from colon of wild-type, core 1- and core 3-transferase-deficient mice highlights differences compared with human MUC2. Glycobiology 22, 1128–1139. 10.1093/glycob/cws083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Thomès L., Burkholz R., and Bojar D. (2021). Glycowork: A Python package for glycan data science and machine learning. Glycobiology 31, 1240–1244. 10.1093/glycob/cwab067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Jung J., Enterina J.R., Bui D.T., Mozaneh F., Lin P.-H., Nitin, Kuo C.-W., Rodrigues E., Bhattacherjee A., Raeisimakiani P., et al. (2021). Carbohydrate Sulfation As a Mechanism for Fine-Tuning Siglec Ligands. ACS Chem Biol 16, 2673–2689. 10.1021/acschembio.1c00501. [DOI] [PubMed] [Google Scholar]
  • 35.Harris R., van Halbeek H., Glushka J., Basa L., Ling V., Smith K., and Spellman M. (1993). Identification and structural analysis of the tetrasaccharide NeuAc alpha(2–>6)Gal beta(1–>4)GlcNAc beta(1–>3)Fuc alpha 1->O-linked to serine 61 of human factor IX. Biochemistry 32, 6539–6547. 10.1021/bi00077a007. [DOI] [PubMed] [Google Scholar]
  • 36.Nishimura H., Takao T., Hase S., Shimonishi Y., and Iwanaga S. (1992). Human factor IX has a tetrasaccharide O-glycosidically linked to serine 61 through the fucose residue. Journal of Biological Chemistry 267, 17520–17525. 10.1016/S0021-9258(19)37073-5. [DOI] [PubMed] [Google Scholar]
  • 37.Moloney D.J., Shair L.H., Lu F.M., Xia J., Locke R., Matta K.L., and Haltiwanger R.S. (2000). Mammalian Notch1 Is Modified with Two Unusual Forms ofO-Linked Glycosylation Found on Epidermal Growth Factor-like Modules*. Journal of Biological Chemistry 275, 9604–9611. 10.1074/jbc.275.13.9604. [DOI] [PubMed] [Google Scholar]
  • 38.Zhang W., Xu R., Matuszek Ż., Cai Z., and Pan T. (2020). Detection and quantification of glycosylated queuosine modified tRNAs by acid denaturing and APB gels. RNA 26, 1291–1298. 10.1261/rna.075556.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Zhao X., Ma D., Ishiguro K., Saito H., Akichika S., Matsuzawa I., Mito M., Irie T., Ishibashi K., Wakabayashi K., et al. (2023). Glycosylated queuosines in tRNAs optimize translational rate and post-embryonic growth. Cell 186, 5517–5535.e24. 10.1016/j.cell.2023.10.026. [DOI] [PubMed] [Google Scholar]
  • 40.Larsson J.M.H., Karlsson H., Sjövall H., and Hansson G.C. (2009). A complex, but uniform O-glycosylation of the human MUC2 mucin from colonic biopsies analyzed by nanoLC/MSn. Glycobiology 19, 756–766. 10.1093/glycob/cwp048. [DOI] [PubMed] [Google Scholar]
  • 41.Narimatsu Y., Joshi H.J., Yang Z., Gomes C., Chen Y.-H., Lorenzetti F.C., Furukawa S., Schjoldager K.T., Hansen L., Clausen H., et al. (2018). A validated gRNA library for CRISPR/Cas9 targeting of the human glycosyltransferase genome. Glycobiology 28, 295–305. 10.1093/glycob/cwx101. [DOI] [PubMed] [Google Scholar]
  • 42.Xie Y., Hemberger H., Till N.A., Chai P., Watkins C.P., Lebedenko C.G., Caldwell R.M., George B.M., Bertozzi C.R., Garcia B.A., et al. (2023). The modified RNA base acp3U is an attachment site for N-glycans in glycoRNA. bioRxiv. 10.1101/2023.11.06.565735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Li Q., Xie Y., Wong M., Barboza M., and Lebrilla C.B. (2020). Comprehensive structural glycomic characterization of the glycocalyxes of cells and tissues. Nat Protoc 15, 2668–2704. 10.1038/s41596-020-0350-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.MacLean B., Tomazela D.M., Shulman N., Chambers M., Finney G.L., Frewen B., Kern R., Tabb D.L., Liebler D.C., and MacCoss M.J. (2010). Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics 26, 966–968. 10.1093/bioinformatics/btq054. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1

Table S1. Summary of rPAL signal intensity and migration changes upon GTase KOs and OEs.

media-1.docx (12.9KB, docx)
Supplement 2

Table S2. Data independent acquisition Mass Spec O-glycomics from HEK293 and K562 cell lines.

media-2.xlsx (13.2KB, xlsx)
Supplement 3

Table S3. Data dependent acquisition Mass Spec O-glycomics from organoids.

media-3.xlsx (115.8KB, xlsx)
Supplement 4

Table S4. O-glycan library and GlycanDIA window setup for the analysis.

media-4.xlsx (56.8KB, xlsx)
Supplement 5

Figure S1. Dissection of O-glycan biosynthesis with sialoglycoRNA labeling.

A. rPAL northern with WT, MGAT1 KO, ST3GAL4/6 KO, ST3GAL3/4/6+ST6GAL1/2 KO, and ST3GAL3/4/6+ST6GAL1/2 KO + ST6GAL1 over expression (OE). An in-gel Sybr stain of total RNA and detection of biotinylated sialoglycoRNA on the membrane (Strep) are shown, along with a quantification of sialoglycoRNA signal (n= 3 biological replicates,mean±SEM, unpaired student’s t-test).

B. rPAL northern as in (A), here with WT, GCNT1 KO, ST3GAL1/2 KO, and ST6GAL1/2 KO.

C. rPAL northern as in (A), here with WT, COSMC KO + ST6GALNAC1 OE.

D. rPAL northern as in (A), here with WT, ST3GAL1/2/3/4/5/6, ST3GAL1/2/3/4/5/6+ST6GAL1/2, and ST3GAL3/4/6 KO + ST3GAL4 OE.

E. rPAL northern as in (A), here with WT, ST3GAL1 KO, ST3GAL2 KO, ST3GAL3 KO clone a, ST3GAL3 KO clone b.

F. Western blot of whole cell lysate from RNA proximity labeling (rPL) experiments. Total protein stains from each sample (left) demonstrate even loading while biotin signal (Strep) demonstrate Lectin-dependent patterns of biotinylation.

G. RNA proximity labeling (rPL) northern blot of live HEK293 cells using the following biotinylated lectins: control (no lectin), WGA, Jacalin, VVL, and MAA-I. *lower molecular weight (MW) signal, ** ultra-high molecular weight signal are noted.

Figure S2: Optimization of GAO for glycoRNA labeling

A. SDS-PAGE of Worthington, MedChemExpress, and Sigma GAO (65-68 kDa).

B. Urea PAGE of HEK293 small RNA incubated with 1 or 10 units of Worthington GAO supplemented with RNaseOUT or EDTA, or heat-inactivated GAO.

C. GAO northern of HEK293 small RNA treated with various amounts of Worthington GAO and HRP. An in-gel Sybr stain of small RNA and detection of biotinylated glycoRNA on the membrane (Strep) are shown.

D. GAO northern as in (B) of GAO-labeled HEK293 small RNA with and without RNase A/T1/III pre-treatment (n= 3 technical replicates, mean±SEM, unpaired student’s t-test).

Figure S3. rPAL and mass spectrometry characterization of O-glycans on RNA from Ulcerative Colitis tissue-derived organoids.

A. rPAL northern of total RNA from nUC organoids with or without RNase A/T1 treatment. An in-gel Sybr stain of total RNA and detection of biotinylated sialoglycoRNA on the membrane (Strep) are shown.

B. Heatmap of O-glycan sequences released from organoids derived from inflamed (iUC) and non-inflamed (nIC) Ulcerative Colitis tissue and identified by Data Dependent Acquisition (DDA) (n=3 biological replicates)

C. Previously unreported O-glycan structures that were discovered from the DDA-based MS analysis of O-glycans derived from human colon organoid small RNA. Proposed compositions and linkages are shown and annotated.

D. MS/MS spectrum of one conventional O-fucose, Galβ1-4GlcNAcβ1-3Fucα (m/z 530).

Data Availability Statement

  • Raw Mass Spectrometry data have been deposited in the GlycoPOST database and accession numbers are listed below

Mass Spectrometry Data for organoids (Data Dependent Acquisition) This Study GlycoPOST GPST000463
Mass Spectrometry Data for HEK293 and K562 (Data Independent Acquisition) This Study GlycoPOST, GPST000455

Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES