Skip to main content
RNA Biology logoLink to RNA Biology
. 2020 Apr 21;17(7):1018–1039. doi: 10.1080/15476286.2020.1750842

The large repertoire of 2’-O-methylation guided by C/D snoRNAs on Trypanosoma brucei rRNA

K Shanmugha Rajan a, Yinzhou Zhu b,*, Katerina Adler a, Tirza Doniger a, Smadar Cohen-Chalamish a, Ankita Srivastava b, Moran Shalev-Benami c, Donna Matzov c, Ron Unger a, Christian Tschudi d, Arthur Günzl b, Gordon G Carmichael b, Shulamit Michaeli a,
PMCID: PMC7549662  PMID: 32250712

ABSTRACT

The parasite Trypanosoma brucei cycles between insect and mammalian hosts, and is the causative agent of sleeping sickness. Here, we performed genome-wide mapping of 2ʹ-O-methylations (Nms) on trypanosome rRNA using three high-throughput sequencing methods; RibOxi-seq, RiboMeth-seq and 2ʹ-OMe-seq. This is the first study using three genome-wide mapping approaches on rRNA from the same species showing the discrepancy among the methods. RibOxi-seq detects all the sites, but RiboMeth-seq is the only method to evaluate the level of a single Nm site. The sequencing revealed at least ninety-nine Nms guided by eighty-five snoRNAs among these thirty-eight Nms are trypanosome specific sites. We present the sequence and target of the C/D snoRNAs guiding on rRNA. This is the highest number of Nms detected to date on rRNA of a single cell parasite. Based on RiboMeth-seq, several Nm sites were found to be differentially regulated at the two stages of the parasite life cycle, the insect procyclic form (PCF) versus the bloodstream form (BSF) in the mammalian host.

KEYWORDS: Trypanosomes, RibOxi-seq, RiboMeth-seq, 2ʹ-OMe-seq, rRNA, snoRNA

Introduction

One of the most widespread modifications in the cell is the methylation of the ribose 2ʹ-OH moiety which is found not only on stable RNAs such as tRNAs, rRNAs, and snRNAs, but also on mRNAs [1,2]. It was initially suggested that this modification confers stability to RNA by reducing the reactivity of the oxygen thereby preventing a nucleophilic attack and subsequent cleavage of the adjacent phosphodiester bond [3].

Most of the 2ʹ-O-methylations (Nms) in eukaryotes are guided by a group of small nucleolar RNAs (snoRNAs), known as C/D snoRNAs, and are mediated by the RNA associated methyltransferase, NOP1/Fibrillarin. C/D snoRNAs are named after short sequence motifs, the C box (5ʹ-RUGAUGA-3ʹ) and the D box (5ʹ-CUGA-3ʹ). Many snoRNAs have an additional box (C’ and D’ boxes). Most box C/D snoRNAs have regions of complementarity to 10–22 nt-long sequences in mature rRNA. Recent studies suggest that snoRNAs with even 8 nt complementary to rRNA are able to properly guide the modification [4]. According to the +5 rule, the methylated nt on the rRNA is the complement to the fifth nucleotide upstream of box D or D’. Each C/D snoRNP consists of the snoRNA and a set of four binding proteins, NOP56, NOP58, SNU13 and NOP1/Fibrillarin [5,6]. It was shown that trypanosome snoRNAs are no exception and bind all four proteins [7]. The snoRNA repertoire and the number of Nm sites vary among eukaryotes ranging from 55 Nms in yeast to 112 Nms in humans [8]. A second group of snoRNAs guides pseudouridine (Ψ) on rRNA, and these are known as H/ACA snoRNAs [5]. Most of the Nm and Ψ sites were shown to be located in functional domains of the rRNA such as the peptidyl transferase centre (PTC), helix 69 (H69) and the A-site finger domain of the large subunit, as well as the decoding centre of the small subunit [9].

Trypanosoma brucei (T. brucei) is an infamous parasite that cycles between two hosts. The form propagating in the insect host is known as the procylic form (PCF), whereas the parasite that multiplies in mammals is the bloodstream form (BSF). Cycling between two hosts requires adaptation, especially to growth temperature but also to nutrient levels [10]. Trypanosomes are known for having unique molecular pathways, such as trans-splicing of nuclear pre-mRNA [11] and RNA editing of mitochondrial pre-mRNA [12].

Previous insights into the rich repertoire of trypanosome snoRNAs were obtained mainly through in silico analyses. The first described set of snoRNAs consisted of 91 members which comprised 57 C/D snoRNAs and 34 H/ACA RNAs with the potential to direct 84 Nms and 32 Ψ, respectively [13]. The first RNA-seq of T. brucei small RNAs identified 20 novel snoRNA candidates (14 H/ACA and 6 C/D snoRNAs) [14]. In trypanosomatids, most of the snoRNA genes exist in clusters, as was shown in T. brucei [13] and Leishmania major (L. major) [15,16]. Whereas C/D snoRNA genes occur exclusively in such clusters, which typically harbour interspersed genes of C/D and H/ACA snoRNAs, H/ACA snoRNAs also exist as solitary genes. All snoRNA genes are transcribed by RNA polymerase II, and individual RNAs are then processed from the primary transcript by an as yet unknown mechanism [17]. Pre-snoRNA transcripts are trans-spliced and polyadenylated [18]. We recently demonstrated that polyadenylation of pre-snoRNA transcripts is mediated by two poly (A) polymerases, PAP2 and PAP1 [19].

Mapping of Ψ on T. brucei rRNA in PCF and BSF trypanosomes identified 68 Ψs on rRNA, 21 of which were shown to be hypermodified in BSF as a result of increased levels of the guiding snoRNAs [20]. Overexpression of snoRNAs, which guide Ψ in the H69 domain of rRNA, accelerated the growth of PCF parasites at 30°C. Interestingly, these modifications are predicted to significantly alter the secondary structure of the LSU rRNA, suggesting that hypermodified positions may contribute to the required changes in ribosome function during cycling between the two hosts [20].

Recent Cryo-EM studies were able to visualize the Nm modification in ribosomes of these parasites. 105 Nms were visualized in the trypanosomatid Leishmania donovani (L. donovani) rRNA [21] and 112 Nms were reported for T. cruzi rRNA [22]. Genome-wide mapping of Nm sites was recently performed in yeast and mammals using three different approaches [2330]. Nm mapping traditionally relied on the property of reverse transcriptase to pause one nt before the modified Nm in the presence of low dNTP concentrations [31]. The high throughput application of this method, 2ʹ-OMe-seq, was recently utilized for Nm mapping [23] and found to be less reliable in Nm quantification compared to other high-throughput methods such as RiboMeth-seq and mass spectrometry-based methods [8,28]. We recently developed a new method, termed RibOxi-seq, that is independent of reverse transcription and is based on the resistance of 3ʹ terminal, methylated ribose moieties to periodate (IO4-) treatment, leaving only fragments containing the 2ʹ-O-methylated residue suitable for ligation to adaptors and sequencing [29]. A third high throughput method used for mapping and quantifying Nm is RiboMeth-seq, which is based on alkaline RNA fragmentation. Nm induces protection at the 3ʹ-adjacent phosphodiester RNA bond from cleavage at alkaline conditions [30]. Under such conditions, RNA fragments starting or ending at the +1-nucleotide relative to the modified residue, were shown to be excluded from libraries prepared from fragmented RNA [24].

In this study, we mapped sites of Nm modification at single base resolution in the two life stages of T. brucei grown in culture using RibOxi-seq, RiboMeth-seq and 2ʹ-OMe-seq. We detected 99 Nms guided by 85 snoRNAs in the two life stages similar to the number found in humans [29].

Materials and methods

Cell growth and transfection

Procyclic T. brucei 427 wild-type strain and the genetically modified 29–13 strains, which carries integrated genes for T7 polymerase and the tetracycline repressor, were grown in SDM-79 medium supplemented with 10% fetal calf serum. Medium for 29–13 cells contained 50 μg/ml hygromycin and 15 μg/ml G418. T. brucei 427 BSF wild-type cells and the 131–514 cell line were aerobically cultivated at 37°C under 5% CO2 in HMI-9 medium supplemented with 10% fetal calf serum. Cell line 131–514 was grown in the presence of 2 μg/ml G418, and 2.4 μg/ml phleomycin, as previously described [7,13,20,32].

Primer extension and Northern analyses

Primer extension was performed as previously described [13]. The extension products were analysed on 12% acrylamide denaturing gels. For northern analysis, total RNA was extracted, separated on either 10% acrylamide denaturing gel or agarose-formaldehyde gel, and analysed using RNA probes. RNA probes were prepared by in-vitro transcription using α-32P-UTP. Oligonucleotides used in this study are listed in Supplementary Table T1.

Preparation of the small RNome

Whole cell extracts were prepared from 109 cells; after extraction with 0.3 M KCl, 10 mM MgCl2, the ribosomes were sedimented by centrifugation for 3 hr at 35,000 rpm in a Beckman 70.1Ti rotor (150,000 x g). RNA extracted from the post-ribosomal supernatant (PRS) was used for library preparation, essentially as described [20]. Briefly, 800ng of PRS RNA was dephosphorylated with FastAP Thermosensitive Alkaline Phosphatase (Thermo Scientific), cleaned by Agencourt RNA clean XP beads (Beckman Coulter) and ligated to 3ʹ linker using high concentration T4 RNA Ligase 1 (NEB) in a buffer containing DMSO, ATP, PEG 8000 and RNase inhibitor (NEB). The ligated RNA was cleaned from excess linker using Dynabeads® MyOne™ SILANE beads (Thermo Scientific), and first strand cDNA was prepared using the AffinityScript Reverse Transcriptase enzyme (Agilent). The RNA was subsequently degraded using 2 μl of 1 M NaOH and the cDNA was cleaned using Dynabeads® MyOne™ SILANE beads (Thermo Scientific). The cDNA was further ligated to 3ʹ adapter using high concentration T4 RNA Ligase 1 (NEB) and cleaned of excess adapter by using Dynabeads® MyOne™ SILANE beads (Thermo Scientific). The adapter ligated cDNA was PCR enriched using NEBNext® High-Fidelity (NEB) polymerase (9 PCR cycles), separated on an E-Gel EX agarose gel (Invitrogen) and size selected in the range of 150–300 bp (containing ~30-180nt corresponding to RNA). The amplicons were gel purified using NucleoSpin Gel and PCR Clean-up kit (Macherey-Nagel) and sequenced in a Nextseq system (Illumina) in paired end mode (20–40 million reads for each sample).

RibOxi-seq

T. brucei PCF and BSF were grown to mid-log phase (8 and 2 × 106 cells/ml, respectively), and total RNA was prepared from 108 cells using the Trizol reagent (Thermo Scientific) according to the manufacturer’s specifications. The RNA was further purified by transferring the aqueous phase of the chloroform extraction to a PureLink RNA mini kit column (Thermo Scientific) and by applying a genomic DNA removal step using the PureLink on-column DNase set. The subsequent steps of the procedure have been published in detail [29]. Briefly, 8 µg of purified RNA was denatured for 3 min at 90°C and fragmented by Benzonase (final RNA concentration: 100ng/µL, enzyme 0.5 units/100 µL reaction) on ice for 90 min. The digestion products for each reaction were divided into a control sample (Ctrl: 1 µg) and an oxidation sample (Ox: 7 µg), and both samples underwent the same treatments except for oxidation and β-elimination steps of the Ox sample. The Ox sample was oxidized with sodium periodate (NaIO4) at a final concentration of 50 mM in oxidation buffer (3.28 mM sodium borate, 37.5 mM boric acid, pH = 8.6) for 45 minutes at room temperature, protected from light. The reaction mixture was precipitated and incubated with elimination buffer (33.75 mM sodium borate, 50 mM boric acid, pH = 9.5) for 90 minutes at 45°C to remove the 3ʹ-terminal nucleotides. To remove 3ʹ phosphates, RNA was purified using NucAway spin columns (ThermoFisher Scientific) and treated with 2 µl of T4 PNK (New England Biolabs [NEB]) for 4 hours in PNK buffer, with the pH adjusted to 6.0. A final round of NaIO4 treatment was carried out to oxidize all ends without Nms.

All subsequent steps were identical for both Ox and Ctrl samples: 50 pmol of NEB miRNA universal linker was ligated to RNA 3ʹ-ends using T4 RNA Ligase 1 (NEB) for overnight at 16°C according to the manufacturer’ instructions, omitting the addition of ATP. Next, 50 pmol of RT primer, containing unique molecular identifiers for bioinformatic PCR duplicate removal, were added and annealed in a thermal cycler at 95°C for 2 minutes, 65°C for 10 minutes, and 4°C for 1 min. After ligation of 100 pmol of 5ʹ-RNA linker with NEB T4 RNA ligase 1, reverse transcription was carried out using NEB Protoscript II, both steps following the manufacturer’s protocols. Subsequently, Ampure XP beads (Beckman Coulter) were used to remove empty adapters (<100 bp), and cDNA libraries were amplified by PCR using Illumina-compatible adapters that contained barcodes for demultiplexing sequence reads. Libraries were sequenced using the Illumina NextSeq500 platform with 150 cycle Mid Output Kits.

RiboMeth-seq

Total RNA from T. brucei PCF and BSF was denatured at 90°C for 2 min in a thermocycler. Then, an equal volume of buffer (NaHCO3/Na2CO3, pH 9.9) was added, and RNA samples were incubated at 90°C for 10, 15 or 20 minutes [30]. The hydrolysed RNA was then used for library preparation. Briefly, 800ng of the RNA was dephosphorylated with FastAP Thermosensitive Alkaline Phosphatase (Thermo Scientific) and cleaned by Agencourt RNA clean XP beads (Beckman Coulter). The RNA was then ligated to 3ʹ linker using high concentration T4 RNA Ligase 1 (NEB) in a buffer containing DMSO, ATP, PEG 8000 and RNase inhibitor (NEB) for 1.5 hr at 22°C. The ligated RNA was purified from excess linker using Dynabeads® MyOne™ SILANE beads (Thermo Scientific), and first strand cDNA was prepared using the AffinityScript Reverse Transcriptase enzyme (Agilent) at 55°C for 45 min. Next, the RNA was degraded using 2 μl of 1 M NaOH, and the cDNA was cleaned using Dynabeads® MyOne™ SILANE beads (Thermo Scientific). The cDNA was further ligated to 3ʹ adapter using a high concentration T4 RNA Ligase 1 (NEB) overnight at 22°C and cleaned of excess adapter by using Dynabeads® MyOne™ SILANE beads (Thermo Scientific). The adapter ligated cDNA was PCR enriched using NEBNext® High-Fidelity (NEB) polymerase (9 PCR cycles), separated on an E-Gel EX agarose gel (Invitrogen) and size selected for the range of 150–300 bp (containing ~30-180nt corresponding to RNA). The amplicons were gel purified using NucleoSpin Gel and PCR Clean-up kit (Macherey-Nagel), and sequenced in a Nextseq system (Illumina) in paired end mode (20–40 million reads for each sample).

2ʹ-OMe-seq

To perform 2ʹ-OMe-seq, an adaptor was ligated to the 3ʹ end of the total RNA (upon fragmentation) and cDNA was prepared using AffinityScript reverse transcriptase (Agilent) under low dNTP (0.004 mM) and high dNTP (1 mM) concentration [23]. The cDNA was then ligated to an adaptor, PCR amplified and cleaned by Ampure SPRI beads (Beckman Coulter); the samples were sequenced in an Illumina NextSeq machine in paired-end mode (20–40 million reads for each sample).

Bioinformatic analysis

RibOxi-seq reads were aligned to the T. brucei rRNA using SMALT v_0.7.5 (http://www.sanger.ac.uk/resources/software/SMALT/) with default parameters. For each sample, the resulting BAM file was sorted using samtools v1.9 and converted to a BED file using bamtobed from the BEDtools v2.26.0 Suite [33]. For each position in the rRNA, we counted the number of reads whose 3ʹ-end alignment terminated at that base. A table containing all samples with the per base counts for each position on the rRNA was imported into DESeq2 [34]. Oxidized samples versus control (non-oxidized) samples were compared for differential expression at each position on the rRNA. Positions with a log2 fold-change of 6 and a corrected p-value of less than 10e−11 were considered as methylated.

For stringent RibOxi-seq analysis, the forward reads were trimmed by cutadapt 1.18 to remove 3ʹ read-through adapters, and un-trimmed reads were discarded. Alignment to T. brucei rRNA was done using STAR 2.7.0a (ver 2.0.6) (https://github.com/alexdobin/STAR/releases)[35]. Aligned BAM files were then converted to BED files using BEDtools v2.26.033 [33]. Aligned 3ʹ-ends were counted to generate count tables that were subsequently fed through DESeq2 [34] for differential analysis. The resulting list was filtered using the following criteria: baseMean>100, log2fc >7, 0<(corrected P-value)<0.0005. A master shell script for read processing and alignment, a python script for end counting, and a R script for DESeq2 analysis were used in this study, and are provided in the Supplementary materials.

To analyse the RiboMeth-seq libraries, the reads were initially trimmed of adapter sequences using Trim Galore version 0.4.4 (https://github.com/FelixKrueger/TrimGalore) with the following parameters:–stringency 4 – length 30 – paired – retain_unpaired. The alignment to the reference rRNA sequence was done by STAR (ver 2.0.6) (https://github.com/alexdobin/STAR/releases)[35], mapped, and properly paired reads were converted to *.bed using BEDtools v2.26.0 Suite [33]. 5′- and 3′-ends counting was done directly on *.bed file using a dedicated Unix script (attached in the Supplement). Analysis was performed by calculation of MAX score for detection of Nm residues, and RMS score (score C) for their quantification. To calculate MAX score, the relative change of end coverage position by position was calculated in 5′→3′ and reverse direction. The relative change was normalized to average values spanning −6 and +6 nucleotides. The normalized relative change for 5′→3′ and reverse direction were averaged, and the maximal value between the average and normalized relative change was retained (score MAX). RMS score was calculated essentially as described previously for score C using the same relative impact of neighbouring nucleotides [30]. The scripts used in this study are provided in the Supplementary materials.

For 2ʹ-OMe-seq analysis, we counted for each position in the rRNA the number of reads whose 5ʹ-end alignment initiated at that location. We also calculated the total coverage of each base using genomecov from the BEDtools v2.26.0 Suite [33]. For each position, we calculated the 2ʹOMe score as described [23], comparing the ratio of the total coverage with the number of reads initiating at that position. Each predicted site was manually inspected for an increase compared to its neighbouring nucleotides.

Results

Mapping the Nms on T. brucei rRNA

The most recent analyses of the T. brucei small RNA repertoire identified 83 C/D snoRNAs which have the potential to guide 87 Nms on rRNA [14,15]. To validate the predicted sites and comprehensively determine Nm deposition on trypanosome rRNA, we applied three high-throughput approaches for Nm site identification and quantification, namely RibOxi-seq, a method which we recently developed and which is based on ribose oxidation [29], RiboMeth-seq, which relies on resistance of Nm sites to alkaline hydrolysis [30] and 2ʹ-OMe-seq, which depends on the enzymatic properties of reverse transcriptase [23]. In RibOxi-seq, the RNA is randomly fragmented by Benzonase, and sodium periodate is used to induce oxidation of ribose moieties of all RNA ends. Since Benzonase cannot cleave RNA immediately downstream of an Nm site, it is necessary to remove the 3ʹ-terminal nucleotides after this first oxidation step by β-elimination to expose Nms at 3ʹ-ends. Further removal of terminal phosphate groups by T4 PNK generates two kinds of phosphate-less RNA 3ʹ-ends: unmethylated and 2ʹ-O-methylated nucleotides. A second sodium periodate treatment then oxidizes unmethylated nucleotides into dialdehydes, preventing 3ʹ ligation of linker molecules to these RNA fragments. Conversely, 2ʹ-O-methylated ribose renders RNA 3ʹ-ends resistant to oxidation and capable of 3ʹ linker ligation. This strategy enriches sequences with 3ʹ-terminal Nm sites, leading to highly Nm-specific, low complexity cDNA libraries that are analysed by next-generation sequencing. Note, as a result of Benzonase treatment, the terminal 2ʹ-O-methylated bases possess 3ʹ-phosphates, and the non-methylated bases have a mixture of cyclic-phosphates, 3ʹ-phosphates and 2ʹ-phosphates and like 2ʹ-O-methylated bases, the 2ʹ-phosphates and cyclic-phosphates are also resistant to oxidation. To avoid ligation of the linkers to these mixtures in the subsequent steps, the RNA was further treated with T4 Polynucleotide Kinase (PNK) that removes all the three types of phosphates and thus avoiding their detection in our sequencing. Thus, despite the fact that 2ʹ,3ʹ-cyclic phosphate are present in rRNA [36] the RibOxi-seq protocol described above eliminated these false-positives and this method therefore is proved to be highly specific, sensitive and accurate in mapping Nm sites of human rRNA [29].

We performed RibOxi-seq on samples from both T. brucei life cycle stages (PCF and BSF), aligned the reads to the genome sequence using the SMALT program, and counted only the 3ʹ-ends of the aligned sequences for the identification of Nm sites. As controls, we analysed RNA fragments that were not subjected to sodium periodate treatment, and calculated the log2 fold change (log2fc) of treated over control samples for each rRNA position. The data are based on three independent biological replicates for each life cycle stage, which were each sequenced twice. By applying a cut-off value of log2fc of >6 and adjusted P-value of <10−11, we identified 96 Nm sites in trypanosome rRNAs (Fig. 1A) (Supplementary Table T2). Although our analysis accurately detected all predicted Nm sites, the method also yielded numerous inconclusive sites. Some of these sites are the two m62A (A2260 and A2261 in SSU) and m4 C (C2135 in SSU) that were also detected in L. donovani Cryo-EM [21] (Supplementary Table T2). In coherence with our detection, a recent pseudo-MS3 analysis also assigned this m4 C site as an Nm in L. donovani rRNA [37]. Using a stringent bioinformatics analysis (baseMean>100, log2fc >7, 0<(P-value)<0.0005), 83 Nms were detected in either PCF and BSF, 76 Nms were detected in both PCF and BSF, 22 inconclusive sites were detected in either PCF or BSF, and 9 inconclusive sites were detected in both PCF or BSF (Supplementary Table T2). These 22 inconclusive sites are mostly sites located one or two nt from a known Nm site. Interestingly, such sites were not found in human rRNA using the same method [29]. Since we did not identify snoRNAs to guide these 22 adjacent positions, these are likely guided by the snoRNAs guiding the adjacent known Nm site, suggesting that trypanosome snoRNAs are ‘slippery’. As an example, in SSU, a site was detected 2nt from Am125, Gm1676, and Um2154; in LSUα, a site was detected 2nt from Am927, one nt from Gm925. Indeed, primer extension using low dNTP verified the detection of three novel sites, Am997 (adjacent to Am996), Gm1621 (adjacent to Am1620) and Am1665 in LSUα (Fig. 1B). Interestingly, similar phenomenon was described in yeast for snR13, snR48 and U18 showing that the same snoRNA can guide adjacent Nms [38]. These snoRNAs have atypical C’/D’ boxes and hence bind differently the protein components of the snoRNAs, and can catalyse additional modifications that do not conform to the classical +5 rule. In trypanosomes, many the snoRNAs also have atypical boxes [13,15,16].

Figure 1.

Figure 1.

Detection of Nm positions in T. brucei rRNA using RibOxi-seq. A) Volcano plot illustrating the detection of Nm positions in T. brucei rRNA. Total RNA from T. brucei PCF was treated (oxidized) with NaIO4, adapters or left un-treated (non-oxidized) and ligated and sequenced as described in ‘Material and Methods’. The reads were mapped to the rRNA, and the positions having a cut-off value of log2FC >6 and adjusted P-value of <10−11 are presented. The X-axis represents the log2FC (oxidized/non-oxidized) and the Y-axis represents the -log10 (P-value) across each site detected in the rRNA. Positions meeting the criteria of log2FC>6 are indicated as orange dots, those meeting FDR < 5e-10 are indicated by red dots, and positions qualifying both log2FC and FDR are shown in green. The zoomed-in views indicate examples of the detected methylation sites. B) Validation of novel Nm sites detected by RibOxi-seq. Total RNA (10 μg) was subjected to primer extension using an oligonucleotide complementary to an indicated rRNA region at low and high-dNTP concentrations (0.004 and 1 mM, respectively). Extension products were separated on 10% polyacrylamide–7 M urea gels along with a dideoxynucleotide sequence ladder of the rRNA produced using the same primer. Partial RNA sequences are depicted, and the methylated nucleotide is highlighted in red. Locations of the Nms on rRNA are indicated.

The stringent criteria used in the analysis also did not detect thirteen predicted Nm sites [14,15]. Six of these sites are adjacent to known sites (one nt apart). All these sites can be detected with less stringent criteria, suggesting that these sites really exist.

Since the suitability of RibOxi-seq data for quantitative analyses has not been demonstrated and we were interested in assessing the level of Nm modifications in both life stages, we examined Nm modification of rRNA in PCF and BSF by RiboMeth-seq [30]. RNA that was subjected to alkaline hydrolysis was ligated with 3ʹ end linker, and libraries were prepared. Size selected amplicons of 60–140 bp were sequenced. Profiles of fragments obtained around the Nm sites that show a characteristic drop at the Nm residue are presented (Supplementary Fig. S1). The RMS (relative methylation score: score C) was calculated for each Nm site detected by RibOxi-seq, as previously described [30] (Supplementary Table T3). For instance, Um1630 had an RMS value of 0.65 in BSF and 0.04 in PCF, similarly Um1674 had an RMS value of 0.22 in BSF and 0.73 in PCF. The RiboMeth-seq data prepared from RNA extracted from separate replicates of both life stages were almost identical (Pearson’s correlation of 0.99) (Supplementary Fig. S2A). Four replicates of BSF and three replicates of PCF were used for quantification of Nm levels. The results indicate that not all of the Nm sites are equally modified along the rRNA (Fig. 2).

Figure 2.

Figure 2.

RiboMeth-seq reveals the stoichiometry of methylation at Nm sites in T. brucei rRNA. RMS score for individual Nms on rRNA is plotted as the mean ± SE. Four biological replicates were used to quantify BSF Nms, and three replicates were used to quantify PCF Nms. The RMS score of BSF Nm is presented as blue dots and PCF as red dots. Sites adjacent to other Nms exhibiting low RMS scores are not shown.

Three Nm sites were hypermodified in BSF: SSU Um1630 (BSF RMS = 0.65, PCF RMS = 0.04), SSU Cm1844 (BSF RMS = 0.42, PCF RMS = 0.13), and LSUβ Gm1247 (BSF RMS = 0.72, PCF RMS = 0.23). Five Nm sites were hypomodified in the BSF: LSUα Am1180 (BSF RMS = 0.01, PCF RMS = 0.77), SSU Um2154 (BSF RMS = 0.10, PCF RMS = 0.38), LSUβ Am544 (BSF RMS = 0.07, PCF RMS = 0.26), LSUβ Gm659 (BSF RMS = 0.12, PCF RMS = 0.42), and SSU Um1674 (BSF RMS = 0.23,PCF RMS = 0.73). The differential RMS along the SSU and LSU rRNA is schematically presented (Fig. 2), indicating that several Nm sites were differentially and reproducibly modified in T. brucei parasites during its life cycle (Table 1). However, the magnitude of fold-change in the Nm modifications, apart from the eight sites mentioned above, was marginal. In RiboMeth-seq, Nm sites present adjacent to another Nm were difficult to quantify. For example, SSU Um57 had no RMS value, whereas Am56 had an RMS value of 0.47 (Supplementary Table T3).

Table 1.

The complete stoichiometry of Nm sites in T. brucei rRNA. The identity and position of individual Nm sites in T. brucei rRNA. The RMS score of each individual Nm and its fold change are presented for both life stages. Data are shown as mean ± s.e.m. The fold change of 2ʹ-OMe score, identity and expression values of snoRNA guiding individual Nms are also presented.

rRNA Nm BSF (RMS) PCF (RMS) FC_RMS (BSF/PCF) FC _2ʹ-OMe (BSF/PCF) snoRNA snoRNA log2FC(BSF/PC)
SSU Um36 0.64 ± 0.02 0.83 ± 0.04 0.77 0.84 TB10Cs2C1 3.51
SSU Cm46 0.52 ± 0.01 0.55 ± 0.01 0.95 0.94 TB8Cs3 C3 0.15
SSU Am56 0.47 ± 0.03 0.75 ± 0.04 0.63 1.09 TB8Cs2C1 0.09
SSU Cm66 0.35 ± 0.01 0.38 ± 0.06 0.93 1.10 TB8Cs2C1 0.09
SSU Am125 0.83 ± 0.01 0.89 ± 0.00 0.94 0.98 TB8Cs1 C2 −0.58
SSU Um680 0.52 ± 0.01 0.55 ± 0.01 0.94 1.19 TB10Cs3 C3 −0.60
SSU Um714 0.78 ± 0.01 0.80 ± 0.05 0.98 1.34 TB9Cs2C6 1.41
SSU Am721 0.76 ± 0.01 0.81 ± 0.06 0.95 1.46 TB6Cs2C1 −0.03
SSU Gm1517 0.45 ± 0.01 0.48 ± 0.02 0.93 1.50 TB11 Cs3 C2 1.90
SSU Gm1531 0.76 ± 0.01 0.84 ± 0.00 0.91 1.21 TB3Cs1C1 −0.66
SSU Gm1603 0.89 ± 0.01 0.95 ± 0.00 0.93 0.88 TB8Cs3C2 1.35
SSU Um1630 0.65 ± 0.03 0.04 ± 0.02 15.38 0.72 TB10Cs2”C3 1.03
SSU Um1652 0.78 ± 0.01 0.72 ± 0.06 1.09 1.13 TB10Cs1C3 1.07
SSU Um1674 0.23 ± 0.06 0.73 ± 0.02 0.31 0.95 TB10Cs3C2 −0.12
SSU Gm1676 0.39 ± 0.08 0.14 ± 0.05 2.80 0.79 TB9Cs3C1 −0.50
SSU Gm1700 0.64 ± 0.00 0.55 ± 0.02 1.18 0.92 TB10Cs2”C1 −0.07
SSU Cm1844 0.42 ± 0.01 0.13 ± 0.07 3.12 1.25 TB7Cs1C1 1.01
SSU Gm1895 0.52 ± 0.02 0.69 ± 0.04 0.75 1.48 TB9Cs2C4 −2.01
SSU Um1899 0.76 ± 0.02 0.89 ± 0.03 0.86 1.20 TB11Cs2C1 −1.45
SSU Um2054 0.93 ± 0.00 0.90 ± 0.02 1.03 0.85 TB8Cs2C0 0.33
SSU Am2096 0.76 ± 0.01 0.87 ± 0.04 0.87 1.39 TB9Cs3C2 −0.01
SSU Um2123 0.65 ± 0.01 0.57 ± 0.03 1.14 0.88 TB10Cs4C3 −0.04
SSU Cm2134 0.72 ± 0.00 0.52 ± 0.01 1.38 1.29 TB10Cs4C3 −0.04
SSU Um2154 0.10 ± 0.01 0.38 ± 0.14 0.25 0.85 TB10Cs3C2 −0.12
SSU Gm2227 0.87 ± 0.01 0.92 ± 0.01 0.95 1.76 TB8Cs1C1 1.06
LSUβ Gm71 0.61 ± 0.01 0.64 ± 0.03 0.94 1.17 TB9Cs1C1 −0.86
LSUβ Um73 0.69 ± 0.01 0.57 ± 0.04 1.21 1.55 TB9Cs1C1 −0.86
LSUβ Am95 0.74 ± 0.01 0.80 ± 0.02 0.92 1.16 TB3Cs3C1 1.06
LSUβ Cm377 0.36 ± 0.02 0.31 ± 0.16 1.18 1.74 TB9Cs4C3 1.36
LSUβ Am400 0.90 ± 0.00 0.92 ± 0.01 0.97 1.17 TB6Cs1C2 −1.58
LSUβ Am520 0.73 ± 0.01 0.75 ± 0.05 0.98 0.57 TB10Cs7C2 1.61
LSUβ Am544 0.07 ± 0.03 0.26 ± 0.26 0.27 0.55 TB6Cs1C3 0.01
LSUβ Am545 0.75 ± 0.01 0.80 ± 0.03 0.94 0.55 TB6Cs1C3 0.01
LSUβ Gm552 0.63 ± 0.02 0.74 ± 0.07 0.85 0.53 TB10Cs1C4 −3.04
LSUβ Um578 0.90 ± 0.01 0.93 ± 0.01 0.97 1.19 TB11Cs3C1 0.88
LSUβ Am588 0.41 ± 0.03 0.52 ± 0.09 0.79 1.04 TB8Cs1C3 0.57
LSUβ Cm601 0.62 ± 0.01 0.33 ± 0.13 1.86 1.30 TB10Cs1C1 −0.72
LSUβ Am609 0.50 ± 0.01 0.54 ± 0.03 0.94 1.25 TB10Cs1C1 −0.72
LSUβ Am622 0.81 ± 0.01 0.81 ± 0.00 0.99 1.30 TB10Cs2ʹC1 0.47
LSUβ Am646 0.62 ± 0.01 0.55 ± 0.08 1.12 1.23 TB11Cs4ʹC1 0.92
LSUβ Gm659 0.12 ± 0.03 0.42 ± 0.12 0.29 1.13 TB9Cs2C7 −0.85
LSUβ Um672 0.35 ± 0.01 0.31 ± 0.19 1.12 2.92 TB11Cs4C3 2.01
LSUβ Gm673 0.60 ± 0.02 0.60 ± 0.02 1.01 2.60 TB11Cs4C3 2.01
LSUβ Um685 0.62 ± 0.01 0.68 ± 0.08 0.91 1.40 TB9Cs2C7 −0.85
LSUβ Um728 0.17 ± 0.02 0.36 ± 0.07 0.48 0.97 TB10Cs3C5 0.44
LSUβ Um1093 0.68 ± 0.02 0.78 ± 0.01 0.87 1.16 TB5Cs1C1 −0.57
LSUβ Gm1094 0.57 ± 0.02 0.70 ± 0.03 0.81 1.10 TB5Cs1C1 −0.57
LSUβ Cm1175 0.22 ± 0.03 0.38 ± 0.08 0.58 1.43 TB5Cs1C1 −0.57
LSUβ Am1201 0.77 ± 0.01 0.80 ± 0.04 0.97 1.67 TB10Cs2”C2 0.35
LSUβ Gm1245 0.60 ± 0.01 0.53 ± 0.14 1.14 1.14 TB3Cs1C-1 1.31
LSUβ Gm1247 0.72 ± 0.03 0.23 ± 0.13 3.07 1.02 TB11Cs1C2 0.66
LSUβ Cm1264 0.81 ± 0.00 0.62 ± 0.04 1.31 0.94 TB6Cs1C1 1.62
LSUβ Gm1269 0.37 ± 0.02 0.39 ± 0.16 0.96 0.69 TB11Cs1C2 0.66
LSUβ Um1375 0.36 ± 0.04 0.63 ± 0.03 0.58 0.71 TB10Cs3C1 0.05
LSUβ Am1388 0.29 ± 0.02 0.39 ± 0.16 0.75 0.86 TB11Cs4C1 0.72
LSUβ Am1400 0.72 ± 0.01 0.71 ± 0.05 1.02 1.07 TB11Cs4C1 0.72
LSUβ Cm1413 0.78 ± 0.01 0.74 ± 0.02 1.06 0.78 TB9Cs2C1 3.56
LSUβ Um1435 0.60 ± 0.02 0.85 ± 0.05 0.71 0.95 TB9Cs3C3 0.20
LSUα Am254 0.17 ± 0.03 0.38 ± 0.03 0.45 0.92 TB10Cs3C2 −0.12
LSUα Am742 0.39 ± 0.01 0.85 ± 0.07 0.46 1.00 TB9Cs2C2 −0.75
LSUα Am743 0.89 ± 0.00 0.95 ± 0.03 0.94 1.00 TB9Cs2C2 −0.75
LSUα Am746 0.91 ± 0.00 0.94 ± 0.02 0.97 0.87 TB10Cs2C2 −0.70
LSUα Cm747 0.61 ± 0.01 0.75 ± 0.02 0.80 0.85 TB10Cs2C2 −0.70
LSUα Cm760 0.82 ± 0.01 0.85 ± 0.04 0.96 1.10 TB9Cs2C2 −0.75
LSUα Um916 0.69 ± 0.01 0.63 ± 0.02 1.10 1.34 TB11Cs4C2 1.30
LSUα Gm925 0.82 ± 0.01 0.87 ± 0.04 0.94 1.62 TB9Cs2C3 1.60
LSUα Am927 0.69 ± 0.01 0.50 ± 0.17 1.39 1.35 TB11Cs4C2 1.30
LSUα Am996 0.60 ± 0.01 0.61 ± 0.08 0.98 1.34 TB9Cs5C2 2.24
LSUα Cm1006 0.60 ± 0.01 0.77 ± 0.06 0.78 1.27 TB9Cs5C1 −0.48
LSUα Am1024 0.75 ± 0.01 0.86 ± 0.04 0.87 1.44 TB11Cs1C3 1.43
LSUα Gm1028 0.50 ± 0.02 0.72 ± 0.08 0.70 1.14 TB9Cs2C5 −0.33
LSUα Um1145 0.58 ± 0.04 0.84 ± 0.00 0.69 0.99 TB10Cs3C4 −0.70
LSUα Am1180 0.01 ± 0.00 0.77 ± 0.05 0.01 1.01 TB9Cs5C1 −0.48
LSUα Um1181 0.84 ± 0.02 0.94 ± 0.02 0.90 0.97 TB10Cs3C4 −0.70
LSUα Gm1267 0.54 ± 0.04 0.83 ± 0.03 0.65 1.31 TB9Cs4C1 −0.83
LSUα Um1448 0.49 ± 0.01 0.69 ± 0.05 0.71 1.08 TB8Cs2C2 0.67
LSUα Gm1605 0.62 ± 0.02 0.79 ± 0.06 0.79 1.52 TB10Cs2”C3 1.03
LSUα Cm1608 0.53 ± 0.01 0.74 ± 0.07 0.72 1.24 TB8Cs3C1 0.83
LSUα Am1620 0.76 ± 0.02 0.77 ± 0.05 0.99 1.12 TB8Cs3C1 0.83
LSUα Gm1621 0.50 ± 0.01 0.26 ± 0.07 1.94 0.99 TB8Cs3C1 0.83
LSUα Am1665 0.64 ± 0.02 0.73 ± 0.01 0.88 0.98 TB6Cs1ʹC1 0.02
LSUα Gm1709 0.72 ± 0.01 0.83 ± 0.05 0.87 0.76 TB8Cs1C1 1.06
LSUα Um1742 0.80 ± 0.01 0.84 ± 0.03 0.96 1.02 TB7Cs2C1 −0.34
5.8 S Am41 0.80 ± 0.01 0.82 ± 0.05 0.97 1.22 TB6Cs1ʹC1 0.02
5.8 S Am43 0.64 ± 0.00 0.56 ± 0.06 1.14 1.44 TB6Cs1ʹC1 0.02
5.8 S Gm75 0.55 ± 0.01 0.64 ± 0.08 0.85 1.42 TB8Cs3C3 0.15
5.8 S Am163 0.95 ± 0.00 0.94 ± 0.02 1.01 1.15 TB9Cs4C2 0.70
5.8 S Um167 0.88 ± 0.00 0.92 ± 0.01 0.96 1.34 TB9Cs4C2 0.70

To validate RiboMeth-seq in our hands, we applied the method to cells in which NOP1 was silenced. The results indicate that, as anticipated, most of the sites were reduced upon silencing. However, several sites were not changed: Cm46, Um680 and Um1630 in SSU; Gm925 and Gm1605 in LSUα; Cm601, Am609, Gm1245, Gm1247, Cm1264, Gm1269, Am1400, and Cm1413 in LSUβ (Supplementary Fig. S3). Several of these sites are guided by snoRNAs with special functions. For instance, Gm925 is guided by snoRNA Tb9Cs2C3 which functions in srRNA1 processing [39]. It is not currently known why these sites were not reduced upon NOP1 silencing. However, studies in mammalian cells showed that the most essential methylation sites on rRNA were less affected in cells depleted of NOP1 [26,27]. However, we cannot rule out the possibility that other methyltransferases guide such modifications and hence are insensitive to NOP1 depletion. For example, in yeast, Gm2922 on LSU rRNA is mediated by the methyltransferase, Sbp1 [40], and trypanosomes possess a homologue to this enzyme, encoded by Tb927.2.4550.

The third method that we used for Nm mapping is 2ʹ-OMe-seq. RNA from both PCF and BSF was subjected to reverse transcriptase under 1 mM or 0.004 mM dNTPs and RNA libraries were prepared upon ligation of 3ʹ linker, as previously described [20,23]. Under limiting dNTP concentrations, the reverse transcriptase stops one nucleotide downstream of the methylated site. We performed the same experiment using RNA prepared from cells carrying the silencing construct of NOP1. We then examined the results by defining the fraction of reads terminating at a given position in the low dNTP sample as measured by 2ʹOMe score (Table 1, Supplementary Table T3, Supplementary Fig. S4) [23]. Although this method was reported to be less quantitative, we inspected the relative stops on the different sites and compared them to the methylation fraction by RMS in RiboMeth-seq. Indeed, both methods indicated that the level of modification varies along the rRNA molecules. However, a Pearson’s correlation of only ~0.45 was obtained between the two methods (Supplementary Fig. S2B).

RiboMeth-seq data, which is considered to be the most quantitative method, was used to mark the Nm positions that were elevated (>10%) in the BSF parasite on the secondary structure of the rRNA (Figs. 46).

Figure 4.

Figure 4.

Localization of the Nms and ψs in the secondary structure of T. brucei SSU rRNA. Boxes highlighted in blue show the Nm sites whose level is increased in BSF (>10% compared to PCF) and those in yellow are the hypermodified pseudouridines (ψ) [20]. T. brucei specific Nm (not detected in T. cruzi or L. donovani Cryo-EM studies) are indicated by an asterisk (*).

Figure 6.

Figure 6.

Localization of Nms and ψs on the secondary structure of T. brucei LSUβ rRNA. Boxes highlighted in blue show the Nm sites whose level is increased in BSF (>10% compared to PCF) and those in yellow show the hypermodified pseudouridines (ψ) [20]. T. brucei specific Nm (not detected in T. cruzi or L. donovani Cryo-EM studies) are indicated by an asterisk (*).

Overall, we were able to identify at least 99 Nm sites using all three methods. Of these, 97 sites were detected by RibOxi-seq and 2ʹ-OMe-seq, and an additional two sites were supported by RiboMeth-seq (Supplementary Fig. S5). The results of the three-mapping methods indicating the level of modification, the identity of guiding snoRNA(s) and the expression level of the snoRNAs are presented (Table 1 and Supplementary Table T3). The study also identified 22 novel Nm sites (Table 1, Supplementary Table T2-3). Based on RNA-seq of small RNA libraries, we further identified snoRNA candidates that guide these newly identified modifications (Supplementary Fig. S6). Note that two of these snoRNAs interact imperfectly with their target (one mismatch). Similar predictions were recently suggested for several mammalian snoRNAs [41]. Moreover, 10 predicted Nm sites could not be confirmed experimentally, suggesting that, despite that these snoRNAs can perfectly base-pair with the rRNA target, these snoRNAs do not guide modifications in cultured exponentially growing trypanosomes. Indeed, this phenomenon was reported previously in other systems [4244].

The combination of our mapping approaches accurately identified each modification site, and corrected several positions of previously predicted sites, which were based on the complementary D or D’ sites in snoRNAs. This inconsistency arises from atypical D or D’ boxes. The accurate mapping data presented in this study corrects these mistakes, which are offset by one nucleotide either downstream or upstream of the published predictions. The updated and corrected Nm sites in T. brucei rRNA are listed in Table 1 and depicted in an updated map of the rRNA secondary structure (Figs. 46). The sequences of snoRNAs guiding these Nm sites are presented in Table 2.

Table 2.

The complete sequence of C/D snoRNA guiding T. brucei rRNA Nm sites. The predicted C/C’ and D/D’ boxes are highlighted in bold. The corresponding L. major homologs of T. brucei snoRNA are indicated[15]. The domain of complementarity to the rRNA target is underlined.

T. brucei snoRNA snoRNA Sequence L. major homolog Size(nt) rRNA Target Reference
TB3Cs1C1 CUGAUGACUAUAACGUUUCACUCUUGUCGUAGGAAAUGUCUUGACGAAACUUAUACUUCGUUUGAUUCUGA LM25Cs1C2 71 SSU-Gm1531 Liang et al[13].
TB3Cs1C-1 UGAUGUGAAUCUAUCUGUGCAAAUGACACCUCUAGUCUGAAACACAACAUGACGCGAGCACAAAUACAAGUUAUCCCUGGCUGA   84 LSU3-Gm1245 Barth et al[7].
TB3Cs3C1 UAUGAUGUCCAAAUGCCAAAUUUUACGCAUCUAUUUUUCUGAUGUGAAACGAUGAAUCAAAGCUUAACACAACUCUGAUU LM29Cs2C1 80 LSU3-Am95 Michaeli et al[14].
TB5Cs1C1 GUGAUGAAUUUUAAGCUUAGGACACCUUUGGACGCAGGGACCCUGCGAUUGAAAACAGUGAAACUUGAUCGAUUCGUACACUGAUUU LM22Cs1C2 87 LSU3-Cm1175 LSU3-Gm1094 Liang et al[13].
TB5Cs-1C1 GCUUGAUGAAUAAAUGUGUUAUUGUAUGGGCUGAGAAAUGUGGAGGGAGAGAAAUAACUUACAUACAUUCUGGCAUUUGAUGAUUGAAAUGGGUGGAGAGCUGUUGGCAGCUGAC   115 LSU5-Um46 Current study
TB5Cs-1pC1 UUGAUCAUGAUUUUCUUGCUUAACCCGCUUCCUCCCUCCACCAUUAGUGACUUCCCGUUAUCUUUUCUACGCUGA   77 LSU5-Gm1621 Current study
TB6Cs1C1 AGCAUGAUGAUCAUACGUGCAAUUCCUGUGGUAUCUGAAAAAUGCAAUGACAGAAGAACUGCGACGAUGCCUUCAUACAAGGAUUUACUGACU LM30Cs1C1 93 LSU3-Cm1264 Liang et al[13].
TB6Cs1C2 CCAAUGAUGUUGUUAUUUAAUUAUACACCUGAUCAUGUUGUUGAUGAGAGGAAACGCUGAGGUG LM26Cs1C2 64 LSU3-Am400 Liang et al[13].
TB6Cs1C3 GAUGAUGCUAACAAUCGAGGCAUUUGUAUGAUUUUCAAAUGAAUUAGGCAUCCGUGCACUAAGCUACCUAUCAUAAGCACGGGACUGAU LM26Cs1C1 89 LSU3-Am544 LSU3-Am545 rRNA processing Liang et al[13]., Michaeli et al[14].
TB6Cs1ʹC1 UUCAUGAUGUAGAAAAAAUAACUCUCCAGUAUCUCUGAUAUGUUCGUGAAAAGAGAAAAGCUAUUUAAACGUUGCGUUCUUCAGCUGAUUC   91 5.8 S-Am43 Current study
TB6Cs2C1 CGCUGAUGAAUUAAUUUUUCUGAGUGUUUUCUUAGAGUUCCGUAACGGGCAUGAUAAGCACACAAAUUAUGAACCCUUUAACUCUGAGAG LM30Cs2C1
LM30Cs2C2
90 SSU-Am721 Liang et al[13].
TB7Cs1C1 UACGGUGAUGAAAAUACAUGUAUAUGCUAACUUUUCUGUACUCCAUGGAGUGAGCAUGAAGUGAAUUAUCCGCAGAGAAGUCUGAAA LM14Cs1C1 87 SSU-Cm1844 Barth et al[7].
TB7Cs2C1 UGAAUGAUGAACUUAAACAUACUUUUUUCCUAACAAAACGCCUGAAUGUGUUGAAUGAUGCACAACCAAUACGUGUGUUCUGAUA LM20Cs2C1 68 LSU5-Um1742 Current study
TB8Cs1C1 GUGUGUGAUGUAUAUACGACUAUGAACAACUCGUCAGAGUGCUAUCUUUGAUGAUCACAUACAUUUUGCUUCCUCACUGACAA LM20Cs1C3 83 LSU5-Gm1709 SSU-Gm2227 rRNA processing Liang et al[13]., Chikne et al[39].
TB8Cs1C2 CACGGUGAUGUUCAAUACAAUAACUGAUGUAAUGAGACCUAGUGGAAUACUGCGGACACUUCUUGUUCUGAGCG LM20Cs1C4 74 SSU-Am125 Liang et al[13].
TB8Cs1C3 ACACAUGAUGAGCAGCAAAGAAAGGGAAUCUCUUGUCGAGCUCAGGGAAUGAAUCCCAGUGGCGGUGACAUGACAAACUUUGAUAUUACUAUAAUCCAUUCUAUUCUGUUACUGAUGC LM20Cs1C1 118 LSU3-Am588 rRNA processing Liang et al[13]., Chikne et al[39].
TB8Cs1C4 UCCCUUGAUGAUUGUGGCAACUCUCCACGGAACUUAUCUGACAAAAUUUGCCUACGAACCUAUUACCAAGGCUGAGGU LM20Cs1C2 78   Liang et al[13].
TB8Cs2C0 GCGCGUGAUGAAUAAAUACAAACGACCAAUAAUCGGAAGCGUCAGUAACACCUCACGCAUGACGCCACUUUGAAUGCAAUACUAAUUAUCUGACUC LM36Cs2C2 96 SSU-Um2054 Liang et al[13].
TB8Cs2C1 UACCUCGAUGAUGUGUAUGAGAACAAGCAUAUGUCCGAGCUGACCACAAUUGUGGCACAAUGAGAGCAUUACUCGAGUCCUUGAAAGCUGAGUG LM36Cs2C1 94 SSU-Am56 SSU-Cm66 Liang et al[13].
TB8Cs2C2 GCAAAGCAAAUAAACAGCGGGGGAAACCUCGUCUCAAAAUGUGAUUUUGCGUGAACGAGCGUUUUCAAGCGUCAUCCACUGACUC LM36Cs2C3 85 LSU5-Um1448 Michaeli et al[14].
TB8Cs3C1 GCCAGUGAUUAUACGUAAUGUCUUUGCUACAGGUGAUUGUACGAUAUGACCAUACCGACUAAACCAACCGAGAUCAAUGAGGC LM23Cs1C3 83 LSU5-Am1620 LSU5-Cm1608  Liang et al[13].
TB8Cs3C2 CGGUGUGAUUACAGACAGGAUGUAAGUGAGUCAAUGUCAAUAUCUCCGUAUUACACCAUGAGGACUAUUGUCCCCCGUGUCUGACCG LM23Cs1C1 87 SSU-Gm1603 Liang et al[13].
TB8Cs3C3 ACACAUGAUGUCAUUUCGUAUUCUGCAAUACUGACAAUAACUUGAGCGAGACAAGACAUAUUUGACUACUGGCAACUGAAAC LM23Cs1C2 81 5.8 S-Gm75 SSU-Cm46 Liang et al[13].
TB9Cs1C1 CUUCGUGAUGAUCCCGCGAACUGAGUGUACCUUUUUUCAGCACUUUCGUGCAAUGGAAUGUAAUGGCACGGUGCCCUCUUGUUGGGUGUACUGAUA LM26Cs1C3 98 LSU3-Gm11 Liang et al[13].
TB9Cs2C1 GCCCAUGACGAUAAACCACUUACGACGGUCUUAUGACACACACCCGAACAUGGAUUGAGCACGAGUGUUACGAUAGUUUCUGGGGCGCCGCACAACAUUCCCGAGGC LM35Cs3C6 LM22Cs1C1 107 LSU3-Am70 LSU3-Cm1413 rRNA processing Liang et al[13]., Michaeli et al[14].
TB9Cs2C2 GUCAAUGAUGAGUCUGUCAAAUCCGUGUUUCAGCUGAAUUUAUUUGAUGCUGACAUCAGUUAAUUUUGUCUGUUUGACUUCUGAGUA LM35Cs3C3 87 LSU5-Am742 LSU5-Am743 LSU5-Cm760 LSU5-Am762 Liang et al[13].
TB9Cs2C3 GUGGCCGAUGAUGGAAACUAGUUGAGCGUCCAAACAUGUUCCGACGUCAUAUUCAUGAGGGAUCUAUACAACACAAAUCACCUUUCGGGUCUGAUGG LM35Cs3C4 97 LSU5-Gm925 rRNA processing Liang et al[13]., Chikne et al[39].
TB9Cs2C4 GGGUGACGAUGUACAAUAUGUUCAAAUUGCACCGAGAACCUGUGAGGACACCAUAACACAGACCUGCACUGAACCU LM35Cs3C5 76 SSU-Gm1895 Liang et al[13].
TB9Cs2C5 GCCACUGAUGAAAGAGCUUCCGAUACCGCGUAGGCGGAACGGAAACACACUAUGUCGAUGCAACUGUGAACUCUAUCUUUCGCUCCGAGCUGACGU LM34Cs1C1 96 LSU5-Gm1028 rRNA processing Liang et al[13]., Chikne et al[39].
TB9Cs2C6 GCGUGUGAUGAAUACCUAAUAUAACAAAUUAACAGCAACAUCUGAACAGAACCCGUGACGCUAAUAUUGUUUCUGACGC LM35Cs3C1 LM35Cs3C-1 79 SSU-Um714 Liang et al[13].
TB9Cs2C7 CUACUUGAUGACAUCAAUGGACUGGAGUCUCUGAGUGUAUUUGAAUGACAAUAACCCAUUUAAAGAAUAUUCUUCUUUCCCCGGCUGAUGG LM35Cs3C2 91 LSU3-Gm959, LSU3-Um685 Liang et al[13].
TB9Cs3C1 AUUACUUGAUGUAUAACACGAUAUUCAGGUAAAGAUUAUCAGGAGUAACUGACUGAGAUAACAUCAUGCACCACUCUGACCA LM5Cs1C1 87 SSU-Gm1676 Liang et al[13].
TB9Cs3C2 CUCCAUGAUGCCAUGACAAGACUAUAAGAGCACAGUUUGAACUGACUUCACAAGACGGACGAGAACGUCGCUGCAAUAUUCUGAUGA LM5Cs1C4 125 SSU-Am2096  Liang et al[13].
TB9Cs3C3 CUCUAUGAUGUUAAAAGAAGUUUUGUAGUAGGGUAAAAUCUGACAUCCGACCAUGAAGGUACGAAUUUAAUGUGCUUUCAUGUGCUUCUGCUAUUGUGGUUGCACAGAGGCGCUAUGUCUGAGAA LM35Cs2C1 81 LSU3-Um1435 rRNA processing Liang et al[13]., Chikne et al[39].
TB9Cs4C1 GAUCGUGAUGAUAUUAACCCUGCUCCGCUACUGAGUGUUGAAGCAUGAAACGAUAUCCUUCAGGGCUACUGAUGC LM35Cs3C7 75 LSU5-Gm1267 Liang et al[13].
TB9Cs4C2 CCACAUGAUGAUCCAUGUAUUCACCAUAUCGACACUGAGUCGGAAACUCCCCGUGACGCACAAGUGAUUGUGCAUGGAACCGCCCGCACACGCUGUAGGGCACUGACUA LM35Cs1C2 109 5.8 S-Am163 Liang et al[13].
TB9Cs4C3 UACUAUGAUUACAUCCAUAAUGCGUCAGGACACACGAGUGUGUACGUGACUGUUGGAUUCUAACGCGACGCCGUAAGCAAUAUGAUCA LM35Cs1C1 88 LSU3-Cm377 Liang et al[13].
TB9Cs5C1 GCGAGUGAUGAGAACAUGGAACUAUUGCACGUUUAUAUGAUAAGGCAACUUGAUGACUUACACACGCUUCACUAAAUAUCGUACGAGCGAUUACUGAUCA   100 LSU5-Cm1006 LSU5-Am1180 Liang et al[13].
TB9Cs5C2 GCAUGAUGACGAAACAAUUUUGCACGUCAGUUUGAAUUAGCAAAUGUGAAGAUGAAAUUGACACAGCUAUUUUAUGGGCUGUCCUGAUCU LM14Cs1C2 90 LSU5-Am996 Liang et al[13].
TB10Cs1C1 GUGUAUGAUGAGAAACCUAUUUUUAUGUAACUCGGGAGAACUGAGCAUAUUACCUGAUGAGUAAACAAUCAAUCGUUAGAUAGUAGCACUGAUGU LM36Cs1C3 96 LSU3-Cm601 LSU3-Am609 rRNA processing Liang et al[13]., Chikne et al[39].
TB10Cs1C2 (MRP) UAGUGUUGUGUGUCUGUGUCCGUCGUACUAGUAAAGGAUGAUGAGGAUGCGGCCACCCCACGCGACGUGAUGCGGUGUGUUGUUGACACUGCAGGCGUGUAGUUUCCCCUCCGCUGCCGUUGCAAAAGCAAACCCAUUUUGGGCAACAUACGCAGCCGCGCGCCACAAGACGCCUCCACCGCUCGACGGGGUGCCGCCAUGACGCGGGUCCCGAGCCAAAGUCCCUCGCCCGAAUUUCCGAAGGCAGUGAGUGCCUCGCGUGGGGAGACCUGCGCGCAGUAAAACCUCGUGCCGCUGGCGGUGCGGGGGCCACCGGCUGUGGUGGUUGAUCCCCUCUGGUGGAUCCCACAGCAGCAACUACGCGGGCCGUACUUUCGUCGGUGCGAGACUCCCCACCCGGUGGUAUGAAUGGCGGGCACCACUGAGUACACGUAUAGCCAUGAGCGGCAUGUGCAGAAGAGGGCUUACAAAGGAGGCAGCGCGAGCGAGCGAAUGGGUAACACUGGCGCGCUGCAGAGAAA   520 rRNA processing Liang et al[13].
TB10Cs1C3 UCACGUGAUGAGCAUUCAAUUCACAUCGCGACACACCGAUGAUUCGAAUGAGUGAGCUUCGAAUGGGAUAAAUUUAAUCGAGAAAGAACAGCUGAGAU LM36Cs1C1 98 SSU-Um1652 Liang et al[13].
TB10Cs1C4 GAGAAUGAUGAGAUUGCCAUCAUACUAUUGGAAGACGAGUCUGAACCCUGAUGCAUUUUAUCAUGCGGCACUGACGA LM36Cs1C2 78 LSU3-Gm552 rRNA processing Liang et al[13]., Chikne et al[39].
TB10Cs2C1 UAACAUGACGAGUGAGGAGCGCUAUAUCUUCUUCACCAAGUGCAGAAUUAACCGUCUGAGUACUUUAUCACUUUGAAGUGAAGCGCAACCUGAUUU LM33Cs2C1 96 SSU-Um36 Liang et al[13].
TB10Cs2C2 CGCUGAUGAAGUUGAUAUGGUCCGUGUUUCAGAUCGCUGAAUUGACGCACAAUAGCAUAUCUUCUGAGUU LM33Cs2C2 70 LSU5-Am746 Liang et al[13].
TB10Cs2ʹC1 GCACGUGAUGAGAGUAUGUUCUUUCUAUAUACUGAUUUGUCCAUUGAAGUACACAGAUACAUAUUCAUAGACUGUCGUUUCCCUGACC LM33Cs3C1 88 LSU3-Am622 Barth et al[7].
TB10Cs2”C1 UGAUGACAUCACUAAGCGUUUCUGACGGAAUUCGUAUGCUGAACACAUAACAAACGCAAAUCACUCCACACUGA LM36Cs-1ʹC2 74   Doniger et al. 2010
TB10Cs2”C2 UGAGGCAAACGAUGAAAAAAUAAAUGGAUCGCUGACUCUGAACACAUUAUCUGAUUGACAGGACAUUUCUUUGACUAUACCUCUGA   77   Doniger et al. 2010
TB10Cs2”C3 UGAUUACAAAGCAAUUUUAUCCUCACCCUGACGUGAGUGGUUCUGUGACGAGAAACCGAUUGACAAUAAUACCAAGAUCUGCUAUGA LM36Cs-1ʹC1 95 LSU5-Gm1605  Doniger et al. 2010
TB10Cs3C1 CGCGUGAUGAGGUGCAGAAGGCAUGUCGCCGCUACGGCGGUGGCUCGCGUAGCCGUCUGGCUGUGCGCGUACUGUGAGCUACUGUACUCCAUGGGUGAACAAUCUCUGAUG LM33Cs1C3 111 LSU3-Um1375 Liang et al[13].
TB10Cs3C2 CACAUGAUGCACACAAUUCAAUACUCACUCUGAACAUCACUUGUCAGGAGGAAUGUGAUAACAUGCACCACCAGCUGAUCA LM33Cs1C4 81 LSU5-Am254 SSU-Um1674 Liang et al[13].
TB10Cs3C3 GCGCAUGAUGUGCUCAACUGGAAUUACCAUCUGAACGCGGGAUACCGCAAGUCGAUGAAUUAAUGCUACGUGCAUUACCUCCGCUGUUACUCGUAUCACUGACAC   105 SSU-Um680 Liang et al[13].
TB10Cs3C4 CGCGUGAUGACAUACAAAGUUGUUUGCACAUUAUCCGACACACCGUGAGCGAGUUACAAUAUUACAAGAACACCAUCUGAAU LM33Cs1C2 82 LSU5-Um1145 LSU5-Um1181 Liang et al[13].
TB10Cs3C5 CGGUGAUUAGCAGUGCGUCUUCCACCUAACGACCCUUGAUGAUUAUGAUACGAUGCCUGGUCAACAGAACUAUACUACACCAAAUUUAGUAAAUGAGAC LM33Cs1C1 99 LSU3-Um728 Liang et al[13].
TB10Cs4C2 CUGCGUGAUGUGACUGCAGUUGUAGUGCGCACUGACGACCCAUCAUGAGCGAGAAACCAACUUUGCGUUUACCAAUCUAUCUGAUUC   87 SSU-Cm1932 Liang et al[13].
TB10Cs4C3 UGCGACGAUGAGAAACUGUCUAACGACAGGCGGACCGACACAUCCAAUGAGGACUCUUGAAUGUGUACAAAUGUUGAGCA LM18Cs1C1 80 SSU-Um2123, SSU-Gm2134 rRNA processing Liang et al[13]., Chikne et al[39].
TB10Cs4C4 GUGUAUGAGGACAGAAGUUGUAGUGCGCGACUGAGUGACAACUUAGUGCUGAUUGAUACCAACGCUUUCAGCGAGAGUGCUGUACUUGACUGACAC LM18Cs1C3 96 SSU-Gm1931 rRNA processing Liang et al[13]., Michaeli et al[14].
TB10Cs5C1 ACUUAUGAUGACAAUAUUAUUUCUUACUUUCUGAUCCGUGAUUUGAUUGACACAAGCACUGUCUGUGCGAACGUAUCGAGAACCUGAUU LM36Cs4C1a LM36Cs4C1b 89   Myslyuk et al[48].
TB10Cs7C1 GCUAUGAUGCCCUGUCAAUAAAUGCGUUCUGUUUGAACUGACGUUUAAAGUAGGAUGUCACUGAGACACAAGAAACUGA   82   Michaeli et al. 2012
TB10Cs7C2 UGAUUCCAACACGUAUUAUUUUUGGUGACCUUACCGUAUGAUUAUUUACGAAUACAACAUGUCAUAGUUACAAUGAGC LM36Cs-1C2 78 LSU3-Am520 Michaeli et al[14].
TB10Cs7C3a UGCCGUGACGAUUUGUCAUCUGCUGCGUUCUUUUUGAGUUUUUUGUAAAGUGAGAUGUUACAUGACUUACCGACUGAGCA LM26Cs2C1 80   Michaeli et al[14].
TB10Cs7C3b CCGUGACGAUUUGUCAUCUGCUGCGUUCUUUUUGAGUUUUUUGUAAAGUGAGAUGUCACUGUCUGGUUUACCAUCUGAGC LM26Cs2C1 80   Michaeli et al[14].
TB11Cs1C2 CUUAUGAUGAGAAGACACGUUUACCUGACACCUCUUCUGAUUUAACAUUGACGAGUAAAAACUGCUAACAGUUAUCCCUGUCUGAC LM25Cs1C1 86 LSU3-Gm1247 LSU3-Gm1269 Liang et al[13].
TB11Cs1C3 GUGCGUGAUGUUCAACAACCGCAAUCACUCCCAUACCUCUGAUAGUAUUGUUUGAUUGACACCAUUGCGUACUGAUGC LM25Cs1C3 78 LSU5-Am1024 Liang et al[13].
TB11Cs2C1 UGAAUGAUGACUGACAAAACAUCACAGACUUUGAUGACCCCAUGAACAAGAAAAAUUGUCGCCCCAGACUGAUU LM5Cs1C2 76 SSU-Um1899 LSU3-Gm1062 rRNA processing Liang et al[13]., Gupta et al[32].
TB11Cs2C2 GAAGUGAUUGACACCUAGGCCGAUGUAAAGCCGUCGCAGAUGGACGUCGAUAUCUUGUGAAAACAGUACUAUUUUAUGCCCUGACUGAUC LM5Cs1C3 92 rRNA processing Liang et al[13]., Gupta et al[32].
TB11Cs3C1 GGACGUGAUGAAGAAAAAUUAUUUACUUCUGUUUGGAGAGGGUUCAGGAACACUCUCCAUGACGUUACCAUAAUUAAUCCAUUCUCUGAUCA LM36Cs3C1 92 LSU3-Um578 Liang et al[13].
TB11Cs3C2 UUUUGAUGAAAAACCUUUCAUGCUGUGUGACGUACUCCCUUAUGAGGGCAGGCACAAGCUGCUUGCGGCCUAGUGUCAUGCAAUUGAUUAUAGACGGCAUUCUGAA   106 SSU-Gm1517 rRNA processing Liang et al[13]., Chikne et al[39].
TB11Cs4C1 UCUAAUGAUGACAGUCAAUAGUUUCCUGUCAGCCUGACGGCAGUAGAGCCAUUUUGAAGACAUAAUUUUUAACUCAGCUACACUGAAUC LM27Cs1C1 90 LSU3-Am1388 LSU3-Am1400 Liang et al[13].
TB11Cs4C2 GCCACUGAUGCUGUGAUGCAUAAUUGUUGUUCGAGGUCCAAACAGUUUGAGCGAUGCAUUGAUAACGGAACAUCAAAAAUCACCUUUCGGCUGAGCA LM27Cs1C2 97 LSU5-Um916 LSU5-Am927  Liang et al[13].
TB11Cs4C3 CAGUGUGAUGGAAACAACGAUUAUGUGUACGUGAAGGUCAAUAUGCCUUACUUUAUGAGCGCGCUUAUUGAAUACUAAAUCAAACUCAACAGGUCUGACUG LM27Cs1C3 101 LSU3-Um673 Liang et al[13].
TB11Cs4ʹC1 CGUGUGAUGAUAUUAGUGAUCUUUUGAAGUUAGCCCUUCAAUCCGAGAGUUUCGUGAUGGAAUAACCGUU LM4Cs1C1 70 LSU3-Am646 Michaeli et al[14].

In total, we mapped at least 31 Nms in SSU rRNA, 34 Nms in LSUα and 38 Nms in LSUβ. In comparison, Nm site distribution among trypanosomatid rRNAs is surprisingly variable. Up to 23 of the T. brucei sites seem to be specific to this species, and were not found in either T. cruzi [22] or Leishmania [21]. Similarly, 38 sites that were methylated in both T. brucei and T. cruzi, were not identified in Leishmania, and 52 Nm sites shared between T. brucei and L. donovani.

Each of our mapping methods has its shortcomings: RibOxi-seq and RiboMeth-seq cannot detect Nms close to the 5ʹ end of an RNA, and 2ʹ-OMe-seq is prone to effects originating from RNA secondary structure. Receiver Operating Characteristic (ROC) curves and Matthews Correlation Coefficient (MCC) plotted for RiboMeth-seq RMS score (supplementary Fig. S2B) suggest that this method detects large number of false positives in T. brucei rRNA (81 false positives) (Sensitivity = 0.99, Specificity = 0.24, AUC ~0.79, MCC = 0.365). However, the substantial overlap of identified sites among the three mapping approaches strongly indicates that, with the potential exception of adjacent modified sites, we have comprehensively determined the extent of Nm sites in T. brucei. Accordingly, the complete repertoire described here is supported by bioinformatics predictions.

Comparing Nms in the two life stages shows differential levels of modification

The results (Fig. 2 and Table 1) indicate the presence of at least 18 Nms that are increased in the BSF life stage and 36 Nms that are decreased in BSF, though only 8 of these were significantly altered (by >3-fold change) (Supplementary Table T5). To determine whether the abundance of the guiding snoRNAs for these differentially regulated sites were also elevated in the BSF, we performed northern analysis on two snoRNAs. The blots revealed the overexpression of snoRNAs in BSF compared to PCF (Fig. 3A). The elevation of the snoRNA levels was also examined using small RNA-library sequencing (Fig. 3B, Supplementary Table T3), and was noted in previously published transcriptomes of PCF and BSF [45] (Fig. 3C). The overexpression of snoRNA seems to be a specific rather than a general phenomenon, since not all the C/D snoRNAs were overexpressed in BSF (Supplementary Table T3). snoRNA levels were examined experimentally (Fig. 4A), and indeed, a similar degree of enhancement was found in our experiments and in the RNA-seq data. The increase of snoRNA abundances was not restricted to a specific snoRNA within a cluster, but rather, the regulation of expression encompassed entire clusters (Fig. 3C), which is expected since most of the C/D snoRNAs are encoded in clusters and are processed from polycistronic transcripts that undergo trans-splicing and polyadenylation [18].

Figure 3.

Figure 3.

snoRNA expression analysis. (A) Total RNA (10 μg) from PCF or BSF was separated on a 12% denaturing polyacrylamide gel and detected by northern blotting with complementary probes to the specified snoRNAs. Data are presented as mean ± S.E.M. Experiments were done in triplicate (n = 3). TB9Cs2C7 was used as loading control. (B) Coverage of selected snoRNA coding regions in T. brucei small RNome. The read distribution profile of the selected snoRNA coding sequence based on the PCF (blue) and BSF (red) small RNA libraries described in this study. (C) Coverage of selected snoRNA precursors. The read distribution profile of the snoRNA precursor sequence based on the PCF (blue) and BSF (red) libraries, as described in [45]. D) Correlation between Nm level and snoRNA expression. (I) Pairwise comparison of RMS score fold-change and snoRNA expression fold-change (log2) in both life stages from at least three independent libraries. Spearman’s correlation coefficient (R) and p-value are indicated. (ii) Pairwise comparison of RMS score fold change and snoRNA expression fold-change (log2) using a cut-off of baseMean >6000, in both life stages.

Interestingly, not all Nm sites that were guided by elevated snoRNAs were elevated in BSF parasites (Table 1), suggesting that a higher snoRNA level alone is not sufficient to increase the cognate modification (see Discussion). A low correlation (r = 0.34, p = 0.0013) was observed between the fold change of RMS value and log2FC of snoRNA levels in the two life stages (Fig 3Di). A better correlation (r = 0.554, p = 0.0041) was observed when a subset of abundant snoRNAs having baseMean score >6000 (calculated by DESeq2 [34]) (Supplementary Table T3) was used (Fig 3Dii).

To evaluate the functional significance of the positions that were found to be differentially modified in BSF parasites, their positions were depicted on the secondary structure (Fig. 46) of the rRNA, and on the tertiary model (Fig. 7). The hypermodified (FC>3 in BSF) and hypomodified Nm (FC>3 in PCF) sites are boxed and illustrated in the 3D structure. The data clearly indicate that these differentially modified positions are not randomly distributed, but are clustered mostly around the functional domains such as the A, P sites and PTC. Most of the variable Nm sites in BSF and PCF parasites are adjacent to each other, except for a small number of distinct sites such as Cm1777 or Cm1844 (in T. brucei) located next to the expansion segment ES9 S (Supplementary Fig. 4–6). Um1576 or Um1630 (in T. brucei) guided by Tb10Cs2”C3 is the most hypermodified Nm site in BSF parasites, and is trypanosome-specific. Interestingly, among the variable sites elevated in BSF, LSU Um916 and Am927 are guided by the same snoRNA (TB11Cs4C2).

Figure 5.

Figure 5.

Localization of Nms and ψs in the secondary structure of T. brucei LSUα rRNA. Boxes highlighted in blue show the Nm sites whose level is increased in BSF (>10% compared to PCF) and those in yellow show the hypermodified pseudouridines (ψ) [20]. T. brucei specific Nm (not detected in T. cruzi or L. donovani Cryo-EM studies) are indicated by an asterisk (*).

Figure 7.

Figure 7.

Variable Nm modifications in T. brucei are located around functional domains of the ribosome. Nm sites whose levels are reproducibly regulated (>10%) in T. brucei BSF are indicated. Nm sites whose level is increased in BSF (compared to PCF) are coloured in red and those whose level is decreased in BSF are coloured in blue, respectively. Hypermodified and hypomodified sites (FC>3) are indicated by a green box. A- P- and E- site tRNAs are represented as a gold surface. The peptidyl transferase centre (PTC) and protein exit tunnel (marked with a purple dotted line) are indicated. The 3D representation is based on the published L. donovani cryo-EM coordinates deposited as 6AZ3 and 6AZ1 for the large and small subunits, respectively [21]. L.donovani Nm sites corresponding to T. brucei SSU and LSU rRNA are shown.

Why three methods were necessary to determine the Nm landscape of T. brucei rRNA

The two critical issues regarding Nm mapping is the location of the modification and its level. The RibOxi-seq tuned out to faithfully detect 97 out of the 99Nms that most likely exist. However, in order to find the additional missing 2 Nms another detection method was necessary and these two missing Nms were verified by RiboMeth-seq and 2ʹ-OMe-seq. Note that RiboMeth-seq is the only method used in this study to quantitate the level of the Nm, failed to detect all the Nm site ab- initio using the stringent defined criteria [24,25,30]. Indeed, only 35 bona-fide sites were detected and 81 false-positives were observed using RiboMeth-seq. This result could reflect the relatively low level of modification on individual sites compared to what is reported for yeast and humans [24,25,27,30]. We therefore manually inspected all the sites detected by RibOxi-seq for their fragmentation profile in RiboMeth-seq data and we obtained evidence for their resistance to alkaline hydrolysis that was observed for all the Nm sites. The quality of 2ʹ-OMe-seq did not afford detection of all the Nms ab- initio and many false-positives also detected (AUC = 0.5). Note, that in this study we used this method only to verify the bone-fide sites detected by the other two methods. The failure to detect all the sites by this method may also stem from the low level of the modification on the individual sites that might be compensated in trypanosomes by having many adjacent sites that is interferes with the detection. Based on the experience of this study, RibOxi-seq detects most of the sites and should be the basis for de novo mapping. However, the two other methods and especially RiboMeth-seq is necessary for verification and quantitation. Our mapping and assignment of the Nms is not only based on the three methods but are also supported by the presence of the cognate C/D snoRNAs to guide the specific modification that we assigned for each of the Nm. In many cases, we also obtained support from the Cryo-EM data [21,22]

Discussion and conclusion

In this study, we mapped the Nms on rRNA of both life stages of T. brucei. The study identified at least 99 Nm sites with high confidence. Using RiboMeth-seq, we identified several Nm sites in T. brucei rRNA that are located near functional domains of the ribosome, and are differentially modified between the BSF and PCF stages.

Recent studies demonstrated that Nm levels are not homogeneous across rRNA in other systems. Variation in some Nm modifications was found in exponentially growing cultured cells and developing tissues of the mouse [2327,46]. This implies the presence of sub-populations of ribosomes that might assume specific functions. It was also demonstrated in yeast that certain Nms in rRNA are only partially methylated [24,30,47].

It is well accepted that in humans, no direct correlation exists between the level of the modification and the abundance of the guiding snoRNAs, although there is a tendency for sites targeted by multiple snoRNAs to have higher levels of modification [25]. These generalizations seem to be partially true in trypanosomes. Only ~0.55 correlation (Spearman’s correlation) was found for abundant snoRNAs and the Nm modifications that they guide. We have previously described a machine learning algorithm (SVM) that was able to predict the abundance of snoRNAs at an 85% rate of success [48].

In this study we compared our data with the data derived by Cryo-EM of both T. cruzi and Leishmania ribosomes [21,22]. Not all sites observed by Cryo-EM were observed by our mapping and the vice-versa. Indeed, also in the human system, several new Nms were revealed in the Cryo-EM of human ribosomes that were not detected by RiboMeth-seq and the vice-versa. It was argued that these sites might not be accessible in the rRNA, or perhaps their methylation rate was too low to be detected by the mapping method [23,25,26,49].

Our study revealed 22 novel Nm sites in T. brucei. Among these newly identified sites, 8 were detected near previously predicted sites. Importantly, these sites were also seen in the Cryo-EM studies. Although many of the trypanosome modifications are located in positions which are already rich in modifications and are near functional domains. We also observed 23 unique positions that were not mapped by Cryo-EM studies [21,22]. Indeed, 14 of these unique sites were also predicted to exist in L. major rRNA. However, five of these sites are conserved across different domains of life but were not mapped in the Cryo-EM of T.cruzi or L. donovani.

Our analysis also revealed 2 Nms (LSUa-Cm747 and LSUb-Am544) that are specific for T. brucei, yeast and E. gracilis, and 3 Nms (LSUa-Am743, LSUa-Am1180, SSU-Am2153 and SSU-Um2154) that exist only in T. brucei. Of note, RibOxi-seq revealed 14 additional sites that are located adjacent to known Nm sites, suggesting the presence of ~113 Nms in T.brucei, similar to the number suggested by cryo-EM of L. donovani and T. cruzi [21,22] and that of Crithidia fasciculata [50]. These neighbouring sites are likely to be guided by the same snoRNA, suggesting that more flexible rules for guiding Nms exist in these organisms, as in yeast [38].

The most striking observation from the RiboMeth-seq analysis in Fig. 2 is that most RMS scores are low, suggesting that not all rRNA molecules are modified in these positions. Such a low score could result from an experimental flaw related to the construction of RiboMeth-seq libraries as the over amplification of cDNA may diminish the differential signal between the modified and neighbouring positions, as previously observed [27]. However, our protocol uses a low number of amplification cycles in the library preparation (9 cycles compared to 12 cycles) [27]. We suggest that the low level of modification is due to the large number of sites available for Nm modification and insufficient core proteins to assemble a large number of each snoRNP, thereby limiting the modification at each position. Alternatively, the proximity of the Nm sites results in competition between snoRNAs for access to the relevant domain for guiding the modification. At the present time, we cannot determine whether a single rRNA can carry adjacent modifications, and this awaits the development of mapping technology at single molecule resolution. It will be interesting to examine if depletion of snoRNAs especially guiding hypermodified positions in BSF, can affect the ability of the parasites to transform from one stage to another and to establish infection.

In sum, this study highlights the need for more than one mapping approach to confidently determine the landscape of Nm modification. The methodology presented here suggests to first use RibOxi-seq for detection of the Nms that needs to be verified by additional mapping approaches together with data on snoRNAs guiding these modifications. Data analysis is also assisted by using RNA from cells depleted for the methyltransferase, NOP1. The need to have multiple approaches for mapping in trypanosomes may stem from the rich repertoire of Nms that also resulted in low level of modification on most of the individual sites.

Supplementary Material

Supplemental Material
Supplemental Material

Acknowledgments

This work was supported by a grant from the Israel-US Binational Science Foundation (BSF), by the I-core programme of the Planning and budgeting committee and the Israel Science Foundation grant 1796/12, NIH grants R01 AI028798 and AI110325 to C.T., R21 AI142149 to A.G. and R21 MH110955 to G.C. S.M. holds the David and Inez Myers Chair in RNA silencing of diseases.

Funding Statement

This work was supported by the National Institutes of Health [R01 AI028798 and AI110325]; National Institutes of Health [R21 MH110955]; National Institutes of Health [R21 AI142149]; United States - Israel Binational Science Foundation [2015219]; I-core program of the Planning and budgeting committee and the Israel Science Foundation grant 1796/12.

Disclosure statement

The authors declare no competing financial interests.

Data Accessibility

The RNA sequencing data related to this study have been deposited in the NCBI database under the accession number GSE102516 (https://www.ncbi.nlm.nih.gov/bioproject/?term=GSE102516) and PRJNA526606 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA526606) (https://trace.ncbi.nlm.nih.gov/Traces/sra/?study=SRP188173). The bioinformatics scripts used in this study is provided in the ‘Supplementary Files’.

Supplementary material

Supplemental data for this article can be accessed here.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material
Supplemental Material

Data Availability Statement

The RNA sequencing data related to this study have been deposited in the NCBI database under the accession number GSE102516 (https://www.ncbi.nlm.nih.gov/bioproject/?term=GSE102516) and PRJNA526606 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA526606) (https://trace.ncbi.nlm.nih.gov/Traces/sra/?study=SRP188173). The bioinformatics scripts used in this study is provided in the ‘Supplementary Files’.


Articles from RNA Biology are provided here courtesy of Taylor & Francis

RESOURCES