Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2022 Oct 19;50(19):11229–11242. doi: 10.1093/nar/gkac825

Structure-based screening for functional non-coding RNAs in fission yeast identifies a factor repressing untimely initiation of sexual differentiation

Yu Ono 1,3, Kenta Katayama 2,3,3, Tomoki Onuma 4, Kento Kubo 5,6, Hayato Tsuyuzaki 7,8, Michiaki Hamada 9,10,11,, Masamitsu Sato 12,13,14,
PMCID: PMC9638895  PMID: 36259651

Abstract

Non-coding RNAs (ncRNAs) ubiquitously exist in normal and cancer cells. Despite their prevalent distribution, the functions of most long ncRNAs remain uncharacterized. The fission yeast Schizosaccharomyces pombe expresses >1800 ncRNAs annotated to date, but most unconventional ncRNAs (excluding tRNA, rRNA, snRNA and snoRNA) remain uncharacterized. To discover the functional ncRNAs, here we performed a combinatory screening of computational and biological tests. First, all S. pombe ncRNAs were screened in silico for those showing conservation in sequence as well as in secondary structure with ncRNAs in closely related species. Almost a half of the 151 selected conserved ncRNA genes were uncharacterized. Twelve ncRNA genes that did not overlap with protein-coding sequences were next chosen for biological screening that examines defects in growth or sexual differentiation, as well as sensitivities to drugs and stresses. Finally, we highlighted an ncRNA transcribed from SPNCRNA.1669, which inhibited untimely initiation of sexual differentiation. A domain that was predicted as conserved secondary structure by the computational operations was essential for the ncRNA to function. Thus, this study demonstrates that in silico selection focusing on conservation of the secondary structure over species is a powerful method to pinpoint novel functional ncRNAs.

INTRODUCTION

Eukaryotic genomes are pervasively transcribed (1). Non-coding RNAs (ncRNAs) are transcripts which do not encode proteins. There are thousands of ncRNAs detected in the human genome (2–5). Although ncRNAs used to be recognized as junk molecules, recent studies reported some ncRNAs participating in cellular activities such as gene regulation (6). Nevertheless, there are few ncRNAs whose functions have been elucidated (7), and still a large portion of ncRNAs remain uncharacterized. Since those uncharacterized ncRNAs may play biological roles in cells, it is necessary to search for ncRNAs to identify those which are functional, to further elucidate the enigmatic landscape of ncRNA functions.

In general, ncRNAs are classified into two types by size: short ncRNAs (∼200 nt) and long ncRNAs (> 200 nt). Short ncRNAs include well-studied conventional ncRNAs such as tRNA, rRNA, snRNA, snoRNA and miRNA. The functions of long ncRNAs may be rather unconventional and diverse: for instance, XIST/Xist RNA, one of the earliest examples of functional ncRNAs, participates in X-chromosome inactivation (8). It covers one of the X-chromosomes in female cells and epigenetically induces gene silencing by recruiting chromatin-remodelling factors (9). In addition, several long ncRNAs are reported to participate in transcriptional or translational regulation by acting as a scaffold for protein complexes, or as an aptamer for specific proteins or mRNAs (10–12).

To identify functional ncRNAs further, a number of screenings were conducted. More than 16 000 human ncRNAs were systematically knocked-down in seven cell lines, to identify 499 that affect cell growth (13). However, knocking out each ncRNA for inspection is effective but laborious. Informatics-based screenings that rely on evolutionary conservation of ncRNAs over species have also been performed. Conservation of ncRNAs can be examined from the aspect of sequences, location and structures. First, nucleotide sequences of ncRNAs are conserved only weakly even in related species (14,15). Second, conservation of gene synteny on chromosomes would help to find functional ncRNAs. For example, physical co-localization of gene loci on the chromosome is occasionally conserved over species. Indeed, more ncRNAs are found conserved in location than in sequence (14,15). ncRNAs with a conserved location might participate in regulation of neighbouring genes (16). Third, ncRNAs form secondary structures via base pairing. Secondary structures could be considered as functional motifs of ncRNAs. In most cases, functional ncRNAs participate in biological processes through interaction with proteins, and this requires folding of the RNAs. For example, XIST RNA uses the AUCG tetraloop structure at the 5′ terminus to recruit polycomb repressive complex 2 (PRC2) (9).

Here, assuming that functional ncRNAs are structurally conserved, screenings that rely on the secondary structure may pick up potential functional ncRNAs. Computational programs that evaluate the structural conservation such as RNAz (17), EvoFold (18) and CMfinder (19) have been developed. Several studies have screened entire genomic DNA of bacteria, archaea, fungi and vertebrates for conserved RNA structures, although few of the predicted potential ncRNAs have been confirmed to play cellular roles (20–25).

The fission yeast Schizosaccharomyces pombe is widely used as a model to investigate cellular activities, taking advantage of easy genetic manipulation including gene knockout by use of polymerase chain reaction (PCR)-based gene targeting methods (26). Schizosaccharomyces pombe has a 12 Mb genome with >1800 annotated ncRNAs. There are some unconventional functional ncRNAs identified. For example, meiRNA, a meiosis-specific long RNA transcribed from the sme2 gene locus, is known to be essential for sexual differentiation, particularly for progression of meiosis, as well as chromosome pairing (27–30). Some mRNA-type long ncRNAs controlling stress-regulated gene expression have also been identified (31,32). In addition, characterization of long ncRNAs include rse1 (33), SPNCRNA.1343 (16), prt (34) and nam1 (35) has been reported (36).

Systematic deletion of long intergenic ncRNAs in fission yeast and their functional profiling have been recently conducted to present phenomics datasets for long ncRNAs (37). Screenings for functional ncRNAs have also been conducted previously based on sequence and location conservation, targeting annotated ncRNAs (15,16). For structure-based screening, an analysis targeting fungal genomes (>60 species) has predicted 15 structurally conserved ncRNA motifs, including one variant snoRNA motif which exists in S. pombe (22). However, no studies have conducted screening based on structural conservation for S. pombe ncRNAs, and only a few studies exist for the budding yeast Saccharomyces cerevisiae (20,25).

Here, we attempted to identify novel functional ncRNAs in S. pombe through combination of structure-based screening in silico and biological screening in vivo. In the first step, we designed in silico selection of structurally conserved ncRNAs among related species, based on comparative genomics among S. pombe and three other Schizosaccharomyces species: S. cryophilus, S. japonicus and S. octosporus (Figure 1A), as previously performed for budding yeast (25). Candidates were then subjected to the in vivo screening, in which we constructed S. pombe knockout mutants of the candidate ncRNAs and tested their phenotypic traits under several environmental and stress conditions. In this research, we screened a pool of 1857 annotated ncRNAs of S. pombe and nominated 151 ncRNAs through informatics. After their re-assessment, 14 of them were subjected to biological tests. Through the final assays, we found that an ncRNA named ‘nc1669’ is required for continuation of vegetative growth in S. pombe cells.

Figure 1.

Figure 1.

Structure-based analyses identify 151 ncRNAs conserved in secondary structure. (A) The phylogenetic tree of four Schizosaccharomyces species based on comparative genomics, adopted from a previous study (15). (B) The outline of the computational screening. Black, S. pombe; magenta, S. cryophilus; green, S. japonicus; blue, S. octosporus. [1] S. pombe 1857 ncRNAs were assessed for sequence similarity, and 1053 ncRNAs conserved in sequences with any of three Schizosaccharomyces species are selected as pairwise alignments. [2] Those were subjected to MAFFT assays to produce multiple alignments based on secondary structure conservation. [3] The alignments were digested into 10 245 fragments. [4] Conserved secondary structure motifs (CSMs, 180 motifs in total) were detected by RNAz. [5] CSMs were mapped onto the S. pombe genome to identify conserved secondary structure genes (CSGs; 151 genes in total). (C) Details of 1857 ncRNA genes in S. pombe used as input of the analyses. Numbers of genes in each category are shown in parentheses. Percentages of conventional ncRNAs (tRNA, rRNA, snRNA and snoRNA) and unconventional ncRNAs are shown below. (D) Details of 151 CSGs identified in (A). (E) Examples of CSMs in SPNCRNA.1357 (a), SPNCRNA.1145 (b) and SPNCRNA.933 (c), detected in step [4] in (B). Predicted structures were drawn using the RNAalifold algorithm (72).

MATERIALS AND METHODS

Yeast strains and genetics

Schizosaccharomyces pombe strains used in this study are listed in Supplementary Table S1. The homothallic (h90) prototrophic strain JY3 was used as a prototrophic wild-type (WT) strain for all the experiments. KEN0040 and KEN0041 (nc1669Δ), as well as KEN0187 and KEN0188 (nc1669FLΔ), were constructed from JY3. MJ0006, MJ1551, SY50 and KEN0137 were used as controls in assays for drug sensitivity and cell viability.

Standard methods for S. pombe genetics were used (38). YE5S medium was used for growing cells. YE5S medium containing G418 (100 μg/ml) was used for selection of a strain having the kan gene that confers G418 resistance. SD–3S medium (SD medium containing yeast nitrogen base and glucose supplemented with adenine sulphate and leucine) was used for assays for the hyper-mating phenotype.

Gene deletion was performed using standard methods for PCR-based gene targeting (26). Oligonucleotide sequences for gene deletion and colony PCR are listed in Supplementary Table S2.

Plate-based screening

For in vivo screening of deletion mutants of candidate genes, cells were grown overnight on YE5S agar plates, and then suspended in ddH2O to the concentration of 0.2 × 107 cells/ml. Ten-fold serial dilutions of 107–102 cells were spotted on each assay plate. The assay plates are based on YE5S without or with thiabendazole (TBZ; 10–20 μg/ml; Sigma), hydroxyurea (HU; 5–10 mM; Sigma), H2O2 (1–5 mM; Wako), caffeine (7.5–12.5 mM; Sigma), CdSO4 (10–20 μg/ml, Sigma) or sorbitol (1–2 M; nacalai tesque). To test the UV sensitivity, 10-fold dilutions of cells were first spotted on YE5S agar plates, and then cells were irradiated with UV at 200 or 400 J/m2 with Stratalinker® UV Crosslinker (Stratagene). The plates were then incubated at 26.5°C. Temperature sensitivity of deletion mutants was tested on YE5S agar plates incubated at 22, 26.5 and 36°C.

Sexual differentiation assays

Sexual differentiation (mating, meiosis and sporulation) can be induced on SPA (sporulation agar), which does not contain nitrogen sources. Cells grown in YE5S were collected and spotted on SPA and incubated at 26.5°C for 10 or 24 h.

Mating in the presence of nitrogen (hyper-mating) was assayed on SD–3S agar plates: cells grown in YE5S were collected and suspended in a mixed solution of leucine and uracil to the concentration of 0.4 × 107 cells/ml. The cell suspension (5 μl) was spotted onto SD–3S plates.

For observation of cells, cells were suspended to 1.6% formaldehyde (Thermo Fisher Scientific) and left to stand for 2 h. Samples were centrifuged to remove supernatants, and cells were washed with and resuspended in phosphate-buffered saline (PBS). Prior to observation, samples were subjected to mild sonication using an ultrasonic homogenizer VP-050N (TAITEC) for 45 s to dissolve cell aggregates. An aliquot (1 μl) of the cell suspension was mounted on a slide glass together with 1 μl of VECTASHIELD® mounting medium with 4′,6-diamidino-2-phenylindole (DAPI; VECTOR laboratories) and covered with a coverslip.

Mating efficiency was calculated by the following formula: mating efficiency = 2 × number of zygotes/(2 × number of zygotes + number of unmated cells).

Microscopy

Images of DIC (differential interference contrast) and fluorescence were acquired using the DeltaVision-SoftWoRx system (Applied Precision) (39). For fluorescence images, 12 sections with 0.4 μm intervals were taken along the z-axis, deconvolved and projected using the Quick Projection algorithm of the SoftWoRx software.

RNA preparation and gene expression analyses

Cells grown in YE5S were dropped onto SD–3S plates as described above and incubated at 26.5°C for 48 h. Spots were collected and suspended in 1 ml of ice-cold diethylpyrocarbonate (DEPC)-treated water, spun down at 2000 rpm for 5 min at 4°C and supernatants were removed. Alternatively, for comparison of RNA levels depending on nitrogen availability in the medium (Figure 4), cells were cultivated in the presence (EMM + N) or absence (EMM–N) of the nitrogen source for 4 h and collected similarly.

Figure 4.

Figure 4.

nc1669 represses expression of ste11 in the presence of nitrogen. (A) Expression levels of ste11 (i), nc1669 (ii) and nc1670 (iii) in the indicated strains, cultivated in the presence (+N) or absence (–N, 4 h) of the nitrogen source, were quantified and normalized with the act1 mRNA level. Average ± SEM (three trials). *Pp = 0.0149, **P = 0.00934 (i), *P = 0.0282 (ii), n.s., not significant, P = 0.0685 (iii) (Student's t-test). (B) Cascade of sexual differentiation in nitrogen-rich conditions and under nitrogen starvation.

The pellets were snap-frozen with liquid nitrogen and stored at –80°C overnight. The pellets were suspended in 520 μl of TES and 520 μl of phenol:chloroform:isoamyl alcohol 25:24:1 (pH 5.2) (nacalai tesque), incubated for 1 h at 65°C and placed on ice for 5 min. Suspensions were rotated at 14 000 rpm for 15 min at 4°C. Aqueous phases were transferred to new tubes with 400 μl of phenol:chloroform:isoamyl alcohol 25:24:1 (pH5.2). Aqueous phases were transferred to new tubes with 400 μl of chloroform and spun down at 14 000 rpm for 5 min at 4°C. Aqueous phases (∼300 μl) were transferred to new tubes, and 60 μl of 1.5 M sodium acetate (pH 5.3) and 750 μl of ice-cold 100% ethanol were added and stored at –20°C for 16 h. Samples were then rotated at 14 000 rpm for 20 min at 4°C, and supernatants were removed. After washing with 100 μl of 70% ethanol, samples were placed for 5 min at room temperature. Supernatants were then removed, and RNA pellets were dissolved in 50 μl of DEPC-treated water through repetitive freeze and thaw three times. Finally, the RNA samples were stored at –80°C.

DNAs contaminating RNA samples were eliminated by the TURBO DNA-free Kit (Invitrogen). A 5 μl aliquot of 10× Turbo DNase buffer, 1 μl of Turbo DNase, RNA extracts containing 8 μg of RNA and up to 50 μl of DEPC-treated water were mixed and incubated for 30 min at 37°C. DNase inactivation reagent (5 μl) was then added and incubated for 5 min at 25°C. Finally, the samples were centrifuged at 14 000 rpm for 5 min and ∼40 μl of supernatants were transferred to new tubes.

For reverse transcription, DEPC-treated water (1.2 μl), 10× Reverse transcription buffer (2 μl; Applied Biosystems), dNTPs (0.8 μl), 10× random primer or 1 μM strand-specific primer (Supplementary Table S2; 2 μl), reverse transcriptase (1 μl) and extracted RNA (2 μg) were mixed together and placed in the thermal cycler as follows: 25°C for 10 min, 37°C for 120 min and 85°C for 5 min, followed by cooling down at 4°C. For subsequent quantitative PCR, 5 μl of the samples were mixed with H2O (0.6 μl), THUNDERBIRD® SYBR qPCR Mix (10 μl; Applied Biosystems), 3 μM forward primer (2 μl) (Supplementary Table S2), 3 μM reverse primer (2 μl) (Supplementary Table S2) and 50× ROX reference dye (0.4 μl) in a reaction plate. The RNA level was analysed with the StepOne Real Time PCR system (Applied Biosystems). Data were normalized with the expression level of the act1 mRNA encoding S. pombe G-actin.

Construction of plasmids

For expression of genes from plasmids, we modified the plasmid pREP1 as follows: the selection marker gene LEU2 of pREP1 was replaced with the natMX6 marker cassette, to create pREP1-NatR. This was used because the host strain does not need to possess a leucine auxotroph in the genetic background that may affect mating ratios. For expression of full-length or truncated nc1669 genes and tRNAGlu08, the nmt1 promoter of pREP1-NatR was removed, and the genomic sequence of each ncRNA gene including the promoter and terminator regions was inserted, so that the inserted RNA genes can be natively expressed from their own promoters and terminators. For expression of the nc1669 gene, the annotated gene region (2258 bp) flanked by the upstream 1153 bp and the downstream 1086 bp was cloned into pREP1-NatR without the nmt1 promoter/terminator. The product is termed pREP1-NatR-nc1669FL (without Pnmt1), from which the following truncated plasmids were derived: pREP1-NatR-nc1669woCSR and pREP1-NatR-nc1669FL-wo-nc1670. For expression of tRNAGlu08, the annotated gene region (72 bp) flanked by 1152 bp upstream and 919 bp downstream was inserted to replace Pnmt1. This plasmid was termed pREP1-NatR-SPBTRNAGLU.08.

Construction of a mutant with a shuffled sequence

The conserved secondary structure motif (CSM) sequence (117 nt) of nc1669 was shuffled as follows. The sequence was randomly shuffled using uShuffle (40) to maintain the nucleotide composition. After randomization, an increase of the loop-forming probability was evaluated using ParasoR (41). The operations of uShuffle and ParasoR were repeated 50 650 times to compose the pipeline to nominate the most probable candidate sequence termed CSM-shf1. The loop formation probability of the original CSM sequence was 0.4496, and that of CSM-shf1 was 0.5819, the highest possibility of loop formation of all the candidates. The sequence of CSM-shf1 is shown in Supplementary Table S2.

The CSM-shf1 DNA fragment was chemically synthesized, and was then introduced into JY3 (WT) cells to create the nc1669 mutant harbouring the shuffled sequence using CRISPR/Cas9 [clustered regularly interspaced palindromic repeats (CRISPR)/CRISPR-associated peptide 9] methods. The pSR6 plasmid containing the crRNA sequence originated from the internal sequence of nc1669 (Supplementary Table S2), and Cas9 (42) was constructed and introduced into JY3 cells together with the donor DNAs containing the CSM-shf1 sequence. The genotype of transformants was confirmed by PCR. A pair of primers (Supplementary Table S2) was used to detect the CSM-shf1 mutant sequence.

Computational analyses

Extraction for ncRNAs and gene annotation

The S. pombe genome sequence and annotations were downloaded in the GenBank format from Pombase (https://www.pombase.org/data/genome_sequence_and_features/OLD/20170906/genbank/) (43,44) in June, 2016. All genes annotated as ‘misc_RNA’ were extracted with their gene names, their exonic DNA sequences and information on their genomic location. We defined ncRNAs as the sequences annotated in the ‘misc_RNA’ category, which is composed of conventional ncRNAs (tRNAs, rRNAs, snRNAs and snoRNAs), pseudogenes and unconventional ncRNAs that have not been characterized. The number of extracted ncRNAs was 1857 in total. Information on gene annotation was also downloaded from the Ensembl Fungi database, release 39 (https://fungi.ensembl.org/info/data/ftp/index.html) in the GFF3 format.

Pairwise alignments

Pairwise alignments of S. pombe sequences in comparison with each of three close Schizosaccharomyces species, namely ‘Schizosaccharomyces cryophilus versus Schizosaccharomyces pombe’, ‘Schizosaccharomyces japonicus versus Schizosaccharomyces pombe’ and ‘Schizosaccharomyces octosporus versus Schizosaccharomyces pombe’, were downloaded from the Ensembl Genomes database, release 86 (https://fungi.ensembl.org/info/data/ftp/index.html) (45), and the complete data were used without introducing any further cut-off by ourselves in the following procedures. All three alignments were constructed by the LastZ alignment algorithm (46). Using the pairwise alignments produced from the sequences of 1857 S. pombe non-coding genes as input, sequences that aligned to any of the related species were extracted. Genes that were not aligned to any sequences of the other three species, or genes aligned with sequences shorter than 50 bp were excluded in this process.

Structure-based screening

Structural analyses shown below were partly performed on the NIG supercomputer at ROIS National Institute of Genetics. RNAz software version 2.1 (17,47) was used to evaluate structural conservation of RNAs. RNAz requires alignment as input. It uses a support vector machine (SVM) learning algorithm to judge whether the input alignment is ‘structural RNA’ through comprehensive evaluation of the sequence similarity, structural similarity, structural stability and covariance base substitution. ‘Structural RNA’ has a defined secondary structure which is of functional importance and is conversed in the species used for comparison (47). For each input alignment, RNAz returns a P-value (range 0–1), which represents the probability of being a ‘structural RNA’.

The genes that were paired with genes in any of closely related species were defined as ‘Conserved Primary-sequence Gene’ in Figure 1B (1053 genes). Those were then applied to construction of structure-based multiple alignments with the MAFFT software (MAFFT v7.310) using the ‘Q-INS-i’ option (48). Then, alignments were digested into 120 nt fragments with a sequential 80 nt overlap with the rnazWindow.pl algorithm provided with the RNAz software. As represented in Figure 1B, 10 245 fragments were extracted by this procedure. Fragments of alignments were subjected to analysis with the RNAz software, using options ‘–forward, –no-shuffle, –cutoff = 0.5’. Note that only the forward direction of the alignments was used for evaluation. Alignments with a P-value >0.5 were extracted. Then, when applicable, neighbouring fragments were fused into one with the rnazCluster.pl algorithm, and fused regions were defined as CSMs (180 motifs were detected in total). Finally, S. pombe genes conserved within more than one of the species were defined as conserved secondary structure genes (CSGs).

Estimation of the false discovery rate

To estimate the number of false positives selected by RNAz, the following analyses were performed. After creating structure-based multiple alignments, the nucleotide sequence of an extracted S. pombe fragment in each alignment was randomly shuffled to destroy the secondary structure, using the random shuffling algorithm rnazRandomizeAln.pl provided by RNAz. Thereafter, shuffled alignments were subjected to similar RNAz analyses to evaluate their structure conservation. Consequently, 6.1% (64 out of 1053 input genes) of CSGs were found to be conserved even when their sequences were randomized. This indicates that those genes could be selected in a non-specific manner by RNAz, and the ratio (6.1%) has been defined as the false discovery rate (FDR).

RESULTS

Structure-based screening identified 151 ncRNAs evolutionarily conserved in closely related species

At the commencement of the in silico screening, 1857 ncRNAs have been annotated in the S. pombe genome in GenBank (June, 2016) (43). To identify functional ncRNAs, we selected ncRNAs which are structurally conserved in other Schizosaccharomyces species using computational programmes (Figure 1B).

The collection of 1857 non-coding gene sequences comprises tRNAs, rRNAs, snRNAs, snoRNAs, pseudogenes and uncharacterized ncRNAs (Figure 1C). The in silico part of the screen was carried out in the following steps (Figure 1B). See the Materials and Methods for details.

  • Selection of ncRNAs with conserved primary sequences in closely related species ([1], Figure 1B). To identify conserved loci between S. pombe and at least another related species, pairwise alignments provided in the Ensembl Genome database (45) were referred to. This procedure selected 1053 S. pombe ncRNA genes.

  • Structure-based alignment of the selected ncRNAs ([2], Figure 1B). The MAFFT software creates multiple alignments, considering secondary structures in addition to primary sequences of the input ncRNA sequences (48).

  • Fragmentation ([3], Figure 1B). The multiple alignments obtained above were digested into 120 bases with a sequential 80 nt overlap, to produce 10 245 fragments.

  • Selection of RNA motifs with conserved secondary structures in the species using the RNAz software (17,47) ([4], Figure 1B). As a result, 180 motifs (mean length = 147 nt) were extracted as conserved regions, which were then termed CSMs.

  • Identification of conserved ncRNA genes in the species ([5], Figure 1B). The extracted CSMs were mapped onto the S. pombe genome, whose genes were defined as CSGs. The FDR, evaluating the accuracy of RNAz operations, was estimated as 6.1%, indicating that ∼64 out of 1053 genes could correspond to false positives.

Through the whole procedures, 151 genes were extracted as CSGs: ncRNAs with secondary structures that are conserved in S. pombe and at least another Schizosaccharomyces species (Figure 1D). These CSGs include 71 uncharacterized ncRNAs and 1 pseudogene (in total, 47.7% of CSGs), which are candidates for novel functional ncRNAs.

In the screen, we also included well-known functional ncRNAs (tRNAs, rRNAs, snRNAs and snoRNAs) to evaluate the efficiency and validity of the screening procedures. Our screening procedures highlighted 25.7% (79 out of 307) of those canonical ncRNAs (Figure 1C, D). Assuming that those well-known ncRNAs are generally conserved in sequence and structure over species, our computational screening could detect ∼25% of total functional ncRNAs. The ratio was defined as the DS (detection sensitivity). Well-known functional RNAs account for 16.4% of 1857 total non-coding genes in S. pombe, whereas they account for 52.3% of the total CSGs after the screen (Figure 1C, D). This finding indicates that our structure-based in silico screening has successfully pinpointed uncharacterized RNAs with conserved structures and sequences from the entire RNA pool.

Through the analyses, we identified many types of CSMs and found that those estimated structures varied among molecules. As can be seen in the examples shown in Figure 1E, some have simple linear structures (a and b), while some have complicated structures with multiple branching loops (c).

Biological screening was performed to identify the functional ncRNA

To reveal the functions of 71 uncharacterized CSGs, those genes were listed for biological screening. Among them, 57 were antisense RNAs that overlap with protein-coding genes on their complementary strand, and 14 were ncRNAs that did not overlap with protein-coding sequences on either strand (Figure 2AC). We focused on the ncRNAs without protein CDS overlaps. This is mainly for a practical reason, i.e. to achieve an efficient high-throughput in vivo screen. Knockout of ‘antisense RNA’ genes inevitably removes protein-coding genes located on the complementary strand simultaneously. This obscures which gene is responsible for the observed phenotype. In addition, antisense RNAs might often be predicted to regulate expression of the genes located on the sense strand.

Figure 2.

Figure 2.

Plate-based knockout assays to identify functional ncRNAs. (A) Schematic diagrams of SPNCRNA.254 and SPNCRNA.1032 (a) and SPNCRNA.507 and SPNCRNA.808 (b). (B) The flow diagram illustrating the strategy from in silico to in vivo procedures. (C) List of structurally conserved RNAs that do not overlap with protein-coding sequences shown with P-values computed by RNAz and the number of species in which each RNA is structurally conserved. (D) Examples of biological screening. For viability assays (i), a 1/10 serial dilution of cells of the indicated knockout mutants was spotted on YE5S plates (26.5°C) without or with HU, TBZ or UV irradiation. The images are excerpts of the whole assays shown in Supplementary Figure S3. For observation of sexual differentiation (ii), cells were spotted on SPA, and ratios of mating, meiotic progression and spore formation were monitored. Nuclei were stained with DAPI and are shown merged with DIC. Scale bar, 10 μm.

Many of 14 candidates overlapped with other ncRNA(s) in either strand, and some overlapped with untranslated regions (UTRs) of protein-coding genes. Candidate genes SPNCRNA.254 and SPNCRNA.507 overlap with other candidates SPNCRNA.1032 and SPNCRNA.808, respectively, and only one mutant lacking overlapping two genes was constructed for each case (Figure 2A). In total, 12 knockout mutants were constructed and subjected to the biological screening.

In the biological screen, mutant cells were placed under a number of environmental conditions to examine phenotypes during mitotic and meiotic cycles and in reaction to external stresses (Figure 2B). These include growth assays at high or low temperature, a reaction to UV irradiation that induces DNA damage, in the presence of HU as an inhibitor of DNA synthesis, thiabendazole (TBZ) as an anti-mitotic drug destroying microtubules, caffeine that affects cell cycle progression, H2O2 as an oxidative stress, CdSO4 for a heavy metal stress and sorbitol for osmotic stress (Figure 2B, D; Supplementary Figures S1–S7). In addition, mutant cells were observed under nitrogen starvation conditions. Schizosaccharomyces pombe undergoes sexual differentiation (mating followed by meiosis and sporulation) upon nitrogen starvation. To monitor phenotypes there, cells were cultured in media lacking nitrogen sources and the cell morphology was observed under a microscope (Figure 2D). Mating efficiencies of the knockout mutants were quantified (Supplementary Figure S8).

In the plate-based assays, there were some knockout mutants that showed phenotype, albeit partially. The deletion mutant of SPNCRNA.491Δ was moderately sensitive to 10 μg/ml TBZ (Supplementary Figures S2–S4), implying that ncRNA491 may be related to microtubule functions or regulation.

nc1669 regulates untimely initiation of sexual differentiation

The plate-based screening highlighted the knockout mutant of SPNCRNA.1669 (nc1669Δ), which showed higher mating efficiency than WT cells under nitrogen starvation (Figure 3A). Assuming that nc1669Δ cells are competent to easily undergo sexual differentiation, we hypothesized that the mutant cells may conjugate even in the presence of nitrogen sources. When cells were grown in media containing nitrogen sources, some of the nc1669Δ cells underwent mating (followed by meiosis), whereas WT cells never initiated mating (Figure 3B, C). Multiple clones, denoted as #1 and #2, of the same genotype were subjected to phenotype analyses. The penetrance of the phenotype was different between clones (Figure 3A, B). This tendency remained after genetic backcrossing, excluding the possibility that additional mutations unexpectedly occurred in some clones.

Figure 3.

Figure 3.

nc1669 is required for timing of sexual differentiation. (A) Mating efficiencies of WT and nc1669Δ cells under nitrogen starvation (on SPA plates, 10 h). Two independent colonies, denoted as #1 and #2, of the identical genotype were tested. (B, C) Mating efficiencies in the nitrogen-rich synthetic medium (SD–3S) were counted for WT, nc1669Δ and nc1669FLΔ cells (B). In (B), two independent colonies were chosen. Merged images of DIC and DAPI (nuclei) for WT and nc1669Δ cells (C). Scale bars, 10 μm. (D) Schematic diagrams of the nc1669 gene locus in WT, nc1669Δ and nc1669FLΔ strains. The black boxed regions of the gene were replaced with the marker gene conferring G418 resistance (kanR). The position of the CSM identified in the screen is shown. Arrows, direction of transcription. Dotted arrow, not clear if transcription occurs therefrom.

Untimely sexual differentiation in the presence of nitrogen sources is commonly called ‘hyper-mating’. The phenotype was seen in the nc1669Δ mutant originally created for the initial screen. It is of note that the nc1669Δ mutant still harbours the CSM sequence in the locus, which has been identified as a conserved motif over species by the bioinformatic screen as shown above (Figure 3D). The CSM overlaps with tRNAGlu08 in the antisense strand.

To examine the relationship between nc1669 and sexual differentiation, the expression level of nc1669 was monitored in response to nitrogen starvation (Figure 4A). For reference, expression of ste11 was examined, which encodes a master transcription factor for sexual differentiation and is transcriptionally up-regulated in response to nitrogen starvation (49) (i, Figure 4A). An almost comparable high level of ste11 was detected, even in the presence of nitrogen, in the nc1669Δ mutant (i, Figure 4A). Up-regulation of ste11 expression in nc1669Δ indicates that the ncRNA nc1669 negatively regulates expression of ste11 in vegetative WT cells (Figure 4B). In line with this, the expression of nc1669 was reduced in response to nitrogen starvation (ii, Figure 4A).

Although the nc1669Δ mutant lacks most of the gene, the remaining portion appeared to be highly expressed, possibly due to artificial effects due to insertion of the marker gene kanR (ii, Figure 4A). The remaining transcripts appeared non-functional (loss of function), because the null (complete deletion) mutant nc1669FLΔ (Figure 3D) exhibited the hyper-mating phenotype similarly to the nc1669Δ mutant (Figure 3B). Both nc1669Δ and nc1669FLΔ mutants lack another ncRNA, SPNCRNA.1670 (nc1670), that overlaps with nc1669 in its antisense strand (Figure 3D). First, the expression level of nc1670 did not significantly change in response to nitrogen starvation (iii, Figure 4A). In order to determine deletion of which ncRNA is responsible for the hyper-mating phenotype, two plasmids were constructed: the plasmid p[FL] contains and expresses the full length of the nc1669 gene, whilst the other one p[Δnc1670] expresses the partial nc1669 lacking the nc1670 region (Figure 5A). Those plasmids were introduced into the nc1669FLΔ strain, and we found that both transformants complemented the hyper-mating phenotype to a similar degree (Figure 5B, C). These results demonstrated that the lack of nc1669, but not of nc1670, was the cause of the hyper-mating phenotype.

Figure 5.

Figure 5.

nc1670 is not responsible for the hyper-mating phenotype. (A) Schematic diagrams of the insert regions expressed in plasmids. (B) Mating efficiency in the nitrogen-rich medium SD–3S + Nat. Average ± SEM (three trials). n >1400 cells. (C) Expression levels of nc1669 in the indicated strains were quantified and normalized with the act1 mRNA level. Average ± SEM (three trials).

In summary, we identified the ncRNA nc1669 as an uncharacterized functional ncRNA that possesses a CSM, which is required to negatively regulate initiation of untimely sexual differentiation.

Investigation of molecular function of nc1669

Those experiments demonstrate that the ncRNA nc1669 represses untimely mating in nutrition-rich conditions. We further examined how the ncRNA nc1669 represses hyper-mating. From the antisense strand of nc1669, tRNA encoding glutamic acid (tRNAGlu08, SPBTRNAGLU.08) is transcribed (Figure 3D). In the chromosome, the CSM of nc1669 overlaps with the tRNA gene on the antisense strand, and the CSM also displayed a tRNA-like structure comprising three hairpin loops (Figure 6A).

Figure 6.

Figure 6.

The CSM, but not its antisense tRNA, is required for the function of nc1669. (A) Secondary structure of the CSM in nc1669 predicted by RNAalifold. (B) Expression levels of tRNAGlu08 in the indicated strains were quantified and normalized with the act1 mRNA level. Average ± SEM (three trials). *p = 0.0491 (between the WT and nc1669Δ), 0.0378 (between the WT and nc1669FLΔ) (Student's t-test). (C) Relative expression levels of tRNAGlu08 in the indicated strains. Average ± SEM (three trials). (D) Mating efficiencies of WT, nc1669FLΔ and nc1669FLΔ+p[tRNA] cells in nitrogen-rich medium (SD–3S + Nat). Average ± SEM (three trials). (E) Regions cloned in expression plasmids are shown. The plasmid p[FL] contains the full length of nc1669, whereas p[ΔCSM] contains nc1669 lacking the CSM. (F) Relative expression levels of nc1669 in the indicated strains. Average ± SEM (three trials). (G) Mating efficiencies of the indicated strains on nitrogen-rich medium (SD–3S + Nat). Average ± SEM (three trials). n >1000 cells. (H) Mating efficiencies of WT, nc1669-shf1, nc1669Δ and nc1669FLΔ cells in nitrogen-rich medium (SD–3S). Average ± SEM (four trials). n >2500 cells. *p = 0.0252 (χ2 test).

This raises the question of whether the CSM of nc1669 may just reflect an inversion of the structure of overlapping tRNAGlu08. However, our screen did not highlight tRNAGlu08 as a conserved ncRNA. Even if an RNA forms a secondary structure, it does not necessarily mean that the antisense RNA also forms a secondary structure. This is partly due to U–G forming a base pair that may constitute a secondary structure, whereas its antisense bases C–A do not form a pair in RNAs. Indeed, the RNAz operation evaluated that the degree of structure conservation was extremely low in tRNAGlu08 (the RNA class probability P = 0.07), whereas it was quite high in the nc1669 coding strand (P = 0.94). This indicates that the CSM of nc1669 is not just a reflected structure of tRNAGlu08.

Several studies suggest that tRNAs and related factors regulate sexual differentiation in yeast (50,51). We therefore postulated that nc1669 may cooperate with tRNAGlu08 to repress untimely mating of cells. To test this possibility, we first monitored the amount of tRNAGlu08 in nc1669Δ and in nc1669FLΔ cells. In the partial deletion of nc1669 that still retains the sequence for the tRNAGlu08 gene, the expression level of tRNAGlu08 was reduced to a similar extent to that in nc1669FLΔ cells lacking the tRNA gene (Figure 6B). This indicates that the partial deletion of the nc1669 gene affected expression of the neighbouring tRNAGlu08 gene. It is possible that unexpected up-regulation of partial nc1669 RNAs (ii, Figure 4A) affected silencing of the antisense tRNAGlu08.

To further investigate whether the untimely mating phenotype of nc1669Δ is due to the loss of tRNAGlu08 expression, we prepared the nc1669FLΔ strain harbouring the plasmid p[tRNA] that contains the tRNAGlu08 gene. Although expression of tRNAGlu08 was complemented by plasmids in the nc1669FLΔ mutant (nc1669FLΔ+p[tRNA], Figure 6C), untimely mating was still observed (Figure 6D). Through these results, we conclude that the hyper-mating phenotype was due to the loss of nc1669per se, and not through regulation of tRNAGlu08. Having clarified that tRNAGlu08 is unrelated to the hyper-mating phenotype, we next investigated whether the CSM in nc1669 is essential for the function. When nc1669 without nc1670 was expressed in the nc1669FLΔ strain, untimely mating was complemented in the same way as with the full-length nc1669 (Figure 5). This indicates that the remaining region containing the CSM in nc1669 retains the ability to repress untimely mating. Finally, when we induced expression of nc1669 without the CSM in the nc1669FLΔ strain (nc1669FLΔ+p[ΔCSM], Figure 6E), we found that the exogenous construct did not retain the ability to repress the hyper-mating phenotype (Figure 6F, G).

To further strengthen this result, we also prepared the nc1669-shf1 mutant, in which the nucleotide sequence of the CSM region was randomly shuffled so that the original secondary structure of nc1669 was substantially altered by maximizing the possibility of loop formation among all the candidates (Supplementary Figure S9A). The nc1669-shf1 mutant showed the phenotype of untimely mating (Figure 6H), which may be ascribed to a moderate increase in ste11 expression (i, Supplementary Figure S9B). The phenotype is not due to RNA stability, as the level of the nc1669-shf1 RNA was not reduced by the mutation (ii, Supplementary Figure S9B). Therefore, we concluded that the CSM is required for the full function of nc1669 to inhibit untimely mating during vegetative growth, through repression of premature expression of ste11.

DISCUSSION

ncRNA screening based on sequence, structure and biological tests

This study presents screening methods employing informatics focusing on conservation of the RNA structure, which was confirmed to be effective to identify functional ncRNAs through the biological screening. We screened for structurally conserved RNAs from 1857 annotated (including uncharacterized) ncRNAs, to find 151 structurally conserved ncRNAs, 14 of which were chosen for investigating their functions in vivo. As a result, we succeeded in identifying an ncRNA nc1669 functional in repression of untimely sexual differentiation. On the other hand, meiRNA, a well-characterized ncRNA required for progression of meiosis, was not detected in the screen. This is probably due to the screening strategy: ncRNAs well conserved in sequence and in secondary structure among relatives are selected in silico, and ncRNAs that are not highly conserved such as meiRNA may not be hit in the screen.

Prior to searching for conserved secondary structures, selection based on sequence conservation of ncRNAs in S. pombe and related species was performed at the initial step (see Figure 1B). This skips a huge amount of computational calculation which would be necessary if all the ncRNAs were searched for structural conservation. We compacted the screen to avoid the complexity in the computational operations by extracting only the conserved regions of S. pombe ncRNAs. This sequence-based screen in practice discarded 804 out of 1857 ncRNAs (see Figure 1B), although some ncRNAs that are conserved in structure but not in sequence could have been dismissed.

The detection sensitivity of structurally conserved ncRNAs estimated from well-known ncRNAs was ∼25% (see Figure 1C, D). There are two possible explanations for the reason why ∼75% were not detected as structurally conserved ncRNAs. First, even though a group of RNAs share functions over species, their structures are not necessarily unique in exerting these functions. The other possibility is that the RNAz software recognized that their structures are not sufficiently conserved. RNAz comprehensively assesses sequence similarity, structural similarity, covariance base substitution and thermodynamic stability to evaluate structural conservation (47). For example, RNAs with low thermodynamic stability are not selected even if their overall structures are similar over species, even though those might be structurally stabilized possibly through chemical modifications in cells, as seen in tRNAs (52). Such ncRNAs might be dismissed in our screen.

For ncRNA screening, the FDR was estimated to be ∼6% (Figure 1B). This was calculated through random shuffling of the fragments extracted by structure-based multiple alignment. Although FDR values calculated in this way may be underestimates, this accuracy is sufficient for our screening because, in any case, candidates next proceed to biological screening to test their phenotypes. Besides that, the FDR could be suppressed by increasing the selection threshold of RNAz (the cut-off of the RNA class probability P) to ≥0.9, which would exclude many candidates. When the cut-off of P was raised to ≥0.9 in our analysis, the FDR indeed decreased to 1.3%, but the detection sensitivity also decreased to 16% (Table 1), demonstrating that there is a trade-off between the FDR and detection sensitivity.

Table 1.

FDR and detection sensitivity at different selection thresholds

P ≥0.5 P ≥0.9
FDR 6.1% 1.3%
Detection sensitivity 25.7% 16.0%

Although many types of CSMs were found in our screen, it appears hard to find a consensus tendency in their structures (Figure 1E). Further computational analyses would be effective to find the structural tendency. For example, it is possible to classify CSMs based on their structural similarity (clustering analysis). For instance, software such as GraphClust (53), DotAligner (24) and RNAscClust (54) could be applicable for structure-based clustering of RNAs.

nc1669 regulates initiation of sexual differentiation

In general, unconventional ncRNAs (excluding tRNA, rRNA, snRNA, snoRNA and miRNA) are classified into cis-acting and trans-acting, in which they function at proximal or distal gene loci, respectively. The majority of unconventional ncRNAs have been reported to function as cis-acting ncRNAs; in particular, they tend to regulate expression of nearby genes or overlapping antisense genes. nc1669 could regulate expression of SPBTRNAGLU.08 and SPNCRNA.1670 on the overlapping antisense strand, which respectively encode a tRNA and an ncRNA (Figure 3D). Phenotypic analyses may provide a hint to distinguish the way in which nc1669 acts.

The hyper-mating phenotype of nc1669FLΔ cells indicated that the ncRNA negatively regulates untimely initiation of sexual differentiation in nutrition-rich conditions (Figure 3AC). In fission yeast, initiation of sexual differentiation during vegetative growth is negatively regulated by Pat1 kinase (55–57). The homothallic (h90) pat1-114 temperature-sensitive mutant in rich media undergoes haploid meiosis at high temperature (34°C), whilst the mutant undergoes hyper-mating (but not haploid meiosis) at the semi-restrictive temperature of 29.5°C (55,56).

The cell fate of the pat1-114 mutant is known to be determined by Ste7 protein. The ste7Δ mutant cannot initiate mating under nitrogen starvation, meaning that Ste7 promotes mating. At 29.5°C, pat1-114 ste7Δ cells initiate haploid meiosis, in contrast to hyper-mating of pat1-114 cells (58). This suggests that Ste7 protein navigates pat1-114 cells towards hyper-mating by blocking initiation of haploid meiosis at the semi-restrictive temperature. Although the nc1669Δ mutant exhibited hyper-mating, it never initiated haploid meiosis unlike pat1-114. It is therefore possible that nc1669 is involved in cell fate determination together with Ste7 to modulate the function of Pat1.

Two other major factors are known to inhibit untimely initiation of sexual differentiation in the presence of nutrients: TORC1 (target of rapamycin complex 1) and PKA (cAMP-dependent protein kinase) (59). TORC1 is a protein complex that contains the conserved TOR kinase Tor2. TORC1 is essential for cell growth and repression of sexual differentiation, as the tor2 mutant undergoes mating in rich media (60–64). On the other hand, the cellular cAMP level varies depending on environmental conditions, and thus modulates the activity of its target, PKA (65,66). TORC1 and PKA signal transduction pathways synergistically repress the level of Ste11, the key transcription factor for sexual differentiation (49,59). In line with this, expression of ste11 in vegetative conditions was up-regulated in the nc1669Δ mutant (i, Figure 4A). It is therefore possible that nc1669 somehow represses expression of ste11 in rich media through the CSM, to block untimely initiation of sexual differentiation. The genetic loci of nc1669 and ste11 are close to each other on chromosome II (4109819–4107562 and 3983404–3987400, respectively), therefore it is possible that the nc1669 ncRNA may act in cis for repression of ste11 transcription. This might account for the fact that expression of nc1669 from plasmids did not fully complement the null mutant nc1669FLΔ (Figure 5AC). It is alternatively possible that nc1669 may function together with the rse1 ncRNA, which has been identified as a cis-acting long ncRNA that represses ste11 expression (33).

It is also suggested that tRNAs or their related factors regulate sexual differentiation in budding and fission yeast (50,51). It was therefore possible that nc1669 acts in cis, to promote expression or subsequent modification of the encompassing tRNA gene SPBTRNAGLU.08. The tRNAGlu08 gene on its own, however, appeared to be unrelated directly to sexual differentiation, because artificial expression of tRNAGlu08 in nc1669Δ cells failed to suppress the hyper-mating phenotype (Figure 6D).

The target genes of nc1669 could be limited to factors related to sexual differentiation, as the nc1669 knockout mutant did not display any other phenotypes in the biological screen. It remains elusive why each colony of nc1669 mutants displayed a variable degree of the hyper-mating phenotype (Figure 3A, B). As we excluded possibilities of additional mutations through backcrossing, we interpret this as follows: the degree of the hyper-mating phenotype might be variable when the nc1669 gene was first deleted in each cell. Since then, the degree of the phenotype in each cell could be maintained over generations inside single colonies in an epigenetic manner.

Perspective

We have constructed gene deletion mutants for each of 12 ncRNAs that do not overlap with protein-coding regions, and systematically examined their phenotype. Although only the mutant of SPNCRNA.1669 exhibited a remarkable phenotype, some other ncRNAs would be worth noticing. For instance, knockout of SPNCRNA.491 resulted in moderate sensitivity to the microtubule poison TBZ. Besides that, RNA-seq profiling indicated that SPNCRNA.510 is highly expressed (120 copies/cell) during vegetative growth, in comparison with the average transcript level (∼0.2–1 copy/cell) (67). Raman spectroscopy in combination with RNA-seq identified SPNCRNA.510 as a gene whose expression level varied widely depending on the cultivation conditions (68).

Systematic deletions of 141 S. pombe long intergenic ncRNAs have been conducted to present functional profiling of those ncRNAs under ∼140 environmental conditions (37). ncRNA475, 507, 515, 808 and 965 in our list overlapped with theirs regarding phenotypes in stress conditions and in cell cycle progression. This may suggest that ncRNAs conserved in sequence and in structure are often functional under some environmental conditions.

We propose that such existing data for functional profiling and expression levels in various conditions (37,69), as well as in the mutants of RNA metabolic pathways (70), can be combined with our in silico/in vivo screening, so that functional ncRNAs that react to environmental changes can be efficiently highlighted. It would also be informative as a next step to determine the in vitro or in vivo secondary structure of the elicited ncRNAs using methods such as DMS-MaPseq (71).

During the plate-based in vivo screen, only a few genes displayed phenotypes in the conditions we tested. Parameters and thresholds in the in silico procedures may be further modulated to broaden the scope of candidates for the next in vivo step. It is also possible to flexibly design experimental conditions of the in vivo screening, based on the cellular events of interest, or on the expression patterns of individual candidates. These combinatory multi-omics studies might efficiently pinpoint their phenotypes to reveal elusive functions of uncharacterized ncRNAs.

DATA AVAILABILITY

The datasets generated in this study can be downloaded as a supplementary datafile.

Supplementary Material

gkac825_Supplemental_Files

ACKNOWLEDGEMENTS

We thank Soichiro Yamanaka for advice on experimental design at the initial stage of this study. We also thank Mika Toya for advice on experimental design and Keigo Ide for advice on computational analyses. Computational analyses were performed on the NIG supercomputer at ROIS National Institute of Genetics.

Contributor Information

Yu Ono, Laboratory of Cytoskeletal Logistics, Department of Life Science and Medical Bioscience, School of Advanced Science and Engineering, Waseda University, 2-2 Wakamatsucho, Shinjuku-ku, Tokyo 162-8480, Japan.

Kenta Katayama, Laboratory of Cytoskeletal Logistics, Department of Life Science and Medical Bioscience, School of Advanced Science and Engineering, Waseda University, 2-2 Wakamatsucho, Shinjuku-ku, Tokyo 162-8480, Japan; Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), 3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan.

Tomoki Onuma, Laboratory of Cytoskeletal Logistics, Department of Life Science and Medical Bioscience, School of Advanced Science and Engineering, Waseda University, 2-2 Wakamatsucho, Shinjuku-ku, Tokyo 162-8480, Japan.

Kento Kubo, Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), 3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan; Bioinformatics Laboratory, Department of Electrical Engineering and Bioscience, School of Advanced Science and Engineering, Waseda University, 3-4-1 Okubo Shinjuku-ku, Tokyo 169-8555, Japan.

Hayato Tsuyuzaki, Laboratory of Cytoskeletal Logistics, Department of Life Science and Medical Bioscience, School of Advanced Science and Engineering, Waseda University, 2-2 Wakamatsucho, Shinjuku-ku, Tokyo 162-8480, Japan; Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), 3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan.

Michiaki Hamada, Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), 3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan; Bioinformatics Laboratory, Department of Electrical Engineering and Bioscience, School of Advanced Science and Engineering, Waseda University, 3-4-1 Okubo Shinjuku-ku, Tokyo 169-8555, Japan; Institute for Medical-oriented Structural Biology, Waseda University, 2-2 Wakamatsucho, Shinjuku-ku, Tokyo 162-8480, Japan.

Masamitsu Sato, Laboratory of Cytoskeletal Logistics, Department of Life Science and Medical Bioscience, School of Advanced Science and Engineering, Waseda University, 2-2 Wakamatsucho, Shinjuku-ku, Tokyo 162-8480, Japan; Institute for Medical-oriented Structural Biology, Waseda University, 2-2 Wakamatsucho, Shinjuku-ku, Tokyo 162-8480, Japan; Institute for Advanced Research of Biosystem Dynamics, Waseda Research Institute for Science and Engineering, Graduate School of Advanced Science and Engineering, Waseda University, 3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

JSPS KAKENHI [JP25291041, JP15H01359, JP16H04787, JP16H01317, JP18K19347 and JP21H00261 to M.S.]; The Uehara Memorial Foundation; Ohsumi Frontier Science Foundation; Waseda University grants for Special Research Projects [2017B-242, 2017B-243, 2018B-222, 2019C-570, 2020R-038 and 2022C-164 to M.S.]. Y.O. is supported by JST SPRING [JPMJSP2128]. Funding for open access charge: Waseda University.

Conflict of interest statement. None declared.

REFERENCES

  • 1. Dunham I., Kundaje A., Aldred S.F., Collins P.J., Davis C.A., Doyle F., Epstein C.B., Frietze S., Harrow J., Kaul R.et al.. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012; 489:57–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Frankish A., Diekhans M., Ferreira A.-M., Johnson R., Jungreis I., Loveland J., Mudge J.M., Sisu C., Wright J., Armstrong J.et al.. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 2019; 47:D766–D773. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Iyer M.K., Niknafs Y.S., Malik R., Singhal U., Sahu A., Hosono Y., Barrette T.R., Prensner J.R., Evans J.R., Zhao S.et al.. The landscape of long noncoding RNAs in the human transcriptome. Nat. Genet. 2015; 47:199–208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. FANTOM Consortium and the RIKEN PMI and CLST (DGT) A promoter-level mammalian expression atlas. Nature. 2014; 507:462–470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Peciña A., Smith K.N., Mézard C., Murakami H., Ohta K., Nicolas A.. Targeted stimulation of meiotic recombination. Cell. 2002; 111:173–184. [DOI] [PubMed] [Google Scholar]
  • 6. Hirose T., Mishima Y., Tomari Y.. Elements and machinery of non-coding RNAs: toward their taxonomy. EMBO Rep. 2014; 15:489–507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Quek X.C., Thomson D.W., Maag J.L.V., Bartonicek N., Signal B., Clark M.B., Gloss B.S., Dinger M.E.. lncRNAdb v2.0: expanding the reference database for functional long noncoding RNAs. Nucleic Acids Res. 2015; 43:D168–D173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Brown C.J., Ballabio A., Rupert J.L., Lafreniere R.G., Grompe M., Tonlorenzi R., Willard H.F.. A gene from the region of the human X inactivation centre is expressed exclusively from the inactive X chromosome. Nature. 1991; 349:38–44. [DOI] [PubMed] [Google Scholar]
  • 9. Zhao J., Sun B.K., Erwin J.A., Song J.-J., Lee J.T.. Polycomb proteins targeted by a short repeat RNA to the mouse X chromosome. Science. 2008; 322:750–756. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Rinn J.L., Kertesz M., Wang J.K., Squazzo S.L., Xu X., Brugmann S.A., Goodnough L.H., Helms J.A., Farnham P.J., Segal E.et al.. Functional demarcation of active and silent chromatin domains in human HOX loci by noncoding RNAs. Cell. 2007; 129:1311–1323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Wang X., Arai S., Song X., Reichart D., Du K., Pascual G., Tempst P., Rosenfeld M.G., Glass C.K., Kurokawa R.. Induced ncRNAs allosterically modify RNA binding proteins in cis to inhibit transcription. Nature. 2008; 454:126–130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Gong S., Wang Y., Wang Z., Zhang W.. Computational methods for modeling aptamers and designing riboswitches. Int. J. Mol. Sci. 2017; 18:2442. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Liu S.J., Horlbeck M.A., Cho S.W., Birk H.S., Malatesta M., He D., Attenello F.J., Villalta J.E., Cho M.Y., Chen Y.et al.. CRISPRi-based genome-scale identification of functional long noncoding RNA loci in human cells. Science. 2017; 355:eaah7111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Ulitsky I., Shkumatava A., Jan C.H., Sive H., Bartel D.P.. Conserved function of lincRNAs in vertebrate embryonic development despite rapid sequence evolution. Cell. 2011; 147:1537–1550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Rhind N., Chen Z., Yassour M., Thompson D.A., Haas B.J., Habib N., Wapinski I., Roy S., Lin M.F., Heiman D.I.et al.. Comparative functional genomics of the fission yeasts. Science. 2011; 332:930–936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Ard R., Tong P., Allshire R.C.. Long non-coding RNA-mediated transcriptional interference of a permease gene confers drug tolerance in fission yeast. Nat. Commun. 2014; 5:5576. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Gruber A.R., Findeiß S., Washietl S., Hofacker I.L., Stadler P.F.. RNAz 2.0: improved noncoding RNA detection. Pac. Symp. Biocomput. 2010; 2010:69–79. [PubMed] [Google Scholar]
  • 18. Pedersen J.S., Bejerano G., Siepel A., Rosenbloom K., Lindblad-Toh K., Lander E.S., Kent J., Miller W., Haussler D.. Identification and classification of conserved RNA secondary structures in the human genome. PLoS Comput. Biol. 2006; 2:e33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Yao Z., Weinberg Z., Ruzzo W.L.. CMfinder—a covariance model based RNA motif finding algorithm. Bioinformatics. 2006; 22:445–452. [DOI] [PubMed] [Google Scholar]
  • 20. Steigele S., Huber W., Stocsits C., Stadler P.F., Nieselt K.. Comparative analysis of structured RNAs in S. cerevisiae indicates a multitude of different functions. BMC Biol. 2007; 5:25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Weinberg Z., Lünse C.E., Corbino K.A., Ames T.D., Nelson J.W., Roth A., Perkins K.R., Sherlock M.E., Breaker R.R.. Detection of 224 candidate structured RNAs by comparative analysis of specific subsets of intergenic regions. Nucleic Acids Res. 2017; 45:10811–10823. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Li S., Breaker R.R.. Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics. BMC Genomics. 2017; 18:785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Weinberg Z., Wang J.X., Bogue J., Yang J., Corbino K., Moy R.H., Breaker R.R.. Comparative genomics reveals 104 candidate structured RNAs from bacteria, archaea, and their metagenomes. Genome Biol. 2010; 11:R31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Smith M.A., Seemann S.E., Quek X.C., Mattick J.S.. DotAligner: identification and clustering of RNA structure motifs. Genome Biol. 2017; 18:244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. McCutcheon J.P., Eddy S.R.. Computational identification of non-coding RNAs in Saccharomyces cerevisiae by comparative genomics. Nucleic Acids Res. 2003; 31:4119–4128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Bähler J., Wu J., Longtine M.S., Shah N.G., McKenzie A., Steever A.B., Wach A., Philippsen P., Pringle J.R.. Heterologous modules for efficient and versatile PCR-based gene targeting in Schizosaccharomyces pombe. Yeast. 1998; 14:943–951. [DOI] [PubMed] [Google Scholar]
  • 27. Watanabe Y., Yamamoto M.. S. pombe mei2 + encodes an RNA-binding protein essential for premeiotic DNA synthesis and meiosis I, which cooperates with a novel RNA species meiRNA. Cell. 1994; 78:487–498. [DOI] [PubMed] [Google Scholar]
  • 28. Ding D.-Q., Okamasa K., Yamane M., Tsutsumi C., Haraguchi T., Yamamoto M., Hiraoka Y.. Meiosis-specific noncoding RNA mediates robust pairing of homologous chromosomes in meiosis. Science. 2012; 336:732–736. [DOI] [PubMed] [Google Scholar]
  • 29. Ding D.-Q., Okamasa K., Katou Y., Oya E., Nakayama J., Chikashige Y., Shirahige K., Haraguchi T., Hiraoka Y.. Chromosome-associated RNA–protein complexes promote pairing of homologous chromosomes during meiosis in Schizosaccharomyces pombe. Nat. Commun. 2019; 10:5598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Andric V., Nevers A., Hazra D., Auxilien S., Menant A., Graille M., Palancade B., Rougemaille M.. A scaffold lncRNA shapes the mitosis to meiosis switch. Nat. Commun. 2021; 12:770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Hirota K., Miyoshi T., Kugou K., Hoffman C.S., Shibata T., Ohta K.. Stepwise chromatin remodelling by a cascade of transcription initiation of non-coding RNAs. Nature. 2008; 456:130–134. [DOI] [PubMed] [Google Scholar]
  • 32. Oda A., Takemata N., Hirata Y., Miyoshi T., Suzuki Y., Sugano S., Ohta K.. Dynamic transition of transcription and chromatin landscape during fission yeast adaptation to glucose starvation. Genes Cells. 2015; 20:392–407. [DOI] [PubMed] [Google Scholar]
  • 33. Fauquenoy S., Migeot V., Finet O., Yague-Sanz C., Khorosjutina O., Ekwall K., Hermand D.. Repression of cell differentiation by a cis-acting lincRNA in fission yeast. Curr. Biol. 2018; 28:383–391. [DOI] [PubMed] [Google Scholar]
  • 34. Shah S., Wittmann S., Kilchert C., Vasiljeva L.. lncRNA recruits RNAi and the exosome to dynamically regulate pho1 expression in response to phosphate levels in fission yeast. Genes Dev. 2014; 28:231–244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Touat-Todeschini L., Shichino Y., Dangin M., Thierry-Mieg N., Gilquin B., Hiriart E., Sachidanandam R., Lambert E., Brettschneider J., Reuter M.et al.. Selective termination of lncRNA transcription promotes heterochromatin silencing and cell differentiation. EMBO J. 2017; 36:2626–2641. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Yamashita A., Shichino Y., Yamamoto M.. The long non-coding RNA world in yeasts. Biochim. Biophys. Acta. 2016; 1859:147–154. [DOI] [PubMed] [Google Scholar]
  • 37. Rodriguez-Lopez M., Anver S., Cotobal C., Kamrad S., Malecki M., Correia-Melo C., Hoti M., Townsend S., Marguerat S., Pong S.K.et al.. Functional profiling of long intergenic non-coding RNAs in fission yeast. Elife. 2022; 11:e76000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Moreno S., Klar A., Nurse P.. Molecular genetic analysis of fission yeast Schizosaccharomyces pombe. Methods Enzymol. 1991; 194:795–823. [DOI] [PubMed] [Google Scholar]
  • 39. Sato M., Toya M., Toda T.. Visualization of fluorescence-tagged proteins in fission yeast: the analysis of mitotic spindle dynamics using GFP–tubulin under the native promoter. Methods Mol. Biol. 2009; 545:185–203. [DOI] [PubMed] [Google Scholar]
  • 40. Jiang M., Anderson J., Gillespie J., Mayne M.. uShuffle: a useful tool for shuffling biological sequences while preserving the k-let counts. BMC Bioinformatics. 2008; 9:192–192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Kawaguchi R., Kiryu H.. Parallel computation of genome-scale RNA secondary structure to detect structural constraints on human genome. BMC Bioinformatics. 2016; 17:203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Li S., Toya M., Sato M.. Simplification of nutritional conditions in transformation procedures for genome editing with the CRISPR/Cas9 system for fission yeast. Gene. 2021; 784:145595. [DOI] [PubMed] [Google Scholar]
  • 43. Benson D.A., Cavanaugh M., Clark K., Karsch-Mizrachi I., Lipman D.J., Ostell J., Sayers E.W.. GenBank. Nucleic Acids Res. 2013; 41:D36–D42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Lock A., Rutherford K., Harris M.A., Hayles J., Oliver S.G., Bähler J., Wood V.. PomBase 2018: user-driven reimplementation of the fission yeast database provides rapid and intuitive access to diverse, interconnected information. Nucleic Acids Res. 2019; 47:D821–D827. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Kersey P.J., Allen J.E., Allot A., Barba M., Boddu S., Bolt B.J., Carvalho-Silva D., Christensen M., Davis P., Grabmueller C.et al.. Ensembl genomes 2018: an integrated omics infrastructure for non-vertebrate species. Nucleic Acids Res. 2018; 46:D802–D808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Harris R.S. Improved pairwise alignment of genomic DNA. 2007; Pennsylvania State University; PhD Thesis. [Google Scholar]
  • 47. Washietl S., Hofacker I.L., Stadler P.F.. Fast and reliable prediction of noncoding RNAs. Proc. Natl Acad. Sci. USA. 2005; 102:2454–2459. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Katoh K., Misawa K., Kuma K., Miyata T.. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002; 30:3059–3066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Sugimoto A., Iino Y., Maeda T., Watanabe Y., Yamamoto M.. Schizosaccharomyces pombe ste11 + encodes a transcription factor with an HMG motif that is a critical regulator of sexual development. Genes Dev. 1991; 5:1990–1999. [DOI] [PubMed] [Google Scholar]
  • 50. Kamada Y. Novel tRNA function in amino acid sensing of yeast Tor complex1. Genes Cells. 2017; 22:135–147. [DOI] [PubMed] [Google Scholar]
  • 51. Otsubo Y., Matsuo T., Nishimura A., Yamamoto M., Yamashita A.. tRNA production links nutrient conditions to the onset of sexual differentiation through the TORC1 pathway. EMBO Rep. 2018; 19:e44867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Motorin Y., Helm M.. tRNA stabilization by modified nucleotides. Biochemistry. 2010; 49:4934–4944. [DOI] [PubMed] [Google Scholar]
  • 53. Heyne S., Costa F., Rose D., Backofen R.. GraphClust: alignment-free structural clustering of local RNA secondary structures. Bioinformatics. 2012; 28:i224–i232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Miladi M., Junge A., Costa F., Seemann S.E., Havgaard J.H., Gorodkin J., Backofen R.. RNAscClust: clustering RNA sequences using structure conservation and graph based motifs. Bioinformatics. 2017; 33:2089–2096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Iino Y., Yamamoto M.. Mutants of Schizosaccharomyces pombe which sporulate in the haploid state. Mol. Gen. Genet. 1997; 198:416–421. [DOI] [PubMed] [Google Scholar]
  • 56. Nurse P. Mutants of the fission yeast Schizosaccharomyces pombe which alter the shift between cell proliferation and sporulation. Mol. Gen. Genet. 1985; 198:497–502. [Google Scholar]
  • 57. Beach D., Rodgers L., Gould J.. RAN1 + controls the transition from mitotic division to meiosis in fission yeast. Curr. Genet. 1985; 10:297–311. [DOI] [PubMed] [Google Scholar]
  • 58. Matsuyama A., Yabana N., Watanabe Y., Yamamoto M.. Schizosaccharomyces pombe Ste7p is required for both promotion and withholding of the entry to meiosis. Genetics. 2000; 155:539–549. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Valbuena N., Moreno S.. TOR and PKA pathways synergize at the level of the Ste11 transcription factor to prevent mating and meiosis in fission yeast. PLoS One. 2010; 5:e11514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Álvarez B., Moreno S.. Fission yeast Tor2 promotes cell growth and represses cell differentiation. J. Cell Sci. 2006; 119:4475–4485. [DOI] [PubMed] [Google Scholar]
  • 61. Uritani M., Hidaka H., Hotta Y., Ueno M., Ushimaru T., Toda T.. Fission yeast Tor2 links nitrogen signals to cell proliferation and acts downstream of the Rheb GTPase. Genes Cells. 2006; 11:1367–1379. [DOI] [PubMed] [Google Scholar]
  • 62. Matsuo T., Otsubo Y., Urano J., Tamanoi F., Yamamoto M.. Loss of the TOR kinase Tor2 mimics nitrogen starvation and activates the sexual development pathway in fission yeast. Mol. Cell. Biol. 2007; 27:3154–3164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Weisman R., Roitburg I., Schonbrun M., Harari R., Kupiec M.. Opposite effects of Tor1 and Tor2 on nitrogen starvation responses in fission yeast. Genetics. 2007; 175:1153–1162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Rodríguez-López M., Gonzalez S., Hillson O., Tunnacliffe E., Codlin S., Tallada V.A., Bähler J., Rallis C.. The GATA transcription factor Gaf1 represses tRNAs, inhibits growth, and extends chronological lifespan downstream of fission yeast TORC1. Cell Rep. 2020; 30:3240–3249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Byrne S.M., Hoffman C.S.. Six git genes encode a glucose-induced adenylate cyclase activation pathway in the fission yeast Schizosaccharomyces pombe. J. Cell Sci. 1993; 105:1095–1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Maeda T., Watanabe Y., Kunitomo H., Yamamoto M.. Cloning of the pka1 gene encoding the catalytic subunit of the cAMP-dependent protein kinase in Schizosaccharomyces pombe. J. Biol. Chem. 1994; 269:9632–9637. [PubMed] [Google Scholar]
  • 67. Marguerat S., Schmidt A., Codlin S., Chen W., Aebersold R., Bähler J.. Quantitative analysis of fission yeast transcriptomes and proteomes in proliferating and quiescent cells. Cell. 2012; 151:671–683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Kobayashi-Kirschvink K.J., Nakaoka H., Oda A., Kamei K.F., Nosho K., Fukushima H., Kanesaki Y., Yajima S., Masaki H., Ohta K.et al.. Linear regression links transcriptomic data and cellular Raman spectra. Cell Syst. 2018; 7:104–117. [DOI] [PubMed] [Google Scholar]
  • 69. Leong H.S., Dawson K., Wirth C., Li Y., Connolly Y., Smith D.L., Wilkinson C.R.M., Miller C.J.. A global non-coding RNA system modulates fission yeast protein levels in response to stress. Nat. Commun. 2014; 5:3947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Atkinson S.R., Marguerat S., Bitton D.A., Rodríguez-López M., Rallis C., Lemay J.-F., Cotobal C., Malecki M., Smialowski P., Mata J.et al.. Long noncoding RNA repertoire and targeting by nuclear exosome, cytoplasmic exonuclease, and RNAi in fission yeast. RNA. 2018; 24:1195–1213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Zubradt M., Gupta P., Persad S., Lambowitz A.M., Weissman J.S., Rouskin S.. DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo. Nat. Methods. 2017; 14:75–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Bernhart S.H., Hofacker I.L., Will S., Gruber A.R., Stadler P.F.. RNAalifold: improved consensus structure prediction for RNA alignments. BMC Bioinformatics. 2008; 9:474–474. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkac825_Supplemental_Files

Data Availability Statement

The datasets generated in this study can be downloaded as a supplementary datafile.


Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES