Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2006 Dec 12;103(51):19490–19495. doi: 10.1073/pnas.0607493103

Identification of a large noncoding RNA in extremophilic eubacteria

Elena Puerta-Fernandez *, Jeffrey E Barrick †,, Adam Roth , Ronald R Breaker *,†,‡,§
PMCID: PMC1748253  PMID: 17164334

Abstract

We have discovered a large and highly conserved RNA motif that typically resides in a noncoding section of a multigene messenger RNA in extremophilic Gram-positive eubacteria. RNAs of this class adopt an ornate secondary structure, are large compared with most other noncoding RNAs, and have been identified only in certain extremophilic bacteria. These ornate, large, extremophilic (OLE) RNAs have a length of ≈610 nucleotides, and the 35 representatives examined exhibit extraordinary conservation of nucleotide sequence and base pairing. Structural probing of the OLE RNA from Bacillus halodurans corroborates a complex secondary structure model predicted from comparative sequence analysis. The patterns of structural conservation, and its unique phylogenetic distribution, suggest that OLE RNA carries out a complex and critical function only in certain extremophilic bacteria.

Keywords: isoprenoid, riboswitch, ribozyme, superoperon


In recent years, a large number of small noncoding RNAs (13) have been identified in eubacteria (4), archaea (5), and eukaryotes (6). In many instances, novel noncoding RNAs in bacteria have been discovered by using bioinformatics search strategies (e.g., refs. 713) that take advantage of the conserved nucleotide sequences and secondary structures of these functional RNAs. Another productive way to identify numerous noncoding RNAs from bacteria is by cloning and sequencing small RNAs isolated from cell extracts (e.g., refs. 1316), and this approach is useful particularly for those examples that are not well conserved through evolution.

RNAs such as tRNAs, rRNAs, and some ribozymes have noncoding functions that have long been known to be central to RNA processing and protein synthesis mechanisms. However, it seems possible that additional noncoding RNAs will be found that perform fundamental biochemical tasks that to date have been assumed to be the exclusive province of protein factors. A number of noncoding RNAs discovered recently have proven to participate as the key components of gene regulation systems (e.g., refs. 1719). Additional examples of widespread noncoding RNAs in bacteria, such as 6S RNA (2022), the dual-function tmRNA (23), and noncoding portions of messenger RNAs such as T-Box elements (24) and riboswitches (2527) perform important gene control and molecular sensing tasks that are critical for cells to function normally. The existence of so many RNAs with atypical functions implies that some newly discovered noncoding RNAs might perform surprising and important roles in fundamental cellular processes. In this article, we describe features of a noncoding RNA element whose size, structural sophistication, and unique phylogenetic distribution are suggestive of a complex biological function.

Results

Identifying Ornate, Large, Extremophilic (OLE) RNAs by Using Bioinformatics.

We have previously used computer-aided search strategies that employ comparative sequence analysis (e.g., refs. 11 and 2830) to identify noncoding RNAs whose nucleotide sequences and secondary structures are conserved in bacteria (11, 28). These bioinformatics searches frequently uncover new classes of riboswitches that are noncoding portions of mRNAs that sense specific metabolites and regulate gene expression (2527).

However, these lists of structured RNA motifs routinely include elements that do not appear to be riboswitches (e.g., ref. 21). Many of these predicted RNA motifs carry only a few conserved nucleotides and have relatively simple secondary structure architectures compared with riboswitches (data not shown). Some of these unclassified motifs are likely to represent noncoding RNAs with functions that do not demand highly specific nucleotide sequences or structures to carry out their functions or otherwise are not well conserved among distantly related bacterial species. In addition, the genomic positions of the DNAs that code for these elements frequently are inconsistent with riboswitches, because the latter are almost always located in the 5′ untranslated regions of the genes they control.

One of the most striking examples of a new-found structured RNA element was encountered during a recent bioinformatics search of 116 bacterial genomes. Although the search revealed a number of new riboswitch candidates in α-proteobacteria (28), one RNA motif “hit” was not in this bacterial lineage and therefore was not described in the previous report. However, evidence for sequence conservation corresponding to this RNA was made available on the Breaker Laboratory Intergenic Sequence Server (BLISS; http://bliss.biology.yale.edu) (11, 28). Subsequently, we found that this RNA is listed as a candidate gene control element amongst a large number of hits provided as supplementary data for a bioinformatics report published by another laboratory (12).

We call members of this class “OLE RNAs” because they have a particularly ornate secondary structure, are distinctively large compared with the sizes of other well conserved noncoding RNAs, and have been found almost exclusively in species of eubacteria that are characterized as extremophilic. Initially, only the region of OLE RNA encompassed by P12.2 through P14.1 (Fig. 2) was identified. We iteratively used a combination of manual sequence comparison and computational sequence analysis (see Materials and Methods), which revealed the full extent of its size and increased the number of OLE RNA examples identified. This effort yielded 15 representatives (Fig. 1) that are ≈600 nt in length and are almost exclusively present in Firmicutes.

Fig. 2.

Fig. 2.

Conserved nucleotide sequence and secondary structure model for OLE RNAs. Each circle represents a nucleotide whose base identity is not conserved, and thick lines represent regions of variable nucleotide number and identity. Dashed lines indicate continuations of the RNA chain. Three of the 35 representatives carry a short hairpin loop in place of P9.3 (boxed). Other details are as defined in Fig. 1 legend.

Fig. 1.

Fig. 1.

Sequence alignment and predicted base-pairing interactions of 15 OLE RNA representatives. Nucleotides shaded in the same color identify stretches of complementary base pairs labeled P1 through P14. Black and gray labels designate 5′ and 3′ portions of the base-paired elements, respectively. The “Consensus” line was determined by including 20 OLE RNA representatives isolated by PCR amplification of environmental sequences [see supporting information (SI) Fig. 5 for a complete alignment]. Nucleotides that exhibit 97% or greater conservation (red) or that exhibit 75% or greater conservation (black) are depicted. R is G or A; Y is C or U; M is A or C; W is A or U; S is C or G; K is G or U; H is A, C, or U; D is A, G, or U; B is C, G, or U.

Expanding the Number of OLE RNA Representatives by Community Genomic PCR.

To further expand the collection of OLE RNA variants, we used primers corresponding to highly conserved portions of OLE RNA elements to PCR-amplify similar genomic regions from a diverse population of bacteria that were present in a hypersaline microbial mat (31). We chose this environmental sample as a potential source of OLE RNA variants because the sample presumably contains hundreds of different bacterial species that live in an environment that exhibits wide variations in pH and concentrations of salt, oxygen, and hydrogen sulfide (32, 33). The specific genomic DNA used in this study was isolated from the deepest layers of the microbial mat that are expected to be low in oxygen and high in hydrogen sulfide.

Primers for population PCR were designed to hybridize near the 5′ and 3′ termini of the OLE element (see SI Fig. 5), corresponding to two conserved regions of the gene that were identified upon examining the 15 representatives present in sequenced bacterial genomes (Fig. 1). The majority of the products generated by population PCR corresponded in length (≈580 nt) to that expected if known OLE RNA sequences were amplified with the same primers. Although cloning of the PCR products sometimes yielded individual constructs that were substantially larger or smaller than typical OLE RNA templates (data not shown), only those of similar size produced new representatives of the motif. Of 48 clones sequenced, 35 yielded OLE RNA sequences, of which 20 were unique (SI Fig. 5).

Elaborate Sequence and Structural Features of OLE RNAs.

The alignment of all OLE RNAs identified (SI Fig. 5) was used to define a consensus sequence (Fig. 1) and a secondary structure model (Fig. 2) for this class of RNAs. The alignment of these sequences from diverse organisms reveals that the RNA exhibits a strikingly high level of sequence and secondary structure conservation. Of the ≈610 nucleotides in OLE RNAs spanning from the start of P1 to its completion, nearly half (294 nucleotides) exhibit only one or no mutations relative to the defined consensus sequence. This exceptionally high level of sequence conservation, much of it present within bulges and loops of a complex secondary structure, indicates that OLE RNA forms a complex tertiary structure that is critical for its biological function.

Some of the conserved nucleotides also occur in segments of the RNA that are predicted to form base-paired stems. However, many of the nucleotides within predicted stems are not restricted in sequence identity. These relatively less conserved nucleotides almost always retain the ability to base-pair with their predicted partners. In total, 23 base-paired elements were defined, and the importance of all these stems for the function of the RNA is supported by frequent evidence of nucleotide covariation. This pattern of sequence conservation also suggests that OLE RNA is not a transposable element or a recently acquired genetic element. Unlike OLE RNAs, such elements exhibit far greater sequence similarity between representatives and they lack substantive structural elements with sequence covariation.

We performed a series of in-line probing assays (34) to provide further support for the secondary structure model presented (SI Figs. 6 and 7, and data not shown). In-line probing assays take advantage of the conformation-dependent rates of spontaneous RNA degradation. The resulting data can be used to identify RNA linkages that reside in unstructured regions because their greater flexibility typically permits greater levels of RNA cleavage compared with linkages within structured regions. The sequence from B. halodurans exhibits a pattern of spontaneous RNA cleavage (SI Fig. 7) that largely is consistent with the model derived from comparative sequence analysis (Fig. 2), again indicating that OLE RNAs form an exceptionally complex secondary structure.

OLE RNAs Are Embedded in a Large Operon.

In 14 of the 15 organisms known to carry OLE RNA, the gene encoding the RNA is located immediately 5′ relative to an ORF for a protein of unknown function. A second ORF further downstream is the dxs gene, which codes for 1-deoxy-d-xylulose-5-phosphate synthase. This compound is a metabolic intermediate for the biosynthesis of a number of fundamental metabolites including some coenzymes and lipids. With riboswitches, the proximity of genes for metabolic enzymes frequently can be used to rationalize the ligand specificity and the gene-control function of the RNA element. Therefore, we sought to further clarify the composition of the transcriptional unit that carries OLE RNA in B. halodurans.

To verify the expression of OLE RNA in B. halodurans, and to establish the length of the RNA transcript, we isolated total RNA from cells and performed a series of RT-PCRs (Fig. 3 A and B). DNA primer pairs also were designed to determine which of the adjacent genes (Fig. 3A) were expressed in a transcriptional unit that is contiguous with OLE RNA. For example, the presence of RT-PCR products in lanes 3–3′ and 4–4′ (Fig. 3B) confirms that at least some OLE RNA is expressed by B. halodurans grown under aerobic conditions in peptone-starch-carbonate medium (pH 10, 37°C). Furthermore, only DNA primer pairs 6–6′ and 10–10′ failed to amplify a product of the expected sizes, indicating that the RNA is expressed as part of a large polycistronic mRNA containing 10 other genes. This finding is consistent with the architecture of the corresponding genomic region in Bacillus subtilis, which is predicted to carry transcription terminators preceding and following the analogous operon (but lacking OLE RNA).

Fig. 3.

Fig. 3.

Expression of OLE RNA in B. halodurans. (A) Schematic representation of the OLE RNA gene and its surroundings in B. halodurans. Box identifies genes present in the same transcriptional unit. Arrows represent the primers used for RT-PCR, where primer pairs are represented as X and X′. Gene designations are as follows: a, BH2786, hypothetical protein; b, nusB, transcriptional antiterminator; c, folD, methylenetetrahydrofolate dehydrogenase; d, BH2783, exodeoxyribonuclease VII (large subunit); e, BH2782, exodeoxyribonuclease VII (small subunit); f, BH2781, geranyltranstransferase; g, BH2780, hypothetical protein; h, dxs, 1-deoxyxylulose-5-phosphate synthase; i, BH2778, hemolysin-like protein; j, ahrC, arginine repressor; k, recN, DNA repair and genetic recombination; l, spoIVB, intercompartmental signaling of pro-σK processing/activation. (B) Products of RT-PCR and PCR assays for various primer pairs separated by 1% agarose gel and stained with ethidium bromide. Lanes designated by + and − were PCR-amplified with or without reverse transcription, respectively. M, lanes with DNA markers at 100-bp size increments. (C) 5′ RACE mapping of termini produced by processing of OLE RNAs. Multiple asterisks reflect the number of clones examined that are terminated at the nucleotide indicated.

Two of the coexpressed genes near OLE RNA are involved in the biosynthesis of isoprenoids (1-deoxy-d-xylulose-5-phosphate synthase and geranyltranstransferase). However, the remaining genes have functions that appear to be unrelated to lipid metabolism. One gene (folD, encoding methylenetetrahydrofolate dehydrogenase) is involved in the metabolism of a coenzyme, another ORF encodes a hemolysin-like protein, three genes participate in DNA recombination (large and small subunits of exodeoxyribonuclease VII and the DNA repair and recombination protein RecN), and two genes are involved in transcriptional regulation (ahrC, encoding an arginine repressor, and NusB, encoding a transcription antiterminator protein).

In other organisms, the ORFs flanking the gene for OLE RNA are homologous, and their genomic arrangements are the same. The only known exception occurs in Desulfitobacterium hafniense, in which the OLE RNA gene resides at a completely different locus, although the remainder of the operon lacking OLE RNA is still present elsewhere in the genome. This suggests that OLE RNA is not exclusively a riboswitch control element that is needed to regulate genes within the large operon. Furthermore, the D. hafniense OLE RNA gene is flanked by intergenic regions that are >200 nt, and therefore the RNA might be expressed as a monocistronic transcript. These findings suggest that the biological function of the RNA does not require that it be coexpressed within the larger polycistronic mRNA.

If OLE RNA is functional as a separate RNA transcript, then it also might be processed from the larger operon transcript produced by most organisms. We examined this possibility by conducting 5′ RACE (rapid amplification of cDNA ends) (35, 36), which can be used to identify the 5′ termini of processed RNAs. We identified a dominant PCR product and determined the termini of these molecules by cloning and sequencing the resulting amplicons. Of 17 clones examined, 13 had 5′ termini that mapped to the P1 region of OLE RNA (Fig. 3C). Therefore, OLE RNAs appear to be processed to yield a dominant RNA product whose nucleotide sequence initiates at the same location as the highly conserved sequences and structures that are characteristic of OLE RNAs. Of the remaining clones, two terminate in the junction between P4 and P5, and two terminate near the loop of P5.

Co-occurrence of OLE RNA and a Predicted Membrane Protein.

The gene immediately downstream of the OLE RNA gene in all of the organisms but D. hafniense encodes a putative membrane protein whose function remains unknown (Fig. 3A; BH2780 in B. halodurans). In related organisms that lack OLE RNA (such as B. subtilis or Clostridium perfringens), this predicted membrane protein also is absent, even though the operon otherwise retains the same organization. In addition, we conducted a BLAST search with the BH2780 protein sequence and found that the only significant hits corresponded to organisms that also carry an OLE RNA gene. These findings suggest that a functional link might exist between OLE RNAs and BH2780-like proteins.

Comparative amino acid sequence analysis of the 15 BH2780 homologs from sequenced bacterial genomes reveals that considerable conservation occurs in several regions of the protein (SI Fig. 8). Some of these conserved amino acids reside within four predicted transmembrane domains (SI Figs. 8 and 9), although most of the conserved residues are present in two segments of the polypeptide that are predicted to be positioned on the cytoplasmic side of the membrane. Moreover, preliminary efforts to purify BH2780 protein from a transgenic Escherichia coli strain support cell membrane localization (data not shown).

Mutations in OLE RNA Do Not Alter Gene Expression.

OLE RNAs are distinctive in that they carry many more highly conserved nucleotides compared with known riboswitch classes (2527). OLE RNAs also are at least three times longer than the largest known riboswitch aptamers (Fig. 4A), again suggesting that OLE RNAs are not likely to function exclusively as gene control elements. However, the juxtaposition of OLE RNA elements with the same ORFs is similar to the arrangements commonly seen with riboswitches, and this caused us to consider the possibility that OLE RNAs were a new class of metabolite-sensing gene control element.

Fig. 4.

Fig. 4.

OLE RNA is large and complex, and has an unusual phylogenetic distribution compared with other noncoding RNAs that are widespread in bacteria. (A) Sizes and functions of some noncoding RNAs in B. halodurans. Bars represent size ranges when multiple representatives of an RNA class are known. The size of the group II ribozyme without its internal ORF is presented. Although tmRNAs are translated, their complex structure and multiple functions were reasons for comparison with noncoding RNAs. Note that 16S (≈1,500 nt) and 23S (≈2,900 nt) rRNAs (shaded) are too large for the length scale. (B) Some characteristics of bacteria that carry OLE RNA. Identities of bacterial species 1–15 are the organisms in the same order as they appear in Fig. 1. Solvent tolerance was assigned to those organisms that are known to use or produce high concentrations of organic solvents such as ethanol or acetate. All organisms listed are able to grow under anoxic conditions, although some organisms (e.g., B. halodurans) are facultative anaerobes. (C) The P12.2 through P14.1 region of OLE RNAs exhibits dual symmetry in conserved sequence and secondary structure as indicated by colored shading and boxes.

We performed a series of experiments to assess whether OLE RNA or its fragments could bind several candidate metabolites (e.g., 1-deoxy-d-xylulose-5-phosphate and compounds in related pathways) without success. Specifically, the results of in-line probing assays (SI Fig. 6 and data not shown) reflect the complex structure of OLE RNA but do not reveal any modulation of RNA structure when candidate metabolites are added. The ability of OLE RNA to influence the expression of a β-galactosidase reporter gene in transformed B. subtilis cells also was examined by using methods similar to those reported previously for assessing riboswitch function (37, 38).

Fusion of the entire noncoding portion of the intergenic region between the B. halodurans geranyltranstransferase and BH2780 coding regions (including OLE RNA) to a β-galactosidase gene carrying its own translation start site yields ≈100 Miller units when transformed B. subtilis cells are grown in either minimal or rich medium (data not shown). In contrast, fusions of this same noncoding region in each of three reading frames to a β-galactosidase gene lacking its own translation start site does not yield any reporter gene expression in rich medium, suggesting that OLE RNA lacks signals for translation initiation.

Most importantly, no changes were observed in gene expression between the wild-type transcriptional fusion and OLE RNA mutants that disrupt stems P4, P6.1, P12.1, and P12.2 (data not shown). Our results are consistent with the hypothesis that OLE RNA is unlikely to be a gene-control element. It is important to note, however, that the growth conditions examined might not permit the normal function of some riboswitch types, and that the transgenic organism used does not naturally carry OLE RNA.

Discussion

The data presented herein demonstrate that a large and elaborate noncoding RNA exists in B. halodurans (Fig. 4A) and a number of other Firmicutes that have several characteristics in common (Fig. 4B). Many Firmicutes form spores, so this characteristic of OLE-carrying bacteria is not as striking as the observation that these organisms almost uniformly exhibit anaerobic growth and extremophilic character. Although only 15 sequences were identified by searching >850 genomes that have been or are in the process of being sequenced, these genomes do not include many from extremophilic bacteria, which might have limited our bioinformatics findings.

Bioinformatics searches for OLE RNAs from published sequences derived from environmental samples isolated from the Sargasso sea (39), Minnesota soil and whale fall (40), and an acid mine drainage biofilm (41) also did not provide any additional hits. In contrast, cloning of OLE RNA sequences from genomic DNA isolated from a hypersaline mat produced numerous sequences. Although the bacterial mat is estimated to contain at least 750 different species, the apparent enrichment for OLE RNAs might be due to the presence of this RNA primarily in organisms that survive in extreme growth conditions (Fig. 4B).

Of the 15 OLE RNA representatives, 14 are present adjacent to ORFs that encode proteins important for isoprenoid biosynthesis. Changes in isoprenoid type and amount can alter membrane stability, and chemical variations of isoprenoids are favored by some anaerobic bacteria (42, 43). This characteristic localization of OLE RNA genes suggests that the RNA might contribute to biochemical processes that produce unique features of membranes in some extremophilic or anaerobic bacteria. Unfortunately, the operon carrying OLE RNA is populated with other ORFs whose protein products cannot easily be assigned to a single biological process (see legend of Fig. 3A). This operon could be considered a “superoperon,” which is a term that is sometimes used to denote long mRNAs that code for an eclectic mixture of proteins (4446). Therefore, the precise role of OLE RNA cannot easily be deduced from the functions of its coexpressed neighbors.

OLE RNAs are distinctly greater in length and complexity than almost all other noncoding RNAs that are known to be widespread in bacteria. All well conserved noncoding RNAs with sizes similar to OLE RNAs (Fig. 4A) are known to be ribozymes that carry out sophisticated chemical reactions (RNA hydrolysis, RNA splicing, protein synthesis). One structural element in particular, the P12.2 through P14.1 region (Fig. 4C), exhibits a remarkable symmetry in sequence and structure that includes two GNRA tetraloops. GNRA tetraloops are commonly occurring structural elements in structured RNAs that are intrinsically stable and often interact with proteins or other RNA elements (4749). The closest structural precedent to the P12.2 through P14.1 region is the tandem aptamer arrangement found in cooperative glycine-binding riboswitches (38), and perhaps this region of OLE RNA also binds two identical ligands.

The co-occurrence of a putative membrane protein of unknown function (BH2780) is suggestive of a functional link between the RNA and the protein. The protein sequence alignment reveals conserved stretches of basic residues (lysine and arginine) located at either the amino or carboxy termini of the proteins (SI Fig. 8), which is a characteristic of some RNA-binding proteins. Therefore, we are currently investigating the possibility that OLE RNA and the OLE-associated protein BH2780 form a stable functional complex. Although the true function of OLE RNA remains a mystery, the characteristics of this RNA suggest that it might have a complex function that is critical for many species of eubacteria that live in extreme environments.

Materials and Methods

Bioinformatics.

OLE RNA was initially discovered by examining the output of a computer-assisted search of 116 bacterial genomes for highly conserved RNAs (28). Conserved sequences residing in intergenic regions were identified by applying BLASTN analysis (50), and similar intergenic regions were pairwise aligned by using the FASTA package as reported (11, 28). Additional OLE RNAs were found by using BLASTN and filtered covariance model searches (29) of version 14 of the RefSeq database (51) and environmental sequences (3941).

PCR Amplification and Cloning of OLE RNA Sequences from Environmental Samples.

Community genomic DNA was obtained as a gift from Norman Pace (University of Colorado, Boulder), who extracted samples from a hypersaline microbial mat from Guerrero Negro, Baja California Sur, Mexico (31). DNAs used in this study were isolated from layers 8, 9, and 10 (other layers were not examined), which sample a microbial community located at a depth of 13–60 mm from the surface in a zone that is rich in hydrogen sulfide. One hundred to 200 ng of DNA from each layer was PCR-amplified with degenerate synthetic DNA primers 5′-GAAGRCGGGSCTAAAAATCCG and 5′-GGCWTAGGTCCAGYAGGWTTTCCC, where R represents A or G; Y represents T or C; S represents C or G; and W represents A or T (1:1 mixtures). The DNA primers (prepared by the Keck Foundation Biotechnology Resource Center at Yale University) correspond to highly conserved P4 and P12.2 regions of OLE RNA and were expected to produce ≈580-bp amplification products.

Each PCR contained 10 mM Tris·HCl (pH 8.3 at 23°C), 50 mM KCl, 2 mM MgSO4, 250 μM each dNTP, 2 μM each DNA primer, 100 mM betaine, 80 μg/ml BSA, and 1 unit of Taq DNA polymerase (Promega). Amplicons of ≈580 bp dominated the PCR products generated from layer 9 DNA, and these were isolated by agarose gel electrophoresis, purified from the gel by using a QIAquick kit (Qiagen), cloned into TOPO TA pCR2.1 vector, and transformed into E. coli TOP10 cells by using a protocol supplied by the vendor (Invitrogen). Plasmids from 48 colonies that carried an insert of ≈580 bp were purified, and the appropriate DNA segments were sequenced by the Keck Foundation Biotechnology Resource Center at Yale University.

RNA Expression Analysis by RT-PCR.

B. halodurans cells (ATCC 21591) were grown at 37°C in peptone-starch-carbonate medium (ATCC medium 542, which contains the following constituents: 5.0 g/liter peptone, 5.0 g/liter yeast extract, 1.0 g/liter K2HPO4, 0.2 g/liter MgSO4·7H2O, and 10 g/liter soluble starch). The medium was autoclaved at 121°C for 20 min, and subsequently 100 ml of sterile 10% Na2CO3 was added to yield a pH of 10.

Total RNA was isolated from cells grown overnight. Cells (3 OD600) were pelleted and resuspended in 100 μl of 3 mg/ml lysozyme in TE buffer [10 mM Tris·HCl (pH 7.5 at 25°C)/1 mM EDTA]. Cell lysis was facilitated by a freeze–thaw cycle before isolating RNA with 1 ml of TRIzol reagent (Invitrogen) by using the protocol supplied by the vendor. Five micrograms of total RNA was used as template for each 20-μl reverse transcription reaction with SuperScript II reverse transcriptase (Invitrogen) and an appropriate DNA primer, using the buffer and the protocol supplied by the vendor. Negative controls without enzyme were included for each sample or DNA primer pair used. Five microliters of each reverse transcription reaction was used as template for PCR with the DNA primers indicated for each experiment.

5′ RACE.

RNA was isolated as described above from B. halodurans cells grown overnight in peptone-starch-carbonate medium. Fifteen micrograms of RNA was used to determine cDNA ends using a protocol similar to that described previously (35, 36). The RNAs were prepared either with or without tobacco acid pyrophosphatase to distinguish primary transcript 5′ ends from internal 5′ processing sites. The DNA primer for cDNA synthesis, 5′-ACATGCCTTTGTGGATCGCCCTACTA, is complementary to the P6 region of OLE RNA. A second DNA primer for subsequent PCR amplification of cDNAs, 5′-GGACACTGACATGGACTGAAGGAGTA, is homologous to the adaptor RNA, 5′-GGACACUGACAUGGACUGAAGGAGUAGAAAC, used for 5′ RACE. PCR products that were detected both with and without tobacco acid pyrophosphatase treatment were purified by using a QIAquick kit and cloned by using a TOPO TA kit, and 17 clones were sequenced.

Supplementary Material

Supporting Figures

Acknowledgments

We are very grateful to Dr. Norman Pace for his generous gift of community genomic DNA from a hypersaline microbial mat. We thank members of the R.R.B. laboratory for helpful discussions, Wade C. Winkler and Izabela Puskarz for conducting preliminary in-line probing experiments, and Keith Corbino for assistance with comparative sequence analyses. R.R.B. was supported by National Institutes of Health Grant GM068819 and the Howard Hughes Medical Institute. J.E.B. was supported by a predoctoral fellowship from the Howard Hughes Medical Institute. E.P.-F. is a recipient of a postdoctoral fellowship from the Spanish Ministerio de Educación y Ciencia.

Glossary

Abbreviation

OLE

ornate, large, extremophilic

Footnotes

The authors declare no conflict of interest.

This article is a PNAS direct submission.

This article contains supporting information online at www.pnas.org/cgi/content/full/0607493103/DC1.

References

  • 1.Morey C, Avner P. FEBS Lett. 2004;567:27–34. doi: 10.1016/j.febslet.2004.03.117. [DOI] [PubMed] [Google Scholar]
  • 2.Hüttenhofer A, Schattner P, Polacek N. Trends Genet. 2005;21:289–297. doi: 10.1016/j.tig.2005.03.007. [DOI] [PubMed] [Google Scholar]
  • 3.Mattick JS, Makunin IV. Hum Mol Genet. 2006;15:R17–R29. doi: 10.1093/hmg/ddl046. [DOI] [PubMed] [Google Scholar]
  • 4.Gottesman S. Trends Genet. 2005;21:399–404. doi: 10.1016/j.tig.2005.05.008. [DOI] [PubMed] [Google Scholar]
  • 5.Dennis PP, Omer A. Curr Opin Microbiol. 2005;8:685–694. doi: 10.1016/j.mib.2005.10.013. [DOI] [PubMed] [Google Scholar]
  • 6.Costa FF. Gene. 2005;357:83–94. doi: 10.1016/j.gene.2005.06.019. [DOI] [PubMed] [Google Scholar]
  • 7.Rivas E, Klein RJ, Jones TA, Eddy SR. Curr Biol. 2001;11:1369–1373. doi: 10.1016/s0960-9822(01)00401-8. [DOI] [PubMed] [Google Scholar]
  • 8.Wassarman KM, Repoila F, Rosenow C, Storz G, Gottesman S. Genes Dev. 2001;15:1637–1651. doi: 10.1101/gad.901001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Klein RJ, Misulovin Z, Eddy SR. Proc Natl Acad Sci USA. 2002;99:7542–7547. doi: 10.1073/pnas.112063799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Rodionov DA, Vitreschak AG, Mironov AA, Gelfand MS. Nucleic Acids Res. 2003;31:6748–6757. doi: 10.1093/nar/gkg900. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Barrick JE, Corbino KA, Winkler WC, Nahvi A, Mandal M, Collins J, Lee M, Roth A, Sudarsan N, Jona I, et al. Proc Natl Acad Sci USA. 2004;101:6421–6426. doi: 10.1073/pnas.0308014101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Abreu-Goodger C, Ontiveros-Palacios N, Ciria R, Merino E. Trends Genet. 2004;20:475–479. doi: 10.1016/j.tig.2004.08.003. [DOI] [PubMed] [Google Scholar]
  • 13.Vogel J, Bartels V, Tang TH, Churakov G, Slagter-Jäger JG, Hüttenhofer A, Wagner EGH. Nucleic Acids Res. 2003;31:6435–6443. doi: 10.1093/nar/gkg867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Vogel J, Sharma CM. Biol Chem. 2005;386:1219–1238. doi: 10.1515/BC.2005.140. [DOI] [PubMed] [Google Scholar]
  • 15.Kawano M, Reynolds AA, Miranda-Rios J, Storz G. Nucleic Acids Res. 2005;33:1040–1050. doi: 10.1093/nar/gki256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hüttenhofer A, Vogel J. Nucleic Acids Res. 2006;34:636–646. doi: 10.1093/nar/gkj469. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.McManus MT, Sharp PA. Nat Rev Genet. 2002;3:737–747. doi: 10.1038/nrg908. [DOI] [PubMed] [Google Scholar]
  • 18.Storz G, Altuvia S, Wassarman KM. Annu Rev Biochem. 2005;74:199–217. doi: 10.1146/annurev.biochem.74.082803.133136. [DOI] [PubMed] [Google Scholar]
  • 19.Gottesman S. Annu Rev Microbiol. 2004;58:303–328. doi: 10.1146/annurev.micro.58.030603.123841. [DOI] [PubMed] [Google Scholar]
  • 20.Trotochaud AE, Wassarman KM. Nat Struct Mol Biol. 2005;12:313–319. doi: 10.1038/nsmb917. [DOI] [PubMed] [Google Scholar]
  • 21.Barrick JE, Sudarsan N, Weinberg Z, Ruzzo WL, Breaker RR. RNA. 2005;11:774–784. doi: 10.1261/rna.7286705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Willkomm DK, Hartmann RK. Biol Chem. 2005;386:1273–1277. doi: 10.1515/BC.2005.144. [DOI] [PubMed] [Google Scholar]
  • 23.Zwieb C, Wower I, Wower J. Nucleic Acids Res. 1999;27:2063–2071. doi: 10.1093/nar/27.10.2063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Grundy FJ, Henkin TM. Front Biosci. 2003;8:d20–d31. doi: 10.2741/908. [DOI] [PubMed] [Google Scholar]
  • 25.Mandal M, Breaker RR. Nat Rev Mol Cell Biol. 2004;5:451–463. doi: 10.1038/nrm1403. [DOI] [PubMed] [Google Scholar]
  • 26.Soukup JK, Soukup GA. Curr Opin Struct Biol. 2004;14:344–349. doi: 10.1016/j.sbi.2004.04.007. [DOI] [PubMed] [Google Scholar]
  • 27.Winkler WC, Breaker RR. Annu Rev Microbiol. 2005;59:487–517. doi: 10.1146/annurev.micro.59.030804.121336. [DOI] [PubMed] [Google Scholar]
  • 28.Corbino KA, Barrick JE, Lim J, Welz R, Tucker BJ, Puskarz I, Mandal M, Rudnick ND, Breaker RR. Genome Biol. 2005;6:R70. doi: 10.1186/gb-2005-6-8-r70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Yao Z, Weinberg Z, Ruzzo WL. Bioinformatics. 2006;22:445–452. doi: 10.1093/bioinformatics/btk008. [DOI] [PubMed] [Google Scholar]
  • 30.Weinberg Z, Ruzzo WL. Bioinformatics. 2006;22:35–39. doi: 10.1093/bioinformatics/bti743. [DOI] [PubMed] [Google Scholar]
  • 31.Ley RE, Harris JK, Wilcox J, Spear JR, Miller SR, Bebout BM, Maresca JA, Bryant DA, Sogin ML, Pace NR. Appl Environ Microbiol. 2006;72:3685–3695. doi: 10.1128/AEM.72.5.3685-3695.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.van Gemerden H. Mar Geol. 1993;113:3–25. [Google Scholar]
  • 33.Dupraz C, Visscher PT. Trends Microbiol. 2005;13:429–438. doi: 10.1016/j.tim.2005.07.008. [DOI] [PubMed] [Google Scholar]
  • 34.Soukup GA, Breaker RR. RNA. 1999;5:1308–1325. doi: 10.1017/s1355838299990891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Bensing BA, Meyer BJ, Dunny GM. Proc Natl Acad Sci USA. 1996;93:7794–7799. doi: 10.1073/pnas.93.15.7794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Argaman L, Hershberg R, Vogel J, Bejerano G, Wagner EG, Margalit H, Altuvia S. Curr Biol. 2001;11:941–950. doi: 10.1016/s0960-9822(01)00270-6. [DOI] [PubMed] [Google Scholar]
  • 37.Mandal M, Boese B, Barrick JE, Winkler WC, Breaker RR. Cell. 2003;113:577–586. doi: 10.1016/s0092-8674(03)00391-x. [DOI] [PubMed] [Google Scholar]
  • 38.Mandal M, Lee M, Barrick JE, Weinberg Z, Emilsson GM, Ruzzo WL, Breaker RR. Science. 2004;306:275–279. doi: 10.1126/science.1100829. [DOI] [PubMed] [Google Scholar]
  • 39.Venter JC, Remington K, Heidelberg JF, Halpern AL, Rusch D, Eisen JA, Wu D, Paulsen I, Nelson KE, Nelson W, et al. Science. 2004;304:66–74. doi: 10.1126/science.1093857. [DOI] [PubMed] [Google Scholar]
  • 40.Tringe SG, von Mering C, Kobayashi A, Salamov AA, Chen K, Chang HW, Podar M, Short JM, Mathur EJ, Detter JC, et al. Science. 2005;308:554–557. doi: 10.1126/science.1107851. [DOI] [PubMed] [Google Scholar]
  • 41.Tyson GW, Chapman J, Hugenholtz P, Allen EE, Ram RJ, Richardson PM, Solovyev VV, Rubin EM, Rokhsar DS, Banfield JF. Nature. 2004;428:37–43. doi: 10.1038/nature02340. [DOI] [PubMed] [Google Scholar]
  • 42.Ourisson G, Rohmer M, Poralla K. Annu Rev Microbiol. 1987;41:301–333. doi: 10.1146/annurev.mi.41.100187.001505. [DOI] [PubMed] [Google Scholar]
  • 43.Weijers JWH, Schouten S, Hopmans EC, Geenevasen JAJ, David ORP, Coleman JM, Pancost RD, Sinninghe Damsté JS. Environ Microbiol. 2006;8:648–657. doi: 10.1111/j.1462-2920.2005.00941.x. [DOI] [PubMed] [Google Scholar]
  • 44.Tsui HC, Zhao G, Feng G, Leung HC, Winkler ME. Mol Microbiol. 1994;11:189–202. doi: 10.1111/j.1365-2958.1994.tb00300.x. [DOI] [PubMed] [Google Scholar]
  • 45.Lyngstaddas A, Løbner-Olesen A, Grelland E, Boye E. Biochim Biophys Acta. 1999;1472:376–384. doi: 10.1016/s0304-4165(99)00146-4. [DOI] [PubMed] [Google Scholar]
  • 46.Goodchild A, Raftery M, Saunders NFW, Guilhaus M, Cavicchioli R. J Proteome Res. 2004;3:1164–1176. doi: 10.1021/pr0498988. [DOI] [PubMed] [Google Scholar]
  • 47.Wool IG, Gluck A, Endo Y. Trends Biochem Sci. 1992;17:266–269. doi: 10.1016/0968-0004(92)90407-z. [DOI] [PubMed] [Google Scholar]
  • 48.Doherty EA, Batey RT, Masquida B, Doudna JA. Nat Struct Biol. 2001;8:339–343. doi: 10.1038/86221. [DOI] [PubMed] [Google Scholar]
  • 49.Nissen P, Ippolito JA, Ban N, Moore PB, Steitz TA. Proc Natl Acad Sci USA. 2001;98:4899–4903. doi: 10.1073/pnas.081082398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Nucleic Acids Res. 1997;25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Pruitt KD, Tatusova T, Maglott DR. Nucleic Acids Res. 2005;33:D501–D504. doi: 10.1093/nar/gki025. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Figures
pnas_0607493103_1.pdf (1,023.1KB, pdf)
pnas_0607493103_3.pdf (212.8KB, pdf)
pnas_0607493103_4.pdf (64.3KB, pdf)
pnas_0607493103_5.pdf (177KB, pdf)
pnas_0607493103_6.pdf (26.3KB, pdf)

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES