Skip to main content
RNA logoLink to RNA
. 2021 Jul;27(7):791–804. doi: 10.1261/rna.078747.121

Differential translation of mRNA isoforms transcribed with distinct sigma factors

Dylan M McCormick 1,5,#, Jean-Benoît Lalanne 1,2,4,5, Tammy CT Lan 3, Silvi Rouskin 3, Gene-Wei Li 1
PMCID: PMC8208050  PMID: 33927010

Abstract

Sigma factors are an important class of bacterial transcription factors that lend specificity to RNA polymerases by binding to distinct promoter elements for genes in their regulons. Here we show that activation of the general stress sigma factor, σB, in Bacillus subtilis paradoxically leads to dramatic induction of translation for a subset of its regulon genes. These genes are translationally repressed when transcribed by the housekeeping sigma factor, σA, owing to extended RNA secondary structures as determined in vivo using DMS-MaPseq. Transcription from σB-dependent promoters excludes the secondary structures and activates translation, leading to dual induction. Translation efficiencies between σB- and σA-dependent RNA isoforms can vary by up to 100-fold, which in multiple cases exceeds the magnitude of transcriptional induction. These results highlight the role of long-range RNA folding in modulating translation and demonstrate that a transcription factor can regulate protein synthesis beyond its effects on transcript levels.

Keywords: sigma factor, RNA structure, translation efficiency, B. subtilis, dual induction

INTRODUCTION

Transcriptional regulation by sigma factors is a hallmark of bacterial gene expression. Sigma factors bind to the core RNA polymerases, forming holoenzymes that can initiate transcription at sites with well-defined sequences. In Bacillus subtilis, most genes are transcribed by the housekeeping sigma factor σA, and some are additionally or exclusively transcribed by alternative sigma factors that control specific processes such as sporulation and motility (Haldenwang 1995; Helmann 2019). The alternative sigma factor σB is involved in the general stress response (Haldenwang and Losick 1979; Haldenwang 1995; Hecker et al. 2007; Price 2014) and initiates transcription for over two hundred genes with well-defined promoter sequences (Petersohn et al. 1999; Nicolas et al. 2012; Zhu and Stülke 2018). Induction of transcription leads to corresponding increases in RNA levels (Fig. 1A).

FIGURE 1.

FIGURE 1.

σB can activate both transcription and translation. (A) Models of transcriptional and translational induction for a transcriptional unit consisting of a promoter, coding sequence, and terminator. Stimuli are indicated with lightning bolts and ribosomes are colored in yellow. (B) RNA-seq, (C) ribosome profiling, and (D) apparent translation efficiency measurements from σB active and inactive conditions. σB regulon genes are indicated with black crosses (+), and subsets that are translationally activated or translationally repressed are highlighted in red and yellow, respectively (Rend-seq/ribosome profiling traces shown in Supplemental Fig. S4). Induced σB regulon genes without complex isoform architecture (Materials and Methods) are highlighted in cyan (Rend-seq/ribosome profiling traces for a subset shown in Supplemental Fig. S5). The dashed blue lines mark a 3.7-fold change in expression for visual reference. The dashed red line is an approximate threshold (2.7-fold) separating the population of translationally activated genes from those whose apparent TE does not markedly change. The insets show the cumulative distribution function (CDF) of fold change (FC) across the two conditions in each measurement, with separate CDFs for all genes (gray) and σB regulon genes (black). The percentage of genes in each group exceeding the chosen thresholds are listed on the right. Contributions of mRNA levels and translation to changes in protein synthesis rate among (E) translationally activated σB regulon genes and (F) a representative subset of induced σB regulon genes without complex isoform architecture. The fold change in protein synthesis rate is indicated by the height of the bars up to the arrows (arrows pointing down correspond to decreased translation efficiency). The light and dark gray regions denote the respective contributions of mRNA levels and translation, that is, fold-change in protein synthesis = (fold-change in mRNA level) × (fold-change in translation efficiency).

Translational regulation is also widespread in B. subtilis, although it is not typically thought to be controlled by transcription factors. Differential translation among genes in the same operon is largely driven by differences in mRNA secondary structure (Burkhardt et al. 2017) and is important for stoichiometric production of proteins in the same complex or metabolic pathway (Li et al. 2014; Lalanne et al. 2018). Translation can be additionally regulated by RNA-binding proteins or riboswitches that modulate the accessibility of the ribosome binding sites on the mRNA (Yakhnin et al. 2004, 2007; Breaker 2018). Operons are often controlled both transcriptionally and translationally (Fig. 1A), but seldomly by the same regulator (Hollands et al. 2012; Chauvier et al. 2017; Bastet et al. 2018).

Here we show that the transcription factor σB not only activates transcription, but also derepresses translation for a subset of its regulon genes. Using Rend-seq (end-enriched RNA-seq) (Lalanne et al. 2018) and ribosome profiling, we identified 12 genes whose apparent translation efficiency is increased substantially during σB activation. Most of them are transcribed from a σB-dependent promoter as well as at least one σA-dependent promoter, generating multiple transcript isoforms. By modulating σB activities, we found that each transcript isoform is associated with a distinct translation efficiency, with strongly repressed translation for σA-driven isoforms and elevated translation for σB-driven isoforms. These were orthogonally confirmed using a fluorescent reporter in a subset of examples. Both computational RNA folding and in vivo structural probing by DMS-MaPseq (Zubradt et al. 2016) indicate that the repressed σA-driven isoforms possess extended RNA secondary structures that sequester the ribosome binding sites. On the other hand, σB-driven isoforms have shorter 5′ UTRs that only include the regions corresponding to the second halves of the extended stem–loops in the longer σA-driven isoforms. Therefore, σB can simultaneously activate both transcription and translation by modulating isoform-specific secondary structures.

RESULTS

σB activates translation for a subset of its regulon

We first observed translational activation of σB regulon genes while profiling gene expression for a B. subtilis strain with an elevated general stress response during steady-state growth due to a genetic modification (Materials and Methods). Rend-seq and ribosome profiling data were generated to quantify the mRNA levels and protein synthesis rates, respectively, for both the wild type (“σB inactive”) and the genetically modified strain (“σB active”). The density of ribosome footprints for a gene provides an estimate for the relative rate of protein synthesis, provided that most ribosomes complete translation to yield full-length polypeptides and that the elongation time averaged across the entire transcript is constant (Ingolia et al. 2009; Li 2015; Li et al. 2014; Lalanne et al. 2018). Translation efficiency (TE), defined as the rate of protein production per mRNA molecule, can then be estimated from Rend-seq and ribosome profiling data by calculating the per-gene ribosome profiling coverage over Rend-seq coverage, that is, the ribosome density along a transcript (Li 2015; Li et al. 2014). Given σB’s well-understood role in transcription initiation, we expected its regulon members to change in mRNA levels and not TE.

Surprisingly, we found that several genes in the σB regulon showed far greater increases in protein synthesis rate (ribosome profiling) than in mRNA levels (Rend-seq). Between the two conditions, 25% of the annotated σB regulon genes (Zhu and Stülke 2018) had substantially different expression levels (56/225 with >3.7-fold change, Fig. 1B,C). Although most genes showed concordant changes in mRNA levels and protein synthesis rates, a notable population (21%, 12/56) exhibited a considerably greater increase in protein synthesis rates than mRNA levels (>2.7-fold), suggesting an increase in apparent translation efficiency (Fig. 1D). Among these translationally activated σB regulon genes, the magnitude of TE increases often exceeded the rise in mRNA levels, as most genes (75%, 9/12) exhibited a fold change in apparent TE accounting for >50% of the observed fold change in protein synthesis rate (Fig. 1E, compared to purely transcriptionally activated genes in Fig. 1F, Materials and Methods). Hence, translational induction contributes to the majority of the increase in expression of a subset of the σB regulon, suggesting a yet-unknown strategy for activating translation following σB induction.

σB-dependent alternative mRNA isoforms drive translational up-regulation

To identify the regulatory features that could drive translational up-regulation, we examined the transcript architecture of translationally activated σB regulon genes using Rend-seq. Through sparse fragmentation of input RNAs, Rend-seq enriches for the 5′ and 3′ boundaries of transcripts, enabling the detection and quantification of mRNA isoforms within operons (Lalanne et al. 2018). We observed that the translationally activated σB regulon genes were found in two or more different RNA isoforms (Fig. 2; Supplemental Figs. S1, S2). In particular, eight of the 12 genes shared a common operon architecture (Fig. 2; Supplemental Fig. S1): They were each transcribed both as a part of a polycistronic mRNA from a vegetative (σA-dependent) promoter, as well as from their own σB-dependent promoter. As illustrated by the representative genes ctc and yvrE, in the absence of stress, the primary isoform was the long, σA-dependent polycistronic mRNA (Fig. 2). In these transcripts, the ribosome footprint density for ctc and yvrE was much lower compared to their cotranscribed upstream genes. Under σB induction, additional 5′ ends appeared directly upstream of their coding sequences (Fig. 2, red arrows), consistent with the creation of alternative mRNA isoforms from σB-dependent transcription start sites (TSSs, Fig. 2 inset). Furthermore, these additional 5′ ends coincide with a sharp increase in ribosome footprint density over the gene bodies.

FIGURE 2.

FIGURE 2.

Translationally activated σB regulon genes display alternative mRNA isoforms. Rend-seq and ribosome profiling data from conditions with inactive/active σB for the operons containing (A) ctc and (B) yvrE (σB regulon genes are highlighted in red). Orange and blue bars represent 5′- and 3′-mapped read counts, respectively, and the black scale bars correspond to 0.5 kb. Fold changes (FC) for Rend-seq and ribosome profiling between σB active and σB inactive conditions are shown. Rend-seq 5′ ends corresponding to the σB-dependent transcription start sites are marked by red arrows. Putative σB-dependent promoter sequences are listed for each gene (+1 corresponds to the 5′ end of the σB-dependent isoform mapped by Rend-seq). The consensus sequences for the −10 and −35 regions of σB-dependent promoters are GTTTaa and GGG(A/T)A(A/T) (Petersohn et al. 1999). For ctc specifically, the additional 5′/3′ peak pair (*) in the σB active condition corresponds to a spurious RNase A cleavage site that likely occurred post-lysis. See also Supplemental Figures S1, S2.

We found that the short, σB-dependent isoforms of the translationally activated genes had significantly elevated translation efficiency compared to the corresponding long, σA-dependent isoforms. By estimating the relative prevalence of short and long isoforms across Rend-seq and ribosome profiling data sets with different levels of σB induction, we could infer the individual translation efficiency for each isoform (Fig. 3A; Supplemental Fig. S3, Materials and Methods), hereafter referred to as the isoform-specific translation efficiency. Compared to the σA-dependent isoforms, we found that the TE for the σB isoform was three- to 100-fold larger (median = 8.4, Fig. 3C). The σA isoform-specific TEs were all below the median TE across the transcriptome (5/8 in the bottom quartile, Fig. 3B), whereas the σB isoform-specific TEs were all above the median (7/8 in the top quartile). These results indicate that these σA-dependent isoforms are translationally repressed compared to most genes, whereas the σB-dependent isoforms are translationally activated.

FIGURE 3.

FIGURE 3.

σB-dependent mRNA isoforms have elevated TE. (A) Estimation of the isoform-specific TE for the short, σB-dependent and long, σA-dependent isoforms of ctc and yvrE. Each point is an experimental condition which has a different short isoform fraction and correspondingly different apparent TE (conditions shown in Fig. 2 are distinctly marked by a triangle and a square for σB inactive and active, respectively). Error bars correspond to standard deviations from subsampling bootstraps. The gray lines are linear regressions, whereas the dashed lines indicate estimates of isoform-specific TE calculated from the fits (Materials and Methods). Estimated isoform-specific TEs and errors (standard deviations) from a bootstrapped linear fit (Materials and Methods) are shown. (B) Distribution (beeswarm and boxplot, whiskers corresponding to 10th and 90th percentile) of apparent TE in σB inactive conditions. Translationally activated σB regulon genes (subset from Fig. 1 for which isoform-specific TE could be estimated, Materials and Methods) are marked (red). (C) Isoform-specific TE values inferred, with error bars as in A. (D) Fluorescent reporter assay for validating differential TE between isoforms. Protein expression (from fluorescence) and mRNA levels (from RT-qPCR) were measured for synthetic constructs (left) representing σA-dependent (L) and σB-dependent (S) isoforms. Relative (to S reporters) isoform-specific TE (right) was calculated by dividing relative protein expression by relative mRNA levels. Errors bars represent the standard deviation for technical replicates (n = 3 for fluorescence, n = 4 for RT-qPCR). See also Supplemental Figure S3.

In contrast to the σB regulon genes that display complex isoform architectures, genes with predominantly simple isoforms (highlighted in cyan in Fig. 1B–D, Materials and Methods, Rend-seq and ribosome profiling traces for a subset shown in Supplemental Fig. S5) showed largely unchanged translation efficiency (Fig. 1D), consistent with pure transcriptional activation (Fig. 1F). Interestingly, we also found three genes with robust transcriptional activation but little increase in protein synthesis rate (highlighted in yellow in Fig. 1B–D), corresponding to a large decrease in apparent translation efficiency in the σB active condition. Two of them (csbA and ywjA) exhibit an isoform arrangement that is converse to the translationally activated ones, with long, σB-dependent and short, σA-dependent isoforms (Supplemental Fig. S4). In the remaining case (yfkJ), the σB-dependent isoform has a truncated Shine–Dalgarno sequence, explaining a nearly 10-fold reduction in translation.

Focusing on the translationally activated σB regulon genes, we confirmed that TE was isoform-specific using fluorescent reporter constructs for ctc and yvrE (Fig. 3D). Specifically, we fused the fluorescent protein mNeonGreen to the carboxy-terminal end of each gene. For each fusion protein (ctc-mNeon, yvrE-mNeon), two distinct isoform-specific 5′ untranslated region (5′ UTR) variants were placed under the control of an ectopic promoter: (i) a short-isoform variant (S) that included each gene's native 5′ UTR corresponding to the σB-dependent isoform (as identified by Rend-seq), and (ii) a long-isoform variant (L) that included ∼100 additional nucleotides in the upstream region, which covers a portion of the coding sequence (CDS) of the upstream gene in the operon. Additionally, a start codon and nonnative ribosome binding site (RBS) were inserted directly upstream to enable translation of the truncated upstream CDS in the long-isoform variant. We then quantified the isoform-specific TE for each construct by normalizing relative protein expression (from fluorescence, Materials and Methods) to relative mRNA levels (from RT-qPCR, Materials and Methods). We found that these isoform-specific TEs qualitatively recapitulated our sequencing-based measurements (Fig. 3D). Specifically, the isoform-specific TE of the long-isoform constructs was roughly four- to sixfold lower than that of the short-isoform constructs, although any further decreases were difficult to quantify due to high background fluorescence. Nevertheless, inclusion of upstream sequence elements was sufficient to produce a large reduction in TE in the absence of the general stress response, which suggests that features in the σA-dependent isoforms can repress translation of the downstream σB regulon gene. Given the many functions that RNA secondary structure plays in shaping translation in bacteria (Lodish 1968; Kudla et al. 2009; Goodman et al. 2013; Li et al. 2014; Boël et al. 2016; Espah Borujeni and Salis 2016; Borujeni et al. 2017; Bhattacharyya et al. 2018; Cambray et al. 2018; Chiaruttini and Guillier 2020), we aimed to determine if structures in the σA-dependent isoforms could explain the observed impact on translation.

Extensive secondary structure is associated with translationally repressed, σA-dependent isoforms

To understand the possible role of mRNA secondary structures in setting isoform-specific translation efficiency, we computationally folded for the σA-dependent isoforms of ctc and yvrE. By mapping the putative Shine–Dalgarno (SD) sequences that recruit ribosome binding (Shine and Dalgarno 1974) onto minimum free energy (MFE) structures (Materials and Methods), we found that the majority of bases in the SD sequences were sequestered deep in stable, long-range structures (Fig. 4A). Strikingly, in both cases the σB-dependent 5′ ends were located inside the loop of the long RNA stems, such that the short, σB-generated isoforms have their 5′ UTRs entirely liberated from these extended secondary structures. The likelihood of SD sequestration was further supported by calculating the base-pairing probability for each position in the SD sequences, which revealed that the majority of positions were predicted to be paired across the thermodynamic ensemble (base-pairing probability ≈1). Given that SD sequences facilitate ribosome recruitment to mRNA to initiate translation, we expected that the presence of extensive secondary structure at and around these elements in the σA-dependent isoforms could plausibly repress translation of the downstream σB regulon gene. However, numerous factors in the cellular microenvironment affect the folding dynamics of RNAs, yielding in vivo structures that can differ substantially from their in silico counterparts (Rouskin et al. 2014; Spitale et al. 2015; Burkhardt et al. 2017; Mustoe et al. 2018). Accordingly, we decided to experimentally validate these computationally predicted structures for the σA-dependent isoforms of ctc and yvrE.

FIGURE 4.

FIGURE 4.

σA-dependent mRNA isoforms have extended secondary structures in vivo. (A) Minimum free energy (MFE) structures of the σA-dependent isoforms of ctc and yvrE near the ribosome binding site. The transcription start sites of σB-dependent isoforms (indicated with arrows), putative Shine–Dalgarno (SD) sequences, and start codons are highlighted in magenta, blue, and green, respectively. The stop codon of the upstream gene in the operon is indicated with an orange box. Computationally determined base-pairing probabilities for individual bases in the SD sequences are shown beside each structure. (B) DMS-MaPseq workflow for in vivo RNA structure determination of σA-dependent isoforms. (C) Cumulative distributions of the per-base mutational fractions for the σA-dependent isoforms of ctc and yvrE. Solid and dashed lines indicate conditions with and without DMS treatment. (D) DMS-constrained MFE structures of representative transcripts for σA-dependent isoforms of ctc and yvrE colored by normalized DMS-MaPseq mutation rate (DMS signal), where values correspond to increased base accessibility. Structured regions containing putative SD sequences are magnified.

We used the RNA structure probing method DMS-MaPseq to quantify mRNA structures in vivo. This technique involves treating RNA with the methylating agent dimethyl sulfate (DMS) to modify the base-pairing faces of accessible adenine and cytosine nucleobases. These modifications are subsequently encoded as mutations during reverse transcription using a specialized thermostable group II intron reverse transcriptase, generating a mutational signal that is detectable using high-throughput sequencing and has been shown to correlate with base accessibility (Zubradt et al. 2016; Tomezsko et al. 2020). We used a targeted version of DMS-MaPseq to specifically reverse transcribe and amplify the predicted structural region in the σA-dependent isoforms of ctc and yvrE following DMS treatment in vivo (Fig. 4B). After sequencing these amplicons, we examined the per-base mutational fractions against a control without DMS treatment and confirmed that DMS induced a characteristic signal at amino bases (Fig. 4C).

We refolded the σA-dependent isoforms of ctc and yvrE using DMS signal as a constraint (Materials and Methods) and found strong agreement with the earlier MFE structures (Fig. 4D). In particular, the regions containing the SD sequences were indeed highly structured in vivo and thus less accessible to the translation machinery. Additionally, the inferred RNA secondary structure in these regions was robust to the choice of the folding window size (Materials and Methods). These extended structures that occlude the ribosome binding sites are consistent with the repressed translation of the long, σA-dependent isoforms.

After validating the computationally predicted secondary structures by DMS-MaPseq, we extended our computational analysis to additional translationally activated σB regulon genes and found a consistent pattern of characteristic structures in the σA-dependent isoforms that sequester the sequence elements required for translation initiation (Fig. 5). Similar to ctc and yvrE, the remaining six genes for which we estimated isoform-specific TE all displayed MFE structures with the SD sequences located in extended stem–loops, and base-pairing probabilities indicated that the SD sequences were predominantly paired. These results suggest that these other σA-dependent long isoforms are also translationally repressed by extensive secondary structures, like the orthogonally validated instances of ctc and yvrE.

FIGURE 5.

FIGURE 5.

Long-range mRNA secondary structures in σA-dependent isoforms sequester sequence elements necessary for translation. MFE structures of transcripts for σA-dependent isoforms of other translationally activated σB regulon genes. The transcription start sites of σB-dependent isoforms (indicated with arrows), putative Shine–Dalgarno (SD) sequences, and start codons are highlighted in magenta, blue, and green, respectively. The stop codon of the upstream gene in the operon is indicated with an orange box. Computationally determined base-pairing probabilities for individual bases in the SD sequences are shown beside each structure.

Internal σB promoters liberate mRNA secondary structure and activate translation

In contrast to being repressed in the σA-dependent isoforms, genes in the short, σB-dependent isoforms had above-normal levels of translation (Fig. 3C). The single-nucleotide resolution afforded by Rend-seq data revealed a common feature among this group of genes: the TSSs of the σB-dependent isoforms were located within the extended secondary structure, often inside the loop region or in the downstream stem (Figs. 4A, 5, magenta and arrow). Therefore, σB-driven transcription generates isoforms with 5′ UTRs that lack the upstream portion of the stem sequestering the SD sequence in the long, σA-dependent isoforms, thereby freeing up the ribosome binding site for efficient translation initiation.

The prevalence of this regulation suggests an alternative configuration for σB-dependent gene expression that does not entirely rely on its canonical role as acting at the transcriptional level. In this operonic architecture, σA-driven promoters produce long, polycistronic mRNAs containing stable structures that impede translation initiation for σB regulon genes located at the ends of these transcripts (Fig. 6). When activated by stress, however, σB initiates transcription from alternative promoters directly upstream of its regulon genes, bypassing the inhibitory secondary structures and thereby promoting ribosome binding on these shorter mRNAs. The resulting increase in protein expression predominantly arises from a greater ribosome flux on these transcripts, demonstrating a novel function for σB in regulating gene expression in a simultaneous transcriptional-translational induction.

FIGURE 6.

FIGURE 6.

Model for σB-dependent translational activation. Schematic of a polycistronic operon containing a σA-dependent promoter (PA), σB-dependent promoter (PB), coding sequences, and a terminator. (A) In the absence of σB, transcription from PA produces a polycistronic mRNA molecule containing secondary structures that translationally repress the σB-dependent open reading frame (red) by sequestering its Shine–Dalgarno sequence (blue) and start codon (green). (B) PB becomes transcriptionally active upon σB induction, generating an mRNA isoform with an alternative transcription start site (magenta). Without the sequences necessary to form stable secondary structures, these transcripts can recruit ribosomes more efficiently to facilitate greater protein expression.

DISCUSSION

Bacterial sigma factors have long been studied as quintessential examples of gene regulation. Mechanistically, their direct effects on transcription initiation are well-understood (Paget 2015). We expand this view by demonstrating that the alternative sigma factor σB in B. subtilis can also influence translation initiation for several of its regulon genes. Translation activation is accomplished by modulating isoform-specific RNA secondary structures that normally impede translation initiation. mRNA isoform-specific modulation of translation efficiency has been noted before in other species such as the classic example of the galactose operon in Escherichia coli (Queen and Rosenberg 1981), as well as in eukaryotic systems (Floor and Doudna 2016), but not previously in the mechanistic context of alternative sigma factor induction. This multifunctional control of transcription and translation by a single trans-acting factor serves as a strategy to enable massive up-regulation of gene expression under specific cellular conditions.

The RNA secondary structures that impede translation in the long, σA-dependent isoforms often include regions of the upstream open reading frames (ORFs), raising questions about whether ribosomes translating the upstream ORFs may perturb the formation of the inhibitory secondary structures. Ribosomes are known to unwind structured regions of RNA as they elongate over coding sequences (Takyar et al. 2005; Wen et al. 2008). We observed that the stop codon of the upstream gene in the operon was typically located within the large stem–loop (Figs. 4, 5). This places ribosomes in proximity to the critical structural elements if the upstream message is actively translated. However, the results from our fluorescent reporter assay show that this configuration is not capable of fully restoring translation for either ctc or yvrE, despite the upstream gene being driven by an exogenous ribosome binding site with the consensus SD sequence. These data suggest that translation of the upstream gene is insufficient to fully derepress downstream genes, presumably because the ribosome footprint does not extend sufficiently downstream to disrupt RNA structure, or possibly due to rapid refolding of secondary structures after ribosomes pass through.

What is the utility of this regulatory strategy? From an evolutionary perspective, it seems counterintuitive for these genes to be found within larger operons despite being lowly translated. We could instead imagine a transcription terminator evolving in the region between the upstream genes in the operon and the σB-dependent TSS, which would ensure that the σB regulon gene is only induced upon activation of the general stress response. One potential explanation for multifunctional regulation is to allow fine-tuned expression of some σB regulon genes during nonstress conditions. On the one hand, this transcript architecture enables these genes to be transcribed during exponential growth. On the other hand, translation may have been selected against in the same condition to avoid fitness defects from overexpression. In this case, the observed basal expression from the σA-driven isoforms would be sufficient for their functions during nonstress conditions.

Another possible explanation for this regulatory strategy could be that small amounts of these proteins are necessary for coping with general stress during transitional periods where σB has already been activated but synthesis of general stress proteins is still ongoing. A fitness benefit would be challenging to identify except in specific conditions where the cell relies on one of these particular σB regulon genes for survival. Indeed, extensive phenotyping of σB-regulon member deletions under varied stresses has demonstrated the limited impact of individual proteins on cell fitness (Höper et al. 2005). Given a lack of characterization for most of these genes, we did not find functional commonalities among them beyond their association with general stress. Identifying the exact stress conditions in which this regulatory strategy confers a fitness advantage constitutes an interesting future direction.

Regardless of the function of σB-dependent translational activation, our characterization of sigma factor-mediated dual induction (Fig. 6) expands our view of the regulatory roles of sigma factors and reveals an intriguing principle of bacterial genome organization that could be further investigated in similar organisms. Indeed, inspection of the intergenic regions for the operons considered above among other Bacilli revealed evidence of conserved RBS sequestration in long isoforms which were liberated in short isoforms originating from putative σB promoters (Supplemental Data 1; Supplemental Table S2; Materials and Methods), suggesting functional roles of dual transcription-translation activation. Beyond the general stress regulon, this observed principle could be at play for other alternative sigma factors, as many of their regulon genes (33%, excluding σB-dependent genes) have at least half of their RNA levels derived from longer isoforms with upstream transcription start sites (Supplemental Data 2; Materials and Methods).

MATERIALS AND METHODS

Strains and strain construction

Strains used to generate new data in this study are listed in Table 1. Strains pertaining to matched Rend-seq and ribosome profiling data sets retrieved from GEO accession GSE162169 (Lalanne et al. 2021) are listed in Supplemental Table S1.

TABLE 1.

Strains and plasmids used in this study

graphic file with name 791tb01.jpg

To construct the strains for the fluorescent reporter assay, the genes ctc and yvrE (with variable upstream regions) were fused to the fluorescent protein mNeonGreen with a carboxy-terminal linker and cloned into pJBL044 under the constitutive promoter Pveg using Gibson assembly (New England Biolabs). The original pJBL044 plasmid was constructed using isothermal assembly from a fragment of pDR160 (Bose and Grossman 2011), a kanR cassette (Guérout-Fleury et al. 1995), levB homology regions, the Pveg promoter, and the strong efp terminator. The assembled plasmids were transformed into Mix and Go! E. coli DH5 Alpha Competent Cells (Zymo Research) per the manufacturer's instructions and isolated using a QIAprep Spin Miniprep Kit (QIAGEN). The fusion constructs were then integrated into BS168 at the levB locus using standard cloning techniques (Harwood and Cutting 1990), and successful recombinants were verified by colony PCR. All plasmids and recombinants (see Table 1) were further validated by Sanger sequencing (Quintara Biosciences).

Growth conditions

Unless indicated otherwise, all strains were grown at 37°C with shaking (250 rpm) in LB supplemented with carbenicillin (100 µg/mL for E. coli) and/or kanamycin (50 µg/mL for E. coli, 5 µg/mL for B. subtilis) when appropriate. For overnight cultures, LB liquid media was inoculated with single colonies from LB agar plates.

For matched Rend-seq/ribosome profiling data sets, strains were grown in LB or conditioned MCC medium (Parker et al. 2020; Lalanne et al. 2021) with various inducer (xylose, IPTG) concentrations (see Supplemental Table S1). For these data sets, cells were grown in exponential phase for at least 10 doublings before harvesting at OD600 ≈ 0.3.

Existing Rend-seq and ribosome profiling data sets

Matched Rend-seq and ribosome profiling data sets used to identify genes with increased TE (Fig. 1) and to estimate the short isoform fraction and corresponding apparent TE (Fig. 3; Supplemental Fig. S3) were obtained from GEO accession GSE162169 (Lalanne et al. 2021). These data sets display a range of σB activation due to a diverse set of genetic modifications and growth media. In particular, we previously identified that tuning the expression of translation termination factors RF2 and PrmC activates σB to varying degrees (Lalanne et al. 2021). For example, the σB active data presented in Figures 1 and 2 correspond to a CRISPRi knockdown of RF2, while σB inactive corresponds to wild-type. Importantly, although it is possible that different RF2 levels could affect translation initiation (and therefore TE) of genes (Lalanne et al. 2021), none of the genes that show a substantial increase in TE (Fig. 1) have a UGA stop codon or are cotranscribed with a gene ending with UGA stop (UGA being the stop codon cognate to RF2). Hence, the molecular causes of σB activation are distinct and independent from the mechanisms leading to translational activation characterized here.

Quantification of mRNA level, ribosome footprint density, and translation efficiency

From pile-up files (.wig format), the mRNA level corresponding to a gene was quantified as the 1% winsorized average read density for 3′-end mapped Rend-seq reads across the body of the gene, excluding a 40 nt region the start and end of the gene (start+40 nt to end−40 nt for averaging). Ribosome footprint read density was similarly calculated (1% winsorized density from start+40 nt to end−40 nt). Read densities were then normalized to rpkm (reads per kilobase per million reads mapped) using the total number of reads mapping to non-rRNA or tRNAs. For all genes, bootstrap (randomly sampling from the distribution of read counts per position across the body of the gene and calculating the corresponding resampled density and downstream quantities) was used as a measure of technical and read count variability. Error bars in Figure 3A and Supplemental Figure S3 correspond to the standard deviation across bootstrap subsamplings. Large error bars correspond to large counting noise (regions with few reads mapped). The translation efficiency of each gene was calculated as the ribosome profiling rpkm divided by the Rend-seq rpkm. Only genes with >50 reads mapped were considered to identify candidates with substantially elevated TE (Fig. 1).

Changes in translation efficiency for induced σB regulon genes

To identify genes with increased translation efficiency, we used a threshold of a >2.7-fold increase in apparent translation efficiency in the σB active vs. inactive conditions.

A >2.7-fold decrease in apparent TE was used to mark genes with repressed translation upon σB induction. Among the four genes with repressed translation (csbA, yfkJ, ywjA, sigB), the gene for sigB itself was excluded from further consideration because of the large overlap of its open reading frame with the upstream gene and to avoid interpretation difficulty arising from the translation termination defect in the RF2 knockdown condition. The three remaining genes are highlighted in yellow in Figure 1B–D, with Rend-seq and ribosome profiling traces shown in Supplemental Figure S4.

To identify induced σB regulon genes (>3.7-fold increase in Rend-seq and/or ribosome profiling read density and classification as a member of the annotated σB regulon, n = 56) with simple mRNA isoforms, we leveraged our deeply sequenced Rend-seq data set in LB (Lalanne et al. 2018) to exclude genes which were not the first gene of their mRNA (e.g., second gene in a polycistronic transcript), displayed multiple upstream transcription start sites in addition to the σB-dependent start site, or had substantial transcription from long isoforms. The resulting “simple isoform” σB regulon genes (highlighted in cyan in Fig. 1B–D) displayed a much more restricted range in fold-change in apparent TE across the σB active vs. inactive conditions. A subset have their transcriptional and translational responses separately displayed in Figure 1F, and Rend-seq/ribosome profiling traces are shown for some examples in Supplemental Figure S5.

The above analyses are summarized in Supplemental Table S3.

Determination of isoform-specific TE

To estimate the isoform-specific TE for particular genes, we assume that each individual mRNA isoform has a distinct TE, and that the total ribosome footprint density for a gene with multiple mRNA isoforms is equal to the sum of the isoform-specific TEs weighted by the mRNA abundance of each isoform.

Specifically, consider a two-gene operon with a long isoform that includes both gene 1 and gene 2 as well as a short isoform that contains gene 2 exclusively (schematically illustrated in Supplemental Fig. S3A). Denote overall mRNA level for genes 1 and 2 by m1 and m2, and overall ribosome footprint density r1 and r2 for the two genes, respectively. Further, let mshort, mlong be the level of the short and long isoform, respectively, and TE2,short, TE2,long the corresponding isoform-specific TE. Note that the overall mRNA level for genes 1 and 2 are related to isoform mRNA levels by: m1 = mlong and m2 = mshort + mlong. Hence, from the total mRNA level for both genes, we can infer the isoform mRNA levels: mlong = m1, and mshort = m2m1.

By assumption, for the ribosome density on gene 2: r2 = mshort TE2,short + mlong TE2,long. For the apparent TE of gene 2, we thus have:

TE2,apparent=r2m2=mshortTE2,short+mlongTE2,longm2. (1)

Reorganizing the equation leads to: TE2,apparent = fshort TE2,short + (1 − fshort)TE2,long, where we have defined the short isoform mRNA fraction for gene 2 as fshort:=mshortm2=m2m1m2. We note that for genes in conditions with little to no short isoform expression, the estimated short isoform fraction may be negative as a result of the technical variability in coverage.

Using RNA-seq data, fshort can be estimated from the mRNA levels on both genes as shown above as m2m1m2. Using ribosome profiling data from a matched sample, the apparent TE on gene 2, TE2,apparent, can be estimated as r2/m2. If our assumption of isoform-specific TE linearly contributing to overall ribosome density on gene 2 is valid, then a plot of TE2,apparent vs. fshort across samples with variable induction of the short isoform should display a linear relationship, with a y-intercept at fshort = 0 of TE2,long and a y-intercept at fshort = 1 of TE2,short as seen in Figure 3A and Supplemental Figure S3B.

To increase the precision of the determination of the short and long isoform mRNA levels, genomic regions used to quantify mRNA levels were extended beyond gene bodies using manually curated transcript boundaries determined by Rend-seq. mRNA levels and ribosome footprint densities were calculated as the average read densities across these regions in Rend-seq and ribosome profiling data, respectively.

To determine the uncertainty on estimated isoform-specific TEs, linear regressions were performed on bootstrap resampling estimates for the short isoform fractions and apparent TEs. Each bootstrap regression provided an estimated TElong and TEshort. The error bars for these quantities (Fig. 3A,C; Supplemental Fig. S3B) were taken as the standard deviations of these bootstrap estimates.

For the genes that do not belong to the group with the characteristic long, σA-dependent isoforms and short, σB-dependent isoforms (Supplemental Fig. S2), their alternative promoters are too close to allow proper quantification of isoform-specific abundances. These were thus excluded from the above analyses.

Fluorescent reporter assay

For the fluorescence reporter assay, the strains GLB115, GLB572, GLB573, GLB574, and GLB575 were grown to OD600 ≈ 1–2 and then back-diluted 200-fold into fresh media. Three technical replicates per culture were grown at 37°C for 12 h in a BioTek Synergy H1 microplate reader, and absorbance (600 nm) and fluorescence intensity (EX 485/20 nm, EM 520/20 nm) were measured every 5 min. Fluorescence was normalized by absorbance at each time point, and any background signal from cellular/media autofluorescence was removed by subtracting the mean normalized fluorescence values of the wild-type BS168 replicates. These quantities were then converted to relative values by normalizing proportionally to the signal for the S reporters.

For reverse transcription-qPCR (RT-qPCR), overnight cultures of the same strains were back-diluted to OD600 ≈ 2 × 10−4 and regrown for roughly 10 generations. At OD600 ≈ 0.3, 5 mL of cells were harvested and mixed with 5 mL of chilled methanol, spun down at 4°C for 10 min, and frozen at −80°C after removing the supernatant. Thawed cell pellets were treated with 100 µL of 10 mg/mL lysozyme in TE, and total RNA was extracted using an RNeasy Mini Kit (QIAGEN). DNA was removed using TURBO DNase (Thermo Fisher Scientific), and RNA was purified using isopropanol precipitation. Reverse transcription was performed using Random Hexamer Primer (Thermo Fisher Scientific) and M-MuLV Reverse Transcriptase (New England Biolabs) per the manufacturer's instructions. RNA levels were measured on a Roche LightCycler 480 Real-Time PCR system using two primer sets for mNeon and one primer set each for the loading controls gyrA and sigA (mNeon F1, mNeon R1, mNeon F2, mNeon R2, gyrA F, gyrA R, sigA F, sigA R, see Table 2). The fold change in mNeon RNA levels relative to the S reporters was calculated by taking the average of three technical replicates across each combination of primer sets (mNeon1/gyrA, mNeon1/sigA, mNeon2/gyrA, mNeon2/sigA).

TABLE 2.

Oligos used in this study

graphic file with name 791tb02.jpg

Isoform-specific TE was subsequently calculated by normalizing mean relative fluorescence by mean fold change in mNeon RNA levels, and the standard deviation was propagated from each measurement type.

RNA secondary structure prediction

Minimum free energy (MFE) structures were predicted using the RNAfold program of the ViennaRNA Package (Lorenz et al. 2011) with default parameters. Base-pairing probabilities were determined by constraining each position in a sequence individually as unpaired and then calculating the partition function from the ensemble free energy computed by RNAfold. The probability of each position being unpaired was calculated by dividing the partition function for the constrained sequence by the partition function for an unconstrained sequence, and the base-pairing probabilities were simply the probabilities of the complements. Putative Shine–Dalgarno (SD) sequences were identified as the region upstream of the start codon that forms the strongest duplex with the anti-Shine–Dalgarno (aSD, 5′-TCACCTCCT-3′) sequence in the 16S ribosomal RNA. RNA secondary structures determined using RNAfold were visualized using VARNA v3.93 (Visualization Applet for RNA) (Darty et al. 2009). The structures sequestering the ribosome binding sites shown in Figures 4 and 5 were confirmed to be robust to the specific regions computationally folded, both at the level of secondary structure and base-pairing probabilities of the SD sequences.

DMS-MaPseq

In vivo DMS treatment was performed as previously described (Zubradt et al. 2016; Burkhardt et al. 2017). Specifically, an overnight culture of BS168 was split two ways and back-diluted to OD600 ≈ 2 × 10−4. Following regrowth to OD600 ≈ 0.2, 15 mL of each culture was incubated at 37°C for 2 min with shaking (1000 rpm) after treating one with 750 µL of dimethyl sulfate (DMS, ∼5% final concentration). The reaction was stopped by adding 30 mL of chilled stop solution (30% β-mercaptoethanol, 25% isoamyl alcohol) to each sample, after which they were immediately transferred to ice and spun down at 4°C for 8 min. The cell pellets were washed with 8 mL of chilled wash solution (30% β-mercaptoethanol), resuspended in residual wash solution, and frozen at −80°C. Thawed cell pellets were treated with 100 µL of 10 mg/mL lysozyme in TE, and total RNA lysis buffer (10 mM EDTA, 50 mM sodium acetate) was added to 650 µL. Total RNA was extracted using hot acid-phenol:chloroform and isopropanol precipitation.

For library preparation, established protocols (Zubradt et al. 2016; Tomezsko et al. 2020) were followed. DNA was removed using TURBO DNase, and RNA >200 nt was purified using an RNA Clean & Concentrator-5 Kit per the manufacturer's instructions (Zymo Research). Ribosomal RNA was depleted using a MICROBExpress Bacterial mRNA Enrichment Kit (Thermo Fisher Scientific), and RNA >200 nt was again purified using an RNA Clean & Concentrator-5 Kit. Reverse transcription was performed at 64°C for 90 min using 70 ng of RNA from each sample and TGIRT-III (Ingex). The RT primers were specific to each gene (ctc R, yvrE R, see Table 2). The RT reaction was treated with 1 µL RNase H (New England Biolabs) and incubated at 37°C for 20 min to remove RNA. Roughly 1/10 of the resulting volume was used as template for a two-step PCR amplification with Phusion High-Fidelity DNA Polymerase (New England Biolabs) per the manufacturer's specifications, which was run for 15–25 cycles with the RT primer serving as the reverse primer (ctc F, yvrE F, see Table 2). PCR products (∼240–290 bp) were purified by gel extraction on an 8% TBE polyacrylamide gel (Thermo Fisher Scientific) and isopropanol precipitation. Samples with particularly low dsDNA concentrations (as measured on an Invitrogen Qubit 4 Fluorometer) were reamplified for 7–20 additional cycles and purified in the same manner. After adding adapters via PCR, the libraries were sequenced on an Illumina MiSeq (2 × 250 nt reads).

To determine the DMS signal, FASTQ files were processed and analyzed using the DREEM (Detection of RNA folding Ensembles using Expectation-Maximization clustering) pipeline with the “‐‐fastq” and “‐‐struct” options (Tomezsko et al. 2020). In brief, paired-end reads were filtered for quality and trimmed using FASTQC v.0.11.8 and TrimGalore 0.4.1, respectively. Reads were aligned to target sequences in the reference genome NC_000964.3 from the NCBI RefSeq database (O'Leary et al. 2016) using Bowtie2 2.3.4.1 with the options “‐‐local –no-unal ‐‐no-discordant ‐‐no-mixed –X 1000 -L 12.” Mapped reads were represented as bit vectors and clustered by their mutational signatures using the DREEM algorithm with standard parameters (Tomezsko et al. 2020). Per-base mutational fractions were initially quantified using the population-average fraction of mismatches and deletions. Following expectation-maximization (EM) clustering, the DMS reactivity was taken as the mutation rates of the bases in the cluster K = 1. After normalizing to the median of the top 5% of positions (with the upper limit set to 1.0), the DMS signal was used as a folding constraint for predicting RNA secondary structures with the program RNAstructure v.6.0.1 (Reuter and Mathews 2010). Additionally, the folding windows were expanded symmetrically by 50, 100, 150, and 200-nt in either direction to assess the robustness of the predicted folds. RNA secondary structures were visualized using VARNA v3.93 (Darty et al. 2009). The sequencing data sets for DMS-MaPseq are available online using the GEO accession GSE168393.

Conservation analysis

To assess whether σAB isoform configurations and RNA secondary structures in the long isoforms were conserved in other species from the Bacillus genus, we extracted and annotated intergenic regions for the σB regulon genes with marked TE induction displaying both short and long isoforms (Fig. 2; Supplemental Fig. S1; ctc, yvrE, yhdF, yocK, ydbD, yflH, yxaB, csbX). Analysis was restricted to Bacillus species (genus taken from the GTDB taxonomy [Parks et al. 2018]) within the reference and representative bacterial genomes from RefSeq (O'Leary et al. 2016) with an identified homolog of the rsbV-rsbW-sigB operon (Lalanne et al. 2021), leading to 26 species analyzed (listed in Supplemental Table S2). For all these Bacillus species, homologs of pairs of genes involving the σA–σB isoform configurations from B. subtilis (RefSeq protein accession listed in Supplemental Table S2) were taken as query for a blastp search (Ye et al. 2006) with an E-value cutoff of 1 × 10−7. Bacillus species in which the two genes were conserved, found in the same order, and separated by <400 bp were retained for further analysis.

Most operons considered were not widely conserved in the other Bacillus species, with all but the prs-ctc operon (conserved in 18/25 Bacillus) conserved in up to two more species (Bacillus atropheus and Bacillus amyloliquefaciens, see Supplemental Table S2). The sequences of the intergenic regions for conserved operons were extracted, folded for minimum free energy RNA secondary structures using RNAfold (Lorenz et al. 2011), and annotated for putative Shine–Dalgarno sequences (as described above), σB-dependent promoters (region of maximum local alignment using the Smith–Waterman algorithm to the consensus motif GTTTAA(13X)GGGWAW or GTTTAA(14X)GGGWAW using the nuc44 scoring matrix), and possible intervening terminators (from the list of high-confidence, bioinformatically identified terminators in Johnson et al. 2020). Supplemental Data 1 summarizes the analysis. We found evidence for RNA secondary structures sequestering Shine–Dalgarno sequences in the σA-dependent isoforms, and freed in the σB-dependent isoforms, for the overwhelming majority of intergenic regions in other Bacillus species (Supplemental Table S2; Supplemental Data 1) despite changes in sequence. Interestingly, most of the examples with weaker structures had evidence for a strong intrinsic terminator upstream of the σB-dependent promoter.

Analysis of isoform architecture for genes associated with other alternative sigma factors

To assess whether genes under the control of other alternative sigma factors also pervasively displayed transcription from longer upstream mRNA isoforms, we analyzed a deep Rend-seq data set from B. subtilis in LB (Lalanne et al. 2018). For all annotated promoters in DBTBS (Sierro et al. 2008) associated with alternative sigma factors with positional information (n = 319, excluding σB-dependent genes), we computed the Rend-seq read density in windows −115 to −15 (TSS-upstream) and +15 to +115 (TSS-downstream) positions relative to the annotated transcription start site of the promoter. Promoters with downstream read density lower than 0.1 reads/nt were not considered further (below expression cutoff, n = 122). For the remaining 197 promoters, we calculated the ratio of TSS-downstream to TSS-upstream read densities and retained instances in which the ratio was larger than 0.5 (i.e., 50% of the expression coming from a putative long isoform). To exclude cases where the signal arose from a separate upstream transcript as opposed to a bona fide long isoform, instances with a mapped 3′ end (3′ peak z-score >12) in the region −115 to +115 were further excluded. In fine, 33% (65/197) of expressed genes downstream from annotated alternative sigma factor promoters had evidence for most of their transcription coming from a long upstream isoform in LB (summarized in Supplemental Data 2). This suggests that the isoform-specific translational activation described in the present work could be applicable to other sigma factors in B. subtilis.

SUPPLEMENTAL MATERIAL

Supplemental material is available for this article.

Supplementary Material

Supplemental Material
supp_27_7_791__DC1.html (1.2KB, html)

ACKNOWLEDGMENTS

We thank members of the Li laboratory and the Rouskin laboratory for critical discussions. We thank S. McKeithen-Mead for providing the mNeon template. This research was supported by National Institutes of Health (NIH) grant R35GM124732, the National Science Foundation (NSF) CAREER Award, the Smith Odyssey Award, the Pew Biomedical Scholars Program, a Sloan Research Fellowship, the Searle Scholars Program, the Smith Family Award for Excellence in Biomedical Research, a Natural Sciences and Engineering Research Council of Canada (NSERC) doctoral Fellowship, and a Howard Hughes Medical Institute (HHMI) International Student Research Fellowship (to J.-B.L.).

Footnotes

REFERENCES

  1. Bastet L, Turcotte P, Wade JT, Lafontaine DA. 2018. Maestro of regulation: riboswitches orchestrate gene expression at the levels of translation, transcription and mRNA decay. RNA Biol 15: 679–682. 10.1080/15476286.2018.1451721 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bhattacharyya S, Jacobs WM, Adkar B V, Yan J, Zhang W, Shakhnovich EI. 2018. Accessibility of the Shine-Dalgarno sequence dictates N-terminal codon bias in E. coli. Mol Cell 70: 894–905.e5. 10.1016/j.molcel.2018.05.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Boël G, Letso R, Neely H, Price WN, Wong KH, Su M, Luff JD, Valecha M, Everett JK, Acton TB, et al. 2016. Codon influence on protein expression in E. coli correlates with mRNA levels. Nature 529: 358–363. 10.1038/nature16509 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Borujeni AE, Cetnar D, Farasat I, Smith A, Lundgren N, Salis HM. 2017. Precise quantification of translation inhibition by mRNA structures that overlap with the ribosomal footprint in N-terminal coding sequences. Nucleic Acids Res 45: 5437–5448. 10.1093/nar/gkx061 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bose B, Grossman AD. 2011. Regulation of horizontal gene transfer in Bacillus subtilis by activation of a conserved site-specific protease. J Bacteriol 193: 22–29. 10.1128/JB.01143-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Breaker RR. 2018. Riboswitches and translation control. Cold Spring Harb Perspect Biol 10: a032797. 10.1101/cshperspect.a032797 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Burkhardt DH, Rouskin S, Zhang Y, Li GW, Weissman JS, Gross CA. 2017. Operon mRNAs are organized into ORF-centric structures that predict translation efficiency. Elife 6: e22037. 10.7554/eLife.22037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Cambray G, Guimaraes JC, Arkin AP. 2018. Evaluation of 244,000 synthetic sequences reveals design principles to optimize translation in Escherichia coli. Nat Biotechnol 36: 1005. 10.1038/nbt.4238 [DOI] [PubMed] [Google Scholar]
  9. Chauvier A, Picard-Jean F, Berger-Dancause JC, Bastet L, Naghdi MR, Dubé A, Turcotte P, Perreault J, Lafontaine DA. 2017. Transcriptional pausing at the translation start site operates as a critical checkpoint for riboswitch regulation. Nat Commun 8: 13892. 10.1038/ncomms13892 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chiaruttini C, Guillier M. 2020. On the role of mRNA secondary structure in bacterial translation. Wiley Interdiscip Rev RNA 11: e1579. 10.1002/wrna.1579 [DOI] [PubMed] [Google Scholar]
  11. Darty K, Denise A, Ponty Y. 2009. VARNA: interactive drawing and editing of the RNA secondary structure. Bioinformatics 25: 1974–1975. 10.1093/bioinformatics/btp250 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Espah Borujeni A, Salis HM. 2016. Translation initiation is controlled by RNA folding kinetics via a ribosome drafting mechanism. J Am Chem Soc 138: 7016–7023. 10.1021/jacs.6b01453 [DOI] [PubMed] [Google Scholar]
  13. Floor SN, Doudna JA. 2016. Tunable protein synthesis by transcript isoforms in human cells. Elife 5: e10921. 10.7554/eLife.10921 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Goodman DB, Church GM, Kosuri S. 2013. Causes and effects of N-terminal codon bias in bacterial genes. Science 342: 475–479. 10.1126/science.1241934 [DOI] [PubMed] [Google Scholar]
  15. Guérout-Fleury AM, Shazand K, Frandsen N, Stragier P. 1995. Antibiotic-resistance cassettes for Bacillus subtilis. Gene 167: 335–336. 10.1016/0378-1119(95)00652-4 [DOI] [PubMed] [Google Scholar]
  16. Haldenwang WG. 1995. The sigma factors of Bacillus subtilis. Microbiol Rev 59: 1–30. 10.1128/MR.59.1.1-30.1995 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Haldenwang WG, Losick R. 1979. A modified RNA polymerase transcribes a cloned gene under sporulation control in Bacillus subtilis. Nature 282: 256–260. 10.1038/282256a0 [DOI] [PubMed] [Google Scholar]
  18. Harwood CR, Cutting SM. 1990. Molecular biological methods for Bacillus. Wiley, Chichester, NY. [Google Scholar]
  19. Hecker M, Pané-Farré J, Völker U. 2007. SigB-dependent general stress response in Bacillus subtilis and related gram-positive bacteria. Annu Rev Microbiol 61: 215–236. 10.1146/annurev.micro.61.080706.093445 [DOI] [PubMed] [Google Scholar]
  20. Helmann JD. 2019. Where to begin? Sigma factors and the selectivity of transcription initiation in bacteria. Mol Microbiol 112: 335–347. 10.1111/mmi.14309 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hollands K, Proshkin S, Sklyarova S, Epshtein V, Mironov A, Nudler E, Groisman EA. 2012. Riboswitch control of Rho-dependent transcription termination. Proc Natl Acad Sci 109: 5376–5381. 10.1073/pnas.1112211109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Höper D, Völker U, Hecker M. 2005. Comprehensive characterization of the contribution of individual SigB-dependent general stress genes to stress resistance of Bacillus subtilis. J Bacteriol 187: 2810–2826. 10.1128/JB.187.8.2810-2826.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Ingolia NT, Ghaemmaghami S, Newman JRS, Weissman JS. 2009. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science 324: 218–223. 10.1126/science.1168978 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Johnson GE, Lalanne J-B, Peters ML, Li G-W. 2020. Functionally uncoupled transcription–translation in Bacillus subtilis. Nature 585: 124–128. 10.1038/s41586-020-2638-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kudla G, Murray AW, Tollervey D, Plotkin JB. 2009. Coding-sequence determinants of expression in Escherichia coli. Science 324: 255–258. 10.1126/science.1170160 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Lalanne JB, Taggart JC, Guo MS, Herzel L, Schieler A, Li GW. 2018. Evolutionary convergence of pathway-specific enzyme expression stoichiometry. Cell 173: 749–761.e38. 10.1016/j.cell.2018.03.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Lalanne JB, Parker DJ, Li GW. 2021. Spurious regulatory connections dictate the expression-fitness landscape of translation factors. Mol Syst Biol 17: e10302. 10.15252/msb.202110302 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Li GW. 2015. How do bacteria tune translation efficiency? Curr Opin Microbiol 24: 66–71. 10.1016/j.mib.2015.01.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Li GW, Burkhardt D, Gross C, Weissman JS. 2014. Quantifying absolute protein synthesis rates reveals principles underlying allocation of cellular resources. Cell 157: 624–635. 10.1016/j.cell.2014.02.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Lodish HF. 1968. Bacteriophage f2 RNA: control of translation and gene order. Nature 220: 345–350. 10.1038/220345a0 [DOI] [PubMed] [Google Scholar]
  31. Lorenz R, Bernhart SH, Höner zu Siederdissen C, Tafer H, Flamm C, Stadler PF, Hofacker IL. 2011. ViennaRNA Package 2.0. Algorithms Mol Biol 6: 26. 10.1186/1748-7188-6-26 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Mustoe AM, Busan S, Rice GM, Hajdin CE, Peterson BK, Ruda VM, Kubica N, Nutiu R, Baryza JL, Weeks KM. 2018. Pervasive regulatory functions of mRNA structure revealed by high-resolution SHAPE probing. Cell 173: 181–195.e18. 10.1016/j.cell.2018.02.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Nicolas P, Mäder U, Dervyn E, Rochat T, Leduc A, Pigeonneau N, Bidnenko E, Marchadier E, Hoebeke M, Aymerich S, et al. 2012. Condition-dependent transcriptome reveals high-level regulatory architecture in Bacillus subtilis. Science 335: 1103–1106. 10.1126/science.1206848 [DOI] [PubMed] [Google Scholar]
  34. O'Leary NA, Wright MW, Brister JR, Ciufo S, Haddad D, McVeigh R, Rajput B, Robbertse B, Smith-White B, Ako-Adjei D, et al. 2016. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res 44: D733–D745. 10.1093/nar/gkv1189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Paget MS. 2015. Bacterial sigma factors and anti-sigma factors: structure, function and distribution. Biomolecules 5: 1245–1265. 10.3390/biom5031245 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Parker DJ, Lalanne JB, Kimura S, Johnson GE, Waldor MK, Li GW. 2020. Growth-optimized aminoacyl-tRNA synthetase levels prevent maximal tRNA charging. Cell Syst 11: 121–130.e6. 10.1016/j.cels.2020.07.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Parks DH, Chuvochina M, Waite DW, Rinke C, Skarshewski A, Chaumeil PA, Hugenholtz P. 2018. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nat Biotechnol 36: 996. 10.1038/nbt.4229 [DOI] [PubMed] [Google Scholar]
  38. Petersohn A, Bernhardt J, Gerth U, Höper D, Koburger T, Völker U, Hecker M. 1999. Identification of σ(B)-dependent genes in Bacillus subtilis using a promoter consensus-directed search and oligonucleotide hybridization. J Bacteriol 181: 5718–5724. 10.1128/JB.181.18.5718-5724.1999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Price CW. 2014. General stress response in Bacillus subtilis and related gram-positive bacteria. In Bacterial stress responses, pp. 301–318. ASM Press, Washington, DC: http://doi.wiley.com/10.1128/9781555816841.ch17 [Google Scholar]
  40. Queen C, Rosenberg M. 1981. Differential translation efficiency explains discoordinate expression of the galactose operon. Cell 25: 241–249. 10.1016/0092-8674(81)90249-X [DOI] [PubMed] [Google Scholar]
  41. Reuter JS, Mathews DH. 2010. RNAstructure: software for RNA secondary structure prediction and analysis. BMC Bioinformatics 11: 129. 10.1186/1471-2105-11-129 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Rouskin S, Zubradt M, Washietl S, Kellis M, Weissman JS. 2014. Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo. Nature 505: 701–705. 10.1038/nature12894 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Shine J, Dalgarno L. 1974. The 3′ terminal sequence of Escherichia coli 16S ribosomal RNA: complementarity to nonsense triplets and ribosome binding sites. Proc Natl Acad Sci 71: 1342–1346. 10.1073/pnas.71.4.1342 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Sierro N, Makita Y, de Hoon M, Nakai K. 2008. DBTBS: a database of transcriptional regulation in Bacillus subtilis containing upstream intergenic conservation information. Nucleic Acids Res 36: D93–D96. 10.1093/nar/gkm910 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Spitale RC, Flynn RA, Zhang QC, Crisalli P, Lee B, Jung JW, Kuchelmeister HY, Batista PJ, Torre EA, Kool ET, et al. 2015. Structural imprints in vivo decode RNA regulatory mechanisms. Nature 519: 486–490. 10.1038/nature14263 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Takyar S, Hickerson RP, Noller HF. 2005. mRNA helicase activity of the ribosome. Cell 120: 49–58. 10.1016/j.cell.2004.11.042 [DOI] [PubMed] [Google Scholar]
  47. Tomezsko PJ, Corbin VDA, Gupta P, Swaminathan H, Glasgow M, Persad S, Edwards MD, Mcintosh L, Papenfuss AT, Emery A, et al. 2020. Determination of RNA structural diversity and its role in HIV-1 RNA splicing. Nature 582: 438–442. 10.1038/s41586-020-2253-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Wen JD, Lancaster L, Hodges C, Zeri AC, Yoshimura SH, Noller HF, Bustamante C, Tinoco I. 2008. Following translation by single ribosomes one codon at a time. Nature 452: 598–603. 10.1038/nature06716 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Yakhnin H, Zhang H, Yakhnin A V, Babitzke P. 2004. The trp RNA-binding attenuation protein of Bacillus subtilis regulates translation of the tryptophan transport gene trpP (yhaG) by blocking ribosome binding. J Bacteriol 186: 278–286. 10.1128/JB.186.2.278-286.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Yakhnin H, Pandit P, Petty TJ, Baker CS, Romeo T, Babitzke P. 2007. CsrA of Bacillus subtilis regulates translation initiation of the gene encoding the flagellin protein (hag) by blocking ribosome binding. Mol Microbiol 64: 1605–1620. 10.1111/j.1365-2958.2007.05765.x [DOI] [PubMed] [Google Scholar]
  51. Ye J, McGinnis S, Madden TL. 2006. BLAST: improvements for better sequence analysis. Nucleic Acids Res 34: W6–W9. 10.1093/nar/gkl164 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Zhu B, Stülke J. 2018. SubtiWiki in 2018: from genes and proteins to functional network annotation of the model organism Bacillus subtilis. Nucleic Acids Res 46: D743–D748. 10.1093/nar/gkx908 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Zubradt M, Gupta P, Persad S, Lambowitz AM, Weissman JS, Rouskin S. 2016. DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo. Nat Methods 14: 75–82. 10.1038/nmeth.4057 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material
supp_27_7_791__DC1.html (1.2KB, html)

Articles from RNA are provided here courtesy of The RNA Society

RESOURCES