Skip to main content
mSystems logoLink to mSystems
. 2021 Jul 13;6(4):e00526-21. doi: 10.1128/mSystems.00526-21

Promoter Architecture Differences among Alphaproteobacteria and Other Bacterial Taxa

Kevin S Myers a, Daniel R Noguera a,b, Timothy J Donohue a,c,
Editor: Karoline Faustd
PMCID: PMC8407463  PMID: 34254822

ABSTRACT

Much of our knowledge of bacterial transcription initiation has been derived from studying the promoters of Escherichia coli and Bacillus subtilis. Given the expansive diversity across the bacterial phylogeny, it is unclear how much of this knowledge can be applied to other organisms. Here, we report on bioinformatic analyses of promoter sequences of the primary σ factor (σ70) by leveraging publicly available transcription start site (TSS) sequencing data sets for nine bacterial species spanning five phyla. This analysis identifies previously unreported differences in the −35 and −10 elements of σ70-dependent promoters in several groups of bacteria. We found that Actinobacteria and Betaproteobacteria σ70-dependent promoters lack the TTG triad in their −35 element, which is predicted to be conserved across the bacterial phyla. In addition, the majority of the Alphaproteobacteria σ70-dependent promoters analyzed lacked the thymine at position −7 that is highly conserved in other phyla. Bioinformatic examination of the Alphaproteobacteria σ70-dependent promoters identifies a significant overrepresentation of essential genes and ones encoding proteins with common cellular functions downstream of promoters containing an A, C, or G at position −7. We propose that transcription of many σ70-dependent promoters in Alphaproteobacteria depends on the transcription factor CarD, which is an essential protein in several members of this phylum. Our analysis expands the knowledge of promoter architecture across the bacterial phylogeny and provides new information that can be used to engineer bacteria for use in medical, environmental, agricultural, and biotechnological processes.

IMPORTANCE Transcription of DNA to RNA by RNA polymerase is essential for cells to grow, develop, and respond to stress. Understanding the process and control of transcription is important for health, disease, the environment, and biotechnology. Decades of research on a few bacteria have identified promoter DNA sequences that are recognized by the σ subunit of RNA polymerase. We used bioinformatic analyses to reveal previously unreported differences in promoter DNA sequences across the bacterial phylogeny. We found that many Actinobacteria and Betaproteobacteria promoters lack a sequence in their −35 DNA recognition element that was previously assumed to be conserved and that Alphaproteobacteria lack a thymine residue at position −7, also previously assumed to be conserved. Our work reports important new information about bacterial transcription, illustrates the benefits of studying bacteria across the phylogenetic tree, and proposes new lines of future investigation.

KEYWORDS: bioinformatics, motif prediction, promoters, transcription

INTRODUCTION

The transcription of DNA to RNA is vital for cells to grow, develop, and respond to environmental stimuli. In addition, transcription can be harnessed to engineer microbial strains that sequester carbon dioxide or nitrogen gas or produce fuels, chemicals, pharmaceuticals, and materials (1, 2). Despite the central role of transcription, most of our understanding of this process in bacteria has come from analysis of a few species. We are interested in understanding transcription across bacteria, especially those with activities of agricultural, environmental, medical, and industrial importance.

Transcription depends on RNA polymerase (RNAP) binding in a site-specific manner to promoter DNA sequences, and it can be regulated by protein factors and small-molecule ligands (3, 4). Biochemical, structural, and genetic studies have shown that bacterial RNAP is typically composed of core subunits (β, β′, ω, and α2) and a specificity subunit (σ) that allows a holoenzyme to recognize promoter DNA sequences (5, 6). Bacteria often contain multiple σ factors, each directing a different RNAP holoenzyme to recognize specific DNA sequences in promoters (710), thereby allowing individual RNAP holoenzymes to transcribe different gene sets (1116). It is also well known that differences in promoter sequences can influence either RNAP holoenzyme binding or subsequent steps in transcription initiation (79, 1720).

While bacteria often contain multiple σ factors (21, 22), a housekeeping σ factor (often called σ70 in reference to the molecular weight of the Escherichia coli protein) is known or predicted to recognize promoters containing a −35 and a −10 motif in bacteria (numbers indicate the number of bases upstream of the transcription start site [TSS]) (2127). Housekeeping σ factors are typically essential proteins because of the number and diversity of genes that require σ70 for transcription (25). In E. coli, highly conserved nucleotides in the −35 (−35TTG−33) and -10 (−12TATAAT−7) elements of the promoter carry out base-specific interactions with amino acids in the σ70 subunit (5, 22), suggesting that these or similar protein-DNA contacts are important in the process of transcription across the bacterial phylogeny (25).

Despite this, there is emerging evidence that some aspects of the paradigm for σ70-dependent transcription developed in the gammaproteobacterium E. coli may not apply across the bacterial phylogeny. For example, in several groups of bacteria, maximal transcription from σ70-dependent ribosomal (rRNA) operons in vitro requires CarD, an RNAP-binding protein that interacts with promoter DNA just upstream of the −10 hexamer (2831). CarD family members are found in several bacterial phyla or classes, including Actinomycetes, Aquificae, Cyanobacteria, Deinococcus-Thermus, Firmicutes, Spirochaetes, and Thermodesulfobacteria, as well as Alphaproteobacteria and Deltaproteobacteria (30, 32). When tested, carD is often an essential gene and its loss or depletion reduces expression of many genes, supporting a hypothesis that CarD has a role in controlling expression of numerous cellular processes and pathways (3336).

Indeed, it was recently shown that promoters for the σ70-dependent rRNA operons and other genes from the alphaproteobacterium Rhodobacter sphaeroides were activated by CarDRsp in vitro (96). In addition, the requirement for CarD for maximal transcription was found to be due, in part, to the absence of a thymine at the last position of the −10 motif in these promoters (96). Further, analyses of published genome-scale transcription start site (TSS) data sets suggest differences in the −10 motif recognized by the housekeeping σ factor between R. sphaeroides, a few other Alphaproteobacteria, and a limited number of well-studied bacteria (37, 96). In this work, we leveraged published genome-wide TSS data from organisms across the bacterial phylogeny to predict the sequences of −35 and −10 elements of their σ70-dependent promoters (3744). We also used the lack of a thymine at position −7 to predict the cellular functions encoded by genes that are known or predicted to require CarD to activate transcription across the bacterial phyla. Based on this, we propose that differences in promoter architecture and the presence of CarD play a previously unrecognized role in transcription of essential and other genes in Alphaproteobacteria and possibly other bacterial phyla.

RESULTS

TSS-based bioinformatic predictions of promoter sequences across the phylogeny.

Past genome-scale predictions of bacterial promoter sequences have often been made without knowing the precise site of transcription initiation by RNA polymerase that is provided by transcription start site (TSS) data sets. High-throughput TSS sequencing (TSS-seq) has recently been used to provide genome-scale coordinates for the initiation sites for bacterial transcription units at the nucleotide level (37, 38). We gathered publicly available genome-scale TSS-seq data sets from a variety of bacterial species in order to predict bacterial promoter sequences and to ask if they were similar across the phylogeny. The published bacterial TSS-seq data sets that we analyzed included several thousand experimentally determined TSSs from Actinobacteria (40, 41), Alphaproteobacteria (37, 38, 43), Betaproteobacteria (39), Firmicutes (44), and Gammaproteobacteria (42) (Table 1).

TABLE 1.

Summary of TSS data analyzed and annotated σ factors for organisms studied

Species Phylum or class No. of:
Genome accession no.
TSS analyzed TSS conditions Annotated σ factors
Mycobacterium smegmatis Actinobacteria 2,139 3 26 NC_008596.1
Streptomyces coelicolor Actinobacteria 3,570 44 63 NC_003888.3
Caulobacter crescentus Alphaproteobacteria 2,726 8 16 NC_011916.1
Novosphingobium aromaticivorans Alphaproteobacteria 2,301 2 12 NC_007794.1
Rhodobacter sphaeroides Alphaproteobacteria 3,015 2 17 NC_007493.2
Zymomonas mobilis Alphaproteobacteria 3,940 3 5 NZ_CP023715.1
Burkholderia cenocepacia Betaproteobacteria 6,598 1 20 AM747720.1
Bacillus subtilis Firmicutes 5,601 1 18 NC_000964.3
Escherichia coli Gammaproteobacteria 2,702 3 7 NC_000913.3

Since σ70 homologs are needed for transcription of most genes (25, 27), the majority of the TSSs in these genome-scale data sets are predicted to be derived from σ70-dependent promoters. Thus, we hypothesized that including all identified TSSs in each bacterium in our analysis would allow the discovery of overrepresented DNA sequences that corresponded to the σ70 promoter sequence. Further, the large number of TSSs analyzed predicts that any activity of alternative σ factors on genome-wide transcription would be minimal. Additionally, most of the TSS data sets were generated under environmental conditions where the activity of many alternative σ factors was predicted to be low, thereby limiting the influence of other promoter motifs on our analysis (Table 1). Using MEME as a motif discovery tool (45, 46), we were able to identify upstream motifs with DNA sequence similarity to −35 and −10 promoter elements that would be predicted to be recognized by the respective housekeeping σ factor based on what is known about σ70-promoter interactions in other well-studied bacterial species (Fig. 1; Table S1). Upon examining the overrepresented sequences, we found that motifs identified by MEME are generally conserved across this diverse set of bacterial species and have significant sequence identity to −35 and −10 promoter elements that are known to represent binding sites for σ70-containing RNAP in well-studied organisms.

FIG 1.

FIG 1

Sequences of MEME-predicted σ70-dependent −35 and −10 promoter elements for individual bacterial species. Indicated are the organism’s name, the taxonomic group it belongs to, and the most likely sequences of −35 and −10 elements predicted by the MEME motif finder upstream of the published TSS-seq data for this organism. The last column on the right indicates the percentage of the predicted σ70-dependent −10 promoter elements that contain a thymine at position −7 (−7T) relative to the published TSS.

TABLE S1

Summary of MEME-identified predicted σ70-dependent promoters upstream of TSSs. Download Table S1, XLSX file, 0.7 MB (732.2KB, xlsx) .

Copyright © 2021 Myers et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

However, the motifs identified by MEME also make predictions about some base-specific differences in promoter elements that are recognized by σ70 RNAP holoenzyme across the bacterial phylogeny. For example, TSS-seq data sets show that over 60% of the −10 elements of the Alphaproteobacteria R. sphaeroides, Zymomonas mobilis (37, 96), and Caulobacter crescentus (37, 43, 96) lack the thymine at position −7 that is found in over 95% of the E. coli and Bacillus subtilis σ70-dependent promoters (Fig. 1). In an extreme case, the MEME motif finder indicates that >80% of the σ70-dependent promoters in Novosphingobium aromaticivorans, another member of the Alphaproteobacteria, lack a conserved thymine at position −7 (Fig. 1). The low frequency of a thymine at position −7 in Alphaproteobacteria σ70-dependent promoters contrasts with the prediction made by the MEME motif finders that >90% of the analogous transcription units contain a thymine at this position in Actinobacteria, Betaproteobacteria, Firmicutes, and Gammaproteobacteria for which genome-scale TSS data sets are publicly available (Fig. 1).

The MEME motif finder also identified the overrepresented TTG DNA sequences within potential −35 elements that were well conserved across most of the same bacterial species (Fig. 1; Table S1). However, this DNA sequence was not found to be overrepresented in Betaproteobacteria and Actinobacteria (Fig. 1).

To analyze additional features of these putative σ70-dependent promoters across the bacterial phylogeny, we also used the predictions of the MEME motif finder to determine the distance between the −10 and −35 elements and calculate the number of bases between the downstream end of the −10 element and the experimentally determined TSS. This analysis resulted in the same most frequent distance between −35 and −10 elements of the σ70-dependent promoters (17 bp) and between the TSS and the −10 element (6 bp) across the species for which genome-scale TSS data are available, suggesting that these features of promoters are conserved across the bacterial phylogeny (Table S1).

To test if the prediction of sequences of these promoter elements was influenced by the motif-finding algorithm, we analyzed the same genome-wide TSS-seq data sets using Delila-PY (47), a Python-based pipeline interface with the Delila software suite (48) that uses a different motif-finding method than MEME. The use of Delila-PY to identify DNA sequence motifs upstream of the experimentally mapped TSSs predicted that >60% of the −10 elements of Alphaproteobacteria σ70-dependent promoters lacked a T at position −7 in all four species examined (Fig. 2; Table S2). This analysis also revealed that, while there are some species-specific differences in the base distribution at position −7, when averaged across all Alphaproteobacteria, there is a roughly equal percentage of each base at position −7 (Fig. 3A). In agreement with MEME, Delila-PY predicted that >80% of the non-Alphaproteobacteria −10 elements for σ70-dependent promoters contained a thymine at position −7 (Fig. 2; Table S2). Further, Delila-PY predicted a lack of conservation for the −35 element for Mycobacterium smegmatis, Streptomyces coelicolor, and Burkholderia cenocepacia (Fig. 2). In total, there was a 60% to 95% agreement between the predicted σ70 −35 and −10 promoter elements identified by both the MEME motif finder and Delila-PY across the data sets we analyzed. The fact that similar predictions about the DNA sequences of σ70-dependent promoters are made when using either MEME or Delila-PY suggest that the observed differences were not due to a specific algorithm but are likely to be biologically relevant.

FIG 2.

FIG 2

Sequences of Delila-PY-predicted σ70-dependent −35 and −10 promoter elements for individual bacterial species. Indicated are the organism’s name, the taxonomic group it belongs to, and the most likely sequences of −35 and −10 elements predicted by Delila-PY upstream of the published TSS-seq data for this organism. The last column on the right indicates the percentage of the predicted σ70-dependent −10 promoter elements that contain a thymine at position −7 (−7T) relative to the published TSS.

FIG 3.

FIG 3

Distribution of −7A (green), −7C (blue), −7G (yellow), and −7T (red) bases within −10 elements upstream of all TSSs as identified by Delila-PY (A), upstream of genes with at least one homolog in the Database of Essential Genes (DEG) (B), and upstream of genes identified as essential using transposon insertion data sets (C) for the bacterial species indicated. The average distribution across all Alphaproteobacteria is indicated as “All Alphaproteobacteria.” The number of TSSs identified for each data set is listed below each group of bars.

TABLE S2

Summary of Delila-PY-identified predicted σ70-dependent promoters upstream of TSSs. Download Table S2, XLSX file, 1.1 MB (1.1MB, xlsx) .

Copyright © 2021 Myers et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

In sum, this analysis illustrates that new insights can be obtained from a comparative analysis of genome-scale TSS-seq data across bacterial species. Specifically, it makes a prediction that the −35 and −10 elements of σ70-dependent promoters vary significantly across the bacterial phylogeny. In addition, since the MEME motif finder and Delila-PY identified similar overrepresented sequences proximal to the TSS, we conclude that the nucleotide present at position −7 relative to the TSS of σ70-dependent promoters is a previously unrecognized feature across many Alphaproteobacteria. Below, we focus on predictions about the nature and consequences of the difference in the base at position −7 of the −10 element for predicted σ70-dependent promoters within Alphaproteobacteria.

Functional groups associated with the products of genes that are downstream of −7T σ70-dependent promoters among Alphaproteobacteria.

Because a thymine at position −7 (−7T) relative to the TSS of σ70-dependent promoters is present in only a minority of the predicted promoters in Alphaproteobacteria (Fig. 1 and 2), we investigated if this small set of transcription units is enriched for gene products that have specific functions across these bacteria. When we analyzed the genes downstream of −7T σ70-dependent promoters in Alphaproteobacteria, we found that <20% of these encoded homologs of proteins contained in the bacterial Database of Essential Genes (49, 50) (Fig. 3B; Table S2). There was also a roughly equal distribution of all 4 bases at position −7 within promoters upstream of σ70-dependent transcription units that encode homologs of these essential proteins (Fig. 3B). We also analyzed the predicted −7T σ70-dependent promoters upstream of R. sphaeroides and C. crescentus genes that have been identified as essential in transposon insertion sequencing (Tn-seq) mutant libraries (51, 52) and genes identified as essential in Z. mobilis via microarrays (53). This analysis predicts that <15% of the genes containing −7T σ70-dependent promoters in these three species are essential and found that there was no statistical enrichment in these gene products (hypergeometric test, P ≤ 0.05) (Fig. 3C; Table S2). Indeed, there is no significant enrichment for any base at position −7 in the σ70-dependent promoters that are found upstream of these essential genes (Fig. 3C). Taken together, these data suggest that the genomes of Alphaproteobacteria have no significant enrichment for known essential genes downstream of −7T σ70-dependent promoters.

We also tested for functional enrichment of the products transcribed from genes downstream of predicted Alphaproteobacteria −7T σ70-dependent promoters. To do this, we analyzed the predicted cellular role of gene products transcribed from Alphaproteobacteria transcription units with a −7T σ70-dependent promoter with functional groups compiled from the KEGG Brite ontology, the KEGG pathway lists, and GO terms from each organism (54, 55) using a hypergeometric test (an adjusted P value of ≤0.1 indicates significant enrichment) (Fig. 4; Table 2; Table S3). This analysis revealed that the annotated functions of gene products downstream of predicted −7T σ70-dependent promoters were highly variable among the Alphaproteobacteria species for which genome-scale TSS data were available. Indeed, the only functions enriched in more than one Alphaproteobacteria were annotated as having roles in cell envelope function and protein degradation (Fig. 4; Table 2; Table S3). Moreover, this analysis revealed that Z. mobilis had no statistically significant enrichment for any functional groups for products encoded by genes downstream of predicted −7T σ70-dependent promoters, perhaps reflecting the low number of genes that fall in this category in this bacterium.

FIG 4.

FIG 4

Functional enrichment of genes downstream of predicted −7T σ70-dependent promoters (left) or −7A/C/G σ70-dependent promoters (right). Colors indicate percentage of all enriched genes within each species present in each cluster (rows) in each organism (columns) using the code shown at the lower left. Darker purple indicates that more enriched genes were present within that individual category. Gray boxes indicate gene sets which show no functional enrichment (NF) for a specific group in the indicated bacterial species.

TABLE 2.

Functional enrichment of genes downstream of predicted −7T σ70-dependent promoters

Functional group Species Subgroup No. of genes
Cell Cycle C. crescentus Cell Cycle (KEGG Brite ccs04112) 12
Cell Wall / Cell Membrane C. crescentus Peptidoglycan Metabolic Process (GO:0000270) 4
N. aromaticivorans Lipid Biosynthesis Proteins (KEGG Brite nar01004) 2
R. sphaeroides Peptidoglycan Biosynthetic Process (GO:0009252) 10
DNA Organization N. aromaticivorans Chromosome and Associated Proteins (KEGG Brite nar03036) 4
DNA Repair N. aromaticivorans DNA Repair and Recombination Proteins (KEGG Brite nar03400) 5
DNA Replication N. aromaticivorans DNA Replication Proteins (KEGG Brite nar03032) 3
Phospho-Group Transferase R. sphaeroides Transferase Activity Transferring Phosphorus-Containing Groups (GO:0016772) 10
Protein Degradation N. aromaticivorans Peptidases and Inhibitors (KEGG Brite nar01002) 4
Protein Catabolic Process (GO:0030163) 3
R. sphaeroides Peptidases and Inhibitors (KEGG Brite rsp01002) 14
Protein Catabolic Process (GO:0030163) 4
Protein Folding N. aromaticivorans Chaperones and Folding Catalysts (KEGG Brite nar03110) 5
Transcription N. aromaticivorans Transcription Factors (KEGG Brite nar03000) 5
Two-Component System (KEGG Brite nar02022) 2
Transcription Machinery (KEGG Brite nar03021) 2
Translation N. aromaticivorans Mitochondrial Biogenesis (KEGG Brite nar03029) 4
Exosome (KEGG Brite nar04147) 3
Ribosome Biogenesis (KEGG Brite nar03009) 3
Transport N. aromaticivorans Transporters (KEGG Brite nar02000) 8
TABLE S3

Functional group enrichment for products of genes downstream of predicted −7T σ70-dependent promoters and −7A/C/G σ70-dependent promoters. Download Table S3, XLSX file, 0.07 MB (74.1KB, xlsx) .

Copyright © 2021 Myers et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

In the three Alphaproteobacteria for which genome-wide TSS data are available, we found evidence for functional enrichments unique to the individual organisms. For example, in N. aromaticivorans, the largest number of enriched functional groups of proteins encoded by genes downstream of −7T σ70-dependent promoters were predicted to function in translation, transport, transcription, protein folding, DNA organization, DNA repair, and DNA replication (Fig. 4; Table 2; Table S3), as well as some proteins being predicted to allow N. aromaticivorans to metabolize aromatic compounds (5660). In R. sphaeroides, the products encoded by genes downstream of predicted −7T σ70-dependent promoters were enriched for phosphor-group transfer, including sensor histidine kinases of two-component regulatory systems (Fig. 4; Table 2; Table S3). In C. crescentus, the products of genes downstream of predicted −7T σ70-dependent promoters were enriched in those involved in the cell cycle changes of this alphaproteobacterium (Fig. 4; Table 2; Table S3) (61). Because of this predicted enrichment, we asked if the genes in C. crescentus downstream of a predicted −7T σ70-dependent promoter exhibited any cell cycle changes in transcription in published RNA-seq data set (62). This analysis revealed an increase of the average abundance of transcripts derived from the cell cycle genes downstream of predicted −7T σ70-dependent promoters over the cell cycle (Fig. 5A). We did not find a similar pattern in cell cycle-specific transcript abundance when we analyzed the same number of randomized genes downstream of predicted −7T σ70-dependent promoters for genes that encode proteins with different functions (Fig. 5A). Taken together, these results indicate that genes downstream of predicted −7T σ70-dependent promoters do not encode common cellular functions across the Alphaproteobacteria. Instead, they predict that −7T σ70-dependent promoters are found upstream of transcription units that encode proteins responsible for diverse and possibly lifestyle-specific sets of cellular functions.

FIG 5.

FIG 5

Average transcript abundance values for selected functional groups in C. crescentus (A) and R. sphaeroides (B). (A) Average transcript abundance values for genes downstream of predicted −7T σ70-dependent promoters involved in the cell cycle (blue) and the same number of randomly selected genes downstream of predicted −7T σ70-dependent promoters (orange) in C. crescentus during the cell cycle. (B) Average transcript abundance values for genes downstream of predicted −7A/C/G σ70-dependent promoters involved in photosynthesis (blue) and translation (orange) in R. sphaeroides as a function of time after exposing an anaerobic culture to oxygen.

Alphaproteobacteria proteins transcribed from genes downstream of σ70-dependent −7A/C/G promoters perform essential and core cellular functions.

We also examined the predicted functions of Alphaproteobacteria proteins transcribed from genes downstream of predicted σ70-dependent promoters containing an adenine, cytosine, or guanine at position −7 (−7A/C/G promoters). In this case, we found that there was a high percentage of homologous proteins in the bacterial Database of Essential Genes (DEG) (49, 50) that were encoded by transcription units that contained predicted −7A/C/G σ70-dependent promoters (Fig. 3B; Table S2) but no significant difference in distribution of bases at this position within these promoters. Further, we found statistically significant enrichment of transposon-identified essential genes located downstream of predicted −7A/C/G σ70-dependent promoters (5153) (Fig. 3C; Table S2) (hypergeometric test, P ≤ 0.05). Examining the individual base distributions at position −7 again showed no significant enrichment (Fig. 3C). These analyses suggest that Alphaproteobacteria have a higher percentage of −7A/C/G σ70-dependent promoters upstream of genes encoding known essential functions than those containing a thymine at this position and that an A, C, and G at position −7 are roughly equally distributed in all the Alphaproteobacteria included in our analysis (Fig. 3).

We also tested for functional enrichments in the proteins transcribed from genes that are downstream of predicted −7A/C/G σ70-dependent promoters. This analysis identified several functional groups enriched in multiple Alphaproteobacteria species for products encoded by genes that are downstream of predicted −7A/C/G σ70-dependent promoters (Fig. 4; Table 3; Table S3). The enriched functional groups shared among all 4 Alphaproteobacteria species were gene products involved in translation, central carbon metabolism, and the biosynthesis of secondary metabolites (Fig. 4; Table 3; Table S3). The biosynthesis of purines (three species), amino acid biosynthesis gene products (three species), and transporters (two species) was enriched in the products of genes downstream of predicted −7A/C/G σ70-dependent promoters in a subset of the Alphaproteobacteria for which genome-wide TSS data sets were available (Fig. 4; Table 3; Table S3).

TABLE 3.

Functional enrichment of genes downstream of predicted −7A/C/G σ70-dependent promoters

Functional group Species Subgroup No. of genes
4Fe-4S Cluster Binding N. aromaticivorans 4Fe-4S Cluster Binding (GO:0051539) 20
Biosynthesis of Amino Acids N. aromaticivorans Biosynthesis of Amino Acids (KEGG Pathways nar01230) 64
Cellular Amino Acid Biosynthetic Process (GO:0008652) 38
Valine, Leucine, and Isoleucine Biosynthesis (KEGG Pathways nar00290) 10
Branched-Chain Amino Acid Biosynthetic Process (GO:0009082) 9
C5-Branched Dibasic Acid Metabolism (KEGG Pathways nar00660) 9
Cysteine and Methionine Metabolism (KEGG Pathways nar00270) 17
Lysine Biosynthetic Process via Diaminopimelate (GO:0009089) 7
Lysine Biosynthetic Process (GO:0009085) 7
Leucine Biosynthetic Process (GO:0009098) 5
Isoleucine Biosynthetic Process (GO:0009097) 6
Lysine Biosynthesis (KEGG Pathways nar00300) 9
Isoprenoid Biosynthetic Process (GO:0008299) 6
R. sphaeroides Biosynthesis of Amino Acids (KEGG Pathways rsp01230) 50
Cellular Amino Acid Biosynthetic Process (GO:0008652) 29
Histidine Biosynthetic Process (GO:0000105) 7
Threonine Biosynthetic Process (GO:0009088) 4
Lysine Biosynthetic Process (GO:0009085) 6
Lysine Biosynthetic Process via Diaminopimelate (GO:0009089) 6
Lysine Biosynthesis (KEGG Pathways rsp00300) 9
Cysteine and Methionine Metabolism (KEGG Pathways rsp00270) 15
Methionine Biosynthetic Process (GO:0009086) 7
Z. mobilis Biosynthesis of Amino Acids (KEGG Pathways zmo01230) 29
Biosynthesis of Lipids R. sphaeroides DNA Replication (KEGG Pathways rsp03030) 8
Biosynthesis of Purines N. aromaticivorans Purine Nucleotide Biosynthetic Process (GO:0006164) 11
Purine Metabolism (KEGG Pathways nar00230) 18
‘De Novo’ IMP Biosynthetic Process (GO:0006189) 8
R. sphaeroides Purine Nucleotide Biosynthetic Process (GO:0006164) 10
Z. mobilis Exosome (KEGG Brite zmo04147) 20
Biosynthesis of Secondary Metabolites C. crescentus Biosynthesis of Secondary Metabolites (KEGG Pathways ccs01110) 26
Biosynthesis of Cofactors (KEGG Pathways ccs01240) 7
N. aromaticivorans Biosynthesis of Secondary Metabolites (KEGG Pathways nar01110) 141
Heme Biosynthetic Process (GO:0006783) 5
R. sphaeroides Biosynthesis of Secondary Metabolites (KEGG Pathways rsp01110) 119
Biosynthesis of Cofactors (KEGG Pathways rsp01240) 61
Z. mobilis Biosynthesis of Secondary Metabolites (KEGG Pathways zmo01110) 58
Biosynthesis of Purines N. aromaticivorans Purine Nucleotide Biosynthetic Process (GO:0006164) 11
Purine Metabolism (KEGG Pathways nar00230) 18
‘De Novo’ IMP Biosynthetic Process (GO:0006189) 8
R. sphaeroides Purine Nucleotide Biosynthetic Process (GO:0006164) 10
Z. mobilis Exosome (KEGG Brite zmo04147) 20
Carbon Metabolism C. crescentus One-Carbon Metabolic Process (GO:0006730) 7
N. aromaticivorans 2-Oxocarboxylic Acid Metabolism (KEGG Pathways nar01210) 16
Carbon Metabolism (KEGG Pathways nar01200) 47
Glycolytic Process (GO:0006096) 7
Citrate Cycle (TCA Cycle (KEGG Pathways nar00020) 14
Gluconeogenesis (GO:0006094) 6
TCA Cycle (GO:0006099) 9
Pentose Phosphate Pathway (KEGG Pathways nar00030) 10
R. sphaeroides Carbon Metabolism (KEGG Pathways rps01200) 43
2-Oxocarboxylic Acid Metabolism (KEGG Pathways rsp01210) 12
TCA Cycle (GO:0006099) 8
Z. mobilis Carbon Metabolism (KEGG Pathways zmo01200) 18
Glycolysis/Gluconeogenesis (KEGG Pathways – zmo00010) 11
Pentose Phosphate Pathway (KEGG Pathways zmo00030) 5
Carbohydrate Metabolic Process (GO:0005975) 10
Cell Wall / Membrane N. aromaticivorans Polysaccharide Biosynthetic Process (GO:0000271) 7
Phospholipid Biosynthetic Process (GO:0008654) 7
DNA Repair N. aromaticivorans DNA Repair (GO:0006281) 19
Cellular Response to DNA Damage Stimulus (GO:0006974) 15
DNA Topoisomerase Type II (Double Strand Cut) (GO:0003918) 4
Base Excision Repair (KEGG Pathways nar03410) 8
DNA Replication R. sphaeroides DNA Replication (KEGG Pathways rsp03030) 8
Photosynthesis R. sphaeroides Porphyrin-Containing Compound Biosynthetic Process (GO:0006779) 8
Porphyrin and Chlorophyll Metabolism (KEGG Pathways rsp00860) 22
Protoporphyrinogen IX Biosynthetic Process (GO:0006782) 7
Chlorophyll Biosynthetic Process (GO:0015995) 12
Coproporphyrinogen Oxidase Activity (GO:0004109) 4
Carbon Fixation in Photosynthetic Organisms (KEGG Pathways rsp00710) 11
Protein Degradation N. aromaticivorans Proteolysis (GO:0006508) 28
Serine-Type Endopeptidase Activity (GO:0004252) 10
Protein Folding N. aromaticivorans Peptidyl-Prolyl Cis-Trans Isomerase Activity (GO:0003755) 7
Protein Peptidyl-Prolyl Isomerization (GO:0000413) 7
Protein Folding (GO:0006457) 8
RNA Processing R. sphaeroides RNA Phosphodiester Bond Hydrolysis Exonucleolytic (GO:0090503) 4
Transcription R. sphaeroides Transcription Machinery (KEGG Brite rsp03021) 12
Translation C. crescentus Translation (GO:0006412) 3
N. aromaticivorans Translation (GO:0006412) 50
Aminoacyl-tRNA Ligase Activity (GO:0004812) 19
tRNA Aminoacylation for Protein Translation (GO:0006418) 14
Aminoacyl-tRNA Biosynthesis (KEGG Pathways nar00970) 19
tRNA Processing (GO:0008033) 14
tRNA Binding (GO:0000049) 12
tRNA Aminoacylation (GO:0043039) 5
R. sphaeroides Translation (GO:0006412) 54
Aminoacyl-tRNA Biosynthesis (KEGG Pathways rsp00970) 44
Aminoacyl-tRNA Ligase Activity (GO:0004812) 18
Transfer RNA Biogenesis (KEGG Brite rsp03016) 33
tRNA Aminoacylation for Protein Translation (GO:0006418) 14
Translation Factors (KEGG Brite rsp03012) 13
Mitochondrial Biogenesis (KEGG Brite rsp03029) 20
tRNA Binding (GO:0000049) 15
RNA Binding (GO:0003723) 31
Non-Coding RNAs (KEGG Brite rsp03100) 25
Translational Termination (GO:0006415) 5
Ribosome (GO:0005840) 21
tRNA Processing (GO:0008033) 12
Structural Constituent of Ribosome (GO:0003735) 20
Ribosome Biogenesis (KEGG Brite rsp03009) 17
Translation Elongation Factor Activity (GO:0003746) 6
Translation Elongation (GO:0006414) 6
Z. mobilis Translation (GO:0006412) 29
Ribosome (KEGG Brite zmo03011) 38
Ribosome (KEGG Brite zmo03010) 19
Structural Component of Ribosome (GO:0003735) 19
Ribosome (GO:0005840) 20
Transport N. aromaticivorans Protein Export (KEGG Pathways nar03060) 10
R. sphaeroides Protein Export (KEGG Pathways rsp03060) 11
Bacterial Secretion System (KEGG Pathways rsp03070) 10

In addition, this analysis showed enrichment of several other groups of gene products that are transcribed from predicted −7A/C/G σ70-dependent promoters in single bacterial species (Fig. 4; Table 3; Table S3). One example of this is the enrichment of genes whose products are involved in photosynthesis in R. sphaeroides, the only phototrophic alphaproteobacterium for which genome-wide TSS data sets are available (Fig. 4; Table 3; Table S3). Consistent with the bioinformatically predicted function of the −7A/C/G σ70-dependent promoters that are upstream of genes encoding proteins involved in photosynthesis, there is a significant reduction in abundance of transcripts encoding proteins involved in photosynthesis and only a slight increase in abundance of those encoding translation functions after photosynthetic cells are shifted to nonphotosynthetic conditions (Fig. 5B) (63). We also found that products encoded by genes transcribed from predicted −7A/C/G σ70-dependent promoters that function in lipid biosynthesis and transcription were among those enriched only in R. sphaeroides. The genes transcribed from predicted −7A/C/G σ70-dependent promoters whose products are involved in lipid biosynthesis may play a role in forming the membrane invaginations that house the photosynthetic apparatus of this organism (64), while the genes encoding alternative σ factors in this group (rpoE, rpoH1, and rpoH2) may play a role in the R. sphaeroides response to singlet oxygen and heat or envelope stress (Fig. 4; Table 3; Table S3) (6570).

In another example, predicted −7A/C/G σ70-dependent promoters in N. aromaticivorans were overrepresented upstream of transcription units that encode iron-sulfur proteins, enzymes in cell wall/cell membrane biosynthesis, DNA repair, protein degradation, and protein folding (Fig. 4; Table 3; Table S3). Phenolic compounds metabolized by N. aromaticivorans are known to damage bacterial cell membranes and other macromolecules, suggesting that this gene is associated with a lifestyle of this alphaproteobacterium (71, 72). Further, several of the iron-sulfur proteins transcribed from genes downstream of predicted −7A/C/G σ70-dependent promoters function in the tricarboxylic acid (TCA) cycle, which assimilates the products of aromatic metabolism into N. aromaticivorans cellular biomass (58, 73). Below, we discuss the importance of finding that a majority of Alphaproteobacteria genes that are transcribed from predicted −7A/C/G σ70-dependent promoters include those involved in many cellular functions.

DISCUSSION

The initiation of transcription requires the recognition and binding of RNAP to specific promoter DNA sequences, an event that requires a σ factor, and can depend on other proteins and small molecule ligands (3, 4). A variety of studies have helped predict the promoter sequence in some well-studied bacterial species, but access to genome-scale maps of TSSs at the nucleotide level provides an opportunity to catalog and compare promoter sequences across the bacterial phylogeny. Here, we predicted promoter sequences using bioinformatic analysis of published genome-scale TSS-seq data sets. Using the MEME and Delila-PY motif-finding algorithms (4547), we predicted differences in the −35 (Actinobacteria and Betaproteobacteria) and −10 (Alphaproteobacteria) promoter elements that are recognized by the housekeeping σ factor (σ70). Below, we discuss the biochemical and functional consequences of the differences in predicted σ70-dependent promoters across these taxa.

Features of σ70-dependent promoter recognition that are conserved across phyla.

The −10 and −35 elements of bacterial promoters make specific contacts with separate regions of σ factors. From mechanistic studies, amino acids in σ70 region 2.4 make specific contacts with the −10 element in cognate promoters (27, 74). Comparison of the sequence of σ70 region 2 in the phyla we examined shows a high level of amino acid conservation, including Q437, T440, and R441 (using residue numbers of E. coli σ70) (Fig. 6A), which recognize the −10 region of the promoter (10). These residues are also conserved in the Alphaproteobacteria, where the sequences of the −10 elements in the majority of the predicted σ70 promoters lack a thymine at position −7 that is highly conserved in many other promoters that are recognized by this σ factor. Similarly, the −35 promoter elements interact specifically with σ70 region 4 (10). Comparison of σ70 region 4 among the bacteria studied here also shows a high degree of amino acid conservation, including residues that recognize the −35 sequence: R584, E585, and Q589 (using residues numbers of E. coli σ70) (Fig. 6B). Indeed, these σ70 region 4 amino acids are conserved in M. smegmatis, S. coelicolor, and B. cenocepacia, bacteria that lack a −35 TTG sequence that is conserved across the phyla (Fig. 1 and 2). This suggests that few, if any, differences in σ70 exist to account for the observed differences in −35 elements of M. smegmatis, S. coelicolor, and B. cenocepacia and the −10 elements of Alphaproteobacteria.

FIG 6.

FIG 6

Amino acid alignment of region 2 (A) and region 4 (B) of the housekeeping σ factor and portion of CarD homologs (C) from the indicated bacterial species. Asterisks and highlighting indicate fully conserved residues, colons indicate conservation between residues with strongly similar properties, and periods indicate conservation between residues with weakly similar properties (94, 95). Conserved residues involved in predicted σ70-dependent −10 promoter binding (A), −35 promoter binding (B), or key functional residue in CarD (C) are indicated by red arrows.

Alphaproteobacteria genes for essential and core metabolic functions contain −7A/C/G σ70-dependent promoters.

It was previously proposed that there are differences in the −10 promoter elements of Alphaproteobacteria, based on examination of a small number of transcription units (75, 76) or organisms (37, 96). In this study, we used published genome-scale TSS data sets to show that −7T is widely conserved across the bacterial phylogeny, except for the Alphaproteobacteria.

We also examined the biological implications of this variance in the sequence of the −10 element of σ70-dependent promoters in Alphaproteobacteria. We found that essential genes were more likely to be transcribed from predicted −7A/C/G σ70-dependent promoters in several Alphaproteobacteria. This finding was unexpected given the conservation of amino acid residues in σ70 region 2 that recognize the −10 element sequence, and it suggests that there are different requirements for transcription initiation in Alphaproteobacteria (see below). We also found that the functions of proteins transcribed from genes downstream of predicted −7A/C/G σ70-dependent promoters were often shared among the Alphaproteobacteria. This so-called core regulon of genes that are downstream of −7A/C/G σ70-dependent promoters includes proteins that function in translation, carbon metabolism, and biosynthesis of amino acids, purines, and secondary metabolites. These findings suggest that there has been a reprogramming of promoter architecture within the Alphaproteobacteria to place −7A/C/G σ70-dependent promoters upstream of both essential genes and ones that encode numerous cellular functions. The presence of a core Alphaproteobacteria regulon that contains predicted −7A/C/G σ70-dependent promoters makes it tempting to propose that this reprogramming occurred after the Alphaproteobacteria diverged. Analysis of TSS data sets from other members of the bacterial phylogeny is needed to test this hypothesis.

However, we also found Alphaproteobacteria −7T σ70-dependent promoters upstream of genes that encode proteins with critical functions, including cell cycle genes in C. crescentus and cell wall/cell membrane biosynthesis genes in multiple members of this phylum. In addition, some Alphaproteobacteria showed enrichment for different bases at position −7 of σ70-dependent promoters upstream of genes that were linked to individual lifestyles. For example, R. sphaeroides showed enrichment for predicted −7A/C/G σ70-dependent promoters upstream of genes encoding proteins involved in photosynthesis, while C. crescentus showed enrichment for predicted −7T σ70-dependent promoters upstream of genes encoding products involved in their cell cycle developmental program. In contrast, N. aromaticivorans contained a large number of enriched functional groups transcribed from genes downstream of both −7T and −7A/C/G σ70-dependent promoters. If the latter finding reflects the acquisition by N. aromaticivorans of transcription units from a variety of bacteria which allow it to metabolize aromatic compounds, then future analysis of other aromatic-metabolizing Alphaproteobacteria (77) might shed light on core or extended regulons for this metabolic capacity. We could not identify functional enrichments for Z. mobilis proteins transcribed from genes downstream of either −7T promoters or −7A/C/G σ70-dependent promoters, but this might reflect the number of genes annotated with unknown function and the lack of metabolic analyses of this alphaproteobacterium. Further studies of Alphaproteobacteria are needed to better understand the roles of proteins encoded by genes downstream of −7T and −7A/C/G σ70-dependent promoters in their lifestyles.

The potential role of CarD at Alphaproteobacteria σ70-dependent promoters.

In organisms in which the majority of −10 elements contain a −7T, the presence of other bases at this position often reduces their activity, creates a promoter which requires different base patterns to compensate, or requires another protein to stimulate transcription (7880, 96). The transcription factor CarD may play such a stimulatory role in Alphaproteobacteria, since it increased transcription from several R. sphaeroides σ70-dependent −7A/C/G promoters in vitro (96), possibly by stabilizing open complex formation by RNAP (28, 29). CarD is essential in several Alphaproteobacteria, and a residue required for CarD function (W86) (29, 30) (red arrow in Fig. 6C) is conserved across the Alphaproteobacteria we studied, suggesting that this protein also activates transcription in these species (Fig. 6C). Together, these data suggest that CarD homologs enhance transcription by σ70-containing RNAP, perhaps by compensating for the lack of a thymine at position −7 in the −10 element of Alphaproteobacteria.

Lateral gene transfer (LGT) is common within Alphaproteobacteria and with other phyla (81, 82), and it is proposed to be a key component of proteobacterial evolution (83). This raises the possibility that LGT of CarD and transcription units containing −7A/C/G σ70-dependent promoters played an important role in both the evolution of the Alphaproteobacteria and their branching from other taxa. Additional analysis of alphaproteobacterial species could lead to a better understanding of any evolutionary link between CarD and transcription initiation.

Potential impacts of promoter differences on biotechnology.

The unique features of Alphaproteobacteria σ70-dependent promoters described here and elsewhere (37, 96) highlight the importance of analyzing multiple phyla to gain a more complete picture of transcription initiation. For example, the paradigms for σ70-dependent sequence motifs developed in other bacteria might not accurately predict the presence of alphaproteobacterial promoters. The ability to control activity of alphaproteobacterial promoters has practical applications, since they have biochemical and metabolic pathways that would be beneficial to harness for various biotechnology applications. These include the conversion of lignin-derived and other aromatic compounds into valuable products by N. aromaticivorans (56, 5860) and the ability of R. sphaeroides to harvest solar energy, fix atmospheric nitrogen and CO2, and produce hydrogen and other valuable chemicals (8488). Future efforts to engineer these and other bacteria will be enhanced by a better understanding of promoter architecture and the role of proteins like CarD in transcription initiation.

Conclusion.

By analyzing published genome-scale TSS data sets from species across the bacterial phylogeny, we found that Actinobacteria and Betaproteobacteria σ70-dependent promoters lack conserved bases in their predicted −35 elements. We further found that the base at the −7 position of the −10 elements in over 60% of the σ70-dependent promoters in several diverse Alphaproteobacteria differs from that found in many other phyla, and we propose that CarD plays a role in activating transcription from Alphaproteobacteria σ70-dependent promoters. Our findings highlight the importance of studying numerous bacterial species to increase our understanding of transcription and engineer members of the Alphaproteobacteria as well as other bacterial phyla for processes of medical, agricultural, environmental, and biotechnological importance.

MATERIALS AND METHODS

Data sets.

We used published TSS data from M. smegmatis and S. coelicolor (Actinobacteria) (40, 41), C. crescentus, N. aromaticivorans, R. sphaeroides, and Z. mobilis (Alphaproteobacteria) (37, 38, 43), B. cenocepacia (Betaproteobacteria) (39), B. subtilis (Betaproteobacteria) (44), and E. coli (Gammaproteobacteria) (42). All identified TSSs were used in our analysis; there was no attempt to remove any TSSs potentially downstream of alternative σ factors. Genome sequence files (e.g., GFF, GenBank, and genome FASTA) for each species were obtained from NCBI (89) using the following accession numbers for each species: M. smegmatis, NC_008596.1; S. coelicolor, NC_003888.3; C. crescentus, NC_011916.1; N. aromaticivorans, NC_007794.1; R. sphaeroides, NC_007493.2; Z. mobilis, NZ_CP023715.1; B. cenocepacia, AM747720.1; B. subtilis, NC_000964.3; and E. coli, NC_000913.3.

Promoter motif prediction.

The MEME motif finder (version 5.1.0) was used to predict σ70-dependent promoter elements upstream of each TSS (45, 46). To identify the −10 promoter element, the nucleotide sequence from −19 to −5 relative to each identified TSS was analyzed using the “zoops” method (minimum width of 9 bp, maximum width of 10 bp, no palindromic motifs) in MEME. The motif with the most hits and lowest P value was chosen for each species. The predicted −10 elements all had overrepresented TA nucleotides at positions −12 and −11 relative to the TSS. The percentage of thymine at position −7 was calculated relative to the TA dinucleotide within each predicted −10 promoter element. If identical sequences were used from multiple closely spaced TSSs upstream of the same gene, the duplications were not analyzed to determine the position −7 base percentages. To identify DNA sequences of potential −35 elements in each data set, the positions −40 to −28 relative to each identified TSS with an identified −10 element were analyzed using MEME and the settings described above. Motifs for publication were constructed using WebLogo (90). Distances between the −35 and −10 elements and between the −10 element and position +1 were determined using custom Python scripts.

Delila-PY (47) was also used to predict σ70-dependent promoter motifs (48, 91). The DNA sequences in predicted −10 elements were identified by searching −15 to −5 relative to each TSS using default parameters for Delila-PY. The motif with the highest information content is, by default, reported by Delila-PY and was used in our analyses. The percentage of a thymine base (T) at position −7 was calculated as described above. To identify the DNA sequences in −35 elements for each species, the positions −37 to −27 relative to each identified TSS with an identified −10 element were analyzed using the default Delila-PY settings, and the predicted logos are reported in our analyses. We used the Delila-PY −10 promoter element predictions for subsequent analysis due to the larger number of predicted σ70-dependent promoters generated by this tool.

Determining gene essentiality.

Predicted σ70-dependent promoters identified by Delila-PY were split into two categories based on the identity of the base at position −7: −7T and −7A/C/G. For each group, the genes downstream of each predicted σ70 promoter were searched against the Database of Essential Genes (DEG), version 15 (49). The amino acid sequence of each gene downstream of a predicted σ70-dependent promoter was searched against the protein sequences of all bacterial essential genes in DEG 15 using BLAST (version 2.9.0) (92) with a E value threshold of 1 × 10−10. Genes with at least one match to a protein sequence within the DEG list in any bacterial species were considered essential for this analysis.

The genes within the −7T and the −7A/C/G groups were also compared to essential genes identified by transposon insertion sequencing (Tn-seq) for R. sphaeroides and C. crescentus (51, 52) or transposon insertion identification via microarrays for Z. mobilis (53). Statistical significance of the number of essential genes within each group and species was determined using a hypergeometric test, with an adjusted P value of ≤0.05 being considered significant.

Functional enrichment.

Functions of gene products downstream of predicted σ70-dependent promoters in the −7T and −7A/C/G groups were obtained from the NCBI GenBank file for each species. These predicted functions were mapped to organized protein functional groups from the KEGG Brite ontology, KEGG Pathways, and GO terms (54, 55). The comparison was performed using a hypergeometric test with an adjusted P value of ≤0.1 as a threshold for significant enrichment. Subgroups were combined into supergroups by binning similar cellular functions.

Protein sequence alignments.

Protein sequences of RNA polymerase σ70 and CarD homologs were obtained from the GenBank files mentioned above. Clustal Omega (9395) was used to align the sequences using default parameters.

ACKNOWLEDGMENTS

We thank Michael Place for programming assistance and Kemardo Henry, Wilma Ross, and Richard Gourse, as well as members of the Donohue and Noguera labs for helpful discussion.

This material is based upon work supported by the Great Lakes Bioenergy Research Center, U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research under award DE-SC0018409 and by the National Science Foundation under award CBET-1803055.

Contributor Information

Timothy J. Donohue, Email: tdonohue@bact.wisc.edu.

Karoline Faust, KU Leuven.

REFERENCES

  • 1.d'Espaux L, Mendez-Perez D, Li R, Keasling JD. 2015. Synthetic biology for microbial production of lipid-based biofuels. Curr Opin Chem Biol 29:58–65. doi: 10.1016/j.cbpa.2015.09.009. [DOI] [PubMed] [Google Scholar]
  • 2.Liu Y, Nielsen J. 2019. Recent trends in metabolic engineering of microbial chemical factories. Curr Opin Biotechnol 60:188–197. doi: 10.1016/j.copbio.2019.05.010. [DOI] [PubMed] [Google Scholar]
  • 3.Lee DJ, Minchin SD, Busby SJW. 2012. Activating transcription in bacteria. Annu Rev Microbiol 66:125–152. doi: 10.1146/annurev-micro-092611-150012. [DOI] [PubMed] [Google Scholar]
  • 4.Minchin SD, Busby SJW. 2009. Analysis of mechanisms of activation and repression at bacterial promoters. Methods 47:6–12. doi: 10.1016/j.ymeth.2008.10.012. [DOI] [PubMed] [Google Scholar]
  • 5.Chen J, Boyaci H, Campbell EA. 2021. Diverse and unified mechanisms of transcription initiation in bacteria. Nat Rev Microbiol 19:95–109. doi: 10.1038/s41579-020-00450-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Burgess RR, Travers AA, Dunn JJ, Bautz EK. 1969. Factor stimulating transcription by RNA polymerase. Nature 221:43–46. doi: 10.1038/221043a0. [DOI] [PubMed] [Google Scholar]
  • 7.Auble DT, Allen TL, deHaseth PL. 1986. Promoter recognition by Escherichia coli RNA polymerase. Effects of substitutions in the spacer DNA separating the -10 and -35 regions. J Biol Chem 261:11202–11206. doi: 10.1016/S0021-9258(18)67368-5. [DOI] [PubMed] [Google Scholar]
  • 8.Gaal T, Barkei J, Dickson RR, deBoer HA, deHaseth PL, Alavi H, Gourse RL. 1989. Saturation mutagenesis of an Escherichia coli rRNA promoter and initial characterization of promoter variants. J Bacteriol 171:4852–4861. doi: 10.1128/jb.171.9.4852-4861.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Kobayashi M, Nagata K, Ishihama A. 1990. Promoter selectivity of Escherichia coli RNA polymerase: effect of base substitutions in the promoter -35 region on promoter strength. Nucleic Acids Res 18:7367–7372. doi: 10.1093/nar/18.24.7367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Campbell EA, Muzzin O, Chlenov M, Sun JL, Olson CA, Weinman O, Trester-Zedlitz ML, Darst SA. 2002. Structure of the bacterial RNA polymerase promoter specificity σ subunit. Mol Cell 9:527–539. doi: 10.1016/s1097-2765(02)00470-7. [DOI] [PubMed] [Google Scholar]
  • 11.Lin W, Mandal S, Degen D, Cho MS, Feng Y, Das K, Ebright RH. 2019. Structural basis of ECF-sigma-factor-dependent transcription initiation. Nat Commun 10:710. doi: 10.1038/s41467-019-08443-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Dall'Alba G, Casa PL, Notari DL, Adami AG, Echeverrigaray S, de Avila ESS. 2019. Analysis of the nucleotide content of Escherichia coli promoter sequences related to the alternative sigma factors. J Mol Recognit 32:e2770. doi: 10.1002/jmr.2770. [DOI] [PubMed] [Google Scholar]
  • 13.Zhao K, Liu M, Burgess RR. 2010. Promoter and regulon analysis of nitrogen assimilation factor, σ54, reveal alternative strategy for E. coli MG1655 flagellar biosynthesis. Nucleic Acids Res 38:1273–1283. doi: 10.1093/nar/gkp1123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Campbell EA, Kamath S, Rajashankar KR, Wu M, Darst SA. 2017. Crystal structure of Aquifex aeolicus sigma(N) bound to promoter DNA and the structure of sigma(N)-holoenzyme. Proc Natl Acad Sci U S A 114:E1805–E1814. doi: 10.1073/pnas.1619464114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Rhodius VA, Mutalik VK. 2010. Predicting strength and function for promoters of the Escherichia coli alternative sigma factor, sigmaE. Proc Natl Acad Sci U S A 107:2854–2859. doi: 10.1073/pnas.0915066107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Campagne S, Allain FH, Vorholt JA. 2015. Extracytoplasmic function sigma factors, recent structural insights into promoter recognition and regulation. Curr Opin Struct Biol 30:71–78. doi: 10.1016/j.sbi.2015.01.006. [DOI] [PubMed] [Google Scholar]
  • 17.Shultzaberger RK, Chen Z, Lewis KA, Schneider TD. 2007. Anatomy of Escherichia coli σ 70 promoters. Nucleic Acids Res 35:771–788. doi: 10.1093/nar/gkl956. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Brewster RC, Jones DL, Phillips R. 2012. Tuning promoter strength through RNA polymerase binding site design in Escherichia coli. PLoS Comput Biol 8:e1002811. doi: 10.1371/journal.pcbi.1002811. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Brunner M, Bujard H. 1987. Promoter recognition and promoter strength in the Escherichia coli system. EMBO J 6:3139–3144. doi: 10.1002/j.1460-2075.1987.tb02624.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Bienick MS, Young KW, Klesmith JR, Detwiler EE, Tomek KJ, Whitehead TA. 2014. The interrelationship between promoter strength, gene expression, and growth rate. PLoS One 9:e109105. doi: 10.1371/journal.pone.0109105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Paget MS. 2015. Bacterial sigma factors and anti-sigma factors: structure, function and distribution. Biomolecules 5:1245–1265. doi: 10.3390/biom5031245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Browning DF, Busby SJ. 2004. The regulation of bacterial transcription initiation. Nat Rev Microbiol 2:57–65. doi: 10.1038/nrmicro787. [DOI] [PubMed] [Google Scholar]
  • 23.Feklistov A, Darst SA. 2011. Structural basis for promoter-10 element recognition by the bacterial RNA polymerase sigma subunit. Cell 147:1257–1269. doi: 10.1016/j.cell.2011.10.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Huerta AM, Collado-Vides J. 2003. Sigma70 promoters in Escherichia coli: specific transcription in dense regions of overlapping promoter-like signals. J Mol Biol 333:261–278. doi: 10.1016/j.jmb.2003.07.017. [DOI] [PubMed] [Google Scholar]
  • 25.Paget MS, Helmann JD. 2003. The sigma70 family of sigma factors. Genome Biol 4:203. doi: 10.1186/gb-2003-4-1-203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Kumar A, Grimes B, Fujita N, Makino K, Malloch RA, Hayward RS, Ishihama A. 1994. Role of the sigma 70 subunit of Escherichia coli RNA polymerase in transcription activation. J Mol Biol 235:405–413. doi: 10.1006/jmbi.1994.1001. [DOI] [PubMed] [Google Scholar]
  • 27.Lonetto M, Gribskov M, Gross CA. 1992. The σ70 family: sequence conservation and evolutionary relationships. J Bacteriol 174:3843–3849. doi: 10.1128/jb.174.12.3843-3849.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Stallings CL, Glickman MS. 2011. CarD: a new RNA polymerase modulator in mycobacteria. Transcription 2:15–18. doi: 10.4161/trns.2.1.13628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Srivastava DB, Leon K, Osmundson J, Garner AL, Weiss LA, Westblade LF, Glickman MS, Landick R, Darst SA, Stallings CL, Campbell EA. 2013. Structure and function of CarD, an essential mycobacterial transcription factor. Proc Natl Acad Sci U S A 110:12619–12624. doi: 10.1073/pnas.1308270110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Bae B, Chen J, Davis E, Leon K, Darst SA, Campbell EA. 2015. CarD uses a minor groove wedge mechanism to stabilize the RNA polymerase open promoter complex. Elife 4:e08505. doi: 10.7554/eLife.08505. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Weiss LA, Harrison PG, Nickels BE, Glickman MS, Campbell EA, Darst SA, Stallings CL. 2012. Interaction of CarD with RNA polymerase mediates Mycobacterium tuberculosis viability, rifampin resistance, and pathogenesis. J Bacteriol 194:5621–5631. doi: 10.1128/JB.00879-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Garcia-Moreno D, Abellon-Ruiz J, Garcia-Heras F, Murillo FJ, Padmanabhan S, Elias-Arnanz M. 2010. CdnL, a member of the large CarD-like family of bacterial proteins, is vital for Myxococcus xanthus and differs functionally from the global transcriptional regulator CarD. Nucleic Acids Res 38:4586–4598. doi: 10.1093/nar/gkq214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Stallings CL, Stephanou NC, Chu L, Hochschild A, Nickels BE, Glickman MS. 2009. CarD is an essential regulator of rRNA transcription required for Mycobacterium tuberculosis persistence. Cell 138:146–159. doi: 10.1016/j.cell.2009.04.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Gallego-Garcia A, Iniesta AA, Gonzalez D, Collier J, Padmanabhan S, Elias-Arnanz M. 2017. Caulobacter crescentus CdnL is a non-essential RNA polymerase-binding protein whose depletion impairs normal growth and rRNA transcription. Sci Rep 7:43240. doi: 10.1038/srep43240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zhu DX, Garner AL, Galburt EA, Stallings CL. 2019. CarD contributes to diverse gene expression outcomes throughout the genome of Mycobacterium tuberculosis. Proc Natl Acad Sci U S A 116:13573–13581. doi: 10.1073/pnas.1900176116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Woldemeskel SA, Daitch AK, Alvarez L, Panis G, Zeinert R, Gonzalez D, Smith E, Collier J, Chien P, Cava F, Viollier PH, Goley ED. 2020. The conserved transcriptional regulator CdnL is required for metabolic homeostasis and morphogenesis in Caulobacter. PLoS Genet 16:e1008591. doi: 10.1371/journal.pgen.1008591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Vera JM, Ghosh IN, Zhang Y, Hebert AS, Coon JJ, Landick R. 2020. Genome-scale transcription-translation mapping reveals features of Zymomonas mobilis transcription units and promoters. mSystems 5:e00250-20. doi: 10.1128/mSystems.00250-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Myers KS, Vera JM, Lemmer KC, Linz AM, Landick R, Noguera DR, Donohue TJ. 2020. Genome-wide identification of transcription start sites in two Alphaproteobacteria, Rhodobacter sphaeroides 2.4.1 and Novosphingobium aromaticivorans DSM 12444. Microbiol Resour Announc 9:e00880-20. doi: 10.1128/MRA.00880-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Sass AM, Van Acker H, Forstner KU, Van Nieuwerburgh F, Deforce D, Vogel J, Coenye T. 2015. Genome-wide transcription start site profiling in biofilm-grown Burkholderia cenocepacia J2315. BMC Genomics 16:775. doi: 10.1186/s12864-015-1993-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Li X, Mei H, Chen F, Tang Q, Yu Z, Cao X, Andongma BT, Chou SH, He J. 2017. Transcriptome landscape of Mycobacterium smegmatis. Front Microbiol 8:2505. doi: 10.3389/fmicb.2017.02505. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Jeong Y, Kim JN, Kim MW, Bucca G, Cho S, Yoon YJ, Kim BG, Roe JH, Kim SC, Smith CP, Cho BK. 2016. The dynamic transcriptional and translational landscape of the model antibiotic producer Streptomyces coelicolor A3(2). Nat Commun 7:11605. doi: 10.1038/ncomms11605. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Thomason MK, Bischler T, Eisenbart SK, Förstner KU, Zhang A, Herbig A, Nieselt K, Sharma CM, Storz G. 2015. Global transcriptional start site mapping using differential RNA sequencing reveals novel antisense RNAs in Escherichia coli. J Bacteriol 197:18–28. doi: 10.1128/JB.02096-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Zhou B, Schrader JM, Kalogeraki VS, Abeliuk E, Dinh CB, Pham JQ, Cui ZZ, Dill DL, McAdams HH, Shapiro L. 2015. The global regulatory architecture of transcription during the Caulobacter cell cycle. PLoS Genet 11:e1004831. doi: 10.1371/journal.pgen.1004831. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Warman E, Forrest D, Wade JT, Grainger DC. 2020. Widespread divergent transcription from prokaryotic promoters. bioRxiv doi: 10.1101/2020.01.31.928960. [DOI] [PMC free article] [PubMed]
  • 45.Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS. 2009. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res 37:W202–W208. doi: 10.1093/nar/gkp335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Bailey TL, Williams N, Misleh C, Li WW. 2006. MEME: discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res 34:W369–W373. doi: 10.1093/nar/gkl198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Myers KS, Place M, Kominek J, Noguera DR, Donohue TJ, Newton ILG. 2021. Delila-PY, a pipeline for utilizing the Delila suite of software to identify potential DNA binding motifs. Microbiol Resour Announc 10:e00160-21. doi: 10.1128/MRA.00160-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Schneider TD, Stormo GD, Yarus MA, Gold L. 1984. Delila system tools. Nucleic Acids Res 12:129–140. doi: 10.1093/nar/12.1part1.129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Luo H, Lin Y, Liu T, Lai FL, Zhang CT, Gao F, Zhang R. 2020. DEG 15, an update of the Database of Essential Genes that includes built-in analysis tools. Nucleic Acids Res 49:D677–D686. doi: 10.1093/nar/gkaa917. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Zhang R, Ou HY, Zhang CT. 2004. DEG: a database of essential genes. Nucleic Acids Res 32:D271–D272. doi: 10.1093/nar/gkh024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Burger BT, Imam S, Scarborough MJ, Noguera DR, Donohue TJ. 2017. Combining genome-scale experimental and computational methods to identify essential genes in Rhodobacter sphaeroides. mSystems 2:e00015-17. doi: 10.1128/mSystems.00015-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Christen B, Abeliuk E, Collier JM, Kalogeraki VS, Passarelli B, Coller JA, Fero MJ, McAdams HH, Shapiro L. 2011. The essential genome of a bacterium. Mol Syst Biol 7:528. doi: 10.1038/msb.2011.58. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Deutschbauer A, Price MN, Wetmore KM, Tarjan DR, Xu Z, Shao W, Leon D, Arkin AP, Skerker JM. 2014. Towards an informative mutant phenotype for every bacterial gene. J Bacteriol 196:3643–3655. doi: 10.1128/JB.01836-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Karp PD, Paley SM, Krummenacker M, Latendresse M, Dale JM, Lee TJ, Kaipa P, Gilham F, Spaulding A, Popescu L, Altman T, Paulsen I, Keseler IM, Caspi R. 2010. Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology. Brief Bioinform 11:40–79. doi: 10.1093/bib/bbp043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Keseler IM, Collado-Vides J, Santos-Zavaleta A, Peralta-Gil M, Gama-Castro S, Muñiz-Rascado L, Bonavides-Martinez C, Paley S, Krummenacker M, Altman T, Kaipa P, Spaulding A, Pacheco J, Latendresse M, Fulcher C, Sarker M, Shearer AG, Mackie A, Paulsen I, Gunsalus RP, Karp PD. 2011. EcoCyc: a comprehensive database of Escherichia coli biology. Nucleic Acids Res 39:D583–D590. doi: 10.1093/nar/gkq1143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Cecil JH, Garcia DC, Giannone RJ, Michener JK. 2018. Rapid, parallel identification of catabolism pathways of lignin-derived aromatic compounds in Novosphingobium aromaticivorans. Appl Environ Microbiol 84:e01185-18. doi: 10.1128/AEM.01185-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Kontur WS, Bingman CA, Olmsted CN, Wassarman DR, Ulbrich A, Gall DL, Smith RW, Yusko LM, Fox BG, Noguera DR, Coon JJ, Donohue TJ. 2018. Novosphingobium aromaticivorans uses a Nu-class glutathione S-transferase as a glutathione lyase in breaking the beta-aryl ether bond of lignin. J Biol Chem 293:4955–4968. doi: 10.1074/jbc.RA117.001268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Perez JM, Kontur WS, Gehl C, Gille DM, Ma Y, Niles AV, Umana G, Donohue TJ, Noguera DR. 2021. Redundancy in aromatic O-demethylation and ring opening reactions in Novosphingobium aromaticivorans and their impact in the metabolism of plant derived phenolics. Appl Environ Microbiol 87:e02794-20. doi: 10.1128/AEM.02794-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Kontur WS, Olmsted CN, Yusko LM, Niles AV, Walters KA, Beebe ET, Vander Meulen KA, Karlen SD, Gall DL, Noguera DR, Donohue TJ. 2019. A heterodimeric glutathione S-transferase that stereospecifically breaks lignin's beta(R)-aryl ether bond reveals the diversity of bacterial beta-etherases. J Biol Chem 294:1877–1890. doi: 10.1074/jbc.RA118.006548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Gall DL, Ralph J, Donohue TJ, Noguera DR. 2014. A group of sequence-related sphingomonad enzymes catalyzes cleavage of beta-aryl ether linkages in lignin beta-guaiacyl and beta-syringyl ether dimers. Environ Sci Technol 48:12454–12463. doi: 10.1021/es503886d. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Skerker JM, Laub MT. 2004. Cell-cycle progression and the generation of asymmetry in Caulobacter crescentus. Nat Rev Microbiol 2:325–337. doi: 10.1038/nrmicro864. [DOI] [PubMed] [Google Scholar]
  • 62.Fang G, Passalacqua KD, Hocking J, Llopis PM, Gerstein M, Bergman NH, Jacobs-Wagner C. 2013. Transcriptomic and phylogenetic analysis of a bacterial cell cycle reveals strong associations between gene co-expression and evolution. BMC Genomics 14:450. doi: 10.1186/1471-2164-14-450. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Arai H, Roh JH, Kaplan S. 2008. Transcriptome dynamics during the transition from anaerobic photosynthesis to aerobic respiration in Rhodobacter sphaeroides 2.4.1. J Bacteriol 190:286–299. doi: 10.1128/JB.01375-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Lemmer KC, Alberge F, Myers KS, Dohnalkova AC, Schaub RE, Lenz JD, Imam S, Dillard JP, Noguera DR, Donohue TJ. 2020. The NtrYX two-component system regulates the bacterial cell envelope. mBio 11:e00957-20. doi: 10.1128/mBio.00957-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.MacGregor BJ, Karls RK, Donohue TJ. 1998. Transcription of the Rhodobacter sphaeroides cycA P1 promoter by alternate RNA polymerase holoenzymes. J Bacteriol 180:1–9. doi: 10.1128/JB.180.1.1-9.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Karls RK, Brooks J, Rossmeissl P, Luedke J, Donohue TJ. 1998. Metabolic roles of a Rhodobacter sphaeroides member of the sigma32 family. J Bacteriol 180:10–19. doi: 10.1128/JB.180.1.10-19.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Green HA, Donohue TJ. 2006. Activity of Rhodobacter sphaeroides RpoHII, a second member of the heat shock sigma factor family. J Bacteriol 188:5712–5721. doi: 10.1128/JB.00405-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Glaeser J, Nuss AM, Berghoff BA, Klug G. 2011. Singlet oxygen stress in microorganisms. Adv Microb Physiol 58:141–173. doi: 10.1016/B978-0-12-381043-4.00004-0. [DOI] [PubMed] [Google Scholar]
  • 69.Nuss AM, Glaeser J, Klug G. 2009. RpoH(II) activates oxidative-stress defense systems and is controlled by RpoE in the singlet oxygen-dependent response in Rhodobacter sphaeroides. J Bacteriol 191:220–230. doi: 10.1128/JB.00925-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Dufour YS, Imam S, Koo BM, Green HA, Donohue TJ. 2012. Convergence of the transcriptional responses to heat shock and singlet oxygen stresses. PLoS Genet 8:e1002929. doi: 10.1371/journal.pgen.1002929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Dominguez-Cuevas P, Gonzalez-Pastor JE, Marques S, Ramos JL, de Lorenzo V. 2006. Transcriptional tradeoff between metabolic and stress-response programs in Pseudomonas putida KT2440 cells exposed to toluene. J Biol Chem 281:11981–11991. doi: 10.1074/jbc.M509848200. [DOI] [PubMed] [Google Scholar]
  • 72.Fitzgerald DJ, Stratford M, Gasson MJ, Ueckert J, Bos A, Narbad A. 2004. Mode of antimicrobial action of vanillin against Escherichia coli, Lactobacillus plantarum and Listeria innocua. J Appl Microbiol 97:104–113. doi: 10.1111/j.1365-2672.2004.02275.x. [DOI] [PubMed] [Google Scholar]
  • 73.Seo JS, Keum YS, Li QX. 2009. Bacterial degradation of aromatic compounds. Int J Environ Res Public Health 6:278–309. doi: 10.3390/ijerph6010278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Gruber TM, Bryant DA. 1997. Molecular systematic studies of eubacteria, using sigma70-type sigma factors of group 1 and group 2. J Bacteriol 179:1734–1747. doi: 10.1128/jb.179.5.1734-1747.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Malakooti J, Ely B. 1995. Principal sigma subunit of the Caulobacter crescentus RNA polymerase. J Bacteriol 177:6854–6860. doi: 10.1128/jb.177.23.6854-6860.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Malakooti J, Wang SP, Ely B. 1995. A consensus promoter sequence for Caulobacter crescentus genes involved in biosynthetic and housekeeping functions. J Bacteriol 177:4372–4376. doi: 10.1128/jb.177.15.4372-4376.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Egland PG, Pelletier DA, Dispensa M, Gibson J, Harwood CS. 1997. A cluster of bacterial genes for anaerobic benzene ring biodegradation. Proc Natl Acad Sci U S A 94:6484–6489. doi: 10.1073/pnas.94.12.6484. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Heyduk E, Heyduk T. 2014. Next generation sequencing-based parallel analysis of melting kinetics of 4096 variants of a bacterial promoter. Biochemistry 53:282–292. doi: 10.1021/bi401277w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Hook-Barnard IG, Hinton DM. 2007. Transcription initiation by mix and match elements: flexibility for polymerase binding to bacterial promoters. Gene Regul Syst Bio 1:275–293. [PMC free article] [PubMed] [Google Scholar]
  • 80.Moyle H, Waldburger C, Susskind MM. 1991. Hierarchies of base pair preferences in the P22 ant promoter. J Bacteriol 173:1944–1950. doi: 10.1128/jb.173.6.1944-1950.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Le PT, Pontarotti P, Raoult D. 2014. Alphaproteobacteria species as a source and target of lateral sequence transfers. Trends Microbiol 22:147–156. doi: 10.1016/j.tim.2013.12.006. [DOI] [PubMed] [Google Scholar]
  • 82.Kloesges T, Popa O, Martin W, Dagan T. 2011. Networks of gene sharing among 329 proteobacterial genomes reveal differences in lateral gene transfer frequency at different phylogenetic depths. Mol Biol Evol 28:1057–1074. doi: 10.1093/molbev/msq297. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Dagan T, Artzy-Randrup Y, Martin W. 2008. Modular networks and cumulative impact of lateral transfer in prokaryote genome evolution. Proc Natl Acad Sci U S A 105:10039–10044. doi: 10.1073/pnas.0800679105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Atsumi S, Higashide W, Liao JC. 2009. Direct photosynthetic recycling of carbon dioxide to isobutyraldehyde. Nat Biotechnol 27:1177–1180. doi: 10.1038/nbt.1586. [DOI] [PubMed] [Google Scholar]
  • 85.Imam S, Noguera DR, Donohue TJ. 2013. Global insights into energetic and metabolic networks in Rhodobacter sphaeroides. BMC Syst Biol 7:89. doi: 10.1186/1752-0509-7-89. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Kontur WS, Ziegelhoffer EC, Spero MA, Imam S, Noguera DR, Donohue TJ. 2011. Pathways involved in reductant distribution during photobiological H(2) production by Rhodobacter sphaeroides. Appl Environ Microbiol 77:7425–7429. doi: 10.1128/AEM.05273-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Yilmaz LS, Kontur WS, Sanders AP, Sohmen U, Donohue TJ, Noguera DR. 2010. Electron partitioning during light- and nutrient-powered hydrogen production by Rhodobacter sphaeroides. Bioenerg Res 3:55–66. doi: 10.1007/s12155-009-9072-8. [DOI] [Google Scholar]
  • 88.Khatipov E, Miyake M, Miyake J, Asada Y. 1999. Polyhydroxybutyrate accumulation and hydrogen evolution by Rhodobacter sphaeroides as a function of nitrogen availability, p 157–161. In Zaborsky OR, Benemann JR, Matsunaga T, Miyake J, San Pietro A (ed), BioHydrogen. Springer Nature, Cham, Switzerland. [Google Scholar]
  • 89.Sayers EW, Beck J, Brister JR, Bolton EE, Canese K, Comeau DC, Funk K, Ketter A, Kim S, Kimchi A, Kitts PA, Kuznetsov A, Lathrop S, Lu Z, McGarvey K, Madden TL, Murphy TD, O'Leary N, Phan L, Schneider VA, Thibaud-Nissen F, Trawick BW, Pruitt KD, Ostell J. 2020. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 48:D9–D16. doi: 10.1093/nar/gkz899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Crooks GE, Hon G, Chandonia JM, Brenner SE. 2004. WebLogo: a sequence logo generator. Genome Res 14:1188–1190. doi: 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Schneider TD. 2006. Twenty years of Delila and molecular information theory: the Altenberg-Austin Workshop in Theoretical Biology Biological Information, Beyond Metaphor: Causality, Explanation, and Unification Altenberg, Austria, 11–14 July 2002. Biol Theory 1:250–260. doi: 10.1162/biot.2006.1.3.250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Madeira F, Park YM, Lee J, Buso N, Gur T, Madhusoodanan N, Basutkar P, Tivey ARN, Potter SC, Finn RD, Lopez R. 2019. The EMBL-EBI search and sequence analysis tools APIs in 2019. Nucleic Acids Res 47:W636–W641. doi: 10.1093/nar/gkz268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Sievers F, Higgins DG. 2014. Clustal omega. Curr Protoc Bioinformatics 48:3.13.1–3.13.16. [DOI] [PubMed] [Google Scholar]
  • 95.Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Soding J, Thompson JD, Higgins DG. 2011. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol 7:539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Henry KK, Ross W, Myers KS, Lemmer KC, Vera JM, Landick R, Donohue TJ, Gourse RL. 2020. A majority of Rhodobacter sphaeroides promoters lack a crucial RNA polymerase recognition feature, enabling coordinated transcription activation. Proc Natl Acad Sci U S A 117:29658–29668. doi: 10.1073/pnas.2010087117. [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

TABLE S1

Summary of MEME-identified predicted σ70-dependent promoters upstream of TSSs. Download Table S1, XLSX file, 0.7 MB (732.2KB, xlsx) .

Copyright © 2021 Myers et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

TABLE S2

Summary of Delila-PY-identified predicted σ70-dependent promoters upstream of TSSs. Download Table S2, XLSX file, 1.1 MB (1.1MB, xlsx) .

Copyright © 2021 Myers et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

TABLE S3

Functional group enrichment for products of genes downstream of predicted −7T σ70-dependent promoters and −7A/C/G σ70-dependent promoters. Download Table S3, XLSX file, 0.07 MB (74.1KB, xlsx) .

Copyright © 2021 Myers et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.


Articles from mSystems are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES