Transcription closed and open complex formation coordinate expression of genes with a shared promoter region

Antti Häkkinen; Samuel M D Oliveira; Ramakanth Neeli-Venkata; Andre S Ribeiro

doi:10.1098/rsif.2019.0507

. 2019 Dec 11;16(161):20190507. doi: 10.1098/rsif.2019.0507

Transcription closed and open complex formation coordinate expression of genes with a shared promoter region

Antti Häkkinen ^1,^✉, Samuel M D Oliveira ¹, Ramakanth Neeli-Venkata ¹, Andre S Ribeiro ¹

PMCID: PMC6936044 PMID: 31822223

Abstract

Many genes are spaced closely, allowing coordination without explicit control through shared regulatory elements and molecular interactions. We study the dynamics of a stochastic model of a gene-pair in a head-to-head configuration, sharing promoter elements, which accounts for the rate-limiting steps in transcription initiation. We find that only in specific regions of the parameter space of the rate-limiting steps is orderly coexpression exhibited, suggesting that successful cooperation between closely spaced genes requires the coevolution of compatible rate-limiting step configuration. The model predictions are validated using in vivo single-cell, single-RNA measurements of the dynamics of pairs of genes sharing promoter elements. Our results suggest that, in E. coli, the kinetics of the rate-limiting steps in active transcription can play a central role in shaping the dynamics of gene-pairs sharing promoter elements.

Keywords: transcription, gene expression noise, bidirectional promoter

1. Introduction

Closely spaced gene-pairs abound in genomes of all life forms, from human [1,2] to prokaryotes [3,4]. Furthermore, they are highly conserved [2,5], suggesting that they provide functionality that is selectively advantageous.

Gene-pairs can be arranged head-to-head (transcriptionally divergent), with their transcription start sites (TSS) closely located, sharing promoter elements such as transcription factor binding sites [1,2,4]. Head-to-tail (tandem) and tail-to-tail (convergent) overlapping gene-pairs are also found, allowing interference between RNA polymerases (RNAP) [6] and/or with transcription factors [7,8]. Each configuration can vary in several parameters, such as distance between TSSs, which affect transcription of the component genes [4,9–11], allowing co-regulation without explicit control mechanisms. The multitude of naturally occurring configurations suggests that each configuration possesses distinct selective advantages.

While some configurations have been identified and their ubiquity established by models and measurements [2,11–13], the range of possible behaviours and advantages as a gene regulation mechanism remain largely uncharacterized. Such characterization would benefit understanding the array of tasks that organisms such as Escherichia coli perform using closely spaced promoters, as opposed to using individual genes or genes connected by transcription factors.

Much research has been conducted on the global co-expression patterns of closely located genes, particularly in eukaryotes. This has resulted in the accumulation of evidence for the existence of various, complementary mechanisms at play [14–18]. For instance, Limi et al. reported that two highly expressed genes, Cryba4 and Crybb1, can be simultaneously transcribed from adjacent bidirectional promoters in humans, despite their close proximity [14]. Meanwhile, Behjati et al. found that bidirectional promoters employ three major architectures across the human genome, varying in their DNA accessibility, histone modifications, DNA methylation and transcription factor binding profiles [15]. Direct regulatory interference between spatially close genes has also been reported in humans, and it has been associated with reduced gene expression noise [16]. Finally, Scruggs et al. established that the distance between the promoters can be used as a regulatory mechanism of the degree of interaction between their dynamics [17].

However, some of these findings may not apply to bacterial genomes, due to various structural differences. For example, eukaryotic genomes are an order of magnitude larger and they are partitioned into linear chromosomes and confined to the nucleus. Meanwhile, bacterial genomes are singular, circular and bacteria lack membrane-bound nuclei [19,20]. Furthermore, eukaryotic DNA is packed around histones, while prokaryotic DNA is compressed by supercoiling [20–22], which is expected to cause major differences into how the cells of these two domains access their DNA, for both replication as well as transcription, which may explain why the dynamics of these processes differ so widely between them (e.g. DNA replication is two orders of magnitude slower in prokaryotes [19]). Consequently, we expect the regulatory mechanisms and the dynamics of closely spaced promoters to differ significantly between eukaryotes and prokaryotes, which may alter which mechanism and whether co-expression or interference has a dominant role in each domain.

One aspect that remains unexplored with respect to the bidirectional promoters is the existence of multiple rate-limiting steps during transcription initiation [23,24]. As only some of these steps are physically involved in the gene-pair interactions, we expect the nature of the rate-limiting steps of each promoter to affect the dynamics of closely spaced configurations. Importantly, the durations of the open complex formation of a strong and a weak promoter can differ from little to up to two orders of magnitude [25] and live cell single-RNA measurements suggest that different promoters are rate-limited at different stages of transcription initiation [24,26–29]. As such, it is plausible that promoters whose initiation kinetics are similar in mean duration but whose rate-limiting step structures differ will feature different dynamics in the bidirectional configuration.

Here, we study the dynamics of a stochastic model of a gene-pair in a head-to-head configuration sharing promoter elements (the most common closely spaced gene-pair configuration [2,5]) as a function of the rate-limiting step configuration of each gene. We analyse the models using analytical stochastic methods. Next, we validate the main findings by performing time-lapse microscopy measurements of individual genes and in pairs of genes sharing promoter elements, at the single-RNA level, in live E. coli.

2. Methods

2.1. Models

Transcription in E. coli starts when an RNAP, recruiting the appropriate σ-factor, specifically binds to a promoter. This creates a closed complex of the RNAP and DNA, which can require several trials before stabilizing [30]. In strong promoters, this step is nearly irreversible [31]. The virtually irreversible open complex formation follows, consisting, of e.g. DNA unwinding and compaction [32] and the RNAP clamp assembly [33].

We assume a variant of a model of transcription initiation of the overlapping promoters of the galactose operon in the absence of cAMP-CRP [3]. The transcribed promoter is stochastically selected based on the relative affinities between the two promoters and the RNAP, encoded in the forward rates of the closed complex formation of each promoter. After the selection, the remaining steps of transcription initiation occur at the promoter region [23]. The following stochastic chemical [34,35] reactions are used to model this:

\begin{matrix} P_{0} \overset{k_{1}}{⟶} I_{1} \overset{k_{3}}{⟶} P_{0} + X \\ and & P_{0} \overset{k_{2}}{⟶} I_{2} \overset{k_{4}}{⟶} P_{0} + Y, \end{matrix}}

2.1

where P₀ represents a free promoter (unoccupied by an RNAP), I₁ and I₂ represent intermediate transcriptional complexes committed to transcribing genes 1 and 2, respectively, and X and Y represent the messenger RNA products (or, if they closely follow [36], proteins) of genes 1 and 2, respectively. A schematic is provided in figure 2a, and a thorough analysis in the electronic supplementary material.

Figure 2. — Model schematic and simulated examples. (a) Schematic of gene-pair in a head-to-head configuration: genes 1 and 2 produce RNAs X and Y, respectively. The shared promoter can be in three-states: free, or occupied for transcription initiation of gene 1 or 2. (b) Produced RNA numbers over time in a single Monte Carlo simulation. The dots denote the moments when RNAs were produced. (c) Distribution of intervals between consecutive productions of X in 10 000 simulations. The parameter values are (k₁, k₂, k₃, k₄) = (1, 1, 1, 1). Here, τ_X and ${τ_{X}}^{(1)}$ have a mean (variance) of 3 (7) and 2 (2), respectively. (Online version in colour.)

If the genes did not share promoter elements, the intervals between productions of X (gene 1) would be [29]:

\begin{matrix} τ_{X}^{(1)} E^{-} (k_{1}, k_{3}) \\ with & E [τ_{X}^{(1)}] = k_{1}^{- 1} + k_{3}^{- 1} \\ V ar [τ_{X}^{(1)}] = k_{1}^{- 2} + k_{3}^{- 2}, \end{matrix}}

2.2

where $E^{-} (λ_{1}, \dots, λ_{n})$ represents a hypoexponential distribution (i.e. a sum of exponential distributions) with rates λ₁, …, λ_n. Similarly, the production intervals for gene 2 would be ${τ_{Y}}^{(1)} \sim E^{-} (k_{2}, k_{4})$ .

These distributions are of low noise, as measured by the coefficient of variation (standard deviation over the mean), as this quantity equals unity for Poissonian production (exponential production intervals). More specifically, the noise is determined by the ratio of k₁ and k₃. Regardless of the mean, it is minimized for steps of equal duration and maximized when a single step is rate-limiting. The dynamics of an individual gene is unaffected by the step order (i.e. interchanging k₁ and k₃ has no effect on ${τ_{X}}^{(1)}$ ).

Regardless of the configuration, the mean and variance of the production intervals are linked to that of the produced RNAs. In the long-term (infinite time), the mean and variance of produced RNA per unit time are [37]:

\begin{aligned} μ_{Z} & ≐ lim_{t \to \infty} \frac{E [Z (t)]}{t} = {E [τ_{Z}]}^{- 1} \\ and η_{Z} μ_{Z} & ≐ lim_{t \to \infty} \frac{V ar [Z (t)]}{t} = V ar [τ_{Z}] {E [τ_{Z}]}^{- 3}, \end{aligned}}

2.3

i.e. the mean number of RNAs produced per unit time (μ_Z) equals the inverse interval mean, while the Fano factor (variance over the mean) of the RNA numbers (η_Z) equals the squared coefficient of variation of the production intervals. The cell phenotype is also affected by other processes, such as RNA degradation and dilution due to cell division. Regardless, the mean and noise of the produced RNA numbers are directly linked to the phenotype (details in electronic supplementary material) [38], so we expect our results to hold qualitatively in the presence of other processes.

2.2. Cells, plasmids, chemicals and growth conditions

We used E. coli strain BW25113 (lacI⁺ rrnB_T14 ΔlacZ_WJ16 hsdR514 ΔaraBAD_AH33 ΔrhaBAD_LD78) [39], which contains the constitutive promoters P_lacI+ and P_araC producing, respectively, LacI repressors [40] and AraC repressors. As this strain does not contain the tetR gene responsible for encoding TetR repressors, any gene downstream to a P_tetA promoter is expressed constitutively.

We constructed five target systems on a single-copy pBELO plasmid. The first plasmid features the P_lacO3O1 promoter controlling the production of an RNA molecule coding for a red fluorescent mCherry protein followed by 48 binding sites for the MS2-GFP protein (mCherry-48BS). The other four systems are modified versions of the first, with the P_lacO3O1 promoter being replaced by the following promoters: (i) P_BAD promoter; (ii) P_lacO3O1-tetA dual-tandem promoter; (iii) P_lacO3O1-BAD dual-tandem promoter; and (iv) P_{lacO3O1-lacO3O1} dual-bidirectional promoter. All strains aside from its target system also contain either a medium-copy plasmid pZA25 with the reporter gene P_ara-MS2-GFP or a low-copy plasmid pZS12 with the reporter gene P_lac-MS2-GFP. These plasmids are responsible for producing the fusion protein MS2-GFP, both producing an abundance of MS2-GFP when activated as detailed below. The reporter plasmids were generously provided by Orna Amster-Choder (Hebrew University of Jerusalem, Israel) [41], and Philippe Cluzel (Harvard University, USA) [42], respectively. The activity of the promoters P_lacO3O1, P_lacO3O1-tetA and P_{lacO3O1-lacO3O1} is regulated by the repressor LacI and the inducer isopropyl β-d-1-thiogalactopyranoside (IPTG). Meanwhile, the activity of P_BAD is regulated by the repressor AraC and the inducer l-arabinose. Finally, the activity of P_lacO3O1-BAD is regulated by both repressors (LacI and AraC) and both inducers (IPTG and l-arabinose).

Cells were grown overnight in lysogeny broth (LB) medium supplemented with appropriate antibiotics (34 μg ml⁻¹ of chloramphenicol, 50 μg ml⁻¹ of ampicillin, and 50 μg ml⁻¹ of kanamycin) with shaking at 250 r.p.m. We made subcultures, by diluting the stationary-phase culture into fresh M9 medium supplemented with glycerol (0.4% final concentration) and the appropriate antibiotics. Cells were left in the incubator until reaching OD₆₀₀ of about 0.25. For the pZA25-P_ara-MS2-GFP reporter plasmid activation, 0.4% of l-arabinose was added to the culture, which was then incubated at 37°C for 60 min. Cells containing the pZS12-P_lac-MS2-GFP reporter plasmid were incubated in the same way and were activated with 1 mM IPTG. Next, for the activation of P_lacO3O1, P_lacO3O1-tetA and P_{lacO3O1-lacO3O1} target plasmids, specific concentrations of IPTG (either 5 μM or 1 mM) were added to the culture. For activating the P_BAD or P_lacO3O1-BAD target plasmids, 0.1% of l-arabinose was added. For the latter, similar concentrations of IPTG (5 μM or 1 mM) were added as well. Inducer-activated cells were then left in the incubator for 90 min, prior to microscopy observation.

2.3. Microscopy and image analysis

Cells were visualized using a Nikon Eclipse (Ti-E, Nikon) inverted microscope equipped with a 100 × Apo TIRF (1.49 NA, oil) objective. Cells and fluorescent spots within were imaged by highly inclined and laminated optical sheet (HILO) microscopy, using an EMCCD camera (iXon3 897, Andor Technology), a 488 nm argon laser (Melles-Griot), and an emission filter (HQ514/30, Nikon). Phase-contrast images were acquired by a CCD camera (DS-Fi2, Nikon). The software for image acquisition was NIS-Elements (Nikon, Japan). An example of each channel is shown in figure 1.

Figure 1. — Example images of live *E. coli* expressing GFP-tagged RNAs. (a) Phase contrast image of the live *E. coli* with the P_lacO3O1-tetA construct taken after 1 h of induction with 1 mM IPTG induction at 37°C. (b) HILO image visualizing the abundant GFP inside the same *E. coli* cells and the target RNA bound by an array of GFPs appearing as bright spots.

We performed time-lapse fluorescence and phase-contrast imaging of the cells (the latter for cell segmentation and lineage construction). For this, 8 μl of cells were placed on a microscope slide between a coverslip and a M9 glycerol agarose gel pad. During image acquisition, cells were constantly supplied with fresh media containing IPTG and l-arabinose, at the same concentration as when in liquid culture, by a micro-perfusion peristaltic pump (Bioptechs) at 0.3 ml min⁻¹. Images were captured for 5 h, once per minute in the case of fluorescence and once per 5 min in the case of phase-contrast. During image acquisition, cells were kept in a temperature-controlled chamber (FCS2, Bioptechs) at optimal temperature (37°C).

Time-series microscopy images were processed as in [43] by, first, aligning consecutive images so as to maximize the cross-correlation of fluorescence intensities. Next, we annotated manually the region occupied by each cell in the time series. Afterwards, the location, dimension and orientation of each cell in each frame is obtained by principal component analysis, assuming that fluorescence inside the cell is uniform [44]. Cell lineages were then extracted using CellAging, based on overlapping areas in consecutive frames [44]. Next, the intensity of each cell is fit with a surface (quadratic polynomial of the distance from the cell border) in least-deviations sense [45]. This surface represents the cellular background intensity which is subtracted to obtain the foreground intensity. Next, the foreground intensity is fit with a set of Gaussian surfaces, in least-deviations sense, with decreasing heights until the heights are in the 99% confidence interval of the background noise (estimated assuming a normal distribution and using median absolute deviation) [45]. The Gaussians represent fluorescent RNA spots, and the volume under each represent the total spot intensity. Finally, as MS2-GFP-tagged RNA lifetimes are much longer than cell division times [46], the cellular foreground intensity will be an increasing curve, with each jump corresponding to the appearance of a novel tagged RNA. The moments when a jump occurs are estimated using a specialized curve fitting algorithm [27]. The intervals between jumps in individual cells correspond to time intervals between consecutive RNA production events.

3. Results and discussion

3.1. Analytical distributions of production time intervals

From the perspective of the production kinetics of X alone, the reaction system of equation (2.1) is equivalent to:

I_{2} ⇌_{k_{2}}^{k_{4}} P_{0} \overset{k_{1}}{⟶} I_{1} \overset{k_{3}}{⟶} P_{0} + X

3.1

which is potentially a highly noisy process [29,47]. While the expression of gene 1 might not be noisy on its own, its expression is perturbed by the transcription machinery occupying the shared promoter region for expression of gene 2, introducing (random) temporal gaps in the expression.

Let $G (\cdot)$ denote the distribution of consecutive productions of X in equation (3.1). The mean and variance of the time intervals between the productions of X are given in [29]:

\begin{aligned} τ_{X} & \sim G (k_{4}, k_{2}, k_{1}, k_{3}), \\ E [τ_{X}] & = (1 + \frac{k_{2}}{k_{4}}) {k_{1}}^{- 1} + {k_{3}}^{- 1} \\ and V ar [τ_{X}] & = ({(1 + \frac{k_{2}}{k_{4}})}^{2} + 2 \frac{k_{1}}{k_{4}} \frac{k_{2}}{k_{4}}) {k_{1}}^{- 2} + {k_{3}}^{- 2}, \end{aligned}}

3.2

while, due to the symmetry of the model, the production intervals of Y are $τ_{Y} \sim G (k_{3}, k_{1}, k_{2}, k_{4})$ .

By comparing equation (2.2) with equation (3.2) we find that, regardless of the parameters, in a bidirectional configuration, the mean and variance of the time intervals between RNA productions of each gene are increased. Consequently, while ${τ_{X}}^{(1)}$ is always sub-Poissonian, τ_X can exhibit either sub- or super-Poissonian behaviour.

RNA production according to the model is exemplified in figure 2b, and the expected interval distribution in figure 2c. While the production intervals of each gene are often somewhat regular, as indicated by the bulk of the distribution, large outliers are present due to the temporal gaps, which coincide with the transcriptional activity of the other gene (figure 2b).

As the marginals shown in equation (3.2) fail to capture the co-expression of the two genes, further analysis is necessary. The time between consecutive productions by either gene, i.e. a jump in X(t) + Y(t), is (detailed in the electronic supplementary material):

\begin{aligned} τ_{X + Y} & \sim E (k_{1} + k_{2}) + E^{+} (\frac{k_{1}}{k_{1} + k_{2}}, k_{3}, k_{4}), \\ E [τ_{X + Y}] & = (1 + \frac{k_{1}}{k_{3}} + \frac{k_{2}}{k_{4}}) {(k_{1} + k_{2})}^{- 1} \\ and V ar [τ_{X + Y}] & = (1 + {(\frac{k_{1}}{k_{3}} - \frac{k_{2}}{k_{4}})}^{2} + 2 \frac{k_{1}}{k_{3}} \frac{k_{2}}{k_{3}} + 2 \frac{k_{1}}{k_{4}} \frac{k_{2}}{k_{4}}) {(k_{1} + k_{2})}^{- 2}, \end{aligned}}

3.3

where $E (λ)$ is an exponential distribution with rate λ and $E^{+} (p_{1}, \dots, p_{n - 1}, λ_{1}, \dots, λ_{n})$ is a hyperexponential distribution with mixing probabilities p₁, …, p_n and rates λ₁, …, λ_n. Again, this distribution can feature either sub- or super-poissonian behaviour, depending on its parameter values. By combining equations (2.3), (3.2), and (3.3), one can determine the asymptotic covariance and the (Pearson) correlation ρ_XY between the produced RNA numbers X(t) and Y(t) (detailed in the electronic supplementary material).

3.2. Noise and correlation in the transcription kinetics of genes in a head-to-head configuration

Based on the above, we first analysed how the noise and correlation in the transcription kinetics of a head-to-head configuration depends on the dynamics of the individual genes. For this, the parameterization λ, q₁₂, q₁₃, q₂₄ was found to be insightful. Here, $λ ≐ μ_{X + Y}$ is a timescale parameter (mean total production rate) and $q_{i j} ≐ k_{i} / k_{j}$ denote ratios of rates of two reactions. Furthermore, q₁₂ controls the bias, i.e. the expression ratio of each gene: for large (small) q₁₂, gene 1 (gene 2) is expressed more frequently. Finally, q₁₃ and q₂₄ control the relative durations of closed and open complex formation, which equal 1/(1 + q₁₃) and q₁₃/(1 + q₁₃), respectively, for gene 1. Specifically, if q₁₃ > 1 (q₁₃ < 1), then k₁ > k₃ and the gene is limited at the open (closed) complex formation.

The mean RNA numbers are controlled by the bias and the scale: μ_X = λ⁻¹ q₁₂/(1 + q₁₂) and μ_Y = λ⁻¹/(1 + q₁₂). As such, the stage at which the transcription kinetics of each gene is rate-limited does not affect the mean number of produced RNAs. Meanwhile, the noise and correlation exhibit complex behaviour, which can be divided into a few regions. The regions and their properties are shown in table 1 and electronic supplementary material, table S1. The noise of each gene and the correlation coefficient are shown in figures 3 and 4a, and their analytical forms in the electronic supplementary material.

Table 1.

Noise and correlation in RNA production kinetics in the different regions of the parameter space of head-to-head configuration. Here, ∼1⁻ (∼1⁺) denotes weakly sub- (super-) Poissonian behaviour (noise of about 1), while ∼1 denotes that both behaviours are possible. Finally, <1* indicates that <1 holds at least for one of the genes, possibly for both.

region	condition	noise η_X	noise η_Y	correlation ρ_XY
A	q₂₄ > 1, q₂₄ > q₁₃	>1	∼1⁻	>0
A	q₁₃ > q₂₄, q₁₃ > 1	∼1⁻	>1	>0
B	q₁₃, q₂₄ < 1	∼1	∼1	∼0⁻
C	q₁₃ ∼ q₂₄ > 1	<1*	<1*	<0
D	q₁₃ ∼ 1, q₂₄ < 1	<1	∼1⁺	<0
D	q₁₃ < 1, q₂₄ ∼ 1	∼1⁺	<1	<0
E	q₁₃ ∼ q₂₄ ∼ 1	<1*	<1*	<0

Open in a new tab

Figure 3. — Noise in the RNA production in a head-to-head configuration as a function of their relative durations of closed and open complex formations. (a) Gene 1 and (b) gene 2. The black curves denote unity, and q₁₂ = 2. (Online version in colour.)

Figure 4. — Correlation and total noise (tandem configuration) as a function of the relative durations of closed and open complex formation. (a) Correlation between the RNA production kinetics of two head-to-head genes. (b) Noise of RNA production of a gene with two initiation sites. The black curves denote zero or unity, and q₁₂ = 2. (Online version in colour.)

Region A: for q₂₄ > 1, q₂₄ > q₁₃, the expression of gene 2 is most limited at the open complex formation, while that of gene 1 is more symmetric. As such, the promoter region is mostly occupied, and gene 1 must express either fast or rarely. In the former case, there is a burst of production of proteins X after each Y, so the expression of the two genes is positively correlated, and while gene 2 is Poissonian, solely controlled by its open complex formation process, gene 1 is highly noisy as the geometric burst of RNA is separated by the gaps created by the other gene. In the latter case, the expression of gene 1 is controlled by uniform random productions and the correlation vanishes. Specifically, in the latter case, the noise of gene 1 is 1 + 2 q₁₂ (super-Poissonian) and gene 2 is Poissonian. The correlation for large q₁₂ is $\sqrt{1 / 2}$ , which is maximal for the configuration, while for small q₁₂ the correlation vanishes. The part q₁₃ > q₂₄, q₁₃ > 1 is symmetric. Note that the bias q₁₂ controls the upper bounds for noise and correlation.

Region B: here, both genes are limited at the closed complex formation. Thus, the promoter region is rarely occupied, as the expression is limited by an RNAP finding the gene and initiating transcription. This causes the expression of both genes to be Poissonian, as each is limited by a single step, and uncorrelated, as their activities do not interfere at the promoter region.

Region C: both genes are limited at the open complex formation, which makes them to alternate in occupying the shared promoter region. The noise is set by the bias q₁₂, which determines the gene more disturbed by the activity of the other. Specifically, the noise of gene 1 equals 1/2 + q₁₂/2 and the noise of gene 2 equals $1 / 2 + {q_{12}}^{- 1} / 2$ . As the genes inhibit each other by competing for the shared promoter region, the expression patterns are anticorrelated.

Region D: for q₁₃ ∼ 1, q₂₄ < 1, gene 2 is limited during the closed complex formation, so it does not block the shared promoter area. Meanwhile, gene 1 is limited at both stages, making its RNA production to be sub-Poissonian. The expression of gene 2, originally Poissonian, becomes affected by periods of inactivity as gene 1 employs the promoter, increasing the noise, as controlled by the bias, yielding noise of $1 + {q_{12}}^{- 1} / 2$ . The correlation is negative, as gene 1 inhibits the expression of gene 2. The part q₁₃ < 1, q₂₄ ∼ 1 is symmetric.

Region E: both genes have similar closed and open complex formation durations, resulting in low noise in a non-bidirectional configuration. If their closed complex formation durations are similar (i.e. q₁₂ ∼ 1), both genes are of low noise (∼7/9) and their expression is anticorrelated (∼ − 2/7), as they alternate in activity. Otherwise, one is of low noise (∼5/9), unaffected by the configuration, while the other is of high noise, with its expression being disturbed by the frequent gaps caused by the other. Specifically, the noise is 5/9 + 2 q₁₂/9 for gene 1 and $5 / 9 + 2 {q_{12}}^{- 1} / 9$ for gene 2. The correlation is negative, with a maximum of − 2/7 at q₁₂ = 1, and minima of $- 1 / \sqrt{10}$ at q₁₂ → 0 and q₁₂ → ∞.

In summary, for coupled gene activity, one (or both) genes must not be limited at the closed complex formation alone. When coupled, both genes are low noise only if both feature similar relative closed-to-open complex durations. In this case, their expression is likely anticorrelated, which may have further implications if the genes are involved in regulating the same or complementary processes. If the relative closed-to-open complex durations differ, one is of high noise and the other of low noise, while their expression is, surprisingly, positively correlated. While our analysis lacks processes following the transcription initiation, the presence of e.g. first-order degradation pulls the noise toward unity and the correlation toward zero, leaving the conclusions qualitative useful.

3.3. Noise in a gene with two initiation sites

Next, we consider the dynamics of a common RNA product controlled by a promoter with two TSSs (see electronic supplementary material, figure S1c). This is common in E. coli [48] and more so in, e.g. plant mitochondria [49]. The configuration is readily accommodated by our model, by considering the dynamics of X + Y. As the mean and variance of X + Y follow the mean and covariance of (X, Y), the results can be derived from those obtained in the previous section.

Figure 4b shows the noise for X + Y, representing the RNAs produced through either TSS. The noise is low only if both TSSs exhibit production dynamics with low noise, i.e. in the regions C, D or E. Compared to individual TSSs, the RNA number fluctuations are lower, being suppressed by the negative correlation. If one TSS exhibits highly noisy production (region A), the RNA numbers become highly noisy, regardless of the dynamics of the other TSS. Finally, in region B, the production is exponential-like, as multiple TSSs only increase the RNAP to promoter binding affinity, which makes their dynamics indiscernible from that of a single TSS. This suggests that low noise tandem architectures exert selection pressure on both promoter components.

3.4. Model predictions for empirical validation

To validate our predictions, we observed transcription in live E. coli at the single-RNA level in various constructs. Three of the constructs feature synthetic genes whose production is controlled by a single promoter (specifically P_lacO3O1, P_tetA and P_BAD (see electronic supplementary material, figure S1a). The remaining constructs feature pairs of genes sharing promoter elements. One of these constructs is P_{lacO3O1-lacO3O1}, with overlapping lacO3O1 promoters in the opposite strands (see electronic supplementary material, figure S1b), with the reporter being on a single side. In the other two constructs, the expression is controlled by a P_lacO3O1-tetA or a P_lacO3O1-BAD dual-tandem promoters (see electronic supplementary material, figure S1c). In all these, the expression of the lacO3O1 promoter is modulated by the IPTG concentration, an inducer for the lac promoter [50]. Meanwhile, aTc concentration is held constant at 15 ng ml^{− 1}, in order to trigger full expression of the tetA promoter. Similarly, l-arabinose concentration is held constant at 0.1%, to trigger full expression of the BAD promoter. In all cases, RNA production dynamics was measured by time-lapse microscopy imaging using MS2-GFP tagging (see methods).

Using our models, we aim to predict the behaviour of the gene pairs sharing promoter elements, given knowledge of the behaviour of the constituent genes not involved in gene pair interactions (i.e. operating as isolated promoters). More specifically, we test whether, from the measured dynamics of RNA production of P_lacO3O1, P_tetA [26] and P_BAD, one can predict the kinetics of P_{lacO3O1-lacO3O1}, P_lacO3O1-tetA and P_lacO3O1-BAD.

For this, we first extracted the number of RNAs in each cell in the first and the last frame of the time series for all the constructs in each condition (figure 5b,c, respectively). These data were used to estimate the mean and standard deviation the production intervals, and the most likely (maximum-likelihood fit) model of electronic supplementary material, equation (S1) for the single promoters, through the relations of electronic supplementary material, equation (S6). The estimated intervals are shown in table 2, along with the model parameters of electronic supplementary material, equation (S1) where applicable. A Wald test testing for a specific mean and standard deviation was used to compute a p-value to confirm that the model predicts the mean and variance of the RNA distributions. The measurement data and the resulting model fits are exemplified for the P_lacO3O1-tetA construct induced with 1000 μM of IPTG in figure 5. We also extracted the intervals from the full time series for several of the constructs (about 120 frames, one every minute) to verify that the production time intervals can be correctly estimated from the RNA distributions (see electronic supplementary material, table S2 and figure 5a). The RNA counts and the time intervals are given in electronic supplementary material, file S2 and S3, respectively, and constitute the raw measurement data after the image analysis, as opposed to the model fits shown in tables 2–4.

Figure 5. — Measured production time intervals and RNA distributions in the first and last frame of the time series along with model predictions for the P_{lacO3O1 - tetA} construct induced with 1000 μM of IPTG. (a) Production time interval distribution in 39 cells and the corresponding model predictions using a model with interactions during transcription initiation (model; see table 3) or a model with no interactions (null model; see table 4), (b) RNA distribution in 65 cells 1 min after induction (these data are used only for predicting the subsequent RNA distribution in (c)), and (c) RNA distribution in 115 cells 146 min after induction, along with the model predictions. (Online version in colour.)

Table 2.

Estimated RNA production intervals for each of the promoter constructs. The table shows the promoter, induction, estimated paramater of model electronic supplementary material, equation (S1) for the single promoters, the estimated mean, standard deviation (s.d.), and noise (coefficient of variation) of the RNA production intervals, and the p-value of the test of model versus data.

promoter	IPTG (μM)	${k_{1}}^{- 1}$ (s)	${k_{3}}^{- 1}$ (s)	mean (s)	s.d. (s)	noise	p-value
lacO3O1	1000	362.3	737.3	1099.6	821.5	0.558	0.446
lacO3O1	5	25.8	1236.8	1262.6	1237.0	0.960	0.273
tetA	—	287.4	385.5	672.9	480.9	0.511	0.075
BAD	—	1036.7	333.7	1370.4	1089.1	0.632	0.059
lacO3O1-tetA	1000	—	—	702.2	638.8	0.828	0.604
lacO3O1-tetA	5	—	—	1111.6	1089.1	0.960	0.164
lacO3O1-lacO3O1	1000	—	—	1659.3	1437.0	0.750	0.112
lacO3O1-lacO3O1	5	—	—	2205.9	2119.3	0.923	0.971
lacO3O1-BAD	1000	—	—	866.8	612.9	0.500	0.099
lacO3O1-BAD	5	—	—	1274.7	1248.9	0.960	0.698

Open in a new tab

Table 4.

Null models derived for the dual independent promoters from the individual promoter fits of table 2. The table shows the promoter/induction scheme, and the mean and standard deviation (s.d.) of the RNA production intervals assuming the null models, and the p-value of the test of model versus data for maximally (R = 2) and minimally (R = ∞) RNA polymerase starved null models.

promoter	IPTG (μM)	mean (s)	s.d. (s)	noise	p-val. R = 2	p-val. R = ∞
lacO3O1-lacO3O1	1000	1099.6	821.5	0.558	4.116 × 10⁻⁴	6.601 × 10⁻⁵
lacO3O1-lacO3O1	5	1262.6	1237.0	0.960	9.981 × 10⁻³	8.386 × 10⁻³
lacO3O1-tetA	1000	417.5	303.5	0.529	2.127 × 10⁻³	4.267 × 10⁻⁴
lacO3O1-tetA	5	439.0	358.5	0.667	7.289 × 10⁻⁵	5.049 × 10⁻⁶
lacO3O1-BAD	1000	610.1	468.9	0.591	6.325 × 10⁻⁶	1.456 × 10⁻⁷
lacO3O1-BAD	5	657.1	588.7	0.803	3.464 × 10^{− 3}	5.124 × 10^{− 4}

Open in a new tab

The results in table 2 indicate that changing IPTG concentration alters the noise of the lacO3O1 promoter in addition to changing its mean expression rate, which is expected to be due to changes in the open-to-closed complex duration ratio, and is in agreement with previous reports [28]. The p-values indicate that there is no evidence that any of the models fit the measurements poorly.

Next, using the above parameters (i.e. k₁ and k₃ in table 2), we constructed the models for the dual promoters through equations (3.2) and (3.3). The obtained models are shown in table 3. The predicted mean and standard deviation show an agreement with the measured behaviour of the dual promoter constructs, while the noise and correlation indicate that promoters operate at different regions of the open-to-closed complex ratio space (these values cannot be directly measured with our system, but can be inferred from the model fit). The results indicate that the model predicts the behaviour of the dual-promoter measurements well, and that the noise is modulated by the change in the coordination between the two promoters in the dual promoter construct.

Table 3.

Models derived for the dual promoters from the individual promoter fits of table 2 using the model with interactions during transcription initiation. The table shows the promoter/induction scheme, the mean and standard deviation (s.d.) of the RNA production intervals and the correlation between the RNA numbers assuming the derived models, and the p-value of the test of model versus data.

promoter	IPTG (μM)	mean (s)	s.d. (s)	noise	correlation	p-value
lacO3O1-lacO3O1	1000	1836.9	1685.2	0.842	−0.188	0.168
lacO3O1-lacO3O1	5	2499.3	2486.5	0.990	−0.010	0.931
lacO3O1-tetA	1000	701.4	616.1	0.772	−0.166	0.603
lacO3O1-tetA	5	1190.4	1212.9	1.038	+0.172	0.123
lacO3O1-BAD	1000	901.3	731.4	0.659	−0.070	0.141
lacO3O1-BAD	5	1240.0	1230.9	0.985	+0.105	0.620

Open in a new tab

As our methodology cannot identify which of the steps correspond to ${k_{1}}^{- 1}$ and ${k_{3}}^{- 1}$ in table 2, we also considered the alternative step ordering. The dual-promoter model fits had a p-values less than 3.964 × 10^{− 3} in for the lacO3O1-tetA construct at 5 μM IPTG, and p-values less than 4.614 × 10^{− 3} for the lacO3O1-BAD construct at 1000 μM IPTG, indicating that the alternatives are not likely for lacO3O1 at 5 μM and BAD. The step order for lacO3O1 at 1000 μM IPTG and tetA cannot be resolved from these data, but the alternatives result in a qualitatively similar dual-promoter models and p-values greater than 0.116. For the constructs containing these two promoters, we report the most likely models, all suggesting the order specified in table 2. These findings are also supported by prior evidence using a different methodology [28].

The fact that the measurements fall into the different regions of operation (figure 4 and table 1) is apparent in figures 6–8. Namely, the high IPTG condition falls into region E for the lacO3O1-lacO3O1 and lacO3O1-tetA, and into region D for the lacO3O1-BAD construct. At low IPTG, the lacO3O1-lacO3O1 transits into region C, as both promoters are modulated by the changes in the inducer concentration, while the lacO3O1-tetA and lacO3O1-BAD transit into (opposite directions) of region A. This explains the widely different noise levels in the estimated and measured intervals (table 2; electronic supplementary material, table S2), which are well predicted by our models in each case (compare with table 3).

Figure 7. — Predicted correlation between the RNA productions initiated by each start site of the dual promoter as a function of the relative durations of closed and open complex formation and the expression ratio of the promoters. The black curves denote zero and q₁₂ = 2 (figure 4a), and the markers the predictions for the measurements. (Online version in colour.)

Figure 6. — Noise of RNA production of one side of a dual promoter as a function of the relative durations of closed and open complex formation and the expression ratio of the promoters. The black curves denote unity and q₁₂ = 2 (figure 3), and the markers the predictions for the measurements, circles representing the validated ones (table 3). (Online version in colour.)

Figure 8. — Noise of RNA production of the dual promoter as a function of the relative durations of closed and open complex formation and the expression ratio of the promoters. The black curves denote unity and q₁₂ = 2 (figure 4b), and the markers the predictions for the measurements, circles representing the validated ones (tables 2 and 3). (Online version in colour.)

Finally, we verified that a model with no interactions between the two promoters would not explain the measurements. For this, we attempted to predict the mean, noise, and intervals in a dual-promoter measurement using independent expression from the constituent promoters (i.e. electronic supplementary material, equation (S4)) as predicted from the single-promoter measurements. The results in table 4 show that the associated model fails to explain the observed dual-promoter behaviour, and the apparent mismatch is exemplified in figure 5. Note that the models are also unaffected by the (k₁, k₃) identifiability problem. While the mean and noise of the system consisting of two independent promoters trivially follow from their independent components, the time intervals of the combined production do not. In particular, the intervals are not independent. We also considered the possibility that while the promoters might have interactions, their expression levels may be altered by the other promoter using the same finite pool of RNAP. For this, we assumed that the number of RNAP modulate the closed complex formation rate (i.e. $k_{1} = R {\tilde{k}}_{1}$ where ${\tilde{k}}_{1}$ is the per-polymerase closed complex formation rate, and R represents an RNA polymerase), which will cause a slight reduction of the closed complex formation rate, as determined by the closed to open-to-closed complex duration ratio of the other promoter. Any of these models (all R and all step orders) failed to explain the behaviour of our dual-promoter measurements as well. The effects are most extreme for R = 2, but we verified that models for other R have no better fit. Our model is recovered at R = 1 and the independent model without polymerases is recovered at R = ∞.

We conclude that our model of closely spaced promoters that assumes interactions between the promoters is the one that well predicts the measurements in each setting, for both the head-to-head and tandem constructs, while a model with no interactions cannot explain the observed measurements. Relevantly, our models reveal that the observed changes arise from changes in the coordination between the two coupled TSS of our synthetic constructs.

4. Conclusion

We analysed a stochastic model of two genes in a head-to-head configuration as a function of whether each gene is rate-limited during the closed and/or open complex formation. Compared to individual genes, in the bidirectional configuration, the transcription activity is slower and noisier in both genes, as each gene interferes with the activity of the other, allowing two genes with sub-Poissonian dynamics to exhibit super-Poissonian dynamics when coupled. Importantly, provided information on the kinetics of the constituent promoters when not sharing promoter elements, the models were shown to be able to predict well the behaviour of the pairs of the same genes when sharing promoter elements, implying that they capture accurately the effects of the complex interference caused by the shared promoter region.

We found that for such prediction to be accurate, the models have to account for the two-rate-limiting step kinetics of active transcription in E. coli. In particular, the time-length of such rate-limiting steps, namely the closed and open complex formations, controls not only the expression rate and noise of each gene (as in isolated genes, e.g. [24]), but also the kinetics of the temporal gaps caused by the transcription events of the opposite gene. This programs the behaviour intricately: a similar rate-limiting step structure combined with a rate-limiting open complex formation is required for both genes to feature low noise; otherwise, one tends to be highly noisy. Also, orderly systems tend to exhibit strong negative correlation, while the genes alternate expression, but the correlation can be lost or become positive if the open-to-closed complex formation time-lengths are incompatible. As such, not only the mean and variance of the durations of each stage but also the mechanistic underpinnings, affect the dynamics of closely-spaced gene-pairs, implying that promoters with seemingly identical dynamics in isolation may differ widely in their dynamics in a closely spaced configuration. Relevantly, as shown, the results generalize to the behaviour of individual genes with multiple transcription initiation sites.

Overall, these results suggest that, in E. coli, the kinetics of the rate-limiting steps in active transcription needs to be considered for dissecting the dynamics of pairs of genes sharing promoter elements. In this regard, we find it to be striking that pairs of closely spaced promoters, by tuning the kinetics of their closed and open complex formation (which are sequence dependent and, thus, evolvable) tunes the orderliness of the whole gene-pair. This new knowledge provides an important route to follow in the engineering of pairs of closely spaced promoters with desired dynamics and contributes to a better understanding of the dynamics of natural pairs of closely spaced genes and their potential role in the gene expression programs of E. coli.

Supplementary Material

Supplementary material

rsif20190507supp1.pdf^{(87.9KB, pdf)}

Supplementary Material

RNA number measurements

rsif20190507supp2.tsv^{(79.1KB, tsv)}

Supplementary Material

Production interval measurements

rsif20190507supp3.tsv^{(6.8KB, tsv)}

Data accessibility

The datasets supporting this article have been uploaded as part of the electronic supplementary material.

Authors' contributions

A.H. and A.S.R. conceived the study; A.H. devised the models and analysis; S.O. and R.N.-V. performed the experiments; A.H. and S.O. analysed the data; A.H. and A.S.R. drafted the manuscript. All authors contributed in writing the manuscript and have approved the final version.

Competing interests

The authors declare that they have no conflict of interest.

Funding

This work was supported by the Alfred Kordelin Foundation (A.H.); the Vilho, Yrjo and Kalle Vaisala Foundation (S.M.D.O.); the Tampere University of Technology President’s Graduate Programme (R.N.-V.); the Jane and Aatos Erkko Foundation (grant no. 610536) (A.S.R.); and Academy of Finland (grant nos. 295027 and 305342) (A.S.R.). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

References

1.Adachi N, Lieber MR. 2002. Bidirectional gene organization: a common architectural feature of the human genome. Cell 109, 807–809. ( 10.1016/S0092-8674(02)00758-4) [DOI] [PubMed] [Google Scholar]
2.Trinklein ND, Force Aldred S, Hartman SJ, Schroeder DI, Otillar RP, Myers RM. 2004. An abundance of bidirectional promoters in the human genome. Genome Res. 14, 62–66. ( 10.1101/gr.1982804) [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Herbert M, Kolb A, Buc H. 1986. Overlapping promoters and their control in Escherichia coli: the gal case. Proc. Natl Acad. Sci. USA 83, 2807–2811. ( 10.1073/pnas.83.9.2807) [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Moss Bendtsen K, Erdossy J, Csiszovszki Z, Lo Svenningsen S, Sneppen K, Krishna S, Semsey S. 2011. Direct and indirect effects in the regulation of overlapping promoters. Nucl. Acids Res. 39, 6879–6885. ( 10.1093/nar/gkr390) [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Korbel JO, Jensen LJ, von Mering C, Bork P. 2004. Analysis of genomic context: prediction of functional associations from conserved bidirectionally transcribed gene pairs. Nat. Biotechnol. 22, 911–917. ( 10.1038/nbt988) [DOI] [PubMed] [Google Scholar]
6.Prescott EM, Proudfoot NJ. 2002. Transcriptional collision between convergent genes in budding yeast. Proc. Natl Acad. Sci. USA 99, 8796–8801. ( 10.1073/pnas.132270899) [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Callen BP, Shearwin KE, Egan JB. 2004. Transcriptional interference between convergent promoters caused by elongation over the promoter. Mol. Cell 14, 647–656. ( 10.1016/j.molcel.2004.05.010) [DOI] [PubMed] [Google Scholar]
8.Palmer AC, Ahlgren-Berg A, Egan JB, Dodd IB, Shearwin KE. 2009. Potent transcriptional interference by pausing of RNA polymerases over a downstream promoter. Mol. Cell 34, 545–555. ( 10.1016/j.molcel.2009.04.018) [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Hakkinen A, Healy S, Jacobs HT, Ribeiro AS. 2011. Genome wide study of NF-Y type CCAAT boxes in unidirectional and bidirectional promoters in human and mouse. J. Theor. Biol. 281, 74–83. ( 10.1016/j.jtbi.2011.04.027) [DOI] [PubMed] [Google Scholar]
10.Zanotto E, Hakkinen A, Teku G, Shen B, Ribeiro AS, Jacobs HT. 2009. NF-Y influences directionality of transcription from the bidirectional Mrps12/Sarsm promoter in both mouse and human cells. BBA Gene Regul. Mech. 1789, 432–442. ( 10.1016/j.bbagrm.2009.05.001) [DOI] [PubMed] [Google Scholar]
11.Martins L, Makela J, Hakkinen A, Kandhavelu M, Yli-Harja O, Fonseca JM, Ribeiro AS. 2012. Dynamics of transcription of closely spaced promoters in Escherichia coli, one event at a time. J. Theor. Biol. 301, 83–94. ( 10.1016/j.jtbi.2012.02.015) [DOI] [PubMed] [Google Scholar]
12.Sneppen K, Dodd IB, Shearwin KE, Palmer AC, Schubert RA, Callen BP, Egan JB. 2005. A mathematical model for transcriptional interference by RNA polymerase traffic in Escherichia coli. J. Mol. Biol. 346, 399–409. ( 10.1016/j.jmb.2004.11.075) [DOI] [PubMed] [Google Scholar]
13.Yan C, Wu S, Pocetti C, Bai L. 2015. Regulation of cell-to-cell variability in divergent gene expression. Nat. Commun. 7, 11099 ( 10.1038/ncomms11099) [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Limi S, Zhao Y, Guo P, Lopez-Jones M, Zheng D, Singer RH, Skoultchi AI, Cvekl A. 2019. Bidirectional analysis of Cryba4-Crybb1 nascent transcription and nuclear accumulation of Crybb3 mRNAs in lens fibers. Invest. Ophthalmol. Vis. Sci. 60, 234–244. ( 10.1167/iovs.18-25921) [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Behjati Ardakani F. et al. 2018. Integrative analysis of single-cell expression data reveals distinct regulatory states in bidirectional promoters. Epigenet. Chromatin 11, 66 ( 10.1186/s13072-018-0236-7) [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Kustatscher G, Grabowski P, Rappsilber J. 2017. Pervasive coexpression of spatially proximal genes is buffered at the protein level. Mol. Syst. Biol. 13, 937 ( 10.15252/msb.20177548) [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Scruggs BS, Gilchrist DA, Nechaev S, Muse GW, Burkholder A, Fargo DC, Adelman K. 2015. Bidirectional transcription arises from two distinct hubs of transcription factor binding and active chromatin. Mol. Cell 58, 1101–1112. ( 10.1016/j.molcel.2015.04.006) [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Wei W, Pelechano V, Jarvelin AI, Steinmetz LM. 2011. Functional consequences of bidirectional promoters. Trends Genet. 27, 267–276. ( 10.1016/j.tig.2011.04.002) [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Lewin B. 2007. Genes IX. Sudbury, MA: Jones and Bartlett. [Google Scholar]
20.Griswold A. 2008. Genome packaging in prokaryotes: the circular chromosome of E. coli. Nat. Education 1, 57. [Google Scholar]
21.Moshkin YM. 2015. Chromatin—a global buffer for eukaryotic gene control. AIMS Biophys. 2, 531–554. ( 10.3934/biophy.2015.4.531) [DOI] [Google Scholar]
22.Ammar R, Torti D, Tsui K, Gebbia M, Durbic T, Bader GD, Giaever G, Nislow C, Reinberg D. 2012. Chromatin is an ancient innovation conserved between Archaea and Eukarya. eLife 1, e00078 ( 10.7554/eLife.00078) [DOI] [PMC free article] [PubMed] [Google Scholar]
23.McClure WR. 1980. Rate-limiting steps in RNA chain initiation. Proc. Natl Acad. Sci. USA 77, 5634–5638. ( 10.1073/pnas.77.10.5634) [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Lloyd-Price J, Startceva S, Kandavalli V, Chandraseelan J, Goncalves N, Oliveira SMD, Hakkinen A, Ribeiro AS. 2016. Dissecting the stochastic transcription initiation process in live Escherichia coli. DNA Res. 23, 203–214. ( 10.1093/dnares/dsw009) [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Revyakin A, Ebright RH, Strick TR. 2004. Promoter unwinding and promoter clearance by RNA polymerase: detection by single-molecule DNA nanomanipulation. Proc. Natl Acad. Sci. USA 101, 4776–4780. ( 10.1073/pnas.0307241101) [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Muthukrishnan A-B, Kandhavelu M, Lloyd-Price J, Kudasov F, Chowdhury S, Yli-Harja O, Ribeiro AS. 2012. Dynamics of transcription driven by the tetA promoter, one event at a time, in live Escherichia coli cells. Nucl. Acids Res. 40, 8472–8483. ( 10.1093/nar/gks583) [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Hakkinen A, Ribeiro AS. 2015. Estimation of GFP-tagged RNA numbers from temporal fluorescence intensity data. Bioinformatics 31, 69–75. ( 10.1093/bioinformatics/btu592) [DOI] [PubMed] [Google Scholar]
28.Kandavalli VK, Tran H, Ribeiro AS. 2016. Effects of σ factor competition are promoter initiation kinetics dependent. BBA Gene Regul. Mech. 1859, 1281–1288. ( 10.1016/j.bbagrm.2016.07.011) [DOI] [PubMed] [Google Scholar]
29.Hakkinen A, Ribeiro AS. 2016. Characterizing rate limiting steps in transcription from RNA production times in live cells. Bioinformatics 32, 1346–1352. ( 10.1093/bioinformatics/btv744) [DOI] [PubMed] [Google Scholar]
30.Vvedenskaya IO, Vahedian-Movahed H, Zhang Y, Taylor DM, Ebright RH, Nickels BE. 2016. Interactions between RNA polymerase and the core recognition element are a determinant of transcription start site selection. Proc. Natl Acad. Sci. USA 113, E2899–E2905. ( 10.1073/pnas.1603271113) [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Record MT Jr., Reznikoff WS, Craig ML, McQuade KL, Schlax PJ. 1996. Escherichia coli RNA polymerase (Eσ⁷0), promoters and the kinetics of the steps of transcription initiation. In Escherichia coli and Salmonella typhimurium:cellular and molecular biology, 2nd edn (eds FC Neidhart, JL Ingraham, KB Low, B Magasanik, M Schaechter, HE Umbarger), pp. 792–820. Washington, DC: ASM Press.
32.Wang F, Greene EC. 2011. Single-molecule studies of transcription: from one RNA polymerase at a time to the gene expression profile of a cell. J. Mol. Biol. 412, 814–831. ( 10.1016/j.jmb.2011.01.024) [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Patrick M, Dennis PP, Ehrenberg M, Bremer H. 2015. Free RNA polymerase in Escherichia coli. Biochimie 119, 80–91. ( 10.1016/j.biochi.2015.10.015) [DOI] [PubMed] [Google Scholar]
34.McQuarrie DA. 1967. Stochastic approach to chemical kinetics. J. Appl. Probab. 4, 413–478. ( 10.2307/3212214) [DOI] [Google Scholar]
35.Gillespie DT. 2009. Stochastic simulation of chemical kinetics. Annu. Rev. Phys. Chem. 58, 35–55. ( 10.1146/annurev.physchem.58.032806.104637) [DOI] [PubMed] [Google Scholar]
36.Kaern M, Elston TC, Blake WJ, Collins JJ. 2005. Stochasticity in gene expression: from theories to phenotypes. Nat. Rev. Genet. 6, 451–464. ( 10.1038/nrg1615) [DOI] [PubMed] [Google Scholar]
37.Cox DR. 1962. Renewal theory. London, UK: Methuen. [Google Scholar]
38.Pedraza JM, Paulsson J. 2008. Effects of molecular memory and bursting on fluctuations in gene expression. Science 319, 339–343. ( 10.1126/science.1144331) [DOI] [PubMed] [Google Scholar]
39.Baba T. et al. 2006. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol. Syst. Biol. 2, 1–11. ( 10.1038/msb4100050) [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Glascock CB, Weickert MJ. 1998. Using chromosomal lacIQ1 to control expression of genes on high-copy-number plasmids in Escherichia coli. Gene 223, 221–231. ( 10.1016/S0378-1119(98)00240-6) [DOI] [PubMed] [Google Scholar]
41.Nevo-Dinur K, Nussbaum-Shochat A, Ben-Yehuda S, Amster-Choder O. 2011. Translation-independent localization of mRNA in E. coli. Science 331, 1081–1084. ( 10.1126/science.1195691) [DOI] [PubMed] [Google Scholar]
42.Le TT, Harlepp S, Guet CC, Dittmar K, Emonet T, Pan T, Cluzel P. 2005. Real-time RNA profiling within a single bacterium. Proc. Natl Acad. Sci. USA 102, 9160–9164. ( 10.1073/pnas.0503311102) [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Oliveira SMD, Hakkinen A, Lloyd-Price J, Tran H, Kandavalli V, Ribeiro AS. 2016. Temperature-dependent model of multi-step transcription initiation in Escherichia coli based on live single-cell measurements. PLoS Comput. Biol. 12, e1005174 ( 10.1371/journal.pcbi.1005174) [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Hakkinen A, Muthukrishnan A-B, Mora A, Fonseca JM, Ribeiro AS. 2013. CellAging: a tool to study segregation and partitioning in division in cell lineages of Escherichia coli. Bioinformatics 29, 1708–1709. ( 10.1093/bioinformatics/btt194) [DOI] [PubMed] [Google Scholar]
45.Hakkinen A, Kandhavelu M, Garasto S, Ribeiro AS. 2014. Estimation of fluorescence-tagged RNA numbers from spot intensities. Bioinformatics 30, 1146–1153. ( 10.1093/bioinformatics/btt766) [DOI] [PubMed] [Google Scholar]
46.Golding I, Cox EC. 2004. RNA dynamics in live Escherichia coli cells. Proc. Natl Acad. Sci. USA 101, 11 310–11 315. ( 10.1073/pnas.0404443101) [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Peccoud J, Ycart B. 1995. Markovian modeling of gene-product synthesis. Theor. Popul. Biol. 48, 222–234. ( 10.1006/tpbi.1995.1027) [DOI] [Google Scholar]
48.Mendoza-Vargas A. et al. 2009. Genome-wide identification of transcription start sites, promoters and transcription factor binding sites in E. coli. PLoS ONE 4, e7526 ( 10.1371/journal.pone.0007526) [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Tracy RL, Stern DB. 1995. Mitochondrial transcription initiation: promoter structures and RNA polymerases. Curr. Genet. 28, 205–216. ( 10.1007/BF00309779) [DOI] [PubMed] [Google Scholar]
50.Lutz R, Lozinski T, Ellinger T, Bujard H. 2001. Dissecting the functional program of Escherichia coli promoters: the combined mode of action of Lac repressor and AraC activator. Nucl. Acids Res. 29, 3873–3881. ( 10.1093/nar/29.18.3873) [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

rsif20190507supp1.pdf^{(87.9KB, pdf)}

RNA number measurements

rsif20190507supp2.tsv^{(79.1KB, tsv)}

Production interval measurements

rsif20190507supp3.tsv^{(6.8KB, tsv)}

Data Availability Statement

The datasets supporting this article have been uploaded as part of the electronic supplementary material.

[RSIF20190507C1] 1.Adachi N, Lieber MR. 2002. Bidirectional gene organization: a common architectural feature of the human genome. Cell 109, 807–809. ( 10.1016/S0092-8674(02)00758-4) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C2] 2.Trinklein ND, Force Aldred S, Hartman SJ, Schroeder DI, Otillar RP, Myers RM. 2004. An abundance of bidirectional promoters in the human genome. Genome Res. 14, 62–66. ( 10.1101/gr.1982804) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C3] 3.Herbert M, Kolb A, Buc H. 1986. Overlapping promoters and their control in Escherichia coli: the gal case. Proc. Natl Acad. Sci. USA 83, 2807–2811. ( 10.1073/pnas.83.9.2807) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C4] 4.Moss Bendtsen K, Erdossy J, Csiszovszki Z, Lo Svenningsen S, Sneppen K, Krishna S, Semsey S. 2011. Direct and indirect effects in the regulation of overlapping promoters. Nucl. Acids Res. 39, 6879–6885. ( 10.1093/nar/gkr390) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C5] 5.Korbel JO, Jensen LJ, von Mering C, Bork P. 2004. Analysis of genomic context: prediction of functional associations from conserved bidirectionally transcribed gene pairs. Nat. Biotechnol. 22, 911–917. ( 10.1038/nbt988) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C6] 6.Prescott EM, Proudfoot NJ. 2002. Transcriptional collision between convergent genes in budding yeast. Proc. Natl Acad. Sci. USA 99, 8796–8801. ( 10.1073/pnas.132270899) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C7] 7.Callen BP, Shearwin KE, Egan JB. 2004. Transcriptional interference between convergent promoters caused by elongation over the promoter. Mol. Cell 14, 647–656. ( 10.1016/j.molcel.2004.05.010) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C8] 8.Palmer AC, Ahlgren-Berg A, Egan JB, Dodd IB, Shearwin KE. 2009. Potent transcriptional interference by pausing of RNA polymerases over a downstream promoter. Mol. Cell 34, 545–555. ( 10.1016/j.molcel.2009.04.018) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C9] 9.Hakkinen A, Healy S, Jacobs HT, Ribeiro AS. 2011. Genome wide study of NF-Y type CCAAT boxes in unidirectional and bidirectional promoters in human and mouse. J. Theor. Biol. 281, 74–83. ( 10.1016/j.jtbi.2011.04.027) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C10] 10.Zanotto E, Hakkinen A, Teku G, Shen B, Ribeiro AS, Jacobs HT. 2009. NF-Y influences directionality of transcription from the bidirectional Mrps12/Sarsm promoter in both mouse and human cells. BBA Gene Regul. Mech. 1789, 432–442. ( 10.1016/j.bbagrm.2009.05.001) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C11] 11.Martins L, Makela J, Hakkinen A, Kandhavelu M, Yli-Harja O, Fonseca JM, Ribeiro AS. 2012. Dynamics of transcription of closely spaced promoters in Escherichia coli, one event at a time. J. Theor. Biol. 301, 83–94. ( 10.1016/j.jtbi.2012.02.015) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C12] 12.Sneppen K, Dodd IB, Shearwin KE, Palmer AC, Schubert RA, Callen BP, Egan JB. 2005. A mathematical model for transcriptional interference by RNA polymerase traffic in Escherichia coli. J. Mol. Biol. 346, 399–409. ( 10.1016/j.jmb.2004.11.075) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C13] 13.Yan C, Wu S, Pocetti C, Bai L. 2015. Regulation of cell-to-cell variability in divergent gene expression. Nat. Commun. 7, 11099 ( 10.1038/ncomms11099) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C14] 14.Limi S, Zhao Y, Guo P, Lopez-Jones M, Zheng D, Singer RH, Skoultchi AI, Cvekl A. 2019. Bidirectional analysis of Cryba4-Crybb1 nascent transcription and nuclear accumulation of Crybb3 mRNAs in lens fibers. Invest. Ophthalmol. Vis. Sci. 60, 234–244. ( 10.1167/iovs.18-25921) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C15] 15.Behjati Ardakani F. et al. 2018. Integrative analysis of single-cell expression data reveals distinct regulatory states in bidirectional promoters. Epigenet. Chromatin 11, 66 ( 10.1186/s13072-018-0236-7) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C16] 16.Kustatscher G, Grabowski P, Rappsilber J. 2017. Pervasive coexpression of spatially proximal genes is buffered at the protein level. Mol. Syst. Biol. 13, 937 ( 10.15252/msb.20177548) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C17] 17.Scruggs BS, Gilchrist DA, Nechaev S, Muse GW, Burkholder A, Fargo DC, Adelman K. 2015. Bidirectional transcription arises from two distinct hubs of transcription factor binding and active chromatin. Mol. Cell 58, 1101–1112. ( 10.1016/j.molcel.2015.04.006) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C18] 18.Wei W, Pelechano V, Jarvelin AI, Steinmetz LM. 2011. Functional consequences of bidirectional promoters. Trends Genet. 27, 267–276. ( 10.1016/j.tig.2011.04.002) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C19] 19.Lewin B. 2007. Genes IX. Sudbury, MA: Jones and Bartlett. [Google Scholar]

[RSIF20190507C20] 20.Griswold A. 2008. Genome packaging in prokaryotes: the circular chromosome of E. coli. Nat. Education 1, 57. [Google Scholar]

[RSIF20190507C21] 21.Moshkin YM. 2015. Chromatin—a global buffer for eukaryotic gene control. AIMS Biophys. 2, 531–554. ( 10.3934/biophy.2015.4.531) [DOI] [Google Scholar]

[RSIF20190507C22] 22.Ammar R, Torti D, Tsui K, Gebbia M, Durbic T, Bader GD, Giaever G, Nislow C, Reinberg D. 2012. Chromatin is an ancient innovation conserved between Archaea and Eukarya. eLife 1, e00078 ( 10.7554/eLife.00078) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C23] 23.McClure WR. 1980. Rate-limiting steps in RNA chain initiation. Proc. Natl Acad. Sci. USA 77, 5634–5638. ( 10.1073/pnas.77.10.5634) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C24] 24.Lloyd-Price J, Startceva S, Kandavalli V, Chandraseelan J, Goncalves N, Oliveira SMD, Hakkinen A, Ribeiro AS. 2016. Dissecting the stochastic transcription initiation process in live Escherichia coli. DNA Res. 23, 203–214. ( 10.1093/dnares/dsw009) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C25] 25.Revyakin A, Ebright RH, Strick TR. 2004. Promoter unwinding and promoter clearance by RNA polymerase: detection by single-molecule DNA nanomanipulation. Proc. Natl Acad. Sci. USA 101, 4776–4780. ( 10.1073/pnas.0307241101) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C26] 26.Muthukrishnan A-B, Kandhavelu M, Lloyd-Price J, Kudasov F, Chowdhury S, Yli-Harja O, Ribeiro AS. 2012. Dynamics of transcription driven by the tetA promoter, one event at a time, in live Escherichia coli cells. Nucl. Acids Res. 40, 8472–8483. ( 10.1093/nar/gks583) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C27] 27.Hakkinen A, Ribeiro AS. 2015. Estimation of GFP-tagged RNA numbers from temporal fluorescence intensity data. Bioinformatics 31, 69–75. ( 10.1093/bioinformatics/btu592) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C28] 28.Kandavalli VK, Tran H, Ribeiro AS. 2016. Effects of σ factor competition are promoter initiation kinetics dependent. BBA Gene Regul. Mech. 1859, 1281–1288. ( 10.1016/j.bbagrm.2016.07.011) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C29] 29.Hakkinen A, Ribeiro AS. 2016. Characterizing rate limiting steps in transcription from RNA production times in live cells. Bioinformatics 32, 1346–1352. ( 10.1093/bioinformatics/btv744) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C30] 30.Vvedenskaya IO, Vahedian-Movahed H, Zhang Y, Taylor DM, Ebright RH, Nickels BE. 2016. Interactions between RNA polymerase and the core recognition element are a determinant of transcription start site selection. Proc. Natl Acad. Sci. USA 113, E2899–E2905. ( 10.1073/pnas.1603271113) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C31] 31.Record MT Jr., Reznikoff WS, Craig ML, McQuade KL, Schlax PJ. 1996. Escherichia coli RNA polymerase (Eσ⁷0), promoters and the kinetics of the steps of transcription initiation. In Escherichia coli and Salmonella typhimurium:cellular and molecular biology, 2nd edn (eds FC Neidhart, JL Ingraham, KB Low, B Magasanik, M Schaechter, HE Umbarger), pp. 792–820. Washington, DC: ASM Press.

[RSIF20190507C32] 32.Wang F, Greene EC. 2011. Single-molecule studies of transcription: from one RNA polymerase at a time to the gene expression profile of a cell. J. Mol. Biol. 412, 814–831. ( 10.1016/j.jmb.2011.01.024) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C33] 33.Patrick M, Dennis PP, Ehrenberg M, Bremer H. 2015. Free RNA polymerase in Escherichia coli. Biochimie 119, 80–91. ( 10.1016/j.biochi.2015.10.015) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C34] 34.McQuarrie DA. 1967. Stochastic approach to chemical kinetics. J. Appl. Probab. 4, 413–478. ( 10.2307/3212214) [DOI] [Google Scholar]

[RSIF20190507C35] 35.Gillespie DT. 2009. Stochastic simulation of chemical kinetics. Annu. Rev. Phys. Chem. 58, 35–55. ( 10.1146/annurev.physchem.58.032806.104637) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C36] 36.Kaern M, Elston TC, Blake WJ, Collins JJ. 2005. Stochasticity in gene expression: from theories to phenotypes. Nat. Rev. Genet. 6, 451–464. ( 10.1038/nrg1615) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C37] 37.Cox DR. 1962. Renewal theory. London, UK: Methuen. [Google Scholar]

[RSIF20190507C38] 38.Pedraza JM, Paulsson J. 2008. Effects of molecular memory and bursting on fluctuations in gene expression. Science 319, 339–343. ( 10.1126/science.1144331) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C39] 39.Baba T. et al. 2006. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol. Syst. Biol. 2, 1–11. ( 10.1038/msb4100050) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C40] 40.Glascock CB, Weickert MJ. 1998. Using chromosomal lacIQ1 to control expression of genes on high-copy-number plasmids in Escherichia coli. Gene 223, 221–231. ( 10.1016/S0378-1119(98)00240-6) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C41] 41.Nevo-Dinur K, Nussbaum-Shochat A, Ben-Yehuda S, Amster-Choder O. 2011. Translation-independent localization of mRNA in E. coli. Science 331, 1081–1084. ( 10.1126/science.1195691) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C42] 42.Le TT, Harlepp S, Guet CC, Dittmar K, Emonet T, Pan T, Cluzel P. 2005. Real-time RNA profiling within a single bacterium. Proc. Natl Acad. Sci. USA 102, 9160–9164. ( 10.1073/pnas.0503311102) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C43] 43.Oliveira SMD, Hakkinen A, Lloyd-Price J, Tran H, Kandavalli V, Ribeiro AS. 2016. Temperature-dependent model of multi-step transcription initiation in Escherichia coli based on live single-cell measurements. PLoS Comput. Biol. 12, e1005174 ( 10.1371/journal.pcbi.1005174) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C44] 44.Hakkinen A, Muthukrishnan A-B, Mora A, Fonseca JM, Ribeiro AS. 2013. CellAging: a tool to study segregation and partitioning in division in cell lineages of Escherichia coli. Bioinformatics 29, 1708–1709. ( 10.1093/bioinformatics/btt194) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C45] 45.Hakkinen A, Kandhavelu M, Garasto S, Ribeiro AS. 2014. Estimation of fluorescence-tagged RNA numbers from spot intensities. Bioinformatics 30, 1146–1153. ( 10.1093/bioinformatics/btt766) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C46] 46.Golding I, Cox EC. 2004. RNA dynamics in live Escherichia coli cells. Proc. Natl Acad. Sci. USA 101, 11 310–11 315. ( 10.1073/pnas.0404443101) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C47] 47.Peccoud J, Ycart B. 1995. Markovian modeling of gene-product synthesis. Theor. Popul. Biol. 48, 222–234. ( 10.1006/tpbi.1995.1027) [DOI] [Google Scholar]

[RSIF20190507C48] 48.Mendoza-Vargas A. et al. 2009. Genome-wide identification of transcription start sites, promoters and transcription factor binding sites in E. coli. PLoS ONE 4, e7526 ( 10.1371/journal.pone.0007526) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20190507C49] 49.Tracy RL, Stern DB. 1995. Mitochondrial transcription initiation: promoter structures and RNA polymerases. Curr. Genet. 28, 205–216. ( 10.1007/BF00309779) [DOI] [PubMed] [Google Scholar]

[RSIF20190507C50] 50.Lutz R, Lozinski T, Ellinger T, Bujard H. 2001. Dissecting the functional program of Escherichia coli promoters: the combined mode of action of Lac repressor and AraC activator. Nucl. Acids Res. 29, 3873–3881. ( 10.1093/nar/29.18.3873) [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Transcription closed and open complex formation coordinate expression of genes with a shared promoter region

Antti Häkkinen

Samuel M D Oliveira

Ramakanth Neeli-Venkata

Andre S Ribeiro

Abstract

1. Introduction

2. Methods

2.1. Models

Figure 2.

2.2. Cells, plasmids, chemicals and growth conditions

2.3. Microscopy and image analysis

Figure 1.

3. Results and discussion

3.1. Analytical distributions of production time intervals

3.2. Noise and correlation in the transcription kinetics of genes in a head-to-head configuration

Table 1.

Figure 3.

Figure 4.

3.3. Noise in a gene with two initiation sites

3.4. Model predictions for empirical validation

Figure 5.

Table 2.

Table 4.

Table 3.

Figure 7.

Figure 6.

Figure 8.

4. Conclusion

Supplementary Material

Supplementary Material

Supplementary Material

Data accessibility

Authors' contributions

Competing interests

Funding

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases