Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2020 Oct 12;117(43):26608–26615. doi: 10.1073/pnas.2010849117

Filtering input fluctuations in intensity and in time underlies stochastic transcriptional pulses without feedback

Alberto Stefano Sassi a, Mayra Garcia-Alcala b,c,d, Mark J Kim b, Philippe Cluzel b,1, Yuhai Tu a,1,2
PMCID: PMC7604450  PMID: 33046652

Significance

Stochastic pulsatile dynamics are ubiquitous in organisms ranging from Escherichia coli to mammalian cells. The molecular mechanism for these stochastic pulses has long been thought to depend on feedback control that leads to bistability in the underlying system. Here, by quantitative modeling constrained by recent single-cell data in E. coli flagellar synthesis, we find that an inhibitor of the transcription factor creates a monotonic ultrasensitive switch that serves as a digital filter to eliminate small-amplitude input fluctuations. Additionally, the high-frequency input fluctuations are filtered by integration over time. Together, our results reveal that the filter-and-integrate design can generate stochastic pulses without feedback. This filter-and-integrate strategy allows cells to turn on gene expression only for persistently strong input signals.

Keywords: transcriptional pulses, gene regulation, stochastic modeling, filtering, ultrasensitivity

Abstract

Stochastic pulsatile dynamics have been observed in an increasing number of biological circuits with known mechanism involving feedback control and bistability. Surprisingly, recent single-cell experiments in Escherichia coli flagellar synthesis showed that flagellar genes are activated in stochastic pulses without the means of feedback. However, the mechanism for pulse generation in these feedbackless circuits has remained unclear. Here, by developing a system-level stochastic model constrained by a large set of single-cell E. coli flagellar synthesis data from different strains and mutants, we identify the general underlying design principles for generating stochastic transcriptional pulses without feedback. Our study shows that an inhibitor (YdiV) of the transcription factor (FlhDC) creates a monotonic ultrasensitive switch that serves as a digital filter to eliminate small-amplitude FlhDC fluctuations. Furthermore, we find that the high-frequency (fast) fluctuations of FlhDC are filtered out by integration over a timescale longer than the timescale of the input fluctuations. Together, our results reveal a filter-and-integrate design for generating stochastic pulses without feedback. This filter-and-integrate mechanism enables a general strategy for cells to avoid premature activation of the expensive downstream gene expression by filtering input fluctuations in both intensity and time so that the system only responds to input signals that are both strong and persistent.


The gene regulatory network for controlling the bacterial flagellar synthesis and assembly has long served as a canonical example of transcriptional cascade (18), in which a particular gene product (protein) can serve as the transcription factor (TF) for other genes, which encode proteins that serve as TFs for other downstream genes, and so on. The bacterial flagellar promoters are organized in three classes, each of which underlies specific stages of the complex flagellar assembly and chemosensory signaling processes (911). The class 1 promoter controls the transcription of flhDC, the operon encoding for the proteins FlhD and FlhC, that assemble into the heterohexamer FlhD4C2 (refer to as FlhDC hereafter), which is the master regulator of flagellar synthesis. The master regulator serves as the TF for class 2 promoters and initiates the transcription of seven operons necessary for the assembly of the basal body and the hook of the flagellum (12, 13). A specific class 2 protein, FliA, activates class 3 promoters to express class 3 genes coding proteins for the flagellar filament and the chemotaxis signaling pathway (14). The bacterial flagellar synthesis represents a remarkable process in which over 50 different genes are expressed in a coherent fashion over several generations to build a large functional protein complex. Over the past decades, experimental studies have yielded many valuable insights about the transcriptional network that underlies flagellar synthesis in Escherichia coli and Salmonella (9, 10, 1518). In particular, population measurements suggested that the transcriptional cascade follows a deterministic temporal order where genes in subsequent classes are activated sequentially and, once turned on, promoters remain active continuously during exponential growth (10).

Recently, Kim et al. (19) measured gene expression dynamics of single E. coli cells over many generations by using a microfluidic device called the “mother machine” (20, 21). Surprisingly, the new single-cell data showed that flagellar promoters are stochastically activated in intermittent pulses, that is, bursts of strong activity amid a quiet background. Due to their stochastic nature, transcriptional pulses can only be detected by single-cell measurements, and they were missed in previous cell population study (10).

Indeed, rapid developments in quantitative single-cell measurements have revealed that quasi-periodic or stochastic pulsating dynamics is a common dynamic activity pattern in biological circuits in a wide range of organisms, for example, TF Msn2 in yeast (22, 23), TF p53 (24) and NF-κB (25) in mammalian cells, bacterial TF σb in the bacterium Bacillus (26), and stochastic cell fate switching in microbes (27). Typically, the underlying mechanism involves bistability caused by a combination of positive and negative feedback loops (28); a recent theoretical study showed that stochastic pulses may also be caused by a negative feedback loop alone (29). Suffice to say, most known mechanisms for pulses involve some form of feedback control. However, in Kim et al. (19), pulses were present even when the promoter of FlhDC was deleted and replaced by a synthetic/constitutive promoter that does not respond to flagellar endogenous regulator and thus rules out the involvement of any feedback control. Additionally, in a different system that controls cell fate, stochastic pulses were also successfully generated by transferring, in E. coli, a reconstituted gene circuit without feedback (30).

Prompted by these recent observations, we ask the question, what are the underlying design principles that govern the stochastic transcriptional pulses without feedback? To answer this question, we use the large single-cell expression time series dataset for different strains and mutants from Kim et al. (19) to quantitatively and consistently constrain a system-level stochastic model. Using this approach, we aim to identify the minimum design principles necessary to generate stochastic transcriptional pulses without feedback in general, and specifically during E. coli flagellar synthesis.

Results

We first present a system-level (coarse-grained) stochastic model for the regulation of class 2 genes by FlhDC. We then use the model to analyze and explain the steady-state distributions of class 2 activity and its dynamics observed in single-cell experiments (19). Next, we propose possible molecular mechanisms underlying the main findings from our analysis and describe a filter-and-integrate mechanism for controlling noisy FlhDC signal in E. coli. Finally, we use our model to make testable predictions for responses to realistic time-varying signals.

A System-Level Stochastic Model for Class 1 and Class 2 Gene Expression Dynamics.

In the experiments by Kim et al. (19), they used strains whose expression of class 1 genes was controlled by promoters of different strengths (see SI Appendix, Supplementary Information A for details of the experimental data). For each strain, long time series (70 generations) of class 1 and class 2 promoter activity were simultaneously measured within the same cells using fluorescent proteins as proxy. They found that distributions of class 2 promoter activity depend on the class 1 promoter strength as well as the presence or absence of the inhibitor YdiV (3133) [also known as RflP (34)].

Here, we use the class 1 reporter fluorescence signal as a proxy to represent the FlhDC level C1 as the input to the class 2 gene transcription process. Following Kim et al. (19), we define the class 2 gene expression activity (output), A2, as the time derivative of the fluorescence time series of the class 2 reporter gene, which is assumed to be proportional to the degree of activation of class 2 promoters (see SI Appendix, Supplementary Information A for details of computing and denoising A2 from experimental data obtained in the “mother machine”). We propose a system-level stochastic model to describe dynamics of these two coarse-grained observables, A2 and C1, without considering molecular details. In particular, the A2 dynamics in a single cell is described by a stochastic differential equation (SDE) that has three basic terms: decay (degradation), activation (production), and noise,

dA2dt=A2fs(C1)τ2+θs=A2τ2+fs(C1)τ2+θs, [1]

where s=(+,) represents cells with and without YdiV, respectively, and τ21 is the decay rate with τ2 as the characteristic timescale associated with class 2 expression. For convenience, we write activation rate of the class 2 promoter as fs(C1)/τ2 so that A2=fs(C1). We call fs(C1) the response function, as it characterizes how class 2 activity responds to class 1 protein level C1. The white noise θs has zero mean θs(t)=0 and correlation θs(t)θs(t)=2Δs(C1)δ(tt) with noise strength given by Δs(C1). Here, we assume that the noise strength Δs(C1) depends only on the average C1 concentration C1, which is valid due to the timescale difference for A2 and C1, as will become clear later in this work.

The activity of class 2 genes A2 depends on the class 1 protein level C1, whose dynamics can be modeled by a similar SDE,

dC1dt=(C1μ)τ1+C1η=C1τ1+μτ1+C1η, [2]

where τ1 is the decay time for class 1 protein. We define μ/τ1 as the C1 production rate, so that the steady-state average is C1=μ with μ depending on the promoter strength of a specific strain. The white noise η has zero mean η(t)=0 and η(t)η(t)=2Δ1δ(tt), with the noise strength Δ1 as a constant for all promoter strengths. The noise η is multiplied by C1 in Eq. 2. This choice of a linear multiplicative noise is due to the fact that it provides a better fit to experimental data in comparison with other forms of noise such as additive and square root multiplicative noise (see SI Appendix, Supplementary Information B and Fig. S2 for a detailed comparison). Mechanistically, the multiplicative noise may originate from random partitioning of protein molecules during cell division and fluctuations in protein degradation rate.

An important assumption of our proposed model is that the observed class 2 pulses do not require transcriptional feedback on YdiV. Indeed, this assumption is fully supported by mother machine single-cell data published in Kim et al. (ref. 19, figure S(11)), which shows that, when the expression of YdiV is controlled by a synthetic constitutive promoter, the pulses are still present. Overall, our “minimal” system-level (coarse-grained) model consists of two SDEs (Eqs. 1 and 2) with a few physiologically meaningful parameters and functions: two timescales τ1 and τ2, a constant multiplicative noise strength Δ1, a strain-dependent noise strength Δs(μ), and two response functions fs(C1). In the following, by fitting all of the single-cell data for different strains (19) with our model quantitatively, we determine the parameters and response functions, which help us identify the underlying mechanism for transcriptional pulses.

YdiV Creates an Ultrasensitive Switch.

We first use our model to study all of the steady-state class 2 activity distributions from experiments (see Methods for details on simulations of our model). In Fig. 1A, experimentally observed distributions of the normalized class 2 activity A2/A2 for cells with and without YdiV for three class 1 promoter strengths—low (P1, red), medium (P4, green), and high (P7, blue)—are shown (see SI Appendix, Supplementary Information A for details of the experimental dataset). The most drastic difference between cells with and without YdiV is for strain P4 that has a medium promoter strength similar to that of the wild-type cells. The A2/A2 distribution for the P4 strain in the presence of YdiV (green line in Fig. 1 A, Left) is strongly asymmetric with a peak that is significantly shifted with respect to the mean, whereas the distributions for lower and higher promoter strengths are more symmetric. This asymmetry is largely absent in strains without YdiV (Fig. 1 A, Right). For simplicity, we fit the experimental data by using Hill functions for the response functions fs and tuning the shape of the Hill functions to minimize the difference between the distributions of C1 and A2 from the model and those from experiments for different promoter strengths (see SI Appendix, Supplementary Information C for details). Our model can explain all of the steady-state A2 distributions from different strains together as shown by the direct comparison between the experimentally measured distributions (Fig. 1A) and those obtained from our model (Fig. 1B).

Fig. 1.

Fig. 1.

Comparison between experimental data and model results. (A) Distributions of the normalized class 2 activity A2/A2 in the presence (Left) and absence (Right) of YdiV for small (P1, red), medium (P4, green), and large (P7, blue) class 1 promoter strengths. Data are taken from ref. 19. We use the black dashed lines to indicate the positions of its mean value and zero for A2. (B) The corresponding distributions obtained from our model. (C) Comparison of typical time series of C1(t) and A2(t) from experiments and model for the P4 strain with and without YdiV. (D) Response functions obtained by fitting our model to experimental data (19) with (f+, solid line) and without (f, dashed line) YdiV. The circles (+YdiV) and crosses (−YdiV) indicate the mean value from different mother cells in experiments. Note that fs does not pass through the center of the data points, due to its high nonlinearity fs(C1)fs(C1). The color code for each promoter is the same as in A and B.

The most significant difference in the dynamics of class 2 activity A2(t) between cells with and without YdiV also occurs in strain P4 with a medium class 1 promoter strength similar to that of wild-type cells. As shown in Fig. 1C, from the experiments, the A2 dynamics contains well-defined pulses in the presence of YdiV, while pulses are not well separated and the presence of higher-frequency noise appears to be unfiltered in the absence of YdiV. These different activity dynamics are reproduced in our model.

By fitting our model to the single-cell data (see SI Appendix, Supplementary Information C for details of the fitting and values of all of the fitted model parameters), we obtain the response functions, f+(C1) and f(C1), which cannot be directly determined from experimental data because A2=fs(C1)fs(C1), due to their nonlinearity. As shown in Fig. 1D (solid line), the response function f+(C1) in the presence of YdiV behaves like a monotonic ultrasensitive switch, that is, it has a much larger Hill coefficient (H+12) than the Hill coefficient (H3) for f(C1) (see SI Appendix, Supplementary Information C for details). Due to this ultrasensitive (steep) response function, when the mean FlhDC concentration is close to the threshold (e.g., for strain P4), the distribution of A2/A2 has the largest asymmetry, and there is a significant shift in the peak of the distribution (Fig. 1 A, Left and B, Left). In the absence of YdiV, the response function f(C1) is found to be less steep (dotted line in Fig. 1D), and the probability distributions are more symmetric (Fig. 1 A, Right and B, Right). Since we only used experimental data from three strains (P1,P4,P7) to fit (calibrate) our model to determine the response functions and other parameters (such as τ1 and τ2), we can test the consistency of our model by applying it to other strains that are not used in the fitting. In particular, we ran simulations of our model for two other strains, P2 and P5, with different promoter strengths but with all other parameters and the response functions fixed. As shown in SI Appendix, Fig. S5, the A2 distributions for P2 and P5 obtained from our model agree well with experiments, and the average values of C1 are also consistent with the experiments (SI Appendix, Fig. S4).

The agreements between our model and the experiments not only validate our model, but, more importantly, they reveal that the role of YdiV is to create a ultrasensitive (steep) response of the class 2 activity to the class 1 protein (FlhDC) concentration.

Memory Effect and Integration over Time.

Next, we use our model to analyze the stochastic gene expression dynamics in the flagellum system. As shown in Eq. 1, there are two noise sources for the class 2 activity A2: the intrinsic noise θs in the gene expression process and the extrinsic noise due to fluctuations of the FlhDC concentration C1. From our model, the mean class 2 promoter activity averaged over the intrinsic noise can be written in the following integral form:

A2(t)=tG(tt)fs(C1(t))dt, [3]

where G(tt)=τ21exp[(tt)/τ2] is the integral kernel (Green’s function) that decays with the timescale τ2. Eq. 3 means that A2(t) depends on the values of C1(t) in a time window tτ2tt; that is, there is a memory effect. Given that C1(t) varies with a timescale τ1, the strength of this memory effect depends on the ratio of the two timescales τ2/τ1. For τ2/τ11, the kernel G(tt)δ(tt) and A2(t)fs(C1(t)); that is, the class 2 activity depends on the instantaneous level of class 1 protein C1(t) with negligible memory effect. However, if τ2/τ1>1, the memory effect becomes significant. In the limit τ2/τ11, A2(t)fs(C1)C1fs(C1)P(C1)dC1 becomes a constant independent of the instantaneous FlhDC level C1(t) (P(C1) is the steady-state C1 distribution function).

To demonstrate the memory effect, we divide the FlhDC concentration in equally spaced bins (in log-scale) and then compute the class 2 activity and FlhDC concentration averaged within each bin to form the so-called “bin-averaged response curve” (BARC) for each strain with a different class 1 promoter strength (see Methods for details on BARC). As shown in Fig. 2A, when τ2>τ1, the BARCs for different strains can be shifted from each other, consistent with experiments. This means that the class 2 activity is not determined just by the instantaneous FlhDC concentration; it depends also on its average over previous times, due to the memory effect. Indeed, as shown in Fig. 2B, in the absence of memory effect when τ2τ1, all BARCs collapse.

Fig. 2.

Fig. 2.

The bin-averaged response curves and the ratio of two timescales τ2/τ1. (A) Model results for τ2=3.5τ1. Inset shows the same behavior from experimental data. (B) Model results for τ2=0.1τ1. We used −YdiV strains here; see SI Appendix, Fig. S7 for similar results for +YdiV strains. (C and D) The normalized power spectra (green lines) from our model for (C) τ2=3.5τ1 and (D) τ2=0.1τ1 for the P4 strain. The normalized power spectrum (black line) from experiments (P4) is also shown for comparison. We used τ1=45min within the range of C1 correlation time estimated from experiments.

When τ2>τ1, A2 depends on an integral of the transformed input signal f+(C1) over a time window τ2 as shown in Eq. 3. This integration effectively averages the input signal in time and eliminates high-frequency fluctuations as demonstrated in Fig. 2 C and D, where power spectra for the two cases with and without integration are shown. The power spectrum from experimental data is consistent with our model, with τ2=3.5τ1 where the power of the spectrum of A2(t) is concentrated in the low-frequency regime with a similar half-maximum-power frequency f1/20.002/min as shown in Fig. 2C. For τ2τ1 (Fig. 2D), the power spectrum is much broader, with significant high-frequency fluctuations and f1/20.008/min that is much higher than that from experiments. Quantitatively, by comparing BARCs and autocorrelation functions from the model and those from experiments, we estimate the timescale ratio to be in the range 2.1τ2/τ13.6 (see SI Appendix, Supplementary Information F for details).

A YdiV-Mediated Sequestration Mechanism for Filtering.

So far, we have used our model to explain the steady-state distributions and dynamics of the noisy single-cell data. In this section, we propose possible molecular mechanisms underlying our findings, with a focus on the role of YdiV in regulating FlhDC signal.

We first briefly discuss possible molecular origins of separation of timescales. The shorter timescale τ1 is most likely related to the degradation time of the FlhC, and FlhD monomers and the longer timescale τ2 are related to the decay time of the FlhDC-dependent class 2 promoter activation (see Fig. 3A for an illustration). Since FlhD and FlhC can only serve as functional TF when they form heterohexamer FlhDC, τ2 is affected by the binding affinity of FlhDC to class 2 promoters as well as stability of the FlhDC complex. The heterohexamer is more stable than the monomers (35), which is consistent with τ2>τ1 as observed in experiments and our model. It would be interesting to test the effects of changing τ2 experimentally (see Summary and Discussion for more details).

Fig. 3.

Fig. 3.

The YdiV-mediated sequestration mechanism for switch like behavior. (A) Schematic representation of the mechanism of transcriptional regulation of class 2 promoters by FlhDC. The class 1 promoter (purple) initiates the expression of the monomers FlhC and FlhD (yellow and orange, respectively) that forms the heterohexamer FlhD4C2 (FlhDC). The timescale τ1 may be related to the degradation rate of the monomers (FlhD and FlhC), while τ2 may be related to the degradation rate of FlhDC or its unbinding rate to the promoters, both labeled by red arrows. The activation of class 2 genes, which depends on binding of the FlhDC complex to the class 2 promoter (green), is inhibited when FlhDC is sequestered by binding to YdiV (blue) instead (gray box). (B) Response curves obtained from the YdiV sequestration model for different values of YdiV concentrations (Pt is the total class 2 promoter concentration); see Methods for details. (Inset) The response functions, f+(C1) (blue) and f(C1) (red), in the presence and absence of YdiV, respectively, from fitting experimental data (the same as in Fig. 1D). (C) Hill coefficient (H) and the half-maximum concentration (K1/2) as a function of the YdiV concentration.

We now study the role of YdiV in creating an ultrasensitive switch-like response of class 2 gene expression as shown in Fig. 1D. YdiV is known to have two main effects on FlhDC. First, by occupying the binding sites of FlhDC, YdiV prevents the binding of FlhDC to free class 2 promoters (32). Second, YdiV mediates the interaction between FlhDC and the degradation complex ClpXP (31, 32). Here, we only consider the effect of competitive binding on the response function; similar effect may be achieved by enhancing the degradation of FlhDC, which is not considered in this study. Intuitively, due to competitive binding, YdiV can introduce a threshold for FlhDC by sequestrating FlhDC to prevent it from binding to and activating the class 2 promoters.

To show the qualitative effects of this YdiV-mediated sequestration mechanism, we developed a simple model based on competitive binding of FlhDC by YdiV and the class 2 promoter as illustrated in Fig. 3A (see Methods for details of the sequestration model). In Fig. 3B, the fraction of promoters bound to FlhDC, which serves as a measure of the class 2 gene expression activity, is shown as a function of the scaled FlhDC concentration C1/Pt, with Pt as the total promoter concentration, for different values of total YdiV concentration. We can see that higher YdiV concentrations lead to delayed but steeper response curves, which agrees qualitatively with the response curves f+ and f shown in Fig. 3 B, Inset for comparison. A response curve can be described by its effective Hill coefficient H and its half-maximum concentration K1/2. From the experimental data and our model, both the Hill coefficient and K1/2 are larger in wild-type cells than those in the ΔydiV mutant. As shown in Fig. 3C, the YdiV-mediated sequestration model reproduces this general trend, with both H and K1/2 increasing with YdiV concentration.

The steep response function f+(C1) suggests that YdiV serves (approximately) as a digital filter. When the input signal, that is, FlhDC concentration (C1), is lower than a threshold C*K1/2(Yt) set by the YdiV concentration Yt, the normalized output f+(C1)/max(f+) is close to 0. When C1>C*, the normalized output is close to 1.

One important ingredient in our model is the inhibitory effect of YdiV on class 2 promoter activity. Therefore, changing YdiV at a fixed average FlhDC level can modulate the class 2 promoter activity. In particular, the model predicts that the increase in YdiV levels raises the threshold of class 1 concentration that needs to be reached to have a nonnegligible class 2 response. More technically, the constant K1/2,+ increases with the concentration of YdiV. This results in a shift in the class 2 distributions toward lower mean values for larger YdiV levels. We have tested this prediction by using flow cytometry to quantify the distributions of class 2 expression in two strains, ProBY and ProDY (36), which have higher levels of YdiV expression than in wild type (see SI Appendix, Supplementary Information E for details of the experiments). As shown in SI Appendix, Fig. S6, we find that, when the YdiV level is increased, there is a shift in the class 2 gene expression distributions toward lower values consistent with our model results.

A Filter-and-Integrate Mechanism for Stochastic Pulses without Feedback.

Put together, our results show that E. coli uses a combined filter-and-integrate strategy (mechanism) to control its class 2 gene expression. As shown in Fig. 4, in the presence of YdiV (left side of Fig. 4), the input signal, that is, the FlhDC concentration C1(t) (blue line), which fluctuates relatively fast at a short timescale τ1, first goes through a highly nonlinear ultrasensitive response function f+(C1), which serves approximately as a digital filter that eliminates the FlhDC fluctuations lower than certain threshold C* set by the YdiV concentration. The filtered signal f+(C1(t)) (green line) has a nearly digital stochastic pulse-like pattern.

Fig. 4.

Fig. 4.

The filter-and-integrate mechanism for pulse generation. In the presence of YdiV, the FlhDC concentration C1(t) (blue line), which fluctuates with a fast timescale τ1, is filtered by the inhibitory effect of YdiV. The filtered signal f+(C1(t)) (black line), which has a pulse-like pattern, is then integrated over a longer timescale τ2 to determine the class 2 activity A2(t) (red line). In the absence of YdiV, there is no filtering effect, and the resulting A2 dynamics is noisier, with no pulse. Without integration (τ2τ1), A2 would have many short spikes when the signal is filtered, and it resembles the original noisy signal (C1) in the absence of filtering. Results presented here are obtained from simulations of our model using parameters fitting the P4 strain.

The second part of the two-pronged process is to integrate the filtered signal over a timescale τ2(>τ1), which gives rise to the intermittent “strong” pulses in A2 (red line). Without the integration process τ2τ1, the A2 activity would have many short spikes, too short to complete the flagellar synthesis process. The integration process acts as a filter in frequency space, as demonstrated in Fig. 2 C and D, to suppress those FlhDC fluctuations that are strong in intensity but short in time duration. As a result, class 2 gene expression is turned on only when there are persistently high levels of FlhDC signal. Due to the same integration (memory) effect, once turned on, the class 2 expression remains active for a time duration of τ2 even without strong instantaneous FlhDC signal. This long duration of on time is necessary for the cell to finish the flagellar synthesis process that lasts for several generations.

Both filtering and integration in time are critical to generate the stochastic pulses observed in experiments (19) where A2 alternates between having near-zero activity (quiet) for a long period and being highly active during a strong pulse that lasts for a long timescale set by τ2 (bottom left in Fig. 4). In the absence of YdiV, the noisy C1(t) signal is not filtered, and, as a result, there is no pulse in A2(t), which fluctuates continuously (bottom right in Fig. 4). In the absence of integration, A2 follows the same dynamics as the filtered signal f+, which has many short spikes (bottom middle in Fig. 4).

Predicted Gene Expression Responses to Time-Varying Signals.

Now that our model is fully developed with all its parameters determined by existing single-cell experiments (19), it provides a powerful tool to predict class 2 gene expression responses to more realistic time-dependent signals without any tuning parameter. As an example, we use our model to study the responses of class 2 activity to an increase of the class 1 promoter strength from a low value μ0(=μ(P1)2.6) to a series of higher values μ1 at time t=0. We find that a cell will turn on the class 2 gene expression after a delay time τr, as shown in Fig. 5A. The delay time τr varies from cell to cell and follows a distribution P(τr|μ1) that depends on μ1. In Fig. 5B, the delay time distributions for different values of μ1 are shown. Both the average delay time and its variance decrease with μ1 or, equivalently, the average FlhDC level as shown in Fig. 5C. Responses to other time-varying stimuli such as ramps and oscillatory signals can be studied by our model similarly.

Fig. 5.

Fig. 5.

The predicted responses to step stimuli. (A) Predicted dynamics of C1 and A2 in response to a time-dependent (step function) stimulus in which the class 1 promoter strength μ is changed at t=0 from a low value μ0=μ(P1)=2.6 to a higher value μ1. The delay time τr is the time duration from the stimulus (t=0) to the onset of class 2 expression, which is defined as the time when A2 crosses a threshold A2*=max(f+)/4 (qualitative results do not depend on the choice of A2*). (B) Distributions of the delay time τr (normalized by τ1) for different values of μ1. (C) Both the average and variance of τr decreases with μ1 or the average FlhDC concentration. The values of μ for strains P4 and P7 used in our study are marked for reference.

Our results suggest that, even though FlhDC concentration does not control the class 2 gene expression deterministically, the filter-and-integrate mechanism allows the cell to control the timing statistics of the class 2 expression based on the average FlhDC signal intensity. As shown in Fig. 5C, a higher FlhDC concentration induces (on average) a shorter and more accurate (smaller coefficient of variation) onset time to turn on the class 2 genes. Similarly, an elevated average FlhDC level leads to more frequent occurrence of the transcriptional pulses. These model predictions can be tested in future single-cell experiments by using inducible class 1 promoters.

Summary and Discussion

In this paper, dynamics of the class 1 and class 2 promoters in E. coli flagellar synthesis are studied by using a minimal stochastic model that captures the essential characteristics of the underlying system. Applying our stochastic model to a large single-cell dataset quantitatively shows that the transcriptional pulses are generated by a filter-and-integrate mechanism without the need of feedback control. Both the YdiV-enabled filtering and the memory-enabled time integration are crucial to average out FlhDC fluctuations in intensity and in time so that an individual cell can make a “calculated” (informed) decision on whether to turn on the expensive class 2 gene expression. This mechanism or a variation of it may be at work in other pulse-generating systems without feedback. For example, in the reconstituted SinI−SinR circuit (30), the antagonist SinI may play the critical role of the filter (analogous to YdiV) by binding to and inhibiting the transcriptional regulator SinR (analogous to FlhDC). On the other hand, while the timescale of the class 2 activity (A2) pulses in flagellar synthesis is provided by the integration time τ2, the pulse timescale in the SinI−SinR system may be controlled directly by the timescale of the SinI fluctuations (30).

To better understand the feedbackless filter-and-integrate strategy in E. coli, we compare it with a closely related bacterium Salmonella, which does have an additional positive feedback loop in controlling its class 2 promoter activity. As illustrated in Fig. 6, FliZ, a class 2 protein, can inhibit YdiV in Salmonella but not in E. coli. This positive feedback loop in Salmonella leads to bistability in class 2 gene response (37). As a result, the hysteretic response function in Salmonella fSal(C1) has two critical points Cl* and Ch*(>Cl*), as illustrated in Fig. 6B. Due to the bistability, once the FlhDC level goes over the upper critical level C1>Ch* even for a short time, the response signal fSal(C1) will stay on (green) for a much longer time, as long as FlhDC level is higher than the lower critical level C1>Cl*. This amplification of the “on” time duration allows Salmonella to activate class 2 gene expression upon detecting a large-but-short signal. In contrast, as shown in Fig. 6A, the response function for E. coli, though steep, does not have bistability, because there is no feedback. As a result, an elevated FlhDC level over the threshold (C*) for a short time is not enough to trigger significant class 2 activity, given the large integration time τ2.

Fig. 6.

Fig. 6.

The different gene regulation strategies for E. coli and Salmonella. (A) The E. coli network has the YdiV-mediated inhibition of the class 1 master regulator (FlhDC) but not the feedback mechanism from class 2 gene. As a result, E. coli has a steep but monotonic response function f+ with a threshold C*; that is, class 2 gene expression is activated (green) when C1>C* or inhibited (red) when C1<C*. For an FlhDC signal that passes the threshold C* briefly, the class 2 activity is only turned on for a short time, which is not enough to start the flagella synthesis, given the long integration time τ2 for class 2 gene activation. (B) In Salmonella, there is a feedback (blue line) from a class 2 protein (FliZ) to suppress the inhibitor YdiV. This feedback mechanism leads to a bistable (hysteretic) response of class 2 genes in the range Cl*<C1<Ch*. As a result, a brief period of elevated FlhDC concentration over the (upper) threshold Ch* can cause a prolonged class 2 gene activation as long as C1>Cl*, which can lead to flagella synthesis.

Their different strategies may reflect the different purposes of motility for these two bacteria. Salmonella is a pathogen and activates the flagellar gene expression cascade in nutrient-rich conditions. A short pulse of strong stimulus from the host may trigger the flagellar synthesis process. This aggressive strategy may be optimized for host colonization and infection. In E. coli, however, flagellar synthesis is activated in nutrient-poor conditions for the purpose of foraging and searching for nutrients. Therefore, E. coli may adopt a more prudent (filter-and-integrate) strategy that only triggers the expensive flagellar synthesis process when the cell experiences persistent poor nutrient conditions.

Our current analysis opens up a few directions for future investigation. Experimentally, effects of the filter-and-integration mechanism may be tested by perturbing the two key factors, for example, by tuning the strength of the interaction between FlhDC and class 2 promoters (32, 38) or by reducing the effective concentration of YdiV by adding Salmonella FlhC known to have weaker affinity for E. coli promoters but the same affinity for YdiV as E. coli FlhC (39). It would be highly informative to measure single-cell gene expression dynamics of Salmonella in the mother machine as done for E. coli (19), to understand its gene regulation strategy quantitatively. In our current model, YdiV level is assumed to be constant as we focus on the long timescale (low frequency) dynamics of the system. However, in the high-frequency regime, the A2 power spectrum from experiments follows P(f)f1 (f is the frequency), which decays slower than the power spectrum from our current model with a fixed YdiV level (SI Appendix, Fig. S10). A similar 1/f noise spectrum was observed in flagellar motor switching dynamics (40) and was subsequently explained by considering fluctuations of the response regulator CheY-P in a single cell (41). It would be interesting to include dynamics of YdiV in our model to explore whether it also leads to the observed 1/f spectrum in the high-frequency regime. Furthermore, to fully understand how the class 2 gene expression response depends on YdiV, molecular details of the FlhDC heterohexamer complex formation, its binding (unbinding) to DNA, and its degradation mediated by YdiV need to be considered in addition to the YdiV-mediated sequestration studied here.

Finally, our work here demonstrates that system-level stochastic modeling augmented by quantitative single-cell measurements for different strains and mutants provides a powerful tool to decipher principles of gene regulation from inherently noisy single-cell data. This tool becomes necessary when dealing with complex gene regulatory circuits such as the full bacterial flagellar gene expression cascade with multiple feedback controls. The general system-level stochastic modeling approach with coarse-grained variables that can be directly compared with experiments should be applicable to study stochastic expression dynamics in other gene regulatory systems with or without feedback (2326, 30, 42).

Methods

Simulation of the Stochastic Differential Equations.

For the numerical simulation of the stochastic equations, we write Eq. 2 in discrete form,

ΔC1=(C1μ)Δtτ1+γηC12Δ1Δt, [4]

where γη is a stochastic variable following a Gaussian distribution N[0,1]. We let τ1=1 to set the timescale, and the time step Δt=0.004τ1. The discrete form of Eq. 1 is

ΔA2=(A2fs(C1))τ1τ2Δt+γθ2Δsτ1Δt, [5]

where γθ is a Gaussian random number as γη.

Bin-Averaged Response Curves and Flatness.

To obtain the BARCs, we divide the full interval 101C1102 into 20 logarithmically spaced bins and calculate the average of class 1 concentration and class 2 activity for all of the data points in each bin. This bin-averaged class 2 activity versus the bin-averaged class 1 concentration is the BARC. The BARCs in Fig. 2 are only shown at those bins that contain more than 5,000 data points, for better statistics.

Steady-State Solution of the Sequestration Model.

Let Pt be the concentration of promoters, C1 be the total concentration of FlhDC, and Yt be the YdiV concentration. We denote C1p and C1y as the concentrations of FlhDC that are bound to the promoters and YdiV, respectively. In steady state, the balance of binding and unbinding reactions leads to C1p=C1fPt/C1f+Kp, and C1y=C1fYt/C1f+Ky, where Kp and Ky are the dissociation constants of FlhDC to the promoters and YdiV, respectively. C1f(=C1C1pC1y) is the free FlhDC concentration, and it satisfies the following equation (conservation of FlhDC):

C1f+C1fPtC1f+Kp+C1fYtC1f+Ky=C1, [6]

which can be solved to determine C1f as a function of Pt, C1, Yt, Ky, and Kp. The fraction of FlhDC bound promoters is C1p/Pt=C1f/(Kp+C1f), which is used as a measure of the class 2 activity. The dependence of C1p/Pt on C1/Pt is plotted in Fig. 3B. Parameters Ky/Pt=102 and Kp/Pt=101 are used in Fig. 3 B and C.

Supplementary Material

Supplementary File

Acknowledgments

We thank Drs. Y. Lin and M. Erhardt for discussions. The work by A.S.S. and Y.T. is supported by NIH Grant R35GM131734 to Y.T. The work by M.G.-A., M.J.K., and P.C. is supported by NIH Grant R01GM134275 to P.C. and NSF Grant 1615487 to P.C. M.G.-A. acknowledges support from Consejo Nacional de Ciencia y Tecnología (CONACYT) through a PhD scholarship and Dirección General de Asuntos del Personal Académico - Universidad Nacional Autónoma de México (DGAPA-UNAM) for support through Grant PAPIIT-IN226917.

Footnotes

The authors declare no competing interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2010849117/-/DCSupplemental.

Data Availability.

All study data are included in the article and SI Appendix.

References

  • 1.Martinez-Antonio A., Janga S., Thieffry D., Functional organisation of Escherichia coli transcriptional regulatory network. J. Mol. Biol. 381, 238–247 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bähler J., A transcriptional pathway for cell separation in fission yeast. Cell Cycle 4, 39–41 (2005). [DOI] [PubMed] [Google Scholar]
  • 3.Allan D. W., Thor S., Transcriptional selectors, masters, and combinatorial codes: Regulatory principles of neural subtype specification. Wiley Interdiscip. Rev. Dev. Biol. 4, 505–528 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Spöring I., et al. , Regulation of flagellum biosynthesis in response to cell envelope stress in Salmonella enterica serovar typhimurium. mBio 9, e00736-17 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Soutourina O. A., Bertin P. N., Regulation cascade of flagellar expression in Gram-negative bacteria. FEMS Microbiol. Rev. 27, 505–523 (2003). [DOI] [PubMed] [Google Scholar]
  • 6.Jacob F., Monod J., Genetic regulatory mechanisms in the synthesis of proteins. J. Mol. Biol. 3, 318–56 (1961). [DOI] [PubMed] [Google Scholar]
  • 7.Shen-Orr S. S., Milo R., Mangan S., Alon U., Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet. 31, 64–68 (2002). [DOI] [PubMed] [Google Scholar]
  • 8.Ghafari M., Mashaghi A., On the role of topology in regulating transcriptional cascades. Phys. Chem. Chem. Phys. 19, 25168–25179 (2017). [DOI] [PubMed] [Google Scholar]
  • 9.Chilcott G. S., Hughes K. T., Coupling of flagellar gene expression to flagellar assembly in Salmonella enterica serovar typhimurium and Escherichia coli. Microbiol. Mol. Biol. Rev. 64, 694–708 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kalir S., et al. , Ordering genes in a flagella pathway by analysis of expression kinetics from living bacteria. Science 292, 2080–2083 (2001). [DOI] [PubMed] [Google Scholar]
  • 11.Apel D., Surette M. G., Bringing order to a complex molecular machine: The assembly of the bacterial flagella. Biochim. Biophys. Acta 1778, 1851–1858 (2008). [DOI] [PubMed] [Google Scholar]
  • 12.Ikebe T., Iyoda S., Kutsukake K., Promoter analysis of the class 2 flagellar operons of Salmonella. Genes Genet. Syst. 74, 179–183 (1999). [DOI] [PubMed] [Google Scholar]
  • 13.Liu X., Matsumura P., Differential regulation of multiple overlapping promoters in flagellar class II operons in Escherichia coli. Mol. Microbiol. 21, 613–620 (1996). [DOI] [PubMed] [Google Scholar]
  • 14.Fitzgerald D. M., Bonocora R. P., Wade J. T., Comprehensive mapping of the Escherichia coli flagellar regulatory network. PLoS Genet. 10, e1004649 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Komeda Y., Transcriptional control of flagellar genes in Escherichia coli K-12. J. Bacteriol. 168, 1315–1318 (1986). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Kutsukake K., Ohya Y., Iino T., Transcriptional analysis of the flagellar regulon of Salmonella typhimurium. J. Bacteriol. 172, 741–747 (1990). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Brown J. D., et al. , The rate of protein secretion dictates the temporal dynamics of flagellar gene expression. Mol. Microbiol. 70, 924–937 (2008). [DOI] [PubMed] [Google Scholar]
  • 18.Saini S., Brown J. D., Aldridge P. D., Rao C. V., FliZ Is a posttranslational activator of FlhD4C2-dependent flagellar gene expression. J. Bacteriol. 190, 4979–4988 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Kim J. M., Garcia-Alcala M., Balleza E., Cluzel P., Stochastic transcriptional pulses orchestrate flagellum biosynthesis in E. coli. Sci. Adv. 6, eaax0947 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Moffitt J. R., Lee J. B., Cluzel P., The single-cell chemostat: An agarose-based, microfluidic device for high-throughput, single-cell studies of bacteria and bacterial communities. Lab Chip 12, 1487–1494 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Wang P., et al. , Robust growth of Escherichia coli. Curr. Biol. 20, 1099–1103 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Cai L., Dalal C. K., Elowitz M. B., Frequency-modulated nuclear localization bursts coordinate gene regulation. Nature, 455, 485–490 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Tay S., et al. , Single-cell nf-κb dynamics reveal digital activation and analogue information processing. Nature 466, 267–271 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Batchelor E., Alexander L., Mock C., Lahav G., Stimulus-dependent dynamics of p53 in single cells. Mol. Syst. Biol. 7, 488 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Hao N., O’Shea E. K., Signal-dependent dynamics of transcription factor translocation controls gene expression. Nat. Struct. Mol. Biol. 19, 31–39 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Locke J. C. W., Young J. W., Fontes M., Hernández Jiménez M. J., Elowitz M. B., Stochastic pulse regulation in bacterial stress response. Science 334, 366–369 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Norman T. M., Lord N. D., Paulsson J., Losick R., Stochastic switching of cell fate in microbes. Annu. Rev. Microbiol. 69, 381–403 (2015). [DOI] [PubMed] [Google Scholar]
  • 28.Levine J. H., Lin Y., Elowitz M. B., Functional roles of pulsing in genetic circuits. Science 342, 1193–1200 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Martinez-Corral R., Raimundez E., Lin Y., Elowitz M., Garcia-Ojalvo J., Self-amplifying pulsatile protein dynamics without positive feedback. Cell Syst. 7, 453–462.e1 (2018). [DOI] [PubMed] [Google Scholar]
  • 30.Lord N. D., et al. , Stochastic antagonism between two proteins governs a bacterial cell fate switch. Science 366, 116–120 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Wada T., Hatamoto Y., Kutsukake K., Functional and expressional analyses of the anti-FlhD4C4 factor gene ydiV in Escherichia coli. Microbiology 158, 1533–1542 (2012). [DOI] [PubMed] [Google Scholar]
  • 32.Takaya A., et al. , YdiV: A dual function protein that targets FlhDC for ClpXP-dependent degradation by promoting release of DNA-bound FlhDC complex. Mol. Microbiol. 83, 1268–1284 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Wada T., Tanabe Y., Kutsukake K., FliZ acts as a repressor of the ydiV gene, which encodes an anti-FlhD4C2 factor of the flagellar regulon in Salmonella enterica serovar typhimurium. J. Bacteriol. 193, 5191–5198 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Hengge R., et al. , Systematic nomenclature for GGDEF and EAL domain-containing cyclic di-GMP turnover proteins of Escherichia coli. J. Bacteriol. 198, 7–11 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Sato Y., Takaya A., Mouslim C., Hughes K. T., Yamamoto T., FliT selectively enhances proteolysis of FlhC subunit in FlhD4C2 complex by an ATP-dependent protease, ClpXP. J. Biol. Chem. 289, 33001–33011 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Davis J. H., Rubin A. J., Sauer R. T., Design, construction and characterization of a set of insulated bacterial promoters. Nucleic Acids Res. 39, 1131–1141 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Koirala S., et al. , A nutrient-tunable bistable switch controls motility in Salmonella enterica serovar typhimurium. mBio 5, e01611-14 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Stafford G. P., Ogi T., Hughes C., Binding and transcriptional activation of non-flagellar genes by the Escherichia coli flagellar master regulator FlhD2C2. Microbiology 151, 1779–1788 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Albanna A., et al. , Driving the expression of the Salmonella enterica sv Typhimurium flagellum using flhDC from Escherichia coli results in key regulatory and cellular differences. Sci. Rep. 8, 16705 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Korobkova E., Emonet T., Vilar J. M. G., Shimizu T. S., Cluzel P., From molecular noise to behavioral variability in a single bacterium. Nature 428, 574–578 (2004) [DOI] [PubMed] [Google Scholar]
  • 41.Tu Y., Grinstein G., How white noise generates power-law switching in bacterial flagellar motors. Phys. Rev. Lett. 94, 208101 (2005). [DOI] [PubMed] [Google Scholar]
  • 42.Susman L., et al. , Individuality and slow dynamics in bacterial growth homeostasis. Proc. Natl. Acad. Sci. U.S.A. 115, E5679–E5687 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File

Data Availability Statement

All study data are included in the article and SI Appendix.


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES