Significance
Stochastic pulsatile dynamics are ubiquitous in organisms ranging from Escherichia coli to mammalian cells. The molecular mechanism for these stochastic pulses has long been thought to depend on feedback control that leads to bistability in the underlying system. Here, by quantitative modeling constrained by recent single-cell data in E. coli flagellar synthesis, we find that an inhibitor of the transcription factor creates a monotonic ultrasensitive switch that serves as a digital filter to eliminate small-amplitude input fluctuations. Additionally, the high-frequency input fluctuations are filtered by integration over time. Together, our results reveal that the filter-and-integrate design can generate stochastic pulses without feedback. This filter-and-integrate strategy allows cells to turn on gene expression only for persistently strong input signals.
Keywords: transcriptional pulses, gene regulation, stochastic modeling, filtering, ultrasensitivity
Abstract
Stochastic pulsatile dynamics have been observed in an increasing number of biological circuits with known mechanism involving feedback control and bistability. Surprisingly, recent single-cell experiments in Escherichia coli flagellar synthesis showed that flagellar genes are activated in stochastic pulses without the means of feedback. However, the mechanism for pulse generation in these feedbackless circuits has remained unclear. Here, by developing a system-level stochastic model constrained by a large set of single-cell E. coli flagellar synthesis data from different strains and mutants, we identify the general underlying design principles for generating stochastic transcriptional pulses without feedback. Our study shows that an inhibitor (YdiV) of the transcription factor (FlhDC) creates a monotonic ultrasensitive switch that serves as a digital filter to eliminate small-amplitude FlhDC fluctuations. Furthermore, we find that the high-frequency (fast) fluctuations of FlhDC are filtered out by integration over a timescale longer than the timescale of the input fluctuations. Together, our results reveal a filter-and-integrate design for generating stochastic pulses without feedback. This filter-and-integrate mechanism enables a general strategy for cells to avoid premature activation of the expensive downstream gene expression by filtering input fluctuations in both intensity and time so that the system only responds to input signals that are both strong and persistent.
The gene regulatory network for controlling the bacterial flagellar synthesis and assembly has long served as a canonical example of transcriptional cascade (1–8), in which a particular gene product (protein) can serve as the transcription factor (TF) for other genes, which encode proteins that serve as TFs for other downstream genes, and so on. The bacterial flagellar promoters are organized in three classes, each of which underlies specific stages of the complex flagellar assembly and chemosensory signaling processes (9–11). The class 1 promoter controls the transcription of flhDC, the operon encoding for the proteins FlhD and FlhC, that assemble into the heterohexamer (refer to as FlhDC hereafter), which is the master regulator of flagellar synthesis. The master regulator serves as the TF for class 2 promoters and initiates the transcription of seven operons necessary for the assembly of the basal body and the hook of the flagellum (12, 13). A specific class 2 protein, FliA, activates class 3 promoters to express class 3 genes coding proteins for the flagellar filament and the chemotaxis signaling pathway (14). The bacterial flagellar synthesis represents a remarkable process in which over 50 different genes are expressed in a coherent fashion over several generations to build a large functional protein complex. Over the past decades, experimental studies have yielded many valuable insights about the transcriptional network that underlies flagellar synthesis in Escherichia coli and Salmonella (9, 10, 15–18). In particular, population measurements suggested that the transcriptional cascade follows a deterministic temporal order where genes in subsequent classes are activated sequentially and, once turned on, promoters remain active continuously during exponential growth (10).
Recently, Kim et al. (19) measured gene expression dynamics of single E. coli cells over many generations by using a microfluidic device called the “mother machine” (20, 21). Surprisingly, the new single-cell data showed that flagellar promoters are stochastically activated in intermittent pulses, that is, bursts of strong activity amid a quiet background. Due to their stochastic nature, transcriptional pulses can only be detected by single-cell measurements, and they were missed in previous cell population study (10).
Indeed, rapid developments in quantitative single-cell measurements have revealed that quasi-periodic or stochastic pulsating dynamics is a common dynamic activity pattern in biological circuits in a wide range of organisms, for example, TF Msn2 in yeast (22, 23), TF p53 (24) and NF-B (25) in mammalian cells, bacterial TF in the bacterium Bacillus (26), and stochastic cell fate switching in microbes (27). Typically, the underlying mechanism involves bistability caused by a combination of positive and negative feedback loops (28); a recent theoretical study showed that stochastic pulses may also be caused by a negative feedback loop alone (29). Suffice to say, most known mechanisms for pulses involve some form of feedback control. However, in Kim et al. (19), pulses were present even when the promoter of FlhDC was deleted and replaced by a synthetic/constitutive promoter that does not respond to flagellar endogenous regulator and thus rules out the involvement of any feedback control. Additionally, in a different system that controls cell fate, stochastic pulses were also successfully generated by transferring, in E. coli, a reconstituted gene circuit without feedback (30).
Prompted by these recent observations, we ask the question, what are the underlying design principles that govern the stochastic transcriptional pulses without feedback? To answer this question, we use the large single-cell expression time series dataset for different strains and mutants from Kim et al. (19) to quantitatively and consistently constrain a system-level stochastic model. Using this approach, we aim to identify the minimum design principles necessary to generate stochastic transcriptional pulses without feedback in general, and specifically during E. coli flagellar synthesis.
Results
We first present a system-level (coarse-grained) stochastic model for the regulation of class 2 genes by FlhDC. We then use the model to analyze and explain the steady-state distributions of class 2 activity and its dynamics observed in single-cell experiments (19). Next, we propose possible molecular mechanisms underlying the main findings from our analysis and describe a filter-and-integrate mechanism for controlling noisy FlhDC signal in E. coli. Finally, we use our model to make testable predictions for responses to realistic time-varying signals.
A System-Level Stochastic Model for Class 1 and Class 2 Gene Expression Dynamics.
In the experiments by Kim et al. (19), they used strains whose expression of class 1 genes was controlled by promoters of different strengths (see SI Appendix, Supplementary Information A for details of the experimental data). For each strain, long time series ( generations) of class 1 and class 2 promoter activity were simultaneously measured within the same cells using fluorescent proteins as proxy. They found that distributions of class 2 promoter activity depend on the class 1 promoter strength as well as the presence or absence of the inhibitor YdiV (31–33) [also known as RflP (34)].
Here, we use the class 1 reporter fluorescence signal as a proxy to represent the FlhDC level as the input to the class 2 gene transcription process. Following Kim et al. (19), we define the class 2 gene expression activity (output), , as the time derivative of the fluorescence time series of the class 2 reporter gene, which is assumed to be proportional to the degree of activation of class 2 promoters (see SI Appendix, Supplementary Information A for details of computing and denoising from experimental data obtained in the “mother machine”). We propose a system-level stochastic model to describe dynamics of these two coarse-grained observables, and , without considering molecular details. In particular, the dynamics in a single cell is described by a stochastic differential equation (SDE) that has three basic terms: decay (degradation), activation (production), and noise,
| [1] |
where represents cells with and without YdiV, respectively, and is the decay rate with as the characteristic timescale associated with class 2 expression. For convenience, we write activation rate of the class 2 promoter as so that . We call the response function, as it characterizes how class 2 activity responds to class 1 protein level . The white noise has zero mean and correlation with noise strength given by . Here, we assume that the noise strength depends only on the average concentration , which is valid due to the timescale difference for and , as will become clear later in this work.
The activity of class 2 genes depends on the class 1 protein level , whose dynamics can be modeled by a similar SDE,
| [2] |
where is the decay time for class 1 protein. We define as the production rate, so that the steady-state average is with depending on the promoter strength of a specific strain. The white noise has zero mean and , with the noise strength as a constant for all promoter strengths. The noise is multiplied by in Eq. 2. This choice of a linear multiplicative noise is due to the fact that it provides a better fit to experimental data in comparison with other forms of noise such as additive and square root multiplicative noise (see SI Appendix, Supplementary Information B and Fig. S2 for a detailed comparison). Mechanistically, the multiplicative noise may originate from random partitioning of protein molecules during cell division and fluctuations in protein degradation rate.
An important assumption of our proposed model is that the observed class 2 pulses do not require transcriptional feedback on YdiV. Indeed, this assumption is fully supported by mother machine single-cell data published in Kim et al. (ref. 19, figure S(11)), which shows that, when the expression of YdiV is controlled by a synthetic constitutive promoter, the pulses are still present. Overall, our “minimal” system-level (coarse-grained) model consists of two SDEs (Eqs. 1 and 2) with a few physiologically meaningful parameters and functions: two timescales and , a constant multiplicative noise strength , a strain-dependent noise strength , and two response functions . In the following, by fitting all of the single-cell data for different strains (19) with our model quantitatively, we determine the parameters and response functions, which help us identify the underlying mechanism for transcriptional pulses.
YdiV Creates an Ultrasensitive Switch.
We first use our model to study all of the steady-state class 2 activity distributions from experiments (see Methods for details on simulations of our model). In Fig. 1A, experimentally observed distributions of the normalized class 2 activity for cells with and without YdiV for three class 1 promoter strengths—low (, red), medium (, green), and high (, blue)—are shown (see SI Appendix, Supplementary Information A for details of the experimental dataset). The most drastic difference between cells with and without YdiV is for strain that has a medium promoter strength similar to that of the wild-type cells. The distribution for the strain in the presence of YdiV (green line in Fig. 1 A, Left) is strongly asymmetric with a peak that is significantly shifted with respect to the mean, whereas the distributions for lower and higher promoter strengths are more symmetric. This asymmetry is largely absent in strains without YdiV (Fig. 1 A, Right). For simplicity, we fit the experimental data by using Hill functions for the response functions and tuning the shape of the Hill functions to minimize the difference between the distributions of and from the model and those from experiments for different promoter strengths (see SI Appendix, Supplementary Information C for details). Our model can explain all of the steady-state distributions from different strains together as shown by the direct comparison between the experimentally measured distributions (Fig. 1A) and those obtained from our model (Fig. 1B).
Fig. 1.
Comparison between experimental data and model results. (A) Distributions of the normalized class 2 activity in the presence (Left) and absence (Right) of YdiV for small (, red), medium (, green), and large (, blue) class 1 promoter strengths. Data are taken from ref. 19. We use the black dashed lines to indicate the positions of its mean value and zero for . (B) The corresponding distributions obtained from our model. (C) Comparison of typical time series of and from experiments and model for the strain with and without YdiV. (D) Response functions obtained by fitting our model to experimental data (19) with (, solid line) and without (, dashed line) YdiV. The circles (+YdiV) and crosses (−YdiV) indicate the mean value from different mother cells in experiments. Note that does not pass through the center of the data points, due to its high nonlinearity . The color code for each promoter is the same as in A and B.
The most significant difference in the dynamics of class 2 activity between cells with and without YdiV also occurs in strain with a medium class 1 promoter strength similar to that of wild-type cells. As shown in Fig. 1C, from the experiments, the dynamics contains well-defined pulses in the presence of YdiV, while pulses are not well separated and the presence of higher-frequency noise appears to be unfiltered in the absence of YdiV. These different activity dynamics are reproduced in our model.
By fitting our model to the single-cell data (see SI Appendix, Supplementary Information C for details of the fitting and values of all of the fitted model parameters), we obtain the response functions, and , which cannot be directly determined from experimental data because , due to their nonlinearity. As shown in Fig. 1D (solid line), the response function in the presence of YdiV behaves like a monotonic ultrasensitive switch, that is, it has a much larger Hill coefficient () than the Hill coefficient () for (see SI Appendix, Supplementary Information C for details). Due to this ultrasensitive (steep) response function, when the mean FlhDC concentration is close to the threshold (e.g., for strain ), the distribution of has the largest asymmetry, and there is a significant shift in the peak of the distribution (Fig. 1 A, Left and B, Left). In the absence of YdiV, the response function is found to be less steep (dotted line in Fig. 1D), and the probability distributions are more symmetric (Fig. 1 A, Right and B, Right). Since we only used experimental data from three strains () to fit (calibrate) our model to determine the response functions and other parameters (such as and ), we can test the consistency of our model by applying it to other strains that are not used in the fitting. In particular, we ran simulations of our model for two other strains, and , with different promoter strengths but with all other parameters and the response functions fixed. As shown in SI Appendix, Fig. S5, the distributions for and obtained from our model agree well with experiments, and the average values of are also consistent with the experiments (SI Appendix, Fig. S4).
The agreements between our model and the experiments not only validate our model, but, more importantly, they reveal that the role of YdiV is to create a ultrasensitive (steep) response of the class 2 activity to the class 1 protein (FlhDC) concentration.
Memory Effect and Integration over Time.
Next, we use our model to analyze the stochastic gene expression dynamics in the flagellum system. As shown in Eq. 1, there are two noise sources for the class 2 activity : the intrinsic noise in the gene expression process and the extrinsic noise due to fluctuations of the FlhDC concentration . From our model, the mean class 2 promoter activity averaged over the intrinsic noise can be written in the following integral form:
| [3] |
where is the integral kernel (Green’s function) that decays with the timescale . Eq. 3 means that depends on the values of in a time window ; that is, there is a memory effect. Given that varies with a timescale , the strength of this memory effect depends on the ratio of the two timescales . For , the kernel and ; that is, the class 2 activity depends on the instantaneous level of class 1 protein with negligible memory effect. However, if , the memory effect becomes significant. In the limit , becomes a constant independent of the instantaneous FlhDC level ( is the steady-state distribution function).
To demonstrate the memory effect, we divide the FlhDC concentration in equally spaced bins (in log-scale) and then compute the class 2 activity and FlhDC concentration averaged within each bin to form the so-called “bin-averaged response curve” (BARC) for each strain with a different class 1 promoter strength (see Methods for details on BARC). As shown in Fig. 2A, when , the BARCs for different strains can be shifted from each other, consistent with experiments. This means that the class 2 activity is not determined just by the instantaneous FlhDC concentration; it depends also on its average over previous times, due to the memory effect. Indeed, as shown in Fig. 2B, in the absence of memory effect when , all BARCs collapse.
Fig. 2.
The bin-averaged response curves and the ratio of two timescales . (A) Model results for . Inset shows the same behavior from experimental data. (B) Model results for . We used −YdiV strains here; see SI Appendix, Fig. S7 for similar results for +YdiV strains. (C and D) The normalized power spectra (green lines) from our model for (C) and (D) for the strain. The normalized power spectrum (black line) from experiments () is also shown for comparison. We used within the range of correlation time estimated from experiments.
When , depends on an integral of the transformed input signal over a time window as shown in Eq. 3. This integration effectively averages the input signal in time and eliminates high-frequency fluctuations as demonstrated in Fig. 2 C and D, where power spectra for the two cases with and without integration are shown. The power spectrum from experimental data is consistent with our model, with where the power of the spectrum of is concentrated in the low-frequency regime with a similar half-maximum-power frequency as shown in Fig. 2C. For (Fig. 2D), the power spectrum is much broader, with significant high-frequency fluctuations and that is much higher than that from experiments. Quantitatively, by comparing BARCs and autocorrelation functions from the model and those from experiments, we estimate the timescale ratio to be in the range (see SI Appendix, Supplementary Information F for details).
A YdiV-Mediated Sequestration Mechanism for Filtering.
So far, we have used our model to explain the steady-state distributions and dynamics of the noisy single-cell data. In this section, we propose possible molecular mechanisms underlying our findings, with a focus on the role of YdiV in regulating FlhDC signal.
We first briefly discuss possible molecular origins of separation of timescales. The shorter timescale is most likely related to the degradation time of the FlhC, and FlhD monomers and the longer timescale are related to the decay time of the FlhDC-dependent class 2 promoter activation (see Fig. 3A for an illustration). Since FlhD and FlhC can only serve as functional TF when they form heterohexamer FlhDC, is affected by the binding affinity of FlhDC to class 2 promoters as well as stability of the FlhDC complex. The heterohexamer is more stable than the monomers (35), which is consistent with as observed in experiments and our model. It would be interesting to test the effects of changing experimentally (see Summary and Discussion for more details).
Fig. 3.
The YdiV-mediated sequestration mechanism for switch like behavior. (A) Schematic representation of the mechanism of transcriptional regulation of class 2 promoters by FlhDC. The class 1 promoter (purple) initiates the expression of the monomers FlhC and FlhD (yellow and orange, respectively) that forms the heterohexamer (FlhDC). The timescale may be related to the degradation rate of the monomers (FlhD and FlhC), while may be related to the degradation rate of FlhDC or its unbinding rate to the promoters, both labeled by red arrows. The activation of class 2 genes, which depends on binding of the FlhDC complex to the class 2 promoter (green), is inhibited when FlhDC is sequestered by binding to YdiV (blue) instead (gray box). (B) Response curves obtained from the YdiV sequestration model for different values of YdiV concentrations ( is the total class 2 promoter concentration); see Methods for details. (Inset) The response functions, (blue) and (red), in the presence and absence of YdiV, respectively, from fitting experimental data (the same as in Fig. 1D). (C) Hill coefficient () and the half-maximum concentration () as a function of the YdiV concentration.
We now study the role of YdiV in creating an ultrasensitive switch-like response of class 2 gene expression as shown in Fig. 1D. YdiV is known to have two main effects on FlhDC. First, by occupying the binding sites of FlhDC, YdiV prevents the binding of FlhDC to free class 2 promoters (32). Second, YdiV mediates the interaction between FlhDC and the degradation complex ClpXP (31, 32). Here, we only consider the effect of competitive binding on the response function; similar effect may be achieved by enhancing the degradation of FlhDC, which is not considered in this study. Intuitively, due to competitive binding, YdiV can introduce a threshold for FlhDC by sequestrating FlhDC to prevent it from binding to and activating the class 2 promoters.
To show the qualitative effects of this YdiV-mediated sequestration mechanism, we developed a simple model based on competitive binding of FlhDC by YdiV and the class 2 promoter as illustrated in Fig. 3A (see Methods for details of the sequestration model). In Fig. 3B, the fraction of promoters bound to FlhDC, which serves as a measure of the class 2 gene expression activity, is shown as a function of the scaled FlhDC concentration , with as the total promoter concentration, for different values of total YdiV concentration. We can see that higher YdiV concentrations lead to delayed but steeper response curves, which agrees qualitatively with the response curves and shown in Fig. 3 B, Inset for comparison. A response curve can be described by its effective Hill coefficient and its half-maximum concentration . From the experimental data and our model, both the Hill coefficient and are larger in wild-type cells than those in the mutant. As shown in Fig. 3C, the YdiV-mediated sequestration model reproduces this general trend, with both and increasing with YdiV concentration.
The steep response function suggests that YdiV serves (approximately) as a digital filter. When the input signal, that is, FlhDC concentration (), is lower than a threshold set by the YdiV concentration , the normalized output is close to 0. When , the normalized output is close to 1.
One important ingredient in our model is the inhibitory effect of YdiV on class 2 promoter activity. Therefore, changing YdiV at a fixed average FlhDC level can modulate the class 2 promoter activity. In particular, the model predicts that the increase in YdiV levels raises the threshold of class 1 concentration that needs to be reached to have a nonnegligible class 2 response. More technically, the constant increases with the concentration of YdiV. This results in a shift in the class 2 distributions toward lower mean values for larger YdiV levels. We have tested this prediction by using flow cytometry to quantify the distributions of class 2 expression in two strains, ProBY and ProDY (36), which have higher levels of YdiV expression than in wild type (see SI Appendix, Supplementary Information E for details of the experiments). As shown in SI Appendix, Fig. S6, we find that, when the YdiV level is increased, there is a shift in the class 2 gene expression distributions toward lower values consistent with our model results.
A Filter-and-Integrate Mechanism for Stochastic Pulses without Feedback.
Put together, our results show that E. coli uses a combined filter-and-integrate strategy (mechanism) to control its class 2 gene expression. As shown in Fig. 4, in the presence of YdiV (left side of Fig. 4), the input signal, that is, the FlhDC concentration (blue line), which fluctuates relatively fast at a short timescale , first goes through a highly nonlinear ultrasensitive response function , which serves approximately as a digital filter that eliminates the FlhDC fluctuations lower than certain threshold set by the YdiV concentration. The filtered signal (green line) has a nearly digital stochastic pulse-like pattern.
Fig. 4.
The filter-and-integrate mechanism for pulse generation. In the presence of YdiV, the FlhDC concentration (blue line), which fluctuates with a fast timescale , is filtered by the inhibitory effect of YdiV. The filtered signal (black line), which has a pulse-like pattern, is then integrated over a longer timescale to determine the class 2 activity (red line). In the absence of YdiV, there is no filtering effect, and the resulting dynamics is noisier, with no pulse. Without integration (), would have many short spikes when the signal is filtered, and it resembles the original noisy signal () in the absence of filtering. Results presented here are obtained from simulations of our model using parameters fitting the strain.
The second part of the two-pronged process is to integrate the filtered signal over a timescale , which gives rise to the intermittent “strong” pulses in (red line). Without the integration process , the activity would have many short spikes, too short to complete the flagellar synthesis process. The integration process acts as a filter in frequency space, as demonstrated in Fig. 2 C and D, to suppress those FlhDC fluctuations that are strong in intensity but short in time duration. As a result, class 2 gene expression is turned on only when there are persistently high levels of FlhDC signal. Due to the same integration (memory) effect, once turned on, the class 2 expression remains active for a time duration of even without strong instantaneous FlhDC signal. This long duration of on time is necessary for the cell to finish the flagellar synthesis process that lasts for several generations.
Both filtering and integration in time are critical to generate the stochastic pulses observed in experiments (19) where alternates between having near-zero activity (quiet) for a long period and being highly active during a strong pulse that lasts for a long timescale set by (bottom left in Fig. 4). In the absence of YdiV, the noisy signal is not filtered, and, as a result, there is no pulse in , which fluctuates continuously (bottom right in Fig. 4). In the absence of integration, follows the same dynamics as the filtered signal , which has many short spikes (bottom middle in Fig. 4).
Predicted Gene Expression Responses to Time-Varying Signals.
Now that our model is fully developed with all its parameters determined by existing single-cell experiments (19), it provides a powerful tool to predict class 2 gene expression responses to more realistic time-dependent signals without any tuning parameter. As an example, we use our model to study the responses of class 2 activity to an increase of the class 1 promoter strength from a low value to a series of higher values at time . We find that a cell will turn on the class 2 gene expression after a delay time , as shown in Fig. 5A. The delay time varies from cell to cell and follows a distribution that depends on . In Fig. 5B, the delay time distributions for different values of are shown. Both the average delay time and its variance decrease with or, equivalently, the average FlhDC level as shown in Fig. 5C. Responses to other time-varying stimuli such as ramps and oscillatory signals can be studied by our model similarly.
Fig. 5.
The predicted responses to step stimuli. (A) Predicted dynamics of and in response to a time-dependent (step function) stimulus in which the class 1 promoter strength is changed at from a low value to a higher value . The delay time is the time duration from the stimulus () to the onset of class 2 expression, which is defined as the time when crosses a threshold (qualitative results do not depend on the choice of ). (B) Distributions of the delay time (normalized by ) for different values of . (C) Both the average and variance of decreases with or the average FlhDC concentration. The values of for strains and used in our study are marked for reference.
Our results suggest that, even though FlhDC concentration does not control the class 2 gene expression deterministically, the filter-and-integrate mechanism allows the cell to control the timing statistics of the class 2 expression based on the average FlhDC signal intensity. As shown in Fig. 5C, a higher FlhDC concentration induces (on average) a shorter and more accurate (smaller coefficient of variation) onset time to turn on the class 2 genes. Similarly, an elevated average FlhDC level leads to more frequent occurrence of the transcriptional pulses. These model predictions can be tested in future single-cell experiments by using inducible class 1 promoters.
Summary and Discussion
In this paper, dynamics of the class 1 and class 2 promoters in E. coli flagellar synthesis are studied by using a minimal stochastic model that captures the essential characteristics of the underlying system. Applying our stochastic model to a large single-cell dataset quantitatively shows that the transcriptional pulses are generated by a filter-and-integrate mechanism without the need of feedback control. Both the YdiV-enabled filtering and the memory-enabled time integration are crucial to average out FlhDC fluctuations in intensity and in time so that an individual cell can make a “calculated” (informed) decision on whether to turn on the expensive class 2 gene expression. This mechanism or a variation of it may be at work in other pulse-generating systems without feedback. For example, in the reconstituted SinI−SinR circuit (30), the antagonist SinI may play the critical role of the filter (analogous to YdiV) by binding to and inhibiting the transcriptional regulator SinR (analogous to FlhDC). On the other hand, while the timescale of the class 2 activity () pulses in flagellar synthesis is provided by the integration time , the pulse timescale in the SinI−SinR system may be controlled directly by the timescale of the SinI fluctuations (30).
To better understand the feedbackless filter-and-integrate strategy in E. coli, we compare it with a closely related bacterium Salmonella, which does have an additional positive feedback loop in controlling its class 2 promoter activity. As illustrated in Fig. 6, FliZ, a class 2 protein, can inhibit YdiV in Salmonella but not in E. coli. This positive feedback loop in Salmonella leads to bistability in class 2 gene response (37). As a result, the hysteretic response function in Salmonella has two critical points and , as illustrated in Fig. 6B. Due to the bistability, once the FlhDC level goes over the upper critical level even for a short time, the response signal will stay on (green) for a much longer time, as long as FlhDC level is higher than the lower critical level . This amplification of the “on” time duration allows Salmonella to activate class 2 gene expression upon detecting a large-but-short signal. In contrast, as shown in Fig. 6A, the response function for E. coli, though steep, does not have bistability, because there is no feedback. As a result, an elevated FlhDC level over the threshold () for a short time is not enough to trigger significant class 2 activity, given the large integration time .
Fig. 6.
The different gene regulation strategies for E. coli and Salmonella. (A) The E. coli network has the YdiV-mediated inhibition of the class 1 master regulator (FlhDC) but not the feedback mechanism from class 2 gene. As a result, E. coli has a steep but monotonic response function with a threshold ; that is, class 2 gene expression is activated (green) when or inhibited (red) when . For an FlhDC signal that passes the threshold briefly, the class 2 activity is only turned on for a short time, which is not enough to start the flagella synthesis, given the long integration time for class 2 gene activation. (B) In Salmonella, there is a feedback (blue line) from a class 2 protein (FliZ) to suppress the inhibitor YdiV. This feedback mechanism leads to a bistable (hysteretic) response of class 2 genes in the range . As a result, a brief period of elevated FlhDC concentration over the (upper) threshold can cause a prolonged class 2 gene activation as long as , which can lead to flagella synthesis.
Their different strategies may reflect the different purposes of motility for these two bacteria. Salmonella is a pathogen and activates the flagellar gene expression cascade in nutrient-rich conditions. A short pulse of strong stimulus from the host may trigger the flagellar synthesis process. This aggressive strategy may be optimized for host colonization and infection. In E. coli, however, flagellar synthesis is activated in nutrient-poor conditions for the purpose of foraging and searching for nutrients. Therefore, E. coli may adopt a more prudent (filter-and-integrate) strategy that only triggers the expensive flagellar synthesis process when the cell experiences persistent poor nutrient conditions.
Our current analysis opens up a few directions for future investigation. Experimentally, effects of the filter-and-integration mechanism may be tested by perturbing the two key factors, for example, by tuning the strength of the interaction between FlhDC and class 2 promoters (32, 38) or by reducing the effective concentration of YdiV by adding Salmonella FlhC known to have weaker affinity for E. coli promoters but the same affinity for YdiV as E. coli FlhC (39). It would be highly informative to measure single-cell gene expression dynamics of Salmonella in the mother machine as done for E. coli (19), to understand its gene regulation strategy quantitatively. In our current model, YdiV level is assumed to be constant as we focus on the long timescale (low frequency) dynamics of the system. However, in the high-frequency regime, the power spectrum from experiments follows ( is the frequency), which decays slower than the power spectrum from our current model with a fixed YdiV level (SI Appendix, Fig. S10). A similar noise spectrum was observed in flagellar motor switching dynamics (40) and was subsequently explained by considering fluctuations of the response regulator CheY-P in a single cell (41). It would be interesting to include dynamics of YdiV in our model to explore whether it also leads to the observed spectrum in the high-frequency regime. Furthermore, to fully understand how the class 2 gene expression response depends on YdiV, molecular details of the FlhDC heterohexamer complex formation, its binding (unbinding) to DNA, and its degradation mediated by YdiV need to be considered in addition to the YdiV-mediated sequestration studied here.
Finally, our work here demonstrates that system-level stochastic modeling augmented by quantitative single-cell measurements for different strains and mutants provides a powerful tool to decipher principles of gene regulation from inherently noisy single-cell data. This tool becomes necessary when dealing with complex gene regulatory circuits such as the full bacterial flagellar gene expression cascade with multiple feedback controls. The general system-level stochastic modeling approach with coarse-grained variables that can be directly compared with experiments should be applicable to study stochastic expression dynamics in other gene regulatory systems with or without feedback (23–26, 30, 42).
Methods
Simulation of the Stochastic Differential Equations.
For the numerical simulation of the stochastic equations, we write Eq. 2 in discrete form,
| [4] |
where is a stochastic variable following a Gaussian distribution . We let to set the timescale, and the time step . The discrete form of Eq. 1 is
| [5] |
where is a Gaussian random number as .
Bin-Averaged Response Curves and Flatness.
To obtain the BARCs, we divide the full interval into 20 logarithmically spaced bins and calculate the average of class 1 concentration and class 2 activity for all of the data points in each bin. This bin-averaged class 2 activity versus the bin-averaged class 1 concentration is the BARC. The BARCs in Fig. 2 are only shown at those bins that contain more than 5,000 data points, for better statistics.
Steady-State Solution of the Sequestration Model.
Let be the concentration of promoters, be the total concentration of FlhDC, and be the YdiV concentration. We denote and as the concentrations of FlhDC that are bound to the promoters and YdiV, respectively. In steady state, the balance of binding and unbinding reactions leads to , and , where and are the dissociation constants of FlhDC to the promoters and YdiV, respectively. is the free FlhDC concentration, and it satisfies the following equation (conservation of FlhDC):
| [6] |
which can be solved to determine as a function of , , , , and . The fraction of FlhDC bound promoters is , which is used as a measure of the class 2 activity. The dependence of on is plotted in Fig. 3B. Parameters and are used in Fig. 3 B and C.
Supplementary Material
Acknowledgments
We thank Drs. Y. Lin and M. Erhardt for discussions. The work by A.S.S. and Y.T. is supported by NIH Grant R35GM131734 to Y.T. The work by M.G.-A., M.J.K., and P.C. is supported by NIH Grant R01GM134275 to P.C. and NSF Grant 1615487 to P.C. M.G.-A. acknowledges support from Consejo Nacional de Ciencia y Tecnología (CONACYT) through a PhD scholarship and Dirección General de Asuntos del Personal Académico - Universidad Nacional Autónoma de México (DGAPA-UNAM) for support through Grant PAPIIT-IN226917.
Footnotes
The authors declare no competing interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2010849117/-/DCSupplemental.
Data Availability.
All study data are included in the article and SI Appendix.
References
- 1.Martinez-Antonio A., Janga S., Thieffry D., Functional organisation of Escherichia coli transcriptional regulatory network. J. Mol. Biol. 381, 238–247 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Bähler J., A transcriptional pathway for cell separation in fission yeast. Cell Cycle 4, 39–41 (2005). [DOI] [PubMed] [Google Scholar]
- 3.Allan D. W., Thor S., Transcriptional selectors, masters, and combinatorial codes: Regulatory principles of neural subtype specification. Wiley Interdiscip. Rev. Dev. Biol. 4, 505–528 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Spöring I., et al. , Regulation of flagellum biosynthesis in response to cell envelope stress in Salmonella enterica serovar typhimurium. mBio 9, e00736-17 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Soutourina O. A., Bertin P. N., Regulation cascade of flagellar expression in Gram-negative bacteria. FEMS Microbiol. Rev. 27, 505–523 (2003). [DOI] [PubMed] [Google Scholar]
- 6.Jacob F., Monod J., Genetic regulatory mechanisms in the synthesis of proteins. J. Mol. Biol. 3, 318–56 (1961). [DOI] [PubMed] [Google Scholar]
- 7.Shen-Orr S. S., Milo R., Mangan S., Alon U., Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet. 31, 64–68 (2002). [DOI] [PubMed] [Google Scholar]
- 8.Ghafari M., Mashaghi A., On the role of topology in regulating transcriptional cascades. Phys. Chem. Chem. Phys. 19, 25168–25179 (2017). [DOI] [PubMed] [Google Scholar]
- 9.Chilcott G. S., Hughes K. T., Coupling of flagellar gene expression to flagellar assembly in Salmonella enterica serovar typhimurium and Escherichia coli. Microbiol. Mol. Biol. Rev. 64, 694–708 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kalir S., et al. , Ordering genes in a flagella pathway by analysis of expression kinetics from living bacteria. Science 292, 2080–2083 (2001). [DOI] [PubMed] [Google Scholar]
- 11.Apel D., Surette M. G., Bringing order to a complex molecular machine: The assembly of the bacterial flagella. Biochim. Biophys. Acta 1778, 1851–1858 (2008). [DOI] [PubMed] [Google Scholar]
- 12.Ikebe T., Iyoda S., Kutsukake K., Promoter analysis of the class 2 flagellar operons of Salmonella. Genes Genet. Syst. 74, 179–183 (1999). [DOI] [PubMed] [Google Scholar]
- 13.Liu X., Matsumura P., Differential regulation of multiple overlapping promoters in flagellar class II operons in Escherichia coli. Mol. Microbiol. 21, 613–620 (1996). [DOI] [PubMed] [Google Scholar]
- 14.Fitzgerald D. M., Bonocora R. P., Wade J. T., Comprehensive mapping of the Escherichia coli flagellar regulatory network. PLoS Genet. 10, e1004649 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Komeda Y., Transcriptional control of flagellar genes in Escherichia coli K-12. J. Bacteriol. 168, 1315–1318 (1986). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kutsukake K., Ohya Y., Iino T., Transcriptional analysis of the flagellar regulon of Salmonella typhimurium. J. Bacteriol. 172, 741–747 (1990). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Brown J. D., et al. , The rate of protein secretion dictates the temporal dynamics of flagellar gene expression. Mol. Microbiol. 70, 924–937 (2008). [DOI] [PubMed] [Google Scholar]
- 18.Saini S., Brown J. D., Aldridge P. D., Rao C. V., FliZ Is a posttranslational activator of FlhD4C2-dependent flagellar gene expression. J. Bacteriol. 190, 4979–4988 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Kim J. M., Garcia-Alcala M., Balleza E., Cluzel P., Stochastic transcriptional pulses orchestrate flagellum biosynthesis in E. coli. Sci. Adv. 6, eaax0947 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Moffitt J. R., Lee J. B., Cluzel P., The single-cell chemostat: An agarose-based, microfluidic device for high-throughput, single-cell studies of bacteria and bacterial communities. Lab Chip 12, 1487–1494 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wang P., et al. , Robust growth of Escherichia coli. Curr. Biol. 20, 1099–1103 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cai L., Dalal C. K., Elowitz M. B., Frequency-modulated nuclear localization bursts coordinate gene regulation. Nature, 455, 485–490 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Tay S., et al. , Single-cell nf-b dynamics reveal digital activation and analogue information processing. Nature 466, 267–271 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Batchelor E., Alexander L., Mock C., Lahav G., Stimulus-dependent dynamics of p53 in single cells. Mol. Syst. Biol. 7, 488 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Hao N., O’Shea E. K., Signal-dependent dynamics of transcription factor translocation controls gene expression. Nat. Struct. Mol. Biol. 19, 31–39 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Locke J. C. W., Young J. W., Fontes M., Hernández Jiménez M. J., Elowitz M. B., Stochastic pulse regulation in bacterial stress response. Science 334, 366–369 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Norman T. M., Lord N. D., Paulsson J., Losick R., Stochastic switching of cell fate in microbes. Annu. Rev. Microbiol. 69, 381–403 (2015). [DOI] [PubMed] [Google Scholar]
- 28.Levine J. H., Lin Y., Elowitz M. B., Functional roles of pulsing in genetic circuits. Science 342, 1193–1200 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Martinez-Corral R., Raimundez E., Lin Y., Elowitz M., Garcia-Ojalvo J., Self-amplifying pulsatile protein dynamics without positive feedback. Cell Syst. 7, 453–462.e1 (2018). [DOI] [PubMed] [Google Scholar]
- 30.Lord N. D., et al. , Stochastic antagonism between two proteins governs a bacterial cell fate switch. Science 366, 116–120 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wada T., Hatamoto Y., Kutsukake K., Functional and expressional analyses of the anti-FlhD4C4 factor gene ydiV in Escherichia coli. Microbiology 158, 1533–1542 (2012). [DOI] [PubMed] [Google Scholar]
- 32.Takaya A., et al. , YdiV: A dual function protein that targets FlhDC for ClpXP-dependent degradation by promoting release of DNA-bound FlhDC complex. Mol. Microbiol. 83, 1268–1284 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wada T., Tanabe Y., Kutsukake K., FliZ acts as a repressor of the ydiV gene, which encodes an anti-FlhD4C2 factor of the flagellar regulon in Salmonella enterica serovar typhimurium. J. Bacteriol. 193, 5191–5198 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Hengge R., et al. , Systematic nomenclature for GGDEF and EAL domain-containing cyclic di-GMP turnover proteins of Escherichia coli. J. Bacteriol. 198, 7–11 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Sato Y., Takaya A., Mouslim C., Hughes K. T., Yamamoto T., FliT selectively enhances proteolysis of FlhC subunit in FlhD4C2 complex by an ATP-dependent protease, ClpXP. J. Biol. Chem. 289, 33001–33011 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Davis J. H., Rubin A. J., Sauer R. T., Design, construction and characterization of a set of insulated bacterial promoters. Nucleic Acids Res. 39, 1131–1141 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Koirala S., et al. , A nutrient-tunable bistable switch controls motility in Salmonella enterica serovar typhimurium. mBio 5, e01611-14 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Stafford G. P., Ogi T., Hughes C., Binding and transcriptional activation of non-flagellar genes by the Escherichia coli flagellar master regulator FlhD2C2. Microbiology 151, 1779–1788 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Albanna A., et al. , Driving the expression of the Salmonella enterica sv Typhimurium flagellum using flhDC from Escherichia coli results in key regulatory and cellular differences. Sci. Rep. 8, 16705 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Korobkova E., Emonet T., Vilar J. M. G., Shimizu T. S., Cluzel P., From molecular noise to behavioral variability in a single bacterium. Nature 428, 574–578 (2004) [DOI] [PubMed] [Google Scholar]
- 41.Tu Y., Grinstein G., How white noise generates power-law switching in bacterial flagellar motors. Phys. Rev. Lett. 94, 208101 (2005). [DOI] [PubMed] [Google Scholar]
- 42.Susman L., et al. , Individuality and slow dynamics in bacterial growth homeostasis. Proc. Natl. Acad. Sci. U.S.A. 115, E5679–E5687 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All study data are included in the article and SI Appendix.






