Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2019 Nov 15;15(11):e1007497. doi: 10.1371/journal.pcbi.1007497

An in silico analysis of robust but fragile gene regulation links enhancer length to robustness

Kenneth Barr 1,*, John Reinitz 2, Ovidiu Radulescu 3,*
Editor: Ilya Ioshikhes4
PMCID: PMC6881076  PMID: 31730659

Abstract

Organisms must ensure that expression of genes is directed to the appropriate tissues at the correct times, while simultaneously ensuring that these gene regulatory systems are robust to perturbation. This idea is captured by a mathematical concept called r-robustness, which says that a system is robust to a perturbation in up to r − 1 randomly chosen parameters. r-robustness implies that the biological system has a small number of sensitive parameters and that this number can be used as a robustness measure. In this work we use this idea to investigate the robustness of gene regulation using a sequence level model of the Drosophila melanogaster gene even-skipped. We consider robustness with respect to mutations of the enhancer sequence and with respect to changes of the transcription factor concentrations. We find that gene regulation is r-robust with respect to mutations in the enhancer sequence and identify a number of sensitive nucleotides. In both natural and in silico predicted enhancers, the number of nucleotides that are sensitive to mutation correlates negatively with the length of the sequence, meaning that longer sequences are more robust. The exact degree of robustness obtained is dependent not only on DNA sequence, but also on the local concentration of regulatory factors. We find that gene regulation can be remarkably sensitive to changes in transcription factor concentrations at the boundaries of expression features, while it is robust to perturbation elsewhere.

Author summary

Robustness assures that organisms can survive when faced with unpredictable environments or genetic mutations. In this work, we characterize the robustness of gene regulation using an experimentally validated model of the regulation of the Drosophila gene even-skipped. We use a mathematically precise definition of robustness that allows us to make quantitative comparisons of robustness between different genetic sequences or between different nuclei. From this analysis, we found that genetic sequences that were not previously known to be important for gene regulation reduce sensitivity to genetic perturbation. In contrast, we found that gene regulation can be very sensitive to the concentrations of regulators. This extreme sensitivity was only observed at the boundaries of expression features, where switch-like behavior is desirable. This highlights the importance of considering context when assessing robustness.

Introduction

Biological systems must be robust to perturbations, both environmental and genetic, in order to maintain their functions in fluctuating circumstances [1]. Some of their robustness properties, such as noise reduction, are shared with general cybernetic systems. Others, such as the relationship between robustness and evolvability are specific to living systems. Robustness of general cybernetic systems was studied mathematically by von Neumann [2]. von Neumann employed multiplexing and majority rule with Boolean automata, an approach that captured buffering by redundancy of the error control, but failed to treat flexibility and the possibility of adaptation underlying biological robustness. Many authors tried to describe the peculiarities of biological robustness using metaphors such as robust-yet-fragile [3], René’s Thom’s theory of catastrophes [46] or control theory [7]. Although, like von Neumann’s automata, biological organisms are remarkably robust to uncertainty in their components, they can be strongly influenced by small perturbations that act on sensitive elements of regulatory control networks [1]. As shown by large scale multiple knock-out analysis of metabolic networks in yeast, regulatory networks are robust with respect to single gene mutations, but become sensitive when multiple genes are mutated [8]. It has also been shown using mathematical modeling that segmentation patterns in the Drosophila embryo are robust to changes of single parameters [9]. Similarly, the accuracy of the gap gene response to a maternal gradient is unaffected by mutations of a single gene [10], but is, as predicted by theory, sensitive to a double mutation [11, 12]. Furthermore, the relation between robustness, redundancy and complexity is particularly relevant to biological systems. It has been suggested, that, contrary to the common belief that simple systems are robust, a certain degree of complexity can also lead to stable behavior [13, 14]. This property follows from the very general mathematical principle of measure concentration in high dimension [15, 16], as first discussed in Gorban and Radulescu [13]. The law of large numbers is an instance of this principle, ensuring that non-correlated variation of additive effects is buffered and vanishes when the number of elements increases. Within the same formalism, the concept “robust yet fragile” is made precise by the idea of r-robustness: a functional property can be stable with respect to perturbation of up to r − 1 randomly chosen parameters and sensitive when r parameters are varied simultaneously [13]. Robust biological systems include organismal development, where a form of robustness called canalization assures that all individuals arrive at the same phenotype despite individual genetic variation [1719]. Genetic and signal transduction network models have provided mechanistic explanations of developmental robustness and canalization in Drosophila [11, 12], C. elegans [20], and S. purpuratus [21].

In gene networks, the connections between genes represent regulatory interactions which control levels of expression of genes through cis-regulatory elements, typically 500 to 1000 basepairs (bp) in length, called enhancers [22]. These sequences contain clusters [23, 24] of binding sites for transcription factors (TFs) that act in combination to direct gene expression in specific spatial domains or tissues. While it is well understood how the dynamics of developmental networks confer robustness to the system, it is poorly understood how or if the enhancers that control these networks contribute to robustness through their organization. An important property of enhancers is their redundancy, which is seen at two levels. The clusters of binding sites in an enhancer typically include multiple binding sites for the same TF, conferring a many-to-one relationship between TF binding sites and gene expression [25]. At a higher level, multiple enhancers can control expression in the same expression domain or tissue type [26], and such “shadow” enhancers are known to increase robustness [2732]. These specific experimental findings have remained largely unaddressed at the theoretical level.

Previously described data driven and experimentally well tested models of Drosophila development are an ideal system for the theoretical study of robustness. Confocal microscopy has been used to generate spatial and temporal atlases of protein and mRNA levels at single nucleus resolution during the first 4 hours of development [3338]. These data provide the basis of sequence level models of gene regulation, which predict gene expression levels as a function of protein levels and DNA sequence [3948]. Using such models, it is possible to address how the general principles of gene regulation can confer robustness to mutations in enhancers.

In this work we use a previously reported model of gene regulation [47] to model the robustness of the Drosophila even-skipped (eve) locus with respect to variation in both TF concentration and DNA sequence. This model is described fully in the Appendix to this work and its main features are listed in the Results section. eve codes for the homeodomain protein Eve, whose expression forms seven sharply located stripes, necessary for the formation of parasegments during embryonic development [49]. We specifically assess two types of robustness: distributed robustness and r-robustness [13]. The robustness of this gene regulation model has never been investigated. The previously introduced robustness concepts have only been tested on a signalling network model and never in developmental biology. We find that the regulation of eve can be extraordinarily sensitive to certain changes in TF concentrations, a property that may help form sharp borders in expression domains. We also find that this regulation is r-robust with respect to sequence mutation. Expression of eve is sensitive only to changes in a few nucleotides of the enhancer. Finally, we show that the number of sensitive nucleotides decreases in longer enhancer sequences from both natural and in silico generated enhancers, indicating that enhancer length confers robustness to genetic perturbation. We thus provide a computational proof of the importance of enhancer length for the robustness of the gene regulation.

Results

Distinguishing types of robustness

Distributed and r-robustness arise in complex systems whose properties depend on a large number of parameters. In the former case, the effect of a single perturbation is small and grows very slowly with the number and size of perturbations. In the latter, weaker case, the system is insensitive to the majority of perturbations, excepting the perturbation of a few sensitive parameters. Gorban and Radulescu [13] formalized these types of robustness and investigated the robustness of a well described signaling pathway. In this work we follow the definitions laid out in Gorban and Radulescu [13, Eqs 1 and 2]. We consider the robustness of a positive quantitative property M that depends on n positive parameters K = (K1, K2, ⋯, Kn), namely M = f(K1, K2, ⋯, Kn). The property M is robust in a distributed manner with respect to changes in these parameters if the variance in M is reduced compared to the variance in the parameters K, when the parameters are subjected to independent perturbations. That is, considering variance in all independent parameters Var(log Ki) = Var(log K), i ∈ {1, ⋯, n}, then we consider M to be robust in a distributed manner if

Var(logM)Var(logK). (1)

Similarly, if we consider a subset of r parameters Ir = {i1, i2, ⋯, ir} ⊂ 2{1,⋯,n}, which we multiply by positive, independent, identically distributed, random scales (s1, s2, ⋯, sr), we define M as r-robust if, there is an r* such that, for any r < r* and randomly chosen Ir,

Var(logM)Var(logs), (2)

where Var(log s) is the variance of each log si, 1 ≤ ir. According to this definition, an r-robust property M can have large variance if the number of perturbed parameters is larger or equal to r*.

To distinguish between these types of robustness, it is useful to study the relationship between the variance of parameters and the variance in the output, or similarly to observe the variance in the output given the number of parameters perturbed.

For example, consider a system and a property M that is r-robust. In this system, there are n parameters, of which n0 parameters are individually sensitive to perturbation, meaning that M is sensibly affected by the perturbation of each of these parameters. For a formal definition of sensitivity with respect to individiual parameters one can use either the ratio Var(log M)/Var(log Ki) or the usual local sensitivity measure |logMlogKi| averaged over the domain of interest in the parameter space. If we select r of n parameters at random, the probability we did not select a sensitive parameter is (1 − n0/n)r. Then the probability that at least one sensitive parameter was selected is 1 − (1 − n0/n)r. If changes in a sensitive parameter contribute V0 to the log-variance, and the effect is not cumulative, then the log-variance in M with respect to r mutations is given by

Var(logM)=(1-(1-n0/n)r)V0. (3)

Although V0 is large, Var(log M) is small for small r and becomes V0 only when r is large enough. The cross-over value of r characterizing the loss of robustness is smaller for a large number of sensitive parameters n0.

The definition of n0 uses two implicit assumptions: i) that parameters have a well defined structural or biochemical meaning (nucleotide, concentrations of transcription factors, binding affinities, etc.) and ii) that among the identified parameters some are highly sensitive and the others have negligible sensitivity. These two assumptions are not always satisfied. For instance, the total number of parameters and the number of sensitive parameters can be different in models with different levels of abstraction. As sensitive parameters are related hierarchically across levels of abstraction we expect that findings for one type of model apply to other models in a hierarchy of abstractions (a full discussion of hierarchies of models related by model reduction can be found elsewhere [50]). Rather generally, we expect that a number of parameters have sensitivity much higher than the others. This phenomenon was explained from first principles in models with time scales distributed over many orders of magnitude. As discussed elsewhere [13], the heterogeneity of the sensitivity of parameters of biochemical systems results from the existence of widely distributed time and concentration scales, a property called “multiscaleness.” In multiscale systems some parameters are important and dominate the others, while a majority of parameters have small effect and play a more static role. The term “sloppy-sensitivity” is sometimes used to designate this situation. However, in practice, a threshold must be chosen to separate sensitive from non-sensitive parameters. This threshold can be for instance chosen inside the largest gap in the distribution of sensitivities. Interestingly, the relation (3) does not need knowledge of the threshold to identify the number of sensitive parameters n0. It is enough to perform a r-robustness test and the value of n0 is found by fitting Eq (3). Furthermore, let us define r1/2 as the number of simultaneously perturbed parameters producing a variance Var(M) that is half of the maximal variance V0. It follows straightforwardy from (3) that

n0=n(1-2-1r1/2). (4)

Eq (4) implies that n0 and r1/2 are negatively correlated, the decrease of the first and the increase of the latter equivalently meaning higher r-robustness.

To summarize, the term r-robustness means nonlinear dependence of the variance on the number r of randomly chosen perturbed targets according to a saturation curve described by Eq (3). r1/2 is the characteristic value of r separating robust (low variance) and non-robust (high variance) situations. In models relating DNA sequence to gene expression each nucleotide in the sequence is a parameter and point mutations act on a single parameter, thus the number of sensitive parameters represents the number of sensitive nucleotides.

In contrast to the r-robustness situation, consider the function M = (K1K2Kn)1/n. This function has distributed robustness. In this case, if all parameters have a log-variance of VK, then the log-variance of M with respect to r will be

Var(logM)=VKrn2, (5)

where r is the number of parameters that have been independently perturbed. In contrast to r-robustness, the signature of distributed robustness is a linear increase of the variance with respect to r. Another signature of distributed robustness is the dependence of the variance on the total number of parameters when all the parameters are perturbed. If all parameters are perturbed, the log-variance of M is simply

Var(logM)=VK/n, (6)

and the log-variance is a vanishingly small fraction of VK when n → ∞. While the log-variance of the geometric mean scales like 1/n, some other robust functions have faster variance decrease with n. General distributed robustness is related to concentration of measure in high-dimensional spaces, a phenomenon well known in mathematics. The distribution of a function f of n variables “concentrates,” meaning that it has vanishing variance when n is very large if the function depends on all of the variables but not particularly strongly on some of them (i.e., the function should have the Lipshitz property, discussed elsewhere [13]).

In the cases of r-robustness or distributed robustness we can assign a number that describes the robustness of the system. Rather generally, robustness is described by the ratio of variance of the input to variance of the output, a parameter we call ρ. For much of this work we will be working with variables of unknown scale. In light of this, we observe the variance in the fold-change of the input and the fold-change of the output. The fold-change variations are well captured by the variance of the logarithm. More precisely,

Var(log(K))=Var(log(sK)),Var(log(M))=Var(log(sM)), (7)

where sK = K/K′, sM = M/M′ are the fold changes of K and M with respect to the reference values K′ and M′, respectively. Thus, the robustness ratio ρ is given by

ρ=Var(log(M))Var(log(K)). (8)

For r-robust properties we expect that ρ depends on the number r of perturbed targets according to

ρ(r)=1-(1-n0/n)r, (9)

where n0 is the number of sensitive parameters. Thus ρ is small only for small r and a more appropriate measure of robustness is in this case n0.

Robustness of mRNA levels with respect to transcription factor concentration

Enhancers interpret the local concentration of transcription factors in order to specify appropriate production of mRNA. In order to determine the degree to which known regulatory mechanisms acting on enhancers contribute to robustness with respect to fluctuations in TF concentrations, we utilized a model that simulates the regulation of Drosophila eve [47], which is expressed in seven transverse stripes across developing Drosophila embryos. This model incorporates several mechanisms. The binding of TFs to DNA, including the effects of steric competition and cooperativity, is treated by thermodynamics [51]. Because chromatin state is an important predictor of TF binding [5255], we exclude TF binding within closed chromatin. Other mechanisms, described phenomenologically, are short-range quenching of transcriptional activators [5658], and coactivation of repressors [44, 59, 60]. The functional roles of the TFs used in the model are known from independent experiments, and expression is calculated by summing the bound activators after accounting for the effects of the mechanisms listed above and passing the resulting net activation N through a diffusion limited Arrhenius rate law, taking into account competition for interaction with the basal transcriptional machinery. In the study cited above, the model is able to accurately treat the expression pattern of stripes between 35.5% and 92.5% embryo length (Fig 1A), identify the enhancers within the eve regulatory locus, and simulate the effect of ectopic Hb expression. A detailed description of the model is provided in the Supplementary information, S1 Text.

Fig 1. Robustness of eve expression to variation in TF concentration.

Fig 1

(A) The relative expression of eve mRNA along a 10% wide strip along the anterior posterior axis (gray shading) and the model fit to the same data (black line). eve stripe number is indicated. (B) The relationship between variation in fold-change TF concentration and fold-change mRNA levels (Eq (7)) for the TF Gt at the percent embryo length indicated. The line representing ρ = 1 (Eq (8)) is indicated with a gray line. Points below this line are robust, while points above are sensitive. (C) The ratio of the variance in fold-change mRNA to the variance of fold change TF concentration ρ (Eq (8)) for the indicated TF at each position in the embryo (solid line). The role of each factor (activator or repressor) is indicated in parentheses. ρ values have been square root transformed for better visual presentation. The dashed line indicates the value ρ = 1. The perturbation size A in Eq (10) was set to 0.1. The expression pattern of eve (gray shading) is included for visual orientation.

To test how this system responds to fluctuations in TF levels, we simulated changes in TF levels by multiplying them by the fold ratio

sT=exp(AX), (10)

where A is a parameter that sets the size of fluctuations, and X is a random number drawn from a uniform distribution between -1 and 1. Because TF levels are in arbitrary units, we observe the fold change in TF levels and the fold change in resulting mRNA and computed the ratio ρ (Eq (8)) after simulating 10,000 fluctuations.

We find that sensitivity to fluctuations in individual TFs varies with respect to position in the embryo. For instance, if we observe sensitivity to fluctuations in the TF Giant (Gt) at the interstripes, borders, and peak of the second eve stripe (positions indicated in Fig 1A), we find that at the anterior interstripe and border, expression is not robust to changes in Gt regardless of the magnitude of fluctuation (Fig 1B).

This embodies the well established fact that Gt controls the anterior border of eve stripe 2 [25, 61, 62]. In contrast, the posterior interstripe of stripe 2 is insensitive to fluctuations in Gt. Notably, at the peak of stripe 2, expression is robust against small fluctuations and sensitive to large ones (Fig 1B).

In general, we note that expression at interstripes is more sensitive to fluctuating TF levels than expression at stripe peaks (Fig 1C, S1 Fig). We note sensitivity to Hb at the anterior borders of stripes 3 and 4 and the posterior border of stripe 6. The posterior border of stripe 2 is sensitive to Kr. The posterior borders of stripes 3 and 4 and anterior borders of stripes 6 and 7 are sensitive to Kni. The anterior border of stripe 2 and the posterior border of stripe 5 are sensitive to Gt. Finally, the posterior border of stripe 7 is sensitive to Tll. In each case, the border is sensitive to the factors that set the border of that stripe within the embryo [25, 6167].

The eve locus is r-robust with respect to nucleotide changes

Genetic systems may also be robust with respect to changes in DNA sequence. In order to investigate the robustness of the eve locus with respect to sequence perturbation, we simulated random mutations to r nucleotides 10,000 times, with r spanning 1 to 10% of all nucleotides (see Materials and Methods Section 5.3). If eve expression is robust in a distributed manner, variance will increase linearly with r (Eq (5)). In contrast, if eve expression is r-robust, variance will saturate with increasing r (Eq (3)). When we examine the relationship between variance in eve expression and r (illustrated at 35.5% embryo length in (Fig 2A), we find that variance saturates with r, along a curve well described by Eq (3). According to the first part of the Results section, this shows that eve expression is r-robust with respect to nucleotide changes. For r-robust systems, robustness is captured by the parameter n0 (Eq (3)), which indicates the number of sensitive parameters. To find n0, we fit Eq (3) at every position along the anterior-posterior axis. We find that stripe peaks are more robust to perturbation than interstripes in that they have fewer sensitive parameters (Fig 2B).

Fig 2. Robustness of eve expression to mutation of DNA sequence.

Fig 2

(A) The variation of eve expression at 35.5% embryo length at various values of r nucleotides that are mutated. The fit to data (Materials and Methods) is shown as a grey line, and the estimated number of sensitive nucleotides, n0, is indicated (Eq (3)). (B) The number of sensitive nucleotides, n0 at every position along the anterior-posterior axis.

Longer eve enhancers are more robust to perturbation

For the second stripe of eve, four sequences of different length are known to drive expression: the intact locus, the proximal 1700 bp, S2E, and MSE2. Each larger sequence contains the sequence of all smaller enhancers (Fig 3A). The gene regulatory model successfully predicts that each of these sequences drives expression at the position of eve stripe 2. The model prediction and experimental data for each enhancer is shown in Fig 3B–3E. In order to investigate whether additional sequence contributes additional robustness, we investigated the number of sensitive nucleotides n0 in each of these four sequences at the peak of stripe 2 expression (40.5% embryo length). We find that as sequence length grows, not only does the ratio of sensitive nucleotides decrease, the absolute number of sensitive nucleotides decreases from about 26 to 12 (Fig 4A–4D).

Fig 3. Predicted expression driven by successively smaller enhancers.

Fig 3

(A) Diagram showing the entire eve locus and successively smaller sequences that all drive stripe 2. Where each sequence aligns within the locus is indicated with dashed lines. Position with respect to TSS is indicated. (B-E) The model predicted expression and actual expression for each of the sequences in (A) along the anterior-posterior axis. Model output is in black lines and expression data is in gray shading. Relative expression on a 0 to 1 scale is reported.

Fig 4. Mutational robustness of natural S2Es.

Fig 4

(A-D) The variation in mRNA expression for increasing numbers of perturbed nucleotides at 40.5% embryo length (peak of stripe 2) is shown for sequences of different length that drive eve stripe 2. The best fit curve and estimated number of sensitive nucleotides n0 is indicated. (E) The ratio of variation in mRNA to variation in TF concentration ρ (Eq (8)) at 40.5% embryo length for each TF and each sequence. A was set to A = 0.1 in Eq (10) for simulations.

We also investigated the robustness (ρ) to changes in TF concentration for each of the stripe 2 enhancers. For the majority of TFs, there was a relationship between the size of the enhancer and robustness to TF concentration (Fig 4E). For the factors Cad, Dst and Kni, longer enhancers were more robust to changes in TF concentration. For Bcd, Hb, Gt and Tll MSE2 was more robust than S2E and the proximal 1700bp but less robust than the intact locus. Finally, MSE2 was the most robust to changes in Kr, followed by the intact locus, then S2E and the proximal 1700bp.

Robustness is a function of enhancer length

In order to determine whether the relationship between sequence length and the robustness measure n0 to mutation is inherent to the system, we generated 8010 putative stripe 2 elements in silico. We generated 10 putative S2Es with each length from 200 bp to 1000 bp. All 8010 S2Es are predicted to drive the correct expression pattern (Fig 5A). We estimated the number of sensitive nucleotides n0 for each of these S2Es by simulating 10,000 sets of r sequence mutation with r spanning from 1 bp to 10% of the sequence length. We estimated n0 at the peak of stripe 2 expression (40.5% embryo length). We found that n0 decreases with enhancer length (Fig 5B).

Fig 5. Sequence robustness of putative S2Es.

Fig 5

(A) Predicted expression of 8010 putative S2Es with lengths from 200 bp to 1000 bp (10 each, gray lines). Each putative S2E is predicted to drive expression similar to the natural D. melanogaster S2E. (B) Boxplots of the number of sensitive nucleotides n0 vs. sequence length for each of the 8010 putative S2Es in bins of 50 bp. n0 is significantly correlated with sequence length (Spearman ρ = −0.25, p < 2.2 × 10−16). (C) The correlation (Spearman ρ) between the sensitivity to changes in transcription factor concentration (ρ) and the length of putative S2Es (* p < 10−4; ** p < 10−9; *** p < 10−16). The ratio of the variance in mRNA to the variance in TF levels, ρ (Eq (8)) was estimated with A = 0.1 at 40.5% embryo length. (D) Boxplots of the number of modeled binding sites in each putative S2E vs. sequence length for each of the 8010 putative S2Es in 50 bp bins. Spearman ρ = 0.96, p < 2.2 × 10−16.

We also investigated robustness, as described by ρ, to changes in TF concentration for each of these 8010 enhancers. We measured ρ from 10,000 simulations with A = 0.1 at 40.5% embryo length. For most TFs, there is a relationship between enhancer length and robustness to changes in TF concentration, but the direction of this relationship was not consistent between factors (Fig 5C). As enhancers increase in length, they become more robust to changes in Bcd, Hb, Gt, and Kr and less robust to changes in Cad, Dst, and Kni.

The location and mechanism of sensitive nucleotides

In order to identify the location of sensitive nucleotides we tested all possible single nucleotide sequence perturbations and selected the nucleotides that led to the highest log variance of mRNA expression (Fig 6A and S2 Fig). For all sequences except MSE2, the sensitive nucleotides occurred in a tight cluster at the 3’ end of the stripe 2 enhancer.

Fig 6. Location of sensitive nucleotides.

Fig 6

(A) Log variance of mRNA expression when each nucleotide is perturbed one at a time, for each of the four sequences tested. The n0 most sensitive nucleotides are indicated in red. (B) The mean change in PWM score for the indicated factors when each nucleotide in S2E is perturbed. The n0 most sensitive nucleotides are indicated in red. (C) The percent of total strength of transcriptional activation N (see Eq. 15 S1 Text) along the DNA for the locus (top) and S2E (bottom). Sensitive nucleotides represent a smaller percent of total activation in the locus compared to S2E.

To identify which factors drive sensitivity, we tested the mean change in PWM score for the eight TFs considered here. Sensitive nucleotides tend to lead to a reduced PWM score for at least one of Bcd, Kr, and Hb, but not all binding site losses correspond to sensitive nucleotides (Fig 6B). This effect was especially strong for Hb, where sequence mutations that lead to reduced PWM scores at three Hb sites represent 6 of 11 sensitive nucleotides in the intact locus.

The degree to which any DNA sequence activates eve transcription is captured by the model parameter N (Eq. 15 S1 Text). N represents the sum of transcriptional activators bound to a particular DNA sequences, weighted by their strength of transcriptional activation. When we examine N for the enhancer S2E (Fig 6C, top), we find that a large percentage of transcriptional activation comes from three segments at the 3 prime end of S2E. These correspond to the three Hb sites that contain sensitive nucleotides (Fig 6B). These three Hb sites are responsible for a considerable amount of the total activation in S2E (Fig 6C), but the percent of total activation from these sequences is much smaller in the intact eve locus (Fig 6C). This distributed activation makes the eve locus more robust than smaller enhancers to perturbation of these sequences, which are contained in both.

Sensitive nucleotides are more conserved

Mutations of sensitive nucleotides, which lead to a greater change in transcription rates, are likely to have a negative effect on organismal fitness. Thus, we expect sensitive nucleotides to be more conserved on average than insensitive nucleotides. To test this hypothesis, we performed an alignment of S2Es from 12 Drosophilids to assess conservation at every nucleotide. Using the sensitivity values from the intact eve locus we found that the sensitive nucleotides were conserved in 79.5% of species, on average, compared to a background conservation rate of 68%. The correlation between sensitivity and conservation was marginally significant (Spearman ρ; p = 0.042). This correlation supports the notion that mutations in sensitive nucleotides negatively impact fitness, but is limited by the fact that nucleotides that are not important for stripe 2 expression might be conserved for other reasons. For example, they could be important for expression of other eve stripes, or for expression of eve in additional tissues. Nucleotides may also be conserved due to mechanisms not treated by our model, such as chromatin state [52, 53]. These processes will dilute the statistical signal observed

Human enhancers are r-robust with respect to nucleotide changes

Recently, Kircher et al [68] reported saturation mutagenesis of several human regulatory elements using a high- throughput reporter assay. The mutagenesis strategy incorporates a random number of insertions, deletions, or substitutions uniformly across the length of the enhancer. Using this data, we are able to explicitly test how variance in expression changes with the number of perturbed nucleotides. We restricted our analysis to the two enhancers that showed high reproducibility across biological replicates (IRF4 and SORT1). We found that the variation in expression saturates with increasing number of nucleotide changes, along curves well described by Eq (3), for both of these enhancers (S3 Fig).

Discussion

In this work we assessed the robustness of enhancers with respect to changes in TF levels or sequence mutation in the context of a sequence level model of gene regulation. We found that enhancers are r-robust to single nucleotide sequence changes, and that this robustness increases with the length of the sequence. The precise level of r-robustness seen, however, is not solely dependent on DNA sequence. It depends on the state of bound TFs, and thus manifests itself experimentally as dependence on position within the embryo. Sensitivity, when observed, is coupled to biological function. This point is most clearly seen in the dependence of domain border positions on TF concentration, but is also observable in the functional role of sensitive nucleotides. We discuss each of these points below.

Our major finding is that robustness of eve to sequence mutation is well-described by r-robustness. That is, the eve regulatory DNA is a system with a small number of sensitive parameters. This is understandable in terms of the classic experiments that elucidated this regulation. These experiments showed that enhancer function resided in multiple binding sites for each TF, each of which could be disrupted by site-directed mutation [25, 61]. Such a picture is fully compatible with r-robustness with n0 equal to about 26 for MSE2, which is on the order of the number of the number of base changes required to mutate all of the binding sites for a single TF.

Our finding that n0 decreases to only 12 for the whole locus indicates that robustness increases with increasing length of regulatory DNA. This might appear to contradict the modular structure of enhancers, but it too is compatible with the experimental literature. Early experiments with enhancers sought to find the minimum fragments that could recapitulate an expression feature using non-quantitative assays [25], MSE2 is quantitatively not equivalent to the full S2E, expressing at a level 5 times lower [47]. Moreover, MSE2 provides a lower rate of rescue of lethality than does the full S2E [69]. S2E, in turn, was first identified by the presence of two conserved sequences at either end [70], but this structural feature says nothing about the actual functional limits of S2E, which are known to be larger in the closely related species D. erecta [71]. Moreover there is evidence from the sea squirt Ciona that redundancy built into enhancers ensures robust expression in appropriate tissues without disrupting specificity [72]. Redundancy also buffers environmental perturbations, which can disrupt minimal enhancers [69, 73]. We have already alluded to the existence of shadow enhancers, redundant enhancers controlling the same expression domain and thereby increasing robustness [26, 27, 2932, 7476].

We also compared the level of robustness of putative in silico stripe 2 enhancers of different lengths, and also found that longer enhancers were more robust. These results also suggest that selective forces that are not explicitly modeled here drive the evolution of robustness. These in silico enhancers were selected for their pattern generating capability, but not explicitly for robustness. Thus, it is interesting that both MSE2 and S2E have fewer sensitive nucleotides (26.3 and 21.6 respectively) than in silico enhancers of the same length (average of 49.5 and 37.1), indicating that robustness to sequence perturbation may be selected for in natural populations, a conclusion reinforced by our finding that sensitive nucleotides are better conserved across species. The high correlation between length and number of sites (Fig 5D) indicates that redundancy in binding sites combined with a more distributed contribution of all the nucleotides, with sensitive nucleotides being responsible of a lower percentage of the total activation, is probably the cause of increased robustness. Supporting this, we find that MSE2 and S2E have a higher density of sites (114 and 204) than the in silico enhancers of the same length (average of 100.6 and 141).

It is important to note that all these measures of sequence robustness varied according to position in the embryo. This is natural and expected because the transcriptional state is dependent not only on position, but also on bound TFs, which vary by position. As a consequence, assays for sequence and gene product concentrations provide a very incomplete picture in the absence of knowledge of state information about regulators.

In contrast with the robustness that we see in cis-regulatory sequence, we find marked sensitivity to TF concentrations. For example, Fig 1 shows levels of ρ exceeding 100. Specifically, the transcription rate at stripe borders was drastically sensitive to changes in the concentration of TFs. For instance, the anterior border of eve stripe 2 is controlled by Gt [25]. When Gt levels fluctuate at this embryo position, transcription levels fluctuate to a greater degree. Such sensitivity may be a necessary feature of the circuit, where high sensitivity to individual repressors allows the formation of extremely sharp borders. Fig 1 also shows areas of low sensitivity. Such areas of apparent robustness fall into two classes. First, in Fig 1C there are areas of high robustness in areas where a transcription factor is not expressed. These areas include the region posterior to 50% embryo length for Bcd, Hb between 60% and 70% embryo length, Kr posterior to 70% embryo length, Kni anterior to 45% embryo length, Gt from 45% to 60% embryo length, and Tll anterior to 80%. Here the insensitivity to perturbation arises from the trivial reason that any multiplicative factor applied to zero gives zero. Second, we see reduced sensitivity at stripe peaks. We believe this arises because of the dependence of the transcription activation on the individual transcription factors concentration is a strongly nonlinear sigmoidal function whose derivative is very large close to a threshold (domain borders) and low far away. This excludes robustness at the domain borders and allows it in between.

The results reported here are a precise and quantitative characterization of what it means for a specific biological system to be “robust but fragile.” The concept of r-robustness makes this idea precise, and in the case of robustness against changes of sequence, the values of n0 found have a clear relationship to well known experimental results. We see no evidence of the 1/n concentration characteristic of the central limit theorem as illustrated in Eqs 5 and 6. This is to be expected in biological systems, where the necessity of precise control confronts the need for stability and resilience against perturbation, a concept well captured by the idea of r-robustness.

The r-robustness of cis-regulatory sequences is achieved by a principle of “hierarchical heterogeneity” according to which nucleotide mutations have widely distributed impacts. The total rate of transcription of a gene is a weighted sum of contributions of different DNA segments. The contribution to the total transcription of each segment is itself a sum over many interacting sites. At each level of this hierarchy, sensitive functions such as exponential binding affinities or the diffusion-limited Arrhenius rate law produce a disparity among parameters, in which sensitive parameters stand out. Sensitive nucleotides dominate the others and their impact cannot be easily flattened by addition, thus excluding distributed concentration effects. However, in longer sequences the combined effects of many nucleotides dilutes the impact of all the mutations. Therefore, some nucleotides, sensitive in short sequences become insensitive in the long ones.

Materials and methods

Model selection

The model used in this work is the same as reported in Barr and Reinitz [47]. The parameter set used was the best model including chromatin state information, called ‘Repeat Chromatin #2’ in that work.

Simulations of TF concentration perturbation

We perturbed TF concentration by selecting a random number X distributed uniformly between -1 and 1. Then we multiplied the TF concentration at every embryonic position by exp(AX), where A is a set parameter that scales the size of perturbation, and we observed the predicted change in mRNA synthesis rate at all positions. We repeated this calculation 10,000 times for each value of A between 0 and 3 in increments of 0.1, for a total of 310,000 simulations. This was repeated for each of the 8 TFs included in the model.

Simulations of sequence mutation

In the intact locus model, some nucleotides are not accessible for TF binding because of the chromatin state. We perturbed DNA sequence by selecting sets of r nucleotides only from open chromatin regions. The r nucleotides were then substituted by one of the remaining three possible nucleotides with equal probability. We then assessed model output at each position. This was repeated for every set r from 1 10% of the total accessible nucleotides.

Estimation of sensitive nucleotides

To fit Eq (3) to data, we used simulated annealing from the R package GenSA, using default parameters. n was set to the total length of accessible nucleotides, or 8765 for the locus, 1726 for p1700, 804 for S2E and 484 for MSE2.

Enhancer-reporter assays

The locus expression, as well as the expression of the S2E and MSE2 construct are reported in Barr and Reinitz [47]. The p1700 data is from Janssens et al. [39].

Generation of putative S2Es

To generate putative S2Es of different lengths, we fixed the kinetic parameters and optimized DNA sequence using previously described methods [77]. We used the expression of S2E as the target. We started each optimization with a random sequence of the desired final length.

12 species alignment

To identify conservation at sensitive nucleotides we first obtained putative S2E sequences by using the BLAST tool at FlyBase [78]. We identified significant contiguous alignments for the species sim, sec, ere, yak, rho, ele, tak, eug, bia, kik, and pse. We performed an alignment using Clustal Omega [79] using default parameters. To get a conservation score at every base in the melanogaster sequence, we calculated the percent of species containing the same nucleotide as melanogaster at that position.

Analysis of human enhancers

Data from Kircher et al. [68] was obtained from the NCBI Gene Expression Omnibus under accession number GSE126550. We restricted our analysis to enhancers of SORT1 and IRF4, which had high reproducibility across biological replicates. This was necessary because our analysis of variance would otherwise be dominated by variance due to measurement error. For every barcode, we counted the number of unique molecular identifiers (UMIs) observed in the DNA and RNA in each of three experimental replicates. Additionally, for each barcode we counted the number of mutations that had been introduced to the original sequence. We excluded mutations that were not observed in the DNA pool in all any of the experimental replicates. The ratio of UMI counts in RNA to UMI counts in DNA serves as a measure of expression. We examined the variance in log-expression for each number r perturbed nucleotides. We reported this number for all values of r that represent at least 100 barcodes. When we fit the values of 3 we also include a linear adjustment for experimental measurement error.

Supporting information

S1 Text. Sequence level model of the gene regulation.

A text describing the sequence level model of gene regulation is provided.

(PDF)

S1 Fig. Heatmap of robustness of eve expression to variation in TF concentration.

A heatmap comparing the variance in fold-change input to fold-change output (Eqs 7 and 8) Var(log(Δ[mRNA])) at different positions within the embryo as well as different sizes of perturbation to TF concentration, indicated by Var(log(Δ[TF])). Darker shading represents increasing variation in mRNA levels.

(TIF)

S2 Fig

Sensitive nucleotides change with r (Top) The log variance in S2E mRNA expression when each nucleotide is perturbed one at a time (r = 1). The top 26 most sensitive nucleotides are indicated in red. (Bottom) The log variance in S2E mRNA expression when each nucleotide is perturbed in a pairwise fashion with all other nucleotides (r = 2). The 26 most sensitive nucleotides from the r = 1 are indicated in red.

(TIF)

S3 Fig. r-robustness in human enhancers.

The log variance in expression with respect to the number of mutated nucleotides is reported for each of three experimental replicates for enhancers for the human genes SORT1 and IRF4. The best fit to Eq (3) (red) and the corresponding number of sensitive nucleotides n0 is shown. For details on this analysis see Materials and Methods.

(TIF)

Data Availability

All relevant data are within the manuscript and its Supporting Information files.

Funding Statement

JR received funding from the U.S. National Institutes of Health, grant R01 OD010936 (https://www.nih.gov/). JR also received funding from grant FACCTS from the University of Chicago, https://fcc.uchicago.edu/faccts. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Kitano M. Biological Robustness. Nature Reviews Genetics. 2004;5:826–837. 10.1038/nrg1471 [DOI] [PubMed] [Google Scholar]
  • 2. Von Neumann J. Probabilistic logics and the synthesis of reliable organisms from unreliable components. Automata studies. 1956;34:43–98. [Google Scholar]
  • 3. Stelling J, Sauer U, Szallasi Z, Doyle FJ III, Doyle J. Robustness of cellular functions. Cell. 2004;118(6):675–685. 10.1016/j.cell.2004.09.008 [DOI] [PubMed] [Google Scholar]
  • 4. Thom R. Topological Models in Biology. Topology. 1969;8:313–335. 10.1016/0040-9383(69)90018-4 [DOI] [Google Scholar]
  • 5. Corson F, Siggia ED. Geometry, epistasis, and developmental patterning. Proceedings of the National Academy of Sciences. 2012;109(15):5568–5575. 10.1073/pnas.1201505109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Golubitsky M, Stewart I. Homeostasis, singularities, and networks. Journal of mathematical biology. 2017;74(1-2):387–407. 10.1007/s00285-016-1024-2 [DOI] [PubMed] [Google Scholar]
  • 7. Kitano H. Towards a theory of biological robustness. Molecular systems biology. 2007;3(1):137 10.1038/msb4100179 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Deutscher D, Meilijson I, Kupiec M, Ruppin E. Multiple knockout analysis of genetic robustness in the yeast metabolic network. Nature genetics. 2006;38(9):993 10.1038/ng1856 [DOI] [PubMed] [Google Scholar]
  • 9. von Dassow G, Meir E, Munro EM, Odell GM. The segment polarity network is a robust developmental module. Nature. 2000;406:188–192. 10.1038/35018085 [DOI] [PubMed] [Google Scholar]
  • 10. Houchmandzadeh B, Wieschaus E, Leibler S. Establishment of developmental precision and proportions in the early Drosophila embryo. Nature. 2002;415:798–802. 10.1038/415798a [DOI] [PubMed] [Google Scholar]
  • 11. Manu, Surkova S, Spirov AV, Gursky V, Janssens H, Kim A, et al. Canalization of gene expression in the Drosophila blastoderm by gap gene cross regulation. PLoS Biology. 2009;7:e1000049 10.1371/journal.pbio.1000049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Manu, Surkova S, Spirov AV, Gursky V, Janssens H, Kim A, et al. Canalization of gene expression and domain shifts in the Drosophila blastoderm by dynamical attractors. PLoS Computational Biology. 2009;5:e1000303 10.1371/journal.pcbi.1000303 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Gorban AN, Radulescu O. Dynamical robustness of biological networks with hierarchical distribution of time scales. IET Systems Biology. 2007;1:238–246. 10.1049/iet-syb:20060083 [DOI] [PubMed] [Google Scholar]
  • 14. Cardelli L, Csikász-Nagy A, Dalchau N, Tribastone M, Tschaikowski M. Noise reduction in complex biological switches. Scientific reports. 2016;6:20214 10.1038/srep20214 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Talagrand M. Concentration of measure and isoperimetric inequalities in product spaces. Publications Mathématiques de l’Institut des Hautes Etudes Scientifiques. 1995;81(1):73–205. 10.1007/BF02699376 [DOI] [Google Scholar]
  • 16. Gromov M. Metric structures for Riemannian and non-Riemannian spaces. Springer Science & Business Media; 2007. [Google Scholar]
  • 17. Waddington CH. Canalization of development and the inheritance of acquired characters. Nature. 1942;150:563–565. 10.1038/150563a0 [DOI] [PubMed] [Google Scholar]
  • 18. Waddington CH. The Strategy of Genes. London: George Allen & Unwin; 1957. [Google Scholar]
  • 19. Siegal ML, Bergman A. Waddington’s canalization revisited: Developmental stability and evolution. Proceedings of the National Academy of Sciences USA. 2002;99:10528–10532. 10.1073/pnas.102303999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Hoyos E, Kim K, Milloz J, Barkoulas M, Pénigault JB, Munro E, et al. Quantitative variation in autocrine signaling and pathway crosstalk in the Caenorhabditis vulval network. Current Biology. 2011;21(7):527–538. 10.1016/j.cub.2011.02.040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Garfield DA, Runcie DE, Babbitt CC, Haygood R, Nielsen WJ, Wray GA. The impact of gene expression variation on the robustness and evolvability of a developmental gene regulatory network. PLoS biology. 2013;11(10):e1001696 10.1371/journal.pbio.1001696 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Long HK, Prescott SL, Wysocka J. Ever-changing landscapes: transcriptional enhancers in development and evolution. Cell. 2016;167:1170–1187. 10.1016/j.cell.2016.09.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Berman BP, Nibu Y, Pfeiffer BD, Tomancak P, Celniker SE, Levine M, et al. Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome. Proceedings of the National Academy of Sciences USA. 2002;99:757–762. 10.1073/pnas.231608898 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Ochoa-Espinosa A, Yucel G, Kaplan L, Pare A, Pura N, Oberstein A, et al. The role of binding site cluster strength in Bicoid-dependent patterning in Drosophila. Proceedings of the National Academy of Sciences USA. 2005;102:4960–4965. 10.1073/pnas.0500373102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Small S, Blair A, Levine M. Regulation of even-skipped stripe 2 in the Drosophila embryo. The EMBO Journal. 1992;11:4047–4057. 10.1002/j.1460-2075.1992.tb05498.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Hong JW, Hendrix D, Levine M. Shadow Enhancers as a Source of Evolutionary Novelty. Science. 2008;321:1314 10.1126/science.1160631 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Werner T, Hammer A, Wahlbuhl M, Bösl MR, Wegner M. Multiple conserved regulatory elements with overlapping functions determine Sox10 expression in mouse embryogenesis. Nucleic acids research. 2007;35(19):6526–6538. 10.1093/nar/gkm727 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Frankel N, Davis GK, Vargas D, Wang S, Payre F, Stern DL. Phenotypic robustness conferred by apparently redundant transcriptional enhancers. Nature. 2010;466(7305):490 10.1038/nature09158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Perry M, Boettiger AN, Bothma JP, Levine M. Shadow Enhancers Foster Robustness of Drosophila Gastrulation. Current Biology. 2010;20:1562–1567. 10.1016/j.cub.2010.07.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Perry M, Boettiger AN, Levine M. Multiple enhancers ensure precision of gap gene-expression patterns in the Drosophila embryo. Proceedings of the National Academy of Sciences of the United States of America. 2011;108:13570–13575. 10.1073/pnas.1109873108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Staller MV, Vincent BJ, Meghan DJ, Lydiard-Martin T, Wunderlich Z, Estrada J, et al. Shadow enhancers enable hunchback bifunctionality in the Drosophila embryo. Proceedings of the National Academy of Sciences USA. 2015;112:785–790. 10.1073/pnas.1413877112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Wunderlich Z, Bragdon MDJ, Vincent BJ, White JA, Estrada J, DePace AH. Krüppel expression levels are maintained through compensatory evolution of shadow enhancers. Cell Reports. 2015;12:1740–1747. 10.1016/j.celrep.2015.08.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Jaeger J, Surkova S, Blagov M, Janssens H, Kosman D, Kozlov KN, et al. Dynamic control of positional information in the early Drosophila embryo. Nature. 2004;430:368–371. 10.1038/nature02678 [DOI] [PubMed] [Google Scholar]
  • 34. Poustelnikova E, Pisarev A, Blagov M, Samsonova M, Reinitz J. A database for management of gene expression data in situ. Bioinformatics. 2004;20:2212–2221. 10.1093/bioinformatics/bth222 [DOI] [PubMed] [Google Scholar]
  • 35. Luengo-Hendriks CL, Keranen SVE, Fowlkes CC, Simirenko L, Weber GH, Henriquez C, et al. 3D Morphology and Gene Expression in the Drosophila Blastoderm at Cellular Resolution I: Data Acquisition Pipeline. Genome Biology. 2006;7:R123 10.1186/gb-2006-7-12-r123 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Surkova S, Myasnikova E, Janssens H, Kozlov KN, Samsonova A, Reinitz J, et al. Pipeline for acquisition of quantitative data on segmentation gene expression from confocal images. Fly. 2008;2:58–66. 10.4161/fly.6060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Surkova S, Kosman D, Kozlov K, Manu, Myasnikova E, Samsonova A, et al. Characterization of the Drosophila Segment Determination Morphome. Developmental Biology. 2008;313(2):844–862. 10.1016/j.ydbio.2007.10.037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Fowlkes CC, Hendricks CLL, Keränen SVE, Rübel GHWO, Huang M, Chatoor S, et al. A quantitative spatiotemporal atlas of gene expression in the Drosophila blastoderm. Cell. 2008;133:364–374. 10.1016/j.cell.2008.01.053 [DOI] [PubMed] [Google Scholar]
  • 39. Janssens H, Hou S, Jaeger J, Kim AR, Myasnikova E, Sharp D, et al. Quantitative and predictive model of transcriptional control of the Drosophila melanogaster even skipped gene. Nature Genetics. 2006;38:1159–1165. 10.1038/ng1886 [DOI] [PubMed] [Google Scholar]
  • 40. Segal E, Raveh-Sadka T, Schroeder M, Unnerstall U, Gaul U. Predicting expression patterns from regulatory sequence in Drosophila segmentation. Nature. 2008;451:535–540. 10.1038/nature06496 [DOI] [PubMed] [Google Scholar]
  • 41. Samee MAH, Sinha S. Quantitative modeling of a gene’s expression from its intergenic sequence. PLoS Computational Biology. 2014;10:1–21. 10.1371/journal.pcbi.1003467 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Kazemian M, Blatti C, Richards A, McCutchan M, Wakabayashi-Ito N, Hammonds AS, et al. Quantitative Analysis of the Drosophila Segmentation Regulatory Network Using Pattern Generating Potentials. PLoS Biology. 2010;8:e1000456 10.1371/journal.pbio.1000456 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. He X, Samee MAH, Blatti C, Sinha S. Thermodynamics-Based Models of Transcriptional Regulation by Enhancers: The Roles of Synergistic Activation, Cooperative Binding and Short-Range Repression. PLoS Computational Biology. 2010;6:e1000935 10.1371/journal.pcbi.1000935 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Kim AR, Martinez C, Ionides J, Ramos AF, Ludwig MZ, Ogawa N, et al. Rearrangements of 2.5 Kilobases of Noncoding DNA from the Drosophila even-skipped Locus Define Predictive Rules of Genomic cis-Regulatory Logic. PLoS Genetics. 2013;9:e1003243 10.1371/journal.pgen.1003243 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Martinez C, Kim AR, Rest JS, Ludwig M, Kreitman M, White K, et al. Ancestral resurrection of the Drosophila S2E enhancer reveals accessible evolutionary paths through compensatory change. Molecular Biology and Evolution. 2014;31:903–916. 10.1093/molbev/msu042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Sayal R, Dresch JM, Pushel I, Taylor BR, Arnosti D. Quantitative perturbation-based analysis of gene expression predicts enhancer activity in early Drosophila embryo. eLife. 2016;5:e08445 10.7554/eLife.08445 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Barr KA, Reinitz J. A sequence level model of an intact locus predicts the location and function of nonadditive enhancers. PLoS One. 2017;12:1–26. 10.1371/journal.pone.0180861 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Barr KA, Martinez C, Moran JR, Kim AR, Ramos AF, Reinitz J. Synthetic enhancer design by in silico compensatory evolution reveals flexibility and constraint in cis-regulation. BMC Systems Biology. 2017;11:116 10.1186/s12918-017-0485-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Lawrence PA, Johnston P, MacDonald P, Struhl G. Borders of parasegments in Drosophila embryos are delimited by the fushi-tarazu and even-skipped genes. Nature. 1987;328:440–442. 10.1038/328440a0 [DOI] [PubMed] [Google Scholar]
  • 50. Radulescu O, Gorban AN, Zinovyev A, Lilienbaum A. Robust simplifications of multiscale biochemical networks. BMC systems biology. 2008;2(1):86 10.1186/1752-0509-2-86 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Ackers GK, Johnson AD, Shea MA. Quantitative model for gene-regulation by lambda-phage repressor. Proceedings of the National Academy of Sciences USA. 1982;79:1129–1133. 10.1073/pnas.79.4.1129 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Harrison MM, Li XY, Kaplan T, Botchan MR, Eisen MB. Zelda binding in the early Drosophila melanogaster embryo marks regions subsequently activated at the Maternal-to-Zygotic Transition. PLoS Genetics. 2011;7:e1002266 10.1371/journal.pgen.1002266 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Li XY, Thomas S, Sabo PJ, Eisen MB, Stamatoyannopoulos JA, Biggin MD. The role of chromatin accessibility in directing the widespread, overlapping patterns of Drosophila transcription factor binding. Genome Biology. 2011;12:1–17. 10.1186/gb-2011-12-4-r34 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Zabet NR, Adryan B. Estimating binding properties of transcription factors from genome-wide binding profiles. Nucleic Acids Research. 2015;43:84–94. 10.1093/nar/gku1269 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Martin PCN, Zabet NR. Dissecting the binding mechanisms of transcription factors to DNA using a statistical thermodynamics framework. bioRxiv. 2019;666446. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Gray S, Szymanski P, Levine M. Short-range repression permits multiple enhancers to function autonomously within a complex promoter. Genes and Development. 1994;8:1829–1838. 10.1101/gad.8.15.1829 [DOI] [PubMed] [Google Scholar]
  • 57. Gray S, Levine M. Short-range transcriptional repressors mediate both quenching and direct repression within complex loci in Drosophila. Genes and Development. 1996;10:700–710. 10.1101/gad.10.6.700 [DOI] [PubMed] [Google Scholar]
  • 58. Fakhouri WD, Ay A, Sayal R, Dresch J, Dayringer E, Arnosti DN. Deciphering a transcriptional regulatory code: modeling short-range repression in the Drosophila embryo. Molecular Systems Biology. 2010;6:341 10.1038/msb.2009.97 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Han K, Levine M, Manley JL. Synergistic activation and repression of transcription by Drosophila homeobox proteins. Cell. 1989;56:573–583. 10.1016/0092-8674(89)90580-1 [DOI] [PubMed] [Google Scholar]
  • 60. Small S, Arnosti DN, Levine M. Spacing ensures autonomous expression of different stripe enhancers in the even-skipped promoter. Development. 1993;119:767–772. [PubMed] [Google Scholar]
  • 61. Stanojevic D, Small S, Levine M. Regulation of a segmentation stripe by overlapping activators and repressors in the Drosophila embryo. Science. 1991;254:1385–1387. [DOI] [PubMed] [Google Scholar]
  • 62. Reinitz J, Sharp DH. Mechanism of eve stripe formation. Mechanisms of Development. 1995;49:133–158. 10.1016/0925-4773(94)00310-j [DOI] [PubMed] [Google Scholar]
  • 63. Frasch M, Levine M. Complementary patterns of even-skipped and fushi-tarazu expression involve their differential regulation by a common set of segmentation genes in Drosophila. Genes and Development. 1987;1:981–995. 10.1101/gad.1.9.981 [DOI] [PubMed] [Google Scholar]
  • 64. Small S, Kraut R, Hoey T, Warrior R, Levine M. Transcriptional regulation of a pair-rule stripe in Drosophila. Genes and Development. 1991;5:827–839. 10.1101/gad.5.5.827 [DOI] [PubMed] [Google Scholar]
  • 65. Small S, Blair A, Levine M. Regulation of two pair-rule stripes by a single enhancer in the Drosophila embryo. Developmental Biology. 1996;175:314–324. 10.1006/dbio.1996.0117 [DOI] [PubMed] [Google Scholar]
  • 66. Fujioka M, Emi-Sarker Y, Yusibova GL, Goto T, Jaynes JB. Analysis of an even-skipped rescue transgene reveals both composite and discrete neuronal and early blastoderm enhancers, and multi-stripe positioning by gap gene repressor gradients. Development. 1999;126:2527–2538. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Clyde DE, Corado MS, Wu X, Pare A, Papatsenko D, Small S. A self-organizing system of repressor gradients establishes segmental complexity in Drosophila. Nature. 2003;426:849–853. 10.1038/nature02189 [DOI] [PubMed] [Google Scholar]
  • 68. Kircher M, Xiong C, Martin B, Schubach M, Inoue F, Bell RJA, et al. Saturation mutagenesis of twenty disease-associated regulatory elements at single base-pair resolution. Nature Communications. 2019;10:3583 10.1038/s41467-019-11526-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Ludwig MZ, Manu, Kittler R, White KP, Kreitman M. Consequences of eukaryotic enhancer architecture for gene expression dynamics, development, and fitness. PLoS Genetics. 2011;7:e1002364 10.1371/journal.pgen.1002364 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Ludwig MZ, Patel NH, Kreitman M. Functional analysis of eve stripe 2 enhancer evolution in Drosophila: rules governing conservation and change. Development. 1998;125:949–958. [DOI] [PubMed] [Google Scholar]
  • 71. Crocker J, Stern D. Functional regulatory evolution outside of the minimal even-skipped enhancer. Development. 2017;144:3095–3101. 10.1242/dev.149427 [DOI] [PubMed] [Google Scholar]
  • 72. Farley EK, Olson KM, Zhang W, Brandt AJ, Rokhsar DS, Levine MS. Suboptimization of developmental enhancers. Science. 2015;350:325–328. 10.1126/science.aac6948 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Crocker J, Abe N, Rinaldi L, McGregor AP, Frankel N, Wang S, et al. Low affinity binding site clusters confer hox specificity and regulatory robustness. Cell. 2015;160:191–203. 10.1016/j.cell.2014.11.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Frankel N, Davis GK, Vargas D, Wang S, Payre F, Stern DL. Phenotypic robustness conferred by apparently redundant transcriptional enhancers. Nature. 2010;466:490–493. 10.1038/nature09158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Dunipace L, Ozdemir A, Stathopoulos A. Complex interactions between cis-regulatory modules in native conformation are critical for Drosophila snail expression. Development. 2011;138:4075–4084. 10.1242/dev.069146 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Cannavò E, Khoueiry P, Garfield DA, Geeleher P, Zichner T, Gustafson HE, et al. Shadow enhancers are pervasive features of developmental regulatory networks. Current Biology. 2016;26:38–51. 10.1016/j.cub.2015.11.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Martinez CA, Barr KA, Kim AR, Reinitz J. A synthetic biology approach to the development of transcriptional regulatory models and custom enhancer design. Methods. 2013;62:91–98. 10.1016/j.ymeth.2013.05.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Crosby MA, Goodman JL, Strelets VB, Zhang P, Gelbart WM, the FlyBase Consortium. FlyBase: genomes by the dozen. Nucleic Acids Research. 2007;35:D486–D491. 10.1093/nar/gkl827 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79. Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–2948. 10.1093/bioinformatics/btm404 [DOI] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007497.r001

Decision Letter 0

Douglas A Lauffenburger, Ilya Ioshikhes

11 Aug 2019

Dear Dr Radulescu,

Thank you very much for submitting your manuscript 'An in silico analysis of robust but fragile

gene regulation links enhancer length to robustness' for review by PLOS Computational Biology. Your manuscript has been fully evaluated by the PLOS Computational Biology editorial team and in this case also by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the manuscript as it currently stands. While your manuscript cannot be accepted in its present form, we are willing to consider a revised version in which the issues raised by the reviewers have been adequately addressed. We cannot, of course, promise publication at that time.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

Your revisions should address the specific points made by each reviewer. Please return the revised version within the next 60 days. If you anticipate any delay in its return, we ask that you let us know the expected resubmission date by email at ploscompbiol@plos.org. Revised manuscripts received beyond 60 days may require evaluation and peer review similar to that applied to newly submitted manuscripts.

In addition, when you are ready to resubmit, please be prepared to provide the following:

(1) A detailed list of your responses to the review comments and the changes you have made in the manuscript. We require a file of this nature before your manuscript is passed back to the editors.

(2) A copy of your manuscript with the changes highlighted (encouraged). We encourage authors, if possible to show clearly where changes have been made to their manuscript e.g. by highlighting text.

(3) A striking still image to accompany your article (optional). If the image is judged to be suitable by the editors, it may be featured on our website and might be chosen as the issue image for that month. These square, high-quality images should be accompanied by a short caption. Please note as well that there should be no copyright restrictions on the use of the image, so that it can be published under the Open-Access license and be subject only to appropriate attribution.

Before you resubmit your manuscript, please consult our Submission Checklist to ensure your manuscript is formatted correctly for PLOS Computational Biology: http://www.ploscompbiol.org/static/checklist.action. Some key points to remember are:

- Figures uploaded separately as TIFF or EPS files (if you wish, your figures may remain in your main manuscript file in addition).

- Supporting Information uploaded as separate files, titled Dataset, Figure, Table, Text, Protocol, Audio, or Video.

- Funding information in the 'Financial Disclosure' box in the online system.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see here

We are sorry that we cannot be more positive about your manuscript at this stage, but if you have any concerns or questions, please do not hesitate to contact us.

Sincerely,

Ilya Ioshikhes

Associate Editor

PLOS Computational Biology

Douglas Lauffenburger

Deputy Editor

PLOS Computational Biology

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: In this manuscript, Barr, Reinitz and Radulescu dissect the role of mutations, enhancer length and TF concentration on the robustness of gene regulation. They use the classical example of even-skipped gene in Drosophila to show that the enhancers are robust to single nucleotide sequence changes and that this robustness increases with the length of the enhancer. They identified a set of “sensitive” nucleotides that are conserved across several Drosophila species. Finally, the authors show that the transcription rate is highly sensitive to TF concentrations, but this sensitivity is dependent on the A-P axis position. The paper is well written and presents interesting results. The way how definitions are given is really good - first in informal way, then in formal way. The examples makes the technical part easy to understand and easy to read without any prior knowledge of robustness. There are some major points that authors would need to address before I could recommend this paper for publication:

1. The description of the results in the abstract is not so clear. Authors should give a short description how they understand the robustness (as less nucleotides sensitive to mutations, for example).

2. [lines 5-10] Very general description of the robustness in biological systems. It is not clear what actually the problem of the mathematical measures of robustness in biology. Maybe, giving more words on the written examples will give us a better picture.

3. [before line 25] There are various examples on robustness, but at the end it makes it more confusing what actually authors are going to investigate. I think, giving more studies related to the technical side of the robustness that authors investigated (TF concentrations and gene expression) would be better.

4. [after line 25] It feels that it is absolutely different style of writing - very clear descriptions, motivations and problem statements, I really like the «story-telling» style at this part.

5. [line 88] Authors did not formally introduce n_0. It is, not actually obvious. The authors talked before about the insensitive parameters, and should not switch to the sensitive ones without formal introduction.

6. Figure 1C has very few comments in the paragraph. Authors talk about transcriptional repressor but where it is on the plot is not clear.

7. In Figure 2, the relationship described by the line on plot 2A is clear only from the figure description, it is not understandable from the paragraph.

8. Figure 3B-E have no comments at all. Also, in the results section authors introduced two different robustness measures, but during the reading it is not clear where they talk about the first definition and where about the second one. (It is not critical for understanding at all, but it becomes interesting what is the robustness they are talking about at each particular moment.)

9. In Figure 5, it would be better to give more comments on different TF concentrations - what exactly is the direction of the relationship in any particular case?

10. It would be better to give more comments on figures because the description of the results is not so full in some paragraphs.

11. Figure 6C is difficult to understand. The authors need to add better explanations for that figure.

12. We previously explored the relationship between TF concentration and observed binding, including for 5 gap TFs at eve-stripped locus, and showed that there is a sensitivity of TF binding to TF concentration (https://academic.oup.com/nar/article/43/1/84/2903035), but this is not generally true for all TFs (https://www.biorxiv.org/content/10.1101/666446v1.abstract). The authors should discuss these results in the context of their findings.

Reviewer #2: The manuscript by Barr K.A., Reinitz J. and Radulescu O. "An in silico analysis of robust but fragile gene regulation links enhancer length to robustness" presents a computational study of recently developed and previously published model of Drosophilla embryo patterning relating the nucleotide composition of known enhancer regions

to gene expression of major genes determining the patterning. The authors also exploits previously introduced notions of distributed robustness and r-robustness.

I think this is an interesting study which deserves to appear in PLoS Computational Biology.

I have a couple of important, in my point of view, remarks.

The possibility to connect nucleotide sequences to quantitative properties of phenotypes looks very exciting.

However, current manuscript exploits already existing model and already introduced concept.

I think it is necessary to highlight more explicitly the scientific novelty of the study.

Focusing on one particular gene regulation seems to me contradicting to rather general title of the manuscript.

Why it can not be extended other genes known to be involved in patterning?

Otherwise, in the absense of novel experimental data, this seems to be a limitation of the study.

Also, the principal conclusion about the character of robustness with respect to mutations (r-robustness)

is interesting but it can be rather a consequence of model properties rather than biological reality.

I think it would be advantageous to look for the use of independent data to directly or indirectly

validate this conclusion.

The sequence conservation study used by the authors is an example of using the data independent on

the model. However, the results are relatively weak. I would even avoid stating that the results

are statistically significant having the p-value just a little below 0.05 threshold.

I think this weakness should be clearly accepted and discussed.

I am also confused by the use of r symbol in the manuscript.

For example, the authors use r to designate the number of perturbed parameters.

At the same time, r participates in the definition of r-robustness,

meaning that r is the maximum number of randomly perturbed parameters,

after which the system looses robustness. Are they the same "r"?

"This saturating curve is well described by a system with a limited number of sensitive parameters".

It seems that the authors use this sentence to prove the fact of r-robustness.

From what it follows? Can one compare two fits, for r-robustness and distributed

robustness? The major conclusions of the paper are based on this fit, so more formal

approach to prove it is the best fit among some alternative models is needed.

Less important remarks:

The following sentese is confusing to me : "According to this definition, a

property M can be r-robust only for some values of r; typically, it can be robust for

small values of r and lose this property for large r."

Can it be at all otherwise (not typically?)

Also, in line 92, the authors suddenly introduce the effect of mutations, without

explaining how mutations are connected to model parameters. Does each mutation affect

one and only one parameter? It is getting more clear after further reading but

it would be better to clearly explain it in the introductory part.

Formula (4) is supposed to notify "variance of M" while it shows Var(r)

Why for Figure 2 one uses 35.5% embryo length position? If this is just an illustration,

then what about other positions, can one get a summary?

Minor remarks:

In author summary "sensitive to the levels of regulators" what does it mean? Expression levels?

Line 26: The sentence sounds not clear to me. What represents what and through what?

Line 47: "confer robustness to enhancers" sounds unclear. Should be "confer robustness to mutations in enhancers"?

Line 51: "it main features" -> "its main features"

Line 73: "distributedly robust" does not sound a good term for me, in terms of language use

Line 78: I do not quite get the meaning of $\\in 2^\\{1,...,n\\}$ in the expression

Line 112: "concentration of measure in high-dimensional spaces, a phenomenon well known in mathematics", a reference would be suitable here,

and not only from Gorban and Radulescu.

Line 135: "contribute robustness", contribute to robustness?

Line 149: "effects of mutations to the environment in trans", what environment? Should be better explained.

Line 320: "ousting distributed concentration effects." what does it mean?

Line 380: There is a misprint in the name of the journal

Line 526: missing space in the title

Line 535: extra comma in the author list

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Nicolae Radu Zabet and Liudmila Mikheeva

Reviewer #2: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007497.r003

Decision Letter 1

Douglas A Lauffenburger, Ilya Ioshikhes

22 Oct 2019

Dear Dr Radulescu,

We are pleased to inform you that your manuscript 'An in silico analysis of robust but fragile gene regulation links enhancer length to robustness' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Once you have received these formatting requests, please note that your manuscript will not be scheduled for publication until you have made the required changes.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pcompbiol/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process.

One of the goals of PLOS is to make science accessible to educators and the public. PLOS staff issue occasional press releases and make early versions of PLOS Computational Biology articles available to science writers and journalists. PLOS staff also collaborate with Communication and Public Information Offices and would be happy to work with the relevant people at your institution or funding agency. If your institution or funding agency is interested in promoting your findings, please ask them to coordinate their releases with PLOS (contact ploscompbiol@plos.org).

Thank you again for supporting Open Access publishing. We look forward to publishing your paper in PLOS Computational Biology.

Sincerely,

Ilya Ioshikhes

Associate Editor

PLOS Computational Biology

Douglas Lauffenburger

Deputy Editor

PLOS Computational Biology

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors addressed all the comments.

Reviewer #2: The authors addressed all my remarks, I think the paper can be accepted now.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Nicolae Radu Zabet

Reviewer #2: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007497.r004

Acceptance letter

Douglas A Lauffenburger, Ilya Ioshikhes

7 Nov 2019

PCOMPBIOL-D-19-01037R1

An in silico analysis of robust but fragile gene regulation links enhancer length to robustness

Dear Dr Radulescu,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Laura Mallard

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Sequence level model of the gene regulation.

    A text describing the sequence level model of gene regulation is provided.

    (PDF)

    S1 Fig. Heatmap of robustness of eve expression to variation in TF concentration.

    A heatmap comparing the variance in fold-change input to fold-change output (Eqs 7 and 8) Var(log(Δ[mRNA])) at different positions within the embryo as well as different sizes of perturbation to TF concentration, indicated by Var(log(Δ[TF])). Darker shading represents increasing variation in mRNA levels.

    (TIF)

    S2 Fig

    Sensitive nucleotides change with r (Top) The log variance in S2E mRNA expression when each nucleotide is perturbed one at a time (r = 1). The top 26 most sensitive nucleotides are indicated in red. (Bottom) The log variance in S2E mRNA expression when each nucleotide is perturbed in a pairwise fashion with all other nucleotides (r = 2). The 26 most sensitive nucleotides from the r = 1 are indicated in red.

    (TIF)

    S3 Fig. r-robustness in human enhancers.

    The log variance in expression with respect to the number of mutated nucleotides is reported for each of three experimental replicates for enhancers for the human genes SORT1 and IRF4. The best fit to Eq (3) (red) and the corresponding number of sensitive nucleotides n0 is shown. For details on this analysis see Materials and Methods.

    (TIF)

    Attachment

    Submitted filename: resub_letter.pdf

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting Information files.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES