Abstract
Robust positioning of cells in a tissue against unavoidable noises is important for achieving normal and reproducible morphogenesis. The position in a tissue is represented by morphogen concentrations, and cells read them to recognize their spatial coordinates. From the engineering viewpoint, these positioning processes can be regarded as an information coding. Organisms are conjectured to adopt good coding designs with high reliability for a given number of available morphogen species and their chemical properties. To answer, quantitatively, the questions of how good coding is adopted, and subsequently when, where, and to what extent each morphogen contributes to positioning, we need a way to evaluate the goodness of coding. In this article, by introducing basic concepts of computer science, we mathematically formulate coding processes in morphogen-dependent positioning, and define some key concepts such as encoding, decoding, and positional information and its precision. We demonstrate the best designs for pairs of encoding and decoding rules, and show how those designs can be biologically implemented by using some examples. We also propose a possible procedure of data analysis to validate the coding optimality formulated here.
Introduction
Providing cells with precise information on their position is an important step in morphogenesis, and it is followed by cell differentiation and formation of morphological structures, both of which depend on the position. The positional information is provided in the form of concentrations of diffusive chemicals called morphogens. Perturbation of the spatial profiles of morphogen concentrations often causes critical morphological anomalies (1, 2).
A main focus of studies on positioning in tissues is the formation and read-out of morphogen gradients. The molecular mechanisms have been revealed by experiments, and mathematical models describing them have been proposed to reproduce and predict outcomes of experiments. Moreover, recent advances in imaging techniques have enabled us to visualize the spatial profiles of morphogens and to quantitatively measure their variability among embryos (3, 4, 5, 6, 7, 8, 9, 10). The design of robust positioning by morphogens in the face of this variability or noise has been explored by a systems approach, and possible noise-reducing mechanisms in the formation and interpretation of gradients have been discussed in specific systems (11, 12, 13, 14, 15, 16, 17).
We recently asked a new question, one about robustness (18, 19): What are the best spatial profiles of morphogen concentrations for achieving the most efficient and reliable transfer of positional information to cells in the presence of noise? Fig. 1 illustrates the problem. Let us consider the following in silico experiment. In a two-dimensional tissue, cells in a region with an arbitrary shape (labeled T) need to differentiate from the rest of the tissue according to the concentrations of three morphogens. For given spatial profiles of these morphogens, the correspondence between position and a set of concentrations of the three morphogens is determined. It maps region T in real space to region T′ in chemical space where the coordinates are given by morphogen concentrations (see T1′ and T2′). If each cell on the plane can recognize its own position only through the morphogen concentrations (i.e., concentration coordinates) that it detects, and if the cell differentiates in accordance with those concentrations in an appropriate manner, then the target differentiation pattern T is realized in the tissue.
Our interest here is in the reproducibility of the differentiation pattern T when the spatial profiles of the morphogens are perturbed by noise. The right panels in Fig. 1 illustrate the reproducibility of the produced pattern when different spatial profiles are adopted in the presence of noise of the same magnitude (see subsection SA in the Supporting Material for a mathematical description of the problem). For spatial profile A, focal shape T is reproduced with reasonably high accuracy, whereas for profile B, it is not reproduced well. Note that although the magnitude of the noise is exactly the same in these two cases, the directions of the three morphogen gradients are different. This implies that the spatial profile of morphogen concentrations strongly affects the reliability of positioning based on those concentrations. It also suggests that there might exist a best spatial profile of morphogens that achieves the highest reliability.
From an engineering viewpoint, the above question about robustness is interpreted as an information coding problem: spatial coordinates in a tissue, that is, original information that should be transferred to cells, are converted into morphogen concentrations, and then the cells read-out the coordinates from the concentrations that they detect. In previous works, we discussed coding designs in a few simple situations. In the case without informational redundancy where two-dimensional positioning is achieved by two morphogens, we proved mathematically that orthogonal morphogen gradient vectors provide the highest precision of positional information at a focal region (orthogonality principle) (18). In the case with informational redundancy where one-dimensional positioning is achieved by two morphogens, we showed that the precision depends strongly on the relative directions of their gradients (i.e., opposite or identical), and that which is better is determined by the sign of the correlation of the noise associated with the two morphogens (19). Using these theoretical results, we identified the optimality of the mode of encoding adopted and evaluated the relative contribution of each morphogen to cell positioning in vertebrate limb bud development and in early patterning of the Drosophila embryo. However, the above coding designs were discussed under different mathematical formalisms restricted to specific situations.
In this article, we construct a general mathematical framework for the coding processes in multidimensional positioning by multiple morphogens. This framework includes our previous results as special cases. We clearly define some key concepts such as encoding, decoding, and positional information and its precision in terms of information and statistical theories. Based on this framework, we mathematically derive optimal coding designs that achieve the most reliable positioning during morphogenesis. We also show how those optimal designs can be biologically implemented by using some examples. We expect this study to provide important criteria in analyzing quantitative data on the spatial profiles of morphogens, which is becoming increasingly available as a result of recent advances in optical systems and imaging techniques.
Model
Information coding in computer science and developmental biology
The concept of information coding originates from the field of computer science (Fig. 2 A). The coding process consists of two steps: encoding and decoding. Original information (e.g., texts, images, movies) that a sender would like to transfer is converted into binary sequences by the sender's computer based on a rule. This conversion process is called encoding. The binary sequences are transferred to the receiver's computer through a channel, which is followed by their conversion into the information that the sender transferred (i.e., texts, images, or movies) based on another rule. The latter process is called decoding. By designing appropriate rules for encoding and decoding, the information transfer becomes efficient and reliable against noise, such as stochastic flips between 0 and 1, which may arise when information is transferred through channels (20).
Using the concept of coding in computer science, we can interpret in a similar manner the positioning by morphogens in developmental biology (Fig. 2 B). The original information to be transferred here is the spatial coordinates in a tissue (denoted by a vector x = (x1, …, xN) (N = 1, 2, or 3)), and it is converted into a set of morphogen concentrations (u = (u1, …, uM) (N ≤ M)) by an encoding rule u(x) (i.e., spatial profiles of morphogens). Biologically, the encoding rule is determined by various factors such as the configuration of morphogen sources, morphogen diffusivity, and organ geometry that specifies the boundary condition (21, 22, 23, 24, 25).
Decoding is the process to read-out morphogen gradients, which is to determine the correspondence between an observed set of morphogen concentrations u′ (the prime symbol indicates the concentration observed by cells) and position or its function (the caret symbol is used to distinguish from the original information x). If all developmental processes occurred in a deterministic manner without any noises, each cell would detect a unique set of morphogen concentrations depending on the position. The inverse map of the encoding rule u−1(x) is the natural decoding rule, by which each cell would exactly recognize its true position from the observed concentrations regardless of the encoding rule u(x) used.
In the presence of noise, however, the situation is not so simple, because the morphogen concentrations detected by a cell are not unique but fuzzy (see subsection SB in the Supporting Material for assumed origins of the noise). For a fuzzy input, a cell must decide its response. In other words, it must choose a decoding rule, the correspondence between u′ and . In terms of statistics, this is an estimation problem, and is the estimated position. We can regard as the mathematical definition of positional information. Because is a random variable, we can measure its precision or ambiguity by its variance or other statistics. Biologically, each decoding rule or an equivalent response is realized by appropriately choosing the structures of signaling networks and parameters for biochemical reactions. We will discuss the biochemical implementation of the decoding rule later.
The reliability of the information transfer depends strongly on the encoding and decoding rules adopted (Fig. 1). In the following section, we mathematically formulate the coding processes for the positioning by morphogens, and derive the best coding rules to achieve the maximum precision of positional information.
Mathematical formulation of positional information coding
Static variability in morphogen spatial profiles is one noise source. The probability that a cell located at x detects morphogen concentration u′ is modeled by a multivariate Gaussian probability distribution P(u′; u(x)), where u(x) is the population average over all embryos. We assume that the variance of the noise is not very large, and that u(x) is time-invariant. The latter is plausible because chemical reactions and diffusion processes are generally much faster than organ growth, and also because changes in morphogen source levels are slow (see subsection SB in the Supporting Material for a more general treatment of noise).
The goodness of coding, which is equivalent to the goodness of estimation or the precision of positional information, is measured in terms of the inverse of the generalized variance of , , where is the variance-covariance matrix of . Intuitively speaking, represents how the estimated position concentrates around the true position (see Fig. 3 A). Under plausible statistical assumptions (see subsection SB in the Supporting Material), Cramer-Rao's inequality gives us the upper limit of the precision of positional information for a given encoding rule u(x), as
(1) |
where I(x) is Fisher information matrix in which component (i,j) is defined as
(2) |
The upper limit in Eq. 1 is realized when the maximum likelihood (ML) estimate () is adopted as (i.e., ML decoding); thus, ML decoding is the best decoding rule. In the following analysis, we assume ML decoding, and then det[I (x)] indicates the precision of the positional information at position x. When the noise is not large, the precision is approximately given by
(3) |
where D is a M × N morphogen gradient matrix (i.e., Dij = ∂ui/∂xj) and ∑ is the variance-covariance matrix of noises added to u(x). This value strongly depends on the choice of encoding rules, that is, on the spatial profiles of morphogens u(x). Thus, our main purpose in the following analysis is to identify the best encoding rule that maximizes the precision of positional information (det[I (x)]) when the best decoding rule (ML decoding) is adopted (see subsection SC in the Supporting Material for a geometrical interpretation of det[I (x)] maximization).
Positioning performance vector
To aid in our understanding of the coding designs discussed in the following sections, we introduce a quantity that we call the positioning performance (PP) vector. The PP vector of morphogen i (denoted by ηi (x)) is defined as the morphogen's gradient vector divided by the magnitude of the noise at a focal location x (see Fig. 3 B):
(4) |
In one-dimensional positioning, the PP vector becomes a scalar whose sign indicates the gradient direction (we called this scalar “directional-PP” in the previous study (19)). We call the magnitude of PP vector ‖ηi (x)‖ the PP of the morphogen i. The PP of a morphogen is interpreted as the precision of positioning along direction ηi (x) provided by morphogen i. This can be understood easily by considering the one-dimensional positioning by a single morphogen. In that case, det[I(x)] = I(x) ≅ (du/dx)2/σ2 = η(x)2 holds, and thus PP is a natural measure of the precision of positioning along a single axis as has been adopted by some studies (4, 19). The PP (and the PP vector) is the generalized quantity necessary for evaluating multidimensional positioning. Note that the PP becomes larger as the morphogen gradient becomes steeper and the noise smaller.
Using PP vectors, the Fisher information matrix is rewritten as I(x) ≅ ET C−1 E and thus the precision of positional information is given by
(5) |
where C is the correlation coefficient matrix for the concentration noises ∑ (i.e., Cii = 1 and Cij = ρij (i ≠ j)), and E is a matrix whose component is ηij ≡ (∂ui/∂xj)/σi.
PP may, in general, depend on the position, but the dependence may not be very large. For example, PP of Bicoid, a major morphogen in Drosophila development, is almost constant within the spatial range in which Bicoid effectively works (see (4, 19) and see subsection SD in the Supporting Material).
Biochemical implementation of ML decoding
We here explain how to implement ML decoding biochemically. As an example, let us think of one-dimensional positioning by two morphogens (Fig. 3 C). Let us suppose that the spatial profiles of morphogens vary among embryos due to the variability of the morphogen source levels. Then the observed concentrations (u′,v′) by the cell located at x0 is distributed around its average (u(x0), v(x0)). When the variability of the source levels is not very large, the ML decoding (i.e., a correspondence between (u′,v′) and estimated position ) for cells located around x0 is approximately given by
(6) |
where wi = (ηi2 – ρη1η2)/(η12 + η22 – 2ρη1η2) (19). The values ηi and ρ are PP vectors (or directional-PP) and the correlation coefficient of the variability of the two morphogen source levels, respectively. In the following analysis, we assume that those values are spatially invariant constants (see subsection SD in the Supporting Material).
Equation 6 indicates that any sets of observed concentrations (u′,v′) satisfying define the contour of the positional value equivalent to x0 in the chemical space. Its direction at (u(x0), v(x0)) is given by T1 = (−w2/(dv/dx), w1/(du/dx)) (Fig. 3 D).
On the other hand, cellular responses such as the expression levels of target genes are given as functions of (u′,v′). For simplicity, we here consider a scalar function, F(u′,v′). Along the direction T2 = (−∂F/∂v, ∂F/∂u), which is perpendicular to gradF = (∂F/∂u, ∂F/∂v), there is no change of F (i.e., dF = 0). Therefore, if T2 is parallel to T1, F(u′,v′) can be regarded as an ML response:
(7) |
A class of functions, F(u,v) = F(v1/S/u) with S = w2β/w1α, satisfies Eq. 7. For example, the following generalized-Hill equation is one of the functions included in that class (Fig. 3 E),
(8a) |
where h and K are a Hill coefficient and a constant, respectively. Compared with the ordinary Hill equation, G(u) = uh/(uh + K0h), the dissociation constant in Eq. 8a, Kv1/S, depends on v. Because Eq. 8a is not defined at (u,v) = (0,0), we may consider the following function instead of Eq. 8a:
(8b) |
A biochemical interpretation of Eq. 8a is the promoter regulation of a target gene by the signal of u, in which the binding (or unbinding) between the signal and the promoter is repressed (or enhanced) by the signal of v (Fig. 3 F). An interpretation of Eq. 8b is the competitive regulation of a promoter in which the signals of u and v work as an activator and a repressor, respectively.
Fig. 3G shows an example of cellular responses. In this example, the purpose is the precise recognition of position around the center of an embryo. The thin red and blue curves show embryo-to-embryo variability of the gradients of two morphogens. The green curves are the outputs over embryos calculated by using Eq. 8a. The variability of the output is clearly smaller than that of the output calculated by the ordinary Hill equation G(u), shown by the gray curves. The orange curves with larger variability shows the output calculated by Eq. 8a but with parameter S ≠ w2β/w1α, which gives an example of non-ML decoding. This shows that the precision of spatial recognition is not improved even though multiple information sources are available if appropriate response functions and parameters are not chosen.
Similar discussions are possible for multidimensional positioning by multiple morphogens by replacing Eq. 6 with the following Eq. 9:
(9) |
Local and global precision of positional information
Our goal is to elucidate the optimal encoding rule (i.e., the optimal spatial profile of morphogen concentrations u(x) to maximize the precision of positioning). The precision is measured by det[I (x)] at each location x. Its value depends on x for a given u(x), and thus det[I (x)] is a local quantity. Eq. 5 indicates that det[I (x)] is determined by the relative orientation of different morphogen gradient vectors at a focal location for a given set of PPs (‖ηi (x)‖) and noise correlation (C) (see also Results, below). Because the curvature radius of contours of morphogen concentrations is generally much larger than the scale of each cell size, the relative orientation of the gradient vectors hardly changes around the location. Therefore, the local optimum for a location is also approximately optimum for its neighboring locations.
On the other hand, when we consider more-global situations (i.e., the scale much larger than the curvature radius of contours of morphogen concentrations), we need a global quantity to evaluate the precision of positional information in the whole region of a target tissue. The weighted average of det I[(x)] over a focal region Ω (denoted by ΨΩ) is a candidate of such a quantity:
(10) |
where |Ω| is the volume of the focal region, and W(x) is a weighting function. To achieve normal development, higher precision may be required in some parts (e.g., an undifferentiated region) than in other parts (e.g., an already differentiated region) of the tissue. As an example of the weighing function, we may set the weight as W(x) = 1 for the undifferentiated region and W(x) = 0 for the already differentiated region (see Morishita and Iwasa (18) as a biological application).
In the following analysis, we mainly focus on encoding rules that maximize the local precision det[I (x)]. But the local rules provide a basis for the calculation of global precision ΨΩ. We will discuss how the local rules relate to conditions for maximizing ΨΩ. Especially, in one-dimensional positioning, we will see that optimal encoding for a local region is consistent with that in a global sense if the spatial dependences of ηi (x) and ρij are not large. On the other hand, in two-dimensional or three-dimensional positioning, to derive general rules for maximizing the global precision is much more difficult because we need much information about geometries to calculate ΨΩ such as organ morphologies, shapes and spatial arrangement of morphogen sources. But once we fix the information we will be able to evaluate the global optimality of encoding by calculating the spatial dependence of det I[(x)]. We will illustrate this procedure by a biological example later (see Orthogonal Morphogen Gradient Vectors is the Best Encoding in the Absence of Informational Redundancy, below, and Fig. 6).
Results
Optimal encoding in one-dimensional positioning (N = 1)
We first consider encoding rules for one-dimensional positioning by multiple morphogens. We here assume that morphogen sources are located at either end of a line segment, representing a one-dimensional organ (or embryo, egg, etc.). Then, each encoding rule is defined by the combination of the source locations. For M morphogen species, there are 2M possible source location combinations in total.
If the noises associated with the different morphogens are uncorrelated with each other, the situation is quite simple: all of the encoding rules have the same performance, because the precision is just the sum of the squared PP of each morphogen (). If the different morphogens have the same PP, then the precision is proportional to the number of morphogen species, I(x) ∝ M (see subsection SE in the Supporting Material).
In contrast, when morphogens share signaling pathways relating to, for example, their synthesis, degradation, or modification, the noise associated with different morphogens are likely to be correlated (19). Then, the precision can be improved by choosing appropriate source arrangements for a given set of correlation coefficients. In the following analysis, we summarize the results on the best encoding for given correlations ρij and PPs |ηi| for M species of morphogens.
Two morphogens
In the case of two morphogens, I(x) = (η12 + η22 – 2ρη1η2)/(1 − ρ122). Thus, the better source arrangement to improve the precision at x in an embryo is determined only by the sign of the noise correlation ρ12 (x). For positive correlation (ρ12 > 0), oppositely directed gradients (i.e., η1η2 < 0) are better than the alternative, and vice versa for negative correlated noise (Fig. 4 A). The probability distribution of the estimated position, , becomes sharper (i.e., a more reliable information transfer is achieved) when the relative directions of the gradients is appropriately chosen.
We note that this encoding rule is for maximizing the precision of positional information at a focal location x. But it is also the best for maximizing the precision at any locations (thus, for maximizing global precision, ΨΩ (see Eq. 10)) if ρ12 does not depend on x (see subsection SD in the Supporting Material and (19)).
Three or more morphogens
The best source arrangement in the case of three or more morphogens depends not only on the signs of the noise correlations but also on the PP of each morphogen. For each morphogen pair (i, j), we can calculate the impact of the relative directions of the gradients of morphogens i and j on the precision (denoted by |ζij|). For example, in the case of three morphogens (M = 3), ζij = (ρjkρki −ρij)|ηiηj| (k ≠ i, j) hold (see subsection SF in the Supporting Material). The best encoding to maximize the precision of positional information at a location is uniquely determined by the combination of the signs (sgn) of ζ for the two pairs with large impact at the location (e.g., sgn(ζ12) and sgn(ζ23); Fig. 4 B). We should note again that this encoding design is also the best for maximizing global precision if the impact |ζij| (i.e., ρ and |η|) can be regarded as constant. Similarly, better encoding for M > 3 is realized by considering the relative source locations of morphogen pairs with larger impact (see subsection SF in the Supporting Material).
Optimal encoding in multidimensional positioning (N = 2 or 3)
In multidimensional positioning, the spatial pattern of morphogen concentrations can be characterized by the relative directions of morphogen gradient vectors at each location (Fig. 5). Thus, infinitely many ways of encoding are possible, which is a major difference from one-dimensional positioning.
Orthogonal morphogen gradient vectors is the best encoding in the absence of informational redundancy
When the number of morphogens (M) is the minimum needed for N-dimensional positioning (i.e., M = N), there is no informational redundancy. In this case, the precision of positional information at each location is proportional to det [E]2, i.e., the squared area spanned by two PP vectors in two-dimensional positioning, or the squared volume by three PP vectors in three-dimensional positioning (Fig. 5 A). In both cases, the precision is maximized when the gradient vectors are orthogonal, regardless of the PP values and the noise correlations: becomes sharper as the angle made by the gradient vectors approaches 90° (Fig. 5 B).
In our previous study, in which the ambiguity of positional information was measured by the information entropy for (18), we also found orthogonal gradient vectors to be the best encoding. (See subsection SG in the Supporting Material for the relationship between information entropy and the Fisher information matrix.)
Intuitively, it may seem trivial that the orthogonal crossing of the gradient vectors gives the maximum precision at a focal point (i.e., local precision det I[(x)]) in a tissue. In many situations, however, achieving the orthogonal crossing globally (i.e., in a wide region in the tissue) may not be easy. A straight way to achieve it is to adopt linear sources of morphogens crossing orthogonally each other (Fig. 6 A). Then orthogonal crossing of gradient vectors is achieved everywhere. In the wing disk during Drosophila development, this idea appears to be adopted. The morphogen sources of two typical morphogens, Decapenta-plegic (Dpp) and Wingless (Wg), is crossing orthogonally (Fig. 6 A). Because quantitative analysis of spatial profiles of Dpp and Wg is advanced (21, 26), it will be possible in the near future to evaluate the goodness of encoding by examining the spatial dependences of ηi (x) and det I[(x)].
In contrast, when the shapes of morphogen sources and tissue boundaries have large curvature, the contours of morphogen concentrations also become curved. How much and where the orthogonality between morphogen gradient vectors is achieved strongly depends on those shapes and the arrangement of locations of morphogen sources. For example, in the limb development of vertebrates such as chicks and mice, typical information sources—such as the zone of polarizing activity (ZPA) whose molecular substance is Sonic hedgehog (Shh), and the apical ectodermal ridge (AER) from which Fibroblast growth factor (Fgf) and Wnt are released—have curved shapes (Fig. 6 B). We calculated the spatial profiles of ZPA and AER signals by assuming simple diffusion and linear degradation of the diffusive chemicals.
For the arrangement of morphogen sources in the wild-type, the angle between the two gradients is nearly orthogonal in a wide range of tissues, leading to higher precision of positional information globally. On the other hand, when the location of one morphogen source (ZPA) is perturbed from the original position, the orthogonality is lost remarkably in a wide range (bottom panels in Fig. 6 B). Because the anterior-posterior information provided by the ZPA signal during early phases of limb development is critical for the establishment of different digit identities (27), this result implies that better coding design is adopted to realize robust positioning—done by improving the global precision ΨΩ in Eq. 10 against the variability of spatial profiles of morphogen concentrations (see also our previous study (18)).
The best encoding in the presence of informational redundancy (M > N)
As with one-dimensional positioning, informational redundancy can improve the precision of positional information if encoding rules appropriate for the given noise are chosen. In this case, the precision of two-dimensional (or three-dimensional) positioning is the sum of squared areas (or volumes) spanned by all possible MC2 pairs of {η′i, η′j} (i < j) (or MC3 triplets of {η′i, η′j, η′k} (i < j < k)), where η′α is the weighted linear sum of the PP vectors, , and C−1/2 is a matrix defined as C−1/2C−1/2 = C−1. This is a natural extension of the case without informational redundancy (see Fig. 5, C and H). However, orthogonal gradient vectors may not represent the best encoding to maximize the precision.
We mainly examine two-dimensional positioning in detail because the visual understanding of results is easier. After that, the three-dimensional case is discussed. The two-dimensional results can help us understand intuitively the best encoding rule in three-dimensional positioning.
Symmetrically crossing of gradient vectors is the best encoding when different morphogens have the same PP and their noises are independent of each other
For uncorrelated noises, η′i = ηi holds. Thus, the precision is the sum of the squared areas spanned by all possible pairs of PP vectors (Fig. 5 C). Further, if different morphogens have the same PP, the precision is maximized when the relative directions of the morphogen gradients are arranged symmetrically. For example, in the case of M = 3, the best encoding is to arrange the gradient vectors so that the angles between each pair of adjacent vectors is 120° (see Fig. 5, D–F, and see subsection SH in the Supporting Material for details). In general, when the best encoding is adopted, the precision is proportional to the square of the number of morphogen species, that is, det[I(x)] ∝ M2 (see subsection SE in the Supporting Material).
Asymmetrically crossing gradient vectors
When different morphogens have different PPs or when the noises associated with them are correlated, the best encoding may shift from the symmetric arrangement. Here, we show the two simplest examples with M = 3:
In the first case, the PP of one of the three morphogens is larger or smaller than the PPs of the other two (e.g., ‖η1‖ = γ ‖η2‖ = γ ‖η3‖), and the noises associated with the three morphogens are uncorrelated (Fig. 5 G). When the PP of morphogen 1 is much larger (γ >> 1), the maximum precision is achieved when the angle between the gradient vectors of morphogens 1 and 2 (or 3) is 90°. When the PP of morphogen 1 is much smaller (γ << 1), the best encoding is when the angle of gradient vectors of morphogens 2 and 3 approaches 90°.
In the second case, the three morphogens have the same PP and only ρ23 is nonzero (Fig. 5, I and J). As the magnitude of correlation |ρ23| increases, the optimal angle between the gradient vectors of the morphogens with uncorrelated noises (i.e., 1 and 2 (or 3)) approaches 90°. A typical characteristic of this case is that the preferred relative angle between the gradients of a pair of morphogens with correlated noises (i.e., 2 and 3) depends on the sign of the correlation: when the correlation is positive an obtuse angle (i.e., oppositely directed) is preferred, and when it is negative an acute angle (identically directed) is preferred. This result is consistent with one-dimensional positioning with correlated noises (see the results of one-dimensional case, above). Tendencies similar to these results were also observed for more general situations.
The upper limit of the precision increases as the magnitude of correlation increases, indicating that the precision can be improved by increasing the correlation of noises, as well as by increasing the informational redundancy (see subsection SI in the Supporting Material).
Three-dimensional positioning
Analytical solution of the best encoding rule for three-dimensional positioning is much more complicated than the two-dimensional solution. Therefore, we numerically examined some specific cases, and confirmed that conditions to improve the precision of positional information in three-dimensional positioning are similar to those in the two-dimensional case:
-
1.
A symmetric arrangement of morphogen gradients gives the best encoding when the different morphogens have the same PP and their noises are independent of each other, where det[I(x)] ∝ M3 holds (see subsection SE in the Supporting Material).
-
2.
Increasing the magnitude of the correlation between noises associated with morphogens and increasing informational redundancy are both possible ways to improve the precision of positional information.
Summary and Discussion
Summary of this study
In this study, we mathematically formulated coding processes of positional information and defined key concepts such as encoding and decoding. The goodness of coding, i.e., the reliability of positioning, is determined by the choice of encoding and decoding rules. According to Cramer-Rao's inequality, the best decoding rule is ML decoding. Then the precision of positional information at a focal location x can be measured by det[I(x)]. We determined mathematically the best encoding rules to maximize det[I(x)] for several different situations. We also discuss the relationship between local (det[I(x)]) and global (ΨΩ) precision, and present the biochemical implementation of ML decoding by showing some examples.
Suggestions for data analysis: a possible procedure to validate coding optimality
Deriving the best coding design is useful because it provides a criterion by which to evaluate experimental data. Fig. 7 shows a possible procedure of data analysis to validate coding optimality formulated in this study (see subsection SJ in the Supporting Material for details). Through the procedure, we will be able to quantitatively answer some parts of the following questions: How is good coding adopted in real biological systems, and when, where, and to what extent does each morphogen contribute to positioning?
Future works
In our analysis, ML decoding was assumed in discussing the best encoding rules. However, if there are constraints in realizing decoding mechanisms by biochemical reactions, the best encoding may change according to the choice of decoding rule. Considering such constrained situations, and discussing the evolution of a pair of encoding and decoding mechanisms, are interesting and important topics for future work.
Further, in this study, optimal coding rules are derived for static situations, i.e., the tissue growth and the changes of average spatial profiles of morphogen concentrations are neglected. This quasiequilibrium assumption may be a good approximation in some situations. However, to understand more dynamic aspects of organ morphogenesis, we need to extend the mathematical framework developed in this study to a nonsteady version, i.e., a design of space-time and cross-stage information coding. It is, of course, a big challenge but is a very important issue not only for theoretical interest but also for interpreting the large amount of spatiotemporal information about molecules, geometries, and even mechanics that will be available in the near future.
Understanding robust coding of space-time information will also be important in the field of synthetic developmental biology. The main focus of research in current synthetic biology is the construction of genetic circuits within cells (28). However, more-advanced applications will require the regulation of the spatial structures created by multicellular systems. Proposing the design of space-time coding will provide useful insights into this problem.
Acknowledgments
We thank Dr. Takayuki Suzuki for providing pictures of chick limb development.
This work was supported by a PRESTO grant from Japan Science and Technology Corporation to Y.M.
Editor: Andre Levchenko.
Footnotes
Ten subsections, three figures, and supporting equations are available at http://www.biophysj.org/biophysj/supplemental/S0006-3495(11)01179-9.
Supporting Material
References
- 1.Wolpert L. 3rd Ed. Oxford University Press; New York, NY: 2007. Principles of Development. [Google Scholar]
- 2.Gilbert S.F. 9th Ed. Sinauer Associates; Sunderland, MA: 2006. Developmental Biology. [Google Scholar]
- 3.Houchmandzadeh B., Wieschaus E., Leibler S. Establishment of developmental precision and proportions in the early Drosophila embryo. Nature. 2002;415:798–802. doi: 10.1038/415798a. [DOI] [PubMed] [Google Scholar]
- 4.Gregor T., Tank D.W., et al. Bialek W. Probing the limits to positional information. Cell. 2007;130:153–164. doi: 10.1016/j.cell.2007.05.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Megason S.G., Fraser S.E. Imaging in systems biology. Cell. 2007;130:784–795. doi: 10.1016/j.cell.2007.08.031. [DOI] [PubMed] [Google Scholar]
- 6.Bollenbach T., Pantazis P., et al. Jülicher F. Precision of the Dpp gradient. Development. 2008;135:1137–1146. doi: 10.1242/dev.012062. [DOI] [PubMed] [Google Scholar]
- 7.Eldar A., Dorfman R., et al. Barkai N. Robustness of the BMP morphogen gradient in Drosophila embryonic patterning. Nature. 2002;419:304–308. doi: 10.1038/nature01061. [DOI] [PubMed] [Google Scholar]
- 8.Kicheva A., Pantazis P., et al. González-Gaitán M. Kinetics of morphogen gradient formation. Science. 2007;315:521–525. doi: 10.1126/science.1135774. [DOI] [PubMed] [Google Scholar]
- 9.Kanodia J.S., Rikhy R., et al. Shvartsman S.Y. Dynamics of the dorsal morphogen gradient. Proc. Natl. Acad. Sci. USA. 2009;106:21707–21712. doi: 10.1073/pnas.0912395106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.He F., Wen Y., et al. Ma J. Probing intrinsic properties of a robust morphogen gradient in Drosophila. Dev. Cell. 2008;15:558–567. doi: 10.1016/j.devcel.2008.09.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Bollenbach T., Kruse K., et al. Jülicher F. Robust formation of morphogen gradients. Phys. Rev. Lett. 2005;94:018103. doi: 10.1103/PhysRevLett.94.018103. [DOI] [PubMed] [Google Scholar]
- 12.Howard M., ten Wolde P.R. Finding the center reliably: robust patterns of developmental gene expression. Phys. Rev. Lett. 2005;95:208103. doi: 10.1103/PhysRevLett.95.208103. [DOI] [PubMed] [Google Scholar]
- 13.Eldar A., Rosin D., et al. Barkai N. Self-enhanced ligand degradation underlies robustness of morphogen gradients. Dev. Cell. 2003;5:635–646. doi: 10.1016/s1534-5807(03)00292-2. [DOI] [PubMed] [Google Scholar]
- 14.Veitia R.A., Nijhout H.F. The robustness of the transcriptional response to alterations in morphogenetic gradients. Bioessays. 2006;28:282–289. doi: 10.1002/bies.20377. [DOI] [PubMed] [Google Scholar]
- 15.Tostevin F., ten Wolde P.R., Howard M. Fundamental limits to position determination by concentration gradients. PLoS Comput. Biol. 2007;3:e78. doi: 10.1371/journal.pcbi.0030078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Ashe H.L., Briscoe J. The interpretation of morphogen gradients. Development. 2006;133:385–394. doi: 10.1242/dev.02238. [DOI] [PubMed] [Google Scholar]
- 17.Hirashima T., Iwasa Y., Morishita Y. Distance between AER and ZPA is defined by feed-forward loop and is stabilized by their feedback loop in vertebrate limb bud. Bull. Math. Biol. 2008;70:438–459. doi: 10.1007/s11538-007-9263-4. [DOI] [PubMed] [Google Scholar]
- 18.Morishita Y., Iwasa Y. Optimal placement of multiple morphogen sources. Phys. Rev. E. 2008;77:041909. doi: 10.1103/PhysRevE.77.041909. [DOI] [PubMed] [Google Scholar]
- 19.Morishita Y., Iwasa Y. Accuracy of positional information provided by multiple morphogen gradients with correlated noise. Phys. Rev. E. 2009;79:061905. doi: 10.1103/PhysRevE.79.061905. [DOI] [PubMed] [Google Scholar]
- 20.Cover T.M., Thomas J.A. 2nd Ed. Wiley-Interscience; Hoboken, NJ: 2006. Elements of Information Theory. [Google Scholar]
- 21.Teleman A.A., Cohen S.M. Dpp gradient formation in the Drosophila wing imaginal disc. Cell. 2000;103:971–980. doi: 10.1016/s0092-8674(00)00199-9. [DOI] [PubMed] [Google Scholar]
- 22.Strigini M., Cohen S.M. Wingless gradient formation in the Drosophila wing. Curr. Biol. 2000;10:293–300. doi: 10.1016/s0960-9822(00)00378-x. [DOI] [PubMed] [Google Scholar]
- 23.Lander A.D., Nie Q., Wan F.Y.M. Do morphogen gradients arise by diffusion? Dev. Cell. 2002;2:785–796. doi: 10.1016/s1534-5807(02)00179-x. [DOI] [PubMed] [Google Scholar]
- 24.Kornberg T.B., Guha A. Understanding morphogen gradients: a problem of dispersion and containment. Curr. Opin. Genet. Dev. 2007;17:264–271. doi: 10.1016/j.gde.2007.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Gonzales-Gaitan M. Signal dispersal and transduction through the endocytic pathway. Natl. Rev. 2003;4:213–224. doi: 10.1038/nrm1053. [DOI] [PubMed] [Google Scholar]
- 26.Marois E., Mahmoud A., Eaton S. The endocytic pathway and formation of the Wingless morphogen gradient. Development. 2006;133:307–317. doi: 10.1242/dev.02197. [DOI] [PubMed] [Google Scholar]
- 27.McGlinn E., Tabin C.J. Mechanistic insight into how Shh patterns the vertebrate limb. Curr. Opin. Genet. Dev. 2006;16:426–432. doi: 10.1016/j.gde.2006.06.013. [DOI] [PubMed] [Google Scholar]
- 28.Andrianantoandro E., Basu S., et al. Weiss R. Synthetic biology: new engineering rules for an emerging discipline. Mol. Sys. Biol. 2006 doi: 10.1038/msb4100073. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.