Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2024 Feb 29;20(2):e1011867. doi: 10.1371/journal.pcbi.1011867

Developmental hourglass: Verification by numerical evolution and elucidation by dynamical-systems theory

Takahiro Kohsokabe 1, Shigeru Kuratanai 1, Kunihiko Kaneko 2,*
Editor: Philip K Maini3
PMCID: PMC10903806  PMID: 38422161

Abstract

Determining the general laws between evolution and development is a fundamental biological challenge. Developmental hourglasses have attracted increased attention as candidates for such laws, but the necessity of their emergence remains elusive. We conducted evolutionary simulations of developmental processes to confirm the emergence of the developmental hourglass and unveiled its establishment. We considered organisms consisting of cells containing identical gene networks that control morphogenesis and evolved them under selection pressure to induce more cell types. By computing the similarity between the spatial patterns of gene expression of two species that evolved from a common ancestor, a developmental hourglass was observed, that is, there was a correlation peak in the intermediate stage of development. The fraction of pleiotropic genes increased, whereas the variance in individuals decreased, consistent with previous experimental reports. Reduction of the unavoidable variance by initial or developmental noise, essential for survival, was achieved up to the hourglass bottleneck stage, followed by diversification in developmental processes, whose timing is controlled by the slow expression dynamics conserved among organisms sharing the hourglass. This study suggests why developmental hourglasses are observed within a certain phylogenetic range of species.

Author summary

Understanding the intriguing relationship between development and evolution in multicellular organisms has long been a challenge in biology. A recent hypothesis called the developmental hourglass proposes that there is a conserved middle stage during development across species of the same animal group. Despite growing evidence supporting this hypothesis, the underlying mechanisms and reasons for its emergence have remained elusive due to limited experimental data.

To address this gap, we employed numerical evolution of gene regulation networks controlling pattern formation. Remarkably, our simulations revealed that species that diverged relatively recently in phylogeny displayed the highest similarity during the middle stage of development, which gradually diminished as they diverged further phylogenetically. Our findings satisfied not only the criteria of the developmental hourglass but also confirmed several essential characteristics of the developmental hourglass reported in recent experiments. Through theoretical analysis, we further demonstrated that the emergence of the developmental hourglass could be attributed to the acquisition of genes that change slowly and govern developmental processes, which also foster the robustness of development.

By integrating computational simulations, theoretical insights, and previous experimental evidence, our study thus provides a comprehensive understanding of the developmental hourglass, which will unravel the intricate relationship between development and evolution.

Introduction

Elucidating the possible relationships between evolution and developmental processes is a fundamentally unsolved issue in biology. Nothing to speak of those early works of 19th-century embryologists and evolutionists such as von Baer [1], Darwin [2], and Haeckel [3]; even today, the search for the general laws of the evolution of developmental processes remains one of the most important challenges in the field of evolutionary developmental biology. As a candidate for such a general law, the “developmental hourglass” model has gathered increased attention. According to the developmental hourglass model, multicellular species belonging to the same phylum have the highest embryonic similarity at a particular intermediate period of their development, when the morphological elements characterizing the phylum are formed.

The validity of the developmental hourglass model was first noted based on the coincidence between the phylotypic stage of vertebrates and the period of temporal and spatial progression of the hox genes [46] and has been quantitatively verified for various taxa by bioinformatics analyses over the past two decades (for chordates [712]; for echinoderms [1315]; for arthropods [16,17]; for nematodes [18,19]; for mollusks [20] and also suggested beyond the animal kingdom, for flowering plants [21]; for fungus [22]). One may state that the observed data support a developmental hourglass, at least in some phyla.

Despite the growing evidence for the developmental hourglass model, whether it is just a contingency in the evolution of multicellular organisms or a necessity therein, is still in question. In other words, the possible mechanisms underlying the evolutionary acquisition of mid-embryonic conservation remain elusive. Recent experimental studies have concluded that the developmental hourglass is not sufficiently explained by fragility in the bottleneck stage [23,24], but rather, developmental constraints or limitations on phenotypic variability are suggested to be the main driving force of the emergence of the developmental hourglass [23,25]. The relevance of the self-organization processes to the formation of the hourglass was discussed (as well as the origin of the life cycle of multicellular organisms [26]), whereas there is no consensus on the mechanism for the developmental hourglass.

Generally, it is challenging to solve such fundamental issues in the evolution of developmental processes by means of experimentally available data, and the search for mechanisms of the developmental hourglass is no exception. As another approach to this challenge, simulation studies on the evolution of developmental processes have been conducted over the past 20 years. In this case, one can repeat virtual evolution while retaining all the information, such as gene regulatory networks and developmental dynamics. Hypotheses about the possible relationships between development and evolution can then be tested. Theoretical hypotheses have been proposed using this approach regarding the structure of gene regulatory networks that induce segmentation [2729], testing plausible evolutionary scenarios of segmentation mechanisms [30,31], the relationship between developmental and mutational robustness [32], and congruence in the developmental and evolutionary processes in both RNA [33] and gene expression patterns [34].

In this study, we conducted an evolutionary simulation to investigate the mechanism behind the establishment of a developmental hourglass. For this purpose, we adopted a dynamical-system scheme to generate spatial patterns, which is commonly adopted in theoretical studies [27,2931,34]. The model for the simulation can represent cellular states that are determined by gene expression patterns governed by gene regulatory networks and spatial cell-cell interactions, leading to the differentiation of cell types and pattern formation. Individuals must be selected according to a biologically relevant fitness function. We assumed that if there are more cell types within an individual, they attain more functions such that the fitness is higher. This fitness function reflects the idea that different cell types have different cellular functions so that species with many cell types can deal with multiple tasks; hence, they are more adaptive than those with fewer cell types. Each cell type was defined according to the on or off state of the given output genes (see Fig 1B for a schematic representation). In other words, our simulation mimics competition for survival, where individuals acquiring as many cell types as possible are the most adaptive and would leave more offspring.

Fig 1. Schematic of the model used in this simulation.

Fig 1

(A) Each individual, which is a unit of selection, contains cells aligned in a one-dimensional space. These contained the same gene network. The state of each cell, given by the gene expression pattern, changes according to identical gene networks. There were 100 individuals in the evolving population in each generation. (B) Starting from the initial states, the gene expression patterns change according to the gene regulation network. After sufficient time, the expression of each cell reaches the terminal state, on or off. Cell type was determined according to the on/off patterns of the target genes. For the four target genes, there were 24 possible states, as shown in the figure. The fitness of an individual is determined by the number of different cell types in the terminal state. (In the figure, for instance, it is 6). (C) The selection procedure of species diverged from a common ancestor. The left chain represents the result of the first step of evolutionary simulation to select one species. Each circle represents the ancestor at each generation (although significantly abbreviated). As the second step, an ancestor at a certain generation was cloned, and additional evolutionary simulations were carried from the ancestor up to the same generations of the original (1000).

In the present paper, we use the term "species", not rigorously following its biological definition. It just refers to organisms with different networks (i.e., distinct gene sets), which are branched by earlier generations in the evolution. This is a bit sloppy in terms of the definition of species in biology. Still, we need to distinguish individual organisms developed by different networks from those developed by the identical gene network (i.e. clones under noise) for the following analysis. Accordingly, we need a term distinguishable from just the individuals. Hence, we use the term "species" in the above sense, when needed.

To study the developmental hourglass, the developmental processes of extant species that diverged from a common ancestor must be compared. Thus, the model must represent not only a mother-to-daughter single-chain pedigree but also be able to examine a phylogenetic tree that contains multiple pedigrees as branches. To achieve this, a two-stage evolutionary simulation was performed. In the first stage, an evolutionary simulation of one lineage was performed to obtain one species by selection. In the second stage, evolutionary simulations were executed over the same number of generations, using an ancestor in the lineage of the first-step simulation. Using different random numbers for mutation, extant species branching from the common ancestor were obtained (see Fig 1C for a schematic representation).

Using the above setup, we aimed to examine the correlation between gene expression patterns and the development of multiple species that diverged from a common ancestor. The results showed a developmental hourglass in the sense that the peak of such correlations lies in the middle of development. We also confirmed other reported characteristics, including accumulation of expressed pleiotropic genes and reduction of intra-species gene expression variance at the developmental bottleneck stage. The developmental period of the hourglass bottleneck obtained in the simulations was determined by the genes whose expression changed slowly during development, acting as genes to schedule development. The bottleneck serves as a developmental foundation that sufficiently attenuates the effects of the initial noise to prepare for subsequent developmental processes. Based on these results, the validity of the developmental hourglass and possible mechanisms for its emergence were discussed.

Results

Herein, we report the results of the simulation of a model based on the three policies described in the introduction. We assumed that the expression dynamics of each gene at each cell location are governed by the gene regulation network (GRN), while some gene products diffuse through the space. There are two specific morphogens with spatial gradients. These spatial gradients give input to morphogenetic genes. Under those morphogen inputs, the gene expression pattern reached a terminal steady state after sufficient ‘developmental’ time according to gene expression dynamics governed by the GRN, where the expression of each gene reached an on or off state. Depending on the combination of on/off expression patterns of the given output genes, a cell type is determined (here, we choose four output genes so that a maximum of 16 cell types could exist). Each cell type carries on some functions such that the fitness is given by the number of cell types that coexist over the space in the terminal state. GRNs with higher fitness than others will be selected with a higher probability, whereas slight random changes are introduced as “mutations” for replication to generate offspring. This simulation mimics evolutionary competition for survival, in which individuals that acquire as many cell types as possible are the most adaptive and leave more offspring. To represent multiple lineages that diverged from a common ancestor, additional evolutionary simulations of ancestral species were performed (see Methods for details).

Fig 2A shows specific examples of the developmental dynamics of two individuals acquired from evolutionary simulation after 1000 generations. The horizontal axis is the developmental time, and the vertical axis is the index indicating the cell location in a one-dimensional line. Different colors indicate different cellular states. The two individuals diverged from a common ancestral individual in the 800th generation. For precision, in this case, after selecting individual A1 in the primary simulation of 1000 generations, the ancestral individual of the 800th generation was chosen, and an additional simulation was performed over 200 generations (i.e., 1000 generations in total, equal to the generations in the primary simulation). Thus, owing to the secondary simulation, individual A2 was selected to represent different species diverged from a common ancestor.

Fig 2. Examples of the evolutionary simulation that showed hourglass-like developmental correlation.

Fig 2

(A) Two examples of gene expression dynamics of evolved GRNs (A1, A2). The color represents the cellular state, i.e., the expression (On/Off) pattern of four output genes. The vertical axis is the spatial index of cells and the horizontal axis is developmental time. Spatial pattern transitions were more frequent in the early developmental time, and a horizontal log plot was used to adequately depict them. This is consistent with experimental observations. Thereafter, the developmental time was plotted in logs. Following evolving 800 generations, the two GRNs evolved from a common ancestor of the 800th generation and then simulated further over 200 more generations, while suffering from different genetic changes, such that the two have diverged from the common ancestor. (B) The heat map of the similarity indicator, comparing the developmental dynamics of the two individuals given in A. The horizontal axis is the developmental time of the individual A1, and the vertical axis is that of A2. Both axes are on a logarithmic scale. Each point indicates the similarity of gene expression states between the two examples at specific time points. (As the timescale of development can be different between the two in general, the comparison at different time points is included.) The green line indicates the locations where the similarity indicator takes maximum for each developmental time step of individual A1 (horizontal axis). Note that the high similarity region is approximately 100–200 in either individual developmental time, which is neither the beginning nor the end of the developmental dynamics. (C) Projection of the similarity indicator on the green line in B against the developmental time of the first species (i.e., the horizontal axis in B).

By examining the ontogenetic change of spatial patterns with the differentiation of cell types, it was noted that there are some phases within which patterns change only slightly in ontogeny, and between them, drastic changes occur. Considering this case, there were three phases: early, intermediate, and late. By comparing the development of the two species (i.e., individuals with distinct GRNs), it may be difficult to judge at a glance in which phase the two are most similar to each other (Fig 2A). Thus, we quantitatively compared the developmental processes of the two individuals to examine the similarity in patterns between them throughout the course of development. The differences in gene expression between the species were computed across each cell of the same spatial location, and the inverse of the sum of the differences was used as an indicator of the similarity in gene expression patterns between the two individuals. This indicator is similar to those used in experimental studies [915,18,19], in which the average expression over the whole cell is adopted by transcriptome analysis whereas it is superior in that it includes spatially local information. (Note, however, that the same result was obtained even if we adopted the spatially averaged indicator as adopted in the experiments, see also S1 Fig).

Fig 2B displays the similarity between the gene expression states of individuals A1 and A2 at each developmental time. The indicator of similarity explained above was computed at each time point, as shown in the heat map. The horizontal axis represents the developmental time of individual A1, and the vertical axis represents that of individual A2. Fig 2C displays the time course of similarity, which indicates that the peak of the similarity is in the middle of the developmental dynamics (see also S2 Fig).

The similarity in gene expression patterns, as shown in Fig 2B and 2C, can be calculated for any two species obtained in this simulation. Additional examples are shown in the SF2. As long as the compared species were phylogenetically close, we observed the peak of the similarity between two species that diverged from a common ancestor, always in the middle of developmental time, which is consistent with the developmental hourglass [915,18,19].

Next, we examined the phylogenetic range within which the above similarity peak was observed by comparing the two species that diverged from earlier common ancestors. Fig 3 presents how the time course of the above similarity changes as the generations that the two species diverged earlier in the phylogenetic tree, where the similarity was computed between the original species and that departed at different branching points in the left phylogenetic tree (A, B, C). It was plotted against the developmental time of the original species, as shown in Fig 2C.

Fig 3. The similarity between two species that departed from a common ancestor at given earlier generations in the phylogenetic tree.

Fig 3

The similarity is computed as in Fig 1B and 1C, the average over pairs of individuals. The horizontal axis represents the developmental time of the original species, and the vertical axis represents the similarity between the spatial patterns of gene expression. The line color denotes how far the two species are, in comparison, branched in the left phylogenetic tree, diverged from the 800th generation A(blue), 500th generation B(green), and 200th generation C(red). The horizontal axis was plotted on a logarithmic scale. At the intermediate developmental time, the similarity indicator takes a maximum when the pair branched at the later stage (in this example the peak exists up to ~500, whereas including other examples, the peaks exist for the pairs branched at the recent 100–300 generations). Typically, the closer the phylogenetic pair of individuals, the more prominent the peak in the similarity indicator.

The peak of similarity in the intermediate developmental stage was most prominent in comparison with the species departed at branch A (i.e. separated at the most recent 200th generation), and observed for B, albeit a bit weaker (separated at the 500 generations before), whereas in C, that is, the comparison between species separated much earlier in the tree (at the 800th generation), the middle peak disappears (see S3 Fig, how the intermediate peak diminishes, as the point where the two species were increased per 100 generations). Hence, the similarity peak in the intermediate developmental time is preserved for comparison with the species that have diverged up to a certain depth range in the phylogenetic tree, but if they departed from a much farther ancestor, the peak disappeared.

Note that the common similarity peak is also observed even between the evolved individuals and their ancestors, for instance, as seen in S4 Fig, where we compared the species at the 1000th generation and its ancestor at the 500th generation. This will be because the mechanism leading to the similarity in the intermediate developmental stage, as will be discussed later, has been conserved after the acquisition and will be preserved after maximal fitness is achieved by evolution.

Next, we examined whether the present simulation results also contain other properties reported in the development of the hourglass-like conservation pattern.

In [11], it was observed that genes expressed in multiple stages during development (pleiotropic genes) were predominantly expressed at the bottleneck middle stage. Thus, we defined pleiotropic genes in our simulation and computed the ratio of the number of expressed pleiotropic genes to the total expressed genes. To compute it, we split developmental time into 12 stages and defined the “pleiotropic genes” as those that are expressed for more than half of the stages. Then, the fraction of expressed pleiotropic genes at each stage was plotted over the developmental stages. As shown in Fig 4, the fraction had a peak at the bottleneck time shown in the blue band. This increase in expressed pleiotropic genes is consistent with the experimental data (even though one might see arbitrariness in the above criterion for pleiotropic genes, the maximal fraction of expressed pleiotropic genes at the bottleneck time is insensitive to the details of the criterion, such as the threshold to be judged as expressed, and the way of partition of developmental time to stages).

Fig 4. The time course of the fraction of the expressed pleiotropic genes.

Fig 4

The pleiotropic genes are defined as those that are expressed for more than half of the developmental stages, whereas the stage was defined as splitting developmental time before reaching the stationary state into equal 12 spans, either by linear scale (the plot of purple (+) points) or by the logarithmic scale (the green (×) points). The fraction is obtained by averaging over 1000 individuals of different initial conditions, whereas the variance from the average was quite small. The area colored by blue indicates the typical range of developmental bottleneck (average ± standard deviation), which is given by comparison over other individuals diverted 200 generations earlier.

The recent experimental report shows that the difference in gene expression patterns between clones is minimal at the developmental hourglass’s bottleneck [35,36]. We then tried to confirm if this characteristic is available from the simulation results and calculated the difference in the phenotypes (gene expression patterns) of clones of the same species, due to variation in initial conditions. This phenotypic variation of clones (i.e., the averaged gene expression difference between individuals from the same species), was computed in our model by taking different initial conditions (see Methods) is shown in Fig 5, against the developmental time in the logarithmic scale. The difference between individuals decreased up to a certain developmental time and then increased. The time that the variation took a minimum agrees with the time that the similarity in gene expression takes a maximum, i.e., the bottleneck of the developmental hourglass. (In other words, the average similarity between clone individuals takes maximal, statistically). After the variance takes minimal at the hourglass bottleneck time, it increases, as shown in Fig 5. In most cases, the increase is rather gradual towards the final stage of development, whereas a small peak is observed in some samples. As will be discussed below, due to a slight difference in the timing of the onset of the rise in the slow gene expression), which causes enhancement of the variance in some gene expression.

Fig 5. The variance of gene expression patterns over 1000 clones, i.e., those generated from the same genotype but with different initial conditions.

Fig 5

The horizontal axis represents developmental time, and the vertical axis represents the total variance in gene expression over all cells, i.e., the summation of variance over cells, over genes, and over clones (see also Methods). As specific examples, the data of three different individuals are indicated by different colors. The blue stripe indicates the typical range of observed developmental bottlenecks (average ± standard deviation), as obtained by comparing individuals belonging to blue triangle A in the phylogenetic tree in Fig 3, which exhibits clear hourglass-like developmental correlations. Here, the initial conditions are chosen by adding Gaussian distribution noise with a variance σ = 0.1. (The details of the computation scheme are presented in the Methods section).

Note that the strength of the noise in the initial condition facilitates the emergence of a developmental hourglass. In this evolutionary simulation, the fitness is set up such that multiple cell types are generated in the final stage under the variation in the initial state. The developmental process must reduce the initial variation to achieve robustness. Up to a certain developmental time, attenuation of the initial phenotypic variances by noise progresses. According to recent studies, this robustness to noise implies robustness to genetic changes [32]. Hence, while the variance within the same species is reduced during the developmental period, the variation in expression among genetically different species is also reduced (as long as they are not too far apart). This timing, in which the variance is minimized, agrees with the bottleneck in the hourglass (Fig 5). Furthermore, if we evolve the system by reducing the variance of the initial conditions (or noise), then the developmental hourglass is no longer observed (see S5 Fig).

In addition, we also computed the variance among individuals of the same network, due to continuously added noise throughout the developmental process. In this case, again, we observed that the variance due to the noise is minimal at around the bottleneck time.

Furthermore, to examine the universality of our result, we have carried out 21 runs for a set of evolution simulations of the present model, where different spatial patterns are observed after reaching the fittest state. Among 18 of 21, we have observed the similarity in the intermediate stage, i.e., the developmental hourglass. (In fact, the exceptional 3/21 cases in which a clear peak was not observed contain oscillatory gene expression in the earlier developmental stage, which masks the peak). Also, it should be noted that in the sample runs showing the developmental hourglass, common peaks are observed for any pairs of species diverted from the common ancestor, as far as we have examined.

In summary, our simulation reproduces the hourglass properties reported in real evolutionary developmental biology (evo-devo) data thus far, despite the simplification adopted in the model.

The developmental bottleneck and slow-gene control

Here, we note that the developmental similarity diagram (Fig 2) drastically changes in the middle developmental period, where the pattern itself changes in a discrete manner rather than continuously. Below, we explore how these drastic/discrete changes in development shape the developmental hourglasses.

Fig 6A illustrates examples of the changes in the expression of each gene in one individual over the entire space. This figure also confirms the stepwise change (Here, the gene expression remains at an intermediate level between on and off only if the input to it is close to the threshold for its expression. Switching between on and off is quite fast unless the gene’s timescale parameter γi is very small (see also Methods). Hence, for most genes, the expression changes are stepwise). Furthermore, there is a stage in which the expression of many genes exhibits discrete changes (switches). This stage coincides with the bottleneck time of the hourglass. Before the bottleneck time, gene expression exhibited little spatial heterogeneity as shown in Fig 6A. After the bottleneck, spatial subregions are generated in which specific genes are expressed and spatial heterogeneity is increased.

Fig 6. Control of the developmental schedule by slow genes.

Fig 6

(A) Examples of spatial gene expression dynamics. The gray vertical line in each figure is the timescale of the genes whose expression changes are slowest (see C and Methods). The left upper corner is the expression dynamics of the slowest gene. (B) Time series of expression levels of all genes at cell index 32 in A. Different colors correspond to the expression levels of different genes. (C) Time scale of expression changes of each gene, which is plotted in order as a function of its rank in magnitude. Each circle denotes the average value of the timescale of genes over cells. Error bars are the standard deviation. (For the computation of the timescale of genes, see the Methods section). (D) Correlation between the timescale of the slowest gene and the timing of the bottleneck of the detected developmental hourglass. Magenta circles represent data from the individuals acquired through evolutionary simulation, whereas green squares and orange triangles represent data obtained by adopting a parametric change to the slowest gene in the original (blue dots). Green squares represent the cases in which the timescale of the slowest genes was changed to five times faster, and the orange triangles represent those changed with ⅕ times slower (See also [37] and S6 Fig for the heterochronic shift induced by manipulation of the slow gene).

The time course of all gene expression levels for a given cell is shown in Fig 6B, which also demonstrates that the expression of various genes exhibits on/off switch changes around the bottleneck of the developmental hourglass stage. The expression levels of many initially expressed genes slowly decreased up to this bottleneck time, when fewer genes started to be expressed, depending on the spatial position. (The timing of the expression of specific gene sets in varies by individual, which causes a drastic increase in phenotypic variance in Fig 5).

The data also suggest the existence of a few genes whose expression levels change slower than those of the other genes. We computed the time scale of the expression changes of each gene (identical to the time for each gene that needs to change its expression from zero to fully expressed, or vice versa; see Methods for details) and plotted them in the order of their magnitude. As shown in Fig 6C, there are a few (four in the figure) genes whose expression levels change much more slowly than others. In the present paper, they are called ‘slow genes’. Such slow genes have also been observed in previous studies, which are key to relating the developmental timetable with evolutionary change, as they tend to control the expression of other genes whose expression levels change at a much faster time scale [34,37]. Furthermore, Fig 6B suggests that the time scale of the slow gene is of the same order as the bottleneck time. This is reasonable because, with the time scale when expression levels of slow genes change, the expression of many other genes that are controlled by the slow ones are also switched between on and off.

In Fig 6D, the correlation between the timescale of these slowly changing genes and the average bottleneck time of the developmental hourglass is plotted. These variables are linearly correlated. To confirm the linear relationship, we also created mutants in which the timescale parameter γi of slow genes was altered and examined if the bottleneck time changed accordingly. Even when such external change in the parameter was imposed, the common linear relationship between the timescale of slow genes and the bottleneck time was still observed (see Methods for the computation of the timescale of genes). This may suggest a possible causality from slow genes that control the developmental time schedule to the timing of the bottleneck of the developmental hourglass. Most individuals that shared a bottleneck in the developmental hourglass shared common slow genes. In contrast, the probability that two (distant) species that do not share the same developmental hourglass have common slow genes is much lower, and at a chance level, that is, the level for two randomly generated networks.

To unveil if and how a slow gene controls the developmental time schedule, we explored how the gene works in the GRN. In Fig 7, we plotted the essential network structure extracted by the procedure described in the Method section. As shown, the slow gene controls the on/off pattern of target genes which lie downstream of the slow genes. Here, in contrast to most gene expressions that saturate with a faster time scale (~ 10 time steps), the expression of the slow genes changes with a much slower time scale. Following its change, on/off expressions of other (including target) genes change quickly, as shown in the time series of Fig 7.

Fig 7. An example of evolved gene network and expression control by the slow gene.

Fig 7

Left: The essential structure of the gene network of the individual of Fig 2A-1. Nodes filled with color are marker genes to determine fitness. Each color corresponds to the code in Fig 2A, i.e., red, green, blue, and brightness (which is represented as gray). The node with the light blue circumference is the slow gene. The path from the slow gene to the gray marker gene is shown as thick arrows. Note that paths from the slow gene to other marker genes exist. The procedure to extract essential gene network structure is explained in the Methods section. Right: Example of gene expression control of the slow gene. Expression dynamics of the genes in the thick arrowed path on the left, namely 9 (top), 12 (middle), and 5 (bottom), are shown for the cellular index 35. In each of these subfigures, the horizontal dashed line indicates the gene expression threshold of the arrowhead gene, and the vertical dashed arrows indicate the developmental time when the gene expression dynamics and the arrowhead gene expression threshold crossed, which basically corresponds to the expression change in the arrowhead gene.

Discussion

In summary, our evolutionary simulation exhibited the developmental processes consistent with the developmental hourglass. When we compared species diversified from the common ancestor, we observed (i) amongst phylogenetically close species, the similarity in the gene expression patterns is maximal at the intermediate developmental period. Even though neither the initial population nor the fitness condition favors the developmental hourglass, it was spontaneously shaped through evolution. (ii) at this bottleneck period, the number of pleiotropic genes was at the maximum, and (iii) the individual variances in gene expression levels over clones were at the minimum. These three features agree quite well with those experimentally reported as the developmental hourglass. We then uncovered that, at that stage, expressions of many genes are switched on or off, driven by a few genes whose expressions change slowly (“slow genes”). The timescale of the expression dynamics of these slow genes agrees with that of the bottleneck period.

To gain robustness in the developed pattern, unavoidable variations that existed in the initial stage and during later developmental dynamics should be reduced through the developmental course. In our model, such reduction progresses up to the bottleneck stage, where the variation among individuals is minimized. In fact, when there is a sufficient level of variance in the initial condition, as adopted in our simulation, the robustness to the variation evolves, and in this case, the developmental hourglass is observed, whereas if the initial variance was set to be too small, the developmental hourglass was not observed as in S5 Fig.

In recent theoretical and experimental studies, the correlation between phenotypic variance by noise and that by genetic changes has been observed [32,38,39]. There, robustness to noise fosters the robustness to mutation, i.e., genetic assimilation of robustness was observed. By extending the argument of such a correlation to the developmental time course, one may expect that the reduction of the initial variance by noise leads to a reduction in variance across genetically altered individuals. This explains the coincidence between the time of minimal variance over clones and the developmental bottleneck over genetically different species, which was recently suggested [35,36]. The underlying mechanism for obtaining the robustness to noise in the initial state needs to be further elucidated, while the idea of the slow gene control found and discussed in this paper may provide a clue to uncover it.

At the bottleneck stage, the expression of slow genes sets in, and most genes change their expression levels. Following the expression of slow genes, other genes are diversified by species, leading to a developmental hourglass. The later developmental processes are contingent on the species. However, the mechanism driven by slow genes to absorb variations in the initial variation was preserved through evolution. Thus, the developmental hourglass is expected to be observed for species with the shared control mechanism mediated by slow genes.

In summary, the developmental hourglass obtained in our simulation results was caused by an attenuation of the variation in the initial state, accompanied by the expression of a few slow genes that control the developmental process. If these slow genes are conserved among species that diverged from the common ancestor, they share the common developmental bottleneck time determined by the slow genes.

If the two species diverted far earlier, the similarity peak would be diminished. This decrease might look gradual at first glance, but indeed, at a certain point, the peak disappears distinctively. This point roughly agrees with the point at which the commonality of slow genes is lost as the two species branched earlier. Here, the attenuation of initial variation should always be present, but the species separated earlier would use different gene sets for the slow process, and then the gene expression patterns of the two species would no longer show similarity in the intermediate developmental stage, and the similarity peak would no longer exist. In this case, the similarity can be maximal either at the very beginning or at the last developmental time (Fig 3). This result may correspond to the cross-phylum comparison of gene expression patterns, as reported previously [40] (whereas we note the criticism to it [41] as discussed below). It will be important to examine closely within which range the common hourglass behavior is observed.

The slow genes regulate the expression of many other genes, as was shown previously [34]; thus, with this timescale of slow genes, many genes change their expression levels, including pleiotropic genes. This leads to an increase in the fraction of expressed pleiotropic genes in the developmental bottleneck period, which is consistent with the present simulation and experimental data [11]. The phylogenetically conserved time scheduling by slow genes can explain the range of species in which the hourglass was observed, which might correspond to the range within the same phylum. In the present simple model, morphogenesis is determined only by the expression of genes; thus, the control of development is regulated solely by gene expression, whereas current experimental studies revealed that mechanical interactions and subsequent cell motions are important in the development of real multicellular organisms [42]. When such processes are included, the commonality of the slow control mechanisms (not restricted to gene expression dynamics) may explain the hourglass.

The idea of the developmental hourglass stems from the discovery of the conservation of specific gene expression patterns in animals [4]. Conceptually, the bottleneck of the developmental hourglass is thought to correspond to the stage at which morphological characters that define the phylum of the species arise, the so-called phylotypic stage [5,6].

After the finding of the developmental similarity was made, the reverse question was posed, i.e., whether a group of animals that share the developmental hourglass agrees with a common phylum [43]. This question remains unanswered, which will be important in uncovering common developmental mechanisms that provide a basis for the developmental hourglass. Our results may suggest that species that shape the developmental hourglass could be defined as those that share a common slow control mechanism to reduce noise in the early developmental period. Exploring such common slow mechanisms would be important, even though the species that form a developmental hourglass may not correspond to a phylum.

The study’s primary limitation is that the model employed in this simulation is extremely simplified in terms of the number of genes and spatial degrees of freedom compared to real organisms. In particular, only gene regulations and diffusive cellular interactions are considered. Other layers of development, such as the collective movement of cells under mechanical forces, reported as important in tissue morphogenesis and responsible for making shape, are not considered [42]. However, the synergy between gene expression dynamics and cell-to-cell diffusion is common in the development of real organisms [44] and when represented mathematically, such mechanical processes can also be represented by a combination of gene expression dynamics and diffusive interactions among cells [45]. The basic results obtained in this study will be valid under the inclusion of such processes.

Methods

Model

The model in this study considered the evolution of a finite-sized population of 100 individuals. An individual is defined as 128 cells allied in a one-dimensional space without cell proliferation during gene expression dynamics. In each generation, individuals exhibit specific gene expression patterns according to gene expression dynamics with cell-cell interactions provided by their own gene regulation networks. For the gene expression patterns of each individual, fitness is computed, and those with higher fitness can produce multiple offspring, and those with lower fitness cannot. Therefore, 100 descendants are generated in total. Evolutionary simulations were conducted by iterating the scheme. This section provides a detailed description of our proposed model, including the set of parameters adopted in the simulations (Table 1), for the reproduction of the results.

Table 1. List of the parameters adopted in the simulation.

Description Change in each generation by mutations
J i,j Gene network structure matrix Randomly chosen 4 (i, j) pairs are subject to the mutation with the probability 1/8 (for the change to 1 or -1) and 3/4 (for the change to 0).
γ i The timescale of the expression dynamics of gene i Gaussian noise with which the average is 0 and the standard deviation is 0.01 is added to the value.
θ i The expression threshold for gene i
D i The diffusion constant of gene i product
xiinit Initial value of gene i expression
description value
N The number of genes in each gene network 24
L The number of cells in each individual 128
β The nonlinearity of the reaction near the threshold 40
P The number of individuals in each population 100
t term The number of timesteps for development 1000
G term The number of generations for the evolutionary simulation 1000
σ The strength of the epigenetic perturbation on initial condition 0.1*

*This value is changed in SF4

Gene regulation networks

In this study, the total number of genes in gene networks was fixed at N = 24, and gene regulation networks were described as an N × N matrix Ji,j, where the interaction between gene j and gene i is given by its (i, j) element. Interaction was represented as a discrete value of 1 (activation), -1 (inhibition), or 0 (no interaction between the two genes).

Gene expression dynamics

A cell’s state is represented by the expression levels of N genes, that is, the number of expressed proteins. Here, let xi(l, t) represents the protein expression level of the i-th gene in the l-th cell at time t. Cells are aligned in a 1-dimensional space. Since this model does not employ cellular proliferation, cellular index r assigned to each cell is fixed throughout the simulation, which takes an integer value ranging from 1 to the total number of cells L (= 128). Proteins expressed from each gene either activate, inhibit, or do not influence the expression of other genes or their own expression. This protein level is als termed as gene expression level. We assumed that the change in the i-th protein expression level is given by the following discrete-space reaction-diffusion equation:

Δxi(l,t)Δt=γi(F(i,x(l,t))xi(l,t))+Di(xi(l+1,t)2xi(l,t)+xi(l1,t)) (1)

The first term of Eq (1), γi(F(i,x(l,t))xi(l,t)) denotes the change in protein expression by reactions within a cell. This term consists of two parts, i.e., F(i, x(l, t)) and −xi(l, t). The former represents the protein synthesis under regulations of other proteins and the latter represents the degradation of the protein. The expression of protein i is regulated by other proteins j. As are often adopted in simplified gene regulation dynamics models, F(i, x(l, t)) is a result of activation or suppression from other proteins. If the input from others is larger than given threshold, it is tended to be expressed, and otherwise not expressed. The specific form we adopted is given by the following equation.

F(i,x(l,t))=f(jJi,jxj(l,t)θi)=11+eβ(jJi,jxj(l,t)θi) (2)

where x(l, t) denotes the vector of the expression of each gene xi(l, t) (i = 1,2,..N).

In the above equation, Ji.j changes by generation k through evolution, and thus it has the index k, but putting the index k in Ji.j makes the expression too complicated, so it is omitted here. The function f(z)=1/(1+eβz) is similar to a step function, where the function approaches 1 as z is increased to the positive side and approaches 0 as z decreases to the negative side; in other words, in Eq (2), if the total sum of gene interactions jJi,jxj(l,t) is sufficiently larger than the expression threshold θi, then F(i, x(l, t))~1, which indicates that the gene is going to be fully expressed; if it is smaller than θi, then F(i, x(l, t))<1, which indicates that the gene expression is suppressed. We choose β, the sensitivity of the function f(z) at z = 0, as 40. In general, β is proportional to the Hill coefficient.

γi determines the rate of the degradation of the i-th protein, whereas the value of xi(l, t) was scaled such that the maximum level was equal to 1 [27,46,47].

The second term in Eq (1), Di(xi(l+1,t)2xi(l,t)+xi(l1,t)) is a discrete-space version of the diffusion, which denotes cell-to-cell interaction via diffusion where Di is the diffusion constant of the product of the gene i, that determines the rate.

The numerical timestep interval Δt is set to 0.1.

Generation and density of gene regulation networks

In this study, gene regulation networks were initiated using a semi-random generation process. For each element of the network matrix, interaction values of-1, 0, and 1 are assigned probabilities of 1/16, 7/8, and 1/16, respectively. The generated networks have network density, that is, the number of total connections divided by the total number of genes ~ 2, which is comparable to the real data [4850].

Initial/boundary conditions

Morphogen gradients, which were introduced as the expression of genes #0 and #1, were supplied externally. Fixed linear morphogens were induced from both sides for cellular use, such that x0(r,t)=x0(t)=(Ll)/(L1) and x1(l,t)=l1/(L1), where L is the total number of cells that is set to 128 in the simulations and l is the cellular index from 1 to L. These gradients are not only present in the initial state but are invariant in time. We also evaluated a case involving a gradient with an exponential dependence in space as elξ, but this condition did not alter the conclusions presented in this study.

For genes other than the input genes #0 and #1 (i.e. genes #2, #3,…,#N), the initial expression levels of each gene (at t = 0) were given spatially uniform. The initial value is chosen as

xi(l,0)=xiinit+δi

where xiinit is a value inherited from the ancestor slightly altered by mutations (as described later), whereas δi is the deviation due to the epigenetic perturbation, given by a random value from the Gaussian distribution with a mean 0 and the variance σ, independently generated for each gene, but taken uniformly over all cells. This noise term is due to environmental, behavioral, and other non-genetic effects, so that the initial condition is deviated from the inherited value xiinit. Except for the SF4, σ is set to 0.1.

Discrete Neumann boundary conditions were adopted at both ends for this study, that is xi(1) = xi(2) and xi(M−1) = xi(M) for i = 2,3,4,… .N.

Fitness function

We defined cell types according to the ON/OFF states of the designated four genes (genes #2 to #5 in the model). Specifically, if one gene’s expression is over 0.99 (with the value range [0,1]), the gene is ON, whereas if it is less than 0.01, the gene is OFF. The number of existing cell types, Nt, was computed by sampling over cells. The fitness function is set to become higher as more cell types coexist.

The model can maximally generate 24 = 16 cell types such that the fitness of individual k is defined as fk = Nt. Hence, the fitness is maximal if 16 different cellular states exist in cells aligned in a one-dimensional space.

With this setup, the expression level of each cell can settle to an intermediate value, between 0 and 1, and thus the expression levels of the target four genes can take intermediate values, between ON and OFF. However, since gene expression sensitivity β is set to 40, which corresponds to a quite high Hill coefficient, xi(l, t) of the four target genes takes either ~1 (ON) or ~0 OFF for the final, stationary state of the developmental dynamics. (The exceptions are the case when the input to the corresponding gene is close to the expression threshold θi (1β), and such case is not favored under the evolution of the present fitness, because the intermediate expression levels of the target genes just reduce the number of possible cell types. In fact, after evolutionary generations by the selection, such exceptions are hardly observed, except few cases with oscillatory gene expression localized at some cells)

Selection

Whether an individual in a population is selected for the next generation is determined probabilistically. The probability that one individual would be selected is given by its relative fitness to others, i.e., efi/jefj, for all j, where i and j are individual indexes. We adopt this exponential form to amplify the selection by fitness, i.e., to amplify the advantage of having one more cell type than other individuals (By simply adopting fi/∑jfj as the probability, the advantage of having one more cell type over other individuals was too small and the individual with one more cell type was hardly selected.

In fact, such exponential form is often adopted in genetic algorithm or numerical evolution.

Mutation

During the replication process, the gene network structure and parameters in the reaction-diffusion equation were slightly changed by the random mutation. Here is the list of evolvable parameters and their degrees of change during one replication.

Population cloning

Each evolutionary simulation was executed for up to 1000 generations, and we sampled cases in which the fitness of the fittest individual at the 1000th generation was above 15 (the maximum was 16). For these fitted individuals, we further cloned the ancestral populations. We selected ancestral populations from each 100th generation and evolved them up to 1000th generation. The parameters and mutational setups for the evolutionary simulations were identical to those of the original simulation.

Individuals obtained by these evolutionary simulations of the cloned population represent offspring that diverged from the same pedigree with different branching points.

Calculation of the similarity indicator of gene expression patterns

The similarity of gene expression patterns between two individuals was calculated using two indicators: Pearson’s correlation coefficient and the inverse of the total gene expression difference. The definitions of the two indicators are as follows:

  1. the inverse of the total gene expression difference
    Δ(t1,t2)=1ir|xi1(l,t1)xi2(l,t2)|
    where xik(r,t) indicates gene i’s expression level at cell l, time t, of the individual k.
  2. Pearson’s correlation coefficient

    For each developmental time t1, t2, the correlation coefficient ρ(t1, t2) is given as
    ρ(t1,t2)=cov(X1(t1),X2(t2))σX1(t1)σX2(t2)
    where Xk(t) is the vector in which elements Xik(t) are the summation of gene i’s expression over space in individual k at a given time t, that is, Xik(t)=rxik(r,t),σXk(t) is the standard deviation of Xk at time t, and cov denotes the covariance.

Identifying pleiotropic genes

Following the experimental report in ref [11], pleiotropic genes in this paper refer to the genes expressed in multiple developmental stages. The numerical procedure to identifying them is as follows:

First, we defined developmental time points in which gene expressions are compared. In ref [11], these time points are 16 embryonic stages determined by staging table of development. We defined stage time points as arithmetically or logarithmically even intervals of 16 time points through [10,1000].

For each developmental stage time point sj, the average expression in space was calculated for each gene.

xi(sj)¯=1Ll=0L1xi(l,sj)

where xi(sj)¯ is the average expression of gene i over space at stage sj.

Then we set a threshold to determine if a gene is expressed or not, which is supposed to corresponds to the detection limit in experiments so that we set it small as 0.1.

If xi(sj)¯ is larger than the above threshold 0.1, for more than the half of the number of total stages (i.e., 8 = 16/2), the gene is determined as a pleiotropic gene.

The ratio of the pleiotropic genes plotted in Fig 4 was defined for each stage time point as the total number of expressed pleiotropic genes divided by the total number of expressed genes. Even if the number of stage time points or the threshold value was changed, the result presented does not change, as long as the change was not too large.

Calculation of variance in the pattern dynamics over clones

For a given individual, i.e., a set of GRN and parameters, N genetic clones are made while adding Gaussian noise to the original initial gene expression state of the development. The average and the standard deviation of the noise are 0 and 0.01, respectively. The developmental dynamics of those clones are calculated according to the same equation for gene expression dynamics. Then, the variance in the pattern dynamics over clones, which indicates how the initial difference propagates during development is calculated as

var(t)=1Nkli(xik(l,t)xi(l,t)¯)2

Where Var(t) indicates the developmental variance over clones at time t, xik(l,t) is the gene i’s expression in an individual k, at cell l and time t. xi(l,t)¯ is the average of xi(r, t) over clones, i.e.,

xi(l,t)¯=1Nkxik(l,t)

Estimation of the timescale of slow gene

To estimate the timescale of genes, first, we defined a unit of time step. The unit is set to be small enough so that the change within the unit is small. In this study, we adopted the unit as the interval for the increment of time during the numerical simulation of gene expression dynamics. Similar calculations could be performed in actual experimental systems if the interval was chosen sufficiently small compared to the time needed for gene expression to saturate.

By using this unit, we can estimate the partial derivative of gene expressions as xi(r,t)t|t=tkxi(r,tk)xi(r,tk1)Δt, where xi(r, t) is the gene expression level of gene i at location r, time t. Δt is the timestep and tk indicates the time after the k timesteps, i.e., tk = kΔt (k = 1,2,…).

The timescale of genes can be estimated by counting timesteps according to the following procedure.

Count up k by 1 if the following two conditions are not violated.

  1. The sign of the derivative at t = tk is the same as that of the time at tk-1. i.e., xi(r,tk)xi(r,tk1)Δtxi(r,tk+1)xi(r,tk)Δt>0

  2. The absolute value of the derivative at time tk is larger than a threshold; in this study we adopted the threshold as 0.001.

The condition 1 is for discarding oscillatory behavior and the condition 2 is for discarding time steps after saturation.

Extraction of essential gene network structure

In our model, the gene network is identical for each individual and over cells. However, due to the heterogeneity of input morphogen gradients, different genes are activated, and different parts of the whole network are activated for different cells (leading to effectively different gene expressions dynamics by cells). Therefore, although the whole gene network contains all the information needed to regulate the dynamics, it is unsuitable for analyzing the gene regulations responsible for a cell. The whole network has too many ’decoy’ edges that may work in other cells, but do nothing in the cell under consideration. We extracted an essential network from the whole network using the following method to identify which regulations are essential in a given cell.

  • 1. Trace the dynamics of the inputs to the marker genes, i.e., jJi,jxj(t)θi(fori=2,3,4,5) in that cell and detect the periods during at least one of the input is in the range of [−2/β: 2/β], which is the dynamic range for the gene expression dynamics and correspond to the timing of gene expression changes of marker (see Eqs (1) and (2)).

  • 2. Trace the expression dynamics of genes. If a gene is not expressed at all throughout the periods, the regulation edges from and to the gene are not responsible for the expression change in marker genes. Hence, edges from/to such genes are eliminated from the whole network. Also, if the expression level of gene stays constant throughout the periods, regulation edges from that gene just contribute to increase/decrease of the expression threshold of the arrowhead gene. Thus, edges from such genes are eliminated from the whole network.

  • 3. For each remaining edge, check whether input from the root gene’s expression contributes to the expression of the arrowhead gene. If the contribution, defined and computed below, is larger than a certain threshold value, then the edge is assumed to be responsible for the change. Otherwise, the edge is eliminated from the whole network.

    Here the contribution is estimated as follows. For a given cell, let Δxk,l be the expression change of gene k during the l-th period. Then the contribution of the edge ji for the gene expression change in the l-th period is defined as
    contribution:Ji.jΔxk,lkJi.kΔxk,l

    If the contribution is smaller than a threshold, which was set to 0.01, delete the edge for input from the edge does not contribute to the total change in the input.

  • 5. Last, delete all the edges on genes that have a route neither from morphogens nor to the marker genes.

Supporting information

S1 Fig. Developmental similarity of the examples in the main Fig 1 measured by different similarity indicators.

The similarity between the developmental dynamics of the two individuals shown in Fig 1A is represented as a heat map. The axes are the same as in Fig 1B, and the similarity is measured by Pearson’s coefficient on the left and Spearman’s rank coefficient on the right.

(TIF)

pcbi.1011867.s001.tif (1.6MB, tif)
S2 Fig. Additional examples that show developmental-hourglass-like similarity profile.

Four additional examples that show developmental hourglass are represented with developmental similarity. (Left) Two examples of gene expression pattern dynamics of evolved GRNs. The horizontal axis represents the developmental time, and the vertical axis represents the cellular index, i.e., space. (Right) The dependence of similarity is plotted against the developmental time, for species with different phylogenetic distance. The horizontal axis represents the developmental time, and the vertical axis represents the average of the similarity between the spatial patterns of gene expressions, in the same way as in Fig 2. The similarity is computed as in Fig 1C and averaged over pairs of individuals. The line colour denotes the phylogenetic distance of compared species departed from 900, 800,…, 100, 0th generations. The similarity calculated among the species that diverged every 100 generations, from the 900th to the 0th generation is plotted.

(TIF)

pcbi.1011867.s002.tif (4.8MB, tif)
S3 Fig. The time course of the similarity indicator and its dependence on the phylogenetic distance.

Similarities among the species that diverged from a common ancestor are plotted, using the data from the same simulation run as in Fig 2, but plotted with phylogenic distance per 100 generations. Axes are the same as in Fig 2(left). Each curve represents the similarity among species that have different length of shared generations in the evolutionary path. The top blue curve represents the similarity of species that share 900 generations out of total 1000 generations, and with each 100 generations decrease of the length of shared generations, results are plotted as different colours, from blue to green (see also Figs 1 and 2 for schematic representation).

(TIF)

S4 Fig. Similarity between an evolved species and its ancestors.

Similarities are plotted for the individual of Fig 2A1 with each 100-generation distant ancestor from the 900th to 0th generation. The horizontal axis is the developmental time of the evolved individual, and the vertical axis is the similarity indicator. The similarity is plotted in the same manner as in Fig 2C. The intermediate similarity peak is conserved from the 700th generation ancestor (i.e., 300 generations distant ancestor), which roughly corresponds with the slow gene control acquisition generation.

(TIF)

pcbi.1011867.s004.tif (415.2KB, tif)
S5 Fig. Examples of the similarity profiles with smaller noise levels for the initial condition.

The relationship between phylogenetic distance and the value of the similarity indicator is represented as the same as Fig 2, SF2, and SF3 but with different noise levels on the initial condition. (A) An example of the case with the noise level on the initial condition σ = 0.01, which is 0.1 in the other figures. (B) An example of the case with the noise level on the initial condition σ = 0.001.

(TIF)

pcbi.1011867.s005.tif (1.1MB, tif)
S6 Fig. Slow gene control and heterochrony.

Subfigures A)–C) display pattern formation dynamics of the same gene network except for the change in the time-scale parameter of the slowest gene. A: Pattern formation dynamics originally obtained from evolutionary simulation. The value of the timescale parameter of the slowest gene is c5.17×10−2. B: Pattern formation dynamics with a larger timescale of the slowest gene; 10.3×10−2, 2 times larger than that of A. C: Pattern formation dynamics with a smaller timescale of the slowest gene; 2.58×10−2, 0.5 times smaller than that of A.

(TIF)

pcbi.1011867.s006.tif (3.8MB, tif)

Acknowledgments

The authors would thank Masahiro Uesaka and Naoki Irie for their stimulating discussions and for critical reading of the manuscript.

Data Availability

All programs that are needed to reproduce the results in this manuscript are available on Zenodo at https://doi.org/10.5281/zenodo.8049347

Funding Statement

SK and KK are supported by a Grant-in-Aid for Scientific Research on Innovative Areas (17H06386) from the Ministry of Education, Culture, Sports, Science and Technology (MEXT) of Japan. https://kaken.nii.ac.jp/ja/grant/KAKENHI-PLANNED-17H06386/ " KK is also supported by a Grant-in-Aid for Scientific Research (A)(20H00123) from the Ministry of Education, Culture, Sports, Science and Technology (MEXT) of Japan. https://kaken.nii.ac.jp/ja/grant/KAKENHI-PROJECT-20H00123/ and Novo Nordisk Foundation (0065542). https://novonordiskfonden.dk/en/ The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.von Baer KE. Über entwickelungsgeschichte der thiere. Beobachtung und Reflexion, 1, Königsberg: Bei den Gebrüdern Bornträger; 1828. 10.5962/bhl.title.6303 [DOI] [Google Scholar]
  • 2.Darwin C. On the Origin of Species by Means of Natural Selection, or the Preservation of Favoured Races in the Struggle for Life. London: John Murray; 1859. [PMC free article] [PubMed] [Google Scholar]
  • 3.Haeckel E. Generelle morphologie der organismen. Allgemeine grundzüge der organischen formen-wissenschaft, mechanisch begründet durch die von Charles Darwin reformirte descendenztheorie. Berlin: G. Reimer; 1866. 10.5962/bhl.title.3953 [DOI] [Google Scholar]
  • 4.Slack JM, Holland PW, Graham CF. The zootype and the phylotypic stage. Nature. 1993. Feb 11;361(6412):490–2. doi: 10.1038/361490a0 . [DOI] [PubMed] [Google Scholar]
  • 5.Duboule D. Temporal colinearity and the phylotypic progression: a basis for the stability of a vertebrate Bauplan and the evolution of morphologies through heterochrony. Dev Suppl. 1994:135–42. . [PubMed] [Google Scholar]
  • 6.Rudolf RA. The Shape of Life: Genes, Development, and the Evolution of Animal Form. Chicago: University of Chicago Press; 1996. [Google Scholar]
  • 7.Hazkani-Covo E, Wool D, Graur D. In search of the vertebrate phylotypic stage: a molecular examination of the developmental hourglass model and von Baer’s third law. J Exp Zool B Mol Dev Evol. 2005. Mar 15;304(2):150–8. doi: 10.1002/jez.b.21033 Erratum in: J Exp Zoolog B Mol Dev Evol. 2005 May 15;304(3):268. . [DOI] [PubMed] [Google Scholar]
  • 8.Irie N, Sehara-Fujisawa A. The vertebrate phylotypic stage and an early bilaterian-related stage in mouse embryogenesis defined by genomic information. BMC Biol. 2007. Jan 12;5:1. doi: 10.1186/1741-7007-5-1 ; PMCID: PMC1797197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Irie N, Kuratani S. Comparative transcriptome analysis reveals vertebrate phylotypic period during organogenesis. Nat Commun. 2011;2:248. doi: 10.1038/ncomms1248 Erratum in: Nat Commun. 2014;5:3366. ; PMCID: PMC3109953. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Wang Z, Pascual-Anaya J, Zadissa A, Li W, Niimura Y, Huang Z, et al. The draft genomes of soft-shell turtle and green sea turtle yield insights into the development and evolution of the turtle-specific body plan. Nat Genet. 2013. Jun;45(6):701–706. doi: 10.1038/ng.2615 Epub 2013 Apr 28. Erratum in: Nat Genet. 2014 Jun;46(6):657. ; PMCID: PMC4000948. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hu H, Uesaka M, Guo S, Shimai K, Lu TM, Li F, et al. Constrained vertebrate evolution by pleiotropic genes. Nat Ecol Evol. 2017. Nov;1(11):1722–1730. doi: 10.1038/s41559-017-0318-0 Epub 2017 Sep 25. . [DOI] [PubMed] [Google Scholar]
  • 12.Marlétaz F, Firbas PN, Maeso I, Tena JJ, Bogdanovic O, Perry M, et al. Amphioxus functional genomics and the origins of vertebrate gene regulation. Nature. 2018. Dec;564(7734):64–70. doi: 10.1038/s41586-018-0734-6 Epub 2018 Nov 21. ; PMCID: PMC6292497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Malik A, Gildor T, Sher N, Layous M, Ben-Tabou de-Leon S. Parallel embryonic transcriptional programs evolve under distinct constraints and may enable morphological conservation amidst adaptation. Dev Biol. 2017. Oct 1;430(1):202–213. doi: 10.1016/j.ydbio.2017.07.019 Epub 2017 Aug 2. . [DOI] [PubMed] [Google Scholar]
  • 14.Gildor T, Cary GA, Lalzar M, Hinman VF, Ben-Tabou de-Leon S. Developmental transcriptomes of the sea star, Patiria miniata, illuminate how gene expression changes with evolutionary distance. Sci Rep. 2019. Nov 7;9(1):16201. doi: 10.1038/s41598-019-52577-9 ; PMCID: PMC6838185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Li Y, Omori A, Flores RL, Satterfield S, Nguyen C, Ota T, et al. Genomic insights of body plan transitions from bilateral to pentameral symmetry in Echinoderms. Commun Biol. 2020. Jul 10;3(1):371. doi: 10.1038/s42003-020-1091-1 Erratum in: Commun Biol. 2021 Apr 6;4(1):459. ; PMCID: PMC7351957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Domazet-Lošo T, Tautz D. A phylogenetically based transcriptome age index mirrors ontogenetic divergence patterns. Nature. 2010. Dec 9;468(7325):815–8. doi: 10.1038/nature09632 . [DOI] [PubMed] [Google Scholar]
  • 17.Kalinka AT, Varga KM, Gerrard DT, Preibisch S, Corcoran DL, Jarrells J, Ohler U, Bergman CM, Tomancak P. Gene expression divergence recapitulates the developmental hourglass model. Nature. 2010. Dec 9;468(7325):811–4. doi: 10.1038/nature09634 . [DOI] [PubMed] [Google Scholar]
  • 18.Levin M, Hashimshony T, Wagner F, Yanai I. Developmental milestones punctuate gene expression in the Caenorhabditis embryo. Dev Cell. 2012. May 15;22(5):1101–8. doi: 10.1016/j.devcel.2012.04.004 Epub 2012 May 3. . [DOI] [PubMed] [Google Scholar]
  • 19.Schiffer PH, Polsky AL, Cole AG, Camps JIR, Kroiher M, Silver DH, et al. The gene regulatory program of Acrobeloides nanus reveals conservation of phylum-specific expression. Proc Natl Acad Sci U S A. 2018. Apr 24;115(17):4459–4464. doi: 10.1073/pnas.1720817115 Epub 2018 Apr 6. ; PMCID: PMC5924915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Xu F, Domazet-Lošo T, Fan D, Dunwell TL, Li L, Fang X, Zhang G. High expression of new genes in trochophore enlightening the ontogeny and evolution of trochozoans. Sci Rep. 2016. Oct 4;6:34664. doi: 10.1038/srep34664 ; PMCID: PMC5048140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Quint M, Drost HG, Gabel A, Ullrich KK, Bönn M, Grosse I. A transcriptomic hourglass in plant embryogenesis. Nature. 2012. Oct 4;490(7418):98–101. doi: 10.1038/nature11394 Epub 2012 Sep 5. . [DOI] [PubMed] [Google Scholar]
  • 22.Cheng X, Hui JH, Lee YY, Wan Law PT, Kwan HS. A "developmental hourglass" in fungi. Mol Biol Evol. 2015. Jun;32(6):1556–66. doi: 10.1093/molbev/msv047 Epub 2015 Feb 27. . [DOI] [PubMed] [Google Scholar]
  • 23.Zalts H, Yanai I. Developmental constraints shape the evolution of the nematode mid-developmental transition. Nat Ecol Evol. 2017. Mar 27;1(5):113. doi: 10.1038/s41559-017-0113 . [DOI] [PubMed] [Google Scholar]
  • 24.Uchida Y, Uesaka M, Yamamoto T, Takeda H, Irie N. Embryonic lethality is not sufficient to explain hourglass-like conservation of vertebrate embryos. Evodevo. 2018. Mar 16;9:7. doi: 10.1186/s13227-018-0095-0 ; PMCID: PMC5855935. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Schmidt K, Starck JM. Testing evolutionary hypotheses about the phylotypic period of zebrafish. J Exp Zool B Mol Dev Evol. 2011. Jul 15;316(5):319–29. doi: 10.1002/jez.b.21404 Epub 2011 Feb 15. . [DOI] [PubMed] [Google Scholar]
  • 26.Newman SA. Animal egg as evolutionary innovation: a solution to the "embryonic hourglass" puzzle. J Exp Zool B Mol Dev Evol. 2011. Nov 15;316(7):467–83. doi: 10.1002/jez.b.21417 Epub 2011 May 9. . [DOI] [PubMed] [Google Scholar]
  • 27.Salazar-Ciudad I, Newman SA, Solé RV. Phenotypic and dynamical transitions in model genetic networks. I. Emergence of patterns and genotype-phenotype relationships. Evol Dev. 2001. Mar-Apr;3(2):84–94. doi: 10.1046/j.1525-142x.2001.003002084.x . [DOI] [PubMed] [Google Scholar]
  • 28.Salazar-Ciudad I, Solé RV, Newman SA. Phenotypic and dynamical transitions in model genetic networks. II. Application to the evolution of segmentation mechanisms. Evol Dev. 2001. Mar-Apr;3(2):95–103. doi: 10.1046/j.1525-142x.2001.003002095.x Erratum in: Evol Dev 2001 Sep-Oct;3(5):371. . [DOI] [PubMed] [Google Scholar]
  • 29.François P, Hakim V, Siggia ED. Deriving structure from evolution: metazoan segmentation. Mol Syst Biol. 2007;3:154. doi: 10.1038/msb4100192 Epub 2007 Dec 18. ; PMCID: PMC2174625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Fujimoto K, Ishihara S, Kaneko K. Network evolution of body plans. PLoS One. 2008. Jul 23;3(7):e2772. doi: 10.1371/journal.pone.0002772 ; PMCID: PMC2464711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Ten Tusscher KH, Hogeweg P. Evolution of networks for body plan patterning; interplay of modularity, robustness and evolvability. PLoS Comput Biol. 2011. Oct;7(10):e1002208. doi: 10.1371/journal.pcbi.1002208 Epub 2011 Oct 6. ; PMCID: PMC3188509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kaneko K. Evolution of robustness to noise and mutation in gene expression dynamics. PLoS One. 2007. May 9;2(5):e434. doi: 10.1371/journal.pone.0000434 ; PMCID: PMC1855988. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Ancel LW, Fontana W. Plasticity, evolvability, and modularity in RNA. J Exp Zool. 2000. Oct 15;288(3):242–83. doi: . [DOI] [PubMed] [Google Scholar]
  • 34.Kohsokabe T, Kaneko K. Evolution-development congruence in pattern formation dynamics: Bifurcations in gene expression and regulation of networks structures. J Exp Zool B Mol Dev Evol. 2016. Jan;326(1):61–84. doi: 10.1002/jez.b.22666 ; PMCID: PMC5064737. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Liu J, Frochaux M, Gardeux V, Deplancke B, Robinson-Rechavi M. Inter-embryo gene expression variability recapitulates the hourglass pattern of evo-devo. BMC Biol. 2020. Sep 19;18(1):129. doi: 10.1186/s12915-020-00842-z ; PMCID: PMC7502200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Uchida Y, Shigenobu S, Takeda H, Furusawa C, Irie N. Potential contribution of intrinsic developmental stability toward body plan conservation. BMC Biol. 2022. Apr 11;20(1):82. doi: 10.1186/s12915-022-01276-5 ; PMCID: PMC8996622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Kohsokabe T, Kaneko K. Dynamical systems approach to evolution-development congruence: Revisiting Haeckel’s recapitulation theory. J Exp Zool B Mol Dev Evol. 2022. Jan;338(1–2):62–75. doi: 10.1002/jez.b.23031 Epub 2021 Feb 18. ; PMCID: PMC9291011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Landry CR, Lemos B, Rifkin SA, Dickinson WJ, Hartl DL. Genetic properties influencing the evolvability of gene expression. Science. 2007. Jul 6;317(5834):118–21. doi: 10.1126/science.1140247 Epub 2007 May 24. . [DOI] [PubMed] [Google Scholar]
  • 39.Tang QY, Kaneko K. Dynamics-Evolution Correspondence in Protein Structures. Phys Rev Lett. 2021. Aug 27;127(9):098103. doi: 10.1103/PhysRevLett.127.098103 . [DOI] [PubMed] [Google Scholar]
  • 40.Levin M, Anavy L, Cole AG, Winter E, Mostov N, Khair S, et al. The mid-developmental transition and the evolution of animal body plans. Nature. 2016. Mar 31;531(7596):637–641. doi: 10.1038/nature16994 Epub 2016 Feb 17. Erratum in: Nature. 2019 Nov;575(7782):E3. ; PMCID: PMC4817236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Hejnol A, Dunn CW. Animal Evolution: Are Phyla Real? Curr Biol. 2016. May 23;26(10):R424–6. doi: 10.1016/j.cub.2016.03.058 . [DOI] [PubMed] [Google Scholar]
  • 42.Heisenberg CP, Bellaïche Y. Forces in tissue morphogenesis and patterning. Cell. 2013. May 23;153(5):948–62. doi: 10.1016/j.cell.2013.05.008 . [DOI] [PubMed] [Google Scholar]
  • 43.Irie N, Kuratani S. The developmental hourglass model: a predictor of the basic body plan? Development. 2014. Dec;141(24):4649–55. doi: 10.1242/dev.107318 . [DOI] [PubMed] [Google Scholar]
  • 44.Green JB, Sharpe J. Positional information and reaction-diffusion: two big ideas in developmental biology combine. Development. 2015. Apr 1;142(7):1203–11. doi: 10.1242/dev.114991 . [DOI] [PubMed] [Google Scholar]
  • 45.Murray JD. Mathematical Biology II Spatial Models and Biomedical Applications. New York: Springer; 2003. 10.1007/b98869 [DOI] [Google Scholar]
  • 46.Glass L, Kauffman SA. The logical analysis of continuous, non-linear biochemical control networks. J Theor Biol. 1973. Apr;39(1):103–29. doi: 10.1016/0022-5193(73)90208-7 . [DOI] [PubMed] [Google Scholar]
  • 47.Mjolsness E, Sharp DH, Reinitz J. A connectionist model of development. J Theor Biol. 1991. Oct 21;152(4):429–53. doi: 10.1016/s0022-5193(05)80391-1 . [DOI] [PubMed] [Google Scholar]
  • 48.Serov VN, Spirov AV, Samsonova MG. Graphical interface to the genetic network database GeNet. Bioinformatics. 1998;14(6):546–7. doi: 10.1093/bioinformatics/14.6.546 . [DOI] [PubMed] [Google Scholar]
  • 49.Shen-Orr SS, Milo R, Mangan S, Alon U. Network motifs in the transcriptional regulation network of Escherichia coli. Nat Genet. 2002. May;31(1):64–8. doi: 10.1038/ng881 Epub 2002 Apr 22. . [DOI] [PubMed] [Google Scholar]
  • 50.Rosenfeld N, Alon U. Response delays and the structure of transcription networks. J Mol Biol. 2003. Jun 13;329(4):645–54. doi: 10.1016/s0022-2836(03)00506-0 . [DOI] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1011867.r001

Decision Letter 0

Philip K Maini, James O'Dwyer

13 Aug 2023

Dear Prof. Kaneko,

Thank you very much for submitting your manuscript "Developmental hourglass: Confirmation by numerical evolution and elucidation by dynamical-systems theory" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Philip K Maini

Academic Editor

PLOS Computational Biology

James O'Dwyer

Section Editor

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Kohsokabe et al. present an unusual approach to provide some mechanistic insight into the hourglass model of morphological evolution through simulation. They present an admittedly simple simulation framework that implements simple gene regulatory networks in 1D organisms with cell intrinsic and cell signalling effects. They simulate their ontogeny and evolve them over 1000 generations. Species are periodically split off from the simulated organisms to provide comparative data. Noisy initial conditions and mutations in GRN matrices at species transitions are implemented. Cell type diversity (under simplified, but strict definition) is used for selection of individuals between generation. They then develop a framework to compare spatial gene expression patterns that evolve in this simulated system across developmental time. This provides the ingredients for testing the predictions of the hourglass model. They find that indeed, in this system, mid stages of the developmental trajectory are more conserved between species compared to earlier and later stages, mid stages are enriched for pleiotropic genes (under the definition of the toy model) and show less divergence within species evolved from the same starting conditions (with noise). All this seems to mimic the general predictions of the hourglass model. Detailed analysis of the evolving GRNs indicate that the pattern could be driven by relatively few genes that change their expression slower relative to other genes. This is interpreted in the context of the theoretical work previously published by the authors on the subject.

The assumptions of the simulation seem reasonable. It is fairly difficult to understand the Figure 2 visualisation of the two species and how they arose, however, once the distinction between ontogeny (developmental time, plotted) and phylogeny (the number of generations that led to this individual and potential split along the path) sinks in, the paper is relatively easy to follow. The hourglass pattern is much stronger for more related species and all but disappears when the bifurcation appears early in the simulation. This is not necessarily the prediction of the hourglass model. The authors should discuss this. Also, given the obvious distance between this simple toy model and real evolving organisms, the authors should think about other possible, perhaps more trivial reasons for why the mid-simulation pattern convergence occurs. The gradual decrease in gene expression noise in the system is intriguing. The sudden rise of complexity in the system after that would require more thorough investigation. The authors make the connection with slow genes. However, this idea is not sufficiently developed. Would there be a way to investigate why does this dynamics coupling occur in this system? And if yes, could there be drawn some analogy with GRNs in real biological systems?

Overall, this is an unorthodox view of the hourglass model. Understanding the toy model and looking at its behaviour could be of interest to scientists who are in general struggling with the meaning, significance and mechanisms of this peculiar embryological observation.

Minor comments: The paper contains several typos. Figure 1: developmetnal time 2x, Figure 4 is not referred to at all, the relevant call out goes to Figure 3 again. Fonts in Figures are very diversified, different fonts and sizes.

Reviewer #2: Phenotypic evolution has been driven not only external factors such as environmental conditions, but also by internal ones, i.e., developmental processes. One of the key interests in evolutionary developmental biology lies in deciphering the evolutionary constraints and principles imposed by embryogenesis. The paper under review tackles this intricate issue by utilizing evolutionary simulations, a noteworthy methodological choice that distinguishes this study within the evo-devo field. This approach allows the authors to gain a new valuable insight into a problem that has been traditionally addressed by speculation based on observations of extant species and fossil records.

However, the paper presents several significant concerns. The most prominent point is the ambiguity in the presentation of the methods, data interpretation, and the validation of their arguments. These shortcomings necessitate a major revision of the manuscript. The severity of these issues is compounded by errors in crucial equations explaining the model. Coupled with the lack of explanation for some variables and functions, the paper in its current state poses challenges for precise interpretation. The paper would be greatly improved by revising its data presentation and argument validation, which ensure credible conclusion.

<comments methods="" on="" section="" the="">

For “Model”:

The manuscript currently lacks information for cell count and cell proliferation. It will help understanding if the authors incorporate explanations about these factors.

For “Gene expression dynamics”:

1. Equation (1) appears to be missing a ")" which compromises its correctness.

2. To enhance comprehension of the model, the authors should provide precise information about several key elements.

2-1. Equation (1) seems to describe changes in protein expression, but it remains unclear how generation is accounted for.

2-2. D_i is assumed to be a value related to diffusion, but without explanation, its exact role and method of determination remain ambiguous. This lack of clarity extends to the setting of cell-cell interactions.

2-3. The functions f(x) and F(i,x(r,t)) need proper descriptions, as their roles are currently unexplained.

2-4. The correspondence between "as x is increased" and "as x_i (l,t) decreases" needs to be clarified.

For “Initial/boundary conditions”:

1. Explanations are required to better understand this part of the manuscript

1-1. It is assumed that M represents cell numbers, but without explicit clarification, this remains uncertain.

1-2. The authors claim that gene expression levels are "spatially uniform" for genes other than gene #0 and 1. It is unclear whether this signifies identical gene expression levels across all cells.

1-3. The inheritance of gene expression levels from ancestors needs further explanation. Does it imply that the clone inherits the initial state used by the progenitor cell?

For “Selection”:

More information should be given about the variable e^(f_i ) to clarify its role and purpose.

For “Mutation”:

There appears to be no list detailing the evolvable parameters and their potential changes. Provision of this information would assist readers in comprehending the model.

For “Calculation of the similarity indicator of gene expression patterns”:

The variable σ seems to be absent in the final sentence.

For “Calibration of the timescale of slow gene”:

1. There is ambiguity on the definition of terms and the purpose of calculations, which should be resolved.

1-1. The concept of calibrating the timescale needs an explicit explanation. The authors should provide a detailed definition of timescale, clarify the purpose of its calibration, and justify the employed manipulations.

1-2. The relationship between the introduced value k and the term "timescale" remains obscure.

<comments on="" results="" section="" the="">

For paragraph 1:

1. The term "fixed terminal state" requires explicit definition to avoid ambiguity. The Methods section mentions that fitness is determined by the number of final differentiation states, which led me to an interpretation of no specific target expression pattern. clear definitions are necessary to grasp the simulation setup accurately, namely whether this understanding is incorrect (there is some target pattern), or "fixed state" refers to a "steady state".

2. Gene expression patterns appear to be changing even at the 1000th timestep, as inferred from Fig 2A. The reason behind the choice of ending "embryogenesis" at the 1000th step requires clarification.

For paragraph 2:

1. The legend for Fig. 2A states that "Spatial pattern transitions were more frequent in the early developmental time." However, spatial differentiation seems minimal until around the 100th step. It would be beneficial if the term "spatial pattern transition" is clearly defined, quantified, and compared across developmental periods.

For paragraph 3:

1. The statement "there are some phases within which patterns change only slightly" needs clarification whether it refers to subtle changes in ontogeny or phylogeny.

2. In the legend of Fig. 2B, the "middle" period is suggested as the stage of highest similarity among lineages. If it refers to the 100th step out of 1000, it seems to fall within the "early" period, thus requiring a precise definition of the "middle" period. While I comprehend that the term "middle" does not necessarily imply a literal midpoint, and the primary focus here is the hourglass-like pattern, the current manuscript would lead to confusion for readers.

3. The previous observation of the phylotypic period suggests that the conservation of this period is persistent, regardless of the evolutionary scale. Thus, if the authors claim to recapitulate an hourglass pattern in their simulation, persistent conservation at a specific period must be demonstrated.

4. Regarding Fig S1, there seems to be a mislabel on the y-axis.

5. For Fig S2, details on the number of evolutionary trials conducted, and on the probability of the hourglass-like evolutionary outcomes would enhance the paper's value.

For paragraph 4:

1. The authors claim that the “middle” period was consistently conserved between two species that diverged at the 800th generation based on one example. This assertion would be greatly strengthened by the addition of statistical data, such as the distribution of the most conserved period. This information would be particularly beneficial when discussing whether the “middle” conservation observed in this simulation persists like the phylotypic period.

For paragraph 6:

1. In Fig. 3, the authors compared cellular expression patterns between lineages representing extant species. It would be helpful to understand the similarity of developmental patterns between ancestral species (e.g., species of the 400th generation that diverged at Branch C). If the mid-conserved period in this simulation indeed represents the phylotypic period, it is expected that ancestral species also exhibit mid-phase conservation. Comparing ancestral species would also shed light on the process of establishing constraints on the conserved period.

2. The findings from Fig S3 reveal diminished conservation of the mid-phase between “distantly” related species (inculuding those 300 generations apart within the 1000-generation simulation). This might indicate that the simulation's scope extends beyond the within-phyla evolutionary scale, which is traditionally where the hourglass model is applicable. The authors should address the appropriateness of the simulation's scale.

3. The observed pattern that developmental similarity peaks at the final developmental stage among species separated by over 300 generations may imply developmental drift, as opposed to developmental constraints. The authors are recommended to share their interpretation of this observation.

For paragraph 7:

1. I encourage the authors to clarify how many evolutionary trials were executed, and within those, how frequently the results exhibited an hourglass-like pattern, and how often they demonstrated enrichment of pleiotropic genes during the conserved middle period. Statistical data on these aspects would be useful in evaluating the validity of the findings.

For paragraph 8:

1. During actual embryogenesis, developmental noise is continuously generated. However, in this simulation, noise is only introduced at the initial step. It's unclear if this methodology accurately represents real-embryogenesis and requires an additional discussion.

2. The manuscript says Fig 4 displays the variation among clones, but it appears this is represented in Fig 5.

3. The indicator for gene expression pattern similarity in this analysis is different from those utilized in other assessments of similarity. An explanation for this deviation would be helpful.

For paragraph 9:

1. The statement referring to the necessity for developmental noise to achieve robustness requires further specificity. It should clearly define the context of robustness - against what and of what? Furthermore, the corresponding setup within the simulation that reflects this requirement needs explicit clarification. There seems to be a disconnect with the setup that aims an increase in the terminal differentiated state throughout the simulation. Given this ambiguity, the rationale behind the attenuation of initial noise is not clear.

For paragraph 12:

1. A clear description of what specifically is changing in a stepwise manner in gene expression patterns would enhance understanding. Providing a quantitative evaluation, or conversely, an example of gradual change, would be beneficial.

Paragraph 13:

1. To effectively argue that a switch occurs between initially expressed genes and late-expressed genes around the bottleneck period, the authors should support their findings with statistical analysis, as opposed to merely providing an example.

2. Ref [23] investigated developmental conservation in terms of temporal variability of gene expression patterns. Their analysis highlighted larger temporal expression divergence before, rather than after, the bottleneck period. A discussion on the compatibility between the authors' findings and those from this previous study would be appreciated.

Paragraph 14:

1. In Fig 6b, the rate of gene expression change is gradual, making it difficult to classify genes as “slow”, “normal” or “fast”. The authors should define a clear and quantitative indicator.

2. The authors argue that numerous genes are regulated by slow genes. They should clarify whether this is based on solid evidence or if it is speculative.

Paragraph 15:

1. Fig 6d illustrates the correlation between the timescale of slow genes and the similarity peak. However, in substantial instances, the slow genes reach their expression change peaks after the similarity peak (provided my understanding of the “timescale” of genes is accurate). This may suggest that the data presented here doesn't fully support the causality between slow genes and developmental conservation. The authors should discuss this point.

2. The color codes in the figure legend appear to be misaligned.

<comments discussion="" on="" section="" the="">

For paragraph 2:

1. The simulation implies that substantial developmental noise is essential during evolution. Is there empirical evidence that can corroborate this assumption?

For paragraph 3:

2. The authors state that a decrease in variance due to developmental noise will lead to a reduction in variance due to genetic difference. The underlying mechanisms of this assumption remain unclear to me (Intuitively, it seems to be natural to conclude that robustness against noise would be a byproduct of robustness against genetic variations). The readability of this section could be significantly enhanced by providing further clarification on the authors' assumptions regarding "robustness."

For paragraph 4:

1. Once again, it is essential that the authors provide explicit and objective criteria for distinguishing "slow" genes.

2. The link between the expression of slow genes and the buffering of initial variations remains ambiguous. While Figure 6 indicates a correlation between the expression change in certain genes and the peak of similarity, it doesn't necessarily demonstrate a causal relationship. The authors should support their argument with analytical results, or alternatively, they should refine the manuscript to indicate that the connection is hypotherical.

For paragraph 6:

1. The discussion is intriguing on the diversification of the mid-embryonic period among lineages and the concurrent buffering of variation within a lineage. Apart from the authors' speculation, are there any additional analytical results that shed light on the process of gene repertoire alteration while maintaining slow gene sets? One of the advantages of simulation approach is its ability to trace the evolutionary process, so an analysis of this kind could add significant value to this study.

2. I would like to ascertain whether the authors are aware of the criticisms towards the methodology and conclusions of ref [41]. (Hejnol A, Dunn CW. Animal Evolution: Are Phyla Real? Curr Biol. 2016 May 23;26(10):R424-6.)

For paragraph 7:

1. In justifying their assertion that slow genes regulate many other genes, the authors cite their own previous research. I believe presenting data fromn the simulation conducted in this study would strengthen their argument.

2. The authors' proposal is intriguing to define higher taxa based on the emergence of developmental systems (gene sets) that impose constraints. I suggest that incorporating insights from the simulations about the process of emergence of such gene sets will make this paper more insightful and impactful.</comments></comments></comments>

Reviewer #3: The review is uploaded as an attachment

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Attachment

Submitted filename: referees_report.pdf

pcbi.1011867.s007.pdf (29.2KB, pdf)
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1011867.r003

Decision Letter 1

Philip K Maini, James O'Dwyer

29 Jan 2024

Dear Prof. Kaneko,

We are pleased to inform you that your manuscript 'Developmental hourglass: Verification by numerical evolution and elucidation by dynamical-systems theory' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Philip K Maini

Academic Editor

PLOS Computational Biology

James O'Dwyer

Section Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: No more comments.

Reviewer #2: The authors have clearly addressed all of my questions and made the necessary revisions. I’m now convinced that this paper is sufficiently qualified for publication. This study, which dissects the relationship between developmental systems and phenotypic evolution by taking advantage of numerical simulation, will provide valuable insights to the field of evolutionary biology.

Reviewer #3: Reading the revised manuscript, I found that all my concerns as well as other reviewers' were appropriately taken care of.

Minor points

1. A left and right parenthesis do not match in the marked paragraph in P.106

2. P.121 als -> also

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: No: Since I could not find the codes and the complete data set, the authors should put them on a public repository following the Journal policy before publication.

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1011867.r004

Acceptance letter

Philip K Maini, James O'Dwyer

15 Feb 2024

PCOMPBIOL-D-23-00952R1

Developmental hourglass: Verification by numerical evolution and elucidation by dynamical-systems theory

Dear Dr Kaneko,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Anita Estes

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Developmental similarity of the examples in the main Fig 1 measured by different similarity indicators.

    The similarity between the developmental dynamics of the two individuals shown in Fig 1A is represented as a heat map. The axes are the same as in Fig 1B, and the similarity is measured by Pearson’s coefficient on the left and Spearman’s rank coefficient on the right.

    (TIF)

    pcbi.1011867.s001.tif (1.6MB, tif)
    S2 Fig. Additional examples that show developmental-hourglass-like similarity profile.

    Four additional examples that show developmental hourglass are represented with developmental similarity. (Left) Two examples of gene expression pattern dynamics of evolved GRNs. The horizontal axis represents the developmental time, and the vertical axis represents the cellular index, i.e., space. (Right) The dependence of similarity is plotted against the developmental time, for species with different phylogenetic distance. The horizontal axis represents the developmental time, and the vertical axis represents the average of the similarity between the spatial patterns of gene expressions, in the same way as in Fig 2. The similarity is computed as in Fig 1C and averaged over pairs of individuals. The line colour denotes the phylogenetic distance of compared species departed from 900, 800,…, 100, 0th generations. The similarity calculated among the species that diverged every 100 generations, from the 900th to the 0th generation is plotted.

    (TIF)

    pcbi.1011867.s002.tif (4.8MB, tif)
    S3 Fig. The time course of the similarity indicator and its dependence on the phylogenetic distance.

    Similarities among the species that diverged from a common ancestor are plotted, using the data from the same simulation run as in Fig 2, but plotted with phylogenic distance per 100 generations. Axes are the same as in Fig 2(left). Each curve represents the similarity among species that have different length of shared generations in the evolutionary path. The top blue curve represents the similarity of species that share 900 generations out of total 1000 generations, and with each 100 generations decrease of the length of shared generations, results are plotted as different colours, from blue to green (see also Figs 1 and 2 for schematic representation).

    (TIF)

    S4 Fig. Similarity between an evolved species and its ancestors.

    Similarities are plotted for the individual of Fig 2A1 with each 100-generation distant ancestor from the 900th to 0th generation. The horizontal axis is the developmental time of the evolved individual, and the vertical axis is the similarity indicator. The similarity is plotted in the same manner as in Fig 2C. The intermediate similarity peak is conserved from the 700th generation ancestor (i.e., 300 generations distant ancestor), which roughly corresponds with the slow gene control acquisition generation.

    (TIF)

    pcbi.1011867.s004.tif (415.2KB, tif)
    S5 Fig. Examples of the similarity profiles with smaller noise levels for the initial condition.

    The relationship between phylogenetic distance and the value of the similarity indicator is represented as the same as Fig 2, SF2, and SF3 but with different noise levels on the initial condition. (A) An example of the case with the noise level on the initial condition σ = 0.01, which is 0.1 in the other figures. (B) An example of the case with the noise level on the initial condition σ = 0.001.

    (TIF)

    pcbi.1011867.s005.tif (1.1MB, tif)
    S6 Fig. Slow gene control and heterochrony.

    Subfigures A)–C) display pattern formation dynamics of the same gene network except for the change in the time-scale parameter of the slowest gene. A: Pattern formation dynamics originally obtained from evolutionary simulation. The value of the timescale parameter of the slowest gene is c5.17×10−2. B: Pattern formation dynamics with a larger timescale of the slowest gene; 10.3×10−2, 2 times larger than that of A. C: Pattern formation dynamics with a smaller timescale of the slowest gene; 2.58×10−2, 0.5 times smaller than that of A.

    (TIF)

    pcbi.1011867.s006.tif (3.8MB, tif)
    Attachment

    Submitted filename: referees_report.pdf

    pcbi.1011867.s007.pdf (29.2KB, pdf)
    Attachment

    Submitted filename: Response_to_Reviewers.pdf

    pcbi.1011867.s008.pdf (304.8KB, pdf)

    Data Availability Statement

    All programs that are needed to reproduce the results in this manuscript are available on Zenodo at https://doi.org/10.5281/zenodo.8049347


    Articles from PLOS Computational Biology are provided here courtesy of PLOS

    RESOURCES