Skip to main content
eLife logoLink to eLife
. 2021 Oct 13;10:e66711. doi: 10.7554/eLife.66711

Evolution of irreversible somatic differentiation

Yuanxiao Gao 1, Hye Jin Park 1,2,3, Arne Traulsen 1, Yuriy Pichugin 1,†,
Editors: Aleksandra M Walczak4, Aleksandra M Walczak5
PMCID: PMC8514237  PMID: 34643506

Abstract

A key innovation emerging in complex animals is irreversible somatic differentiation: daughters of a vegetative cell perform a vegetative function as well, thus, forming a somatic lineage that can no longer be directly involved in reproduction. Primitive species use a different strategy: vegetative and reproductive tasks are separated in time rather than in space. Starting from such a strategy, how is it possible to evolve life forms which use some of their cells exclusively for vegetative functions? Here, we develop an evolutionary model of development of a simple multicellular organism and find that three components are necessary for the evolution of irreversible somatic differentiation: (i) costly cell differentiation, (ii) vegetative cells that significantly improve the organism’s performance even if present in small numbers, and (iii) large enough organism size. Our findings demonstrate how an egalitarian development typical for loose cell colonies can evolve into germ-soma differentiation dominating metazoans.

Research organism: None

Introduction

In complex multicellular organisms, different cells specialise to execute different functions. These functions can be generally classified into two kinds: reproductive and vegetative. Cells performing reproductive functions contribute to the next generation of organisms, while cells performing vegetative function contribute to sustaining the organism itself. In unicellular species and simple multicellular colonies, these two kinds of functions are performed at different times by the same cells – specialization is temporal. In more complex multicellular organisms, specialization transforms from temporal to spatial (Mikhailov et al., 2009), where groups of cells focused on different tasks emerge in the course of organism development.

Typically, cell functions are changed via differentiation, such that a daughter cell performs a different function than the maternal cell. The vast majority of metazoans feature a very specific and extreme pattern of cell differentiation: any cell performing vegetative functions forms a somatic lineage, that is, producing cells performing the same vegetative function – somatic differentiation is irreversible. Since such somatic cells cannot give rise to reproductive cells, somatic cells do not have a chance to pass their offspring to the next generation of organisms. Such a mode of organism development opened a way for deeper specialization of somatic cells and consequently to the astonishing complexity of multicellular animals. Outside of the metazoans – in a group of green algae Volvocales serving as a model species for evolution of multicellularity – the emergence of irreversibly differentiated somatic cells is the hallmark innovation marking the transition from colonial life forms to multicellular species (Kirk, 2005).

While the production of individual cells specialized in vegetative functions comes with a number of benefits (Grosberg and Strathmann, 2007), the development of a dedicated vegetative cell lineage that is lost for organism reproduction is not obviously a beneficial adaptation. From the perspective of a cell in an organism, the guaranteed termination of its lineage seems the worst possible evolutionary outcome for itself. From the perspective of an entire organism, the death of somatic cells at the end of the life cycle is a waste of resources, as these cells could in principle become parts of the next generation of organisms. For example, exceptions from irreversible somatic differentiation are widespread in plants (Lanfear, 2018) and are even known in simpler metazoans among cnidarians (DuBuc et al., 2020) for which differentiation from vegetative to reproductive functions has been reported. Therefore, the irreversibility of somatic differentiation cannot be taken for granted in the course of the evolution of complex multicellularity.

Terminal differentiation is a type of cell differentiation different from irreversible cell differentiation. Unlike irreversibly differentiated cells who are capable of cell division, terminally differentiated cells lose the ability to divide. Terminally differentiated cells often perform tasks too demanding to be compatible with cell division. For example heterocysts of cyanobacteria perform nitrogen fixation, which requires anaerobic conditions, therefore these cells are very limited in resources and do not divide. In the scope of this study, we do not consider terminal differentiation but focus on somatic cells that are able to divide while being part of an organism (or cell colony) but not able to grow into a new organism, that is, irreversible somatic differentiation.

The majority of the theoretical models addressing the evolution of somatic cells focuses on the evolution of cell specialization, abstracting from the developmental process how germ (reproductive specialists) and soma are produced in the course of the organism growth. For example, a large amount of work focuses on the optimal distribution of reproductive and vegetative functions in the adult organism (Michod, 2007; Willensdorfer, 2009; Rossetti et al., 2010; Rueffler et al., 2012; Ispolatov et al., 2012; Goldsby et al., 2012; Solari et al., 2013; Goldsby et al., 2014; Amado et al., 2018; Tverskoi et al., 2018). However, these models do not consider the process of organism development. Other work takes the development of an organism into account to some extent: In Gavrilets, 2010, the organism development is considered, but the fraction of cells capable of becoming somatic is fixed and does not evolve. In Erten and Kokko, 2020, the strategy of germ-to-soma differentiation is an evolvable trait, but the irreversibility of somatic differentiation is taken for granted. In Rodrigues et al., 2012, irreversible differentiation was found, but both considered cell types pass to the next generation of organisms, such that the irreversible specialists are not truly somatic cells in the sense of evolutionary dead ends. Finally, in Cooper and West, 2018 a broad scope of cell differentiation patterns has been investigated in the context of evolution of cooperation. However, irreversible somatic differentiation was not considered in the study. Hence, the theoretical understanding of the evolution of irreversibly differentiated somatic cell lines is limited so far.

In the present work, we developed a theoretical model to investigate conditions for the evolution of the irreversible somatic differentiation. In the model, we suppose there are two cell types: germ-role and soma-role, where only germ-role cells pass to the next generation of organisms while soma-role cells are responsible for vegetative functions. Both germ-role cells and soma-role cells can divide and they may switch to each other during growth. In our model, we incorporate factors including (i) costs of cell differentiation, (ii) benefits provided by presence of soma-role cells, (iii) maturity size of the organism. We ask under which circumstances irreversible somatic differentiation is a strategy that can maximize the population growth rate compared to strategies in which differentiation does not occur or somatic differentiation is reversible.

Model

We consider a large population of clonally developing organisms composed of two types of cells: germ-role and soma-role. The roles differ in the ability to survive beyond the end of the organism life cycle: soma-role cells die at the end, while germ-role cells continue to live. Each organism is initiated as a single germ-role cell. In the course of the organism growth, germ-role cells may differentiate to give rise to soma-role cells and vice versa, see Figure 1A,B. After n rounds of synchronous cell divisions, the organism reaches its maturity size of 2n cells. Immediately upon reaching maturity, the organism reproduces: germ-role cells disperse and each becomes a newborn organism, while all soma-role cells die and are thus lost, see Figure 1A. We assume that soma-role cells are capable to accelerate growth: an organism containing more somatic cells grows faster, so having soma-role cells during the life cycle is beneficial for the organism.

Figure 1. Model overview.

Figure 1.

(A) The life cycle of an organism starts with a single germ-role cell. In each round, all cells divide and daughter cells can differentiate into a role different from the maternal cell’s role. When the organism reaches maturity, it reproduces: each germ-role cell becomes a newborn organism and each soma-role cell dies. (B) Change of cell roles is controlled by a stochastic developmental strategy defined by probabilities of each possible outcomes of a cell division. (C) Differentiation of cells requires an investment of resources and, thus, slows down the organism growth. Each cell differentiation event incurs a cost (csg or cgs). (D) The growth contribution of somatic cells is controlled by a function that decreases the doubling time with the fraction of somatic cells. The form of this function is controlled by four parameters, x0, x1, α, and b.

To investigate the evolution of irreversible somatic differentiation, we consider organisms in which the functional role of the cell (germ-role or soma-role) is not necessarily inherited. When a cell divides, the two daughter cells can change their role, leading to three possible combinations: two germ-role cells, one germ-role cell plus one soma-role cell, or two soma-role cells. We allow all these outcomes to occur with different probabilities, which also depend on the parental type, see Figure 1B. If the parental cell had the germ-role, the probabilities of each outcome are denoted by ggg, ggs, and gss respectively. If the parental cell had the soma-role, these probabilities are sgg, sgs, and sss. Altogether, six probabilities define a stochastic developmental strategy D=(ggg,ggs,gss;sgg,sgs,sss). In our model, it is the stochastic developmental strategy that is inherited by offspring cells rather than the functional role of the parental cell.

To feature irreversible somatic differentiation, the developmental strategy must allow germ-role cells to give rise to soma-role cells (ggg<1) and must forbid soma-role cells to give rise to germ-role cells (sss=1). All other developmental strategies can be broadly classified into two classes. Reversible somatic differentiation describes strategies where cells of both roles can give rise to each other: ggg<1 and sss<1. In the strategy with no somatic differentiation, soma-role cells are not produced in the first place: ggg=1, see Table 1.

Table 1. Classification of developmental strategies.

Class ggg sss
Irreversible somatic differentiation <1 = 1
Reversible somatic differentiation <1 <1
No somatic differentiation = 1 irrelevant

In our model, evolution of the developmental strategy is driven by the growth competition between populations executing different strategies – these populations able to produce more offspring and/or complete their life cycle faster gain a selective advantage. Specifically, we measure the fitness in the growth competition by the population growth rate in a stationary regime of exponential growth (Pichugin et al., 2017; Gao et al., 2019). The rate of population growth is determined by the number of offspring produced by an organism (equal to the number of germ-role cells at the end of life cycle) and the time needed for an organism to develop from a single cell to maturity (improved with the number of soma-role cells during the life cycle).

To obtain these growth rates, we simulate the process of the organism growth. Here, we assume that resource distribution among cells is coordinated at the level of the organism: Cells which need more resources will get more, such that cell division is synchronous. In our model, we consider synchronous cell division of organisms and our main results are dependent on this assumption. However, we shortly explore the effects of asynchronous cell division in Appendix G. Any organism is born as a single germ-role cell and passes through n rounds of simultaneous cell divisions. Each round starts with every cell independently choosing the outcome of its division with probability of each outcome given by the developmental strategy (D). This step determines what composition will the organism have at the next round of cell division. Then, the length of the cell doubling round (t) is computed as a product of two independent effects: the differentiation effect Fdiff representing costs of changing cell roles (Gallon, 1992) and the organism composition effect Fcomp representing benefits from having soma-role cells (Grosberg and Strathmann, 1998; Shelton et al., 2012; Matt and Umen, 2016),

t=Fdiff×Fcomp. (1)

Both Fdiff and Fcomp are re-calculated at every round of cell division.

The cell differentiation effect Fdiff represents the costs of cell differentiation. The differentiation of a cell requires efforts to modify epigenetic marks in the genome, recalibration of regulatory networks, synthesis of additional and utilization of no longer necessary proteins. This requires an investment of resources and therefore an additional time to perform cell division. Hence, any cell, which is about to give rise to a cell of a different role, incurs a differentiation cost cgs for germ-to-soma and csg for soma-to-germ transitions (and double of these if both offspring take a role different from the parent), see Figure 1C. The differentiation cost is the averaged differentiation cost among all cells in an organism

Fdiff=1+c=1+csg(Nsgs+2Nsgg)+cgs(Nggs+2Ngss)N, (2)

where Nsgs is the number of soma-roll cells that produce a germ-role cell and a soma-role cell in a cell division step. Nsgg, Nggs and Ngss are defined in the analogous way. N is the number of total cells. As organisms undergo synchronous cell division, we have N=2n cells after the n th cell division.

The composition effect profile Fcomp(x) captures how the cell division time depends on the proportion of soma-role cells x=s/(s+g) present in an organism (s and g are the numbers of soma-role and germ-role cells). In this study, we use a functional form illustrated in Figure 1D and given by

Fcomp(x)={1for0xx01b+b(x1xx1x0)forx0<x<x11bforx1x1 (3)

With the functional form (3), soma-role cells can benefit to the organism growth, only if their proportion in the organism exceeds the contribution threshold x0. Interactions between soma-role cells may lead to the synergistic (increase in the number of soma-role cells improves their efficiency), or discounting benefits (increase in the number of soma-role cells reduces their efficiency) to the organism growth, controlled by the contribution synergy parameter α. The maximal achievable reduction in the cell division time is given by the maximal benefit b, realized beyond the saturation threshold x1 of the soma-role cell proportion. A further increase in the proportion of soma-role cells does not provide any additional benefits. With the right combination of parameters, (3) is able to recover various characters of soma-role cells contribution to the organism growth: linear (x0=0,x1=1,α=1), power-law (x0=0,x1=1,α1), step-functions (x0=x1), and a huge range of other scenarios. Previous works have shown that convex (accelerating) performance functions favour cell differentiation (Michod, 2006; Rueffler et al., 2012; Cooper and West, 2018). The performance functions measure the performance of organisms with respect to different traits, such as fertility and viability. Lately, the form of functions favoring cell differentiation has been extended to be concave (decelerating) by including topological constraints in organisms (Yanni et al., 2020). Our model extends the form of performance functions by allowing it has a contribution threshold and saturation threshold.

Once the outcome of all cell divisions is known and the time needed to complete the current cell doubling round is computed, the current round ends and the next starts. The development completes after n rounds. At this stage, the number of germ-role cells (organism offspring number) and the cumulative length of the life cycle are obtained.

In Gao et al., 2019, we have shown that the growth rate (λ) of a population, in which organisms undergo a stochastic development and fragmentation, is given by the solution of

iGiPie-λTi=1. (4)

Here, i is the developmental trajectory – in our case, the specific combination of all cell division outcomes; Gi is the number of offspring organisms produced at the end of developmental trajectory i, equal to the number of germ-role cells at the moment of maturity; Pi is the probability that an organism development will follow the trajectory i; Ti is the time necessary to complete the trajectory i – from a single cell to the maturity size of 2n cells.

For a given combination of differentiation costs (cgs, csg) and a composition effect profile (determined by four parameters: x0, x1, b, and α), we screen through a number of stochastic developmental strategies D and identify the one providing the largest growth rate (λ) to the population. In this study, we searched for those parameters under which irreversible strategies lead to the fastest growth and are thus evolutionary optimal, see model details in Appendix A.

Results

For irreversible somatic differentiation to evolve, cell differentiation must be costly

We found that irreversible somatic differentiation does not evolve when cell differentiation is not associated with any costs (csg=cgs=0), see Figure 2A. Only reversible differentiation evolves there, see Figure 2B. This finding comes from the fact that when somatic differentiation is irreversible, the fraction of germ-role cells can only decrease in the course of life cycle. As a result, irreversible strategies deal with the tradeoff between producing more soma-role cells at the beginning of the life cycle, and having more germ-role cells by the end of it. On the one hand, irreversible strategies which produce a lot of soma-role cells early on, complete the life cycle quickly but preserve only a few germ-role cells by the time of reproduction. On the other hand, irreversible strategies which generate a lot of offspring, can deploy only a few soma-role cells at the beginning of it and thus their developmental time is inevitably longer. By contrast, reversible somatic differentiation strategies do not experience a similar tradeoff, as germ-role cells can be generated from soma-role cells. As a result, reversible strategy allows higher differentiation rates and can develop a high soma-role cell fraction in the course of the organism growth and at the same time have a large number of germ-role cells by the moment of reproduction. Under costless cell differentiation, for any irreversible strategy, we can find a reversible differentiation counterpart, which leads to faster growth: the development proceeds faster, while the expected number of produced offspring is the same, see Appendix 2 for details. As a result, costless cell differentiation cannot lead to irreversible somatic differentiation.

Figure 2. Impact of cell differentiation costs on the evolution of development strategies.

Figure 2.

The fractions of 200 random composition effect profiles promoting irreversible (A), reversible (B), and no differentiation (C) strategies at various cell differentiation costs (csg, cgs). In the absence of costs (cgs=csg=0), only reversible strategies were observed. Reversible strategies are prevalent at smaller cell differentiation costs. No differentiation strategies are the most abundant at large costs for germ-role cells (cgs). Irreversible strategies are the most abundant at large costs for soma-role cells (csg). (D) Cumulative cell differentiation rate (gss+12ggs+sgg+12sgs) in developmental strategies evolutionarily optimal at various differentiation costs (csg=cgs), separated by class (irreversible somatic differentiation, reversible somatic differentiation, or no somatic differentiation). Thick lines represent median values within each class, shaded areas show 90% confidence intervals. For each cost value, 3000 random profiles are used in this panel. Evolutionary optimal reversible strategies (orange) have much higher rates of cell differentiation than irreversible strategies (green). Consequently, reversible strategies are penalized more under costly differentiation. (E–H) Shapes of composition effect profiles (compare Figure 1D) promoting irreversible (green lines), reversible (orange lines), and no differentiation (black lines) strategies at four parameter sets indicated in panel A. The maturity size used in the calculation is 210 cells.

To confirm the reasoning that reversible strategies gain an edge over irreversible strategies by having larger differentiation rates, we asked which reversible and irreversible strategies become optimal at various cell differentiation costs (c=csg=cgs). At each value of costs, we found evolutionarily optimal developmental strategy for 3000 different randomly sampled composition effect profiles Fcomp(x). We found that evolutionarily optimal reversible strategies feature much larger rates of cell differentiation than evolutionarily optimal irreversible strategies, see Figure 2D. Even at large costs, where frequent differentiation is heavily penalized, the distinction between differentiation rates of reversible and irreversible strategies remains apparent.

We screened through a spectrum of germ-to-soma (cgs) and soma-to-germ (csg) differentiation costs, see Figure 2A–C. Irreversible somatic differentiation is most likely to evolve when it is cheap to differentiate from germ-role to soma-role (low cgs) but it is expensive to differentiate back (high csg), see Figure 2A. Irreversible strategies are insensitive to high soma-to-germ costs, since soma-role cells never differentiate. At the same time, reversible strategies are heavily punished by high costs of soma-role differentiation.

It is not very surprising to find irreversible differentiation where the differentiation costs are highly asymmetric. However, irreversible strategies are consistently observed in other regions of the costs space, even including these, where the asymmetry is opposite (it is hard to go from germ to soma but easy to return back), see Figure 2A,H. To identify what other factors, beyond asymmetric costs, can lead to evolution of irreversible somatic differentiation, below we focus on the scenario of equal differentiation costs csg=cgs=c.

Evolution of irreversible somatic differentiation is promoted when even a small number of somatic cells provides benefits to the organism

The composition effect profiles Fcomp(x) that promote the evolution of irreversible somatic differentiation have certain characteristic shapes, see Figure 2E–H. We investigated what kind of composition effect profiles can make irreversible somatic differentiation become an evolutionary optimum. We sampled a number of random composition effect profiles with independently drawn parameter values and found optimal developmental strategies for each profile for a number of differentiation costs (c) and maturity size (2n) values. We took a closer look at the instances of Fcomp(x) which resulted in irreversible somatic differentiation being evolutionarily optimal.

We found that irreversible strategies are only able to evolve when the soma-role cells contribute to the organism cell doubling time even if present in small proportions, see Figure 3A,B. Analysing parameters of the composition factors promoting irreversible differentiation, we found that this effect manifests in two patterns. First, the contribution threshold value (x0) has to be small, see Figure 3D – irreversible differentiation is promoted when soma-role cells begin to contribute to the organism growth even in low numbers. Second, the contribution synergy was found to be large (α>1) or, alternatively, the saturation threshold (x1) was small, see Figure 3C.

Figure 3. Irreversible soma evolves when substantial benefits arise at small concentrations of soma-role cells.

Figure 3.

In all panels, the data representing the entire set of composition effect profiles Fcomp(x) is presented in grey, while the subset promoting irreversible strategies is coloured. (A, B) Median and 90% confidence intervals of composition effect profiles at different differentiation costs (A, number of cell division n=10) and maturity sizes (B, differentiation costs c=5). (C, D) The set of composition effect profiles in the parameter space. Each point represents a single profile (c=5 and n=10). (C) The co-distribution of the saturation threshold (x1) and the contribution synergy (α) reveals that either x1 must be small or α must be large. (D) Co-distribution of the contribution threshold (x0) and the maximal benefit (b) shows that x0 must be small, while b must be intermediate to promote irreversible differentiation. A total of 3000 profiles are used for panels A, C, D and 1000 profiles for panel B.

Both the contribution threshold x0 and the contribution synergy α control the shape of the composition effect profile at intermediary abundances of soma-role cells. If the contribution synergy α exceeds 1, the profile is convex, so the contribution of soma-role cells quickly becomes close to maximum benefit (b). A small saturation threshold (x1) means that the maximal benefit of soma is achieved already at low concentrations of soma-role cells (and then the shape of composition effect profile between two close thresholds has no significance). Together, these patterns give an evidence that the most crucial factor promoting irreversible somatic differentiation is the effectiveness of soma-role cells at small numbers, see Appendix 4 for more detailed data presentation.

These patterns are driven by the static character of differentiation strategies we use: the chances for a cell to differentiate are the same at the first and the last round of cell division. Therefore, the optimal germ-to-soma differentiation rate is found as a balance between the needs to deploy soma-role cells early on and to keep the high number of germ-role by the end of the life cycle. This implies that irreversible somatic differentiation strategies produce soma-role cells at lower rate than reversible strategies, see Figure 2D. With irreversible differentiation, an organism spends a significant amount of time having only a few soma-role cells. Hence, the irreversible strategy can only be evolutionarily successful, if the few soma-role cells have a notable contribution to the organism growth time.

We also found that profiles featuring irreversible differentiation do not possess neither extremely large, nor extremely small maximal benefit values b, see Figure 3D. When the maximal benefit is too small, the cell differentiation just does not provide enough benefits to be selected for and the evolutionarily optimal strategy is no differentiation. In the opposite case, when the maximal benefit is very close to one, the cell doubling time approaches zero, see Equation (3). Then, the benefits of having many soma-role cells outweighs the costs of differentiation and the optimal strategy is reversible, see Appendix 4.

For irreversible somatic differentiation to evolve, the organism size must be large enough

By screening through the maturity size (2n) and differentiation costs (c), we found that the evolution of irreversible somatic differentiation is heavily suppressed at small maturity sizes, Figure 4A. We found that either reversible strategies or the no differentiation strategy evolve in small organisms. Since reversible strategies can quickly reach a fixed fraction of soma-role cells, thus they can obtain maximised benefits from soma-role cells with small maturity sizes (Appendix 2—figure 1). Since the no differentiation strategy does not involve cell differentiation, they do not have cell differentiation costs. In contrast, irreversible strategies increase the fraction of soma-roles and increase the benefits of soma-role cells gradually as maturity size increases. Meanwhile, the cell differentiation costs for irreversible strategies decrease as maturity size increases as the fraction of germ-role cells decreases. Thus compared with other strategies, the irreversible strategies have advantages in large organisms. We found that under csg=cgs, the minimal maturity size allowing irreversible somatic differentiation to evolve is 2n=64 cells. At the same time, organisms performing just a few more rounds of cell divisions are able to evolve irreversible differentiation at a wide range of cell differentiation costs, see also Appendix 5. This indicates that the evolution of irreversible somatic differentiation is strongly tied to the size of the organism.

Figure 4. Irreversible differentiation can evolve if organism grows to a large enough size in the course of its life cycle.

Figure 4.

(A) The fraction of composition effect profiles promoting irreversible strategies at various cell differentiation costs (c=csg=cgs) and maturity sizes (2n). Irreversible strategies were only found for maturity size 26 = 64 cells and larger. (B) The fraction of composition effect profiles promoting irreversible strategies at unequal differentiation costs csg=2cgs. A rare occurrences of irreversible strategies (∼1%) was detected at the maturity size 25=32 cells in a narrow range of cell differentiation costs but not at the smaller sizes. (C) The range of cell differentiation costs allowing evolution of irreversible strategies at at the maturity size 2n=32 (n=5) cells. For irreversible strategies to evolve at such a small size, the differentiation from soma-role to germ-role must be much more costly than the opposite transition (csgcgs).

Evolution of irreversible strategies at sizes smaller than 64 cells is possible for csg>cgs. For instance, at csg=2cgs some irreversible strategies were found to be optimal at the maturity size 25 = 32 cells, Figure 4B. However, irreversible strategies were found in a narrow range of cell differentiation costs and the fraction of composition effect profiles that allow evolution of irreversible differentiation there was quite low – about 1%. The evolution of irreversible strategies at such small maturity sizes becomes likely only at extremely unequal costs of transition between germ and some roles csgcgs, see Figure 4C. Hence, for irreversible somatic differentiation to evolve, the organism size should exceed a threshold of roughly 64 cells.

Irreversible somatic differentiation can also evolve when cell differentiation is risky

In our main model, we considered differentiation costs in a specific form of cell division delay. However, the process of cell differentiation may impact the organism development in another way. Differentiation requires modifications in DNA regulation, which in turn poses a risk of dysregulation resulting in an emergence of selfish mutants that could for example cause cancer. The disposable soma theory suggests that cells performing vegetative functions form separate lineages to contain emerging mutations and prevent them from passing to the next generations of organisms. In line with this hypothesis, we also considered a model of risky cell differentiation, where the transition between germ and soma roles incurs a risk of getting cancer that kills the entire organism, see Appendix 6.

The results obtained with a model of risky differentiation are very similar to the outcomes of our main model, where cell differentiation cause delay, see Figure 5. In both models, irreversible differentiation only evolves if cell differentiation does not come for free but brings costly side-effects (delay or risk). Also, in both models irreversible differentiation is prevalent when costs of soma-to-germ transitions are intense; reversible differentiation is prevalent when costs of both transitions are low; and no differentiation is prevalent when costs of germ-to-soma transitions are intense Figure 2A–C.

Figure 5. Irreversible differentiation can evolve if cell differentiation incurs a risk of organism death instead of division delay.

Figure 5.

The fractions of 200 random composition effect profiles promoting irreversible (A), reversible (B), and no differentiation (C) strategies at various risks of cell differentiation (δsg, δgs). The qualitative pattern is very similar to the results obtained with the model of cell differentiation causing delay, see Figure 2A–C. In the absence of risk (δgs=δsg=0), only reversible strategies were observed. Reversible strategies are prevalent at smaller risk values. No differentiation strategies are the most abundant at large risk of germ-to-soma transition (δgs). Irreversible strategies are the most abundant at large risks of soma-to-germ transition (δsg). The maturity size used in the calculation is 210 cells. The legend is the same as that in Figure 2A–C.

Discussion

The vast majority of cells in a body of any multicellular being contains enough genetic information to build an entire new organism. However, in a typical metazoan species, very few cells actually participate in the organism reproduction – only a limited number of germ cells are capable of doing it. The other cells, called somatic cells, perform vegetative functions but do not contribute to reproduction – somatic differentiation is irreversible. We asked for the reason for the success of such a specific mode of organism development. We theoretically investigated the evolution of irreversible somatic differentiation with a model of clonally developing organisms taking into account benefits provided by soma-role cells, costs arising from cell differentiation, and the effect of the raw organism size.

Our key findings are:

  • The evolution of irreversible somatic differentiation is inseparable from costly cell differentiation or risky cell differentiation.

  • For irreversible somatic differentiation to evolve in organisms with synchronous cell division, somatic cells should be able to contribute to the organism performance already when their numbers are small.

  • Only large enough organisms tend to develop irreversible somatic differentiation.

According to our results, cell differentiation costs are essential for the emergence of irreversible somatic differentiation, see Figure 2A. The costs punish strategies with high rate of cell differentiation. As a result, irreversible strategies gain an advantage because their overall differentiation rate is low, see Figure 2D, and soma-role cells do not differentiate at all. Most models focus on traits that lead to benefits for the organism, while the cost of cell differentiation are rarely considered. For cells in a multicellular organism, differentiation costs arise from the material needs, energy, and time it takes to produce components necessary for the performance of the differentiated cell, which were absent in the parent cell. For instance, in filamentous cyanobacteria nitrogen-fixating heterocysts develop much thicker cell wall than parent photosynthetic cells had. Also, reports indicate between 23% (Ow et al., 2008) and 74% (Sandh et al., 2014) of the proteome changes its abundance in heterocysts compared against photosynthetic cells. Similarly, the changes in the protein composition in the course of cell differentiation was found during the development of stalk and fruiting bodies of Dictyostelium discoideum (Bakthavatsalam and Gomer, 2010; Czarna et al., 2010).

An alternative to differentiation costs in terms of slower growth is a model with a risky differentiation, where we found similar patterns, see Figure 5. These results indicate that the exact mechanism of the differentiation costs does not play a major role in the evolution of irreversible somatic differentiation.

Our model demonstrates that irreversible somatic differentiation is more likely to evolve when a few soma-role cells are able to provide a substantial benefit to the organism, see Figure 3. Volvocales algae demonstrate that a significant contribution by small numbers of somatic cells might indeed be found in a natural population: In Eudorina illinoiensis, only four out of thirty-two cells are vegetative (Sambamurty AVSS, 2005) (soma-role in our terms). This species has developed some reproductive division of labour and a fraction of only 1/8 of vegetative cells is sufficient for colony success. Thus, it seems possible that highly-efficient soma-role cells open the way to the evolution of irreversible somatic differentiation. Several patterns of how cells proved the benefit to an organism have been previously considered (Michod, 2007; Willensdorfer, 2009; Rossetti et al., 2010; Rueffler et al., 2012; Cooper and West, 2018; Yanni et al., 2020). The majority of papers focuses on the resource allocation toward different tasks in each cell in an organism and how divergent different cells can be. In our model, we assume that the germ-role and soma-role cell are different in function and focus on the relationship between the number of soma-role cells and their impact, e.g. the character of their interactions. While the found Fcomp curves exhibit convex-like shape, see Figure 3A,B, this finding has a different nature from the convex trade-off between fertility and viability found in the models of cell differentiation (Michod, 2007).

Our model shows that irreversible somatic differentiation does not evolve if the organism size is small, see Figure 4A. The maturity size plays an important role in an organism’s life cycle (Amado et al., 2018; Erten and Kokko, 2020): Large organisms have potential advantages to optimize themselves in multiple ways, such as to improve growth efficiency (Waters et al., 2010), to avoid predators (Matz and Kjelleberg, 2005; Fisher et al., 2016; Hiltunen and Becks, 2014), to increase problem-solving efficiency (Morand-Ferron and Quinn, 2011), and to exploit the division of labour in organisms (Carroll, 2001; Matt and Umen, 2016). Moreover, the maximum size has been related to the reproduction of the organism from the onset of multicellularity in Earth’s history (Ratcliff et al., 2012). Our results suggest that the smallest organism able to evolve irreversible somatic differentiation should typically be about 32–64 cells (unless the cost of soma-to-germ differentiation is extremely large and the cost of the reverse is low). This is in line with the pattern of development observed in Volvocales green algae. In Volvocales, cells are unable to move (vegetative function) and divide (reproductive function) simultaneously, as a unique set of centrioles are involved in both tasks (Wynne and Bold, 1985; Koufopanou, 1994). Chlamydomonas reinhardtii (unicellular) and Gonium pectorale (small colonies up to 16 cells) perform these tasks at different times. They move towards the top layers of water during the day to get more sunlight. At night, however, these species perform cell division and/or colony reproduction, slowly sinking down in the process. However, among larger Volvocales, a division of labour begins to develop. In Eudorina elegans colonies, containing 16–32 cells, a few cells at the pole have their chances to give rise to an offspring colony reduced (Marchant, 1977; Hallmann, 2011). In P. californica, half of the 128-celled colony is formed of smaller cells, which are totally dedicated to the colony movement and die at the end of colony life cycle (Kikuchi, 1978; Hallmann, 2011). In Volvox carteri, most of a 10,000 cell colony is formed by somatic cells, which die upon the release of offspring groups (Hallmann, 2011).

In a majority of our tests, we used the maturity size of 210 = 1024 cells. This is significantly larger than the minimal necessary size for evolution of irreversible somatic differentiation. However, the body size of the order of 1000 cell attracts attention because at this scale organisms of very diverse degrees of complexity are observed: from undifferentiated colonies (ocean algae Phaeocystis antarctica), to intermediary life forms (slime molds slugs), to paradigm multicellular organisms (higher Volvocales and nematode Caenorhabditis elegans).

The model presented in our study focuses on the transition from colonial life forms to multicellular beings. Further development of complexity opens multiple new ways for optimization of life cycle. For example, a maternal organism can provide protection and nurture for offspring at their early stages of growth, like in V. carteri (10,000 cells) in which offspring colonies develop inside the parental organism. There, the rate of offspring growth depends mostly on the performance of the maternal organism and much less on the differentiation strategy of offspring. Having maternal protection allows to relax the conditions for evolution of irreversible differentiation indicated in our study. How much these conditions can be relaxed is a very interesting question.

One of the most significant assumptions we took is the synchronicity of cell divisions even if division outcomes are different. This is only possible if cell actions are coordinated at the level of organism – otherwise, cells that do not differentiate may complete their divisions before differentiating cells. When in the history of multicellularity such a coordination emerges is an open question. However, in a number of rather simple species, a synchronicity of cell divisions paired with cell differentiation is observed. One example is the green algae Eudorina illinoiensis – one of the simplest species demonstrating the first signs of reproductive division of labour, in which four out of 32 cells are differentiated (Sambamurty AVSS, 2005). Another example is 128-celled algae Pleodorina californica, half of the cells are differentiated. And still, the cell divisions are synchronous (Kikuchi, 1978). Even the size of the mature organism being a power of two indicates that cells do not divide independently, but their actions are controlled at the level of the organism.

To peek at the impact of the cell division synchronicity, we developed a model with asynchronous cell division, where cell differentiation costs are paid individually by each differentiating cell, see Appendix. G. We found that the evolution of irreversible differentiation is significantly suppressed even under the most favourable conditions (csgcgs) – the frequency of composition profiles promoting irreversible somatic differentiation is much smaller and the maturity size restriction is higher.

Another assumption, which shapes the results of our study, is the static differentiation strategy the probability of each division outcome does not depend on the stage of life cycle. On the one hand, the static nature of differentiation strategy puts irreversible differentiation in disadvantage, as it creates a trade-off between the fraction of soma-role cells at the early stage of life cycle and the number of germ-role cells at the end of life cycle. On the other hand, a set of fully flexible dynamic differentiation strategies present an efficient but hardly realistic solution to the life cycle optimization problem: at the first round of cell divisions organism converts to all-soma state and remains so until the last round, when all cells convert back to germ-state. Theoretically, this strategy provides simultaneously the fastest possible development rate (100% soma-role cells during life cycle) and the largest possible number of offspring (100% germ-role cells at the end of life cycle). Still, we cannot provide an example of such a developmental program in nature. Nevertheless, the differentiation strategy of higher Volvocales is not static Kirk, 2005 and the exploration of a vast space of dynamic differentiation strategies warrants further investigation.

We acknowledge that our discussion of natural examples of germ-soma differentiation relies heavily on Volvocales algae. This merely reflects the bias in the empirical literature about evolution of germ/soma differentiation towards this group. We should note that our model is not a model of Volvocales life cycle. Instead, we aim to answer the question about emergence of irreversible somatic differentiation in a broad context without tailoring it to the features of a single group.

Our study originated from curiosity about driving factors in the evolution of irreversible somatic differentiation: Why does the green algae Volvox from the kingdom Plantae shed most of its biomass in a single act of reproduction? And why, in another kingdom, Animalia, in most of the species the majority of body cells is outright forbidden to contribute to the next generation? Our results show which factors makes a difference between the evolution of an irreversible somatic differentiation and other strategies of development. One of these factors, the maturity size, is known in the context of the evolution of reproductive division of labour (Kirk, 2005). Another factor, the costs of cell differentiation, is, in general, discussed in a greater biological scope but is hardly acknowledged as a factor contributing to the evolution of organism development. Finally, the early contribution of soma-role cells to the organism growth, even if they are small in numbers, is an unexpected outcome of our investigation, overlooked so far as well. Despite the simplistic nature of our model (we did not aim to model any specific organism), all our results find a confirmation among the Volvocales clade. Hence, we expect that the findings of this study reveal general properties of the evolution of irreversible somatic differentiation, independently of the clade where it evolves.

Acknowledgements

YG, HJP, AT, and YP thank the Max Planck Society for generous funding. HJP was supported by the NRF grant funded by the Korea government (MSIT) Grant No.2020R1A2C1101894 and by an appointment to the JRG Program at the APCTP through the Science and Technology Promotion Fund and Lottery Fund of the Korean Government. This was also supported by the Korean Local Governments - Gyeongsangbuk-do Province and Pohang City.

Appendix 1

Search for the evolutionarily optimal developmental program

Finding the population growth rate for a given developmental program

In Gao et al., 2019, we have shown that a population of organisms, which begin their life cycle from the same state but have a stochastic development, eventually grows exponentially with the rate λ given by the solution of

ie-λTiGiPi=1. (5)

Here, i is the developmental trajectory – in our case, the specific combination of all cell division outcomes; Pi is the probability that an organism development will follow the trajectory i; Ti is the time necessary to complete the trajectory i – from a single cell to the maturity size of 2n cells; Gi is the number of offspring organisms produced at the end of developmental trajectory i, equal to the number of germ-role cells at the moment of maturity.

In order to find the population growth rate, we need to know Gi, Ti, and Pi (how many offspring are produced, how long did it take to mature, and how likely is this developmental trajectory, respectively). The complete set of developmental trajectories is huge as it scales exponentially with the number of divisions n.

In our study, for each developmental strategy, we sampled M=300 developmental trajectories at random. To get each trajectory, we simulated the growth of the single organism according to the rules of our model. For each trajectory, the developmental time Ti was computed as a sum of cell doubling times at each of the n synchronous cell divisions, the number of offspring Gi was given by the count of germ-role cells at the end of development. The resulting ensemble of trajectories (with Pi=1/M) was plugged into (5) to compute the population growth rate λ.

Finding the developmental program with the largest population growth rate

We assume that evolution occurs by growth competition between populations executing different developmental strategies. These strategies, which provide larger population growth rate will outgrow others. To find evolutionarily optimal strategies under given conditions, we screened through a large set of developmental strategies and identified the one with the maximal population growth rate λ. Since the probabilities of cell division outcomes sum into one (ggg+ggs+gss=1 and sgg+sgs+sss=1), these probabilities can be represented as a point on two simplexes, one for the division of germ-role cells, and one for the division of soma-role cells. Consequently, we choose the set of developmental strategies as a Cartesian product of two triangular lattices – one for division probabilities of germ-role cells (ggg,ggs,gss) and one for soma-role cells (sgg,sgs,sss). The lattice space was set to 0.1, so each of two independent lattices contained 11×12/2=66 nodes, and the whole set of developmental strategies comprised 66 × 66 = 4356 different strategies. For each of these strategies, the population growth rate λ was calculated and the strategy with the largest growth rate was identified as evolutionarily optimal.

In our investigation, parameters such as differentiation costs (csg, cgs) and maturity size (2n) were used as control parameters. In other words, we either fix them at the specific values, or screened through a range of values to obtain a map (see Figures 2 and 3 in the main text). However, the parameters that controlled the shape of composition effect profile (x0, x1, α, and b) were treated differently. For each combination of control parameters, we randomly sampled a number (between 200 and 3000) of combinations of these parameters. The thresholds (0x0x11) were sampled as a pair of independent distributed random values from the uniform distribution U(0,1). The contribution threshold x0 was set to the minimum of the pair, and the saturation threshold x1 was set to the maximum. The contribution synergy (α>0) corresponds to the concave shape of the profile at α<1 and to the convex shape at α>1. Therefore, log10(α) was sampled from the uniform distribution U(-2,+2), so the profile has an equal probability to demonstrate concave and convex shape. Finally, the maximum benefit (0b<1) was sampled from a uniform distribution, U(0,1). For each tested combination of control parameters, we found the optimal developmental strategy for every sampled profile. We then classified these as irreversible somatic differentiation, reversible somatic differentiation, or no somatic differentiation.

Appendix 2

Under costless cell differentiation, irreversible soma strategy cannot be evolutionarily optimal

In this section, we will show that an irreversible strategy can never be an evolutionary optimum without cell differentiation being costly. To do that, we first consider the deterministic dynamics of the expected composition of the organism. Then, for an arbitrary irreversible strategy, we identify a more advantageous reversible strategy which gives the same organism composition at the end of life cycle but higher number of soma-role cells during the life cycle.

In our model, the composition of the organism is governed by the stochastic developmental strategy and differs between different organisms. Here, as a proxy for this complex stochastic dynamics, we consider the mathematical expectation of the composition. Assume that after j cell divisions the fraction of soma-role cells is rs(j) and the fraction of germ-role cells is rg(j)=1rs(j) , j=1,,n, where n is the maximal number of divisions. Then, the expected fractions of cells of the two types after the next cell division is 

rs(j+1)=(sss+sgs2)rs(j)+(ggs2+gss)rg(j)=(1ms)rs(j)+mgrg(j),
rg(j+1)=(ggg+ggs2)rg(j)+(sgs2+sgg)rs(j)=(1mg)rg(j)+msrs(j) (6)

where we introduced ms=sgg+sgs2 and mg=gss+ggs2 – the probabilities that the offspring of a cell will have a different role. Naturally, for irreversible somatic differentiation ms=0 and mg>0 , for no somatic differentiation strategies mg=0 and ms being irrelevant, while the reversible differentiation class covers the rest. (6) can be written in matrix form

(rs(j+1)rg(j+1))=(1msmgms1mg)(rs(j)rg(j)) (7)

A newborn organism contains a single germ-role cell (rs(0)=0,rg(0)=1) , therefore, the expected composition of an organism after j divisions is

(rs(j)rg(j))=(1msmgms1mg)j(01) (8)

The matrix has two eigenvalues: 1 and 1-mg-ms, with associated right eigenvectors (mg,ms)T and (1,-1)T, respectively. Hence, the expected composition after j divisions can be obtained in the explicit form

rs(j)=1mg+ms[mgmg(1mgms)j],
rg(j)=1mg+ms[ms+mg(1mgms)j]. (9)

For an arbitrary irreversible somatic differentiation strategy D, ms=0, the expected number of soma-role cells changes as

rs,D(j)=1(1mg)j, (10)

which is a monotonically increasing function of the number of cell divisions t, see the green line in Fig. B. In the life cycle involving j cell divisions, the fraction of soma-role cells at the end of life cycle is rs,D(j)=1(1mg)j.

Now, we consider another developmental strategy D with reversible somatic differentiation in which mg=rs,D(n) and ms=1rs,D(n). Using mg+ms=1 in (9), it can be shown that the expected fraction of soma-role cells in D after the very first cell division is exactly rs,D(n) and stays constant thereafter, see the orange line in Fig. B. Thus, the number of offspring produced is the same for both development strategies.

If cell differentiation is costless (ds=dg=0), then the cell doubling time depends only on the fraction of soma-role cells. As all soma-role cells are then present already after the first cell division, organisms following the reversible strategy D will grow faster than organisms using the irreversible strategy D at any stage of organism development, independently of the choice of the composition effect profile (Fcomp). At the end of the life cycle, both strategies have the same expected number of offspring. Therefore, under costless cell differentiation, for any irreversible strategy, we can find a reversible strategy that leads to a larger population growth rate.

Appendix 2—figure 1. Under costless differentiation, for any irreversible somatic differentiation strategy, exists a reversible somatic differentiation strategy dominating it.

Appendix 2—figure 1.

The green curve shows the dynamics of the expected fraction of soma-role cells in an organism using an irreversible strategy (mg=0.1, ms=0.0, n=12). The orange curve shows the dynamics of the expected fraction of soma-role cells in an organism using the specific reversible strategy [mg=1-(1-mg)120.72, ms=1-mg0.28]. In this strategy, the number of offspring produced at the end of the life cycle is the same as in the considered irreversible strategy. At the same time, the fraction of soma-role cells during the life cycle is larger. Therefore, under costless differentiation, the presented reversible strategy is more effective than the considered irreversible strategy.

Appendix 3

Conditions promoting the evolution of reversible, irreversible, and no differentiation strategies

Appendix 3—figure 1. Impact of maturity size on the evolution of development strategies.

Appendix 3—figure 1.

The fractions of 200 random composition effect profiles promoting irreversible (A), reversible (B), and no differentiation (C) strategies at various cell differentiation costs (c=csg=cgs) and maturity size 2n. Irreversible strategies are most abundant at large maturity sizes and intermediary cell differentiation costs. Reversible strategies are most abundant at small cell differentiation costs. No differentiation strategies are most abundant at small maturity sizes and cell differentiation costs. The legend is the same as that in Figure 2A–C.

Appendix 4

Parameters of composition effect profiles promoting reversible, irreversible, and no differentiation strategies

Appendix 4—figure 1. Impact of composition effect parameters on the evolution of development strategies.

Appendix 4—figure 1.

Each diagonal panel represents individual distribution of each of four parameters among composition effect profiles promoting irreversible (green), reversible (orange), and no differentiation (black) strategies. Each non-diagonal panel represents a pairwise co-distribution of these parameters. Irreversible (green) strategies are promoted at small contribution thresholds x0 and intermediary maximal benefit b. Also, either the contribution synergy α must be large, or the saturation threshold x1 should be small – see main text for detailed discussion. Reversible (orange) strategies require large b– there the benefits of having a large number of soma-role cells outweighs costs paid by frequent differentiation. Due to the fast accumulation of soma-role cells, reversible strategies tolerate larger x0 than irreversible. Reversible exhibit the same restrictions with respect to x1 as irreversible and are insensitive to α. For this figure, 3000 composition effect profiles were investigated with costs c=csg=cgs=5 and n=10.

Appendix 5

Evolution of irreversible somatic differentiation under various maturity sizes and unequal cell differentiation costs

Appendix 5—figure 1. Evolution of irreversible somatic differentiation at unequal cell differentiation costs.

Appendix 5—figure 1.

(A-C) The fraction of 200 random composition effect profiles promoting irreversible strategy at various cell differentiation costs (csg, cgs) at fixed maturity size n=5 (panel A), 10 (B), and 15 (C). Larger maturity sizes promote the evolution of irreversible differentiation across all cell differentiation costs. (D-F) The fraction of composition effect profiles promoting irreversible strategy at unequal cell differentiation costs csg/cgs=2 (panel D), csg/cgs=1 (E), and csg/cgs=0.5 (F). Even with unequal differentiation costs, the minimal maturity size allowing the evolution of irreversible differentiation stays roughly the same — 25-26 cells. Dashed lines indicate overlap between panels. The legend is the same as that in Figure 2A–C.

Appendix 6

Model of risky cell differentiation

In the risky differentiation model, we assume that cell differentiation implies a risk of errors leading to defective cells (Aktipis et al., 2015). These cells act in their selfish interests, compromising the integrity of an organism. This leads to the organism death, very similar to outcomes of cancer in complex multicellular species.

The impact of the defective cell depends on which stage of life cycle it appears. A defective cell emerged during the first cell division will likely result in a non-viable organism. At the same time, a defective cell emerged in the very last round of cell divisions is unlikely to affect the organism because its life cycle is about to end. To reflect this effect, we scaled the impact of a newly emerged defective cell by the number of cells already present in an organism. This way, the probability to get cancer is proportional to the frequency of cell differentiation events. The proportions of soma-role cells and germ-role cells that differentiate upon division in the total number of cell divisions are

fsg=Nsgg+12NsgsZ,
fgs=Ngss+12NggsZ (11)

where Nxyz is the number of cell divisions at which cell of role x gives rise to a y cell and a z cell, and Z=2n-1 is the total number of cell divisions during the organism growth with maturity size 2n.

We define the probabilities of death caused by defective cells emerged in germ to soma and soma to germ transitions as 

dgs=tanh(δgsfgs),
dsg=tanh(δsgfsg) (12)

where δgs and δsg characterize the risk of cancer from a germ to soma and from a soma to germ transition. The transformation function tanh(x) is chosen to grow linearly at a small number of differentiation events but exponentially saturates to one if these events are numerous, see Fig. F.

We assume that an organism successfully completes its life cycle and produces offspring only if no cancer emerges in the course of its growth. The probability of this at each round of cell division is

Psuccess=(1-dgs)(1-dsg). (13)

Otherwise, the organism dies and does not produce any offspring. There are no delay differentiation costs in this model (csg=cgs=0).

A typical feature of the cancer cells in complex organisms is a high cell division rate. This has a large impact on organisms of complex animals, in which the division rate of regular cells is low and the life cycle are long. However, organisms in the focus of our study have very short life cycles (few rounds of cell divisions) and even the regular cells actively proliferate. Hence, the growth advantage of defective cells should have much smaller impact on simple species. Therefore, in this model, we neglect the difference in division rates between defective and regular cells and keep cell divisions synchronous.

The probability of getting cancer depends on the frequency of cell differentiation events. An organism with a higher cell differentiation rate has a higher death probability, which leads to slower population growth.

Appendix 6—figure 1. Probability of death from cancer as a function of the differentiation frequency.

Appendix 6—figure 1.

If the frequency of differentiation is small, the death probability grows linearly. If the frequency of differentiation is large, the death probability saturates at 1.

Appendix 7

Evolution of irreversible somatic differentiation in a model with asynchronous cell division

Our original model features synchronous cell division. This comes from the assumption that differentiation costs are paid collectively by the whole organism. Here, we consider another option, where differentiation costs are paid individually by each cell. An immediate consequence is that cell division in such a model is asynchronous because differentiating cells take more time to divide.

In the asynchronous model, we model cell division as a random process occurring with the reaction rates

rggg=g×ggg×(Fcomp×1)1,
rggs=g×ggs×(Fcomp×(1+cgs))1,
rgss=g×gss×(Fcomp×(1+2cgs))1,
rsgg=s×sgg×(Fcomp×(1+2csg))1,
rsgs=s×sgs×(Fcomp×(1+csg))1,
rsss=s×sss×(Fcomp×1)1, (14)

where s,g are the number of germ and soma cells in the organism, sxy,gxy are elements of the differentiation program D, Fcomp is the composition effect profile computed identically to the synchronous model, see Equation 3, and csg and cgs are differentiation costs.

We use the Gillespie algorithm to find which kind of cell division occurs next and how much time does it take. Then a chosen cell division occurs once (organism grows by a single cell). After that Fcomp value is updated to reflect the changed composition. Then the next cell division is sampled and the process continues until the organism reaches the maturity size. This model is designed to be the asynchronous implementation of our ideas, which remains close to our original model presented in the main text. Therefore, the rest of simulation protocol remains the same.

Computation time of the asynchronous model scales linearly with the number of cell divisions: it takes 1023 simulation steps to simulate the growth from 1 to 1024 cells. Therefore, it is computationally much more demanding than the synchronous model. The synchronous model scales linearly with the number of cell generations: the same growth to 1024 cells needs only 10 steps there. Maps similar to Figure 2A–C are unavailable with asynchronous model for computational reasons. Still, we calculated optimal differentiation strategies for a single combination of costs: cgs=0, csg=10. Under these conditions, which favour evolution of irreversible differentiation in the synchronous model, it is significantly suppressed in the asynchronous model, see Fig. G.

Appendix 7—figure 1. The model with asynchronous cell division suppresses evolution of irreversible somatic differentiation even at the most favourable conditions (cgs=0, csg=10).

Appendix 7—figure 1.

Panels show composition profiles Fcomp coloured according to the class of differentiation strategy: irreversible (green), reversible (orange), and no differentiation (black). In the synchronous model (top row), due to very asymmetric costs, irreversible differentiation is capable of evolving at small maturity sizes (eight cells). In the asynchronous model (bottom row), irreversible differentiation is not observed below 32 cells. Even at large sizes, the fraction of profiles promoting irreversible differentiation remains small.

The reason behind the difference between results for synchronous and asynchronous models is the different performance of reversible strategies in these models. If costs of soma differentiation are large enough, the expected period of cell division for a differentiating soma-role cell is longer than the length of life cycle. As a result, instead of re-differentiation, soma-role cells become effectively terminally differentiated. In such a situation, the growth of the organism is determined by propagation of germ-role cells and does not depend on the value of soma differentiation costs.

The key to success of reversible strategies in the synchronous model was an ability to develop large fractions of soma-role cells early on and to keep this fraction in the course of life cycle. There, the fraction of soma-role cells is preserved by a dynamic equilibrium between differentiation in both directions, see Appendix 2. In the asynchronous model with high soma differentiation costs, soma-role cells do not divide and such a dynamic equilibrium does not exist. The fraction of soma-role cells is maintained differently here. If we denote the number of germ-role and soma-role cells at time j (unlike Appendix 2, it is a continuous parameter here) as g(j) and s(j), respectively, then in the case of non-dividing soma-role cells (cgs=0csg1sss=0), the dynamics of the organism is given by

ddjg(j)=2(1mg)g(j),
ddjs(j)=2mgg(j), (15)

The solution of this system of equations with initial condition of one germ-role and no soma-role cells is

g(j)=e2(1mg)j,
s(j)=mg1mg(e2(1mg)j1) (16)

Hence, the fraction rs(j) of soma-role cells is

rs(j)=mg(1e2(1mg)j)1mge2(1mg)jjmg. (17)

The differentiation strategy considered above (sss=0) is an extreme case where a dynamic equilibrium between cell differentiations is not possible. Still, Equation 17 demonstrates that a balance between germ-role and soma-role cells is still achieved here. Therefore, in the asynchronous model with highly asymmetric differentiation costs, the reversible strategies keep all components that make them successful in the no costs scenario: the early production of soma-role cells due to high differentiation rates, the necessary fraction of soma-role cells during the life cycle (Equation 17), and the overall fast growth of the whole organism, despite having non-dividing soma-role cells (Equation 16).

Note that in irreversible strategies, soma-role cells do not differentiate and therefore divide at a normal rate. Therefore, the characteristic trade-off of irreversible strategies between having more soma-role cells early and more germ-role cells later in life cycle remains in place even in the asynchronous model. As a result, in this model, reversible strategies are not punished by asymmetric costs and outcompete irreversible ones.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Yuriy Pichugin, Email: pichugin@evolbio.mpg.de.

Aleksandra M Walczak, École Normale Supérieure, France.

Aleksandra M Walczak, École Normale Supérieure, France.

Funding Information

This paper was supported by the following grants:

  • Ministry of Science and ICT, South Korea 2020R1A2C1101894 to Hye Jin Park.

  • JRG Program to Hye Jin Park.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Software, Formal analysis, Investigation, Visualization.

Conceptualization, Software, Methodology.

Conceptualization, Supervision.

Conceptualization, Methodology, Project administration.

Additional files

Transparent reporting form

Data availability

The code implementing our model is deposited at https://github.com/YuanxiaoGao/Evolution-of-irreversible-somatic-differentiation (copy archived at https://archive.softwareheritage.org/swh:1:rev:9a1ea7c84f3041ebe3720e7837b28182912b5e00).

References

  1. Aktipis CA, Boddy AM, Jansen G, Hibner U, Hochberg ME, Maley CC, Wilkinson GS. Cancer across the tree of life: cooperation and cheating in multicellularity. Philosophical Transactions of the Royal Society B: Biological Sciences. 2015;370:20140219. doi: 10.1098/rstb.2014.0219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Amado A, Batista C, Campos PRA. A mechanistic model for the evolution of multicellularity. Physica A: Statistical Mechanics and Its Applications. 2018;492:1543–1554. doi: 10.1016/j.physa.2017.11.080. [DOI] [Google Scholar]
  3. Bakthavatsalam D, Gomer RH. The secreted proteome profile of developing Dictyostelium discoideum cells. Proteomics. 2010;10:2556–2559. doi: 10.1002/pmic.200900516. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Carroll SB. Chance and necessity: the evolution of morphological complexity and diversity. Nature. 2001;409:1102–1109. doi: 10.1038/35059227. [DOI] [PubMed] [Google Scholar]
  5. Cooper GA, West SA. Division of labour and the evolution of extreme specialization. Nature Ecology & Evolution. 2018;2:1161–1167. doi: 10.1038/s41559-018-0564-9. [DOI] [PubMed] [Google Scholar]
  6. Czarna M, Mathy G, Mac'Cord A, Dobson R, Jarmuszkiewicz W, Sluse-Goffart CM, Leprince P, De Pauw E, Sluse FE. Dynamics of the Dictyostelium discoideum mitochondrial proteome during vegetative growth, starvation and early stages of development. Proteomics. 2010;10:6–22. doi: 10.1002/pmic.200900352. [DOI] [PubMed] [Google Scholar]
  7. DuBuc TQ, Schnitzler CE, Chrysostomou E, McMahon ET, Febrimarsa, Gahan JM, Buggie T, Gornik SG, Hanley S, Barreira SN, Gonzalez P, Baxevanis AD, Frank U. Transcription factor AP2 controls cnidarian germ cell induction. Science. 2020;367:757–762. doi: 10.1126/science.aay6782. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Erten EY, Kokko H. From zygote to a multicellular soma: Body size affects optimal growth strategies under cancer risk. Evolutionary Applications. 2020;13:1593–1604. doi: 10.1111/eva.12969. [DOI] [Google Scholar]
  9. Fisher RM, Bell T, West SA. Multicellular group formation in response to predators in the alga Chlorella vulgaris. Journal of Evolutionary Biology. 2016;29:551–559. doi: 10.1111/jeb.12804. [DOI] [PubMed] [Google Scholar]
  10. Gallon JR. Reconciling the incompatible: N2 fixation and O2. The New Phytologist. 1992;129:571–609. doi: 10.1111/J.1469-8137.1992.TB00087.X. [DOI] [Google Scholar]
  11. Gao Y, Traulsen A, Pichugin Y. Interacting cells driving the evolution of multicellular life cycles. PLOS Computational Biology. 2019;15:e1006987. doi: 10.1371/journal.pcbi.1006987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Gavrilets S. Rapid transition towards the division of labor via evolution of developmental plasticity. PLOS Computational Biology. 2010;6:e1000805. doi: 10.1371/journal.pcbi.1000805. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Goldsby HJ, Dornhaus A, Kerr B, Ofria C. Task-switching costs promote the evolution of division of labor and shifts in individuality. PNAS. 2012;109:13686–13691. doi: 10.1073/pnas.1202233109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Goldsby HJ, Knoester DB, Kerr B, Ofria C. The effect of conflicting pressures on the evolution of division of labor. PLOS ONE. 2014;9:e102713. doi: 10.1371/journal.pone.0102713. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Grosberg RK, Strathmann RR. One cell, two cell, red cell, blue cell: the persistence of a unicellular stage in multicellular life histories. Trends in Ecology & Evolution. 1998;13:112–116. doi: 10.1016/S0169-5347(97)01313-X. [DOI] [PubMed] [Google Scholar]
  16. Grosberg RK, Strathmann RR. The Evolution of Multicellularity: A Minor Major Transition? Annual Review of Ecology, Evolution, and Systematics. 2007;38:621–654. doi: 10.1146/annurev.ecolsys.36.102403.114735. [DOI] [Google Scholar]
  17. Hallmann A. Evolution of reproductive development in the volvocine algae. Sexual Plant Reproduction. 2011;24:97–112. doi: 10.1007/s00497-010-0158-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hiltunen T, Becks L. Consumer co-evolution as an important component of the eco-evolutionary feedback. Nature Communications. 2014;5:5226. doi: 10.1038/ncomms6226. [DOI] [PubMed] [Google Scholar]
  19. Ispolatov I, Ackermann M, Doebeli M. Division of labour and the evolution of multicellularity. Proceedings of the Royal Society B: Biological Sciences. 2012;279:1768–1776. doi: 10.1098/rspb.2011.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Kikuchi K. Cellular differentiation in Pleodorina californica. Cytologia. 1978;43:153–160. doi: 10.1508/cytologia.43.153. [DOI] [Google Scholar]
  21. Kirk DL. A twelve-step program for evolving multicellularity and a division of labor. BioEssays. 2005;27:299–310. doi: 10.1002/bies.20197. [DOI] [PubMed] [Google Scholar]
  22. Koufopanou V. The evolution of soma in the volvocales. The American Naturalist. 1994;143:907–931. doi: 10.1086/285639. [DOI] [Google Scholar]
  23. Lanfear R. Do plants have a segregated germline? PLOS Biology. 2018;16:e2005439. doi: 10.1371/journal.pbio.2005439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Marchant HJ. Colony formation and inversion in the green alga Eudorina elegans. Protoplasma. 1977;93:325–339. doi: 10.1007/BF01275663. [DOI] [Google Scholar]
  25. Matt G, Umen J. Volvox: a simple algal model for embryogenesis, morphogenesis and cellular differentiation. Developmental Biology. 2016;419:99–113. doi: 10.1016/j.ydbio.2016.07.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Matz C, Kjelleberg S. Off the hook--how Bacteria survive protozoan grazing. Trends in Microbiology. 2005;13:302–307. doi: 10.1016/j.tim.2005.05.009. [DOI] [PubMed] [Google Scholar]
  27. Michod RE. The group covariance effect and fitness trade-offs during evolutionary transitions in individuality. PNAS. 2006;103:9113–9117. doi: 10.1073/pnas.0601080103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Michod RE. Evolution of individuality during the transition from unicellular to multicellular life. PNAS. 2007;104 Suppl 1:8613–8618. doi: 10.1073/pnas.0701489104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Mikhailov KV, Konstantinova AV, Nikitin MA, Troshin PV, Rusin LY, Lyubetsky VA, Panchin YV, Mylnikov AP, Moroz LL, Kumar S, Aleoshin VV. The origin of metazoa: a transition from temporal to spatial cell differentiation. BioEssays. 2009;31:758–768. doi: 10.1002/bies.200800214. [DOI] [PubMed] [Google Scholar]
  30. Morand-Ferron J, Quinn JL. Larger groups of passerines are more efficient problem solvers in the wild. PNAS. 2011;108:15898–15903. doi: 10.1073/pnas.1111560108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Ow SY, Cardona T, Taton A, Magnuson A, Lindblad P, Stensjö K, Wright PC. Quantitative shotgun proteomics of enriched heterocysts from Nostoc sp. PCC 7120 using 8-plex isobaric peptide tags. Journal of Proteome Research. 2008;7:1615–1628. doi: 10.1021/pr700604v. [DOI] [PubMed] [Google Scholar]
  32. Pichugin Y, Peña J, Rainey PB, Traulsen A. Fragmentation modes and the evolution of life cycles. PLOS Computational Biology. 2017;13:e1005860. doi: 10.1371/journal.pcbi.1005860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Ratcliff WC, Denison RF, Borrello M, Travisano M. Experimental evolution of multicellularity. PNAS. 2012;109:1595–1600. doi: 10.1073/pnas.1115323109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Rodrigues JFM, Rankin DJ, Rossetti V, Wagner A, Bagheri HC. Differences in cell division rates drive the evolution of terminal differentiation in microbes. PLOS Computational Biology. 2012;8:e1002468. doi: 10.1371/journal.pcbi.1002468. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Rossetti V, Schirrmeister BE, Bernasconi MV, Bagheri HC. The evolutionary path to terminal differentiation and division of labor in cyanobacteria. Journal of Theoretical Biology. 2010;262:23–34. doi: 10.1016/j.jtbi.2009.09.009. [DOI] [PubMed] [Google Scholar]
  36. Rueffler C, Hermisson J, Wagner GP. Evolution of functional specialization and division of labor. PNAS. 2012;109:E326–E335. doi: 10.1073/pnas.1110521109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Sambamurty AVSS . A Textbook of Algae. IK International Pvt Limited; 2005. [Google Scholar]
  38. Sandh G, Ramström M, Stensjö K. Analysis of the early heterocyst Cys-proteome in the multicellular Cyanobacterium Nostoc punctiforme reveals novel insights into the division of labor within diazotrophic filaments. BMC Genomics. 2014;15:1064. doi: 10.1186/1471-2164-15-1064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Shelton DE, Desnitskiy AG, Michod RE. Distributions of reproductive and somatic cell numbers in diverse Volvox (Chlorophyta) species. Evolutionary Ecology Research. 2012;14:707. [PMC free article] [PubMed] [Google Scholar]
  40. Solari CA, Kessler JO, Goldstein RE. A general allometric and life-history model for cellular differentiation in the transition to multicellularity. The American Naturalist. 2013;181:369–380. doi: 10.1086/669151. [DOI] [PubMed] [Google Scholar]
  41. Tverskoi D, Makarenkov V, Aleskerov F. Modeling functional specialization of a cell colony under different fecundity and viability rates and resource constraint. PLOS ONE. 2018;13:e0201446. doi: 10.1371/journal.pone.0201446. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Waters JS, Holbrook CT, Fewell JH, Harrison JF. Allometric scaling of metabolism, growth, and activity in whole colonies of the seed-harvester ant Pogonomyrmex californicus. The American Naturalist. 2010;176:501–510. doi: 10.1086/656266. [DOI] [PubMed] [Google Scholar]
  43. Willensdorfer M. On the evolution of differentiated multicellularity. Evolution. 2009;63:306–323. doi: 10.1111/j.1558-5646.2008.00541.x. [DOI] [PubMed] [Google Scholar]
  44. Wynne MJ, Bold HC. Introduction to the Algae: Structure and Reproduction. Prentice-Hall, Incorporated; 1985. [Google Scholar]
  45. Yanni D, Jacobeen S, Márquez-Zacarías P, Weitz JS, Ratcliff WC, Yunker PJ. Topological constraints in early multicellularity favor reproductive division of labor. eLife. 2020;9:e54348. doi: 10.7554/eLife.54348. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Aleksandra M Walczak1
Reviewed by: E Yagmur Erten2, Guy Cooper3, George Constable

Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Acceptance summary:

The paper proposes a model, which studies an often-neglected aspect of cellular differentiation and division of labour. While the model is relatively simple, the premise and the findings are thought-provoking and this study can potentially provide the groundwork for further investigation.

Decision letter after peer review:

Thank you for submitting your article "Evolution of irreversible somatic differentiation" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Aleksandra Walczak as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: E. Yagmur Erten (Reviewer #1); Guy Cooper (Reviewer #2); George Constable (Reviewer #3).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

All reviewers found value in your work, stressed the simplicity and elegance of your model, and appreciated the insights and intuitions it provides. This elegance, however, comes at a cost. Your model relies on a key assumption, namely that cell divisions within an individual are synchronous, and that there does not seem to be within-host fitness differences between the different cell types. Reviewers questioned the biological relevance of the assumption, and we think it would be profitable to investigate further its impact on the model's results.

The discussion among reviewers also highlighted that the presentation of the model lacks details, and would need to be more precise and formalized. Reviewer #3, in particular, provides specific suggestions for the formalization of the model and leads for its analysis. I would like to encourage you to try and analyse the model, and at least to provide in the paper all elements necessary to fully understand it (in the form of equations, but also links to code). Notation should also be clarified, especially for the difference between time and cell generations, and add information about time in the notation (especially in the definition of c). The results are appealing and make intuitive sense, but careful readers need to be able to fully understand the model, and this is not the case with the current presentation of the manuscript. As the model is clarified, new questions may arise, which is why I am not making a recommendation for acceptance at this stage.

I encourage the authors to address all the comments made by the reviewers.

Reviewer #1 (Recommendations for the authors):

1. Lines 64-67: In what way the current study (model, assumptions etc.) differs from Cooper and West (2018), such that irreversible somatic differentiation is observed in this study but not in Cooper and West (2018)?

2. Lines 80-81: It is unclear at this point through what mechanism somatic cells accelerate growth. Do the organisms grow faster because somatic cells themselves divide at a faster rate, so having more of them means shorter development time? Or do the somatic cells contribute to overall resources available to all cells and every cell (including germ-role ones) divides faster? It becomes clearer later on and I think in their particular model it would not make a difference. But it would help to at least indicate that more explanation will come later.

3. Lines 125-128: The authors use a functional form (Equation 2) to determine soma cells' contribution to the growth rate. As their results depend on the shape of this function, I am wondering if there are empirical studies that support one type of form or the other. For instance, under what conditions would soma cells work better alone (Line 128)? In other words, which of these functional forms we are more likely to encounter in nature? This is later discussed to some extent, but references to the relevant literature (e.g. other models) could be useful in the Methods section as well, if a reader wanted to check other related approaches.

4. The authors refer to Appendix 3 for the first time at line 177, whereas while reading the results up to this point, I kept wondering what the fractions of the other strategies (RSD and NSD) were. In case adding the figures for RSD and NSD to the main text distracts from the main message, I think at least mentioning that they are at Appendix 3 much earlier in the Results section would help the readers.

5. Line 565: Here the authors say that large b favours ISD and a very large one promotes RSD, whereas in the main text they say "neither extremely large, nor extremely small" b favours ISD (Lines 208-209), which I found somewhat inconsistent.

6. It is not clear to me why the evolution of irreversible somatic differentiation requires a large enough organismal size. Also, in the main text, the authors do not mention what instead evolves in smaller organisms (RSD or NSD? This is later found in Appendix 3, but is not referred to or discussed in the main text). The authors later link their results about body size to some empirical examples in the Discussion section, but again, they do not discuss what might underlie these empirical observations or their findings about body size.

7. The second paragraph of the Discussion seems out-of-place as it is. I also cannot follow the logic; why do these cell numbers indicate organismal synchronicity? And what about cell death?

Reviewer #2 (Recommendations for the authors):

I like the model, it is simple and easy to interpret, providing predictions that make sense. However, it is not as general a model as the discussion implies in some cases. The predictions of the model are likely to depend on modelling assumptions that may be unrealistic in different systems, including the examples often cited in the paper.

My biggest request is that I would like more of a discussion of the limits that arise due to the these assumptions. In particular, to what extent are the predictions contingent on the fact that soma provide benefits continuously as the group grows? This is not the case for many of the systems cited in the work, such as in the Volvocine algae and in fruiting body formations such as in Dictyostelium. Furthermore, one could also imagine that differentiation probabilities are density dependent, or that germ cell fecundity depends on the number of soma cells in the last generation. I suspect that predictions 2 and 3 would not necessarily hold in these scenarios, which could explain for instance why many Volvocine species have a very large number of somatic cells. Acknowledging and discussing exactly how the predictions hinge on these assumptions would make the analysis much stronger.

Secondly, I think some definitions could be clearer in the introduction. For instance, if soma do not replicate at all, does it even make sense to speak of irreversible soma vs reversible soma? Many of the models cited have sterile soma that do not replicate (most Michod models, and Cooper and West model at least). Similarly, what if separation between germ and soma only occurs in one-generation of the group life cycle? What does the distinction between irreversible vs reversible soma mean in this case? Is irreversible soma just the same as soma sterility? How does all of this compare to the germline sequestration question, which readers may be more familiar with? These distinctions could be much clearer, which would help to set up the key question of the paper and make its scope more obvious.

Finally, I think some aspects of the presentation of the results could be improved. I found Figure 2A in isolation difficult to fully interpret. There are three outcomes in this model, ISD, RSD, and NSD, and the frequency of each outcome is only shown in Appendix 3. I would suggest including the frequency of the two other strategies in the main text. The same applies to Figure 4. You can't infer from just looking at the frequency of ISD alone to what extent the patterns are driven by irreversible soma being favoured over reversible soma vs no soma being favoured at all.

Reviewer #3 (Recommendations for the authors):

I very much enjoyed this paper, and only have a few suggestions with respect to the model.

I think potential conceptual limitations of the model lie in the assumptions of synchronous cell division and constant development strategy.

It may be possible to address the first of these issues (and thus the initial concern of Dr. Walczak) with some illustrative supplementary simulations (e.g. preliminary results to demonstrate the extent to which maturation time is affected by such asynchronicity). These might even take the form of some simple continuous time ODE models.

However the second of these issues would be a highly difficult task, and lies well outside the scope of the current paper. While exploring this question might certainly serve as a nice extension to the current work, I would not expect the authors to tackle this in the current context, where it would merely muddle the story presented.

Finally, while I like the model in general, there are some points of clarification I think could be made. Although I feel I have understood the core elements, there are some points of ambiguity where it is possible that I may be mistaken, and ironing out these potential misconceptions in the appendices would be beneficial for readers.

As I understand it:

The fraction of soma and germ cells in an organism are given by g(t) and s(t) in Equation 7, with s(t)=x in the main text (see Equation 2 – this should be made consistent?).

Note that 't' here refers to the generation t=1,2,…,n

These dynamics are independent of the costs of differentiation, cg and cs.

However, the division time for cells in the organism during growth is dependent on these costs (see 't' in Equation 1 – note that 't' here is the continuous doubling time, which has an inconsistent notation with Equations 4-8).

Writing Tgen(t) for the doubling time at generation t, we have

Tgen(t)=Fdiff * Fcomp

= (1+<c> )*( 1 – b + b ( ( x1 – s(t) )/( x1 – x0 ) ))

(when x0<s(t)<x1 – for simplicity I won't write out the other conditions)

At this point I'm not 100% sure how <c> is defined. I'd assume the following:

<c> = 1+s(t)*cs*(sgs + 2 sgg)+g(t)*cg*(ggs + 2 gss)

(Is this correct?)

At this point I have Tgen(t) as a function of t (having substituted for g(t) and s(t) from Equation 7). This allows me to write the maturation time (time to reach a size 2n) as

Tmat = \sumt=1n Tgen(t)

Finally, the fitness of an organism with a particular developmental strategy is given by the rate of gamete production (i.e. the number of gametes at maturity divided by the time taken to reach maturity)

W = g(n) / Tmat

Working out the evolutionary optimal strategy is then a matter of maximising W with respect to sgs, sss, ggs and gss.

Is this all correct?

If so, it may be possible to make analytical progress on this problem by replacing the discontinuous function in Equation 1 with a continuous approximation, e.g.

1 – (1 – b) x\β / ( x\β + (1 – x)\β )

I mainly mention this latter point as a potential area of future investigation. However with respect to the model details, I would recommend the authors clarify the points above in one of the appendices.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

Thank you for submitting your article "Evolution of irreversible somatic differentiation" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by Aleksandra Walczak as the Senior and Reviewing Editor. The following individuals involved in review of your submission have agreed to reveal their identity: E. Yagmur Erten (Reviewer #1); Guy Cooper (Reviewer #2); George Constable (Reviewer #3).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential Revisions:

All reviewers note that the paper has greatly improved. However they still would like to see some small changes made to improve clarity (no new research is required). Here is a summary but please go through the reviews attached below for concrete points:

– Clarify the details of the new models, as suggested by reviewer 1;

– Define soma and germ cells clearly, as noted by reviewer 2;

– Provide more discussion on the assumption of fixed strategies, as noted by reviewer 2;

– Add clarifications as suggested by reviewer 3.

Reviewer #1 (Recommendations for the authors):

I thank the authors for their responses and clarifications, as well as for extending their model to include risky differentiation and asynchronous cell division. I very much enjoyed reading the revised version of their manuscript, but I have the following questions about the new models they included.

Risky cell differentiation (Appendix 6): I don't exactly follow why the authors multiply the frequency and not the number of cell differentiations with per differentiation death risk. Maybe I am misunderstanding something, but isn't this implicitly assuming that deadly differentiation errors, if they happen in larger organisms, will have less probability to have an impact than in smaller organisms? Or in other words, the effect of deadly differentiation will be larger in smaller organisms? If true, this assumption might still be realistic, e.g. one deadly cell within 32 cells compared to 1024 can plausibly have a larger organismal level effect. But one can also argue that if one cell becomes a mutant and acquires a growth advantage, the overall size of the organism might not matter, especially e.g. if the mutant occurs early in the development or has a very high cell proliferation/resource uptake rate. Although this might not change the results in the main text qualitatively, as there the authors use one maturity size (210) in their calculations.

Cell division asynchrony (Appendix 7): seeing the results of the cell division asynchrony, it seems like synchrony is almost a necessary condition for the evolution of irreversible differentiation in their model, just like those conditions summarized in Lines 335-339 albeit perhaps not as strict as them (since, although rarely, irreversible strategies still evolve). Perhaps it should be acknowledged as such earlier and more explicitly, rather than at the end of Discussion? Particularly given the fact that the authors looked at one of the most favourable conditions for irreversible strategies (c (s-> g) >> c(g->s)) and found that the evolution of irreversible differentiation is very rare.

Reviewer #2 (Recommendations for the authors):

I think this model and analyses are very good and so I won't comment too much on these (with the exception of a thought on the asynchrony model). I think a few things need to be clarified but I am otherwise happy to endorse the paper for publication.

My main comments will concern the introduction and framing of this study as I think that there are still things that need to be made clear, which will really help the reader right from the start.

I think some very clear definitions of different terms are needed and as early as possible in the introduction. From what I can gather across different sections of the introduction: the soma cells are those that contribute to vegetative functions (sustaining the overall organism) but cannot act as a seed/propagule/spore for founding of a new organism. In contrast, germ cells are those that do not contribute to vegetative functions but can act as such a spore. The authors distinguish between terminally differentiated soma that do not divide such as in cyanobacteria and non-terminally differentiated soma that can divide such as in the Volvocales. They then ask the question in what conditions do the latter kind of soma (non-terminally differentiated) become irreversibly differentiated (that is when they can only divide and produce more soma). If the above is correct, I would suggest making these definitions and distinctions clearer and more localised rather than having bits of each definition spread across the introduction in a way that needs piecing together (perhaps a glossary type table could also help?)

If the above is correct, then the definitions need to be applied more consistently throughout the manuscript. For instance, the "somatic" cells that exist during the growth of the "higher" Volvocales do not qualify as somatic cells per the author's definition as they do not contribute to vegetative functions. In this case only the last generation of flagella beaters are "soma", none of which divide and so the distinction between reversibly and irreversibly differentiated does not apply here. The authors have added a paragraph about this in the discussion but they lean on the Volvocales so much in the introduction and discussion of their work that this mismatch needs to be flagged much sooner in the paper.

A similar issue applies to the discussion of Cooper and West 2018. Much as I would like to pretend that this paper could potentially cover all possibilities, group growth is not explicitly modelled here and so the distinction between reversible and irreversibly differentiated soma does not apply here (one can imagine that in this model there are no cooperative interactions as the group grows and that division of labour then may occur but only in the last generation of the group life cycle before spore dispersal much like for the Volvocales). If non-sterile helpers count as soma then this might be a different issue as sterile cells may be considered irreversible soma and non-sterile helpers as reversible soma, but then these non-sterile helpers can "seed" the next generation so I don't think they really qualify as soma per the author's definition? Having clearer definitions will help resolve these confusions.

Otherwise, I feel that the authors have sidestepped the potential impact of non-static traits in their model by saying in their response to reviewers that they have plans for a future paper on this. That is great and I am very much looking forward to what they find but this issue still needs to be discussed in this paper as many of their results here could be explained as arising directly from the assumption of static strategies as the group grows. I would suggest mentioning this at least once or twice as they go through the results (around lines 238-242 would be good) and then a whole paragraph on this in the discussion is warranted (can also mention plans for future work on this here). For instance, the need for just a few somatic cells that provide large benefits seems to arise directly from the fact that germ cells can't modulate the number of soma cells they spawn once these become too numerous, or that they can't have a time based strategy that produces many soma earlier but fewer later as the group grows.

I think the results they have found in the asynchronous model is really good but needs more explanation/discussion of why ISD can't seem to work here. They have modelled a time to replication cost as arising from the different differentiation costs. I find it strange in that case that RSD is not the worst affected strategy as the authors have established that this is the strategy with the most differentiation. I otherwise would have thought that having soma cells that divert their energy to vegetative functions as the slower replicator might have been a natural way to introduce asynchrony.

Finally, I think a word of caution on the discussion of "convex" shapes and how this favours division of labour/terminal differentiation/irreversible differentiation (lines 323-332). In several of the models cited (if not all), the convexity at issue is the relationship between an individual's investment in a public good/vegetative function and the fitness return to the group. In the authors' paper, the convexity is between the number/proportion of "helpers/soma" in the group and the fitness return to the group. These are very different things (one has to do with synergy from internal efficiencies whereas as the other comes from synergies from between individual interactions) and so should not be treated as the same prediction.

Reviewer #3 (Recommendations for the authors):

The authors have made substantial changes to the manuscript that appear to have addressed many of the concerns of the reviewers.

In their response, the reviewers clarified some details of the model, and I now feel I have a better understanding as to how it works. However that has led to another couple of small suggestions on my part that I believe would help readers.

In my original review I stated:

"Note that 't' here refers to the generation t=1,2,…,n

However, the division time for cells in the organism during growth is dependent on these costs (see 't' in Equation 1 – note that 't' here is the continuous doubling time, which has an inconsistent notation with Equations 4-8)"

I now see that under the costless differentiation assumed in Appendix 2 , t becomes an integer which helps simplify the subsequent analysis. It's worthwhile to make a note of this fact (before the sentence "Then, the expected fractions.…" would be an obvious potential place to mention this).

The authors response also makes clear at multiple points that cell divisions are stochastic:

"Since the differentiation program is stochastic, the costs of differentiation depend on the actual number of differentiation events happened in the course of growth, rather than probabilities like ggs.",

"Since the differentiation strategy is stochastic, the time to reach maturity (Tmat) and the number of offspring at the last stage (g(n)) are random values, which we sample by repeatedly simulating the process of growth."

"However, since the outcomes of cell divisions are stochastic, the sampling of developmental trajectories has to reflect that and in our case it is done numerically."

I understand this. My comments, which I may not have articulated clearly in my initial review, were more aimed at asking how much understanding could be gained from alternatively taking a mean field approach. Indeed, this is precisely the approach the authors themselves take in Appendix 2, where they "consider the mathematical expectation of the composition". This leads to the obvious question – why can't a similar approach be used when differentiation is not costless?

Of course, I completely understand that stochasticity could be very important in a model such as this (where initial cell numbers are low), and it may be that such a mean-field approach leads to misleading results with respect to the prediction of the mean population growth rate. If this is the case, I think the authors should make a statement of this fact somewhere, perhaps with a reference to results in Gao et al., 2019 with respect to the differences between mean field and stochastic predictions.

Otherwise the authors have done a good job of clarifying my questions and addressing my concerns.

eLife. 2021 Oct 13;10:e66711. doi: 10.7554/eLife.66711.sa2

Author response


Essential revisions:

All reviewers found value in your work, stressed the simplicity and elegance of your model, and appreciated the insights and intuitions it provides. This elegance, however, comes at a cost. Your model relies on a key assumption, namely that cell divisions within an individual are synchronous, and that there does not seem to be within-host fitness differences between the different cell types. Reviewers questioned the biological relevance of the assumption, and we think it would be profitable to investigate further its impact on the model's results.

The discussion among reviewers also highlighted that the presentation of the model lacks details, and would need to be more precise and formalized. Reviewer #3, in particular, provides specific suggestions for the formalization of the model and leads for its analysis. I would like to encourage you to try and analyse the model, and at least to provide in the paper all elements necessary to fully understand it (in the form of equations, but also links to code). Notation should also be clarified, especially for the difference between time and cell generations, and add information about time in the notation (especially in the definition of c). The results are appealing and make intuitive sense, but careful readers need to be able to fully understand the model, and this is not the case with the current presentation of the manuscript. As the model is clarified, new questions may arise, which is why I am not making a recommendation for acceptance at this stage.

I encourage the authors to address all the comments made by the reviewers.

Thank you and the reviewers for so well thought and constructive reviews. In the revised version of the manuscript, we additionally test our findings with a model featuring the risky differentiation (inspired by cancer) and an asynchronous cell division. We also clarified the model presentation. Later, next to the reviewers’ comments, we describe the specific changes in details.

Reviewer #1 (Recommendations for the authors):

1. Lines 64-67: In what way the current study (model, assumptions etc.) differs from Cooper and West (2018), such that irreversible somatic differentiation is observed in this study but not in Cooper and West (2018)?

We discuss this in the updated manuscript. In principle, the ingredients we consider to be necessary for irreversible somatic differentiation are also included in that study, but the model setup is very different. For example, the sterile cells in Cooper and West do not divide further.

2. Lines 80-81: It is unclear at this point through what mechanism somatic cells accelerate growth. Do the organisms grow faster because somatic cells themselves divide at a faster rate, so having more of them means shorter development time? Or do the somatic cells contribute to overall resources available to all cells and every cell (including germ-role ones) divides faster? It becomes clearer later on and I think in their particular model it would not make a difference. But it would help to at least indicate that more explanation will come later.

Thank you for highlighting this issue and others below. We reworked the presentation of the model to make it clear. Regarding the specific question here, having soma-role cells allows for higher resource uptake by the organism, so having more soma-role cells generally results in faster developmental time. Since our main model is based on synchronous division, we do not allow that only somatic cells would divide faster.

3. Lines 125-128: The authors use a functional form (Equation 2) to determine soma cells' contribution to the growth rate. As their results depend on the shape of this function, I am wondering if there are empirical studies that support one type of form or the other. For instance, under what conditions would soma cells work better alone (Line 128)? In other words, which of these functional forms we are more likely to encounter in nature? This is later discussed to some extent, but references to the relevant literature (e.g. other models) could be useful in the Methods section as well, if a reader wanted to check other related approaches.

The choice of the functional form is motivated by its flexibility – with a right combination of parameters, a diverse scenarios can be covered: linear, concave, convex, a step threshold and so on. For instance, the prominent theoretical finding is that germ-soma differentiation arises when the tradeoff between viability and fertility is convex. Our functional form covers this set of models as a special case with x0 = 0 and x1=1. In the revised manuscript, we also discuss equivalent expressions used by other models.

Our statement of “soma cells working better alone” implied that there is no synergy between soma-role cells and increasing in the number of soma-role cells brings diminishing benefits. We now see that this wording causes confusion and rewrote that sentence.

4. The authors refer to Appendix 3 for the first time at line 177, whereas while reading the results up to this point, I kept wondering what the fractions of the other strategies (RSD and NSD) were. In case adding the figures for RSD and NSD to the main text distracts from the main message, I think at least mentioning that they are at Appendix 3 much earlier in the Results section would help the readers.

We also find figures for RSD and NSD very insightful but decided to put them into appendix to keep the focus of the paper on ISD. With great pleasure, we bring Figure 1 from appendix 3 to the main text.

5. Line 565: Here the authors say that large b favours ISD and a very large one promotes RSD, whereas in the main text they say "neither extremely large, nor extremely small" b favours ISD (Lines 208-209), which I found somewhat inconsistent.

Thank you for highlighting this. All fixed.

6. It is not clear to me why the evolution of irreversible somatic differentiation requires a large enough organismal size. Also, in the main text, the authors do not mention what instead evolves in smaller organisms (RSD or NSD? This is later found in Appendix 3, but is not referred to or discussed in the main text). The authors later link their results about body size to some empirical examples in the Discussion section, but again, they do not discuss what might underlie these empirical observations or their findings about body size.

Thank you for pointing on the lack of the discussion on this topic. In the revised manuscript, we discuss what causes the limitation of the organism size. In a nutshell, it is related to the monotonic increase in the fraction of soma cells in irreversible differentiation strategies and the decreased differentiation costs as the organism size increases.

7. The second paragraph of the Discussion seems out-of-place as it is. I also cannot follow the logic; why do these cell numbers indicate organismal synchronicity? And what about cell death?

There, we list two organisms, one with 32 and one with 128 cells. Both numbers are powers of two. If the cell divisions are synchronized at the level of organism, then at any moment, the number of cells should be power of two (cell death is not reported for these species). By contrast, in a bacterial colony founded by a single cell, cell divisions eventually desynchronize – at 32 cells this effect is already notable and by 128 cells it is apparent. Thus, we may suggest that some organism-level events synchronize cell division in the listed Volvocales. We polished this paragraph to be more clear and better integrated into the discussion.

Reviewer #2 (Recommendations for the authors):

I like the model, it is simple and easy to interpret, providing predictions that make sense. However, it is not as general a model as the discussion implies in some cases. The predictions of the model are likely to depend on modelling assumptions that may be unrealistic in different systems, including the examples often cited in the paper.

My biggest request is that I would like more of a discussion of the limits that arise due to the these assumptions. In particular, to what extent are the predictions contingent on the fact that soma provide benefits continuously as the group grows? This is not the case for many of the systems cited in the work, such as in the Volvocine algae and in fruiting body formations such as in Dictyostelium. Furthermore, one could also imagine that differentiation probabilities are density dependent, or that germ cell fecundity depends on the number of soma cells in the last generation. I suspect that predictions 2 and 3 would not necessarily hold in these scenarios, which could explain for instance why many Volvocine species have a very large number of somatic cells. Acknowledging and discussing exactly how the predictions hinge on these assumptions would make the analysis much stronger.

Thank you for raising this important topic. In the revised manuscript, we extend the discussion of our assumptions. We agree that our model is simplified comparing with the processes occurring in nature.

We assume that the soma-role cells provide benefits to the group growth during the whole ontogenesis. This may not accurately represent more complex species like Volvox, where daughter colonies grow within the maternal organism. Taking this into account can relax our last finding: “soma-role cells should bring benefits even in small numbers”, as only the final composition of the organism will matter in this case. However, a life cycle, where juvenile organisms are protected and nurtured by maternal body, represents quite an advanced degree of evolved complexity, which may not be directly relevant to the earliest stage of the evolution of multicellularity.

Secondly, I think some definitions could be clearer in the introduction. For instance, if soma do not replicate at all, does it even make sense to speak of irreversible soma vs reversible soma? Many of the models cited have sterile soma that do not replicate (most Michod models, and Cooper and West model at least). Similarly, what if separation between germ and soma only occurs in one-generation of the group life cycle? What does the distinction between irreversible vs reversible soma mean in this case? Is irreversible soma just the same as soma sterility? How does all of this compare to the germline sequestration question, which readers may be more familiar with? These distinctions could be much clearer, which would help to set up the key question of the paper and make its scope more obvious.

Thank you for highlighting this overlook. Indeed, the range of cell specializations that are called “soma cells” is very wide and we are looking in a particular type of these. By soma-role cells, we consider cells, which divide in the course of organism growth (unlike sterile/terminally differentiated cells), but do not contribute to the organism reproduction (unlike temporal division of labor).

We also do not consider the sequestration of the germ line and one-off separation between germ and soma, as these require considering a dynamical differentiation program, in which probabilities of differentiation change with time. We even considered such a model at the conceptualization stage but found an efficient but degenerate differentiation program – turn all cells into soma-role at the beginning of the life cycle and keep them until the end, where all cells turn to germ-role. This way, the developmental speed is extremely fast and the number of produced offspring is high as well. Obviously, this means that the space of dynamic differentiation programs is constrained but investigation of this space and its constraints is too far from the scope of the current work. We are planning to investigate this topic in later work.

Finally, I think some aspects of the presentation of the results could be improved. I found Figure 2A in isolation difficult to fully interpret. There are three outcomes in this model, ISD, RSD, and NSD, and the frequency of each outcome is only shown in Appendix 3. I would suggest including the frequency of the two other strategies in the main text. The same applies to Figure 4. You can't infer from just looking at the frequency of ISD alone to what extent the patterns are driven by irreversible soma being favoured over reversible soma vs no soma being favoured at all.

Thank you for the suggestion. Figure 1 from appendix 3 is elevated to the main text.

Reviewer #3 (Recommendations for the authors):

I very much enjoyed this paper, and only have a few suggestions with respect to the model.

I think potential conceptual limitations of the model lie in the assumptions of synchronous cell division and constant development strategy.

It may be possible to address the first of these issues (and thus the initial concern of Dr. Walczak) with some illustrative supplementary simulations (e.g. preliminary results to demonstrate the extent to which maturation time is affected by such asynchronicity). These might even take the form of some simple continuous time ODE models.

However the second of these issues would be a highly difficult task, and lies well outside the scope of the current paper. While exploring this question might certainly serve as a nice extension to the current work, I would not expect the authors to tackle this in the current context, where it would merely muddle the story presented.

Finally, while I like the model in general, there are some points of clarification I think could be made. Although I feel I have understood the core elements, there are some points of ambiguity where it is possible that I may be mistaken, and ironing out these potential misconceptions in the appendices would be beneficial for readers.

Thank you for this feedback. With the help from your presentation, we rewrote the model section in the updated version of the manuscript. Below, there is our clarification of the model steps.

As I understand it:

The fraction of soma and germ cells in an organism are given by g(t) and s(t) in Equation 7, with s(t)=x in the main text (see Equation 2 – this should be made consistent?).

Note that 't' here refers to the generation t=1,2,…,n

These dynamics are independent of the costs of differentiation, cg and cs.

However, the division time for cells in the organism during growth is dependent on these costs (see 't' in Equation 1 – note that 't' here is the continuous doubling time, which has an inconsistent notation with Equation 4-8).

Writing Tgen(t) for the doubling time at generation t, we have

Tgen(t)=Fdiff * Fcomp

= (1+<c> )*( 1 – b + b ( ( x1 – s(t) )/( x1 – x0 ) ))

(when x0<s(t)<x1 – for simplicity I won't write out the other conditions)

Up to this point, all is correct.

At this point I'm not 100% sure how <c> is defined. I'd assume the following: <c> = 1+s(t)*cs*(sgs + 2 sgg)+g(t)*cg*(ggs + 2 gss)

(Is this correct?)

Not exactly. Since the differentiation program is stochastic, the costs of differentiation depend on the actual number of differentiation events happened in the course of growth, rather than probabilities like ggs. In each simulation, at each cell-generation step, we sample how many differentiation events (i.e. offspring is of a different type than the parent) will occur from a multinomial distribution. Each event brings cs and cg costs. The factor <c> used to compute the generation time is the average of these costs: the ratio between cumulative cost and the total number of cells at the beginning of the simulation step.

This way, the costs and the organism composition will differ among different runs of the same simulation. Hence, for each combination of control parameters, we run 300 independent realizations of organism growth and compute the expected population growth from this whole dataset (see below).

At this point I have Tgen(t) as a function of t (having substituted for g(t) and s(t) from Equation 7). This allows me to write the maturation time (time to reach a size 2n) as

Tmat = \sumt=1n Tgen(t)

Tmat is a sum of individual cell doubling times Tgen (t) but it is a random value from rather complex distribution and not a function.

Finally, the fitness of an organism with a particular developmental strategy is given by the rate of gamete production (i.e. the number of gametes at maturity divided by the time taken to reach maturity)

W = g(n) / Tmat

Since the differentiation strategy is stochastic, the time to reach maturity (Tmat) and the number of offspring at the last stage (g(n)) are random values, which we sample by repeatedly simulating the process of growth. In our previous work (Interacting cells driving the evolution of multicellular life cycles, PloS CB, 2019), we have shown that a population with stochastic development will grow exponentially with the rate (W) given by the solution of equation

\sumi Pi Gi e-W Ti = 1

where the sum is over possible trajectories i, Pi is the probability of each trajectory, Gi is the number of offspring produced at the maturity, Ti is the time to maturity. In our work, we numerically solve this equation using a sampled distribution of trajectories. In deterministic scenarios, like the absence of differentiation, the solution of this equation yields

W = log(g(n)) / Tmat,

which has a similar form as your expression.

Working out the evolutionary optimal strategy is then a matter of maximising W with respect to sgs, sss, ggs and gss.

Is this all correct?

This is correct.

If so, it may be possible to make analytical progress on this problem by replacing the discontinuous function in Equation 1 with a continuous approximation, e.g.

1 – (1 – b) x\β / ( x\β + (1 – x)\β )

Thank you for a suggestion. However, since the outcomes of cell divisions are stochastic, the sampling of developmental trajectories has to reflect that and in our case it is done numerically. Unfortunately, a replacement of Fcomp function with a continuous approximation does not lead to deeper analytical insights.

I mainly mention this latter point as a potential area of future investigation. However with respect to the model details, I would recommend the authors clarify the points above in one of the appendices.

Thank you again for careful consideration of our model. Now, we see that our initial presentation was confusing. In the updated text, we extended the presentation of the model.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

Reviewer #1 (Recommendations for the authors):

I thank the authors for their responses and clarifications, as well as for extending their model to include risky differentiation and asynchronous cell division. I very much enjoyed reading the revised version of their manuscript, but I have the following questions about the new models they included.

Risky cell differentiation (Appendix 6): I don't exactly follow why the authors multiply the frequency and not the number of cell differentiations with per differentiation death risk. Maybe I am misunderstanding something, but isn't this implicitly assuming that deadly differentiation errors, if they happen in larger organisms, will have less probability to have an impact than in smaller organisms? Or in other words, the effect of deadly differentiation will be larger in smaller organisms? If true, this assumption might still be realistic, e.g. one deadly cell within 32 cells compared to 1024 can plausibly have a larger organismal level effect. But one can also argue that if one cell becomes a mutant and acquires a growth advantage, the overall size of the organism might not matter, especially e.g. if the mutant occurs early in the development or has a very high cell proliferation/resource uptake rate. Although this might not change the results in the main text qualitatively, as there the authors use one maturity size (210) in their calculations.

We appreciate your attention to details of the model design. During the manuscript revision, we discussed a lot how to implement the cell differentiation risk. The assumption, which you suggested – each individual defective cell brings the same risk to the organism, has also been closely considered. However, taking into account the context of our study, we settled on a different design. The key factor in our decision is that the organisms in our model constantly grow in size. As a result, the moment of the emergence of a defective cell plays a large role in the defect’s impact. If a defective cell emerges at the last round of cell divisions, it cannot harm the organism much, because the life cycle is about to end. However, a defective cell emerged in the very first cell division will have a large impact on the organism performance during the life cycle. To take this effect into account and make later defects less dangerous, we scaled the number of differentiation events by the organism size, thus attributing the risk to the frequency of differentiated cells.

We agree that a growth advantage of defective (defecting) cells may also play a role here. However, such an advantage should have much smaller impact on ever-growing simple organisms than on large animals. For animals, the division rate of the typical cell is close to zero, hence even a humble growth advantage of a defective cell brings a lot of difference to the cell dynamics over the span of long life cycle. In our case, every cell in the organism actively proliferates, while the life cycle is very short. Thus, the growth advantage must be large in order to have an impact. Hence, emergence of defects leading to a change in relative cell growth must be less likely in our model than in complex animals. On top of that, the rapid propagation of malignant tumors in human organism is often the end-result of a silent process of accumulation of multiple mutations, which can take years to complete. For the organisms we study, the defective cells just don’t have enough time to turn into proper cancer cells before the life cycle ends. Hence, we are convinced that the growth advantage of defective cells can be ignored in the context of our study.

We acknowledge that this design choice is tailored to simple organisms and may not be optimal for complex animals. In the revised manuscript, we added an exposition of our reasoning to the risky model section in Appendix 6.

Cell division asynchrony (Appendix 7): seeing the results of the cell division asynchrony, it seems like synchrony is almost a necessary condition for the evolution of irreversible differentiation in their model, just like those conditions summarized in Lines 335-339 albeit perhaps not as strict as them (since, although rarely, irreversible strategies still evolve). Perhaps it should be acknowledged as such earlier and more explicitly, rather than at the end of Discussion? Particularly given the fact that the authors looked at one of the most favourable conditions for irreversible strategies (c (s-> g) >> c(g->s)) and found that the evolution of irreversible differentiation is very rare.

We agree that there is a need to mention the assumption of synchrony as early as possible. We now explicitly state that the model and the results are based on synchronous cell division in the Model section. And the asynchronous cell division is explored additionally in the Appendix 7.

Reviewer #2 (Recommendations for the authors):

I think this model and analyses are very good and so I won't comment too much on these (with the exception of a thought on the asynchrony model). I think a few things need to be clarified but I am otherwise happy to endorse the paper for publication.

My main comments will concern the introduction and framing of this study as I think that there are still things that need to be made clear, which will really help the reader right from the start.

I think some very clear definitions of different terms are needed and as early as possible in the introduction. From what I can gather across different sections of the introduction: the soma cells are those that contribute to vegetative functions (sustaining the overall organism) but cannot act as a seed/propagule/spore for founding of a new organism. In contrast, germ cells are those that do not contribute to vegetative functions but can act as such a spore. The authors distinguish between terminally differentiated soma that do not divide such as in cyanobacteria and non-terminally differentiated soma that can divide such as in the Volvocales. They then ask the question in what conditions do the latter kind of soma (non-terminally differentiated) become irreversibly differentiated (that is when they can only divide and produce more soma). If the above is correct, I would suggest making these definitions and distinctions clearer and more localised rather than having bits of each definition spread across the introduction in a way that needs piecing together (perhaps a glossary type table could also help?)

We adopted the suggestion to clarify the differences between terminal differentiation and irreversible differentiation. We also explained the irreversible differentiation that is of most interest for us further in the Model section.

If the above is correct, then the definitions need to be applied more consistently throughout the manuscript. For instance, the "somatic" cells that exist during the growth of the "higher" Volvocales do not qualify as somatic cells per the author's definition as they do not contribute to vegetative functions. In this case only the last generation of flagella beaters are "soma", none of which divide and so the distinction between reversibly and irreversibly differentiated does not apply here. The authors have added a paragraph about this in the discussion but they lean on the Volvocales so much in the introduction and discussion of their work that this mismatch needs to be flagged much sooner in the paper.

Thank you for your thorough treatment of definitions! We define the germ-role as a state of a cell in which it continues to live after the organism fragments at the end of life cycle, and the soma-role as a state of a cell in which it dies upon fragmentation. We additionally assume that soma-role cells are capable to provide a benefit to the organism. However, this benefit is separate from the definition of soma-role itself. For example, according to our model, if the fraction of soma-role cells is below the contribution threshold x0, there is no benefit to the organism. Still, these cells are considered as soma-role. Hence, in your example, we see no conceptual problem with progenitors of flagella beaters not contributing to the vegetative functions, as contribution of soma-role cells is still possible under the right conditions (after the last round of cell divisions here) but not mandatory. In the revised manuscript we make our definitions more clear.

Our attention to Volvocales in the introduction and discussion reflects the overwhelming bias in the empirical literature about evolution of germ/soma differentiation towards this group (and also cellular slime molds but their multicellularity is aggregative). Without this group, simply, there is a little to discuss. Nevertheless, our model is not about Volvocales, we aim to answer the question about emergence of irreversible somatic differentiation in a broad context without tailoring it to the features of a single group. As a result, the design of our model does not fully reflect the features of Volvocales life cycle. We overlooked that our introduction made an impression that the manuscript is about a single prominent group. In the updated version, we modified the exposition of our work. We also explicitly acknowledge the bias of empirical examples in our discussion.

A similar issue applies to the discussion of Cooper and West 2018. Much as I would like to pretend that this paper could potentially cover all possibilities, group growth is not explicitly modelled here and so the distinction between reversible and irreversibly differentiated soma does not apply here (one can imagine that in this model there are no cooperative interactions as the group grows and that division of labour then may occur but only in the last generation of the group life cycle before spore dispersal much like for the Volvocales). If non-sterile helpers count as soma then this might be a different issue as sterile cells may be considered irreversible soma and non-sterile helpers as reversible soma, but then these non-sterile helpers can "seed" the next generation so I don't think they really qualify as soma per the author's definition? Having clearer definitions will help resolve these confusions.

We appreciate your suggestions. We have modified the comparison to this paper in the Introduction section. Meanwhile, as you mentioned in the above comment, we have elaborated the definition of germ-role and soma-role in the model section and stressed the differences between your and our models in discussion.

Otherwise, I feel that the authors have sidestepped the potential impact of non-static traits in their model by saying in their response to reviewers that they have plans for a future paper on this. That is great and I am very much looking forward to what they find but this issue still needs to be discussed in this paper as many of their results here could be explained as arising directly from the assumption of static strategies as the group grows. I would suggest mentioning this at least once or twice as they go through the results (around lines 238-242 would be good) and then a whole paragraph on this in the discussion is warranted (can also mention plans for future work on this here). For instance, the need for just a few somatic cells that provide large benefits seems to arise directly from the fact that germ cells can't modulate the number of soma cells they spawn once these become too numerous, or that they can't have a time based strategy that produces many soma earlier but fewer later as the group grows.

As you suggested, in the updated Results section, we acknowledge that our findings are obtained under assumption of static strategies. We also added a new paragraph to discussion about non-static differentiation strategies: their existence in nature and a possible impact on the model results.

I think the results they have found in the asynchronous model is really good but needs more explanation/discussion of why ISD can't seem to work here. They have modelled a time to replication cost as arising from the different differentiation costs. I find it strange in that case that RSD is not the worst affected strategy as the authors have established that this is the strategy with the most differentiation. I otherwise would have thought that having soma cells that divert their energy to vegetative functions as the slower replicator might have been a natural way to introduce asynchrony.

Irreversible differentiation is suppressed in the asynchronous model because there, the reversible strategies are capable to develop large number of soma-role cells early on and keep large fraction of germ-role cells throughout the life cycle even at the high soma differentiation costs (cs->g=10). Moreover, we would like to highlight that unlike in the synchronous model, here, high costs of soma differentiation bring an advantage to reversible differentiation strategies. The key effect here is that if costs of soma differentiation is large enough, the expected period of cell division for a differentiating soma-role cell is longer than the length of life cycle. As a result, instead of re-differentiation, soma-role cells become effectively terminally differentiated. In such a situation, the growth of the organism is determined by propagation of germ-role cells and is not affected by soma differentiation costs. Hence, the development of organisms using reversible strategies remains fast and irreversible strategies are outcompeted. In the updated manuscript, we discuss this effect in the Appendix 7.

We appreciate the idea of soma-role cells being slower replicators. We believe it may lead to a more elegant model. However, in this appendix, we test the impact of the assumption of the synchronous cell division used in the baseline model. Hence, it is more appropriate to present the asynchronous implementation of the base model keeping the remaining design intact. This immediately dictates that the costs must be associated with differentiation events and not with the cell state itself. Otherwise, it would be difficult to compare results from two models. We emphasize this motivation in the updated Appendix 7.

Finally, I think a word of caution on the discussion of "convex" shapes and how this favours division of labour/terminal differentiation/irreversible differentiation (lines 323-332). In several of the models cited (if not all), the convexity at issue is the relationship between an individual's investment in a public good/vegetative function and the fitness return to the group. In the authors' paper, the convexity is between the number/proportion of "helpers/soma" in the group and the fitness return to the group. These are very different things (one has to do with synergy from internal efficiencies whereas as the other comes from synergies from between individual interactions) and so should not be treated as the same prediction.

We agree with you on the analyses of the model differences between ours and the previous works. In the revised manuscript, we have stressed the model differences in the curvature of the shapes.

Reviewer #3 (Recommendations for the authors):

The authors have made substantial changes to the manuscript that appear to have addressed many of the concerns of the reviewers.

In their response, the reviewers clarified some details of the model, and I now feel I have a better understanding as to how it works. However that has led to another couple of small suggestions on my part that I believe would help readers.

In my original review I stated:

"Note that 't' here refers to the generation t=1,2,…,n

However, the division time for cells in the organism during growth is dependent on these costs (see 't' in Equation 1 – note that 't' here is the continuous doubling time, which has an inconsistent notation with Equations 4-8)"

I now see that under the costless differentiation assumed in Appendix 2 , t becomes an integer which helps simplify the subsequent analysis. It's worthwhile to make a note of this fact (before the sentence "Then, the expected fractions.…" would be an obvious potential place to mention this).

Thanks – this was obviously a flaw in our notation. We have taken a new letter “j” to represent the previous “t” in the revised manuscript in Appendix 2 to clarify the number of cell divisions. We have further explained the variable “n”, which describes the maximal number of divisions. Thus j=1,⋯,n.

The authors response also makes clear at multiple points that cell divisions are stochastic:

"Since the differentiation program is stochastic, the costs of differentiation depend on the actual number of differentiation events happened in the course of growth, rather than probabilities like ggs.",

"Since the differentiation strategy is stochastic, the time to reach maturity (Tmat) and the number of offspring at the last stage (g(n)) are random values, which we sample by repeatedly simulating the process of growth."

"However, since the outcomes of cell divisions are stochastic, the sampling of developmental trajectories has to reflect that and in our case it is done numerically."

I understand this. My comments, which I may not have articulated clearly in my initial review, were more aimed at asking how much understanding could be gained from alternatively taking a mean field approach. Indeed, this is precisely the approach the authors themselves take in Appendix 2, where they "consider the mathematical expectation of the composition". This leads to the obvious question – why can't a similar approach be used when differentiation is not costless?

Of course, I completely understand that stochasticity could be very important in a model such as this (where initial cell numbers are low), and it may be that such a mean-field approach leads to misleading results with respect to the prediction of the mean population growth rate. If this is the case, I think the authors should make a statement of this fact somewhere, perhaps with a reference to results in Gao et al., 2019 with respect to the differences between mean field and stochastic predictions.

Thank you for the clarification. While we completely agree that a mean field model can potentially provide a lot of insights about our findings, we have to admit that it does not do so. The expressions in the Appendix 2 are short and it seems promising to develop such a model further. However, once we put differentiation costs into play, things quickly get complicated. To find the population growth rate λ, we also need to compute the length of life cycle, which is not always possible. For the special case of step-function Fcomp, (e.g. x0=x1), ignoring the discreteness of the organism composition, and relying on Mathematica software, we managed to find the population growth rate λ explicitly. Shortening the notation with shortcut variables, it is given by:

λ = {bM2(n+log[1xf])aM(1b)x0xf+β(Mn(1b)log[mgsmgsMx0]), x0<xfM(n+log[1xf])βnαxf,x0>xf

where

M=mgs+msg
xf=mgsmgs+msg(1e(msg+mgs)n)
α=csmsgcgmgs
β=mgs+msg+mgsmsg(cg+cs)

The condition x0<xf checks whether the fraction of soma-role cells ever reaches the threshold value x0 (xf is the fraction of soma-role cells at the end of life cycle, c.f. Equation 9 in Appendix 2).

Finding the optimal developmental strategies means finding the maximum of the expression above. This can be only performed numerically (and we already have a more detailed numerical model). In the absence of insightful results from this analytical approach, we just do not see the reason to include this result to the paper.

In our previous paper (Gao 2019) we did not use a mean field approximation (replacement of stochastic trajectories with the most likely one). Instead, our analytical results were obtained with the weak selection approximation (all stochastic trajectories have approximately similar duration Ti). We do not use this approximation in our current work because all the interesting strategies are found under such conditions where trajectories with many soma-role cells complete life cycle much faster, and thus, times to complete different trajectories differ a lot.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Transparent reporting form

    Data Availability Statement

    The code implementing our model is deposited at https://github.com/YuanxiaoGao/Evolution-of-irreversible-somatic-differentiation (copy archived at https://archive.softwareheritage.org/swh:1:rev:9a1ea7c84f3041ebe3720e7837b28182912b5e00).


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES