Abstract
One of the last uncharted territories in evolutionary biology concerns the link with cell biology. Because all phenotypes ultimately derive from events at the cellular level, this connection is essential to building a mechanism-based theory of evolution. Given the impressive developments in cell biological methodologies at the structural and functional levels, the potential for rapid progress is great. The primary challenge for theory development is the establishment of a quantitative framework that transcends species boundaries. Two approaches to the problem are presented here: establishing the long-term steady-state distribution of mean phenotypes under specific regimes of mutation, selection, and drift and evaluating the energetic costs of cellular structures and functions. Although not meant to be the final word, these theoretical platforms harbor potential for generating insight into a diversity of unsolved problems, ranging from genome structure to cellular architecture to aspects of motility in organisms across the Tree of Life.
Keywords: evolutionary cell biology, population genetics, bioenergetic costs, bacterial envelope, cell motility
All organismal features are functions of structures and processes that develop at the cellular level. Thus, given that most of the prominent advances in the life sciences over the past 50 years have emerged from the field of cell biology, the incorporation of cell biological issues into the mainstream of evolutionary science is long overdue [1]. Establishing a formal field of evolutionary cell biology will have to integrate the cataloging of diversity via comparative studies into an explanatory theoretical framework. The goal is not to simply document what has evolved but to understand the mechanisms by which change has come about and might be encouraged (or discouraged) from proceeding in certain practical situations.
Developing such theory is challenging, as there is a need for a conceptual framework for testing alternative evolutionary hypotheses. Unlike the situation in the well-trodden fields of molecular and genomic evolution, the subject matter is not a linear readout of nucleotides or amino acids. One cannot construct a neutral (null) evolutionary hypothesis for variation in a cellular trait based on the expected behavior in the absence of selection and then contrast this with observed trait values (a strategy widely embraced in molecular-evolution studies that compare, for example, silent and amino acid replacement sites in protein coding genes). The problem here is that most cellular features involve complexes of multiple types of organic molecules (nucleic acids, proteins, lipids, and carbohydrate derivatives) and/or dynamic reaction rates that depend on the cellular environment, sometimes with the key components changing among phylogenetic lineages.
In exploring some potential paths forward, we start with the general assumption of uniformitarianism. That is, as with the laws of physics and chemistry, we assume that the general mechanisms of evolution observed today have applied across the Tree of Life for the past 3 billion or so years in which cells have occupied the planet. By extension, we assume that these mechanisms are consistent with the basic principles of evolutionary population genetics that have been laid down over the past century of research [2,3]. Based on a myriad of empirical observations on the genetics and stochastic features of inheritance, mutation, and recombination, combined with the ubiquitous force of natural selection, the field of population genetics is now so well established that any credible evolutionary framework needs to either be fully consistent with evolutionary-genetic principles or to explain why a perceived deviation from such principles remains in the realm of rational discourse.
As an entrée to developing a theoretical foundation for evolutionary cell biology, we propose two potential approaches that can be readily integrated into a hypothesis-testing framework. As these are early days for this field, alternative points of view are desirable. One of the most significant problems in the broader body of biological thinking is the common assumption that all observed aspects of biodiversity are products of natural selection [4,5]. With this mind set, evolutionary biology becomes little more than a (sometimes endless) exercise in dreaming up the supposed agents of selection molding one’s favorite aspect of phenotypic diversity. However, we now know that this unwavering belief in the limitless power of natural selection is untenable. For example, much of the substantial molecular/genomic variation across the Tree of Life can only be understood by appreciating the inability of natural selection to oppose certain kinds of mutational changes [6–8].
One might be inclined to believe that as the focus expands to higher-level aspects of cellular organization, the likelihood of anything other than natural selection dictating the course of evolution will become diminishingly small. As will be demonstrated below, however, there is now substantial evidence that stochastic forces due to finite population size and chromosome structure always dictate what natural selection can and cannot promote/oppose in alternative phylogenetic contexts. This means that a rational field of evolutionary cell biology will require some resistance to the seduction of adaptive story telling. It remains true that evolution is a process of descent with modification, and few traits are overtly maladaptive, but the processes that enabled the emergence of certain attributes millennia ago need not reflect current-day functions.
As we hope to reach a broad audience, we start with an outline of the basic principles of population genetics essential to understanding the limits to what natural selection can accomplish, with particular emphasis on how the forces of mutation and random genetic drift conspire to define the bounds within which mean phenotypes of traits are likely to wander over evolutionary time. We next explore how results from the fields of bioenergetics and biochemistry provide one particular approach to integrating the vast array of results from molecular cell biology with evolutionary theory. After considering applications of these ideas to an array of issues in cell biology, we close with a discussion of the plausibility of the idea that many aspects of cellular diversity, particularly in eukaryotes, have been channeled by nonadaptive forces of evolution, calling into question the common view that eukaryotes are on a progressive evolutionary march toward adaptive perfection.
The Population-genetic Environment and the Limits to Natural Selection
Evolution is a dynamical process that sometimes reaches equilibrium states when there is a balance between opposing forces of allele-frequency change (e.g., mutation and selection biases operating in opposite directions). One of the lessons learned from decades of genetic surveys at the DNA level is that, at any one time, there are typically no more than one or two nucleotide variants simultaneously present at the same genomic site within a population. This simplifies many aspects of theoretical population genetics [3]. If p is the frequency of a particular variant at a nucleotide site and s is its selective advantage (or disadvantage), the expected change in allele frequency across generations is sp(1 − p), where p(1 − p) is a measure of variation at this site (equivalent to half the heterozygosity under random mating). Variants of this expression exist if there are nonadditive interactions among alleles, but this will not qualitatively alter the general points to be made below.
Of special relevance is the fact that sp(1 − p) is the expected rate of allele-frequency change in the absence of opposing forces. If the latter conditions are met, a beneficial allele will simply march to fixation (frequency). However, except in rare cases of extraordinarily strong selection, one or more opposing forces almost always influence the fates of mutant alleles. One of the key determinants of allele-frequency change is random genetic drift, which we define here as the cumulative effect of various sources of noise in the gene-transmission process across generations, subsumed into a parameter called the genetic effective population size Ne [9]. Notably, genetic drift operates like a diffusion process, with the variance in allele-frequency change across generations being equal to p(1 − p)/Ne. (The denominator must be multiplied by two in a diploid population, but we will adhere to a haploid situation throughout.)
Because both the change in allele-frequency via selection and the variance in allele-frequency change via drift are proportional to p(1 − p), the ratio s/(1/Ne) = Nes is p = 1 a general indicator of the ratio of the power of selection to drift, independent of allele frequency. This benchmark proves to be central to many aspects of molecular, genomic, and cellular evolution [3,10]. For situations in which Ne|s| > 1, selection is the primary determinant of the fates of common alleles (although drift always plays a role at low frequencies). If, however, Ne|s| < 1, natural selection is rendered powerless, as the stochastic forces of drift always overwhelm the deterministic force of selection, a condition called effective neutrality. The key point is that as Ne declines, the fraction of mutations (beneficial and deleterious) with fates largely determined by drift necessarily expands.
Many factors influence the genetic effective size of a population [3,9,11]. In the ideal case in which family sizes are random, Ne is simply equal to the number of reproductive adults, Na. However, most demographic factors conspire to make Ne smaller than Na, often by orders of magnitude—due to environmental heterogeneity, family sizes are generally not random; sex ratios can deviate from 1:1; and temporal bottlenecks in Na have much more long-lasting effects than do population-size expansions. More importantly, however, many aspects of the stochastic transmission of allele frequencies across generations are due to intrinsic genetic factors rather than to demographic forces.
Most mutations are detrimental to fitness, creating a background of low-frequency deleterious alleles distributed across chromosomes and individuals in all populations. As a consequence, a new beneficial mutation will frequently arise at a location closely linked to deleterious variants at other sites, reducing its net selective advantage, in some cases to the point of being dragged out of the population by negative selection against the linked variants, a process known as background selection [9]. The magnitude of this force depends on the rate of recombination, which can liberate a beneficial allele from its deleterious background. However, even when a beneficial mutation appears on a clean genetic background, it can become a victim of newly arising linked deleterious background mutations. Moreover, selective interference also occurs between beneficial mutations at different genomic locations, as these simultaneously compete for fixation. The key point is that even extremely large populations can be subject to relatively strong stochastic effects, with reduced Ne being primarily a consequence of genes being linked together on chromosomes. In very large populations, Ne is expected to scale very slowly (approximately logarithmically) with Na [12].
Negative scaling of the effective population size with organism size
One of the central empirical challenges in evolutionary biology is acquiring estimates of Ne. Ideally, Ne would be measured directly from temporal fluctuations in allele frequencies, but this is essentially impossible for very large populations, as the true fluctuations are so small as to be unobservable with reasonable sample sizes and time scales. An alternative approach relies on observed population levels of variation at nucleotide sites assumed to be largely free of the forces of selection (e.g., silent sites at the third positions of codons). Under long-term drift-mutation equilibrium, the average level of nucleotide variation at such sites (here defined to be the mean heterozygosity between randomly paired gametes or haploid genomes) is equal to the ratio of the power of mutation to that of drift, 2Neu, where u is the mutation rate per nucleotide site per generation (the 2 being replaced by 4 in diploids). When a measure of such variation is available along with an estimate of the mutation rate, Ne can be estimated by factoring out. 2u.
As a large number of direct estimates of u are now available to a high degree of precision (below), some general statements can be made about estimates of Ne obtained in this way. First, over a 17 order-of-magnitude gradient of organism size (mass per adult) across the Tree of Life, there is a progressive, nearly three order-of-magnitude decline in Ne (Lynch et al., submitted) (Fig. 1). Large vertebrates typically have Ne in the range of 104 to 4 × 105, invertebrates in the range of 3 × 105 to 3 × 106, and vascular plants have Ne intermediate to these two groups. On the other hand, Ne in microbial eukaryotes ranges from 4 × 106 to 2 × 108. Bacteria have still higher Ne, but in no case does this exceed 5 × 108. As a pile of 109 bacterial cells occupies ~1mm3, these results make clear that Ne is typically many orders of magnitude smaller than the absolute population size.
Fig. 1.

The negative scaling of effective population size (Ne) with organism size across the Tree of Life. The shaded squares envelope the full range of estimated variation for each major group, and it is plausible that over evolutionary time the mutation rates of individual taxa wander between the upper and lower limits of each group, as there is drift around the drift barrier [35]. The line is the least-squares fit to the overall data Ne = (5.1 × 106)B−0.21, r2 = 0.87.where B is adult dry weight (in μg). See Supplemental Table 1 for further information on sources of data.
These observations imply that the pool of mutations available to natural selection is potentially much broader in microbes than in multicellular species. Recalling the approximate breakpoint for effective neutrality, |s| < 1/Ne, natural selection in bacteria can perceive mutations with absolute effects as small as 10−9, whereas the critical cutoff in vertebrates is more on the order of s = 10−5. This four-order-of-magnitude gradient of critical s implies that natural selection has the capacity to exploit much more fine-scale variation in bacteria than is the case in eukaryotes, particularly in multicellular species.
The molecular resource base for natural selection
The degree to which this gradient in the drift barrier translates into differential susceptibility to adaptive evolution ultimately depends on the distribution of fitness effects of new mutations. For random genetic drift to impose a significant barrier to the evolutionary refinement of a trait, there must be a substantial pool of deleterious mutations with small enough effects that they can drift to fixation in species with small Ne. Although it is difficult to make statements about specific cellular traits, numerous lines of evidence suggest that the vast majority of mutations have deleterious effects, with the distributions of fitness effects associated with both beneficial and deleterious mutations being biased to near-zero values.
First, serially bottlenecked mutation-accumulation (MA) lines across diverse species, designed to specifically prevent the influence of selection on the pool of de novo mutations, consistently reveal a slow per-generation decline in growth rate and other fitness-related traits, as expected with a strong predominance of mutations with deleterious effects [13,14]. Second, statistical inferences based on the performance of individual MA lines imply highly skewed distributions of fitness effects, with the modes for both deleterious and beneficial effects indistinguishable from zero, and the bulk of the distributions having absolute effects below 1% [15–17]. Third, more indirect details on the mutational distribution of fitness effects, derived from studies on the site-frequency spectra of segregating alleles, commonly suggest that 10—40% of mutations in diverse organisms have deleterious effects smaller than 10−5, with the mode of this pool again being near (if not at) 0.0 [18–22]. Fourth, bioenergetic considerations of the costs of small nucleotide insertions, which typically comprise ~10% of de novo mutations [23], imply fractional reductions in fitness typically far below 10−4 per mutation [8]. Thus, the existence of large pools of mutations with effects small enough to be immune to selection in some lineages but large enough to ensure removal/promotion by selection in others is not in doubt.
The Drift Barrier to Cellular Perfection
The substantial contributions of mutations of small effect to phenotypic variation and fitness raise questions about the common view that all aspects of cell biology are fine-tuned by the unlimited power of natural selection. To further clarify several points, we introduce a simple model to illustrate how the mean phenotype of a complex trait is expected to respond to the joint forces of selection, mutation, and random genetic drift. The starting point is the assumption that the intensities and patterns of these forces remain relatively constant over a long period. Under such a scenario, stochastically arising mutations have a certain probability of being promoted by selection and/or simply drifting to fixation, and back mutations ensure that movement in the opposite direction can also occur. Over a long evolutionary time period, the mean phenotype will then wander within a range dictated by the prevailing forces of selection and the diffusive properties of mutation and random genetic drift. As a result, there is not a single point equilibrium, but a quasi-steady-state distribution of mean phenotypes, which can be viewed as the temporal distribution of mean phenotypes over time for a particular lineage or as the dispersion of isolated lineages experiencing the same population-genetic environment. Although this type of model may be viewed as unrealistic for evolutionary responses to temporally varying ecological changes, it serves as a reasonable heuristic guide to the evolution of intracellular features carrying out conserved functions for eons in settings buffered by internal homeostatic mechanisms.
Consider a trait for which the genotypic value is determined by ℓ genetic loci (or nucleotide sites) each having two possible allelic states (+ or −) with additive effects, and with the rate of mutation from + to −, and the reverse, being u10 and u01, respectively. Although there are 2ℓ possible haplotypes, because of the equivalence of effects across all loci, there are only (ℓ+1) genotypic values (defined by the number of + alleles) (Fig. 2). If the population size is suitably small (such that the expected number of mutations arising in the population per generation for the trait is < 1), evolution will proceed in an essentially stepwise manner between adjacent classes. With paths connecting all adjacent states, complete exploration of genotypic space is possible.
Fig. 2.

Schematic for the evolutionary transition rates between adjacent genotypic classes under the sequential-fixation model for the case of ℓ = 4 contributing genetic loci. Note that for all but the end states, there are multiple phenotypically equivalent haplotypes (with the red boxes), the number of which (the multiplicity) is defined by the coefficient of the binomial distribution. The net mutation rates between genotypic states are functions of the forward and reverse mutation rates (μ01 and μ10) and the number of parental + and − alleles; the population size also influences the absolute flux rate, but as this is assumed to be constant, it influences all rates equally and has no influence on the final equilibrium distribution, and so is dropped from the figure. The quantities φ01and φ10are the probabilities of fixation of newly arising mutations, which are functions of the selective advantage of mutant relative to ancestral allele and of the absolute and genetic effective population sizes. This schematic readily generalizes to any value of ℓ.
Under this model, the rates of transition between states depend on the products of mutation rates and probabilities of fixation of newly arising mutations. The former depends on the genotypic state (number of +/− alleles) and the population size, whereas the latter is described by Kimura’s equation [24], which depends on Na, Ne, and s. This model has already been applied to cell biological features such as transcription factors and their binding sites [1] and the alternative states of multimeric proteins in Ref. [25]. The goal here is simply to make the interpretation of the theoretical results more transparent.
A useful feature of the linear mutation model is that the equilibrium probability of each genotypic state m is simply equal to the product of the transition rates on the arrows pointing toward the state from above and below, with the final result condensing to
| (1a) |
C is the normalization constant, equal to the reciprocal of the sum of the terms to the right of C for m = 0 to ℓ, which scales the sum of probabilities to 1 [26]. The coefficient sm is a measure of the selective advantage of genotypic class m relative to some reference point (usually scaling the most fit class to have fitness 1.0 and all other classes to have relative selective disadvantages sm < 0). The quantity Nesm is then equivalent to the now familiar ratio of the strength of selection relative to drift for genotypic class m. Finally,
| (1b) |
is the expected distribution of genotypic values under neutrality (were drift and mutation to be the only evolutionary forces), equivalent to the binomial distribution, with η = u01/(u01 + u10) being the expected frequency of + alleles at each site in this case. Taken together, Equations (1a, b) show that the steady-state probability distribution under selection is a simple multiplicative transformation of the neutral expectation, with the weight for each class being the exponential function of the scaled strength of selection, e2NeSm. Under neutrality, sm = 0 for all m, reducing Equation (1a) to (1b). The mean equilibrium value of m, , is the drift barrier—mean phenotypes that are closer to a local optimum than μD might transiently arise by mutation but cannot be sustained, owing to the inefficiency of selection beyond such a point.
The evolutionary implications of this model can be summarized as follows. First, Equations (1a,b) make clear the role of mutation bias in evolution, a subject of recent debate [27–30]. The steady-state genotype distribution depends only on the distribution of mutational effects, that is, on the ratio of mutation rates, not on the mutation rate itself. Moreover, mutation bias alone does not simply push the genotype distribution to a particular location in the absence of selection. Rather, the multiplicity of haplotypes within genotypic classes is a strong determinant of the mean of the distribution. In the absence of selection, the genotypic mean is ℓη (the expected number of + alleles per genome), and with increasing Ne natural selection progressively pushes the distribution toward the optimum value. This means that depending on the number of loci involved (ℓ) and the position of the optimum, the location of the mean phenotype (genotypic value) can either increase or decrease with increasing Ne. The central point is that, even in the absence of mutation bias, the multiplicity of functionally equivalent haplotypic states acts as an evolutionary attractor, ensuring that the mean phenotypes of complex traits will almost never perfectly coincide with optima defined by natural selection.
Second, even though natural selection can efficiently move the expected genotypic means toward the optimum with increasing Ne, there can still be considerable variance in the distribution. This can be seen in the example given in Fig. 3, where there is persistent directional selection toward the highest genotypic value; other examples with alternative fitness functions can be found in Refs. [1,26]. The exact scaling of the overall mean phenotype with Ne will depend on the form of the fitness function, but the standard deviations of the steady-state distributions are often on the order of 20% or so of the mean. The central point here is that the stochastic forces of drift and mutation cause population mean phenotypes to wander around the drift barrier. This behavior should be of concern to those who wish to interpret all of interspecific variation in mean phenotypes as inevitable outcomes of natural selection.
Fig. 3.

An example of the response of the steady-state mean-phenotype distribution to effective population size differences with ℓ = 100. Here fitness of genotypic class m is equal to exp(−0.5[(m − 100)/20000]2), a half Gaussian such that the optimal fitness occurs for m = 100, and mutation is biased in the downward direction such that u01 = u10/25. Selection here is weak, and distributed over ℓ = 100loci, with m = 91 having a reduction in fitness of 10−7, and m = 0 having a reduction of 1.25 × 10−5, so that the distribution with Ne = 105 is close to the neutral expectation (given by the left-most dashed distribution). Results are given for Ne ranging from 105.5 to 109.
We close this section with two examples that seem to be in accordance with the drift-barrier hypothesis. First, one of the only cellular traits for which there is a broad set of estimates across the Tree of Life is the mutation rate per nucleotide site per generation (u) [31,32]. Because the vast majority of mutations are deleterious, theory suggests that mutator alleles are generally selected against as an indirect consequence of the linked load of deleterious mutations that they promote [33,34]. This is a relatively weak form of selection because the excess load associated with a mutator allele is equal to the genome-wide deleterious mutation rate itself, some of which is removed by recombination. Such a load will become progressively smaller as the mutation rate is reduced by selection in populations with larger Ne, as the maximum improvement cannot exceed that which would be accomplished by complete replication fidelity.
Consistent with a drift-imposed gradient, there is a negative scaling of the mutation rate over a five order-of-magnitude increase in Ne (Fig. 4). Although the quantitative scaling varies among groups of organisms, this too is consistent with the theory. For example, the five- to ten-fold reduction in u in unicellular eukaryotes relative to that in bacteria is in accordance with the idea that the magnitude of selection operating on the mutation rate is a function of the genome-wide deleterious mutation rate summed over all functional sites [31], as eukaryotes have substantially more genomic material under selection than do bacteria (owing to the increase in gene number and regulatory DNA). Also consistent with the drift-barrier hypothesis for replication fidelity is the high mutation rate associated with “error-prone” polymerases involved in relatively small numbers of nucleotide transactions per cell cycle [35], and the substantial increase in transcript error rates, as their effects are transient over the life span of a cell [36,37].
Fig. 4.

Negative scaling of the base-substitution mutation rate per nucleotide site with the effective population size. Individual regression lines, and shaded squares encompassing the data, are given for three groups of organisms. The data are primarily drawn from the compilations in Refs. [31,32], with a few additions that have emerged since those publications (Supplemental Table 1).
Second, as an illustration of the notion of multiplicity of alternative, functionally equivalent sites, consider the observations that have been made on post-translational modification in eukaryotes. Phosphorylation of amino acid residues commonly plays a role in modifying protein activity, and comparative studies in yeasts and mammals indicate that many phosphorylated serines and threonines are under purifying selection to retain their phosphosite status [38–41]. However, large fractions of phosphosites appear free to vary among species in terms of status and location [42–45]. For example, only ~ 5% of all Saccharomyces cerevisiae phosphorylation sites are conserved across the entire yeast lineage (dating back ~700 million years), and even when the same phosphorylatable residue is present in two moderately related species, their phosphorylation status may differ. In addition, Asp and Glu residues, which are naturally charged negatively, serve as potential replacements for their phosphorylatable counterparts, that is, phosphosites often evolve from phosphomimetic Asp and Glu sites and vice versa [46–48]. Taken together, these observations suggest a scenario whereby the number of phosphorylated residues on individual proteins behaves like a quantitative trait under stabilizing selection for an appropriate negative charge (with site-specific digital properties, such as those represented in Fig. 2), but with enough degrees of freedom that the specific locations of the affected residues are free to wander in an effectively neutral fashion [49,50].
For purposes of presentation, the model presented above has been idealized in a number of ways. For example, mutations with a distribution of effects have not been included, and the model assumes that evolutionary transitions move from one essentially fixed state to another adjacent state, ignoring the possibility of vaulting multiple states in populations with very large Ne. Although implementation of these modifications into the theory is desirable, violations of these assumptions are unlikely to alter the qualitative conclusion that the level of achievable cellular perfection will scale with Ne. Allowing for a distribution of mutational effects should strengthen this conclusion, as it will extend the range of effective neutrality for a subset of mutations into a larger range of Ne. For traits involving a very large number of loci, such as growth rate, the elevated gradient in multiplicity is expected to play a more substantial role in causing a trait to stabilize at positions below the optimum.
The Bioenergetic Constraints on Cellular Evolution
The proposed explanation for the negative scaling of the mutation rate with Ne is based on population-genetic theory defining the average-linked fitness burden of mutator genotypes that builds up over hundreds of generations of mutation accumulation. However, most cellular modifications have immediate impacts via impacts on the cell’s energy budget resulting from the synthesis of the basic building blocks for trait construction. Unless the fitness advantages provided by the trait modification outweigh the investment costs by an amount exceeding the power of drift, promotion by natural selection is not possible, although if effectively neutral, the variant may drift to fixation and may even be driven toward that end by mutation bias.
At the heart of this subject are two central concepts in biology: 1) the costs of building and maintaining cellular features and any resultant benefits in terms of biomass production and 2) the degree to which such alterations translate into fitness differences influencing long-term evolutionary trajectories. Here, we attempt to sketch out a quantitative framework for addressing these issues, following the procedures outlined in Refs. [8,51].
The bioenergetic cost of a cellular feature
As with others before us [52–56], for practical reasons, we will quantify costs in units of ATP hydrolyses (or their equivalent), the universal currency of bioenergetics across the Tree of Life. Situations do exist in which energy is not the limiting resource, for example, phytoplankton populations with some key elements such as carbon, nitrogen, phosphorus, or a trace metal constraining the rate of cell growth. For microbes growing on a defined medium with a single compound providing carbon and energy (and all other nutrients in excess supply), the growth yield per carbon consumed increases linearly with the substrate heat of combustion (which is linearly related to the number of C–H bonds available for oxidation [57]; up until a threshold value, thereafter leveling off (Fig. 5). This suggests that below a critical value of ≃ 10 kcal/g resource carbon, growth for most heterotrophs is limited by energy, whereas above this threshold the food supply contains excess energy relative to carbon content required for growth. Notably, the most common substrate used in microbial growth experiments, glucose, has a heat of combustion of 9.3 kcal/g carbon, close to the threshold at which growth is equally limited by carbon and energy.
Fig. 5.

Growth yields per unit carbon consumed as a function of the heats of combustion of the carbon substrate (which also serves as the source of energy). The scaling follows a power-law relationship (r2 = 0.81) up to HC ≃ 10, and thereafter levels off at a constant value ≃ 1.3 (units are defined in the axis labels). Results are taken from various sources in the literature (Supplemental Table 2); all results for eukaryotes involve fungi (mostly yeasts). The dashed-line threshold is fitted by eye.
Although cases involving nonenergy limitation may require a shift in the currency to be used, the conceptual framework outlined below should still apply. Regardless of the substrates being consumed, all aspects of cellular maintenance and growth require energy, and although the yields of microbes grown on alternative substrates (per unit carbon consumed) vary with the nature of the substrate (Fig. 5), few commonly used substrates have yields more than 50% below the threshold value, and the estimated numbers of ATP hydrolyses necessary to build an offspring cell are relatively independent of the substrate used [52,55,56]. This provides at least partial justification for using ATP as the currency in the kinds of computations outlined below.
In evaluating the total cost of a cellular feature, four categories of investment must be considered (Fig. 6). First, nearly all cellular features are assembled from four types of monomeric building blocks—amino acids, nucleotides, lipid molecules, and/or carbohydrates, which if not acquired externally, must be synthesized by processes requiring carbon skeletons and energy investments, for example, from transformations of glucose or acetate to precursor metabolites that are then further processed. Second, the construction of a higher-order structure from its basic building blocks imposes assembly costs, such as expenditures in polymerizing nucleic acid and amino acid chains, adding posttranslational modifications. Third, the maintenance of biomolecules generally imposes still additional costs such as accommodation of molecular turnover. Although there are acquisition costs for obtaining external resources, for example, use membrane-bound channels, phagocytosis, and mobility, these occur upstream of the steps used in trait construction/maintenance and are not considered in the following analyses.
Fig. 6.

The distinction between direct and opportunity costs associated with synthesizing monomeric building blocks. These costs are only evaluated at the point where a precursor molecule is first exploited in the biosynthetic pathway, as any earlier processing (and either gain or loss of energy) would occur regardless of the downstream modes of precursor utilization. The direct cost represents the basic biochemical expenditures involved in producing the monomeric building block, which must be taken out of the cell’s total energy budget. When a precursor molecule has been utilized, the opportunity cost represents the degree to which this loss robs the cell of energy that could be used in other processes. The cost of an entire trait further entails costs of processing, assembling, and maintaining the structure derived from the underlying monomeric components.
Finally, in addition to the three sets of direct costs of constructing and maintaining a trait, there will usually be indirect costs associated with diverting resources that would otherwise be available for other cellular functions (Fig. 6). Such costs are incurred because metabolic precursors used in the biosynthesis of the building blocks of the trait under consideration can no longer be used for ATP production or as carbon skeletons for other structures. Atkinson defined these secondary, opportunity costs as the “prices of metabolites,” [53] and seemingly independent of him, Craig and Weber and Akashi and Gojobori used this approach to partition the costs of amino acids into components associated with the investment in synthesis and the diversion of metabolic precursors [58,59].
To sum up, the total cost of constructing and maintaining a trait and diverting components from alternative usage to do so is the sum of the direct and opportunity costs over the entire life span of the cell,
| (2) |
whereas cD will be reflected in ATP hydrolysis reactions leading to heat dissipation (owing to imperfect exploitation of energy release), opportunity costs do not generate heat, as ATP is not consumed.
Given the near universality of many biosynthetic pathways and enzyme-reaction mechanisms across the Tree of Life, this general approach holds promise, as the steps for calculating most costs can often be gleaned from information in the literature [8,51]. The biosynthetic costs of the basic monomeric building blocks are on most solid footing. For example, although there is variation among residues, the average direct and opportunity costs per amino acid are ~2 and 25 ATP hydrolyses [51]. There is little variation in costs among nucleotides, the average being ~11 and 34 ATPs for DNA and 10 and 31 ATPs for RNA for the direct and opportunity costs, respectively. The costs of lipids are much higher—although there is variation among lipid types depending on the fatty acid chain length and the number of unsaturated carbons, as a first-order approximation the average total cost per lipid molecule is ~425 ATP equivalents, approximately one-third of which involves direct cost [22].
The evolutionary cost of a cellular feature
Given a measure of the total energetic cost of a cellular feature (or modification thereof), the evolutionary implications revolve around three issues: 1) use of an appropriate definition of the cost from the perspective of organismal fitness; 2) scaling of this cost relative to the total cost of building and maintaining the cell; and 3) conversion of the appropriately scaled measure to a fitness differential.
First, although we have defined cT in terms of direct and opportunity components, the latter will not always influence cellular performance. Consider, for example, an organism experiencing a saturating carbon/energy source but limited by a micronutrient such as iron. If the energy extracted from the food source is in excess supply and therefore underutilized, the appropriate energetic cost from a fitness perspective is simply cT ≃ cD.
Second, supposing the cell has a baseline total energy budget per cell cycle of CT (including the costs of both growth and maintenance; [8], the presence of the trait under consideration alters the lifetime energy budget to . Note that our view here is that CT is the sum of direct and opportunity costs, which includes the direct energetic expenditure of ATP on biosynthesis and the indirect diversion of energy-containing carbon skeletons (opportunity loss), as would be reflected in the consumption of glucose molecules consumed in a chemostat and their conversion to ATP equivalents. If the cell-division time is τ in the absence of the trait, this additional investment in the trait will alter the cell-division time to τ′ > τ (for the time being ignoring any downstream fitness benefits, as the goal is to understand the baseline investment costs).
Finally, to understand the total bioenergetic investment in a trait from a fitness perspective, we need to define its effects on the cell’s reproductive output relative to that in the absence of such an excess investment. Previous work intuited that the intrinsic selective disadvantage of any cellular feature is proportional to the fraction of the total energy pool per cell lifetime used up by the putative structure [8]. More precisely, the selective disadvantage sc of a cellular feature is defined as the rate of offspring production by a mutant containing the feature (with the total cost cT) relative to that of the “wild-type” without such a feature. Thus, for every wild-type offspring, there will be on average 1 – sc mutant offspring. Recent theoretical work justified such intuition [60], except that a correcting factor of In(R) is required, where R is the mean number of offspring produced per individual per generation. In the case of binary fission, R = 2, yielding
| (3) |
For organisms experiencing variable environments, sc might change substantially across generations owing to shifts in the numerator and/or denominator of this expression, and for a clonally reproducing organism this might be accommodated by evaluating geometric mean fitness.
The key parameter in this expression is the total energy budget of a cell, CT. There are a number of ways to obtain this, one of the most powerful being the method of Pirt [61], which requires growth of the study organism at different dilution rates in a chemostat—the slope of a plot of the rate of resource consumption vs. the dilution rate (equivalent to the cell-division rate) provides a direct estimate of the amount of resource consumption per cell division, whereas the intercept estimates the basal (time-dependent) rate of investment in maintenance. Generally, in applications of this method a single compounds, such as glucose, as the sole source of carbon skeletons and energy. Given a knowledge of fraction of carbon incorporated into biomass, the metabolic pathway(s) utilized, and known conversion factors, these measures can then be transformed into units of ATP utilization. Summarizing results from a wide variety of organisms, Lynch and Marinov (2015) [8] found that the total energy required to build a cell scales isometrically with cell volume (across both bacteria and eukaryotes). These results can be applied to Equation (5) as a first-order approximation in the absence of direct observations of CT on a study organism.
One caveat with respect to the preceding derivation is that it focuses entirely on the bioenergetic costs of producing a trait. In some cases, there may be additional side effects, as when a novel protein promiscuously interacts with inappropriate substrates and/or excessively occupies cellular volume or membrane real estate in ways that negatively impinge on r. Because the total fitness cost of a cellular addition is in most cases likely to be so small as to be beyond the reach of experimentation, separating these indirect effects from the more immediate bioenergetic costs may only be possible by computational analysis.
Finally, we note that an alternative route to estimating evolutionary costs may reside in the more systems-biology approach introduced by Molenaar and collaborators [62] and elaborated by others [63–65]. Here, the idea is that the internal biochemistry of the cell can be represented by a small number of mutually constrained molecular categories (e.g., ribosomes, nutrient acquisition, and all others), with the growth rate emerging as a function of the allocation of proteome to these different pools. In this context, a novel cellular feature could be viewed as competing energetically for resources allocated the pre-existing pools, and indeed this framework has already been used to explore the growth impediment owing to expression of a useless protein [63] or the insertion of a synthetic circuit [64]. It remains to be seen whether this conceptual framework, developed for Escherichia coli, is generalizable to other species, and as the tradeoff constraint employed is conceptually similar to the opportunity costs invoked above, it is possible that both approaches will yield qualitatively similar results.
Applications to Cell Biological Traits
With the basic theory in place, the selective consequences of modifying, eliminating, or recruiting in toto (by, e.g., horizontal transfer) any cellular feature can be evaluated provided certain information is available: 1) the energetic (or elemental) cost of the alteration; 2) the entire cost of building and maintaining the cell, which serves as a scaling factor for the former; and 3) the effective population size, which dictates the minimum relative cost that can be perceived by natural selection.
The general conceptual framework is outlined in Fig. 7. Ignoring the bioenergetic cost, a particular cellular feature may appear to endow an organism with increased fitness owing to its affects on performance in response to a range of ecological challenges. However, in order for the feature to be promoted by selection, the benefits accrued must sufficiently offset the costs of producing and maintaining the trait to a significant enough extent that the net selective advantage exceeds the power of random genetic drift. If this is not the case, the feature will be subject to loss by deactivating mutations.
Fig. 7.

The net selective advantage (disadvantage) of a cellular feature is a function of the organism’s ecological context, which defines sdirect, and the investment cost in constructing and maintaining the trait scost. The sign and magnitude of snet = sdirect − scost, relative to the drift barrier, determines whether the feature will be promoted by positive selection or subject to removal by deactivating mutations.
We now consider applications of the theory to some central issues in evolutionary cell biology, ranging from the genomic to the whole-organism level. With a goal of motivating further research, these examples merely outline the general issues, and there is ample room for further attention to technical details and extension to a diversity of organisms.
Gene and genome architecture
One of the clearest findings of the genomics era is the dramatic expansion of genome size and complexity from prokaryotes to unicellular eukaryotes to multicellular species [7]. Most of this inflation involves the addition of noncoding intergenic and intronic DNA. As natural selection places a premium on efficient resource utilization and reproduction in all species, how can this phylogenetic gradient in the expansion of nonfunctional DNA be explained? Accounting for the various biosynthetic, assembly, and maintenance costs, Lynch and Marinov found that most genes in E. coli have costs (summed over the DNA, mRNA, and protein levels) well above the drift barrier for this species (i.e., sc < 1/Ne ≃ 10−9) [8]. If, for example, one considers an insertion of just ten unexpressed nucleotides of double-stranded DNA, the total cost would be ~900 ATP hydrolyses (not including small additional processing costs associated with polymerization). Given that the total cost of building an E. coli cell is ~3 × 1010 ATPs, this means that the selective disadvantage of a functionally irrelevant 10-bp insertion of DNA is ~3 × 10−8, enough to be perceived by natural selection in this species.
Proceeding to yeast and then to multicellular species (Arabidopsis and humans), the absolute costs per gene (cT) increase, owing to the increased investment in introns, untranslated flanking regions, histones, and substantially higher numbers of mRNAs and proteins expressed per cell. However, with an increase in cell volume, the total cost of building cells (CT) increases 100- to 1000-fold up this gradient of organismal complexity, while the effective population size declines, lowering the drift barrier by factors of 10–1000 (Fig. 1). This results in the progressive repositioning of the costs of most genes relative to the drift barrier, such that insertions of noncoding DNA as long as 10–100 kb are nearly invisible to the eyes of natural selection. These observations provide a more formal view of the commonly invoked “genomic streamlining” hypothesis. Because of their enormous Ne, bacteria are subject to the efficient removal of DNA that does not pay its way in terms of a significant selective advantage. In contrast, although selection still opposes genomic bloating with nonfunctional DNA in eukaryotes, drift is powerful enough to render such selection ineffective.
These same general principles can be applied to another aspect of genome and proteome structure that has attracted considerable attention—the idea that the elemental composition of nucleic acids and proteins (e.g., C, N, P, and S content) is adaptively tuned to environmental conditions. Under this view, organisms living in environments depleted for a particular element are expected to evolve toward underutilization of the same element in their nucleotide and amino acid repertoires, and pathways involved in the metabolism of a scarce element are expected to evolve to avoid the inclusion of the same element in their infrastructure for acquiring said nutrient [66–68]. The advantages of such shifts at the level of single nucleotides or amino acids cannot be large, raising the question as to whether the population-genetic conditions necessary for the promotion of such modifications by natural selection are likely to be met. Here, it becomes necessary to measure costs in units of elemental composition rather than ATP equivalents.
Consider the use of carbon and nitrogen in nucleotides and amino acids. Over a range of growth conditions, the dry weight of E. coli cells consists of 45–47% carbon and 10–13% nitrogen [69]. The dry weight of an average cell in this species is ~0.00038 ng, implying ~9 × 109 C and ~2 × 109 N atoms per cell. An A:T pair contains 10 C and 7 N atoms, whereas a G:C pair contains 9 and 8, respectively. Thus, for both elements, an A:T vs. G:C exchange adds/subtracts just a single atom, changing the elemental usages of C and N by fractions of ~10−10 and 0.5 × 10−9, respectively, both of which are below the drift barrier for this species. As E. coli is a very high Ne species, this suggests that selection operating on elemental composition of single nucleotides is unlikely to be effective in cellular organisms.
In contrast, the maximum difference for C content among the 20 amino acids is 9 for tryptophan vs. glycine, and for N is 3 (arginine vs. several other residues). The maximum fractional impact of a single amino-acid substitution in a single-copy protein is then ~10−9 for C and 5 × 10−9 for N, close to the most generous drift barrier for bacteria (Fig. 1). However, most proteins are present in multiple copies per cell, so these numbers need to be multiplied accordingly. For E. coli, the mean number of proteins per cell is ≃2000 with a very large standard deviation (≃5000) [8], seemingly placing the total elemental costs of some amino-acid substitutions in the realm of selective sensitivity, as the maximum impact (sc) will commonly be in the range of 10−6 to 10−5.
It is less clear that this will be true in eukaryotes. The number of atoms of a particular type per cell can be expected to increase roughly linearly with cell volume, but the number of proteins percell increases with the 2/3rds power of cell volume [8]. Given these scalings, the cost/benefit of an amino acid substitution would be expected to decline with the 2/3rds power of cell volume. Because Ne declines by two to five orders of magnitude from bacteria to eukaryotes, it appears questionable as to whether cellular elemental composition can be driven by natural selection in eukaryotes, especially in multicellular species, except perhaps in the most highly expressed genes.
Membranes and cell walls
Owing to the relatively high biosynthetic costs of lipids, membranes constitute one of the predominant costs of building cells. All cells are surrounded by a lipid bilayer, and Gram-negative bacteria have two of these sandwiching a carbohydrate-derived cell wall. Plants and some fungi also have cell walls. Because external membranes form the surface of the cell, their area scales with the ~2/3 rds power of cell volume, thereby becoming proportionally less of an energy drain in larger cells. However, the emergence of eukaryotes was accompanied by the establishment of an internal-membrane system, including the nucleus, endoplasmic reticulum, Golgi, and vesicle-transport systems. Thus, it is of interest to know the degree to which the investment in internal membranes exceeds the reduction in cell-surface investment. Although a full census of internal membranes only exists for a few eukaryotes, the data suggest that these two opposing changes are roughly balanced, rendering a total contribution of lipids (internal plus external membranes) to the cell’s energy budget in the range of ~10 to 30% across eukaryotes, which is comparable to the ~ 20% investment in cell-membrane lipids in the moderate-sized bacteria Bacillus subtilis and E. coli ([22], see also the Supplemental Text).
These kinds of observations provide a context in which to evaluate Lane and Martin’s suggestion that the origin of the mitochondrion precipitated a bioenergetics revolution necessary for the origin of eukaryotes [70]. The crux of their argument is that movement of the power-generating machine of the cell, ATP synthase, to internal membranes bypassed surface:volume ratio limitations in bacteria (where ATP synthase resides in the cell membrane). These ideas need to be tempered by the fact that mitochondrial membranes impose an investment cost equivalent to increasing the energy requirements for building a eukaryotic cell by ~5%, and that when scaled to cell volume, the number of ribosomes and ATP-synthase complexes per eukaryotic cell are no greater than what is found in bacteria [8,22]. These observations and others (including the progressive reduction in cell-division rate with cell volume in eukaryotes but not prokaryotes [8]) are inconsistent with the idea that the establishment of the mitochondrion spawned a bioenergetic revolution.
We now turn to the previously unexplored issue of cell walls. Although such structures provide obvious advantages in certain ecological settings, cell walls have been lost in most eukaryotic and a few bacterial lineages, presumably during periods in which the investment in biosynthesis outweighed any fitness advantages. Indirect evidence suggests that cell walls comprise a substantial fraction of a cell’s energy budget. For example [71], estimate that ~21% of the biomass of a S. cerevisiae cell consists of the cell wall (~4% protein, 30% mannose, 60% glycan, and 1% chitin). The glycan subunits are derived from ~7.7 × 109 glucose molecules, which would otherwise be available for cellular energy or as carbon skeletons for other biosynthetic pathways ([72]. Similar numbers were derived for another yeast species, Candida albicans [72].
A more explicit statement can be made on the cost of surface layers in bacteria. Although their cell envelopes have diverse kinds of organization, bacteria can be roughly categorized into Gram-negative vs. Gram-positive classes. The Gram-negative envelope consists of an internal cell membrane, overlain by a peptidoglycan layer (the cell wall), and then by an outer membrane. In contrast, Gram-positive bacteria have thicker peptidoglycan layers and lack outer membranes. Bacterial cell envelopes have two fairly obvious roles—maintaining turgor pressure and preventing hazardous hydrophobic molecules from entering the cell, but other likely functions include attachment to environmental surfaces, and in the case of pathogens, evasion of the host’s immune response by masking the surface of the cell [73]. Regardless of the perceived benefits of the various layers, to be maintained by natural selection, the advantages must at least outweigh the costs of producing the putative structure.
Owing to the complex set of constituents in the bacterial cell wall and outer membrane, the computations of costs of these layers are quite involved, but using the known biosynthetic pathways for all subunits, we outline the basic steps in the Supplemental Text. Here we focus on just the main findings. The basic carbohydrate subunit of peptidoglycan consists of two fused molecules, NAG (N-acetylglucosamine) and NAM (N-acetylmuramic acid), with short peptide chains fused to the latter for cross-bridging. Applying the methods outlined in Ref. [51] to the known pathway for peptidoglycan synthesis, the direct cost of synthesis of each peptidoglycan unit is equivalent to ~15 ATP hydrolyses, whereas the opportunity cost is equivalent to ~208 ATPs. There are ~3.5 × 106 disaccharides in the entire cell wall of the Gram-negative E. coli [74,75], suggesting a direct cost of ~5.25 × 107 ATP hydrolyses for cell-wall biosynthesis. To put this in context, the direct cost of lipids in the cell membrane of E. coli is ~ 2.5× 108 ATPs. Although these numbers are approximate, they suggest that the fraction of a Gram-negative bacterium’s energy budget associated with the cell wall is ~10× less than the cost of the cell membrane lipids.
Expanding this approach to the outer membrane components in E. coli and to the cell envelope of the Gram-positive bacterium B. subtilis (Supplemental Text), allows some general statements about the overall envelope costs in these two species (Fig. 8). First, despite the large organizational differences and their presumed ecological consequences, both types of envelopes constitute ~30% of the cell’s total ATP utilization, with the opportunity costs greatly exceeding the direct costs. Second, membranes are the most expensive components of the envelope in both types of bacteria. Contrary to previously reported results [22], the membrane of B. subtilis is costlier than that of E. coli owing to the former’s high content of branched fatty acids (see Supplemental Text). Third, the most expensive molecular constituent of bacterial envelopes is the sugar decorations—lipopolysaccharide (2243 ATPs) in Gram-negative, and wall teichoic acid (1916 ATPs) and lipoteichoic acid (506 ATPs) in Gram-positive bacteria (Supplement Fig. 2). For comparison, the average lipid costs are ~300 ATPs (271 in E. coli and 344 in B. subtilis). Fourth, peptidoglycan of B. subtilis costs more than that of E. coli, and this estimate is conservative given that it assumed that B. subtilis has only five layers.
Fig. 8.

Percentages of the total energy pool invested in the production of a particular envelope component. Each color signifies the total cost of a particular envelope component. Each slice with the lighter coloration represent the direct cost fraction of the correspondingly colored envelope component. “Cell budget remainder” refers to all remaining cellular processes and structures not involved in envelope production. See Supplemental Text for calculation details.
Motility
Many cells can swim, and it is easy to imagine adaptive reasons for doing so, e.g., directive movement toward patchy resources and avoidance of predators, but not all cells living in aquatic environments can self-propel. The latter include many planktonic bacteria, diatoms, and green algae. This suggests that the energetic cost of swimming and building the mobility apparatus may exceed the benefits in certain settings.
The power requirement for swimming can be estimated as the work necessary to move an object at a particular velocity. However, the estimated costs of moving flagella indicate that the cellular conversion of chemical energy into directed swimming is universally low, averaging ~2% in diverse unicellular species (bacteria, archaea, ciliates, euglenoids, and green algae) [76–84]. Brownian-motion jostling of cells, rotational diffusion, helical swimming patterns, and flexibility of the flagellum are among the reasons for this low efficiency. Nonetheless, it has often been suggested that the total cost of swimming is trivially small in an absolute sense, e.g., often ≤ 1% of a cell’s baseline energy budget [77,85–87]. As has already been emphasized, however, what may seem small in a fractional energetic sense can be quite visible to the eyes of natural selection.
An understanding of what the cost of swimming means in evolutionary terms can be obtained from the scaling relationship between average swimming speed (v in units of μm/sec) and cell volume (V in units of μm3) observed for a wide variety of unicellular eukaryotic species, v ≃ 10.5V0.30 (Fig. 9). Noting that the radius of a sphere is (3V/4π)1/3, this implies an average swimming speed for unicellular eukaryotes of ~8.5 /sec in units of effective cell diameters, independent of cell volume. By comparison, the peak speeds of the fastest fish (marlins and sailfish) are body lengths/sec.
Fig. 9.

Scaling relationships between swimming velocities (v) and cell volume (V) in unicellular species. For eukaryotes, v = 10.5V0.30(r2 = 0.49); and for bacteria, v = 27.8V0:19(r2 = 0.21). The measured values are corrected for temperature assuming a Q10 of 2.0. See Supplemental Table 3 for data and their sources.
How much energy does such activity demand? Applying Stokes’ Law to the low Reynolds number situations in which microbes live, fora sphere with ra dius r, the power required for swimming,
| (4) |
scales with the product of the viscosity of the medium (η) and the squared swimming velocity (v), and to account for the inefficient conversion from cellular chemical energy to motion, this must further be divided by ~0.02 (noted above). Assuming spherical cells, the scaling relationship for eukaryotes in Fig. 9 implies rv2 ≃ 68V μm3/sec2, suggesting a power scaling that is essentially isometric with respect to cell volume (because v2 scales as ~ V0.60 and r scales as ~V0.33). Noting that the viscosity of water is 10−2 g·cm−1 ·sec−1 (assuming 20° C), after appropriate changes of units, the average power requirement of swimming is estimated to be 640 × 10−19)V kg • m2 • sec−3, which has equivalent units of joules/sec (here, cell volume V is in units of μm3).
To put this in more familiar terms, note that the energy associated with the hydrolysis of 1 mol of ATP at physiological conditions is ~50 kilojoules ([88], Supplementary). Recalling Avogadro’s number of molecules per mole, the power requirement for swimming in units of ATP molecules hydrolyzed/hour is then ~(2.5 × 105)V. Because average basal metabolic rates also scale isometrically with cell volume, as ~(0.4 × 109)V ATP hydrolyses/hour [8], this suggests a fractional energetic investment (relative to baseline metabolism) in swimming of ~0.06% (assuming the cell is constantly swimming), independent of cell volume. This estimate does not include the biosynthetic cost of the proteins comprising a flagellum and its basal protein parts, which in the case of an E. coli flagellum amounts to ~3.5 × 108 ATP hydrolyses in terms of direct costs alone (Supplemental Text).
These rough calculations allow some preliminary conclusions with respect to swimming in bacteria. First, recalling from above that the ATP cost of swimming is ~(2.5 × 105)V per hour for eukaryotes (and not much higher for bacteria), because the average volume of an E. coli cell is μm3, and the cell-division time is typically no more than a few hours, almost all of the cost of swimming is associated with building rather than operating the apparatus. This conclusion is strengthened by the fact that E. coli cells generally harbor several flagella. Assuming a moderate cell division time of 1 h and a maximum of 10 flagella per cell, the lifetime cost of swimming is ~(2.5 × 105), whereas the investment in the flagellar infrastructure is ~4000× higher. Second, from Ref. [8], the total cost of building an E. coli cell is ~3 × 1010 ATP hydrolyses, implying that the bioenergetic costs of operating and building each flagellum are on the order of 0.003 and 1.2%, respectively, of the total cellular energy budget allocated to growth.
Although the structures and modes of operation of eukaryotic flagella are completely different from those in bacteria, similar calculations for eukaryotes (Supplemental Text) suggest somewhat lower total costs of swimming but roughly comparable costs of flagellar construction. For example, noting that the cellular volume of the biflagellated green alga Chlamydomonas reinhardtii is ~200 μm3, and allowing for a 24-h cell division time implies an energetic cost of swimming of ~1 × 109 ATP per cell life time, whereas the total protein biosynthesis costs of flagellar construction is ~21 × 109 ATP hydrolyses (for each of the two flagella). In contrast to the situation in most bacteria, eukaryotic flagella are surrounded by lipid bilayers, which in the case of each Chlamydomonas flagellum requires another ~ 13 × 109 ATPs in direct biosynthetic costs. Accounting for the two flagella, the total cost of building the apparatus is then on the order of 70× that of actual swimming. With the total cost of building a C. reinhardtii cell being equivalent to ~5.4 × 1012 ATP hydrolyses [8], the relative costs of operating and constructing the flagella are 0.02 (again, assuming swimming at a constant rate) and 1.2%, respectively.
Thus, from an evolutionary perspective, the cost of swimming constitutes a significant fraction of a cell’s total energy budget. It is, therefore, no surprise that many bacteria have evolved mechanisms for selective use of flagella, expressing the necessary genes only in environments providing significant benefits [89]. Prolonged periods of silencing of flagellar genes would be expected to eventually lead to the accumulation of deactivating mutations (and complete loss of motility), and this presumably explains the absence of flagella/cilia from numerous branches in the Tree of Life.
Closing Comments
Our primary goal has been to seed the development of a formal field of evolutionary cell biology based on well-established principles in evolutionary genetics. One of the great problems in biology is the common adherence to the belief that all aspects of biodiversity have been optimized by processes of natural selection. Compelling empirical evidence suggests otherwise at the molecular and cellular levels, and the theory presented above provides a formal path to understanding the limits to what selection can accomplish in alternative population-genetic environments.
Natural selection is ubiquitous, and we are not advocating that overtly maladaptive evolution is rampant, but simply that there is ample room in the domain of effective neutrality in most species, particularly eukaryotes, that significant changes at the molecular and cellular levels are likely to accrue over long evolutionary time periods in the absence of diversifying selection. Once established, the cumulative changes in gene structure and genome architecture dictated by drift and biased mutation results in a fundamental shift in the types of real estate on which natural selection operates via descent with modification.
The steady-state distribution approach that we present is meant to yield insight into the degree to which mean phenotypes might exhibit scaling relationships with organism size, as a consequence of the latter having side effects on the power of random genetic drift. This approach also provides a basis for understanding how mutation, even if unbiased, can push a mean phenotype away from its optimum, the more so the smaller the effective population size. Finally, the concept of haplotypic multiplicity provides a conceptual basis for understanding how species exposed to identical pressures of mutation, selection, and random genetic drift can come to harbor alternative molecular/cellular features.
Consistent with this view, accumulating evidence implies that there are often multiple degrees of freedom for accomplishing the same cellular function, post-translational modification being one such example. Others examples include 1) the multiple layers of surveillance in replication fidelity, which are free to wander so long as the joint level of fidelity consistent with the drift barrier is maintained [90]; 2) the alternative multimeric states of homologous proteins across the Tree of Life [25]; and 3) shifts in transcription-factor binding motifs [1]. Most notable is the emergence of many examples of the evolutionary rewiring of the components of key pathways with no known change in function, including: meiosis and the cell cycle [91–93]; biosynthesis [94–98]; motility [99]; mating-type specification [100,101]; and ribosome production [102,103]. These diverse observations suggest that redundancies are common even at higher levels of organization, at least in eukaryotes, allowing for effectively neutral divergence.
The evolutionary-cost approach that we present provides a more explicit basis for understanding the granularity of the types of mutational changes that can be perceived by natural selection in different population-genetic contexts. Such analyses raise the possibility of evaluating the degree to which mildly deleterious alterations can passively accumulate by mutation pressure alone in small- Ne lineages, while being efficiently purged from species with large Ne. Likewise, it becomes possible to understand the elevated degree of evolutionary fine-tuning that can be accomplished by selection in small- Ne populations, with large- Ne populations being biased toward restricted use of large-effect adaptive mutations, which are almost certainly less frequent than the former.
The evolutionary-cost framework may also serve as a useful guide to understanding the conditions necessary for the passive emergence of cellular complexity by the process of constructive neutral evolution (CNE [104–106]). Under this model, an ancestral cellular function is carried out by the product of a single gene (A). A fortuitous binding interaction then develops with another protein B, which initially has negligible negative effects on both A and B’s functionality and on cell fitness. However, by binding to part of A’s surface, B may suppress the effects of future mutations arising at the A:B interface that would otherwise compromise the stability of A. Over time, this permissive interfacial environment might lead to enough mutational buildup that A would no longer be functional in the absence of B. In principle, this evolved functional dependence of A on B could be followed by a similar scenario involving a third protein, C, and so on, imposing a sort of neutral ratchet-like mechanism toward complexity. Under this scenario, the intricate interdependencies of the large numbers of subcomponents of complexes like the ribosome and the spliceosome need not have been selected for on the basis of their intrinsic advantages, but instead could be at least in part the result of a long series of fortuitous and effectively neutral steps accompanied by the accumulation of previously forbidden mutations, which then enforces collaboration.
This verbal model for CNE illustrates the need for the kinds of quantitative analyses made possible by evolutionary-cost evaluation. One of the key assumptions of the CNE hypothesis is the existence of excess capacity within cells. This assumption is critical because the successful evolutionary diversion of B molecules to A requires that any preexisting benefits of B are not compromised to a level that could be opposed by natural selection. As excess capacity implies a superfluous energetic drain on the ancestral cell, this raises the question as to why such a condition would exist in the first place. Based on the theory outlined above, such a scenario would be most likely if the initial condition of excess capacity of B were effectively neutral, which is more likely with a small effective population size, consistent with the idea that small Ne facilitates the passive emergence of cellular complexity.
Finally, we note that in principle, there is no reason the general approach to estimating evolutionary costs cannot be applied to multicellular organisms, although owing to uncertainties regarding tissue-specific expression, the necessary computations will be more involved. An interesting start in this direction has been made by Ref. [107] in an analysis of the cost of signal transduction in the cell cycle of developing embryos.
Supplementary Material
Acknowledgments
This research was supported by the Multidisciplinary University Research Initiative awards W911NF-09-1-0444 and W911NF-14-1-0411 from the US Army Research Office, National Institutes of Health award R35-GM122566-01, and National Science Foundation award MCB-1518060 to ML. We thank Paul Schavemaker, Guillaume Le Treut, Ron Milo and two anonymous reviewers for helpful comments.
Footnotes
Appendix A.: Supplementary data
Supplementary data to this article can be found online at https://doi.org/10.1016/j.jmb.2020.02.006.
References
- [1].Lynch M, Field MC, Goodson H, Malik HS, Pereira-Leal JB, Roos DS, Turkewitz A, Sazer S, Evolutionary cell biology: two origins, one objective, Proc. Natl. Acad. Sci. U.S.A 111 (2014) 16990–16994, 10.1073/pnas.1415861111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Charlesworth B, Charlesworth D, Elements of Evolutionary Genetics, W. H. Freeman, New York, NY, 2010. [Google Scholar]
- [3].Walsh JB, Lynch M, Evolution and Selection of Quantitative Traits, Oxford Univ. Press, Oxford, UK, 2018. [Google Scholar]
- [4].Gould SJ, Lewontin RC, The spandrels of san marco and the panglossian paradigm: a critique of the adaptationist programme, Proc. R. Soc. London, Ser. A or B 205 (1979) 581–598, 10.1098/rspb.1979.0086. [DOI] [PubMed] [Google Scholar]
- [5].Lynch M, The frailty of adaptive hypotheses for the origins of organismal complexity, Proc. Natl. Acad. Sci. U.S.A 104 (Suppl.) (2011) 8597–8604, 10.1002/iub.489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Kimura M, The Neutral Theory of Molecular Evolution, Cambridge Univ. Press, Cambridge, UK, 1983. [Google Scholar]
- [7].Lynch M, The Origins of Genome Architecture, Sinauer Assocs., Inc., Sunderland, MA, 2007. [Google Scholar]
- [8].Lynch M, Marinov GK, The bioenergetic costs of a gene, Proc. Natl. Acad. Sci. U.S.A 112 (2015) 15690–15695, 10.1073/pnas.1514974112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Charlesworth B, Effective population size and patterns of molecular evolution and variation, Genetics 10 (2009) 195–205, 10.1038/nrg2526. [DOI] [PubMed] [Google Scholar]
- [10].Ohta T, Slightly deleterious mutant substitutions in evolution, Nature 246 (1973) 96–98, 10.1038/246096a0. [DOI] [PubMed] [Google Scholar]
- [11].Charlesworth B, The effects of deleterious mutations on evolution at linked sites, Genetics 190 (2012) 5–22, 10.1534/genetics.111.134288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [12].Neher RA, Genetic draft, selective interference, and population genetics of rapid adaptation, Annu. Rev. Ecol. Evol. Syst 44 (2013) 195–215, 10.1146/annurev-ecolsys-110512-135920. [DOI] [Google Scholar]
- [13].Lynch M, Blanchard J, Houle D, Kibota T, Schultz S, Vassilieva L, Willis J, Spontaneous deleterious mutation, Evolution 53 (1999) 645–663, 10.2307/2640707. [DOI] [PubMed] [Google Scholar]
- [14].Baer CF, Miyamoto MM, Denver DR, Mutation rate variation in multicellular eukaryotes: causes and consequences, Nat. Rev. Genet 8 (2007) 619–631, 10.1038/nrg2158. [DOI] [PubMed] [Google Scholar]
- [15].Keightley PD, The distribution of mutation effects on viability in Drosophila melanogaster, Genetics 138 (1994) 1315–1322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Robert L, Ollion J, Robert J, Song X, Matic I, Elez M, Mutation dynamics and fitness effects followed in single cells, Science 359 (2018) 1283–1286, 10.1126/science.aan0797. [DOI] [PubMed] [Google Scholar]
- [17].Böndel KB, Kraemer SA, Samuels T, McClean D, Lachapelle J, Ness RW, Colegrave N, Keightley PD, Inferring the distribution of fitness effects of spontaneous mutations in Chlamydomonas reinhardtii, PLoS Biol. 17 (2019), 10.1371/\text{journal}.pbio.3000192 e3000192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Keightley PD, Eyre-Walker A, Joint inference of the distribution of fitness effects of deleterious mutations and population demography based on nucleotide polymorphism frequencies, Genetics 177 (2007) 2251–2261, 10.1534/genetics.107.080663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Bataillon T, Bailey SF, Effects of new mutations on fitness: insights from models and data, Ann. N. Y. Acad. Sci 1320 (2014) 76–92, 10.1111/nyas.12460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Huber CD, Kim BY, Marsden CD, Lohmueller KE, Determining the factors driving selective effects of new nonsynonymous mutations, Proc. Natl. Acad. Sci. U.S.A 114 (2015) 4465–4470, 10.1073/pnas.1619508114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Kim BY, Huber CD, Lohmueller KE, Inference of the distribution of selection coefficients for new nonsynonymous mutations using large samples, Genetics 206 (2017) 345–361, 10.1534/genetics.116.197145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].Lynch M, Ackerman M, Spitze K, Ye Z, Maruki T, Population genomics of Daphnia pulex, Genetics 206 (2016) 315–332, 10.1534/genetics.116.190611. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [23].Sung W, Ackerman MS, Dillon M, Platt T, Fuqua C, Cooper V, Lynch M, Evolution of the insertion-deletion mutation rate across the tree of life, G3: Genes Genomes Genetics 6 (2016) 2583–2591, 10.1534/g3.116.030890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24].Kimura M, On the probability of fixation of mutant genes in a population, Genetics 47 (1962) 713–719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Lynch M, Evolutionary diversification of the multimeric states of proteins, Proc. Natl. Acad. Sci. U.S.A 110(2013) E2821–E2828, 10.1073/pnas.1310980110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Lynch M, Phylogenetic diversification of cell biological features, Elife 7 (2018), 10.7554/eLife.34820 e34820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Yampolsky LY, Stoltzfus A, Bias in the introduction of variation as an orienting factor in evolution, Evol. Dev 3 (2001) 73–83, 10.1046/j.1525-142x.2001.003002073.x. [DOI] [PubMed] [Google Scholar]
- [28].Stoltzfus A, Cable K, Mendelian-mutationism: the forgotten evolutionary synthesis, J. Hist. Biol 47 (2014) 501–546, 10.1007/s10739-014-9383-2. [DOI] [PubMed] [Google Scholar]
- [29].Storz JF, Natarajan C, Signore AV, Witt CC, McCandlish DM, Stoltzfus A, The role of mutation bias in adaptive molecular evolution: insights from convergent changes in protein function, Philos. Trans. R. Soc. Lond. B Biol. Sci 374 (2019) 20180238, 10.1098/rstb.2018.0238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [30].Svensson EI, Berger D, The role of mutation bias in adaptive evolution, Trends Ecol. Evol 34 (2019) 422–434, 10.1016/j.tree.2019.01.015. [DOI] [PubMed] [Google Scholar]
- [31].Lynch M, Ackerman M, Gout J-F, Long H, Sung W, Thomas WK, Foster PL, Genetic drift, selection, and evolution of the mutation rate, Nat. Rev. Genet 17 (2016) 704–714, 10.1038/nrg.2016.104. [DOI] [PubMed] [Google Scholar]
- [32].Long H, Sung W, Kucukyildirim S, Williams E, Guo W, Patterson C, Gregory C, Strauss C, Stone C, Berne C, Kysela D, Shoemaker WR, Muscarella M, Luo H, Lennon JT, Brun YV, Lynch M, Evolutionary determinants of genome-wide nucleotide composition, Nature Ecol. Evol 2 (2017) 237–240, 10.1038/s41559-017-0425-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33].Kimura M, On the evolutionary adjustment of spontaneous mutation rates, Genet. Res 9 (2017) 23–34, 10.1017/S0016672300010284. [DOI] [Google Scholar]
- [34].Lynch M, The cellular, developmental, and population-genetic determinants of mutation-rate evolution, Genetics 180 (2008) 933–943, 10.1534/genetics.108.090456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Lynch M, The lower bound to the evolution of mutation rates, Genome Biol. Evol 3 (2011) 1107–1118, 10.1093/gbe/evr066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [36].Gout JF, Thomas WK, Smith Z, Okamoto K, Lynch M, Large-scale detection of in vivo transcription errors, Proc. Natl. Acad. Sci. U.S.A 110 (2013) 18584–18589, 10.1073/pnas.1309843110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [37].Gout J-F, Li W, Fritsch C, Li A, Haroon S, Singh L, Hua D, Fazelinia H, Seeholzer S, Lynch M, Vermulst M, The landscape of transcription errors in eukaryotic cells, Proc. Natl. Acad. Sci. U.S.A 3 (2017), 10.1126/sciadv.1701484 e1701484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [38].Landry CR, Levy ED, Michnick SW, Weak functional constraints on phosphoproteomes, Trends Genet. 25 (5) (2009) 193–197, 10.1016/j.tig.2009.03.003. [DOI] [PubMed] [Google Scholar]
- [39].Nguyen Ba AN, Moses AM, Evolution of characterized phosphorylation sites in budding yeast, Mol. Biol. Evol 27 (2010) 2027–2037, 10.1093/molbev/msq090. [DOI] [PubMed] [Google Scholar]
- [40].Gray VE, Kumar S, Rampant purifying selection conserves positions with posttranslational modifications in human proteins, Mol. Biol. Evol 28 (2011) 1565–1568, 10.1093/molbev/msr013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [41].Levy ED, Michnick SW, Landry CR, Protein abundance is key to distinguish promiscuous from functional phosphorylation based on evolutionary information, Philos. Trans. R. Soc. Lond. B Biol. Sci 367 (2012) 2594–2606, 10.1098/rstb.2012.0078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [42].Moses AM, Liku ME, Li JJ, Durbin R, Regulatory evolution in proteins by turnover and lineage-specific changes of cyclin-dependent kinase consensus sites, Proc. Natl. Acad. Sci. U.S.A 104 (2007) 17713–17718, 10.1073/pnas.0700997104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [43].Holt LJ, Tuch BB, Villén J, Johnson AD, Gygi SP, Morgan DO, Global analysis of cdk1 substrate phosphorylation sites provides insights into evolution, Science 325 (2009) 1682–1686, 10.1126/science.1172867. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [44].Freschi L, Osseni M, Landry CR, Functional divergence and evolutionary turnover in mammalian phosphoproteomes, PLoS Genet. 10 (1) (2014), 10.1371/journal.pgen.1004062 e1004062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [45].Studer RA, Evolution of protein phosphorylation across 18 fungal species, Science 354 (2016) 229–232, 10.1126/science.aaf2144. [DOI] [PubMed] [Google Scholar]
- [46].Kurmangaliyev YZ, Goland A, Gelfand MS, Evolutionary patterns of phosphorylated serines, Biol. Direct 6 (8) (2011), 10.1186/1745-6150-6-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [47].Pearlman SM, Serber Z, Ferrell JE Jr., A mechanism for the evolution of phosphorylation sites, Cell 147 (2011) 934–946, 10.1016/j.cell.2011.08.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [48].Diss G, Freschi L, Landry CR, Where do phosphosites come from and where do they go after gene duplication? Int. J. Evol. Biol 2012 (2012) 843167, 10.1155/2012/843167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [49].Lienhard GE, Non-functional phosphorylations? Trends Biochem. Sci 33 (2008) 351–352, 10.1016/j.tibs.2008.05.004. [DOI] [PubMed] [Google Scholar]
- [50].Landry CR, Freschi L, Zarin T, Moses AM, Turnover of protein phosphorylation evolving under stabilizing selection, Front. Genet 5 (2014) 245, 10.3389/fgene.2014.00245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [51].Mahmoudabadi G, Phillips M, Lynch R, Milo R, Defining the energetic costs of cellular structures, bioRxiv 6 (2019). www.biorxiv.org/content/10.1101/666040v1. [Google Scholar]
- [52].Bauchop T, Elsden SR, The growth of micro-organisms in relation to their energy supply, J. Gen. Microbiol 23 (1960) 457–469, 10.1099/00221287-23-3-457. [DOI] [PubMed] [Google Scholar]
- [53].Atkinson DE, Adenine nucleotides as universal stoichiometric metabolic coupling agents, Adv. Enzym. Regul 9 (1970) 207–219, 10.1016/S0065-2571(71)80045-6. [DOI] [PubMed] [Google Scholar]
- [54].Stouthamer AH, A theoretical study on the amount of atp required for synthesis of microbial cell material, Antonie Leeuwenhoek 39 (1973) 545–565, 10.1007/BF02578899. [DOI] [PubMed] [Google Scholar]
- [55].Tempest DW, Neijssel OM, The status of yatp and maintenance energy as biologically interpretable phenomena, Annu. Rev. Microbiol 38 (1984) 459–486, 10.1146/annurev.mi.38.100184.002331. [DOI] [PubMed] [Google Scholar]
- [56].Russell JB, Cook GM, Energetics of bacterial growth: balance of anabolic and catabolic reactions, Microbiol. Rev 59 (1995) 48–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [57].Kharasch MS, Sher B, The electronic conception of valence and heats of combustion of organic compounds, J. Phys. Chem 29 (1925) 625–658, 10.1021/j150252a001UT. [DOI] [Google Scholar]
- [58].Craig CL, Weber RS, Selection costs of amino acid substitutions in cole 1 and colia gene clusters harbored by Escherichia coli, Mol. Biol. Evol 15 (1998) 774–776, 10.1093/oxfordjournals.molbev.a025981. [DOI] [PubMed] [Google Scholar]
- [59].Akashi H, Gojobori T, Metabolic efficiency and amino acid composition in the proteomes of Escherichia coli and Bacillus subtilis, Proc. Natl. Acad. Sci. U.S.A 99 (2002) 3695–3700, 10.1073/pnas.062526999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [60].Ilker E, Hinczewski M, Modeling the growth of organisms validates a general relation between metabolic costs and natural selection, Phys. Rev. Lett 122 (2019) 238101, 10.1103/PhysRevLett.122.238101. [DOI] [PubMed] [Google Scholar]
- [61].Pirt SJ, Maintenance energy: a general model for energy-limited and energy-sufficient growth, Arch. Microbiol 133 (1982) 300–302, 10.1007/BF00521294. [DOI] [PubMed] [Google Scholar]
- [62].Molenaar D, Van Berlo R, De Ridder D, Teusink B, Shifts in growth strategies reflect tradeoffs in cellular economics, Mol. Syst. Biol 5 (1) (2009) 1099–1102, 10.1126/science.1192588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [63].Scott M, Gunderson CW, Mateescu EM, Zhang Z, Hwa T, Interdependence of cell growth and gene expression: origins and consequences, Science 330 (6007) (2010) 1099–1102, 10.1126/science.1192588. [DOI] [PubMed] [Google Scholar]
- [64].Weiße AY, Oyarzú DA, Danos V, Swain PS, Mechanistic links between cellular trade-offs, gene expression, and growth, Proc. Natl. Acad. Sci. U.S.A 112 (9) (2015) E1038–E1047, 10.1073/pnas.1416533112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [65].Pandey PP, Jain S, Analytic derivation of bacterial growth laws from a simple model of intracellular chemical dynamics, Theor. Biosci 135 (3) (2016) 121–130, 10.1007/s12064-016-0227-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [66].Baudouin-Cornu P, Surdin-Kerjan Y, Marliere P, Thomas D, Molecular evolution of protein atomic composition, Science 293 (5528) (2001) 297–300, 10.1126/science.1061052. [DOI] [PubMed] [Google Scholar]
- [67].Alves R, Savageau MA, Evidence of selection for low cognate amino acid bias in amino acid biosynthetic enzymes, Mol. Microbiol 56 (4) (2005) 1017–1034, 10.1111/j.1365-2958.2005.04566.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [68].Acquisti C, Kumar S, Elser JJ, Signatures of nitrogen limitation in the elemental composition of the proteins involved in the metabolic apparatus, Proc. Biol. Sci 276 (1667) (2009) 2605–2610, 10.1126/science.1061052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [69].Folsom JP, Carlson RP, Physiological, biomass elemental composition and proteomic analyses of Escherichia coli ammonium-limited chemostat growth, and comparison with iron- and glucose-limited chemostat growth, Microbiology 161 (2015) 1659–1670, 10.1099/mic.0.000118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [70].Lane N, Martin W, The energetics of genome complexity, Nature 467 (7318) (2010) 929–934, 10.1038/nature09486. [DOI] [PubMed] [Google Scholar]
- [71].Yin QY, de Groot PW, de Jong L, Klis FM, De Koster CG, Mass spectrometric quantitation of covalently bound cell wall proteins in Saccharomyces cerevisiae, FEMS Yeast Res. 7 (2007) 887–896, 10.1111/j.1567-1364.2007.00272.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [72].Klis FM, de Koster CG, Brul S, Cell wall-related bionumbers and bioestimates of Saccharomyces cerevisiae and Candida albicans, Eukaryot. Cell 13 (2014) 2–9, 10.1128/EC.00250-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [73].Neidhardt FC, Ingraham JL, Schaechter M, Physiology of the Bacterial Cell: a Molecular Approach, Sinauer Associates, Sunderland, MA, 1990. [Google Scholar]
- [74].Wientjes FB, Woldringh CL, Nanninga N, Amount of peptidoglycan in cell walls of gram-negative bacteria, J. Bacteriol 173 (1991) 7684–7691, 10.1128/jb.173.23.7684-7691.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [75].Gumbart JC, Beeby M, Jensen GJ, Roux B, Escherichia coli peptidoglycan structure and mechanics as predicted by atomic-scale simulations, PLoS Comput. Biol 10 (2014), 10.1371/journal.pcbi.1003475 e1003475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [76].Gittleson SM, Noble RM, Locomotion in Polytomella agilis and Polytoma uvella, Trans. Am. Microsc. Soc 92 (1973) 122–128, 10.2307/3225176. [DOI] [Google Scholar]
- [77].Purcell EM, Life at low Reynolds number, Am. J. Phys 45 (1977) 3–11, 10.1119/1.10903. [DOI] [Google Scholar]
- [78].Meister M, Lowe G, Berg HC, The proton flux through the bacterial flagellar motor, Cell 49 (1987) 643–650, 10.1016/0092-8674(87)90540-x. [DOI] [PubMed] [Google Scholar]
- [79].Chattopadhyay S, Moldovan R, Yeung C, Wu XL, Swimming efficiency of bacterium Escherichia coli, Proc. Natl. Acad. Sci. U.S.A 103 (2006) 13712–13717, 10.1073/pnas.0602043103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [80].Katsu-Kimura Y, Nakaya F, Baba SA, Mogami Y, Substantial energy expenditure for locomotion in ciliates verified by means of simultaneous measurement of oxygen consumption rate and swimming speed, J. Exp. Biol 212 (2009) 1819–1824, 10.1242/jeb.028894. [DOI] [PubMed] [Google Scholar]
- [81].Tam D, Hosoi AE, Optimal feeding and swimming gaits of biflagellated organisms, Proc. Natl. Acad. Sci. U.S.A 108 (2011) 1001–1006, 10.1073/pnas.1011185108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [82].Arroyo M, Heltai L, Millán D, DeSimone A, Reverse engineering the euglenoid movement, Proc. Natl. Acad. Sci. U.S.A 109 (2012) 17874–17879, 10.1073/pnas.1213977109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [83].Barsanti L, Coltelli P, Evangelista V, Frassanito AM, Gualtieri P, Swimming patterns of the quadriflagellate Tetraflagellochloris mauritanica (chlamydomonadales, chlorophyceae), J. Phycol 52 (2016) 209–218, 10.1111/jpy.12384. [DOI] [PubMed] [Google Scholar]
- [84].Kinosita Y, Uchida N, Nakane D, Nishizaka T, Direct observation of rotation and steps of the archaellum in the swimming halophilic archaeon Halobacterium salinarum, Nat. Microbiol 1 (2016) 16148, 10.1038/nmicrobiol.2016.148. [DOI] [PubMed] [Google Scholar]
- [85].Fenchel T, Finlay BJ, Respiration rates in heterotrophic, free-living protozoa, Microb. Ecol 9 (1983) 99–122, 10.1007/BF02015125. [DOI] [PubMed] [Google Scholar]
- [86].Raven JA, Richardson K, Dinophyte flagella: a cost-benefit analysis, New Phytol. 98 (1984) 259–276, 10.1111/j.1469-8137.1984.tb02736.x. [DOI] [Google Scholar]
- [87].Crawford DW, Metabolic cost of motility in planktonic protists: theoretical considerations on size scaling and swimming speed, Microb. Ecol 24 (1–10) (1992), 10.1007/BF00171966. [DOI] [PubMed] [Google Scholar]
- [88].Milo R, Phillips R, Cell Biology by the Numbers, Garland Science, New York, NY, 2016. [Google Scholar]
- [89].Ferreira JL, Gao FZ, Rossmann FM, Nans A, Brenzinger S, Hosseini R, Beeby M, Gamma-proteobacteria eject their polar flagella under nutrient depletion, retaining flagellar motor relic structures, PLoS Biol. 17 (3) (2019), 10.1371/\journal.pbio.3000165 e3000165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [90].Lynch M, Evolutionary layering and the limits to cellular perfection, Proc. Natl. Acad. Sci. U.S.A 109 (2011) 18851–18856, 10.1073/pnas.1216130109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [91].Cross FR, Buchler NE, Skotheim JM, Evolution of networks and sequences in eukaryotic cell cycle control, Philos. Trans. R. Soc. Lond. B Biol. Sci 366 (2011) 3532–3544, 10.1098/rstb.2011.0078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [92].Harashima H, Dissmeyer N, Schnittger A, Cell cycle control across the eukaryotic kingdom, Trends Cell Biol. 23 (2013) 345–356, 10.1016/j.tcb.2013.03.002. [DOI] [PubMed] [Google Scholar]
- [93].Kohl KP, Sekelsky J, Meiotic and mitotic recombination in meiosis, Genetics 194 (2013) 327–334, 10.1534/genetics.113.150581. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [94].Jensen RA, Pierson DL, Evolutionary implications of different types of microbial enzymology for l-tyrosine biosynthesis, Nature 254 (2019) 667–671, 10.1038/254667a0. [DOI] [PubMed] [Google Scholar]
- [95].Koonin EV, Mushegian AR, Bork P, Non-orthologous gene displacement, Trends Genet. 12 (1996) 334–336, 10.1016/S0168-9525(96)80003-5. [DOI] [PubMed] [Google Scholar]
- [96].Barreteau H, Kovac A, Boniface A, Sova M, Gobec S, Blanot D, Cytoplasmic steps of peptidoglycan biosynthesis, FEMS Microbiol. Rev 32 (2008) 168–207, 10.1111/j.1574-6976.2008.00104.x. [DOI] [PubMed] [Google Scholar]
- [97].Omelchenko MV, Galperin MY, Wolf YI, Koonin EV, Non-homologous isofunctional enzymes: a systematic analysis of alternative solutions in enzyme evolution, Biol. Direct 5 (2010) 51, 10.1186/1745-6150-5-31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [98].Liapounova NA, Hampl V, Gordon PMK, Sensen CW, Gedamu L, Dacks JB, Reconstructing the mosaic glycolytic pathway of the anaerobic eukaryote monocerco-monoides, Eukaryot. Cell 5 (2012) 2138–2146, 10.1128/EC.00258-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [99].Taylor TB, Mulley G, Dills AH, Alsohim AS, McGuffin LJ, Studholme DJ, Silby MW, Brockhurst MA, Johnson LJ, Jackson RW, Evolutionary resurrection of flagellar motility via rewiring of the nitrogen regulation system, Science 347 (2015) 1014–1017, 10.1126/science.1259145. [DOI] [PubMed] [Google Scholar]
- [100].Baker CR, Tuch BB, Johnson AD, Extensive dna-binding specificity divergence of a conserved transcription regulator, Proc. Natl. Acad. Sci. U.S.A 108 (2011) 7493–7498, 10.1073/pnas.1019177108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [101].Singh DP, et al. , Genome-defence small rnas exapted for epigenetic mating-type inheritance, Nature 509 (2014) 447–452, 10.1038/nature13318. [DOI] [PubMed] [Google Scholar]
- [102].Tanay A, Regev A, Shamir R, Conservation and evolvability in regulatory networks: the evolution of ribosomal regulation in yeast, Proc. Natl. Acad. Sci. U.S.A 102 (2005) 7203–7208, 10.1073/pnas.0502521102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [103].Lavoie H, Hogues H, Mallick J, Sellam A, Nantel A, Whiteway M, Evolutionary tinkering with conserved components of a transcriptional regulatory network, PLoS Biol. 8 (2010), 10.1371/journal.pbio.1000329 e1000329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [104].Stoltzfus A, On the possibility of constructive neutral evolution, J. Mol. Evol 49 (1999) 169–181, 10.1007/PL00006540. [DOI] [PubMed] [Google Scholar]
- [105].Gray MW, Lukes J, Archibald JM, Keeling PJ, Doolittle WF, Cell biology. irremediable complexity? Science 330 (2010) 920–921, 10.1126/science.1198594. [DOI] [PubMed] [Google Scholar]
- [106].Lukes J, Archibald JM, Keeling PJ, Doolittle WF, Gray MW, How a neutral evolutionary ratchet can build cellular complexity, IUBMB Life 63 (2011) 528–537, 10.1002/iub.489. [DOI] [PubMed] [Google Scholar]
- [107].Rodenfels J, Neugebauer KM, Howard J, Heat oscillations driven by the embryonic cell cycle reveal the energetic costs of signaling, Dev. Cell 48 (5) (2019) 646–658, 10.1016/j.devcel.2018.12.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
