Skip to main content
eLife logoLink to eLife
. 2024 Sep 6;13:e93146. doi: 10.7554/eLife.93146

Reversions mask the contribution of adaptive evolution in microbiomes

Paul A Torrillo 1,2, Tami D Lieberman 1,2,3,4,
Editors: Vaughn S Cooper5, Detlef Weigel6
PMCID: PMC11379459  PMID: 39240756

Abstract

When examining bacterial genomes for evidence of past selection, the results depend heavily on the mutational distance between chosen genomes. Even within a bacterial species, genomes separated by larger mutational distances exhibit stronger evidence of purifying selection as assessed by dN/dS, the normalized ratio of nonsynonymous to synonymous mutations. Here, we show that the classical interpretation of this scale dependence, weak purifying selection, leads to problematic mutation accumulation when applied to available gut microbiome data. We propose an alternative, adaptive reversion model with opposite implications for dynamical intuition and applications of dN/dS. Reversions that occur and sweep within-host populations are nearly guaranteed in microbiomes due to large population sizes, short generation times, and variable environments. Using analytical and simulation approaches, we show that adaptive reversion can explain the dN/dS decay given only dozens of locally fluctuating selective pressures, which is realistic in the context of Bacteroides genomes. The success of the adaptive reversion model argues for interpreting low values of dN/dS obtained from long timescales with caution as they may emerge even when adaptive sweeps are frequent. Our work thus inverts the interpretation of an old observation in bacterial evolution, illustrates the potential of mutational reversions to shape genomic landscapes over time, and highlights the importance of studying bacterial genomic evolution on short timescales.

Research organism: Other

Introduction

Understanding evolutionary pressures acting upon bacterial populations is crucial for predicting the emergence and future virulence of pathogens (Culyba and Van Tyne, 2021), modeling strategies to combat antimicrobial resistance (Davies and Davies, 2010), and designing genetically modified organisms (Castle et al., 2021). Bacteria can adapt at rapid rates due to their short generation times and large population sizes. Indeed, the rapid evolutionary potential of the microbiome has been proposed to assist in the dietary transitions of mammals (Kolodny and Schulenburg, 2020). However, the vast majority of possible mutations do not increase bacterial fitness and instead result in a neutral or deleterious effect (Kimura, 1977; Davies et al., 1999; Jolley et al., 2000; Dingle et al., 2001). Metrics that estimate the directionality and intensity of past selection at genomic loci of interest have thus become critical tools in modern microbiology and biology more generally.

The normalized ratio of nonsynonymous (N) to synonymous (S) substitutions, known as dN/dS or the KA/KS ratio, is a widely used indicator of past selection (Jukes and Cantor, 1969; Kryazhimskiy and Plotkin, 2008; Barber and Elde, 2014). Nonsynonymous substitutions change the encoded amino acid and thus are considered likely to impact a protein’s function, while synonymous substitutions do not affect the encoded amino acid and are therefore considered effectively neutral, with limited exceptions (Nowick et al., 2019). To account for the fact that nonsynonymous mutations are more likely than synonymous mutations based on the genomic code (~3× on average; Yang and Nielsen, 2000), the values ‘dN’ and ‘dS’ normalize mutation counts to available sites on the genome. The dN/dS ratio therefore summarizes past selection on a genetic sequence, which could be a whole genome, pathway, gene, functional domain, or nucleotide; notably values of dN/dS averaged genome-wide can obscure signatures of adaptive evolution on other portions of the genome (Loo et al., 2020; Ho et al., 2005; Peterson and Masel, 2009). A dN/dS ratio of >1 indicates the dominance of past adaptive evolution (i.e., directional selection) while a ratio of <1 traditionally implies past selection against amino acid change (purifying selection).

Early sequencing work comparing bacterial genomes of the same species reported relatively low dN/dS values across the whole genome (<0.15) (Jolley et al., 2000; Dingle et al., 2001). These observations, obtained from comparing distant bacteria within each species, indicated a strong predominance of purifying selection. However, as it became economically feasible to sequence organisms separated by fewer mutations and therefore less evolutionary time, a contrasting pattern emerged in which high dN/dS values (~1) were found between closely related strains (Feil et al., 2003; Baker et al., 2004). Recent work in the human microbiome has confirmed such results and furthered the contrast between timescales by finding values of dN/dS > 1 (Garud et al., 2019; Lieberman et al., 2011; Shoemaker et al., 2022). The timescale dependence of dN/dS has been mainly attributed to the ongoing action of purifying selection (Garud et al., 2019), a model first proposed by Rocha et al., 2006. According to this model, weak purifying selection (or locally inactive purifying selection; Loo et al., 2020) allows for an initially inflated dN/dS ratio as deleterious mutations that will eventually be purged remain in the population. As time progresses and purifying selection continuously operates, the dN/dS ratio decreases (Loo et al., 2020; Garud et al., 2019; Rocha et al., 2006). However, multiple studies have observed genome-wide values of dN/dS > 1 in these same microbial systems, with values substantially >1 in key genes, which are simply unaccounted for in the purifying model (Garud et al., 2019; Lieberman et al., 2011; Marvig et al., 2015; Zhao et al., 2019; Zhao et al., 2020).

Here, we demonstrate fundamental flaws in the purifying selection model in the context of the large within-person population sizes typical to the human microbiome and many bacterial infections (>1012 bacteria/person). We use analytical, simulation-based, and genomic approaches to support a contrasting model for the timescale dependence of dN/dS, in which adaptive evolution predominates but is not apparent on long-timescales due to adaptive reversion. The comparative success of the reversion model suggests that the study of closely related bacteria is needed to fully understand evolutionary dynamics.

Results

A model of purifying selection that fits the data reveals unrealistic parameters

Explaining the timescale dependence of dN/dS through an exclusively purifying selection model poses several challenges. Firstly, fitting observed data with purifying selection requires a preponderance of mutations with extraordinarily small effects on fitness (selective coefficients, s), which are challenging to eliminate effectively (Haigh, 1978). Secondly, the occurrence of an adaptive event during the extensive time required to purge weakly deleterious mutations interrupts the purification of such mutations. Lastly, neutral bottlenecking processes, such as those observed during host-to-host transmission, exacerbate the accumulation of deleterious mutations. For most of this section, we will disregard these last two complications and focus on the problem of small s. To provide clarity, we first detail the classic purifying selection model.

Mutations can be divided into three classes, the first two of which accumulate at a constant rate per unit of time: synonymous mutations (S), neutral nonsynonymous mutations (Nneut), and non-neutral, transient, nonsynonymous mutations (Ntransient). We restate the timescale dependence of dN/dS as the observation that, in a population starting from a single wild type (WT) cell, the average number of non-neutral nonsynonymous mutations per cell in the population (N¯transient) increases and then asymptotes. Assuming an infinitely large population size and an infinite genome size (to circumvent saturation of mutations), the exclusive purifying selection model (Garud et al., 2019; Rocha et al., 2006) can thus be written as

dNdS=N¯neut+N¯transient3S¯ (1)

and

dN¯transient=UNdtsN¯transientdt. (2)

Here, UN is the non-neutral mutation rate per core genome per generation, s is the selective disadvantage of a non-neutral nonsynonymous mutation (or the harmonic mean of such mutations; see Appendix 1, Section 1.1), and t is the number of generations. The 3 in the denominator of Equation 1 accounts for the discrepancy in the number of potential nonsynonymous and synonymous sites (Yang and Nielsen, 2000). We solve for N¯transient by assuming N¯transient(t=0)=0 to obtain:

N¯transient(t)=UN(1est)s. (3)

We further simplify and combine these equations to create an equation for dN/dS with only two parameters as previously done (Garud et al., 2019). First, since dN/dS plateaus with time (Figure 1a), we have limtN¯neut+N¯transient3S¯=N¯neut3S¯=α. Conveniently, α represents both the asymptote of dN/dS and the proportion of nonsynonymous mutations that are neutral. This allows us to leave only s as the other free parameter, obtaining (see Appendix 1, Section 1.1)

dNdS=α+(1α)1estst. (4)

Figure 1. The previously proposed explanation for the time dependence of dN/dS is weak purifying selection.

(a) Time signature of dN/dS as depicted by data points derived from the studies by Garud et al., 2019 and Zhao et al., 2019. Each dot represents a pairwise comparison between the consensus sequence from two gut microbiomes as computed by Garud et al., 2019, using only the top 10 species based on the quality of data points (see ‘Methods’). Where the high initial value of dN/dS begins to become the low asymptotic value of dN/dS occurs at approximately dS=μSs. Fit lines were derived from these points using Equation 4 to depict the trend. The median R2 is 0.81 (range 0.54–0.94). Corresponding data from Zhao et al., 2019 confirms these observed trends, demonstrating high levels of dN/dS at short timescales and low levels at longer timescales. Adaptive genes are from Zhao et al., 2019 and are defined as those that have high dN/dS values in multiple lineages. Insets: dN vs. dS on a linear scale. Note that the data was fit to minimize variance in the logarithmic scale, not the linear scale, so the fit is not expected to be as good for the inset. See Figure 1—figure supplement 1 for minimizing variance on a linear scale. See Figure 1—figure supplement 2 for all species on separate panels. (b) Values of s from the output of 999 standard bootstrap iterations of curve fitting, conducted with replacement, demonstrate that only small values of the average selective coefficient can fit the data.

Figure 1.

Figure 1—figure supplement 1. The purifying selection model predicts even weaker purifying selection when fitting to nonlogarithmic dN/dS.

Figure 1—figure supplement 1.

Equivalent to Figure 1, except the fit is determined by minimizing variance in dN/dS rather than logarithmic dN/dS. In this fit, s values tend to be even smaller, with a median of 2.0 × 10-5. The median R2 = 0.64 (range 0.07–0.88).
Figure 1—figure supplement 2. Purifying selection model fits by individual species.

Figure 1—figure supplement 2.

Content of Figure 1a, with each species given its own panel. One species of note is B. caccae, which appears to have substantial initial dN/dS values >1, at odds with a purifying selection model. Fits for s, with uncertainty, are given in Figure 1b.
Figure 1—figure supplement 3. Larger selective coefficients rapidly lose explanatory power.

Figure 1—figure supplement 3.

Here, we use Equation 4 to once again fit the data. We still use α=0.1 but use alternate values of s. We see that significantly larger, though still relatively small, levels of purifying selection have essentially no ability to explain the data. R2 is for minimizing the logarithmic variance.

As sequence analysis is not privy to the actual number of generations, we approximate t assuming that synonymous mutations accumulate according to a molecular clock (t=dS2(1/4)μ), where μ is the mutation rate per generation per base pair, ¼ represents the proportion of random mutations that are synonymous (Yang and Nielsen, 2000), and 2 accounts for the fact that divergence is a measure between a pair of genomes. As selection and mutation are both in units per time, any change in μ results in a corresponding change in s. Both model fits and consequences are largely dependent on the ratio of these two variables (more on this below), and thus are not sensitive to the choice of μ. We use a relatively high mutation rate of 10-9 per base pair per generation (Drake, 1991; Barrick and Lenski, 2013) as lower rate would imply even weaker purifying selection.

Fitting the data from Garud et al., 2019, we infer median values of α ≈ 0.10 (0.09–0.14) and s ≈ 3.5 × 10-5 (2.6 × 10-5-6.5 × 10–5) across all species (‘Methods’, Figure 1a). Aggregating all of the data at once results in a similar optimal fit of α ≈ 0.11 and s ≈ 2.8 × 10–5. The similarity across the 10 species is perhaps not surprising, given that all are human gut residents of the order Bacteroidales; these values are also in line with the values obtained previously from aggregating across all species (Garud et al., 2019). These values indicate a model in which only ~10% of nonsynonymous mutations are neutral and the remaining ~90% are so weakly deleterious that they are beyond the limit of detection of any experimental method to date (s ≳ 10–3) (Gallet et al., 2012). Higher values of s that better reflect experimental observations (Kibota and Lynch, 1996; Trindade et al., 2010; Robert et al., 2018) result in poor fits to the data (Figure 1—figure supplement 3). While the implied proportion of deleterious mutations may seem high, deep mutational scanning experiments have revealed that most amino acid-changing mutations in essential genes are deleterious enough to be measured in the lab (Kelsic et al., 2016; Dewachter et al., 2023); complex real-world environments are expected to constrain an even larger fraction of the genome.

In finite populations, the presence of so many weakly deleterious mutations becomes quickly problematic. When s is smaller than UN, organisms without any deleterious mutations (or with the fewest number of deleterious mutations, the ‘least-loaded class’; Haigh, 1978) can be easily lost from a finite population before they outcompete less fit organisms and fitness decay begins to occur. The likelihood of loss depends on the population size and mutation-selection balance (UN/s), a parameter that estimates the average number of deleterious mutations per cell relative to the least-loaded class. Given a core genome of L = 1.5 ×106 bp that can acquire deleterious mutations, we then expect 0.001 new deleterious mutations per genome per generation (UN=34(1-α)Lμ). Thus, the value of UN/s for the above fits is ~29, indicating that most cells in the population contain dozens of deleterious mutations (see Appendix 1, Section 1.2). With this value of the mutation-selection balance parameter, the frequency of mutation-free organisms in a population is extremely small, even for a population that starts without any deleterious mutations (<10–12 after 100,000 generations). If the flexible genome also contains deleterious mutations, the least-loaded class is pushed down even further. Simulations substantiate this prediction of mutation accumulation and decrease in frequency of the wild type (Figure 2a, ‘Methods’).

Figure 2. Models of extremely weak purifying selection that can fit the data suffer from mutation accumulation and fitness decay.

(a) The temporal dynamics of the least-loaded class in a large population under the purifying selection model. The black line represents the predicted frequency of the wild-type (mutation-free) class over time. The simulation curve shows simulation results assuming constant purifying selection in an exceptionally large effective population size (Ne = 1018; see text for a discussion of population size) under a slightly modified Wright–Fisher model (‘Methods’). (b) As a consequence of the loss of the least-loaded class, fitness declines in finite populations over time. Colored lines indicate simulations from various effective population sizes with mutations of constant selective effect. The deleterious mutation rate in the simulation is 1.01 × 10−3 per genome per generation. (c) Using the same simulations as in panel (b), we see that realistic global effective population sizes fail to fit the dN/dS curve, with different asymptotes. The black line denotes the infinite population theoretical model, and the colored lines indicate increasing effective population sizes, which change the strength of genetic drift in the simulations. Larger values of s and models in which all mutations are deleterious cannot fit the data (Figure 2—figure supplement 1, Figure 2—figure supplement 2). Generations are assumed to occur once every day.

Figure 2.

Figure 2—figure supplement 1. Larger selective coefficients that can prevent mutation accumulation in simulations lead to less optimal data fit.

Figure 2—figure supplement 1.

We first calculate the smallest possible s values (as estimated from Equation 5) that will not lead to mutation accumulation for a given effective population size and the corresponding fits. The core genome size is assumed to be 1,500,000 nucleotides. Once again, we use Equation 4 to fit the data and use α=0.1.
Figure 2—figure supplement 2. Even assuming all mutations are deleterious suggests a higher asymptotic dN/dS.

Figure 2—figure supplement 2.

A purifying selection simulation as in Figure 2, except that all mutations are assumed to be weakly deleterious (s = 3.5 × 10-5) rather than just 90%. In other words, rather than the standard deleterious mutation rate of 1.01 × 10-3 mutations per genome per generation, the mutation rate is now 1.13 × 10-3 mutations per genome per generation. With an effective population size of 109, mutations still occur so quickly as to raise the asymptote above the data. Furthermore, all of these mutations contributing to the asymptotic dN/dS are deleterious rather than neutral, furthering the problem of Muller’s ratchet.
Figure 2—figure supplement 3. High rates of recombination are unlikely to rescue a model of weak purifying selection.

Figure 2—figure supplement 3.

A purifying selection simulation with a population of 109 with and without recombination. To simulate the potential for recombination to purge deleterious mutations, we assume a 2.5 × 10-7 chance per codon per generation of return to the ancestral state (500 times the basal mutation rate, dotted blue line), which brings alongside it a linked synonymous mutation. This would represent recombination bringing in fragments of 100 bp in length from a genome with dS = 0.01. All other assumptions are the same as the continuous selection simulations in Figure 2. While recombination does allow more fit genotypes to arise, their selective advantage is quite small, and time to sweep is still slower than mutation accumulation. Recombination is more effective at purging deleterious alleles when many deleterious mutations are present, but this occurs at a longer timescale than the actual drop off of dN/dS values. We also include a variant where recombination events revert deleterious mutations without linked synonymous mutations. In this model, we have only included recombination events that revert deleterious mutations and chosen not to explicitly model cases of neutral recombination. High levels of neutral recombination with distant organisms would result in closely related organisms having dN/dS significantly less than 1 and would not fit the data without significant adaptation.
Figure 2—figure supplement 4. Transmission bottlenecks and adaptation further complicate purifying selection.

Figure 2—figure supplement 4.

Results of a simulation with purifying selection that takes into account transmission bottlenecks and the possibility of adaptation. The simulation is meant to represent a potential transmission chain from gut to gut. This simulation is a modified version of the simulation used for the reversion model (see ‘Methods’). Only forward mutations are available when beneficial mutations arrive (sben=0.03) and deleterious mutations arrive at a rate of 1.01 × 10-3 mutations per genome with a selective disadvantage of s = 3.5 × 10-5. The census size in an individual gut is 1010. We choose very conservative values for adaptation rate, bottleneck size, and bottleneck frequency to show how even small levels of these dynamics strongly influence results. Transmission bottlenecks happen on average every 100,000 generations. The population is bottlenecked to 1000 members upon transmission. Beneficial selective pressures are released on average every 8400 generations. Multiple beneficial pressures can occur together (the number released at a time is a Poisson random variable of mean 1). This rate of beneficial mutation does not contribute much to dN/dS on its own as seen from the dotted line where adaptations are excluded in the dN/dS calculation. However, both of these scenarios drastically increase the rate of mutation accumulation and the asymptotic dN/dS. The reversion model can prevent this from occurring with 10 times the amount of adaptation and transmission.

The time until the least-loaded class is completely lost from the population depends on the strength of genetic drift. The strength of genetic drift is inversely proportional to population size in well-mixed populations (Gillespie, 2004), and in less well-mixed or otherwise nonideal populations, is inversely proportional to a smaller parameter, the effective population size, Ne. Ne is often estimated by assessing polymorphisms in a population (Gillespie, 2004) but is hard to estimate from data following a recent bottleneck. Because each individual’s gut microbiome is thought to be well mixed (census size = 1013) (Sender et al., 2016), it has been recently argued that Ne ≈ 1011 reflects drift processes for dominant gut species (Ghosh and Good, 2022; Labavić et al., 2022). On the other hand, lower values of Ne ≈ 109 or less have been estimated for global populations of bacteria (Bobay and Ochman, 2018) because of the slow rates of bacterial transmission across people. While this decrease in Ne when increasing scales may seem paradoxical, we note this use of Ne only reflects the magnitude of the force of drift; for other calculations in nonideal populations, census population size or other parameters should be used.

Without extremely large values of Ne, the least-loaded class will be lost recurrently, rapidly lowering the fitness of the population (i.e., Muller’s ratchet; Haigh, 1978). Assuming s is small and thus approximately additive, this recurrent process of fitness decay occurs roughly when the following inequality is satisfied (see Appendix 1, Section 1.2; Neher and Shraiman, 2012):

2sNeeUNs<<1. (5)

Given UN/s = 29 as derived above, Ne > 1015 is required to avoid continual deleterious mutation accumulation and fitness decline (Figure 2b). Thus, the purifying model requires levels of drift unrealistic at the within-person or across-globe scales. Simulations confirm that deleterious mutations accumulate and compromise the ability of the purifying model to explain empirical dN/dS decay in reasonably finite populations (Figure 2c). Moreover, continuous accumulation of mutations in such populations decreases fitness so much that the average genome contains a sizable fraction (~10%) of deleterious alleles after 1 million years (Figure 2c), assuming Ne = 109 and one generation a day (Korem et al., 2015). Even if this decreased fitness was biologically maintainable, the accumulation of so many deleterious mutations would lead to many potential adaptive back mutations, complicating the efficiency of purifying selection. Consequently, this value of UN/s is simply incompatible with a model where a vast majority of alleles are already optimal.

Lastly, the intolerance of the purifying model to adaptation and transmission is particularly problematic. Within-host adaptive sweeps have been observed in Bacteroides fragilis (Zhao et al., 2019) and other Bacteroides (Garud et al., 2019). Such adaptation interferes with inefficient purifying selection; deleterious mutations are likely to hitchhike (Desai et al., 2013) to fixation on the genomic background of adaptive mutations. Any given weakly deleterious mutation with s = 3.5 × 10–5 cannot be purged from a within-host population on the timescale of human lifetime (assuming ~1 generation per day), and thus if any adaptive sweep occurred within that host, it would either hitchhike to fixation or be completely removed from the population. Similarly, deleterious mutations can also hitchhike to fixation during neutral transmission bottlenecks, thereby raising the average number of deleterious mutations per cell in the population, furthering mutation accumulation, and hampering the efficiency of purifying selection. Simulations confirm that even infrequent adaptive sweeps and bottlenecks have tangible impacts on dN/dS, including raising the asymptote (Figure 2—figure supplement 3).

Neither recombination nor differential selection at transmission can easily rescue a model of weak purifying selection

Homologous recombination, which occurs at detectable rates within human gut microbiomes and within the Bacteroidales order (Liu and Good, 2024), cannot rescue a population from Muller’s ratchet when such weakly deleterious mutations are so frequent. If we assume a generously high rate of recombination, such that a mutated nucleotide is 500 times more likely to be reverted via recombination than mutation (r/m = 500) (Torrance et al., 2024; Liu and Good, 2024) and brings along a single linked synonymous mutation during each recombination event, the decay of dN/dS still cannot be recreated in a population of size 109 and fitness will still decay (Figure 2—figure supplement 4). The inability of recombination to suppress mutation accumulation in this regime arises because the selective advantages themselves are still too small to sweep faster than the rate at which mutations accumulate. While recombination does allow dN/dS to eventually decay, the rate of decay is much slower than observed, resulting in a poor fit to the data (Figure 2—figure supplement 4). While higher values of Ne or a higher recombination rate could theoretically approximate the absence of linkage and escape of Muller’s ratchet, we note that the maximum r/m across bacteria is estimated to be <50 (Torrance et al., 2024) and our simulations are therefore conservative.

Our presentation so far has implicitly assumed that weak purifying selection has been acting continuously and that values of s are constant for any given allele over time. However, apparently weak purifying selection might theoretically emerge from mutations that spend periods under neutral selection (or even local positive selection) and larger periods under strong negative selection, with the estimated value of s reflecting the harmonic mean (Culyba and Van Tyne, 2021; Loo et al., 2020). However, such models will have a hard time overcoming mutation accumulation. For example, a model in which purifying selection acts only during transmission still cannot prevent mutation accumulation without unrealistic assumptions. In particular, the selection-at-transmission model would still require ~29 non-neutral mutations in the average adult population, which implies a very low frequency of the least-loaded class. Assuming each host’s population gets replaced once every 10,000 bacterial generations (~26 years), such a model would require the least-loaded class to be 6000× more likely to colonize them than the average genotype in the population ((1+10,000s)29). The presence of rare cells with strong selective advantages would suggest super-spreading across human microbiomes, which has yet to be reported in the human microbiome (Faith et al., 2013). More importantly, Muller’s ratchet would still click because of the low frequency of this least-loaded class.

Adaptive reversions can explain the decay of dN/dS

If purifying selection cannot explain the decay in neutral mutations, what can? One particularly attractive process that removes nonsynonymous mutations over time is strong adaptive mutation and subsequent strong adaptive reversion of the same nucleotide when conditions change. Such reversions are likely to sweep in large populations when mutations are adaptive locally but deleterious in other environments (Ascensao et al., 2023). In the gut microbiome, these alternative environments could represent different hosts (Figure 3a) or environmental changes within a single host (e.g., diet, medication, other microbes). As an illustrative example, the presence of a bacteriophage in one gut microbiome might select for a loss-of-function mutation (premature stop codon or otherwise) in a phage receptor, driving this mutation to fixation in its host, but reverting to the wild-type receptor when transmitted to a phage-free host. Reversions are most likely when compensatory mutations that counteract a mutation’s deleterious effects are either scarce or not as beneficial as direct reversion (Levin et al., 2000) (i.e., provided a premature stop codon); we discuss models that include compensatory mutations later in this section.

Figure 3. Locally adaptive mutations and subsequent reversions can explain the decay of nonsynonymous mutations.

(a) Cartoon schematic depicting a potential reversion event within a single transmitted lineage of bacteria. The color of each individual indicates a different local adaptive pressure. Closed arrows represent mutation while open arrows indicate transmission. (b) Reversions become increasingly likely at larger population sizes and are nearly guaranteed to occur and fix within 1–10 years when strongly beneficial in gut microbiomes. The probability of revertant arising and fixing (top panel) is calculated as 1(12sbenUrev)Ne, and the expected time to fixation of reversion (bottom panel) is calculated as 12NesbenUrev+ln(Ne)sben (Urev=4.5×1010 per generation). Generation times are assumed to be 1 day. Note that the mutation rate does not affect time to fixation much when Ne is large. Here, we assume no clonal interference or bottlenecks, though simulations do take these processes into account. See Appendix 1, Section 2.2 for derivation. Each line type displays a different selective advantage coefficient. (c) The adaptive reversion model can fit the data. Each colored line shows the fit for a different species. The median R2 = 0.82 (range 0.54–0.94). Fit minimizes logarithmic variance. See Figure 3—figure supplement 1 for alternative fitting linear variance. See Figure 3—figure supplement 2 for species individually. Insets: fit parameters for τflip, the average number of generations for a given environmental pressure to switch directions and nloci, the average number of sites under different fluctuating environmental pressures. The scale of the y-axis is linear. Confidence intervals are from 999 bootstrapped resamples.

Figure 3.

Figure 3—figure supplement 1. Potential for even more adaptation if fitting nonlogarithmic dN/dS.

Figure 3—figure supplement 1.

Results of fitting the reversion model (Equation 8) to minimize dN/dS rather than logarithmic dN/dS. This implies a potential longer time to switch for a given selective pressure (median τflip=53,340) and an increased number of loci undergoing fluctuating selection (median nloci=84). The median R2 = 0.72 (range 0.11–0.88).
Figure 3—figure supplement 2. Reversion model fits by individual species.

Figure 3—figure supplement 2.

Content of Figure 3b, with each species now given its own panel. One species of note is B. caccae, which appears to have initial dN/dS values >1, which can now be fit with the reversion model (see Figure 1—figure supplement 2 for comparison with purifying model).

Adaptive nonsense mutations have been observed to emerge frequently within individual people in both pathogens (Culyba and Van Tyne, 2021; Lieberman et al., 2011; Key et al., 2023; Shopsin et al., 2008) and commensals (Zhao et al., 2019; Barreto et al., 2023). Identifying reversions in vivo requires both high temporal resolution and deep surveillance such that the probability of persistence of ancestral genotype is removed (Snitkin et al., 2013) despite this difficulty, reversions of stop codons have been observed in mouse models (Sousa et al., 2017) and during an outbreak of a pathogen infecting the lungs of people with cystic fibrosis (Poret et al., 2024). While direct reversion has not yet been observed in gut microbiomes, premature stop codons are frequently observed. Among the 325 observed nonsynonymous de novo mutations in a study of within-host B. fragilis adaptation (Zhao et al., 2019), 28 were premature stop codons. This frequency is significantly higher than expected by chance (p=0.015; ‘Methods’). Moreover, 4 of the 44 mutations in 16 genes shown to be under adaptive evolution on this short timescale were stop codons. These same 16 genes show a signature of purifying selection on long timescales (Figure 1a).

Traditionally, mutational reversions of stop codons and other mutations have been considered exceedingly unlikely and have been ignored in population genetics (Tajima, 1996), with a few exceptions (Charlesworth and Eyre-Walker, 2007). However, for a bacterial population within a human gut microbiome, the likelihood of a mutational reversion is quite high. A single species within the gut microbiome can have a census population size of 1013, with generation rates ranging from 1 to 10 per day (Sender et al., 2016; Korem et al., 2015). Taking a conservative estimate of one generation per day and a within-person Ne of 1010 (e.g., bacteria at the end of the colon may not contribute much to the next generation; Labavić et al., 2022), reversions become highly probable (Figure 3b; see Appendix 1, Section 2.2). Given a mutation rate of 10–9 per site per generation, we anticipate 10 mutants at any given site each generation. In the large population sizes relevant for the gut microbiome, a beneficial mutation will then take substantially longer to sweep the population than occur, with values of sben > 1% generally sweeping within 10 years (Figure 3b). Consequently, if selection strongly benefits a reverting mutation, a genotype with a beneficial mutation is essentially guaranteed to emerge within days to weeks and replace its ancestors within the host within months to years.

Given its plausibility, we now consider if the reversion model can explain the observed decay of dN/dS. The dynamics of the reversion model can be given by

.dN¯transientdt=1τflip(nlociN¯transient)1τflipN¯transient (6)

With the corresponding solution for N¯transient being (see Appendix 1, Section 3.1)

N¯transient(t)=nloci2(1e2tτflip). (7)

Here, nloci denotes the number of loci that experience distinct sources of fluctuating selection. The parameter τflip represents the average number of generations required for the sign of selection at a chosen locus to flip and determines the key point in the dN/dS decay curve where N¯transient(t) begins to drop. We note that a locus here could be a nucleotide, gene, or gene set – any contiguous or noncontiguous stretch of DNA in which two knockout mutations would be just as beneficial or harmful as one mutation. We again use α to represent the proportion of nonsynonymous mutations that are neutral. Using Equation 7, we obtain a formula for dN/dS that has only three free parameters when a single value for μS is chosen:

dNdS=α+nloci6μSt(1e2tτflip). (8)

When fitting the dN/dS curve, the values obtained are reasonable in the context of bacterial genomics, with median best-fit values across species of τflip=46,000 bacterial generations (range 25,000–105,000) and nloci=55 (range 34–80). Given daily bacterial generations, this value of τflip suggests the sign of selection on a given allele would flip approximately every 110 years. The average time for any pressure to flip would thus be approximately every 2 years, or less frequently if adaptive events occur in bursts (e.g., upon transmission to a new host). While 55 loci under distinct selective pressures may seem high, Bacteroidetes genomes are known to have dozens of invertible promoters (up to 47 in B. fragilis; Jiang et al., 2019). Invertible promoters are restricted out of the genome and re-ligated in the opposite direction to turn gene expression on or off. The number of invertible promoters in a given genome approximates a lower bound on the number of fluctuating selective pressures that these genomes frequently experience. Interestingly, adaptive loss-of-function mutations reported in B. fragilis affect the same genes regulated by invertible promoters (Zhao et al., 2019). The plausibility of these fit parameters lends support to a model in which dN/dS decays solely based on strong and recurrent local adaptations.

To ensure a reversion model is robust to finite populations, we performed simulations using fit parameters. These simulations capture the dynamics of a single population evolving as it transmits across a series of hosts through random bottlenecks (Figure 4a; ‘Methods’); these simulations allow for clonal interference between adaptive mutations. We allow new pressures to arise independent of bottlenecks as new selective forces (phage migration [Koskella and Brockhurst, 2014]; immune pressures [Barroso-Batista et al., 2015]; dietary changes [Carmody et al., 2019]) can emerge throughout the lifespan and independent of migration; forcing transmission and bottlenecks to coincide gives similar results (Figure 4—figure supplement 1). As in the purifying selection simulations, the per base pair mutation rate is 10–9, and 90% of nonsynonymous substitutions are deleterious, but this time they have a larger s of 0.003 (Robert et al., 2018) and are thus purged more quickly from the population. Notably, while some of these deleterious mutations hitchhike to fixation during bottlenecks and adaptive sweeps, fitness does not decay because these mutations are subsequently reverted with adaptive sweeps (Figure 4—figure supplement 2). If deleterious mutations had significantly smaller s, they would be unable to be reverted due to the long time needed to reach fixation, even if bottlenecks and adaptive events are less frequent (Figure 2—figure supplement 3).

Figure 4. Under a model of reversion, the apparent dN/dS on long timescales underestimates the extent of adaptive evolution.

(a) The reversion model successfully fits the data in simulations. We simulate a population of size 1010 that has a bottleneck to size 10 on average every 10,000 generations (~27 years or a human generation [Wang et al., 2023] given a bacterial generation a day), with one adaptive pressure (sben=0.03) occurring on average every 840 generations independently of bottlenecks (see Figure 4—figure supplement 1 for an alternative where bottlenecks and selection are correlated). New pressures either require forward mutations (which can be acquired at a rate of 1.1 × 10-8 per available locus per generation) or reversions (which can be acquired at a rate of 4.5 × 10-10 per available locus per generation), the balance of which depends on the history of pressures on the tracked genome (i.e., more past forward pressures implies more potential future reverse pressures). Based on the best fit to the data, we use nloci=55. Deleterious mutations occur at a rate of 1.01 × 10-3 mutations per genome and have s=0.003 and can themselves be reverted. More details on the simulation can be found in ‘Methods’. Each curve represents the average of 10 runs; the blue line shows the observed pairwise dN/dS while the red line includes adaptive mutations and reversions. The theory line is the result of Equation 8. Observable dN/dS decays because of reversion, while the actual dN/dS of mutations that occurred is >1 when taking into account both forward and reverse mutations. (b) PAML (Yang, 2007) cannot detect true dN/dS in a given gene in the presence of adaptive reversions. Both lines are dN/dS as calculated by PAML on a simulated gene phylogeny. In the permanent adaptations simulation (blue), adaptive mutations are acquired simply and permanently. In the transient adaptations simulation (red), only more recent mutations will be visible while older mutations are obscured (‘Methods’). Line is the average of 10 simulated phylogenies and shaded regions show the range. (c) Categories of genes in the Bacteroides fragilis genome (NCTC_9343) enriched for stop-codon adjacent codons (TTA and TCA) relative to the expectation from the rest of the genome (‘Methods’). The use of these codons suggests these sequences may have recently had premature stop codon mutations. p-Values are displayed above bars and were calculated using a one-proportion Z-test with Bonferroni correction. See ‘Methods’, Supplementary file 1, and Supplementary file 2 for more details.

Figure 4.

Figure 4—figure supplement 1. Effect of correlating mutations with bottleneck has little impact on the fit.

Figure 4—figure supplement 1.

Same parameters as simulation from Figure 4a but rather than have new pressures occur independently of bottlenecks, they occur concurrently. To have an overall average of one beneficial mutation every 840 generations, an average of 11.9 mutations (Poisson with mean 11.9) occur every bottleneck, which occurs on average every 10,000 generations. The most noticeable difference is greater variability at low synonymous divergence values. The first 100,000 generations is the average of 50 simulations while subsequent parts of the curve consist of only 5 simulations to save on computing time (hence the slight spike).
Figure 4—figure supplement 2. Deleterious hitchhikers are reverted over time, preventing fitness decay.

Figure 4—figure supplement 2.

Besides for reversions potentially occurring due to strong environmental pressures, reversions can also occur because of the fixation of deleterious hitchhikers. These results are from the simulations of Figure 4a, where deleterious mutations occur at a rate of 1.01 × 10-3 per genome per generation and with a selective disadvantage of s = 0.003. Deleterious mutations are allowed to hitchhike, and reversions of these deleterious mutations are also permitted. Adaptive mutations and reversions are not counted in relative fitness. The curve is the average of 10 simulations.
Figure 4—figure supplement 3. Inclusion of compensatory mutation in the reversion model shows that dN/dS still decays provided reversion occurs at reasonable rates.

Figure 4—figure supplement 3.

(a) Using a Markov chain (see Appendix 1, Section 3.2), we calculate the theoretical values of N¯transient, and thus dN/dS decay, under varying assumptions about the rate at which compensatory mutations win over true reversions. While compensatory mutations do slow the rate at which dN/dS decays (relative to no compensatory mutations) if reversions are more likely than compensatory mutations, a decay will be observed. Reducing the number of loci can correct for delayed decay from compensatory mutations. (b) The likelihood of true reversion vs. compensatory mutations (treated as mutually exclusive) under different parameter regimes as determined from simulations. Simulations are performed the same as those displayed in Figure 4 and described in the ‘Methods’ section for the reversion model (same population size, bottlenecks, loci, arrival of pressures, mutation rates), except now there is an additional compensatory category that can be used to satisfy an adaptation to a reverse pressure and has a separate rate and selective advantage. Results are obtained by comparing accumulated true reversions to compensatory mutations after 500,000 generations.

We note that other complex models that include reversion and other processes are also possible. For example, a model with a very large number of loci with selective tradeoffs and pressures that act only transiently (nonfluctuating) could potentially fit the data. However, the agreement between nloci and the number of invertible promoters, and the finding of parallel evolution in vivo, suggests the fluctuating selection model is more realistic than a very many-sites model.

So far, we have assumed that only exact reversions are selected upon when the sign of selection returns to its original state. However, the reversion model can also accommodate compensatory mutations that exclude any selective advantage for reversion; these compensatory mutations can also be subject to reversion themselves. We conceptualize this as a random walk, in which a locus at a nonancestral state acquires a compensatory mutation with probability p or obtains a true reversion with probability 1 – p (see Appendix 1, Section 3.2). As long as p0.5, dN/dS will decay to the same asymptote despite adaptive dynamics occurring. While compensatory mutations shift the timing of dN/dS decay to the right, it can be shifted backward by decreasing nloci (Figure 4—figure supplement 3a). The condition p0.5 is easily met when sben = 0.03, until excluding compensatory mutations are 10 times more likely than true reversion and provide 95% of the selective advantage of the true reversion (Figure 4—figure supplement 3b). If selective pressures are stronger (as they might be in the presence of phage), true reversions will outcompete compensatory mutations even if the supply of compensatory mutations is greater or such mutations provide better relative compensation.

A critical consequence of the reversion model is that apparent and actual dN/dS values diverge quickly. Even when the true genome-wide dN/dS exceeds 1 – meaning that adaptive sweeps have been a dominant force in shaping genomes – the observed value can be close to 0.1 on long timescales. This disparity complicates the interpretation of dN/dS as it becomes challenging to determine whether a genome or gene lacks nonsynonymous mutations due to reversions or negative selection. We confirmed the inability to detect adaptive selection on a gene when reversion is rampant by simulating protein phylogenies; even the advanced software PAML (Phylogenetic Analysis by Maximum Likelihood) (Yang, 2007) significantly underestimates actual dN/dS (Figure 4b; ‘Methods’). Without sufficient temporal sampling, no software can realistically estimate these hidden, adaptively driven nonsynonymous mutations.

Lastly, we sought to find evidence of past reversions of stop codons in certain genes by analyzing codon usage. Both leucine and serine have the property that they can be encoded by six codons, only one of which is highly stop codon adjacent (TTA for leucine and TCA for serine). Across the B. fragilis genome, these codons are depleted overall (13.48% usage rather than the neutral expectation of 16.67%). However, specific Clusters of Orthologous Genes (COG) categories are enriched in TTA and TCA codons relative to this baseline, including genes associated with transcription and cell envelope biogenesis (Figure 4c, ‘Methods’, Supplementary file 1). Further, when functionally annotated genes are further categorized by cellular localization, more gene categories exhibit enrichment (Figure 4c, ‘Methods’, Supplementary file 1), most notably genes involved in inorganic ion transport and metabolism that are localized to the outer membrane. Genes implicated in within-host B. fragilis adaptation (Zhao et al., 2019) are also found disproportionately in this category of outer membrane transporters (p=1.22 × 10–4; ‘Methods’, Supplementary file 1). Both the cell envelope and membrane-bound transporters are known to mediate interactions with the immune system and phage (Sukhithasri et al., 2013; Ongenae et al., 2022), and are therefore expected to experience fluctuating selective pressures. The enrichment of stop-codon adjacent codons in pathways associated with environment-dependent costs further supports a model in which adaptive mutational reversions are frequent.

Discussion

In this study, we present a new interpretation of the time-dependent changes in dN/dS for bacterial populations. We show that the traditional weak purifying selection model struggles to replicate theoretical results in realistic population sizes and propose an alternative model with opposite implications that are supported by analytical, simulation, and genomic results. Together, these results challenge the conventional view that high dN/dS values on short timescales are an artifact and should not be trusted. Instead, the success of the reversion model suggests that adaptive dynamics are underestimated on long timescales because of the saturation of dN.

It is perhaps not surprising that reversions have been relatively overlooked in previous literature. First, most population genetics theory focuses on eukaryotic organisms with smaller population sizes and longer generation times, for which reversion is less likely. The low likelihood of reversion in these populations has inspired the use of the convenient infinite-site model (Tajima, 1996), which assumes that reversions never occur and simplifies derivations. While smaller values of Ne can be appropriate for modeling global bacterial dynamics – because bottlenecks and geography limit how many organisms effectively compete – they are inappropriate for within-gut populations, which are less structured. While gut microbiomes do have a spatial structure that reduces competition, theoretical work modeling this biogeography suggests that the census and active population sizes differ only approximately tenfold (Ghosh and Good, 2022; Labavić et al., 2022). This brings the within-gut microbiome Ne to substantially larger than the per-nucleotide mutation rate, invalidating the infinite sites model. Secondly, while bacterial geneticists have long observed adaptive loss-of-function mutations, two common misinterpretations of population genetic parameters can underestimate the probability of reversion: molecular clock rates (μ), which are generally low, can easily be confused with the supply of potential mutations (μNe) (Lieberman, 2022); and classical approaches that assess Ne from genetic diversity vastly underestimate the currently active population size, particularly if a bottleneck recently occurred (e.g., during transmission). Lastly, simulating large populations, even when appropriate, is computationally difficult. As a consequence, population genetics simulations, including those of bacteria, have used relatively small population sizes (≤106 organisms). We overcome computational limitations by tracking genetic classes rather than individual genotypes (‘Methods’). While our approach does not allow explicit comparison between individuals within a population, we believe this framework represents a powerful method to simulate large population sizes when applicable.

Whether or not a reversion model can be applied beyond host-associated microbial populations remains to be explored. We only analyze microbiome data here, but we anticipate that analyses of highly curated dN/dS decay curves from microbial pathogens could yield similarly plausible parameter fits for the reversion model given past observations of dN/dS decay (Rocha et al., 2006). When effective population sizes are smaller than 109, reversions are relatively unlikely. For example, while adaptive reversions can sweep individual gut microbiomes, we do not propose that reversions sweep the global bacterial population. Regardless, theoretical work on animal populations has shown that adaptive reversions are possible after local population bottlenecks (Charlesworth and Eyre-Walker, 2007). Similarly, environmental variations that change more rapidly than the timescale required for a local selective sweep (e.g., those imposed by daily dietary changes in the gut; or imposed by light-dark cycles in the environment) would be less likely to drive fixation and subsequent reversion than the less rapid changes considered here (e.g., phage migration) (Cvijović et al., 2015). On the other hand, adaptive reversions may be particularly relevant for viral populations, which are known to undergo within-host adaptation, have very large population sizes, and experience frequent bottlenecks (Feder et al., 2017). Reversions have commonly been observed in certain regions of the HIV genome and have been postulated to diminish measured substitution rates in those regions (Druelle and Neher, 2023).

Despite the success of a model of reversion alone in explaining dN/dS decay, it remains possible that other forces could additionally contribute. While we have shown that purifying selection alone, either continuously or during transmission, cannot explain dN/dS decay alone, it is possible that some degree of purifying selection could act alongside a reversion model. Similarly, directional selection could be incorporated into the reversion model by adjusting the parameter α. While the true contribution of adaptive evolution to α is likely nonzero, it is difficult to fit with available data and it is therefore left for future work.

While we have presented evidence that recombination alone is unlikely to rescue a model of weak purifying selection, it remains possible that recombination could be included in a model that includes adaptation and, notably, could drive adaptive reversions. Microbial geneticists have frequently observed that recombined regions exhibit lower dN/dS values compared to non-recombined regions (Castillo-Ramírez et al., 2011), a signature consistent with having already experienced reversion or purifying selection. Recombination could potentially revert multiple mutations at specific loci simultaneously, which might be particularly beneficial in the presence of genomic epistasis. Thus, despite the success of the mutation-driven model, it is likely that recombination plays some role in the decay of dN/dS.

While more direct observation of adaptive reversions is currently lacking, we propose that this paucity is simply an artifact of lacking samples along a line of descent with sufficient genomic resolution. Despite this challenge in observation, a recent study tracking de novo mutations between mothers and infants revealed several cases of apparent reversion, with elevated values of dN/dS > 1, though not significantly so (Chen and Garud, 2022). Moreover, many short-term studies in the gut microbiome and beyond have revealed strong evidence of within-person adaptation, including parallel evolution (Lieberman et al., 2011; Marvig et al., 2015; Zhao et al., 2019; Cooper and Lenski, 2000) and loss-of-function changes like premature stop codons (Key et al., 2023) – with low long-term dN/dS values in these same short-term genes (Vigué and Tenaillon, 2023). We note that adaptation and reversion do not result in parallel evolution in the genomic record if various initial mutations result in the same phenotype (i.e., loss-of-function mutations); however, it would result in changes in codon usage bias we have shown (Figure 4c).

The shortcomings of the purifying model and the success of the reversion model under realistic assumptions highlight the importance of studying evolution in real time for understanding evolutionary dynamics. In addition, our results emphasize the importance of simulating large population sizes for explaining observations in bacterial population genomics, spotlight the potential for strong adaptation in bacterial populations, and underscore the need for continued development of population genetics theory for microbial populations.

Methods

Data and parameter estimation

Data was obtained from Shoemaker et al., 2022 and was initially generated by Garud et al., 2019. Pairwise dN/dS values can be found in the GitHub repository. The parameters are estimated using scipy.optimize.curve_fit. The fit minimizes the RMSD of the logarithmic dN/dS. If we fit the data by minimizing just dN/dS on a linear scale, we get s ≈ 2.0 × 10–5, which suggests an even weaker purifying selection (Figure 1—figure supplement 1). We analyzed the 10 species with the most data points reflecting short divergence times (dS < 0.0005), which is critical for data fit.

Population simulations overview

The majority of the computational simulations performed are built upon the idea of the Wright–Fisher model with selection (Tataru et al., 2017) that population generations can be determined from a multinomial distribution. However, we have made some changes to generalize this model for our purposes.

First, the simulations do not necessarily assume a constant population size but rather assume the population grows via a logistic growth model with a capacity to allow for the implementation of bottlenecks. Specifically, if P[t] is the population on generation t and K is the population capacity, then

P[t+1]¯=P[t]+P[t](1P[t]K)

And

P[t+1]=Poiss(P[t+1]¯).

The population size is a Poisson random variable as we choose to determine the offspring of individual genetic classes as a Poisson random variable. We note that except for the very first few generations and after bottlenecks, the population size only has small fluctuations around a fixed capacity.

To speed the simulation up and enable the simulation of very large population sizes, we implemented a variety of genotype classes, rather than tracking each genotype individually. Genotype classes are similar to the practice of simulating fitness classes (Desai and Fisher, 2007), though we manage the number of unique classes via Poisson merging and splitting.

For all simulations, we start from a single organism that begins with 500,000 neutral alleles, representing a core genome size of this many codons that has yet to receive any mutations. When a mutation occurs, one allele may change types or stay the same, depending on the mutation received and the state of the randomly chosen codon. For example, a deleterious mutation occurring at a codon already in a deleterious state does not change the genotype class of the organism.

The specific implementations and additional parameters used for this model are provided in the following sections. Here, we outline the theory that ensures that genotype classes accurately represent such a population and enable the calculation of fitness. Consider the total population of size, P[t], at generation t, as a composite of multiple different classes. The number of individuals in class j on generation t will be Aj[t]. We have

P[t]=j0Aj[t].

Within class j, we store several variables that provide information about the genotype of members of Aj[t]. Specifically, we store a number jk that specifies the number of alleles of type k in the class j. Examples of potential types that are used in our work include deleterious alleles, adaptive alleles, and alleles that result from reversion. Each type of allele is associated with a specific selective advantage sk. We can now write a formula to calculate the absolute fitness Fj of class j:

Fj=k0(1+sk)jk.

From the absolute fitness Fj, we calculate the average absolute fitness of the population on generation t via

E[F]=j0FjAj[t]P[t].

We now calculate the relative fitness of class j on generation t as

fj=FjE[F].

Next, we calculate the expected size of class j in the next generation with

Aj[t+1]=fj(Aj[t]+Aj[t](1P[t]K)).

Note that

P[t+1]=j0Aj[t+1]=j0fj(Aj[t]+Aj[t](1-P[t]K))=
j0FjE[F](Aj[t]+Aj[t](1P[t]K))=P[t]+P[t](1P[t]K).

This allows us to use a logistic model of growth to represent population size rather than being constrained to fixing it, which is useful for simulating bottlenecks.

To account for genetic drift through random fluctuations, we rewrite the above equations to be

Aj[t+1]=Poiss(fj(Aj[t]+Aj[t](1P[t]K))).

which also implies

P[t+1]=Poiss(P[t]+P[t](1P[t]K)).

Note that this simulation still has equivalent dynamics of the frequencies of classes as a Wright–Fisher model with selection in the case of a fixed population size due to the ability to split Poisson processes, that is,

Pr(Aj[t+1]=X|P[t+1])=Pr(Bin(P[t+1],fjAj[t+1]P[t])=X).

Mutations are added in every generation depending on the mutation rate. Only single mutants are generated per generation, and an organism cannot get more than one mutation per generation. The number of new mutants is determined by the binomial distribution. New mutants are added then to their appropriate class j. For example, if a deleterious mutation is gained in a class with 10 deleterious alleles (and nothing else), this new mutant will increase the population size of the class with 11 deleterious alleles (and nothing else) while decreasing the population size of the class with 10 deleterious alleles (and nothing else).

By grouping individuals in classes rather than by genotype, computational costs can be greatly cut down. Grouping individuals does not affect the dynamics of the simulation because the merging of Poisson processes is still Poisson. The downside to this approach is information loss, though by designing custom alleles, we can track specific mutational histories like reversions.

Purifying selection simulations

The purifying selection simulations (Figure 2) utilized the base framework as mentioned above. Effective population sizes (Ne) varied from 106 to 1018 depending on the simulation. The simulation begins with an initial organism with 500,000 neutral alleles (representing a WT core genome). The population quickly grows to the carrying capacity (Ne) and follows logistic growth (see ‘Population simulations overview’). The deleterious nonsynonymous mutation rate per genome per generation is 1.01 × 10–3. These deleterious mutations have a selective disadvantage of s ≈ 3.5 × 10–5. Synonymous and neutral nonsynonymous mutations are not simulated directly as they are neutral and are instead assumed to accumulate in the population with an average rate of 3.75 × 10–4 and 1.12 × 10–4 per genome per generation, respectively.

We estimate the average dN/dS of the population by taking the average number of codon differences between two individuals in the population to be twice the average number of mutations in the population. This approximation is valid due to the lack of selective sweeps, bottlenecks, and large effective population size, which results in expected coalescent time between random individuals being 106–1018 generations (far longer than our simulations).

Variations of this basic purifying selection model are performed as described in the article, including increasing the mutation rate to 1.13 × 10–3 mutations per genome per generation (Figure 2—figure supplement 2) and the simulation of recombination (Figure 2—figure supplement 3). For simulations of recombination, we assume that transitions to the ancestral state (purging the deleterious allele) occur at a rate of 2.5 × 10–7 per codon per genome per generation and bring along a synonymous mutation (tracked via the number of recombinations to the ancestral state). This procedure does not allow for recombination to purge multiple deleterious alleles at a time; such events are unlikely given that deleterious alleles are rare and randomly distributed. We also include a model in which synonymous mutations are not included in this reversion event.

The simulation of purifying selection through bottlenecks and infrequent adaptive sweeps was performed as in the modified version of the reversion model (see ‘Reversion simulations’), though with less frequent adaptation and larger and less frequent bottlenecks.

Stop codon enrichment in the Zhao and Lieberman et al. dataset

Table S7 in Zhao et al., 2019 provides an Excel sheet detailing all observed mutations. There were 325 observed nonsynonymous mutations of which 28 were stop codons. Under a null model, there are 415 possible permutations of initial codon and codon one mutation (see ‘Code availability’) away that result in a nonsynonymous substitution of which 23 lead to a stop codon. Assuming no preference for specific mutation or initial codon, we would expect roughly 18 stop codons in this data. Under a null binomial distribution, the p-value for obtaining 28 or more is 0.015.

Reversion simulations

We simulated gut bacterial populations using a modified Wright–Fisher model (see ‘Population simulations overview’) to monitor mutation acquisition over time compared to an ancestor. Like all simulations, we begin with a single organism with 500,000 neutral alleles to represent the WT core genome. The population can grow to a capacity of 1010 via a logistic growth model. Environmental changes occur with a probability of nlociτflip per generation, triggering an average of one selective pressure per environmental change, modeled by a Poisson distribution. Population bottlenecks to 10 individuals occur independently of environmental changes with a probability of 10–4 per generation (see Figure 4—figure supplement 1 for an alternative in which bottlenecks and environmental change are correlated).

Both adaptive selective pressures and adaptive mutations are categorized into two allele types: forward and reverse. These classes are designed to enable tracking of complete mutational history and therefore recorded relative to the ancestral state rather than the current state. Thus, actual dN/dS is calculated as the sum of these mutations and observed dN/dS using their difference (plus asymptomatic dN/dS). When releasing beneficial selective pressures, their classification as forward or reverse is based on the balance of previously released selective pressures: the probability of an adaptive pressure being classified as a reverse adaptation increases as the number of forward pressures increases and is equal to the difference between the forward pressures previously released (qF) and reverse pressures previously released (qR) divided by the number of loci (i.e., qFqRnloci). All beneficial mutations have a selective advantage of sben = 0.03 (for forward or reverse). The rate at which mutations occur given an available pressure depends on whether the mutation is adapting to a forward or reverse pressure: the reversion rate is set at one-fifth the rate of the nonsynonymous per codon mutation rate (4.5 × 10–10 per generation per cell), while forward mutations are set at a rate five times higher than the nonsynonymous mutation rate because they can happen at multiple sites (25× the reversion rate; 1.1 × 10–8 per generation per cell).

This simulation treats each adaptive mutation as occupying a unique codon in the core genome for simplicity. This assumes that the ancestral allele at a given locus has been purged before the next environmental change affecting that locus (or selective pressure); as theory suggests that a beneficial mutation takes 768 generations to fix (see Appendix 1, Section 2.2), compared to 46,200 generations for pressure shifts at any locus we believe this assumption is reasonable. To confirm this theory still holds in the presence of bottlenecks and clonal interference, we tracked the average number of beneficial mutations in the population relative to the number of selective pressures released at any generation in simulations; we found only a 2% deviation between the average and expected total beneficial mutations over 2 × 106 generations.

Throughout the simulation, deleterious mutations occur at a rate of 1.01 × 10–3 mutations per genome per generation, with a selective disadvantage of s = 0.003, and can be reverted to a deleterious reversion allele class (separate from the adaptive reversion allele class).

To calculate dN/dS, we assume the simulated population could be compared to an equivalent population but with distinct mutations, allowing us to calculate the dN/dS as using double the current observed substitutions.

We assume during the reversion simulations that the ancestor has no initial transient mutations. We make this assumption for computational simplicity but the theoretical curve is equivalent whether starting from no revertible mutations or the equilibrium where half of the loci currently have forward mutations (assuming forward and reverse mutations occur at equal rates; see Appendix 1, Section 3.3).

Testing standard dN/dS software

We simulated gene sequences with selective pressures acting at specific sites for Figure 4b. For each genomic distance investigated (every 500,000 bacterial generations), we ran 10 simulations as described below, with each simulation resulting in 10 sequences derived from a branching process. In the permanent adaptations simulation (blue), adaptive mutations in the phylogeny are acquired simply and permanently. In the transient adaptations simulation (red), only more recent mutations in the same phylogeny will be visible while older mutations are obscured by reversion. Both sets of sequences were then fed to PAML v4.8 (Yang, 2007) for the estimation of dN/dS values. PAML uses maximum likelihood analysis to estimate the rate of substitution that best explains a given phylogenetic tree.

For each simulation, we generated a random 1500 bp open-reading frame and designated 10% of codons as neutral, 10% under positive selection, and 80% under purifying selection. We introduced mutations and branches across several cycles, with each cycle representing 100,000 generations. For each cycle, we assigned mutations at random according to the following probabilities: 67.5% that a nonsynonymous mutation occurred at a codon under positive selection, 12.5% for synonymous mutation at any codon, 3.75% for nonsynonymous mutation at a nonselected site (neutral), and 16.25% for no mutation. These values were selected to give a dN/dS of around 2 and to match the general ratios in the reversion model.

Two phylogenies were constructed from each simulation: both received identical mutations, but they differed in how nonsynonymous mutations at selective codon sites were visible at the end of the simulation. In the transient adaptations version of the phylogeny, nonsynonymous mutations at selective codon sites were reverted at the end of the simulation, except those that occurred within the last 500,000 generations. Reverted sites were converted to a synonymous substitution at a frequency based on the codon table (assuming an equal probability of all nucleotide mutations). Both versions of the sequences underwent multiple sequence alignment and neighbor-joining tree construction (Biopython; Cock et al., 2009). We calculated treewide dN/dS ratios using PAML v4.8’s codeML feature (Álvarez-Carretero et al., 2023), employing the M2a model to analyze site-specific selection.

Closeness to stop codons

To evaluate possible enrichment for stop codon adjacency, we focused on TTA and TCA codons. TTA and TCA are ideal for measuring the likelihood of nonsense mutations because each has two point mutations that yield a stop codon, unlike the five other redundant codons encoding for the same amino acids (for both leucine and serine, one codon is singly stop codon adjacent and the last four are not stop codon adjacent). In B. fragilis, these codons have a codon usage rate of 13% for leucine and 14% for serine.

We annotated the reference genome NCTC_9343 with Bakta v1.9 (Schwengers et al., 2021) and obtained COG categories for each gene using eggNOG v5.0 (Cantalapiedra et al., 2021). Genes that did not have a functional COG category (35%) were removed. To control for unusual outlier genes skewing results, only the 15 COG groups that had at least 50 genes were considered for enrichment analyses. For each COG category, we calculated a null codon usage proportion based on the proportion of leucine and serine codons and compared this to the actual proportion using a one-proportion Z test. To address the fact that genes in the same functional category but localized to different parts of the cell may be under different selective pressures, we analyzed cellular location classifications from PSORTb v3.02 (Yu et al., 2010) and categorized genes by the combination of function and localization. We analyzed the 15 function-location combinations with more than 50 genes. After identifying those categories that were significantly enriched, we cross-referenced which categories of genes shown to be under adaptive within-person evolution in a previous study of B. fragilis within-person evolution were in Zhao et al., 2019. Of the 16 genes reported in that paper, 8 were assigned a functional COG category/cellular location and in the reference genome (NCTC_9343). Four of these were in outer membrane inorganic ion transport and metabolism, a significant enrichment (p=1.22×104; binomial test) (Supplementary file 1, Supplementary file 2).

Code availability

Code and simulation results are available at https://github.com/PaulTorrillo/Microbiome_Reversions (copy archived at Torrillo, 2023).

Acknowledgements

We thank Daniel Fisher, two anonymous reviewers, Benjamin Good, Erik van Nimwegen, and all members of the Lieberman Lab for their thoughtful feedback on this manuscript. We also thank William Shoemaker for making the data used in this work easily accessible and for his feedback on the manuscript. This work was funded by a grant from the National Institutes of Health (1DP2GM140922-01 to TDL) and a fellowship for the National Sciences Foundation (to PAT).

Appendix 1

Supporting information

1.1 dN/dS theory

The following is meant to provide a more in-depth walkthrough of how one can build up and interpret the purifying selection model and its effect of time dependence on dN/dS. To be accessible to a wide audience and self-contained, we have included enough detail that most sections should be followable with basic knowledge of calculus.

Assume an infinite population of organisms. Consider the existence of m classes of nonsynonymous mutations. The number of mutations of the ith class in the population is represented by the variable N¯i. Each class N¯i has an associated mutation rate Ui (mutations per genome per generation per unit time) and an associated selective disadvantage si (mutation purification per unit time). In the purifying selection model, we assume that s0=0 and si>0>0. We assume that both the mutation rate and the selective disadvantage of each class remain constant throughout time. We assume this is the global population and hence no migration. We then have

dN¯i=UidtsiN¯idt (S1)

Assuming i>0, we can integrate

dNiUisiN¯i=dt.

Using u-substitution, we let u=μisiN¯ii so that dudN¯i=si which implies dN¯i=dusi so that

dusiu=dt.

Integrating both sides, we get

ln(u)si=t+C.

where C is the constant of integration. Continuing we have

u=esitsiC
μisiN¯i=esitsiC
N¯i=μisiesitsiCsi.

We can remove the constant of integration and instead replace it with the initial condition N¯i(0)

N¯i(0)=UisiesiCsi.

So, then we have that

C=ln(UisiN¯i(0))si.

So that we get

N¯i=Uisi(1esit)+N¯i(0)esit.

Since N¯i(0) should occur in a homogeneous population that is the most recent common ancestor of the individuals in the population we are observing, we assume N¯i(0) = 0, so we have

N¯i=Uisi(1esit). (S2)

If i=0, we have

N¯0=U0t.

We also assume that there are synonymous mutations S¯ that are neutral and occur with new mutations per unit time US so that

S¯=USt.

So, if we want to find N¯ (the total number of nonsynonymous mutations), that will be given by

N¯ =U0t+i1Uisi(1esit). (S3)

Now observe the following with G as the number of base pairs in the core genome, then dN=2N¯/(G×3/4) and dS=2S¯/(G×1/4). The 2 comes from the fact that there are two diverged lineages when calculating dN/dS.

dNdS=13N¯/GS¯/G=U0t+i1Uisi(1esit)USt=13(U0US+i1Ui/USsit(1esit)). (S4)

The ⅓ is for normalization. While the above form is easier to analyze in terms of actual time, it should also be noted that data is given in terms of dS rather than t so the following equivalent form can also be helpful when discussing fitting of the timescale dependence of dN/dS:

dNdS=13(U0US)+i12UiGsidS(1eGsi2USdS).

Furthermore, we can rewrite Ui=3αiUS so that we have

dNdS=α0+i12αiUSGsidS(1eGsi2USdS).

Note α0 is equivalent to α in the main text. Next, we can simply rewrite βi=Gsi2US so that we have

dNdS=α0+i1αiβidS(1eβidS). (S5)

To begin analyzing (S5), we consider the asymptotic behavior. First, we note that

limdS+α0+i1αiβidS(1eβidS)=α0=U0US. (S6)

For initial behavior, we can use L’Hopital’s rule

limdS+0α0+i1αiβidS(1eβidS)=α0+i1αi=U0+i1UiUS. (S7)

Finally, we are interested in when dN/dS ends up being in between these initial and final values. Each class i has a different midpoint at which its contribution to dN /dS is a half. In mathematical terms, this can be summarized as

αi2=αiβidS(1eβidS).

which implies

βidS2=(1eβidS).

Thus, we see that for any class of mutation, 1si will determine the midpoint (assuming some constant US and G). We can then imagine the curve as similar to a step function where the location of each step is determined by the corresponding 1si and how far the function steps down will be determined by the size of the mutation rate Ui. Finally, we can surmise that the average step-down occurs at approximately the harmonic average

s1=i1UiUNsi.

where UN=i1Ui, which represents the total non-neutral nonsynonymous mutation rate. Taking this all into account, here is exactly what is being fit in the dN/dS curve. We are fitting the equation

dNdS=α+1αβdS(1eβdS). (S8)

α is the fraction of nonsynonymous mutations that are neutral. Now β is a compound parameter and we can fit β=Gs2US which is essentially half of the harmonic average of selective disadvantages in ratio with the synonymous mutation rate per site per generation. Now we can estimate the synonymous mutation per site per generation with the following:

109mutationsbasepair×generation×3basepairssite×14synonymousmutationmutations=34×109×synonymousmutationssite×generation.

Thus, we are truly fitting

dNdS=α+34×(1α)×1092sdS(1es32×109dS). (S9)

1.2 Mutation accumulation

To analyze genetic drift and Muller’s ratchet (Haigh, 1978; Neher and Shraiman, 2012), we will provide a brief overview of the approach well suited for our work. For any nonsynonymous mutation class Ni with i>0, we can track the size of a population with 0 mutations of class Ni, denoted by variable Wi, via

dWi=siN¯i(t)WidtUiWidt. (S10)

Here, N¯i(t) is the average number of mutations of class N¯i in the population and siN¯i(t)Wi is equivalent to mean fitness. This equation reflects how every unit of time, the mutation-free class should increase by its selective advantage relative to the population though also loses members of the population to the mutation rate.

If we substitute in N¯i(t), we get

dWi=siUisi(1esit)dtUiWidt.
dWi=Ui(1esit)WidtUiWidt.
dWi=UiesitWidt.

So then

ln(Wi)=Uisiesit+C.

Assuming that Wi at time 0 is given by W0 (representing the initial population size which under our assumptions is always free of mutations and hence wild type), then

ln(W0)=Uisi+C.
C=ln(W0)Uisi.
Wi(t)=W0eUi(1esit)si. (S11)

We see that if we set W0=1, then everything can be given in terms of frequencies within the population. Asymptotically, we have that

limtW0eUi(1esit)si=W0eUisi. (S12)

Importantly, we also note the following: let W be the frequency of the wild type (mutation-free class) and W0=1. Then,

W(t)=i>0Wi(t)=ei>0Ui(1esit)si. (S13)

which implies

limtei>0Ui(1esit)si=ei>0Uisi=eUNs. (S14)

Once again using

UN=i1Ui.

And

s1=i1UiUNsi.

Furthermore, the average time for the frequency in the population without mutations of class i to reach one-half of the logarithm of the asymptotic frequency is the same when using the above simplifications. In other words, the difference in how much and how fast the wild type is lost should not depend too heavily on the distribution of selective coefficients.

Finally, we can predict if the least-loaded class will be lost to drift. We can form this prediction via the following. We assume that if the least-loaded class W drops below its steady-state frequency, eUNs, it will have advantage s every generation. If the least-loaded class has advantage s, then it has a 2s probability of extinction (see Appendix 1, Section 2.1). Hence, if we expect there to be NeeUNs individuals, we can estimate that mutation accumulation will occur when

2sNeeUNs<<1. (S15)

2.1. Extinction and fixation probability

Here, we will derive how the fixation probability of a mutation with selective advantage sben is approximately 2sben. This is a standard result that can be found in classic population genetics textbooks. It is usually derived via differential equations but can also be obtained more classically from the study of branching processes (Haldane, 1927) ,which we will use here. First, we assume that individuals reproduce via a Galton–Watson branching process with mean of 1 + sben. We also assume there are no other mutations in the population that can interfere with fixation or extinction. A fundamental result from the study of branching processes is that the extinction probability is given by the smallest non-negative root of the branching processes corresponding probability generating function, f(z). Being a Galton–Watson branching process, the probability generating function is the probability generating function of a Poisson process so

f(z)=e(1+sben)(1z).

Thus, we need to find the smallest non-negative solution, z, to

e(1+sben)(1z)z=0.

We can use a second-order Taylor approximation to approximate the exponential so we have

1+(1+sben)(z1)+((1+sben)(z1))22z=0.

which has the smallest non-negative solution of

z=sben2+1(sben+1)2.

If sben is small, then we have

z11+2sben.

which we can further approximate by doing

z11+2sben12sben12sben12sben. (S16)

If 1-2sben is the extinction probability, then the fixation probability will be 2sben.

2.2. Likelihood of reversion

We calculate the time for mutations to occur and fix in the population for demonstration in Figure 3b. For the sake of simplicity, we consider the weak-mutation strong selection regime (no clonal interference) in our theory but do include such dynamics in our simulations. Regardless, clonal interference will only minorly change the frequency of a revertant with high sben in the population as multiple backgrounds will find the same reversion when Ne is sufficiently high. The expected time to fixation given a mutation with selective coefficient sben is estimated using

(1+sben)t=Ne.

where Ne is the census population size. The above implies

tln(1+sben)=ln(Ne).

And if sben is small, then we have

t=ln(Ne)sben. (S17)

Now we need to know the expected time for a fixing mutant to arise. First, we want the probability the mutation arises and fixes, which will be given by

1(12sbenUben)Ne2NesbenUben.

This implies the expected time to arrive is approximately

12NesbenUben. (S18)

So, therefore, in the absence of clonal interference, the time for reversion to fix in the population depends more on the time for it to fix over time than the time for the reversion to arise (and is, therefore, less dependent on the mutation rate) if

12NesbenUben<ln(Ne)sben.

Alternatively written as

1<2NeUbenln(Ne). (S19)

Finally, note that the expected time to fixation is given by

12NesbenUben+ln(Ne)sben. (S20)

The second term can be larger because of bottlenecks and clonal interference.

3.1 Reversion model with fluctuating loci

The reversion model can be derived in the following way. First, one can rewrite Equation 2 as a generic negative feedback model in which mutations emerge and are purged from the population at a rate proportional to how many mutations have accumulated:

dN¯transient=RindtPoutN¯transientdt. (S21)

In this formulation, Rin is simply the rate of accumulation of transient non-neutral nonsynonymous mutations per genome per unit time and Pout is the loss rate of these mutations per unit time.

From this more general form, we can develop the reversion model. Similar to the original purifying selection model, it is possible to assume a variety of classes of mutations, but for simplicity, we only assume 1. If we make the following definitions for

Rin=nlociτflip.
Pout=2τflip.

we can link Equation S21 to a fluctuating loci model via

dN¯transientdt=RinPoutN¯transient=nlociτflip2τflipN¯transient=1τflip(nlociN¯transient)1τflipN¯transient. (S22)

Here, τflip represents the average number of generations it takes for a given selective pressure on a locus to switch direction, and nloci, the number of loci under fluctuating selective pressures. Equation S22 can be more directly linked to a fluctuating loci model as we see the rate of nonsynonymous mutations is proportional to nlociN¯transient (the number of loci unmutated) and the rate out is proportional to N¯transient the number of loci mutated.

Solving similar to the purifying selection model, we have

N¯transient(t)=nloci2(1e2tτflip). (S23)

This can then be used to obtain dN/dS

dNdS=13(U0US+nloci2USt(1e2tτflip)).

Or equivalently

dNdS=13(U0US+nlociGdS(1eGdSUSτflip)).

If we set α=U03US,β=GUSτflip, and γ=nloci3USτflip so that

dNdS=α+γβdS(1eβdS). (S24)

which is equivalent to Equation S9 with one more free parameter.

3.2 Effect of compensatory mutations

We can further extend the theory to include compensatory mutations by calculating the expected value of N¯transient as a Markov chain. First, let v be the state vector for a locus under selection where the index of each row corresponds to the number of observed mutations currently at that locus and A be the corresponding stochastic matrix with rows i and columns j. First, we need to consider the probability the state does not change on a given generation (i.e., the diagonal of A). This will be 1τflip. For every element on the diagonal of A, we thus have Ai=j=1-1τflip.

Supposing there is a state change, let us define p as the probability there is an increase in observed mutations (a forward or compensatory mutation) and 1-p, the probability there is a decrease in observed mutations (a reversion). Then, Aj=i-1=1τflipp and Aj=i+1=1τflip(1-p) (with the exception of Ai=1,j=0=1τflip). Finally, with Δt being the number of time steps and assuming v0=1 and vi>0=0, then

N¯transient(Δt)=nlocii0(AΔtv)ii. (S25)

If p>1p, compensatory mutations are gained at a linear rate and the compensatory mutation rate will factor into the asymptotic dN/dS value. Conversely, if p<1p, the number of compensatory mutations is almost surely finite by the central limit theorem and hence will not factor into the asymptotic dN/dS. Finally, if p=1-p, this is the classic elementary random walk well known to deviate from the origin with O(n). Interestingly, this is also sublinear and will not factor into asymptotic dN/dS.

3.3 Reversion model starting from equilibrium conditions

Our simulations and theory assume that the initial population starts with no forward mutations (i.e., WT) for simplicity. However, starting at equilibrium conditions does not impact the shape of the curve. The intuition here is that while starting from equilibrium enables the identification of reversions of initially transient mutations, these will be subsequently hidden by parallel evolution. Noting Equation S23, we have that for our simulations and base theory:

dN=nloci(1e2tτflip). (S26)

We can show the same result occurs starting at equilibrium conditions. A Python script confirming the algebra is available on the GitHub repository. First, let there be three states a given locus can be in. The first state (i = 1) will be initial transient mutations, the second state (i = 2) will be ancestral alleles, and the third state (i = 3) will be subsequent transient mutations. We can then build a transition matrix:

A=(1τflip001τflip1τflip1τflip01τflip1τflip). (S27)

We can then use the transition matrix to find the probability of being in a state at any given time via the matrix exponential, which is

eAt=(etτflip001e2tτflip21+e2tτflip21e2tτflip21+e2tτflip2etτflip1e2tτflip21+e2tτflip2). (S28)

Now note that we can use eAt(j,i) to find the probability of being in a given state j after starting at initial state i. First, let us calculate expected dN at a locus that started from the ancestral allele and has subsequently diverged for time t. Take note that subsequent transient mutations are assumed to be distinct (i.e., they will always lead to at least one difference when compared to a different lineage).

dN(initalancestral)=eAt(2,2)eAt(3,2)+eAt(3,2)eAt(2,2)+2eAt(3,2)eAt(3,2). (S29)

Now let us calculate dN assuming the allele was initially a transient mutation.

+eAt(1,1)eAt(2,1)+eAt(2,1)eAt(1,1)+2eAt(1,1)eAt(3,1)+2eAt(3,1)eAt(1,1). (S30)

Noting that we have assumed equal rates of forward and reverse adaptations throughout, the equilibrium would be composed of nloci/2 initially ancestral loci and of nloci/2 initially transient loci. Working through all of the algebra, we find

dN=nloci(dN(initalancestral)+dN(initaltransient))2.
dN=nloci(1e2tτflip)+nloci(1e2tτflip)2.
dN=nloci(1e2tτflip).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Tami D Lieberman, Email: tami@mit.edu.

Vaughn S Cooper, University of Pittsburgh, United States.

Detlef Weigel, Max Planck Institute for Biology Tübingen, Germany.

Funding Information

This paper was supported by the following grants:

  • National Institutes of Health 1DP2GM140922-01 to Tami D Lieberman.

  • National Science Foundation Graduate Research Fellowship Program to Paul A Torrillo.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Software, Formal analysis, Investigation, Visualization, Methodology, Writing – original draft.

Conceptualization, Supervision, Funding acquisition, Investigation, Writing – original draft, Project administration.

Additional files

Supplementary file 1. Enrichment of stop adjacent codon usage in specific gene categories of Bacteroides fragilis NCTC_9343, related to Figure 4c.

Results display TTA/TCA enrichment for statistically significant categories of genes in B. fragilis NCTC_9343. Loci from these categories and mutated in Zhao et al., 2019 (Table S7) are noted in column J.

elife-93146-supp1.xlsx (11.9KB, xlsx)
Supplementary file 2. Genes assigned to COG categories in B. fragilis NCTC_9343 and their enrichment of stop-adjacent codons, related to Figure 4c.

List of genes assigned COG categories (via eggNOG) to be used to evaluate for closeness to stop codons. Cellular location is also given (if predicted) by PSORTb. Annotations are from Bakta.

elife-93146-supp2.xlsx (213.9KB, xlsx)
MDAR checklist

Data availability

Code and results of simulations are available at Github repository https://github.com/PaulTorrillo/Microbiome_Reversions (copy archived at Torrillo, 2023).

References

  1. Álvarez-Carretero S, Kapli P, Yang Z. Beginner’s guide on the use of paml to detect positive selection. Molecular Biology and Evolution. 2023;40:msad041. doi: 10.1093/molbev/msad041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Ascensao JA, Wetmore KM, Good BH, Arkin AP, Hallatschek O. Quantifying the local adaptive landscape of a nascent bacterial community. Nature Communications. 2023;14:248. doi: 10.1038/s41467-022-35677-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Baker L, Brown T, Maiden MC, Drobniewski F. Silent nucleotide polymorphisms and a phylogeny for Mycobacterium tuberculosis. Emerging Infectious Diseases. 2004;10:1568–1577. doi: 10.3201/eid1009.040046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Barber MF, Elde NC. Escape from bacterial iron piracy through rapid evolution of transferrin. Science. 2014;346:1362–1366. doi: 10.1126/science.1259329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Barreto HC, Sousa A, Gordo I. The landscape of adaptive evolution of a gut commensal bacteria in aging mice. Current Biology. 2023;33:5272. doi: 10.1016/j.cub.2023.11.021. [DOI] [PubMed] [Google Scholar]
  6. Barrick JE, Lenski RE. Genome dynamics during experimental evolution. Nature Reviews. Genetics. 2013;14:827–839. doi: 10.1038/nrg3564. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Barroso-Batista J, Demengeot J, Gordo I. Adaptive immunity increases the pace and predictability of evolutionary change in commensal gut bacteria. Nature Communications. 2015;6:8945. doi: 10.1038/ncomms9945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bobay LM, Ochman H. Factors driving effective population size and pan-genome evolution in bacteria. BMC Evolutionary Biology. 2018;18:153. doi: 10.1186/s12862-018-1272-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Molecular Biology and Evolution. 2021;38:5825–5829. doi: 10.1093/molbev/msab293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Carmody RN, Bisanz JE, Bowen BP, Maurice CF, Lyalina S, Louie KB, Treen D, Chadaideh KS, Maini Rekdal V, Bess EN, Spanogiannopoulos P, Ang QY, Bauer KC, Balon TW, Pollard KS, Northen TR, Turnbaugh PJ. Cooking shapes the structure and function of the gut microbiome. Nature Microbiology. 2019;4:2052–2063. doi: 10.1038/s41564-019-0569-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Castillo-Ramírez S, Harris SR, Holden MTG, He M, Parkhill J, Bentley SD, Feil EJ. The impact of recombination on dN/dS within recently emerged bacterial clones. PLOS Pathogens. 2011;7:e1002129. doi: 10.1371/journal.ppat.1002129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Castle SD, Grierson CS, Gorochowski TE. Towards an engineering theory of evolution. Nature Communications. 2021;12:3326. doi: 10.1038/s41467-021-23573-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Charlesworth J, Eyre-Walker A. The other side of the nearly neutral theory, evidence of slightly advantageous back-mutations. PNAS. 2007;104:16992–16997. doi: 10.1073/pnas.0705456104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Chen DW, Garud NR. Rapid evolution and strain turnover in the infant gut microbiome. Genome Research. 2022;32:1124–1136. doi: 10.1101/gr.276306.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Cock PJA, Antao T, Chang JT, Chapman BA, Cox CJ, Dalke A, Friedberg I, Hamelryck T, Kauff F, Wilczynski B, de Hoon MJL. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. 2009;25:1422–1423. doi: 10.1093/bioinformatics/btp163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Cooper VS, Lenski RE. The population genetics of ecological specialization in evolving Escherichia coli populations. Nature. 2000;407:736–739. doi: 10.1038/35037572. [DOI] [PubMed] [Google Scholar]
  17. Culyba MJ, Van Tyne D. Bacterial evolution during human infection: Adapt and live or adapt and die. PLOS Pathogens. 2021;17:e1009872. doi: 10.1371/journal.ppat.1009872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Cvijović I, Good BH, Jerison ER, Desai MM. Fate of a mutation in a fluctuating environment. PNAS. 2015;112:E5021–E5028. doi: 10.1073/pnas.1505406112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Davies EK, Peters AD, Keightley PD. High frequency of cryptic deleterious mutations in Caenorhabditis elegans. Science. 1999;285:1748–1751. doi: 10.1126/science.285.5434.1748. [DOI] [PubMed] [Google Scholar]
  20. Davies J, Davies D. Origins and evolution of antibiotic resistance. Microbiology and Molecular Biology Reviews. 2010;74:417–433. doi: 10.1128/MMBR.00016-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Desai MM, Fisher DS. Beneficial mutation selection balance and the effect of linkage on positive selection. Genetics. 2007;176:1759–1798. doi: 10.1534/genetics.106.067678. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Desai MM, Walczak AM, Fisher DS. Genetic diversity and the structure of genealogies in rapidly adapting populations. Genetics. 2013;193:565–585. doi: 10.1534/genetics.112.147157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Dewachter L, Brooks AN, Noon K, Cialek C, Clark-ElSayed A, Schalck T, Krishnamurthy N, Versées W, Vranken W, Michiels J. Deep mutational scanning of essential bacterial proteins can guide antibiotic development. Nature Communications. 2023;14:241. doi: 10.1038/s41467-023-35940-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Dingle KE, Colles FM, Wareing DR, Ure R, Fox AJ, Bolton FE, Bootsma HJ, Willems RJ, Urwin R, Maiden MC. Multilocus sequence typing system for Campylobacter jejuni. Journal of Clinical Microbiology. 2001;39:14–23. doi: 10.1128/JCM.39.1.14-23.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Drake JW. A constant rate of spontaneous mutation in DNA-based microbes. PNAS. 1991;88:7160–7164. doi: 10.1073/pnas.88.16.7160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Druelle V, Neher RA. Reversions to consensus are positively selected in HIV-1 and bias substitution rate estimates. Virus Evolution. 2023;9:veac118. doi: 10.1093/ve/veac118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Faith JJ, Guruge JL, Charbonneau M, Subramanian S, Seedorf H, Goodman AL, Clemente JC, Knight R, Heath AC, Leibel RL, Rosenbaum M, Gordon JI. The long-term stability of the human gut microbiota. Science. 2013;341:1237439. doi: 10.1126/science.1237439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Feder AF, Kline C, Polacino P, Cottrell M, Kashuba ADM, Keele BF, Hu SL, Petrov DA, Pennings PS, Ambrose Z. A spatio-temporal assessment of simian/human immunodeficiency virus (SHIV) evolution reveals A highly dynamic process within the host. PLOS Pathogens. 2017;13:e1006358. doi: 10.1371/journal.ppat.1006358. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Feil EJ, Cooper JE, Grundmann H, Robinson DA, Enright MC, Berendt T, Peacock SJ, Smith JM, Murphy M, Spratt BG, Moore CE, Day NPJ. How clonal is Staphylococcus aureus? Journal of Bacteriology. 2003;185:3307–3316. doi: 10.1128/JB.185.11.3307-3316.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Gallet R, Cooper TF, Elena SF, Lenormand T. Measuring selection coefficients below 10(-3): method, questions, and prospects. Genetics. 2012;190:175–186. doi: 10.1534/genetics.111.133454. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Garud NR, Good BH, Hallatschek O, Pollard KS. Evolutionary dynamics of bacteria in the gut microbiome within and across hosts. PLOS Biology. 2019;17:e3000102. doi: 10.1371/journal.pbio.3000102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Ghosh OM, Good BH. Emergent evolutionary forces in spatial models of luminal growth and their application to the human gut microbiota. PNAS. 2022;119:e2114931119. doi: 10.1073/pnas.2114931119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Gillespie JH. Population Genetics: A Concise Guide. JHU Press; 2004. [Google Scholar]
  34. Haigh J. The accumulation of deleterious genes in a population--Muller’s Ratchet. Theoretical Population Biology. 1978;14:251–267. doi: 10.1016/0040-5809(78)90027-8. [DOI] [PubMed] [Google Scholar]
  35. Haldane JBS. A mathematical theory of natural and artificial selection, part v: Selection and mutation. Mathematical Proceedings of the Cambridge Philosophical Society. 1927;23:838–844. doi: 10.1017/S0305004100015644. [DOI] [Google Scholar]
  36. Ho SYW, Phillips MJ, Cooper A, Drummond AJ. Time dependency of molecular rate estimates and systematic overestimation of recent divergence times. Molecular Biology and Evolution. 2005;22:1561–1568. doi: 10.1093/molbev/msi145. [DOI] [PubMed] [Google Scholar]
  37. Jiang X, Hall AB, Arthur TD, Plichta DR, Covington CT, Poyet M, Crothers J, Moses PL, Tolonen AC, Vlamakis H, Alm EJ, Xavier RJ. Invertible promoters mediate bacterial phase variation, antibiotic resistance, and host adaptation in the gut. Science. 2019;363:181–187. doi: 10.1126/science.aau5238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Jolley KA, Kalmusova J, Feil EJ, Gupta S, Musilek M, Kriz P, Maiden MC. Carried meningococci in the Czech Republic: a diverse recombining population. Journal of Clinical Microbiology. 2000;38:4492–4498. doi: 10.1128/JCM.38.12.4492-4498.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Jukes TH, Cantor CR. Evolution of protein molecules. Mammalian Protein Metabolism. 1969;3:21–132. doi: 10.1016/B978-1-4832-3211-9.50009-7. [DOI] [Google Scholar]
  40. Kelsic ED, Chung H, Cohen N, Park J, Wang HH, Kishony R. RNA structural determinants of optimal codons revealed by MAGE-Seq. Cell Systems. 2016;3:563–571. doi: 10.1016/j.cels.2016.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Key FM, Khadka VD, Romo-González C, Blake KJ, Deng L, Lynn TC, Lee JC, Chiu IM, García-Romero MT, Lieberman TD. On-person adaptive evolution of Staphylococcus aureus during treatment for atopic dermatitis. Cell Host & Microbe. 2023;31:593–603. doi: 10.1016/j.chom.2023.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Kibota TT, Lynch M. Estimate of the genomic mutation rate deleterious to overall fitness in E. coli. Nature. 1996;381:694–696. doi: 10.1038/381694a0. [DOI] [PubMed] [Google Scholar]
  43. Kimura M. Preponderance of synonymous changes as evidence for the neutral theory of molecular evolution. Nature. 1977;267:275–276. doi: 10.1038/267275a0. [DOI] [PubMed] [Google Scholar]
  44. Kolodny O, Schulenburg H. Microbiome-mediated plasticity directs host evolution along several distinct time scales. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 2020;375:20190589. doi: 10.1098/rstb.2019.0589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Korem T, Zeevi D, Suez J, Weinberger A, Avnit-Sagi T, Pompan-Lotan M, Matot E, Jona G, Harmelin A, Cohen N, Sirota-Madi A, Thaiss CA, Pevsner-Fischer M, Sorek R, Xavier R, Elinav E, Segal E. Growth dynamics of gut microbiota in health and disease inferred from single metagenomic samples. Science. 2015;349:1101–1106. doi: 10.1126/science.aac4812. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Koskella B, Brockhurst MA. Bacteria-phage coevolution as a driver of ecological and evolutionary processes in microbial communities. FEMS Microbiology Reviews. 2014;38:916–931. doi: 10.1111/1574-6976.12072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Kryazhimskiy S, Plotkin JB. The population genetics of dN/dS. PLOS Genetics. 2008;4:e1000304. doi: 10.1371/journal.pgen.1000304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Labavić D, Loverdo C, Bitbol AF. Hydrodynamic flow and concentration gradients in the gut enhance neutral bacterial diversity. PNAS. 2022;119:e2108671119. doi: 10.1073/pnas.2108671119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Levin BR, Perrot V, Walker N. Compensatory mutations, antibiotic resistance and the population genetics of adaptive evolution in bacteria. Genetics. 2000;154:985–997. doi: 10.1093/genetics/154.3.985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Lieberman TD, Michel J-B, Aingaran M, Potter-Bynoe G, Roux D, Davis MR, Jr, Skurnik D, Leiby N, LiPuma JJ, Goldberg JB, McAdam AJ, Priebe GP, Kishony R. Parallel bacterial evolution within multiple patients identifies candidate pathogenicity genes. Nature Genetics. 2011;43:1275–1280. doi: 10.1038/ng.997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Lieberman TD. Detecting bacterial adaptation within individual microbiomes. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 2022;377:20210243. doi: 10.1098/rstb.2021.0243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Liu Z, Good BH. Dynamics of bacterial recombination in the human gut microbiome. PLOS Biology. 2024;22:e3002472. doi: 10.1371/journal.pbio.3002472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Loo SL, Ong A, Kyaw W, Thibaut LM, Lan R, Tanaka MM. Nonsynonymous polymorphism counts in bacterial genomes: A comparative examination. Applied and Environmental Microbiology. 2020;87:e02002-20. doi: 10.1128/AEM.02002-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Marvig RL, Sommer LM, Molin S, Johansen HK. Convergent evolution and adaptation of Pseudomonas aeruginosa within patients with cystic fibrosis. Nature Genetics. 2015;47:57–64. doi: 10.1038/ng.3148. [DOI] [PubMed] [Google Scholar]
  55. Neher RA, Shraiman BI. Fluctuations of fitness distributions and the rate of Muller’s ratchet. Genetics. 2012;191:1283–1293. doi: 10.1534/genetics.112.141325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Nowick K, Walter Costa MB, Höner Zu Siederdissen C, Stadler PF. Selection pressures on RNA sequences and structures. Evolutionary Bioinformatics Online. 2019;15:1176934319871919. doi: 10.1177/1176934319871919. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Ongenae V, Mabrouk AS, Crooijmans M, Rozen D, Briegel A, Claessen D. Reversible bacteriophage resistance by shedding the bacterial cell wall. Open Biology. 2022;12:210379. doi: 10.1098/rsob.210379. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Peterson GI, Masel J. Quantitative prediction of molecular clock and ka/ks at short timescales. Molecular Biology and Evolution. 2009;26:2595–2603. doi: 10.1093/molbev/msp175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Poret AJ, Schaefers M, Merakou C, Mansour KE, Lagoudas GK, Cross AR, Goldberg JB, Kishony R, Uluer AZ, McAdam AJ, Blainey PC, Vargas SO, Lieberman TD, Priebe GP. De Novo Mutations Mediate Phenotypic Switching in an Opportunistic Human Lung Pathogen. bioRxiv. 2024 doi: 10.1101/2024.02.06.579193. [DOI]
  60. Robert L, Ollion J, Robert J, Song X, Matic I, Elez M. Mutation dynamics and fitness effects followed in single cells. Science. 2018;359:1283–1286. doi: 10.1126/science.aan0797. [DOI] [PubMed] [Google Scholar]
  61. Rocha EPC, Smith JM, Hurst LD, Holden MTG, Cooper JE, Smith NH, Feil EJ. Comparisons of dN/dS are time dependent for closely related bacterial genomes. Journal of Theoretical Biology. 2006;239:226–235. doi: 10.1016/j.jtbi.2005.08.037. [DOI] [PubMed] [Google Scholar]
  62. Schwengers O, Jelonek L, Dieckmann MA, Beyvers S, Blom J, Goesmann A. Bakta: rapid and standardized annotation of bacterial genomes via alignment-free sequence identification. Microbial Genomics. 2021;7:000685. doi: 10.1099/mgen.0.000685. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Sender R, Fuchs S, Milo R. Revised estimates for the number of human and bacteria cells in the body. PLOS Biology. 2016;14:e1002533. doi: 10.1371/journal.pbio.1002533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Shoemaker WR, Chen D, Garud NR. Comparative population genetics in the human gut microbiome. Genome Biology and Evolution. 2022;14:evab116. doi: 10.1093/gbe/evab116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Shopsin B, Drlica-Wagner A, Mathema B, Adhikari RP, Kreiswirth BN, Novick RP. Prevalence of agr dysfunction among colonizing Staphylococcus aureus strains. The Journal of Infectious Diseases. 2008;198:1171–1174. doi: 10.1086/592051. [DOI] [PubMed] [Google Scholar]
  66. Snitkin ES, Zelazny AM, Gupta J, NISC Comparative Sequencing Program. Palmore TN, Murray PR, Segre JA. Genomic insights into the fate of colistin resistance and Acinetobacter baumannii during patient treatment. Genome Research. 2013;23:1155–1162. doi: 10.1101/gr.154328.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Sousa A, Ramiro RS, Barroso-Batista J, Güleresi D, Lourenço M, Gordo I. Recurrent reverse evolution maintains polymorphism after strong bottlenecks in commensal gut bacteria. Molecular Biology and Evolution. 2017;34:2879–2892. doi: 10.1093/molbev/msx221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Sukhithasri V, Nisha N, Biswas L, Anil Kumar V, Biswas R. Innate immune recognition of microbial cell wall components and microbial strategies to evade such recognitions. Microbiological Research. 2013;168:396–406. doi: 10.1016/j.micres.2013.02.005. [DOI] [PubMed] [Google Scholar]
  69. Tajima F. Infinite-allele model and infinite-site model in population genetics. Journal of Genetics. 1996;75:27–31. doi: 10.1007/BF02931749. [DOI] [Google Scholar]
  70. Tataru P, Simonsen M, Bataillon T, Hobolth A. Statistical inference in the wright-fisher model using allele frequency data. Systematic Biology. 2017;66:e30–e46. doi: 10.1093/sysbio/syw056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Torrance EL, Burton C, Diop A, Bobay LM. Evolution of homologous recombination rates across bacteria. PNAS. 2024;121:e2316302121. doi: 10.1073/pnas.2316302121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Torrillo P. Microbiome reversions. swh:1:rev:70cb1d1484e57a0aa5cfdd719c705515d21e818eSoftware Heritage. 2023 https://archive.softwareheritage.org/swh:1:dir:cc5db861b46ab1ead5c8a80bf8a29598a614021a;origin=https://github.com/PaulTorrillo/Microbiome_Reversions;visit=swh:1:snp:c07f8832389fa1e5285bde66070bc62e19c69633;anchor=swh:1:rev:70cb1d1484e57a0aa5cfdd719c705515d21e818e
  73. Trindade S, Perfeito L, Gordo I. Rate and effects of spontaneous mutations that affect fitness in mutator Escherichia coli. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 2010;365:1177–1186. doi: 10.1098/rstb.2009.0287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Vigué L, Tenaillon O. Predicting the effect of mutations to investigate recent events of selection across 60,472 Escherichia coli strains. PNAS. 2023;120:e2304177120. doi: 10.1073/pnas.2304177120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Wang RJ, Al-Saffar SI, Rogers J, Hahn MW. Human generation times across the past 250,000 years. Science Advances. 2023;9:eabm7047. doi: 10.1126/sciadv.abm7047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Yang Z, Nielsen R. Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. Molecular Biology and Evolution. 2000;17:32–43. doi: 10.1093/oxfordjournals.molbev.a026236. [DOI] [PubMed] [Google Scholar]
  77. Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Molecular Biology and Evolution. 2007;24:1586–1591. doi: 10.1093/molbev/msm088. [DOI] [PubMed] [Google Scholar]
  78. Yu NY, Wagner JR, Laird MR, Melli G, Rey S, Lo R, Dao P, Sahinalp SC, Ester M, Foster LJ, Brinkman FSL. PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes. Bioinformatics. 2010;26:1608–1615. doi: 10.1093/bioinformatics/btq249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Zhao S, Lieberman TD, Poyet M, Kauffman KM, Gibbons SM, Groussin M, Xavier RJ, Alm EJ. Adaptive evolution within gut microbiomes of healthy people. Cell Host & Microbe. 2019;25:656–667. doi: 10.1016/j.chom.2019.03.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Zhao S, Dai CL, Evans ED, Lu Z, Alm EJ. Tracking Strains Predicts Personal Microbiomes and Reveals Recent Adaptive Evolution. bioRxiv. 2020 doi: 10.1101/2020.09.14.296970. [DOI]

Editor's evaluation

Vaughn S Cooper 1

This valuable study addresses the interpretation of patterns of synonymous and nonsynonymous diversity in microbial genomes. The authors present solid theoretical and computational evidence that adaptive mutations that revert the amino acids to an earlier state can significantly impact the observed ratios of synonymous and nonsynonymous mutations in human commensal bacteria. This article will be of interest to microbiologists with a background in evolution and to researchers studying the human microbiome.

Decision letter

Editor: Vaughn S Cooper1
Reviewed by: Daniel S Fisher2

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Decision letter after peer review:

Thank you for submitting your article "Reversions mask the contribution of adaptive evolution in microbiomes" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by Detlef Weigel as the Senior Editor.

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions (for the authors):

1) Address how recombination affects assumptions of your model, which invokes Muller's Ratchet to necessitate back–mutation. In effect, recombination of the wild–type allele is equivalent to back–mutation, but recombination can also revert or incorporate multiple linked variants at once and alter the assumptions of your model and analyses. As recombination tends to bring in many more mutations overall than occur in regions of a pair of genomes with asexual ancestry, the effects cannot be neglected. To what extent can this give rise to a similar dependence of dN/dS on dS as seen in the data?

2) Clarify the underlying assumptions of the model (e.g., the fitting parameters nloci and Tadapt), its theoretical results, and experimental applications. In addition, it clarifies the environmental conditions used for simulations, which could involve colonization of new hosts or environments, affect selection at many sites at once, and lead to clonal interference.

3) Evaluate the alternative that many compensatory mutations that could phenotypically revert an earlier mutation might overcome the exact specific reversion mutation in greater detail, including whether clonal interference between compensatory and reversion mutations would result in the mutations with the largest s – eg, as mentioned, reversion of a stop codon – being much more likely to sweep. Additional discussion of this important issue would be valuable.

Reviewer #1 (Recommendations for the authors):

This study makes a substantial contribution to our understanding of the molecular evolutionary dynamics of microbial genomes by proposing a model that incorporates relatively frequent adaptive reversion mutations. In many ways, this makes sense from my own experience with evolutionary genomic data of microbes, where reversions are surprisingly familiar as evidence of the immense power of selection in large populations.

One criticism is the reliance on one major data set of B. fragilis to test fits of these models, but this is relatively minor in my opinion and can be caveated by discussion of other relevant datasets for parallel investigation.

Another point is that this problem isn't as new as the manuscript indicates, see for example https://journals.asm.org/doi/10.1128/aem.02002–20.

Nonetheless, the paper succeeds by both developing theory and offering concrete parameters to illustrate the magnitudes of the problems that distinguish competing ideas, for example, the risk of mutational load posed in the absence of frequent back mutation.

Please expand the discussion of the novelty of incorporating (or overlooking) reversion mutations as well as the broader problems of dN/dS metrics for population–wide data where fixation is rare.

I'm otherwise enthusiastic about this study.

Reviewer #2 (Recommendations for the authors):

This manuscript asks how different forms of selection affect the patterns of genetic diversity in microbial populations. One popular metric used to infer signatures of selection is dN/dS, the ratio of nonsynonymous to synonymous distances between two genomes. Previous observations across many bacterial species have found dN/dS decreases with dS, which is a proxy for the divergence time. The most common interpretation of this pattern was proposed by Rocha et al. (2006), who suggested the excess in nonsynonymous mutations on short divergence times represent transient deleterious mutations that have not yet been purged by selection.

In this study, the authors propose an alternative model based on the population structure of human gut bacteria, in which dN is dominated by selective sweeps of SNPs that revert previous mutations within local populations. The authors argue that contrary to standard population genetics models, which are based on the population dynamics of large eukaryotes, the large populations in the human gut mean that reversions may be quite common and may have a large impact on evolutionary dynamics. They show that such a model can fit the decrease of dN/dS in time at least as well as the purifying selection model.

Strengths

The main strength of the manuscript is to show that adaptive sweeps in gut microbial populations can lead to small dN/dS. While previous work has shown that using dN/dS to infer the strength of selection within a population is problematic (see Kryazhimskiy and Plotkin, 2008, cited in the paper) the particular mechanism proposed by the authors is new to my knowledge. In addition, despite the known caveats, dN/dS values are still routinely reported in studies of microbial evolution, and so their interpretation should be of considerable interest to the community.

The authors provide compelling justification for the importance of adaptive reversions and make a good case that these need to be carefully considered by future studies of microbial evolution. The authors show that their model can fit the data as well as the standard model based on purifying selection and the parameters they infer appear to be plausible given known data. More generally, I found the discussion on the implications of traditional population genetics models in the context of human gut bacteria to be a valuable contribution of the paper.

Weaknesses

The authors argue that the purifying selection model would predict a gradual loss in fitness via Muller's ratchet. This is true if recombination is ignored, but this assumption is inconsistent with the data from Garud, et al. (2019) cited in the manuscript, who showed a significant linkage decrease in the bacteria also used in this study.

I also found that the data analysis part of the paper added little new to what was previously known. Most of the data comes directly from the Garud et al. study and the analysis is very similar as well. Even if other appropriate data may not currently be available, I feel that more could be done to test specific predictions of the model with more careful analysis.

Finally, I found the description of the underlying assumptions of the model and the theoretical results difficult to understand. I could not, for example, relate the fitting parameters nloci and Tadapt to the simulations after reading the main text and the supplement. In addition, it was not clear to me if simulations involved actual hosts or how the changes in selection coefficients for different sites was implemented. Note that these are not simply issues of exposition since the specific implementation of the model could conceivably lead to different results. For example, if the environmental change is due to the colonization of a different host, it would presumably affect the selection coefficients at many sites at once and lead to clonal interference. Related to this point, it was also not clear that the weak mutation strong selection assumption is consistent with the microscopic parameters of the model. The authors also mention that "superspreading" may somehow make a difference to the probability of maintaining the least loaded class in the purifying selection model, but what they mean by this was not adequately explained.

I see three main issues that would significantly improve the manuscript if addressed. The first is the issue of recombination, which undermines using Muller's ratchet to motivate the proposed model. Including the effects of recombination in the model would be a significant project in itself so I do not expect the authors to do that, but I do believe it is important that the issue be seriously addressed. In any case, I think the fact that the values of s implied by the purifying selection model are so small is already problematic, for many of the reasons discussed in the paper, so I would suggest focusing more on that. It might be more useful perhaps to demonstrate in simulations how even rare adaptive sweeps would affect the purging of deleterious mutations with small s, which is mentioned in the manuscript but not developed.

The second issue I found is the data analysis. It would be useful to check some of the hypotheses raised in the text directly from the data. For example, the authors mention that premature stop codons may preferentially be subject to reversions, which could be verified. Even if these analyses do not directly support the model, the authors can still comment on them and I believe it would significantly add to the paper.

Finally, I found the presentation of the model and simulations to be confusing and feel that it needs to be made significantly clearer for readers to be able to understand. In general, the supplement had considerable detail on standard results (Sec. 2.1 and 2.2 for example), but was very vague on the actual model itself (Sec. 3.1 and 3.2). If anything, this emphasis should be reversed. I've added some more specific comments below which might help with this issue. It should also be noted that the inferred values of s from any simple model will be effective parameters that may not be directly related to measurable parameters from experiments or observations. To give a very simple example, if selection coefficients are varying quickly in time, the relevant s for the accumulation of mutations would be the time average s, which may be considerably smaller than the typical value.

Reviewer #3 (Recommendations for the authors):

The diversity of bacterial species in the human gut microbiome is widely known, but the extensive diversity within each species is far less appreciated. Strains found in individuals on opposite sides of the globe can differ by as little as handfuls of mutations, while strains found in an individual's gut, or in the same household, might have a common ancestor tens of thousands of years ago. What are the evolutionary, ecological, and transmission dynamics that established and maintain this diversity?

The time, T, since the common ancestor of two strains, can be directly inferred by comparing their core genomes and finding the fraction of synonymous (non–amino acid changing) sites at which they differ: dS. With the per–site per–generation mutation rate, μ, and the mean generation times roughly known, this directly yields T (albeit with substantial uncertainty of the generation time.) A traditional way to probe the extent to which selection plays a role is to study pairs of strains and compare the fraction of non–synonymous (amino acid or stop–codon changing) sites, dN, at which the strains differ with their dS. Small dN/dS, as found between distantly related strains, is attributed to purifying selection against deleterious mutations dominating over mutations that have driven adaptive evolution. Large dN/dS as found in laboratory evolution experiments, is caused by beneficial mutations that quickly arise in large bacterial populations, and, with substantial selective advantages, per generation, can rise to high abundance fast enough that very few synonymous mutations arise in the lineages that take over the population.

A number of studies (including by Lieberman's group) have analyzed large numbers of strains of various dominant human gut species and studied how dN/dS varies. Although between closely related strains the variations are large – often much larger than attributable to just statistical variations – a systematic trend from dN/dS around unity or larger for close relatives to dN/dS ~ 0.1 for more distant relatives has been found in enough species that it is natural to conjecture a general explanation.

The conventional explanation is that, for close relatives, the effects of selection over the time since they diverged has not yet purged weakly deleterious mutations that arose by chance – roughly mutations with sT<1 – while since the common ancestor of more distantly related strains, there is plenty of time for most of those that arose to have been purged.

Torrillo and Lieberman have carried out an in–depth – sophisticated and quantitative – analysis of models of some of the evolutionary processes that shape the dependence of dN/dS on dS – and hence on their divergence time, T. They first review the purifying selection model and show that – even ignoring its inability to explain dN/dS > 1 for many closely related pairs – the model has major problems explaining the crossover from dN/dS somewhat less than unity to much smaller values as dS goes through – on a logarithmic scale – the 10^–4 range. The first problem, already seen in the infinite–population–size deterministic model, is that a very large fraction of non–synonymous mutations would have to have deleterious s's in the 10^–5 per generation range to fit the data (and a small fraction effectively neutral). As the s's are naturally expected (at least in the absence of quantitative analysis to the contrary) to be spread out over a wide range on a logarithmic scale of s, this seems implausible. But the authors go further and analyze the effects of fluctuations that occur even in the very large populations: ~ >10^12 bacteria per species in one gut, and 10^10 human guts globally. They show that Muller's ratchet – the gradual accumulation of weakly deleterious mutations that are not purged by selection – leads to a mutational meltdown with the parameters needed to fit the purifying selection model. In particular, with N_e the "effective population size" that roughly parametrizes the magnitude of stochastic birth–death and transition fluctuations, and U the total mutation rate to such deleterious mutations this occurs for U/s > log(sN_e) which they show would obtain with the fitted parameters.

Torrillo and Lieberman promise an alternate model: that there are a modest number of "loci" at which conditionally beneficial mutations can occur that are beneficial in some individual guts (or other environmental conditions) at some times, but deleterious in other (or the same) gut at other times. With the ancestors of a pair of strains having passed through one too many individuals and transmissions, it is possible for a beneficial mutation to occur and rise in the population, only later to be reverted by the beneficial inverse mutation. With tens of loci at which this can occur, they show that this process could explain the drop of dN/dS from short times – in which very few such mutations have occurred – to very long times by which most have flipped back and forth so that a random pair of strains will have the same nucleotide at such sites with 50% probability. Their qualitative analysis of a minimally simple model of this process shows that the bacterial populations are plenty big enough for such specific mutations to occur many times in each individual's gut, and with modest beneficials, to takeover. With a few of these conditionally beneficial mutations or reversions occurring during an individuals lifetime, they get a reasonably quantitative agreement with the dN/dS vs dS data with very few parameters. A key assumption of their model is that genetically exact reversion mutations are far more likely to takeover a gut population – and spread – than compensatory mutations which have a similar phenotypic–reversion effect: a mutation that is reverted does not show up in dN, while one that is compensated by another shows up as a two–mutation difference after the environment has changed twice.

Strengths:

The quantitative arguments made against the conventional purifying selection model are highly compelling, especially the consideration of multiple aspects that are usually ignored, including – crucially – how Muller's ratchet arises and depends on the realistic and needed–to–fit parameters; the effects of bottlenecks in transmission and the possibility that purifying selection mainly occurs then; and complications of the model of a single deleterious s, to include a distribution of selective disadvantages. Generally, the author's approach of focusing on the simplest models with as few as possible parameters (some roughly known), and then adding in various effects one–by–one, is outstanding and, in being used to analyze environmental microbial data, exceptional.

The reversion model the authors propose and study is a simple general one and they again explore carefully various aspects of it – including dynamics within and between hosts – and the consequent qualitative and quantitative effects. Again, the quantitive analysis of almost all aspects is exemplary. Although it is hard to make a compelling guess of the number of loci that are subject to alternating selection on the needed time–scales (years to centuries) they make a reasonable argument for a lower bound in terms of the number of known invertible promoters (that can genetically switch gene expression on and off).

Weaknesses:

The primary weakness of this paper is one that the author's are completely open about: the assumption that, collectively, any of possibly–many compensatory mutations that could phenotypically revert an earlier mutation, are less likely to arise and takeover local populations than the exact specific reversion mutation. While detailed analysis of this is, reasonably enough, beyond the scope of the present paper, more discussion of this issue would add substantially to this work. Quantitatively, the problem is that even a modest number of compensatory mutations occurring as the environmental pressures change could lead to enough accumulation of non–synonymous mutations that they could cause dN/dS to stay large – easily >1 – to much larger dS than is observed. If, say, the appropriate locus is a gene, the number of combinations of mutations that are better in each environment would play a role in how large dN would saturate to in the steady state (1/2 of n_loci in the author's model). It is possible that clonal interference between compensatory and reversion mutations would result in the mutations with the largest s – eg, as mentioned, reversion of a stop codon – being much more likely to take over, and this could limit the typical number of differences between quite well–diverged strains. However, the reversion and subsequent re–reversion would have to both beat out other possible compensatory mutations – naively less likely. I recommend that a few sentences in the Discussion be added on this important issue along with comments on the more general puzzle – at least to this reader! – as to why there appear to be so little adaptive genetic changes in core genomes on time scales of human lifetimes and civilization.

An important feature of gut bacterial evolution that is now being intensely studied is only mentioned in passing at the end of this paper: horizontal transfer and recombination of core genetic material. As this tends to bring in many more mutations overall than occur in regions of a pair of genomes with asexual ancestry, the effects cannot be neglected. To what extent can this give rise to a similar dependence of dN/dS on dS as seen in the data? Of course, such a picture begs the question as to what sets the low dN/dS of segments that are recombined –– often from genetic distances comparable to the diameter of the species.

The main substantive criticisms are summarized in "Weaknesses" section. Here are a few more minor points, and then presentational and pedagogical comments.

1) The analysis of the effects of a distribution of values of s is not convincing. A simple argument that for each s there is a crossover dS ~ mu/s, and approximating the corresponding portion of dN/dS vs dS as a step function at that crossover, gives a better approximation that using the harmonic mean: the rough distribution of s needed to fit, can then be eyeballed from the data – roughly by derivative d[dN/dS]/dS. (This point is roughly made – although hard to parse and confounded by uninformative details – in the Supplement).

2) For Muller's ratchet and deleterious loading: The time dependence computed from the deterministic analysis is rather misleading as it assumes – unreasonably – that the common ancestor had no deleterious mutations. The main point is the small fraction of the least–loaded subpopulation, W(t=infinity), being very small. The comparison of this with the appropriate N_e – whether N_e s exp(–U/s) is small or large – determining whether Muller's ratchet operates is the important point which needs emphasizing more: and this should be in the main text. [Note that for Muller's ratchet, using the harmonic mean s is roughly correct.] I would replace existing Eq(5) with an equation with this comparison in it (now just loosely in words around L 163) and refer to Ref(29) for analyses. Explain that if N is large enough or U small enough, reversions of the deleterious mutations will prevent Muller's ratchet.

Additionally, make clear that the model assumes additive effects of the deleterious mutations, and note that the average burden is small – and thus additive probably reasonable – even when large enough that Muller's ratchet is effective.

3) Some more discussion of "effective population size" is needed when introduced, in addition to that in the Discussion. This is especially true here, since – as noted in Discussion – it is often taken from the dS of a species via N_e = /mu: simply the overall T_MRCA in units of generations. The way the present paper uses N_e, is how it appears in modeling and thus controls several actual biological quantities: how much mutations can change abundance by drift, and how often beneficial mutations occur. This is thus a "real" effective population size. It is important to note, however, that with clonal interference (both in individuals and across human populations) different "N_e"'s are involved for different processes. A detailed discussion is not needed, but a couple of sentences (and a reference) in the main text is – around L165 – and a note in caption of Figure 2.

4) What is being assumed in various places (eg Figure 2) for the generation time is unclear.

5) The heuristic explanation for the time dependence of the reversion model is missing. T_adapt is rather confusing. A basic quantity – call it eg tau_change – is how often the conditions for a particular transient mutation change from beneficial to deleterious. This is the time that corresponds to the dS at which the crossover in the dN/dS data occurs – and indeed what is in the equations. T_adapt = tau_change/n_loci is the typical time between when any of these mutations switch. This determines dN. The distinction needs explaining.

6) The meaning of "loci" in counting the number of possible reverting mutations is unclear. Presumably what is meant is that at each locus there may be a number of roughly equivalent mutations – with each changing the phenotype so the others would no longer be beneficial – but to revert the phenotype needs to revert the specific one that occurred. This needs to be made clearer. Also, note that this would change the numbers somewhat as the beneficial mutation rate would be higher, but it has shown that analysis is not sensitive to that factor because of the large populations. Generally, a clearer statement that the beneficial s is the important quantity setting the time scales – log(Ns)/s – rather than the time for mutation to arise, is needed in the main text.

7) Effects of recombination: It would be good to add some more discussion on whether the effects of recombination alone (with an assumption about the effects of purifying selection on long time scales) could fit the data – as pointed out in "Weaknesses" section.

Is reversion by recombination likely to have a substantial role in dN/dS? I do not find the suggestion that multiple mutations could be simultaneously reverted by recombination at all believable. The further away the segment is, the more implausible is the scenario that compensatory mutations – let all other beneficial mutations with epistasis between them – have not occurred.

8) Generally, for each of the models and extensions used, make clear the number and set of parameters, and the assumptions. (Eg Figure 3)

Presentation:

Although the overall scenario, points made, and many of the explanations are quite clear, aspects of the presentation, figures, and captions are unclear or sloppy and detract from the paper's readability and potential impact. More specific and clear statements of the assumptions and model features are needed. And which of the mathematics to include or not in the main text, the supplement, or not at all, could be improved.

Notation:

– Mutation rates: mu_S etc are more often used per site: Here they are used for genome–wide instead of the more conventional – and better – U_S etc

– Use some notation for s of beneficial mutants – eg s_adapt.

– T_adapt see above

– N is used both for population size and numbers. With the use of n_loci, might be better to use n(t)'s – perhaps best with an overbar – for the average number of mutations that accumulate with time.

In many figures, notations such as "1.8e+01" or, in one, "quadrillion" are used: Use standard notation and, eg 10^1 mark instead of 1.8 times this.

Figure 1:

It is not clear what is meant by "adaptive genes" in the Zhao et al. data shown.

In the caption, say that crossover occurs at roughly dS ~ mu/s : this makes the role of s clear. In general, this kind of heuristic note is useful – independently of whether or not a reader is mathematically inclined!

L98: Make clear N_transient etc are averages. Spell out that for dS << 1, dS ~ 2 U_S T_MRCA, and this regime obtains for all the data.

L127: Spell out mu_S (or better U_S) in terms of the number of sites and generation time.

L150: the minus sign in the exponent is not visible (similarly in some other equations)

L187–194: It is unclear what is meant by "superspreaders" and what the 30,000X factor means. Is it that strains from one individual would have to spread to 30,000 others before reversions etc? The argument against superspreaders could be expanded beyond "seems unlikely".

L204–213: Equation (6) and surrounding text on general process, is not needed and confusing. Instead, the left side and last equality in Eq S23 should be pulled into the main text. Then can go directly to Eq(7) with an explanation that the n_loci/2 factor is simply the average number of differences between distantly related bacteria.

L259: Is the fitted T_adapt in generations? Also, give in years. And don't pretend to precision as "840", and later "110 years": give ranges.

Figure 4: (a): note in the caption that actual dN/dS > 1 and why.

Specify n_loci used.

(b) "discounting beneficial mutations" is unclear: are or aren't the reversions included in this statement?

L287: Explain what PAML is/assumes.

L321: Note that in the simulations with genetic classes, the dN, etc between two individuals cannot be directly kept track of. Correct?

L348: Meaning of "negative feedback" is unclear.

L353: As per main comments: more is needed on how tiny the rate of compensatory or other beneficial mutations would have to be to not raise α hugely.

L365: Explain "pseudogenized".

Methods and Theory:

Generally, much more than needed deriving standard results.

Wright–Fisher simulations: The notation here is very bad, with nearby letters being used for very different things – eg why not t instead of i which is in any case elsewhere used to mean something different? – and j_k used as it is but with j being an index. And generally, it is overly long and unclear.

L 673: what is RMSD?

L678: Explain PAML

dN/dS Theory: Much too much detail as most very standard Eg: certainly don't need all the derivation leading up to L742 equation.

L777: Make clear that α used in the main text is that with 0 subscript here.

L791–815 is confusing, not needed, and probably not valid. The point made in L817 is all that is needed – see comment (1) above.

L860: Say that sNW factor is mean fitness.

L895 is a classic result and not all derivation is needed.

L912: "has 2s probability of extinction" is unclear what it means: I presume something about the rate of ratchet. L915 equations, as noted in (2) should be in the main text.

Extinction and Fixation Probability:

This is all standard and not needed beyond quoting results.

L980: "Mutation is accessible if… " is very misleading. If L982 is satisfied, then there will be clonal interference between multiple mutations, and much of the analysis changes. But whether satisfied or not, the mutations can arise and fix: what changes is which will dominate the time to fixation as in L989

L1005: As noted, should pull part of this into the main text.

Implied Amount of Superspreading: extremely unclear. (and S26 is not even an equation)

SI Figures: There are probably more than needed and captions are often insufficient. Would be better to have a few paragraphs of text explaining them collectively, and the figures interspersed in the text.

Figure S4: What is "theory" curve?

Figure S5: Very unclear what was done or what modeled. Why is every 100,000 generations underestimated of bottleneck? What does "beneficial mutations are released" mean.

eLife. 2024 Sep 6;13:e93146. doi: 10.7554/eLife.93146.sa2

Author response


Essential revisions (for the authors):

1) Address how recombination affects assumptions of your model, which invokes Muller's Ratchet to necessitate back–mutation. In effect, recombination of the wild–type allele is equivalent to back–mutation, but recombination can also revert or incorporate multiple linked variants at once and alter the assumptions of your model and analyses. As recombination tends to bring in many more mutations overall than occur in regions of a pair of genomes with asexual ancestry, the effects cannot be neglected. To what extent can this give rise to a similar dependence of dN/dS on dS as seen in the data?

We now discuss the effect of recombination on the purifying selection model on page 6 and in Figure 2—figure supplement 3. In short, we now show that reasonable levels of recombination cannot rescue the purifying selection model from Muller’s ratchet when s is so low and the influx of new deleterious mutations is so high. We thank the reviewers for prompting this improvement.

2) Clarify the underlying assumptions of the model (e.g., the fitting parameters nloci and Tadapt), its theoretical results, and experimental applications. In addition, it clarifies the environmental conditions used for simulations, which could involve colonization of new hosts or environments, affect selection at many sites at once, and lead to clonal interference.

More clarification has been given in the text and the methods regarding the reversion model simulations and fitting. We initially left too much of the implementation to the reader reviewing the Github code, and we apologize for this oversight.

3) Evaluate the alternative that many compensatory mutations that could phenotypically revert an earlier mutation might overcome the exact specific reversion mutation in greater detail, including whether clonal interference between compensatory and reversion mutations would result in the mutations with the largest s – eg, as mentioned, reversion of a stop codon – being much more likely to sweep. Additional discussion of this important issue would be valuable.

We have now added simulations that include the possibility of compensatory mutations, and we have updated both the text and added an additional Figure 4—figure supplement 3. In short, we show that adding compensatory mutations that do not completely restore original fitness (as would be expected following a loss-of-function mutation) leads to a strengthening of the reversion model and a lower required number of locally adaptive loci. We thank the reviewers for prompting this improvement.

Reviewer #1 (Recommendations for the authors):

This study makes a substantial contribution to our understanding of the molecular evolutionary dynamics of microbial genomes by proposing a model that incorporates relatively frequent adaptive reversion mutations. In many ways, this makes sense from my own experience with evolutionary genomic data of microbes, where reversions are surprisingly familiar as evidence of the immense power of selection in large populations.

One criticism is the reliance on one major data set of B. fragilis to test fits of these models, but this is relatively minor in my opinion and can be caveated by discussion of other relevant datasets for parallel investigation.

We analyze data from 10 species of the Bacteroidales family, and we compare it to a dataset of Bacteroides fragilis. We have now added a reference to a recent manuscript from our group showing phenotypic alteration by reversion of a stop codon and further breaking of the same pathway through stop codons in other genes in Burkholderia dolosa on page 7, and have added a new analysis of codon usage in support of the reversion model on page 9-10.

We have chosen not to analyze other species as there are no large data sets with rigorous and evenly-applied quality control across scales. We anticipate the reversion model would be able to fit the data in these cases. We now note that this work remains to be done in the discussion.

Another point is that this problem isn't as new as the manuscript indicates, see for example https://journals.asm.org/doi/10.1128/aem.02002–20.

Loo et al. puts forward an explanation similar to the purifying model proposed by Rocha et al, which we refute here. Quoting from Loo et al: “Our results confirm the observation that nonsynonymous SNPs are relatively elevated under shorter time periods and that purifying selection is more apparent over longer periods or during transmission.” While there is some linguistic similarity between the weak purifying model and our model of strong local adaptation model and strong adaptive reversion, we believe that the dynamical and predictive implications suggested by the reversion model are an important conceptual leap and correction to the literature. We now cite Loo et al. and additional works cited therein. We have updated the abstract, introduction, and discussion to further emphasize the distinction of the reversion model from previous models: namely the implication of the reversion model that long-time scale dN/dS hides dynamics.

Nonetheless, the paper succeeds by both developing theory and offering concrete parameters to illustrate the magnitudes of the problems that distinguish competing ideas, for example, the risk of mutational load posed in the absence of frequent back mutation.

Please expand the discussion of the novelty of incorporating (or overlooking) reversion mutations as well as the broader problems of dN/dS metrics for population–wide data where fixation is rare.

We now better emphasize the distinction between the weak purifying selection model and reversion model in both the abstract, results, and discussion. We further emphasize that this work inverts the conclusion from prior works -- that dN/dS gives misleading interpretations of dynamics on long time scales, and that short-term measures (intra-population) of dN/dS may be more informative than long-term (inter-population) measures of dN/dS for dynamical inferences. Thank you for instigating this improvement.

I'm otherwise enthusiastic about this study.

Reviewer #2 (Recommendations for the authors):

This manuscript asks how different forms of selection affect the patterns of genetic diversity in microbial populations. One popular metric used to infer signatures of selection is dN/dS, the ratio of nonsynonymous to synonymous distances between two genomes. Previous observations across many bacterial species have found dN/dS decreases with dS, which is a proxy for the divergence time. The most common interpretation of this pattern was proposed by Rocha et al. (2006), who suggested the excess in nonsynonymous mutations on short divergence times represent transient deleterious mutations that have not yet been purged by selection.

In this study, the authors propose an alternative model based on the population structure of human gut bacteria, in which dN is dominated by selective sweeps of SNPs that revert previous mutations within local populations. The authors argue that contrary to standard population genetics models, which are based on the population dynamics of large eukaryotes, the large populations in the human gut mean that reversions may be quite common and may have a large impact on evolutionary dynamics. They show that such a model can fit the decrease of dN/dS in time at least as well as the purifying selection model.

Strengths

The main strength of the manuscript is to show that adaptive sweeps in gut microbial populations can lead to small dN/dS. While previous work has shown that using dN/dS to infer the strength of selection within a population is problematic (see Kryazhimskiy and Plotkin, 2008, cited in the paper) the particular mechanism proposed by the authors is new to my knowledge. In addition, despite the known caveats, dN/dS values are still routinely reported in studies of microbial evolution, and so their interpretation should be of considerable interest to the community.

The authors provide compelling justification for the importance of adaptive reversions and make a good case that these need to be carefully considered by future studies of microbial evolution. The authors show that their model can fit the data as well as the standard model based on purifying selection and the parameters they infer appear to be plausible given known data. More generally, I found the discussion on the implications of traditional population genetics models in the context of human gut bacteria to be a valuable contribution of the paper.

Thank you for the kind words and appreciation of the manuscript.

Weaknesses

The authors argue that the purifying selection model would predict a gradual loss in fitness via Muller's ratchet. This is true if recombination is ignored, but this assumption is inconsistent with the data from Garud, et al. (2019) cited in the manuscript, who showed a significant linkage decrease in the bacteria also used in this study.

We now investigate the effect of recombination on the purifying selection model on page 6 and in Figure 2—figure supplement 3. In short, we show that reasonable levels of recombination (obtained from literature r/m values) cannot rescue the purifying selection model from Muller’s ratchet when s is so low and the influx of new deleterious mutations is so high. We thank the reviewers for prompting this improvement.

I also found that the data analysis part of the paper added little new to what was previously known. Most of the data comes directly from the Garud et al. study and the analysis is very similar as well. Even if other appropriate data may not currently be available, I feel that more could be done to test specific predictions of the model with more careful analysis.

In addition to new analyses regarding recombination and compensatory mutations using the Garud et al. data set, we have now added two new analyses, both using Bacteroides fragilis. First, we show that de novo mutations in Zhao & Lieberman et al. dataset include an enrichment of premature stop codons (page 7). Second we show that genes expected to be under fluctuating selection in B. fragilis displays a significant closeness to stop codons, consistent with recent stop codons and reversions. We thank the reviewer for prompting the improvement.

Finally, I found the description of the underlying assumptions of the model and the theoretical results difficult to understand. I could not, for example, relate the fitting parameters nloci and Tadapt to the simulations after reading the main text and the supplement. In addition, it was not clear to me if simulations involved actual hosts or how the changes in selection coefficients for different sites was implemented. Note that these are not simply issues of exposition since the specific implementation of the model could conceivably lead to different results. For example, if the environmental change is due to the colonization of a different host, it would presumably affect the selection coefficients at many sites at once and lead to clonal interference. Related to this point, it was also not clear that the weak mutation strong selection assumption is consistent with the microscopic parameters of the model. The authors also mention that "superspreading" may somehow make a difference to the probability of maintaining the least loaded class in the purifying selection model, but what they mean by this was not adequately explained.

We apologize for leaving the specifics of the implementation from the paper and only accessible through the Github page and have corrected this. We have added a new section in the methods further detailing the reversion model and the specifics of how nloci and Tadapt (now tau_switch as of the edits) are implemented in the code.

The possibility for clonal interference is indeed included in the simulation. Switching is not correlated with transmissions in our main figure simulations (Figure 4a). When we run simulations in which transmission and selection are correlated, the results remain essentially the same, barring higher variance at lower divergences Figure 4-figure supplement 1. We have now clarified these points in the results, and have also better clarified the selection only at transmission model in the main results.

I see three main issues that would significantly improve the manuscript if addressed. The first is the issue of recombination, which undermines using Muller's ratchet to motivate the proposed model. Including the effects of recombination in the model would be a significant project in itself so I do not expect the authors to do that, but I do believe it is important that the issue be seriously addressed. In any case, I think the fact that the values of s implied by the purifying selection model are so small is already problematic, for many of the reasons discussed in the paper, so I would suggest focusing more on that. It might be more useful perhaps to demonstrate in simulations how even rare adaptive sweeps would affect the purging of deleterious mutations with small s, which is mentioned in the manuscript but not developed.

As mentioned above, we have updated the manuscript to include a section and SI figure detailing the effect of recombination on the purifying selection model.

We demonstrate the problem of rare adaptive sweeps on the purifying model in Figure 2—figure supplement 4. We have now expanded this paragraph to further emphasize this point.

The second issue I found is the data analysis. It would be useful to check some of the hypotheses raised in the text directly from the data. For example, the authors mention that premature stop codons may preferentially be subject to reversions, which could be verified. Even if these analyses do not directly support the model, the authors can still comment on them and I believe it would significantly add to the paper.

As mentioned above, we have added two new analyses regarding B. fragilis, frequency of stop codons, and codon usage which support the reversion model.

Finally, I found the presentation of the model and simulations to be confusing and feel that it needs to be made significantly clearer for readers to be able to understand. In general, the supplement had considerable detail on standard results (Sec. 2.1 and 2.2 for example), but was very vague on the actual model itself (Sec. 3.1 and 3.2). If anything, this emphasis should be reversed. I've added some more specific comments below which might help with this issue. It should also be noted that the inferred values of s from any simple model will be effective parameters that may not be directly related to measurable parameters from experiments or observations. To give a very simple example, if selection coefficients are varying quickly in time, the relevant s for the accumulation of mutations would be the time average s, which may be considerably smaller than the typical value.

As mentioned above, we now have fixed this oversight and added much more detail about the model.

Reviewer #3 (Recommendations for the authors):

The diversity of bacterial species in the human gut microbiome is widely known, but the extensive diversity within each species is far less appreciated. Strains found in individuals on opposite sides of the globe can differ by as little as handfuls of mutations, while strains found in an individual's gut, or in the same household, might have a common ancestor tens of thousands of years ago. What are the evolutionary, ecological, and transmission dynamics that established and maintain this diversity?

The time, T, since the common ancestor of two strains, can be directly inferred by comparing their core genomes and finding the fraction of synonymous (non–amino acid changing) sites at which they differ: dS. With the per–site per–generation mutation rate, μ, and the mean generation times roughly known, this directly yields T (albeit with substantial uncertainty of the generation time.) A traditional way to probe the extent to which selection plays a role is to study pairs of strains and compare the fraction of non–synonymous (amino acid or stop–codon changing) sites, dN, at which the strains differ with their dS. Small dN/dS, as found between distantly related strains, is attributed to purifying selection against deleterious mutations dominating over mutations that have driven adaptive evolution. Large dN/dS as found in laboratory evolution experiments, is caused by beneficial mutations that quickly arise in large bacterial populations, and, with substantial selective advantages, per generation, can rise to high abundance fast enough that very few synonymous mutations arise in the lineages that take over the population.

A number of studies (including by Lieberman's group) have analyzed large numbers of strains of various dominant human gut species and studied how dN/dS varies. Although between closely related strains the variations are large – often much larger than attributable to just statistical variations – a systematic trend from dN/dS around unity or larger for close relatives to dN/dS ~ 0.1 for more distant relatives has been found in enough species that it is natural to conjecture a general explanation.

The conventional explanation is that, for close relatives, the effects of selection over the time since they diverged has not yet purged weakly deleterious mutations that arose by chance – roughly mutations with sT<1 – while since the common ancestor of more distantly related strains, there is plenty of time for most of those that arose to have been purged.

Torrillo and Lieberman have carried out an in–depth – sophisticated and quantitative – analysis of models of some of the evolutionary processes that shape the dependence of dN/dS on dS – and hence on their divergence time, T. They first review the purifying selection model and show that – even ignoring its inability to explain dN/dS > 1 for many closely related pairs – the model has major problems explaining the crossover from dN/dS somewhat less than unity to much smaller values as dS goes through – on a logarithmic scale – the 10^–4 range. The first problem, already seen in the infinite–population–size deterministic model, is that a very large fraction of non–synonymous mutations would have to have deleterious s's in the 10^–5 per generation range to fit the data (and a small fraction effectively neutral). As the s's are naturally expected (at least in the absence of quantitative analysis to the contrary) to be spread out over a wide range on a logarithmic scale of s, this seems implausible. But the authors go further and analyze the effects of fluctuations that occur even in the very large populations: ~ >10^12 bacteria per species in one gut, and 10^10 human guts globally. They show that Muller's ratchet – the gradual accumulation of weakly deleterious mutations that are not purged by selection – leads to a mutational meltdown with the parameters needed to fit the purifying selection model. In particular, with N_e the "effective population size" that roughly parametrizes the magnitude of stochastic birth–death and transition fluctuations, and U the total mutation rate to such deleterious mutations this occurs for U/s > log(sN_e) which they show would obtain with the fitted parameters.

Torrillo and Lieberman promise an alternate model: that there are a modest number of "loci" at which conditionally beneficial mutations can occur that are beneficial in some individual guts (or other environmental conditions) at some times, but deleterious in other (or the same) gut at other times. With the ancestors of a pair of strains having passed through one too many individuals and transmissions, it is possible for a beneficial mutation to occur and rise in the population, only later to be reverted by the beneficial inverse mutation. With tens of loci at which this can occur, they show that this process could explain the drop of dN/dS from short times – in which very few such mutations have occurred – to very long times by which most have flipped back and forth so that a random pair of strains will have the same nucleotide at such sites with 50% probability. Their qualitative analysis of a minimally simple model of this process shows that the bacterial populations are plenty big enough for such specific mutations to occur many times in each individual's gut, and with modest beneficials, to takeover. With a few of these conditionally beneficial mutations or reversions occurring during an individuals lifetime, they get a reasonably quantitative agreement with the dN/dS vs dS data with very few parameters. A key assumption of their model is that genetically exact reversion mutations are far more likely to takeover a gut population – and spread – than compensatory mutations which have a similar phenotypic–reversion effect: a mutation that is reverted does not show up in dN, while one that is compensated by another shows up as a two–mutation difference after the environment has changed twice.

Strengths:

The quantitative arguments made against the conventional purifying selection model are highly compelling, especially the consideration of multiple aspects that are usually ignored, including – crucially – how Muller's ratchet arises and depends on the realistic and needed–to–fit parameters; the effects of bottlenecks in transmission and the possibility that purifying selection mainly occurs then; and complications of the model of a single deleterious s, to include a distribution of selective disadvantages. Generally, the author's approach of focusing on the simplest models with as few as possible parameters (some roughly known), and then adding in various effects one–by–one, is outstanding and, in being used to analyze environmental microbial data, exceptional.

The reversion model the authors propose and study is a simple general one and they again explore carefully various aspects of it – including dynamics within and between hosts – and the consequent qualitative and quantitative effects. Again, the quantitive analysis of almost all aspects is exemplary. Although it is hard to make a compelling guess of the number of loci that are subject to alternating selection on the needed time–scales (years to centuries) they make a reasonable argument for a lower bound in terms of the number of known invertible promoters (that can genetically switch gene expression on and off).

We are very grateful for the reviewer’s kind words and careful reading.

Weaknesses:

The primary weakness of this paper is one that the author's are completely open about: the assumption that, collectively, any of possibly–many compensatory mutations that could phenotypically revert an earlier mutation, are less likely to arise and takeover local populations than the exact specific reversion mutation. While detailed analysis of this is, reasonably enough, beyond the scope of the present paper, more discussion of this issue would add substantially to this work. Quantitatively, the problem is that even a modest number of compensatory mutations occurring as the environmental pressures change could lead to enough accumulation of non–synonymous mutations that they could cause dN/dS to stay large – easily >1 – to much larger dS than is observed. If, say, the appropriate locus is a gene, the number of combinations of mutations that are better in each environment would play a role in how large dN would saturate to in the steady state (1/2 of n_loci in the author's model). It is possible that clonal interference between compensatory and reversion mutations would result in the mutations with the largest s – eg, as mentioned, reversion of a stop codon – being much more likely to take over, and this could limit the typical number of differences between quite well–diverged strains. However, the reversion and subsequent re–reversion would have to both beat out other possible compensatory mutations – naively less likely. I recommend that a few sentences in the Discussion be added on this important issue along with comments on the more general puzzle – at least to this reader! – as to why there appear to be so little adaptive genetic changes in core genomes on time scales of human lifetimes and civilization.

We now directly consider compensatory mutations (page 9, SI text 3.2, and Figure 4-figure supplement 3). We show that as long as true reversions are more likely than compensatory mutations overall, (adaptive) nonsynonymous mutations will still tend to revert towards their initial state and not contribute to asymptotic dN/dS, and show that true reversions are expected in a large swath of parameter space. Thank you for motivating this improvement!

We note in the discussion that directional selection could be incorporated into the parameter α (assuming even more of the genome is deleterious) on page 11.

An important feature of gut bacterial evolution that is now being intensely studied is only mentioned in passing at the end of this paper: horizontal transfer and recombination of core genetic material. As this tends to bring in many more mutations overall than occur in regions of a pair of genomes with asexual ancestry, the effects cannot be neglected. To what extent can this give rise to a similar dependence of dN/dS on dS as seen in the data? Of course, such a picture begs the question as to what sets the low dN/dS of segments that are recombined –– often from genetic distances comparable to the diameter of the species.

We now discuss the effect of recombination on the purifying selection model on page 6 and in Figure 2—figure supplement 3. In short, we now show that reasonable levels of recombination cannot rescue the purifying selection model from Muller’s ratchet when s is so low and the influx of new deleterious mutations is so high. We thank the reviewers for prompting this improvement

The main substantive criticisms are summarized in "Weaknesses" section. Here are a few more minor points, and then presentational and pedagogical comments.

1) The analysis of the effects of a distribution of values of s is not convincing. A simple argument that for each s there is a crossover dS ~ mu/s, and approximating the corresponding portion of dN/dS vs dS as a step function at that crossover, gives a better approximation that using the harmonic mean: the rough distribution of s needed to fit, can then be eyeballed from the data – roughly by derivative d[dN/dS]/dS. (This point is roughly made – although hard to parse and confounded by uninformative details – in the Supplement).

We agree that an argument involving a step function is more straightforward. We have changed the analysis to now use this logic for explaining the effects of a distribution of s and have streamlined this section more generally. Thank you for the improvement.

2) For Muller's ratchet and deleterious loading: The time dependence computed from the deterministic analysis is rather misleading as it assumes – unreasonably – that the common ancestor had no deleterious mutations. The main point is the small fraction of the least–loaded subpopulation, W(t=infinity), being very small. The comparison of this with the appropriate N_e – whether N_e s exp(–U/s) is small or large – determining whether Muller's ratchet operates is the important point which needs emphasizing more: and this should be in the main text. [Note that for Muller's ratchet, using the harmonic mean s is roughly correct.] I would replace existing Eq(5) with an equation with this comparison in it (now just loosely in words around L 163) and refer to Ref(29) for analyses. Explain that if N is large enough or U small enough, reversions of the deleterious mutations will prevent Muller's ratchet.

Additionally, make clear that the model assumes additive effects of the deleterious mutations, and note that the average burden is small – and thus additive probably reasonable – even when large enough that Muller's ratchet is effective.

We replaced equation 5 with S17 which puts into context the size needed for Muller’s Ratchet to be prevented and suggested ref(29) (now ref (40)) for further analysis. We also noted the assumption of additive mutation effects. Thank you for the improvement.

3) Some more discussion of "effective population size" is needed when introduced, in addition to that in the Discussion. This is especially true here, since – as noted in Discussion – it is often taken from the dS of a species via N_e = /mu: simply the overall T_MRCA in units of generations. The way the present paper uses N_e, is how it appears in modeling and thus controls several actual biological quantities: how much mutations can change abundance by drift, and how often beneficial mutations occur. This is thus a "real" effective population size. It is important to note, however, that with clonal interference (both in individuals and across human populations) different "N_e"'s are involved for different processes. A detailed discussion is not needed, but a couple of sentences (and a reference) in the main text is – around L165 – and a note in caption of Figure 2.

We now more thoroughly discuss effective population size in the main text when introduced (page 4-5) and refer to it in the caption of Figure 2. Thank you for the improvement.

4) What is being assumed in various places (eg Figure 2) for the generation time is unclear.

We assume about one generation a day. We added this in the text and apologize for the confusion.

5) The heuristic explanation for the time dependence of the reversion model is missing. T_adapt is rather confusing. A basic quantity – call it eg tau_change – is how often the conditions for a particular transient mutation change from beneficial to deleterious. This is the time that corresponds to the dS at which the crossover in the dN/dS data occurs – and indeed what is in the equations. T_adapt = tau_change/n_loci is the typical time between when any of these mutations switch. This determines dN. The distinction needs explaining.

We agree that using tau_change (which we call Tau_flip) over T_adapt provides a more intuitive understanding of the time dependence and have changed text and equations accordingly. We thank the reviewer for prompting this improvement.

6) The meaning of "loci" in counting the number of possible reverting mutations is unclear. Presumably what is meant is that at each locus there may be a number of roughly equivalent mutations – with each changing the phenotype so the others would no longer be beneficial – but to revert the phenotype needs to revert the specific one that occurred. This needs to be made clearer. Also, note that this would change the numbers somewhat as the beneficial mutation rate would be higher, but it has shown that analysis is not sensitive to that factor because of the large populations. Generally, a clearer statement that the beneficial s is the important quantity setting the time scales – log(Ns)/s – rather than the time for mutation to arise, is needed in the main text.

Yes, this is the correct interpretation. We have clarified this in the text and figure legend. In the simulations, the beneficial forward mutation rate is higher than the reversion rate to take this into account, and this has been made clear in the new methods section detailing how the reversion mutations function.

We have also adjusted the text and Figure 3b to emphasize the importance of s vs. mutation rate in setting time to reversion. Thanks for prompting the improvement.

7) Effects of recombination: It would be good to add some more discussion on whether the effects of recombination alone (with an assumption about the effects of purifying selection on long time scales) could fit the data – as pointed out in "Weaknesses" section.

We now discuss the effect of recombination on the purifying selection model on page 8 and in Figure 2—figure supplement 3. In short, we now show that reasonable levels of recombination cannot rescue the purifying selection model from Muller’s ratchet when s is so low and the influx of new deleterious mutations is so high.

Is reversion by recombination likely to have a substantial role in dN/dS? I do not find the suggestion that multiple mutations could be simultaneously reverted by recombination at all believable. The further away the segment is, the more implausible is the scenario that compensatory mutations – let all other beneficial mutations with epistasis between them – have not occurred.

We have now clarified this part of the discussion. Recombination would be helpful in the reversion model if multiple mutations had occurred in the same gene over longer periods of time, and the gene being completely replaced with the ancestral version would be beneficial upon environmental change. This is particularly likely if epistasis enables only 2+ step mutants to be beneficial upon environmental change. We hope the revised version is more convincing.

8) Generally, for each of the models and extensions used, make clear the number and set of parameters, and the assumptions. (Eg Figure 3)

We apologize for the confusion and have added more detail to the figure captions.

Presentation:

Although the overall scenario, points made, and many of the explanations are quite clear, aspects of the presentation, figures, and captions are unclear or sloppy and detract from the paper's readability and potential impact. More specific and clear statements of the assumptions and model features are needed. And which of the mathematics to include or not in the main text, the supplement, or not at all, could be improved.

Notation:

– Mutation rates: mu_S etc are more often used per site: Here they are used for genome–wide instead of the more conventional – and better – U_S etc

Done.

– Use some notation for s of beneficial mutants – eg s_adapt.

Done.

– T_adapt see above

Done.

– N is used both for population size and numbers. With the use of n_loci, might be better to use n(t)'s – perhaps best with an overbar – for the average number of mutations that accumulate with time.

We added in an overbar (thank you for the suggestion!), but we continue to use capitalize N for nonsynonymous mutations for the sake of matching to dN.

In many figures, notations such as "1.8e+01" or, in one, "quadrillion" are used: Use standard notation and, eg 10^1 mark instead of 1.8 times this.

Done.

Figure 1:

It is not clear what is meant by "adaptive genes" in the Zhao et al. data shown.

Clarification has been added. We apologize for the confusion.

In the caption, say that crossover occurs at roughly dS ~ mu/s : this makes the role of s clear. In general, this kind of heuristic note is useful – independently of whether or not a reader is mathematically inclined!

We have made the changes. Thank you for the suggestion.

L98: Make clear N_transient etc are averages. Spell out that for dS << 1, dS ~ 2 U_S T_MRCA, and this regime obtains for all the data.

We have made the changes. Thank you for the suggestion.

L127: Spell out mu_S (or better U_S) in terms of the number of sites and generation time.

For fitting the data, since dS=#S / number of S sites, only the mutation rate per base pair matters. We now further discuss the molecular clock on page 3 to provide better clarity. U_N does depend on the number of sites and we have now made this more explicit on page 4.

L150: the minus sign in the exponent is not visible (similarly in some other equations)

We are unsure how to remedy this as they do appear on our version of the document.

L187–194: It is unclear what is meant by "superspreaders" and what the 30,000X factor means. Is it that strains from one individual would have to spread to 30,000 others before reversions etc? The argument against superspreaders could be expanded beyond "seems unlikely".

We apologize for the confusion and have added more clarification. We now only discuss superspreaders more directly in the Results section relating to the purifying selection model for clarity.

L204–213: Equation (6) and surrounding text on general process, is not needed and confusing. Instead, the left side and last equality in Eq S23 should be pulled into the main text. Then can go directly to Eq(7) with an explanation that the n_loci/2 factor is simply the average number of differences between distantly related bacteria.

We agree and have made the changes accordingly and left some of the notes on the more general process to the SI. Thank you for the improvement.

L259: Is the fitted T_adapt in generations? Also, give in years. And don't pretend to precision as "840", and later "110 years": give ranges.

We have clarified and given ranges. Thank you for the suggestion.

Figure 4: (a): note in the caption that actual dN/dS > 1 and why.

Done.

Specify n_loci used.

Done.

(b) "discounting beneficial mutations" is unclear: are or aren't the reversions included in this statement?

Reworded and clarified.

L287: Explain what PAML is/assumes.

Done.

L321: Note that in the simulations with genetic classes, the dN, etc between two individuals cannot be directly kept track of. Correct?

Correct, this is now noted.

L348: Meaning of "negative feedback" is unclear.

Reworded.

L353: As per main comments: more is needed on how tiny the rate of compensatory or other beneficial mutations would have to be to not raise α hugely.

See new section on compensatory mutations.

L365: Explain "pseudogenized".

Changed word to nonfunctional.

Methods and Theory:

Generally, much more than needed deriving standard results.

Wright–Fisher simulations: The notation here is very bad, with nearby letters being used for very different things – eg why not t instead of i which is in any case elsewhere used to mean something different? – and j_k used as it is but with j being an index. And generally, it is overly long and unclear.

We have reworked the indices to make it more clear. Thank you for the suggestion.

L 673: what is RMSD?

Root mean square deviation. This has been added in the text.

L678: Explain PAML

Clarification has been added.

dN/dS Theory: Much too much detail as most very standard Eg: certainly don't need all the derivation leading up to L742 equation.

Yes, the theory is purposely very highly detailed. The intent is to make it available to a wide audience of readers including those who may not know mathematics beyond basic calculus, as the majority of the theory doesn’t use any very advanced techniques. We hope that this helps interest more readers in doing theory. We now clarify this intent.

L777: Make clear that α used in the main text is that with 0 subscript here.

Done.

L791–815 is confusing, not needed, and probably not valid. The point made in L817 is all that is needed – see comment (1) above.

We do agree it is probably not needed and now make the point in a quicker more straightforward way that depends upon the step function approximation as suggested.

L860: Say that sNW factor is mean fitness.

Done.

L895 is a classic result and not all derivation is needed.

Yes, this was included in an attempt to make the work self contained for readers without a background in population genetics. We hope that this is useful to readers. We now add the word classic result to make sure the historical context of this is understood.

L912: "has 2s probability of extinction" is unclear what it means: I presume something about the rate of ratchet. L915 equations, as noted in (2) should be in the main text.

Yes, clarified. The equation has been moved.

Extinction and Fixation Probability:

This is all standard and not needed beyond quoting results.

Yes, this was included in an attempt to make the work self contained for readers without a background in population genetics.

L980: "Mutation is accessible if… " is very misleading. If L982 is satisfied, then there will be clonal interference between multiple mutations, and much of the analysis changes. But whether satisfied or not, the mutations can arise and fix: what changes is which will dominate the time to fixation as in L989

Changed language used, thank you.

L1005: As noted, should pull part of this into the main text.

Done.

Implied Amount of Superspreading: extremely unclear. (and S26 is not even an equation)

We decided to remove this part of the SI as it was not adding much and clarify more in the main text.

SI Figures: There are probably more than needed and captions are often insufficient. Would be better to have a few paragraphs of text explaining them collectively, and the figures interspersed in the text.

Most of the SI figures are more for checks of robustness than major points about theory and are more meant to be looked at in context of the main text. Thus, we kept them as are though we expanded upon the captions to add more detail.

Figure S4: What is "theory" curve?

Changed label to infinite population in top panel. Removed theory line in the second panel upon review as it is confusing since the asymptote occurs because of mutation accumulation which is not modeled by theory.

Figure S5: Very unclear what was done or what modeled. Why is every 100,000 generations underestimated of bottleneck? What does "beneficial mutations are released" mean.

We apologize for the confusion. This is just running the simulation for the reversion model, which assumes a population transmitting through bottlenecks and adaptations under the assumptions of the purifying selection model (ie. so remove the reversions part, have less frequent forward mutations, and make deleterious mutations have a smaller deleterious effect). We have added more to the figure captions to explain.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Supplementary file 1. Enrichment of stop adjacent codon usage in specific gene categories of Bacteroides fragilis NCTC_9343, related to Figure 4c.

    Results display TTA/TCA enrichment for statistically significant categories of genes in B. fragilis NCTC_9343. Loci from these categories and mutated in Zhao et al., 2019 (Table S7) are noted in column J.

    elife-93146-supp1.xlsx (11.9KB, xlsx)
    Supplementary file 2. Genes assigned to COG categories in B. fragilis NCTC_9343 and their enrichment of stop-adjacent codons, related to Figure 4c.

    List of genes assigned COG categories (via eggNOG) to be used to evaluate for closeness to stop codons. Cellular location is also given (if predicted) by PSORTb. Annotations are from Bakta.

    elife-93146-supp2.xlsx (213.9KB, xlsx)
    MDAR checklist

    Data Availability Statement

    Code and results of simulations are available at Github repository https://github.com/PaulTorrillo/Microbiome_Reversions (copy archived at Torrillo, 2023).


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES