Abstract
Deep-sea hydrothermal vents resemble the early Earth, and thus the dominant Thermococcaceae inhabitants, which occupy an evolutionarily basal position of the archaeal tree and take an obligate anaerobic hyperthermophilic free-living lifestyle, are likely excellent models to study the evolution of early life. Here, we determined that unbiased mutation rate of a representative species, Thermococcus eurythermalis, exceeded that of all known free-living prokaryotes by 1-2 orders of magnitude, and thus rejected the long-standing hypothesis that low mutation rates were selectively favored in hyperthermophiles. We further sequenced multiple and diverse isolates of this species and calculated that T. eurythermalis has a lower effective population size than other free-living prokaryotes by 1-2 orders of magnitude. These data collectively indicate that the high mutation rate of this species is not selectively favored but instead driven by random genetic drift. The availability of these unusual data also helps explore mechanisms underlying microbial genome size evolution. We showed that genome size is negatively correlated with mutation rate and positively correlated with effective population size across 30 bacterial and archaeal lineages, suggesting that increased mutation rate and random genetic drift are likely two important mechanisms driving microbial genome reduction. Future determinations of the unbiased mutation rate of more representative lineages with highly reduced genomes such as Prochlorococcus and Pelagibacterales that dominate marine microbial communities are essential to test these hypotheses.
Subject terms: Archaea, Population genetics
One theory for the origin of life is that the last universal common ancestor was an anaerobic hyperthermophilic organism inhabiting the deep-sea hydrothermal vents, as these environments display a few characteristics paralleling the early Earth [1]. While hydrothermal vents vary with chemical parameters, they all share a high temperature zone near the black chimney with anaerobic fluid from it. In the past decades, great efforts were made to understand the metabolic strategies deep-sea hyperthermophiles use to conserve energy and cope with physicochemical stresses, and to appreciate the molecular mechanisms leading to the stabilization of nucleic acids and proteins at exceedingly high temperatures [2, 3]. However, little is known whether they have a low or high intrinsic (i.e., not selected by environmental pressure) rate to change their genetic background information and whether this intrinsic potential itself is a result of selection shaped by these unique habitats.
A previous population genomic analysis showed that protein sequences are under greater functional constraints in thermophiles than in mesophiles, suggesting that mutations are functionally more deleterious in thermophiles than in mesophiles [4]. This explanation is also supported by experimental assays showing nearly neutral mutations in temperate conditions become strongly deleterious at high temperature [5]. Furthermore, fluctuation tests on a hyperthermophilic archeaon Sulfolobus acidocaldarius [6] and a hyperthermophilic bacterium Thermus thermophilus [7] consistently showed that hyperthermophiles have much lower mutation rate compared to mesophiles. This appears to support the hypothesis that selection favors high replication fidelity at high temperature [5].
Nevertheless, mutation rates measured using fluctuation experiments based on reporter loci are known to be biased, since the mutation rate of the organism is extrapolated from a few specific nonsynonymous mutations enabling survival in an appropriate selective medium, which renders the results susceptible to uncertainties associated with the representativeness of these loci and to inaccuracies of the assumptions made in extrapolation methods [8–10]. These limitations are avoided by the mutation accumulation (MA) experiment followed by whole-genome sequencing (WGS) of derived lines. In the MA part, multiple independent MA lines initiated from a single progenitor cell each regularly pass through a single-cell bottleneck, usually by transferring on solid medium. As the effective population size (Ne) becomes one, selection is unable to eliminate all but the lethal mutations, rendering the MA/WGS an approximately unbiased method to measure the spontaneous mutation rate [11].
Members of the free-living anaerobic hyperthermophilic archaeal family Thermococcaceae are among the dominant microbial lineages in the black-smoker chimney at Guaymas Basin [12] and other deep-sea hydrothermal vents [13, 14]. This family only contains three genera: Thermococcus, Pyrococcus and Palaeococcus. In this study, the MA/WGS procedure was applied to determine the unbiased spontaneous mutation rate of a representative member Thermococcus eurythermalis A501, a conditional pizeophilic archaeon which can grow equally well from 0.1 MPa to 30 MPa at 85 °C [15, 16]. The MA lines were propagated at this optimal temperature on plates with gelrite which tolerates high temperature, and the experiment was performed under normal air pressure and in strictly anaerobic condition (Fig. 1A–D). To the best of our knowledge, this is the first report of unbiased mutation rate of a hyperthermophile and an obligate anaerobe.
Our MA experiment allowed accumulation of mutations over 314 cell divisions (after correcting the death rate (Table S1) [17]) in 100 independent lines initiated from a single founder colony and passed through a single cell bottleneck every day. By sequencing genomes of 96 survived lines at the end of the MA experiment, we identified 544 base-substitution mutations over these lines (Table S2), which translates to an average mutation rate (µ) of 85.01 × 10−10 per cell division per nucleotide site (see Supplementary information). The ratio of accumulated nonsynonymous to synonymous mutations (371 vs 107) did not differ from the ratio of nonsynonymous to synonymous sites (1,485,280 vs 403,070) in the A501 genome (χ2 test; p > 0.05). Likewise, there was no difference of the accumulated mutations between intergenic (65) and protein-coding sites (478) (χ2 test; p > 0.05). These are evidence for minimal selective elimination of deleterious mutations during the MA process. In general, the mutations were randomly distributed along the chromosome and the plasmid, though 86 base-substitution mutations fell into 14 genes which showed significant enrichment of mutations (bootstrap test; p < 0.05 for each gene) and 52 out of the 86 base-substitution mutations were found in five genes (TEU_RS04685 and TEU_RS08625-08640 gene cluster) (Fig. 1E, Table S3). A majority of mutations in these five genes may have inactivated these genes (38 out of 71 in the former gene and 33 out of 43 in the latter gene cluster) either by nonsense mutation or insertion-deletion (INDEL) mutation. The phenomenon of mutation clustering is not unique to this organism; it was reported in another MA study with the yeast Schizosaccharomyces pombe, and these genomic regions may represent either mutational hotspots or that mutations confer selective advantages under experimental conditions [18]. The TEU_RS04685 encodes the beta subunit of the sodium ion-translocating decarboxylase which is an auxiliary pathway for ATP synthesis by generating sodium motive force via decarboxylation [19], and the TEU_RS08625-08640 encodes a putative nucleoside ABC transporter. These genes appear to be important for energy conservation in the highly fluctuating deep-sea hydrothermal fluids. Under the culture conditions in which peptides and amino acids were stably and sufficiently supplied (see the TRM medium recipe in Supplementary information), however, these genes may be dispensable because peptides and amino acids are the preferred carbon and energy sources for T. eurythermalis [15]. On the other hand, some of these genes (e.g., TEU_RS08625) were shown to be upregulated under alkaline stress [16], and thus may be similarly induced under the culture condition in which pH is elevated compared to the vents. Besides, the laboratory condition differed from the vents in a number of other physicochemical features including hydrostatic pressure (0.1 MPa during the MA process versus 20 MPa in situ), temperature and salinity, which likely imposed additional selective pressures on the mutation accumulation processes. Taken together, deleting these genes were likely translated to a net fitness gain and were thus driven by selection. Removing these mutations led to a spontaneous mutation rate of 71.57 × 10−10 per cell division per site for T. eurythermalis A501. After removing the mutations in these 14 genes, both the accumulated mutations at nonsynonymous sites (288) relative to those (104) at synonymous sites (χ2 test; p = 0.014) and the accumulated mutations at intergenic regions (65) relative to protein-coding regions (392) (χ2 test; p = 0.013) showed marginally significant differences.
To date, over 20 phylogenetically diverse free-living bacterial species and two archaeal species isolated from various environments have been assayed with MA/WGS, and their mutation rates vary from 0.79 × 10−10 to 97.80 × 10−10 per cell division per site [20]. The only prokaryote that displays a mutation rate (97.80 × 10−10 per cell division per site) comparable to A501 is Mesoplasma florum L1 [21], a host-dependent wall-less bacterium with highly reduced genome (~700 genes). Our PCR validation of randomly chosen 20 base-substitution mutations from two MA lines displaying highest mutation rates and of all nine INDEL mutations involving >10 bp changes across all lines (Table S2) indicates that the calculated high mutation rate did not result from false bioinformatics predictions.
The extremely high mutation rate of T. eurythermalis is unexpected. One potential explanation in line with the “mutator theory” [22–24] is that high mutation rate may allow the organisms to gain beneficial mutations more rapidly and thus is selectively favored in deep-sea hydrothermal vents where physicochemical parameters are highly fluctuating. Alternatively, high mutation rate is the result of random genetic drift according to the “drift-barrier model” [21]. In this model, increased mutation rates are associated with increased load of deleterious mutations, so natural selection favors lower mutation rates. On the other hand, increased improvements of replication fidelity come at an increased cost of investments in DNA repair activities. Therefore, natural selection pushes the replication fidelity to a level that is set by genetic drift, and further improvements are expected to reduce the fitness advantages [11, 21]. These two explanations for the high mutation rate of T. eurythermalis are mutually exclusive, and resolving them requires the calculation of the power of genetic drift, which is inversely proportional to Ne.
A common way to calculate Ne for a prokaryotic population is derived from the equation πS = 2 × Ne × µ, where πS represents the nucleotide diversity at synonymous (silent) sites among randomly sampled members of a panmictic population [25]. We therefore sequenced genomes of another eight T. eurythermalis isolates available in our culture collections. Like T. eurythermalis A501, these additional isolates were collected from the same cruise but varying at the water depth from 1987 m to 2009 m at Guaymas Basin. They differ by only up to 0.135% in the 16S rRNA gene sequence and share a minimum whole-genome average nucleotide identity (ANI) of 95.39% (Table S4), and thus fall within an operationally defined prokaryotic species typically delineated at 95% ANI [26]. Population structure analysis with PopCOGenT [27] showed that these isolates formed a panmictic population and that two of them were repetitive as a result of clonal descent (see Supplementary information). Using the median value of πS = 0.083 across 1628 single-copy orthologous genes shared by the seven non-repetitive genomes, we calculated the Ne of T. eurythermalis to be 5.83 × 106.
Next, we collected the unbiased mutation rate of other prokaryotic species determined with the MA/WGS strategy from the literature [11, 28–30]. While the Ne data were also provided from those studies, the isolates used to calculate the Ne were identified based on their membership of either an operationally defined species (e.g., ANI at 95% cutoff) or a phenotypically characterized species (e.g., many pathogens), which often create a bias in calculating Ne [25]. We therefore again employed PopCOGenT to delineate panmictic populations from those datasets and re-calculated Ne accordingly. There was a significant negative linear relationship between µ and Ne on a logarithmic scale (dashed gray line in Fig. 2A [r2 = 0.83, slope = −0.85, s.e.m. = 0.09, p < 0.001]) according to a generalized linear model (GLM) regression. This relationship cannot be explained by shared ancestry, as confirmed by phylogenetic generalized least square (PGLS) regression analysis (solid blue line in Fig. 2A [r2 = 0.81, slope = −0.81, s.e.m. = 0.09, p < 0.001]). The nice fit of T. eurythermalis to the regression line validated the drift-barrier hypothesis. This is evidence that the high mutation rate of T. eurythermalis is driven by genetic drift rather than by natural selection.
As stated in the drift-barrier theory, high mutation rate is associated with a high load of deleterious mutations. In the absence of back mutations, recombination becomes an essential mechanism in eliminating deleterious mutations [31]. In support of this argument, the ClonalFrameML analysis [32] shows that members of the T. eurythermalis population recombine frequently, with a high ratio of the frequency of recombination to mutation (ρ/θ = 0.59) and a high ratio of the effect of recombination to mutation (r/m = 5.76). In fact, efficient DNA incorporation to Thermococcaceae genomes from external sources has been well documented experimentally [33, 34]. A second potentially important mechanism facilitating T. eurythermalis adaptation at high temperature is strong purifying selection at the protein sequence level, as protein sequences in thermophiles are generally subjected to stronger functional constraints compared to those in mesophiles [4, 35].
Our result of the exceptionally high mutation rate of a free-living archaeon is a significant addition to the available collection of the MA/WGS data (Table S5), in which prokaryotic organisms with very high mutation rate have only been known for a host-dependent bacterium (Mesoplasma florum L1) with unusual biology (e.g., cell wall lacking). The availability of these two deeply branching (one archaeal versus the other bacterial) organisms adopting opposite lifestyles (one free-living versus the other host-restricted; one hyperthermophilic versus the other mesophilic; one obligate anaerobic versus the other facultative anaerobe), along with other phylogenetically and ecologically diverse prokaryotic organisms displaying low and intermediate mutation rates, provides an opportunity to help illustrate mechanisms potentially driving genome size evolution across prokaryotes. We found a negative linear relationship (dashed gray line in Fig. 2B [r2 = 0.49, slope = −1.66, s.e.m. = 0.32, p < 0.001]) between genome size and base-substitution mutation rate, which is consistent with the hypothesis that increased mutation rate drives microbial genome reduction. We also showed a positive linear relationship (dashed gray line in Fig. 2C [r2 = 0.47, slope = 0.24, s.e.m. = 0.06, p < 0.001]) between genome size and Ne, which suggests that random genetic drift drives genome reduction across prokaryotes. These correlations remain robust when the data were analyzed as phylogenetically independent contrasts (blue solid lines in Fig. 2B [r2 = 0.47, slope = −1.75, s.e.m. = 0.34, p < 0.001] and in Fig. 2C [r2 = 0.45, slope = 0.25, s.e.m. = 0.06, p < 0.001]). Our results are consistent with recent studies which employed mathematical modeling and/or comparative sequence analyses to show random genetic drift [36] and increased mutation rate [37] driving genome reduction across diverse bacterial lineages including both free-living and host-dependent bacteria. One benefit of the present study is that it directly measures µ and Ne, as compared to those recent advances which relied on proxies for these metrics (e.g., using the ratio of nonsynonymous substitution rate to synonymous substitution rate to represent Ne) to infer mechanisms of genome reduction.
Despite this advantage, there are important caveats to our conclusions related to the mechanisms of genome reduction. The correlation analyses performed here are inspired by Lynch and colleagues’ work, who had great success explaining eukaryotic genome expansion with genetic drift [11, 38]. However, there are a few key differences of genomic features between prokaryotes and eukaryotes, which makes it more difficult to explain the correlation observed in prokaryotes. Importantly, genome sizes of eukaryotes can vary over several orders of magnitude, whereas those of free-living prokaryotes differ by only an order of magnitude [11], so there is much less variability to explain in prokaryotes. Moreover, eukaryotic genomes experience dramatic expansions of transposable elements which are often considered as genomic parasites, whereas prokaryotic genomes including those large ones are usually depleted with transposable elements and their genome size variations are largely driven by gene content [39]. Aside from these conceptual difficulties, the plots (Fig. 2B, C) are poorly populated with typical free-living species carrying small genomes such as the Prochlorococcus (mostly 1.6–8 Mb) and Pelagibacterales (1.3–1.5 Mb), which dominate the photosynthetic and heterotrophic microbial communities, respectively, in the ocean [40]. It has been generally postulated that bacterial species in these lineages have very large Ne [39–41], though there has been little direct evidence for it [42, 43]. If confirmed through the measurement of the unbiased mutation rate (µ) followed by the calculation of Ne based on µ, it might compromise the linear relationship between genome size and Ne observed here (Fig. 2C). It is also not necessarily appropriate to translate correlations to causal relationships. For example, the correlation between increased mutation rates and decreased genome sizes (Fig. 2B) does not necessarily mean that increased mutation rate drives genome reduction. This is because high mutation rates are observed in species with small Ne. Given that deletion bias is commonly found in prokaryotes [44, 45], genome reduction can be easily explained by increased fixation of deletional mutations in species with smaller Ne. High mutation rates in these species are simply the result of random genetic drift as explained by the drift-barrier theory, and they may have a limited role in driving genome reduction.
Whereas our analysis based on the available data did not support natural selection as a universal mechanism driving genome reduction across prokaryotes (Fig. 2B, C), it does not mean that selection has no role in genome reduction of a particular taxon. In the case of thermophiles, proponents for selection acting to reduce genomes explained that genome size, due to its positive correlation with cell volume, may be an indirect target of selection which strongly favors smaller cell volume [35]. The underlying principle is that high temperature requires cells to increase the lipid content and change the lipid composition of the cell membranes, which consumes a large part of the cellular energy, and thus lower cell volume is selectively favored at high temperature [35]. Our calculations of a relatively small Ne in T. eurythermalis does not necessarily contradict with this selective argument, given that the fitness gained by decreasing cell volume and thus reducing genome size is large enough to overcome the power of random genetic drift. On the other hand, our data strongly indicate that neutral forces dictate the genome evolution of T. eurythermalis, and they are not negligible with regard to its genome reduction process. The significantly more deletion over insertion events (t test; 95 versus 37 events with p < 0.001 and 48 versus 20 events with p < 0.05 before and after removing the 14 genes enriched in mutations, respectively) and the significantly more nucleotides involved in deletions over insertions (t test; 433 versus 138 bp with p < 0.05 and 386 versus 121 bp with p < 0.001 before and after removing the 14 genes enriched in mutations, respectively) suggest that the deletion bias, combined with increased chance fixation of deletion mutants due to low Ne, is a potentially important neutral mechanism giving rise to the small genomes of T. eurythermalis (2.12 Mbp).
The globally distributed deep-sea hydrothermal vents are microbe-driven ecosystems, with no known macroorganisms surviving at the vent fluids. Sample collections, microbial isolations, and laboratory propagations of mutation lines at hot and anoxic conditions are challenging. In the present study, we determined that T. eurythermalis, and perhaps Thermococcaceae in general, has a highly increased mutation rate and a highly decreased effective population size compared to all other known free-living prokaryotic lineages. While it remains to be tested whether this is a common feature among the vents’ microbes, the present study nevertheless opens a new avenue for investigating the hyperthemophile ecology and evolution in the deep sea.
Supplementary information
Acknowledgements
This research is supported by the National Key R&D Program of China (2018YFC0309800), National Nature of Science China (NSFC 41530967), China Ocean Mineral Resources R & D Association DY125-22-04, the Hong Kong Branch of Southern Marine Science and Engineering Guangdong Laboratory (Guangzhou) (SMSEGL20SC02), and the Hong Kong Research Grants Council Area of Excellence Scheme (AoE/M-403/16).
Author contributions
HL conceptualized the work and strategy, directed the bioinformatics analyses, interpreted the data, and wrote the main manuscript. XX set up the experimental platform for deep-sea hyperthermophile studies, directed the experimental analyses and related writing, co-interpreted the data, provided comments to the main manuscript, and acquired the strains. JG performed all the experiments with contributions from XM, drafted the related methods in supplementary information and Fig. 1. XW performed all the bioinformatics analyses, co-interpreted the related results, drafted the related methods in supplementary information, Fig. 2 and all supplemental tables. YS contributed the bioinformatics tools for mutation detection and mutation rate calculation.
Data availability
All the datasets generated, analyzed, and presented in the current study are available in the Supplementary Information. Genomic sequences of the eight Thermococcus eurythermalis strains are available at the JGI IMG under the GOLD study id Gs0142375. Raw reads of the eight strains are available at the NCBI SRA under the accession number PRJNA679699.
Code availability
The custom scripts used in this study have been deposited in the online repository (https://github.com/luolab-cuhk/Thermococcus-mut-genome-size).
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Jiahao Gu, Xiaojun Wang
Contributor Information
Xiang Xiao, Email: zjxiao2018@sjtu.edu.cn.
Haiwei Luo, Email: hluo2006@gmail.com.
Supplementary information
The online version of this article (10.1038/s41396-020-00888-5) contains supplementary material, which is available to authorized users.
References
- 1.Martin W, Baross J, Kelley D, Russell MJ. Hydrothermal vents and the origin of life. Nat Rev Microbiol. 2008;6:805–14. doi: 10.1038/nrmicro1991. [DOI] [PubMed] [Google Scholar]
- 2.Dick GJ. The microbiomes of deep-sea hydrothermal vents: distributed globally, shaped locally. Nat Rev Microbiol. 2019;17:271–83. doi: 10.1038/s41579-019-0160-2. [DOI] [PubMed] [Google Scholar]
- 3.Zierenberg RA, Adams MWW, Arp AJ. Life in extreme environments: hydrothermal vents. Proc Natl Acad Sci USA. 2000;97:12961–2. doi: 10.1073/pnas.210395997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Friedman R, Drake JW, Hughes AL. Genome-wide patterns of nucleotide substitution reveal stringent functional constraints on the protein sequences of thermophiles. Genetics. 2004;167:1507–12. doi: 10.1534/genetics.104.026344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Drake JW. Avoiding dangerous missense: thermophiles display especially low mutation rates. PLoS Genet. 2009;5:e1000520. doi: 10.1371/journal.pgen.1000520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Grogan DW, Carver GT, Drake JW. Genetic fidelity under harsh conditions: analysis of spontaneous mutation in the thermoacidophilic archaeon Sulfolobus acidocaldarius. Proc Natl Acad Sci USA. 2001;98:7928–33. doi: 10.1073/pnas.141113098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mackwan RR, Carver GT, Kissling GE, Drake JW, Grogan DW. The rate and character of spontaneous mutation in Thermus thermophilus. Genetics. 2008;180:17–25. doi: 10.1534/genetics.108.089086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Long H, Miller SF, Strauss C, Zhao C, Cheng L, Ye Z. Antibiotic treatment enhances the genome-wide mutation rate of target cells. Proc Natl Acad Sci USA. 2016;113:E2498–505. doi: 10.1073/pnas.1601208113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Williams AB. Spontaneous mutation rates come into focus in Escherichia coli. DNA Repair. 2014;24:73–79. doi: 10.1016/j.dnarep.2014.09.009. [DOI] [PubMed] [Google Scholar]
- 10.Lee H, Popodi E, Tang H, Foster PL. Rate and molecular spectrum of spontaneous mutations in the bacterium Escherichia coli as determined by whole-genome sequencing. Proc Natl Acad Sci USA. 2012;109:E2774–E2783. doi: 10.1073/pnas.1210309109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lynch M, Ackerman MS, Gout J-F, Long H, Sung W, Thomas WK, et al. Genetic drift, selection and the evolution of the mutation rate. Nat Rev Genet. 2016;17:704–14. doi: 10.1038/nrg.2016.104. [DOI] [PubMed] [Google Scholar]
- 12.He Y, Xiao X, Wang F. Metagenome reveals potential microbial degradation of hydrocarbon coupled with sulfate reduction in an oil-immersed chimney from Guaymas Basin. Front Microbiol. 2013;4:148. doi: 10.3389/fmicb.2013.00148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Roussel EG, Bonavita M-AC, Querellou J, Cragg BA, Webster G, Prieur D, et al. Extending the sub-sea-floor biosphere. Science. 2008;320:1046. doi: 10.1126/science.1154545. [DOI] [PubMed] [Google Scholar]
- 14.Takai K, Nakamura K. Archaeal diversity and community development in deep-sea hydrothermal vents. Curr Opin Microbiol. 2011;14:282–91. doi: 10.1016/j.mib.2011.04.013. [DOI] [PubMed] [Google Scholar]
- 15.Zhao W, Zeng X, Xiao X. Thermococcus eurythermalis sp. nov., a conditional piezophilic, hyperthermophilic archaeon with a wide temperature range for growth, isolated from an oil-immersed chimney in the Guaymas Basin. Int J Syst Evol Microbiol. 2015;65:30–35. doi: 10.1099/ijs.0.067942-0. [DOI] [PubMed] [Google Scholar]
- 16.Zhao W, Liu X, Jian H, Zhang Y, Xiao X. Cross-stress adaptation in a piezophilic and hyperthermophilic archaeon from deep sea hydrothermal vent. Front Microbiol. 2020;11:2081. doi: 10.3389/fmicb.2020.02081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Frenoy A, Bonhoeffer S. Death and population dynamics affect mutation rate estimates and evolvability under stress in bacteria. PLoS Biol. 2018;16:e2005056. doi: 10.1371/journal.pbio.2005056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Farlow A, Long H, Arnoux S, Sung W, Doak TG, Nordborg M, et al. The spontaneous mutation rate in the fission yeast Schizosaccharomyces pombe. Genetics. 2015;201:737–44. doi: 10.1534/genetics.115.177329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Buckel W. Sodium ion-translocating decarboxylases. Biochim Biophys Acta Bioenerg. 2001;1505:15–27. doi: 10.1016/s0005-2728(00)00273-5. [DOI] [PubMed] [Google Scholar]
- 20.Long H, Sung W, Kucukyildirim S, Williams E, Miller SF, Guo W, et al. Evolutionary determinants of genome-wide nucleotide composition. Nat Ecol Evol. 2018;2:237–40. doi: 10.1038/s41559-017-0425-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Sung W, Ackerman MS, Miller SF, Doak TG, Lynch M. Drift-barrier hypothesis and mutation-rate evolution. Proc Natl Acad Sci USA. 2012;109:18488–92. doi: 10.1073/pnas.1216223109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Tenaillon O, Toupance B, Nagard HL, Taddei F. Godelle B. Mutators, population size, adaptive landscape and the adaptation of asexual populations of bacteria. Genetics. 1999;152:485–93. doi: 10.1093/genetics/152.2.485. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Giraud A, Radman M, Matic I, Taddei F. The rise and fall of mutator bacteria. Curr Opin Microbiol. 2001;4:582–5. doi: 10.1016/s1369-5274(00)00254-x. [DOI] [PubMed] [Google Scholar]
- 24.Taddei F, Radman M, Maynard-Smith J, Toupance B, Gouyon PH, Godelle B. Role of mutator alleles in adaptive evolution. Nature. 1997;387:700–2. doi: 10.1038/42696. [DOI] [PubMed] [Google Scholar]
- 25.Rocha EPC. Neutral theory, microbial practice: challenges in bacterial population genetics. Mol Biol Evol. 2018;35:1338–47. doi: 10.1093/molbev/msy078. [DOI] [PubMed] [Google Scholar]
- 26.Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S, High-throughput ANI. analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun. 2018;9:5114. doi: 10.1038/s41467-018-07641-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Arevalo P, VanInsberghe D, Elsherbini J, Gore J, Polz MF. A reverse ecology approach based on a biological definition of microbial populations. Cell. 2019;178:820–34. doi: 10.1016/j.cell.2019.06.033. [DOI] [PubMed] [Google Scholar]
- 28.Sun Y, Powell KE, Sung W, Lynch M, Moran MA, Luo H. Spontaneous mutations of a model heterotrophic marine bacterium. ISME J. 2017;11:1713–8. doi: 10.1038/ismej.2017.20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Senra MVX, Sung W, Ackerman M, Miller SF, Lynch M, Soares CAG. An unbiased genome-wide view of the mutation rate and spectrum of the endosymbiotic bacterium Teredinibacter turnerae. Genome Biol Evol. 2018;10:723–30. doi: 10.1093/gbe/evy027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Dillon MM, Sung W, Sebra R, Lynch M, Cooper VS. Genome-wide biases in the rate and molecular spectrum of spontaneous mutations in Vibrio cholerae and Vibrio fischeri. Mol Biol Evol. 2017;34:93–109. doi: 10.1093/molbev/msw224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hughes AL. Near neutrality: leading edge of the neutral theory of molecular evolution. Ann N. Y Acad Sci. 2008;1133:162–79. doi: 10.1196/annals.1438.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Didelot X, Wilson DJ. ClonalFrameML: efficient inference of recombination in whole bacterial genomes. PLoS Comput Biol. 2015;11:e1004041. doi: 10.1371/journal.pcbi.1004041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Song Q, Li Z, Chen R, Ma X, Xiao X, Xu J. Induction of a toxin-antitoxin gene cassette under high hydrostatic pressure enables markerless gene disruption in the hyperthermophilic archaeon Pyrococcus yayanosii. Appl Environ Microbiol. 2019;85:e02662–18. doi: 10.1128/AEM.02662-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Sato T, Fukui T, Atomi H, Imanaka T. Improved and versatile transformation system allowing multiple genetic manipulations of the hyperthermophilic archaeon Thermococcus kodakaraensis. Appl Environ Microbiol. 2005;71:3889–99. doi: 10.1128/AEM.71.7.3889-3899.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Sabath N, Ferrada E, Barve A, Wagner A. Growth temperature and genome size in bacteria are negatively correlated, suggesting genomic streamlining during thermal adaptation. Genome Biol Evol. 2013;5:966–77. doi: 10.1093/gbe/evt050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Sela I, Wolf YI, Koonin EV. Theory of prokaryotic genome evolution. Proc Natl Acad Sci USA. 2016;113:11399–407. doi: 10.1073/pnas.1614083113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Bourguignon T, Kinjo Y, Villa-Martín P, Coleman NV, Tang Q, Arab DA, et al. Increased mutation rate is linked to genome reduction in prokaryotes. Curr Biol. 2020;30:3848–.e4. doi: 10.1016/j.cub.2020.07.034. [DOI] [PubMed] [Google Scholar]
- 38.Lynch M. Evolution of the mutation rate. Trends Genet. 2010;26:345–52. doi: 10.1016/j.tig.2010.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Marais GAB, Batut B, Daubin V. Genome evolution: mutation is the main driver of genome size in prokaryotes. Curr Biol. 2020;30:R1083–5. doi: 10.1016/j.cub.2020.07.093. [DOI] [PubMed] [Google Scholar]
- 40.Giovannoni SJ, Cameron Thrash J, Temperton B. Implications of streamlining theory for microbial ecology. ISME J. 2014;8:1553–65. doi: 10.1038/ismej.2014.60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Kashtan N, Roggensack SE, Rodrigue S, Thompson JW, Biller SJ, Coe A, et al. Single-cell genomics reveals hundreds of coexisting subpopulations in wild. Prochlorococcus Sci. 2014;344:416–20. doi: 10.1126/science.1248575. [DOI] [PubMed] [Google Scholar]
- 42.Luo H, Huang Y, Stepanauskas R, Tang J. Excess of non-conservative amino acid changes in marine bacterioplankton lineages with reduced genomes. Nat Microbiol. 2017;2:1–9. doi: 10.1038/nmicrobiol.2017.91. [DOI] [PubMed] [Google Scholar]
- 43.Luo H, Friedman R, Tang J, Hughes AL. Genome reduction by deletion of paralogs in the marine cyanobacterium Prochlorococcus. Mol Biol Evol. 2011;28:2751–60. doi: 10.1093/molbev/msr081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Mira A, Ochman H, Moran NA. Deletional bias and the evolution of bacterial genomes. Trends Genet. 2001;17:589–96. doi: 10.1016/s0168-9525(01)02447-7. [DOI] [PubMed] [Google Scholar]
- 45.Kuo C-H, Ochman H. Deletional bias across the three domains of life. Genome Biol Evol. 2009;1:145–52. doi: 10.1093/gbe/evp016. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All the datasets generated, analyzed, and presented in the current study are available in the Supplementary Information. Genomic sequences of the eight Thermococcus eurythermalis strains are available at the JGI IMG under the GOLD study id Gs0142375. Raw reads of the eight strains are available at the NCBI SRA under the accession number PRJNA679699.
The custom scripts used in this study have been deposited in the online repository (https://github.com/luolab-cuhk/Thermococcus-mut-genome-size).