Skip to main content
Genome Biology and Evolution logoLink to Genome Biology and Evolution
letter
. 2013 Apr 4;5(5):819–821. doi: 10.1093/gbe/evt048

Rewriting Evolution—“Been There, Done That”

David Penny 1,*
PMCID: PMC3673624  PMID: 23558594

Abstract

A recent paper by a science journalist in Nature shows major errors in understanding phylogenies, in this case of placental mammals. The underlying unrooted tree is probably correct, but the placement of the root just reflects a well-known error from the acceleration in the rate of evolution among some myomorph rodents.

Keywords: placental mammals, phylogeny errors, rates of evolution


There is a rather famous local T-shirt from the 1970s that was a take-off of those who had bought theirs at exotic locations around the world—it was simply “Been there, done that.“ However, the message applies very well to claims by a science writer in a three-page Nature article last year that untested methods were supposedly “rewriting“ the tree of placental mammal relationships (Dolgin 2012). The supposedly “new” phylogeny placed the root of the placental tree within the rodents—contradicting both established molecular and fossil phylogenies! However, the article as a whole is a major attack on the whole field of molecular phylogenetics. Several problems that have been solved in the past are made afresh in that report, and it is instructive/helpful to analyze these difficulties to help avoid similar problems in the future. Unfortunately, the author makes at least four classical mistakes about evolutionary trees: not separating out the problem of rooting the tree (Penny 1976); ignoring the mathematical studies about problems when there are differences in the mutational process, such as in mutation rates (e.g., Hendy and Penny 1989); not specifying the model/mechanism of evolution; and therefore not being able to determine an appropriate optimality criterion (Steel and Penny 2000), and thus cannot evaluate the time scale over which their characters are relevant for phylogeny (Mossel and Steel 2004).

Overall, the author is certainly in no position to challenge well-established basic phylogenies of placental mammals (e.g., Meredith et al. 2011; dos Reis et al. 2012)—even though there are still some well-known uncertainties about a few aspects of the placental tree (such as the precise point of rooting the tree) (Meredith et al. 2011; McCormack et al. 2012), see later. We tried to take the issue up with Nature itself, but because of the length of the response, they did not wish any discussion of these fundamental issues. However, it is almost certainly an issue that all journals should have a procedure for handling, at least in a general way, questions that the journals themselves have raised, even if they are by science reporters.

First, there is no outgroup for the claimed rooted tree (Figure 1 of Dolgin [2012] is shown unrooted as fig. 1A). Their unrooted tree is standard, but their rooted tree is not (their tree is shown in fig. 1C). The current accepted tree (fig. 1B) is also consistent with the unrooted tree (fig. 1A), so the apparent controversy is not about the unrooted tree; it is about the position of the root of the tree—point 1. This is a very important point, and it means that there is really no problem with the type of data used, it is simply that the position of the root is critical to interpreting the tree. It is unclear how Dolgin (2012) roots the tree, but it appears to be midpoint rooting (see Penny et al. [1995], which is known to be sensitive to rate differences, see the next point.

Fig. 1.—

Fig. 1.—

The mammalian tree in question. The same unrooted tree (A) can be rooted at the arrow “r1” to give the standard tree for placental mammals (B) or at position “r2” to give the first divergence within rodents (as in C). It is therefore essential to specify the position of the root, and this is usually done with an “outgroup”—for example, the marsupials for the placental mammals. C, cat; D, dog; E, elephant; G, guinea pig; H, human; M, mouse; Rb, rabbit; Rt, rat.

Second, we have seen the tree in figure 1C before (D'Erchia et al. 1996; see also Lin, Waddell, et al. 2002), it was long ago falsified because it is quite well known that there have been significant acceleration in the mutational processes in at least some myomorph rodents. Compared with humans, they have more than a 3-fold increase in mutation rate per cell division (Drake 1999). This rate difference initially led to some difficulties reconciling nuclear and mitochondrial trees (e.g., Lin et al. 2002). Improving the number and distribution of eutherian samples meant, there was the expected basic consilience between nuclear and mitochondrial sequences and that has been maintained as more and more sequences became available. A similar problem has been found, and solved, in bird evolution—Some early trees gave the root of the avian tree in the faster evolving passerine song birds (see Harrison et al. 2004). Break up the long branches in either the placental or the avian case, and the traditional tree (for placentals) or the paleognath/neognath tree (for birds) reappears. So point 2 is that there are some well-known problems with establishing the root of the tree when there are differences in either mutation rates or nucleotide composition (Lockhart et al. 1992). In a statistical sense, these are “systematic” errors, rather than “sampling“ errors.

Although the two issues given above are sufficient to account for the unusual tree in Dolgin (2012), there are other problems. The third point is that it is essential to understand as much as possible about the processes of evolution for the data used. It is certainly not sufficient to claim that a particular analysis is “simpler” (as in Dolgin 2012) because it is known, for example, that some “simple” optimality criteria (such as parsimony) might be the maximum-likelihood estimator for a “no common mechanism" model. However, in principle, this allows a very large increase in the number of parameters and is subject to the long-branch attraction effect mentioned earlier (Hendy and Penny 1989). So under many circumstances, it cannot guarantee to produce the correct tree—even with an unlimited number of characters. So in this sense, it is not “statistically consistent” (see Felsenstein 2003). Thus, an apparently “simple” model does not guarantee consistency of the method. In practice, it is necessary to go well beyond whether a character is “present/absent” and formally evaluate the time scale over which a character will be informative for a given mutation rate (Mossel and Steel 2004). These latter authors show an important limit for Markov models, and their mathematical results certainly help explain the limitation at deeper times. These are the third and fourth points.

We must always be ready to revise our scientific knowledge, and we should welcome new tests of any aspect of science. There is a very interesting aspect of evolutionary theory that it might be difficult to finalize the last details of the placental tree (see Song et al. 2012). Although there are about 3 × 1020 rooted trees for 19 orders of mammals ((2n − 3)!!, where n is the number of taxa, and the !! notation is multiplying by every second number—1 × 3 × 5 …). We are about down to no more than 101 or 102 trees left to consider. It has been known for about 30 years that although individual gene trees are highly similar, they are not usually identical. This happens because of coalescence/lineage sorting (Pamilo and Nei 1988), hybridization, differential selection on sequences (at either the primary sequence or tertiary structural level), ambiguous homologies (including from polyploidy and/or copy number variation), gene conversion, lateral gene transfer, etc. So we still have more work to do!

Perhaps the bad news is that unfortunately evolutionary relationships cannot be done seriously by amateurs? However, the good news is that there is a well-studied mathematical basis for tree building (e.g., Felsenstein 2003; Semple and Steel 2003). Certainly, the small nonprotein coding RNAs (ncRNAs) are an important addition to our knowledge of gene regulation and help inform us about important dynamic aspects of evolution (Hoeppner et al. 2012). However, there is no evidence, yet, that simple “presence/absence” of ncRNAs evolves at a time scale that makes them useful for deeper phylogeny—that has to be established separately. There has been some careful work that showed, for example, the occurrence of a vault RNA throughout vertebrates, but that study used gene location data (synteny) to help show homology (Stadler et al. 2009). We certainly require full testing/evaluation of new and/or untried methods of analysis. However, the present attack on evolutionary study is just full of classical mistakes. We must always determine the time scale over which characters are relevant, it is essential to solve the mathematical problems, and it is also necessary to determine the correct placement of the root of the tree. Only then can we start to move beyond what we know already. As far as we can tell, the current difficulty is not any property of microRNAs, but the mutational processes in myomorph rodents. The unrooted tree of Dolgin (2012) is standard; it is just the position of the root of the tree that is the problem. Although we should all welcome the testing of our current knowledge by new data and new approaches, there are also known standards that must be met.

Literature Cited

  1. D'Erchia AM, Gissi C, Pesole G, Saccone C, Arnason U. The guinea-pig is not a rodent. Nature. 1996;381:597–600. doi: 10.1038/381597a0. [DOI] [PubMed] [Google Scholar]
  2. Dolgin E. Rewriting evolution. Nature. 2012;486:460–462. doi: 10.1038/486460a. [DOI] [PubMed] [Google Scholar]
  3. dos Reis M, et al. Phylogenomic datasets provide both precision and accuracy in estimating the timescale of placental mammal phylogeny. Proc Biol Sci. 2012;279:3491–3500. doi: 10.1098/rspb.2012.0683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Drake JW. The distribution of rates of spontaneous mutation over viruses, prokaryotes, and eukaryotes. Ann N Y Acad Sci. 1999;870:100–107. doi: 10.1111/j.1749-6632.1999.tb08870.x. [DOI] [PubMed] [Google Scholar]
  5. Felsenstein J. Inferring phylogenies. Sunderland (MA): Sinauer; 2003. [Google Scholar]
  6. Harrison GL, et al. Four new avian mitochondrial genomes help get to basic evolutionary questions in the late cretaceous. Mol Biol Evol. 2004;21:974–983. doi: 10.1093/molbev/msh065. [DOI] [PubMed] [Google Scholar]
  7. Hendy MD, Penny D. A framework for the quantitative study of evolutionary trees. Syst Zool. 1989;38:297–309. [Google Scholar]
  8. Hoeppner MP, Gardner PP, Poole AM. Comparative analysis of RNA families reveals distinct repertoires for each domain of life. PLoS Comput Biol. 2012;8(11):e1002752. doi: 10.1371/journal.pcbi.1002752. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Lin Y-H, et al. Four new mitochondrial genomes and the increased stability of evolutionary trees of mammals from improved taxon sampling. Mol Biol Evol. 2002;19:2060–2070. doi: 10.1093/oxfordjournals.molbev.a004031. [DOI] [PubMed] [Google Scholar]
  10. Lin Y-H, Waddell PJ, Penny D. Pika and vole mitochondrial genomes add support to both rodent monophyly and glires. Gene. 2002;294:119–129. doi: 10.1016/s0378-1119(02)00695-9. [DOI] [PubMed] [Google Scholar]
  11. Lockhart PJ, et al. Controversy on chloroplast origins. FEBS Lett. 1992;301:127–131. doi: 10.1016/0014-5793(92)81231-a. [DOI] [PubMed] [Google Scholar]
  12. McCormack JE, et al. Ultraconserved elements are novel phylogenomic markers that resolve placental mammal phylogeny when combined with species-tree analysis. Genome Res. 2012;22:746–754. doi: 10.1101/gr.125864.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Meredith RW, et al. Impacts of the Cretaceous Terrestrial Revolution and KPg extinction on mammal diversification. Science. 2011;334:521–524. doi: 10.1126/science.1211028. [DOI] [PubMed] [Google Scholar]
  14. Mossel E, Steel M. A phase transition for a random cluster model on phylogenetic trees. Math Biosci. 2004;187:189–203. doi: 10.1016/j.mbs.2003.10.004. [DOI] [PubMed] [Google Scholar]
  15. Pamilo P, Nei M. Relationships between gene trees and species trees. Mol Biol Evol. 1988;5:568–583. doi: 10.1093/oxfordjournals.molbev.a040517. [DOI] [PubMed] [Google Scholar]
  16. Penny D. Criteria for optimising phylogenetic trees and the problem of finding the root of a tree. J Mol Evol. 1976;8:95–116. doi: 10.1007/BF01739097. [DOI] [PubMed] [Google Scholar]
  17. Penny D, et al. Improved analyses of human mtDNA sequences support a recent African origin for Homo sapiens. Mol Biol Evol. 1995;12:863–882. doi: 10.1093/oxfordjournals.molbev.a040263. [DOI] [PubMed] [Google Scholar]
  18. Semple C, Steel M. Phylogenetics. Oxford: Oxford University Press; 2003. [Google Scholar]
  19. Song S, Liu L, Edwards SV, Wu S. Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model. Proc Natl Acad Sci U S A. 2012;109:14942–14947. doi: 10.1073/pnas.1211733109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Stadler PF, et al. Evolution of vault RNAs. Mol Biol Evol. 2009;26:1975–1991. doi: 10.1093/molbev/msp112. [DOI] [PubMed] [Google Scholar]
  21. Steel MA, Penny D. Parsimony, likelihood and the role of models in molecular phylogenetics. Mol Biol Evol. 2000;17:839–850. doi: 10.1093/oxfordjournals.molbev.a026364. [DOI] [PubMed] [Google Scholar]

Articles from Genome Biology and Evolution are provided here courtesy of Oxford University Press

RESOURCES