Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2013 Mar 4;110(13):E1180. doi: 10.1073/pnas.1300129110

Reply to Gatesy and Springer: The multispecies coalescent model can effectively handle recombination and gene tree heterogeneity

Shaoyuan Wu a,1, Sen Song b, Liang Liu c, Scott V Edwards d,1
PMCID: PMC3612631  PMID: 23650651

Gatesy and Springer (1) suggest that coalescent methods cannot build a reliable phylogeny from gene trees, particularly from exonic sequences concatenated in silico. However, the authors confuse several key issues in coalescent theory and phylogenetic reconstruction, and their criticisms are unfounded.

Gatesy and Springer cite two papers to claim that results of recent mammal coalescent analyses differ from ours because of how we sampled genes (2). However, differences with these studies are more likely a result of their use of Bayesian methods unsuitable for phyologenomic data, or of genomic markers (ultraconserved elements) different from ours. Different genomic markers likely carry different signals (3), but our use of exonic data has proven effective in many contexts.

Gatesy and Springer (1) claim that the existence of recombination in our dataset might mislead coalescent analysis. However, a recent study (4) shows that recombination is not a primary factor influencing the accuracy of coalescent models. It is understandable that systematists might think that recombination would ruin a species tree or even a concatenation analysis, but in fact, because recombination only occurs within species, it is unlikely to be problematic for any but the most extreme cases of incomplete lineage sorting (4).

Gatesy and Springer (1) claim that the use of in silico concatenated exonic data in our analysis violated a basic assumption of coalescent methods. We did in fact “concatenate” exonic sequences separated in the genome, but in calling that problematic, Gatesy and Springer overlooked the wealth of phylogenomic studies that do the same thing when using transcriptome data. Our use of concatenated exonic data is actually the same as the use of cDNA sequences from transcriptomes, which has been widely and successfully used in phylogenomic studies (5). It is Gatesy and Springer, and the practice of concatenation in general, that “ignore a fundamental tenet of molecular biology” when, by using concatenation methods, which typically not only combine sequences from the same gene but also from genomically dispersed genes together, they introduce a process that never occurs in nature. Thus, our concatenation in constructing exonic sequences is in fact widely used and is not the same as the concatenation typically practiced in phylogenetics.

Finally, in suggesting that the data of Meredith et al. (6) was insufficient for their analysis, we do not imply that the many systematic studies that have sampled fewer base pairs are invalid. Indeed, we have shown extensively elsewhere that coalescent methods can work well with as few as two loci, depending on the degree of hemiplasy. The number of loci or base pairs required scales with the number of taxa sampled; in Gatesy and Springer’s work (1), loci have been undersampled at the expense of more taxa, which can lead to problems in analysis.

Overall, Gatesy and Springer (1) ignore the basic difference between concatenation and coalescent methods, namely in how they handled gene tree heterogeneity. As shown by our subsampling analysis (2) and many other studies, violation of the assumption of identical gene trees can lead concatenation into inconsistency. When analyzing phylogenomic datasets with many taxa, gene tree heterogeneity will be substantial and coalescent methods will be more consistent.

Acknowledgments

This work was supported in part by the startup funds of L.L. provided by the University of Georgia; the Collaboration Grant from Tsinghua National Laboratory for Computation; and National Science Foundation Grant DEB-0743616 (to S.V.E).

Footnotes

The authors declare no conflict of interest.

References

  • 1.Gatesy J, Springer MS. Concatenation versus coalescence versus “concatalescence”. Proc Natl Acad Sci USA. 2013;110:E1179. doi: 10.1073/pnas.1221121110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Song S, Liu L, Edwards SV, Wu S. Resolving conflict in eutherian mammal phylogeny using phylogenomics and the multispecies coalescent model. Proc Natl Acad Sci USA. 2012;109(37):14942–14947. doi: 10.1073/pnas.1211733109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Dolgin E. Phylogeny: Rewriting evolution. Nature. 2012;486(7404):460–462. doi: 10.1038/486460a. [DOI] [PubMed] [Google Scholar]
  • 4.Lanier HC, Knowles LL. Is recombination a problem for species-tree analyses? Syst Biol. 2012;61(4):691–701. doi: 10.1093/sysbio/syr128. [DOI] [PubMed] [Google Scholar]
  • 5.Chiari Y, Cahais V, Galtier N, Delsuc F. Phylogenomic analyses support the position of turtles as the sister group of birds and crocodiles (Archosauria) BMC Biol. 2012;10:65. doi: 10.1186/1741-7007-10-65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Meredith RW, et al. Impacts of the Cretaceous Terrestrial Revolution and KPg extinction on mammal diversification. Science. 2011;334(6055):521–524. doi: 10.1126/science.1211028. [DOI] [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES