This special feature has its origins with a 2014 symposium organized by myself and my co-editors David W. Bapst, Graeme T. Lloyd and Nicholas J. Matzke at the Society for Vertebrate Paleontology meetings in Berlin, Germany. In the years prior to 2014, there had been several interesting and important papers published [1–4] about estimating phylogenetic trees incorporating fossil taxa, and particularly involving Bayesian analyses. Some of these papers were written from a neontological perspective, and appeared in journals not often read by palaeontologists. We convened our session with a simple idea: to get different types of researchers from both neontology and palaeontology together to share work and ideas concerning phylogenetic analyses in palaeontology. The session's success inspired my co-editors and I to propose a special feature for Biology Letters, to which we invited the session participants and other contributors. The Biology Letters short format is perfect for focused vignettes about the actual practice of phylogenetics with fossils.
For years, those working primarily with fossil data, and those working primarily with molecular data, took different approaches to both building phylogenetic trees, and scaling them to absolute time. Palaeontologists have often applied a maximum parsimony paradigm under which potential trees are scored according to how many character changes are required in the dataset to produce that tree. The tree implying the fewest changes being ultimately preferred, returning either the single most parsimonious tree or a sample of equally parsimonious trees. Neontologists, working primarily with molecular data, more often apply likelihood-based methods, including Bayesian methods, which evaluate phylogenetic trees by applying a model of evolution to the data, and calculating the likelihood of the data given the tree and the model. Maximum likelihood methods return a point estimate of the phylogeny with branch lengths equal to the expected number of substitutions per site along that branch. Bayesian methods allow for researchers to quantify and incorporate their existing knowledge about phylogenetic relationships and model parameters using probability distributions called priors. Under Bayesian methods, a sample of trees is returned, allowing a more natural incorporation of uncertainty in phylogenetic estimation.
Communication between palaeontological and neontological workers has been limited. No matter the methodology for estimating them, phylogenetic trees have become crucial to comparative biology. Phylogenetic trees have been used to formulate, give evidence in favour of or against hypotheses about the evolutionary history of groups and traits. Because the field of phylogenetics is both crucial and rapidly changing, we sought to incorporate a diverse cross-section of the approaches biologists use to incorporate fossils in trees, and how those trees are eventually used to inform the study of evolution.
The bulk of these papers have focused on the estimation of phylogenetic trees. Building a quality phylogenetic dataset remains a challenge for researchers; Guillerme & Cooper [5] address this issue in their study on availability of both fossil and extant data in mammals. This paper takes a new look at the very old question of if and how missing data are injurious to phylogenetic inference, concluding that the distribution of missing data is more important than the ultimate amount. O'Reilly & Donoghue [6] take a broad look at different paradigms for dating phylogenetic trees, using fossils either as tips in the analysis or as node calibrations, ultimately concluding that these approaches can be used concurrently. Matzke & Wright [7] evaluate the importance of the assumptions made in Bayesian divergence dating analyses, and demonstrate how violations of these assumptions can result in poor inferences. Together, these papers address some of the aspects of building and dating phylogenetic trees incorporating fossils that may be opaque to researchers.
Some papers built trees as a way to understand the biology of a specific clade, or to address a question about how traits have evolved, and how methodological advances can allow us to explore these questions more thoroughly. Researchers in this issue tackled two phylogenetic questions that hold not just the interest of scientists, but the general public. Lloyd et al. [8] use the fossil record of dinosaurs in conjunction with model-based methods for a posteriori time-scaling to evaluate when the bird clade, and flight, originated. Lee [9], seeking to understand the timing of mammal evolution, implemented automated approaches for discovering which parts of the dataset evolve under potentially different evolutionary rates. He concludes that automated approaches can be useful for understanding the biology that underlies phylogeny.
Lastly, some of the papers in this issue explored the downstream use of phylogenetic trees in trait-based or biogeographic analysis. Halliday & Goswami [10] and Bapst et al. [11] both evaluate multiple phylogenetic trees in phylogenetic comparative analyses. Halliday & Goswami built both parsimony and Bayesian phylogenies of placental mammals to look at body-size evolution. Bapst et al. perform a similar analysis in birds. Both papers ultimately conclude that careful attention to the assumptions underlying the production of dated phylogenies is crucial, and that the inferential method chosen can greatly affect the results of comparative analyses. Gorscak & O'Connor [12] build a Bayesian time-scaled phylogeny to understand the palaeobiogeography of titanosaur sauropod dinosaurs, in the process posing several large taxonomic rearrangements. They then use a model selection approach to conclude that titanosaurs were globally distributed, but experienced regionalization as the continents drifted apart in the Cretaceous. Together, these three papers suggest that to learn more about evolutionary patterns via phylogenies with fossil tips, we must closely examine how we reconstruct our dated trees.
It has been a pleasure to assemble this collection. I am proud that seven of the papers in this special feature come from early-career researchers, including graduate students. I am particularly thankful to Biology Letters, and editors Surayya Johar and Raminder Shergill for assisting and allowing us to gather a diverse and talented group of researchers to make this special feature. This field has moved faster than anyone could have expected. In a few short years, the models and methods available to analyse combined datasets have expanded greatly. My co-editors and I look forward to seeing how methods for putting fossils in trees will continue to develop into the future.
Competing interests
I declare I have no competing interests.
Funding
This article was funded by National Science Foundation (grant no. 1612858).
References
- 1.Stadler T. 2010. Sampling-through-time in birth–death trees. J. Theor. Biol. 267, 396–404. ( 10.1016/j.jtbi.2010.09.010) [DOI] [PubMed] [Google Scholar]
- 2.Ronquist F, Klopfstein S, Vilhelmsen L, Schulmeister S, Murray DL, Rasnitsyn AP. 2012. A total-evidence approach to dating with fossils, applied to the early radiation of the Hymenoptera. Syst. Biol. 61, 973–999. ( 10.1093/sysbio/sys058) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bapst DW. 2013. A stochastic rate-calibrated method for time-scaling phylogenies of fossil taxa. Methods Ecol. Evol. 4, 724–733. ( 10.1111/2041-210X.12081) [DOI] [Google Scholar]
- 4.Heath TA, Huelsenbeck JP, Stadler T. 2014. The fossilized birth-death process for coherent calibration of divergence-time estimates. Proc. Natl Acad. Sci. USA 111, E2957–E2966. ( 10.1073/pnas.1319091111) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Guillerme T, Cooper N. 2016. Assessment of available anatomical characters for linking living mammals to fossil taxa in phylogenetic analyses. Biol. Lett. 12, 20151003 ( 10.1098/rsbl.2015.1003) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.O'Reilly JE, Donoghue PCJ. 2016. Tips and nodes are complementary not competing approaches to the calibration of molecular clocks. Biol. Lett. 12, 20150975 ( 10.1098/rsbl.2015.0975) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Matzke NJ, Wright A. 2016. Inferring node dates from tip dates in fossil Canidae: the importance of tree priors. Biol. Lett. 12, 20160328 ( 10.1098/rsbl.2016.0328) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lloyd GT, Bapst DW, Friedman M, Davis KE. 2016. Probabilistic divergence time estimation without branch lengths: dating the origins of dinosaurs, avian flight and crown birds. Biol. Lett. 12, 20160609 ( 10.1098/rsbl.2016.0609) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lee MSY. 2016. Multiple morphological clocks and total-evidence tip-dating in mammals. Biol. Lett. 12, 20160033 ( 10.1098/rsbl.2016.0033) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Halliday TJD, Goswami A. 2016. The impact of phylogenetic dating method on interpreting trait evolution: a case study of Cretaceous–Palaeogene eutherian body-size evolution. Biol. Lett. 12, 20160051 ( 10.1098/rsbl.2016.0051) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Bapst DW, Wright AM, Matzke NJ, Lloyd GT. 2016. Topology, divergence dates, and macroevolutionary inferences vary between different tip-dating approaches applied to fossil theropods (Dinosauria). Biol. Lett. 12, 20160237 ( 10.1098/rsbl.2016.0237) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Gorscak E, O'Connor PM. 2016. Time-calibrated models support congruency between Cretaceous continental rifting and titanosaurian evolutionary history. Biol. Lett. 12, 20151047 ( 10.1098/rsbl.2015.1047) [DOI] [PMC free article] [PubMed] [Google Scholar]
