Arising from: H. Lee et al.; Scientific Reports 10.1038/s41598-023-36848-0 (2023).
The classification of sand dollars was recently reassessed by Lee et al.1 based on a four-locus molecular dataset. While expanding the taxon sampling relative to previous studies, and providing a novel hypothesis of relationship for some sand dollar lineages, the authors also favor a topology that incorporates several deep splits that are incongruent with previous morphological and molecular efforts, including several genome-scale studies. Here, I reevaluate their dataset and find that it does not harbor the necessary signal to resolve deep branching patterns. On the contrary, available phylogenomic data reject their tree as a plausible phylogenetic hypothesis. Conflicting phylogenetic trees should not be considered for taxonomic or macroevolutionary purposes without first evaluating the adequacy of the data at hand, especially when publicly available, genome-scale datasets for the intervening taxa already exist.
Phylogenetic trees serve two major purposes: (1) they provide a means for classifying organisms into taxonomic groups, and (2) they are the substrate on which evolutionary patterns and processes are explored. In a recent publication, Lee et al.1 presented a novel hypothesis of relationships for sand dollars, sea biscuits, and close relatives, a clade of echinoids known as Luminacea. Their phylogenetic tree was then used as mentioned above to introduce a new classification of sand dollars, suggest potential morphological homoplasies, and infer the biogeographic history of the clade. The efforts of Lee et al.1, however, support a topology that is at odds with our current understanding of the phylogeny of irregular echinoids, while disregarding the wealth of molecular resources already available for the clade2.
As noted by the authors, the classification of luminacean lineages has been heavily revised recently, suggesting a much more dynamic morphological evolution than once considered3,4 (Fig. 1A–C). In fact, the name Luminacea was only proposed two years ago. Nonetheless, these taxonomic changes were supported by genome-scale datasets2,5, congruent with previous small-scale molecular studies6, and adopted only after a thorough assessment of levels of phylogenetic signal, noise, and biases in the data. Lee et al.1 gathered a novel molecular dataset for Luminacea composed of four loci (16 s, 18 s, cox1, and H3) and including the broadest sample of sand dollar (Scutelloida) family-level clades. Their efforts help clarify relationships among Scutelliformes, supporting the erection of three new superfamilies, and posing valuable hypotheses regarding the independent origins of lunules (the unique perforations in the tests of some sand dollars). Nonetheless, their topology (shown in Fig. 1C) markedly departs from those of recent studies (shown in Fig. 1B), rejecting the monophyly of sand dollars and finding a novel placement for Cassiduloida. This tree has no precedent in the literature, conflicting with Sanger-sequenced6, mitogenomic7, transcriptomic2,5, and total-evidence8 datasets, and further widening the discrepancy between molecular and morphological evidence (see Figs. 1 and S1; increased morphological tree lengths reflect elevated homoplasy). Despite these challenges, Lee et al.1 embrace their novel results, citing the benefits of a wider taxon sampling, and proposing that morphological similarities among sand dollars arose through convergent evolution.
Luminacea is a relatively ancient clade of irregular echinoids that originated sometime between the Middle Jurassic8 and the Early Cretaceous1,2, and which displays elevated heterogeneity in evolutionary rates5. These conditions pose substantial obstacles for phylogenetic reconstruction, especially when relying on small-scale datasets. In line with this, deep nodes in the phylogeny of Lee et al.1 show low support values, including those relating to the novel rearrangements proposed. In order to assess the robustness of their results, I obtained the data used by the authors from GenBank (72 nucleotide sequences for 28 taxa, encompassing four loci with occupancies ranging between 25 and 96%). Given the lack of H3 sequences among outgroup taxa, sequences of Strongylocentrotus purpuratus were further incorporated. Loci were aligned using the L-INS-i algorithm in MAFFT v7.5059, and concatenated into a matrix both with and without sequences for S. purpuratus (the latter replicating the original dataset, although the authors manually aligned some loci). The effect of trimming positions with over 50% missing data was also explored, determining four concatenated matrices. Inference from these datasets, as well as others mentioned below, was performed under maximum likelihood using optimal partitioned models in IQ-TREE v1.6.1210,11. Despite differing minimally with respect to the original alignment, none of these datasets recovered the topology supported by Lee et al.1 (Fig. S2), an instability that could indicate low levels of phylogenetic signal.
To assess this hypothesis, and establish whether the dataset gathered by Lee et al.1 is able to confidently resolve relationships among the major lineages within Luminacea (i.e., Cassiduloida, Clypeasteroida, Laganiformes, and Scutelliformes; see Fig. 1) I relied on approximately unbiased (AU) topological tests12. I performed a set of constrained tree searches enforcing the monophyly of the four aforementioned clades as well as of the nodes connecting them, exploring all fifteen possible patterns of relationships among them. All other nodes were left unconstrained. Comparison of the likelihood scores of these candidate trees reveals that five topological alternatives, including those shown in Fig. 1B and C, form part of the confidence set of trees (Table S1). These results prove that the data of Lee et al.1 cannot be used to support (or reject) deep relationships within Luminacea, and that their results do not truly deviate from those of previous studies.
While phylogenomic datasets sometimes fail to uncover true phylogenetic histories, the amount of information they contain can be much more thoroughly explored, allowing for competing signals to be quantified and diagnosed. I reanalyzed the latest echinoid phylotranscriptomic dataset2, estimating site-wise log likelihood scores for the trees depicted in Fig. 1B and C. These were turned into gene-wise scores whose difference, known as ΔGLS13, represent amounts of signal for/against topological alternatives. As shown in Fig. 1D, phylogenomic data strongly support the monophyly of Scutelloida, as well as the placement of Cassiduloida as their sister clade. On the other hand, the tree put forth by Lee et al.1 finds only minimal support, being the preferred topology for only 8% of loci.
Our understanding of the phylogeny and evolutionary history of sand dollars, sea biscuits, and close relatives, has changed dramatically in recent years2,5,8, prompting a taxonomic restructuring. Change, however, does not necessarily reflect uncertainty. The available molecular data for Luminacea places us, for the first time, in a position in which we can build a stable taxonomic classification for its living members. Improved taxon sampling will continue to provide novel phylogenetic insights; however, favoring phylogenies that conflict with those built using thousands of loci amounts to an extraordinary claim, one that, as shown here, is not based on sufficient evidence. While the phylogenetic and evolutionary hypotheses put forth by the authors for scutelliforms are valuable, their data do not substantiate a phylogenetic reassessment of Luminacea. Doing so threatens to perpetuate a state of taxonomic instability that is unwarranted in light of the data at hand, as well as potentially lead to inaccurate inferences of morphological, biogeographical, and macroevolutionary history.
Additional information
Supplementary information for this paper, including figures and tables, is available online. Data, results, and code to replicate all aspects of the analysis is deposited in the Zenodo repository: 10.5281/zenodo.10207759.
Supplementary Information
Acknowledgements
This manuscript was improved by comments from Rich Mooi, Greg W. Rouse, Jeffrey R. Thompson, and three anonymous reviewers. This work was supported by NSF grant DEB-2036186 (awarded to Greg W. Rouse).
Competing interests
The author declares no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-024-54208-4.
References
- 1.Lee H, et al. Phylogeny, ancestral ranges and reclassification of sand dollars. Sci. Rep. 2023;13:10199. doi: 10.1038/s41598-023-36848-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Mongiardino Koch N, et al. Phylogenomic analyses of echinoid diversification prompt a re-evaluation of their fossil record. eLife. 2022;11:e72460. doi: 10.7554/eLife.72460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Kroh A, Smith AB. The phylogeny and classification of post-Palaeozoic echinoids. J. Syst. Palaeontol. 2010;8:147–212. doi: 10.1080/14772011003603556. [DOI] [Google Scholar]
- 4.Mooi R. Paedomorphosis, Aristotle's lantern, and the origin of the sand dollars (Echinodermata: Clypeasteroida) Paleobiology. 1990;16:25–48. doi: 10.1017/S0094837300009714. [DOI] [Google Scholar]
- 5.Mongiardino Koch N, et al. A phylogenomic resolution of the sea urchin tree of life. BMC Evol. Biol. 2018;18:189. doi: 10.1186/s12862-018-1300-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Smith AB, et al. Testing the molecular clock: Molecular and paleontological estimates of divergence times in the Echinoidea (Echinodermata) Mol. Biol. Evol. 2006;23:1832–1851. doi: 10.1093/molbev/msl039. [DOI] [PubMed] [Google Scholar]
- 7.Sun S, Xiao N, Sha Z. Complete mitochondrial genomes of four deep-sea echinoids: Conserved mitogenome organization and new insights into the phylogeny and evolution of Echinoidea. PeerJ. 2022;10:e13730. doi: 10.7717/peerj.13730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mongiardino Koch N, Thompson JR. A total-evidence dated phylogeny of Echinoidea combining phylogenomic and paleontological data. Syst. Biol. 2021;70:421–439. doi: 10.1093/sysbio/syaa069. [DOI] [PubMed] [Google Scholar]
- 9.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Nguyen L-T, et al. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum likelihood phylogenies. Mol. Biol. Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kalyaanamoorthy S, et al. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods. 2017;14:587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Shimodaira H. An approximately unbiased test of phylogenetic tree selection. Syst. Biol. 2002;51:492–508. doi: 10.1080/10635150290069913. [DOI] [PubMed] [Google Scholar]
- 13.Kroh, A. & Mooi, R. World Echinoidea Database. Accessed at https://www.marinespecies.org/echinoidea on 2023–07–11. 10.14284/355 (2023).
- 14.Goloboff PA, Catalano SA. TNT version 1.5, including a full implementation of phylogenetic morphometrics. Cladistics. 2016;32:221–238. doi: 10.1111/cla.12160. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.