Abstract
How much of evolutionary history is lost because of the unevenness of the fossil record? Lagerstätten, sites which have historically yielded exceptionally preserved fossils, provide remarkable, yet distorting insights into past life. When examining macroevolutionary trends in the fossil record, they can generate an uneven sampling signal for taxonomic diversity; by comparison, their effect on morphological variety (disparity) is poorly understood. We show here that lagerstätten impact the disparity of ichthyosaurs, Mesozoic marine reptiles, by preserving higher diversity and more complete specimens. Elsewhere in the fossil record, undersampled diversity and more fragmentary specimens produce spurious results. We identify a novel effect, that a taxon moves towards the centroid of a Generalized Euclidean dataset as its proportion of missing data increases. We term this effect ‘centroid slippage’, as a disparity-based analogue of phylogenetic stemward slippage. Our results suggest that uneven sampling presents issues for our view of disparity in the fossil record, but that this is also dependent on the methodology used, especially true with widely used Generalized Euclidean distances. Mitigation of missing cladistic data is possible by phylogenetic gap filling, and heterogeneous effects of lagerstätten on disparity may be accounted for by understanding the factors affecting their spatio-temporal distribution.
Keywords: lagerstätten, disparity, missing data, centroid slippage, generalized Euclidean distance
1. Introduction
Palaeontologists express different attitudes to lagerstätten, deposits which have historically yielded exceptionally preserved fossils. Most see them as opportunities to encounter the true richness of past life, others as statistical outliers that distort the story presented by the fossil record [1]. The distortion is potentially huge; for example, only 10% of the Burgess Shale fauna would be seen under ‘normal’ conditions of fossilization [2]. Alternatively, younger lagerstätten like Holzmaden or Jehol do not exhibit such high levels of biological novelty and many lagerstätte taxa co-occur in deposits lacking exceptional preservation. Understanding how lagerstätten bias the fossil record is therefore crucial to accurately reconstructing biodiversity through deep time.
The effects of lagerstätten sampling on taxonomic diversity are well documented [1,3,4], but do not necessarily extend to morphological variety (disparity), as these measures of biodiversity are generally decoupled [5–7]. Morphometric disparity analyses directly compare physical shapes between taxa, but their application in palaeontology is limited by the pervasive issue of missing anatomical data [8]. Cladistic disparity analyses use differential character states between taxa as a proxy for disparity, producing similar results to the morphometric approach [9] and readily accommodating missing character data. While that same missing data negatively affects measurement of disparity, lagerstätten positively affect character completeness [10–12]. Lagerstätte biases may therefore relate to the problem of missing data, introducing methodological issues to cladistic disparity analyses that are linked to biases in the geological record. Ciampaglio et al. [13] examined the effects of missing data on disparity by using artificial cladistic datasets that were up to 25% incomplete. Smith et al. [14] address the impacts of missing data at greater percentages, finding that disparity for a variety of metrics decreases as the proportion of missing cladistic data increases; this analysis, however, was performed on fossil datasets rather than artificial datasets where the true (complete data) disparity value can be measured. Consequently, the theoretical impacts of missing data at the highest percentages remain unexplored.
Several morphometric disparity studies have excluded lagerstätte taxa from their analyses [15–18]. Each found little evidence for lagerstätten biases, but these conclusions cannot necessarily be extended to cladistic disparity analyses. Lagerstätten bias the cladistic disparity of pterosaurs, flying reptiles with delicate, hollow bones [4]. While diversity and disparity may be decoupled, their fossil record is strongly dependent on lagerstätten [19]; such deposits undoubtedly record a sizeable proportion of their disparity. Butler et al. [4] also used phylogenetic correction to infill missing character states in their cladistic matrix, preventing investigation of the potential interrelationship between lagerstätten, missing data, and disparity. Brusatte et al. [20] found that phylogenetic correction for missing data in pterosaurs alters the results of disparity analysis in diverse ways depending on the datasets, phylogenetic trees, and disparity comparisons used, and does not simply inflate disparity estimates. However, they caution against phylogenetic correction, except for clades with a large amount of missing data or patchy fossil records. It is, therefore, worth exploring the fossil records of clades not composed predominantly of lagerstätten to understand the interlinked effects of methodology, missing data, and exceptional preservation on disparity.
Ichthyosaurs, swimming reptiles with large, robust skeletons, occupied shallow and deep marine environments with high preservation potentials, giving them a rich fossil record. Ichthyosaurs were among the most significant groups to radiate in the oceans after the Permian–Triassic mass extinction. New taxa described from the Panxian, Luoping, Xingyi, and Guanling lagerstätten of China [21], alongside longer-known materials from the Besano lagerstätte of Italy and Switzerland and the well-established global record of Triassic ichthyosaurs [22], document rapid accumulation of disparity and anatomical innovations through the Triassic [23]. Despite reduction in ecomorphospace by the Triassic–Jurassic extinction [24], ichthyosaurs remained ecologically prominent throughout most of the Mesozoic [25]. With their rich fossil record, comprising lagerstätten and non-lagerstätten deposits, ichthyosaurs make a suitable case study to investigate the impacts of exceptional preservation on our view of disparity.
We also examine disparity in artificial cladistic datasets, allowing for measurement of the true disparity signal, followed by the random removal of 0–100% of characters to ascertain the impact of higher levels of missing data. Ichthyosaur disparity is calculated here from a moderately incomplete dataset (52% missing characters) without any phylogenetic correction method, and so our analysis of the effects of missing data provides a means of critiquing the signal that emerges.
2. Material and methods
(a). Character and distance matrices
In cladistic disparity analyses, character-taxon cladistic matrices are converted to taxon–taxon distance matrices and then ordinated, allowing disparity metrics to be calculated as summary statistics from the ordination space. Most studies use either Generalized Euclidean Distance (GED) or Gower Coefficient Distance (GCD) matrices. The former has proved popular because incomparable distances are mathematically infilled; complete datasets allow for a greater range of subsequent analytical techniques. Lloyd [26], however, cautioned that this approach may result in illogical data that could negatively impact a disparity study. GCD has become increasingly popular as it has the advantage of accounting for the effects of missing data at the cost of removing taxa which do not share any characters and so are incomparable. Maximum Observable Rescaled Distance (MORD) operates in a similar way to GCD [26]; however, this has not yet been widely used as it has only recently been developed. Here, we use disparity metrics calculated from all three distances to ascertain how the choice of distance matrix may affect measurements of disparity as GCD and MORD matrices are designed to more readily accommodate missing data.
Our chosen cladistic matrix is modified from Moon [27] and comprises 112 ichthyosauriforms scored for 287 discrete characters, where characters 100, 204, 205, and 218 are ordered. Phalarodon major was removed as it is a nomen dubium [28] and the character states for Suevoleviathan integer and S. disinteger were combined as the latter is a junior synonym of the former [29]. GED, GCD, and MORD distance matrices were generated from this using the MorphDistMatrix function of the Claddis package [26] in R (v3.2.2) [30]. In the GCD and MORD matrices, 14 taxa had incomparable distances and were removed prior to ordination. While GED matrices are commonly employed for the specific reason that they do not require taxon removal, those same taxa were then removed from the GED matrix to give taxon parity in subsequent statistical tests.
(b). Ordination
Each distance matrix was subjected to principal coordinates analysis (PCO) in R (v3.2.2) [30]. PCO may introduce negative eigenvalues into the resultant data; thus, their corresponding axis lengths, as the square roots of those eigenvalues, are imaginary. Negative eigenvalue correction, using either the Lingoes [31] or Cailliez [32] methods, are used to avoid this issue and are commonly employed in PCO analyses. Such methods, however, reduce the amount of variation represented by lower ordination axes and so erode their disparity signal [33]. As there is no consensus on the number of PCO axes to use in subsequent analyses [26], this may be problematic for studies that focus on few axes, such as the first three. To ascertain the impacts of negative eigenvalue correction, the squared Pearson correlation between the pairwise distances in PCO space and the original distance matrix was plotted, following the method of Kotrc & Knoll [33], along with the percentage variance summarized by increasing numbers of PCO axes. This was conducted for each type of distance matrix, with and without Cailliez negative eigenvalue correction.
Largely, as the number of uncorrected PCO axes increases, the degree of correlation with the original distance matrix increases smoothly, plateauing as higher axes are included (electronic supplementary material, figure S1). By contrast, while inclusion of all corrected PCO axes produces a perfect correlation with the original distance matrix, it fluctuates for lower numbers of axes. Even for uncorrected GCD and MORD matrices, the correlation declines sharply over the first three axes, those that are plottable and commonly used in the calculation of disparity metrics.
On the basis of these results, PCO without negative eigenvalue correction was performed on each type of distance matrix. We used all positive PCO axes in subsequent analyses to maximize the correlation with its corresponding distance matrix and summarizing the greatest possible amount of variance on the lower axes, while excluding axes with negative eigenvalues: 107 for GED, 62 for GCD, and 62 for MORD.
The different degrees of correlation between the original distance matrices, and corrected and uncorrected PCO matrices, warn against arbitrary selection of axes or selection based on certain criteria, such as summarizing a particular percentage of variance, as the chosen number of axes may be poorly correlated with the original distance matrix. The sharp drop in correlation over the first few axes even in the uncorrected PCO matrices further warns against selection of axes without first determining how well they represent the original data.
(c). Disparity calculations
The selected taxa were sorted into 10 Ma time bins, spanning 250 Ma to 90 Ma, and the PCO scores for each distance matrix subsampled for the taxa in each bin. Following the recommendations of previous studies [34,35], a 10 Ma time bin length was selected as a suitable trade-off between resolution and sample size. This approach was also selected, rather than division into epochs, to avoid confounding effects associated with uneven time bin lengths. Disparity metrics were then calculated using the disparity function of the R package dispRity [36]. The chosen metrics were the sums of ranges and variances, and the mean distance from the centroid; 95% confidence intervals were calculated from 1000 bootstrapped pseudoreplicates with replacement. PCO plots of morphospace for each bin were created using the first two PCO axes. The same metrics and confidence intervals were then calculated for a second dataset pruned of taxa that occur in lagerstätte deposits.
(d). Lagerstätte taxa
The definition of a lagerstätte remains controversial. Sedimentological, geochemical, and taphonomic criteria have been described [37], but no consensus exists. The definition of a lagerstätte may also include a historical component. Many classical lagerstätten have been repeatedly sampled for decades or even centuries, partly due to the exceptional scientific and aesthetic quality of their specimens, partly due to high specimen availability resulting from systematic excavation in quarries or rapid erosional exposure (e.g. coastal settings). Other factors may compound this historical component, for example, the initial accessibility of a locality or the effort invested in the subsequent preparation and description of any collected specimens; all these elements are likely linked to some degree by the ‘bonanza effect’ where palaeontologists preferentially sample and describe fossiliferous units of high scientific potential over others [38], leading to sampling spikes for those units. Furthermore, as lagerstätten show heterogeneous fidelity of preservation between clades, localities that yield taxa used in our analysis may be considered lagerstätten for ichthyosaurs, but not for other clades. Maisch [22] identified the Triassic Pardonet Formation and Hosselkus Limestone, and the Middle Jurassic Los Molles Formation, among others, as ichthyosaur lagerstätten, based on their relatively complete, well-preserved, and well-sampled skeletons despite the lack of classical indicators such as soft tissue preservation. Other ichthyosaur-bearing horizons, particularly the Oxford Clay and Blue Lias formations, contain beds that show exceptional preservation, and so are referred to as lagerstätte-grade by some authors; however, other authors do not class them as such.
Ultimately, the issue under consideration is how lagerstätten impact our picture of ichthyosaur disparity in terms of their contribution to character state codings. Thus, we define a conservative list of lagerstätten which may have induced spikes in the availability of highly codable specimens due to the balance of the historical (well-sampled and documented) and taphonomic (exceptionally preserved) factors outlined above. Any significant differences in disparity metrics should therefore remain valid even with the elevation of existing deposits to lagerstätte grade, the future discovery of new ichthyosaur taxa from lagerstätten, or the discovery of new lagerstätten themselves. The chosen lagerstätten are the Early Triassic Nanlinghu and Jialingjiang formations of China, the Middle and Late Triassic Guanling and Xiaowa formations of China, the Middle Triassic Zhuganpo Member of the Falang Formation of China, the Middle Triassic Besano Formation of Switzerland and Italy, the Early Jurassic Posidonia Shale of Germany, the Early Jurassic Strawberry Bank Limestone, and Blue Lias and Charmouth Mudstone formations of the UK, the Middle to Late Jurassic Oxford Clay of the UK, the Late Jurassic Solnhofen Limestone of Germany, the Late Jurassic Kimmeridge Clay Formation of the UK, the Early Cretaceous Vaca Muerta Formation of Argentina, and the Early Cretaceous Toolebuc Formation of Australia. Out of the 112 taxa in this analysis, 37 are exclusively from lagerstätten (33%) and occur in bins 2–4, 6–8, 11–13, 15–16. As there was no way to assess whether lagerstätte or non-lagerstätte specimens of the same taxon provided particular character codings in the dataset, we included co-occurring taxa in both partitions of the dataset. This represents a compromise reflecting the certain (lagerstätte-exclusive taxa) and potential (co-occurring taxa) influences of lagerstätten on ichthyosaur disparity, rather than the worst-case scenario where lagerstätten also dominate the proportion of known character states for co-occurring taxa. We also ran the following analyses where only lagerstätten-exclusive taxa were removed from disparity calculations, thus representing the best-case scenario.
(e). Lagerstätten effects
Statistically significant differences between time bins with and without lagerstätte taxa were assessed in two ways. Firstly, disparity metrics from bins which contain lagerstätten and those from the corresponding bins with lagerstätten taxa removed were extracted from the overall time series and tested for significant differences using paired samples two-tailed t-tests and Wilcoxon signed-rank tests. Secondly, two-tailed Pearson and Spearman tests of correlation with Holm correction [39] for false discovery rates were used to test whether the removal of lagerstätte taxa affected the trend in each metric through time. Generalized differencing [40] using Graeme Lloyd's gen.diff function [41], was performed on the raw disparity metrics to detrend and remove short-term autocorrelation from the data. Correlations were then tested for in the same manner. To relate missing data (see below) and sample size biases to possible lagerstätte effects, the same tests as above were conducted on diversity and mean character completeness time series, and an independent samples, two-tailed t-test was used to test for a significant difference in completeness between lagerstätte and non-lagerstätte taxa.
(f). Missing data
To assess how the proportion of missing data for a taxon in the original cladistic matrix might affect its position in morphospace, Pearson and Spearman tests of correlation were used to test for a relationship between the percentage of missing data from each taxon and their Pythagorean distance from the origin of the PCO plots in n dimensions, where n is the number of PCO axes. A linear model was then fitted to the data, showing that as the percentage of missing data increases, the Pythagorean distance to the origin decreases (see below). The procedure was repeated using just the first two axes to provide a means of critiquing the PCO morphospace plots. Disparity metrics for each level of missing data were also calculated, using the methodology described above (electronic supplementary material, figure S2).
To ascertain whether the relationship is simply a quirk of the dataset, the same procedure was conducted on two other cladistic datasets taken from the literature: 81 non-avian theropod dinosaurs coded for 413 characters taken from Nesbitt & Ezcurra [42], and 80 sauropterygians coded for 270 characters taken from Benson & Bowdler [43], as well as the ichthyosaur cladistic matrix used in this study. Both datasets were downloaded from Graeme Lloyd's website (http://www.graemetlloyd.com/matr.html) on 22 July 2017, converted to distance matrices, and subjected to PCO without negative eigenvalue correction in R (v3.2.2) [30] prior to the calculation of Pythagorean distances for each taxon and correlation with the percentages of missing data (electronic supplementary material, figures S3–S6).
The relationship between missing data and disparity was investigated further using simulations. A single dataset comprising 20 operational taxonomic units coded for 300 unordered, binary, equally weighted characters with randomly assigned states generated in R (v3.2.2) [30]. Within the same cladistic matrix, the 20 OTUs were duplicated 10 times and 10% through to 90% of character states were deleted for successive groups of 10 OTUs. The cladistic matrix was converted to a GED matrix and subjected to PCO without negative eigenvalue correction. The same relationship between missing data and Pythagorean distance was tested for using the method above.
Additional simulations were conducted to specifically focus on the relationship between missing data and distance from the centroid. To ensure the accuracy of these simulations, 20 empirical cladistic matrices were downloaded from Graeme Lloyd's website on 01 May 2018 and the proportion of different character states assessed for each; the proportion of higher character states was seen to decline in an approximate geometric relationship (electronic supplementary material, figure S7). Normal, Poisson, and negative binomial distributions were then fitted to the distribution of missing data in each matrix (electronic supplementary material, figure S8) and the model fit assessed using the Akaike Information Criterion. Negative binomial distributions performed the best, but only marginally compared to normal distributions (electronic supplementary material, table S1). Simulated cladistic matrices covering a range of dimensions were generated in R (v3.2.2). Each matrix was then assigned a different normal distribution of missing data (this strategy was chosen for simplicity as a negative binomial distribution tends towards normality for large sample sizes), and equal or geometric distributions of unordered, randomly assigned character states for binary, ternary, quaternary, quinary, and senary coding strategies. Each matrix was transformed, ordinated, and analysed using the methodology above, and with negative eigenvalue correction during ordination.
3. Results
(a). Lagerstätten biases
Time bins with lagerstätte taxa show significantly lower values for each disparity metric when lagerstätte taxa are excluded (table 1 and figure 1). Morphospace occupation is similarly reduced when lagerstätte taxa are excluded (figure 2). After generalized differencing, centroid distances from GCD and MORD fail to correlate between lagerstätte and non-lagerstätte time series using Pearson correlation; centroid distance from all three distances, and the sums of variances from GED also fail using Spearman correlation (tables 2 and 3). This shows that the preservational biases of lagerstätten impact the long-term trend in disparity observed over hundreds of millions of years. It should be noted, however, that while there is a statistical difference between the time series, they broadly recover the same peaks and troughs in disparity, aside from the late Early to Late Jurassic where lagerstätten dominate their fossil record. An independent samples, two-tailed t-test showed a statistically significant difference (T = −3.47, d.f. = 54.69, p < 0.005, n = 25, 40) in the percentage completeness of cladistic characters between the lagerstätte (57.4% complete) and non-lagerstätte taxa (37.2% complete), and taxonomic diversity-per-bin shows significant differences within lagerstätte bins when lagerstätte taxa are excluded (table 1). Together, these results suggest that lagerstätten affect disparity in two ways: through the preservation of more taxa; and through the preservation of more complete specimens. Biases in individual disparity metrics and morphospace plots, however, are partially dependent on the choice of distance matrix.
Table 1.
Paired samples two-tailed t-tests and Wilcoxon signed-rank (W) tests between disparity metrics (Sum V = sum of variances, Sum R = sum of ranges, Cent = mean distance from the centroid), and character completeness (Compl) and taxonomic diversity (Div), from time bins with lagerstätte taxa, and with those taxa removed (d.f. = 9, p-values are italicized, T and W statistics are plain text).
| Sum V | Sum R | Cent | Compl | Div | |
|---|---|---|---|---|---|
| T | |||||
| GED | 4.67, <0.005 | 11.01, <0.0001 | 5.80, <0.0005 | −1.25, 0.24 | 4.11, <0.005 |
| GCD | 5.08, <0.001 | 10.42, <0.0001 | 6.94, <0.0001 | — | — |
| MORD | 4.94, <0.001 | 10.15, <0.0001 | 6.56, <0.0005 | — | — |
| W | |||||
| GED | 55, <0.005 | 55, <0.005 | 55, <0.005 | 10, 0.16 | 45, <0.01 |
| GCD | 55, <0.005 | 55, <0.005 | 55, <0.005 | — | — |
| MORD | 55, <0.005 | 55, <0.005 | 55, <0.005 | — | — |
Figure 1.
Comparisons of ichthyosaur diversity disparity through time, with and without lagerstätte taxa. Time series of disparity metrics, along with taxonomic diversity, for each time bin from 250 to 90 Ma. (a) Taxonomic diversity, (b) sum of variances, (c) sum of ranges, and (d) mean distance from the centroid. Error bars represent the bootstrapped 95% confidence interval. (Online version in colour.)
Figure 2.
Changing morphospace occupation of ichthyosaurs through geological time. Ten Myr morphospace plots of the first two PCO axes of the ichthyosaur dataset from 250 to 90 Ma. (Online version in colour.)
Table 2.
Two-tailed tests of Pearson correlation between raw and generalized differenced (GD) disparity metric time series, and character completeness and diversity, with lagerstätte taxa present and removed (p-values are italicized). See table 1 for column title abbreviations.
| Sum V | Sum R | Cent | Compl | Div | |
|---|---|---|---|---|---|
| raw | |||||
| GED | 0.79, <0.005 | 0.80, <0.005 | 0.78, <0.005 | 0.94, <0.0001 | 0.90, <0.0001 |
| GCD | 0.92, <0.0001 | 0.82, <0.001 | 0.767, <0.01 | — | — |
| MORD | 0.90, <0.0001 | 0.81, <0.001 | 0.65 <0.05 | — | — |
| GD | |||||
| GED | 0.67, <0.05 | 0.74, <0.05 | 0.67, <0.05 | 0.93, <0.0001 | 0.92, <0.0001 |
| GCD | 0.92, <0.0001 | 0.76, <0.01 | 0.45, 0.12 | — | — |
| MORD | 0.89, <0.0005 | 0.76, <0.005 | 0.44, 0.13 | — | — |
Table 3.
Two-tailed tests of Spearman correlation between raw and GD disparity metric time series, and character completeness and diversity, with lagerstätte taxa present and removed (p-values are italicized). See table 1 for column title abbreviations.
| Sum V | Sum R | Cent | Compl | Div | |
|---|---|---|---|---|---|
| Raw | |||||
| GED | 0.85, <0.005 | 0.81, <0.001 | 0.781 <0.005 | 0.94, <0.0001 | 0.85, <0.0005 |
| GCD | 0.80, <0.005 | 0.75, <0.005 | 0.36, 0.20 | — | — |
| MORD | 0.87, <0.0005 | 0.77, <0.005 | 0.32, 0.26 | — | — |
| GD | |||||
| GED | 0.42, 0.14 | 0.66, <0.05 | 0.54, 0.12 | 0.98, <0.0001 | 0.90, <0.0005 |
| GCD | 0.71, <0.05 | 0.65, <0.05 | 0.52, 0.07 | — | — |
| MORD | 0.75, <0.01 | 0.64, <0.05 | 0.42, 0.16 | — | — |
Reanalysis, excluding only lagerstätte-exclusive taxa from disparity calculations, had no effect on the statistical significance or otherwise of our results, aside from significant Spearman correlations for the GED-derived centroid distance after generalized differencing. This demonstrates that the impacts of lagerstätten remain prevalent even for the best-case scenario. The greater completeness of lagerstätte specimens, however, suggests that the true impacts of lagerstätten on ichthyosaur disparity may lie somewhere between our compromise and the worst-case scenario.
(b). Missing data and centroid slippage
A highly significant, negative relationship was found between the percentage of missing characters for a taxon in a cladistic matrix and its Pythagorean distance to the origin of n-dimensional PCO space-derived from GED, where n is the number of PCO axes examined (table 4 and figure S9). This trend holds true across simulated cladistic matrices spanning a wide variety of dimensions, coding strategies, and levels of missing data, and is unaffected by the use, or otherwise, of negative eigenvalue correction during ordination (figure 3). For convenience, we term this effect ‘centroid slippage’ (PCO performed on a GED matrix is mathematically equivalent to principal components analysis, where the centroid of the dataset lies at zero on all axes). The effect of centroid slippage is nullified when GCD and MORD are used. Together with the reduced impact of lagerstätten effects when alternative distances are used, this highlights a key methodological issue for disparity analyses. In other words, taxa move from their true positions in morphospace towards the centroid.
Table 4.
Two-tailed tests of Pearson and Spearman correlation between a taxon's percentage missing data and its Pythagorean distance from the origin of PCO space in n dimensions where n is the total number of PCO axes for each dataset (p-values are italicized).
| Ichthyosauria | Sauropterygia | Dinosauria | artificial | |
|---|---|---|---|---|
| Pearson | −0.87, <0.0001 | −0.91, <0.0001 | −0.84, <0.0001 | −0.98, <0.0001 |
| Spearman | −0.89, <0.0001 | −0.90, <0.0001 | −0.88, <0.0001 | −0.97, <0.0001 |
Figure 3.
Simulation of the proportion of missing data versus distance from the morphospace origin. Heatmaps of the relationship between a taxon's percentage missing data and its Pythagorean distance from the origin of PCO space on the first two PCO axes, derived from 1100 matrices and 100 000 taxa. The shift from blue to red indicates an increasing density of points at a given coordinate.
GED produces the most severe biases in disparity metrics and is especially subject to centroid slippage, as gaps in the distance matrix are infilled with a weighted mean fractional univariate distance based on the calculable distances [26]. The limits of morphospace size are affected by the proportion of missing data for taxa comprising the hull of the morphospace (figure 4), and trends in morphospace occupation by the dispersal of taxa within that morphospace. Lagerstätten effects are part of this, with the higher character completeness of their taxa often determining the bounds of morphospace in the bins in which they occur, while their position relative to more incomplete taxa is warped as the latter slip towards the centroid (figure 2).
Figure 5.
Impact of missing data on morphospace. Morphospace plots of the first two PCO axes of a simulated dataset, artificially degraded for varying levels of missing data, compared to the true morphospace for the complete simulation dataset.
Figure 4.
Impact of missing data on morphospace. Morphospace plots of the first two PCO axes of a simulated dataset, artificially degraded for varying levels of missing data, compared to the true morphospace for the complete simulation dataset.
(c). Ichthyosaur disparity through time
Despite their different responses to geological sampling biases, bivariate morphospace plots and disparity metric time series from all three distance types converge upon trends in disparity concordant with the findings of previous authors. Morphospace occupation was greatest in the first 8.7 Myr of the Triassic (figures 1 and 2), during which time early ichthyosaurs gained great ecomorphological diversity by radiating into ecological niches left vacant by the Permian–Triassic mass extinction, plus possibly new, previously unoccupied ecospace [23]. Disparity rapidly decreased during the loss of circa-littoral and shallow water forms during the Late Triassic and remained reduced in accordance with the conservative morphology of Early Jurassic ichthyosaurs, despite their high diversity [24]. Overall, there was a shift in morphospace occupation throughout the Mesozoic (figure 2), corresponding to the shift from early elongate taxa to later thunniform taxa [41].
4. Discussion
Compared to the morphometric approach to disparity, the cladistic approach is an excellent way of examining disparity in an incomplete fossil record, yet it is sensitive to the choice of distance matrix and disparity metric. GCD is increasingly recommended as it can account for the effect of missing data [44]; Lloyd [26] introduced MORD which may perform even more strongly. While these latter distances will likely prove preferable in future studies, two problems associated with GED may yet cause problems with our current view of disparity in the fossil record, as this distance type is still commonly employed in studies using the cladistic approach.
Firstly, centroid slippage makes a taxon appear morphologically average, analogous to the way in which missing data in phylogenetic studies causes stemward slippage [45], where the incomplete taxon appears more plesiomorphic than is actually the case. By extension, just as stemward slippage may confound the determination of phylogenetic relationships among a set of taxa, centroid slippage will confound the morphological relationships among taxa, affecting their relative positions in morphospace. Lagerstätte biases relate to this first issue due to the greater completeness of their taxa.
The second issue is that, as PCO is a method used to visualize variance in a dataset using similarities and differences between groups of objects, centroid slippage may be problematic for studies that attempt to use separation in morphospace as a proxy for ecomorphological separation between clades. Highly incomplete taxa may be removed from the clusters to which they are truly affiliated and pushed into another, or groups that are truly morphologically separate may end up overlapping due to centroid slippage. More broadly, the changing positions of taxa in all dimensions will alter the size and position of occupied morphospace, strongly altering the disparity signal.
Focusing on the complete signal contained within the fossil record, the high number of Triassic and Early Jurassic lagerstätten suggests that this part of ichthyosaur evolutionary history can be interpreted with some confidence. In particular, their Jurassic fossil record is composed predominantly of lagerstätten, giving high fidelity, yet also indicating the potential severity of lagerstätte biases were these deposits absent. Ichthyosaur lagerstätten are fewer in number from the Cretaceous onwards, and correspondingly interpretation of their evolutionary history is less confident [24]. This is particularly problematic in the few million years prior to their extinction during the Mid-Cretaceous, where their fossil record is highly fragmentary [25].
Butler et al. [4] demonstrated that the peaks and troughs in the record of pterosaur disparity are largely artificial due to lagerstätte sampling. While the ichthyosaur fossil record is less dependent on lagerstätten deposits, there is nevertheless a biased signal between bins with and without lagerstätten. Lagerstätten vary in their depositional environments and are geographically and temporally localized; thus, they only give a truer record of particular ecosystems, but not necessarily a more complete global signal. Despite this, the disparity and diversity signals they preserve are more complete and so are inherently more faithful than bins without lagerstätte deposits; these latter must therefore be interpreted with caution.
Lagerstätten improve the fidelity of the disparity signal for a time bin by preserving a greater range of species, and by preserving more complete specimens. Thus, while the bias of missing species and missing data may affect lagerstätte-containing bins less, the relative increase in disparity in lagerstätte-containing bins will affect attempts to draw broad conclusions about patterns of disparity through time; for example, if linear models are fitted to time series to try to discern an overall trend. An important point to consider is that while lagerstätten provide more faithful preservation, they have differing impacts on different groups. To an extent, this is dependent on the depositional environment of the ecosystem and the ecological biases of the organisms within that ecosystem; for example, infaunal organisms are typically more likely to be preserved than nektonic or pelagic species, and shelf environments more likely to receive deposition than forests. As such, the distribution and type of lagerstätten in time and space may have heterogeneous effects on the disparity of different clades. Here, we document a clear impact of lagerstätten on ichthyosaur disparity, whereas a study of Palaeozoic echinoderms [46] found that lagerstätten had virtually no effect on their overall disparity (strong effects were noted because of taphonomic degradation, however, highlighting the problem of missing data). In turn, this poses difficulties for associating changes in disparity within distinct groups to external causes or to evolutionary interactions between those groups.
Together, these two points highlight the need to find ways to correct the biases imposed by lagerstätten. The removal of highly incomplete taxa prior to analysis does not solve the problem of missing data, merely sidestepping it at the cost of a straight reduction in disparity. Phylogenetic correction is an alternative, but its utility is dependent on how well resolved the chosen phylogenetic hypothesis is. As this is itself affected by the amount of missing data in a cladistic matrix, the instability of phylogenetic and disparity analyses may trend together; thus, the utility of phylogenetic correction will decline even as the disparity analysis calls more strongly for correction. This is also the case for the use of gap weighting to infill missing data. Smith et al. [14] note that these methods all affect the disparity signal differently. They present a novel method of correction, which identifies linkage in the presence or absence of characters in a dataset, and then selectively removes additional data to equalize the distribution of missing characters between the taxa, minimizing the degree of disparity reduction at varying levels of missing data. In turn, while the true disparity values cannot be measured, relative changes in the size and occupation of morphospace through time can be identified with greater confidence. Application of this method to future disparity studies may help reduce the impact of the greater completeness of lagerstätte specimens.
The bias from preservation of a greater number of taxa is harder to mitigate, but may be tackled through a better understanding of the factors that control the spatio-temporal distribution of lagerstätten. Lagerstätten of similar ages, representing similar facies environments, may occur in clusters resulting from deposition in a broad geographical region where conditions were generally conducive to exceptional preservation [47]. Trends in the size and distribution of these clusters through time result from fluctuations in the environmental constraints required for their formation at local, regional, and global scales [47]. Preservational biases resulting in lagerstätten dominate in the marine environment, while terrestrial lagerstätten are more strongly controlled by sampling biases [47]. Thus, it may be possible to demonstrate more general links between these biases and the disparity signal from the fossil record. The spatio-temporal distribution of lagerstätten may also provide useful insight into the problem of missing data. Tutin & Butler [48] found a significant relationship between skeletal completeness and character completeness metrics for plesiosaurs, and between plesiosaur and ichthyosaur completeness, suggesting common controls on skeletal preservation in both groups; this could feasibly include a lagerstätten bias. Lagerstätte biases in completeness may not be present for all groups; however, Verriere et al. [49] found no evidence for lagerstätte effects on parareptile diversity and in turn no correlation between their diversity and skeletal or character completeness metrics.
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
Supplementary Material
Acknowledgements
We thank Dr Steve Donovan, and two other anonymous referees, whose comments helped us to greatly improve the quality of this manuscript.
Data accessibility
All additional data for this article are provided in the electronic supplementary material, table S1, figures S1–S9. All R scripts and data used in the analyses are available from the Dryad Digital Repository: http://dx.doi.org/10.5061/dryad.2b6n12q [50].
Authors' contributions
J.F.S., T.L.S., and B.C.M. designed the analysis. J.F.S. carried out the analysis. B.C.M. provided the cladistic dataset. J.F.S. and M.J.B. wrote the paper, and all authors commented on the paper.
Competing interests
We declare we have no competing interests.
Funding
B.C.M. is funded by Leverhulme Research Project Grant no. RPG-2015-126 to M.J.B.
Disclaimer
This manuscript has not been published in whole or in part elsewhere and is not currently being considered for publication in another journal.
References
- 1.Benson R, Butler R, Lindgren J, Smith A. 2010. Mesozoic marine tetrapod diversity: mass extinctions and temporal heterogeneity in geological megabiases affecting vertebrates. Proc. R. Soc. B 277, 829–834. ( 10.1098/rspb.2009.1845) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Whittington H, Conway Morris S. 1985. Illustration. In Fossils of the burgess shale (eds Briggs D, Erwin D, Collier F, Clark C), p. 418 Washington, DC: Smithsonian Institution Press. [Google Scholar]
- 3.Benson R, Butler R. 2011. Uncovering the diversification history of marine tetrapods: ecology influences the effect of geological sampling biases. Geol. Soc. Spec. Publ. 358, 191–208. ( 10.1144/SP358.13) [DOI] [Google Scholar]
- 4.Butler R, Brusatte S, Andres B, Benson R. 2011. How do geological sampling biases affect studies of morphological evolution in deep time? A case study of pterosaur (Reptilia: Archosauria) disparity. Evolution 66, 147–162. ( 10.1111/j.1558-5646.2011.01415.x) [DOI] [PubMed] [Google Scholar]
- 5.Erwin D. 2007. Disparity: morphological pattern and developmental context. Palaeontology 50, 57–73. ( 10.1111/j.1475-4983.2006.00614.x) [DOI] [Google Scholar]
- 6.Ruta M, Angielczyk D, Fröbisch J, Benton M. 2013. Decoupling of morphological disparity and taxic diversity during the adaptive radiation of anomodont therapsids. Proc. R. Soc. B 280, 20131071 ( 10.1098/rspb.2013.1071) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Benton M, Forth J, Langer M. 2014. Models for the rise of the dinosaurs. Curr. Biol. 24, 87–95. ( 10.1016/j.cub.2013.11.063) [DOI] [PubMed] [Google Scholar]
- 8.Brown C, Arbour J, Jackon D. 2012. Testing of the effect of missing data estimation and distribution in morphometric multivariate data analyses. Syst. Biol. 61, 941–954. ( 10.1093/sysbio/sys047) [DOI] [PubMed] [Google Scholar]
- 9.Hetherington A, Sherratt E, Ruta M, Wilkinson M, Deline B, Donoghue P. 2015. Do cladistic and morphometric data capture common patterns of morphological disparity? Palaeontology 58, 393–399. ( 10.1111/pala.12159) [DOI] [Google Scholar]
- 10.Brocklehurst N, Upchurch P, Mannion P, O'Connor J. 2012. The completeness of the fossil record of Mesozoic birds: implications for early avian evolution. PLoS ONE 7, e39056 ( 10.1371/journal.pone.0039056) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cleary T, Moon B, Dunhill D, Benton M. 2015. The fossil record of ichthyosaurs, completeness metrics and sampling biases. Palaeontology 58, 521–536. ( 10.1111/pala.12158) [DOI] [Google Scholar]
- 12.Dean C, Mannion P, Butler R. 2016. Preservational bias controls the fossil record of pterosaurs. Palaeontology 59, 225–247. ( 10.1111/pala.12225) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ciampaglio C, Kemp M, McShea D. 2001. Detecting changes in morphospace occupation patterns in the fossil record: characterization and analysis of measures of disparity. Paleobiology 27, 695–715. () [DOI] [Google Scholar]
- 14.Smith A, Rosario M, Eiting T, Dumont E. 2014. Joined at the hip: linked characters and the problem of missing data in studies of disparity. Evolution 68, 2386–2400. ( 10.1111/evo.12435) [DOI] [PubMed] [Google Scholar]
- 15.Friedman M. 2010. Explosive morphological diversification of spiny-finned teleost fishes in the aftermath of the end-Cretaceous extinction. Proc. R. Soc. B 277, 1675–1683. ( 10.1098/rspb.2009.2177) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Anderson P, Friedman M, Brazeau M, Rayfield R. 2011. Initial radiation of jaws demonstrated stability despite faunal and environmental change. Nature 476, 206–209. ( 10.1038/nature10207) [DOI] [PubMed] [Google Scholar]
- 17.Smithwick F, Stubbs T. 2018. Phanerozoic survivors: Actinopterygian evolution through the Permo-Triassic and Triassic-Jurassic mass extinction events. Evolution 72, 348–362. ( 10.1111/evo.13421) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Clarke J, Friedman M. 2018. Body-shape diversity in Triassic–Early Cretaceous neopterygian fishes: sustained holostean disparity and predominantly gradual increases in teleost phenotypic variety. Paleobiology 94, 1–32. ( 10.1017/pab.2018.8) [DOI] [Google Scholar]
- 19.Benton M, Dunhill A, Lloyd G, Marx F. 2011. Assessing the quality of the fossil record: insights from vertebrates. In Comparing the geological and fossil records: implications for biodiversity studies (eds A McGowan, A Smith). Geol. Soc. Spec. Publ. 358, 63–94. ( 10.1144/SP358.6) [DOI] [Google Scholar]
- 20.Brusatte S, Montanari S, Yi H, Norell M. 2011. Phylogenetic corrections for morphological disparity analysis: new methodology and case studies. Paleobiology 37, 1–22. ( 10.1666/09057.1) [DOI] [Google Scholar]
- 21.Benton M, et al. 2013. Exceptional vertebrate biotas from the Triassic of China, and the expansion of marine ecosystems after the Permo–Triassic mass extinction. Earth Sci. Rev. 137, 85–128. ( 10.1016/j.earscirev.2013.05.014) [DOI] [Google Scholar]
- 22.Maisch M. 2010. Phylogeny, systematics and origin of the Ichthyosauria – the state of the art. Palaeodiversity 3, 151–214. [Google Scholar]
- 23.Stubbs T, Benton M. 2016. Ecomorphological diversifications of Mesozoic marine reptiles: the roles of ecological opportunity and extinction. Paleobiology 42, 547–573. ( 10.1017/pab.2016.15) [DOI] [Google Scholar]
- 24.Dick D, Maxwell E. 2015. The evolution and extinction of the ichthyosaurs from the perspective of quantitative ecospace modelling. Biol. Lett. 11, 20150339 ( 10.1098/rsbl.2015.0339) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Fischer V, Bardet N, Guiomar M, Godefroit P. 2014. High diversity in Cretaceous ichthyosaurs from Europe prior to their extinction. PLoS ONE 9, e84709 ( 10.1371/journal.pone.0084709) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Lloyd G. 2016. Estimating morphological diversity and tempo with discrete character-taxon matrices: implementation, challenges, progress, and future directions. Biol. J. Linn. Soc. 118, 131–151. ( 10.1111/bij.12746) [DOI] [Google Scholar]
- 27.Moon B. 2019. A new phylogeny of ichthyosaurs (Reptilia: Diapsida). J. Syst. Palaeontol. 17, 129–155. ( 10.1080/14772019.2017.1394922) [DOI] [Google Scholar]
- 28.MacGowan C, Motani R. 2003. Ichthyopterygia, p. 175 München, Germany: Verlag Dr. Friedrich Pfeil. [Google Scholar]
- 29.Maxwell E. 2018. Redescription of the 'lost' holotype of Suevoleviathan integer (Bronn, 1844) (Reptilia: Ichthyosauria). J. Vertebr. Paleontol. 38, e1439833 ( 10.1080/02724634.2018.1439833) [DOI] [Google Scholar]
- 30.R Core Team. 2013. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; (www.R-project.org) [Google Scholar]
- 31.Lingoes J. 1971. Some boundary conditions for a monotone analysis of symmetric matrices. Psychometrika 36, 195–203. ( 10.1007/BF02291398) [DOI] [Google Scholar]
- 32.Cailliez F. 1983. The analytical solution of the additive constant problem. Psychometrika 48, 343–349. ( 10.1007/BF02294026) [DOI] [Google Scholar]
- 33.Kotrc B, Knoll A. 2015. A morphospace of planktonic marine diatoms. I. Two views of disparity through time. Paleobiology 41, 45–67. ( 10.1017/pab.2014.4) [DOI] [Google Scholar]
- 34.Foote M. 1994. Morphological disparity in Ordovician–Devonian crinoids and the early saturation of morphological space. Paleobiology 20, 320–344. ( 10.1017/S009483730001280X) [DOI] [Google Scholar]
- 35.Brusatte S, Benton M, Ruta M, Lloyd G. 2008. The first 50 Myr of dinosaur evolution: macroevolutionary pattern and morphological disparity. Biol. Lett. 4, 733–736. ( 10.1098/rsbl.2008.0441) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Guillerme T. 2018. dispRity: a modular R package for measuring disparity. Methods Ecol. Evol. 9, 1755–1763. ( 10.1111/2041-210X.13022) [DOI] [Google Scholar]
- 37.Allison P. 1998. Konservat-Lagerstätten: cause and classification. Paleobiology 14, 331–344. ( 10.1017/S0094837300012082) [DOI] [Google Scholar]
- 38.Raup D. 1977. Systematists follow the fossils. Paleobiology 3, 328–329. ( 10.1017/S0094837300005431) [DOI] [Google Scholar]
- 39.Holm S. 1979. A simple sequentially rejective multiple test procedure. Scand. J. Stat. 6, 65–70. ( 10.2307/4615733) [DOI] [Google Scholar]
- 40.McKinney M. 1990. Classifying and analyzing evolutionary trends. In Evolutionary trends (ed. McNamara K.), pp. 28–58. London, UK: Belhaven. [Google Scholar]
- 41.Lloyd G. 2008. Generalized differencing of time series See www.graemetlloyd.com/methgd.html.
- 42.Nesbitt S, Ezcurra M. 2015. The early fossil record of dinosaurs in North America: a new neotheropod from the base of the Upper Triassic Dockum Group of Texas. Acta Palaeontol. Pol. 60, 513–526. ( 10.4202/app.00143.2014) [DOI] [Google Scholar]
- 43.Benson R, Bowdler T. 2014. Anatomy of Colymbosaurus megadeirus (Reptilia, Plesiosauria) from the Kimmeridge Clay Formation of the U.K., and high diversity among Late Jurassic plesiosauroids. J. Vertebr. Paleontol. 34, 1053–1071. ( 10.1080/02724634.2014.850087) [DOI] [Google Scholar]
- 44.Thorne P, Ruta M, Benton M. 2011. Resetting the evolution of marine reptiles at the Triassic-Jurassic boundary. Proc. Natl Acad. Sci. USA 108, 8339–8344. ( 10.1073/pnas.1018959108) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Sansom R, Wills M. 2013. Fossilization causes organisms to appear erroneously primitive by distorting evolutionary trees. Sci. Rep. UK 3, 2545 ( 10.1038/srep02545) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Deline B, Thomka J. 2017. The role of preservation on the quantification of morphology and patterns of disparity within Paleozoic echinoderms. J. Paleontol. 91, 618–632. ( 10.1017/jpa.2016.129) [DOI] [Google Scholar]
- 47.Muscente A, et al. 2017. Exceptionally preserved fossil assemblages through geologic time and space. Gondwana Res. 48, 164–188. ( 10.1016/j.gr.2017.04.020) [DOI] [Google Scholar]
- 48.Tutin S, Butler R. 2017. The completeness of the fossil record of plesiosaurs, marine reptiles from the Mesozoic. Acta Palaeontol. Pol. 62, 563–573. ( 10.4202/app.00355.2017) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Verriere A, Brocklehurst N, Frobisch J. 2016. Assessing the completeness of the fossil record: comparison of different methods applied to parareptilian tetrapods (Vertebrata: Sauropsida). Paleobiology 42, 680–695. ( 10.1017/pab.2016.26) [DOI] [Google Scholar]
- 50.Flannery Sutherland JT, Moon BC, Stubbs TL, Benton MJ. 2019. Data from: Does exceptional preservation distort our view of disparity in the fossil record? Dryad Digital Repository. ( 10.5061/dryad.2b6n12q) [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Citations
- Flannery Sutherland JT, Moon BC, Stubbs TL, Benton MJ. 2019. Data from: Does exceptional preservation distort our view of disparity in the fossil record? Dryad Digital Repository. ( 10.5061/dryad.2b6n12q) [DOI] [PMC free article] [PubMed]
Supplementary Materials
Data Availability Statement
All additional data for this article are provided in the electronic supplementary material, table S1, figures S1–S9. All R scripts and data used in the analyses are available from the Dryad Digital Repository: http://dx.doi.org/10.5061/dryad.2b6n12q [50].





