Abstract
Estimates of deep-time biodiversity typically rely on statistical methods to mitigate the impacts of sampling biases in the fossil record. However, these methods are limited by the spatial and temporal scale of the underlying data. Here we use a spatially explicit mechanistic model, based on neutral theory, to test hypotheses of early tetrapod diversity change during the late Carboniferous and early Permian, critical intervals for the diversification of vertebrate life on land. Our simulations suggest that apparent increases in early tetrapod diversity were not driven by local endemism following the ‘Carboniferous rainforest collapse’. Instead, changes in face-value diversity can be explained by variation in sampling intensity through time. Our results further demonstrate the importance of accounting for sampling biases in analyses of the fossil record and highlight the vast potential of mechanistic models, including neutral models, for testing hypotheses in palaeobiology.
Subject terms: Palaeontology, Theoretical ecology
A spatially explicit mechanistic model, based on neutral theory, is used to show that changes in apparent diversity in the fossil record of the late Carboniferous and early Permian are explained by variation in sampling intensity through time.
Main
The establishment of terrestrial ecosystems and diversification of early tetrapods during the late Carboniferous and early Permian (323–272 million years ago (Ma)) was a key event in vertebrate evolution. This interval was punctuated by a climate change-driven floral turnover at the end of the Carboniferous, referred to as the ‘Carboniferous rainforest collapse’ (CRC)1,2. In the past decade, several studies have attempted to estimate the impact of the CRC on the diversity of early tetrapods (early representatives of amphibians and amniotes). The first investigation into the impact of the CRC on early tetrapod diversification, by Sahney et al.3, hypothesized that habitat fragmentation caused by the CRC drove increased endemism in early tetrapod communities via the ‘island-biogeography effect’, causing allopatric speciation in newly isolated patches of forest4. Such increases in local endemism would, in turn, be expected to lead to a rise in beta diversity and global species richness, coupled with a decline in local richness (alpha diversity)3. This interpretation has been challenged, however, because it took the fossil record at face value and thus did not compensate for pervasive biases caused by various interconnected geological, taphonomic, anthropogenic and historical factors, which result in an uneven spatial and temporal distribution of fossil occurrences3–7.
More recent investigations have pointed to sampling biases as a possible alternative explanation, and find no evidence of increases in endemism during or after the CRC5,6. In particular, Dunne et al.6, after correcting for sampling, found evidence of increased connectedness between early tetrapod communities (for both amphibians and amniotes) and lower ‘global’ diversity following the CRC—the opposite of that reported by Sahney et al.3. The same study6 also suggested that fragmentation of the rainforest probably promoted the recovery and subsequent diversification of amniotes, a clade that today comprises reptiles, birds and mammals6. Despite these advances, the early tetrapod fossil record remains fragmentary as well as unevenly and incompletely sampled (particularly in the late Carboniferous)7, which obscures patterns of diversity and biogeography during a critical time in vertebrate evolution6. The true joint effects of sampling and environmental change on the diversification of tetrapods following the CRC are yet to be unravelled.
Quantitative approaches to correcting for the effects of sampling biases on estimates of past biodiversity typically rely on statistical or phylogenetic techniques8,9. These approaches have led to substantial revisions of diversity patterns in many fossil groups, including early tetrapods10–15.
Mechanistic neutral models provide an alternative and complementary approach that has not yet been used widely in palaeobiological studies (but see ref. 16). Neutral models assume individual dynamics are independent of species identity. Making this strong assumption puts the focus on sampling, habitat structure and dispersal in isolation from other potential complicating factors. It also permits use of particularly efficient simulation algorithms17, enabling us to study spatially explicit samples of individuals from a very large spatially explicit landscape that would be impractical to simulate mechanistically under alternative models. Crucially, such neutral simulations can specify landscapes with realistic size and structure, and enable features such as palaeogeography, habitat loss and habitat fragmentation to be manipulated experimentally18. Diversity can then be sampled from the simulations in the same locations and with the same intensity as the empirical data, thus providing a new way to test how real-world patterns of fossil record sampling impact inferred patterns of face-value (directly observed, ‘raw’ or uncorrected) diversity, under a range of hypothetical palaeogeographic or ecological scenarios. The mechanistic nature of neutral models also enables us to run the models with samples much larger than the empirical sample sizes. This allows us to predict how observed diversity patterns might change if the intensity of fossil sampling were increased by an order of magnitude, and to understand what patterns could be detected within the currently available fossil data19. Studies involving neutral simulations can even be used to test theories of diversity dynamics at global scales, much larger than could be directly quantified in the fossil record because of incomplete spatial sampling16,20,21.
In this study, we apply a spatially explicit version of neutral theory to test the hypothesis that the CRC impacted early tetrapod diversity through habitat fragmentation. Our neutral simulations mimic the empirical structure of the fossil record by sampling at the same locations and to approximately the same intensity as recorded in the empirical or ‘known’ early tetrapod data. We investigate three scenarios related to the CRC. The first (scenario A; Fig. 1a) performs simulations on a ‘pristine’ landscape with no habitat fragmentation (that is, the CRC was absent from this scenario). The second (scenario B; Fig. 1a) models the effect of the CRC as random habitat loss across the landscape. The final scenario (C; Fig. 1a) models the effect of the CRC as a loss in habitat in which ‘habitat islands’ remained around the localities where early tetrapods occurred. We use these spatially explicit neutral simulations to examine the extent to which the empirical fossil record, given its sampling bias, can infer global patterns of diversity change. We estimate trends in tetrapod diversity over time, under a neutral scenario, by simulating our best-fitting models again with constant temporal sampling. This study is, to the best of our knowledge, the first to apply a fully spatially explicit neutral model to empirical fossil data.
Results
Neutral models incorporating temporal and spatial biases
In the simplest scenario, we performed simulations on a pristine global landscape with no habitat fragmentation (scenario A; Fig. 1). We sampled diversity patterns from the simulations at the same palaeo-locations and to approximately the same intensity as the real fossil record. Results from the simulations with optimal fixed parameters matched the empirical fossil record well overall, with 80–85% mean accuracy across four diversity metrics: alpha diversity across all localities, mean alpha diversity, beta diversity and gamma diversity (Methods). The simulations could not, however, reproduce alpha, beta and gamma diversity well with the same fixed set of parameters. When compared with the empirical fossil record, the neutral models of early Permian communities with optimized fixed parameters predicted more species (higher gamma diversity), and higher alpha diversity than seen empirically (Fig. 2). Despite these differences, the majority of the empirical values are within the range of variation between simulations, with a mean accuracy across all metrics of 81.3% for amniotes and 82.1% for amphibians.
Neutral models under habitat fragmentation
To test the hypothesis that fragmentation of the rainforest at the end of the Carboniferous promoted the development of endemism among early tetrapod communities, we modelled two scenarios of habitat loss and fragmentation occurring from 307 Ma onwards: first, a random pattern of habitat loss (scenario B; Fig. 1) and second, a clustered pattern of habitat loss (scenario C; Fig. 1). The random habitat loss scenario (B) maintains connectivity across the landscape as habitat is lost. The clustered habitat scenario (C) leaves isolated habitat ‘islands’ that may promote endemism over geological timescales. These habitat ‘islands’ are conceptually analogous to the oceanic islands in MacArthur and Wilson’s theory of island biogeography4. Endemic species may thus arise naturally on such islands within neutral simulations. Scenario C directly tests the mechanistic assumption of Sahney et al.3 (that endemism, driven by fragmentation and manifesting as increasing beta diversity, is the cause of tetrapod diversity increases post-CRC).
Our models of random habitat loss (scenario B) demonstrate that increasing the amount of habitat loss, while keeping all other parameters the same, causes ‘global’ species richness and beta diversity to decline (Fig. 3 and Extended Data Fig. 1). Species richness decreases relatively linearly across all time periods. However, alpha and beta diversity demonstrate a more variable pattern across time for different levels of habitat loss. In particular, the interval between 307 and 297 Ma has very similar alpha diversity for all levels of habitat loss. This is potentially caused by the lower numbers of sampled fossils found at this time (Extended Data Fig. 2), because under poor sampling, inferred alpha diversity will be impacted primarily by the number of sampled individuals, rather than by other factors such as the quantity of surrounding habitat.
Our clustered habitat (scenario C) tested whether neutral theory supports the hypothesis that habitat loss results in highly disconnected habitat islands that promote endemism. Under these circumstances, unless the fossil localities were close, dispersal between distinct fossil localities was restricted almost entirely, meaning that the number of shared species between localities was likely to be very low. The neutral simulations of the clustered habitat scenario generated diversity patterns that did not closely fit the empirical fossil data (Fig. 4). Although the overall trend matches to some extent, the simulations had a high level of variability between intervals, primarily dictated by the number of fossil localities known for each interval. Furthermore, loss of habitat, and the resulting decrease in the size of the metacommunity supplying individuals to the fossil sites, caused a reduction in species richness. Similarly, there was also a reduction in alpha diversity, particularly for amniotes.
By simulating this same best-fitting scenario (20% random habitat loss for amniotes and a pristine landscape for amphibians), but sampling more individuals at each locality, it is possible to predict the broader diversity changes under the same model beyond the empirical sample size. When ten times more individuals are sampled from each fossil locality, differences emerge compared with simulations in which sampling of the fossil record is exactly matched (Fig. 5). The general trends in species richness over time for both amniotes and amphibians are roughly similar to the trends observed in the fossil record (Fig. 5). However, there is no longer a significant increase in beta diversity post-CRC, especially for amphibians. Likewise, alpha diversity is relatively consistent over time. There is also a broader range in the simulation outcomes where many more individuals are sampled.
To remove temporal variation in sampling intensity (but retain spatial sampling structure), we also simulated a model version with constant sampling effort over time. When 100 individuals are randomly selected from each time slice, in the same spatial arrangement as the empirically sampled localities, the trend in species richness (gamma diversity) over time tracks the changes in global diversity (Fig. 6). The simulated patterns in diversity where sampling effort is standardized bears only limited resemblance to the real fossil record together with its sampling biases; it matches the general trend only for beta diversity.
Discussion
This work shows that the apparent increases in face-value diversity observed in the fossil record of early tetrapods across the late Carboniferous/early Permian can be explained by a simple mechanistic neutral model that accounts for biases in sampling. However, there does appear to be a small but observable change in the characteristics of early tetrapod diversity around 307 Ma, the approximate timing of the CRC. This can be explained by either changes in dispersal, changes in density of individual organisms (Extended Data Figs. 3 and 4) or fragmentation of habitats (which is theoretically similar to a reduction in species diversity; Fig. 3). These findings support the previous assessment that patterns of diversity in the early tetrapod fossil record should not be interpreted at face value6.
The model scenario of rainforest fragmentation that is most consistent with the empirical (face-value) fossil data is one in which the global density of individual early tetrapod organisms decreases by a small amount at 307 Ma (Fig. 3). When sampling the simulations in a realistic manner, this results in a temporary dip in the face-value gamma diversity and beta diversity of amphibians around the time of the CRC. By contrast, amniotes show an increase in both face-value beta and gamma diversity, suggesting a potential role of endemism, although this does not have much effect until 10 million years after the CRC. Under this scenario, simulated face-value gamma diversity losses during the CRC are even greater than those observed at face value in the fossil record (after accounting for the changes in sampling effort over time).
When many more individuals were sampled from the same simulation models, the emergent diversity patterns changed considerably because so much more of the underlying system is revealed (Fig. 5). For example, a larger sample may not uncover much more species-level diversity, suggesting that the already present species dominate with large abundances. This sensitivity to sampling suggests that the temporal changes in alpha and beta diversity found in the fossil record may disappear as more fossils are found. This shows that the effect of sampling bias can be mitigated to an extent by more intense sampling, even if the additional sampling is equally biased. When the same number of individuals are sampled from each point in time within our simulations, the trends in species richness and alpha diversity disappear to an extent (Fig. 6). This, again, suggests that the face-value patterns in the fossil record are an artefact of changes in the number of locations sampled within each time interval. The development of endemism does happen, as can be seen from increasing beta diversity following the CRC, for both amniotes and amphibians (Fig. 6). However, it is not enough to offset the alpha diversity decrease from habitat loss, suggesting that the effects of endemism often do not increase gamma diversity22 and in fact there is a small decrease in gamma diversity after the CRC, probably in response to habitat loss.
Taken together, our results suggest that endemism from habitat loss at the CRC would have probably led to a net decrease in gamma species richness, and not an increase as has been claimed previously by Sahney et al.3. After accounting for sampling bias, the limited changes to global richness are primarily driven by a modest reduction in global tetrapod population density over time, which is consistent with the expected ecological impact of the collapse of the rainforests and drying of the climate. The simulated scenario that aligns best with the empirical, face-value patterns is that of random habitat loss of between 0% and 20%, a scenario that is dynamically identical under neutral theory to an equivalent reduction in density17.
Our models used relatively abstract patterns of habitat loss, because the real patterns are not known. Future research could attempt to produce more realistic patterns of rainforest habitat loss, based on either palaeoclimate reconstructions or comprehensive occurrence data for fossil plants. Integrating more accurate maps of tropical rainforest coverage over time with the mechanistic basis of neutral theory would be more informative for exploring theories of diversity generation following the CRC. This is not currently possible because of the absence of readily available palaeoclimate reconstructions for this particular time interval, and the challenges associated with building comprehensive, spatially explicit, occurrence-based dataset for fossil plants. It is not immediately clear how one would relate forest patterns to the dynamics of early tetrapod diversity because amphibians and reptiles (both modern and extinct) exhibit broad variability in their dependency on forest cover. One immediate pattern of rainforest loss that could be incorporated into future related work, with the addition of empirical data, is the hypothesis that the rainforest disappearance began in western Pangaea before moving eastwards23. Another key consideration for future research is deciphering the influence of hierarchical spatial scaling on the patterns recovered here; alpha, beta, and gamma diversity are ultimately nested and changes at the community scale can be reflected at larger scales24,25.
The neutral models explored here assumed that abundances (population densities) were consistent over time (that is, the same number of individual organisms exist in each unit of habitat), except in the case of habitat loss at the CRC. However, the abundance of early tetrapods would also have a significant effect on the numbers of specimens preserved in the fossil record. Consequently, lower numbers of fossil specimens could be indicative of smaller populations and lower species richness. However, it is difficult to satisfactorily resolve the relationship between density and sampling rate because the nature of fossil preservation varies substantially over time, space and environments. Across our dataset of late Carboniferous and early Permian tetrapods, quality of preservation (and thus the size of the ‘taphonomic window’) varies substantially, which in turn influences sampling intensity (Extended Data Fig. 5). Fossil localities of late Carboniferous age that have yielded particularly well-preserved or abundant specimens are typically coal deposits26,27 (for example, coal mines at Nyrañy in the Czech Republic and Linton Diamond Mine in Ohio, United States). In the early Permian, owing to the combination of orogenic activity (mountain building) and drier climatic conditions, fossils are much less likely to be preserved in coal deposits. Instead, many richly diverse localities in the early Permian are the remains of terrestrial environments such as floodplains, river systems and even caves28, many of which have been quarried and excavated extensively over many decades (for example, various localities in the Red Beds of Texas and Oklahoma, United States). This lack of coal deposits in the early Permian also reflects the contraction of rainforest habitats across this interval, invoking the common-cause hypothesis, which states that the covariation of fossil and rock records is due to an external factor29,30. Similarly, the disappearance of coal deposits might simultaneously affect taphonomic windows and true underlying biodiversity driven by the loss of rainforest habitat. Because of these temporal changes in preservation, it is impossible to infer true densities of early tetrapods during this interval (and probably any interval in the geological past). This limitation motivated keeping density as a free parameter within the neutral simulations, but precludes understanding of how both early tetrapod densities and preservation rates varied. Unravelling the true historical changes would require a better understanding of both the true densities of early tetrapods over time and changes to the preservation rates over time (one of the measures that is possible to estimate for species within assemblages).
Our explanations of the changes in early tetrapod diversity through time have all been based on ecological neutral theory. Alternative explanations could come from changes in non-neutral dynamics, such as species niche structure, competition between species or wider ecosystem-level shifts. These explanations cannot yet be tested from a mechanistic basis but represent an exciting avenue of future research, as do investigations of the minimal requirements for these models to have stronger predictive power.
Conclusions
Statistical approaches to estimating past biodiversity patterns can provide important insights into patterns of diversity6,22,31. However, they are generally limited by the geographical and temporal extent of the available fossil occurrence data. In our study, spatially explicit neutral models have proven to be a valuable tool for directly testing established hypotheses of diversity change in the first vertebrates to emerge onto land, and illuminating the impacts of spatial and temporal sampling biases on their face-value diversity patterns.
Interdisciplinary studies integrating modern ecological theory with palaeontological data have been identified as crucial for informing predictions for future diversity32,33 as well as more accurately understanding past biodiversity patterns34–37. Our results shed new light on the impact of the CRC on early tetrapod diversity, by showing that increased endemism resulting from habitat loss at the CRC is unlikely to have produced an increase in biodiversity. Our study also offers new insights into the effects of sampling bias on fossil diversity estimates, and demonstrates the huge untapped potential that mechanistic models, such as those founded on neutral theory, have for testing hypotheses of deep-time biodiversity change.
Methods
Neutral models
Assessment of spatial and temporal biases on a mechanistic basis requires a model that is spatially and temporally explicit. In addition, to study the impact of habitat loss and fragmentation on biodiversity requires a model that can directly incorporate these dynamics within the biodiversity-generating process. Neutral models fulfil all these requirements and are also tractable at large scales. Neutral theory38 assumes that the properties of an individual are independent of its species identity. The dynamics of neutral models are thus dictated by some combination of dispersal, ecological drift and speciation. The output of neutral models is a simulated ecological community, where each individual has an assigned species identity. These simulated communities are equivalent to a complete census of the simulated area. The communities provide a baseline for expected biodiversity under ‘idealized’ conditions39, against which the biodiversity from real communities can be compared. Neutral theory has, however, only rarely been applied in analyses of fossil data. A few palaeoecological studies have used spatially implicit neutral theory16,20,21 where populations (for example, within separate continents) are divided to roughly represent spatial barriers. To the best of our knowledge, however, no previous study has applied a fully spatially explicit neutral model to fossil data.
The classic, spatially implicit model38 conceives of a local community connected to a metacommunity by immigration at a given rate; other models incorporate more explicit dispersal between parts of the landscape. Being based on fundamental biological mechanisms, neutral theory has utility for identifying underlying dynamics40, acting as a null or ‘ideal’ model39, or making predictions at broader spatial or temporal scales than are possible with field experiments41. We use a spatially explicit neutral model42 that incorporates the exact locations of each individual in space and incorporates a dispersal kernel to describe the distance moved by offspring from their parents. Such a fully spatially explicit model is essential to account for spatial sampling bias. The metacommunity concept of the spatially implicit model is replaced by movement around a broad spatially explicit landscape.
The mechanism of our model proceeds as follows: an individual is first chosen to die leaving a ‘space’ that will be filled by a newborn individual. The parent of the newborn individual is chosen from other nearby individuals according to a dispersal kernel, which we modelled as a two-dimensional normal distribution. The newborn is normally conspecific to its parent, but occasionally, with probability ν at each birth, it becomes a new species. Over many generations, nearby individuals are more likely to be the same species, whereas distant individuals will be more likely to be different. We use these models to generate communities of species across the landscape.
A major development for neutral theory was backwards-time coalescence methods43, which produce equivalent results to a naïve (forwards-time) implementation of the mechanisms described above but are many orders of magnitude faster in computational performance. Furthermore, many scenarios are made possible with coalescence that are not possible otherwise, such as exceedingly large or infinite landscapes42 or sampling a small subset of individuals from the landscape without having to simulate the entire landscape first. The latter feature means that our models can simulate observations at just the precise locations observed in the fossil record, while accounting mechanistically for the whole community alive at the time with a full spatial structure from the relevant period in history. An equivalent model using forwards-time techniques would require simulating every tetrapod that existed across the entire time frame and continent of interest, a feat not remotely feasible with current computational power. Unfortunately, most non-neutral models cannot benefit from the use of coalescence and associated abilities to account for sampling in huge spatially explicit systems. We use the pycoalescence package available for Python and R17, which uses coalescence methods implemented in C++ for high-performance spatially explicit neutral simulations.
Preparation of fossil occurrence data
Data detailing the global occurrences of early tetrapod species from the late Carboniferous (Bashkirian) to early Permian (Kungurian) were downloaded from the Paleobiology Database (www.paleobiodb.org). These data represent the published knowledge on the global occurrences of early tetrapod species alongside taxonomic opinions; it is the result of a concerted effort to document the Palaeozoic terrestrial tetrapod fossil record. The dataset was cleaned by removing marine taxa, ichnotaxa and taxa with uncertain taxonomic identifications. The total number of amniote (including Reptiliomorpha (Table 1)) and amphibian (non-amniotes and early tetrapodomorphs (Table 1)) species per locality was ascertained and recorded (Extended Data Fig. 5). The resulting dataset (Supplementary information) details the number of amniote and amphibian species found at each locality (a ‘collection’ in Paleobiology Database terms) during each of the eight stratigraphic intervals from the Bashkirian to the Kungurian.
Table 1.
Term | Definition |
---|---|
Neutral theory | Related to the study of neutral models, these are individual-based models where the fate of individuals (chances of survival, movement and reproduction) is unconnected with species identity. |
Alpha diversity | The richness (number of taxonomic groups) at different sites or habitats within a local scale. Also referred to as ‘local richness’. In this study, we look at species richness. Alpha diversity at a given time point is calculated here as the number of species per locality, as a mean across all localities at that time point. |
Gamma diversity | The total diversity (number of taxonomic groups) across all communities within a larger region. Also referred to as ‘global diversity’. In this study, we consider species-level diversity and calculate gamma diversity as the total number of species across all localities at a given time point. |
Beta diversity | The ratio between alpha (local) and gamma (global) diversity given by the equation . Beta diversity quantifies the difference between communities in the region. Increased beta diversity means increased turnover between localities. Beta diversity can sometimes be defined differently from this, but it always aims to capture turnover of diversity. |
Face value | Face-value (or ‘raw’ or ‘uncorrected’) alpha, beta or gamma diversity is the measured diversity seen in a possibly biased sample. By contrast, the true alpha, beta or gamma diversity corresponds to what is really present or what is seen from an unbiased sample. |
Amniote | Tetrapod species belonging to the clade Reptiliomorpha, which includes the crown group Amniota and those species more closely related to them than to modern amphibians. |
Amphibian | Non-amniote tetrapod species including early Tetrapodpodomorpha, non-amniote tetrapodomorph species including Lepospondyli and Temnospondyli. |
Neutral simulations of early tetrapods
We split the tetrapods into amphibians and amniotes to reflect their differing physiologies and environmental preferences, treating each with an independent neutral model. Our simulations required maps of the relative density of individuals across the globe. These were determined separately for each interval (Bashkirian to Kungurian) from the continental boundaries of the time. Global rasterized maps of individual relative densities were produced at 0.01-degree resolution using the continental extents provided by the Paleobiology Database based on GPlates palaeogeographical reconstructions44. This corresponds to pixels of around 1 km2 each representing a cell for our model. The palaeocoordinates of each fossil locality were calculated and localities were then aggregated within each 1 km2 cell. Specimen counts per locality were estimated using the ‘occurrences-squared’ heuristic45, calculated simply as the square of the number of unique fossil occurrences. This metric provides a basic way of accounting for the fact that most localities lack information about counts of specimens, and because it is rarely obvious how many distinct individuals contributed to a set of fossil fragments. Using this metric in our models approximates the total number of individuals that contributed to the observed fossil record and therefore the number of individuals that should be sampled in the neutral simulations. This generated a ‘sample map’ defining the number of individuals to be sampled at each position in space. Because the majority of the globe was not sampled, most cells in this sample map were set to 0. The relative density and the sample maps together contain the spatial information of the entire global community of amphibians and amniotes for the simulation and define which individuals from each global community were sampled.
The second parameter critical for the simulations is the dispersal rate (σ), which controls the distance that individuals disperse across the landscape in a given generation. σ is used as the variance in a Rayleigh distribution determining the radius of dispersal, with a separate uniform random number determining the direction of dispersal. This means that larger values of σ correspond with longer dispersal distances, on average.
The eight stratigraphic intervals sampled from the fossil record were sufficiently far apart in time that we reasonably assumed no shared species between the different time intervals within the model. Consequently, we ran simulations for each time interval as separate neutral models in parallel, and aggregated the communities post-simulation.
We performed simulations with parameters encompassing a broad range of biologically feasible values: density values for habitat cells ranged from 25 to 1,000 individuals per km for ‘habitat’ regions (non-habitat regions have a density of 0 individuals), the parameter of dispersal (σ) varied to give mean distances of 0.1–14 km, and speciation rates varied from 10−8 to 10−1. We explored 5 density and 5 dispersal parameters giving 25 combinations using Latin hypercube sampling46 to evenly sample from arithmetic parameter space. Under coalescence methods, higher speciation rates can be applied post-simulation for generating communities17,43. We performed simulations using a minimum speciation rate of 10–8 and applied all other speciation rates afterwards to generate additional communities.
Three broad scenarios of tetrapod diversity were simulated (Fig. 1). In all models, the global landscape was restricted by continental boundaries. Our simplest model (scenario A) contained pristine habitat with no habitat loss (that is, uniform, with no habitat fragmentation) Two scenarios (B and C) exhibited habitat loss of different forms following the CRC. The landscape was fragmented according to a random spatial pattern, so that land areas contained habitat on a percentage of their area (either 20%, 40% or 80% of habitat remaining). The random pattern was generated by randomly removing pixels from the landscape until the desired percentage of habitat remains.
Model parameterization
To determine how well the simulations fit the patterns in the fossil record, four biodiversity metrics were used for each interval: the alpha diversity (ɑ) for each fossil locality (that is, the local species richness), the mean alpha diversity across all localities, the total species richness across all localities (γ) and the mean beta diversity (calculated as ) across all localities. The mean actual percentage error between the real and simulated fossil records in alpha diversity for each locality was averaged to get a mean alpha accuracy μα. The mean actual percentage error between the real and simulated fossil records was calculated for each other metric (α, β and γ). Averaging the mean actual percentage errors for the four metrics (μα, α, β and γ) gives an indication of the goodness of fit for one simulation—we refer to this percentage as the accuracy of a single simulation. There is some redundancy between the values because the parameters are not independent, but the approach should still result in the simulation that most closely matches the real fossil record.
Because each interval was run as a separate neutral simulation, the parameters of speciation rate, density and dispersal could be allowed to vary over time. However, because combinations of parameters can be aggregated in any number of ways, we considered just two possibilities that reflected our assumptions of the possible ecological changes over time: either there was no change in these parameters (we use a single parameter set for all intervals); or the parameters could change at the time of the CRC (we use two parameter sets, one for pre-CRC (323–307 Ma) and one for post-CRC (307–372 Ma)). The first scenario represents a neutral ecosystem with no changes in fundamental ecological dynamics. The second presents a neutral scenario that assumes ecological changes were generated by the CRC and may be reflected in neutral dynamics. In some tests a single set of parameters (speciation rate, dispersal and density) was used for all time intervals, whereas in others this requirement was relaxed to investigate how the parameters themselves may change over time.
Upscaling and downscaling simulated communities
To explore potential biodiversity patterns that would emerge if the fossil record included a larger number of individuals, we ran simulations with the same model parameters as the best-fitting simulations, including the same number of simulated individuals, but reporting back on ten times more sampled individuals (sampled with replacement). This scenario demonstrates how the emergent biodiversity patterns change with the overall intensity of sampling effort in isolation from other factors. We also explored the effect of sampling a fixed number of individuals from each time interval, for comparison with sampling different numbers of individuals from each time interval in line with the temporal changes in sampling intensity present in the fossil record. This samples from the simulation without sampling-intensity biases over time but retains the sampling-intensity biases over space matching the real-world spatial sampling pattern. It enables us to demonstrate the effect of temporal sampling biases in isolation from other factors.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank all contributors to the Paleobiology Database, in particular T. Liebrecht, R. Whatley, J. Dummasch, J. Alroy and M. Carrano. E.M.D. thanks T. Dunkley-Jones and P. Mannion for helpful comments and discussion. E.M.D., R.A.C. and R.J.B. were funded by the European Union’s Horizon 2020 research and innovation programme under grant agreement 637483 (European Research Council Starting Grant TERRA to R.J.B.), and also through a Leverhulme Research Project Grant (RPG-2019-365 to R.J.B.). R.A.C. was also funded by a Royal Society University Research Fellowship (RF\ERE\210396). S.E.D.T. was funded by the Imperial–National University of Singapore Joint PhD Scholarship. J.R. was funded by a Natural Environment Research Council fellowship (NE/L011611/1) and a Leverhulme Trust Research Fellowship (RF-2022-497). Through J.R. and S.E.D.T., this study is an output of the Georgina Mace Centre for the Living Planet at Imperial College London. All simulations were performed on high-throughput computing systems at Imperial College London. For the purpose of open access, the authors have applied a ‘Creative Commons Attribution' (CC BY) licence to any author accepted manuscript version arising. This is Paleobiology Database official publication number 454.
Extended data
Author contributions
R.A.C. and J.R. conceived the project and all authors input into the design. E.M.D. and S.E.D.T. curated the data, conducted the analyses, prepared the figures and led the writing of the paper. All authors contributed to collating material for the supplementary information, and to the writing and approval of the final paper.
Peer review
Peer review information
Nature Ecology & Evolution thanks Lauren Sallan and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
All relevant data supporting our analyses are available in the OSF repository: 10.17605/OSF.IO/ZGHWB.
Code availability
Code for downloading and cleaning fossil occurrence data from the Paleobiology Database is available at: https://github.com/emmadunne/neutral_theory_tetrapods. Code for running simulations using pycoalescence is available at: https://github.com/thompsonsed/palaeo_neutral_sims. Both are accessible through the OSF repository: 10.17605/OSF.IO/ZGHWB
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Emma M. Dunne, Samuel E. D. Thompson.
These authors jointly supervised this work: Richard J. Butler, James Rosindell, Roger A. Close.
Extended data
is available for this paper at 10.1038/s41559-023-02128-3.
Supplementary information
The online version contains supplementary material available at 10.1038/s41559-023-02128-3.
References
- 1.Uhl D, Cleal CJ. Late Carboniferous vegetation change in lowland and intramontane basins in Germany. Int. J. Coal Geol. 2010;83:318–328. doi: 10.1016/j.coal.2009.07.007. [DOI] [Google Scholar]
- 2.Cleal CJ, et al. Plant biodiversity changes in Carboniferous tropical wetlands. Earth Sci. Rev. 2012;114:124–155. doi: 10.1016/j.earscirev.2012.05.004. [DOI] [Google Scholar]
- 3.Sahney S, Benton MJ, Falcon-Lang HJ. Rainforest collapse triggered Carboniferous tetrapod diversification in Euramerica. Geology. 2010;38:1079–1082. doi: 10.1130/G31182.1. [DOI] [Google Scholar]
- 4.Macarthur, R. H. & Wilson, E. O. The Theory of Island Biogeography (Princeton Univ. Press, 1967).
- 5.Brocklehurst N, Dunne EM, Cashmore DD, Frӧbisch J. Physical and environmental drivers of Paleozoic tetrapod dispersal across Pangaea. Nat. Commun. 2018;9:5216. doi: 10.1038/s41467-018-07623-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Dunne EM, et al. Diversity change during the rise of tetrapods and the impact of the ‘Carboniferous rainforest collapse’. Proc. R. Soc. B. 2018;285:20172730. doi: 10.1098/rspb.2017.2730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Clack JA, et al. Phylogenetic and environmental context of a Tournaisian tetrapod fauna. Nat. Ecol. Evol. 2016;1:2. doi: 10.1038/s41559-016-0002. [DOI] [PubMed] [Google Scholar]
- 8.Alroy J. On four measures of taxonomic richness. Paleobiology. 2020;46:158–175. doi: 10.1017/pab.2019.40. [DOI] [Google Scholar]
- 9.Close RA, Evers SW, Alroy J, Butler RJ. How should we estimate diversity in the fossil record? Testing richness estimators using sampling-standardised discovery curves. Methods Ecol. Evol. 2018;9:1386–1400. doi: 10.1111/2041-210X.12987. [DOI] [Google Scholar]
- 10.Benson RBJ, Upchurch P. Diversity trends in the establishment of terrestrial vertebrate ecosystems: interactions between spatial and temporal sampling biases. Geology. 2013;41:43–46. doi: 10.1130/G33543.1. [DOI] [Google Scholar]
- 11.Benton MJ, Ruta M, Dunhill AM, Sakamoto M. The first half of tetrapod evolution, sampling proxies, and fossil record quality. Palaeogeogr. Palaeoclimatol. Palaeoecol. 2013;372:18–41. doi: 10.1016/j.palaeo.2012.09.005. [DOI] [Google Scholar]
- 12.Pearson MR, Benson RBJ, Upchurch P, Fröbisch J, Kammerer CF. Reconstructing the diversity of early terrestrial herbivorous tetrapods. Palaeogeogr. Palaeoclimatol. Palaeoecol. 2013;372:42–49. doi: 10.1016/j.palaeo.2012.11.008. [DOI] [Google Scholar]
- 13.Brocklehurst N, Day MO, Rubidge BS, Fröbisch J. Olson’s Extinction and the latitudinal biodiversity gradient of tetrapods in the Permian. Proc. R. Soc. B. 2017;284:20170231. doi: 10.1098/rspb.2017.0231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Pardo JD, Small BJ, Milner AR, Huttenlocker AK. Carboniferous–Permian climate change constrained early land vertebrate radiations. Nat. Ecol. Evol. 2019;3:200–206. doi: 10.1038/s41559-018-0776-z. [DOI] [PubMed] [Google Scholar]
- 15.Brocklehurst N. Olson’s Gap or Olson’s Extinction? A Bayesian tip-dating approach to resolving stratigraphic uncertainty. Proc. R. Soc. B. 2020;287:20200154. doi: 10.1098/rspb.2020.0154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Holland SM. Diversity and tectonics: predictions from neutral theory. Paleobiology. 2018;44:219–236. doi: 10.1017/pab.2018.2. [DOI] [Google Scholar]
- 17.Thompson SED, Chisholm RA, Rosindell J. pycoalescence and rcoalescence: packages for simulating spatially explicit neutral models of biodiversity. Methods Ecol. Evol. 2020;11:1237–1246. doi: 10.1111/2041-210X.13451. [DOI] [Google Scholar]
- 18.Thompson SED, Chisholm RA, Rosindell J. Characterising extinction debt following habitat fragmentation using neutral theory. Ecol. Lett. 2019;22:2087–2096. doi: 10.1111/ele.13398. [DOI] [PubMed] [Google Scholar]
- 19.Brocklehurst N. A simulation-based examination of residual diversity estimates as a method of correcting for sampling bias. Palaeontol. Electronica. 2015;18:1–15. [Google Scholar]
- 20.Holland SM, Sclafani JA. Phanerozoic diversity and neutral theory. Paleobiology. 2015;41:369–376. doi: 10.1017/pab.2015.10. [DOI] [Google Scholar]
- 21.Jordan SMR, Barraclough TG, Rosindell J. Quantifying the effects of the break up of Pangaea on global terrestrial diversification with neutral theory. Phil. Trans. R. Soc. B. 2016;371:20150221. doi: 10.1098/rstb.2015.0221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Close RA, et al. The apparent exponential radiation of Phanerozoic land vertebrates is an artefact of spatial sampling biases. Proc. R. Soc. B. 2020;287:20200372. doi: 10.1098/rspb.2020.0372. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Cleal CJ, Thomas BA. Palaeozoic tropical rainforests and their effect on global climates: is the past the key to the present? Geobiology. 2005;3:13–31. doi: 10.1111/j.1472-4669.2005.00043.x. [DOI] [Google Scholar]
- 24.Sepkoski JJ. Alpha, beta, or gamma: where does all the diversity go? Paleobiology. 1988;14:221–234. doi: 10.1017/S0094837300011969. [DOI] [PubMed] [Google Scholar]
- 25.Patzkowsky ME. Origin and evolution of regional biotas: a deep-time perspective. Annu. Rev. Earth Planet. Sci. 2017;45:471–495. doi: 10.1146/annurev-earth-060115-012317. [DOI] [Google Scholar]
- 26.Hook RW, Ferm JC, Whittington HB, Morris SC. A depositional model for the Linton tetrapod assemblage (Westphalian D, Upper Carboniferous) and its palaeoenvironmental significance. Phil. Trans. R. Soc. B. 1997;311:101–109. [Google Scholar]
- 27.Ó Gogáin A, et al. Metamorphism as the cause of bone alteration in the Jarrow assemblage (Langsettian, Pennsylvanian) of Ireland. Palaeontology. 2022;65:e12628. doi: 10.1111/pala.12628. [DOI] [Google Scholar]
- 28.MacDougall MJ, Tabor NJ, Woodhead J, Daoust AR, Reisz RR. The unique preservational environment of the Early Permian (Cisuralian) fossiliferous cave deposits of the Richards Spur locality, Oklahoma. Palaeogeogr. Palaeoclimatol. Palaeoecol. 2017;475:1–11. doi: 10.1016/j.palaeo.2017.02.019. [DOI] [Google Scholar]
- 29.Dunhill AM, Hannisdal B, Benton MJ. Disentangling rock record bias and common-cause from redundancy in the British fossil record. Nat. Commun. 2014;5:4818. doi: 10.1038/ncomms5818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Peters SE, Heim NA. Macrostratigraphy and macroevolution in marine environments: testing the common-cause hypothesis. Geol. Soc. Spec. Publ. 2011;358:95–104. doi: 10.1144/SP358.7. [DOI] [Google Scholar]
- 31.Mannion PD, et al. A temperate palaeodiversity peak in Mesozoic dinosaurs and evidence for Late Cretaceous geographical partitioning. Glob. Ecol. Biogeogr. 2012;21:898–908. doi: 10.1111/j.1466-8238.2011.00735.x. [DOI] [Google Scholar]
- 32.Benton MJ. Origins of biodiversity. PLoS Biol. 2016;14:e2000724. doi: 10.1371/journal.pbio.2000724. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Barnosky AD, et al. Merging paleobiology with conservation biology to guide the future of terrestrial ecosystems. Science. 2017;355:eaah4787. doi: 10.1126/science.aah4787. [DOI] [PubMed] [Google Scholar]
- 34.Jackson JBC, Johnson KG. Measuring past biodiversity. Science. 2001;293:2401–2404. doi: 10.1126/science.1063789. [DOI] [PubMed] [Google Scholar]
- 35.Willis KJ, Birks HJB. What is natural? The need for a long-term perspective in biodiversity conservation. Science. 2006;314:1261–1265. doi: 10.1126/science.1122667. [DOI] [PubMed] [Google Scholar]
- 36.Bonuso N. Shortening the gap between modern community ecology and evolutionary paleoecology. PALAIOS. 2007;22:455–456. doi: 10.2110/palo.2007.S05. [DOI] [Google Scholar]
- 37.Mayhew PJ, Jenkins GB, Benton TG. A long-term association between global temperature and biodiversity, origination and extinction in the fossil record. Proc. R. Soc. B. 2007;275:47–53. doi: 10.1098/rspb.2007.1302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Hubbell, S. P. The Unified Neutral Theory of Biodiversity and Biogeography (MPB-32) (Princeton Univ. Press, 2001).
- 39.Alonso D, Etienne RS, McKane AJ. The merits of neutral theory. Trends Ecol. Evol. 2006;21:451–457. doi: 10.1016/j.tree.2006.03.019. [DOI] [PubMed] [Google Scholar]
- 40.Vergnon R, Dulvy NK, Freckleton RP. Niches versus neutrality: uncovering the drivers of diversity in a species-rich community. Ecol. Lett. 2009;12:1079–1090. doi: 10.1111/j.1461-0248.2009.01364.x. [DOI] [PubMed] [Google Scholar]
- 41.Rahbek C, et al. Predicting continental-scale patterns of bird species richness with spatially explicit models. Proc. R. Soc. B. 2006;274:165–174. doi: 10.1098/rspb.2006.3700. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Rosindell J, Cornell SJ. Species–area relationships from a spatially explicit neutral model in an infinite landscape. Ecol. Lett. 2007;10:586–595. doi: 10.1111/j.1461-0248.2007.01050.x. [DOI] [PubMed] [Google Scholar]
- 43.Rosindell J, Wong Y, Etienne RS. A coalescence approach to spatial neutral ecology. Ecol. Inform. 2008;3:259–271. doi: 10.1016/j.ecoinf.2008.05.001. [DOI] [Google Scholar]
- 44.Seton M, et al. Global continental and ocean basin reconstructions since 200Ma. Earth Sci. Rev. 2012;113:212–270. doi: 10.1016/j.earscirev.2012.03.002. [DOI] [Google Scholar]
- 45.Alroy J. New methods for quantifying macroevolutionary patterns and processes. Paleobiology. 2000;26:707–733. doi: 10.1666/0094-8373(2000)026<0707:NMFQMP>2.0.CO;2. [DOI] [Google Scholar]
- 46.McKay MD, Beckman RJ, Conover WJ. A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics. 1979;21:239–245. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data supporting our analyses are available in the OSF repository: 10.17605/OSF.IO/ZGHWB.
Code for downloading and cleaning fossil occurrence data from the Paleobiology Database is available at: https://github.com/emmadunne/neutral_theory_tetrapods. Code for running simulations using pycoalescence is available at: https://github.com/thompsonsed/palaeo_neutral_sims. Both are accessible through the OSF repository: 10.17605/OSF.IO/ZGHWB