Skip to main content
PLOS Biology logoLink to PLOS Biology
. 2021 Jul 12;19(7):e3001340. doi: 10.1371/journal.pbio.3001340

gen3sis: A general engine for eco-evolutionary simulations of the processes that shape Earth’s biodiversity

Oskar Hagen 1,2,*, Benjamin Flück 1,2, Fabian Fopp 1,2, Juliano S Cabral 3, Florian Hartig 4, Mikael Pontarp 5, Thiago F Rangel 6, Loïc Pellissier 1,2
Editor: Andrew J Tanentzap7
PMCID: PMC8384074  PMID: 34252071

Abstract

Understanding the origins of biodiversity has been an aspiration since the days of early naturalists. The immense complexity of ecological, evolutionary, and spatial processes, however, has made this goal elusive to this day. Computer models serve progress in many scientific fields, but in the fields of macroecology and macroevolution, eco-evolutionary models are comparatively less developed. We present a general, spatially explicit, eco-evolutionary engine with a modular implementation that enables the modeling of multiple macroecological and macroevolutionary processes and feedbacks across representative spatiotemporally dynamic landscapes. Modeled processes can include species’ abiotic tolerances, biotic interactions, dispersal, speciation, and evolution of ecological traits. Commonly observed biodiversity patterns, such as α, β, and γ diversity, species ranges, ecological traits, and phylogenies, emerge as simulations proceed. As an illustration, we examine alternative hypotheses expected to have shaped the latitudinal diversity gradient (LDG) during the Earth’s Cenozoic era. Our exploratory simulations simultaneously produce multiple realistic biodiversity patterns, such as the LDG, current species richness, and range size frequencies, as well as phylogenetic metrics. The model engine is open source and available as an R package, enabling future exploration of various landscapes and biological processes, while outputs can be linked with a variety of empirical biodiversity patterns. This work represents a key toward a numeric, interdisciplinary, and mechanistic understanding of the physical and biological processes that shape Earth’s biodiversity.


This study describes a novel mechanistic engine that predicts a realistic global latitudinal diversity gradient, species richness distribution and phylogenies. This approach is a step towards the interdisciplinary numeric understanding of the physical and biological processes that have shaped Earth’s biodiversity.

Introduction

Ecological and evolutionary processes have created various patterns of diversity in living organisms across the globe [1]. Species richness varies across regions, such as continents [2,3], and along spatial and environmental gradients [4,5], such as latitude [6,7]. These well-known patterns, derived from the observed multitude of life forms on Earth, have intrigued naturalists for centuries [1,8,9] and stimulated the formulation of numerous hypotheses to explain their origin (e.g., [1,6,7,10,1115]). Ecologists and evolutionary biologists have attempted to test and disentangle these hypotheses [16], for example, via models of cladogenesis [17] or correlative spatial analyses [18,19]. However, we are only at the beginning of a mechanistic understanding of the ecological and evolutionary dynamics driving diversity patterns [2023].

The complexity of interacting ecological, evolutionary, and spatial processes limits our ability to formulate, test, and apply the mechanisms forming biodiversity patterns [24,25]. Additionally, multiple processes act and interact with different relative strengths across spatiotemporal scales [20]. Current research suggests that allopatric [22,23,26] and ecological [24] speciation, dispersal [27], and adaptation [28] all act conjointly in interaction with the environment [29,30], producing observed biodiversity patterns [31]. Comprehensive explanations of the origin and dynamics of biodiversity must therefore consider a large number of biological processes and feedbacks [32], including species’ ecological and evolutionary responses to their dynamic abiotic environment, acting on both ecological and evolutionary time scales [20,33]. Consequently, biodiversity patterns can rarely be explained by a single hypothesis, as the expectations of the various contending mechanisms are not clearly asserted [20,34].

A decade ago, a seminal paper by Gotelli and colleagues [35] formulated the goal of developing a “general simulation model for macroecology and macroevolution” (hereafter computer models). Since then, many authors have reiterated this call for a broader use of computer models in biodiversity research [20,36,37], prompting the implementation of several models to explore the emergence of patterns [22,23,38,39]. With computer models, researchers can explore the implications of implemented hypotheses and mechanisms and evaluate whether emerging model outputs are compatible with observations. Several case studies have illustrated the feasibility and usefulness of eco-evolutionary computer models in guiding the interpretation of empirical data [23,26,38,4046]. Moreover, models have reproduced realistic large-scale biodiversity patterns, such as those along latitude [22,39,47], by considering climate and geological dynamics [23,26,38,44] and those related to population isolation, by considering dispersal ability and geographic distance [22,23,26,38,4044]. For example, computer models have been used to examine how oceans’ paleogeography has influenced biodiversity dynamics in marine ecosystems [26,38,43,47]. Despite these recent studies, there is still scope for developing advanced computer models to shed light on the mechanisms underlying biodiversity patterns. In particular, general models that can accommodate and thus contrast several of the hypotheses listed above have utility in our endeavor to better understand and infer the underpinnings of outstanding biodiversity patterns on Earth.

Macroevolutionary studies have highlighted that patterns emerging from simulations are generally sensitive to the mechanisms implemented and to the landscapes upon which mechanisms act [22,26,38,47]. Systematically comparing and exploring the effects of mechanisms and landscapes, however, is often hindered by the lack of flexibility and idiosyncrasies of existing computer models. Most models implement, and thus test, only a limited set of evolutionary processes and hypotheses. Many models are designed for specific and therefore fixed purposes, with spatial and temporal boundaries, ranging from the global [22,26,44] to continental [23] or regional scale [41,42] and from millions of years [38,41,42,47] to thousands of years [22,23]. Moreover, previous eco-evolutionary population models were developed to test a fixed number of mechanisms [22,26,35,38,39,42,44,46,4852] without having generality build in by design. The diverse input and output formats and limited code availability [53], as well as the different algorithmic implementations, have reduced generality, accessibility, and compatibility between hitherto available models.

Biological hypotheses and landscapes should be compared within a common and standardized platform with the modularity required for flexible explorations of multiple landscapes and processes [35]. Increased generality is thus a desirable feature of computer models that aim to explore the mechanisms and landscapes that shape biodiversity in dynamic systems such as rivers [54], oceans [38,43], islands [41,42,55], and mountains [56,57], or across gradients such as latitude [20,22,47]. Inspired by the mechanistic implementation in existing models used to understand the formation of biodiversity gradients [22,23,26,3846], we created a simulation engine that can approximate a variety of biological processes over dynamic landscapes. The model integrates mechanisms including (i) allopatric speciation [26,43,44]; (ii) niche evolution [38,39]; and (iii) competitive interactions [23]. This modeling engine offers the possibility to explore the eco-evolutionary dynamics of lineages under a broad range of biological processes and landscapes within a common framework. Simulated species populations occupy a spatial domain (hereafter site) bounded by a combination of geological, climatic, and ecological factors. The sites occupied by a species define the species’ realized geographic range (hereafter species range) [58]. The engine tracks species populations over time, which can change as a result of dynamic environments, as well as species dispersal ability, ecological interactions, local adaptation, and speciation. The initial species range and the criteria for speciation, dispersal, ecological interactions, and trait evolution are adjustable mechanisms, allowing the integration of a wide range of hypotheses within the model. Given the flexibility of modifying both mechanisms and landscapes, the engine offers a general tool and is thus named the “general engine for eco-evolutionary simulations” (hereafter gen3sis). We highlight the potential of gen3sis to infer the underlying processes behind biodiversity patterns, thus tackling a long-standing topic in evolutionary ecology.

As an illustration of the flexibility and utility of gen3sis, we implement 5 biological hypotheses with dynamic and static landscapes proposed to explain the latitudinal diversity gradient (LDG) [20] and other biodiversity patterns. More specifically, we implement and analyze (i) a null model without ecological interactions, where all terrestrial sites are suitable for all species; (ii) time for species accumulation [5962]; (iii) diversification rates, i.e., depending on temperature [63, 64]; (iv) ecological limits independent of temperature and aridity; and (v) ecological limits dependent on energetic carrying capacity [65,66]. We use this case study to illustrate how simulation results can be compared with multiple empirical biodiversity data, including empirical distribution and phylogenetic patterns of major tetrapod clades (i.e., mammals, birds, amphibians, and reptiles), to inform us about potential mechanisms underlying patterns.

Engine principles and scope

Gen3sis is a modeling engine, developed for formalizing and testing multiple hypotheses about the emergence of biodiversity patterns. The engine simulates the consequences of multiple customizable processes and landscapes responsible for the appearance (speciation) and disappearance (extinction) of species over evolutionary time scales. Speciation and extinction emerge from ecological and evolutionary mechanisms dependent on dispersal, species interactions, trait evolution, and geographic isolation processes. Customizable eco-evolutionary processes, which interact with dynamic landscapes, make it possible to adjust for various macro-eco-evolutionary hypotheses about specific taxonomic groups, ecosystem types, or processes. We made the engine openly available to the research community in an R package to catalyze an interdisciplinary exploration, application, and quantification of the mechanisms behind biodiversity dynamics. The R statistical programming language and environment [67] is widely used for reproducible and open-source research [68,69], and since its origins, it has been used for handling and analyzing spatial data [70]. Gen3sis follows best practices for scientific computing [71], including high modularization; consistent naming, style, and formatting; single and meaningful authoritative representation; automated workflows; version control; continuous integration; and extensive documentation.

Gen3sis operates over a grid-based landscape, either the entire globe or a specific region. The landscape used as input is defined by the shape of the colonizable habitat (e.g., land masses for terrestrial organisms), its environmental properties (e.g., temperature and aridity), and its connectivity to dispersal (e.g., the influence of barriers, such as rivers and oceans for terrestrial organisms). Gen3sis simulates species’ population range dynamics, traits, diversification, and spatial biodiversity patterns in response to geological, biological, and environmental drivers. Using a combined trait-based and biological species concept, gen3sis tracks the creation, dynamics, and extinction of species ranges, which are composed of a set of sites occupied by species populations. Eco-evolutionary dynamics are driven by user-specified landscapes and processes, including ecology, dispersal, speciation, and evolution (Fig 1). Below, we explain the gen3sis inputs, the configurations (including eco-evolutionary processes), and the landscapes defining the computer model, as well as user-defined outputs (Fig 1C–1F).

Fig 1.

Fig 1

Schematic of the main components of the computer model: (A, B) model inputs, including the spatiotemporal landscape objects and the configuration file; (C–F) model outputs, including present and past species ranges, phylogenetic relationships among species, and the ecological traits of species; (G) model engine containing the mechanics; and (H) empirical data applicable for model validation.

Inputs and initialization

Gen3sis has 2 input objects, which define a particular model (Fig 1). These inputs are (i) a dynamic landscape (Fig 1A), which is further divided into environmental variables and distance matrices; and (ii) a configuration (Fig 1B), in which the user can define initial conditions, biological functions, and their parameter values, as well as technical settings for the model core.

Landscape

The landscape objects (Fig 1A) form the spatiotemporal context in which the processes of speciation, dispersal, evolution, and ecology take place. Landscape objects are generated based on temporal sequences of landscapes in the form of raster files, which are summarized in 2 classes. The first landscape class contains (i) the geographic coordinates of the landscape sites; (ii) the corresponding information on which sites are generally suitable for a clade (e.g., land or ocean); and (iii) the environmental conditions (e.g., temperature and aridity). The landscape may be simplified into a single geographic axis (e.g., [72]) for theoretical experiments, or it may consider realistic configurations aimed at reproducing real local or global landscapes [26,73,74]. The second landscape class defines the connectivity of the landscape, used for computing dispersal and consequently isolation of populations. By default, the connection cost between occupied sites is computed for each time step from the gridded landscape data based on haversine geographic distances. This can be modified by a user-defined cost function in order to account for barriers with different strengths (e.g., based on elevation [73], water, or land) or even to facilitate dispersal in specific directions (e.g., to account for currents and river flow directions). The final connection costs are stored as sparse distance matrices [75]. Distance matrices, containing the connection costs, are provided at every time step as either (i) a precomputed full distance matrix, containing all habitable sites in the landscape (faster simulations but more storage required); or (ii) a local distance matrix, computed from neighboring site distances up to a user-defined range limit (slower simulation runs but less storage required).

Configuration

The configuration object (Fig 1B) includes the customizable initialization, observer, speciation, dispersal, evolution, and ecology functions. These 6 functions define a configuration applied in the simulation engine (Table 1). The possibility to customize these functions confers the high flexibility of gen3sis by including a wide range of mechanisms, as illustrated by 5 configurations presented in a case study (Table A in S1 Note). Additionally, the configuration object lists the model settings, including (i) whether a random seed is used, allowing simulation reproducibility; (ii) start and end times of the simulation; (iii) rules about aborting the simulation, including the maximum global or local species number allowed; and (iv) the list of ecological traits considered in the simulation. One or multiple traits can be defined, which should correspond to those used in the ecology function. Moreover, the initialization function creates the ancestor species at the start of the simulation. Users can define the number of ancestor species, their distribution within the ancient landscape and their initial trait values. With the observer function, changes over time in any abiotic or biotic information of the virtual world can be recorded by defining the outputs that are saved at specified time steps. Outputs can be saved and plotted in real time as the model runs. The core biological functions (i.e. speciation, dispersal, evolution, and ecology) are presented below.

Table 1. Presentation of the core functions of speciation, dispersal, ecology, and evolution implemented in gen3sis.

The computation of core functions is customizable in the configuration object. Shown are input objects that are combined to generate updated outputs. The table corresponds to the mechanisms presented in Fig 2B.

Objective Input Computation Output
Speciation
Determines the divergence between geographic clusters of populations within a species; determines cladogenesis. Species divergence matrix; species trait matrix; species abundance matrix; landscape values; distance matrix. Divergence between geographically isolated clusters of populations increases over time, while (re)connected clusters decrease down to zero; speciation happens when the divergence between 2 clusters is above the speciation threshold, but it can also consider trait differences. Updated species divergence matrix; new species if speciation occurred; updated genealogy table.
Dispersal
Determines the colonization of vacant sites. Species trait matrix; species abundance matrix; landscape values; distance matrix. Species disperse according to a unique value or a distribution of dispersal values. Updated species abundance matrix.
Evolution
Determines the change in species traits in each site, anagenesis. Species trait matrix; species abundance matrix; landscape values; geographic clusters; distance matrix. Traits might change for each species in the populations of occupied sites. Updated species trait matrix.
Ecology
Determines the species abundance in each site. Species trait matrix; species abundance matrix; landscape values; genealogy. Change the species abundance, based on landscape environmental values and species co-occurrences; changes species trait values. Updated species abundance matrix.

Core functions and objects

The states of the model runs are updated in discrete time steps. At each time step, the speciation, dispersal, evolution, and ecology functions are executed sequentially (Fig 2). Speciation and extinction emerge from interactions across core functions. For example, speciation events are influenced by the speciation function, as well as by the ecology and dispersal functions that interact in a dynamic landscape, ultimately dictating populations’ geographic isolation. Likewise, global extinctions depend on local extinctions, which decrease the number of inhabited sites until no sites remain inhabited by a species, rendering it extinct. Extinction happens when the occupied sites become uninhabitable and no other suitable sites are within dispersal distance or according to the change in species traits, rendering the species unfit for the environment. Internally, the computer model defines core objects of the simulations: species abundances; species trait values; the species divergence matrix between all populations for each species; and the phylogeny of all species created during the simulation. In the following sections, we describe the core processes in gen3sis, as well as their inputs and outputs. For a summary, see Table 1.

Fig 2. Schematic example of the gen3sis engine simulation cycle of one species’ populations over a landscape evolution example containing highlands (yellow), lowlands (green), and a river acting as a barrier (blue).

Fig 2

(A) Landscape. A time series of landscapes is used as input, with the landscape being updated after every time step of the simulation cycle, i.e., after the ecology process. (B) Model core processes. First, the speciation process determines the divergence between geographic clusters of populations that are not connected and splits the clusters into new species if a threshold is reached. In this illustration, divergence between clusters of the species’ populations was not sufficient to trigger speciation. Second, in the dispersal process, the species spreads within a landscape to reachable new sites. In this illustration, the river limits dispersal. Third, the evolution process can modify the value of the traits in the populations. In this illustration, 2 populations show trait evolution in their ability to cope with the local environment (i.e., red and white populations). Fourth, the ecology process recalculates the abundance of the species in each site based on the abiotic condition and co-occurring species, possibly resulting in local extinctions. In this illustration, the red population was unsuited to the lowlands, while the white population survived in the highlands. Speciation and extinction events emerge from multiple simulation cycles of customizable processes.

Running a simulation in gen3sis consists of the following steps: (i) Read in the configuration object, prepare the output directories, load the initial landscape (Fig 2A), and create the ancestor specie(s) (using the initialization function create_ancestor_species). (ii) Run the main loop over the landscape time steps. At every time step, the engine loads the appropriate landscape, removes all sites that became uninhabitable in the new time step, and executes the core functions as defined by the configuration object (Fig 2B). (iii) At the end of every time step, gen3sis saves the species richness, genealogy, and, if desired, the species, landscape, and other customized observations that are defined in the observer function (e.g., summary statistics and species pattern plots). Core functions are modifiable and can account for a wide range of mechanisms, as illustrated in the case study (S1 and S2 Notes). Conversely, functions can be turned off, for example, in an ecologically neutral model. For a pseudo-code of gen3sis, see S3 Note.

Speciation

Core. The speciation function iterates over every species separately, registers populations’ geographic occupancy (species range), and determines when geographic isolation between population clusters is sufficient to trigger a lineage-splitting event of cladogenesis. A species’ range can be segregated into spatially discontinuous geographic clusters of sites and is determined by multiple other processes. The clustering of occupied sites is based on the species’ dispersal capacity and the landscape connection costs. Over time, disconnected clusters gradually accumulate incompatibility (divergence), analogous to genetic differentiation. Disconnected species population clusters that maintain geographic isolation for a prolonged period of time will result in different species after the differentiation threshold Ϟ is reached (modeling Dobzhansky–Muller incompatibilities [76]). These clusters become 2 or more distinct species, and a divergence matrix reset follows. On the other hand, if geographic clusters come into secondary contact before the speciation occurs, they coalesce and incompatibilities are gradually reduced to zero.

Nonexhaustive modification possibilities. A customizable speciation function can further embrace processes that modulate speciation. Increased divergence values per time step can be constant for all species or change depending on biotic and abiotic conditions, such as faster divergence between species occupying higher temperature sites [64], or they can be dependent on population size [77] or other attributes [78]. The function also takes the ecological traits as input, thus allowing for ecological speciation [24], where speciation depends on the divergence of ecological traits between—but not within—clusters [79].

Dispersal

Core. The dispersal function iterates over all species populations and determines the connectivity between sites and the colonization of new sites in the landscape. Dispersal distances are drawn following a user-defined dispersal function and then compared with the distance between pairs of occupied and unoccupied sites accounting for landscape costs. A unique dispersal value can be used (deterministic connection of sites) or dispersal values can be selected from a specified distribution (stochastic connection of sites). If the dispersal cost between the sites is lower than the dispersal value, the dispersal is successful. If populations from multiple sites of origin manage to reach an unoccupied site, the final colonizer is selected randomly to seed the newly occupied site.

Nonexhaustive modification possibilities. A customizable dispersal function enables the modeling of different dispersal kernels depending on the type of organism considered. Dispersal values can be further linked with the ecology function, e.g., a trade-off with other traits [80] and dispersal versus competitive ability [81], and the evolution function allowing dispersal to evolve, resulting in species with different dispersal abilities [82].

Evolution

Core. The evolution function determines the change in the traits of each population in occupied sites of each species. Traits are defined in the configuration object and can evolve over time for each species’ populations. The function iterates over every population of a species and modifies the trait(s) according to the specified function (e.g., traits related to dispersal, niche, or competition).

Nonexhaustive modification possibilities. A customizable evolution function takes as input the species abundance, species trait, species divergence clusters, and landscape values. In the function, it is possible to define which traits evolve and how they change at each time step. In particular, the frequency and/or amount of change can be made dependent on temperature [83], ecological traits [84], or abundances [85], while the directions of change can follow local optima or various evolutionary models, including Brownian motion [86] and Ornstein–Uhlenbeck [87].

Ecology

Core. The ecology function determines the abundance or presence of populations in occupied sites of each species. Thus, extinction processes derive from ecology function interactions with other processes and landscape dynamics. The function iterates over all occupied sites and updates the species population abundances or presences on the basis of local environmental values, updated co-occurrence patterns, and species traits.

Nonexhaustive modification possibilities. A customizable ecology function takes as input the species abundance, species trait, species divergence and clusters, and the landscape values. Inspired by classic niche theory [10,15,88], the function can account for various niche mechanisms, from simple environmental limits to complex multispecies interactions. It is possible, for example, to include a carrying capacity for the total number of individuals or species [21] or competition between species based on phylogenetic or trait distances [23], based on an interaction currency [89], or further constrained by a functional trade-off [80].

Outputs and comparisons with empirical data

The computer model delivers a wide range of outputs that can be compared with empirical data (Fig 1, Table 2). Gen3sis is therefore suitable for analyzing the links between interacting processes and their multidimensional emergent patterns. By recording the time and origin of all speciation events, as well as trait distributions and abundance throughout evolutionary history, the simulation model records the information required to track the dynamics of species diversity and the shaping of phylogenetic trees. The most common patterns observed and studied by ecologists and evolutionary biologists, including species ranges, abundances, richness, and genealogies, are emergent properties of the modeled processes (Table 2). All internal objects are accessible to the observer function, which is configurable and executed during simulation runs. Gen3sis provides direct simulation outputs in a format ready to be stored, analyzed, and compared with empirical data. Given the flexibility of gen3sis, it is possible to explore not only parameter ranges guided by prior knowledge available for a given taxonomic group, but also a variety of landscape scenarios and mechanisms (Fig 3). Furthermore, to attain generality, validating modeled outputs with multiple empirical patterns is recommended [20,25,35]. Gen3sis generates multiple outputs, which can be compared with empirical data using simulation rankings or acceptance criteria [25,35,90].

Table 2. List of outputs from the gen3sis computer model, both direct and indirect, that can be compared with empirical data.

Direct outputs are the species abundance matrix, species trait matrix, and phylogeny, while indirect outputs result from various combinations of the direct outputs. The computations of indirect outputs rely on other packages available in the R environment [67].

Scale
Pattern Spatial Temporal
I + I +
Metric Example local regional global present past deep past
Alpha diversity (α) Local species richness follows marked spatial gradients, such as along latitude (LDG [91]). Species richness is further correlated across scales when the regional species pool size is positively associated with local species richness (e.g., [4,92]). * * * * * *
Beta diversity (β) Species turnover is marked along both spatial and environmental gradients [93,94] and can display sharp boundaries forming biogeographic domains [95]. * * * * *
Gamma diversity (γ) Regional difference in species richness, e.g., across biogeographic regions with comparable climates, such as the continental temperate region of North America versus Asia [96]. * * * *
Species abundance, frequency, and range Assemblages are generally composed of a few very abundant species and many rare species [97,98]. A few species tend to occupy many sites, while most are very rare and have a restricted range size [99]. * * * * * *
Species ecological niche width distribution Niche width is heterogeneous across species [100,101], and narrow niche width leads to higher speciation [102]. * * * * * *
Trait evolutionary rates Ecological traits and niches generally evolve slowly so that closely related lineages have similar traits and niches, coined as niche conservatism [60]. * * * * * *
Species diversification rates Species diversification rate varies over time and across clades [103105]. * * * * *
Topological and temporal phylogenetic properties Empirical phylogenetic trees typically display a topological signature [106] and have more divided branching over time, with marked prevalence of a recent branching distribution [107]. * * * * *
Phylogenetic alpha (α) and beta (β) diversity Local communities can show either phylogenetic overdispersion or clustering compared with the regional pool [108]; greater geographic distances correspond to increased phylogenetic β diversity across biogeographic barriers [109]; decay in phylogenetic similarity with increasing geographic distance [110]. * * * * *
Functional alpha (α) and beta (β) diversity Local assemblages represent a subset of the regional functional diversity; functional traits show a typical turnover spatially, often along environmental gradients [111]. * * * * * *

LDG, latitudinal diversity gradient.

Fig 3. Schematic representation of the case study showing the model design with 2 landscapes (i.e., L1 and L2, with and without temporal environmental variability) and configurations of 5 models (i.e., M1, M2, M3, M4, and M5; Table A in S1 Note) and model evaluation and testing, based on multiple patterns including LDG, spatial α-diversity, range size distributions, phylogenetic balance, and temporal dynamics of species diversification (nLTT).

Fig 3

Selection criteria were based on empirical data from major tetrapod groups, i.e. mammals, birds, amphibians and reptiles (Table 3). LDG, latitudinal diversity gradient; nLTT, normalized lineage though time.

Case study: The emergence of the LDG in the Cenozoic

Context

Expanding from previous studies using mechanistic simulation models [22,23,39], we use a case study to illustrate the flexibility of gen3sis to implement mechanisms that derive from a variety of biological hypotheses, as well as null models to serve as contrast. We present how gen3sis can be used to explore a variety of proposed mechanisms and how to explore parameters, but the case study is not a comprehensive exploration of existing hypotheses and their associated parameters [20].

The LDG is one of Earth’s most iconic biodiversity patterns, but the underlying mechanisms remain largely debated [20,63,64,101,102,112114]. Many hypotheses have been proposed to explain the formation of the LDG [20], and these generally agree that a combination of biological processes and landscape dynamics has shaped the emergence of the LDG [20]. Among the proposed hypotheses, it has been postulated that older and more stable tropical environments have more time for cumulating species and have reduced extinctions, while niche conservatism limits the spread of lineages to more recent colder environments [5962]. Second, higher temperatures in the tropics increase metabolic and mutation rates, which could lead to faster reproductive incompatibilities among populations and higher speciation rates compared with colder environments [63,64]. Third, the tropics are generally more productive than colder environments and greater resource availability can sustain higher abundances, and, therefore, a larger number of species can coexist there [65,66,115,116]. From these hypotheses, we illustrate 5 derivative models in gen3sis: a null model without ecological filtering or trait evolution (M1); a model of trait evolution only considering niche conservatism where trait evolutionary rates are limited (M2); a model where evolutionary rates are proportional to the occupied site temperatures (M3); a model with uniform carrying capacity (M4); and a model where carrying capacity depends on temperature and aridity (M5). We simulated the spread, speciation, dispersal, and extinction of terrestrial organisms over the Cenozoic marked by various shifts in diversification. Finally, we evaluated whether the emerging patterns from these simulated mechanisms correspond to the empirical LDG, phylogenetic tree imbalance, and range size frequencies computed from data of major tetrapod groups, including mammals, birds, amphibians, and reptiles (Fig 3).

Input landscapes

The quality of the outputs of simulation models such as gen3sis hinges on accurate and relevant reconstructions of past environmental conditions [117]. Conditions during the Cenozoic (i.e., 65 Ma until the present) are considered key for the diversification of the current biota [118], and the Cenozoic is the period during which the modern LDG is expected to have been formed [119]. In the Cenozoic, the continents assumed their modern geographic configuration [26]. Climatically, this period was characterized by a general cooling, especially in the Miocene, and ended with the climatic oscillations of the Quaternary [120].

We compiled 2 global paleoenvironmental landscapes (i.e., L1 and L2) for the Cenozoic at 1° and approximately 170 kyr of spatial and temporal resolution, respectively (S1 Note, S1 and S2 Animations). To account for uncertainties in paleoreconstructions on the emerging large-scale biodiversity patterns, we used 2 paleoelevation reconstructions [121,122] associated with 2 approaches for estimating the paleotemperature of sites (S1 Note). L1 had temperatures defined by Köppen bands based on the geographic distribution of lithologic indicators of climate [56]. L2 had temperature defined by a composite of benthic foraminifer isotope records over time [123] and along latitude for specific time periods [124130]. An aridity index ranging from 0 to 1 was computed based on the subtropical arid Köppen zone for both landscapes [56]. Finally, in order to test for the effects of deep-time environmental dynamics, we also ran simulations (i.e., L1.0 and L2.0) in which a constant contemporary landscape was set for the same number of time steps as in L1 and L2 (S1 Note).

We used available paleoelevation models [121,122] and paleoclimate indicators [56,123133] to generate input landscapes to explore the formation of the LDG and account for uncertainties and limitations. Hence, the case study represents an illustration of how gen3sis can handle multiple reconstructions that interact with eco-evolutionary processes in complex ways. Further research in geology and climatology is required to generate more accurate paleolandscapes than those presented here.

Model configurations

We implemented 5 illustrative gen3sis models derived from hypotheses on the emergence of the LDG. The models (i.e., M1, M2, M3, M4, and M5) had distinct speciation and ecological processes and contrast the common idea that time, diversification rates, and ecological limits underpin the LDG (Fig 3, Table A in S1 Note). As a simplified approach for this illustration, all simulations were initiated with one single ancestor species spread over the entire terrestrial surface of the Earth at 65 Ma [134], but initial conditions could also match the ancestral range informed by fossil records [47]. The temperature optimum of each population was initiated to match local site conditions. Since we focused on terrestrial organisms, aquatic sites were considered inhabitable and twice as difficult to cross as terrestrial sites. This approximates the different dispersal limitations imposed by aquatic and terrestrial sites. To compute the full distance matrix, we used haversine geodesic distances.

M1

We applied a null model where all the terrestrial sites were ecologically equivalent. Temperature and aridity thus did not determine the niche of the species. The divergence rate between isolated clusters was kept constant (i.e., +1 for every 170 kyr of isolation). Clusters of populations that accumulated differentiation over time speciated according to a speciation threshold Ϟ. Ecology and trait evolution were turned off, making this null model a baseline with which all the other more complex models can be contrasted.

M2

In the implementation of the niche conservatism, the ecology function defined the species population abundance, where the abundance increased proportionally to the distance between the population temperature niche optimum and the site temperature (S1 Note). The temperature optimum of each population was set to evolve randomly, with a normal distribution following Brownian motion with standard deviation σ2.

M3

In the implementation of the diversification rates, the speciation function applied a temperature-dependent divergence between population clusters [63,64]. Species in warmer environments accumulated divergence between disconnected clusters of populations at a higher rate (S1 Note). The accumulation of divergence was set to be 3 times faster at the warmest sites. The rate of differentiation increase was shaped by the average site temperature of the species clusters to the power of dpower plus a constant. Overall, this created higher speciation rates at warmer than at colder sites (S1 Note, S1 Fig).

M4

In the implementation of the carrying capacity, we applied a model where the total number of individuals was equally limited in each site, as an overall constraint on biotic interactions [135]. Because of resource and space limitations, only a limited number of individuals (k) could coexist within the site. If the sum of all species abundances in a site was above k (modulated by kpower), species abundances were randomly reduced across species until k was reached. This contrasts with the next model M5, where the carrying capacity varied with temperature and aridity. Locally extinct species were the ones with zero individuals after the limit k was applied (S1 Note).

M5

In the implementation of the ecological limits, the ecology function included a carrying capacity k of each site that scaled with area energy (i.e., temperature and aridity) [116,136]. In this model, we assumed that the carrying capacity of the number of individuals at sites scaled with energy, which indirectly also constrained the number of species that could coexist in a given place [21,116]. If the sum of all species abundances in a site was above k (modulated by kpower), species abundances were randomly reduced across species until k was reached, as in M4 (S1 Note).

Exploration of model parameters

We explored the parameter space of each model using Sobol sequences, a quasi-random number generator that samples parameters evenly across the parameter space [137]. We explored parameter ranges by basing upper and lower parameter boundaries on the literature and interactive modeling explorations. In all models, species dispersed following a Weibull distribution with shape ɸ = [2 to 5] and a scale of Ψ = [550 to 850], resulting in most values being around 500 to 1,500 km, with rare large dispersal events above 2,000 km. The explored dispersal distribution parameters ranged in realized mean and 95% quantiles between less than a single cell, i.e., approximately 50 km for a landscape at 4°, and more than the Earth’s diameter, i.e., approximately 12,742 km (S2 Fig). In all models except M1, the evolution function defined the temperature niche optimum to evolve following Brownian motion. The temperature optimum of each population was set to evolve randomly, following a normal distribution in a Brownian motion fashion with standard deviation σ2 = [0.001 to 0.010], corresponding to [±0.1°C to ±1°C] per time step. In all models except M3, species emerged after Ϟ = [6 to 60], corresponding to events occurring after [1 to 10] myr of isolation in the cases where the divergence rate was kept constant. For M3, the differentiation increase with temperature (i.e., 3 times faster at the hottest sites) changed to the power of dpower = [2 to 6] plus a constant (S1 Fig). Temperature niche optima were homogenized per geographic cluster by an abundance-weighted mean after ecological processes happened. Carrying capacities had k values ranging from low to high with a power-law scaling kpower = [1 to 4]. For further details on the simulation model framework, model parameters, initial conditions, paleoenvironmental reconstructions, and landscape modification experiment, see S1 Note and Table A in S1 Note.

For each model (i.e., M1, M2, M3, M4, and M5) in combination with each landscape (i.e., L1 and L2) with and without deep-time environmental dynamics, we ran a full factorial exploration of these parameter ranges at a coarse resolution of 4° (i.e., M1 n = 300, M2 n = 780, M3 n = 1,020, M4 n = 300, M5 n = 780) and compared these to empirical data. Simulations considered further (i) had at least 20 species at the present; (ii) had fewer than 50,000 species; or (iii) had fewer than 10,000 species cohabiting the same site at any point in time (S1 Note). After parameter range exploration, we identified realistic parameters and ran a subset at 1° for high-resolution outputs for illustration (Fig 4). Parameter exploration is illustrative and could be expanded in future research applications.

Fig 4. Illustration of one global simulation of the speciation, dispersal, and extinction of lineages over the Cenozoic, starting with a single ancestor species and imposed energetic carrying capacity (M5 in L1).

Fig 4

We selected the best matching simulation of M5 in L1 at 1° (n = 12) that predicted realistic biodiversity patterns. (A) Images of the Earth land masses through time, used as input for the simulation. (B) Selected emerging patterns: evolutionary dynamics, phylogeny, and present richness. (B.1) Evolutionary dynamics: γ richness (log10 scale) through time (blue line) and diversification rate. (B.2) Phylogeny showing the distribution of the temperature optima for all extant species. (B.3) Present distribution of simulated α biodiversity globally, which indicates locations of biodiversity hotspots. For the empirical match, see S8 Fig. (C) Model correspondence with empirical data on terrestrial mammals, birds, amphibians, and reptiles for the LDG, measured as the standardized and area-scaled mean species number per latitudinal degree. The emerging LDG%loss (i.e., 4.6% of species loss per latitudinal degree) closely matched empirical curves, with good agreement for mammals (Pearson r = 0.6), birds (r = 0.57), amphibians (r = 0.57), and reptiles (r = 0.38) (S1 Note, Figs 4C and S9). Data presented here are available in S1 Data at https://zenodo.org/record/5006413, including selected simulation summary output (phylogeny and richness) and empirical richness used to derive LDG curves. LDG, latitudinal diversity gradient.

Correspondence with empirical data

We compared simulation ability to produce the observed biodiversity patterns using a pattern-oriented modeling (POM) approach [25,90]. POM compares the predictions of each model and parameter combination with a number of diagnostic patterns from empirical observations. In our case, we used the LDG slope and curve, spatial α-biodiversity, range size frequencies, tree imbalance, and macroevolutionary temporal dynamics as diagnostic patterns (Fig 3, S1 Note). The POM approach allows a calibration and model comparison based on high-level diagnostic patterns, avoiding the hurdles of defining explicit (approximate) likelihood functions [138]. The POM approach requires the specification of a range for each pattern under which observation and prediction are accepted, hence when a simulation satisfactorily reproduces empirical observations. Unless POM is coupled with an explicit probabilistic model [138], the limits for acceptance must be decided based on the empirical data distribution [25,90]. In complement to POM, we computed the Bayesian information criteria (BIC), balancing the fit of the model to α-diversity with a penalization for model complexity (S1 Note).

To generate the empirical values for these patterns, we obtained distribution data on 25,941 species [139141], following [142], and phylogenetic data on 18,978 species [5,143146], following [147] for major tetrapod groups, i.e., terrestrial mammals, birds, amphibians, and reptiles (S1 Note). LDG%loss was defined as the percentage of species loss per latitudinal degree and was measured as the slope of a linear regression of normalized species richness against absolute latitude. The β-statistics [31] was used for phylogenetic tree imbalance in ultrametric trees, following [106]. Species range decrease (SRD) in km2 was defined as the percentage of species loss per species range and was measured as the slope of a linear regression of range size distributions. We further compared the mean species number per latitude curve (LDGcurve), normalized lineage though time (nLTT) curves, and α-biodiversity spatial distribution (S1 Note). Empirical values of LDG, β, and SRD were as follows: mammals (LDG = 5.1%, β = −0.4, SRD = 2.3*103%), birds (LDG = 1.5%, β = −1.3, SRD = 6.5*107%), amphibians (LDG = 3.9%, β = −0.7, SRD = 0.11%), and reptiles (LDG = 1.5%, β = −0.8, SRD = 5.3*103%). Based on these values, we used the following acceptance criteria: (i) LDG between 5.4% and 1.1%; (ii) tree shape statistic β between −1.4 and −0.3; (iii) range size frequencies with a decrease in the number of large-range species with a tolerance of 5% [9799]; (iv) correlation of mean species number per latitude with r > 0.4; and (iv) nLTT curve difference < 0.15.

Simulations results

We illustrate how integrating deep-time environmental dynamics and biological processes can help us understand the origin of biodiversity patterns. Simulations including deep-time temporal dynamics systematically showed a best fit to empirical data (S4 Fig). Models M4 and M5 resulted in the best match for most of the empirical patterns individually, and M5 was the only model able to pass all acceptance criteria (Table 3). Although all 5 models were able to reproduce the LDG%loss, M5 was superior in explaining the LDGcurve (S5 and S6 Figs), α-diversity, phylogenetic tree imbalance, and species range size frequencies simultaneously (Table 3). Most simulations of model M5 (67%) resulted in a decrease in species richness at higher latitudes, indicating that the LDG emerged systematically under M5 mechanisms (S8 Fig, Tables B, C, and D in S1 Note). Using the BIC approach, and accounting for model complexity, we found that the models implementing carrying capacities (i.e., M4 and M5) were the only ones significantly superior to the null model M1 when considering α-diversity spatial patterns (S3, S5, and S6 Figs). Finally, we found that the support for M4 and M5 over M1, M2, and M3 was consistent across the 2 alternative landscapes L1 and L2 (S6 and S8 Figs, Table D in S1 Note). We further illustrate the capacity to run high-resolution simulations for a subset of the explored parameters. Increasing the spatial resolution of the simulations (n = 12) resulted in an increase in γ richness and computation time and a slight decrease in the LDG%loss (S7 Fig), which was associated with a disproportionally larger number of sites toward higher latitudes, which, in turn, affected population connectivity and, therefore, speciation rates [148].

Table 3. Model acceptance table with pattern descriptions and details of acceptance derived from empirical data.

Percentages of accepted simulations (for both landscapes) are shown for each model and acceptance parameter and the combination of all acceptance patterns. For details, see S1 Note.

Acceptance M1 M2 M3 M4 M5
Pattern Description and empirical acceptance n = 300 n = 780 n = 1,020 n = 300 n = 780
LDG %loss Percentage of species loss per latitudinal degree from linear regression slope.
Accept LDGs between 5% and 1%
26% 28% 41% 34% 36%
LDG curve Standardized mean species number per latitude correlation between simulated and empirical maximal Pearson correlation.
Accept r > 0.4
21% 28% 43% 38% 60%
α biodiversity Spatial distribution correlation between simulated and empirical maximal Pearson correlation.
Accept r > 0.4
18% 24% 37% 20% 36%
Range Range size distributions.
Accept only distributions that show a consistent frequency decrease toward large-ranged species with a tolerance of 5%
23% 8% 4% 31% 16%
Phytogenic balance The imbalance of a phylogenetic tree is measured by the value that maximizes the likelihood in the β-splitting model [152].
Accept phylogenies with β between −1.4 and −0.3
56% 56% 55% 73% 64%
nLTT Temporal dynamics of species diversification, measured by the differences between empirical and simulated nLTT curves [153].
Accept nLTT differences < 0.15
66% 65% 70% 62% 59%
Combined Simulations passing all criteria above with at least 20 species alive at present time 0% 0% 0% 0% 1%

LDG, latitudinal diversity gradient; nLTT, normalized lineage though time.

In order to gain insight into the eco-evolutionary processes leading to the simulated patterns, we quantified the speciation, extinction, and migration rates within and between low (23° 27′ N- 23° 27′ S) and high (66° 33′ N -23° 27′ N; 23° 27′ S-66° 33′ S] latitudes from the best ranking simulations. Speciation and extinction rates were consistently higher at low compared with high latitude (S10 Fig), but speciation was systematically superior to extinction in contributing to the LDG. In contrast, dispersal from low to high latitude was always more frequent than from high to low latitude (Table F in S1 Note), which contributed to attenuation of the LDG. Because diversity was higher in the tropics, species were more likely to move from low to high latitude, corroborating empirical observations [149]. Moreover, our results indicate that an increase in the scaling factor of carrying capacity with energy k resulted in a steeper LDG%loss (Tables B and C in S1 Note), which is in agreement with findings from previous studies [21,63,116,136]. Similarly, increasing the time for divergence consistently led to lower species richness and flattened the LDG slope so that the tropics accumulated diversity more slowly, but changes in speciation rates were less likely to drive large-scale biodiversity patterns [114]. Including a carrying capacity led to a characteristic increase in speciation and extinction rates toward the present, which intensified when temperature and aridity were considered as limiting factors (S10S12 Figs), matching the recent diversification found in empirical data [150].

Synthesis

In accordance with Rangel and colleagues [23], we found that realistic LDG patterns are dependent on species evolutionary responses to environmental dynamics. Rangel and colleagues [23] concluded that LDG patterns in South America are sensitive to the rate of evolutionary adaptation to climatic factors, which are dynamic in time (climate oscillations) and space (topography). However, while in that study [23], the intensity of competition was assumed to be an inverse function of phylogenetic distance; in gen3sis, competition can be modeled directly through traits and carrying capacity, opening up a new pathway for future investigations. In addition, Saupe and colleagues [22] showed that simulations with poor dispersal are better at representing the observed strong LDG in tetrapods. In agreement with their results, our parameter explorations indicated that dispersal correlated negatively with LDG [22], and simulations with lower dispersal parameters agreed better with the data (S1 Note). While previous case studies using computer models have conveyed information on the formation of the LDG [22,23,39], they used a shorter timeframe (i.e., below 1 Ma) and/or explored few mechanisms, i.e., a simplified landscape or a single acceptance criterion [26,38,43,114]. Although our case study was illustrative and we implemented only a small representative fraction of the candidate processes and parameters expected to shape biodiversity patterns, it illustrates how gen3sis can handle multiple interacting eco-evolutionary processes proposed in the literature. Still, it is imperative that future research explore further mechanisms and parameters combinations in order to advance our understanding of the processes behind the emergence of biodiversity.

Although recent studies using realistic landscapes and computer models reproduced biodiversity patterns over a time scale spanning the Quaternary [22,23,39], many speciation and extinction events shaping present diversity patterns date back before the glaciation, and few studies have covered deep-time dynamics [26,38,43,142]. Deep-time landscape reconstructions are still generally lacking but are increasingly becoming available [121,123]. For example, we represented Quaternary climatic oscillation using approximately 170 kyr time steps, which correspond to a coarser temporal scale compared with the frequency of oscillations, and thus do not account for shorter climatic variation effects on diversity patterns [22,23,39]. We also did not consider ice cover, which can mask species’ habitable sites, which might explain mismatches between simulated and empirical LDG patterns below 50° (Fig 4C). Moreover, paleoindicators of climate from Köppen bands have major limitations, and the temperature estimation derived in our case study might suffer from large inaccuracies. Lastly, extrapolation of the current temperature lapse rate along elevation might lead to erroneous estimates, especially in terms of the interaction with air moisture [151], which was not further investigated here. Through interdisciplinary research across the fields of geology, climatology, and biology, we expect that gen3sis will improve our understanding of the shaping of biodiversity across space, time, and complexity.

Discussion

Understanding the emergence of biodiversity patterns requires the consideration of multiple biological processes and abiotic forces that potentially underpin them [20,23,35,36]. We have introduced gen3sis, a modular, spatially explicit, eco-evolutionary simulation engine implemented as an R package, which offers the possibility to explore ecological and macroevolutionary dynamics over changing landscapes. Gen3sis generates commonly observed diversity patterns and, thanks to its flexibility, enables the testing of a broad range of hypotheses (Table 4). It follows the principle of computer models from other fields [154156], where mechanisms are implemented in a controlled numeric environment and emerging patterns can be compared with empirical data [25]. The combination of exploring patterns emerging from models and qualitatively and quantitatively matching their outputs to empirical data should increase our understanding of the processes underlying global biodiversity patterns.

Table 4. A nonexhaustive list of expected applications of gen3sis.

Given the flexibility and the range of outputs produced by the engine, we expect that gen3sis will serve a large range of purposes, from testing a variety of theories and hypotheses to evaluating phylogenetic diversification methods.

Use Examples from Fig 1
Testing phylogenetic inference methods, including diversification rates in phylogeographic reconstructions. Infer diversification rate in gen3sis simulated phylogenies (E) and compare with a known diversification in gen3sis (A, B, and G).
Providing biotic scenarios for past responses to geodynamics. Based on model outputs (C–F) and comparisons with empirical data (H), select plausible models (B).
Testing paleoclimatic and paleotopographic reconstructions using biodiversity data. Based on model outputs (C–F) and comparisons with empirical data (H), select plausible landscape(s) (A).
Comparing expectations of different processes relating to the origin of biodiversity; generating and testing hypotheses. Compare models (A, B, and G) with outputs (C–F) and possibly how well outputs match empirical data (H).
Comparing simulated intraspecific population structure with empirical genetic data. Compare simulated divergence matrices with population genetic data.
Forecasting the response of biodiversity to global changes (e.g., climate or fragmentation). Extrapolate plausible and validated models (A, B, and G) to landscapes under climate change scenarios (A).
Investigating trait evolution through space and time. Combine past and present simulated species traits (F) and distributions (C, D) with fossil and trait data (H).
Modeling complex systems in space and time in unconventional biological contexts in order to investigate eco-evolutionary processes in fields traditionally not relying on biological principles. Model eco-evolutionary mechanisms (A, B, and G) in an unconventional eco-evolutionary context.

Verbal explanations of the main principles underlying the emergence of biodiversity are frequently proposed but are rarely quantified or readily generalized across study systems [20]. We anticipate that gen3sis will be particularly useful for exploring the consequences of mechanisms that so far have mostly been verbally defined. For example, the origins of biodiversity gradients have been associated with a variety of mechanisms [7], but these represent verbal abstractions of biological processes that are difficult to evaluate [20]. Whereas simulation models can always be improved, their formulation implies formalizing process-based abstractions via mechanisms expected to shape the emergent properties of a system [157]. Specifically, when conveying models with gen3sis, decisions regarding the biological processes and landscapes must be formalized in a reproducible fashion. By introducing gen3sis, we encourage a standardization of configuration and landscape objects, which will facilitate future model comparisons. This standardization offers a robust framework for developing, testing, comparing, and applying the mechanisms relevant to biodiversity research. Moreover, modeling eco-evolutionary processes in a flexible platform enables the exploration of how biodiversity statistics may depend on a multitude of different model assumptions and parameter values. This approximates how biodiversity patterns relate to eco-evolutionary processes. Further studies exploring the dependency of summary statistics on model assumptions or parameters are necessary and could be readily assisted by gen3sis.

Studying multiple patterns is a promising approach for disentangling competing hypotheses [20,90]. A wide range of biodiversity dimensions can be simulated with gen3sis (Table 2), which—after appropriate sampling [158]—can be used in a multidimensional comparison with empirical data, i.e., a time series of species abundance matrices and trait matrices, as well as a phylogeny. These output objects are compatible with most R packages used for community or phylogenetic analyses. Hence, the model outputs can be linked to packages computing diversification rates [159], community phylogenetics [160], or functional diversity [161]. The comparison of simulation outputs with empirical data requires a systematic exploration of processes and parameter values when formulating models (e.g., [162]). First, a set of mechanisms and/or a range of reasonable parameter values are explored, e.g., dispersal distances from measurements in a specific clade [163] and/or evolutionary rates [164]. A range of simulation outputs can then be evaluated quantitatively by studying the range of models and parameter values that produce the highest level of agreement with multiple types of empirical data, using, for example, a POM approach [90]. For each model, patterns are evaluated given acceptance criteria (e.g., [42]). A multiscale and multipattern comparison of simulations with empirical data can be completed to evaluate a model’s ability to simultaneously reproduce not only one, but a diverse set of empirical patterns across multiple biodiversity dimensions.

Using an illustrative case study, we have demonstrated the flexibility and utility of gen3sis in modeling multiple eco-evolutionary hypotheses in global paleoenvironmental reconstructions (Figs 3 and 4). Our case study indicates that global biodiversity patterns can be modeled realistically by combining paleoenvironmental reconstructions with eco-evolutionary processes, thus moving beyond pattern description to pattern reproduction [35]. Nevertheless, in our case study, we only implemented a few of the standing LDG hypotheses [20,34]. Multiple macroecological and macroevolutionary hypotheses still have to be tested, including the role of stronger biotic interactions in the tropics than in other regions [165], and compared with more empirical biodiversity patterns [20]. Considering multiple additional biodiversity patterns will allow a more robust selection of models. Apart from the global LDG case study, we propose an additional case study (S2 Note, S15 Fig) illustrating how gen3sis can be used for regional and theoretical studies, such as investigations of the effect of island ontology on the temporal dynamics of biodiversity [41,166]. Further, illustrations associated with the programming code are offered as a vignette of the R package, which will support broad application of gen3sis. Altogether, our examples illustrate the great potential for exploration provided by gen3sis, promising future advances in our understanding of empirical biodiversity patterns.

Gen3sis could be a valuable tool for exploring iconic biodiversity patterns whose underlying mechanisms remain largely under investigation [167]. For example, although we know that mountains are hotspots of biodiversity [56,168], a causal link between mountain dynamics and biodiversity remains poorly understood [169]. Coupling gen3sis with orogenic and erosion models could shed new light on the role of mountain building and associated surface processes in the formation of biodiversity. More generally, the potential role of plate tectonics and surface processes in generating topographic complexity in biodiversity is becoming a hot research topic in the Earth sciences [73]. Similarly, there are many more species associated with coastal reefs (especially coral reefs) in marine ecosystems than in pelagic environments [170]. While it is expected that a combination of geographic features, including plate tectonics [171], and ecological processes interact to form marine diversity, process strengths and interactions are still under investigation [172]. Using gen3sis with paleoenvironmental reconstructions, it is possible to study the interactive effects of ecological and evolutionary processes in shaping global marine biodiversity, with results increasing in precision as more dense and accurate data on paleoenvironmental reconstructions become available [121]. Gen3sis can further support the study of biological processes and can be used to improve our understanding of the links between temperature and biodiversity. For example, it has been hypothesized that temperature influences diversification [63], but the mechanisms and their consequences are still under discussion [173]. Using gen3sis, it is possible to explore the multiple causal pathways between temperature and biodiversity, with the study of the past providing insight into species responses to ongoing climate change [174]. Finally, gen3sis can be used to explore not only species diversity, but also intraspecific genetic structure and thus the correspondence between these diversity levels [175].

Conclusions

Here, we have introduced gen3sis, a modular simulation engine that enables exploration of the consequences of ecological and evolutionary processes and feedbacks on the emergence of spatiotemporal macro-eco-evolutionary biodiversity dynamics. This modeling approach bears similarity to other computer models that have led to significant progress in other fields, such as climatology [154], cosmology [155], and conservation [156]. We have showcased the versatility and utility of gen3sis by comparing the ability of 3 alternative mechanisms in 2 landscapes to generate the LDG while accounting for other global biodiversity patterns. Besides the LDG, frontiers on the origins of biodiversity involve [16] (i) quantifying speciation, extinction, and dispersal events [119]; (ii) exploring adaptive niche evolution [23,39]; and (iii) investigating multiple diversity dependence and carrying capacity mechanisms [21,115,116]. Further possibilities may include (iv) investigating the mechanisms behind age-dependent speciation and extinction patterns [106,112,176]; (v) exploring contrasts between terrestrial and aquatic ecosystems [16]; and (vi) calculating the uncertainty resulting from climatic and geological dynamics (e.g., [22,23,26,38,43]). Gen3sis can support these research frontiers as a general tool for formalizing and studying existing theories associated with the origin of biodiversity, for testing new hypotheses against data, and for making predictions about future biodiversity trajectories (Table 4). Openly available as an R package, gen3sis has the potential to catalyze interdisciplinary biodiversity research and advance our numerical understanding of biodiversity. We call for the formation of a community of ecologists, biologists, mathematicians, geologists, climatologists, and scientists from other fields around this class of eco-evolutionary simulation models in order to unravel the processes that have shaped Earth’s biodiversity.

Supporting information

S1 Animation. Reconstructed dynamic landscape L1 (i.e., world 65 Ma) with the environmental values used for the main case study.

(MP4)

S2 Animation. Reconstructed dynamic landscape L2 (i.e., world 65 Ma) with the environmental values used for the main case study.

(MP4)

S3 Animation. Theoretical dynamic landscape (i.e., theoretical island) with the environmental values used for the supplementary case study.

(MP4)

S4 Animation. Dynamic simulated biodiversity patterns (i.e., M5 L1 world from 65 Ma to the present).

The map shows the α diversity and the top and right graphs indicate the richness profile of longitude and latitude, respectively.

(MP4)

S1 Fig

Divergence increase per time step di against the normalized occupied niche of isolated populations for models (A) M1, M2, M4, and M5, which assume temperature-independent divergence, and (B) M3, which assumes temperature-dependent divergence, where divergence relates to the mean of the realized temperature with 3 different dpower values.

(PDF)

S2 Fig. Nonexhaustive probability density functions of the explored dispersal parameters in a Weibull distribution with shape ɸ of 1, 2, and 5 and Ψ of 550, 650, 750, and 850.

Data presented available in S2 Data at https://zenodo.org/record/5006413.

(PDF)

S3 Fig

Models (i.e., M1, M2, M3, M4, and M5) (A) Kernel density estimate of the same explored parameters (i.e., divergence threshold and dispersal scale) for selected simulations based on a Pearson correlation of simulated versus best observed (i.e., cor > 0.4) and (B) performance quantified with the BIC. Omitted values from the parameter space were simulations generating an unacceptable best Pearson correlation to the empirical data (r ≤ 0.4), too many species (>35,000) or a weak richness gradient (<20 species between minimal and maximal α-richness). Data presented available in S3 Data at https://zenodo.org/record/5006413. BIC, Bayesian information criteria.

(PDF)

S4 Fig

Summary statistics of the model fit to empirical data with and without environmental dynamics for (A) a Pearson correlation of standardized mean species number per latitude (LDGcurve), (B) a Pearson correlation of spatial α-diversity, and (C) the exact difference between lineage through time curves (nLTT). Data presented available in S2 Data at https://zenodo.org/record/5006413. nLTT, normalized lineage though time.

(PDF)

S5 Fig

Standardized mean species number per latitude (LDGcurve) for empirical data (i.e., terrestrial mammals, birds, amphibians, and reptiles) and best matching simulation from models (A) M1, (B) M2, (C) M3, (D) M4, and (E) M5. Data presented available in S4 Data at https://zenodo.org/record/5006413.

(PDF)

S6 Fig

Frequencies of Pearson correlation between simulated standardized mean species number per latitude (LDGcurve) against best matching empirical LDGcurve for each dynamic landscape L1 (in blue) and L2 (in pink) for models (A) M1, (B) M2, (C) M3, (D) M4, and (E) M5. Models M4 and M5 are the only ones producing correlations >0.5. Data presented available in S3 Data at https://zenodo.org/record/5006413.

(PDF)

S7 Fig. Effects of grid cell size on simulations of M2 L1.

(A) Correlation of grid cell, LDG slope, and other summary statistics. (B) Simulated LDG slope and grid cell size, showing a significant effect of spatial resolution on LDG slope. Data presented available in S5 Data at https://zenodo.org/record/5006413. CPU, central processing unit; LDG, latitudinal diversity gradient.

(PDF)

S8 Fig. Frequencies of simulated normalized LDG slope (histogram) with empirical LDG for 4 main groups (dashed gray line) and acceptance range (red line).

Frequencies for models (A) M1, (B) M2, (C) M3, (D) M4, and (E)M5 with total frequency and frequency discriminated for each landscape, i.e., L1 and L2. Data presented available in S3 Data at https://zenodo.org/record/5006413. LDG, latitudinal diversity gradient.

(PDF)

S9 Fig

Normalized richness of (A) selected simulation, (B) terrestrial mammals, (C) birds, (D) amphibians, and (E) reptiles, with Pearson correlation values for comparisons between simulated and empirical data.

(PDF)

S10 Fig. Mean absolute evolutionary events (i.e., speciation and extinction) for every 1 myr for the top 7 best matching current spatial α-biodiversity simulations for each model with and without environmental dynamics.

Data presented available in S6 Data at https://zenodo.org/record/5006413.

(PDF)

S11 Fig. Standardized speciation events for every 1 myr of the top 7 best matching current spatial α-biodiversity simulations for each model with and without environmental dynamics.

Data presented available in S6 Data at https://zenodo.org/record/5006413.

(PDF)

S12 Fig. Standardized extinction events for every 1 myr of the top 7 best matching current spatial α-biodiversity simulations for each model with and without environmental dynamics.

Data presented available in S6 Data at https://zenodo.org/record/5006413.

(PDF)

S13 Fig

Correlation of model parameters and emerging patterns for all models and landscapes without deep-time environmental dynamics (A) M0 L1.0, (B) M0 L2.0, (C) M1 L1.0, (D) M1 L2.0, (E) M2 L1.0, and (F) M2 L2.0. Emerging patterns: (i) phylogeny beta is the phylogenetic tree imbalance statistic measured as the value that maximizes the likelihood in the β-splitting model; (ii) range quant 0.95% is the value of the 95% quantile of the species range area distribution; (iii) LDG % loss is the slope of the linear regression of species richness; (iv) richness r is the highest Pearson correlation between simulated and empirical α-diversity; (v) nLTT diff is the lowest difference between simulated and empirical nLTT curves; and (vi) LDG curve r is the highest Pearson correlation between simulated and empirical standardized mean species number per latitude. Data presented available in S3 Data at https://zenodo.org/record/5006413. LDG, latitudinal diversity gradient; nLTT, normalized lineage though time.

(PDF)

S14 Fig

Correlation of model parameters and 3 emerging patterns for all models and landscapes considering deep-time environmental dynamics (A) M0 L1, (B) M0 L2, (C) M1 L1, (D) M1 L2, (E) M2 L1, and (F) M2 L2. Emerging patterns: (i) phylogeny beta is the phylogenetic tree imbalance statistic measured as the value that maximizes the likelihood in the β-splitting model; (ii) range quant 0.95% is the value of the 95% quantile of the species range area distribution; (iii) LDG % loss is the slope of the linear regression of species richness; (iv) richness r is the highest Pearson correlation between simulated and empirical α-diversity; (v) nLTT diff is the lowest difference between simulated and empirical nLTT curves; and (vi) LDG curve r is the highest Pearson correlation between simulated and empirical standardized mean species number per latitude. Data presented available in S3 Data at https://zenodo.org/record/5006413. LDG, latitudinal diversity gradient; nLTT, normalized lineage though time.

(PDF)

S15 Fig

Results of the island case study showing (A) landscape size and environmental dynamics and (B) results of 3 experiments (i.e., lower, equal, and higher trait evolution compared with the temporal environmental variation). The time series in (B) shows γ richness (log10 scale) on theoretical oceanic islands, following the geomorphological dynamics of islands. Thick lines indicate the average of the replicates, whereas thin lines indicate SD envelopes (n = 30 for each trait evolutionary rate scenario). The dashed gray vertical bar crossing the entire plot indicates the period in which the island reaches its maximum size. Data presented available in S7 Data at https://zenodo.org/record/5006413.

(PDF)

S1 Note. Global case study: The emergence of the LDG in the Cenozoic.

(DOCX)

S2 Note. Island case study: Does trait evolution impact biodiversity dynamics?

(DOCX)

S3 Note. Gen3sis pseudo-code.

(DOCX)

Acknowledgments

We thank Samuel Bickel and Alex Skeels for thorough comments on this manuscript and package. We thank Camille Albouy, Charles N.D. Santana, Lydian Boschman, Wilhelmine Bach, Thomas Keggin, Flurin Leugger, Victor L.J. Boussange, Conor Waldock, and all sELDiG working group participants for insightful feedback during the model development. We thank the WSL and ETH Zürich for support and infrastructure, including access to High Performance Computing facilities.

Abbreviations

BIC

Bayesian information criteria

gen3sis

general engine for eco-evolutionary simulations

LDG

latitudinal diversity gradient

nLTT

normalized lineage though time

POM

pattern-oriented modeling

SRD

species range decrease

Data Availability

Gen3sis is implemented in a mix of R and C++ code, and wrapped into an R-package. All high-level functions that the user may interact with are written in R, and are documented via the standard R / Roxygen help files for R-packages. Runtime-critical functions are implemented in C++ and coupled to R via the Rcpp framework. Additionally, the package provides several convenience functions to generate input data, configuration files and plots, as well as tutorials in the form of vignettes that illustrate how to declare models and run simulations. The software, under an open and free GPL3 license, can be downloaded from CRAN at https://CRAN.R-project.org/package=gen3sis. The development version, open to issue reporting and feature suggestions, is available at https://github.com/project-Gen3sis/R-package. Supporting information, such as notes, scripts, data, figures and animations, are available at https://zenodo.org/record/5006413, facilitating full reproducibility.

Funding Statement

OH, JC, FH, MP, TR and LP were part of the sELDiG working group, which was supported by the Synthesis Centre of the German Centre for Integrative Biodiversity Research, Halle-Jena-Leipzig (DFG FZT 118). LP was supported by the Swiss National Science Foundation grant (N° 310030_188550). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Darwin C. On the origin of species by means of natural selection, or preservation of favoured races in the struggle for life. London: John Murray; 1859. [PMC free article] [PubMed] [Google Scholar]
  • 2.Couvreur TLP, Dauby G, Blach-Overgaard A, Deblauwe V, Dessein S, Droissart V, et al. Tectonics, climate and the diversification of the tropical African terrestrial flora and fauna. Biol Rev Camb Philos Soc. 2020. Epub 2020/09/15. doi: 10.1111/brv.12644 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Pellissier L, Heine C, Rosauer DF, Albouy C. Are global hotspots of endemic richness shaped by plate tectonics? Biol J Linn Soc. 2017;123(1):247–61. doi: 10.1093/biolinnean/blx125 [DOI] [Google Scholar]
  • 4.Gaston KJ. Global patterns in biodiversity. Nature. 2000;405(6783):220–7. doi: 10.1038/35012228 [DOI] [PubMed] [Google Scholar]
  • 5.Jetz W, Thomas GH, Joy JB, Hartmann K, Mooers AO. The global diversity of birds in space and time. Nature. 2012;491(7424):444–8. Epub 2012/11/06. doi: 10.1038/nature11631 [DOI] [PubMed] [Google Scholar]
  • 6.Mittelbach GG, Schemske DW, Cornell HV, Allen AP, Brown JM, Bush MB, et al. Evolution and the latitudinal diversity gradient: speciation, extinction and biogeography. Ecol Lett. 2007;10(4):315–31. doi: 10.1111/j.1461-0248.2007.01020.x [DOI] [PubMed] [Google Scholar]
  • 7.Fine PVA. Ecological and evolutionary drivers of geographic variation in species diversity. Annu Rev Ecol Evol Syst. 2015;46(1):369–92. doi: 10.1146/annurev-ecolsys-112414-054102 [DOI] [Google Scholar]
  • 8.Av Humboldt, Bonpland A. Ideennzu einer geographie der pflanzen nebst einem naturgemälde der tropenländer, auf beobachtungen und messungen gegründet. Tübingen,: Cotta F. G.; etc.; 1807. 2 p.l., xii, 182 p. p. [Google Scholar]
  • 9.Wallace AR. Tropical nature, and other essays. London: Macmillan and Company; 1878. [Google Scholar]
  • 10.Hutchinson GE. Homage to Santa Rosalia or why are there so many kinds of animals? Am Nat. 1959;93(870):145–59. doi: 10.1086/282070 [DOI] [Google Scholar]
  • 11.MacArthur R, Wilson E. The theory of island biogeography. Princeton, NJ: Princeton Univ. Press; 1967. [Google Scholar]
  • 12.Pianka ER. Convexity, desert lizards, and spatial heterogeneity. Ecology. 1966;47(6):1055–9. doi: 10.2307/1935656 [DOI] [Google Scholar]
  • 13.Willig MR, Kaufman DM, Stevens RD. Latitudinal gradients of biodiversity: Pattern, process, scale, and synthesis. Annu Rev Ecol Evol Syst. 2003;34(1):273–309. doi: 10.1146/annurev.ecolsys.34.012103.144032 [DOI] [Google Scholar]
  • 14.Schemske DW, Mittelbach GG. "Latitudinal gradients in species diversity": Reflections on Pianka’s 1966 article and a look forward. Am Nat. 2017;189(6):599–603. Epub 2017/05/18. doi: 10.1086/691719 [DOI] [PubMed] [Google Scholar]
  • 15.MacArthur RH. Patterns of species diversity. Biol Rev. 1965;40(4):510–33. doi: 10.1111/j.1469-185X.1965.tb00815.x [DOI] [Google Scholar]
  • 16.Benton MJ. Origins of Biodiversity. PLoS Biol. 2016;14(11):e2000724. doi: 10.1371/journal.pbio.2000724 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Morlon H, Parsons TL, Plotkin JB. Reconciling molecular phylogenies with the fossil record. Proc Natl Acad Sci. 2011;108(39):16327–32. Epub 2011/09/21. doi: 10.1073/pnas.1102543108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Field R, Hawkins BA, Cornell HV, Currie DJ, Diniz-Filho JAF, Guégan J-F, et al. Spatial species-richness gradients across scales: a meta-analysis. J Biogeogr. 2009;36(1):132–47. doi: 10.1111/j.1365-2699.2008.01963.x [DOI] [Google Scholar]
  • 19.Sandel B, Arge L, Dalsgaard B, Davies RG, Gaston KJ, Sutherland WJ, et al. The influence of Late Quaternary climate-change velocity on species endemism. Science. 2011;334(6056):660–4. doi: 10.1126/science.1210173 [DOI] [PubMed] [Google Scholar]
  • 20.Pontarp M, Bunnefeld L, Cabral JS, Etienne RS, Fritz SA, Gillespie R, et al. The latitudinal diversity gradient: Novel understanding through mechanistic eco-evolutionary models. Trends Ecol Evol. 2019;34(3):211–23. doi: 10.1016/j.tree.2018.11.009 [DOI] [PubMed] [Google Scholar]
  • 21.Etienne RS, Cabral JS, Hagen O, Hartig F, Hurlbert AH, Pellissier L, et al. A minimal model for the latitudinal diversity gradient suggests a dominant role for ecological limits. Am Nat. 2019;194(5):E122–E33. Epub 2019/10/16. doi: 10.1086/705243 [DOI] [PubMed] [Google Scholar]
  • 22.Saupe EE, Myers CE, Townsend Peterson A, Soberon J, Singarayer J, Valdes P, et al. Spatio-temporal climate change contributes to latitudinal diversity gradients. Nat Ecol Evol. 2019;3(10):1419–29. Epub 2019/09/11. doi: 10.1038/s41559-019-0962-7 [DOI] [PubMed] [Google Scholar]
  • 23.Rangel TF, Edwards NR, Holden PB, Diniz-Filho JAF, Gosling WD, Coelho MTP, et al. Modeling the ecology and evolution of biodiversity: Biogeographical cradles, museums, and graves. Science. 2018;361(6399):eaar5452. doi: 10.1126/science.aar5452 [DOI] [PubMed] [Google Scholar]
  • 24.Doebeli M, Dieckmann U. Speciation along environmental gradients. Nature. 2003;421(6920):259–64. doi: 10.1038/nature01274 [DOI] [PubMed] [Google Scholar]
  • 25.Grimm V, Revilla E, Berger U, Jeltsch F, Mooij WM, Railsback SF, et al. Pattern-oriented modeling of agent-based complex systems: Lessons from ecology. Science. 2005;310(5750):987–91. doi: 10.1126/science.1116681 [DOI] [PubMed] [Google Scholar]
  • 26.Leprieur F, Descombes P, Gaboriau T, Cowman PF, Parravicini V, Kulbicki M, et al. Plate tectonics drive tropical reef biodiversity dynamics. Nat Commun. 2016;7:11461. doi: 10.1038/ncomms11461 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Duputie A, Massol F. An empiricist’s guide to theoretical predictions on the evolution of dispersal. Interface Focus. 2013;3(6):20130028. Epub 2014/02/12. doi: 10.1098/rsfs.2013.0028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Keller I, Seehausen O. Thermal adaptation and ecological speciation. Mol Ecol. 2012;21(4):782–99. Epub 2011/12/21. doi: 10.1111/j.1365-294X.2011.05397.x [DOI] [PubMed] [Google Scholar]
  • 29.Urban MC. The evolution of species interactions across natural landscapes. Ecol Lett. 2011;14(7):723–32. Epub 2011/05/28. doi: 10.1111/j.1461-0248.2011.01632.x [DOI] [PubMed] [Google Scholar]
  • 30.Mittelbach GG, Schemske DW. Ecological and evolutionary perspectives on community assembly. Trend Ecol Evol. 2015;30(5):241–7. Epub 2015/03/26. doi: 10.1016/j.tree.2015.02.008 [DOI] [PubMed] [Google Scholar]
  • 31.Schluter D, Pennell MW. Speciation gradients and the distribution of biodiversity. Nature. 2017;546(7656):48–55. doi: 10.1038/nature22897 [DOI] [PubMed] [Google Scholar]
  • 32.Villa Martin P, Hidalgo J, Rubio de Casas R, Munoz MA. Eco-evolutionary Model of Rapid Phenotypic Diversification in Species-Rich Communities. PLoS Comp Biol. 2016;12(10):e1005139. doi: 10.1371/journal.pcbi.1005139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Leidinger L, Cabral J. Biodiversity dynamics on islands: Explicitly accounting for causality in mechanistic models. Diversity. 2017;9(3). doi: 10.3390/d9030031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Pontarp M, Wiens JJ. The origin of species richness patterns along environmental gradients: uniting explanations based on time, diversification rate and carrying capacity. J Biogeogr. 2017;44(4):722–35. doi: 10.1111/jbi.12896 [DOI] [Google Scholar]
  • 35.Gotelli NJ, Anderson MJ, Arita HT, Chao A, Colwell RK, Connolly SR, et al. Patterns and causes of species richness: a general simulation model for macroecology. Ecol Lett. 2009;12(9):873–86. doi: 10.1111/j.1461-0248.2009.01353.x [DOI] [PubMed] [Google Scholar]
  • 36.Cabral JS, Valente L, Hartig F. Mechanistic simulation models in macroecology and biogeography: state-of-art and prospects. Ecography. 2017;40(2):267–80. doi: 10.1111/ecog.02480 [DOI] [Google Scholar]
  • 37.Connolly SR, Keith SA, Colwell RK, Rahbek C. Process, mechanism, and modeling in macroecology. Trends Ecol Evol. 2017;32(11):835–44. doi: 10.1016/j.tree.2017.08.011 [DOI] [PubMed] [Google Scholar]
  • 38.Saupe EE, Qiao H, Donnadieu Y, Farnsworth A, Kennedy-Asser AT, Ladant J-B, et al. Extinction intensity during Ordovician and Cenozoic glaciations explained by cooling and palaeogeography. Nat Geosci. 2020;13(1):65–70. doi: 10.1038/s41561-019-0504-6 [DOI] [Google Scholar]
  • 39.Saupe EE, Myers CE, Peterson AT, Soberón J, Singarayer J, Valdes P, et al. Non-random latitudinal gradients in range size and niche breadth predicted by spatial patterns of climate. Global Ecol Biogeogr. 2019;28(7):928–42. doi: 10.1111/geb.12904 [DOI] [Google Scholar]
  • 40.Skeels A, Cardillo M. Reconstructing the geography of speciation from contemporary biodiversity data. Am Nat. 2019;193(2):240–55. Epub 2019/02/06. doi: 10.1086/701125 [DOI] [PubMed] [Google Scholar]
  • 41.Cabral JS, Wiegand K, Kreft H. Interactions between ecological, evolutionary and environmental processes unveil complex dynamics of insular plant diversity. J Biogeogr. 2019. doi: 10.1111/jbi.13606 [DOI] [Google Scholar]
  • 42.Cabral JS, Whittaker RJ, Wiegand K, Kreft H, Emerson B. Assessing predicted isolation effects from the general dynamic model of island biogeography with an eco-evolutionary model for plants. J Biogeogr. 2019. doi: 10.1111/jbi.13603 [DOI] [Google Scholar]
  • 43.Donati GFA, Parravicini V, Leprieur F, Hagen O, Gaboriau T, Heine C, et al. A process-based model supports an association between dispersal and the prevalence of species traits in tropical reef fish assemblages. Ecography. 2019;42(12):2095–106. doi: 10.1111/ecog.04537 [DOI] [Google Scholar]
  • 44.Descombes P, Gaboriau T, Albouy C, Heine C, Leprieur F, Pellissier L. Linking species diversification to palaeo-environmental changes: A process-based modelling approach. Global Ecol Biogeogr. 2018;27(2):233–44. doi: 10.1111/geb.12683 [DOI] [Google Scholar]
  • 45.Qiao H, Saupe EE, Soberon J, Peterson AT, Myers CE. Impacts of Niche Breadth and Dispersal Ability on Macroevolutionary Patterns. Am Nat. 2016;188(2):149–62. Epub 2016/07/16. doi: 10.1086/687201 [DOI] [PubMed] [Google Scholar]
  • 46.Pontarp M, Brännström Å, Petchey OL, Poisot T. Inferring community assembly processes from macroscopic patterns using dynamic eco-evolutionary models and Approximate Bayesian Computation (ABC). Methods Ecol Evol. 2019;10(4):450–60. doi: 10.1111/2041-210x.13129 [DOI] [Google Scholar]
  • 47.Gaboriau T, Albouy C, Descombes P, Mouillot D, Pellissier L, Leprieur F. Ecological constraints coupled with deep-time habitat dynamics predict the latitudinal diversity gradient in reef fishes. Proc R Soc B Biol Sci. 2019;286(1911):20191506. Epub 2019/09/19. doi: 10.1098/rspb.2019.1506 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Sukumaran J, Economo EP, Lacey Knowles L. Machine Learning Biogeographic Processes from Biotic Patterns: A New Trait-Dependent Dispersal and Diversification Model with Model Choice By Simulation-Trained Discriminant Analysis. Syst Biol. 2016;65(3):525–45. Epub 2015/12/31. doi: 10.1093/sysbio/syv121 [DOI] [PubMed] [Google Scholar]
  • 49.Kubisch A, Holt RD, Poethke H-J, Fronhofer EA. Where am I and why? Synthesizing range biology and the eco-evolutionary dynamics of dispersal. Oikos. 2014;123(1):5–22. doi: 10.1111/j.1600-0706.2013.00706.x [DOI] [Google Scholar]
  • 50.Pigot AL, Phillimore AB, Owens IP, Orme CD. The shape and temporal dynamics of phylogenetic trees arising from geographic speciation. Syst Biol. 2010;59(6):660–73. Epub 2010/10/19. doi: 10.1093/sysbio/syq058 [DOI] [PubMed] [Google Scholar]
  • 51.Rangel Thiago Fernando LVB, Diniz-Filho José Alexandre F, Colwell Robert K. Species Richness and Evolutionary Niche Dynamics: A Spatial Pattern–Oriented Simulation Experiment. Am Nat. 2007;170(4):602–16. doi: 10.1086/521315 [DOI] [PubMed] [Google Scholar]
  • 52.Rangel TFLVB Diniz-Filho JAF. An evolutionary tolerance model explaining spatial patterns in species richness under environmental gradients and geometric constraints. Ecography. 2005;28(2):253–63. doi: 10.1111/j.0906-7590.2005.04038.x [DOI] [Google Scholar]
  • 53.Culina A, van den Berg I, Evans S, Sanchez-Tojar A. Low availability of code in ecology: A call for urgent action. PLoS Biol. 2020;18(7):e3000763. Epub 2020/07/30. doi: 10.1371/journal.pbio.3000763 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Muneepeerakul R, Bertuzzo E, Rinaldo A, Rodriguez-Iturbe I. Evolving biodiversity patterns in changing river networks. J Theor Biol. 2019;462:418–24. Epub 2018/11/30. doi: 10.1016/j.jtbi.2018.11.021 [DOI] [PubMed] [Google Scholar]
  • 55.Jõks M, Pärtel M. Plant diversity in Oceanic archipelagos: realistic patterns emulated by an agent-based computer simulation. Ecography. 2018;42(4):740–54. doi: 10.1111/ecog.03985 [DOI] [Google Scholar]
  • 56.Hagen O, Vaterlaus L, Albouy C, Brown A, Leugger F, Onstein RE, et al. Mountain building, climate cooling and the richness of cold-adapted plants in the Northern Hemisphere. J Biogeogr. 2019. doi: 10.1111/jbi.13653 [DOI] [Google Scholar]
  • 57.Xu X, Kuntner M, Liu F, Chen J, Li D. Formation of rivers and mountains drives diversification of primitively segmented spiders in continental East Asia. J Biogeogr. 2018. doi: 10.1111/jbi.13403 [DOI] [Google Scholar]
  • 58.Ficetola GF, Mazel F, Thuiller W. Global determinants of zoogeographical boundaries. Nat Ecol Evol. 2017;1:0089. doi: 10.1038/s41559-017-0089 [DOI] [PubMed] [Google Scholar]
  • 59.Wiens JJ, Donoghue MJ. Historical biogeography, ecology and species richness. Trends Ecol Evol. 2004;19(12):639–44. doi: 10.1016/j.tree.2004.09.011 [DOI] [PubMed] [Google Scholar]
  • 60.Wiens JJ, Graham CH. Niche conservatism: Integrating evolution, ecology, and conservation biology. Annu Rev Ecol Evol Syst. 2005;36(1):519–39. doi: 10.1146/annurev.ecolsys.36.102803.095431 [DOI] [Google Scholar]
  • 61.Fischer AG. Latitudinal Variations in Organic Diversity. Evolution. 1960;14(1):64–81. doi: 10.2307/2405923 [DOI] [Google Scholar]
  • 62.Stephens Patrick R., Wiens John J. Explaining Species Richness from Continents to Communities: The Time-for-Speciation Effect in Emydid Turtles. Am Nat. 2003;161(1):112–28. doi: 10.1086/345091 [DOI] [PubMed] [Google Scholar]
  • 63.Allen AP, Gillooly JF, Savage VM, Brown JH. Kinetic effects of temperature on rates of genetic divergence and speciation. Proc Natl Acad Sci. 2006;103(24):9130–5. doi: 10.1073/pnas.0603587103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Rohde K. Latitudinal gradients in species diversity: the search for the primary cause. Oikos. 1992:514–27. [Google Scholar]
  • 65.MacArthur RH, Wilson EO. An Equilibrium Theory of Insular Zoogeography. Evolution. 1963;17(4):373–87. doi: 10.2307/2407089 [DOI] [Google Scholar]
  • 66.Rosenzweig ML. Species diversity in space and time: Cambridge University Press; 1995. [Google Scholar]
  • 67.R Core Team. R: A Language and Environment for Statistical Computing. In: Computing RFfS, editor. Vienna, Austria; 2020. [Google Scholar]
  • 68.Bocedi G, Palmer SCF, Malchow A-K, Zurell D, Watts K, Travis JMJ. RangeShifter 2.0: An extended and enhanced platform for modelling spatial eco-evolutionary dynamics and species’ responses to environmental changes. biorXiv. 2020. doi: 10.1101/2020.11.26.400119 [DOI] [Google Scholar]
  • 69.Tello JS, Stevens RD. Can stochastic geographical evolution re-create macroecological richness-environment correlations? Global Ecol Biogeogr. 2012;21(2):212–23. doi: 10.1111/j.1466-8238.2011.00661.x [DOI] [Google Scholar]
  • 70.Bivand RS. Progress in the R ecosystem for representing and handling spatial data. J Geogr Syst. 2020. doi: 10.1007/s10109-020-00336-0 [DOI] [Google Scholar]
  • 71.Wilson G, Aruliah DA, Brown CT, Chue Hong NP, Davis M, Guy RT, et al. Best practices for scientific computing. PLoS Biol. 2014;12(1):e1001745. doi: 10.1371/journal.pbio.1001745 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Alzate A, Janzen T, Bonte D, Rosindell J, Etienne RS, Belmaker J. A simple spatially explicit neutral model explains the range size distribution of reef fishes. Global Ecol Biogeogr. 2019;28(7):875–90. doi: 10.1111/geb.12899 [DOI] [Google Scholar]
  • 73.Salles T, Rey P, Bertuzzo E. Mapping landscape connectivity as a driver of species richness under tectonic and climatic forcing. Earth Surf Dyn. 2019;7(4):895–910. doi: 10.5194/esurf-7-895-2019 [DOI] [Google Scholar]
  • 74.Giezendanner J, Bertuzzo E, Pasetto D, Guisan A, Rinaldo A. A minimalist model of extinction and range dynamics of virtual mountain species driven by warming temperatures. PLoS ONE. 2019;14(3):e0213775. Epub 2019/03/19. doi: 10.1371/journal.pone.0213775 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.van Etten J. R package gdistance: Distances and routes on geographical grids. J Stat Softw. 2017;76(13). doi: 10.18637/jss.v076.i13 [DOI] [Google Scholar]
  • 76.Dobzhansky T. Genetics and the origin of species. Repr. ed. New York: Columbia University Press; 1982. 364 S. p. doi: 10.1126/science.216.4548.844 [DOI] [Google Scholar]
  • 77.Stanley SM. Population Size, Extinction, and Speciation: The Fission Effect in Neogene Bivalvia. Paleobiology. 1986;12(1):89–110. [Google Scholar]
  • 78.Wisz MS, Pottier J, Kissling WD, Pellissier L, Lenoir J, Damgaard CF, et al. The role of biotic interactions in shaping distributions and realised assemblages of species: implications for species distribution modelling. Biol Rev Camb Philos Soc. 2013;88(1):15–30. doi: 10.1111/j.1469-185X.2012.00235.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Pontarp M, Petchey OL. Ecological opportunity and predator-prey interactions: linking eco-evolutionary processes and diversification in adaptive radiations. Prc R Soc B Biol Sci. 2018;285(1874). Epub 2018/03/09. doi: 10.1098/rspb.2017.2550 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Pellissier L, Descombes P, Hagen O, Chalmandrier L, Glauser G, Kergunteuil A, et al. Growth-competition-herbivore resistance trade-offs and the responses of alpine plant communities to climate change. Funct Ecol. 2018;32(7):1693–703. doi: 10.1111/1365-2435.13075 [DOI] [Google Scholar]
  • 81.Pellissier L. Stability and the competition-dispersal trade-off as drivers of speciation and biodiversity gradients. Front Ecol Evol. 2015;3. doi: 10.3389/fevo.2015.00052 [DOI] [Google Scholar]
  • 82.Ronce O. How Does It Feel to Be Like a Rolling Stone? Ten Questions About Dispersal Evolution. Annu Rev Ecol Evol Syst. 2007;38(1):231–53. doi: 10.1146/annurev.ecolsys.38.091206.095611 [DOI] [Google Scholar]
  • 83.Stegen JC, Enquist BJ, Ferriere R. Advancing the metabolic theory of biodiversity. Ecol Lett. 2009;12(10):1001–15. doi: 10.1111/j.1461-0248.2009.01358.x [DOI] [PubMed] [Google Scholar]
  • 84.Gillooly JF, Allen AP, West GB, Brown JH. The rate of DNA evolution: Effects of body size and temperature on the molecular clock. Proc Natl Acad Sci U S A. 2005;102(1):140–5. doi: 10.1073/pnas.0407735101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Ohta T. The nearly neutral theory of molecular evolution. Annu Rev Ecol Syst. 1992;23(1):263–86. doi: 10.1146/annurev.es.23.110192.001403 [DOI] [Google Scholar]
  • 86.Felsenstein J. Maximum-likelihood estimation of evolutionary trees from continuous characters. Am J Hum Genet. 1973;25(5):471. [PMC free article] [PubMed] [Google Scholar]
  • 87.Hansen TF. Stabilizing selection and the comparative analysis of adaptation. Evolution. 1997;51(5):1341–51. doi: 10.1111/j.1558-5646.1997.tb01457.x [DOI] [PubMed] [Google Scholar]
  • 88.Macarthur R, Levins R. The limiting similarity, convergence, and divergence of coexisting species. Am Nat. 1967;101(921):377–85. doi: 10.1086/282505 [DOI] [Google Scholar]
  • 89.Cabral JS, Kreft H, Higgins S. Linking ecological niche, community ecology and biogeography: insights from a mechanistic niche model. J Biogeogr. 2012;39(12):2212–24. doi: 10.1111/jbi.12010 [DOI] [Google Scholar]
  • 90.Grimm V, Railsback SF. Pattern-oriented modelling: a ’multi-scope’ for predictive systems ecology. Philos T R Soc B. 2012;367(1586):298–310. Epub 2011/12/07. doi: 10.1098/rstb.2011.0180 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Lewin R. Biologists disagree over bold signature of nature. Science. 1989;244(4904):527–9. doi: 10.1126/science.244.4904.527 [DOI] [PubMed] [Google Scholar]
  • 92.Lawton JH. Are there general laws in ecology? Oikos. 1999;84(2):177–92. doi: 10.2307/3546712 [DOI] [Google Scholar]
  • 93.Swenson NG, Anglada-Cordero P, Barone JA. Deterministic tropical tree community turnover: evidence from patterns of functional beta diversity along an elevational gradient. Proc R Soc B Biol Sci. 2011;278(1707):877–84. Epub 2010/09/24. doi: 10.1098/rspb.2010.1369 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Anderson MJ, Crist TO, Chase JM, Vellend M, Inouye BD, Freestone AL, et al. Navigating the multiple meanings of beta diversity: a roadmap for the practicing ecologist. Ecol Lett. 2011;14(1):19–28. Epub 2010/11/13. doi: 10.1111/j.1461-0248.2010.01552.x [DOI] [PubMed] [Google Scholar]
  • 95.Holt BG, Lessard J-P, Borregaard MK, Fritz SA, Araújo MB, Dimitrov D, et al. An update of wallace’s zoogeographic regions of the world. Science. 2013;339(6115):74–8. doi: 10.1126/science.1228282 [DOI] [PubMed] [Google Scholar]
  • 96.Qian H, Ricklefs RE, White PS. Beta diversity of angiosperms in temperate floras of eastern Asia and eastern North America. Ecol Lett. 2005;8(1):15–22. doi: 10.1111/j.1461-0248.2004.00682.x [DOI] [Google Scholar]
  • 97.Bar-On YM, Phillips R, Milo R. The biomass distribution on Earth. Proc Natl Acad Sci. 2018;115(25):6506–11. Epub 2018/05/23. doi: 10.1073/pnas.1711842115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Brown JH. On the relationship between abundance and distribution of species. Am Nat. 1984;124(2):255–79. doi: 10.1086/284267 [DOI] [Google Scholar]
  • 99.Gaston KJ. Species-range-size distributions: patterns, mechanisms and implications. Trends Ecol Evol. 1996;11(5):197–201. doi: 10.1016/0169-5347(96)10027-6 [DOI] [PubMed] [Google Scholar]
  • 100.Gómez-Rodríguez C, Baselga A, Wiens JJ. Is diversification rate related to climatic niche width? Global Ecol Biogeogr. 2015;24(4):383–95. doi: 10.1111/geb.12229 [DOI] [Google Scholar]
  • 101.Lamanna C, Blonder B, Violle C, Kraft NJ, Sandel B, Simova I, et al. Functional trait space and the latitudinal diversity gradient. Proc Natl Acad Sci. 2014;111(38):13745–50. Epub 2014/09/17. doi: 10.1073/pnas.1317722111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Rolland J, Salamin N. Niche width impacts vertebrate diversification. Global Ecol Biogeogr. 2016;25(10):1252–63. doi: 10.1111/geb.12482 [DOI] [Google Scholar]
  • 103.Morlon H. Phylogenetic approaches for studying diversification. Ecol Lett. 2014;17(4):508–25. Epub 2014/02/19. doi: 10.1111/ele.12251 [DOI] [PubMed] [Google Scholar]
  • 104.Silvestro D, Warnock RCM, Gavryushkina A, Stadler T. Closing the gap between palaeontological and neontological speciation and extinction rate estimates. Nat Commun. 2018;9(1):5237. Epub 2018/12/12. doi: 10.1038/s41467-018-07622-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Stadler T. Mammalian phylogeny reveals recent diversification rate shifts. Proc Natl Acad Sci. 2011;108(15):6187–92. Epub 2011/03/30. doi: 10.1073/pnas.1016876108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Hagen O, Hartmann K, Steel M, Stadler T. Age-dependent speciation can explain the shape of empirical phylogenies. Syst Biol. 2015;64(3):432–40. doi: 10.1093/sysbio/syv001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Manceau M, Lambert A, Morlon H. Phylogenies support out-of-equilibrium models of biodiversity. Ecol Lett. 2015;18(4):347–56. Epub 2015/02/26. doi: 10.1111/ele.12415 [DOI] [PubMed] [Google Scholar]
  • 108.Swenson NG, Erickson DL, Mi X, Bourg NA, Forero-Montaña J, Ge X, et al. Phylogenetic and functional alpha and beta diversity in temperate and tropical tree communities. Ecology. 2012;93(sp8):S112–S25. doi: 10.1890/11-0402.1 [DOI] [Google Scholar]
  • 109.Graham CH, Fine PV. Phylogenetic beta diversity: linking ecological and evolutionary processes across space in time. Ecol Lett. 2008;11(12):1265–77. Epub 2008/12/03. doi: 10.1111/j.1461-0248.2008.01256.x [DOI] [PubMed] [Google Scholar]
  • 110.Morlon H, Schwilk DW, Bryant JA, Marquet PA, Rebelo AG, Tauss C, et al. Spatial patterns of phylogenetic diversity. Ecol Lett. 2011;14(2):141–9. Epub 2010/12/21. doi: 10.1111/j.1461-0248.2010.01563.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Petchey OL, Gaston KJ. Functional diversity: back to basics and looking forward. Ecol Lett. 2006;9(6):741–58. Epub 2006/05/19. doi: 10.1111/j.1461-0248.2006.00924.x [DOI] [PubMed] [Google Scholar]
  • 112.Silvestro D, Castiglione S, Mondanaro A, Serio C, Melchionna M, Piras P, et al. A 450 million years long latitudinal gradient in age-dependent extinction. Ecol Lett. 2020;23(3):439–46. Epub 2019/12/20. doi: 10.1111/ele.13441 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Igea J, Tanentzap AJ. Angiosperm speciation cools down in the tropics. Ecol Lett. 2020;23(4):692–700. Epub 2020/02/12. doi: 10.1111/ele.13476 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Tittensor DP, Worm B. A neutral-metabolic theory of latitudinal biodiversity. Global Ecol Biogeogr. 2016;25(6):630–41. doi: 10.1111/geb.12451 [DOI] [Google Scholar]
  • 115.Storch D, Bohdalkova E, Okie J. The more-individuals hypothesis revisited: the role of community abundance in species richness regulation and the productivity-diversity relationship. Ecol Lett. 2018;21(6):920–37. Epub 2018/04/17. doi: 10.1111/ele.12941 [DOI] [PubMed] [Google Scholar]
  • 116.Hurlbert AH, Stegen JC. When should species richness be energy limited, and how would we know? Ecol Lett. 2014;17(4):401–13. Epub 2014/01/08. doi: 10.1111/ele.12240 [DOI] [PubMed] [Google Scholar]
  • 117.Eyres A, Eronen JT, Hagen O, Bohning-Gaese K, Fritz SA. Climatic effects on niche evolution in a passerine bird clade depend on paleoclimate reconstruction method. Evolution. 2021;75(5):1046–60. Epub 2021/03/17. doi: 10.1111/evo.14209 [DOI] [PubMed] [Google Scholar]
  • 118.Kissling WD, Eiserhardt WL, Baker WJ, Borchsenius F, Couvreur TL, Balslev H, et al. Cenozoic imprints on the phylogenetic structure of palm species assemblages worldwide. Proc Natl Acad Sci. 2012;109(19):7379–84. Epub 2012/04/25. doi: 10.1073/pnas.1120467109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Meseguer AS, Condamine FL. Ancient tropical extinctions at high latitudes contributed to the latitudinal diversity gradient. Evolution. 2020. Epub 2020/04/05. doi: 10.1111/evo.13967 [DOI] [PubMed] [Google Scholar]
  • 120.Zachos J, Pagani M, Sloan L, Thomas E, Billups K. Trends, rhythms, and aberrations in global climate 65Ma to present. Science. 2001;292(5517):686–93. doi: 10.1126/science.1059412 [DOI] [PubMed] [Google Scholar]
  • 121.Straume EO, Gaina C, Medvedev S, Nisancioglu KH. Global Cenozoic Paleobathymetry with a focus on the Northern Hemisphere Oceanic Gateways. Gondwana Res. 2020;86:126–43. doi: 10.1016/j.gr.2020.05.011 [DOI] [Google Scholar]
  • 122.Scotese CR, Wright N. PALEOMAP Paleodigital Elevation Models (PaleoDEMS) for the Phanerozoic. 2018. Available from: https://www.earthbyte.org/paleodem-resource-scotese-and-wright-2018.
  • 123.Westerhold T, Marwan N, Drury AJ, Liebrand D, Agnini C, Anagnostou E, et al. An astronomically dated record of Earth’s climate and its predictability over the last 66 million years. Science. 2020;369(6509):1383–7. doi: 10.1126/science.aba6853 [DOI] [PubMed] [Google Scholar]
  • 124.Zhang L, Hay WW, Wang C, Gu X. The evolution of latitudinal temperature gradients from the latest Cretaceous through the Present. Earth-Sci Rev. 2019;189:147–58. doi: 10.1016/j.earscirev.2019.01.025 [DOI] [Google Scholar]
  • 125.Hollis CJ, Dunkley Jones T, Anagnostou E, Bijl PK, Cramwinckel MJ, Cui Y, et al. The DeepMIP contribution to PMIP4: methodologies for selection, compilation and analysis of latest Paleocene and early Eocene climate proxy data, incorporating version 0.1 of the DeepMIP database. Geosci Model Dev. 2019;12(7):3149–206. doi: 10.5194/gmd-12-3149-2019 [DOI] [Google Scholar]
  • 126.Hutchinson DK, de Boer AM, Coxall HK, Caballero R, Nilsson J, Baatsen M. Climate sensitivity and meridional overturning circulation in the late Eocene using GFDL CM2.1. Clim Past. 2018;14(6):789–810. doi: 10.5194/cp-14-789-2018 [DOI] [Google Scholar]
  • 127.Evans D, Sagoo N, Renema W, Cotton LJ, Muller W, Todd JA, et al. Eocene greenhouse climate revealed by coupled clumped isotope-Mg/Ca thermometry. Proc Natl Acad Sci. 2018;115(6):1174–9. Epub 2018/01/24. doi: 10.1073/pnas.1714744115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Cramwinckel MJ, Huber M, Kocken IJ, Agnini C, Bijl PK, Bohaty SM, et al. Synchronous tropical and polar temperature evolution in the Eocene. Nature. 2018;559(7714):382–6. Epub 2018/07/04. doi: 10.1038/s41586-018-0272-2 [DOI] [PubMed] [Google Scholar]
  • 129.Sijp WP, von der Heydt AS, Dijkstra HA, Flögel S, Douglas PMJ, Bijl PK. The role of ocean gateways on cooling climate on long time scales. Glob Planet Change. 2014;119:1–22. doi: 10.1016/j.gloplacha.2014.04.004 [DOI] [Google Scholar]
  • 130.Keating-Bitonti CR, Ivany LC, Affek HP, Douglas P, Samson SD. Warm, not super-hot, temperatures in the early Eocene subtropics. Geology. 2011;39(8):771–4. doi: 10.1130/g32054.1 [DOI] [Google Scholar]
  • 131.Annan JD, Hargreaves JC. A new global reconstruction of temperature changes at the Last Glacial Maximum. Clim Past. 2012;9(1):367–76. doi: 10.5194/cp-9-367-2013 [DOI] [Google Scholar]
  • 132.Scotese CR. Some thoughts on global climate change: the transition from icehouse to hothouse. Paleomap Project. 2015;21:1(2). [Google Scholar]
  • 133.Boucot AJ, Xu C, Scotese CR, Morley RJ. Phanerozoic paleoclimate: an atlas of lithologic indicators of climate. Tulsa, U.S.A.: Society of Economic Paleontologists and Mineralogists (Society for Sedimentary Geology); 2013. [Google Scholar]
  • 134.Marcot JD, Fox DL, Niebuhr SR. Late Cenozoic onset of the latitudinal diversity gradient of North American mammals. Proc Natl Acad Sci U S A. 2016;113(26):7189–94. Epub 2016/06/15. doi: 10.1073/pnas.1524750113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.Levine SH. Competitive Interactions in Ecosystems. Am Nat. 1976;110(976):903–10. doi: 10.1086/283116 [DOI] [Google Scholar]
  • 136.Storch D, Okie JG, Field R. The carrying capacity for species richness. Global Ecol Biogeogr. 2019;28(10):1519–32. doi: 10.1111/geb.12987 [DOI] [Google Scholar]
  • 137.Kucherenko S, Albrecht D, Saltelli A. Exploring multi-dimensional spaces: A comparison of Latin hypercube and quasi Monte Carlo sampling techniques. arXiv preprint arXiv:150502350. 2015. [Google Scholar]
  • 138.Hartig F, Calabrese JM, Reineking B, Wiegand T, Huth A. Statistical inference for stochastic simulation models—theory and application. Ecol Lett. 2011;14(8):816–27. Epub 2011/06/18. doi: 10.1111/j.1461-0248.2011.01640.x [DOI] [PubMed] [Google Scholar]
  • 139.Jenkins CN, Pimm SL, Joppa LN. Global patterns of terrestrial vertebrate diversity and conservation. Proc Natl Acad Sci. 2013;110(28):E2602–10. Epub 2013/06/28. doi: 10.1073/pnas.1302251110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 140.IUCN. IUCN Red List of threatened species. 2020 [cited 2020 Jan 1]. Available from: http://www.iucnredlist.org
  • 141.BirdLife International. Data zone BirdLife internatinal 2020 [cited 2020 Jan 1]. Available from: http://datazone.birdlife.org
  • 142.Descombes P, Leprieur F, Albouy C, Heine C, Pellissier L. Spatial imprints of plate tectonics on extant richness of terrestrial vertebrates. J Biogeogr. 2017;44(5):1185–97. doi: 10.1111/jbi.12959 [DOI] [Google Scholar]
  • 143.Bininda-Emonds OR, Cardillo M, Jones KE, MacPhee RD, Beck RM, Grenyer R, et al. The delayed rise of present-day mammals. Nature. 2007;446(7135):507–12. Epub 2007/03/30. doi: 10.1038/nature05634 [DOI] [PubMed] [Google Scholar]
  • 144.Kuhn TS, Mooers AØ, Thomas GH. A simple polytomy resolver for dated phylogenies. Methods Ecol Evol. 2011;2(5):427–36. doi: 10.1111/j.2041-210X.2011.00103.x [DOI] [Google Scholar]
  • 145.Pyron RA, Burbrink FT. Early origin of viviparity and multiple reversions to oviparity in squamate reptiles. Ecol Lett. 2014;17(1):13–21. Epub 2013/08/21. doi: 10.1111/ele.12168 [DOI] [PubMed] [Google Scholar]
  • 146.Pyron RA, Wiens JJ. Large-scale phylogenetic analyses reveal the causes of high tropical amphibian diversity. Proc R Soc B Biol Sci. 2013;280(1770):20131622. Epub 2013/09/13. doi: 10.1098/rspb.2013.1622 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 147.Meseguer AS, Antoine PO, Fouquet A, Delsuc F, Condamine FL, McGill B. The role of the Neotropics as a source of world tetrapod biodiversity. Global Ecol Biogeogr. 2020;29(9):1565–78. doi: 10.1111/geb.13141 [DOI] [Google Scholar]
  • 148.Rahbek C, Graves GR. Multiscale assessment of patterns of avian species richness. Proc Natl Acad Sci. 2001;98(8):4534–9. doi: 10.1073/pnas.071034898 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 149.Jansson R, Rodriguez-Castaneda G, Harding LE. What can multiple phylogenies say about the latitudinal diversity gradient? A new look at the tropical conservatism, out of the tropics, and diversification rate hypotheses. Evolution. 2013;67(6):1741–55. Epub 2013/06/05. doi: 10.1111/evo.12089 [DOI] [PubMed] [Google Scholar]
  • 150.Weir JT, Schluter D. The latitudinal gradient in recent speciation and extinction rates of birds and mammals. Science. 2007;315(5818):1574–6. doi: 10.1126/science.1135590 [DOI] [PubMed] [Google Scholar]
  • 151.Spicer RA. Phytopaleoaltimetry: using plant fossils to measure past land surface elevation. In: Carina Hoorn, Allison Perrigo, Antonelli A, editors. Mountains, Climate and Biodiversity Oxford: Wiley. Oxford: Wiley; 2018. p. 95–109. [Google Scholar]
  • 152.Aldous DJ. Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today. Stat Sci. 2001;16(1):23–34. doi: 10.1214/ss/998929474 [DOI] [Google Scholar]
  • 153.Janzen T, Höhna S, Etienne RS, Paradis E. Approximate Bayesian Computation of diversification rates from molecular phylogenies: introducing a new efficient summary statistic, the nLTT. Methods Ecol Evol. 2015;6(5):566–75. doi: 10.1111/2041-210x.12350 [DOI] [Google Scholar]
  • 154.Simpkins G. Progress in climate modelling. Nat Clim Change. 2017;7(10):684–5. doi: 10.1038/nclimate3398 [DOI] [Google Scholar]
  • 155.Vogelsberger M, Marinacci F, Torrey P, Puchwein E. Cosmological simulations of galaxy formation. Nat Rev Phys. 2020;2(1):42–66. doi: 10.1038/s42254-019-0127-2 [DOI] [Google Scholar]
  • 156.Gaucherel C, Carpentier C, Geijzendorffer IR, Noûs C, Pommereau F. Discrete-event models for conservation assessment of integrated ecosystems. Ecol Inform. 2020. doi: 10.1016/j.ecoinf.2020.101205 [DOI] [Google Scholar]
  • 157.Levins R. The strategy of model building in population biology. Am Sci. 1966;54(4):421–31. [Google Scholar]
  • 158.Zurell D, Berger U, Cabral JS, Jeltsch F, Meynard CN, Münkemüller T, et al. The virtual ecologist approach: simulating data and observers. Oikos. 2010;119(4):622–35. doi: 10.1111/j.1600-0706.2009.18284.x [DOI] [Google Scholar]
  • 159.Morlon H, Lewitus E, Condamine FL, Manceau M, Clavel J, Drury J, et al. RPANDA: an R package for macroevolutionary analyses on phylogenetic trees. Methods Ecol Evol. 2016;7(5):589–97. doi: 10.1111/2041-210x.12526 [DOI] [Google Scholar]
  • 160.Tsirogiannis C, Sandel B. PhyloMeasures: a package for computing phylogenetic biodiversity measures and their statistical moments. Ecography. 2016;39(7):709–14. doi: 10.1111/ecog.01814 [DOI] [Google Scholar]
  • 161.Laliberté E, Legendre P, Shipley B, Laliberté ME. Package ‘FD’: Measuring functional diversity from multiple traits, and other tools for functional ecology. 2014. [DOI] [PubMed] [Google Scholar]
  • 162.Cabral JS, Schurr FM. Estimating demographic models for the range dynamics of plant species. Global Ecol Biogeogr. 2010;19(1):85–97. doi: 10.1111/j.1466-8238.2009.00492.x [DOI] [Google Scholar]
  • 163.Clark James S. Why trees migrate so fast: Confronting theory with dispersal biology and the paleorecord. Am Nat. 1998;152(2):204–24. doi: 10.1086/286162 [DOI] [PubMed] [Google Scholar]
  • 164.Quintero I, Wiens JJ. Rates of projected climate change dramatically exceed past rates of climatic niche evolution among vertebrate species. Ecol Lett. 2013;16(8):1095–103. Epub 2013/06/27. doi: 10.1111/ele.12144 [DOI] [PubMed] [Google Scholar]
  • 165.Schemske DW, Mittelbach GG, Cornell HV, Sobel JM, Roy K. Is There a Latitudinal Gradient in the Importance of Biotic Interactions? Annu Rev Ecol Evol Syst. 2009;40(1):245–69. doi: 10.1146/annurev.ecolsys.39.110707.173430 [DOI] [Google Scholar]
  • 166.Borregaard MK, Matthews TJ, Whittaker RJ, Field R. The general dynamic model: towards a unified theory of island biogeography? Global Ecol Biogeogr. 2016;25(7):805–16. doi: 10.1111/geb.12348 [DOI] [Google Scholar]
  • 167.Rahbek C, Borregaard MK, Colwell RK, Dalsgaard B, Holt BG, Morueta-Holme N, et al. Humboldt’s enigma: What causes global patterns of mountain biodiversity? Science. 2019;365(6458):1108–13. doi: 10.1126/science.aax0149 [DOI] [PubMed] [Google Scholar]
  • 168.Zizka A, Antonelli A. Mountains of diversity. Nature. 2018;555(7695):173–4. doi: 10.1038/d41586-018-02062-6 [DOI] [PubMed] [Google Scholar]
  • 169.Rahbek C, Borregaard MK, Antonelli A, Colwell RK, Holt BG, Nogues-Bravo D, et al. Building mountain biodiversity: Geological and evolutionary processes. Science. 2019;365(6458):1114–9. doi: 10.1126/science.aax0151 [DOI] [PubMed] [Google Scholar]
  • 170.Nelson JS, Grande T, Wilson MVH. Fishes of the world. 5th ed. Hoboken, New Jersey: John Wiley & Sons; 2016. xli, 707 pages p. [Google Scholar]
  • 171.Zaffos A, Finnegan S, Peters SE. Plate tectonic regulation of global marine animal diversity. Proc Natl Acad Sci. 2017. doi: 10.1073/pnas.1702297114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 172.Bowen BW, Rocha LA, Toonen RJ, Karl SA, ToBo L. The origins of tropical marine biodiversity. Trends Ecol Evol. 2013;28(6):359–66. Epub 2013/03/05. doi: 10.1016/j.tree.2013.01.018 [DOI] [PubMed] [Google Scholar]
  • 173.Boag TH, Gearty W, Stockey RG. Metabolic tradeoffs control biodiversity gradients through geological time. Curr Biol. 2021. Epub 2021/05/08. doi: 10.1016/j.cub.2021.04.021 [DOI] [PubMed] [Google Scholar]
  • 174.Bennett JM, Sunday J, Calosi P, Villalobos F, Martinez B, Molina-Venegas R, et al. The evolution of critical thermal limits of life on Earth. Nat Commun. 2021;12(1):1198. Epub 2021/02/21. doi: 10.1038/s41467-021-21263-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 175.Yannic G, Hagen O, Leugger F, Karger DN, Pellissier L. Harnessing paleo-environmental modeling and genetic data to predict intraspecific genetic structure. Evol Appl. 2020;13(6):1526–42. Epub 2020/07/21. doi: 10.1111/eva.12986 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 176.Warren BH, Hagen O, Gerber F, Thebaud C, Paradis E, Conti E. Evaluating alternative explanations for an association of extinction risk and evolutionary uniqueness in multiple insular lineages. Evolution. 2018;72(10):2005–24. Epub 2018/08/29. doi: 10.1111/evo.13582 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Roland G Roberts

15 Jul 2020

Dear Dr Hagen,

Thank you for submitting your manuscript entitled "gen3sis: The GENeral Engine for Eco-Evolutionary SImulationS on the origins of biodiversity" for consideration as a Methods and Resources paper by PLOS Biology.

Your manuscript has now been evaluated by the PLOS Biology editorial staff, as well as by an academic editor with relevant expertise, and I'm writing to let you know that we would like to send your submission out for external peer review.

However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Please re-submit your manuscript within two working days, i.e. by Jul 17 2020 11:59PM.

Login to Editorial Manager here: https://www.editorialmanager.com/pbiology

During resubmission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF when you re-submit.

Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. Once your manuscript has passed all checks it will be sent out for review.

Given the disruptions resulting from the ongoing COVID-19 pandemic, please expect delays in the editorial process. We apologise in advance for any inconvenience caused and will do our best to minimize impact as far as possible.

Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission.

Kind regards,

Roli Roberts

Roland G Roberts, PhD,

Senior Editor

PLOS Biology

Decision Letter 1

Roland G Roberts

14 Sep 2020

Dear Dr Hagen,

Thank you very much for submitting your manuscript "gen3sis: The GENeral Engine for Eco-Evolutionary SImulationS on the origins of biodiversity" for consideration as a Methods and Resources paper at PLOS Biology. Your manuscript has been evaluated by the PLOS Biology editors, an Academic Editor with relevant expertise, and by three independent reviewers.

You'll see that while the reviewers recognise the potential merits of your approach, they are all struggling to identify the clear advance or unique utility that gen3sis represents over previous initiatives. While the open availability of your platform is a bonus, the reviewers are currently not fully persuaded by the examples presented or by your claims to be able to meet the stipulated challenges.

The reviews of your manuscript are appended below. Based on their specific comments and following discussion with the academic editor, I regret that we cannot accept the current version of the manuscript for publication. We remain interested in your study and we would be willing to consider resubmission of a comprehensively revised version that thoroughly addresses all the reviewers' comments. Note that one reviewer says "if space is an issue"; at PLOS Biology we have no space restrictions, but the manuscript should remain well structured and accessible. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript would be sent for further evaluation by the reviewers.

We appreciate that these requests represent a great deal of extra work, and we are willing to relax our standard revision time to allow you six months to revise your manuscript. We expect to receive your revised manuscript within 6 months.

Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension. At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may end consideration of the manuscript at PLOS Biology.

**IMPORTANT - SUBMITTING YOUR REVISION**

Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript:

1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript.

*NOTE: In your point by point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point.

You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response.

2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Related" file type.

*Resubmission Checklist*

When you are ready to resubmit your revised manuscript, please refer to this resubmission checklist: https://plos.io/Biology_Checklist

To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record.

Please make sure to read the following important policies and guidelines while preparing your revision:

*Published Peer Review*

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*PLOS Data Policy*

Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5

*Blot and Gel Data Policy*

We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosbiology/s/submission-guidelines#loc-materials-and-methods

Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Roli Roberts

Roland G Roberts, PhD,

Senior Editor,

rroberts@plos.org,

PLOS Biology

*****************************************************

REVIEWERS' COMMENTS:

Reviewer #1:

Hagen and colleagues present a tool to simulate eco-evolutionary processes. The simulation framework can be used to generate null models of biodiversity, and to test macroecological and macroevolutionary processes across spatio-temporal scales. The framework is flexible, allowing users to specify various parameters. The authors suggest the platform represents a much-needed step forward in understanding the processes that shape biodiversity over Earth history. Similar simulations have been used to test potential processes that influence biodiversity, but these frameworks are not readily available for the broader community. Thus, this tool will be of broad interest but is not novel, and there are now numerous studies that have employed similar frameworks (see below). In general, I think this is a worthwhile contribution. However, there are a number of issues that need to be addressed prior to publication. Chief among them is a lack of clarity regarding the parameters and functionality of the simulation framework. More detail needs to be provided on virtually all aspects of the simulation framework, outlined below. The authors also provide two examples of implementation of the simulation framework. These examples seem half-baked, without clear hypotheses, null models, or explanation of how the simulations were performed. The authors need to spend more time discussing both examples, or, if space is an issue, eliminate one example so that the other can be given due diligence. Both of these are essential changes.

1. One of the benefits of a simulation framework is the ability to test competing processes and hypotheses, and to generate null models. The authors do not adequately formulate hypotheses for how the processes in the model affect biodiversity patterns, nor do they implement null models. For example, in the abstract they indicate that: "Via the process of niche evolution in response to changes in climate and the spatial distribution of habitat, our model predicts realistic island biodiversity dynamics, as well as a global latitudinal biodiversity gradient and species richness distribution". However, it is not at all clear what this means, and how niche evolution does or does not affect biodiversity patterns. Is niche evolution necessary to generate realistic biodiversity patterns? How does niche evolution produce the patterns, and which processes in the simulation (e.g., species, dispersal, extinction, carrying capacity) contribute most to the patterns they find? At what rate do niches need to evolve? These questions remain unclarified in the body of the text.

2. The case studies were useful for assessing how the simulations can be applied. However, similar to my comment above, it was not at all clear what the reader should take away from these exercises. In other words, I was left wondering what the authors learned from the modelling efforts: which processes were important in generating the realistic biodiversity patterns? Was it the process of speciation? Extinction? Dispersal? Again, the benefit of a simulation framework is the ability to test, explicitly, the relative contribution of processes to the generation of biodiversity, and to examine what type of patterns result if those processes are not employed (i.e., a null model). If space is a limitation, the authors should focus on only one case study and better explain both the methods and results to readers. In short, I was disappointed in the examples and poor explanation provided by the authors, especially since the authors suggest that these types of simulations can provide 'general rules' for the generation of biodiversity, and yet they do not focus on what they found regarding 'rules' of biodiversity, nor do they employ null models to ensure results are unique to specific processes.

3. Line 79: One of the unique aspects provided by the authors' simulation framework is the ability to assess the effect size of certain variables on the generation of realistic biodiversity patterns. This can be done by gradually increasing model complexity, and by implementation of null models. The authors should discuss this benefit in more detail (around line 79) and employ this framework in their example case studies.

4. The authors note that 'practical implementations of simulations have yet to be consolidated or widely used' (line 74). This is not strictly accurate, and the authors should point readers to notable exceptions, such as those below (not an exhaustive list):

Rangel, T. F. et al (2018). Modeling the ecology and evolution of biodiversity: Biogeographical cradles, museums, and graves. Science, 361(6399).

Rangel, T. F. et al (2007). Species richness and evolutionary niche dynamics: a spatial pattern-oriented simulation experiment. The American Naturalist, 170(4), 602-616.

Saupe, E. E. et al (2019). Spatio-temporal climate change contributes to latitudinal diversity gradients. Nature ecology & evolution, 3(10), 1419-1429.

Saupe, E. E. et al (2020). Extinction intensity during Ordovician and Cenozoic glaciations explained by cooling and palaeogeography. Nature Geoscience, 13(1), 65-70.

Saupe, E. E. et al (2019). Non‐random latitudinal gradients in range size and niche breadth predicted by spatial patterns of climate. Global Ecology and Biogeography, 28(7), 928-942.

5. Line 82 and 90: Gotelli and colleagues and Saupe and colleagues have noted that simulations can be used to examine how continental configuration and paleogeography influence biodiversity dynamics, which also provide examples of simulations on million-year time frames (e.g., Saupe et al. 2020. Nature Geoscience).

6. The authors should provide clarity on the process of extinction in the simulation framework. When does extinction occur, and can extinction criteria be modified by the user?

7. The authors refer to 'barriers' within the manuscript, but it was unclear whether such barriers can be specified, and how they could be specified. I.e., could a separate input file representing rivers, mountain ranges, etc be used to limit dispersal? Or, is climate the only potential barrier? Similarly, the authors indicate that oceanic pixels/cells are inhabitable for simulations focused on the terrestrial realm: this is very surprising, and I would like the authors to justify this choice (see line 363). Is this the case for the simulations in general, or can this feature be 'turned off'?

8. The simulation framework allows for trait divergence and trait evolution. This is excellent, but the authors should specify what types of traits can be modelled explicitly in the manuscript.

9. The links provided in the manuscript to the code are broken.

10. The authors generally describe the functions in the simulation framework, but each could benefit from more detail on options and functionality.

11. The process of speciation in the simulation framework was not clear. The authors indicate that clusters/populations gradually accumulate incompatibility, but it was not clear at what rate this occurs (line 223). Can the rate be altered? Is the rate affected by the rate of climate change? The description regarding trait divergence that eventually leads to speciation differs from the description of divergence on line 360, which instead refers to numbers of years for speciation to occur (e.g., 2, 4, 6, 8, 10 Ma). Can the authors clarify?

12. Conceptual island case study example (line 299): as noted before, this example could benefit from additional explanation. It was difficult to understand the authors' hypotheses/expectations, and their methodology. Did the authors perform simulations to see what processes should be tweaked to generate a flat biodiversity gradient on the island? What amount of trait evolution, and which ecological factors, interact to produce richness gradients?

~12.1: the authors note (line 305) that dispersal was limited to surrounding sites. Does this mean that populations could only disperse to surrounding cells? If so, does it follow that speciation could only occur if local extinction happened, which then isolated populations?

~12.1.1: Did the authors consider other dispersal thresholds?

~12.2: What does 'high' and 'low' trait evolution mean?

~12.3: I was surprised to see that diversity did not decrease when the hypothetical island decreased in size by erosional processes, following the species-area relationship. Do the authors have an explanation for why diversity did not decrease, which seems incongruent with empirical biodiversity patterns?

13. Latitudinal biodiversity gradient case study example: Again, it was not at all clear what were the hypotheses for this case study. Which processes did the authors expect to generate latitudinal biodiversity gradients? Speciation? Extinction? Carrying capacity alone?

~13.1: Saupe et al in NEE used simulations to evaluate processes that could generate latitudinal biodiversity gradients. From what I can see, the simulations are similar to those performed by Saupe and colleagues, although Saupe et al invoked fewer biotic processes (and therefore their simulations are potentially less realistic). This contribution seems relevant to cite here (line 419), in addition to reference 19, and should also be discussed more thoroughly throughout this case study. How do the findings and patterns of the authors compare to previously published simulations that used a similar simulation framework? Are the processes invoked similar? The same comment could also be made for the study by Rangel et al 2019 (Science), although this study was not global in scale: how do the findings compare to the authors' simulations, which also focused on simulating biodiversity gradients?

~13.2: The authors note they initiated the simulations from a single ancestor species over the terrestrial surface (line 347). The wording here is unclear, as is the authors' methodology. Does this imply that the simulation started from only one grid cell on Earth (if so, which?), or that random grid cells were occupied globally (if so, which?), but that they all belonged to the same species?

~13.3: The authors note on line 382 that 'LDG emerged systematically from the underlying modelled mechanisms', but how? Why? From what mechanisms?

~13.4: It wasn't clear how speciation or extinction was parameterized in these simulations. Please clarify.

~13.5: The authors assume a carrying capacity. This is interesting and likely realistic, but won't this automatically generate more diversity at low latitudes based on the authors' assumption that carrying capacity is predicated on energy? This does not seem a true test of process, as the scientific community already knows that energy is higher at low latitudes. Thus, if you assume more energy allows for a higher carrying capacity, it will automatically allow a latitudinal biodiversity gradient to form. What would happen if the authors do not assume a carrying capacity: is a latitudinal biodiversity gradient produced from the simulations? And, what differed in the simulations that did not produce biodiversity gradients, which the authors reference occurred around 10 percent of the time? I'd like to see more interrogation of these opposing patterns.

~13.6: The authors note that an increase in K correlated positively with LDG slope, which is not at all surprising. My question is: what other processes were involved? Speciation, extinction, dispersal? What process was dominant - i.e., has the largest effect size?

~13.7: Apologies if I missed this, but I did not see where the authors obtained their empirical richness patterns, from which to compare to their simulations (line 385).

~13.8: The authors do not indicate how patterns change when parameters are tweaked. Are patterns similar at different thresholds (i.e., they altered scale of dispersal, rates of evolution, speciation time, etc)? What is the sensitivity of biodiversity patterns to model parameterization?

~13.9: The climatic data used to power the latitudinal biodiversity gradient example are highly suspect (detailed in note 2S). There are many weaknesses to what the authors have done, although they have clearly put a lot of effort into the reconstructions. The key thing here is that the Koppen reconstructions are qualitative at best, with large uncertainties associated with them. A few (of many) criticisms would be:

* They are built from the distribution of sediments, but these have large climatic ranges associated with them. For instance, bauxites show that it is seasonally wet (but how seasonal is poorly defined) and warm (above ~22C) (see Price, G., P. Valdes and B. Sellwood, Prediction of modern bauxite occurrence: Implications for climate reconstruction. Palaeogeography Palaeoclimatology Palaeoecology, 1997. 131(1-2): p. 1-13.). The temperature could be anywhere between 22C or 32C, or indeed more! This impreciseness of the indicator is true for pretty much all lithologies used. Hence, the Koppen maps are a broad-brush estimate of the climate (and this is ignoring the fact that, for some time periods, there are large data gaps so the maps will be spatially crude as well).

* Koppen uses very broad categories. For instance, temperate climates are defined as anything between 0 and 18C! This is not very precise. As far as I understand, the authors perform a strange smoothing process (this needs to be clarified) to change these categories into something quantitative, but you could pretty much do anything here. They don't seem to think about basic climate phenomena (such as continentality), so the resulting temperatures will not be physically realistic. Other Koppen categories are mainly moisture-based, but the same comments apply.

* There is a recent paper that indicates temperature using present day lapse rates is extremely inaccurate.

* Interpolating to 170 kyr is spurious. The data is at stage level (~5Ma) and anything finer has no scientific input.

* Overall, I would be unconvinced that the reconstructions are quantitatively reliable. At best, they may reflect some very large scale (continental size) gradients, but I would have no confidence on smaller scales.

* Minor note: reference 3 and 4 seem to be the same in note S2.

14. Line 407: should read as 'flexibility'

15. The authors should examine the example code associated with the 'gen3sis' R package. The resulting map of alpha richness does not match empirical patterns very well, and the authors may want to consider a new example.

Reviewer #2:

This study presents a new R package for conducting simulations of eco-evolutionary dynamics to explore biodiversity patterns. The authors argue that a reliance on statistical and verbal models lacking mechanism has hindered progress in biogeography and macro-ecology and that computer simulations, in which these mechanisms can be encoded, can provide new insights into the causes of variation in biodiversity over time and space. The paper describes a general simulation model aimed to address this, that includes various processes such as dispersal, allopatric speciation, trait evolution and extinction, that take place on a dynamic landscape. The paper is well explained, the structure of the model is simple to follow and overall seems sensible. Two case studies are provided to illustrate the applicability of the model. The first, demonstrates that when island area rises and falls over time, this leads to a hump shaped trend in species richness, that is sensitive to how rapidly niche related traits evolve. The second, shows that when warm-wet environments have a higher individual carrying capacity, this leads to a latitudinal gradient in species richness.

I agree with the authors that these kinds of mechanistic simulation models are an additional useful tool in understanding biodiversity patterns. Researchers have been advocating the use of such models for many years and there have been a number of high profile studies implementing this kind of approach. I have to admit, however, that I struggled to identify in this paper what the key advance or new insight is over these earlier mechanistic modelling studies.

While as a modeller, I remain hopeful and open-minded, I am also sceptical that the general approach advocated here, will address the challenges to the field that the authors highlight. The problem, is that with these increasingly complex models the number of free parameters or hard encoded processes becomes enormous. Often the models are too complex to adequately explore parameter space and to fit to empirical data. As a result, earlier mechanistic modelling studies have resorting to picking a particular set of model assumptions and simply showing that this can produce realistic looking patterns but without formal hypothesis testing or parameter estimation. For instance, in the current study, the authors show that when warm-wet environments support more individuals then a latitudinal diversity gradient emerges. This is expected and of course not a new finding (e.g. Hubbell's 2001 neutral theory shows that metacommunities with more individuals contain more species), but more importantly, it doesn't provide much additional insight into whether this mechanism actually contributes to causing the LDG.

The problem comes back to the fact that there are multiple explanations for why richness varies with gradients in temperature and water availability and these causal hypotheses are extremely difficult to test because they all lead to the same prediction: that richness is highest in warm-wet environments. If you look at disciplines such as climate science, which the authors point to as an example of where mechanistic models have been extremely successful, the reason these complex models (e.g. GCMs) work is because we have a very good understanding of the basic physics and chemistry. Without a more thorough understanding of the mechanisms involved in generating and maintaining biodiversity, and without independent parameter estimates for most of the key processes, it is difficult to see this approach leading to major advances. This is a general criticism of the approach, and one not limited to this specific study, but I failed to see how the authors propose to address this problem.

I think the R package that the authors have developed would be a useful tool that I am sure ecologists would employ. Whether it really provides a 'general' model as the authors claim I'm not so sure. I think every ecologist will have different opinions about how a particular process should be modelled, or requirements for what should or should not be included in the model. For example, I was not convinced by the first case study on island dynamics presented in the paper, showing that with faster rates of niche evolution, species diversity is expected to decrease. This arises in the model because species that evolve rapidly may evolve to have niches that do not correspond to climate conditions on the island. But it is hard to see how this would actually happen in nature and it seems highly unrealistic: if the climate affects individual survival, then species would adapt to the conditions they encounter on the island and not just randomly drift through niche space. With faster rates of niche evolution species would be able to better track changing climate conditions and thus avoid extinction, leading to higher diversity.

One suggestion, would be for the authors to expand the scope of their experimental simulations. For instance, to simulate the evolution of biodiversity patterns under a particular set of model assumptions and parameters (stage 1). Then to employ a formal model fitting exercise where they run multiple simulations under different assumptions and parameter combinations and test the fit of these different scenarios to the patterns produced in stage 1. Testing how often the true model is recovered (type 1 and type 2 errors), how accurately and reliably the true parameter values can be estimated, and how these inferences depend on different assumptions would be an important first step to demonstrating the model is potentially useful. In the context of the current study, if the authors could demonstrate that richness correlates with energy-water availability because of differences in individual carrying capacity, rather than other prominent explanations (e.g. niche conservatism, niche partitioning etc) then this would be much more compelling.

Reviewer #3:

This is an interesting and well-written presentation of a new r package for ecological modelling. The goal of the paper is worthwhile - to create a general modular software framework for deeper integration of models in geographical ecology to improve theory and make it more explicit.

It's a little bit hard for me to tell whether the authors actually reach this worthy goal. They argue that fields like cosmology and climatology are heavily reliant on mechanistic models, which they are, but those models seem very different. In cosmology everything is highly deterministic and fully specifiable once you know the initial conditions, a modeller's dream. And climate modelling uses a small number of huge models, greatly dwarfing ecological parallels such as the Madingley model and clearly completely different from what is offered here. The premise that cosmology and climate uses models and so should ecology thus seems superficial and in my opinion doesn't add theoretical depth.

My main question is what this model is for. Is it a toy program that can be used for teaching? Is it a fully finished model? The authors cast it as a "modelling engine with a modular implementation", indicating that this is something that other people can hook up to their code, but how exactly they would do that is not clear. Is it really a module that can be tweaked and changed and incorporated into other people's research projects? Or really just a very customisable final model? There's a mention of the flexibility coming from functions being definable in an open way, but no example of this (that I can see). As an editorial comment, I don't think the code examples should be relegated to the supplement in a paper like this - let's see the package in use in the main text.

This question is especially pertinent given that, as they state, "a mechanstic understanding" in ecology "is elusive". Do we really know enough about the mechanisms to say that the implementation choices taken here are sensible? The authors state in the abstract that the model produces "realistic island biodiversity dynamics", but how do we know that the emerging dynamics are realistic? How much of the model framework is amenable to modification, and what elements are taken as assumed? I can't see that distinction made.

I have a few other implementation-based questions. It appears that "runtime critical" elements are implemented via RCpp, but looking at the github code, it's actually a fairly small part of the code that is written in C++. Is that really enough to make the model run fast? There are no runtimes given for the examples, which I guess is fair enough given the diversity of hardware, but would be nice to get an indication of how fast it is and could be made. I'm here assuming that the code does not call into any C++ libraries not included in the repo (it doesn't look like it).

Finally it would be nice to see some more expanded documentation for using the package. There are vignettes available, but are they sufficient to run the software for other people?

In conclusion, this is potentially interesting, but I don't at present feel I have sufficient information to evaluate whether the authors achieve what they promise with the package.

Decision Letter 2

Roland G Roberts

10 Feb 2021

Dear Dr Hagen,

Thank you very much for submitting a revised version of your manuscript "gen3sis: the general engine for eco-evolutionary simulations on the origins of biodiversity" for consideration as a Methods and Resources paper at PLOS Biology. This revised version of your manuscript has been evaluated by the PLOS Biology editors, the Academic Editor and the original reviewers.

You'll see that while reviewer #3 is now satisfied, reviewers #1 and #2 continue to raise some significant concerns with your treatment of the literature (and specifically the novelty of your approach with respect to other studies, such as Rangel et al, Saupe et al), with the novelty and strength of your LDG case study, including the parameters and scenarios explored. The Academic Editor asked me to emphasise the need for you to better explain how the resource builds on existing work in the field.

In light of the reviews (below), we will not be able to accept the current version of the manuscript, but we would welcome re-submission of a much-revised version that takes into account the reviewers' comments. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent for further evaluation by the reviewers.

IMPORTANT: I should also say that we will be willing to consult reviewers only once more on this manuscript, and if they remain unconvinced by the merits of your study after the next resubmission we would not invite you to revise again.

We expect to receive your revised manuscript within 3 months.

Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension. At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may end consideration of the manuscript at PLOS Biology.

**IMPORTANT - SUBMITTING YOUR REVISION**

Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript:

1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript.

*NOTE: In your point by point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point.

You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response.

2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Related" file type.

*Re-submission Checklist*

When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist

To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record.

Please make sure to read the following important policies and guidelines while preparing your revision:

*Published Peer Review*

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*PLOS Data Policy*

Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5

*Blot and Gel Data Policy*

We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosbiology/s/submission-guidelines#loc-materials-and-methods

Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Roli Roberts

Roland G Roberts, PhD,

Senior Editor,

rroberts@plos.org,

PLOS Biology

*****************************************************

REVIEWERS' COMMENTS:

Reviewer #1:

In general, I found the manuscript much improved. The R package will be useful to the evolutionary and ecological communities, although I disagree with the authors regarding its novelty. To me, the power of the R package is providing an easy-to-use framework that is flexible with appropriate vignettes explaining the parameters. As currently written, the authors (perhaps inadvertently) downplay the novelty and significance of previous, similar platforms.

Most of my questions regarding the framework and its flexibility were clarified by the authors. They do a nice job explaining the various parameters in Gen3sis, and I think the algorithm will be of interest to the community. However, and apologies if I missed this, it was still unclear to me how extinction occurred. I imagine extinction could occur via numerous routes, and this should be made clear.

I still found issues with the main LDG case study provided by the authors. Although I understand the authors provide the case study as an example of how the modelling framework operates, they also present the results as if the study is novel with conclusive findings. This is problematic, as the LDG study is not novel (the same question has been examined using a similar framework by numerous authors) and is missing key elements, which are discussed below. The authors would have convinced me of the power of their framework and the use of LDGs as a case study if they had evaluated more than three hypotheses and included null models. However, this was not done, even though the authors cite this functionality as one of the primary novelties of their framework (and it was requested by the reviewers previously).

1. Hypotheses:

a. The presentation and description of hypotheses requires further thought. The authors focus on three hypotheses: time for species accumulation, diversification rates, and ecological limits. The 'time to speciation' hypothesis is purposely vague in the literature because it is difficult to disentangle the various processes that could produce higher species richness at low latitudes using empirical data. However, a mechanistic model allows for these processes to be examined explicitly. Speciation, extinction, and dispersal are the only mechanisms that directly generate differential patterns of biodiversity across landscapes: how might these three processes contribute to greater species richness in the tropics given an older tropics, and which are supported by the M0 model? Usually, the 'tropics as older' hypothesis infers extinction rates were higher at high latitudes due to climate change (especially in the Northern Hemisphere), but all three could be involved (or not).

b. Regarding the carrying capacity hypothesis: the authors should discuss (even briefly) why higher abundance may lead to higher species richness. I understand that higher abundance would lead to more biomass, but not necessarily how it could lead to greater species richness. It is interesting to test this hypothesis in the Gen3sis framework, but the authors need to better explain the mechanics, as this is the key contribution of their modelling framework. Again, the only processes that can produce more species in a region compared to another region are differential rates of dispersal, speciation or extinction. Thus, how might carrying capacity generate differential rates of speciation, extinction, or dispersal? Do the authors suggest that lower carrying capacity will increase extinction of incipient species at high latitudes? Or, do the authors suggest that speciation and extinction rates are constant across latitudes, and the higher carrying capacity contributes to higher diversity in the tropics via dispersal of species into the tropics? Again, you found this model to be the best supported, and some discussion of how this occurs in the simulation - the mechanisms - is warranted and would be of great interest to readers.

c. Assumptions are provided for each of the models/hypotheses (M0, M1 and M2) in Note S1, with the exception of Model 0. Model 0 does not seem to be a true null model, and the assumptions should be specified. Indeed, proper null simulations are not provided by the authors, which would allow for the effects of certain parameters on patterns to be examined (e.g., absence of speciation, extinction, ecology). The authors mentioned null simulations are feasible (for example, in Note S1 'mechanisms can be completely turned off. In the case of ecology, this will lead to an ecologically neutral model'), and I was therefore surprised to see they were not performed. E.g., do flat LDGs result if climate does not change? If extinction or speciation is prevented, but dispersal allowed?

2. Results:

a. How did speciation, extinction, and dispersal contribute to LDG for M2? I found the better fit of M2 interesting, but I wanted to know more. This is not discussed by the authors and seems critical to the authors' conclusions.

b. The figures are very nice. What was interesting, however, was that the simulated LDG did not fit the empirical LDGs any better (or so it seemed based on Fig S6 and Fig. 4c) than the LDGs found by Rangel and colleagues and Saupe and colleagues. The perhaps poorer fit of model M2 to empirical data compared to previous models needs to be addressed, especially if the authors wish to highlight their conclusions throughout the manuscript (rather than use the 'case study' as a toy example).

c. I was particularly intrigued that all models (M0, M1 and M2) resulted in LDGs (noted by the authors on line 473). This seems an important and interesting finding, and I would have liked to see more information on how close simulated patterns from M0 and M1 are to empirical gradients. What are the correlation coefficients between simulated and empirical curves, and what did the LDG curves look like in these cases?

3. Discussion:

a. The authors discuss the congruence of their patterns with Saupe et al, but they do not compare their results to Rangel et al's seminal work focused on South America, which analysed the effect of various parameters, including niche evolution (absent from Saupe et al's model) on LDGs.

b. I assume the authors could not assess the effect of precipitation on speciation (found by Saupe and colleagues to be important), as the authors use only temperature as a variable to constrain niches?

Minor:

Line 439 and 456: remove apostrophe and replace with comma

Line 101: perhaps not interoperability, but rather implementation of models by others

Line 498: move the reference to the end of the clause, after 'of the LDG using computer models"

The references at the end of this sentence also do not examine LDGs sensu stricto, but do represent mechanistic computer models.

Lines 585-608: I think it would be useful for readers (enhance clarity) if these lines were moved to the introductory paragraph of the case study.

Reviewer #2:

I think the authors have gone to a lot of effort to address the comments from the last review and the paper is improved. I do believe that the R package that the authors have developed will be a useful tool. However, I'm afraid that I still feel that the empirical application of the model presented here is far from convincing. Given the authors claims that these kinds of complex models can provide new insights into large scale biodiversity patterns, providing a powerful example to demonstrate this is important.

The authors now focus on using their model to make inferences about the causes of the latitudinal diversity gradient. They explore three different scenarios: a time for speciation model, speciation rate and carrying capacity model. The authors conclude that the carrying capacity model best fits empirical patterns in terms of matching the slope of the LDG, the slope of the range size frequency distribution and the shape of the phylogenetic tree (phylogenetic imbalance). That a model with a latitudinal gradient in carrying capacity best explain the observed LDG is a strong conclusion to make but I really don't think the application and fitting of the model is robust enough to say this.

One concern is the parameter values used. There seemed to be little to no justification for the choice of parameter values used and as far as I could tell the authors do not provide evidence showing that their conclusions are robust to a comprehensive exploration of parameter space. For the parameters they examine, only a handful of different values are used. I understand that with such a complex model it is very hard to comprehensively explore parameter space, but I feel this is an important weakness of the current approach being advocated here. Using slightly lower or higher values, or indeed some value intermediate to the ones you have chosen, could change the conclusion of which scenario fits best. Of course, this possibility can never be ruled out entirely, but I think just trying 3 or 4 different values for a parameter is just nowhere near sufficient. To me the current implementation of the simulation remains an illustrative, that can be used to make statements likes 'here are the kinds of patterns that can arise under different scenarios under these assumptions' but it cannot be used to robustly test hypotheses.

The authors only keep simulations that fit certain criteria e.g. fewer than 50,000 species. But it seems like for the scenarios they explored most simulations are retained. This suggests that a wider range of parameter values needed to be explored - ideally you want to explore the full breadth of parameter space, so that in one scenario all species go extinct before the end of the simulation while in another they all exceed some maximum imposed value. This would tell you that you are reaching the bounds of realistic parameter values and that there is no need to explore more extreme values.

The authors comparison of scenarios also seem incomplete. First, as far as I could tell from the notes the authors compare a model where there is a latitudinal gradient in carrying capacity (M2) to models (m0 and m1) where there is no carrying capacity. But what about the scenario where there is a carrying capacity but it doesn't vary across latitude? This scenario is needed in order to demonstrate whether it is the presence of a carrying capacity or the gradient in carrying capacity which drives differences in model fit.

I had a query about m1 and the gradient in speciation rates. As far as I could tell the gradient in time to speciation is ~ 3 (i.e. 3 times faster at the hottest sites) regardless of the parameter values chosen (e.g. 4/1.3 = 3.1 and 20/6.7 = 3). Perhaps I'm missing something here, but in that case a clearer explanation is needed.

When comparing scenarios, the authors identify which model best matches the empirical data even though the number of parameters varies between models. Some kind of penalisation for model complexity seems necessary here. Also, the authors narrow down parameter space by simulating on a coarse grid of 4 degrees, before then applying those parameters at a finer resolution. To me this doesn't seem justifiable. The best parameters for explaining the spatial and phylogenetic patterns will almost certainly vary with the number of grid cells in the simulation. This will apply to both the absolute parameter values (e.g. a coarser grid with fewer cells will lead to lower richness) and relative parameter values (e.g. a coarser grid with fewer cells will limit the extent if variation in range size among species and thus tree imbalance).

Another issue is the choice of statistics used to compare empirical and simulated patterns. In table 2, the authors present a long list of spatial and phylogenetic metrics that their model can be used to predict. Why then only use three statistics and why these three? The choice seemed arbitrary and perhaps not obvious. If the model makes predictions of richness in each cell why not compare the spatial patterns rather than just the LDG slope which throws so much information away? Why look at phylogenetic imbalance? Given that one of the hypotheses the authors are testing is whether there is a carrying capacity to diversity, the temporal dynamics of species diversification would have seemed an obvious, indeed necessary, metric to include. Metrics have been developed for matching empirical and simulated LTT plots that I think would be more informative (e.g. Janzen et al 2015). Related to this, none of the metrics capture absolute species richness or range size. So, it could be that the parameter values that give the best match according to the 3 summary statistics chosen give implausible levels of richness. The metric of Janzen et al 2015 would help to address this. In short, I would be very sceptical that the metrics chosen here can robustly discriminate between the three different scenarios. I would also be concerned about the sensitivity of some of these statistics to other choices in the model and I don't think the current study presents a strong case for understanding how these statistics depend on the different assumptions of the model or the parameter values chosen. For instance, previous studies have shown that there are many factors that can cause differences in phylogenetic tree imbalance, from the geographic mode of speciation to assumptions about how species compete. Perhaps in the simulations the authors explored, observed tree imbalance is best matched by the model with a varying carrying capacity but change the way you model speciation or competition and this may no longer be the case.

Janzen, T., Höhna, S. and Etienne, R.S. (2015), Approximate Bayesian Computation of diversification rates from molecular phylogenies: introducing a new efficient summary statistic, the nLTT. Methods Ecol Evol, 6: 566-575. https://doi.org/10.1111/2041-210X.12350

L393 -What is the justification for assuming a globally distributed species? How does this initial condition influence the relative fit of the models?

L405 - it is not immediately clear how this is representing the 'time for species accumulation' model as the simulation starts with a single globally distributed species rather than one restricted to the tropics. I assume this is because at the start of the simulation most environments on Earth were hot and thus this is the ancestral niche. if that is the rationale then I think this needs to be spelt out more clearly.

L407 - Only temperature considered in site suitability? What about precipitation?

Reviewer #3:

I'm happy with the edits to the manuscript - this is a great paper and package. I'll add that I did not expect this when I started the review, but the author's comments and edits really convinced me of the usefulness of the engine presented here.

Decision Letter 3

Roland G Roberts

7 Jun 2021

Dear Dr Hagen,

Thank you for submitting your revised Methods and Resources entitled "gen3sis: the general engine for eco-evolutionary simulations on the origins of biodiversity" for publication in PLOS Biology. I have now obtained advice from two of the original reviewers and have discussed their comments with the Academic Editor. 

Based on the reviews, we will probably accept this manuscript for publication, provided you satisfactorily address the remaining points raised by the reviewers. Please also make sure to address the following data and other policy-related requests.

IMPORTANT: Please address the following:

a) Regarding the title, we wonder whether (especially given the reviewers' comments) it might be more accurate to say "a general engine" rather than "the"? Also, is it possible to make the second half of the title more explicit. We suggest "gen3sis: a general engine for eco-evolutionary simulations of the processes that shape Earth’s biodiversity" (based on a phrase from your Abstract), but are open to other suggestions.

b) Please address the remaining points from the reviewers. The Academic Editor asked me "to emphasise reviewer #1's comments and ask the authors to explore further the potential to build on this method in the Discussion as it is by no means conclusive."

c) Please attend to my Data Policy requests below. Essentially we'll need the numerical values presented in Figs 4BC, S2, S3B, S4ABC, S5ABCDEF, S6ABCDE, S7AB, S8ABC, S10, S11, S12, S13ABCDEFGHIJ, S14ABCDEFGHIJ, S15AB to be made available in some way; in addition, the location of the data should be clearly stated in the respective legends.

As you address these items, please take this last chance to review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the cover letter that accompanies your revised manuscript.

We expect to receive your revised manuscript within two weeks.

To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following:

-  a cover letter that should detail your responses to any editorial requests, if applicable, and whether changes have been made to the reference list

-  a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable)

-  a track-changes file indicating any changes that you have made to the manuscript. 

NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines:

https://journals.plos.org/plosbiology/s/supporting-information  

*Published Peer Review History*

Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*Early Version*

Please note that an uncorrected proof of your manuscript will be published online ahead of the final version, unless you opted out when submitting your manuscript. If, for any reason, you do not want an earlier version of your manuscript published online, uncheck the box. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us as soon as possible if you or your institution is planning to press release the article.

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please do not hesitate to contact me should you have any questions.

Sincerely,

Roli Roberts

Roland G Roberts, PhD,

Senior Editor,

rroberts@plos.org,

PLOS Biology

------------------------------------------------------------------------

DATA POLICY:

You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797 

Many thanks for supplying the code and data required to reproduce the results in Github. I can see that the global heatmaps are best generated directly from those files, but for the simpler structured data in your paper, we do ask for all numerical values that underlie the Figures to be made available in one of the following forms:

1) Supplementary files (e.g., excel). Please ensure that all data files are uploaded as 'Supporting Information' and are invariably referred to (in the manuscript, figure legends, and the Description field when uploading your files) using the following format verbatim: S1 Data, S2 Data, etc. Multiple panels of a single or even several figures can be included as multiple sheets in one excel file that is saved using exactly the following convention: S1_Data.xlsx (using an underscore).

2) Deposition in a publicly available repository. Please also provide the accession code or a reviewer link so that we may view your data before publication. 

Regardless of the method selected, please ensure that you provide the individual numerical values that underlie the summary data displayed in the following figure panels as they are essential for readers to assess your analysis and to reproduce it: Figs 4BC, S2, S3B, S4ABC, S5ABCDEF, S6ABCDE, S7AB, S8ABC, S10, S11, S12, S13ABCDEFGHIJ, S14ABCDEFGHIJ, S15AB. NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values).

IMPORTANT: Please also ensure that figure legends in your manuscript include information on where the underlying data can be found (e.g. the supplementary data files and/or Github), and ensure your supplemental data file/s has a legend.

Please ensure that your Data Statement in the submission system accurately describes where your data can be found.

------------------------------------------------------------------------

DATA NOT SHOWN?

- Please note that per journal policy, we do not allow the mention of "data not shown", "personal communication", "manuscript in preparation" or other references to data that is not publicly available or contained within this manuscript. Please either remove mention of these data or provide figures presenting the results and the data underlying the figure(s).

------------------------------------------------------------------------

REVIEWERS' COMMENTS:

Reviewer #1:

The authors have done a nice job with their revisions, and I am now mostly satisfied with the manuscript. The model is well described, and its utility is apparent. I also appreciate how the authors have made their LDG example just that: an example. The authors have downplayed the results and discussion of the LDG analyses throughout, and I think this greatly helps with clarity and alleviates the concerns that I (and the other Reviewer) had. I still find issues with some of their LDG work. However, I think these issues become less important if the authors can provide even more explicit discussion that the analyses are simply illustrating the utility of their framework, and they are not mean to make broad claims about LDG generation.

1) I am not convinced by the author's justification for assuming a globally distributed species and how this affects the initial condition for inference of the models.

2) What does 'modelled process of environmental filtering' mean? (Abstract)

3) The authors partition existing hypotheses for the LDG into three broad categories: (i) time for species accumulation and niche conservatism, (ii) variation in diversification rates, and (iii) variation in ecological limit. Although useful to categorize the many existing hypotheses for the LDG, the categories devised by the authors seem inaccurate. For example, time to species accumulation does not need to invoke niche conservatism. Indeed, the 'tropics as older' hypothesis is primarily about how climate in the tropics has been stable for longer, which allows more species to accumulate because of reduced extinction. Niche conservatism can play a role in this hypothesis, but it does not need to do so. The second hypothesis (variation in diversification rates) should also include discussion of the tropics as a cradle vs museum. Rates of speciation and extinction can vary across latitudes for more reasons than increased metabolic and mutation rates. Finally, the third category invoked by the authors (variation in ecological limit) is also, ultimately, about reduced extinction due to differing ecological limits. This section needs to be made clearer, or - alternatively - the authors should focus solely on their subset of hypotheses, without making claims to the broader set of existing LDG hypotheses.

4) I am still concerned about the authors' protocol of removing simulations if they do not meet certain criteria. It is easy to force simulations to match empirical data if you only require the simulations to do so some of the time. This, however, does not necessarily reveal anything deep about underlying mechanisms. For example, the authors throw out any simulations that do not match the following acceptance criteria: (i) LDG between 5.4% and 1.1%, (ii) tree shape statistic, β between -1.4 and -0.3, and (iii) range size frequencies with a decrease in the number of large-range species with a tolerance of 5% (Line 533).

5) One of the best models found was M4, which imposed a carrying capacity. However, this carrying capacity did not vary by latitude. I was curious, then, what caused the LDG to emerge in M4? Do the authors have insight into this?

6) The authors indicate that their results "corroborates with a contribution of both speciation [22], extinction [6] and dispersal [149] in shaping the LDG" (Line 567). However, the authors found that dispersal occurs out of the tropics, not into the tropics (Line 564). Thus, I wondered how dispersal could contribute to the LDG in this case? Dispersal in this instance seems to be occurring in the reverse direction to elevate diversity in the tropics.

Line 252: population instead of populations?

Line 485: distribution, not distributions?

Line 580: South America, not South American

Line 596: Consider rewording to "we made sure to select parameters based on a range supported by the literature"

Line 648-651: consider rewording, as this sentence is a bit confusing.

Reviewer #2:

While I remain skeptical about the utility of these very complex models and would have expected to see a really impressive empirical demonstration first (the method would then sell itself!), I think that the computer model that the authors present will be a useful tool for others and will hopefully encourage further advances.

The authors have undoubtedly improved the implementation and analysis of their simulations examining different explanations for the LDG. They have also toned down their interpretation of these results, admitting that these should be viewed as a preliminary exploration. So, I think this is now much more balanced and robust.

In short, the authors have addressed my comments.

Decision Letter 4

Roland G Roberts

23 Jun 2021

Dear Dr Hagen,

On behalf of my colleagues and the Academic Editor, Andrew Tanentzap, I'm pleased to say that we can in principle offer to publish your Methods and Resources paper, "gen3sis: a general engine for eco-evolutionary simulations of the processes that shape Earth’s biodiversity" in PLOS Biology, provided you address any remaining formatting and reporting issues. These will be detailed in an email that will follow this letter and that you will usually receive within 2-3 business days, during which time no action is required from you. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have made the required changes.

IMPORTANT: Many thanks for providing the underlying data in the Zenodo deposition. However, please could you include the Zenodo URL in each relevant Figure legend, so that the Figures are standalone? e.g. in the legend for Fig 4, "Data presented here is available in S1 Data at https://zenodo.org/record/5006413..." I have told my colleagues to expect this change.

Please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process.

PRESS: We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have not yet opted out of the early version process, we ask that you notify us immediately of any press plans so that we may do so on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for choosing PLOS Biology for publication and supporting Open Access publishing. We look forward to publishing your study. 

Sincerely,

Roli Roberts

Roland G Roberts, PhD 

Senior Editor 

PLOS Biology

rroberts@plos.org

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Animation. Reconstructed dynamic landscape L1 (i.e., world 65 Ma) with the environmental values used for the main case study.

    (MP4)

    S2 Animation. Reconstructed dynamic landscape L2 (i.e., world 65 Ma) with the environmental values used for the main case study.

    (MP4)

    S3 Animation. Theoretical dynamic landscape (i.e., theoretical island) with the environmental values used for the supplementary case study.

    (MP4)

    S4 Animation. Dynamic simulated biodiversity patterns (i.e., M5 L1 world from 65 Ma to the present).

    The map shows the α diversity and the top and right graphs indicate the richness profile of longitude and latitude, respectively.

    (MP4)

    S1 Fig

    Divergence increase per time step di against the normalized occupied niche of isolated populations for models (A) M1, M2, M4, and M5, which assume temperature-independent divergence, and (B) M3, which assumes temperature-dependent divergence, where divergence relates to the mean of the realized temperature with 3 different dpower values.

    (PDF)

    S2 Fig. Nonexhaustive probability density functions of the explored dispersal parameters in a Weibull distribution with shape ɸ of 1, 2, and 5 and Ψ of 550, 650, 750, and 850.

    Data presented available in S2 Data at https://zenodo.org/record/5006413.

    (PDF)

    S3 Fig

    Models (i.e., M1, M2, M3, M4, and M5) (A) Kernel density estimate of the same explored parameters (i.e., divergence threshold and dispersal scale) for selected simulations based on a Pearson correlation of simulated versus best observed (i.e., cor > 0.4) and (B) performance quantified with the BIC. Omitted values from the parameter space were simulations generating an unacceptable best Pearson correlation to the empirical data (r ≤ 0.4), too many species (>35,000) or a weak richness gradient (<20 species between minimal and maximal α-richness). Data presented available in S3 Data at https://zenodo.org/record/5006413. BIC, Bayesian information criteria.

    (PDF)

    S4 Fig

    Summary statistics of the model fit to empirical data with and without environmental dynamics for (A) a Pearson correlation of standardized mean species number per latitude (LDGcurve), (B) a Pearson correlation of spatial α-diversity, and (C) the exact difference between lineage through time curves (nLTT). Data presented available in S2 Data at https://zenodo.org/record/5006413. nLTT, normalized lineage though time.

    (PDF)

    S5 Fig

    Standardized mean species number per latitude (LDGcurve) for empirical data (i.e., terrestrial mammals, birds, amphibians, and reptiles) and best matching simulation from models (A) M1, (B) M2, (C) M3, (D) M4, and (E) M5. Data presented available in S4 Data at https://zenodo.org/record/5006413.

    (PDF)

    S6 Fig

    Frequencies of Pearson correlation between simulated standardized mean species number per latitude (LDGcurve) against best matching empirical LDGcurve for each dynamic landscape L1 (in blue) and L2 (in pink) for models (A) M1, (B) M2, (C) M3, (D) M4, and (E) M5. Models M4 and M5 are the only ones producing correlations >0.5. Data presented available in S3 Data at https://zenodo.org/record/5006413.

    (PDF)

    S7 Fig. Effects of grid cell size on simulations of M2 L1.

    (A) Correlation of grid cell, LDG slope, and other summary statistics. (B) Simulated LDG slope and grid cell size, showing a significant effect of spatial resolution on LDG slope. Data presented available in S5 Data at https://zenodo.org/record/5006413. CPU, central processing unit; LDG, latitudinal diversity gradient.

    (PDF)

    S8 Fig. Frequencies of simulated normalized LDG slope (histogram) with empirical LDG for 4 main groups (dashed gray line) and acceptance range (red line).

    Frequencies for models (A) M1, (B) M2, (C) M3, (D) M4, and (E)M5 with total frequency and frequency discriminated for each landscape, i.e., L1 and L2. Data presented available in S3 Data at https://zenodo.org/record/5006413. LDG, latitudinal diversity gradient.

    (PDF)

    S9 Fig

    Normalized richness of (A) selected simulation, (B) terrestrial mammals, (C) birds, (D) amphibians, and (E) reptiles, with Pearson correlation values for comparisons between simulated and empirical data.

    (PDF)

    S10 Fig. Mean absolute evolutionary events (i.e., speciation and extinction) for every 1 myr for the top 7 best matching current spatial α-biodiversity simulations for each model with and without environmental dynamics.

    Data presented available in S6 Data at https://zenodo.org/record/5006413.

    (PDF)

    S11 Fig. Standardized speciation events for every 1 myr of the top 7 best matching current spatial α-biodiversity simulations for each model with and without environmental dynamics.

    Data presented available in S6 Data at https://zenodo.org/record/5006413.

    (PDF)

    S12 Fig. Standardized extinction events for every 1 myr of the top 7 best matching current spatial α-biodiversity simulations for each model with and without environmental dynamics.

    Data presented available in S6 Data at https://zenodo.org/record/5006413.

    (PDF)

    S13 Fig

    Correlation of model parameters and emerging patterns for all models and landscapes without deep-time environmental dynamics (A) M0 L1.0, (B) M0 L2.0, (C) M1 L1.0, (D) M1 L2.0, (E) M2 L1.0, and (F) M2 L2.0. Emerging patterns: (i) phylogeny beta is the phylogenetic tree imbalance statistic measured as the value that maximizes the likelihood in the β-splitting model; (ii) range quant 0.95% is the value of the 95% quantile of the species range area distribution; (iii) LDG % loss is the slope of the linear regression of species richness; (iv) richness r is the highest Pearson correlation between simulated and empirical α-diversity; (v) nLTT diff is the lowest difference between simulated and empirical nLTT curves; and (vi) LDG curve r is the highest Pearson correlation between simulated and empirical standardized mean species number per latitude. Data presented available in S3 Data at https://zenodo.org/record/5006413. LDG, latitudinal diversity gradient; nLTT, normalized lineage though time.

    (PDF)

    S14 Fig

    Correlation of model parameters and 3 emerging patterns for all models and landscapes considering deep-time environmental dynamics (A) M0 L1, (B) M0 L2, (C) M1 L1, (D) M1 L2, (E) M2 L1, and (F) M2 L2. Emerging patterns: (i) phylogeny beta is the phylogenetic tree imbalance statistic measured as the value that maximizes the likelihood in the β-splitting model; (ii) range quant 0.95% is the value of the 95% quantile of the species range area distribution; (iii) LDG % loss is the slope of the linear regression of species richness; (iv) richness r is the highest Pearson correlation between simulated and empirical α-diversity; (v) nLTT diff is the lowest difference between simulated and empirical nLTT curves; and (vi) LDG curve r is the highest Pearson correlation between simulated and empirical standardized mean species number per latitude. Data presented available in S3 Data at https://zenodo.org/record/5006413. LDG, latitudinal diversity gradient; nLTT, normalized lineage though time.

    (PDF)

    S15 Fig

    Results of the island case study showing (A) landscape size and environmental dynamics and (B) results of 3 experiments (i.e., lower, equal, and higher trait evolution compared with the temporal environmental variation). The time series in (B) shows γ richness (log10 scale) on theoretical oceanic islands, following the geomorphological dynamics of islands. Thick lines indicate the average of the replicates, whereas thin lines indicate SD envelopes (n = 30 for each trait evolutionary rate scenario). The dashed gray vertical bar crossing the entire plot indicates the period in which the island reaches its maximum size. Data presented available in S7 Data at https://zenodo.org/record/5006413.

    (PDF)

    S1 Note. Global case study: The emergence of the LDG in the Cenozoic.

    (DOCX)

    S2 Note. Island case study: Does trait evolution impact biodiversity dynamics?

    (DOCX)

    S3 Note. Gen3sis pseudo-code.

    (DOCX)

    Attachment

    Submitted filename: Response_Reviewers_gen3sis.docx

    Attachment

    Submitted filename: Response_Reviewers_gen3sis_R2.docx

    Attachment

    Submitted filename: Response_Reviewers_gen3sis_R3.docx

    Data Availability Statement

    Gen3sis is implemented in a mix of R and C++ code, and wrapped into an R-package. All high-level functions that the user may interact with are written in R, and are documented via the standard R / Roxygen help files for R-packages. Runtime-critical functions are implemented in C++ and coupled to R via the Rcpp framework. Additionally, the package provides several convenience functions to generate input data, configuration files and plots, as well as tutorials in the form of vignettes that illustrate how to declare models and run simulations. The software, under an open and free GPL3 license, can be downloaded from CRAN at https://CRAN.R-project.org/package=gen3sis. The development version, open to issue reporting and feature suggestions, is available at https://github.com/project-Gen3sis/R-package. Supporting information, such as notes, scripts, data, figures and animations, are available at https://zenodo.org/record/5006413, facilitating full reproducibility.


    Articles from PLoS Biology are provided here courtesy of PLOS

    RESOURCES