Skip to main content
Data in Brief logoLink to Data in Brief
. 2024 Apr 3;54:110384. doi: 10.1016/j.dib.2024.110384

A harmonized database of European forest simulations under climate change

Marc Grünig a,, Werner Rammer a, Katharina Albrich b, Frédéric André c, Andrey LD Augustynczik d, Friedrich Bohn e, Meike Bouwman f, Harald Bugmann g, Alessio Collalti h,i, Irina Cristal j, Daniela Dalmonech h,i, Miquel De Caceres k, Francois De Coligny l, Laura Dobor m, Christina Dollinger a, David I Forrester n, Jordi Garcia-Gonzalo j, José Ramón González j, Ulrike Hiltner g, Tomáš Hlásny m, Juha Honkaniemi b, Nica Huber g,w, Mathieu Jonard c, Anna Maria Jönsson o, Fredrik Lagergren o, Mats Nieberg p,q,r, Marco Mina s, Frits Mohren f, Christine Moos t, Xaxier Morin u, Bart Muys v, Mikko Peltoniemi b, Christopher PO Reyer p, Ilié Storms v, Dominik Thom a, Maude Toïgo u,x, Rupert Seidl a
PMCID: PMC11033166  PMID: 38646195

Abstract

Process-based forest models combine biological, physical, and chemical process understanding to simulate forest dynamics as an emergent property of the system. As such, they are valuable tools to investigate the effects of climate change on forest ecosystems. Specifically, they allow testing of hypotheses regarding long-term ecosystem dynamics and provide means to assess the impacts of climate scenarios on future forest development. As a consequence, numerous local-scale simulation studies have been conducted over the past decades to assess the impacts of climate change on forests. These studies apply the best available models tailored to local conditions, parameterized and evaluated by local experts. However, this treasure trove of knowledge on climate change responses remains underexplored to date, as a consistent and harmonized dataset of local model simulations is missing.

Here, our objectives were (i) to compile existing local simulations on forest development under climate change in Europe in a common database, (ii) to harmonize them to a common suite of output variables, and (iii) to provide a standardized vector of auxiliary environmental variables for each simulated location to aid subsequent investigations. Our dataset of European stand- and landscape-level forest simulations contains over 1.1 million simulation runs representing 135 million simulation years for more than 13,000 unique locations spread across Europe. The data were harmonized to consistently describe forest development in terms of stand structure (dominant height), composition (dominant species, admixed species), and functioning (leaf area index). Auxiliary variables provided include consistent daily climate information (temperature, precipitation, radiation, vapor pressure deficit) as well as information on local site conditions (soil depth, soil physical properties, soil water holding capacity, plant-available nitrogen). The present dataset facilitates analyses across models and locations, with the aim to better harness the valuable information contained in local simulations for large-scale policy support, and for fostering a deeper understanding of the effects of climate change on forest ecosystems in Europe.

Keywords: Process-based models, Vegetation dynamics, Europe's forests, Forest development, Forest structure, Forest composition, Forest functioning


Specifications Table

Subject Environmental Sciences: Ecological modeling
Specific subject area Harmonizing forest modeling simulations of climate change effects, process-based forest simulation models
Data format Raw, harmonized
Type of data Table, Database
Data collection Data were contributed from the European forest modeling community. Each contributor uploaded simulation data files and a metadata file containing information on the design and drivers of the simulation as CSV files to an R Shiny application. Upon submission, the data were stored in a designated folder with a unique identifier assigned to each contributor. Contributions were required to follow specific criteria, including process-based simulations and annual output information on key vegetation development indicators (i.e. proportion of tree species, canopy height, and leaf area index). Other requirements were that simulation outputs were on the level of tree species, as well as in the absence of disturbances and management (or business-as-usual management).
Data source location Technical University of Munich, TUM School of Life Sciences, Ecosystem Dynamics and Forest Management Group
Data accessibility Repository name: Data for: A harmonized database of European forest simulations under climate change
Data identification number: 10.5281/zenodo.10730807
Direct URL to data: https://zenodo.org/records/10730807

1. Value of the Data

  • Forest simulation model outputs from 17 different models were collected and harmonized. The dataset contains 1.1 million individual simulation runs over 135 million simulation years across 13,599 unique locations in Europe, covering large proportions of the climate and soil conditions in Europe's forests.

  • The database contains standardized output variables across all models. Specifically, harmonized simulation outputs are available for canopy height (structure), leaf area index (LAI; functioning) and tree species proportions (composition) at annual time step. To provide harmonized layers of context information for simulation results, we collated daily climate data for historic climate (1981–2005) and a set of climate change scenarios (2006–2100) and consistent soil proprieties for all simulations.

  • This is the first harmonized dataset of local forest model simulations at continental scale. The data collected here will support synthetic analyses of climate change impact on Europe's forests, and will facilitate comparative analyses across locations and models. Further, our dataset also helps to identify regions that remain underrepresented in model-based climate impact assessments and should thus be the focus of future studies.

2. Background

The objective was to collate projections on forest development under climate change derived from simulation models. Specifically, we compiled existing simulation data from previously conducted analyses using published models at the stand- to landscape-scale. Contributions to the dataset were made by several experts of the European forest modeling community, and all contributors are co-authors of this paper. Model outputs were compiled for three common state variables describing complementary aspects of forest ecosystems. By harmonizing the output variables across different models and adding standardized climate and soil data we created a novel, bottom-up dataset for broad-scale, multi-model assessments of climate change impacts of Europe's forests.

3. Data Description

The data are collected and stored in SQLite format. SQLite is a widely used open-source database format and can be accessed from all major data analysis platforms. One SQLite database contains the raw simulation outputs and a metadata table of all simulations including information about locations and harmonized soil conditions for those locations. This data follows the structure described in detail in the supplementary information (Tables S1 and S2). Simulation outputs with harmonized climate data are stored in one SQLite database per climate scenario. Tables in those databases follow the structure shown in Table 1. Further, a metadata table of all simulations, including information about locations and soil conditions for those locations is provided (Table S2).

Table 1.

Structure of the harmonized simulation database including standardized auxiliary data for all simulations. Discrete vegetation states were created by combining the three state variables and binning the respective continuous variables – see methods for details. Note that each row in Table 1 describes a column in the simulation database.

Section Column name Description
General SourceID ID to identify the source of the data (i.e. contributor).
SimulationID Unique numeric identifier of the simulation created by the contributor.
Year Year of the simulation starting with 1 or the calendar year (e.g., 2000).
Vegetation Discrete vegetation state Discrete state derived by combining the three state variables species composition, LAI class and dominant canopy height class (e.g. PIAB_3_20_22; see methods for details).
Dominant height In meters, harmonized (calculated from min, mean and max heights, details on the calculations are shown in the methods).
LAI Leaf Area Index (one-sided or projected) in m²/m².
Soil WHC Water holding capacity of the site (mm).
TextureSand % sand content of the soil.
TextureSilt % silt content of the soil.
TextureClay % clay content of the soil.
SoilDepth Depth of the plant-accessible soil (mm) without rocks (> 2 mm diameter).
AvailableNitrogen Plant available nitrogen (kg/ha/year).
Climate Scenario Combination of GCM and RCP from which daily data was obtained.
Temperature Columns “tas_1” to “tas_365” with daily mean temperature [°C].
Precipitation Columns “prec_1” to “prec_365” with daily precipitation [mm].
Radiation Columns “rad_1” to “rad_365” with daily radiation [W/m2].
Vapor pressure deficit (VPD) Columns “vpd_1” to “vpd_365” with daily vapor pressure deficit [kPa].

The database contains 1,117,453 simulation runs that together contain 135,375,583 simulation years. Simulations cover 13,599 unique locations across Europe and represent 92 tree species. Simulation data were provided from 19 research groups, using 17 different forest models. All simulations were created with locally tested and evaluated models that are well-documented and published in the peer-reviewed literature (Table 2). Note that as models are further developed over time, model versions used for the simulations may vary from the cited references in some cases. While all models contributed to the coverage of climate and soil conditions across Europe, their individual contributions varied in terms of geographic range and number of simulations provided. Likewise, for some models more data were available than for others. While the majority of simulations were run with iLand, MEDFATE simulations covered the largest climate and soil gradient. The simulations in the database consist of 90.5% climate change runs and 9.5% of the simulations were run under baseline or observed climate conditions. The database contains simulations from 12 stand-level models and four landscape-level models (Table 2, see Bugmann & Seidl, 2022 [1] for a review on modeling approaches). For balance between stand- and landscape-level simulations, a subset of 1-ha stands from the full landscape was sampled and used as individual simulation runs.

Table 2.

Models, number of simulations per model and their coverage of current climate and soil space of Europe. Percentage of climate and soil space refers to the area covered by the climate and soil space in which the model simulations are located. For this, we stratified the climate and soil space and checked which of the classes are covered by the simulations of each model. We then calculated the area covered by the classes that are represented by each model. Further details are described in the Experimental Design, Materials and Methods section. The Model type column distinguishes between stand-level (S) models and landscape-level (L) models.

MODEL OVERVIEW
MODEL Simulations % of all % clim % soil Model type Model reference
ILAND 821,979 73.2 21.4 43.1 L Seidl et al. 2012 [2]
4C 250,979 22.4 18.3 54.3 S Lasch-Born et al., 2020 [3]
MEDFATE 22,464 2.0 40.8 64.5 S De Cáceres et al., 2021 [4]
FORCLIM 9210 0.8 10.3 46.7 S Bugmann, 1996 [5]; Huber et al., 2021 [6]
LPJ-GUESS2.1 6156 0.6 15.5 46.6 L Smith et al., 2008 [7]
FORCEEPS 5040 0.5 4.8 6.7 S Morin et al., 2021 [8]
TREEMIG 2820 0.3 0.6 0.3 L Lischke et al., 2006 [9]
3PGN-BW 1428 0.1 40.0 59.8 S Augustynczik and Yousefpour, 2021 [10]
FORMIND 1008 0.1 24.2 6.0 S Fischer et al., 2016 [11]
3D-CMCC-FEM 367 < 0.1 13.1 7.3 S Collalti et al., 2018 [12]; Dalmonech et al., 2022 [13]
GOTILWA+ 336 < 0.1 10.1 7.2 S Nadal-Sala et al., 2013 [14]
SORTIE-ND 278 < 0.1 21.3 22.0 S Canham et al., 2005 [15]
PREBAS 252 < 0.1 6.6 6.2 S Minunno et al., 2019 [16]
3PGMIX 173 < 0.1 10.1 34.0 S Forrester and Tang, 2016 [17]
LANDSCAPEDNDC 38 < 0.1 11.3 7.2 L Haas et al., 2013 [18]
HETEROFOR 12 < 0.1 6.7 18.6 S Jonard et al., 2020 [19]

In geographic space, the simulation runs cluster in Central Europe, Spain (Catalonia), Finland and Sweden, i.e. areas that were analyzed particularly intensively in previous modeling studies (Fig. 1). To evaluate the proportion of the geographic area of Europe that was covered by the climate and soil conditions represented in the dataset, the climate and soil space (using the variables shown in Fig. 2) of the entire continent were stratified and strata in which simulations were located were obtained. As some of the strata cover more geographic area (i.e. climate and soil conditions that occur more often), a simulation in that strata could cover a larger percentage of climate and soil space than other simulations. The climate space covered by all simulation data spans 79% of Europe's geographic area and the covered soil space spanned 75.4% of the geographic area (Fig. 2). Areas not covered are mostly unforested regions in highly continental parts of eastern Europe (i.e. parts of Ukraine, western Romania), very warm regions in the Mediterranean (i.e. southern Spain, parts of Greece and Italy), and wet and very oceanic regions including large parts of the British Isles and southern Norway. For soil conditions, we find that mainly soils with very coarse or very fine texture, low nitrogen availability and low water holding capacity are not covered by simulations.

Fig. 1.

Fig. 1:

Locations of the simulations in geographical space. Simulations from different models are displayed with different colors and symbols. Note that the projection of the map is Lambert Azimuthal Equal-Area (LAEA), the true north can be identified by following the longitudinal lines plotted in grey at 0° and 20° East.

Fig. 2.

Fig. 2:

Climate and soil space covered by simulations. Climate space was stratified into unique combinations of mean annual temperature (MAT; ten classes; panel a), annual precipitation (ANP; 7 classes; panel c) and temperature seasonality (Seas; seven classes; panel e). Soil space was stratified to unique combinations of plant-available nitrogen (N; five classes, but 0–20 kg/ha/year is not covered; panel b), soil texture (classes from coarse (=1) to very fine (=5) according to the European Soil Database classification scheme for soil texture; panel d) and water holding capacity (WHC; five classes; panel f). Areas not covered by the data in this database are hatched. For more details and data sources see section 3.2.1 Climate and 3.2.2 Soil.

The database contains simulations for 92 species. While most species are represented in less than 1000 simulations (median 728), Fagus sylvatica, Picea abies, Larix decidua, and Pinus sylvestris are the most prevalent, each occurring in over 400,000 simulations (Fig. 3). Furthermore, the dataset contains simulations without forest management (39.2%) and simulations implementing common practices (60.8%).

Fig. 3.

Fig. 3:

Number of simulations in which the 20 most prevalent tree species in the dataset occur.

4. Experimental Design, Materials and Methods

4.1. Data collection

A central database of forest simulations was created from existing model outputs from previous simulation studies, collating information from a variety of stand- to landscape-scale forest simulation models across Europe. The focus lay on models that were locally evaluated, thus encapsulating the best available bottom-up understanding of quantitative forest ecosystem dynamics in Europe. Furthermore, the dataset was restricted to results from process-based models, as these are expected to be more robust under changing environmental conditions compared to purely empirical models [20]. Simulation outputs were provided by modeling groups in two files, a simulation data file and a metadata file containing information on the design and drivers of the simulation (Table S1 & S2). Forest simulations under climate change conditions were of particular interest, but simulation runs under baseline climate conditions were also included. The minimum requirements for contribution to the dataset were that the model conducting the simulation was process-based, and that simulation outputs provide annual information on basic indicators of vegetation development (i.e. proportion of tree species, canopy height, and leaf area index). While some models provide a broader range of output indicators, these three variables were chosen as least common denominator for describing forest composition, structure, and functioning. Furthermore, the simulations had to provide information at the level of individual tree species, models simulating plant functional types were not included. With regard to the initial conditions of the simulation, both generic initial conditions (such as simulations starting from bare ground) and runs initialized with the current state of the vegetation were included. Further, simulation runs were conducted either in the absence of management interventions or assumed business-as-usual management for the area under study. Natural disturbances were not considered in the simulations. Simulation data were compiled and analyzed at stand level, hence information from landscape-scale simulation models were considered as unique data vectors at the level of simulated stands (i.e., areas of homogeneous climate and soil conditions) within a landscape. The temporal extent of simulations was variable, and both historical and future time series were included.

4.2. Data harmonization

Simulation data went through initial checks to ensure that the metadata for each simulation were complete and IDs in the metadata table (Table S2) were matching the simulation data (Table S1). To harmonize simulation outputs across the different models and simulation studies we used discrete vegetation states as described by Rammer & Seidl (2019) [21]. Discrete vegetation states condense the complexity of forest vegetation by describing the structure, composition, and function of vegetation as distinct states. To that end, continuous variables were discretized, and a state was derived for every simulation year in the output of each model. With regard to stand composition, a species was categorized as dominant if it held more than 66% of a standʼs basal area. Admixed species were explicitly considered if their share was ≥20%. Based on these categories, a unique string was derived to describe species composition by concatenating the first two characters of the genus name and the first two characters of the species name (e.g. Pinus sylvestris to PISY), and combining all dominant and admixed species occurring in a stand. Letters for dominant species were capitalized, while admixed species were in lowercase letters (e.g. PISYfasy means Pinus sylvestris is the dominant species and has a share of more than 66%, with Fagus sylvatica as an admixed species with more than 20% of the basal area). To categorize ecosystem functioning, LAI (as a key indicator for the exchange of carbon, water, and energy of the system with the atmosphere) was grouped into three classes: class 1 for sparse (LAI < 2), class 2 for moderate (2 ≤ LAI ≤ 4), and class 3 for dense forest canopies (LAI > 4). For structure, the dominant height of the canopy was employed, as the vertical utilization of space is a key element of forest structure. Canopy height can further indicate the development stage of forest stands. The height information from different models (mean, minimum, maximum) was harmonized to dominant stand height. For stands dominated by Pinus sylvestris, Abies alba, Larix decidua, Picea abies, Fagus sylvatica and Quercus robur, allometric factors calibrated by Kahn (1994) [22] were applied to describe the relationship of stand dominant height to maximum tree height. For other species, a statistical model based on yield table data was derived to estimate dominant height from mean and maximum tree height [22]. Specifically, for 14 species, a linear mixed model with dominant height as dependent variable, mean and maximum height as predictor variable, and random slopes for the different species was calibrated (conditional R2 = 0.999; marginal R2 = 0.996). To make the analysis robust to individual outliers, dominant height was limited to lie between 0.8 times maximum height (lower limit) and maximum height (upper limit). Some models only provided information for mean height (but not maximum height), or for minimum and maximum height only (but not mean height). In the latter case, mean height was calculated as the arithmetic mean of minimum and maximum height. For simulations for which only mean height was available, maximum height was estimated using a linear mixed model. This model was calibrated on a random subsample of the dataset of all simulation data across all models that contained both maximum and mean height. In the linear mixed model, maximum height was used as the dependent variable, mean height as predictor variable and random effects for species were included (conditional R2 = 0.54; marginal R2 = 0.25). Maximum height was then predicted from mean heights with this model and subsequently calculated dominant height from mean and maximum height as described before. Subsequently, continuous information on dominant height was grouped into 2-m bins. Finally, the individual states for forest composition, structure, and function were combined into a unique string describing the state of the vegetation (e.g., PIABfasy_3_20_22 representing a stand dominated by Picea abies (PIAB) with admixed Fagus sylvatica (fasy) that has a dense forest canopy (LAI class 3) and a dominant height of between 20 and 22 m). This harmonization and discretization of the underlying simulation data resulted in a total of 18,598 distinct vegetation states being recorded in our database.

4.3. Auxiliary data

In addition to the harmonized simulation data and the respective metadata, a common data vector of auxiliary data was compiled. This vector contained standardized climate and soil data for each simulation, in order to facilitate the analysis and interpretation of the dataset.

4.3.1. Climate

To derive common and coherent climate data for all simulation runs included in the database, the simulation-specific climate time series (which was restricted to annual values for temperature and precipitation) provided by the modeling groups was matched with climate data from a common climate dataset. This approach allowed to address problems of different data resolution and representation in the data used for individual simulations (e.g., some models operate with monthly climate data while others use daily climate data, some models use maximum and minimum temperature as drivers while others use mean temperature, some models include climatic drivers beyond temperature and precipitation such as vapor pressure deficit (VPD) while others do not, some simulation runs were driven by detailed downscaled climate data while others used coarser-resolution climate information as input). As common climate database EURO—CORDEX climate data was harnessed for historical conditions as well as three RCP scenarios (RCP2.6, RCP4.5 and RCP8.5), each simulated with three global circulation models (MPI-M-MPI-ESM-LR, ICHEC-EC-EARTH, NCC—NorESM1-M, all downscaled with the SMHI-RCA4 RCM), resulting in 12 climate scenarios. The climate data was obtained in a 0.11° x 0.11° spatial resolution and daily temporal resolution from the Copernicus Climate Data Store. (https://cds.climate.copernicus.eu/cdsapp#!/dataset/projections-cordex-domains-single-levels?tab=overview). GCMs were selected to cover a broad gradient of temperature and precipitation conditions [23] representative for the current and future climatic conditions in Europe.

For the harmonized dataset, the goal was to obtain daily climate data from EURO—CORDEX for all simulations. To achieve this, the best matching EURO—CORDEX trajectory was assigned to each simulation and for each simulation year the daily data from the best matching year from that scenario was extracted. In more detail, information on the type of climate trajectory used for the simulation run (baseline run without climate change, SRES, or RCP scenario family) was obtained from the metadata of each simulation. Baseline conditions refer to historical time series, while SRES and RCP scenarios refer to simulations under future climate scenarios. The best matching EURO—CORDEX scenario was assigned to each simulation based on a comparison between slopes (i.e. temporal change) of temperature and precipitation used in the simulations with those calculated from EURO—CORDEX scenarios for the simulation location. The difference of the slope for temperature and precipitation was obtained and the overall difference was calculated as ∆T + 0.1* ∆P to choose the trajectory with the smallest overall difference. Whenever simulation metadata contained specific information on the GCM and/or RCP used for the climate forcing used in the simulation, this information was harnessed to limit selection options from the EURO—CORDEX database. For instance, if the climate trajectory of a simulation was run with an RCP2.6 scenario, the temperature and precipitation slopes were only compared with the three RCP2.6 scenarios contained in our EURO—CORDEX selection. The best matching trajectory was then adjusted to the mean temperature and precipitation level of the simulation. Specifically, the difference between mean temperatures of the simulation and climate scenario trajectory and the multiplicative difference for precipitation was added. This was necessary to make sure that the adjusted time series represented the climate used in the simulations, as they may differ due to the relatively coarse resolution of the gridded climate data. Next, a time series from the adjusted scenario data was constructed, matching the individual years to each year in the simulation. For this, an index combining the additive difference between mean annual temperatures and the multiplicative difference of annual precipitation (again with ∆T + 0.1* ∆P) was calculated for each pair of simulation data year and climate scenario year. To prevent multiple occurrences of the same year, one of the three best matching years was randomly sampled with replacement. Finally, the thus constructed daily climate time series data for all variables (temperature, precipitation, radiation and VPD) was stored along the simulation it represents.

4.3.2. Soil

Mirroring the approach taken with climate data, the simulation metadata provided by the modelling groups was facilitated to derive a consistent and quantitative soil data set across all simulations. While some simulation metadata contained exact numbers on all relevant soil variables considered (i.e. soil depth, soil texture (sand, silt, clay percentage), water holding capacity (WHC) and available nitrogen), others provided more descriptive values for water and nutrient conditions, such as soil water or fertility ratings (see Table S3). To complete missing data and convert descriptive ratings to quantitative soil characteristics pan-European datasets on soil information were leveraged. For soil depth, soil texture and WHC gridded data (1 km resolution) from the European Soil Data Center (ESDAC [24]) was obtained. Soil fertility was approximated by means of plant-available nitrogen, considering that nitrogen is the most important macro-nutrient for forests in Europe at the continental scale. Since data on plant-available nitrogen (i.e., the annual flux of mineralized nitrogen) is not available for large scales, we used a statistical approach to estimate mineralization rates based on fertility and climate data and combined them with data on nitrogen stocks. Specifically, the SQ1 nutrient availability map from the harmonized European soil database [25] was reclassified from the four classes of soil fertility considered therein to values of 100, 65, 40 and 20 kg/ha/year of plant-available nitrogen. In a next step, from the Soilgrids dataset [26] the nitrogen pool (in kg/m2) across Europe based on relative nitrogen content (g/kg) and bulk density (kg/m3) for soil layers up to 30 cm soil depth were obtained. The reclassified soil fertility map was divided by the nitrogen pool layer to approximate a coarse pseudo-mineralization rate. This pseudo-mineralization rate was then used as a dependent variable to calibrate a generalized linear model (GLM) with mean annual temperature, annual precipitation, temperature seasonality (obtained from CHELSA [27]; we used CHELSA data instead of EURO—CORDEX to create a layer of 1 km spatial resolution, matching the other soil layers) and soil pH (from Soilgrids) as predictor variables (D2 = 0.64). The calibrated model was used to project pseudo-mineralization rates for continental Europe at 1 × 1 km resolution providing more consistent estimates based on local soil and climate conditions. Finally, the nitrogen pool was multiplied with the modeled pseudo-mineralization rate to get an approximation of plant-available nitrogen.

4.4. Coverage of simulated data

The coverage of the current climate and soil space of Europe's forests by simulation data compiled here cover was investigated using the harmonized data (see Fig. 2). The historical climate was categorized into ten stratified bins of mean annual temperature (< −2 °C, −2 – 2 °C, 2–4 °C, 4 – 6 °C, 6 – 8 °C, 8 – 10 °C, 10 – 12 °C, 12 – 14 °C, 14 – 18 °C, > 18 °C), seven stratified bins of annual precipitation sum (< 400 mm, 400 – 550 mm, 550 – 750 mm, 750 – 1000 mm, 1000 – 1400 mm, 1400 – 2000 mm, > 2000 mm) and seven stratified bins of temperature seasonality (calculated as standard deviation of monthly mean temperature* 100, binned to 〈 200, 200 – 400, 400 – 500, 500 – 600, 600 – 700, 700 – 900, 〉 900). Simulations were assigned to their bins based on the climate grid cell (0.11°) of the simulation's location, and the area that is covered by the occupied bins was calculated. This approach likely underestimates the true climatic coverage of the simulations, as individual simulations (both stand-level and landscape-level) often encompass climatic gradients within a single cell. Soil conditions were categorized into five unique combinations of stratified values along the dimensions of soil texture (six classes from fine to coarse calculated with sand, silt and clay content), water holding capacity (<50 mm, 50 – 100 mm, 100 – 150 mm, 150 – 180 mm, >180 mm) and soil fertility (plant-available nitrogen of < 20, 20 – 40, 40 – 65, 65 – 100, >100 kg/ha/year). Again, each simulated location was located in the three-dimensional space of soil conditions to assess how well the compiled simulation data represented the soil conditions of Europe's forests.

Limitations

The majority of simulations were sourced from iLand model outputs (73.2%, as shown in Table 2), totaling over 800,000 entries stored in large tables, which were unwieldy to manage. To mitigate this, we divided the simulation data into smaller chunks with different unique identifiers. Furthermore, the imbalance in the number of simulations from the various models creates a bias that needs consideration during database utilization for analytical purposes.

Ethics Statement

We confirm to have read the ethical requirements for publication in Data in Brief and confirm that the current work does not involve human subjects, animal experiments, or any data collected from social media platforms.

Declaration of Generative AI and AI-Assisted Technologies in the Writing Process

During the preparation of this work the authors used ChatGPT in order to improve readability. After using this tool, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication.

CRediT authorship contribution statement

Marc Grünig: Conceptualization, Methodology, Software, Formal analysis, Data curation, Writing – original draft, Writing – review & editing, Visualization. Werner Rammer: Conceptualization, Methodology, Software, Formal analysis, Data curation, Writing – original draft, Writing – review & editing, Funding acquisition. Katharina Albrich: Data curation, Writing – review & editing. Frédéric André: Data curation, Writing – review & editing. Andrey L.D. Augustynczik: Data curation, Writing – review & editing. Friedrich Bohn: Data curation, Writing – review & editing. Meike Bouwman: Data curation, Writing – review & editing. Harald Bugmann: Data curation, Writing – review & editing. Alessio Collalti: Data curation, Writing – review & editing. Irina Cristal: Data curation, Writing – review & editing. Daniela Dalmonech: Data curation, Writing – review & editing. Miquel De Caceres: Data curation, Writing – review & editing. Francois De Coligny: Data curation, Writing – review & editing. Laura Dobor: Data curation, Writing – review & editing. Christina Dollinger: Data curation, Writing – review & editing. David I. Forrester: Data curation, Writing – review & editing. Jordi Garcia-Gonzalo: Data curation, Writing – review & editing. José Ramón González: Data curation, Writing – review & editing. Ulrike Hiltner: Data curation, Writing – review & editing. Tomáš Hlásny: Data curation, Writing – review & editing. Juha Honkaniemi: Data curation, Writing – review & editing. Nica Huber: Data curation, Writing – review & editing. Mathieu Jonard: Data curation, Writing – review & editing. Anna Maria Jönsson: Data curation, Writing – review & editing. Fredrik Lagergren: Data curation, Writing – review & editing. Mats Nieberg: Data curation, Writing – review & editing. Marco Mina: Data curation, Writing – review & editing. Frits Mohren: Data curation, Writing – review & editing. Christine Moos: Data curation, Writing – review & editing. Xaxier Morin: Data curation, Writing – review & editing. Bart Muys: Data curation, Writing – review & editing. Mikko Peltoniemi: Data curation, Writing – review & editing. Christopher PO Reyer: Data curation, Writing – review & editing. Ilié Storms: Data curation, Writing – review & editing. Dominik Thom: Data curation, Writing – review & editing. Maude Toïgo: Data curation, Writing – review & editing. Rupert Seidl: Conceptualization, Methodology, Writing – original draft, Writing – review & editing, Funding acquisition.

Acknowledgments

We acknowledge funding received from the European Union's Horizon 2020 research and innovation program under grant agreement no. 101000574 (RESONATE: Resilient forest value chains – enhancing resilience through natural and socio-economic responses). We further acknowledge support from the European Research Council under the European Union's Horizon 2020 research and innovation program (Grant Agreement 101001905).

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Footnotes

Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.dib.2024.110384.

Appendix. Supplementary materials

mmc1.docx (19.9KB, docx)

Data Availability

References

  • 1.Bugmann H., Seidl R. The evolution, complexity and diversity of models of long-term forest dynamics. J. Ecol. 2022;110(10):2288–2307. doi: 10.1111/1365-2745.13989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Seidl R., Rammer W., Scheller R.M., Spies T.A. An individual-based process model to simulate landscape-scale forest ecosystem dynamics. Ecol. Modell. 2012;231:87–100. [Google Scholar]
  • 3.Lasch-Born P., Suckow F., Reyer C.P., Gutsch M., Kollas C., Badeck F.W., Bugmann H.K., Grote R., Fürstenau C., Lindner M., Schaber J. Description and evaluation of the process-based forest model 4C v2. 2 at four European forest sites. Geosci. Model Dev. 2020;13(11):5311–5343. [Google Scholar]
  • 4.De Cáceres M., Mencuccini M., Martin-StPaul N., Limousin J.M., Coll L., Poyatos R., Cabon A., Granda V., Forner A., Valladares F., Martínez-Vilalta J. Unravelling the effect of species mixing on water use and drought stress in Mediterranean forests: a modelling approach. Agric. For. Meteorol. 2021;296 [Google Scholar]
  • 5.Bugmann H.K. A simplified forest model to study species composition along climate gradients. Ecology. 1996;77(7):2055–2074. [Google Scholar]
  • 6.Huber N., Bugmann H., Cailleret M., Bircher N., Lafond V. Stand-scale climate change impacts on forests over large areas: transient responses and projection uncertainties. Ecol. Appl. 2021;31(4):e02313. doi: 10.1002/eap.2313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Smith B., Prentice I.C., Sykes M.T. Representation of vegetation dynamics in the modelling of terrestrial ecosystems: comparing two contrasting approaches within European climate space. Glob. Ecol. Biogeogr. 2001:621–637. [Google Scholar]
  • 8.Morin X., Bugmann H., De Coligny F., Martin-StPaul N., Cailleret M., Limousin J.M., Ourcival J.M., Prevosto B., Simioni G., Toigo M., Vennetier M., Catteau E., Guillemot J. Beyond forest succession: a gap model to study ecosystem functioning and tree community composition under climate change. Funct. Ecol. 2021;35(4):955–975. [Google Scholar]
  • 9.Lischke H., Zimmermann N.E., Bolliger J., Rickebusch S., Löffler T.J. TreeMig: a forest-landscape model for simulating spatio-temporal patterns from stand to landscape scale. Ecol. Modell. 2006;199(4):409–420. [Google Scholar]
  • 10.Augustynczik A.L.D., Yousefpour R. Assessing the synergistic value of ecosystem services in European beech forests. Ecosyst. Serv. 2021;49 [Google Scholar]
  • 11.Fischer R., Bohn F., de Paula M.D., Dislich C., Groeneveld J., Gutiérrez A.G.…Huth A. Lessons learned from applying a forest gap model to understand ecosystem and carbon dynamics of complex tropical forests. Ecol. Modell. 2016;326:124–133. [Google Scholar]
  • 12.Collalti A., Thornton P.E., Cescatti A., Rita A., Borghetti M., Nolè A., Matteucci G. The sensitivity of the forest carbon budget shifts across processes along with stand development and climate change. Ecol. Appl. 2019;29(2):e01837. doi: 10.1002/eap.1837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Dalmonech D., Marano G., Amthor J.S., Cescatti A., Lindner M., Trotta C., Collalti A. Feasibility of enhancing carbon sequestration and stock capacity in temperate and boreal European forests via changes to management regimes. Agric. For. Meteorol. 2022;327 [Google Scholar]
  • 14.Nadal-Sala D., Sabaté S., Gracia C. GOTILWA+: un modelo de procesos que evalúa efectos del cambio climático en los bosques y explora alternativas de gestión para su mitigación. Ecosistemas. 2013;22(3):29–36. [Google Scholar]
  • 15.Canham C.D., Murphy L.E., Papaik M.J. Institute of Ecosystem Studies. Millbrook; NY: 2005. SORTIE-ND: software for spatially-explicit simulation of forest dynamics. [Google Scholar]
  • 16.Minunno F., Peltoniemi M., Härkönen S., Kalliokoski T., Makinen H., Mäkelä A. Bayesian calibration of a carbon balance model PREBAS using data from permanent growth experiments and national forest inventory. For. Ecol. Manag. 2019;440:208–257. [Google Scholar]
  • 17.Forrester D.I., Tang X. Analysing the spatial and temporal dynamics of species interactions in mixed-species forests and the effects of stand density using the 3-PG model. Ecol. Modell. 2016;319:233–254. [Google Scholar]
  • 18.Haas E., Klatt S., Fröhlich A., Kraft P., Werner C., Kiese R.…Butterbach-Bahl K. LandscapeDNDC: a process model for simulation of biosphere–atmosphere–hydrosphere exchange processes at site and regional scale. Landsc. Ecol. 2013;28(4):615–636. [Google Scholar]
  • 19.Jonard M., André F., De Coligny F., De Wergifosse L., Beudez N., Davi H.…Vincke C. HETEROFOR 1.0: a spatially explicit model for exploring the response of structurally complex forests to uncertain future conditions–Part 1: carbon fluxes and tree dimensional growth. Geosci. Model Dev. 2020;13(3):905–935. [Google Scholar]
  • 20.Gustafson E.J. When relationships estimated in the past cannot be used to predict the future: using mechanistic models to predict landscape ecological dynamics in a changing world. Landsc. Ecol. 2013;28:1429–1437. [Google Scholar]
  • 21.Rammer W., Seidl R. A scalable model of vegetation transitions using deep neural networks. Methods Ecol. Evol. 2019;10(6):879–890. doi: 10.1111/2041-210X.13171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kahn M. Modellierung der Höhenentwicklung ausgewählter Baumarten in Abhängigkeit bom Standort. Forstwissenschaftliche Fakultet der Universität der Universität München und Bayerische forstliche Versuchs-und Forschungsanstalt. 1994 [Google Scholar]
  • 23.Parding K.M., Dobler A., McSweeney C.F., Landgren O.A., Benestad R., Erlandsen H.B.…Loukos H. GCMeval–An interactive tool for evaluation and selection of climate model ensembles. Clim. Serv. 2020;18 [Google Scholar]
  • 24.Hiederer R. Luxembourg: Publications Office of the European Union –2013 –47pp. – EUR26082EN Scientific and Technical Research series; 2013. Mapping Soil Properties for Europe - Spatial Representation of Soil Database Attributes. ISSN 1831-9424. [DOI] [Google Scholar]
  • 25.Fischer G., Nachtergaele F., Prieler S., van Velthuizen H.T., Verelst L., Wiberg D. 2008. Global Agro-ecological Zones Assessment for Agriculture (GAEZ 2008) [Google Scholar]
  • 26.Hengl T., Mendes de Jesus J., Heuvelink G.B., Ruiperez Gonzalez M., Kilibarda M., Blagotić A.…Kempen B. SoilGrids250m: global gridded soil information based on machine learning. PLoS One. 2017;12(2) doi: 10.1371/journal.pone.0169748. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Karger D.N., Conrad O., Böhner J., Kawohl T., Kreft H., Soria-Auza R.W.…Kessler M. Climatologies at high resolution for the earth's land surface areas. Sci. Data. 2017;4(1):1–20. doi: 10.1038/sdata.2017.122. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.docx (19.9KB, docx)

Data Availability Statement


Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES