Skip to main content
Scientific Data logoLink to Scientific Data
. 2023 Sep 14;10:624. doi: 10.1038/s41597-023-02489-1

A pseudoproxy emulation of the PAGES 2k database using a hierarchy of proxy system models

Feng Zhu 1,, Julien Emile-Geay 2, Kevin J Anchukaitis 3,4, Nicholas P McKay 5, Samantha Stevenson 6, Zilu Meng 7
PMCID: PMC10502095  PMID: 37709805

Abstract

Paleoclimate reconstructions are now integral to climate assessments, yet the consequences of using different methodologies and proxy data require rigorous benchmarking. Pseudoproxy experiments (PPEs) provide a tractable and transparent test bed for evaluating climate reconstruction methods and their sensitivity to aspects of real-world proxy networks. Here we develop a dataset that leverages proxy system models (PSMs) for this purpose, which emulates the essential physical, chemical, biological, and geological processes that translate climate signals into proxy records, making these synthetic proxies more relevant to the real world. We apply a suite of PSMs to emulate the widely-used PAGES 2k dataset, including realistic spatiotemporal sampling and error structure. A hierarchical approach allows us to produce many variants of this base dataset, isolating the impact of sampling bias in time and space, representation error, sampling error, and other assumptions. Combining these various experiments produces a rich dataset (“pseudoPAGES2k”) for many applications. As an illustration, we show how to conduct a PPE with this dataset based on emerging climate field reconstruction techniques.

Subject terms: Palaeoclimate, Palaeoceanography

Background & Summary

Reconstructions of past climate have become integral to climate assessments1. Such reconstructions employ a wide variety of mathematical techniques, ranging from purely statistical2 to data assimilation techniques that fuse observations and model output320. To establish their relative merits, these reconstructions must be benchmarked against reference datasets. This is routinely done on subsets of the instrumental period using cross-validation, but such efforts tend to underestimate the true spread of reconstructions in the pre-instrumental era21, indicative of overfitting.

While pre-instrumental intercomparisons of reconstruction methods have occasionally been carried out with real-world proxy observations22,23, such efforts are fundamentally limited by the lack of a true benchmark: pre-instrumental climates were not, by definition, observed directly, so these intercomparisons can only inform on convergence or divergence, but cannot provide any metric of their closeness to the true climate.

To sidestep this hurdle, pseudoproxy experiments (PPEs) have long been used as a laboratory to benchmark climate reconstruction methods. The heart of PPEs is to start from the output of long integrations of a global climate model and to apply mathematical transformations to this output to mimic the processes whereby paleoclimate proxies register these climate variations in space and time24. Because the original climate is specified, and sampled perfectly in space and time, the ability of a reconstruction to recover this climate is known. Moreover, as the generating process of these “pseudoproxies” is specified, it can be manipulated to yield insights into the sources of uncertainty contributing to reconstruction error. While simple PPE designs are informative, the more realistic the target climate and pseudoproxy generation process, the more relevant this benchmark becomes, so there is considerable potential in this avenue of research16,2531.

Initial work used the simplistic assumption that paleotemperature proxies were a linear superposition of local temperature and Gaussian (white) noise, sampled uniformly in time3234. Over time, more realistic pseudoproxy constructions were developed, involving other climate variables, more elaborate noise models, realistic spatiotemporal sampling, and noise levels approximating real proxy networks16,28,35,36. Recent work29 has leveraged more realistic proxy system modeling (PSM) frameworks3742 to capture the essential physical, chemical, biological and geological processes that translate climate signals into the paleoclimate records that form the basis of climate reconstruction efforts (e.g. ref. 43). However, such models have yet to gain widespread use, so even recent efforts have sometimes employed a simplistic “temperature + noise” pseudoproxy design16,29,44.

The PAGES 2k Phase 2 global multi-proxy database (Fig. 1) has been widely used for studies of Common Era climate since its release43. It has played a central role for investigating the multi-decadal and longer-term surface temperature variability45,46 and the spatiotemporal temperature patterns of various climatic epochs23,47 over the Common Era. In addition, it has served as the principal data source for the latest version 2.1 of the Last Millennium Reanalysis (LMR) products48, and has become a common network template for pseudoproxy studies31,49. However, these PPE related studies used only a partial network and employed a simplistic “temperature + noise” design, and a systematic pseudoproxy emulation of the PAGES 2k network has yet to be produced. The PAGES 2k Phase 2 has a number of known biases that present challenge to global annual mean temperature reconstruction50, which need to be rigorously evaluated.

Fig. 1.

Fig. 1

The PAGES 2k Phase 2 network43.

Here, we do so by generating a pseudoproxy dataset that: (i) emulates the majority of the PAGES 2k Phase 2 network43, (ii) employs a more realistic data-generating mechanism with proxy system models (PSMs) and isotope-enabled climate model simulations, and (iii) explicitly separates sensor, archive, and observational effects. By combining various pseudoproxy designs, noise levels, and spatiotemporal sampling scenarios, we generate many digital avatars of the PAGES 2k network, supporting the evaluation of climate reconstruction methods in a wide variety of settings. To illustrate the use of this dataset, we show its application to a suite of climate field reconstructions10,48,51.

Methods

Reference climate

Our base climate utilizes the “iCESM1” last millennium simulation (iCESM-LM hereafter) generated by the isotope-enabled Community Earth System Model (iCESM)52. As an addition to the standard CESM, iCESM simulates the isotopic water fluxes transported between its five major isotope-enabled components, including the atmosphere model iCAM, the land model iCLM, the ocean model iPOP, the sea ice model iCICE, and the river runoff model iRTM. The atmosphere model iCAM tracks water tracers and isotopes in all phases through processes such as surface fluxes, boundary layer mixing, cloud physics, convection, and advection, and simulates precipitation δ18O variability with high fidelity53. The land model iCLM considers the water vapor flux and isotope fractionation in vegetated land surfaces54. Main processes include water isotope exchanges among soil, spaces under and above canopy, and leaves. The land and vegetation types and amount of canopy use a modern climatological mean with a constant seasonal cycle55. The ocean model iPOP transports water isotopes passively through resolved flow and parameterized turbulence, and the simulated seawater δ18O is validated under present-day climate conditions56. The sea ice model iCICE simulates the sinks of the isotopic water mass through melting and sublimation processes, and the sources through snowfall, sea ice growth, and vapor condensation52. All components coupled together provide a plausible simulation of the water isotope fields.

The iCESM-LM simulation applies the transient external forcings following the same setup for the CESM Last Millennium Ensemble (CESM-LME)57. The solar forcing comes from the total solar irradiance reconstruction by Vieira et al.58 patched with spectral variations from Schmidt et al.59. The last millennium volcanic forcing is based on the ice core-based index by Gao et al.60, while for the historical period, an eruption dataset by Ammann et al.61 is adopted. The greenhouse gas forcing, namely the concentrations of the main long-lasting greenhouse gases (i.e., CO2, CH4, N2O), are derived from Antarctic ice core analyses by Schmidt et al.59. For the land use and land cover boundary conditions, the reconstruction by Pongratz et al.62 and that by Hurtt et al.63 are merged together to yield a consistent land use change. The orbital forcing is computed in the model based on Berger et al.64. The ozone forcing comes from the Whole Atmosphere Community Climate Model (WACCM) and the prescribed aerosol forcing are applied only over the historical period. For more details, please refer to CESM-LME57.

Proxy network

Figure 1 shows the PAGES 2k Phase 2 Network43. It consists of 692 records from 648 globally distributed sites, archived in trees, corals and sclerosponges, marine sediments, lake sediments, glacier ice, documentary sources, speleothems, boreholes, bivalves, and hybrid records. Each archive includes single or multiple observation types, among which tree ring width (TRW), maximum latewood density (MXD), coral and sclerosponge δ18O and Sr/Ca, lake varve thickness, and ice core δ18O are essential to Common Era temperature reconstructions (e.g., LMR) and their PSMs have been developed by recent efforts already38. We therefore focus on these proxy types, and generate their emulations to form our pseudoproxy network (Fig. 2). For proxy sites located within the same model grid cell, the input climate signals are the same, while the generation mechanisms vary according to their proxy types.

Fig. 2.

Fig. 2

The spatiotemporal availability of the PAGES 2k pseudoproxy network with realistic and full temporal availability.

Proxy system modeling

Following the proxy system modeling framework37, we build our pseudoproxy network based on the iCESM output, leveraging the PSMs from the PRYSM toolbox38 and the CFR codebase65. The concept of PSMs encompasses both geophysical/chemical/biological process-based models, as well as statistical models; both can be either linear or nonlinear. In this study, both categories of PSMs have been adopted, depending on availability. A given PSM can only be applied if its inputs are within the scope of the available climate variables. In addition, the more complex the PSM, the more parameters it contains, and these parameters must generally be fitted to modern observations, lest they introduce additional sources of uncertainty.

As in all modeling endeavors, the choice of PSM is therefore a trade-off between “sins of omission” (excessive simplicity) and “sins of commission” (excessive complexity). The present dataset used the most complex PSMs where justified by scientific understanding and available data. When these conditions were not met, simpler PSMs were selected to avoid sins of commissions or logistical hurdles (e.g. model fields available at too coarse a resolution).

Statistical PSMs, although highly idealized, are still based on scientific understanding of the geophysical/chemical/biological processes leading to the transduction of climate signals into proxy archives. As shown in Tardif et al.48, even linear, statistical PSMs for tree-ring width that include bivariate and seasonal dependence can yield vastly more realistic results than the traditional fitting to annual temperature.

Forward modeling of tree ring width (TRW)

Tree-ring width (TRW) is a major observation source to investigate the Common Era climate. In the PAGES 2k database, TRW represents the largest network with 354 records, most of which are located in the Northern Hemisphere. Depending on the location and species, TRW chronologies may record not only temperature variations but also moisture conditions, although the climatic signals can be modulated by biological memory effects49,50,6672. The relationship between TRW and the environmental variables is thus complex, and TRW PSMs with various complexity levels have been developed since 2000, including TREERING200073, Vaganov-Shashkin (VS)74 and its simplified version VS-Lite7577, MAIDEN (Modeling and Analysis In DENdroecology)78,79, and even the land model ORCHIDEE (ORganizing Carbon and Hydrology In Dynamic EcosystEms)80.

This work used VS-Lite developed by Tolwinski-Ward et al.7577 to generate our pseudo-TRW network because of its overall skill79,81, simplicity, and capacity to be widely applied to the PAGES 2k sites. VS-Lite takes monthly temperature and precipitation signals as input, and emulates a threshold-dependent tree-ring monthly growth response to the climate with piece-wise linear growth response functions (Eq. (1)) determined by four parameters: the lower and upper thresholds for temperature and soil moisture, respectively:

gV(V)=0ifVV1(VV1)/(V2V1)ifV1VV21ifV2V, 1

where V represents T (temperature) or M (moisture). The overall growth response is then the minimum of these two response functions modulated by the insolation-based growth response (gE) (Eq. (2)), which is determined by the latitude of the site, and the final output TRW is the standardized series of the annual integration of the monthly growths, with an error term added on (Eq. (3)):

g=gEmin{gT,gM}, 2
TRW=standardizetstegdt+ζ, 3

where t represents time in month, ts and te denote the window for the integration, and ζ is a pink noise term (i.e. a stochastic process with a spectral density S(f)fβ, with β a positive constant). Setting ts < 0 (a specific month in the previous year) can help mimic the biological memory effect or other unaccounted for sources of low frequency variability in TRW66,69,72,8285. Following ref. 75, we set ts = −4 and ts = 12, which represents an integration window from September of the previous year to December of the current year for the Northern Hemisphere, and from March of the current year to June of the next year for the Southern Hemisphere. The pink noise term is added to further mimic other non-climatic processes such as the detrending process of TRW records, following the formulation of colored noise proposed in ref. 86 with tuned spectral scaling slope87,88 β = 2 and SNR = 1 (signal-to-noise ratio defined in standard deviation24,26,89). Without this term, the scaling slope of the simulated TRWs will be significantly flatter than that observed in the real records, especially for the low-frequency band40. The need for this can be viewed in two ways: on one hand, it suggests that tree-ring width records in the PAGES2k database contain more low-frequency than expected from the climate signal and simple persistence structure present in VS-Lite alone, perhaps due to data processing (detrending and standardization) or unaccounted for biological or ecological processes. Alternatively, this can be seen as a result of a “sin of omission” in VS-Lite and an incomplete mimic of the full range of biological processes important for the autocorrelation structure of temperature-sensitive tree-ring series.

The four threshold parameters T1, T2, M1, M2 are crucial to the behavior of the model. We calibrate them against the CRUTS monthly temperature and precipitation observations90 version 4.05, using the Bayesian inference method elaborated in ref. 76. This essentially generates optimal posterior probability distributions for each threshold parameter by updating the prior distributions over Monte-Carlo iterations, and yields the estimate of each parameter following maximum likelihood estimation (MLE). With the calibrated parameters, iCESM-simulated monthly temperature and precipitation signals can be translated to the corresponding pseudo-TRW series. An example of the generated pseudo-TRW chronology and its comparison to the real-world counterpart in time and frequency domains is shown in Fig. 3.

Fig. 3.

Fig. 3

The dashboard for the tree ring width record “NAm_136” in dataset “ppwn_SNRinf_rta”. The unit “NA” stands for “not applicable” as the variable is a standardized index and thus unitless. “PSD” refers to power spectral density and is in the unit of power (squared unit of the proxy variable) per year.

Forward modeling of maximum latewood density (MXD)

Compared to TRW, maximum latewood density (MXD) more faithfully tracks growing season temperature history without distortions due to biological memory effects49,50,68,69,84,9195. As there is not yet a published, tractable proxy system model for MXD, here we use a simple univariate linear model to emulate the behavior of MXD series:

MXD=aTseasonal+b, 4

where a represents a linear slope factor, Tseasonal the average temperature over the growing season, and b the intercept. The growing season is calibrated against the CRUTS dataset, version 4.05. Following ref. 48, the season that yields the optimal regression skill is picked from an expert-curated pool of growing season candidates, including the default calendar year option (Jan-Dec) and variants of warm seasons (i.e., Jun-Aug, Mar-Aug, Jun-Nov for Northern Hemisphere, and Dec-Feb, Dec-May, Sep-Feb for Southern Hemisphere) during which trees are expected to grow. An example of a generated pseudo-MXD chronology is shown in Fig. 4.

Fig. 4.

Fig. 4

The dashboard for the maximum latewood density record “NAm_134” in dataset “ppwn_SNRinf_rta”. The unit “NA” stands for “not applicable” as the variable is a standardized index and thus unitless. “PSD” refers to power spectral density and is in the unit of power (squared unit of the proxy variable) per year.

Forward modeling of coral and sclerosponge δ18O

In contrast to trees, corals and sclerosponges mainly cover the tropical ocean regions and are thus of great importance to investigating tropical climate variability, including El Niño Southern Oscillation (ENSO)17,96101. Following Brown et al.102, we use a bilinear model to simulate coral and sclerosponge δ18O based on the annual sea surface temperature (SST) and seawater δ18O (denoted as δ18Osw) signals:

δ18O=aSST+bδ18Osw, 5

where a = −0.22 represents the linear slope factor, and b = 0.97002 the conversion factor from VSMOW to VPDB. Thompson et al.103 state that since the δ18Osw network is scarce, they use sea surface salinity (SSS) to estimate δ18Osw. However, a salinity-based PSM is reliant on the SSS/δ18Osw relationships that are known to be nonstationary and are based on extremely limited observational data104; the original formulation based on δ18Osw is thus preferable given that the iCESM output is leveraged. An example of the generated pseudo-coral/sclerosponge δ18O chronology is shown in Fig. 5.

Fig. 5.

Fig. 5

The dashboard for the coral δ18O record “Ocn_075” in dataset “ppwn_SNRinf_rta”. “PSD” refers to power spectral density and is in the unit of power (squared unit of the proxy variable) per year.

Forward modeling of coral and sclerosponge Sr/Ca

The skeletal trace element ratio Sr/Ca in corals and sclerosponges has a straightforward temperature interpretation. Following Corrège et al.105, we apply a simple univariate linear model based on the annual SST signal, but with fixed parameters:

Sr/Ca=aSST+b, 6

where a represents the linear slope factor with a Gaussian distribution with mean of −0.06 and standard deviation of 0.01, and b is the intercept with a mean value around 10.553 based on Table 1 in Corrège et al.105. In this study, we take a = −0.06 and b = 10.553. An example of the generated pseudo-coral/sclerosponge Sr/Ca chronology is shown in Fig. 6.

Fig. 6.

Fig. 6

The dashboard for the coral Sr/Ca record “Ocn_067” in dataset “ppwn_SNRinf_rta”. “PSD” refers to power spectral density and is in the unit of power (squared unit of the proxy variable) per year.

Forward modeling of ice core δ18O

Glacier ice cores mainly cover the polar and mountain regions, where trees cannot grow. They are usually well-preserved and span a long time interval with annual time resolution, and are important to investigate long-term climate change. For ice core δ18O, we apply the corresponding module in the PRYSM toolbox38, which is based on the work of Johnsen106, Whillans and Grootes107, Cuffey and Steig108, Johnsen et al.109, and Küttel et al.110.

Its sensor model takes precipitation-weighted δ18O to emulate the δ18O input to ice:

δ18Oweighted=pδ18Op/p, 7

where p represents the monthly precipitation amount, and δ18Op the precipitation δ18O. The precipitation-weighted δ18O is then corrected based on the elevation difference between the proxy site and its nearest model grid point with a rate of −0.25 per 100 meters. Next, its archive model emulates the compaction and diffusion processes of isotopes in ice via a convolution with a Gaussian kernel G:

δ18Oice=Gδ18Oweighted 8

An example of the generated pseudo-ice core δ18O chronology is shown in Fig. 7.

Fig. 7.

Fig. 7

The dashboard for the ice core δ18O record “Arc_029” in dataset “ppwn_SNRinf_rta”. “PSD” refers to power spectral density and is in the unit of power (squared unit of the proxy variable) per year.

Forward modeling of lake varve thickness

Varves, or annually laminated sediments, can be valuable temperature proxies for the Common Era due to their high-resolution and because they can be found in areas where other annually-resolved archives are absent. Varve thickness or mass accumulation rate are directly related to sediment input and deposition, which in turn can be strongly related to climate in some lakes, however many phenomena can affect varve thickness, and the relationship between climatic and environmental drivers and varve thickness is often complex and typically varies from lake to lake111. Temperature-driven varve thickness records are most common in the Arctic, where summer temperature can have strong and direct impacts on sediment transportation by melting winter snowpack and glaciers and extending the ice-free season.

The PAGES 2k Phase 2 database includes eight sites with varve thickness records interpreted to respond to temperature. Mechanistically simulating varve thickness is complex, highly site-specific, and not practical for most PPE studies. Nevertheless, most varve thickness records share characteristics that are readily and simply simulated. There are two key processes that we simulate. First, because varve thickness measures a depositional process, the distribution of varve thickness is zero-bounded and right-skewed, and is appropriately modeled with a Poisson or Gamma distribution112. Second, varve thickness records typically include substantial year-to-year memory. Unlike most sedimentary records, this is not due to bioturbation or post-depositional mixing (as this would destroy the laminations). However, glacial and sedimentary processes in the watershed and in the lake can be prone to significant memory, which affects the spectral characteristics of varve thickness records.

Here, we apply a simple model as below:

thickness=ΓTseasonal+Γ(b), 9

where Γ(·) represents a mapping from the original distribution to a Gamma distribution, Tseasonal is the seasonally-averaged temperature calculated based on the seasonality metadata of each site (Table 1)43, and b is a realization of fractional Brownian motion with Hurst index H = 0.75 and SNR = 1, a combination we find fits well with the real records. An example of the generated pseudo-lake varve thickness chronology is shown in Fig. 8.

Table 1.

The seasonality of each lake varve thickness site.

PAGES2k Site ID Seasonality
Arc_001 Jun - Aug
Arc_014 Dec - Mar
Arc_020 Jun - Aug
Arc_022 Jun - Aug
Arc_025 Jun - Aug
Arc_030 Jul - Sep
Arc_072 Mar - Oct
Arc_076 Jun
Fig. 8.

Fig. 8

The dashboard for the lake varve thickness record “Arc_025” in dataset “ppwn_SNRinf_rta”. “PSD” refers to power spectral density and is in the unit of power (squared unit of the proxy variable) per year.

Pseudoproxy production workflow

Figure 9 shows the general procedure for pseudoproxy generation. The starting point is the isotope-enabled Community Earth System Model (iCESM) last millennium plus historical simulation52 (Section Reference Climate), chosen so that the isotope-related proxies can be simulated with minimal assumptions.

Fig. 9.

Fig. 9

Flow chart of the general procedure for pseudoproxy generation.

Environmental variables are taken from the iCESM output, including air surface temperature, precipitation rate, sea level pressure, precipitation δ18O, seawater δ18O, and sea surface temperature (SST). Proxy metadata are taken from the PAGES 2k dataset (Section Proxy Network), including the location information, time axis, archive type, sensor, species, seasonality, etc.

These two sources of information (environmental variables and proxy metadata) are then fed to the PSMs for tree-ring width (TRW), maximum latewood density (MXD), coral/sclerosponge δ18O, coral/sclerosponge Sr/Ca, lake varve thickness, and ice core δ18O, which translate the climatic signals and generate the raw output in proxy space (Section Proxy Modeling).

The raw output is then treated as signal, and white noise is added with a set of signal-to-noise ratio (SNR, defined in standard deviation24,26,89) options (∞, 10, 2, 1, 0.5, 0.25)22,28, where SNR = ∞ is a noise-free case, SNR = 1 means that the signal and noise share an equal standard deviation, etc. We generate datasets with two types of temporal sampling: (1) full annual sampling over 850–2005 CE, and (2) the realistic temporal availability of each record (Fig. 2).

Because iCESM is a biased representation of reality, the pseudoproxies generated by this workflow inherit some of the same biases in low-order statistics like mean and variance. To facilitate comparison with real-world records, we apply a bias correction and variance matching against the real records, according to the mean and variance of the real proxy measurements over the common timespan to the pseudoproxy counterpart. Note that this step shifts and scales the time series, but has no impact on the statistical distribution (e.g., Gaussian or Gamma), nor the spectral characteristics (i.e., scaling slopes and peaks) of the simulated proxies.

As a benchmark, we also generate pseudoproxies following the traditional temperature-plus-noise method: the temperature signal at the grid cell nearest each proxy site is added with white noise with the same set of SNR options and the same two types of temporal sampling.

This workflow results in multiple pseudoproxy emulations of the PAGES 2k network, differing in:

design either “temperature-plus-white-noise” model (tpwn) or using the pseudoproxy models described above, with added white noise (ppwn)

noise level as quantified by the SNR of ∞ (pure signal), 10, 2, 1, 0.5 or 0.25.

temporal sampling either full annual sampling over 850–2005 CE (fta), or the realistic temporal availability of Fig. 2 (rta).

Data Records

Table 2 lists the pseudoproxy datasets generated in this study, which we call “pseudoPAGES2k”. The dataset IDs indicate the property of each dataset. For instance, “tpwn_SNR10_fta” means that the pseudoproxies are generated with the temperature-plus-white-noise method with SNR equals to 10 and full temporal availability, while “ppwn_SNR0.5_rta” means that the pseudoproxies are generated via the PSM hierarchy with white noise added on and SNR equals to 0.5, and the realistic temporal availability is applied, and so on and so forth. The datasets are archived at Zenodo113 (10.5281/zenodo.7652533).

Table 2.

A list of the “pseudoPAGES2k” pseudoproxy datasets.

Dataset ID Signal Noise SNR Sampling
ppwn_SNRinf_rta PSM generated none inf realistic
ppwn_SNR10_rta PSM generated white noise 10 realistic
ppwn_SNR2_rta PSM generated white noise 2 realistic
ppwn_SNR1_rta PSM generated white noise 1 realistic
ppwn_SNR0.5_rta PSM generated white noise 0.5 realistic
ppwn_SNR0.25_rta PSM generated white noise 0.25 realistic
ppwn_SNRinf_fta PSM generated none inf full
ppwn_SNR10_fta PSM generated white noise 10 full
ppwn_SNR2_fta PSM generated white noise 2 full
ppwn_SNR1_fta PSM generated white noise 1 full
ppwn_SNR0.5_fta PSM generated white noise 0.5 full
ppwn_SNR0.25_fta PSM generated white noise 0.25 full
tpwn_SNR10_rta nearest temperature white noise 10 realistic
tpwn_SNR2_rta nearest temperature white noise 2 realistic
tpwn_SNR1_rta nearest temperature white noise 1 realistic
tpwn_SNR0.5_rta nearest temperature white noise 0.5 realistic
tpwn_SNR0.25_rta nearest temperature white noise 0.25 realistic
tpwn_SNR10_fta nearest temperature white noise 10 full
tpwn_SNR2_fta nearest temperature white noise 2 full
tpwn_SNR1_fta nearest temperature white noise 1 full
tpwn_SNR0.5_fta nearest temperature white noise 0.5 full
tpwn_SNR0.25_fta nearest temperature white noise 0.25 full

“SNR” refers to signal-to-noise ratio defined in standard deviation24,26,89. The full sampling refers to 850–2005 CE with annual resolution, and the realistic sampling refers to the realistic temporal availability of each record.

The “iCESM1” last millennium simulation (iCESM-LM) used in this study can be accessed at a data server hosted by Rorbert Tardif at University of Washington (https://atmos.washington.edu/~rtardif/LMR/prior).

The PAGES 2k Phase 2 Network used in this study can be accessed at the National Center for Environmental Information’s World Data Service for Paleoclimatology (https://www.ncei.noaa.gov/access/paleo-search/study/21171).

Technical Validation

To verify if the generation procedure (Fig. 9) yields a realistic pseudoproxy emulation of the PAGES 2k database, we validate the generated pseudoproxies against the original records’ statistics, in both time and frequency domains. We emphasize that this is a validation specifically of the realism (and therefore utility) of the pseudoproxy generation procedure, rather than an evaluation of any single PSM or GCM, which has been done elsewhere. Rather, we aim to show that, coupling these models–imperfect though they may be–can produce pseudoproxies that emulate key characteristics of the target series. In the time domain, a good pseudoproxy emulation should reproduce the probability distribution shape of the real proxies; this may be assessed via split violin plots. In the frequency domain, a good emulation should reproduce the power spectral density (PSD) of the target series, indicating the energy partitioning per frequency interval, particularly the periodic and continuum114 characteristics of the series.

Figures 3 to 8 show examples for specific records, one site per proxy type. Since the real records may be unevenly-spaced in time, we leverage the Weighted Wavelet Z-transform (WWZ) method implemented in Pyleoclim115, to obtain the PSD curves. As illustrated by the PSD plots and the probability distribution plots, the pseudoproxies show an overall good agreement with the real records, including, for instance, the steep attenuation of high-frequencies in the ice core δ18O record shown in Fig. 7, and the long tail distribution of the varve thickness record shown in Fig. 8. To validate thoroughly the spectral characteristics, Fig. 10 shows the spectral analysis by proxy types. It can be seen that overall good agreement is achieved between the pseudoproxies and their real counterparts, indicating a realistic emulation from the spectral perspective. This should result in more realistic assessments of the spectral characteristics of reconstruction skill. We emphasize that the procedure of bias correction and variance matching has no bearing on these aspects of the validation, as it simply adds a scale and offset to the pseudoproxies, without modifying their probability distribution shape or spectral characteristics.

Fig. 10.

Fig. 10

Spectral analysis of the pseudoproxy records in dataset “ppwn_SNRinf_rta” by proxy type. The gray curves denote the power spectral density (PSD, in the unit of power per year, i.e., squared unit of the proxy variable per year) of the real records, while the colored curves denote that of the pseudoproxy records.

Usage Notes

To illustrate the many potential uses of this dataset, we provide Jupyter notebooks (Code Availability) for the basic analysis and visualization of the dataset, as well as applications to climate field reconstruction. Specifically, we provide Python-based examples of:

  • Loading and visualizing the pseudoPAGES2k dataset.

  • Filtering the pseudoPAGES2k dataset according to various criteria.

  • Generating dashboards like Figs. 38.

  • How to apply Paleoclimate Data Assimilation (in the vein of the Last Millennium Reanalysis48) to the pseudoPAGES2k dataset, and its use for benchmarking climate field reconstruction methods.

Other potential uses of this dataset and its production workflow include optimal sampling design116. A natural extension would be to add age uncertainties to these pseudoproxies, as done in ref. 31.

Acknowledgements

The authors acknowledge support from the National Natural Science Foundation of China (grant 42205053 to F.Z.), the Natural Science Foundation of the Higher Education Institutions of Jiangsu Province (grant 22KJB170005 to F.Z.), the National Science Foundation (grants 2303530 and EAR 1948822 to F.Z.), and the Climate Program Office of the National Oceanographic and Atmospheric Administration (grants NA18OAR4310426 to J.E.G. and NA18OAR4310420 to K.J.A.).

Author contributions

J.E.G. and F.Z. designed the research and drafted the manuscript. F.Z. conducted all calculations and produced all the figures. All authors provided information about the proxy archives, proxy systems models, and/or climate models, and wrote the manuscript.

Code availability

The Jupyter notebooks illustrating the usage of the pseudoPAGES2k dataset can be accessed at 10.5281/zenodo.7652533 or https://github.com/fzhu2e/paper-pseudoPAGES2k.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.IPCC. Summary for policymakers. In Masson-Delmotte, V. et al. (eds.) Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change (Cambridge University Press, 2021).
  • 2.Tingley MP, et al. Piecing together the past: statistical insights into paleoclimatic reconstructions. Quaternary Science Reviews. 2012;35:1–22. doi: 10.1016/j.quascirev.2012.01.012. [DOI] [Google Scholar]
  • 3.Jones JM, Widmann M. Early peak in Antarctic oscillation index. Nature. 2004;432:290–291. doi: 10.1038/432290b. [DOI] [PubMed] [Google Scholar]
  • 4.Goosse H, et al. The origin of the European “Medieval Warm Period”. Climate of the Past. 2006;2:99–113. doi: 10.5194/cp-2-99-2006. [DOI] [Google Scholar]
  • 5.Gebhardt C, Kühl N, Hense A, Litt T. Reconstruction of Quaternary temperature fields by dynamically consistent smoothing. Climate Dynamics. 2008;30:421–437. doi: 10.1007/s00382-007-0299-9. [DOI] [Google Scholar]
  • 6.Widmann M, Goosse H, van der Schrier G, Schnur R, Barkmeijer J. Using data assimilation to study extratropical Northern Hemisphere climate over the last millennium. Climate of the Past. 2010;6:627–644. doi: 10.5194/cp-6-627-2010. [DOI] [Google Scholar]
  • 7.Goosse, H. et al. Reconstructing surface temperature changes over the past 600 years using climate model simulations with data assimilation. Journal of Geophysical Research: Atmospheres 115, 10.1029/2009JD012737 (2010).
  • 8.Annan JD, Hargreaves JC. A new global reconstruction of temperature changes at the Last Glacial Maximum. Climate of the Past. 2013;9:367–376. doi: 10.5194/cp-9-367-2013. [DOI] [Google Scholar]
  • 9.Steiger NJ, Hakim GJ, Steig EJ, Battisti DS, Roe GH. Assimilation of Time-Averaged Pseudoproxies for Climate Reconstruction. Journal of Climate. 2014;27:426–441. doi: 10.1175/JCLI-D-12-00693.1. [DOI] [Google Scholar]
  • 10.Hakim GJ, et al. The last millennium climate reanalysis project: Framework and first results. Journal of Geophysical Research: Atmospheres. 2016;121:6745–6764. doi: 10.1002/2016JD024751. [DOI] [Google Scholar]
  • 11.Franke J, Brönnimann S, Bhend J, Brugnara Y. A monthly global paleo-reanalysis of the atmosphere from 1600 to 2005 for studying past climatic variations. Scientific Data. 2017;4:170076. doi: 10.1038/sdata.2017.76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Acevedo W, Fallah B, Reich S, Cubasch U. Assimilation of pseudo-tree-ring-width observations into an atmospheric general circulation model. Climate of the Past. 2017;13:545–557. doi: 10.5194/cp-13-545-2017. [DOI] [Google Scholar]
  • 13.Steiger NJ, Smerdon JE, Cook ER, Cook BI. A reconstruction of global hydroclimate and dynamical variables over the Common Era. Scientific Data. 2018;5:1–15. doi: 10.1038/sdata.2018.86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Tierney JE, et al. Glacial cooling and climate sensitivity revisited. Nature. 2020;584:569–573. doi: 10.1038/s41586-020-2617-x. [DOI] [PubMed] [Google Scholar]
  • 15.Osman MB, et al. Globally resolved surface temperatures since the Last Glacial Maximum. Nature. 2021;599:239–244. doi: 10.1038/s41586-021-03984-4. [DOI] [PubMed] [Google Scholar]
  • 16.King JM, et al. A data assimilation approach to last millennium temperature field reconstruction using a limited high-sensitivity proxy network. Journal of Climate. 2021;-1:1–64. doi: 10.1175/JCLI-D-20-0661.1. [DOI] [Google Scholar]
  • 17.Zhu F, et al. A re-appraisal of the ENSO response to volcanism with paleoclimate data assimilation. Nature Communications. 2022;13:747. doi: 10.1038/s41467-022-28210-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Shoji S, Okazaki A, Yoshimura K. Impact of proxies and prior estimates on data assimilation using isotope ratios for the climate reconstruction of the last millennium. Earth and Space Science. 2022;9:e2020EA001618. doi: 10.1029/2020EA001618. [DOI] [Google Scholar]
  • 19.Valler V, Franke J, Brugnara Y, Brönnimann S. An updated global atmospheric paleo-reanalysis covering the last 400 years. Geoscience Data Journal. 2022;9:89–107. doi: 10.1002/gdj3.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Annan JD, Hargreaves JC, Mauritsen T. A new global surface temperature reconstruction for the Last Glacial Maximum. Climate of the Past. 2022;18:1883–1896. doi: 10.5194/cp-18-1883-2022. [DOI] [Google Scholar]
  • 21.Smerdon JE, Pollack HN. Reconstructing earth’s surface temperature over the past 2000 years: the science behind the headlines. Wiley Interdisciplinary Reviews: Climate Change n/a–n/a. 2016 doi: 10.1002/wcc.418. [DOI] [Google Scholar]
  • 22.Wang J, Emile-Geay J, Guillot D, McKay NP, Rajaratnam B. Fragility of reconstructed temperature patterns over the common era: Implications for model evaluation. Geophysical Research Letters. 2015;42:7162–7170. doi: 10.1002/2015GL065265. [DOI] [Google Scholar]
  • 23.Neukom R, Steiger N, Gómez-Navarro JJ, Wang J, Werner JP. No evidence for globally coherent warm and cold periods over the preindustrial common era. Nature. 2019;571:550–554. doi: 10.1038/s41586-019-1401-2. [DOI] [PubMed] [Google Scholar]
  • 24.Smerdon JE. Climate models as a test bed for climate reconstruction methods: pseudoproxy experiments. Wiley Interdisciplinary Reviews: Climate Change. 2012;3:63–77. doi: 10.1002/wcc.149. [DOI] [Google Scholar]
  • 25.Smerdon JE, Kaplan A, Chang D, Evans MN. A Pseudoproxy Evaluation of the CCA and RegEM Methods for Reconstructing Climate Fields of the Last Millennium. Journal of Climate. 2010;23:4856–4880. doi: 10.1175/2010JCLI3328.1. [DOI] [Google Scholar]
  • 26.Smerdon, J. E., Kaplan, A., Zorita, E., González–Rouco, J. F. & Evans, M. N. Spatial performance of four climate field reconstruction methods targeting the Common Era. Geophysical Research Letters 38, 10.1029/2011GL047372 (2011).
  • 27.Smerdon JE, Kaplan A, Amrhein DE. Erroneous model field representations in multiple pseudoproxy studies: Corrections and implications*. J. Clim. 2010;23:5548–5554. doi: 10.1175/2010JCLI3742.1. [DOI] [Google Scholar]
  • 28.Wang J, Emile-Geay J, Guillot D, Smerdon JE, Rajaratnam B. Evaluating climate field reconstruction techniques using improved emulations of real-world conditions. Climate of the Past. 2014;10:1–19. doi: 10.5194/cp-10-1-2014. [DOI] [Google Scholar]
  • 29.Evans MN, Smerdon JE, Kaplan A, Tolwinski–Ward SE, González–Rouco JF. Climate field reconstruction uncertainty arising from multivariate and nonlinear properties of predictors. Geophysical Research Letters. 2014;41:9127–9134. doi: 10.1002/2014GL062063. [DOI] [Google Scholar]
  • 30.Smerdon, J. E., Coats, S. & Ault, T. R. Model-dependent spatial skill in pseudoproxy experiments testing climate field reconstruction methods for the Common Era. Climate Dynamics 1–22, 10.1007/s00382-015-2684-0 (2015).
  • 31.Nilsen T, Talento S, Werner JP. Constraining two climate field reconstruction methodologies over the north atlantic realm using pseudo-proxy experiments. Quaternary Science Reviews. 2021;265:107009. doi: 10.1016/j.quascirev.2021.107009. [DOI] [Google Scholar]
  • 32.Mann ME, Rutherford S. Climate reconstruction using ‘Pseudoproxies’. Geophys. Res. Lett. 2002;29:139–1. doi: 10.1029/2001GL014554. [DOI] [Google Scholar]
  • 33.Rutherford S, Mann ME, Delworth TL, Stouffer RJ. Climate field reconstruction under stationary and nonstationary forcing. J. Clim. 2003;16:462–479. doi: 10.1175/1520-0442(2003)016&#x00026;lt;0462:CFRUSA&#x00026;gt;2.0.CO;2. [DOI] [Google Scholar]
  • 34.Rutherford S, et al. Proxy-Based Northern Hemisphere Surface Temperature Reconstructions: Sensitivity to Method, Predictor. Network, Target Season, and Target Domain. J. Clim. 2005;18:2308–2329. [Google Scholar]
  • 35.Mann ME, Rutherford S, Wahl E, Ammann C. Robustness of proxy-based climate field reconstruction methods. Journal of Geophysical Research (Atmospheres) 2007;112:12109–+. doi: 10.1029/2006JD008272. [DOI] [Google Scholar]
  • 36.Gómez-Navarro JJ, Zorita E, Raible CC, Neukom R. Pseudo-proxy tests of the analogue method to reconstruct spatially resolved global temperature during the Common Era. Climate of the Past. 2017;13:629–648. doi: 10.5194/cp-13-629-2017. [DOI] [Google Scholar]
  • 37.Evans MN, Tolwinski-Ward SE, Thompson DM, Anchukaitis KJ. Applications of proxy system modeling in high resolution paleoclimatology. Quaternary Science Reviews. 2013;76:16–28. doi: 10.1016/j.quascirev.2013.05.024. [DOI] [Google Scholar]
  • 38.Dee S, et al. PRYSM: An open-source framework for PRoxY system modeling, with applications to oxygen-isotope systems. J. Adv. Model. Earth Syst. 2015;7:1220–1247. doi: 10.1002/2015MS000447. [DOI] [Google Scholar]
  • 39.Dee SG, Steiger NJ, Emile-Geay J, Hakim GJ. On the utility of proxy system models for estimating climate states over the common era. Journal of Advances in Modeling Earth Systems. 2016;8:1164–1179. doi: 10.1002/2016MS000677. [DOI] [Google Scholar]
  • 40.Dee SG, et al. Improved spectral comparisons of paleoclimate models and observations via proxy system modeling: Implications for multi-decadal variability. Earth and Planetary Science Letters. 2017;476:34–46. doi: 10.1016/j.epsl.2017.07.036. [DOI] [Google Scholar]
  • 41.Dee SG, Russell JM, Morrill C, Chen Z, Neary A. PRYSM v2.0: A Proxy System Model for Lacustrine Archives. Paleoceanography and Paleoclimatology. 2018;33:1250–1269. doi: 10.1029/2018PA003413. [DOI] [Google Scholar]
  • 42.Bothe O, Wagner S, Zorita E. Simple noise estimates and pseudoproxies for the last 21000 years. Earth System Science Data. 2019;11:1129–1152. doi: 10.5194/essd-11-1129-2019. [DOI] [Google Scholar]
  • 43.PAGES2k Consortium A global multiproxy database for temperature reconstructions of the Common Era. Scientific Data. 2017;4:170088 EP. doi: 10.1038/sdata.2017.88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Widmann, M., Franke, J., Goosse, H., Hakim, G. & Steiger, N. The DAPS data assimilation intercomparison experiment. In EGU General Assembly Conference Abstracts, EGU General Assembly Conference Abstracts, 19100 (2018).
  • 45.Neukom R, et al. Consistent multidecadal variability in global temperature reconstructions and simulations over the common era. Nature Geoscience. 2019;12:643–649. doi: 10.1038/s41561-019-0400-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Wang J, et al. Evaluation of multidecadal and longer-term temperature changes since 850 CE based on Northern Hemisphere proxy-based reconstructions and model simulations. Science China Earth Sciences. 2020;63:1126–1143. doi: 10.1007/s11430-019-9607-x. [DOI] [Google Scholar]
  • 47.St. Klippel L, George S, Büntgen U, Krusic PJ, Esper J. Differing pre-industrial cooling trends between tree rings and lower-resolution temperature proxies. Climate of the Past. 2020;16:729–742. doi: 10.5194/cp-16-729-2020. [DOI] [Google Scholar]
  • 48.Tardif R, et al. Last Millennium Reanalysis with an expanded proxy database and seasonal proxy modeling. Climate of the Past. 2019;15:1251–1273. doi: 10.5194/cp-15-1251-2019. [DOI] [Google Scholar]
  • 49.Zhu F, Emile-Geay J, Hakim GJ, King J, Anchukaitis KJ. Resolving the Differences in the Simulated and Reconstructed Temperature Response to Volcanism. Geophysical Research Letters. 2020;47:e2019GL086908. doi: 10.1029/2019GL086908. [DOI] [Google Scholar]
  • 50.Anchukaitis KJ, Smerdon JE. Progress and uncertainties in global and hemispheric temperature reconstructions of the common era. Quaternary Science Reviews. 2022;286:107537. doi: 10.1016/j.quascirev.2022.107537. [DOI] [Google Scholar]
  • 51.Guillot D, Rajaratnam B, Emile-Geay J. Statistical paleoclimate reconstructions via Markov random fields. The Annals of Applied Statistics. 2015;9:324–352. doi: 10.1214/14-AOAS794. [DOI] [Google Scholar]
  • 52.Brady E, et al. The Connected Isotopic Water Cycle in the Community Earth System Model Version 1. Journal of Advances in Modeling Earth Systems. 2019;11:2547–2566. doi: 10.1029/2019MS001663. [DOI] [Google Scholar]
  • 53.Nusbaumer J, Wong TE, Bardeen C, Noone D. Evaluating hydrological processes in the Community Atmosphere Model Version 5 (CAM5) using stable isotope ratios of water. Journal of Advances in Modeling Earth Systems. 2017;9:949–977. doi: 10.1002/2016MS000839. [DOI] [Google Scholar]
  • 54.Wong TE, Nusbaumer J, Noone DC. Evaluation of modeled land-atmosphere exchanges with a comprehensive water isotope fractionation scheme in version 4 of the Community Land Model. Journal of Advances in Modeling Earth Systems. 2017;9:978–1001. doi: 10.1002/2016MS000842. [DOI] [Google Scholar]
  • 55.Oleson, K. W. et al. Technical description of version 4.0 of the Community Land Model (CLM). Tech. Rep., National Center for Atmospheric Research (2010).
  • 56.Zhang J, et al. Asynchronous warming and δ18O evolution of deep Atlantic water masses during the last deglaciation. Proceedings of the National Academy of Sciences. 2017;114:11075–11080. doi: 10.1073/pnas.1704512114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Otto-Bliesner BL, et al. Climate variability and change since 850 ce: An ensemble approach with the community earth system model. Bulletin of the American Meteorological Society. 2015;97:735–754. doi: 10.1175/BAMS-D-14-00233.1. [DOI] [Google Scholar]
  • 58.Vieira LEA, Solanki SK, Krivova NA, Usoskin I. Evolution of the solar irradiance during the Holocene. Astronomy & Astrophysics. 2011;531:A6. doi: 10.1051/0004-6361/201015843. [DOI] [Google Scholar]
  • 59.Schmidt GA, et al. Climate forcing reconstructions for use in PMIP simulations of the Last Millennium (v1.1) Geoscientific Model Development. 2012;5:185–191. doi: 10.5194/gmd-5-185-2012. [DOI] [Google Scholar]
  • 60.Gao, C., Robock, A. & Ammann, C. Volcanic forcing of climate over the past 1500 years: An improved ice core-based index for climate models. Journal of Geophysical Research: Atmospheres 113, 10.1029/2008JD010239 (2008).
  • 61.Ammann, C. M., Meehl, G. A., Washington, W. M. & Zender, C. S. A monthly and latitudinally varying volcanic forcing dataset in simulations of 20th century climate. Geophysical Research Letters 30, 10.1029/2003GL016875 (2003).
  • 62.Pongratz, J., Reick, C., Raddatz, T. & Claussen, M. A reconstruction of global agricultural areas and land cover for the last millennium. Global Biogeochemical Cycles 22, 10.1029/2007GB003153 (2008).
  • 63.Hurtt GC, et al. Harmonization of land-use scenarios for the period 1500– 2100: 600 years of global gridded annual land-use transitions, wood harvest, and resulting secondary lands. Climatic Change. 2011;109:117. doi: 10.1007/s10584-011-0153-2. [DOI] [Google Scholar]
  • 64.Berger A, Loutre M-F, Tricot C. Insolation and Earth’s orbital periods. Journal of Geophysical Research: Atmospheres. 1993;98:10341–10362. doi: 10.1029/93JD00222. [DOI] [Google Scholar]
  • 65.Zhu F, 2023. cfr: a Python package for Climate Field Reconstruction. Zenodo. [DOI]
  • 66.Fritts HC. Growth-rings of trees: their correlation with climate. Science. 1966;154:973–979. doi: 10.1126/science.154.3752.973. [DOI] [PubMed] [Google Scholar]
  • 67.Krakauer, N. Y. & Randerson, J. T. Do volcanic eruptions enhance or diminish net primary production? Evidence from tree rings. Global Biogeochemical Cycles 17, 10.1029/2003GB002076 (2003).
  • 68.Frank D, Büntgen U, Böhm R, Maugeri M, Esper J. Warmer early instrumental measurements versus colder reconstructed temperatures: shooting at a moving target. Quaternary Science Reviews. 2007;26:3298–3310. doi: 10.1016/j.quascirev.2007.08.002. [DOI] [Google Scholar]
  • 69.Esper J, Schneider L, Smerdon JE, Schöne BR, Büntgen U. Signals and memory in tree-ring width and density data. Dendrochronologia. 2015;35:62–70. doi: 10.1016/j.dendro.2015.07.001. [DOI] [Google Scholar]
  • 70.Stoffel M, et al. Estimates of volcanic-induced cooling in the Northern Hemisphere over the past 1,500 years. Nature Geoscience. 2015;8:784–788. doi: 10.1038/ngeo2526. [DOI] [Google Scholar]
  • 71.Zhang H, et al. Modified climate with long term memory in tree ring proxies. Environmental Research Letters. 2015;10:084020. doi: 10.1088/1748-9326/10/8/084020. [DOI] [Google Scholar]
  • 72.Lücke L, Hegerl G, Schurer A, Wilson R. Effects of memory biases on variability of temperature reconstructions. Journal of Climate. 2019 doi: 10.1175/JCLI-D-19-0184.1. [DOI] [Google Scholar]
  • 73.Fritts, H. et al. User manual for treering 2000. Tech. Rep., Laboratory of Tree-Ring Research (2000).
  • 74.Vaganov, E. A., Hughes, M. K. & Shashkin, A. V. Growth Dynamics of Conifer Tree Rings: Images of Past and Future Environments. Ecological Studies (Springer-Verlag, Berlin Heidelberg, 2006).
  • 75.Tolwinski-Ward SE, Evans MN, Hughes MK, Anchukaitis KJ. An efficient forward model of the climate controls on interannual variation in tree-ring width. Climate Dynamics. 2011;36:2419–2439. doi: 10.1007/s00382-010-0945-5. [DOI] [Google Scholar]
  • 76.Tolwinski-Ward SE, Anchukaitis KJ, Evans MN. Bayesian parameter estimation and interpretation for an intermediate model of tree-ring width. Climate of the Past. 2013;9:1481–1493. doi: 10.5194/cp-9-1481-2013. [DOI] [Google Scholar]
  • 77.Tolwinski-Ward SE, Tingley MP, Evans MN, Hughes MK, Nychka DW. Probabilistic reconstructions of local temperature and soil moisture from tree-ring data with potentially time-varying climatic response. Climate Dynamics. 2015;44:791–806. doi: 10.1007/s00382-014-2139-z. [DOI] [Google Scholar]
  • 78.Rezsöhazy, J. et al. Application and evaluation of the dendroclimatic process-based model MAIDEN during the last century in Canada and Europe. Climate of the Past 16, 1043–1059, 10.5194/cp-16-1043-2020. Publisher: Copernicus GmbH (2020).
  • 79.Rezsöhazy J, Gennaretti F, Goosse H, Guiot J. Testing the performance of dendroclimatic process-based models at global scale with the PAGES2k tree-ring width database. Climate Dynamics. 2021;57:2005–2020. doi: 10.1007/s00382-021-05789-7. [DOI] [Google Scholar]
  • 80.Druel A, Ciais P, Krinner G, Peylin P. Modeling the Vegetation Dynamics of Northern Shrubs and Mosses in the ORCHIDEE Land Surface Model. Journal of Advances in Modeling Earth Systems. 2019;11:2020–2035. doi: 10.1029/2018MS001531. [DOI] [Google Scholar]
  • 81.Evans, M. N., Zhu, F., Rezsöhazy, J. & Jeong, J. What complexity PSM for paleoclimate data assimilation? Results from a tree-ring width data model intercomparison study. In EOS, Transactions, AGU. Abstract PP041-0011 presented at the AGU Fall 2020 Meeting, 15 Dec (2020).
  • 82.Franke J, Frank D, Raible CC, Esper J, Brönnimann S. Spectral biases in tree-ring climate proxies. Nature Climate Change. 2013;3:360–364. doi: 10.1038/nclimate1816. [DOI] [Google Scholar]
  • 83.Zhang H, et al. Modified climate with long term memory in tree ring proxies. Environmental Research Letters. 2015;10:084020. doi: 10.1088/1748-9326/10/8/084020. [DOI] [Google Scholar]
  • 84.Esper J, et al. Large-scale, millennial-length temperature reconstructions from tree-rings. Dendrochronologia. 2018;50:81–90. doi: 10.1016/j.dendro.2018.06.001. [DOI] [Google Scholar]
  • 85.Büntgen U, et al. The influence of decision-making in tree ring-based climate reconstructions. Nature Communications. 2021;12:3411. doi: 10.1038/s41467-021-23627-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Kirchner JW. Aliasing in 1/f(alpha) noise spectra: origins, consequences, and remedies. Physical Review. E, Statistical, Nonlinear, and Soft Matter Physics. 2005;71:066110. doi: 10.1103/PhysRevE.71.066110. [DOI] [PubMed] [Google Scholar]
  • 87.Huybers P, Curry W. Links between annual, Milankovitch and continuum temperature variability. Nature. 2006;441:329–332. doi: 10.1038/nature04745. [DOI] [PubMed] [Google Scholar]
  • 88.Zhu, F. et al. Climate models can correctly simulate the continuum of global-average temperature variability. Proceedings of the National Academy of Sciences 201809959, 10.1073/pnas.1809959116 (2019). [DOI] [PMC free article] [PubMed]
  • 89.Mann, M. E., Rutherford, S., Wahl, E. & Ammann, C. Robustness of proxy-based climate field reconstruction methods. Journal of Geophysical Research: Atmospheres 112, 10.1029/2006JD008272 (2007).
  • 90.Harris I, Jones PD, Osborn TJ, Lister DH. Updated high-resolution grids of monthly climatic observations – the CRU TS3.10 Dataset. International Journal of Climatology. 2014;34:623–642. doi: 10.1002/joc.3711. [DOI] [Google Scholar]
  • 91.D’Arrigo R, Wilson R, Anchukaitis KJ. Volcanic cooling signal in tree ring temperature records for the past millennium. Journal of Geophysical Research: Atmospheres. 2013;118:9000–9010. doi: 10.1002/jgrd.50692. [DOI] [Google Scholar]
  • 92.Schneider L, et al. Revising midlatitude summer temperatures back to ad 600 based on a wood density network. Geophysical Research Letters. 2015;42:4556–4562. doi: 10.1002/2015GL063956. [DOI] [Google Scholar]
  • 93.Wilson R, et al. Last millennium northern hemisphere summer temperatures from tree rings: Part i: The long term context. Quaternary Science Reviews. 2016;134:1–18. doi: 10.1016/j.quascirev.2015.12.005. [DOI] [Google Scholar]
  • 94.Anchukaitis KJ, et al. Last millennium northern hemisphere summer temperatures from tree rings: Part ii, spatially resolved reconstructions. Quaternary Science Reviews. 2017;163:1–22. doi: 10.1016/j.quascirev.2017.02.020. [DOI] [Google Scholar]
  • 95.Björklund J, et al. Scientific merits and analytical challenges of tree-ring densitometry. Reviews of Geophysics. 2019;57:1224–1264. doi: 10.1029/2019RG000642. [DOI] [Google Scholar]
  • 96.Cobb KM, Charles CD, Cheng H, Edwards RL. El Niño/Southern Oscillation and tropical Pacific climate during the last millennium. Nature. 2003;424:271. doi: 10.1038/nature01779. [DOI] [PubMed] [Google Scholar]
  • 97.Cobb KM, et al. Highly Variable El Niño–Southern Oscillation Throughout the Holocene. Science. 2013;339:67–70. doi: 10.1126/science.1228246. [DOI] [PubMed] [Google Scholar]
  • 98.Emile-Geay J, Cobb KM, Mann ME, Wittenberg AT. Estimating Central Equatorial Pacific SST Variability over the Past Millennium. Part I: Methodology and Validation. Journal of Climate. 2013;26:2302–2328. doi: 10.1175/JCLI-D-11-00510.1. [DOI] [Google Scholar]
  • 99.Emile-Geay J, Cobb KM, Mann ME, Wittenberg AT. Estimating Central Equatorial Pacific SST Variability over the Past Millennium. Part II: Reconstructions and Implications. Journal of Climate. 2013;26:2329–2352. doi: 10.1175/JCLI-D-11-00511.1. [DOI] [Google Scholar]
  • 100.Tierney JE, et al. Tropical sea surface temperatures for the past four centuries reconstructed from coral archives. Paleoceanography. 2015;30:226–252. doi: 10.1002/2014PA002717. [DOI] [Google Scholar]
  • 101.Emile-Geay, J., Cobb, K. M., Cole, J. E., Elliot, M. & Zhu, F. Past ENSO Variability, chap. 5, 87–118, https://agupubs.onlinelibrary.wiley.com/doi/pdf/10.1002/496 9781119548164.ch5 (American Geophysical Union (AGU), 2020).
  • 102.Brown, J., Simmonds, I. & Noone, D. Modeling δ18 O in tropical precipitation and the surface ocean for present-day climate. Journal of Geophysical Research: Atmospheres 111, 10.1029/2004JD005611 (2006).
  • 103.Thompson, D. M., Ault, T. R., Evans, M. N., Cole, J. E. & Emile-Geay, J. Comparison of observed and simulated tropical climate trends using a forward model of coral δ 18 O. Geophysical Research Letters 38, 10.1029/2011GL048224 (2011).
  • 104.Stevenson S, et al. Twentieth Century Seawater δ18 O Dynamics and Implications for Coral-Based Climate Reconstruction. Paleoceanography and Paleoclimatology. 2018;33:606–625. doi: 10.1029/2017PA003304. [DOI] [Google Scholar]
  • 105.Corrège T. Sea surface temperature and salinity reconstruction from coral geochemical tracers. Palaeogeography, Palaeoclimatology, Palaeoecology. 2006;232:408–428. doi: 10.1016/j.palaeo.2005.10.014. [DOI] [Google Scholar]
  • 106.Johnsen S. Stable isotope homogenization of polar firn and ice. Proceedings of the Symposium on Isotopes and Impurities in Snow and Ice. 1977;118:210–219. [Google Scholar]
  • 107.Whillans IM, Grootes PM. Isotopic diffusion in cold snow and firn. Journal of Geophysical Research: Atmospheres. 1985;90:3910–3918. doi: 10.1029/JD090iD02p03910. [DOI] [Google Scholar]
  • 108.Cuffey KM, Steig EJ. Isotopic diffusion in polar firn: implications for interpretation of seasonal climate parameters in ice-core records, with emphasis on central Greenland. Journal of Glaciology. 1998;44:273–284. doi: 10.3189/S0022143000002616. [DOI] [Google Scholar]
  • 109.Johnsen S, et al. Diffusion of stable isotopes in polar firn and ice. Proceedings of the Symposium on Isotopes and Impurities in Snow and Ice. 2000;159:121–140. [Google Scholar]
  • 110.Küttel M, Steig EJ, Ding Q, Monaghan AJ, Battisti DS. Seasonal climate information preserved in West Antarctic ice core water isotopes: Relationships to temperature, large-scale circulation, and sea ice. Climate Dynamics. 2012;39:1841–1857. doi: 10.1007/s00382-012-1460-7. [DOI] [Google Scholar]
  • 111.Hodder K, Gilbert R, Desloges J. Glaciolacustrine varved sediment as an alpine hydroclimatic proxy. Journal of Paleolimnology. 2007;38:365–394. doi: 10.1007/s10933-006-9083-9. [DOI] [Google Scholar]
  • 112.Blaauw, M. & Christen, J. Flexible paleoclimate age-depth models using an autoregressive gamma process. Bayesian Analysis 6, 10.1214/ba/1339616472 (2011).
  • 113.Zhu F, 2023. The pseudopages2k dataset v1.4. Zenodo. [DOI]
  • 114.Ghil M, et al. Advanced spectral methods for climatic time series. Rev. Geophys. 2002;40:1003–1052. doi: 10.1029/2000RG000092. [DOI] [Google Scholar]
  • 115.Khider D, et al. Pyleoclim: Paleoclimate Timeseries Analysis and Visualization With Python. Paleoceanography and Paleoclimatology. 2022;37:e2022PA004509. doi: 10.1029/2022PA004509. [DOI] [Google Scholar]
  • 116.Comboul M, Emile-Geay J, Hakim GJ, Evans MN. Paleoclimate sampling as a sensor placement problem. Journal of Climate. 2015;28:7717–7740. doi: 10.1175/JCLI-D-14-00802.1. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

  1. Zhu F, 2023. cfr: a Python package for Climate Field Reconstruction. Zenodo. [DOI]
  2. Zhu F, 2023. The pseudopages2k dataset v1.4. Zenodo. [DOI]

Data Availability Statement

The Jupyter notebooks illustrating the usage of the pseudoPAGES2k dataset can be accessed at 10.5281/zenodo.7652533 or https://github.com/fzhu2e/paper-pseudoPAGES2k.


Articles from Scientific Data are provided here courtesy of Nature Publishing Group

RESOURCES