Abstract
In environmental monitoring, multiple spatial variables are often sampled at a geographical location that can depend on each other in complex ways, such as non-linear and non-Gaussian spatial dependence. We propose a new mixture copula model that can capture those complex relationships of spatially correlated multiple variables and predict univariate variables while considering the multivariate spatial relationship. The proposed method is demonstrated using an environmental application and compared with three existing methods. Firstly, improvement in the prediction of individual variables by utilising multivariate spatial copula compares to the existing univariate pair copula method. Secondly, performance in prediction by utilising mixture copula in the multivariate spatial copula framework compares with an existing multivariate spatial copula model that uses a non-linear principal component analysis. Lastly, improvement in the prediction of individual variables by utilising the non-linear non-Gaussian multivariate spatial copula model compares to the linear Gaussian multivariate cokriging model. The results show that the proposed spatial mixture copula model outperforms the existing methods in the cross-validation of actual and predicted values at the sampled locations.
Subject terms: Environmental sciences, Mathematics and computing
Introduction
Many environmental sampling is often observed multiple spatially correlated variables at a given geographical location. For instance, multiple topsoil heavy metal concentrations, such as cadmium, zinc, and copper, are sampled from the soil sample at a field location. In forestry, multiple biomass variables, such as bole, foliage, stump, branch, and root biomass, are sampled in a tree. Also, the spatial distribution of forest biomass variables may use to understand wildfire behaviour. These variables can depend on each other in complex ways, such as non-linear and non-Gaussian spatial dependence. The spatial modelling by considering these complex multivariate spatial dependence may increase the prediction accuracy of individual variables, which may help forest managers to minimise risk and save lives. This article focusses on the copula-based spatial modelling of spatially correlated multiple variables and predicts the individual variables while utilising multivariate spatial dependence of spatially correlated variables.
Gaussian-based linear kriging method is widely used to model spatial variables and provides a weighted average measure of linear spatial dependence. The kriging weights do not depend on the different values of samples and also assume linear Gaussian spatial dependence over the spatial domain1–6. However, spatial interpolation (prediction or simulation) based on a spatial model expects to behave differently for different values of samples. That is, spatial correlation between samples varies for the different quantiles of samples. Thus, Bárdossy7 introduced spatial copula method that can capture the spatial dependence of a spatial variable by considering the different values of samples. In the non-spatial setting, copula method is used to model the dependence between two or more non-spatial variables, which has widely applied in many fields, such as environmental science, finance, economics, medicine and engineering8–14. Bárdossy’s7 spatial copula method divides the distance over which spatial dependence exists into equally spaced intervals, also referred to as distance classes or spatial bins, and requires the same family of copulas to be fitted across all of the spatial bins. Gräler and Pebesma15 proposed a more flexible spatial copula model that permits copulas from different families to be fitted across the distance classes. The added flexibility of Gräler and Pebesma’s model, over Bárdossy’s model, permits increased accuracy in modelling and prediction. The spatial copula concept proposed by Bárdossy, Gräler and Pebesma has used in mining, forestry, soil sampling, hydrology, and other environmental applications8,16–24. However, these spatial copula methods enable modelling and predicting a univariate spatial variable without considering the multivariate dependence of spatially correlated multiple variables. Recently, Gnann et al.25 improved Bàrdossy’s7 method to interpolate a primary spatial variable while considering a secondary correlated spatial variable. However, Gnann et al.25 assumed that the joint distribution of primary and secondary variables follows Gaussian copula.
As a solution to model non-Gaussian multivariate spatial dependence in spatial copula framework, Musafer et al.26 proposed a multivariate spatial copula model, whereby the correlated spatial variables were transformed into spatially uncorrelated factors using non-linear principal component analysis (NLPCA). Then, Gräler and Pebesma’s univariate spatial copula model was used to model and predict spatially uncorrelated factors. Subsequent back transformation is required to transform predicted values to the scale of the original variables and to re-inject correlation. However, Musafer et al.’s26 method indirectly models the joint dependence between spatial variables through a black-box transformation. We directly extend Gräler and Pebesma’s univariate spatial copula to multivariate setting that jointly models spatially correlated multiple variables via a white-box mixture copula24. The mixture copula is a joint distribution function of multiple copulas that offers a more flexible framework for parametric statistical modelling and analysis. Also, a single copula family may not be able to capture tail dependencies but the mixture copula capture the tail dependencies as well24. The mixture copula has used in the non-spatial setting for modelling multivariate genomic data27, and modelling wave height and period28,29. We adapt the mixture copula in the spatial setting that offers a more flexible multivariate spatial copula framework for spatially correlated multiple variables.
Methods
The methodology for modelling spatially correlated multiple variables consists of two essential components: modelling each spatial variable separately using Gräler and Pebesma’s15 univariate spatial copula; then joining the univariate spatial copulas using the idea of mixture copula24. We also use the proposed spatial mixture model to predict individual variables using inverse conditional approach in a bivariate context30. However, the method can be used to predict more than two variables with a trivial generalisation of the bivariate setting.
Modelling
Let be the second-order stationary multivariate spatial random field with m spatial variables that are sampled at the same two-dimensional location , and let be the set of existing locations in the given spatial domain .
A spatial copula15 describes the joint spatial dependence of a univariate spatial variable at any two spatial locations x and , where h is the separation distance between two locations. Hence, spatial copulas model dependence of one spatial location relative to another spatial location, rather than modelling dependence using absolute locations.
The methodology for modelling spatially correlated multiple variables is simply shown in Fig. 1, and a detail procedure for the model development is provided in steps 1–4.
Figure 1.
A diagram for spatial mixture copula construction.
Step 1: For each spatial variable , models the marginal cumulative distribution functions (CDFs), such as Gamma, Weibull, Normal, Log-normal, and obtain the best fitted CDFs. Let denote the best fit CDF of , which is assumed to be same at each location x, i.e., .
The proposed method is based on the concept of distance dependent spatial copula15,31. Hence, the distances between every pair of locations are calculated. Suppose, , , and are four sampled locations of Z.
As given in Fig. 2, the distances are calculated for each location pair - as , where and are the coordinates of and , for , , respectively. Also, number of pairs are obtained with n sampled locations. The next important step of the methodology is spatial binning.
Figure 2.

Example plot to show the possible pairs with four locations.
The spatial dependence of copula-based spatial models depends on the distance between locations. As the distance between two points increases, spatial dependence between points decreases until it is independent or negligible enough to be considered independent. The distance at which independence occurs is referred to as the cut-off distance and is determined empirically using a correlogram. The correlogram plots the Kendall’s tau correlation coefficient for each spatial bin, and a curve is fitted through the plotted points. Similar to a variogram in Kriging, the cut-off distance is visually determined as the distance at which the curve plateaus.
Step 2: Based on the distance between pairs, place each sample pair into K equally spaced spatial bins as follows: , where is the cut-off distance. A correlogram is used to determine the cut-off distance as a plot of against the mean distance of each bin, which is calculated using the pairs belonging to relevant spatial bin. Figure 3 depicts an example correlogram.
Figure 3.

An example correlogram. The blue dashed line indicates the upper limit of cut-off distance at which pairs of points are no longer considered to be spatially dependent. Empirical values (black dots) overlaid with theoretical cubic smooth line.
Given the pairs of points for each spatial bin, spatial copula that describes the dependence of spatial variable at any two locations can be calculated as,
| 1 |
where is the index of the spatial bin, u and v are any selected quantiles of the corresponding univariate CDF of at locations x and .
The copulas for each bin are selected using maximum log-likelihood values of competing copulas, such as Gaussian, Student’s t, Clayton, Frank, Gumbel, and Joe30, which represent variety of dependence structures. Then, a mixture copula is used to determine the multivariate spatial dependence across bins as a weighted linear combination of copula.
Step 3: For each spatial bin k, use the spatial copulas in Eq. (1) to construct the mixture copula as,
| 2 |
where, is the mixture weight, , and .
An equal weight can be used in Eq. (2) if the correlogram of each variable is not significantly different. Otherwise, compare different weight combinations across bins and obtain the optimal weight combination. Moreover, for small distances, pairs of points will become extremely strong dependent and modelled using a comonotonic copula M(u, v). For large distances, pairs will become independent and modelled using a product copula 32, as follows
The mixture copula in Eq. (2) only describes the multivariate spatial dependence across individual bins. However, a spatial model should be able to capture spatial autocorrelation between bins15. For instance, points near the upper bound of the first bin and the lower bound of the second bin may have similar features; points near the upper bound of the second bin and lower bound of the third bin; and so on. Thus, the spatial dependence is incorporated using the distance dependent parameter that determines spatial dependence while incorporating spatial autocorrelation.
In practical situations, the first bin is modelled using the best fit copula for that bin, and subsequent bins are modelled using the convex linear combination of copulas with parameter 15. Further, pairs that fall above the cut-off distance are often omitted and not incorporated into the convex combination, assumed as an independent copula.
Step 4: Use Eq. (2), construct the distance dependent spatial mixture copula of as the convex linear combination of mixture copulas of each spatial bin as follows,
| 3 |
where for , is the mean distance, and denote upper limits of the chosen distances for the spatial bins.
Prediction
Prediction of individuals spatial variables at sampled locations based on the spatial mixture copula is described in a bivariate context. That is , then is the spatial mixture copula of and . The prediction method demonstrates the advantage of using a secondary correlated spatial variable in the prediction of a primary spatial variable25. Thus, an inverse conditional prediction approach is proposed30,[pp. 40–42] where a secondary correlated spatial variable is known when predicting the primary spatial variable .
Suppose is the primary variable of interest, then is correlated secondary variable. The prediction of at location x conditional on the known given value of can be generated at the same location x, using the copula CDF of . The procedure of the inverse conditional approach is given in steps 5-8,
Step 5: Obtain the joint CDF values of and using , and let T be the vector with joint CDF values.
Step 6: Obtain the marginal CDF values of using , and let R be the vector with marginal CDF values.
Step 7: Derive the conditional distribution of T, given , using the partial derivative of as follows,
let be the conditional predicted value of at location x.
Step 8: Take, .
The prediction of , given , can be described by simply switching the subscripts 1 and 2 in the steps 5–8. The proposed method can be validated against actual values at sampled locations by cross-validation, and three scenarios are considered with the existing methods.
Can any improvement in the prediction of individual variables be gained by utilising the multivariate spatial dependence using mixture copula over the univariate pair copula15?
Can any improvement in the prediction of individual variables be gained by utilising the mixture copula over the NLPCA transformation based spatial copula26?
Can any improvement in the prediction of individual variables be gained by utilising the non-linear non-Gaussian multivariate spatial dependence (spatial mixture copula) over the linear Gaussian multivariate spatial dependence (cokriging)33?
The cross-validation study is illustrated using mean absolute error (MAE), root mean square error (RMSE), mean absolute percentage error (MAPE). The MAE, RMSE and MAPE can be calculated using the actual and predicted values at the sampled locations26. Also, accuracy in the reproduction of the bivariate relationship of and is evaluated based on the mean square error from the kernel density estimation (KDE MSE). The KDE MSE can be calculated by taking the mean of the squared differences between the bivariate KDEs of the actual and predicted data26.
Application
The proposed method was applied to model real forest data that was taken from georeferenced forest inventory plots in the US Department of Agriculture Forest Service Bartlett Experimental Forest (BEF) in Bartlett, New Hampshire34. The variables of interest were forest-wide biomass estimations within the area of 1053 hectares (measured in mg/ha). In this study, only foliage biomass () and bole biomass () were used that sampled at 335 two-dimensional locations.
The prediction of bole biomass can be used for carbon accounting purposes, and the prediction of foliage biomass can be used to identify regions with high values of foliage biomass. Also, the behaviour of wildfires depends on pools of biomass variables26,35.
Table 1 gives the summary statistics of the data. Figure 4a,b show the spatial distributions of and at observed locations. Figure 4c shows a strong bivariate non-linear relationship between and . The best marginal distributions were selected based on the maximum log-likelihood (ML) values. The Weibull distribution was achieved as the best distribution for based on the ML values, 65.03, 69.43, 33.26, 44.03, and the Gamma distribution was achieved as the best distribution for based on the ML values, 378.86, 378.28, 250.20, 377.51, of Gamma, Weibull, Normal, Log-normal distributions respectively. Then, the CDF values of and were calculated using the corresponding CDFs. The following steps for the modelling is only incorporated the CDF values of and (Step 1).
Table 1.
Summary statistics of and .
| Statistics | ||
|---|---|---|
| n | 335 | 335 |
| Mean | 0.334 | 0.119 |
| Standard deviation | 0.219 | 0.115 |
| Minimum | 0.200 | 0.010 |
| First quartile | 0.140 | 0.030 |
| Median | 0.300 | 0.090 |
| Third quartile | 0.510 | 0.160 |
| Maximum | 0.820 | 0.560 |
Figure 4.
BEF data. Spatial distributions of (a) , (b) , and (c) scatter plot between and .
The cut-off distance was selected as 800 m using the correlograms of variables, and ten equally spaced (80 m) spatial bins were created (see Table 2). Table 3 shows the best fit copulas and the estimated copula parameters, where and are the fitted univariate spatial copulas of and respectively (Step 2). The correlation across bins almost similar for each variable (see Table 2), and then equal weights were used. Table 4 shows the mixture copulas of each bin (Step 3).
Table 2.
BEF data: spatial binning.
| Bins | Mean distance | Kendall’s tau | |
|---|---|---|---|
| 0–80 | 68 | 0.31 | 0.23 |
| 80–160 | 105 | 0.20 | 0.18 |
| 160–240 | 209 | 0.11 | 0.10 |
| 720–800 | 758 | 0.03 | 0.03 |
Table 3.
The univariate spatial copulas for each bin.
| Bins | ||
|---|---|---|
| 0–80 | = Joe (1.71) | = Joe (1.48) |
| 80–160 | = Gumbel (1.31) | = Gaussian (0.29) |
| 160–240 | = Gumbel(1.16) | = Frank (1.12) |
| 240–320 | = Gumbel (1.10) | = Clayton (0.19) |
| 320–400 | = Gumbel (1.06) | = Clayton (0.13) |
| 400–480 | = Joe(1.09) | = Joe (1.08) |
| 480–560 | = Joe (1.07) | = Gumbel (1.03) |
| 560–640 | = Clayton (0.09) | = Clayton (0.06) |
| 640–720 | = Clayton(0.08) | = Clayton (0.05) |
| 720–800 | = Joe (1.05) | = Gumbel (1.03) |
Table 4.
The mixture copulas of each bin with ==0.5.
| Bins | Mean distance () | Mixture copula () |
|---|---|---|
| 0–80 | 68 | |
| 80–160 | 105 | |
| 160–240 | 209 | |
| 240–320 | 284 | |
| 320–400 | 368 | |
| 400–480 | 436 | |
| 480–560 | 518 | |
| 560–640 | 606 | |
| 640–720 | 678 | |
| 720–800 | 758 |
The mixture copulas in Table 4 were used to develop the distance dependent convex combination of mixture copulas as given in the Eq. (3), which is the proposed spatial mixture copula of the spatially correlated and (Step 4), is given by
where = = 0.31, = 0.61,, = 0.48.
The proposed spatial mixture copula method was used to predict and using the inverse conditional approach as described in the steps 5–8. Figure 5 shows the bivariate relationship of and . Table 5 shows the model validation results with the existing methods.
Figure 5.
Reproduction of bivariate relationship using various methods. Actual (red), predicted (black), given (green), and given (blue).
Table 5.
Model validation in prediction of and .
| Method | KDE | ||||||
|---|---|---|---|---|---|---|---|
| RMSE | MAE | MAPE | RMSE | MAE | MAPE | MSE | |
| Pair copula | 0.20 | 0.17 | 1.12 | 0.11 | 0.08 | 1.98 | 3.61 |
| Cokriging | 0.19 | 0.16 | 0.54 | 0.11 | 0.08 | 0.75 | 3.71 |
| NLPCA | 0.29 | 0.24 | 1.64 | 0.14 | 0.10 | 2.21 | 12.40 |
| Mixture copula | 0.14 | 0.13 | 0.56 | 0.06 | 0.05 | 0.63 | 1.34 |
Significant values are in [bold].
According to Table 5 almost all the RMSE, MAE, and MAPE values are the lowest for the and predictions based on the spatial mixture copula. The MAPE value of cokriging method is the smallest for the prediction that is very close to the spatial mixture copula. Thus, it can be seen that the proposed method outperformed in the prediction of and across the observed locations. Also, the proposed method accurately reproduces the bivariate relationship in terms of the minimum value of KDE MSE.
In Fig. 5, the univariate pair copula method does not reproduce the tail values of both variables. Cokriging is unable to predict tail values and follows a strictly linear relationship. Although the NLPCA spatial copula method reproduces the non-linear relationship, it cannot reproduce upper tails, specifically for . The prediction of individual variables using the novel spatial mixture copula method accurately predicts both upper and lower tail values, and conditional values of the variables reproduce the non-linear relationships between them. Thus, using mixture copula in the multivariate spatial copula framework is improved the accuracy in the univariate prediction.
Conclusions
This article proposed a new mixture copula method for modelling spatially correlated multiple variables. The proposed method models multiple spatial variables without any normalisation of the original variables, such as NLPCA transformation. The method was applied to model bivariate non-linear spatial variables (foliage biomass) and (bole biomass). The model performance was assessed in the cross-validation of actual versus predicted values at sampled locations. The use of multivariate spatial dependence in the univariate prediction, the strength of the mixture copula in the univariate prediction, and utilising non-linear non-Gaussian multivariate spatial dependence in the univariate prediction, were compared with the existing univariate pair copula, NLPCA spatial copula and cokriging methods, respectively. The results showed that the proposed spatial mixture copula model outperformed the existing methods in terms of the minimum values of RMSE, MAE, MAPE, and KDE MSE.
The method also applied to non-linear simulated bivariate correlated variables (see Supplementary online), where the spatial mixture copula outperformed the existing methods, in terms of predicting individual simulated variables and their bivariate relationship. The proposed method used equal weights for each variable for both BEF application and simulation study. However, further improvement to the spatial mixture model is the optimal weights selection of each variable in the mixture copula modelling. For instance, one spatial variable may have a strong spatial dependence across locations than the other variable, and the optimal weights selection may increase the prediction accuracy of each variable across locations. The prediction method is explained for the bivariate case (m = 2), however, it can be extended to multivariate setting. Also, the proposed spatial mixture copula can be extended to multivariate spatial sampling design methodology for optimally selecting additional sampling to reduce prediction uncertainty by leveraging spatially correlated multiple variables. Moreover, the proposed method assumes isotropic spatial dependence of spatial variables but can be extended to model spatially correlated anisotropic variables, which can be present in mining36 and soil variables37, for example.
The proposed method assumes that the spatial random field is stationary. However, the method can be extended to non-stationary spatial processes. For example, a non-stationary spatial process can be divided into several locally stationary processes. A univariate spatial copula can be modelled to each stationary process, and then a global non-stationary spatial copula can be constructed as a mixture of locally stationary spatial copulas. Furthermore, the proposed method assumes that all data points are known and collected at the same set of locations. However, measurements could be unavailable or difficult to sample at some locations, which is quite common in functional spatial data analysis. Thus, the proposed method can be extended for modelling and predicting complex functional spatial data.
Supplementary Information
Acknowledgements
The authors would like to thank the Queensland University of Technology for funding the present study. Mohomed Abraj is funded through an Australian Government Research Training Programme award and a Queensland University of Technology Higher Degree Research award.
Author contributions
M.A. model development, simulation, data analysis and interpretation. M.A. manuscript drafting. M.A., M.H.T. and Y. W. critical revision of the manuscript.
Data availability
In implementing the proposed model in this paper, R software version 3.6.3 (R Development Core Team 2020) was used. Specifically, R package “spcopula” version 0.2-4 of Gräler was entirely used in this study (see http://r-forge.r-project.org/projects/spcopula/), including key dependent packages, such as “copula”, “VineCopula” version 2.1.8, “sp”, “spBayes” “MASS”,“fitdistrplus”. The data is available in “spBayes” package, where only non-zero values of biomass were considered in this study. For simulation study, “gstat” package was mainly used that facilitated the unconditional prediction of Gaussian random fields. Moreover, in comparing the proposed model with the most relevant NLPCA spatial copula model, MATLAB software was used, specifically “Nonlinear PCA toolbox” of Scholz (see http://www.nlpca.org/matlab.html)38. Data for developed method can be found in Dropbox: https://www.dropbox.com/s/cdbmx89fgjul9cg/bef_data.csv?dl=0.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-022-18007-z.
References
- 1.Agyeman PC, et al. Prediction of nickel concentration in peri-urban and urban soils using hybridized empirical Bayesian kriging and support vector machine regression. Sci. Rep. 2022;12:1–16. doi: 10.1038/s41598-022-06843-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Behrens T, Schmidt K, MacMillan RA, Rossel RV. Multiscale contextual spatial modelling with the gaussian scale space. Geoderma. 2018;310:128–137. doi: 10.1016/j.geoderma.2017.09.015. [DOI] [Google Scholar]
- 3.Cui T, Pagendam D, Gilfedder M. Gaussian process machine learning and kriging for groundwater salinity interpolation. Environ. Model. Softw. 2021;144:105170. doi: 10.1016/j.envsoft.2021.105170. [DOI] [Google Scholar]
- 4.Gribov A, Krivoruchko K. Empirical bayesian kriging implementation and usage. Sci. Total Environ. 2020;722:137290. doi: 10.1016/j.scitotenv.2020.137290. [DOI] [PubMed] [Google Scholar]
- 5.Krivoruchko K, Gribov A. Evaluation of empirical bayesian kriging. Spat. Stat. 2019;32:100368. doi: 10.1016/j.spasta.2019.100368. [DOI] [Google Scholar]
- 6.Rostami AA, Karimi V, Khatibi R, Pradhan B. An investigation into seasonal variations of groundwater nitrate by spatial modelling strategies at two levels by kriging and co-kriging models. J. Environ. Manage. 2020;270:110843. doi: 10.1016/j.jenvman.2020.110843. [DOI] [PubMed] [Google Scholar]
- 7.Bárdossy A. Copula-based geostatistical models for groundwater quality parameters. Water Resour. Res. 2006 doi: 10.1029/2005WR004754. [DOI] [Google Scholar]
- 8.Abraj M, Hewaarachchi A. Joint return period estimation of daily maximum and minimum temperatures using copula method. Adv. Appl. Stat. 2021;66:175–190. doi: 10.17654/as066020175. [DOI] [Google Scholar]
- 9.Carreau J, Toulemonde G. Extra-parametrized extreme value copula: Extension to a spatial framework. Spat. Stat. 2020;40:100410. doi: 10.1016/j.spasta.2020.100410. [DOI] [Google Scholar]
- 10.D’Amico G, Petroni F. Copula based multivariate semi-markov models with applications in high-frequency finance. Eur. J. Oper. Res. 2018;267:765–777. doi: 10.1016/j.ejor.2017.12.016. [DOI] [Google Scholar]
- 11.Dolžan D, Bukovšek DK, Omladič M, Škulj D. Some multivariate imprecise shock model copulas. Fuzzy Sets Syst. 2022;428:34–57. doi: 10.1016/j.fss.2021.01.008. [DOI] [Google Scholar]
- 12.Li F, Zhou J, Liu C. Statistical modelling of extreme storms using copulas: A comparison study. Coast. Eng. 2018;142:52–61. doi: 10.1016/j.coastaleng.2018.09.007. [DOI] [Google Scholar]
- 13.Patton AJ. A review of copula models for economic time series. J. Multivar. Anal. 2012;110:4–18. doi: 10.1016/j.jmva.2012.02.021. [DOI] [Google Scholar]
- 14.Senarathne S, Drovandi CC, McGree J. Bayesian sequential design for copula models. TEST. 2020;29:454–478. doi: 10.1007/s11749-019-00661-7. [DOI] [Google Scholar]
- 15.Gräler B, Pebesma E. The pair-copula construction for spatial data: A new approach to model spatial dependency. Proced. Environ. Sci. 2011;7:206–211. doi: 10.1016/j.proenv.2011.07.036. [DOI] [Google Scholar]
- 16.Abraj, M. & Wijekoon, P. Analysis of wind speed and direction data in hambantota district of southern sri lanka. In Young Scientist Forum 6 th Symposium, 1–5 (National Science and Technology Commission, 2017).
- 17.Addo E, Chanda EK, Metcalfe AV. Spatial pair-copula model of grade for an anisotropic gold deposit. Math. Geosci. 2019;51:553–578. doi: 10.1007/s11004-018-9757-7. [DOI] [Google Scholar]
- 18.Durocher M, Chebana F, Ouarda TB. On the prediction of extreme flood quantiles at ungauged locations with spatial copula. J. Hydrol. 2016;533:523–532. doi: 10.1016/j.jhydrol.2015.12.029. [DOI] [Google Scholar]
- 19.Kazianka H, Pilz J. geoENV VII-Geostatistics for Environmental Applications. Springer; 2010. Spatial interpolation using copula-based geostatistical models; pp. 307–319. [Google Scholar]
- 20.Krupskii P, Huser R, Genton MG. Factor copula models for replicated spatial data. J. Am. Stat. Assoc. 2018;113:467–479. doi: 10.1080/01621459.2016.1261712. [DOI] [Google Scholar]
- 21.Li J, Bárdossy A, Guenni L, Liu M. A copula based observation network design approach. Environ. Model. Softw. 2011;26:1349–1357. doi: 10.1016/j.envsoft.2011.05.001. [DOI] [Google Scholar]
- 22.Marchant B, Saby N, Jolivet C, Arrouays D, Lark R. Spatial prediction of soil properties with copulas. Geoderma. 2011;162:327–334. doi: 10.1016/j.geoderma.2011.03.005. [DOI] [Google Scholar]
- 23.Musafer GN, Thompson MH, Kozan E, Wolff R. Spatial pair-copula modeling of grade in ore bodies: A case study. Nat. Resour. Res. 2017;26:223–236. doi: 10.1007/s11053-016-9314-3. [DOI] [Google Scholar]
- 24.Sohrabian B. Geostatistical prediction through convex combination of archimedean copulas. Spat. Stat. 2021;41:100488. doi: 10.1016/j.spasta.2020.100488. [DOI] [Google Scholar]
- 25.Gnann SJ, Allmendinger MC, Haslauer CP, Bárdossy A. Improving copula-based spatial interpolation with secondary data. Spat. Stat. 2018;28:105–127. doi: 10.1016/j.spasta.2018.07.001. [DOI] [Google Scholar]
- 26.Musafer GN, Thompson MH, Wolff RC, Kozan E. Nonlinear multivariate spatial modeling using nlpca and pair-copulas. Geogr. Anal. 2017;49:409–432. doi: 10.1111/gean.12126. [DOI] [Google Scholar]
- 27.Zhang Q, Shi X. A mixture copula bayesian network model for multimodal genomic data. Cancer Inform. 2017;16:1176935117702389. doi: 10.1177/1176935117702389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Huang W, Dong S. Joint distribution of significant wave height and zero-up-crossing wave period using mixture copula method. Ocean Eng. 2021;219:108305. doi: 10.1016/j.oceaneng.2020.108305. [DOI] [Google Scholar]
- 29.Lin Y, Dong S, Tao S. Modelling long-term joint distribution of significant wave height and mean zero-crossing wave period using a copula mixture. Ocean Eng. 2020;197:106856. doi: 10.1016/j.oceaneng.2019.106856. [DOI] [Google Scholar]
- 30.Nelsen RB. An Introduction to Copulas. Springer; 2007. [Google Scholar]
- 31.Tobler WR. A computer movie simulating urban growth in the detroit region. Econ. Geogr. 1970;46:234–240. doi: 10.2307/143141. [DOI] [Google Scholar]
- 32.Zheng Y, Yang J, Huang JZ. Approximation of bivariate copulas by patched bivariate fréchet copulas. Insur. Math. Econ. 2011;48:246–256. doi: 10.1016/j.insmatheco.2010.11.002. [DOI] [Google Scholar]
- 33.Madani N, Emery X. A comparison of search strategies to design the cokriging neighborhood for predicting coregionalized variables. Stoch. Environ. Res. Risk Assess. 2019;33:183–199. doi: 10.1007/s00477-018-1578-1. [DOI] [Google Scholar]
- 34.Finley AO, Banerjee S, Carlin BP. spBayes: An R Package for univariate and multivariate hierarchical point-referenced spatial models. J. Stat. Softw. 2007 doi: 10.18637/jss.v019.i04. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kumar L, Mutanga O. Remote sensing of above-ground biomass. Remote Sens. 2017 doi: 10.3390/rs9090935. [DOI] [Google Scholar]
- 36.Addo E, Chanda EK, Metcalfe AV. Spatial pair-copula model of grade for an anisotropic gold deposit. Math. Geosci. 2018 doi: 10.1007/s11004-018-9757-7. [DOI] [Google Scholar]
- 37.Chen L-L, et al. Probabilistic assessment of slope failure considering anisotropic spatial variability of soil properties. Geosci. Front. 2022 doi: 10.1016/j.gsf.2022.101371. [DOI] [Google Scholar]
- 38.Scholz M. Validation of nonlinear pca. Neural Process. Lett. 2012;36:21–30. doi: 10.1007/s11063-012-9220-6. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
In implementing the proposed model in this paper, R software version 3.6.3 (R Development Core Team 2020) was used. Specifically, R package “spcopula” version 0.2-4 of Gräler was entirely used in this study (see http://r-forge.r-project.org/projects/spcopula/), including key dependent packages, such as “copula”, “VineCopula” version 2.1.8, “sp”, “spBayes” “MASS”,“fitdistrplus”. The data is available in “spBayes” package, where only non-zero values of biomass were considered in this study. For simulation study, “gstat” package was mainly used that facilitated the unconditional prediction of Gaussian random fields. Moreover, in comparing the proposed model with the most relevant NLPCA spatial copula model, MATLAB software was used, specifically “Nonlinear PCA toolbox” of Scholz (see http://www.nlpca.org/matlab.html)38. Data for developed method can be found in Dropbox: https://www.dropbox.com/s/cdbmx89fgjul9cg/bef_data.csv?dl=0.



