Abstract
Background.
Several studies have demonstrated that chromium (Cr) and cadmium (Cd) have adverse impacts on the environment and human health. These elements are present in electronic waste (e-waste) recycling sites. Several interpolation methods have been used to evaluate geographical impacts on humans and the environment.
Objectives.
The aim of the present paper is to compare the accuracy of inverse distance weighting (IDW) and ordinary kriging (OK) in topsoil analysis of e-waste recycling sites in Douala, Cameroon.
Methods.
Selecting the proper spatial interpolation method is crucial for carrying out surface analysis. Ordinary kriging and IDW are interpolation methods used for spatial analysis and surface mapping. Two sets of samples were used and compared. The performances of interpolation methods were evaluated and compared using cross-validation.
Results.
The results showed that the OK method performed better than IDW prediction for the spatial distribution of Cr, but the two interpolation methods had the same result for Cd (in the first set of samples). Results from Kolmogorov-Smirnov and Shapiro-Wilk tests showed that the data were normally distributed in the study area. The p value (0.302 and 0.773) was greater than 0.05 for Cr and for Cd (0.267 and 0.712). In the second set of samples, the OK method results (for Cd and Cr) were greatly diminished and the concentrations dropped, looking more like an average on the maps. However, the IDW interpolation gave a better representation of the concentration of Cd and Cr on the maps of the study area. For the second set of samples, OK and IDW for Cd and Cr had more similar results, especially in terms of root mean square error (RMSE).
Conclusions.
Many parameters were better identified from the RMSE statistic obtained from cross-validation after exhaustive testing. Inverse distance weighting appeared more adequate in limited urban areas.
Competing Interests.
The authors declare no competing financial interests
Keywords: Douala, IDW, interpolation, ordinary kriging, e-waste
Introduction
Soil is an important part of the urban ecosystem that directly and indirectly affects general quality of life.1 The study of spatial distribution and identification of contaminated urban soils and geographical areas with clear identification of source contamination is important for planning and decision making.2,3 Several studies have identified high variability in soils in urban areas.4,5 The use of spatial interpolations in geostatistics is increasingly required to understand and solve soil pollution problems in urban areas. Geostatistics describes patterns of spatial data. It provides an estimate and quantitative mapping of the pollution distribution with minimum variance.6 A number of factors can affect map quality. These include soil variability, as well as the sampling and interpolation method employed.7 Spatial interpolation techniques such as ordinary kriging (OK) and inverse distance weighted (IDW) are widely applied in soil geochemistry for the production of spatial distribution maps of soil parameters.8,3 Geostatistical methods can provide reliable estimates in locations that have not been sampled.9,10 Spatial prediction techniques, also known as spatial interpolation techniques, differ from classical modeling approaches as they incorporate information on the geographic position of the sample data points.11 The most common interpolation techniques evaluate the estimate for a property at any given location by a weighted average of nearby data.12
Questions on the appropriateness of various interpolation methods have evolved, as there is a wide range of these methods. Several comparative studies of relative accuracy have been performed for soil quality parameters. The methods used include geostatistical kriging-based techniques and the IDW method for deterministic interpolation.13 Both methods rely on the similarity of nearby sample points to estimate values and create a surface. Deterministic techniques use mathematical functions for interpolation. Geostatistics relies on both statistical and mathematical methods, which can be used to create surfaces and assess the credibility of the predictions. From a theoretical standpoint, OK is the optimal interpolation method.14 However, the correct application of OK requires an accurate determination of the spatial structure via semivariogram construction and model-fitting.12 Many studies have compared IDW and kriging to evaluate the best prediction method. In some cases, the performance of OK was reported to be better than IDW.14,15,16,17,18 Research was conducted by Laslett et al. to evaluate some interpolation methods for estimation of surface soil pH.19 They applied IDW and kriging methods over data obtained from digital elevation models and climate for estimations. They found the kriging method for interpolation to be the most suitable method.20 Dayani et al. and Hooker and Nathanail used the simple kriging estimator for mapping the pollution of heavy metals in order to estimate the concentration of lead in unsampled areas using the OK error maps to evaluate and control the error of the predicted map at unsampled locations.21,22
In other studies, IDW generally out-performed OK.13 The best results for mapping soil organic matter contents and soil nitrate levels were obtained with the IDW interpolation technique.23 The results have often been mixed.24,25,26
Given the variability of results obtained by previous studies, the present study aims to identify the best method for environmental and human health risk evaluation in specific locations in urban areas. This will be done by determining the spatial variability of selected heavy metals in the soil; assessing the accuracy of the IDW and OK interpolation techniques for mapping chromium (Cr) and cadmium (Cd) in the soil of electronic waste (e-waste) recycling sites in Douala, Cameroon; and identifying the spatial prediction method that best illustrates the spatial variability of the environmental and human risk exposure of certain heavy metals. This will enable the identification of the best method for areas where people are at greatest risk with the most urgent need for remediation measures.
Abbreviations
- e-waste
Electronic waste
- IDW
Inverse distance weighting
- OK
Ordinary kriging
- RMSE
Root mean square error
Methods
The study area is located in Douala, Cameroon. Douala city is situated at latitude 4°1′north and longitude 9°45′ east, on the Wouri estuary, approximately 50 km from the opening of the estuary into the Atlantic Ocean. Douala is divided into 6 districts governed by councils, where a mayor is elected by those councils to preside over the city. The city currently houses about 3.5 million people in a nucleated settlement pattern.27 The central area of the city, Akwa, has banks, commercial enterprises and other small-scale businesses. The topography surrounding Douala is characterized by a gentle slope from an altitude of approximately 57 m in the east to approximately 3 m along the Wouri River in the west.27 The sloping topography and high run-off rate in the Wouri estuarine system cause groundwater levels in the city to be shallow and above the soil surface in some areas. The city is consequently subjected to frequent severe floods almost year-round. This applies in particular to Mabanda and Bonendale in Bonaberi to the north and the Youpwe area in the south, as illustrated by Guevart et al. in a computer-generated view of the flood prone areas.27
Ngodi, Makea and New Bell are hotspots with the highest concentration of e-waste recycling activities in Douala. They are located in the town and are surrounded by other economic activities.
Sampling
A total of 30 soil samples, 10 per area, were collected from Makea, Ngodi and New Bell. Collection took place in the first half of 2017. The samples were collected with the aid of a soil auger at a depth of 5 to 15 cm, representing the topsoil. Each point of sample collection was recorded with a global positioning system with a precision error of ±3 m. This was used to obtain the altitude, longitude, and latitude figures of the locations, which were later imported into a geographic information system (ArcGIS 10.1). The samples were stored in aluminum foil before they were transferred into plastic bags and bottles. As the collection was done randomly, the recycling sites were not uniformly distributed.
The samples were analyzed with an atomic absorption spectrometer. A representative sample was first pulverized using a mortar and pestle. About 1 g of each sample was weighed into the dry digesting tube. A total of 5 ml of concentrated perchloric acid was added in the ratio and stirred. The digesting tube was placed in a water bath set at 100°C and boiled for 2 hours. To avoid caking, the sample was shaken vigorously at 20-minute intervals and the resulting solution, referred to as stock solution, was filtered and made up to 50 ml with distilled water, and stored in polyethylene bottles prior to instrumental analysis. The stock solution was used to directly determine the elements (Cr and Cd) with an atomic absorption spectrometer. The distribution of the points is shown in Figure 2. The slope degree and aspect were measured with the processing of an Advanced Spaceborne Thermal Emission and Reflection Radiometer image.
Spatial and statistical methods
The hierarchical Bayesian Poisson mixture model includes a prior distribution for the unknown parameters in the environment as well as the data distribution. The Poisson distribution is a model commonly used to count data. The Poisson distribution assumes that the variance and mean of the data are equal. This is not the case for the data that were used in this study. However, it is a good instrument that allows inference and estimation using software and a mixed model.
Hotspot analysis
Hotspot analysis is both a statistical and spatial method used in estimating hotspot concentration of particular elements or features. In the field of spatial interpolation, the Hot Spot Analysis tool (from ArcGIS) calculates the Getis-Ord Gi* statistic for each feature in a dataset. The resultant z-scores and p-values identify where features with either high or low values cluster in space. This tool works by looking at each feature within the context of neighboring features. A feature with a high value is interesting, but may not be a statistically significant hotspot. To be a statistically significant hotspot, a feature will have a high value and be surrounded by other features with high values as well. One of its assumptions is that the collected points should be equally distributed, with the same distance between each point collected;9 which was not the case in the field.28
Geoadditive models, spatially adaptive models and mixed models
Introduced by Kammann and Wand, geoadditive models analyze the spatial distribution of the variables while accounting for possible non-linear covariate effects.29 Moreover, the geoadditive model can be extended to include generalized responses, small area estimation, longitudinal data, missing data, and so on.30 It gained popularity in applied research as a flexible and interpretable regression technique because it maintains the assumption of additivity of the covariates effects, allowing the presence of nonlinear relationships with the response variable.31
The linear mixed model representation is a useful instrument because it allows estimation using mixed model methodology and software. They represent such effects by merging an additive model that accounts for the non-linear relationship between the variables, and that accounts for the spatial correlation by expressing both as a linear mixed model.32
Geostatistical analysis
There are two main categories of interpolation methods. The deterministic interpolation techniques (IDW, radial basis functions, global polynomial interpolation) that use mathematical functions such as Euclidean distance and the family of geostatistics which relies on both mathematical and statistical models that take positive autocorrelation into account.33 The present study compared the OK and IDW methods for Cr and Cd soil interpolation.
Ordinary kriging and inverse distance weighting methods
Ordinary kriging and IDW are the most popular spatial interpolation methods used in the field of environmental studies. Both methods have been developed based on the assumption that nearby points have more correlations and similarities than distant observations. The IDW interpolation is largely a reflection of Waldo Tobler's first law in geography which states that “everything is related to everything else, but near things are more related than distant things”.34 The IDW method assumes that the distance between neighbors is proportional to the similarities and the rate of correlations between them; this is defined as a distance reverse function of every point from nearby points.35 The IDW method works best with evenly distributed points in an area. It is possible to control the significance of known points upon the interpolated values, based on their distance from the output point with IDW. The weights for samples in IDW decrease with an increase in distance between the known samples and the estimated points. These weights are controlled by weighting powers, so that greater powers reduce the effect of farther estimated points and smaller powers distribute the weights more uniformly among the neighbors' points. Inverse distance weighting does not make assumptions about spatial relationships except for the basic assumption that nearby points will be more closely related than distant points to the value at the interpolated location.
The main factors affecting the accuracy of IDW interpolation are the “rate” of decreasing weight (defined as the power parameter of distance ‘p'), the size of the neighborhood and the number of neighbors which are relevant for the accuracy of results.
Inverse distance weighting was calculated using the Equation 1.36
where, u is the estimation location, , , , 1 , ui, i = 1,…., n, are the locations of the sample points within the neighborhood, Z*(u) is the inverse distance estimate at the estimation location, n is the number of sample points, λi, i = 1,…, n, are the weights assigned to each sample point, and Z(ui), i = 1,…, n, are the conditioning data at sample points. The weights are determined using Equation 2:
where, di are the Euclidian distances between estimation location and sample points and exponent p is the power or distance exponent value. Note that the sum of the inverse distance weights λi, i = 1,…, n, is equal to 1, that is (Equation 3):
The value applied for the power p varies generally from 1 to 4 and are commonly recommended. One of the main factors affecting the accuracy of the IDW is the value of the power parameter.
Ordinary kriging provides an estimate of an unsampled point based on the weighted average of observed neighboring points within a specific area. At the difference of IDW, the OK method is not deterministic but extends the proximity weighting approach of IDW to incorporate random components where the exact point is unknown. The weights in OK are decided based on the spatial structure parameters of a variogram which measures the relationships between squared differences between paired samples and their distances.8 The OK method samples input data and models the relationship between the variance in value and distance between points. The spatial autocorrelation between measured sample points was examined using semivariogram/covariance. Anisotropy semivariogram was used to model the spatial relationship in the dataset and to find the best fit model that passes through the points in the semivariogram.9 In order to estimate the spatial and statistical relationship as well as perform the interpolation and calculate the surface, the OK method used the semivariogram.
The OK equation is outlined in Equation 4:
where, Ẑ (s0) is the predicted location, λi is the unknown weight of the measured value of pairs of points at ith location, Z (si) is the measured value of pairs of points at ith location, and N is the number of measured values of pairs of points multiplied by the distance h.
The normality of data was tested using the Kolmogorov-Smirnov and Shapiro-Wilk normality test. These tests are used to detect sample of data normality from a population with a specific distribution.
When performing IDW and OK, cross validation is part of the output of the Geostatistical Analyst Toolbar results in ArcGIS 10.1. Cross validation provides summary measures of error and allows comparison between interpolation methods. The parameters used for the comparison are the mean error and root mean square error (RMSE). The mean error is the average difference between observed and predicted values. The RMSE shows how well the model predicts observed values. Low values for both error statistics indicate a better model.37
The mean error is used to determine the degree of bias in the estimates and is calculated using Equation 5.
where, ME is the mean error. The RMSE provides a measure of the error size that is it sensitive to outliers. The RMSE value can be calculated with Equation 6.
The mean absolute error provides an absolute measure of the size of the error. Mean absolute error is calculated with Equation 7.
where, MAE is the mean absolute error.
The mean error gives an indication of how well the data value fits into the neighborhood of the surrounding values. It is the result of the difference between the measured value and the estimated cross validation. The closer the average of the cross validation error is to zero, the error decreases; there is no apparent bias. Overestimation and underestimation of the model is indicated by a positive and negative bias, respectively. Thus, the mean standardized error is the mean error divided by the standard deviation where the mean error value depends on the data scale. The value of the mean standardized error should be as close to zero as possible. For an optimum result, the predictions should be as close to the measured values as possible. To assess the accuracy of the method, the RMSE is very useful; it indicates how closely the model predicts the measured values. The estimates are more accurate as long as the RMSE is close to zero. The average standard error is calculated to evaluate the deviation from the observation. If this value is close to zero, then the deviation from the observation is minimal.38
The variability in prediction is correctly assessed if the average standard error is close to the root-mean-squared prediction error. If the average standard error is greater than the root-mean-squared prediction error, the variability of the predictions is overestimated and vice versa. Therefore, the result of each prediction error divided by its estimated prediction standard error should be similar. Thus, if the prediction standard error is valid, the root-mean-square standardized error should be close to one. Greater than one root-mean-square standardized error values indicate an underestimation and less than one indicates that the prediction errors are overestimated.39,40
Geographic information system mapping procedure
The IDW and OK of the spatial analyst extension in ArcGIS 10.1 was used in the mapping of variables. All the measured points (Cr and Cd) were used in the calculation of each interpolated cell based on the results of the laboratory analysis of each sample. Thus, 15 points were removed randomly from the current data points from their original analysis. Then a test was conducted to see which method better predicts the concentrations in the samples that were left out. A feature dataset (Douala shapefile) was used for the mask. The legend classification was based on the concentration levels of pollutants in the hotspot.
Results
Table 1 shows the Cd and Cr concentrations that were measured in the laboratory. Each is attached to a particular location with global positioning system points. The values presented in Table 1 were used for the interpolations.
Table 1.
Sample no | Cd value mg/l | Cr value mg/l | Sample no | Cd value mg/l | Cr value mg/l | Sample no | Cd value mg/l | Cr value mg/l |
---|---|---|---|---|---|---|---|---|
Tests 21* | 10 | 20 | ||||||
Tests 25* | 10 | 20 | ||||||
NGO 1 | 30 | 90 | MAK 1 | 20 | 140 | NB 1 | 10 | 80 |
NGO 2 | 20 | 8 | MAK 2 | 20 | 130 | NB2 | 20 | 100 |
NGO 3 | 20 | 40 | MAK 3 | 20 | 80 | NB3 | 30 | 40 |
NGO 4 | 30 | 10 | MAK 4 | 20 | 150 | NB4 | 20 | 80 |
NGO 5 | 30 | 80 | MAK 5 | 20 | 130 | NB 5 | 20 | 40 |
NGO 6 | 30 | 40 | MAK 6 | 20 | 200 | NB6 | 30 | 140 |
NGO 7 | 30 | 130 | MAK 7 | 20 | 190 | NB7 | 20 | 90 |
NGO 8 | 30 | 70 | MAK 8 | 20 | 80 | NB 8 | 30 | 90 |
NGO 9 | 25 | 130 | MAK 9 | 20 | 90 | NB9 | 30 | 20 |
NGO 10 | 20 | 30 | MAK 10 | 10 | 90 | NB 10 | 30 | 20 |
*control sites
Abbreviations: S, site: NGO, Ngodi; MAK, Makea; NB, New Bell
Test site 21 and 25 are samples of the topsoil from a natural environment (control). As can be seen, there was a significant difference in concentrations between control samples and e-waste recycling sites samples.
One assumption of the geostatistical analysis is that the data have to be normally distributed. In order to assess the normality of data, the Kolmogorov-Smirnov and Shapiro-Wilk tests were applied. The results (Tables 2 and 3) showed that the data distributions were a good fit. The calculated p-values were greater than 0.05 and the data points were distributed around the line.
Table 2.
Kolmogorov-Smirnova | Shapiro-Wilk | |||||
---|---|---|---|---|---|---|
Statistics | ddl | Signification | Statistics | ddl | Signification | |
Cr value mg/l | 0.302 | 35 | 0.000 | 0.773 | 35 | 0.000 |
a Significance correction of Lilliefors
Abbreviation: ddl, Degree of freedom
Table 3.
Kolmogorov-Smirnova | Shapiro-Wilk | |||||
---|---|---|---|---|---|---|
Statistics | ddl | Signification | Statistics | ddl | Signification | |
Cd value mg/l | 0.267 | 35 | 0.000 | 0.712 | 35 | 0.000 |
a Significance correction of Lilliefors
Abbreviation: dll, Degree of freedom
Table 2 gives the result of the Kolmogorov-Smirnov and Shapiro-Wilk normality tests for Cr. For the two tests applied, the data are normally distributed in the study area. The p value (0.302 and 0.773) is greater than 0.05.
Observed values
Figure 2 shows the distribution of data points around the line, indicating that the data points are normally distributed along the line.
Table 3 gives the result of the normality test of Kolmogorov-Smirnov and Shapiro-Wilk for Cd. For the two tests applied, the data are normally distributed in the study area with p value of 0.267 and 0.712, which is greater than 0.05.
Observed value
Figure 3 shows the distribution of Cd data points around the line, indicating that the data points are normally distributed along the line and therefore over the study area.
Results of the interpolated maps comparing the two techniques
Figure 4 presents the Cr interpolation over the study area. The western and southwestern parts of the hotspot had the highest concentrations of Cr. However, the eastern and the northeastern areas were less polluted by Cr. The OK interpolation method was used to derive this map.
Figure 5 presents the Cd interpolation over the study area. The western part of the hotspot had the highest concentrations of Cd. The other parts of the study area were less polluted and are represented with the color yellow. Furthermore, the yellow color indicates an average concentration over the main part of the study area, while the northern part had the lowest concentration.
Figure 6 presents the IDW Cd interpolation over the study area. The western part of the hotspot had the highest concentrations of Cd. Other parts of the map which were less polluted are represented by the color yellow and are predominantly in the north and northeast.
Figure 7 shows the map of Cr interpolation over the study area. The west, southwest and portions of the north and northwest of the hotspot were the areas with the highest concentration of Cr. The eastern and the northeastern parts were less polluted with Cr.
Results of the interpolated maps by the two techniques (based on samples left out)
In order to further assess the performance of the two methods, 15 samples were randomly removed from the initial set of data. Then, the same analyses were performed with samples left out. The results of the interpolated maps are shown in Figures 8–11.
Figure 8 presents cadmium interpolation over the study area. The map has almost a uniform concentration around samples points all over the study area. It appears that the interpolation made a map averaging the concentration of the pollutant.
Figure 9 presents Cr interpolation over the study area. The concentration of Cr noticeably increases from the western to the eastern part of the hotspot.
Figure 10 presents Cd interpolation over the study area. The western and southeastern part of the hotspot had the highest concentration of Cd. However, the northern and the northeastern areas were less polluted by Cd.
Figure 11 shows the map of Cr interpolation over the study area. The west, southwest and portions of the north and northwest had the highest concentrations of Cr. The northeastern parts were less polluted by Cr.
Discussion
The OK map results provided a comprehensive distribution of pollutants in the study area, with the potential concentration values. The OK results (Cd and Cr) did not fully take into consideration the reality of the urban area in the hotspot. From 1–100 m distance, concentrations varied greatly. However, the IDW maps gave a more reliable representation of the situation. The e-waste recycling sites were relatively small, and therefore the concentrations were not spread over a wide spectrum in the study area. The IDW interpolation gave a better representation of the concentration of Cd and Cr in the maps of the study area.
Comparison of interpolation performance
For Cr, the comparison of cross validation results with OK and IDW showed varying errors from one point to the other. For the OK method (Table 4), the error ranged from ±1 to ±53, showing that between locations, the error was minimized or maximized, depending on the measured values. In the case of IDW, the error ranged from ±1 to ±73 with a mean error of 2.40, compared to −0.75 obtained in OK.
Table 4.
Measured | Predicted | Error | |
---|---|---|---|
Maximum | 219.00 | 175.5972 | 55.46796 |
Minimum | 119.00 | 157.4815 | −53.4602 |
Mean | 168.70 | 167.9408 | −0.75918 |
The mean of the predicted values was 167.94 for the OK method and 171 for the IDW method (Table 5). Of the two methods, the result from OK was closer to the actual mean of the measured values (168.7).
Table 5.
Measured | Predicted | Error | |
---|---|---|---|
Maximum | 219.00 | 202.7136 | 73.78169 |
Minimum | 119.00 | 140.5065 | −73.9935 |
Mean | 168.70 | 171.0687 | 2.368655 |
In the case of Cd, the comparison of cross validation results between OK and IDW shows that the error varies from one point to the other. For the OK method (Table 6), the error range of ±1 to ±8.3 shows that the error is minimized or maximized, depending on the measured values. However, the error range was not wide. In the case of IDW, the error ranged from ±1 to ±9.0 with a mean error of −0.40, compared to −0.26 for OK.
Table 6.
Measured | Predicted | Error | |
---|---|---|---|
Maximum | 39.5 | 38.0457 | 8.369907 |
Minimum | 26.5 | 29.31114 | −7.68887 |
Mean | 34.13 | 33.86917 | −0.26083 |
For Cd interpolation with OK and IDW, the accuracy in the results were similar, with IDW having a slight advantage. In this case, the results were very close to each other, with IDW showing slightly better results compared to OK. The mean of predicted values was 33.86 for the OK method and 33.72 for the IDW method (Table 7). Of the two methods, the result obtained from the OK method was closer to the mean of the measured values, but the IDW method reported a better difference.
Table 7.
Measured | Predicted | Error | |
---|---|---|---|
Maximum | 39.5 | 38.17815 | −9.05455 |
Minimum | 26.5 | 26.73521 | 8.054287 |
Mean | 34.13 | 33.72236 | −0.40764 |
Validation results: comparison of interpolation performance
The cross-validation comparison of the OK and IDW methods for Cd is shown in Figure 12. The mean value was −0.2 for OK and −0.2 for IDW. The root mean square was 4.21 for OK and 4.28 for IDW. From these two values, it can be concluded that OK and IDW methods can be used interchangeably as both are good for Cd modelling.
For the OK, the standardized mean error recorded was −0.04. This value was very close to zero, showing that the model was good Cd interpolation. The RMSE was 1.07 while the average standard error was 3.83. The OK method produced a good interpolation model for Cd prediction in the study area. It is important to note that the IDW method does not have this information listed in order to be compared with the OK technique. However, the root mean squares are close, showing that the two models are a good fit for Cd.
The cross-validation comparison of the OK and IDW methods for Cr is shown in Figure 13. The mean values were −0.75 and 2.36 for the OK and IDW methods, respectively. The root mean square was 25.94 for OK and 32.40 for IDW. Using these values, the OK method produced a better result compared to IDW for Cr.
For the OK method, the standardized mean error was −0.02, showing that the model was good for Cr interpolation. The root-mean-square error was 1.03 and the average standard error was 24.91. Thus, the OK result produced a better interpolation model for Cr prediction in the study area compared to IDW.
Comparison of maps results after leaving out samples
After leaving out 15 samples points in order to run the analysis again, The OK and IDW map results provided a comprehensive distribution of pollutants in the study area, with potential concentration values. The OK results (Cd and Cr) were greatly diminished and the concentrations dropped, looking more like an average. This is due to the fact that the included number of samples was smaller compared to the initial set of samples. The IDW interpolation gave a better representation of the concentration of Cd and Cr on the maps of the study area. In the case of Cr, the levels of concentration are more visible with a wider spatial occupation compared to the map with initial samples.
Comparison of interpolation performance after leaving out samples
For Cr, the comparison of cross validation results with OK and IDW showed varying errors from one point to the other.
For the OK method (Table 8), the error ranged from ±9 to ±13, showing that between locations, the error was minimized or maximized, depending on the measured values.
Table 8.
Measured | Predicted | Error | |
---|---|---|---|
Maximum | 219 | 188.42 | 13.03643 |
Minimum | 119 | 139.21 | 9.494681 |
Mean | 169 | 163.81 | 11.26555 |
In the case of IDW, the error was 1 with a mean error of 1, compared to 11.26 obtained in OK (Table 9). The error was brought to 1 for IDW, a reduction compared to the initial sample points interpolated.
Table 9.
Measured | Predicted | Error | |
---|---|---|---|
Maximum | 219 | 219 | 1 |
Minimum | 119 | 119 | 1 |
Mean | 169 | 169 | 1 |
For Cd interpolation with OK and IDW, the accuracy was similar to the initial set of samples (Tables 10 and 11). The results were very close to each other.
Table 10.
Measured | Predicted | Error | |
---|---|---|---|
Maximum | 39.5 | 34.7083 | 1.33377 |
Minimum | 26.9 | 33.9884 | 1.00824 |
Mean | 33.20 | 34.215333 | 1.32286 |
Table 11.
Measured | Predicted | Error | |
---|---|---|---|
Maximum | 39.5 | 39.5 | 1 |
Minimum | 26.9 | 26.9 | 1 |
Mean | 33.20 | 33.20 | 1 |
The mean of the predicted values was 34.21 for the OK method and 33.20 for the IDW method (Tables 10 and 11). From the two methods, the results obtained from both were closer to the mean of the measured values, but the IDW reported a smaller difference in terms of error.
The cross-validation comparison of the OK and IDW methods after dropping some of the samples for Cd is shown in Figure 14. The mean value was −0.12 for OK and −0.34 for IDW. The root mean square was 3.95 for OK and 5.19 for IDW. From these two values, it can be concluded that the OK and IDW methods can be used interchangeably as both are good for Cd modelling.
For the OK method, the standardized mean error was −0.02. This value was very close to zero, showing that the model was a good fit for Cd interpolation. The RMSE was 1.001, while the average standard error was 3.95. The OK and IDW methods produced good interpolation models with almost identical results for Cd prediction in the study area.
The cross-validation comparison of the OK and IDW methods for the samples of Cr left out is shown in Figure 15. The mean value was −0.59 for OK and 2.70 for IDW. The root mean square was 26.24 for OK and 26.99 for IDW. These results indicate that the OK and IDW methods can be used interchangeably, as both are good fits for Cr modelling.
For OK, the standardized mean error recorded was -0.06. This value was very close to zero, showing that the model was good for Cr interpolation. The RMSE was 1.130 and the average standard error was 23.59. Based on the mean, the OK method produced a good interpolation model for Cr prediction in the study area with samples left out. However, the root-mean-squared provided very similar results for OK and IDW.
Conclusions
The present study applied techniques based on fundamental theorems of surfaces to interpolate the spatial patterns of Cd and Cr in soils from e-waste recycling sites. It provided quantitative information on the best interpolation method for heavy metals in e-waste sites located in urban areas. Many studies have looked at the interpolation methods of heavy metals in soils,45 but few have conducted a comparison of interpolation methods for heavy metals coming from e-waste recycling sites in urban areas of developing countries. The findings demonstrated that OK had more accurate results compared to IDW for Cr. The OK interpolation method was more accurate than IDW and this aligns with conclusions of several other studies.41,22,42,43 Therefore, OK is more suitable for Cr, however not for Cd in urban areas, which was a slight advantage in favor of IDW. Ordinary kriging was considered to have generally superior performance compared to IDW as the prediction error was lower for the OK method. Ordinary kriging was more accurate for Cr, but it was on the same level as IDW for Cd. In order to validate this conclusion, 15 samples were left out and the same analyses were carried out to evaluate the best interpolation method. The OK method showed better results using this technique. However, in urban areas where people are concentrated in a limited space, the results showed IDW to be more accurate in evaluating exposure risk in specific locations. These spatial interpolation methods had various decision parameters. However, by adjusting the power parameter of IDW, a better and smoother result can be obtained. This partly explains its popularity as an interpolation method, especially in evaluating water pollution.44 The mean error and RMSE showed that OK was more suitable for Cr, but not for Cd compared to IDW results. Therefore, for interpolation of heavy metals in urban areas, OK is not considered to be the best method for interpolation. It is, however, recommended that the accuracy of the interpolation models be checked by evaluating the error of each model and using the one with the lowest error. These results can assist geographic information system specialists in selecting the best method for assessing pollution levels of heavy metals in e-waste sites.
Acknowledgments
This work is supported by the African Union through the Pan African University, Life and Earth Sciences Institute, and by UNESCO through the ANESI exchange program. The authors acknowledge Dr. E.A. Akinyele for his contribution in language revision and useful suggestions.
References
- 1.van Kamp I, Leidelmeijer K, Marsman G, de Hollander A. Urban environmental quality and human well-being: Towards a conceptual framework and demarcation of concepts; a literature study. Landsc Urban Plan [Internet] 2003 Sep 15;65(1–2):5–18. doi: 10.1016/S0169-2046(02)00232-3. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 2.Wu S, Xia X, Lin C, Chen X, Zhou C. Levels of arsenic and heavy metals in the rural soils of Beijing and their changes over the last two decades (1985–2008) J Hazard Mater [Internet] 2010;179(1–3):860–8. doi: 10.1016/j.jhazmat.2010.03.084. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [PubMed] [Google Scholar]
- 3.Zhang C. Using multivariate analyses and GIS to identify pollutants and their spatial patterns in urban soils in Galway, Ireland. Environ Pollut [Internet] 2006 Aug;142(3):501–11. doi: 10.1016/j.envpol.2005.10.028. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [PubMed] [Google Scholar]
- 4.Madrid L, Diaz-Barrientos E, Ruiz-Cortes E, Reinoso R, Biasioli M, Davidson CM, Duarte AC, Grcman H, Hossack I, Hursthouse AS, Kralj T, Ljung K, Otabbong E, Rodrigues S, Urquhart GJ, Ajmone-Marsan F. Variability in concentrations of potentially toxic elements in urban parks from six European cities. J Environ Monit [Internet] 2006 Nov;8(11):1158–65. doi: 10.1039/B607980F. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [PubMed] [Google Scholar]
- 5.Wei B, Yang L. A review of heavy metal contaminations in urban soils, urban road dusts and agricultural soils from China. Microchem J [Internet] 2010 Mar;94(2):99–107. doi: 10.1016/j.microc.2009.09.014. [cited 2019 Dec 12] Available from: [DOI] [Google Scholar]
- 6.Isaaks EH, Srivastava RM. An introduction to applied geostatistics. New York, NY: Oxford University Press; 1989. p. 561. p. [Google Scholar]
- 7.Sadler EJ, Busscher WJ, Bauer PJ, Karlen DL. Spatial scale requirements for precision farming: a case study in the southeastern USA. Agron J [Internet] 1998 Mar;90(2):191–7. doi: 10.2134/agronj1998.00021962009000020012x. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 8.Webster R, Oliver MA. 2nd ed. Chichester, England: John Wiley & Sons; 2007. Geostatistics for environmental scientists; p. 317. p. [Google Scholar]
- 9.Geostatistical analyst tutorial [Internet] Redlands, CA: Esri; 2010. p. 57. [cited 2018 Jan 23] p. Available from: http://help.arcgis.com/en/arcgisdesktop/10.0/pdf/geostatistical-analyst-tutorial.pdf. [Google Scholar]
- 10.Kerry R, Oliver MA. The analysis of ranked observations of soil structure using indicator geostatistics. Geoderma [Internet] 2007 Aug 15;140(4):397–416. doi: 10.1016/j.geoderma.2007.04.020. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 11.Cressie NA. Statistics for spatial data. Revised ed. New York, NY: John Wiley & Sons; 1993. p. 900. p. [Google Scholar]
- 12.Reza SK, Sarkar D, Baruah U, Das TH. Evaluation and comparison of ordinary kriging and inverse distance weighting methods for prediction of spatial variability of some chemical parameters of Dhalai district, Tripura. Agropedology. 2010; 20(1):38–48. [Google Scholar]
- 13.Nalder IA, Wein RW. Spatial interpolation of climatic normals: test of a new method in the Canadian boreal forest. Agric For Meteorol [Internet] 1998 Dec 4;92(4):211–25. doi: 10.1016/S0168-1923(98)00102-6. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 14.Hosseini E, Gallichand J, Marcotte D. Theoretical and experimental performance of spatial interpolation methods for soil salinity analysis. Trans ASAE. 1994; 37(6):1799–1807. [Google Scholar]
- 15.Dalthorp D, Nyrop J, Villani M. Estimation of local mean population densities of Japanese beetle grubs (Scarabaeidae: Coleoptera) Environ Entomol [Internet] 1999 Apr 1;28(2):255–65. doi: 10.1093/ee/28.2.255. [cited 2019 Dec 11] Available from: Subscription required to view. [DOI] [Google Scholar]
- 16.Kravchenko A, Bullock DG. A comparative study of interpolation methods for mapping soil properties. Agron J [Internet] 1999 May;91(3):393–400. doi: 10.2134/agronj1999.00021962009100030007x. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 17.Kravchenko AN. Influence of spatial structure on accuracy of interpolation methods. Soil Sci Soc Am J [Internet] 2003 Sep;67(5):1564–71. doi: 10.2136/sssaj2003.1564. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 18.Reinstorf F, Binder M, Schirmer M, Grimm-Strele J, Walther W. Comparative assessment of regionalisation methods of monitored atmospheric deposition loads. Atmos Environ [Internet] 2005 Jun;39(20):3661–74. doi: 10.1016/j.atmosenv.2005.03.006. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 19.Laslett GM, McBratney AB, Pahl PJ, Hutchinson MF. Comparison of several spatial prediction methods for soil pH. Eur J Soil Sci [Internet] 1987 Jun;38(2):325–341. doi: 10.1111/j.1365-2389.1987.tb02148.x. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 20.Mousavi S.R, Sarmadian F, Dehghani S, Sadikhani MR, Taati A. Evaluating inverse distance weighting and kriging methods in estimation of some physical and chemical properties of soil in Qazvin Plain. Eurasian J Soil Sci. 2017; 6(4):327–36. [Google Scholar]
- 21.Mahmoud D, Jahangard M, Mahdi N. Geostatistical analysis of Pb, Zn and Cd concentration in soil of Sepahanshahr suburb (south of Esfahan) J Water Soil Agric Sci Technol. 2010 Winter;23(4):67–76. [Google Scholar]
- 22.Hooker PJ, Nathanail CP. Risk-based characterisation of lead in urban soils. Chem Geol [Internet] 2006 Feb 28;226(3–4):340–51. doi: 10.1016/j.chemgeo.2005.09.028. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 23.Gotway CA, Ferguson RB, Hergert GW, Peterson TA. Comparison of kriging and inverse-distance methods for mapping soil parameters. Soil Sci Soc Am J [Internet] 1996 Jul;60(4):1237–47. doi: 10.2136/sssaj1996.03615995006000040040x. [cited 2019 Dec 11] Available from: Subscription required to view. [DOI] [Google Scholar]
- 24.Schloeder CA, Zimmerman NE, Jacobs MJ. Comparison of methods for interpolating soil properties using limited data. Soil Sci Soc Am J. 2001;65:470–9. [Google Scholar]
- 25.Mueller TG, Pierce F, Schabenberger O, Warncke DD. Map quality for site-specific fertility management. Soil Sci Soc Am J [Internet] 2001 Sep;65(5):1547–58. doi: 10.2136/sssaj2001.6551547x. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 26.Lapen DR, Hayhoe HN. Spatial analysis of seasonal and annual temperature and precipitation normals in southern Ontario, Canada. J Great Lakes Res [Internet] 2003;29(4):529–44. doi: 10.1016/S0380-1330(03)70457-2. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 27.Guevart E, Noeske J, Solle J, Essomba JM, Edjenguele M, Bita A, Mouangue A, Manga B. [Factors contributing to endemic cholera in Douala, Cameroon] Déterminats du cholera à Douala. Med Trop (Mars) 2006 Jun;66(3):283–91. French. [PubMed] [Google Scholar]
- 28.Tutorial: hot spot analysis (ArcGIS 10.1) [Internet] Redlands, CA: Esri; 2012. Aug 8, [cited 2019 Mar 13] Available from: http://www.arcgis.com/home/item.html?id=6626d5cc81a745f1b737028f7a519521. [Google Scholar]
- 29.Kammann EE, Wand MP. Geoadditive models. J R Stat Soc [Internet] 2003 Jan;52(1):1–18. doi: 10.1111/1467-9876.00385. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 30.Ruppert D, Wand MP, Carroll RJ. Semiparametric regression during 2003– 2007. Electron J Stat. 2009 Jan;3:1193–1256. doi: 10.1214/09-EJS525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Bocci C. [Florence, Italy]: University of Florence; 2009. Geoadditive models for data with spatial information [dissertation] p. 107. p. [Google Scholar]
- 32.Hastie T, Tibshirani R. Exploring the nature of covariate effects in the proportional hazards model. Biometrics [Internet] 1990 Dec;46(4):1005–16. [cited 2019 Dec 12] Available from: https://www.jstor.org/stable/2532444 Subscription required to view. [PubMed] [Google Scholar]
- 33.Johnston K, Ver Hoef JM, Krivoruchko K, Lucas N. Using ArcGIS geostatistical analyst. Redlands, CA: Esri; 2001. p. 306. p. [Google Scholar]
- 34.Hengl T. A practical guide to geostatistical mapping. 2nd ed. Amsterdam, Netherlands: Hengl; 2009. Nov, p. 291. p. [Google Scholar]
- 35.Yasrebi J, Saffari M, Fathi H, Karimian N, Moazallahi M, Gazni R. Evaluation and comparison of ordinary kriging and inverse distance weighting methods for prediction of spatial variability of some soil chemical parameters. Res J Biol Sci [Internet] 2009;4(1):93–102. [cited 2019 Dec 12] Available from: http://medwelljournals.com/abstract/?doi=rjbsci.2009.93.102. [Google Scholar]
- 36.Babak O, Deutsch CV. Statistical approach to inverse distance interpolation. Stoch Environ Res Risk Assess [Internet] 2009 Jul;23(5):543–53. doi: 10.1007/s00477-008-0226-6. [cited 2019 Dec 11] Available from: Subscription required to view. [DOI] [Google Scholar]
- 37.Deterministic methods for spatial interpolation [Internet] Redlands, CA: Esri; 2008. [cited 2018 Jan 23]; [about 1 screen] Available from: http://help.arcgis.com/en/arcgisdesktop/10.0/help/index.html#//003100000023000000.htm. [Google Scholar]
- 38.Prudhomme C, Reed DW. Mapping extreme rainfall in a mountainous region using geostatistical techniques: a case study in Scotland. Int J Climatol [Internet] 1999 Oct;19(12):1337–56. doi: 10.1002/(SICI)1097-0088(199910)19:12<1337::AIDJOC421>3.0.CO;2-G. [cited 2019 Dec 12] Available from: [DOI] [Google Scholar]
- 39.Krivoruchko K, Crawford CA. Assessing the uncertainty resulting from geoprocessing operations. In: Maguire DJ, Batty M, Goodchild MF, editors. GIS, spatial analysis, and modeling. Redlands, CA: ESRI Press; 2005. pp. 67–92. [Google Scholar]
- 40.Elena C, Xavier B. Modelling uncertainty using geostatistics, a case study in Ecuador; Proceedings of Spatial Accuracy 2016; 2016 Jul 5–8; Montpellier, France. Catania, Italy: International Spatial Accuracy Research Association; 2016. pp. 74–9. [Google Scholar]
- 41.Liu X, Wu J, Xu J. Characterizing the risk assessment of heavy metals and sampling uncertainty analysis in paddy field by geostatistics and GIS. Environ Pollut [Internet] 2006 May;141(2):257–64. doi: 10.1016/j.envpol.2005.08.048. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [PubMed] [Google Scholar]
- 42.Yang X, Wang L. Spatial analysis and hazard assessment of mercury in soil around the coal-fired power plant: a case study from the city of Baoji, China. Environ Geol [Internet] 2008;53(7):1381–8. doi: 10.1007/s00254-007-0747-1. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 43.Mahmoudabadi E, Sarmadian F, Savaghebi GH, Alijani Z. Accuracy assessment of geostatistical methods for zoning of heavy metals in soils of urban-industrial areas. Int Res J Appl Basic Sci. 2012; 3(5):991–9. [Google Scholar]
- 44.Madhloom HM, Al-Ansari N, Laue J, Chabuk A. Modeling spatial distribution of some contamination within the lower reaches of Diyala River using IDW interpolation. Sustain [Internet] 2018;10(1) doi: 10.3390/su10010022. [cited 2019 Dec 12]; Article 22 [13 p.]. Available from: [DOI] [Google Scholar]
- 45.Williams KE, Anderson SJ. Geostatistical methods for predicting soil moisture continuously in a Subalpine Basin. Am Soc Photogramm Remote Sensing [Internet] 2014 Apr;4:333–41. doi: 10.14358/PERS.80.4.333. [cited 2019 Dec 12] Available from: [DOI] [Google Scholar]
- 46.D'Acqui LP, Santi CA, Maselli F. Use of ecosystem information to improve soil organic carbon mapping of a Mediterranean island. J Environ Qual. 2007 Jan 9; 36(1):262–71. doi: 10.2134/jeq2005.0283. [DOI] [PubMed] [Google Scholar]
- 47.Hajizadeh Y, Mokhtari M, Faraji M, Mohammadi A, Nemati S, Ghanbari R, Abdolahnejad A, Fard RF, Nikoonahad A, Jafari N, Miri M. Trends of BTEX in the central urban area of Iran: a preliminary study of photochemical ozone pollution and health risk assessment. Atmos Pollut Res [Internet] 2018 Mar;9(2):220–9. doi: 10.1016/j.apr.2017.09.005. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 48.Lu GY, Wong DW. An adaptive inverse-distance weighting spatial interpolation technique. Comput Geosci [Internet] 2008 Sep;34(9):1044–55. doi: 10.1016/j.cageo.2007.07.010. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]
- 49.Oke AO, Sangodoyin AY, Ogedengbe K, Omodele T. Mapping of river water quality using inverse distance weighted interpolation in Ogun-Osun River Basin, Nigeria. Landsc Environ. 2013; 7(2):48–62. [Google Scholar]
- 50.odd MJ, Lowrance RR, Goovaerts P, Vellidis G, Pringle CM. Geostatistical modeling of the spatial distribution of sediment oxygen demand within a Coastal Plain blackwater watershed. Geoderma. 2010 Oct 15;159(1–2):53–62. doi: 10.1016/j.geoderma.2010.06.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Webster R, Oliver MA. Characterizing spatial processes: the covariance and variogram. In: Webster R, Oliver MA, editors. Geostatistics for environmental scientists. 2nd ed. Chichester, England: John Wiley & Sons; 2007. Chapter 4. [Google Scholar]
- 52.Zandi S, Ghobakhlou A, Sallis P. Evaluation of spatial interpolation techniques for mapping soil pH. 19th International Congress on Modelling and Simulation; 2011 Dec 12–16; Perth, Australia. Canberra, Australia: Modelling and Simulation Society of Australia and New Zealand; 2011. pp. 1153–9. [Google Scholar]
- 53.Zarco-Perello S, Simoes N. PeerJ [Internet] 2017. Nov 30, Ordinary kriging vs inverse distance weighting: spatial interpolation of the sessile community of Madagascar reef, Gulf of Mexico. [cited 2019 Dec 12];5. Article 4078 [24 p.] Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Zhang C, Tang Y, Xu X, Kiely G. Towards spatial geochemical modelling: use of geographically weighted regression for mapping soil organic carbon contents in Ireland. Appl Geochem [Internet] 2011 Jul;26(7):1239–48. doi: 10.1016/j.apgeochem.2011.04.014. [cited 2019 Dec 12] Available from: Subscription required to view. [DOI] [Google Scholar]