Abstract
This study proposes the development of a nonparametric regression model combined with geographically weighted regression. The regression model considers geographical factors and has a data pattern that does not follow a parametric form to overcome the problem of spatial heterogeneity and unknown regression functions. This study aims to model provincial food security index data in Indonesia with the GWSNR model, so finding the optimal knot point and the best geographic weighting is necessary. We propose the selection of optimal knot points using the Cross Validation (CV) and Generalized Cross Validation (GCV) methods. The optimal knot point will control the accuracy of the regression curve as we also consider the MSE value in showing the ability of the model. In addition, we determine the best geographic weighting and test the significance of the model parameters. We demonstrate the GWSNR model on food security index data. The best GWSNR model uses the Gaussian kernel weighting function and selects the optimal knot point as one-knot point based on the lowest CV and GCV values. Simultaneous and partial parameter test results show that there are 10 area classifications with different effects on each group of classification results. Some of the highlights of the proposed approach are:
-
•
The method is the development of a nonparametric regression model with geographic weighting, which combines nonparametric and spatial regression in modeling the national food security index.
-
•
There are three-knot points tested in nonparametric truncated spline regression and three geographic weightings in spatial regression. Then the optimal knot point and best bandwidth are determined using Cross Validation and Generalized Cross Validation.
-
•
This article will determine regional groupings in Indonesia in 2022 based on significant predictors in modeling the national food security index numbers.
Keywords: Food security index, Nonparametric regression, Geographic weighting function, Kernel function, Generalized cross-validation, Truncated spline
Method name: Geographically Weighted Spline Nonparametric Regression (GWSNR) Model
Graphical abstract
Specifications table
| Subject area: | Mathematics and Statistics |
| More specific subject area: | Statistics: Nonparametric Regression, Spatial Regression |
| Name of your method: | Geographically Weighted Spline Nonparametric Regression (GWSNR) Model |
| Name and reference of original method: | Sifriyani, I. N. Budiantara, S. H. Kartiko, and Gunardi, A new method of hypothesis test for truncated spline nonparametric regression influenced by spatial heterogeneity and application, Abstract and Applied Analysis. 2018 (2018). https://doi.org/10.1155/2018/9,769,150 |
| Resource availability: | National food security index and the predictors from publication of the agricultural data center and information system of the secretariat general of the ministry of agriculture. |
Background
Indonesia is committed to realizing sustainable development goals (SDGs), which include eliminating poverty and ending hunger, achieving food security, improving nutrition, and promoting sustainable agriculture. In achieving these targets, the National Food Agency has the task and role of coordinating, establishing, and implementing policies for preventing and handling food and nutrition insecurity [1]. There is an interest in sustainable development and meeting food needs, so researchers must discuss and research the food security index. This research aims to model and find factors influencing the national food security index.
Spatial effects and geographical factors influence food security index data. In statistics, spatial effects occur when the data shows heteroscedasticity and spatial autocorrelation. Regression analysis for geographically and locally oriented data can use the Geographically Weighted Regression (GWR) model. Fotheringham first introduced GWR in 1967 [2]. Research using GWR theory was carried out by Brunsdon et al. [3], Chasco et al. [4], Mennis et al. [5], Sefa mizrak et al. [6], Sifriyani et al. [[7], [8]], Fei Jiang et al. [9], and Abel Kebede Reda, et al. [10]. GWR development has also been carried out by statisticians such as [[11], [12], [13]]. The development of GWR carried out by statisticians is still in linear form. However, in reality, not all data is known to have a clear relationship pattern, or the regression curve is unknown [14]. So, nonparametric regression is an alternative approach to use in such cases.
Several types of non-parametric regression models are often discussed, such as Spline by Wahba, 1990; Green & Silverman, 1994; Budiantara et al., 1997; Budiantara, 2002. Kernel by Hardle, 1990, and Fourier Series by Antoniadis, 1994 and Wavelets by Antoniadis, 2001. Splines are segmented polynomials that have the property of flexibility. Splines are very dependent on their vertices. Truncated Spline is a segmented polynomial model that allows effective adaptation to the local characteristics of the data. So, Sifriyani et al. [15] developed the Geographically Weighted Spline Nonparametric Regression (GWSNR) model to solve spatial analysis problems where the regression curve is unknown, Then find test statistics for goodness of fit and validation of the GWSNR model [15].
Besides being influenced by spatial effects, food security index data has an unknown regression curve. So, the GWSNR model is suitable for use. This research aims to model provincial food security index data in Indonesia using the GWSNR model. Therefore, finding the optimal knot points and the best geographic weights is necessary. We propose optimal knot selection using Cross Validation (CV) and Generalized Cross Validation (GCV) methods. The optimal knot points will control the accuracy of the regression curve, and we also consider the MSE value in indicating the model's ability. Next, determine the best geographic weighting function, then estimate and test the significance of the model parameters. This research will determine regional groupings in Indonesia in 2022 based on significant predictors in modeling the national food security index numbers.
Method details
Nonparametric regression with a truncated spline approach
In nonparametric regression modeling, one of the approaches that can be used to estimate the regression curve is the truncated spline approach [16], (Sifriyani, et al., Spline and kernel mixe estimators in multivariable nonparametric regression for dengue hemorhagic fever model, 2023). The truncated spline in nonparametric regression is a model that has good statistical and visual interpretation [9,17], besides the truncated spline has high flexibility [18], (Sifriyani, D, M, A, & J, [17]). A truncated spline is a segmented polynomial model. These segmented polynomials provide greater flexibility than ordinary polynomials and this property allows the model to adapt more effectively to the local characteristics of the data [19]. Truncated splines of order with knot points where , with and is real konstanta. The function of nonparametric regression with a truncated spline approach presented in Eq. (1).
| (1) |
where is a real constant with is the unit of observation, is the number of predictor variables and is the number of knot points. Then the truncated function is given in Eq. (2).
| (2) |
Truncated splines have the following functional characteristics [20]:
-
1.
Function is a piece-wise polynomial of degree on interval .
-
2.
Function has a continuous derivative of degree
-
3.
is a step function with jump points .
The nonparametric regression model with a truncated spline approach [19] is given in Eq. (3).
| (3) |
where is the i th response variable, is the i th predictor variable, are real constant, are knot points, and are random errors assumed to be identical, independent, and normally distributed with zero mean and variance [8,15].
Geographically weighted regression
The Geographically Weighted Regression (GWR) model was first introduced by Fotheringham [2]. The GWR model is an extension of the classic linear regression model which was developed to model data with continuous response variables and consider spatial or location aspects. The response variable y in the GWR model is predicted by predictor variables, each of which has a regression coefficient depending on the location where the data is observed, so that each observation location has different regression parameter values [17,19]. GWR model of the relationship between the response variable and predictor variables at the i th location is given in Eq. (4).
| (4) |
is the value of the response variable at the point of the i th observation location, is the k-th predictor variable at the i-location observation, represents the geographic coordinates of the Longitude and Latitude of the i th observation location, is a constant/intercept of GWR, is the k-th parameter at the i th location associated with predictor variables and is the error at the i th location which is assumed to be independent, identical and normally distributed with zero mean and variance [[8], [20]].
Materials and model specifications
Constructing a geographically weighted spline nonparametric regression model
The Geographically Weighted Spline Nonparametric Regression (GWSNR) model is a development of nonparametric regression for spatial data with local parameter estimators for each observation location. The truncated spline approach is used to solve spatial analysis problems where the shape of the regression curve is unknown. In the regression model, we use the assumption that the errors are normally distributed with zero mean and variance at each location . The coordinate location is one of the important factors in determining the weights used to estimate the parameters of the model.
The form of the relationship between the response variables and the predictor variables at the -th location for the GWSNR regression model can be expressed mathematically in Eq. (5).
| (5) |
Eq. (5) is a GWSNR model of degree with areas. The components in Eq. (5) described as is the response variable at the i th location where , the variable is the -th predictor variable at the-th location where the symbol denotes the -th knot point on the component of the -th predictor variable where the symbol denotes the component parameter of the polynomial and defined as the k-th parameter of the -th predictor variable in the -th area. is the truncated component parameter of the truncated multivariable spline nonparametric regression model in GWR. is the -th parameter at the -th knot point and the -th predictor variable in the -th area [15].
Geographically weighted spine nonparametric regression model estimation process
The process of estimating the GWSNR model with the Maximum Likelihood Estimation (MLE) with weights is given as follows.
1. The probability density function for is shown in Eq. (6)
| (6) |
2. Generating the joint probability distribution function for the jth location is shown in Eq. (7)
| (7) |
3. Generate Likelihood Function is shown in Eq. (8)
| (8) |
4. Estimator , and can be obtained using the MLE method by solving the optimization in Eq. (9)
| (9) |
Estimator , and are completely shown in Theorem 1.
Theorem 1
Suppose the regression model Eq. (5) with normally distributed error with zero mean and variance and the weighted likelihood function is given by (8) then the MLE estimator for and are given by:
where
Regression curve estimator contains the polynomial component represented by the matrix and the truncated components represented by matrices . If , then the truncated multivariable spline nonparametric regression curve estimator in the GWR model will approach the polynomial parametric regression curve estimator in the GWR model.
Furthermore, if and matrix contains a linear function then the nonparametric multivariable spline truncated regression curve estimator in the GWR model will be a linear parametric regression curve estimator in the GWR model (multiple linear regression in the GWR model) developed by many researchers [[21], [22], [23], [24], [25], [26], [27], [28]].
Generalized cross validation (GCV) for optimal knot point selection
The knot point is the location on the axis where the regression function changes. The optimal knot point is obtained from the minimum GCV [29]. The GCV method formulation is given in Eq. (10).
| (10) |
Where,
Mean Square Error of geographically weighted spline nonparametric regression model
is a matrix which is obtained from the equation
: the identity matrics
: Knot points
: the number of obseravations.
Geographic weighting function
The geographic weighting function used is the Gaussian kernel function, the bisquare kernel function and the exponential function [30].
-
1.The Gaussian kernel function is stated in Eq. (11).
where is the geographic weighting, is the standard normal density function, is the distance from location-i to location-j and is the bandwidth value. The bandwidth value is a function smoothing parameter value whose value is always positive. shows the standard deviation of the distance vector .(11) -
2.The bisquare kernel function is stated in Eq. (12).
where is the Euclidean distance between location to location and b are the bandwidth values.(12) -
3.The exponential function is stated in Eq. (13).
is the distance from location-i to location-j and b is the bandwidth value or smoothing parameter whose value is always positive. In finding the optimum bandwidth value using the Cross Validation (CV) method which is mathematically defined in Eq. (14).(13)
where is the estimator on observations at location that were omitted from the estimation process. To get the optimum value, it is obtained from the value of the CV iteration results and the minimum CV value is taken. Additionally, The optimum bandwidth can be determined using GCV defined in Eq. (15).(14) (15) Where the sum of the main diagonal elements of the weight matrix. The optimum bandwidth is chosen by finding the smallest GCV. The smallest GCV is generated from the model that has the slightest error.
Geographically weighted spline nonparametric regression fitment test
The purpose of the statistical model fit test is to ascertain whether the Geographically Weighted Spline Nonparametric Regression model is more suitable for data analysis than the nonparametric global regression model [15,31]. The formulation of the GWSNR suitability test hypothesis is as follows:
-
(There is no significant difference between the GWSNR Model and the Global Model of truncated spline in multivariable nonparametric regression)
-
(There is a significant difference between the GWSNR Model and the Global Model of truncated spline in multivariable nonparametric regression)
The rejection area if or . The test statistic is given in Eq. (16).
| (16) |
where
is statistics test for geographically weighted spline nonparametric regression fitment test, is the response variable vector, n is the number of observation units, p is the number of predictor variables, m is the number of truncated spline orders and is the Identity matrix
Simultaneous GWSNR model parameter significance test
Simultaneous parameter significance test was carried out using the F test or analysis table of variance. The purpose of the test is to determine whether there is a simultaneous or simultaneous influence between the response variables on the predictor variable or not [8,15]. The formulation of the simultaneous test hypothesis is as follows:
The rejection area if or . The test statistical significance of simultaneous GWSNR model parameters is given in Eq. (17).
| (17) |
where
Partial parameter significance test
Formulation of the partial parameter test hypothesis of the model in GWSNR.
The rejection area if or . The test statistics [15,32] to be used is given in Eq. (18).
| (18) |
The component and is the ()-th diagonal element of matrix . If the significance level is , then the decision taken will be rejected H0 if the value
Research methods
Food security is a condition where food is met for all people and countries at all times, reflected in food that is nutritious, safe, high quality, diverse, nutritious, and affordable. The Food Security Index is a measure of several indicators used to produce a composite score of food security conditions in a region [1]. In this research, several predictor variables were used, which were thought to affect the food security index in 2022. The variables are followed as follows:
: Food Security Index
Rice Production
Red Chili Production
Shallot Production
Palm Oil Production
Beef Production
Production of chicken meat
Expenditure For Food
Percentage of Poor Population
Percentage of Population According to Inadequate Consumption
Percentage of Population with Food Insecurity
The data used in this research is secondary data. Food Security Index data can be accessed from the official website of the National Food Agency in a publication entitled Food Security Statistics for 2023. All predictors considered to influence the Food Security Index can be accessed from the websites of the Indonesian Central Statistics Agency and the National Food Agency. The research units used were 34 provinces in Indonesia. The Geographically Weighted Spline Nonparametric Regression Model procedures are spatial Mapping, descriptive statistical analysis, testing spatial effects, and GWSNR modeling. The stages in this research include:
-
1.
Spatial Mapping based on variable characteristics and observational data
-
2.
Descriptive Statistical Analysis.
-
3.
Testing the spatial effect with the Breusch-Pagan method.
-
4.
GWSNR modeling and determination of geographic weighting
-
5.
Calculate the Euclidean distance between observation locations based on geographic location
-
6.
Determine the optimum bandwidth based on the CV value of the Gaussian kernel function, the Bisquare kernel function and the Exponential kernel function for each observation location
-
7.
Selection of the optimum knot points using Cross Validation (CV) and Generalized Cross Validation (GCV) methods
-
8.
Estimation of GWSNR model parameters
-
9.
Testing the significance of the simultaneous parameters of the GWSNR model
-
10.
Testing the significance of the partial parameters of the GWSNR model
-
11.
Testing the suitability of the GWSNR model
-
12.
Area mapping based on significant variables.
-
13.
GWSNR Model Goodness and Accuracy Measures
-
14.
Interpretation of the model GWSNR. model significance test was carried out in three stages, namely Simultaneous model parameter test using analysis of variance or F test, Partial model parameter test using T test and Model suitability test aimed at testing differences in the GWSNR model and the Global Spline Nonparametric Regression model.
Spatial distribution of food security indices and predictor variables
The spatial distribution of each variable is shown in Fig. 1, Fig. 2, Fig. 3, Fig. 4, Fig. 5, Fig. 6, Fig. 7, Fig. 8, Fig. 9, Fig. 10, Fig. 11. The value of each variable is displayed at the value intervals provided by the figure.
Fig. 1.
Spatial Distribution of Food security index in 2022
Fig. 2.
Spatial Distribution of Rice production in 2022
Fig. 3.
Spatial Distribution of Red chili production in 2022
Fig. 4.
Spatial Distribution of Shallot production
Fig. 5.
Spatial Distribution of Palm oil production
Fig. 6.
Spatial Distribution of Beef production
Fig. 7.
Spatial Distribution of Production of laying chicken meat
Fig. 8.
Spatial Distribution of Average monthly food expenditure per capita
Fig. 9.
Spatial Distribution of Percentage of poor population
Fig. 10.
Spatial Distribution of Percentage of population by status of inadequate food consumption
Fig. 11.
Spatial Distribution of Percentage of population with moderate or severe food insecurity, scale of experience of food security
Method validation
Descriptive statistics
Descriptive statistical data on research variables consist of average, minimum value, and maximum value. The results of descriptive statistical calculations are presented in Table 2.
Table 2.
Descriptive Statistics of Research Data.
| Variable | Minimum | Maximum | Mean |
|---|---|---|---|
| 35.48 | 83.82 | 72.43 | |
| 855 | 9789,588 | 1600,450 | |
| 1 | 343,067 | 40.017 | |
| 2 | 564,255 | 58,958.6 | |
| 1 | 8785,327 | 1378,073 | |
| 627 | 93,303 | 12,876 | |
| 10 | 38,874 | 4267.8 | |
| 453,031 | 923,933 | 634,229 | |
| 4,45 | 26,56 | 10,243 | |
| 1,78 | 37,37 | 11,324 | |
| 2,87 | 15,31 | 6,04 |
The average Food Security Index (IKP) in Indonesia is 72.43 %, with the lowest Food Security Index being 35.48 % and the highest being 83.82 %. The province with the highest IKP is Bali Province and the lowest IKP is in Papua Province. Descriptive statistics for the 2 variables are given in Table.
Spatial effect testing
The spatial heterogeneity test aims to determine whether there is a spatial effect on the food security index variable. If the test results detect a spatial effect, spatial modeling is the right step to use in the analysis of the food security index. The results of the spatial effect test are given in Table 3.
Table 3.
Breusch-Pagan Test.
| Breusch Pagan | Degrees of Freedom | p-value | Decision |
|---|---|---|---|
| 12.14 | 20 | 0.0091 | spatial heterogeneity effect |
Based on the results of the Breusch-Pagan test, it was found that p-value (0.0091) < (0.05) then decided to refuse with the understanding that there is a spatial heterogeneity effect on the food security index
Geographically weighted spline nonparametric regression modeling
Stages and results of Geographically Weighted Spline Nonparametric Regression Modeling starts with the process of estimating the model with the details of selecting geographic weights, determining the optimal knot point according to the observed data pattern used.
Geographic weighting selection
The steps for finding the GWSNR model begin with selecting the best geographic weighting. Geographical weighting is determined based on the optimal bandwidth value. The geographic weights used in this study are the Gaussian kernel function, the Bisquare kernel function and the Exponential kernel function. The following Table 4 results of calculations for the weighting function.
Table 4.
Geographic Weighting Function.
| Geographic Weighting Function | Bandwidth | CV | GCV |
|---|---|---|---|
| Kernel Gaussian Function | 19.34 | 1214.49 | 161.30 |
| Kernel Bisquare Function | 42.29 | 1255.29 | 166.72 |
| Kernel Exponential Function | 31.28 | 1215.06 | 164.02 |
Based on the results of the geographic weighting calculations shown in Table 4, the best weighting function is the Gaussian kernel function with a bandwidth of 19.34 and has the smallest CV value of 1.214,49 and the smallest GCV of 161.30.
Optimum knot point selection
Determination of the optimum knot point is obtained based on the minimum GCV value. The following is the minimum GCV value using 1 knot point, 2 knot point, and 3 knot point.
Based on Table 5, the best knot point is obtained by using one knot point with a minimum GCV value of 27.333. The optimum knot points with one knot point are as follows:
| = 98,742 | = 398.64 |
| = 3431.7 | = 457,740 |
| = 5644.5 | = 4.67 |
| = 87,854 | = 2.14 |
| = 1553.8 | = 2.99 |
Table 5.
Optimum Knot Point.
| Knot Point(s) | CV | GCV |
|---|---|---|
| 1 | 273.37 | 27.333 |
| 2 | 330.25 | 27.546 |
| 3 | 440.10 | 29.953 |
The knot point value is used for GWSNR modeling.
Geographically weighted spline nonparametric regression model estimation
The optimal knot point result obtained by the lowest GCV is at one knot point, so the general GWSNR model is given in Eq. (19).
| (19) |
The results of the GWSNR model parameters are inputted to each model. The modeling results obtained are 34 models according to 34 provinces of observational data. The GWSNR model for West Sumatra Province is given in Eq. (20), the GWSNR Model for East Java Province is given in Eq. (21), and the GWSNR Model for South Sulawesi Province is given in Eq. (22) below.
GWSNR model for West Sumatra Province
| (20) |
GWSNR model for East Java Province
| (21) |
GWSNR model for South Sulawesi Province
| (22) |
Comparison between estimator and variable data given in Fig. 12.
Fig. 12.
Distribution pattern of estimator and data variable
In Fig. 12 it shows that the results of the estimator approach the original data of the variable (Food Security Index).
Significance test of GWSNR model parameters
GWSNR model significance test was carried out in three stages, namely Simultaneous model parameter test using analysis of variance or F test, Partial model parameter test using T test and Model suitability test aimed at testing differences in the GWSNR model and the Global Spline Nonparametric Regression model.
Simultaneous significance test of GWSNR model parameters
The formula of the hypothesis of the simultaneous parameter significance test of the GWSNR model is
or
The results of the calculation of the simultaneous significance test of the GWSNR model are given in Table 6.
Table 6.
Analysis of Variance in Simultaneous Significance Test.
| Source of Diversity | Sum of Squares | Degree of Freedom | Middle Square | V* | p-value |
|---|---|---|---|---|---|
| Regression | 2962.30 | 33 | 89.77 | 7.03 | 4.68 x 10–5 |
| Residual | 217.11 | 17 | 12.77 | ||
| Total | 3179.40 | 50 |
The results of the analysis obtained values V* = 7.03 > = 3.36 or p-value (4.68 × 10–5) < (0.05). then decided to refuse . The results obtained predictor variables and simultaneously have a significant effect on the response variable .
Partial significance test of GWSNR model parameters
The formula of the hypothesis test of partial significance of GWSNR model parameters is
There is at least one or
This test uses a significance level = 0.05 and rejection criteria is rejected if p-value < . Based on the results of the partial parameter significance test, 10 group classifications based on significant variables were obtained in Table 7.
Table 7.
Significant Variables.
| Group Classification | Significant Variable(s) | Province |
|---|---|---|
| 1 | , , , , , , , , , and | Aceh, West Sumatera, Riau, and East Java |
| 2 | , , , , , , , , and | Kep. Riau, DKI Jakarta, West Java, and Bali |
| 3 | , , , , , , , and | Jambi, Central Java, South Sulawesi, West Sulawesi, Maluku, and Papua |
| 4 | , , , , , , and | North Sumatera, Bengkulu, Kep. Bangka Belitung, Central Kalimantan, and South Kalimantan |
| 5 | , , , , , and | West Nusa Tenggara Barat and West Kalimantan |
| 6 | , , , , , and | West Papua |
| 7 | , , , , and | North Sumatera, DI Yogyakarta, East Nusa Tenggara, North Kalimantan, North Sulawesi, Central Sulawesi, South-east Sulawesi, and North Maluku |
| 8 | , , , and | Lampung and Gorontalo |
| 9 | , , and | Banten |
| 10 | and | East Kalimantan |
The distribution map based on significant variables is given in Fig. 13.
Fig. 13.
Spatial distribution based on significant variables.
In Fig. 13. the areas that have been mapped based on groups of several predictor variables that are significant to food security index are on the same mapping because they have uniform characteristics. This is in accordance with the first law of geography by W Tobler.
GWSNR model fit test
The formulation of the GWSNR fit test hypothesis is
There is at least one or
The significance level used is = 0.05 and rejection criteria is rejected if p-value < . The results of statistical calculation of the model fit test are given in Table 8.
Table 8.
Model GWSNR Fit Test.
| Source of Diversity | Sum of Squares | Degree of Freedom | Middle Square | V | p-value |
|---|---|---|---|---|---|
| Regression | 56.26 | 13 | 4.33 | 0.34 | 0.0097 |
| Residual | 217.11 | 17 | 12.77 | ||
| Total | 273.37 | 30 |
Based on Table 8, we obtained that p-value (0.0097) < (0.05), then we decided to reject . The conclusion in this test is that the GWSNR model is more appropriate to use than the global model.
GWSNR model goodness-of-fit and accuracy measures
The measure of the goodness-of-fit and accuracy of the model used in this study is the coefficient of determination and Root Mean Square Error (RMSE) the results of which can be seen in Table 9 below.
Table 9.
Model Goodness-of-Fit and Accuracy Measures.
| Model | Value | RMSE |
|---|---|---|
| Nonparametric Regression | 24.02 | 4.28 |
| GWR | 92.78 | 3.41 |
| GWSNR | 95.16 | 2.57 |
The coefficient of determination for the GWSNR model is 95.16 % which indicates that the GWSNR model is for variables , , , , , , , and can explain the diversity of the Food Security Index of 34 provinces in Indonesia of 95.16 % with an RMSE value of 2.57.
GWSNR model interpretation
The interpretation of the GWR model for the province of West Sumatra is explained in the following description:
-
1.GWSNR model interpretation for variable (rice production) and other variables are considered constant. The effect of rice production on the Food Security Index can be interpreted in the Equation Model (23).
(23) The interpretation of this model is that when rice production is less than 98,742 tons, if rice production increases by 1 ton, the Food Security Index in Indonesia will increase by 1.44 10–4.
-
2.GWSNR model interpretation for variable (red chili production) and other variables are considered constant. The effect of red chili production on the Food Security Index can be interpreted in the Equation Model (24)
(24) The interpretation of this model is that when red chili production is less than 3431.7 tons, if red chili production increases by 1 ton, the Food Security Index in Indonesia will decrease by 3.77 10–3.
-
3.GWSNR model interpretation for variable (shallot production) and other variables are considered constant. The effect of shallot production on the Food Security Index can be interpreted in the Equation Model (25).
(25) The interpretation of this model is, when shallot production is less than 5.644,5 tons, then if shallot production increases by 1 ton, the Food Security Index in Indonesia will increase by .
-
4.GWSNR model interpretation for variable (oil palm production) and other variables are considered constant. The effect of palm oil production on the Food Security Index can be interpreted in the Equation Model (26).
(26) The interpretation of this model is when the palm oil production is less than 87,854 tons, if palm oil production increases by 1 ton, then the Food Security Index in Indonesia will decrease by 1.36 10–5.
-
5.GWSNR model interpretation for variable beef production and other variables are considered constant. The effect of beef production on the Food Security Index can be interpreted in the Equation Model (27).
(27) The interpretation of this model is that when beef production is less than 1553.8 tons, if beef production increases by 1 ton, the Food Security Index in Indonesia will decrease by 8.77 10–3.
-
6.GWSNR model interpretation for variable chicken meat production and other variables is considered constant. The effect of chicken meat production on the Food Security Index can be interpreted in the Equation Model (28).
(28) The interpretation of this model is, when chicken meat production is less than 398.64 tons, if chicken meat production increases by 1 ton, the Food Security Index in Indonesia will increase by 5.73 10–3.
-
7.GWSNR model interpretation for variable (expenditure on food) and other variables is considered constant. The effect of expenditure on food on the Food Security Index can be interpreted in the Equation Model (29).
(29) The interpretation of this model is, when expenditure on food is less than 457,740 rupiah, then if spending on food increases by 1 rupiah, then the Food Security Index in Indonesia will increase by 4.55 10–4.
-
8.GWSNR model interpretation for the variable the percentage of poor people and other variables is considered constant. The effect of the percentage of poor people on the Food Security Index can be interpreted in the Equation Model (30).
(30) The interpretation of this model is, when the percentage of poor people is less than 4.67 percent, then if the percentage of poor people increases by 1 percent, then the Food Security Index in Indonesia will decrease by 2.43.
-
9.GWSNR model interpretation for variable is the percentage of the population according to insufficient consumption status and other variables are considered constant. The effect of the percentage of the population according to the status of insufficient consumption on the Food Security Index can be interpreted in the Equation Model (31).
(31) The interpretation of this model is, when the percentage of the population according to the consumption insufficiency status is less than 2.14 percent, then if the percentage of the population according to the consumption insufficiency status increases by 1 percent, then the Food Security Index in Indonesia will increase by 1.58.
-
10.GWSNR model interpretation for the variable the percentage of the population with food insecurity and other variables is considered constant. The effect of the percentage of population with food insecurity on the Food Security Index can be interpreted in the Equation Model (32).
(32) The interpretation of this model is, when the percentage of the population with food insecurity is less than 2.99 percent, then if the percentage of the population with food insecurity increases by 1 percent, the Food Security Index in Indonesia will decrease by 33.46.
Conclusion
-
1.
Food security index data and predictor variables , , , , , , , and provinces in Indonesia Year 2022 has a spatial effect.
-
2.
The best GWSNR model is using the Gaussian kernel weighting function, while selecting the optimal knot point is one knot point based on the lowest CV and GCV values.
-
3.
The results of simultaneous and partial parameter tests are significant. Based on the results of the partial significance test, there are 10 area classifications based on significant predictor variables with different effects on each grouping classification results.
-
4.
The results of the model fit test stated that the GWSNR model was most appropriate for modeling food security index data.
-
5.
The GWSNR model goodness-of-fit is 95.16 % which indicates that the GWSNR model for variables , , , , , , , and can explain the diversity of the Food Security Index of 34 provinces in Indonesia of 92.78 % with an RMSE value of 3.41.
-
6.
Factors influencing the national food security index are rice production, red chili production, shallot production, palm oil production, beef production, laying hen meat production, average per capita food expenditure per month, percentage of poor people, percentage of population according to inadequate food consumption status, and percentage of population with food insecurity.
CRediT authorship contribution statement
Sifriyani: Software, Visualization, Data curation. I Nyoman Budiantara: Validation, Writing – review & editing, Supervision. Krishna Purnawan Candra: Software, Visualization, Data curation. Marisa Putri: Investigation, Resources, Writing – original draft, Project administration.
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgments
Limitations
None.
Ethics statements
The dependent variable used in this study is the Food Security Index in Indonesia 2022. The predictor variables used in this study are Rice Production, Red Chili Production, Shallot Production, Palm Oil Production, Beef Production, Production of Laying Chicken Meat, Average Monthly Food Expenditure per capita, Percentage of Poor Population, Percentage of Population According to Food Consumption Insufficiency Status and Percentage of Population with Food Insecurity. The data was obtained through the publication of the Agricultural Data Center and Information System of the Secretariat General of the Ministry of Agriculture, 2(1) of 2022.
Acknowledgments
The authors gratefully acknowledge the funding of KEMENDIKBUD RISTEK Indonesia in 2024 [061/E5/PG.0200.PL/2024 and 622/UN/17.L1/HK/2024].
Supplementary material and/or additional information [OPTIONAL]
None.
Footnotes
Related research article: Sifriyani, I. N. Budiantara, S. H. Kartiko, and Gunardi, A new method of hypothesis test for truncated spline nonparametric regression influenced by spatial heterogeneity and application, Abstract and Applied Analysis. 2018 (2018). https://doi.org/10.1155/2018/9769150
For a published article: None
Data availability
Data will be made available on request.
References
- 1.pertanian P.d. Pertanian, Pusat Data dan Sistem Informasi Pertan. Sekr. Jenderal Kementerian; 2022. Statistik Ketahanan Pangan Tahun 2022 (Vol. 2) [Google Scholar]
- 2.F A.S., B C., C M. John Wiley & Sons Ltd; England: 2002. Geographically Weighted Regression: The Analysis of Spatialy Varying Relationships. [Google Scholar]
- 3.Brunsdon A., Fotheringham S., Charlton M.E. Geographically weighted regression: a method for exploring spatial nonstationarity. Geogr. Anal. 1996;28(4):281–298. [Google Scholar]
- 4.Chasco, Garcia, & Vicens. (2007). Modeling Spatial Variations in Household Disposable income with Geographically Weighted Regression. Munich Personal RePEc Archive Paper, 1682.
- 5.Mennis J.L., Jordan L. The distribution of environmental equity: exploring spatial nonstationarity in multivariate models of air toxic releases. J. Ann. Assoc. Am. Geogr. 2005;95(2):249–268. [Google Scholar]
- 6.M S., C H. Determining of factors affecting the disaster resilience of countries by geographical weighted regression. Int. J. Disast. Risk Reduct. 2022;81(103311) [Google Scholar]
- 7.Sifriyani, Ruslan Analysis of The Causes of Flooding in Samarinda City Using Spatial Statistics: geographically Weighted Regression. AIP Conf. Proc. 2023;2554(1030002) [Google Scholar]
- 8.Sifriyani, Haryatmi, Budiantara I.N., Gunardi Geographically weighted regression with spline approach. Far East Journal of Mathematical Science. 2017;101(6):1183–1196. [Google Scholar]
- 9.Jiang F., Ma Y., Carroll R.J. A spline-assisted semiparametric approach to nonparametric measurement error models. Econom. Stat. 2023;2023 [Google Scholar]
- 10.K A., T L., G G., L D. Research in Transportation Economics Modelling the effect of spatial determinants on freight (trip) attraction: a spatially autoregressive geographically weighted regression approach. Res. Transport. Econ. 2022;99 [Google Scholar]
- 11.Wrenn D.H., Sam A.G. Geographically and temporally weighted likelihood regression: exploring the spatiotemporal determinants of land use change. J. Reg. Sci. Urban Econ. 2014;44(15):60–74. [Google Scholar]
- 12.Yu D. Exploring spatotemporally varying regressed relationships: the geographically weighted panel regression analysis. J. Int. Arch. Photogramm. 2010;38(6):134–139. [Google Scholar]
- 13.Zuhdi S., Saputro D.R., Widyaningsih P. Parameters estimation of geographically weighted ordinal logistic regression (GWOLR) model. J. Phys. Conf. Ser. 2017;855(1):1–5. [Google Scholar]
- 14.Sifriyani, Dani A.T., Fauziyah M., Budiantara I.N. Statistical modelling: a new regression curve approximation using mixed estimators truncated spline and epanechnikov kernel. Eng. Lett. 2023;31(4):1649–1655. [Google Scholar]
- 15.Sifriyani, Budiantara I.N., Kartiko S.H., Gunardi A new method of hypothesis test for trncated spline nonparametric regression influenced by spatial heterogeneity and application. Abstr. Appl. Analy. 2018;2018 [Google Scholar]
- 16.D K., K J.-Y. On spline estimators and prediction intervals in nonparametric regression. Comput. Stat. Data Anal. 2000;35(1):67–82. [Google Scholar]
- 17.Sifriyani D., M Y., M Z., A D., J S. Modeling of dengue hemorhagic fever cases in AWS Hospital Samarinda using bi-responses nonparametric regression with estimatprs spline truncated. Commun. Math. Biol. Neurosci. 2023;2023 [Google Scholar]
- 18.Sifriyani, H R. AIP Conf. Proc. 030004. 2023. Application of nonparametric truncated spline regression on infant mortality rate in Kalimantan. [Google Scholar]
- 19.Y L., P K.E. Spline nonparametric quasi-likelihood regression with in the frame ofthe accelerated failure time model. Comput. Stat. Data Anal. 2012;56(9):2675–2687. [Google Scholar]
- 20.Z C. Assessing the equivalence of nonparametric regression tests based on spline and local polynomial smoothers. J. Stat. Plan. Inference. 2004;126(1):73–95. [Google Scholar]
- 21.B F., N A. Households in potential economic distress: a geographically weighted regression model for Italy, 2001-2011. Spat. Stat. 2017;21(Part B):362–376. [Google Scholar]
- 22.Brunsdon, Fotheringham Some notes on parametric significance test for geographically weighted regression. J. Reg. Sci. 1999;39(3):497–524. [Google Scholar]
- 23.Demsar, Fotheringham, Charlton Exploring the spatio-temporal dynamics of geographical processes with geographically weighted regression and geovisual analytics. Inferen. Vis. 2008;7:181–197. [Google Scholar]
- 24.F A.S., C M., C R. Proceeding of the 9th International Conference on Geocomputation. 2007. Application of geographically weighted regression to a 19-year set of house price data in London to calivrate local hedonic price models. [Google Scholar]
- 25.H O.J., L C., G M.A. The spatial effect of alcohol availability on violence: a geographically weighted regression analysis. Appl. Geogr. 2021;150(102824) [Google Scholar]
- 26.Li Y., J Y., B J.A. Modeling spatially-varying ecological relationships using geographically weighted generalized linier model: a simulation study based on longline seabird bycatch. Fish. Res. 2016;181:14–24. [Google Scholar]
- 27.Lu B., Charlton M., Harris P., Fotheringham A.S. Geographically weighted regression with a non-euclidean distance metric: a case study using hedonic house price data. Int. J. Geograph. Inform. Sci. 2014;28(4):660–681. [Google Scholar]
- 28.W S.-S., Y H., G F., H R.-M. Spatial patterns and origins of heavy metals in Sheyang River catchment in Jiangsu, China based on geographically weighted regression. Sci. Tot. Environ. 2017;181:1518–1529. doi: 10.1016/j.scitotenv.2016.12.137. [DOI] [PubMed] [Google Scholar]
- 29.Sifriyani I., H, M Z. Application of nonparametric geographically weighted spline regression model for spatial mapping of open unemployment rate in Kalimantan. Journal of Physics: Conf. Series. 2021;2123 [Google Scholar]
- 30.Sifriyani H., S F., Ruslan Evaluatiom of forest productivity and governance of the preservation of tropical rain forest in Kalimantan using the NGWR-TS nonparametric geospatial method. EurAsian J. Biosci. 2019;13(2):2373–2379. [Google Scholar]
- 31.Sifriyani, Mandang I., A F.D., Ruslan Developing geographically weighted panel regression model for spatio-temporal analysis of covid-19 postove vases in Kalimantan, Indonesia. Journal of Southwest Jiatong University. 2022;57(3):112–136. [Google Scholar]
- 32.S A., Sifriyani, H M.N. regression nonparametric spline estimation on blood glucose of inpatients diabetes mellitus at Samarinda Hospital. Barekang: Jurnal Ilmu Matematika dan Terapan. 2023;17(1):0147–0154. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data will be made available on request.














