Kernel-based geographically and temporally weighted autoregressive model for house price estimation

Jooyong Shim; Changha Hwang

doi:10.1371/journal.pone.0205063

. 2018 Oct 11;13(10):e0205063. doi: 10.1371/journal.pone.0205063

Kernel-based geographically and temporally weighted autoregressive model for house price estimation

Jooyong Shim ¹, Changha Hwang ^2,^*

Editor: Fenghua Wen³

PMCID: PMC6182806 PMID: 30307975

Abstract

Spatiotemporal nonstationarity and autocorrelation are two crucial points in modeling geographical data. Previous studies have demonstrated that geographically and temporally weighted autoregressive (GTWAR) model accounts for both spatiotemporal nonstationarity and autocorrelation simultaneously to estimate house prices. Therefore, this paper proposes a kernel-based GTWAR (KBGTWAR) model by incorporating the basic principle of support vector machine regression into spatially and temporally varying coefficients model. The efficacy of KBGTWAR model is demonstrated through a case study on housing prices in the city of Shenzhen, China, from year 2004 to 2008. Comparing the existing models, KBGTWAR model obtains the lowest value for the residual sum of squares (RSS) and the highest value for the coefficient of determination R². Moreover, KBGTWAR model improves the goodness of fit of the existing GTWAR model from 12.0 to 4.5 in terms of RSS, from 0.914 to 0.968 in terms of R² and from 3.84 to 4.45 in terms of F-statistic. The results show that KBGTWAR model provides a comparatively high goodness of fit and sufficient explanatory power for both spatiotemporal nonstationarity and autocorrelation. The results of this study demonstrate that the proposed KBGTWAR model can be used to effectively formulate polices for real estate management.

Introduction

Analysis of relationships between an output variable and input variables in spatiotemporal fields has recently attracted attention in a data analysis community. Spatiotemporal models occur when data are obtained across time as well as space. Spatial or temporal autocorrelation exists between the observations. In addition, spatial or temporal nonstationarity arises in the relationships. Ordinary regression models have neglected these issues that violate the standard Gauss-Markov assumptions. Spatial econometrics models have been proposed to deal with these issues. Spatiotemporal regression models can contribute to better understanding complex phenomena studied in geographical information science and other fields.

Spatial models not only can account for spatial autocorrelation but also can deal with spatial lag dependence or/and spatial error dependence. There are two approaches for handling spatial autocorrelation. The first is the weight matrix approach, which uses a spatial weight matrix to deal with the spatial relationships between observations. This approach is based on the work of [1]. See for details [2] and [3]. The second is the geostatistical approach, which directly models the covariance matrix of the error terms. This approach is based on the work of [4]. See for details [5] and [6]. Moreover, [3] proposed a spatiotemporal autoregressive (STAR) model to incorporate temporal correlations between observations and found it powerful in the context of residential real estate. [7] developed a two-order STAR model with a Bayesian procedure to effectively detect and correct heteroscedasticity among the residuals.

The spatial models aforementioned have made remarkable contributions to effectively applying spatial and temporal process to regression modeling. However, these models assumed that a global level relationship exists across the study area. By the way, the assumption of stationary or structural stability over space and time is often impractical, because parameters tend to change depending on the study area. To deal with spatial heterogeneity in housing markets, a variety of localized modeling techniques have been proposed. As a result, [8] and [9] demonstrated the usefulness of locally weighted regression in modeling nonmonocentric cities. [10], [11] and [12] proposed geographically weighted regression (GWR), which has grown in popularity amongst real estate modeling methods. GWR model is a local spatial model for exploring spatial nonstationarity. To capture both spatial nonstationarity and spatial autocorrelation in a complex process, [13] proposed a GWR with spatially lagged variables to utilize the spatial variations. But this model did not consider temporal information. To model the effects of temporal heterogeneity, [14] and [15] proposed a geographically and temporally weighted regression (GTWR) to simultaneously capture both spatial and temporal nonstationarity by integrating temporal effects into the traditional GWR model. [16] developed a GTWR model based on travel time distance metrics. But these GTWR models do not consider spatial autocorrelation.

[17] asserted that it is better to deal with both spatial and temporal heterogeneity along with spatial autocorrelation effects in a mixed model. It is because spatial heterogeneity and autocorrelation are generally related in the context of modeling although the two problems are theoretically distinguished. However, it is not easy to build an integrated model with spatiotemporal autocorrelated and heterogeneous effects. [18] proposed a geographically and temporally weighted autoregressive (GTWAR) model to deal with spatiotemporal variations. In this paper we propose an efficient kernel-based GTWAR (KBGTWAR) model by incorporating the basic principle of support vector machine (SVM) regression into spatially and temporally varying coefficients model. SVM, first developed by [19] and his group at AT&T Bell Laboratories, has been successfully applied to a number of real world problems related to classification and regression problems.

The rest of this paper is organized as follows. Section 2 briefly describes the basic principle of GTWR and GTWAR models. Section 3 proposes an efficient KBGTWAR model. Section 4 and section 5 present case study and conclusion, respectively.

GTWAR model

In this section we briefly review the basic framework for GTWR and GTWAR models. We also review how to determine the adjustment parameters related with these two models.

GTWR model

The GWR model is a spatially varying coefficient regression approach for exploring spatial nonstationarity of a regression relationship for spatial data [10, 11, 12]. The GWR model extends the traditional linear regression model by allowing local rather than global parameters to be estimated. [14] developed a GTWR model to deal with spatial and temporal nonstationarity simultaneously by integrating temporal effects into the GWR model. [15] developed a GTWR model focusing on spatiotemporal kernel function definition and spatiotemporal bandwidth optimization. [16] developed a GTWR model based on non-Euclidean travel distance. The GTWR model can be expressed as follows:

\begin{matrix} y_{i} = β_{0} (u_{i}, v_{i}, t_{i}) + \sum_{k = 1}^{d} β_{k} (u_{i}, v_{i}, t_{i}) x_{k i} + ϵ_{i}, i = 1, \dots, n, \end{matrix}

(1)

where (u_i, v_i, t_i) is the coordinate of the observation i in space (u_i, v_i) at time t_i, β₀(u_i, v_i, t_i) indicates the intercept value, β_k(u_i, v_i, t_i) indicates the slope for each variable k and each space-time point i, and ϵ_i represents the random error with no correlation between different points. Here, local parameters β₀(u_i, v_i, t_i) and β_k(u_i, v_i, t_i) are continuous functions of the point (u_i, v_i, t_i). We notice that the GTWR model (1) is a spatially and temporally varying coefficient model.

For a given data set, these local parameters are estimated using the weighted least square procedure. The relevant weights W_ij, for j = 1, ⋯, n, indicate the proximity of each data point to the point (u_i, v_i, t_i). Let

\begin{matrix} β (u_{i}, v_{i}, t_{i}) = {(β_{0} (u_{1}, v_{1}, t_{1}), β_{1} (u_{1}, v_{1}, t_{1}), \dots, β_{d} (u_{1}, v_{1}, t_{1}))}^{t} \end{matrix}

(2)

be the vector of the local parameters for the space-time point i. Here, the superscript t represents the transpose of vector or matrix. Then, β(u_i, v_i, t_i) is estimated by

\begin{matrix} \hat{β} (u_{i}, v_{i}, t_{i}) = {(X^{t} W (u_{i}, v_{i}, t_{i}) X)}^{- 1} X^{t} W (u_{i}, v_{i}, t_{i}) y, \end{matrix}

(3)

where X is the n × (d + 1) matrix of input variables, y is the n-dimensional vector of output variable, and W(u_i, v_i, t_i) is an n × n weighting matrix of the form

\begin{matrix} W (u_{i}, v_{i}, t_{i}) = diag {W_{i 1}, W_{i 2}, \dots, W_{i n}} . \end{matrix}

(4)

In addition, the fitted value $\hat{y}$ is obtained as follows:

\begin{matrix} \hat{y} = (\begin{matrix} {\hat{y}}_{1} \\ {\hat{y}}_{2} \\ ⋮ \\ {\hat{y}}_{n} \end{matrix}) = (\begin{matrix} x_{1}^{a} {(X^{t} W (u_{1}, v_{1}, t_{1}) X)}^{- 1} X^{t} W (u_{1}, v_{1}, t_{1}) \\ x_{2}^{a} {(X^{t} W (u_{2}, v_{2}, t_{2}) X)}^{- 1} X^{t} W (u_{2}, v_{2}, t_{2}) \\ ⋮ \\ x_{n}^{a} {(X^{t} W (u_{n}, v_{n}, t_{n}) X)}^{- 1} X^{t} W (u_{n}, v_{n}, t_{n}) \end{matrix}) y, \end{matrix}

(5)

where $x_{i}^{a}$ is the ith row of the matrix X such that $x_{i}^{a} = (1, x_{1 i}, \dots, x_{d i})$ .

The weights W_ij are usually obtained through an adaptive kernel function. The adaptive kernel function attempts to adjust for the density of data points. This adaptive kernel function could use the same number of observed points in each local neighborhood set. The most commonly used adaptive kernel function is the Gaussian function

\begin{matrix} W_{i j} = exp (- \frac{d_{i j}^{2}}{h^{2}}), \end{matrix}

(6)

where d_ij is a spatiotemporal distance between points i and j and h is a nonnegative parameter known as bandwidth, which produces a decay of influence with distance. When the spatial and temporal distances between points i and j are given by ${(d_{i j}^{S})}^{2} = {(u_{i} - u_{j})}^{2} + {(v_{i} - v_{j})}^{2}, {(d_{i j}^{T})}^{2} = {(t_{i} - t_{j})}^{2}$ , we can construct the spatiotemporal distance as as a linear combination between ${(d_{i j}^{S})}^{2}$ and ${(d_{i j}^{T})}^{2}$ as follows:

\begin{matrix} {(d_{i j}^{S T})}^{2} = μ^{S} [{(u_{i} - u_{j})}^{2} + {(v_{i} - v_{j})}^{2}] + μ^{T} {(t_{i} - t_{j})}^{2}, \end{matrix}

(7)

where μ^S represents the scale factor of spatial distance and μ^T represents the scale factor of temporal distance. Thus, the weights W_ij can be expressed as

\begin{matrix} W_{i j} & = exp {- (\frac{μ^{S} [{(u_{i} - u_{j})}^{2} + {(v_{i} - v_{j})}^{2}] + μ^{T} {(t_{i} - t_{j})}^{2}}{h_{S T}^{2}})} \\ = exp {- (\frac{{(u_{i} - u_{j})}^{2} + {(v_{i} - v_{j})}^{2}}{h_{S}^{2}} + \frac{{(t_{i} - t_{j})}^{2}}{h_{T}^{2}})} \\ = exp {- (\frac{{(d_{i j}^{S})}^{2}}{h_{S}^{2}} + \frac{{(d_{i j}^{T})}^{2}}{h_{T}^{2}})} \end{matrix}

(8)

\begin{matrix} = & W_{i j}^{S} \times W_{i j}^{T}, \end{matrix}

(9)

where $h_{S T}^{2}, h_{S}^{2} = h_{S T}^{2} / μ^{S}$ and $h_{T}^{2} = h_{S T}^{2} / μ^{T}$ are the parameters of spatiotemporal, spatial, and temporal bandwidths, respectively. Thus, if the spatial and temporal bandwidths are determined, the weight matrix $W (u_{i}, v_{i}, t_{i})$ and $\hat{β} (u_{i}, v_{i}, t_{i})$ can be obtained.

[18] developed the improved GTWR (IGTWR) with the following spatiotemporal distance

\begin{matrix} {\begin{matrix} d_{i j}^{S T} = μ^{S} d_{i j}^{S} + μ^{T} d_{i j}^{T} + 2 \sqrt{μ^{S} μ^{T} d_{i j}^{S} d_{i j}^{T}} cos (ν), & t_{j} < t_{i} \\ d_{i j}^{S T} = \infty, & t_{j} > t_{i} \end{matrix}, \end{matrix}

(10)

where ν ∈ [0, π]. The adjustment parameters μ^S, μ^T and ν should be determined.

GTWAR model

To account for both spatiotemporal heterogeneity and spatial autocorrelation effects simultaneously, [18] developed a GTWAR, which combines the IGTWR model with the autocorrelation regression model. Spatial autocorrelation is considered by introducing the spatial lag $\sum_{j = 1}^{n} {\bar{W}}_{i j} y_{j}$ in a linear regression relationship [20].

\begin{matrix} y_{i} = ρ_{i} \sum_{j = 1}^{n} {\bar{W}}_{i j} y_{j} + ϵ_{i}, i = 1, 2, \dots, n, \end{matrix}

(11)

where ρ_i represents a spatial autoregressive parameter varying across geographical locations, ϵ_i denotes the random error in the relationship, and ${\bar{W}}_{i j}$ is the element in the ith row and jth column of the n × n spatial weight matrix $\bar{W}$ with ${\bar{W}}_{i i} = 0$ . The elements ${\bar{W}}_{i j}$ are typically row-normalized, such that for each i, $\sum_{j = 1}^{n} {\bar{W}}_{i j} = 1$ . Consequently, the spatial lag may be interpreted as a weighted average of the neighbors. We notice that the model (11) is a locally based autoregressive model. Incorporating (11) into the IGTWR model, [18] proposed the following GTWAR model:

\begin{matrix} y_{i} = β_{0} (u_{i}, v_{i}, t_{i}) + ρ (u_{i}, v_{i}, t_{i}) \sum_{j = 1}^{n} {\bar{W}}_{i j} y_{j} + \sum_{k = 1}^{d} β_{k} (u_{i}, v_{i}, t_{i}) x_{k i} + ϵ_{i}, i = 1, \dots, n, \end{matrix}

(12)

where ρ(u_i, v_i, t_i) is a scalar spatiotemporal autoregressive parameter at point i.

Model selection

Parameter estimation in GTWR and GTWAR is highly dependent on the adjustment parameters μ^S, μ^T and/or ν associated with the weighting function used. The selection of the adjustment parameters for GTWR and GTWAR can be determined using a cross validation (CV) approach or the corrected Akaike information criterion (AIC) from [12]. Since the adaptive kernel function generally uses the same number q of the nearest observed points, there is one more adjustment parameter. To obtain an optimal value of q, the CV or corrected AIC approach could be used. [14] argue that only the parameter ratio τ = μ^T/μ^S plays an important role in constructing weights. Hence, they set μ^S = 1 to reduce the number of adjustment parameters, and so only three parameters, q, μ^T and ν.

The CV function is defined as

\begin{matrix} C V (λ) = \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}^{(- i)} (λ))}^{2}, \end{matrix}

(13)

where λ is the vector of adjustment parameters associated with the GTWR or GTWAR model and ${\hat{y}}^{(- i)} (λ)$ is the fitted value of y_i with the observation i omitted from the calibration process. The corrected AIC function is defined according to [21] as follows:

\begin{matrix} A I C_{c} (λ) = n log (\frac{R S S (λ)}{n}) + n log (2 π) + n (\frac{n + tr (H (λ))}{n - 2 - tr (H (λ))}), \end{matrix}

(14)

where n is the number of data points in data set, RSS is the residual sum of squares defined as $R S S (λ) = \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}$ , and tr(H(λ)) is the trace of the hat matrix H(λ) associated with the GTWR or GTWAR model with the adjustment parameter vector λ, which satisfies the relationship $\hat{y} = H (λ) y$ . See (5) for the hat matrix related with the GTWR model and [18] for the hat matrix related with the GTWAR model. Here log denotes the natural logarithm. The adjustment parameters are achieved automatically with an optimization technique by minimizing the CV function (13) or the corrected AIC function (14).

KBGTWAR model

In this section we review SVM regression and illustrate how to develop KBGTWAR model. The underlying idea of KBGTWAR model is that the true mean specification is approximated by combining linear SVM regression with nonlinear feature mapping function of the coordinate vector (u_i, v_i, t_i) of the observed point i.

SVM regression

The foundations of SVM have been originally proposed by [19] and are gaining popularity due to many attractive features, and promising empirical performance. We now briefly review SVM regression. See for details [22]. Suppose we are given the data set $D = {(x_{i}, y_{i})}_{i = 1}^{n}$ with each covariate vector $x_{i} \in R^{d}$ and the output $y_{i} \in R$ . We basically illustrate the case of the linear SVM regression, taking the form

\begin{matrix} f (x) = w^{t} x + b, \end{matrix}

(15)

where b is the bias term. The goal of the linear SVM regression is to find a linear regression function $f (x)$ that approximates all pairs $(x_{i}, y_{i})$ with ϵ precision and is simultaneously as flat as possible. Flatness means that we seek a small w. One way to guarantee this is to minimize the norm ${∥ w ∥}^{2}$ . To make it feasible, we introduce slack variables $ξ_{i}, ξ_{i}^{*}$ representing upper and lower constraints on the outputs. This leads to the convex optimization problem

\begin{matrix} min_{w, b, ξ, ξ^{*}} J = \frac{1}{2} {∥ w ∥}^{2} + C \sum_{i = 1}^{n} (ξ_{i} + ξ_{i}^{*}) \\ subject to {\begin{matrix} y_{i} - w^{t} x_{i} - b \leq ϵ + ξ_{i} \\ w^{t} x_{i} + b - y_{i} \leq ϵ + ξ_{i}^{*} \\ ξ_{i}, ξ_{i}^{*} \geq 0 \end{matrix} . \end{matrix}

(16)

The regularization parameter C > 0 determines the trade-off between the flatness of f and the fitting errors.

The key idea is to construct the primal Lagrange function

\begin{matrix} L = J & - & \sum_{i = 1}^{n} α_{i} (ϵ + ξ_{i} - y_{i} + w^{t} x_{i} + b) \\ - & \sum_{i = 1}^{n} α_{i}^{*} (ϵ + ξ_{i}^{*} + y_{i} - w^{t} x_{i} - b) - \sum_{i = 1}^{n} (η_{i} ξ_{i} + η_{i}^{*} ξ_{i}^{*}), \end{matrix}

(17)

where $α_{i}, α_{i}^{*}, η_{i}, η_{i}^{*} \geq 0$ are Lagrange multipliers. The conditions for optimality are given by

\begin{matrix} \frac{\partial L}{\partial w} = 0 & \to & w = \sum_{i = 1}^{n} (α_{i} - α_{i}^{*}) x_{i} \end{matrix}

(18)

\begin{matrix} \frac{\partial L}{\partial b} = 0 & \to & \sum_{i = 1}^{n} (α_{i} - α_{i}^{*}) = 0 \end{matrix}

(19)

\begin{matrix} \frac{\partial L}{\partial ξ_{i}^{(*)}} = 0 & \to & C - α_{i}^{(*)} - η_{i}^{(*)} = 0, i = 1, \dots, n \end{matrix}

(20)

Here, we refer to α_i and $α_{i}^{*}$ by $α_{i}^{(*)}$ .

Substituting (18), (19) and (20) into (17) yields the dual optimization problem.

\begin{matrix} min_{α_{i}, α_{i}^{*}} \frac{1}{2} \sum_{i, j = 1}^{n} (α_{i} - α_{i}^{*}) (α_{j} - α_{j}^{*}) x_{i}^{t} x_{j} + ϵ \sum_{i = 1}^{n} (α_{i} + α_{i}^{*}) - \sum_{i = 1}^{n} y_{i} (α_{i} - α_{i}^{*}) \\ subject to \sum_{i = 1}^{n} (α_{i} - α_{i}^{*}) = 0 and α_{i}, α_{i}^{*} \in [0, C] \end{matrix}

(21)

Solving the above optimization problem determines the Lagrange multipliers ${\hat{α}}_{i}, {\hat{α}}_{i}^{*}$ . Thus the optimal regression function can be rewritten as follows:

\begin{matrix} \hat{f} (x) = \sum_{i = 1}^{n} ({\hat{α}}_{i} - {\hat{α}}_{i}^{*}) x_{i}^{t} x + \hat{b} . \end{matrix}

(22)

Here, the optimal value of b is obtained by employing Karush-Kuhn-Tucker [23] conditions as follows:

\begin{matrix} \hat{b} = \frac{1}{n_{s}} \sum_{i \in I_{sv}} (y_{i} - \sum_{j = 1}^{n} ({\hat{α}}_{j} - {\hat{α}}_{j}^{*}) x_{j}^{t} x_{i} - ϵ \times sign ({\hat{α}}_{i} - {\hat{α}}_{i}^{*})), \end{matrix}

(23)

where n_s is the size of $I_{sv} = {i = 1, 2, \dots, n : 0 < | α_{i} - α_{i}^{*} | < C}$ .

We now consider how to make the linear SVM regression algorithm nonlinear. This could be achieved by simply preprocessing the the input vectors x_i by a nonlinear feature mapping function $ϕ : R^{d} \to F$ into some feature space $F$ , and then applying the linear SVM regression algorithm. Thus, we only need to use the kernel trick K(x_i, x_j) = ϕ(x_i)^t ϕ(x_j) in the Eqs (21), (22) and (23) associated with the linear SVM regression algorithm [24]. We never need to know explicitly what ϕ is. The most popular kernel is Gaussian kernel defined by

\begin{matrix} K (x_{i}, x_{j}) = exp (- ∥ x_{i} - x_{j} ∥^{2} / 2 κ), i, j = 1, \dots, n, \end{matrix}

(24)

where κ > 0 is the kernel parameter.

KBGTWAR model

Given the data set $D = {(u_{i}, x_{i}, y_{i})}_{i = 1}^{n}$ with each coordinate vector u_i = (u_i, v_i, t_i), covariate vector $x_{i} \in R^{d}$ and the output $y_{i} \in R$ , we consider the following GTWAR model reexpressed from (12):

\begin{matrix} y_{i} = ρ (u_{i}) \sum_{j = 1}^{n} {\bar{W}}_{i j} y_{j} + \sum_{k = 0}^{d} β_{k} (u_{i}) x_{k i} + ϵ_{i}, i = 1, \dots, n, \end{matrix}

(25)

where x_0i = 1 and x_ki is the kth component of x_i = (x_1i, ⋯, x_di)^t for k = 1, ⋯, d. For simplicity, we use the spatiotemporal weights matrix $\bar{W}$ constructed from q-nearest neighbors based on the following modified spatiotemporal distance

\begin{matrix} {\begin{matrix} d_{i j}^{S T} = \sqrt{{(d_{i j}^{S})}^{2} + {(d_{i j}^{T})}^{2}}, & t_{j} < t_{i} \\ d_{i j}^{S T} = \infty, & t_{j} > t_{i} \end{matrix} . \end{matrix}

(26)

When the observation j is one of q nearest neighbors of the observation i, ${\bar{W}}_{i j} = 1 / q$ , otherwise ${\bar{W}}_{i j} = 0$ . The diagonal elements are ${\bar{W}}_{i i} = 0$ . The spatiotemporal weights matrix $\bar{W}$ is row-normalized.

To develop KBGTWAR model, we apply the basic principle of the linear SVM regression to the GTWAR model (12) after preprocessing the coordinate vectors u_i by a nonlinear feature mapping function ϕ into some feature space $F$ . Thus, we first assume that ρ(u_i) and β_k(u_i) for k = 0, 1, ⋯, d are nonlinearly related to the coordinate vector u_i such that ρ(u_i) = w^t ϕ (u_i) + b ∈ [0, 1] or [−1, 1], $β_{k} (u_{i}) = w_{k}^{t} ϕ (u_{i}) + b_{k}$ , where w and w_k are the weight vectors of dimension d_h corresponding to ϕ(u_i). Here, ϕ is defined in an implicit way. An inner product in feature space has an equivalent kernel such that K(u_i, u_j) = ϕ (u_i)^t ϕ (u_j), provided certain conditions hold [24]. Several choices of the kernel function are possible. As mentioned before, Gaussian kernel is most widely used. Thus, we focus on the choice of an Gaussian kernel (24) in the sequel.

Using the basic idea of the linear SVM regression, we define the convex optimization problem:

\begin{matrix} min_{w, w_{k}, b, b_{k}, ξ, ξ^{*}} J = \frac{1}{2} {∥ w ∥}^{2} + \frac{1}{2} \sum_{k = 0}^{d} {∥ w_{k} ∥}^{2} + C \sum_{i = 1}^{n} (ξ_{i} + ξ_{i}^{*}) \\ subject to {\begin{matrix} y_{i} - (w^{t} ϕ (u_{i}) + b) {\bar{W}}_{i} y - \sum_{k = 0}^{d} x_{k i} (w_{k}^{t} ϕ (u_{i}) + b_{k}) \leq ξ_{i} \\ (w^{t} ϕ (u_{i}) + b) {\bar{W}}_{i} y + \sum_{k = 0}^{d} x_{k i} (w_{k}^{t} ϕ (u_{i}) + b_{k}) - y_{i} \leq ξ_{i}^{*} \\ ξ_{i}, ξ_{i}^{*} \geq 0 \\ 0 \leq w^{t} ϕ (u_{i}) + b \leq 1 if ρ \in [0, 1] \\ or - 1 \leq w^{t} ϕ (u_{i}) + b \leq 1 if ρ \in [- 1, 1] \end{matrix}, \end{matrix}

(27)

where the constant C > 0 is the regularization parameter and we use the notation ${\bar{W}}_{i}$ to refer to the ith row of matrix $\bar{W}$ . For simplicity we set up the size ϵ of the insensitive zone to zero.

Now we are going to construct the primal Lagrange function for the case where ρ ∈ [0, 1] or ρ ∈ [−1, 1]. For the case of ρ ∈ [0, 1], the Lagrange function is constructed as follows:

\begin{matrix} L = J & - & \sum_{i = 1}^{n} α_{i} (ξ_{i} - y_{i} + (w^{t} ϕ (u_{i}) + b) {\bar{W}}_{i} y + \sum_{k = 0}^{d} x_{k i} (w_{k}^{t} ϕ (u_{i}) + b_{k})) \\ - & \sum_{i = 1}^{n} α_{i}^{*} (ξ_{i}^{*} + y_{i} - (w^{t} ϕ (u_{i}) + b) {\bar{W}}_{i} y - \sum_{k = 0}^{d} x_{k i} (w_{k}^{t} ϕ (u_{i}) + b_{k})) \\ - & \sum_{i = 1}^{n} η_{i} ξ_{i} - \sum_{i = 1}^{n} η_{i}^{*} ξ_{i}^{*} - \sum_{i = 1}^{n} ν_{i} (w^{t} ϕ (u_{i}) + b) - \sum_{i = 1}^{n} ν_{i}^{*} (1 - w^{t} ϕ (u_{i}) - b), \end{matrix}

(28)

where $α_{i}, α_{i}^{*}, η_{i}, η_{i}^{*}, ν_{i}, ν_{i}^{*}$ are the Lagrange multipliers.

For the case of ρ ∈ [−1, 1], the Lagrange function is constructed as follows:

\begin{matrix} L = J & - & \sum_{i = 1}^{n} α_{i} (ξ_{i} - y_{i} + (w^{t} ϕ (u_{i}) + b) {\bar{W}}_{i} y + \sum_{k = 0}^{d} x_{k i} (w_{k}^{t} ϕ (u_{i}) + b_{k})) \\ - & \sum_{i = 1}^{n} α_{i}^{*} (ξ_{i}^{*} + y_{i} - (w^{t} ϕ (u_{i}) + b) {\bar{W}}_{i} y - \sum_{k = 0}^{d} x_{k i} (w_{k}^{t} ϕ (u_{i}) + b_{k})) \\ - & \sum_{i = 1}^{n} η_{i} ξ_{i} - \sum_{i = 1}^{n} η_{i}^{*} ξ_{i}^{*} - \sum_{i = 1}^{n} ν_{i} (1 + w^{t} ϕ (u_{i}) + b) - \sum_{i = 1}^{n} ν_{i}^{*} (1 - w^{t} ϕ (u_{i}) - b), \end{matrix}

(29)

where $α_{i}, α_{i}^{*}, η_{i}, η_{i}^{*}, ν_{i}, ν_{i}^{*}$ are the Lagrange multipliers.

It is understood that the dual variables in (28) and (29) have to satisfy positivity constraints, i.e., $α_{i}, α_{i}^{*}, η_{i}, η_{i}^{*}, ν_{i}, ν_{i}^{*} \geq 0$ . It follows from the saddle point condition that the partial derivatives of $L$ with respect to the primal variables $(w, w_{k}, b, b_{k}, ξ_{i}, ξ_{i}^{*})$ have to vanish for optimality.

\begin{matrix} \frac{\partial L}{\partial w} = 0 & \to & w = \sum_{i = 1}^{n} ((α_{i} - α_{i}^{*}) {\bar{W}}_{i} y + ν_{i} - ν_{i}^{*}) ϕ (u_{i}) \end{matrix}

(30)

\begin{matrix} \frac{\partial L}{\partial w_{k}} = 0 & \to & w_{k} = \sum_{i = 1}^{n} (α_{i} - α_{i}^{*}) x_{k i} ϕ (u_{i}), k = 0, 1, \dots, d \end{matrix}

(31)

\begin{matrix} \frac{\partial L}{\partial b} = 0 & \to & \sum_{i = 1}^{n} ((α_{i} - α_{i}^{*}) {\bar{W}}_{i} y + (ν_{i} - ν_{i}^{*})) = 0 \end{matrix}

(32)

\begin{matrix} \frac{\partial L}{\partial b_{k}} = 0 & \to & \sum_{i = 1}^{n} (α_{i} - α_{i}^{*}) x_{k i} = 0, k = 1, \dots, d \end{matrix}

(33)

\begin{matrix} \frac{\partial L}{\partial ξ_{i}} = 0 & \to & C - α_{i} - η_{i} = 0, i = 1, \dots, n \end{matrix}

(34)

\begin{matrix} \frac{\partial L}{\partial ξ_{i}^{*}} = 0 & \to & C - α_{i}^{*} - η_{i}^{*} = 0, i = 1, \dots, n \end{matrix}

(35)

Classical Lagrangian duality enables the primal problem to be transformed to their dual problem. Substituting (30), (31), (32), (33), (34) and (35) into (28) and (29) yields two dual optimization problems, respectively. For the case of ρ ∈ [0, 1], the dual optimization problem is obtained as follows:

\begin{matrix} min_{α_{i}^{(*)}, ν_{i}^{(*)}} \frac{1}{2} \sum_{i, j} ({\bar{W}}_{i} y (α_{i} - α_{i}^{*}) + ν_{i} - ν_{i}^{*}) K_{i j} ({\bar{W}}_{j} y (α_{j} - α_{j}^{*}) + ν_{j} - ν_{j}^{*}) \\ + \frac{1}{2} \sum_{i, j} (α_{i} - α_{i}^{*}) x_{k i} K_{i j} x_{k j} (α_{j} - α_{j}^{*}) - \sum_{i = 1}^{n} y_{i} (α_{i} - α_{i}^{*}) + \sum_{i = 1}^{n} ν_{i}^{*} \\ subject to {\begin{matrix} 0 < α_{i} < C, 0 < α_{i}^{*} < C \\ ν_{i} \geq 0, ν_{i}^{*} \geq 0 \end{matrix} \end{matrix}

(36)

For the case of ρ ∈ [−1, 1], the dual optimization problem is obtained as follows:

\begin{matrix} min_{α_{i}^{(*)}, ν_{i}^{(*)}} \frac{1}{2} \sum_{i, j} ({\bar{W}}_{i} y (α_{i} - α_{i}^{*}) + ν_{i} - ν_{i}^{*}) K_{i j} ({\bar{W}}_{j} y (α_{j} - α_{j}^{*}) + ν_{j} - ν_{j}^{*}) \\ + \frac{1}{2} \sum_{i, j} (α_{i} - α_{i}^{*}) x_{k i} K_{i j} x_{k j} (α_{j} - α_{j}^{*}) - \sum_{i = 1}^{n} y_{i} (α_{i} - α_{i}^{*}) + \sum_{i = 1}^{n} (ν_{i} + ν_{i}^{*}) \\ subject to {\begin{matrix} 0 < α_{i} < C, 0 < α_{i}^{*} < C \\ ν_{i} \geq 0, ν_{i}^{*} \geq 0 \end{matrix} \end{matrix}

(37)

Solving the above quadratic programming (QP) problem (36) or (37) with the constraints determines the optimal Lagrange multipliers ${\hat{α}}_{i}, {\hat{α}}_{i}^{*}$ and ${\hat{ν}}_{i}, {\hat{ν}}_{i}^{*}$ . Thus, the estimated weight vectors $\hat{w}$ and ${\hat{w}}_{k}$ are obtained, respectively as follows:

\begin{matrix} \hat{w} = \sum_{i = 1}^{n} (({\hat{α}}_{i} - {\hat{α}}_{i}^{*}) {\bar{W}}_{i} y + {\hat{ν}}_{i} - {\hat{ν}}_{i}^{*}) ϕ (u_{i}) \end{matrix}

(38)

\begin{matrix} {\hat{w}}_{k} = \sum_{i = 1}^{n} ({\hat{α}}_{i} - {\hat{α}}_{i}^{*}) x_{k i} ϕ (u_{i}), k = 0, 1, \dots, d \end{matrix}

(39)

Thus, for a point u_i associated with the training data set $D$ , ρ(u_i) and β_k(u_i) are obtained, respectively as follows:

\begin{matrix} \hat{ρ} (u_{i}) = (D i a g {\bar{W} y} (\hat{α} - {\hat{α}}^{*}) + \hat{ν} - {\hat{ν}}^{*}) K_{i} + \hat{b} \end{matrix}

(40)

\begin{matrix} {\hat{β}}_{k} (u_{i}) = {(K_{i}^{t} \circ X^{k})}^{t} (\hat{α} - {\hat{α}}^{*}) + {\hat{b}}_{k}, k = 0, 1, \dots, d, \end{matrix}

(41)

where K_i is the ith row of the kernel matrix K whose elements are ϕ(u_i)^t ϕ(u_j) = K(u_i, u_j), X^k is the kth column of the n × (d + 1) design matrix X, and ∘ is the Hadamard product.

Here, $\hat{b}$ and ${\hat{b}}_{k}$ can be determined by the linear regression with the input vector ${({\bar{W}}_{i} y, X_{i})}^{t}$ and the output variable

\begin{matrix} y_{i} - (D i a g {\bar{W} y} (\hat{α} - {\hat{α}}^{*}) + \hat{ν} - {\hat{ν}}^{*}) K_{i} - ((X_{i} X^{t}) \circ K_{i}) (\hat{α} - {\hat{α}}^{*}), \end{matrix}

(42)

which is,

\begin{matrix} y_{i} - (D i a g {\bar{W} y} (\hat{α} - {\hat{α}}^{*}) + \hat{ν} - {\hat{ν}}^{*}) K_{i} - ((X_{i} X^{t}) \circ K_{i}) (\hat{α} - {\hat{α}}^{*}) \\ = & (b, b_{0}, b_{1}, \dots, b_{k}) {({\bar{W}}_{i} y, X_{i}^{t})}^{t} for i \in I_{sv}, \end{matrix}

(43)

where $I_{sv} = {i = 1, 2, \dots, n : 0 < | α_{i} - α_{i}^{*} | < C or | ν_{i} - ν_{i}^{*} | > 0}$ is obtained by exploiting Karush-Kuhn-Tucker conditions [23]. That is, the estimated values of $\hat{y}$ is obtained as follows:

\begin{matrix} \hat{y} = D i a g {\bar{W} y} \hat{ρ} + \sum_{k = 0}^{d} x_{k} \circ {\hat{β}}_{k}, \end{matrix}

(44)

where $\hat{ρ} = {(\hat{ρ} (u_{1}), \dots, \hat{ρ} (u_{n}))}^{t}$ and ${\hat{β}}_{k} = {({\hat{β}}_{k} (u_{1}), \dots, {\hat{β}}_{k} (u_{n}))}^{t}$ .

Model selection

The functional structure of the KBGTWAR model is characterized by the regularization parameter C, the kernel parameter κ and the number q of the nearest neighbors. These hyperparameters will affect the final model complexity. We now illustrate the model selection method which determines the optimal values of these hyperparameters of the KBGTWAR model. To choose these hyperparameters we utilize the AIC function which is defined according to [25] as follows:

\begin{matrix} A I C (λ) = \frac{R S S (λ)}{n} + \frac{2 d}{n} {\hat{σ}}^{2} (λ), \end{matrix}

(45)

where λ is the vector of hyperparameters, $R S S (λ) = \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}$ , ${\hat{σ}}^{2} (λ)$ denotes the estimate of noise variance defined as ${\hat{σ}}^{2} (λ) = \frac{1}{n - d} R S S (λ)$ , and d is the number of free parameters, i.e., the size of $I_{sv} = {i = 1, 2, \dots, n : 0 < | α_{i} - α_{i}^{*} | < C or | ν_{i} - ν_{i}^{*} | > 0}$ .

Case study

This section illustrates the performance of the proposed KBGTWAR model using house price data collected from 2001 to 2008 in Shenzhen, China. This data set was firstly used in [18]. Shenzhen is a special economic zone, which is situated in Guangdong Province immediately north of Hong Kong Special Administrative Region. This city forms part of the Pearl River Delta megalopolis. There are six administrative districts in Shenzhen, which are Luohu, Nanshan, Futian, Yantian, Bao’an and Longgang. By the way, the last two districts are not situated in the Special Zone. See for further details [18]. The house prices of Shenzhen continue to increase at an alarming rate due to rapid industrialization and urbanization. The study data on house prices were provided by the Shenzhen municipal bureau of land resources and housing management. From the study area 406 observations are available.

[18] reported that for empirical study thirteen input variables were used, but only six of them were statistically significant at the 90% confidence level according to their t-probabilities. The six input variables used in this study are land area (LANDA), distance from the nearest major road (DROAD), quality (QUAL), land price (LPRICE), property management spend (MAGT), and proximity to bus (TRAFF). The input variables and (u, v, t) coordinate variables are standardized such as (z − min(z))/(max(z) − min(z)). As in [26], a recent selling price is used as output variable, which stands as a proxy for the market value of the house. In fact, the logarithm of recent selling price is considered as output variable.

According to [18], there exist spatiotemporal autocorrelations in the output variable. Therefore, it is appropriate to use GTWAR-based models for this house price data set. For comparison, the global OLS model, the spatial autoregressive (SAR) model and three different GWR-based models including GWR, GTWAR and KBGTWAR are implemented using the same data set. The OLS and SAR models are employed to analyze the house price data by considering space coordinates and time as exogenous variables. GTWAR and KBGTWAR models are used to analyze the house price data with spatial and temporal considerations.

Using the cross validation (CV) technique, [18] determined that the optimal number of the nearest neighbors for GWR and GTWAR models is q = 51 and q = 44, respectively. In this paper we obtain q = 53 using the AIC technique for KBGTWAR model. As mentioned before, the spatiotemporal weights matrix $\bar{W}$ for KBGTWAR model is constructed using the spatiotemporal distance (26) instead of the spatiotemporal distance in [18]. Using this $\bar{W}$ , we obtain the value of Moran’s I for KBGTWAR model, which is 0.1105. This value indicates that spatiotemporal autocorrelation is positive and thus we need to use KBGTWAR model under the condition ρ ∈ [0, 1]. We examine parameter estimates for models under consideration and goodness of fit in terms of RSS and R². For the case of ρ ∈ [0, 1], the values of hyperparameters of KBGTWAR model are determined by the AIC method as (C, κ, q) = (50, 0.05, 53). The results are reported in Tables 1 and 2. In fact, the estimate of ρ for GTWAR and KBGTWAR is the median of 406 estimated ρ_i’s. Both median values are very similar when ρ ∈ [0, 1]. Fig 1 reports the estimated ρ ∈ [0, 1] values for 406 observed points. By comparing RSS and R² values, KBGTWAR gives significantly better fit of data than OLS, SAR, GWR and GTWAR models. The proposed KBGTWAR model provides the bigger F-statistic value than GWR and GTWAR models. Therefore, this indicates that it is more appropriate to model this particular data set with nonlinear local KBGTWAR model.

Table 1. Parameter estimate results of OLS, SAR and GWR for residential estate price data of Shenzhen.

	OLS	SAR (q = 44, ρ ∈ [−1, 1])	GWR (q = 51)
Parameter			Min	Med	Max
Intercept	8.0027	8.426	3.519	7.713	9.663
LANDA	0.4178	0.416	-4.750	0.236	5.011
DROAD	0.3627	0.361	-2.400	-0.310	2.928
QUAL	0.2469	0.246	-15.535	-0.304	19.686
LPRICE	1.5161	1.538	-3.734	2.125	12.279
MAGT	1.5734	1.574	-3.021	2.643	6.117
TRAFF	1.0397	1.042	-3.292	0.954	4.842
ρ		-0.0470
RSS	54.7	54.7		37.2
R²	0.612	0.612		0.736
F-statistic	69.38	69.38		1.63

Open in a new tab

Table 2. Parameter estimate results of GTWAR and KBGTWAR for residential estate price data of Shenzhen.

	GTWAR (q = 44, ρ ∈ [0, 1])			KBGTWAR (q = 53, ρ ∈ [0, 1])
Parameter	Min	Med	Max	Min	Med	Max
Intercept	0.805	8.020	16.663	6.656	7.285	7.852
LANDA	-6.699	0.571	6.192	-0.470	1.159	5.773
DROAD	-7.237	-0.016	8.884	-4.052	0.102	3.123
QUAL	-10.280	-0.941	5.490	-4.987	0.126	2.305
LPRICE	-6.377	1.300	10.584	-2.123	1.476	7.198
MAGT	-9.794	1.776	8.753	-2.363	1.702	5.819
TRAFF	-5.589	1.102	9.991	-1.792	0.656	3.109
ρ		0.1101			0.1088
RSS		12.0			4.5
R²		0.914			0.968
F-statistic		3.84			4.45

Open in a new tab

Fig 1 — The estimated spatial autoregressive parameter(ρ ∈ [0, 1]) values for 406 observed points.

We now investigate the spatial and temporal variations of three selected parameters, i.e., autoregressive parameter ρ, DROAD and QUAL coefficients. Figs 2 and 3 show the results. The plots in Fig 2 show the effects of sales time and the geographical location on each individual parameter. As seen from Fig 2, on the whole the autoregressive parameter and QUAL coefficient values do not show the apparent spatial and temporal variations. However, DROAD coefficient show somewhat apparent spatial and temporal variations. In particular, this coefficient changes nonlinearly in the sales time and Y-coordinate. The maps in Fig 3 show the distribution of each individual coefficient. As seen from Fig 3, the autoregressive parameter and QUAL coefficient do not show apparent spatial variations. However, DROAD coefficient changes from low in the south to high in the north. From Figs 2 and 3 we observe that the effect of DROAD coefficient on the house price depends on the location and the sales time.

Fig 2 — Autoregressive parameter ρ (top), DROAD (middle) and QUAL (bottom) coefficients on the sales time and the X, Y coordinates of the geographical location.

Fig 3 — Autoregressive parameter (top), DROAD (middle) and QUAL (bottom) coefficients.

Conclusions

In this paper, we proposed the KBGTWAR model to simultaneously account for spatiotemporal nonstationarity and autocorrelation that exist in the house prices. To devise KBGTWAR model we applied kernel technique to spatially and temporally varying coefficients and then utilized the basic principle of linear SVM to estimate the relevant parameters. Unlike GTWAR model, KBGTWAR model is basically nonlinear. Therefore, KBGTWAR model can deal with complex nonlinear trends, which are very common in spatiotemporal phenomena. KBGTWAR model can also appropriately explore the dynamic relationship between the output variable and the input variables, since this model is based on varying coefficients model which efficiently describes dynamic patterns of a regression relationship.

KBGTWAR model takes over all advantages of SVM and varying coefficients model that capture nonlinearities in the data, that have good prediction ability, and that are useful tools when the functional form of the relationship between the output variable and the input variables is left unspecified. In particular, KBGTWAR model works well under settings without strong assumptions on the distribution of the data. KBGTWAR model is also interpretable, since varying coefficients model is meaningfully interpretable.

However, as with all SVM-related models, KBGTWAR model requires lots of computing time in determining the optimal hyperparameters, and has serious computational problem for large data because it has to solve a large-scale quadratic programming problem to get the values of relevant parameters. These are disadvantages associated with KBGTWAR model.

This paper analyzed data reflecting the spatiotemporal nonstationarity and autocorrelation of house prices. OLS, SAR, GWR, GTWAR and KBGTWAR models used in the study were built based on house price data collected between 2001 and 2008 in the city of Shenzhen, China. The performances of these models were then compared based on RSS, R² and F-statistic. This paper demonstrates that the proposed KBGTWAR model provides good results in goodness of fit for the given example. To conclude, we proposed a more efficient KBGTWAR model to account for spatiotemporal nonstationarity and autocorrelation simultaneously.

Acknowledgments

The authors wish to thank Professor Huang who provided us with house price data. This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology with grant no. (NRF-2014R1A1A2054917, NRF-2018R1D1A1B07042349). This work was supported by “Human Resources Program in Energy Technology” of the Korea Institute of Energy Technology Evaluation and Planning (KETEP), granted financial resource from the Ministry of Trade, Industry & Energy, Republic of Korea (No. 20174030201740).

Data Availability

Data are third party and are available upon request from the case study whose authors may be contacted by emailing Bo Huang at bohuang@cuhk.edu.hk. The authors of this study accessed the data in the same manner, and did not have any special privileges to the data.

Funding Statement

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology with grant no. (NRF-2014R1A1A2054917, NRF-2018R1D1A1B07042349). This work was supported by “Human Resources Program in Energy Technology” of the Korea Institute of Energy Technology Evaluation and Planning (KETEP), granted financial resource from the Ministry of Trade, Industry & Energy, Republic of Korea (No. 20174030201740). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Cliff AD, Ord JK. Spatial processes: Models and applications. Pion, London; 1981. [Google Scholar]
2. Can A. Specification and estimation of hedonic housing price models. Reg Sci Urban Econ. 1992. September; 22(3): 453–474. 10.1016/0166-0462(92)90039-4 [DOI] [Google Scholar]
3. Pace RK, Gilley OW. Generalizing the OLS and grid estimators. Real Estate Econ. 1998. June; 26(2): 331–347. 10.1111/1540-6229.00748 [DOI] [Google Scholar]
4. Matheron G. Principles of geostatistics. Econ Geol. 1963. December; 58(8): 1246–1266. 10.2113/gsecongeo.58.8.1246 [DOI] [Google Scholar]
5. Basu A, Thibodeau T. Analysis of spatial autocorrelation in house price. J Real Estate Finance Econ. 1998. July;17(1): 61–85. 10.1023/A:1007703229507 [DOI] [Google Scholar]
6. Gelfand AE, Ghosh SK, Knight JR, Sirmans CF. Spatio-temporal modeling of residential sales data. J Bus Econ Statist. 1998. July; 16(3): 312–321. 10.1080/07350015.1998.10524770 [DOI] [Google Scholar]
7. Tu Y, Yu SM, Sun H. Transaction-based office price indexes: a spatiotemporal modeling approach. Real Estate Econ. 2004. May; 32(2): 297–328. 10.1111/j.1080-8620.2004.00093.x [DOI] [Google Scholar]
8. McMillen DP. One hundred fifty years of land values in Chicago: a nonparametric approach. J Urban Econ. 1996. July; 40(1): 100–124. 10.1006/juec.1996.0025 [DOI] [Google Scholar]
9. McMillen DP, McDonald JF. A nonparametric analysis of employment density in a polycentric city. J Reg Sci. 1997. November; 37(4): 591–612. 10.1111/0022-4146.00071 [DOI] [Google Scholar]
10. Brunsdon C, Fotheringham AS, Charlton M. Geographically weighted regression: a method for exploring spatial nonstationarity. Geogr Anal. 1996. October; 28(4): 281–298. 10.1111/j.1538-4632.1996.tb00936.x [DOI] [Google Scholar]
11. Fotheringham AS, Charlton ME, Brunsdon C. The geography of parameter space: an investigation of spatial non-stationarity. Int J Geogr Inf Sci. 1996. July; 10(5): 605–627. 10.1080/02693799608902100 [DOI] [Google Scholar]
12. Fotheringham AS, Brunsdon C, Charlton M. Geographically weighted regression. John Wiley and Sons, Chichester; 2002. [Google Scholar]
13. Brunsdon C, Fotheringham AS, Charlton M. Some notes on parametric significance tests for geographically weighted regression. J Reg Sci. 1999. August;39(3): 497–524. 10.1111/0022-4146.00146 [DOI] [Google Scholar]
14. Huang B, Wu B, Barry T. Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices. Int J Geogr Inf Sci. 2010. March; 24(3): 383–401. 10.1080/13658810802672469 [DOI] [Google Scholar]
15. Fotheringham AS, Crespo R, Yao J. Geographical and temporal weighted regression (GTWR). Geogr Anal. 2015. October; 47(4): 431–452. 10.1111/gean.12071 [DOI] [Google Scholar]
16. Liu J, Yang Y, Xu S, Zhao Y, Wang Y, Zhang F. A Geographically Temporal Weighted Regression Approach with Travel Distance for House Price Estimation. Entropy. 2016. August; 18(8): 303–312. 10.3390/e18080303 [DOI] [Google Scholar]
17.LeSage JP. The theory and practice of spatial econometrics. preprint (1999). Available at http://www.spatial-econometrics.com.
18. Wu B, Li R, Huang B. A geographically and temporally weighted autoregressive model with application to housing prices. Int J Geogr Inf Sci. 2014. April; 28(5): 1186–1204. 10.1080/13658816.2013.878463 [DOI] [Google Scholar]
19. Vapnik VN. The nature of statistical learning theory. Springer, New York; 1995. [Google Scholar]
20. Can A, Megbolugbe I. Spatial dependence and house price index construction. J Real Estate Finance Econ. 1997. January; 14(1): 203–222. 10.1023/A:1007744706720 [DOI] [Google Scholar]
21. Hurvich CM, Simonoff JS, Tsai CL. Smoothing parameter selection in nonparametric regression using an improved Akaike Information Criterion. J R Stat Soc Series B. 1998. April; 60(2): 271–293. 10.1111/1467-9868.00125 [DOI] [Google Scholar]
22. Smola AJ, Schölkopf B. A tutorial on support vector regression. Stat Comput. 2004. August; 14(3): 199–222. 10.1023/B:STCO.0000035301.49549.88 [DOI] [Google Scholar]
23.Kuhn HW, Tucker AW. Nonlinear programming. Proceedings of 2nd Berkeley Symposium, University of California Press, Berkeley; 1951.
24. Mercer J. Function of positive and negative type and their connection with theory of integral equations Philosophical Transactions of Royal Society A 1909; 415–446. [Google Scholar]
25. Cherkassky V, Ma Y. Comparison of model selection for regression. Neural Comput. 2003. July; 15(7): 1691–1714. 10.1162/089976603321891864 [DOI] [PubMed] [Google Scholar]
26. Sirmans G, Macpherson D, Zietz A. The composition of hedonic pricing models. J. Real Estate Lit. 2005. January; 13(1): 3–41. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

[pone.0205063.ref001] 1. Cliff AD, Ord JK. Spatial processes: Models and applications. Pion, London; 1981. [Google Scholar]

[pone.0205063.ref002] 2. Can A. Specification and estimation of hedonic housing price models. Reg Sci Urban Econ. 1992. September; 22(3): 453–474. 10.1016/0166-0462(92)90039-4 [DOI] [Google Scholar]

[pone.0205063.ref003] 3. Pace RK, Gilley OW. Generalizing the OLS and grid estimators. Real Estate Econ. 1998. June; 26(2): 331–347. 10.1111/1540-6229.00748 [DOI] [Google Scholar]

[pone.0205063.ref004] 4. Matheron G. Principles of geostatistics. Econ Geol. 1963. December; 58(8): 1246–1266. 10.2113/gsecongeo.58.8.1246 [DOI] [Google Scholar]

[pone.0205063.ref005] 5. Basu A, Thibodeau T. Analysis of spatial autocorrelation in house price. J Real Estate Finance Econ. 1998. July;17(1): 61–85. 10.1023/A:1007703229507 [DOI] [Google Scholar]

[pone.0205063.ref006] 6. Gelfand AE, Ghosh SK, Knight JR, Sirmans CF. Spatio-temporal modeling of residential sales data. J Bus Econ Statist. 1998. July; 16(3): 312–321. 10.1080/07350015.1998.10524770 [DOI] [Google Scholar]

[pone.0205063.ref007] 7. Tu Y, Yu SM, Sun H. Transaction-based office price indexes: a spatiotemporal modeling approach. Real Estate Econ. 2004. May; 32(2): 297–328. 10.1111/j.1080-8620.2004.00093.x [DOI] [Google Scholar]

[pone.0205063.ref008] 8. McMillen DP. One hundred fifty years of land values in Chicago: a nonparametric approach. J Urban Econ. 1996. July; 40(1): 100–124. 10.1006/juec.1996.0025 [DOI] [Google Scholar]

[pone.0205063.ref009] 9. McMillen DP, McDonald JF. A nonparametric analysis of employment density in a polycentric city. J Reg Sci. 1997. November; 37(4): 591–612. 10.1111/0022-4146.00071 [DOI] [Google Scholar]

[pone.0205063.ref010] 10. Brunsdon C, Fotheringham AS, Charlton M. Geographically weighted regression: a method for exploring spatial nonstationarity. Geogr Anal. 1996. October; 28(4): 281–298. 10.1111/j.1538-4632.1996.tb00936.x [DOI] [Google Scholar]

[pone.0205063.ref011] 11. Fotheringham AS, Charlton ME, Brunsdon C. The geography of parameter space: an investigation of spatial non-stationarity. Int J Geogr Inf Sci. 1996. July; 10(5): 605–627. 10.1080/02693799608902100 [DOI] [Google Scholar]

[pone.0205063.ref012] 12. Fotheringham AS, Brunsdon C, Charlton M. Geographically weighted regression. John Wiley and Sons, Chichester; 2002. [Google Scholar]

[pone.0205063.ref013] 13. Brunsdon C, Fotheringham AS, Charlton M. Some notes on parametric significance tests for geographically weighted regression. J Reg Sci. 1999. August;39(3): 497–524. 10.1111/0022-4146.00146 [DOI] [Google Scholar]

[pone.0205063.ref014] 14. Huang B, Wu B, Barry T. Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices. Int J Geogr Inf Sci. 2010. March; 24(3): 383–401. 10.1080/13658810802672469 [DOI] [Google Scholar]

[pone.0205063.ref015] 15. Fotheringham AS, Crespo R, Yao J. Geographical and temporal weighted regression (GTWR). Geogr Anal. 2015. October; 47(4): 431–452. 10.1111/gean.12071 [DOI] [Google Scholar]

[pone.0205063.ref016] 16. Liu J, Yang Y, Xu S, Zhao Y, Wang Y, Zhang F. A Geographically Temporal Weighted Regression Approach with Travel Distance for House Price Estimation. Entropy. 2016. August; 18(8): 303–312. 10.3390/e18080303 [DOI] [Google Scholar]

[pone.0205063.ref017] 17.LeSage JP. The theory and practice of spatial econometrics. preprint (1999). Available at http://www.spatial-econometrics.com.

[pone.0205063.ref018] 18. Wu B, Li R, Huang B. A geographically and temporally weighted autoregressive model with application to housing prices. Int J Geogr Inf Sci. 2014. April; 28(5): 1186–1204. 10.1080/13658816.2013.878463 [DOI] [Google Scholar]

[pone.0205063.ref019] 19. Vapnik VN. The nature of statistical learning theory. Springer, New York; 1995. [Google Scholar]

[pone.0205063.ref020] 20. Can A, Megbolugbe I. Spatial dependence and house price index construction. J Real Estate Finance Econ. 1997. January; 14(1): 203–222. 10.1023/A:1007744706720 [DOI] [Google Scholar]

[pone.0205063.ref021] 21. Hurvich CM, Simonoff JS, Tsai CL. Smoothing parameter selection in nonparametric regression using an improved Akaike Information Criterion. J R Stat Soc Series B. 1998. April; 60(2): 271–293. 10.1111/1467-9868.00125 [DOI] [Google Scholar]

[pone.0205063.ref022] 22. Smola AJ, Schölkopf B. A tutorial on support vector regression. Stat Comput. 2004. August; 14(3): 199–222. 10.1023/B:STCO.0000035301.49549.88 [DOI] [Google Scholar]

[pone.0205063.ref023] 23.Kuhn HW, Tucker AW. Nonlinear programming. Proceedings of 2nd Berkeley Symposium, University of California Press, Berkeley; 1951.

[pone.0205063.ref024] 24. Mercer J. Function of positive and negative type and their connection with theory of integral equations Philosophical Transactions of Royal Society A 1909; 415–446. [Google Scholar]

[pone.0205063.ref025] 25. Cherkassky V, Ma Y. Comparison of model selection for regression. Neural Comput. 2003. July; 15(7): 1691–1714. 10.1162/089976603321891864 [DOI] [PubMed] [Google Scholar]

[pone.0205063.ref026] 26. Sirmans G, Macpherson D, Zietz A. The composition of hedonic pricing models. J. Real Estate Lit. 2005. January; 13(1): 3–41. [Google Scholar]

PERMALINK

Kernel-based geographically and temporally weighted autoregressive model for house price estimation

Jooyong Shim

Changha Hwang

Roles

Abstract

Introduction

GTWAR model

GTWR model

GTWAR model

Model selection

KBGTWAR model

SVM regression

KBGTWAR model

Model selection

Case study

Table 1. Parameter estimate results of OLS, SAR and GWR for residential estate price data of Shenzhen.

Table 2. Parameter estimate results of GTWAR and KBGTWAR for residential estate price data of Shenzhen.

Fig 1. Estimated spatial autoregressive parameter.

Fig 2. Effects of sales time and the geographical location on each individual parameter.

Fig 3. Spatial variation of each individual parameters.

Conclusions

Acknowledgments

Data Availability

Funding Statement

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Kernel-based geographically and temporally weighted autoregressive model for house price estimation

Jooyong Shim

Changha Hwang

Roles

Abstract

Introduction

GTWAR model

GTWR model

GTWAR model

Model selection

KBGTWAR model

SVM regression

KBGTWAR model

Model selection

Case study

Table 1. Parameter estimate results of OLS, SAR and GWR for residential estate price data of Shenzhen.

Table 2. Parameter estimate results of GTWAR and KBGTWAR for residential estate price data of Shenzhen.

Fig 1. Estimated spatial autoregressive parameter.

Fig 2. Effects of sales time and the geographical location on each individual parameter.

Fig 3. Spatial variation of each individual parameters.

Conclusions

Acknowledgments

Data Availability

Funding Statement

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases