Stochastic dynamic models and Chebyshev splines

Ruzong Fan; Bin Zhu; Yuedong Wang

doi:10.1002/cjs.11233

. Author manuscript; available in PMC: 2015 Jun 2.

Published in final edited form as: Can J Stat. 2014 Nov 3;42(4):610–634. doi: 10.1002/cjs.11233

Stochastic dynamic models and Chebyshev splines

Ruzong Fan ^1,^*, Bin Zhu ², Yuedong Wang ³

PMCID: PMC4451187 NIHMSID: NIHMS690965 PMID: 26045632

Abstract

In this article, we establish a connection between a stochastic dynamic model (SDM) driven by a linear stochastic differential equation (SDE) and a Chebyshev spline, which enables researchers to borrow strength across fields both theoretically and numerically. We construct a differential operator for the penalty function and develop a reproducing kernel Hilbert space (RKHS) induced by the SDM and the Chebyshev spline. The general form of the linear SDE allows us to extend the well-known connection between an integrated Brownian motion and a polynomial spline to a connection between more complex diffusion processes and Chebyshev splines. One interesting special case is connection between an integrated Ornstein–Uhlenbeck process and an exponential spline. We use two real data sets to illustrate the integrated Ornstein–Uhlenbeck process model and exponential spline model and show their estimates are almost identical.

Keywords: Brownian motion, Ornstein–Uhlenbeck process, reproducing kernel Hilbert space, smoothing splines, stochastic differential equations

1. INTRODUCTION

There exists a one-to-one correspondence between a reproducing kernel Hilbert space (RKHS) and Gaussian stochastic processes by the Kolmogorov consistency theorem (Cramér & Leadbetter, 1967; Wahba, 1990). This correspondence has led to the development of connections between general smoothing spline models and Gaussian stochastic processes which enables researchers to borrow strength across different fields. For example, stochastic models corresponding to smoothing splines have been used to derive the generalized maximum likelihood estimate of smoothing parameters and to construct Bayesian confidence intervals (Wahba, 1990). A specific connection between an L-spline and an integrated Brownian motion has been explored to develop efficient O(n) algorithms (Wecker & Ansley, 1983).

Specific connections between more complex processes and related smoothing splines have not been studied extensively. The purpose of this article is to establish a connection between a stochastic dynamic model (SDM) driven by a linear stochastic differential equation (SDE) and a Chebyshev spline. We construct a differential operator for the penalty function and develop an RKHS corresponding to the SDM and the Chebyshev spline. The general form of the SDE allows us to establish a connection between more complex diffusion processes and Chebyshev splines. As an interesting special case, an integrated Ornstein–Uhlenbeck process will be connected to an exponential spline.

The extended connection between the SDM driven by a linear SDE and a Chebyshev spline can be explored to motivate new SDMs based on spline models, and vice versa. As an illustration, we will present a partial spline model motivated by an SDM and an SDM motivated by logistic spline. We differ from the smoothing spline literature where one usually builds SDMs that connect to spline models of interest; here we will build spline models that connect to SDMs. We will present the construction of penalties corresponding to SDMs, which may be regarded as priors for regression functions in non-parametric regression models.

Estimation and computational methods for Chebyshev splines may be used to compute the posterior means of the corresponding SDMs, and vice versa. We will use two real data sets to show that the posterior mean of the integrated Ornstein–Uhlenbeck process in an SDM matches the penalized least squares estimate of the corresponding exponential spline.

Gaussian stochastic models have been widely used in many fields including physics, engineering, finance and biology (Ansley & Kohn, 1986; Rue & Martino, 2009; Stuart, 2010; Griebel & Hegland, 2010; Lindgren, Rue, & Lindstrom, 2011; Papaspiliopoulis et al., 2012). Some of the interesting special cases are given in Bishwal (2008). On the other hand, much work has been done in the area of smoothing splines. For instance, Pintore, Speckman, & Holmes (2006) used an RKHS representation to derive the smoothing spline with a particular inhomogeneous differential operator penalty; Furrer & Nychka (2007) built a framework to understand the asymptotic properties of Kriging and splines; in this framework Kriging estimators are interpreted as generalized smoothing splines. The differential operator L = D^q in Pintore, Speckman, & Holmes (2006) is a special case of that considered in this paper.

The remainder of this article is organized as follows. In Section 2, we introduce the SDM. In Section 3, we compute mean and covariance functions for a general class of Gaussian models driven by a linear SDE and construct corresponding Chebyshev splines. We extend the SDM in Section 4 with general differential operators and connect them to general Chebyshev splines. We use two real data sets to confirm the theoretical results in Section 5 and conclude with some remarks in Section 6.

2. STOCHASTIC DYNAMIC MODELS

We present an SDM driven by an SDE in the Section 2.1. Then we develop an equivalent stochastic integration equation and use this equivalent model to compute the mean and covariance functions of the dynamic system in Section 2.2.

2.1. Stochastic Dynamic Models

Consider a temporal SDM

Y (t, ω) = U (t, ω) + ε (t, ω), ω \in Ω, t \in [0, T],

(1)

where Y(t, ω) is the observation at time t, U(t, ω) is a latent stochastic process of interest observed at time t on the path ω, ε(t, ω) is an error term, Ω is the sample path probability space, and T is a positive real number which may be the right end point of time in consideration. We assume that U(·) and ε(·) are independent and ε(·) is a Gaussian white noise process with variance σ².

In this paper, we will first consider the following stochastic dynamic system for the latent stochastic process U(·)

\frac{d^{q} U (t)}{d t^{q}} = a_{0} (t) U_{q} + b (t) V (t), q \geq 0, t \in [0, T],

(2)

where U_k = d^kU(0)/dt^k for k = 0, …, q are initial values of U(·) and its derivatives up to the order q, and a₀(t) and b(t) are integrable deterministic functions. We assume that U₀, …, U_q are mutually independent and square integrable random variables. A more general equation for U(·) that corresponds to the general Chebyshev splines will be considered in Section 4.

We assume that the stochastic process V(·) in (2) is independent of U₀, …, U_q and is driven by the following SDE

d V (t, ω) = β (t, V (t, ω)) d t + σ (t, V (t, ω)) d B (t, ω), V (0, ω) = 0, t \in [0, T],

(3)

where β(t, V(t, ω)) is a drift term, σ²(t, V(t, ω)) is a diffusion coefficient, and B(t, ω) is a standard Brownian motion observed at time t on the path ω. We assume that the SDE (3) has a unique, continuous and adapted strong solution, and V(·) has finite moments of any order p ∈ [1,∞). This assumption holds when the coefficients in (3) satisfy the Lipschitz condition, |σ(t, x) − σ(t, y)|² + |β(t, x) − β(t, y)|² ≤ c₁(T)|x − y|², and the growth condition, |σ(t, x)|² + |β(t, x)|² ≤ c₂(T)(1 + x²) for all x, y ∈ R and t ∈ [0, T], where c₁(T) and c₂(T) are positive constants depending on T only (Ikeda & Watanabe, 1989).

Together, Equations (1), (2) and (3) define the SDM we study in this paper. The SDE (3) is a parametric Itô SDE with initial value equal to zero. The assumption about the initial value may be removed (see Remark 3.1). Zhu, Song, & Taylor (2011) considered a similar SDM with a₀(t) = 0 and b(t) = 1 in the stochastic dynamic system (2) without the initial condition V(0) = 0, and used an Ornstein–Uhlenbeck process V(·) to model the rate of changes of prostate specific antigen profiles. We consider the more general Equation (2) for the connection to Chebyshev splines.

It is noteworthy that a special case of the SDM has been connected to polynomial smoothing splines. Specifically, let a₀(t) = b(t) = 1, β(t, V(t)) = 0 and σ(t, V(t)) = σ_V, then U(·) is the same as the random effects model (1.5.8) in Wahba (1990) for a polynomial spline of order q + 1 (refer to Section S.5 in the Supplementary Materials for details). When q = 1 and V(·) is a standard Brownian motion, then U(·) is an integrated Brownian motion. Alternatively, one may want to use an Ornstein–Uhlenbeck process V(·) to model exponentially decreasing correlation. In this case, U(·) is an integrated Ornstein–Uhlenbeck process.

2.2. An Equivalent Stochastic Integration Equation

In this section, we first introduce a stochastic integration equation that is equivalent to the stochastic dynamic system (2). We then use the stochastic integration equation to compute the mean and covariance functions of the stochastic process U(·).

When q = 0, the stochastic dynamic system (2) has the form

U (t) = a_{0} (t) U_{0} + b (t) V (t),

(4)

and it is straightforward to compute the mean and covariance functions of U(·) in this case.

In the following discussion, we assume that q ≥ 1.We will show that the following stochastic integration equation is equivalent to the stochastic dynamic system (2)

U (t) = \sum_{i = 0}^{q - 1} ψ_{i} (t) U_{i} + A_{q} (t) U_{q} + \int_{0}^{t} ψ_{q} (t - s) d [b (s) V (s)],

(5)

where ψ_i(t) ≜ tⁱ/i!, $A_{q} (t) ≜ \int_{0}^{t} ψ_{q - 1} (t - s) a_{0} (s) d s$ , and V(·) is a stochastic process driven by the SDE defined in (3). We first show that (5) can be rewritten as

U (t) = \sum_{i = 0}^{q - 1} ψ_{i} (t) U_{i} + A_{q} (t) U_{q} + \int_{0}^{t} ψ_{q - 1} (t - s) b (s) V (s) d s .

(6)

To see that (5) can be rewritten as (6), applying Itô formula to b(s)V(s)(t − s)^q, s ≤ t, leads to

\int_{0}^{t} {(t - s)}^{q} d [b (s) V (s)] + \int_{0}^{t} b (s) V (s) d {(t - s)}^{q} = b (s) V (s) {(t - s)}^{q} |_{0}^{t} = 0 .

Hence, we have

\int_{0}^{t} {(t - s)}^{q} d [b (s) V (s)] = q \int_{0}^{t} b (s) V (s) {(t - s)}^{q - 1} d s .

(7)

Suppose that U(·) is given by the stochastic integration Equation (5). By Equation (7), we have the expression (6).

When q = 1, it is easy to see that taking the derivative on both sides of (6) with respect to t leads to (2). When q ≥ 2, taking the derivative on both sides of (6) with respect to t leads to

\frac{d U (t)}{d t} = \sum_{i = 1}^{q - 1} ψ_{i - 1} (t) U_{i} + A_{q - 1} (t) U_{q} + \int_{0}^{t} ψ_{q - 2} (t - s) b (s) V (s) d s + [ψ_{q - 1} (t - s) b (s) V (s)] |_{s = t} = \sum_{i = 1}^{q - 1} ψ_{i - 1} (t) U_{i} + A_{q - 1} (t) U_{q} + \int_{0}^{t} ψ_{q - 2} (t - s) b (s) V (s) d s .

Continuing this process to calculate higher order derivatives, we get d^qU(t)/dt^q = a₀(t)U_q + b(t)V(t). Therefore, the stochastic integration Equation (5) or (6) implies the stochastic differential system (2).

Conversely, assuming that U(·) is given by the stochastic differential system (2) and reversing the process above, it is not difficult to show that U(·) satisfies the stochastic integration Equation (5) or (6). The following lemma summarizes the above discussion.

Lemma 2.1

Suppose that the stochastic process V(·) is driven by the SDE (3). Then, the stochastic dynamic system (2) is equivalent to the stochastic integration Equation (5) or (6).

Note that when a₀(t) = b(t) = 1 and V(t) = σ_V B(t), we have A_q(t) = ψ_q(t) and $U (t) = \sum_{i = 0}^{q} ψ_{i} (t) U_{i} + σ_{V} \int_{0}^{t} ψ_{q} (t - s) d B (s)$ which is the stochastic process corresponding to the polynomial spline of order q + 1 (Kimeldorf & Wahba, 1970a,b; Wahba, 1990).

The mean and covariance functions of U(·) listed in the following proposition can be computed directly based on (4) when q = 0 or by applying the Fubini’s theorem to Equation (6) when q ≥ 1.

Proposition 2.1

For the stochastic process U(·) defined in (2) or equivalently in (5) or (6), when q ≥ 1, we have

E U (t) = \sum_{i = 0}^{q - 1} ψ_{i} (t) μ_{i} + A_{q} (t) μ_{q} + \int_{0}^{t} ψ_{q - 1} (t - s) b (s) E V (s) d s,

Cov (U (s), U (t)) = \sum_{i = 0}^{q - 1} ψ_{i} (s) ψ_{i} (t) σ_{i}^{2} + A_{q} (s) A_{q} (t) σ_{q}^{2} + \int_{0}^{t} \int_{0}^{s} ψ_{q - 1} (t - u) ψ_{q - 1} (s - υ) b (u) b (υ) Cov (V (u), V (υ)) d u d υ,

where μ_i ≜ E U_i and $σ_{i}^{2} ≜ Var (U_{i})$ When q = 0, we have E U(t) = a₀(t)μ₀ + b(t) E V(t) and $Cov (U (s), U (t)) = a_{0} (s) a_{0} (t) σ_{0}^{2} + b (s) b (t) Cov (V (s), V (t))$ .

There exists a unique RKHS that is congruent to the Hilbert space generated by the stochastic process U(·) (Berlinet & Thomas-Agnan, 2004). In the next section, we will make specific connections between Gaussian processes driven by a linear SDE and an RKHS.

3. GAUSSIAN PROCESS DRIVEN BY A LINEAR SDE AND CONNECTION TO CHEBYSHEV SPLINES

The stochastic process V(·) driven by the SDE (3) is a Markovian process. However, it is not necessarily a Gaussian process. In this section, we consider a Gaussian process driven by the following linear SDE

d V (t) = [β_{0} (t) + β_{1} (t) V (t)] d t + σ (t) d B (t), V (0) = 0, t \in [0, T],

(8)

where β₀(t), β₁(t) and σ(t) are deterministic, measurable, and bounded functions of time t (Karatzas & Shreve, 1988). The SDE (8) is a special case of (3) when β(t, V(t, ω)) is linear in V(·) and σ(t, V(t, ω)) is deterministic.

In this section, we establish a connection between the SDM where V(·) is driven by the linear SDE (8) and Chebyshev splines. In Section 3.1, we apply results in Section 2.2 to compute the mean and covariance functions for Gaussian processes driven by the linear SDE. In Section 3.2, we build connections between Gaussian processes and Chebyshev splines. In the subsections, we provide examples of some specific SDMs and their corresponding spline models.

3.1. Gaussian Process Driven by the Linear SDE (8)

Denote $Ψ (t) ≜ exp (\int_{0}^{t} β_{1} (s) d s)$ . By the Itô formula, one can show that (Karatzas & Shreve, 1988, pp. 354–355)

V (t) = Ψ (t) [\int_{0}^{t} \frac{β_{0} (s)}{Ψ (s)} d s + \int_{0}^{t} \frac{σ (s)}{Ψ (s)} d B (s)] .

(9)

Lemma 3.1

Suppose that the stochastic process V(·) is a Gaussian process driven by the linear SDE (8). Then, we have

E V (t) = Ψ (t) \int_{0}^{t} \frac{β_{0} (s)}{Ψ (s)} d s,

Cov (V (s), V (t)) = Ψ (s) Ψ (t) \int_{0}^{s \land t} {(\frac{σ (u)}{Ψ (u)})}^{2} d u .

The proof of Lemma 3.1 can be found in the Supplementary Materials. For q ≥ 1, denote

F (t, u) ≜ I (u \leq t) \int_{u}^{t} ψ_{q - 1} (t - τ) b (τ) Ψ (τ) d τ,

where I(u ≤ t) = 1 when u ≤ t and 0 otherwise. Combining results in Proposition 2.1 and Lemma 3.1, we have the following theorem.

Theorem3.1

Suppose that the stochastic process V(·) is a Gaussian process driven by the linear SDE (8) and $U_{i} ~ N (μ_{i}, σ_{i}^{2})$ for i = 0, …, q. Then, the stochastic process U(·) is a Gaussian process with the following mean and covariance functions

E U (t) = \sum_{i = 0}^{q - 1} ψ_{i} (t) μ_{i} + A_{q} (t) μ_{q} + \int_{0}^{t} F (t, s) \frac{β_{0} (s)}{Ψ (s)} d s,

Cov (U (s), U (t)) = \sum_{i = 0}^{q - 1} ψ_{i} (s) ψ_{i} (t) σ_{i}^{2} + A_{q} (s) A_{q} (t) σ_{q}^{2} + \int_{0}^{s \land t} F (s, u) F (t, u) {(\frac{σ (u)}{Ψ (u)})}^{2} d u,

where, for q = 0, we take $\sum_{i = 0}^{q - 1} ≜ 0$ , A₀(t) ≜ a₀(t), and F(t, u) ≜ b(t) Ψ(t)I(u ≤ t).

The proof of Theorem 3.1 is provided in the Supplementary Materials.

Remark 3.1

We have assumed that V(0) = 0 in the SDE (8). This assumption can be removed from the construction as follows. Let

\bar{V} (t) = Ψ (t) [U_{q} + \int_{0}^{t} \frac{β_{0} (s)}{Ψ (s)} d s + \int_{0}^{t} \frac{σ (s)}{Ψ (s)} d B (s)] .

Then, V̄(·) satisfies

d \bar{V} (t) = [β_{0} (t) + β_{1} (t) \bar{V} (t)] d t + σ (t) d B (t), \bar{V} (0) = U_{q}, t \in [0, T] .

(10)

When a₀(t) = Ψ(t), b(t) = 1, and V(·) is a solution of linear SDE (8), the stochastic differential system (2) is equivalent to

\frac{d^{q} U (t)}{d t^{q}} = \bar{V} (t) .

(11)

This means that d^qU(t)/dt^q is equal to the solution V̄(t) of the linear SDE (10) which starts from U_q. Both V̄(·) and U(·) defined in (11) are Gaussian processes.

3.2. Construction of a Chebyshev Spline Model

We now construct a Chebyshev spline model such that, up to a known function, the penalized least squares estimate equals the best linear unbiased estimator of the SDM defined by (1), (2) and (8).We note that the following development is in the opposite direction of the common approach employed in the spline literature where one constructs a stochastic process to connect with a smoothing spline model.

Define a differential operator

L f (t) ≜ \frac{d}{d t} (\frac{f^{(q)} (t)}{Ψ (t)}) = \frac{f^{(q + 1)} (t) - β_{1} (t) f^{(q)} (t)}{Ψ (t)} .

(12)

We may write L as Lf (u) = D [D^qf (u)/Ψ(u)] where D^q = d^q/dt^q is the qth derivative operator. The differential operator L is a special case of the general differential operator L_q+1 defined in Equation (4.64) of Gu (2013) for the Chebyshev spline with w_i(t) = 1 for i = 1, …, q and w_q+1(t) = Ψ(t). Then, from Equation (4.65) in Gu (2013), the Chebyshev system on [0, T] is ψ₀(t), ψ₁(t), …, ψ_q−1(t), ψ̃_q(t) and they span the null space ℋ₀ = span{ψ₀(t), ψ₁(t), …, ψ_q−1(t), ψ̃_q(t)} of the differential operator L, where

{\tilde{ψ}}_{q} (t) = \int_{0}^{t} ψ_{q - 1} (t - s) Ψ (s) d s .

Under an inner product $\sum_{i = 0}^{q} f^{(i)} (0) g^{(i)} (0)$ , {ψ₀(t),ψ₁(t), …, ψ_q−1(t),ψ̃_q(t)} forms an orthonormal basis of the null space ℋ₀. Furthermore, it can be shown that the Green’s function associated with the differential operator L is

G (t, u) = I (u \leq t) \int_{u}^{t} ψ_{q - 1} (t - τ) Ψ (τ) d τ,

which is given by the relation (4.67) in Gu (2013).

Consider the model space

W_{2}^{q + 1} [0, T] ≜ {f : f, f', \dots, f^{(q)} are absolutely continuous and \int_{0}^{T} {(L f (t))}^{2} h (t) d t < \infty}

(13)

with an inner product

(f, g) ≜ \sum_{i = 0}^{q} f^{(i)} (0) g^{(i)} (0) + \int_{0}^{T} L f (u) L g (u) h (u) d u,

(14)

where h(u) ≜ (Ψ(u)/σ(u))² is a weight function. Following the same arguments as in Section 4.5.2 of Gu (2013), we have the following result.

Theorem 3.2

Assume that a₀(t) = Ψ(t) and b(t) = 1. Then, $W_{2}^{q + 1} [0, T]$ is an RKHS. Let us denote the kernel of L as $ℋ_{0} = {f \in W_{2}^{q + 1} [0, T] : L f = 0}$ . Then, ψ₀(t), ψ₁(t), …, ψ_q−1(t), A_q(t) = ψ̃_q(t) form an orthonormal basis of ℋ₀, and $W_{2}^{q + 1} [0, T]$ can be decomposed into $W_{2}^{q + 1} [0, T] = ℋ_{0} \oplus ℋ_{1}$ , where

ℋ_{0} = span {ψ_{0} (t), ψ_{1} (t), \dots, ψ_{q - 1} (t), {\tilde{ψ}}_{q} (t)},

ℋ_{1} = {f \in W_{2}^{q + 1} [0, T] : f (0) = f' (0) = \dots = f^{(q)} (0) = 0} .

The reproducing kernels of ℋ₀ and ℋ₁ are respectively given by

R_{0} (s, t) = \sum_{i = 0}^{q - 1} ψ_{i} (s) ψ_{i} (t) + {\tilde{ψ}}_{q} (s) {\tilde{ψ}}_{q} (t),

R_{1} (s, t) = \int_{0}^{s \land t} G (s, u) G (t, u) {[h (u)]}^{- 1} d u .

The proof of Theorem 3.2 is straightforward based on the fact that A_q(t) = ψ̃_q(t) and G(t, u) = F(t, u) when a₀(t) = Ψ(t) and b(t) = 1.

Remark 3.2

In Theorem 3.2 we assumed that a₀(t) = Ψ(t). In general, assume that a₀(t) is strictly positive and a₀(0) = 1. It is not difficult to check that Dⁱψ_j(0) = 1 if i = j and 0 otherwise for 0 ≤ i, j ≤ q − 1. Furthermore, D^qψ_j(0) = 0 for 0 ≤ j ≤ q − 1, DⁱA_q(0) = 0 for 0 ≤ i ≤ q − 1, and D^qA_q(0) = a₀(0) = 1. Then {ψ₀(t), ψ₁(t), …, ψ_q−1(t), A_q(t)} forms an orthonormal basis of a subspace of $W_{2}^{q + 1} [0, T]$ under the inner product $\sum_{i = 0}^{q} f^{(i)} (0) g^{(i)} (0)$ . In addition, we have Lψ_i(t) = 0 for i = 0, 1, …, q− 1. It is easy to check that D^qA^q(t) = a₀(t). Therefore, LA_q(t) = 0 iff a₀(t) = Ψ(t). Consequently, A_q(t) does not belong to the space $ℋ_{0} = {f \in W_{2}^{q + 1} [0, T] : L f = 0}$ when a₀(t) ≠ Ψ(t). Nevertheless, when a₀(t) ≠ Ψ(t), a partial spline model may be constructed. See Section 3.3.2 for an example.

Consider the following nonparametric regression model

Y_{i} = f (t_{i}) + ε_{i}, i = 1, \dots, n, t_{i} \in [0, T] .

(15)

Assume that $f \in W_{2}^{q + 1} [0, T]$ . A Chebyshev spline is the solution to the following penalized least Squares

min_{f \in W_{2}^{q + 1} [0, T]} {\frac{1}{n} \sum_{i = 1}^{n} {[Y_{i} - f (t_{i})]}^{2} + λ \int_{0}^{T} {(L f (t))}^{2} h (t) d t},

(16)

where λ is a smoothing parameter and L is given in (12). Let y = (Y₁, …, Y_n)′, $Σ = {R_{1} (t_{i}, t_{j})}_{i, j = 1}^{n}, S = {(ψ_{0} (t_{i}), \dots, ψ_{q - 1} (t_{i}), A_{q} (t_{i}))}_{i = 1}^{n}$ , and M = Σ + nλI_n, where I_n is an n × n identity matrix. The solution to (16) can be represented as (Wang, 2011)

{\hat{f}}_{λ} (t) = \sum_{i = 0}^{q - 1} d_{i} ψ_{i} (t) + d_{q} A_{q} (t) + \sum_{k = 1}^{n} c_{i} R_{1} (t, t_{i}),

(17)

Where

(d_{0}, \dots, d_{q})' = {(S' M^{- 1} S)}^{- 1} S' M^{- 1} y,

(c_{1}, \dots, c_{n})' = M^{- 1} [I_{n} - S {(S' M^{- 1} S)}^{- 1} S' M^{- 1}] y .

Now consider n observations based on the SDM (1)

Y (t_{i}) = U (t_{i}) + ε (t_{i}), i = 1, \dots, n, t_{i} \in [0, T],

(18)

where the stochastic processes U(·) and V(·) are defined in (2) and (8), respectively. The best linear unbiased estimator of U(t) is the posterior mean E (U(t)|Y(t_i), i = 1, …, n) (Wahba, 1990). Denote

μ (t) ≜ \int_{0}^{t} G (t, s) β_{0} (s) / Ψ (s) d s .

(19)

In this article, we assume that μ(t) is known. Subtracting μ(t_i) on both sides of (18) and following the same arguments as in Wahba (1990) and Gu (2013), we have the following connection between the Chebyshev spline and the best linear unbiased estimator of U(t).

Proposition 3.1

Assume that a₀(t) = Ψ(t), b(t) = 1, and U₀, …, $U_{q} \overset{i i d}{~} N (0, a)$ . Denote

Û_{a} (t) = E (U (t) | Y (t_{i}), i = 1, \dots, n)

as the posterior mean where the dependence on the variance a is expressed explicitly. For any fixed t ∈ [0, T], when λ = σ²/n, we have

lim_{a \to \infty} Û_{a} (t) = μ (t) + {\hat{f}}_{λ} (t),

(20)

where f̂_λ (t) in (20) is the penalized least squares solution to (16) with observations y = (Y(t₁) − μ(t₁), …, Y(t_n) − μ(t_n))′.

Remark 3.3

The penalty can be simplified as follows

\int_{0}^{T} {(L f (t))}^{2} h (t) d t = \int_{0}^{T} {[\frac{f^{(q + 1)} (t) - β_{1} (t) f^{(q)} (t)}{σ (t)}]}^{2} d t .

(21)

It is clear that the construction of RKHS including inner product, basis function of the space ℋ₀ and reproducing kernel of the space ℋ₁ is independent of the function β₀(t), while whether there exists a drift term μ(t) defined in (19) depends on if β₀(t) = 0. The condition n λ = σ² does not depend on σ(t) since σ(t) is involved in the penalty. When σ(t) is a constant, say σ_V, it may be absorbed into the smoothing parameter and then we have the standard condition $n λ = σ^{2} / σ_{V}^{2}$ .

Remark 3.4

Assume that β₀(t) = 0. Then μ(t) = 0. Proposition 3.1 states that the best linear unbiased estimator of U(t) coincides with the smoothing spline estimate as a → ∞. This link has been explored to derive the generalized maximum likelihood (restricted maximum likelihood) estimate of the smoothing parameter λ. The variance function of Û_a(t) as a → ∞ has been used to construct Bayesian confidence intervals. See Wang (2011) for details.

Remark 3.5

Proposition 3.1 extends existing results to the case when μ(t) ≠ 0. Denote

Û = {lim}_{a \to \infty} (U_{a} (t_{1}), \dots, U_{a} (t_{n}))'

as the best linear unbiased estimates at design points, Y = (Y(t₁), …, Y(t_n))′, and μ = (μ(t₁), …, μ(t_n))′. Then, it can be seen that

Û = μ + ({\hat{f}}_{λ} (t_{1}), \dots, {\hat{f}}_{λ} (t_{n}))' = μ + H (λ) (Y - μ) = (I_{n} - H (λ)) μ + H (λ) Y,

Where H (λ) = I_n − nλM⁻¹[I_n − S(S′M⁻¹S)⁻¹S′M⁻¹] is the smoothing matrix. Thus, Û has the typical form of a shrinkage estimator.

3.3. Ornstein–Uhlenbeck Process and Exponential Spline

Consider an Ornstein–Uhlenbeck process V(·) that satisfies the SDE

d V (t) = θ (μ - V (t)) d t + σ_{V} d B (t), V (0) = 0,

(22)

where μ is the equilibrium value of the process, θ > 0 is the speed of reversion, and σ_V is the volatility. It is easy to see that the Ornstein–Uhlenbeck process is a special case of (8) with β₀(t) = μθ, β₁(t) = −θ, and σ(t) = σ_V.

The mean and covariance functions of V(·) are

E V (t) = μ [1 - exp (- θ t)],

Cov (V (s), V (t)) = \frac{σ_{V}^{2}}{2 θ} exp (- θ | s - t |) [1 - exp (- 2 θ s \land t)] .

In this subsection we assume that q ≥ 1. See Remark 3.7 for the exponential spline with q = 0. It is easy to check that Ψ(t) = exp(−θt), $h (t) = e^{- 2 θ t} σ_{V}^{- 2}$ and $F (t, u) = I (u \leq t) \int_{u}^{t} ψ_{q - 1} (t - τ) e^{- θ τ} b (τ) d τ$ . Assume that $U_{i} ~ N (μ_{i}, σ_{i}^{2})$ . Then the mean and covariance functions of U(·) are

E U (t) = \sum_{i = 0}^{q - 1} ψ_{i} (t) μ_{i} + A_{q} (t) μ_{q} + μ \int_{0}^{t} ψ_{q - 1} (t - s) b (s) (1 - e^{- θ s}) d s

Cov (U (s), U (t)) = \sum_{i = 0}^{q - 1} ψ_{i} (s) ψ_{i} (t) σ_{i}^{2} + A_{q} (s) A_{q} (t) σ_{q}^{2} + σ_{V}^{2} \int_{0}^{s \land t} F (s, u) F (t, u) e^{2 θ u} d u .

The differential operator in (12) reduces to Lf(u) = [f^(q+1)(u) + θf^(q)(u)] e^θu. Therefore, the penalty term in (16) is equal to $(λ / σ_{V}^{2}) \int_{0}^{T} {[f^{(q + 1)} (t) + θ f^{(q)} (t)]}^{2} d t$ . Note that f^(q+1) + θf^(q) = D^q−1(D² + θD)f, where D² + θD is differential operator for exponential spline (Wang, 2011).

3.3.1. Exponential spline

Consider a special case of model (2) with a₀(t) = e^−θt and b(t) = 1. When q = 1, the stochastic process U(·) can be represented as a summation of drift terms and an integrated Ornstein–Uhlenbeck process

U (t) = U_{0} + U_{1} \frac{1 - e^{- θ t}}{θ} + \int_{0}^{t} (t - s) d V (s) = U_{0} + U_{1} \frac{1 - e^{- θ t}}{θ} + \int_{0}^{t} V (s) d s .

(23)

The penalty $\int_{0}^{T} {[(D^{2} + θ D) f]}^{2} d t$ is the same as that for an exponential spline. It is easy to check that the basis functions of the space ℋ₀ in Theorem 3.2 are ψ₀(t) = 1 and A₁(t) = (1 − e^−θt)/θ, and the reproducing kernel of the space ℋ₁ is

R_{1} (s, t) = \frac{σ_{V}^{2}}{θ^{2}} [s \land t - \frac{e^{- θ s} + e^{- θ t}}{θ} (e^{θ s \land t} - 1) + \frac{e^{- θ (s + t)}}{2 θ} (e^{2 θ s \land t} - 1)],

(24)

which is equivalent to that of an exponential spline (Wang, 2011, pp 41–44). Furthermore,

μ (t) = \int_{0}^{t} \frac{F (t, u) β_{0} (u)}{Ψ (u)} d u = \int_{0}^{t} μ θ e^{θ u} d u \int_{u}^{t} e^{- θ τ} d τ = μ θ^{- 1} (θ t - 1 + e^{- θ t}) .

(25)

Therefore, when μ = 0, the integrated Ornstein–Uhlenbeck process U(·) defined in (23) is the corresponding stochastic process for the exponential spline. The case when μ ≠ 0 provides an extension of the exponential spline. We note that the integrated Ornstein–Uhlenbeck stochastic process with drift U(t) represents a major deviation from existing spline literature which involves the Brownian motion only.

The mean function of stochastic process U(·) is

E [U (t)] = μ_{0} + μ_{1} \frac{1 - e^{- θ t}}{θ} + μ (t + \frac{e^{- θ t} - 1}{θ}) .

When μ = 0, $μ = 0, \bar{V} (t) = Ψ (t) [U_{1} + \int_{0}^{t} σ_{V} e^{θ s} d B (s)]$ in Remark 3.1 satisfies

d \bar{V} (t) = - θ \bar{V} (t) d t + σ_{V} d B (t), \bar{V} (0) = U_{1}, t \in [0, T] .

Therefore, $U (t) = U_{0} + \int_{0}^{t} \bar{V} (s) d s$ and E[U(t)] = μ₀ + μ₁(1 − e^−θt)/θ.

When q > 1, the stochastic process U(·) can be represented as

U (t) = \sum_{i = 1}^{q - 1} ψ_{i} (t) U_{i} + A_{q} (t) U_{q} + \int_{0}^{t} ψ_{q - 1} (t - s) V (s) d s .

(26)

The penalty $\int_{0}^{T} {[D^{(q - 1)} (D^{2} + θ D) f]}^{2} d t$ adds polynomials up to the order q − 1 to the kernel space ℋ₀. Specifically, the orthonormal basis functions of ℋ₀ are ψ₀(t), ψ₁(t), …, ψ_q−1(t), and A_q(t), where $A_{q} (t) = \int_{0}^{t} ψ_{q - 1} (t - s) e^{- θ s} d s$ . The function A_q(t) can be calculated recursively by A_q(t) = ψ_q−1(t)/θ − A_q−1(t)/θ and A₁(t) = (1 − e^−θt)/θ. The Chebyshev spline in this case may be called a polynomial-exponential spline.

3.3.2. Partial exponential spline

Consider another special case of model (2) with a₀(t) = b(t) = 1. For q ≥ 1, the stochastic process U(·) can be represented as

U (t) = \sum_{i = 0}^{q} ψ_{i} (t) U_{i} + \int_{0}^{t} ψ_{q - 1} (t - s) V (s) d s .

(27)

Replacing a Brownian motion by the Ornstein–Uhlenbeck process V(·), the stochastic process U(·) in (27) extends the stochastic model for polynomial splines (Wahba, 1990) (see Eq. (S.5) in the Supplementary Materials). The mean and covariance of the stochastic process U(·) are

E U (t) = \sum_{i = 0}^{q} ψ_{i} (t) μ_{i} + \int_{0}^{t} ψ_{q - 1} (t - s) μ (1 - e^{- θ s}) d s,

Cov (U (s), U (t)) = \sum_{i = 0}^{q} ψ_{i} (s) ψ_{i} (t) σ_{i}^{2} + σ_{V}^{2} \int_{0}^{s \land t} F (s, u) F (t, u) e^{2 θ u} d u,

where $F (t, u) = I (u \leq t) \int_{u}^{t} ψ_{q - 1} (t - τ) e^{- θ τ} d τ$ .

For simplicity, we now consider the special case when q = 1. The stochastic process U(·) is given by

U (t) = U_{0} + U_{1} t + \int_{0}^{t} (t - s) d V (s) = U_{0} + U_{1} t + \int_{0}^{t} V (s) d s .

(28)

The integrated Ornstein–Uhlenbeck process in (28) extends the integrated Brownian motion [see (S.6) in Section S.5 of the Supplementary Materials]. The mean and covariance functions are

E U (t) = μ_{0} + μ_{1} t + μ (t + \frac{e^{- θ t} - 1}{θ}),

Cov (U (s), U (t)) = σ_{0}^{2} + s t σ_{1}^{2} + R_{1} (s, t),

(29)

where R₁ is given by (24).

Note that Theorem 3.2 does not apply in this case since a₀(t) ≠ Ψ(t). Compared with the exponential spline model, the basis function (1 − exp(−θt))/θ in ℋ₀ has been replaced by the function ψ₁(t) = t which is not orthogonal to the space ℋ₁ (Remark 3.2). Nevertheless, the stochastic process U(·) defined in (28) can be connected to the following partial spline model

Y_{i} = α_{1} + α_{2} t_{i} + f (t_{i}) + ε_{i}, i = 1, \dots, n, t_{i} \in [0, T] .

(30)

Assume that $f \in ℳ ≜ W_{2}^{2} [0, T] ⊖ {1, (1 - exp (- θ t)) / θ}$ under the inner product (14) with Lf (u) = [D²f (u) + θDf(u)] e^θu and $h (u) = e^{- 2 θ u} σ_{V}^{- 2}$ . Let α̂₁, α̂₂ and f̂_λ(t) be the solution to the following penalized least squares

min_{α_{1} \in R, α_{2} \in R, f \in ℳ} {\frac{1}{n} \sum_{i = 1}^{n} {[Y_{i} - (α_{1} + α_{2} t_{i} + f (t_{i}))]}^{2} + λ \int_{0}^{T} {(L f (t))}^{2} h (t) d t} .

(31)

Following similar arguments as in Section 3.2 one can show that lim_a→∞ Û_a(t) = μ(t)+ α̂₁ + α̂₂t + f̂_λ(t), where α̂₁, α̂₂ and f̂_λ(t) are the penalized least squares solutions to (31) with observations Y_i = Y(t_i) − μ(t_i) for i = 1, …, n where μ(t) is given in (25).

3.4. Logistic Spline and Its Corresponding SDM

The logistic spline is a special case of the Chebyshev spline with q = 0 and penalty $\int_{0}^{T} {[f' (t) - β_{1} (t) f (t)]}^{2} d t$ , where

β_{1} (t) = θ γ e^{- θ t} / (1 + γ e^{- θ t}), θ > 0, γ > 0.

(32)

To construct the corresponding SDM, consider a process V(·) driven by the following linear SDE

d V (t) = β_{1} (t) V (t) d t + σ_{V} d B (t), V (0) = 0 .

(33)

Equation (33) is a special case of (8) with β₀(t) = 0, β₁(t) is given in (32), and σ(t) = σ_V. It is not difficult to check that μ(t) = 0, Ψ(t) = (1 + γ)/(1 + γe^−θt), and

V (t) = \frac{σ_{V} B (t)}{1 + γ e^{- θ t}} + \frac{σ_{V} γ}{1 + γ e^{- θ t}} \int_{0}^{t} e^{- θ s} d B (s) .

Then, E V(t) = 0 and

Cov (V (s), V (t)) = Ψ (s) Ψ (t) \int_{0}^{s \land t} {(\frac{σ_{V}}{Ψ (u)})}^{2} d u = \frac{σ_{V}^{2}}{(1 + γ e^{- θ s}) (1 + γ e^{- θ t})} [s \land t + \frac{2 γ}{θ} (1 - e^{- θ (s \land t)}) + \frac{γ^{2}}{2 θ} (1 - e^{- 2 θ (s \land t)})] .

(34)

According to (21), the penalty $\int_{0}^{T} {(L f (t))}^{2} h (t) d t = σ_{V}^{- 2} \int_{0}^{T} {[f' (t) - β_{1} (t) f (t)]}^{2} d t$ is the same as that for the logistic spline up to a multiplying constant which can be absorbed into the smoothing parameter λ (Wang, 2011). Consider the special case of stochastic dynamic system (2) with a₀(t) = Ψ(t) and b(t) = 1. The basis function of the space ℋ₀ is Ψ(t) and the Green’s function is Ψ(t)I(u ≤ t). Then, the reproducing kernel for the space ℋ₁ is $R_{1} (s, t) = Ψ (s) Ψ (t) \int_{0}^{s \land t} {[σ_{V} / Ψ (u)]}^{2} d u$ . R₁ has the form in (34) which is the same as the reproducing kernel for the logistic spline (Eq. (2.61) in Wang (2011)). Thus, the SDM (1), (2) and (33) consist of the stochastic model for the logistic spline.

Remark 3.6

When q = 1, following similar argument it can be shown that the SDM (1), (2) and (33) is the stochastic model for the logistic spline discussed by Gu (2013, p. 161). The cases when q > 1 can be regarded as extensions of the logistic spline where basis functions ψ₀(t), …, ψ_q−1(t) are added to the null space ℋ₀.

Remark 3.7

The connection between an SDM driven by (33) and a Chebyshev spline discussed in this section holds for a general function β₁(t). Specifically, consider the SDE (33) with an unspecified β₁(t). Assume that q = 0, a₀(t) = Ψ(t) and b(t) = 1. Then, $V (t) = σ_{V} Ψ (t) \int_{0}^{t} d B (s) / Ψ (s)$ and $U (t) = Ψ (t) U_{0} + σ_{V} Ψ (t) \int_{0}^{t} d B (s) / Ψ (s)$ . According to Theorem 3.2, the RKHS $W_{2}^{1} [0, T] = {Ψ (t)} \oplus ℋ_{1}$ , where ℋ₁has the reproducing kernel $R_{1} (s, t) = Ψ (s) Ψ (t) \int_{0}^{s \land t} {[σ_{V} / Ψ (u)]}^{2} d u$ . The penalty of the corresponding Chebyshev spline is $σ_{V}^{- 2} \int_{0}^{T} {[f' (t) - β_{1} (t) f (t)]}^{2} d t$ . The logistic spline is a special case with β₁(t) = θγe^−θt/(1 + γe^−θt). It is not difficult to check that the function β₁(t) = −θ corresponds to the exponential spline with β₀(t) = 0. Other functions may be considered for β₁(t). For example, β₁(t) = −θt corresponds to a Chebyshev spline with the kernel space spanned by Ψ(t) = e^−θt²/2, the Gaussian function.

4. EXTENDED STOCHASTIC DYNAMIC MODELS AND THEIR CONNECTIONS TO CHEBYSHEV SPLINES

Motivated by the Chebyshev splines (Karlin & Ziegler, 1966; Kimeldorf & Wahba, 1971; Gu, 2013), we now consider a more general dynamic system by replacing D^q in (2) with the following differential operator

Π_{q} = D \frac{1}{a_{1}} D \frac{1}{a_{2}} \dots D \frac{1}{a_{q}},

(35)

where a₁, …, a_q are strictly positive and (i + 1)th differentiable functions with a_i(0) = 1. Specifically, we assume the following stochastic dynamic system for U(·)

Π_{q} U (t) = a_{0} (t) Π_{q} U (0) + b (t) V (t), q \geq 0, t \in [0, T] .

(36)

As in previous sections, we will consider two types of models for V(·): the general diffusion process determined by the SDE (3) and the Gaussian model driven by the linear SDE (8).

The following stochastic differential system has been considered by Kimeldorf & Wahba (1971):

Π_{q + 1} U (t) = \frac{d B (t)}{d t},

(37)

where $Π_{q + 1} = D \frac{1}{a_{0}} D \frac{1}{a_{1}} D \frac{1}{a_{2}} \dots D \frac{1}{a_{q}}$ is a (q + 1)-order differential operator of Chebyshev splines (Wahba, 1978, 1990). The stochastic dynamic system (37) is equivalent to Π_qU(t) = a₀(t)[Π_qU(0) + B(t)]. The stochastic dynamic system (36) extends (37) in two aspects: (1) Brownian motion B(t) is replaced by a general solution V(·) of an SDE, and (2) the coefficient b(t) of V(·) does not necessarily equal to a₀(t).

4.1. General Stochastic Dynamic Models

Following Kimeldorf & Wahba (1971) and Wahba (1990), denote

Π_{0} = I,

Π_{1} = D \frac{1}{a_{q}},

\begin{matrix} Π_{2} = D \frac{1}{a_{q - 1}} D \frac{1}{a_{q}}, \\ ⋮ \\ Π_{q - 1} = D \frac{1}{a_{2}} D \frac{1}{a_{3}} \dots D \frac{1}{a_{q}}, \end{matrix}

where I is the identity operator. In addition, define the following functions:

ω_{0} (t) = a_{q} (t),

\begin{matrix} ω_{1} (t) = a_{q} (t) \int_{0}^{t} a_{q - 1} (t_{q - 1}) d t_{q - 1}, \\ ⋮ \\ ω_{q - 1} (t) = a_{q} (t) \int_{0}^{t} a_{q - 1} (t_{q - 1}) d t_{q - 1} \int_{0}^{t_{q - 1}} a_{q - 2} (t_{q - 2}) d t_{q - 2} \dots \int_{0}^{t_{2}} a_{1} (t_{1}) d t_{1}, \end{matrix}

ω_{q} (t) = a_{q} (t) \int_{0}^{t} a_{q - 1} (t_{q - 1}) d t_{q - 1} \int_{0}^{t_{q - 1}} a_{q - 2} (t_{q - 2}) d t_{q - 2} \dots \int_{0}^{t_{2}} a_{1} (t_{1}) d t_{1} \int_{0}^{t_{1}} a_{0} (t_{0}) d t_{0} .

Note that for i, j = 0, 1, 2, …, q, we have

(Π_{i} ω_{j}) (0) = {\begin{matrix} 1 & if i = j \\ 0 & else . \end{matrix}

(38)

Define

X (t) = a_{q} (t) \int_{0}^{t} a_{q - 1} (t_{q - 1}) d t_{q - 1} \dots \int_{0}^{t_{2}} a_{1} (t_{1}) d t_{1} \int_{0}^{t_{1}} b (t_{0}) d t_{0} \int_{0}^{t_{0}} d V (u) = a_{q} (t) \int_{0}^{t} b (t_{0}) V (t_{0}) d t_{0} \int_{t_{0}}^{t} a_{1} (t_{1}) d t_{1} \dots \int_{t_{q - 2}}^{t} a_{q - 1} (t_{q - 1}) d t_{q - 1} .

(39)

We note a typo in Equation (7.3) of Kimeldorf & Wahba (1971) where … $\int_{0}^{t_{2}} a_{1} (t_{1}) d W (t_{1})$ should be … $\int_{0}^{t_{2}} a_{1} (t_{1}) W (t_{1}) d t_{1}$ .

Following similar arguments as in the Section 2.2, the stochastic differential system (36) is equivalent to the following stochastic integration equation

U (t) = \sum_{i = 0}^{q} ω_{i} (t) U_{i} + X (t),

(40)

where U_i = Π_iU(0). Let μ_i = EU_i and $σ_{i}^{2} = Var (U_{i})$ . Applying the Fubini’s Theorem to the stochastic integration equation (40), we have the following results which extends Proposition 2.1.

Proposition 4.1

Suppose that U(·) is given by the stochastic integration equation (40), where V(·) is driven by the SDE (3). Then,

E U (t) = \sum_{i = 0}^{q} ω_{i} (t) μ_{i} + a_{q} (t) \int_{0}^{t} b (τ) E V (τ) d τ \int_{τ}^{t} a_{1} (t_{1}) d t_{1} \dots \int_{t_{q - 2}}^{t} a_{q - 1} (t_{q - 1}) d t_{q - 1},

Cov (U (s), U (t)) = \sum_{i = 0}^{q} ω_{i} (s) ω_{i} (t) σ_{i}^{2} + Cov (X (s), X (t)),

Cov (X (s), X (t)) = a_{q} (s) a_{q} (t) \int_{0}^{s} \int_{0}^{t} b (u) b (υ) Cov (V (u), V (υ)) d u d υ \int_{u}^{s} a_{1} (s_{1}) d s_{1} \int_{υ}^{t} a_{1} (t_{1}) d t_{1} \dots \int_{s_{q - 2}}^{s} a_{q - 1} (s_{q - 1}) d s_{q - 1} \int_{t_{q - 2}}^{t} a_{q - 1} (t_{q - 1}) d t_{q - 1} .

4.2. Gaussian Models Driven by a Linear SDE

Denote

\tilde{F} (t, u) = a_{q} (t) \int_{u}^{t} b (τ) Ψ (τ) d τ \int_{τ}^{t} a_{1} (t_{1}) d t_{1} \int_{t_{1}}^{t} a_{2} (t_{2}) d t_{2} \dots \int_{t_{q - 2}}^{t} a_{q - 1} (t_{q - 1}) d t_{q - 1} .

The following results extend Theorem 3.1 when the SDM is driven by the linear SDE (8).

Theorem 4.1

Suppose that the stochastic process V(·) is a Gaussian process driven by the linear SDE (8) and $U_{i} ~ N (μ_{i}, σ_{i}^{2})$ for i = 0, …, q. Then, the stochastic process U(·) given by (36) or (40) is a Gaussian process with the following mean and covariance functions

E U (t) = \sum_{i = 0}^{q} ω_{i} (t) μ_{i} + \int_{0}^{t} \tilde{F} (t, u) \frac{β_{0} (u)}{Ψ (u)} d u,

Cov (U (s), U (t)) = \sum_{i = 0}^{q} ω_{i} (s) ω_{i} (t) σ_{i}^{2} + \int_{0}^{s \land t} \tilde{F} (s, u) \tilde{F} (t, u) {(\frac{σ (u)}{Ψ (u)})}^{2} d u .

The proof of Theorem 4.1 is provided in the Supplementary Materials.

4.3. Connection to Chebyshev Splines

Again, let $Ψ (t) ≜ exp (\int_{0}^{t} β_{1} (s) d s)$ and h(t) ≜ (Ψ(t)/σ(t))² be a weight function. Consider the following differential operator

\tilde{L} f (u) = \frac{d}{d u} (\frac{Π_{q} f (u)}{Ψ (u)}) = \frac{D Π_{q} f (u) - β_{1} (u) Π_{q} f (u)}{Ψ (u)} .

Consider the model space

{\tilde{W}}_{2}^{q + 1} [0, T] ≜ {f : f, Π_{1} f, \dots, Π_{q} f are absolutely continuous and \int_{0}^{T} {(\tilde{L} f (t))}^{2} h (t) d t < \infty}

with inner product

(f, g) = \sum_{i = 0}^{q} Π_{i} f (0) Π_{i} g (0) + \int_{0}^{T} \tilde{L} f (u) \tilde{L} g (u) h (u) d u .

(41)

Theorem 4.2

Assume that a₀(t) = Ψ(t) and b(t) = 1. Then, the space ${\tilde{W}}_{2}^{q + 1} [0, T]$ is an RKHS with inner product (41). Let ${\tilde{ℋ}}_{0} = {f \in {\tilde{W}}_{2}^{q + 1} [0, T] : \tilde{L} f = 0}$ be the kernel of L̃. Then, {ω₀, ω₁,…, ω_q} forms an orthonormal basis of ℋ̃₀ and ${\tilde{W}}_{2}^{q + 1} [0, T]$ can be decomposed into ${\tilde{W}}_{2}^{q + 1} [0, T] = {\tilde{ℋ}}_{0} \oplus {\tilde{ℋ}}_{1}$ , where

{\tilde{ℋ}}_{0} = span {ω_{0}, ω_{1}, \dots, ω_{q}},

{\tilde{ℋ}}_{1} = {f \in {\tilde{W}}_{2}^{q + 1} [0, T] : f (0) = Π_{1} f (0) = \dots = Π_{q} f (0) = 0} .

The reproducing kernels of ℋ̃₀ and ℋ̃₁ are respectively given by

{\tilde{R}}_{0} (s, t) = \sum_{i = 0}^{q} ω_{i} (s) ω_{i} (t),

{\tilde{R}}_{1} (s, t) = \int_{0}^{s \land t} \tilde{F} (s, u) \tilde{F} (t, u) {[h (u)]}^{- 1} d u .

The proof of Theorem 4.2 is provided in the Supplementary Materials.

Now consider the nonparametric regression model (15) where $f \in {\tilde{W}}_{2}^{q + 1} [0, T]$ . A general Chebyshev spline is the solution to the penalized least squares (16) with $W_{2}^{q + 1} [0, T]$ and L being replaced by ${\tilde{W}}_{2}^{q + 1} [0, T]$ and L̃, respectively. Again, up to a constant function, the smoothing spline estimate equals the posterior mean of the SDM (1), (8) and (36). Specifically, assume that U₀, …, $U_{q} \overset{i i d}{~} N (0, a)$ . Denote Û_a = E(U(t)|Y(t_i), i = 1, …, n) and f̃_λ(t) as the penalized least squares solution with observations (Y(t₁) − μ̃(t₁), …, Y(t_n) − μ̃(t_n))′, where

\tilde{μ} (t) ≜ \int_{0}^{t} \tilde{F} (t, s) β_{0} (s) / Ψ (s) d s .

Then for any fixed t ∈ [0, T], when λ = σ²/n, it can be shown that lim_a→∞ Û_a(t) = μ̃(t) + f̃_λ(t).

4.4. Chebyshev Splines to Stochastic Dynamic Models

In the previous sections we started with SDE driven SDMs and built corresponding Chebyshev splines. We now describe a strategy that builds an SDE driven SDM that corresponds to the general Chebyshev spline defined in Section 4.3.

With the general Chebyshev spline, a₀(t), …, a_q(t) and a general weight function h(t) > 0 are given. Assume that $a_{i} (t) \in W_{2}^{i + 1} [0, t]$ for i = 0, …, q are strictly positive functions with a_i(0) = 1. From SDMs to Chebyshev splines, we have set $a_{0} (t) = Ψ (t) = exp (\int_{0}^{t} β_{1} (s) d s)$ and h(t) = (a₀(t)/σ(t))² in the previous sections, where β₁(t) and σ(t) are given in the linear SDE (8). Reversely, now we set $β_{1} (t) = a_{0}^{'} (t) / a_{0} (t)$ and $σ (t) = a_{0} (t) / \sqrt{h (t)}$ . Consider the following stochastic dynamic system

Π_{q} U (t) = a_{0} (t) U_{q} + V (t),

(42)

d V (t) = β_{1} (t) V (t) d t + σ (t) d B (t), V (0) = 0,

(43)

with initial conditions U_i = Π_iU(0) for i = 0, …, q. Then, it is not difficult to show that the general Chebyshev spline equals the posterior mean of the SDM (1), (42), and (43).

5. APPLICATIONS

Proposition 3.1 implies that with appropriate choices of parameters in the SDM and the smoothing parameter, the posterior mean coincides with the corresponding Chebyshev spline estimate. In particular, as discussed in Section 3.3.1, with U₀, $U_{1} \overset{i i d}{~} N (0, a)$ , a → ∞ and $n λ = σ^{2} / σ_{V}^{2}$ , the posterior mean of the integrated Ornstein–Uhlenbeck process (23) coincides with the exponential spline estimate, in which σ² is the variance of ε(t, ω) the measurement error in Equation (1) and $σ_{V}^{2}$ is the volatility term of integrated Ornstein–Uhlenbeck process (23), reflecting the fluctuation of the process. We now use two real data applications to illustrate this theoretical result.

Glomerular filtration rate (GFR) measures the flow rate of filtered fluid through the kidney. Progression of kidney disease is often assessed by change in GFR. Therefore it is important to estimate the trajectory of GFR based on observations (Li et al., 2014). The left panel of Figure 1 shows estimated GFR (eGFR) from a patient with chronic kidney disease. We first consider an exponential spline model discussed in Section 3.3.1 since the profile is close to exponential decay. To estimate the speed of reversion parameter θ, as in Wang (2011), we first fit a nonlinear regression model which is motivated by Equation (25) as y_i = β₁ + β₂ exp(−β₃t_i) + ε_i, i = 1, …, 62, and then set θ = β̂³ where β̂₃ is the non-linear least square estimate of β₃.We then fit the exponential spline using the ssr function in the R ASSIST package (Wang, 2011) with θ = β̂₃ and smoothing parameter selected by the generalized maximum likelihood method. The exponential spline fit is shown in the left panel of Figure 1 as the solid blue line.

GFR (a) and PSA (b) examples. Circles are observations, posterior means are dashed red lines, and exponential spline fits are solid blue lines.

The Ornstein–Uhlenbeck process provides a natural model for a process stabilizing around some equilibrium point. Such phenomena is often observed in the biological or biomedical application. For example, we consider modelling the eGFR profile by the SDM with integrated Ornstein–Uhlenbeck process (23), for which the parameters have natural interpretations in terms of the convergence of process. This is a special case of the stochastic velocity model with Ornstein–Uhlenbeck process discussed in Zhu, Song, & Taylor (2011). Thus the Markov chain Monte Carlo (MCMC) algorithm developed in Zhu, Song, & Taylor (2011) can be used to compute the posterior mean of the integrated Ornstein–Uhlenbeck process. Since patients will eventually lose renal function, we set the equilibrium value μ = 0.We estimated the parameter θ as the posterior mean of MCMC samples of θ, and set U₀, $U_{1} \overset{i i d}{~} N (0, 10^{4})$ . The MCMC was run for 45,000 iterations, in which the first 35,000 runs are discarded as the burn in and every 10th draw is saved. The posterior mean is shown in the left panel of Figure 1 as the red dashed line. As expected from the theoretical result, the exponential spline estimate is almost identical to the posterior mean of the integrated Ornstein–Uhlenbeck process.

We further present another real data application for the prostate specific antigen (PSA) profile commonly used to monitor recurrence of prostate cancer after treatment. Different from the GFR example, the rate of change of the PSA profile converges or the slope of the PSA profile is stabilized at a non-zero value (Figure 1, Zhu, Song, & Taylor, 2011). It’s reasonable to fit an SDM with integrated Ornstein–Uhlenbeck process and non-zero μ. For such a case, we illustrate that exponential spline can also be applied with a simple transformation. To make the estimates by two methods comparable, we fix θ̂; = 1. 15 and μ̂ = 0. 385 (posterior means in Table 3, Zhu, Song, & Taylor, 2011), while other parameters were estimated either by the MCMC algorithm for SDM or generalized maximum likelihood method for the exponential spline. To fit the exponential spline, the transformed variable z(t) = y(t)− μ̂{t + (e^−θ̂t − 1)/θ̂} is used as observations. The right panel of Figure 1 shows the observations and estimated profiles. Again, the exponential spline estimate is almost identical to the posterior mean of the integrated Ornstein–Uhlenbeck process.

6. DISCUSSION

Under the general framework of isometric mapping between the Hilbert space spanned by a second order stochastic process and the RKHS generated by its covariance kernel, we establish a specific connection between an SDM driven by a linear SDE and the Chebyshev spline. This connection provides a statistical structure and mechanism for estimating sample paths in a stochastic dynamic model as well as a justification for the somewhat ad hoc penalty in a penalized least squares. Our results extend the well-known connection between the integrated Brownian and the polynomial spline to the connection between an SDM and the Chebyshev spline, which is mutually beneficial for these two different areas. For example, fitting spline models with large data can be computationally expensive. Instead, we may fit the corresponding SDMs with efficient algorithms based on the Markov property (Kohn & Ansley, 1987; Zhu, Song, & Taylor, 2011).

In this paper we have assumed σ(t), β₀(t), and β₁(t) in (8) are known, which in practice need to be estimated. Under the assumption of a₀(t) = Ψ(t) and b(t) = 1, the SDM (1), (2) and (8) are determined by σ(t), β₀(t), and β₁(t). Similarly, the Chebyshev spline model may contain unknown parameters _ (Heckman & Ramsay, 2000). For example, the parameter θ in the penalty $\int_{0}^{T} {[(D^{2} + θ D) f]}^{2} d t$ of the exponential spline is usually unknown and corresponds to an unknown speed of reversion in the integrated Ornstein–Uhlenbeck process. Estimation methods have been proposed for parameters in Chebyshev spline models (Heckman & Ramsay, 2000; Wang & Ke, 2009 and references therein) and SDMs (Zhu, Song, & Taylor, 2011, and references therein). Connection and comparison between these parameter estimation methods in these two different fields will be studied in the future.

One important feature of the SDM considered in this paper is that the process V(·) can be non-stationary which makes it more flexible. For special cases discussed in this paper, we have assumed that σ(t) is a constant. With a general σ(t), Equation (21) corresponds to an adaptive penalty for spatial inhomogeneous functions (see Pintore, Speckman, & Holmes, 2006; Liu & Guo, 2010, and references therein). As a future research topic, we will explore the corresponding non-stationary processes to fit spline models with varying smoothing parameters.

Supplementary Material

NIHMS690965-supplement-01.pdf^{(71.5KB, pdf)}

ACKNOWLEDGEMENTS

We thank the editors, an associate editor and two referees for constructive comments that substantially improved an earlier draft and revision. This study was supported by the Intramural Research Program of the Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Maryland, USA (Ruzong Fan).

BIBLIOGRAPHY

Ansley CF, Kohn R. The equivalence of two stochastic approaches to spline smoothing. Journal of Applied Probability. 1986;23:391–405. [Google Scholar]
Berlinet A, Thomas-Agnan C. Reproducing Kernel Hilbert Spaces in Probability and Statistics. Boston: Kluwer; 2004. [Google Scholar]
Bishwal JPN. Parameter Estimation in Stochastic Differential Equations. New York: Springer; 2008. [Google Scholar]
Cramér H, Leadbetter MR. Stationary and Related Stochastic Processes; Sample Function Properties and Their Applications. New York: Wiley; 1967. [Google Scholar]
Furrer EM, Nychka DW. A framework to understand the asymptotic properties of Kriging and splines. Journal of the Korean Statistical Society. 2007;36:57–76. [Google Scholar]
Griebel M, Hegland M. A finite element method for density estimation with Gaussian process priors. SIAM Journal on Numerical Analysis. 2010;47:4759–4792. [Google Scholar]
Gu C. Smoothing Spline ANOVA Models. 2nd ed. New York: Springer; 2013. [Google Scholar]
Heckman N, Ramsay JO. Penalized regression with model-based penalties. Canadian Journal of Statistics. 2000;28:241–258. [Google Scholar]
Ikeda N, Watanabe S. Stochastic Differential Equations and Diffusion Processes. 2nd ed. Amsterdam, New York: North-Holland Pub. Co.; 1989. [Google Scholar]
Karatzas K, Shreve SE. Brownian Motion and Stochastic Calculus. Springer-Verlag; 1988. pp. 354–355. [Google Scholar]
Karlin S, Ziegler Z. Chebyshevian spline functions. Siam Journal on Numerical Analysis. 1966;3:514–543. [Google Scholar]
Kimeldorf GS, Wahba G. A correspondence between Bayesian estimation on stochastic process and smoothing by splines. The Annals of Mathematical Statistics. 1970a;41:495–502. [Google Scholar]
Kimeldorf GS, Wahba G. Spline functions and stochastic processes. Sankhya: The Indian Journal of Statistics, Series A. 1970b;32:173–180. [Google Scholar]
Kimeldorf GS, Wahba G. Some results on Tchebycheffian spline functions. Journal of Mathematical Analysis and Applications. 1971;33:82–95. [Google Scholar]
Kohn R, Ansley CF. A new algorithm for spline smoothing based on smoothing a stochastic process. Siam Journal on Scientific and Statistical Computing. 1987;8:33–48. [Google Scholar]
Li L, Chang A, Rostand SG, Hebert L, Appel LJ, Astor BC, Lipkowitz MS, Wright JT, Kendrick C, Wang X, Greene TH. A within-patient analysis for time-varying risk factors of CKD progression. Journal of the American Society of Nephrology. 2014;25 doi: 10.1681/ASN.2013050464. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lindgren F, Rue H, Lindstrom J. An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach. Journal of the Royal Statistical Society B. 2011;73:423–498. [Google Scholar]
Liu Z, Guo W. Data driven adaptive spline smoothing. Statistica Sinica. 2010;20:1143–1163. [Google Scholar]
Papaspiliopoulis O, Pokern Y, Roberts GO, Stuart AM. Nonparametric estimation of diffusions: A differential equations approach. Biometrika. 2012;99:511–531. [Google Scholar]
Pintore A, Speckman P, Holmes CC. Spatially adaptive smoothing splines. Biometrika. 2006;93:113–125. [Google Scholar]
Rue H, Martino S. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. Journal of the Royal Statistical Society B. 2009;71:319–392. [Google Scholar]
Stuart AM. Inverse problems: A Bayesian perspective. Acta Numerica. 2010;19:451–559. [Google Scholar]
Wahba G. Improper priors, spline smoothing and the problem of guarding against model errors in regression. Journal of the Royal Statistical Society B. 1978;40:364–372. [Google Scholar]
Wahba G. Spline Models for Observational Data. Philadelphia, PA: Society for Industrial Mathematics; 1990. [Google Scholar]
Wang Y. Smoothing Splines, Methods and Applications. Boca Raton, FL: CRC Press, A Chapman & Hall Book; 2011. [Google Scholar]
Wang Y, Ke C. Smoothing Spline Semi-Parametric Nonlinear Regression Models. Journal of Computational and Graphical Statistics. 2009;18:165–183. [Google Scholar]
Wecker WE, Ansley CF. The signal extraction approach to non-linear regression and spline smoothing. Journal of the American Statistical Association. 1983;78:81–89. [Google Scholar]
Zhu B, Song PXK, Taylor JMG. Stochastic functional data analysis: a diffusion model-based approach. Biometrics. 2011;67:1295–1304. doi: 10.1111/j.1541-0420.2011.01591.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

NIHMS690965-supplement-01.pdf^{(71.5KB, pdf)}

[R1] Ansley CF, Kohn R. The equivalence of two stochastic approaches to spline smoothing. Journal of Applied Probability. 1986;23:391–405. [Google Scholar]

[R2] Berlinet A, Thomas-Agnan C. Reproducing Kernel Hilbert Spaces in Probability and Statistics. Boston: Kluwer; 2004. [Google Scholar]

[R3] Bishwal JPN. Parameter Estimation in Stochastic Differential Equations. New York: Springer; 2008. [Google Scholar]

[R4] Cramér H, Leadbetter MR. Stationary and Related Stochastic Processes; Sample Function Properties and Their Applications. New York: Wiley; 1967. [Google Scholar]

[R5] Furrer EM, Nychka DW. A framework to understand the asymptotic properties of Kriging and splines. Journal of the Korean Statistical Society. 2007;36:57–76. [Google Scholar]

[R6] Griebel M, Hegland M. A finite element method for density estimation with Gaussian process priors. SIAM Journal on Numerical Analysis. 2010;47:4759–4792. [Google Scholar]

[R7] Gu C. Smoothing Spline ANOVA Models. 2nd ed. New York: Springer; 2013. [Google Scholar]

[R8] Heckman N, Ramsay JO. Penalized regression with model-based penalties. Canadian Journal of Statistics. 2000;28:241–258. [Google Scholar]

[R9] Ikeda N, Watanabe S. Stochastic Differential Equations and Diffusion Processes. 2nd ed. Amsterdam, New York: North-Holland Pub. Co.; 1989. [Google Scholar]

[R10] Karatzas K, Shreve SE. Brownian Motion and Stochastic Calculus. Springer-Verlag; 1988. pp. 354–355. [Google Scholar]

[R11] Karlin S, Ziegler Z. Chebyshevian spline functions. Siam Journal on Numerical Analysis. 1966;3:514–543. [Google Scholar]

[R12] Kimeldorf GS, Wahba G. A correspondence between Bayesian estimation on stochastic process and smoothing by splines. The Annals of Mathematical Statistics. 1970a;41:495–502. [Google Scholar]

[R13] Kimeldorf GS, Wahba G. Spline functions and stochastic processes. Sankhya: The Indian Journal of Statistics, Series A. 1970b;32:173–180. [Google Scholar]

[R14] Kimeldorf GS, Wahba G. Some results on Tchebycheffian spline functions. Journal of Mathematical Analysis and Applications. 1971;33:82–95. [Google Scholar]

[R15] Kohn R, Ansley CF. A new algorithm for spline smoothing based on smoothing a stochastic process. Siam Journal on Scientific and Statistical Computing. 1987;8:33–48. [Google Scholar]

[R16] Li L, Chang A, Rostand SG, Hebert L, Appel LJ, Astor BC, Lipkowitz MS, Wright JT, Kendrick C, Wang X, Greene TH. A within-patient analysis for time-varying risk factors of CKD progression. Journal of the American Society of Nephrology. 2014;25 doi: 10.1681/ASN.2013050464. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] Lindgren F, Rue H, Lindstrom J. An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach. Journal of the Royal Statistical Society B. 2011;73:423–498. [Google Scholar]

[R18] Liu Z, Guo W. Data driven adaptive spline smoothing. Statistica Sinica. 2010;20:1143–1163. [Google Scholar]

[R19] Papaspiliopoulis O, Pokern Y, Roberts GO, Stuart AM. Nonparametric estimation of diffusions: A differential equations approach. Biometrika. 2012;99:511–531. [Google Scholar]

[R20] Pintore A, Speckman P, Holmes CC. Spatially adaptive smoothing splines. Biometrika. 2006;93:113–125. [Google Scholar]

[R21] Rue H, Martino S. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. Journal of the Royal Statistical Society B. 2009;71:319–392. [Google Scholar]

[R22] Stuart AM. Inverse problems: A Bayesian perspective. Acta Numerica. 2010;19:451–559. [Google Scholar]

[R23] Wahba G. Improper priors, spline smoothing and the problem of guarding against model errors in regression. Journal of the Royal Statistical Society B. 1978;40:364–372. [Google Scholar]

[R24] Wahba G. Spline Models for Observational Data. Philadelphia, PA: Society for Industrial Mathematics; 1990. [Google Scholar]

[R25] Wang Y. Smoothing Splines, Methods and Applications. Boca Raton, FL: CRC Press, A Chapman & Hall Book; 2011. [Google Scholar]

[R26] Wang Y, Ke C. Smoothing Spline Semi-Parametric Nonlinear Regression Models. Journal of Computational and Graphical Statistics. 2009;18:165–183. [Google Scholar]

[R27] Wecker WE, Ansley CF. The signal extraction approach to non-linear regression and spline smoothing. Journal of the American Statistical Association. 1983;78:81–89. [Google Scholar]

[R28] Zhu B, Song PXK, Taylor JMG. Stochastic functional data analysis: a diffusion model-based approach. Biometrics. 2011;67:1295–1304. doi: 10.1111/j.1541-0420.2011.01591.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Stochastic dynamic models and Chebyshev splines

Ruzong Fan

Bin Zhu

Yuedong Wang

Abstract

1. INTRODUCTION

2. STOCHASTIC DYNAMIC MODELS

2.1. Stochastic Dynamic Models

2.2. An Equivalent Stochastic Integration Equation

Lemma 2.1

Proposition 2.1

3. GAUSSIAN PROCESS DRIVEN BY A LINEAR SDE AND CONNECTION TO CHEBYSHEV SPLINES

3.1. Gaussian Process Driven by the Linear SDE (8)

Lemma 3.1

Theorem3.1

Remark 3.1

3.2. Construction of a Chebyshev Spline Model

Theorem 3.2

Remark 3.2

Proposition 3.1

Remark 3.3

Remark 3.4

Remark 3.5

3.3. Ornstein–Uhlenbeck Process and Exponential Spline

3.3.1. Exponential spline

3.3.2. Partial exponential spline

3.4. Logistic Spline and Its Corresponding SDM

Remark 3.6

Remark 3.7

4. EXTENDED STOCHASTIC DYNAMIC MODELS AND THEIR CONNECTIONS TO CHEBYSHEV SPLINES

4.1. General Stochastic Dynamic Models

Proposition 4.1

4.2. Gaussian Models Driven by a Linear SDE

Theorem 4.1

4.3. Connection to Chebyshev Splines

Theorem 4.2

4.4. Chebyshev Splines to Stochastic Dynamic Models

5. APPLICATIONS

Figure 1.

6. DISCUSSION

Supplementary Material

ACKNOWLEDGEMENTS

BIBLIOGRAPHY

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases