Statistical Inference for Periodic Self-Exciting Threshold Integer-Valued Autoregressive Processes

Congmin Liu; Jianhua Cheng; Dehui Wang

doi:10.3390/e23060765

. 2021 Jun 17;23(6):765. doi: 10.3390/e23060765

Statistical Inference for Periodic Self-Exciting Threshold Integer-Valued Autoregressive Processes

Congmin Liu ¹, Jianhua Cheng ¹, Dehui Wang ^2,^*

Editor: Christian H Weiss

PMCID: PMC8234043 PMID: 34204491

Abstract

This paper considers the periodic self-exciting threshold integer-valued autoregressive processes under a weaker condition in which the second moment is finite instead of the innovation distribution being given. The basic statistical properties of the model are discussed, the quasi-likelihood inference of the parameters is investigated, and the asymptotic behaviors of the estimators are obtained. Threshold estimates based on quasi-likelihood and least squares methods are given. Simulation studies evidence that the quasi-likelihood methods perform well with realistic sample sizes and may be superior to least squares and maximum likelihood methods. The practical application of the processes is illustrated by a time series dataset concerning the monthly counts of claimants collecting short-term disability benefits from the Workers’ Compensation Board (WCB). In addition, the forecasting problem of this dataset is addressed.

Keywords: periodic autoregression, integer-valued threshold models, parameter estimation

1. Introduction

There has been considerable interest in integer-valued time series because of their wide range of applications, including epidemiology, finance, and disease modeling. Examples of such data are as follows: the number of major global earthquakes per year, monthly crimes in a particular country or region, and patient numbers in a hospital per month over a period of time, etc. Following the first-order integer-valued autoregressive (INAR(1)) models introduced by Al-Osh and Alzaid [1], INAR models have been widely used, see Du and Li [2], Jung et al. [3], Weiß [4], Ristić et al. [5], Zhang et al. [6], Li et al. [7], Kang et al. [8] and Yu et al. [9], among others. However, for so-called piecewise phenomenon such as high thresholds, sudden bursts of large values, and time volatility, the INAR model will not work well. The threshold models (Tong [10]; Tong and Lim [11]) have attracted much attention and have been widely used to model nonlinear phenomena. To capture the piecewise phenomenon of integer-valued time series, Monteiro et al. [12] introduced a class of self-exciting threshold integer-valued autoregressive (SETINAR) models driven by independent Poisson-distributed random variables. Wang et al. [13] proposed a self-excited threshold Poisson autoregressive (SETPAR) model. Yang et al. [14] considered a class of SETINAR processes that properly capture flexible asymmetric and nonlinear responses without assuming the distributions for the errors. Yang et al. [15] introduced an integer-valued threshold autoregressive process based on a negative binomial thinning operator (NBTINAR(1)).

In addition, there are many sources of business, economic and meteorology time series data showing a periodically varying phenomenon that repeats itself after a regular period of time. It may be affected by seasonal factors and human activities. For dealing with the processes exhibiting periodic patterns, Bennett [16] and Gladyshev [17] proposed periodically correlated random processes. Then, Bentarzi and Hallin [18], Lund and Basawa [19], Basawa and Lund [20], and Shao [21], among other authors, studied the periodic autoregressive moving-average (PARMA) models in some detail. To capture the periodic phenomenon of integer-valued time series, Monteiro et al. [22] proposed the periodic integer-valued autoregressive models of order one (PINAR(1)) with period T, driven by a periodic sequence of independent Poisson-distributed random variables. Hall et al. [23] considered the extremal behavior of periodic integer-valued moving-average sequences. Santos et al. [24] introduced a multivariate PINAR model with time-varying parameters. The analysis of periodic self-exciting threshold integer-valued autoregressive (PSETINAR ${(2; 1, 1)}_{T}$ ) processes was introduced by Pereira et al. [25]. Manaa and Bentarzi [26] established the existence of high moment and the strict periodic stationarity for the PSETINAR ${(2; 1, 1)}_{T}$ processes. The CLS and CML methods are applied to estimate the parameters while using the nested sub-sample search (NeSS) algorithm proposed by Li and Tong [27] to estimate the periodic threshold parameters. A drawback of this PSETINAR ${(2; 1, 1)}_{T}$ model is that the mean and variance of Poisson distribution are equal, which is not always true in the real data. Therefore, in this paper, we remove the assumption of Poisson distribution, only specify the relationship between mean and variance of observations, develop quasi-likelihood inference for the PSETINAR ${(2; 1, 1)}_{T}$ processes, and consider the estimation of thresholds.

Quasi-likelihood is a non-parametric inference method proposed by Wedderburn [28]. It is very useful in cases where the exact distributional information is not available, while only the relation between mean and variance of the observation is given, and it enjoys a certain robustness of validity. Quasi-likelihood has been widely applied. For example, Azrak and Mélard [29] proposed a simple and efficient algorithm to evaluate the exact quasi-likelihood of ARMA models with time-dependent coefficients; Christou and Fokianos [30] studied probabilistic properties and quasi-likelihood estimation for negative binomial time series models; Li et al. [31] studied the quasi-likelihood inference for the self-exciting threshold integer-valued autoregressive (SETINAR(2,1)) processes under a weaker condition; Yang et al. [32] modeled overdispersed or underdispersed count data with generalized Poisson integer-valued autoregressive (GPINAR(1)) processes and investigated the maximum quasi- likelihood estimators.

The remainder of this paper is organized as follows. In Section 2, we redefine the PSETINAR(2; 1, 1) $_{T}$ processes under weak conditions and discuss their basic properties. In Section 3, we consider the quasi-likelihood inference for the unknown parameters. Thresholds estimation is also discussed. Section 4 presents some simulation results for the estimates. In Section 5, we give an application of the proposed processes to a real dataset. The forecasting problem of this dataset is addressed. Concluding remarks are given in Section 6. All proofs are postponed to the Appendix A.

2. The Model and Its Properties

The periodic self-exciting threshold integer-valued autoregressive model of order one with two regimes (PSETINAR ${(2; 1, 1)}_{T}$ ) (originally proposed by Pereira et al. [25], and further studied by Manaa and Bentarzi [26]) is defined by the recursive equation:

X_{t} = \{\begin{matrix} α_{t}^{(1)} \circ X_{t - 1} + Z_{t}, & X_{t - 1} \leq r_{t}, \\ α_{t}^{(2)} \circ X_{t - 1} + Z_{t}, & X_{t - 1} > r_{t}, \end{matrix} t \in Z

(1)

with threshold parameters $r_{t} = r_{j}$ , autoregressive coefficients $α_{t}^{(k)} = α_{j}^{(k)} \in (0, 1)$ , for $k = 1, 2$ , $t = j + s T, j = 1, 2, \dots, T, s \in Z$ , and $T \in N_{0}$ . Note that Equation (1) admits the representation

X_{j + s T} = (α_{j}^{(1)} \circ X_{j + s T - 1}) I_{j + s T - 1}^{(1)} + (α_{j}^{(2)} \circ X_{j + s T - 1}) I_{j + s T - 1}^{(2)} + Z_{j + s T},

(2)

where

(i)
$I_{j + s T - 1}^{(1)} : = I {X_{j + s T - 1} \leq r_{j}}, I_{j + s T - 1}^{(2)} : = 1 - I_{j + s T - 1}^{(1)} = I {X_{j + s T - 1} > r_{j}}$ , in which ${r_{j}, j = 1, 2, \dots, T}$ is a set of thresholds value;
(ii)
The thinning operator “∘” is defined as
$α_{j}^{(k)} \circ X_{j + s T - 1} = \sum_{i = 1}^{X_{j + s T - 1}} U_{i, j + s T} (α_{j}^{(k)}),$ (3)
in which ${U_{i, j + s T} (α_{j}^{(k)}), j = 1, 2, \dots, T, s \in Z}$ is a sequence of independent periodic Bernoulli random variables with $P (U_{i, j + s T} (α_{j}^{(k)}) = 1) = 1 - P (U_{i, j + s T} (α_{j}^{(k)}) = 0) = α_{j}^{(k)}, k = 1, 2$ ;
(iii)
${Z_{j + s T}, j = 1, 2, \dots, T, s \in Z}$ constitutes a sequence of independent periodic random variables with $E (Z_{j + s T}) = λ_{j}$ , $V a r (Z_{j + s T}) = σ_{z, j}^{2}$ , which is assumed to be independent of ${X_{j + s T - 1}}$ and ${α_{j}^{(k)} \circ X_{j + s T - 1}}$ .

Remark 1.

The innovation of PSETINAR ${(2; 1, 1)}_{T}$ process defined by Pereira et al. [25] and Manaa and Bentarzi [26] is a sequence of independent periodic Poisson-distributed random variables with mean $λ_{j}$ , that is ${Z_{t}} \sim P (λ_{j})$ , where $t = j + s T$ , $j = 1, 2, \dots, T$ , $s \in Z$ . In this paper, we use $E (Z_{j + s T}) = λ_{j}$ , $V a r (Z_{j + s T}) = σ_{z, j}^{2}$ instead of the assumption of periodic Poisson distribution for ${Z_{j + s T}}$ , so that the model is more flexible.

The following proposition establishes the conditional mean and the conditional variance of the PSETINAR ${(2; 1, 1)}_{T}$ process, which plays an important role in the study of the process properties and parameter estimations.

Proposition 1.

For any fixed $j = 1, 2, \dots, T$ , with $T \in N_{0}$ , the conditional mean and the conditional variance of the process ${X_{t}}$ for $t = j + s T$ and $s \in Z$ defined in (2) are given by

(i)
$E (X_{j + s T} | X_{j + s T - 1}) = α_{j}^{(1)} X_{j + s T - 1} I_{j + s T - 1}^{(1)} + α_{j}^{(2)} X_{j + s T - 1} I_{j + s T - 1}^{(2)} + λ_{j}$ ,

(ii)
$V a r (X_{j + s T} | X_{j + s T - 1}) = \sum_{k = 1}^{2} α_{j}^{(k)} (1 - α_{j}^{(k)}) X_{j + s T - 1} I_{j + s T - 1}^{(k)} + σ_{z, j}^{2}$ .

The following theorem states the ergodicity of the PSETINAR ${(2; 1, 1)}_{T}$ process (2). This property is useful in deriving the asymptotic properties of the parameter estimators.

Theorem 1.

For any fixed $j = 1, 2, \dots, T$ , with $T \in N_{0}$ , the process ${X_{t}}$ for $t = j + s T$ and $s \in Z$ defined in (2) is an ergodic Markov chain.

3. Parameters Estimation

Suppose we have a series of observations ${X_{j + s T}, j = 1, 2, \dots, T, s \in N_{0}}$ generated from the PSETINAR ${(2; 1, 1)}_{T}$ process. The goal of this section is to estimate the unknown parameters vector $β = (β_{1}, \dots, β_{3 T})^{'} ≜ (α_{1}^{(1)}, α_{1}^{(2)}, λ_{1}, α_{2}^{(1)}, α_{2}^{(2)}, λ_{2}, \dots, α_{T}^{(1)},$ $α_{T}^{(2)}, λ_{T})^{'}$ and threshold parameters vector $r = (r_{1}, r_{2}, \dots, r_{T})^{'}$ . This section is divided into two subsections. In Section 3.1, we estimate the parameters vector $β$ by using the maximum quasi-likelihood (MQL) method when the thresholds value is known. We consider the maximum quasi-likelihood (MQL) and conditional least square (CLS) estimators of thresholds $r$ in Section 3.2.

3.1. Estimation of Parameters $β$

As described in Proposition 1 (ii), we have the variance of $X_{t}$ conditional on $X_{t - 1}$ , let $θ_{j} ≜ (θ_{j}^{(1)}, θ_{j}^{(2)}, σ_{z, j}^{2})^{'}$ with $θ_{j}^{(k)} = α_{j}^{(k)} (1 - α_{j}^{(k)})$ , $k = 1, 2$ , $j = 1, 2, \dots, T$ , then the $V a r (X_{j + s T} | X_{j + s T - 1})$ admits the representation

V_{θ_{j}} (X_{j + s T} | X_{j + s T - 1}) \overset{△}{=} V a r (X_{j + s T} | X_{j + s T - 1}) = θ_{j}^{(1)} X_{j + s T - 1} I_{j + s T - 1}^{(1)} + θ_{j}^{(2)} X_{j + s T - 1} I_{j + s T - 1}^{(2)} + σ_{z, j}^{2},

for $\forall j = 1, 2, \dots, T, s \in N_{0}$ .

As discussed in Wedderburn [28], we have the set of standard quasi-likelihood estimating equations:

L (β) = \sum_{s = 0}^{N - 1} \sum_{j = 1}^{T} \frac{X_{j + s T} - E (X_{j + s T} | X_{j + s T - 1})}{V_{θ_{j}} (X_{j + s T} | X_{j + s T - 1})} \frac{\partial E (X_{j + s T} | X_{j + s T - 1})}{\partial β_{i}} = 0,

(4)

for $i = 1, \dots, 3 T$ , where N is the total number of cycles. By solving (4), the quasi-likelihood estimator can be obtained.

This method is essentially a two-step estimation, if $θ_{j}$ is unknown, we propose substituting a suitable consistent estimator of $θ_{j}$ obtained by other means, getting modified quasi-likelihood estimating equations and then solving them for the primary parameters of interest. In the modified quasi- likelihood estimating equations, we replace $θ_{j}$ with a suitable consistent estimator ${\hat{θ}}_{j}$ . For simplicity in notation, we define $V_{{\hat{θ}}_{j}}^{- 1} \overset{△}{=} V_{{\hat{θ}}_{j}}^{- 1} (X_{j + s T} | X_{j + s T - 1})$ . This approach leads to the modified quasi-likelihood estimator ${\hat{β}}_{M Q L}$ of $β$ (see Zheng, Basawa and Datta [33]):

{\hat{β}}_{M Q L} = Q_{N}^{- 1} q_{N},

(5)

where

Q_{N} = [\begin{matrix} Q_{1, N} & 0 & \dots & 0 \\ 0 & Q_{2, N} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & Q_{T, N} \end{matrix}],

and

q_{N} = (q_{1, N}, q_{2, N}, \dots, q_{T, N})^{'},

moreover, the $0$ ’s are $(3 \times 3)$ -null matrices, $Q_{j, N}$ and $Q_{j, N} (j = 1, 2, \dots, T)$ given by

Q_{j, N} = [\begin{matrix} \sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} X_{j + s T - 1}^{2} I_{j + s T - 1}^{(1)} & 0 & \sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} X_{j + s T - 1} I_{j + s T - 1}^{(1)} \\ 0 & \sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} X_{j + s T - 1}^{2} I_{j + s T - 1}^{(2)} & \sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} X_{j + s T - 1} I_{j + s T - 1}^{(2)} \\ \sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} X_{j + s T - 1} I_{j + s T - 1}^{(1)} & \sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} X_{j + s T - 1} I_{j + s T - 1}^{(2)} & \sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} \end{matrix}],

q_{j, N} = (\begin{matrix} \sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} X_{j + s T} X_{j + s T - 1} I_{j + s T - 1}^{(1)} & \sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} X_{j + s T} X_{j + s T - 1} I_{j + s T - 1}^{(2)} & \sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} X_{j + s T} \end{matrix})^{'} .

Note that we use consistent estimator ${\hat{θ}}_{j} = ({\hat{α}}_{j}^{(1)} (1 - {\hat{α}}_{j}^{(1)}), {\hat{α}}_{j}^{(2)} (1 - {\hat{α}}_{j}^{(2)}), {\hat{σ}}_{z, j}^{2})^{'}$ instead of $θ_{j}$ .

Next, the proposition gives consistent estimators ${\hat{σ}}_{z, j}^{2}$ of $σ_{z, j}^{2}$ , which depends on some consistent estimators ${\hat{α}}_{j}^{(k)}$ and ${\hat{λ}}_{j}$ with $k = 1, 2$ , $j = 1, 2, \dots, T$ .

Proposition 2.

The following variance estimators for ${Z_{j + s T}}$ with $j = 1, 2, \dots, T, s \in N_{0}$ are consistent:

$\begin{matrix} (i) {\hat{σ}}_{1, z, j}^{2} = \frac{1}{N} \sum_{s = 0}^{N - 1} {(X_{j + s T} - \sum_{k = 1}^{2} {\hat{α}}_{j}^{(k)} X_{j + s T - 1} I_{j + s T - 1}^{(k)} - {\hat{λ}}_{j})}^{2} \\ - \frac{1}{N} \sum_{k = 1}^{2} \sum_{s = 0}^{N - 1} {\hat{α}}_{j}^{(k)} (1 - {\hat{α}}_{j}^{(k)}) X_{j + s T - 1} I_{j + s T - 1}^{(k)}, \end{matrix}$ (6)

$\begin{matrix} (i i) {\hat{σ}}_{2, z, j}^{2} = {\hat{σ}}_{x, j}^{2} - {\hat{p}}_{j} [{\hat{α}}_{j}^{{(1)}^{2}} {\hat{σ}}_{j}^{2^{(1)}} + {\hat{α}}_{j}^{(1)} (1 - {\hat{α}}_{j}^{(1)}) {\hat{μ}}_{j}^{(1)}] \\ - (1 - {\hat{p}}_{j}) [{\hat{α}}_{j}^{{(2)}^{2}} {\hat{σ}}_{j}^{2^{(2)}} + {\hat{α}}_{j}^{(2)} (1 - {\hat{α}}_{j}^{(2)}) {\hat{μ}}_{j}^{(2)}] \\ - {\hat{p}}_{j} (1 - {\hat{p}}_{j}) {({\hat{α}}_{j}^{(1)} {\hat{μ}}_{j}^{(1)} - {\hat{α}}_{j}^{(2)} {\hat{μ}}_{j}^{(2)})}^{2}, \end{matrix}$ (7)

for $k = 1, 2, j = 1, 2, \dots, T, s \in N_{0}$ , in which ${\hat{α}}_{j}^{(k)}$ and ${\hat{λ}}_{j}$ are consistent estimators of $α_{j}^{(k)}$ and $λ_{j}$ (for example, we can use the CLS estimators given in Theorem 3.1 of Pereira et al. [25]), furthermore

${\bar{X}}_{j} = \frac{1}{N} \sum_{s = 0}^{N - 1} X_{j + s T}, {\hat{σ}}_{x, j}^{2} = \frac{1}{N} \sum_{s = 0}^{N - 1} {(X_{j + s T} - {\bar{X}}_{j})}^{2},$

$N_{j}^{(k)} = \sum_{s = 0}^{N - 1} I_{j + s T - 1}^{(k)}, {\hat{μ}}_{j}^{(k)} = \frac{1}{N_{j}^{(k)}} \sum_{s \in {I_{j + s T - 1}^{(k)} = 1}} X_{j + s T},$

${\hat{p}}_{j} = \frac{1}{N} \sum_{s = 0}^{N - 1} I_{j + s T - 1}^{(1)}, {\hat{σ}}_{j}^{2^{(k)}} = \frac{1}{N_{j}^{(k)}} \sum_{s \in {I_{j + s T - 1}^{(k)} = 1}} {(X_{j + s T} - {\hat{μ}}_{j}^{(k)})}^{2} .$

The two estimations are based on conditional variance Var $(X_{j + s T} | X_{j + s T - 1})$ and variance Var $(X_{j + s T})$ , respectively. The details can be found in the Appendix A.

To study the asymptotic behavior of the estimator ${\hat{β}}_{M Q L}$ , we make the following assumptions about the process of ${X_{t}}$ :

(C1)
By Proposition 1 in Pereira et al. [25], we assume the ${X_{t}}$ is a strictly ciclostationary process;
(C2)
$E | X_{t} |^{4} < \infty$ .

Now for the asymptotic properties of the quasi-likelihood estimator ${\hat{β}}_{M Q L}$ given by (5), we have the following asymptotic distribution.

Theorem 2.

Let ${X_{t}}$ be a PSETINAR ${(2; 1, 1)}_{T}$ process defined in (2), then under the assumptions (C1)-(C2), the estimator ${\hat{β}}_{M Q L}$ given by (5) is asymptotically normal,

$\sqrt{N} ({\hat{β}}_{M Q L} - β) \to N (0, H^{- 1} (θ)),$

where

$H (θ) = [\begin{matrix} H_{1} (θ) & 0 & \dots & 0 \\ 0 & H_{2} (θ) & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & H_{T} (θ) \end{matrix}],$

with matrices $H_{j} (j = 1, 2, \dots, T)$ given by

H_{j} (θ) = [\begin{matrix} E (V_{θ_{j}}^{- 1} (X_{j} | X_{j - 1}) X_{j - 1}^{2} I_{j - 1}^{(1)}) & 0 & E (V_{θ_{j}}^{- 1} (X_{j} | X_{j - 1}) X_{j - 1} I_{j - 1}^{(1)}) \\ 0 & E (V_{θ_{j}}^{- 1} (X_{j} | X_{j - 1}) X_{j - 1}^{2} I_{j - 1}^{(2)}) & E (V_{θ_{j}}^{- 1} (X_{j} | X_{j - 1}) X_{j - 1} I_{j - 1}^{(2)}) \\ E (V_{θ_{j}}^{- 1} (X_{j} | X_{j - 1}) X_{j - 1} I_{j - 1}^{(1)}) & E (V_{θ_{j}}^{- 1} (X_{j} | X_{j - 1}) X_{j - 1} I_{j - 1}^{(2)}) & E (V_{θ_{j}}^{- 1} (X_{j} | X_{j - 1})) \end{matrix}] .

It is worth mentioning that this theorem reflects the consistency of the estimator ${\hat{β}}_{M Q L}$ .

3.2. Estimation of Thresholds Vector $r$

Note that in the real data application, the threshold values are also unknown. In this subsection, we estimate the thresholds vector $r = {(r_{1}, r_{2}, \dots, r_{T})}^{'}$ . Here, we further promote the nested sub-sample search (NeSS) algorithm (see, e.g., Yang et al. [15], Li and Tong [27], and Li et al. [31]) and use conditional least squares (CLS) and modified quasi-likelihood (MQL) principles to estimate $r$ .

For some fixed $λ = {(λ_{1}, λ_{2}, \dots, λ_{T})}^{'}$ , the application of the conditional least squares principle yields the sum of squared errors:

\begin{matrix} S_{N} (r, λ) \\ = & \sum_{s = 0}^{N - 1} \sum_{j = 1}^{T} {(X_{j + s T} - \sum_{k = 1}^{2} \frac{\sum_{s = 0}^{N - 1} (X_{j + s T} X_{j + s T - 1} I_{j + s T - 1}^{(k)} - λ_{j} X_{j + s T - 1} I_{j + s T - 1}^{(k)})}{\sum_{s = 0}^{N - 1} X_{j + s T - 1}^{2} I_{j + s T - 1}^{(k)}} X_{j + s T - 1} I_{j + s T - 1}^{(k)} - λ_{j})}^{2}, \end{matrix}

and then the thresholds vector $r$ can be estimated by minimizing $S_{N} (r, λ)$ ,

\hat{r} = arg min_{r \in [\underline{r}, \bar{r}]} S_{N} (r, λ),

(8)

where $\underline{r}$ and $\bar{r}$ are some known lower and upper bounds of $r$ . In practice, they can be selected as the minimum and maximum values in each cycle of the sample. For convenience, we consider an alternative objective function

J_{N} (r, λ) = S_{N} - S_{N} (r, λ),

where

S_{N} = \sum_{s = 0}^{N - 1} \sum_{j = 1}^{T} {(X_{j + s T} - \frac{\sum_{s = 0}^{N - 1} (X_{j + s T} X_{j + s T - 1} - λ_{j} X_{j + s T - 1})}{\sum_{s = 0}^{N - 1} X_{j + s T - 1}^{2}} X_{j + s T - 1} - λ_{j})}^{2} .

Now, the optimization in (8) is equivalent to

{\hat{r}}_{C L S} = arg max_{r \in [\underline{r}, \bar{r}]} J_{N} (r, λ),

(9)

where ${\hat{r}}_{C L S}$ is the conditional least squares estimator of the thresholds vector $r$ .

Inspired by the method of conditional least squares, we investigate the performances of $r$ by using the quasi-likelihood principle. The modified quasi-likelihood estimator ${\hat{r}}_{M Q L}$ of $r$ is obtained by maximizing the expression

{\tilde{J}}_{N} (r, λ) = {\tilde{S}}_{N} - {\tilde{S}}_{N} (r, λ),

which yields

{\hat{r}}_{M Q L} = arg max_{r \in [\underline{r}, \bar{r}]} {\tilde{J}}_{N} (r, λ),

(10)

where

\begin{matrix} {\tilde{S}}_{N} (r, λ) \\ = & \sum_{s = 0}^{N - 1} \sum_{j = 1}^{T} V_{{\hat{θ}}_{j}}^{- 1} {(X_{j + s T} - \sum_{k = 1}^{2} \frac{\sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} \cdot (X_{j + s T} X_{j + s T - 1} I_{j + s T - 1}^{(k)} - λ_{j} X_{j + s T - 1} I_{j + s T - 1}^{(k)})}{\sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} \cdot X_{j + s T - 1}^{2} I_{j + s T - 1}^{(k)}} X_{j + s T - 1} I_{j + s T - 1}^{(k)} - λ_{j})}^{2}, \end{matrix}

and

{\tilde{S}}_{N} = \sum_{s = 0}^{N - 1} \sum_{j = 1}^{T} V_{{\hat{θ}}_{j}}^{- 1} {(X_{j + s T} - \frac{\sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} \cdot (X_{j + s T} X_{j + s T - 1} - λ_{j} X_{j + s T - 1})}{\sum_{s = 0}^{N - 1} V_{{\hat{θ}}_{j}}^{- 1} \cdot X_{j + s T - 1}^{2}} X_{j + s T - 1} - λ_{j})}^{2} .

It is worth mentioning that there are unknown parameters $λ_{j}$ with $j = 1, \dots, T$ when we use (9) and (10) to estimate thresholds vector $r$ . As argued in Li and Tong [27], Yang et al. [14], and Yang et al. [15], when $λ$ and r are one-dimensional parameters, we can choose any positive number as the value of $λ$ without worrying about getting a wrong result of $\hat{r}$ . Fortunately, we also find out by simulations that the estimations of $r$ by maximizing ${\tilde{J}}_{N} (r, λ)$ and $J_{N} (r, λ)$ do not depend on the value of $λ$ . In order to give an intuitive impression of ${\tilde{J}}_{N} (r, λ) / N$ , we generate a set of data with Model I (given in Section 4, i.e., $T = 2, N = 50, β = {(0.2, 0.1, 3, 0.8, 0.1, 7)}^{'}, r = {(8, 4)}^{'}$ ), and plot the shapes of ${\tilde{J}}_{N} (r, λ) / N$ . From Figure 1, we can see that for different values of $λ$ , the shape of ${\tilde{J}}_{N} (r, λ) / N$ changes, but the maximum value in each subfigure is obtained at the true thresholds vector $r = {(8, 4)}^{'}$ . In practice, we can choose the mean in each cycle of the samples for $λ_{j}, j = 1, 2, \dots, T$ .

The shapes of ${\tilde{J}}_{N} (r, λ) / N$ .

Actually, using the quasi-likelihood method to estimate the thresholds is a three-step estimation procedure, and we now present the algorithm to implement our estimation procedure as follows:

Step 1:
Choose the upper bound $\bar{r}$ and lower bound $\underline{r}$ of $r$ , solve (9) to get the ${\hat{r}}_{C L S}$ with $λ_{j} = {\bar{X}}_{j} = \frac{1}{N} \sum_{s = 0}^{N - 1} X_{j + s T}, j = 1, 2, \dots, T$ ;
Step 2:
Fix ${\hat{r}}_{C L S}$ at the current value, solve (6) or (7) to get the ${\hat{σ}}_{z, j}^{2}, j = 1, 2, \dots, T$ , where $α_{j}^{(k)}$ and $λ_{j}$ with $k = 1, 2$ can be estimated by other methods, then solve (5) to get ${\hat{β}}_{M Q L}$ .
Step 3:
Fix ${\hat{θ}}_{j} = ({\hat{α}}_{j, M Q L}^{(1)} (1 - {\hat{α}}_{j, M Q L}^{(1)}), {\hat{α}}_{j, M Q L}^{(2)} (1 - {\hat{α}}_{j, M Q L}^{(2)}), {\hat{σ}}_{z, j}^{2})^{'}, j = 1, 2, \dots, T$ at its estimated value from Step 2, choose the same upper bound $\bar{r}$ and lower bound $\underline{r}$ as in Step 1, solve (10) to get ${\hat{r}}_{M Q L}$ .

4. Simulation Study

In this section, we conduct simulation studies to illustrate the finite sample performances of the estimates. The initial value $X_{0}$ is fixed at 0. In order to capture the characteristics of the data from the PSETINAR ${(2; 1, 1)}_{T}$ process, we first generate a set of data with the distribution of innovations ${Z_{t}}$ given by Model I (mentioned below in this section) and parameters $β = (0.2, 0.45, 1, 0.2, 0.45, 2, 0.65, 0.45$ , 1, $0.65, 0.45, 2, 0.2, 0.45, 3, 0.2, 0.45$ , $7, 0.8, 0.45, 7, 0.2, 0.1, 3$ , $0.8$ , ${0.1, 7, 0.2, 0.1, 7, 0.8, 0.45, 2)}^{'}$ , $r$ = ${(3, 3, 3, 1, 3, 3, 5, 9, 3, 6, 7)}^{'}$ , $T = 11$ , $N = 50$ . The parameter vectors we choose here are randomly selected, and there are slight differences between the parameters of each cycle, the thresholds vector of $r$ was chosen such that there are enough data in each regime. We give the sample path in the first six cycles in Figure 2, of which $N = 6$ . We can see that even if there are slight differences between the parameters of each cycle, the dataset still exhibits periodic characteristics.

To report the performances of the estimates, we conduct simulation studies under the following three models:

Model I. Assume that ${Z_{t}}$ is a sequence of i.i.d periodic Poisson distributed random variables with mean $E (Z_{t}) = V a r (Z_{t}) = λ_{j}$ for $t = j + s T, j = 1, 2, \dots, T, s \in N_{0}$ .

Model II. Assume that ${Z_{t}}$ is a sequence of i.i.d. periodic Geometric distributed random variables with p.m.f. given by

p (Z_{j + s T} = z) = \frac{λ_{j}^{z}}{{(1 + λ_{j})}^{1 + z}}, z = 0, 1, 2, \dots

with $E (Z_{t}) = λ_{j}, V a r (Z_{t}) = λ_{j} (1 + λ_{j})$ for $t = j + s T, j = 1, 2, \dots, T, s \in N_{0}$ .

Model III. Assume that ${Z_{t}}$ is a sequence of i.i.d mixed distributed random variables,

Z_{t} = Δ_{t} Z_{1 t} + (1 - Δ_{t}) Z_{2 t},

where ${Δ_{t}}$ is a sequence of i.i.d periodic Bernoulli distributed random variables with $P (Δ_{t} = 1) = 1 - P (Δ_{t} = 0) = ρ_{j}, ρ = (ρ_{1}, ρ_{2}, \dots, ρ_{T})$ for $t = j + s T, j = 1, 2, \dots, T, s \in N_{0}$ , which is independent of ${Z_{i t}}, i = 1, 2$ .

For ${Z_{1 t}}$ given in Model I and ${Z_{2 t}}$ given in Model II, we can easily see that $E (Z_{t}) = λ_{j}, V a r (Z_{t}) = λ_{j}^{2} (1 - ρ_{j}) + λ_{j}$ .

For each model, we generate the data with $X_{0} = 0$ , set $T = 3$ and the sample sizes $n = N T = 150, 300, 900$ . All the calculations are performed under the $R 3.6 . 2$ software with 1000 replications. We use the command constrOptim to optimize the objective function of the maximum likelihood estimation. The threshold vector is calculated by the algorithms discussed in Section 3.2. Other algorithms are based on the explicit expressions.

4.1. Performances of the ${\hat{β}}_{CLS}$ , ${\hat{β}}_{MQL}$ and ${\hat{β}}_{CML}$

Pereira et al. [25] provided a theoretical basis for the conditional least squares (CLS) and conditional maximum likelihood (CML) estimators of the parameters vector $β$ in the PSETINAR ${(2; 1, 1)}_{T}$ process but did not conduct simulation research. Manaa and Bentarzi [26] provided the asymptotic properties of the estimators and compared their performance through a simulation study. To compare the performance of the three estimators ${\hat{β}}_{C L S}$ , ${\hat{β}}_{C M L}$ and ${\hat{β}}_{M Q L}$ (given in Section 3), we conduct simulation studies for these three estimators under Models I to III. The parameters are selected as follows:

Series A. $β = {(0.2, 0.45, 1, 0.2, 0.45, 2, 0.8, 0.45, 2)}^{'}, r = {(3, 2, 2)}^{'}$ .

Series B. $β = {(0.65, 0.45, 1, 0.65, 0.45, 2, 0.35, 0.45, 2)}^{'}, r = {(2, 2, 3)}^{'}$ .

Series C. $β = {(0.2, 0.45, 3, 0.2, 0.45, 7, 0.8, 0.45, 7)}^{'}, r = {(12, 7, 9)}^{'}$ .

To eliminate the influence of the change of parameters on estimates, we choose the series randomly and change the parameters with fixed $α^{(k)}, k = 1, 2$ or $λ$ separately. The selection of these thresholds ensures there are enough data in each regime.

Spectral analysis starts from finding hidden periodicity, and it is an important subject of time series frequency domain analysis. The approach for studying hidden periods based on frequency domain analysis is the periodogram method, proposed by Schuster [34]; the rigorous examination is shown in Fisher [35]. For a series of observations ${X_{t}}, t = 1, 2, \dots, n$ , the periodogram is defined as

I_{n} (f_{k}) = \frac{1}{n} {| \sum_{t = 1}^{n} X_{t} e^{- i 2 π f_{k} t} |}^{2} = a_{k}^{2} + b_{k}^{2},

(11)

where

a_{k} = \{\begin{matrix} \frac{1}{\sqrt{n}} {(\sum_{t = 1}^{n} X_{t} cos (2 π f_{k} t))}^{2}, & k = 1, 2, \dots, [\frac{n - 1}{2}], \\ \frac{1}{\sqrt{n}} \sum_{t = 1}^{n} {(- 1)}^{t} X_{t}, & k = \frac{n}{2}, \end{matrix}

b_{k} = \{\begin{matrix} \frac{1}{\sqrt{n}} {(\sum_{t = 1}^{n} X_{t} sin (2 π f_{k} t))}^{2}, & k = 1, 2, \dots, [\frac{n - 1}{2}], \\ 0, & k = \frac{n}{2}, \end{matrix}

and the period $T = [1 / \arg \max_{f} I_{n} (f_{k})]$ , where $[\cdot]$ denotes the integer part of a number.

The sample path and periodogram of the Series A, B and C under Model I are plotted in Figure 3 to show the periodic characteristics. Because the period is three and short, it is difficult to see the period from the sample path, but the periodogram can show the period very well. In addition, the simulation results are summarized in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8 and Table 9.

The sample path and periodogram of Series A(top), B(middle) and C(bottom) in Model I.

Table 1.

Bias and MSE for Series A of Model I (MSE in parentheses): CLS, MQL and CML.

N	Method	$α_{1}^{(1)}$	$α_{1}^{(2)}$	$λ_{1}$	$α_{2}^{(1)}$	$α_{2}^{(2)}$	$λ_{2}$	$α_{3}^{(1)}$	$α_{3}^{(2)}$	$λ_{3}$
50	CLS	0.001	−0.001	0.001	−0.018	−0.004	0.006	0.008	0.005	−0.025
		(0.052)	(0.014)	(0.253)	(0.131)	(0.024)	(0.230)	(0.160)	(0.024)	(0.326)
	MQL	0.000	−0.002	0.006	−0.015	−0.004	0.002	0.011	0.006	−0.030
		(0.054)	(0.014)	(0.266)	(0.126)	(0.023)	(0.220)	(0.156)	(0.024)	(0.316)
	CML	0.024	0.010	−0.047	0.054	0.019	−0.079	0.003	0.007	−0.027
		(0.024)	(0.008)	(0.117)	(0.062)	(0.016)	(0.126)	(0.047)	(0.013)	(0.134)
100	CLS	0.004	0.000	−0.006	0.013	−0.001	−0.005	0.002	−0.003	0.008
		(0.026)	(0.007)	(0.132)	(0.058)	(0.011)	(0.108)	(0.085)	(0.012)	(0.168)
	MQL	0.004	0.000	−0.006	0.013	−0.001	−0.006	−0.001	−0.004	0.012
		(0.024)	(0.007)	(0.120)	(0.057)	(0.011)	(0.105)	(0.082)	(0.011)	(0.162)
	CML	0.012	0.004	−0.023	0.036	0.007	−0.034	0.003	0.000	−0.001
		(0.014)	(0.004)	(0.067)	(0.036)	(0.008)	(0.073)	(0.024)	(0.006)	(0.066)
300	CLS	−0.003	−0.002	0.009	0.002	0.000	−0.005	−0.002	0.000	−0.001
		(0.010)	(0.003)	(0.051)	(0.020)	(0.004)	(0.034)	(0.028)	(0.004)	(0.055)
	MQL	−0.002	−0.001	0.007	0.001	0.000	−0.004	−0.003	0.000	0.000
		(0.009)	(0.002)	(0.045)	(0.019)	(0.004)	(0.033)	(0.027)	(0.003)	(0.053)
	CML	0.000	0.000	0.000	0.003	0.001	−0.007	0.001	0.002	−0.006
		(0.005)	(0.001)	(0.025)	(0.014)	(0.003)	(0.024)	(0.007)	(0.002)	(0.020)

N	Method	$α_{1}^{(1)}$	$α_{1}^{(2)}$	$λ_{1}$	$α_{2}^{(1)}$	$α_{2}^{(2)}$	$λ_{2}$	$α_{3}^{(1)}$	$α_{3}^{(2)}$	$λ_{3}$
50	CLS	−0.013	−0.010	0.146	−0.010	−0.003	0.053	−0.010	−0.007	0.054
		(0.022)	(0.011)	(2.088)	(0.082)	(0.022)	(1.915)	(0.078)	(0.026)	(3.823)
	MQL	−0.010	−0.008	0.117	−0.010	−0.003	0.052	−0.014	−0.009	0.079
		(0.022)	(0.010)	(2.000)	(0.082)	(0.021)	(1.913)	(0.075)	(0.025)	(3.709)
	CML	0.003	0.001	−0.015	0.044	0.021	−0.201	0.003	0.000	−0.033
		(0.012)	(0.006)	(1.119)	(0.044)	(0.013)	(1.054)	(0.025)	(0.010)	(1.286)
100	CLS	0.001	−0.002	0.015	−0.003	0.001	0.013	0.002	−0.003	0.022
		(0.014)	(0.006)	(1.323)	(0.043)	(0.011)	(1.046)	(0.038)	(0.012)	(1.772)
	MQL	0.000	−0.003	0.034	−0.002	0.001	0.008	0.001	−0.003	0.027
		(0.012)	(0.006)	(1.203)	(0.042)	(0.011)	(1.027)	(0.037)	(0.012)	(1.726)
	CML	0.006	0.001	−0.029	0.018	0.010	−0.085	0.011	0.003	−0.043
		(0.007)	(0.003)	(0.672)	(0.026)	(0.007)	(0.657)	(0.012)	(0.005)	(0.620)
300	CLS	0.000	0.000	0.006	0.002	0.002	−0.014	0.006	0.003	−0.040
		(0.006)	(0.003)	(0.586)	(0.014)	(0.004)	(0.350)	(0.013)	(0.004)	(0.606)
	MQL	0.001	0.000	0.002	0.001	0.001	−0.010	0.005	0.002	−0.032
		(0.005)	(0.002)	(0.527)	(0.014)	(0.004)	(0.341)	(0.012)	(0.004)	(0.589)
	CML	0.002	0.001	−0.013	0.005	0.003	−0.030	0.003	0.002	−0.026
		(0.003)	(0.001)	(0.262)	(0.011)	(0.003)	(0.267)	(0.004)	(0.002)	(0.201)

N	Method	$α_{1}^{(1)}$	$α_{1}^{(2)}$	$λ_{1}$	$α_{2}^{(1)}$	$α_{2}^{(2)}$	$λ_{2}$	$α_{3}^{(1)}$	$α_{3}^{(2)}$	$λ_{3}$
50	CLS	0.009	0.003	−0.019	0.005	−0.012	0.016	0.006	−0.003	−0.002
		(0.038)	(0.007)	(2.068)	(0.382)	(0.043)	(5.495)	(0.217)	(0.026)	(5.702)
	MQL	0.008	0.003	−0.017	−0.055	−0.022	0.070	0.009	−0.002	−0.008
		(0.037)	(0.006)	(1.995)	(0.378)	(0.043)	(5.461)	(0.220)	(0.026)	(5.718)
	CML	0.007	0.004	−0.017	0.015	0.004	−0.025	0.014	0.007	−0.031
		(0.005)	(0.002)	(0.590)	(0.025)	(0.006)	(1.380)	(0.008)	(0.004)	(1.326)
100	CLS	−0.001	−0.002	0.007	−0.006	−0.004	0.007	−0.006	−0.002	0.011
		(0.019)	(0.003)	(1.143)	(0.190)	(0.023)	(3.017)	(0.114)	(0.011)	(2.871)
	MQL	0.000	−0.002	0.004	−0.005	−0.004	0.007	−0.006	−0.003	0.012
		(0.018)	(0.003)	(1.091)	(0.189)	(0.023)	(3.001)	(0.115)	(0.012)	(2.882)
	CML	0.006	0.002	−0.012	0.008	0.004	−0.017	0.001	0.007	−0.017
		(0.002)	(0.001)	(0.238)	(0.012)	(0.003)	(0.691)	(0.004)	(0.002)	(0.660)
300	CLS	−0.003	−0.001	0.004	−0.004	−0.001	−0.006	0.003	−0.002	−0.006
		(0.006)	(0.001)	(0.361)	(0.062)	(0.007)	(0.889)	(0.033)	(0.004)	(0.848)
	MQL	−0.003	0.000	0.003	−0.002	0.000	−0.008	0.004	−0.002	−0.007
		(0.006)	(0.001)	(0.345)	(0.062)	(0.007)	(0.887)	(0.033)	(0.004)	(0.849)
	CML	0.000	0.001	−0.001	0.001	0.001	−0.011	0.004	0.002	−0.015
		(0.001)	(0.000)	(0.069)	(0.004)	(0.001)	(0.205)	(0.001)	(0.001)	(0.222)

N	Method	$α_{1}^{(1)}$	$α_{1}^{(2)}$	$λ_{1}$	$α_{2}^{(1)}$	$α_{2}^{(2)}$	$λ_{2}$	$α_{3}^{(1)}$	$α_{3}^{(2)}$	$λ_{3}$
50	CLS	−0.004	−0.002	0.069	−0.019	−0.008	0.061	−0.011	−0.008	0.131
		(0.038)	(0.007)	(2.068)	(0.382)	(0.043)	(5.495)	(0.217)	(0.026)	(5.702)
	MQL	−0.004	−0.002	0.067	−0.016	−0.007	0.051	−0.009	−0.007	0.122
		(0.037)	(0.006)	(1.995)	(0.378)	(0.043)	(5.461)	(0.220)	(0.026)	(5.718)
	CML	0.010	0.005	−0.019	0.037	0.014	−0.152	0.013	0.009	−0.038
		(0.005)	(0.002)	(0.590)	(0.025)	(0.006)	(1.380)	(0.008)	(0.004)	(1.326)
100	CLS	0.000	0.000	−0.005	−0.020	−0.004	0.054	0.001	−0.008	0.046
		(0.019)	(0.003)	(1.143)	(0.190)	(0.023)	(3.017)	(0.114)	(0.011)	(2.871)
	MQL	−0.002	−0.001	0.006	−0.020	−0.004	0.054	0.002	−0.008	0.045
		(0.018)	(0.003)	(1.091)	(0.189)	(0.023)	(3.001)	(0.115)	(0.012)	(2.882)
	CML	0.008	0.003	−0.059	0.016	0.005	−0.068	0.009	0.003	−0.047
		(0.002)	(0.001)	(0.238)	(0.012)	(0.003)	(0.691)	(0.004)	(0.002)	(0.660)
300	CLS	0.000	−0.001	−0.007	−0.005	−0.001	0.010	−0.014	−0.004	0.071
		(0.006)	(0.001)	(0.361)	(0.062)	(0.007)	(0.889)	(0.033)	(0.004)	(0.848)
	MQL	0.000	−0.001	−0.008	−0.005	−0.001	0.011	−0.014	−0.004	0.072
		(0.006)	(0.001)	(0.345)	(0.062)	(0.007)	(0.887)	(0.033)	(0.004)	(0.849)
	CML	0.000	0.000	−0.012	0.005	0.001	−0.020	0.004	0.002	−0.021
		(0.001)	(0.000)	(0.069)	(0.004)	(0.001)	(0.205)	(0.001)	(0.001)	(0.222)

$ρ$	Method	$α_{1}^{(1)}$	$α_{1}^{(2)}$	$λ_{1}$	$α_{2}^{(1)}$	$α_{2}^{(2)}$	$λ_{2}$	$α_{3}^{(1)}$	$α_{3}^{(2)}$	$λ_{3}$
(0.9, 0.9, 0.9)	CLS	0.002	0.002	−0.004	0.009	0.004	−0.014	−0.007	0.000	0.002
		(0.010)	(0.002)	(0.049)	(0.022)	(0.004)	(0.041)	(0.026)	(0.004)	(0.055)
	MQL	0.002	0.002	−0.004	0.009	0.004	−0.014	−0.007	−0.001	0.003
		(0.009)	(0.002)	(0.042)	(0.021)	(0.004)	(0.040)	(0.026)	(0.004)	(0.053)
	CML	−0.021	−0.009	0.046	−0.043	−0.018	0.057	−0.055	−0.022	0.081
		(0.006)	(0.001)	(0.027)	(0.013)	(0.003)	(0.030)	(0.012)	(0.003)	(0.034)
(0.8, 0.8, 0.8)	CLS	−0.001	−0.001	0.000	0.005	−0.004	0.005	−0.005	−0.004	0.012
		(0.010)	(0.002)	(0.048)	(0.026)	(0.004)	(0.044)	(0.030)	(0.004)	(0.056)
	MQL	−0.001	−0.001	0.000	0.005	−0.004	0.006	−0.008	−0.005	0.016
		(0.009)	(0.002)	(0.042)	(0.026)	(0.004)	(0.043)	(0.030)	(0.004)	(0.054)
	CML	−0.042	−0.018	0.088	−0.080	−0.040	0.122	−0.121	−0.049	0.183
		(0.007)	(0.002)	(0.033)	(0.015)	(0.004)	(0.041)	(0.028)	(0.004)	(0.067)

N	Para.		MQL			CLS
N	Para.	Bias	Median	MSE	Bias	Median	MSE
50	$r_{1}$	−0.167	0	0.447	0.042	0	0.550
	$r_{2}$	0.422	0	1.986	0.723	0	2.841
	$r_{3}$	0.457	0	1.975	0.947	0	3.779
100	$r_{1}$	−0.107	0	0.151	−0.003	0	0.137
	$r_{2}$	0.224	0	1.378	0.570	0	2.428
	$r_{3}$	0.245	0	0.861	0.505	0	1.903
300	$r_{1}$	−0.007	0	0.007	0.000	0	0.002
	$r_{2}$	0.027	0	0.283	0.117	0	0.477
	$r_{3}$	0.021	0	0.035	0.066	0	0.200

N	Method	$α_{1}^{(1)}$	$α_{1}^{(2)}$	$λ_{1}$	$α_{2}^{(1)}$	$α_{2}^{(2)}$	$λ_{2}$	$α_{3}^{(1)}$	$α_{3}^{(2)}$	$λ_{3}$
50	CLS	−0.087	−0.040	0.214	0.018	−0.004	−0.007	0.019	0.002	−0.030
		(0.068)	(0.016)	(0.339)	(0.153)	(0.025)	(0.248)	(0.203)	(0.026)	(0.381)
	MQL	−0.011	−0.007	0.026	0.019	−0.003	−0.008	0.019	0.002	−0.031
		(0.065)	(0.014)	(0.292)	(0.155)	(0.024)	(0.244)	(0.203)	(0.026)	(0.376)
	CML	−0.016	−0.009	0.039	−0.012	−0.024	0.047	−0.109	−0.045	0.153
		(0.022)	(0.007)	(0.118)	(0.042)	(0.015)	(0.140)	(0.091)	(0.017)	(0.245)
100	CLS	−0.044	−0.017	0.103	−0.015	−0.006	0.020	−0.005	−0.003	0.008
		(0.033)	(0.008)	(0.162)	(0.075)	(0.012)	(0.132)	(0.100)	(0.013)	(0.199)
	MQL	−0.008	−0.002	0.013	−0.015	−0.006	0.020	−0.004	−0.003	0.008
		(0.030)	(0.007)	(0.137)	(0.074)	(0.012)	(0.129)	(0.099)	(0.012)	(0.197)
	CML	−0.043	−0.017	0.088	−0.057	−0.033	0.093	−0.129	−0.048	0.186
		(0.014)	(0.004)	(0.073)	(0.027)	(0.008)	(0.083)	(0.062)	(0.010)	(0.156)
300	CLS	−0.016	−0.006	0.036	0.000	−0.002	0.000	0.004	−0.001	−0.003
		(0.010)	(0.002)	(0.048)	(0.026)	(0.004)	(0.043)	(0.030)	(0.004)	(0.057)
	MQL	−0.003	−0.001	0.003	−0.001	−0.002	0.003	0.004	−0.001	−0.003
		(0.009)	(0.002)	(0.043)	(0.025)	(0.004)	(0.042)	(0.029)	(0.004)	(0.054)
	CML	−0.047	−0.020	0.097	−0.081	−0.037	0.113	−0.112	−0.046	0.169
		(0.007)	(0.002)	(0.035)	(0.016)	(0.004)	(0.038)	(0.025)	(0.004)	(0.061)

	Whole Dataset	Jan.	Feb.	Mar.	Apr.	May	Jun.	Jul.	Aug.	Sep.	Oct.	Nov.	Dec.
Mean	6.1	4.2	3.8	4.6	4.9	7.0	7.1	8.5	7.5	7.2	7.2	7.2	4.4
Variance	11.8	2.2	3.3	1.8	9.0	14.7	5.9	28.9	12.5	12.0	12.2	14.8	6.9
Maximum	21	6	7	8	10	14	12	21	12	12	12	14	19
Minimum	1	2	1	3	1	2	3	3	2	2	2	2	1

	Jan.	Feb.	Mar.	Apr.	May	Jun.	Jul.	Aug.	Sep.	Oct.	Nov.	Dec.
${\hat{r}}_{C L S}$	3	4	7	5	5	6	10	4	9	6	7	6
${\hat{r}}_{M Q L}$	3	4	7	5	5	6	10	4	9	6	7	5

PSETINAR ${(2; 1, 1)}_{12}$	AIC	BIC	PINAR ${(1)}_{12}$	AIC	BIC
Pois.	586.63	596.61	Pois.	592.12	599.38
Zero-truncated Pois.	581.65	591.64	Zero-truncated Pois.	594.44	601.71
Geom.	610.45	620.43	Geom.	605.56	612.82
Zero-truncated Geom.	586.36	596.34	Zero-truncated Geom.	595.15	602.42

Month	$α^{(1)}$	$α^{(2)}$	$λ$
Jan.	0.112	$8.907 \times 10^{- 08}$	3.819
Feb.	0.227	0.032	3.060
Mar.	0.692	-	1.969
Apr.	0.999	0.240	2.048
May	0.586	$8.521 \times 10^{- 09}$	4.889
Jun.	0.265	$4.316 \times 10^{- 08}$	5.507
Jul.	0.360	-	5.942
Aug.	0.390	-	4.186
Sep.	0.380	$3.366 \times 10^{- 07}$	5.218
Oct.	0.502	$1.027 \times 10^{- 07}$	4.044
Nov.	0.433	$2.776 \times 10^{- 08}$	4.990
Dec.	0.508	0.222	1.000

	h	1	2	3	12
Conditional expectation	PSETINAR ${(2; 1, 1)}_{12}$ (Zero-truncated Pois.)	2.641	3.019	3.433	2.929
	PINAR ${(1)}_{12}$ (Zero-truncated Pois.)	2.753	3.377	3.567	3.788
	PINAR ${(1)}_{12}$ (Pois.)	2.724	3.407	3.704	4.008
Conditional distribution	PSETINAR ${(2; 1, 1)}_{12}$ (Zero-truncated Pois.)	2.814	3.000	3.109	2.930

PERMALINK

Statistical Inference for Periodic Self-Exciting Threshold Integer-Valued Autoregressive Processes

Congmin Liu

Jianhua Cheng

Dehui Wang

Roles

Abstract

1. Introduction

2. The Model and Its Properties

Remark 1.

Proposition 1.

Theorem 1.

3. Parameters Estimation

3.1. Estimation of Parameters β

Proposition 2.

Theorem 2.

3.2. Estimation of Thresholds Vector r

Figure 1.

4. Simulation Study

Figure 2.

4.1. Performances of the β^CLS, β^MQL and β^CML

Figure 3.

Table 1.

Table 2.

Table 3.

Table 4.

Table 5.

Table 6.

Table 7.

Table 8.

Table 9.

4.2. Performances of r^MQL and r^CLS

Table 10.

Table 11.

Table 12.

Table 13.

Table 14.

Table 15.

Table 16.

Table 17.

Table 18.

Table 19.

Table 20.

5. Real Data Example

Figure 4.

Figure 5.

Table 21.

Table 22.

Table 23.

Table 24.

Table 25.

Figure 6.

Figure 7.

6. Conclusions

Appendix A

Proof of Theorem 1.

Proof of Proposition 2

Proof of Theorem 2.

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3.1. Estimation of Parameters $β$

3.2. Estimation of Thresholds Vector $r$

4.1. Performances of the ${\hat{β}}_{CLS}$ , ${\hat{β}}_{MQL}$ and ${\hat{β}}_{CML}$

4.2. Performances of ${\hat{r}}_{MQL}$ and ${\hat{r}}_{CLS}$