Skip to main content
Entropy logoLink to Entropy
. 2025 May 17;27(5):537. doi: 10.3390/e27050537

Bootstrap Confidence Intervals for Multiple Change Points Based on Two-Stage Procedures

Li Hou 1, Baisuo Jin 1,*, Yuehua Wu 2,*, Fangwei Wang 1
Editor: Nikolay K Vitanov
PMCID: PMC12110430  PMID: 40422491

Abstract

This paper investigates the construction of confidence intervals for multiple change points in linear regression models. First, we detect multiple change points by performing variable selection on blocks of the input sequence; second, we re-estimate their exact locations in a refinement step. Specifically, we exploit an orthogonal greedy algorithm to recover the number of change points consistently in the cutting stage, and employ the sup-Wald-type test statistic to determine the locations of multiple change points in the refinement stage. Based on a two-stage procedure, we propose bootstrapping the estimated centered error sequence, which can accommodate unknown magnitudes of changes and ensure the asymptotic validity of the proposed bootstrapping method. This enables us to construct confidence intervals using the empirical distribution of the resampled data. The proposed method is illustrated with simulations and real data examples.

Keywords: change point, bootstrap, confidence interval, OGA, sup-Wald-type test

1. Introduction

Multiple change point detection is common in many applications, such as signal processing [1], medical diagnosis [2], industrial control [3], and oceanography [4]. For example, in the case of the detection of changes in one’s heart rates, the accurate identification of the change points can decompose the data into stationary segments, allowing clinicians to identify the behavior of the heart and diagnose diseases. The detection of multiple change points has important practical implications, which has prompted extensive research in the statistical community.

Implementing appropriate hypothesis testing is an important method for detecting multiple change points. Here are some examples. Ref. [5] proposed sup-Wald-type tests to test the null hypothesis of no change against alternative hypotheses containing an arbitrary number of changes to identify multiple structural changes in linear models. At the same time, the null hypothesis of s changes against s+1 changes is tested to determine the number of breakpoints. Later, ref. [6] developed dynamic programming principles to estimate multiple change points in linear regression. Ref. [7] proposed a genetic algorithm for detecting multiple breakpoints. However, these methods are very time-consuming. There is another line of research on multiple change point detection that transforms the problem into a variable selection problem. For example, ref. [8] used LASSO [9] to estimate the locations of multiple change points in a one-dimensional piecewise constant signal observed in white noise. Ref. [10] introduced a two-stage procedure based on adaptive LASSO [11], SCAD [12], and MCP [13] regularization methods that can simultaneously detect multiple change points in linear models. In addition, the two-stage procedure has been extended to accelerated failure time (AFT) models [14].

Despite the aforementioned developments, research on the inference for multiple change points remains limited. Under the null hypothesis, asymptotic [15] or approximate [16] distributions of test statistics have been derived, which enables quantifying uncertainty in the number of change points. For many testing procedures in change point analysis, the computation of critical values often relies on the asymptotic behavior of the test statistic under the null hypothesis. However, the convergence of the test statistic to its limiting distribution is often slow, and in some cases, the exact form of this distribution remains unknown. It has been noted that the bootstrap method is a computer-based statistical inference technique that can provide answers to a variety of statistical questions without relying on formulas. For example, ref. [17] proposed a bootstrap method in which the estimated error sequence is resampled with replacement to obtain confidence intervals for change points. Ref. [18] introduced an asymptotically valid confidence region for a single change point through the inversion of bootstrap tests. Ref. [19] studied the application of a circular overlapping block bootstrap method in the context of an at-most-one-change time series model with an abrupt change in the mean and dependent errors. In the field of a single change point estimation, the bootstrap technique has emerged as a valuable tool for approximating unknown probability distributions and the characteristics of change point estimators. These methods make it possible for us to provide confidence intervals for multiple change points. Ref. [20] addressed this problem in mean shift models by proposing the bootstrap construction of pointwise and uniform confidence intervals for multiple change points based on a moving sum procedure.

This paper aims to construct confidence intervals for multiple change points. By [10], the multiple change point detection problem can be formulated as a model selection problem. Before using bootstrapping, it is important to ensure that the estimates of the number of change points and the estimates of the within-segment parameters are consistent. However, regularization methods such as the LASSO and SCAD may suffer from the bias problem inherited from the penalty function [12,21]. Specifically, LASSO may select more irrelevant variables, leading to an overestimation of the number of change points. To alleviate this problem, we consider adopting the L0-regularization method to achieve consistency between the estimates of the number of change points and their locations. Although the subset selection, while unbiased, is often computationally expensive, several optimization strategies and algorithms have been proposed to overcome the computational difficulties. For example, ref. [22] introduced an iterative algorithm called the orthogonal greedy algorithm (OGA) for high-dimensional regression models, which sequentially selects input variables to be included in the linear regression model and proposed a procedure named OGA + HDIC + Trim. OGA is a fast stepwise forward regression method that starts with a null model and adds predictors via component-wise linear least squares estimation. HDIC is a high-dimensional information criterion used to enter predictors into the model along the OGA path. The Trim method defines a subset to exclude additional irrelevant variables from OGA + HDIC. To construct confidence intervals for multiple change points, we proceed as follows. We first cut the data sequence into segments. Due to its model selection consistency and expected convergence properties, we apply OGA + HDIC + Trim to find data segments containing change points. We then utilize sup-Wald-type test statistics to locate them. Finally, we apply the bootstrap method to construct confidence intervals for multiple change points using the bootstrap estimated centered error sequence.

The main contributions of this paper are as follows. As described above, we first propose a two-stage procedure to detect multiple change points by using OGA + HDIC + Trim procedure and sup-Wald-type test statistics. In the first stage, we cut the data sequence into segments. In addition, we give the asymptotic distributions of the change point estimators under certain conditions. Based on this framework, we further explore the application of bootstrapping techniques in constructing confidence intervals for the change points and demonstrate the validity of the bootstrap method, i.e., the proposed bootstrap 100(1α)%-confidence intervals asymptotically attain the coverage probability of 1α for a given α(0,1) [20]. Last but not least, we illustrate the effectiveness and applicability of the proposed method through extensive simulation studies and a real data example, respectively.

The rest of this paper is organized as follows. Section 2 details the detection of multiple change points using the OGA + HDIC + Trim procedure and sup-Wald-type test statistics. Furthermore, Section 3 introduces the resampling bootstrap method for the change point estimators based on a two-stage procedure and Section 4 gives the theoretical properties of the proposed bootstrap method. Section 5 presents extensive simulation studies. Section 6 applies the proposed method to the seismograms of the 1982 Urakawa–Oki earthquake. Technical proofs of the main results are relegated to the Appendix A. In this paper, vectors and matrices are denoted in bold type.

2. Multiple Change Point Detection Based on Two-Stage Procedures

Assume that (xi,yi),i=1,,n, satisfy the following linear regression model with s change points located at a1<<as as

yi=xiβ1+l=1sδlI(al<in)+εi=xiβ1+εi,if1ia1,xi(β1+δ1)+εi,ifa1<ia2,xi(β1+l=1sδl)+εi,ifas<in, (1)

where n is the sample size, xi=(xi,1,,xi,q) is a sequence of q-dimensional predictors; β1=(β1,1,,βq,1)0 is an unknown q-dimensional vector of regression coefficients; s is an unknown number of change points; 1<a1<<as<n are unknown change points; δl=(δ1,l,,δq,l),l=1,,s, are unknown changes in regression coefficient vectors at change points; and εi’s are unobservable random errors.

The estimation of the regression coefficients β1 and (β1+l=1jδl),j=1,s is hampered by the unknown parameters s, a1,,as. In light of [23], we propose to replace s in (1) with a predetermined number of segments, denoted by pn, to facilitate the estimation of the regression coefficients. This replacement allows us to identify the total number of change points. Specifically, we partition the data sequence into pn segments, where pn as n. This ensures that all segments excluding the first segment have length m, while the length of the first segment is n(pn1)m. Let Q1=1,2,,npn1m and Ql=npnl+1m+1,,npnlm, l=2,,pn. See (3) for more details on segmentation. We assume that m<min(aj+1aj)/2 for j=1,,s, so that each segment Ql, for l=1,,pn, contains at most one change point.

We denote the segment as kj if ajQkj. This partitioning results in the following model:

yi=xiβ1+l=2pndlInpnl+1m<inωi+εi, (2)

where

dkj=δj0,forj=1,,s,dl=0,foranylk1,,ks,

and ωi=dkjI(iTj) with Tj=npnkj+1) m+1,,aj. ωi=0 for all ij=1sQkj.

Remark 1.

By the definition of m, any two segments also contain at most one change point. From (2), it follows that

yi=xiβ1+l=1j1δl+εi,ifiQkj1,xiβ1+l=1jδlωi+εi,ifiQkj,xiβ1+l=1jδl+εi,ifiQkj+1.

Based on the partition, we can obtain least-squares estimates γ^l for l=kj1,kj,kj+1 within each segment.

  • 1. 

    When aj=n(pnkj)m, the change point coincides with the pre-specified cut-point and ωi=δj. The regression coefficients corresponding to the three segments are denoted by γl,l=kj1,kj,kj+1, and are equal to β1+l=1j1δl, β1+l=1j1δl, and β1+l=1jδl, respectively. The segment Qkj that contains the change point aj can be identified by γ^kj+1γ^kj0.

  • 1. 

    When aj<n(pnkj)m, due to the existence of ωi, the linear regression model is misspecified on Qkj, causing γ^kj to converge to the pseudo-true value γkj under model misspecification [24,25]. Since γkj is different from both β1+l=1j1δl and β1+l=1jδl, Qkj can be determined by the following: γ^kjγ^kj10 and γ^kj+1γ^kj0.

According to Remark 1, we reformulate the change-point detection problem as a high-dimensional variable selection task by constructing differences between coefficients. Therefore, (2) can be rewritten in matrix form as follows:

yn=Xnθ+en+εn, (3)

where yn=(y1,y2,,yn),

Xn=X(1),,X(pn)=X(1)00X(2)X(2)0XpnXpnXpn,

X(1)=(x1,,xn(pn1)m) and X(j)=(xn(pnj+1)m+1,,xn(pnj)m). In addition, the definitions of θ=(θ1,,θpn)=β1,(γ2β1),,(γpnγpn1), εn=(ε1,,εn), and en represents the artificial n-dimensional error due to model misspecification, whose elements are equal to zero for all ij=1sQkj.

It can be seen that θl=0 if l1,kj, or kj+1 for j=1,,s. Let An={1,k1,k1+1,k2,k2+1,,ks,ks+1}. It is important to note that θl=0q×1 is evident for all lAn. Therefore, the estimation of An comes down to identifying the non-zero elements from θ, which is the focus of the subsequent subsections.

2.1. Segment Selection

By the definition of m, each segment Qkj contains a change point. In this subsection, our goal is to select segments Qkj for j=1,,s. From the above statement, we can see that selecting all non-zero elements of θ can yield the estimation of An effectively. Therefore, after the cutting stage, the problem of detecting multiple change points is reduced to a variable selection problem in high-dimensional scenarios. We use the OGA + HDIC + Trim method for the segment selection.

We rewrite the model (3) as follows:

yn=j=1rnzjθj+ε˜n, (4)

where rn=pn·q, {z1,,zrn} are all the column vectors of xn, ε˜n=en+εn and θ=(θ1,,θrn). Without loss of generality, replace yn by ynyn1n and zj by zjzj1n. Define σ^Jd2=n1yn(InHJd)yn, where HJd is the orthogonal projection matrix onto the linear span of {zj,jJd}. For convenience, denote H=0. The proposed OGA+HDIC+Trim algorithm is outlined in Algorithm 1.

Algorithm 1 OGA + HDIC + Trim
Require: response vector ynRn, regressor matrix XnRn×rn.
 Initialzation: set d=0, r(0)=yn and J^0=.
While d=0,,Dn do
    Compute j^d+1=argmax1jrn,jJ^d1|zjr(d)|/(n1/2zj) and update J^d+1=J^dj^d+1;
    Compute zj^d+1 via zj^d+1=(In=1dHj^)zj^d+1, where Hj^=zj^zj^/zj^2;
    Compute r(d+1) via r(d+1)=(InHj^d+1)r(d).
end
    Compute the minimum of HDIC via
HDIC(J^d)=logσ^J^d2+(J^d)cnlog(rn)/n,d^n=argmin1dDnHDIC(J^d). (5)
return J^n via
J^n={j^:HDICJ^d{j^}>HDIC(J^d),1d^n},ifd^n>1;{j^1},ifd^n=1.

In this context, the value of Dn denotes the maximum number of iterations, and the convergence rate theory of OGA in [22] shows that Dn=O(n1/2(logrn)1/2). The parameter cn satisfies the conditions cn and cnlogrn=o(n12γ), where γ[0,1). Let θ^=(θ^1,,θ^pn) be the estimate obtained by applying the OGA+HDIC+Trim procedure. Therefore, we derive an estimate of An from J^n as A^n={l:θ^l0,l=1,,pn}. Denote

C^n=l:lA^n,l+1A^n,l=2,,pn+1=k^1,,k^s^, (6)

where k^1<<k^s^. It is clear that if l+1A^n,lA^n and l1A^n, then lC^n and l1C^n. Let Q(j)=QkjQkj+1. It only includes the change point aj since m<minj(aj+1aj)/2, and it ensures that no change point overlaps with the cut-point of Q(j). By following the steps outlined above and considering that the Wald test cannot detect the location of the change point at the partition boundary, we derive the following selected segments:

Q^(j)=Q^k^jQ^k^j+1={n(pnk^j+1)m+1,n(pnk^j1)m}, (7)

where Q^k^j={n(pnk^j+1)m+1,,n(pnk^j)m}.

2.2. Refining

By Theorem 1 in Section 4, s^ converges to s as n. Hence, we assume that for a large n, there exists Q^(j) such that ajQ^(j). We now show how to estimate this change point. Note that for iQ^(j)={n(pnk^j+1)m+1,n(pnk^j1)m}, we have

yi=xiβj+δjI(aj<inj(r))+εi,nj(l)inj(r), (8)

where nj(l)=n(pnk^j+1)m+1 and nj(r)=n(pnk^j1)m. Here, δj=βj+1βj and βj and βj+1 are unknown q-dimensional vectors of the regression coefficients on the line segment {nj(l),aj} and {aj,nj(r),}, respectively. nj(r)nj(l)+1=2m.

We compute the sup-Wald test statistics [26] and estimate aj by

a^j,h=argmaxhδ^j;hZj,hMjZj,hδ^j;h,nj(l)+q<h<nj(r)q, (9)

where Zj,h=(0,,0,xh+1,xh+2,,xnj(r)), Xj=(xnj(l),xnj(l)+1.,xnj(r)), and Mj=InXj(XjXj)1xj. We also obtain the estimates (β^j;h,δ^j;h) of (βj,δj) by regressing y(j)=(ynj(l),,ynj(r)) on Xj and Zj,h, respectively. The limiting behavior of a^j,h is given in Section 4.

Since the multiple change points are not dense in the data sequence, without loss of generality, we assume that m<min(aj+1aj)/2,j=1,,s1, so that each segment Q(j) contains at most one change point. Note that if m is too small, it leads to inconsistency in estimating the regression parameters and increases the computational time. Therefore, we need to avoid choosing too small a value for m. To address this issue, we define m=c0n according to Theorem 2 in Section 3, where c0 serves as a tuning parameter, and · is the ceiling function. We study the range of values of c0 on the interval [0.1,1.5]. The final value of m is determined using the Bayesian Information Criterion (BIC) as follows:

m^=argminmlogi=1nyixiβ^1,h+j=1s^δ^j,hI(a^j,h<in)2+s^·qlogn.

3. Bootstrap Confidence Intervals for Multiple Change Points

In this section, we construct bootstrap confidence intervals for multiple change points. It helps us to study the behavior of multiple change points in a linear regression model. Obviously, the two-stage procedure inherently involves the quantification of uncertainty in the number of change points and their respective locations.

We define the estimated residuals ε^i and the centered residuals ε˜i as follows:

ε^i=yixiβ^1,h+l=1s^δ^l,hI(a^l,h<in),ε˜i=ε^i1a^j,ha^j1,hl=a^j1,h+1a^j,hε^l,a^j1,h<ia^j,h, (10)

where a^0,h=1 and a^s^+1,h=n. Let εa^j1,h+1,,εa^j,h be independently and identically distributed (i.i.d.) random variables sampled from the empirical distribution function of {ε˜a^j1,h+1,,ε˜a^j,h}. We then consider the bootstrap observations defined as follows:

yi=xiβ^1,h+l=1s^δ^l,hI(a^l,h<in)+εi,i=1,,n, (11)

where εi represents the bootstrapped version of the residuals. To obtain an approximation to the distribution of a^j;h in (9), a bootstrap statistic of the estimate is defined as

a^j;hb=argmaxhbδ^j;hbzj,hbMjZj,hbδ^j;hb,j=1,,s^, (12)

where the number of change points s^ and the C^n are obtained by performing OGA on yn=(y1,,yn) as in (6). The data sequence segmentation remains unchanged. In this case, Zj,hb=(0,,0,xhb+1,,xnj(r)), and β^j;hb,δ^j;hb represents the least-squares estimates obtained by regressing y(j)=(ynj(l),,ynj(r)) on Xj and Zj,hb. Next, we describe the construction of bootstrap CIs for multiple change points.

  • 1.

    We generate a bootstrap sample (y1,,yn) by randomly sampling residuals (ε1,,εn) from the set {ε˜1,,ε˜n} as in (11);

  • 2.

    We apply the two-stage procedure and compute the local maximizer obtained as in (12) for each estimated segment;

  • 3.

    For a given bootstrap sample size B, we repeat Steps 1-2 B times and record a^j;hb(b), j=1,,s^, where b=1,,B.

Therefore, the bootstrap-based approximation for the change point aj can be constructed. Generally, for any α(0,1), the bootstrap 100(1α)% confidence interval for the change point aj is given by the following:

CIsj(α)=a^j,h+qU(α/2),a^j,h+qL(α/2), (13)

where qU(α/2)=supx;1Bb=1BIa^j,hb(b)a^j;hxα/2, and qL(α/2)=infx;1Bb=1BIa^j,hb(b)a^j;hx1α/2.

If C^nC^n, then a^j;hb is an estimate of a^j;h for j=1,,s^. If some elements of C^n are not in the set C^n, then a^j;h has no corresponding bootstrap estimate for some j. Hence, the bootstrap CI of a^j;h is constructed using {a^j;hb(b):a^j;hb(b),b=1,,B} instead of {a^j;hb(1),,a^j;hb(B)} in (13), which yields

CIsj(α)=a^j,h+qU(α/2),a^j,h+qL(α/2), (14)

where

qU(α/2)=supx:1Bb=1BIa^j,hb(b)a^j;hxα/2,

and

qL(α/2)=infx:1Bb=1BIa^j,hb(b)a^j;hx1α/2.

4. Theoretical Validity of the Bootstrap Confidence Intervals

To investigate the performance of the bootstrap CIs for multiple change points, we make the following assumptions.

Assumption 1.

If s1, then aj/nτj>0 for 1js. Furthermore, if s2, then min1js1τj+1τj>0.

Assumption 2.

εi,i=1,2,,n is a sequence of independently and identically distributed random variables with E(εi)=0 and E(εi2)=σ2.

Assumption 3.

ajnj(l)=2τjm, where τj(0,1) and · is the ceilling function.

Assumption 4.

sup(t2t1)1i=t1t2xixi/(t2t1) is stochastically bounded. εi is independent of the regressor xj for all i and j.

Assumption 5.

δj0, and δj1(2m)1/2+α=o(1) for some α(0,1/2) as n.

By Assumption 1, it follows that m/(aj+1aj)0, i.e., there is at most one change point in each segment for a large n. Assumption 2 follows that the residuals are independent and identically distributed and justifies the use of bootstrapping to estimate the central error, helping generate the sample distribution of the change points. Assumption 3 assumes that the shift point is bounded from the endpoints for asymptotic purposes. Assumption 4 requires that there is enough data around the change point and at the beginning and end of the sample so that the change point can be identified. The asymptotic distribution of a^j,h depends on various unknown quantities, with the magnitude of the change δj being the most significant. Assumption 5 is the minimum signal amplitude of the regression coefficient in the high-dimensional setting. The shift amplitude cannot be too small; otherwise, the change point will not be identified. Based on this, we give the necessary Assumption 5. Next, we establish the consistency of the number of change points s^ and the change point estimator a^j,h. The following theorem provides the consistency of the estimated number of change points.

Theorem 1.

Suppose that m, pn, Dn=O(n/log(rn))1/2, and log(rn)/n0 as n. Under Assumptions 1–5, we have

limnP(s^=s)=1;limnPajQ^(j),j=1,,ss^=s=1, (15)

where s^ and k^j,j=1,,s^ are given in (6).

Theorem 1 extends Theorem 4 in [22] to the multiple change points detection case. The consistency and asymptotic distribution of change points estimators are given below.

Theorem 2.

If i=t1t2xixi/t1t2pV as t2t1, under Assumptions 1–4, we have that when n, a^j,haj=Op(δj2) and

δjVδjσ2(a^j,haj)dargmax{W(c)|c|/2:cR},

where V is a strictly positive definite matrix, {W(c):cR} is a two-sided Wiener process, and δj is a fixed value or satisfies δj0, as specified in Assumption 5.

Subsequently, we establish the validity of the bootstrap CIs in (14). For future reference, we define some notations here. Any symbol with a superscript ∗ denotes an object under the bootstrap probability measure, rather than the original measure used in some of the other sections. For example, E(·) denotes the conditional expectation with respect to the bootstrap probability measure conditional on the original data. Similarly, P(·) denotes the conditional probability under the bootstrap measure.

Theorem 3.

Under the assumptions of Theorem 2, we have

supxR|P(a^j,hba^j,hx)P(a^j,hajx)|p0.

The proofs of Theorems 2 and 3 are given in Appendix A.

Combining Theorem 3 with Theorems 1 and 2 establishes the validity of the bootstrap method for multiple change points.

Corollary 1.

Under Assumptions 1–5, we have that, as n,

supxRPj=1sa^j,hba^j,hxPj=1sa^j,hajxp0.

Since PajCIsj(α)1α for each j=1,,s, by the Bonferroni correction, we have Pj=1sajCIsj(α/s)1α. The asymptotic validity of the proposed bootstrap CIs in (14) follows.

5. Simulation

In this section, we first present the simulation results for change point detection given in Section 2 and compare them with the two-stage multiple change point detection procedure involving LASSO (TSMCDlasso) by [10]. We also construct confidence intervals using the bootstrap method proposed in Section 3. We denote the CIs in (14) by bootstrapoga,wald.

5.1. Detection of Multiple Change Points

We consider the simulation setting where the change points aj, j=1,2,3 are, respectively, 150, 300, and 450, respectively, and generate data from the model

yt=2cos(tπ/30)+2sin(tπ/30)+0.1yt1+(3cos(tπ/30)+sin(tπ/30)+0.2yt1)I(150,600](t)+(2cos(tπ/30)0.3yt1)I(300,600](t)+(2cos(tπ/30)+2sin(tπ/30))I(450,600](t)+εt, (16)

where ε1,,εn are independent and follow the standard normal distribution. The simulated data are shown in Figure 1. In this model, {yt,t=1,,600} represents a periodic autocorrelation sequence with a period of 30 and an autocorrelation order of 1.

Figure 1.

Figure 1

Simulated regression data with change points indicated by dotted lines.

We perform 1000 Monte Carlo simulations for multiple change point estimation in Table 1. According to [22], we take cn=2 in (5), similar to AIC. We focus on counting the number of events for which {|a^jaj|5}. The percentage of the correct identifications of all change points, denoted as call(%), reflects the proportion of replicates for which |a^jaj|5 for all j. In addition, cj(%) represents the proportion of replicates for which |a^jaj|5. The mean and standard error of the estimated change points are calculated for the replicates for which the difference between the estimated change points and the true value is less than or equal to 50 (i.e., |a^jaj|50).

Table 1.

Performance of different methods for multiple change point detection.

Method call c1 c2 c3
TSPoga,wald 90.70 98.50 96.80 97.80
Mean 150.34 300.41 449.77
SE 1.66 2.31 2.22
TSMCDlasso 72.60 95.20 95.80 96.40
Mean 150.61 300.43 450.16
SE 2.42 2.31 2.19

From Table 1, we can see that TSPoga,wald generally outperforms TSMCDlasso in terms of yielding a correct identification rate. It is noteworthy that TSMCDlasso performs significantly worse than TSPoga,wald in identifying all change points, especially at a1, as evidenced by the lower c1 value. TSPoga,wald and TSMCDlasso show comparable estimation accuracy in terms of the mean and standard error (SE) of the estimated change points.

5.2. Bootstrap CIs

All results are based on 500 realizations in the simulation setting, and we use B=500 to give the corresponding bootstrap CIs. We assume the confidence level is (1α){0.9,0.95}. For each j, the coverage of the bootstrap CI is calculated as the proportion of simulated realizations where CIsj(α) contains aj. For all j, the coverage of the bootstrap CIs is calculated as the proportion of simulated realizations, where j=1sCIsj(α/s) contains all aj.

Table 2 confirms the effectiveness of our bootstrap method, as the empirical coverage probability for each aj is close to the nominal level. The overall coverage of the bootstrap confidence intervals for all j is slightly higher than the nominal level, which may be due to the complexity of multiple change point detection. The average computational time for the bootstrapoga,wald procedure is 1.44 min per Monte Carlo replication, as measured on an Intel(R) Core(TM) i9-14900K processor (3.20 GHz) with 64 GB of RAM.

Table 2.

Performance of bootstrapoga,wald.

(1α)% aall a1 a2 a3
90 93.80 91.80 93.80 91.00
95 96.80 95.80 95.80 95.60

6. Empirical Application

In this section, we illustrate the proposed method by an application to the east–west component of seismograms, recorded at Iwanai station during the first foreshock of the Urakawa–Oki earthquake in 1982. This dataset has been previously studied by [27]. The time series data are analyzed using autoregressive models (AR) of an order of 5. The estimates and visualization of the 95% confidence level bootstrap CIs are presented in Table 3 and Figure 2, respectively.

Table 3.

Bootstrap CIs for the change points.

Change Point 95% Bootstrap CIs 90% Bootstrap CIs
3074 [3072, 3084] [3073, 3080]
3914 [3877, 3952] [3882, 3948]

Figure 2.

Figure 2

Change points estimated by our method (vertical lines), with shaded areas representing the 95% confidence intervals around the change points. The results for the two change points are shown in red and blue, respectively.

It can be seen from Table 3 that change points are detected at 3074 and 3914. In [27], they found the estimated change points to be 3079 and 3929. It is noteworthy that [27]s’ results and ours are close. In geology, these two change points represent the arrival times of P-waves and S-waves, which are two types of seismic waves. From Figure 2, it is clear that the confidence interval (CI) of the first change point is narrower than that of the second change point.

This example demonstrates the applicability and effectiveness of our method in detecting change points in seismic data. By accurately identifying the locations of structural faults, we can gain insights into the underlying geological processes and improve our understanding of seismic events.

7. Conclusions

This paper effectively addresses the bias issues often encountered when using OGA for model selection and the estimation of segments containing change points in the cutting stage. The accuracy of multiple change point estimation is improved by applying sup-Wald-type test statistics in the refining stage. The proposed method successfully constructs confidence intervals for multiple change points by using a bootstrapping technique with a two-stage procedure to quantify the uncertainty of multiple change points. The reliable construction of confidence intervals makes this method a valuable addition to the field of change point analysis and regression modeling. The bootstrapping technique also guarantees asymptotic validity. Numerical studies demonstrate the statistical accuracy of the proposed method. Our method can also be applied together with block bootstrapping to the parameter changes of linear regression models with dependent errors.

Appendix A

Proof of Theorem 2.

We only sketch this proof because it is similar to the proof of Propositions 1 and 2 in [26]. To prove Theorem 2, we define V(h)=δ^j;h(Zj,hMjZj,h)δ^j;h. For the sake of simplicity, we denote h0=aj, which means that h0 is the change point in Ikj. If h=h0, we can obtain that Zj,h=Zj,h0, where Zj,h0=(0,,0,xh0,,xnj(r)). By Equation (9), a^j,h=argmaxhV(h). Note that

δ^j;h=Zj,hMjZj,h1(Zj,hMjZj,h0δj+Zj,hMjε),δ^j,h0=δj+Zj,h0MjZj,h01Zj,h0Mjε.

It follows that

V(h)V(h0)=δjZj,h0MjZj,hZj,hMjZj,h1Zj,hMjZj,h0Zj,h0MjZj,h0δj+v(h),

where

v(h)=2δj(Zj,h0MjZj,h)(Zj,hMjZj,h)1Zj,h0Mjε2δjZj,h0Mjε+εMjZj,h(Zj,hMjZj,h)1Zj,hMjεεMjZj,h0(Zj,h0MjZj,h0)1Zj,h0Mjε.

Define for hh0,

g(h)=δj(Zj,h0MjZj,h)(Zj,hMjZj,h)1(Zj,hMjZj,h0)Zj,h0MjZj,h0δj/h0h.

When h=h0, define g(h)=δjδj. Then, we have

V(h)V(h0)=|h0h|g(h)+v(h). (A1)

Let

ZΔ=Zj,hZj,h0=(0,,0,xh+1,,xh0,0,,0),h<h0,Zj,h0Zj,h=(0,,0,xh0+1,,xh,0,,0),h>h0,0,h=h0.

We have Zj,h0=Zj,hZΔsgn(h0h). It follows that

v(h)=v1(h)+v2(h)+v3(h)+v4(h)+v5(h)=2δjZΔεsgn(h0h)2δjZΔXj(XjXj)1Xjεsgn(h0h)2δj(ZΔMjZj,h)(Zj,hMjZj,h)1Zj,hMjεsgn(h0h)+εMjZj,h(Zj,hMjZj,h)1Zj,hMjεεMjZj,h0(Zj,h0MjZj,h0)1Zj,h0Mjε. (A2)

We now establish the convergence rate of the change point estimator a^j,h in Theorem 2. By Lemma A.2 in [26], there exists λ>0 such that for every ϵ>0, inf|hh0|>Cδj2g(h)λδj2 has a probability of at least 1ϵ. Therefore, we only need to prove that

P(|a^j,haj|>Cδj2)Psuphh0>Cδj2V(h)V(h0)PsuphK(C)v(h)(h0h)δj2>λϵ, (A3)

where K(C)=h:hh0>Cδj2andnj(l)+ηNjhnj(r)ηNj for a small number of η>0. It is easy to show that

suphK(C)|v2(h)||h0h|δj22δj1ZΔxj/(h0h)(xjxj)1xjε=Op(δj1Nj1/2)=op(1),suphK(C)v3(h)|h0h|δj2δj1(ZΔMjZj,h)/(h0h)(Zj,hMjZj,h)1Zj,hMjε=Op(δj1Nj1/2)=op(1),suphK(C)|v4(h)||h0h|δj2δj2(Zj,hMjZj,h)1/2Zj,hMjε2/|h0h|=Op(δj2|h0h|1)=Op(1),suphK(C)|v5(h)||h0h|δj2δj2(Zj,h0MjZj,h0)1/2Zj,h0Mjε2/|h0h|=Op(δj2|h0h|1)=Op(1).

By Lemma A.3 in [26], there exists a large C such that

PsuphK(C)|v1(h)||h0h|δj2>λ5PsuphK(C)δj1ZΔε/(h0h)>λ10100Bλ2Cϵ5.

This completes the proof of (A3).

To study the limiting distribution of a^j,h, we need to investigate the behavior of V(h) on D(C)={h:|hh0|Cδj2}. Since XjZΔ=Op(δj2) and Zj,hMjZΔ=Op(δj2), we have

|h0h|g(h)=δjZj,h0MjZj,hZj,hMjZj,h1Zj,hMjZj,h0Zj,h0MjZj,h0δj=δj{ZΔZΔZΔXj(XjXj)1XjZΔ}δjδj{(ZΔMjZj,h)(Zj,hMjZj,h)1(Zj,hMjZΔ)}δj=δjZΔZΔδj+op(1).

Consider v(h) in (A2). It is straightforward to prove that if |hh0|Cδj2, then v2(h)=Op(δj1Nj1/2)=op(1), and v3(h)=Op(δj1Nj1/2)=op(1). It can also be shown that

v4(h)+v5(h)=εMjZj,h0(Zj,hMjZj,h)1(Zj,h0MjZj,h0)1Zj,h0Mjε+εMjZΔ(Zj,hMjZj,h)1Zj,hMjε+εMjZj,h0(Zj,hMjZj,h)1ZΔMjε=op(1)

on D(C). Therefore, we obtain that

V(h)V(h0)=δjZΔZΔδj+2δjZΔεsgn(h0h)+op(1). (A4)

In light of the proof of Proposition 2 in [26], we obtain the limiting distribution in Theorem 2. □

In the following, we introduce lemmas that are instrumental in proving Theorem 3.

Lemma A1.

If δj is fixed or δj0 but satisfies Assumption 5, then as Nj,

var(εnj(l))=Nj1i=nj(l)nj(r)εiε¯Nj2+OpNj1.

where εnj(l) is defined in (11).

Proof f Lemma A1.

In view of (1) and (10), we have

ε˜i=xiI(aj<inj(r))1nj(r)ajl=aj+1nj(r)xlδj+(xix¯j)(βjβ^j,h)xiI(a^j,h<inj(r))1Njl=a^j,h+1nj(r)xlδ^j,h+εiε¯Nj, (A5)

where x¯j=1Njl=nj(l)nj(r)xl and ε¯Nj=1Njl=nj(l)nj(r)εl.

Assume aj<a^j,h (the other case can be handled in a similar way). Since E(εnj(l))=Nj1i=nj(l)nj(r)ε˜i=0, we have var(εnj(l))=Nj1i=nj(l)nj(r)ε˜i2. It follows from (A5) that

var(εnj(l))=1Nji=nj(l)nj(r)(εiε¯Nj)2+2Nji=nj(l)nj(r)(xix¯j)(βjβ^j,h)(εiε¯Nj)+1Nji=nj(l)nj(r)(xix¯j)(βjβ^j,h)2+1Nj2i=a^j,h+1nj(r)xi(δjδ^j,h)2+1Nj2i=aj+1a^j,hxiδj2+2Nj2i=aj+1a^j,hxiδjl=a^j,h+1nj(r)xl(δjδ^j,h). (A6)

According to Corollary 1 in [26], we have βjβ^j,h=Op(Nj1/2) and δjδ^j,h=Op(Nj1/2), which, combined with a^j,h=aj+Op(δj2), yields that Nj1i=aj+1a^j,hxiδj=Op(Nj1δj1) and Nj1l=a^j,h+1nj(r)xl(δjδ^j,h)=Op(Nj1/2). This completes the proof.

Lemma A2.

For every ϵ>0,

lim supNjPNj1i=nj(l)nj(r)(εi)2Var(εnj(l))ϵϵσ2.

Proof of Lemma A2.

We have for every ϵ>0 and every η>0,

PNj1i=nj(l)nj(r)εi2var(εnj(l))ϵ=PNj1i=nj(l)nj(r)εi2Nj1i=nj(l)nj(r)ε˜i2ϵPNj1i=nj(l)nj(r)εi2I(|εi|ηNj)Nj1i=nj(l)nj(r)ε˜i2I(|ε˜i|ηNj)ϵ+PNj1i=nj(l)nj(r)εi2I(|εi|<ηNj)Nj1i=nj(l)nj(r)ε˜i2I(|ε˜i|<ηNj)ϵ (A7)

Clearly, for every ϵ>0 and every η>0, as Nj, we have

Nj1i=nj(l)nj(r)ε˜i2I(|ε˜i|ηNj)0a.s.

It follows that

PNj1i=nj(l)nj(r)εi2IεiηNjϵ/22ϵNj1i=nj(l)nj(r)ε˜i2Iε˜iηNj0,a.s.

Since

lim supNjPNj1i=nj(r)nj(l)εi2Iεi<ηNjNj1i=nj(r)nj(l)ε˜i2Iε˜i<ηNjϵ/24ϵ2lim supNjNj2i=nj(r)nj(l)ε˜i4Iε˜i<ηNj4ϵ2η2lim supnNj1i=nj(r)nj(l)ε˜i2=4ϵ2η2σ2a.s.,

choosing η2=ϵ3/8 gives the assertion in Lemma A2. □

Lemma A3.

Under the assumption that i=t1t2xixi/t1t2pV as t2t1p, for every ϵ>0, there exists λ>0 such that gλδ^j,h2 with a probability of at least 1ϵ, where g=infk2h>Njηg(hb).

Lemma A4.

Under the assumption that i=t1t2xixi/t1t2pV as t2t1, for every ϵ>0 and λ>0, there exists N0>0, such that Nj>N0, P(|hbh|>Njη)<ϵ.

The proofs of Lemmas A3 and A4 follow from [26] (see Lemmas A.2 and A.4). We next establish the consistency of a^j,hb in Lemma A5, which serves as a key step in proving Theorem 3.

Lemma A5.

If δj is fixed or δj0 but satisfies Assumption 5, under Assumptions 1–4, we have

a^j,hba^j,h=Op(δ^j,h2).

Proof of Lemma A5.

We define V(hb)=(δ^j;hb)(Zj,hbMjZj,hb)δ^j;hb. By Equation (12), a^j,hb=argmaxhbV(hb). Denote h=a^j,h. If hb=h, then Zj,hb=Zj,h. It can be seen from (11) that

δ^j;hb=Zj,hbMjZj,hb1(Zj,hbMjZj,hδ^j,h+Zj,hbMjε),δ^j;h=δ^j,h+Zj,hMjZj,h1Zj,hMjε.

It follows that

V(hb)V(h)=δ^j,hZj,hMjZj,hbZj,hbMjZj,hb1Zj,hbMjZj,hZj,hMjZj,hδ^j,h+v(hb),

where

v(hb)=2δ^j,h(Zj,hMjZj,hb)(Zj,hbMjZj,hb)1Zj,hbMjε2δ^j,hZj,hMjε+εMjZj,hb(Zj,hbMjZj,hb)1Zj,hbMjεεMjZj,h(Zj,hMjZj,h)1Zj,hMjε.

Define

g(hb)=δ^j,h(Zj,hMjZj,hb)(Zj,hbMjZj,hb)1(Zj,hbMjZj,h)Zj,hMjZj,hδ^j,h/hhb,if hbhδ^j,hδ^j,h,if hb=h.

We have

V(hb)V(h)=|hhb|g(hb)+v(hb). (A8)

Again, since V(hb)Vh by definition, it suffices to prove, by Lemma A4, that

Psuphbh>Cδ^j,h2V(hb)Vh<ϵ.

By Lemmas A1 and A2, for any ϵ>0 and η>0, we have Phbh>Njη<ϵ for a large Nj. Therefore, to prove the above equation, we only need to show that

PsuphbK(C)V(hb)Vh<ϵ,

where K(C)=hb:hbh>Cδ^j,h2andnj(l)+Njηhbnj(r)ηNj for a small number η>0. Finding suphbK(C)V(hb) is equivalent to a restricted search, this is legitimate only after the consistency has been established. Thus, by Lemma A3, it suffices to show that

PsuphbK(C)|v(hb)||hhb|δ^j,h2>λ<ϵ. (A9)

Next, consider v(hb). Denote

ZΔb=Zj,hbZj,h=(0,,0,xhb+1,,xh,0,,0),hb<h,Zj,hZj,hb=(0,,0,xh+1,,xhb,0,,0),hb>h,0,hb=h.

It follows that Zj,hb=Zj,h+ZΔbsgnhhb. Thus, we have

v(hb)=v1(hb)+v2(hb)+v3(hb)+v4(hb)+v5(hb)=2δ^j,hZΔbεsgn(hhb)2δ^j,hZΔbxj(xjxj)1xjεsgn(hhb)2δ^j,h(ZΔbMjZj,hb)(Zj,hbMjZj,hb)1Zj,hbMjεsgn(hhb)+εMjZj,hb(Zj,hbMjZj,hb)1Zj,hbMjεεMjZj,h(Zj,hMjZj,h)1Zj,hMjε. (A10)

By Lemmas A1 and A2, we deduce that XjXj1Xjε=Nj1/2Op(1). Since Nj1/2+αδj10 and

Nj1/2+αδj1Nj1/2+αδ^j,h1=Nj1/2+αδj1(δ^j,hδj)δ^j,h1=Op(Nj1+αδj2),

by Corollary 1 in [26], we have Nj1/2+αδ^j,h1 tends to zero. It follows from ZΔbXj=|hhb|Op(1) that

suphbK(C)|v2(hb)||hhb|δ^j,h22δ^j,h1ZΔbXj/(hhb)(XjXj)1Xjε=Op(δ^j,h1Nj1/2)=op(1).

Similarly, we have (ZΔbMjZj,hb)=|hhb|Op(1), and (Zj,hbMjZj,hb)1Zj,hbMjε=Nj1/2Op(1) uniformly on K(C), which implies

suphbK(C)v3(hb)|hhb|δ^j,h2δ^j,h1(ZΔbMjZj,hb)/(hhb)(Zj,hbMjZj,hb)1Zj,hbMjε=Op(δ^j,h1Nj1/2)=op(1).

Furthermore, it is easy to show that the expressions suphbK(C)|v4(hb)||hhb|δ^j,h2 and

suphbK(C)|v5(h)||hhb|δ^j,h2 are uniformly Op(1) on K(C). By Lemma A.3 in [26],

PsuphbK(C)|v1(hb)||hhb|δ^j,h2>λ5PsuphbK(C)δ^j,h1ZΔbε/(hhb)>λ10100Bλ2Cϵ5

for large C. In conclution, the consistency of the bootstrap estimator has been established. □

Proof of Theorem 3.

Lemma A5 implies that when C is large, a^j,hb has a high probability of being outside the set K(C). Let D(C) denote the complement set of K(C) such that D(C)={hb:|hbh|Cδ^j,h2}. To study the limiting distribution, we consider the expression of V(hb)V(h) on D(C).

We use Zj,h=Zj,hbZΔbsgnhhb to obtain that

hhbg(hb)=δ^j,h{Zj,hMjZj,hZj,hMjZj,hb(Zj,hbMjZj,hb)1Zj,hbMjZj,h}δ^j,h=δ^j,h{ZΔbZΔbZΔbXj(XjXj)1XjZΔb}δ^j,hδ^j,hZΔbMjZj,hb(Zj,hbMjZj,hb)1Zj,hbMjZΔbδ^j,h. (A11)

Since XjZΔb=Op(δ^j,h2) and Zj,hbMjZΔb=Op(δ^j,h2), we have

|hhb|g(hb)=δ^j,hZΔbZΔbδ^j,h+op(1).

Since hhbCδ^j,h2, we deduce that v2(hb) and v3(hb) are both bounded by Op(Nj1/2δ^j,h1)=op(1), and v4(hb) and v5(hb) are both bounded by op(1) uniformly on D(C).

By the above results, we obtain that

V(hb)V(h)=δ^j,hZΔbZΔbδ^j,h+2δ^j,hZΔbεsgnhhb+op(1),

which combined with (A4) completes the proof. □

Author Contributions

Conceptualization, L.H., B.J. and Y.W.; methodology, L.H. and B.J.; data analysis, L.H. and F.W.; writing, L.H., Y.W. and F.W. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding authors.

Conflicts of Interest

The authors declare no conflicts of interest.

Funding Statement

This work is supported by the National Natural Science Foundation of China (12231017, 72293573); and the Natural Science and Engineering Research Council of Canada (RGPIN-2023-05655).

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

  • 1.Haynes K., Eckley I.A., Fearnhead P. Computationally efficient changepoint detection for a range of penalties. J. Comput. Graph. Stat. 2017;26:134–143. doi: 10.1080/10618600.2015.1116445. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Picard F., Robin S., Lavielle M., Vaisse C., Daudin J.J. A statistical approach for array CGH data analysis. BMC Bioinform. 2005;6:27. doi: 10.1186/1471-2105-6-27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Li J., Fearnhead P., Fryzlewicz P., Wang T. Automatic change-point detection in time series via deep learning. J. R. Stat. Soc. Ser. B Stat. Methodol. 2024;86:273–285. doi: 10.1093/jrsssb/qkae004. [DOI] [Google Scholar]
  • 4.Killick R., Fearnhead P., Eckley I.A. Optimal detection of changepoints with a linear computational cost. J. Am. Stat. Assoc. 2012;107:1590–1598. doi: 10.1080/01621459.2012.737745. [DOI] [Google Scholar]
  • 5.Bai J., Perron P. Estimating and testing linear models with multiple structural changes. Econometrica. 1998;66:47–78. doi: 10.2307/2998540. [DOI] [Google Scholar]
  • 6.Bai J., Perron P. Computation and analysis of multiple structural change models. J. Appl. Econom. 2003;18:1–22. doi: 10.1002/jae.659. [DOI] [Google Scholar]
  • 7.Davis R.A., Lee T.C.M., Rodriguez-Yam G.A. Structural break estimation for nonstationary time series models. J. Am. Stat. Assoc. 2006;101:223–239. doi: 10.1198/016214505000000745. [DOI] [Google Scholar]
  • 8.Harchaoui Z., Lévy-Leduc C. Multiple change-point estimation with a total variation penalty. J. Am. Stat. Assoc. 2010;105:1480–1493. doi: 10.1198/jasa.2010.tm09181. [DOI] [Google Scholar]
  • 9.Tibshirani R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B Stat. Methodol. 1996;58:267–288. doi: 10.1111/j.2517-6161.1996.tb02080.x. [DOI] [Google Scholar]
  • 10.Jin B., Wu Y., Shi X. Consistent two-stage multiple change-point detection in linear models. Can. J. Stat. 2016;44:161–179. doi: 10.1002/cjs.11282. [DOI] [Google Scholar]
  • 11.Zou H. The adaptive lasso and its oracle properties. J. Am. Stat. Assoc. 2006;101:1418–1429. doi: 10.1198/016214506000000735. [DOI] [Google Scholar]
  • 12.Fan J., Li R. Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties. J. Am. Stat. Assoc. 2001;96:1348–1360. doi: 10.1198/016214501753382273. [DOI] [Google Scholar]
  • 13.Zhang C. Nearly unbiased variable selection under minimax concave penalty. Ann. Stat. 2010;38:894–942. doi: 10.1214/09-AOS729. [DOI] [PubMed] [Google Scholar]
  • 14.Li J., Jin B. Multi-threshold accelerated failure time model. Ann. Stat. 2018;46:2657–2682. doi: 10.1214/17-AOS1632. [DOI] [Google Scholar]
  • 15.Eichinger B., Kirch C. A MOSUM procedure for the estimation of multiple random change points. Bernoulli. 2018;24:526–564. doi: 10.3150/16-BEJ887. [DOI] [Google Scholar]
  • 16.Fang X., Li J., Siegmund D. Segmentation and estimation of change-point models: False positive control and confidence regions. Ann. Stat. 2020;48:1615–1647. doi: 10.1214/19-AOS1861. [DOI] [Google Scholar]
  • 17.Antoch J., Hušková M., Veraverbeke N. Change-point problem and bootstrap. J. Nonparametr. Stat. 1995;5:123–144. doi: 10.1080/10485259508832639. [DOI] [Google Scholar]
  • 18.Dumbgen L. The asymptotic behavior of some nonparametric change-point estimators. Ann. Stat. 1991;19:1471–1495. doi: 10.1214/aos/1176348257. [DOI] [Google Scholar]
  • 19.Hušková M., Kirch C. Bootstrapping confidence intervals for the change-point of time series. J. Time Ser. Anal. 2008;29:947–972. doi: 10.1111/j.1467-9892.2008.00589.x. [DOI] [Google Scholar]
  • 20.Cho H., Kirch C. Bootstrap confidence intervals for multiple change points based on moving sum procedures. Comput. Stat. Data Anal. 2022;175:107552. doi: 10.1016/j.csda.2022.107552. [DOI] [Google Scholar]
  • 21.Lv J., Fan Y. A unified approach to model selection and sparse recovery using regularized least squares. Ann. Stat. 2009;37:3498–3528. doi: 10.1214/09-AOS683. [DOI] [Google Scholar]
  • 22.Ing C.K., Lai T.L. A stepwise regression method and consistent model selection for high-dimensional sparse linear models. Stat. Sin. 2011;21:1473–1513. doi: 10.5705/ss.2010.081. [DOI] [Google Scholar]
  • 23.Jin B., Shi X., Wu Y. A novel and fast methodology for simultaneous multiple structural break estimation and variable selection for nonstationary time series models. Stat. Comput. 2013;23:221–231. doi: 10.1007/s11222-011-9304-6. [DOI] [Google Scholar]
  • 24.White H. Maximum likelihood estimation of misspecified models. Econom. J. Econom. Soc. 1982;50:1–25. doi: 10.2307/1912526. [DOI] [Google Scholar]
  • 25.Flynn C.J., Hurvich C.M., Simonoff J.S. Efficiency for regularization parameter selection in penalized likelihood estimation of misspecified models. J. Am. Stat. Assoc. 2013;108:1031–1043. doi: 10.1080/01621459.2013.801775. [DOI] [Google Scholar]
  • 26.Bai J. Estimation of a change point in multiple regression models. Rev. Econ. Stat. 1997;79:551–563. doi: 10.1162/003465397557132. [DOI] [Google Scholar]
  • 27.Takanami T., Kitagawa G. Estimation of the arrival times of seismic waves by multivariate time series model. Ann. Inst. Stat. Math. 1991;43:407–433. doi: 10.1007/BF00053364. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data presented in this study are available on request from the corresponding authors.


Articles from Entropy are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES