Skip to main content
Wiley - PMC COVID-19 Collection logoLink to Wiley - PMC COVID-19 Collection
. 2022 Oct 19:10.1111/insr.12521. Online ahead of print. doi: 10.1111/insr.12521

Path algorithms for fused lasso signal approximator with application to COVID‐19 spread in Korea

Won Son 1, Johan Lim 2, Donghyeon Yu 3,
PMCID: PMC9874640  PMID: 36710888

Summary

The fused lasso signal approximator (FLSA) is a smoothing procedure for noisy observations that uses fused lasso penalty on unobserved mean levels to find sparse signal blocks. Several path algorithms have been developed to obtain the whole solution path of the FLSA. However, it is known that the FLSA has model selection inconsistency when the underlying signals have a stair‐case block, where three consecutive signal blocks are either strictly increasing or decreasing. Modified path algorithms for the FLSA have been proposed to guarantee model selection consistency regardless of the stair‐case block. In this paper, we provide a comprehensive review of the path algorithms for the FLSA and prove the properties of the recently modified path algorithms' hitting times. Specifically, we reinterpret the modified path algorithm as the path algorithm for local FLSA problems and reveal the condition that the hitting time for the fusion of the modified path algorithm is not monotone in a tuning parameter. To recover the monotonicity of the solution path, we propose a pathwise adaptive FLSA having monotonicity with similar performance as the modified solution path algorithm. Finally, we apply the proposed method to the number of daily‐confirmed cases of COVID‐19 in Korea to identify the change points of its spread.

Keywords: Change points, fused lasso signal approximator, modified path algorithm, pathwise adaptive weight, solution path

1. INTRODUCTION

Consider observations {yi}i=1n from a model

yi=μi+ϵi, (1)

where ϵis are independently and identically from a distribution with mean 0 and variance σn2. Here, we assume that true underlying signals have block structures with μ1==μj1μj1+1==μjKμjK+1==μn. Under this assumption, it is essential to identify the unknown block structures to estimate the signal means. This problem is known as multiple‐change‐point detection. The literature contains various methods on multiple‐change‐point detection, including best subset selection (Yao, 1988; Yao & Au, 1989), circular binary segmentation (CBS) (Olshen et al.2004), and wild binary segmentation (WBS) (Fryzlewicz, 2014).

This study focuses on the fused lasso signal approximator (FLSA) that finds sparse signal blocks using the fused lasso penalty (Tibshirani et al.2005) on underlying mean levels. The FLSA obtains the signal estimate by minimising

fSFL(μ;y,λ1,λ2)=12yμ22+λ1μ1+λ2μTV, (2)

where y=(y1,,yn)T, μ=(μ1,,μn)T, μ1=j=1n|μj| is the 1‐norm of μ, and μTV=j=2n|μjμj1| is the total‐variation norm of μ. Given λ1 and λ2, the FLSA obtains the sparse block solution for μ, providing the estimates of both signal levels and block structures. As shown in Lemma A.1 in Friedman et al. (2007), the sparse estimate can be obtained directly from the minimiser μ^(0,λ2) of f(μ;y,0,λ2) by the soft‐thresholding operator μ^(λ1,λ2)=Softλ1(μ^(0,λ2)), where Softa(x)=sign(xi)max(|xi|a,0)i=1n for xn.

Furthermore, the FLSA has the model selection consistency (i.e. exact identification of the true block structure) when the underlying true block structure has no stair‐case blocks, where a stair‐case block denotes that three consecutive blocks that are either strictly increasing or decreasing (Rinaldo, 2009, 2014; Qian & Jia, 2016). The FLSA estimator is inconsistent in identifying the true signal blocks when the true block structure contains any stair‐case blocks. The preconditioned FLSA via Puffer transformation (Qian & Jia, 2016) and the modified path algorithm for FLSA (Son & Lim, 2019) are proposed to resolve this inconsistency. Recently, the error analysis for the FLSA has been done in literature (Lin et al.2016).

Besides the theoretical properties of the FLSA, it is critical to obtain its solution efficiently. The entire solution path of the FLSA can be obtained using either the path algorithm for the FLSA proposed by Hoefling (2010) or the path algorithm for the generalised lasso by Tibshirani & Taylor (2011). These two solution path algorithms solve the problem (2) but still show the model selection inconsistency for identifying true blocks with stair‐case blocks. Contrarily, two modified path algorithms have been developed recently to resolve the model selection inconsistency of the FLSA. The preconditioned FLSA with the Puffer transformation (Qian & Jia, 2016) converts the original problem into a Lasso problem with reparameterisation and applies the Puffer transformation. Essentially, the solution path algorithm for the preconditioned FLSA is based on the path algorithm for Lasso (Bradely Efron et al.2004). The modified path algorithm for the FLSA (mPath‐FLSA) by Son & Lim (2019) reinterprets the FLSA path algorithm using the distances between two adjacent blocks and proposes a new distance criterion that guarantees the model selection consistency regardless of a stair‐case block. Note that the original study does not provide the overall objective function of mPath‐FLSA.

Our main contributions through this study are as follows. First, we provide a comprehensive review of the FLSA path algorithms and prove the properties of the hitting times of the recently modified path algorithm. Specifically, we provide an explicit form of the objective function of mPath‐FLSA. Furthermore, we apply the new representation to show that the hitting times of mPath‐FLSA can fail to increase monotonically with respect to λ2. For example, we suppose that there are four observations, y1=0.032, y2=0.787, y3=0.122, and y4=0.207. The hitting times for the FLSA solution path algorithm are (λ2(0),,λ2(3))=(0,0.1097,0.2731,0.3352), which satisfy the monotone property (i.e. λ2(0)λ2(3)). However, the hitting times of mPath‐FLSA are calculated as (λ2(0),,λ2(3))=(0,0.0822,0.2049,0.1676), which are not monotone in λ2. Section 3 provides additional details. Second, we propose a pathwise adaptive FLSA and its solution path algorithm to resolve the violation of the monotonicity of the solution path along λ2. The proposed method adopts the weighted fusion penalty terms j=2nwj(l)|μjμj1|, which are adaptively defined by the solutions at the hitting times (λ2(l))l=0n1 on the solution path along λ2. Finally, we provide a comprehensive numerical comparison of the existing path algorithms with the proposed algorithm. We compare four methods based on the entire solution path along λ2, the exact pattern recovery probability, and the estimation performance of the signal levels and block structures concerning the selection of tuning parameters.

The remainder of this paper is organised as follows. Section 2 contains a brief review of the existing solution path algorithms related to the FLSA. Section 3 presents a new interpretation of mPath‐FLSA besides the conditions for the non‐monotonicity of the solution path along λ2. The pathwise adaptive FLSA and its solution path algorithm that satisfies the monotone property in the solution path along λ2 are proposed in Section 4. In Section 5, a numerical study for measuring the probability of the exact recovery was conducted using the proposed algorithm and the other existing algorithms. Moreover, we compared the estimation performance with the optimal tuning parameter chosen by the Bayesian information criterion (BIC) and the extended BIC (EBIC). In Section 6, we apply the proposed method and existing FLSA path algorithms to the number of daily‐confirmed cases of COVID‐19 in Korea to identify events that affect the COVID‐19 spread. Finally, we conclude the paper with some remarks.

2. EXISTING SOLUTION PATH ALGORITHMS FOR FLSA

In this section, we review three existing solution path algorithms related to the FLSA; the path algorithm for the FLSA (Path‐FLSA) by Hoefling (2010), the path algorithm for the preconditioned FLSA (Path‐PFLSA) by Qian & Jia (2016), the modified path algorithm for the FLSA (mPath‐FLSA) by Son & Lim (2019). Henceforth, to focus on the identification of true mean blocks, we consider the FLSA with λ1=0 as follows:

fFL(μ;y,λ2)=12yμ22+λ2j=2n|μjμj1|. (3)

Note that we omit the review of the solution path algorithm for the generalised Lasso by Tibshirani & Taylor (2011) because its solution path for the FLSA is the same as Path‐FLSA.

2.1. Path Algorithm for the FLSA (Path‐FLSA)

The FLSA path algorithm proposed by Hoefling (2010) provides a whole solution path of the FLSA along λ2 based on the fact that the solution path for λ2 is a piece‐wise linear in λ2. Originally, Hoefling (2010) proposes a path algorithm for the generalised FLSA problem

minμfFL(μ;y,λ2)=12yμ22+λ2(i,j)E|μiμj|, (4)

where E denotes a set of index pairs that correspond to fusion penalty terms. Here, we focus on the path algorithm for the one‐dimensional FLSA as in (3) (i.e. E={(1,2),(2,3),,(p1,p)}).

Specifically, let {B^k}k=1J^(λ2) be a partition of {1,,n} defined from the solution μ^FL(λ2) of the FLSA at λ2, where J^(λ2)1 is the number of change points in μ^FL(λ2). Further, we let {λ2(l)}l=0n1 be hitting times on the solution path, with two adjacent sets in the partition defined by the previous hitting time fused on each of them. For example, there is an index k{1,,nl+1} on λ2=λ2(l), such that B^k1(l1) and B^k(l1) are fused as B^k1(l)=B^k1(l1)B^k(l1). Thus, the solution μ^FL(λ2) for λ2=λ2(l) can be expressed explicitly as:

μ^iFL(λ2)=k=1nlν^k(l)(λ2)I(iB^k(l)), (5)

where b^k(l)=|B^k(l)|, ν^k(l)(λ2)=1b^k(l)iB^k(l)yi+c^k(λ2)=y¯k+c^k(λ2),

c^k(λ2)=2λ2b^k(l)ifν^k(λ2)<ν^k1(λ2),ν^k+1(λ2)<ν^k(λ2)2λ2b^k(l)ifν^k(λ2)>ν^k1(λ2),ν^k+1(λ2)>ν^k(λ2)0if(ν^k(λ2)ν^k1(λ2))(ν^k+1(λ2)ν^k(λ2))>0.

Now, suppose that we know a partition {B^k(l)}k=1nl by μ^FL(λ2(l)), such that B^k(l)={1,,n}, and consider λ2(λ2(l),λ2(l+1)), where there is no fusion in the FLSA estimate. The FLSA solution satisfies the following equation:

fνk=b^k(l)νkiB^k(l)yi+λ2(|νjνj1|+|νjνj+1|)=0, (6)

where |x| is a subdifferential of |x|, which is defined as |x|=1 if x>0, −1 if x<0, and z[1,1] if x=0. We can easily obtain the derivative of ν^k with respect to λ2 from (6) as

ν^kλ2=|ν^kν^k1|+|ν^kν^k+1|b^k(l). (7)

Thus, ν^k is linear in λ2(λ2(l),λ2(l+1)) because the numerator and denominator of (7) are unchanged for the given interval (λ2(l),λ2(l+1)). This property shows that the FLSA solution path is piecewise linear in λ2.

Given the solution ν^k on the hitting time λ2(l) and the derivative ν^k/λ2, the solution ν^k(λ2) for λ2(λ2(l),λ2(l+1)) can be easily obtained as

ν^k(λ2)=ν^k(λ2(l))+ν^kλ2δ, (8)

where δ=λ2λ2(l). Based on (8), the next hitting time is defined as

λ2(l+1)=min1knl1hk,k+1(λ2(l))=min1knl1ν^k+1(λ2(l))ν^k(λ2(l))ν^kλ2ν^k+1λ2+λ2(l) (9)

Given the Eqs. 8 and (9), the Path‐FLSA starts with λ2(0)=0 and sequentially calculates the hitting times. The Path‐FLSA stores all solutions at hitting times (λ2(0),,λ2(n1)) and their derivatives to obtain the solution for any λ2 with (8). Note that the FLSA solution path for generalised fusion penalty λ2(i,j)E|μiμj| could be split for some edge set E. For example, two adjacent solutions μ^i(l1) and μ^j(l1) for (i,j)E are fused at λ2(l) (i.e. μ^i(l1)μ^j(l1) and μ^i(l)=μ^j(l)) and split at λ2(t) for some t>l (i.e. μ^i(t1)=μ^j(t1) and μ^i(t)μ^j(t)) with the generalised fusion penalty. However, the one‐dimensional FLSA (i.e. E is an edge set {(1,2),(2,3),,(n1,n)} of a chain graph) only has hitting events on the solution path, as shown by Friedman et al. (2007) in Proposition 2.

2.2. Preconditioned FLSA with Puffer Transformation (PCD‐FLSA)

As explained in the Introduction, the FLSA faces the model selection inconsistency due to stair‐case blocks in the underlying signals. Qian & Jia (2016) connect the FLSA model selection inconsistency to the irrepresentable condition of the Lasso problem (Zhao & Yu, 2006) by the reparameterisation of the FLSA. To obtain the model selection consistency, Qian & Jia (2016) applied the Puffer transformation introduced by Jia & Rohe (2015) to the reparameterised FLSA, which is called the preconditioned FLSA with Puffer transformation (PCD‐FLSA).

Specifically, consider the reparameterisation of μ with θ as follows:

μ1=θ1,μ2=θ1+θ2,,μn=j=1nθjμ=Aθ, (10)

where A is a lower triangular matrix with nonzero elements equal to one. This reparameterisation can represent the objective function of the FLSA as

fL(θ;λ2)=12yAθ22+λ2θ[2:n]1, (11)

where θ[2:n]=(θ2,,θn)TRn1. The solution of (11) can be obtained by

θ^[2:n](λ2)=argminθ[2:n]12y˜X˜θ[2:n]22+λ2θ[2:n]1,andθ^1(λ2)=y¯x¯Tθ^[2:n](λ2), (12)

where y¯=(1/n)i=1nyi, y˜=yy¯1n, 1n is an n‐dimensional vector of ones, X˜=[x1x¯11n,,xn1x¯n11n]=X1nx¯T, x¯=(x¯1,,x¯n1)T, x¯j=1nTxj/n, and X=A[1:n,2:n]=[x1,,xn1], which denotes that X is an n×(n1) matrix defined by removing the first column of A. Henceforth, we refer to the reparameterised FLSA problem (12) as the transformed Lasso.

Based on the reparameterisation, the pattern recovery for μ using the FLSA is equivalent to the sign consistency of the Lasso estimator for θ[2:n]. For a linear model y˜=X˜θ[2:n]+ϵ with E(ϵ)=0, the conditions for the sign consistency of the Lasso estimator is well understood. One of the necessary conditions of the sign consistency of the Lasso estimator is the irrepresentable condition (Zhao & Yu, 2006) defined as follows:

X˜ScTX˜S(X˜STX˜S)1sign((θ[2:n])S)1ηfor someη(0,1], (13)

where S={j:(θ[2:n])j0}. Theorem 2 of Qian & Jia (2016) shows that the irrepresentable condition (13) holds if and only if one of the following two conditions holds.

  • (1)

    For a set of change points {j1,,jK}, K=1 or max1t<K(jt+1jt)=1.

  • (2)

    The underlying signal has no stair‐case blocks. That is, (μjtμjt1)(μjt+1μjt)<0.

The abovementioned condition (2) shows the inconsistency of the FLSA in identifying true change points when underlying signals have stair‐case blocks.

Qian & Jia (2016) proposed a preconditioned FLSA with a Puffer transformation to resolve the inconsistency of the FLSA. The Puffer transformation for Lasso is proposed by Jia & Rohe (2015) to make the Lasso estimator sign consistent when the irreresentable condition does not hold for the design matrix Xn×p with np. Let the singular value decomposition of X˜ be X˜=UDVT. The Puffer transformation is defined as Fn×n=UD1UT. Because X˜n×(n1) in (12), the results of Jia & Rohe (2015) are valid for the transformed Lasso problem in (12). Let Z=FX˜ and a=Fy˜ be the preconditioned design matrix and the response vector, respectively. The preconditioned FLSA with the Puffer transformation solves

minb12aZb22+λ2b1. (14)

Because ZTZ=In, the solution of the PCD‐FLSA has the explicit form b^(λ2)=Softλ2(ZTa), where Softa(x)=(sign(xi)max(|xi|a,0))i=1n. Thus, the solution path of b^(λ2) is piecewise linear in λ2, and the hitting times (or slope‐change points in the solution path) are λ2,PCDFLSA(i)=|ZTa|(i) for i=1,,n1, where x(i) is the ith smallest value in a vector x.

2.3. Modified Path Algorithm for FLSA (mPath‐FLSA)

Recently, Son & Lim (2019) provided a new interpretation of the path algorithm for the one‐dimensional FLSA (1D‐FLSA), which uses a new distance measure for two adjacent blocks to determine if they are fused. Furthermore, they used this interpretation to show the inconsistency of the FLSA in the presence of the stair‐case blocks and proposed a path algorithm with a modified distance measure that resolves this inconsistency.

Specifically, let λ2(0)=0<λ2(1)<,<λ2(n1) be the hitting times of the 1D‐FLSA. Further, let {Bk(l)}k=1nl be a partition of {1,,n} corresponding to the estimated blocks of μ^(λ2) for λ2[λ2(l),λ2(l+1)). Son & Lim (2019) developed a new distance measure d(B^k1(l),B^k(l)) for two adjacent blocks B^k1(l) and B^k(l) defined as

d(B^k1(l),B^k(l))=|y¯k1(l)y¯k(l)|1b^k1(l)sy¯k1(l)+1b^k(l)sy¯kl, (15)

where b^k(l)=|B^k(l)|, y¯k(l)=(1/b^k(l))iB^k(l)yi, and

sy¯k(l)=||y¯k(l)y¯k1(l)|+|y¯k(l)y¯k+1(l)||/2fork=2,,nl1|(|y¯k(l)y¯k1(l))|)|/2fork=nl||y¯k(l)y¯k+1(l)||/2fork=1. (16)

Son & Lim (2019) used the distance measure in (15) to show that the hitting time λ2(l+1) in (9) is equivalent to 12minkd(B^k1(l),B^k(l)). This interpretation shows that the 1D‐FLSA cannot fuse two consecutive stair‐case blocks with a finite λ2 because the denominator of the distance measure is defined as 0.

The abovementioned phenomenon for two consecutive stair‐case blocks promoted Son & Lim (2019) to propose the modified distance measure that guarantees the consistency of the 1D‐FLSA estimator in identifying true block structures regardless of the stair‐case blocks. The modified distance measure δ(B^k1(l),B^k(l)) for two consecutive blocks B^k1(l) and B^k(l) is defined as

δ(B^k1(l),B^k(l))=|y¯k1(l)y¯k(l)|1/b^k1(l)+1/b^k(l). (17)

A modified path algorithm is proposed using the modified distance measure with the hitting times defined as:

λ2,mPathFLSA(l+1)=12minkδ(B^k1(l),B^k(l))forl=0,,n2. (18)

Theorem 2 of Son & Lim (2019) shows that the 1D‐FLSA with a modified path algorithm consistently identifies the change points and signs of difference between two adjacent blocks. Note that the modified path algorithm guarantees a consistent estimator for identifying true block structures by adopting the modified distance measure δ(·,·), but this modification causes the modified path algorithm to solve a different problem that is not equivalent to the 1D‐FLSA, and the monotone increasing property of the hitting times could be violated (i.e. for some l, λ2(l)>λ2(l+1)). This phenomenon is explained in the next section.

3. NEW INTERPRETATION OF MODIFIED PATH ALGORITHM

In this section, we provide a novel interpretation of the mPath‐FLSA, including the exact target objective function of the mPath‐FLSA. It clarifies the differences between Path‐FLSA and mPath‐FLSA. Moreover, we illustrate a condition that violates the monotone increasing property of the hitting times in mPath‐FLSA.

Suppose we know that a partition {B^k(l)}k=1nl of {1,,n} for a given λ2,mPathFLSA(l)=12minkδ(B^k1(l1),B^k(l)), where λ2,mPathFLSA(l) is the lth hitting time of {y1,y2,,yn} based on the mPath‐FLSA. In this section, we denote λ2,mPathFLSA(l) as λ2(l) for notational simplicity. Let νk(l) be a parameter of the kth block mean at λ2[λ2(l),λ2(l+1)). A localised FLSA problem for a pair (k1,k) of the partition indices of {B^j(l)}j=1nl is define as

fl(νk1(l),νk(l);ηk)=12j{k1,k}iB^j(l)(yiνj(l))2+ηk|νk(l)νk1(l)|fork=2,,nl, (19)

where ηk0 is a tuning parameter of the localised FLSA problem for (B^k1(l),B^k(l)). In Theorem 1, we show that the hitting times of the mPath‐FLSA are equivalent to the minimum hitting times of (nl1) localised FLSA problems.

Theorem 1

Let λ2(l) be the lth hitting time of the modified path algorithm with y=(y1,,yn) and λ2(0)=0. Let ηj(l+1) be the hitting time of (19) for a partition pair (B^j1(l),B^j(l)). We define τ(l+1)=min2jnlηj(l+1), which is the minimum hitting time of fl(νj1,νj) for j=2,,nl. Then, ηj(l+1)=δ(B^j1(l),B^j(l)) and λ2(l+1)=τ(l+1)/2.

The necessary and sufficient conditions of the solution using the subgradient approach of fl(νj1,νj;ηj) are as follows:

flνj1(l)=iB^j1(l)yib^j1(l)νj1(l)ηj|νj(l)νj1(l)|=0flνj(l)=iB^j(l)yib^j(l)νj(l)+ηj|νj(l)νj1(l)|=0, (20)

where b^j(l)=|B^j(l)|, and |x| is the subdifferential of |x|. Let ν^j1(l) and ν^j(l) be the solutions of the Eq. (20). For ν^j1(l)>ν^j(l), the solution can be represented as ν^j1(l)=y¯j1(l)(ηk/b^j1(l)) and ν^j(l)=y¯j(l)+(ηk/b^j(l)). From ν^j1(l)>ν^j(l), the hitting time is ηk(l+1)=y¯j1(l)y¯j(l)1/b^j1(l)+1/b^j(l)=|y¯j(l)y¯j1(l)|1/b^j1(l)+1/b^j(l). Similarly, for ν^j1(l)<ν^j(l), the hitting time is ηj(l+1)=y¯j(l)y¯j1(l)1/b^j1(l)+1/b^j(l)=|y¯j(l)y¯j1(l)|1/b^j1(l)+1/b^j(l). Hence, ηj(l+1)=δ(B^j1(l),B^j(l)). Based on the definition of the minimum hitting time τ(l+1), τ(l+1) is equivalent to min2jnlδ(B^j1(l),B^j(l)) in the modified path algorithm. It proves λ2(l+1)=τ(l+1)/2.

Theorem 1 offers a novel interpretation of mPath‐FLSA. The mPath‐FLSA lacks an overall objective function, and it finds the next fusion block sequentially by solving independent localised FLSA problems for two adjacent blocks, called local fused lasso signal approximator (LFLSA). Henceforth, we refer to the mPath‐FLSA by Son and Lim (2019) as Path‐LFLSA. Moreover, the hitting times for each sub‐problems can be considered as the distance between the two blocks. Consequently, Path‐LFLSA can be considered as a hierarchical clustering algorithm for {1,2,,nl} with distance metric ρ(i,j)=δ(B^i(l),B^j(l)) if |ij|=1 and ρ(i,j)= if |ij|>1.

Furthermore, this interpretation can help identify conditions that violate the monotonicity in the hitting times of Path‐LFLSA. Theorem 2 shows the conditions for the occurrence of the non‐monotonic sequence terms.

Theorem 2

Let λ2(l)=min2jnl+1ηj(l1)/2 be the lth hitting time of the Path‐LFLSA with y=(y1,,yn) and λ2(0)=0. Suppose that k=argmin2jnlηj(l1), and the partition set P(l)={B^j(l)}j=1nl, where B^i(l)=B^i(l1) for 1ik2, B^k1(l)=B^k1(l1)B^k(l1), and B^i(l)=B^i+1(l1) for kinl. Then, the next hitting time λ2(l+1) is less than λ2(l) if ηk1(l+1)/2<λ2(l) or ηk(l+1)/2<λ2(l). Moreover, the violation λ2(l)>λ2(l+1) can be checked with the solution at λ2(l1). Let ωk(l1)=b^k(l1)/(b^k1(l1)+b^k(l1)). The violation occurs if and only if the following inequality hold for m=k2 (2<knl) or k+1 (2k<nl):

|y¯k(l1)y¯k1(l1)|>b^m(l1)(b^k1(l1)+b^k(l1))2b^k1(l1)b^k(l1)j{m,k1,k}b^j(l1)(1ωk(l1))y¯k1(l1)+ωk(l1)y¯k(l1)y¯m(l1). (21)

Due to the definition of λ2(l), λ2(l)ηj(l1)/2 for 2jnl+1. After lth hitting time, the set B^k1(l) is defined by the union of B^k1(l1) and B^k(l1), and the other blocks remain the same as in the previous partition P(l1). Thus, the pairs (B^j1(l),B^j(l)) for j=2,,k2,k+1,,nl also remain the same as in the previous partition set P(l1), which implies ηj(l)=ηj(l1)λ2(l) for jk1,k. Therefore, the next hitting time λ2(l+1) decreases only when ηk1(l+1)/2<λ2(l) or ηk(l+1)/2<λ2(l). Moreover, based on this assumption, we can express the lth hitting time as λ2(l)=ηk(l1)/2=|y¯k(l1)y¯k1(l1)|/(2/b^k(l1)+2/b^k1(l1)). Because y¯k2(l)=y¯k2(l1), y¯k1(l)=(1ωk(l1))y¯k1(l1)+ωk(l1)y¯k(l1), and y¯k(l)=y¯k+1(l1), we can express ηk1(l+1) and ηk(l+1) as ηk1(l+1)=|(1ωk(l1))y¯k1(l1)+ωk(l1)y¯k(l1)y¯k2(l1)|/(1/(b^k(l1)+b^k1(l1))+1/b^k2(l1)) and ηk(l+1)=|(1ωk(l1))y¯k1(l1)+ωk(l1)y¯k(l1)y¯k+1(l1)|/(1/(b^k(l1)+b^k1(l1))+1/b^k+1(l1)), respectively. Applying these observations to 2λ2(l)/ηk1(l+1)>1 and 2λ2(l)/ηk(l+1)>1, we obtain the condition (21).

To describe the violation λ2(l)>λ2(l+1), we consider three independent random variables (Y1,Y2,Y3) from the standard normal distribution N(0,1). Corollary 1 shows that the probability of the violation λ2(1)>λ2(2) is 4P((4Y1+Y2)/5<Y3<Y1)=4EΦ33Z|Z>00.121 when we apply the Path‐LFLSA to (Y1,Y2,Y3), where ZN(0,1) and Φ(z) is the cumulative distribution function of the standard normal distribution.

Corollary 1

Suppose Y1,Y2,Y3 are random samples from N(0,1). Let Z be a standard normal random variable and Φ(z) be the cumulative distribution function of Z. Let λ2(1)=minj=2,3ηj(0)/2 and λ2(2)=η2(1)/2 be the first and the second hitting times of the Path‐LFLSA with λ2(0)=0, respectively. Then, the probability P(λ2(1)>λ2(2)) is equal to 4P((4Y1+Y2)/5<Y3<Y1). In addition, P(λ2(1)>λ2(2)) can be calculated from the univariate conditional expectation 4EΦ33Z|Z>0.

A proof of Corollary 1 can be found in Appendix A of the supplementary material. The findings of this study reveal that the mPath‐FLSA, also called the Path‐LFLSA, is the clustering of the observed point based on the distance defined by the minimum of the next hitting times of the localised FLSA problems. We suggest using the indices l=0,1,,n1 of hitting times {λ2(l)} to represent the solution path of the Path‐LFLSA instead of directly drawing the solution path along λ2, which avoids falsely split points in the solution path. This phenomenon is illustrated in Section 5.

4. PATHWISE ADAPTIVE FLSA

The Path‐LFLSA motivated us to propose the pathwise adaptive FLSA, which is a weighted FLSA with pathwise adaptive weights and guarantees the monotonicity of the hitting times. Specifically, let λ2(l) be the lth hitting time and P(l)={B^j(l)}j=1nl be a partition of {1,2,,n} at λ2(l) such that B^i(l)B^j(l)= for ij and j=1nlB^j(l)={1,,n}. For λ2(λ2(l),λ2(l+1)], the pathwise adaptive FLSA (PA‐FLSA) minimises

fPA(μ;λ2)=12i=1n(yiμi)2+λ2j=2nwμ,j(l)|μjμj1|, (22)

where

wμ,j(l)=1|y¯t(l)y¯t1(l)|ifjB^t(l),j1B^t1(l),|y¯t(l)y¯t1(l)|>1MMotherwise

and M>0 is a sufficiently large weight that fuses the corresponding two parameters. Equivalently, we can express the PA‐FLSA problem in the following reduced form:

fl(ν1,,νnl;λ2)=12j=1nliB^j(l)(yiνj)2+λ2j=2nlwj(l)|νjνj1|, (23)

where wj(l)=|y¯j(l)y¯j1(l)|1 if |y¯j(l)y¯j1(l)|>M1 and M otherwise. Note that M>0 is also used in the reduced form to avoid the division by zero besides the numerical instability. For example, we set M=100 in our numerical study.

We now investigate the properties of PA‐FLSA. First, we express the solution of the PA‐FLSA in Lemma 1.

Lemma 1

Suppose that there are n intervals [λ2(0),λ2(1)), [λ2(1),λ2(2)), , [λ2(n),) with λ2(0)=0, where the fused set is unchanged in each interval. For λ2[λ2(l),λ2(l+1)), the solution of the pathwise adaptive FLSA estimator of (23) is given by

μ^iPA(λ2)=k=1nlν^k(l)(λ2)I(iB^k(l)),

where ν^k(l)=y¯k(l)+c^k(l)(λ2), b^k(l)=|B^k(l)|, |x| is the subdifferential of |x|, and

c^k(l)(λ2)=λ2b^k(l)wk+1(l)|ν^k+1(l)ν^k(l)|ifk=1λ2b^k(l)wk(l)|ν^k(l)ν^k1(l)|wk+1(l)|ν^k+1(l)ν^k(l)|if2knl1λ2b^k(l)wk(l)|ν^k(l)ν^k1(l)|ifk=nl

The subdifferential of fl(·) gives the proof of Lemma 1 directly (Bertsekas, 1999).

Based on Lemma 1, we can anticipate the PA‐FLSA to avoid the inconsistency of the FLSA in the presence of the stair‐case blocks. Specifically, for the FLSA, the bias term c^k is zero for the stair‐case blocks (i.e. (ν^kν^k1)(ν^k+1ν^k)>0), implying that the FLSA fails to move the estimates of the stair‐case blocks. However, for the PA‐FLSA, the c^k(l) for 2knl1 is defined as:

c^k(l)(λ2)=λ2b^k(l)wk(l)+wk+1(l)ifν^k1(l)>ν^k(l)andν^k(l)<ν^k+1(l)(local min.)λ2b^k(l)wk(l)+wk+1(l)ifν^k1(l)<ν^k(l)andν^k(l)>ν^k+1(l)(local max.)λ2b^k(l)wk(l)wk+1(l)ifν^k1(l)<ν^k(l)andν^k(l)<ν^k+1(l)(increasing stair‐block)λ2b^k(l)wk(l)wk+1(l)ifν^k1(l)>ν^k(l)andν^k(l)>ν^k+1(l)(decreasing stair‐block).

Thus, the PA‐FLSA has a nonzero bias term for the stair‐case blocks unless |y¯k(l)y¯k1(l)|=|y¯k+1(l)y¯k(l)|.

Second, we investigate the monotone fusion property of the PA‐FLSA, which denotes that μ^j and μ^j+1 remain fused at λ2>λ2 if μ^j and μ^j+1 are fused at λ2. The monotone fusion property for the FLSA was proved in Proposition 2 of Friedman et al. (2007). We show that the PA‐FLSA holds the monotone fusion property similar to the FLSA in Proposition 1. However, the PA‐FLSA adopts the weighted fusion penalty with the pathwise adaptive weights.

Proposition 1

In the PA‐FLSA, two parameters that are fused in the solution for λ2 are fused for all λ2>λ2.

Appendix B of the supplementary material contains a proof of Proposition 1.

Finally, we show that the solution path of the PA‐FLSA is piecewise linear in the given interval [λ2(l),λ2(l+1)) and discontinuous at the hitting times, which changes the pathwise adaptive weights in the Theorem 3 mentioned below.

Theorem 3

The solution path of the PA‐FLSA is a piecewise linear function of λ2 and discontinuous at the hitting times λ2(l), which changes the pathwise adaptive weights. The next hitting time λ2(l+1) can be obtained from the current solution at λ2(l) by

λ2(l+1)=λ2(l)+min2knlhk,k1(l)>0hk,k1(l),wherehk,k1(l)=ν^k(l)ν^k1(l)ν^k1(l)λ2ν^k(l)λ2. (24)

The partition {B^j(l)}j=1nl is unchanged for λ2[λ2(l),λ2(l+1)). Based on Lemma 1, we can represent the derivative of the solution with respect to λ2 for a given interval [λ2(l),λ2(l+1)) as

ν^kλ2=(b^k(l))1wk+1(l)sign(ν^k+1ν^k)fork=1(b^k(l))1wk(l)sign(ν^kν^k1)wk+1(l)sign(ν^k+1ν^k)fork=2,,nl1(b^k(l))1wk(l)sign(ν^kν^k1)fork=nl.

Thus, the solution paths of ν^k for k=1,,nl along λ2 are linear in the given interval [λ2(l),λ2(l+1)) because {wk(l)}k=1nl and {sign(ν^k+1ν^k)}k=1nl are unchanged. Based on this linearity, for λ2[λ2(l),λ2(l+1)), we can obtain the solution at λ2 using the equation:

ν^k(λ2)=ν^k(l)+ανkλ2,

where ν^k(l) is the minimiser of (23) at λ2=λ2(l) and α=λ2λ2(l)[0,λ2(l+1)λ2(l)). We can find the next hitting time hk,k1(l) for a pair (ν^k(λ2),ν^k1(λ2)) of two solution paths from the equation ν^k(λ2)=ν^k1(λ2), for k=2,,nl, as

hk,k1(l)=ν^k(l)ν^k1(l)ν^k1λ2ν^kλ2(0,λ2(l+1)λ2(l)].

Because these hitting times are only valid for the unchanged partition {B^j(l)}j=1nl (i.e. there is no fusion for λ2(λ2(l),λ2(l+1))), the next hitting time λ2(l+1) should satisfy the following equation:

λ2(l+1)λ2(l)=min2knlhk,k1(l)>0hk,k1(l).

Consider two PA‐FLSA problems with (wj(l))j=1n1 and (wj(l+1))j=1nl1 at λ2=λ2(l+1) to show the discontinuity of the solution path of the PA‐FLSA. We denote the first problem as the PA‐FLSA( l) and the second problem as the PA‐FLSA( l+1) to distinguish the two problems. Suppose that 3<q<nl and ν^q(l)(λ2) and ν^q1(l)(λ2) of the PA‐FLSA( l) are fused at λ2(l). Let s(ν^k(l),ν^k1(l))=|ν^k(l)ν^k1(l)|, b^[q1:q](l)=b^q1(l)+b^q(l), and y¯[q1:q](l)=(b^q1(l)y¯q1(l)+b^q(l)y¯q(l))/b^[q1:q]. Based on Lemma 1, we can express the solution ν^(l)(λ2(l+1)) of the PA‐FLSA( l) as

ν^k(l)(λ2(l+1))=y¯1(l)+λ2(l+1)b^1(l)w2(l)s(ν^2(l),ν^1(l))ifk=1y¯k(l)λ2(l+1)b^k(l)wk(l)s(ν^k(l),ν^k1(l))wk+1(l)s(ν^k+1(l),ν^k(l))ifk1,q1,q,nly¯[q1:q](l)λ2(l+1)b^[q1:q](l)wq1(l)s(ν^q1(l),ν^q2(l))wq+1(l)s(ν^q+1(l),ν^q(l))ifk=q1,qy¯nl(l)λ2(l+1)b^nl(l)wnl(l)s(ν^nl(l),ν^nl1(l))ifk=nl

Based on this assumption, we can represent the partition B^k(l+1) into B^k(l+1)=B^k(l) for 1kq2, B^q1(l+1)=B^q1(l)B^q(l), and B^k(l+1)=B^k+1(l) for qknl1. For notational simplicity, in the proof, we use the definition of the pathwise adaptive weight as wk(l)=|y¯k(l)y¯k1(l)|1. Thus, the following equations hold:

y¯k(l+1)=y¯k(l)for1kq2y¯[q1:q](l)fork=q1y¯k+1(l)forqknl1,wk(l+1)=wk(l)for2kq2|y¯[q1:q](l)y¯q2(l)|1fork=q1|y¯q+1(l)y¯[q1:q](l)|1fork=qwk+1(l)forq+1knl1.

Then, by Lemma 1, the solution ν^(l+1)(λ2(l+1)) of the PA‐FLSA( l+1) can be represented as

ν^k(l+1)(λ2(l+1))=y¯k(l)+λ2(l+1)b^k(l)w2(l)s(ν^2(l+1),ν^1(l+1))ifk=1y¯k(l)λ2(l+1)b^k(l)wk(l)s(ν^k(l+1),ν^k1(l+1))wk+1(l)s(ν^k+1(l+1),ν^k(l+1))if2kq3y¯k(l)λ2(l+1)b^k(l)wk(l)s(ν^k(l+1),ν^k1(l+1))wk+1(l+1)s(ν^k+1(l+1),ν^k(l+1))ifk=q2y¯[q1:q](l)λ2(l+1)b^[q1:q](l)wq1(l+1)s(ν^q1(l+1),ν^q2(l+1))wq(l+1)s(ν^q(l+1),ν^q1(l+1))ifk=q1y¯q+1(l)λ2(l+1)b^q+1(l)wq(l+1)s(ν^q(l+1),ν^q1(l+1))wq+2(l)s(ν^q+1(l+1),ν^q(l+1))ifk=qy¯k+1(l)λ2(l+1)b^k+1(l)wk+1(l)s(ν^k(l+1),ν^k1(l+1))wk+2(l)s(ν^k+1(l+1),ν^k(l+1))ifq<k<nl1y¯nl(l)λ2(l+1)b^nl(l)wnl(l)s(ν^nl1(l+1),ν^nl2(l+1))ifk=nl1

Let μ^i(l)(λ2(l+1))=j=1nlν^j(l)(λ2(l+1))I(iB^j(l)) and μ^(l+1)(λ2(l+1))=j=1nl1ν^j(l+1)(λ2(l+1))I(iB^j(l+1)). A comparison of the two solutions μ^(l)(λ2(l+1)) and μ^(l+1)(λ2(l+1)) can easily spot that the absolute differences between |μi(l)(λ2(l+1))μi(l+1)(λ2(l+1))| for ij=q2qB^j(l+1) are nonzero if wq1(l)wq1(l+1) and wq+1(l)wq(l+1). This completes the proof of the discontinuity of the solution path of the PA‐FLSA.

The results in Theorem 3 show that the hitting times λ2(l) of the PA‐FLSA increase monotonically. Unlike the original FLSA, the solution path of the PA‐FLSA is discontinuous at hitting times that change the pathwise adaptive weights. However, this discontinuity does not affect PA‐FLSA's performance in estimating true signal levels and identifying the change points. We will examine these phenomena in the numerical study.

5. NUMERICAL STUDY

This section presents a comprehensive numerical study. First, we numerically investigate the properties we have explored in the previous sections, including the violation of the monotone increasing property for the hitting times of the Path‐LFLSA and the discontinuity of the PA‐FLSA's solution path. Second, we numerically compare the probabilities of the exact pattern recovery of the four methods (Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA) under four scenarios. The first three scenarios are based on Qian & Jia (2016), and the fourth is novel for this study. We also conduct a comparison of the performances of the four methods in estimating the signal levels and identifying the block structures with the optimal tuning parameters chosen by the Bayesian information criterion (BIC) (Schwarz, 1978) and the extended BIC (EBIC) (Chen & Chen, 2008) in Appendix C of the supplementary material.

5.1. Comparison of Whole Solution Paths

In this subsection, we compare the whole solution paths of the Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA. We consider yi=μi+ϵi, ϵiN(0,σ2), μ=(μ1,,μ8)T, and σ=0.25 to generate the underlying signal and noisy observations. The true mean value is set to μi=(0.5,0.5,0,0,0.5,0.5)T and the noisy observation is generated as y=(0.4314,0.4000,0.2140,0.5188,0.2379,0.4435)T. We applied the four methods to obtain the whole solution paths along λ2. The solution paths of the Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA are depicted in Figure 1. The solution path of the Path‐LFLSA seems to have a split event around λ2=0.2, while the Path‐LFLSA actually has fusion events only. We also report the hitting times of the solution paths from Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA in Table 1 for details.

FIGURE 1.

FIGURE 1

Solution paths of Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA for μ=(μ1,,μ6)T

TABLE 1.

Hitting times of the whole solution paths from the four methods with the noisy observation y with λ2(0)=0

Method
λ2(1)
λ2(2)
λ2(3)
λ2(4)
λ2(5)
Path‐FLSA 0.0314 0.1832 0.2056 0.5266 0.8330
PCD‐FLSA 0.0314 0.2056 0.6140 0.7328 0.7567
Path‐LFLSA 0.0078 0.0514 0.1832 0.1317 0.4165
PA‐FLSA 0.0314 0.0521 0.1322 0.1838 0.2682

Table 1 shows that the fourth hitting time λ2(4)=0.1317 is less than the third hitting time λ2(3)=0.1832 for Path‐LFLSA, violating the monotone increasing property of the hitting times. Contrarily, the other solution path algorithms satisfy the monotone increasing property of the hitting times. As demonstrated in Theorem 2, this violation λ2(3)>λ2(4) can be checked with the solution at the second hitting time λ2=λ2(2). At λ2=λ2(2), y¯(2)=(0.4157,0.2140,0.5188,0.3407)T, b^(2)=(2,1,1,2)T, η(2)=(η2(2),η3(2),η4(2))=(0.2099,0.1832,0.2865)T. Thus, k=argminj=2,3,4ηj(2)=3, w3(2)=b^3(2)/(b^2(2)+b^3(2))=0.5, and m=k2=1 or m=k+1=4 in the condition (21). Thus, the following condition (21) holds for m=1:

0.7328=|y¯3(2)y¯2(2)|>b^1(2)(b^2(2)+b^3(2))2b^2(2)b^3(2)j=13b^j(2)(y¯2(2)+y¯3(2))/2y¯1(2)=0.5266.

As mentioned in Section 3, we suggest drawing the solution path of the Path‐LFLSA along indices of its hitting times, which is equivalent to the order of fusion events. We depict the solution paths along λ2 and the indices of the hitting times in Figure 2. The solution path along the indices of the hitting times (Figure 2 (b)) is more readable than the solution path along λ2 (Figure 2 (a)).

FIGURE 2.

FIGURE 2

Solution paths of Path‐LFLSA along λ2 and indices of hitting times for μ=(μ1,,μ6)T

Figure 1 also shows that the solution path of the PA‐FLSA has discontinuous points at the hitting times. For example, the values of ν^(2)(λ2(3)) and ν^(3)(λ2(3)) in the proof of Theorem 3 are different, where ν^(2)(λ2(3))=(0.3107,0.1764,0.2597)T and ν^(3)(λ2(3))=(0.3107,0.1672,0.2505)T. Table 1 shows that the hitting times of the PA‐FLSA are monotonically increasing, supporting Theorem 3.

Finally, we compared the fused sets at the hitting times. From Figure 1, the partitions from the Path‐FLSA are obtained as B^(1)={{1,2},{3},{4},{5},{6}}, B^(2)={{1,2},{3,4},{5},{6}}, B^(3)={{1,2},{3,4},{5,6}}, B^(4)={{1,2,3,4},{5,6}}, and B^(5)={{1,2,3,4,5,6}} for λ2(1),,λ2(5), respectively. Path‐LFLSA and PA‐FLSA have the same partitions B^(1)={{1,2},{3},{4},{5},{6}}, B^(2)={{1,2},{3},{4},{5,6}}, B^(3)={{1,2},{3,4},{5,6}}, B^(4)={{1,2,3,4},{5,6}}, and B^(5)={{1,2,3,4,5,6}} for λ2(1),,λ2(5), respectively. Thus, we observe that Path‐FLSA, Path‐LFLSA, and PA‐FLSA contain the true partition {{1,2},{3,4},{5,6}}. Contrarily, the partitions from the PCD‐FLSA are obtained as B^(1)={{1,2},{3},{4},{5},{6}}, B^(2)={{1,2},{3},{4},{5,6}}, B^(3)={{1,2,3},{4},{5,6}}, B^(4)={{1,2,3,4},{5,6}}, and B^(5)={{1,2,3,4,5,6}} for λ2(1),,λ2(5), respectively. The PCD‐FLSA fails to contain the true partition, and this difference is because of the Puffer transformation in the PCD‐FLSA. Note that the last observation describes the estimates of the methods from the one data set y only, not implying that the PCD‐FLSA generally fails to contain the true partition. In the following subsection, we evaluate the exact pattern recovery probabilities estimated by 1000 data sets to compare the general performance of containing the true partition.

5.2. Comparison of Exact Pattern Recovery Probabilities

In this section, we compare the probabilities of the exact pattern recovery of the four methods similar to Qian & Jia (2016), where the exact pattern recovery indicates that the solution path of the given method contains the solution of the true block structures. We considered four scenarios for the underlying block structures. Three scenarios are taken directly from Qian & Jia (2016) to reproduce their results and conduct a fair comparison, in which two scenarios have no stair‐case blocks, and one scenario has stair‐case blocks. Besides, the last scenario for our numerical study was a true block structure with multiple stair‐case blocks. These four scenarios can be divided into two categories based on the existence of the stair‐case blocks, which is equivalent to the failure of the irrepresentable condition of the transformed Lasso in (13). Again, for a fair comparison, we set n=430 and consider σ=0.05,0,10,,0.5 like Qian & Jia (2016). The noisy observations are from yi=μi+ϵi, where ϵiN(0,σ2) and μi are specified case by case. We generated 1000 data sets and checked the inclusion of the true block structures with the given solution path to measure the probability of the exact pattern recovery. To obtain the confidence intervals of the exact pattern recovery probabilities, we repeat the procedure for estimating the exact pattern recovery 50 times as well.

5.2.1. Two scenarios without stair‐case blocks

First, we considered the two true signal structures μ(1) and μ(2) used in Qian & Jia (2016) for the scenarios S1 and S2, respectively. The two true signal vectors for S1 and S2 are respectively defined as

μi(1)=01i40241i80181i1203121i1600161i2002201i2400241i2802281i3201321i3603361i4000401i430andμi(2)=01i15216i30031i60261i1200121i2102211i2400241i2552256i3700371i3852386i4000401i430.

With these two true signal vectors, the left hand sides (LHS) of the irrepresentable condition of the transformed Lasso are 0.975 and 0.9826 for μ(1) and μ(2), respectively. These values denote that the irrepresentation conditions hold for two signal vectors because there exists a constant η such that 0<η1LHS1. Figure 3 depicts the true signals and noisy observations for two scenarios with σ=0.25. Scenarios S1 and S2 have ten change points in the true signal. The differences between S1 and S2 are the lengths of the blocks and signal levels.

FIGURE 3.

FIGURE 3

True signals and noisy observations for two scenarios S1 and S2

Figure 4 depicts the estimated probabilities and their 95% confidence intervals of the exact pattern recovery of Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA for S1 and S2. We confirmed that the probabilities of the exact pattern recovery of Path‐FLSA and PCD‐FLSA have the same shapes as in the simulation results in Qian & Jia (2016). Moreover, Figure 4 shows an interesting finding that the estimated probabilities of the exact pattern recovery of Path‐LFLSA and PA‐FLSA are almost similar and greater than those of Path‐FLSA and PCD‐FLSA for σ0.25. Specifically, for σ=0.5 and case S2, the probabilities of the exact pattern recovery for Path‐LFLSA and PA‐FLSA are estimated as 0.422 and 0.458, respectively, while those of Path‐FLSA and PCD‐FLSA are zero. For S1 and S2, we further observe that PA‐FLSA has a larger pattern recovery probability than Path‐LFLSA for relatively large σ levels (σ0.35).

FIGURE 4.

FIGURE 4

Plots of the estimated probabilities of the exact pattern recovery under σ=0.05,0.10,,0.5 for S1 and S2. Dashed lines denote the 95% confidence intervals of the exact pattern recovery probabilities

5.2.2. Two scenarios with stair‐case blocks

Second, we consider two true signal structures μ(3) and μ(4) for the scenarios S3 and S4, respectively, where μ(3) is from Qian & Jia (2016) and μ(4) is a novel case with multiple stair‐case blocks. The true signal vectors μ(3) and μ(4) for S3 and S4 are defined as follows:

μi(3)=01i1002101i1100.1111i2102211i2200.1221i3202321i3300331i430andμi(4)=2.41i501.851i1001.2101i1500.6151i2000201i2500.6251i3001.2301i3501.8351i4002.4401i430.

As in the cases of S1 and S2, we calculated the LHS of the irrepresentable condition of the transformed Lasso for S3 and S4. Both the LHS values for μ(3) and μ(4) were calculated as 1.0. These LHS values indicate that the irrepresentation conditions failed for two signal vectors because a constant η such that 0<η1LHS=0 does not exist. Figure 5 depicts the true signals and noisy observations for two scenarios S3 and S4 with σ=0.25. Scenarios S3 and S4 have six and ten change points, respectively.

FIGURE 5.

FIGURE 5

True signals and noisy observations for two scenarios S3 and S4

Figure 6 depicts the estimated probabilities and their 95% confidence intervals of the exact pattern recovery of Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA for S3 and S4. We can compare the characteristics of the four methods when the true signal has the stair‐case blocks. First, Path‐FLSA fails totally in finding the exact pattern (i.e. true block structure) for S3 and S4 because the estimated probabilities of the exact pattern recovery of Path‐FLSA for S3 and S4 are zero for all σ=0.05,,0.5. This observation supports the inconsistency of the FLSA under the existence of the stair‐case blocks. Second, Path‐LFLSA and PA‐FLSA are more robust to the noise level (i.e. error variance) than PCD‐FLSA. The estimated exact pattern recovery probabilities of Path‐LFLSA and PA‐FLSA decreased slower that of PCD‐FLSA. For example, the exact pattern recovery probability of PCD‐FLSA decreased from 1.000 at σ=0.05 to 0.001 at σ=0.15 while that of Path‐LFLSA and PA‐FLSA decreased from 1.000 and 1.000 at σ=0.05 to 0.549 and 0.595 at σ=0.15 for S4, respectively.

FIGURE 6.

FIGURE 6

Plots of the estimated probabilities of the exact pattern recovery under σ=0.05,0.10,,0.5 for S3 and S4. Dashed lines denote the 95% confidence intervals of the exact pattern recovery probabilities

Overall, Path‐LFLSA and PA‐FLSA outperformed Path‐FLSA and PCD‐FLSA concerning the exact pattern recovery probability for all cases from S1 to S4. For Path‐LFLSA and PA‐FLSA, PA‐FLSA (Path‐LFLSA) was better than Path‐LFLSA (PA‐FLSA) for S1, S2, and S4 (S3), but the differences were small.

6. APPLICATION TO COVID‐19 INFECTION IN KOREA

In this section, we apply the four methods Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA to the daily‐confirmed cases of COVID‐19 infection in Korea. We downloaded a dataset of COVID‐19 infection status from the Public Data Portal (http://data.go.kr) by using OpenAPI through https://openapi.data.go.kr/openapi/service/rest/Covid19/getCovid19InfStateJson with the authorised key. The downloaded dataset contained 9 variables, including date of state (stateDt), a time of state (stateTime), and the number of cumulative confirmed cases of COVID‐19 (decideCnt). We set a target period as a period from 03‐01‐2020 to 03/31/2022 and used the cases whose times of state were 0:00. To obtain the daily‐confirmed cases of COVID‐19, we applied the first differencing to the number of cumulative confirmed cases, where the difference between the cumulative confirmed cases of COVID‐19 stated at 0:00 on 03/01/2020 and 03/02/2020 as the daily‐confirmed cases occurred on 03/01/2020.

We considered a logarithmic transformation to stabilise the variance of the observation before applying the four methods because the FLSA‐based procedures are sensitive to the noise level, as shown in Section 5. In addition, the number of daily confirmed cases is dramatically increasing from January 2022 in Korea. For example, the number of daily confirmed cases was 621,204 on March 16, 2022, while the number of daily confirmed patients was 3831 on January 1, 2022. Figure 7 depicts the log‐transformed daily‐confirmed cases of the target period with a gray line.

FIGURE 7.

FIGURE 7

Plots of the logarithmic transformed daily confirmed cases of COVID‐19 (Inline graphic), the estimates of Path‐FLSA (Inline graphic), PCD‐FLSA (Inline graphic), Path‐LFLSA (Inline graphic), and PA‐FLSA (Inline graphic) with the EBIC

To select the optimal tuning parameter, we used the following EBIC proposed by Chen & Chen (2008), which showed better results for the FLSA as described in Appendix C of the supplementary material,

EBIC(λ2)=nlog(RSS(λ2))+J^(λ2)log(n)+lognJ^(λ2),

where RSS(λ2)=i=1n(yiμ^iDB(λ2))2, μ^iDB(λ2)=y¯B^j for iB^j(λ2), y¯B^j=|B^j(λ2)|1iB^j(λ2)yi, and J^(λ2) is the number of the estimated blocks at λ2.

Figure 7 depicts the estimates by Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA. The number of identified change points for Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA were 82, 172, 55, and 39, respectively. As shown in Figure 7 (a), the Path‐FLSA identified many change points in the periods that seems to have trends or stair‐case blocks and also missed several local changes. For example, the Path‐FLSA missed the change points for a local peak from 05/04/2020 to 05/06/2020, while both the Path‐LFLSA and PA‐FLSA identified the change points on 05/03/2020 and 05/06/2020. Figure 7 (b) shows that the PCD‐FLSA estimates seem to have many false‐positive change points and also identify much more change points compared with the others. As shown in the case of S4 having the stair‐case blocks, the pattern recovery probabilities of the Path‐FLSA and PCD‐FLSA decrease very quickly when the noise level increases. With the residual of the fitted models, the estimates of the error standard deviation by Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA are 0.24, 0.14, 0.18, and 0.20, respectively. Thus, this observation is also consistent with the comparison result of the pattern recovery probability for S4. From the above observations, we focused on the comparison of the identified change points by the Path‐LFLSA and PA‐FLSA. We also depict the histograms and Q‐Q plots of the residuals of the four methods in Appendix D of the supplementary material. From the figures in Appendix D of the supplementary material, the residuals of the four methods are not exactly following the normal distribution but seem to follow the normal distribution within the interval [1.5,1.5]. It is also worth noting that the FLSA tends to find many change points when the underlying signal has a trend because the FLSA model is adequate for the piecewise constant mean model. For identifying the trend‐change points, we refer to the 1 trend filtering method proposed by Kim et al. (2009), which is more suitable to identify sparse trend‐change points.

Table 2 reports the identified change points by Path‐LFLSA and PA‐FLSA with the corresponding indices, dates, and debiased estimates. The gray rows in Table 2 denote the identified change points by either the Path‐LFLSA or the PA‐FLSA only. There are 32 change points commonly identified by the Path‐LFLSA and PA‐FLSA, and there are also 23 (7) change points identified by the Path‐LFLSA (PA‐FLSA) only. Among the seven change points identified by the PA‐FLSA only, the six change points at 03/07/2020, 08/12/2020, 09/11/2020, 10/24/2020, 01/01/2021 and 11/01/2021 are closed to the six change points identified by the Path‐LFLSA only, where most of the differences are one or two days. The change point by the PA‐FLSA on 01/31/2022 seems to be an intermediate point within the period having a trend. For the change points by the Path‐LFLSA only except for the six change points corresponding to the change points by the PA‐FLSA only, most of the identified change points catch the local peaks.

TABLE 2.

The estimated change points of Path‐LFLSA and PA‐FLSA

Path‐LFLSA PA‐LFLSA Path‐LFLSA PA‐LFLSA
Index Date
μ^DB
Index Date
μ^DB
Index Date
μ^DB
Index Date
μ^DB
6 03/06/2020 535.34 306 12/31/2020 992.99
7 03/07/2020 507.43 307 01/01/2021 984.45
10 03/10/2020 232.84 10 03/10/2020 199.89 315 01/09/2021 757.46 315 01/09/2021 749.87
35 04/04/2020 97.92 35 04/04/2020 97.92 322 01/16/2021 526.01 322 01/16/2021 526.01
39 04/08/2020 47.23 39 04/08/2020 47.23 351 02/14/2021 393.59
48 04/17/2020 26.23 48 04/17/2020 26.23 355 02/18/2021 561.75
64 05/03/2020 9.89 64 05/03/2020 9.89 372 03/07/2021 392.68
67 05/06/2020 3.91 67 05/06/2020 3.91 401 04/05/2021 460.51 401 04/05/2021 424.31
69 05/08/2020 15.72 420 04/24/2021 663.27
75 05/14/2020 30.47 468 06/11/2021 589.20
86 05/25/2020 19.05 86 05/25/2020 21.65 478 06/21/2021 444.14
112 06/20/2020 45.73 492 07/05/2021 683.50 492 07/05/2021 599.00
117 06/25/2020 34.80 520 08/02/2021 1437.73 520 08/02/2021 1437.73
138 07/16/2020 51.02 571 09/22/2021 1760.66
141 07/19/2020 33.56 579 09/30/2021 2617.28
147 07/25/2020 60.68 147 07/25/2020 47.02 587 10/08/2021 2001.65
163 08/10/2020 31.85 604 10/25/2021 1422.00
165 08/12/2020 33.91 611 11/01/2021 1784.54
167 08/14/2020 85.90 167 08/14/2020 131.79 625 11/15/2021 2134.56 625 11/15/2021 2242.17
185 09/01/2020 296.16 185 09/01/2020 296.16 639 11/29/2021 3372.36 639 11/29/2021 3372.36
194 09/10/2020 162.48 646 12/06/2021 5002.52 646 12/06/2021 5002.52
195 09/11/2020 159.73 665 12/25/2021 6532.64 665 12/25/2021 6532.64
202 09/18/2020 121.88 202 09/18/2020 119.86 688 01/17/2022 978.03 688 01/17/2022 3978.03
210 09/26/2020 87.69 695 01/24/2022 7080.12 695 01/24/2022 7080.12
212 09/28/2020 44.60 702 01/31/2022 16194.46
234 10/20/2020 77.29 705 02/03/2022 18074.79 705 02/03/2022 23355.52
238 10/24/2020 79.16 709 02/07/2022 36739.06 709 02/07/2022 36739.06
256 11/11/2020 112.47 256 11/11/2020 115.98 716 02/14/2022 54343.44 716 02/14/2022 54343.44
261 11/16/2020 211.76 261 11/16/2020 211.76 723 02/21/2022 99131.02 723 02/21/2022 99131.02
269 11/24/2020 333.21 269 11/24/2020 333.21 730 02/28/2022 158836.04 730 02/28/2022 158836.04
285 12/10/2020 564.42 285 12/10/2020 564.42 737 03/07/2022 226706.55 737 03/07/2022 226706.55

Note: The rows in gray denote the change points found by only one method.

On the other hand, interestingly, most of the commonly identified change points correspond to the period of the social distancing announced by the Ministry of Health and Welfare of Korea. For example, April 4, 2020, corresponds to the end of the intensive social distancing period from March 22, 2020, to April 5, 2020. In addition, the time periods between change points are similar to the periods of the social distancing policy by the Ministry of Health and Welfare of Korea, where the policy periods are usually two or three weeks. Finally, we should address that the sequence of the daily confirmed cases is observed through time, and it seems to have time‐dependent trends. Thus, the main assumptions of the FLSA for the model selection consistency is hardly satisfied, and then the FLSA‐based methods do not guarantee the model selection consistency. For example, among the commonly identified change points, the change points within four periods (03/10/2020 ∼ 05/06/2020, 11/16/2020 ∼ 12/10/2020, 11/15/2021∼ 12/25/2025, 02/03/2022 ∼ 03/07/2022) seem to be intermediate points within the periods having either an increasing or a decreasing trend.

However, this COVID‐19 spread example addresses that the Path‐LFLSA and PA‐FLSA are still applicable to find the main change points caused by an external event in a stable period (i.e. a period with a low trend effect). For example, both Path‐LFLSA and PA‐FLSA succeeded in finding the beginning day of the second wave of the COVID‐19 pandemic in Korea on August 15, 2020, when a mass rally was held near Gwanghwamun Square in Seoul, where the identified change point on August 14, 2020, in Table 2 denotes that the underlying mean value was changed on August 15, 2020. In addition, the identified change point on November 11, 2020, is close to the beginning day (11/4/2020) of the third wave reported in Seong et al. (2021) and the identified change point at 07/05/2021 is also close to the beginning day (7/7/2021) of the fourth wave 1. Recently, the number of daily confirmed cases has been rapidly increasing, and the fifth wave had was begun on 01/26/2022 2. Both Path‐LFLSA and PA‐FLSA found the change points on 01/24/2022, which is close to the beginning day of the fifth wave. The identified change points related to the second to the fifth waves are highlighted in red in Table 2.

7. CONCLUSION

In this study, we provide a new interpretation of the modified path algorithm for the FLSA by discovering the exact optimisation problems corresponding to the modified path algorithm, called Path‐LFLSA. Our discovery demonstrates that the modified path algorithm's hitting times are not monotonically increasing, and the violation of the monotone increasing property for the next hitting time can be verified by comparing the solution from the previous hitting time. We propose a pathwise adaptive FLSA with a weighted fusion penalty to recover the monotonicity of the hitting times. The comprehensive numerical study illustrates the whole solution paths of the four methods, including three existing ones and the proposed PA‐FLSA, and it also shows that the Path‐LFLSA and PA‐FLSA are less sensitive to noise levels for pattern recovery than the Path‐FLSA and PCD‐FLSA. Furthermore, our numerical study in Appendix C of the supplementary material provides a practical guideline for choosing the optimal tuning parameters of the Path‐LFLSA and PA‐FLSA that outperform Path‐FLSA and PCD‐FLSA to identify the true block structures and estimate the true signal levels. The application of Path‐LFLSA and PA‐FLSA with the optimal tuning parameter selection by EBIC to the number of daily‐confirmed cases of COVID‐19 infection found the change points related to the beginning days of the COVID‐19 pandemic waves from the second to the fifth in Korea.

Supporting information

insr12521‐sup‐0001‐Supp_PATH_FLSA_rev_v1.pdf

ACKNOWLEDGEMENTS

W. Son's research is supported by the National Research Foundation of Korea (No. 2020R1F1A1A01051039), J. Lim's research is supported by the National Research Foundation of Korea (NRF‐2021R1A2C1010786), and D. Yu's research is supported by the National Research Foundation of Korea (NRF‐2022R1A5A7033499) and Inha University Research Grant.

Son, W. , Lim, J. , and Yu, D. (2022) Path algorithms for fused lasso signal approximator with application to COVID‐19 spread in Korea. International Statistical Review, 10.1111/insr.12521.

Footnotes

1

Korea officially in COVID‐19 fourth wave, an article in Korea Herald available at https://www.koreaherald.com/view.php?ud%3D20210707000868

2

Daily COVID‐19 Cases Exceed 13,000, 5th Wave Beginning, an article in KBS World available at https://world.kbs.co.kr/service/news_view.htm?lang%3De&Seq_Code%3D167226

REFERENCES

  1. Bertsekas, D. (1999). Nonlinear Programming. Athena Scientific.
  2. Bradely Efron, B. , Hastie, T. , Johnstone, I. & Tibshirani, R. (2004). Least angle regression. Ann. Stat., 32, 407–499. [Google Scholar]
  3. Chen, J & Chen, Z. (2008). Extended Bayesian information criteria for model selection with large model spaces. Biometrika, 95(3), 759–771. [Google Scholar]
  4. Friedman, J. , Hastie, T. , Hoefling, H. & Tibshirani, R. (2007). Pathwise coordinate optimization. Ann. Appl. Stat., 1, 302–332. [Google Scholar]
  5. Fryzlewicz, P. (2014). Wild binary segmentation for multiple change‐point detection. Ann. Stat., 42(6), 2243–2281. [Google Scholar]
  6. Hoefling, H. (2010). A path algorithm for the fused Lasso signal approximator. J. Comput. Graph. Stat., 19, 984–1006. [Google Scholar]
  7. Jia, J. & Rohe, K. (2015). Preconditioning the lasso for sign consistency. Electr. J. Stat., 9, 1150–1172. [Google Scholar]
  8. Kim, S.J. , Koh, K. , Boyd, S. & Gorinevsky, D. (2009). 1 trend filtering. SIAM Review, 51(2), 339–360. [Google Scholar]
  9. Lin, K. , Sharpnack, J. , Rinaldo, A. & Tibshirani, R.J. 2016. Approximate recovery in changepoint problems, from 2, estimation error rates. arXiv preprint, arXiv:1606.06746.
  10. Olshen, A.B. , Venkatraman, E. , Lucito, R. & Wigler, M. (2004). Circular binary segmentation for the analysis of array‐based dna copy number data. Biostatistics, 5(4), 557–572. [DOI] [PubMed] [Google Scholar]
  11. Qian, J. & Jia, J. (2016). On stepwise pattern recovery of the fused Lasso. Comput. Stat. Data Anal., 94, 221–237. [Google Scholar]
  12. Rinaldo, A. (2009). Properties and refinements of the fused lasso. Ann. Stat., 37, 2922–2952. [Google Scholar]
  13. Rinaldo, A. 2014. Corrections to properties and refinements of the fused lasso. available at https://www.stat.cmu.edu/∼arinaldo/Fused_Correction.png
  14. Schwarz, G. (1978). Estimating the dimension of a model. Ann. Stat., 6, 461–464. [Google Scholar]
  15. Seong, H. , Hyun, H.J. , Yun, J.G. , Noh, J.Y. , Cheong, H.J. , Kim, W.J. & Song, J.Y. (2021). Comparison of the second and third waves of the COVID‐19 pandemic in South Korea: Importance of early public health intervention. Int. J. Infectious Disease, 104, 742–745. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Son, W. & Lim, J. (2019). Modified path algorithm of fused Lasso signal approximator for consistent recovery of change points. J. Stat. Plan. Inference, 200, 223–238. [Google Scholar]
  17. Tibshirani, R. , Saunders, M. , Rosset, S. , Zhu, J. & Knight, K. (2005). Sparsity and smoothness via the fused lasso. J. R. Stat. Soc.: Ser. B (Stat. Methodol.), 67, 91–108. [Google Scholar]
  18. Tibshirani, R.J. & Taylor, J. (2011). The solution path of the generalized lasso. Ann. Stat., 39(3), 1335–1371. [Google Scholar]
  19. Yao, Y.C. (1988). Estimating the number of change‐points via Schwarz' criterion. Stat. Probab. Lett., 6(3), 181–189. [Google Scholar]
  20. Yao, Y.C. & Au, S.T. (1989). Least‐squares estimation of a step function. Sankhyā Indian J. Stat. Ser. A, 51(3), 370–381. [Google Scholar]
  21. Zhao, P. & Yu, B. (2006). On model selection consistency of lasso. J. Mach. Learn. Res., 7, 2541. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

insr12521‐sup‐0001‐Supp_PATH_FLSA_rev_v1.pdf


Articles from International Statistical Review are provided here courtesy of Wiley

RESOURCES