Summary
The fused lasso signal approximator (FLSA) is a smoothing procedure for noisy observations that uses fused lasso penalty on unobserved mean levels to find sparse signal blocks. Several path algorithms have been developed to obtain the whole solution path of the FLSA. However, it is known that the FLSA has model selection inconsistency when the underlying signals have a stair‐case block, where three consecutive signal blocks are either strictly increasing or decreasing. Modified path algorithms for the FLSA have been proposed to guarantee model selection consistency regardless of the stair‐case block. In this paper, we provide a comprehensive review of the path algorithms for the FLSA and prove the properties of the recently modified path algorithms' hitting times. Specifically, we reinterpret the modified path algorithm as the path algorithm for local FLSA problems and reveal the condition that the hitting time for the fusion of the modified path algorithm is not monotone in a tuning parameter. To recover the monotonicity of the solution path, we propose a pathwise adaptive FLSA having monotonicity with similar performance as the modified solution path algorithm. Finally, we apply the proposed method to the number of daily‐confirmed cases of COVID‐19 in Korea to identify the change points of its spread.
Keywords: Change points, fused lasso signal approximator, modified path algorithm, pathwise adaptive weight, solution path
1. INTRODUCTION
Consider observations from a model
| (1) |
where are independently and identically from a distribution with mean 0 and variance . Here, we assume that true underlying signals have block structures with . Under this assumption, it is essential to identify the unknown block structures to estimate the signal means. This problem is known as multiple‐change‐point detection. The literature contains various methods on multiple‐change‐point detection, including best subset selection (Yao, 1988; Yao & Au, 1989), circular binary segmentation (CBS) (Olshen et al., 2004), and wild binary segmentation (WBS) (Fryzlewicz, 2014).
This study focuses on the fused lasso signal approximator (FLSA) that finds sparse signal blocks using the fused lasso penalty (Tibshirani et al., 2005) on underlying mean levels. The FLSA obtains the signal estimate by minimising
| (2) |
where , , is the ℓ 1‐norm of , and is the total‐variation norm of . Given and , the FLSA obtains the sparse block solution for , providing the estimates of both signal levels and block structures. As shown in Lemma A.1 in Friedman et al. (2007), the sparse estimate can be obtained directly from the minimiser of by the soft‐thresholding operator , where for .
Furthermore, the FLSA has the model selection consistency (i.e. exact identification of the true block structure) when the underlying true block structure has no stair‐case blocks, where a stair‐case block denotes that three consecutive blocks that are either strictly increasing or decreasing (Rinaldo, 2009, 2014; Qian & Jia, 2016). The FLSA estimator is inconsistent in identifying the true signal blocks when the true block structure contains any stair‐case blocks. The preconditioned FLSA via Puffer transformation (Qian & Jia, 2016) and the modified path algorithm for FLSA (Son & Lim, 2019) are proposed to resolve this inconsistency. Recently, the error analysis for the FLSA has been done in literature (Lin et al., 2016).
Besides the theoretical properties of the FLSA, it is critical to obtain its solution efficiently. The entire solution path of the FLSA can be obtained using either the path algorithm for the FLSA proposed by Hoefling (2010) or the path algorithm for the generalised lasso by Tibshirani & Taylor (2011). These two solution path algorithms solve the problem (2) but still show the model selection inconsistency for identifying true blocks with stair‐case blocks. Contrarily, two modified path algorithms have been developed recently to resolve the model selection inconsistency of the FLSA. The preconditioned FLSA with the Puffer transformation (Qian & Jia, 2016) converts the original problem into a Lasso problem with reparameterisation and applies the Puffer transformation. Essentially, the solution path algorithm for the preconditioned FLSA is based on the path algorithm for Lasso (Bradely Efron et al., 2004). The modified path algorithm for the FLSA (mPath‐FLSA) by Son & Lim (2019) reinterprets the FLSA path algorithm using the distances between two adjacent blocks and proposes a new distance criterion that guarantees the model selection consistency regardless of a stair‐case block. Note that the original study does not provide the overall objective function of mPath‐FLSA.
Our main contributions through this study are as follows. First, we provide a comprehensive review of the FLSA path algorithms and prove the properties of the hitting times of the recently modified path algorithm. Specifically, we provide an explicit form of the objective function of mPath‐FLSA. Furthermore, we apply the new representation to show that the hitting times of mPath‐FLSA can fail to increase monotonically with respect to . For example, we suppose that there are four observations, , , , and . The hitting times for the FLSA solution path algorithm are , which satisfy the monotone property (i.e. ). However, the hitting times of mPath‐FLSA are calculated as , which are not monotone in . Section 3 provides additional details. Second, we propose a pathwise adaptive FLSA and its solution path algorithm to resolve the violation of the monotonicity of the solution path along . The proposed method adopts the weighted fusion penalty terms , which are adaptively defined by the solutions at the hitting times on the solution path along . Finally, we provide a comprehensive numerical comparison of the existing path algorithms with the proposed algorithm. We compare four methods based on the entire solution path along , the exact pattern recovery probability, and the estimation performance of the signal levels and block structures concerning the selection of tuning parameters.
The remainder of this paper is organised as follows. Section 2 contains a brief review of the existing solution path algorithms related to the FLSA. Section 3 presents a new interpretation of mPath‐FLSA besides the conditions for the non‐monotonicity of the solution path along . The pathwise adaptive FLSA and its solution path algorithm that satisfies the monotone property in the solution path along are proposed in Section 4. In Section 5, a numerical study for measuring the probability of the exact recovery was conducted using the proposed algorithm and the other existing algorithms. Moreover, we compared the estimation performance with the optimal tuning parameter chosen by the Bayesian information criterion (BIC) and the extended BIC (EBIC). In Section 6, we apply the proposed method and existing FLSA path algorithms to the number of daily‐confirmed cases of COVID‐19 in Korea to identify events that affect the COVID‐19 spread. Finally, we conclude the paper with some remarks.
2. EXISTING SOLUTION PATH ALGORITHMS FOR FLSA
In this section, we review three existing solution path algorithms related to the FLSA; the path algorithm for the FLSA (Path‐FLSA) by Hoefling (2010), the path algorithm for the preconditioned FLSA (Path‐PFLSA) by Qian & Jia (2016), the modified path algorithm for the FLSA (mPath‐FLSA) by Son & Lim (2019). Henceforth, to focus on the identification of true mean blocks, we consider the FLSA with as follows:
| (3) |
Note that we omit the review of the solution path algorithm for the generalised Lasso by Tibshirani & Taylor (2011) because its solution path for the FLSA is the same as Path‐FLSA.
2.1. Path Algorithm for the FLSA (Path‐FLSA)
The FLSA path algorithm proposed by Hoefling (2010) provides a whole solution path of the FLSA along based on the fact that the solution path for is a piece‐wise linear in . Originally, Hoefling (2010) proposes a path algorithm for the generalised FLSA problem
| (4) |
where denotes a set of index pairs that correspond to fusion penalty terms. Here, we focus on the path algorithm for the one‐dimensional FLSA as in (3) (i.e. ).
Specifically, let be a partition of defined from the solution of the FLSA at , where is the number of change points in . Further, we let be hitting times on the solution path, with two adjacent sets in the partition defined by the previous hitting time fused on each of them. For example, there is an index on , such that and are fused as . Thus, the solution for can be expressed explicitly as:
| (5) |
where , ,
Now, suppose that we know a partition by , such that , and consider , where there is no fusion in the FLSA estimate. The FLSA solution satisfies the following equation:
| (6) |
where is a subdifferential of , which is defined as if , −1 if , and if . We can easily obtain the derivative of with respect to from (6) as
| (7) |
Thus, is linear in because the numerator and denominator of (7) are unchanged for the given interval . This property shows that the FLSA solution path is piecewise linear in .
Given the solution on the hitting time and the derivative , the solution for can be easily obtained as
| (8) |
where . Based on (8), the next hitting time is defined as
| (9) |
Given the Eqs. 8 and (9), the Path‐FLSA starts with and sequentially calculates the hitting times. The Path‐FLSA stores all solutions at hitting times and their derivatives to obtain the solution for any with (8). Note that the FLSA solution path for generalised fusion penalty could be split for some edge set . For example, two adjacent solutions and for are fused at (i.e. and ) and split at for some (i.e. and ) with the generalised fusion penalty. However, the one‐dimensional FLSA (i.e. is an edge set of a chain graph) only has hitting events on the solution path, as shown by Friedman et al. (2007) in Proposition 2.
2.2. Preconditioned FLSA with Puffer Transformation (PCD‐FLSA)
As explained in the Introduction, the FLSA faces the model selection inconsistency due to stair‐case blocks in the underlying signals. Qian & Jia (2016) connect the FLSA model selection inconsistency to the irrepresentable condition of the Lasso problem (Zhao & Yu, 2006) by the reparameterisation of the FLSA. To obtain the model selection consistency, Qian & Jia (2016) applied the Puffer transformation introduced by Jia & Rohe (2015) to the reparameterised FLSA, which is called the preconditioned FLSA with Puffer transformation (PCD‐FLSA).
Specifically, consider the reparameterisation of with as follows:
| (10) |
where is a lower triangular matrix with nonzero elements equal to one. This reparameterisation can represent the objective function of the FLSA as
| (11) |
where . The solution of (11) can be obtained by
| (12) |
where , , is an ‐dimensional vector of ones, , , , and , which denotes that is an matrix defined by removing the first column of . Henceforth, we refer to the reparameterised FLSA problem (12) as the transformed Lasso.
Based on the reparameterisation, the pattern recovery for using the FLSA is equivalent to the sign consistency of the Lasso estimator for . For a linear model with , the conditions for the sign consistency of the Lasso estimator is well understood. One of the necessary conditions of the sign consistency of the Lasso estimator is the irrepresentable condition (Zhao & Yu, 2006) defined as follows:
| (13) |
where . Theorem 2 of Qian & Jia (2016) shows that the irrepresentable condition (13) holds if and only if one of the following two conditions holds.
-
(1)
For a set of change points , or .
-
(2)
The underlying signal has no stair‐case blocks. That is, .
The abovementioned condition (2) shows the inconsistency of the FLSA in identifying true change points when underlying signals have stair‐case blocks.
Qian & Jia (2016) proposed a preconditioned FLSA with a Puffer transformation to resolve the inconsistency of the FLSA. The Puffer transformation for Lasso is proposed by Jia & Rohe (2015) to make the Lasso estimator sign consistent when the irreresentable condition does not hold for the design matrix with . Let the singular value decomposition of be . The Puffer transformation is defined as . Because in (12), the results of Jia & Rohe (2015) are valid for the transformed Lasso problem in (12). Let and be the preconditioned design matrix and the response vector, respectively. The preconditioned FLSA with the Puffer transformation solves
| (14) |
Because , the solution of the PCD‐FLSA has the explicit form , where . Thus, the solution path of is piecewise linear in , and the hitting times (or slope‐change points in the solution path) are for , where is the smallest value in a vector .
2.3. Modified Path Algorithm for FLSA (mPath‐FLSA)
Recently, Son & Lim (2019) provided a new interpretation of the path algorithm for the one‐dimensional FLSA (1D‐FLSA), which uses a new distance measure for two adjacent blocks to determine if they are fused. Furthermore, they used this interpretation to show the inconsistency of the FLSA in the presence of the stair‐case blocks and proposed a path algorithm with a modified distance measure that resolves this inconsistency.
Specifically, let be the hitting times of the 1D‐FLSA. Further, let be a partition of corresponding to the estimated blocks of for . Son & Lim (2019) developed a new distance measure for two adjacent blocks and defined as
| (15) |
where , , and
| (16) |
Son & Lim (2019) used the distance measure in (15) to show that the hitting time in (9) is equivalent to . This interpretation shows that the 1D‐FLSA cannot fuse two consecutive stair‐case blocks with a finite because the denominator of the distance measure is defined as 0.
The abovementioned phenomenon for two consecutive stair‐case blocks promoted Son & Lim (2019) to propose the modified distance measure that guarantees the consistency of the 1D‐FLSA estimator in identifying true block structures regardless of the stair‐case blocks. The modified distance measure for two consecutive blocks and is defined as
| (17) |
A modified path algorithm is proposed using the modified distance measure with the hitting times defined as:
| (18) |
Theorem 2 of Son & Lim (2019) shows that the 1D‐FLSA with a modified path algorithm consistently identifies the change points and signs of difference between two adjacent blocks. Note that the modified path algorithm guarantees a consistent estimator for identifying true block structures by adopting the modified distance measure , but this modification causes the modified path algorithm to solve a different problem that is not equivalent to the 1D‐FLSA, and the monotone increasing property of the hitting times could be violated (i.e. for some , ). This phenomenon is explained in the next section.
3. NEW INTERPRETATION OF MODIFIED PATH ALGORITHM
In this section, we provide a novel interpretation of the mPath‐FLSA, including the exact target objective function of the mPath‐FLSA. It clarifies the differences between Path‐FLSA and mPath‐FLSA. Moreover, we illustrate a condition that violates the monotone increasing property of the hitting times in mPath‐FLSA.
Suppose we know that a partition of for a given , where is the hitting time of based on the mPath‐FLSA. In this section, we denote as for notational simplicity. Let be a parameter of the block mean at . A localised FLSA problem for a pair of the partition indices of is define as
| (19) |
where is a tuning parameter of the localised FLSA problem for . In Theorem 1, we show that the hitting times of the mPath‐FLSA are equivalent to the minimum hitting times of localised FLSA problems.
Theorem 1
Let be the hitting time of the modified path algorithm with and . Let be the hitting time of (19) for a partition pair . We define , which is the minimum hitting time of for . Then, and .
The necessary and sufficient conditions of the solution using the subgradient approach of are as follows:
(20) where , and is the subdifferential of . Let and be the solutions of the Eq. (20). For , the solution can be represented as and . From , the hitting time is . Similarly, for , the hitting time is . Hence, . Based on the definition of the minimum hitting time , is equivalent to in the modified path algorithm. It proves .
Theorem 1 offers a novel interpretation of mPath‐FLSA. The mPath‐FLSA lacks an overall objective function, and it finds the next fusion block sequentially by solving independent localised FLSA problems for two adjacent blocks, called local fused lasso signal approximator (LFLSA). Henceforth, we refer to the mPath‐FLSA by Son and Lim (2019) as Path‐LFLSA. Moreover, the hitting times for each sub‐problems can be considered as the distance between the two blocks. Consequently, Path‐LFLSA can be considered as a hierarchical clustering algorithm for with distance metric if and if .
Furthermore, this interpretation can help identify conditions that violate the monotonicity in the hitting times of Path‐LFLSA. Theorem 2 shows the conditions for the occurrence of the non‐monotonic sequence terms.
Theorem 2
Let be the hitting time of the Path‐LFLSA with and . Suppose that , and the partition set , where for , , and for . Then, the next hitting time is less than if or . Moreover, the violation can be checked with the solution at . Let . The violation occurs if and only if the following inequality hold for or :
(21)
Due to the definition of , for . After hitting time, the set is defined by the union of and , and the other blocks remain the same as in the previous partition . Thus, the pairs for also remain the same as in the previous partition set , which implies for . Therefore, the next hitting time decreases only when or . Moreover, based on this assumption, we can express the hitting time as . Because , , and , we can express and as and respectively. Applying these observations to and , we obtain the condition (21).
To describe the violation , we consider three independent random variables from the standard normal distribution . Corollary 1 shows that the probability of the violation is when we apply the Path‐LFLSA to , where and is the cumulative distribution function of the standard normal distribution.
Corollary 1
Suppose are random samples from . Let be a standard normal random variable and be the cumulative distribution function of . Let and be the first and the second hitting times of the Path‐LFLSA with , respectively. Then, the probability is equal to . In addition, can be calculated from the univariate conditional expectation .
A proof of Corollary 1 can be found in Appendix A of the supplementary material. The findings of this study reveal that the mPath‐FLSA, also called the Path‐LFLSA, is the clustering of the observed point based on the distance defined by the minimum of the next hitting times of the localised FLSA problems. We suggest using the indices of hitting times to represent the solution path of the Path‐LFLSA instead of directly drawing the solution path along , which avoids falsely split points in the solution path. This phenomenon is illustrated in Section 5.
4. PATHWISE ADAPTIVE FLSA
The Path‐LFLSA motivated us to propose the pathwise adaptive FLSA, which is a weighted FLSA with pathwise adaptive weights and guarantees the monotonicity of the hitting times. Specifically, let be the hitting time and be a partition of at such that for and . For , the pathwise adaptive FLSA (PA‐FLSA) minimises
| (22) |
where
and is a sufficiently large weight that fuses the corresponding two parameters. Equivalently, we can express the PA‐FLSA problem in the following reduced form:
| (23) |
where if and otherwise. Note that is also used in the reduced form to avoid the division by zero besides the numerical instability. For example, we set in our numerical study.
We now investigate the properties of PA‐FLSA. First, we express the solution of the PA‐FLSA in Lemma 1.
Lemma 1
Suppose that there are intervals , , , with , where the fused set is unchanged in each interval. For , the solution of the pathwise adaptive FLSA estimator of (23) is given by
where , , is the subdifferential of , and
The subdifferential of gives the proof of Lemma 1 directly (Bertsekas, 1999).
Based on Lemma 1, we can anticipate the PA‐FLSA to avoid the inconsistency of the FLSA in the presence of the stair‐case blocks. Specifically, for the FLSA, the bias term is zero for the stair‐case blocks (i.e. ), implying that the FLSA fails to move the estimates of the stair‐case blocks. However, for the PA‐FLSA, the for is defined as:
Thus, the PA‐FLSA has a nonzero bias term for the stair‐case blocks unless .
Second, we investigate the monotone fusion property of the PA‐FLSA, which denotes that and remain fused at if and are fused at . The monotone fusion property for the FLSA was proved in Proposition 2 of Friedman et al. (2007). We show that the PA‐FLSA holds the monotone fusion property similar to the FLSA in Proposition 1. However, the PA‐FLSA adopts the weighted fusion penalty with the pathwise adaptive weights.
Proposition 1
In the PA‐FLSA, two parameters that are fused in the solution for are fused for all .
Appendix B of the supplementary material contains a proof of Proposition 1.
Finally, we show that the solution path of the PA‐FLSA is piecewise linear in the given interval and discontinuous at the hitting times, which changes the pathwise adaptive weights in the Theorem 3 mentioned below.
Theorem 3
The solution path of the PA‐FLSA is a piecewise linear function of and discontinuous at the hitting times , which changes the pathwise adaptive weights. The next hitting time can be obtained from the current solution at by
(24)
The partition is unchanged for . Based on Lemma 1, we can represent the derivative of the solution with respect to for a given interval as
Thus, the solution paths of for along are linear in the given interval because and are unchanged. Based on this linearity, for ), we can obtain the solution at using the equation:
where is the minimiser of (23) at and . We can find the next hitting time for a pair of two solution paths from the equation , for , as
Because these hitting times are only valid for the unchanged partition (i.e. there is no fusion for ), the next hitting time should satisfy the following equation:
Consider two PA‐FLSA problems with and at to show the discontinuity of the solution path of the PA‐FLSA. We denote the first problem as the PA‐FLSA( ) and the second problem as the PA‐FLSA( ) to distinguish the two problems. Suppose that and and of the PA‐FLSA( ) are fused at . Let , , and . Based on Lemma 1, we can express the solution of the PA‐FLSA( ) as
Based on this assumption, we can represent the partition into for , , and for . For notational simplicity, in the proof, we use the definition of the pathwise adaptive weight as . Thus, the following equations hold:
Then, by Lemma 1, the solution of the PA‐FLSA( ) can be represented as
Let and . A comparison of the two solutions and can easily spot that the absolute differences between for are nonzero if and . This completes the proof of the discontinuity of the solution path of the PA‐FLSA.
The results in Theorem 3 show that the hitting times of the PA‐FLSA increase monotonically. Unlike the original FLSA, the solution path of the PA‐FLSA is discontinuous at hitting times that change the pathwise adaptive weights. However, this discontinuity does not affect PA‐FLSA's performance in estimating true signal levels and identifying the change points. We will examine these phenomena in the numerical study.
5. NUMERICAL STUDY
This section presents a comprehensive numerical study. First, we numerically investigate the properties we have explored in the previous sections, including the violation of the monotone increasing property for the hitting times of the Path‐LFLSA and the discontinuity of the PA‐FLSA's solution path. Second, we numerically compare the probabilities of the exact pattern recovery of the four methods (Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA) under four scenarios. The first three scenarios are based on Qian & Jia (2016), and the fourth is novel for this study. We also conduct a comparison of the performances of the four methods in estimating the signal levels and identifying the block structures with the optimal tuning parameters chosen by the Bayesian information criterion (BIC) (Schwarz, 1978) and the extended BIC (EBIC) (Chen & Chen, 2008) in Appendix C of the supplementary material.
5.1. Comparison of Whole Solution Paths
In this subsection, we compare the whole solution paths of the Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA. We consider , , , and to generate the underlying signal and noisy observations. The true mean value is set to and the noisy observation is generated as . We applied the four methods to obtain the whole solution paths along . The solution paths of the Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA are depicted in Figure 1. The solution path of the Path‐LFLSA seems to have a split event around , while the Path‐LFLSA actually has fusion events only. We also report the hitting times of the solution paths from Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA in Table 1 for details.
FIGURE 1.

Solution paths of Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA for
TABLE 1.
Hitting times of the whole solution paths from the four methods with the noisy observation with
| Method |
|
|
|
|
|
|||||
|---|---|---|---|---|---|---|---|---|---|---|
| Path‐FLSA | 0.0314 | 0.1832 | 0.2056 | 0.5266 | 0.8330 | |||||
| PCD‐FLSA | 0.0314 | 0.2056 | 0.6140 | 0.7328 | 0.7567 | |||||
| Path‐LFLSA | 0.0078 | 0.0514 | 0.1832 | 0.1317 | 0.4165 | |||||
| PA‐FLSA | 0.0314 | 0.0521 | 0.1322 | 0.1838 | 0.2682 |
Table 1 shows that the fourth hitting time is less than the third hitting time for Path‐LFLSA, violating the monotone increasing property of the hitting times. Contrarily, the other solution path algorithms satisfy the monotone increasing property of the hitting times. As demonstrated in Theorem 2, this violation can be checked with the solution at the second hitting time . At , , , . Thus, , , and or in the condition (21). Thus, the following condition (21) holds for :
As mentioned in Section 3, we suggest drawing the solution path of the Path‐LFLSA along indices of its hitting times, which is equivalent to the order of fusion events. We depict the solution paths along and the indices of the hitting times in Figure 2. The solution path along the indices of the hitting times (Figure 2 (b)) is more readable than the solution path along (Figure 2 (a)).
FIGURE 2.

Solution paths of Path‐LFLSA along and indices of hitting times for
Figure 1 also shows that the solution path of the PA‐FLSA has discontinuous points at the hitting times. For example, the values of and in the proof of Theorem 3 are different, where and . Table 1 shows that the hitting times of the PA‐FLSA are monotonically increasing, supporting Theorem 3.
Finally, we compared the fused sets at the hitting times. From Figure 1, the partitions from the Path‐FLSA are obtained as , , , , and for , respectively. Path‐LFLSA and PA‐FLSA have the same partitions , , , , and for , respectively. Thus, we observe that Path‐FLSA, Path‐LFLSA, and PA‐FLSA contain the true partition . Contrarily, the partitions from the PCD‐FLSA are obtained as , , , , and for , respectively. The PCD‐FLSA fails to contain the true partition, and this difference is because of the Puffer transformation in the PCD‐FLSA. Note that the last observation describes the estimates of the methods from the one data set only, not implying that the PCD‐FLSA generally fails to contain the true partition. In the following subsection, we evaluate the exact pattern recovery probabilities estimated by 1000 data sets to compare the general performance of containing the true partition.
5.2. Comparison of Exact Pattern Recovery Probabilities
In this section, we compare the probabilities of the exact pattern recovery of the four methods similar to Qian & Jia (2016), where the exact pattern recovery indicates that the solution path of the given method contains the solution of the true block structures. We considered four scenarios for the underlying block structures. Three scenarios are taken directly from Qian & Jia (2016) to reproduce their results and conduct a fair comparison, in which two scenarios have no stair‐case blocks, and one scenario has stair‐case blocks. Besides, the last scenario for our numerical study was a true block structure with multiple stair‐case blocks. These four scenarios can be divided into two categories based on the existence of the stair‐case blocks, which is equivalent to the failure of the irrepresentable condition of the transformed Lasso in (13). Again, for a fair comparison, we set and consider like Qian & Jia (2016). The noisy observations are from , where and are specified case by case. We generated 1000 data sets and checked the inclusion of the true block structures with the given solution path to measure the probability of the exact pattern recovery. To obtain the confidence intervals of the exact pattern recovery probabilities, we repeat the procedure for estimating the exact pattern recovery 50 times as well.
5.2.1. Two scenarios without stair‐case blocks
First, we considered the two true signal structures and used in Qian & Jia (2016) for the scenarios and , respectively. The two true signal vectors for and are respectively defined as
With these two true signal vectors, the left hand sides (LHS) of the irrepresentable condition of the transformed Lasso are 0.975 and 0.9826 for and , respectively. These values denote that the irrepresentation conditions hold for two signal vectors because there exists a constant such that . Figure 3 depicts the true signals and noisy observations for two scenarios with . Scenarios and have ten change points in the true signal. The differences between and are the lengths of the blocks and signal levels.
FIGURE 3.

True signals and noisy observations for two scenarios and
Figure 4 depicts the estimated probabilities and their 95% confidence intervals of the exact pattern recovery of Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA for and . We confirmed that the probabilities of the exact pattern recovery of Path‐FLSA and PCD‐FLSA have the same shapes as in the simulation results in Qian & Jia (2016). Moreover, Figure 4 shows an interesting finding that the estimated probabilities of the exact pattern recovery of Path‐LFLSA and PA‐FLSA are almost similar and greater than those of Path‐FLSA and PCD‐FLSA for . Specifically, for and case , the probabilities of the exact pattern recovery for Path‐LFLSA and PA‐FLSA are estimated as 0.422 and 0.458, respectively, while those of Path‐FLSA and PCD‐FLSA are zero. For and , we further observe that PA‐FLSA has a larger pattern recovery probability than Path‐LFLSA for relatively large levels .
FIGURE 4.

Plots of the estimated probabilities of the exact pattern recovery under for and . Dashed lines denote the 95% confidence intervals of the exact pattern recovery probabilities
5.2.2. Two scenarios with stair‐case blocks
Second, we consider two true signal structures and for the scenarios and , respectively, where is from Qian & Jia (2016) and is a novel case with multiple stair‐case blocks. The true signal vectors and for and are defined as follows:
As in the cases of and , we calculated the LHS of the irrepresentable condition of the transformed Lasso for and . Both the LHS values for and were calculated as 1.0. These LHS values indicate that the irrepresentation conditions failed for two signal vectors because a constant such that does not exist. Figure 5 depicts the true signals and noisy observations for two scenarios and with . Scenarios and have six and ten change points, respectively.
FIGURE 5.

True signals and noisy observations for two scenarios S3 and S4
Figure 6 depicts the estimated probabilities and their 95% confidence intervals of the exact pattern recovery of Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA for and . We can compare the characteristics of the four methods when the true signal has the stair‐case blocks. First, Path‐FLSA fails totally in finding the exact pattern (i.e. true block structure) for and because the estimated probabilities of the exact pattern recovery of Path‐FLSA for and are zero for all . This observation supports the inconsistency of the FLSA under the existence of the stair‐case blocks. Second, Path‐LFLSA and PA‐FLSA are more robust to the noise level (i.e. error variance) than PCD‐FLSA. The estimated exact pattern recovery probabilities of Path‐LFLSA and PA‐FLSA decreased slower that of PCD‐FLSA. For example, the exact pattern recovery probability of PCD‐FLSA decreased from 1.000 at to 0.001 at while that of Path‐LFLSA and PA‐FLSA decreased from 1.000 and 1.000 at to 0.549 and 0.595 at for , respectively.
FIGURE 6.

Plots of the estimated probabilities of the exact pattern recovery under for and . Dashed lines denote the 95% confidence intervals of the exact pattern recovery probabilities
Overall, Path‐LFLSA and PA‐FLSA outperformed Path‐FLSA and PCD‐FLSA concerning the exact pattern recovery probability for all cases from to . For Path‐LFLSA and PA‐FLSA, PA‐FLSA (Path‐LFLSA) was better than Path‐LFLSA (PA‐FLSA) for S1, S2, and S4 (S3), but the differences were small.
6. APPLICATION TO COVID‐19 INFECTION IN KOREA
In this section, we apply the four methods Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA to the daily‐confirmed cases of COVID‐19 infection in Korea. We downloaded a dataset of COVID‐19 infection status from the Public Data Portal (http://data.go.kr) by using OpenAPI through https://openapi.data.go.kr/openapi/service/rest/Covid19/getCovid19InfStateJson with the authorised key. The downloaded dataset contained 9 variables, including date of state (stateDt), a time of state (stateTime), and the number of cumulative confirmed cases of COVID‐19 (decideCnt). We set a target period as a period from 03‐01‐2020 to 03/31/2022 and used the cases whose times of state were 0:00. To obtain the daily‐confirmed cases of COVID‐19, we applied the first differencing to the number of cumulative confirmed cases, where the difference between the cumulative confirmed cases of COVID‐19 stated at 0:00 on 03/01/2020 and 03/02/2020 as the daily‐confirmed cases occurred on 03/01/2020.
We considered a logarithmic transformation to stabilise the variance of the observation before applying the four methods because the FLSA‐based procedures are sensitive to the noise level, as shown in Section 5. In addition, the number of daily confirmed cases is dramatically increasing from January 2022 in Korea. For example, the number of daily confirmed cases was 621,204 on March 16, 2022, while the number of daily confirmed patients was 3831 on January 1, 2022. Figure 7 depicts the log‐transformed daily‐confirmed cases of the target period with a gray line.
FIGURE 7.

Plots of the logarithmic transformed daily confirmed cases of COVID‐19 (
), the estimates of Path‐FLSA (
), PCD‐FLSA (
), Path‐LFLSA (
), and PA‐FLSA (
) with the EBIC
To select the optimal tuning parameter, we used the following EBIC proposed by Chen & Chen (2008), which showed better results for the FLSA as described in Appendix C of the supplementary material,
where , for , , and is the number of the estimated blocks at .
Figure 7 depicts the estimates by Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA. The number of identified change points for Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA were 82, 172, 55, and 39, respectively. As shown in Figure 7 (a), the Path‐FLSA identified many change points in the periods that seems to have trends or stair‐case blocks and also missed several local changes. For example, the Path‐FLSA missed the change points for a local peak from 05/04/2020 to 05/06/2020, while both the Path‐LFLSA and PA‐FLSA identified the change points on 05/03/2020 and 05/06/2020. Figure 7 (b) shows that the PCD‐FLSA estimates seem to have many false‐positive change points and also identify much more change points compared with the others. As shown in the case of having the stair‐case blocks, the pattern recovery probabilities of the Path‐FLSA and PCD‐FLSA decrease very quickly when the noise level increases. With the residual of the fitted models, the estimates of the error standard deviation by Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA are 0.24, 0.14, 0.18, and 0.20, respectively. Thus, this observation is also consistent with the comparison result of the pattern recovery probability for . From the above observations, we focused on the comparison of the identified change points by the Path‐LFLSA and PA‐FLSA. We also depict the histograms and Q‐Q plots of the residuals of the four methods in Appendix D of the supplementary material. From the figures in Appendix D of the supplementary material, the residuals of the four methods are not exactly following the normal distribution but seem to follow the normal distribution within the interval . It is also worth noting that the FLSA tends to find many change points when the underlying signal has a trend because the FLSA model is adequate for the piecewise constant mean model. For identifying the trend‐change points, we refer to the trend filtering method proposed by Kim et al. (2009), which is more suitable to identify sparse trend‐change points.
Table 2 reports the identified change points by Path‐LFLSA and PA‐FLSA with the corresponding indices, dates, and debiased estimates. The gray rows in Table 2 denote the identified change points by either the Path‐LFLSA or the PA‐FLSA only. There are 32 change points commonly identified by the Path‐LFLSA and PA‐FLSA, and there are also 23 (7) change points identified by the Path‐LFLSA (PA‐FLSA) only. Among the seven change points identified by the PA‐FLSA only, the six change points at 03/07/2020, 08/12/2020, 09/11/2020, 10/24/2020, 01/01/2021 and 11/01/2021 are closed to the six change points identified by the Path‐LFLSA only, where most of the differences are one or two days. The change point by the PA‐FLSA on 01/31/2022 seems to be an intermediate point within the period having a trend. For the change points by the Path‐LFLSA only except for the six change points corresponding to the change points by the PA‐FLSA only, most of the identified change points catch the local peaks.
TABLE 2.
The estimated change points of Path‐LFLSA and PA‐FLSA
| Path‐LFLSA | PA‐LFLSA | Path‐LFLSA | PA‐LFLSA | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Index | Date |
|
Index | Date |
|
Index | Date |
|
Index | Date |
|
||||
| 6 | 03/06/2020 | 535.34 | 306 | 12/31/2020 | 992.99 | ||||||||||
| 7 | 03/07/2020 | 507.43 | 307 | 01/01/2021 | 984.45 | ||||||||||
| 10 | 03/10/2020 | 232.84 | 10 | 03/10/2020 | 199.89 | 315 | 01/09/2021 | 757.46 | 315 | 01/09/2021 | 749.87 | ||||
| 35 | 04/04/2020 | 97.92 | 35 | 04/04/2020 | 97.92 | 322 | 01/16/2021 | 526.01 | 322 | 01/16/2021 | 526.01 | ||||
| 39 | 04/08/2020 | 47.23 | 39 | 04/08/2020 | 47.23 | 351 | 02/14/2021 | 393.59 | |||||||
| 48 | 04/17/2020 | 26.23 | 48 | 04/17/2020 | 26.23 | 355 | 02/18/2021 | 561.75 | |||||||
| 64 | 05/03/2020 | 9.89 | 64 | 05/03/2020 | 9.89 | 372 | 03/07/2021 | 392.68 | |||||||
| 67 | 05/06/2020 | 3.91 | 67 | 05/06/2020 | 3.91 | 401 | 04/05/2021 | 460.51 | 401 | 04/05/2021 | 424.31 | ||||
| 69 | 05/08/2020 | 15.72 | 420 | 04/24/2021 | 663.27 | ||||||||||
| 75 | 05/14/2020 | 30.47 | 468 | 06/11/2021 | 589.20 | ||||||||||
| 86 | 05/25/2020 | 19.05 | 86 | 05/25/2020 | 21.65 | 478 | 06/21/2021 | 444.14 | |||||||
| 112 | 06/20/2020 | 45.73 | 492 | 07/05/2021 | 683.50 | 492 | 07/05/2021 | 599.00 | |||||||
| 117 | 06/25/2020 | 34.80 | 520 | 08/02/2021 | 1437.73 | 520 | 08/02/2021 | 1437.73 | |||||||
| 138 | 07/16/2020 | 51.02 | 571 | 09/22/2021 | 1760.66 | ||||||||||
| 141 | 07/19/2020 | 33.56 | 579 | 09/30/2021 | 2617.28 | ||||||||||
| 147 | 07/25/2020 | 60.68 | 147 | 07/25/2020 | 47.02 | 587 | 10/08/2021 | 2001.65 | |||||||
| 163 | 08/10/2020 | 31.85 | 604 | 10/25/2021 | 1422.00 | ||||||||||
| 165 | 08/12/2020 | 33.91 | 611 | 11/01/2021 | 1784.54 | ||||||||||
| 167 | 08/14/2020 | 85.90 | 167 | 08/14/2020 | 131.79 | 625 | 11/15/2021 | 2134.56 | 625 | 11/15/2021 | 2242.17 | ||||
| 185 | 09/01/2020 | 296.16 | 185 | 09/01/2020 | 296.16 | 639 | 11/29/2021 | 3372.36 | 639 | 11/29/2021 | 3372.36 | ||||
| 194 | 09/10/2020 | 162.48 | 646 | 12/06/2021 | 5002.52 | 646 | 12/06/2021 | 5002.52 | |||||||
| 195 | 09/11/2020 | 159.73 | 665 | 12/25/2021 | 6532.64 | 665 | 12/25/2021 | 6532.64 | |||||||
| 202 | 09/18/2020 | 121.88 | 202 | 09/18/2020 | 119.86 | 688 | 01/17/2022 | 978.03 | 688 | 01/17/2022 | 3978.03 | ||||
| 210 | 09/26/2020 | 87.69 | 695 | 01/24/2022 | 7080.12 | 695 | 01/24/2022 | 7080.12 | |||||||
| 212 | 09/28/2020 | 44.60 | 702 | 01/31/2022 | 16194.46 | ||||||||||
| 234 | 10/20/2020 | 77.29 | 705 | 02/03/2022 | 18074.79 | 705 | 02/03/2022 | 23355.52 | |||||||
| 238 | 10/24/2020 | 79.16 | 709 | 02/07/2022 | 36739.06 | 709 | 02/07/2022 | 36739.06 | |||||||
| 256 | 11/11/2020 | 112.47 | 256 | 11/11/2020 | 115.98 | 716 | 02/14/2022 | 54343.44 | 716 | 02/14/2022 | 54343.44 | ||||
| 261 | 11/16/2020 | 211.76 | 261 | 11/16/2020 | 211.76 | 723 | 02/21/2022 | 99131.02 | 723 | 02/21/2022 | 99131.02 | ||||
| 269 | 11/24/2020 | 333.21 | 269 | 11/24/2020 | 333.21 | 730 | 02/28/2022 | 158836.04 | 730 | 02/28/2022 | 158836.04 | ||||
| 285 | 12/10/2020 | 564.42 | 285 | 12/10/2020 | 564.42 | 737 | 03/07/2022 | 226706.55 | 737 | 03/07/2022 | 226706.55 | ||||
Note: The rows in gray denote the change points found by only one method.
On the other hand, interestingly, most of the commonly identified change points correspond to the period of the social distancing announced by the Ministry of Health and Welfare of Korea. For example, April 4, 2020, corresponds to the end of the intensive social distancing period from March 22, 2020, to April 5, 2020. In addition, the time periods between change points are similar to the periods of the social distancing policy by the Ministry of Health and Welfare of Korea, where the policy periods are usually two or three weeks. Finally, we should address that the sequence of the daily confirmed cases is observed through time, and it seems to have time‐dependent trends. Thus, the main assumptions of the FLSA for the model selection consistency is hardly satisfied, and then the FLSA‐based methods do not guarantee the model selection consistency. For example, among the commonly identified change points, the change points within four periods (03/10/2020 ∼ 05/06/2020, 11/16/2020 ∼ 12/10/2020, 11/15/2021∼ 12/25/2025, 02/03/2022 ∼ 03/07/2022) seem to be intermediate points within the periods having either an increasing or a decreasing trend.
However, this COVID‐19 spread example addresses that the Path‐LFLSA and PA‐FLSA are still applicable to find the main change points caused by an external event in a stable period (i.e. a period with a low trend effect). For example, both Path‐LFLSA and PA‐FLSA succeeded in finding the beginning day of the second wave of the COVID‐19 pandemic in Korea on August 15, 2020, when a mass rally was held near Gwanghwamun Square in Seoul, where the identified change point on August 14, 2020, in Table 2 denotes that the underlying mean value was changed on August 15, 2020. In addition, the identified change point on November 11, 2020, is close to the beginning day (11/4/2020) of the third wave reported in Seong et al. (2021) and the identified change point at 07/05/2021 is also close to the beginning day (7/7/2021) of the fourth wave 1. Recently, the number of daily confirmed cases has been rapidly increasing, and the fifth wave had was begun on 01/26/2022 2. Both Path‐LFLSA and PA‐FLSA found the change points on 01/24/2022, which is close to the beginning day of the fifth wave. The identified change points related to the second to the fifth waves are highlighted in red in Table 2.
7. CONCLUSION
In this study, we provide a new interpretation of the modified path algorithm for the FLSA by discovering the exact optimisation problems corresponding to the modified path algorithm, called Path‐LFLSA. Our discovery demonstrates that the modified path algorithm's hitting times are not monotonically increasing, and the violation of the monotone increasing property for the next hitting time can be verified by comparing the solution from the previous hitting time. We propose a pathwise adaptive FLSA with a weighted fusion penalty to recover the monotonicity of the hitting times. The comprehensive numerical study illustrates the whole solution paths of the four methods, including three existing ones and the proposed PA‐FLSA, and it also shows that the Path‐LFLSA and PA‐FLSA are less sensitive to noise levels for pattern recovery than the Path‐FLSA and PCD‐FLSA. Furthermore, our numerical study in Appendix C of the supplementary material provides a practical guideline for choosing the optimal tuning parameters of the Path‐LFLSA and PA‐FLSA that outperform Path‐FLSA and PCD‐FLSA to identify the true block structures and estimate the true signal levels. The application of Path‐LFLSA and PA‐FLSA with the optimal tuning parameter selection by EBIC to the number of daily‐confirmed cases of COVID‐19 infection found the change points related to the beginning days of the COVID‐19 pandemic waves from the second to the fifth in Korea.
Supporting information
insr12521‐sup‐0001‐Supp_PATH_FLSA_rev_v1.pdf
ACKNOWLEDGEMENTS
W. Son's research is supported by the National Research Foundation of Korea (No. 2020R1F1A1A01051039), J. Lim's research is supported by the National Research Foundation of Korea (NRF‐2021R1A2C1010786), and D. Yu's research is supported by the National Research Foundation of Korea (NRF‐2022R1A5A7033499) and Inha University Research Grant.
Son, W. , Lim, J. , and Yu, D. (2022) Path algorithms for fused lasso signal approximator with application to COVID‐19 spread in Korea. International Statistical Review, 10.1111/insr.12521.
Footnotes
Korea officially in COVID‐19 fourth wave, an article in Korea Herald available at https://www.koreaherald.com/view.php?ud%3D20210707000868
Daily COVID‐19 Cases Exceed 13,000, 5th Wave Beginning, an article in KBS World available at https://world.kbs.co.kr/service/news_view.htm?lang%3De&Seq_Code%3D167226
REFERENCES
- Bertsekas, D. (1999). Nonlinear Programming. Athena Scientific.
- Bradely Efron, B. , Hastie, T. , Johnstone, I. & Tibshirani, R. (2004). Least angle regression. Ann. Stat., 32, 407–499. [Google Scholar]
- Chen, J & Chen, Z. (2008). Extended Bayesian information criteria for model selection with large model spaces. Biometrika, 95(3), 759–771. [Google Scholar]
- Friedman, J. , Hastie, T. , Hoefling, H. & Tibshirani, R. (2007). Pathwise coordinate optimization. Ann. Appl. Stat., 1, 302–332. [Google Scholar]
- Fryzlewicz, P. (2014). Wild binary segmentation for multiple change‐point detection. Ann. Stat., 42(6), 2243–2281. [Google Scholar]
- Hoefling, H. (2010). A path algorithm for the fused Lasso signal approximator. J. Comput. Graph. Stat., 19, 984–1006. [Google Scholar]
- Jia, J. & Rohe, K. (2015). Preconditioning the lasso for sign consistency. Electr. J. Stat., 9, 1150–1172. [Google Scholar]
- Kim, S.J. , Koh, K. , Boyd, S. & Gorinevsky, D. (2009). trend filtering. SIAM Review, 51(2), 339–360. [Google Scholar]
- Lin, K. , Sharpnack, J. , Rinaldo, A. & Tibshirani, R.J. 2016. Approximate recovery in changepoint problems, from estimation error rates. arXiv preprint, arXiv:1606.06746.
- Olshen, A.B. , Venkatraman, E. , Lucito, R. & Wigler, M. (2004). Circular binary segmentation for the analysis of array‐based dna copy number data. Biostatistics, 5(4), 557–572. [DOI] [PubMed] [Google Scholar]
- Qian, J. & Jia, J. (2016). On stepwise pattern recovery of the fused Lasso. Comput. Stat. Data Anal., 94, 221–237. [Google Scholar]
- Rinaldo, A. (2009). Properties and refinements of the fused lasso. Ann. Stat., 37, 2922–2952. [Google Scholar]
- Rinaldo, A. 2014. Corrections to properties and refinements of the fused lasso. available at https://www.stat.cmu.edu/∼arinaldo/Fused_Correction.png
- Schwarz, G. (1978). Estimating the dimension of a model. Ann. Stat., 6, 461–464. [Google Scholar]
- Seong, H. , Hyun, H.J. , Yun, J.G. , Noh, J.Y. , Cheong, H.J. , Kim, W.J. & Song, J.Y. (2021). Comparison of the second and third waves of the COVID‐19 pandemic in South Korea: Importance of early public health intervention. Int. J. Infectious Disease, 104, 742–745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Son, W. & Lim, J. (2019). Modified path algorithm of fused Lasso signal approximator for consistent recovery of change points. J. Stat. Plan. Inference, 200, 223–238. [Google Scholar]
- Tibshirani, R. , Saunders, M. , Rosset, S. , Zhu, J. & Knight, K. (2005). Sparsity and smoothness via the fused lasso. J. R. Stat. Soc.: Ser. B (Stat. Methodol.), 67, 91–108. [Google Scholar]
- Tibshirani, R.J. & Taylor, J. (2011). The solution path of the generalized lasso. Ann. Stat., 39(3), 1335–1371. [Google Scholar]
- Yao, Y.C. (1988). Estimating the number of change‐points via Schwarz' criterion. Stat. Probab. Lett., 6(3), 181–189. [Google Scholar]
- Yao, Y.C. & Au, S.T. (1989). Least‐squares estimation of a step function. Sankhyā Indian J. Stat. Ser. A, 51(3), 370–381. [Google Scholar]
- Zhao, P. & Yu, B. (2006). On model selection consistency of lasso. J. Mach. Learn. Res., 7, 2541. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
insr12521‐sup‐0001‐Supp_PATH_FLSA_rev_v1.pdf
