Path algorithms for fused lasso signal approximator with application to COVID‐19 spread in Korea

Won Son; Johan Lim; Donghyeon Yu

doi:10.1111/insr.12521

. 2022 Oct 19:10.1111/insr.12521. Online ahead of print. doi: 10.1111/insr.12521

Path algorithms for fused lasso signal approximator with application to COVID‐19 spread in Korea

Won Son ¹, Johan Lim ², Donghyeon Yu ^3,^✉

PMCID: PMC9874640 PMID: 36710888

Summary

The fused lasso signal approximator (FLSA) is a smoothing procedure for noisy observations that uses fused lasso penalty on unobserved mean levels to find sparse signal blocks. Several path algorithms have been developed to obtain the whole solution path of the FLSA. However, it is known that the FLSA has model selection inconsistency when the underlying signals have a stair‐case block, where three consecutive signal blocks are either strictly increasing or decreasing. Modified path algorithms for the FLSA have been proposed to guarantee model selection consistency regardless of the stair‐case block. In this paper, we provide a comprehensive review of the path algorithms for the FLSA and prove the properties of the recently modified path algorithms' hitting times. Specifically, we reinterpret the modified path algorithm as the path algorithm for local FLSA problems and reveal the condition that the hitting time for the fusion of the modified path algorithm is not monotone in a tuning parameter. To recover the monotonicity of the solution path, we propose a pathwise adaptive FLSA having monotonicity with similar performance as the modified solution path algorithm. Finally, we apply the proposed method to the number of daily‐confirmed cases of COVID‐19 in Korea to identify the change points of its spread.

Keywords: Change points, fused lasso signal approximator, modified path algorithm, pathwise adaptive weight, solution path

1. INTRODUCTION

Consider observations ${y_{i}}_{i = 1}^{n}$ from a model

y_{i} = μ_{i} + ϵ_{i},

(1)

where $ϵ_{i} s$ are independently and identically from a distribution with mean 0 and variance $σ_{n}^{2}$ . Here, we assume that true underlying signals have block structures with $μ_{1} = \dots = μ_{j_{1}} \neq μ_{j_{1} + 1} = \dots = μ_{j_{K}} \neq μ_{j_{K} + 1} = \dots = μ_{n}$ . Under this assumption, it is essential to identify the unknown block structures to estimate the signal means. This problem is known as multiple‐change‐point detection. The literature contains various methods on multiple‐change‐point detection, including best subset selection (Yao, 1988; Yao & Au, 1989), circular binary segmentation (CBS) (Olshen et al., 2004), and wild binary segmentation (WBS) (Fryzlewicz, 2014).

This study focuses on the fused lasso signal approximator (FLSA) that finds sparse signal blocks using the fused lasso penalty (Tibshirani et al., 2005) on underlying mean levels. The FLSA obtains the signal estimate by minimising

f_{S F L} (μ; y, λ_{1}, λ_{2}) = \frac{1}{2} ‖ y - μ ‖_{2}^{2} + λ_{1} ‖ μ ‖_{1} + λ_{2} ‖ μ ‖_{TV},

(2)

where $y = {(y_{1}, \dots, y_{n})}^{T}$ , $μ = {(μ_{1}, \dots, μ_{n})}^{T}$ , $‖ μ ‖_{1} = \sum_{j = 1}^{n} | μ_{j} |$ is the ℓ ₁‐norm of $μ$ , and $‖ μ ‖_{TV} = \sum_{j = 2}^{n} | μ_{j} - μ_{j - 1} |$ is the total‐variation norm of $μ$ . Given $λ_{1}$ and $λ_{2}$ , the FLSA obtains the sparse block solution for $μ$ , providing the estimates of both signal levels and block structures. As shown in Lemma A.1 in Friedman et al. (2007), the sparse estimate can be obtained directly from the minimiser ${\hat{μ}}_{(0, λ_{2})}$ of $f (μ; y, 0, λ_{2})$ by the soft‐thresholding operator ${\hat{μ}}_{(λ_{1}, λ_{2})} = {Soft}_{λ_{1}} ({\hat{μ}}_{(0, λ_{2})})$ , where ${Soft}_{a} (x) = {(sign (x_{i}) \max (| x_{i} | - a, 0))}_{i = 1}^{n}$ for $x \in ℝ^{n}$ .

Furthermore, the FLSA has the model selection consistency (i.e. exact identification of the true block structure) when the underlying true block structure has no stair‐case blocks, where a stair‐case block denotes that three consecutive blocks that are either strictly increasing or decreasing (Rinaldo, 2009, 2014; Qian & Jia, 2016). The FLSA estimator is inconsistent in identifying the true signal blocks when the true block structure contains any stair‐case blocks. The preconditioned FLSA via Puffer transformation (Qian & Jia, 2016) and the modified path algorithm for FLSA (Son & Lim, 2019) are proposed to resolve this inconsistency. Recently, the error analysis for the FLSA has been done in literature (Lin et al., 2016).

Besides the theoretical properties of the FLSA, it is critical to obtain its solution efficiently. The entire solution path of the FLSA can be obtained using either the path algorithm for the FLSA proposed by Hoefling (2010) or the path algorithm for the generalised lasso by Tibshirani & Taylor (2011). These two solution path algorithms solve the problem (2) but still show the model selection inconsistency for identifying true blocks with stair‐case blocks. Contrarily, two modified path algorithms have been developed recently to resolve the model selection inconsistency of the FLSA. The preconditioned FLSA with the Puffer transformation (Qian & Jia, 2016) converts the original problem into a Lasso problem with reparameterisation and applies the Puffer transformation. Essentially, the solution path algorithm for the preconditioned FLSA is based on the path algorithm for Lasso (Bradely Efron et al., 2004). The modified path algorithm for the FLSA (mPath‐FLSA) by Son & Lim (2019) reinterprets the FLSA path algorithm using the distances between two adjacent blocks and proposes a new distance criterion that guarantees the model selection consistency regardless of a stair‐case block. Note that the original study does not provide the overall objective function of mPath‐FLSA.

Our main contributions through this study are as follows. First, we provide a comprehensive review of the FLSA path algorithms and prove the properties of the hitting times of the recently modified path algorithm. Specifically, we provide an explicit form of the objective function of mPath‐FLSA. Furthermore, we apply the new representation to show that the hitting times of mPath‐FLSA can fail to increase monotonically with respect to $λ_{2}$ . For example, we suppose that there are four observations, $y_{1} = - 0.032$ , $y_{2} = 0.787$ , $y_{3} = - 0.122$ , and $y_{4} = 0.207$ . The hitting times for the FLSA solution path algorithm are $(λ_{2}^{(0)}, \dots, λ_{2}^{(3)}) = (0,0 . 1097,0 . 2731,0 . 3352)$ , which satisfy the monotone property (i.e. $λ_{2}^{(0)} \leq \dots \leq λ_{2}^{(3)}$ ). However, the hitting times of mPath‐FLSA are calculated as $(λ_{2}^{(0)}, \dots, λ_{2}^{(3)}) = (0,0 . 0822,0 . 2049,0 . 1676)$ , which are not monotone in $λ_{2}$ . Section 3 provides additional details. Second, we propose a pathwise adaptive FLSA and its solution path algorithm to resolve the violation of the monotonicity of the solution path along $λ_{2}$ . The proposed method adopts the weighted fusion penalty terms $\sum_{j = 2}^{n} w_{j}^{(l)} | μ_{j} - μ_{j - 1} |$ , which are adaptively defined by the solutions at the hitting times ${(λ_{2}^{(l)})}_{l = 0}^{n - 1}$ on the solution path along $λ_{2}$ . Finally, we provide a comprehensive numerical comparison of the existing path algorithms with the proposed algorithm. We compare four methods based on the entire solution path along $λ_{2}$ , the exact pattern recovery probability, and the estimation performance of the signal levels and block structures concerning the selection of tuning parameters.

The remainder of this paper is organised as follows. Section 2 contains a brief review of the existing solution path algorithms related to the FLSA. Section 3 presents a new interpretation of mPath‐FLSA besides the conditions for the non‐monotonicity of the solution path along $λ_{2}$ . The pathwise adaptive FLSA and its solution path algorithm that satisfies the monotone property in the solution path along $λ_{2}$ are proposed in Section 4. In Section 5, a numerical study for measuring the probability of the exact recovery was conducted using the proposed algorithm and the other existing algorithms. Moreover, we compared the estimation performance with the optimal tuning parameter chosen by the Bayesian information criterion (BIC) and the extended BIC (EBIC). In Section 6, we apply the proposed method and existing FLSA path algorithms to the number of daily‐confirmed cases of COVID‐19 in Korea to identify events that affect the COVID‐19 spread. Finally, we conclude the paper with some remarks.

2. EXISTING SOLUTION PATH ALGORITHMS FOR FLSA

In this section, we review three existing solution path algorithms related to the FLSA; the path algorithm for the FLSA (Path‐FLSA) by Hoefling (2010), the path algorithm for the preconditioned FLSA (Path‐PFLSA) by Qian & Jia (2016), the modified path algorithm for the FLSA (mPath‐FLSA) by Son & Lim (2019). Henceforth, to focus on the identification of true mean blocks, we consider the FLSA with $λ_{1} = 0$ as follows:

f_{F L} (μ; y, λ_{2}) = \frac{1}{2} ‖ y - μ ‖_{2}^{2} + λ_{2} \sum_{j = 2}^{n} | μ_{j} - μ_{j - 1} | .

(3)

Note that we omit the review of the solution path algorithm for the generalised Lasso by Tibshirani & Taylor (2011) because its solution path for the FLSA is the same as Path‐FLSA.

2.1. Path Algorithm for the FLSA (Path‐FLSA)

The FLSA path algorithm proposed by Hoefling (2010) provides a whole solution path of the FLSA along $λ_{2}$ based on the fact that the solution path for $λ_{2}$ is a piece‐wise linear in $λ_{2}$ . Originally, Hoefling (2010) proposes a path algorithm for the generalised FLSA problem

\min_{μ} f_{F L} (μ; y, λ_{2}) = \frac{1}{2} ‖ y - μ ‖_{2}^{2} + λ_{2} \sum_{(i, j) \in E} | μ_{i} - μ_{j} |,

(4)

where $E$ denotes a set of index pairs that correspond to fusion penalty terms. Here, we focus on the path algorithm for the one‐dimensional FLSA as in (3) (i.e. $E = {(1,2), (2,3), \dots, (p - 1, p)}$ ).

Specifically, let ${{\hat{B}}_{k}}_{k = 1}^{\hat{J} (λ_{2})}$ be a partition of ${1, \dots, n}$ defined from the solution ${\hat{μ}}^{F L} (λ_{2})$ of the FLSA at $λ_{2}$ , where $\hat{J} (λ_{2}) - 1$ is the number of change points in ${\hat{μ}}^{F L} (λ_{2})$ . Further, we let ${λ_{2}^{(l)}}_{l = 0}^{n - 1}$ be hitting times on the solution path, with two adjacent sets in the partition defined by the previous hitting time fused on each of them. For example, there is an index $k \in {1, \dots, n - l + 1}$ on $λ_{2} = λ_{2}^{(l)}$ , such that ${\hat{B}}_{k - 1}^{(l - 1)}$ and ${\hat{B}}_{k}^{(l - 1)}$ are fused as ${\hat{B}}_{k - 1}^{(l)} = {\hat{B}}_{k - 1}^{(l - 1)} \cup {\hat{B}}_{k}^{(l - 1)}$ . Thus, the solution ${\hat{μ}}^{F L} (λ_{2})$ for $λ_{2} = λ_{2}^{(l)}$ can be expressed explicitly as:

{\hat{μ}}_{i}^{F L} (λ_{2}) = \sum_{k = 1}^{n - l} {\hat{ν}}_{k}^{(l)} (λ_{2}) I (i \in {\hat{B}}_{k}^{(l)}),

(5)

where ${\hat{b}}_{k}^{(l)} = | {\hat{B}}_{k}^{(l)} |$ , ${\hat{ν}}_{k}^{(l)} (λ_{2}) = \frac{1}{{\hat{b}}_{k}^{(l)}} \sum_{i \in {\hat{B}}_{k}^{(l)}} y_{i} + {\hat{c}}_{k} (λ_{2}) = {\bar{y}}_{k} + {\hat{c}}_{k} (λ_{2})$ ,

{\hat{c}}_{k} (λ_{2}) = \{\begin{matrix} \frac{2 λ_{2}}{{\hat{b}}_{k}^{(l)}} & if & {\hat{ν}}_{k} (λ_{2}) < {\hat{ν}}_{k - 1} (λ_{2}), {\hat{ν}}_{k + 1} (λ_{2}) < {\hat{ν}}_{k} (λ_{2}) \\ - \frac{2 λ_{2}}{{\hat{b}}_{k}^{(l)}} & if & {\hat{ν}}_{k} (λ_{2}) > {\hat{ν}}_{k - 1} (λ_{2}), {\hat{ν}}_{k + 1} (λ_{2}) > {\hat{ν}}_{k} (λ_{2}) \\ 0 & if & ({\hat{ν}}_{k} (λ_{2}) - {\hat{ν}}_{k - 1} (λ_{2})) ({\hat{ν}}_{k + 1} (λ_{2}) - {\hat{ν}}_{k} (λ_{2})) > 0 \end{matrix} .

Now, suppose that we know a partition ${{\hat{B}}_{k}^{(l)}}_{k = 1}^{n - l}$ by ${\hat{μ}}^{F L} (λ_{2}^{(l)})$ , such that $⋃ {\hat{B}}_{k}^{(l)} = {1, \dots, n}$ , and consider $λ_{2} \in (λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ , where there is no fusion in the FLSA estimate. The FLSA solution satisfies the following equation:

\frac{\partial f}{\partial ν_{k}} = {\hat{b}}_{k}^{(l)} ν_{k} - \sum_{i \in {\hat{B}}_{k}^{(l)}} y_{i} + λ_{2} (\partial | ν_{j} - ν_{j - 1} | + \partial | ν_{j} - ν_{j + 1} |) = 0,

(6)

where $\partial | x |$ is a subdifferential of $| x |$ , which is defined as $\partial | x | = 1$ if $x > 0$ , −1 if $x < 0$ , and $z \in [- 1,1]$ if $x = 0$ . We can easily obtain the derivative of ${\hat{ν}}_{k}$ with respect to $λ_{2}$ from (6) as

\frac{\partial {\hat{ν}}_{k}}{\partial λ_{2}} = - \frac{\partial | {\hat{ν}}_{k} - {\hat{ν}}_{k - 1} | + \partial | {\hat{ν}}_{k} - {\hat{ν}}_{k + 1} |}{{\hat{b}}_{k}^{(l)}} .

(7)

Thus, ${\hat{ν}}_{k}$ is linear in $λ_{2} \in (λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ because the numerator and denominator of (7) are unchanged for the given interval $(λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ . This property shows that the FLSA solution path is piecewise linear in $λ_{2}$ .

Given the solution ${\hat{ν}}_{k}$ on the hitting time $λ_{2}^{(l)}$ and the derivative $\partial {\hat{ν}}_{k} / \partial λ_{2}$ , the solution ${\hat{ν}}_{k} (λ_{2})$ for $λ_{2} \in (λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ can be easily obtained as

{\hat{ν}}_{k} (λ_{2}) = {\hat{ν}}_{k} (λ_{2}^{(l)}) + \frac{\partial {\hat{ν}}_{k}}{\partial λ_{2}} δ,

(8)

where $δ = λ_{2} - λ_{2}^{(l)}$ . Based on (8), the next hitting time is defined as

λ_{2}^{(l + 1)} = \min_{1 \leq k \leq n - l - 1} h_{k, k + 1} (λ_{2}^{(l)}) = \min_{1 \leq k \leq n - l - 1} \frac{{\hat{ν}}_{k + 1} (λ_{2}^{(l)}) - {\hat{ν}}_{k} (λ_{2}^{(l)})}{\frac{\partial {\hat{ν}}_{k}}{\partial λ_{2}} - \frac{\partial {\hat{ν}}_{k + 1}}{\partial λ_{2}}} + λ_{2}^{(l)}

(9)

Given the Eqs. 8 and (9), the Path‐FLSA starts with $λ_{2}^{(0)} = 0$ and sequentially calculates the hitting times. The Path‐FLSA stores all solutions at hitting times $(λ_{2}^{(0)}, \dots, λ_{2}^{(n - 1)})$ and their derivatives to obtain the solution for any $λ_{2}$ with (8). Note that the FLSA solution path for generalised fusion penalty $λ_{2} \sum_{(i, j) \in E} | μ_{i} - μ_{j} |$ could be split for some edge set $E$ . For example, two adjacent solutions ${\hat{μ}}_{i}^{(l - 1)}$ and ${\hat{μ}}_{j}^{(l - 1)}$ for $(i, j) \in E$ are fused at $λ_{2}^{(l)}$ (i.e. ${\hat{μ}}_{i}^{(l - 1)} \neq {\hat{μ}}_{j}^{(l - 1)}$ and ${\hat{μ}}_{i}^{(l)} = {\hat{μ}}_{j}^{(l)}$ ) and split at $λ_{2}^{(t)}$ for some $t > l$ (i.e. ${\hat{μ}}_{i}^{(t - 1)} = {\hat{μ}}_{j}^{(t - 1)}$ and ${\hat{μ}}_{i}^{(t)} \neq {\hat{μ}}_{j}^{(t)}$ ) with the generalised fusion penalty. However, the one‐dimensional FLSA (i.e. $E$ is an edge set ${(1,2), (2,3), \dots, (n - 1, n)}$ of a chain graph) only has hitting events on the solution path, as shown by Friedman et al. (2007) in Proposition 2.

2.2. Preconditioned FLSA with Puffer Transformation (PCD‐FLSA)

As explained in the Introduction, the FLSA faces the model selection inconsistency due to stair‐case blocks in the underlying signals. Qian & Jia (2016) connect the FLSA model selection inconsistency to the irrepresentable condition of the Lasso problem (Zhao & Yu, 2006) by the reparameterisation of the FLSA. To obtain the model selection consistency, Qian & Jia (2016) applied the Puffer transformation introduced by Jia & Rohe (2015) to the reparameterised FLSA, which is called the preconditioned FLSA with Puffer transformation (PCD‐FLSA).

Specifically, consider the reparameterisation of $μ$ with $θ$ as follows:

(μ_{1} = θ_{1}, μ_{2} = θ_{1} + θ_{2}, \dots, μ_{n} = \sum_{j = 1}^{n} θ_{j}) \equiv μ = A θ,

(10)

where $A$ is a lower triangular matrix with nonzero elements equal to one. This reparameterisation can represent the objective function of the FLSA as

f_{L} (θ; λ_{2}) = \frac{1}{2} ‖ y - A θ ‖_{2}^{2} + λ_{2} ‖ θ_{[2 : n]} ‖_{1},

(11)

where $θ_{[2 : n]} = {(θ_{2}, \dots, θ_{n})}^{T} \in R^{n - 1}$ . The solution of (11) can be obtained by

{\hat{θ}}_{[2 : n]} (λ_{2}) = \underset{θ_{[2 : n]}}{argmin} \frac{1}{2} ‖ \tilde{y} - \tilde{X} θ_{[2 : n]} ‖_{2}^{2} + λ_{2} ‖ θ_{[2 : n]} ‖_{1}, and {\hat{θ}}_{1} (λ_{2}) = \bar{y} - {\bar{x}}^{T} {\hat{θ}}_{[2 : n]} (λ_{2}),

(12)

where $\bar{y} = (1 / n) \sum_{i = 1}^{n} y_{i}$ , $\tilde{y} = y - \bar{y} 1_{n}$ , $1_{n}$ is an $n$ ‐dimensional vector of ones, $\tilde{X} = [x_{1} - {\bar{x}}_{1} 1_{n}, \dots, x_{n - 1} - {\bar{x}}_{n - 1} 1_{n}] = X - 1_{n} {\bar{x}}^{T}$ , $\bar{x} = {({\bar{x}}_{1}, \dots, {\bar{x}}_{n - 1})}^{T}$ , ${\bar{x}}_{j} = 1_{n}^{T} x_{j} / n$ , and $X = A_{[1 : n, 2 : n]} = [x_{1}, \dots, x_{n - 1}]$ , which denotes that $X$ is an $n \times (n - 1)$ matrix defined by removing the first column of $A$ . Henceforth, we refer to the reparameterised FLSA problem (12) as the transformed Lasso.

Based on the reparameterisation, the pattern recovery for $μ$ using the FLSA is equivalent to the sign consistency of the Lasso estimator for $θ_{[2 : n]}$ . For a linear model $\tilde{y} = \tilde{X} θ_{[2 : n]} + ϵ$ with $E (ϵ) = 0$ , the conditions for the sign consistency of the Lasso estimator is well understood. One of the necessary conditions of the sign consistency of the Lasso estimator is the irrepresentable condition (Zhao & Yu, 2006) defined as follows:

‖ {\tilde{X}}_{S^{c}}^{T} {\tilde{X}}_{S} {({\tilde{X}}_{S}^{T} {\tilde{X}}_{S})}^{- 1} sign ({(θ_{[2 : n]})}_{S}) ‖_{\infty} \leq 1 - η for some η \in (0,1],

(13)

where $S = {j : {(θ_{[2 : n]})}_{j} \neq 0}$ . Theorem 2 of Qian & Jia (2016) shows that the irrepresentable condition (13) holds if and only if one of the following two conditions holds.

(1)
For a set of change points ${j_{1}, \dots, j_{K}}$ , $K = 1$ or $\max_{1 \leq t < K} (j_{t + 1} - j_{t}) = 1$ .
(2)
The underlying signal has no stair‐case blocks. That is, $(μ_{j_{t}} - μ_{j_{t - 1}}) (μ_{j_{t + 1}} - μ_{j_{t}}) < 0$ .

The abovementioned condition (2) shows the inconsistency of the FLSA in identifying true change points when underlying signals have stair‐case blocks.

Qian & Jia (2016) proposed a preconditioned FLSA with a Puffer transformation to resolve the inconsistency of the FLSA. The Puffer transformation for Lasso is proposed by Jia & Rohe (2015) to make the Lasso estimator sign consistent when the irreresentable condition does not hold for the design matrix $X \in ℝ^{n \times p}$ with $n \geq p$ . Let the singular value decomposition of $\tilde{X}$ be $\tilde{X} = U D V^{T}$ . The Puffer transformation is defined as $F_{n \times n} = U D^{- 1} U^{T}$ . Because $\tilde{X} \in ℝ^{n \times (n - 1)}$ in (12), the results of Jia & Rohe (2015) are valid for the transformed Lasso problem in (12). Let $Z = F \tilde{X}$ and $a = F \tilde{y}$ be the preconditioned design matrix and the response vector, respectively. The preconditioned FLSA with the Puffer transformation solves

\min_{b} \frac{1}{2} ‖ a - Z b ‖_{2}^{2} + λ_{2} ‖ b ‖_{1} .

(14)

Because $Z^{T} Z = I_{n}$ , the solution of the PCD‐FLSA has the explicit form $\hat{b} (λ_{2}) = {Soft}_{λ_{2}} (Z^{T} a)$ , where ${Soft}_{a} (x) = {(sign (x_{i}) \max (| x_{i} | - a, 0))}_{i = 1}^{n}$ . Thus, the solution path of $\hat{b} (λ_{2})$ is piecewise linear in $λ_{2}$ , and the hitting times (or slope‐change points in the solution path) are $λ_{2, P C D - F L S A}^{(i)} = | Z^{T} a |_{(i)}$ for $i = 1, \dots, n - 1$ , where $x_{(i)}$ is the $i th$ smallest value in a vector $x$ .

2.3. Modified Path Algorithm for FLSA (mPath‐FLSA)

Recently, Son & Lim (2019) provided a new interpretation of the path algorithm for the one‐dimensional FLSA (1D‐FLSA), which uses a new distance measure for two adjacent blocks to determine if they are fused. Furthermore, they used this interpretation to show the inconsistency of the FLSA in the presence of the stair‐case blocks and proposed a path algorithm with a modified distance measure that resolves this inconsistency.

Specifically, let $λ_{2}^{(0)} = 0 < λ_{2}^{(1)} < \dots, < λ_{2}^{(n - 1)}$ be the hitting times of the 1D‐FLSA. Further, let ${B_{k}^{(l)}}_{k = 1}^{n - l}$ be a partition of ${1, \dots, n}$ corresponding to the estimated blocks of $\hat{μ} (λ_{2})$ for $λ_{2} \in [λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ . Son & Lim (2019) developed a new distance measure $d ({\hat{B}}_{k - 1}^{(l)}, {\hat{B}}_{k}^{(l)})$ for two adjacent blocks ${\hat{B}}_{k - 1}^{(l)}$ and ${\hat{B}}_{k}^{(l)}$ defined as

d ({\hat{B}}_{k - 1}^{(l)}, {\hat{B}}_{k}^{(l)}) = \frac{| {\bar{y}}_{k - 1}^{(l)} - {\bar{y}}_{k}^{(l)} |}{\frac{1}{{\hat{b}}_{k - 1}^{(l)}} s ({\bar{y}}_{k - 1}^{(l)}) + \frac{1}{{\hat{b}}_{k}^{(l)}} s ({\bar{y}}_{k}^{(l)})},

(15)

where ${\hat{b}}_{k}^{(l)} = | {\hat{B}}_{k}^{(l)} |$ , ${\bar{y}}_{k}^{(l)} = (1 / {\hat{b}}_{k}^{(l)}) \sum_{i \in {\hat{B}}_{k}^{(l)}} y_{i}$ , and

s ({\bar{y}}_{k}^{(l)}) = \{\begin{array}{ll} | (\partial | {\bar{y}}_{k}^{(l)} - {\bar{y}}_{k - 1}^{(l)} | + \partial | {\bar{y}}_{k}^{(l)} - {\bar{y}}_{k + 1}^{(l)} |) | / 2 & for k = 2, \dots, n - l - 1 \\ | (\partial | {\bar{y}}_{k}^{(l)} - {\bar{y}}_{k - 1}^{(l)}) |) | / 2 & for k = n - l \\ | (\partial | {\bar{y}}_{k}^{(l)} - {\bar{y}}_{k + 1}^{(l)} |) | / 2 & for k = 1 \end{array} .

(16)

Son & Lim (2019) used the distance measure in (15) to show that the hitting time $λ_{2}^{(l + 1)}$ in (9) is equivalent to $\frac{1}{2} \min_{k} d ({\hat{B}}_{k - 1}^{(l)}, {\hat{B}}_{k}^{(l)})$ . This interpretation shows that the 1D‐FLSA cannot fuse two consecutive stair‐case blocks with a finite $λ_{2}$ because the denominator of the distance measure is defined as 0.

The abovementioned phenomenon for two consecutive stair‐case blocks promoted Son & Lim (2019) to propose the modified distance measure that guarantees the consistency of the 1D‐FLSA estimator in identifying true block structures regardless of the stair‐case blocks. The modified distance measure $δ ({\hat{B}}_{k - 1}^{(l)}, {\hat{B}}_{k}^{(l)})$ for two consecutive blocks ${\hat{B}}_{k - 1}^{(l)}$ and ${\hat{B}}_{k}^{(l)}$ is defined as

δ ({\hat{B}}_{k - 1}^{(l)}, {\hat{B}}_{k}^{(l)}) = \frac{| {\bar{y}}_{k - 1}^{(l)} - {\bar{y}}_{k}^{(l)} |}{1 / {\hat{b}}_{k - 1}^{(l)} + 1 / {\hat{b}}_{k}^{(l)}} .

(17)

A modified path algorithm is proposed using the modified distance measure with the hitting times defined as:

λ_{2, m P a t h - F L S A}^{(l + 1)} = \frac{1}{2} \min_{k} δ ({\hat{B}}_{k - 1}^{(l)}, {\hat{B}}_{k}^{(l)}) for l = 0, \dots, n - 2 .

(18)

Theorem 2 of Son & Lim (2019) shows that the 1D‐FLSA with a modified path algorithm consistently identifies the change points and signs of difference between two adjacent blocks. Note that the modified path algorithm guarantees a consistent estimator for identifying true block structures by adopting the modified distance measure $δ (\cdot, \cdot)$ , but this modification causes the modified path algorithm to solve a different problem that is not equivalent to the 1D‐FLSA, and the monotone increasing property of the hitting times could be violated (i.e. for some $l$ , $λ_{2}^{(l)} > λ_{2}^{(l + 1)}$ ). This phenomenon is explained in the next section.

3. NEW INTERPRETATION OF MODIFIED PATH ALGORITHM

In this section, we provide a novel interpretation of the mPath‐FLSA, including the exact target objective function of the mPath‐FLSA. It clarifies the differences between Path‐FLSA and mPath‐FLSA. Moreover, we illustrate a condition that violates the monotone increasing property of the hitting times in mPath‐FLSA.

Suppose we know that a partition ${{\hat{B}}_{k}^{(l)}}_{k = 1}^{n - l}$ of ${1, \dots, n}$ for a given $λ_{2, m P a t h - F L S A}^{(l)} = \frac{1}{2} \min_{k} δ ({\hat{B}}_{k - 1}^{(l - 1)}, {\hat{B}}_{k}^{(l -)})$ , where $λ_{2, m P a t h - F L S A}^{(l)}$ is the $l th$ hitting time of ${y_{1}, y_{2}, \dots, y_{n}}$ based on the mPath‐FLSA. In this section, we denote $λ_{2, m P a t h - F L S A}^{(l)}$ as $λ_{2}^{(l)}$ for notational simplicity. Let $ν_{k}^{(l)}$ be a parameter of the $k th$ block mean at $λ_{2} \in [λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ . A localised FLSA problem for a pair $(k - 1, k)$ of the partition indices of ${{\hat{B}}_{j}^{(l)}}_{j = 1}^{n - l}$ is define as

f_{l} (ν_{k - 1}^{(l)}, ν_{k}^{(l)}; η_{k}) = \frac{1}{2} \sum_{j \in {k - 1, k}} \sum_{i \in {\hat{B}}_{j}^{(l)}} {(y_{i} - ν_{j}^{(l)})}^{2} + η_{k} | ν_{k}^{(l)} - ν_{k - 1}^{(l)} | for k = 2, \dots, n - l,

(19)

where $η_{k} \geq 0$ is a tuning parameter of the localised FLSA problem for $({\hat{B}}_{k - 1}^{(l)}, {\hat{B}}_{k}^{(l)})$ . In Theorem 1, we show that the hitting times of the mPath‐FLSA are equivalent to the minimum hitting times of $(n - l - 1)$ localised FLSA problems.

Theorem 1

Let $λ_{2}^{(l)}$ be the $l th$ hitting time of the modified path algorithm with $y = (y_{1}, \dots, y_{n})$ and $λ_{2}^{(0)} = 0$ . Let $η_{j}^{(l + 1)}$ be the hitting time of (19) for a partition pair $({\hat{B}}_{j - 1}^{(l)}, {\hat{B}}_{j}^{(l)})$ . We define $τ^{(l + 1)} = \min_{2 \leq j \leq n - l} η_{j}^{(l + 1)}$ , which is the minimum hitting time of $f_{l} (ν_{j - 1}, ν_{j})$ for $j = 2, \dots, n - l$ . Then, $η_{j}^{(l + 1)} = δ ({\hat{B}}_{j - 1}^{(l)}, {\hat{B}}_{j}^{(l)})$ and $λ_{2}^{(l + 1)} = τ^{(l + 1)} / 2$ .

The necessary and sufficient conditions of the solution using the subgradient approach of $f_{l} (ν_{j - 1}, ν_{j}; η_{j})$ are as follows:

$\begin{array}{rcl} \frac{\partial f_{l}}{\partial ν_{j - 1}^{(l)}} = \sum_{i \in {\hat{B}}_{j - 1}^{(l)}} y_{i} - {\hat{b}}_{j - 1}^{(l)} ν_{j - 1}^{(l)} - η_{j} \partial | ν_{j}^{(l)} - ν_{j - 1}^{(l)} | = 0 \\ \frac{\partial f_{l}}{\partial ν_{j}^{(l)}} = \sum_{i \in {\hat{B}}_{j}^{(l)}} y_{i} - {\hat{b}}_{j}^{(l)} ν_{j}^{(l)} + η_{j} \partial | ν_{j}^{(l)} - ν_{j - 1}^{(l)} | = 0, \end{array}$ (20)

where ${\hat{b}}_{j}^{(l)} = | {\hat{B}}_{j}^{(l)} |$ , and $\partial | x |$ is the subdifferential of $| x |$ . Let ${\hat{ν}}_{j - 1}^{(l)}$ and ${\hat{ν}}_{j}^{(l)}$ be the solutions of the Eq. (20). For ${\hat{ν}}_{j - 1}^{(l)} > {\hat{ν}}_{j}^{(l)}$ , the solution can be represented as ${\hat{ν}}_{j - 1}^{(l)} = {\bar{y}}_{j - 1}^{(l)} - (η_{k} / {\hat{b}}_{j - 1}^{(l)})$ and ${\hat{ν}}_{j}^{(l)} = {\bar{y}}_{j}^{(l)} + (η_{k} / {\hat{b}}_{j}^{(l)})$ . From ${\hat{ν}}_{j - 1}^{(l)} > {\hat{ν}}_{j}^{(l)}$ , the hitting time is $η_{k}^{(l + 1)} = \frac{{\bar{y}}_{j - 1}^{(l)} - {\bar{y}}_{j}^{(l)}}{1 / {\hat{b}}_{j - 1}^{(l)} + 1 / {\hat{b}}_{j}^{(l)}} = \frac{| {\bar{y}}_{j}^{(l)} - {\bar{y}}_{j - 1}^{(l)} |}{1 / {\hat{b}}_{j - 1}^{(l)} + 1 / {\hat{b}}_{j}^{(l)}}$ . Similarly, for ${\hat{ν}}_{j - 1}^{(l)} < {\hat{ν}}_{j}^{(l)}$ , the hitting time is $η_{j}^{(l + 1)} = \frac{{\bar{y}}_{j}^{(l)} - {\bar{y}}_{j - 1}^{(l)}}{1 / {\hat{b}}_{j - 1}^{(l)} + 1 / {\hat{b}}_{j}^{(l)}} = \frac{| {\bar{y}}_{j}^{(l)} - {\bar{y}}_{j - 1}^{(l)} |}{1 / {\hat{b}}_{j - 1}^{(l)} + 1 / {\hat{b}}_{j}^{(l)}}$ . Hence, $η_{j}^{(l + 1)} = δ ({\hat{B}}_{j - 1}^{(l)}, {\hat{B}}_{j}^{(l)})$ . Based on the definition of the minimum hitting time $τ^{(l + 1)}$ , $τ^{(l + 1)}$ is equivalent to $\min_{2 \leq j \leq n - l} δ ({\hat{B}}_{j - 1}^{(l)}, {\hat{B}}_{j}^{(l)})$ in the modified path algorithm. It proves $λ_{2}^{(l + 1)} = τ^{(l + 1)} / 2$ .

Theorem 1 offers a novel interpretation of mPath‐FLSA. The mPath‐FLSA lacks an overall objective function, and it finds the next fusion block sequentially by solving independent localised FLSA problems for two adjacent blocks, called local fused lasso signal approximator (LFLSA). Henceforth, we refer to the mPath‐FLSA by Son and Lim (2019) as Path‐LFLSA. Moreover, the hitting times for each sub‐problems can be considered as the distance between the two blocks. Consequently, Path‐LFLSA can be considered as a hierarchical clustering algorithm for ${1,2, \dots, n - l}$ with distance metric $ρ (i, j) = δ ({\hat{B}}_{i}^{(l)}, {\hat{B}}_{j}^{(l)})$ if $| i - j | = 1$ and $ρ (i, j) = \infty$ if $| i - j | > 1$ .

Furthermore, this interpretation can help identify conditions that violate the monotonicity in the hitting times of Path‐LFLSA. Theorem 2 shows the conditions for the occurrence of the non‐monotonic sequence terms.

Theorem 2

Let $λ_{2}^{(l)} = \min_{2 \leq j \leq n - l + 1} η_{j}^{(l - 1)} / 2$ be the $l th$ hitting time of the Path‐LFLSA with $y = (y_{1}, \dots, y_{n})$ and $λ_{2}^{(0)} = 0$ . Suppose that $k = {argmin}_{2 \leq j \leq n - l} η_{j}^{(l - 1)}$ , and the partition set $P^{(l)} = {{\hat{B}}_{j}^{(l)}}_{j = 1}^{n - l}$ , where ${\hat{B}}_{i}^{(l)} = {\hat{B}}_{i}^{(l - 1)}$ for $1 \leq i \leq k - 2$ , ${\hat{B}}_{k - 1}^{(l)} = {\hat{B}}_{k - 1}^{(l - 1)} \cup {\hat{B}}_{k}^{(l - 1)}$ , and ${\hat{B}}_{i}^{(l)} = {\hat{B}}_{i + 1}^{(l - 1)}$ for $k \leq i \leq n - l$ . Then, the next hitting time $λ_{2}^{(l + 1)}$ is less than $λ_{2}^{(l)}$ if $η_{k - 1}^{(l + 1)} / 2 < λ_{2}^{(l)}$ or $η_{k}^{(l + 1)} / 2 < λ_{2}^{(l)}$ . Moreover, the violation $λ_{2}^{(l)} > λ_{2}^{(l + 1)}$ can be checked with the solution at $λ_{2}^{(l - 1)}$ . Let $ω_{k}^{(l - 1)} = {\hat{b}}_{k}^{(l - 1)} / ({\hat{b}}_{k - 1}^{(l - 1)} + {\hat{b}}_{k}^{(l - 1)})$ . The violation occurs if and only if the following inequality hold for $m = k - 2$ $(2 < k \leq n - l)$ or $k + 1$ $(2 \leq k < n - l)$ :

$| {\bar{y}}_{k}^{(l - 1)} - {\bar{y}}_{k - 1}^{(l - 1)} | > \frac{{\hat{b}}_{m}^{(l - 1)} {({\hat{b}}_{k - 1}^{(l - 1)} + {\hat{b}}_{k}^{(l - 1)})}^{2}}{{\hat{b}}_{k - 1}^{(l - 1)} {\hat{b}}_{k}^{(l - 1)} \sum_{j \in {m, k - 1, k}} {\hat{b}}_{j}^{(l - 1)}} |(1 - ω_{k}^{(l - 1)}) {\bar{y}}_{k - 1}^{(l - 1)} + ω_{k}^{(l - 1)} {\bar{y}}_{k}^{(l - 1)} - {\bar{y}}_{m}^{(l - 1)}| .$ (21)

Due to the definition of $λ_{2}^{(l)}$ , $λ_{2}^{(l)} \leq η_{j}^{(l - 1)} / 2$ for $2 \leq j \leq n - l + 1$ . After $l th$ hitting time, the set ${\hat{B}}_{k - 1}^{(l)}$ is defined by the union of ${\hat{B}}_{k - 1}^{(l - 1)}$ and ${\hat{B}}_{k}^{(l - 1)}$ , and the other blocks remain the same as in the previous partition $P^{(l - 1)}$ . Thus, the pairs $({\hat{B}}_{j - 1}^{(l)}, {\hat{B}}_{j}^{(l)})$ for $j = 2, \dots, k - 2, k + 1, \dots, n - l$ also remain the same as in the previous partition set $P^{(l - 1)}$ , which implies $η_{j}^{(l)} = η_{j}^{(l - 1)} \geq λ_{2}^{(l)}$ for $j \neq k - 1, k$ . Therefore, the next hitting time $λ_{2}^{(l + 1)}$ decreases only when $η_{k - 1}^{(l + 1)} / 2 < λ_{2}^{(l)}$ or $η_{k}^{(l + 1)} / 2 < λ_{2}^{(l)}$ . Moreover, based on this assumption, we can express the $l th$ hitting time as $λ_{2}^{(l)} = η_{k}^{(l - 1)} / 2 = | {\bar{y}}_{k}^{(l - 1)} - {\bar{y}}_{k - 1}^{(l - 1)} | / (2 / {\hat{b}}_{k}^{(l - 1)} + 2 / {\hat{b}}_{k - 1}^{(l - 1)})$ . Because ${\bar{y}}_{k - 2}^{(l)} = {\bar{y}}_{k - 2}^{(l - 1)}$ , ${\bar{y}}_{k - 1}^{(l)} = (1 - ω_{k}^{(l - 1)}) {\bar{y}}_{k - 1}^{(l - 1)} + ω_{k}^{(l - 1)} {\bar{y}}_{k}^{(l - 1)}$ , and ${\bar{y}}_{k}^{(l)} = {\bar{y}}_{k + 1}^{(l - 1)}$ , we can express $η_{k - 1}^{(l + 1)}$ and $η_{k}^{(l + 1)}$ as $η_{k - 1}^{(l + 1)} = | (1 - ω_{k}^{(l - 1)}) {\bar{y}}_{k - 1}^{(l - 1)} + ω_{k}^{(l - 1)} {\bar{y}}_{k}^{(l - 1)} - {\bar{y}}_{k - 2}^{(l - 1)} | / (1 / ({\hat{b}}_{k}^{(l - 1)} + {\hat{b}}_{k - 1}^{(l - 1)}) + 1 / {\hat{b}}_{k - 2}^{(l - 1)})$ and $η_{k}^{(l + 1)} = | (1 - ω_{k}^{(l - 1)}) {\bar{y}}_{k - 1}^{(l - 1)} + ω_{k}^{(l - 1)} {\bar{y}}_{k}^{(l - 1)} - {\bar{y}}_{k + 1}^{(l - 1)} | / (1 / ({\hat{b}}_{k}^{(l - 1)} + {\hat{b}}_{k - 1}^{(l - 1)}) + 1 / {\hat{b}}_{k + 1}^{(l - 1)}),$ respectively. Applying these observations to $2 λ_{2}^{(l)} / η_{k - 1}^{(l + 1)} > 1$ and $2 λ_{2}^{(l)} / η_{k}^{(l + 1)} > 1$ , we obtain the condition (21).

To describe the violation $λ_{2}^{(l)} > λ_{2}^{(l + 1)}$ , we consider three independent random variables $(Y_{1}, Y_{2}, Y_{3})$ from the standard normal distribution $N (0,1)$ . Corollary 1 shows that the probability of the violation $λ_{2}^{(1)} > λ_{2}^{(2)}$ is $4 P ((4 Y_{1} + Y_{2}) / 5 < Y_{3} < Y_{1}) = 4 E (Φ (- 3 \sqrt{3} Z) | Z > 0) \approx 0.121$ when we apply the Path‐LFLSA to $(Y_{1}, Y_{2}, Y_{3})$ , where $Z \sim N (0,1)$ and $Φ (z)$ is the cumulative distribution function of the standard normal distribution.

Corollary 1

Suppose $Y_{1}, Y_{2}, Y_{3}$ are random samples from $N (0,1)$ . Let $Z$ be a standard normal random variable and $Φ (z)$ be the cumulative distribution function of $Z$ . Let $λ_{2}^{(1)} = \min_{j = 2,3} η_{j}^{(0)} / 2$ and $λ_{2}^{(2)} = η_{2}^{(1)} / 2$ be the first and the second hitting times of the Path‐LFLSA with $λ_{2}^{(0)} = 0$ , respectively. Then, the probability $P (λ_{2}^{(1)} > λ_{2}^{(2)})$ is equal to $4 P ((4 Y_{1} + Y_{2}) / 5 < Y_{3} < Y_{1})$ . In addition, $P (λ_{2}^{(1)} > λ_{2}^{(2)})$ can be calculated from the univariate conditional expectation $4 E (Φ (- 3 \sqrt{3} Z) | Z > 0)$ .

A proof of Corollary 1 can be found in Appendix A of the supplementary material. The findings of this study reveal that the mPath‐FLSA, also called the Path‐LFLSA, is the clustering of the observed point based on the distance defined by the minimum of the next hitting times of the localised FLSA problems. We suggest using the indices $l = 0,1, \dots, n - 1$ of hitting times ${λ_{2}^{(l)}}$ to represent the solution path of the Path‐LFLSA instead of directly drawing the solution path along $λ_{2}$ , which avoids falsely split points in the solution path. This phenomenon is illustrated in Section 5.

4. PATHWISE ADAPTIVE FLSA

The Path‐LFLSA motivated us to propose the pathwise adaptive FLSA, which is a weighted FLSA with pathwise adaptive weights and guarantees the monotonicity of the hitting times. Specifically, let $λ_{2}^{(l)}$ be the $l th$ hitting time and $P^{(l)} = {{\hat{B}}_{j}^{(l)}}_{j = 1}^{n - l}$ be a partition of ${1,2, \dots, n}$ at $λ_{2}^{(l)}$ such that ${\hat{B}}_{i}^{(l)} ⋂ {\hat{B}}_{j}^{(l)} = \emptyset$ for $i \neq j$ and $⋃_{j = 1}^{n - l} {\hat{B}}_{j}^{(l)} = {1, \dots, n}$ . For $λ_{2} \in (λ_{2}^{(l)}, λ_{2}^{(l + 1)}]$ , the pathwise adaptive FLSA (PA‐FLSA) minimises

f_{P A} (μ; λ_{2}) = \frac{1}{2} \sum_{i = 1}^{n} {(y_{i} - μ_{i})}^{2} + λ_{2} \sum_{j = 2}^{n} w_{μ, j}^{(l)} | μ_{j} - μ_{j - 1} |,

(22)

where

w_{μ, j}^{(l)} = \{\begin{matrix} \frac{1}{| {\bar{y}}_{t}^{(l)} - {\bar{y}}_{t - 1}^{(l)} |} & if j \in {\hat{B}}_{t}^{(l)}, j - 1 \in {\hat{B}}_{t - 1}^{(l)}, | {\bar{y}}_{t}^{(l)} - {\bar{y}}_{t - 1}^{(l)} | > \frac{1}{M} \\ M & otherwise \end{matrix}

and $M > 0$ is a sufficiently large weight that fuses the corresponding two parameters. Equivalently, we can express the PA‐FLSA problem in the following reduced form:

f_{l} (ν_{1}, \dots, ν_{n - l}; λ_{2}) = \frac{1}{2} \sum_{j = 1}^{n - l} \sum_{i \in {\hat{B}}_{j}^{(l)}} {(y_{i} - ν_{j})}^{2} + λ_{2} \sum_{j = 2}^{n - l} w_{j}^{(l)} | ν_{j} - ν_{j - 1} |,

(23)

where $w_{j}^{(l)} = {| {\bar{y}}_{j}^{(l)} - {\bar{y}}_{j - 1}^{(l)} |}^{- 1}$ if $| {\bar{y}}_{j}^{(l)} - {\bar{y}}_{j - 1}^{(l)} | > M^{- 1}$ and $M$ otherwise. Note that $M > 0$ is also used in the reduced form to avoid the division by zero besides the numerical instability. For example, we set $M = 100$ in our numerical study.

We now investigate the properties of PA‐FLSA. First, we express the solution of the PA‐FLSA in Lemma 1.

Lemma 1

Suppose that there are $n$ intervals $[λ_{2}^{(0)}, λ_{2}^{(1)})$ , $[λ_{2}^{(1)}, λ_{2}^{(2)})$ , $\dots$ , $[λ_{2}^{(n)}, \infty)$ with $λ_{2}^{(0)} = 0$ , where the fused set is unchanged in each interval. For $λ_{2} \in [λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ , the solution of the pathwise adaptive FLSA estimator of (23) is given by

${\hat{μ}}_{i}^{P A} (λ_{2}) = \sum_{k = 1}^{n - l} {\hat{ν}}_{k}^{(l)} (λ_{2}) I (i \in {\hat{B}}_{k}^{(l)}),$

where ${\hat{ν}}_{k}^{(l)} = {\bar{y}}_{k}^{(l)} + {\hat{c}}_{k}^{(l)} (λ_{2})$ , ${\hat{b}}_{k}^{(l)} = | {\hat{B}}_{k}^{(l)} |$ , $\partial | x |$ is the subdifferential of $| x |$ , and

${\hat{c}}_{k}^{(l)} (λ_{2}) = \{\begin{matrix} \frac{λ_{2}}{{\hat{b}}_{k}^{(l)}} (w_{k + 1}^{(l)} \partial | {\hat{ν}}_{k + 1}^{(l)} - {\hat{ν}}_{k}^{(l)} |) & if k = 1 \\ - \frac{λ_{2}}{{\hat{b}}_{k}^{(l)}} (w_{k}^{(l)} \partial | {\hat{ν}}_{k}^{(l)} - {\hat{ν}}_{k - 1}^{(l)} | - w_{k + 1}^{(l)} \partial | {\hat{ν}}_{k + 1}^{(l)} - {\hat{ν}}_{k}^{(l)} |) & if 2 \leq k \leq n - l - 1 \\ - \frac{λ_{2}}{{\hat{b}}_{k}^{(l)}} (w_{k}^{(l)} \partial | {\hat{ν}}_{k}^{(l)} - {\hat{ν}}_{k - 1}^{(l)} |) & if k = n - l \end{matrix}$

The subdifferential of $f_{l} (\cdot)$ gives the proof of Lemma 1 directly (Bertsekas, 1999).

Based on Lemma 1, we can anticipate the PA‐FLSA to avoid the inconsistency of the FLSA in the presence of the stair‐case blocks. Specifically, for the FLSA, the bias term ${\hat{c}}_{k}$ is zero for the stair‐case blocks (i.e. $({\hat{ν}}_{k} - {\hat{ν}}_{k - 1}) ({\hat{ν}}_{k + 1} - {\hat{ν}}_{k}) > 0$ ), implying that the FLSA fails to move the estimates of the stair‐case blocks. However, for the PA‐FLSA, the ${\hat{c}}_{k}^{(l)}$ for $2 \leq k \leq n - l - 1$ is defined as:

{\hat{c}}_{k}^{(l)} (λ_{2}) = \{\begin{aligned} \frac{λ_{2}}{{\hat{b}}_{k}^{(l)}} (w_{k}^{(l)} + w_{k + 1}^{(l)}) & if {\hat{ν}}_{k - 1}^{(l)} > {\hat{ν}}_{k}^{(l)} and {\hat{ν}}_{k}^{(l)} < {\hat{ν}}_{k + 1}^{(l)} (local min.) \\ - \frac{λ_{2}}{{\hat{b}}_{k}^{(l)}} (w_{k}^{(l)} + w_{k + 1}^{(l)}) & if {\hat{ν}}_{k - 1}^{(l)} < {\hat{ν}}_{k}^{(l)} and {\hat{ν}}_{k}^{(l)} > {\hat{ν}}_{k + 1}^{(l)} (local max.) \\ - \frac{λ_{2}}{{\hat{b}}_{k}^{(l)}} (w_{k}^{(l)} - w_{k + 1}^{(l)}) & if {\hat{ν}}_{k - 1}^{(l)} < {\hat{ν}}_{k}^{(l)} and {\hat{ν}}_{k}^{(l)} < {\hat{ν}}_{k + 1}^{(l)} (increasing stair‐block) \\ \frac{λ_{2}}{{\hat{b}}_{k}^{(l)}} (w_{k}^{(l)} - w_{k + 1}^{(l)}) & if {\hat{ν}}_{k - 1}^{(l)} > {\hat{ν}}_{k}^{(l)} and {\hat{ν}}_{k}^{(l)} > {\hat{ν}}_{k + 1}^{(l)} (decreasing stair‐block) \end{aligned} .

Thus, the PA‐FLSA has a nonzero bias term for the stair‐case blocks unless $| {\bar{y}}_{k}^{(l)} - {\bar{y}}_{k - 1}^{(l)} | = | {\bar{y}}_{k + 1}^{(l)} - {\bar{y}}_{k}^{(l)} |$ .

Second, we investigate the monotone fusion property of the PA‐FLSA, which denotes that ${\hat{μ}}_{j}$ and ${\hat{μ}}_{j + 1}$ remain fused at $λ_{2}^{'} > λ_{2}$ if ${\hat{μ}}_{j}$ and ${\hat{μ}}_{j + 1}$ are fused at $λ_{2}$ . The monotone fusion property for the FLSA was proved in Proposition 2 of Friedman et al. (2007). We show that the PA‐FLSA holds the monotone fusion property similar to the FLSA in Proposition 1. However, the PA‐FLSA adopts the weighted fusion penalty with the pathwise adaptive weights.

Proposition 1

In the PA‐FLSA, two parameters that are fused in the solution for $λ_{2}$ are fused for all $λ_{2}^{'} > λ_{2}$ .

Appendix B of the supplementary material contains a proof of Proposition 1.

Finally, we show that the solution path of the PA‐FLSA is piecewise linear in the given interval $[λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ and discontinuous at the hitting times, which changes the pathwise adaptive weights in the Theorem 3 mentioned below.

Theorem 3

The solution path of the PA‐FLSA is a piecewise linear function of $λ_{2}$ and discontinuous at the hitting times $λ_{2}^{(l)}$ , which changes the pathwise adaptive weights. The next hitting time $λ_{2}^{(l + 1)}$ can be obtained from the current solution at $λ_{2}^{(l)}$ by

$λ_{2}^{(l + 1)} = λ_{2}^{(l)} + \min_{\underset{h_{k, k - 1}^{(l)} > 0}{2 \leq k \leq n - l}} h_{k, k - 1}^{(l)}, where h_{k, k - 1}^{(l)} = \frac{{\hat{ν}}_{k}^{(l)} - {\hat{ν}}_{k - 1}^{(l)}}{\frac{\partial {\hat{ν}}_{k - 1}^{(l)}}{\partial λ_{2}} - \frac{\partial {\hat{ν}}_{k}^{(l)}}{\partial λ_{2}}} .$ (24)

The partition ${{\hat{B}}_{j}^{(l)}}_{j = 1}^{n - l}$ is unchanged for $λ_{2} \in [λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ . Based on Lemma 1, we can represent the derivative of the solution with respect to $λ_{2}$ for a given interval $[λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ as

$\frac{\partial {\hat{ν}}_{k}}{\partial λ_{2}} = \{\begin{array}{ll} {({\hat{b}}_{k}^{(l)})}^{- 1} w_{k + 1}^{(l)} sign ({\hat{ν}}_{k + 1} - {\hat{ν}}_{k}) & for k = 1 \\ - {({\hat{b}}_{k}^{(l)})}^{- 1} (w_{k}^{(l)} sign ({\hat{ν}}_{k} - {\hat{ν}}_{k - 1}) - w_{k + 1}^{(l)} sign ({\hat{ν}}_{k + 1} - {\hat{ν}}_{k})) & for k = 2, \dots, n - l - 1 \\ - {({\hat{b}}_{k}^{(l)})}^{- 1} w_{k}^{(l)} sign ({\hat{ν}}_{k} - {\hat{ν}}_{k - 1}) & for k = n - l \end{array} .$

Thus, the solution paths of ${\hat{ν}}_{k}$ for $k = 1, \dots, n - l$ along $λ_{2}$ are linear in the given interval $[λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ because ${w_{k}^{(l)}}_{k = 1}^{n - l}$ and ${sign ({\hat{ν}}_{k + 1} - {\hat{ν}}_{k})}_{k = 1}^{n - l}$ are unchanged. Based on this linearity, for $λ_{2} \in [λ_{2}^{(l)}, λ_{2}^{(l + 1)}$ ), we can obtain the solution at $λ_{2}$ using the equation:

${\hat{ν}}_{k} (λ_{2}) = {\hat{ν}}_{k}^{(l)} + α \frac{\partial ν_{k}}{\partial λ_{2}},$

where ${\hat{ν}}_{k}^{(l)}$ is the minimiser of (23) at $λ_{2} = λ_{2}^{(l)}$ and $α = λ_{2} - λ_{2}^{(l)} \in [0, λ_{2}^{(l + 1)} - λ_{2}^{(l)})$ . We can find the next hitting time $h_{k, k - 1}^{(l)}$ for a pair $({\hat{ν}}_{k} (λ_{2}), {\hat{ν}}_{k - 1} (λ_{2}))$ of two solution paths from the equation ${\hat{ν}}_{k} (λ_{2}) = {\hat{ν}}_{k - 1} (λ_{2})$ , for $k = 2, \dots, n - l$ , as

$h_{k, k - 1}^{(l)} = \frac{{\hat{ν}}_{k}^{(l)} - {\hat{ν}}_{k - 1}^{(l)}}{\frac{\partial {\hat{ν}}_{k - 1}}{\partial λ_{2}} - \frac{\partial {\hat{ν}}_{k}}{\partial λ_{2}}} \in (0, λ_{2}^{(l + 1)} - λ_{2}^{(l)}] .$

Because these hitting times are only valid for the unchanged partition ${{\hat{B}}_{j}^{(l)}}_{j = 1}^{n - l}$ (i.e. there is no fusion for $λ_{2} \in (λ_{2}^{(l)}, λ_{2}^{(l + 1)})$ ), the next hitting time $λ_{2}^{(l + 1)}$ should satisfy the following equation:

$λ_{2}^{(l + 1)} - λ_{2}^{(l)} = \min_{\underset{h_{k, k - 1}^{(l)} > 0}{2 \leq k \leq n - l}} h_{k, k - 1}^{(l)} .$

Consider two PA‐FLSA problems with ${(w_{j}^{(l)})}_{j = 1}^{n - 1}$ and ${(w_{j}^{(l + 1)})}_{j = 1}^{n - l - 1}$ at $λ_{2} = λ_{2}^{(l + 1)}$ to show the discontinuity of the solution path of the PA‐FLSA. We denote the first problem as the PA‐FLSA( $l$ ) and the second problem as the PA‐FLSA( $l + 1$ ) to distinguish the two problems. Suppose that $3 < q < n - l$ and ${\hat{ν}}_{q}^{(l)} (λ_{2})$ and ${\hat{ν}}_{q - 1}^{(l)} (λ_{2})$ of the PA‐FLSA( $l$ ) are fused at $λ_{2}^{(l)}$ . Let $s ({\hat{ν}}_{k}^{(l)}, {\hat{ν}}_{k - 1}^{(l)}) = \partial | {\hat{ν}}_{k}^{(l)} - {\hat{ν}}_{k - 1}^{(l)} |$ , ${\hat{b}}_{[q - 1 : q]}^{(l)} = {\hat{b}}_{q - 1}^{(l)} + {\hat{b}}_{q}^{(l)}$ , and ${\bar{y}}_{[q - 1 : q]}^{(l)} = ({\hat{b}}_{q - 1}^{(l)} {\bar{y}}_{q - 1}^{(l)} + {\hat{b}}_{q}^{(l)} {\bar{y}}_{q}^{(l)}) / {\hat{b}}_{[q - 1 : q]}$ . Based on Lemma 1, we can express the solution ${\hat{ν}}^{(l)} (λ_{2}^{(l + 1)})$ of the PA‐FLSA( $l$ ) as

${\hat{ν}}_{k}^{(l)} (λ_{2}^{(l + 1)}) = \{\begin{array}{ll} {\bar{y}}_{1}^{(l)} + \frac{λ_{2}^{(l + 1)}}{{\hat{b}}_{1}^{(l)}} (w_{2}^{(l)} s ({\hat{ν}}_{2}^{(l)}, {\hat{ν}}_{1}^{(l)})) & if k = 1 \\ {\bar{y}}_{k}^{(l)} - \frac{λ_{2}^{(l + 1)}}{{\hat{b}}_{k}^{(l)}} (w_{k}^{(l)} s ({\hat{ν}}_{k}^{(l)}, {\hat{ν}}_{k - 1}^{(l)}) - w_{k + 1}^{(l)} s ({\hat{ν}}_{k + 1}^{(l)}, {\hat{ν}}_{k}^{(l)})) & if k \neq 1, q - 1, q, n - l \\ {\bar{y}}_{[q - 1 : q]}^{(l)} - \frac{λ_{2}^{(l + 1)}}{{\hat{b}}_{[q - 1 : q]}^{(l)}} (w_{q - 1}^{(l)} s ({\hat{ν}}_{q - 1}^{(l)}, {\hat{ν}}_{q - 2}^{(l)}) - w_{q + 1}^{(l)} s ({\hat{ν}}_{q + 1}^{(l)}, {\hat{ν}}_{q}^{(l)})) & if k = q - 1, q \\ {\bar{y}}_{n - l}^{(l)} - \frac{λ_{2}^{(l + 1)}}{{\hat{b}}_{n - l}^{(l)}} (w_{n - l}^{(l)} s ({\hat{ν}}_{n - l}^{(l)}, {\hat{ν}}_{n - l - 1}^{(l)})) & if k = n - l \end{array}$

Based on this assumption, we can represent the partition ${\hat{B}}_{k}^{(l + 1)}$ into ${\hat{B}}_{k}^{(l + 1)} = {\hat{B}}_{k}^{(l)}$ for $1 \leq k \leq q - 2$ , ${\hat{B}}_{q - 1}^{(l + 1)} = {\hat{B}}_{q - 1}^{(l)} \cup {\hat{B}}_{q}^{(l)}$ , and ${\hat{B}}_{k}^{(l + 1)} = {\hat{B}}_{k + 1}^{(l)}$ for $q \leq k \leq n - l - 1$ . For notational simplicity, in the proof, we use the definition of the pathwise adaptive weight as $w_{k}^{(l)} = | {\bar{y}}_{k}^{(l)} - {\bar{y}}_{k - 1}^{(l)} |^{- 1}$ . Thus, the following equations hold:

${\bar{y}}_{k}^{(l + 1)} = \{\begin{array}{cl} {\bar{y}}_{k}^{(l)} & for 1 \leq k \leq q - 2 \\ {\bar{y}}_{[q - 1 : q]}^{(l)} & for k = q - 1 \\ {\bar{y}}_{k + 1}^{(l)} & for q \leq k \leq n - l - 1 \end{array}, w_{k}^{(l + 1)} = \{\begin{array}{cl} w_{k}^{(l)} & for 2 \leq k \leq q - 2 \\ {(| {\bar{y}}_{[q - 1 : q]}^{(l)} - {\bar{y}}_{q - 2}^{(l)} |)}^{- 1} & for k = q - 1 \\ {(| {\bar{y}}_{q + 1}^{(l)} - {\bar{y}}_{[q - 1 : q]}^{(l)} |)}^{- 1} & for k = q \\ w_{k + 1}^{(l)} & for q + 1 \leq k \leq n - l - 1 \end{array} .$

Then, by Lemma 1, the solution ${\hat{ν}}^{(l + 1)} (λ_{2}^{(l + 1)})$ of the PA‐FLSA( $l + 1$ ) can be represented as

${\hat{ν}}_{k}^{(l + 1)} (λ_{2}^{(l + 1)}) = \{\begin{array}{ll} {\bar{y}}_{k}^{(l)} + \frac{λ_{2}^{(l + 1)}}{{\hat{b}}_{k}^{(l)}} (w_{2}^{(l)} s ({\hat{ν}}_{2}^{(l + 1)}, {\hat{ν}}_{1}^{(l + 1)})) & if k = 1 \\ {\bar{y}}_{k}^{(l)} - \frac{λ_{2}^{(l + 1)}}{{\hat{b}}_{k}^{(l)}} (w_{k}^{(l)} s ({\hat{ν}}_{k}^{(l + 1)}, {\hat{ν}}_{k - 1}^{(l + 1)}) - w_{k + 1}^{(l)} s ({\hat{ν}}_{k + 1}^{(l + 1)}, {\hat{ν}}_{k}^{(l + 1)})) & if 2 \leq k \leq q - 3 \\ {\bar{y}}_{k}^{(l)} - \frac{λ_{2}^{(l + 1)}}{{\hat{b}}_{k}^{(l)}} (w_{k}^{(l)} s ({\hat{ν}}_{k}^{(l + 1)}, {\hat{ν}}_{k - 1}^{(l + 1)}) - w_{k + 1}^{(l + 1)} s ({\hat{ν}}_{k + 1}^{(l + 1)}, {\hat{ν}}_{k}^{(l + 1)})) & if k = q - 2 \\ {\bar{y}}_{[q - 1 : q]}^{(l)} - \frac{λ_{2}^{(l + 1)}}{{\hat{b}}_{[q - 1 : q]}^{(l)}} (w_{q - 1}^{(l + 1)} s ({\hat{ν}}_{q - 1}^{(l + 1)}, {\hat{ν}}_{q - 2}^{(l + 1)}) - w_{q}^{(l + 1)} s ({\hat{ν}}_{q}^{(l + 1)}, {\hat{ν}}_{q - 1}^{(l + 1)})) & if k = q - 1 \\ {\bar{y}}_{q + 1}^{(l)} - \frac{λ_{2}^{(l + 1)}}{{\hat{b}}_{q + 1}^{(l)}} (w_{q}^{(l + 1)} s ({\hat{ν}}_{q}^{(l + 1)}, {\hat{ν}}_{q - 1}^{(l + 1)}) - w_{q + 2}^{(l)} s ({\hat{ν}}_{q + 1}^{(l + 1)}, {\hat{ν}}_{q}^{(l + 1)})) & if k = q \\ {\bar{y}}_{k + 1}^{(l)} - \frac{λ_{2}^{(l + 1)}}{{\hat{b}}_{k + 1}^{(l)}} (w_{k + 1}^{(l)} s ({\hat{ν}}_{k}^{(l + 1)}, {\hat{ν}}_{k - 1}^{(l + 1)}) - w_{k + 2}^{(l)} s ({\hat{ν}}_{k + 1}^{(l + 1)}, {\hat{ν}}_{k}^{(l + 1)})) & if q < k < n - l - 1 \\ {\bar{y}}_{n - l}^{(l)} - \frac{λ_{2}^{(l + 1)}}{{\hat{b}}_{n - l}^{(l)}} (w_{n - l}^{(l)} s ({\hat{ν}}_{n - l - 1}^{(l + 1)}, {\hat{ν}}_{n - l - 2}^{(l + 1)})) & if k = n - l - 1 \end{array}$

Let ${\hat{μ}}_{i}^{(l)} (λ_{2}^{(l + 1)}) = \sum_{j = 1}^{n - l} {\hat{ν}}_{j}^{(l)} (λ_{2}^{(l + 1)}) I (i \in {\hat{B}}_{j}^{(l)})$ and ${\hat{μ}}^{(l + 1)} (λ_{2}^{(l + 1)}) = \sum_{j = 1}^{n - l - 1} {\hat{ν}}_{j}^{(l + 1)} (λ_{2}^{(l + 1)}) I (i \in {\hat{B}}_{j}^{(l + 1)})$ . A comparison of the two solutions ${\hat{μ}}^{(l)} (λ_{2}^{(l + 1)})$ and ${\hat{μ}}^{(l + 1)} (λ_{2}^{(l + 1)})$ can easily spot that the absolute differences between $| μ_{i}^{(l)} (λ_{2}^{(l + 1)}) - μ_{i}^{(l + 1)} (λ_{2}^{(l + 1)}) |$ for $i \in \cup_{j = q - 2}^{q} {\hat{B}}_{j}^{(l + 1)}$ are nonzero if $w_{q - 1}^{(l)} \neq w_{q - 1}^{(l + 1)}$ and $w_{q + 1}^{(l)} \neq w_{q}^{(l + 1)}$ . This completes the proof of the discontinuity of the solution path of the PA‐FLSA.

The results in Theorem 3 show that the hitting times $λ_{2}^{(l)}$ of the PA‐FLSA increase monotonically. Unlike the original FLSA, the solution path of the PA‐FLSA is discontinuous at hitting times that change the pathwise adaptive weights. However, this discontinuity does not affect PA‐FLSA's performance in estimating true signal levels and identifying the change points. We will examine these phenomena in the numerical study.

5. NUMERICAL STUDY

This section presents a comprehensive numerical study. First, we numerically investigate the properties we have explored in the previous sections, including the violation of the monotone increasing property for the hitting times of the Path‐LFLSA and the discontinuity of the PA‐FLSA's solution path. Second, we numerically compare the probabilities of the exact pattern recovery of the four methods (Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA) under four scenarios. The first three scenarios are based on Qian & Jia (2016), and the fourth is novel for this study. We also conduct a comparison of the performances of the four methods in estimating the signal levels and identifying the block structures with the optimal tuning parameters chosen by the Bayesian information criterion (BIC) (Schwarz, 1978) and the extended BIC (EBIC) (Chen & Chen, 2008) in Appendix C of the supplementary material.

5.1. Comparison of Whole Solution Paths

In this subsection, we compare the whole solution paths of the Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA. We consider $y_{i} = μ_{i}^{*} + ϵ_{i}$ , $ϵ_{i} \sim N (0, σ^{2})$ , $μ = {(μ_{1}, \dots, μ_{8})}^{T}$ , and $σ = 0.25$ to generate the underlying signal and noisy observations. The true mean value is set to $μ_{i}^{*} = {(0 . 5,0 . 5,0, 0, - 0.5, - 0.5)}^{T}$ and the noisy observation is generated as $y = {(0 . 4314,0 . 4000, - 0 . 2140,0 . 5188, - 0.2379, - 0.4435)}^{T}$ . We applied the four methods to obtain the whole solution paths along $λ_{2}$ . The solution paths of the Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA are depicted in Figure 1. The solution path of the Path‐LFLSA seems to have a split event around $λ_{2} = 0.2$ , while the Path‐LFLSA actually has fusion events only. We also report the hitting times of the solution paths from Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA in Table 1 for details.

Solution paths of Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA for $μ = {(μ_{1}, \dots, μ_{6})}^{T}$

TABLE 1.

Hitting times of the whole solution paths from the four methods with the noisy observation $y$ with $λ_{2}^{(0)} = 0$

Method

λ_{2}^{(1)}

λ_{2}^{(2)}

λ_{2}^{(3)}

λ_{2}^{(4)}

λ_{2}^{(5)}

Path‐FLSA

0.0314

0.1832

0.2056

0.5266

0.8330

PCD‐FLSA

0.0314

0.2056

0.6140

0.7328

0.7567

Path‐LFLSA

0.0078

0.0514

0.1832

0.1317

0.4165

PA‐FLSA

0.0314

0.0521

0.1322

0.1838

0.2682

Open in a new tab

Table 1 shows that the fourth hitting time $λ_{2}^{(4)} = 0.1317$ is less than the third hitting time $λ_{2}^{(3)} = 0.1832$ for Path‐LFLSA, violating the monotone increasing property of the hitting times. Contrarily, the other solution path algorithms satisfy the monotone increasing property of the hitting times. As demonstrated in Theorem 2, this violation $λ_{2}^{(3)} > λ_{2}^{(4)}$ can be checked with the solution at the second hitting time $λ_{2} = λ_{2}^{(2)}$ . At $λ_{2} = λ_{2}^{(2)}$ , ${\bar{y}}^{(2)} = {(0.4157, - 0 . 2140,0 . 5188, - 0.3407)}^{T}$ , ${\hat{b}}^{(2)} = {(2,1, 1,2)}^{T}$ , $η^{(2)} = (η_{2}^{(2)}, η_{3}^{(2)}, η_{4}^{(2)}) = {(0 . 2099,0 . 1832,0 . 2865)}^{T}$ . Thus, $k = {argmin}_{j = 2,3, 4} η_{j}^{(2)} = 3$ , $w_{3}^{(2)} = {\hat{b}}_{3}^{(2)} / ({\hat{b}}_{2}^{(2)} + {\hat{b}}_{3}^{(2)}) = 0.5$ , and $m = k - 2 = 1$ or $m = k + 1 = 4$ in the condition (21). Thus, the following condition (21) holds for $m = 1$ :

0.7328 = | {\bar{y}}_{3}^{(2)} - {\bar{y}}_{2}^{(2)} | > \frac{{\hat{b}}_{1}^{(2)} {({\hat{b}}_{2}^{(2)} + {\hat{b}}_{3}^{(2)})}^{2}}{{\hat{b}}_{2}^{(2)} {\hat{b}}_{3}^{(2)} \sum_{j = 1}^{3} {\hat{b}}_{j}^{(2)}} |({\bar{y}}_{2}^{(2)} + {\bar{y}}_{3}^{(2)}) / 2 - {\bar{y}}_{1}^{(2)}| = 0.5266 .

As mentioned in Section 3, we suggest drawing the solution path of the Path‐LFLSA along indices of its hitting times, which is equivalent to the order of fusion events. We depict the solution paths along $λ_{2}$ and the indices of the hitting times in Figure 2. The solution path along the indices of the hitting times (Figure 2 (b)) is more readable than the solution path along $λ_{2}$ (Figure 2 (a)).

Solution paths of Path‐LFLSA along $λ_{2}$ and indices of hitting times for $μ = {(μ_{1}, \dots, μ_{6})}^{T}$

Figure 1 also shows that the solution path of the PA‐FLSA has discontinuous points at the hitting times. For example, the values of ${\hat{ν}}^{(2)} (λ_{2}^{(3)})$ and ${\hat{ν}}^{(3)} (λ_{2}^{(3)})$ in the proof of Theorem 3 are different, where ${\hat{ν}}^{(2)} (λ_{2}^{(3)}) = {(0 . 3107,0 . 1764, - 0.2597)}^{T}$ and ${\hat{ν}}^{(3)} (λ_{2}^{(3)}) = {(0 . 3107,0 . 1672, - 0.2505)}^{T}$ . Table 1 shows that the hitting times of the PA‐FLSA are monotonically increasing, supporting Theorem 3.

Finally, we compared the fused sets at the hitting times. From Figure 1, the partitions from the Path‐FLSA are obtained as ${\hat{B}}^{(1)} = {{1,2}, {3}, {4}, {5}, {6}}$ , ${\hat{B}}^{(2)} = {{1,2}, {3,4}, {5}, {6}}$ , ${\hat{B}}^{(3)} = {{1,2}, {3,4}, {5,6}}$ , ${\hat{B}}^{(4)} = {{1,2, 3,4}, {5,6}}$ , and ${\hat{B}}^{(5)} = {{1,2, 3,4, 5,6}}$ for $λ_{2}^{(1)}, \dots, λ_{2}^{(5)}$ , respectively. Path‐LFLSA and PA‐FLSA have the same partitions ${\hat{B}}^{(1)} = {{1,2}, {3}, {4}, {5}, {6}}$ , ${\hat{B}}^{(2)} = {{1,2}, {3}, {4}, {5,6}}$ , ${\hat{B}}^{(3)} = {{1,2}, {3,4}, {5,6}}$ , ${\hat{B}}^{(4)} = {{1,2, 3,4}, {5,6}}$ , and ${\hat{B}}^{(5)} = {{1,2, 3,4, 5,6}}$ for $λ_{2}^{(1)}, \dots, λ_{2}^{(5)}$ , respectively. Thus, we observe that Path‐FLSA, Path‐LFLSA, and PA‐FLSA contain the true partition ${{1,2}, {3,4}, {5,6}}$ . Contrarily, the partitions from the PCD‐FLSA are obtained as ${\hat{B}}^{(1)} = {{1,2}, {3}, {4}, {5}, {6}}$ , ${\hat{B}}^{(2)} = {{1,2}, {3}, {4}, {5,6}}$ , ${\hat{B}}^{(3)} = {{1,2, 3}, {4}, {5,6}}$ , ${\hat{B}}^{(4)} = {{1,2, 3,4}, {5,6}}$ , and ${\hat{B}}^{(5)} = {{1,2, 3,4, 5,6}}$ for $λ_{2}^{(1)}, \dots, λ_{2}^{(5)}$ , respectively. The PCD‐FLSA fails to contain the true partition, and this difference is because of the Puffer transformation in the PCD‐FLSA. Note that the last observation describes the estimates of the methods from the one data set $y$ only, not implying that the PCD‐FLSA generally fails to contain the true partition. In the following subsection, we evaluate the exact pattern recovery probabilities estimated by 1000 data sets to compare the general performance of containing the true partition.

5.2. Comparison of Exact Pattern Recovery Probabilities

In this section, we compare the probabilities of the exact pattern recovery of the four methods similar to Qian & Jia (2016), where the exact pattern recovery indicates that the solution path of the given method contains the solution of the true block structures. We considered four scenarios for the underlying block structures. Three scenarios are taken directly from Qian & Jia (2016) to reproduce their results and conduct a fair comparison, in which two scenarios have no stair‐case blocks, and one scenario has stair‐case blocks. Besides, the last scenario for our numerical study was a true block structure with multiple stair‐case blocks. These four scenarios can be divided into two categories based on the existence of the stair‐case blocks, which is equivalent to the failure of the irrepresentable condition of the transformed Lasso in (13). Again, for a fair comparison, we set $n = 430$ and consider $σ = 0 . 05,0, 10, \dots, 0.5$ like Qian & Jia (2016). The noisy observations are from $y_{i} = μ_{i} + ϵ_{i}$ , where $ϵ_{i} \sim N (0, σ^{2})$ and $μ_{i}$ are specified case by case. We generated 1000 data sets and checked the inclusion of the true block structures with the given solution path to measure the probability of the exact pattern recovery. To obtain the confidence intervals of the exact pattern recovery probabilities, we repeat the procedure for estimating the exact pattern recovery 50 times as well.

5.2.1. Two scenarios without stair‐case blocks

First, we considered the two true signal structures $μ^{(1)}$ and $μ^{(2)}$ used in Qian & Jia (2016) for the scenarios $S1$ and $S2$ , respectively. The two true signal vectors for $S1$ and $S2$ are respectively defined as

μ_{i}^{(1)} = \{\begin{array}{rcl} 0 & 1 \leq i \leq 40 \\ 2 & 41 \leq i \leq 80 \\ - 1 & 81 \leq i \leq 120 \\ 3 & 121 \leq i \leq 160 \\ 0 & 161 \leq i \leq 200 \\ 2 & 201 \leq i \leq 240 \\ 0 & 241 \leq i \leq 280 \\ 2 & 281 \leq i \leq 320 \\ - 1 & 321 \leq i \leq 360 \\ 3 & 361 \leq i \leq 400 \\ 0 & 401 \leq i \leq 430 \end{array} and μ_{i}^{(2)} = \{\begin{array}{rcl} 0 & 1 \leq i \leq 15 \\ 2 & 16 \leq i \leq 30 \\ 0 & 31 \leq i \leq 60 \\ 2 & 61 \leq i \leq 120 \\ 0 & 121 \leq i \leq 210 \\ 2 & 211 \leq i \leq 240 \\ 0 & 241 \leq i \leq 255 \\ 2 & 256 \leq i \leq 370 \\ 0 & 371 \leq i \leq 385 \\ 2 & 386 \leq i \leq 400 \\ 0 & 401 \leq i \leq 430 \end{array} .

With these two true signal vectors, the left hand sides (LHS) of the irrepresentable condition of the transformed Lasso are 0.975 and 0.9826 for $μ^{(1)}$ and $μ^{(2)}$ , respectively. These values denote that the irrepresentation conditions hold for two signal vectors because there exists a constant $η$ such that $0 < η \leq 1 - LHS \leq 1$ . Figure 3 depicts the true signals and noisy observations for two scenarios with $σ = 0.25$ . Scenarios $S1$ and $S2$ have ten change points in the true signal. The differences between $S1$ and $S2$ are the lengths of the blocks and signal levels.

True signals and noisy observations for two scenarios $S1$ and $S2$

Figure 4 depicts the estimated probabilities and their 95% confidence intervals of the exact pattern recovery of Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA for $S1$ and $S2$ . We confirmed that the probabilities of the exact pattern recovery of Path‐FLSA and PCD‐FLSA have the same shapes as in the simulation results in Qian & Jia (2016). Moreover, Figure 4 shows an interesting finding that the estimated probabilities of the exact pattern recovery of Path‐LFLSA and PA‐FLSA are almost similar and greater than those of Path‐FLSA and PCD‐FLSA for $σ \geq 0.25$ . Specifically, for $σ = 0.5$ and case $S2$ , the probabilities of the exact pattern recovery for Path‐LFLSA and PA‐FLSA are estimated as 0.422 and 0.458, respectively, while those of Path‐FLSA and PCD‐FLSA are zero. For $S1$ and $S2$ , we further observe that PA‐FLSA has a larger pattern recovery probability than Path‐LFLSA for relatively large $σ$ levels $(σ \geq 0.35)$ .

Plots of the estimated probabilities of the exact pattern recovery under $σ = 0 . 05,0 . 10, \dots, 0.5$ for $S1$ and $S2$ . Dashed lines denote the 95% confidence intervals of the exact pattern recovery probabilities

5.2.2. Two scenarios with stair‐case blocks

Second, we consider two true signal structures $μ^{(3)}$ and $μ^{(4)}$ for the scenarios $S3$ and $S4$ , respectively, where $μ^{(3)}$ is from Qian & Jia (2016) and $μ^{(4)}$ is a novel case with multiple stair‐case blocks. The true signal vectors $μ^{(3)}$ and $μ^{(4)}$ for $S3$ and $S4$ are defined as follows:

μ_{i}^{(3)} = \{\begin{array}{rcl} 0 & 1 \leq i \leq 100 \\ - 2 & 101 \leq i \leq 110 \\ - 0.1 & 111 \leq i \leq 210 \\ 2 & 211 \leq i \leq 220 \\ 0.1 & 221 \leq i \leq 320 \\ - 2 & 321 \leq i \leq 330 \\ 0 & 331 \leq i \leq 430 \end{array} and μ_{i}^{(4)} = \{\begin{array}{rcl} - 2.4 & 1 \leq i \leq 50 \\ - 1.8 & 51 \leq i \leq 100 \\ - 1.2 & 101 \leq i \leq 150 \\ - 0.6 & 151 \leq i \leq 200 \\ 0 & 201 \leq i \leq 250 \\ 0.6 & 251 \leq i \leq 300 \\ 1.2 & 301 \leq i \leq 350 \\ 1.8 & 351 \leq i \leq 400 \\ 2.4 & 401 \leq i \leq 430 \end{array} .

As in the cases of $S1$ and $S2$ , we calculated the LHS of the irrepresentable condition of the transformed Lasso for $S3$ and $S4$ . Both the LHS values for $μ^{(3)}$ and $μ^{(4)}$ were calculated as 1.0. These LHS values indicate that the irrepresentation conditions failed for two signal vectors because a constant $η$ such that $0 < η \leq 1 - LHS = 0$ does not exist. Figure 5 depicts the true signals and noisy observations for two scenarios $S3$ and $S4$ with $σ = 0.25$ . Scenarios $S3$ and $S4$ have six and ten change points, respectively.

True signals and noisy observations for two scenarios S3 and S4

Figure 6 depicts the estimated probabilities and their 95% confidence intervals of the exact pattern recovery of Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA for $S3$ and $S4$ . We can compare the characteristics of the four methods when the true signal has the stair‐case blocks. First, Path‐FLSA fails totally in finding the exact pattern (i.e. true block structure) for $S3$ and $S4$ because the estimated probabilities of the exact pattern recovery of Path‐FLSA for $S3$ and $S4$ are zero for all $σ = 0.05, \dots, 0.5$ . This observation supports the inconsistency of the FLSA under the existence of the stair‐case blocks. Second, Path‐LFLSA and PA‐FLSA are more robust to the noise level (i.e. error variance) than PCD‐FLSA. The estimated exact pattern recovery probabilities of Path‐LFLSA and PA‐FLSA decreased slower that of PCD‐FLSA. For example, the exact pattern recovery probability of PCD‐FLSA decreased from 1.000 at $σ = 0.05$ to 0.001 at $σ = 0.15$ while that of Path‐LFLSA and PA‐FLSA decreased from 1.000 and 1.000 at $σ = 0.05$ to 0.549 and 0.595 at $σ = 0.15$ for $S4$ , respectively.

Plots of the estimated probabilities of the exact pattern recovery under $σ = 0 . 05,0 . 10, \dots, 0.5$ for $S3$ and $S4$ . Dashed lines denote the 95% confidence intervals of the exact pattern recovery probabilities

Overall, Path‐LFLSA and PA‐FLSA outperformed Path‐FLSA and PCD‐FLSA concerning the exact pattern recovery probability for all cases from $S1$ to $S4$ . For Path‐LFLSA and PA‐FLSA, PA‐FLSA (Path‐LFLSA) was better than Path‐LFLSA (PA‐FLSA) for S1, S2, and S4 (S3), but the differences were small.

6. APPLICATION TO COVID‐19 INFECTION IN KOREA

In this section, we apply the four methods Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA to the daily‐confirmed cases of COVID‐19 infection in Korea. We downloaded a dataset of COVID‐19 infection status from the Public Data Portal (http://data.go.kr) by using OpenAPI through https://openapi.data.go.kr/openapi/service/rest/Covid19/getCovid19InfStateJson with the authorised key. The downloaded dataset contained 9 variables, including date of state (stateDt), a time of state (stateTime), and the number of cumulative confirmed cases of COVID‐19 (decideCnt). We set a target period as a period from 03‐01‐2020 to 03/31/2022 and used the cases whose times of state were 0:00. To obtain the daily‐confirmed cases of COVID‐19, we applied the first differencing to the number of cumulative confirmed cases, where the difference between the cumulative confirmed cases of COVID‐19 stated at 0:00 on 03/01/2020 and 03/02/2020 as the daily‐confirmed cases occurred on 03/01/2020.

We considered a logarithmic transformation to stabilise the variance of the observation before applying the four methods because the FLSA‐based procedures are sensitive to the noise level, as shown in Section 5. In addition, the number of daily confirmed cases is dramatically increasing from January 2022 in Korea. For example, the number of daily confirmed cases was 621,204 on March 16, 2022, while the number of daily confirmed patients was 3831 on January 1, 2022. Figure 7 depicts the log‐transformed daily‐confirmed cases of the target period with a gray line.

Plots of the logarithmic transformed daily confirmed cases of COVID‐19 (), the estimates of Path‐FLSA (), PCD‐FLSA (), Path‐LFLSA (), and PA‐FLSA () with the EBIC

Inline graphic — Plots of the logarithmic transformed daily confirmed cases of COVID‐19 (), the estimates of Path‐FLSA (), PCD‐FLSA (), Path‐LFLSA (), and PA‐FLSA () with the EBIC

To select the optimal tuning parameter, we used the following EBIC proposed by Chen & Chen (2008), which showed better results for the FLSA as described in Appendix C of the supplementary material,

EBIC (λ_{2}) = n \log (RSS (λ_{2})) + \hat{J} (λ_{2}) \log (n) + \log (\binom{n}{\hat{J} (λ_{2})}),

where $RSS (λ_{2}) = \sum_{i = 1}^{n} {(y_{i} - {\hat{μ}}_{i}^{D B} (λ_{2}))}^{2}$ , ${\hat{μ}}_{i}^{D B} (λ_{2}) = {\bar{y}}_{{\hat{B}}_{j}}$ for $i \in {\hat{B}}_{j} (λ_{2})$ , ${\bar{y}}_{{\hat{B}}_{j}} = | {\hat{B}}_{j} (λ_{2}) |^{- 1} \sum_{i \in {\hat{B}}_{j} (λ_{2})} y_{i}$ , and $\hat{J} (λ_{2})$ is the number of the estimated blocks at $λ_{2}$ .

Figure 7 depicts the estimates by Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA. The number of identified change points for Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA were 82, 172, 55, and 39, respectively. As shown in Figure 7 (a), the Path‐FLSA identified many change points in the periods that seems to have trends or stair‐case blocks and also missed several local changes. For example, the Path‐FLSA missed the change points for a local peak from 05/04/2020 to 05/06/2020, while both the Path‐LFLSA and PA‐FLSA identified the change points on 05/03/2020 and 05/06/2020. Figure 7 (b) shows that the PCD‐FLSA estimates seem to have many false‐positive change points and also identify much more change points compared with the others. As shown in the case of $S4$ having the stair‐case blocks, the pattern recovery probabilities of the Path‐FLSA and PCD‐FLSA decrease very quickly when the noise level increases. With the residual of the fitted models, the estimates of the error standard deviation by Path‐FLSA, PCD‐FLSA, Path‐LFLSA, and PA‐FLSA are 0.24, 0.14, 0.18, and 0.20, respectively. Thus, this observation is also consistent with the comparison result of the pattern recovery probability for $S4$ . From the above observations, we focused on the comparison of the identified change points by the Path‐LFLSA and PA‐FLSA. We also depict the histograms and Q‐Q plots of the residuals of the four methods in Appendix D of the supplementary material. From the figures in Appendix D of the supplementary material, the residuals of the four methods are not exactly following the normal distribution but seem to follow the normal distribution within the interval $[- 1 . 5,1 . 5]$ . It is also worth noting that the FLSA tends to find many change points when the underlying signal has a trend because the FLSA model is adequate for the piecewise constant mean model. For identifying the trend‐change points, we refer to the $ℓ_{1}$ trend filtering method proposed by Kim et al. (2009), which is more suitable to identify sparse trend‐change points.

Table 2 reports the identified change points by Path‐LFLSA and PA‐FLSA with the corresponding indices, dates, and debiased estimates. The gray rows in Table 2 denote the identified change points by either the Path‐LFLSA or the PA‐FLSA only. There are 32 change points commonly identified by the Path‐LFLSA and PA‐FLSA, and there are also 23 (7) change points identified by the Path‐LFLSA (PA‐FLSA) only. Among the seven change points identified by the PA‐FLSA only, the six change points at 03/07/2020, 08/12/2020, 09/11/2020, 10/24/2020, 01/01/2021 and 11/01/2021 are closed to the six change points identified by the Path‐LFLSA only, where most of the differences are one or two days. The change point by the PA‐FLSA on 01/31/2022 seems to be an intermediate point within the period having a trend. For the change points by the Path‐LFLSA only except for the six change points corresponding to the change points by the PA‐FLSA only, most of the identified change points catch the local peaks.

TABLE 2.

The estimated change points of Path‐LFLSA and PA‐FLSA

Path‐LFLSA

PA‐LFLSA

Path‐LFLSA

PA‐LFLSA

Index

Date

{\hat{μ}}^{D B}

Index

Date

{\hat{μ}}^{D B}

Index

Date

{\hat{μ}}^{D B}

Index

Date

{\hat{μ}}^{D B}

03/06/2020

535.34

306

12/31/2020

992.99

03/07/2020

507.43

307

01/01/2021

984.45

03/10/2020

232.84

03/10/2020

199.89

315

01/09/2021

757.46

315

01/09/2021

749.87

04/04/2020

97.92

04/04/2020

97.92

322

01/16/2021

526.01

322

01/16/2021

526.01

04/08/2020

47.23

04/08/2020

47.23

351

02/14/2021

393.59

04/17/2020

26.23

04/17/2020

26.23

355

02/18/2021

561.75

05/03/2020

9.89

05/03/2020

9.89

372

03/07/2021

392.68

05/06/2020

3.91

05/06/2020

3.91

401

04/05/2021

460.51

401

04/05/2021

424.31

05/08/2020

15.72

420

04/24/2021

663.27

05/14/2020

30.47

468

06/11/2021

589.20

05/25/2020

19.05

05/25/2020

21.65

478

06/21/2021

444.14

112

06/20/2020

45.73

492

07/05/2021

683.50

492

07/05/2021

599.00

117

06/25/2020

34.80

520

08/02/2021

1437.73

520

08/02/2021

1437.73

138

07/16/2020

51.02

571

09/22/2021

1760.66

141

07/19/2020

33.56

579

09/30/2021

2617.28

147

07/25/2020

60.68

147

07/25/2020

47.02

587

10/08/2021

2001.65

163

08/10/2020

31.85

604

10/25/2021

1422.00

165

08/12/2020

33.91

611

11/01/2021

1784.54

167

08/14/2020

85.90

167

08/14/2020

131.79

625

11/15/2021

2134.56

625

11/15/2021

2242.17

185

09/01/2020

296.16

185

09/01/2020

296.16

639

11/29/2021

3372.36

639

11/29/2021

3372.36

194

09/10/2020

162.48

646

12/06/2021

5002.52

646

12/06/2021

5002.52

195

09/11/2020

159.73

665

12/25/2021

6532.64

665

12/25/2021

6532.64

202

09/18/2020

121.88

202

09/18/2020

119.86

688

01/17/2022

978.03

688

01/17/2022

3978.03

210

09/26/2020

87.69

695

01/24/2022

7080.12

695

01/24/2022

7080.12

212

09/28/2020

44.60

702

01/31/2022

16194.46

234

10/20/2020

77.29

705

02/03/2022

18074.79

705

02/03/2022

23355.52

238

10/24/2020

79.16

709

02/07/2022

36739.06

709

02/07/2022

36739.06

256

11/11/2020

112.47

256

11/11/2020

115.98

716

02/14/2022

54343.44

716

02/14/2022

54343.44

261

11/16/2020

211.76

261

11/16/2020

211.76

723

02/21/2022

99131.02

723

02/21/2022

99131.02

269

11/24/2020

333.21

269

11/24/2020

333.21

730

02/28/2022

158836.04

730

02/28/2022

158836.04

285

12/10/2020

564.42

285

12/10/2020

564.42

737

03/07/2022

226706.55

737

03/07/2022

226706.55

Open in a new tab

Note: The rows in gray denote the change points found by only one method.

On the other hand, interestingly, most of the commonly identified change points correspond to the period of the social distancing announced by the Ministry of Health and Welfare of Korea. For example, April 4, 2020, corresponds to the end of the intensive social distancing period from March 22, 2020, to April 5, 2020. In addition, the time periods between change points are similar to the periods of the social distancing policy by the Ministry of Health and Welfare of Korea, where the policy periods are usually two or three weeks. Finally, we should address that the sequence of the daily confirmed cases is observed through time, and it seems to have time‐dependent trends. Thus, the main assumptions of the FLSA for the model selection consistency is hardly satisfied, and then the FLSA‐based methods do not guarantee the model selection consistency. For example, among the commonly identified change points, the change points within four periods (03/10/2020 ∼ 05/06/2020, 11/16/2020 ∼ 12/10/2020, 11/15/2021∼ 12/25/2025, 02/03/2022 ∼ 03/07/2022) seem to be intermediate points within the periods having either an increasing or a decreasing trend.

However, this COVID‐19 spread example addresses that the Path‐LFLSA and PA‐FLSA are still applicable to find the main change points caused by an external event in a stable period (i.e. a period with a low trend effect). For example, both Path‐LFLSA and PA‐FLSA succeeded in finding the beginning day of the second wave of the COVID‐19 pandemic in Korea on August 15, 2020, when a mass rally was held near Gwanghwamun Square in Seoul, where the identified change point on August 14, 2020, in Table 2 denotes that the underlying mean value was changed on August 15, 2020. In addition, the identified change point on November 11, 2020, is close to the beginning day (11/4/2020) of the third wave reported in Seong et al. (2021) and the identified change point at 07/05/2021 is also close to the beginning day (7/7/2021) of the fourth wave 1. Recently, the number of daily confirmed cases has been rapidly increasing, and the fifth wave had was begun on 01/26/2022 2. Both Path‐LFLSA and PA‐FLSA found the change points on 01/24/2022, which is close to the beginning day of the fifth wave. The identified change points related to the second to the fifth waves are highlighted in red in Table 2.

7. CONCLUSION

In this study, we provide a new interpretation of the modified path algorithm for the FLSA by discovering the exact optimisation problems corresponding to the modified path algorithm, called Path‐LFLSA. Our discovery demonstrates that the modified path algorithm's hitting times are not monotonically increasing, and the violation of the monotone increasing property for the next hitting time can be verified by comparing the solution from the previous hitting time. We propose a pathwise adaptive FLSA with a weighted fusion penalty to recover the monotonicity of the hitting times. The comprehensive numerical study illustrates the whole solution paths of the four methods, including three existing ones and the proposed PA‐FLSA, and it also shows that the Path‐LFLSA and PA‐FLSA are less sensitive to noise levels for pattern recovery than the Path‐FLSA and PCD‐FLSA. Furthermore, our numerical study in Appendix C of the supplementary material provides a practical guideline for choosing the optimal tuning parameters of the Path‐LFLSA and PA‐FLSA that outperform Path‐FLSA and PCD‐FLSA to identify the true block structures and estimate the true signal levels. The application of Path‐LFLSA and PA‐FLSA with the optimal tuning parameter selection by EBIC to the number of daily‐confirmed cases of COVID‐19 infection found the change points related to the beginning days of the COVID‐19 pandemic waves from the second to the fifth in Korea.

Supporting information

insr12521‐sup‐0001‐Supp_PATH_FLSA_rev_v1.pdf

Click here for additional data file.^{(276.2KB, pdf)}

ACKNOWLEDGEMENTS

W. Son's research is supported by the National Research Foundation of Korea (No. 2020R1F1A1A01051039), J. Lim's research is supported by the National Research Foundation of Korea (NRF‐2021R1A2C1010786), and D. Yu's research is supported by the National Research Foundation of Korea (NRF‐2022R1A5A7033499) and Inha University Research Grant.

Son, W. , Lim, J. , and Yu, D. (2022) Path algorithms for fused lasso signal approximator with application to COVID‐19 spread in Korea. International Statistical Review, 10.1111/insr.12521.

Footnotes

Korea officially in COVID‐19 fourth wave, an article in Korea Herald available at https://www.koreaherald.com/view.php?ud%3D20210707000868

Daily COVID‐19 Cases Exceed 13,000, 5th Wave Beginning, an article in KBS World available at https://world.kbs.co.kr/service/news_view.htm?lang%3De&Seq_Code%3D167226

REFERENCES

Bertsekas, D. (1999). Nonlinear Programming. Athena Scientific.
Bradely Efron, B. , Hastie, T. , Johnstone, I. & Tibshirani, R. (2004). Least angle regression. Ann. Stat., 32, 407–499. [Google Scholar]
Chen, J & Chen, Z. (2008). Extended Bayesian information criteria for model selection with large model spaces. Biometrika, 95(3), 759–771. [Google Scholar]
Friedman, J. , Hastie, T. , Hoefling, H. & Tibshirani, R. (2007). Pathwise coordinate optimization. Ann. Appl. Stat., 1, 302–332. [Google Scholar]
Fryzlewicz, P. (2014). Wild binary segmentation for multiple change‐point detection. Ann. Stat., 42(6), 2243–2281. [Google Scholar]
Hoefling, H. (2010). A path algorithm for the fused Lasso signal approximator. J. Comput. Graph. Stat., 19, 984–1006. [Google Scholar]
Jia, J. & Rohe, K. (2015). Preconditioning the lasso for sign consistency. Electr. J. Stat., 9, 1150–1172. [Google Scholar]
Kim, S.J. , Koh, K. , Boyd, S. & Gorinevsky, D. (2009). $ℓ_{1}$ trend filtering. SIAM Review, 51(2), 339–360. [Google Scholar]
Lin, K. , Sharpnack, J. , Rinaldo, A. & Tibshirani, R.J. 2016. Approximate recovery in changepoint problems, from $ℓ_{2},$ estimation error rates. arXiv preprint, arXiv:1606.06746.
Olshen, A.B. , Venkatraman, E. , Lucito, R. & Wigler, M. (2004). Circular binary segmentation for the analysis of array‐based dna copy number data. Biostatistics, 5(4), 557–572. [DOI] [PubMed] [Google Scholar]
Qian, J. & Jia, J. (2016). On stepwise pattern recovery of the fused Lasso. Comput. Stat. Data Anal., 94, 221–237. [Google Scholar]
Rinaldo, A. (2009). Properties and refinements of the fused lasso. Ann. Stat., 37, 2922–2952. [Google Scholar]
Rinaldo, A. 2014. Corrections to properties and refinements of the fused lasso. available at https://www.stat.cmu.edu/∼arinaldo/Fused_Correction.png
Schwarz, G. (1978). Estimating the dimension of a model. Ann. Stat., 6, 461–464. [Google Scholar]
Seong, H. , Hyun, H.J. , Yun, J.G. , Noh, J.Y. , Cheong, H.J. , Kim, W.J. & Song, J.Y. (2021). Comparison of the second and third waves of the COVID‐19 pandemic in South Korea: Importance of early public health intervention. Int. J. Infectious Disease, 104, 742–745. [DOI] [PMC free article] [PubMed] [Google Scholar]
Son, W. & Lim, J. (2019). Modified path algorithm of fused Lasso signal approximator for consistent recovery of change points. J. Stat. Plan. Inference, 200, 223–238. [Google Scholar]
Tibshirani, R. , Saunders, M. , Rosset, S. , Zhu, J. & Knight, K. (2005). Sparsity and smoothness via the fused lasso. J. R. Stat. Soc.: Ser. B (Stat. Methodol.), 67, 91–108. [Google Scholar]
Tibshirani, R.J. & Taylor, J. (2011). The solution path of the generalized lasso. Ann. Stat., 39(3), 1335–1371. [Google Scholar]
Yao, Y.C. (1988). Estimating the number of change‐points via Schwarz' criterion. Stat. Probab. Lett., 6(3), 181–189. [Google Scholar]
Yao, Y.C. & Au, S.T. (1989). Least‐squares estimation of a step function. Sankhyā Indian J. Stat. Ser. A, 51(3), 370–381. [Google Scholar]
Zhao, P. & Yu, B. (2006). On model selection consistency of lasso. J. Mach. Learn. Res., 7, 2541. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

insr12521‐sup‐0001‐Supp_PATH_FLSA_rev_v1.pdf

Click here for additional data file.^{(276.2KB, pdf)}

[insr12521-bib-0001] Bertsekas, D. (1999). Nonlinear Programming. Athena Scientific.

[insr12521-bib-0002] Bradely Efron, B. , Hastie, T. , Johnstone, I. & Tibshirani, R. (2004). Least angle regression. Ann. Stat., 32, 407–499. [Google Scholar]

[insr12521-bib-0003] Chen, J & Chen, Z. (2008). Extended Bayesian information criteria for model selection with large model spaces. Biometrika, 95(3), 759–771. [Google Scholar]

[insr12521-bib-0004] Friedman, J. , Hastie, T. , Hoefling, H. & Tibshirani, R. (2007). Pathwise coordinate optimization. Ann. Appl. Stat., 1, 302–332. [Google Scholar]

[insr12521-bib-0005] Fryzlewicz, P. (2014). Wild binary segmentation for multiple change‐point detection. Ann. Stat., 42(6), 2243–2281. [Google Scholar]

[insr12521-bib-0006] Hoefling, H. (2010). A path algorithm for the fused Lasso signal approximator. J. Comput. Graph. Stat., 19, 984–1006. [Google Scholar]

[insr12521-bib-0007] Jia, J. & Rohe, K. (2015). Preconditioning the lasso for sign consistency. Electr. J. Stat., 9, 1150–1172. [Google Scholar]

[insr12521-bib-0008] Kim, S.J. , Koh, K. , Boyd, S. & Gorinevsky, D. (2009). $ℓ_{1}$ trend filtering. SIAM Review, 51(2), 339–360. [Google Scholar]

[insr12521-bib-0009] Lin, K. , Sharpnack, J. , Rinaldo, A. & Tibshirani, R.J. 2016. Approximate recovery in changepoint problems, from $ℓ_{2},$ estimation error rates. arXiv preprint, arXiv:1606.06746.

[insr12521-bib-0010] Olshen, A.B. , Venkatraman, E. , Lucito, R. & Wigler, M. (2004). Circular binary segmentation for the analysis of array‐based dna copy number data. Biostatistics, 5(4), 557–572. [DOI] [PubMed] [Google Scholar]

[insr12521-bib-0011] Qian, J. & Jia, J. (2016). On stepwise pattern recovery of the fused Lasso. Comput. Stat. Data Anal., 94, 221–237. [Google Scholar]

[insr12521-bib-0012] Rinaldo, A. (2009). Properties and refinements of the fused lasso. Ann. Stat., 37, 2922–2952. [Google Scholar]

[insr12521-bib-0013] Rinaldo, A. 2014. Corrections to properties and refinements of the fused lasso. available at https://www.stat.cmu.edu/∼arinaldo/Fused_Correction.png

[insr12521-bib-0014] Schwarz, G. (1978). Estimating the dimension of a model. Ann. Stat., 6, 461–464. [Google Scholar]

[insr12521-bib-0015] Seong, H. , Hyun, H.J. , Yun, J.G. , Noh, J.Y. , Cheong, H.J. , Kim, W.J. & Song, J.Y. (2021). Comparison of the second and third waves of the COVID‐19 pandemic in South Korea: Importance of early public health intervention. Int. J. Infectious Disease, 104, 742–745. [DOI] [PMC free article] [PubMed] [Google Scholar]

[insr12521-bib-0016] Son, W. & Lim, J. (2019). Modified path algorithm of fused Lasso signal approximator for consistent recovery of change points. J. Stat. Plan. Inference, 200, 223–238. [Google Scholar]

[insr12521-bib-0017] Tibshirani, R. , Saunders, M. , Rosset, S. , Zhu, J. & Knight, K. (2005). Sparsity and smoothness via the fused lasso. J. R. Stat. Soc.: Ser. B (Stat. Methodol.), 67, 91–108. [Google Scholar]

[insr12521-bib-0018] Tibshirani, R.J. & Taylor, J. (2011). The solution path of the generalized lasso. Ann. Stat., 39(3), 1335–1371. [Google Scholar]

[insr12521-bib-0019] Yao, Y.C. (1988). Estimating the number of change‐points via Schwarz' criterion. Stat. Probab. Lett., 6(3), 181–189. [Google Scholar]

[insr12521-bib-0020] Yao, Y.C. & Au, S.T. (1989). Least‐squares estimation of a step function. Sankhyā Indian J. Stat. Ser. A, 51(3), 370–381. [Google Scholar]

[insr12521-bib-0021] Zhao, P. & Yu, B. (2006). On model selection consistency of lasso. J. Mach. Learn. Res., 7, 2541. [Google Scholar]

PERMALINK

Path algorithms for fused lasso signal approximator with application to COVID‐19 spread in Korea

Won Son

Johan Lim

Donghyeon Yu

Summary

1. INTRODUCTION

2. EXISTING SOLUTION PATH ALGORITHMS FOR FLSA

2.1. Path Algorithm for the FLSA (Path‐FLSA)

2.2. Preconditioned FLSA with Puffer Transformation (PCD‐FLSA)

2.3. Modified Path Algorithm for FLSA (mPath‐FLSA)

3. NEW INTERPRETATION OF MODIFIED PATH ALGORITHM

Theorem 1

Theorem 2

Corollary 1

4. PATHWISE ADAPTIVE FLSA

Lemma 1

Proposition 1

Theorem 3

5. NUMERICAL STUDY

5.1. Comparison of Whole Solution Paths

FIGURE 1.

TABLE 1.

FIGURE 2.

5.2. Comparison of Exact Pattern Recovery Probabilities

5.2.1. Two scenarios without stair‐case blocks

FIGURE 3.

FIGURE 4.

5.2.2. Two scenarios with stair‐case blocks

FIGURE 5.

FIGURE 6.

6. APPLICATION TO COVID‐19 INFECTION IN KOREA

FIGURE 7.

TABLE 2.

7. CONCLUSION

Supporting information

ACKNOWLEDGEMENTS

Footnotes

REFERENCES

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases