Convex recovery of continuous domain piecewise constant images from nonuniform Fourier samples

Greg Ongie; Sampurna Biswas; Mathews Jacob

doi:10.1109/TSP.2017.2750111

. Author manuscript; available in PMC: 2019 Jan 1.

Published in final edited form as: IEEE Trans Signal Process. 2017 Sep 7;66(1):236–250. doi: 10.1109/TSP.2017.2750111

Convex recovery of continuous domain piecewise constant images from nonuniform Fourier samples

Greg Ongie ¹, Sampurna Biswas ², Mathews Jacob ^3,^✉

PMCID: PMC6101269 NIHMSID: NIHMS942074 PMID: 30140146

Abstract

We consider the recovery of a continuous domain piecewise constant image from its non-uniform Fourier samples using a convex matrix completion algorithm. We assume the discontinuities/edges of the image are localized to the zero level-set of a bandlimited function. This assumption induces linear dependencies between the Fourier coefficients of the image, which results in a two-fold block Toeplitz matrix constructed from the Fourier coefficients being low-rank. The proposed algorithm reformulates the recovery of the unknown Fourier coefficients as a structured low-rank matrix completion problem, where the nuclear norm of the matrix is minimized subject to structure and data constraints. We show that exact recovery is possible with high probability when the edge set of the image satisfies an incoherency property. We also show that the incoherency property is dependent on the geometry of the edge set curve, implying higher sampling burden for smaller curves. This paper generalizes recent work on the super-resolution recovery of isolated Diracs or signals with finite rate of innovation to the recovery of piecewise constant images.

Index Terms: Off-the-Grid Image Recovery, Structured Low-Rank Matrix Completion, Finite Rate of Innovation

I. Introduction

The direct recovery of continuous domain signals by convex optimization is emerging as a powerful alternative to traditional discrete domain compressed sensing [1]–[3]. The ability of these continuous domain “off-the-grid” schemes to minimize discretization errors makes them attractive in practical applications, where only the low-pass measurements of the signal are available. The history of such continuous domain signal recovery algorithms dates back to Prony [4], where the recovery of a linear combination of exponentials from uniform samples is considered. Prony-like algorithms recover the signal by estimating an annihilating polynomial whose zeros correspond to the frequencies of the exponentials. Work by Liang et al. [5], [6] and the finite rate of innovation (FRI) framework [7] extended Prony-like methods to recover more general signals that reduce to a sparse linear combination of Dirac delta functions under an appropriate transformation (e.g., differential operators, convolution). Recently, several authors have further extended FRI methods to recover such signals from their non-uniform Fourier samples [3], [8]– [11] by exploiting the low-rank structure of an enhanced matrix built from Fourier data (e.g., a Hankel matrix in 1-D). Recovery guarantees exists for certain classes of these signals when the singularities are isolated and well-separated [2], [3], [12].

The signal models discussed above have limited flexibility in exploiting the extensive additional structure present in multi-dimensional imaging problems. In particular, the edges in multidimensional images are connected and can be modeled as smooth curves or surfaces. While discrete image representations to capture this structure have been the subject extensive research [13], [14], similar continuous domain representations have attracted less attention. We recently introduced a novel framework recover piecewise polynomial images, whose edges are localized to smooth curves, from their uniform [15], [16] and non-uniform [11] Fourier samples; our framework generalizes a recent extension of FRI models to curves [17]. We assume that the partial derivatives of the signal vanish outside the zero level-set of a bandlimited function, which is only true for piecewise smooth signals. This relation translates to a linear system of convolution equations involving the uniform Fourier samples of the partial derivatives, which can be compactly represented as the multiplication of a specific structured matrix with the Fourier coefficients of the bandlimited function. We have introduced theoretical guarantees for the recovery of such images from uniform samples [15], [16]. Our earlier work has shown that the structured matrix built from the Fourier coefficients of piecewise constant images is low-rank [11], [16], which we used to recover the image from its non-uniform Fourier samples with good performance in practical applications. We have also introduced an computationally efficient algorithm termed as GIRAF, which works on the original signal samples rather than the structured high-dimensional matrix [18], [19]; the computational complexity of this algorithm is comparable to discrete total variation regularization, which makes this scheme readily applicable to large-scale imaging problems, such as undersampled dynamic magnetic resonance image reconstruction [20].

The main focus of the present paper is to introduce theoretical guarantees on the recovery of continuous domain piecewise constant images from non-uniform Fourier samples via a convex structured low-rank matrix completion algorithm. Our main result shows number of non-uniform samples to recovery the image is proportional to the complexity of the edge set, as measured by the bandwidth of the edge set function, and an incoherence measure related to the edge set geometry. We additionally show that the recovery is robust to noise and model-mismatch.

The proof of the main result builds off of [3], which proved similar recovery guarantees for the recovery of multi dimensional isolated Diracs from non-uniform Fourier samples by minimizing the nuclear norm of an “enhanced” multi-level Hankel matrix. This work showed that the number of samples necessary for recovery depends the number of Diracs and on an incoherence measure of the signal, that can be defined solely in terms of the relative locations of the Diracs. However, the theory in [3] relies heavily on an explicit factorization of the enhanced matrix (e.g., Vandermonde factorization of a Hankel matrix in the 1-D case), which is only available when the number of singularities are isolated and finite. Since the singularities in the proposed class of piecewise constant images (i.e., the image edges) are not isolated nor finite, the recovery guarantees in [3] cannot be directly extended to our setting. Instead, to achieve our result, we give a new characterization of the row and column spaces of the structured matrix arising in our setting. We show this new characterization allows us to derive an incoherence measure based solely on geometric properties of the edge set. In particular, we derive an upper bound for the incoherence measure that is related to the size of edge set curve. The results show that high sampling burden is associated with the estimation of images with smaller piecewise constant regions, which is consistent with intuition.

We note that the signal models in [1]–[3] do not include the class of piecewise constant images considered in this work. In particular, all of the above models assume the discontinuities to be finite in number and well separated, unlike in our setting. Recently, [12] adapted the results in [3] to introduce recovery guarantees for Fourier interpolation of a variety of finite-rate-of-innovation signal models [7], including piecewise constant functions. However, these results are limited to the 1-D setting and share the assumption than the discontinuities/innovations of the signal are finite and isolated. Furthermore, the structured matrix lifting considered in this work is different than those considered in [3] and [12]. Specifically, the structured matrix lifting in this work consists of two vertically concatenated multi-level Toeplitz matrices (i.e., block Toeplitz with Toeplitz blocks), whose entries are built from the weighted Fourier coefficients of the images. This is substantially different from the structured matrix liftings considered in [3] (unweighted, one block, single block multilevel Hankel) and [12] (weighted, one block, single-level Hankel). Finally, we note that a preliminary version of the results presented in this has been published previously in the conference paper [21] without proofs. The present work includes considerably more details and proofs, and major improvements to the main theorem.

A. Notation

Bold lower-case letters x are used to indicate vector quantities, bold upper-case X to denote matrices, and calligraphic script 𝒳 for general linear operators. We typically reserve lower-case greek letters μ, γ, etc. for trigonometric polynomials (3) and upper-case greek letters Λ,Ω, etc. for their coefficient index sets, i.e. finite subsets of the integer lattice ℤ², with cardinality denoted by |Λ|. We write Λ+Ω for the dilation of the index set Ω by Λ, i.e. the Minkowski sum {k + ℓ: k ∈ Λ, ℓ ∈ Ω}, and write 2Λ to mean Λ + Λ, 3Λ = 2Λ + Λ, etc. We also denote the contraction of Ω by Λ by Ω:Λ = {ℓ ∈ Ω: ℓ − k ∈ Ω for all k ∈ Λ}.

II. Background

A. 2-D Piecewise Constant Images with Bandlimited Edges

In this work we consider a continuous domain piecewise constant model for images,

f (r) = \sum_{i = 1}^{N} a_{i} 1_{U_{i}} (r), for all r = (x, y) \in {[0, 1]}^{2},

(1)

where a_i ∈ ℂ, 1_U denotes the characteristic function of the set U, and each U_i ⊂ [0, 1]² is a simply connected regions with piecewise smooth boundaries ∂U_i. We study the recovery of such an image from a sampling of its Fourier coefficients f̂ specified by

\hat{f} [k] = \int_{{[0, 1]}^{2}} f (r) e^{- j 2 π k \cdot r}; k \in Ω \subset ℤ^{2} .

(2)

Following [16], we further assume that the edge set of the piecewise constant image, specified by E := ∪_i∂U_i, coincides with the zero set of a 2-D bandlimited function:

E = {r \in {[0, 1]}^{2} : μ (r) = 0}, with μ (r) = \sum_{k \in Λ} c [k] e^{j 2 π k \cdot r},

(3)

where the coefficients c[k] ∈ ℂ, and Λ is a finite subset of ℤ². We call any function μ in the form (3) a trigonometric polynomial, and we say μ is bandlimited to Λ, i.e., the Fourier coefficients μ̂ are supported within Λ. For short, we will write {μ = 0} for the zero set of μ considered as a subset of [0, 1]².

Define the degree of a trigonometric polynomial μ, denoted by deg(μ) = (K,L) to be the linear dimensions of the smallest rectangle containing the support set {k : μ̂ [k] ≠ 0}. In [16] we proved that for every curve E given by the zero set of a trigonometric polynomial, there exists a unique minimal degree trigonometric polynomial¹ μ₀ such that E = {μ₀ = 0} and if μ is any other trigonometric polynomial with {μ₀ = 0} ⊂ {μ = 0}, then deg(μ₀) ≤ deg(μ) entrywise. By extension, we define the degree of a curve E to be equal to the degree of of its minimial degree polynomial μ₀. We also say the curve E is bandlimited to Λ₀ ⊂ ℤ², where Λ₀ is the minimal rectangular index set containing the support of μ̂. Intuitively, the degree/bandwidth of a curve gives a quantitative measure of its complexity. For example, in [16] we show the number of connected components of a curve is bounded in terms of its degree.

B. Recovery from uniform Fourier samples

We have shown in [16] that when μ is any trigonometric polynomial that vanishes on the edge set of the piecewise constant image f, the gradient ∇f = (∂_xf, ∂_yf) satisfies the property

μ \nabla f = 0,

(4)

where equality in (4) is understood in the sense of distributions (see, e.g., [22]). The spatial domain annihilation relation (4) translates directly to the following convolution annihilation relation in Fourier domain:

\sum_{k \in Λ} \hat{\partial f} [ℓ - k] \hat{μ} [k] = 0, \forall ℓ \in ℤ^{2} .

(5)

Here $\hat{\partial f} [k] = j 2 π (k_{x} \hat{f} [k], k_{y} \hat{f} [k])$ for k = (k_x, k_y). Note the equations in (5) are linear with respect to the coefficients μ̂.

Suppose we have access to samples of the Fourier coefficients f̂ on a finite rectangular grid Γ ⊂ ℤ², and suppose μ is bandlimited to Λ₁ ⊂ Γ. Then we can build the system of equations in (5) for all ℓ belonging to the index set Λ₂ ⊂ Γ, where Λ₂ is the set of all integer shifts of Λ₁ contained in Γ. In this case (5) can be compactly represented in matrix form as

T (\hat{f}) h = [\begin{matrix} T_{x} (\hat{f}) \\ T_{y} (\hat{f}) \end{matrix}] h = 0,

(6)

where 𝒯_x(f̂), 𝒯_y(f̂) ∈ ℂ^|^Λ₂^|×|^Λ₁^| are matrices corresponding to the discrete 2-D convolution with the arrays k_x f̂[k_x, k_y] and k_x f̂[k_x, k_y] for (k_x, k_y) ∈ Γ, respectively (after omitting the inconsequential factor j2π). Here we use h to denote the vectorized version of the filter (μ̂ [k] : k ∈ Λ₁), where the index set Λ₁ is called the filter support. The matrices 𝒯_x(f̂) and 𝒯_y(f̂) have a block Toeplitz with Toeplitz blocks structure. See Figure 2 for an illustration of the construction of 𝒯 (f̂).

Fig. 2 — Construction of the structured matrix lifting 𝒯 (f̂) considered in this work. From a rectangular array of the Fourier coefficients f̂ [*k_x, k_y*] of a continuous domain image f(*x, y*), the weighted arrays *k_x f̂* [*k_x, k_y*] and *k_y f̂* [*k_x, k_y*] are constructed. The matrices 𝒯_x(f̂) and 𝒯_y(f̂) are then obtained by extracting all vectorized patches from the weighted arrays, and loading these into the rows of 𝒯_x(f̂) and 𝒯_y(f̂). The resulting matrices 𝒯_x(f̂) and 𝒯_y(f̂) have a block Toeplitz with Toeplitz blocks structure. Finally 𝒯 (f̂) is formed by vertically concatenating the blocks 𝒯_x(f̂) and 𝒯_y(f̂).

Equation (6) shows that 𝒯 (f̂) is rank deficient, since it has the non-trivial vector h in its nullspace. In addition, when the filter support Λ₁ defining 𝒯 (f̂) is sufficiently big, we can also show 𝒯 (f̂) is low-rank. This is because if μ₀ is the minimal degree polynomial for the edge set, then any multiple of μ = γ · μ₀ bandlimited to Λ₁ will satisfy the annihilation equation (4). In Fourier domain, this means the vector

h = (({\hat{μ}}_{0} * \hat{γ}) [k] : k \in Λ_{1})

(7)

is in the nullspace of 𝒯 (f̂). Hence if the filter support Λ₁ is larger than support Λ₀ of μ₀, 𝒯 (f̂) has a large nullspace and is low-rank. The following result from [16] gives an exact characterization of the rank of 𝒯 (f̂), which will be important for this work:

Theorem 1

[16] Suppose f is a piecewise constant image (1) whose edge set E = {μ₀ = 0} is the zero set of a trigonometric polynomial μ₀ bandlimited to Λ₀. Let 𝒯 (f̂) be built with filter size Λ₁ ⊇ Λ₀, then

rank T (\hat{f}) \leq ∣ Λ_{1} ∣ - ∣ Λ_{1} : Λ_{0} ∣

(8)

where |Λ₁| is the number of indices in Λ₁ and |Λ₁: Λ₀| is the number of integer shifts of Λ₀ contained in Λ₁. Moreover, equality holds in (8) if Γ ⊇ 2Λ₁+Λ₀ and if the edge set does not contain any singular points. In this case, the nullspace of 𝒯 (f̂) consists of all vectors in the form (7).

Note that R := |Λ₁| − |Λ₁: Λ₀| is a measure of the bandwidth of μ₀ and hence is indicative of the complexity of the edge set curve E = {μ₀ = 0}. In the remainder of this work we assume the conditions in Theorem 1 that guarantee the equality rank 𝒯 (f̂) = R holds, in particular Γ ⊇ 2Λ₁+Λ₀.

If we take Λ₁ = Λ₀, the above result shows Fourier samples of f̂ in Γ ⊇ 3Λ₀ is sufficient for the recovery of the minimal degree polynomial μ₀, since in this case μ̂₀ can be identified as the unique non-trivial nullspace vector of 𝒯 (f̂). The following theorem states that once μ₀ is available, f is the unique solution to the annihilation equations (4) and (5):

Theorem 2

[16]. Suppose f is a piecewise constant image (1) whose edge set E = {μ₀ = 0} is the zero set of a trigonometric polynomial μ₀ bandlimited to Λ₀. Suppose the Fourier sampling set Γ ⊇ Λ₀. If g ∈ L¹([0, 1]²) satisfies

μ_{0} \nabla g = 0 subject t o \hat{g} [k] = \hat{f} [k] for all k \in Γ,

(9)

then g = f almost everywhere.

In principle, this result allows us to solve for the amplitudes of regions of the piecewise constant function f by plugging in the known μ₀ into the equation (9) and solving a linear system, similar to Prony’s method. However, for complicated piecewise constant images with many regions, it may be more practical to use the approximations introduced in [16].

III. Recovery from non-uniform Fourier samples

The theory presented in Section I shows that the exact recovery of a continuous domain piecewise constant image with a bandlimited edge set is possible when we collect Fourier samples of the image on a sufficiently large uniform grid in Fourier domain. However, the recovery procedure breaks down when we have non-uniform or missing samples, which is often the case in practical settings, e.g., compressed sensing MRI [23]. Therefore, we propose and analyze a method to interpolate the missing samples to a uniform grid in Fourier domain, which guarantees full recovery of the image in spatial domain.

Recall that Theorem 1 says that the structured matrix 𝒯 (f̂) built from the Fourier coefficients f̂ [k], k ∈ Γ, where Γ ⊂ ℤ² is a uniform rectangular grid, is known to be low-rank precisely when f is a piecewise constant image with a bandlimited edge set. Hence we propose to recover f̂ [k], k ∈ Γ from its samples at non-uniform locations Ω ⊂ Γ as the solution to the convex matrix completion problem: min

min_{\hat{g} [k], k \in Γ} {‖ T (\hat{g}) ‖}_{*} subject to \hat{g} [k] = \hat{f} [k] for all k \in Ω

(10)

where ||·||^* denotes the nuclear norm, i.e., the sum of the singular values of a matrix, which is the convex relation of the rank functional. Note that (10) is different than the standard low-rank matrix completion setting studied in [24], [25] in that the low-rank matrix 𝒯 (f̂) is structured and parameterized by the coefficient vector f̂. Similar structured low-rank matrix completion schemes have been proposed for the recovery of signals from non-uniform Fourier samples [3], [12] and used with empirical success in MRI applications [10], [11], [26]. The main focus of this paper is to determine the sufficient number of samples that will ensure exact recovery of the Fourier coefficients of f on the reconstruction grid Γ with high probability.

A. Role of incoherence

Several authors have shown that the sufficient number of samples for low-rank matrix recovery by nuclear norm minimization to succeed is dependent on the incoherence of the sampling basis with respect to the matrix to be to be recovered [3], [25]. Similarly, our results depend on an incoherence measure derived from the structure of the matrix 𝒯 (f̂) and properties of the piecewise constant image f. In particular, define ℘_U and ℘_V to be the orthogonal projections onto the column space and row space of 𝒯 (f̂), respectively, i.e., if 𝒯 (f̂) = UΣV^* is the rank-R singular value decomposition then ℘_UX = UU^*X, ℘_VX = XVV^*. In Appendix B, we show that the structured matrix 𝒯 (f̂) can be expanded using orthonormal basis of matrices A_k such that

T (\hat{f}) = \sum_{k \in Γ / {0}} \hat{f} [k] w [k] A_{k}

(11)

where w[k], k ∈ Γ/{0} are a set of positive weights that do not depend on f̂. Similar to results in [3], [12], [25], we prove that nuclear norm minimization (10) recovers the exact low-rank solution with high probability provided we can uniformly bound the norms of the projections of the sampling basis matrices A_k onto the row and column spaces of 𝒯 (f̂):

Proposition 3

Consider 𝒯 (f̂) of rank R corresponding to a piecewise constant function f whose edge set coincides with the zero set of μ₀, let ρ be the incoherency measure associated with μ₀ to be defined in the sequel, and set c_s = |Γ|/|Λ₁|. Then we have

max_{k \in Γ} {‖ P_{U} A_{k} ‖}_{F}^{2} \leq \frac{ρ R c_{s}}{∣ Γ ∣},

(12)

max_{k \in Γ} {‖ P_{V} A_{k} ‖}_{F}^{2} \leq \frac{ρ R c_{s}}{∣ Γ ∣}

(13)

The proof in Section VIII-F relies on the row and column spaces of 𝒯 (f̂) derived in Lemma 8 and Lemma 6 in the next section. These results will be used in the derivation of the main theorem in Section IX.

B. Main Results

We now present our main results, which determine the sufficient number of random Fourier samples for the convex structured low-rank matrix completion program (10) to succeed with high probability. Our first theorem addresses the case of recovery from noiseless Fourier samples:

Theorem 4

Let f be a continuous domain piecewise constant image (1), whose edge-set is described by the zero-set of the trigonometric polynomial μ₀ bandlimited to Λ₀ (see (3)). Let Ω ⊂ Γ be an index set drawn uniformly at random within Γ. Then there exists a universal constant c > 0 such that the solution to (10) is f̂ with probability exceeding 1 − |Γ|⁻², provided

∣ Ω ∣ > c ρ c_{s} R {log}^{4} ∣ Γ ∣,

(14)

where R = |Λ₁| − |Λ₁ : Λ₀| = rank 𝒯 (f̂), c_s = |Γ|/|Λ₁|, c is a universal constant, and ρ ≥ 1 is an incoherence measure depending on the geometry of the edge-set, to be defined in the sequel.

To better understand the dependence of the bound in (14) on the filter size Λ₁ and the edge set bandwidth Λ₀, assume for simplicity that Λ₁ is some dilation of Λ₀, that is, Λ₁ = αΛ₀, where α > 1 is an integer. In this case, the factor c_s R in (14) simplifies to

(\frac{∣ Λ_{1} ∣ - ∣ Λ_{1} : Λ_{0} ∣}{∣ Λ_{1} ∣}) ∣ Γ ∣ \leq (\frac{α^{2} - {(α - 1)}^{2}}{α^{2}}) ∣ Γ ∣ \leq \frac{2 ∣ Γ ∣}{α} .

(15)

Therefore, assuming the other constants in (14) are fixed, the number of measurements sufficient for exact recovery is proportional to the reciprocal of the dilation factor α. This suggests taking the filter size Λ₁ to be as large as allowed by Theorem 4. Namely, Λ₁ should satisfy 2Λ₁ + Λ₀ = Γ, i.e., the side-lengths of filter support Λ₁ should be roughly half those of the reconstruction grid Γ. Fixing the filter support Λ₁ to obey this bound, then Γ = (2α + 1)Λ₀, and so |Γ| ≤ (2α + 1)²|Λ₀|. Inserting this bound into (15) gives

c_{s} R = O (α ∣ Λ_{0} ∣) .

(16)

Combined with (14), this shows that the number of measurements sufficient for exact recovery is on the order of |Λ₀|, up to incoherence and log factors.

The proof of Theorem 4, detailed in Appendix B, is in line with the approach of [3]. In particular, we prove the result by constructing an approximate dual certificate using the well-known “golfing scheme” of [25]. The main differences between in the proof of the above result and that in [3] results from the differences in the matrix structure and hence the characterization of the incoherency between the row and column subspaces of 𝒯 (f̂) with the sampling basis. In particular, the matrix 𝒯 (f̂) we consider is obtained by stacking two block Toeplitz with Toeplitz blocks (BTTB) matrices whose entries are the weighted Fourier coefficients of f, as opposed to a single unweighted BTTB matrix in [3]. The approach in [3] relies on an explicit low-rank factorization of a BTTB matrix in terms of Vandermonde-like matrices². Since this factorization is not available in our setting, we use algebraic properties of trigonometric polynomials to give a new characterization of the row and column spaces of the matrix. In particular, we show in Section IV that similar Vandermonde-like basis matrices exist for the row and column space of the lifted matrix, and use these to derive a related incoherence measure that satisfies the bounds in Prop. 3.

C. Recovery in the presence of noise and model-mismatch

We now generalize (66) to the setting where we have noisy or corrupted Fourier samples

{\hat{f}}_{n} [k] = \hat{f} [k] + η [k], k \in Ω,

(17)

where η[k] ∈ ℂ is a vector of noise. In this case, we pose recovery as

min_{\hat{g}} {‖ T (\hat{g}) ‖}_{*} subject to {‖ P_{Ω} ({\hat{f}}_{n} - \hat{g}) ‖}_{2} \leq δ .

(18)

where δ > 0 is an estimate of the ℓ²-norm of the error ||η||, and ℘_Ω denotes projection onto Ω. We make no assumptions on the statistics of the noise η. In particular, η can represent errors due to model-mismatch, such as when the image is not perfectly piecewise constant, or when the edge set of the image does not coincide perfectly with the zero level-set of a bandlimited function.

The following theorem shows that when the deviation of f̂_n from f̂ is small, the modified recovery program (18) recovers a solution that is close in norm to f̂ under the same sampling conditions as Theorem 4.

Theorem 5

Let f be specified by (1), whose edge-set is described by the zero-set of the trigonometric polynomial μ₀ bandlimited to Λ₀ with associated incoherence measure ρ. Let Ω ⊂ Γ be an index set drawn uniformly at random within Γ such that |Ω| satisfies the bound (14) in Theorem 4. If the measurements f̂_n satisfy ||℘_Ω(f̂_n − f̂)||₂ ≤ δ, then the solution ĝ to (18) satisfies

{‖ T (\hat{f}) - T (\hat{g}) ‖}_{F} \leq 5 {∣ Γ ∣}^{2} δ .

(19)

with probability exceeding 1 − |Γ|⁻².

See Section IV in the Supplementary Materials for proof. The bound (19) allows us to quantify the effect of model-mismatch on recovery. In particular, suppose the image f_n represents a perturbation from an ideal piecewise constant image f such that their difference in L²-norm is δ-small:

{‖ f_{n} - f ‖}_{L^{2}}^{2} = {(\int_{{[0, 1]}^{2}} {∣ f_{n} (r) - f (r) ∣}^{2} d r)}^{\frac{1}{2}} \leq δ .

(20)

Then by Parseval’s theorem, the measurements of f̂_n satisfy ||℘_Ω(f̂_n − f̂)||² ≤ δ, hence Theorem 5 applies. From (19) we obtain the bound ||𝒯 (f̂) − 𝒯 (ĝ)||_F ≤ 5|Γ|²||f_n − f||_L_². This shows that if the image f_n is close to the ideal piecewise constant image f in spatial domain L²-norm, then the matrix 𝒯 (ĝ) we recover using (18) will be close in norm to 𝒯 (f̂) with high probability.

IV. Row and column spaces of 𝒯 (f̂) and incoherence

In this section we define an incoherence measure ρ that satisfies the desired bounds in Prop. 3. We show that the incoherence measure depends only on the geometry of the edge set of the image. The incoherence measure is derived from a new characterization of the row and column spaces of the matrix 𝒯 (f̂) in terms of Vandermonde-like basis matrices.

A. Row and column spaces of 𝒯 (f̂)

Our first lemma gives a basis for the row space of 𝒯 (f̂):

Lemma 6

A basis of the row space of 𝒯 (f̂) is given by the columns of the |Λ₁| × R Vandermonde-like matrix

E_{row} (P) : = \frac{1}{\sqrt{∣ Λ_{1} ∣}} (\begin{matrix} e^{j 2 π k_{1} \cdot r_{1}} & \dots & e^{j 2 π k_{1} \cdot r_{R}} \\ ⋮ & ⋮ \\ e^{j 2 π k_{∣ Λ_{1} ∣} \cdot r_{1}} & \dots & e^{j 2 π k_{∣ Λ_{1} ∣} \cdot r_{R}} \end{matrix})

(21)

where {k₁, …, k_|_Λ₁_|} is a linear indexing of elements in Λ₁, and P = {r₁, …, r_R} is a set of R = |Λ₁|−|Λ₁ : Λ₀| distinct points on the edge set curve {μ₀ = 0} chosen such that the columns of E_row are linearly independent.

The careful reader will have noticed that Lemma 6 takes for granted the existence of a set of points P = {r₁, ...., r_R} ⊂ {μ₀ = 0} such that the columns of E_row(P) is linearly independent. Call such a set P a set of admissible nodes for the curve {μ₀ = 0}. The following result shows that sets of admissible nodes always exist and are easy to construct:

Lemma 7

Let μ₀ be bandlimited to Λ₀. Any set of M ≥ R + |Λ₀| distinct points on the curve {μ₀ = 0} contains a subset of R points that are a set of admissible nodes.

The next lemma shows that we can characterize the column space of 𝒯 (f̂) in a similar way as the row space:

Lemma 8

A basis of the column space of 𝒯 (f̂) is given by the columns of the 2|Λ₂| × R weighted Vandermonde-like matrix:

E_{col} (P) = \frac{1}{\sqrt{∣ Λ_{2} ∣}} (\begin{array}{c} \frac{w_{1, x}}{‖ w_{1} ‖} e^{j 2 π k_{1} \cdot r_{1}} & \dots & \frac{w_{R, x}}{‖ w_{R} ‖} e^{j 2 π k_{1} \cdot r_{R}} \\ ⋮ & ⋮ \\ \frac{w_{1, x}}{‖ w_{1} ‖} e^{j 2 π k_{∣ Λ_{2} ∣} \cdot r_{1}} & \dots & \frac{w_{R, x}}{‖ w_{R} ‖} e^{j 2 π k_{∣ Λ_{2} ∣} \cdot r_{R}} \\ \frac{w_{1, y}}{‖ w_{1} ‖} e^{j 2 π k_{1} \cdot r_{1}} & \dots & \frac{w_{R, y}}{‖ w_{R} ‖} e^{j 2 π k_{1} \cdot r_{R}} \\ ⋮ & ⋮ \\ \frac{w_{1, y}}{‖ w_{1} ‖} e^{j 2 π k_{∣ Λ_{2} ∣} \cdot r_{1}} & \dots & \frac{w_{R, y}}{‖ w_{R} ‖} e^{j 2 π k_{∣ Λ_{2} ∣} \cdot r_{R}} \end{array}),

(22)

where where {k₁, …, k_|_Λ₂_|} is a linear indexing of elements in Λ₂ and P = {r₁, ...., r_R} is a set of admissible nodes for the curve {μ₀ = 0}. The weight vectors w_i = (w_i,x, w_i,y), are described by the formula (52) in Appendix VIII, and depend only on the edge set {μ₀ = 0}, the nodes P, and the filter support Λ₁.

See Section VIII-C for the proofs of Lemmas 6 and 7, and Section VIII-E for the proof of Lemma 8.

B. Incoherence measure

We now show how to define an incoherence measure ρ that satisfies the desired bounds in Prop. 3. Consider the Gram matrix G(P) = [E_row(P)]^*E_row(P), where P is any set of R points r₁, …, r_R on the edge set curve {μ = 0}. It is easy to see from the definition (21) that the entries of G(P) are specified by

{(G (P))}_{i, j} = \frac{1}{∣ Λ_{1} ∣} D_{Λ_{1}} (r_{i} - r_{j}), 1 \leq i, j \leq R,

(23)

where D_Λ₁(r) := Σ_k_∈Λ₁e^j²^π^k^·^r is the Dirichlet kernel supported on Λ₁. Note that G(P) has ones along the diagonal, and the magnitude of the off-diagonal entries is dictated by the distances |r_i − r_j | and the filter support Λ₁. We now define the incoherence measure ρ associated with the edge set E = {μ₀ = 0} in terms of G(P).

Definition 9

Suppose the edge set curve E = {μ₀ = 0} has bandwidth Λ₀ (see (3)), and set R = |Λ₁| − |Λ₁: Λ₀|. Define the incoherence measure ρ by

ρ = min_{\begin{matrix} P \subset {μ_{0} = 0} \\ ∣ P ∣ = R \end{matrix}} \frac{1}{λ_{\min} [G (P)]},

(24)

where λ_min[G(P)] is the minimum eigenvalue of G(P).

Put in words, among all possible arrangements of R points along the edge-set {μ₀ = 0}, we seek the arrangement such that the minimum eigenvalue G(P) is as large as possible. Intuitively, the optimal arrangement will maximize the minimum separation distance among the R points, and ρ can be thought of as a measure of this geometric property. In particular, edge set curves that enclose a small area, and hence require the points P to be closely spaced along the curve, will result in a large value of ρ. According to Theorem 4, the measurement burden will be high for such curves.

Note that curves corresponding to a particular bandwidth can come in different sizes. Specifically, for a fixed μ₀ with bandwidth Λ₀ consider the family of curves {μ₀ = α}, where α is a scalar. One can change α to obtain multiple curves with exactly the same bandwidth, each of which correspond to a different levelset of μ₀. These level-sets will have different incoherence measures, depending on how large or small the level-set curves are. This shows the incoherence of an edge set captures something besides its bandwidth. See Figure 3 for an illustration.

Fig. 3 — Illustration of edge set incoherence measure ρ. In (a) are the level-sets of trigonometric polynomial μ₀ bandlimited to Λ₀ of size 3×3. These curves all have the same bandwidth, Λ₀, but come in different sizes. In (b)–(d) we show R = 24 nodes on the curve giving the indicated bound on incoherence parameter ρ defined in (24), assuming a filter Λ₁ of size 7×7. Observe that the incoherence measure increases as the curve gets smaller. This indicates the smaller curves have a significant sampling burden.

We can give incoherency measure of an edge set a more precise geometric interpretation based on the minimum separation distance of a set of admissible nodes. We generalize a bound on the condition number of Vandermonde matrices derived in [27] to the case of the Vandermonde-like matrix (21), and use this to derive a bound for the incoherence parameter ρ.

Theorem 10

Assume that the points $P = {(x_{i}, y_{i})}_{i = 1}^{R}$ belonging to the curve {μ₀ = 0} satisfy |x_i − x_j | > Δ and |y_i − y_j | > Δ for all i ≠ j. Assume the filter support Λ₁ ⊂ ℤ² is a square region symmetric around the origin of size $\sqrt{∣ Λ_{1} ∣} \times \sqrt{∣ Λ_{1} ∣}$ . Then

ρ \leq {(1 - \frac{1}{\sqrt{∣ Λ_{1} ∣} Δ})}^{- 2},

(25)

where ρ is the incoherence parameter (24) associated with the curve {μ₀ = 0}.

See Section I of the Supplementary Materials for the proof. The bound in (25) shows that the incoherence is close to one (i.e., is as small as possible) when $Δ ≫ 1 / \sqrt{∣ Λ_{1} ∣}$ . Since Δ is the spacing between each pair of points on the curve, to achieve a larger Δ spacing, and hence a smaller ρ, requires a larger curve. This suggests that fewer measurements are required to recover a larger curve, which is consistent with the findings in the isolated Dirac setting [27], [28].

V. Numerical Experiments

A. Algorithms

For small to moderate problem sizes the nuclear norm minimization problem (10) can be solved efficiently with the alternating directions method of multipliers (ADMM) algorithm, which results in a modification of the singular value thresholding (SVT) algorithm [29]. This approach has been proposed for related structured low-rank matrix completion problems in several works, e.g., [3], [11], [12], [30]. We adopt this approach here as well for our small-scale numerical experiments. A detailed implementation of this algorithm can be found in, e.g., [28]. However, we note that for large scale problems, such as those encountered in realistic imaging applications, more efficient approaches need to be adopted, because often in these cases the lifted matrix is too large to be held in memory. A fast algorithm for solving an approximation to (10) for large-scale problems is given in [19].

B. Phase transitions

In Fig. 4, we study the probability of exact recovery under different assumptions on the filter size and edge set of the image. For these experiments the reconstruction grid Γ was of size 65 × 65. We generated synthetic random piecewise constant functions with known edge set bandwidth (see Fig. 3(c)), and attempted to recover their Fourier coefficients in Γ from random samples in Ω at the specified undersampling factor. For each set of parameters we ran 10 random trials. We count the recovery as “exact” if the recovered coefficients f̂ satisfied ||f̂ − f₀||/||f̂₀|| < 10⁻³, where f̂₀ is the ground truth. The exact recovery rate was then obtained by averaging over the 10 trials.

First, in Fig. 4(a), we studied the effect of changing the filter size Λ₁ on the recovery while keeping other parameters constant. We fixed the edge-set bandwidth to |Λ₀| = 9×9 and varied the filter size as |Λ₁| = (2K +1)×(2K +1) for K = 1, …, 30. We call K the filter bandwidth. Note that Theorem 4 has restrictions on how large Λ₁ can be. The maximum filter bandwidth for which Theorem 4 holds in this case was K = 15 (red line in Figure 4(a)), however we extended the filter size to observe the behavior of the algorithm outside of this regime. As predicted by Theorem 4, we find that the optimal performance is obtained when Λ₁ is the largest as allowed by Theorem 4 (roughly half the size of Γ in each dimension).

Next, in Fig. 4(b), we study the recovery as a function of the bandwidth of the edge-set of the image. The filter bandwidth was fixed at K = 15, and we varied the edge-set bandwidth as |Λ₀| = (2K₀ +1)×(2K₀ +1). The phase transition shows dependence |Ω| = O(|Λ₀|) as predicted by Theorem 4.

C. Comparison with TV minimization on real MRI data

We also compare the proposed Fourier domain interpolation scheme against standard discrete TV minimization in spatial domain:

min_{u \in ℂ^{N}} T V (u) subject to P_{Ω} (Fu) = P_{Ω} (F u_{0}) .

(26)

Here u ∈ ℂ^N with N = N_xN_y is a 2-D array representing a discrete N_x × N_y image, u₀ ∈ ℂ^N is the image to be recovered, F ∈ ℂ^N×N denotes the unitary 2-D discrete Fourier transform (DFT) matrix acting on N₁ × N₂ arrays, P_Ω is projection onto the index of sampling locations Ω ⊂ [N_x]×[N_y], and TV (·) denotes the (isotropic) total variation semi-norm:

T V (u) = \sum_{i = 1}^{N} {({∣ {(\partial_{1} u)}_{i} ∣}^{2} + {∣ {(\partial_{2} u)}_{i} ∣}^{2})}^{\frac{1}{2}}

(27)

where ∂₁ and ∂₂ are finite difference operators in the horizontal and vertical directions, respectively. The problem (26) has been studied extensively [31]–[36] as a model for undersampled MRI reconstruction and other inverse problems in imaging.

In Fig. 5 we perform an experiment comparing against TV minimization and the proposed approach on real MRI data. For this experiment we used a fully-sampled four-coil single-slice acquisition consisting of 256 × 256 Cartesian k-space samples, which was compressed to a single virtual coil using an SVD-based technique [37]. The data in the single virtual coil was observed to have smoothly varying complex phase in image domain. To compensate for this source of model-mismatch, we further pre-preprocessed the data by removing the complex phase in image domain. We note that this preprocessing step is unrealistic for a true MRI experiment. However, the optimization problem (10) could be modified to incorporate an estimate of the smoothly varying phase in the measurement model; we omit this step for simplicity. Finally, we retrospectively undersampled the pre-processed virtual single coil data, taking 50% uniform random samples. We find that the proposed structured low-rank recovery shows significant improvement recovery error over standard total variation as measured by SNR = 20log₁₀(||f̂||/||f̂^* − f̂||), where f̂^* is the recovered data and f̂ is the ground truth. The error images indicate the proposed method more faithfully recovers the true edges of the image.

VI. Discussion

Discrete domain total-variation minimization has played a central role in compressed sensing from its inception [31], [32], which models the image to be recovered as (approximately) piecewise constant. Since the present work can be thought of as an extension of compressed sensing type guarantees to the continuous domain setting, it is fruitful to explore the connections between our continuous domain model and discrete domain total variation.

At first glance, the structured low-rank matrix completion problem (10) may seem far removed from the TV-minimization problem (26). But, in fact, one can show TV-minimization (26) is equivalent to nuclear norm minimization of a related structured matrix lifting in Fourier domain. Specifically, (26) is equivalent to

min_{v} {‖ C (Fu) ‖}_{*} subject to P_{Ω} (Fu) = P_{Ω} (F u_{0}) .

(28)

Here

C (Fu) = [\begin{matrix} C_{x} (Fu) \\ C_{y} (Fu) \end{matrix}] \in ℂ^{2 N \times N}

(29)

and 𝒮_x(Fu), 𝒮_y(Fu) are block circulant with circulant blocks matrices whose first column is specified by the arrays v_x = F∂_xu and v_y = F∂_xu. Assuming circular boundary conditions, we can write (v_x)[k_x, k_y] = (1 − e^j2πk_x/N_x)(Fu)[k_x, k_y] and (v_y)[k_x, k_y] = (1 − e^j2πk_y/N_y)(Fu)[k_x, k_y].

We find it interesting to use this re-formulation of TV-minimization to better understand the proposed approach. In Table I we summarize the similarities and differences. One essential difference is the dimensions of the matrix liftings. In particular, the matrix lifting we propose has dimensions 2|Λ₂| × |Λ₁|, with |Λ₁| ≪ |Λ₂| whereas the matrix lifting associated with TV in (28) has dimensions 2N × N. If the reconstruction grid size is the same in both cases, i.e., |Γ| = N, then the proposed matrix lifting has substantially fewer columns than the one associated with TV. This is due to our assumption that edge set of the image has low bandwidth. In other words, we restrict the degrees of freedom of the model by constructing a lifting with fewer columns. We believe this difference may explain the success of the proposed method over TV-minimization observed empirically in Section V.

TABLE I.

Comparison of proposed scheme with discrete total variation minimization

	TV-minimization	Proposed

Spatial domain	discrete	continuous
Derivative operator	finite differences	exact derivative
Singularity set	discrete points	connected curves

Frequency domain	discrete	discrete
Frequency weighting w_i[k]	1 − e^j2πk_i/N_i	j2πk_i
Lifted matrix structure	two-level circulant	two-level Toeplitz
Rank of lifted matrix	sparsity of discrete gradient	bandwidth of edge set

Open in a new tab

VII. Conclusion

We derived performance guarantees for the recovery of piecewise constant images from random non-uniform Fourier samples via a convex structured low-rank matrix completion problem. This was achieved by adapting results in [3] to the case of a low-rank block two-fold Toeplitz matrix with an additional weighting scheme that arises naturally when considering piecewise constant images. We also define incoherence measures that rely only on geometric properties of the edge set, which indicate that the sampling burden is higher for images with smaller piecewise constant regions.

The recovery guarantees in this work studied the case of uniform random samples. However, in practice we observe that recovery works well with when considering other types of variable density random sampling, where the low spatial frequencies are more heavily sampled. It would be interesting to adapt our results to a wider variety of sampling distributions, and to identify the optimal sampling strategy for signals belonging to our image model.

Fig. 1 — Annihilation of a piecewise constant function as a multiplication in spatial domain (top) and as a convolution in Fourier domain (bottom). The partial derivatives of a piecewise constant function are supported on the edge set. If there is a bandlimited function μ that is zero along the edge set, then the spatial domain product of μ with the gradient ∇f = (∂*_xf,* ∂*_yf*) is identically zero. In Fourier domain, this is equivalent to the annihilation of the arrays j2πk_xf[*k_x, k_y*] and j2πk_yf[*k_x, k_y*] by 2-D convolution with a finite filter determined by the Fourier coefficients μ̂.

Acknowledgments

This work is supported by grants NIH 1R01EB019961-01A1 and ONR N00014-13-1-0202.

VIII. Appendix A: Incoherence Bounds

A. Notation and Preliminaries

To simplify our arguments, we will convert the linear operators 𝒯(f̂) and 𝒯(f̂)* defined in Fourier domain to linear operators acting on spaces of trigonometric polynomials (3) in spatial domain. Specifically, for any index set Ω ⊂ ℤ², let B_Ω denote the vector space of all trigonometric polynomials that have coefficients supported within Ω. Similarly, we denote the space of vector fields ρ = (ρ₁, ρ₂) with components ρ₁, ρ₂ ∈ B_Ω as $B_{Ω}^{2}$ . We set 𝒮(f) = ℱ𝒯(f̂) ℱ⁻¹, where ℱ is the Fourier transform of a periodic function on [0, 1]². For any index set Λ, define the Dirichlet kernel D_Λ₁ (r) := Σ_k_∈Λ₁ e^j²^π^k^·^r. For all φ ∈ B_Λ₁, the action of the linear operator $S (f) : B_{Λ_{1}} \to B_{Λ_{2}}^{2}$ can be expressed compactly as

S (f) φ = D_{Λ_{2}} * (φ \nabla f) \in B_{Λ_{1}}^{2},

(30)

where φ∇f is understood as a tempered distribution, and the convolution is applied separately to each vector field component. Here convolution with D_Λ₂ is a bandlimiting operation. Simliarly, for $ρ = (ρ, ρ_{2}) \in B_{Λ_{2}}^{2}$ , the adjoint 𝒮(f)^* acts as

S {(f)}^{*} ρ = D_{Λ_{1}} * (ρ \cdot \nabla f) \in B_{Λ_{1}}

(31)

which is the spatial domain equivalent of the adjoint matrix 𝒯(f̂)^*. More expliclty, if f = 1_U where U is a simply connected region with smooth boundary ∂U, a straightforward argument using the divergence theorem shows that the function 𝒮(f)φ is given pointwise as the weighted curve integral

(S (f) φ) (r) = \oint_{\partial U} D_{Λ_{2}} (r - r^{'}) n (r^{'}) d s (r^{'}),

(32)

for all r ∈ [0, 1]², where n(r′) is the outward unit normal to the curve ∂U at r′, and ds is the arc-length element. Likewise, 𝒮(f)^*ρ is the function given pointwise by

(S {(f)}^{*} ρ) (r) = \oint_{\partial U} D_{Λ_{1}} (r - r^{'}) [ρ (r^{'}) \cdot n (r^{'})] d s (r^{'}),

(33)

for all r ∈ [0, 1]². These formulas can be generalized to an arbitrary piecewise constant function f = Σ_i a_i1_{U_i} by linearity. However, in the remainder we focus on the case where f = 1_U to simplify our arguments.

B. Fundamental subspaces of 𝒮(f) and dimensions

Under the conditions of Theorem 1, the nullspace of 𝒯(f̂) is spanned by shifts of the minimal annihilating filter, $\hat{μ_{0}}$ . In spatial domain, this space consists of all multiples of the minimal degree polynomial γ = η μ₀ such that γ is bandlimited to Λ₁. We denote this space by

{(μ_{0})}_{Λ_{1}} : = {η μ_{0} : η \in B_{Λ_{1} : Λ_{0}}} .

(34)

Note that (μ₀)_Λ₁ is a subspace of B_Λ₁ with dimension |Λ₁: Λ₀|. Therefore, the dimension of the kernel of 𝒮(f), denoted by ker 𝒮(f), is given by

dim ker S (f) = ∣ Λ_{1} : Λ_{0} ∣ .

(35)

By the rank-nullity theorem, the dimension of the image of 𝒮(f), denoted by im 𝒮(f), is

dim im S (f) = ∣ Λ_{1} ∣ - ∣ Λ_{1} : Λ_{0} ∣ = R .

(36)

Likewise, the dimension of the coimage im 𝒮(f)^* is also R. Furthermore, since im 𝒮(f)^* = [kerS(f)]^⊥, we have

im S {(f)}^{*} = {(μ_{0})}_{Λ_{1}}^{⊥}

(37)

This means that any γ ∈ B_Λ₁ is in the row space if and only if γ is orthogonal to every trigonometric polynomial of the form η μ₀ ∈ B_Λ₁, or equivalently,

〈 γ, η, μ_{0} 〉 = \int_{{[0, 1]}^{2}} γ (r) \bar{η (r) μ_{0} (r)} d r = 0

(38)

for all η ∈ B_{Λ₁: Λ₀}.

C. Basis for the coimage of 𝒮(f) (corresponding to the row space of 𝒯(f̂))

Let s ∈ [0, 1]², and set φ_s ∈ B_Λ₁ to be the translated Dirichlet kernel:

φ_{s} (r) = D_{Λ_{1}} (r - s) for all r \in {[0, 1]}^{2} .

(39)

Equivalently, φ_s ∈ B_Λ₁ is the trigonometric polynomial specified in Fourier domain as

\hat{φ_{s}} [k] = {\begin{cases} e^{- j 2 π s \cdot k} & if k \in Λ_{1} \\ 0 & if k \notin Λ_{1} \end{cases} .

(40)

Observe that the inner product of φ_s with any other trigonometric polynomial η ∈ B_Λ₁ is given by the point-evaluation of η at s:

〈 η, φ_{s} 〉 = \sum_{k \in Λ_{1}} \hat{η} [k] e^{j 2 π k \cdot s} = η (s) .

(41)

Suppose now that the point s satisfies μ₀(s) = 0. In this case, we see that φ_s is necessarily in the coimage $im S {(f)}^{*} = {(μ_{0})}_{Λ_{1}}^{⊥}$ since we have

〈 γ μ_{0}, φ_{s} 〉 = γ (s) μ_{0} (s) = 0.

(42)

for any multiple of the minimal polynomial γμ₀ ∈ B_Λ₁, i.e., any element in ker 𝒮(f) = (μ₀)_Λ₁.

We will now show how to construct a basis for the coimage of 𝒮(f) out of elements having the form φ_{r_i} for some r_i, i = 1, …,R belonging to the zero set of μ₀. For an arbtirary collection of R points ${r_{i}}_{i = 1}^{R} \subset {μ_{0} = 0}$ , we are not guaranteed that the set of functions ${φ_{r_{i}}}_{i = 1}^{R}$ is linearly independent. However, we will show that there exists a constant M = M(Λ₀,Λ₁) such that for any M distinct points ${r_{i}}_{i = 1}^{M} \subset {μ_{0} = 0}$ we can always find a subset of R linearly independent basis functions from the collection ${φ_{r_{i}}}_{i = 1}^{M}$ . The constant M is related the maximum number of isolated zeros that a system of two trigonometric polynomials can have. The following lemma, which is a consequence of the BKK bound in enumerative algebraic geometry (see, e.g., [38]), puts a bound on M. See section II of the supplementary material for proof.

Lemma 11

Let Λ₁ and Λ₀ be rectangular index sets such that Λ₀ ⊂ Λ₁, and set R = |Λ₁| − |Λ₁: Λ₀|. For any μ₀, μ₁ trigonometric polynomials bandlimited to Λ₀ and Λ₁, respectively, the maximum number M of isolated solutions of μ₀(r) = μ₁(r) = 0 is bounded as

M < R + ∣ Λ_{0} ∣ .

(43)

We now prove equivalents of Lemma 6 and Lemma 7 in terms of the spatial domain operator 𝒮(f):

Lemma 12

Let {r₁, …., r_N} be any collection of N distinct points on the curve {μ₀ = 0}, where N ≥ R + |Λ₀|. Then the coimage space $i m S {(f)}^{*} = {(μ_{0})}_{Λ_{1}}^{⊥}$ is spanned by the set of shifted Dirichlet kernels φ_i(r) = D_Λ₁ (r − r_i) for all i = 1, …,N, i.e.,

span {φ_{r_{i}}}_{i = 1}^{N} = {(μ_{0})}_{Λ_{1}}^{⊥} .

(44)

In particular, there exists a subset of R = |Λ₁| − |Λ₁: Λ₀| elements of ${φ_{r_{i}}}_{i = 1}^{N}$ that is a basis for the coimage space ${(μ_{0})}_{Λ_{1}}^{⊥}$ .

Proof

All the functions φ_{r_i} are in ${(μ_{0})}_{Λ_{1}}^{⊥}$ since we have ≪φ_i, γμ₀〉 = γ(r_i)μ₀(r_i) = 0 because each r_i belong to the zero set of μ₀. This implies that

span {φ_{r_{i}}}_{i = 1}^{M} \subseteq {(μ_{0})}_{Λ_{1}}^{⊥} .

(45)

Our focus is on proving (44) with equality. For this, it is sufficient to show that any vector orthogonal to $span {φ_{i}}_{i = 1}^{N}$ is in (μ₀)_Λ. Assume that there is a vector η(r) ∈ B_Λ₁ that is in the orthogonal complement space of $span {φ_{i}}_{i = 1}^{N}$ . This is only possible if

〈 η, φ_{i} 〉 = η (r_{i}) = 0, for all i = 1, \dots, N .

(46)

Therefore, both η and μ₀ have N zeros in common. By Lemma 11 this is only possible if η contains μ₀ as a factor. This implies that all vectors in the orthogonal complement space of $span {φ_{i}}_{i = 1}^{M}$ are in (μ₀)_Λ₁, or equivalently

span {φ_{r_{i}}}_{i = 1}^{M} \supseteq {(μ_{0})}_{Λ_{1}}^{⊥},

(47)

which together with (45) proves (44).

Finally, we also know that the dimension of ${(μ_{0})}_{Λ_{1}}^{⊥}$ is equal to R <M. Thus, one can select a subset of R basis functions φ_i that are linearly independent and hence a basis for ${(μ_{0})}_{Λ_{1}}^{⊥}$ .

Translating this result to Fourier domain, we see that the row space of 𝒯(f̂) is spanned by the vectors of Fourier coefficients $(\hat{φ_{i}} [k] : k \in Λ_{1}) \in ℂ^{∣ Λ_{1} ∣}$ , for i = 1, …,R. Equivalently, this can be expressed as the columns of the Vandermonde-like matrix E_row specified by (21), which proves Lemma 6 and 7.

D. Discretization of curve integrals: quadrature formula

Using the results from the previous subsection, we now introduce a quadrature formula for curve integrals, which we will use to determine the range space im 𝒮(f) in the next subsection.

Let γ be any function in B_Λ for any Λ ⊇ Λ₀. Then from the orthogonal decomposition $B_{Λ} = {(μ_{0})}_{Λ} \oplus {(μ_{0})}_{Λ}^{⊥}$ we can decompose γ as

γ (r) = \sum_{i = 1}^{S} a_{i} D_{Λ} (r - r_{i}) + φ (r) μ_{0} (r),

(48)

where S = |Λ|−|Λ: Λ₀|, and where ${D_{Λ} (r - r_{i})}_{i = 1}^{S}$ defines a basis of ${(μ_{0})}_{Λ}^{⊥}$ . Here, the coefficients a_i in (48) are obtained uniquely as

[\begin{matrix} a_{1} \\ ⋮ \\ a_{S} \end{matrix}] = D^{- 1} [\begin{matrix} γ (r_{1}) \\ ⋮ \\ γ (r_{S}) \end{matrix}],

(49)

where D ∈ ℝ^S×S is the symmetric matrix with entries [D]_i,j = D_Λ(r_i − r_j) for 1 ≤ i, j ≤ S. The above expression can be compactly expressed as a = D⁻¹g, where g = (γ(r₁), …, γ(r_S))^T.

Lemma 13

Let f = 1_U where U is a simply connected region with smooth boundary ∂U, which is the zero levelset of μ₀ ∈ B_Λ₀ and let γ ∈ B_Λ. Consider the curve integral of the form

q = \oint_{\partial U} γ (r) n (r) d s (r),

(50)

where n(r) = ∇f(r)/|∇f(r)| is the unit normal on the curve ∂U. The curve integral can be evaluated using the quadrature formula

q = \sum_{i = 1}^{S} γ (r_{i}) w_{i},

(51)

where the S = |Λ| − |Λ : Λ₀| points ${r_{i}}_{i = 1}^{S}$ belong to the curve {μ₀ = 0}, and the cooresponding weight vectors w_i ∈ ℝ², i = 1, .., S, are specified by

[\begin{matrix} w_{1} \\ ⋮ \\ w_{S} \end{matrix}] = D^{- 1} [\begin{matrix} v_{1} \\ ⋮ \\ v_{S} \end{matrix}] .

(52)

where v_i = ∮_∂U D_Λ(r − r_i)n(r)ds(r) ∈ ℝ².

Proof

Decomposing γ(r) using (48), we obtain

\oint_{\partial U} γ (r) n (r) d s (r) = \sum_{i = 1}^{S} a_{i} \underset{: = v_{i}}{\underset{︸}{\oint_{\partial U} D_{Λ} (r - r_{i}) n (r) d s (r)}}

(53)

The above sum can be expressed in the vector form as

\sum_{i = 1}^{S} a_{i} v_{i} = a^{*} V = g^{*} D^{- 1} V

(54)

where $V = {[v_{1}^{T}, \dots, v_{S}^{T}]}^{T} \in ℂ^{R \times 2}$ . Setting $W = D^{- 1} V = {[w_{1}^{T}, \dots, w_{S}^{T}]}^{T} \in ℂ^{R \times 2}$ we obtain (51).

E. Basis for the range of 𝒮(f) (corresponding to the column space of 𝒯(f̂))

We now introduce a basis set for im 𝒮(f), which will be used to prove Lemma 8.

Lemma 14

The range of 𝒮(f), denoted by im 𝒮(f) is specified by

im S (f) = span {w_{i} D_{Λ_{2}} (r - r_{i})}_{i = 1}^{R}

(55)

for an appropriate choice of points ${r_{i}}_{i = 1}^{R} \subset {μ_{0} = 0}$ with R = |Λ₁| − |Λ₁: Λ₀|, and where the weight vectors w_i are specified by (52).

Proof

Consider an arbitrary element ρ = (ρ, ρ₂) ∈ im 𝒮(f). We can express ρ as ρ = 𝒮(f)ψ = ℬ_Λ₂ (ψ∇f) = D_Λ₂ * (ψ∇f) for some ψ ∈ B_Λ₁. By the definition in (33), we have

ρ (r) = \oint_{\partial U} ψ (s) D_{Λ_{2}} (r - s) n (s) d s = \sum_{i = 1}^{S} ψ (r_{i}) D_{Λ_{2}} (r - r_{i}) w_{i},

(56)

where we Lemma 13 in the last step with S = |Γ| − |Γ : Λ₀| since the integrand ψ(s)^* D_Λ₂ (r − s) belongs to B_Γ. The above relation shows that any ρ(s) ∈ im 𝒮(f) can be expressed as the linear combination of the functions D_Λ₂ (s − r_i)w_i, for i = 1, .., S. Thus, we have $im S (f) \subset span {D_{Λ_{2}} (r - r_{i}) w_{i}}_{i = 1}^{S}$ . We also know that dim (im 𝒮(f)) = R < S. This implies that we can select a subset of R vectors from the set ${D_{Λ_{2}} (r - r_{i}) w_{i}}_{i = 1}^{S}$ that are linearly independent, which will span im 𝒮(f), and hence define a basis.

Correspondingly, the column space of 𝒯(f̂) is spanned by the Fourier coefficients of the basis vectors w_iD_Λ₂ (r − r_i), or the columns of the 2|Λ₂| × R weighted Vandermonde-like matrix E_col specified by (22).

F. Incoherence Bounds

1) Projection onto row subspace

Let E_row = E_row(P) be any basis for the row space V of 𝒯(f̂) specified by (21), whose columns are vectorized Fourier coefficients of the translated and normalized Dirichlet kernels $φ_{i} (r) = \frac{1}{\sqrt{∣ Λ_{1} ∣}} D_{Λ_{1}} (r - r_{i})$ , i = 1, …,R, for some set of admissible nodes P = {r₁, …, r_R} ⊂ {μ₀ = 0}. Projecting the measurement basis matrix A_k onto V, we have

{‖ P_{V} A_{k} ‖}_{F}^{2} = {‖ A_{k} E_{row} {(E_{row}^{*} E_{row})}^{- 1} E_{row}^{*} ‖}_{F}^{2} \leq {[λ_{\min} (E_{row}^{*} E_{row})]}^{- 1} {‖ A_{k} E_{row} ‖}_{F}^{2}

Since A_k selects |ω(k)| rows of E_row, each of which has R entries of magnitude $1 / \sqrt{∣ Λ_{1} ∣}$ , we have

{‖ A_{k} E_{row} ‖}_{F}^{2} = \frac{1}{∣ ω (k) ∣} \cdot R \cdot ∣ ω (k) ∣ \cdot \frac{1}{∣ Λ_{1} ∣} = \frac{R}{∣ Λ_{1} ∣} = \frac{R c_{s}}{∣ Γ ∣}

(57)

where c_s = |Γ|/|Λ₁|. Hence,

{‖ P_{V} A_{k} ‖}_{F}^{2} \leq {[λ_{\min} (E_{row}^{*} E_{row})]}^{- 1} \frac{R c_{s}}{∣ Γ ∣} .

(58)

Minimizing over all sets of admissible nodes P in the construction of E_row gives the final bound

{‖ P_{V} A_{k} ‖}_{F}^{2} \leq \frac{ρ R c_{s}}{∣ Γ ∣} .

(59)

2) Projection onto column space

Let E_col = E_col(P) be a basis for the column space of 𝒯(f̂) specified by (22), whose columns are vectorized Fourier coefficients of the translated and weighted Dirichlet kernels $\frac{1}{\sqrt{∣ Λ_{2} ∣}} \frac{w_{i}}{‖ w_{i} ‖} D_{Λ_{2}} (r - r_{i})$ , for some set of admissible nodes P = {r₁, …, r_R} ⊂ {μ₀ = 0}. Observe the columns of E_col are defined to have unit ℓ²-norm. Following the same steps as in the row space bound, we have

{‖ P_{U} A_{k} ‖}_{F}^{2} = {‖ E_{col} {(E_{col}^{*} E_{col})}^{- 1} E_{col}^{*} A_{k} ‖}_{F}^{2} \leq {[λ_{\min} (E_{col}^{*} E_{col})]}^{- 1} {‖ E_{col}^{*} A_{k} ‖}_{F}^{2}

Expanding the norm ${‖ E_{col}^{*} A_{k} ‖}_{F}^{2}$ gives

{‖ E_{col}^{*} A_{k} ‖}_{F}^{2} = \frac{1}{∣ Λ_{2} ∣} \sum_{i = 1}^{R} \frac{1}{∣ ω (k) ∣} \sum_{ℓ \in ω (k)} {| 〈 \frac{ℓ}{‖ ℓ ‖}, \frac{w_{i}}{‖ w_{i} ‖} 〉 |}^{2} \leq \frac{R}{∣ Λ_{2} ∣} \leq \frac{R c_{s}}{∣ Γ ∣} .

Hence, we have

{‖ P_{U} A_{k} ‖}_{F}^{2} \leq \frac{ρ^{'} R c_{s}}{∣ Γ ∣} .

(60)

where ρ′ is defined similarly to ρ as:

ρ^{'} = min_{\begin{matrix} P \subset {μ_{0} = 0} \\ ∣ P ∣ = R \end{matrix}} \frac{1}{λ_{\min} [E_{col} (P) * E_{col} (P)]},

(61)

Finally, we show how to bound ρ′ by ρ in (60). Observe that we can re-define ρ and ρ′ in terms of the minimum singular value of the basis matrices E_row(P) and E_col(P), according to the correspondences:

λ_{\min} (E_{col} {(P)}^{*} E_{col} (P)) = σ_{\min}^{2} (E_{col} (P)), λ_{\min} (E_{row} {(P)}^{*} E_{row} (P)) = σ_{\min}^{2} (E_{row} (P)) .

We will show $σ_{\min}^{2} (E_{row} (P)) \leq σ_{\min}^{2} (E_{col} (P))$ , or equivalently, [λ_min(E_col(P)^*E_col(P))]⁻¹ ≤ [λ_min(E_row(P)^*E_row(P)]⁻¹, for any set P consisting of R points on the edge set. The claim then follows immediately by taking the minimum over all such sets P.

To ease notation, we drop the dependence on the set P in the following. Observe that we can express E_col as

E_{col} = [\begin{matrix} {\tilde{E}}_{col} W_{x} \\ {\tilde{E}}_{col} W_{y} \end{matrix}]

(62)

where $W_{x} = diag (\frac{w_{1, x}}{‖ w_{1} ‖}, \dots, \frac{w_{R, x}}{‖ w_{R} ‖}), W_{y} = diag (\frac{w_{1, y}}{‖ w_{1} ‖}, \dots, \frac{w_{R, y}}{‖ w_{R} ‖})$ , and Ẽ_col ∈ ℂ^|^Λ₂^|×R is the Vandermonde-like matrix given entrywise by [Ẽ_col]_i,j = e^{j2π^k_i·r_i}, for all k_i ∈ Λ₂, 1 ≤ j ≤ R. In other words, Ẽ_col has the same structure as E_row, but is built with respect to Λ₂ instead of Λ₁. In particular, since we always assume Λ₁ ⊂ Λ₂, the matrix E_row can be embedded as a submatrix of Ẽ_col by restricting the rows of Ẽ_col to those indexed by Λ₁. By the variational characterization of the minimum singular value of a matrix, we have

σ_{\min}^{2} (E_{col}) = min_{‖ u ‖ = 1} {‖ E_{col} u ‖}^{2} = min_{‖ u ‖ = 1} {‖ {\tilde{E}}_{col} W_{x} u ‖}^{2} + {‖ {\tilde{E}}_{col} W_{y} u ‖}^{2} \geq σ_{\min}^{2} ({\tilde{E}}_{col}) \underset{= 1}{\underset{︸}{({‖ W_{x} u ‖}^{2} + {‖ W_{y} u ‖}^{2})}}

(63)

Finally, since E_row is a submatrix of Ẽ_col, we also have $σ_{\min}^{2} (E_{row}) \leq σ_{\min}^{2} ({\tilde{E}}_{col})$ , which together with (63) gives the desired inequality.

IX. Appendix B: Proof of Main Theorem

A. Reformulation in lifted domain

We now reformulate the recovery of f̂ as a matrix recovery problem in the lifted domain. The matrices 𝒯_x(f̂) and 𝒯_y(f̂) contain several copies of the weighted entries k_x f̂[k] and k_y f̂ [k], respectively. We use ω(k) to denote the set of locations (α₁, α₂) in the matrix 𝒯_x(f̂) or 𝒯_y(f̂) that contain the entry k_x f̂[k] or k_y f̂[k] (this set is the same in either case).

We define the sampling matrices $A_{k} = [\begin{matrix} A_{1, k} \\ A_{2, k} \end{matrix}] \in ℂ^{2 ∣ Λ_{2} ∣ \times ∣ Λ_{1} ∣}$ , for each k = (k₁, k₂) ∈ Γ, where

{(A_{i, k})}_{α} = {\begin{matrix} \frac{k_{i}}{‖ k ‖ \sqrt{∣ ω_{i} (k) ∣}}, & if & α = (α_{1}, α_{2}) \in ω (k) \\ 0 & else \end{matrix}

(64)

for i = 1, 2. The matrices {A_k}_k_∈Γ form an orthonormal basis for the space of matrices defined by the range of the matrix lifting 𝒯; we will call any matrix in the range of 𝒯 a structured matrix. For any set of coefficients {ĝ[k]}_k_∈Γ we can expand the structured matrix 𝒯(ĝ) as

T (\hat{g}) = \sum_{k \in Γ} \hat{g} [k] ‖ k ‖ \sqrt{∣ w_{i} (k) ∣} A_{k} .

(65)

We denote the projection operator corresponding to a single sampling location k by 𝒜_k(X) = 〈A_k,X〉 A_k. Since {A_k}_k_∈Γ is an orthonormal basis, for any structured matrix X, we have Σ_k_∈Γ 𝒜_k(X) = 𝒜(X) = X. Since A_k is not the basis for a general X ∈ ℂ²^|^Λ₂^|×|^Λ₁^|, we also define the projection operator to the space orthogonal to the space of structured matrices by 𝒜_⊥(X) = (ℐ − 𝒜)(X), where ℐ is the identity operator. In particular, the constraint 𝒜_⊥(X) = 0 implies that X is a structured matrix.

The recovery of f from its partial Fourier samples f̂[k], k ∈ Ω, can thus be reformulated as the completion of a structured matrix X from its measurements 𝒜_k, k ∈ Ω. Since the matrix is structured, we have 𝒜_⊥(X) = 0. We thus reformulate (10) as the structured low-rank recovery problem:

{minimize}_{X} {‖ X ‖}_{*} subject to Q_{Ω} (X) = Q_{Ω} (T (\hat{f})),

(66)

where 𝒬_Ω that satisifies 𝔼[𝒬_Ω] = ℐ is defined as:

Q_{Ω} = \frac{∣ Γ ∣}{∣ Ω ∣} A_{Ω} + A^{⊥}

(67)

B. Conditions for perfect recovery

The tangent space T of the matrix X is defined as $T : = {U X_{1}^{H} + X_{2} V^{H} : X_{1} \in ℂ^{∣ Λ_{2} ∣ \times R}, X_{2} \in ℂ^{∣ Λ_{1} ∣ \times R}}$ where X = UΛV^H is the singular value decomposition of X. The orthogonal complement of T is denoted by T^⊥. We first show that if ℘_T ≈ ℘_T𝒬_Ω℘_T, and if an approximate dual certificate that satisfies certain conditions exist, we obtain perfect recovery.

Lemma 15

Consider a multiset Ω that contains m random indices. Suppose the sampling operator 𝒬_Ω obeys

‖ P_{T} - P_{T} Q_{Ω} P_{T} ‖ \leq \frac{1}{2}

(68)

and there exists a dual certificate matrix W satisfying

Q_{Ω}^{⊥} (W) = 0

(69)

{‖ P_{T} (W - U V^{*}) ‖}_{F} \leq \frac{1}{6 n}

(70)

‖ P_{T}^{⊥} (W) ‖ \leq \frac{1}{2} .

(71)

Then, 𝒯(f̂) is the unique solution to (66), where n = |Γ| and m = |Ω|.

See Section III-A of supplementary material for proof. Equation (68) suggests that 𝒬_Ω ≈ ℐ on the tangent space. The conditions (69), (70), and (71) indicates the existence of a W, which approximates the exact dual certificate UV^*. The above lemma is in line with [3, lemma 1], with the exception of the third condition, indicated by (70). To satisfy (68), we bound the deviation of ℘_T𝒬_Ω℘_T from ℘_T in the following lemma.

Lemma 16

Suppose (12) holds. Then we have

‖ P_{T} - P_{T} Q_{Ω} P_{T} ‖ \leq ε \leq \frac{1}{2}

(72)

with probability exceeding 1 − n⁻⁴, provided that m > c₁ρR c_s log(n).

We prove this using [39, Theorem 1.6]. (See Section III-B of supplementary material)

C. Construction of the approximate dual certificate W

We will now use the golfing scheme of [3], [25] to construct an approximate dual certificate W, which satisfies (69), (70), and (71). In particular, we generate j₀ independent random sampling sets Ω_i; 1 ≤ i ≤ j₀, each containing m̃ = m/j₀ samples corresponding to sampling with replacement. We start with F₀ = UV*, and follow the following steps:

F₀ = UV^* and set $j_{0} - 3 {log}_{\frac{1}{ε}} n$ .
∀i(1 ≤ i ≤ j₀), F_i = ℘_𝒯(ℐ − 𝒬_{Ω_i})℘_T (F_i₋₁)
$W = \sum_{j = 1}^{j_{0}} Q_{Ω_{i}} F_{j - 1}$

Step 3 ensures that W satisfies (69) since each term W_i = 𝒬_{Ω_i}F_j₋₁ satisfies $Q_{Ω}^{⊥} (W_{i}) = 0$ . The recursive construction also satisfies (70). In particular,

{‖ P_{T} (W - U V^{*}) ‖}_{F} = {‖ P_{T} F_{j_{0}} ‖}_{F} \leq ε^{j_{0}} {‖ F_{0} ‖}_{F} = ε^{j_{0}} \sqrt{R} \leq ε^{j_{0}} n

Now we focus on showing that W satisfies (71). Note that if j₀ is chosen as $3 {log}_{\frac{1}{ε}} n$ , assuming n > 6, we have ${(ε)}^{j_{0}} n < \frac{1}{6 n}$ .

Lemma 17

For any matrix M, there exists some numerical constant c₂ such that

‖ (I - Q_{Ω}) (M) ‖ \leq c_{2} \sqrt{\frac{n log n}{m}} {‖ M ‖}_{A, 2} + \frac{c_{2} n log n}{m} {‖ M ‖}_{A, \infty},

(73)

with probability at least 1 − n⁻¹⁰. Here,

{‖ M ‖}_{A, \infty} = max_{k \in Γ} | \frac{〈 A_{k}, M 〉}{∣ ω_{k} ∣} |

(74)

{‖ M ‖}_{A, 2} = \sqrt{\sum_{k \in Γ} \frac{{∣ 〈 A_{k}, M 〉 ∣}^{2}}{∣ ω_{k} ∣}}

(75)

See Section III-C of supplementary material for proof.

Lemma 18

Assume that there exists a constant μ₅ such that $ω_{k} {‖ P_{T} (A_{k}) ‖}_{A, 2} \leq \frac{μ_{5} R}{n}$ . For any matrix M, we have

{‖ P_{T} [(I - Q_{Ω}) (M)] ‖}_{A, 2} \leq c_{3} \sqrt{\frac{μ_{5} R log n}{m}} ({‖ M ‖}_{A, 2} + \sqrt{\frac{n log n}{m}} {‖ M ‖}_{A, \infty}),

with probability at least 1 − n⁻¹⁰.

See Section III-D of the Supplementary Materials for proof.

Lemma 19

For any matrix M ∈ T, there exists some numerical constant c₄, such that

{‖ P_{T} [(I - Q_{Ω}) (M)] ‖}_{A, \infty} \leq c_{4} \sqrt{\frac{ρ c_{s} R log n}{m}} \sqrt{\frac{ρ c_{s} R}{n}} {‖ M ‖}_{A, 2} + \frac{c_{4} ρ c_{s} R log n}{m} {‖ M ‖}_{A, \infty},

(76)

with probability at least 1 − n⁻¹⁰.

See Section III-F of the Supplementary Materials for proof. From the golfing scheme, we have $‖ P_{T^{⊥}} (W) ‖ \leq \sum_{j = 1}^{j_{0}} ‖ P_{T^{⊥}} Q_{Ω_{i}} P_{T} F_{j - 1} ‖$ . Using lemma 17 and substituting from lemma 18 and lemma 19, we have

‖ P_{T^{⊥}} Q_{Ω_{i}} F_{j - 1} ‖ \leq {(\frac{1}{2})}^{j_{0} - 1} c_{2} {\sqrt{\frac{n log n}{\tilde{m}}} {‖ F_{0} ‖}_{A, 2} + \frac{n log n}{\tilde{m}} {‖ F_{0} ‖}_{A, \infty}}

The last inequality holds if m̃ = m/j₀ ≫ max (μ₅, ρc_s)Rlog n. Substituting for $j_{0} = 3 {log}_{\frac{1}{ε}} (n)$ assumed in the golfing scheme, we require m ≫ c₆ max (μ₅, ρc_s)Rlog² n to satisfy the above inquality. See Section III-G of the Supplementary Materials for details. We will now present the lemmas bounding ||F₀||_𝒜_,₂ and ||F₀||_𝒜_,∞, where F₀ = UV^*.

Lemma 20

With the incoherence measure ρ, one can bound

{‖ U V^{*} ‖}_{A, \infty} \leq \frac{ρ c_{s} R}{n}

(77)

{‖ U V^{*} ‖}_{A, 2}^{2} \leq \frac{c_{7} μ_{3} c_{s} {log}^{2} (n) R}{n}

(78)

{‖ P_{T} (\sqrt{ω_{α}} A_{α}) ‖}_{A, 2}^{2} \leq \frac{c_{7} μ_{3} c_{s} {log}^{2} (n) R}{n}, \forall α \in Γ

(79)

for μ₃ = 3ρ and c₇ is some constant.

See Section III-H of the Supplementary Materials for proof. From (79), we see that the constant μ₅ in lemma 19 can be chosen as μ₅ = c₇ μ₃ c_s log²(n) such that $ω_{k} {‖ P_{T} (A_{k}) ‖}_{A, 2} \leq \frac{μ_{5} R}{n}$ . Substituting for μ₅, we observe that the dominant term has its dependence on log⁴(n). Thus, ||℘_T^⊥ 𝒬_{Ω_i}F_j₋₁||< 1/2 if m > c₆c₇ c_s (3ρ) R log₄(n).

Footnotes

More precisely, μ₀ is unique up to multiplication by a phase factor e^j²^π^k^·^r for some k ∈ ℤ².

The structured matrices considered in [3] are block Hankel with Hankel block matrices (BHHB), but this difference is purely cosmetic: every BTTB matrix can be re-expressed as BHHB after a permutation of its rows and columns. In particular, the Vandermonde-like factorization of BHHB matrices in [3] carries over to BTTB matrices.

Contributor Information

Greg Ongie, Department of EECS, University of Michigan, Ann Arbor, MI 48108 USA.

Sampurna Biswas, Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA, 52245 USA.

Mathews Jacob, Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA, 52245 USA.

References

1.Bhaskar BN, Recht B. Atomic norm denoising with applications to line spectral estimation. Communication, Control, and Computing, 2011 49th Annual Allerton Conference on; IEEE; 2011. pp. 261–268. [Google Scholar]
2.Candès EJ, Fernandez-Granda C. Towards a mathematical theory of super-resolution. Communications on Pure and Applied Mathematics. 2014;67(6):906–956. [Google Scholar]
3.Chen Y, Chi Y. Robust spectral compressed sensing via structured matrix completion. Information Theory, IEEE Trans on. 2014;60(10):6576–6601. [Google Scholar]
4.Stoica P, Moses RL. Introduction to spectral analysis. Vol. 1 Prentice hall; Upper Saddle River, NJ: 1997. [Google Scholar]
5.Haacke E, Liang ZP, Izen S. Superresolution reconstruction through object modeling and parameter estimation. Acoustics, Speech and Signal Processing, IEEE Trans on. 1989 Apr;37(4):592–595. [Google Scholar]
6.Haacke EM, Liang ZP, Izen SH. Constrained reconstruction: A superresolution, optimal signal-to-noise alternative to the fourier transform in magnetic resonance imaging. Medical Physics. 1989;16(3):388–397. doi: 10.1118/1.596427. [DOI] [PubMed] [Google Scholar]
7.Vetterli M, Marziliano P, Blu T. Sampling signals with finite rate of innovation. Signal Processing, IEEE Trans on. 2002;50(6):1417–1428. [Google Scholar]
8.Jin KH, Lee D, Ye JC. A general framework for compressed sensing and parallel MRI using annihilating filter based low-rank hankel matrix. 2015 preprint arXiv:1504.00532. [Google Scholar]
9.Jin KH, Lee D, Ye JC. A novel k-space annihilating filter method for unification between compressed sensing and parallel mri. IEEE ISBI. 2015
10.Haldar JP. Low-rank modeling of local k-space neighborhoods (LORAKS) for constrained MRI. Medical Imaging, IEEE Trans on. 2014;33(3):668–681. doi: 10.1109/TMI.2013.2293974. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Ongie G, Jacob M. Recovery of piecewise smooth images from few fourier samples. SampTA. 2015:543–547. doi: 10.1137/15M1042280. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Ye JC, Kim JM, Jin KH, Lee K. Compressive sampling using annihilating filter-based low-rank interpolation. IEEE Trans on Information Theory. 2016 [Google Scholar]
13.Starck JL, Candès EJ, Donoho DL. The curvelet transform for image denoising. Image Processing, IEEE Trans on. 2002;11(6):670–684. doi: 10.1109/TIP.2002.1014998. [DOI] [PubMed] [Google Scholar]
14.Do MN, Vetterli M. The contourlet transform: an efficient directional multiresolution image representation. Image Processing, IEEE Trans on. 2005;14(12):2091–2106. doi: 10.1109/tip.2005.859376. [DOI] [PubMed] [Google Scholar]
15.Ongie G, Jacob M. Super-resolution MRI using finite rate of innovation curves. IEEE ISBI. 2015 [Google Scholar]
16.Ongie G, Jacob M. Off-the-grid recovery of piecewise constant images from few fourier samples. 2015 May;:543–547. doi: 10.1137/15M1042280. arXiv:1510.00384. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Pan H, Blu T, Dragotti PL. Sampling curves with finite rate of innovation. Signal Processing, IEEE Trans. on; 2014. [Google Scholar]
18.Ongie G, Jacob M. A fast algorithm for structured low-rank matrix recovery with applications to undersampled MRI reconstruction. IEEE International Symposium on Biomedical Imaging; 2016; [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Ongie G, Jacob M. Giraf: A fast algorithm for structured low-rank matrix recovery giraf: A fast algorithm for structured low-rank matrix recovery. 2016 doi: 10.1109/isbi.2016.7493322. arXiv:1609.07429. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Balachandrasekaran A, Ongie G, Jacob M. Accelerated dynamic MRI using structured low rank matrix completion. 2016 IEEE International Conference on Image Processing (ICIP); Institute of Electrical and Electronics Engineers (IEEE); 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Ongie G, Biswas S, Jacob M. Structured matrix recovery of piecewise constant signals with performance guarantees. International Conference on Image Processing; 2016; [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Strichartz RS. A Guide to Distribution Theory and Fourier Transforms. World Scientific Pub Co Pte Lt; 2003. [Google Scholar]
23.Lustig M, Donoho D, Pauly JM. Sparse mri: The application of compressed sensing for rapid mr imaging. Magnetic resonance in medicine. 2007;58(6):1182–1195. doi: 10.1002/mrm.21391. [DOI] [PubMed] [Google Scholar]
24.Candès E, Recht B. Exact matrix completion via convex optimization. Commun ACM. 2012 Jun;55(6):111–119. [Google Scholar]
25.Gross D. Recovering low-rank matrices from few coefficients in any basis. Information Theory, IEEE Trans on. 2011;57(3):1548–1566. [Google Scholar]
26.Shin PJ, Larson PE, Ohliger MA, Elad M, Pauly JM, Vigneron DB, Lustig M. Calibrationless parallel imaging reconstruction based on structured low-rank matrix completion. Magnetic Resonance in Medicine. 2013 doi: 10.1002/mrm.24997. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Moitra A. Super-resolution, extremal functions and the condition number of vandermonde matrices. Proceedings of the 47th Annual ACM on Symposium on Theory of Computing; 2015; pp. 821–830. [Google Scholar]
28.Ye JC, Kim JM, Jin KH. Compressive sampling using structured low-rank interpolation. 2015 preprint arXiv:1511.08975. [Google Scholar]
29.Cai JF, Candès EJ, Shen Z. A singular value thresholding algorithm for matrix completion. SIAM Journal on Optimization. 2010;20(4):1956–1982. [Google Scholar]
30.Fazel M, Pong TK, Sun D, Tseng P. Hankel matrix rank minimization with applications to system identification and realization. SIAM Journal on Matrix Analysis and Applications. 2013;34(3):946–977. [Google Scholar]
31.Candès EJ, Romberg J, Tao T. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans on information theory. 2006;52(2):489–509. [Google Scholar]
32.Candes EJ, Romberg JK, Tao T. Stable signal recovery from incomplete and inaccurate measurements. Communications on Pure and Applied Mathematics. 2006;59(8):1207–1223. [Google Scholar]
33.Needell D, Ward R. Near-optimal compressed sensing guarantees for total variation minimization. IEEE Trans on Image Processing. 2013;22(10):3941–3949. doi: 10.1109/TIP.2013.2264681. [DOI] [PubMed] [Google Scholar]
34.Needell D, Ward R. Stable image reconstruction using total variation minimization. SIAM Journal on Imaging Sciences. 2013;6(2):1035–1058. [Google Scholar]
35.Krahmer F, Ward R. Stable and robust sampling strategies for compressive imaging. IEEE Trans on image processing. 2014;23(2):612–622. doi: 10.1109/TIP.2013.2288004. [DOI] [PubMed] [Google Scholar]
36.Poon C. On the role of total variation in compressed sensing. SIAM Journal on Imaging Sciences. 2015;8(1):682–720. [Google Scholar]
37.Zhang T, Pauly JM, Vasanawala SS, Lustig M. Coil compression for accelerated imaging with cartesian sampling. Magnetic resonance in medicine. 2013;69(2):571–582. doi: 10.1002/mrm.24267. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Li T, Wang X. The BKK root count in cn. Mathematics of Computation of the American Mathematical Society. 1996;65(216):1477–1484. [Google Scholar]
39.Tropp J. User-friendly tail bounds for sums of random matrices. Foundations of computational Math. 2012;12(4):389–434. [Google Scholar]

[R1] 1.Bhaskar BN, Recht B. Atomic norm denoising with applications to line spectral estimation. Communication, Control, and Computing, 2011 49th Annual Allerton Conference on; IEEE; 2011. pp. 261–268. [Google Scholar]

[R2] 2.Candès EJ, Fernandez-Granda C. Towards a mathematical theory of super-resolution. Communications on Pure and Applied Mathematics. 2014;67(6):906–956. [Google Scholar]

[R3] 3.Chen Y, Chi Y. Robust spectral compressed sensing via structured matrix completion. Information Theory, IEEE Trans on. 2014;60(10):6576–6601. [Google Scholar]

[R4] 4.Stoica P, Moses RL. Introduction to spectral analysis. Vol. 1 Prentice hall; Upper Saddle River, NJ: 1997. [Google Scholar]

[R5] 5.Haacke E, Liang ZP, Izen S. Superresolution reconstruction through object modeling and parameter estimation. Acoustics, Speech and Signal Processing, IEEE Trans on. 1989 Apr;37(4):592–595. [Google Scholar]

[R6] 6.Haacke EM, Liang ZP, Izen SH. Constrained reconstruction: A superresolution, optimal signal-to-noise alternative to the fourier transform in magnetic resonance imaging. Medical Physics. 1989;16(3):388–397. doi: 10.1118/1.596427. [DOI] [PubMed] [Google Scholar]

[R7] 7.Vetterli M, Marziliano P, Blu T. Sampling signals with finite rate of innovation. Signal Processing, IEEE Trans on. 2002;50(6):1417–1428. [Google Scholar]

[R8] 8.Jin KH, Lee D, Ye JC. A general framework for compressed sensing and parallel MRI using annihilating filter based low-rank hankel matrix. 2015 preprint arXiv:1504.00532. [Google Scholar]

[R9] 9.Jin KH, Lee D, Ye JC. A novel k-space annihilating filter method for unification between compressed sensing and parallel mri. IEEE ISBI. 2015

[R10] 10.Haldar JP. Low-rank modeling of local k-space neighborhoods (LORAKS) for constrained MRI. Medical Imaging, IEEE Trans on. 2014;33(3):668–681. doi: 10.1109/TMI.2013.2293974. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Ongie G, Jacob M. Recovery of piecewise smooth images from few fourier samples. SampTA. 2015:543–547. doi: 10.1137/15M1042280. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Ye JC, Kim JM, Jin KH, Lee K. Compressive sampling using annihilating filter-based low-rank interpolation. IEEE Trans on Information Theory. 2016 [Google Scholar]

[R13] 13.Starck JL, Candès EJ, Donoho DL. The curvelet transform for image denoising. Image Processing, IEEE Trans on. 2002;11(6):670–684. doi: 10.1109/TIP.2002.1014998. [DOI] [PubMed] [Google Scholar]

[R14] 14.Do MN, Vetterli M. The contourlet transform: an efficient directional multiresolution image representation. Image Processing, IEEE Trans on. 2005;14(12):2091–2106. doi: 10.1109/tip.2005.859376. [DOI] [PubMed] [Google Scholar]

[R15] 15.Ongie G, Jacob M. Super-resolution MRI using finite rate of innovation curves. IEEE ISBI. 2015 [Google Scholar]

[R16] 16.Ongie G, Jacob M. Off-the-grid recovery of piecewise constant images from few fourier samples. 2015 May;:543–547. doi: 10.1137/15M1042280. arXiv:1510.00384. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] 17.Pan H, Blu T, Dragotti PL. Sampling curves with finite rate of innovation. Signal Processing, IEEE Trans. on; 2014. [Google Scholar]

[R18] 18.Ongie G, Jacob M. A fast algorithm for structured low-rank matrix recovery with applications to undersampled MRI reconstruction. IEEE International Symposium on Biomedical Imaging; 2016; [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Ongie G, Jacob M. Giraf: A fast algorithm for structured low-rank matrix recovery giraf: A fast algorithm for structured low-rank matrix recovery. 2016 doi: 10.1109/isbi.2016.7493322. arXiv:1609.07429. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Balachandrasekaran A, Ongie G, Jacob M. Accelerated dynamic MRI using structured low rank matrix completion. 2016 IEEE International Conference on Image Processing (ICIP); Institute of Electrical and Electronics Engineers (IEEE); 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Ongie G, Biswas S, Jacob M. Structured matrix recovery of piecewise constant signals with performance guarantees. International Conference on Image Processing; 2016; [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Strichartz RS. A Guide to Distribution Theory and Fourier Transforms. World Scientific Pub Co Pte Lt; 2003. [Google Scholar]

[R23] 23.Lustig M, Donoho D, Pauly JM. Sparse mri: The application of compressed sensing for rapid mr imaging. Magnetic resonance in medicine. 2007;58(6):1182–1195. doi: 10.1002/mrm.21391. [DOI] [PubMed] [Google Scholar]

[R24] 24.Candès E, Recht B. Exact matrix completion via convex optimization. Commun ACM. 2012 Jun;55(6):111–119. [Google Scholar]

[R25] 25.Gross D. Recovering low-rank matrices from few coefficients in any basis. Information Theory, IEEE Trans on. 2011;57(3):1548–1566. [Google Scholar]

[R26] 26.Shin PJ, Larson PE, Ohliger MA, Elad M, Pauly JM, Vigneron DB, Lustig M. Calibrationless parallel imaging reconstruction based on structured low-rank matrix completion. Magnetic Resonance in Medicine. 2013 doi: 10.1002/mrm.24997. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] 27.Moitra A. Super-resolution, extremal functions and the condition number of vandermonde matrices. Proceedings of the 47th Annual ACM on Symposium on Theory of Computing; 2015; pp. 821–830. [Google Scholar]

[R28] 28.Ye JC, Kim JM, Jin KH. Compressive sampling using structured low-rank interpolation. 2015 preprint arXiv:1511.08975. [Google Scholar]

[R29] 29.Cai JF, Candès EJ, Shen Z. A singular value thresholding algorithm for matrix completion. SIAM Journal on Optimization. 2010;20(4):1956–1982. [Google Scholar]

[R30] 30.Fazel M, Pong TK, Sun D, Tseng P. Hankel matrix rank minimization with applications to system identification and realization. SIAM Journal on Matrix Analysis and Applications. 2013;34(3):946–977. [Google Scholar]

[R31] 31.Candès EJ, Romberg J, Tao T. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans on information theory. 2006;52(2):489–509. [Google Scholar]

[R32] 32.Candes EJ, Romberg JK, Tao T. Stable signal recovery from incomplete and inaccurate measurements. Communications on Pure and Applied Mathematics. 2006;59(8):1207–1223. [Google Scholar]

[R33] 33.Needell D, Ward R. Near-optimal compressed sensing guarantees for total variation minimization. IEEE Trans on Image Processing. 2013;22(10):3941–3949. doi: 10.1109/TIP.2013.2264681. [DOI] [PubMed] [Google Scholar]

[R34] 34.Needell D, Ward R. Stable image reconstruction using total variation minimization. SIAM Journal on Imaging Sciences. 2013;6(2):1035–1058. [Google Scholar]

[R35] 35.Krahmer F, Ward R. Stable and robust sampling strategies for compressive imaging. IEEE Trans on image processing. 2014;23(2):612–622. doi: 10.1109/TIP.2013.2288004. [DOI] [PubMed] [Google Scholar]

[R36] 36.Poon C. On the role of total variation in compressed sensing. SIAM Journal on Imaging Sciences. 2015;8(1):682–720. [Google Scholar]

[R37] 37.Zhang T, Pauly JM, Vasanawala SS, Lustig M. Coil compression for accelerated imaging with cartesian sampling. Magnetic resonance in medicine. 2013;69(2):571–582. doi: 10.1002/mrm.24267. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] 38.Li T, Wang X. The BKK root count in cn. Mathematics of Computation of the American Mathematical Society. 1996;65(216):1477–1484. [Google Scholar]

[R39] 39.Tropp J. User-friendly tail bounds for sums of random matrices. Foundations of computational Math. 2012;12(4):389–434. [Google Scholar]

PERMALINK

Convex recovery of continuous domain piecewise constant images from nonuniform Fourier samples

Greg Ongie

Sampurna Biswas

Mathews Jacob

Roles

Abstract

I. Introduction

A. Notation

II. Background

A. 2-D Piecewise Constant Images with Bandlimited Edges

B. Recovery from uniform Fourier samples

Fig. 2.

Theorem 1

Theorem 2

III. Recovery from non-uniform Fourier samples

A. Role of incoherence

Proposition 3

B. Main Results

Theorem 4

C. Recovery in the presence of noise and model-mismatch

Theorem 5

IV. Row and column spaces of 𝒯 (f̂) and incoherence

A. Row and column spaces of 𝒯 (f̂)

Lemma 6

Lemma 7

Lemma 8

B. Incoherence measure

Definition 9

Fig. 3.

Theorem 10

V. Numerical Experiments

A. Algorithms

B. Phase transitions

Fig. 4.

C. Comparison with TV minimization on real MRI data

Fig. 5.

VI. Discussion

TABLE I.

VII. Conclusion

Fig. 1.

Acknowledgments

VIII. Appendix A: Incoherence Bounds

A. Notation and Preliminaries

B. Fundamental subspaces of 𝒮(f) and dimensions

C. Basis for the coimage of 𝒮(f) (corresponding to the row space of 𝒯(f̂))

Lemma 11

Lemma 12

Proof

D. Discretization of curve integrals: quadrature formula

Lemma 13

Proof

E. Basis for the range of 𝒮(f) (corresponding to the column space of 𝒯(f̂))

Lemma 14

Proof

F. Incoherence Bounds

1) Projection onto row subspace

2) Projection onto column space

IX. Appendix B: Proof of Main Theorem

A. Reformulation in lifted domain

B. Conditions for perfect recovery

Lemma 15

Lemma 16

C. Construction of the approximate dual certificate W

Lemma 17

Lemma 18

Lemma 19

Lemma 20

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases