Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Jan 1.
Published in final edited form as: IEEE Trans Signal Process. 2017 Sep 7;66(1):236–250. doi: 10.1109/TSP.2017.2750111

Convex recovery of continuous domain piecewise constant images from nonuniform Fourier samples

Greg Ongie 1, Sampurna Biswas 2, Mathews Jacob 3,
PMCID: PMC6101269  NIHMSID: NIHMS942074  PMID: 30140146

Abstract

We consider the recovery of a continuous domain piecewise constant image from its non-uniform Fourier samples using a convex matrix completion algorithm. We assume the discontinuities/edges of the image are localized to the zero level-set of a bandlimited function. This assumption induces linear dependencies between the Fourier coefficients of the image, which results in a two-fold block Toeplitz matrix constructed from the Fourier coefficients being low-rank. The proposed algorithm reformulates the recovery of the unknown Fourier coefficients as a structured low-rank matrix completion problem, where the nuclear norm of the matrix is minimized subject to structure and data constraints. We show that exact recovery is possible with high probability when the edge set of the image satisfies an incoherency property. We also show that the incoherency property is dependent on the geometry of the edge set curve, implying higher sampling burden for smaller curves. This paper generalizes recent work on the super-resolution recovery of isolated Diracs or signals with finite rate of innovation to the recovery of piecewise constant images.

Index Terms: Off-the-Grid Image Recovery, Structured Low-Rank Matrix Completion, Finite Rate of Innovation

I. Introduction

The direct recovery of continuous domain signals by convex optimization is emerging as a powerful alternative to traditional discrete domain compressed sensing [1]–[3]. The ability of these continuous domain “off-the-grid” schemes to minimize discretization errors makes them attractive in practical applications, where only the low-pass measurements of the signal are available. The history of such continuous domain signal recovery algorithms dates back to Prony [4], where the recovery of a linear combination of exponentials from uniform samples is considered. Prony-like algorithms recover the signal by estimating an annihilating polynomial whose zeros correspond to the frequencies of the exponentials. Work by Liang et al. [5], [6] and the finite rate of innovation (FRI) framework [7] extended Prony-like methods to recover more general signals that reduce to a sparse linear combination of Dirac delta functions under an appropriate transformation (e.g., differential operators, convolution). Recently, several authors have further extended FRI methods to recover such signals from their non-uniform Fourier samples [3], [8]– [11] by exploiting the low-rank structure of an enhanced matrix built from Fourier data (e.g., a Hankel matrix in 1-D). Recovery guarantees exists for certain classes of these signals when the singularities are isolated and well-separated [2], [3], [12].

The signal models discussed above have limited flexibility in exploiting the extensive additional structure present in multi-dimensional imaging problems. In particular, the edges in multidimensional images are connected and can be modeled as smooth curves or surfaces. While discrete image representations to capture this structure have been the subject extensive research [13], [14], similar continuous domain representations have attracted less attention. We recently introduced a novel framework recover piecewise polynomial images, whose edges are localized to smooth curves, from their uniform [15], [16] and non-uniform [11] Fourier samples; our framework generalizes a recent extension of FRI models to curves [17]. We assume that the partial derivatives of the signal vanish outside the zero level-set of a bandlimited function, which is only true for piecewise smooth signals. This relation translates to a linear system of convolution equations involving the uniform Fourier samples of the partial derivatives, which can be compactly represented as the multiplication of a specific structured matrix with the Fourier coefficients of the bandlimited function. We have introduced theoretical guarantees for the recovery of such images from uniform samples [15], [16]. Our earlier work has shown that the structured matrix built from the Fourier coefficients of piecewise constant images is low-rank [11], [16], which we used to recover the image from its non-uniform Fourier samples with good performance in practical applications. We have also introduced an computationally efficient algorithm termed as GIRAF, which works on the original signal samples rather than the structured high-dimensional matrix [18], [19]; the computational complexity of this algorithm is comparable to discrete total variation regularization, which makes this scheme readily applicable to large-scale imaging problems, such as undersampled dynamic magnetic resonance image reconstruction [20].

The main focus of the present paper is to introduce theoretical guarantees on the recovery of continuous domain piecewise constant images from non-uniform Fourier samples via a convex structured low-rank matrix completion algorithm. Our main result shows number of non-uniform samples to recovery the image is proportional to the complexity of the edge set, as measured by the bandwidth of the edge set function, and an incoherence measure related to the edge set geometry. We additionally show that the recovery is robust to noise and model-mismatch.

The proof of the main result builds off of [3], which proved similar recovery guarantees for the recovery of multi dimensional isolated Diracs from non-uniform Fourier samples by minimizing the nuclear norm of an “enhanced” multi-level Hankel matrix. This work showed that the number of samples necessary for recovery depends the number of Diracs and on an incoherence measure of the signal, that can be defined solely in terms of the relative locations of the Diracs. However, the theory in [3] relies heavily on an explicit factorization of the enhanced matrix (e.g., Vandermonde factorization of a Hankel matrix in the 1-D case), which is only available when the number of singularities are isolated and finite. Since the singularities in the proposed class of piecewise constant images (i.e., the image edges) are not isolated nor finite, the recovery guarantees in [3] cannot be directly extended to our setting. Instead, to achieve our result, we give a new characterization of the row and column spaces of the structured matrix arising in our setting. We show this new characterization allows us to derive an incoherence measure based solely on geometric properties of the edge set. In particular, we derive an upper bound for the incoherence measure that is related to the size of edge set curve. The results show that high sampling burden is associated with the estimation of images with smaller piecewise constant regions, which is consistent with intuition.

We note that the signal models in [1]–[3] do not include the class of piecewise constant images considered in this work. In particular, all of the above models assume the discontinuities to be finite in number and well separated, unlike in our setting. Recently, [12] adapted the results in [3] to introduce recovery guarantees for Fourier interpolation of a variety of finite-rate-of-innovation signal models [7], including piecewise constant functions. However, these results are limited to the 1-D setting and share the assumption than the discontinuities/innovations of the signal are finite and isolated. Furthermore, the structured matrix lifting considered in this work is different than those considered in [3] and [12]. Specifically, the structured matrix lifting in this work consists of two vertically concatenated multi-level Toeplitz matrices (i.e., block Toeplitz with Toeplitz blocks), whose entries are built from the weighted Fourier coefficients of the images. This is substantially different from the structured matrix liftings considered in [3] (unweighted, one block, single block multilevel Hankel) and [12] (weighted, one block, single-level Hankel). Finally, we note that a preliminary version of the results presented in this has been published previously in the conference paper [21] without proofs. The present work includes considerably more details and proofs, and major improvements to the main theorem.

A. Notation

Bold lower-case letters x are used to indicate vector quantities, bold upper-case X to denote matrices, and calligraphic script 𝒳 for general linear operators. We typically reserve lower-case greek letters μ, γ, etc. for trigonometric polynomials (3) and upper-case greek letters Λ,Ω, etc. for their coefficient index sets, i.e. finite subsets of the integer lattice ℤ2, with cardinality denoted by |Λ|. We write Λ+Ω for the dilation of the index set Ω by Λ, i.e. the Minkowski sum {k + : k ∈ Λ, ∈ Ω}, and write 2Λ to mean Λ + Λ, 3Λ = 2Λ + Λ, etc. We also denote the contraction of Ω by Λ by Ω:Λ = { ∈ Ω: k ∈ Ω for all k ∈ Λ}.

II. Background

A. 2-D Piecewise Constant Images with Bandlimited Edges

In this work we consider a continuous domain piecewise constant model for images,

f(r)=i=1Nai1Ui(r),forallr=(x,y)[0,1]2, (1)

where ai ∈ ℂ, 1U denotes the characteristic function of the set U, and each Ui ⊂ [0, 1]2 is a simply connected regions with piecewise smooth boundaries ∂Ui. We study the recovery of such an image from a sampling of its Fourier coefficients specified by

f^[k]=[0,1]2f(r)e-j2πk·r;kΩ2. (2)

Following [16], we further assume that the edge set of the piecewise constant image, specified by E := ∪iUi, coincides with the zero set of a 2-D bandlimited function:

E={r[0,1]2:μ(r)=0},withμ(r)=kΛc[k]ej2πk·r, (3)

where the coefficients c[k] ∈ ℂ, and Λ is a finite subset of ℤ2. We call any function μ in the form (3) a trigonometric polynomial, and we say μ is bandlimited to Λ, i.e., the Fourier coefficients μ̂ are supported within Λ. For short, we will write {μ = 0} for the zero set of μ considered as a subset of [0, 1]2.

Define the degree of a trigonometric polynomial μ, denoted by deg(μ) = (K,L) to be the linear dimensions of the smallest rectangle containing the support set {k : μ̂ [k] ≠ 0}. In [16] we proved that for every curve E given by the zero set of a trigonometric polynomial, there exists a unique minimal degree trigonometric polynomial1 μ0 such that E = {μ0 = 0} and if μ is any other trigonometric polynomial with {μ0 = 0} ⊂ {μ = 0}, then deg(μ0) ≤ deg(μ) entrywise. By extension, we define the degree of a curve E to be equal to the degree of of its minimial degree polynomial μ0. We also say the curve E is bandlimited to Λ0 ⊂ ℤ2, where Λ0 is the minimal rectangular index set containing the support of μ̂. Intuitively, the degree/bandwidth of a curve gives a quantitative measure of its complexity. For example, in [16] we show the number of connected components of a curve is bounded in terms of its degree.

B. Recovery from uniform Fourier samples

We have shown in [16] that when μ is any trigonometric polynomial that vanishes on the edge set of the piecewise constant image f, the gradient f = (∂xf,yf) satisfies the property

μf=0, (4)

where equality in (4) is understood in the sense of distributions (see, e.g., [22]). The spatial domain annihilation relation (4) translates directly to the following convolution annihilation relation in Fourier domain:

kΛf^[-k]μ^[k]=0,2. (5)

Here f^[k]=j2π(kxf^[k],kyf^[k]) for k = (kx, ky). Note the equations in (5) are linear with respect to the coefficients μ̂.

Suppose we have access to samples of the Fourier coefficients on a finite rectangular grid Γ ⊂ ℤ2, and suppose μ is bandlimited to Λ1 ⊂ Γ. Then we can build the system of equations in (5) for all belonging to the index set Λ2 ⊂ Γ, where Λ2 is the set of all integer shifts of Λ1 contained in Γ. In this case (5) can be compactly represented in matrix form as

T(f^)h=[Tx(f^)Ty(f^)]h=0, (6)

where 𝒯x(), 𝒯y() ∈ ℂ|Λ2|×|Λ1| are matrices corresponding to the discrete 2-D convolution with the arrays kx[kx, ky] and kx[kx, ky] for (kx, ky) ∈ Γ, respectively (after omitting the inconsequential factor j2π). Here we use h to denote the vectorized version of the filter (μ̂ [k] : k ∈ Λ1), where the index set Λ1 is called the filter support. The matrices 𝒯x() and 𝒯y() have a block Toeplitz with Toeplitz blocks structure. See Figure 2 for an illustration of the construction of 𝒯 ().

Fig. 2.

Fig. 2

Construction of the structured matrix lifting 𝒯 () considered in this work. From a rectangular array of the Fourier coefficients [kx, ky] of a continuous domain image f(x, y), the weighted arrays kx [kx, ky] and ky [kx, ky] are constructed. The matrices 𝒯x() and 𝒯y() are then obtained by extracting all vectorized patches from the weighted arrays, and loading these into the rows of 𝒯x() and 𝒯y(). The resulting matrices 𝒯x() and 𝒯y() have a block Toeplitz with Toeplitz blocks structure. Finally 𝒯 () is formed by vertically concatenating the blocks 𝒯x() and 𝒯y().

Equation (6) shows that 𝒯 () is rank deficient, since it has the non-trivial vector h in its nullspace. In addition, when the filter support Λ1 defining 𝒯 () is sufficiently big, we can also show 𝒯 () is low-rank. This is because if μ0 is the minimal degree polynomial for the edge set, then any multiple of μ = γ · μ0 bandlimited to Λ1 will satisfy the annihilation equation (4). In Fourier domain, this means the vector

h=((μ^0γ^)[k]:kΛ1) (7)

is in the nullspace of 𝒯 (). Hence if the filter support Λ1 is larger than support Λ0 of μ0, 𝒯 () has a large nullspace and is low-rank. The following result from [16] gives an exact characterization of the rank of 𝒯 (), which will be important for this work:

Theorem 1

[16] Suppose f is a piecewise constant image (1) whose edge set E = {μ0 = 0} is the zero set of a trigonometric polynomial μ0 bandlimited to Λ0. Let 𝒯 () be built with filter size Λ1 ⊇ Λ0, then

rankT(f^)Λ1-Λ1:Λ0 (8)

where |Λ1| is the number of indices in Λ1 and |Λ1: Λ0| is the number of integer shifts of Λ0 contained in Λ1. Moreover, equality holds in (8) if Γ ⊇ 2Λ10 and if the edge set does not contain any singular points. In this case, the nullspace of 𝒯 () consists of all vectors in the form (7).

Note that R := |Λ1||Λ1: Λ0| is a measure of the bandwidth of μ0 and hence is indicative of the complexity of the edge set curve E = {μ0 = 0}. In the remainder of this work we assume the conditions in Theorem 1 that guarantee the equality rank 𝒯 () = R holds, in particular Γ ⊇ 2Λ10.

If we take Λ1 = Λ0, the above result shows Fourier samples of in Γ ⊇ 3Λ0 is sufficient for the recovery of the minimal degree polynomial μ0, since in this case μ̂0 can be identified as the unique non-trivial nullspace vector of 𝒯 (). The following theorem states that once μ0 is available, f is the unique solution to the annihilation equations (4) and (5):

Theorem 2

[16]. Suppose f is a piecewise constant image (1) whose edge set E = {μ0 = 0} is the zero set of a trigonometric polynomial μ0 bandlimited to Λ0. Suppose the Fourier sampling set Γ ⊇ Λ0. If gL1([0, 1]2) satisfies

μ0g=0subjecttog^[k]=f^[k]forallkΓ, (9)

then g = f almost everywhere.

In principle, this result allows us to solve for the amplitudes of regions of the piecewise constant function f by plugging in the known μ0 into the equation (9) and solving a linear system, similar to Prony’s method. However, for complicated piecewise constant images with many regions, it may be more practical to use the approximations introduced in [16].

III. Recovery from non-uniform Fourier samples

The theory presented in Section I shows that the exact recovery of a continuous domain piecewise constant image with a bandlimited edge set is possible when we collect Fourier samples of the image on a sufficiently large uniform grid in Fourier domain. However, the recovery procedure breaks down when we have non-uniform or missing samples, which is often the case in practical settings, e.g., compressed sensing MRI [23]. Therefore, we propose and analyze a method to interpolate the missing samples to a uniform grid in Fourier domain, which guarantees full recovery of the image in spatial domain.

Recall that Theorem 1 says that the structured matrix 𝒯 () built from the Fourier coefficients [k], k ∈ Γ, where Γ ⊂ ℤ2 is a uniform rectangular grid, is known to be low-rank precisely when f is a piecewise constant image with a bandlimited edge set. Hence we propose to recover [k], k ∈ Γ from its samples at non-uniform locations Ω ⊂ Γ as the solution to the convex matrix completion problem: min

ming^[k],kΓT(g^)subjecttog^[k]=f^[k]forallkΩ (10)

where ||·||* denotes the nuclear norm, i.e., the sum of the singular values of a matrix, which is the convex relation of the rank functional. Note that (10) is different than the standard low-rank matrix completion setting studied in [24], [25] in that the low-rank matrix 𝒯 () is structured and parameterized by the coefficient vector . Similar structured low-rank matrix completion schemes have been proposed for the recovery of signals from non-uniform Fourier samples [3], [12] and used with empirical success in MRI applications [10], [11], [26]. The main focus of this paper is to determine the sufficient number of samples that will ensure exact recovery of the Fourier coefficients of f on the reconstruction grid Γ with high probability.

A. Role of incoherence

Several authors have shown that the sufficient number of samples for low-rank matrix recovery by nuclear norm minimization to succeed is dependent on the incoherence of the sampling basis with respect to the matrix to be to be recovered [3], [25]. Similarly, our results depend on an incoherence measure derived from the structure of the matrix 𝒯 () and properties of the piecewise constant image f. In particular, define ℘U and ℘V to be the orthogonal projections onto the column space and row space of 𝒯 (), respectively, i.e., if 𝒯 () = UΣV* is the rank-R singular value decomposition then ℘UX = UU*X, ℘VX = XVV*. In Appendix B, we show that the structured matrix 𝒯 () can be expanded using orthonormal basis of matrices Ak such that

T(f^)=kΓ/{0}f^[k]w[k]Ak (11)

where w[k], k ∈ Γ/{0} are a set of positive weights that do not depend on . Similar to results in [3], [12], [25], we prove that nuclear norm minimization (10) recovers the exact low-rank solution with high probability provided we can uniformly bound the norms of the projections of the sampling basis matrices Ak onto the row and column spaces of 𝒯 ():

Proposition 3

Consider 𝒯 () of rank R corresponding to a piecewise constant function f whose edge set coincides with the zero set of μ0, let ρ be the incoherency measure associated with μ0 to be defined in the sequel, and set cs = |Γ|/|Λ1|. Then we have

maxkΓPUAkF2ρRcsΓ, (12)
maxkΓPVAkF2ρRcsΓ (13)

The proof in Section VIII-F relies on the row and column spaces of 𝒯 () derived in Lemma 8 and Lemma 6 in the next section. These results will be used in the derivation of the main theorem in Section IX.

B. Main Results

We now present our main results, which determine the sufficient number of random Fourier samples for the convex structured low-rank matrix completion program (10) to succeed with high probability. Our first theorem addresses the case of recovery from noiseless Fourier samples:

Theorem 4

Let f be a continuous domain piecewise constant image (1), whose edge-set is described by the zero-set of the trigonometric polynomial μ0 bandlimited to Λ0 (see (3)). Let Ω ⊂ Γ be an index set drawn uniformly at random within Γ. Then there exists a universal constant c > 0 such that the solution to (10) is f̂ with probability exceeding 1 − |Γ|−2, provided

Ω>cρcsRlog4Γ, (14)

where R = |Λ1||Λ1 : Λ0| = rank 𝒯 (), cs = |Γ|/|Λ1|, c is a universal constant, and ρ ≥ 1 is an incoherence measure depending on the geometry of the edge-set, to be defined in the sequel.

To better understand the dependence of the bound in (14) on the filter size Λ1 and the edge set bandwidth Λ0, assume for simplicity that Λ1 is some dilation of Λ0, that is, Λ1 = αΛ0, where α > 1 is an integer. In this case, the factor cs R in (14) simplifies to

(Λ1-Λ1:Λ0Λ1)Γ(α2-(α-1)2α2)Γ2Γα. (15)

Therefore, assuming the other constants in (14) are fixed, the number of measurements sufficient for exact recovery is proportional to the reciprocal of the dilation factor α. This suggests taking the filter size Λ1 to be as large as allowed by Theorem 4. Namely, Λ1 should satisfy 2Λ1 + Λ0 = Γ, i.e., the side-lengths of filter support Λ1 should be roughly half those of the reconstruction grid Γ. Fixing the filter support Λ1 to obey this bound, then Γ = (2α + 1)Λ0, and so |Γ| ≤ (2α + 1)2|Λ0|. Inserting this bound into (15) gives

csR=O(αΛ0). (16)

Combined with (14), this shows that the number of measurements sufficient for exact recovery is on the order of |Λ0|, up to incoherence and log factors.

The proof of Theorem 4, detailed in Appendix B, is in line with the approach of [3]. In particular, we prove the result by constructing an approximate dual certificate using the well-known “golfing scheme” of [25]. The main differences between in the proof of the above result and that in [3] results from the differences in the matrix structure and hence the characterization of the incoherency between the row and column subspaces of 𝒯 () with the sampling basis. In particular, the matrix 𝒯 () we consider is obtained by stacking two block Toeplitz with Toeplitz blocks (BTTB) matrices whose entries are the weighted Fourier coefficients of f, as opposed to a single unweighted BTTB matrix in [3]. The approach in [3] relies on an explicit low-rank factorization of a BTTB matrix in terms of Vandermonde-like matrices2. Since this factorization is not available in our setting, we use algebraic properties of trigonometric polynomials to give a new characterization of the row and column spaces of the matrix. In particular, we show in Section IV that similar Vandermonde-like basis matrices exist for the row and column space of the lifted matrix, and use these to derive a related incoherence measure that satisfies the bounds in Prop. 3.

C. Recovery in the presence of noise and model-mismatch

We now generalize (66) to the setting where we have noisy or corrupted Fourier samples

f^n[k]=f^[k]+η[k],kΩ, (17)

where η[k] ∈ ℂ is a vector of noise. In this case, we pose recovery as

ming^T(g^)subjecttoPΩ(f^n-g^)2δ. (18)

where δ > 0 is an estimate of the 2-norm of the error ||η||, and ℘Ω denotes projection onto Ω. We make no assumptions on the statistics of the noise η. In particular, η can represent errors due to model-mismatch, such as when the image is not perfectly piecewise constant, or when the edge set of the image does not coincide perfectly with the zero level-set of a bandlimited function.

The following theorem shows that when the deviation of n from is small, the modified recovery program (18) recovers a solution that is close in norm to under the same sampling conditions as Theorem 4.

Theorem 5

Let f be specified by (1), whose edge-set is described by the zero-set of the trigonometric polynomial μ0 bandlimited to Λ0 with associated incoherence measure ρ. Let Ω ⊂ Γ be an index set drawn uniformly at random within Γ such that |Ω| satisfies the bound (14) in Theorem 4. If the measurements f̂n satisfy ||Ω(n)||2δ, then the solution ĝ to (18) satisfies

T(f^)-T(g^)F5Γ2δ. (19)

with probability exceeding 1 − |Γ|−2.

See Section IV in the Supplementary Materials for proof. The bound (19) allows us to quantify the effect of model-mismatch on recovery. In particular, suppose the image fn represents a perturbation from an ideal piecewise constant image f such that their difference in L2-norm is δ-small:

fn-fL22=([0,1]2fn(r)-f(r)2dr)12δ. (20)

Then by Parseval’s theorem, the measurements of n satisfy ||℘Ω(n)||2δ, hence Theorem 5 applies. From (19) we obtain the bound ||𝒯 () − 𝒯 (ĝ)||F ≤ 5|Γ|2||fnf||L2. This shows that if the image fn is close to the ideal piecewise constant image f in spatial domain L2-norm, then the matrix 𝒯 (ĝ) we recover using (18) will be close in norm to 𝒯 () with high probability.

IV. Row and column spaces of 𝒯 () and incoherence

In this section we define an incoherence measure ρ that satisfies the desired bounds in Prop. 3. We show that the incoherence measure depends only on the geometry of the edge set of the image. The incoherence measure is derived from a new characterization of the row and column spaces of the matrix 𝒯 () in terms of Vandermonde-like basis matrices.

A. Row and column spaces of 𝒯 (f̂)

Our first lemma gives a basis for the row space of 𝒯 ():

Lemma 6

A basis of the row space of 𝒯 () is given by the columns of the |Λ1| × R Vandermonde-like matrix

Erow(P):=1Λ1(ej2πk1·r1ej2πk1·rRej2πkΛ1·r1ej2πkΛ1·rR) (21)

where {k1,, k|Λ1|} is a linear indexing of elements in Λ1, and P = {r1,, rR} is a set of R = |Λ1||Λ1 : Λ0| distinct points on the edge set curve {μ0 = 0} chosen such that the columns of Erow are linearly independent.

The careful reader will have noticed that Lemma 6 takes for granted the existence of a set of points P = {r1, ...., rR} ⊂ {μ0 = 0} such that the columns of Erow(P) is linearly independent. Call such a set P a set of admissible nodes for the curve {μ0 = 0}. The following result shows that sets of admissible nodes always exist and are easy to construct:

Lemma 7

Let μ0 be bandlimited to Λ0. Any set of MR + |Λ0| distinct points on the curve {μ0 = 0} contains a subset of R points that are a set of admissible nodes.

The next lemma shows that we can characterize the column space of 𝒯 () in a similar way as the row space:

Lemma 8

A basis of the column space of 𝒯 () is given by the columns of the 2|Λ2| × R weighted Vandermonde-like matrix:

Ecol(P)=1Λ2(w1,xw1ej2πk1·r1wR,xwRej2πk1·rRw1,xw1ej2πkΛ2·r1wR,xwRej2πkΛ2·rRw1,yw1ej2πk1·r1wR,ywRej2πk1·rRw1,yw1ej2πkΛ2·r1wR,ywRej2πkΛ2·rR), (22)

where where {k1,, k|Λ2|} is a linear indexing of elements in Λ2 and P = {r1, ...., rR} is a set of admissible nodes for the curve {μ0 = 0}. The weight vectors wi = (wi,x, wi,y), are described by the formula (52) in Appendix VIII, and depend only on the edge set {μ0 = 0}, the nodes P, and the filter support Λ1.

See Section VIII-C for the proofs of Lemmas 6 and 7, and Section VIII-E for the proof of Lemma 8.

B. Incoherence measure

We now show how to define an incoherence measure ρ that satisfies the desired bounds in Prop. 3. Consider the Gram matrix G(P) = [Erow(P)]*Erow(P), where P is any set of R points r1,, rR on the edge set curve {μ = 0}. It is easy to see from the definition (21) that the entries of G(P) are specified by

(G(P))i,j=1Λ1DΛ1(ri-rj),1i,jR, (23)

where DΛ1(r) := Σk∈Λ1ej2πk·r is the Dirichlet kernel supported on Λ1. Note that G(P) has ones along the diagonal, and the magnitude of the off-diagonal entries is dictated by the distances |rirj | and the filter support Λ1. We now define the incoherence measure ρ associated with the edge set E = {μ0 = 0} in terms of G(P).

Definition 9

Suppose the edge set curve E = {μ0 = 0} has bandwidth Λ0 (see (3)), and set R = |Λ1||Λ1: Λ0|. Define the incoherence measure ρ by

ρ=minP{μ0=0}P=R1λmin[G(P)], (24)

where λmin[G(P)] is the minimum eigenvalue of G(P).

Put in words, among all possible arrangements of R points along the edge-set {μ0 = 0}, we seek the arrangement such that the minimum eigenvalue G(P) is as large as possible. Intuitively, the optimal arrangement will maximize the minimum separation distance among the R points, and ρ can be thought of as a measure of this geometric property. In particular, edge set curves that enclose a small area, and hence require the points P to be closely spaced along the curve, will result in a large value of ρ. According to Theorem 4, the measurement burden will be high for such curves.

Note that curves corresponding to a particular bandwidth can come in different sizes. Specifically, for a fixed μ0 with bandwidth Λ0 consider the family of curves {μ0 = α}, where α is a scalar. One can change α to obtain multiple curves with exactly the same bandwidth, each of which correspond to a different levelset of μ0. These level-sets will have different incoherence measures, depending on how large or small the level-set curves are. This shows the incoherence of an edge set captures something besides its bandwidth. See Figure 3 for an illustration.

Fig. 3.

Fig. 3

Illustration of edge set incoherence measure ρ. In (a) are the level-sets of trigonometric polynomial μ0 bandlimited to Λ0 of size 3×3. These curves all have the same bandwidth, Λ0, but come in different sizes. In (b)–(d) we show R = 24 nodes on the curve giving the indicated bound on incoherence parameter ρ defined in (24), assuming a filter Λ1 of size 7×7. Observe that the incoherence measure increases as the curve gets smaller. This indicates the smaller curves have a significant sampling burden.

We can give incoherency measure of an edge set a more precise geometric interpretation based on the minimum separation distance of a set of admissible nodes. We generalize a bound on the condition number of Vandermonde matrices derived in [27] to the case of the Vandermonde-like matrix (21), and use this to derive a bound for the incoherence parameter ρ.

Theorem 10

Assume that the points P={(xi,yi)}i=1R belonging to the curve {μ0 = 0} satisfy |xixj | > Δ and |yiyj | > Δ for all ij. Assume the filter support Λ1 ⊂ ℤ2 is a square region symmetric around the origin of size Λ1×Λ1. Then

ρ(1-1Λ1Δ)-2, (25)

where ρ is the incoherence parameter (24) associated with the curve {μ0 = 0}.

See Section I of the Supplementary Materials for the proof. The bound in (25) shows that the incoherence is close to one (i.e., is as small as possible) when Δ1/Λ1. Since Δ is the spacing between each pair of points on the curve, to achieve a larger Δ spacing, and hence a smaller ρ, requires a larger curve. This suggests that fewer measurements are required to recover a larger curve, which is consistent with the findings in the isolated Dirac setting [27], [28].

V. Numerical Experiments

A. Algorithms

For small to moderate problem sizes the nuclear norm minimization problem (10) can be solved efficiently with the alternating directions method of multipliers (ADMM) algorithm, which results in a modification of the singular value thresholding (SVT) algorithm [29]. This approach has been proposed for related structured low-rank matrix completion problems in several works, e.g., [3], [11], [12], [30]. We adopt this approach here as well for our small-scale numerical experiments. A detailed implementation of this algorithm can be found in, e.g., [28]. However, we note that for large scale problems, such as those encountered in realistic imaging applications, more efficient approaches need to be adopted, because often in these cases the lifted matrix is too large to be held in memory. A fast algorithm for solving an approximation to (10) for large-scale problems is given in [19].

B. Phase transitions

In Fig. 4, we study the probability of exact recovery under different assumptions on the filter size and edge set of the image. For these experiments the reconstruction grid Γ was of size 65 × 65. We generated synthetic random piecewise constant functions with known edge set bandwidth (see Fig. 3(c)), and attempted to recover their Fourier coefficients in Γ from random samples in Ω at the specified undersampling factor. For each set of parameters we ran 10 random trials. We count the recovery as “exact” if the recovered coefficients satisfied ||f0||/||0|| < 10−3, where 0 is the ground truth. The exact recovery rate was then obtained by averaging over the 10 trials.

Fig. 4.

Fig. 4

Phase transition experiments. We generated random piecewise constant images with known edge-set bandwidth and study the success rate proposed structured low-rank matrix completion scheme under two conditions: in (a) we vary the filter size Λ1 while keeping the edge-set bandwidth K0 fixed, in (b) we vary the edge-set bandwidth K while keeping the filter size fixed. Examples of the randomly generated data are shown in (c).

First, in Fig. 4(a), we studied the effect of changing the filter size Λ1 on the recovery while keeping other parameters constant. We fixed the edge-set bandwidth to |Λ0| = 9×9 and varied the filter size as |Λ1| = (2K +1)×(2K +1) for K = 1,, 30. We call K the filter bandwidth. Note that Theorem 4 has restrictions on how large Λ1 can be. The maximum filter bandwidth for which Theorem 4 holds in this case was K = 15 (red line in Figure 4(a)), however we extended the filter size to observe the behavior of the algorithm outside of this regime. As predicted by Theorem 4, we find that the optimal performance is obtained when Λ1 is the largest as allowed by Theorem 4 (roughly half the size of Γ in each dimension).

Next, in Fig. 4(b), we study the recovery as a function of the bandwidth of the edge-set of the image. The filter bandwidth was fixed at K = 15, and we varied the edge-set bandwidth as |Λ0| = (2K0 +1)×(2K0 +1). The phase transition shows dependence |Ω| = O(|Λ0|) as predicted by Theorem 4.

C. Comparison with TV minimization on real MRI data

We also compare the proposed Fourier domain interpolation scheme against standard discrete TV minimization in spatial domain:

minuNTV(u)subjecttoPΩ(Fu)=PΩ(Fu0). (26)

Here u ∈ ℂN with N = NxNy is a 2-D array representing a discrete Nx × Ny image, u0 ∈ ℂN is the image to be recovered, F ∈ ℂN×N denotes the unitary 2-D discrete Fourier transform (DFT) matrix acting on N1 × N2 arrays, PΩ is projection onto the index of sampling locations Ω ⊂ [Nx]×[Ny], and TV (·) denotes the (isotropic) total variation semi-norm:

TV(u)=i=1N((1u)i2+(2u)i2)12 (27)

where 1 and 2 are finite difference operators in the horizontal and vertical directions, respectively. The problem (26) has been studied extensively [31]–[36] as a model for undersampled MRI reconstruction and other inverse problems in imaging.

In Fig. 5 we perform an experiment comparing against TV minimization and the proposed approach on real MRI data. For this experiment we used a fully-sampled four-coil single-slice acquisition consisting of 256 × 256 Cartesian k-space samples, which was compressed to a single virtual coil using an SVD-based technique [37]. The data in the single virtual coil was observed to have smoothly varying complex phase in image domain. To compensate for this source of model-mismatch, we further pre-preprocessed the data by removing the complex phase in image domain. We note that this preprocessing step is unrealistic for a true MRI experiment. However, the optimization problem (10) could be modified to incorporate an estimate of the smoothly varying phase in the measurement model; we omit this step for simplicity. Finally, we retrospectively undersampled the pre-processed virtual single coil data, taking 50% uniform random samples. We find that the proposed structured low-rank recovery shows significant improvement recovery error over standard total variation as measured by SNR = 20log10(||||/||*f̂||), where * is the recovered data and is the ground truth. The error images indicate the proposed method more faithfully recovers the true edges of the image.

Fig. 5.

Fig. 5

Recovery of MRI data from 2-fold random uniform undersampling. Error images shown in bottom right.

VI. Discussion

Discrete domain total-variation minimization has played a central role in compressed sensing from its inception [31], [32], which models the image to be recovered as (approximately) piecewise constant. Since the present work can be thought of as an extension of compressed sensing type guarantees to the continuous domain setting, it is fruitful to explore the connections between our continuous domain model and discrete domain total variation.

At first glance, the structured low-rank matrix completion problem (10) may seem far removed from the TV-minimization problem (26). But, in fact, one can show TV-minimization (26) is equivalent to nuclear norm minimization of a related structured matrix lifting in Fourier domain. Specifically, (26) is equivalent to

minvC(Fu)subjecttoPΩ(Fu)=PΩ(Fu0). (28)

Here

C(Fu)=[Cx(Fu)Cy(Fu)]2N×N (29)

and 𝒮x(Fu), 𝒮y(Fu) are block circulant with circulant blocks matrices whose first column is specified by the arrays vx = F∂xu and vy = F∂xu. Assuming circular boundary conditions, we can write (vx)[kx, ky] = (1 − ej2πkx/Nx)(Fu)[kx, ky] and (vy)[kx, ky] = (1 − ej2πky/Ny)(Fu)[kx, ky].

We find it interesting to use this re-formulation of TV-minimization to better understand the proposed approach. In Table I we summarize the similarities and differences. One essential difference is the dimensions of the matrix liftings. In particular, the matrix lifting we propose has dimensions 2|Λ2| × |Λ1|, with |Λ1||Λ2| whereas the matrix lifting associated with TV in (28) has dimensions 2N × N. If the reconstruction grid size is the same in both cases, i.e., |Γ| = N, then the proposed matrix lifting has substantially fewer columns than the one associated with TV. This is due to our assumption that edge set of the image has low bandwidth. In other words, we restrict the degrees of freedom of the model by constructing a lifting with fewer columns. We believe this difference may explain the success of the proposed method over TV-minimization observed empirically in Section V.

TABLE I.

Comparison of proposed scheme with discrete total variation minimization

TV-minimization Proposed

Spatial domain discrete continuous
Derivative operator finite differences exact derivative
Singularity set discrete points connected curves

Frequency domain discrete discrete
Frequency weighting wi[k] 1 − ej2πki/Ni j2πki
Lifted matrix structure two-level circulant two-level Toeplitz
Rank of lifted matrix sparsity of discrete gradient bandwidth of edge set

VII. Conclusion

We derived performance guarantees for the recovery of piecewise constant images from random non-uniform Fourier samples via a convex structured low-rank matrix completion problem. This was achieved by adapting results in [3] to the case of a low-rank block two-fold Toeplitz matrix with an additional weighting scheme that arises naturally when considering piecewise constant images. We also define incoherence measures that rely only on geometric properties of the edge set, which indicate that the sampling burden is higher for images with smaller piecewise constant regions.

The recovery guarantees in this work studied the case of uniform random samples. However, in practice we observe that recovery works well with when considering other types of variable density random sampling, where the low spatial frequencies are more heavily sampled. It would be interesting to adapt our results to a wider variety of sampling distributions, and to identify the optimal sampling strategy for signals belonging to our image model.

Fig. 1.

Fig. 1

Annihilation of a piecewise constant function as a multiplication in spatial domain (top) and as a convolution in Fourier domain (bottom). The partial derivatives of a piecewise constant function are supported on the edge set. If there is a bandlimited function μ that is zero along the edge set, then the spatial domain product of μ with the gradient ∇f = (∂xf,yf) is identically zero. In Fourier domain, this is equivalent to the annihilation of the arrays j2πkxf[kx, ky] and j2πkyf[kx, ky] by 2-D convolution with a finite filter determined by the Fourier coefficients μ̂.

Acknowledgments

This work is supported by grants NIH 1R01EB019961-01A1 and ONR N00014-13-1-0202.

VIII. Appendix A: Incoherence Bounds

A. Notation and Preliminaries

To simplify our arguments, we will convert the linear operators 𝒯() and 𝒯()* defined in Fourier domain to linear operators acting on spaces of trigonometric polynomials (3) in spatial domain. Specifically, for any index set Ω ⊂ ℤ2, let BΩ denote the vector space of all trigonometric polynomials that have coefficients supported within Ω. Similarly, we denote the space of vector fields ρ = (ρ1, ρ2) with components ρ1, ρ2BΩ as BΩ2. We set 𝒮(f) = ℱ𝒯() ℱ−1, where ℱ is the Fourier transform of a periodic function on [0, 1]2. For any index set Λ, define the Dirichlet kernel DΛ1 (r) := Σk∈Λ1 ej2πk·r. For all φBΛ1, the action of the linear operator S(f):BΛ1BΛ22 can be expressed compactly as

S(f)φ=DΛ2(φf)BΛ12, (30)

where φf is understood as a tempered distribution, and the convolution is applied separately to each vector field component. Here convolution with DΛ2 is a bandlimiting operation. Simliarly, for ρ=(ρ,ρ2)BΛ22, the adjoint 𝒮(f)* acts as

S(f)ρ=DΛ1(ρ·f)BΛ1 (31)

which is the spatial domain equivalent of the adjoint matrix 𝒯()*. More expliclty, if f = 1U where U is a simply connected region with smooth boundary ∂U, a straightforward argument using the divergence theorem shows that the function 𝒮(f)φ is given pointwise as the weighted curve integral

(S(f)φ)(r)=UDΛ2(r-r)n(r)ds(r), (32)

for all r ∈ [0, 1]2, where n(r′) is the outward unit normal to the curve ∂U at r′, and ds is the arc-length element. Likewise, 𝒮(f)*ρ is the function given pointwise by

(S(f)ρ)(r)=UDΛ1(r-r)[ρ(r)·n(r)]ds(r), (33)

for all r ∈ [0, 1]2. These formulas can be generalized to an arbitrary piecewise constant function f = Σi ai1Ui by linearity. However, in the remainder we focus on the case where f = 1U to simplify our arguments.

B. Fundamental subspaces of 𝒮(f) and dimensions

Under the conditions of Theorem 1, the nullspace of 𝒯() is spanned by shifts of the minimal annihilating filter, μ0^. In spatial domain, this space consists of all multiples of the minimal degree polynomial γ = η μ0 such that γ is bandlimited to Λ1. We denote this space by

(μ0)Λ1:={ημ0:ηBΛ1:Λ0}. (34)

Note that (μ0)Λ1 is a subspace of BΛ1 with dimension |Λ1: Λ0|. Therefore, the dimension of the kernel of 𝒮(f), denoted by ker 𝒮(f), is given by

dimkerS(f)=Λ1:Λ0. (35)

By the rank-nullity theorem, the dimension of the image of 𝒮(f), denoted by im 𝒮(f), is

dimimS(f)=Λ1-Λ1:Λ0=R. (36)

Likewise, the dimension of the coimage im 𝒮(f)* is also R. Furthermore, since im 𝒮(f)* = [kerS(f)], we have

imS(f)=(μ0)Λ1 (37)

This means that any γBΛ1 is in the row space if and only if γ is orthogonal to every trigonometric polynomial of the form η μ0BΛ1, or equivalently,

γ,η,μ0=[0,1]2γ(r)η(r)μ0(r)¯dr=0 (38)

for all ηBΛ1: Λ0.

C. Basis for the coimage of 𝒮(f) (corresponding to the row space of 𝒯(f̂))

Let s ∈ [0, 1]2, and set φsBΛ1 to be the translated Dirichlet kernel:

φs(r)=DΛ1(r-s)forallr[0,1]2. (39)

Equivalently, φsBΛ1 is the trigonometric polynomial specified in Fourier domain as

φs^[k]={e-j2πs·kifkΛ10ifkΛ1. (40)

Observe that the inner product of φs with any other trigonometric polynomial ηBΛ1 is given by the point-evaluation of η at s:

η,φs=kΛ1η^[k]ej2πk·s=η(s). (41)

Suppose now that the point s satisfies μ0(s) = 0. In this case, we see that φs is necessarily in the coimage imS(f)=(μ0)Λ1 since we have

γμ0,φs=γ(s)μ0(s)=0. (42)

for any multiple of the minimal polynomial γμ0BΛ1, i.e., any element in ker 𝒮(f) = (μ0)Λ1.

We will now show how to construct a basis for the coimage of 𝒮(f) out of elements having the form φri for some ri, i = 1,,R belonging to the zero set of μ0. For an arbtirary collection of R points {ri}i=1R{μ0=0}, we are not guaranteed that the set of functions {φri}i=1R is linearly independent. However, we will show that there exists a constant M = M0,Λ1) such that for any M distinct points {ri}i=1M{μ0=0} we can always find a subset of R linearly independent basis functions from the collection {φri}i=1M. The constant M is related the maximum number of isolated zeros that a system of two trigonometric polynomials can have. The following lemma, which is a consequence of the BKK bound in enumerative algebraic geometry (see, e.g., [38]), puts a bound on M. See section II of the supplementary material for proof.

Lemma 11

Let Λ1 and Λ0 be rectangular index sets such that Λ0 ⊂ Λ1, and set R = |Λ1||Λ1: Λ0|. For any μ0, μ1 trigonometric polynomials bandlimited to Λ0 and Λ1, respectively, the maximum number M of isolated solutions of μ0(r) = μ1(r) = 0 is bounded as

M<R+Λ0. (43)

We now prove equivalents of Lemma 6 and Lemma 7 in terms of the spatial domain operator 𝒮(f):

Lemma 12

Let {r1,., rN} be any collection of N distinct points on the curve {μ0 = 0}, where N ≥ R + |Λ0|. Then the coimage space imS(f)=(μ0)Λ1 is spanned by the set of shifted Dirichlet kernels φi(r) = DΛ1 (rri) for all i = 1,,N, i.e.,

span{φri}i=1N=(μ0)Λ1. (44)

In particular, there exists a subset of R = |Λ1||Λ1: Λ0| elements of {φri}i=1N that is a basis for the coimage space (μ0)Λ1.

Proof

All the functions φri are in (μ0)Λ1 since we have ≪φi, γμ0〉 = γ(ri)μ0(ri) = 0 because each ri belong to the zero set of μ0. This implies that

span{φri}i=1M(μ0)Λ1. (45)

Our focus is on proving (44) with equality. For this, it is sufficient to show that any vector orthogonal to span{φi}i=1N is in (μ0)Λ. Assume that there is a vector η(r) ∈ BΛ1 that is in the orthogonal complement space of span{φi}i=1N. This is only possible if

η,φi=η(ri)=0,foralli=1,,N. (46)

Therefore, both η and μ0 have N zeros in common. By Lemma 11 this is only possible if η contains μ0 as a factor. This implies that all vectors in the orthogonal complement space of span{φi}i=1M are in (μ0)Λ1, or equivalently

span{φri}i=1M(μ0)Λ1, (47)

which together with (45) proves (44).

Finally, we also know that the dimension of (μ0)Λ1 is equal to R <M. Thus, one can select a subset of R basis functions φi that are linearly independent and hence a basis for (μ0)Λ1.

Translating this result to Fourier domain, we see that the row space of 𝒯() is spanned by the vectors of Fourier coefficients (φi^[k]:kΛ1)Λ1, for i = 1,,R. Equivalently, this can be expressed as the columns of the Vandermonde-like matrix Erow specified by (21), which proves Lemma 6 and 7.

D. Discretization of curve integrals: quadrature formula

Using the results from the previous subsection, we now introduce a quadrature formula for curve integrals, which we will use to determine the range space im 𝒮(f) in the next subsection.

Let γ be any function in BΛ for any Λ ⊇ Λ0. Then from the orthogonal decomposition BΛ=(μ0)Λ(μ0)Λ we can decompose γ as

γ(r)=i=1SaiDΛ(r-ri)+φ(r)μ0(r), (48)

where S = |Λ||Λ: Λ0|, and where {DΛ(r-ri)}i=1S defines a basis of (μ0)Λ. Here, the coefficients ai in (48) are obtained uniquely as

[a1aS]=D-1[γ(r1)γ(rS)], (49)

where D ∈ ℝS×S is the symmetric matrix with entries [D]i,j = DΛ(rirj) for 1 ≤ i, jS. The above expression can be compactly expressed as a = D−1g, where g = (γ(r1),, γ(rS))T.

Lemma 13

Let f = 1U where U is a simply connected region with smooth boundary ∂U, which is the zero levelset of μ0BΛ0 and let γBΛ. Consider the curve integral of the form

q=Uγ(r)n(r)ds(r), (50)

where n(r) = ∇f(r)/|f(r)| is the unit normal on the curve ∂U. The curve integral can be evaluated using the quadrature formula

q=i=1Sγ(ri)wi, (51)

where the S = |Λ||Λ : Λ0| points {ri}i=1S belong to the curve {μ0 = 0}, and the cooresponding weight vectors wi ∈ ℝ2, i = 1, .., S, are specified by

[w1wS]=D-1[v1vS]. (52)

where vi = ∮∂U DΛ(rri)n(r)ds(r) ∈ ℝ2.

Proof

Decomposing γ(r) using (48), we obtain

Uγ(r)n(r)ds(r)=i=1SaiUDΛ(r-ri)n(r)ds(r):=vi (53)

The above sum can be expressed in the vector form as

i=1Saivi=aV=gD-1V (54)

where V=[v1T,,vST]TR×2. Setting W=D-1V=[w1T,,wST]TR×2 we obtain (51).

E. Basis for the range of 𝒮(f) (corresponding to the column space of 𝒯(f̂))

We now introduce a basis set for im 𝒮(f), which will be used to prove Lemma 8.

Lemma 14

The range of 𝒮(f), denoted by im 𝒮(f) is specified by

imS(f)=span{wiDΛ2(r-ri)}i=1R (55)

for an appropriate choice of points {ri}i=1R{μ0=0} with R = |Λ1||Λ1: Λ0|, and where the weight vectors wi are specified by (52).

Proof

Consider an arbitrary element ρ = (ρ, ρ2) ∈ im 𝒮(f). We can express ρ as ρ = 𝒮(f)ψ = ℬΛ2 (ψf) = DΛ2 * (ψf) for some ψBΛ1. By the definition in (33), we have

ρ(r)=Uψ(s)DΛ2(r-s)n(s)ds=i=1Sψ(ri)DΛ2(r-ri)wi, (56)

where we Lemma 13 in the last step with S = |Γ||Γ : Λ0| since the integrand ψ(s)* DΛ2 (rs) belongs to BΓ. The above relation shows that any ρ(s) ∈ im 𝒮(f) can be expressed as the linear combination of the functions DΛ2 (sri)wi, for i = 1, .., S. Thus, we have imS(f)span{DΛ2(r-ri)wi}i=1S. We also know that dim (im 𝒮(f)) = R < S. This implies that we can select a subset of R vectors from the set {DΛ2(r-ri)wi}i=1S that are linearly independent, which will span im 𝒮(f), and hence define a basis.

Correspondingly, the column space of 𝒯() is spanned by the Fourier coefficients of the basis vectors wiDΛ2 (rri), or the columns of the 2|Λ2| × R weighted Vandermonde-like matrix Ecol specified by (22).

F. Incoherence Bounds

1) Projection onto row subspace

Let Erow = Erow(P) be any basis for the row space V of 𝒯() specified by (21), whose columns are vectorized Fourier coefficients of the translated and normalized Dirichlet kernels φi(r)=1Λ1DΛ1(r-ri), i = 1,,R, for some set of admissible nodes P = {r1,, rR} ⊂ {μ0 = 0}. Projecting the measurement basis matrix Ak onto V, we have

PVAkF2=AkErow(ErowErow)-1ErowF2[λmin(ErowErow)]-1AkErowF2

Since Ak selects (k)| rows of Erow, each of which has R entries of magnitude 1/Λ1, we have

AkErowF2=1ω(k)·R·ω(k)·1Λ1=RΛ1=RcsΓ (57)

where cs = |Γ|/|Λ1|. Hence,

PVAkF2[λmin(ErowErow)]-1RcsΓ. (58)

Minimizing over all sets of admissible nodes P in the construction of Erow gives the final bound

PVAkF2ρRcsΓ. (59)

2) Projection onto column space

Let Ecol = Ecol(P) be a basis for the column space of 𝒯() specified by (22), whose columns are vectorized Fourier coefficients of the translated and weighted Dirichlet kernels 1Λ2wiwiDΛ2(r-ri), for some set of admissible nodes P = {r1,, rR} ⊂ {μ0 = 0}. Observe the columns of Ecol are defined to have unit 2-norm. Following the same steps as in the row space bound, we have

PUAkF2=Ecol(EcolEcol)-1EcolAkF2[λmin(EcolEcol)]-1EcolAkF2

Expanding the norm EcolAkF2 gives

EcolAkF2=1Λ2i=1R1ω(k)ω(k)|,wiwi|2RΛ2RcsΓ.

Hence, we have

PUAkF2ρRcsΓ. (60)

where ρ′ is defined similarly to ρ as:

ρ=minP{μ0=0}P=R1λmin[Ecol(P)Ecol(P)], (61)

Finally, we show how to bound ρ′ by ρ in (60). Observe that we can re-define ρ and ρ′ in terms of the minimum singular value of the basis matrices Erow(P) and Ecol(P), according to the correspondences:

λmin(Ecol(P)Ecol(P))=σmin2(Ecol(P)),λmin(Erow(P)Erow(P))=σmin2(Erow(P)).

We will show σmin2(Erow(P))σmin2(Ecol(P)), or equivalently, [λmin(Ecol(P)*Ecol(P))]−1 ≤ [λmin(Erow(P)*Erow(P)]−1, for any set P consisting of R points on the edge set. The claim then follows immediately by taking the minimum over all such sets P.

To ease notation, we drop the dependence on the set P in the following. Observe that we can express Ecol as

Ecol=[EcolWxEcolWy] (62)

where Wx=diag(w1,xw1,,wR,xwR),Wy=diag(w1,yw1,,wR,ywR), and col ∈ ℂ|Λ2|×R is the Vandermonde-like matrix given entrywise by [col]i,j = ej2πki·ri, for all ki ∈ Λ2, 1 ≤ jR. In other words, col has the same structure as Erow, but is built with respect to Λ2 instead of Λ1. In particular, since we always assume Λ1 ⊂ Λ2, the matrix Erow can be embedded as a submatrix of col by restricting the rows of col to those indexed by Λ1. By the variational characterization of the minimum singular value of a matrix, we have

σmin2(Ecol)=minu=1Ecolu2=minu=1EcolWxu2+EcolWyu2σmin2(Ecol)(Wxu2+Wyu2)=1 (63)

Finally, since Erow is a submatrix of col, we also have σmin2(Erow)σmin2(Ecol), which together with (63) gives the desired inequality.

IX. Appendix B: Proof of Main Theorem

A. Reformulation in lifted domain

We now reformulate the recovery of as a matrix recovery problem in the lifted domain. The matrices 𝒯x() and 𝒯y() contain several copies of the weighted entries kx[k] and ky [k], respectively. We use ω(k) to denote the set of locations (α1, α2) in the matrix 𝒯x() or 𝒯y() that contain the entry kx[k] or ky[k] (this set is the same in either case).

We define the sampling matrices Ak=[A1,kA2,k]2Λ2×Λ1, for each k = (k1, k2) ∈ Γ, where

(Ai,k)α={kikωi(k),ifα=(α1,α2)ω(k)0else (64)

for i = 1, 2. The matrices {Ak}k∈Γ form an orthonormal basis for the space of matrices defined by the range of the matrix lifting 𝒯; we will call any matrix in the range of 𝒯 a structured matrix. For any set of coefficients {ĝ[k]}k∈Γ we can expand the structured matrix 𝒯(ĝ) as

T(g^)=kΓg^[k]kwi(k)Ak. (65)

We denote the projection operator corresponding to a single sampling location k by 𝒜k(X) = 〈Ak,XAk. Since {Ak}k∈Γ is an orthonormal basis, for any structured matrix X, we have Σk∈Γ 𝒜k(X) = 𝒜(X) = X. Since Ak is not the basis for a general X ∈ ℂ2|Λ2|×|Λ1|, we also define the projection operator to the space orthogonal to the space of structured matrices by 𝒜(X) = (ℐ − 𝒜)(X), where ℐ is the identity operator. In particular, the constraint 𝒜(X) = 0 implies that X is a structured matrix.

The recovery of f from its partial Fourier samples [k], k ∈ Ω, can thus be reformulated as the completion of a structured matrix X from its measurements 𝒜k, k ∈ Ω. Since the matrix is structured, we have 𝒜(X) = 0. We thus reformulate (10) as the structured low-rank recovery problem:

minimizeXXsubjecttoQΩ(X)=QΩ(T(f^)), (66)

where 𝒬Ω that satisifies 𝔼[𝒬Ω] = ℐ is defined as:

QΩ=ΓΩAΩ+A (67)

B. Conditions for perfect recovery

The tangent space T of the matrix X is defined as T:={UX1H+X2VH:X1Λ2×R,X2Λ1×R} where X = UΛVH is the singular value decomposition of X. The orthogonal complement of T is denoted by T. We first show that if ℘T ≈ ℘T𝒬ΩT, and if an approximate dual certificate that satisfies certain conditions exist, we obtain perfect recovery.

Lemma 15

Consider a multiset Ω that contains m random indices. Suppose the sampling operator 𝒬Ω obeys

PT-PTQΩPT12 (68)

and there exists a dual certificate matrix W satisfying

QΩ(W)=0 (69)
PT(W-UV)F16n (70)
PT(W)12. (71)

Then, 𝒯() is the unique solution to (66), where n = |Γ| and m = |Ω|.

See Section III-A of supplementary material for proof. Equation (68) suggests that 𝒬Ω ≈ ℐ on the tangent space. The conditions (69), (70), and (71) indicates the existence of a W, which approximates the exact dual certificate UV*. The above lemma is in line with [3, lemma 1], with the exception of the third condition, indicated by (70). To satisfy (68), we bound the deviation of ℘T𝒬ΩT from ℘T in the following lemma.

Lemma 16

Suppose (12) holds. Then we have

PT-PTQΩPTε12 (72)

with probability exceeding 1 − n−4, provided that m > c1ρR cs log(n).

We prove this using [39, Theorem 1.6]. (See Section III-B of supplementary material)

C. Construction of the approximate dual certificate W

We will now use the golfing scheme of [3], [25] to construct an approximate dual certificate W, which satisfies (69), (70), and (71). In particular, we generate j0 independent random sampling sets Ωi; 1 ≤ ij0, each containing = m/j0 samples corresponding to sampling with replacement. We start with F0 = UV*, and follow the following steps:

  1. F0 = UV* and set j0-3log1εn.

  2. i(1 ≤ ij0), Fi = ℘𝒯(ℐ − 𝒬Ωi)℘T (Fi−1)

  3. W=j=1j0QΩiFj-1

Step 3 ensures that W satisfies (69) since each term Wi = 𝒬ΩiFj−1 satisfies QΩ(Wi)=0. The recursive construction also satisfies (70). In particular,

PT(W-UV)F=PTFj0Fεj0F0F=εj0Rεj0n

Now we focus on showing that W satisfies (71). Note that if j0 is chosen as 3log1εn, assuming n > 6, we have (ε)j0n<16n.

Lemma 17

For any matrix M, there exists some numerical constant c2 such that

(I-QΩ)(M)c2nlognmMA,2+c2nlognmMA,, (73)

with probability at least 1 − n−10. Here,

MA,=maxkΓ|Ak,Mωk| (74)
MA,2=kΓAk,M2ωk (75)

See Section III-C of supplementary material for proof.

Lemma 18

Assume that there exists a constant μ5 such that ωkPT(Ak)A,2μ5Rn. For any matrix M, we have

PT[(I-QΩ)(M)]A,2c3μ5Rlognm(MA,2+nlognmMA,),

with probability at least 1 − n−10.

See Section III-D of the Supplementary Materials for proof.

Lemma 19

For any matrix MT, there exists some numerical constant c4, such that

PT[(I-QΩ)(M)]A,c4ρcsRlognmρcsRnMA,2+c4ρcsRlognmMA,, (76)

with probability at least 1 − n−10.

See Section III-F of the Supplementary Materials for proof. From the golfing scheme, we have PT(W)j=1j0PTQΩiPTFj-1. Using lemma 17 and substituting from lemma 18 and lemma 19, we have

PTQΩiFj-1(12)j0-1c2{nlognmF0A,2+nlognmF0A,}

The last inequality holds if = m/j0 ≫ max (μ5, ρcs)Rlog n. Substituting for j0=3log1ε(n) assumed in the golfing scheme, we require mc6 max (μ5, ρcs)Rlog2 n to satisfy the above inquality. See Section III-G of the Supplementary Materials for details. We will now present the lemmas bounding ||F0||𝒜,2 and ||F0||𝒜,∞, where F0 = UV*.

Lemma 20

With the incoherence measure ρ, one can bound

UVA,ρcsRn (77)
UVA,22c7μ3cslog2(n)Rn (78)
PT(ωαAα)A,22c7μ3cslog2(n)Rn,αΓ (79)

for μ3 = 3ρ and c7 is some constant.

See Section III-H of the Supplementary Materials for proof. From (79), we see that the constant μ5 in lemma 19 can be chosen as μ5 = c7 μ3 cs log2(n) such that ωkPT(Ak)A,2μ5Rn. Substituting for μ5, we observe that the dominant term has its dependence on log4(n). Thus, ||℘T 𝒬ΩiFj−1||< 1/2 if m > c6c7 cs (3ρ) R log4(n).

Footnotes

1

More precisely, μ0 is unique up to multiplication by a phase factor ej2πk·r for some k ∈ ℤ2.

2

The structured matrices considered in [3] are block Hankel with Hankel block matrices (BHHB), but this difference is purely cosmetic: every BTTB matrix can be re-expressed as BHHB after a permutation of its rows and columns. In particular, the Vandermonde-like factorization of BHHB matrices in [3] carries over to BTTB matrices.

Contributor Information

Greg Ongie, Department of EECS, University of Michigan, Ann Arbor, MI 48108 USA.

Sampurna Biswas, Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA, 52245 USA.

Mathews Jacob, Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA, 52245 USA.

References

  • 1.Bhaskar BN, Recht B. Atomic norm denoising with applications to line spectral estimation. Communication, Control, and Computing, 2011 49th Annual Allerton Conference on; IEEE; 2011. pp. 261–268. [Google Scholar]
  • 2.Candès EJ, Fernandez-Granda C. Towards a mathematical theory of super-resolution. Communications on Pure and Applied Mathematics. 2014;67(6):906–956. [Google Scholar]
  • 3.Chen Y, Chi Y. Robust spectral compressed sensing via structured matrix completion. Information Theory, IEEE Trans on. 2014;60(10):6576–6601. [Google Scholar]
  • 4.Stoica P, Moses RL. Introduction to spectral analysis. Vol. 1 Prentice hall; Upper Saddle River, NJ: 1997. [Google Scholar]
  • 5.Haacke E, Liang ZP, Izen S. Superresolution reconstruction through object modeling and parameter estimation. Acoustics, Speech and Signal Processing, IEEE Trans on. 1989 Apr;37(4):592–595. [Google Scholar]
  • 6.Haacke EM, Liang ZP, Izen SH. Constrained reconstruction: A superresolution, optimal signal-to-noise alternative to the fourier transform in magnetic resonance imaging. Medical Physics. 1989;16(3):388–397. doi: 10.1118/1.596427. [DOI] [PubMed] [Google Scholar]
  • 7.Vetterli M, Marziliano P, Blu T. Sampling signals with finite rate of innovation. Signal Processing, IEEE Trans on. 2002;50(6):1417–1428. [Google Scholar]
  • 8.Jin KH, Lee D, Ye JC. A general framework for compressed sensing and parallel MRI using annihilating filter based low-rank hankel matrix. 2015 preprint arXiv:1504.00532. [Google Scholar]
  • 9.Jin KH, Lee D, Ye JC. A novel k-space annihilating filter method for unification between compressed sensing and parallel mri. IEEE ISBI. 2015
  • 10.Haldar JP. Low-rank modeling of local k-space neighborhoods (LORAKS) for constrained MRI. Medical Imaging, IEEE Trans on. 2014;33(3):668–681. doi: 10.1109/TMI.2013.2293974. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Ongie G, Jacob M. Recovery of piecewise smooth images from few fourier samples. SampTA. 2015:543–547. doi: 10.1137/15M1042280. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Ye JC, Kim JM, Jin KH, Lee K. Compressive sampling using annihilating filter-based low-rank interpolation. IEEE Trans on Information Theory. 2016 [Google Scholar]
  • 13.Starck JL, Candès EJ, Donoho DL. The curvelet transform for image denoising. Image Processing, IEEE Trans on. 2002;11(6):670–684. doi: 10.1109/TIP.2002.1014998. [DOI] [PubMed] [Google Scholar]
  • 14.Do MN, Vetterli M. The contourlet transform: an efficient directional multiresolution image representation. Image Processing, IEEE Trans on. 2005;14(12):2091–2106. doi: 10.1109/tip.2005.859376. [DOI] [PubMed] [Google Scholar]
  • 15.Ongie G, Jacob M. Super-resolution MRI using finite rate of innovation curves. IEEE ISBI. 2015 [Google Scholar]
  • 16.Ongie G, Jacob M. Off-the-grid recovery of piecewise constant images from few fourier samples. 2015 May;:543–547. doi: 10.1137/15M1042280. arXiv:1510.00384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Pan H, Blu T, Dragotti PL. Sampling curves with finite rate of innovation. Signal Processing, IEEE Trans. on; 2014. [Google Scholar]
  • 18.Ongie G, Jacob M. A fast algorithm for structured low-rank matrix recovery with applications to undersampled MRI reconstruction. IEEE International Symposium on Biomedical Imaging; 2016; [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ongie G, Jacob M. Giraf: A fast algorithm for structured low-rank matrix recovery giraf: A fast algorithm for structured low-rank matrix recovery. 2016 doi: 10.1109/isbi.2016.7493322. arXiv:1609.07429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Balachandrasekaran A, Ongie G, Jacob M. Accelerated dynamic MRI using structured low rank matrix completion. 2016 IEEE International Conference on Image Processing (ICIP); Institute of Electrical and Electronics Engineers (IEEE); 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ongie G, Biswas S, Jacob M. Structured matrix recovery of piecewise constant signals with performance guarantees. International Conference on Image Processing; 2016; [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Strichartz RS. A Guide to Distribution Theory and Fourier Transforms. World Scientific Pub Co Pte Lt; 2003. [Google Scholar]
  • 23.Lustig M, Donoho D, Pauly JM. Sparse mri: The application of compressed sensing for rapid mr imaging. Magnetic resonance in medicine. 2007;58(6):1182–1195. doi: 10.1002/mrm.21391. [DOI] [PubMed] [Google Scholar]
  • 24.Candès E, Recht B. Exact matrix completion via convex optimization. Commun ACM. 2012 Jun;55(6):111–119. [Google Scholar]
  • 25.Gross D. Recovering low-rank matrices from few coefficients in any basis. Information Theory, IEEE Trans on. 2011;57(3):1548–1566. [Google Scholar]
  • 26.Shin PJ, Larson PE, Ohliger MA, Elad M, Pauly JM, Vigneron DB, Lustig M. Calibrationless parallel imaging reconstruction based on structured low-rank matrix completion. Magnetic Resonance in Medicine. 2013 doi: 10.1002/mrm.24997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Moitra A. Super-resolution, extremal functions and the condition number of vandermonde matrices. Proceedings of the 47th Annual ACM on Symposium on Theory of Computing; 2015; pp. 821–830. [Google Scholar]
  • 28.Ye JC, Kim JM, Jin KH. Compressive sampling using structured low-rank interpolation. 2015 preprint arXiv:1511.08975. [Google Scholar]
  • 29.Cai JF, Candès EJ, Shen Z. A singular value thresholding algorithm for matrix completion. SIAM Journal on Optimization. 2010;20(4):1956–1982. [Google Scholar]
  • 30.Fazel M, Pong TK, Sun D, Tseng P. Hankel matrix rank minimization with applications to system identification and realization. SIAM Journal on Matrix Analysis and Applications. 2013;34(3):946–977. [Google Scholar]
  • 31.Candès EJ, Romberg J, Tao T. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans on information theory. 2006;52(2):489–509. [Google Scholar]
  • 32.Candes EJ, Romberg JK, Tao T. Stable signal recovery from incomplete and inaccurate measurements. Communications on Pure and Applied Mathematics. 2006;59(8):1207–1223. [Google Scholar]
  • 33.Needell D, Ward R. Near-optimal compressed sensing guarantees for total variation minimization. IEEE Trans on Image Processing. 2013;22(10):3941–3949. doi: 10.1109/TIP.2013.2264681. [DOI] [PubMed] [Google Scholar]
  • 34.Needell D, Ward R. Stable image reconstruction using total variation minimization. SIAM Journal on Imaging Sciences. 2013;6(2):1035–1058. [Google Scholar]
  • 35.Krahmer F, Ward R. Stable and robust sampling strategies for compressive imaging. IEEE Trans on image processing. 2014;23(2):612–622. doi: 10.1109/TIP.2013.2288004. [DOI] [PubMed] [Google Scholar]
  • 36.Poon C. On the role of total variation in compressed sensing. SIAM Journal on Imaging Sciences. 2015;8(1):682–720. [Google Scholar]
  • 37.Zhang T, Pauly JM, Vasanawala SS, Lustig M. Coil compression for accelerated imaging with cartesian sampling. Magnetic resonance in medicine. 2013;69(2):571–582. doi: 10.1002/mrm.24267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Li T, Wang X. The BKK root count in cn. Mathematics of Computation of the American Mathematical Society. 1996;65(216):1477–1484. [Google Scholar]
  • 39.Tropp J. User-friendly tail bounds for sums of random matrices. Foundations of computational Math. 2012;12(4):389–434. [Google Scholar]

RESOURCES