Abstract
This paper presents a technique for solving the multichannel blind deconvolution problem. The authors observe the convolution of a single (unknown) source with K different (unknown) channel responses; from these channel outputs, the authors want to estimate both the source and the channel responses. The authors show how this classical signal processing problem can be viewed as solving a system of bilinear equations, and in turn can be recast as recovering a rank-1 matrix from a set of linear observations. Results of prior studies in the area of low-rank matrix recovery have identified effective convex relaxations for problems of this type and efficient, scalable heuristic solvers that enable these techniques to work with thousands of unknown variables. The authors show how a priori information about the channels can be used to build a linear model for the channels, which in turn makes solving these systems of equations well-posed. This study demonstrates the robustness of this methodology to measurement noises and parametrization errors of the channel impulse responses with several stylized and shallow water acoustic channel simulations. The performance of this methodology is also verified experimentally using shipping noise recorded on short bottom-mounted vertical line arrays.
I. INTRODUCTION
Acoustic remote sensing, such as ocean tomography and geoacoustic inversion, typically relies on a controlled active source to probe the environment.1,2 In this case, the channel impulse response (CIR), which is the Green's function for a given ocean environment, can readily be obtained from recorded signals since the broadcast source signal is known. A potential alternative for performing acoustic remote sensing is to use sources of opportunity (e.g., radiating ships) instead of deploying and operating controlled sources. However, using sources of opportunity means that the broadcast source signal is then unknown; thus, estimating both the CIR and the actual source signal using only recorded signals, a process commonly referred to as blind deconvolution, is generally an ill-posed problem without any additional a priori information or assumptions, especially for propagating sound through an ocean waveguide as it is distorted by multipath effects. This paper develops a scalable methodology that solves the multichannel deconvolution problem for a source of opportunity radiating in an ocean waveguide using a priori information about CIRs between the source and the array of receiver elements. Because the method is not performed in a totally blind manner, it is referred to as myopic deconvolution.
For decades, the multichannel deconvolution problem has been an active field of research with applications in wireless communication,3,4 image restoration,5,6 seismic signal processing,7 and underwater acoustics.8–11 The essence of blind (or myopic) deconvolution rests on the exploitation of either channel structures12 or input source properties, or both. In the underwater acoustics community, the blind deconvolution problem has been studied by many researchers and several algorithms have been proposed.13–16 Traditional multichannel blind deconvolution methods can broadly be categorized into two main approaches: deterministic methods and statistical methods.17 Deterministic methods, such as subspace methods,18 usually reduce the blind deconvolution problem to a regularized least squares problem, hence critically depend on an accurate estimate of the duration of the CIR (i.e., channel length). These methods also entail high computational cost and sensitivity to noise. Statistical methods19,20 typically rely on statistical information about an unknown source signal, which, in practice, require a long recording duration to build up the statistical information.
This paper examines a sensing scenario under the multiple-channel framework [Fig. 1(a)], in which a network of distributed sensors (e.g., a vertical line array) listens to the same source [Fig. 1(b)]. Compared to traditional multichannel blind deconvolution approaches, the proposed myopic deconvolution approach does not rely on accurate channel length estimation or any statistical information about the input source, so it can be used with short recording duration (e.g., a single snapshot). Instead, the proposed approach investigates the multichannel deconvolution problem as a low-rank matrix recovery problem, which significantly differs from the problem formulation of the aforementioned blind deconvolution approaches. Compared to the recently developed single-channel blind deconvolution method using a similar formulation,21 the multichannel deconvolution method exploits the spatial diversity of CIRs across channels to solve the blind deconvolution problem.
FIG. 1.
(Color online) Multichannel deconvolution problem: (a) Problem illustration. (b) Underwater acoustic channel model.
Under the general deconvolution framework [Fig. 1(a)], the multichannel deconvolution problem is typically posed as a bilinear system of equations that can be recast as a system of linear equations whose solution has a rank-1 structural constraint. This system of linear equations is solved using the low-rank matrix recovery tools developed in recent years.22,23 The proposed approach estimates CIRs and source by solving an optimization program. This formulation has the advantage that a priori information about CIRs can be incorporated by adding constraints. Furthermore, this also allows one to use standard numerical techniques to solve the problem on a relatively large scale. The numerical experiments in Sec. IV demonstrate recovery of 100 channels with 200 taps each from a source of length 1000. Additionally, when the dimension of the subspace containing the admissible CIRs is consistent with the available a priori information, and it is small compared to the length of measurements, the myopic deconvolution can be achieved using a small number of receivers.
The remainder of this paper is organized as follows. Section II presents the theoretical formulation of the proposed myopic deconvolution method based on recasting the multichannel deconvolution problem as a low-rank matrix recovery problem. A general formulation of the method based on the assumption that CIRs are time-limited is first introduced, and the formulation is then extended to CIRs that can be represented by a general linear model. Section III introduces a linear model for parameterizing multipath CIRs in ocean waveguides in order to incorporate a priori knowledge of the arrival-time structure of the CIR in the low-rank matrix recovery problem. Section IV first investigates the performance and the robustness of the proposed deconvolution method using numerical simulations of general stylized channels such as time-limited channels and pulse-like channels. This section then presents numerical simulation results for realistic acoustic channels in an ocean waveguide and deconvolution results using at-sea data recorded in shallow water using bottom-mounted short vertical line arrays. Finally, Sec. V summarizes the findings of this article.
II. MULTICHANNEL DECONVOLUTION VIA LOW-RANK RECOVERY
A. The multichannel deconvolution problem scenario
The multichannel deconvolution problem is illustrated in Fig. 1(a). A common source signal s(t) drives K different channels with the CIRs . Over a period of time, we observe samples of convolution outputs , where . From these observations alone, we wish to estimate both and s(t). This problem has direct application in underwater acoustic passive sensing as shown in Fig. 1(b): for instance, assuming sound from a shipping source that propagates toward each hydrophone based on each CIR hk(t). So under this model, each hydrophone from a vertical line array (forming multiple channels) listens to the same ship noise. The goal is to deconvolve the CIRs and the source by measuring only the outputs of a hydrophone array. This problem does not have a unique solution if we know nothing about the source signal s(t) or the CIR hk(t). However, if we make some structural assumptions about the channels, namely that they live in some known subspace with a low dimension, then the problem, which becomes well posed, can be solved by a fast and scalable algorithm.
If we treat both the source s(t) and the CIR hk(t) as unknowns, and our observations consist of linear combinations of entries of s(t) and the hk(t) multiplied by one another, multichannel blind deconvolution is challenging since it is equivalent to solving a system of bilinear equations. To precisely state the problem, an inverse problem is written down using the language of linear algebra. We assume that both the s(t) and hk(t) are band limited, so recovering discretized samples hk[n] of hk(t) and s[n] of s(t), spaced at the corresponding Nyquist rate or closer, is the same as recovering the CIRs themselves and the source (limited to this frequency band). We define the general convolution model as
| (1) |
This problem does not have a unique solution if we assume s[n] and hk[n] have no structure, because many signals and CIRs can produce the same observations yk[ℓ]. Even if we were to observe an infinite number of samples over an infinite amount of time and treat the channels jointly, the hk[n] and s[n] would still not be uniquely determined.
B. FIR channels
The story does improve, however, if we make structural assumptions about the CIR hk[n]. For example, we might assume that the hk[n]'s are time-limited [i.e., finite impulse response (FIR) channels] such that hk[n] can be nonzero only for 0 ≤ n ≤ N - 1. Then we can limit the index summation in the convolution model Eq. (1)
| (2) |
Equation (2) has unknowns, and L bilinear equations that combine these unknowns in different ways: each sample of yk involves a sum over products of various combinations of and . Treating the channels jointly adds more equations and more unknowns. However, since the channels are all being driven by a common source, the number of unknowns is smaller than the number of equations. The system of equations corresponding to the observations has KL equations and unknowns. Thus, when the number of channels K is large and L > N, the number of equations in fact is much larger than the number of unknowns.
Only comparing the number of equations to the number of unknowns does not tell the whole story, especially since the equations are non-linear (bilinear in the unknowns of ). The observations above reveal in some loose sense that as the number of observations grows, so does the ratio of the information we have (equations) versus the information we do not (variables). We take another step toward formalizing this trade-off by recasting the bilinear equations in Eq. (2) as a system of linear equations with a rank constraint. The common source used in Eq. (2) and the CIR hk can be defined as a vector as below
| (3) |
Then we can arrange all pairs of variables appearing in the sum in Eq. (2) in a large matrix X0
| (4) |
Each observation yk[ℓ], an example of which is shown graphically in Fig. 2, is now a linear combination of entries in X0. Concatenating the observations from each channel into a single vector , we form a linear system of equations
| (5) |
where takes sums over skew diagonals of submatrices . We can write out this linear operator in the form of inner products of measurement matrix and the rank-1 matrix , where . Measurement matrix Akℓ defines the skew-diagonal sum operation yielding observation yk[ℓ], and we can write yk[ℓ] as below.
| (6) |
An example of Akℓ is given in Eq. (11). Then for all measurements across channels, we form the following expression:
| (7) |
FIG. 2.
(Color online) To illustrate the proposed approach, the matrix X0 defined in Eq. (4) is explicitly written with a number of measurements per channel of L = 9, channel length N = 3, and a number of channels K = 3. Each observation yk[ℓ] is a sum along one of the skew diagonals of the a submatrix of X0, as illustrated by the red lines above. Recovering an L+N - 1 × NK matrix from measurements of this type is virtually impossible, but explicitly incorporating the fact that X0 has rank-1 structure into the recovery makes the recovery possible for appropriate values of L, N, K.
We can now treat the recovery of the CIR hk and the source s as a matrix recovery problem. The vector y contains KL different linear combinations of entries of a matrix. Regardless of the values of L, N, and K, the number of equations will always be less than the number of entries in the unknown matrix, indicating that the system of linear equations in Eq. (5) is underdetermined. However, by definition in Eq. (4), X0 is a rank-1 matrix. To recover X0 from y, we might search for a matrix that satisfies
| (8) |
For an arbitrary y, a solution that satisfies the constraints above exactly may or may not exist. Given y, we might search instead for such a feasible solution by solving
| (9) |
The deconvolution problem is now recast as a low-rank matrix recovery problem, which can then be solved using low-rank matrix recovery tools developed in recent years, such as nuclear norm minimization, low-rank approximation, iterative hard thresholding, and alternating methods.24 Studies in this area of low-rank matrix recovery have identified effective convex relaxations25,26 for problems of this type and efficient, scalable heuristic solvers that enable these techniques to work with thousands of unknown variables. A description of the heuristic solver used in this paper is given in the Appendix. The Burer-Monteiro heuristic solver is based on the low-rank factorization method and notably efficient in memory. Numerical simulations using this solver are presented in Sec. IV.
C. A general linear model for the CIR
The linear operator in Sec. II B was derived from the assumption that the CIR is time-limited, so the length of each CIR is N. More generally, we can incorporate a priori information about the CIR (e.g., expected arrival-time structure in a multipath environment) into a general linear model, that is, we can assume each CIR hk can be written as
| (10) |
where will be called the channel subspace matrix, is the coefficient vector, N is the length of the vector hk, and D is the number of columns of the matrix Ck. Therefore, the CIR hk lives in a subspace that is the linear span of the columns of the matrix Ck. The time-limited CIR example in Sec. II B is a special case, which corresponds to . We shall note here the length of the channel N no longer needs to satisfy the constraint L > N, but we require that the length of the channel coefficient vector D satisfies L > D, since the number of unknowns of the CIR per channel is now D instead of N.
With the subspace model in place, the goal is now to recover the source s and the channel subspace coefficients . As before, we recast this problem as recovering the rank-1 matrix . Each observation yk[ℓ] is again a linear combination of the entries in X0. Here the new linear operator is implemented with the channel model embedded. For example, we can write out explicitly how to calculate yk[0] using linear algebra with N = 4
| (11) |
In this case, each measurement matrix is the corresponding skew diagonal sum matrix times the channel subspace matrix. We still keep the general notation the same.
III. LINEAR MODELS FOR MULTIPATH CIR IN OCEAN WAVEGUIDES
In Sec. II, the multichannel deconvolution problem has been recast as a low-rank matrix recovery problem, and we require the ambient dimension of the measurements to be bigger than the dimension of the CIR to solve the problem. Because a priori knowledge of the CIR is usually available, dimension reduction of the CIR using a linear model is possible. For instance, in an ocean waveguide, an accurate linear model for CIRs can be fit using a priori knowledge of the environmental parameters (expected variations of the sound-speed profile) and the source-receiver configuration.27 Incorporating a linear model for CIRs reduces the dimension of the subspace spanned by the admissible CIRs such that the deconvolution becomes more tractable numerically. Two types of linear models for the CIR in ocean waveguides are discussed below.
A. The multipath CIR with known arrival-times and unknown amplitudes
Our first CIR model, based on the ray approximation, assumes that the CIR consists of multiple pulses that arrive at different time delays with different amplitudes. This model can be interpreted as the same pulse emitted from a common source but taking multiple paths to reach the receivers, as illustrated in Fig. 1(b). The CIR hk(t) can be written as a sum of pulse function p(t) with different delays t1,t2,…,tD and different amplitudes
| (12) |
In this CIR linear model, we know the pulse profile p(t) [as an example, we assume the pulse is a band limited Gaussian-windowed sinusoid , and the decay parameter τ is determined by the effective bandwidth of the pulse] and pulse time delays t1, t2, …, tD as the a priori information, and the unknowns are the amplitude coefficients of each ray-like arrivals. Discretizing hk(t) and p(t - td) generate vectors and . In terms of linear algebra, Eq. (12) implies that the CIR hk lies in D-dimension subspace spanned by vectors instead of the N-dimension vector length (D < N). The subspace matrix for the CIR hk, which is explained in Eq. (10), is simply constructed by stacking D time-domain pulse basis vectors column-wise. Therefore, Ck can be written as
| (13) |
B. The multipath CIR with uncertain arrival-times and unknown amplitudes
In practice, the arrival-time structure of the CIR in ocean waveguides is never known exactly as a result of environmental fluctuations (e.g., sound speed variations). However, assuming that the range of expected sound-speed variations can be estimated, using for instance an oceanographic model, we can assume that the arrival-times ti (i = 1… D) for each ray arrival lies within a known time-interval Δti, that is, each ray arrival might occur within the time window ti ± Δti. Examining all possible pulses that arrive within one time interval ti ± Δti shows that they form a collection of shifted pulses that lie on a certain region of a manifold. Depending on the range of the time interval, the region of the manifold differs. Now the model for pulses on the manifold in terms of the arrival-time is nonlinear. The manifold needs to be embedded into a linear model that can approximate the CIR by a linear subspace with preferably a low dimension so that the deconvolution method can be implemented. Since all the pulses that lie on the manifold are highly correlated, the principal component analysis (PCA) is performed here to build a low dimension linear subspace for all shifted pulses.
A systematic way to construct the lower dimension subspace based on PCA is introduced in this section. An over-complete subspace matrix, denoted by B1o, is first formed by column-wise stacking vectors that represent all pulses arriving within the time interval ti ± Δti. Using the same notation as in Sec. III A for shifted pulse basis vectors, we form the matrix B1o as
where δt is a tiny shifting grid for each pulse. However, all those time-shifted pulses [i.e., Fig. 3(a) is an example of those shifted pulses] are highly correlated. We then perform a PCA using a singular value decomposition of this collection of shifted pulse basis vectors to reduce its dimension, resulting in a low-dimension linear model for approximating any pulse arriving in this time region. The matrix B1o can be written as . Because the singular value spectrum of this matrix decays to near zero quickly as shown in Fig. 3(b), we only need the first R (R is a small number in this typical example) singular values of B1o to capture nearly all energy (more than 99.999% in simulation) of the singular values. Then the R corresponding singular vectors [i.e., Fig. 3(c) is an example of the first R = 6 singular vectors] form the basis of the low dimension subspace. Column-wise stacking of these R singular vectors generates a matrix B1 that can accurately represent any pulse that arrives within the time region t1 ± Δt. The matrix B1 is written as
| (14) |
The same procedure is performed for each path of the CIR and generates corresponding for each time region. Finally, the subspace matrix Ck, introduced in Eq. (10) for the CIR hk is then formed by
| (15) |
Because of the uncertainty of the paths' arrival-time, the CIR hk is approximated by a subspace with the dimension R × D compared to a subspace of the dimension D in Eq. (13).
FIG. 3.
(Color online) (a) An illustration of the small time shifts of a band limited basis pulse (400–600 Hz). All shifted pulses form an over-complete subspace matrix B1o for any ray-like pulse that arrives within an uncertainty window of duration Δt = 3 ms. (b) The singular value spectrum of the corresponding over-complete matrix B1o. The plot contains the first 20 of 600 singular values. The first R = 6 singular values capture more than 99% of the total energy. (c) The pulse subspace matrix B1, consisting of the R = 6 principal components of B1o.
We need to point out that PCA is not the only method of finding the low dimension subspace for pulses that arrive within a known time region. Based on the a priori information of the CIR structure, other efficient subspaces that can approximate hk with a linear model also exist. For example, the discrete prolate spheroidal sequences28 can form a highly efficient basis that represents band limited and time concentrated pulses, which can also be an adequate linear model for the current application.
IV. NUMERICAL RESULTS
In this section, the multichannel myopic deconvolution method is implemented on various types of channel structures and the effectiveness and robustness of the deconvolution algorithm are demonstrated for both stylized and realistic CIRs. In simulations, the common driving source signal, , where L + N − 1 = 1000, is Gaussian white noise filtered in an arbitrary bandwidth representative of shipping noise spectra (400–600 Hz) with a sampling frequency fs = 2000 Hz.
A. Stylized simulations
Stylized channel structures introduced in Secs. II and III are simulated in this section as direct implementations of the proposed deconvolution method. These general CIR assumptions do not directly represent any specific underwater acoustic channels, but are to illustrate the method. Realistic underwater acoustic channel simulations will be discussed later in Sec. IV C.
1. Time-limited channels
The first simulation investigates the time-limited channel model that corresponds to the scenario illustrated in Fig. 2. For time-limited channels, CIR vectors have finite length, which means that elements of the CIR vector hk[n] are nonzero only for 0 ≤ n ≤ N − 1 and they are normally distributed random variables (i.e., FIR channels). Figure 4(b) illustrates a typical recovery result for parameters N = 200 (the CIR vector length) and K = 100 (number of recording channels). In this case, both the source signal and all of the CIRs (21 000 variables) are recovered with relatively small errors from 100 observed outputs (100 000 total samples). The myopic deconvolution method here benefits from having totally unstructured CIRs, which efficiently increases the mixing of the source signal and CIRs and thus the diversity of observations in the K = 100 recording signals.
FIG. 4.
(Color online) Examples of recovery for finite-duration CIRs (N = 200 taps) using K = 100 channels. (a) Comparison of the original source signal (solid line) and the recovered source signal (dashed line). The relative error is 3 × 10−5 (only the first 300 samples out of 1000 samples are plotted). (b) Comparison of the true underlying CIRs (plotting only the first 50 taps out of N = 200 for the first 3 channels) and the recovered CIRs. The average relative error across all 100 channels is 10−4.
2. Channels with known arrival-times and unknown amplitudes
To examine the method proposed in Sec. III A, we implement a general linear model for CIRs with a finite number of discrete arrivals with a priori known arrival-times but unknown amplitudes. This linear model is realized for multipath channels in an ocean waveguide as introduced in Sec. III. In this simulation scenario, each CIR consisted of three different individual pulse arrivals that correspond to D = 3, defined in Sec. III A. The basis pulse p(t) throughout the rest of simulations is defined as a Gaussian window pulse in the frequency band of 400–600 Hz. The length of the CIR vector is N = 500, and the number of channels is K = 50. Because accurate time delays t1k, t2k, t3k for each pulse are assumed to be known, we only need to estimate the amplitudes of each pulse, which are uk[1], uk[2], uk[3].
Simulation results presented in Fig. 5(a) demonstrate the exact deconvolution of the first five CIRs [the common source signal is also exactly recovered as in Fig. 4(a), but not plotted in Fig. 5]. The method proposed in this article yields an accurate performance if the CIR linear channel model is precise and the dimension of the subspace representing the model is small. The multipath CIR model is only one special example commonly applied in the ocean waveguide. Another typical channel model in other applications is sparse channel, that is, each CIR has D non-zero components at known sparse locations, but we do not know the corresponding amplitudes at each location.29
FIG. 5.
(Color online) (a) Comparison of the actual CIR and recovered CIR for five channels assuming known arrival-times and unknown amplitudes for the three pulse-like arrivals of each CIR. The average relative error across all K = 50 channels is 2 × 10−10. (b) Same as (a) but assuming uncertain arrival-times (Δt = 1.5 ms) and unknown amplitudes. The average relative error across all K = 50 channels is 3 × 10−5.
3. Channels with uncertain arrival-times and unknown amplitudes
The channel model introduced in Sec. III B assuming uncertain arrival-times and unknown amplitudes is examined hereafter. Parameters of the simulation are similar to those used in Sec. IV A 2 except that the arrival-times of each pulse-like arrival of each CIR can lie anywhere within the time interval ti ± Δt (i = 1… D), assuming the same uncertainty window Δt = 1.5 ms for all arrivals for the sake of simplicity. The uncertainty of the arrival-time increases the dimension of the subspace that represents the CIR, and yields a small model error in representing CIRs because of truncating insignificant singular values. The results in Fig. 5(b) demonstrate that the method still accurately estimated the CIR.
B. Robustness analysis
1. The effect of additive measurement noise
The recovery performance in the presence of additive measurement noises is examined in this section. Gaussian measurement noise ek(t) is added on each channel's convolution measurements y1(t),…, yK(t). In this simulation, the CIR model is the same as in Sec. IV A 2. The recovery error and measurement noise are plotted in a log-log scale in Fig. 6. The measurement signal-to-noise ratio (SNR) is defined as , and the CIR recovery relative error is defined as . The result shows that our algorithm is robust to additive noise: as the noise level increases, no catastrophic failure occurs, and the recovery error grows linearly. Hence, the proposed deconvolution method recovers the CIR and the source accurately, and the performance is robust to noise when a linear model represented by Eq. (10) exists.
FIG. 6.
(Color online) Variations of the recovery error (in logarithmic scale) for the deconvolution method (averaged over 100 independent realizations) vs SNR of the recorded measurements.
2. The effect of model errors
In the real environment, there is not only the measurement noise, but also model errors that are caused by lack of accurate a priori information to build an exact linear model for the CIR. The model error incurred in representing the CIR by the subspace matrix can be defined as . In the simulation, the channel model error is generated under the assumption that we do not have information about the later and weaker arrival pulses: in typical shallow water environments, those later arrivals have significantly weaker amplitudes and more randomly distributed arrival-times, when compared to the earlier arrivals, as they correspond to ray paths cumulating more reflections from the absorbing ocean bottom and the fluctuating ocean surface. For example, the true CIR shown in Fig. 7(a) consists of five pulses that arrive at time t1, t2, t3, t4, t5, but the a priori information only provides the knowledge of the first three pulses that arrive at time t1, t2, t3. The linear model that is generated by including stronger arrivals only at time t1, t2, t3 would introduce model error in approximating the true CIR. The best approximation for the CIR is also depicted in Fig. 7(a). Channel model error is defined as the normalized Euclidean distance between the true CIR and CIR approximation using the linear model, which is . Figure 7(b) illustrates the method is also robust to this type of channel model error if the model error (averaged here over 100 independent realizations) is within an upper bound of ≈10 dB. It is worth noting that the model error ek can be generated and measured by other variables as well. For example, the model error can be caused by a tilted array rather than a perfect vertical array; therefore, the accuracy of a priori information will depend on a variable that measures the overall array tilt or individual receiver locations more generally.
FIG. 7.
(Color online) (a) Example of channel model error caused by lack of accurate a priori information on the arrival-time structure of the actual CIR (upper plot) containing five arrivals. Lower plot shows the approximated CIR assuming only three arrivals are present. (b) Variations of the recovery error (in logarithmic scale) for the deconvolution method (averaged over 100 independent realizations) vs channel model error.
C. Numerical simulations in an ocean waveguide
In this section numerical simulations are conducted in a generic shallow water environment to assess the performance of the myopic deconvolution method for the case of a surface source arbitrarily set at depth of 5 m, which represents a shipping source. This source is assumed to broadcast Gaussian white noise filtered in the frequency band 400–600 Hz as a simple model for shipping noise.30 Figure 8(a) describes the shallow water environment used for the simulations. All environmental parameters including the sound speed profile [Fig. 8(b)] were selected to be representative of the experimental scenario and the actual environment introduced in Sec. IV D. Additionally, as implemented in the actual at-sea experiment, only a very short bottom-mounted vertical linear array (VLA) with 16 elements and 1-m spacing located from 128 to 143 m deep were simulated here. The range between the source and the VLA elements was arbitrarily set to 2 km and the corresponding CIR waveforms were computed using the normal-mode formulation implemented with the software KRAKEN [Fig. 8(c), blue line].31
FIG. 8.
(Color online) (a) Computational shallow water waveguide. (b) Experimentally-measured sound speed profile. (c) A priori information of the ray arrival-time structure of the CIRs for K = 16 channels. For each ray arrival, the width of the black shaded area indicates the duration of the corresponding uncertainty time-window 2Δt for the shallow source shown in (a). (d) Comparison of the actual CIRs (blue solid line) obtained from normal-mode simulations and the recovered CIRs (red dashed line) across all K = 16 channels using the deconvolution method for the shallow source shown in (a). The average relative error across all K = 16 channels is ≈10−1.
A priori information of the ray arrival-time structure of the CIRs was obtained independently from a simple ray tracing simulation, using the standard BELLHOP (Ref. 32) model, and used for the CIR parametrization discussed in Sec. III [Fig. 8(b)] (note that no amplitude information for the various paths was used and that the ray simulations do not account for any dispersive effects presented in the actual CIR waveforms computed from normal-mode simulations). A classic shallow-water multipath structure can be observed where for instance the first wavefront corresponds to the direct and first surface-bounced arriving nearly simultaneously for this very shallow source. The uncertainty time window of each simulated ray arrival (of up to Δt = 7 ms) corresponds to variations of the sound-speed profile of ±3 m/s which are representative of the expected variations of the environmental parameters at the experimental site described in Sec. IV D. The ray arrival-time intervals are then further combined if their corresponding uncertainty windows overlap. The resulting ray arrival-time uncertainty windows are then used to construct the CIR subspace using the same method described in Sec. III B. Figure 8(c) shows that a small relative error of more than −20 dB is achieved with the CIRs estimated from myopic deconvolution (red dashed line) when compared to the original CIRs computed from normal-mode simulations (blue solid line) even though the a priori information available about the CIRs is not very accurate in this shallow-ocean acoustic channel application (i.e., resulting in fairly large uncertainty time window of up to Δt = 7 ms).
Furthermore, Figs. 9(a) and 9(b) demonstrate that similar performance can be achieved even when we reduce the number of recording channels to only eight and four channels. These simulation results indicate that the proposed deconvolution method can be used with a very limited number of receivers, provided that fairly accurate a priori information of the arrival-time structure of the CIR is available. This means that the myopic deconvolution method does not require a dense array to perform well, which is advantageous in comparison to some traditional deconvolution methods such as the ray-based deconvolution (RBD) method9 which not only it needs a long aperture, but the array should also be relatively dense with appropriate element spacing for beamforming to work.
FIG. 9.
(Color online) Same as Fig. 8(c) but using only the simulated-received waveforms on (a) K = 8 channels or (b) K = 4 channels for the myopic deconvolution method. The relative error across all channels for both cases is ≈10−1.
D. Deconvolution of experimentally measured shipping noise recordings
The myopic deconvolution method is applied here to shipping noise recorded on a short bottom-mounted vertical line array which was moored in a shallow and nearly range-independent section of the Coronado Bank (water depth ≈150 m), approximately 20 km offshore of San Diego, CA. Based on prior studies, the environmental parameters for the test site were estimated to be similar to those shown in Fig. 8(a). The VLA had 16 elements uniformly spaced by 1 m; with the first element approximately 7 m above the seafloor. Other technical features of the hydrophone array deployment and the electronic system have been described previously.33–35 The research vessel (R/V) New Horizon was used as a surface source of opportunity to test the myopic deconvolution method using the recorded shipping noise data on the VLA. Since no ground truth was available for the actual CIR between the R/V and the VLA, estimated CIRs from the myopic deconvolution were compared to the estimated CIRs independently obtained using a previously described RBD method.
To do so, K = 16 CIRs between the R/V and the VLA were first estimated using RBD for an arbitrarily selected location of the R/V ≈600 m away from the VLA, referred to hereafter as the “library” location. Based on visual inspection of spectrograms of the recorded shipping noise34,35 from the R/V at this distance, its dominant frequency band was found to be 300–800 Hz and the 10-s long snapshot of recorded data was filtered in this frequency band. These K = 16 estimated CIRs with RBD [see Fig. 10(a)] were then used to infer a priori arrival-time structure of the CIRs (in the vicinity of this selected library location) as input to the myopic deconvolution method. In order to estimate the arrival-time uncertainty for each ray-path of the CIRs, we used historical sound-speed profiles measurements from CTD casts collected in the area and representative of the ocean sound speed fluctuations (≈3 m/s) occurring during the time period of the experiments. By running multiple ray simulations using the software BELLHOP for a source-array separation distance of 2 km, through this collection of sound-speed profiles we obtained a maximum uncertainty window of Δt = 3 ms for all considered ray arrivals. An arrival-time window of Δt = 3 ms was used here for each ray-like arrival.
FIG. 10.
(Color online) Experimental results using shipping noise. (a) Estimated CIRs using the library data set from the RBD method when the R/V was ≈600 m from the VLA. (b) Same as (a) but using a different data set (the event set) when the R/V was within the same location on a different day. (c) Comparison of the estimated CIRs from either the RBD (red line) or the myopic deconvolution method (green line) using the same event data set. (d) Comparison of the estimated CIRs with RBD using either the event data set [red line, same as shown in (b)] or the library data set [blue line, same as shown in (a)].
The myopic deconvolution method was then applied to another set of shipping noise data (referred to as “event” data) recorded from the same R/V when it passed again across the library location a day later to estimate the corresponding CIRs (referred to as recovered CIRs) between the R/V and the VLA [see Fig. 10(c)]. As a validation, the CIRs were also estimated independently using the RBD method for the same event data which closely compared to the recovered CIRs from the myopic deconvolution method [see Figs. 10(b) and 10(c)] with an average correlation coefficient across the 16 channels of 0.85. It can also be noted that the CIRs estimated from RBD using the library data (i.e., used only as a priori information for the myopic deconvolution method) and the event data (which matched the CIRs obtained from the myopic deconvolution method) did significantly differ—as expected—due to variations of the environmental parameters across the selected 2 days at this shallow water test site: the average correlation coefficient across the 16 channels was only 0.58. Finally, Fig. 11 shows that the myopic deconvolution method can also be implemented successfully using only a smaller number of receiver elements (respectively, eight and four elements), provided that sufficiently accurate a priori information of the arrival-time structure of the CIRs is available (e.g., using simulated data for the selected test site). If such a priori information is available, this may be a significant advantage of the myopic deconvolution method over the RBD method which requires an array with sufficiently long aperture to beamform individually the different ray arrivals composing the CIR wavefronts.9
FIG. 11.
(Color online) (a) and (b): Same as Figs. 10(c) and 10(d) but using only K = 8 channels for the myopic deconvolution method. (c) and (d): Same as Figs. 10(c) and 10(d) but using only K = 4 channels for the myopic deconvolution method.
V. CONCLUSIONS
This article introduced a method for solving the deconvolution problem using acoustic sources of opportunity and a priori information about the CIR via the low-rank recovery problem formulation. For the shallow-water acoustic channel application, we developed a systematic way to build an efficient and accurate linear model incorporating a priori information about the expected CIRs' arrival-time structure so that the low-rank recovery method can be implemented for the proposed myopic deconvolution method. Stylized numerical simulations demonstrated that the proposed method perfectly deconvolved both the noise source signal and CIRs simultaneously. The proposed method was also shown to be robust in the presence of additive measurement noises and model errors in the CIR parameterization. Furthermore the applicability of the proposed myopic deconvolution method for estimated CIRs in shallow water using shipping noise and short bottom-mounted VLAs was demonstrated both numerically and experimentally. This method is likely to be applicable to other environments supporting waveguide-like propagation (e.g., seismic or structural waveguides). The proposed myopic deconvolution method can naturally be extended to the multiple input multiple output deconvolution problem (i.e., the case of multiple sources of opportunities) and receivers.
ACKNOWLEDGMENT
This work was funded by the Office of Naval Research code 32. The authors would like to thank Dr. William A. Kuperman and the Marine Physical Laboratory team at the Scripps Institution of Oceanography for providing experimental dataset.
APPENDIX
1. The heuristic solver
The multichannel deconvolution problem becomes computationally intractable as the number of channels increases. To recover X0, we are searching over a matrix of size that minimizes the nuclear norm which satisfies the linear constrains. If we want to recover K = 100 channels with each channel of length N = 500, and a source of length L+N − 1 = 1000, then the rank-1 matrix X0 is of size 1000 × 50 000, which the storage becomes an issue in computation. However, the nonlinear programming algorithm developed by Burer and Monteiro23 using low-rank factorization can greatly reduce the number of variables, and in the meanwhile, provides reliable performance adapted in our application.
By construction, we know that the target matrix X0 is rank-1, so instead of iterating matrix X of size to minimize its nuclear norm, we can reformulate the problem to optimize over vectors and , where X = ZHT (note that Z and H correspond to the source signal vector s and CIR vectors h in Eq. (4), respectively). Hence we only need to store two vectors Z and H in the memory for each iteration. This reformulation is driven by the fact that the nuclear norm is equal to the minimum Frobenius norm factorization
| (A1) |
We can then minimize the following augmented Lagrange term:
| (A2) |
This new formulation of the problem is non-convex because is a linear combination of the product of two unknowns. But the Burer-Monteiro heuristic shows that under certain conditions, the local minima in Eq. (A2) are also the global minima.36 This Burer-Monteiro heuristic solver is used in state-of-the-art large scale implementations of matrix recovery problems. To minimize the augmented Lagrangian term in Eq. (A2), the inner operation is executed using Limited-memory Broyden-Fletcher-Goldfarb-Shanno algorithm (LBFGS) with Matlab solver “minfunc” developed by Recht et al.22 The parameters of the augmented Lagrangian are updated according to the schedule proposed by Burer and Monteiro.23 The parameter λ determines the trade-off between the fidelity of the solution to the measurements y and its conformance to the model. The term involving σ measures the Euclidean norm of the infeasibility, and σ is a penalty parameter. We update both parameters for each LBFGS minimization. We also use a gradient-based algorithm because the function and the gradient evaluations of the augmented Lagrange term Eq. (A2) with respect to vectors Z, H can be performed very quickly.
References
- 1. Munk W., Worcester P., and Wunsch C., Ocean Acoustic Tomography ( Cambridge University Press, New York, 2009), pp. 2–30. [Google Scholar]
- 2. Dowling D. R. and Sabra K. G., “ Acoustic remote sensing,” Ann. Rev. Fluid Mech. 47, 221–243 (2015). 10.1146/annurev-fluid-010814-014747 [DOI] [Google Scholar]
- 3. Xu G., Liu H., Tong L., and Kailath T., “ A least-squares approach to blind channel identification,” IEEE Trans. Signal Process. 43(12), 2982–2993 (1995). 10.1109/78.476442 [DOI] [Google Scholar]
- 4. Giannakis G. B. and Serpedin E., “ Linear multichannel blind equalizers of nonlinear fir volterra channels,” IEEE Trans. Signal Process. 45(1), 67–81 (1997). 10.1109/78.552206 [DOI] [Google Scholar]
- 5. Harikumar G. and Bresler Y., “ Perfect blind restoration of images blurred by multiple filters: Theory and efficient algorithms,” IEEE Trans. Image Process. 8(2), 202–219 (1999). 10.1109/83.743855 [DOI] [PubMed] [Google Scholar]
- 6. Souidene W., Abed-Meraim K., and Beghdadi A., “ A new look to multichannel blind image deconvolution,” IEEE Trans. Image Process. 18(7), 1487–1500 (2009). 10.1109/TIP.2009.2018566 [DOI] [PubMed] [Google Scholar]
- 7. Mendel J. M., Optimal Seismic Deconvolution: An Estimation-based Approach ( Academic Press, New York, 1983), pp. 1–26. [Google Scholar]
- 8. Sabra K. and Dowling D., “ Blind deconvolution in oceanic waveguides using artificial time reversal,” J. Acoust. Soc. Am. 116, 262–271 (2004). 10.1121/1.1751151 [DOI] [Google Scholar]
- 9. Sabra K. G., Song H.-C., and Dowling D. R., “ Ray-based blind deconvolution in ocean sound channels,” J. Acoust. Soc. Am. 127(2), EL42–EL47 (2010). 10.1121/1.3284548 [DOI] [PubMed] [Google Scholar]
- 10. Abadi S. H., Rouseff D., and Dowling D. R., “ Blind deconvolution for robust signal estimation and approximate source localization,” J. Acoust. Soc. Am. 131(4), 2599–2610 (2012). 10.1121/1.3688502 [DOI] [PubMed] [Google Scholar]
- 11. Abadi S. H., Song H. C., and Dowling D. R., “ Broadband sparse-array blind deconvolution using frequency-difference beamforming,” J. Acoust. Soc. Am. 132(5), 3018–3029 (2012). 10.1121/1.4756920 [DOI] [PubMed] [Google Scholar]
- 12. Chapin S. R., Ioup G. E., Ioup J. W., and Smith G. B., “ Statistical analysis of acoustic Green's functions in a shallow ocean environment,” J. Acoust. Soc. Am. 109(5), 2295–2295 (2001). 10.1121/1.4744047 [DOI] [Google Scholar]
- 13. Smith G. B., “ Blind deconvolution for multipath mitigation in shallow water acoustics,” J. Acoust. Soc. Am. 107(5), 2868–2868 (2000). 10.1121/1.429301 [DOI] [Google Scholar]
- 14. Broadhead M. K. and Pflug L. A., “ Performance of some sparseness criterion blind deconvolution methods in the presence of noise,” J. Acoust. Soc. Am. 107(2), 885–893 (2000). 10.1121/1.428270 [DOI] [PubMed] [Google Scholar]
- 15. Roan M. J., Gramann M. R., Erling J. G., and Sibul L. H., “ Blind deconvolution applied to acoustical systems identification with supporting experimental results,” J. Acoust. Soc. Am. 114(4), 1988–1996 (2003). 10.1121/1.1610465 [DOI] [PubMed] [Google Scholar]
- 16. Cazzolato B. S., Nelson P., Joseph P., and Brind R. J., “ Numerical simulation of optimal deconvolution in a shallow-water environment,” J. Acoust. Soc. Am. 110(1), 170–185 (2001). 10.1121/1.1379081 [DOI] [Google Scholar]
- 17. Tong L. and Perreau S., “ Multichannel blind identification: From subspace to maximum likelihood methods,” IEEE Proc. 86, 1951–1968 (1998). 10.1109/5.720247 [DOI] [Google Scholar]
- 18. Moulines E., Duhamel P., Cardoso J.-F., and Mayrargue S., “ Subspace methods for the blind identification of multichannel fir filters,” IEEE Trans. Signal Process. 43(2), 516–525 (1995). 10.1109/78.348133 [DOI] [Google Scholar]
- 19. Giannakis G. B. and Mendel J. M., “ Identification of nonminimum phase systems using higher order statistics,” IEEE Trans. Acoust., Speech Signal Process. 37(3), 360–377 (1989). 10.1109/29.21704 [DOI] [Google Scholar]
- 20. Broadhead M. K., Pflug L. A., and Field R. L., “ Use of higher order statistics in source signature estimation,” J. Acoust. Soc. Am. 107(5), 2576–2585 (2000). 10.1121/1.428645 [DOI] [PubMed] [Google Scholar]
- 21. Ahmed A., Recht B., and Romberg J., “ Blind deconvolution using convex programming,” IEEE Trans. Inf. Theory. 60(3), 1711–1732 (2014). 10.1109/TIT.2013.2294644 [DOI] [Google Scholar]
- 22. Recht B., Fazel M., and Parrilo P. A., “ Guaranteed minimum-rank solutions of linear matrix equations via nuclear norm minimization,” SIAM Review 52(3), 471–501 (2010). 10.1137/070697835 [DOI] [Google Scholar]
- 23. Burer S. and Monteiro R., “ A nonlinear programming algorithm for solving semidefinite programs via low-rank factorization,” Math. Program., Ser. B 95, 329–357 (2003). 10.1007/s10107-002-0352-8 [DOI] [Google Scholar]
- 24. Davenport M. and Romberg J., “ An overview of low-rank matrix recovery from incomplete observations,” IEEE J. Sel. Topics Signal Process. 10(4), 608–622 (2016). 10.1109/JSTSP.2016.2539100 [DOI] [Google Scholar]
- 25. Candès E. J. and Recht B., “ Exact matrix completion via convex optimization,” Found. Comput. Math. 9(6), 717–772 (2009). 10.1007/s10208-009-9045-5 [DOI] [Google Scholar]
- 26. Candès E. J. and Tao T., “ The power of convex relaxation: Near-optimal matrix completion,” IEEE Trans. Inf. Theory 56(5), 2053–2080 (2010). 10.1109/TIT.2010.2044061 [DOI] [Google Scholar]
- 27. Jensen F. B., Kuperman W. A., Porter M. B., and Schmidt H., Computational Ocean Acoustics ( Springer Science & Business Media, New York, 2011), pp. 103–107. [Google Scholar]
- 28. Davenport M. A. and Wakin M. B., “ Compressive sensing of analog signals using discrete prolate spheroidal sequences,” Appl. Comput. Harmonic Anal. 33(3), 438–472 (2012). 10.1016/j.acha.2012.02.005 [DOI] [Google Scholar]
- 29. Romberg J., Tian N., and Sabra K., “ Multichannel blind deconvolution using low rank recovery,” in SPIE Defense, Security, and Sensing, International Society for Optics and Photonics (International Society for Optics and Photonics, 2013), pp. 87500E–87500E. [Google Scholar]
- 30. Wenz G. M., “ Acoustic ambient noise in the ocean: Spectra and sources,” J. Acoust. Soc. Am. 34(12), 1936–1956 (1962). 10.1121/1.1909155 [DOI] [Google Scholar]
- 31. Porter M. B., The KRAKEN normal mode program, No. NRL/MR/5120-92-6920 (Naval Research Lab., Washington, DC, 1992), pp. 1–198.
- 32. Porter M. B., “ The bellhop manual and user's guide: Preliminary draft,” Technical Report, Heat, Light, and Sound Research, Inc., La Jolla, CA, pp. 1–57 (2011).
- 33. Skinner J. and Hodgkiss W., “ A networked/autonomous receiving array system,” in IEEE OCEANS (2008), pp. 1–6. [Google Scholar]
- 34. Leroy C., Lani S., Sabra K. G., Hodgkiss W. S., Kuperman W., and Roux P., “ Enhancing the emergence rate of coherent wavefronts from ocean ambient noise correlations using spatio-temporal filters,” J. Acoust. Soc. Am. 132(2), 883–893 (2012). 10.1121/1.4731231 [DOI] [PubMed] [Google Scholar]
- 35. Lani S. W., Sabra K. G., Hodgkiss W. S., Kuperman W., and Roux P., “ Coherent processing of shipping noise for ocean monitoring,” J. Acoust. Soc. Am. 133(2), EL108–EL113 (2013). 10.1121/1.4776775 [DOI] [PubMed] [Google Scholar]
- 36. Burer S. and Monteiro R. D., “ Local minima and convergence in low-rank semidefinite programming,” Math. Program. 103(3), 427–444 (2005). 10.1007/s10107-004-0564-1 [DOI] [Google Scholar]











