Abstract
Multidimensional NMR spectroscopy typically employs phase-sensitive detection, which results in hypercomplex data (and spectra) when utilized in more than one dimension. Nonuniform sampling approaches have become commonplace in multidimensional NMR, enabling dramatic reductions in experiment time, increases in sensitivity and/or increases in resolution. In order to utilize nonuniform sampling optimally, it is necessary to characterize the relationship between the spectrum of a uniformly sampled data set and the spectrum of a subsampled data set. In this work we construct an algebra of hypercomplex numbers suitable for representing multidimensional NMR data along with partial-component nonuniform sampling (i.e. the hypercomplex components of data points are subsampled). This formalism leads to a modified DFT–convolution relationship involving a partial-component, hypercomplex point-spread function set. The framework presented here is essential for the continued development and appropriate characterization of partial-component nonuniform sampling.
Keywords: aliasing, discrete Fourier transform (DFT), convolution, point-spread function (PSF), compressed sensing
Introduction
Ever since NMR was invented by Rabi et al. [1], and further developed by Bloch and Purcell et al. [2–4], scientists have continued to devise methods for enhancing sensitivity and resolution. Ernst and Anderson introduced the Fourier Transform NMR in the mid 1960s [5], which shortened experiment times and increased spectral sensitivity. Jean Jeener conceived of the first 2D experiment in 1971 and it was ultimately carried out by Ernst et al. in 1975 [6]. The long experiment times required by multidimensional NMR led to Bodenhausen and Ernst’s accordion experiment in 1981 [7], which launched the concept of nonuniform sampling (NUS) and further motivated the development of non-Fourier spectral reconstruction [8]. The next significant change in sampling strategy that impacted sensitivity came with the hypercomplex representation of phase-sensitive data by States et al. [9], which was later formalized by Delsuc [10]. Each hypercomplex entry in the multidimensional data array contains a set of real valued components. The component values give the magnitudes of the signal recorded at various phase angles.
Multidimensional NMR has incrementally evolved over the last several decades through the continued development of novel NUS schemes and non-Fourier reconstruction techniques (see [11] for review). One contributing factor to the improved understanding of NUS is the advent of rigorous, quantitative methods for characterizing NUS schemes [12–15] and reconstruction methods [8, 16–19]. Central to understanding these efforts has been the Discrete Fourier Transform (DFT)–Convolution Theorem, which states for complex-valued vectors a and b,
(1) |
where is the DFT operator, ∘ is the Hadamard product (i.e. elementwise multiplication), and * is convolution. For a “data vector” a and a “sampling function” b, the DFT–Convolution Theorem states that the DFT of the sampling function ( (b)), often referred to as the point-spread function (PSF), relates the spectrum obtained by complete uniform sampling ( (a)) to the spectrum obtained by subsampling ( (a ∘ b)). In particular, the sampling artifacts in the PSF enter into the subsampled spectrum by way of convolution with the uniformly sampled spectrum. It should be noted that while non-Fourier reconstruction methods are often designed to suppress sampling artifacts and generally achieve better sensitivity than the DFT of zero-augmented subsampled data, the PSF is still appropriate for the relative comparison of sampling schemes.
Until the recent introduction of random-phase detection (RPD, [20]), subsampling ignored the multi-component nature of hypercomplex data; samples included either all the components of a data value or none of them (we call this full-component sampling). In contrast, RPD, which is the first presentation of a partial-component sampling scheme, collects only one component from each data value. In general, partial-component subsampling of multidimensional, hypercomplex data can not be handled by the classic DFT–Convolution Theorem. This paper formalizes a modified DFT–Convolution Theorem to accommodate partial-component subsampling and serves as an extension of the work by Delsuc [10]. In addition, the derivation of the modified DFT-Convolution relationship provides a means for characterizing the sampling-induced spectral aliases that arise from any partial-component sampling scheme.
The derivation reveals the proper form of the PSF under partial-component subsampling. These results modernize the theoretical foundations of multidimensional NMR and provide a platform for quantifying the spectral sensitivity of the next generation of novel sampling schemes.
Methods
The following subsections introduce an algebra of hypercomplex numbers along with the DFT, convolution and Hadamard operators necessary to define a modified DFT–Convolution relationship suitable for analysis of partial-component, nonuniform sampling of hypercomplex, multidimensional NMR data. Note that the algebra developed here is quite different from the quaternion or octonion algebras commonly employed in other fields [21–23]; among other things, our multiplication is commutative.
An Algebra of Hypercomplex Numbers
Two fundamental properties of hypercomplex NMR data are:
Applying a 90-degree phase shift in dimension i followed by a 90-degree phase shift in dimension j yields the same result as applying the two phase shifts in the opposite order.
Applying a 180-degree phase shift in any dimension results simply in a change of sign.
The work of Delsuc [10] is extended by defining an algebra of hypercomplex numbers that satisfies the above properties.
Each dimension in a d-dimensional hypercomplex number system has real and imaginary components. A single real-valued unit, denoted by “1”, is common to all dimensions and is the identity element of the algebra, whereas each dimension k = 1, …, d has a unique imaginary unit uk. The imaginary units are the generators of the algebra and, along with the identity element, can be used to construct a set of 2d basis elements of the algebra. The particular set of basis elements or components (casually referred to as “phases” in NMR) employed here is defined as
(2) |
where φ1 ···φd is a product of units performed according to the multiplication rules:
(3) |
These rules express the fundamental properties of hypercomplex NMR noted at the top of this section, provided that multiplication by uk is interpreted as a phase shift by 90 degrees in dimension k.
A value in the d-dimensional hypercomplex space ℍd is a weighted sum over the component set. For example, consider the three-dimensional case. The generators u1, u2 and u3 produce the component set
(4) |
and a general member of the space ℍ3 can be written as
(5) |
where the coefficients a, b, …, h are real numbers. In the context of multidimensional NMR, the real unit along each dimension is typically referred to as “R” and the imaginary unit along each dimension is referred to as “I”. In this notation, for example, the component u1u3 from is expressed as “IRI” and f above could be referred to as the “IRI” coefficient of x.
As a simpler example, the 1-dimensional hypercomplex space ℍ1 has only one imaginary unit, u1. Numbers in this space have only two components, R and I, and they can be written in the form x = a + bu1. It’s easy to see that the multiplication operation defined in Equation (3) makes this space isomorphic to the usual complex numbers.
The exponential function is defined for hypercomplex numbers according to the usual Taylor series expansion. Using this formula one can show that the familiar properties of exponentials continue to hold: If x and y are hypercomplex numbers then ex+y = ex · ey, and for any real number a and imaginary unit uk we have eauk = cos(a) + uk sin(a).
We will write “x{φ}” for the real-valued coefficient of the component φ ∈ appearing in x. Thus in Equation (5) above, f = x{u1u3}. It follows that a general member of ℍd can be written as
(6) |
A multidimensional array X of hypercomplex numbers is similarly expressed as
(7) |
where X{φ} is now the real-valued array whose elements are the coefficients of the component φ. The hypercomplex element of X located at the indices k1, …, kd will be written as “X[k1, …, kd]”. This system of notation puts the square brackets and curly brackets on equal footing, emphasizing how indexing along dimension is similar to indexing along component.
DFT
Let be a hypercomplex vector of length N (the subscript on “ℍ” indicates the dimensionality of the phase-space and the superscript indicates the size of the vector). The hypercomplex DFT on dimension j of t is the length-N vector f defined by:
(8) |
This is the same as the usual formula for the DFT of a complex vector except that the ordinary imaginary unit i has been replaced with uj. We write f = (t), with the subscript j indicating the dimension on which acts.
The DFT operator is extended to multidimensional arrays by performing a single DFT on all the sub-vectors parallel to the j-th dimension. Thus, if T is a d-dimensional array of hypercomplex values then
(9) |
where Nj is the size of T ’s j-th dimension. Note that the dimension index j is used here in two different ways: It specifies the location of elements in T (i.e., kj, nj, and Nj) and also the hypercomplex unit in the exponential factor (i.e., uj). By contrast, Equation (8) uses j only to select a hypercomplex unit, independently of the single dimension the vector f comprises.
A full DFT of a multidimensional hypercomplex data array involves performing a single transform in each of the dimensions: (T) = (··· (T) ···). The order of the transforms is irrelevant because the uj units commute; the series of subscripts has been omitted from the multidimensional Fourier operator on the left hand side to indicate that all dimensions have been transformed. Written out in full, this becomes
(10) |
Convolution
Let be hypercomplex vectors. The convolution of f and g (written as f * g) is the length-N vector h defined by:
(11) |
The vectors f and g are treated as periodic, with element indices that fall out of the range {0, …, N − 1} wrapped back in range through the modulo operator. The hypercomplex convolution defined above differs from the classic complex-valued convolution only in that the multiplication operator (·) is now hypercomplex multiplication performed according to Equation (3).
Convolution is defined similarly for multidimensional, hypercomplex data sets. The sums run over all the dimensions:
(12) |
With these definitions, the DFT–Convolution theorem (Equation (1)) holds for hypercomplex vectors (if is replaced by ) and for multidimensional arrays of hypercomplex values.
NMR Data and Sampling in Time/Phase
In the original NUS scheme, for each hypercomplex point in the data set, either all the components are sampled or none of them are (i.e. full-component sampling). This can be represented mathematically using the Hadamard operator. Let T be a fully sampled data set; then the NUS data set is specified by T ∘ S, where S is a multidimensional array of the same size as T whose elements are 1 (for the data points that are sampled) or 0 (for the points that are not). S is called the sampling function.
The Hadamard product will not suffice for the more general case of partial-component subsampling (e.g. RPD, in which each element of the array is sampled at exactly one of its components). To handle such schemes we introduce a modified Hadamard operator (⊙) and define partial-component subsampling as
(13) |
where the sampling function S is written as a hypercomplex array in which the coefficient values are 1 for the points and components that are sampled and 0 for those that are not. (It is worth emphasizing that the hypercomplex representation of S is for notational convenience only. The sampling function should be viewed not as a single array of hypercomplex values, but rather as a set of real valued arrays, with one array for each hypercomplex component. While it may be useful to store or refer to the set as a single hypercomplex array, it does not act mathematically as a hypercomplex array.) In the modified Hadamard product of two arrays, the values are multiplied element-wise and component-wise within elements. In other words, for each component φ
(14) |
and therefore
(15) |
The modified Hadamard product on hypercomplex data arrays is thus a summation of real valued Hadamard products.
Unfortunately the classic DFT–Convolution relationship does not hold for the modified Hadamard operator. Instead, a separate relation holds for each component φ:
(16) |
which results from applying Equation (1) to Equation (14). Multiplying each side by φ and adding leads, by way of Equation (7), to
(17) |
This expresses the spectrum of a partial-component NUS data set, (T′), in terms of a convolution between component-isolated spectra of the fully sampled data set, (T {φ} · φ), and a component-isolated point-spread function, (S{φ}). The latter is a member of the component-isolated PSF set
(18) |
The relationships among the functional components of Equation (17) are shown in Figure 1; a visualization of these relationships is shown in Figure 2, in which a 1-dimensional spectrum containing a synthetic peak in the real component and another in the imaginary component is paired with RPD sampling. The component-isolated PSF set defined in Equation (18) is shown in the cyan box. The yellow box shows that the true peaks are aliased by the separation of T into its components. However, due to the change in sign, the convolution products in the magenta box show constructive interference at the true locations of the synthetic peaks (left of center in the real component, right of center in the imaginary component) and destructive interference at the aliased locations of the synthetic peaks (right of center in the real component, left of center in the imaginary component).
It is instructive to apply this analysis to the degenerate case of full-component subsampling. In this situation, for each point in the time domain either all the components are sampled or none of them are. Hence for each element of S, either all the component coefficients are 1 or all are 0. It follows that the component-isolated sampling functions S{φ} are all equal to a single real-valued sampling function S′, which then distributes out of the summation in Equation (17). The end result is
(19) |
which agrees with Equation (1) as expected.
PSF Peak-to-sidelobe Ratio (PSR)
Let T be a multidimensional, hypercomplex array of time domain data and let S be the corresponding sampling function. In characterizing the aliasing artifacts present in the PSF, we observe that all of the component-isolated PSFs defined in Equation (18) enter into the summation of convolutions in Equation (17). Further, the components of the PSFs “mix” with the components of (T) and are recombined through the summation to determine the components of (T ⊙ S). Given these interconnections, it is appropriate to measure the spectral aliasing in terms of the power of the PSF taken across all components:
(20) |
where the squaring is taken element-wise so that power(PSF) is a real-valued multidimensional array of the same size as S.
The PSF power gives an upper bound on the magnitude of spectral aliasing. Consider a subsampled data set and let φ1 and φ2 be two of the hypercomplex components. The φ1 and φ2 components of the data will necessarily be correlated with each other because they are subsets of the same fully sampled signal. The structure of S may be such that (S{φ1}) and (S{φ2}) both have non-central peaks at the same location, but of opposite amplitudes. When convolved with (T {φ1}) and (T {φ2}) according to the right-hand side of Equation (17), it is possible for the aliased peaks to cancel out in the summation over the components and not appear in the final spectrum. Since the interference of sampling artifacts depends on the sampling function and on the structure of the time domain data, it is not possible to decouple the contributions in the general case. As such, we employ the PSF power and expect that alternative metrics may be developed in the future.
The PSF power shows the multidimensional distribution of artifacts and careful visual inspection may reveal features of a candidate sampling scheme. However, the conclusions may be subjective and difficult to automate. As a first-order approximation to sensitivity, the peak-to-sidelobe ratio (PSR, an adapted form of the sidelobe-to-peak ratio of Lustig et al. [24]) may be employed. PSR is defined as the ratio between (1) the central element of the PSF power (i.e. the element at zero frequency) and (2) the largest non-central element of the PSF power. The sampling function corresponding to complete sampling has a PSF power with a central element proportional to the square root of the number of sampled points; all the other elements of the PSF power are 0, leading to an infinite PSR. At lower sampling coverages, the central element is smaller and the structure embedded in the sampling function reveals itself as non-central peaks in the PSF, thereby reducing the PSR to a finite value. For example, the sampling function in Figure 2, has PSR=8.6.
Conclusion
Given a hypercomplex, multidimensional data array, a full-component sampling function may by expressed as a single real-valued array and the subsampled data is obtained with a Hadamard product. The classic DFT–Convolution Theorem is applicable and the hypercomplex DFT of the real-valued sampling function defines the PSF, which serves to characterize the sampling artifacts introduced in the NUS spectrum. A partial-component sampling function has a real-valued array for each component, and the subsampled data is obtained by taking the Hadamard product of each component of the sampling function with the corresponding component of the data array. The component-weighted summation of Hadamard products propagates through the classic DFT–Convolution Theorem, which leads to the central result of this paper.
While the modifications to the classic DFT–Convolution relationship necessary to accommodate partial-component subsampling may appear minor, their implications are significant. The component-weighted summation of convolutions on the right hand side of Equation (17) effectively superimposes the convolution products. Even though each of the component-isolated PSFs is completely determined by the sampling function, the extent to which their artifacts constructively or destructively mix is partially determined by the specific values of the time domain data being subsampled. In the general case, this “many-to-one” mapping can not be decoupled, thereby complicating metrics based solely on the PSF, as discussed in the PSR section above and demonstrated in Figure 2.
NMR spectroscopists have relied on the classic DFT–Convolution Theorem to govern the characterization of sampling artifacts since the inception of NUS. Subsampling schemes have evolved and have outgrown the underlying mathematical foundation. The derivations in this paper provide the formal foundation for a new generation of subsampling schemes, which is essential for proper characterization and optimization.
We formalize a hypercomplex algebra suitable for multidimensional NMR data.
The DFT–Convolution Theorem is modified to accommodate partial-component sampling.
The point-spread function set indicates spectral sensitivity for nonuniform sampling.
The framework is essential for development of partial-component nonuniform sampling.
Acknowledgments
We thank Dr. Jonathan Helmus for helpful discussions. Funding from the US National Institutes of Health (P01-GM47467 and R01-RR20125) is gratefully acknowledged.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Rabi, Zacharias JR, Millman S, Kusch P. A new method of measuring nuclear magnetic moment. Physical Review. 1938;53:318–318. doi: 10.1002/jmri.1880020203. [DOI] [PubMed] [Google Scholar]
- 2.Bloch F. Nuclear induction. Physical Review. 1946;70(7–8):460–474. [Google Scholar]
- 3.Bloch F, Hansen WW, Packard M. The nuclear induction experiment. Physical Review. 1946;70(7–8):474–485. [Google Scholar]
- 4.Purcell EM, Torrey HC, Pound RV. Resonance absorption by nuclear magnetic moments in a solid. Physical Review. 1946;69(1–2):37–38. [Google Scholar]
- 5.Ernst RR, Anderson WA. Application of Fourier transform spectroscopy to magnetic resonance. Review of Scientific Instruments. 1966;37(1):93–102. [Google Scholar]
- 6.Müller Luciano, Kumar Anil, Ernst RR. Two-dimensional carbon-13 NMR spectroscopy. The Journal of Chemical Physics. 1975;63:5490–5491. [Google Scholar]
- 7.Bodenhausen Geoffrey, Ernst RR. The accordion experiment, a simple approach to three-dimensional NMR spectroscopy. Journal of Magnetic Resonance. 1981;45:367–373. [Google Scholar]
- 8.Barna Jennifer CJ, Laue Ernest D, Mayger Michael R, Skilling John, Worrall Simon JP. Reconstruction of phase-sensitive two-dimensional nuclear-magnetic resonance spectra using maximum entropy. Biochemical Society Transactions. 1986;14:1262–1263. [Google Scholar]
- 9.States DJ, Haberkorn RA, Ruben DJ. A two-dimensional nuclear Overhauser experiment with pure absorption phase in four quadrants. Journal of Magnetic Resonance (1969) 1982;48(2):286–292. [Google Scholar]
- 10.Delsuc Marc A. Spectral representation of 2D NMR spectra by hypercomplex numbers. Journal of Magnetic Resonance (1969) 1988;77(1):119–124. [Google Scholar]
- 11.Maciejewski Mark W, Mobli Mehdi, Schuyler Adam D, Stern Alan S, Hoch Jeffrey C. Data sampling in multidimensional NMR: Fundamentals and strategies. Topics in Current Chemistry. 2012;316:49–78. doi: 10.1007/128_2011_185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Coggins Brian E, Zhou Pei. Sampling of the NMR time domain along concentric rings. Journal of Magnetic Resonance. 2007;184(2):207–21. doi: 10.1016/j.jmr.2006.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hyberts Sven G, Takeuchi Koh, Wagner Gerhard. Poisson-gap sampling and forward maximum entropy reconstruction for enhancing the resolution and sensitivity of protein NMR data. Journal of the American Chemical Society. 2010;132(7):2145–2147. doi: 10.1021/ja908004w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Schuyler Adam D, Maciejewski Mark W, Arthanari Haribabu, Hoch Jeffrey C. Knowledge-based nonuniform sampling in multidimensional NMR. Journal of Biomolecular NMR. 2011;50(3):247–262. doi: 10.1007/s10858-011-9512-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Rovnyak David, Sarcone Mark, Jiang Ze. Sensitivity enhancement for maximally resolved two-dimensional NMR by nonuniform sampling. Magnetic Resonance in Chemistry. 2011;49(8):483–491. doi: 10.1002/mrc.2775. [DOI] [PubMed] [Google Scholar]
- 16.Hoch JC, Stern AS. Maximum entropy reconstruction, spectrum analysis and deconvolution in multidimensional nuclear magnetic resonance. Methods in Enzymology. 2001;338:159–178. doi: 10.1016/s0076-6879(02)38219-3. [DOI] [PubMed] [Google Scholar]
- 17.Stern Alan S, Li Kuo-Bin, Hoch Jeffrey C. Modern spectrum analysis in multidimensional NMR spectroscopy: comparison of linear-prediction extrapolation and maximum-entropy reconstruction. Journal of the American Chemical Society. 2002;124(9):1982–1993. doi: 10.1021/ja011669o. [DOI] [PubMed] [Google Scholar]
- 18.Kim Seho, Szyperski Thomas. GFT NMR, a new approach to rapidly obtain precise high-dimensional NMR spectral information. Journal of the American Chemical Society. 2003;125(5):1385–1393. doi: 10.1021/ja028197d. [DOI] [PubMed] [Google Scholar]
- 19.Kupče Ēriks, Freeman Ray. Projection-reconstruction of three-dimensional NMR spectra. Journal of the American Chemical Society. 2003;125(46):13958–13959. doi: 10.1021/ja038297z. [DOI] [PubMed] [Google Scholar]
- 20.Maciejewski Mark W, Fenwick Matthew, Schuyler Adam D, Stern Alan S, Gorbatyuk Vitaliy, Hoch Jeffrey C. Random phase detection in multidimensional NMR. Proceedings of the National Academy of Sciences of the United States of America. 2011;108(40):16640–16644. doi: 10.1073/pnas.1103723108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ell Todd Anthony. Quaternion-Fourier transforms for analysis of two-dimensional linear time-invariant partial differential systems. Proceedings of the 32nd IEEE Conference on Decision and Control. 1993;2:1830–1841. [Google Scholar]
- 22.Ell Todd A, Sangwine Stephen J. Hypercomplex Fourier transforms of color images. Image Processing, IEEE Transactions on Image Processing. 2007;16(1):22–35. doi: 10.1109/tip.2006.884955. [DOI] [PubMed] [Google Scholar]
- 23.Sangwine Stephen J, Ell Todd A. Complex and hypercomplex discrete Fourier transforms based on matrix exponential form of Euler’s formula. 2010 Arxiv preprint arXiv:1001.4379. [Google Scholar]
- 24.Lustig Michael, Donoho David, Pauly John M. Sparse MRI: The application of compressed sensing for rapid MR imaging. Magnetic Resonance in Medicine. 2007;58(6):1182–1195. doi: 10.1002/mrm.21391. [DOI] [PubMed] [Google Scholar]