Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Jan 1.
Published in final edited form as: IEEE Trans Med Imaging. 2017 Sep 25;37(1):185–199. doi: 10.1109/TMI.2017.2756072

Single- and Multiple-Shell Uniform Sampling Schemes for Diffusion MRI Using Spherical Codes

Jian Cheng 1,, Dinggang Shen 2, Pew-Thian Yap 3,, Peter J Basser 4,
PMCID: PMC5867228  NIHMSID: NIHMS925741  PMID: 28952937

Abstract

In diffusion MRI (dMRI), a good sampling scheme is important for efficient acquisition and robust reconstruction. Diffusion weighted signal is normally acquired on single or multiple shells in q-space. Signal samples are typically distributed uniformly on different shells to make them invariant to the orientation of structures within tissue, or the laboratory coordinate frame. The Electrostatic Energy Minimization (EEM) method, originally proposed for single shell sampling scheme in dMRI, was recently generalized to multi-shell schemes, called Generalized EEM (GEEM). GEEM has been successfully used in the Human Connectome Project (HCP). However, EEM does not directly address the goal of optimal sampling, i.e., achieving large angular separation between sampling points. In this paper, we propose a more natural formulation, called Spherical Code (SC), to directly maximize the minimal angle between different samples in single or multiple shells. We consider not only continuous problems to design single or multiple shell sampling schemes, but also discrete problems to uniformly extract sub-sampled schemes from an existing single or multiple shell scheme, and to order samples in an existing scheme. We propose five algorithms to solve the above problems, including an incremental SC (ISC), a sophisticated greedy algorithm called Iterative Maximum Overlap Construction (IMOC), an 1-Opt greedy method, a Mixed Integer Linear Programming (MILP) method, and a Constrained Non-Linear Optimization (CNLO) method. To our knowledge, this is the first work to use the SC formulation for single or multiple shell sampling schemes in dMRI. Experimental results indicate that SC methods obtain larger angular separation and better rotational invariance than the state-of-the-art EEM and GEEM. The related codes and a tutorial have been released in DMRITool 1.

Index Terms: Sampling, q-space, Multiple Shells, Uniform Spherical Sampling, Diffusion MRI, Spherical Codes

I. Introduction

Diffusion MRI (dMRI) is a unique imaging methodology to explore white matter structure and architecture in the human brain by measuring diffusion of water molecules. In dMRI, signal measurements are a limited number of samples of the diffusion signal E(q) in 3D q-space. Reconstruction in dMRI entails fitting the signal samples by using a well-designed model, including Diffusion Tensor Imaging (DTI) [1], and various models in High Angular Resolution Diffusion Imaging (HARDI) [2]–[10]. After model fitting, meaningful quantities can be estimated, including diffusion tensor and fractional anisotropy in DTI [1], the ensemble average propagator, diffusion Orientation Distribution Function (dODF), fiber ODF (fODF), and the Generalized Fractional Anisotropy (GFA) in HARDI [2]–[9], etc. An appropriate data sampling scheme in q-space is important for all such dMRI acquisition and reconstruction applications in order to recover as much information as possible by using a minimal number of measurements.

The optimal sampling scheme that yields the best reconstruction quality is dependent on the diffusion model, reconstruction algorithm, and the ground truth diffusion signal. 1) A typical DTI sampling scheme is a single shell scheme with about 30 samples and a low b value (≤1200 s/mm2) [11], while multi-shell schemes are preferred (or necessarily needed in some cases) with a higher maximal b value (≥2000 s/mm2) in HARDI methods [4], [5], [9], [10]. 2) Even for the same diffusion model, different reconstruction algorithms, including least squares, robust non-linear estimation with various regularizations, etc., may have different preferred schemes [12]. For a given diffusion model, there may be several model fitting (e.g., reconstruction) algorithms. Robust estimation may prefer larger b values due to its robustness to noise [13], [14], and compressed sensing based reconstruction could reduce the number of data samples, compared with traditional least square based reconstruction [15], [16]. 3) Different variabilities of DWI signals may prefer different schemes. An efficient scheme should allocate more samples in the domains with larger signal changes than in domains with smaller signal changes. Error propagation analysis showed that the optimal b value for DTI is inversely proportional to the mean diffusivity of the latent signal [11]. Therefore, it is infeasible to develop a general optimal sampling scheme that works best for all signal variabilities and all reconstruction methods. However, a necessary property for an optimal sampling scheme is that the samples should be spherically uniformly distributed with no directional preference, because 1) an optimal sampling scheme should be invariant to the orientation of tissue structures and the laboratory coordinate frame in which the data is scanned; 2) we have to design one single scheme for all different variabilities of signals in human brain. Thus, existing works [11], [17]–[24] focus on designing single or multiple shell spherically uniform sampling schemes with input parameters S, {Ks}s=1S, where S is the number of shells and Ks is the number of samples in the s-th shell. These parameters are user-determined based on the consideration of diffusion signals, models, reconstruction algorithms, noise level, and the scan time limit.

Uniform single shell sampling schemes are widely used in dMRI, where samples in q-space are uniformly distributed on a sphere (i.e., a fixed b-value). Spherical tessellation using spherical polyhedrons can generate uniform single shell schemes with some specific fixed numbers of samples determined by polyhedron types [2]. On the other hand, the Electrostatic Energy Minimization (EEM) method proposed in dMRI by Jones et al., [11] is the most popular way to generate a uniform single shell sampling scheme with an arbitrary number of samples. EEM considers the samples as electrons on sphere, and estimates the sample configuration by minimizing the electrostatic energy based on Coulomb’s law:

min{ui}i=1Ki<jf2(ui,uj),fα(ui,uj)=1ui-uj2α+1ui+uj2α, (1)

where the second term in fα(ui, uj) is used because antipodal symmetric samples provide redundant information in dMRI data reconstruction. The problem in Eq. (1) is the so-called Thomson problem 2. Some best known solutions of EEM have been collected in CAMINO [25]. For the multi-shell case, recently it was reported that staggered samples in different shells are better for reconstruction, compared with repeated spherical samples in different shells [17]–[20], [24]. [19] and [20] tried to design a multi-shell scheme with S shells by separating a single shell scheme into S subsets respectively in different ways to minimize the electrostatic energy. [24] proposed to a rotate single shell scheme and use the rotated scheme in different shells. [17] and [18] generalized EEM from the single shell case to the multiple shell case, called Generalized EEM (GEEM), by considering the electrostatic energies both in each individual shell and in the combined shell with all samples. The obtained multi-shell schemes by GEEM showed large angular separations in shells than [19], [20], and have been successfully used in the Human Connectome Project (HCP) [26]. Although EEM and GEEM are widely used, the electrostatic energy formulation does not directly maximize the angular separation between measurements. Samples in q-space are not electrons, and it is still not clear how electrostatic energy relates to dMRI reconstruction.

A good sampling scheme should have large angular separation such that the reconstruction has large angular resolution and good rotational invariance. Thus we propose a mathematically more natural way for sampling scheme design, which directly maximizes the minimal angular difference between sampling points, i.e., covering radius, on the unit sphere. This way to determine sample configuration on sphere is essentially called the Spherical Code (SC) formulation 3, or Tammes problem 4. In many diffusion MRI applications, the information redundancy is considered in local spatial neighborhoods and local angular neighborhoods [27]–[30], which is based on a natural assumption that if the samples ui and uj are close in the sphere, then the DWI signal samples E(qui) and E(quj) are also close. Thus, based on this assumption, maximizing the separation angles between nearest sampling points is related to minimizing the information redundancy in measured DWI samples. The SC formulation directly maximizes the minimal separation angles between the nearest samples. While in the EEM formulation, the electrostatic energy is determined by distances between all sample pairs, not just the nearest samples. Moreover, the covering radius (i.e., minimal separation angle between samples) is a natural and direct measurement of “angular resolution” of spherical sampling schemes, compared with the electrostatic energy. Some pioneer works of HARDI uses finer tessellation gradient table (i.e., larger number of samples with smaller separation angles) to investigate the non-Gaussianity of DWI signals [2], [31]. Wedeen et al., [32] used 4πK to roughly measure the angular resolution of schemes. Caruyer et al., [17], [18] evaluated schemes by different methods using the covering radius. Thus, it is natural to directly maximize the covering radius of sampling schemes.

SC is related with the famous thirteen spheres problem raised by Isaac Newton, and has been well studied in mathematics [33]–[35]. Tóth [33] proved an upper bound for the covering radius of K samples on a sphere. There is also a collection of best known solutions for the SC problem with the antipodal symmetry constraint in 𝕊2 [34] 5. It can be proven that the SC problem is equivalent to EEM problem in Eq. (1) when α → ∞. Thus, Papadakis et al., [36] tried to approximately solve SC problem by iteratively increasing α in Eq. (1). However as α increases, the optimization problem has more local minima and becomes more unstable because the cost function tends to be discontinuous and floating-point overflow occurs [36]. Since the SC formulation directly maximizes the minimal distance between samples, it yields larger angular separation than EEM formulation. However to our knowledge, the SC formulation and the collected best known solutions have not yet been used in the dMRI domain. Although SC is well-studied in the mathematical literature, its current formulation is limited to a continuous single shell, and is not applicable to multiple shells, nor to the discrete problem to uniformly extract subsets from an existing scheme.

In this paper, we propose the SC formulation in diffusion MRI with total five algorithms, not only for continuous problems to design single and multiple shell schemes, but also for discrete problems to uniformly sub-sample an existing scheme, and for ordering an existing scheme. Preliminary parts of this work were reported in two conference papers [37], [38]. In this paper, we categorize sampling scheme design problems in dMRI, propose total 5 algorithms in a unified framework of minimizing the mean covering radius, and provide additional examples, results, derivations, and insights.

The paper is organized as follows. Section II proposes the SC formulation and shows how the existing sampling scheme design problems in dMRI can be formulated as optimization problems using SC. Section III proposes 5 algorithms to solve the problems in Section II. Section IV demonstrates some experimental results of synthetic data and real data.

II. Spherical Code (SC) and Sampling Problems in DMRI

A. Spherical Code (SC) Formulation

For a given set of spherical samples {uiS2}i=1K, we define the covering radius 3 for the i-th sample as

di({ul}l=1K)=minji,jarccosuiTuj. (2)

where the absolute value operator is used because the antipodal symmetric samples have the same role in dMRI data reconstruction. We also define the covering radius for all samples as the minimal angular distance between all sample pairs, i.e.,

d({ul}l=1K)=minidi({ul}l=1K)=minji,i,jarccosuiTuj, (3)

The SC formulation on a single shell is to find K samples {ui}i=1K such that the covering radius is maximized:

max{uiD}i=1Kd({ui}i=1K) (4)

where 𝔻 ⊆ 𝕊2 is the solution domain. If 𝔻 = 𝕊2, Eq. (4) is a continuous SC (CSC) problem for selecting K points from the continuous sphere 𝕊2. If D={un}n=1N, a set of N predetermined points on 𝕊2, then Eq. (4) is a discrete SC (DSC) problem for selecting K from N points. For CSC, [34] proposed to iteratively optimize a continuous cost function that approximates the original cost function in Eq. (4). The authors also released a collection of best known solutions for the SC problem in 𝕊2 [34] 5. Note that the original SC problem only considers CSC, i.e., 𝔻 = 𝕊2, while in this paper we propose both CSC and DSC formulations and generalize them to multi-shell case for designing sampling schemes in dMRI.

For multi-shell sampling, SC is to find a set of points {us,i} by maximizing the weighted mean of covering radii:

max{us,iD}wS-1s=1Sd({us,i}i=1Ks)+(1-w)d({us,i}i=1,,Ks;s=1,,S), (5)

where S is the number of shells, Ks is the number of points on the s-th shell, us,i is the i-th point on the s-th shell, and w is a weighting factor between the mean covering radius of the S shells and the covering radius of the combined shell containing all points from the S shells. w is normally set to 0.5. Because of the second term, the estimated samples in different shell are staggered if w ≠ 1.

B. Uniform Sampling Scheme Design Problems in dMRI

Spherical sampling problems in diffusion MRI can be classified into two categories, the continuous category (P-C) which determines samples in the continues sphere, and the discrete category (P-D), i.e., the sub-sampling problem, which determines samples in a discrete set of spherical samples. These two problem categories include the following sampling scheme design problems.

  • Single-shell continuous problem (P-C-S). Given a number K, how does one determine K uniform samples on sphere, such that these samples are separated as far as possible? The problem is for single shell scheme design. P-C-S can be solved by CSC in Eq. (4) with 𝔻 = 𝕊2.

  • Multi-shell continuous problem (P-C-M). Given S numbers {Ks}s=1S, how does one determine a scheme with S shells, Ks points in the s-th shell, such that samples are separated as far as possible not only in each single shell, but also in the combined shell with all samples? The problem is for multi-shell sampling scheme design. P-C-M can be solved by CSC in Eq. (5) with 𝔻 = 𝕊2.

  • Single subset from single set problem (P-D-SS). Given N known points {ui}i=1N on sphere, how does one uniformly select K samples from the given N samples, such that these samples are separated as far as possible? The problem is to reduce the number of samples in an existing single shell scheme. It also relates to P-C-S, because after discretizing the continuous sphere using many points, solving P-D-SS is an approximation of solving P-C-S. P-D-SS can be solved by DSC in Eq. (4) with D={ui}i=1N.

  • Multiple subsets from multiple sets problem (P-D-MM). Given points {us,i}, i = 1, 2,..., Ns, s = 1, 2,..., S, how does one uniformly select Ks from the Ns samples for the s-th shell, such that samples are separated as far as possible not only in each single shell, but also in the combined shell with all samples? P-D-MM is to reduce the number of samples in an existing multi-shell scheme. It can be solved by DSC in Eq. (5) with 𝔻 = {us,i}.

  • Multiple subsets from single set problem (P-D-MS). Given a single set of points {ui}i=1N on sphere, how does one uniformly select multiple sets of samples from a given single set of samples, such that the samples are separated as far as possible not only in each set, but also in the combined set with all samples? It relates to P-C-M by discretizing the continuous sphere using many points. P-D-MS can be solved by DSC in Eq. (5) with D={ui}i=1N. [19], [20] actually tried to solve P-D-MS by minimizing electrostatic energy in different heuristic search ways. We propose to solve it by using a better algorithm called Mixed Integer Linear Programming (MILP).

  • Acquisition ordering problem (P-O). Given a single or multiple shell scheme, how does one optimize the acquisition order of the samples, such that an interruption at any point of the acquisition can still yield a nearly uniform sampling scheme [22], [23]? This problem can be solved by using incremental SC (ISC) as described in III-A.

P-C-S is a special case of P-C-M, and P-D-SS is a special case of P-D-MM (with S = 1 and w = 1). In the following context, although P-O belongs to the discrete category, we will use P-C to denote P-C-S and P-C-M, P-D to denote P-D-SS, P-D-MM, and P-D-MS.

III. Algorithms for the SC formulation

In this section, we propose five algorithms to solve the above sampling scheme design problems. Table I summarizes the algorithms applicable to the corresponding problems. Note that although one algorithm may work for several problems, only the problems in bold are suggested for good performance. ISC and IMOC do not need an initialization. MILP does not require an initialization, although a good initialization may reduce the computation time. The algorithms in brackets in the third column are suggested for good initializations. The algorithms can also be cascaded, i.e., one algorithm can use the result by another algorithm as the initialization. Table II lists the suggested algorithms for P-O, P-C and P-D, where IMOC + 1-Opt + CNLO means CNLO uses the scheme by IMOC + 1-Opt as its initialization.

TABLE I.

Proposed algorithms to solve the problems listed in Section II-B.

Algorithms Problems Initialization

ISC P-C, P-D, P-O No
IMOC P-C, P-D No
1-Opt P-C, P-D Yes (IMOC)
CNLO P-C Yes (IMOC+1-Opt)
MILP P-C, P-D Not required (IMOC + 1-Opt)

TABLE II.

Problems listed in Section II-B and suggested algorithms.

Problems Suggested algorithms Initialization

P-C IMOC + 1-Opt + CNLO No
P-D MILP Not required (IMOC+1-Opt)
P-O ISC No

A. Incremental SC (ISC)

The similar incremental strategy used for incremental EEM (IEEM) [39] and incremental GEEM (IGEEM) [17], [18] can be also used to propose a greedy incremental SC estimation, called incremental SC (ISC). ISC works for P-D in both Eq. (4) and Eq. (5) with D={ui}i=1N. At step k, we estimate one point u ∈ 𝔻 that maximizes the cost function based on the k − 1 points estimated in previous iterations. ISC can be also applied to P-C, i.e., 𝔻 = 𝕊2, by approximating 𝕊2 using a large number of uniformly distributed points. In practice, we normally use 20481 points from a 7 order tessellation of the icosahedron. ISC can obtain reasonable schemes in seconds. For P-O to optimize acquisition order of a given scheme {us,i}, we can use ISC with 𝔻 = {us,i}. In this way, the first partial set of samples are nearly uniformly separated. It was reported in [40] that optimal ordering reduces the risk of systematic errors due to gradient heating and subject motion.

B. Iterative Maximum Overlap Construction (IMOC)

Incremental methods (IEEM [39], IGEEM [18] and ISC) are to obtain a scheme with reasonable uniform coverage when the acquisition is terminated and only a subset of samples are used. These incremental methods all have the same limitations. 1) Because they are developed to work reasonably well for any subset with first several samples, the cost function (i.e., electrostatic energy or covering radius) of the full scheme is not optimal. 2) When using a finer uniform sample set, the cost function of the obtained sampling scheme may not be improved, which is beyond our expectation.

We propose a sophisticated greedy algorithm, called Iterative Maximum Overlap Construction (IMOC), to iteratively search the covering radius and sequentially generate samples with maximal spherical overlap on the sphere. IMOC works for both P-C and P-D. See the IMOC in Alg. 1 for designing single shell schemes. IMOC uses MOC in Alg. 2 to verify whether a candidate covering radius θ is able to construct an acceptable scheme with K samples. We define the coverage set of a point x as C(x, θ) = {y | arccos(|yTx|) < θ,y ∈ 𝕊2}. With a given candidate θ, MOC generates samples one by one. In the i-th step, MOC finds the best point ui whose coverage C(ui, θ) has the largest overlap with the existing total coverage set CS, and then adds C(ui, θ) into CS. See Alg. 2. If all K points can be obtained by MOC using θ, then the best covering radius is no less than θ, which means θ can be increased. If MOC cannot find K points using θ, then θ can be decreased. Therefore, IMOC performs a binary search of the covering radius θ from (0, θub(2K)), where θub(2K) in Eq. (6) is an upper bound of the covering radius with K points with antipodal symmetry consideration (i.e., total 2K points) [33].

Algorithm 1.

Iterative Maximum Overlap Construction (IMOC) for Single shell Scheme Design:

graphic file with name nihms925741f5.jpg

Algorithm 2.

Maximum Overlap Construction (MOC) for Single shell Scheme Design:

graphic file with name nihms925741f6.jpg

Algorithm 3.

IMOC for Multiple shell Scheme Design:

graphic file with name nihms925741f7.jpg
θub(2K)=arccos4-csc2(πK6(K-1)) (6)

See Fig. 1(a) for a demonstration of MOC when solving P-C-S in 𝕊1 in the 2D case. It is obvious that IMOC yields the globally optimal solution in 𝕊1, while ISC and IGEEM cannot.

Fig. 1. Sketch map of IMOC.

Fig. 1

(a) demonstrates MOC algorithm 2 for P-C-S in 𝕊1, where u1, u2 and u3 are selected one by one with a given θ, and the yellow arc denotes the total coverage set. Figures (b) and (c) are sketch maps for the multi-shell case P-C-M. (b) shows the compromise between the covering radius θ0 in the combined shell and the covering radius θs in the s-th shell, and demonstrates the binary search path (the dashed blue line segment) and the obtained covering radii in IMOC. The grey area contains all feasible covering radii, and the red curve denotes all Pareto optimal covering radii. (c) demonstrates MOC for 3 shell case. Three colors denote three shells. The numbers near the points are the selection orders in MOC for the points. Point 1 (i.e., u1,1) is first selected for shell 1, then point 2 (u2,1) for shell 2 and point 3 (u3,1) for shell 3. The red circle around u1,1 denotes its coverage set C(u1,1, θ1) which is added to CS1, and the dashed black circle around u1,1 denotes C(u1,1, θ0) which is added to CS0. Point 4 (u1,2) is selected based on CSs, ∀s = 0, 1, 2, 3.

IMOC and MOC algorithms for multiple shell sampling scheme design are shown in Alg. 3 and 4. For multiple shell case, IMOC needs to search (θ0, θ1,..., θS) in S + 1 dimensional space, where θ0 is the covering radius of the combined shell with all samples and θs (s > 0) is the covering radius of the s-th shell. There is a compromise between θ0 and θs (s > 0). The cost function in Eq. (5) essentially converts a multi-objective optimization problem into a single objective optimization using a weighted summation. There are infinite Pareto optimal covering radii to the multi-objective optimization problem. IMOC only searches the covering radii in one dimensional space determined by the upper bounds {θsub}s=0S, which in practice provides good covering radii for both individual shells and the combined shell. See Alg. 3 and Fig. 1(b) and (c). MOC in Alg. 4 first selects one point us,1 for each shell. Then it selects the best shell s and the best point us,i whose coverage set C(us,i, θs) has the largest overlap area with the existing total coverage set CSs ∪ CS0 among all shells and all candidate points outside the existing total coverage set.

Similarly with ISC, for P-C, IMOC can analytically select the best point in each iteration in 2D case in 𝕊1. See Fig. 1(a). However, in 𝕊2, IMOC requires a fine uniform discretization of the sphere to find the best point in each step with the largest overlap area by counting the number of points in overlaping sets. For efficient implementation of IMOC, we consider 3 issues. 1) KD-tree is used for efficient nearest neighbor search. 2) The overlap area As(x) for a candidate point x in the s-th shell can be stored for next step. In the i + 1-th step, As(x) is different from the i-th step and needs to be re-calculated, only if the determined point us′,i in the i-th step satisfies arccos (|xTus′,i|) < θ0, if ss′, or arccos (|xT us′,i|) < θs, if s = s′. 3) In MOC, for P-C, the candidate points are selected from a small “outside surface” of CS, i.e., {x | x ∉ CS, ∃y ∈ CS, s.t. arccos (|xTy|) ≤ δ}, instead of the entire complementary set (𝕊2 − CS). δ is set as twice of the covering radius of the fine uniform sampling set used in IMOC. It is obvious that in the 2D case, this modification does not change the result because the point in each step of MOC is always in the “outside surface”. See Fig. 1(a). For P-C-S and P-C-M in 𝕊2, the results by IMOC do not change neither in our experiments, although we cannot give a proof. These three issues significantly speed up MOC, because the first one reduces the time for each neighborhood search, and the other two largely reduce the number of neighborhood searches. In our implementation, IMOC only requires a few seconds on an ordinary laptop.

C. 1-Opt Algorithm

We propose a greedy algorithm called 1-Opt. For both P-C and P-D, with a given initialization scheme, the 1-Opt algorithm at each step finds the best point pair (us,i from current result and x ∈ 𝔻) such that after setting us,i as x the covering radius for us,i can be best increased. For single shell case, 1-Opt is to increase di defined in Eq. (2). For multi-shell case, 1-Opt is to increase both covering radii of us,i in the combined shell, i.e., ds,i=argmax(s,i)(s,j),j,sarccosus,iTus,j and in the s-th shell, i.e., di=argmaxji,jarccosus,iTus,j. See Alg. 5. 1-Opt is to increase the covering radius for every individual sample. 1-Opt requires an initialization, and a good initialization normally results in a good result. In practice, we use the result by IMOC as the initialization of 1-Opt.

Algorithm 4.

MOC for Multiple shell Scheme Design:

graphic file with name nihms925741f8.jpg

Algorithm 5.

1-Opt for Multi-Shell Scheme Design:

graphic file with name nihms925741f9.jpg

Note that if IMOC is used to design a scheme with many samples for P-C, then there sometimes is a hole area in the result of IMOC. This is because of the discretization used in IMOC for P-C. By applying 1-Opt to the IMOC result, even though 1-Opt sometimes cannot increase the overall covering radius for the whole scheme, it can better arrange the points to fix the hole. See Fig. 3.

Fig. 3.

Fig. 3

Multi-shell sampling schemes with 90 × 3 samples generated by three methods, and the covering radii of 3 shells and the combined shell in schemes shown in Table III. The colors differentiate the sampling points from the three shells. Note that there is a hole area in the scheme by IMOC, which can be fixed by 1-Opt and CNLO.

D. Constrained Non-Linear Optimization (CNLO)

For P-C, SC problem in Eq. (4) can be converted into a continuous optimization, called Constrained Non-Linear Optimization (CNLO):

maxθ,{ui}i=1Kθ (7a)
s.t.arccos(uiTuj)θ,i<jK; (7b)
uiTui=1,i; (7c)

where Eq. (7b) means all separation angles are larger than θ, and Eq. (7c) means the estimated samples are on the unit sphere. After the problem is solved, the solution θ* is the estimated covering radius and there exists i and j such that arccos(ui)Tuj=θ. Note that both cost function and constraints are continuous.

CNLO in Eq. (7) has total K(K-1)2 inequality constraints. Thus for P-C-S or P-C-M with a large number of samples, CNLO may be very slow due to large number of constraints. CNLO is a local optimization method which requires an initialization. Thus for a given initialization {pi} we propose a sped up version of CNLO to reduce the number of inequality constraints via max

maxθ,{ui}i=1Kθ (8a)
s.t.arccos(uiTuj)θ,ifpiTpjcos(2δ0+θub),i<jK; (8b)
arccos(uiTpi)δ0,i; (8c)
uiTui=1,i; (8d)

where δ0 controls the difference between the final results {ui} and the initialization {pi}, and the inequality constraints which have two initialization points far from 2δ0 + θub do not need to be considered in Eq. (8b), because θub is an upper bound. If δ0 = 0, then the result is just the initialization. If δ0 = π/2, then the result of Eq. (8) is the same as the result of Eq. (7). When setting δ0 as a small float number, the number of constraints can be largely reduced and CNLO can be sped up.

Similarly, with a given initialization {ps,i}, P-C-M in Eq. (5) can be solved using CNLO in Eq. (9):

max{θs},θ0,{us,i}w1Si=1Sθs+(1-w)θ0 (9a)
s.t.arccos(us,iTus,j)θs,ifps,iTps,jcos(2δ0+θsub),s,i<jKs; (9b)
arccos(us,iTus,j)θ0,ifps,iTps,jcos(2δ0+θsub),s<s,iKs,jKs; (9c)
arccos(us,iTps,i)δ0,s,i; (9d)
θsθ0,s; (9e)
us,iTus,i=1,s,i; (9f)

δ0 controls the difference between the result and the initialization. Eq. (9b) means that the separation angles of samples in the s-th shell are larger than its corresponding covering radius θs. Eq. (9c) means that the separation angles of samples in two different shells are larger than the covering radius θ0 for the combined shell with all samples. Eq. (9e) means that the covering radii for all single shells are always larger than the covering radius for the combined shell, in other words, Eq. (9e) means that the separation angles of samples in the same shell are larger than θ0.

CNLO in Eq. (8) and Eq. (9) are continuous non-convex constrained optimization problems. We use sequential quadratic programming (SQP) to solve CNLO. In each step, SQP solves a quadratic programming problem which locally approximates the original non-linear problem. In practice, we use the SQP solver implemented in NLOPT library [41] (NLOPT_LD_SLSQP). CNLO yields a locally optimal solution with a given initialization. We suggest to use the result by IMOC + 1-Opt as the initialization. In practice, we set δ0 = 0.1, and keep performing CNLO by updating the initialization {pi} (or {ps,i}) using the previously obtained result.

E. Mixed Integer Linear Programming (MILP)

We propose a Mixed Integer Linear Programming (MILP) algorithm to solve P-D. For D={ui}i=1N, Eq. (4) can be solved using MILP:

maxθ,{hi}i=1Nθ (10a)
s.t.arccos(uiTuj)θ-(2-hi-hj)M,i>j (10b)
θ1bθθub(2K) (10c)
i=1Nhi=K;hi=0,1,i (10d)

where hi = 1 means that ui is selected as one of the K points, θlb is a lower bound of the covering radius, which can be set to 0 or the covering radius from an existing sampling scheme, θub(2K) is an upper bound of the covering radius in Eq. (6), and M is the difference between the maximal distance (denoted as dmax) and the minimal distance (denoted as dmin) of any two points ui, uj ∈ 𝔻, ij. After solving MILP, the solution of θ, denoted as θ*, is the covering radius of the selected K samples. The constraint in Eq. (10b) only takes effect when two points are both selected, i.e., hi = hj = 1, and is automatically satisfied when hi = 0 or hj = 0, because we set M large enough such that arccos(uiTuj)dminθ-(2-hi-hj)M, when hi + hj ≤ 1.

Similarly, Eq. (5) with 𝔻 = {us,i} can be solved using MILP:

max{θs},{hs,i}wS-1s=1Sθs+(1-w)θ0 (11a)
s.t.arccos(uiTuj)θs-(2-hs,i-hs,j)M,s,i>j (11b)
arccos(uiTuj)θ0-(2-hs,i-hs,j)M,s,s,i>j (11c)
θslbθsθub(2Ks),s;θ0lbθ0θub(2s=1SKs) (11d)
i=1Nhs,i=Ks,s;s=1Shs,i1,i;hs,i=0,1,i,s (11e)

where hs,i = 1 means that ui is selected as one of the Ks points on the s-th shell, θslb and θub(2Ks) are the lower and upper bounds of the covering radius on the s-th shell, and θ0lb and θub(2i=1SKs) are the lower and upper bounds for the combined shell with all points. Similarly with Eq. (10b), constraints in Eq. (11b) and Eq. (11c) are respectively for the first and second terms in Eq. (11a). s=1Shs,i1 makes ui to be selected at most in one shell such that the estimated samples are staggered across shells. Note that the above MILP in Eq. (11) is for P-D-MS. For P-D-MM, Eq. (11b) and Eq. (11c) need to be replaced by Eq. (12a) and Eq. (12b)

arccos(us,iTus,j)θs-(2-hs,i-hs,j)M,s,i>j (12a)
arccos(us,iTus,j)θ0-(2-hs,i-hs,j)M,s,s,i>j (12b)

where hs,i = 1 means us,i is selected as one of the Ks points on the s-th shell, and s=1Shs,i1 can be removed from Eq. (11e).

Note that although in general the MILP is NP hard, it can be solved with a globally optimal solution using branch and bound method which iteratively solves the relaxed LP program. In our implementation, we solve Eq. (10) and Eq. (11) using GUROBI [42], which can yield the globally optimal solution or at least a reasonable solution within 10 minutes for P-D. In practice, we progressively increase the lower bound θlb based on the solutions estimated in previous iterations to find a better feasible solution within 10 minutes which is good enough in experiments.

IV. Experiments

A. Effect of Discretization in Incremental Greedy Methods

Incremental greedy methods (i.e., IEEM [39], IGEEM [18], ISC, and IMOC), all require a uniform sample set as a discretization of 𝕊2 to determine the best sample in each step. It is obvious that a finer uniform set may not yield better results for ISC in 2D case. We test the discretization effect in 𝕊2 for both single shell and multi-shell scheme design. We consider two uniform sample sets respectively with 81 samples and 20482 samples from spherical tessellation. With these two uniform sets, IEEM, ISC, IMOC, and IMOC + 1-Opt are performed to generate single shell scheme with K ∈ [5,80], where IMOC + 1-Opt means 1-Opt with the initialization by IMOC. The left subfigure of Fig. 2 depicts the covering radii of schemes obtained by different methods with different uniform sets. It also shows the best known single shell schemes using the SC concept collected by Dr. Sloane 5, denoted as GRASSC, and the best known single shell schemes using EEM collected in CAMINO [25]. When K ∈ [50,80] is close to 81 (the number of uniform samples in the coarse uniform set), covering radii by IEEM and ISC are not improved using the finer set, and when K < 50 is far from 81, IEEM and ISC obtain larger covering radii using finer discretization. When using the finer uniform set, covering radii by IMOC and IMOC + 1-Opt are improved, close to the best known solutions in GRASSC, and are much larger than covering radii of the schemes in CAMINO. ISC, IMOC, IMOC + 1-Opt are performed to generate schemes with 3 shells, K ∈ [5,25] per shell by using these two uniform sets. The right subfigure of Fig. 2 shows mean of covering radii of three shells and the covering radii of the combined shell. It also demonstrates that when K ×3 is close to 81, covering radii of the combined shell by ISC are not improved using the finer set, while IMOC has no such issue. The experiment shows that IMOC and IMOC + 1-Opt generally obtain better results than ISC and IEEM for P-C. CNLO can further improve the results by IMOC + 1-Opt, and the covering radii by IMOC + 1-Opt + CNLO obtains the best results, very closed to the solutions in GRASSC in single shell case. Thus, as shown in Table II, ISC is suggested for P-O, and IMOC + 1-Opt + CNLO is suggested for P-C.

Fig. 2. Effect of discretization.

Fig. 2

The left figure shows the covering radii of single shell schemes with K samples obtained by methods using two uniform sets with 81 samples and 20481 samples. The right figure shows the covering radii of multi-shell schemes with K × 3 samples, where θs is the mean of covering radii in three shells, and θ0 is the covering radius of the combined shell.

B. Covering Radii in Sampling Schemes

We evaluate the proposed method by generating multi-shell schemes with K ×3 samples. We test two cases with K = 28 and K = 90, considering these two cases were also used for evaluation in [18], and the multi-shell scheme with 90 × 3 samples by IGEEM was also used in HCP [18], [26].

Table III shows the covering radii of the schemes by different methods, where IMOC + 1-Opt + CNLO means CNLO using the result by IMOC + 1-Opt as the initialization. Sparse and Optimal Acquisition (SOA) in [20], which separates a uniform scheme into several sets by using a greedy algorithm, is performed using the software released by the author 6. The result by SOA for the scheme with 90 × 3 samples is not shown, because it did not terminate after running for 2 days. The solid rotation method in [24] starts from the single shell scheme with 28 samples by EEM, and finds the optimal three rotations for the 3 shells, such that the electrostatic energy in the combined shell is minimized. The scheme by the solid rotation method is the 3 rotated EEM schemes, and the covering radii of the scheme were extracted from [18]. Note that we run GEEM by using the latest software released by the authors 7, which obtains better covering radii for individual shells but a slightly worse covering radius for the combined shell than the original results shown in [18]. As references, the table also lists the best known results for single shell with K and K ×3 samples using EEM from CAMINO [25] and the SC formulation from 5 [34]. Note that the precomputed schemes by EEM and GRASSC are only for the single shell case, not for the multi-shell case. If this single shell scheme (with 28 or 90 samples) is repeated for 3 shells (i.e., with 3 different b values), then the covering radius for the combined shell (with 84 samples or 270 samples) is actually 0°. The scheme with 84 samples by EEM is not related with the scheme with 28 samples by EEM. IMOC obtains better covering radii than incremental methods, i.e., IGEEM, ISC, and it is even better than MILP, which is because the globally optimal schemes by MILP are determined only from a coarse uniform set with 321 samples, not from a fine uniform set. It is impractical for MILP to work with 20481 samples, because MILP is NP hard. For both 28 ×3 and 90 ×3 cases, IMOC + 1-Opt + CNLO obtains best schemes with largest covering radii in both 3 individual shells and the combined shell, and the covering radii by IMOC + 1-Opt + CNLO in these 3 shells are even better than best known scheme collected in CAMINO using EEM [11], [25], similarly with the left subfigure in Fig. 2.

TABLE III.

Covering radii of multi-shell sampling schemes with 28 × 3 and 90 × 3 samples generated by various methods with the corresponding running time.

Shell 1 (28) Shell 2 (28) Shell 3 (28) Combined (28 × 3) Running Time

SOA [20] 15.9° 15.8° 16.7° 15.1° 36 min
Solid Rotation [24] 25.7° 25.7° 25.7° 7.4° not available
IGEEM [18] 19.2° 19.7° 19.3° 4.7° 0.3 s
GEEM [18] 23.6° 24.3° 23.4° 12.3° 3.7 s
ISC (N = 20481) 21.3° 19.3° 21.1° 10.5° 0.3 s
MILP (N = 321) 23.8° 23.8° 24.3° 13.3° 10 min
IMOC (N = 20481) 24.3° 24.3° 24.3° 14.0° 10.5 s
IMOC + 1-Opt (N = 20481) 24.3° 24.4° 24.3° 14.0° 12.8 s
IMOC + 1-Opt + CNLO 26.1° 26.3° 26.9° 14.4° 39.7 s

EEM (CAMINO) [11], [25] 25.7° 25.7° 25.7° 15.6° not available
GRASSC 5 [34] 27.8° 27.8° 27.8° 16.2° not available

Shell 1 (90) Shell 2 (90) Shell 3 (90) Combined (90 × 3) Running Time

IGEEM [18] 10.8° 10.3° 10.5° 2.4° 0.6 s
GEEM [18] 12.9° 13.3° 12.6° 7.2° 11 min
ISC (N = 20481) 10.4° 9.7° 10.4° 4.6° 0.6 s
MILP (N = 321) 13.3° 13.5° 13.3° 7.9° 10 min
IMOC (N = 20481) 13.5° 13.5° 13.5° 7.8° 7.8 s
IMOC + 1-Opt (N = 20481) 13.6° 13.5° 13.5° 7.8° 29.6 s
IMOC + 1-Opt + CNLO 14.6° 14.6° 14.7° 8.4° 31 min

EEM (CAMINO) [11], [25] 15.1° 15.1° 15.1° not available not available
GRASSC 5 [34] 15.7° 15.7° 15.7° not available not available

Table III also shows the running time for different methods on an ordinary laptop. Incremental methods (IGEEM, ISC, IMOC) are all very fast. IGEEM and ISC have similar running time, because they can be implemented in the same efficient way to maintain the table of increased cost functions between the selected sample set and each unselected sample, and update the table when each new sample is selected. A similar acceleration strategy can be used for 1-Opt. MILP is terminated after 10 minutes, because we found that the cost function of MILP changes very slow after 10 minutes. Compared with the 28×3 scheme, the running time of estimating the 90×3 scheme by all methods except IMOC increases largely. The running time of GEEM increases from 3.7 seconds to 11 minites, and the running time of CNLO increases from 39.7 seconds to 31 minutes, because of the larger number of constraints. The running time of IMOC decreases for the 90 ×3 scheme, probably because the iteration number of MOC decreases for the 90 ×3 scheme, compared with the 28 ×3 scheme. It should be noted that the running time of scheme design methods is actually not very important for practical usage, because the schemes are optimized only once for scanners.

Fig. 3 shows the schemes by IMOC, IMOC + 1-Opt, and IMOC + 1-Opt + CNLO. It shows that 1) IMOC + 1-Opt can fix the hole in the scheme by IMOC; 2) IMOC + 1-Opt increases the covering radii of individual shells of the scheme by IMOC, although the covering radius of the combined shell keeps the same (7.78°); 3) CNLO can refine the scheme by IMOC + 1-Opt. Thus for P-C, we do not use IMOC as a stand-alone method. Instead, IMOC + 1-Opt + CNLO is suggested for P-C, where two greedy methods (i.e., IMOC + 1-Opt) are used to set initialization of gradient based optimization method CNLO.

C. Rotational Invariance in Reconstruction

We test rotational invariance of the schemes with 28 × 3 samples by different methods in Table III. We generate synthetic diffusion signal from a mixture of cylinder model [43], where two cylinders cross with 60° and share the cylinder parameters of length L = 5 mm, radius ρ = 5 μm, diffusivity D0 = 2.02 × 10−3 mm2/s. With each tested scheme, we rotate the model and generate signals 20481 times with the rotation angles determined by the uniform sample set with 20481 samples. Then we perform Non-Negative Spherical Deconvolution (NNSD) [9] with the spherical harmonic (SH) order 8 to estimate fODFs, detect the peaks, and calculate the mean angular differences by comparing the detected peaks with the ground-truth fiber directions in these 20481 tests. A good sampling scheme should give low mean angular differences with low standard deviations. Table IV lists the mean and standard deviation of angular differences obtained by NNSD using different schemes. IMOC + 1-Opt +CNLO yields the lowest angular differences with the lowest deviation, and IMOC has the lower mean and deviation than IGEEM.

TABLE IV.

Angular differences between estimated directions and ground-truth fiber directions using different schemes by various methods.

SOA [20] IGEEM [17], [18] GEEM [18] IMOC IMOC + 1-Opt IMOC + 1-Opt + CNLO
Angular Difference 1.41° ± 0.75° 1.67° ± 0.81° 1.23° ± 0.73° 1.43° ± 0.77° 1.31° ± 0.71° 1.16° ± 0.64°

We compare the 28×3 schemes and 90×3 schemes in DWI signal reconstruction by using tensorial Spherical Polar Fourier Imaging (SPFI) [16]. The DWI signals are synthesized using a mixture of tensor model with two symmetric tensors with eigenvalues [1.7, 0.2, 0.2] × 10−3 mm2=s with a crossing angle of 60°. We rotate the model and generate signals 20481 times with the rotation angles determined by the uniform sample set with 20481 samples. Tensorial SPFI with spherical order of 8 and radial order of 4 is used to fit the data and re-sample the DWI signals at continues 3 shells with a fine tessellation of 20481 samples. Then normalized mean Root-Mean-Squared-Error (RMSE) of DWI signals in 3 shells is calculated. We also add Rician noise with SNR=30 (i.e., standard deviation of 1=30) for 1000 trails, and re-calculate mean normalized RMSE in reconstruction using noisy samples. Table V shows the mean and standard deviation of normalized RMSE in reconstruction using noise-free samples and noisy samples. It shows that IMOC + 1-Opt + CNLO obtains lower mean and std of normalized RMSE values in reconstruction, especially when the number of samples is small and SNR is high. However, when using a larger number of noisy samples in SPFI (e.g., 90 × 3 scheme with noise, SNR=30), the normalized RMSE results by IMOC + 1-Opt + CNLO and GEEM are similar. It also shows that the 90 × 3 scheme by IMOC has much larger standard deviation (i.e., rotational invariance) of normalized RMSE than schemes by other methods, which is because of the hole area effect in IMOC. IMOC + 1-Opt and IMOC + 1-Opt + CNLO reduce the rotational invariance of IMOC.

TABLE V.

Normalized RMSE of DWI signal reconstruction using tensorial SPFI.

SOA [20] IGEEM [17], [18] GEEM [18] IMOC IMOC + 1-Opt IMOC + 1-Opt + CNLO

28 × 3, no noise 0.46% ± 0.048% 0.42% ± 0.046% 0.38% ± 0.062% 0.38% ± 0.0058% 0.36% ± 0.054% 0.35% ± 0.055%
90 × 3, no noise not available 0.176% ± 0.0044% 0.174% ± 0.0022% 0.179% ± 0.016% 0.176% ± 0.0035 0.172% ± 0.0021%
28 × 3, SNR=30 4.39% ± 0.58% 4.33% ± 0.60% 4.31% ± 0.58% 4.35% ± 0.61% 4.33% ± 0.59% 4.31% ± 0.56%
90 × 3, SNR=30 not available 2.57% ± 0.34% 2.55% ± 0.33% 2.58% ± 0.35% 2.56% ± 0.34% 2.54% ± 0.33%

D. MILP for Separation of Sampling Schemes

We evaluate the effectiveness of the proposed MILP in Eq. (11) with w = 1 to solve P-D-MS, by verifying whether it can separate a set of samples into several subsets, keeping the points in each subset as uniform as possible. We compared MILP with the subsetpoints program in CAMINO [11], [25] and dirsplit in MRtrix 8 for the same task. For this evaluation, we randomly mix two sets of uniform points, where one set has 81 points generated by spherical tessellation (with covering radius 15.9°) and the other set has 60 points (with covering radius 18.3°) from CAMINO generated by EEM. Separation of these 141 points into subsets respectively with 81 and 60 samples should ideally give results that match the original uniform point sets. MILP obtained results that exactly match the original point sets within 1 second. subsetpoints in CAMINO, which minimizes electrostatic energy by using simulated annealing, saves the result every hour when it runs. It obtains two incorrect points in each subset after running for 2 hours, 7 incorrect points after 8 hours, and the correct result after 9 hours. Although the correct result has been obtained, the program continues to run for hours until the simulated annealing temperature is finally small enough. dirsplit in MRtrix initializes two sets and then randomly swaps 2 directions in each step to reduce the electrostatic energy. dirsplit can only separate the scheme into 2 schemes with nearly equal numbers 71 and 70. With default parameters and multithreading, it finished running after 16 minutes. However, the covering radii in these two sets are 6.4° and 5.1° respectively, different from the original two uniform sets.

E. MILP for sub-sampling of multi-shell HCP scheme

In order to reduce the scanning time and reuse the existing data, we may want to uniformly select a sub-sampled scheme from an existing scheme. IGEEM has been successfully used in multi-shell diffusion data acquisition in HCP [17], [18], [26]. The scheme has 3 shells with b values 1000, 2000, and 3000 s=mm2, 90 samples per shell. See top left subfigure in Fig. 4. MILP and a random selection are performed to obtain two sub-sampled schemes with 45×3 samples. It is a problem of P-D-MM, which to our knowledge can not be solved by any existing released routine (e.g., in CAMINO or MRtrix). MILP yields the covering radius 7.0° in the combined shell, while the random selection yields 2.4°, which is the same as the original HCP scheme. See first row in Fig. 4. We also perform NNSD with SH order 8 on a subject with the full scheme of 90 × 3 samples and two sub-sampled schemes with 45 × 3 samples from MILP and random selection. The corresponding fODF fields with detected peaks are displayed in Fig. 4 with GFA as backgrounds. Peaks are detected from the fODFs with GFA higher than 0.5. Fig. 4 shows that the estimated fODF and peaks from sub-sampled scheme by MILP are much closer to the results from the full scheme, compared with the random selection. Note that fODFs near grey matter areas by the scheme from random selection are very noisy, compared with the results using the full scheme. That is because that random selection cannnot keep the uniformity in the sub-sampled scheme. In the region visualized in Fig. 4, the mean angular difference of the detected peaks is 4.2° between the results from the full scheme and the sub-sampled scheme by MILP, and 5.9° between the results from the full scheme and the sub-sampled scheme by random selection. The normalized RMSE of the estimated GFA of fODFs is 7.5% for the sub-sampled scheme by MILP and is 18.9% for the scheme by random selection.

Fig. 4.

Fig. 4

First row: full HCP scheme, sub-sampled schemes by MILP and random selection, respectively. Second row: fODFs fields estimated from the data sets corresponding to schemes, where fODF glyphs are colored by directions, the background is the GFA map, and the yellow tubes denote detected peaks.

V. Discussion

A. Sampling scheme design without antipodal symmetry

The SC formulation and EEM formulation [11], [17]–[23] both are to design sampling schemes with the antipodal symmetry constraint, based on prior knowledge that the diffusion signal is antipodal symmetric, i.e., E(q) = E(−q). However, some works proposed to model asymmetry of diffusion signals [44] or fiber ODFs [45], which may prefer sampling schemes without the antipodal symmetry constraint. Moreover general uniform single or multiple shell sampling schemes (i.e., without the antipodal symmetry constraint) are also be useful in some applications other than diffusion MRI.

Although the proposed SC formulation and 5 algorithms focus on sampling scheme design with antipodal symmetry, they can be easily performed to design general sampling schemes without the antipodal symmetry constraint. Actually the original SC formulation in mathematics [33], [35] does not consider the antipodal symmetry constraint. If the absolute value operator is removed from the definition of covering radius in Eq. (3), then Eq. (4) and Eq. (5) are SC formulations respectively for general single and multiple shell scheme design, where Eq. (4) is the original SC formulation in [33], [35]. ISC and 1-Opt can be used for general uniform sampling scheme design straightforwardly, based on the revised definition of covering radius. For IMOC, if we revise the definition of the coverage set of a point x as C(x, θ) = {y | arccos(yT x) < θ, y ∈ 𝕊2}, without the absolute value operator, then IMOC can be used for general uniform scheme design. In Fig. 1(a), with the revised definition, the covering set does not include the yellow arc generated by the dashed samples in the antipodal positions. For CNLO and MILP, if we remove the absolute operator in Eq. (7), (8), (9), (10) and (11), then CNLO and MILP can be used for general uniform sampling scheme design without the antipodal symmetry constraint. For each proposed algorithm, the strategy and procedure remain the same if the antipodal symmetry constraint is removed.

B. Sampling scheme design with other considerations

1) Diffusion models and model fitting

As discussed in the introduction part, both the SC formulation and EEM formulation are to design uniform sampling schemes in single or multiple shells with parameters S and {Ks}s=1S. This is based on the fact that diffusion signals in voxels have various preferred orientations, while we need to design one scheme for all signals in human brain. The number of b values S, and the number of samples in each shell {Ks}s=1S, are both user determined. Different diffusion models, e.g., tensor model and various HARDI models, may prefer different distributions of b values in the radial part of the q-space, and different number of samples in low or b values. Moreover, even for the same model, different model fitting algorithms may prefer different schemes [12]. The samples with high b values capture more orientational information, while have lower single-to-noise ratio (SNR), than the samples with low b values. Thus robust estimation algorithms [9], [14], [16], which are less sensitive to noise, may prefer larger number of samples with high b values.

2) Eddy current correction

Recent works on eddy current correction [29], [46], [47] showed that three kinds of acquisitions are useful for eddy current correction, i.e., 1) the blip-up blip-down acquisition, which scans the the data twice with flipped phase-encoding directions; 2) the acquisition with gradients on the whole sphere that includes both {ui} and {−ui}; 3) the acquisition with gradients on the whole sphere without the antipodal symmetry constraint. Schemes by SC can be directly used in the blip-up blip-down acquisition with flipped phase-encoding directions. and can be used in the second kind of acquisition by incorporating the flipped gradients {−ui}. For the third acquisition on the whole sphere without the antipodal symmetry constraint, we can perform the proposed optimization methods without the constraint, as described in Section V-A.

3) Gradient reorientation

The uniform schemes generated by SC or EEM are used to scan DWI data. Some preprocessing steps, including motion correction, eddy current correction, registration of DWIs, etc., may require gradient reorientation [48], [49]. Gradient reorientation changes each individual gradient, and also the separation angles between gradients. Thus even though the designed schemes by SC for scanning data are optimized with large angular separation, the reorientated scheme for model fitting may be suboptimal. This issue also exists for schemes by EEM, where the electrostatic energy is sub-optimal after reorientation. The gradient reorientation issue has not been considered in existing works on uniform scheme design [11], [17]–[23], because the transform matrix for each gradient vector is dependent on DWI images and also the image registration algorithm used in the preprocessing steps. In this paper, we show that in both SC and EEM formulations, uniform sampling scheme design before gradient reorientation is an approximation of uniform sampling scheme design after gradient reorientation, in an approximation algorithm point of view 9.

Intuitively, if the transform matrix is close to the identity matrix, then the reoriented scheme are closed to the original scheme which has been well optimized. This intuition can be quantitatively described in Proposition 1, which shows that if the transform matrix is inside a ball of the identity matrix with radius δ, then the inner product of two reoriented gradient vectors x1Tx2 is inside a range centered at the inner product of the original two gradient vectors u1Tu2 with a radius of 2δ+δ2. The proof of this proposition is shown in Appendix A.

Proposition 1: Angular separation after gradient reorientation

Let u1 and u2 be two gradient vectors in 𝕊2. Denote the two gradient vectors after reorientation as x1 and x2, and the corresponding transform matrices as T1 and T2 such that xi = Tiui, i = 1, 2. If ||TiI||2δ, where I is the identity matrix, then the reoriented gradient vectors satisfies x1Tx2-u1Tu22δ+δ2.

Note that Proposition 1 works for gradient reorientation using rotation matrices and also general affine transforms [48], where if Ai is the affine matrix used to reorient ui such that xi=AiuiAiui2, then Ti=AiAiui2. Proposition 1 means that the separation angle of two reoriented gradients arccos(x1Tx2) is in [ arccos(u1Tu2+2δ+δ2),arccos(u1Tu2-2δ-δ2)]. And the minimal separation angle of any two reoriented gradients is also in a range defined by the separation angle of two original gradient vectors. Thus, in an approximation algorithm point of view 9, maximizing the angular separation of the original scheme for scanner is to approximately maximize the angular separation of the reorientated scheme for model fitting. In practice, the transform matrix between a reoriented gradient vector and an original gradient vector in motion correction or eddy current correction is close to the identity matrix, i.e., ||TI||2 < δ, and δ is a small value. If the transform matrix T is the rotation about z-axis with the degree of 5°, then ε = 0.123 that is about 12% of the norm of I. In practice, the rotation angle of the transform matrix is smaller than 5°. Thus the change of the separation angle after gradient reorientation is small.

For the EEM formulation, as shown in Eq. (1), the electrostatic energy between ui and uj is

f2(ui,uj)=1ui-uj22+1ui+uj22=11-(uiTuj)2[11-(uiTuj-2δ-δ2)2,11-(uiTuj+2δ+δ2)2] (13)

Based on Proposition 1, after gradient reorientation, the electrostatic energy between xi and xj is

f2(xi,xj)=11-(xiTxj)2[11-(uiTuj-2δ-δ2)2,11-(uiTuj+2δ+δ2)2] (14)

Note that here we assume uiTuj+2δ+δ2<1. The length of the range is

11-(uiTuj+2δ+δ2)2-11-(uiTuj-2δ-δ2)2=4uiTuj(2δ+δ2)1-(uiTuj+2δ+δ2)2)(1-(uiTuj-2δ-δ2)2)4c0(2δ+δ2)(1-(c0+2δ+δ2)2)(1-(c0-2δ-δ2)2)=L(c0,δ) (15)

where c0=maxi<juiTuj<1-2δ-δ2. Thus, the electrostatic energy of the reoriented sampling scheme Σi< j f2(ui, uj), and the electrostatic energy of the original scheme Σi< j f2(xi, xj), are both inside a range, and we have

|i<jf2(ui,uj)-i<jf2(xi,xj)|K(K-1)L(c0,δ) (16)

where K is the number of samples in the single shell scheme. Thus, minimizing the electrostatic energy of the original scheme is to approximately minimize the energy of the reoriented scheme. When δ is small, i.e., the transform matrix is close to the identity matrix, the range length in Eq. (16) is small and the electrostatic energies of the original scheme and the reoriented scheme are close.

We have shown that uniform sampling scheme design before gradient reorientation is an approximation of uniform sampling scheme design after gradient reorientation. However, it should be noted that if the transform matrix due to gradient reorientation is far from the identity matrix, the approximation error can be large. As shown in experiments, the proposed SC method yields larger angular separation (i.e., covering radius) between samples than the state-of-the-art EEM and GEEM method. If we assume that gradient reorientation reduces the covering radius of all schemes by the same amount (< 5°), then, after gradient reorientation, the scheme by SC still has larger covering radius than the scheme by EEM.

4) Minimizing gradient heating

The SC formulation in this paper is to design uniform sampling scheme in shells, similarly with the works based on EEM [11], [17]–[23], which have not yet considered gradient heating specifically. We have proposed to use ISC for P-O, where the first partial set of samples are nearly uniformly separated. It was reported in [40] that optimal ordering not only ensures that usable data are available if the scan needs to be cut short, but also reduces the risk of systematic errors due to gradient heating and subject motion. Minimizing gradient heating is beyond of the content of this paper, and gradient heating may be reduced in different ways [50], [51]. It can be a future work to investigate which way of reducing gradient heating is more appropriate to be incorporated into current SC framework in this paper.

VI. Conclusion

In this paper, we proposed to use the Spherical Code concept, which directly maximizes angular separations between samples, to design single- and multi-shell uniform sampling schemes. We showed that various continuous and discrete sampling scheme design problems can be formulated using SC. We proposed five novel algorithms (i.e., ISC, IMOC, 1-Opt, MILP, CNLO) based on SC for uniform sampling scheme design. See Table I. ISC is mainly to order samples in an existing scheme. IMOC and 1-Opt are to quickly obtain schemes in deterministic greedy ways. Although IMOC is a greedy incremental method, it obtains globally optimal solution in 𝕊1. For the sphere 𝕊2 in dMRI, covering radii of schemes by IMOC are very close to the best known solutions in single shell case 5, and are much larger than EEM in CAMINO. CNLO is a local optimization method which refines a given initialization (normally from IMOC+1-Opt). MILP is used to perform sub-sampling in an existing scheme. It can obtain globally optimal solution or a good result within 10 minutes. The experiments showed that compared with the state-of-the-art methods [18], [20] based on electrostatic energy which is used in HCP, the multi-shell scheme by IMOC + 1-Opt + CNLO yields larger covering radii in general, and better rotational invariance in dMRI reconstruction when the SNR is high and the number of samples is small. However, when a large number of noisy samples are used in DWI reconstruction, the sampling schemes by different methods may have similar reconstruction results. The proposed SC methods are to maximize the angular separation between samples, independent of diffusion models and diffusion reconstruction. The optimal sampling scheme that yields the best reconstruction quality is dependent on the diffusion model, reconstruction algorithm, and the ground truth diffusion signal. It is a future work to optimize the sampling schemes for a given specific diffusion model and a given reconstruction algorithm, so that the final result of the diffusion reconstruction is optimized. We have released some demos and codes in DMRITool 1.

Acknowledgments

The authors would like to thank Dr. Ruiliang Bai and Dr. Alexandru Avram for useful discussions. This work was supported in part by funds provided by the Intramural Research Program of the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) (ZIA-HD000266), and NIH grants (NS093842 and EB022880). The data were provided in part by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University.

Appendix A

Proof of Proposition 1

Denote Ti = I + Ci, then we have ||Ci||2δ, and

x1Tx2=u1T(I+C1T)(I+C2)u2=u1Tu2+u1T(C1T+C2)u2+u1TC1TC2u2

Let c=u1T(C1T+C2)u2+u1TC1TC2u2, then

x1Tx2-u1Tu2c

Because u1T(C1T+C2)u2C1T+C222δ, and u1TC1TC2u2C1T2C22=δ2, we have c ≤ 2δ + δ2. Then

x1Tx2-u1Tu22δ+δ2 (17)

Footnotes

Contributor Information

Jian Cheng, SQITS, NIBIB, NICHD, National Institutes of Health.

Dinggang Shen, Department of Radiology and Biomedical Research Imaging Center (BRIC), the University of North Carolina at Chapel Hill. Department of Brain and Cognitive Engineering, Korea University, Seoul 02841, Republic of Korea.

Pew-Thian Yap, Department of Radiology and Biomedical Research Imaging Center (BRIC), the University of North Carolina at Chapel Hill.

Peter J. Basser, SQITS, NIBIB, NICHD, National Institutes of Health

References

  • 1.Basser PJ, Mattiello J, LeBihan D. MR Diffusion Tensor Spectroscropy and Imaging. Biophysical Journal. 1994;66:259–267. doi: 10.1016/S0006-3495(94)80775-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Tuch DS, Reese TG, Wiegell MR, Makris N, Belliveau JW, Wedeen VJ. High Angular Resolution Diffusion Imaging Reveals Intravoxel White Matter Fiber Heterogeneity. Magnetic Resonance in Medicine. 2002;48:577–582. doi: 10.1002/mrm.10268. [DOI] [PubMed] [Google Scholar]
  • 3.Tuch DS. Q-ball imaging. Magnetic Resonance in Medicine. 2004;52:1358–1372. doi: 10.1002/mrm.20279. [DOI] [PubMed] [Google Scholar]
  • 4.Cheng J, Ghosh A, Jiang T, Deriche R. Model-free and Analytical EAP Reconstruction via Spherical Polar Fourier Diffusion MRI. Medical Image Computing and Computer-Assisted Intervention (MICCAI’10); sep 2010; pp. 590–597. [DOI] [PubMed] [Google Scholar]
  • 5.Özarslan E, Koay CG, Shepherd TM, Komlosh ME, Ïrfanoğlu MO, Pierpaoli C, Basser PJ. Mean apparent propagator (MAP) MRI: a novel diffusion imaging method for mapping tissue microstructure. NeuroImage. 2013;78:16–32. doi: 10.1016/j.neuroimage.2013.04.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Descoteaux M, Angelino E, Fitzgibbons S, Deriche R. Regularized, Fast and Robust Analytical Q-ball Imaging. Magnetic Resonance in Medicine. 2007;58:497–510. doi: 10.1002/mrm.21277. [DOI] [PubMed] [Google Scholar]
  • 7.Cheng J, Ghosh A, Deriche R, Jiang T. Model-Free, Regularized, Fast, and Robust Analytical Orientation Distribution Function Estimation. Medical Image Computing and Computer-Assisted Intervention (MICCAI’10); sep 2010; pp. 648–656. [DOI] [PubMed] [Google Scholar]
  • 8.Tournier JD, Calamante F, Connelly A. Robust determination of the fibre orientation distribution in diffusion MRI: non-negativity constrained super-resolved spherical deconvolution. NeuroImage. 2007;35(4):1459–1472. doi: 10.1016/j.neuroimage.2007.02.016. [DOI] [PubMed] [Google Scholar]
  • 9.Cheng J, Deriche R, Jiang T, Shen D, Yap PT. Non-Negative Spherical Deconvolution (NNSD) for estimation of fiber Orientation Distribution Function in single-/multi-shell diffusion MRI. NeuroImage. 2014;101:750–764. doi: 10.1016/j.neuroimage.2014.07.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Jeurissen B, Tournier JD, Dhollander T, Connelly A, Sijbers J. Multi-tissue constrained spherical deconvolution for improved analysis of multi-shell diffusion MRI data. NeuroImage. 2014;103:411–426. doi: 10.1016/j.neuroimage.2014.07.061. [DOI] [PubMed] [Google Scholar]
  • 11.Jones DK, Horsfield MA, Simmons A. Optimal strategies for measuring diffusion in anisotropic systems by magnetic resonance imaging. Magnetic Resonance in Medicine. 1999;42:515–525. [PubMed] [Google Scholar]
  • 12.Cheng H, Zhu J. Diffusion Magnetic Resonance Imaging (dMRI) 2016 [Google Scholar]
  • 13.Koay CG, Chang LC, Carew JD, Pierpaoli C, Basser PJ. a unifying theoretical and algorithmic framework for least squares methods of estimation in diffusion tensor imaging. Journal of Magnetic Resonance. 2006;182:115–125. doi: 10.1016/j.jmr.2006.06.020. [DOI] [PubMed] [Google Scholar]
  • 14.Chang LC, Jones DK, Pierpaoli C. RESTORE: robust estimation of tensors by outlier rejection. Magnetic resonance in medicine. 2005;53(5):1088–1095. doi: 10.1002/mrm.20426. [DOI] [PubMed] [Google Scholar]
  • 15.Cheng J, Merlet S, Caruyer E, Ghosh A, Jiang T, Deriche R. Compressive Sensing Ensemble Average Propagator Estimation via L1 Spherical Polar Fourier Imaging. MICCAI Workshop on Computational Diffusion MRI (CDMRI’11); 2011. [Google Scholar]
  • 16.Cheng J, Shen D, Yap P-T, Basser PJ. Tensorial Spherical Polar Fourier diffusion MRI with Optimal Dictionary Learning. International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI’15); Springer; 2015. pp. 174–182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Caruyer E, Cheng J, Lenglet C, Sapiro G, Jiang T, Deriche R. Optimal Design of Multiple Q-shells experiments for Diffusion MRI. MICCAI Workshop on Computational Diffusion MRI (CDMRI’11); 2011. [Google Scholar]
  • 18.Caruyer E, Lenglet C, Sapiro G, Deriche R. Design of multishell sampling schemes with uniform coverage in diffusion MRI. Magnetic Resonance in Medicine. 2013 doi: 10.1002/mrm.24736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Zhan L, Leow AD, Aganj I, Lenglet C, Sapiro G, Yacoub E, Harel N, Toga AW, Thompson PM. Differential information content in staggered multiple shell hardi measured by the tensor distribution function. Biomedical Imaging: From Nano to Macro, 2011 IEEE International Symposium on; IEEE; 2011. pp. 305–309. [Google Scholar]
  • 20.Koay CG, Özarslan E, Johnson KM, Meyerand ME. Sparse and optimal acquisition design for diffusion MRI and beyond. Medical physics. 2012;39(5):2499–2511. doi: 10.1118/1.3700166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Koay CG. Pseudometrically constrained centroidal voronoi tessellations: Generating uniform antipodally symmetric points on the unit sphere with a novel acceleration strategy and its applications to diffusion and three-dimensional radial MRI. Magnetic resonance in medicine. 2014;71(2):723–734. doi: 10.1002/mrm.24715. [DOI] [PubMed] [Google Scholar]
  • 22.Dubois J, Poupon C, Lethimonnier F, Le Bihan D. Optimized diffusion gradient orientation schemes for corrupted clinical dti data sets. Magnetic Resonance Materials in Physics, Biology and Medicine. 2006;19(3):134–143. doi: 10.1007/s10334-006-0036-0. [DOI] [PubMed] [Google Scholar]
  • 23.Cook P, Symms M, Boulby P, Alexander D. Optimal acquisition orders of diffusion-weighted MRI measurements. Journal of Magnetic Resonance Imaging. 2007;25(5):1051–1058. doi: 10.1002/jmri.20905. [DOI] [PubMed] [Google Scholar]
  • 24.De Santis S, Assaf Y, Evans C, Jones D. Improved precision in the charmed model of white matter through sampling scheme optimization and model parsimony testing. Proceedings of the International Society for Magnetic Resonance in Medicine; 2011. p. 3928. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Cook P, Bai Y, Nedjati-Gilani S, Seunarine K, Hall M, Parker G, Alexander D. Camino: Open-source diffusion-MRI reconstruction and processing. ISMRM; 2006. [Google Scholar]
  • 26.Sotiropoulos SN, Jbabdi S, Xu J, Andersson JL, Moeller S, Auerbach EJ, Glasser MF, Hernandez M, Sapiro G, Jenkinson M, et al. Advances in diffusion MRI acquisition and processing in the Human Connectome Project. NeuroImage. 2013;80:125–43. doi: 10.1016/j.neuroimage.2013.05.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Hagmann P, Jonasson L, Deffieux T, Meuli R, Thiran J-P, Wedeen VJ. Fibertract segmentation in position orientation space from high angular resolution diffusion MRI. NeuroImage. 2006;32(2):665–675. doi: 10.1016/j.neuroimage.2006.02.043. [DOI] [PubMed] [Google Scholar]
  • 28.St-Jean S, Coupé P, Descoteaux M. Non local spatial and angular matching: Enabling higher spatial resolution diffusion MRI datasets through adaptive denoising. Medical image analysis. 2016;32:115–130. doi: 10.1016/j.media.2016.02.010. [DOI] [PubMed] [Google Scholar]
  • 29.Andersson JL, Sotiropoulos SN. An integrated approach to correction for off-resonance effects and subject movement in diffusion MR imaging. Neuroimage. 2016;125:1063–1078. doi: 10.1016/j.neuroimage.2015.10.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Saghafi B, Kim J, Chen G, Shi F, Lin W, Yap PT, Shen D. Spatio-angular consistent construction of neonatal diffusion MRI atlases. Human Brain Mapping. 2017;38(6):3175–3189. doi: 10.1002/hbm.23583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Frank LR. Characterization of anisotropy in high angular resolution diffusion-weighted MRI. Magnetic Resonance in Medicine. 2002;47(6):1083–1099. doi: 10.1002/mrm.10156. [DOI] [PubMed] [Google Scholar]
  • 32.Wedeen VJ, Hagmann P, Tseng WYI, Reese TG, Weisskoff RM. Mapping Complex Tissue Architecture With Diffusion Spectrum Magnetic Resonance Imaging. Magnetic Resonance In Medicine. 2005;54:1377–1386. doi: 10.1002/mrm.20642. [DOI] [PubMed] [Google Scholar]
  • 33.Tóth LF. On the densest packing of spherical caps. The American Mathematical Monthly. 1949;56(5):330–331. [Google Scholar]
  • 34.Conway JH, Hardin RH, Sloane NJ. Packing lines, planes, etc.: Packings in Grassmannian spaces. Experimental mathematics. 1996;5(2):139–159. [Google Scholar]
  • 35.Musin OR, Tarasov AS. The tammes problem for n= 14. Experimental Mathematics. 2015;24(4):460–468. [Google Scholar]
  • 36.Papadakis NG, Murrills CD, Hall LD, Huang CL-H, Adrian Carpenter T. Minimal gradient encoding for robust estimation of diffusion anisotropy. Magnetic Resonance Imaging. 2000;18(6):671–679. doi: 10.1016/s0730-725x(00)00151-x. [DOI] [PubMed] [Google Scholar]
  • 37.Cheng J, Shen D, Yap P-T. Designing single- and multiple-shell sampling schemes for diffusion MRI using spherical code. Medical Image Computing and Computer-Assisted Interventio (MICCAI’14); Springer; 2014. pp. 281–288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Cheng J, Shen D, Yap P-T, Basser PJ. Novel single and multiple shell uniform sampling schemes for diffusion MRI using spherical codes. International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI’15); Springer; 2015. pp. 28–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Deriche R, Calder J, Descoteaux M. Optimal real-time q-ball imaging using regularized kalman filtering with incremental orientation sets. Medical image analysis. 2009;13(4):564–579. doi: 10.1016/j.media.2009.05.008. [DOI] [PubMed] [Google Scholar]
  • 40.Pierpaoli C. Artifacts in diffusion MRI. Oxford University Press; New York: 2010. pp. 303–317. [Google Scholar]
  • 41.Johnson SG. The nlopt nonlinear-optimization package. 2014. [Google Scholar]
  • 42.I. Gurobi Optimization. Gurobi optimizer reference manual. 2014 [Online]. Available: http://www.gurobi.com.
  • 43.Özarslan E, Shepherd TM, Vemuri BC, Blackband SJ, Mareci TH. Resolution of complex tissue microarchitecture using the diffusion orientation transform (DOT) NeuroImage. 2006;31:1086–1103. doi: 10.1016/j.neuroimage.2006.01.024. [DOI] [PubMed] [Google Scholar]
  • 44.Liu C, Bammer R, Acar B, Moseley ME. Characterizing Non-Gaussian Diffusion by Using Generalized Diffusion Tensors. Magnetic Resonance In Medicine. 2004;51:925–937. doi: 10.1002/mrm.20071. [DOI] [PubMed] [Google Scholar]
  • 45.Reisert M, Kellner E, Kiselev VG. About the geometry of asymmetric fiber orientation distributions. IEEE transactions on medical imaging. 2012;31(6):1240–1249. doi: 10.1109/TMI.2012.2187916. [DOI] [PubMed] [Google Scholar]
  • 46.Irfanoglu MO, Modi P, Nayak A, Hutchinson EB, Sarlls J, Pierpaoli C. DR-BUDDI (Diffeomorphic Registration for Blip-Up blip-Down Diffusion Imaging) method for correcting echo planar imaging distortions. NeuroImage. 2015;106:284–299. doi: 10.1016/j.neuroimage.2014.11.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Graham MS, Drobnjak I, Zhang H. Realistic simulation of artefacts in diffusion MRI for validating post-processing correction techniques. NeuroImage. 2016;125:1079–1094. doi: 10.1016/j.neuroimage.2015.11.006. [DOI] [PubMed] [Google Scholar]
  • 48.Alexander DC, Pierpaoli C, Basser PJ, Gee JC. Spatial Transformations of Diffusion Tensor Magnetic Resonance Images. IEEE Transactions On Medical Imaging. 2001;20:1131–1139. doi: 10.1109/42.963816. [DOI] [PubMed] [Google Scholar]
  • 49.Leemans A, Jones DK. The B-matrix must be rotated when correcting for subject motion in DTI data. Magnetic resonance in medicine. 2009;61(6):1336–1349. doi: 10.1002/mrm.21890. [DOI] [PubMed] [Google Scholar]
  • 50.Sjölund J, Szczepankiewicz F, Nilsson M, Topgaard D, Westin CF, Knutsson H. Constrained optimization of gradient waveforms for generalized diffusion encoding. Journal of Magnetic Resonance. 2015;261:157–168. doi: 10.1016/j.jmr.2015.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Hutter J, Slator PJ, Price AN, Gomes ADS, McCabe L, Kuklisova MM, Aljabar P, Rutherford M, Hajnal JV. Diffusion imaging with intra volume interleaving of b values. ISMRM; 2017; 2017. [Google Scholar]

RESOURCES