Abstract
A fast full-wave numerical approach was developed for simulating high-field multi-channel radio-frequency (RF) receive coil arrays in magnetic resonance imaging (MRI). To improve the efficiency, the impedance matrix was compressed by a multilevel Adaptive Cross Approximation (ACA) method. Furthermore, careful organization of multiple coil simulations was applied so that the impedance matrix associated with biological subjects is constructed and pre-conditioned only once. Numerical examples demonstrate the efficacy of the proposed approach for RF coil simulations.
1. Introduction
Multi-channel RF receive arrays have become the method of choice for signal reception in high-field (≥ 3 Tesla) MRI (Roemer et al 1990, Pruessmann et al 1999). The signal-to-noise ratio (SNR) is improved by using small elements of higher sensitivity. The coverage of a large coil is retained by optimally combining the sensitivity profile of small coils. The performance of RF coil arrays depends on both coil/subject and inter-element electromagnetic interactions (electric and magnetic coupling). Coil/subject interactions determine the tuning and the sensitivity profile of each coil element. Coil coupling determines the noise correlation and therefore the combined sensitivity map.
Full-wave simulations are effective to study these electromagnetic interactions (Jin 1999). On the one hand, they enable performance improvement for a general population when designing coils. On the other hand, subject-specific simulations are often desired when interpreting high-field images. Thus one needs fast simulation speed to examine a large number of different coils with a fixed generic subject model, or to simulate a fixed coil array with a subject-specific model.
In the past, several full-wave methods have been applied to RF coil simulations. These include the finite-difference time-domain (FDTD) method (Taflove and Hagness 2005, Zhao et al 2002), the finite-element method (FEM)(Silvester and Ferrari 1996, Petropoulos et al 1993), and the integral-equation (IE) methods (Chew et al 2000, Wang and Duyn 2006). The FDTD is simple to implement but lacks geometric modeling accuracy of RF coils. The FEM is superior at geometric modeling, but is computationally expensive and requires non-trivial three-dimensional (3D) unstructured mesh generation (George 1991). The IE method has comparable geometrical modeling capability as the FEM. It is more accurate (no dispersion errors as in the FDTD and the FEM) and does not require high-quality 3D meshes. Hybrid approaches that combine the advantages of different methods have also been proposed (Wang and Duyn 2008). However, to the authors’ best knowledge, no methods have been developed specifically for improving the speed of simulating RF coil arrays.
In this article, we present a fast full-wave simulation strategy based on the surface integral-equation (SIE) method (Chew et al 2000, Wang and Duyn 2006). Efficiency is improved by applying two approaches. First of all, a multilevel ACA method was applied to compress the impedance matrix (Bebendorf 2000, Zhao et al 2005). This results in less floating-point operations (FLOPs) when iteratively solving a linear system of equations (Saad 2003). Secondly, simulations of multiple coil elements were organized so that the impedance matrix representing biological subjects is constructed only once. This reduces the CPU time because matrix fill-in often takes longer than iterative solving the matrix. Results of a 7 Tesla 32-channel array demonstrates a 6.6-fold speed-up with 1.68% decrease of numerical accuracy.
2. Surface Integral-Equation Method
The SIE method in this article applies the Poggio-Miller-Chang-Harrington-Wu-Tsai (PMCHWT) formulation to model dielectric bodies and the Electrical Field Integral Equation (EFIE) to model conductors (Chew et al 2000, Wang and Duyn 2006). For simplicity, let us consider one dielectric body. The computational domain is divided into interior and exterior regions . In each region, the total field is the superimposition of the incident and the scattering fields.
| (1) |
| (2) |
The scattering field generated by surface electric current and magnetic current can be written as
| (3) |
| (4) |
According to surface equivalent theorem (Jackson 1999), and are equal to those generated by electric and magnetic currents on the interior phantom surface, i.e., and , in an infinite homogeneous region filled with and . The same applies to and , which are equal to those generated by electric and magnetic currents on the exterior phantom surface, i.e., and , and conduction currents on coil surface in an infinite homogeneous region filled with and . In each medium, operators and are written as
| (5) |
| (6) |
where can be either or , and is Green’s function
| (7) |
In the above, and we adopted the convention.
In the PMCHWT formulation, the interior field is coupled to the the exterior field by enforcing field continuity on phantom surfaces, i.e., and . The coupled problem is solved by applying Galerkin’s method with a set of basis and testing functions. This yields the following so-called impedance matrix
| (8) |
where and are unknown electric and magnetic currents on phantom surface respectively. Each sub-matrix is given by
| (9) |
| (10) |
| (11) |
| (12) |
| (13) |
| (14) |
One of the advantages of the SIE method is that only surfaces are meshed. Compared to volume methods such as the FEM, this results in much fewer unknowns to be solved. However, the impedance matrix in Eq. (8) is dense because it represents mutual interaction between currents. The computational costs of storing and inverting dense matrices are typically one polynomial order higher than sparse matrices in the FEM (Golub and Van Loan 1996, Saad 2003). To reduce memory cost, one approach is to apply symmetry. Although the global impedance matrix is asymmetric, a certain part of the impedance matrix is either symmetric or skew-symmetric as denoted in Eq. (8). Furthermore, and are all symmetric sub-blocks. In most RF coil simulations, the impedance matrix is mainly associated with biological subjects. In this scenario, about of the entire impedance matrix needs actual storage.
3. Fast Simulation Scheme
3.1. The Multilevel ACA Method
The multilevel ACA method is a way to reduce both memory and CPU time in iterative matrix solvers (Bebendorf 2000, Zhao et al 2005). In order to understand the principle, we first take a look at an example of the so-called outer-product format (Golub and Van Loan 1996)
| (15) |
The N-by-N matrix can be written exactly as the product of a N-by-1 matrix (or a vector) and a 1-by-N matrix (or the transpose of a vector) . This equivalent format favors the computation of matrix-vector products, e.g., where is a N-by-1 vector, which are the dominant operation in an iterative matrix solver (Saad 2003). Using the conventional matrix format of , it requires storage units and FLOPs for one matrix-vector multiplication. On the contrary, the equivalent outer-product format only requires storage units and one vector inner product and one scalar-vector product, a total of FLOPs for one matrix-vector multiplication. When is large, the outer-product format reduces computational costs by one polynomial order. For this reason, the outer-product format can be viewed as matrix compression.
The rank of the matrix in Eq.(15) is one according to definition. In general, a matrix can be written as the sum of sets of outer products, i.e.,
| (16) |
To determine the rank of , one can apply the singular value decomposition (SVD), column-pivoted QR or column-pivoted LU (Golub and Van Loan 1996). The actual computational efficiency of matrix compression via outer-products depends on rank . Outer-products are advantageous in matrix-vector multiplications if .
The ACA method is an on-the-fly approach to compress the impedance matrix of the SIE method via outer-products. Before introducing the ACA, one needs to make sure that the impedance matrix can be compressed. According to Eq. (7), the magnitude of Green’s function varies by . The spatial profile is smooth in far-field and can be interpolated by low-order polynomials. This interpolatability indicates a degenerated degree of linear independence in the part of the impedance matrix that represents far-field mutual interactions (far-field blocks). As rank serves as a measure of the degree of linear independence, far-field blocks are rank-deficient and can be compressed via outer-products. Or in other words, compression should only be selectively applied to far-field blocks (Rokhlin 1985).
In the ACA method, identification of far-field interactions starts from dividing a computational domain into multiple sub-domains. This is illustrated in Fig. 1. If the radii of the circumscribed spheres of two sub-domains and the geometrical center-to-center distance satisfy the following criteria (Bebendorf 2000)
| (17) |
where is the so-called admissible ratio, the mutual interactions between the two sub-domains are determined to be weak and the associated sub-matrix should be compressed.
Figure 1.

The 32-channel receive coil array (a) and the sub-domains of the decomposed head model (b).
Matrix compression can take place once the entire impedance matrix is known. Alternatively, a more CPU-efficient approach can be derived from the column-pivoted Crout algorithm. The Crout algorithm is a variant of the LU decomposition but with a specific memory accessing sequence. It decomposes a matrix on-the-fly and has been used in out-of-the-core matrix decomposition (Duff et al 1986). Upon finishing the LU decomposition, one matrix is factorized into the product of two matrices, which resembles the format in Eq. (16). Meanwhile, column-pivoted LU can be used to determine matrix ranks (Golub and Van Loan 1996). Thus column-pivoted Crout is a rank-revealing on-the-fly matrix factorization method that we need.
The ACA method is essentially the column-pivoted Crout algorithm without actually performing pivoting‡. The detail of the ACA algorithm is listed here for completeness (Bebendorf 2000)
The ACA algorithm
In the context of electromagnetic simulations, ACA can be physically interpreted as an alternating and iterative way of finding the most significant transmitters and receivers in two groups of antennas (Zhao et al 2005). The strongest receiver is first selected after examining the interactions between an arbitrary transmitter and all receivers. By reciprocity, the strongest receiver then transmits and the strongest transmitter is determined by examining the interactions. Then the next strongest receiver and transmitters are found similarly and the algorithm proceeds.
According to antenna theory, the minimum rank of far-field interactions is two, which correspond to the two orthogonal linear polarizations in far-field (Heldring et al 2007). The ACA determines the actual rank by examining the convergency of the Frobenius norm (Golub and Van Loan 1996),
| (18) |
where denotes a singular value. If the relative change of Frobenius norm converges to a prescribed tolerance , matrix compression stops. Thus indicates the amount of uncertainty introduced by the ACA to the impedance matrix. As shown in (Wang and Duyn 2010), also indicates the additional solution error introduced by matrix compression. Thus it provides a practical tool to control simulation errors.
The compressed (in Eq. (8)) of the impedance matrix associated with the human head (in Fig. 1) is illustrated in Fig. 2, where dark blocks represent incompressible interactions and light-colored blocks represent compressed ones with the associated rank denoted inside. Instead of compressing far-field interactions between individual sub-domains, one can combine sub-domains into larger groups and reduce the total number of compression required. This is a multilevel scheme and its detail can be found in (Wang and Duyn 2010). The resulting rank map of this multilevel ACA is also shown in Fig. 2, where compression on a larger scale can be identified clearly.
Figure 2.

Rank map of the single-level ACA method (a) and the multilevel ACA method (b).
3.2. Multi-Coil Simulations
When simulating different RF coil elements, the phantom does not change its shape and constitution. Only coils and coil/subject interactions change. Thus we can generate matrix blocks and (in Eq. (8)) that are associate with subjects once and apply them repeatedly. When simulating a single element, the CPU time of matrix fill-in is comparable or even longer than that of solving the matrix iteratively (see examples below). Therefore, this scheme reduces the CPU time of multiple coil simulations by a factor of two or more in practice.
4. Numerical Example
We simulated a 32-channel receive coil array for human brain proton imaging at 7.0 Tesla (300 MHz). This array was designed in cooperation with, and built by, Nova Medical Inc. (Wilmington, MA, USA). It comprises 32 small loop coils modeled by quadrilateral elements as shown in Fig. 1. On average, between 20 to 30 roof-top basis functions were applied to model each element (Chew et al 2000). The head model was adapted from the specific anthropomorphic mannequin (SAM) model, which was published by the IEEE standard 1528-2003, by scaling it in the two transversal directions according to the size of the actual subject being scanned. The model has a relative permittivity of and a conductivity of . These values correspond to those of the white matter at 300 MHz (Gabriel 1996).
Both the combined SNR map and the geometric noise amplification factor (g-Factor) were simulated (Pruessmann et al 1999). This required 32 simulations to calculate the profiles and the noise correlation matrix (Wang and Duyn 2006). The generalized minimal residual method (GMRES) method with a successive over relaxation (SOR) pre-conditioner was applied (Saad 2003). In order to construct the pre-conditioner, a sparse representation of the impedance matrix was extracted at first by retaining all elements in a row that are larger than 0.5% of the largest element of that row. The stopping criteria of the GMRES method was 0.001. In the ACA method, was 0.01. The computer program was developed in with double precision numbers. Simulations were performed on a 3.0 GHz AMD Opteron 256 processor.
4.1. Evaluation
The proposed approach was evaluated at first by comparing its results with the conventional approach without matrix compression (Wang and Duyn 2006). The SNR map was calculated by combining the simulated profiles with a noise correlation matrix obtained by adding coil noise to simulated noise correlation. The amount of noise added corresponded to a measured 2:1 unloaded-to-loaded Q ratio. Note that if one knows the detail of the electronics on coils, accurate simulation of factors can be performed (Wang et al 2007).
Table I lists the computational costs of simulating one element by using different approaches. Both matrix fill-in and iterative solution time were reduced when applying the multilevel ACA method. Meanwhile, memory consumption is reduced by a factor of 2.6. Note that matrix symmetry as mentioned in Section 2 was applied in the conventional approach as well. Otherwise the memory cost of the conventional approach will be 2.67 times higher.
Table 1.
Comparison of computational costs, where is fill-in time, is pre-conditioning time and is the total iterative solution time.
| Memory (MB) | ||||
|---|---|---|---|---|
| Conventional approach | 14.8 | 1.7 | 16 | 296 |
| Single-level ACA | 10 | 2.7 | 8.1 | 138 |
| Multi-level ACA | 8.6 | 2.5 | 6 | 111 |
On the other hand, matrix pre-conditioning time increased by a factor of 1.5. This is because each row of the impedance matrix needs to be reconstructed in the multilevel ACA method for pre-conditioning. Since the impedance matrix of the phantom is generated and pre-conditioned just once, this additional CPU overhead is expected to have an insignificant impact.
The simulated combined SNR maps are compared in Fig. 3. As can be seen, the results are nearly identical. Accuracy was further evaluated by using a 2-norm error defined by
| (19) |
where the sum was taken over all pixels. As shown in Table II, the relative error introduced by the multilevel ACA method is 1.68%. This corresponds well to the ACA tolerance of , which indicates 1% uncertainty in the compressed impedance matrix.
Figure 3.

Combined SNR maps computed by conventional approach (a) and the proposed approach (b).
Table 2.
Comparison of the total simulation time of 32 elements and the relative error.
| Total CPU (s) | ||
|---|---|---|
| Conventional approach | 1270 | (N/A) |
| Single-level ACA | 238 | 1.61% |
| Multi-level ACA | 192 | 1.68% |
Table II also compares the total CPU time of simulating all 32 coil elements. The multilevel ACA finished in 192 seconds, or a 6.6-fold speed-up. Note that different coil elements may require different number of iterations of the GMRES solver (between 20 to 39 in this example). Typically, if an element is closer to the phantom, the coil/subject interaction is stronger and more iterations are expected.
We also mention that compared to the FDTD method, the conventional approach is already two orders of magnitude faster if similar modeling accuracy is required (Wang and Duyn 2006). The inefficiency of the FDTD is mainly due to its inferior geometry modeling capability of RF coils.
4.2. Experiment Verification
Experiments were performed on a General Electric (Waukesha, WI, USA) 7.0 Tesla whole body scanner. Axial multi-slice data were acquired using a gradient-echo sequence with a field-of-view (FOV) and 2-mm spatial resolution. The repetition time (TR) is 3.0 s and the echo time (TE) is 9.6 ms. The thickness and spacing of 128 axial slices were 1 mm. Image reconstruction and computation of the noise amplification factor (g-Factor) maps (for rate 2, 3 and 4 in the left-right direction) were performed as described in (deZwart et al 2002). The brain mask used to compute the coil sensitivity maps and the g-Factor maps were extrapolated results over 4 mm. Furthermore, the FOV was truncated to to narrowly encompass the head.
Results on a single axial slice 8-cm below the top of the head was used to compare simulation and measured results. Since g-Factor maps are sensitive to masks and the definition of the FOV, the brain mask from the acquired data was used to compute the simulated g-Factor maps as well. FOV of the simulated data was also reduced to the same . The simulated and measured g-Factor maps at acceleration rate four are shown in Figs. 4, where same gray scale ranging from 1.0 to 3.0 was applied. A close match of simulation and measured results is observed. Further comparison of the average g-Factors results on the same slice are listed in Table III, where it shows a maximum difference of about 5.5%. This discrepancy may result from the approximate head model, residual coil mutual coupling in experiment and other experiential conditions not considered in simulations.
Figure 4.

The simulated combined SNR map and the masking of simulation results with the brain profile (a), the simulated (b) and the measured (c) g-Factor maps at acceleration rate four.
Table 3.
Comparison of average g-Factors at different acceleration rates.
| Acceleration rate | 2 | 3 | 4 |
|---|---|---|---|
| Simulated g-Factor | 1.01 | 1.14 | 1.73 |
| Measured g-Factor | 1.02 | 1.20 | 1.83 |
5. Conclusions and Discussions
We presented a full-wave approach for fast simulations of high-field MRI receive coil arrays. In order to improve efficiency, the impedance matrix was compressed by applying a multilevel ACA method. Furthermore, multiple coil simulations were organized so that the impedance matrix associated with biological subjects is constructed only once. Numerical examples demonstrated the validity and the effectiveness of the proposed approach. It was shown that on a single CPU, 32 full-wave simulations of a human head coil array finished within a few minutes. As demonstrated separately in (Wang et al 2010), this method provides online simulation results with realistic subject models generated by an in-house image processing program.
Though examples in this article show that a single-medium phantom yields good agreement between simulation and experimental results of receive coil arrays, inhomogeneous phantoms are useful in certain applications, especially in the prediction of the Specific Absorption Rate (SAR) of transmit coils. Incorporating tissue inhomogeneity can be accomplished in several ways. For instance, the surface of each organ can be meshed and the proposed SIE method can be carried out straightforwardly. Alternatively, different types of tissues can be modeled by finite-difference or finite-element methods, and the proposed SIE method provides a boundary condition such as the one in the finite-element boundary-integral (FEBI) method (Chew et al 2000). The first approach has been implemented and verified by the electromagnetic scattering of a double-layer sphere. We plan to extend the proposed method to simulate inhomogeneous phantoms in the future.
As shown in (Wang and Duyn 2010), the computational complexity of the multilevel ACA method applied in this article scales by . Several approaches can be applied to further improve simulation speed when the number of unknowns becomes very large, e.g., modeling the whole body. For instance, advanced matrix solvers such as the one proposed in (Lucente et al 2008) can be applied to reduce matrix inversion time. On the other hand, the application of multi-processor, multi-core or Graphic Processing Unit (GPU) is expected to reduce simulation time significantly. As demonstrated in (Lezar and Davidson 2010), the application of GPUs can increase the speed by an order of magnitude in finite-element simulations. Since the ACA method divides computational domains into packets, it is parallelization-friendly. We focus on algorithm development in this article and the hardware implementation on different parallel platforms will be studied in the future.
Finally, we noticed the actual number of iterations depends on the relative position of RF coils with respect to phantom. Thus developing robust matrix inversion techniques, which are less sensitive to the aforementioned factor and other potential performance-degrading factors, such as coil tuning and the Q-factor, is important when considering a broad spectrum of RF coils. This topic will ensue in our future research as well.
Acknowledgments
This work was supported in part by the NINDS intramural research program. The authors gratefully acknowledge Dr. Alan P. Koretsky for insightful discussions.
Footnotes
The results are not pure lower and upper triangular matrices. There are scattered zero entries
References
- Bebendorf M 2000. Approximation of boundary element matrices. Numer. Math 86 565–589 [Google Scholar]
- Chew WC, Jin JM, Michielssen E and Song JM eds 2000. Fast and efficient algorithms in computational electromagnetics (Reading, MA: Artech House Publishers; ) [Google Scholar]
- Duff IS, Erisman AM, and Reid JK 1986. Direct methods for sparse matrices (Reading, Oxford: Oxford Science Publications; ) [Google Scholar]
- de Zwart JA, Ledden PJ, Kellman P, van Gelderen P and Duyn JH 2002. Design of a SENSE optimized high sensitivity MRI receive coil for brain imaging. Magn. Reson. Med 47 1218–1227 [DOI] [PubMed] [Google Scholar]
- Gabriel C 1996. Compilation of the dielectric properties of body tissues at RF and microwave frequencies. (Technical Report: Brooks Air Force: AL/OE-TR-1996-0037; ) [Google Scholar]
- George PL 1991. Automatic mesh generation: application to finite element methods. (Readings: John Wiley & Sons; ) [Google Scholar]
- Golub GH and Van Loan CF 1996. Matrix computations (3rd Ed.) (Reading, Baltimore: Johns Hopkins University Press; ) [Google Scholar]
- Heldring A, Rius JM, Tamayo JM, Parrón J and Úbeda E 2007. Fast direct solution of method of moments linear system IEEE Trans. Antennas and Propagat 55 3220–3228 [Google Scholar]
- Jackson JD 1999. Classical electrodynamics, (Reading, New York: John Wiley & Sons; ) [Google Scholar]
- Jin JM 1999. Electromagnetic Analysis and Design in Magnetic Resonance Imaging (Reading, FL: CRC Press; ) [Google Scholar]
- Lezar E and Davidson D 2010 An investigation of GPU-based acceleration for finite element waveguide eigenanalysis. The 10th International Workshop on Finite Elements for Microwave Engineering, Meredith, New Hampshire, Oct 2010 [Google Scholar]
- Lucente E, Monorchio A and Mittra R 2008. An iteration-free MoM approach based on excitation independent characteristic basis functions for solving large multiscale electromagnetic scattering problems. IEEE Trans. Antennas and Propagation 56 999–1007 [Google Scholar]
- Petropoulos LS, Haacke EM, Brown RW and Boerner E 1993. Predicting RF field penetration in heterogeneous bodies using a 3-D finite element approach: preliminary results. Magn. Reson. Med 30 366–372 [DOI] [PubMed] [Google Scholar]
- Pruessmann KP, Weiger M, Scheidegger MB and Boesiger P 1999. SENSE: sensitivity encoding for fast MRI. Magn. Reson. Imag 42 952–962 [PubMed] [Google Scholar]
- Rokhlin V 1985. Rapid solution of integral equations of classic potential theory. J. Comput. Phys 60 187–207 [Google Scholar]
- Roemer PB, Edelstein WA, Hayes CE, Souza SP and Mueller OM 1990. The NMR phased array. Magn. Reson. Imag 16 192–225. [DOI] [PubMed] [Google Scholar]
- Saad Y 2003. Iterative methods for sparse linear systems (Reading, PA: SIAM; ) [Google Scholar]
- Silvester PP and Ferrari RL 1996. Finite elements for electrical engineers (3rd edition) (Reading, UK: Cambridge University Press; ) [Google Scholar]
- Taflove A and Hagness SC 2005. Computational electrodynamics: the finite-Difference time-domain method, 3rd ed. (Reading, Boston: Artech House; ) [Google Scholar]
- Wang S, Duyn JH 2006. Analysing radio-frequency coil arrays in high-field magnetic resonance imaging by the combined field integral equation method. Phys. Med. Biol 51 3211–3229 [DOI] [PubMed] [Google Scholar]
- Wang S, Dodd S and Duyn JH 2007 Full-wave evaluations of loading factors of small animal RF receive coils. 15th Scientific Meeting and Exhibition, International Society for Magnetic Resonance in Medicine, Berlin, Germany, May 2007 [Google Scholar]
- Wang S, Duyn JH 2008. Time-domain finite-difference/finite-element hybrid simulations of radio frequency coils in magnetic resonance imaging. Phys. Med. Biol 53 2677–2692 [DOI] [PubMed] [Google Scholar]
- Wang S, deZwart JA and Duyn JH 2010 Subject-specific evaluation of multi-channel receive coil arrays by fast integral-equation method. 18th Scientific Meeting and Exhibition, International Society for Magnetic Resonance in Medicine, Toronto, Canada, May 2010 [Google Scholar]
- Wang S, Duyn JH 2010. A topology-based multilevel fast integral-equation method. Microwave and optical technology letters 52 2195–2198 [Google Scholar]
- Zhao H, Crozier S and Liu F 2002. Finite difference time domain (FDTD) method for modeling the effect of switched gradients on the human body in MRI, Magn. Reson. Med 48 1037–1042 [DOI] [PubMed] [Google Scholar]
- Zhao K, Vouvakis MN and Lee J-F 2005. The adaptive cross approximation algorithm for accelerated method of moments computations of EMC problems. IEEE Trans. Electromagnetic Compatibility 47 763–773 [Google Scholar]
