Abstract
Quantitative susceptibility mapping (QSM) is a powerful MRI technique that has shown great potential in quantifying tissue susceptibility in numerous neurological disorders. However, the intrinsic ill-posed dipole inversion problem greatly affects the accuracy of the susceptibility map. We propose QSMGAN: a 3D deep convolutional neural network approach based on a 3D U-Net architecture with increased receptive field of the input phase compared to the output and further refined the network using the WGAN with gradient penalty training strategy. Our method generates accurate QSM maps from single orientation phase maps efficiently and performs significantly better than traditional non-learning-based dipole inversion algorithms. The generalization capability was verified by applying the algorithm to an unseen pathology--brain tumor patients with radiation-induced cerebral microbleeds.
Keywords: Magnetic Resonance Imaging, Quantitative Susceptibility Mapping, Dipole Field Inversion, Deep Convolutional Neural Networks, Generative Adversarial Networks, Cerebral Microbleeds
1. Introduction
Quantitative susceptibility mapping (QSM) is a recent phase-based quantitative magnetic resonance imaging (MRI) technique that enables in vivo quantification of magnetic susceptibility, a tissue parameter that is altered in a variety of neurological disorders (Liu et al., 2015; Wang et al., 2017). QSM has been shown to quantify changes in vascular injury, such as the formation of cerebral microbleeds (CMBs) over time, hemorrhage, and stroke (Klohs et al., 2011; Liu et al., 2012). Iron deposition in the deep gray matter due to aging or disease can also be investigated using QSM (Li et al., 2014). In neurodegenerative diseases such as Parkinson’s disease (N. He et al., 2015), Alzheimer’s disease (Acosta-Cabronero et al., 2013) and Huntington’s disease (Van Bergen et al., 2016), QSM can quantify the paramagnetic iron deposition related to disease progression and potentially generate biomarkers for diagnosing and managing neurodegenerative disease patients.
Although QSM has been demonstrated to have great potential in both research studies and clinical practice, accurate and reproducible quantification of tissue susceptibility requires multiple steps of careful data processing including phase reconstruction, coil combination (for multi-channel coils) (Hammond et al., 2007), multi-echo phase combination (for multi-echo sequences) (Eckstein et al., 2017), background phase removal (Li et al., 2015b; Liu et al., 2011a; Sun and Wilman, 2014) and phase-susceptibility dipole inversion (Li et al., 2015a; Liu et al., 2011b, 2009). Among them, the dipole inversion step is considered the most difficult because it is intrinsically an ill-posed inverse problem (Deistung et al., 2016). This is due to the representation of the relationship between magnetic field perturbation and susceptibility distribution as a convolution, which can be more efficiently calculated as a point-wise multiplication in frequency space except along the conical surface where zero values result in missing data or noise amplification when solving for the inverse. To overcome this issue, missing data can be recovered by acquiring at least three scans with different relative orientations of the volume-of-interest in the main magnetic field, as with Calculation Of Susceptibility through Multiple Orientation Sampling (COSMOS, Liu et al., 2009). This requires a subject to change head orientation between repeated scans, which has several disadvantages that significantly limit its application in practice: 1) the scan time is prolonged since multiple repeated scans are required, increasing both the cost of QSM and the risk of motion artifact; 2) co-registration of the different orientation images are required, which both increases processing time and introduces errors due to misalignment; and 3) modern high-field head coils are usually configured to be very close to the subject’s head for higher sensitivity, limiting the ability to rotate one’s head and degrading the quality of the QSM calculation. As a result, this approach is usually impractical for patient studies despite its superior potential to alleviate the ill-posed dipole inversion.
Over the past decade, many approaches have been developed to address the ill-posed inverse problem. Thresholded K-space Division (TKD) simply thresholds the dipole kernel to a predetermined non-zero value to avoid dividing by zero (Shmueli et al., 2009). Morphology Enabled Dipole Inversion (MEDI) regularizes the ill-posed inversion problem by imposing edge preservation constraints derived from magnitude images (Liu et al., 2011b). Compressed Sensing Compensated inversion (CSC) exploits the fact that missing k-space data satisfies the compressed sensing requirement and applies a sparse L1 norm to regularize the problem (Wu et al., 2012). Quantitative Susceptibility Mapping by Inversion of a Perturbation Field Model (QSIP) approaches the problem by inversion of a perturbation model and makes use of a tissue/air susceptibility atlas (Poynton et al., 2015). These traditional methods suffer from three major limitations: 1) they either suffer from significant streaking artifacts or require careful hyperparameter tuning; 2) can be iterative and therefore take minutes to hours to compute; 3) they result in vastly different susceptibility quantifications, limiting reproducibility and making it difficult to compare studies that use different algorithms.
Recently, Deep Convolutional Neural Networks (DCNNs) have shown great potential in computer vision tasks such as image classification (K. He et al., 2015), semantic segmentation (Long et al., 2014) and object detection (Ren et al., 2015). Among various deep neural network architectures, U-Net (Ronneberger et al., 2015) has become the most popular backbone for many medical image-related problems (Gong et al., 2018; Kleesiek et al., 2016; Zbontar et al., 2018) due to its effectiveness and universality. Bollmann et al. (Bollmann et al., 2019) and Yoon et al. (Yoon et al., 2018) adopted the U-Net structure and extended it to 3D to solve the dipole inversion problem of QSM by training the network to learn the inversion using patches of various sizes as the input. Since its inception in 2014, Generative Adversarial Networks (GANs) (Goodfellow et al., 2014) have been incorporated into CNNs to further improve performance of segmentation, classification, and especially contrast generation tasks (Arjovsky et al., 2017; Hammernik et al., 2018; Nie et al., 2018; Radford et al., 2015; Yang et al., 2018; Zhu et al., 2017) by combining a generator that is trained to generate more realistic and accurate images with a discriminator that is trained to distinguish the real from the generated images. This idea of adversarial learning has recently been extended to applications in medical imaging (Mardani et al., 2019; Schlemper et al., 2018; Seitzer et al., 2018; Zhu et al., 2019a, 2019b). The goals of this study were to for the first time: 1) incorporate the physical principles of the dipole inversion model that describes the susceptibility-phase relationship into the training of a deep neural network to generate QSM images and 2) harness the power of adversarial learning in this new application. We achieved these aims by: 1) modifying the structure of the 3D U-Net proposed by Bollmann et al. and Yoon et al. (Bollmann et al., 2019 and Yoon et al., 2018) to incorporate an increased receptive field of the input phase image patches in conjunction with a cropping of resulting output in order to emulate the dipole physics within the structure of the model; and 2) by utilizing a GAN to regularize the model training process and further improve the accuracy of QSM dipole inversion.
2. Materials and Methods
2.1. Theory of QSM dipole inversion and GANs
Assuming that the susceptibility-induced magnetization is regarded as a magnetic dipole and the orientation of the main magnetic field B0 is defined as the z-axis in the imaging Cartesian coordinate, the magnetic field perturbation and susceptibility distribution is related by a convolution, which can be efficiently calculated by a point-wise multiplication in frequency space (Liu et al., 2015).
(1) |
Where ΔBz is the local field perturbation, B0 is the main magnetic field, χ represents the tissue susceptibility, k is the frequency space vector and kz is the z-component. In practice, we measure ΔBz by phase variation and solve the inverse problem for the susceptibility distribution χ. However, notice that when , the bracket term on the right-hand side becomes close to zero, which causes missing measurements or noise amplification when solving the inverse problem, making it ill-posed.
Assume y(d) = ΔBz(d) is the acquired tissue phase of the subject and x(d) is the susceptibility map of the subject we want to solve in the ill-posed phase-susceptibility dipole inversion problem, and function f represents the relationship between them, then we can simplify equation (1) with:
(2) |
To solve the dipole inversion problem, we are finding a function h that gives:
(3) |
where is an estimate of the true susceptibility map x. The idea of GANs is to define a game between two competing components (networks): the discriminator (D) and the generator (G). G takes an input and generates a sample that D receives and tries to distinguish from a real sample. The goal of G is to “fool” D by generating more realistic samples. In this case, we use G as the function h:
(4) |
The adversarial game between G and D is a minimax objective:
(5) |
where is the distribution of true susceptibility maps and is the distribution of tissue phases. To stabilize the training process, we adopt the method of Wasserstein GAN (WGAN) (Arjovsky et al., 2017), and the value function for WGAN is:
(6) |
where is the set of 1-Lipschitz functions, which can be enforced by adding a gradient penalty (GP) term to the value function (Gulrajani et al., 2017):
(7) |
where λgp is a parameter that controls the weight of the gradient penalty. Since the goal for G in this task is to recover/reconstruct QSM from a certain input tissue phase, we also included an L1 loss as content loss in the objective function of G:
(8) |
where Ladv is the adversarial loss indicated in equation (6).
2.2. QSMGAN framework
We designed a 3D U-Net architecture similar to Bollmann et al., 2019 and Yoon et al., 2018 as the generator part of the QSMGAN framework as shown in Figure 1. In each U-Net block, there are two 3x3x3 Conv3d-BatchNorm-LeakyReLU (negative slope of 0.2) layers, where Conv3d is the commonly used 3D convolution layer, the BatchNorm (batch normalization) accelerates and stabilizes the optimization and the LeakyReLU facilitates the training of the GAN. 3D average pooling was used to down-sample the image patch as proposed in the classic U-Net architecture, while 3D transpose convolution was applied to restore the resolution in the up-sampling path while incorporating the low frequency information back in the model. At the end of the generator, we applied a cropping layer to focus the training on only the center part of the patch. For the discriminator part of the QSMGAN, we designed a 3D patch-based convolutional neural network where each block of the network is composed of a 3D convolution (4x4x4 kernel size and stride 2) and a LeakyReLU (negative slope of 0.2). The four blocks in the network lower the input patch to 1/16 of the original size and the 3D convolution layer at the end converts the resulting patch to a binary output corresponding to the prediction of real and fake QSM patches.
2.3. Subjects and data acquisition
Eight healthy volunteers (average age 28, M/F=3/5) were recruited for this study as the training and validation dataset for QSMGAN. All volunteers were scanned with a 3D multi-echo gradient-recalled sequence (4 echoes, TE= 6/9.5/13/16.5ms, TR=50ms, FA=20°, bandwidth=50kHz, 0.8mm isotropic resolution, FOV=24x24x15cm) using a 32-channel phase-array coil on a 7T MRI scanner (GE Healthcare Technologies, Milwaukee, WI, USA). The sequence was repeated three times on each volunteer with different head orientations (normal position, tilted forward and tilted left) to acquire data for COSMOS reconstruction. GRAPPA-based parallel imaging (Beatty et al., 2014) with an acceleration factor of 3 and 16 auto-calibration lines were also adopted to reduce the scan time of each orientation to about 17 minutes.
To evaluate the generalization ability of our networks, we used a cohort of 12 patients with brain tumors who had developed CMBs years after being treated with radiation therapy. This type of vascular injury was an ideal pathology to test the generalizability of our network because they can both be extremely small in size and difficult to detect, and have very high susceptibility values compared to normal brain tissue due to deposits of hemosiderin. These patients were scanned using the same 7T QSM protocol as the healthy volunteers subjects except the slice thickness was 1.0mm. Only one orientation scan was performed on each patient. After the GRAPPA reconstruction, the image volumes were resampled to 0.8mm isotropic resolution to match the input of the deep learning models.
2.4. QSM data processing and dataset preparation
The raw k-space data were retrieved from the scanner and processed on a Linux workstation using in-house software developed in Matlab 2015b (Mathworks Inc., Natick, MA, USA). The following processing steps (summarized in Figure 2) were performed to obtain the tissue phase maps required for input to the QSMGAN and the calculation of the gold standard COSMOS-QSM which was used as the learning target of the QSMGAN: 1) GRAPPA reconstruction was applied to interpolate the missing k-space lines due to parallel imaging acceleration and channel-wise inverse Fourier transform was applied to obtain the coil magnitude and phase images; 2) coil images were combined to obtain robust echo magnitude and phase images using the MCPC-3D-S method (Eckstein et al., 2017); 3) raw phase was unwrapped using a Laplacian-based algorithm (Li et al., 2011); 4) FSL BET (Jenkinson et al., 2012) was applied on magnitude images from all echoes to obtain a composite brain mask from the intersection of each individual echo mask; 5) V-SHARP (Wu et al., 2012) was used to remove the background field phase to get the tissue phase map; 6) images from different orientations were co-registered using magnitude images with FSL FLIRT (Jenkinson et al., 2012); 7) the dipole field inversion was solved using the COSMOS algorithm (Liu et al., 2009). In addition, TKD (Shmueli et al., 2009), MEDI (Liu et al., 2011b) and iLSQR (Li et al., 2015a) QSM maps were also reconstructed from single orientation data for evaluation and comparison. A threshold of 0.15 was selected for the TKD algorithm, and λ=2000 was used in MEDI. The reconstructed single orientation tissue phase maps from the patient data were 1) used to compute iLSQR QSM and 2) fed into both the 3D U-Net and QSMGAN networks to generate COSMOS-like QSM.
2.5. Training and validation
The 8 healthy subjects were divided into 5 for training, 1 for validation, and 2 for testing. All three orientations were included in the dataset so the total number of scans in the training/validation/test set was 15/3/6. To build the training set, tissue phase and susceptibility patches were sampled by center coordinates with a gap of 8 voxels in all three spatial dimensions. Since background occupies most of the image volume, we sampled 90% patches from inside the brain and only 10% from the background to increase the efficiency of the training. For validation and testing, the input tissue phase volume was divided into non-overlapping patches according to the output patch size and the susceptibility map was reconstructed patch-wise by feeding the input tissue phase patch into the trained network. Figure 3 demonstrates the relationship between the receptive field and input/output patch size.
To assist the neural network training, we multiplied the input phase by a scale factor of 100 and then transformed the output by a scaled hyperbolic tangent operation to get the surrogate target :
This transform not only converts the range of the target susceptibility map to [−1, 1], which aids in the network training, but also results in a more Gaussian distributed histogram, helping the network learn values in different ranges (Figure 4).
As the baseline network, we first trained the U-Net based generator separately with the pairs of input and output patch sizes listed in Table 1. To train the generator, an Adam optimizer with a learning rate of 1e-4 was used and betas were set to (0.5, 0.999). The network was trained for 40,000 iterations with a batch size of 16 that was lowered to 8 for larger input patch sizes. L1 loss was used as the loss function for the baseline network.
Table 1.
3D U-Net Patch Size (input→output) | L1 error (1e-3) | PSNR | NMSE |
---|---|---|---|
32→32 | 1.490±0.184 | 42.25±1.01 | 0.302±0.056 |
48→32 | 1.403±0.204 | 43.07±1.22 | 0.252±0.063 |
64→32 | 1.316±0.230 | 43.39±1.37 | 0.237±0.072 |
96→32 | 1.319±0.216 | 43.38±1.32 | 0.237±0.068 |
48→48 | 1.424±0.195 | 42.58±1.13 | 0.281±0.061 |
64→48 | 1.309±0.210 | 43.53±1.31 | 0.229±0.065 |
96→48 | 1.310±0.212 | 43.37±1.28 | 0.237±0.067 |
128→48 | 1.311±0.215 | 43.40±1.31 | 0.236±0.068 |
64→64 | 1.389±0.211 | 42.87±1.21 | 0.264±0.063 |
96→64 | 1.316±0.207 | 43.46±1.28 | 0.233±0.066 |
128→64 | 1.322±0.211 | 43.32±1.27 | 0.240±0.067 |
To train the QSMGAN, we again started with the baseline network and then: 1) fixed the generator G and trained D for 20,000 iterations to ensure that D was well trained, as suggested by Gulrajani et al. (Gulrajani et al., 2017); and 2) trained G and D together for 40,000 iterations. During each iteration, D (the critic) was updated 5 times with the gradient penalty λgp = 100. Adam optimizers were used for both G and D and the learning rate was lowered to 1e-5. To balance the content loss and adversarial loss, λc was set to 1 and λadv to 0.01.
2.6. Evaluation metrics
To evaluate the quality of the predicted QSM map reconstructed by the network (), we calculated and compared the following metrics: 1) L1 error = ; 2) Peak Signal-to-Noise Ratio (PSNR) = , where R() computes the voxel value range of the input image and MSE() computes the mean squared error between the reconstructed image and the target image; 3) Normalized Mean Squared Error (NMSE)= ; 4) High-frequency error norm (HFEN); and 5) Structure similarity index (SSIM) as described in Langkammer et al., 2018. A Wilcoxon signed rank test was used to test for statistical significant differences in quality metrics between the optimized 3D U-Net and QSMGAN.
Radiation-induced CMBs from each patient were segmented on reconstructed susceptibility weighted images (SWI) using in-house software (Chen et al., 2018; Morrison et al., 2018). The resulting CMB masks were eroded by 1 voxel in all directions to remove the blooming artifact present on SWI and then applied to the iLSQR QSM, 3D U-Net and QSMGAN maps in order to quantify the median CMB susceptibility from the 3 different QSM images. The number of CMBs were also counted for each patient using each of the 3 QSM maps by an experienced rater after blinded randomization of the images. A Kruskall-Wallis test was used to test for significant differences in median CMB susceptibility and CMB count among the 3 QSM methods and Bland-Altman Plots were used to visualize any discrepancies.
3. Results
3.1. Baseline 3D U-Net
We experimented with combinations of three different input patch sizes (323, 483, 643) and 5 output patch sizes (323, 483, 643, 963, 1283, with input > output) for the baseline 3D U-Net. Figure 5 demonstrates the qualitative effects of different input-output size pairs (shown on axial slices) while Table 1 compares the quantitative metrics (L1, PSNR, NMSE) used to evaluate the quality of the resulting QSM maps. When the input patch size was the same as the output patch size, the inversion error increased towards the edge of the patch, resulting in visible discontinuities in a grid-like pattern in the reconstructed QSM map. The higher L1 error, lower PSNR and higher NMSE supports this phenomenon quantitatively. When we increased the input patch size and applied center cropping at the end of the U-Net as shown in Figure 3, the patch edge artifact decreased and the metrics improved. Among the different combinations of patch sizes, the input patch size of 643 and the output patch size of 483 (64→48) provided the best balance between sufficient accuracy of the U-Net dipole inversion and low computation burden/efficiency. Therefore, for the QSMGAN evaluation we used the 64→48 3D U-Net as a basic building block.
3.2. Effectiveness of QSMGAN
Using the 64→48 3D U-Net as the generator, the metric-wise benefit of using QSMGAN over the 3D U-Net is shown by the quantitative metrics listed in Table 2. (p=0.03 for all metrics of 3D U-Net v.s. QSMGAN) Column 4 and 5 in Figure 6 and Figure 7 demonstrates the visual comparison of reconstructed QSM of 3D U-Net and QSMGAN, where the adversarial training further improved the quality of the reconstructed QSM map by reducing both residual blurring and the remaining edge discontinuity artifacts from the relatively smaller input patch size, providing a more accurate and detailed mapping of susceptibility compared to the 3D U-Net baseline.
Table 2.
Methods | L1 error (1e-3) | PSNR | NMSE | HFEN | SSIM |
---|---|---|---|---|---|
TKD | 2.826±0.178 | 38.82±1.69 | 0.496±0.076 | 99.84±4.86 | 0.806±0.023 |
MEDI | 2.909±0.194 | 41.24±1.71 | 0.539±0.059 | 100.99±5.02 | 0.912±0.027 |
iLSQR | 2.193±0.227 | 42.03±1.45 | 0.410±0.088 | 74.40±7.15 | 0.896±0.025 |
3D U-Net 64-->48 | 1.309±0.210 | 43.53±1.31 | 0.229±0.065 | 48.45±8.30 | 0.944±0.018 |
QSMGAN 64-->48 | 1.199±0.215 | 44.16±1.42 | 0.200±0.065 | 45.68±8.53 | 0.952±0.018 |
3.3. Comparison with non-learning-based methods
Compared to 3 common ‘non-learning-based’ QSM dipole inversion algorithms (TKD, MEDI and iLSQR), our QSMGAN approach had 42-59% reductions in NMSE and L1 error in the test datasets while increasing PSNR by 4-13% as shown in Table 2. Figure 6 and Figure 7 show examples of QSM slices from the two test subjects generated from our QSMGAN compared to non-learning-based algorithms. Although TKD had the lowest computational complexity, it also resulted in the most streaking artifacts. Despite its smooth appearance, MEDI was the least uniform with relatively high L1 error and inaccurate contrast of some fine structures such as vessels. It also required the longest computation time of all of the methods (about 2 hours on a regular desktop workstation). Although iLSQR QSM had lower L1 error than TKD and MEDI, it was visually noisier than all other methods. QSMGAN not only resulted in the best L1 error, PSNR, NMSE, HFEN, and SSIM, but achieved the most similar QSM map to COSMOS in only 2 seconds of reconstruction time per scan, the same order of time complexity as with the TKD method.
3.4. Application of networks in patients with radiation-induced CMBs
To evaluate the generalization ability of our networks, we tested our network in a cohort of 12 patients with brain tumors treated with prior radiation therapy. The median susceptibility values for each CMB and total number of CMBs per patient based on iLSQR, 3D U-Net and QSMGAN were not significantly different among methods (Kruskal-Wallis test p=0.149 and p=0.936, respectively; see Figure 8). This comparison demonstrates that the proposed QSMGAN could be well generalized to previously unseen pathology with extreme suscepbitility values. Figure 9 demonstrates the robustness of QSMGAN to artifacts from imperfect preprocessing steps such as skull stripping and background phase removal as well as its ability to generate more uniform susceptibility maps. Patient 8 (row 1) suffered from poor brain extraction and background field removal that resulted in severe susceptibility artifacts from the air-tissue interface in the sinuses in the iLSQR QSM image. Although 3D U-Net partially alleviated this problem, QSMGAN provided the most uniform and highest quality susceptibility map with the least amount of residual artifacts. Patient 12 (row 2) had residual background phase that obscured the detection of a microbleed (denoted by the red arrow) that was correctly visualized on both the deep learning-based QSM maps.
4. Discussion
Although in theory the phase-susceptibility relationship in QSM is global, meaning the tissue phase is determined by the susceptibility of all locations in the imaging volume, we still adopted a patch-based deep learning approach similar to Yoon et al., 2018 for several reasons. Since the network is 3D, the patch-based method can significantly reduce the computation complexity and memory requirement compared to whole-volume based approaches, especially when conducting high-resolution QSM. For example, if we needed to generate a full QSM volume with a 256x256x150 matrix size using the entire volume as an input to the 3D U-Net architecture, even the most advanced GPU with 32GB of graphics memory would not be able to fit a single training sample. The patch-based method also converts one single scan into hundreds of input images, even before data augmentation. Since COSMOS requires a relatively long scan time and is cumbersome to conduct, training a more generalizable deep convolutional network is beneficial when only a limited amount of data is available. Because the phase is mostly determined by nearby susceptibilities due to the properties of the susceptibility-phase convolutional kernel, the patch-based approach yields a good approximation of the dipole inversion.
As Table 1 demonstrates, increasing the input patch size and applying center cropping at the end of the 3D U-Net significantly improved the quality of the reconstructed QSM maps. This can be intuitively described by Figure 3, where when the input patch size equaled the output patch size, an output voxel near the center of the patch (Figure 3a) could receive information from the entire patch. However, a voxel near the edge of the output patch (Figure 3b) would only receive information from the orange region and a large portion of the phase information from the gray region would be missing, reducing the ability of the network to accurately solve for the susceptibility. When we increased the input patch size (Figure 3c) and cropped the output patch such that only the center of the patch was considered a valid QSM prediction, voxels near the edge of the patch regained phase input information thereby increasing the accuracy of the quantified susceptibility values.
Another observation from Table 1 is that the medium output patch size (483) achieved the best QSM reconstruction performance. The smaller patch size (323) performed worse because the output voxels received less information, introducing more error to the patch approximation of global convolution. Unexpectedly, the larger patch size (643) didn’t provide any extra benefit to the dipole inversion. This might be due to the fact that it introduced more variables into the computation process and increased the difficulty of training a good network for QSM reconstruction. In addition, for each output patch size, using excessively large input patches (such as 96-->32) did not further reduce the error but slightly downgraded the QSM quality. This might be due to increased information far from the output patch interfering with the dipole inversion.
A disadvantage of using an excessively large input patch size is the dramatically increased computational complexity and GPU memory requirement. Note that the network is three-dimensional and the computational complexity and memory requirement of training the networks roughly increases with the input patch size by O(n3). The center cropping we applied to ensure a large enough receptive field, only exacerbated this problem, greatly reducing the efficiency of the prediction process. For example, if we increased the input patch size from 323 to 643, the training/prediction time and memory became 8x as long and only 1/8 of the computed patches were utilizied. Based on the observation that excessively large input patch sizes greatly increased the computational burden without improving the quality of the resulting QSM maps, we selected the 64→48 3D U-Net as the base network to integrate with the GAN.
The rationale for the GAN training, which included adding a discriminator or “critic”, was to guide the generator (or the 3D U-Net) to further refine its result so that it could not be distinguished from a real COSMOS QSM patch. Although it took a long time (48 hours) to train the QSMGAN, once the training was finished the discriminator was no longer needed. As a result, reconstruction or prediction of the QSM map for a new scan/subject from tissue phase only required one forward pass through the 3D U-Net for each input patch, thereby resulting in a computational complexity that is identical to the 3D U-Net baseline.
Although the QSMGAN was trained only on healthy volunteer data, when applying the network to patient data, it successfully recovered the previously unseen pathology of cerebral microbleeds and assigned values similar to those obtained from iLSQR. This demonstrated that the networks avoided overfitting and managed to learn the underlying dipole convolution relationship between tissue phase and susceptibility sources. We also observed unforeseen robustness to imperfect preprocessing from QSMGAN. This was likely due to the fact that our QSMGAN was trained on carefully processed training data with little artifacts, so the generator would favor outputs with similar image quality and therefore tended to remove any abnormal suscepbility sources and remaining background phase components. Although our QSMGAN was trained on only brain images because QSM has been most widely utilized in the brain, the network can easily be trained using data from other organs of interest.
5. Conclusions
In this study, we implemented a 3D U-Net deep convolutional neural network approach to improve the dipole inversion problem in QSM reconstruction. To better approximate the global convolution property in the phase-susceptibility relationship through patch-based neural networks, we enlarged the input patch size and introduced center cropping to ensure an increased input receptive field for all neural network outputs. This cropping technique provided significantly lower edge discontinuity artifacts and higher accuracy. Including a generative adversarial network based on the WGAN-GP technique further improved the stability of training process, the image quality, and the accuracy of the susceptibility quantification. Compared to the other traditional non-learning dipole inversion algorithms such as TKD, MEDI and iLSQR, our proposed method could efficiently generate more accurate, COSMOS-like QSM maps from single-orientation, background-field-removed, tissue phase images. When tested on patients with radiation-induced CMBs, QSMGAN improved the robustness of the QSM reconstruction without sacrificing the sensitivity of CMB detection. Future directions include investigating the network’s ability to generalize to other scan parameters (such as TE, TR, and image resolution) and evaluating the performance of QSMGAN in patients with different pathologies and in other organs to ultimately improve patient care.
Table 3.
Subject | iLSQR | 3D U-Net | QSMGAN |
---|---|---|---|
1 | 5 | 4 | 5 |
2 | 14 | 3 | 3 |
3 | 10 | 11 | 13 |
4 | 12 | 9 | 7 |
5 | 29 | 25 | 26 |
6 | 18 | 15 | 15 |
7 | 3 | 5 | 6 |
8 | 4 | 5 | 4 |
9 | 7 | 9 | 10 |
10 | 10 | 11 | 13 |
11 | 25 | 22 | 21 |
12 | 25 | 23 | 22 |
Total | 158 | 142 | 145 |
Acknowledgement:
This research is supported by NIH NICHD grant R01 NS099564.
Contributor Information
Yicheng Chen, UCSF/UC Berkeley Graduate Program in Bioengineering, University of California, San Francisco and Berkeley, California; Department of Radiology and Biomedical Imaging, University of California San Francisco, San Francisco, California.
Angela Jakary, Department of Radiology and Biomedical Imaging, University of California San Francisco, San Francisco, California.
Sivakami Avadiappan, Department of Radiology and Biomedical Imaging, University of California San Francisco, San Francisco, California.
Christopher P. Hess, Department of Radiology and Biomedical Imaging, University of California San Francisco, San Francisco, California
Janine M. Lupo, UCSF/UC Berkeley Graduate Program in Bioengineering, University of California, San Francisco and Berkeley, California; Department of Radiology and Biomedical Imaging, University of California San Francisco, San Francisco, California
Bibliography
- Acosta-Cabronero J, Williams GB, Cardenas-Blanco A, Arnold RJ, Lupson V, Nestor PJ, 2013. In vivo quantitative susceptibility mapping (QSM) in Alzheimer’s disease. PLoS One 8. 10.1371/journal.pone.0081093 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arjovsky M, Chintala S, Bottou L, 2017. Wasserstein GAN. 10.2507/daaam.scibook.2010.27 [DOI]
- Beatty PJ, Chang S, Holmes JH, Wang K, Brau ACS, Reeder SB, Brittain JH, 2014. Design of k-space channel combination kernels and integration with parallel imaging. Magn. Reson. Med 71, 2139–2154. 10.1002/mrm.24883 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bollmann S, Rasmussen KGB, Kristensen M, Blendal RG, Østergaard LR, Plocharski M, O’Brien K, Langkammer C, Janke A, Barth M, 2019. DeepQSM - using deep learning to solve the dipole inversion for quantitative susceptibility mapping. Neuroimage 195, 373–383. 10.1016/j.neuroimage.2019.03.060 [DOI] [PubMed] [Google Scholar]
- Chen Y, Villanueva-Meyer JE, Morrison MA, Lupo JM, 2018. Toward Automatic Detection of Radiation-Induced Cerebral Microbleeds Using a 3D Deep Residual Network. J. Digit. Imaging 10.1007/s10278-018-0146-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deistung A, Schweser F, Reichenbach JR, 2016. Overview of quantitative susceptibility mapping. NMR Biomed. 10.1002/nbm.3569 [DOI] [PubMed] [Google Scholar]
- Eckstein K, Dymerska B, Bachrata B, Bogner W, Poljanc K, Trattnig S, Robinson SD, 2017. Computationally Efficient Combination of Multi-channel Phase Data From Multi-echo Acquisitions (ASPIRE). Magn. Reson. Med c, 1–11. 10.1002/mrm.26963 [DOI] [PubMed] [Google Scholar]
- Gong E, Pauly JM, Wintermark M, Zaharchuk G, 2018. Deep learning enables reduced gadolinium dose for contrast-enhanced brain MRI. J. Magn. Reson. Imaging 48, 330–340. 10.1002/jmri.25970 [DOI] [PubMed] [Google Scholar]
- Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y, 2014. Generative Adversarial Nets. Adv. Neural Inf. Process. Syst 27 2672–2680. [Google Scholar]
- Gulrajani I, Ahmed F, Arjovsky M, Dumoulin V, Courville A, 2017. Improved Training of Wasserstein GANs. 10.1073/pnas.1509047112 [DOI] [Google Scholar]
- Hammernik K, Klatzer T, Kobler E, Recht MP, Sodickson DK, Pock T, Knoll F, 2018. Learning a variational network for reconstruction of accelerated MRI data. Magn. Reson. Med. 79, 3055–3071. 10.1002/mrm.26977 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hammond KE, Lupo JM, Xu D, Metcalf M, Kelley DAC, Pelletier D, Chang SM, Mukherjee P, Vigneron DB, Nelson SJ, 2007. Development of a robust method for generating 7.0 T multichannel phase images of the brain with application to normal volunteers and patients with neurological diseases. 10.1016/j.neuroimage.2007.10.037 [DOI] [PMC free article] [PubMed]
- He K, Zhang X, Ren S, Sun J, 2015. Deep Residual Learning for Image Recognition.
- He N, Ling H, Ding B, Huang J, Zhang Y, Zhang Z, Liu C, Chen K, Yan F, 2015. Region-specific disturbed iron distribution in early idiopathic Parkinson’s disease measured by quantitative susceptibility mapping. Hum. Brain Mapp. 36, 4407–4420. 10.1002/hbm.22928 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jenkinson M, Beckmann CF, Behrens TEJ, Woolrich MW, Smith SM, 2012. FSL. Neuroimage 62, 782–790. 10.1016/J.NEUROIMAGE.2011.09.015 [DOI] [PubMed] [Google Scholar]
- Kleesiek J, Urban G, Hubert A, Schwarz D, Maier-Hein K, Bendszus M, Biller A, 2016. Deep MRI brain extraction: A 3D convolutional neural network for skull stripping. Neuroimage 129, 460–469. 10.1016/j.neuroimage.2016.01.024 [DOI] [PubMed] [Google Scholar]
- Klohs J, Deistung A, Schweser F, Grandjean J, Dominietto M, Waschkies C, Nitsch RM, Knuesel I, Reichenbach JR, Rudin M, 2011. Detection of cerebral microbleeds with quantitative susceptibility mapping in the ArcAbeta mouse model of cerebral amyloidosis. J. Cereb. Blood Flow Metab. 31, 2282–2292. 10.1038/jcbfm.2011.118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Langkammer C, Schweser F, Shmueli K, Kames C, Li X, Guo L, Milovic C, Kim J, Wei H, Bredies K, Buch S, Guo Y, Liu Z, Meineke J, Rauscher A, Marques JP, Bilgic B, 2018. Quantitative susceptibility mapping: Report from the 2016 reconstruction challenge. Magn. Reson. Med. 79, 1661–1673. 10.1002/mrm.26830 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li W, Wang N, Yu F, Han H, Cao W, Romero R, Tantiwongkosi B, Duong TQ, Liu C, 2015a. A method for estimating and removing streaking artifacts in quantitative susceptibility mapping. Neuroimage 108, 111–122. 10.1016/j.neuroimage.2014.12.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li W, Wu B, Batrachenko A, Bancroft-Wu V, Morey RA, Shashi V, Langkammer C, De Bellis MD, Ropele S, Song AW, Liu C, 2014. Differential developmental trajectories of magnetic susceptibility in human brain gray and white matter over the lifespan. Hum. Brain Mapp. 35, 2698–2713. 10.1002/hbm.22360 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li W, Wu B, Liu C, 2015b. iHARPERELLA: an improved method for integrated 3D phase unwrapping and background phase removal. Proc. Intl. Soc. Mag. Reson. Med. 23, 3313. [Google Scholar]
- Li W, Wu B, Liu C, 2011. Quantitative susceptibility mapping of human brain reflects spatial variation in tissue composition. Neuroimage 55, 1645–1656. 10.1016/j.neuroimage.2010.11.088 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu C, Wei H, Gong N, Cronin M, Dibb R, Decker K, 2015. Quantitative Susceptibility Mapping: Contrast Mechanisms and Clinical Applications. Tomography 1, 3–17. 10.18383/j.tom.2015.00136 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu T, Khalidov I, Rochefort L De, Spincemaille P, 2011a. A novel background field removal method for MRI using projection onto dipole fields ( PDF ) 1129–1136. 10.1002/nbm.1670 [DOI] [PMC free article] [PubMed]
- Liu T, Liu J, De Rochefort L, Spincemaille P, Khalidov I, Ledoux JR, Wang Y, 2011b. Morphology enabled dipole inversion (MEDI) from a single-angle acquisition: Comparison with COSMOS in human brain imaging. Magn. Reson. Med. 66, 777–783. 10.1002/mrm.22816 [DOI] [PubMed] [Google Scholar]
- Liu T, Spincemaille P, Rochefort L De, Kressler, Wang Y, 2009. Calculation of Susceptibility Through Multiple Magnetic Field Map to Susceptibility Source Image in 204, 196–204. 10.1002/mrm.21828 [DOI] [PubMed] [Google Scholar]
- Liu T, Surapaneni K, Lou M, Cheng L, Spincemaille P, Wang Y, 2012. Cerebral microbleeds: burden assessment by using quantitative susceptibility mapping. Radiology 262, 269–78. 10.1148/radiol.11110251 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Long J, Shelhamer E, Darrell T, 2014. Fully Convolutional Networks for Semantic Segmentation. [DOI] [PubMed]
- Mardani M, Gong E, Cheng JY, Vasanawala SS, Zaharchuk G, Xing L, Pauly JM, 2019. Deep generative adversarial neural networks for compressive sensing MRI. IEEE Trans. Med. Imaging 38, 167–179. 10.1109/TMI.2018.2858752 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morrison MA, Payabvash S, Chen Y, Avadiappan S, Shah M, Zou X, Hess CP, Lupo JM, 2018. A user-guided tool for semi-automated cerebral microbleed detection and volume segmentation: Evaluating vascular injury and data labelling for machine learning. NeuroImage Clin. 20, 498–505. 10.1016/j.nicl.2018.08.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nie D, Trullo R, Lian J, Wang L, Petitjean C, Ruan S, Wang Q, Shen D, 2018. Medical Image Synthesis with Deep Convolutional Adversarial Networks. IEEE Trans. Biomed. Eng. 9294, 1–11. 10.1109/TBME.2018.2814538. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poynton C, Jankinson M, Adalsteinsson E, Sullivan E, Pfefferbaum A, Wells W, 2015. Quantitative Susceptibility Mapping by Inversion of a Perturbation Field Model: Correlation with Brain Iron in Normal Aging. IEEE Trans. Med. Imaging 34, 339–353. 10.1002/cncr.29075.Familial [DOI] [PMC free article] [PubMed] [Google Scholar]
- Radford A, Metz L, Chintala S, 2015. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks 1–16. 10.1051/0004-6361/201527329 [DOI]
- Ren S, He K, Girshick R, Sun J, 2015. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. [DOI] [PubMed]
- Ronneberger O, Fischer P, Brox T, 2015. U-Net: Convolutional Networks for Biomedical Image Segmentation 1–8.
- Schlemper J, Yang G, Ferreira P, Scott A, McGill LA, Khalique Z, Gorodezky M, Roehl M, Keegan J, Pennell D, Firmin D, Rueckert D, 2018. Stochastic deep compressive sensing for the reconstruction of diffusion tensor cardiac MRI. Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics) 11070 LNCS, 295–303. 10.1007/978-3-030-00928-1_34 [DOI] [Google Scholar]
- Seitzer M, Yang G, Schlemper J, Oktay O, Würfl T, Christlein V, Wong T, Mohiaddin R, Firmin D, Keegan J, Rueckert D, Maier A, 2018. Adversarial and perceptual refinement for compressed sensing MRI reconstruction. Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics) 11070 LNCS, 232–240. 10.1007/978-3-030-00928-1_27 [DOI] [Google Scholar]
- Shmueli K, de Zwart JA, van Gelderen P, Li T-Q, Dodd SJ, Duyn JH, 2009. Magnetic susceptibility mapping of brain tissue in vivo using MRI phase data. Magn. Reson. Med. 62, 1510–22. 10.1002/mrm.22135 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun H, Wilman AH, 2014. Background field removal using spherical mean value filtering and Tikhonov regularization. Magn. Reson. Med. 71, 1151–1157. 10.1002/mrm.24765 [DOI] [PubMed] [Google Scholar]
- Van Bergen JMG, Hua J, Unschuld PG, Lim IAL, Jones CK, Margolis RL, Ross CA, Van Zijl PCM, Li X, 2016. Quantitative susceptibility mapping suggests altered brain iron in premanifest Huntington disease. Am. J. Neuroradiol. 37, 789–796. 10.3174/ajnr.A4617 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Y, Spincemaille P, Liu Z, Dimov A, Deh K, Li J, Zhang Y, Yao Y, Gillen KM, Wilman AH, Gupta A, Tsiouris AJ, Kovanlikaya I, Chiang GCY, Weinsaft JW, Tanenbaum L, Chen W, Zhu W, Chang S, Lou M, Kopell BH, Kaplitt MG, Devos D, Hirai T, Huang X, Korogi Y, Shtilbans A, Jahng GH, Pelletier D, Gauthier SA, Pitt D, Bush AI, Brittenham GM, Prince MR, 2017. Clinical quantitative susceptibility mapping (QSM): Biometal imaging and its emerging roles in patient care. J. Magn. Reson. Imaging 1–21. 10.1002/jmri.25693 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu B, Li W, Guidon A, Liu C, 2012. Whole brain susceptibility mapping using compressed sensing. Magn. Reson. Med. 67, 137–147. 10.1002/mrm.23000 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang G, Yu S, Dong H, Slabaugh G, Dragotti PL, Ye X, Liu F, Arridge S, Keegan J, Guo Y, Firmin D, 2018. DAGAN: Deep De-Aliasing Generative Adversarial Networks for Fast Compressed Sensing MRI Reconstruction. IEEE Trans. Med. Imaging 37, 1310–1321. 10.1109/TMI.2017.2785879 [DOI] [PubMed] [Google Scholar]
- Yoon J, Gong E, Chatnuntawech I, Bilgic B, Lee Jingu, Jung W, Ko J, Jung H, Setsompop, Zaharchuk G, Kim EY, Pauly J, Lee Jongho, 2018. Quantitative susceptibility mapping using deep neural network: QSMnet. Neuroimage 179, 199–206. 10.1016/J.NEUROIMAGE.2018.06.030 [DOI] [PubMed] [Google Scholar]
- Zbontar J, Knoll F, Sriram A, Muckley MJ, Bruno M, Defazio A, Parente M, Geras KJ, Katsnelson J, Chandarana H, Zhang Z, Drozdzal M, Romero A, Rabbat M, Vincent P, Pinkerton J, Wang D, Yakubova N, Owens E, Zitnick CL, Recht MP, Sodickson DK, Lui YW, 2018. fastMRI: An Open Dataset and Benchmarks for Accelerated MRI 1–29.
- Zhu J, Yang G, Lio P, 2019a. How Can We Make Gan Perform Better in Single Medical Image Super-Resolution? A Lesion Focused Multi-Scale Approach 1669–1673. 10.1109/isbi.2019.8759517 [DOI] [Google Scholar]
- Zhu J, Yang G, Lio P, 2019b. Lesion focused super-resolution 56. 10.1117/12.2512576 [DOI]
- Zhu JY, Park T, Isola P, Efros AA, 2017. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. Proc. IEEE Int. Conf. Comput. Vis. 2017-Octob, 2242–2251. 10.1109/ICCV.2017.244. [DOI] [Google Scholar]