Abstract
This work investigates the use of configuration state imaging together with deep neural networks to develop quantitative MRI techniques for deployment in an interventional setting. A physics modeling technique for inhomogeneous fields and heterogeneous tissues is presented and used to evaluate the theoretical capability of neural networks to estimate parameter maps from configuration state signal data. All tested normalization strategies achieved similar performance in estimating and . Varying network architecture and data normalization had substantial impacts on estimated flip angle and , highlighting their importance in developing neural networks to solve these inverse problems. The developed signal modeling technique provides an environment that will enable the development and evaluation of physics-informed machine learning techniques for MR parameter mapping and facilitate the development of quantitative MRI techniques to inform clinical decisions during MR-guided treatments.
Keywords: Quantitative MRI, Physics informed neural networks
1. Introduction
Magnetic resonance imaging (MRI) has been a critical tool for gathering clinical information about soft tissues within the body. Conventionally, MRI has offered radiologists qualitative insights into the identities and properties of biological tissues. Quantitative measurements of MR-sensitive parameters have shown substantial clinical value for the evaluation and monitoring of cancer [10, 12, 14]. As the use of MRI to guide interventional procedures increases, quantitative MRI (qMRI) has the potential to substantially impact oncology interventions. Clinical adoption of qMRI has been limited by the requirements for collecting and processing the data needed to make these measurements [5, 6]. Advances in MRI acquisition and reconstruction, together with modern deep learning techniques, have the potential to address time and computing constraints that have impeded the clinical deployment of qMRI. Deep learning techniques must first overcome several technical challenges posed by this context.
Large, diverse training datasets are typically required to achieve reliable performance of deep learning models. In qMRI, such datasets are not available and are costly to create [6]. Additionally, varying imaging parameters and system imperfections across scanners and protocols adversely impact current deeplearning models in MRI [11, 13]. These inconsistencies are exacerbated in the interventional context. Finally, for any qMRI technique to achieve the desired clinical impact, the uncertainty and repeatability must be adequately understood to inform clinical decisions [2]. Physics-informed neural networks (PINNs) have the potential to address data and generalization challenges faced by current machine learning approaches in qMRI [6, 18]. To support the development of new PINN approaches to quantitative MRI, this work develops physics modeling techniques for inhomogeneous fields and heterogeneous tissues with protons bound to water and to fat contributing to the received signal. This modeling technique is then used to evaluate the theoretical capability of neural networks to estimate parameter maps from simulated configuration state imaging data.
Unbalanced steady-state sequences have the capacity to sample multiple configuration states that uniquely encode both tissue properties and field maps [8, 15]. For this work, a multi-echo configuration state imaging sequence previously developed for brain imaging was evaluated [4], with monopolar readout gradients to facilitate imaging fat. Gradients are balanced along the phase encoding directions and unbalanced along the frequency encoding direction. The unbalanced frequency encoding gradient results in many configuration states in steady-state, which are sampled as each pathway becomes coherent during readout (Fig. 1a). The different encoding of flip angle and information in various configuration states (Fig. 1b) suggests that a mapping could be learned to separate flip angle and information from a single set of configuration state signals at a single nominal flip angle. The resulting data enable the creation of quantitative , and maps from a single acquisition while also reconstructing anatomically meaningful and interpretable source images. Analytical models estimating these parameters from configuration state imaging often necessitate making many assumptions that cannot be satisfied in the real imaging environment or require the acquisition of additional information [1, 9]. The potential for neural networks to relax these assumptions in a data-informed way and extract relevant additional information (explicitly or implicitly) provides an alternative approach for solving these inverse problems.
2. Methods
2.1. MR Forward Model
The Bloch Equation.
For an isolated population of hydrogen spins, the behavior of the bulk nuclear magnetic moment, , in the presence of an external magnetic field is defined by the Bloch equation,
(1) |
where is the gyromagnetic ratio, is the external magnetic field, and are the longitudinal and transverse relaxation times, respectively, and is the equilibrium magnetization in the direction of the main magnetic field.
To enable the modeling of a heterogeneous environment where signal may originate from many spin populations with varying local magnetic fields and resonance frequencies, we developed a technique for simulating MR signal evolution for samples with arbitrary NMR spectra and parameters in the presence of typical field inhomogeneities.
Bloch Solver.
The on-resonance frequency for a particular spin population is proportional to the strength of the external magnetic field, , where the proportionality constant is also the gyromagnetic ratio . Depending on their local chemical environment, spins of the same type of nucleus may exhibit different resonance frequencies, described by their chemical shift, . For example, the primary peak of the NMR spectra for lipid-bound protons has a chemical shift of relative to water protons.
We allow tissue-specific parameters , and the population resonance frequency to be spatially varying, while the external field may vary over space and time. Different system features are considered along the longitudinal and transverse directions of the external magnetic field. Longitudinally, the external field consists of the main magnetic field, , magnetic gradient fields, , and main field spatial inhomogeneity, . The x- and y -components of the external field are defined by the applied radio-frequency excitation field, , which can be considered as a separable function of the time-varying excitation envelope (on-resonance frequency demodulated), , and the spatially-varying excitation efficiency, .
Equation 1 can be written as the combination of two operations on : precession with angular velocity and relaxation governed by and . Demodulating the effects of on-resonance precession about the main magnetic field, precessional angular velocity is given by
(2) |
The solution was approximated using a symmetric operator splitting technique [7], where a piecewise constant field was assumed (10μs resolution when temporally varying) and the new magnetic moment obtained by sequential application of rotation and relaxation operators. Specifically, the magnetic moment after an interval is approximated by applying half the total rotation, applying relaxation and recovery, and then applying the remaining rotation:
(3) |
is the axis-angle rotation matrix for rotation about the unit vector by the angle . The relaxation components are given by
(4) |
(5) |
Signal Model.
The Bloch equation alone is inadequate to model some practical NMR phenomena, such as spin echoes [17]. In practice, the measured MR signal demonstrates a reversible decay of signal due to local variations in resonance frequency and dephasing due to nonzero gradients, in addition to irreversible decay. This is modeled by the collective evolution of a population of spins with varying chemical shifts and positions along the readout direction (a 2D grid of spectroscopic axis and position). Reversible decay due to inhomogeneous broadening is observed as a monoexponential decay following . Because different phase coherence pathways are known to have varying levels of gradient-echo- and spin-echo-like properties, the adopted technique must be able to independently model each of these components. This is achieved by solving the Bloch equation for spins with varying chemical shift and adding them together with a Lorentzian weighting (). Along this spectroscopic axis, the resonance frequency increment must be small enough to prevent time-domain aliasing. This limit can be expressed as
(6) |
where is the total simulated duration, is the maximum allowed relative error (in this work, 10−3), and is the combined rate including the effects of both irreversible and reversible signal decay,
(7) |
After simulation, the signal at a given readout time was computed as the weighted sum of the magnetization vectors of all the spins for that voxel, the spectroscopic axis added together with Lorentzian weighting and the position axis with uniform weighting. MATLAB code for the Bloch solver and signal model are available at https://github.com/fuslab-uofu/mri-signal-model.
2.2. Neural Network Training
Configuration state signals were simulated for a range of MR parameters observed in breast cancer patients in the literature [3, 16] (). Other parameters were randomly initialized from, with efficiency varying from 5–150%, and in ±3 μT. The sequence had fixed TR=20 ms and a main magnetic field strength , with 250 repetitions to reach steady-state. Random configurations were used to generate 262,144 samples. When included, gaussian noise was added to the real and imaginary channels separately. Target values were normalized by dividing by the maximum value. At training time, samples were divided into a random 80/20 training/evaluation split.
Neural networks were trained to estimate , and achieved flip angle (FA) using three different normalization schemes: 1) normalization on a sample basis, dividing by the magnitude of the 0th pathway; 2) normalization on a pathway basis, dividing by the maximum achieved signal intensity for a particular pathway over the full dataset; and 3) batch normalization (to standard normal distribution) of each individual feature (4 pathways × 3 echoes = 12 features). Networks had a fully connected architecture with 12 inputs features and a single output with GELU activation at the input to each hidden node, supervised on the simulated data for 200 epochs using Adam optimizer (initial learning rate 1e–4) and plateau learning rate scheduling. Networks were trained without and with added noise (1% of mean, when added), and three variations of network depth and breadth were used for training (number of hidden layers/nodes: [16, 32, 64, 32, 16]; [256, 512, 512, 256]; or [512, 1024, 512]).
Additional experiments were conducted evaluating the performance of networks predicting all relaxation parameters with field maps as additional inputs. Network inputs consisted of sample-normalized signal magnitude values and the offset and achieved FA for each sample (4 pathways × 3 echoes + 2 = 14 inputs). A fully connected network with [512, 1024, 512] hidden nodes and GELU activation at the input to each hidden node were trained for 200 epochs with , and as targets, both with and without added noise (noise also added to field map values as 1% of max). To evaluate the need for field maps as inputs, equivalent networks were trained using the same scheme, except without FA and/or values as inputs. An additional network was trained with the same relaxometry prediction architecture with achieved flip angle as input, but using the output of the batch-normalized FA estimation network as the input FA value.
As a demonstration of the intended final use case, fully connected neural networks were also trained on real data acquired in a salt pork phantom (TR=20 ms; TE=2.9/8.5/14 ms; FA=10°; FOV/sample spacing =256/2 mm cube; total time=2 min 11s), supervised on T1 and T2 maps of the same sample obtained ( and maps were not acquired in this dataset). Inputs were 3 × 3 × 3 patches of 4 pathways × 3 echoes, normalized using pathway-based normalization. Hidden layers had [512, 1024, 512, 256] nodes, with GELU activation on inputs to hidden layers. All neural network training was conducted using Python 3.9.17 with PyTorch 2.0.1.
3. Results
Results for networks with [512, 1024, 512] hidden nodes trained to estimate , and achieved FA on noisy simulated data with various normalization strategies are summarized in Table 1. Sample normalized inputs gave the most accurate predictions for and (Fig. 2b–c). This normalization scheme resulted in poor fits for estimating and FA (Fig. 2a, d), with noiseless training giving nearly equivalent results to the noisy condition (noiseless RMSE/R: 398/0.51 for , 2.3/0.85 for FA). Pathway-based normalization resulted in the best performance for and estimation, with slightly degraded performance in estimating and (Fig. 2e–h).
Table 1.
Normalization | Sample norm + additional inputs | ||||||||
---|---|---|---|---|---|---|---|---|---|
| |||||||||
Target | Metric | Sample | Pathway | Batch | None | {FA, B0} | FA | B 0 | est. FA |
| |||||||||
T1 (ms) | RMSE | 399 | 25 | 69 | 398 | 36 | 36 | 398 | 59 |
Slope | 0.26 | 0.99 | 0.97 | 0.26 | 0.99 | 0.99 | 0.26 | 0.98 | |
Intercept | 734 | 5.5 | 26 | 743 | 6.5 | 8.8 | 743 | 17.6 | |
R | 0.51 | 1.00 | 0.99 | 0.51 | 1.00 | 1.00 | 0.51 | 0.99 | |
| |||||||||
T2 (ms) | RMSE | 1.6 | 2.1 | 6.1 | 2.1 | 1.7 | 1.6 | 2.0 | 2.0 |
Slope | 1.00 | 1.01 | 0.99 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
Intercept | 0.12 | 0.28 | 0.98 | 0.15 | 0.079 | 0.21 | 0.15 | 0.14 | |
R | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
| |||||||||
(ms) | RMSE | 5.7 | 6.8 | 7.9 | 6.0 | 5.8 | 5.7 | 6.0 | 7.1 |
Slope | 0.99 | 0.99 | 0.98 | 0.99 | 0.99 | 0.99 | 0.99 | 0.98 | |
Intercept | 0.61 | 0.63 | 0.92 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
R | 1.00 | 1.00 | 0.99 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
| |||||||||
FA (°) | RMSE | 2.3 | 0.054 | 0.23 | |||||
Slope | 0.72 | 1.00 | 1.00 | ||||||
Intercept | 2.8 | 0.00057 | 0.038 | ||||||
R | 0.85 | 1.00 | 1.00 |
Providing achieved FA as additional information at the network inputs resulted in improved ability to estimate , with negligible improvement in and estimation (Table 1). Figure 3 shows the prediction performance in real data acquired in salt pork, comparing the acquired and maps with network training results.
4. Discussion
These results provide a preliminary demonstration of the theoretical capacity for configuration state imaging to serve as the basis for rapid quantitative MRI. Trained neural networks were successful in predicting achieved flip angle, , even with noisy data. Unlike the highly undersampled acquisitions used for MR fingerprinting, this technique provides meaningful anatomical images and a readily interpretable basis for neural network-based contrast translation.
The results presented here demonstrate the potential impact of architectural and training decisions on the ability of fully connected neural networks to solve these inverse problems. Even for closely related problems, the optimal configuration may depend on the specific parameter of interest. Sample-based normalization provided the best evaluation performance in estimating and , but the worst performance for estimating and achieved FA. Including estimated FA as an input to relaxation parameter estimation dramatically improved the ability to estimate . Physically meaningful intermediate stages and architectural biases may therefore be important in training networks for this application.
The developed signal modeling technique provides a simulation environment that can test network designs on controlled datasets that include challenges expected in the MRI environment of interest, facilitating the investigation of the limits and guarantees of the resulting machine learning system. Although not evaluated in this work, the technique is extensible to model additional challenges including motion and flow, field drift, and partial voluming. This will enable evaluation of deep operator learning architectures to incorporate sequence parameters (TR, TE) and scanner characteristics (slew rate, field strength) as extra information, providing better tolerance of the variation expected in clinical workflows. The required complexity of forward models for self-supervised physics-informed training can also be evaluated in this environment. It is expected that increasing the incorporation of physics information into the network architecture and training will further improve network performance on real data.
Future work will evaluate the transfer of learned mappings from simulated data to real data and the application of neural interpolators to estimate parameter values for real data. It may also be feasible to leverage redundant spatial information across the pathways to improve SNR of real data. The impact of magnetization transfer, which is not accounted for in the current model, will also be evaluated. Further development of this technique will enable the acquisition of quantitative maps that facilitate clinical decision making during MR-guided treatments.
Acknowledgments.
Funded through NIH R01CA259686.
Footnotes
Disclosure of Interests. The authors have no competing interests to declare that are relevant to the content of this article.
References
- 1.Bruder H, Fischer H, Graumann R, Deimling M: A new steady-state imaging sequence for simultaneous acquisition of two mr images with clearly different contrasts. Magnetic Resonance in Medicine 7, 35–42 (May 1988). 10.1002/mrm.1910070105 [DOI] [PubMed] [Google Scholar]
- 2.Cashmore MT, McCann AJ, Wastling SJ, McGrath C, Thornton J, Hall MG: Clinical quantitative mri and the need for metrology. The British Journal of Radiology 94, 20201215 (March 2021). 10.1259/BJR.20201215 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Chen Y, Panda A, Pahwa S, Hamilton JI, Dastmalchian S, McGivney DF, Ma D, Batesole J, Seiberlich N, Griswold MA, Plecha D, Gulani V: Three-dimensional mr fingerprinting for quantitative breast imaging. Radiology 290, 33–40 (October 2018). 10.1148/radiol.2018180836 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Cheng CC, Preiswerk F, Madore B: Multi-pathway multi-echo acquisition and neural contrast translation to generate a variety of quantitative and qualitative image contrasts. Magnetic resonance in medicine 83, 2310–2321 (June 2020). 10.1002/mrm.28077 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Cheng HLM, Stikov N, Ghugre NR, Wright GA: Practical medical applications of quantitative mr relaxometry. Journal of Magnetic Resonance Imaging 36, 805–824 (2012). 10.1002/jmri.23718 [DOI] [PubMed] [Google Scholar]
- 6.Feng L, Ma D, Liu F: Rapid mr relaxometry using deep learning: An overview of current techniques and emerging trends. NMR in Biomedicine 35, e4416 (April 2022). 10.1002/nbm.4416 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Graf C, Rund A, Aigner CS, Stollberger R: Accuracy and performance analysis for bloch and bloch-mcconnell simulation methods. Journal of Magnetic Resonance 329, 107011 (2021). 10.1016/j.jmr.2021.107011 [DOI] [PubMed] [Google Scholar]
- 8.Hennig J: Echoes–how to generate, recognize, use or avoid them in mr-imaging sequences. part i: Fundamental and not so fundamental properties of spin echoes. Concepts in Magnetic Resonance 3, 125–143 (7 1991). 10.1002/cmr.1820030302 [DOI] [Google Scholar]
- 9.Heule R, Ganter C, Bieri O: Triple echo steady-state (tess) relaxometry. Magnetic Resonance in Medicine 71, 230–237 (2014). 10.1002/mrm.24659 [DOI] [PubMed] [Google Scholar]
- 10.Jarrett AM, Kazerouni AS, Wu C, Virostko J, Sorace AG, DiCarlo JC, Hormuth DA, Ekrut DA, Patt D, Goodgame B, Avery S, Yankeelov TE: Quantitative magnetic resonance imaging and tumor forecasting of breast cancer patients in the community setting. Nature Protocols 16, 5309–5338 (2021). 10.1038/s41596-021-00617-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Knoll F, Hammernik K, Kobler E, Pock T, Recht MP, Sodickson DK: Assessment of the generalization of learned image reconstruction and the potential for transfer learning. Magnetic Resonance in Medicine 81, 116–128 (2019). 10.1002/mrm.27355 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lescher S, Jurcoane A, Veit A, Bähr O, Deichmann R, Hattingen E: Quantitative t1 and t2 mapping in recurrent glioblastomas under bevacizumab: earlier detection of tumor progression compared to conventional mri. Neuroradiology 57, 11–20 (2015). 10.1007/s00234-014-1445-9 [DOI] [PubMed] [Google Scholar]
- 13.Liu F, Samsonov A, Chen L, Kijowski R, Feng L: Santis: Sampling-augmented neural network with incoherent structure for mr image reconstruction. Magnetic Resonance in Medicine 82, 1890–1904 (2019). 10.1002/mrm.27827 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Mai J, Abubrig M, Lehmann T, Hilbert T, Weiland E, Grimm MO, Teichgräber U, Franiel T: T2 mapping in prostate cancer. Investigative Radiology 54 (2019). 10.1097/RLI.0000000000000520 [DOI] [PubMed] [Google Scholar]
- 15.Mizumoto CT, Yoshitome E: Multiple echo ssfp sequences. Magnetic Resonance in Medicine 18, 244–250 (1991). 10.1002/mrm.1910180126 [DOI] [PubMed] [Google Scholar]
- 16.Rakow-Penner R, Daniel B, Yu H, Sawyer-Glover A, Glover GH: Relaxation times of breast tissue at 1.5t and 3t measured using ideal. Journal of Magnetic Resonance Imaging 23, 87–91 (January 2006). 10.1002/jmri.20469 [DOI] [PubMed] [Google Scholar]
- 17.Russek SE, Stupic KF, Biller JR, Boss MA, Keenan KE, Mirowski E: Electromagnetics for Quantitative Magnetic Resonance Imaging, pp. 95–147. World Scientific; (2020). 10.1142/9789813270268_0003 [DOI] [Google Scholar]
- 18.Willard J, Jia X, Xu S, Steinbach M, Kumar V: Integrating scientific knowledge with machine learning for engineering and environmental systems (2020). 10.48550/ARXIV.2003.04919 [DOI] [Google Scholar]