Abstract
Purpose:
Machine learning (ML) has been increasingly used to quantify chemical exchange saturation transfer (CEST) effect. ML models are typically trained using either measured data or fully simulated data. However, training with measured data often lacks sufficient training data, while training with fully simulated data may introduce bias due to limited simulations pools. This study introduces a new platform that combines simulated and measured components to generate partially synthetic CEST data, and to evaluate its feasibility for training ML models to predict amide proton transfer (APT) effect.
Methods:
Partially synthetic CEST signals were created using an inverse summation of APT effects from simulations and the other components from measurements. Training data were generated by varying APT simulation parameters and applying scaling factors to adjust the measured components, achieving a balance between simulation flexibility and fidelity. First, tissue-mimicking CEST signals along with ground truth information were created using multiple-pool model simulations to validate this method. Second, an ML model was trained individually on partially synthetic data, in vivo data, and fully simulated data, to predict APT effect in rat brains bearing 9L tumors.
Results:
Experiments on tissue-mimicking data suggest that the ML method using the partially synthetic data is accurate in predicting APT. In vivo experiments suggest that our method provides more accurate and robust prediction than the training using in vivo data and fully synthetic data.
Conclusion:
Partially synthetic CEST data can address the challenges in conventional ML methods.
Keywords: Chemical exchange saturation transfer (CEST), amide proton transfer (APT), machine learning, tumor
INTRODUCTION
Chemical exchange saturation transfer (CEST) is a magnetic resonance imaging (MRI) contrast mechanism that enables the detection of low-concentration solute molecules with exchangeable/coupling protons, which are not visible using conventional MRI techniques (1–5). In CEST imaging, a long radio frequency (RF) pulse is used to saturate the solute protons, and the resulting change in the water signal due to saturation transfer is measured. By plotting the water signal as a function of the RF frequency offset, a CEST Z-spectrum is obtained, allowing the observation of all CEST effects arising from exchangeable or coupling protons. One specific CEST effect of interest is the amide proton transfer (APT) effect, which occurs at 3.5 ppm from water. APT arises from the chemical exchange between amide protons in mobile proteins or peptides and water protons (6,7). APT has gained significant attention as a promising technique for protein imaging and pH imaging. It has demonstrated potential in various diagnostic applications, including tumor detection (8–11), ischemic stroke (12–14), multiple sclerosis (15,16), traumatic brain injury (17), Alzheimer’s disease (18–22), and Parkinson’s disease (23,24).
Until now, achieving specific APT imaging has been a challenging task due to various confounding factors. APT signals in biological tissues are influenced not only by the APT effect but also by overlapping signals from the direct water saturation (DS) effect, semisolid magnetization transfer (MT) effect, and nearby amine CEST effect. Multiple-pool Lorentzian fit is a commonly used data analysis method to isolate the APT effect from confounding signals. However, its accuracy strongly depends on the fitting model as well as the imaging signal-to-noise ratio (SNR), initial values, boundaries, and experimental conditions. Recently, machine learning methods has emerged as a promising approach for quantifying APT effects, offering improved robustness and processing speed compared to the multiple-pool Lorentzian fit (25–27). Typically, the ML model is trained on measured in vivo data. However, the training using measured in vivo data usually has two limitations: 1) poor-quality ground truth data; and 2) insufficient training data. Previous attempts have been made to generate ground truth data using the multiple-pool Lorentzian fitted APT effect from each voxel, but the low SNR of CEST signals from a single voxel makes it unsuitable for reliable ground truth data for training. In addition, although models trained on a limited number of patients or healthy subjects can be generalized to unseen patients, improving generalizability requires more training data. First, APT effects vary across different disease types and stages, emphasizing the need for a broader range of training data (8–11). Second, variations in other sample parameters such as water relaxations, amine CEST, nuclear Overhauser enhancement (NOE), and MT effect add complexity to the prediction process in different diseases (28–31). Training data that encompass these diverse features would facilitate more robust predictions. However, acquiring such data is impractical.
The use of synthetic data generated through simulations has become a valuable approach for addressing challenges related to data collection, data annotation (ground truth), and data quality assurance (32,33). In the field of MRI, synthetic data have been successfully employed to train ML models for quantitative parameter prediction (34–43). Synthetic data is categorized as fully synthetic or partially synthetic data. Fully synthetic data solely consists of simulated data, while partially synthetic data involve replacing selected features with simulated data while preserving real data components. In an ideal scenario, fully synthetic data would be generated by numerically simulating the Bloch-McConnell equations, considering all exchanging pools and variations in sample parameters, to closely replicate CEST signals from tissues. However, this proves challenging in practice, due to the unknown values of numerous sample parameters in tissues. Consequently, a wide range of sample parameters must be simulated to ensure the synthetic data encompasses the real sample parameters. This approach may generate training data that contains irrelevant features, potentially leading to overfitting. Furthermore, as the size of the training dataset increases, it may become difficult to fit all the training data into memory. Partially synthetic data offers a more practical solution by combining simulated data with real data components. By incorporating simulated features into real data, partially synthetic data provides a more comprehensive representation of the underlying characteristics of the imaging data. This approach allows for more accurate modeling and prediction of quantitative parameters using ML models.
In this study, we introduced a novel platform that integrates simulated components and measured components to generate partially synthetic CEST data. This data encompasses a wide range of pathological changes and provides accurate ground truth information, addressing the limitations of ML CEST imaging using either the measured data or fully synthetic data. First, we validated the capability of this platform to generate partially synthetic CEST data that accurately mimics real data. Second, we evaluated the accuracy of predicting the APT effect using a ML model trained on the partially synthetic CEST data. Finally, we compared the performance of the ML model trained on the partially synthetic CEST data with that trained directly on measured in vivo data and fully synthetic CEST data, to demonstrate the advantages of our method.
METHODS
Animal preparation
Eight rats with 9L tumors were included in this study. For the induction of brain tumor, each rat was injected with 1 × 105 9L glioblastoma cells and imaged after 15-20 days. All rats were immobilized and anesthetized with a 2%/98% isoflurane/oxygen mixture during data acquisition. Respiration was monitored to be stable, and a constant rectal temperature of 37°C was maintained throughout the experiments using a warm-air feedback system (SA Instruments, Stony Brook, NY, USA). All animal procedures were approved by the Animal Care and Usage Committee of Vanderbilt University Medical Center.
MRI
All measurements were performed on a Varian DirectDrive™ horizontal 9.4 T magnet with a 38-mm Litz RF coil (Doty Scientific Inc. Columbia, SC). CEST measurements were performed by applying a continuous-wave CEST sequence with a 5s-rectangular saturation pulse followed by single-shot spin-echo echo planar imaging (SE-EPI) acquisition with repetition time (TR) of 7s. The time of echo (TE) was 27ms. Z-spectra were acquired with RF frequency offsets (Δω) at ±4000, ±3500, ±3000, and from −2000 to 2000 Hz, with a step of 50 Hz (−10 to 10 ppm on 9.4T) and an RF saturation power (ω1) of 1μT. Control image intensity (S0) was obtained by setting Δω to 100 kHz (250 ppm on 9.4T). The acquisition of a Z-spectrum took around 15mins. MT pool size ratio (fm) and the observed water longitudinal relaxation time (T1obs = 1/R1obs) were measured using a selective inversion recovery (SIR) method (44). All images were acquired with a matrix size of 64 × 64, a field of view of 30 × 30mm2, and one acquisition.
Quantification metrics
To isolate the target CEST/NOE effect from confounding factors, a reference signal that contains contributions only from the confounding factors, excluding the target effect, and a label signal that contains all contributions are typically obtained. Conventionally, a chemical exchange saturation transfer ratio (CESTR) metric is calculated by subtracting from . The CESTR is given by,
| (1) |
Additionally, an apparent exchange-dependent relaxation (AREX) metric (45) which inversely subtracts from with normalization is given by,
| (2) |
where represents the exchanging/coupling effect in the rotating frame. For slow exchanging/coupling pools (e.g. APT and NOE), can be described by (46)
| (3) |
where and are the solute concentration and solute-water exchange rate, respectively; is the solute resonance frequency; and is the solute transverse relaxation rate.
Multiple-pool Lorentzian fit
The multiple-pool Lorentzian fit approach was used to process the CEST Z-spectra to isolate each pool. The model function for each Lorentzian is defined as:
| (4) |
Where, S represents CEST image intensity. , represents a Lorentzian line with a central frequency offset from water , full width at half maximum , and peak amplitude . N is the number of fitted pools. In the brain, several pools have been reported, including amide at 3.5ppm (6), amine at 3ppm (47,48), guanidinium at 2ppm (49,50), hydroxyls below 1ppm (51), water, NOE at −1.6ppm (NOE(−1.6)) (52,53), NOE at −3.5ppm (NOE(−3.5)) (31,54), and MT at −2.3ppm (55). However, due to their proximity in frequency offsets and broad peak shapes, it is challenging to separate amine, guanidine, and hydroxyls from each other. Therefore, for processing the CEST Z-spectra, they are typically treated as a single amine pool centered at 2ppm (56). Here, a six-pool Lorentzian fit model including amide (L1), amine (L2), water (L3), NOE(−1.6) (L4), NOE(−3.5) (L5), and MT (L6) was used to process the entire CEST Z-spectra. The fitting was performed to achieve the lowest root mean square (RMS) of residuals between the measured data and model with the least-squares optimization. Supporting Information Table S1 lists the starting points and boundaries of the fit. In the multiple-pool Lorentzian fit model, was obtained by doing the subtraction between 1 and the sum of all the Lorentzian fits, and for each exchanging/coupling pool was obtained by doting the subtraction between 1 and the sum of all Lorentzian fits except that of the corresponding pool (57). The multiple-pool Lorentzian fitted APT spectrum was quantified by Eq. (1).
Generation of partially synthetic CEST data
In a multiple-pool model including CEST/NOE, DS, and MT, the CEST signal has been descripted by the inverse summation of these effects in the rotating frame (45,46). This relationship enables us to effectively incorporate the desired combination of simulated and measured components to create the partially synthetic CEST data. In this study, we made modifications to this relationship and formulated Eq. (5) to generate the partially synthetic CEST signals.
| (5) |
where is the effective water relaxation in the rotating frame; , , , and are APT, NOE, amine CEST, and MT effects in the rotating frame, respectively (45,46); and and are two scaling factors. Supporting information Table S2 describes each parameters.
Specifically, we used and components from measurements, as well as , , and components from simulations, to generate partially synthetic CEST data. The component was quantified by the AREX metric in Eq. (2), employing the multiple-pool Lorentzian fitted Sref for amines. The component was calculated using the formula R1obsL6/(1-L6), where L6 is also obtained from the multiple-pool Lorentzian fit (see Supplementary Information Theory for the derivation). The and components were calculated using Eq. (3). The component was obtained using the following equations:
| (6) |
where was calculated using
| (7) |
in which is the semisolid longitudinal relaxation rate. is obtained from measurement to align with .
Sample parameters including , , T1, and T2, along with the scaling factors r for each pool were varied to generate diverse training data, as detailed in supporting information Table S3. The target CESTR quantified APT spectrum was obtained by the subtraction of two partially synthetic CEST Z-spectra with/without the amide pool using Eq. (5).
Generation of tissue-mimicking CEST data
To evaluate the accuracy of the ML model trained on the partially synthetic CEST data, we generated tissue-mimicking CEST data that provided ground truth information. The tissue-mimicking CEST Z-spectra were created through numerical simulation of the Bloch-McConnell equation, using the same sequence parameters as those employed in the MRI experiments. To simulate the major tissue components found in the brain, we employed a seven-pool model that included amide at 3.5ppm, amine at 3ppm, guanidinium at 2ppm, water, NOE(−1.6), NOE(−3.5), and MT (56). For the purpose of testing other methods, we varied sample parameters including , , T1, and T2 for each pool, as detailed in supporting information Table S4. The values of , , and T2 for water and amide pool, as well as the value of for NOE(−3.5), were different from those used to create the partially synthetic data. Additionally, the values of , , and/or T2 for guanidine, amine, and MT were varied to introduce variations in the amplitude and width of their CEST/MT effects. The ground truth CESTR-quantified APT spectrum for this tissue-mimicking data was obtained by the subtraction of two CEST Z-spectra with/without the amide pool through the simulation of the Bloch equations.
Generation of fully synthetic CEST data
The fully synthetic CEST Z-spectra were also created through numerical simulation of the Bloch-McConnell equation, using the same sequence parameters as those employed in creating the partially synthetic data and followed the same models employed for the tissue-mimicking data. The target CESTR-quantified APT spectrum (also ground truth) for this fully synthetic data was obtained by subtracting two CEST Z-spectra with/without the amide pool through the simulation of the Bloch equations.
Data analysis
1) To validate the feasibility to use Eq. (5) for generating accurate partially synthetic CEST signals, we used the average of each multiple-pool Lorentzian fitted CEST/MT component, , and (; ) from the measurements on the eight rat brains to generate a synthetic CEST Z-spectrum and compared this spectrum with the original measured CEST Z-spectrum. It is worth noting that, in this validation process, the and components were quantified by the AREX metric, employing the multiple-pool Lorentzian fitted for them, instead of using simulations. Additionally, the component was calculated using the formula R1obsSref_w/Slab, where Sref_w represents the reference signal from the water pool, but not the simulation (see Supplementary Information Theory for the derivation).
2) To validate the accuracy of the ML method using the partially synthetic CEST data to predict APT signals, we applied it to process the tissue-mimicking data and compared the results with the conventional multiple-pool Lorentzian fitted results. First, from a few randomly selected Z-spectra within the tissue-mimicking data, we obtained the multiple-pool Lorentzian fitted components and . These fitted components, along with simulated , and , were used to generate the partially synthetic data. This data is called partially synthetic data generated with measured components from tissue-mimicking data to distinguish it from the partially synthetic data generated with measured components from in vivo data. The target CESTR quantified APT spectrum was obtained by the subtraction of two partially synthetic CEST Z-spectra with/without the amide pool using Eq. (5). Next, we trained the ML model using this partially synthetic data. The trained ML model was then applied to predict the CESTR quantified APT spectra for 1000 testing samples of tissue-mimicking data. We obtained the average difference between the ML-predicted APT spectra and the ground truth APT spectra of the tissue-mimicking data within the frequency range of 2ppm to 5ppm. This average difference (i.e., loss) served as a measure of the ML model’s accuracy. Similarly, we obtained the average difference between the multiple-pool Lorentzian fitted APT spectra and the ground truth APT spectra from the tissue-mimicking data. By comparing these average differences from these two methods, we were able to assess whether the ML approach using partially synthetic data outperforms the multiple-pool Lorentzian fit.
3) To evaluate the advantage of the ML method using the partially synthetic CEST data compared to using the in vivo data alone, we conducted two types of comparisons. In type 1, from an average of randomly selected Z-spectra from a varying number of voxels (i.e, 50, 100, 500 voxels) and of all voxels (2157 voxels) within the measured data from five rats, we obtained the multiple-pool Lorentzian fitted components and . Using these components, we generated partially synthetic data and trained the ML model on these data. We then applied the model to predict the APT spectrum on the other three rats. Additionally, we directly trained the ML model on in vivo Z-spectra from the corresponding voxels to those used in creating the partially synthetic data, using the multiple-pool Lorentzian fitted APT spectrum for each voxel as the target. We also augmented the training data created from all voxels of the five rat brains (2157 voxels) by averaging each pair of Z-spectra, resulting in an expansion of the dataset to nearly 2000 times its original size. The multiple-pool Lorentzian fitted APT spectra from the augmented Z spectra were used as targets. This trained ML model was employed to predict APT spectra from the other three rats. This type 1 comparison can also demonstrate the model’s generalizability to other subjects. In type 2, from the averaged Z-spectra from all normal tissues or all tumor tissues of the eight rats, we obtained the multiple-pool Lorentzian fitted components and and generate the partially synthetic data. Additionally, we trained the ML model on all measured Z-spectra from all voxels in either tumors or normal tissues, using the multiple-pool Lorentzian fitted APT spectrum for each voxel as the target. Furthermore, we augmented the training data created from all voxels in normal tissues (1923 voxels) or in tumor tissues (491 voxels) of the eight rats by averaging each pair of Z-spectra, resulting in an expansion of the dataset to nearly 2000 or 500 times its original size respectively. The target was created by using the multiple pool Lorentzian fitted APT spectrum from each of the augmented Z-spectrum. We then applied our NN model to process the data from other parts of the rat brains. This type 2 comparison can also show the model’s generalizability to other pathologies.
4) To evaluate the advantage of the ML method using the partially synthetic CEST data compared to using fully synthetic data, we also conducted two types of comparisons. In both these two types, the multiple-pool Lorentzian fitted components and were obtained from the average of 50 randomly selected measured Z-spectra, which were then used to create partial synthetic data. In type 1, the fully synthetic data were generated with the same sample size as the partially synthetic data for fair comparison. Specifically, , , and T2 for water, amide, and NOE(−3.5) pools were set the same as those used in creating the partial synthetic data. The fs of amine and MT were set to be and , respectively. In type 2, the fully synthetic data were generated to have three times the size of the partially synthetic data by adding more simulations with varied of guanidine, as detailed in Supporting information Table S5.
Machine Learning (ML)
Fig. 1 shows a flowchart of the machine learning workflow, along with the utilized neural network (NN). The input data consisted of the Z-spectra with ranging from −10ppm to −5ppm, −0.5ppm to 0.5ppm, and 2.5ppm to 10ppm at 9.4T (39 data points). The data from −5ppm to −0.5ppm and 0.5ppm to 2.5ppm were excluded since their impact on the CEST signal at 3.5ppm is negligible. The output data consisted of the amplitude (A) and width (W) of the target amide CESTR peak. A was calculated by setting to in the amide CESTR peak, and W was obtained as the full width at half maximum (FWHM) of the amide CESTR peak. The ML-predicted APT spectrum was generated by applying a Lorentzian function based on the predicted A and W (25). An NN model with two dense layers, each comprising 100 neurons and an output layer with two nodes was employed to predict the A and W values. The rectified linear unit activation (ReLU) activation function applied after each dense layer and mean square error (MSE) loss function were utilized. The NN model was trained on four types of data - partially synthetic data with measured components from the tissue-mimicking data, partially synthetic data with measured components from in vivo data, in vivo data, and the fully synthetic data. For the training using partially synthetic data and the fully synthetic data, gaussian noises with a standard deviation of 0.01 were added to the Z-spectra to simulate real tissue signals. The NN model was implemented in MATLAB R2022a using the Adam optimizer for 4000 epochs, with a learning rate of 1x10−3 and a batch size of 64. Supporting Information Fig. S2, shows the training and validation loss versus the number of epochs. Training was stopped when validation loss began to increase, while training loss continued to decrease. The training took ~3h while using partially synthetic and fully synthetic data and ~20 mins for in vivo data without augmentation. The ML prediction of the APT map for a rat brain took approximately 1.6 seconds on a Dell System with Intel(R) Xeon(R) CPU E5-1607 v2 3.00 GHz processor.
Fig. 1.

Flowchart of the machine learning method.
Statistics
The tumor ROIs were outlined based on the map, with values less than a threshold of 7%. The contralateral normal tissue ROIs were chosen to mirror the tumor ROIs. Student’s t-tests were employed to compare the ROI-averaged signals. We considered P < 0.05 to be statistically significant. The normal tissue ROIs, used for selecting training data, were chosen by subtracting the whole brain ROI from the tumor ROI.
RESULTS
Validation of the feasibility to generate partially synthetic CEST data using Eq. (5)
Figs. 2a and 2b show the multiple-pool Lorentzian fitted , , , , and spectra from the averaged Z-spectrum of all normal tissues. Fig. 2c shows a comparison of the synthetic Z-spectrum generated using Eq. (5) with the original measured Z-spectrum. The close resemblance between the synthetic and measured Z-spectra indicates the feasibility of constructing CEST signals using different multiple-pool Lorentzian fitted components according to Eq. (5). Fig. 2d presents a comparison between the measured Z-spectrum and the synthetic Z-spectrum without the APT effect. The discrepancy between the two spectra represents the APT contribution. Thus, by substituting the multiple-pool Lorentzian fitted APT components with simulated APT components, partially synthetic CEST data can be generated. The huge peak at 0ppm in Fig. 2b is due to the inverse of a very small value at 0ppm. Supporting information Fig. S1 shows these spectra from all tumors, showing close resemblance between the synthetic and measured Z-spectra as well.
Fig. 2.

Multiple-pool Lorentzian fitted (a) and fitted APT, amine CEST, NOE(−1.6), NOE(−3.5), and MT spectra (b) from the average of the measured CEST Z-spectra in normal tissues in eight rat brains. Comparison between the measured CEST Z-spectra and synthetic Z-spectrum generated using all multiple-pool Lorentzian fitted components (c) as well as between the measured CEST Z-spectra and synthetic CEST Z-spectrum without APT (d).
Validation of the accuracy of the ML method using partially synthetic CEST data generated with measured components from the tissue-mimicking data.
Fig. 3a displays a representative Z-spectrum from the tissue-mimicking data. Fig. 3b presents a comparison of the corresponding APT spectra from the ML prediction, multiple-pool Lorentzian fit, and the ground truth. This ML model was trained on partially synthetic data, with the measured components fitted from one randomly selected Z-spectrum within tissue-mimicking data. It can be observed that the ML predicted APT spectrum closely resembles the ground truth, whereas the Lorentzian fitted APT spectrum exhibits a difference. Fig. 3c illustrates the comparison of losses between the ML method and the multiple-pool Lorentzian fit for all testing samples. The results indicate that the ML method accurately predicts the APT spectra of all testing samples, with a mean loss less than 8.9x10−4. In contrast, the multiple-pool Lorentzian fit shows significant bias in some of the testing samples. Supporting information Fig. S3 compares losses between the ML method and the multiple-pool Lorentzian fit, with the measured components used to generate the partially synthetic data, fitted from ten other randomly selected Z-spectra, as well as from the average of 50, 100, 500, and 1000 randomly selected Z-spectra, within the tissue-mimicking data. The results consistently demonstrate the accuracy of the ML method using partially synthetic data in predicting APT, while the multiple-pool Lorentzian fit exhibits overestimation in some of the testing samples.
Fig. 3.

(a) A representative Z-spectrum from the tissue-mimicking data. (b) A comparison of the corresponding APT spectra from the ML prediction, multiple-pool Lorentzian fit, and ground truth. (c) A comparison of losses between the ML prediction and the multiple-pool Lorentzian fit for all testing data.
Comparison of the ML model trained on partially synthetic CEST data generated with measured components from in vivo data, directly on the in vivo data, and on fully synthetic CEST data.
Fig. 4 displays the average CEST Z-spectra from the three testing rats, and the corresponding average APT spectra from the multiple-pool Lorentzian fit, ML prediction using the partially synthetic data and the in vivo data with the type 1 selection as well as using the in vivo data with data augmentation. Fig. 5 shows the average CEST Z-spectra from the eight rats, and the corresponding averaged APT spectra from the multiple-pool Lorentzian fit, ML prediction using the partially synthetic data and the in vivo data with the type 2 selection, as well as ML prediction using fully synthetic data with the type 1 and type 2 simulations. Supporting information Fig. S4 and S5 show these plots in Fig. 4 and 5, respectively, with the standard deviation across subjects. The ML predicted APT spectra, trained using all types of partially synthetic data, are roughly consistent, suggesting that our method does not depend on the size of samples for extracting the measured components. In addition, the amplitude is lower than the multiple-pool Lorentzian fitted APT spectra, due to the overestimation of APT using the multiple-pool Lorentzian fit as shown in Fig. 3. In contrast, the ML-predicted APT spectra trained using all types of in vivo data is not consistent. Generally, for training the ML using in vivo data, the higher the voxel count, the closer is predicted APT spectra to the multiple-pool Lorentzian fit, indicating that this type of training requires a larger dataset. The ML-predicted APT spectra trained using all types of fully synthetic data also shows significant deviations from the multiple-pool Lorentzian fitted APT spectra. Specifically, type 1 simulations yield much lower ML-predicted APT spectra, while type 2 simulations yield much higher ML-predicted APT spectra compared to the multiple-pool Lorentzian fitted APT spectra. These findings suggest that training with fully synthetic data heavily relies on the simulation models employed. Supporting information Fig. S6 and Fig. S7 display the pixel-wise regression plots between the multiple-pool Lorentzian fitted APT amplitude and the predicted APT amplitude from all the ML models presented in Fig. 4 and Fig. 5, respectively. Notably, while the R2 values for the training using the partially synthetic data are very high (>0.98), indicating successful training, the R2 values for the training using in vivo data and fully synthetic data are considerably lower, indicating unsuccessful training. Additionally, the R2 values gradually increase (0.596, 0.617, 0.633, 0.701, and 0.859) for ML using in vivo data with type 1 selection of 50, 100, 500, 2157 voxels and with data augmentation, as well as increase from 0.776 and 0.831 to 0.903 and 0.958 for ML with type 2 selection of tumors and normal tissues after using data augmentation, confirming that it rely on significantly larger training data to achieve higher predictive accuracy.
Fig. 4.

Measured CEST Z-spectra from tumors and contralateral normal tissues in the three testing rats (a, b), the corresponding APT spectra from the ML prediction using the partially synthetic data for type 1 selection of 50, 100, 500, and 2157 voxels within five rat brains (c, d), and the corresponding APT spectra from the ML prediction using the measured in vivo data for type 1 selection of 50, 100, 500, and 2157 voxels within five rat brains as well as with data augmentation (e, f). The multiple-pool Lorentzian fitted APT spectra were also plotted in (c-f) for comparison. Data are averaged across different subjects.
Fig. 5.

Measured CEST Z-spectra from tumors and contralateral normal tissues in the eight rats (a, b), the corresponding APT spectra from the ML prediction using the partially synthetic data for type 2 selection of tumors and normal tissues (c, d), the corresponding APT spectra from the ML prediction using the measured in vivo data for type 2 selection of tumors and normal tissues (e, f), and ML prediction using fully synthetic data with the type 1 and type 2 simulations (g, h). The multiple-pool Lorentzian fitted APT spectra were also plotted in (c-h) for comparison. Data are averaged across different subjects.
Fig. 6 shows the APT amplitude maps from the three testing rat brains using the Lorentzian fitting and ML prediction. For ML prediction, partially synthetic data and in vivo data with type 1 selection are used, along with in vivo data with data augmentation. Fig. 7 shows the APT amplitude maps from the one rat brain using the Lorentzian fitting and ML prediction using partially synthetic data, in vivo data with the type 2 selection, in vivo data with the type 2 selection with data augmentation, as well as ML prediction using fully synthetic data with the type 1 and type 2 simulations. Supporting information Fig. S8 shows the APT amplitude maps from the rest seven rat brains. Supporting information Fig. S9 shows the anatomy images from the eight rat brains. The ML-predicted APT map, trained using all types of partially synthetic data, demonstrates good image quality. However, ML prediction using in vivo data and fully synthetic data does not yield satisfactory image quality in terms of the contrast between tumor and normal tissues when compared to the anatomy images. Notably, the ML-predicted APT map trained using in vivo data with data augmentation exhibits good image quality but with reduced contrast between tumor and normal tissues. This suggests that even with augmentation, some features may not be adequately captured in the training data.
Fig. 6.

APT amplitude maps from three testing rat brains (from left to right – Rat 6, Rat 7, Rat 8) using the Lorentzian fitting (a), ML prediction using partially synthetic data with the measured components from the in vivo data with type 1 selection of 50 (b), 100 (c), 500 (d), and 2157 (e) voxels from five rat brains, ML prediction using the measured in vivo data with type 1 selection of 50 (f), 100 (g), 500 (h), and 2157 (i) voxels from five rat brains as well as ML prediction using the measured in vivo data using all voxels from five rat brains with data augmentation (j).
Fig. 7.

APT amplitude maps from the one rat brain (Rat 1) using the Lorentzian fitting (a), ML prediction using partially synthetic data with the measured components from the in vivo data with type 2 selection of tumors (b) and normal tissues (c), ML prediction using the measured in vivo data with type 2 selection of tumors (d) and normal tissues (e), ML prediction using the measured in vivo data with type 2 selection of tumors with data augmentation (f) and normal tissues with data augmentation (g), as well as ML prediction using fully synthetic data with the type 1 (h) and type 2 (i) simulations.
Evaluation of the ML predicted APT imaging to detect tumors
In Fig. 8, the statistical difference in the , , the ML predicted APT amplitude using partially synthetic data with measured components from the type 1 selection of 50 voxels, and multiple-pool Lorentzian fitted APT amplitude are presented. All these parameters exhibit significant differences between tumor and contralateral normal tissues. Tumors display significantly higher APT amplitudes compared to contralateral normal tissues, which is consistent with previous findings (6,7). Notably, the multiple-pool Lorentzian fitted APT amplitudes from both tumor and contralateral normal tissues are higher than those obtained from the ML prediction using the partially synthetic data, which is due to the overestimation of APT effect using the fitting approach as shown in Fig.3.
Fig. 8.

Statistical differences in (a), (b), the ML predicted APT amplitude using partially synthetic data with measured components from the type 1 selection of 50 voxels (c), and the multiple-pool Lorentzian fitted APT amplitude, between tumor and normal tissues in the three testing rat brains.
DISCUSSION
We have introduced a novel platform for generating partially synthetic CEST data by combining simulated components with measured components. Utilizing this approach in machine learning offers several advantages: 1) The ability to create training data with any sample parameters (e.g., APT/NOE , and , the amplitude of amine CEST and MT effects) by adjusting the parameters of the simulated components and scaling factors of the measured components. This addresses the constraint of limited training data when using only measured data in machine learning. 2) By incorporating the measured components, our approach maintains high simulation fidelity. In biological tissues, there are multiple amines from 2-3ppm and hydroxyls below 1ppm with broad ranges of exchange rate (50,51,58–62). Simulating all these pools accurately is challenging due to the unknown parameter ranges. However, the average contributions of these pools can be roughly reflected by the fitted , as shown in Fig. 2b. Additionally, the broad nature of MT pool makes it difficult to estimate its peak width and offset accurately, but these sample parameters do not need to be simulated when using the fitted spectrum. This addresses the challenge of inaccurate simulation models when using fully synthetic data in machine learning. 3) The partially synthetic data provides high-quality ground truth data, ensuring reliable training. In contrast, the training using Z-spectra from each voxel to obtain ground truth data suffers from the low SNR. In addition, the use of the multiple-pool Lorentzian fit to obtain ground truth data may have bias. Supporting information Fig. S10 demonstrates a correlation between the amine and the loss between the multiple-pool Lorentzian fit and the ground truth, indicating the influence of the amine CEST effect on the accuracy of the multiple-pool Lorentzian fit. This finding aligns with our previous study showing that the multiple-pool Lorentzian fit performs better when there are no amine pools (56).
It should be noted that we utilized the line shape information of the amine and MT pool to create the partially synthetic data, but not their amplitudes. The amplitude information is compensated for by the scaling factor, ensuring that even if the amplitude is not accurately fitted using the multiple-pool model Lorentzian fit, it does not affect the accuracy of the ML model trained on this partially synthetic data. We did not vary the width of the and peaks when creating the training data in this paper. However, it is theoretically possible to adjust the widths by varying their Lorentzian fitted widths, which would enable the creation of more training data covering a broader range of variations in different pathologies. Although we did not explore this approach in our study, we conducted validation using tissue-mimicking data that contained varied widths of the amine and MT effects. We found that our method successfully predicted all the testing tissue-mimicking data, indicating its robustness and effectiveness even with varying widths of the amine and MT peaks. The in Eq. (5) can also be obtained from the multiple-pool Lorentzian fitted by tuning the peak amplitude and width. Here, we used Eq. (3) to calculate because it is more directly related to the underlying exchange parameters, so it is easier to determine the range of the simulation parameters by searching the literature.
To evaluate the robustness of the NN trained on the partially synthetic data, we reproduced Fig. 3 with varying levels of noise in supporting information Fig. S11. It can be found that our method shows accurate prediction even with low-SNR data.
As a proof of concept, the new platform to create the synthetic CEST data using Eq. (5) is based on the acquisition of steady-state signals. Theoretically, the method can be extended to create non-steady-state synthetic CEST signal using (46,63),
| (8) |
where, tp is the total saturation time, and is the summation of all components in their rotating frame (i.e., the denominator in Eq. (5)). The multiple-pool Lorentzian fitted components can be obtained from a baseline steady-state acquisition in one or a few subjects. Then, together with the simulated components and Eq. (8), the platform can be extended to non-steady state conditions. In human brains, different tissues such as white matter, gray matter, and cerebrospinal fluid may exhibit different features of the fitted amine and MT effects in proximity to the amide offset. In this case, inclusion of and from these tissues into the training data, albeit from a limited number of subjects, can encompass all possible features.
In this paper, we predicted the CESTR quantified APT effect, instead of the AREX quantified APT effect, since the CESTR has been widely used previously although the mechanisms of these two quantification metrics is still not yet fully understood. To predict the AREX quantified APT, similar training methods can be also used.
CONCLUSION
The utilization of partially synthetic data can effectively address the challenges of limited data availability and the absence of high-quality ground truth data in ML methods using measured data. It also provides enhanced fidelity and robustness compared to using fully synthetic data.
Supplementary Material
Supporting information Fig. S1 Multiple-pool Lorentzian fitted Reff (a) and fitted R_ex^APT, R_ex^amines, R_ex^NOE(−1.6) , R_ex^NOE(−3.5) , R_ex^MT spectra (b) from the average of the measured CEST Z-spectra in tumors in eight rat brains. Comparison between the measured CEST Z-spectra and synthetic Z-spectrum generated using all multiple-pool Lorentzian fitted components (c) as well as between the measured CEST Z-spectra and synthetic CEST Z-spectrum without APT (d).
Supporting information Fig. S2 Heatmap showing the mean squared error (MSE) loss versus the number of epochs for the training and validation using different model configurations. The solid line shows the training loss, while the dashed lines indicate the validation loss. The optimal training was found with 2 layers and 100 neurons with 4000 epochs. The learning rate was kept constant at 1×10−3 with Adam optimizer and ReLU activation was used.
Supporting information Fig. S3 Comparison of losses between the ML method and the multiple-pool Lorentzian fit for all testing data. ML models were trained on fourteen partially synthetic datasets, with the measured components fitted from ten other randomly selected Z-spectra (a-j), as well as the average of 50 (k), 100 (l), 500 (m), and 1000 (n) randomly selected Z-spectra, within the tissue-mimicking data. The mean losses are 7.2148×10-4, 7.2766×10-4, 7.2757×10-4, 7.0909×10-4, and 7.0852×10-4 in (a-j), (k), (l), (m), and (n), respectively.
Supporting information Fig. S4 Measured CEST Z-spectra from tumors and contralateral normal tissues in the three testing rats (a, b), the corresponding APT spectra from the ML prediction using the partially synthetic data for type 1 selection of 50, 100, 500, and 2157 voxels within five rat brains (c, d), and the corresponding APT spectra from the ML prediction using the measured in vivo data for type 1 selection of 50, 100, 500, and 2157 voxels within five rat brains as well as with data augmentation (e, f). The multiple-pool Lorentzian fitted APT spectra were also plotted in (c-f) for comparison.
Supporting information Fig. S5 Measured CEST Z-spectra from tumors and contralateral normal tissues in the eight rats (a, b), the corresponding APT spectra from the ML prediction using the partially synthetic data for type 2 selection of tumors and normal tissues (c, d), the corresponding APT spectra from the ML prediction using the measured in vivo data for type 2 selection of tumors and normal tissues (e, f), and ML prediction using fully synthetic data with the type 1 and type 2 simulations (g, h). The multiple-pool Lorentzian fitted APT spectra were also plotted in (c-h) for comparison.
Supporting information Fig. S6 Pixel-wise regression plots between the multiple-pool Lorentzian fitted APT amplitude and the predicted APT amplitude from all the ML models trained using the partially synthetic data for type 1 selection of 50 (a), 100 (b), 500 (c), and 2157 (d) voxels within five rat brains, using the measured in vivo data for type 1 selection of 50 (e), 100 (f), 500 (g), and 2157 (h) voxels within five rat brains as well as with data augmentation (i).
Supporting information Fig. S7 Pixel-wise regression plots between the multiple-pool Lorentzian fitted APT amplitude and the predicted APT amplitude from all the ML models trained using the partially synthetic data for type 2 selection of tumors and normal tissues (a, b), using the measured in vivo data for type 2 selection of tumors and normal tissues (c, d), using the measured in vivo data for type 2 selection of tumors with data augmentation and normal tissues with data augmentation (e, f), and using fully synthetic data with the type 1 and type 2 simulations (g, h).
Supporting information Fig. S8 APT amplitude maps from the seven rat brains (from left to right, Rat 2, Rat 3, Rat 4, Rat 5, Rat 6. Rat 7, Rat 8) using the Lorentzian fitting (a), ML prediction using partially synthetic data with the measured components from the in vivo data with type 2 selection of tumors (b) and normal tissues (c), ML prediction using the measured in vivo data with type 2 selection of tumors (d) and normal tissues (e), ML prediction using the measured in vivo data with type 2 selection of tumors with data augmentation (f) and normal tissues with data augmentation (g), as well as ML prediction using fully synthetic data with the type 1 (h) and type 2 (i) simulations.
Supporting information Fig. S9 T2-weighted anatomy images from the eight rat brains. From left to right, Row 1 shows anatomy images for Rat 1 to Rat 4 respectively, and Row 2 shows the anatomy images for Rat 5 to Rat 8 respectively.
Supporting information Fig. S10 Scatter plots of losses from the ML method using the partially synthetic data (a) and from the multiple-pool Lorentzian fit (b) of all testing tissue-mimicking data vs. amine . The partially synthetic data were obtained with the measured components fitted from a randomly selected Z-spectrum within the tissue-mimicking data.
Supporting information Fig. S11 Left column: a representative Z-spectrum from the tissue-mimicking data (clean) together with a few noisy Z-spectra with SNR of 200 (a), 150 (b), 100 (c), 50 (d). Middle column: A comparison of the corresponding APT spectra from the ML prediction, multiple-pool Lorentzian fit, and ground truth. Right column: A comparison of the corresponding losses between the ML prediction and the multiple-pool Lorentzian fit for all testing data.
Supporting information Table S1. Starting points and boundaries of the amplitude, width, and offset of all pools in the Lorentzian fit. The unit of peak width and offset is ppm.
Supporting information Table S2. Description of the parameters used.
Supporting information Table S3. List of all sample parameters used to create the partially synthetic CEST data using Eq. (3) and Eq. (5). Since the NOE(−1.6) effect is far from the APT effect, it was not included in creating the synthetic training data.
Supporting information Table S4. List of all sample parameters used to create the tissue-mimicking data using numerical simulation of the Bloch-McConnell equation.
Supporting information Table S5. List of all sample parameters used to create the fully synthetic CEST data for type 1 and type 2 simulations. For type 1 simulations, the fs of guanidine is fixed at 0.03%.
Acknowledgments
Grant Sponsor: R21 AR074261, R03 EB029078, R01 EB029443
DATA AVAILABILITY STATEMENT
The code for simulations that support the findings of this study are openly available in CESTLabZu at https://github.com/CESTlabZu/MachineLearningbasedAPTImaging. The in vivo data used in this study are available on request from the corresponding author upon request.
REFERENCES
- 1.Zhou JY, van Zijl PCM. Chemical exchange saturation transfer imaging and spectroscopy. Prog Nucl Mag Res Sp 2006;48(2-3):109–136. [Google Scholar]
- 2.van Zijl PCM, Yadav NN. Chemical Exchange Saturation Transfer (CEST): What is in a Name and What Isn’t? Magnetic Resonance In Medicine 2011;65(4):927–948. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Kim J, Wu Y, Guo Y, Zheng H, Sun PZ. A review of optimization and quantification techniques for chemical exchange saturation transfer MRI toward sensitive in vivo imaging. Contrast media & molecular imaging 2015;10(3):163–178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wu B, Warnock G, Zaiss M, Lin C, Chen M, Zhou Z, Mu L, Nanz D, Tuura R, Delso G. An overview of CEST MRI for non-MR physicists. Ejnmmi Phys 2016;3(1). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.van Zijl PCM, Lam WW, Xu JD, Knutsson L, Stanisz GJ. Magnetization Transfer Contrast and Chemical Exchange Saturation Transfer MRI. Features and analysis of the field-dependent saturation spectrum. Neuroimage 2018;168:222–241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Zhou J, Payen JF, Wilson DA, Traystman RJ, van Zijl PC. Using the amide proton signals of intracellular proteins and peptides to detect pH effects in MRI. Nature medicine 2003;9(8):1085–1090. [DOI] [PubMed] [Google Scholar]
- 7.Zhou JY, Heo HY, Knutsson L, van Zijl PCM, Jiang SS. APT-weighted MRI: Techniques, current neuro applications, and challenging issues. Journal Of Magnetic Resonance Imaging 2019;50(2):347–364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Jiang SS, Eberhart CG, Zhang Y, Heo HY, Wen ZB, Blair L, Qin HM, Lim M, Quinones-Hinojosa A, Weingart JD, Barker PB, Pomper MG, Laterra J, van Zijl PCM, Blakeley JO, Zhou JY. Amide proton transfer-weighted magnetic resonance image-guided stereotactic biopsy in patients with newly diagnosed gliomas. Eur J Cancer 2017;83:9–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Choi YS, Ahn SS, Lee SK, Chang JH, Kang SG, Kim SH, Zhou JY. Amide proton transfer imaging to discriminate between low- and high-grade gliomas: added value to apparent diffusion coefficient and relative cerebral blood volume. Eur Radiol 2017;27(8):3181–3189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Togao O, Hiwatashi A, Yamashita K, Kikuchi K, Keupp J, Yoshimoto K, Kuga D, Yoneyama M, Suzuki SO, Iwaki T, Takahashi M, Iihara K, Honda H. Grading diffuse gliomas without intense contrast enhancement by amide proton transfer MR imaging: comparisons with diffusion- and perfusion-weighted imaging. Eur Radiol 2017;27(2):578–588. [DOI] [PubMed] [Google Scholar]
- 11.Jiang SS, Eberhart CG, Lim M, Heo HY, Zhang Y, Blair L, Wen ZB, Holdhoff M, Lin D, Huang P, Qin HM, Quinones-Hinojosa A, Weingart JD, Barker PB, Pomper MG, Laterra J, van Zijl PCM, Blakeley JO, Zhou JY. Identifying Recurrent Malignant Glioma after Treatment Using Amide Proton Transfer-Weighted MR Imaging: A Validation Study with Image-Guided Stereotactic Biopsy. Clin Cancer Res 2019;25(2):552–561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Sun PZ, Zhou J, Sun W, Huang J, van Zijl PC. Detection of the ischemic penumbra using pH-weighted MRI. Journal of Cerebral Blood Flow & Metabolism 2007;27(6):1129–1136. [DOI] [PubMed] [Google Scholar]
- 13.Harston GW, Tee YK, Blockley N, Okell TW, Thandeswaran S, Shaya G, Sheerin F, Cellerini M, Payne S, Jezzard P, Chappell M, Kennedy J. Identifying the ischaemic penumbra using pH-weighted magnetic resonance imaging. Brain : a journal of neurology 2015;138(Pt 1):36–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Heo HY, Zhang Y, Burton TM, Jiang SS, Zhao YS, van Zijl PCM, Leigh R, Zhou JY. Improving the detection sensitivity of pH-weighted amide proton transfer MRI in acute stroke patients using extrapolated semisolid magnetization transfer reference signals. Magnetic Resonance In Medicine 2017;78(3):871–880. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Dula AN, Asche EM, Landman BA, Welch EB, Pawate S, Sriram S, Gore JC, Smith SA. Development of Chemical Exchange Saturation Transfer at 7T. Magnetic Resonance In Medicine 2011;66(3):831–838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.By S, Barry RL, Smith AK, Lyttle BD, Box BA, Bagnato FR, Pawate S, Smith SA. Amide proton transfer CEST of the cervical spinal cord in multiple sclerosis patients at 3T. Magnetic Resonance In Medicine 2018;79(2):806–814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Zhang H, Wang WZ, Jiang SS, Zhang Y, Heo HY, Wang XL, Peng Y, Wang J, Zhou JY. Amide proton transfer-weighted MRI detection of traumatic brain injury in rats. J Cerebr Blood F Met 2017;37(10):3422–3432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wang R, Li SY, Chen M, Zhou JY, Peng DT, Zhang C, Dai YM. Amide Proton Transfer Magnetic Resonance Imaging of Alzheimer’s Disease at 3.0 Tesla: A Preliminary Study. Chinese Med J-Peking 2015;128(5):615–619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Wang RR, Chen PD, Shen ZW, Lin GS, Xiao G, Dai ZZ, Zhang BN, Chen YF, Lai LH, Zong XD, Li Y, Tang YY, Wu RH. Brain Amide Proton Transfer Imaging of Rat With Alzheimer’s Disease Using Saturation With Frequency Alternating RF Irradiation Method. Front Aging Neurosci 2019;11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Zhang ZW, Zhang CQ, Yao J, Gao F, Gong T, Jiang SS, Chen WB, Zhou JY, Wang GB. Amide proton transfer-weighted magnetic resonance imaging o human brain aging at 3 Tesla. Quant Imag Med Surg 2020;10(3):727-+. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Zhang ZW, Zhang CQ, Yao J, Chen X, Gao F, Jiang SS, Chen WB, Zhou JY, Wang GB. Protein-based amide proton transfer-weighted MR imaging of amnestic mild cognitive impairment. Neuroimage-Clin 2020;25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Guo ZX, Jiang YC, Qin XY, Mu RH, Meng ZN, Zhuang ZY, Liu FZ, Zhu XQ. Amide Proton Transfer-Weighted MRI Might Help Distinguish Amnestic Mild Cognitive Impairment From a Normal Elderly Population. Front Neurol 2021;12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Li CM, Peng S, Wang R, Chen HB, Su W, Zhao XN, Zhou JY, Chen M. Chemical exchange saturation transfer MR imaging of Parkinson’s disease at 3 Tesla. Eur Radiol 2014;24(10):2631–2639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Li CM, Wang R, Chen HB, Su W, Li SH, Zhao XN, Zhou JY, Qiao J, Lou BH, Song GD, Chen M. Chemical exchange saturation transfer MR imaging is superior to diffusion-tensor imaging in the diagnosis and severity evaluation of Parkinson’s disease: a study on substantia nigra and striatum. Front Aging Neurosci 2015;7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Glang F, Deshmane A, Prokudin S, Martin F, Herz K, Lindig T, Bender B, Scheffler K, Zaiss M. DeepCEST 3T: Robust MRI parameter determination and uncertainty quantification with neural networks-application to CEST imaging of the human brain at 3T. Magnetic Resonance In Medicine 2020;84(1):450–466. [DOI] [PubMed] [Google Scholar]
- 26.Kim B, Schar M, Park H, Heo HY. A deep learning approach for magnetization transfer contrast MR fingerprinting and chemical exchange saturation transfer imaging. Neuroimage 2020;221. [DOI] [PubMed] [Google Scholar]
- 27.Huang JP, Lai JHC, Tse KH, Cheng GWY, Liu Y, Chen ZL, Han XQ, Chen L, Xu JD, Chan KWY. Deep neural network based CEST and AREX processing: Application in imaging a model of Alzheimer’s disease at 3 T. Magnetic Resonance In Medicine 2022;87(3):1529–1545. [DOI] [PubMed] [Google Scholar]
- 28.Xu JZ, Zaiss M, Zu ZL, Li H, Xie JP, Gochberg DF, Bachert P, Gore JC. On the origins of chemical exchange saturation transfer ( CEST) contrast in tumors at 9.4 T. NMR in biomedicine 2014;27(4):406–416. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Zhang XY, Wang F, Li H, Xu JZ, Gochberg DF, Gore JC, Zu ZL. CEST imaging of fast exchanging amine pools with corrections for competing effects at 9.4 T. NMR in biomedicine 2017;30(7). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zhang XY, Wang F, Xu JZ, Gochberg DF, Gore JC, Zu ZL. Increased CEST specificity for amide and fast-exchanging amine protons using exchange-dependent relaxation rate. NMR in biomedicine 2018;31(2). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Jones CK, Huang A, Xu JD, Edden RAE, Schar M, Hua J, Oskolkov N, Zaca D, Zhou JY, McMahon MT, Pillai JJ, van Zijl PCM. Nuclear Overhauser enhancement (NOE) imaging in the human brain at 7 T. Neuroimage 2013;77(0):114–124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Frangi AF, Tsaftaris SA, Prince JL. Simulation and Synthesis in Medical Imaging. Ieee T Med Imaging 2018;37(3):673–679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Chen RJ, Lu MY, Chen TY, Williamson DFK, Mahmood F. Synthetic data in machine learning for medicine and healthcare. Nat Biomed Eng 2021;5(6):493–497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Cohen O, Zhu B, Rosen MS. MR fingerprinting Deep RecOnstruction NEtwork (DRONE). Magnetic Resonance In Medicine 2018;80(3):885–894. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Perlman O, Ito H, Herz K, Shono N, Nakashima H, Zaiss M, Chiocca EA, Cohen O, Rosen MS, Farrar CT. Quantitative imaging of apoptosis following oncolytic virotherapy by magnetic resonance fingerprinting aided by deep learning. Nat Biomed Eng 2022;6(5):648–657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Cai CB, Wang C, Zeng YQ, Cai SH, Liang D, Wu YW, Chen Z, Ding XH, Zhong JH. Single-shot T-2 mapping using overlapping-echo detachment planar imaging and a deep convolutional neural network. Magnetic Resonance In Medicine 2018;80(5):2202–2214. [DOI] [PubMed] [Google Scholar]
- 37.Zhang J, Wu J, Chen SJ, Zhang ZY, Cai SH, Cai CB, Chen Z. Robust Single-Shot T-2 Mapping via Multiple Overlapping-Echo Acquisition and Deep Neural Network. Ieee T Med Imaging 2019;38(8):1801–1811. [DOI] [PubMed] [Google Scholar]
- 38.Li SM, Wu J, Ma LC, Cai SH, Cai CB. A simultaneous multi-slice T-2 mapping framework based on overlapping-echo detachment planar imaging and deep learning reconstruction. Magnetic Resonance In Medicine 2022;87(5):2239–2253. [DOI] [PubMed] [Google Scholar]
- 39.Gavazzi S, van den Berg CAT, Savenije MHF, Kok HP, de Boer P, Stalpers LJA, Lagendijk JJW, Crezee H, van Lier ALHMW. Deep learning-based reconstruction of in vivo pelvis conductivity with a 3D patch-based convolutional neural network trained on simulated MR data. Magnetic Resonance In Medicine 2020;84(5):2772–2787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Bollmann S, Rasmussen KGB, Kristensen M, Blendal RG, Ostergaard LR, Plocharski M, O’Brien K, Langkammer C, Janke A, Barth M. DeepQSM - using deep learning to solve the dipole inversion for quantitative susceptibility mapping. Neuroimage 2019;195:373–383. [DOI] [PubMed] [Google Scholar]
- 41.Jung W, Yoon J, Choi JY, Kim JM, Nam Y, Kim EY, Lee J, Ji S. Exploring linearity of deep neural network trained QSM: QSMnet. Neuroimage 2020;211. [DOI] [PubMed] [Google Scholar]
- 42.Della Maggiora G, Castillo-Passi C, Qiu WQ, Liu S, Milovic C, Sekino M, Tejos C, Uribe S, Irarrazaval P. DeepSPIO: Super Paramagnetic Iron Oxide Particle Quantification Using Deep Learning in Magnetic Resonance Imaging. Ieee T Pattern Anal 2022;44(1):143–153. [DOI] [PubMed] [Google Scholar]
- 43.Jung S, Lee H, Ryu K, Song JE, Park M, Moon WJ, Kim DH. Artificial neural network for multi-echo gradient echo-based myelin water fraction estimation. Magnetic Resonance In Medicine 2021;85(1):394–403. [DOI] [PubMed] [Google Scholar]
- 44.Gochberg DF, Gore JC. Quantitative magnetization transfer imaging via selective inversion recovery with short repetition times. Magnetic Resonance In Medicine 2007;57(2):437–441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Zaiss M, Zu ZL, Xu JZ, Schuenke P, Gochberg DF, Gore JC, Ladd ME, Bachert P. A combined analytical solution for chemical exchange saturation transfer and semi-solid magnetization transfer. NMR in biomedicine 2015;28(2):217–230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Zaiss M, Bachert P. Exchange-dependent relaxation in the rotating frame for slow and intermediate exchange - modeling off-resonant spin-lock and chemical exchange saturation transfer. NMR in biomedicine 2013;26(5):507–518. [DOI] [PubMed] [Google Scholar]
- 47.Cai K, Haris M, Singh A, Kogan F, Greenberg JH, Hariharan H, Detre JA, Reddy R. Magnetic resonance imaging of glutamate. Nature medicine 2012;18(2):302–306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Sun C, Zhao Y, Zu Z. Validation of the presence of fast exchanging amine CEST effect at low saturation powers and its influence on the quantification of APT. Magnetic Resonance in Medicine 2023;DOI: 10.1002/mrm.29742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Cai K, Singh A, Poptani H, Li W, Yang S, Lu Y, Hariharan H, Zhou XJ, Reddy R. CEST signal at 2ppm (CEST@2ppm) from Z-spectral fitting correlates with creatine distribution in brain tumor. NMR in biomedicine 2015;28(1):1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Zhang XY, Xie JP, Wang F, Lin EC, Xu JZ, Gochberg DF, Gore JC, Zu ZL. Assignment of the molecular origins of CEST signals at 2ppm in rat brain. Magnetic Resonance In Medicine 2017;78(3):881–887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Haris M, Cai KJ, Singh A, Hariharan H, Reddy R. In vivo mapping of brain myo-inositol. Neuroimage 2011;54(3):2079–2085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Zhang XY, Wang F, Afzal A, Xu JZ, Gore JC, Gochberg DF, Zu ZL. A new NOE-mediated MT signal at around-1.6 ppm for detecting ischemic stroke in rat brain. Magn Reson Imaging 2016;34(8):1100–1106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Zhang XY, Wang F, Jin T, Xu JZ, Xie JP, Gochberg DF, Gore JC, Zu ZL. MR imaging of a novel NOE-mediated magnetization transfer with water in rat brain at 9.4T. Magnetic Resonance In Medicine 2017;78(2):588–597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Zhao Y, Sun CS, Zu ZL. Assignment of molecular origins of NOE signal at-3.5 ppm in the brain. Magnetic Resonance In Medicine 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Hua J, Jones CK, Blakeley J, Smith SA, van Zijl PCM, Zhou JY. Quantitative description of the asymmetry in magnetization transfer effects around the water resonance in the human brain. Magnetic Resonance In Medicine 2007;58(4):786–793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Zhang XY, Wang F, Li H, Xu JZ, Gochberg DF, Gore JC, Zu ZL. Accuracy in the quantification of chemical exchange saturation transfer (CEST) and relayed nuclear Overhauser enhancement (rNOE) saturation transfer effects. NMR in biomedicine 2017;30(7). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Windschuh J, Zaiss M, Meissner JE, Paech D, Radbruch A, Ladd ME, Bachert P. Correction of B1-inhomogeneities for relaxation-compensated CEST imaging at 7T. Nmr in Biomedicine 2015;28(5):529–537. [DOI] [PubMed] [Google Scholar]
- 58.Chen L, Zeng HF, Xu X, Yadav NN, Cai SH, Puts NA, Barker PB, Li T, Weiss RG, van Zijl PCM, Xu JD. Investigation of the contribution of total creatine to the CEST Z-spectrum of brain using a knockout mouse model. NMR in biomedicine 2017;30(12). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Cai K, Haris M, Singh A, Kogan F, Greenberg JH, Hariharan H, Detre JA, Reddy R. Magnetic resonance imaging of glutamate. Nature medicine 2012;18(2):302–306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Cui J, Zu ZL. Towards the molecular origin of glutamate CEST (GluCEST) imaging in rat brain. Magnetic Resonance In Medicine 2020;83(4):1405–1417. [DOI] [PubMed] [Google Scholar]
- 61.Chen L, Barker PB, Weiss RG, van Zijl PCM, Xu JD. Creatine and phosphocreatine mapping of mouse skeletal muscle by a polynomial and Lorentzian line-shape fitting CEST method. Magnetic Resonance In Medicine 2019;81(1):69–78. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Chen L, Schar M, Chan KWY, Huang JP, Wei ZL, Lu HZ, Qin Q, Weiss RG, van Zijl PCM, Xu JD. In vivo imaging of phosphocreatine with artificial neural networks. Nat Commun 2020;11(1). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Zaiss M, Bachert P. Chemical exchange saturation transfer (CEST) and MR Z-spectroscopy in vivo: a review of theoretical approaches and methods. Physics in medicine and biology 2013;58(22):R221–269. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supporting information Fig. S1 Multiple-pool Lorentzian fitted Reff (a) and fitted R_ex^APT, R_ex^amines, R_ex^NOE(−1.6) , R_ex^NOE(−3.5) , R_ex^MT spectra (b) from the average of the measured CEST Z-spectra in tumors in eight rat brains. Comparison between the measured CEST Z-spectra and synthetic Z-spectrum generated using all multiple-pool Lorentzian fitted components (c) as well as between the measured CEST Z-spectra and synthetic CEST Z-spectrum without APT (d).
Supporting information Fig. S2 Heatmap showing the mean squared error (MSE) loss versus the number of epochs for the training and validation using different model configurations. The solid line shows the training loss, while the dashed lines indicate the validation loss. The optimal training was found with 2 layers and 100 neurons with 4000 epochs. The learning rate was kept constant at 1×10−3 with Adam optimizer and ReLU activation was used.
Supporting information Fig. S3 Comparison of losses between the ML method and the multiple-pool Lorentzian fit for all testing data. ML models were trained on fourteen partially synthetic datasets, with the measured components fitted from ten other randomly selected Z-spectra (a-j), as well as the average of 50 (k), 100 (l), 500 (m), and 1000 (n) randomly selected Z-spectra, within the tissue-mimicking data. The mean losses are 7.2148×10-4, 7.2766×10-4, 7.2757×10-4, 7.0909×10-4, and 7.0852×10-4 in (a-j), (k), (l), (m), and (n), respectively.
Supporting information Fig. S4 Measured CEST Z-spectra from tumors and contralateral normal tissues in the three testing rats (a, b), the corresponding APT spectra from the ML prediction using the partially synthetic data for type 1 selection of 50, 100, 500, and 2157 voxels within five rat brains (c, d), and the corresponding APT spectra from the ML prediction using the measured in vivo data for type 1 selection of 50, 100, 500, and 2157 voxels within five rat brains as well as with data augmentation (e, f). The multiple-pool Lorentzian fitted APT spectra were also plotted in (c-f) for comparison.
Supporting information Fig. S5 Measured CEST Z-spectra from tumors and contralateral normal tissues in the eight rats (a, b), the corresponding APT spectra from the ML prediction using the partially synthetic data for type 2 selection of tumors and normal tissues (c, d), the corresponding APT spectra from the ML prediction using the measured in vivo data for type 2 selection of tumors and normal tissues (e, f), and ML prediction using fully synthetic data with the type 1 and type 2 simulations (g, h). The multiple-pool Lorentzian fitted APT spectra were also plotted in (c-h) for comparison.
Supporting information Fig. S6 Pixel-wise regression plots between the multiple-pool Lorentzian fitted APT amplitude and the predicted APT amplitude from all the ML models trained using the partially synthetic data for type 1 selection of 50 (a), 100 (b), 500 (c), and 2157 (d) voxels within five rat brains, using the measured in vivo data for type 1 selection of 50 (e), 100 (f), 500 (g), and 2157 (h) voxels within five rat brains as well as with data augmentation (i).
Supporting information Fig. S7 Pixel-wise regression plots between the multiple-pool Lorentzian fitted APT amplitude and the predicted APT amplitude from all the ML models trained using the partially synthetic data for type 2 selection of tumors and normal tissues (a, b), using the measured in vivo data for type 2 selection of tumors and normal tissues (c, d), using the measured in vivo data for type 2 selection of tumors with data augmentation and normal tissues with data augmentation (e, f), and using fully synthetic data with the type 1 and type 2 simulations (g, h).
Supporting information Fig. S8 APT amplitude maps from the seven rat brains (from left to right, Rat 2, Rat 3, Rat 4, Rat 5, Rat 6. Rat 7, Rat 8) using the Lorentzian fitting (a), ML prediction using partially synthetic data with the measured components from the in vivo data with type 2 selection of tumors (b) and normal tissues (c), ML prediction using the measured in vivo data with type 2 selection of tumors (d) and normal tissues (e), ML prediction using the measured in vivo data with type 2 selection of tumors with data augmentation (f) and normal tissues with data augmentation (g), as well as ML prediction using fully synthetic data with the type 1 (h) and type 2 (i) simulations.
Supporting information Fig. S9 T2-weighted anatomy images from the eight rat brains. From left to right, Row 1 shows anatomy images for Rat 1 to Rat 4 respectively, and Row 2 shows the anatomy images for Rat 5 to Rat 8 respectively.
Supporting information Fig. S10 Scatter plots of losses from the ML method using the partially synthetic data (a) and from the multiple-pool Lorentzian fit (b) of all testing tissue-mimicking data vs. amine . The partially synthetic data were obtained with the measured components fitted from a randomly selected Z-spectrum within the tissue-mimicking data.
Supporting information Fig. S11 Left column: a representative Z-spectrum from the tissue-mimicking data (clean) together with a few noisy Z-spectra with SNR of 200 (a), 150 (b), 100 (c), 50 (d). Middle column: A comparison of the corresponding APT spectra from the ML prediction, multiple-pool Lorentzian fit, and ground truth. Right column: A comparison of the corresponding losses between the ML prediction and the multiple-pool Lorentzian fit for all testing data.
Supporting information Table S1. Starting points and boundaries of the amplitude, width, and offset of all pools in the Lorentzian fit. The unit of peak width and offset is ppm.
Supporting information Table S2. Description of the parameters used.
Supporting information Table S3. List of all sample parameters used to create the partially synthetic CEST data using Eq. (3) and Eq. (5). Since the NOE(−1.6) effect is far from the APT effect, it was not included in creating the synthetic training data.
Supporting information Table S4. List of all sample parameters used to create the tissue-mimicking data using numerical simulation of the Bloch-McConnell equation.
Supporting information Table S5. List of all sample parameters used to create the fully synthetic CEST data for type 1 and type 2 simulations. For type 1 simulations, the fs of guanidine is fixed at 0.03%.
Data Availability Statement
The code for simulations that support the findings of this study are openly available in CESTLabZu at https://github.com/CESTlabZu/MachineLearningbasedAPTImaging. The in vivo data used in this study are available on request from the corresponding author upon request.
