Abstract
Dynamic positron emission tomography and kinetic modeling play a critical role in tracer development research using small animals. Kinetic modeling from dynamic PET imaging requires accurate knowledge of an input function, ideally determined through arterial blood sampling. Arterial cannulation in mice, however, requires complex, time-consuming and terminal surgery, meaning that longitudinal studies are impossible. The aim of the current work was to develop and evaluate a non-invasive, deep-learning-based prediction model (DLIF) that directly takes the PET data as input to predict a usable input function. We first trained and evaluated the DLIF model on 68 [18F]Fluorodeoxyglucose mouse scans with image-derived targets using cross validation. Subsequently, we evaluated the performance of a trained DLIF model on an external dataset consisting of 8 mouse scans where the input function was measured by continuous arterial blood sampling. The results showed that the predicted DLIF and image-derived targets were similar, and the net influx rate constants following from Patlak modeling using DLIF as input function were strongly correlated to the corresponding values obtained using the image-derived input function. There were somewhat larger discrepancies when evaluating the model on the external dataset, which could be attributed to systematic differences in the experimental setup between the two datasets. In conclusion, our non-invasive DLIF prediction method may be a viable alternative to arterial blood sampling in small animal [18F]FDG imaging. With further validation, DLIF could overcome the need for arterial cannulation and allow fully quantitative and longitudinal experiments in PET imaging studies of mice.
Keywords: dynamic positron emission tomography (PET), small-animal PET 18F-FDG PET/CT, Patlak analysis, arterial input function estimation, glucose metabolism, deep learning, prediction model
1. Introduction
Small animal positron emission tomography (PET) is a non-invasive medical imaging tool that is essential in the development of new molecular imaging tracers, drugs, diagnostic procedures and disease therapies (1–3) In particular, dynamic PET imaging plays a critical role in the imaging of small animals, as it can visualize the time-dependent tracer uptake in vivo, and by the application of tracer kinetic modeling, it allows quantification of biochemical processes, such as glucose metaolism (4). Specifically for irreversible tracers, Patlak analysis can be applied to compute the influx rate constant, , which, for [18F]Fluorodeoxyglucose ([18F]FDG), is proportional to the metabolic rate of glucose (5, 6). The advantage of the Patlak model is that it may provide faster and more accurate calculations on voxel-wise, parametric images of , compared to full tracer kinetic modeling (7, 8). A prerequisite for both tracer kinetic modeling and Patlak analysis is that both the tissue uptake curve and the time-activity curve of the tracer in the blood, known as the arterial inut function (AIF), are known (9).
In preclinical PET imaging of rodents, arterial blood sampling is considered the gold-standard method to sample the input function. However, it requires complex and time-consuming surgery to insert an arterial catheter and allows only a limited blood volume to be withdrawn, without altering the animal physiology (10). Longitudinal experiments with arterial blood sampling from the superficial branch of the femoral artery has been described in the literature for rats (11–13). For mice, however, the most common approach is to cannulate the carotid artery (14). This usually requires terminal surgery because the animals cannot be awakened after the experiments, meaning that longitudinal studies with arterial blood sampling in mice are impossible. To overcome these shortcomings, several alternative methods have been proposed, including the use of a population-based AIF template (15), image-derived input function (IDIF) (16), and simultaneous estimation (17, 18). Although these methods overcome the need for arterial blood sampling, they still have limited practical usability. For instance, the population-based approach neglects individual physiological differences and scan-dependent variations, and requires at least one blood sample for curve scaling. The IDIF approach must be corrected for the limited spatial and temporal resolution of the PET imaging system, image noise, and cardiac and respiratory motion (10, 19–21). Simultaneous estimation could estimate both the AIF and kinetic parameters, but it assumes a known mathematical AIF model and requires at least one late blood sample for parameter estimation (17, 18, 22, 23).
Machine learning and deep learning methods have been used increasingly in recent years for many medical applications, including segmentation, classification, and regression problems (24). Specifically, we have proposed the use of Gaussian Processes and long-short-term-memory (LSTM) models for AIF estimation (25, 26). These models required time-activity curves from up to five tissue regions as model input for the prediction of the AIF. Thus, our earlier approaches were limited by the need for manual delineation of several regions of interest as input, which implies domain specific knowledge, and most importantly, long data processing time. An alternative AIF estimation method is the multilayer 3D-residual network with a regression module, proposed in (27). This offers a promising similarity in shape to the IDIF, though it faced challenges predicting some parts of the curve. Recently, a combined deep-learning-based and model-based method was proposed to estimate the parameters of the input function (28), however this model was only evaluated in a phantom study, and it assumes a specific mathematical AIF model. Nevertheless, there is a need to overcome the limitations associated with the aforementioned methods for sampling or estimating the arterial input function, which would increase practical usability of quantitative PET imaging and allow for longitudinal studies of mice.
In this work, we propose a deep learning model designed to take the four-dimensional PET data as input and a deep-learning-derived input function (DLIF) as output. The approach avoids the need of tedious and time-consuming manual segmentation of regions of interest, has no assumptions of a specific AIF model, and does not require any blood samples for calibration purposes. We evaluate the model by comparing the predicted input function to the reference ones, as well as using both voxel-wise and regional Patlak analysis. Thus, by overcoming the drawbacks of blood sampling and the limitations of other AIF estimation techniques, with proper validation, DLIF would allow for fully quantitative and longitudinal PET imaging of mice, and as such provide an instrumental step forward for quantitative small animal imaging research.
2. Methods
2.1. Datasets
The small animal imaging data used to train and evaluate the DLIF model was collected in retrospect from two independent cohorts, acquired at UiT The Arctic University of Norway (UiT) and Université de Sherbrooke (UdS). The same PET imaging system and radiotracer was used at both centers, however, there were some systematic differences in the experimental methods and available animal strains in the datasets (Table 1). The UiT dataset was used for training and initial evaluation of the model. Because a ground truth AIF was unavailable in this dataset, an IDIF was used as training targets. On the other hand, in the UdS dataset continuous arterial blood sampling was performed simultaneously with PET acquisition. This dataset was used for additional evaluation of the DLIF model.
Table 1.
UiT | UdS | |
---|---|---|
Number of animals | ||
Balb/c | 17 | 3 |
NZBWF1 | 51 | – |
C57/BL/6 | – | 2 |
CD-1 | – | 3 |
Total number of animals | 68 | 8 |
Age [weeks] | 24 8 | n/a |
Weight [g] | ||
Fasting time | 3 h 50 min min | No fasting |
Time in anaesthesia prior to PET | 1 h 17 min min | 45 min min |
Blood glucose [mmol/L] | ||
Injected dose [MBq] | 10. | |
Input function | IDIF | AIF |
2.1.1. UiT dataset
Animals and preparations
Preclinical PET/computed tomography (CT) data of 68 mouse scans were collected in retrospect from an already completed research study (29). This animal study was approved by the Competent Authority on Animal Research, the Norwegian Food Safety Authority; FOTS id 6676/2015. Thirty-six female mice from two strains [NZBWF1, Jax stock # 10008 ()] and [BALB/cAnNCrl ()], purchased from The Jackson Laboratory and Charles River Laboratories, respectively, were included. The mice were fasted for 3 h 50 min min, weighed, and anesthetized for 1 h 17 min min prior to tracer injection in an oxygen-isoflurane mixture (4% and 2% isoflurane for induction and maintenance, respectively), to reduce animal stress and allow stabilization of breathing and heart rate (29). Blood glucose was measured in venous blood to 6.9 mmol/L mmol/L prior to tracer administration, using a glucose meter (FreeStyle Lite, Abott Laboratories). A catheter, made from polyethylene tubing and a 30 gauge needle, was placed into the caudal vein to allow tracer injection.
Image acquisition
The PET/CT scans were performed using a TriumphTM LabPET-8TM small animal PET/CT scanner. Each mouse was scanned between 1–5 times at different ages (range 7–37 weeks), weighing g at imaging time. The anesthetized mice were centered in the field-of-view of the PET/CT scanner, while lying on a 35 ∘C heated bed inside an animal imaging cell, with sensors monitoring heart and breathing rate. Tracer administration was conducted by the injection of MBq [18F]FDG in 100 sterile saline through a tail-vein catheter during 30 s. For 56 scans, injections were performed with an infusion pump, while 12 scans were injected manually followed by 20 flush of sterile saline. A 60-min list-mode PET acquisition was started at injection time, followed by CT imaging for PET attenuation and scatter correction.
Image reconstruction and processing
The PET images were reconstructed into 44 time frames (, , and s) using a 3-dimensional maximum-likelihood estimator algorithm with 50 iterations. Corrections for detector efficiency, radioactive decay, random coincidences, dead time, attenuation and scatter were applied. The voxels were normalized into standardized uptake value (SUV) [g/ml] (30). Each time frame had an image matrix size of voxels. In the current study, only time points up to 45 min were included to match the external UdS dataset, so the last three time frames from the reconstruction were discarded in all following analysis. Furthermore, in order to reduce the memory load during model training, the image dimensions were cropped to , which still encompassed the most vital regions of the mouse.
Volume of interest delineation
Volumes of interest were delineated using PMOD 3.8 (PMOD Technologies Ltd.) in either dynamic PET or static PET space, the latter which was formed by averaging the last 20 min of the dynamic PET acquisition. Delineations of vena cava, myocardium, left ventricle and brain were performed in a standardized and reproducible way, as described in (25). In short, vena cava was defined in a 0.6 mm radius sphere centered on a peak voxel in an early time step of the dynamic PET sequence; myocardium was delineated as voxels above 40–60% of the max voxel value above background in the whole heart in static PET space; left ventricle was defined as the region encompassed by the myocardium uptake; and brain was delineated as a 2 mm radius sphere in the dorsal region of the skull, visually identified in static PET space. All VOIs were applied to the dynamic PET images, and the mean time-activity curve was extracted from each VOI.
IDIF calculation
The IDIF targets for each mouse scan was formed from a parameterized model fit to image-derived data points derived from vena cava and left ventricle volume-of-interest, as described in (25).
2.1.2. UdS dataset
Animals and preparations
The UdS dataset consisted of eight female mice from three different strains (C57/BL/6 (), CD-1 (), and BALB/c ()). Four mice were purchased from Charles River Laboratories, while four were donations from the central animal facility with unknown origin. Animal experiments were performed following the recommendations of the Canadian Council on Animal Care and were approved by the Université de Sherbrooke in-house Ethics Committee for Animal Experiments under Protocol 2022–3463. Animals had free access to food and water before the experiments. Animals were anesthetized with isoflurane (2% L/min O2) for 45 min min prior to tracer injection, while being cannulated in the caudal vein for tracer injection (prefilled with heparinized saline, 0.9%, 50 U/ml), and in the carotid artery for blood withdrawal, as described in (14). The animal temperature was regulated, and the heartbeat and breathing were monitored to ensure that physiological conditions were maintained as stable during the scan. Animal weight was g at imaging time. Blood glucose was measured in venous blood to 9.0 mmol/L mmol/L prior to tracer administration.
Image acquisition and arterial blood sampling
The anesthetized mice were placed in a TriumphTM LabPET-8TM small animal PET/CT scanner with the heart centered in the field of view. An ultrahigh sensitivity blood counter (UHS-BC) was placed on a table in front of the scanner (detector to animal distance 60 cm). The mice were injected with MBq of [18F]FDG in 100–200 sterile saline during 30 s through a 20 cm long PE10 catheter. The withdrawal pump speed was set to up to 5 min, then 7 for 36 min, corresponding to a total of 327 (15%) blood loss during the imaging experiment. This is within the maximum recommended blood loss from a single blood sample study (31). A 45-min list-mode PET acquisition was started at injection time. For the UdS data, CT imaging for PET attenuation and scatter correction was not performed.
Image reconstruction and processing and analysis
The PET images were reconstructed into 41 time frames, using the same framing, reconstruction algorithm and image corrections as described for the UiT dataset. However, because CT imaging was not available for the UdS data, attenuation and scatter correction was not performed. The measured blood curve was corrected for decay, delay, dead time and dispersion, as described in (14). Volumes of interest and an IDIF was also generated for this dataset, in a similar manner as for the UiT data.
2.2. Model architecture
The proposed DLIF model architecture is shown in Figure 1. First, the spatial dimensions of each of the 41 volumes associated to each of the time frames of a single sample are reduced, while at the same time relevant spatial information is extracted and noise filtered out. The main expedient consists of feeding the volumes in parallel to four layers of 3D convolutional filters coupled with batch normalization, rectified linear unit (ReLU) activation functions and finally, 3D maxpooling layers. Choices of the different components are well described in the literature (32), but for instance, batch normalization has been shown to speed up the training process and act as a regularizer to avoid overfitting (33), while ReLU seems the most reasonable choice for activation function when the input and output data are non-negative quantities (34). Also, halving the spatial dimensions with maxpooling while doubling the number of filters for each layer is a well-known approach for obtaining a rich yet compact representation of the input, and many well-known architectures adopt this strategy (35, 36). Once the smallest spatial dimensions are reached (), these 41 volumes of 16 features are flattened, so that each time frame is represented by a vector of features. These vectors are reduced to a length of by two cascaded multilayer perceptrons. Subsequently, 16 filters of 1D convolution are applied along the time axis, to capture the temporal correlations among the neighboring time frames. Finally, a multilayer perceptron takes the extracted features and outputs a 41 long vector as output, representing the predicted DLIF.
2.3. Training regimes
Two different training regimes were used. First, the DLIF model was trained and evaluated with the UiT dataset, using 17-fold cross validation. Cross validation is commonly applied for model performance evaluation in settings with limited available training data (37). By iteratively splitting the dataset to use 64 samples for training and 4 samples for testing in each of the 17 folds, this allowed to have all 68 available samples in the test set once. Next, a new DLIF model was trained using all samples from the UiT dataset, and subsequently tested on the UdS dataset. In this setting, 50 runs with different random initializations were performed to obtain statistics over the DLIF predictions.
In both training regimes the Adam optimizer (38) with standard hyperparameters was selected to perform the minimization of a mean square error loss between the ground truth and the DLIF. Training was performed for epochs with a learning rate of .
The DLIF models were implemented in Python 3.11.5, using PyTorch 2.1.0.
2.4. Model evaluation and statistical analyses
For both training regimes, the DLIF-predicted curves were first compared point by point to the respective reference input function using orthogonal regression. Orthogonal regression was chosen when comparing the measured and the predicted input function during regression analysis, because it assumes measurement error in both variable pairs, as opposed to standard linear regression, which assumes measurement error in only the independent variable (39). Patlak modeling (5, 6) was implemented in an in-house developed script (Python 3.11.5) and used to calculate the influx rate constant, , for each voxel, as well as for brain and myocardium tissues, using both the predicted DLIF and each reference input function. For the UdS dataset, the regional influx rate constants for brain and myocardium tissues were also calculated using the IDIF and compared to those obtained with the AIF.
DLIF- and IDIF-based influx estimates were compared to those obtained from respective reference input function using paired -test () and orthogonal regression. Normality was assessed using quantile-quantile plots. All statistical analyses were implemented in an in-house developed script (Python 3.11.5).
3. Results
Two datasets were used in these experiments. The UiT dataset, consisting of 68 mouse PET scans and an IDIF as reference input function, was used to train and evaluate the model through 17-fold cross validation. Next, the UiT dataset was used as a whole for training while the UdS dataset, consisting of 8 mouse PET scans with corresponding AIF from arterial blood sampling, was used as an external test set.
3.1. Cross-validation
The overall input function curve shape, with an early peak and a vanishing tail, is captured by the DLIF model, as shown for the three mouse scan examples in Figure 2. Investigating the data points from all samples, there is a strong overall linear relationship (slope: 0.84) and strong correlation (correlation coefficient: 0.91) between DLIF and IDIF (Figure 3A). The DLIF model slightly underestimates the IDIF peak, shown as a deviation from the linear model for data points with high SUV value in Figure 3A.
Following DLIF prediction, Patlak modeling was performed to investigate the potential usefulness of the DLIF model for glucose metabolic rate measurements. Figure 3B displays all voxel-wise data points as scatterplot, using DLIF and IDIF as input function, respectively, while Table 2 and Supplementary Figure S2 presents statistics and data from the regional values, for brain and myocardium tissues, respectively. The voxel-wise influx rate constants displayed a strong linear relationship (slope: 1.05) and a strong correlation (correlation coefficient: 0.94) for the majority of the mouse scans. A few outlier cases are obvious as colored data points deviating from the identity line in Figure 3B. Nevertheless, visual comparison of the voxel-wise influx rate constants calculated with IDIF and DLIF, respectively, for the mouse scan with minimum and maximum errors from Figure 2, still show promising similarities (Figure 4). The regional Patlak analysis (Table 2 and Supplementary Figure S2) indicated good agreement between the population average influx rate constants for brain and myocardium tissues (average errors: ) and a strong correlation (correlation coefficient: 0.78–0.95). The obtained P values, surpassing the significance level, suggested insufficient evidence to reject the null hypothesis of significant differences between the groups.
Table 2.
Dataset | Model | Statistics of | Brain | Myocardium |
---|---|---|---|---|
UiT | Reference IDIF | Estimate (ml/g/min) | ||
DLIF | Estimate (ml/g/min) | |||
Average error (%) | ||||
Correlation coefficient | 0.78 | 0.95 | ||
test value | 0.15 | 0.09 | ||
UdS | Reference AIF | Estimate (ml/g/min) | ||
DLIF | Estimate (ml/g/min) | |||
Average error (%) | ||||
Correlation coefficient | 0.94 | 0.91 | ||
test value | 0.87 | 0.38 | ||
IDIF | Estimate (ml/g/min) | |||
Average error (%) | ||||
Correlation coefficient | 0.84 | 0.97 | ||
test value | 0.36 | 0.09 |
∗Estimate (ml/g/min) and average error (%) are expressed as mean confidence interval. Average error was calulated as . Correlation coefficient, and values are calculated from (, ) pairs within each dataset.
3.2. External evaluation
A DLIF model trained on the full UiT dataset was applied to the UdS data for the purpose of external evaluation. The general curve shape is captured by the DLIF model and is in good agreement for some mouse scans (Figure 5A). However, there are also examples where the DLIF model fails to predict the tail (Figure 5B) or the peak (Figure 5C) in this external dataset. The predictions for all test mouse scans are shown in Supplementary Figure S1. For most mouse scans, the DLIF model is unable to predict the AIF peak, while the tail regions are generally in better agreement with the mean DLIF curve (Supplementary Figure S1). This is also visible in the comparison of individual data points (Figure 6A). There is a strong correlation between the data points (correlation coefficient: 0.70), with a linear tendency (slope: 0.61), although with discrepancies, especially around the peak, corresponding to high data point values.
The influx rate constant from Patlak modeling, obtained using DLIF and reference AIF as input function, respectively, showed a strong linear relationship (slope: 1.12) and a strong correlation (correlation coefficient: 0.90) for the voxel-wise calculations (Figure 6B). Visual comparison of the voxel-wise influx rate constants calculated with AIF and DLIF, respectively, for the mouse scan with minimum and maximum errors from Figure 5, also show promising similarities (Figure 7). The regional comparisons for brain and myocardium tissues (Table 2 and Supplementary Figure S3) indicated average errors of 20% and, respectively, with a strong correlation (correlation coefficient: 0.91–0.94). The obtained values for brain and myocardium tissues, surpassing the significance level, suggested insufficient evidence to reject the null hypothesis of significant differences between the groups. Table 2 and Supplementary Figure S4 also displays the corresponding comparisons between obtained with the reference AIF and the one obtained using the IDIF in the UdS dataset. These results indicate similar agreement for the brain region compared to DLIF (mean error: , correlation coefficient: 0.84, value: 0.36), while for myocardium tissues, larger and significant deviations from the reference AIF were obtained, compared to DLIF (mean error: , correlation coefficient: 0.97, value: 0.09).
4. Discussion
Tracer kinetic modeling from dynamic PET imaging requires accurate knowledge of the AIF, ideally determined through arterial blood sampling. The aim of the current study was to develop and evaluate a deep-learning-based prediction model, DLIF, that takes directly the four-dimensional PET data as input, in order to predict a usable input function.
DLIF offers several other advantages relative to currently available methods for AIF estimation. Compared to arterial blood sampling, a trained DLIF model is a non-invasive method, implying simple and convenient use, without the need for surgery, thus allowing longitudinal PET experiments in mice. Compared to our previously published machine learning derived input function (MLDIF) (25, 26), DLIF reduces the subjective bias and preprocessing time by avoiding the need for manual extraction of input time-activity curves. Other common AIF estimation methods, including IDIF (10, 19–21) or simultaneous estimation (17, 18, 22, 23), require correction for the partial volume effect, which is non-trivial, scanner-dependent, and must be carefully measured, or still require a late blood sample for calibration purposes. In contrast, our experiments indicate that a trained DLIF model allows the prediction of both the shape and the amplitude of an input function, using solely image-derived input data, thus without the need for a blood sample for AIF scaling.
4.1. Cross-validation
We first trained and evaluated the DLIF model on 68 mouse scans with IDIF targets using cross-validation. These experiments indicated that DLIF could capture the overall input function curve shape, with an early peak and a vanishing tail, and as such, could be a useful approach for estimating an IDIF (Figure 2). Data points between DLIF and IDIF input functions showed a linear behavior and were highly correlated (Figure 3A). As the input function curve itself is not the interesting result in most dynamic PET studies, we evaluated the influx rate constant, , from Patlak modeling using the reference IDIF as input function, and compared it to the corresponding , when using DLIF as input function. This analysis was done on both voxel-wise (Figures 3B and 4), and on a regional level for brain and myocardium tissues (Table 2, Supplementary Figure S2). In the voxel-wise scatterplot (Figure 3B), each mouse scan is displayed with a different color. The individual slope coefficients for the majority of mouse scans were around 1, however, 10 (15%) of the 68 mouse scans had more severe deviations from the identity line, with slope values outside the range of (Supplementary Table S1). One of these was attributed to tracer injection problems (yellow data points in Figure 3B). Six of the 10 outliers (60%) were BALB/c mice. This strain was represented by only 17 (25%) of the 68 mouse scans in the training data. It is well known that tracer kinetics and glucose metabolism can vary significantly between different mouse strains, especially for diseased strains (40, 41). Based on the unbalanced number of samples from the two strains, we hypothesize that the DLIF model is sensitive to differences in uptake pattern between different mouse strains, and thus is unable to learn the possible variations among the samples. Among the remaining 3 outliers, 2 mouse scans were attributed to significantly higher myocardium uptake, which could partly explain why the DLIF model was less accurate for these samples. The remaining outlier had an unusual positioning in the mouse bed with its spine being curled, and not elongated, as for most other mice, which could affect the DLIF model output (42). Also note that the small P-values found in Figure 3 () are expected for comparisons of large sample sizes (43).
The regional analysis indicated similar mean values for brain and myocardium tissues, with mean errors below 10 %, strong correlation (correlation coefficients: brain: 0.78, myocardium: 0.95), and non-significant differences between the influx rate constants derived using DLIF and reference IDIF as input function (Table 2 and Supplementary Figure S3). These findings were similar to our previously published MLDIF model for brain, where we reported correlation coefficients of 0.56 for brain, and 0.90 for myocardium, with average errors of 7% and 4%, respectively [Table 3 in (25)]. The MLDIF model, however, was based on extracted time-activity curves from 5 manually defined tissue regions (myocardium, brain, liver, muscle and brown fat) as model input. Although the results were similar when using DLIF instead of MLDIF, DLIF undoubtedly overcomes the time-consuming need for manual tissue region delineation in the input images. This removes potential bias in the delineation step, and significantly simplifies the processing pipeline, as well as shortens the application time required to use the DLIF model.
4.2. External evaluation
The DLIF model, was trained with reference IDIF data targets due to the limited availability of high quality and high quantity small animal PET datasets with ground truth arterial blood sampling. To investigate the potential application of DLIF to predict a useful blood input function, we performed model training on the full 68 mouse scan dataset with IDIF targets, and subsequently evaluated the performance on the external UdS dataset, which contained continuous arterial blood samples as ground truth. The predicted DLIF curves were in good agreement with the ground truth AIF for some mouse scans in the UdS dataset (Figure 5), while there were larger discrepancies for others (Supplementary Figure S1). In general, the tail of the DLIF prediction was in better agreement compared to the peak. This discrepancy could be explained by a larger variation in the peak region for the UdS data (Figure 6A). There were several systematic differences in the data collection methodology between the UiT training data and the UdS test data, which could explain these deviations. Most importantly, the UdS mice were not fasted before the experiments, which is reflected in the slightly higher blood glucose values of the UdS data. It is well known that fasting affects the glucose metabolism in mice, and contributes to reducing the variability between samples (44–47), and specifically, myocardium uptake is affected by fasting, a region that we have shown is important for the prediction of the input function (25). Another difference between the studies was that the injection volume in the UdS data was not fixed. Four mice were injected using 100 , while four mice were injected with 200 . This volume difference could most likely introduce a variability in the peak of the input function, which was not accounted for in the training data used for the DLIF model. Also, for the UdS data, no attenuation or scatter correction was performed, because an associated CT scan was missing from the dataset. This could introduce underestimations of the measured activity concentration depending on tissue position (48), something which is not accounted for by the DLIF model. Lastly, the UdS test mice were of three different strains, and four of the mice could not be traced to a specific animal supplier, which could introduce additional bias in the data.
Despite these discrepancies, following Patlak modeling, the voxel-wise influx rate constants obtained using DLIF and reference AIF as input function for the UdS data showed an overall strong linear relationship and strong correlation (Figure 6B), even though systematic deviations are obvious for different mice. We found small but systematic underestimations of the slope coefficient for all BALB/c mice (, average slope coefficient: 0.8), while the slope coefficients for C57/BL/6 (, average slope coefficient: 1.4) and CD-1 (, average slope coefficient: 1.5) were larger and systematically overestimated (Supplementary Table S2). While 17 (25%) of the 68 mouse scans used for training of the DLIF model were from the BALB/c strain, the C57/BL/6 and CD-1 strains were not represented among the training data. Similar to our discussion around Figure 3B, this further indicates that the DLIF model is sensitive to uptake patterns in mouse strains that were not part of the training data.
As evident from Figure 6A, the lower SUV values originating from the input function tail are closer to the identity line, compared to larger SUV values, belonging to the peak. The linear fit during Patlak graphical analysis is based on the steady state part following the input function peak (6). Although the integral of the full input function is present in the equation, the impact of the peak on the is minimal, and consequently, this method is robust to noise and bias around the input function peak. This could explain why the Patlak scatterplot (Figure 6B) still showed high correlation in the voxelwise analysis, despite the discrepancy of many of the peak regions (Supplementary Figure S1). The regional comparison of calculated using reference AIF and DLIF (Table 2 and Supplementary Figure S2) indicated mean errors of 20% and in for brain and myocardium regions, respectively, with a strong correlation, and non-significant differences between the groups. These differences could be expected because of the mentioned differences between the UiT training data, and the UdS test data. Still, while DLIF-based calculation of showed slightly larger errors for brain tissue, compared to IDIF, both results were non-significantly different from the influx derived with the AIF. For myocardium tissue, the average error in the DLIF-based calculation of was similar to the error obtained during cross-validation experiments, while the corresponding error for the IDIF-based calculation was larger and significantly different from the AIF-based influx. Interestingly, although IDIF is a commonly used method for AIF estimation in the literature (16, 49), our results indicate that there might still be significant discrepancies between IDIF and AIF for myocardium tissues, which could be overcome by using the DLIF method. Again, we note that the small P-values found in Figure 6 () are expected for comparisons of large sample sizes (43). With all these mentioned limitations and differences between the UiT training data and the UdS test data, we argue that the DLIF model trained on IDIF reference targets still shows promising potential when compared to the external UdS data with reference AIF targets.
4.3. Limitations
As depicted in Figures 3A and 6A, the DLIF model sometimes over- or underestimates the reference input function in early time frames, evident as vertical and horizontal data points around , respectively. For these cases, we hypothesize that the DLIF model is unable to handle slight time-shifts in the input data. This will be investigated in future research. Our results furthermore indicate that the DLIF approach is sensitive to the specific mouse strain that it was trained on. Further prerequisites for the DLIF approach is that representative training data have been collected for the specific tracer, imaging system, and imaging protocol.
4.4. Future work
Although our work provided a comparison between DLIF and an AIF in the UdS test data, these findings must be further investigated in future research because of the large differences between the experimental methods in the UiT and UdS datasets. For instance, future work could include studying the dependency of the DLIF model to factors such as tracer, mouse strain, and variations in different experimental conditions and imaging protocols. Future research must also evaluate a DLIF model trained on a reference AIF measured in arterial blood.
The DLIF approach was in this work evaluated with [18F]FDG on a specific small animal PET imaging system. With further comprehensive validation, we suggest that the DLIF model could be retrained for other tracers and imaging systems. It is also conceivable that tracers requiring metabolite-correction may be modelled. DLIF could also have relevant applications in clinical human PET imaging (26). The accuracy of the DLIF models for a particular PET application will, in the end, depend on the quality, quantity and relevance of the available training data. Nevertheless, if properly validated, DLIF could provide a simplified, low-bias method for performing quantitative and longitudinal PET imaging studies in mice.
4.5. Conclusion
In conclusion, we demonstrated that our non-invasive DLIF prediction method may be a viable alternative to arterial blood sampling in [18F]FDG imaging of mice. The proposed approach does not require manual segmentation for model input, and it is not depending on a late calibration blood sample or any partial-volume correction. The resulting influx rate constants from Patlak modeling agreed well with image-derived reference values and promising agreement was obtained when comparing to data with continuous arterial blood sampling. With further validation, DLIF could overcome the need for arterial cannulation and allow fully quantitative and longitudinal experiments in PET imaging studies of mice.
Acknowledgments
We would like to thank Ana Oteiza, Montserrat Martin-Armas, Kristin Fenton and Esmaeil Dorraji, who helped to acquire the UiT data. We would also like to thank Mélanie Archambault and Véronique Dumulon-Perreault, who helped to acquire the UdS data.
Funding Statement
The author(s) declare financial support was received for the research, authorship, and/or publication of this article.
This research was supported by Tromsø Research Foundation (19_PET-NUKL) through the 180∘N Norwegian Nuclear Medicine Consortium.
Data availability statement
The data analyzed in this study is subject to the following licenses/restrictions: Protection of intellectual property and innovation. Requests to access these datasets should be directed to samuel.kuttner@uit.no.
Ethics statement
The animal study was approved by Competent Authority on Animal Research, the Norwegian Food Safety Authority; FOTS id 6676/2015 and Université de Sherbrooke in-house Ethics Committee for Animal Experiments under Protocol 2022–3463. The study was conducted in accordance with the local legislation and institutional requirements.
Author contributions
SK: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing; LTL: Methodology, Software, Writing – review & editing; LC: Formal Analysis, Investigation, Writing – review & editing; OS: Formal Analysis, Software, Writing – review & editing; RL: Conceptualization, Writing – review & editing; MK: Methodology, Supervision, Writing – review & editing; RS: Conceptualization, Funding acquisition, Supervision, Writing – review & editing; RJ: Conceptualization, Methodology, Supervision, Writing – review & editing.
Conflict of interest
RL is a co-founder and acting Chief Scientific Officer of Imaging Research & Technology Inc.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnume.2024.1372379/full#supplementary-material
References
- 1.Hicks RJ, Dorow D, Roselt P. Pet tracer development—a tale of mice, men. Cancer Imaging. (2006) 6:S102–6. 10.1102/1470-7330.2006.9098 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Yao R, Lecomte R, Crawford ES. Small-animal pet: what is it,, why do we need it? J Nucl Med Technol. (2012) 40:157–65. 10.2967/jnmt.111.098632 [DOI] [PubMed] [Google Scholar]
- 3.Cunha L, Horvath I, Ferreira S, Lemos J, Costa P, Vieira D, et al. Preclinical imaging: an essential ally in modern biosciences. Mol Diagn Ther. (2014) 18:153–73. 10.1007/s40291-013-0062-3 [DOI] [PubMed] [Google Scholar]
- 4.Gunn RN, Gunn SR, Cunningham VJ. Positron emission tomography compartmental models. J Cereb Blood Flow Metab. (2001) 21:635–52. 10.1097/00004647-200106000-00002 [DOI] [PubMed] [Google Scholar]
- 5.Patlak CS, Blasberg RG, Fenstermacher JD. Graphical evaluation of blood-to-brain transfer constants from multiple-time uptake data. J Cereb Blood Flow Metab. (1983) 3:1–7. 10.1038/jcbfm.1983.1 [DOI] [PubMed] [Google Scholar]
- 6.Patlak CS, Blasberg RG. Graphical evaluation of blood-to-brain transfer constants from multiple-time uptake data. generalizations. J Cereb Blood Flow Metab. (1985) 5:584–90. 10.1038/jcbfm.1985.87 [DOI] [PubMed] [Google Scholar]
- 7.Bouallègue FB, Vauchot F, Mariano-Goulart D. Comparative assessment of linear least-squares, nonlinear least-squares,, Patlak graphical method for regional and local quantitative tracer kinetic modeling in cerebral dynamic 18F-FDG pet. Med Phys. (2019) 46:1260–71. 10.1002/mp.13366 [DOI] [PubMed] [Google Scholar]
- 8.Wang G, Rahmim A, Gunn RN. Pet parametric imaging: past, present, and future. IEEE Trans Radiat Plasma Med Sci. (2020) 4:663–75. 10.1109/TRPMS.2020.3025086 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Alf MF, Wyss MT, Buck A, Weber B, Schibli R, Krämer SD. Quantification of brain glucose metabolism by 18F-FDG pet with real-time arterial and image-derived input function in mice. J Nucl Med. (2013) 54:132–8. 10.2967/jnumed.112.107474 [DOI] [PubMed] [Google Scholar]
- 10.Laforest R, Sharp TL, Engelbach JA, Fettig NM, Herrero P, Kim J, et al. Measurement of input functions in rodents: challenges and solutions. Nucl Med Biol. (2005) 32:679–85. 10.1016/j.nucmedbio.2005.06.012 [DOI] [PubMed] [Google Scholar]
- 11.Meyer PT, Circiumaru V, Cardi CA, Thomas DH, Bal H, Acton PD. Simplified quantification of small animal [18F]FDG pet studies using a standard arterial input function. Eur J Nucl Med Mol Imaging. (2006) 33:948–54. 10.1007/s00259-006-0121-7 [DOI] [PubMed] [Google Scholar]
- 12.Sijbesma JW, Zhou X, García DV, Houwertjes MC, Doorduin J, Kwizera C, et al. Novel approach to repeated arterial blood sampling in small animal pet: application in a test-retest study with the adenosine a1 receptor ligand [11C]MPDX. Mol Imaging Biol. (2016) 18:715–23. 10.1007/s11307-016-0954-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Rey-Bretal D, Moscoso A, Gómez-Lado N, Fernández-Ferreiro A, Silva-Rodríguez J, Álvaro R, et al. Feasibility of longitudinal brain pet with real-time arterial input function in rats. Mol Imaging Biol. (2021) 23:350–60. 10.1007/s11307-020-01556-y [DOI] [PubMed] [Google Scholar]
- 14.Convert L, Sarrhini O, Paillé M, Salem N, Charette PG, Lecomte R. The ultra high sensitivity blood counter: a compact, MRI-compatible, radioactivity counter for pharmacokinetic studies in l volumes. Biomed Phys Eng Express. (2022) 8:035022. 10.1088/2057-1976/AC4C29 [DOI] [PubMed] [Google Scholar]
- 15.Takikawa S, Dhawan V, Spetsieris P, Robeson W, Chaly T, Dahl R, et al. Noninvasive quantitative fluorodeoxyglucose PET studies with an estimated input function derived from a population-based arterial blood curve. Radiology. (1993) 188:131–6. 10.1148/radiology.188.1.8511286 [DOI] [PubMed] [Google Scholar]
- 16.Lanz B, Poitry-Yamate C, Gruetter R. Image-derived input function from the vena cava for 18F-FDG PET studies in rats and mice. J Nucl Med. (2014) 55:1380–8. 10.2967/jnumed.113.127381 [DOI] [PubMed] [Google Scholar]
- 17.Bartlett EA, Ananth M, Rossano S, Zhang M, Yang J, Lin S, et al. Quantification of positron emission tomography data using simultaneous estimation of the input function: validation with venous blood and replication of clinical studies. Mol Imaging Biol. (2019) 21:926–34. 10.1007/s11307-018-1300-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Roccia E, Mikhno A, Ogden T, Mann JJ, Laine A, Angelini E, et al. Quantifying brain [18F]FDG uptake noninvasively by combining medical health records and dynamic PET imaging data. IEEE J Biomed Heal. Inform. (2019) 2194:1–7. 10.1109/JBHI.2018.2890459 [DOI] [PubMed] [Google Scholar]
- 19.Frouin V, Comtat C, Reilhac A, Grégoire MC. Correction of partial-volume effect for pet striatal imaging: fast implementation and study of robustness. J Nucl Med. (2002) 43:1715–26. PMID: [PubMed] [Google Scholar]
- 20.Kim E, Shidahara M, Tsoumpas C, McGinnity CJ, Kwon JS, Howes OD, et al. Partial volume correction using structural-functional synergistic resolution recovery: comparison with geometric transfer matrix method. J Cereb Blood Flow Metab. (2013) 33:914–20. 10.1038/jcbfm.2013.29 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Fang YHD, Muzic RF. Spillover, partial-volume correction for image-derived input functions for small-animal 18F-FDG pet studies. J Nucl Med. (2008) 49:606–14. 10.2967/jnumed.107.047613 [DOI] [PubMed] [Google Scholar]
- 22.Feng D, Wong KP, Wu CM, Siu WC. A technique for extracting physiological parameters, the required input function simultaneously from PET image measurements: theory and simulation study. IEEE Trans Inf Technol Biomed. (1997) 1:243–54. 10.1109/4233.681168 [DOI] [PubMed] [Google Scholar]
- 23.Wong KP, Feng D, Meikle SR, Fulham MJ. Simultaneous estimation of physiological parameters and the input function—in vivo PET data. IEEE Trans Inf Technol Biomed. (2001) 5:67–76. 10.1109/4233.908397 [DOI] [PubMed] [Google Scholar]
- 24.Shen D, Wu G, Suk HI. Deep learning in medical image analysis. Annu Rev Biomed Eng. (2017) 19:221. 10.1146/annurev-bioeng-071516-044442 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kuttner S, Wickstrøm KK, Kalda G, Dorraji SE, Martin-Armas M, Oteiza A, et al. Machine learning derived input-function in a dynamic 18 F-FDG PET study of mice. Biomed Phys Eng Express. (2020) 6:015020. 10.1088/2057-1976/ab6496 [DOI] [PubMed] [Google Scholar]
- 26.Kuttner S, Wickstrøm KK, Lubberink M, Burman J, Sundset R, Jenssen R, et al. Cerebral blood flow measurements with 15 o-water pet using a non-invasive machine-learning-derived arterial input function. J Cereb Blood Flow Metab. (2021) 41:2229–41. 10.1177/0271678X21991393 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wang L, Ma T, Yao S, Ye Q, Coughlin J, Pomper M, et al. Direct estimation of input function based on fine-tuned deep learning method in dynamic PET imaging. J Nucl Med. (2020) 61:1394. [Google Scholar]
- 28.Varnyú D, Szirmay-Kalos L. Blood input function estimation in positron emission tomography with deep learning. In: 2021 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC) (2021). p. 1–7. 10.1109/NSS/MIC44867.2021.9875543 [DOI] [Google Scholar]
- 29.Dorraji ES, Oteiza A, Kuttner S, Martin-Armas M, Kanapathippillai P, Garbarino S, et al. Positron emission tomography and single photon emission computed tomography imaging of tertiary lymphoid structures during the development of lupus nephritis. Int J Immunopathol Pharmacol. (2021) 35:1–13. 10.1177/20587384211033683 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Keyes JW. SUV: standard uptake or silly useless value? J Nucl Med. (1995) 36:1836–9. PMID: [PubMed] [Google Scholar]
- 31.Diehl KH, Hull R, Morton D, Pfister R, Rabemampianina Y, Smith D, et al. A good practice guide to the administration of substances and removal of blood, including routes and volumes. J Appl Toxicol. (2001) 21:15–23. 10.1002/jat.727 [DOI] [PubMed] [Google Scholar]
- 32.Goodfellow I, Bengio Y, Courville A. Deep Learning. MIT Press; (2016). http://www.deeplearningbook.org [Google Scholar]
- 33.Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv (2015). doi: 10.48550/arXiv.1502.03167
- 34.Maas AL, Hannun AY, Ng AY. Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of the International Conference in Machine Learning (ICML) (2013), vol. 30, 3 [Google Scholar]
- 35.Ronneberger O, Fischer P, Brox T. U-net: convolutional networks for biomedical image segmentation. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer; (2015). p. 234–41. [Google Scholar]
- 36.He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016). p. 770–8. [Google Scholar]
- 37.Wong TT. Performance evaluation of classification algorithms by k-fold and leave-one-out cross validation. Pattern Recognit. (2015) 48:2839–46. 10.1016/j.patcog.2015.03.009 [DOI] [Google Scholar]
- 38.Reddi SJ, Kale S, Kumar S. On the convergence of ADAM and beyond. In: Proc. Int. Conf. Learn. Represent. (ICLR) (2018). [Google Scholar]
- 39.Carr JR. Orthogonal regression: a teaching perspective. Int J Math Educ Sci Technol. (2012) 43:134–43. 10.1080/0020739X.2011.573876 [DOI] [Google Scholar]
- 40.Martic-Kehl MI, Ametamey SM, Alf MF, Schubiger PA, Honer M. Impact of inherent variability and experimental parameters on the reliability of small animal pet data. EJNMMI Res. (2012) 2:26. 10.1186/2191-219X-2-26 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Shao X, Liu Y, Zhou M, Xu M, Chen Y, Huang H, et al. Dynamic evolution and mechanism of myocardial glucose metabolism in different functional phenotypes of diabetic cardiomyopathy—a study based on 18F-FDG micropet myocardial metabolic imaging. Diabetol Metab Syndr. (2023) 15:64. 10.1186/s13098-023-01038-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Shorten C, Khoshgoftaar TM. A survey on image data augmentation for deep learning. J Big Data. (2019) 6:60. 10.1186/s40537-019-0197-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Lantz B. The large sample size fallacy. Scand J Caring Sci. (2013) 27:487–92. 10.1111/j.1471-6712.2012.01052.x [DOI] [PubMed] [Google Scholar]
- 44.Fueger BJ, Czernin J, Hildebrandt I, Tran C, Halpern BS, Stout D, et al. Impact of animal handling on the results of 18F-FDG pet studies in mice. J Nucl Med. (2006) 47:999–1006. PMID: [PubMed] [Google Scholar]
- 45.Wong KP, Sha W, Zhang X, Huang SC. Effects of administration route, dietary condition, and blood glucose level on kinetics and uptake of 18F-FDG in mice. J Nucl Med. (2011) 52:800–7. 10.2967/jnumed.110.085092 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Kreissl MC, Stout DB, Wong KP, Wu HM, Caglayan E, Ladno W, et al. Influence of dietary state and insulin on myocardial, skeletal muscle and brain [f]-fluorodeoxyglucose kinetics in mice. EJNMMI Res. (2011) 1:8. 10.1186/2191-219X-1-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Langah RAK, Spicer KM, Chang R, Rosol M. Inhibition of physiologic myocardial FDG uptake in normal rodents: comparison of four pre-scan preparation protocols. Adv J Mol Imaging. (2012) 2:21–30. 10.4236/ami.2012.23004 [DOI] [Google Scholar]
- 48.Ali HH, Bodholdt RP, Jørgensen JT, Myschetzky R, Kjaer A. Importance of attenuation correction (AC) for small animal pet imaging. Diagnostics. (2012) 2:42–51. 10.3390/diagnostics2040042 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Wu HM, Sui G, Lee CC, Prins ML, Ladno W, Lin HD, et al. In vivo quantitation of glucose metabolism in mice using small-animal PET and a microfluidic device. J Nucl Med. (2007) 48:837–45. 10.2967/jnumed.106.038182 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data analyzed in this study is subject to the following licenses/restrictions: Protection of intellectual property and innovation. Requests to access these datasets should be directed to samuel.kuttner@uit.no.