Abstract
Background
Respiratory motion during the single photon emission computed tomography (SPECT) acquisition can cause blurring artifacts in the reconstructed images, leading to inaccurate estimates for activity and absorbed doses.
Purpose
To address the impact of respiratory motion, we utilized a new optical surface imaging (OSI) system to extract the respiratory signals for phase sorting and verified its effectiveness through simulation and patient data. Additionally, we implemented GPU‐accelerated motion‐incorporated reconstruction algorithms for the SPECT projections, integrating motion information to produce motion‐free images from all acquired data.
Methods
We used the 4D XCAT Phantom to generate attenuation maps and activity images across different respiratory phases, with activity distributions based on patient images. SPECT projections were simulated using the SIMIND Monte Carlo program with Poisson noise. The OSI system was modeled by introducing Gaussian noise into the point clouds on the body surface within the attenuation map. The body surface images were registered across phases using a Gaussian mixture model combined with principal component analysis. The extracted respiratory signals were compared to the center‐of‐light (COL) approach, with or without filtering and kidney masking. The OSI method was further validated by comparing respiratory signals derived from a real patient using OSI to simultaneous cone‐beam CT (CBCT) projections. Two motion‐incorporated techniques, namely, 4D reconstruction (4D‐Recon) and post‐reconstruction registration and summation (post‐Recon), were compared with non‐motion‐corrected images (non‐MC) and single‐phase gating (Gating). The quantitative evaluation of image quality utilized recovery coefficients (RC), contrast recovery coefficients (CRC), and uncertainty estimation.
Results
In simulation, the correlation between the ground‐truth and OSI‐based signals remained high and stable (0.99 ± 0.004, p‐value 0.001 vs. COL‐filter with kidney masking). While the kidney mask improved performance (0.87 ± 0.07 without filtering and 0.90 ± 0.06 with filtering, p‐value 0.001), it was less effective and more uncertain than the OSI method. Validation with patient data showed high consistency in breathing frequencies and phase alignment between CBCT‐based and OSI‐based signals. For reconstruction, both 4D‐Recon and post‐Recon significantly enhanced RC and CRC compared to non‐MC, with less uncertainty than Gating. In addition, 4D‐Recon outperformed post‐Recon in certain aspects.
Conclusions
Our novel respiratory signal extraction approach based on OSI demonstrated superior accuracy and reliability compared to a data‐driven method. Applying motion‐incorporated SPECT reconstruction using these accurate breathing signals has the potential to enhance image quality and improve absorbed dose quantification in radiopharmaceutical therapy. The relevant reconstruction algorithms are also made available for public use in the open‐source library PyTomography.
Keywords: nonrigid motion correction, quantitative SPECT, respiratory‐gated 4D imaging, surface tracking
1. INTRODUCTION
Respiratory motion during the single photon emission computed tomography (SPECT) data acquisition affects image quality by causing signal distortion. 1 This leads to blurring, reduced contrast in organs and lesions, increased noise, and artifacts, such as ghosting, splitting, and deformation of activity sources, as well as spillover or dispersion of activity. 2 , 3 , 4 In addition, with the growing use of SPECT for dosimetry in radiopharmaceutical therapies, quantitative SPECT is expected to play a key role in the optimization task of minimizing toxicities to normal organs while maximizing absorbed doses to tumors. 5 , 6 , 7 However, images acquired without accounting for respiratory motion may lead to inaccuracies in the estimated activities. Research indicates that in liver tumor dosimetry using SPECT scans, respiratory motion reduces activity recovery and tumor/non‐tumor ratios by an average of 30 compared to acquisitions without motion. 8 Therefore, efforts are required to reduce or minimize the impact of respiratory motion.
Accurate and robust respiratory signal sensing is a prerequisite for achieving motion correction (MC). 9 , 10 , 11 The signal facilitates the grouping of projection data acquired at different angles, but corresponding to the same respiratory state, into identical bins, which can then be used for motion‐incorporated reconstruction. Some studies have proposed using deep learning methods to extract respiratory signal 12 ; however, they still require external motion‐tracking devices to train the network. Although 4D CT can directly observe tumor motion, it significantly increases the patient's imaging dose compared to 3D CT. 13 Additionally, the respiratory signal obtained from 4D CT may not accurately reflect the conditions during SPECT scanning, as the two are not acquired synchronously.
Data‐driven gating (DDG) methods that utilize emission data have been investigated in positron emission tomography (PET), 14 , 15 where count rates are much higher than in SPECT, and all angular views are acquired simultaneously. However, additional challenges arise in conventional rotating SPECT systems, where projection data are available for only two or three angles at a given time, and count rates are lower. These factors lead to significant noise and variations, challenging routine deployment of the DDG method. 11 Furthermore, to the best of our knowledge, most SPECT systems operate in static or frame mode rather than list‐mode, which is necessary for rebinning projections into chronological time frames. 16 This limitation further hinders the implementation of the DDG method.
Newly developed optical surface imaging (OSI) systems, such as AlignRT (VisionRT, London, Great Britain) and Catalyst (C‐Rad, Uppsala, Sweden), have been introduced in external beam radiation therapy. 17 These systems enable continuous and touchless scanning of patients' external surfaces, and several studies have shown clear correlations between external surfaces and internal structures. 18 , 19 Moreover, implementing this method does not require purchasing expensive equipment, a depth camera alone is sufficient. In addition, the meaning of obtaining accurate respiratory signals is ultimately reflected in the quality of reconstructed images. Unlike rigid‐body motion, which can be corrected using six‐degree‐of‐freedom vectors from an external tracking techniques, respiratory motion involves internal deformations that are more challenging to correct due to its nonrigid nature and its occurrence within the body. 20
As a proof of concept and in preparation for applying the OSI system to SPECT imaging, we developed a novel method for extracting respiratory signals for phase sorting, aiming to validate its feasibility using both simulation and patient data. The results from the OSI system were compared with ground truth data and the DDG method. For motion‐incorporated reconstruction, we integrated the deformation vector field (DVF) into the ordered subset expectation maximization (OSEM) reconstruction of SPECT images and compared the improvement in image quality with MC to that of static or single‐phase reconstructions. To our knowledge, no previous study has applied this method for motion‐compensated reconstruction in SPECT.
2. METHODS
2.1. Digital phantom
The digital 4D XCAT phantom utilized in this study enables the creation of realistic anthropomorphic configurations that are well‐suited for SPECT simulation. 21 Activity images at 10 respiratory phases were generated. A tumor with a 20 mm in diameter was positioned in the liver. The ground truth respiratory signal was determined using Equation (1) and input into the XCAT software. The magnitudes were set to +10 and ‐20 mm for the anterior–posterior direction and diaphragm displacement, respectively. Since the tumor's motion curve is not directly set, its motion depends on its position in the body. This setting follows Huang et al.'s method 22
| (1) |
The activity settings used in the simulation were derived from the study by Kayal et al.,
23
based on SPECT images acquired 1 h after the injection of 6.85 GBq of
‐labeled DOTATATE, as presented in Table 1. The process began with the generation of over 1 billion counts for the activity map, captured by the virtual detectors. These counts were then normalized to match the activity levels of each organ, resulting in 120 noise‐free SPECT projections for each breathing phase.
24
Each projection had dimensions of 128 × 128 pixels. Poisson noise was then added to simulate projections obtained in clinical scans. It is worth noting that this process transfers the expected mean count to the actual number of counts for each scan. Therefore, adding Poisson noise multiple times to the same noise‐free projection can generate different scans under identical conditions, which is crucial for estimating the uncertainty.
TABLE 1.
Activity setting for simulated images in SIMIND.
| Organs | Activity concentration (MBq/mL) |
|---|---|
| Lungs | 0.0445 |
| Liver | 0.2531 |
| Bones | 0.0576 |
| Left kidney | 0.5074 |
| Right kidney | 0.5155 |
| Spleen | 0.5313 |
| Tumor | 0.9564 |
| Background | 0.0526 |
To increase the efficiency of Monte Carlo simulations, we utilized a computer with 64 CPUs, with each CPU running the simulation separately to achieve parallel computation. The dual‐head Siemens SPECT, equipped with a parallel medium‐energy general‐purpose (MEGP) collimator, was simulated to match our clinical protocol on a Siemens Symbia SPECT/CT camera. The characteristics of the collimator, such as transparency, spatial resolution, sensitivity, and so forth, are considered in the SIMIND calculations. The specific parameters of the collimator can be found in the “SI‐ME” mode of the SIMIND “collim.col” file. The “scattwin” scoring routine is employed to simulate the triple‐energy window (TEW) scatter correction method. For
, the photopeak energy range is 187.2–228.8 keV, with the lower energy window at 169.4–187.2 keV and the higher at 228.8–252.9 keV.
Improving the quality of OSEM‐reconstructed images requires more than just projection data. Additional details about the imaging system are essential. SIMIND can generate a density map aligned with the pixel size and matrix dimensions of the SPECT projections, enabling attenuation correction during reconstruction. Furthermore, an ideal SPECT detector would detect only photons traveling perpendicular to the collimator holes. However, this is not the case in practice. The angle at which photons can enter is influenced by the parameters of the collimator and scintillator. To address this issue, point spread function (PSF) modeling has been developed, with parameters that can be derived based on the distance between the source and the detector. 25 As a result, the system matrix used in OSEM reconstruction incorporates geometry transformation, TEW scatter correction, attenuation correction, and PSF modeling.
2.2. Respiratory signals extraction – OSI simulations
Point clouds within the 64 × 64 body surface region of interest (ROI) on the attenuation map were used to simulate the OSI system. To assess the proposed method's robustness against optical signal noise, the Gaussian noise with a standard deviation of 0.5 mm was added to the surface motion, consistent with the accuracy of existing optical surface monitoring systems. 26 A Gaussian mixture model (GMM) 27 was utilized for image registration of OSI across different phases (Figure 1) to obtain the DVF (), as described in Equation (2). The core idea of the GMM registration framework is to represent input point clouds as GMMs. The point set registration problem is then reformulated as aligning two Gaussian mixtures by minimizing a statistical discrepancy measure between them. Finally, principal component analysis (PCA) was applied to reduce the dimensionality of . The weight of the first PCA component () was then utilized as the raw signal for the OSI method.
| (2) |
where and represent the optical surfaces in the reference and target phases, respectively. represents the DVF used for transforming from reference phase to target phase. represents the average DVF; represents each principal component; represents the corresponding weight.
FIGURE 1.

(a) An optical surface image from XCAT phantom body segmentation, with a red square marking the 64 × 64 pixel ROI for respiratory signal extraction. (b) Registration of surface point clouds to the reference phase using the GMM. GMM, Gaussian mixture model; ROI, region of interest.
2.3. Respiratory signals extraction – Data‐driven simulations
We utilized the center‐of‐light (COL) method, one of the most commonly used DDG techniques, to extract respiratory signals for comparison. This method calculates the axial COL of projection counts in objects with respiratory motion. The continuous motion can then be estimated at a specific frequency and converted into organ displacements in the superior–inferior direction. 11 , 28 To minimize the interference from background noise and organ signals, we compared the COL obtained from various preprocessed projections: filtering (applying a 3D filter with 32 × 32 spatial and 2‐temporal dimension box kernel), 29 kidney masking (obtained from the 3D image and projected to different angles), and threshold sets for counting rates. The kidney mask was selected because the kidney's motion is highly synchronized with respiratory phases, owing to its close proximity to the diaphragm. Furthermore, its high activity concentration enhances noise reduction and allows the COL method to extract an optimal breathing signal.
Compared to the time‐based gating scheme, amplitude‐based gating schemes produced SPECT images with fewer motion artifacts. 30 In addition to the flat amplitude phase (EI), the amplitude on one side is equally divided into three gates. Consequently, the entire breathing cycle was divided into seven amplitude‐based segments (as shown in Figure 2). Since the breathing cycle typically lasts approximately 5 s, the XCAT cycle time was set accordingly. To account for the rapid changes during the middle inhalation (MI) and middle exhalation (ME) phases, both were assigned a duration of 0.4218 s in our original curve setting. Therefore, to simplify the problem, the simulated gating projection scan was set to a duration of 0.5 s for applying the COL algorithm. The difference in sub‐projections used for the COL process between full‐cycle and gating scan durations is illustrated in Figure 3. It is shown that the gating image obtained from the 0.5‐s scan appears to be much noisier than that from the standard 15‐s scan.
FIGURE 2.

The respiratory cycle can be divided into seven distinct phases: II, MI, terminal inhalation (TI), end of inhalation (EI), IE, ME, and EE. EE, end of exhalation; EI, end of inhalation; IE, initial exhalation; II, initial inhalation; ME, middle exhalation; MI, middle inhalation; TI, terminal inhalation.
FIGURE 3.

Simulated SPECT projection for the reference phase (end of expiration), Poisson noise was added. A 15‐s scan (ideal breath‐hold) and a 0.5‐s scan (used for respiratory gating) were shown to compare. SPECT, single photon emission computed tomography.
2.4. Validation of OSI method on patient data
To verify our method using real patient data, we utilized data from a patient who underwent a cone‐beam CT (CBCT) scan before receiving external radiation therapy. The CBCT scan ensured accurate patient positioning and verification. 31 During the scan, we positioned a depth camera to record the optical surface information, which was then applied to the OSI method described in Section 2.2. The pronounced difference in x‐ray attenuation between the dense muscle of the diaphragm and the low‐density, air‐filled lungs creates a well‐defined edge in 2D CBCT projections, ensuring the diaphragm's motion is clearly visible. After processing, it can be used as the ground truth for the respiratory signal. 32 The x‐ray voltage is set to 120 kVp, with a current of 20 mA, and the exposure time for each projection is 16 ms.
The depth camera used to capture the patient's surface information is the Azure Kinect (Microsoft Inc., Redmond, WA, USA), an RGB‐D camera popular in research for artificial intelligence applications. It includes a time‐of‐flight depth sensor, which captures depth information by measuring the time difference of amplitude‐modulated continuous wave signals in the near‐infrared spectrum.
33
Each point in the image is assigned a value corresponding to its depth along the Z‐axis, creating a depth matrix. In addition to depth maps, the camera can also capture color images and infrared images. The infrared image has the same dimensions as the depth map and visually represents the objects captured in the depth map. The depth resolution of the camera is 1 mm. The minimum detectable thoracic motion is determined by 1/ mm, where is the angle at which the depth camera is positioned relative to the ground. For example, when is 60
, the smallest detectable depth change is approximately 1.15 mm.
During the experiment, the depth camera is mounted on a bracket at the foot of the bed, angled downward toward the patient. We begin the depth camera acquisition prior to activating the CBCT scanner. Both the CBCT scanner's 2D projections and the depth camera can record their respective start times and frame intervals. Therefore, in this study, we manually correct the time shift in the camera data to ensure alignment with the projection data during analysis. For full technical integration of the camera into the SPECT system, true time synchronization between the two systems would be required, which involves some engineering adjustments and collaboration with scanner manufacturers. The sampling intervals for CBCT and depth camera are 0.182 and 0.033 s (1/30 s), respectively. The data obtained are shown in Figure 4
FIGURE 4.

(a) Image acquired by the depth camera in real‐time during the CBCT scan; (b) 2D projection obtained from the CBCT scan, with the red box indicating the ROI used to extract respiratory signals; (c) Vertical gradient calculation to enhance the clarity of the diaphragm dome; (d) Summation of pixel values along each row; (e) Combination of ROI line maps from all projections. CBCT, cone‐beam computed tomography; ROI, region of interest.
We utilized the Amsterdam Shroud method 34 , 35 to extract respiratory surrogate signals from 2D CBCT projections, as illustrated in Figure 4. The process involved applying a mean filter with a 3 × 3 averaging kernel to smooth the image, followed by calculating the vertical gradient to enhance the edges of diaphragm dome. The ROI of the diaphragm was then selected, and the projection was summed along the coronal plane, converting the 2D image with dimensions 512 × 512 into a line graph of 512 × 1. This procedure was repeated for each projection, and the resulting ROI line graphs were combined to create a 2D image, where the horizontal axis represents the projection number (i.e., time) and the vertical axis represents the position of structures along the craniocaudal axis of the flat panel detector. 36 Finally, the weighted center of the combined ROI line map was calculated by normalizing column values between 0 and 1, yielding centroids for each column that represent the extracted respiratory signal. 37
2.5. Motion‐incorporated reconstruction
To simulate the patient's SPECT scan with an accurate and reliable breathing signal as the basis for gating, we aimed to achieve a motion‐incorporated reconstruction, demonstrating its crucial role in ensuring dosimetry accuracy. Symmetries in the breathing cycle permit only half the data to be used. Using the XCAT digital phantom software and the SIMIND Monte Carlo program, we obtained 25 time points of breathing with a 0.1‐s interval between each point, as shown in Figure 5, to simulate the list‐mode data of SPECT. The position of organs within 0.1 s can be regarded as static,
38
while a gate consisting of multiple 0.1 s intervals will be blurred due to breathing. Consequently, the data for each gate is the average of the time points within its coverage and scaled by its duration. For instance, our test used a 15‐s projection scan time, and the duration times for the Gates 1–4 are 4.2, 2.4, 5.4, and 3.0 s, respectively. Additionally, to simulate images from noise‐less projections, we increased the scan time by a factor of 10. This mode can be achieved through various methods, such as using isotopes with higher gamma yields and more efficient gamma detection (e.g.,
), replacing NaI detectors (as simulated in this study) with more sensitive CZT detectors, extending the scanning time per projection, utilizing equipment with more detectors, or increasing the patient's injection activity. To accommodate these potential improvements, we added this noise‐less mode.
FIGURE 5.

Simulated time points for the motion‐incorporated reconstruction, with black dots indicating the position of each point along the curve. Data from half the cycle is utilized.
The 4D reconstruction algorithm (4D‐Recon) for image reconstruction, proposed by Qiao et al., 39 is defined in Equation (3). We also evaluated the post‐reconstruction registration and summation method for combining independently reconstructed images (post‐Recon, Equation 4). These two motion‐incorporated reconstruction algorithms were compared with images reconstructed without motion correction (non‐MC) and those using single‐phase projections (Gating). The ground truth was established using full‐cycle scans without motion (Static).
| (3) |
where is the image estimate of the iteration; is the system matrix, is the projection data for phase , and is the expected scatter vector for phase (obtained from the TEW scatter correction). is a vector where all elements are equal to 1, and it is used to construct the normalized back‐projection. and are the DVF for transforming between the reference phase to the target phase .
| (4) |
where is the reconstructed image of phase ; represents the DVF for transferring phase to the reference phase.
These algorithms require motion transforms between phases ( and , hereafter referred to as DVF), whose accuracy significantly impacts the effectiveness of MC. 40 The DVF can be obtained by: (i) reconstructing the SPECT projections for each phase independently (Gating SPECT), and (ii) registering the reconstructed Gating SPECT via the target DVF. Additionally, we tested these algorithms by applying the ground truth DVF produced by XCAT using its “vectors” mode (by setting “mode = 4” in the parameter file). This mode outputs the motion vectors for each voxel of the reference phase image to each target phase, providing the ground truth motion vectors. The time point of each phase was set at the middle of each breathing gate to simulate the ideal DVF that can be obtained from 4D CT. To ensure accurate deformable registration for the Gating SPECT, this study utilized the pTV‐reg algorithm as proposed by Vishnevskiy et al., 41 along with their publicly available Matlab package. Gate 3 (TI+IE in Figure 2), with its central position and longer coverage time, serves as the reference gate for both 4D‐Recon and post‐Recon.
The attenuation correction and the TEW scatter correction were applied in all of the reconstructions. All reconstructions in this study were performed using PyTomography, an open‐source Python library initiated by our lab for fast, GPU‐accelerated medical image reconstruction. 25 As the computation time of 4D‐Recon and post‐Recon scales with the number of gates, GPU acceleration is essential for the practical application of this function. To encourage community use, we have released nine tutorials demonstrating how to use the toolbox and integrate the models in PyTomography. The PyTomography project can be accessed at https://github.com/PyTomography. The source codes for this MC study are also made publicly available.
Quantitative evaluation of image quality utilized the recovery coefficient (RC) and contrast recovery coefficient (CRC, Equation 5). To assess the uncertainty of RC and CRC, each algorithm was evaluated using 10 noise realizations with varying random seeds, and the uncertainty range was represented as the mean ± standard deviation. The tumor ROI was determined by the ground truth in the XCAT file. To calculate the tumor's CRC, an ROI in the liver was used, consisting of two 30‐mm diameter spheres positioned within a uniform area.
| (5) |
where and represent the mean values within the tumor and liver regions, respectively, while is the true contrast value between them.
3. RESULTS
3.1. Simulated respiratory signals
The Pearson correlation results for comparing different respiratory signals are presented in Table 2. We applied 20 different noise realizations for the OSI method. In the COL method based on Dual‐head SPECT simulation, two 0.5‐s projections at supplementary angles were combined after flipping one side to reduce the impact of noise. Therefore, we tested 60 different noise realizations for the 120 projections.
TABLE 2.
Pearson correlation between respiratory curves from various methods and ground truth.
| Algorithm | |||
|---|---|---|---|
| Scenario | COL | COL‐filter | OSI |
| Baseline | 0.62 ± 0.20 | 0.71 ± 0.19 | 0.99 ± 0.004 |
| 40% Threshold | 0.64 ± 0.18 | 0.57 ± 0.28 | — |
| Kidney's masking | 0.87 ± 0.07 | 0.90 ± 0.06 | — |
Note: Values are mean ± standard deviation.
Abbreviations: COL, center‐of‐light method; OSI, optical surface imaging.
The correlation between the ground‐truth signal and the OSI‐based signal remains highly consistent and stable (0.99 ± 0.004, p‐0.001 when comparing OSI and COL‐filter with Kidney's masking). The correlations for COL and COL‐filter exhibit unsatisfactory outcomes, with means and standard deviations of 0.62 ± 0.20 and 0.71 ± 0.19, respectively). A noticeable enhancement was observed when using the kidney region mask, achieving 0.87 ± 0.07 and 0.90 ± 0.06 using the filter, both with p‐0.001). However, these still performed less effectively than the OSI method, as further illustrated by the comparison signals depicted in Figure 6, the grey shaded area indicates the uncertainty of respiratory curves when using the COL method with different post‐processing techniques, while the red shaded area represents the OSI method for comparison.
FIGURE 6.

Comparing the respiratory curves from various methods. Different colors represent the error range of each method. The six subplots represent different post‐processing methods of COL. COL, center‐of‐light.
3.2. Validation of OSI method on patient data
Based on the patient data, Figure 7a shows the respiratory signals extracted from CBCT 2D projections and the OSI method. A Gaussian filter with a standard deviation of 2 was applied to the OSI signal to reduce high‐frequency noise. Given that the sampling interval of the OSI (0.033 s) is substantially shorter than that of CBCT (0.182 s), the filtered OSI signal was resampled to align with the lower sampling frequency of the CBCT projection data. This adjustment facilitates a more intuitive comparison of their respective spectral distributions.
FIGURE 7.

(a) Respiratory signals extracted based on the patient data. (b) Spectrum analysis of the respiratory signals.
We used the discrete Fourier transform (DFT) to compare the two respiratory motion signals in the frequency domain, as shown in Figure 7b. The frequency of the respiratory motion was determined by the maximum amplitude on the spectrogram, indicating the largest component proportion. Both the OSI signal and the projection signal showed the same breathing frequency (0.2839 Hz), corresponding to a breathing cycle of approximately 3.5 s.
3.3. Motion‐incorporated reconstruction
For clinical observation, Figure 8a shows the central cross‐section of the tumor area, reconstructed using different algorithms described in Section 2.5. The voxel values are converted from counts to mega Becquerels per milliliter using the scanning time of each projection and a calibration factor derived from a point source simulation in SIMIND. For quantitative analysis, the RC and CRC results for the tumor from noisy (15‐s scan per projection) and noiseless (150‐s scan per projection) are presented in Figure 8b. This figure also displays the uncertainty range (mean ± standard deviation) resulting from 10 noise realizations. They also demonstrate the impacts of applying DVF from Gating images or the ground‐truth data. With GPU acceleration achieved using an NVIDIA Tesla V100, each iteration with six subsets requires approximately 7.5 s of computation. For instance, completing 10 iterations with six subsets would take only about 75 s, meeting the clinical requirements for efficiency.
FIGURE 8.

(a) Cross‐sections of reconstructed images: OSEM with 10 iterations and 6 subsets. (b)RC and CRC using six subsets for OSEM across various iterations. To avoid overlap, the Static and Gating results are shown as the area between two dashed lines. “Noisy” refers to a 15‐s scan per projection in simulations, while “Noiseless” refers to a 150‐s scan per projection. CRC, Contrast recovery coefficient; OSEM, ordered subset expectation maximization; RC, recovery coefficient.
4. DISCUSSION
The respiratory signals extracted by the OSI system show a high correlation with the ground truth, as verified by both simulation results and patient data, as illustrated in Sections 3.1 and 3.2. Furthermore, because this experiment directly observes the patient's diaphragm movement through CBCT,
42
the reference curves are closer to the patient's actual breathing movements.
36
The results shown in Table 2 indicate that OSI offers advantages in terms of accuracy and stability compared to the existing data‐driven methods (p‐value 0.001). It is even significantly better than the COL‐filter with Kidney's masking, which requires an accurate mask to cover the entire breathing cycle of the kidney. Achieving this would lead to additional workload and may need the 4D‐CT. As shown in Figure 6, using kidney masking for COL can significantly reduce uncertainty and make the curves closer to the ground truth. However, it still remains less accurate than OSI due to noisy projection images from the gating SPECT. Thus, if this method is used for guiding motion‐incorporated reconstruction, the EE phase, for example, could be misclassified as the IE phase or other phases, leading to different phases being mixed together. Consequently, the reconstructed images would be similar to non‐MC images. In this study, we simulated
‐DOTATATE, as it is widely used in radiopharmaceutical therapy for neuroendocrine tumors. Further studies could explore various types of radiolabeled agents targeting specific therapeutic applications. For simulations, only the activity settings for each organ, as outlined in Table 1, need to be adjusted.
The higher sampling rate of the depth camera enables it to capture high‐frequency movements, such as sudden patient movements. Additionally, its relatively low cost (approximately $400) increases its potential for wider adoption. The device is also compact and portable, requiring only a standard mounting bracket to ensure compatibility with various systems. Furthermore, the OSI method is a touchless approach that does not require markers to be securely attached to the patient's body, 38 , 43 thereby improving clinical efficiency. Markers may introduce artifacts or affect the accuracy of attenuation correction during reconstruction if they fall within the SPECT signal acquisition area. Additional types of devices have been evaluated for respiratory gating in emission tomography, such as spirometers 44 and the pressure belt‐based systems. 45 However, these devices must be worn by patients and may increase errors and workload. To implement the COL method and motion‐incorporated reconstruction, normal scanning projections must be divided into shorter time‐period sub‐projections. This step is crucial because, without it, organs cannot be assumed to remain nearly stationary within the respiratory gating images. Using the list‐mode data format is essential for capturing this detailed motion data. However, due to significant data storage requirements, clinical SPECT systems typically bin the data. Given the limitations of our current results, we were unable to experimentally verify 4D reconstruction for SPECT images, which will be a focus of our future research.
For the reconstructed images shown in Figure 8a from the 4D digital phantom simulation, as expected, non‐MC reconstruction results in a more blurred image. This situation could happen when performing MC based on nonideal respiratory signals (such as the aforementioned COL method). For quantitative analysis, as shown in Figure 8b, increasing the number of iterations results in a decrease in bias but an increase in variability. This is reflected in the fact that the shaded areas become closer to the true value but expands as more iterations are used. For both noisy and noiseless images, the reconstructed images with MC, in contrast to the non‐MC results, notably enhance RC and CRC. Moreover, a longer scanning time or higher injection dose (noiseless mode) will increase this difference. In addition, as shown in Figure 8b, the 4D‐Recon shows greater accuracy (RC and CRC for “XCAT DVF”) or lower uncertainty (RC for “Gating DVF”) compared to the post‐Recon method. Similar conclusions have been confirmed in several PET studies. 10 , 46 The Gating method achieves bias levels comparable to MC methods; however, due to utilizing only partially acquired data, it incurs higher uncertainty.
Comparing the results of the “Gating DVF” and “XCAT DVF”, the changes in RC and CRC are not significant. This similarity may be due to the localized high‐dose characteristics of the tumor region, which enhance image registration accuracy for the Gating images, making the DVFs from two methods consistent. However, the more accurate DVF appears to provide a greater benefit for the 4D‐Recon compared to the post‐Recon. This is evident in the “XCAT DVF” plots in Figure 8b, where 4D‐Recon outperforms post‐Recon, a trend not seen in the “Gating DVF” plots. Therefore, if this distinction is not critical, we recommend using the Gating DVF for 4D‐Recon, as it is more accessible. However, the post‐Recon and 4D‐Recon methods represent one class of motion compensation approaches, where respiratory motion is first estimated from individually reconstructed respiratory bins and then corrected in the reconstructed images. Other approaches, such as applying MC directly to the 2D projection data before image reconstruction, 47 require further comparative analysis to determine their optimal application scenarios.
Regarding the limitations of this study, the current SPECT scan results are based on Monte Carlo simulations. Therefore, the actual improvement of the OSI method requires further validation through real scans. This involves conducting experiments with an anthropomorphic phantom injected with
, specifically designed to simulate patient breathing. The phantom's thoracic cavity surface should be made from an elastic material that mimics the rise and fall of breathing movements, enabling the depth camera to acquire the signals. Consequently, the method proposed in this paper remains in the proof‐of‐concept stage. Additionally, the study assessed the effectiveness of the OSI method exclusively on data from normal‐sized adult patients. Further validation is necessary to establish its applicability to other patient groups, such as pediatric, obese, and elderly populations.
5. CONCLUSIONS
We evaluated a respiratory phase sorting approach using the optical surface signals and compared it with a data‐driven method, utilizing both simulated and patient data. Overall, our approach combines several advantages, including high accuracy and stability, having non‐contact operation, no need to observe the ROI mask in projections, and better compatibility with various imaging systems. Furthermore, we achieved 4D reconstruction of SPECT by integrating DVF into the OSEM algorithm and confirmed that it produces significantly higher‐quality images compared to those without MC, thereby improving the accuracy of radiopharmaceutical therapy. Additionally, with our GPU‐accelerated PyTomography library, the 4D reconstruction, which we release publicly can be completed in significantly shorter time, enabling extensive studies and applications.
CONFLICT OF INTEREST STATEMENT
The authors have no relevant conflicts of interest to disclose.
ACKNOWLEDGMENTS
This work was supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grants RGPIN‐2019‐06467 and RGPIN‐2021‐02965; Canadian Institutes of Health Research (CIHR) Project Grant PJT‐180251; National Natural Science Foundation of China (12275012, 12475309, 12411530076, 82202941); Beijing Natural Science Foundation (Z210008); China International Talent Exchange Foundation (JC202502001F); Fundamental Research Funds for the Central Universities/Clinical Medicine Plus X ‐ Young Scholars Project of Peking University (PKU2024LCXQ033); Ten Thousand People Project of Ministry of Education (7111400062).
Li C, Polson LA, Wu X, Zhang Y, Uribe C, Rahmim A. Optical surface information‐based respiratory phase‐sorting and motion‐incorporated reconstruction for SPECT imaging. Med Phys. 2025;52:4330–4340. 10.1002/mp.17769
REFERENCES
- 1. Kovalski G, Israel O, Keidar Z, Frenkel A, Sachs J, Azhari H. Correction of heart motion due to respiration in clinical myocardial perfusion SPECT scans using respiratory gating. J Nucl Med. 2007;48:630‐636. [DOI] [PubMed] [Google Scholar]
- 2. Ivanovic M, Pellot‐Barakat C, Weber D, Loncaric S, Shelton D. Effects of patient motion in coincidence studies on hybrid PET/SPECT system. In: 2000 IEEE Nuclear Science Symposium. Conference Record (Cat. No. 00CH37149) . Vol 3. IEEE; 2000:16‐49. [Google Scholar]
- 3. Pretorius PH, King MA. Spillover compensation in the presence of respiratory motion embedded in SPECT perfusion data. IEEE Trans Nucl Sci. 2008;55:537‐542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Rahmim A, Bloomfield P, Houle S, et al. Motion compensation in histogram‐mode and list‐mode EM reconstructions: beyond the event‐driven approach. IEEE Trans Nucl Sci. 2004;51:2588‐2596. [Google Scholar]
- 5. Dickson JC, Armstrong IS, Gabiña PM, et al. EANM practice guideline for quantitative SPECT‐CT. Eur J Nucl Med Mol Imaging. 2023;50:980‐995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Ljungberg M, Celler A, Konijnenberg MW, Eckerman KF, Dewaraja YK, Sjögreen‐Gleisner K. MIRD pamphlet no. 26: joint EANM/MIRD guidelines for quantitative 177Lu SPECT applied for dosimetry of radiopharmaceutical therapy. J Nucl Med. 2016;57:151‐162. [DOI] [PubMed] [Google Scholar]
- 7. James SS, Bednarz B, Benedict S, et al. Current status of radiopharmaceutical therapy. Int J Radiat Oncol Biol Phys. 2021;109:891‐901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Bastiaannet R, Viergever MA, De Jong HW. Impact of respiratory motion and acquisition settings on SPECT liver dosimetry for radioembolization. Med Phys. 2017;44:5270‐5279. [DOI] [PubMed] [Google Scholar]
- 9. Nehmeh S, Erdi YE, Pan T, Pevsner A, et al. Four‐dimensional (4D) PET/CT imaging of the thorax: 4D PET/CT. Med Phys. 2004;31:3179‐3186. [DOI] [PubMed] [Google Scholar]
- 10. Rahmim A, Tang J, Zaidi H. Four‐dimensional image reconstruction strategies in cardiac‐gated and respiratory‐gated PET imaging. PET Clin. 2013;8:51‐67. [DOI] [PubMed] [Google Scholar]
- 11. Pretorius PH, King MA. Data‐driven respiratory signal estimation from temporally finely sampled projection data in conventional cardiac perfusion SPECT imaging. Med Phys. 2022;49:282‐294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Chen Y, Pretorius PH, Lindsay C, Yang Y, King MA. Respiratory signal estimation for cardiac perfusion SPECT using deep learning. Med Phys. 2024;51:1217‐1231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Bryce‐Atkinson A, Marchant T, Rodgers J, et al. Quantitative evaluation of 4D cone beam CT scans with reduced scan time in lung cancer patients. Radiother Oncol. 2019;136:64‐70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Kesner AL, Kuntner C. A new fast and fully automated software based algorithm for extracting respiratory signal from raw PET data and its comparison to other methods. Med Phys. 2010;37:5550‐5559. [DOI] [PubMed] [Google Scholar]
- 15. Büther F, Dawood M, Stegger L, et al. List mode–driven cardiac and respiratory gating in PET. J Nucl Med. 2009;50:674‐681. [DOI] [PubMed] [Google Scholar]
- 16. Tumpa TR, Acuff SN, Gregor J, Lee S, Hu D, Osborne DR. A data‐driven respiratory motion estimation approach for PET based on time‐of‐flight weighted positron emission particle tracking. Med Phys. 2021;48:1131‐1143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Li G. Advances and potential of optical surface imaging in radiotherapy. Phys Med Biol. 2022;67:16TR02. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Fayad H, Pan T, François Clement J, Visvikis D. Correlation of respiratory motion between external patient surface and internal anatomical landmarks. Med Phys. 2011;38:3157‐3164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Fayad H, Pan T, Pradier O, Visvikis D. Patient specific respiratory motion modeling using a 3D patient's external surface. Med Phys. 2012;39:3386‐3395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Feng B, Gifford HC, Beach RD, Boening G, Gennert MA, King MA. Use of three‐dimensional Gaussian interpolation in the projector/backprojector pair of iterative reconstruction for compensation of known rigid‐body motion in SPECT. IEEE Trans Med Imaging. 2006;25:838‐844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Segars WP, Sturgeon G, Mendonca S, Grimes J, Tsui BM. 4D XCAT phantom for multimodality imaging research. Med Phys. 2010;37:4902‐4915. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Huang Y, Dong Z, Wu H, Li C, Liu H, Zhang Y. Deep learning‐based synthetization of real‐time in‐treatment 4D images using surface motion and pretreatment images: a proof‐of‐concept study. Med Phys. 2022;49:7016‐7024. [DOI] [PubMed] [Google Scholar]
- 23. Kayal G, Chauvin M, Vergara‐Gil A, et al. Generation of clinical SPECT/CT images based on Monte Carlo simulation with GATE. Phys Med. 2021;85:24‐31. [DOI] [PubMed] [Google Scholar]
- 24. Gustafsson J, Brolin G, Cox M, Ljungberg M, Johansson L, Gleisner KS. Uncertainty propagation for SPECT/CT‐based renal dosimetry in 177Lu peptide receptor radionuclide therapy. Phys Med Biol. 2015;60:8329. [DOI] [PubMed] [Google Scholar]
- 25. Polson LA, Fedrigo R, Li C, et al. PyTomography: a python library for medical image reconstruction. SoftwareX. 2025;29:102020. [Google Scholar]
- 26. Nguyen D, Farah J, Barbet N, Khodri M. Commissioning and performance testing of the first prototype of AlignRT a and ‐dedicated surface guided radiation therapy platform. Physica Med. 2020;80:159‐166. [DOI] [PubMed] [Google Scholar]
- 27. Jian B, Vemuri BC. Robust point set registration using gaussian mixture models. IEEE Trans Pattern Anal Mach Intell. 2010;33:1633‐1645. [DOI] [PubMed] [Google Scholar]
- 28. Ren S, Jin X, Chan C, et al. Data‐driven event‐by‐event respiratory motion correction using TOF PET list‐mode centroid of distribution. Phys Med Biol. 2017;62:4741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Reymann M, Vija A, Maier A. Method for comparison of data driven gating algorithms in emission tomography. Phys Med Biol. 2023;68:185024. [DOI] [PubMed] [Google Scholar]
- 30. Zhang D, Pretorius PH, Ghaly M, Zhang Q, King MA, Mok GS. Evaluation of different respiratory gating schemes for cardiac SPECT. J Nucl Cardiol. 2020;27:634‐647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Li C, Zhou L, Deng J, et al. A generalizable new figure of merit for dose optimization in dual energy cone beam CT scanning protocols. Phys Med Biol. 2023;68:185021. [DOI] [PubMed] [Google Scholar]
- 32. Chen M, Siochi RA. Diaphragm motion quantification in megavoltage cone‐beam CT projection images. Med Phys. 2010;37:2312‐2320. [DOI] [PubMed] [Google Scholar]
- 33. Kurillo G, Hemingway E, Cheng M‐L, Cheng L. Evaluating the accuracy of the azure kinect and kinect v2. Sensors. 2022;22:2469. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Yan H, Wang X, Yin W, et al. Extracting respiratory signals from thoracic cone beam CT projections. Phys Med Biol. 2013;58:1447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Zijp L, Sonke J‐J, van Herk M. Extraction of the respiratory signal from sequential thorax cone‐beam X‐ray images. In: International Conference on the Use of Computers in Radiation Therapy , 2004:507‐509.
- 36. Rit S, van Herk M, Zijp L, Sonke J‐J. Quantification of the variability of diaphragm motion and implications for treatment margin construction. Int J Radiat Oncol Biol Phys. 2012;82:e399‐e407. [DOI] [PubMed] [Google Scholar]
- 37. Dong Z, Yu S, Szmul A, et al. Simulation of a new respiratory phase sorting method for 4D‐imaging using optical surface information towards precision radiotherapy. Comput Biol Med. 2023;162:107073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Pretorius PH, Johnson KL, Dahlberg ST, King MA. Investigation of the physical effects of respiratory motion compensation in a large population of patients undergoing Tc‐99m cardiac perfusion SPECT/CT stress imaging. J Nucl Cardiol. 2020;27:80‐95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Qiao F, Pan T, Clark JW, Mawlawi OR. A motion‐incorporated reconstruction method for gated PET studies. Phys Med Biol. 2006;51:3769. [DOI] [PubMed] [Google Scholar]
- 40. Sun T, Wu Y, Wei W, et al. Motion correction and its impact on quantification in dynamic total‐body 18F‐fluorodeoxyglucose PET. EJNMMI physics. 2022;9:62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Vishnevskiy V, Gass T, Székely G, Goksel O. Total variation regularization of displacements in parametric image registration. In: International MICCAI Workshop on Computational and Clinical Challenges in Abdominal Imaging , Springer; 2014:211‐220. [Google Scholar]
- 42. Cerviño LI, Chao AK, Sandhu A, Jiang SB. The diaphragm as an anatomic surrogate for lung tumor motion. Phys Med Biol. 2009;54:3529. [DOI] [PubMed] [Google Scholar]
- 43. McNamara JE, Pretorius PH, Johnson K, et al. A flexible multicamera visual‐tracking system for detecting and correcting motion‐induced artifacts in cardiac SPECT slices. Med Phys. 2009;36:1913‐1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Didierlaurent D, Ribes S, Caselles O, et al. A new respiratory gating device to improve 4D PET/CT. Med Phys. 2013;40:032501. [DOI] [PubMed] [Google Scholar]
- 45. Chang G, Chang T, Clark Jr JW, Mawlawi OR. Design and performance of a respiratory amplitude gating device for PET/CT imaging. Med Phys. 2010;37:1408‐1412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Feng T, Wang J, Fung G, Tsui B. Non‐rigid dual respiratory and cardiac motion correction methods after, during, and before image reconstruction for 4D cardiac PET. Phys Med Biol. 2015;61:151. [DOI] [PubMed] [Google Scholar]
- 47. Robert A, Rit S, Jomier J, Sarrut D. Data‐driven motion compensated SPECT reconstruction for liver radioembolization. In: Fully 3D Image Reconstruction in Radiology and Nuclear Medicine , 2021:81‐84.
