Validation of CT dose-reduction simulation

Parinaz Massoumzadeh; Steven Don; Charles F Hildebolt; Kyongtae T Bae; Bruce R Whiting

doi:10.1118/1.3031114

. 2008 Dec 15;36(1):174–189. doi: 10.1118/1.3031114

Validation of CT dose-reduction simulation

Parinaz Massoumzadeh ^1,^a), Steven Don ¹, Charles F Hildebolt ¹, Kyongtae T Bae ^1,^b), Bruce R Whiting ^1,^c)

PMCID: PMC2673664 PMID: 19235386

Abstract

The objective of this research was to develop and validate a custom computed tomography dose-reduction simulation technique for producing images that have an appearance consistent with the same scan performed at a lower mAs (with fixed kVp, rotation time, and collimation). Synthetic noise is added to projection (sinogram) data, incorporating a stochastic noise model that includes energy-integrating detectors, tube-current modulation, bowtie beam filtering, and electronic system noise. Experimental methods were developed to determine the parameters required for each component of the noise model. As a validation, the outputs of the simulations were compared to measurements with cadavers in the image domain and with phantoms in both the sinogram and image domain, using an unbiased root-mean-square relative error metric to quantify agreement in noise processes. Four-alternative forced-choice (4AFC) observer studies were conducted to confirm the realistic appearance of simulated noise, and the effects of various system model components on visual noise were studied. The “just noticeable difference (JND)” in noise levels was analyzed to determine the sensitivity of observers to changes in noise level. Individual detector measurements were shown to be normally distributed (p>0.54), justifying the use of a Gaussian random noise generator for simulations. Phantom tests showed the ability to match original and simulated noise variance in the sinogram domain to within 5.6%±1.6% (standard deviation), which was then propagated into the image domain with errors less than 4.1%±1.6%. Cadaver measurements indicated that image noise was matched to within 2.6%±2.0%. More importantly, the 4AFC observer studies indicated that the simulated images were realistic, i.e., no detectable difference between simulated and original images (p=0.86) was observed. JND studies indicated that observers’ sensitivity to change in noise levels corresponded to a 25% difference in dose, which is far larger than the noise accuracy achieved by simulation. In summary, the dose-reduction simulation tool demonstrated excellent accuracy in providing realistic images. The methodology promises to be a useful tool for researchers and radiologists to explore dose reduction protocols in an effort to produce diagnostic images with radiation dose “as low as reasonably achievable.”

Keywords: computed tomography, CT, noise modeling, dose reduction, bowtie filter, tube current modulation, observer study, four-alternative forced-choice (4AFC), just noticeable difference (JND)

INTRODUCTION

The use of computed tomography (CT) in modern healthcare continues to grow rapidly due to its excellent low-contrast tissue resolution, three-dimensional information, and rapid acquisition times.¹ Despite its high diagnostic value, there are concerns about the risks associated with exposure of patients to ionizing radiation,² and the principle of “as low as reasonably achievable” (ALARA) has been proposed as the goal for radiation dose in clinical practice.³ To accomplish this, efforts are underway to control radiation exposure through improved techniques and avoidance of unnecessary examinations. A challenge in this effort is the ability to acquire clinical images at reduced dose for the purpose of studying the effects of increased noise on diagnostic performance. Concerns about radiation risks limit the ability to perform repeated scans on the same patient with different techniques or suboptimal protocols, in order to determine the effects of dose reduction. Patient motion and variation in contrast-delivery time would also make it difficult to acquire identical images differing only in dose; thus it has been problematic to gather data for studies of the impact of noise on diagnostic accuracy.

Given this scenario, an attractive alternative would be a method to simulate reduced-dose images by adding synthetic noise retrospectively to existing images. This approach has been used for projection radiography, where realistic noise can be created by direct addition of properly filtered random-noise fields in the image domain.⁴^,⁵^,⁶ CT images, created by filtered backprojection, however, have nonlocal noise properties, which are very difficult to simulate directly in the image domain.⁷ On the other hand, it is straightforward to inject synthetic noise, corresponding to a decrease in tube current at the same beam quality (kVp) and collimation, into the projection (sinogram) domain to create realistic simulated reconstructions. In fact, it appears that all the major manufacturers have developed noise simulation software for internal testing,⁸^,⁹^,¹⁰^,¹¹ and promising results have been reported.¹² Unfortunately, general application of this method in the academic research community has been hampered due to the lack of availability of raw scan data and file format information, which manufacturers considered proprietary. Newer scan systems now allow archiving of raw data files and vendors are becoming more willing to collaborate with researchers, leading to an increase in research based on sinogram data, as evidenced by recent reports.¹⁰^,¹³^,¹⁴^,¹⁵^,¹⁶^,¹⁷

A key concern with any such simulation tool is the level of realism in the images produced. Publications to date have validated methods with phantom experiments, reporting matched variances that are not statistically significantly different.¹² Details of these techniques and their implementation are limited, however, and there are often caveats about what range of dose reduction or what protocols can be accurately simulated.¹⁰ Furthermore, there has been limited discussion about the sensitivity to changes in noise level in medical images, and what level of accuracy is required for matching noise levels. In this article, a simulation method is described in detail, including: incorporating an accurate CT noise model;¹⁸ experimental methods to determine parameters necessary for the model; and validation experiments with both objective and subjective measures of image quality, to assess accuracy and to establish just noticeable differences¹⁹ (JNDs) in noise simulation. This general approach can be adapted for other scanner systems. Finally, potential applications of this tool are discussed.

METHOD AND MATERIALS

The research goal was to characterize and validate a method for use in tube-current reduction simulation at a constant kVp and detector geometry. A procedure for simulating reduced-dose images was developed, using a model to generate synthetic CT noise, along with methods to experimentally determine the parameters used in the model. Validation techniques were developed to quantitatively measure the agreement of noise properties in both the sinogram and image domain. The realistic appearance of synthetic noise in images was confirmed by performing observer studies with radiologists. Additionally, tests were conducted to ascertain the effect of various model components on visual noise and also to establish the “just noticeable difference”¹⁹ in noise levels for observers.

Simulations and reconstruction procedure

The procedure developed for simulating clinical images corresponding to a given tube-current reduction consisted of the following steps:²⁰

(1)
Data files containing measured sinogram data were exported from a CT scanner using optical disks or network transfer. (Such data access is now available on many multirow detector CT scanners.)
(2)
Scan protocol information was extracted from the sinogram data header, i.e., tube voltage, tube-current modulation, rotation time, measurement period, and collimation. (Format structure of the data files is unique for each scanner design and must be obtained from the manufacturer.)
(3)
Attenuation measurements, A_m=log(S₀∕S_m) (the logarithm of the ratio of unattenuated signal S₀ to measured signal S_m), and the tube current for each gantry measurement were extracted from the data file.
(4)
The sinogram data were converted from a logarithmic attenuation space to a transmission space measurement, T_m=exp(−A_m).
(5)
The transmission sinogram data (T_m) were multiplied by the bowtie filter transmission profile p(d) for each detector position d, and by the incident flux Q₀(g) at each gantry step g to produce measured linear sinogram data, I_m(d,g)=Q₀(g)p(d)T_m(d,g). These factors can be obtained by techniques described in the subsequent Secs. 2C2, 2D2, 2D3.
(6)
The amount of additive variance ν_a required to transform each high exposure measurement to the desired noise level was calculated from the model [Eq. 6] and parameters provided in the Sec. 2D4.
(7)
The simulated signal was obtained by adding the product of a Gaussian random number (MATLAB randn, with zero mean and unity variance) and the square root of added variance (obtained in step 6) to the existing high exposure scan, $I_{sim} = I_{m} + \sqrt{(ν_{a})} randn$ .
(8)
The logarithm of the ratio of unattenuated signal to the simulated signal was calculated to obtain a simulated attenuation sinogram, A_sim(d,g)=log(Q₀(g)p(d)∕I_sim), and this quantity was inserted back into a data file similar to the source file.
(9)
Using vendor-supplied reconstruction software, the new simulated sinogram data were used to reconstruct image slices.

Noise model

An accurate acquisition-noise model is essential for realistic CT current-reduction simulation. The sinogram noise model for a CT system must incorporate factors such as the exiting x-ray spectrum of an object, the instantaneous incident flux level, the position of the detector in the fan beam, and the amount of electronic noise of the scanner. Scanner components included in this model were bowtie filters, tube-current modulation, polyenergetic x-ray spectra, and energy-integrating detectors. Measurement noise was considered to be uncorrelated between detectors, as has been reported by the researchers at State University of New York (SUNY) at Stony Brook.¹³^,²¹ Measured energy-integrating signals can be shown to obey compound Poisson statistics,¹⁸ for which the mean signal $\bar{S}$ and its variance for a given spectrum are

\bar{S} = κ λ,

(1)

σ_{S}^{2} = κ^{2} λ + N_{0},

(2)

where λ is the mean number of total quanta incident on the detector, and κ is a scaling factor dependent on the x-ray spectrum and detector sensitivity. The scaling factor κ was considered to be a constant in this model, although it has a small dependence on spectra for clinical scan conditions.¹⁸ The electronic background noise N₀ of the scanner may be a function of amplifier gain and offset, etc. The mean number of total quanta depends linearly on the mAs and detector size, which allows a straightforward simulation for changes in current at a constant spectrum, i.e., kVp. (Changes in spectra result in more complicated dependencies that were not considered in this research project.)

The measured signal-to-noise ratio was characterized by the noise equivalent quanta (NEQ), defined as the square of the mean signal divided by the signal variance²²

NEQ = \frac{{\bar{S}}^{2}}{σ_{S}^{2}} = \frac{{(κ λ)}^{2}}{κ^{2} λ + N_{0}} .

(3)

For NEQ>20, it is appropriate to use a Gaussian random number generator for synthesis of simulated noise because the central limit theorem ensures normal (Gaussian) distributions for large numbers of events.²³ If Gaussian noise, with mean zero and variance ν_a, is added to each measurement in x-ray flux space, the resulting NEQ may be written as

{NEQ}_{sim, low} = \frac{{(κ λ)}^{2}}{κ^{2} λ + ν_{a} + N_{0, Hi}} .

(4)

This is valid because the synthetic and measured noise mechanisms are independent, normal statistical processes; hence, the total variance is the sum of these variances.

To simulate the CT signal for a lower tube current, let ρ be the ratio of the desired lower tube current mA_sim to the original tube current, ρ≡(mA_sim)∕mA_original⩽1. The mean number of quanta (λ) in Eq. 3 can be replaced by ρλ to give a target NEQ, and an appropriate system noise can be inserted into the high dose scan [Eq. 4] to match the target. Consequently, by equating the NEQ from Eq. 3 to the NEQ_sim,low from Eq. 4,

\frac{{(κ λ)}^{2}}{κ^{2} λ + ν_{a} + N_{0, Hi}} = \frac{{(κ ρ λ)}^{2}}{κ^{2} ρ λ + N_{0, Low}},

(5)

the amount of required additive variance (v_a) can be obtained

ν_{a} = κ^{2} λ (\frac{1}{ρ} - 1) + (\frac{N_{0, Low}}{ρ^{2}} - N_{0, Hi}) .

(6)

The mean of the total quanta (λ), incident on the detector at gantry location g and detector d, is proportional to the quanta flux incident on the object times the transmittance of the object, T(d,g)=exp[−A(d,g)]. The flux incident on the object equals the output of the x-ray tube

Q_{0} (g) = K c m A (g) s,

(7)

(where K is a constant, c is the collimation, mA is the tube current at gantry step g, and s is the measurement time) multiplied by the bowtie filter transmittance [p(d), which is nonuniform across the fan beam at detector position d]. Therefore, λ in Eq. 6 can be written as

λ (d, g) = Q_{0} (g) p (d) T (d, g),

(8)

resulting in

v_{a} (d, g, ρ) = κ^{2} Q_{0} (g) p (d) T (d, g) (\frac{1}{ρ} - 1) + (\frac{N_{0, Low}}{ρ^{2}} - N_{0, Hi}) .

(9)

Note: ρ=1 indicates a full-dose scan with no additional variance, i.e., ν_a(d,g,ρ)=0; ρ⪡1 indicates an extremely low-dose scan with large ν_a(d,g,ρ). The components necessary for the developed model [the function p(d) and constants κ, K, and N₀] were determined empirically by a series of measurements for each scanner, as described below.

Experimental techniques

Methods were developed to determine the parameters required for the noise model in the simulation software. As a validation, the outputs of the simulations were compared to actual measurements in both the sinogram and image domains. Observer studies were performed to confirm the realistic appearance of simulated images. Additionally, the effect of various components of the noise model on simulations was studied. Institutional Review Board approval was obtained for use of data files taken from clinical patient scans.

Scanners and software

Scans were performed on three 16-row scanners (Somatom Sensation 16, Siemens Medical Solutions, Forchheim, Germany) located in the BJC Hospital campus in St. Louis, MO. CT sinogram data files were downloaded from clinical scanners using optical disks or network transfer. The files contained various types of information, including attenuation measurements (A_m—see Appendix A1), the instantaneous tube current for each gantry measurement, and protocol settings (e.g., electronic integration time and collimation width).

MATLAB software (MathWorks, Natick, MA) was used for data analysis, algorithm development, numeric computation, and data visualization. Image slices were reconstructed from sinograms with offline software provided by Siemens Medical Solutions using a research reconstruction code that closely matches the clinical scanner consol reconstruction. Image domain analysis and display were performed with ImageJ,²⁴ (NIH, Washington, DC). For the forced-choice observer tasks, individual reconstructed images were loaded from DICOM files into MATLAB and a combined montage of multiple images [two for two-alternative forced choice (2AFC), four for four-alternative forced choice (4AFC)] was created, which was then loaded into a single DICOM file for display. For all images, the display brightness and contrast were adjusted to be appropriate for the clinical viewing task, e.g., a cadaver head (used to determine observer sensitivity to simulated noise) had a brightness level of 50 Hounsfield units (HU) and a window width of 150 HU. Using ImageJ, observers would sequence through the image sets, with their choices recorded by an attendant.

Scan parameter experiments

To determine the bowtie filter profile and the x-ray flux scaling factor, an empty gantry (air only) was scanned at four current levels [e.g., 50, 150, 300, and 500 “effective mAs” with 120 kVp and 16×0.75 mm collimation. (Effective mAs is the parameter setting available on Siemens CT consoles, used by the technologist to specify image quality for a particular patient protocol. It is defined as the product of tube current and rotation time divided by the beam pitch; thus, it provides a dose surrogate. As such, it does not directly relate to the physical tube current and rotation time. These parameters always were determined independently from the header of the CT sinogram data file for the experimental conditions reported here.)] Two protocols were used, body (pitch of 1.0 and 0.5 s rotation time) and head (pitch of 0.5 and rotation time of 1.0 s), each of which invoked a different bowtie filter.

To determine the magnitude of electronic system noise [N₀ in Eq. 2], a fabricated, poly(methylmethacrylate) (PMMA) cylinder phantom (35.4 cm diameter) was scanned, using a series of tube currents ranging from the lowest to the highest available settings (50, 150, 300, and 500 effective mAs), 120 kVp, 16×0.75 mm collimation, pitch of 1, and 0.5 s rotation time. The combination of the varying attenuation levels in the phantom (maximum log attenuation of 7.7) and the different current levels provided a controlled range of flux levels spanning the whole clinical range of signal intensity (10 natural logarithm units), including reasonable changes of x-ray spectrum due to beam hardening. The phantom was centered on and aligned along the gantry axis by a fixture, such that only the phantom was present within the scanner’s field of view, thus isolating the attenuation properties of the phantom from any extraneous structures, i.e., there was no patient table present in the scan.

To validate the simulation model for use with tube-current modulation,²⁵ an azimuthally asymmetric object, consisting of a human skull, was scanned with two mAs settings (50 and 250 effective mAs), and with tube-current modulation both on and off, using 120 kVp, 16×0.75 mm collimation, pitch equal 1.0, and 0.5 s rotation time. To determine the interscanner variability of the scaling factor K, 17 clinical patient scans, with tube-current modulation on (11 scans) and off (6 scans), were collected from three scanners in the BJC Hospital complex in St. Louis, MO. Noise calculations for the direct air exposure portions outside the scanned object were performed and were assessed in comparison to the tube-current magnitude recorded in the file header, as a check on the accuracy of the tube-current modulation model.

For studies of observer sensitivity to the simulated noise and a determination of the JND in noise levels, a preserved adult male cadaver head was scanned with 50, 150, 300, and 500 effective mAs on a 16-row scanner, using 120 kVp, 16×0.75 mm collimation, pitch equal 1.0, and a rotation time of 0.5 s. Simulated sinograms with different noise levels were used to reconstruct images.

Determination of simulation parameters

Statistical properties of sinogram data

A fundamental assumption used in step 7 of the simulation procedure (Sec. 2A) is that the noise fluctuations are normally distributed. The statistical properties of detector measurements were studied by applying a chi-square (χ²) goodness-of-fit test²⁶ (MATLAB, chi2gof) to the sinogram data, against the null hypothesis that the data were drawn from a normal distribution. For air or detrended cylinder scans, transmittance measurements (e^−A(d,g,r)) of detector d and row r were sampled over G gantry steps (either 2000 or subsets of 1160 each). A χ² test was performed on the measurements for each detector to compute a p value, which is defined as the probability that a χ² value greater than the observed χ² would be obtained from measurements of a true normal population. As discussed by Press et al.,²⁷ the use of the χ² test involves an implicit assumption that the measurement process itself, as well as the fundamental signal, is normally distributed, in particular that systematic outliers in the data can be avoided. In a CT scan, there are sometimes random events, such as current spikes or small objects on the gantry shroud in the field of view, that produce outlying data points that resist detrending and can lower p values by factors of 10. For example, a set of 2000 measurements from a detector may have a p value of 0.005, while analysis of its two subregions of 1000 points will each have a p value greater than 0.1. (The opposite is also observed: two subregions may have p values less than 0.05, but when combined the resultant p value is greater than 0.05.) Press et al. suggest that smaller acceptance criteria (α values) may be appropriate in the presence of such non-normal measurement errors, e.g., using 0.001 rather than the conventional 0.05 criterion. An acceptance criterion for individual p values must also take into consideration that multiple, independent tests were being performed for many detectors, which will lead to random occurrences of small p values even for a normal population. In this case, it is hard to state significance by simply comparing the percentage of detectors that do not satisfy the extreme criterion.¹³ This is often accounted for by adjusting the acceptance criterion as a function of the number of tests, e.g., a Bonferroni correction²⁸ can be applied with α^′=1−(1−α)^1∕n for n tests. (In a test with 672 detectors, the Bonferroni correction for α=0.05 would yield α^′=0.000 076.) This in effect follows the suggestion of Press et al., for a lower acceptance criterion, but reveals little about the characteristics of the total measurement population.

Rather than considering just a single extreme p value as the test for normality, the evaluation that was utilized was based on the distribution of all measured p values, which is required to be uniformly distributed between zero and one for all measurement sets for the Gaussian hypothesis to be valid. The p values generated from the above χ² test for each detector were analyzed by Fisher’s method²⁹ with a test statistic $Χ_{FM}^{2} = - 2 \sum_{d = 1}^{N_{d}} \log (p_{d})$ , which is defined as a χ² test having 2N_d degrees of freedom. This was evaluated using the complement of an incomplete gamma function (MATLAB, gammainc). The extremum probability (p_X value) that $Χ_{FM}^{2}$ is greater than that of a normal distribution is then compared with the conventional criterion α=0.05.

These tests were applied for the range of four currents (50, 150, 300, and 500 effective mAs) for both an empty gantry air scan and a highly attenuating object (35.4 cm PMMA cylinder) using the body protocol. In the case of the cylinder phantom, detectors in the region near the cylinder edge experience large variations in signal due to small positioning errors coupled with high gradients in the object profile;³⁰ therefore, these regions were excluded from the analysis. Low-frequency signal changes, such as tube-current or detector instability, were often present in addition to the stochastic quantum noise in the measurements; thus, a detrending operation was performed using a cross-gradient operator,³¹ see Appendix A2. The primary beam through the center of the cylinder was attenuated by maximum of about e^−7.7, corresponding to intensities that were about 2000 times less than those in air. This provided the lowest flux conditions that could be experimentally measured. The p_X values (obtained by Fisher’s method to characterize all the detectors in a given scan experiment) were themselves analyzed by Fisher’s method to test the overall Gaussian nature of sinogram data.

Bowtie filter profile

The following technique was used to determine the form of the bowtie filter profile, p(d). Because the x-ray path length through a patient is typically smaller in the periphery of the body relative to its center, bowtie filters shape the intensity of the incident x-ray beam into a nonuniform flux across the fan beam to minimize dose to the patient³² and to obtain equal signal over all rays exiting the patient. Specific bowtie profiles are provided for scans of particular parts of the anatomy, e.g., head bowties have a narrower fan beam of flux compared to body filters. The result of the bowtie filter is a reduced flux in the outer region of the fan beam. If there is no attenuating object present, this creates higher noise levels in periphery regions relative to the isocenter. CT scanners are calibrated to produce data for filtered backprojection that will have a uniform mean attenuation of zero for air; so a scan of an empty gantry produces a sinogram with essentially constant mean value (zero) for attenuation (or unity for transmittance) for each detector across the fan beam; this profile for one detector row is shown in Fig. 1a. However, due to the variation in flux level across the fan beam, variance across the field of view of detectors is not constant [Fig. 1b]. [Note: Engel et al.,³³ recently demonstrated that the presence of objects adds measureable scatter signal (up to 10% of primary flux) in direct exposure regions. This effect was not analyzed in this study.] As is shown in Appendix A1, measuring the noise in a detrended transmission signal as a function of detector position d provides an estimate of the relative transmission of the bowtie filter, p(d).

The effect of bowtie filter on signal statistics. (a) Plot of mean transmittance signal as a function of detector position for air scans at 50 (points) and 500 mA s (line) in a 16-row scanner, with mean transmittance of $\bar{T_{d} (50 mA)} = 1.0002 \pm 0.0019$ and $\bar{T_{d} (500 mA)} = 1.0001 \pm 0.0009$ over 2000 gantry steps. (b) Plot of the variance of transmission signal for the same two scans. While the means are similar and relatively constant, the variance is non-uniform, being greatest in the low-flux periphery areas, and its magnitude is inversely proportional to the current.

The variance of the cross-gradient (Appendix A2) at each detector position was computed for multiple gantry positions [e.g., 1400 (for head protocol) or 2500 (for body protocol) gantry steps], and the profile of this variance versus detector index was fit to an eighth-order polynomial function; the normalized inverse of this fit serves as the bowtie filter transmission profile, p(d). The accuracy of the profile was estimated by computing the root-mean-square (rms) of the relative error between the measured variance and the inverse of the fitted profile, $E_{rms} = \sqrt{1 ∕ N_{d} \sum_{d = 1}^{N_{d}} {(σ_{T}^{2} (d) - (1 ∕ p (d) λ (d)) ∕ (1 ∕ (p (d) λ (d)))))}^{2}} = \sqrt{\sum_{d = 1}^{N_{d}} {[1 - p (d) λ σ_{T}^{2} (d)]}^{2} ∕ N_{d}} .$ As shown in Appendix A4, E_rms must be corrected for bias due to stochastic fluctuations of the measurements. (For comparing measurements to the profile, there is no noise contribution to bias from the profile polynomial.)

Incident flux

The following technique was developed to measure the scaling factor relating the tube current to the x-ray quantum flux. Solving Eq. 7 for the scaling factor K gives

K = \frac{Q_{0} (g)}{c m A (g) s} .

(10)

As shown in the Appendix A1, Q₀(g) is inversely proportional to the variance of the transmittance signal $σ_{T}^{2} (d, g)$ and the bowtie filter profile, p(d). Therefore, Q₀(g) can be estimated by averaging the detrended noise $σ_{CG}^{2} (d, g) = σ_{T}^{2} (d, g) ∕ 4$ given in Appendix A4 over all detectors for an air scan $[\bar{T} (d, g) = 1]$ ,

Q_{0} (g) = \frac{1}{N_{d}} \sum_{d = 1}^{N_{d}} \frac{1}{4 p (d) σ_{CG}^{2} (d, g)} .

(11)

K can be estimated by averaging Eq. 10 over all gantries

\bar{K} = \frac{1}{N_{g}} \sum_{g = 1}^{N_{g}} [\frac{\bar{Q_{0} (g)}}{c m A (g) s}] .

(12)

To determine the intrascanner variability of K, air scan measurements were performed at four different current levels (50, 150, 300, 500 effective mAs), and a mean and standard deviation for K was calculated. To estimate interscanner variability, means of K for three different scanners in the BJC Hospital complex were calculated and a standard deviation computed.

A variation of this technique can be useful in determining the flux level in any clinical scan, including those using tube-current modulation, even if the instantaneous tube current value is not available. Most scans have sinogram data that include some regions outside the patient with direct exposure (air only) measurements. These air regions in the sinogram data can be segmented, e.g., by thresholding for transmittances greater than 0.99, to collect samples with direct exposures. The variance of cross gradient of the signal from the selected segmented regions normalized by the bowtie filter profile represents the flux (Q₀) of the scan given by Eq. 11. This method can thus be used to measure tube-current modulation or to validate tube-current parameters extracted from data file headers.

Electronic system noise

Electronic system noise is several orders of magnitude smaller than the quantum noise present in the direct beam, and hence, only becomes appreciable in the presence of a highly attenuating object. A method for measuring this component is described by Whiting et al.¹⁸ and involves scanning a precisely aligned cylinder object; fitting the sinogram profile signals to a parameterized model; and obtaining the signal statistics from fluctuations about this mean “reference truth” model. From such measurements, variance can be plotted as a function of signal magnitude, with the intercept of the fitted linear equation being equal to the system noise.

A less complicated and more convenient estimate of system noise was obtained by scanning a highly attenuating cylinder and assuming that the sinogram attenuation profile of the cylinder was stationary during one rotation. (A cylinder could be centered to within 3–4 mm of the isocenter, which leads to systematic intensity modulation in the profile near the edges of the cylinder that would distort estimates of stochastic noise. The magnitude of this systematic modulation as a function of position in the cylinder profile was estimated by the methods of Whiting and Muka³⁴ or Earl.³⁰ Data were analyzed only in regions where stochastic noise comprised more than 98% of the signal energy, which was typically the central 200–300 detectors.) The mean and variance of the transmittance signal [T(d,g)=e^−A(d,g)] were calculated for each detector location. Rewriting Eqs. 1, 2 in terms of the transmittance gave the measured variance in terms of the measured mean, $σ_{S}^{2} = κ \bar{S} + N_{0}$ , with two adjustable parameters, κ and N₀. These parameters were determined by using a nonlinear Nelder–Mead simplex optimization (MATLAB, fminsearch) to minimize the mean-square relative error between the measured and modeled variance, averaged over all detectors (see Fig. 2). Edges of the cylinder, which contain high variance due to positioning sensitivity caused by large edge gradients, were excluded. This resulted in an estimate for the incident flux scaling factor (κ) and system noise (N₀).

Plot of transmittance variance divided by mean transmittance as a function of detector position for scan of a 35.4 cm cylinder for 50 mA s. For uniform flux (no bowtie) with no additive system noise, this would be expected to be a constant, but the bowtie filter and system noise cause excess variance. Points represent the measurement and the solid line represents the noise model of Eqs. 1, 2, with parameters (K=3242 and N₀=38) selected to minimize the square of the relative error. The dotted line corresponds to cylinder profile (not to scale) as a function of detector position.

Influence of model components

The effect of the bowtie filter and background noise on image properties was observed by producing simulated images with different combinations of these components enabled, such as including or excluding the bowtie filter in the simulation of low-dose cylinder scan from a high-dose cylinder scan. Noise statistics (means, standard deviations) were measured in regions of interest (ROI) in the images, and comparisons were made between them. In addition, visual inspection of simulated clinical images, with and without the components, revealed the visual impact they had on image features.

Simulation validation

To validate the dose-reduction simulation, statistics from both the sinogram and image domain were compared for low-dose simulations (obtained from high-dose scans) and actual measured low-dose scans. Observer studies were also conducted to assure that simulated images appeared realistic.

Quantitative analysis in sinogram domain

Scans of a 35.4-cm diameter PMMA cylinder were obtained with four current levels (50, 150, 300, and 500 effective mAs), using 120 kVp, 16×0.75 mm collimation, pitch equal 1.0, and a 0.5 s rotation time. Sinogram data at the higher effective mAs were used as the basis for a lower-dose simulated sinogram. In the sinogram data domain, the variance was calculated for all detectors over 3000 gantry steps, as was the rms relative error between the original and simulated scans $[rmsRE = \sqrt{{((original - simulated) ∕ simulated)}^{2}}]$ . As shown in Appendix A4, this rms relative error has a bias due to the stochastic noise in the variance measurements; a correction was applied to the rms relative error by subtraction of the computed bias from the mean-square relative error. The magnitude of this effect was estimated by performing two simulations with the same data and same parameters—the magnitude of this rms relative error was 7.29%, in agreement with a calculated bias of 8.3%.

Quantitative analysis in image domain

In the image domain, multiple ROIs were selected, and the means and standard deviations were calculated for corresponding locations in both original and simulated slices. For cylinder phantom scans, eleven slices were selected from images and ten ROIs were chosen in each slice [Fig. 3a], resulting in a total of 110 ROIs for comparison. Simulated lower-dose scans were prepared from higher dose scans: three sets for 50 mAs (from 150, 300, 500), two sets for 150 mAs (from 300 and 500), and one set for 300 mAs (from 500 mAs ). This gave a total of six sets of simulated ROIs to be compared with three sets of original ROIs, for a total of 660 comparisons.

Images of cylinder and cadaver head, showing selected ROI. Statistics for each ROI were collected and analyzed. Plot of image-domain noise shown in Fig. 12 to determine accuracy of simulations.

For cadaver-head scans, 31 slices were selected, with seven ROIs in each slice [Fig. 3b], including areas of exterior air and interior areas of bone and soft tissue, for a total of 217 ROIs per current level. Again, four current levels were acquired, with three original measurements and six simulated measurements from the higher dose scans, for a total of 1302 ROIs for comparison. Between scans, the cadaver head changed orientation slightly, so that reconstructed images could not be exactly aligned and the difference image contained major artifacts. Therefore, the means of ROI statistics were calculated for exterior and interior regions of each scan, and the relative error between means was reported, rather than computing the rmsRE on a pixel-by-pixel basis.

The rmsRE between the standard deviations of the ROIs was reported as a measure of accuracy. Note that, due to the stochastic noise of individual scans, even with a perfect simulation model there would be a finite rms relative error between any two scans. The analysis in Appendix A4 provides an estimate of this variance floor as a function of individual measurement variances, and a correction was applied (subtracting a variance floor from the measured variance) to remove this bias from the reported results.

Image domain observer study

In addition to quantitative testing of simulation results, it is important that observer studies be performed to insure that simulations appear realistic to radiologists. Images of a cadaver head were used to conduct a four-alternative forced-choice observer study³⁵ (4AFC) for the ability to distinguish simulated from original images. Scans of the cadaver head were performed at four current levels (50, 150, 300, and 500 mAs ) (Fig. 4). The raw data from the higher current levels were used for a simulation of images at the lowest current level of 50 mAs , providing images from four source currents (150, 300 mAs , and two instances from the 500 mAs scan). Six original and 12 simulated image slice positions (out of 31 reconstructed slices available) were selected for the observer experiment. Sets of four images were created, with a random selection of one original and three simulated. The four images were randomly placed in the four-up matrix. The observer was instructed to select one image (the “original”) that had different noise characteristics from the other three. Forty sets of four images (total of 160 images) were displayed one set at a time, and an attendant manually recorded the position of the image choice selected as the original by the observer. The relative frequency with which an observer picked each image from the set was calculated, with the expectation that a 25% frequency would indicate random selection (no difference between the original and simulated images). Because the selection is a binomial process, the responses are not expected to be normally distributed. (Normal curves were fit to the relative frequencies and the normality of the data distributions were tested with Shapiro–Wilk W tests. If data were not normally distributed, the arcsine transformation was performed on the relative frequencies,³⁶ and the normality of the transformed data was tested.) The arcsine-transformed data were entered into a repeated measures analysis of variance. Statistical analyses were performed with JMP Statistical Software (SAS Institute, Inc., Cary, NC).

Image of four-up cadaver head used in 4AFC study. Upper left simulated 150–50 mAs ; upper right original 50 mAs , lower left simulated 300–50 mAs , and lower right simulated 500–50 mAs .

Six radiologists with a mean clinical experience of 10.7 years (range: 7–17 years) participated in the experiment. Before the study, they received the following orientation in distinguishing between the original and simulated images. Two image sets were provided. Each set consisting of the original images at all four dose levels presented next to each other, as well as samples of lower-dose simulated images generated from the three higher-dose raw data scans (same slice location as shown in first step) presented next to the corresponding original image (total of 12 sample dose levels). The observers were allowed to view all samples with no time limit and an attendant was present to answer any questions.

Just noticeable difference

To establish guidelines for the accuracy required in noise simulation, experiments were performed to determine the amount of difference between image noise levels that could be discerned by observers. This was measured as the JND,¹⁹ which is defined as the point at which an observer operates midway between random guessing and perfect certainty. A 2AFC experiment was conducted, for which the JND performance level is 75%{=[random guessing(50%)+perfect certainty(100%)]∕2}. Images were prepared from a 500-mAs cadaver head scan, containing 151 slices; all the image slices had nominally comparable noise levels. A reference low-dose simulated noise level was set at 50 mA s and compared with five higher-dose levels, with dose differing by 10% (55 mAs ), 20% (60 mAs ), 30% (80 mAs ), 40% (90 mAs ), and 100% (100 mAs ). For each of the above noise levels, 40 slices were randomly selected from the 151 slices available, and paired simulated images at the two different dose levels were displayed next to each other, with a random order for the position of the lower dose image (Fig. 5). Observers were asked to select the image in the pair that had lower noise, which was recorded by an attendant. The percentage of correct choices was calculated for each noise level; no detectable difference in noise levels is expected to result in a 50% correct rate, while a large difference in noise levels would give a 100% correct rate. Results of percent correct were plotted versus dose level for the group average and for each individual to determine the dose level at which the JND rate was 75%.

Presentation image for two cadaver head slices used in JND study (simulated images were prepared from a 500-mAs cadaver head scan). Image on the right is simulated at 50 mAs current level, left image is simulated at 60 mAs current level, corresponding to 20% difference in dose level and a 10% difference in standard deviation.

Five radiologists with a mean clinical experience of 11.4 years (range: 7–17 years) participated in the study. Each received an orientation session to the observer task, consisting of viewing one image slice that was simulated with varying noise levels, with the four different noise levels presented side-by-side for inspection.

RESULTS