Abstract
A theoretical framework for detection or discrimination tasks with list-mode data is developed. The object and imaging system are rigorously modeled via three random mechanisms: randomness of the object being imaged, randomness in the attribute vectors, and, finally, randomness in the attribute vector estimates due to noise in the detector outputs. By considering the list-mode data themselves, the theory developed in this paper yields a manageable expression for the likelihood of the list-mode data given the object being imaged. This, in turn, leads to an expression for the optimal Bayesian discriminant. Figures of merit for detection tasks via the ideal and optimal linear observers are derived. A concrete example discusses detection performance of the optimal linear observer for the case of a known signal buried in a random lumpy background.
1. Introduction
This paper is the fifth in a sequence of contributions to the theory of objective or task-based assessment of image quality [1–4], colloquially referred to as the OAIQs. The premise of these papers is that image quality for medical or scientific images must be defined in terms of the ability of some observer to extract useful information from the images. This task-based approach to image quality is expounded in the book Foundations of Image Science [5], and it is developed in detail in many additional papers in this journal [6–22] and elsewhere [23–26]. These papers consider a variety of detection and estimation tasks, and they treat human observers as well as mathematical model observers, including the ideal Bayesian observer for detection and discrimination tasks, and the ideal linear observer, known in the image-quality literature as the Hotelling observer [27]. Many of them also consider the channelized Hotelling observer [28], which is often a good predictor of the performance of a human observer on a detection task.
One thing that all of these papers have in common, however, is that they deal with images stored as arrays of pixel values or other forms of histograms. In the present paper we consider imaging systems in which a fundamentally different form of data storage, called list mode, is used. This storage mode is useful, and often unavoidable, when individual photons are detected and characterized by their position on the detector, time of detection, energy, or other attributes.
A wide variety of photon-counting detectors is used in such systems. For optical imaging, a common approach [29–32] is to use a microchannel plate or other image intensifier so that a single optical photon on the detector input can produce many secondary electrons on the intensifier output. The secondary electrons are then converted to electrical output signals by an array of electrodes or by an output phosphor and an array of photodetectors. Alternatively, the optical photons can be incident on a large-area avalanche photodiode (APD), and the amplified charge can be collected by electrodes along the edges of the APD [33]. In all of these cases, one incident optical photon produces a set of electrical signals from which the two-dimensional (x, y) position of the photon can be estimated, and it is also straightforward to estimate its time of arrival. Thus detectors in this category estimate three attributes, x, y and t, for each detected optical photon. These estimated attributes are continuous variables, and there is no need to bin them into pixel arrays or into discrete time frames.
Optical or infrared imaging spectrometers [34] can include photon-counting detectors and estimation of spatial and spectral coordinates.
Similar methods are used for detecting x-ray or gamma-ray photons and estimating their attributes. A prime example is the Anger scintillation camera [35], which has been the standard imaging instrument in clinical nuclear medicine for over 50 years. Used to image the distribution of a gamma-ray emitting radiopharmaceutical in the patient’s body, the Anger camera consists of a parallel-bore collimator or pinhole to form the image, a scintillation crystal to convert each absorbed gamma ray to a flash of light, a thick glass window to allow the light to spread, and an array of photomultipliers (PMTs) to collect the light and convert it to electrical signals. Because of its high energy, each gamma-ray photon produces thousands of optical photons, just as if an image intensifier were present, and each PMT detects some fraction of the light from each flash. All of the resulting signals are used to estimate the location within the scintillation crystal where the gamma photon was absorbed, and the final spatial resolution is much better than the size of a single PMT. As the crystal is necessarily thick in order to absorb the gamma rays efficiently, it is helpful to estimate the depth of interaction z as well as the lateral position, (x, y). It is also possible to estimate the energy of the gamma rays from the PMT signals, which is useful in discriminating against photons that have undergone Compton scattering in the body. Finally, because the radiotracer in the patient can vary with time, it is useful to record the time of arrival of each gamma ray. Thus each gamma photon detected in an Anger camera can be characterized by five attributes: interaction position (x, y, z), energy ℰ and time of arrival t. Moreover, when the camera is used in single-photon emission computed tomography (SPECT), images must be acquired from multiple directions, and we can add an angular variable specifying the projection direction, for a total of six attributes per gamma-ray event.
Even more attributes can be measured in positron-emission tomography (PET) [36], where the radiopharmaceuticals are labeled with isotopes that decay by emission of positrons (antielectrons). A positron has a range of a few millimeters in tissue, and at the end of this range it annihilates with an electron and produces two simultaneous gamma-ray photons, each of energy 511 keV, moving in opposite directions. Typically a ring of scintillation detectors surrounding the patient captures the two coincident photons, and in the most sophisticated systems, x, y, z, ℰ, and t are estimated for each photon, making a total of ten estimated attributes per nuclear decay event.
Similar photon-counting systems are applicable in many other fields. X-ray computed tomography (CT) is now using photon-counting semiconductor detectors with energy resolution [37, 38], and advanced maximum-likelihood estimation methods can be used to get sub-pixel spatial resolution [39, 40]. Compton telescopes in gamma-ray astronomy [41–43] necessarily record multiple attributes of two or more interactions produced by a single incident gamma photon.
By analogy to PET, photon-counting detectors and attribute estimation also readily apply to other forms of two-photon emission, e.g., spontaneous parametric down-conversion [44] or two-photon emission from semiconductors [45]. Even quantum-entangled two-photon imaging and interferometry [46, 47] could be performed with these methods.
When many attributes are estimated for each photon event, it may be impractical to store them in conventional binned arrays. If we measure P attributes and bin each to B bits, then it requires 2BP bins to store the data. For example, if we measure six attributes and discretize each to 8 bits (256 bins per attribute), then we need 248 ≈ 3 × 1014 bins. Because the total number of detected events is certain to be small compared to 1014, almost all of these bins would contain no data, with a small fraction containing one event each.
The alternative to binning is a simple list of the attributes. If J events are detected and P attributes of each are estimated to 8-bit accuracy, we need PJ bytes to store the whole data set. For example, 109 events with six attributes per event requires a manageable 6 GBytes of storage in list-mode. List-mode data storage is now routine when photon-counting detectors are used in medical tomography [38], and it is increasingly being used in other imaging methods as suitable detectors are developed.
There is a large literature on tomographic reconstruction from list-mode data [48–56], but relatively little has been done on task-based assessment of image quality. Barrett et al. [57] developed an expression for detectability of a nonrandom signal on a nonrandom background from list-mode data, and Parra and Barrett [51] considered simple estimation and detection problems in time-of-flight PET. A number of papers have also treated detection performance on tomographic images reconstructed from list-mode data, using observer models that are standard for pixel-array images [49, 58, 59]. Figures of merit obtained this way thus apply to the joint quality of the raw list-mode data and the reconstruction algorithm.
The goal of this paper is to develop a rigorous statistical theory of image quality for detection or discrimination tasks with list-mode data. The theory includes the randomness of the objects being imaged; randomness in how each object generates data in photon-counting detectors; randomness in the resulting estimated attributes, and the effect of various mathematical observers, including the ideal observer and the Hotelling observer. In order to focus on the intrinsic quality of list-mode data sets, these observers act directly on the attribute lists, and image reconstruction is not considered. The theory is equally applicable to imaging of optical photons and x-ray or gamma-ray photons.
This paper is organized as follows. Section 2 introduces the mathematical notation and derives basic results concerning the ideal (Bayesian) observer, which are then applied in Sections 3 and 4 to two relevant cases: detection of a known signal in a known background, and detection of a known signal in a random background, respectively. Section 5 considers optimal linear detection with list-mode data, and concepts developed in this section are applied in Section 6 to a concrete detection problem. Finally, Section 7 concludes this paper by summarizing the main results.
2. General Theory
Much of the terminology and the basic statistics used in this paper are found in an earlier paper entitled “List-mode Likelihood” [57], and further discussion is given in [5].
2.A. Terminology and Notation
An object being imaged is assumed to be a scalar-valued function of spatial position, independent of time for simplicity. When we wish to emphasize its functional dependence, we denote the object as f(r), where r is a two-dimensional (or three-dimensional) position vector. The units of f(r) are emitted photons per unit area (or volume) per unit time.
We shall also use the notation f to mean a vector description of the object; in principle, f is the vector in Hilbert space that corresponds to the function f(r), but in computer studies f may be a finite expansion in pixels, voxels or other basis functions [5].
The object is viewed by some photon-counting imaging system either for a specified time or until a specified number of detection events has been reached. In an earlier publication [57] these two modes were called preset-time and preset-counts, respectively, and it was shown that slightly different likelihood expressions were obtained in the two cases. In this paper we assume a preset acquisition time, denoted τ, so the number of counts J is a random variable, obeying Poisson statistics.
Each detection event has some attributes, such as position, energy and time of arrival. For the jth event, these attributes are collected into a P-dimensional attribute vector Aj, (j = 1, …, J), where P is the number of attributes per event and J is the number of events. We do not perfectly measure these attributes, but instead estimate them from detector signals. The estimated attribute vector for one event is denoted Âj, and the full list of measured attributes for J events [57], is
where Âj is referred to as a list entry, and the curly brackets denote a set. In preset-time mode, the data set G for inferences about the object consists of the J list entries and J itself, so we define G ≡ (𝒜̂, J).
2.B. Statistical Models
For list-mode image reconstruction [48–52,56,60] or for assessment of image quality from list-mode data [51, 57], we need a likelihood, which is the probability of G conditional on some hypothesis Hk about the object. For a detection or classification task, Hk is the hypothesis that the object was drawn from class k. For an estimation task the hypothesis is that some parameter vector takes on a certain value from a continuous or discrete range of values.
By the usual rules of conditional probabilities, we can write the likelihood as
The notation here needs some comment. We customarily [5] use pr(…) for the probability density function (PDF) for a continuous-valued random variable or vector and Pr(…) for the probability of a discrete-valued random variable or vector. Here, G is a random vector with both continuous and discrete components, so we use pr(…) for this mixed statistical description. Moreover, subscripts on pr(…) and Pr(…) are omitted unless needed to avoid ambiguity, and the random quantity is assumed to be the argument of the function. Otherwise the notation is conventional, with pr(𝒜̂|J, Hk) being read as the PDF of 𝒜̂ conditional on J and Hk, and Pr(J|Hk) indicating the probability of J conditional on Hk only.
If the objects being imaged are random, we can express the likelihood as an expectation with respect to the random objects:
| (1) |
where 〈…〉 denotes ensemble average. In principle the integral above is over all components of the object vector, though in practice the expectation might be performed by Monte Carlo sampling. Unless otherwise noted, all integrals in this paper are assumed to run over the full, usually infinite, range of the variables of integration.
For a given object, the events are statistically independent, so
and hence
Further applying familiar rules of probability calculus, we find
where pr(Aj|r) is the probability density function of the true attribute vector conditioned on the event having originated at a point r in the object.
Now, with
and (1), the likelihood is given by
From earlier work [5, 57], we know that
| (2) |
where s(r) is the system sensitivity, defined as the probability that a photon emitted at point r is recorded in the list. Thus pr(Âj|f) can be written in either of two equivalent forms:
| (3) |
With the first of these forms, the overall probability density of the list-mode data set is
| (4) |
Note that, conditioned on f, J is a Poisson random variable, so Pr(J|f) can be written as
where
| (5) |
Thus a factor of [∫dr f(r)s(r)]J in (4) cancels, and we find
In this expression, the task is defined by pr(f|Hk); the modality is defined by pr(Aj|f) or equivalently, ∫dr pr(Aj|r) pr(r|f); and the properties of the detector are characterized by pr(Âj|Aj).
With the second form in (3), which does not show the role of the detector explicitly, we have
| (6) |
where, we recall, the angle brackets 〈…〉f|Hk denote an average over objects in class k.
Other equivalent forms are
| (7) |
3. Detection of a Known Signal on a Known Background
3.A. Likelihood Ratio
For an SKE/BKE (signal known exactly, background known exactly [5]) detection task, Hk is specified by the known object, which we can denote as f0 in the signal-absent case and f1 ≡ f0 + Δf in the signal-present case. Thus [cf. (7)],
The expression above is obtained from (7) by assuming pr(f|Hk) = δ(f − fk), where δ(…) denotes the delta function. The log-likelihood is given by
The ideal observer uses the likelihood ratio Λ(G|f0) = pr(G|H1)/pr(G|H0) or its logarithm λ(G|f0) = log Λ(G|f0) as the test statistic. For the present SKE/BKE problem, the likelihood ratio is given by
| (8) |
and its logarithm becomes
| (9) |
If the signal is weak, the logarithms can be expanded to yield:
| (10) |
3.B. Figures of Merit for SKE/BKE Detection by an Ideal Observer
We can use the log of the likelihood ratio from (10) to compute an ideal-observer detectability by noting that the terms in the sum in (9) are independent and identically distributed, so the log-likelihood is normally distributed by the central-limit theorem. Normal log-likelihoods have some powerful properties, discussed in detail in Sec. 4C of [3]. In particular, the means and variances of a normal log-likelihood under the two hypotheses must be related by:
| (11) |
where the angle brackets denote expectations with respect to the list-mode data under the subscripted hypotheses; thus 〈λ〉k is a shorthand for 〈λ(G)〉G|fk ≡ ∫dG λ(G)pr(G|fk).
The ideal-observer SNR in this problem is given by [3]
Moreover, for any normally distributed test statistic, the area under the ROC curve (AUC) is given by [5] , where erf(…) denotes the error function. Thus, because the log-likelihood is normal, the ideal-observer performance, specified by either SNR or AUC, is fully determined by 〈λ〉0.
Let us now compute 〈λ〉0 for the expression in (10), term by term. The first term, −τ∫dr Δf(r)s(r), is not random in this SKE/BKE problem, so it is unchanged by the averaging. The average of the second term is given by
where, in the second line we have used (4) and in the fourth line we have used ∫dÂj pr(Âj|r) = 1. Finally, we note that all J terms in the sum yield the same result, so, with (5),
which just cancels the nonrandom first term in (10).
If we truncate (10) after the terms quadratic in Δf and perform manipulations similar to the above, we obtain
| (12) |
Thus the ideal-observer SNR2 for a weak-signal SKE/BKE problem is given by
| (13) |
Note that SNR2 increases linearly with the acquisition time and hence with the mean number of photons in the data set.
An equivalent form is
This form is useful if we wish to evaluate SNR2 by drawing Monte Carlo samples of Â.
4. SKE/BKS Detection
The assumption of a known background in Sec. 3 is obviously unrealistic, and we should treat the background as stochastic. In this section we modify the results of Sec. 3 to include the effect of background randomness. The key difference is that the individual Âj are statistically independent for a given object, but the randomness of the object induces statistical dependence. Indeed:
4.A. Likelihood Ratio
Consider the detection of a known signal that, when it is present, is simply added to a random background b(r). In the literature, this task is referred to as SKE/BKS (signal known exactly, background known statistically). For this task, the object under the signal-absent hypothesis is given by f(r) = b(r), and under the signal-present hypothesis it is given by f(r) = b(r) + Δf(r). In vector notation, the two hypotheses are H0 : f = b and H1 : f = b + Δf, where now the background b is the only random quantity in the problem; its probability density function is denoted prb(b). Thus averaging over f conditional on H0 is the same as averaging over b.
From (1),
The likehood ratio is
Because Δf is nonrandom, we can take it out of the expectation integrals and then expand Λ(G) in a Taylor series as
| (14) |
where
and
The notation used here is discussed in detail in Sec. A.10.2 of [5]. In brief, ∂/∂f is the gradient with respect to f; that is, if f is represented as an N × 1 column vector with components fn, then ∂pr(G|f)/∂f is a column vector with components ∂pr(G|f)/∂fn. With this notation, Δf†[∂pr(G|f)/∂f] indicates the scalar product of the two vectors Δf and ∂pr(G|f)/∂f. Similarly, ∂2pr(G|f)/∂f∂f† is an N × N matrix of second derivatives, and the quadratic form Δf†[∂2pr(G|f)/∂f∂f†]Δf is to be interpreted as the scalar product of Δf with [∂2pr(G|f)/∂f∂f†]Δf. If f is the infinite-dimensional Hilbert-space vector corresponding to the function f(r), then ∂/∂f is interpreted as a functional or Frèchet derivative [5].
The notation does not show it, but a and B are random functions of the entire list-mode data set G.
From (14), the logarithm of the likelihood ratio is
Through terms quadratic in Δf,
| (15) |
This expression is the generalization of (9) for detection of weak nonrandom signals on random backgrounds (SKE/BKS tasks).
4.B. Figures of Merit
The first term in (15) is given explicitly by
From (7), we know that
With the chain rule, we can show that
where the product over j′ now has J − 1 factors, with j′ = j excluded.
Taking the average with respect to prb(f) and doing a little algebra, we find
With (5),
Thus, with (7),
| (16) |
To simplify this complicated expression, we note that
If we interpret pr(𝒜̂|J, f) as a function of f for fixed 𝒜̂ and J, then pr(𝒜̂|J, f) is the list-mode likelihood for estimation of f. By definition, pr(𝒜̂|J, f) peaks at the maximum-likelihood estimate of f, which, under H0, we denote as b̂ML. If this peak is sharp compared to the width of prb(f), we can say that
| (17) |
Of course, b̂ML depends on the list-mode data, 𝒜̂, but we suppress this dependence for notational convenience.
When we use the delta function to perform the two averages over f in (16), many cancellations occur and we find
For notational clarity, we define
| (18) |
Notice that
and, similarly
where we used the fact that ∫dG pr(G|f) = 1. Using the results above and with the help of (11), we have
In the double sum over j and j′, there are J terms with j = j′ and J2 − J terms with j ≠ j′. For each case, the terms are identically distributed, so we can write
| (19) |
with the understanding that j ≠ j′.
In more detail, the first term in (19) is
where we have used (18), (5) and (1). Similarly, the second term in (19) is
Because J conditional on its mean is a Poisson random variable, and because 〈J2 − J〉 = 〈J〉2 for a Poisson random variable,
Moreover, conditional on b̂ML, Φ(Âj, b̂ML) and Φ(Âj′, b̂ML) are statistically independent for j ≠ j′, and the averages can performed with (5) and (1). After some algebra, we find
Thus the second and third terms in (19) cancel, and we are left with [cf. (12)]
| (20) |
This expression gives the performance of an ideal observer on a weak-signal SKE/BKS task, provided the approximation in (17) is valid.
In a form that is amenable to Monte Carlo evaluation [cf. (13)], we can also write
| (21) |
If the approximations above are not applicable (and that would happen, for example, when the signal is not weak so that the approximation in (14) is not valid) or when computation requirements to evaluate (21) are prohibitive, other computational methods can be used. As an example, Kupinski et al. [15] show the application of Markov chain Monte Carlo (MCMC) techniques [61] to the problem of estimating ideal-observer detection performance by calculating Λ(G) for randomly generated datasets G. Although [15] assumed pixelated data, the same scheme can be adapted to the case of list-mode data G
| (22) |
where Φ1, …, ΦN are parameterizations of the background object f = fΦn MCMC-generated from list-mode data G, and Λ(G|fΦn) is the SKE/BKE likelihood ratio
defined in (8). We refer the interested reader to [61, 62] for a detailed theoretical treatment of MCMC methods. Details of our MCMC calculation will be provided in a separate paper currently under preparation. Theoretical results derived by Barrett et al. [3] show that for the case of the log-likelihood ratio, SNR2 = −8 log〈Λ1/2〉0 in which the ensemble average 〈Λ1/2〉0 is estimated as the sample average of the Λ1/2(G1), …, Λ1/2(GM), where each Λ(Gm) is calculated via MCMC sampling of (22).
5. List-mode Hotelling Observer
5.A. Poisson Point Process in Attribute Space
For the conventional Hotelling observer t(g) = wTg operating on binned data g [5, 28], the signal s to be detected is the difference 〈g〉1 − 〈g〉0 in the mean data under the two hypotheses and , in which is the mean covariance matrix of g. It is easy to see, however, that the difference in means of two list-mode data sets is not a useful discriminant. Consider, for example, a simple photon-counting imaging system where the attributes are the 2D interaction positions on a detector plane. The mean position is of little use if both the signal-absent and signal-present objects are symmetrically distributed around some origin; the mean of every attribute is zero under both hypotheses in that case.
We can, however, salvage the Hotelling observer and other concepts from linear-systems analysis by using the attribute list to construct a Poisson point process in attribute space [57, 63], defined as
The intuitive meaning of this point process can be seen by continuing with the example where the attributes are 2D positions, in which case we construct u(Â) by placing a delta-function point at each of the J locations.
The point process is specified by J + 1 random variables: the J attribute vectors in 𝒜̂ and J itself. Because the terms in the sum over j are independent and identically distributed, the mean of u(Â), conditional on f, is given by
This equation defines a linear operator ℒ such that
or, more abstractly,
This is the list-mode counterpart of the familiar relation [5] for binned data, g̅ = ℋf. The key difference is that ℋ is a mapping from an infinite-dimensional Hilbert space (𝕃2(ℝ3) if r is a three-dimensional vector) to an M-dimensional Euclidean space, where M is the total number of bins in g. The operator ℒ, on the other hand, maps 𝕃2(ℝ3) to 𝕃2(ℝP), where P is the number of attributes per event [63].
To compute the Hotelling test statistic or the Hotelling SNR2, we need not only the mean of u(Â) but also its covariance function, which we denote Ku|f(Â, Â′) for a fixed object or Ku|Hk(Â, Â′) when we consider an ensemble of objects appropriate to hypothesis Hk. The definitions are
where
| (23) |
with f̅k(r) being the mean of the object ensemble for Hk.
Methods for computing these covariance functions are detailed in Barrett and Myers [5] (see especially Secs. 8.5.3, 11.3.3 and 11.3.7). The results are
| (24) |
where ℒ† is the adjoint [5] of ℒ and
| (25) |
with Kf|Hk(r, r′) being the covariance function (the kernel of the operator 𝒦f|Hk) of the random objects under Hk. Operator notations for (23) and (24) are
where ℐÂ is the unit operator in attribute space, with kernel δ(Â − Â′).
5.B. Hotelling Test Statistic and Figure of Merit
For a binary discrimination task, a linear observer computes a scalar product between some template w and the data to get a test statistic t, and then compares t to a threshold to make the decision. For list-mode data, the test statistic is a scalar product in attribute space, with the form
From (23), the mean of t(G) under Hk is
The average of the variance of t(G) under the two hypotheses is
where
or, in operator form,
With these results, the figure of merit for this general linear discriminant in attribute space is [3]
where
It can be shown (Barrett and Myers [5], Sec. 13.2.12) that this figure of merit is maximized when
This choice of w defines the list-mode Hotelling observer, first given by Lehovich [63], for which the figure of merit is
| (26) |
Explicitly, from (23), (24), and (25), the operator 𝒦̅u has the kernel given by
| (27) |
where and, similarly, . Because covariance operators are positive-semidefinite, the “diagonal” term on the first line of (27) guarantees that the inverse in (26) exists, provided only that the integral in square brackets in (27) does not vanish for any Â.
Various methods for obtaining or approximating without actually performing the operator inverse are discussed in Barrett and Myers [5], Sec. 14.3.2; the results there are specifically for covariance matrices, but they all generalize readily to operators. Some insights might be gained by manipulating the expression for . For example,
| (28) |
where we introduced
Notice that the operator ℬ̅ is diagonal; so its inverse is easy to calculate.
6. Example
The model system for the example presented here is a planar (2D) optical source imaged with a lens onto a photon-counting imaging detector such as a multianode position-sensitive photomultiplier (MAPMT) [64]. The same mathematics applies to a planar gamma-ray source imaged with a pinhole or parallel-hole collimator onto a scintillation camera [5] which uses a photomultiplier array to sense the light from a scintillation event produced by a gamma ray. In both cases, we can think of the jth photon (optical or gamma-ray) as emerging from the object at a point rj = (xj, yj) and impinging on the detector at Rj = (Xj, Yj). Then the signals from the PMTs are used to estimate the position Rj, with the estimate being denoted by R̂j = (X̂j, Ŷj). Thus the true attribute of the jth detected photon is Aj = Rj, and the list-mode data set is
We assume that the magnification of the imaging system is unity and that the sensitivity s(r) is a constant s0, independent of position but proportional to the square of the numerical aperture of the optics (or the square of the diameter of the pinhole in the gamma-ray case). The optics is assumed to be linear and shift invariant; and its point spread function (PSF) is denoted by popt(R), which is normalized such that ∫ dR popt(R) = 1. Thus
The detector is also assumed to be linear and shift invariant, and its PSF is denoted by pdet(R), so that
Thus both PSFs, normally interpreted deterministically, play the role of conditional PDFs on photon position in this case [5]. In this paper we assume, for simplicity, that both PSFs are Gaussian:
As in previous sections, the object being imaged is divided into signal and background components. The signal, when present, is taken to be Gaussian:
where and rs is a positive number characterizing the width of the signal.
To specify a random background we use the stationary lumpy-background model [65], for which
where the lump function is also assumed to be a Gaussian
Here the number of lumps, K, is a Poisson random variable with mean K̅ and the lump centers rk are random 2D vectors uniformly distributed over a square area of size w × w, and the detector is also w × w. With these assumptions, f(r) is a filtered, stationary, Poisson random process. For further details, see Rolland [65] or Barrett and Myers [5]. Because the signal is known, the covariance Kf|Hk(r, r′) is the same under the two hypotheses and given by [65]
in which we defined p = K̅/w2. We will assume rb ≫ w and rs ≫ w, so that the density pr(R̂|r) can be approximated with the convolution (popt ∗ pdet) (R̂ − r) between popt and pdet evaluated at R̂ − r
where . With this result, the quantity [ℒ𝒦̅fℒ†](R̂, R̂′) assumes the form
In a similar way,
For the example considered here, the kernel of the operator 𝒦̅u is
and if , we can also write
By the same token, ℬ̅−1 ≈ (pf0)−1ℐR̂, so that
Notice that ℬ̅−1 ℒ𝒦̅fℒ† is shift invariant, this allows us to write
| (29) |
provided that
The form above can be verified by substituting and performing the integrals. Then from (28)
in which we have used the fact that 𝒰 is unitary (𝒰†𝒰 = 𝒰𝒰† = ℐ). Notice that ℐρ + τ𝒟 is diagonal, thus its inverse is readily available. Using (26) and (28) we can write
where
If we carry out the calculation, we find
| (30) |
To simplify the notation, let us introduce α = f0τ, β = (Δf0)2/(pf0), , and γ = a/b. With a change of variables [66, 67], we obtain
| (31) |
where the function 2F1(a, b; c; z) denotes the hypergeometric series [66]:
where (x)n = x(x + 1) … (x + n − 1).
We are now interested in studying the behavior of as the exposure time τ increases. We will consider three different cases: rs < rb, rs = rb, and rs > rb and we will use to denote the values of SNR2 for each case.
The rs = rb case is simple to handle. Calculation of the integral yields
As expected, increases as the exposure time τ increases. The same is true for the rs ≠ rb cases. Indeed, by expanding the series in (31) and noticing that α/(1 + α) is positive and monotonic increasing with α = f0τ, we see that must increase as τ increases.
For the rs ≠ rb cases, consider first
where
If rs < rb, then γ < 1 and (1)n/(1 + γ)n > 1/(n + 1) provided that n ≠ 0. Therefore, after substitutions
The result we just obtained shows that, for the rs < rb case and as τ increases, grows faster than it grew in the rs = rb case. With a Taylor expansion for 1/τ ≈ 0, we can make this concept more precise
On the other hand, if rs > rb
and
in which and we have used Γ(γ + 1) = γΓ(γ) [68]. Upon substitution
Fig. 1 reports plots of for three different cases as functions of total mean count number J̅, which is related to the exposure time as J̅ = τpf0w2, where w is the detector side length. For comparison, Fig. 1 also includes plots of SNR2 for the ideal observer calculated via MCMC methods, which we briefly alluded to at the end of Sec. 4.B. The ideal observer’s SNR2 is asymptotically linear with the total mean count number. This asymptotic behavior confirms the result of (20), which was made possible by the assumption in (17). On the other hand, for low counts, the approximation in (17) breaks down and, as shown in Fig. 1, the ideal observer’s SNR2 is no longer proportional to the total mean count number (or, equivalently, to the exposure time).
Fig. 1.
Plots of SNR2 for three different cases. For all plots, Δf0 = 100 s−1, f0 = 1000 s−1, p = 20000 m−2, σ = 0.001 m, and rb = 0.005 m. For the solid curves, rs = 0.0045 m; for the dashed curves, rs = rb = 0.0050 m; and, finally, for the dash-dotted curves rs = 0.0055 m. Thick curves corresponds to the SNR2 for the Hotelling observer, while thin curves to the MCMC-calculated SNR2 for the ideal observer
The results derived above show that the keeps growing as the exposure time increases only for the rs ≤ rb case. Otherwise, is upper limited. Furthermore, grows as the signal intensity grows, and it decreases if the background intensity and/or the lump density p increases. The peculiar behavior we have in the rs > rb case can be intuitively explained by noting that the signal has a Gaussian shape, so do the lumps in the background. If the signal is wider that the lumps, it is possible to have a false positive when a few lumps cluster at the signal location in such a way that they actually look like the signal we are looking for. On the other hand, when the signal is narrower that the lumps (rs < rb), false positives are reduced because no lump arrangement will look like the signal we are to detect.
The rs = rb case deserves a different discussion. One could argue that because rs = rb, a background lump placed at r = (0, 0) could mistakenly be interpreted as the signal Δf we want to detect, which in our case is indeed centered at (0, 0) and it is a scaled version of the lump function ℓ(r). Thus, the should be bounded when rs = rb. However, if we recall that the lump centers r1, …, rK are 2D points uniformly distributed over a square of size w × w, we immediately realize that the probability of having a lump centered at (0, 0) is zero. Thus, with probability 1, a lump centered at (0, 0) must be understood as the signal Δf.
7. Summary and Conclusions
In this paper, we have considered the problem of signal detection with list-mode data. Two observers have been considered: the ideal observer and the optimal linear observer, known in the literature as the Hotelling observer. Three distinct processes are involved in the development of these observers: randomness in the object being imaged, randomness in the attribute vectors, and, finally, randomness in their estimates from noisy detector outputs. The detection problems we have considered are detection of a known signal in a known background and detection of a known signal in a random background. Associated figures of merit have been calculated as well. In particular, the ideal observer’s SNR2 is found to be asymptotically proportional to the exposure time. The optimal linear observer is applied to list-mode data in an example at the end of the paper. A followup paper on the Markov chain Monte Carlo calculation of SNR2 for the ideal observer is currently under preparation.
Acknowledgments
The authors would like to acknowledge Eric Clarkson and Matthew Kupinski for stimulating discussions and NIH grants R37 EB000803 and P41 EB002035 for financial support.
References
- 1.Barrett HH. Objective assessment of image quality: Effects of quantum noise and object variability. J. Opt. Soc. Am. A. 1990;7:1266–1278. doi: 10.1364/josaa.7.001266. [DOI] [PubMed] [Google Scholar]
- 2.Barrett HH, Denny JL, Wagner RF, Myers KJ. Objective assessment of image quality. II. Fissher information, Fourier crosstalk, and figures of merit for task performance. J. Opt. Soc. Am. A. 1995;12:834–852. doi: 10.1364/josaa.12.000834. [DOI] [PubMed] [Google Scholar]
- 3.Barrett HH, Abbey CK, Clarkson E. Objective assessment of image quality. III. ROC metrics, ideal observers, and likelihood-generating functions. J. Opt. Soc. Am. A. 1998;15:1520–1535. doi: 10.1364/josaa.15.001520. [DOI] [PubMed] [Google Scholar]
- 4.Barrett HH, Myers KJ, Devaney N, Dainty C. Objective assessment of image quality. IV. Application to adaptive optics. J. Opt. Soc. Am. A. 2006;23:3080–3105. doi: 10.1364/josaa.23.003080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Barrett HH, Myers KJ. Foundations of Image Science. Hoboken, NJ: Wiley-Interscience; 2004. [Google Scholar]
- 6.Myers KJ, Barrett HH, Borgstrom MC, Patton DD, Seeley GW. Effect of noise correlation on detectability of disk signals in medical imaging. J. Opt. Soc. Am. A. 1985;2:1752–1759. doi: 10.1364/josaa.2.001752. [DOI] [PubMed] [Google Scholar]
- 7.Smith WE, Barrett HH. Hotelling trace criterion as a figure of merit for the optimization of imaging systems. J. Opt. Soc. Am. A. 1986;3:717–725. [Google Scholar]
- 8.Fiete RD, Barrett HH, Smith WE, Myers KJ. Hotelling trace criterion and its correlation with human-observer performance. J. Opt. Soc. Am. A. 1987;4:945–953. doi: 10.1364/josaa.4.000945. [DOI] [PubMed] [Google Scholar]
- 9.Hanson KM. Method of evaluating image-recovery algorithms based on task performance. J. Opt. Soc. Am. A. 1990;7:1294–1304. [Google Scholar]
- 10.Burgess AE. Statistically defined backgrounds: performance of a modified nonprewhitening observer model. J. Opt. Soc. Am. A. 1994;11:1237–1242. doi: 10.1364/josaa.11.001237. [DOI] [PubMed] [Google Scholar]
- 11.Burgess AE, Li X, Abbey CK. Visual signal detectability with two noise components: anomalous masking effects. J. Opt. Soc. Am. A. 1997;14:2420–2442. doi: 10.1364/josaa.14.002420. [DOI] [PubMed] [Google Scholar]
- 12.Bochud FO, Abbey CK, Eckstein MP. Visual signal detection in structured backgrounds. III. Calculation of figures of merit for model observers in statistically non-stationary backgrounds. J. Opt. Soc. Am. A. 2000;17:193–205. doi: 10.1364/josaa.17.000193. [DOI] [PubMed] [Google Scholar]
- 13.Abbey CK, Eckstein MP. Derivation of a detectability index for correlated responses in multiple-alternative forced-choice experiments. J. Opt. Soc. Am. A. 2000;17:2101–2104. doi: 10.1364/josaa.17.002101. [DOI] [PubMed] [Google Scholar]
- 14.Eckstein MP, Abbey CK, Bochud FO. Visual signal detection in structured backgrounds. IV. Figures of merit for model performance in multiple-alternative forced-choice detection tasks with correlated responses. J. Opt. Soc. Am. A. 2000;17:206–217. doi: 10.1364/josaa.17.000206. [DOI] [PubMed] [Google Scholar]
- 15.Kupinski MA, Hoppin JW, Clarkson E, Barrett HH. Ideal-observer computation in medical imaging with use of Markov-chain Monte Carlo techniques. J. Opt. Soc. Am. A. 2003;20:430–438. doi: 10.1364/josaa.20.000430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Chen L, Barrett HH. Task-based lens design with application to digital mammography. J. Opt. Soc. Am. A. 2005;22:148–167. doi: 10.1364/josaa.22.000148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Rolland J, O’Daniel J, Akcay C, DeLemos T, Lee K-I, Cheong KS, Clarkson E, Chakrabarti R, Ferris R. Task-based optimization and performance assessment in optical coherence imaging. J. Opt. Soc. Am. A. 2005;22:1132–1142. doi: 10.1364/josaa.22.001132. [DOI] [PubMed] [Google Scholar]
- 18.Caucci L, Barrett HH, Devaney N, Rodríguez JJ. Application of the Hotelling and ideal observers to detection and localization of exoplanets. J. Opt. Soc. Am. A. 2007;24:B13–B24. doi: 10.1364/josaa.24.000b13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Burgess AE, Judy PF. Signal detection in power-law noise: effect of spectrum exponents. J. Opt. Soc. Am. A. 2007;24:B52–B60. doi: 10.1364/josaa.24.000b52. [DOI] [PubMed] [Google Scholar]
- 20.Abbey CK, Eckstein MP. Classification images for simple detection and discrimination tasks in correlated noise. J. Opt. Soc. Am. A. 2007;24:B110–B124. doi: 10.1364/josaa.24.00b110. [DOI] [PubMed] [Google Scholar]
- 21.Dixon TD, Canga EF, Nikolov SG, Troscianko T, Noyes JM, Canagarajah CN, Bull DR. Selection of image fusion quality measures: objective, subjective, and metric assessment. J. Opt. Soc. Am. A. 2007;24:B125–B135. doi: 10.1364/josaa.24.00b125. [DOI] [PubMed] [Google Scholar]
- 22.Platiša L, Goossens B, Vansteenkiste E, Park S, Gallas BD, Badano A, Philips W. Channelized Hotelling observers for the assessment of volumetric imaging data sets. J. Opt. Soc. Am. A. 2011;28:1145–1163. doi: 10.1364/JOSAA.28.001145. [DOI] [PubMed] [Google Scholar]
- 23.Wagner RF, Brown DG. Unified SNR analysis of medical imaging systems. Phys. Med. Biol. 1985;30:489–518. doi: 10.1088/0031-9155/30/6/001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Frey EC, Gilland KL, Tsui BMW. Application of task-based measures of image quality to optimization and evaluation of three-dimensional reconstruction-based compensation methods in myocardial perfusion SPECT. IEEE Trans. Med. Imag. 2002;21:1040–1050. doi: 10.1109/TMI.2002.804437. [DOI] [PubMed] [Google Scholar]
- 25.He X, Frey EC, Links JM, Gilland KL, Segars WP, Tsui BMW. A mathematical observer study for the evaluation and optimization of compensation methods for myocardial SPECT using a phantom population that realistically models patient variability. IEEE Trans. Nucl. Sci. 2004;51:218–224. [Google Scholar]
- 26.Gang GJ, Lee J, Prince JL, Stayman JW, Tward DJ, Zbijewski W, Siewerdsen JH. Analysis of Fourier-domain task-based detectability index in tomosynthesis and cone-beam CT in relation to human observer performance. Med. Phys. 2011;38:1754–1768. doi: 10.1118/1.3560428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Hotelling H. The generalization of Student’s ratio. Ann. of Math. Stat. 1931;2:360–378. [Google Scholar]
- 28.Myers KJ, Barrett HH. Addition of a channel mechanism to the ideal-observer model. J. Opt. Soc. Am. A. 1987;4:2447–2457. doi: 10.1364/josaa.4.002447. [DOI] [PubMed] [Google Scholar]
- 29.Papaliolios C, Mertz L. New two-dimensional photon camera. In: Crawford DL, editor. Instrumentation in Astronomy IV. vol. 331. Proc. SPIE; 1982. pp. 360–369. vol. 331. [Google Scholar]
- 30.Papaliolios C, Nisenson P, Ebstein S. Speckle imaging with the PAPA detector. Appl. Optics. 1985;24:287–292. doi: 10.1364/ao.24.000287. [DOI] [PubMed] [Google Scholar]
- 31.Timothy JG, Morgan JS, Slater DC, Kasle DB, Bybee RL. MAMA detector systems: A status report. In: Huffman RE, editor. Ultraviolet technology III. vol. 1158. Proc. SPIE; 1989. pp. 104–117. vol. 1158. [Google Scholar]
- 32.Timothy JG. Optical detectors for spectroscopy. Publications of the Astronomical Society of the Pacific. 1983;95:810–834. [Google Scholar]
- 33.Bruyndonckx P, Lemaître C, Schaart D, Maas M, (jan) van der Laan DJ, Krieguer M, Devroede O, Tavernier S. Towards a continuous crystal APD-based PET detector design. Nucl. Instrum. and Meth. A. 2007;571:182–186. [Google Scholar]
- 34.Bostroem KA, Aloisi A, Bohlin R, Diaz R, Dixon V, Goudfrooij P, Hodge P, Lennon D, Long C, Niemi S, Osten R, Proffitt C, Walborn N, Wheeler T, Wolfe M, York B, Zheng W. STIS Instrument Handbook, Version 10.0. Baltimore, MD: Space Telescope Science Institute; 2010. [Google Scholar]
- 35.Anger HO. Scintillation camera. Rev. Sci. Instrum. 1958;29:27–33. [Google Scholar]
- 36.Wernick MN, Aarsvold JN. Emission Tomography: The Fundamentals of PET and SPECT. San Diego, CA: Elsevier Academic Press; 2004. [Google Scholar]
- 37.Johns PC, Dubeau J, Gobbi DG, Li M, Dixit MS. Opto-Canada: SPIE Regional Meeting on Optoelectronics, Photonics, and Imaging. vol. TD01. Proc. SPIE; 2002. Photon-counting detectors for digital radiography and X-ray computed tomography; pp. 367–369. vol. TD01. [Google Scholar]
- 38.Shikhaliev PM, Xu T, Molloi S. Photon counting computed tomography: Concept and initial results. Med. Phys. 2005;32:427–436. doi: 10.1118/1.1854779. [DOI] [PubMed] [Google Scholar]
- 39.Hunter WCJ, Barrett HH, Furenlid LR. Calibration method for ML estimation of 3D interaction position in a thick gamma-ray detector. IEEE Trans. Nucl. Sci. 2009;56:189–196. doi: 10.1109/TNS.2008.2010704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Hesterman JY, Caucci L, Kupinski MA, Barrett HH, Furenlid LR. Maximum-likelihood estimation with a contracting-grid search algorithm. IEEE Trans. Nucl. Sci. 2010;57:1077–1084. doi: 10.1109/TNS.2010.2045898. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Takahashi T, Nakazawa K, Kamae T, Tajima H, Fukazawa Y, Nomachi M, Kokubun M. High resolution CdTe detectors for the next-generation multi-Compton gamma-ray telescope. In: Truemper JE, Tananbaum HD, editors. X-Ray and Gamma-Ray Telescopes and Instruments for Astronomy. vol. 4851. Proc. SPIE; 2002. pp. 1228–1235. vol. 4851. [Google Scholar]
- 42.Bellm EC, Boggs SE, Bandstra MS, Bowen JD, Perez-Becker D, Wunderer CB, Zoglauer A, Amman M, Luke H-K, Chang J-L, Chiu J-S, Liang Y-H, Chang Z-K, Liu W-C, Hung C-H, Lin PN, Huang MA, Jean P. Overview of the nuclear Compton telescope. IEEE Trans. Nucl. Sci. 2009;56:1250–1256. [Google Scholar]
- 43.Zoglauer A, Andritschke R, Kanbach G. Data analysis for the MEGA prototype. New Astron. Rev. 2004;48:231–235. [Google Scholar]
- 44.Kitaeva GK, Penin AN. Spontaneous parametric down-conversion. JETP Lett. 2005;82:330–335. [Google Scholar]
- 45.Hayat A, Ginzburg P, Orenstein M. Observation of two-photon emission from semiconductors. Nature Photon. 2008;2:238–241. [Google Scholar]
- 46.Nasr MB, Minaeva O, Goltsman GN, Sergienko AV, A. Saleh BE, Teich MC. Submicron axial resolution in an ultrabroadband two-photon interferometer using superconducting single-photon detect. Opt. Express. 2008;16:15104–15108. doi: 10.1364/oe.16.015104. [DOI] [PubMed] [Google Scholar]
- 47.Abouraddy AF, Stone PR, Sergienko AV, Saleh BEA, Teich MC. Entangled-photon imaging of a pure phase object. Phys. Rev. Lett. 2004;93 doi: 10.1103/PhysRevLett.93.213903. 213903. [DOI] [PubMed] [Google Scholar]
- 48.Reader AJ, Ally S, Bakatselos F, Manavaki R, Walledge RJ, Jeavons AP, Julyan PJ, Zhao S, Hastings DL, Zweit J. One-pass list-mode EM algorithm for high-resolution 3-D PET image reconstruction into large arrays. IEEE Trans. Nucl. Sci. 2002;49:693–699. [Google Scholar]
- 49.Khurd I-T, Hsiao P, Rangarajan A, Gindi G. A globally convergent regularized ordered-subset EM algorithm for list-mode reconstruction. IEEE Trans. Nucl. Sci. 2004;51:719–725. [Google Scholar]
- 50.Reader AJ, Erlandsson K, Ott RJ, Flower MA. Attenuation and scatter correction of list-mode data driven iterative and analytic image reconstruction algorithms for rotating 3D PET systems. IEEE Trans. Nucl. Sci. 1999;46:2218–2226. [Google Scholar]
- 51.Parra L, Barrett HH. List-mode likelihood: EM algorithm and image quality estimation demonstrated on 2-D PET. IEEE Trans. Med. Imag. 1998;17:228–235. doi: 10.1109/42.700734. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Snyder DL, Politte DG. Image reconstruction from list-mode data in an emission tomography system having time-of-flight measurements. IEEE Trans. Nucl. Sci. 1983;30:1843–1849. [Google Scholar]
- 53.Reader AJ, Erlandsson K, Flower MA, Ott RJ. Fast accurate iterative 28 reconstruction for low-statistics positron volume imaging. Phys. Med. Biol. 1998;43:835–846. doi: 10.1088/0031-9155/43/4/012. [DOI] [PubMed] [Google Scholar]
- 54.Byrne C. Likelihood maximization for list-mode emission tomographic image reconstruction. IEEE Trans. Med. Imag. 2001;20:1084–1092. doi: 10.1109/42.959305. [DOI] [PubMed] [Google Scholar]
- 55.Huesman RH, Klein GJ, Moses WW, Qi J, Reutter BW, Virador PRG. List-mode maximum-likelihood reconstruction applied to positron emission mammography (PEM) with irregular sampling. IEEE Trans. Med. Imag. 2000;19:532–537. doi: 10.1109/42.870263. [DOI] [PubMed] [Google Scholar]
- 56.Levkovitz R, Falikman D, Zibulevsky M, Ben-Tal A, Nemirovski A. The design and implementation of COSEN, an iterative algorithm for fully 3-D listmode data. IEEE Trans. Med. Imag. 2001;20:633–642. doi: 10.1109/42.932747. [DOI] [PubMed] [Google Scholar]
- 57.Barrett HH, White T, Parra LC. List-mode likelihood. J. Opt. Soc. Am. A. 1997;14:2914–2923. doi: 10.1364/josaa.14.002914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Surti S, Karp JS, Popescu LM, Daube-Witherspoon ME, Werner M. Investigation of image quality and NEC in a TOF-capable PET scanner. IEEE Nuclear Science Symposium Conference Record; Rome; Italy. 2004. pp. 4032–4037. [Google Scholar]
- 59.Guérin B, Fakhri GE. Novel scatter compensation of list-mode PET data using spatial and energy dependent corrections. IEEE Trans. Med. Imag. 2011;30:759–773. doi: 10.1109/TMI.2010.2095025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Snyder DL. Parameter estimation for dynamic studies in emission-tomography systems having list-mode data. IEEE Trans. Nucl. Sci. 1984;31:925–931. [Google Scholar]
- 61.Gilks WR, Richardson S, Spiegelhalter DJ. Introducing Markov chain Monte Carlo. In: Gilks WR, Richardson S, Spiegelhalter DJ, editors. Markov chain Monte Carlo in Practice. chap. 1. Boca Raton, FL: Chapman & Hall/CRC; 1996. [Google Scholar]
- 62.Neal RM. Tech. Rep. CRG-TR-1. Department of Computer Science, University of Toronto; 1993. Probabilistic inference using Markov chain Monte Carlo methods. [Google Scholar]
- 63.Lehovich A. Ph.D. thesis. Tucson, AZ: University of Arizona; 2005. List-mode SPECT reconstruction using empirical likelihood. [Google Scholar]
- 64.Korpar S, Križan P, Pestotnik R, Gorišek A, Stanovnik A, Starič M, Škrk D. Multianode photomultipliers as position-sensitive detectors of single photons. Nucl. Instrum. Methods Phys. Res., A. 2000;442:316–321. [Google Scholar]
- 65.Rolland JP, Barrett HH. Effect of random background inhomogeneity on observer detection performance. J. Opt. Soc. Am. A. 1992;9:649–658. doi: 10.1364/josaa.9.000649. [DOI] [PubMed] [Google Scholar]
- 66.Wang ZX, Guo DR. Special functions. Singapore: World Scientific; 1989. [Google Scholar]
- 67.Apelblat A. Table of Definite and Infinite Integrals. Amsterdam, Netherlands: Elsevier Scientific Pub. Co.; 1983. [Google Scholar]
- 68.Andrews GE, Askey R, Roy R. Special functions. Cambridge, England: Cambridge University Press; 2000. [Google Scholar]

