A stochastic analysis of distance estimation approaches in single molecule microscopy - quantifying the resolution limits of photon-limited imaging systems

Sripad Ram; E Sally Ward; Raimund J Ober

doi:10.1007/s11045-012-0175-6

. Author manuscript; available in PMC: 2014 Sep 1.

Published in final edited form as: Multidimens Syst Signal Process. 2013 Sep;24(3):503–542. doi: 10.1007/s11045-012-0175-6

A stochastic analysis of distance estimation approaches in single molecule microscopy - quantifying the resolution limits of photon-limited imaging systems

Sripad Ram ¹, E Sally Ward ², Raimund J Ober ^3,⁴

PMCID: PMC4053535 NIHMSID: NIHMS380603 PMID: 24932067

Abstract

Optical microscopy is an invaluable tool to visualize biological processes at the cellular scale. In the recent past, there has been significant interest in studying these processes at the single molecule level. An important question that arises in single molecule experiments concerns the estimation of the distance of separation between two closely spaced molecules. Presently, there exists different experimental approaches to estimate the distance between two single molecules. However, it is not clear as to which of these approaches provides the best accuracy for estimating the distance. Here, we address this problem rigorously by using tools of statistical estimation theory. We derive formulations of the Fisher information matrix for the underlying estimation problem of determining the distance of separation from the acquired data for the different approaches. Through the Cramer-Rao inequality, we derive a lower bound to the accuracy with which the distance of separation can be estimated. We show through Monte-Carlo simulations that the bound can be attained by the maximum likelihood estimator. Our analysis shows that the distance estimation problem is in fact related to the localization accuracy problem, the latter being a distinct problem that deals with how accurately the location of an object can be determined. We have carried out a detailed investigation of the relationship between the Fisher information matrices of the two problems for the different experimental approaches considered here. The paper also addresses the issue of a singular Fisher information matrix, which presents a significant complication when calculating the Cramer-Rao lower bound. Here, we show how experimental design can overcome the singularity. Throughout the paper, we illustrate our results by considering a specific image profile that describe the image of a single molecule.

Keywords: Marked point process, Photon statistics, Performance bounds, Fluorescence microscopy, Resolution limits, Rayleigh’s criterion

1 Introduction

The study of biomolecular interactions that occur within a cell is fundamental to all areas of basic biomedical research. The optical microscope is one of the most preferred tools to study biomolecular interactions, as it enables the direct visualization of these processes in real time. For instance, several technological advances in the past decade have made it possible to image individual biomolecules with an optical microscope even in live biological cells (Moerner (2007); Ober et al (2004a)). In many concrete applications, it is important to know the distance of separation between the biomolecules, as this has significant biological implications. The resolution limit of the optical microscope plays a crucial role in determining the ability to measure the distance of separation between biomolecules. Classical resolution criteria such as Rayleigh’s criterion, although extensively used, are well known to be based on heuristic notions that render them inadequate for present day microscopy systems. Therefore quantifying the resolution limit is a very important problem with significant implications on the nature and type of studies that can be carried out with an optical microscope.

Current experimental approaches to studying single molecule interactions can be broadly classified into two categories. In one set of approaches, which we refer to as the simultaneous detection approach (Figure 1A), photon emission from the point sources occurs simultaneously during image acquisition and hence the acquired images contain signal from both point sources (Santos and Young (2000); Ram et al (2006a); Chao et al (2009a, b)). In the other set of approaches, which we refer to as the separate detection approach (Figure 1C), photon emission from the point sources are temporally separated (e.g. stochastic photoactivation (Betzig et al (2006); Rust et al (2006); Hess et al (2006)) and blinking (Lidke et al (2005); Lagerholm et al (2006))). Hence the acquired images typically contain signal from only one of the point sources. For both types of approaches, the analysis of the acquired data is carried out using a parameter estimation framework. For example, in the case of the simultaneous detection approach the distance between the point sources is determined by fitting a pair of suitably parameterized image profiles to the acquired data. In the case of the separation detection approach, the analysis involves independently localizing the point sources and then deducing the distance. It has been reported that both approaches are capable of accurately measuring nanometer scale distances, well below the classical resolution criteria. However, an important question arises as to what are the fundamental performance limits of the two experimental approaches to measure the distance of separation.

Fig. 1 — Different experimental approaches to determine the distance of separation between two identical point sources. Panel A illustrates the simultaneous detection approach in which photon emission from both point sources occurs during image acquisition. In this approach, the data consists of a single image that contains signal from both point sources. Panel B illustrates the special case of the simultaneous detection approach, where the image of one of the point sources is additionally available. Here, the data consists of a pair of images where one of the images contains signal from only one point source, whereas the other image contains signal from both point sources. Panel C illustrates the separate detection approach, where photon emission from the point sources are temporally separated. Here, the data consists of a pair of images, where each image contains signal from either of the point sources.

In this paper, we use the tools of statistical signal processing to investigate this question in a rigorous manner. We formulate the resolution problem as a parameter estimation problem of determining the distance between two closely spaced point sources. The issue of resolvability of the two point sources then becomes a question of how accurately the distance can be estimated, i.e., how large is the standard deviation of the distance estimator. In this context, it is important to know what is the lowest possible standard deviation with which the distance can be estimated, as this can be used as a benchmark for the resolvability of the point sources. For this, we make use of the Cramer-Rao inequality (Rao (1965)) which, through the inverse Fisher information matrix, provides a lower bound to the variance of any unbiased estimator of an unknown parameter. Thus, in the present context we interpret the Cramer-Rao lower bound of the distance parameter as a measure of resolvability of the two point sources.

Here, we derive formulations of the Fisher informationmatrix for the parameter estimation problem that underlies the data analysis for the two approaches. Our analysis shows that the Fisher information matrices for the two techniques exhibit very distinct behaviors. For instance, in the simultaneous detection approach the Fisher information matrix depends on the distance of separation between the point sources. In contrast, for the separate detection approach the Fisher information matrix is independent of the distance of separation. As we will see, the distance dependence of the Fisher information matrix has several implications. In particular, for the simultaneous detection approach the Fisher information matrix becomes singular when the distance goes to zero assuming that the two point sources have identical image profiles and photon detection rates, which is typically the case in most imaging applications. An immediate implication is that for very small distances, the Cramer-Rao lower bound of the distance will be numerically very large, thereby predicting poor resolvability of the point sources. On the other hand, the Fisher information matrix for the separate detection approach is invertible for all values of the distance including when the distance is equal to zero.

Another problem that is of significance in the present context is the localization accuracy problem, which deals with how accurately the location of an object can be determined (Wong et al (2011); Ram et al (2006b); Ober et al (2004b); Rohr (2007)). For the separate detection approach, the localization accuracy problem naturally arises as part of the data analysis procedure. For the simultaneous detection approach, the localization accuracy problem arises as a special case where in some applications the image of one of the point sources is additionally available (e.g. photobleaching (Ram et al (2006a); Gordon et al (2004); Qu et al (2004)), which, in turn, can be used as a priori information (Figure 1B). Here, we investigate the relationship between the Fisher information matrix of the two approaches and that of the localization accuracy problem. Our analysis shows that for the separate detection approach, the expression for the Fisher information matrix is equivalent to that of the localization accuracy problem, whereas for the simultaneous detection approach the equivalence is attained only when the distance of separation between the point sources becomes very large (i.e., d → ∞). In this context, we also investigate the singularity of the Fisher information matrix for the simultaneous detection approach. In particular we show that the singularity can be removed when the location coordinates of one of the point sources is known a priori.

Previously, we have examined the distance estimation problem for optical microscopes, where we derived analytical expressions for the Fisher information matrix. In Ram et al (2006a), we investigated the 2D imaging scenario for the simultaneous detection approach, where the point sources were assumed to be located on the x axis of the plane of focus in the object space. In Chao et al (2009a,b), we considered the 3D imaging scenario for the simultaneous detection approach, where the point sources were assumed to be located anywhere in the object space. In Chao et al (2009c), we reported numerical calculations of the Cramer-Rao lower bound for the two detection approaches considered here. In the present work, we rigorously analyze the relationship between the Fisher information matrices of the two experimental approaches considered here and that of the localization accuracy problem.

In the past, other groups have investigated the distance estimation problem by adopting a simplified data model, where the acquired data is described as a deterministic signal corrupted by additive noise (Helstrom (1964); Smith (2005); Shahram and Milanfar (2004)). Because photon/light emission from a point source is inherently a random phenomenon (Young (1996)), it is important to take into account the stochastic nature of the signal (i.e., the photon statistics) from the point sources especially when dealing with photon-limited imaging systems (O’Sullivan et al (1998)). In our (prior and current) work, we have adopted a stochastic framework and model the acquired data as a spatio-temporal random process (marked Poisson process). In this way we explicitly take into account the photon statistics. Thus our results and analyses presented in this paper provide a broad framework to investigate the resolution limits for a wide variety of low light level imaging applications.

The paper is organized as follows. In Section 3, we derive general expressions of the Fisher information matrix for the estimation problem that underlies the simultaneous detection approach. We also derive the Fisher information matrix for a concrete scenario in optical microscopy where the image of an object is considered to be spatially invariant. In Section 4, we discuss the relationship between the Fisher information matrix for the simultaneous detection approach and that of the localization accuracy problem. In Section 5, we consider a special case of the simultaneous detection approach, where we assume that the location coordinates of one of the point sources is known and derive the Fisher information matrix. As we will see, the analysis of this special case provides important insights into the relationship between the two approaches considered here. In Section 6, we derive the Fisher information matrix for the separate detection approach. Finally, in Section 7 we validate our results by demonstrating that the maximum likelihood estimator of the distance attains the Cramer-Rao lower bound for the different experimental approaches considered here. Throughout the paper, we illustrate our results with examples relevant to single molecule microscopy.

2 Stochastic framework

We assume an acquired image to consist of the time points and the spatial coordinates of the detected photons and model it as a spatio-temporal random process. We refer to this process as the image detection process 𝒢 (see Ram et al (2006b) for details). The parameter space Θ is assumed to be an open subset of ℝⁿ and the detector that is used to capture the photons is denoted as 𝒞, where 𝒞 ⊆ ℝ² is open. The temporal part of 𝒢 is modeled as an inhomogeneous Poisson process with intensity Λ_θ called the photon detection rate and the spatial part of 𝒢 is modeled as a sequence of mutually independent random variables with densities {f_θ,τ}_τ≥t₀ called the photon distribution profile. It is assumed that the spatial and temporal components are mutually independent of each other and that f_θ,τ satisfies the regularity conditions necessary for the calculation of the Fisher information matrix (Ram et al (2006b); Kay (1993)).

The general expression of the Fisher informationmatrix for the image detection process 𝒢 is given by (Ram et al (2006b))

I (θ) = \int_{t_{0}}^{t} \int_{𝒞} \frac{1}{Λ_{θ} (τ) f_{θ, τ} (r)} {(\frac{\partial [Λ_{θ} (τ) f_{θ, τ} (r)]}{\partial θ})}^{T} \frac{\partial [Λ_{θ} (τ) f_{θ, τ} (r)]}{\partial θ} drd τ, θ \in Θ,

(1)

where [t₀, t] denotes the time interval during which the data is acquired and the integration variable r denotes the 2D Cartesian coordinates (x,y). In the above equation, no specific assumptions have been made regarding the functional form of f_θ,τ or Λ_θ. Therefore, the above expression of I(θ) is applicable to a wide variety of imaging conditions, such as coherent/incoherent/partially-coherent light sources, polarized illumination and detection, etc. We note that the above equation is applicable to both stationary and moving objects, since we allow the density f_θ,τ, which describes the image profile of the object, to vary in time.

In order to quantify and compare the performance of the various experimental approaches considered in this paper, we make use of the Cramer-Rao inequality (Rao (1965)), which states that for any unbiased estimator θ̂ of a n × 1 vector parameter θ, Cov(θ̂) ≥ I⁻¹(θ), θ ∈ Θ, where I(θ) denotes the Fisher information matrix and it is assumed that the inverse exists. From this inequality, it immediately follows that the i^th leading diagonal entry of the inverse Fisher information matrix ([I⁻¹(θ)]_ii) provides a lower bound to the variance of the estimates of the i^th component of the parameter vector (θ_i), i = 1, …, n.

Throughout the paper, we adopt a parameterization in which the location of the two point sources are specified in terms of their Cartesian coordinates, i.e., (x₀₁, y₀₁) and (x₀₂, y₀₂). Hence the expressions for the Fisher information matrix will be given in terms of this parameterization. As we will see in subsequent sections, this parameterization not only simplifies the derivation of the Fisher information matrix for the different experimental approaches considered here, but it also helps in the analysis of the relationship between the distance estimation problem and the localization accuracy problem. To derive the Cramer-Rao lower bound for the distance parameter d, we require the analytical expression for the (inverse) Fisher information matrix of d. For this, we make use of the following coordinate transformation formula (Kay (1993))

I^{- 1} (d) = (\frac{\partial d}{\partial θ}) I^{- 1} (θ) {(\frac{\partial d}{\partial θ})}^{T}, d \in [0, \infty),

(2)

where θ = (x₀₁, y₀₁, x₀₂, y₀₂), I⁻¹(θ) denotes the inverse Fisher information matrix corresponding to θ, and

{(\frac{\partial d}{\partial θ})}^{T} ≔ \frac{1}{d} (\begin{matrix} - (x_{02} - x_{01}) \\ - (y_{02} - y_{01}) \\ (x_{02} - x_{01}) \\ (y_{02} - y_{01}) \end{matrix}), θ \in Θ .

3 Fisher information matrix for the simultaneous detection approach

In the simultaneous detection approach, the acquired image is assumed to contain the signal from both objects. Hence the photon detection rate Λ_θ and the photon distribution profile f_θ,τ can be written as

Λ_{θ} (τ) = Λ_{θ, 1} (τ) + Λ_{θ, 2} (τ), θ \in Θ, τ \geq t_{0},

(3)

f_{θ, τ} (r) = ε_{θ, 1} (τ) f_{θ, τ, 1} (r) + ε_{θ, 2} (τ) f_{θ, τ, 2} (r), r = (x, y) \in 𝒞, θ \in Θ, τ \geq t_{0},

(4)

where 𝒞 denotes the detector, Λ_θ,1, Λ_θ,2 and f_θ,τ,1, f_θ,τ,2 denote the photon detection rates and the photon distribution profiles of the two objects, respectively, and ε_θ,i(τ) ≔ Λ_θ,i(τ)/(Λ_θ,1 (τ) + Λ_{θ, 2} (τ)), θ ∈ Θ, τ ≥ t₀, i = 1, 2.

The results in this section are divided into two parts. In Section 3.1, we first derive general expressions of the Fisher information matrix for the simultaneous detection approach (Theorem 1). Here, we make no assumptions regarding the specific functional form of the photon detection rates Λ_θ,i or the photon distribution profiles f_θ,τ,i, i = 1, 2. Hence these results provide a general framework that is applicable to a wide variety of imaging scenarios.

In Section 3.2, we consider a concrete scenario (spatially invariant case) in optical microscopy where we assume a specific functional form for the photon distribution profiles f_θ,τ,i, i = 1, 2, which are expressed as a scaled and shifted version of the image of the objects. We then derive the Fisher information matrix for this functional form of f_θ,τ,i, i = 1, 2 (Theorem 2). As will be shown, the resulting Fisher information matrix can be expressed as a product decomposition of the form DCD^T, where D is an orthogonal matrix and C is a positive semidefinite matrix. Under weak assumptions of spatial symmetry for the image of the objects (which are typically satisfied in most situations), the product decomposition greatly simplifies the calculation of the Fisher information matrix and also facilitates the derivation of an analytical expression for the inverse Fisher information matrix (Corollary 1).

3.1 General expression of the Fisher information matrix

In many imaging applications, the unknown parameter vector θ can be expressed as θ = (θ_f, θ_Λ), where θ_f denotes the spatial component and θ_Λ denotes the temporal component. The spatial component θ_f typically consists of parameters that specify the location of one or more objects and the temporal component θ_Λ consists of parameters that specify the photon detection rates of the objects.

In the following theorem, we express the Fisher information matrix as a 2 × 2 block matrix. The terms in the leading diagonal (i.e., S_sim and T_sim) correspond to the Fisher information matrix of the spatial θ_f and temporal θ_Λ components while the terms in the off-diagonal (i.e., R_sim and $R_{sim}^{T}$ ) correspond to the coupling between the spatial and temporal components. We derive expressions for three practical scenarios. In the first scenario, we derive a general expression for the Fisher information matrix. In the second scenario, we consider the case where the photon detection rates are related to one another by a known scalar function β, i.e., β(τ)Λ_θ,1 (τ) = Λ_θ,2 (τ) for τ ≥ t₀ and θ ∈ Θ, where β(τ) ≥ 0. In some applications, the photon detection rates of the objects are assumed to be the same, i.e., Λ_θ,1 (τ) = Λ_θ,2(τ), τ ≥ t₀. We note that this condition is a special case of the second scenario considered here with β(τ) = 1, τ ≥ t₀. For this scenario we show that the Fisher information matrix becomes block diagonal, which implies that the spatial θ_f and temporal θ_Λ components become decoupled. We note that this decoupling simplifies the subsequent analysis of the Fisher information matrix. In the third scenario, we assume that the photon distribution profiles of the objects are equal, i.e., f_θ,τ,1 (r) = f_θ,τ,2 (r) for r ∈ 𝒞, θ ∈ Θ and τ ≥ t₀. This scenario arises in many applications, where the image profiles of the objects are assumed to be identical. For this scenario also we show that the Fisher information matrix becomes block diagonal.

Theorem 1 Let Θ ⊆ ℝⁿ. For θ ≔ (θ_f, θ_Λ) ∈ Θ, let 𝒢(Λ_θ, {f_θ,τ}_τ≥t₀, 𝒞) be an image detection process, where Λ_θ and f_θ,τ are defined in eqs. 3 and 4, respectively. Assume that for θ ∈ Θ, τ ≥ t₀ and i = 1, 2,

A1 (∂f_θ,τ,i(r)/∂θ_Λ) = 0, r ∈ 𝒞,
A2 (∂Λ_θ,i(τ)/∂θ_f) = 0.
- 1.
  Then the Fisher information matrix of 𝒢 corresponding to the acquisition time interval [t₀, t] for the simultaneous detection approach is given by
  $I_{sim} (θ) = [\begin{matrix} S_{sim} (θ) R_{sim} (θ) \\ R_{sim}^{T} (θ) T_{sim} (θ) \end{matrix}], θ \in Θ,$
  where for θ ∈ Θ,
  $S_{sim} (θ) ≔ \int_{t_{0}}^{t} \int_{𝒞} \frac{Λ_{θ} (τ)}{f_{θ, τ} (r)} {(\frac{\partial f_{θ, τ} (r)}{\partial θ_{f}})}^{T} \frac{\partial f_{θ, τ} (r)}{\partial θ_{f}} drd τ,$ (5)
  
  $R_{sim} (θ) ≔ \int_{t_{0}}^{t} \int_{𝒞} \frac{Λ_{θ} (τ)}{f_{θ, τ} (r)} {(\frac{\partial f_{θ, τ} (r)}{\partial θ_{f}})}^{T} \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} drd τ,$ (6)
  
  $T_{sim} (θ) ≔ \int_{t_{0}}^{t} \frac{1}{Λ_{θ} (τ)} {(\frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}})}^{T} \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}} d τ + \int_{t_{0}}^{t} \int_{𝒞} \frac{Λ_{θ} (τ)}{f_{θ, τ} (r)} {(\frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}})}^{T} \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} drd τ .$ (7)
- 2.
  For β(τ) ≥ 0, τ ≥ t₀, assume, in addition to A1 and A2, that
A3 β(τ)Λ_θ,1(τ) = Λ_θ,2(τ), τ ≥ t₀ and θ ∈ Θ.

Then the Fisher information matrix of 𝒢 corresponding to the acquisition time interval [t₀, t] for the simultaneous detection approach is given by
$I_{sim} (θ) = [\begin{matrix} {S̃}_{sim} (θ) & 0 \\ 0 & {T̃}_{sim} (θ) \end{matrix}], θ \in Θ,$
where for θ ∈ Θ,
${S̃}_{sim} (θ) ≔ \int_{t_{0}}^{t} \int_{𝒞} \frac{Λ_{θ, 1} (τ)}{f_{θ, τ, 1} (r) + β (τ) f_{θ, τ, 2} (r)} {(\frac{\partial [f_{θ, τ, 1} (r) + β (τ) f_{θ, τ, 2} (r)]}{\partial θ_{f}})}^{T} \times \frac{\partial [f_{θ, τ, 1} (r) + β (τ) f_{θ, τ, 2} (r)]}{\partial θ_{f}} drd τ,$

${T̃}_{sim} (θ) ≔ \int_{t_{0}}^{t} \frac{1 + β (τ)}{Λ_{θ, 1} (τ)} {(\frac{\partial Λ_{θ, 1} (τ)}{\partial θ_{Λ}})}^{T} \frac{\partial Λ_{θ, 1} (τ)}{\partial θ_{Λ}} d τ .$
- 3.
  For θ ∈ Θ and τ ≥ t₀, assume, in addition to A1 and A2, that
A4 f_θ,τ,1(r) = f_θ,τ,2(r) for r ∈ 𝒞.

Then the Fisher information matrix of 𝒢 corresponding to the acquisition time interval [t₀, t] for the simultaneous detection approach is given by

I_{sim} (θ) = [\begin{matrix} {S̄}_{sim} (θ) & 0 \\ 0 & {T̄}_{sim} (θ) \end{matrix}], θ \in Θ,

where for θ ∈ Θ,

{S̄}_{sim} (θ) ≔ \int_{t_{0}}^{t} Λ_{θ} (τ) d τ \int_{𝒞} \frac{1}{f_{θ, τ, 1} (r)} {(\frac{\partial f_{θ, τ, 1} (r)}{\partial θ_{f}})}^{T} \frac{\partial f_{θ, τ, 1} (r)}{\partial θ_{f}} dr,

{T̄}_{sim} (θ) ≔ \int_{t_{0}}^{t} \frac{1}{Λ_{θ} (τ)} {(\frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}})}^{T} \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}} d τ .

Proof See Section A.1 in Appendix for proof.

In many applications it is important to know whether the Fisher information matrix I(θ) is (block) diagonal. For instance, it is well known that under certain conditions the maximum likelihood estimator of a vector parameter θ is asymptotically Gaussian distributed with mean θ and covariance I⁻¹(θ) (see Van des Bos (2007)). From the above Theorem, we see that if the photon detection rates can be expressed as a scalar function of one another or if the photon distribution profiles are identical, then I(θ) becomes block diagonal. This implies that the maximum likelihood estimates of the spatial (θ_f) and temporal (θ_Λ) components of the unknown vector parameter θ are asymptotically independent. Moreover, if an efficient estimator of θ exists (i.e., an estimator whose covariance matrix is equal to I⁻¹(θ), θ ∈ Θ), then the estimates of θ_f and θ_Λ are uncorrelated. Another implication of block diagonality is that the Cramer-Rao lower bound of the spatial component θ_f is independent of the number of unknown parameters in the temporal component θ_Λ, and vice versa.

Remark 1 In result 2 of Theorem 1, we showed that the Fisher information matrix I_sim(θ) is block diagonal if β(τ)Λ_θ,1(τ) = Λ_θ,2(τ) for τ ≥ t₀ and θ ∈ Θ, where β(τ) ≥ 0, τ ≥ t₀, is a known scalar function. We note that I_sim(θ) will be block diagonal when Λ_θ,1(τ) = β(τ)Λ_θ,2(τ), τ ≥ t₀ and θ ∈ Θ for β(τ) ≥ 0, τ ≥ t₀.

3.2 Fisher information matrix for the spatially invariant case

We next investigate a concrete scenario in optical microscopy where the image of the objects is spatially invariant, and we derive the Fisher information matrix for the simultaneous detection approach. Here, we introduce a specific parameterization of the spatial component θ_f of the parameter vector θ given by θ_f = θ_c = (x₀₁, y₀₁, x₀₂, y₀₂) ∈ Θ_c, where (x₀₁, y₀₁) and (x₀₂, y₀₂) denote the Cartesian coordinates of the two objects, and Θ_c is the parameter space that is an open subset of ℝ⁴. We consider the infinitely large detector 𝒞 = ℝ². For any given imaging condition, this infinite detector provides the best case scenario, where all the photons that reach the detector plane are detected.

In many microscopy applications, the image of an object can be considered to be invariant with respect to shifts in the object location (Young (1996)). In the present context, the photon distribution profile f_{θ_c,τ,i}, i = 1, 2, can be expressed as a scaled and shifted version of the image of the object and is given by

f_{θ_{c}, τ, i} (r) = \frac{1}{M^{2}} q_{i} (\frac{x}{M} - x_{0 i}, \frac{y}{M} - y_{0 i}), r = (x, y) \in ℝ^{2},

(8)

where θ_c ∈ Θ_c, τ ≥ t₀, i = 1, 2, M denotes the total lateral magnification of the optical system, and q_i denotes the image function of the i^th object, i = 1, 2. An image function q is defined as the image of an object at unit magnification when the object is located at the origin of the coordinate axes. By definition, f_{θ_c,τ,i}, i = 1, 2, is assumed to satisfy the regularity conditions that are necessary for the calculation of the Fisher information matrix. Hence we impose appropriate conditions on the image functions, which are given in Definition 6 (see Appendix).

In many imaging experiments, the temporal component θ_Λ of the vector parameter θ is either assumed to be known or the photon detection rates are unknown but assumed to be equal (Λ_θ,1(τ) = Λ_θ,2(τ), τ ≥ t₀). In the former case, the Fisher information matrix of the simultaneous detection approach I_sim(θ) trivially reduces to that of the spatial component θ_f i.e., I_sim(θ) = S_sim(θ), θ ∈ Θ. In the latter case, the Fisher information matrices of the spatial and temporal components are decoupled as shown in Result 2 of Theorem 1. Therefore in this section, we focus our analysis on the Fisher information matrix for the spatial component θ_f.

Without loss of generality, we assume that the photon detection rates of the objects are known, and hence we have

Λ_{θ_{c}} (τ) = Λ_{1} (τ) + Λ_{2} (τ), τ \geq t_{0}, θ_{c} \in Θ_{c},

(9)

where Λ₁ and Λ₂ denote the photon detection rates of the two objects. Further, the photon distribution profile f_θ,τ is given by

f_{θ_{c}, τ} (r) ≔ ε_{1} (τ) f_{θ_{c}, τ, 1} (r) + ε_{2} (τ) f_{θ_{c}, τ, 2} (r), r \in ℝ^{2}, θ_{c} \in Θ_{c}, τ \geq t_{0} .

(10)

where ε_i(τ) = Λ_i(τ)/(Λ₁(τ) + Λ₂(τ)), and f_{θ_c,τ,i} is given by eq. 8 for i = 1, 2, τ ≥ t₀ and θ_c ∈ Θ_c.

In the next Theorem we derive an analytical expression of the Fisher information matrix for the spatial component θ_c pertaining to the specific parameterization of the photon detection rate Λ_{θ_c} and the photon distribution profile f_{θ_c,τ} given in eqs. 9 and 10, respectively. Here, we express the Fisher information matrix S_sim(θ_c) as a 2 × 2 block matrix. As we shall see in Section 4, this expression will be used to analyze its relationship with the Fisher information matrix for the localization accuracy problem. We also derive a product decomposition for S_sim(θ_c). This decomposition simplifies the calculation of the inverse of S_sim(θ_c) and enables us to obtain an analytical expression for the same (Corollary 1).

Theorem 2 Let Θ_c ⊆ ℝ⁴. For θ_c = (x₀₁, y₀₁, x₀₂, y₀₂) ∈ Θ_c, let 𝒢(Λ_{θ_c}, {f_{θ_c,τ}}_τ≥t₀, 𝒞) be an image detection process, where Λ_θ and f_θ,τ are given by eqs. eqs. 9 and 10, respectively.

For θ_c ∈ Θ_c, the Fisher information matrix of the spatial component corresponding to the acquisition time interval [t₀, t] for the simultaneous detection approach is given by

S_{sim} (θ_{c}) = (\begin{matrix} K_{11} (θ_{c}) K_{12} (θ_{c}) \\ K_{12}^{T} (θ_{c}) K_{22} (θ_{c}) \end{matrix}),

(11)

where for θ_c ∈ Θ_c and i, j = 1, 2,

K_{ij} (θ_{c}) ≔ \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{i} (τ) Λ_{j} (τ)}{Λ_{1} (τ) q_{1} (x - x_{01}, y - y_{01}) + Λ_{2} (τ) q_{2} (x - x_{02}, y - y_{02})} \times (\begin{matrix} \frac{\partial q_{i} (x - x_{0 i}, y - y_{0 i})}{\partial x} \frac{\partial q_{j} (x - x_{0 j}, y - y_{0 j})}{\partial x} \frac{\partial q_{i} (x - x_{0 i}, y - y_{0 i})}{\partial x} \frac{\partial q_{j} (x - x_{0 j}, y - y_{0 j})}{\partial y} \\ \frac{\partial q_{i} (x - x_{0 i}, y - y_{0 i})}{\partial y} \frac{\partial q_{j} (x - x_{0 j}, y - y_{0 j})}{\partial x} \frac{\partial q_{i} (x - x_{0 i}, y - y_{0 i})}{\partial y} \frac{\partial q_{j} (x - x_{0 j}, y - y_{0 j})}{\partial y} \end{matrix}) dxdyd τ .

(12)

Let

d = \sqrt{{(x_{02} - x_{01})}^{2} + {(y_{02} - y_{01})}^{2}}

and define

Θ_{c}^{0} = {(x_{01}, y_{01}, x_{02}, y_{02}) | (x_{01}, y_{01}) = (x_{02}, y_{02})}

. Then for

θ_{c} \in Θ_{c} \ Θ_{c}^{0}

, the Fisher information matrix S_sim(θ_c) given in result 1 of this Theorem can be written as

S_{sim} (θ_{c}) = D (θ_{c}) C (θ_{c}) D^{T} (θ_{c}),

where for

θ_{c} \in Θ_{c} \ Θ_{c}^{0}

D (θ_{c}) ≔ (\begin{matrix} D̃ (θ_{c}) & 0 \\ 0 & D̃ (θ_{c}) \end{matrix}), D̃ (θ_{c}) ≔ \frac{1}{d} (\begin{matrix} x_{02} - x_{01} & - (y_{02} - y_{01}) \\ y_{02} - y_{01} & x_{02} - x_{01} \end{matrix}),

(13)

C (θ_{c}) ≔ (\begin{matrix} C_{11} (θ_{c}) C_{12} (θ_{c}) \\ C_{12}^{T} (θ_{c}) C_{22} (θ_{c}) \end{matrix}),

(14)

C_{ij} (θ_{c}) ≔ \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{i} (τ) Λ_{j} (τ)}{Λ_{1} (τ) q_{1} (x + \frac{d}{2}, y) + Λ_{2} (τ) q_{2} (x - \frac{d}{2}, y)} \times (\begin{matrix} q_{i, x}^{'} (x, y) q_{j, x}^{'} (x, y) q_{i, x}^{'} (x, y) q_{j, y}^{'} (x, y) \\ q_{i, x}^{'} (x, y) q_{j, y}^{'} (x, y) q_{i, y}^{'} (x, y) q_{j, y}^{'} (x, y) \end{matrix}) dxdyd τ, i, j = 1, 2,

(15)

with

q_{i, ζ}^{'} (x, y) ≔ {\begin{matrix} \frac{\partial q_{1} (x + \frac{d}{2}, y)}{\partial ζ}, i = 1, (x, y) \in ℝ^{2}, \\ \frac{\partial q_{2} (x - \frac{d}{2}, y)}{\partial ζ}, i = 2, (x, y) \in ℝ^{2}, \end{matrix} ζ \in {x, y} .

(16)

3.
Assume that q₁ and q₂ are symmetric along the y axis with respect to y = 0, i.e., q_i(x, y) = q_i(x, −y), (x, y) ∈ ℝ² and i = 1, 2. Then for $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ and i = 1, 2, C_ij(θ_c) is given by
$C_{ij} (θ_{c}) ≔ \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{i} (τ) Λ_{j} (τ)}{Λ_{1} (τ) q_{1} (x + \frac{d}{2}, y) + Λ_{2} (τ) q_{2} (x - \frac{d}{2}, y)} \times (\begin{matrix} q_{i, x}^{'} (x, y) q_{j, x}^{'} (x, y) & 0 \\ 0 & q_{i, y}^{'} (x, y) q_{j, y}^{'} (x, y) \end{matrix}) dxdyd τ .$ (17)

Proof Substituting for f_{θ_c,τ} and Λ_{θ_c} in the expression for I_ff (θ) given by eq. 5 (see result 1 of Theorem 1) and using Lemma 2, we obtain result 1. For proof of results 2 and 3, please see Section A.2 in Appendix.

In result 1 of the above Theorem, we obtained a block matrix representation of the Fisher information matrix S_sim(θ_c). The leading diagonal terms correspond to the individual contributions from the two objects and the off-diagonal terms correspond to the coupling between the two objects. As we will show in the next Section, the coupling plays an important role in the analysis of the relationship between the Fisher information matrix for the simultaneous detection approach and that for the localization accuracy problem.

The product decomposition D(θ_c)C(θ_c)D^T (θ_c) of S_sim(θ_c) that we obtained in result 2 of the above Theorem has an interesting structure. The matrix C(θ_c) is a special case of S_sim(θ_c) where the y coordinates of the two objects are assumed to be the same, i.e., y₀₂ = y₀₁, and the x coordinates of the two objects are equidistant from the origin. Note that the matrix D(θ_c) is orthogonal (i.e.,D⁻¹(θ_c) = D^T (θ_c)). It should be pointed out that the product decomposition holds only when (x₀₁, y₀₁) ≠ (x₀₂, y₀₂), i.e., when the distance d is not equal to zero, since at (x₀₁, y₀₁) = (x₀₂, y₀₂) the matrix D(θ_c) is not defined. An implication of this product decomposition is that for a given $θ_{c}^{s} = (x_{01}^{s}, y_{01}^{s}, x_{02}^{s}, y_{02}^{s})$ such that $(x_{01}^{s}, y_{01}^{s}) \neq (x_{02}^{s}, y_{02}^{s})$ , the Fisher information matrix for $θ_{c}^{s}$ can be obtained by first computing the Fisher information matrix for $(- \frac{d}{2}, 0, \frac{d}{2}, 0)$ and then preand post-multiplying it with $D (θ_{c}^{s}) and D^{T} (θ_{c}^{s})$ , respectively, where d denotes the distance between the two objects. In many practical situations, the image of the objects is symmetric along the y (and the x) axis. As shown in result 3 of Theorem 2, when this condition is satisfied, several entries of the matrix C(θ_c) become zero, which in turn simplifies the calculation of C(θ_c).

Remark 2 Consider the scenario when the distance between the two objects is zero, i.e. x₀₁ = x₀₂ and y₀₁ = y₀₂. For this scenario, the Fisher information matrix S_sim(θ_c) given in result 1 of Theorem 2 is singular, if the photon detection rates and the image functions of the two objects are identical, i.e., Λ₁ = Λ₂ and q₁ = q₂ (also see Section 4.1). However, for distinct photon detection rates and image functions, S_sim(θ_c) will, in general, be invertible even when the distance between the objects is zero.

In the following Corollary, we make use of the product decomposition of the Fisher information matrix S_sim(θ_c) and the orthogonality of D(θ_c) to obtain an analytical expression for the inverse of S_sim(θ_c) when the distance d between the objects is non-zero.

Corollary 1 Define $Θ_{c}^{0} = {(x_{01}, y_{01}, x_{02}, y_{02}) | (x_{01}, y_{01}) = (x_{02}, y_{02})}$ . For $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ , let S_sim(θ_c) be given by result 2 of Theorem 2, D(θ_c) be given by eq. 13 and C_ij(θ_c), i = 1, 2, be given by eq. 17. Assume that q₁ and q₂ are symmetric along the y axis with respect to y = 0, i.e., q_i(x, y) = q_i(x, −y), (x, y) ∈ ℝ², i = 1, 2. Then for $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ , we have

S_{sim}^{- 1} (θ_{c}) = D (θ_{c}) H (θ_{c}) D^{T} (θ_{c}),

where for $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ ,

H (θ_{c}) = (\begin{matrix} Γ (θ_{c}) & 0 \\ 0 & Γ (θ_{c}) \end{matrix}) (\begin{matrix} C_{22} (θ_{c}) & - C_{12} (θ_{c}) \\ - C_{12} (θ_{c}) & C_{11} (θ) \end{matrix}) (\begin{matrix} Γ (θ_{c}) & 0 \\ 0 & Γ (θ_{c}) \end{matrix}), Γ (θ_{c}) ≔ (\begin{matrix} \frac{1}{\sqrt{Σ_{11} (θ_{c})}} & 0 \\ 0 & \frac{1}{\sqrt{Σ_{22} (θ_{c})}} \end{matrix}),

(18)

with

Σ_{ii} (θ_{c}) ≔ {[C_{11} (θ_{c})]}_{ii} {[C_{22} (θ_{c})]}_{ii} - {({[C_{12} (θ_{c})]}_{ii})}^{2}, i = 1, 2, θ_{c} \in Θ_{c} \ Θ_{c}^{0} .

(19)

Proof The expression for $S_{sim}^{- 1} (θ_{c})$ is obtained by making use of the product decomposition of S_sim(θ_c) and using the expression for the inverse of a block matrix (Zhang (1999)).

4 Simultaneous detection approach and the localization accuracy problem

In many optical microscopy applications, one of the central questions concerns the accuracy with which the location of a microscopic object (e.g., single molecule, biological sub-cellular structure such as a vesicle) can be determined, since this has several implications on the nature and type of studies that can be carried out (see Wong et al (2011); Ober et al (2004b)). The Fisher information matrix for the problem of estimating the location of the i^th object from its image is given by (see Ram et al (2006b); Ober et al (2004b))

Q_{i} ≔ \int_{t_{0}}^{t} Λ_{i} (τ) d τ \int_{ℝ^{2}} \frac{1}{q_{i} (x, y)} (\begin{matrix} {(\frac{\partial q_{i} (x, y)}{\partial x})}^{2} & \frac{\partial q_{i} (x, y)}{\partial x} \frac{\partial q_{i} (x, y)}{\partial y} \\ \frac{\partial q_{i} (x, y)}{\partial x} \frac{\partial q_{i} (x, y)}{\partial y} & {(\frac{\partial q_{i} (x, y)}{\partial y})}^{2} \end{matrix}) dxdy,

(20)

where i = 1, 2 and q_i and Λ_i denote the image function and the photon detection rate of the i^th object, respectively, for i = 1, 2. The above equation was derived using the same stochastic framework used in this paper and it is assumed that the image contains signal from only the i^th object, i = 1, 2.

In the following theorem we show how the Fisher information matrix S_sim(θ_c) for the spatially invariant case of the simultaneous detection approach (Theorem 2) is related to the Fisher information matrix for the localization accuracy problem. Specifically, we show that when the distance tends to infinity, the Fisher information matrix S_sim(θ_c) becomes equivalent to that of two independent localization accuracy problems.

Theorem 3 For θ_c = (x₀₁, y₀₁, x₀₂, y₀₂) ∈ Θ_c, let S_sim(θ_c) be given by result 1 of Theorem 2. For i = 1, 2, let Q_i be given by eq. 20. Let Λ₁ and Λ₂, and q₁ and q₂ denote the photon detection rates and the image functions of the two objects, respectively. Assume that for i = 1, 2, ζ ∈ {x, y} and y ∈ ℝ,

A1 lim_x→±∞ q_i (x, y) = 0,
A2 ${lim}_{x \to \pm \infty} \frac{\partial q_{2} (x, y)}{\partial ζ} = 0 .$

Then

S_{sim}^{inf} ≔ lim_{x_{02} \to \infty} S_{sim} (θ_{c}) = lim_{x_{02} \to \infty} (\begin{matrix} K_{11} (θ_{c}) & K_{12} (θ_{c}) \\ K_{12}^{T} (θ_{c}) & K_{22} (θ_{c}) \end{matrix}) = (\begin{matrix} Q_{1} & 0 \\ 0 & Q_{2} \end{matrix}),

where K_ij(θ_c), i, j = 1, 2 is given by eq. 12.

Proof See Section A.3 in Appendix for proof.

We would like to point out that in deriving the above result we assumed x₀₂ to go to infinity. In general, the above result will hold when any one of the coordinates i.e., x₀₁, y₀₁ or y₀₂ is assumed to go to infinity. From the above Theorem we see that as the distance of separation becomes sufficiently large, the leading diagonal terms (K₁₁(θ_c) and K₂₂(θ_c)) of the Fisher information matrix S_sim(θ_c) for the simultaneous detection approach reduce to that of the localization accuracy problem for the two point sources (i.e., Q₁ and Q₂), and the off-diagonal term K₁₂(θ_c) goes to zero. Note that the off-diagonal term represents the coupling between the two point sources.

From a practical standpoint, the knowledge of the behavior of the off-diagonal term as a function of the distance would enable the experimenter to determine whether it is necessary to calculate the full Fisher information matrix for the simultaneous detection approach or to only calculate the Fisher information matrix for the localization accuracy problem. As we will see in the next section the latter is typically much easier to calculate, since a closed form analytical expression can be obtained.

4.1 Example 1

We next illustrate the results derived in the prior sections by considering a specific image function and calculate the Fisher information matrix for the simultaneous detection approach and for the localization accuracy problem. Here, we make use of the Cramer-Rao inequality to obtain a lower limit to the accuracy (i.e., standard deviation) of the estimates of the parameters of interest (see below). We assume the photon detection rates to be constant and equal i.e., Λ₁(τ) = Λ₂(τ) = Λ₀, τ ≥ t₀. We also assume the image functions to be identical and be given by the Airy profile, which, according to optical diffraction theory describes the image of an in-focus point source that is illuminated by incoherent, unpolarized light (Born and Wolf (1999)). The analytical expression for the image functions can be written as

q_{1} (x, y) = q_{2} (x, y) ≔ \frac{J_{1}^{2} (\frac{2 π n_{a}}{λ} \sqrt{x^{2} + y^{2}})}{π (x^{2} + y^{2})}, (x, y) \in ℝ^{2},

(21)

where J₁ denotes the first order Bessel function of the first kind, n_a > 0 denotes the numerical aperture of the objective lens used to image the point source and λ > 0 denotes wavelength of the detected photons.

By making use of the Cramer-Rao inequality, we define three different quantities, namely the 2D fundamental resolution measure (FREM) for the simultaneous detection approach, the limit to the accuracy of the location coordinates for the simultaneous detection approach, and the fundamental limit to the localization accuracy. Then in Corollary 2, we consider two limiting cases of the distance parameter d, i.e., d → 0 and d → ∞, and derive analytical expressions of the 2D FREM for the simultaneous detection approach. In Section 4.1.1, we numerically calculate the above quantities for different values of d and discuss their implications.

Definition 1 The 2D FREM for the simultaneous detection approach is defined as $δ_{d}^{sim} ≔ \sqrt{I_{sim}^{- 1} (d)}, d \in [0, \infty), where I_{sim}^{- 1} (d)$ , is obtained by substituting $S_{sim}^{- 1} (θ_{c})$ (Corollary 1) in the transformation formula given by eq. 2.

Definition 2 The limit to the accuracy of the location coordinates x_0i and y_0j for the simultaneous detection approach are defined as $δ_{x_{0 i}}^{sim} ≔ \sqrt{[S_{sim}^{- 1}] {(θ_{c})}_{(2 i - 1) (2 i - 1)}} and δ_{y_{0 i}}^{sim} = \sqrt{{[S_{sim}^{- 1} (θ_{c})]}_{(2 j) (2 j)}}$ , respectively, where i, j = 1, 2 and $S_{sim}^{- 1} (θ_{c})$ denotes the inverse Fisher information matrix given by Corollary 1 for θ_c = (x₀₁, y₀₁, x₀₂, y₀₂) ∈ Θ_c.

Definition 3 The fundamental limit to the localization accuracy of the x-coordinate of the i^th object is defined as $δ_{x}^{loc, i} ≔ \sqrt{{[Q_{i}^{- 1}]}_{11}}$ , i = 1, 2, and for the y-coordinate it is defined as $δ_{y}^{loc, i} ≔ \sqrt{{[Q_{i}^{- 1}]}_{22}}$ , i = 1, 2, where Q_i is given by eq. 20, for i = 1, 2.

For the specific image functions and photon detection rates considered in this example, it can be shown that (see Ober et al (2004b)).

δ^{loc} = δ_{x}^{loc, i} = δ_{y}^{loc, i} ≔ \frac{λ}{2 π n_{a} \sqrt{Λ_{0} (t - t_{0})}}, i = 1, 2 .

(22)

Corollary 2 For d ∈ [0, ∞), let $δ_{d}^{sim}$ denote the 2D FREM for the simultaneous detection approach. For i = 1, 2, let Λ_i and q_i denote the photon detection rate and the image function of the i^th object, respectively.

1.
Assume that q₁(x, y) = q₂(x, y), (x, y) ∈ ℝ² and Λ₁(τ) = λ₂(τ), τ ≥ t₀. Then ${lim}_{d \to 0} δ_{d}^{sim} = \infty$ .
2.
For i = 1, 2, assume that q_i is radially symmetric, i.e., there exists a q_i such that $q_{i} (x, y) ≔ q_{i} (\sqrt{(x^{2} + y^{2})}$ , (x, y) ∈ ℝ² and i = 1, 2. Then
$lim_{d \to \infty} δ_{d}^{sim} = \sqrt{{(δ_{rs, 1}^{loc})}^{2} + {(δ_{rs, 2}^{loc})}^{2}},$
where for i = 1, 2,
$δ_{rs, i}^{loc} ≔ \frac{1}{\sqrt{π κ_{i} \int_{t_{0}}^{t} Λ_{i} (τ) d τ}} with κ_{i} ≔ \int_{0}^{\infty} \frac{1}{q_{i} (r)} {(\frac{\partial q_{i} (r)}{\partial r})}^{2} rdr .$ (23)
3.
Let δ^loc be given by eq. 22. For i = 1, 2, let q_i be an Airy profile that is given by eq. 21 and Λ₁(τ) = Λ₂(τ) = Λ₀, τ ≥ t₀. Then ${lim}_{d \to \infty} δ_{d}^{sim} = \sqrt{2} δ^{loc}$ .

Proof 1. By definition $δ_{d}^{sim} = \sqrt{I_{sim}^{- 1} (d)}, where I_{sim}^{- 1} (d)$ , is obtained by substituting $S_{sim}^{- 1} (θ_{c})$ (Corollary 1) in the transformation formula in eq. 2. When d → 0 then x₀₁ → x₀₂ and y₀₂ → y₀₂, and from Remark 2 it immediately follows that S_sim(θ_c) is singular, where θ_c = (x₀₁, y₀₁, x₀₂, y₀₂) ∈ Θ_c and S_sim(θ_c) is given by eq. 5. From this the result follows.

2. Without loss of generality, we assume that d → ∞ implies x₀₂ → ∞. For θ_c = (x₀₁, y₀₁, x₀₂, y₀₂) ∈ Θ_c, consider the term S_sim(θ_c) which is given by eq. 5. Using Theorem 3 and Lemma 3 (see Appendix), we have

lim_{x_{02} \to \infty} S_{sim} (θ_{c}) = [\begin{matrix} Q_{1} & 0 \\ 0 & Q_{2} \end{matrix}] = [\begin{matrix} \frac{1}{{(δ_{rs, 1}^{loc})}^{2}} 1_{2 \times 2} & 0 \\ 0 & \frac{1}{{(δ_{rs, 2}^{loc})}^{2}} I_{2 \times 2} \end{matrix}],

(24)

where Q_i, i = 1, 2, denotes the Fisher information matrix for the localization accuracy problem (eq. 20) and 1_2×2 denotes the 2 × 2 identity matrix. Define Δ_x ≔ x₀₂ − x₀₁ and Δ_y ≔ y₀₂ − y₀₁. Consider the term

lim_{x_{02} \to \infty} {(\frac{\partial d}{\partial θ_{c}})}^{T} = lim_{x_{02} \to \infty} \frac{1}{d} (\begin{matrix} - (x_{02} - x_{01}) \\ - (y_{02} - y_{01}) \\ (x_{02} - x_{01}) \\ (y_{02} - y_{01}) \end{matrix}) = lim_{x_{02} \to \infty} \frac{1}{\sqrt{Δ_{x}^{2} + Δ_{y}^{2}}} (\begin{matrix} - Δ_{x} \\ - Δ_{y} \\ Δ_{x} \\ Δ_{y} \end{matrix}) = lim_{x_{02} \to \infty} (\begin{matrix} - \frac{1}{\sqrt{1 + \frac{Δ_{y}^{2}}{Δ_{x}^{2}}}} \\ - \frac{1}{\sqrt{\frac{Δ_{x}^{2}}{Δ_{y}^{2}} + 1}} \\ \frac{1}{\sqrt{1 + \frac{Δ_{y}^{2}}{Δ_{x}^{2}}}} \\ \frac{1}{\sqrt{\frac{Δ_{x}^{2}}{Δ_{y}^{2}} + 1}} \end{matrix}) = (\begin{matrix} - 1 \\ 0 \\ 1 \\ 0 \end{matrix}) .

(25)

Using eqs. 24 and 25 in eq. 2 and taking the limit x₀₂ → ∞, we have

lim_{x_{02} \to \infty} I_{sim}^{- 1} (d) = (\begin{matrix} - 1 & 0 & 1 & 0 \end{matrix}) [\begin{matrix} \frac{1}{{(δ_{rs, 1}^{loc})}^{2}} 1_{2 \times 2} & 0 \\ 0 & \frac{1}{{(δ_{rs, 2}^{loc})}^{2}} I_{2 \times 2} \end{matrix}] (\begin{matrix} - 1 \\ 0 \\ 1 \\ 0 \end{matrix}) = {(δ_{rs, 1}^{loc})}^{2} + {(δ_{rs, 2}^{loc})}^{2} .

From this the result follows.

3. The Airy profile given in eq. 21 is radially symmetric. Hence substituting for q_i and Λ_i, i = 1, 2, in eq. 23, we have $δ_{rs, 1}^{loc} = δ_{rs, 2}^{loc} = δ^{loc}$ and from this the result immediately follows.

4.1.1 Results

Here we numerically calculate the various quantities defined in Definitions 1–3. For this purpose, we assume the two point sources to be equidistant from the origin and to lie on a line segment that passes through the origin and subtends an angle of 45° with respect to the x-axis. We choose this specific configuration, since some of the calculated values (i.e., particular $δ_{x_{0 i}}^{sim} and δ_{y_{0 i}}^{sim}$ , i = 1, 2) become equal, which simplifies the presentation of the results.

Fig. 2 shows the behavior of the 2D FREM $δ_{d}^{sim}$ as a function of the distance of separation. The figure also shows the limit to the accuracy of x₀₁ and x₀₂ for the simultaneous detection approach, i.e., $δ_{x_{01}}^{sim} and δ_{x_{02}}^{sim}$ , respectively, (the result for y₀₁ and y₀₂ are analogous) as well as the fundamental limit to the localization accuracy δ^loc (eq. 22). According to Rayleigh’s resolution criterion, two identical point sources are said to be resolved in a microscope if their distance of separation is greater than or equal to 0.61λ/n_a, where n_a denotes the numerical aperture of the microscope and λ denotes the wavelength of light emitted by the point sources. For the specific numerical values considered in Figure 2, Rayleigh’s resolution limit is ≈ 219 nm, and according to this criterion distances below 219 nm cannot resolved. In contrast, in Figure 2 we see that the numerical value of the 2D FREM $δ_{d}^{sim}$ is relatively small for a range of distances below the classical resolution limit of 219 nm. An immediate implication of this result is that if there exists an efficient estimator, then these distances can be determined with an accuracy as predicted by $δ_{d}^{sim}$ .

Note that as the distance of separation becomes very small, $δ_{d}^{sim}$ becomes numerically large thereby predicting poor accuracy in estimating the distance of separation. This is expected since under the assumptions of identical photon detection rates and image functions, when the distance d goes to zero the corresponding Fisher information matrix becomes singular and the 2D FREM $δ_{d}^{sim}$ becomes infinitely large (result 1 of Corollary 2). As the distance of separation increases, $δ_{d}^{sim}$ becomes smaller thereby predicting a relatively high accuracy in determining the distance between the two point sources. In particular, for large distances $δ_{d}^{sim}$ approaches the fundamental limit to the localization accuracy δ^loc. This is expected, as it was shown in Theorem 3 that when d → ∞, the Fisher information matrix for the simultaneous detection approach reduces to an expression that is equivalent to two independent localization accuracy problems. For the specific image functions considered here, $δ_{d}^{sim} = \sqrt{2} δ^{loc}$ in the limit d → ∞ (result 2 of Corollary 2).

The results for $δ_{x_{01}}^{sim} and δ_{x_{02}}^{sim}$ are also analogous to that of $δ_{d}^{sim}$ . Note that although $δ_{x_{01}}^{sim} (δ_{x_{02}}^{sim})$ and δ^loc provide lower bounds to the accuracy with which the x-coordinate of a point source can be determined, their behaviors are very different. In particular, $δ_{x_{01}}^{sim} and δ_{x_{02}}^{sim}$ depend on the distance and become infinitely large in the limit d → 0 (see Remark 2), whereas δ^loc is independent of the distance and remains finite for all values of d.

The above discussion raises the question that under what conditions $δ_{x_{01}}^{sim} and δ_{x_{02}}^{sim}$ , and more importantly $δ_{d}^{sim}$ will remain finite as the distance goes to zero. In the next Section, we investigate this problem by considering a specical case of the simultaneous detection approach where we assume that one of the location coordinates is known. As we will see in Section 5.1, for this special case the limit to the accuracy of the distance d remains finite as d → 0 for the specific image profiles and photon detection rates considered in the present example.

5 Special case of the simultaneous detection approach - location of one of the objects is known

It has been shown experimentally that distances well below the classical resolution criteria (e.g., Rayleigh’s resolution criterion) can be resolved in a regular optical microscope when the location coordinates of one of the point sources is known a priori (Ram et al (2006a); Gordon et al (2004); Qu et al (2004)). For example, in a concrete experimental setting such a scenario arises when one wishes to study the interaction between a stationary object and a slow moving object. In many cases, the location coordinates of the stationary object can be determined a priori (for instance from an image that only contains the stationary object) and therefore can be assumed to be known. Thus an important question then arises as to how accurately the distance between the two objects can be determined when the location of one of the objects is known. Here we address this problem by deriving the Fisher information matrix for this specific scenario.

For the present discussion, we assume that the acquired data consists of a pair of images, where one of the images contains photons from only one of the objects (for example, the stationary one) and the other image contains photons from both objects. Here, we assume that the location coordinates (x₀₁, y₀₁) of object 1 is determined from the first image and the location coordinates (x₀₂, y₀₂) of object 2 is determined from the second image. In the following Theorem, we derive the expression for the Fisher information matrix for the problem of estimating the location coordinates of the objects from such a pair of images. We assume that the photon detection rate of the objects is known. Further, we also assume the spatially invariant case (analogous to Section 3.2), where the photon distribution profile of the i^th object f_θ,τ,i, i = 1, 2, is expressed as a scaled and shifted version of the image of that object (see eq. 8).

As we will show, the Fisher information matrix reduces to that of two independent localization accuracy problems. We also show that the Fisher information matrix is invertible for all values of the location coordinates of the two objects including when the location coordinates are the same (i.e., when the distance equals zero).

Theorem 4 Let Θ_c ⊆ ℝ⁴ be open. For θ_c = (x₀₁, y₀₁, x₀₂, y₀₂) ∈ Θ_c, τ ≥ t₀ and i = 1,2, let f_{θ_c,τ, i} and Λ_i denote the photon distribution profile and the photon detection rate of the i^th object, respectively, where f_θ,τ,i is given by eq. 8. For θ_c ∈ Θ and τ ≥ t₀, let Λ(τ) ≔ Λ₁(τ) + Λ₂(τ), and f_{θ_c,τ} be given by eq. 10. For θ_c ∈ Θ_c, let 𝒢₁(Λ₁, {f_{θ_c,τ,1}}_τ≥t₀, ℝ²) and 𝒢₂(Λ, {f_{θ_c,τ}}_τ≥t₀, ℝ²) denote two independent image detection processes.

1.
Then for the two independent image detection processes 𝒢₁ and 𝒢₂, the Fisher information matrix of the spatial component corresponding to the acquisition time interval [t₀, t] for the special case of the simultaneous detection approach is given by
$S_{sim, sp} (θ_{c}) ≔ [\begin{matrix} Q_{1} & 0 \\ 0 & K_{22} (θ_{c}) \end{matrix}], θ_{c} \in Θ_{c},$ (26)
where Q₁ is given by eq. 20 and K₂₂(θ_c), θ_c ∈ Θ_c, is given by eq. 12.
2.
For θ_c ∈ Θ_c, S_sim,sp(θ_c) is invertible including when (x₀₁, y₀₁) = (x₀₂, y₀₂).

Proof See Section A.4 in Appendix for proof.

From result 1 of the above Theorem we see that the Fisher information matrix for the special case of the simultaneous detection approach is a block diagonal matrix. The first term Q₁ (eq. 20) in the leading diagonal pertains to the Fisher information matrix for the localization accuracy problem corresponding to the location coordinates (x₀₁, y₀₁) of object 1. The second term K₂₂(θ_c) (eq. 12) in the leading diagonal is a component of the Fisher information matrix for the spatially invariant case of the simultaneous detection approach in which both location coordinates are unknown and are determined from a single image (Theorem 2). Importantly, this component K₂₂(θ_c) is equivalent to the Fisher information matrix of the localization accuracy problem for the location coordinates (x₀₂, y₀₂) of object 2 in the presence of an extraneous background signal given by Λ₁q₁, where Λ₁ and q₁ denote the photon detection rate and the image function of object 1, respectively. In this context, we would like to note that the effect of an extraneous background term on the localization accuracy problem has been extensively investigated before (Ram et al (2006b); Ober et al (2004b)).

In result 2 of the above Theorem, we showed that the Fisher information matrix is, in general, invertible for all values of the location coordinates of the two objects including when (x₀₁, y₀₁) = (x₀₂, y₀₂), i.e., when the distance between the two objects is zero. This is in contrast to the result obtained in Section 3.2, where we saw that the Fisher information matrix for the simultaneous detection approach becomes singular and therefore non-invertible when the distance is zero (assuming identical image profiles and photon detection rates; see Remark 2).

This brings out a very important aspect of the analyses carried out here. Specifically, the a priori knowledge of the location coordinates of one of the objects reduces the Fisher information matrix of the distance estimation problem to that of two independent localization accuracy problems. More importantly, it also removes the singularity of the Fisher information matrix when the distance is zero. The above result also explains the prior experimental observations of measuring nanometer scale distances well below the classical resolution criteria in a regular optical microscope when a priori information regarding the location coordinates of one of the objects is known (Gordon et al (2004); Qu et al (2004)). In the next section, we further illustrate this through a specific example where we show that the CRLB of the distance parameter remains finite when the distance goes to zero.

We note that in the derivation of the above theorem, the Fisher information matrix for the second image only depends on the location coordinates of object 2, since it is assumed that the location of object 1 is known. However, since the second image contains signal from both objects, it also provides information about the location of object 1. Hence this can be used to improve the location estimates of object 1. A detailed analysis of such a scenario has been previously carried out by us, where, analogous to Theorem 4, we derived the Fisher information matrix for a pair of images but considered the case where both location coordinates were estimated from the second image (Ram et al (2006a)).

Remark 3 The results derived in the above Theorem pertains to the Fisher information matrix for the spatial component θ_f (=θ_c) of the unknown parameter vector θ, and we have assumed the temporal component θ_Λ of θ (and in turn the the photon detection rates of the objects) to be known. The above results will hold even if the temporal component θ_Λ is unknown provided the photon detection rates of the objects are related to one another through a scalar function β, i.e. Λ_θ,1(τ) = β(τ)Λ_θ,2(τ), θ ∈ Θ and τ ≥ t₀. This is due to the fact that under this condition, the Fisher information matrix for the spatial θ_f and temporal components θ_Λ are decoupled (see result 2 of Theorem 1). It should be pointed out that the assumption Λ_θ,1 = βΛ_θ,2, θ ∈ Θ is satisfied in many practical situations since the photon detection rates of the objects are typically assumed to be the same (i.e., β = 1).

5.1 Example 2

We now illustrate the results derived in the previous section by considering a specific image profile. Analogous to Section 4.1, we assume the image functions q₁ and q₂ to be identical Airy profiles given by eq. 21 and set the photon detection rates to be constant and equal, i.e., Λ₁(τ) = Λ₂(τ) = Λ₀, τ ≥ t₀. We also define the 2D FREM for the special case of the simultaneous detection approach, which we denote as $δ_{d}^{sim, sp}$ . In Corollary 3, we consider two limiting cases of the distance parameter d, i.e., d → 0 and d → ∞ and derive analytical expressions for $δ_{d}^{sim, sp}$ for the specific image functions and photon detection rates considered here.

Definition 4 The 2D FREM for the special case of the simultaneous detection approach is defined as $δ_{d}^{sim, sp} ≔ \sqrt{I_{sim, sp}^{- 1} (d)}, d \in [0, \infty)$ , where I_sim,sp(d) is obtained by substituting $S_{sim, sp}^{- 1} (θ_{c})$ (result 2 of Theorem 4) in the transformation formula given by eq. 2.

Corollary 3 For d ∈ [0, ∞), let $δ_{d}^{sim, sp}$ denote the 2D FREM for the special case of the simultaneous detection approach. For i = 1, 2, let Λ_i and q_i denote the photon detection rate and the image function of the i^th object, respectively. Assume that Λ₁(τ) = Λ₂(τ), τ ≥ t₀, q₁(x, y) = q₂(x, y), (x, y) ∈ ℝ², and that q₁ is radially symmetric, i.e., there exists a q₁ such that $q_{1} (x, y) = q_{1} (\sqrt{x^{2} + y^{2}})$ for (x, y) ∈ ℝ².

Then

1.
${lim}_{d \to 0} δ_{d}^{sim, sp} = \sqrt{3} δ_{rs, 1}^{loc}$
2.
${lim}_{d \to ∞} δ_{d}^{sim, sp} = \sqrt{2} δ_{rs, 1}^{loc}$ , where $δ_{rs, 1}^{loc}$ is given by eq. 23.
3.
Let q₁ be an Airy profile that is given by eq. 21 and Λ₁(τ) = Λ₀, τ ≥ t₀. Then
$lim_{d \to 0} δ_{d}^{sim, sp} = \sqrt{3} δ^{loc}, lim_{d \to \infty} δ_{d}^{sim, sp} = \sqrt{2} δ^{loc},$
where δ^loc is given by eq. 22.

Proof 1. By definition, q₁ is radially symmetric and hence from Lemma 3 it follows that $Q_{1}^{- 1} = {(δ_{rs, 1}^{loc})}^{2} 1_{2 \times 2}$ , where Q₁ denotes the Fisher information matrix for the localization accuracy problem of object 1 (eq. 20) and 1_2×2 denotes the 2 × 2 identity matrix. Using this and eq. 2, we get

{(δ_{d}^{sim, sp})}^{2} = I_{sim, sp}^{- 1} (d) ≔ \frac{\partial d}{\partial θ_{c}} S_{sim, sp}^{- 1} (θ_{c}) {(\frac{\partial d}{\partial θ_{c}})}^{T} = \frac{1}{d^{2}} {(\begin{matrix} - (x_{02} - x_{01}) \\ - (y_{02} - y_{01}) \\ (x_{02} - x_{01}) \\ (y_{02} - y_{01}) \end{matrix})}^{T} (\begin{matrix} Q_{1}^{- 1} & 0 \\ 0 & K_{22}^{- 1} (θ_{c}) \end{matrix}) (\begin{matrix} - (x_{02} - x_{01}) \\ - (y_{02} - y_{01}) \\ (x_{02} - x_{01}) \\ (y_{02} - y_{01}) \end{matrix}) = {(δ_{rs, 1}^{loc})}^{2} + \frac{1}{d^{2}} (\begin{matrix} x_{02} - x_{01} & y_{02} - y_{01} \end{matrix}) K_{22}^{- 1} (θ_{c}) (\begin{matrix} x_{02} - x_{01} \\ y_{02} - y_{01} \end{matrix}), d \in [0, \infty),

(27)

where S_sim,sp(θ_c) is given by eq. 26 and K₂₂(θ_c) is given by eq. 12. Because the photon detection rates and the image functions of the objects are assumed to be identical, we have Q₁ = Q₂, where Q_i, i = 1, 2, is given by eq. 20. Using this, we have

lim_{x_{01} \to x_{02}, y_{01} \to y_{02}} K_{22} (θ_{c}) = lim_{x_{01} \to x_{02}, y_{01} \to y_{02}} \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{2}^{2} (τ)}{Λ_{1} (τ) q_{1} (x - x_{01}, y - y_{01}) + Λ_{2} (τ) q_{2} (x - x_{02}, y - y_{02})} \times (\begin{matrix} {(\frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial x})}^{2} & \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial x} \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial y} \\ \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial y} \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial x} & {(\frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial y})}^{2} \end{matrix}) dxdyd τ = \frac{Λ_{0} (t - t_{0})}{2} \int_{ℝ^{2}} \frac{1}{q_{2} (x - x_{02}, y - y_{02})} \times (\begin{matrix} {(\frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial x})}^{2} & \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial x} \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial y} \\ \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial y} \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial x} & {(\frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial y})}^{2} \end{matrix}) dxdyd τ = \frac{1}{2} Q_{2} = \frac{1}{2} Q_{1} = \frac{1}{2 {(δ_{rs, 1}^{loc})}^{2}} 1_{2 \times 2},

(28)

where we have used the shift-invariant property of Lebesgue integrals in the penultimate step. Define Δ_x ≔ x₀₂ − x₀₁ and Δ_y ≔ y₀₂ − y₀₁. Consider the term

lim_{x_{01} \to x_{02}, y_{01} \to y_{02}} \frac{1}{d} (\begin{matrix} x_{02} - x_{01} \\ y_{02} - y_{01} \end{matrix}) = lim_{x_{01} \to x_{02}} lim_{y_{01} \to y_{02}} (\begin{matrix} \frac{1}{\sqrt{1 + \frac{Δ_{y}^{2}}{Δ_{x}^{2}}}} \\ \frac{1}{\sqrt{\frac{Δ_{x}^{2}}{Δ_{y}^{2}} + 1}} \end{matrix}) = (\begin{matrix} 1 \\ 0 \end{matrix}) .

(29)

Substituting eqs. 28 and 29 in eq. 27 and taking the limit d → 0, we get

lim_{d \to 0} {(δ_{d}^{sim, sp})}^{2} = lim_{d \to 0} I_{sim, sp}^{- 1} (d) = lim_{x_{01} \to x_{02}, y_{01} \to y_{02}} I_{sim, sp}^{- 1} (d) = {(δ_{rs, 1}^{loc})}^{2} + lim_{x_{01} \to x_{02}, y_{01} \to y_{02}} \frac{1}{d^{2}} (\begin{matrix} x_{02} - x_{01} & y_{02} - y_{01} \end{matrix}) K_{22}^{- 1} (θ_{c}) (\begin{matrix} x_{02} - x_{01} \\ y_{02} - y_{01} \end{matrix}) = {(δ_{rs, 1}^{loc})}^{2} + 2 {(δ_{rs, 1}^{loc})}^{2} (\begin{matrix} 1 & 0 \end{matrix}) 1_{2 \times 2} (\begin{matrix} 1 \\ 0 \end{matrix}) = 3 {(δ_{rs, 1}^{loc})}^{2} .

From this the result immediately follows.

2. Proof is analogous to that of result 2 of Corollary 2.

3. The Airy profile given in eq. 21 is radially symmetric. Substituting for q₁ and Λ₁ in results 1 and 2 of this Corollary, we get the desired results.

Figure 3 shows the 2D FREM $δ_{d}^{sim, sp}$ as a function of the distance for the special case of the simultaneous detection approach when the location coordinates (x₀₁, y₀₁) of one of the objects is assumed to be known. The figure also shows the 2D FREM for the simultaneous detection approach $δ_{d}^{sim}$ when both location coordinates are assumed to be unknown (Section 1), and as a reference the fundamental limit to the localization accuracy δ^loc (eq. 22). From the figure we see that as the distance of separation decreases, the $θ_{d}^{sim}$ becomes infinitely large as d → 0. In contrast, $θ_{d}^{sim, sp}$ first increases but then decreases and then remains finite even when d = 0. In particular, for the specific image functions and photon detection rates considered here, $δ_{d}^{sim, sp} = \sqrt{3} δ^{loc}$ when d = 0 (result 1 of Corollary 3). An immediate implication of this result is that if the location coordinates of one of the objects is known, then it is possible to determine very small (nanometer scale) distances with relatively very high accuracy in an optical microscope. As the distance of separation increases, the 2D FREM $δ_{d}^{sim, sp}$ behaves analogous to $δ_{d}^{sim}$ . In particular, $δ_{d}^{sim, sp} = \sqrt{2} δ^{loc}$ when the distance becomes infinitely large. This implies that for very large distances of separation, the limit to the accuracy of estimating the distance is independent of the distance and is a constant.

Fig. 3 — Behavior of the 2D FREM $δ_{d}^{sim, sp}$ for the special case of the simultaneous detection approach. Panel A shows $δ_{d}^{sim, sp}$ for a distance range of 10 – 300 nm for the special case of the simultaneous detection approach when the location coordinates (x₀₁, y₀₁) of object 1 is known (⊳). The panel also shows the 2D FREM $δ_{d}^{sim}$ for the simultaneous detection approach when both location coordinates are unknown (∘). Panel B shows the same as Panel A for a distance range of 1 – 50 nm. In all the panels, (—) denotes the fundamental limit to the localization accuracy δ^loc (eq. 22), and in Panel A the vertical dashed line denotes the Rayleigh’s resolution limit. The numerical values used to generate the above plots are identical to those used in Figure 2.

We would like to point out that the analyses carried out in this Section have implications in a broader context of dealing with a singular Fisher information matrix, which represents a significant complication in the analysis of parameter estimation problems (e.g., see Stoica and Marzetta (2001)). In particular our results illustrate how a priori information can be used to eliminate the singularity of the Fisher information matrix. It is important to note that the choice of a priori information intimately depends on the specifics of the experimental design, i.e., how the data is captured. This further underscores the importance of carrying out a rigorous analysis of the Fisher information matrix, as it provides the necessary insight into choosing the most appropriate experimental approach from the point of view of obtaining the best accuracy in estimating the parameters of interest.

6 Fisher information matrix for the separate detection approach

We next consider the case where the location coordinates of the two objects are independently estimated from two separate images. Such a scenario arises in a class of experimental techniques in which the photon emission from the objects are temporally separated (e.g., stochastic photoactivation (Betzig et al (2006); Rust et al (2006); Hess et al (2006)) and blinking (Lidke et al (2005); Lagerholm et al (2006))). In the following Theorem, we derive an analytical expression of the Fisher information matrix for the separate detection approach. Here, we assume the acquired data to consist of a pair of images, where the first image contains signal from only object 1 and the the second image contains signal from only object 2. As we will see, the Fisher information matrix for the separate detection approach will reduce to two independent localization accuracy problems.

Theorem 5 Let Θ_c ⊆ ℝ⁴ be open. For θ_c ∈ Θ_c, τ ≥ t₀ and i = 1, 2, let Λ_i and f_{θ_c,τ,i} denote the photon detection rate and the photon distribution profile of the i^th object, respectively, where f_{θ_c,τ,i} is given by eq. 8. For θ_c ∈ Θ_c, let 𝒢₁(Λ₁, {f_{θ_c,τ,1}}_τ≥t₀, ℝ²) and 𝒢₂(Λ₂, {f_{θ_c,τ,2}}_τ≥t₀, ℝ²) denote two independent image detection processes.

1.
Then for the two independent image detection processes 𝒢₁ and 𝒢₂, the Fisher information matrix of the spatial component corresponding to the acquisition time interval [t₀, t] for the separate detection approach is given by
$S_{sep} (θ_{c}) = [\begin{matrix} Q_{1} & 0 \\ 0 & Q_{2} \end{matrix}], θ_{c} \in Θ_{c},$ (30)
where Q_i, i = 1,2, is given in eq. 20
2.
For θ_c ∈ Θ_c, S_sep(θ_c) is invertible including when (x₀₁, y₀₁) =(x₀₂, y₀₂).

Proof Proof is analogous to that of Theorem 4.

From the above Theorem, we see that the Fisher information matrix for the separate detection approach is block diagonal and is equivalent to two independent localization accuracy problems in the absence of any extraneous background signal. Note that the Fisher information matrix for the separate detection approach is independent of the location coordinates of the objects. This is in contrast to the simultaneous detection approach, where we saw that the Fisher information matrix depended on the location coordinates of the two objects (Theorem 2). In addition to this, for the simultaneous detection approach when both object coordinates are unknown the Fisher information matrix becomes block diagonal and reduces to that of two independent localization accuracy problems (in the absence of any extraneous background signal) only when the distance becomes infinitely large i.e., d → ∞ (Theorem 3).

6.1 Example 3

To illustrate the result derived in this section, we consider a specific image function. Analogous to Sections 4.1 and 5.1, we assume the image functions q₁ and q₂ to be identical Airy profiles given by eq. 21 and set the photon detection rates to be constant and equal, i.e., Λ₁(τ) = Λ₂(τ) = Λ₀, τ ≥ t₀. We also define the 2D FREM for the separate detection approach $δ_{d}^{sep}$ . Then in Corollary 4, we derive an analytical expression for $δ_{d}^{sep}$ for the specific image functions and photon detection rates considered here.

Definition 5 The 2D FREM for the separate detection approach is defined as $δ_{d}^{sep} ≔ \sqrt{I_{sep}^{- 1} (d)}, d \in [0, \infty), where I_{sep}^{- 1} (d)$ is obtained by substituting $S_{sep}^{- 1} (θ_{c})$ (result 2 of Theorem 5) in the transformation formula given by eq. 2.

Corollary 4 For d ∈ [0, ∞), let $δ_{d}^{sep}$ denote the 2D FREM for the separate detection approach. For i = 1, 2, let Λ_i and q_i denote the photon detection rate and the image function of the i^th object, respectively.

1.
For i = 1, 2, assume that q_i is radially symmetric, i.e., there exists a q_i such that $q_{i} (x, y) ≔ q_{i} (\sqrt{(x^{2} + y^{2})}$ (x, y) ∈ ℝ² and i = 1, 2. Then for d ∈ [0, ∞), we have
$δ_{d}^{sep} = \sqrt{{(δ_{rs, 1}^{loc})}^{2} + {(δ_{rs, 2}^{loc})}^{2}},$
where for i = 1, 2, $δ_{rs, i}^{loc}$ is given by eq. 23.
2.
For i = 1, 2, let q_i be an Airy profile that is given by eq. 21 and Λ₁(τ) = Λ₂(τ) = Λ₀, τ ≥ t₀. Then for d ∈ [0, ∞), $δ_{d}^{sep} = \sqrt{2} δ^{loc}$ , where δ^loc is given by eq. 22.

Proof 1. Using eq. 2 and Lemma 3, we have

{(δ_{d}^{sep})}^{2} = I_{sep}^{- 1} (d) ≔ \frac{\partial d}{\partial θ_{c}} S_{sep}^{- 1} (θ_{c}) {(\frac{\partial d}{\partial θ_{c}})}^{T} = \frac{1}{d^{2}} {(\begin{matrix} - (x_{02} - x_{01}) \\ - (y_{02} - y_{01}) \\ (x_{02} - x_{01}) \\ (y_{02} - y_{01}) \end{matrix})}^{T} (\begin{matrix} Q_{1}^{- 1} & 0 \\ 0 & Q_{2}^{- 1} \end{matrix}) (\begin{matrix} - (x_{02} - x_{01}) \\ - (y_{02} - y_{01}) \\ (x_{02} - x_{01}) \\ (y_{02} - y_{01}) \end{matrix}) = \frac{1}{d^{2}} {(\begin{matrix} - Δ_{x} \\ - Δ y \\ Δ_{x} \\ Δ_{y} \end{matrix})}^{T} (\begin{matrix} {(δ_{rs, 1}^{loc})}^{2} & 0 & 0 & 0 \\ 0 & {(δ_{rs, 1}^{loc})}^{2} & 0 & 0 \\ 0 & 0 & {(δ_{rs, 2}^{loc})}^{2} & 0 \\ 0 & 0 & 0 & {(δ_{rs, 2}^{loc})}^{2} \end{matrix}) (\begin{matrix} - Δ_{x} \\ - Δ y \\ Δ_{x} \\ Δ_{y} \end{matrix}) = \frac{1}{d^{2}} ((Δ_{x}^{2} + Δ_{y}^{2}) {(δ_{rs, 1}^{loc})}^{2} + (Δ_{x}^{2} + Δ_{y}^{2}) {(δ_{rs, 2}^{loc})}^{2}) = {(δ_{rs, 1}^{loc})}^{2} + {(δ_{rs, 2}^{loc})}^{2}, d \in [0, \infty),

where Δ_x = x₀₂ − x₀₁ and Δ_y = y₀₂ − y₀₁. From this the result immediately follows. 2. The Airy profile given in eq. 21 is radially symmetric. Hence substituting for Λ_i and q_i, i = 1, 2, in result 1 of this Corollary, the result immediately follows.

From the above result we see that the 2D FREM for the separate detection approach $δ_{d}^{sep}$ is a constant and is independent of the distance of separation, if the image functions of the objects are radially symmetric. More specifically, when the image functions are assumed to be Airy profiles (eq. 21), then the 2D FREM $δ_{d}^{sep} is \sqrt{2}$ times the fundamental limit to the localization accuracy δ^loc. This is in contrast to the simultaneous detection approach where the 2D FREM $δ_{d}^{sim}$ (as well as $δ_{d}^{sim, sp}$ ) depends on the distance and only in the limiting case when d becomes infinitely large, $δ_{d}^{sim} = \sqrt{2} δ^{loc}$ (Corollary 2). An immediate implication of the above result is that, if there exists an efficient estimator of the distance for the separate detection approach, then all distances can be determined with the same level of accuracy when the image profiles are radially symmetric.

7 Simulations

In the previous sections we investigated the Fisher information matrix of the distance d and calculated the 2D FREM for different experimental approaches. An important question then arises as to whether for a given experimental approach there exists an unbiased estimator that can attain the corresponding 2D FREM. In this section we address this question, where we use the Maximum Likelihood (ML) estimator to determine the distance d from simulated data and compare its performance (i.e. standard deviation) to the 2D FREM for the different experimental approaches. We consider all three approaches, i.e., the simultaneous detection approach, the special case of the simultaneous detection approach when one of the object locations are known, and the separate detection approach. We generate the acquired data through Monte-Carlo simulations which are discussed below. Here, we consider the data generation process for an ideal (non-pixelated) detector, where the acquired data consists of the spatial coordinates of the detected photons. We then use the maximum likelihood estimation algorithm on the simulated data to estimate the location coordinates of the objects, and from this we deduce the distance. Table 1 lists the standard deviations of the distance estimates for the different experimental approaches considered here. As we will see the ML estimator is unbiased and attains the 2D FREM for a range of distances when the sample size is sufficiently large.

Table 1.

Results of the maximum likelihood estimator of the distance for the different experimental approaches considered here. Table A shows the results for the simultaneous detection approach. Table B shows the results for the special case of the simultaneous detection approach, where one of the location coordinates is independently determined and is assumed to be known. Table C shows the results of the separate detection approach. The numerical values used to generate the data are identical to those used in Fig. 2. For all the data sets, the mean and standard deviation are obtained from 2000 maximum likelihood estimates of the distance.

A. Simultaneous detection approach
Data set #	True value of distance nm	Mean distance estimates nm	Std. dev of distance estimates nm	Resolution measure $δ_{d}^{sim}$ nm
1	10	10.22	5.87	5.89
2	20	20.01	4.14	4.23
3	50	49.99	2.67	2.65
4	100	99.99	2.14	2.12
5	200	200.06	1.97	1.93
6	500	500	1.64	1.68
B. Special case of the simultaneous detection approach
Data set #	True value of distance nm	Mean distance estimates nm	Std. dev of distance estimates nm	Resolution measure $δ_{d}^{sim}$ nm
1	10	10.15	1.56	1.81
2	20	20.03	1.68	1.82
3	50	50.03	1.85	1.85
4	100	100.02	1.91	1.91
5	200	200.03	1.78	1.74
6	500	500.01	1.56	1.6
C. Separate detection approach
Data set #	True value of distance nm	Mean distance estimates nm	Std. dev of distance estimates nm	Resolution measure $δ_{d}^{sim}$ nm
1	10	10.15	1.48	1.47
2	20	20.02	1.49	1.47
3	50	50.04	1.50	1.47
4	100	100.01	1.48	1.47
5	200	200.03	1.50	1.47
6	500	500.03	1.45	1.47

Open in a new tab

7.1 Data simulation

We consider the two objects to be identical point sources. We set the photon detection rates of the two objects to be equal and constant, i.e. Λ_θ,1(τ) ≔ Λ₀, τ ≥ t₀ and Λ_θ,2(τ) ≔ Λ₀, τ ≥ t₀ and assume the image functions q₁ and q₂ to be identical Airy profiles given by eq. 21. We generate a sequence of images {𝒥_θ,1, 𝒥_θ,2, …, 𝒥_{θ,N_max}}, where N_max denotes the total number of images. For k = 1, …, N_max, the k^th image is given by 𝒥_θ,k ≔ {𝒥_θ,1,k, 𝒥_θ,2,k}, where

𝒥_{θ, i, k} ≔ {(x_{1}^{i, k}, y_{1}^{i, k}), (x_{2}^{i, k}, y_{2}^{i, k}), \dots, (x_{N_{i, k}}^{i, k}, y_{N_{i, k}}^{i, k})}, i = 1, 2, k = 1, \dots, N_{max},

(31)

denotes the signal from the i^th object in the k^th image for k = 1, …, N_max and i = 1, 2. In the above equation, N_i,k denotes the number of detected photons from the i^th object in the k^th image for i = 1, 2 and k = 1, …, N_max, and is a realization of the Poisson random variable with mean Λ₀(t − t₀). The sequence ${(x_{m}^{i, k}, y_{m}^{i, k}); m = 1, \dots, N_{i, k}}$ denotes the spatial coordinates of the detected photons from the i^th object in the k^th image for i = 1, 2 and k = 1, …, N_max, and is a realization of N_i,k random variables with density f_{θ_c,τ,i} given by eq. 8, which is generated by using a method described in Ober et al (2004b).

7.2 Maximum likelihood estimator

For a general parameter estimation problem, the maximum likelihood estimator can be written as argmax_θ ln(ℒ(θ | 𝒵) where 𝒵 denotes the data and ℒ(θ | ·) denotes the likelihood function. For the simultaneous detection approach, the acquired data pertaining to the k^th image is given by 𝒵 = 𝒥_θ,k = {𝒥_θ,1,k, 𝒥_θ,2,k}, k = 1, …, N_max where 𝒥_θ,i,k is defined in eq. 31 and θ = θ_c = (x₀₁, y₀₁, x₀₂, y₀₂) ∈ ℝ⁴.

For the special case of the simultaneous detection approach when one of the location coordinates are known, the acquired data consists of a pair of images {𝒵₁, 𝒵₂}. We assume 𝒵₁ to be the image that contains the signal from object 1, i.e., 𝒵₁ = 𝒥_θ₁,1,k, and 𝒵₂ to be the image that contains the signal from both objects, i.e., 𝒵₁ = {𝒥_θ₁,1,k, 𝒥_θ₂,2,k}, where 𝒥 is defined in eq. 31, θ_i = (x_0i, y_0i) ∈ ℝ², i = 1, 2 and k = 1, … N_max. Here, we carry out two independent ML estimations on each image, i.e., argmax_θ₁ ln(ℒ (θ₁ | 𝒵₁) and argmax_θ₂ ln(ℒ(θ₂ | 𝒵₂, θ̂₁), where θ_i ≔ (x_0i, y_0i) ∈ ℝ², i = 1, 2. Note that while carrying out the maximum likelihood estimation with the second image 𝒵₂, we set the value of θ₁ to be equal to θ̂₁, where thêta₁ denotes the maximum likelihood estimate of θ₁, which is determined from the first image.

For the separate detection approach, the acquired data consists of a pair of images {𝒵 ₁, 𝒵₂} each of which contains the image of only one of the objects. Here, we have 𝒵₁ = 𝒥_θ₁,1,k and 𝒵₂ = 𝒥_θ₂,1,k, where 𝒥 is defined in eq. 31 for θ_i = (x_0i, y_0i) ∈ ℝ², i = 1, 2 and k = 1, …, N_max. For this approach, we carry out independent ML estimations on each image, i.e. argmax_θ₁ ln(ℒ(θ₁ | 𝒵₁) and argmax_θ₂ ln(ℒ (θ₂ | 𝒵₂), where θ_i ≔ (x_0i, y_0i) ∈ ℝ², i = 1, 2.

In all the three imaging scenarios, the ML estimates are determined computationally by using a gradient based optimization algorithm (fminunc) in the MATLAB programming language.

7.3 Comparison of ML estimator performance to the 2D FREM

Table 1 shows the results of the ML estimator for the different experimental approaches considered here. The table lists mean and standard deviation of the distance estimates as well as the 2D FREM of the distance. From the table we see that for all the experimental approaches considered here, the mean value of the distance estimates is very close to the true value suggesting that the ML estimator is unbiased. Moreover, for a range of distances, the standard deviation of the distance is also consistently close to the 2D FREM thereby suggesting that the ML estimator is capable of achieving the theoretically best possible accuracy provided the sample size is sufficiently large. Note that the standard deviation of the ML estimates for the separate detection approach is almost a constant for a range of distances in agreement with the 2D FREM, which in turn shows that different distances can be estimated with the same level of accuracy.

A comparison of the standard deviations of the distance estimates (as well as the 2D FREMs) for the three approaches shows that for a range of distances considered in Table 1, the separate detection approach provides the best accuracy (i.e., the smallest 2D FREM/standard deviation) for determining the distance, followed by the special case of the simultaneous detection approach, and then followed by the simultaneous detection approach.

Acknowledgements

This research was supported in part by the National Institutes of Health (R01 GM085575) and by a postdoctoral fellowship to S. R. from the National Multiple Sclerosis Society (FG-1798-A-1).

Appendix

A Appendix

Definition 6 A function q : ℝ² → [0, ∞) is said to be an image function if the following properties are satisfied (see (Ram et al, 2006b, pg 37)).

1.
∫_ℝ² q(x, y)dxdy = 1,
2.
$\frac{\partial q (x, y)}{\partial x} and \frac{\partial q (x, y)}{\partial y}$ exist for every (x, y) ∈ ℝ²,
3.
$\int_{ℝ^{2}} \frac{\partial q (x, y)}{\partial x} | dxdy < \infty, \int_{ℝ^{2}} | \frac{\partial q (x, y)}{\partial y} | dxdy < \infty,$ and
4.
$\int_{ℝ^{2}} \frac{1}{q (x, y)} {(\frac{\partial q (x, y)}{\partial x})}^{2} dxdy < \infty, \int_{ℝ^{2}} \frac{1}{q (x, y)} {(\frac{\partial q (x, y)}{\partial y})}^{2} dxdy < \infty, and \int_{ℝ^{2}} \frac{1}{q (x, y)} \frac{\partial q (x, y)}{\partial x} \frac{\partial q (x, y)}{\partial y} dxdy < \infty .$

Lemma 1 For θ = (θ_f, θ_Λ) ∈ Θ, τ ≥ t₀ and i = 1, 2, let f_θ,τ,i and Λ_θ,i denote the photon distribution profile and the photon detection rate of the i^th object, respectively, and let Λ_θ and f_θ,τ be given by eqs. 3 and 4, respectively. Let 𝒞 denote the detector.

1.
For θ ∈ Θ and τ ≥ t₀, if β(τ)Λ_θ,1 (τ) = Λ_θ,2 (τ) for some β(τ) ≥ 0 that is independent of θ, then $\frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} = 0$ , θ ∈ Θ, τ ≥ t₀, r ∈ 𝒞.
2.
For θ ∈ Θ and τ ≥ t₀, if f_θ,τ,1(r) = f_θ,τ,2(r), r ∈ 𝒞, then $\frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} = 0$ , θ ∈ Θ, τ ≥ t₀, r ∈ 𝒞.

Proof 1. For θ ∈ Θ, τ ≥ t₀ and i = 1, 2, let ε_θ,i(τ) = Λ_θ,i(τ)/Λ_θ(τ). Consider the term

\frac{\partial ε_{θ, 1} (τ)}{\partial θ_{Λ}} + \frac{\partial ε_{θ, 2} (τ)}{\partial θ_{Λ}} = \frac{Λ_{θ} (τ) \frac{\partial Λ_{θ, 1} (τ)}{\partial θ_{Λ}} - Λ_{θ, 1} (τ) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}}}{Λ_{θ}^{2} (τ)} + \frac{Λ_{θ} (τ) \frac{\partial Λ_{θ, 2} (τ)}{\partial θ_{Λ}} - Λ_{θ, 2} (τ) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}}}{Λ_{θ}^{2} (τ)} = \frac{Λ_{θ} (τ) (\frac{\partial Λ_{θ, 1} (τ)}{\partial θ_{Λ}} + \frac{\partial Λ_{θ, 2} (τ)}{\partial θ_{Λ}}) - (Λ_{θ, 1} (τ) + (Λ_{θ, 2} (τ)) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}}}{Λ_{θ}^{2} (τ)} = \frac{Λ_{θ} (τ) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}} - Λ_{θ} (τ) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}})}{Λ_{θ}^{2} (τ)} = 0, θ \in Θ, τ \geq t_{0},

(32)

where we have used the fact that Λ_θ(τ) ≔ Λ_θ,1(τ) + Λ_θ,2(τ), τ ≥ t₀ and θ ∈ 𝒞. Consider the term

\frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} = \frac{\partial ε_{θ, 1} (τ)}{\partial θ_{Λ}} f_{θ, τ, 1} (r) + ε_{θ, 1} (τ) \frac{\partial f_{θ, τ, 1} (r)}{\partial θ_{Λ}} + \frac{\partial ε_{θ, 2} (τ)}{\partial θ_{Λ}} f_{θ, τ, 2} (r) + ε_{θ, 2} (τ) \frac{\partial f_{θ, τ, 2} (r)}{\partial θ_{Λ}},

(33)

where θ ∈ Θ, τ ≥ t₀ and r ∈ 𝒞. Substituting A1 in eq. 33 and using eq. 32, we have for θ ∈ Θ, τ ≥ t₀ and r ∈ 𝒞

\frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} = f_{θ, τ, 1} (r) (\frac{\partial ε_{θ, 1} (τ)}{\partial θ_{Λ}} + \frac{\partial ε_{θ, 2} (τ)}{\partial θ_{Λ}}) = 0 .

2. Using A2 we have, $ε_{θ, 1} (τ) = \frac{1}{1 + β (τ)}$ , θ ∈ Θ and τ ≥ t₀, and $ε_{θ, 2} (τ) = \frac{β (τ)}{1 + β (τ)}$ , θ ∈ Θ and τ ≥ t₀. Since β(τ) is independent of θ for τ ≥ t₀, $\frac{\partial ε_{θ, i} (τ)}{\partial θ_{Λ}} = 0$ , θ ∈ Θ, τ ≥ t₀ and i = 1, 2. Substituting this in eq. 33 the result follows.

Lemma 2 For θ_c = (x₀₁, y₀₁, x₀₂, y₀₂) ∈ Θ_c, τ ≥ t₀ and i = 1, 2, let f_{θ_c,τ,i} be given by eq. 8. Let M > 0. Then for θ_c ∈ Θ_c and τ ≥ t₀, we have

1.
$\frac{\partial f_{θ_{c}, τ, i} (r)}{\partial x_{0 i}} = - M \frac{\partial f_{θ_{c}, τ, i} (r)}{\partial x}$ , r = (x, y) ∈ ℝ², i = 1, 2.
2.
$\frac{\partial f_{θ_{c}, τ, i} (r)}{\partial y_{0 i}} = - M \frac{\partial f_{θ_{c}, τ, i} (r)}{\partial y}$ , r = (x, y) ∈ ℝ², i = 1, 2.

Proof 1. For _c = (x₀₁, x₀₂, y₀₁, y₀₂) ∈ Θ_c and i = 1, 2, define $u_{i} ≔ \frac{x}{M} - x_{0 i} and v_{i} ≔ \frac{y}{M} - y_{0 i}$ . Then for i = 1, 2, we have

\frac{\partial f_{θ_{c}, τ, i} (r)}{\partial x_{0 i}} = \frac{1}{M^{2}} \frac{\partial q_{i} (\frac{x}{M} - x_{0 i}, \frac{y}{M} - y_{0 i})}{\partial x_{0 i}} = \frac{1}{M^{2}} \frac{\partial q_{i} (u_{i}, v_{i})}{\partial u_{i}} \frac{\partial u_{i}}{\partial x_{0 i}} = - \frac{1}{M^{2}} \frac{\partial q_{i} (u_{i}, v_{i})}{\partial u_{i}} = \frac{1}{M^{2}} \frac{\partial q_{i} (\frac{x}{M} - x_{0 i}, \frac{y}{M} - y_{0 i})}{\partial x} \frac{\partial x}{\partial u_{i}} = - M \frac{1}{M^{2}} \frac{\partial q_{i} (\frac{x}{M} - x_{0 i}, \frac{y}{M} - y_{0 i})}{\partial x} = - M \frac{\partial f_{θ_{c}, τ, i} (r)}{\partial x},

for r = (x, y) ∈ ℝ², θ_c ∈ Θ_c and τ ≥ t₀.

2. Proof is similar to that of result 1.

Lemma 3 For i = 1, 2, let Q_i be given by eq. 20, and Λ_i and q_i denote the photon detection rate and the image function of the i^th object, respectively. For i = 1, 2, assume that q_i is radially symmetric with respect to the origin, i.e., there exists a q_isuch that $q_{i} (x, y) = q_{i} (\sqrt{x^{2} + y^{2}})$ for (x, y) ∈ ℝ² and i = 1, 2. Then for i = 1, 2,

Q_{i} = \frac{1}{{(δ_{rs, i}^{loc})}^{2}} 1_{2 \times 2},

where 1_2×2 denotes the 2 × 2 identity matrix and $δ_{rs, i}^{loc}$ , i = 1, 2, is given by eq. 23.

Proof By definition, q_i, i = 1, 2, is symmetric along the x and y axes with respect to the origin. Using this, it can be shown that (see (Ram et al, 2006b, pg 39))

Q_{i} = (\int_{t_{0}}^{t} Λ_{i} (τ) d τ) Diag [\int_{ℝ^{2}} \frac{1}{q_{i} (x, y)} {(\frac{\partial q_{i} (x, y)}{\partial x})}^{2} dxdy \int_{ℝ^{2}} \frac{1}{q_{i} (x, y)} {(\frac{\partial q_{i} (x, y)}{\partial y})}^{2} dxdy],

where diag denotes the diagonal matrix. Further, using the fact that q_i, i = 1, 2, is radially symmetric, we have

{[Q_{i}]}_{11} = (\int_{t_{0}}^{t} Λ_{i} (τ) d τ) \int_{ℝ^{2}} \frac{1}{q_{i} (x, y)} {(\frac{\partial q_{i} (x, y)}{\partial x})}^{2} dxdy = (\int_{t_{0}}^{t} Λ_{i} (τ) d τ) \int_{0}^{2 π} \int_{0}^{\infty} \frac{1}{q_{i} (r)} {(\frac{\partial q_{i} (r)}{\partial r} \frac{\partial r}{\partial x})}^{2} rdrd ϕ = (\int_{t_{0}}^{t} Λ_{i} (τ) d τ) \int_{0}^{2 π} {cos}^{2} (ϕ) d ϕ \int_{0}^{\infty} \frac{1}{q_{i} (r)} {(\frac{\partial q_{i} (r)}{\partial r})}^{2} rdr = (\int_{t_{0}}^{t} Λ_{i} (τ) d τ) (\int_{0}^{2 π} \frac{1 + cos (2 ϕ)}{2} d ϕ) κ_{i} = (\int_{t_{0}}^{t} Λ_{i} (τ) d τ) π κ_{i} = \frac{1}{{(δ_{rs, i}^{loc})}^{2}},

where i = 1, 2, and κ_i is defined in eq. 23. Similarly, we can show that for i = 1, 2, ${[Q_{i}]}_{22} = 1 / {(δ_{rs, i}^{loc})}^{2}$ .

A.1 Proof of Theorem 1

Proof 1. Substituting for Λ_θ and f_θ,τ in eq. 1, and using assumptions A1 – A2 we get

I_{sim} (θ) = \int_{t_{0}}^{t} \int_{𝒞} \frac{1}{Λ_{θ} (τ) f_{θ, τ} (r)} (\begin{matrix} Λ_{θ} (τ) {(\frac{\partial f_{θ, τ} (r)}{\partial θ_{f}})}^{T} \\ Λ_{θ} (τ) {(\frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}})}^{T} + f_{θ, τ} (r) {(\frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}})}^{T} \end{matrix}) \times (Λ_{θ} (τ) \frac{\partial f_{θ, τ} (r)}{\partial θ_{f}} Λ_{θ} (τ) \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} + f_{θ, τ} (r) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}}) drd τ S_{sim} (θ) = [{(\int_{t_{0}}^{t} \int_{𝒞} \frac{1}{f_{θ, τ} (r)} {(\frac{\partial f_{θ, τ} (r)}{\partial θ_{f}})}^{T} (f_{θ, τ} (r) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}} + Λ_{θ} (τ) \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}}) drd τ)}^{T} \int_{t_{0}}^{t} \int_{𝒞} \frac{1}{f_{θ, τ} (r)} {(\frac{\partial f_{θ, τ} (r)}{\partial θ_{f}})}^{T} (Λ_{θ} (τ) \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} + f_{θ, τ} (r) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}}) drd τ \int_{t_{0}}^{t} \int_{𝒞} \frac{1}{Λ_{θ} (τ) f_{θ, τ} (r)} {(Λ_{θ} (τ) \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} + f_{θ, τ} (r) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}})}^{T} (Λ_{θ} (τ) \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} + f_{θ, τ} (r) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}}) drd τ] .

(34)

By definition, f_θ,τ is a probability density function, which satisfies the regularity conditions that are necessary for the calculation of the Fisher information matrix (Kay (1993)). Hence we have for θ ∈ Θ and τ ≥ t₀,

\int_{𝒞} \frac{\partial f_{θ, τ} (r)}{\partial θ} dr = (\begin{matrix} \int_{𝒞} \frac{\partial f_{θ, τ} (r)}{\partial θ_{f}} dr \\ \int_{𝒞} \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} dr \end{matrix}) = (\begin{matrix} \frac{\partial}{\partial θ_{f}} \int_{𝒞} f_{θ, τ} (r) dr \\ \frac{\partial}{\partial θ_{Λ}} \int_{𝒞} f_{θ, τ} (r) dr \end{matrix}) = (\begin{matrix} \frac{\partial}{\partial θ_{f}} 1 \\ \frac{\partial}{\partial θ_{Λ}} 1 \end{matrix}) = (\begin{matrix} 0 \\ 0 \end{matrix}) .

(35)

Using eq. 35, we have

{[I_{sim} (θ)]}_{12} = {[I_{sim} (θ)]}_{21}^{T} = \int_{t_{0}}^{t} \int_{𝒞} \frac{1}{f_{θ, τ} (r)} {(\frac{\partial f_{θ, τ} (r)}{\partial θ_{f}})}^{T} (Λ_{θ} (τ) \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} + f_{θ, τ} (r) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}}) drd τ = \int_{t_{0}}^{t} \int_{𝒞} \frac{Λ_{θ} (τ)}{f_{θ, τ} (r)} {(\frac{\partial f_{θ, τ} (r)}{\partial θ_{f}})}^{T} \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} drd τ = R_{sim} (θ), θ \in Θ .

(36)

Using eq. 35 and the fact that ∫_𝒞 f_θ,τ (r)dr = 1 for θ ∈ Θ and τ ≥ t₀, we have

{[I_{sim} (θ)]}_{22} = \int_{t_{0}}^{t} \int_{𝒞} \frac{1}{Λ_{θ} (τ) f_{θ, τ} (r)} {(Λ_{θ} (τ) \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} + f_{θ, τ} (r) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}})}^{T} (Λ_{θ} (τ) \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} + f_{θ, τ} (r) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}}) drd τ = \int_{t_{0}}^{t} \int_{𝒞} \frac{Λ_{θ} (τ)}{f_{θ, τ} (r)} {(\frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}})}^{T} \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} drd τ + \int_{t_{0}}^{t} (\int_{𝒞} {(\frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}})}^{T} dr) \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}} d τ + \int_{t_{0}}^{t} \frac{1}{Λ_{θ} (τ)} {(\frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}})}^{T} \frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}} d τ + \int_{t_{0}}^{t} {(\frac{\partial Λ_{θ} (τ)}{\partial θ_{Λ}})}^{T} \int_{𝒞} \frac{\partial f_{θ, τ} (r)}{\partial θ_{Λ}} drd τ = T_{sim} (θ), θ \in Θ .

(37)

Substituting eqs. 36 and 37 in eq. 34, the result immediately follows.

2. Using assumptions A1 and A3 it can be shown that (∂f_θ,τ (r)/∂θ_Λ) = 0, r ∈ 𝒞, θ ∈ Θ, τ ≥ t₀ (see result 3 of Lemma 1 in Appendix). Substituting this and using assumption A3 in eqs. 5, 6 and 7, we obtain the desired result.

3. Using assumptions A1 and A4 it can be shown that (∂f_θ,τ (r)/∂θ_Λ) = 0, r ∈ 𝒞, θ ∈ Θ, τ ≥ t₀ (see result 2 of Lemma 1 in Appendix). Further, by assumption A4 we have f_θ,τ(r) = f_θ,τ,1(r)(ε_θ,1(τ) + ε_θ,2(τ)) = f_θ,τ,1(r), r ∈ 𝒞 and τ ≥ t₀. Substituting these results in eqs. 5, 6 and 7, we obtain the desired result.

A.2 Proof of results 2 and 3 of Theorem 2

Proof 2. For $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ , define s_x ≔ (x₀₁ + x₀₂)/2, s_y ≔ (y₀₁ + y₀₂)/2 and ϕ = tan⁻¹((y₀₂ − y₀₁)/(x₀₂ − x₀₁)). Then we have $x_{01} ≔ s_{x} - \frac{d cos ϕ}{2}, y_{01} ≔ s_{y} - \frac{d sin ϕ}{2}, x_{02} ≔ s_{x} - \frac{d cos ϕ}{2}, y_{02} ≔ s_{y} - \frac{d sin ϕ}{2}$ . Substituting this in result 1 of the Theorem 2 and using the shift invariant property of Lebesgue intergrals, we get for $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ ,

S_{sim} (θ_{c}) ≔ \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{1}{Λ_{1} (τ) q_{1} (x + \frac{d}{2} cos ϕ, y + \frac{d}{2} sin ϕ) + Λ_{2} (τ) q_{2} (x - \frac{d}{2} cos ϕ, y - \frac{d}{2} sin ϕ)} \times [\begin{matrix} Λ_{1} (τ) \frac{\partial q_{1} (x + \frac{d}{2} cos ϕ, y + \frac{d}{2} sin ϕ)}{\partial x} \\ Λ_{1} (τ) \frac{\partial q_{1} (x + \frac{d}{2} cos ϕ, y + \frac{d}{2} sin ϕ)}{\partial y} \\ Λ_{2} (τ) \frac{\partial q_{2} (x - \frac{d}{2} cos ϕ, y - \frac{d}{2} sin ϕ)}{\partial x} \\ Λ_{2} (τ) \frac{\partial q_{2} (x - \frac{d}{2} cos ϕ, y - \frac{d}{2} sin ϕ)}{\partial y} \end{matrix}] {[\begin{matrix} Λ_{1} (τ) \frac{\partial q_{1} (x + \frac{d}{2} cos ϕ, y + \frac{d}{2} sin ϕ)}{\partial x} \\ Λ_{1} (τ) \frac{\partial q_{1} (x + \frac{d}{2} cos ϕ, y + \frac{d}{2} sin ϕ)}{\partial y} \\ Λ_{2} (τ) \frac{\partial q_{2} (x - \frac{d}{2} cos ϕ, y - \frac{d}{2} sin ϕ)}{\partial x} \\ Λ_{2} (τ) \frac{\partial q_{2} (x - \frac{d}{2} cos ϕ, y - \frac{d}{2} sin ϕ)}{\partial y} \end{matrix}]}^{T} dxdyd τ .

(38)

For (x, y) ∈ ℝ², τ ≥ t₀ and $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ , let

Q_{θ_{c}}^{+} (x, y, τ) ≔ Λ_{1} (τ) q_{1} (x + \frac{d}{2} cos ϕ, y + \frac{d}{2} sin ϕ),

(39)

Q_{θ_{c}}^{-} (x, y, τ) ≔ Λ_{2} (τ) q_{2} (x - \frac{d}{2} cos ϕ, y - \frac{d}{2} sin ϕ) .

(40)

For ϕ ∈ (0, 2π), define T_ϕ : ℝ² → ℝ²

(\begin{matrix} x \\ y \end{matrix}) \mapsto (\begin{matrix} u \\ v \end{matrix}) = (\begin{matrix} x cos ϕ + y sin ϕ \\ - x sin ϕ + y cos ϕ \end{matrix}) .

The transformation T_ϕ maps the coordinates of a point on the 2D plane when the coordinate axes is rotated by an angle ϕ. Let $P^{\pm} ≔ (x \pm \frac{d}{2} cos ϕ, y \pm \frac{d}{2} sin ϕ)$ . Then

{P̃}^{\pm} ≔ T_{ϕ} P^{\pm} = (\begin{matrix} cos ϕ sin ϕ \\ - sin ϕ cos ϕ \end{matrix}) (\begin{matrix} x \pm \frac{d}{2} cos ϕ \\ y \pm \frac{d}{2} sin ϕ \end{matrix}) = (\begin{matrix} x cos ϕ + y sin ϕ \pm \frac{d}{2} \\ - x sin ϕ + y cos ϕ \end{matrix}) .

(41)

Using eq. 41, we have for τ ≥ t₀ and $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ ,

(Q_{θ_{c}}^{+} ◦ T_{ϕ}) (x, y, τ) = Λ_{1} (τ) q_{1} (T_{ϕ} (x + \frac{d}{2} cos ϕ, y + \frac{d}{2} sin ϕ)) = Λ_{1} (τ) q_{1} (T_{ϕ} (P^{+})) = Λ_{1} (τ) q_{1} ({P̃}^{+}) = Λ_{1} (τ) q_{1} (x cos ϕ + y sin ϕ + \frac{d}{2}, - x sin ϕ + y cos ϕ), (x, y) \in ℝ^{2},

(42)

(Q_{θ_{c}}^{-} ◦ T_{ϕ}) (x, y, τ) = Λ_{2} (τ) q_{2} (x cos ϕ + y sin ϕ - \frac{d}{2}, - x sin ϕ + y cos ϕ), (x, y) \in ℝ^{2} .

(43)

Similarly, for $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ , τ ≥t₀ and ζ ∈ {x, y},

(\frac{\partial Q_{θ_{c}}^{+}}{\partial ζ} ◦ T_{ϕ}) (x, y) = Λ_{1} (τ) \frac{\partial q_{1} (T_{ϕ} (P^{+}))}{\partial ζ} = Λ_{1} (τ) \frac{\partial q_{1} ({P̃}^{+})}{\partial ζ} = Λ_{1} (τ) \frac{\partial q_{1} (x cos ϕ + y sin ϕ + \frac{d}{2}, - x sin ϕ + y cos ϕ)}{\partial ζ}, (x, y) \in ℝ^{2},

(44)

(\frac{\partial Q_{θ_{c}}^{-}}{\partial ζ} ◦ T_{ϕ}) (x, y) = Λ_{2} (τ) \frac{\partial q_{2} (x cos ϕ + y sin ϕ - \frac{d}{2}, - x sin ϕ + y cos ϕ)}{\partial ζ}, (x, y) \in ℝ^{2} .

(45)

By definition, the determinant of the Jacobian of T_ϕ is given by

Det [T_{ϕ}^{'}] ≔ Det [\begin{matrix} cos ϕ & sin ϕ \\ - sin ϕ & cos ϕ \end{matrix}] = 1, ϕ \in (0, 2 π),

(46)

and for (u, v) ≔ T _ϕ(x, y),

dudv = | Det [T_{ϕ}^{'}] | dxdy = dxdy .

(47)

Substituting eqs. 42 – 47 in the expression for S_sim(θ_c) given in eq. 38 and making use of the change of variables Theorem (Rudin (1987)) we get,

S_{sim} (θ_{c}) = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{1}{Q_{θ_{c}}^{+} (x, y, τ) + Q_{θ_{c}}^{-} (x, y, τ)} (\begin{matrix} \frac{\partial Q_{θ_{c}}^{+} (x, y, τ)}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{+} (x, y, τ)}{\partial y} \\ \frac{\partial Q_{θ_{c}}^{-} (x, y, τ)}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{-} (x, y, τ)}{\partial y} \end{matrix}) {(\begin{matrix} \frac{\partial Q_{θ_{c}}^{+} (x, y, τ)}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{+} (x, y, τ)}{\partial y} \\ \frac{\partial Q_{θ_{c}}^{-} (x, y, τ)}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{-} (x, y, τ)}{\partial y} \end{matrix})}^{T} dxdyd τ = \int_{t_{0}}^{t} \int_{T_{ϕ} (ℝ^{2})} ((\frac{1}{Q_{θ_{c}}^{+} + Q_{θ_{c}}^{-}} (\begin{matrix} \frac{\partial Q_{θ_{c}}^{+}}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{+}}{\partial y} \\ \frac{\partial Q_{θ_{c}}^{-}}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{-}}{\partial y} \end{matrix}) {(\begin{matrix} \frac{\partial Q_{θ_{c}}^{+}}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{+}}{\partial y} \\ \frac{\partial Q_{θ_{c}}^{-}}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{-}}{\partial y} \end{matrix})}^{T}) \circ T_{ϕ}) (x, y, τ) Det | T_{ϕ}^{'} | dxdyd τ = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{1}{Q_{θ_{c}}^{+} (T_{ϕ} (P^{+})) + Q_{θ_{c}}^{-} (T_{ϕ} (P^{-}))} (\begin{matrix} \frac{\partial Q_{θ_{c}}^{+} (T_{ϕ} (P^{+}))}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{+} (T_{ϕ} (P^{+}))}{\partial y} \\ \frac{\partial Q_{θ_{c}}^{-} (T_{ϕ} (P^{-}))}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{-} (T_{ϕ} (P^{-}))}{\partial y} \end{matrix}) {(\begin{matrix} \frac{\partial Q_{θ_{c}}^{+} (T_{ϕ} (P^{+}))}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{+} (T_{ϕ} (P^{+}))}{\partial y} \\ \frac{\partial Q_{θ_{c}}^{-} (T_{ϕ} (P^{-}))}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{-} (T_{ϕ} (P^{-}))}{\partial y} \end{matrix})}^{T} dxdyd τ = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{1}{Q_{θ_{c}}^{+} ({P̃}^{+}) + Q_{θ_{c}}^{-} ({P̃}^{-})} (\begin{matrix} \frac{\partial Q_{θ_{c}}^{+} ({P̃}^{+})}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{+} ({P̃}^{+})}{\partial y} \\ \frac{\partial Q_{θ_{c}}^{-} ({P̃}^{-})}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{-} ({P̃}^{-})}{\partial y} \end{matrix}) {(\begin{matrix} \frac{\partial Q_{θ_{c}}^{+} ({P̃}^{+})}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{+} ({P̃}^{+})}{\partial y} \\ \frac{\partial Q_{θ_{c}}^{-} ({P̃}^{-})}{\partial x} \\ \frac{\partial Q_{θ_{c}}^{-} ({P̃}^{-})}{\partial y} \end{matrix})}^{T} dxdyd τ = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{1}{Λ_{1} (τ) q_{1} (x cos ϕ + y sin ϕ + \frac{d}{2}, - x sin ϕ + y cos ϕ) + Λ_{2} (τ) q_{2} (x cos ϕ + y sin ϕ - \frac{d}{2}, - x sin ϕ + y cos ϕ)} \times (\begin{matrix} Λ_{1} (τ) \frac{\partial q_{1} (x cos ϕ + y sin ϕ + \frac{d}{2}, - x sin ϕ + y cos ϕ)}{\partial x} \\ Λ_{1} (τ) \frac{\partial q_{1} (x cos ϕ + y sin ϕ + \frac{d}{2}, - x sin ϕ + y cos ϕ)}{\partial y} \\ Λ_{2} (τ) \frac{\partial q_{2} (x cos ϕ + y sin ϕ + \frac{d}{2}, - x sin ϕ + y cos ϕ)}{\partial x} \\ Λ_{2} (τ) \frac{\partial q_{2} (x cos ϕ + y sin ϕ + \frac{d}{2}, - x sin ϕ + y cos ϕ)}{\partial y} \end{matrix}) {(\begin{matrix} Λ_{1} (τ) \frac{\partial q_{1} (x cos ϕ + y sin ϕ + \frac{d}{2}, - x sin ϕ + y cos ϕ)}{\partial x} \\ Λ_{1} (τ) \frac{\partial q_{1} (x cos ϕ + y sin ϕ + \frac{d}{2}, - x sin ϕ + y cos ϕ)}{\partial y} \\ Λ_{2} (τ) \frac{\partial q_{2} (x cos ϕ + y sin ϕ + \frac{d}{2}, - x sin ϕ + y cos ϕ)}{\partial x} \\ Λ_{2} (τ) \frac{\partial q_{2} (x cos ϕ + y sin ϕ + \frac{d}{2}, - x sin ϕ + y cos ϕ)}{\partial y} \end{matrix})}^{T} dxdyd τ = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{1}{Λ_{1} (τ) q_{1} (u + \frac{d}{2}, v) + Λ_{2} (τ) q_{2} (u - \frac{d}{2}, v)} (\begin{matrix} Λ_{1} (τ) (cos ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial u} - sin ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial v}) \\ Λ_{1} (τ) (sin ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial u} + cos ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial v}) \\ Λ_{2} (τ) (cos ϕ \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial u} - sin ϕ \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial v}) \\ Λ_{2} (τ) (sin ϕ \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial u} + cos ϕ \frac{\partial_{q_{2}} (u - \frac{d}{2}, v)}{\partial v}) \end{matrix}) \times {(\begin{matrix} Λ_{1} (τ) (cos ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial u} - sin ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial v}) \\ Λ_{1} (τ) (sin ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial u} + cos ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial v}) \\ Λ_{2} (τ) (cos ϕ \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial u} - sin ϕ \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial v}) \\ Λ_{2} (τ) (sin ϕ \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial u} + cos ϕ \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial v}) \end{matrix})}^{T} dudvd τ, θ_{c} \in Θ_{c},

(48)

where u ≔ x cos ϕ + y sin ϕ and v ≔ −x sin ϕ + y cos ϕ. Further, for $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ , τ ≥ t₀ and (x, y) ∈ ℝ², we have

(\begin{matrix} Λ_{1} (τ) (cos ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial u} - sin ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial v}) \\ Λ_{1} (τ) (sin ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial u} + cos ϕ \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial v}) \\ Λ_{2} (τ) (cos ϕ \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial u} - sin ϕ \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial v}) \\ Λ_{2} (τ) (sin ϕ \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial u} + cos ϕ \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial v}) \end{matrix}) = [\begin{matrix} cos ϕ & - sin ϕ & 0 & 0 \\ sin ϕ & cos ϕ & 0 & 0 \\ 0 & 0 & cos ϕ & - sin ϕ \\ 0 & 0 & sin ϕ & cos ϕ \end{matrix}] [\begin{matrix} Λ_{1} (τ) \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial u} \\ Λ_{1} (τ) \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial v} \\ Λ_{2} (τ) \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial u} \\ Λ_{2} (τ) \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial v} \end{matrix}] = \frac{1}{d} [\begin{matrix} x_{02} - x_{01} & - (y_{02} - y_{01}) & 0 & 0 \\ y_{02} - y_{01} & x_{02} - x_{01} & 0 & 0 \\ 0 & 0 & x_{02} - x_{01} & - (y_{02} - y_{01}) \\ 0 & 0 & y_{02} - y_{01} & x_{02} - x_{01} \end{matrix}] [\begin{matrix} Λ_{1} (τ) \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial u} \\ Λ_{1} (τ) \frac{\partial q_{1} (u + \frac{d}{2}, v)}{\partial v} \\ Λ_{2} (τ) \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial u} \\ Λ_{2} (τ) \frac{\partial q_{2} (u - \frac{d}{2}, v)}{\partial v} \end{matrix}] = D (θ_{c}) (\begin{matrix} Λ_{1} (τ) q_{1, x}^{'} (x, y) \\ Λ_{1} (τ) q_{1, y}^{'} (x, y) \\ Λ_{2} (τ) q_{2, x}^{'} (x, y) \\ Λ_{2} (τ) q_{2, y}^{'} (x, y) \end{matrix}),

where D(θ_c) is defined in eq. 13, $q_{i, ζ}^{'}, i = 1, 2$ , ζ ∈ {x, y} is given by eq. 16 and we have used the fact that cos ϕ ≔ (x₀₂ − x₀₁)/d and sin ϕ ≔ (y₀₂ − y₀₁)/d. Substituting the above expression in eq. 48, the result immediately follows.

3. To prove this result we need to show that the off-diagonal terms of C_ij(θ_c) are zero, for i, j = 1, 2 and $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ . For $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ , τ ≥ t₀ and (x, y) ∈ ℝ², let

W_{θ_{c}}^{1} (x, y, τ) ≔ Λ_{1} (τ) q_{1} (x + \frac{d}{2}, y), W_{θ_{c}}^{2} (x, y, τ) ≔ Λ_{2} (τ) q_{2} (x - \frac{d}{2}, y) .

(49)

Define T_Y : ℝ² × [t₀, ∞) → ℝ² × [t₀, ∞), (x, y, τ) ↦ (x, −y, τ). Since q₁ and q₂ are symmetric along the y axis with respect to y = 0, we have $W_{θ_{c}}^{1} (x, y, τ) = (W_{θ_{c}}^{1} \circ T_{Y}) (x,, y, τ) and W_{θ_{c}}^{2} (x, y, τ) = (W_{θ_{c}}^{2} \circ T_{Y}) (x,, y, τ) for θ_{c} \in Θ_{c} \ Θ_{c}^{0}, (x, y) \in ℝ^{2}$ and τ ≥ t₀. This implies that for $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ , (x, y) ∈ ℝ² and τ ≥ t₀, we have

U_{θ_{c}}^{\pm} (x, y, τ) = Λ_{1} (τ) q_{1} (x - \frac{d}{2}, y) \pm Λ_{2} (τ) q_{2} (x + \frac{d}{2}, y) = (U_{θ_{c}}^{\pm} \circ T_{Y}) (x, y, τ),

(50)

\frac{\partial W_{θ_{c}}^{i} (x, y, τ)}{\partial x} = (\frac{\partial W_{θ_{c}}^{i}}{\partial x} \circ T_{Y}) (x, y, τ), i = 1, 2,

(51)

\frac{\partial W_{θ_{c}}^{i} (x, y, τ)}{\partial y} = - (\frac{\partial W_{θ_{c}}^{i}}{\partial y} \circ T_{Y}) (x, y, τ), i = 1, 2 .

(52)

Consider the term [C₁₁(θ_c)]₁₂, where C₁₁(θ_c) is given by eq. 15. Using eqs. 50, 51 and 52 we have

{[C_{11} (θ_{c})]}_{12} = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{1}{Λ_{1} (τ) q_{1} (x + \frac{d}{2}, y) + Λ_{2} (τ) q_{2} (x - \frac{d}{2}, y)} \times (Λ_{1} (τ) \frac{\partial q_{1} (x + \frac{d}{2}, y)}{\partial x}) (Λ_{2} (τ) \frac{\partial q_{1} (x + \frac{d}{2}, y)}{\partial y}) dxdyd τ = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{1}{U_{θ_{c}}^{+} (x, y, τ)} \frac{\partial W^{1} (x, y, τ)}{\partial x} \frac{\partial W_{θ_{c}}^{1} (x, y, τ)}{\partial y} dxdyd τ = - \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{1}{(U_{θ_{c}}^{+} \circ T_{Y}) (x, y, τ)} (\frac{\partial W_{θ_{c}}^{1}}{\partial x} \circ T_{Y}) (x, y, τ) (\frac{\partial W_{θ_{c}}^{1}}{\partial y} \circ T_{Y}) (x, y, τ) dxdyd τ = - \int_{t_{0}}^{t} \int_{ℝ^{2}} ((\frac{1}{U_{θ_{c}}^{+}} \frac{\partial W_{θ_{c}}^{1}}{\partial x} \frac{\partial W_{θ_{c}}^{1}}{\partial y}) \circ T_{Y}) (x, y, τ) dxdyd τ = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{1}{U_{θ_{c}}^{+} (x, y, τ)} \frac{\partial W_{θ_{c}}^{1} (x, y, τ)}{\partial x} \frac{\partial W_{θ_{c}}^{1} (x, y, τ)}{\partial y} dxdyd τ = - {[C_{11} (θ_{c})]}_{12}, θ_{c} \in Θ_{c} \ Θ_{c}^{0},

where we have used the change of variables theorem in the final step. From the above equation it follows that [C₁₁(θ_c)]₁₂ = [C₁₁(θ_c)]₂₁ = 0, $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ . Similarly, by using eqs. 50, 51 and 52, we can show that [C₁₂(θ_c)]₁₂ = [C₁₂(θ_c)]₂₁ = 0, and [C₂₂(θ_c)]₁₂ = [C (θ_c)]₂₁ = 0 for $θ_{c} \in Θ_{c} \ Θ_{c}^{0}$ . From this the result follows.

Lemma 4 For θ_c = (x₀₁, y₀₁, x₀₂, y₀₂) ∈ Θ_c, let K₁₂(θ_c) be given by eq. 12 and for i = 1, 2 let Q_i be given by eq. 20. Then for θ_c ∈ Θ_c and i, j = 1, 2, we have

{[K_{12} (θ_{c})]}_{ij} \leq \sqrt{{[Q_{1}]}_{ii} {[Q_{2}]}_{jj}} < \infty .

Proof Define Δ_x = x₀₂ − x₀₁ and Δ_y = y₀₂ − y₀₁. Applying the Cauchy-Schwarz inequality to the term [K₁₂(θ_c)]₁₁ and using the fact that Λ₁,Λ₂, q₁, q₂ ≥ 0, we have for θ_c ∈ Θ_c

{[K_{12} (θ_{c})]}_{11} = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{1} (τ) Λ_{2} (τ)}{Λ_{1} (τ) q_{1} (x, y) + Λ_{2} (τ) q_{2} (x - Δ_{x}, y - Δ_{y})} \times \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{2} (x - Δ_{x}, y - Δ_{y})}{\partial x} dxdyd τ \leq {(\int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{1} (τ) Λ_{2} (τ)}{Λ_{1} (τ) q_{1} (x, y) + Λ_{2} (τ) q_{2} (x - Δ_{x}, y - Δ_{y})} {(\frac{\partial q_{1} (x, y)}{\partial x})}^{2} dxdyd τ)}^{\frac{1}{2}} \times {(\int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{1} (τ) Λ_{2} (τ)}{Λ_{1} (τ) q_{1} (x, y) + Λ_{2} (τ) q_{2} (x - Δ_{x}, y - Δ_{y})} {(\frac{\partial q_{2} (x - Δ_{x}, y - Δ_{y})}{\partial x})}^{2} dxdyd τ)}^{\frac{1}{2}} \leq {(\int_{t_{0}}^{t} Λ_{2} (τ) d τ)}^{\frac{1}{2}} {(\int_{ℝ^{2}} \frac{1}{q_{1} (x, y)} {(\frac{\partial q_{1} (x, y)}{\partial x})}^{2} dxdyd τ)}^{\frac{1}{2}} \times {(\int_{t_{0}}^{t} Λ_{1} (τ) d τ)}^{\frac{1}{2}} {(\int_{ℝ^{2}} \frac{1}{q_{2} (x - Δ_{x}, y - Δ_{y})} {(\frac{\partial q_{2} (x - Δ_{x}, y - Δ_{y})}{\partial x})}^{2} dxdy)}^{\frac{1}{2}} = {(\int_{t_{0}}^{t} Λ_{1} (τ) d τ \int_{ℝ^{2}} \frac{1}{q_{1} (x, y)} {(\frac{\partial q_{1} (x, y)}{\partial x})}^{2} dxdy)}^{\frac{1}{2}} \times {(\int_{t_{0}}^{t} Λ_{2} (τ) d τ \int_{ℝ^{2}} \frac{1}{q_{2} (x, y)} {(\frac{\partial q_{2} (x, y)}{\partial x})}^{2} dxdy)}^{\frac{1}{2}} = \sqrt{{[Q_{1}]}_{11} {[Q_{2}]}_{22}} < \infty,

where we have used the shift invariant property of Lebesgue integrals in the penultimate step, and we have used the properties of image functions (see definition 6) in the last step. Similarly, we can prove the other results.

A.3 Proof of Theorem 3

Proof Consider the term K₁₁(θ_c) given in eq. 12. By definition, the integral expression of K₁₁(θ_c) is measurable for every θ_c ∈ Θ_c. Define Δ_x ≔ x₀₂ − x₀₁ and Δ_y ≔ y₀₂ − y₀₁. Using the shift invariant property of Lebesgue integrals, and the fact that q_i(x, y) ≥ 0 and Λ_i(τ) ≥ 0 for i = 1, 2, (x, y) ∈ ℝ² and τ ≥ t₀, we have for θ_c ∈ Θ_c

K_{11} (θ_{c}) ≔ \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{1}^{2} (τ)}{Λ_{1} (τ) q_{1} (x, y) + Λ_{2} (τ) q_{2} (x - Δ_{x}, y - Δ_{y})} \times (\begin{matrix} {(\frac{\partial q_{1} (x, y)}{\partial x})}^{2} & \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{1} (x, y)}{\partial y} \\ \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{1} (x, y)}{\partial y} & {(\frac{\partial q_{1} (x, y)}{\partial y})}^{2} \end{matrix}) dxdyd τ \leq \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{1}^{2} (τ)}{Λ_{1} (τ) q_{1} (x, y)} (\begin{matrix} {(\frac{\partial q_{1} (x, y)}{\partial x})}^{2} & \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{1} (x, y)}{\partial y} \\ \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{1} (x, y)}{\partial y} & {(\frac{\partial q_{1} (x, y)}{\partial y})}^{2} \end{matrix}) dxdyd τ = Q_{1} .

(53)

By definition of the image function (see Definition 6), we have for ζ₁ = x and ζ₂ = y, $\int_{ℝ^{2}} \frac{1}{q_{1} (x, y)} \frac{\partial q_{1} (x, y)}{\partial ζ_{i}} \frac{\partial q_{1} (x, y)}{\partial ζ_{j}} dxdy < \infty$ for i, j = 1, 2. This implies that K₁₁(θ_c) is dominated by the expression given in eq. 53 for every θ_c ∈ Θ_c. By definition of the image function (see Definition 6), q₁(x, y) and $\frac{\partial q_{1} (x, y)}{\partial x}$ are continuous for every x ∈ ℝ. Hence the integrand of K₁₁(θ_c) is continuous for every x ∈ ℝ. Hence by using the Theorem on changing integration and limits for Lebesgue integrals (see (Apostol, 1974, pg 281)), we have

lim_{x_{02} \to \infty} K_{11} (θ_{c}) = lim_{x_{02} \to \infty} \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{1}^{2} (τ)}{Λ_{1} (τ) q_{1} (x, y) + Λ_{2} (τ) q_{2} (x - Δ_{x}, y - Δ_{y})} \times (\begin{matrix} {(\frac{\partial q_{1} (x, y)}{\partial x})}^{2} & \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{1} (x, y)}{\partial y} \\ \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{1} (x, y)}{\partial y} & {(\frac{\partial q_{1} (x, y)}{\partial y})}^{2} \end{matrix}) dxdyd τ = \int_{t_{0}}^{t} \int_{ℝ^{2}} lim_{x_{02} \to \infty} \frac{Λ_{1}^{2} (τ)}{Λ_{1} (τ) q_{1} (x, y) + Λ_{2} (τ) q_{2} (x - Δ_{x,} y - Δ y)} (\begin{matrix} {(\frac{\partial q_{1} (x, y)}{\partial x})}^{2} & \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{1} (x, y)}{\partial y} \\ \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{1} (x, y)}{\partial y} & {(\frac{\partial q_{1} (x, y)}{\partial y})}^{2} \end{matrix}) dxdyd τ = \int_{t_{0}}^{t} Λ_{1} (τ) d τ \int_{ℝ^{2}} \frac{1}{q_{1} (x, y)} (\begin{matrix} {(\frac{\partial q_{1} (x, y)}{\partial x})}^{2} & \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{1} (x, y)}{\partial y} \\ \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{1} (x, y)}{\partial y} & {(\frac{\partial q_{1} (x, y)}{\partial y})}^{2} \end{matrix}) dxdy = Q_{1},

where we have used assumption A1 in the next to last step. Similarly, we can show that lim_{x₀₂→∞} K₂₂(θ_c) = Q₂. For the term K₁₂(θ_c), by definition, the integrand is measurable. Further by definition of the image function, the integrand of K₁₂(θ_c) is continuous for every x ∈ ℝ. From Lemma 4 (see Appendix) we know that the entries of K₁₂(θ_c) are dominated by integral expressions that are independent of θ_c ∈ θ_c and are bounded. Hence using the above results pertaining to K₁₂(θ_c) and assumptions A1 and A2, we apply the Theorem on changing integration and limits for Lebesgue integrals (see (Apostol, 1974, pg 281)) to obtain

lim_{x_{02} \to \infty} K_{12} (θ_{c}) = \int_{t_{0}}^{t} \int_{ℝ^{2}} lim_{x_{02} \to \infty} \frac{Λ_{1} (τ) Λ_{2} (τ)}{Λ_{1} (τ) q_{1} (x, y) + Λ_{2} (τ) q_{2} (x - Δ_{x}, y - Δ_{y})} \times (\begin{matrix} \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{2} (x - Δ_{x}, y - Δ_{y})}{\partial x} & \frac{\partial q_{1} (x, y)}{\partial x} \frac{\partial q_{2} (x - Δ_{x}, y - Δ_{y})}{\partial y} \\ \frac{\partial q_{1} (x, y)}{\partial y} \frac{\partial q_{2} (x - Δ_{x}, y - Δ_{y})}{\partial x} & \frac{\partial q_{1} (x, y)}{\partial y} \frac{\partial q_{2} (x - Δ_{x}, y - Δ_{y})}{\partial y} \end{matrix}) dxdyd τ = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{1} (τ) Λ_{2} (τ)}{Λ_{1} (τ) q_{1} (x, y) + 0} (\begin{matrix} \frac{\partial q_{1} (x, y)}{\partial x} 0 & \frac{\partial q_{1} (x, y)}{\partial x} 0 \\ \frac{\partial q_{1} (x, y)}{\partial y} 0 & \frac{\partial q_{1} (x, y)}{\partial y} 0 \end{matrix}) dxdyd τ = 0 .

A.4 Proof of Theorem 4

Proof 1. The image detection processes 𝒢₁ and 𝒢₂, which describe the first and second images, respectively, are assumed to be statistically independent of each other. Hence the general expression for the Fisher information matrix can be written as

S_{sim, sp} (θ_{c}) = S_{sim, sp, 1} (θ_{c}) + S_{sim, sp, 2} (θ_{c}), θ_{c} \in Θ_{c},

where S_sim,sp,1 and S_sim,sp,2 denote the Fisher information matrices corresponding to the image detection processes 𝒢₁ and 𝒢₂, respectively. In the present case, we assume without loss of generality that (x₀₁, y₀₁) to be the location coordinates that is determined from the first image. Then it immediately follows that S_sim,sp,1(θ_c) = Q₁ for θ_c ∈ Θ_c, where Q₁ denotes the Fisher information matrix for the localization accuracy problem corresponding to object 1 and is given by eq. 20.

To derive an expression for S_sim,sp,2(θ_c), we make use of the fact that for the second image the location coordinates (x₀₁, y₀₁) of object 1 can be assumed to be known a priori, since it is already determined from the first image. Hence for the second image only the location coordinates (x₀₂, y₀₂) of the second object are the unknown parameters. Hence from this it immediately follows that the expression for S_{sim, sp,2} (θ_c) will be identical to K₂₂(θ_c) which is a component of the Fisher information matrix S_sim(θ_c) for the problem of estimating θ_c when the location coordinates of both objects are unknown (Theorem 2).

2. To show that S_sim,sp(θ_c) is invertible, we require that $Q_{1}^{- 1} and K_{22}^{- 1} (θ_{c})$ exist for every θ_c ∈ Θ_c. We prove the result by contradiction. Define Δ_x ≔ x₀₁ − x₀₂ and Δ_y = y₀₁ − y₀₂. For θ_c ∈ Θ_c, consider the term

K_{22} (θ_{c}) = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{2}^{2} (τ)}{Λ_{1} (τ) q_{1} (x - x_{01}, y - y_{01}) + Λ_{2} (τ) q_{2} (x - x_{02}, y - y_{02})} \times (\begin{matrix} {(\frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial x})}^{2} & \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial x} \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial y} \\ \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial x} \frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial y} & {(\frac{\partial q_{2} (x - x_{02}, y - y_{02})}{\partial y})}^{2} \end{matrix}) dxdyd τ = \int_{t_{0}}^{t} \int_{ℝ^{2}} \frac{Λ_{2}^{2} (τ)}{Λ_{1} (τ) q_{1} (x - Δ_{x, y} - Δ_{y}) + Λ_{2} (τ) q_{2} (x, y)} (\begin{matrix} {(\frac{\partial q_{2} (x, y)}{\partial x})}^{2} & \frac{\partial q_{2} (x, y)}{\partial x} \frac{\partial q_{2} (x, y)}{\partial y} \\ \frac{\partial q_{2} (x, y)}{\partial x} \frac{\partial q_{2} (x, y)}{\partial y} & {(\frac{\partial q_{2} (x, y)}{\partial y})}^{2} \end{matrix}) dxdyd τ = \int_{t_{0}}^{t} \int_{ℝ^{2}} h_{θ_{c}} (x, y, τ) (\begin{matrix} {(\frac{\partial q_{2} (x, y)}{\partial x})}^{2} & \frac{\partial q_{2} (x, y)}{\partial x} \frac{\partial q_{2} (x, y)}{\partial y} \\ \frac{\partial q_{2} (x, y)}{\partial x} \frac{\partial q_{2} (x, y)}{\partial y} & {(\frac{\partial q_{2} (x, y)}{\partial y})}^{2} \end{matrix}) dxdyd τ,

(54)

where for θ_c ∈ Θ_c,

h_{θ_{c}} (x, y, τ) ≔ \frac{Λ_{2}^{2} (τ)}{Λ_{1} (τ) q_{1} (x - Δ_{x}, y - Δ_{y}) + Λ_{2} (τ) q_{2} (x, y)}, (x, y) \in ℝ^{2}, τ \geq t_{0} .

Assume that there exists an image function q₂ such that the Fisher information matrix K₂₂(θ_c) is singular for θ_c ∈ Θ_c. Hence by eq. 54, it immediately follow that

Det [K_{22} (θ_{c})] = \int_{t_{0}}^{t} \int_{ℝ^{2}} h_{θ_{c}} (x, y, τ) {(\frac{\partial q_{2} (x, y)}{\partial x})}^{2} dxdy \int_{t_{0}}^{t} \int_{ℝ^{2}} h_{θ_{c}} (x, y, τ) {(\frac{\partial q_{2} (x, y)}{\partial y})}^{2} dxdy - {(\underset{T_{2}}{\underset{︸}{\int_{t_{0}}^{t} \int_{ℝ^{2}} h_{θ_{c}} (x, y, τ) \frac{\partial q_{2} (x, y)}{\partial x} \frac{\partial q_{2} (x, y)}{\partial y} dxdy}})}^{2} = 0, θ_{c} \in Θ_{c} .

Note that the above expression pertains to the limiting case of equality of the Cauchy-Schwarz inequality applied to the term T₂. Hence by applying the condition for equality, we have for k ≠ 0

\frac{\partial q_{2} (x, y)}{\partial x} - k \frac{\partial q_{2} (x, y)}{\partial y} = 0, (x, y) \in ℝ^{2} .

(55)

The above equation is analogous to the classical one-dimensional transport equation whose solutions are given by ((Strauss, 1992, pg 6–7))

q_{2} (x, y) = F (x + \frac{y}{k}), (x, y) \in ℝ^{2},

where F is defined on ℝ. As q₂ is an image function satisfying the regularity conditions, we know that q₂ is continuous on ℝ². Hence it follows that F is also continuous on ℝ. Further, q₂(x, y) ≥ 0, (x, y) ∈ ℝ² and hence F (x) ≥ 0, x ∈ ℝ. This implies that there exists a constant K > 0 and a finite interval ℐ = (a, b) ⊂ ℝ such that F (x) ≥ K, x ∈ ℐ. Making use of the fact that ∫ℝ² q₂(x, y)dxdy = 1 (since q₂ is an image function) and substituting for q₂ in terms of F, we have

1 = \int_{ℝ^{2}} q_{2} (x, y) dxdy = \int_{ℝ^{2}} F (x + \frac{y}{k}) dxdy = \int_{ℝ} (\int_{ℝ} F (x + \frac{y}{k}) dx) dy = \int_{ℝ} (\int_{ℐ} F (x + \frac{y}{k}) dx + \int_{ℐ \ ℝ} F (x + \frac{y}{k}) dx) dy \geq \int_{ℝ} (\int_{ℐ} Kdx + \int_{ℐ \ ℝ} F (x + \frac{y}{k}) dx) dy = K (b - a) \int_{ℝ} dy + \int_{ℝ} (\int_{ℐ \ ℝ} F (x + \frac{y}{k}) dx) dy = \infty,

which is a contradiction. Hence K₂₂(θ_c) is invertible for θ_c ∈ θ_c. Similarly we can show that Q₁ is also invertible. From this the result follows.

Contributor Information

Sripad Ram, Department of Immunology, University of Texas Southwestern Medical Center Dallas, TX USA..

E. Sally Ward, Department of Immunology, University of Texas Southwestern Medical Center Dallas, TX USA..

Raimund J. Ober, Email: ober@utdallas.edu, Department of Immunology, University of Texas Southwestern Medical Center Dallas, TX USA.; Department of Electrical Engineering, University of Texas at Dallas Richardson, TX USA.

References

Apostol TM. Mathematical analysis. Boston, USA: Addison Wesley Publishing Company; 1974. [Google Scholar]
Betzig E, Patterson GH, Sougrat R, Lindwasser OW, Olenych S, Bonifacino JS, Davidson MW, Lippincott-Schwartz J, Hess HF. Imaging intracellular fluorescent proteins at nanometer resolution. Science. 2006;313:1642–1645. doi: 10.1126/science.1127344. [DOI] [PubMed] [Google Scholar]
Born M, Wolf E. Principles of Optics. Cambridge, UK: Cambridge University Press; 1999. [Google Scholar]
Van des Bos A. Parameter estimation for scientists and engineers. New York, USA: John Wiley and Sons; 2007. [Google Scholar]
Chao J, Ram S, Abraham A, Ward ES, Ober RJ. Resolution in three-dimensional microscopy. Optics Communications. 2009a;282:1751–1761. doi: 10.1016/j.optcom.2009.01.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chao J, Ram S, Ward ES, Ober RJ. A 3D resolution measure for optical microscopy; IEEE International Symposium on Biomedical Imaging; 2009b. pp. 1115–1118. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chao J, Ram S, Ward ES, Ober RJ. A comparative study of high resolution microscopy imaging modalities using a three-dimensional resolution measure. Optics Express. 2009c;17:24,377–24,402. doi: 10.1364/OE.17.024377. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gordon MP, Ha T, Selvin PR. Single molecule high resolution imaging with photobleaching. Proceedings of the National Academy of Sciences USA. 2004;101:6462–6465. doi: 10.1073/pnas.0401638101. [DOI] [PMC free article] [PubMed] [Google Scholar]
Helstrom CW. The detection and resolution of optical signals. IEEE Transactions on Information Theory. 1964;10:275–287. [Google Scholar]
Hess ST, Girirajan TPK, Mason MD. Ultra-high resolution imaging by fluorescence photoactivation localization microscopy. Biophysical Journal. 2006;91:4258–4272. doi: 10.1529/biophysj.106.091116. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kay SM. Fundamentals of statistical signal processing. New Jersey, USA: Prentice Hall PTR; 1993. [Google Scholar]
Lagerholm BC, Averett L, Weinreb GE, Jacobson K, Thompson NL. Analysis method for measuring submicroscopic distances with blinking quantum dots. Biophysical Journal. 2006;91:3050–3060. doi: 10.1529/biophysj.105.079178. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lidke KA, Rieger B, Jovin TM, Heintzmann R. Superresolution by localization of quantum dots using blinking statistics. Optics Express. 2005;13:7052–7062. doi: 10.1364/opex.13.007052. [DOI] [PubMed] [Google Scholar]
Moerner WE. New directions in single-molecule imaging and analysis. Proceedings of the National Academy of Sciences USA. 2007;104:12,596–12,602. doi: 10.1073/pnas.0610081104. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ober RJ, Martinez C, Lai X, Zhou J, Ward ES. Exocytosis of IgG as mediated by the receptor, FcRn: An analysis at the single molecule level. Proceedings of the National Academy of Sciences USA. 2004a;101:11,076–11,081. doi: 10.1073/pnas.0402970101. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ober RJ, Ram S, Ward ES. Localization accuracy in single molecule microscopy. Biophysical Journal. 2004b;86:1185–1200. doi: 10.1016/S0006-3495(04)74193-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
O’Sullivan JA, Blahut RE, Snyder DL. Information-theoretic image formation. IEEE Transactions on Information Theory. 1998;44:2094–2123. [Google Scholar]
Qu X, Wu D, Mets L, Scherer NF. Nanometer-localized multiple single-molecule fluorescence microscopy. Proceedings of the National Academy of Sciences USA. 2004;101:11,298–11,303. doi: 10.1073/pnas.0402155101. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ram S, Ward ES, Ober RJ. Beyond Rayleigh’s criterion: a resolution measure with application to single-molecule microscopy. Proceedings of the National Academy of Sciences USA. 2006a;103:4457–4462. doi: 10.1073/pnas.0508047103. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ram S, Ward ES, Ober RJ. A stochastic analysis of performance limits for optical microscopes. Multidimensional Systems and Signal Processing. 2006b;17:27–58. [Google Scholar]
Rao CR. Linear statistical inference and its applications. New York, USA: Wiley; 1965. [Google Scholar]
Rohr K. Theoretical limits of localizing 3D landmarks and features. IEEE Transactions on Biomedical Engineering. 2007;54:1613–1620. doi: 10.1109/TBME.2007.902589. [DOI] [PubMed] [Google Scholar]
Rudin W. Real and Complex Analysis. New York, USA: McGraw Hill; 1987. [Google Scholar]
Rust MJ, Bates M, Zhuang X. Sub-diffraction-limit imaging by stochastic optical reconstruction microscopy (STORM) Nature Methods. 2006;4:793–795. doi: 10.1038/nmeth929. [DOI] [PMC free article] [PubMed] [Google Scholar]
Santos A, Young IT. Model-based resolution: applying the theory in quantitative microscopy. Applied Optics. 2000;39:2948–2958. doi: 10.1364/ao.39.002948. [DOI] [PubMed] [Google Scholar]
Shahram M, Milanfar P. Imaging below the diffraction limit: A statistical analysis. IEEE Transactions on Image Processing. 2004;13:677–689. doi: 10.1109/tip.2004.826096. [DOI] [PubMed] [Google Scholar]
Smith ST. Statistical resolution limits and the complexified Cramer-Rao bound. IEEE Transactions on Signal Processing. 2005;53:1597–1609. [Google Scholar]
Stoica P, Marzetta TL. Parameter estimation problems with singular Fisher information matrices. IEEE Transactions on Signal Processing. 2001;49:87–90. [Google Scholar]
Strauss WA. Partial differential equations - an introduction. John Wiley and Sons; 1992. [Google Scholar]
Wong Y, Lin Z, Ober RJ. Limit of the accuracy of parameter estimation for moving single molecules imaged by fluorescence microscopy. IEEE Transactions on Signal Processing. 2011;59:895–908. doi: 10.1109/TSP.2010.2098403. [DOI] [PMC free article] [PubMed] [Google Scholar]
Young IT. Quantitative microscopy. IEEE Engineering in Medicine and Biology. 1996;15:59–66. [Google Scholar]
Zhang F. Matrix Theory. New York, USA: Springer Verlag; 1999. [Google Scholar]

[R1] Apostol TM. Mathematical analysis. Boston, USA: Addison Wesley Publishing Company; 1974. [Google Scholar]

[R2] Betzig E, Patterson GH, Sougrat R, Lindwasser OW, Olenych S, Bonifacino JS, Davidson MW, Lippincott-Schwartz J, Hess HF. Imaging intracellular fluorescent proteins at nanometer resolution. Science. 2006;313:1642–1645. doi: 10.1126/science.1127344. [DOI] [PubMed] [Google Scholar]

[R3] Born M, Wolf E. Principles of Optics. Cambridge, UK: Cambridge University Press; 1999. [Google Scholar]

[R4] Van des Bos A. Parameter estimation for scientists and engineers. New York, USA: John Wiley and Sons; 2007. [Google Scholar]

[R5] Chao J, Ram S, Abraham A, Ward ES, Ober RJ. Resolution in three-dimensional microscopy. Optics Communications. 2009a;282:1751–1761. doi: 10.1016/j.optcom.2009.01.062. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] Chao J, Ram S, Ward ES, Ober RJ. A 3D resolution measure for optical microscopy; IEEE International Symposium on Biomedical Imaging; 2009b. pp. 1115–1118. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] Chao J, Ram S, Ward ES, Ober RJ. A comparative study of high resolution microscopy imaging modalities using a three-dimensional resolution measure. Optics Express. 2009c;17:24,377–24,402. doi: 10.1364/OE.17.024377. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] Gordon MP, Ha T, Selvin PR. Single molecule high resolution imaging with photobleaching. Proceedings of the National Academy of Sciences USA. 2004;101:6462–6465. doi: 10.1073/pnas.0401638101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] Helstrom CW. The detection and resolution of optical signals. IEEE Transactions on Information Theory. 1964;10:275–287. [Google Scholar]

[R10] Hess ST, Girirajan TPK, Mason MD. Ultra-high resolution imaging by fluorescence photoactivation localization microscopy. Biophysical Journal. 2006;91:4258–4272. doi: 10.1529/biophysj.106.091116. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] Kay SM. Fundamentals of statistical signal processing. New Jersey, USA: Prentice Hall PTR; 1993. [Google Scholar]

[R12] Lagerholm BC, Averett L, Weinreb GE, Jacobson K, Thompson NL. Analysis method for measuring submicroscopic distances with blinking quantum dots. Biophysical Journal. 2006;91:3050–3060. doi: 10.1529/biophysj.105.079178. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Lidke KA, Rieger B, Jovin TM, Heintzmann R. Superresolution by localization of quantum dots using blinking statistics. Optics Express. 2005;13:7052–7062. doi: 10.1364/opex.13.007052. [DOI] [PubMed] [Google Scholar]

[R14] Moerner WE. New directions in single-molecule imaging and analysis. Proceedings of the National Academy of Sciences USA. 2007;104:12,596–12,602. doi: 10.1073/pnas.0610081104. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] Ober RJ, Martinez C, Lai X, Zhou J, Ward ES. Exocytosis of IgG as mediated by the receptor, FcRn: An analysis at the single molecule level. Proceedings of the National Academy of Sciences USA. 2004a;101:11,076–11,081. doi: 10.1073/pnas.0402970101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Ober RJ, Ram S, Ward ES. Localization accuracy in single molecule microscopy. Biophysical Journal. 2004b;86:1185–1200. doi: 10.1016/S0006-3495(04)74193-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] O’Sullivan JA, Blahut RE, Snyder DL. Information-theoretic image formation. IEEE Transactions on Information Theory. 1998;44:2094–2123. [Google Scholar]

[R18] Qu X, Wu D, Mets L, Scherer NF. Nanometer-localized multiple single-molecule fluorescence microscopy. Proceedings of the National Academy of Sciences USA. 2004;101:11,298–11,303. doi: 10.1073/pnas.0402155101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] Ram S, Ward ES, Ober RJ. Beyond Rayleigh’s criterion: a resolution measure with application to single-molecule microscopy. Proceedings of the National Academy of Sciences USA. 2006a;103:4457–4462. doi: 10.1073/pnas.0508047103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] Ram S, Ward ES, Ober RJ. A stochastic analysis of performance limits for optical microscopes. Multidimensional Systems and Signal Processing. 2006b;17:27–58. [Google Scholar]

[R21] Rao CR. Linear statistical inference and its applications. New York, USA: Wiley; 1965. [Google Scholar]

[R22] Rohr K. Theoretical limits of localizing 3D landmarks and features. IEEE Transactions on Biomedical Engineering. 2007;54:1613–1620. doi: 10.1109/TBME.2007.902589. [DOI] [PubMed] [Google Scholar]

[R23] Rudin W. Real and Complex Analysis. New York, USA: McGraw Hill; 1987. [Google Scholar]

[R24] Rust MJ, Bates M, Zhuang X. Sub-diffraction-limit imaging by stochastic optical reconstruction microscopy (STORM) Nature Methods. 2006;4:793–795. doi: 10.1038/nmeth929. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] Santos A, Young IT. Model-based resolution: applying the theory in quantitative microscopy. Applied Optics. 2000;39:2948–2958. doi: 10.1364/ao.39.002948. [DOI] [PubMed] [Google Scholar]

[R26] Shahram M, Milanfar P. Imaging below the diffraction limit: A statistical analysis. IEEE Transactions on Image Processing. 2004;13:677–689. doi: 10.1109/tip.2004.826096. [DOI] [PubMed] [Google Scholar]

[R27] Smith ST. Statistical resolution limits and the complexified Cramer-Rao bound. IEEE Transactions on Signal Processing. 2005;53:1597–1609. [Google Scholar]

[R28] Stoica P, Marzetta TL. Parameter estimation problems with singular Fisher information matrices. IEEE Transactions on Signal Processing. 2001;49:87–90. [Google Scholar]

[R29] Strauss WA. Partial differential equations - an introduction. John Wiley and Sons; 1992. [Google Scholar]

[R30] Wong Y, Lin Z, Ober RJ. Limit of the accuracy of parameter estimation for moving single molecules imaged by fluorescence microscopy. IEEE Transactions on Signal Processing. 2011;59:895–908. doi: 10.1109/TSP.2010.2098403. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] Young IT. Quantitative microscopy. IEEE Engineering in Medicine and Biology. 1996;15:59–66. [Google Scholar]

[R32] Zhang F. Matrix Theory. New York, USA: Springer Verlag; 1999. [Google Scholar]

PERMALINK

A stochastic analysis of distance estimation approaches in single molecule microscopy - quantifying the resolution limits of photon-limited imaging systems

Sripad Ram

E Sally Ward

Raimund J Ober

Abstract

1 Introduction

Fig. 1.

2 Stochastic framework

3 Fisher information matrix for the simultaneous detection approach

3.1 General expression of the Fisher information matrix

3.2 Fisher information matrix for the spatially invariant case

4 Simultaneous detection approach and the localization accuracy problem

4.1 Example 1

4.1.1 Results

Fig. 2.

5 Special case of the simultaneous detection approach - location of one of the objects is known

5.1 Example 2

Fig. 3.

6 Fisher information matrix for the separate detection approach

6.1 Example 3

7 Simulations

Table 1.

7.1 Data simulation

7.2 Maximum likelihood estimator

7.3 Comparison of ML estimator performance to the 2D FREM

Acknowledgements

Appendix

A Appendix

A.1 Proof of Theorem 1

A.2 Proof of results 2 and 3 of Theorem 2

A.3 Proof of Theorem 3

A.4 Proof of Theorem 4

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A stochastic analysis of distance estimation approaches in single molecule microscopy - quantifying the resolution limits of photon-limited imaging systems

Sripad Ram

E Sally Ward

Raimund J Ober

Abstract

1 Introduction

Fig. 1.

2 Stochastic framework

3 Fisher information matrix for the simultaneous detection approach

3.1 General expression of the Fisher information matrix

3.2 Fisher information matrix for the spatially invariant case

4 Simultaneous detection approach and the localization accuracy problem

4.1 Example 1

4.1.1 Results

Fig. 2.

5 Special case of the simultaneous detection approach - location of one of the objects is known

5.1 Example 2

Fig. 3.

6 Fisher information matrix for the separate detection approach

6.1 Example 3

7 Simulations

Table 1.

7.1 Data simulation

7.2 Maximum likelihood estimator

7.3 Comparison of ML estimator performance to the 2D FREM

Acknowledgements

Appendix

A Appendix

A.1 Proof of Theorem 1

A.2 Proof of results 2 and 3 of Theorem 2

A.3 Proof of Theorem 3

A.4 Proof of Theorem 4

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases