General Approach to First-Order Error Prediction in Rigid Point Registration

Andrei Danilchenko; J Michael Fitzpatrick

doi:10.1109/TMI.2010.2091513

. Author manuscript; available in PMC: 2015 Oct 15.

Published in final edited form as: IEEE Trans Med Imaging. 2010 Nov 11;30(3):679–693. doi: 10.1109/TMI.2010.2091513

General Approach to First-Order Error Prediction in Rigid Point Registration

Andrei Danilchenko ^1,^✉, J Michael Fitzpatrick ²

PMCID: PMC4607070 NIHMSID: NIHMS727176 PMID: 21075718

Abstract

A general approach to the first-order analysis of error in rigid point registration is presented that accommodates fiducial localization error (FLE) that may be inhomogeneous (varying from point to point) and anisotropic (varying with direction) and also accommodates arbitrary weighting that may also be inhomogeneous and anisotropic. Covariances are derived for target registration error (TRE) and for weighted fiducial registration error (FRE) in terms of covariances of FLE, culminating in a simple implementation that encompasses all combinations of weightings and anisotropy. Furthermore, it is shown that for ideal weighting, in which the weighting matrix for each fiducial equals the inverse of the square root of the cross covariance of its two-space FLE, fluctuations of FRE and TRE are mutually independent. These results are validated by comparison with previously published expressions and by simulation. Furthermore, simulations for randomly generated fiducial positions and FLEs are presented that show that correlation is negligible correlation coefficient < 0.1 in the exact case for both ideal and uniform weighting (i.e., no weighting), the latter of which is employed in commercial surgical guidance systems. From these results we conclude that for these weighting schemes, while valid expressions exist relating the covariance of FRE to the covariance of TRE, there are no measures of the goodness of fit of the fiducials for a given registration that give to first order any information about the fluctuation of TRE from its expected value and none that give useful information in the exact case. Therefore, as estimators of registration accuracy, such measures should be approached with extreme caution both by the purveyors of guidance systems and by the practitioners who use them.

Index Terms: Accuracy, anisotropic, error analysis, fiducial, registration, surgical guidance, weighting

I. Introduction

In 2009, a new method of first-order analysis of the point-registration problem was introduced [2]. The method was developed to facilitate the statistical analysis of a phenomenon that had been observed years earlier, first by Steinmeier using phantoms [3] and more recently by Woerdeman in vivo [4]. These investigators attempted to establish for several fiducial-based, surgical navigation systems, a correlation between the displayed estimates of accuracy and the true accuracy. Steinmeier found no correlation, and Woerdeman observed a negligible correlation coefficient of only 0.08. Thus, lower than normal estimated accuracies were not accompanied by lower than normal true accuracies, nor were there correlations for higher than normal values. The navigational systems’ estimates of accuracy are based on the goodness of fit of the fiducial points after registration, and the surprising result that these estimates seem to be unrelated to truth, suggests that there is an underlying problem with this method of error prediction. The 2009 work [2] investigated this problem for a special case in which expected error in fiducial localization is isotropic and identical for all fiducials, and it proved to first order that fluctuations in registration accuracy are statistically independent of the fluctuations of any estimate of that accuracy based on the goodness of fit. In the present work, we extend this analysis to the general case. We show that, when ideal weighting is used for each fiducial, this independence is maintained, and we show by simulations that, without weighting, the correlation between them is on the order of that low value observed by Woerdeman. In addition, we provide new expressions for error statistics that are more general than those previously provided. We begin in Section II by providing background on, and precise definitions of, error in these navigation systems. In Section III we present our new approach and derive independence and the new statistical expressions. In Sections IV and V we compare our results to previously published results and to results from computer simulations. In Sections VI and VII we discuss our theory and its results and present our conclusions.

II. Background

Fiducial-based navigation has become a ubiquitous adjunct to surgery when rigid anatomy—bone or soft tissue constrained by bone—is involved. The major commercial guidance systems—the StealthStation Surgical Navigation System (Medtronic, Inc., Minneapolis, MN) and the VectorVision System (BrainLAB AG, Feldkirchen, Germany), each provide it as an option in addition to surface-based navigation, and both options incorporate rigid point registration in their algorithms to determine the position of the surgeon’s tracked probe or tracked instrument. The registration approach analyzed in the following sections is employed by both of these systems. Fiducials, often called “markers,” are provided that typically adhere to the skin, but the selection of anatomical “fiducials” are supported by these systems as well. Both fiducial and surface registration approaches are often called “frameless” stereotaxy, in reference to the cumbersome stereotactic frame that they are designed to replace. These frameless approaches are offered by virtually every surgical center as an adjunct to neurosurgery, spine surgery, and orthopedic surgery, and they are provided as well by radiation oncology centers as an adjunct to radiosurgery.

Because fiducials by their nature are discrete objects, they provide positional information only at isolated regions of the anatomy. Information about the position or movement of anatomy outside these regions can be inferred only if all these regions move as a single rigid object. As a result, fiducial-based navigation systems employ only rigid transformations during the registration process. To the extent that the anatomy is non-rigid, rigid transformations are inappropriate, but for surgical resections involving bone, such as a vertebra or a femur, or tissue and bone that move as a single rigid object, such as the head, rigidity is a reasonable assumption. Failure of the anatomy to remain rigid will cause systems’ predictions of fiducial-registration accuracy, such as those studied by Steinmeier and Woerdeman, to fail, but nonrigidity is not the source of failure that we present here. Indeed Steinmeier’s experiments involved only rigid phantoms. Instead we confine our attention to those applications in which rigidity is a valid assumption.

For rigid anatomy, the image-guidance problem is reduced to the determination of the rigid transformation that the anatomy has undergone between image acquisition and intervention. That transformation can be estimated via fiducials that are attached to the anatomy before imaging and remain in place through the registration procedure. The position of each fiducial is localized in both image space and physical space, and then a transformation from image space to physical space is usually chosen such that it minimizes the fiducial registration error (FRE), where, in the simplest method, which is employed by commercial systems, FRE is the root-mean-square of distances between corresponding fiducials after the registration, i.e., the distance between the localized position of each fiducial as transformed from image space to physical space and the position of that corresponding fiducial localized in physical space. In the general case, the displacement between the transformed image point and the physical point is multiplied by a “weighting” matrix, which may be different for each fiducial, and FRE is the root-mean-square of the lengths of the resulting vectors. The weighting matrix is used in cases when some information about the nature of fiducials localization procedure is available. The simplest method described above is equivalent to uniform weighting, and it is employed by commercial systems in the absence of such information. Because of fiducial localization error (FLE), the registration is invariably flawed to some degree, and FRE is a readily available measure of registration error that serves as the most commonly used indicator of the goodness of fit of the of the corresponding points.

Sibson in 1979 investigated the statistical relation between FRE and FLE for the case of isotropic FLE and provided an expression that can be used to relate their expected root-mean-square values as follows [5]:

〈 {FRE}^{2} 〉 = (1 - \frac{2}{N}) 〈 {FLE}^{2} 〉

(1)

where 〈·〉 means “expected value of.” Expressions for FRE for each individual fiducial have also been derived [10]. FRE is an easily measured quantity, but it is not as important as target registration error, TRE, which is the displacement from its true position of a registered point not used as a fiducial¹. The target error TRE(r) at a point r of interest to the surgeon is the true measure of registration error. In 1998 and 2001, the statistical relation between TRE and FLE was investigated for isotropic FLE and expressions were provided relating their expected root-mean-square values [6], [7]

〈 {∣ TRE (r) ∣}^{2} 〉 = \frac{1}{N} (1 + \frac{1}{3} \sum_{k = 1}^{3} \frac{d_{k}^{2}}{f_{k}^{2}}) 〈 {FLE}^{2} 〉

(2)

where d_k is the distance of x from principal axis k of the fiducial configuration, $f_{k}^{2}$ is the mean of the squared distances of the fiducials from that axis. Equation (1) can be used to estimate the distribution of FLE from measurements of FRE for many cases. With this estimate available, (2) can then be exploited to predict TRE based on the fiducial configuration when fiducials are used (a) on a patient, (b) on a coordinate reference frame attached to a patient, and (c) on a surgical probe [8]. Equation (2) is not appropriate for FLE that is anisotropic, which means that it varies with direction, inhomogeneous, which means that it is different for different points. For anisotropic and inhomogeneous FLE, more complex expressions than that of (2) have recently been derived by Wiles [9], [23] and Moghari [11].

Since both 〈FRE²〉 and 〈|TRE(r)|²〉 are proportional to 〈FRE²〉, it might be expected that for a given registration, if FRE is small, then |TRE(r)| will be small and when one is large the other will be large. While it may seem intuitive to expect the size of TRE to be correlated with that of FRE, it is easy to construct counter-examples, as is shown by Fig. 1. That figure shows two examples. For one of them FRE is zero while |TRE(r)| is large, and for the other |TRE(r)| is zero while FRE is large. The key difference is that in the first example the set of FLEs are equivalent to a rigid transformation, while in the second example, they are not. Fig. 1(a) shows image space, while (b) and (d) show physical space. The localizations are assumed to be perfect in image space. The “rigid” set of localization errors occur in (b), while the errors in (d) cannot be represented exactly or even approximately by any rigid transformation.

Fig. 1 shows schematically that, despite the fact that 〈FRE²〉 and 〈|TRE(r)|²〉 are related functionally through (1) and (2), TRE is not necessarily related to FRE for specific registration instances. Thus, the fact that their means are related does not imply that their variations around those means are related. It is easy to construct such examples, but no example or set of examples can answer the question, “Are the fluctuations² of FRE and TRE about their means correlated?” In the next sections we will show that, for the case of small FLE, when either of the two most common weighting schemes is employed, the answer to this question is, “No.”

We begin by proving this answer mathematically to first order in localization error for the case of so-called “ideal weighting”. That proof requires a statistical analysis of a linear model of the problem of rigid point registration. Recently a new approach [2] for such an analysis was developed for the special case of isotropic, homogeneous FLE with uniform weighting (i.e., un-weighted). In the next section, we extend that approach to the general case of anisotropic, inhomogeneous FLE with arbitrary weighting, and we use the approach both to investigate the dependence of FRE and TRE and to provide unified formulas for their respective covariances that are more general than any previous expressions.

III. Derivations

Our derivations are based on a linearization of the rigid, point registration problem. We begin with a statement of the registration problem, followed by the linearization of the problem for small FLE.

A. Registration Problem

The fundamental problem of rigid, fiducial registration is to find the rotation matrix R and translation vector t that minimize the expression

{FRE}^{2} = \sum_{i = 1}^{N} {∣ W_{i} (R (x_{i} + Δ x_{i}) + t - (y_{i} + Δ y_{i})) ∣}^{2}

(3)

where FRE is the weighted fiducial registration error x_i,y_i, are 3×1 vectors representing corresponding points in two spaces, where at least three of the x_i are not collinear, Δx_i and Δy_i represent localization errors in the two spaces, W_i is a 3×3 non-singular weighting matrix, which may be a function of R (see the discussion of “ideal weighting” at the beginning of Section III-G). Clearly the solution, R, t, obtained by minimizing FRE² in (3) is unchanged if all the W_i are multiplied by the same factor w, while FRE² itself is multiplied by w². Thus, size of FRE as defined by (3) is somewhat arbitrary. For example, the general form for the uniform weighting scheme employed by commercial systems is W_i = wI, where I is the 3×3 identity matrix, and for this weighting FRE equals times the root-means square of the distance between fiducials after registration. We can remove this arbitrary factor of w from FRE by stipulating that the W_i be normalized such that $\sum_{1}^{N} {∣ W_{i} ∣}^{2} = \sum_{1}^{N} trace (W_{i}^{t} W_{i}) = 3$ . With this stipulation, uniform weighting requires that W_i = N^−1/2I, and FRE recaptures its meaning as the unweighted root-mean square of the fiducial fitting errors and satisfies (1).

The rotation matrix is restricted to be orthogonal, which means that R^tR = I. The points, y_i, arise from the application of the true, but unknown, rigid transformation, R⁽⁰⁾, t⁽⁰⁾, to the x, as follows:

y_{i} = R^{(0)} x_{i} + t^{(0)} .

(4)

R⁽⁰⁾, t⁽⁰⁾ is the transformation that is sought by the minimization of FRE in (3), but because of fiducial localization error, the transformation that minimizes FRE, namely R, t, will differ from the true transformation R⁽⁰⁾, t⁽⁰⁾. We model the Δx_i and Δy_i as random, independent vectors of fiducial localization error, with each of their three components being normally distributed with zero means and covariances $\sum_{i}^{(1)}$ and $\sum_{i}^{(2)}$ , respectively. Thus, the localization errors in the two spaces may be anisotropic and inhomogeneous. While these statistics are known, it is assumed that no other information about the errors themselves is available. If any relationships among the specific errors for a given set of localizations or if any components of any of the errors were known, that information could be employed in the registration process, and the derivations and proofs below would need to be modified. It is helpful to rewrite (3) as

{FRE}^{2} = \sum_{i = 1}^{N} {∣ W_{i} ({R x}_{i} + t - y_{i} - Δ ξ_{i}) ∣}^{2}

(5)

where Δξ_i = Δy_i−RΔx_i, is the “two-space” localization error vector for point-pair i, and its elements are normally distributed with zero means and covariances $R \sum_{i}^{(1)} R^{t} + \sum_{i}^{(2)}$ . We define $\sum_{i} = R \sum_{i}^{(1)} R^{t} + \sum_{i}^{(2)}$ to be the “two-space” covariance of FLE. Once the rotation and translation that minimize FRE² have been found, they can be used to determine the vector TRE of target registration error at any point r that is not used as a fiducial point

TRE (r) = R r + t - (R^{(0)} r + t^{(0)}) .

(6)

B. Linearization

Sibson showed how derivations of error statistics for this problem could be made tractable by (a) assuming that FLE is small, (b) approximating to first-order in FLE, (c) moving the origin of the coordinate system to the centroid of the x_i, (d) reorienting the coordinate system along the principal axes of the x_i, and (e) separating the rotational and translational problems [5]. He used this approach to investigate FRE; we and others have used it to investigate TRE [6], [7], [9], [11] and FRE [10], [19]. For our present investigation, we have developed a new approach. We employ (a) and (b), but we omit (c), (d), and (e). Thus, we solve to first order in FLE, but we retain the laboratory coordinate system, and we solve for both the rotation and the translation simultaneously. Solving the first-order problem allows us to cast the problem as a simple, unconstrained set of 3N linear algebraic equations in six unknowns of the form: Cq = e, where C is a 3N×6 matrix, e is a 3N×1 vector, and q is a 6×1 vector whose elements are Δθ₁, Δθ₂, Δθ₃, Δt₁, Δt₂, Δt₃, where Δθ_k, is an angle of rotation (in radians) about axis k, and Δt_k, is a translation along that axis.

We begin by defining $x_{i}^{(0)} = R^{(0)} x_{i}$ , and Δt = t − t⁽⁰⁾, which allows us to transform the vector whose squared length appears in the summation in (3) as follows:

\begin{array}{l} R (x_{i} + Δ x_{i}) + t - (y_{i} + Δ y_{i}) = R (R^{{(0)}^{t}} R^{(0)}) (x_{i} + Δ x_{i}) + t - R^{(0)} x_{i} - t^{(0)} - Δ y_{i} \\ = ({R R}^{{(0)}^{t}} - I) R^{(0)} x_{i} + Δ t - (Δ y - R^{(0)} Δ x_{i}) + ({R R}^{{(0)}^{t}} - I) R^{(0)} Δ x_{i} \\ = Δ {R x}_{i}^{(0)} + Δ t - Δ ξ_{i}^{(0)} + Δ {R R}^{(0)} Δ x_{i} \end{array}

(7)

where we have used (4) and have defined

Δ R = {R R}^{{(0)}^{t}} - I

(8)

and

Δ ξ_{i}^{(0)} = (Δ y_{i} - R^{(0)} Δ x_{i}) .

(9)

Using (7) in (3) gives us

{FRE}^{2} = \sum_{i = 1}^{N} {| W_{i} (Δ {R x}_{i}^{(0)} + Δ t - Δ ξ_{i}^{(0)} + Δ {R R}^{(0)} Δ x_{i}) |}^{2} .

(10)

From assumption (a), we know that the components of $Δ ξ_{i}^{(0)}$ and Δx_i, will be small. If they were all zero, then (10) tells us that we could achieve a perfect minimum of zero for (3) by setting ΔR and Δt to zero. For finite but small $Δ ξ_{i}^{(0)}$ and Δx_i, we see from this same equation that, when FRE is minimized, the elements of both ΔR and Δt will be of first order in $Δ ξ_{i}^{(0)}$ and Δx_i. We note that the elements of ΔR^tΔR can be expected to be small relative to those of ΔR and that the elements of ΔRR⁽⁰⁾Δx_i will be small relative to those of $Δ ξ_{i}^{(0)}$ , because they are all of second degree in the elements of $Δ ξ_{i}^{(0)}$ and Δx_i, which themselves can be expected to be small. Following Sibson, we now make the assumption that the $Δ ξ_{i}^{(0)}$ and Δx_i are small enough to render the elements of ΔR^tΔR negligible in comparison to the elements of ΔR and the elements of ΔRR⁽⁰⁾Δx_i negligible in comparison to the elements of $Δ ξ_{i}^{(0)}$ . We note that

\begin{array}{l} Δ R^{t} Δ R = {({R R}^{{(0)}^{t}} - I)}^{t} ({R R}^{{(0)}^{t}} - I) \\ = I - R^{(0)} R^{t} - {R R}^{{(0)}^{t}} + I \\ = - (Δ R^{t} + Δ R) \end{array}

(11)

where we have used the orthogonality of rotation matrices to go from the first line to the second. From (11), we see that, if the elements of ΔR^tΔR are indeed negligible in comparison to the elements of ΔR, then ΔR^t + ΔR ≈ 0, which means that ΔR is antisymmetric, i.e., ΔR^t= −ΔR. Thus, the nonlinear restriction R^tR = I is replaced by a linear one.

This relaxed constraint is essential to the original approach of Sibson on the statistics of FRE [5], the approaches of the subsequent work on the statistics of TRE [6], [7], [9], and our current approach. However, we show in simulations below that our results hold to an excellent approximation even without linearization. The difference being that for the linearized equations we can prove that they hold for all configurations, all $\sum_{i}^{(1)}$ and all $\sum_{i}^{(2)}$ , whereas for the nonlinear version, we can show agreement only for those cases that are simulated. Using our definitions of ΔR and Δt in (6) gives

\begin{array}{l} TRE (r) = ({R R}^{{(0)}^{t}} - I) R^{(0)} r + t - t^{(0)} \\ = Δ {R R}^{(0)} + Δ t \end{array}

(12)

where we have defined r⁽⁰⁾ = R⁽⁰⁾r (similarly to our definition above of $x_{i}^{(0)}$ ). We note finally, that, to first order $\sum_{i} = R^{(0)} \sum_{i}^{(1)} R^{{(0)}^{t}} + \sum_{i}^{(2)}$ .

C. Some Useful Results From Linear Algebra

The minimization of (10), subject to ΔR^t= −ΔR with ΔRR⁽⁰⁾Δx_i ignored in comparison to $Δ ξ_{i}^{(0)}$ is equivalent to finding the least-squares solution to the following over-determined set of 3N linear equations in the six unknowns, Δθ₁, Δθ₂, Δθ₃, Δt₁, Δt₂, Δt₃

{(W_{i} Δ {R x}_{i}^{(0)})}_{j} + {(W_{i} Δ t)}_{j} = {(W_{i} Δ ξ_{i}^{(0)})}_{j}, i = 1, 2, \dots N, j = 1, 2, 3,

(13)

where

Δ R = [\begin{matrix} 0 & - Δ θ_{3} & Δ θ_{2} \\ Δ θ_{3} & 0 & - Δ θ_{1} \\ - Δ θ_{2} & Δ θ_{1} & 0 \end{matrix}]

(14)

and the subscript j enumerates the components of the three-dimensional vectors.

The meaning of ΔR can be understood by adding I to both sides of its definition in (8) and multiplying on the right by R⁽⁰⁾. The result, (I+ΔR)R⁽⁰⁾ = R shows that the rotation matrix that we seek by minimizing (3) is the equivalent to the true rotation followed by a “rotation” I+ΔR. Thus, I+ΔR approximates the rotation away from the truth caused by fiducial localization error. If localization error were zero, then ΔR would also equal zero. Our assumption that localization error is merely small results in the approximation, ΔR^t = −ΔR. The displacement Δr of a point r caused by that small rotation is Δr = (I+ΔR)r − r = ΔRr, which, because ΔR^t is equal to −ΔR, can be written in the form of the familiar cross product for small rotations, Δr = ΔRr = Δθ × r, where Δθ = (Δθ₁, Δθ₂, Δθ₃). This cross product produces the exact expression for Δr in the limit of small rotation angle $∣ Δ θ ∣ = {(Δ θ_{1}^{2} + Δ θ_{2}^{2} + Δ θ_{3}^{2})}^{1 / 2}$ . Equation (14) can also be derived by representing the exact rotation I+ΔR in terms of Euler angles, expanding each sine and each cosine in that representation via a Taylor’s series in the angles Δθ₁, Δθ₂, and Δθ₃, and dropping all but zeroth- and first-order terms from those expansions. The first-order terms all equal one and give rise to the identity I; the first-order terms produce the expression given for ΔR in (14).

We now define the 3N×6 matrix C

C = [\begin{matrix} W_{1} X_{1}^{(0)} & W_{1} \\ W_{2} X_{2}^{(0)} & W_{2} \\ \dots & \dots \\ W_{N} X_{N}^{(0)} & W_{N} \end{matrix}]

(15)

where

X_{i}^{(0)} = [\begin{matrix} 0 & x_{i 3}^{(0)} & - x_{i 2}^{(0)} \\ - x_{i 3}^{(0)} & 0 & x_{i 1}^{(0)} \\ x_{i 2}^{(0)} & - x_{i 1}^{(0)} & 0 \end{matrix}]

(16)

and $x_{i k}^{(0)} = the k th component of x_{i}^{(0)}$ . By defining the 3N×1 vector e as follows:

e_{3 (i - 1) + j} = {(W_{i} Δ ξ_{i}^{(0)})}_{j}, i = 1, \dots, N, j = 1, 2, 3

(17)

and using our definition of C, we can transform (13) into

C q = e .

(18)

With (18), we have transformed the problem into a generic linear-algebra problem, and we need to establish some properties of the least-squares solution to that problem. We begin by considering the singular-value decomposition [15] of C

C = U Λ V^{t}

(19)

where U is a 3N×3N orthogonal matrix, Λ is a 3N×6 diagonal matrix whose diagonal elements Λ_ii are the singular values of C, and V is a 6×6 orthogonal matrix³. We will require that the six singular values Λ_ii of C all be nonzero. This requirement is equivalent to the requirement that the columns of C be independent. We can prove that they are independent via proof by contradiction. Thus, we assume instead that the columns C of are linearly dependent, which means that for some set of six constants, a₁, a₂, …, a₆, $\sum_{i = 1}^{6} a_{i} C_{i} = 0$ , where each C_i is a column of C. Because each column is of length 3N, this constraint is equivalent to 3N equations, which have the form

\begin{matrix} x_{i 2}^{(0)} a_{3} - x_{i 3}^{(0)} a_{2} = a_{4} \\ x_{i 3}^{(0)} a_{1} - x_{i 1}^{(0)} a_{3} = a_{5} \\ x_{i 1}^{(0)} a_{2} - x_{i 2}^{(0)} a_{1} = a_{6} \end{matrix}} for i = 1, 2, \dots, N .

(20)

These 3N equations are in turn equivalent to the N vector equations $x_{i}^{(0)} \times a_{1, 2, 3} = a_{4, 5, 6}$ , where a_α,β,γ is the 3×1 column vector whose elements are a_α, a_β, a_γ. In order for all N of these cross products to be equal to the same constant vector N, all points $x_{i}^{(0)}$ must be collinear. We note from Section III-A above that at least three of the points x_i must not be collinear and that rotating and translating these points to produce the points $x_{i}^{(0)}$ does not change the linearity of the points. Thus, our assumption that the columns of C are dependent has led to a contradiction. Therefore, the columns of C are in fact independent, and therefore the singular values of C are all nonzero.

The least-squares solution of (18) is given as follows [15],

q_{min} = C^{+} e

(21)

where

C^{+} = V Λ^{+} U^{t}

(22)

and Λ⁺ is a 6×3N diagonal matrix for which $Λ_{i i}^{+} = Λ_{i i}^{- 1}$ . We note that $Λ_{i i}^{+}$ are well defined because, as we have shown above, the $Λ_{i i}^{+}$ are nonzero. The elements of the 3N×1 vector

f_{min} = e - C q_{min}

(23)

are the residual errors in the solution and can be rewritten as follows:

f_{min} = (I - {CC}^{+}) e

(24)

where is the 3N×3N identity matrix. Using (19) and (22) in (24), we have

\begin{array}{l} f_{min} = (I - {CC}^{+}) e \\ = (I - U Λ Λ^{+} U^{t}) e \\ = U (I - Λ Λ^{+}) U^{t} e . \end{array}

(25)

For i, j = 1, …, 3N, we have ${(Λ Λ^{+})}_{i j} = \sum_{k = 1}^{6} Λ_{i k} Λ_{k j}^{+} = \sum_{k = 1}^{6} Λ_{i k} Λ_{k j}^{- 1}$ , from which we find that the first six diagonal elements of ΛΛ⁺ are equal to 1 and the last 3N − 6 are equal to 0. Furthermore, we see that the first six diagonal elements of I − ΛΛ⁺ are equal to 0 and the last 3N − 6 are equal to 1. Using these diagonal elements of I − ΛΛ⁺ in the last line of (25), and defining

\tilde{e} = U^{t} e

(26)

we find that

{(f_{min})}_{k} = \sum_{j = 1}^{6} U_{k j} \cdot 0 \cdot {\tilde{e}}_{j} + \sum_{j = 7}^{3 N} U_{k j} \cdot 1 \cdot {\tilde{e}}_{j} = \sum_{j = 7}^{3 N} U_{k j} {\tilde{e}}_{j} .

(27)

Thus we see that each element of f_min is a function of only the last 3N − 6 elements of ẽ.

D. FRE in Terms of ẽ

We now return to the registration problem. We note that the elements of f_min in (27) are the components of the individual fiducial registration error vectors, FRE_i, of those points. Thus, (f_min)₃₍_i₋₁₎₊_j = FRE_ij for i = 1, …, N, j = 1, 2, 3. Therefore, each component of FRE_i for each fiducial point is a function of only the last 3N − 6 elements of ẽ.

E. TRE in Terms of ẽ

Our expression for TRE(r) in (12) can be transformed into a simple matrix-times-vector multiplication

δ = D q

(28)

by defining δ_j = TRE(r)_j, j = 1, 2, 3 and defining the 3×6 matrix

D = [\begin{matrix} 0 & r_{3}^{(0)} & - r_{2}^{(0)} & 1 & 0 & 0 \\ - r_{3}^{(0)} & 0 & r_{1}^{(0)} & 0 & 1 & 0 \\ r_{2}^{(0)} & - r_{1}^{(0)} & 0 & 0 & 0 & 1 \end{matrix}]

(29)

where $r_{k}^{(0)} = the k th component of r^{(0)}$ . In (28), we use the least-squares solution of (18), as given by (21), and the decomposition of as given by (22), to get

δ_{min} = {D q}_{min} = D C^{+} e = D V Λ^{+} U^{t} e = D V Λ^{+} \tilde{e} .

(30)

Thus,

{(δ_{min})}_{k} = \sum_{j = 1}^{6} \sum_{l = 1}^{3 N} {(D V)}_{k j} Λ_{j l}^{+} {\tilde{e}}_{l} = \sum_{j = 1}^{6} {(D V)}_{k j} Λ_{j j}^{- 1} {\tilde{e}}_{j}

(31)

where, in the second step, we have used the explicit form of Λ⁺ given just after (22). From (31) we see that each component of TRE(r) is a function of only the first 6 elements of ẽ.

F. General Formulation for the Covariances of FRE and TRE

Equation (31) can be employed to derive a general expression for Σ_FRE, the covariance of FRE, and Σ_FRE, the covariance of TRE. First, we define two 3N×3N block-diagonal matrices: a covariance matrix Σ for the entire set of FLEs and a weighting matrix W for the entire set of fiducial points. Each matrix is nonzero only within 3×3 blocks on the diagonal. Specifically, Σ₃₍_i₋₁₎₊_j_,3(_i₋₁₎₊_j_′ = (Σ_i)_jj_′ and W₃₍_i₋₁₎₊_j_,3(_i₋₁₎₊_j_′ = (W_i)_jj_′, for i = 1 … N, j = 1, 2, 3, j′ = 1, 2, 3, and all other elements of each matrix equal zero. With these definitions it can be seen from the definition e of given by (17) that

〈 {e e}^{t} 〉 = W \sum W^{t} .

(32)

Our expression for Σ_FRE can be gotten by noting that ${(\sum_{FRE})}_{k l} = 〈 {(f_{min})}_{k} {(f_{min}^{t})}_{l} 〉$ and then using (27), followed by (26) and (32), to get

\begin{array}{l} {(\sum_{FRE})}_{k l} = \sum_{j, j^{'} = 7}^{3 N} U_{k j} U_{l j^{'}} 〈 {\tilde{e}}_{j} {\tilde{e}}_{j^{'}} 〉 \\ = \sum_{j, j^{'} = 7}^{3 N} U_{k j} U_{l j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}} . \end{array}

(33)

The summation over three diagonal elements corresponding to the same marker provides an expression for an individual weighted FRE of that marker

\begin{array}{l} 〈 {FRE}_{i}^{2} 〉 = \sum_{k = 3 i - 2}^{3 i} \sum_{j, j^{'} = 7}^{3 N} U_{k j} U_{k j^{'}} 〈 {\tilde{e}}_{j} {\tilde{e}}_{j^{'}} 〉 \\ = \sum_{k = 3 i - 2}^{3 i} \sum_{j, j^{'} = 7}^{3 N} U_{k j} U_{k j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}} . \end{array}

(34)

The trace of Σ_FRE provides a general expression for 〈FRE²〉

\begin{array}{l} 〈 {FRE}^{2} 〉 = \sum_{k = 1}^{3 N} {(\sum_{FRE})}_{k k} \\ = \sum_{k = 1}^{3 N} \sum_{j, j^{'} = 7}^{3 N} U_{k j} U_{k j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}} \\ = \sum_{j = 7}^{3 N} {(U^{t} W \sum W^{t} U)}_{j j} . \end{array}

(35)

The expressions in (33) through (35) give statistics for the weighted FRE. Statistics for unweighted FRE (but with weighting used in the registration step), which we designate by using lower case, fre, are given by replacing U_jk and U_kj_′ by (W⁻¹U)_kj and (W⁻¹U)_kj_′ in each of these equations, resulting in

{(\sum_{fre})}_{k l} = \sum_{j, j^{'} = 7}^{3 N} {(W^{- 1} U)}_{k j} {(W^{- 1} U)}_{l j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}}

(36)

〈 {fre}_{i}^{2} 〉 = \sum_{k = 3 i - 2}^{3 i} \sum_{j, j^{'} = 7}^{3 N} {(W^{- 1} U)}_{k j} {(W^{- 1} U)}_{k j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}}

(37)

and

〈 {fre}^{2} 〉 = \sum_{j, j^{'} = 7}^{3 N} {(U^{t} W^{- 2} U)}_{j j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}} .

(38)

For isotropic weighting, i.e., W_i = w_iI, it can with some manipulation be seen that $〈 {FRE}_{i}^{2} 〉 = w_{i} 〈 {fre}_{i}^{2} 〉$ .

Our expression for Σ_TRE can be gotten by remembering that in TRE(r)_j = δ_j (31). Therefore,

\begin{array}{l} {(\sum_{TRE})}_{k l} = 〈 {(δ_{min})}_{k} {(δ_{min})}_{l} 〉 \\ = \sum_{j, j^{'} = 1}^{6} {(D V)}_{k j} {(D V)}_{l j^{'}} \frac{〈 {\tilde{e}}_{j} {\tilde{e}}_{j^{'}} 〉}{Λ_{j j} Λ_{j^{'} j^{'}}} . \end{array}

(39)

By applying (26) and (32) to 〈ẽ_jẽ_j_′〉, we achieve the following result:

{(\sum_{TRE})}_{k l} = \sum_{j, j^{'} = 1}^{6} {(D V)}_{k j} {(D V)}_{l j^{'}} \frac{{(U^{t} W \sum W^{t} U)}_{j j^{'}}}{Λ_{j j} Λ_{j^{'} j^{'}}} .

(40)

The trace of Σ_TRE provides a general expression for 〈TRE²〉

\begin{array}{l} 〈 {TRE}^{2} 〉 = \sum_{k = 1}^{3} {(\sum_{TRE})}_{k k} \\ = \sum_{k = 1}^{3} \sum_{j, j^{'} = 1}^{6} {(D V)}_{k j} {(D V)}_{k j^{'}} \times \frac{{(U^{t} W \sum W^{t} U)}_{j j^{'}}}{Λ_{j j} Λ_{j^{'} j^{'}}} . \end{array}

(41)

It should be noted that (33) through (41) are completely general. Thus they are appropriate both for general fiducial localization error (i.e., possibly inhomogeneous and possibly anisotropic) and for general weighting (i.e., possibly not ideal). We have included in Fig. 4 in the Appendix an implementation of (33), (35), (40), and (41) written in Matlab (MathWorks, Inc., Natick, MA). This function is likewise completely general. While these equations have all been derived specifically for three-dimensional space, they all hold for the 2-D case as well with only minor changes to accommodate the reduced dimensionality. The details are given in Appendix C of the Appendix.

Fig. 4 — Computer code to implement the derived formulas. The code is a function written in Matlab (The Mathworks, Natick, MA). Its input and output parameters are described in comments (lines beginning with a percent sign) and in the text.

G. Statistical Independence of FRE and TRE for Ideal Weighting

In Fig. 1, we show examples for which FRE is zero and TRE is nonzero. It can easily be seen from (5) that the number of such examples is infinite. Indeed, whenever there exists a rigid transformation R, t for which Rx_i + t = y_i + Δξ_i for all i, then FRE is zero, but unless all the Δξ_i are zero, TRE will be nonzero. The simplest example is for the localization errors in a given space to be equivalent to a global translation, i.e., all the Δx_i are equal to the same nonzero vector v_x, and all the Δy_i are equal to some other nonzero vector v_y ≠ v_x. As a result, each Δξ_i = v_y − v_x, the rigid transformation that minimizes FRE² is, R = R⁽⁰⁾, t = t⁽⁰⁾ + v_y − v_x, and TRE(r) = v_y − v_x ≠ 0. Substituting these values into (5) reveals that each of the terms in the summation is zero. Thus, each FRE_i = 0. There is likewise an infinity of examples in which TRE is zero and FRE is nonzero. These examples occur when no rigid transformation can reduce the effect of the localization errors, and the transformation that minimizes (5) is the identity. One such example is also shown in Fig. 1. These examples suggest that FRE may not be a good predictor of TRE, but they provide little insight into their statistical relationship. In this section we examine the statistical relationship between FRE and TRE for our model of independent, normally distributed localization errors when an important weighting scheme—“ideal weighting”—is employed, and we show that for this weighting scheme FRE and TRE are statistically independent.

Ideal weighting is important in the rigid point registration problem because it maximizes the probability that the resulting transformation is the true one. We derive ideal weighting in Appendix I-A of the Appendix and show that it has the form $W_{i} = w \times \sum_{i}^{- 1 / 2} = w \times {(R \sum_{i}^{(1)} R^{t} + \sum_{i}^{(2)})}^{- 1 / 2}$ , where w is a constant, which for normalized weighting is equal to $3^{1 / 2} {(\sum_{1}^{N} trace (\sum_{i}^{- 1}))}^{- 1 / 2}$ . Ideal weighting is not new. It was introduced into point registration as early as 1998 by Ohta and Kanatani [16], and it was employed by West et al. in 2001 [17] and Moghari et al.. in 2009 [11], [19]. In both of these 2009 papers a first-order registration equation equivalent to (18) was derived for the special case in which ideal weighting is employed and $\sum_{i}^{1} = 0$ ⁴. As mentioned at the end of Section III-B, to first order R can be replaced by R⁽⁰⁾ in the expression for Σ_i. This approximation is adequate for our purposes, and with it we can see that the R dependence of ideal weighting does not affect the first-order character of (13). Using this weighting in (32) gives us

〈 {e e}^{t} 〉 = W \sum W^{t} = w^{2} \times \sum^{- 1 / 2} \sum \sum^{- 1 / 2} = w^{2} I .

(42)

Since 〈ee^t〉 is diagonal, the elements e_i are uncorrelated; since they are uncorrelated and also normally distributed, they are statistically independent [2], [18]. Because the diagonal elements equal w², the variance of each e_i is w². As shown in Appendix I-B, since U is orthogonal, the elements of ẽ are also independent and normally distributed with variance equal to w². In Section III-D, we found that each component of FRE_i for each fiducial point, which is also an element of f_min, is a function of only the last 3N − 6 elements of ẽ, while in Section III-E we found that each component of TRE(r), which is also a component of δ_min, is a function of only the first six elements of ẽ. Since the elements of ẽ are independent of each other, any functions of nonoverlapping subsets of them are independent each other as well. It is important to note that this conclusion depends crucially on the fact that all 3N of the elements of ẽ are mutually independent. That independence is a consequence of ideal weighting.

Therefore, for the case of ideal weighting, every component of FRE_i of every one of the N fiducial points is statistically independent of any component of the vector TRE(r) at any point r. Thus fluctuations in TRE(r) are, in our first-order approximation, independent of fluctuations of any and all functions of the goodness of fit! We note that the overall fiducial registration error, FRE, from (3), is given in the linearized problem by (10) and can be expressed in terms of f_min as $FRE = {(f_{min}^{t} f_{min})}^{1 / 2}$ and that the length of the vector TRE(r) can be expressed in terms of δ_min as $TRE (r) = {(δ_{min}^{t} δ_{min})}^{1 / 2}$ . Since the elements of f_min are statistically independent of those of δ_min, any function of only f_min is statistically independent of any function of only δ_min. As primary examples, FRE is statistically independent of |TRE(r)|. Since they are independent, they are also uncorrelated. We let CC (FRE, TRE) be the correlation coefficient of FRE and |TRE(r)|

CC (FRE, TRE) = \frac{〈 (FRE - μ_{FRE}) (TRE - μ_{TRE}) 〉}{(σ_{FRE} σ_{TRE})}

(43)

where we have abbreviated |TRE(r)| as TRE, and are letting μ and σ denote means and standard deviations. It easy to see that, to first order in FLE, CC (FRE, TRE) = 0, meaning that there is to first order no correlation between FRE and TRE.

H. Two-Dimensional Space

The results of this section hold not only for 3-D space, but for 2-D space as well. The details are provided in Appendix I-C of the Appendix along with the 2-D forms of (33) through (41), but these equations can be transliterated to two dimensions merely by making the following changes to the summation limits:3N → 2N, 7 → 4, 3i → 2i, 3i − 2 → 2i − 1, and 6 → 3.

I. Comparison With Previously Published Derivations

We compare our formulations with five previous derivations. These derivations, which are spread among many publications with differing notations, are, unlike our general derivation, limited in every case to uniform weighting (i.e., no weighting) or to ideal weighting. In our validation section below, we compare our formulas with these other authors’ formulations on specific cases and find agreement in every case.

The first result is an expression for the probability density of FRE² when ideal weighting is employed. We note that ${FRE}^{2} = f_{min}^{t} f_{min} = \sum_{k = 1}^{3 N} {(f_{min})}_{k}^{2}$ , which by (27) equals $\sum_{j = 7}^{3 N} \sum_{l = 7}^{3 N} {\tilde{e}}_{j} {\tilde{e}}_{l} \sum_{k = 1}^{3 N} U_{k l} U_{k j}$ , and since U is orthogonal, we have ${FRE}^{2} = \sum_{j = 7}^{3 N} {\tilde{e}}_{j}^{2}$ . As noted in Section III-G, when ideal weighting is employed, the ẽ_j become statistically independent and normally distributed with equal variances (= w²). Therefore, FRE² is chi-square distributed with 3N − 6 degrees-of-freedom. The probability density of such a function is well known [22]

p ({FRE}^{2}) = \frac{{FRE}^{(3 N - 8)} e^{- ({FRE}^{2}) / 2}}{w^{2} 2^{(3 / 2 N - 3)} Γ (\frac{3}{2} N - 3)}

(44)

where Γ denotes the Gamma function⁵. This probability density is identical to that derived by Sibson for the case of isotropic, homogeneous FLE with no weighting [5]. Here we have not only re-derived a thirty-year-old result but also shown that it applies to a more general case, namely inhomogeneous, anisotropic FLE with ideal weighting. It differs from the distribution for ideal weighting published recently by Moghari [19] for fre (instead of FRE). Also, for ideal weighting, since the ẽ_j are independent and normally distributed with variances equal to w², we have 〈FRE²〉 = (3N − 6)w², and for normalized weighting

〈 {FRE}^{2} 〉 = \frac{3 (3 N - 6)}{\sum_{1}^{N} trace (\sum_{i}^{- 1})} .

(45)

For the special case of homogeneous FLE (45) reduces to $〈 {FRE}^{2} 〉 = 3 (3 N - 6) / \sum_{k}^{3} σ_{k}^{2}$ , where $σ_{k}^{2}$ is the variance of FLE along one of its principal axes, and for the case of homogenous, isotropic FLE, which is the case originally treated by Sibson, it reduces exactly to (1).

The second result is an expression for 〈FRE²〉 for ideal weighting, first derived for the case of isotropic, homogeneous FLE in 1998 [6] and given above by (2) and derived for general FLE in 2009 [11]. With ideal weighting we find from (32) and (42) that (U^tWΣW^tU)_jj_′ = δ_ij_′ (i.e., the Kronecker delta). Using this result in (41) yields

〈 {TRE}^{2} 〉 = \sum_{k, j = 1}^{3, 6} \frac{{(D V)}_{k j}^{2}}{(Λ_{j j}^{2})} .

(46)

Despite the considerable detail in our derivations of this result, its calculation is quite simple, requiring evaluation only of (16), (15), (19), (29), and (46) in that order.

The third result is an expression for 〈TRE²〉 when FLE is inhomogeneous and anisotropic but no weighting is employed, first derived in 2009 by Wiles first for the homogeneous case [9] and then for the general case [23]. Our expression for this case is gotten simply by removing W from the expression in (41) and noting that, in the construction of the block-diagonal covariance matrix Σ, the nonzero blocks Σ, are all equal.

The fourth result is the decomposition of TRE into three statistically independent orthogonal normally distributed components when FLE is homogeneous and isotropic and no weighting is employed, which was first accomplished in 2001 [7]. We begin by noting that, for this case, having no weighting is equivalent to employing ideal weighting with w = 1. We then have (U′WΣW′U)_jj_′ = δ_ij_′, as in the second result above. Using this result in (39), we find that ${(\sum_{TRE})}_{k l} = \sum_{j = 1}^{6} {(D V)}_{k j} {(D V)}_{l_{j}} / Λ_{j j}^{2}$ . Performing an eigen decomposition yields $\sum_{TRE} = V_{TRE} Λ^{(TRE)} V_{TRE}^{'}$ . Since Σ_TRE is symmetric, V_TRE is orthogonal. If we define $η = V_{δ}^{'} δ_{min}$ , then, since V_TRE is orthogonal and the elements of ẽ are independent and normally distributed, the three components, η_k, are uncorrelated, normally distributed, orthogonal components of TRE with variances equal to η_k.

The fifth result is the set of expressions in (36) through (38) for the statistics of the unweighted individual fiducial registration error with ideal weighting used the registration step when FLE is homogeneous or isotropic. Analytical expressions for these two cases were first derived in 2009 [19]. They reduce in the homogeneous and isotropic case to expressions first derived in 2008 [10].

IV. Validation

Our theory has three major results, which can be summarized as follows: For arbitrary weighting and arbitrary FLE, (a) (33) gives the covariance of FRE, (b) (40) gives the covariance of TRE, (c) FRE and TRE are statistically independent. Our other results are derived from (a) and (b). We tested our results as follows (1) We compared (33), (37), (41) and (46) to previously published analytic expressions for special cases. (2) We compared (33) and (39) to true values obtained by means of simulations, and we measured their cross correlation and dependence. All calculations were performed using Matlab 2010b (Mathworks, Natick, MA) on a Dell Latitude D830 with an Intel Centrino Duocore 2.2 GHz CPU with 2 GB RAM.

A. Comparisons to True Values via Simulations

For each simulation experiment, the following was performed: Randomly select a set S(N) of N unperturbed fiducial positions x_i inside a 200 mm cube and one random target position r inside a 400 mm cube (the cubes share a common corner). Then repeat the following 15 times.

For each x_i, define a covariance FLE matrix $\sum_{i}^{(1)} = V_{i}^{(1)} Λ_{i}^{(1)} {(V_{i}^{(1)})}^{t}$ , where the columns of $V_{i}^{(1)}$ are the principal axes of FLE and the diagonal elements ${(Λ_{i}^{(1)})}_{j j} = {(σ_{i}^{(1)})}_{j}^{2}$ the variances of the independent components of FLE.
Construct a random rotation R⁽⁰⁾ and a random translation t⁽⁰⁾ and apply them to the x_i to produce N corresponding unperturbed positions y_i [see (4)].
Generate a covariance FLE matrix $\sum_{i}^{(2)}$ for each y_i in a similar manner to that used to define $\sum_{i}^{(1)}$ .
Generate a weighting matrix W_i for each point.
Repeat the following steps 10 000 times.
1. Perturb each fiducial position in each space to produce x_i + Δx and y_i + Δy according to $\sum_{i}^{(1)}$ and $\sum_{i}^{(2)}$ , respectively, with random, normally distributed, zero-mean components.
2. Employ a registration algorithm to find R and t that minimize the exact registration formula of (3).
3. Calculate TRE(r) and FRE.
Calculate CC(FRE, TRE) for the 10 000 registrations of Step 5.
Perform the chi-square test described in Appendix I-D of the Appendix to detect dependence between FRE and TRE for the 10 000 registrations of Step 5.
Compute rms(TRE) and rms(FRE) for the 10 000 registrations of Step 5.
Calculate rms(TRE) = 〈TRE²〉^1/2 and rms(FRE) = 〈FRE²〉^1/2 our formulas.
Calculate CC between rms(TRE) from Steps 8 and 9; calculate CC between rms(FRE) from Steps 8 and 9.

The registration algorithm employed to accomplish Step 5b, depends on the weighting being used. For uniform weighting, the method of singular-value decomposition was employed (e.g., [20, Algorithm 8.1]). For ideal weighting, the algorithm of [21] was employed, except that in Step 1 of that algorithm, $W_{i} = w \times {(R \sum_{i}^{(1)} R^{t} + \sum_{i}^{(2)})}^{- 1 / 2} = 〈 {fre}_{i}^{2} 〉$ , as given in Section III-G. This amendment to the algorithm is necessary for ideal weighting when FLE is anisotropic and/or inhomogeneous.

We employed these simulation experiments to determine the accuracy of 1) our expressions for TRE and FRE and 2) our claims that TRE and FRE are statistically independent and hence uncorrelated. As pointed out in [2], a failure to detect dependence does not guarantee that correlation is zero. The more specific correlation test may succeed where the dependence test fails, so we apply separate tests for correlation and dependence—the t-test to detect correlation and the chi-square test based on contingency tables to detect dependence [22]. The latter test is described Appendix I-D of the Appendix. For all tests we chose p = 0.05 with Bonferroni correction applied where appropriate. These tests were also used in [2].

V. Results

A. Comparison to Previously Published Expressions

We compared (33), (37), (41), and (46) with previously published and validated analytic expressions for the first three cases and the fifth case in Section III-A. Regarding (33), as pointed out in Section III-A, the simplification of that equation produced by the assumption of isotropy and homogeneity for FLE yields (45), which agrees exactly with the formula for FRE² derived by Sibson that case. We compared the outputs of (41) to the formulas for V_i = 〈TRE²〉 published by Moghari [11] and Wiles [9] for a large set of inputs. Similarly we compared (37) to the formula for $V_{i} = 〈 {fre}_{i}^{2} 〉$ published by Moghari [19]. For each comparison, we repeated the following 1000 times: We generated sets S(N) of random fiducials and a random target and in the manner described above for the simulations, with N = 3, 4, 5,…, 10, 20, 30 and 40 fiducials. We generated a two-space covariance Σ_i in the same manner as that used in the simulations to generate $\sum_{i}^{(1)}$ , based on V_i and (σ_i)_j. We chose random orientations for each set of principle axes and chose random values of (σ_i)_j. We compared (46), with the formulation of Moghari, who provided an analytic formula for the case of isotropic, inhomogeneous FLE with ideal weighting [11], and we compared (41) with W set to the identity (i.e., no weighting) to the formula given by Wiles’s for the case of anisotropic, homogeneous FLE with no weighting [9]. We compared (37) to Moghari’s formula for the case of isotropic inhomogeneous FLE with ideal waiting [17]. In every case, our outputs agreed with those of these previously published formulas (maximum difference = 1.2 × 10⁻¹⁰). In summary, our new general formulas agree with existing formulas, each of which is limited to a special case.

B. Comparison With Simulations

The important new feature that our theory supports is arbitrary weighting for arbitrary FLE. The sets of possible weightings and FLEs are each infinite, but we focused on two types of weighting—ideal weighting and uniform weighting, which is equivalent to no weighting—and on two sizes of FLE—small, which we defined as rms(FLE) < 1 mm, and large. For the latter case we used rms(FLE) = 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, and 70 mm. Our motivation for the small-versus-large-FLE dichotomy is to explore the limitations of our linear approximation, which is strictly correct only in the limit of small FLE. For small FLE, we tested with N = 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, and 40 fiducials. For the large FLE values, we limited our testing to four fiducials in order to control the requisite simulation time, which for each value of N requires 24 h for 15 values of rms(FLE). For the case of no weighting, we investigated only the case of small FLE, because we found correlation without moving above 1 mm.

Utilizing the simulation scheme above, in every case, we found the values calculated in Steps (8) and (9) to be statistically equivalent (no significant difference via paired t-test at p < 0.05). The maximum absolute percentage differences were less than 1.5% for FLE in the range from 1 to 10 mm (n = 555) and were less than 4.1% for FLE in the range from 1 to 50 mm (n = 120). For Step 10, in which we calculated the CC between the values of rms(TRE) of Steps 8 and 9, and the CC between the values of rms(FRE) calculated in Steps 8 and 9, we found in both cases that CC was well above 99.9%. The minimal lower bound of 95% confidence was 99.99%. Thus, our formulas for both FRE and TRE agree with simulation and show only a slight degradation in accuracy even when FLE is far larger than any value that would be tolerated in surgical guidance (50 mm is equal to one-fourth of the side of the 200 mm cube within which the fiducials were randomly placed for this simulation).

Figs. 2 and 3 summarize the results of our investigation into the correlation and dependence of FRE and TRE. As predicted by our theory, neither correlation nor dependence is detectable when ideal weighting is employed, and FLE is less than 1 mm [Fig. 2(a)]. For FLE > 20 mm, correlation is statistically significant but remains negligible (CC < 0.1) even for FLE = 70 mm [Fig. 2(b)]. For uniform weighting (i.e., no weighting) a very small correlation is observed, CC ≤ 0.1 that decreases with increasing N [Fig. 3(a)]. This correlation is statistically significant for all values of N, showing that, while FRE has some predictive power when higher order terms are included, that power is negligible. The dependence that this tiny correlation implies was undetectable for all but a very few cases. Detection is achieved when the chi-square statistic q (Appendix I-D of the Appendix) rises above a critical value shown by the horizontal line in Fig. 3(b). Such values were observed very rarely, as can be seen by the sub-critical means and, for N ≥ 20, the sub-critical values of mean + standard deviation.

Fig. 2 — FRE-TRE correlation coefficient CC for ideal weighting (a) FLE < 1 mm with varying number of fiducials (b) Varying FLE with four fiducials. In each case, the solid line is the mean of 15 sets of 10,000 registrations for randomly selected fiducial and target positions (see text). The dashed lines are mean ± one standard deviation. Correlation is insignificant at p < 0.05 for all of (a) and for FLE < 10 mm in (b). Statistically significant but negligible (<0.1) correlation occurs for FLE ≥ 20 mm, showing that, while FRE has some predictive power when higher order terms are included, the power is negligible.

Fig. 3 — FRE-TRE statistics for FRE < 1 mm when uniform weighting (i.e., no weighting) is employed (a) Correlation coefficient CC versus number of fiducials. A statistically significant but negligible (<0.1), correlation is apparent that decreases with increasing number of fiducials (b) The chi-square statistic q for dependence versus number of fiducials. The dashed lines are mean ± one standard deviation. The horizontal solid line in (b) is the critical level above which values of q indicate dependence. The slight dependence implied by the small correlation in (a) is not detected by q, whose mean remains below the critical level.

Finally, we repeated our experiments with targets positions chosen to be equal to fiducial positions in order to determine whether some dependence between TRE and FRE might appear in the special case in which a target is close to a fiducial. No such correlation is possible according to our theoretical development, but we explored this case to eliminate the possibility that appreciable second-order dependence might appear. The results were the same as for the randomly chosen target positions—no detectable dependence and no significant correlation.

VI. Discussion

We have generalized a method of first-order analysis of the point-registration problem introduced in 2009 [2] so that it accommodates inhomogeneous and anisotropic FLE with arbitrary weighting. With this generalization we are able to provide first-order expressions for the covariance matrices of FRE and TRE—(33) and (40)—an expression for individual FRE—(34)—and expressions for 〈FRE²〉 and 〈TRE²〉—(35) and (41). As a result, we have unified the theory of first-order FRE and TRE statistics, which has heretofore been solved piecemeal only for special cases, each of which involved a separate formulation [6], [7], [9], [11], [19]. We have included in Appendix I-E of the Appendix an implementation of (33), (35), (40), and (41) in Matlab. With this implementation, we have reduced to 25 lines of Matlab not only all the many special cases of weightings and FLE anisotropies heretofore spread over many pages of many papers with different notations but also all other possible combinations of weightings, inhomogeneities, and anisotropies.

In addition, by means of our new method we are able to generalize the major result of [2]: namely, that, to first order, FRE and TRE are independent when FLE is homogeneous and isotropic. Our generalization reveals that, in addition, if ideal weighting is employed, FRE and TRE are independent to first order when FLE is inhomogeneous and/or anisotropic. Ideal weighting, which we define in Section III-G, is not a new concept. It was introduced to point registration by Ohta and Kanatani in 1998 [16] and used by West et al.. in 2001 [17] and by Moghari et al.. in 2009 [11], [19] as well, but the proof that it produces independence between FRE and TRE is new.

We note that this independence in no way detracts from the importance of (1) and (2), from the generalizations of these expressions provided in the excellent work by Wiles [9], [23], Balachandran [10], and Moghari [11], [19], and further generalizations given in (33) through (38), (40) and (41), each of which relates one expectation value to another. Equation (1) and Wiles’s generalization of that relationship [23] can be used to estimate the FLE covariance from measurements of FRE obtained from repeated registrations based on independent fiducial localizations. Based on this estimate, (2) or one of its generalizations can then be used to estimate an expectation value for TRE for a given fiducial configuration and target position. What we have shown is that the value of FRE observed for a given registration is unreliable as means to estimate the deviation of TRE from its expected value for that same registration.

We have compared our results to previously published formulas for FRE and TRE and to our own simulations to validate our new formulation. While there is an infinite variety of FLE patterns and weighting schemes available, we focused on inhomogeneous and anisotropic FLE with both ideal weighting and uniform weighting. We generated random fiducial and target positions and used both small FLE—rms(FLE) < 1 mm, and large FLE—up to 70 mm. We found excellent agreement in all cases. We also used simulations to evaluate correlation and dependence between FRE and TRE when the first-order approximation is not made. We found, in agreement with our theoretical results, that correlation and dependence approach zero for both weighting schemes as FLE approaches zero, for every case tested. We found for very large FLE that, while there is clearly a statistically significant correlation at higher order, that correlation is so small as to have negligible value as a predictor of TRE.

Our experiments on uniform weighting explain why neither Steinmeier nor Woerdeman, both of whom experimented with systems that apply this weighting scheme, were able to detect appreciable correlation between FRE and TRE, and it provides further support for the argument advanced in [2] that estimates of fluctuations of registration accuracy, i.e., TRE, based on the goodness of fit of the fiducials, e.g., FRE, are unreliable.

VII. Conclusion

By means of a new unified approach to the first-order analysis of the point registration problem, we are able to provide general expressions for the covariances of fiducial registration error and of target registration error. These two expressions and all the other expressions that we derived from them are appropriate for any set of fiducial localization error patterns, whether inhomogeneous or not and whether anisotropic or not. Furthermore, they are appropriate for any arbitrary set of weightings in the point registration problem, including ideal weighting and uniform weighting. Furthermore, this approach allows us to show that, for ideal weighting, FRE and TRE are to first order neither correlated nor dependent. We have verified these results by means of comparison to previous derivations and to simulations. Finally, by means of simulation we have shown that, for all cases examined, correlation between FRE and TRE is also negligible for uniform weighting even for the exact case. These results reinforce the message delivered in [2] that for ideal weighting and uniform weighting, the latter of which is used in commercial guidance systems, fluctuations of measures of the goodness of fit of the fiducials, e.g., fiducial registration error, have no statistical relationship, or at most a negligible relationship, to fluctuations of individual measures of registration accuracy, i.e., target registration error. Therefore, as estimators of registration accuracy, they are unreliable and should approach with extreme caution both by the purveyors of guidance systems and by the practitioners who use them.

Acknowledgments

This work was supported in part by the National Institute on Deafness and Other Communication Disorders under Grant RO1DC008408 and Grant R01DC010184, in part by the National Institute of Biomedical Imaging and Bioengineering under Grant R01EB006136, and in part by Vanderbilt University’s International Office.

The authors would like to thank the anonymous reviewers whose suggestions greatly improved the form and clarity of this paper.

Appendix

Derivation of Ideal Weighting

The standard approach to finding the rigid transformation by minimizing (3) is based on maximum likelihood. Maximum likelihood was introduced into the point registration problem by Ohta and Kanatani in 1998 [16] and used by West et al. in 2001 [17] and by Moghari in 2009 [11], [19]. It has also been invoked for intensity-based registration methods [23]. The goal in point registration is to find the transformation that is most likely to be correct given the measured positions of the fiducial points in the two spaces. That goal can be reached only if the matrices W_i in that equation provide “ideal” weighting. In this section we derive the ideal weighting matrices and show that they provide the maximum-likelihood solution. We begin with the given sets { $x_{i}^{'}$ } and { $u_{i}^{'}$ }, i = 1, …, N of measured vector positions in two spaces, where $x_{i}^{'} = x_{i} + Δ x_{i}$ and $y_{i}^{'} = y_{i} + Δ y_{i}$ , x_i and y_i are the true but unknown positions of corresponding points in the respective spaces, and Δx_i and Δy_i are their unknown localization errors. We wish to find the most likely rigid transformation, but, as we do, we will also find the most likely true positions. Specifically, we wish to find the rotation matrix and translation vector R,t and the {x_i} and{y_i} that together maximize the conditional probability density $p (R, t, {y_{i}} ∣ {x_{i}^{'}}, {y_{i}^{'}})$ . The set of true points {x_i} is missing from the variables in this density because they are completely determined by R,t, and the {y_i}, since by definition x_i = R^ty_i − R^tt.

Thus, we wish to maximize $p (R, t, {y_{i}} ∣ {x_{i}^{'}}, {y_{i}^{'}})$ with respect to R,t, and {y_i} for fixed { $x_{i}^{'}$ } and { $y_{i}^{'}$ }. From Bayes’ law, we have that

p (R, t, {y_{i}} ∣ {x_{i}^{'}}, {y_{i}^{'}}) = p ({x_{i}^{'}}, {y_{i}^{'}} ∣ R, t, {y_{i}}) \frac{p (R, t {y_{i}})}{p ({x_{i}^{'}}, {y_{i}^{'}})}

(47)

and, absent prior knowledge of $p ({x_{i}^{'}}, {y_{i}^{'}})$ , we neglect its dependence on R,t, and {y_i}. Furthermore, $p ({x_{i}^{'}}, {y_{i}^{'}})$ is fixed because both { $x_{i}^{'}$ } and { $y_{i}^{'}$ } are fixed. Therefore, we may achieve our goal by maximizing the conditional probability $p ({x_{i}^{'}}, {y_{i}^{'}} ∣ R, t, {y_{i}})$ with respect to R,t, and {y_i}.

We now assume that all localization errors are independent and that their components have zero mean and are normally distributed about their principal axes. With this assumption we have

p ({x_{i}^{'}}, {y_{i}^{'}} ∣ R, t, {y_{i}}) = c exp (- \sum_{i = 1}^{N} (Δ x_{i}^{t} \sum_{x i}^{- 1} Δ x_{i} + Δ y_{i}^{t} \sum_{y i}^{- 1} Δ y_{i}))

(48)

where Σ_xi and Σ_yi are respectively the covariances of Δx_i and Δy_i. It can be seen from our definitions above and from minor manipulation that $Δ x_{i}^{t} \sum_{x i}^{- 1} Δ x_{i}$ equals ${({R x}_{i}^{'} + t - y_{i})}^{t} S_{x i} ({R x}_{i}^{'} + t - y_{i})$ , where $S_{x i} = R \sum_{x i}^{- 1} R^{t}$ , and that $Δ y_{i}^{t} \sum_{y i}^{- 1} Δ y_{i}$ equals ${(y_{i}^{'} - y_{i})}^{t} S_{y i} (y_{i}^{'} - y_{i})$ , where $S_{y i} = \sum_{y i}^{- 1}$ . After these substitutions, we have $p (R, t, {x_{i}}, {y_{i}} ∣ {x_{i}^{'}}, {y_{i}^{'}}) = c exp (- G (R, t, {x_{i}^{'}}, {y_{i}^{'}}, {y_{i}}))$ , where

G (R, t, {x_{i}^{'}}, {y_{i}^{'}}, {y_{i}}) = \sum_{i = 1}^{N} {({R x}_{i}^{'} + t - y_{i})}^{t} \times R \sum_{x i}^{- 1} R^{t} ({R x}_{i}^{'} + t - y_{i}) + {(y_{i}^{'} - y_{i})}^{t} S_{y i} (y_{i}^{'} - y_{i}) .

(49)

Thus, in order to maximize the likelihood, we need to minimize $G (R, t, {x_{i}^{'}}, {y_{i}^{'}}, {y_{i}})$ with respect to R,t, and {y_i}. We minimize with respect to the {y_i} by taking the derivative of G with respect to each of the components of each of the and setting all 3N derivatives to zero. The result is

y_{i} = {(S_{x i} + S_{y i})}^{- 1} (S_{x i} ({R x}_{i}^{'} + t) + S_{y i} y_{i}^{'}) .

(50)

We note that the expression for y_i given in (50), which is a weighted mean of the ith transformed measured point in x space and its corresponding measured point in y space, provides the best available approximation for the true positions needed in all the expressions for TRE and FRE statistics derived in the body of this paper. In the case of isotropic, homogeneous FLE, as, for example, in (2), this expression reduces to $y_{i} = (({R x}_{i}^{'} + t) + y_{i}^{'}) / 2$ , i.e., a simple unweighted average.

Finally, we substitute our expression for y_i into G to obtain (after considerable manipulation)

G (R, t, {x_{i}^{'}}, {y_{i}^{'}}, {y_{i}}) = \sum_{i = 1}^{N} {({R x}_{i}^{'} + t - y_{i}^{'})}^{t} W_{i}^{2} ({R x}_{i}^{'} + t - y_{i}^{'})

(51)

where W_i = (RΣ_xiR^t + Σ_yi)^−1/2. In order to reach our goal, the remaining task is to find the transformation R,t that minimizes G. In this minimization process, we can use W_i = w(RΣ_xiR^t + Σ_yi)^−1/2, where w is any constant. For this reason we refer to W_i = w(RΣ_xiR^t + Σ_yi)^−1/2 as “ideal” weighting. The right side of (51) has the same form as the right side of (3), but the weighting is unspecified in (3), whereas the derivation above shows that only ideal weighting can achieve maximum likelihood.

Statistical Independence of the Elements of ẽ

In this appendix we show that, if the M elements of the vector e are statistically independent, normally distributed variables with zero mean, then, if the M×M matrix V is orthogonal, the following vector has these same three characteristics:

{\tilde{e}}_{i} = \sum_{k = 1}^{M} V_{k i} e_{k} .

(52)

If the mean of e is zero, then the mean of ẽ is zero.

That the mean is zero is easily seen by computing the expectation value, which we denote by 〈.〉, of both sides of (52)
$〈 {\tilde{e}}_{i} 〉 = 〈 \sum_{k = 1}^{M} V_{k i} e_{k} 〉 = \sum_{k = 1}^{M} V_{k i} 〈 e_{k} 〉 = \sum_{k = 1}^{M} V_{k i} \cdot 0 = 0.$ (53)
If the elements of e are uncorrelated with equal variances and V is orthogonal, then the elements ẽ are uncorrelated.

Because of (53), the correlation of ẽ_i and ẽ_j is equal to 〈(ẽ_i − 〈ẽ_i〉) (ẽ_j − 〈ẽ_j〉)〉
$\begin{array}{l} 〈 ({\tilde{e}}_{i} - 〈 {\tilde{e}}_{i} 〉) ({\tilde{e}}_{j} - 〈 {\tilde{e}}_{j} 〉) 〉 = 〈 \sum_{k}^{M} V_{k i} (e_{k} - 〈 e_{k} 〉) \sum_{l}^{M} V_{l j} (e_{l} - 〈 e_{l} 〉) 〉 \\ = \sum_{k}^{M} \sum_{l}^{M} V_{k i} V_{l j} 〈 (e_{k} - 〈 e_{k} 〉) (e_{l} - 〈 e_{l} 〉) 〉 \\ = σ^{2} \sum_{k}^{M} \sum_{l}^{M} V_{k i} V_{l j} δ_{k l} \\ = σ^{2} \sum_{k}^{M} V_{k i} V_{k j} = σ^{2} δ_{i j} \end{array}$ (54)

where we used the fact that the elements of e are uncorrelated and equal in going from the first line of (54) to the second, defined $σ^{2} = 〈 e_{i}^{2} 〉$ to be the variance of an element of e, and used the orthogonality of V in the last step.

If the elements of e are independent and normally distributed with equal variances and V is orthogonal, then the elements of ẽ are independent and are normally distributed with the same variance.

We note that, since the elements e_i are independent and normally distributed with variance σ², their joint probability density has the form

\begin{array}{l} p_{1, 2, \dots, M} (e_{1}, e_{2}, \dots, e_{N}) = \prod_{i = 1}^{M} {(σ \sqrt{2 π})}^{- 1} exp (\frac{- e_{i}^{2}}{(2 σ^{2})}) \\ = {(σ \sqrt{2 π})}^{- M} exp (\frac{- 1}{2 σ^{2}} \sum_{i = 1}^{M} e_{i}^{2}) \end{array}

(55)

where we have used the fact that the means of the e_i are zero⁶. Because each ẽ_i is a linear combination of the e_i, their joint density has the form (by generalizing the derivations in [18, p. 199–201 ] from two variables to M variables)

q_{1, 2, \dots, M} ({\tilde{e}}_{1}, {\tilde{e}}_{2}, \dots, {\tilde{e}}_{M}) = {∣ J ∣}^{- 1} p_{1, 2, \dots, M} (z_{1}, z_{2}, \dots, z_{M})

(56)

where J = det(V^t) and z = (V^t)⁻¹ẽ. Since V is orthogonal, |J| = 1 and (V^t)⁻¹ = V. Using these properties and (55) in (56) gives us that

\begin{array}{l} q_{1, 2, \dots, M} ({\tilde{e}}_{1}, {\tilde{e}}_{2}, \dots {\tilde{e}}_{M}) = {(σ \sqrt{2 π})}^{- M} exp (- \sum_{i = 1}^{M} \frac{{(V \tilde{e})}_{i}^{2}}{(2 σ^{2})}) \\ = {(σ \sqrt{2 π})}^{- M} exp (- \frac{{(V \tilde{e})}^{t} V \tilde{e}}{2 σ^{2}}) \\ = {(σ \sqrt{2 π})}^{- M} exp (- \frac{{\tilde{e}}^{t} V^{t} V \tilde{e}}{2 σ^{2}}) \\ = {(σ \sqrt{2 π})}^{- M} exp (- \frac{{\tilde{e}}^{t} \tilde{e}}{2 σ^{2}}) \\ = {(σ \sqrt{2 π})}^{- M} exp (- \sum_{i = 1}^{M} \frac{{\tilde{e}}_{i}^{2}}{2 σ^{2}}) \end{array}

(57)

where we have used the orthogonality of V in going from the second line of (57) to the third line. Equation (57) can be written as a product of functions of the individual ẽ_i as follows: $q_{1, 2, \dots M} ({\tilde{e}}_{1}, {\tilde{e}}_{2}, \dots, {\tilde{e}}_{M}) = \prod_{i = 1}^{M} q_{i} ({\tilde{e}}_{i})$ , where $q_{i} ({\tilde{e}}_{i}) = {(σ \sqrt{2 π})}^{- 1} exp (- {\tilde{e}}_{i}^{2} / (2 σ^{2}))$ , which is of normal form. Since the joint probability density of the ẽ_i is equal to the product of individual functions, the ẽ_i are mutually statistically independent. Since the individual functions are of normal form, they are normally distributed with variance σ².

Two-Dimensional Case

The formulas and the proof of independence are derived in Section III only for 3-D transformations, but they all carry over to the 2-D case as well. We highlight here the salient changes that result when the points and the transformations are 2-D. First, x_i, y_iΔx_i, Δy_i, Δξ_i, and $Δ ξ_{i}^{(0)}$ are 2×1. Second, W_i, R, R⁽⁰⁾, $\sum_{i}^{(1)}, \sum_{i}^{(2)}$ , Σ_i, I, and ΔR, are now 2×2 matrices, t, Δt, and

Δ R = [\begin{matrix} 0 & - Δ θ \\ Δ θ & 0 \end{matrix}]

(58)

Third, C is a now a 2N×3 matrix, and q is a three-element vector whose elements are Δθ, Δt₁, Δt₂. Fourth,

X_{i}^{(0)} = [\begin{matrix} - x_{i 2}^{(0)} \\ x_{i 1}^{(0)} \end{matrix}]

(59)

and

e_{2 (i - 1) + j} = {(W_{i} Δ ξ_{i}^{(0)})}_{j}, i = 1, \dots, N, j = 1, 2

(60)

Fifth, Λ is 2N×3, V is 3×3, Λ⁺ is 3×2N, and the proof that all singular values (there are now only three of them) are nonzero involves only the third of the three equations in (20). It is now the first three (instead of first six) diagonal elements of ΛΛ⁺ that are equal to 1, and the last 2N −3 (instead of the last 2N − 6) are equal to 0. As a result, each component of FRE_i for each fiducial point is a function of only the last 2N − 3 elements of ẽ_i, and each component of TRE(r) is a function of only the first three elements of ẽ. The proof that the 2N elements of ẽ are independent makes no reference to the dimension of the space (see Appendix I-B above). Therefore, FRE_i and TRE(r) are statistically independent for the 2-D case, just as they are for the 3-D case.

Sixth, the block-diagonal matrices have slightly different forms to accommodate the reduced spatial dimensionality: Σ₂₍_i₋₁₎₊_j_,2(_i₋₁₎₊_j_′ = (Σ_i)_jj_′ and W₂₍_i₋₁₎₊_j_,2(_i₋₁₎₊_j_′ = (W_i)_jj_′, for i = 1,…, N, j = 1, 2, j′ = 1,2 with all other elements equaling zero.

Seventh, (33) through (41) must be altered to accommodate the reduced dimensionality, as follows:

\begin{array}{l} {(\sum_{FRE})}_{k l} = \sum_{j, j^{'} = 4}^{2 N} U_{k j} U_{l j^{'}} 〈 {\tilde{e}}_{j} {\tilde{e}}_{j^{'}} 〉 \\ = \sum_{j, j^{'} = 4}^{2 N} U_{k j} U_{l j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}} \end{array}

(61)

\begin{array}{l} {FRE}_{i}^{2} = \sum_{k = 2 i - 1}^{2 i} {(\sum_{j, j^{'} = 4}^{2 N} U_{k j} U_{k j^{'}} 〈 {\tilde{e}}_{j} {\tilde{e}}_{j^{'}} 〉)}_{k k} \\ = \sum_{k = 2 i - 1}^{2 i} {(\sum_{j, j^{'} = 4}^{2 N} U_{k j} U_{k j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}})}_{k k} \end{array}

(62)

\begin{array}{l} 〈 {FRE}^{2} 〉 = \sum_{k = 1}^{2 N} {(\sum_{FRE})}_{k k} \\ = \sum_{k = 1}^{2 N} \sum_{j, j^{'} = 4}^{2 N} U_{k j} U_{k j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}} \\ = \sum_{j = 4}^{2 N} {(U^{t} W \sum W^{t} U)}_{j j} \end{array}

(63)

{(\sum_{fre})}_{k l} = \sum_{j, j^{'} = 4}^{2 N} {(W^{- 1} U)}_{k j} {(W^{- 1} U)}_{l j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}}

(64)

〈 {fre}_{i}^{2} 〉 = \sum_{k = 2 i - 1}^{2 i} \sum_{j, j^{'} = 4}^{2 N} {(W^{- 1} U)}_{k j} {(W^{- 1} U)}_{k j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}}

(65)

〈 {fre}^{2} 〉 = \sum_{j, j^{'} = 4}^{2 N} {(U^{t} W^{- 2} U)}_{j j^{'}} {(U^{t} W \sum W^{t} U)}_{j j^{'}}

(66)

{(\sum_{TRE})}_{k l} = 〈 {(δ_{min})}_{k} {(δ_{min})}_{l} 〉 = \sum_{j, j^{'} = 1}^{3} {(D V)}_{k j} {(D V)}_{l j^{'}} \frac{〈 {\tilde{e}}_{j} {\tilde{e}}_{j^{'}} 〉}{Λ_{j j} Λ_{j^{'} j^{'}}}

(67)

{(\sum_{TRE})}_{k l} = \sum_{j, j^{'} = 1}^{3} {(D V)}_{k j} {(D V)}_{l j^{'}} \frac{{(U^{t} W \sum W^{t} U)}_{j j^{'}}}{Λ_{j j} Λ_{j^{'} j}}

(68)

〈 {TRE}^{2} 〉 = \sum_{k = 1}^{2} {(\sum_{TRE})}_{k k} = \sum_{k = 1}^{2} \sum_{j, j^{'} = 1}^{3} {(D V)}_{k j} {(D V)}_{k j^{'}} \frac{{(U^{t} W \sum W^{t} U)}_{j j^{'}}}{Λ_{j j} Λ_{j^{'} j^{'}}} .

(69)

Chi-Square Test for Dependence

To detect dependence, we employ the standard chi-square test based on the contingency tables [22]. Each element, C(i,j), i = 1,2, …, N_c, j = 1,2, …, N_c, of an N_c×N_c contingency table is a count of the number of registrations for which FRE_i ≤ FRE < FRE_i₊₁ and TRE_j ≤ TRE < TRE_j₊₁, where the thresholds, FRE_i and TRE_j can be chosen arbitrarily. The resulting table is then used to calculate the following test statistic: $q = \sum_{i, j = 1}^{N_{C}} {(C (i, j) - C_{0} (i, j))}^{2} / C_{0} (i, j)$ , where C₀(i, j) is the count that is to be expected when FRE and TRE are independent, namely, C₀(i, j) = C₁(i)C₂(j)/M, where M is the total of all counts, $C_{1} (i) = \sum_{j^{'} = 1}^{N_{C}} C (i, j^{'})$ , and $C_{2} (j) = \sum_{i^{'} = 1}^{N_{C}} C (i^{'}, j)$ . The variance in q over repeated experiments (i.e., with different random localization errors) can be reduced by choosing the thresholds FRE_i and TRE_j such that $\sum_{j^{'} = 1}^{N_{C}} C (i_{1}, j^{'}) \approx \sum_{j^{'} = 1}^{N_{C}} C (i_{2}, j^{'})$ and $\sum_{i^{'} = 1}^{N_{C}} C (i^{'}, j_{1}) \approx \sum_{i^{'} = 1}^{N_{C}} C (i^{'}, j_{2})$ . If q is sufficiently large, then it can be concluded that FRE and TRE are dependent. A “sufficiently large” value is that of the chi-square distribution (i.e., cumulative distribution) $χ_{ν}^{2} (1 - p_{0})$ ⁷, where 1 − p₀ is the desired confidence in the truth of dependence. The number, v, of degrees of freedom for this problem equals (N_C −1)². Thus, a value of q that is larger than $χ_{ν}^{2} (1 - p_{0})$ indicates that dependence between FRE and TRE is detected at p ≤ p₀. Failure to detect q exceeding $χ_{ν}^{2} (1 - p_{0})$ suggests that our theory of independence can be extended from the linear approximation to the exact problem encountered in surgical guidance.

Computer Code to Implement the Derived Formulas

We provide in Fig. 4 a Matlab function to implement our new formulas for FRE and TRE. The values that it returns correspond to our equations as follows: (RMS_TRE)² = 〈TRE²〉 from (41), (RMS_FRE)² = 〈FRE²〉 from (35), Cov_TRE = Σ_TRE from (40), and Cov_FRE = Σ_FRE from (33). The inputs correspond to our definitions above as follows: $X 0 (:, i) = x_{i}^{(0)} = R^{(0)} x_{i}$ , W(:,:,i) = W_i, $Cov_FLE (:, :, i) = \sum_{i} = R^{(0)} \sum_{i}^{(1)} R^{{(0)}^{t}} + \sum_{i}^{(2)}$ , and r0 = r⁽⁰⁾ = R⁽⁰⁾r.

Like the formulas that it implements, this code is general. This relatively simple function handles the special cases of uniform weighting for homogeneous, isotropic FLE, first solved by Fitzpatrick et al. [6], uniform weighting for homogeneous, anisotropic FLE first solved by Wiles et al. [9], and ideal weighting for inhomogeneous, anisotropic FLE first solved by Moghari et al. [11], and it handles all other cases as well.

Footnotes

For FLE, FRE, and TRE, we use boldface (only) when we are denoting a vector quantity

Also called “deviations”

For convenience, we have augmented both U and Λ to the decomposition described in [15] by adding 3N − 6 columns to U so as to make it orthogonal and adding Λ rows of zeros to U.

⁴

Equation (30) of [11] and (9) of [19].

⁵

If the distribution for |FRE| is desired, we note that ρ(|FRE|) = 2|FRE|ρ (FRE²).

⁶

Zero means are not necessary for this derivation. They simply reduce its complexity. Since the means are in fact zero, we take advantage of that fact to simplify the equations.

⁷

Note that $χ_{ν}^{2}$ denotes a variable, while $χ_{ν}^{2} (1 - p_{0})$ denotes a distribution.

This paper is a modification of a paper presented February 2010 at the SPIE Medical Imaging Symposium.

Contributor Information

Andrei Danilchenko, Email: andrei.v.danilchenko@vanderbilt.edu, Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN 37235 USA.

J. Michael Fitzpatrick, Email: j.michael.fitzpatrick@vanderbilt.edu, Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN 37235 USA.

References

1.Danilchenko A, Fitzpatrick JM. General approach to error prediction in point registration. presented at the SPIE Med. Imag; San Diego, CA. Feb. 2010.. [Google Scholar]
2.Fitzpatrick JM. Fiducial registration error and target registration error are uncorrelated. presented at the SPIE Med. Imag; Lake Buena Vista, FL. Feb. 2009.. [Google Scholar]
3.Steinmeier R, Rachinger J, Kaus M, Ganslandt O, Huk W, Fahlbusch R. Factors influencing the application accuracy of neuron-avigation systems. Stereotactic Functional Neurosurg. 2000;75:188–202. doi: 10.1159/000048404. [DOI] [PubMed] [Google Scholar]
4.Woerdeman PA, Willems PW, Noordmans HJ, Tulleken CA, van der Sprenkel JW. Application accuracy in frameless image-guided neurosurgery: A comparison study of three patient-to-image registration methods. J Neurosurg. 2007 Jun;106(6):1012–6. doi: 10.3171/jns.2007.106.6.1012. [DOI] [PubMed] [Google Scholar]
5.Sibson R. Studies in the robustness of multidimensional scaling: Perturbational analysis of classical scaling. Journal of the Royal Statistical Society: Series B. 1979;41:217–229. [Google Scholar]
6.Fitzpatrick JM, West JB, Maurer CR., Jr Predicting error in rigid-body, point-based image registration. IEEE Trans Med Imag. 1998 Oct;17(5):694–702. doi: 10.1109/42.736021. [DOI] [PubMed] [Google Scholar]
7.Fitzpatrick JM, West JB. The distribution of target error in rigid-body, point-based registration. IEEE Trans Med Imag. 2001 Sep;20(9):917–927. doi: 10.1109/42.952729. [DOI] [PubMed] [Google Scholar]
8.West JB, Maurer CR., Jr Designing optically tracked instruments for image-guided surgery. IEEE Trans Med Imag. 2004 May;23(5):533–545. doi: 10.1109/tmi.2004.825614. [DOI] [PubMed] [Google Scholar]
9.Wiles AD, Likholyot A, Frantz DD, Peters TM. A statistical model for point-based target registration error with anisotropic fiducial localizer error. IEEE Trans Med Imag. 2007 Mar;27(3):378–390. doi: 10.1109/TMI.2007.908124. [DOI] [PubMed] [Google Scholar]
10.Balachandran R, Fitzpatrick JM. The distribution of registration error of a fiducial marker in rigid-body point-based registration. presented at the SPIE; San Diego, CA. 2008. [Google Scholar]
11.Moghari M, Abolmaesumi P. Distribution of target registration error for anisotropic and inhomogeneous fiducial localization error. IEEE Trans Med Imag. 2009 Jun;28(6):799–813. doi: 10.1109/TMI.2009.2020751. [DOI] [PubMed] [Google Scholar]
12.Suess O, Picht T, Kuehn B, Mularski S, Brock M, Kombos T. Neurosurgery without rigid pin fixation of the head in left frontotemporal tumor surgery with intraoperative speech mapping. Operative Neurosurg Suppl. 2007;2(60):330–338. doi: 10.1227/01.NEU.0000255378.80216.52. [DOI] [PubMed] [Google Scholar]
13.Fitzpatrick JM, Galloway RL. The use of fiducials in the registration of 3D images. Automedica Special Issue: Eng Approaches Neurological Surg. 2001;20 [Google Scholar]
14.Fitzpatrick JM. In: Detecting Failure, Assessing Success Medical Image Registration. Hajnal JV, Hill DLG, Hawkes DJ, editors. Boca Raton, FL: CRC; 2001. pp. 117–139. [Google Scholar]
15.Press WH, Teukolsky SA, Vetterling WT, Flannery BP. Numerical Recipes in C: The Art of Scientific Computing. 2. New York: Cambridge Univ. Press; 1992. [Google Scholar]
16.Ohta N, Kanatani K. Lecture Notes in Computer Science. Berlin, Germany: Springer; 1998. Optimal estimation of 3-D rotation and reliability evaluation; pp. 175–187. [Google Scholar]
17.West JB, Fitzpatrick JM, Batchelor PG. Point-based registration under a similarity transform. Proc. SPIE Med. Imag.; San Diego, CA. Feb. 2001; pp. 611–622. [Google Scholar]
18.Papoulis A, U Pillai S. Probability, Random Variables, and Stochastic Processes. 4. Boston, MA: McGraw Hill; 2002. pp. 266–268. [Google Scholar]
19.Moghari M, Abolmaesumi P. Distribution of fiducial registration error in rigid-body point-based registration. IEEE Trans Med Imag. 2009 Nov;28(11):1791–1801. doi: 10.1109/TMI.2009.2024208. [DOI] [PubMed] [Google Scholar]
20.Fitzpatrick JM, Hill DLG, Maurer CR., Jr . Registration. In: Sonka M, Fitzpatrick JM, editors. Medical Image Processing: Volume II of the Handbook of Medical Imaging. Bellingham, WA: SPIE; 2000. pp. 447–513. [Google Scholar]
21.Balachandran R, Fitzpatrick JM. Iterative solution for rigid-body point-based registration with anisotropic weighting. presented at the SPIE Med. Imag; Lake Buena Vista, FL. Feb. 2009.. [Google Scholar]
22.Spiegel MR. Probability and Statistics, Schaum’s Outline Series. New York: McGraw-Hill; 1975. pp. 116–116. [Google Scholar]
23.Wiles AD. Navigation accuracy of image-guided interventional systems. Dept. Med. Biophys., Univ. Western Ontario; London, ON, Canada: 2009. [Google Scholar]
24.Roche A, Malandain G, Ayache N. Unifying maximum likelihood approaches in medical image registration. Int J Imag Syst Technol: Special Issue 3D Imag. 2000;11:71–80. [Google Scholar]

[R1] 1.Danilchenko A, Fitzpatrick JM. General approach to error prediction in point registration. presented at the SPIE Med. Imag; San Diego, CA. Feb. 2010.. [Google Scholar]

[R2] 2.Fitzpatrick JM. Fiducial registration error and target registration error are uncorrelated. presented at the SPIE Med. Imag; Lake Buena Vista, FL. Feb. 2009.. [Google Scholar]

[R3] 3.Steinmeier R, Rachinger J, Kaus M, Ganslandt O, Huk W, Fahlbusch R. Factors influencing the application accuracy of neuron-avigation systems. Stereotactic Functional Neurosurg. 2000;75:188–202. doi: 10.1159/000048404. [DOI] [PubMed] [Google Scholar]

[R4] 4.Woerdeman PA, Willems PW, Noordmans HJ, Tulleken CA, van der Sprenkel JW. Application accuracy in frameless image-guided neurosurgery: A comparison study of three patient-to-image registration methods. J Neurosurg. 2007 Jun;106(6):1012–6. doi: 10.3171/jns.2007.106.6.1012. [DOI] [PubMed] [Google Scholar]

[R5] 5.Sibson R. Studies in the robustness of multidimensional scaling: Perturbational analysis of classical scaling. Journal of the Royal Statistical Society: Series B. 1979;41:217–229. [Google Scholar]

[R6] 6.Fitzpatrick JM, West JB, Maurer CR., Jr Predicting error in rigid-body, point-based image registration. IEEE Trans Med Imag. 1998 Oct;17(5):694–702. doi: 10.1109/42.736021. [DOI] [PubMed] [Google Scholar]

[R7] 7.Fitzpatrick JM, West JB. The distribution of target error in rigid-body, point-based registration. IEEE Trans Med Imag. 2001 Sep;20(9):917–927. doi: 10.1109/42.952729. [DOI] [PubMed] [Google Scholar]

[R8] 8.West JB, Maurer CR., Jr Designing optically tracked instruments for image-guided surgery. IEEE Trans Med Imag. 2004 May;23(5):533–545. doi: 10.1109/tmi.2004.825614. [DOI] [PubMed] [Google Scholar]

[R9] 9.Wiles AD, Likholyot A, Frantz DD, Peters TM. A statistical model for point-based target registration error with anisotropic fiducial localizer error. IEEE Trans Med Imag. 2007 Mar;27(3):378–390. doi: 10.1109/TMI.2007.908124. [DOI] [PubMed] [Google Scholar]

[R10] 10.Balachandran R, Fitzpatrick JM. The distribution of registration error of a fiducial marker in rigid-body point-based registration. presented at the SPIE; San Diego, CA. 2008. [Google Scholar]

[R11] 11.Moghari M, Abolmaesumi P. Distribution of target registration error for anisotropic and inhomogeneous fiducial localization error. IEEE Trans Med Imag. 2009 Jun;28(6):799–813. doi: 10.1109/TMI.2009.2020751. [DOI] [PubMed] [Google Scholar]

[R12] 12.Suess O, Picht T, Kuehn B, Mularski S, Brock M, Kombos T. Neurosurgery without rigid pin fixation of the head in left frontotemporal tumor surgery with intraoperative speech mapping. Operative Neurosurg Suppl. 2007;2(60):330–338. doi: 10.1227/01.NEU.0000255378.80216.52. [DOI] [PubMed] [Google Scholar]

[R13] 13.Fitzpatrick JM, Galloway RL. The use of fiducials in the registration of 3D images. Automedica Special Issue: Eng Approaches Neurological Surg. 2001;20 [Google Scholar]

[R14] 14.Fitzpatrick JM. In: Detecting Failure, Assessing Success Medical Image Registration. Hajnal JV, Hill DLG, Hawkes DJ, editors. Boca Raton, FL: CRC; 2001. pp. 117–139. [Google Scholar]

[R15] 15.Press WH, Teukolsky SA, Vetterling WT, Flannery BP. Numerical Recipes in C: The Art of Scientific Computing. 2. New York: Cambridge Univ. Press; 1992. [Google Scholar]

[R16] 16.Ohta N, Kanatani K. Lecture Notes in Computer Science. Berlin, Germany: Springer; 1998. Optimal estimation of 3-D rotation and reliability evaluation; pp. 175–187. [Google Scholar]

[R17] 17.West JB, Fitzpatrick JM, Batchelor PG. Point-based registration under a similarity transform. Proc. SPIE Med. Imag.; San Diego, CA. Feb. 2001; pp. 611–622. [Google Scholar]

[R18] 18.Papoulis A, U Pillai S. Probability, Random Variables, and Stochastic Processes. 4. Boston, MA: McGraw Hill; 2002. pp. 266–268. [Google Scholar]

[R19] 19.Moghari M, Abolmaesumi P. Distribution of fiducial registration error in rigid-body point-based registration. IEEE Trans Med Imag. 2009 Nov;28(11):1791–1801. doi: 10.1109/TMI.2009.2024208. [DOI] [PubMed] [Google Scholar]

[R20] 20.Fitzpatrick JM, Hill DLG, Maurer CR., Jr . Registration. In: Sonka M, Fitzpatrick JM, editors. Medical Image Processing: Volume II of the Handbook of Medical Imaging. Bellingham, WA: SPIE; 2000. pp. 447–513. [Google Scholar]

[R21] 21.Balachandran R, Fitzpatrick JM. Iterative solution for rigid-body point-based registration with anisotropic weighting. presented at the SPIE Med. Imag; Lake Buena Vista, FL. Feb. 2009.. [Google Scholar]

[R22] 22.Spiegel MR. Probability and Statistics, Schaum’s Outline Series. New York: McGraw-Hill; 1975. pp. 116–116. [Google Scholar]

[R23] 23.Wiles AD. Navigation accuracy of image-guided interventional systems. Dept. Med. Biophys., Univ. Western Ontario; London, ON, Canada: 2009. [Google Scholar]

[R24] 24.Roche A, Malandain G, Ayache N. Unifying maximum likelihood approaches in medical image registration. Int J Imag Syst Technol: Special Issue 3D Imag. 2000;11:71–80. [Google Scholar]

PERMALINK

General Approach to First-Order Error Prediction in Rigid Point Registration

Andrei Danilchenko

J Michael Fitzpatrick

Roles

Abstract

I. Introduction

II. Background

Fig. 1.

III. Derivations

A. Registration Problem

B. Linearization

C. Some Useful Results From Linear Algebra

D. FRE in Terms of ẽ

E. TRE in Terms of ẽ

F. General Formulation for the Covariances of FRE and TRE

Fig. 4.

G. Statistical Independence of FRE and TRE for Ideal Weighting

H. Two-Dimensional Space

I. Comparison With Previously Published Derivations

IV. Validation

A. Comparisons to True Values via Simulations

V. Results

A. Comparison to Previously Published Expressions

B. Comparison With Simulations

Fig. 2.

Fig. 3.

VI. Discussion

VII. Conclusion

Acknowledgments

Appendix

Derivation of Ideal Weighting

Statistical Independence of the Elements of ẽ

Two-Dimensional Case

Chi-Square Test for Dependence

Computer Code to Implement the Derived Formulas

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases