Skip to main content
Taylor & Francis Open Select logoLink to Taylor & Francis Open Select
. 2015 Apr 22;110(509):368–379. doi: 10.1080/01621459.2014.908779

Size and Shape Analysis of Error-Prone Shape Data

Jiejun Du 1, Ian L Dryden 1, Xianzheng Huang 1
PMCID: PMC4456951  PMID: 26109745

Abstract

We consider the problem of comparing sizes and shapes of objects when landmark data are prone to measurement error. We show that naive implementation of ordinary Procrustes analysis that ignores measurement error can compromise inference. To account for measurement error, we propose the conditional score method for matching configurations, which guarantees consistent inference under mild model assumptions. The effects of measurement error on inference from naive Procrustes analysis and the performance of the proposed method are illustrated via simulation and application in three real data examples. Supplementary materials for this article are available online.

Keywords: Complex normal, Configuration, Landmark, Ordinary Procrustes analysis, Quaternion

INTRODUCTION

Data capturing the size and shape of an object are of great interest in many branches of science. For instance, facial recognition as a routine task in forensics relies on shape data of faces; chemists study shapes of molecules to understand and manipulate chemical properties; and study on shapes of complex organisms is an important part of research in biology and medicine. One approach that has a long history for characterizing sizes and shapes entails defining landmarks on an object, the collection of which is referred to as the configuration of the object (Kendall 1984; Bookstein 1991; Dryden and Mardia 1998). Then the shape data for this object consist of the geometric information of these landmarks in the configuration after removing location, rotation, and (possibly) scale. This way of measuring shape data has been adopted in biology, for example, among many other fields where applied scientists understand well the choice and interpretation of landmarks. Even though usually there are solid scientific grounds or mathematical motivations for the choice of landmarks, locating them on a subject and/or measuring their relative locations is often prone to error. This results in shape data as an error-contaminated measure of the true underlying shape of an object.

Assuming shape data free of measurement error, ordinary Procrustes analysis (Goodall 1991), referred to as OPA, is a conventional approach to match one configuration onto the other, which is an important step in size and shape comparisons among different objects. This method involves translation, rotation, and scaling of one configuration to match it onto the other configuration as closely as possible. In the presence of measurement error, naive implementation of OPA matches the noisy version of a configuration onto the other one. A natural question is whether or not this naive matching can reveal the same information regarding size and shape comparisons as when one matches the error-free configurations. For example, in the study of rat skulls (Bookstein 1991; Kenobi, Dryden, and Le 2010; Mardia et al. 2013), X-rays of rat skulls were recorded from age 7 days to 150 days, and it is of interest to estimate the change in size and shape of the skull as a rat grows. Assuming configurations of skulls are measured precisely, if one implements OPA to match the skull of a rat recorded at an earlier time onto the other skull of the same rat recorded later, then the amount of scaling entailed in OPA reflects the amount of growth of the skull during this time window. Suppose the reality is that configurations of skulls cannot be measured precisely, and thus the observed configurations are noisy surrogates of the unobserved true configurations. To study the growth pattern of rat skulls, it is important to understand the effects of measurement error on inference from naive OPA. We will show in Section 3 that matching two error-contaminated configurations via naive OPA can mask important distinctions between two configurations. This motivates new proposals that can account for measurement error when comparing sizes and shapes of objects. Note that for nontrivial matching the number of landmarks is more than the number of dimensions in which the data lie.

The remainder of the article is organized as follows. In Section 2 models for shape data accounting for measurement error are formulated for the purpose of matching two true configurations in two or three dimensions. Results regarding the effects of measurement error on inference from naive OPA are presented in Section 3. In Section 4, we propose the conditional score method. Simulation studies are reported in Section 5 to illustrate the performance of the proposed methods in comparison with naive OPA. These methods are applied to three real data examples in Section 6. We conclude the article with discussions of our findings and future research directions in Section 7.

2. MEASUREMENT ERROR MODELS

2.1. Models for Two-Dimensional Size-and-Shape Data

For a two-dimensional configuration, it is mathematically convenient to denote the location of a landmark by a complex number, with the real and imaginary parts representing the x- and y-coordinates of the landmark, respectively. Let X and Y be two configurations of interest, each consisting of K( ⩾ 3) landmarks. With the complex-value representation, both X and Y are elements in the K-dimensional complex space, CK. More specifically, X = (X 1, …, XK)t and Y = (Y 1, …, YK)t, where Xl, YlC1 correspond to the lth landmark in the configurations, for l = 1, …, K. To match X onto Y via OPA, the following linear model that relates Y and X is assumed,

Y=β01K+β1X+ϵ, (1)

where β0C1 is the translation parameter, β1(0)C1 is the scale-and-rotation parameter, and multiplying β1 by X in effect scales and rotates X, 1K is the K × 1 vector of ones, and ϵ=(ϵ1,...,ϵK)tCK is the mean-zero random error. The interpretation of the complex multiplication in (1), β1 X, can be made more transparent by looking into the lth landmark (l = 1, …, K), for which β1 Xl is equivalent to ‖β1e iθ Xl, and in real arithmetic,

β1cosθsinθ-sinθcosθRe(Xl)Im(Xl),

where, for tC1, ‖t‖ is the norm of t, Re (t) and Im (t) denote the real and imaginary part of t, respectively, and θ ∈ [0, 2π) is the rotation parameter. Under the model given by (1), matching X onto Y as closely as possible can be formulated as a least-square problem, where one minimizes the squared distance given by Y-β01K-β1X2 over β=(β0,β1)tC2. Here, for ACK, denote by A* the transpose of the conjugate of A, ‖A2 = A*A is the squared Euclidean norm of A.

Solutions to the above least-square problem are the outcomes of OPA (Dryden and Mardia 1998, sec. 5.2). In fact, OPA for two-dimensional shape data can be viewed as the complex version of the least-square method for real-value linear regression. This intimate connection between OPA and real-value linear regression leads us to formulate the upcoming measurement error models for shape data, and further inspires our proposal of the conditional score method, which has successes in drawing inference based on error-contaminated data modeled by real-value regression models.

Instead of assuming X and Y are observed directly as in most existing literature on shape analysis, we consider the scenario where the actual observed configurations result from contaminating the true configurations with measurement error. Measurement error in Y does not cause complications from a modeling perspective, as one may view such measurement error part of ϵ in (1). Caution needs to be taken regarding measurement error in X however, for which the reasons will become clear in Section 3. For notational convenience, henceforth, we only consider X as the error-prone unobserved configuration, and denote W=(W1,...,WK)tCK as an error-contaminated measure of X. More specifically, we assume classical measurement error (Carroll et al. 2006, sec. 1.2) and W relates to X according to

W=X+U, (2)

where U=(U1,...,UK)tCK is the mean-zero nondifferential measurement error (Carroll et al. 2006, sec. 2.5), independent of X and ϵ. Together (1) and (2) give the two component models of the measurement error model for the observed configurations (Y, W).

2.2. Models for Three-Dimensional Size-and-Shape Data

Conventionally, a three-dimensional configuration of K( ⩾ 4) landmarks is denoted by a K × 3 real matrix. Take configuration X as an example, the lth row of X, for l = 1, …, K, is X t l, where X l is an element in the three-dimensional real space R3 that consists of the x-, y-, and z-coordinates of the lth landmark, respectively. A more compact representation of X is attained by using quaternions (Horn 1987; Zhang 1997), which can be viewed as a generalization of complex numbers. Let q = a + bi + cj + dk be a quaternion number in the one-dimensional quaternion space Q1, where i, j, k are three imaginary units, a, b, c, and d are in R1, among which a and (b, c, d) are the real part and imaginary parts of q, respectively. Denote by XQK the quaternion version of X, then X=(X1,...,XK)t, where the real part of XlQ1 is zero and the imaginary parts of it correspond to the three elements in X l, for l = 1, …, K. Similarly, let Y, E, W, and U be the K × 1 quaternion versions of K × 3 real matrices Y, ϵ, W, and U, respectively. Then the classical measurement error model for the observed three-dimensional configurations (Y,W) consists of the following two component models,

Y=γ1K+qXq+E, (3)
W=X+U, (4)

where γQ1 is the translation parameter with the real part equal to zero, qQ1 is the scale-and-rotation parameter, and q is the conjugate of q. For a quaternion q = a + bi + cj + dk, its conjugate is defined as q=a-bi-cj-dk. Appendix A of the supplementary materials provides a brief tutorial of quaternion arithmetic and the geometric interpretation of quaternion multiplication.

Alternatively, one may convert the above measurement error model to its real-value version by using the fact that, for the lth landmark (l = 1, …, K), qXlq is equivalent to Q X l (Kunze and Schaeben 2004), where

Q=a2+b2-c2-d22(bc-ad)2(ac+bd)2(bc+ad)a2-b2+c2-d22(dc-ab)2(bd-ac)2(ab+cd)a2-b2-c2+d2.

This yields the following model equivalent to (3),

Y=1Kβ0t+XQt+ϵ, (5)

where β0R3 is the translation parameter, of which the three elements are the imaginary parts of γ in (3), and ϵ is the K × 3 random error. The real-value version of (4) is simply (2) with W, X, and U now all being K × 3 real-value matrices.

Finally, the conventional OPA for three-dimensional shape data is developed based on the following model,

Y=1Kβ0t+β1XΓ+ϵ, (6)

where β1(>0)R1 is the scale parameter, Γ is a 3 × 3 rotation matrix in SO(3), and SO(3) denotes the special orthogonal group (Dryden and Mardia 1998, sec. 4.1). To match X onto Y via OPA, one minimizes the squared distance given by Y-1Kβ0t-β1XΓ2 over β0, β1, and Γ. Here, for a K × 3 real matrix A, ‖A2 is the squared Euclidean Frobenius norm of A defined by tr(A t A), where “tr” refers to the trace of a matrix. Comparing (5) and (6) reveals that X Q t in (5) corresponds to the same operations on X implemented by β1XΓ in (6), and β1 = a 2 + b 2 + c 2 + d 2.

3. NAIVE PROCRUSTES ANALYSIS

3.1. Bias Analysis in Two-Dimensional Size-and-Shape Matching

Matching two-dimensional X onto Y via OPA yields an estimator of β given by β^=(β^0,β^1)t=(XD*XD)-1XD*Y, where XD=[1KX] is the K × 2 complex-value design matrix associated with (1) (Dryden and Mardia 1998, sec. 3.2). Naive implementation of OPA using the observed configurations (Y, W) results in a naive estimator of β given by β^W=(β^0,W,β^1,W)t=(WD*WD)-1WD*Y, where WD=[1KW].

To study the effects of measurement error on naive OPA in a more concrete setting, we assume landmarks in X, ϵ, and U in (1) and (2) follow complex normal (CN) distributions (Goodman 1963; Kent 1994; Konno 2007). More specifically, it is assumed that, for l = 1, …, K, XlCN(μl,2σx2), ϵlCN(0,2σϵ2), and UlCN(0,2σu2), independently. For a complex normal random variable, its variance being 2σ2 explicitly implies that the variance of the real and imaginary parts of it are both σ2, and these two parts are uncorrelated (Goodman 1963, Example 3.1). Although the assumption of isotropic variance for each of these complex random quantities may not hold in all applications, it is a sensible starting point from which we were able to discover the following results that provide some practically important insights on the impact of measurement error.

Denote by μX=(μ1,...,μK)t the mean of X, where K ⩾ 3. Proposition 3.1 given next is derived under a special case where μX=μ1K for μC1. The follow-up Proposition 3.2 is established under a general setting where one allows μl differ across l = 1, …, K. Expectations appear in both propositions are defined with respect to the distribution of (Y, W) under the above normality assumptions on landmarks.

Proposition 3.1. —

Under the above complex normality assumptions on Xl, εl, and Ul, for l = 1, …, K, if μX=μ1K, where μC1, then

E(β^0,W)=β0+1-σx2σx2+σu2μβ1, (7)
E(β^1,W)=σx2σx2+σu2β1. (8)

Results in Proposition 3.1 are the same in spirit as those derived for real-value simple linear regression with classical measurement error (Fuller 1987, sec. 1.1). The proof, omitted here, is parallel with that given in Fuller (1987, sec. 1.1) except for the change from real normal distributions to complex normal distributions for relevant random variables in our setting (see Du 2012). An important quantity arising from Fuller’s derivations that also emerges in (7) and (8) is the so-called reliability ratio, σ2 x/(σ2 x + σ2 u), denoted by λ, which quantifies the severity of error contamination on the unobserved scalar predictor in simple linear regression. The same interpretation of λ carries over to two-dimensional shape analyses if one assumes that error contamination on all landmarks of a configuration in both x- and y-coordinates are comparable, which is a realistic assumption in many applications. Because λ ∈ [0, 1], (8) indicates an attenuation effect of measurement error on the naive estimator of β1, which is a well-known consequence of ignoring measurement error when estimating the slope parameter in real-value simple linear regression (Carroll et al. 2006, sec. 3.2). In our context of matching two configurations, the attenuation effect translates to underestimating the amount of scaling when matching X onto Y. Because the expectation in (8) is a scalar multiple of β1, the naive rotation is not compromised by measurement error in terms of unbiasedness. Finally, according to (7), measurement error does not compromise (naive) inference on β0 either when μ = 0.

Although it is theoretically reassuring to have Proposition 3.1 in agreement with existing findings in the context of simple linear regression, the assumption of μX=μ1K is overly restrictive for shape data as it forces an object shrink to a point after random noise is removed. It is practically and theoretically more interesting to relax this assumption. However, with μXμ1K, E(β^W) cannot be easily derived in closed form. Proposition 3.2 provides the dominating terms of this expectation when σ2 x + σu 2 is small.

Proposition 3.2. —

Under the above complex normality assumptions on Xl, εl, and Ul, for l = 1, …, K, if μX=(μ1,...,μK)tμ1K, where μC1, then

E(β^0,W)=β0+2σu2μˇXμX,c2K-1-1Kβ1+oσx2+σu2, (9)
E(β^1,W)=β1-2σu2μX,c2K-5+2+2KμX2μX,c2β1+z`oσx2+σu2, (10)

where μˇX=1KtμX/K and μX,c=μX-μˇX1K.

The proof for Proposition 3.2 is relegated to Appendix B of the supplementary materials. Note that, for a centered configuration X, one has μˇX=0 and thus μX=μX,c. In this case, the dominating bias in β^0,W in (9) vanishes. Moreover, according to (10), the dominating bias in β^1,W reduces to

-2σu2μX,c2K-3+2Kβ1, (11)

which is negative for K ⩾ 3, suggesting a negative (dominating) bias in β^1,W. This is reminiscent of the attenuation effect of measurement error on naive scale estimation implied by Proposition 3.1. Furthermore, (11) indicates that, with K, β1, and σ2 u fixed, the attenuation is less noticeable when μX,c is larger, or, equivalently, when the landmarks in the mean configuration μX spread out more around the center. In summary, Proposition 3.2 suggests that ignoring measurement error in the error-prone-centered configuration is less harmful if this unobserved configuration comes from a population whose mean configuration consists of more diffuse landmarks. Otherwise, naive OPA can substantially underestimate β1. As centering configurations (in two or three dimensions) is routinely done in practice, we assume centered X and Y henceforth.

3.2. Bias Analysis in Three-Dimensional Size-and-Shape Matching

The traditional OPA that matches three-dimensional configurations, X and Y, yields the following estimators of the matching parameters appearing in (6) (Dryden and Mardia 1998, sec. 5.2),

β^0=0,Γ^=TVt,β^1=tr(YtXΓ^)tr(XtX), (12)

where T, V ∈ SO(3) result from the singular value decomposition, YtX=YXVΛTt, in which Λ=diag(λ1,λ2,λ3), and λ1 ⩾ λ2 ⩾ |λ3| are square roots of the eigenvalues of X t Y Y t X, among which only λ3 is not necessarily positive (Dryden and Mardia 1998, sec. 4.2). Note that λ3 is negative if and only if det(XtY)<0, and the optimal rotation is unique if λ2 + λ3 > 0, where the eigenvalues are nondegenerate and optimally signed (Kent and Mardia 2001), which we assume throughout. It follows that naive OPA matches W onto Y and yields the following counterpart naive estimators,

β^0,W=0,Γ^W=TWVWt,β^1,W=tr(YWtΓ^W)tr(WtW), (13)

where T W and V W are similarly defined as T and V in (12) with X replaced by W.

In what follows, we view the estimators in (12) as the ideal estimators, which can be computed only when the error-free configurations, that is, the true configurations (Y, X), are observed. Assuming common measurement error variance, σ2 u, for all K( ⩾ 4) landmarks of X in all three (x-, y-, and z-) coordinates, we investigate how the naive estimators compare with the ideal estimators given the true configurations. Our findings are summarized in the following proposition, where the expectation is conditional on (Y, X), which makes the distributional assumption on X l and ϵl (l = 1, …, K) imposed in Propositions 3.1 and 3.2 irrelevant here.

Proposition 3.3. —

Under the assumption of small σu,

Γ^W=Γ^+σuTDVt+σu2TD2Vt/2+Oσu3, (14)
E(β^1,W)=β^1-(3K-2)σu2X2β^1+Oσu3, (15)

where D = [dij]i, j = 1, 2, 3 has elements given by dij = (cjicij)/(λi + λj) if ij, and dij = 0 otherwise, in which cij, for i, j = 1, 2, 3, are elements in C = V t Y t Z T/(‖X‖‖Y‖), and Z = Uu.

The result in (14) is a direct extension of Proposition 3 in Kent and Mardia (2001), and (15) follows from (13) and (14), which are elaborated in the proof provided in Appendix C of the supplementary materials. Note that, given (Y, X), Z is the only random quantity in C, thus the dominating bias terms in (14) are random merely due to the dependence of D on Z (via C). Noticing that Z has mean zero and dij’s are linear in cij’s (i, j = 1, 2, 3), the second term on the right-hand side of (14) has mean zero given (Y, X). This implies that Γ^W is expected to be close to the ideal estimator Γ^ when error contamination is not substantial. In contrast, the dominating bias in β^1,W according to (15) can be more noticeable. More specifically, this dominating bias is always negative, and is smaller in absolute value when ‖X‖ is larger, with K, β^1, and σ2 u fixed. These findings bear obvious resemblance with the conclusions drawn based on the dominating bias in (11) under Proposition 3.2.

The consent of Propositions 3.1–3.3 is that the estimator of the scale parameter resulting from naive OPA is most affected in terms of consistency by measurement error among all estimators involved in matching configurations. This raises concerns especially when the size of objects is the focal point of a study, such as in the study of rat skulls’ growth described in Section 1, which was recently revisited by Mardia et al. (2013) who carried out a Bayesian analysis with size being a key concept in their investigation. In the upcoming section, we derive unbiased score functions for the measurement error models formulated in Section 2, followed by estimating equations based on these scores. The goal is to obtain consistent estimators of the parameters involved in matching X onto Y using error-contaminated data (Y, W). In the sequel, denote by Θ the collection of unknown parameters in the first component model, (1) and (5) (or (3)), of the measurement error model.

4. CONDITIONAL SCORE METHOD

4.1. Conditional Score for Two-Dimensional Shape Data

Following the derivations of conditional score for real-value linear measurement error models in Carroll et al. (2006, sec. 7.2), but using complex normal whenever real normal is used in their derivations, we first establish that, if one views β1, σ2 ε, and σ2 u as known constants and Xl as unknown parameters, then, for l = 1, …, K, Δl=Wl+β1Ylσu2/σϵ2is a sufficient statistic for Xl, where t denotes the conjugate of t for tC1. Then, under the normality assumption on εl and Ul, we derive the first two moments of Yl conditioning on Δl given by E(Yl|Δl,Θ)=(β0+β1Δl)/(1+β12σu2/σϵ2) and var (Yl|Δl,Θ)=2σϵ2/(1+β12σu2/σϵ2), for l = 1, …, K. Finally, using the idea of generalized method of moments (Hansen 1982), we define the following complex-value score function, referred to as the conditional score, for l = 1, …, K,

ψ(Yl,Δl,Θ)=Yl-E(Yl|Δl,Θ)Yl-E(Yl|Δl,Θ)ΔlK-pK-Yl-E(Yl|Δl,Θ)2 var (Yl|Δl,Θ),

where p is the number of parameters in Θ excluding σ2 ε. By construction, ψ(Yl,Δl,Θ) is an unbiased complex vector-value score. Following M-estimation theory (Huber 1967), the solution to the system of score equations, l=1Kψ(Yl,Δl,Θ)=0, is a consistent estimator of Θ, denoted by Θ^ and referred to as the conditional score estimator. A variance estimator for Θ^ can be straightforwardly derived following the sandwich variance construction for M-estimators (Stefanski and Boos 2002).

An appealing feature of the above line of derivation is that its validity does not depend on the distribution of the unobserved configuration X, as {Xl}K l = 1 are viewed as unknown parameters and conditioned out by introducing {Δl}K l = 1.

4.2. Conditional Score for Three-Dimensional Shape Data

To derive the conditional score associated with the three-dimensional measurement error model, we alternate between the quaternion version of the model, that is, (3) along with (4), and the real version given by (5) in conjunction with (2). Similar to the normality assumptions in Sections 3.1 and 4.1, using the real-value representation for a configuration, we assume ϵlN(0,Σϵ) and UlN(0,Σu), for l = 1, …, K, independently, where Σϵ and Σu are 3 × 3 variance-covariance matrices that can be anisotropic. As commented at the end of Section 4.1, which is also implied in the proof in Appendix D in the supplementary materials, no distributional assumption on X l is needed for the validity of the conditional score method.

Under the above distributional assumptions, we first prove in Appendix D that

Δl=Wl+ΣuQtΣϵ-1Yl (16)

is a sufficient statistic for X l if one views X l as a parameter whereas all parameters in (16) are known constants. Then we derive the first two moments of Y l given Δl, which realizes conditioning out X l due to the sufficiency of Δl. These conditional moments are, for l = 1, …, K, E(Yl|Δl,Θ)=β0+Q(I3+ΣuQtΣϵ-1Q)-1(Δl-ΣuQtΣϵ-1β0) and cov (Yl|Δl,Θ)=Σϵ-Q(I3+ΣuQtΣϵ-1Q)-1ΣuQt, where I 3 is the 3 × 3 identity matrix. Next, following the same strategy employed in Section 4.1 for score construction, we obtain the following conditional score for three-dimensional shape data, for l = 1, …, K,

4.2. (17)

where p is the number of parameters in Θ excluding those in Σϵ, and the second (block) component of the score vector uses quaternion multiplication to attain a concise presentation, in which Dl is the quaternion version of Δl. For two quaternions, qr = ar + bri + crj + drk, for r = 1, 2, the quaternion multiplication used in (17) is defined by q 1 q 2 = a 1 a 2b 1 b 2c 1 c 2d 1 d 2 + (b 1 a 2 + a 1 b 2d 1 c 2 + c 1 d 2)i + ( − b 1 d 2 + a 1 c 2 + d 1 b 2 + c 1 a 2)j + ( − c 1 b 2 + d 1 a 2 + a 1 d 2 + b 1 c 2)k. Finally, we have the system of estimating equations, l=1Kψ(Yl,Δl,Θ)=0, the solution to which is a consistent estimator of Θ referred to as the conditional score estimator, whose variance estimator is derived according to the sandwich variance construction for M-estimators.

4.3. Measurement Error Variance Estimation

In Sections 4.1 and 4.2, parameters in the first component model of a measurement error model, (1) and (5) (or (3)), are the only unknown parameters estimated by solving the conditional score equations. In practice, the measurement error variance(s) involved in the second component model, (2) or (4), is (are) typically unknown. Hence, it is necessary to estimate σ2 u or Σu first to implement the proposed methods. It is well understood in the measurement error community that, for linear models, when there is only one error-contaminated measure for each value of the true covariate, the measurement error variance is intrinsically unidentifiable using data (Y, W). When there are multiple measures for each value of the true covariate, Carroll et al. (2006) provided an estimator of the measurement error variance in the context of real-value linear measurement error models (Carroll et al. 2006, eq. (4.3)).

Tailored for shape data, we develop a new strategy for estimating the measurement error variance when replicate measures are available. For notational simplicity, two-dimensional shape data are used next to illustrate the estimation, where σ2 u is the only unknown variance parameter in (2). Moreover, it is assumed that there are two replicate measures for the true configuration X, denoted by W 1, w and W 2, w, where the second subscript “w” is added to distinguish them from the notation for the lth landmark of W, Wl, used in Section 2.1. Viewing each replicate measure as a result of some transformation of X, one may assume that

W1,w=α0,11K+α1,1X+U1,w, (18)
W2,w=α0,21K+α1,2X+U2,w, (19)

where α0,rC1 is the translation parameter, α1,r(0)C1 is the scale-and-rotation parameter, and Ur,wCK is the measurement error in the rth replicate, for r = 1, 2. Combining (18) and (19) to cancel X yields

W2,w=γ01K+γ1W1,w+Uw,[6pt] (20)

where γ0 = α0, 2 − α0, 1α1, 21, 1, γ1 = α1, 21, 1, and U w = U 2, wU 1, wα1, 21, 1. Under the assumption that all 2K landmarks in {U r, w, r = 1, 2} are independent and identically distributed according to CN(0,2σu2), the K landmarks in U w are independent and identically distributed according to CN{0,2σu2(1+γ12)}. Now one may implement OPA to match W 1, w onto W 2, w and obtain estimates of γ0 and γ1, denoted by γ^0 and γ^1, respectively. Then the error variance in (20), that is, 2σ2 u(1 + ‖γ12) as a whole, can be estimated using the mean residual squared distance given by W2,w-γ^01K-γ^1W1,w2/K. It follows that an estimator of σ2 u is σ^u2=W2,w-γ^01K-γ^1W1,w2/{2K(1+γ^12)}.

We conducted extensive simulation studies to experiment with this strategy (results reported in Du 2012). Empirical evidence from these experiments suggests that this strategy can yield very accurate estimate of σ2 u when K is moderate or large (say, K ⩾ 30) and the reliability ratio λ is above 0.5. When there are R( > 2) replicate measures for X, one may follow the above procedure to estimate σ2 u repeatedly using all different pairs of replicates, then take the average of these resulting estimates as the final σ^u2. After a final estimate of σ2 u is computed, one may treat one of the replicate measures of X as W in Section 4.1 and plug in σ^u2 as σ2 u. Alternatively, one may use the average of all R replicate measures of X as W and plug in σ^u2/R as σ2 u when implementing the method proposed in Section 4.1. Parallel arguments carry over to the case with three-dimensional shape data to obtain an estimate of the measurement error variance to be plugged in the conditional scores in Section 4.2. To focus on comparing the conditional score method with naive OPA, we assume the measurement error variance known in the simulation studies presented in the upcoming section.

5. EMPIRICAL EVIDENCE

5.1. Simulation Studies for Two-Dimensional Shape Data

In this section, we present simulation studies under the general setting in Proposition 3.2. We also include simulation studies under the special setting with μX=μ1K as in Proposition 3.1 in Appendix E in the supplementary materials. The goal of these experiments is to empirically illustrate properties of the estimators from naive OPA and the performance of the conditional score method described in Section 4.1. All simulations reported in this article are conducted using R (R Development Core Team 2012) or in SAS 9.2, and the shapes library (Dryden 2012) is used to implement (naive) OPA.

To create true configurations, we first generate {Re(μl), Im(μl)}, for l = 1, …, 10, independently from uniform(− g, g), where g = 5, 10. Using one set of realizations, {μl}10 l = 1, we create a configuration X with landmarks XlCN(μl,2), for l = 1, …, 10. Given one simulated X, another configuration Y is generated according to (1) with β0 = 1 + 2i, β1 = 2 + i, and {εl}10 l = 1 generated independently from CN(0,2). Finally, an error-contaminated version of X, namely, W, is created according to (2) with {Ul}10 l = 1 independently simulated from CN(0,2σu2), where σ2 u is set at different values to attain the reliability ratio λ = 0.5, 0.8, and 1. Note that W coincides with X when λ = 1. Naive estimates, Θ^W=(β^0,W,β^1,W,σ^ϵ,W2)t, and conditional score estimates, Θ^=(β^0,β^1,σ^ϵ2)t, of the parameters in (1) are obtained based on (Y, W) at each g–λ combination. Summary statistics of these estimates from 1000 Monte Carlo (MC) replicates are provided in Table 1.

Table 1 . Averages of naive estimates and averages of conditional score estimates across 1000 Monte Carlo replicates from the simulation study in Section 5.1. Numbers in parentheses are Monte Carlo standard errors of the averages. True parameter values are Re(β0) = 1, Im(β0) = 2, Re(β1) = 2, Im(β1) = 1, and σ2 ε = 1. “Naive” refers to naive estimates; “CSE” refers to conditional score estimates.

  Re(β0) Im(β0) Re(β1) Im(β1) σ2ε
  Elements in μX are generated from uniform( − 5, 5)
  λ = 0.5
Naive 0.96 (0.01) 2.10 (0.01) 1.82 (0.00) 0.91 (0.00) 5.42 (0.03)
CSE 1.01 (0.01) 2.03 (0.01) 1.99 (0.00) 1.00 (0.00) 1.26 (0.03)
  λ = 0.8
Naive 1.00 (0.01) 2.02 (0.01) 1.95 (0.01) 0.98(0.00) 2.17 (0.01)
CSE 1.01 (0.01) 2.00 (0.01) 2.00 (0.00) 1.00(0.00) 1.09 (0.01)
  λ = 1
Naive 0.99 (0.01) 2.00 (0.00) 1.99 (0.00) 1.00 (0.00) 0.98 (0.00)
CSE 0.99 (0.01) 2.00 (0.00) 2.00 (0.00) 1.00 (0.00) 1.04 (0.01)
  Elements in μX are generated from uniform( − 10, 10)
  λ = 0.5
Naive 0.92 (0.01) 2.02 (0.01) 1.95 (0.01) 0.97 (0.01) 5.77 (0.03)
CSE 1.02 (0.01) 2.02 (0.01) 2.00 (0.00) 1.00 (0.01) 1.30 (0.03)
  λ = 0.8
Naive 0.97 (0.01) 1.99 (0.01) 1.99 (0.01) 0.99 (0.00) 2.20 (0.01)
CSE 0.99 (0.01) 1.99 (0.01) 2.00 (0.01) 1.00 (0.00) 1.10 (0.01)
  λ = 1
Naive 1.00 (0.01) 1.99 (0.00) 2.00 (0.00) 1.00 (0.00) 0.98 (0.00)
CSE 1.00 (0.00) 2.00 (0.00) 2.00 (0.00) 1.00 (0.00) 1.04 (0.00)

When λ is as low as 0.5, results in Table 1 suggest significant bias in β^0,W and substantial attenuation in β^1,W. As λ increases, that is, as error contamination lessens, the bias in β^W becomes less significant. The comparison between the naive estimates when g = 5 (upper half of Table 1) with those when g = 10 (lower half of Table 1) indicates that the former β^W are more biased than the latter. This comparison reinforces the implication of the dominating bias in (11) under Proposition 3.2, which is that β^W is less compromised by measurement error when μl’s are more variable across l = 1, …, K. In contrast, the conditional score estimates β^ exhibit performance one would expect for a consistent estimator at all levels of g–λ combination.

In Appendix F in the supplementary materials, we present simulation studies to illustrate the sandwich standard error estimates for conditional score estimators as stated in Section 4. Empirical evidence there indicates that the sandwich standard error estimators are reliable when K is not small (say, K ⩾ 30).

5.2. Simulation Studies for Three-Dimensional Shape Data

For three-dimensional shape data, each landmark μl(Q1) in μX consists of three imaginary parts, for l = 1, …, K, resulting in a total of 3K real numbers in the entire mean configuration μX. To control the spread of μl’s across l = 1, …, K, we generate these 3K real numbers independently from uniform(− g, g), where g varies from 0 to 10, allowing one to observe the effect of the spread among the landmarks in μX on different estimates. Given a set of simulated {μl}K l = 1, X l is generated from N(E(X l), I 3), where the three imaginary parts of μl constitute E(X l), for l = 1, …, K with K = 30. Then, based on a realization of X, we generate Y according to (6) with β1 = 2.25, Σϵ=σϵ2I3 with σ2 ε = 1, and Γ as the rotation matrix corresponding to the unit axis of rotation given by (0.5, 0.33, 0.8)t and the angle of rotation equal to 60°. Finally, W is obtained based on (2) with Σu to be specified in detail next. Slightly different from simulations in Section 5.1, here we acknowledge the fact that the true configurations (Y, X) have mean zero and set β0 at zero in (5) and (6), rather than estimating β0. This leaves one with unknown parameters including the scale parameter β1, the rotation Γ, and the model error variance σ2 ε to be estimated. Using each of 1000 pairs of configurations (Y, W) at a fixed gΣu combination, we implement naive OPA and the conditional score method to obtain two sets of estimates for the unknown parameters.

With g = 5, Figure 1 depicts the MC average of β^1 and that of β^1,W versus λ (left panel), the same comparison for estimates of σ2 ε (middle panel), and finally Γ^-Γ and its naive counterpart (right panel), when Σu=σu2I3 with σ2 u varying over a range to produce λ increasing from 0.8 to 1 (in Figure 1(a)) or when Σu is equal to (in Figure 1(b))

h-10.250.20.250.20.40.30.250.30.35, (21)

where h varies from 1 to 30. As a multivariate generalization of the reliability ratio, here we define λ=tr{(Σx+Σu)-1Σx}/3, where Σx is the variance-covariance of X l, which is equal to I 3 in this experiment. As h increases from 1 to 30, Σu given in (21) results in λ varying from 0.800 to 0.989. Figure 2 presents the same comparisons between estimates versus g when λ = 0.85, which is obtained by first setting Σu=(3/17)I3 (in Figure 2(a)), and second (in Figure 2(b)) setting Σu at

0.19-0.1-0.2-0.10.20.15-0.20.150.25. (22)

In both Figures 1(a) and 2(a), with an isotropic diagonal Σu, estimates of Γ from two methods are very similar across the board, a phenomenon that reconciles with the comments made on (14) following Proposition 3.3. But when the assumption of isotropic and independent measurement error along all three coordinates is violated, as in both Figures 1(b) and 2(b), Γ^W is more compromised by measurement error. As for the other parameters, Figure 1 suggests that naive estimates are more adversely affected by measurement error at lower levels of λ, and the conditional score estimates are much more robust and accurate across different λ. Moreover, Figure 2 reveals that a more diffuse μX can substantially alleviate bias in the naive estimates of the matching parameters due to measurement error. This observation is consistent with the implication of (15) under Proposition 3.3.

Figure 1 . Averages of naive estimates (dashed lines) and averages of conditional score estimates (solid lines) across 1000 Monte Carlo replicates versus reliability ratio λ from simulations in Section 5.2, with g = 5, (a) Σu=σu2I3, and (b) Σu given in (21). Dotted lines correspond to the true parameter values.

Figure 1

Figure 2 . Averages of naive estimates (dashed lines) and averages of conditional score estimates (solid lines) across 1000 Monte Carlo replicates versus g from simulations in Section 5.2, with λ = 0.85 resulting from (a) Σu=(3/17)I3, and (b) Σu given in (22). Dotted lines correspond to the true parameter values.

Figure 2

6. APPLICATION TO REAL DATA

In what follows, we apply both naive OPA and the conditional score method to three real data examples, with the first two concerning two-dimensional configurations and the third considering three-dimensional configurations.

Example 1 —

(Rat skulls). A detailed description of the data of rat skulls brought up in Section 1 is given in Bookstein (1991). Besides Kenobi, Dryden, and Le (2010) and Mardia et al. (2013), many researchers have analyzed the data assuming the recorded skull configurations are precise (Goodall and Lange 1989; Monteiro 1999; Le and Kume 2000; Kent et al. 2001). For illustration purposes, we assume that the skull configurations, each consisting of eight landmarks, are contaminated with measurement error. The data include information on 18 rats, whose skulls were X-rayed at ages 7, 14, 21, 30, 40, 60, 90, and 150 days. For each rat, viewing the first (at age 7 days) recorded two-dimensional skull configuration as W and each of the latter seven recorded skull configurations as Y at that time point, we match W onto Y at each of the seven later time points, first by implementing OPA naively, then using the conditional score method. Because the skull of each rat was X-rayed only once at each time point, the measurement error variance is unidentifiable in this study. As typically done in the measurement error literature when σ2 u cannot be estimated, we conduct a sensitivity analysis by setting σ2 u at different values, and we monitor how the naive inference compares with inference from the conditional score method at different assumed levels of σ2 u. More specifically, at each time point starting from age 14 days, we use data of all 18 rats to estimate σ2 x + σ2 u. Then we set σ2 u at two levels to attain the estimated λ equal to 0.5 and 0.9. From a practical standpoint, λ = 0.5 suggests severe measurement error contamination, and λ = 0.9 implies moderate error contamination.

Figure 3 presents the estimated scale resulting from naive OPA and that from the conditional score method across seven time points at different levels of λ for two randomly selected rats. The pictorial comparison between β^1 and β^1,W highlights one key finding in Section 3, which is that naive OPA tends to underestimate the scale parameter.

Figure 3 . Naive estimates (dashed lines) and conditional score estimates (solid lines for λ = 0.5, dotted lines for λ = 0.9) of the scale parameter ‖β1‖ at seven time points based on the rat data in Example 1.

Figure 3

As pointed out by a referee, a natural extension of the above analysis is to embrace the longitudinal nature of the study and carry out a longitudinal size and shape analysis for the entire sample of 18 rats. For this extension, one may consider a longitudinal model in a very similar spirit as the longitudinal functional model proposed by Greven et al. (2010). This is beyond the scope of our current study, and we provide a brief discussion in Appendix G in the supplementary materials on a longitudinal model for configurations and its connection with the longitudinal functional model in Greven et al. (2010).

Example 2 —

(Brain templates). In this example, we consider template matching in medical imaging where candidate templates of brain images are generated by an automatic algorithm. For each axial MRI brain scan, eight landmarks are estimated on the corpus callosum according to a graphical template algorithm (Amit 1997; Dryden 2003), and the resulting candidate templates are subject to estimation error. Because, given one slice of the brain scan, these candidate templates can be viewed as error-contaminated measures of the same underlying structure in the corpus callosum, the true scale parameter when matching one template onto the other, while appropriately accounting for measurement error, is equal to one. Using all available 604 candidate templates corresponding to the same slice of brain scan, we obtain σ^u20.002 and σ^x2+σ^u20.015, yielding an estimated reliability ratio as 0.860. Then we choose a particular pair of templates as Y and W. Matching W onto Y via naive OPA yields the estimated scale as 0.951 (0.061) and the estimated rotation as 2.116° (2.956), with sandwich standard error estimates in parentheses, and the counterpart conditional score estimates are 0.999 (0.062) and 2.116° (2.947). With such a small number of landmarks (K = 8), high variability in these estimates is inevitable. Although we cannot conclude statistically significant discrepancy between the naive scale estimate and conditional score estimate (with the high variability in estimates), we observe across all 182,106 pairs (from 604 templates with Y and W assigned at random for each pair) that the former is never larger than the latter. Figure 4 depicts β^1,W/β^1 versus β^1 when matching 4950 pairs resulting from the first 100 templates.

Figure 4 . Ratio of β^1,W over β^1 versus β^1 based on 100 brain templates, resulting in 4950 pairs, in Example 2. Estimated standard errors of β^1 (s.e.) associated with different numerical symbols are: s.e.∈ (0, 0.1) for “1,” s.e.∈ [0.1, 0.5) for “2,” s.e.∈ [0.5, 1) for “3,” s.e.∈ [1, 2) for “4,” s.e.∈ [2, 3) for “5,” and s.e.∈ [3, 4.21) for “6.”.

Figure 4

Example 3 —

(Face data). Evison et al. (2010) analyzed three-dimensional facial images of healthy volunteers, which were collected using a Geometrix FaceVision® FV802 Biometric Camera (ALIVE Tech, Cumming, GA). To demonstrate the conditional score method for three-dimensional shape data in comparison with naive OPA, we implement these two methods using data from one volunteer, whose facial configuration consisting of 61 landmarks was measured twice by each of the two operators in the study.

More specifically, denote by W 1, w and W 2, w the two replicate measures for this volunteer’s face collected by one operator, and by Y a facial configuration of the same volunteer measured by the other operator. Using the duplicate measures, W 1, w and W 2, w, and applying the method related in Section 4.3, we obtain σ^u20.007. This estimate of the measurement error variance, in contrast to the estimate of σ2 x + σ2 u (≈ 5.216), results in the estimated reliability ratio equal to 0.999, suggesting minimal error contamination in the observed facial images. This is also implied by Evison et al. (2010), who stated that the three-dimensional coordinates of landmarks were collected with high precision using a software tool. The estimated scale resulting from matching W 1, w onto Y using naive OPA and the conditional score estimate are 0.988 (0.006) and 0.990 (0.006), respectively, with the corresponding estimated standard errors in parentheses. As expected when measurement error is scarce, these two estimates are very similar. To simulate a scenario with slightly more error contamination as expected from CCTV cameras in the field, we add isotropic measurement error to W 1, w to reduce the reliability ratio to 0.9, and repeat the two analyses using the further contaminated configuration. This round of analyses yields the naive estimate of the scale as 0.845 (0.020) and the conditional score estimate as 0.920 (0.024), which demonstrates that naive OPA leads to an attenuated scale estimate.

7. DISCUSSION

We have shown theoretically and empirically that comparisons of two error-contaminated configurations via naive OPA can be misleading, with the adverse effects of measurement error mostly manifested in the attenuated scale estimation. Besides the severity of error contamination, the interplay of several factors that affect the degree of deterioration in inference from naive OPA is revealed in our bias analyses in Section 3. One noteworthy finding is that ignoring measurement error does less harm when the unobserved true configuration consists of more diffuse landmarks. Comforting as this finding sounds, one may not know with absolute certainty if the true configuration has landmarks diffuse enough to counteract measurement error in one particular application. The proposed conditional score method that accounts for measurement error and yields consistent inference for all parameters of interest is a valuable addition to the methodology available for size and shape analyses.

Formulating the measurement error models for shape data using complex/quaternion random variables allows us to derive the conditional scores in ways that are similar to those employed in the context of real-value linear measurement error models. However, although it is assumed in most real-value regression analyses that (Yl, Xl, Wl) are independent across l = 1, …, K, it is more natural to view (Yl, Xl, Wl) correlated across l = 1, …, K for shape data. This correlation creates some technical challenges in studying the bias of naive estimators but not in deriving the conditional scores, as knowledge about such correlation is neither needed nor used in constructing conditional scores. But now that correlation does exist, it is worth considering next how to construct new unbiased scores that account for such correlation. By taking into account the correlation among landmarks within a configuration, the new scores can be more efficient than the conditional scores proposed in the current study.

When more than two shapes are of interest, generalized Procrustes analysis, or GPA, can be used to estimate the mean size and shape (Dryden and Mardia 1998, sec. 5.3). In the presence of measurement error, one may fit the conditional score method into the algorithm of GPA where each error-prone shape is matched onto the estimated mean shape obtained at each iteration. But this becomes unnecessary if the main goal is to estimate the mean size and shape of these error-prone configurations, as ignoring measurement error does not affect the outcome of (naive) GPA regarding mean shape estimation in terms of consistency. This argument is in line with that (naive) predictions are not affected by measurement error, except for efficiency, in real-value linear measurement error models. Regardless, conditional score estimates of the matching parameters can be useful in the case, say, when an accurately measured rat skull becomes available and one wishes to predict the size and shape of this skull at a future time point. Assuming this rat and the rats in Example 1 in Section 6 come from the same population, then using the conditional score estimates of the matching parameters obtained there for prediction of this new rat skull is clearly more appealing than using the estimates from naive OPA.

As suggested by a referee, instead of the additive measurement error in (2), one may view the true configuration X as a set of realizations of the function, X(d), where d(D) is an argument of the function representing the true location of a landmark; then one may view the observed configuration W as another set of realizations of the same function, X(dW), where dW(D) is a noisy version of the target location d. Adopting this viewpoint, one needs a model that relates dW and d, which may lead to a model connecting W and X. Either using (2) or this strategy, with appropriate measurement error models formulated for error-prone shape data, the large battery of methodologies in the existing literature on measurement error can be potentially useful hints for solving other problems in shape analyses in the presence of measurement error. These include the widely adopted regression calibration and SIMEX (Carroll et al. 2006, chaps. 4 and 5), and other methods under the Bayesian framework (Carroll et al. 2006, chap. 9). The study reported in this article opens up this new avenue of research in shape analysis.

SUPPLEMENTARY MATERIALS

The supplement to this article contains Appendices A–G referenced in Sections 2.2, 3, 4.2, 5.1, and 6.

REFERENCES

  1. Amit Y. Graphical Shape Templates for Automatic Anatomy Detection With Applications to MRI Brain Scans. IEEE Transactions on Medical Imaging. 1997;16:28–40. doi: 10.1109/42.552053. [DOI] [PubMed] [Google Scholar]
  2. Booktein F.L. Morphometric Tools for Landmark Data: Geometry and Biology. Cambridge: Cambridge University Press; 1991. [Google Scholar]
  3. Carroll R.J. Ruppert D. Stefanski L.A. Crainiceanu C.M. Measurement Error in Non-Linear Models: A Modern Perspective. (2nd ed.), Boca Raton, FL: Chapman and Hall/CRC; 2006. [Google Scholar]
  4. Dryden I.L. Mathe-matical Methods in Computer Vision. New York: Springer-Verlag; 2003. Statistical Shape Analysis in High-Level Vision, [Google Scholar]
  5. ——— . shapes Package. Vienna, Austria: R Foundation for Statistical Computing; 2012. Contributed package. Available at http://www.R-project.org. [Google Scholar]
  6. Dryden I.L. Mardia K.V. Statistical Shape Analysis. Chichester: Wiley; 1998. [Google Scholar]
  7. Du J. Ph.D. disserta-tion, Department of Statistics, University of South Carolina; 2012. Measurement Error Models in Shape Analysis, [Google Scholar]
  8. Evison M.P. Dryden I.L. Fieller N. Mallett X.D. Morecroft L. Schofield D. Bruegge R.V. Key Parameters of Face Shape Variation in 3D in a Large Sample. Journal of Forensic Sciences. 2010;55:159–162. doi: 10.1111/j.1556-4029.2009.01213.x. [DOI] [PubMed] [Google Scholar]
  9. Fuller W.A. Measurement Error Models. New York: Wiley; 1987. [Google Scholar]
  10. Goodall C. Procrustes Methods in the Statistical Analysis of Shape. Journal of the Royal Statistical Society, Series B. 1991;53:285–339. [Google Scholar]
  11. Goodall C.R. Lange N. Proceedings of the 21st Symposium on the Interface Between Computing Science and Statistics. 1989. Growth Curve Models for Correlated Triangular Shapes; pp. 445–454. with discussion. [Google Scholar]
  12. Goodman N.R. Statistical Analysis Based on a Certain Multivariate Complex Gaussian Distribution. The Annals of Mathematical Statistics. 1963;34:152–177. an introduction. [Google Scholar]
  13. Greven S. Crainiceanu C. Caffo B. Reich D. Longitudinal Functional Principal Component Analysis. Electronic Journal of Statistics. 2010;4:1022–1054. doi: 10.1214/10-EJS575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Hansen L.P. Large Sample Properties of Generalized Method of Moments Estimators. Econometrica. 1982;50:1029–1054. [Google Scholar]
  15. Horn B.P. Closed-Form Solution of Absolute Orientation Using Unit Quaternions. Journal of the Optical Society of America, Series A. 1987;4:629–642. [Google Scholar]
  16. Huber P.J. The Behavior of Maximum Likelihood Estimates Under Nonstandard Conditions. Proceedings of the 5th Berkeley Symposium. 1967;1:221–233. [Google Scholar]
  17. Kendall D.G. Shape Manifolds, Procrustean Metrics, and Complex Projective Spaces. Bulletin of the London Mathematical Society. 1984;16:81–121. [Google Scholar]
  18. Kenobi K. Dryden I.L. Le H. Shape Curves and Geodesic Modeling. Biometrika. 2010;97:567–584. [Google Scholar]
  19. Kent J.T. The Complex Bingham Distribution and Shape Analysis. Journal of the Royal Statistical Society, Series B. 1994;56:285–299. [Google Scholar]
  20. Kent J.T. Mardia K.V. Shape, Procrustes Tangent Projections and Bilateral Symmetry. Biometrika. 2001;88:469–485. [Google Scholar]
  21. Kent J.T. Mardia K.V. Morris R.J. Aykroyd R.D. Functional Models of Growth for Landmark Data, In: Mardia K.V., editor; Aykroyd R.G., editor. Proceedings in Functional and Spatial Data Analysis. Leeds: University of Leeds; 2001. pp. 109–115. [Google Scholar]
  22. Konno Y. Estimation of Normal Covariance Matrices Parameterized by Irreducible Symmetric Cones Under Stein’s Loss. Journal of Multivariate Analysis. 2007;98:259–316. [Google Scholar]
  23. Kunze K. Schaeben H. The Bingham Distribution of Quaternions and Its Spherical Radon Transform in Texture Analysis. Mathematical Geology. 2004;36:917–943. [Google Scholar]
  24. Le H. Kume A. Detection of Shape Changes in Biological Features. Journal of Microscopy. 2000;200:140–147. doi: 10.1046/j.1365-2818.2000.00744.x. [DOI] [PubMed] [Google Scholar]
  25. Mardia K.V. Fallaize C.J. Barber S.J. Jackson R.M. Theobald D.L. Bayesian Alignment of Similarity Shapes. The Annals of Applied Statistics. 2013;7:989–1009. doi: 10.1214/12-AOAS615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Monteiro L.R. Multivariate Regression Models and Geometric Morphometrics: The Search for Causal Factors in the Analysis of Shape. Systematic Biology. 1999;48:192–199. doi: 10.1080/106351599260526. [DOI] [PubMed] [Google Scholar]
  27. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2012. [Google Scholar]
  28. Stefanski L.A. Boos D.D. The Calculus of M-Estimation. The American Statistician. 2002;56:29–38. [Google Scholar]
  29. Zhang F. Quaternions and Matrices of Quaternions, Linear Algebra and Its Applications. 1997;251:21–57. [Google Scholar]

Articles from Journal of the American Statistical Association are provided here courtesy of Taylor & Francis

RESOURCES