Abstract
The analysis of ratios of body measurements is deeply ingrained in the taxonomic literature. Whether for plants or animals, certain ratios are commonly indicated in identification keys, diagnoses, and descriptions. They often provide the only means for separation of cryptic species that mostly lack distinguishing qualitative characters. Additionally, they provide an obvious way to study differences in body proportions, as ratios reflect geometric shape differences. However, when it comes to multivariate analysis of body measurements, for instance, with linear discriminant analysis (LDA) or principal component analysis (PCA), interpretation using body ratios is difficult. Both techniques are commonly applied for separating similar taxa or for exploring the structure of variation, respectively, and require standardized raw or log-transformed variables as input. Here, we develop statistical procedures for the analysis of body ratios in a consistent multivariate statistical framework. In particular, we present algorithms adapted to LDA and PCA that allow the interpretation of numerical results in terms of body proportions. We first introduce a method called the “LDA ratio extractor,” which reveals the best ratios for separation of two or more groups with the help of discriminant analysis. We also provide measures for deciding how much of the total differences between individuals or groups of individuals is due to size and how much is due to shape. The second method, a graphical tool called the “PCA ratio spectrum,” aims at the interpretation of principal components in terms of body ratios. Based on a similar idea, the “allometry ratio spectrum” is developed which can be used for studying the allometric behavior of ratios. Because size can be defined in different ways, we discuss several concepts of size. Central to this discussion is Jolicoeur's multivariate generalization of the allometry equation, a concept that was derived only with a heuristic argument. Here we present a statistical derivation of the allometric size vector using the method of least squares. The application of the above methods is extensively demonstrated using published data sets from parasitic wasps and rock crabs.
Keywords: Allometry, Chalcidoidea, Hymenoptera, LDA ratio extractor, morphometry, multivariate statistics, PCA ratio spectrum
The use of ratios of measurements (i.e., of body proportions), has a long tradition and is deeply ingrained in morphometric taxonomy (Reyment et al. 1984, Winston 1999, Lestrel 2000, Schuh and Brower 2009). In many animal groups, the indication of such ratios is a standard of species descriptions, diagnoses, or identification keys (Mayr and Ashlock 1991). This is especially true for many arthropods, where ratios are a convenient means for distinguishing between morphologically similar species which often differ significantly in body proportions but not in qualitative characters. In certain insect groups, such as parasitic wasps, numerous ratios are routinely reported (e.g., Townes and Townes 1981, Kasparyan 1989, Noyes 2004, Horstmann 2009) and sometimes up to 30 ratios form the main body of a species description (see, e.g., Graham 1969, Graham 1991). Often the use of ratios is rather implicit in descriptive terms, for instance, when leaves are described as being “narrow” or “broad,” both attributes that could be translated into ratios without loss of information. In fact, botanists use numerous such terms for various plant parts that could be partly or wholly substituted by ratios (Stuessy 2009). Ratios are also used for phylogenetic analysis where they are treated as continuous characters (Thiele 1993, Wiens 2000, Rae 2002, Goloboff et al. 2006).
Besides tradition and ease of application, the widespread use of ratios is certainly related to a common way of looking at the shape of organisms. A taxonomist who notices similarity or dissimilarity in proportions of two specimens can always adequately translate them into a series of ratios. Any two individuals are then recognized as having the same shape (i.e., the same body proportions), when all measurements differ by a (positive) constant factor, for instance, when all of them are doubled. It does not matter if a head length to width ratio is, say, 2:4 mm or 4:8 mm, as long as the ratio (0.5) is the same, the shape (as captured by the ratio) is the same. The geometric shape expressed by ratios is thus invariant for a particular measure of size (Mosimann 1970).
Often it is useful to go one step further and analyze more than two linear distances in a single analysis with the help of multivariate statistical methods. Over the past decades, a wide array of tools has been developed in the field of multivariate morphometry (Reyment et al. 1984, Marcus 1990, Claude 2008). These methods help to unravel hidden population structure or to arrive at a better differentiation of groups, in other words, they give insights in the multivariate data structure that cannot be achieved solely by ratio analysis. Standard applications are principal component analysis (PCA) and Fisher's linear discriminant analysis (LDA), both with raw data (often transformed into logarithmic scale) as the primary input (see Pimentel 1979 for a readable account for biologists and Sorensen and Foottit 1992 for illustrative applications in insect systematics). Both methods aim to transform the original variables into a new system of coordinate axes, whereby most of the variance is contained in the first two or three axes. Traditionally, the results are then presented as scatter plots. However, the geometric meaning of these plots differs from the one obtained by the analysis of body ratios (Bookstein 1989, Claude 2008).
For this reason, we present versions of the classical LDA and PCA algorithms that are directly adapted to body proportions. In particular, we develop tools that allow us to interpret the numerical results obtained by these multivariate analyses in terms of the body sizes and body proportions of the individuals in question. The first method, adapted to LDA and called the “LDA ratio extractor,” allows the extraction of the ratios that are most informative for distinguishing between two or more groups. In this context, we also introduce a measure for deciding how much of the variation between individuals or groups of individuals is due to shape differences and how much is due to size differences. The second tool, called the “PCA ratio spectrum,” allows the interpretation of principal components in terms of ratios. In a similar manner, the “allometry ratio spectrum” can be used to assess the extent of allometric behavior in ratios. Furthermore, we present several concepts of size and discuss their relation to multivariate allometry (Klingenberg 1996). Central to this discussion is allometric size (Jolicoeur 1963), a concept that was derived only heuristically. In the Appendix, we therefore provide a statistical derivation of Jolicoeur's allometric size vector using the method of least squares. Finally, the above methods are illustrated with a data set from parasitic wasps (Baur 2002) and a classic data set from rock crabs (Campbell and Mahon 1974). The former is ideally suited for our purpose as ratios are commonly used in the taxonomy of these wasps (see above). The latter is often used for testing new statistical methods; it is included here because of the strong allometric behavior of certain variables.
The mathematical framework, especially the definition of shape and size used in this paper, is adopted from the work of Mosimann (1970), Darroch and Mosimann (1985), Sampson and Siegel (1985), and Rao and Suryawanshi (1996) who has a long and acknowledged history in morphometry (see, e.g., Pimentel 1979; Reyment et al. 1984, Marcus 1990, Klingenberg 1996, Dryden and Mardia 1998, Richtsmeier et al. 2002, Claude 2008). The papers of Mosimann (1970) and Darroch and Mosimann (1985) established the theoretical foundation for the use of body ratios in multivariate analysis and thus provided an ideal starting point for our methods. Sampson and Siegel (1985) and Rao and Suryawanshi (1996) were more concerned with particular definitions of size and shape. In contrast to these authors, our focus is on interpretation of body proportions rather than mere size and shape. Of course, other concepts for the analysis of size and shape (e.g., Cadima and Jolliffe 1996, McCoy et al. 2006, Claude 2008, Hotz et al. 2010) or the analysis of ratios (e.g., Aitchison 1986 for compositional data) have been proposed, but these are, in our opinion, less suited in our context (see below).
METHODOLOGY
The methods presented below consist of a number of steps that are briefly itemized here. The data are first standardized and transformed into logarithms, then the shape space is defined and a suitable size vector chosen. Based on these steps, the best ratios for separation of groups are extracted using a new algorithm adapted to LDA, called the LDA ratio extractor. Associated with this method is a particular measure that allows us to compare the discriminatory power of size with that of shape. The second new tool, called the PCA ratio spectrum, allows us to interpret the axes of a PCA in terms of ratios. A related method, the allometry ratio spectrum, is suitable for examination of the allometric behavior of ratios. Computation of all examples was done with the R statistical software, version 2.11.1 (R Development Core Team 2010) (for obtaining data sets and R files for all methods presented here, see Supplementary Material section).
As mentioned in the introduction, the mathematical framework adopted here originates from Mosimann (1970) and followers. A statistical framework frequently used in the Earth Sciences is Aitchison's analysis of compositional data, also called simplicial analysis (Aitchison 1986, Pawlowsky-Glahn and Egozcue 2001). Typically, compositional data vectors have positive components that sum up to one: imagine, for instance, a rock composed of three minerals in proportions 20%, 50%, and 30%. The corresponding data points (0.2,0.5,0.3) lie on a so-called simplex. The unit-sum constraint means a loss of 1 degree of freedom and requires special statistical tools, many of which have been developed by John Aitchison and his followers. We chose not to apply simplicial analysis to morphometric body ratios for two main reasons: First, ratios do not naturally satisfy the unit-sum constraint. Second, ratios have a complicated interrelationship not present in compositional data: the ratios a/b and b/c completely determine the ratio a/c. One could, alternatively, renormalize all body measurements to unit sum and thus obtain scale-free data on a simplex. This would free the path to simplicial analysis. However, it is not obvious to us how to extract statistical information about ratios from these renormalized data in a natural way. Also, our variants of LDA and PCA in Euclidean space would first have to be adapted to simplicial data, and it is not obvious how to do this, either. For these reasons, we preferred Mosimann's framework to that of Aitchison.
Standardizing the Data
For certain multivariate methods, it is important to standardize the data beforehand, otherwise, larger variables will dominate the analysis. As an example, let u = (u1,…,up) represent vectors of body measurements associated with N individuals of some animal population. It may happen that u1, say, is many times larger than u2 and u3, and so the ratio u2/u3 will be largely dominated by the ratios u1/u2 and u1/u3. For this reason the variables ui should be transformed in a way that they are all in the same order of magnitude. A convenient way to achieve this is to divide each variable by its geometric population mean (Claude 2008). The transformed variables will be called yi. They and their ratios vary around 1. In this scale, a value of yi = 1.2, for example, means that the individual's corresponding body trait is 20% larger than the (geometric) average over the population (strictly speaking, this standardization is only crucial in PCA but has no impact on LDA).
Space of log-ratios.—
Our interpretation of results from statistical analysis of shape will mainly take place in the space of ratios (or body proportions)
For p variables, there are in principle p2 ratios; observe, however, that only p(p − 1)/2 of these are informative and that even less, namely p − 1, can vary freely.
The relations between ratios being of multiplicative nature, it is common in multivariate morphometry to pass to log-transformed values (Reyment et al. 1984, Klingenberg 1996, Claude 2008). This transformation allows the application of linear statistical methods and furthermore avoids some problems associated with the statistical analysis of ratios (see Hills 1978, in response to Atchley et al. 1976).
We thus denote xi = logyi and
(1) |
Following Aitchison (1983), we call the numbers dijlog-ratios. Note that due to our standardization of the original data, the mean of the variables xj is zero. Also, if rij≈1, we have
and thus the log-ratios roughly correspond to the deviation of the ratios from 100%.
Shape
As mentioned in the introduction, a ratio can be calculated from any two body measurements and be used to describe the form of a specimen. A ratio thus represents one way for defining shape (Claude 2008). Mosimann (1970) generalized this particular concept of shape for many measurements by posing the question, “When do two individuals have the same shape with respect to a finite number of measurements?”. His definitions form the basis for our methods and are in the following formally introduced.
To the (standardized) body measurements y = (y1,…,yp)T of some individual, we would like to assign a set of numbers encapsulating the individual's body shape. We assume that these numbers can be calculated by formulas of the form y1b1y2b2·⋯·ypbp. As shape values should be invariant under scaling λy, the exponents must satisfy the shape restriction
(2) |
Passing to the log-values xi, we define
(3) |
to be the shape function associated to the vector of coefficients b = (b1,…,bp) subject to the shape restriction (2). We will also standardize b to length 1 (‖b‖ = 1). Geometrically, these constraints mean that b is a unit vector at right angles to the vector 1 = (1,…,1)T, that is, it lies in the p − 1 dimensional subspace 1⊥ (“shape space”) orthogonal to the vector 1. If
(4) |
denotes the orthogonal projection onto the shape space 1⊥, then we calculate the shape values (z1,…,zp) according to
(5) |
The vector b represents a direction in shape space, and the shape function β(x) is the scalar product of z with the vector b:
Log-ratios dij are represented by the log-ratio vectors
(6) |
where ei and ej are the i-th and j-th standard base vector in ℝp. We collect these vectors to a set ℬ = {bij}1 ≤ i < j ≤ p. The fact that there are many linearly independent subsets of ℬ spanning 1T reflects the interdependence of body ratios and poses a major problem for the interpretation of statistical results in terms of body proportions. We will address this problem below.
Size
Analogous to shape functions, a size function can be defined. We stipulate a size function to be of the form y1a1y2a2·…·ypap, but this time the exponents fulfill the size restriction
(7) |
Thus, an individual with all body measurements doubled, say, will be twice as large. In terms of the log-values x, we define
(8) |
to be the size function corresponding to the size vector a = (a1,…,ap). Three size vectors have been commonly proposed in the literature: Isometric size, allometric size, and shape-uncorrelated size, whose definitions are presented in the following. Shape-uncorrelated size is discussed here for the sake of completeness. In developing the methodology below, our focus will be on isometric and allometric size.
Isometric size.—
The “democratic” way is to give equal weight to all body measurements. This is tantamount to the choice a0 = (1/p)1, and the size α0(x) = a0Tx is simply the arithmetic mean of x. In many cases, the size α0(x) and the shape values z will show significant correlation over the population. This is a sign of the presence of allometry.
Allometric size.—
Allometry was first observed by Cuvier and intensively studied by Huxley and Teissier for bivariate data (e.g., body weight vs. some body trait); see Gayon (2000) for a short history of allometry. A generalization to multivariate data sets was proposed by Jolicoeur (1963). He arrived at his definition of allometric size in a rather heuristic way, whereas we propose in the Appendix a statistical model that leads to Jolicoeur's generalization in a natural manner. One way to pass from the bivariate to the multivariate case is by putting forth the question: Which is the measure of body size fitting optimally into the set of bivariate allometric power laws
for suitable coefficients di and exponents ci? A mathematically more precise formulation is given in the Appendix. The answer to this question is the size function associated to the size vector aJ spanning the first principal component of the log-values x, a fact that is proved in the Appendix by means of the least squares method. More precisely, aJ: = a1/1Ta1 where a1 is the unit eigenvector of the population covariance matrix Σ = E(xxT) corresponding to the largest eigenvalue λ1: Σa1 = λ1a1.
Shape-uncorrelated size.—
A choice of size function that represents the other extreme to allometric size was proposed by Sampson and Siegel (1985) and by Rao and Suryawanshi (1996). Their size vector aR has the property that size and shape over the population are uncorrelated. The shape-uncorrelated size vector is given by aR: = Σ − 11/1TΣ − 11.
The size vector aR is harder to interpret geometrically than aJ. An interpretation is offered in Rao and Suryawanshi (1996): A unit increase in shape-uncorrelated size represents the same average increase (or decrease) in all the variables x1,…,xp. It is also proved in Rao and Suryawanshi (1996) that aRTx is the only size function that is stochastically independent of shape if x has a multivariate normal distribution. This was already shown by Sampson and Siegel (1985) for linear size functions but it holds even true for nonlinear size functions.
The LDA Ratio Extractor: Selecting the Best Ratios with Discriminant Analysis
As mentioned above, LDA is a standard tool in multivariate morphometry. It often allows to distinguish most similar taxa but the numerical results obtained are then hard to interpret. Our aim is to adapt standard LDA in a way that its results admit a convenient interpretation in terms of the body proportions of the specimens under study. Our algorithm is recursive and the basic idea is as follows. In a first step, the ratio with the largest discriminating power is determined. Then a ratio is chosen that has maximal discriminating power but at the same time is as little correlated as possible to the first ratio. If needed, further ratios can be picked out in the same manner.
Suppose that the values x1,x2 stem from two distinct groups with mean m1,m2, and a common (nonsingular) within-groups covariance matrix Σ. Then Fisher discriminant vector w is determined by
(9) |
and ‖w‖ = 1. The vector w is a mixture of size and shape.
Often taxonomists prefer to perform LDA purely within shape space 1⊥, that is to ignore the effects of size. Hence, the method is presented entirely in the shape space. The common within-groups covariance matrix of the shape values zi = Pxi, i = 1,2, is given by Σ1 = PΣP, which is symmetric and positive definite on the subspace 1⊥. Because it is singular in ℝp, its pseudo-inverse must be used to perform the LDA. By singular value decomposition, there exists an orthogonal transformation matrix O in such a way that
Set . The shape discrimination vector w1 is now determined by
(10) |
It is hard to interpret w1 in terms of body proportions because it is a mixture of ratios and, worse, can be written in infinitely many ways as a linear combination of log-ratio vectors (cf., formula 6) from set ℬ. In the next paragraph, we develop an algorithm that extracts the most informative body ratios for between-groups distinction.
Extracting ratios.—
Let x denote the combined data set in which both groups x1 and x2 have been centered to 0 individually. Thus, E(x) = 0 and var(x) = Σ. The dominant log-ratio vector from ℬ with respect to discrimination between groups is the one that has the largest correlation with w1 in the data set x. More precisely, we consider the correlation coefficients
and set
(11) |
The discriminating power of a vector v∈ℝp can be measured by the standard distance D(v), that is, the difference of the means of vTxi, i = 1,2, divided by the common within-groups standard deviation:
(12) |
The term “standard distance” was introduced in Flury and Riedwyl (1986) (the square of D is sometimes called Rayleigh coefficient). Note that w1 is the vector in 1⊥ that maximizes D(b) among all shape vectors b∈1⊥. By (10) and because Pw1 = w1, we have for any b∈1⊥:
Thus, we observe that b1 defined in (11) has the strongest discriminating power among all log-ratio vectors bij∈ℬ. The highest possible standard distance for discrimination within size-and-shape space ℝp is Dtot: = D(w), where the discrimination vector w is given by (9). It is necessary to list the values
(13) |
in order to get the magnitude of the discriminating power of each ratio. In this listing, the log-ratio bij with the second largest value Dij is likely to be already largely explained by b1 due to the strong correlations between ratios. For this reason, we restrict the shape space 1⊥ to the subspace H2 such that the values bTx for b∈H2 are uncorrelated to b1Tx. It is easy to check that H2 is orthogonal to the vector Σb1. Projection onto H2 is given by the matrix
where M is the p×2-matrix M = [a0|Σb1]. Set Σ2 = P2ΣP2 and calculate the (unit length) discrimination vector w2 according to
where Σ2+ is the pseudo-inverse of Σ2 (which has rank p − 2). Now, let b2 be the log-ratio vector bij that shows largest correlation to w2. Iteration of this procedure leads to the following algorithm to compute the sequence of ratios bi, i = 1,…,p − 1:
Let M1 = a0 and initialize k = 1.
- Set Pk = I − Mk(MkTMk) − 1MkT and Σk = PkΣPk. Determine the pseudo-inverse Σk+ and set
3. Let bk = argmaxbij∈ℬc(bij,wk).
- 4. Add the column Σbk to the matrix Mk:
5. Increase k by one unit (unless i = p − 1), and continue at Step 2.
In practice, only a few iterations will be performed because the first two or three log-ratios b1,b2,… will already explain most of the discrimination between the two groups.
Extracting ratios for multiple groups.—
Suppose we are given K groups (classes) x1,…,xK with means m1,…,mK and a common within-groups covariance matrix Σ. The between-groups covariance matrix is defined by
where m is the total mean and nk is the number of individuals in each group. A frequently used criterion for discrimination in the multiple group case is
The unit vector v1 maximizing Q(·) is the eigenvector of Σ − 1B with largest eigenvalue. The generalization of our two-group algorithm explained above to the multiple group case is the following:
Let M1 = a0 and initialize k = 1.
Set Pk = I − Mk(MkTMk) − 1MkT, Σk = PkΣPk and Bk = PkBPk. Determine the pseudo-inverse Σk+ and let wk be the eigenvector of Σk+B with largest eigenvalue.
- Determine
- 4. Add the column Σbk to the matrix Mk:
5. Increase k by one unit and continue at Step 2.
The philosophy behind this algorithm is exactly the same as in the two-group case: First, we determine the linear discriminant wk and choose the log-ratio vector bij with strongest correlation to wk. Then we project to a subspace of shape vectors that are uncorrelated to all log-ratio vectors that have already been chosen. Again, two or three iterations will be sufficient in practice.
Judging the influence of size.—
As mentioned above, the LDA ratio extractor was developed in the shape space that is convenient for most circumstances. Sometimes, however, it might be informative to know how well particular groups are separated in relation to size. In order to assess how much of the total separation is due to size, we define Dsize: = D(a0)/Dtot and Dshape = D(w1)/Dtot. One can then view the number
(14) |
as a measure of how well size discriminates in comparison with shape.
The PCA Ratio Spectrum: Interpreting Principal Components with Ratios
PCA is a very widely used method in multivariate statistics (Jolliffe 2004). In contrast to LDA, specimens are not assigned to different groups for a PCA but are treated as a single group. The resulting scatterplots can then be used to explore the structure of variation in this group. It might be the case that the pattern recovers groupings based on other sets of characters (qualitative morphology, molecular markers, etc.), which would give them additional weight. Usually, individual principal components are interpreted in terms of the original variables (see Jolicoeur and Mosimann 1960 and Manly 2005 for lucid examples). The method developed below allows an interpretation using ratios. The main ingredient of this method is a diagram that we call the PCA ratio spectrum. It allows the user to immediately read off the dominant ratios as well as their interrelationships (recall that ratios are always interdependent in a complex fashion as their number is larger than the degree of freedom in the data).
The technical details of this method and its theoretical justification are presented below. Let the random vector x with E(x) = 0 and cov(x) = Σ (assumed to be nonsingular) represent body measurements of a given population. The first principal components vector u1 = (ui)i = 1,…,p of the shape values z = Px is the eigenvector of Σ1 = PΣP corresponding to the largest eigenvalue λ1 of Σ1:
For a log-ratio vector bij, we have
(15) |
This fact allows a simple graphical interpretation of the first principal component u in terms of body proportions: The numerical values (coefficients) of the components of u1 are drawn as points on the real line. We call this diagram the PCA ratio spectrum of the vector u1. To a pair of points ui, uj on the spectrum with a large difference corresponds a body proportion log(yi/yj) that contributes substantially to the first principal component; on the other hand, close points on the spectrum contribute little. The PCA ratio spectrum represents a mixture of all body proportions and shows how much each of them contributes to the variation in relation to the others. This can be illustrated with the example given in Figure 2b. As can be seen by their comparable separation in the spectrum, the ratios gaster breadth:gaster length and postmarginal vein:tergum 7 length have similar explaining power for the variance. On the other hand, the ratio eye breadth:scape length has no explanatory power because the corresponding points are very close in the spectrum.
If desired, the same procedure can be applied to the second and following principal components. Let us emphasize again that the method can only be applied in a statistically consistent manner when a PCA is performed within the shape space.
Statistical stability of the PCA ratio spectrum.—
Sometimes it might be useful to test whether the PCA ratio spectrum is statistically stable. Instability occurs when the largest eigenvalue λ1 is not sufficiently distinct from the smaller eigenvalues of Σ1, though this rarely might be the case in practice. In order to obtain confidence intervals for the points ui on the PCA ratio spectrum we assume that the values x and hence z are normally distributed. More precisely: Let be a random sample created from a multivariate normal distribution 𝒩(0,Σ1). Denote by the sample covariance matrix and by the standardized first principal components vector of , pointing in the same half-space as u1. The sampling distribution of is complicated but Anderson has established its large-sample distribution (see theorem 13.5.1 in Anderson 2003). It follows from this result that for sufficiently large sample size n, the marginal distribution of the i-th component of the random vector is approximatively normally distributed according to where
(16) |
Here, λ1 > λ2 ≥ ⋯ ≥ λp − 1 are the positive eigenvalues of the matrix Σ1 (which has rank p − 1) and ui,k are the elements of the matrix U = (u1|⋯|up − 1) formed by the corresponding standardized eigenvectors u1,⋯,up − 1. (The eigenvector up corresponding to λp = 0 is proportional to the isometric size vector a0.) Graphically, we represent the 68% confidence intervals [ui − σi,ui + σi] as perpendicular bars of length 2σi at the corresponding point ui on the spectrum (Fig. 2b). If the interval lengths are not too large compared with the separation of the points on the spectrum—as is the case in Figure 2b—then the spectrum can be considered as statistically stable. Even when the normal assumption is violated, the confidence intervals still give some indication of the stability of the spectrum.
Alternatively, one can also sample the original values z directly from the empirical distribution and obtain similar intervals with a bootstrap. The latter was used for estimating the confidence intervals in Figure 2b.
The Allometry Ratio Spectrum: Assessing Allometric Behavior of Ratios
The idea of a ratio spectrum introduced above is also useful for extracting body ratios that show allometric behavior. For a given size vector (like a0 or aJ), the body ratio that shows the most distinctive allometric growth can be interpreted as the one whose covariance with the body sizes aTx is maximal. We obtain
where we have set Σa = :d = (di)i = 1,⋯,p. Hence, exactly as in the preceding paragraph, the body proportions with strongest allometric growth along the size vector a can be read off the allometry ratio spectrum of d. A reasonable choice of a size vector is Jolicoeur's size vector aJ. In that case, we have d∝aJ and thus the allometric body proportions can be directly determined by the spectrum of aJ. An illustration of such a spectrum is given in Figure 4.
RESULTS
Discriminating Species
As an illustration of how to apply the LDA ratio extractor, we revisit a statistical analysis from Baur (2002) where morphometric data from two species of parasitic wasps were examined, namely the species Pteromalus albipennis Walker, 1835 and P. solidaginis Graham and Gijswijt, 1991 from the Pteromalus albipennis group (Insecta: Hymenoptera: Chalcidoidea). The analysis is based on p = 23 characters (called “head breadth,” “OOL,” “eye height,” etc.) measured on n1 = 32 individuals from P. albipennis (Group 1) and n2 = 19 individuals from P. solidaginis (Group 2), see Baur (2002) for a complete description. The common within-group variance is estimated by
where are the estimated covariance matrices of the two groups.
Before performing LDA, we would like to add a word of caution rarely mentioned in the textbooks: If the total number of individuals n = n1 + n2 is not distinctly larger than the number p of body traits, the results from an LDA can be completely spurious. The reason is that the dimension is large enough that a separating plane is likely to exist between the two groups even if the sample points are completely random. As a rule of thumb, one should always have A theoretical justification of this rule is given in MacKay (p. 490 2003).
By applying the LDA ratio extractor introduced in the Methodology section, we obtain OOL:gaster length as the most discriminating ratio. We get Dsize = 0.064 and Dshape = 0.964, hence δ = 0.063 (cf., formula 14). Thus, discrimination between the groups stems mostly from shape differences. The next discriminating body ratio being as little correlated as possible with OOL:gaster length is eye breadth:marginal vein. Its standard distance Dij (see formula 13) is 2.1 as compared with the standard distance Dij = 5.6 for the first ratio. As can also be seen from the scatterplot in Figure 1a, the discriminating power as compared with the first ratio is already much lower. Figure 1b shows the next two ratios extracted from the algorithm, funicle 1 length:propodeum length and scape length:postmarginal vein, with standard distances Dij = 2.3 and Dij = 1.7, respectively. By looking at the plots in Figure 1a and b, one could be tempted to simply combine the first (OOL:gaster length) with the third ratio (funicle 1 length:propodeum length) to arrive at an even better separation of groups. However, one should bear in mind that these ratios are highly correlated and therefore stand for more or less the same information.
Interpreting Principal Components
Figure 2a shows the results of a PCA on the same data set, but this time the two Pteromalus species were entered in the analysis as a single group. A PCA is always useful for examining the structure of variation in a single population, for instance, when it is difficult to assign specimens to different groups beforehand (Pimentel 1979, Reyment et al. 1984, Claude 2008). It can also give additional weight for groupings based on other features. In this case, the specimens in the scatterplot were labeled as either P. albipennis or P. solidaginis according to qualitative character differences, such as coloration or forewing pilosity, and host plant association (see Graham and Gijswijt 1991). As can be seen from Figure 2a, the first principal component is fully congruent with the separation of species. For the interpretation of this component, the PCA ratio spectrum is displayed in Figure 2b. Most of the variation is explained by ratios like gaster breadth:tergum 7 length that correspond to points lying at the opposite end of the spectrum. On the other hand, ratios formed from characters lying adjacent to each other in the spectrum, like marginal vein:postmarginal vein, explain very little. This is visualized in the scatterplot of the two ratios (Fig. 2c). Of course, also the ratio spectra of the second and third principal component could be drawn and sometimes this might be illuminating as well, for instance, for explaining the structure of variation within each species.
The above analysis exemplifies the use of our methodology in the shape space. Sometimes a researcher might be interested to examine differences in the size of the specimens, for instance, for investigating the influence of ecological parameters or different food regimes on populations (McCoy et al. 2006). Here, one could simply plot the isometric size axis (see Size section above) against the first principal component in shape space. From Figure 3 it is evident that the mean size of Pteromalus solidaginis is smaller, but that its range still lies within P. albipennis.
Assessing Allometry
We will illustrate the use of the allometry ratio spectrum on a classical data set of specimens of the purple rock crab Leptograpsus variegatus (Fabricius, 1793) (Crustacea: Brachyura: Grapsidae) from Western Australia (see Campbell and Mahon 1974). These occur in two color forms, blue and orange. Mahon collected 50 individuals from each color form and from each sex and made five body measurements: carapace length (CL) and width (CW), width of frontal lobe (FL), rear width (RW), and body depth (BD). We calculated the allometric size vectors aJ for the body measurements of the males of both the blue and the orange morph. Figure 4 shows the corresponding allometric ratio spectra for both morphs. As can be seen, the ratio BD:RW shows the largest allometric growth whereas for CL:CW allometry is negligible in both groups. Figure 5 confirms this conclusion: There we display a scatter plot for the orange type males of the isometric sizes versus the log-ratios of BD:RW and CL:CW, respectively. Whereas the first ratio (Fig. 5a) visibly has a strong correlation with isometric size, as is characteristic for allometry as explained in the Methodology section, this is much less the case for the second ratio (Fig. 5b).
It is useful to test allometry versus isometry, that is, to test the null hypothesis that aJ = a0. Such a test, under the hypothesis of normality and relatively large sample size, was developed by Anderson (2003) (see section 11.6.2). Adapted to our situation, the P value of the null hypothesis is given by Prob(χp − 12 > κ) where the test value κ is determined by
Here, Σ is the covariance matrix of the sample x of size n and λ1 is its largest eigenvalue. For the male Leptograpsus, the P values are virtually zero for both color types, hence the null hypothesis that no allometry is present can safely be rejected.
DISCUSSION
As initially mentioned, a number of body measurements are commonly collected in taxonomic research. This mainly serves two purposes. First, the raw or log-transformed variables are entered in some kind of standard multivariate statistical analysis (MVA) for studying character variation and for discrimination of taxa. PCA and LDA are among the methods of choice in this respect and are the ones we refer to with MVA below. Second, the same measurements are integrated in descriptive works, but this time by calculating ratios (indeed, the numerical output from MVA would be far too awkward for inclusion in descriptions and identification keys). Of course, it would be most useful if, say, a discriminant function could be interpreted in terms of ratios that then could be directly used for a species description. One could, for instance, expect some guidelines for the choice of ratios. So far, this was not possible because the two kinds of analysis were not directly comparable (see below). Thus, ratio analysis usually adheres to certain standards established for a particular group, rather than following the insights gained from MVA. A case in point is the study of the Encarsia meritoria species complex (Insecta: Hymenoptera: Aphelinidae) by Polaszek et al. (2004), where some of the best ratios used for species discrimination were not even included in their elaborate PCA and LDA.
The incompatibility of MVA and ratio analysis results from the way, size and shape functions are defined for each method (see Fig. 6 for further details). However, the methods presented here, namely the newly developed LDA ratio extractor and the PCA ratio spectrum, solve these problems by using the same definitions for size and shape. Therefore, the results from MVA can now be interpreted in terms of ratios that, in turn, can be directly incorporated in a variety of descriptive taxonomic works. In fact, a more sophisticated use of ratios may be achieved, as is demonstrated by our application of the LDA ratio extractor to the data set from parasitic wasp species of the family Pteromalidae. Here, the best ratios found for separating the two Pteromalus species were OOL (distance of lateral ocellus to eye margin):gaster (abdomen) length, funicle 1 (antenna) length:propodeum length, etc. (see Results section and Fig. 1). These ratios relate characters from widely separated body parts and differ from those commonly used in the taxonomy of pteromalid wasps. For instance, in Graham (1969), still the standard reference in the field (Grissell and Schauff 1997), ratios are exclusively formed from characters lying adjacent to each other, like eye height:breadth or thorax length:breadth (see also Graham and Gijswijt 1991). Evidently, the variation of such ratios among specimens can—to a certain extent—be judged by eye. However, as demonstrated here, these ratios are apparently not the best ones for discrimination. It is of course very difficult if not impossible to judge by eye the discriminating power of ratios based on widely separated characters, a task that is best done analytically with the help of an algorithm such as the one presented in this paper.
The present methodology can thus easily be embedded in a consistent statistical frame work for the multivariate analysis of morphometric data. In particular, it allows us to interpret the results of a PCA and LDA entirely in terms of ratios, which themselves form the core information of most quantitative taxonomic works. The important point of the new methodology is to determine the shape values and to choose a particular size vector beforehand. For the size function, we mainly considered the isometric size vector a0, except for the allometry ratio spectrum, which relates to Jolicoeur's allometric size vector aJ. Of course, other definitions of shape and size are possible (see Bookstein 1989 for a review). By using the “back-projection” method of Burnaby (1966), some authors (e.g., Klingenberg 1996, McCoy et al. 2006) choose to define their shape values by projecting the log-data x on the space orthogonal to the allometric vector aJ. The reason for this is to transform away shape effects related to allometric growth. According to this view, size is represented by the first, shape by all the following principal components of the log-data. It is, however, unclear how these shape values could be properly interpreted in terms of body proportions; in particular, no ratio-spectrum can be assigned in a mathematically consistent way to “shape” vectors orthogonal to aJ. Moreover, the allometric growth law in its bivariate or multivariate versions is just a convenient statistical model and by no means a “law of nature” (Gould 1966). In our opinion, allometry should rather be treated as a hypothesis to be tested after the size values are determined rather than be incorporated into the framework from the very beginning. We therefore prefer to analyze allometric variation with help of the allometry ratio spectrum, as demonstrated above (see Results section).
Our new methods are obviously rooted in the field of multivariate morphometrics (Reyment et al. 1984). The latter is occasionally dubbed traditional morphometrics (Marcus 1990), as opposed to “modern morphometrics” (Claude 2008) such as the analysis of landmarks (geometric morphometrics, Adams et al. 2004, Zelditch et al. 2004) or outlines (e.g., elliptic Fourier analysis, Lestrel 1989, Lestrel 2000). The main reason why we stay within multivariate morphometrics is simply given by the nature of our data. Landmark and outline data are ideally suited for fixed objects, such as a skull or the body of a fish. For an insect with articulated extremities, those methods are of limited use unless one is willing to study the form of the head, thorax, or wings in separate analyses. This can and should be done. Nevertheless, it is often useful to include measurements from all over the body in a single analysis. For instance, a taxonomist trying to distinguish between two most similar species will be happy about any discriminating character. What if they are best separated by the ratio of, say, the length of the hind leg and the eye height? As we have shown above, it is here where methods of multivariate morphometrics, adapted for the analysis of ratios, could play a major role.
SUPPLEMENTARY MATERIAL
Supplementary Material
Acknowledgments
We thank Ian T. Jolliffe, Institute for Engineering, Computing & Mathematics, Exeter, for critical reading of the manuscript. Seraina Klopfstein, Swedish Museum of Natural History, Stockholm, and two reviewers also made numerous valuable comments and suggestions. We are finally grateful to Yvonne Kranz-Baltensperger, Christian Kropf and Elsa Obrecht, Natural History Museum, Bern, for discussion and useful corrections.
APPENDIX
Statistical Derivation of the Allometric Size Vector
We would like to arrive at an estimation of the allometric size vector starting from a statistical model of the allometric growth hypothesis. Let y be the original data and a size function. According to Huxley (1932), each trait yi when graphed against the individual's size α(y) should satisfy the power law
(A1) |
Here we consider di as positive random variables and ci as constant coefficients. We shall use the approach of least squares to statistically estimate the coefficients ai and ci. Taking logarithms on both sides of (A1), we get
where μi = E(logdi) and E(ϵi) = 0. In vector notation, this reads
with E(ϵ) = 0. Because E(x) = 0, we conclude μ = 0. We estimate a and c in a way that the sum of squares is minimal:
where S(a,c) = E‖ϵ‖2. We have
Calculating vector derivatives with respect to a and c we get
and
Setting both equations equal to 0 and dropping the hats over and , we arrive at the system of equations:
Multiplying the second equation from the left by Σ − 1, solving for c and plugging the result into the first equation, one can see that a is an eigenvector of Σ with eigenvalue
and c = a/‖a‖2. Replacing these results in S(a,c) one gets:
Evidently, this expression is minimal if λ is the largest eigenvalue of Σ. Let a1 denote the unit vector representing the first principal component of the data x. Imposing the size restriction, we arrive at the solution
and cJ = aJ/‖aJ‖2. Historically, Jolicoeur (1963) was the first to introduce a multivariate generalization of Huxley's allometric power law and he proposed our aJ as a measure of size (or rather a1 to be precise). He did not, however, give a statistical model to motivate his definition.
References
- Adams DC, Rohlf FJ, Slice DE. Geometric morphometrics: ten years of progress following the "revolution. Ital. J. Zool. 2004;71:5–16. [Google Scholar]
- Aitchison J. Principal component analysis of compositional data. Biometrika. 1983;70:57–65. [Google Scholar]
- Aitchison J. The statistical analysis of compositional data. Monographs on Statistics and Applied Probability. London: Chapman and Hall; 1986. [Google Scholar]
- Anderson TW. An introduction to multivariate statistical analysis. 3rd ed. New York: Wiley; 2003. [Google Scholar]
- Atchley WR, Gaskins TC, Anderson D. Statistical properties of ratios. I. Empirical results. Syst. Zool. 1976;25:137–148. [Google Scholar]
- Baur H. The power of multivariate statistical methods in the taxonomy of Pteromalidae (Hymenoptera: Chalcidoidea) In: Melika G, Thuróczy C, editors. Parasitic wasps: evolution, systematics, biodiversity and biological control. 2002. Budapest (Hungary): Agroinform. p. 73–81. [Google Scholar]
- Bookstein FL. "Size and shape": a comment on semantics. Syst. Zool. 1989;38:173–180. [Google Scholar]
- Burnaby TP. Growth-invariant discriminant functions and generalized distances. Biometrics. 1966;22:96–110. [Google Scholar]
- Cadima JFCL, Jolliffe IT. Size- and shape-related principal component analysis. Biometrics. 1996;52:710–716. [Google Scholar]
- Campbell NA, Mahon RJ. Multivariate study of variation in two species of rock crab of the genus Leptograpsus. Aust. J. Zool. 1974;22:417–425. [Google Scholar]
- Claude J. Morphometrics with R. Use R! New York: Springer; 2008. [Google Scholar]
- Darroch JN, Mosimann JE. Canonical and principal components of shape. Biometrika. 1985;72:241–252. [Google Scholar]
- Dryden IL, Mardia KV. Chichester (UK): Wiley; 1998. Statistical shape analysis. [Google Scholar]
- Flury BK, Riedwyl H. Standard distance in univariate and multivariate analysis. Am. Stat. 1986;40:249–251. [Google Scholar]
- Gayon J. History of the concept of allometry. Am. Zool. 2000;40:748–758. [Google Scholar]
- Goloboff PA, Mattoni CI, Quinteros AS. Continuous characters analyzed as such. Cladistics. 2006;22:589–601. doi: 10.1111/j.1096-0031.2006.00122.x. [DOI] [PubMed] [Google Scholar]
- Gould SJ. Allometry and size in ontogeny and phylogeny. Biol. Rev. 1966;41:587–640. doi: 10.1111/j.1469-185x.1966.tb01624.x. [DOI] [PubMed] [Google Scholar]
- Graham MWRdV. The Pteromalidae of North-Western Europe. B. Brit. Mus. Nat. Hist. Entomol. Suppl. 1969;16:1–908. [Google Scholar]
- Graham MWRdV. A reclassification of the European Tetrastichinae (Hymenoptera: Eulophidae): revision of the remaining genera. Mem. Am. Entomol. Inst. 1991;49:1–322. [Google Scholar]
- Graham MWRdV, Gijswijt MJ. A new species of Pteromalus (Hymenoptera: Chalcidoidea) from France, associated with Solidago virgaurea. Entomol. Ber. Amst. 1991;51:153–155. [Google Scholar]
- Grissell EE, Schauff ME. A handbook of the families of Nearctic Chalcidoidea. 1997. (Hymenoptera). 2nd ed. Washington (DC): Entomological Society of Washington. [Google Scholar]
- Hills M. On ratios: a response to Atchley, Gaskins, and Anderson. Syst. Zool. 1978;27:61–62. [Google Scholar]
- Horstmann K. Revision of the western Palearctic species of Dusona Cameron (Hymenoptera: Ichneumonidae: Campopleginae) Spixiana. 2009;32:45–110. [Google Scholar]
- Hotz T, Huckemann S, Munk A, Gaffrey D, Sloboda B. Shape spaces for prealigned star-shaped objects: studying the growth of plants by principal components analysis. J.R. Stat. Soc. C Appl. Stat. 2010;159:127–143. [Google Scholar]
- Huxley JS. Baltimore (MD): The Johns Hopkins University Press; 1932. Problems of relative growth (with an introduction by Frederick B. Churchill and an essay by Richard E. Strauss) (reprint edition) [Google Scholar]
- Jolicoeur P. The multivariate generalization of the allometry equation. Biometrics. 1963;19:497–499. [Google Scholar]
- Jolicoeur P, Mosimann JE. Size and shape variation in the painted turtle: a principal component analysis. Growth. 1960;24:339–354. [PubMed] [Google Scholar]
- Jolliffe IT. Principal component analysis. 2nd ed. New York: Springer; 2004. [Google Scholar]
- Kasparyan DR. Leiden. Brill; 1989. Ichneumonidae (Subfamily Tryphoninae), tribe Tryphonini. Fauna of the USSR, Hymenoptera 3. (the Netherlands) [Google Scholar]
- Klingenberg CP. Multivariate allometry. In: Marcus LF, Corti M, Loy A, Naylor GJP, Slice DE, editors. Advances in morphometrics. New York: Plenum Press; 1996. pp. 23–49. [Google Scholar]
- Lestrel PE. A method for analyzing complex two-dimensional shapes: elliptic fourier functions. Am. J. Phys. Anthropol. 1989;72:257–258. [Google Scholar]
- Lestrel PE. Recent advances in human biology. Volume 7. Singapore: World Scientific; 2000. Morphometrics for the life sciences. [Google Scholar]
- MacKay DJC. Information theory, inference, and learning algorithms. Cambridge: Cambridge University Press; 2003. [Google Scholar]
- Manly BFJ. Multivariate staistical methods: a primer. 3rd ed. London: Chapman and Hall; 2005. [Google Scholar]
- Marcus LF. Traditional morphometrics. In: Rohlf FJ, Bookstein FL, editors. Proceedings of the Michigan morphometrics workshop. Special publication 2. Ann Arbor (MI): University of Michigan Museum of Zoology; 1990. pp. 77–122. [Google Scholar]
- Mayr E, Ashlock PD. Principles of systematic zoology. 2nd ed. New York: McGraw-Hill; 1991. [Google Scholar]
- McCoy W, Bolker BM, Osenberg CW, Miner BG, Vonesh JR. Size correction: comparing morphological traits among populations and environments. Oecologia. 2006;148:547–554. doi: 10.1007/s00442-006-0403-6. [DOI] [PubMed] [Google Scholar]
- Mosimann JE. Size allometry: size and shape variables with characterizations of the lognormal and generalized gamma distributions. J. Am. Stat. Assoc. 1970;65:930–945. [Google Scholar]
- Noyes JS. Encyrtidae of Costa Rica (Hymenoptera: Chalcidoidea), 2: Metaphycus and related genera, parasitoids of scale insects (Coccoidea) and whiteflies (Aleyrodidae) Mem. Am. Entomol. Inst. 2004;73:1–459. [Google Scholar]
- Pawlowsky-Glahn V, Egozcue JJ. Geometric approach to statistical analysis on the simplex. Stoch. Environ. Res. Risk Assess. 2001;15:384–398. [Google Scholar]
- Pimentel R.A.. 1979. Morphometrics: the multivariate analysis of morphological data. Dubuque (IA): Kendall/Hunt. [Google Scholar]
- Polaszek A, Manzari S, Quicke DLJ. Morphological and molecular taxonomic analysis of the Encarsia meritoria species-complex (Hymenoptera, Aphelinidae), parasitoids of whiteflies (Hemiptera, Aleyrodidae) of economic importance. Zool. Scripta. 2004;33:403–421. [Google Scholar]
- R Development Core Team. R: a language and environment for statistical computing [Internet] Vienna (Austria): R Foundation for Statistical Computing. 2010 Available from: http://www.R-project.org. [Google Scholar]
- Rae TC. Scaling, polymorphism and cladistic analysis. In: MacLeod N, Forey PL, editors. Morphology, shape and phylogeny. Systematic association special volume series 64. Boca Raton (FL): CRC Press; 2002. pp. 45–52. [Google Scholar]
- Rao CR, Suryawanshi S. Statistical analysis of shape of objects based on landmark data. Proc. Natl. Acad. Sci. U.S.A. 1996;93:12132–12136. doi: 10.1073/pnas.93.22.12132. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reyment RA, Blackith RE, Campbell NA. Multivariate morphometrics. 2nd ed. London: Academic Press; 1984. [Google Scholar]
- Richtsmeier J, Deleon VB, Lele SR. The promise of geometric morphometrics. Yearb. Phys. Anthropol. 2002;45:63–91. doi: 10.1002/ajpa.10174. [DOI] [PubMed] [Google Scholar]
- Sampson PD, Siegel AF. The measure of "size" independent of "shape" for multivariate lognormal populations. J. Am. Stat. Assoc. 1985;80:910–914. [Google Scholar]
- Schuh RT, Brower AVZ. Biological systematics: principles and applications. 2nd ed. Ithaca (NY): Cornell University Press; 2009. [Google Scholar]
- Sorensen JT, Foottit R. Ordination in the study of morphology, evolution and systematics of insects. Amsterdam: Elsevier; 1992. applications and quantitative genetic rationals. [Google Scholar]
- Stuessy TF. Plant taxonomy: the systematic evaluation of comparative data. 2nd ed. New York: Columbia University Press; 2009. [Google Scholar]
- Thiele K. The holy grail of the perfect character: the cladistic treatment of morphometric data. Cladistics. 1993;9:275–304. doi: 10.1111/j.1096-0031.1993.tb00226.x. [DOI] [PubMed] [Google Scholar]
- Townes HK, Townes M. A revision of the Serphidae (Hymenoptera) Mem. Am. Entomol. Inst. 1981;32:1–541. [Google Scholar]
- Wiens JJ. Washington (DC): Smithsonian Institution Press; 2000. Phylogenetic analysis of morphological data. [Google Scholar]
- Winston JE. New York: Columbia University Press; 1999. Describing species: practical taxonomic procedures for biologists. [Google Scholar]
- Zelditch ML, Swiderski DL, Sheets HD, Fink WL. Geometric morphometrics for biologists: a primer. Amsterdam: Elsevier; 2004. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.