Abstract
In this paper, we propose a new method for computing a distance between two shapes embedded in three-dimensional space. Instead of comparing directly the geometric properties of the two shapes, we measure the cost of deforming one of the two shapes into the other. The deformation is computed as the geodesic between the two shapes in the space of shapes. The geodesic is found as a minimizer of the Onsager–Machlup action, based on an elastic energy for shapes that we define. Its length is set to be the integral of the action along that path; it defines an intrinsic quasi-metric on the space of shapes. We illustrate applications of our method to geometric morphometrics using three datasets representing bones and teeth of primates. Experiments on these datasets show that the variational quasi-metric we have introduced performs remarkably well both in shape recognition and in identifying evolutionary patterns, with success rates similar to, and in some cases better than, those obtained by expert observers.
Keywords: biological shapes, minimum action path, conformal maps
1. Introduction
The functions of all life forms depend on organization in space and time. Spatial organization involves the interactions of shapes with their surroundings, while organization in time concerns changes in shape. Geometric methods therefore play an essential role in the analyses and simulations of biological systems. This idea has a long history in biology, as illustrated in the work of D'Arcy Thompson [1]. Geometry and topology are now used regularly for representing, searching, simulating, analysing and comparing biological systems. In structural molecular biology, for example, the notion that the structure (or shape) of a protein is a major determinant of its function has led to the development of methods for representing, measuring and comparing protein structures [2–4]. Similar methods have been developed for larger biological systems. Brain morphometry, for example, is concerned with the measurement of the brain geometric structures and the changes they undergo during development, ageing, learning, disease and evolution [5–9].
In a chapter titled ‘The comparison of related forms’, Thompson explored how differences in the forms of related animals can be described by means of simple mathematical transformations [1]. This inspired the development of several shape comparison techniques, whose aim is to define a map between two shapes that can be used to measure their similarity. This is a challenging problem, as the space of possible maps is extremely large and difficult to characterize mathematically. Morphometricians usually attempt to reduce the dimension of this space by enforcing correspondences between specific landmarks. Landmark-based methods that find maps between two shapes work on the premise that knowledge of a mapping on a small number of correspondences can be extended to give the full map between the two surfaces of interest [10–12]. By contrast, landmark-free methods skip the search for landmarks altogether. They rely on a geometric representation of the shape in which some automatically selected points are assigned a signature, under the premise that points with similar signatures are more likely to correspond. Spectral techniques fit in this category. Rustamov, for example, introduces the global point signature of a point on the shape, which encodes both the eigenvalues and the eigenfunctions of the Laplace–Beltrami operator evaluated at that point [13]. Guibas and co-workers [14] introduced the heat kernel signature, a similar robust and multi-scale invariant defined on points on the surface of the shape. Fischl et al. [15,16] introduced an average convexity that they used to drive the alignment of cortical surfaces. McCane [17] developed a variational method for matching curves in two or three dimensions by optimizing their parametrizations. Laga et al. [18] developed statistical models of shapes based on the squared root velocity function. They recently implemented this procedure to study the shapes of plant leaves.
All the methods described above focus on the differences in the geometry of the two shapes to be compared. An alternative approach is to attempt to capture the cost of deforming one shape into the other. This is implemented, for example, in the concept of the earth mover's distance, or Wasserstein metric, which has been used for image retrieval [19] as well as in different contexts for shape comparisons [20–22]. The large deformation diffeomorphic metric mapping method follows a similar concept: it captures a flow of transformations between the two shapes to be compared by introducing a concept of time, with the constraint that those transformations remain diffeomorphic. It has been adapted to matching points [23], curves [24], as well as surfaces [25,26]. Plotkin [27] also considered time when comparing two curves, introducing a variational distance that generalizes the concept of Euclidean distance to high-dimensional objects. This distance is computed as the length of a geodesic on the space of curves, where the geodesic is derived as a solution to a differential equation [27]. It has been used for computing folding pathways for proteins [28], as well as for aligning the curves representing protein structures [29]. None of those methods, however, consider physical constraints in the deformation process between the two objects of study. This can lead to non-physical transformations, such as the appearance of knots when comparing two curves [30].
The problem of non-physical transformations mentioned above highlights in fact the actual challenges that need to be solved: generating a physically sound geodesic flow between two shapes that leads to an actual metric in the space of such shapes. Addressing the former challenge, namely the generation of a physically sound trajectory, usually involves designing energy functions that yield deformation paths with a specified behaviour [31]. Examples of such energies include functions that imitate a physical energy dissipation induced by the deformation of a viscous material [32]. Other measures capture stretching and bending variations of the surface (or thin layers around this surface) of the shape [33–35]. Once the energy function is derived, the best deformation path is chosen as the path that requires the least total energy. Finding such a path usually relies on variational mechanical approaches [34,36,37]. Our motivation in this work is to expand on this concept of deriving physically sound paths between two shapes, focusing mostly on designing an efficient variational approach for generating such paths that is based on the concept of minimum action paths.
We propose a new distance between shapes that captures the complexities of deforming one of the two shapes into the other. Specifically, we consider the generalization of the conventional notion of distance to polymeric objects introduced by Plotkin [27], with two major differences. First, we extend its applications from curves to surfaces. Second, we modify the functional for computing the geodesic distance with the introduction of an energy term to enforce physically sound elastic deformations. We use for this purpose the Onsager–Machlup action functional [38] that is defined to study finite temperature transitions over finite time intervals. In addition to defining the action functional for shape deformation, we propose a method for finding the corresponding minimum action path. The action along this path defines a distance that depends only on the geometry of the initial and final configurations of the shape considered. This distance is a special case of an intrinsic quasi-metric from differential geometry with the Onsager–Machlup action corresponding to a Finsler function [39]. We note that the present paper develops on our own work on computing transition paths between different conformations of biomolecules [40,41].
While three-dimensional data representing a shape come in many forms, we concentrate on the important case where the surface of the shape is available and described with a discrete triangular mesh. Mathematically, these objects are two-dimensional Riemannian manifolds in the smooth case, and piecewise-flat surfaces in the discrete setting. We work with both representations. The method we propose to compute the distance between two such shapes proceeds in three steps. Given two shapes represented by two meshes with the same combinatorics, but different geometry, we construct first an energy surface that has those two shapes as minima. Second, we find the path with minimum action along this energy surface, where the action functional is the Onsager–Machlup action. Finally, we set the distance between the two shapes to be the integral of the action along this path.
Working with shapes described by discrete triangular meshes introduces two issues that we address in this paper. First, there are no reasons to assume that the combinatorics of the meshes representing the two surfaces of interest are identical. As such, the problem of constructing a deformation path between the two surfaces includes two sub-problems: parametrization and deformation. Parametrization corresponds to finding a mapping of the mesh structure of one surface onto the other, while deformation is concerned with finding a sequence of geometric realizations of this mesh as it evolves from its geometry in the start surface to its geometry on the target surface. While there have been recent developments of frameworks that solve those two sub-problems simultaneously [42–44], the approach we propose solves those problems sequentially. For the parametrization problem, we rely on a recent conformal diffeomorphic mapping technique that we have developed for the specific case of surfaces of genus zero [45–47]. The main focus of this paper is on the second sub-problem, which is solved using the minimum action method briefly discussed above and that will be described in detail in the following section. The second issue relates to the definition of an energy that captures the property of a triangular mesh. Many such energies have been defined, such as the as-rigid-as-possible energy and extensions [48–50] that measure the deviation from rigidity, and the shell models that capture stretching and bending [34,51]. In this work, we rely on an ‘elastic energy’ that is inspired from the concept of elastic networks in computational structural biology. Those networks are coarse-grained representations of biomolecules that significantly simplify the modelling of their dynamics [52]. The elastic energy of such a network measures the changes in the lengths of its springs, which we map in our method to the edges of the triangular mesh that is considered. This energy is similar in design to the Riemannian metric introduced by Kilian et al. [53].
The paper is organized as follows. Section 2 provides the mathematical background for our method. In particular, we cover the definition of the variational distance between shapes, the Onsager–Machlup action functional, the definition of the energy surface in the space of shapes, and the method used to find a minimum action path on this energy surface. The following section covers the details of the implementation for shapes represented by discrete surface meshes. In particular, we include details on the pre-processing step of mapping a mesh from one surface to another, as this is a necessary step if the two input meshes do not have the same combinatorics. We also introduce the elastic energy of a discrete surface. In §4, we discuss the applications of the deformation part of our method on generic, high-resolution non-rigid shapes from the TOSCA dataset [54]. Section 5 presents and discusses the results obtained by our method on three test cases, representing three regions of the skeletal anatomy of a collection of primates [22]. We conclude the paper with a brief discussion on future developments of the method.
2. A generalized variational distance between multidimensional objects
We follow the formalism introduced by Plotkin [27]. For illustration, let us start with the simple case of a distance between two shapes of dimension 0. The distance between the corresponding two points A and B at position XA and XB in a Euclidean space
is the minimal arclength travelled in the ‘deformation’ from A to B. This deformation can be captured by a transformation X(t), a vector function parametrized with a time variable t, such that X(0) = XA and X(T) = XB, for a given large time T. The infinitesimal distance travelled is a function of time,
. Using the variation of this functional
, the minimal distance between A and B is then defined by [55]
![]() |
2.1 |
where X(t)* is the minimal transformation between A and B. It is easy to see that this distance maps to the traditional Euclidean distance between two points. It can be easily expanded to multidimensional objects [27].
This definition of distance, however, does not account directly for the shape of the space on which the two points A and B reside. The shape of that space can be captured by a more general functional
of the deformation X(t). Following the concept of Finsler manifold [39], we equip the space of shapes M with an induced quasi-metric
that is an extension of the distance defined for points:
![]() |
2.2 |
We apply this definition to compute distances between two conformations of a shape in
. In the following, we define our choice for the functional
for such shapes and describe the method we propose for finding the corresponding minimum transformation X(t)*. For generality, we do not state the exact energy function within the functional
, and leave this for the following section in which we describe the implementation of this method to discrete shapes. We note that this formalism is inspired by our own work on computing transition paths between two conformations of a biomolecule [40,41]. We refer the reader to the corresponding papers for extended descriptions of the methods.
2.1. An action for elastic deformation of shapes
Let us consider a shape M described by a position vector X. Assuming that we are only interested in effects over long time scales, the dynamic behaviour of this shape under a potential energy U can be described by the overdamped Langevin equation:
| 2.3 |
where γ is a friction coefficient, U(X) is the potential energy of the system, ∇(U(X)) is the gradient of U, and B is an uncorrelated random force with zero mean. Note that by rescaling the units of time, it is possible to set γ = 1.
Equation (2.3) is a stochastic differential equation; as such, it does not have an analytical solution. However, given a trajectory X(t) over the time interval [0, T], it is possible to evaluate the probability that equation (2.3) would have generated that trajectory. Following the formalism introduced by Onsager & Machlup [38], this probability is given by
| 2.4 |
where α is a normalizing factor, kB is the Boltzmann constant and Te the temperature. The action
is given by
| 2.5 |
where the Lagrangian
is defined by
| 2.6 |
Given two possible configurations XA and XB for the shape M, the path of maximum likelihood by which the solution of equation (2.3) connects these two configurations is the one that maximizes the probability given by equation (2.4), i.e. the path that minimizes the action
with the boundary conditions X(0) = XA and X(T) = XB. In the following section, we define a general form for the energy function U that enables the implementation of fast solutions for equation (2.3).
2.2. A quadratic elastic energy for shapes
Let EA(XA) and EB(XB) be the energy values for the shape M in configurations XA and XB, respectively. Let us assume that those energies are values of two analytical functions, EA(X) and EB(X), defined such that both have a unique minimum, at XA and XB, respectively. Assuming small displacements around those minima, these energy functions can be approximated as UA(X) and UB(X) using a second-order Taylor expansion:
![]() |
2.7 |
and
![]() |
2.8 |
where ∇(EA)(XA) and HA(XA) = ∇∇(EA)(XA) are the derivatives and Hessian of EA at XA, respectively, with similar definitions for ∇(EB)(XB) and HB(XB). As we assume that EA is minimum at XA, its first derivatives are set to 0 at its minimum XA. We can also assume that EA(XA) = 0. The same applies to EB. Therefore,
| 2.9 |
and
| 2.10 |
The energy function U for any conformation X is then defined as
| 2.11 |
This is the form of the energy function that will be used for solving equation (2.3). We note that U is the combination of two quadratic functions; this significantly simplifies the process of solving this equation. This is discussed in the next section. We do note that the introduction of the min function leads to an energy function U and consequently of a functional
that are only C0. As such, this is only an approximation.
2.3. A minimum action path between two conformations of a shape
For a given T, the path X that minimizes the action
defined in equation (2.5) satisfies the Euler–Lagrange equation:
| 2.12 |
subject to X(0) = XA and X(T) = XB, where ∇∇(U) is the Hessian of the potential energy U defined above.
Let us consider a possible trajectory solution to the Euler–Lagrange equations. Because of the format of U defined above, this trajectory can be seen as the concatenation of two parts, one in a quadratic well WA whose minimum is XA, and one in the quadratic well WB whose minimum is XB. Those two trajectories meet at the transition state, Xts, at time ts with 0 < ts < T. In well WA,
| 2.13 |
and
| 2.14 |
The equation of motion is therefore
| 2.15 |
with the two boundary conditions X(0) = XA and X(ts) = Xts. As HA(XA) is a constant, the solution of equation (2.15) is then
| 2.16 |
where fA(HA, t) is a matrix function with the real function fA(x, t) given by
| 2.17 |
Note that when
,
.
Similarly, in well WB, the equation of motion becomes
| 2.18 |
with the two boundary conditions X(ts) = Xts and X(T) = XB. As HB(XB) is a constant, the solution of equation (2.18) is then
| 2.19 |
where fB(HB, t) is a matrix function with the real function fB(x, t) given by
| 2.20 |
Note again that when
,
.
Equations (2.16) and (2.19) describe the trajectories, i.e. the minimum action path for the shape, before and after the transition state Xts, respectively. We need to find both Xts and ts. We note first that the two trajectories are continuous, and meet at Xts: XA(ts) = XB(ts) = Xts. For a given ts, we can compute Xts by imposing velocity continuity at this point for the two trajectories. Finally, we find ts by imposing continuity for the energy at Xts.
3. Implementation
While there are many ways to describe a shape in our three-dimensional world, we concentrate on the important case where the shape is defined from its surface. In the discrete case, this surface, M, is described with a discrete triangular mesh,
, with
,
and
denoting the vertices, edges and triangles of
, respectively. The geometry of this mesh is defined by a configuration vector X that contains the coordinates of all the vertices in
. The method described above applies directly to computing the distance between two configurations XA and XB of the same mesh. It can be easily adapted however to computing the distance between two surfaces MA and MB, if the meshes
and
that represent them have the same number of vertices with the same connectivities between them. When the two meshes have different combinatorics, we need to generate first a parametrization between the two surfaces, i.e. a mapping such that the combinatorics of one mesh can be transported from one surface onto the other [56,57]. We have recently developed such a method for the specific case of surfaces of genus zero [45–47]. In the following subsection, we briefly describe this method. We then define the elastic energy E for such a mesh. Finally, we briefly describe MeshPath, our program for computing the distance between two configurations of a mesh. MeshPath is available under OpenSource upon request.
3.1. Surface parametrization
Let MA and MB be two surfaces of genus zero, represented by the meshes
and
, respectively. Both meshes are taken to be triangular. The goal is to define a discrete map
, where f(MA) is a geometric realization of
onto MB, that is as close as possible to an isometry, i.e. that minimizes the distortion of pairwise distances between vertices. The key observation [58] that makes the problem tractable is that any such function f can be understood as the composition of three discrete conformal mapping functions, CA, m and C−1B, where CA and CB are discrete conformal maps from MA and MB to the sphere, respectively, and m a conformal map of the sphere to itself. Once the discrete maps CA and CB are known (see [45,59] for details), f is fully parametrized by m. The group of such conformal self-mappings of the sphere is well understood and is called the Möbius group. Any transformation m in this group is defined by specifying the image of three points, and thus has six degrees of freedom. The mapping f is then constructed by optimizing m so that the composition C−1B ° m ° CA is as near to an isometry as possible, i.e. that it minimizes the functional:
![]() |
3.1 |
Here
and
denote the sets of edges in the meshes
and
, respectively. Aij is the sum of the areas of the two triangles adjacent to the edge (vi, vj) and l(vi, vj) is the distance between the two vertices vi and vj, namely the length of the edge (vi, vj). This method is implemented into the program MatchSurface [45,47]. We have used this program for mesh parametrization for all cases presented in the Result section.
We note that the infimum of Dsd(f) as f varies over all conformal diffeomorphisms from MA to MB defines a metric on the space of surfaces of genus zero [60]. We refer to the corresponding distance as dsd [47].
3.2. An elastic energy for a discrete triangular mesh
Let M be a surface of a shape A represented by the triangular mesh
. We define X(0) to be the ground state conformation of this mesh, and X(t) to be its conformation at a time t, namely a new geometric realization of the mesh with the same combinatorics. We note that the mesh
is basically a set of vertices, with edges between them. As such, it is similar to an elastic network model (ENM) that is used extensively in molecular structural biology to study the dynamics of biomolecules [52,61,62]. We have therefore defined the energy EA of shape A to be the elastic energy E of the mesh
, as defined by the formalism of ENM, as described below.
Let vertex vi be characterized by its position Xi = (xi, yi, zi). For two vertices vi and vj that are connected with an edge in
, we set rij and r0ij to be the length of this edge in any conformation X(t) and in the ground-state conformation X(0), respectively. The elastic energy EA of the mesh is then set to
![]() |
3.2 |
where kij is the force constant of the ‘spring’ formed by the pair of vertices vi and vj. In the original ENM introduced by Tirion, the elastic constants kij are set to be the same for all pairs of vertices [52]. We follow this setting and choose kij = 1.
Using the elastic energy EA defined in equation (3.2), the quadratic energy (see equation (2.10)) of the mesh is then given by
| 3.3 |
where the Hessian H(X(0)) is the matrix containing the second derivatives of EA computed for the conformation X(0). From equation (3.2), it is easy to find that the 3 × 3 submatrix Hij of this Hessian corresponding to the two vertices i and j of an edge (vi, vj) in the mesh is given by
![]() |
3.4 |
and that the 3 × 3 submatrix Hii on the diagonal of H is given by
| 3.5 |
3.3. MeshPath: a program to compute the distance between two shapes
Let A and B be two shapes defined by their surfaces MA and MB, themselves represented by the meshes
and
, respectively. We assume that the two meshes have the same combinatorics; if not, the procedure described above is applied to map
onto MB. The corresponding image is used in place of
. To find the distance between those two shapes, we compute the minimum action path between the conformations of
and
using the following algorithm.

This algorithm mimics the algorithm we developed for computing the minimum action path between two conformations of a biomolecule [41]. The reader is referred to that paper for details. The constant TOL in step (4) is chosen to be 1 × 10−6. Finally, to compute the action of the (discrete) converged trajectory, we use the generalized trapezoidal rule:
![]() |
3.6 |
where
defines the variational distance between the two shapes, which we refer to as dact.
We note that for most applications, the meshes will have their own combinatorics. For comparing any two shapes M1 and M2 represented with the meshes
and
, we parametrize first
onto
and compute both the geometric and variational distances between
and its image
. We then repeat the process by parametrizing
onto
. The final distances dsd and dact between the two shapes are the minima of the corresponding distances for the two parametrizations.
4. Distances between generic, discrete surfaces with exact correspondence
The method described in this paper is designed to compute the distance between two shapes by building a geodesic between those two shapes whose action is minimal. As those shapes are described by triangular meshes, it involves two steps, a parametrization step and a deformation step. As the novelty of the method relates to the latter, we tested it first on objects that have the same triangulation, such that the parametrization step is trivial. Figure 1 illustrates with three examples the minimum action trajectories that are generated between two such objects with PathMesh. We have considered centaur0 and centaur1 (figure 1a), dog0 and dog1 (figure 1b), and cat0 and cat2 (figure 1c) from the TOSCA high-resolution dataset [54]. All three trajectories were computed with a total time T set to 50. In all three cases, the transition along the path occurred at around T = 25, namely in the middle of the trajectories. While the transitions look very natural, we do notice that some deformations of the shapes occur in the vicinity of the transition state, at the level of the left front leg of the centaur, at the right back leg of the dog and even more pronounced at the level of the tail of the cat.
Figure 1.
Minimum action paths between two shapes. Different poses along the geodesic paths computed with PathMesh between two conformations of a centaur (a), of a dog (b) and of a cat (c). The timing for each pose is given as percentage of the total time given for the construction of the path.
The total effort needed to deform one shape into another is expected to be related to how different those two shapes are. The difficulty is to define what ‘different’ means, in a way that a measure of such differences defines a metric in the space of shapes. PathMesh was designed to provide one such measure of differences by computing dact, a quasi-metric for shapes (see above). We compared this distance measure to a more traditional measure of similarities between shapes, namely the coordinate root mean square deviation (RMSD). We chose for that purpose the TOSCA high-resolution dataset [54], available at http://tosca.cs.technion.ac.il. This dataset contains a total of 80 objects regrouped into nine classes, including 11 cats, nine dogs, three wolves, eight horses, six centaurs, four gorillas, 12 female figures (Victoria), and two different male figures, David and Michael, containing 7 and 20 poses, respectively. Objects within one class have the same triangulation and an equal number of vertices that are in direct correspondence. For each class, one mesh (usually the one with index 0) was chosen as a reference, and the distances between the other members of the class and this reference were computed in two different ways. First, the RMSD over the vertices of the triangulation is computed after removal of rigid-body transformations (rotation and translation). The meshes are scaled such that their average edge lengths are set to 1. Second, a path is generated between one object and its reference conformation using PathMesh, and the total action along that path defines the distance dact. In all those computations, the elastic constants for the mesh energies are set to 1, and the trajectories are computed with T = 50. We note that RMSD, energies, actions and time are all given in arbitrary units. Comparisons of the action distances and RMSDs are given in figure 2.
Figure 2.

Distances between discrete shapes with exact vertex correspondence. The cost of deforming one shape from one conformation into another, as measured by the minimal action along the deformation trajectory, dact, is plotted against the root mean square distance (RMSD) between the coordinates of the vertices of the triangulation of the shape in the two conformations, after removal of rigid body transformations. The calculations are performed for nine classes of shapes, corresponding to the high-resolution dataset included in TOSCA [54] (see text for details). Note that both dact and RMSD are in arbitrary units, scaled such that the average edge lengths of the meshes are set to 1.
As expected, the larger the differences between two shapes (as measured by the RMSD), the larger the action-based distance between the two shapes. This behaviour is observed over a broad range of geometric differences between the shapes, up to 100 (with the unit being the average edge length of the meshes considered). The relationship between the two measures of shape differences however is not linear. Interestingly, this relationship is unchanging over all classes included in the TOSCA high-resolution dataset. This is likely due to the fact that the deformations that were applied on the different objects are consistent from one class to another.
Using a single thread on a 4 GHz Intel Core I7, the average CPU time to compute a trajectory between two conformations of a mesh with 15 000 vertices is 470 s, and 2200 s for meshes with 50 000 vertices. We note that most of this cost comes from the computation of a function of a matrix times a vector, which can be performed efficiently, in parallel, using a Krylov-based method (see [41] for details). The parallel version of PathMesh leads to a gain of a factor of 3 when it is run over four threads.
5. Distances between biological shapes
To test the effectiveness of our full procedure, which includes parametrization and generation of a minimum action path for computing a variational, action-based distance between shapes, dact, we applied it on three independent datasets, representing three regions of the skeletal anatomy of a collection of primates. The first dataset, MT1, contains 61 proximal first metatarsals of prosimian primates, New and Old World monkeys, the second dataset, RADIUS, contains 45 distal radii of apes and humans, and the third dataset, TEETH, includes 116 second mandibular molars of prosimian primates and non-primate close relatives. They were originally assembled by Boyer et al. who used them in a similar study with different shape comparison measures based on an ‘earth mover’ metric, and on a ‘continuous Procrustes’ (cP) distance [22]. Using these datasets allows us to evaluate the performances of the variational method proposed above. The evaluation is done by comparing the distances dact between surfaces included in the three datasets to the geometric distance dsd, and to the distances based on landmarks identified by trained morphologists. There are two such sets of experimental distances for the metatarsal dataset, referred to as ‘observer1’ and ‘observer2’, and one set for the RADIUS dataset and the TEETH dataset, named ‘observer1’. We note that the variational distance, dact, in common with dsd, does not require preliminary selection of landmarks on the surfaces. Geometric morphometricians on the other hand have determined landmarks on each surface, choosing them to reflect correspondences considered biologically and evolutionarily meaningful (see SI appendix, Materials of ref. [22]). These landmarks determine a ‘discrete’ Procrustes distance between any two surfaces, which we refer to as dobs. Finally, we note that we had previously compared dsd with the continuous Procrustes distance dcP on the same datasets and found that the former performs best [47].
5.1. A simple shape recognition application
Shape recognition is a natural application of the variational distance described here. As a simple illustration, we considered four surfaces of proximal metatarsal bones, two from baboons, Pu80771 and PuF in MT1, and two from lemurs, Lf170750 and Lf170755 in MT1. The two surfaces PuF (baboon A) and Lf170755 (lemur A) form a mini library of shapes, while the surfaces Pu80771 (baboon B) and Lf170755 (lemur B) are two test surfaces, which we refer to as ‘surface A’ and ‘surface B’. We studied the deformations of all combinations of test versus library surfaces by computing the minimum action paths between them. In figure 3, we show the evolution of the energy (as defined in equation (2.11)) along the corresponding paths. When the surface A is deformed onto the surfaces for baboon A and lemur A, the corresponding minimum actions dact are 0.08 and 0.71, respectively, identifying correctly surface A as corresponding to a baboon. By contrast, surface B is closer to lemur B, and therefore corresponds to a lemur.
Figure 3.
A simple shape recognition experiment. Two surfaces, A and B, of ‘unknown’ proximal metatarsal bones are compared to the surfaces of two known bones, one from baboon (baboon A), and one from lemur (lemur A) (it is known in fact that A is the surface of a bone of a baboon while B is the surface of a bone of a lemur). The comparisons are performed by deforming elastically each surface towards one of the reference surfaces, using the minimum action principle. The energies (as defined by equation (2.11)) of the configurations of the surfaces along the different paths are plotted against ‘time’. Both time and energy have arbitrary units. Note the presence of the cusp at the transition; this cusp is a consequence of the energy function that is used (see §2 for details). The deformations of surface A onto baboon A and of surface B onto lemur A have the lowest action, identifying them correctly as coming from a baboon and a lemur, respectively (see text for details). The geometries of the surfaces at the start, end, and at the transition of each trajectory are shown. (Online version in colour.)
Each of the three distances (dact, dsd and dobs) defines a matrix for each dataset, containing all pairwise distances between the surfaces included in that dataset. To measure the effectiveness of each distance, we compare those matrices in three different ways.
5.2. Correlations between the different distance measures
The distances considered here rank pairs of surfaces according to their similarity. The relative performances of dact and dsd with respect to the observer distance dobs can then be computed using a correlation analysis. As correlation is searching for a linear relationship and the distances may have different dynamic ranges, we first coded those distances with a simple scheme. The distributions of distances are broken into 10 intervals of equal size (in terms of percent of the total number of distances), and the distance belonging to interval i is coded with the value i. Table 1 gives the corresponding correlation coefficients between the discretely coded distances.
Table 1.
Correlations between observer distances, geometric (dsd) and variational (dact) distances.
| dataset | no. pairs | dobs1/dact | dobs2/dact | dobs1/dsd | dobs2/dsd | dobs1/dobs2 |
|---|---|---|---|---|---|---|
| metatarsal (all) | 1830 | 0.77 | 0.77 | 0.78 | 0.78 | 0.87 |
| radius (all) | 990 | 0.54 | n.a. | 0.58 | n.a. | n.a. |
| teeth (all) | 4851 | 0.60 | n.a. | 0.52 | n.a. | n.a. |
The distances based on manual assignments of landmarks by morphometricians may be considered as a reference, since they are based on extensive expert knowledge, though they are not deemed perfect. Note that there is variability between morphometricians, though the correlations between the results of two such observers are high. On all three datasets, the geometric distance dsd and the variational distance dact correlate well with those reference distances.
In figure 4, we compare four (coded) distance matrices for the MT1 dataset.
Figure 4.
Comparisons of the observer distance matrices with the geometry-based (dsd) and action-based (dact) distance matrices for the MT1 dataset of proximal metatarsal bones. X- and Y-axes are the bone index within the dataset, with the bones being sorted first based on the species they correspond to. Note that all four distances clearly separate the bones of the simian species from the bones of the prosimians.
In all four distance matrices, we clearly identify the separation between the bones of the simians (Primates) and the bones of the prosimians (Euprimates). The structures of the matrices within those two groups, however, differ for the four distance measures. Note that this difference also exists between the two observer distance matrices. The Primates group for example contain four species, with one of them only including one sample. The observer 2 distance matrix clearly highlights the presence of the three main groups, as well as identifies the last bone (corresponding to the unique sample) as being different. This separation is less clear in the observer 1 distance matrix. In the distance matrix based on dsd, the primates are mostly organized into two groups, while the action-based distance matrix shows the three dominant subgroups of primates, but regroups the unique sample with the third subgroup. Overall however, the graphical representations of the four matrices confirm the high level of similarity observed in the correlation analysis (table 1).
5.3. Receiver operating characteristic analyses of the distance measures
For a second comparison of the three distance measures, we evaluated their performances using a receiver operating characteristic (ROC) analysis [63]. In this approach a ‘gold standard’ is defined, based on a choice of level in the phylogeny of the specimens, species, genus, family or superfamily. A pair of surfaces is then defined as similar, or ‘positive’, if the corresponding specimens belong to the same taxonomic group considered, and ‘negative’ otherwise. For varying thresholds of the distance measure under study, all pairs of surfaces whose distances fall below the threshold are then assumed positive, while those above it are deemed negative. The pairs that agree with the standard are called true positives (TP), while those that do not are false positives (FP). An ROC analysis compares the rate of TP (also called sensitivity) to the rate of FP (which corresponds to one minus the specificity). It is scored with the proportion of area under the corresponding curve (AUC). An AUC of 1 indicates that all TP are detected first. This corresponds to the ideal distance measure. On the other hand, the diagonal curve leads to an AUC of 0.5. In this case, TP and FP appear at the same rate, and the distance measure contains no information.
The results of ROC analyses based on the three distances dsd, dact and dobs are given in table 2 and illustrated in figure 5.
Table 2.
Area under the curve (AUC) of ROC analyses on the observer, geometric (dsd) and variational (dact) distances.
| first metatarsal |
radius |
teeth |
|||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | dobs1 | dobs2 | dsd | dact | N | dobs | dsd | dact | N | dobs | dsd | dact | |
| species | 13 | 0.94 | 0.97 | 0.96 | 0.96 | 5 | 0.87 | 0.87 | 0.89 | n.a. | |||
| genera | 13 | 0.94 | 0.97 | 0.96 | 0.96 | 4 | 0.86 | 0.80 | 0.85 | 24 | 0.98 | 0.96 | 0.94 |
| families | 9 | 0.91 | 0.95 | 0.95 | 0.93 | n.a. | 17 | 0.87 | 0.85 | 0.80 | |||
| superfamilies | 2 | 0.96 | 0.97 | 0.85 | 0.86 | n.a. | 5 | 0.64 | 0.77 | 0.66 | |||
Figure 5.
ROC analyses of three distance measures of surface similarities. We compare the effectiveness of an observer-based distance, dobs, a geometric-based distance, dsd, and an action-based variational distance, dact, to detect similarities that identify with species in a set of 61 metatarsal surfaces (a), in a set of 45 radius surfaces (b), and with genera in a set of 99 teeth surfaces (c). ‘True’ relationships are defined by the phylogeny of the specimens included in those datasets. An ROC curve that falls close to the first diagonal (shown as a dashed line) reveals poor performance, while an ROC curve that first follows the y-axis, i.e. whose area under the curve is the largest, indicates good performance. The results indicate that dact performs at least as well as dsd and is comparable to the observer-based distance.
Species or genera identification based on landmarks manually defined by experts is expected to perform best. Indeed, the ROC curves derived from the observer distances illustrate excellent classification results, with AUC values above 0.85 in all cases, above 0.9 for the metatarsal dataset, and above 0.95 for the teeth dataset. Note that even with human expertise included, the classification is not perfect. In addition, we observe differences between the results obtained by two distinct experts on the MT1 dataset. The two distance measures dsd and dact alleviate the difficulty of defining landmarks on the two surfaces to be compared. The distances they compute reflect either different geometric properties (as measured by dsd), or differences in their elastic flexibilities (as measured by dact). We find that the distance dact introduced in this study performs as well as dsd on all three datasets, at all phylogenic classification levels. Those two distances perform as well as the observer, landmark-based, distances, with differences that are of similar magnitude to the differences measured between distinct observers.
In such an ROC analysis, it is worth focusing on the small distances, as those are often the most reliable ones and the most relevant for applications. For each distance measure, we therefore computed a ROC50 value, computed as follows. For each of the first 50 false positives that are found, the number of true positives with a smaller distance is recorded. The sum of all of these numbers is then divided by the number of false positives (50), and finally divided by the total number of possible true positives in the dataset considered, thereby defining the ROC50. Results are given in table 3 for all three datasets. Like the AUC values, we note differences between the ROC50 values for the two observers on the MT1 dataset. The geometric distance and the variational distance perform similarly. The latter do seem to outperform both the observer and dsd distances on the RADIUS dataset; this dataset however is the smallest, with only 45 samples, and the differences may not be statistically significant.
Table 3.
ROC50 values for the ROC analyses on the observer, geometric (dsd) and variational (dact) distances.
| first metatarsal |
radius |
teeth |
|||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | dobs1 | dobs2 | dsd | dact | N | dobs | dsd | dact | N | dobs | dsd | dact | |
| species | 13 | 0.41 | 0.57 | 0.40 | 0.35 | 5 | 0.26 | 0.45 | 0.54 | n.a. | |||
| genera | 13 | 0.41 | 0.57 | 0.40 | 0.35 | 4 | 0.30 | 0.37 | 0.46 | 24 | 0.56 | 0.67 | 0.74 |
| families | 9 | 0.29 | 0.50 | 0.40 | 0.38 | n.a. | 17 | 0.28 | 0.38 | 0.42 | |||
| superfamilies | 2 | 0.74 | 0.80 | 0.43 | 0.42 | n.a. | 5 | 0.06 | 0.06 | 0.06 | |||
5.4. Classification of shapes
The ROC analysis ranks distances between specimens and assesses if this ranking is compatible with an existing classification; it does not perform the classification itself. We extend the ROC analysis to the actual problem of classification by performing a third set of computational experiments. Each experiment involves a dataset of surfaces/specimens, D, a taxon, T, and a distance measure, d. We begin by randomly dividing the sets of surfaces in D into two groups of approximately equal size. The first group serves as a training set to define the taxa, while the second group serves as a test set. A test surface is classified by assigning it to the taxon of its nearest neighbour in the training set. This is much akin to the fold recognition experiment illustrated in figure 3 and used in the protein structure prediction community [64]. The results are stored in a confusion matrix, C, whose element C(i, j) reports the number of test surfaces corresponding to specimens that belong to taxon i that have been classified as belonging to taxon j. The accuracy of the classifier d is then defined to be the ratio of the trace of the confusion matrix over the sum of all its elements (i.e. the percentage of correctly classified specimens). To remove the influence of the initial division of the dataset into test and training sets, the procedure is repeated 10 000 times. We performed these experiments for the three datasets, for the three distance measures and for different levels in the taxonomy of the specimens. The results are reported in table 4.
Table 4.
Success rates of classification experiments, based on the dobs, dsd and dact distances.
| first metatarsal |
radius |
teeth |
|||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | dobs1 | dobs2 | dsd | dact | N | dobs | dsd | dact | N | dobs | dsd | dact | |
| species | 13 | 74a | 86 | 81 | 84 | 5 | 76 | 85 | 86 | n.a. | |||
| genera | 13 | 74 | 86 | 81 | 84 | 4 | 85 | 90 | 92 | 24 | 87 | 91 | 87 |
| families | 9 | 83 | 94 | 93 | 94 | n.a. | 17 | 91 | 93 | 91 | |||
| superfamilies | 2 | 100 | 100 | 100 | 99 | n.a. | 5 | 94 | 95 | 95 | |||
aPercentage of correctly classified specimens, computed over 10 000 experiments.
Classifications based on the three distances perform similarly on the three anatomical datasets. We note again a difference between the two observers on the first metatarsal dataset. These differences indicate the difficulties in defining consistent landmarks on anatomical surfaces even for experienced morphometricians.
A distance matrix that contains the results of an all-against-all comparison of all specimens included in a dataset can be turned into a tree using a variety of clustering techniques. We used the unweighted pair group method with arithmetic mean (UPGMA) method to build trees based on the two distance measures (dobs and dact) for the RADIUS dataset, as implemented in the software package PHYLIP [65]. UPMGA constructs a tree by minimizing the net disagreement between the matrix pairwise distances and the distances measured on the tree. Results are shown in figure 6. We note that those are phenetic trees, in opposition to phylogenetic trees that are built from molecular sequencing data.
Figure 6.
Morphology-based UPMGA trees for the specimens included in the RADIUS dataset: (a) is derived from distances computed using landmarks placed on each surface by experienced morphometricians, while (b) is computed using the variational distance dact (this work). Branches of the trees are coloured according to the actual species of the specimens. The trees were computed using the program Neighbour from the software package PHYLIP [65], and subsequently drawn using the package MEGA6 [66]. Note that this illustrates the qualitative differences between the trees; a more quantitative analysis is provided in the text.
The UPMGA tree based on dact shows a high level of agreement with the actual phylogeny of the specimens considered. Homo sapiens and Gorilla are clearly well separated from the other species, forming pure clades. In addition, Pongo and Pan are well separated. Note that even the bonobos (pygmy chimpanzees) are regrouped within the clade for chimpanzees. The tree based on the observer distances shows more misassociations, with one chimpanzee showing within Pongo, and reversely one Pongo showing within the chimpanzees, and two gorillas separated from the main gorilla clade.
To quantify the differences between the trees generated from the observer (dobs), action (dact) and geometric (dsd) distance matrices computed for the RADIUS dataset, we first rescaled those distance matrices so that all distances ranged between 0 and 1, and regenerated the UPMGA trees. The four trees are then compared using the TreeDist program from the software package PHYLIP. TreeDist computes the symmetric distance of Robinson & Foulds [67] to evaluate the similarity between two trees. We find that the distances between the observers' tree and the action-based and geometry-based trees are 72 and 78, respectively. While we cannot assess the meaning of the absolute values of these distances, and the significance of the differences between those values, we do notice that the observers' tree resembles most the tree computed with the action distance measure introduced here.
6. Discussion
We have developed a new method for measuring the similarities between two shapes. This method is designed to capture the complexities of deforming one of the shapes into the other. Specifically, we consider the generalization of the conventional notion of distance. We set the distance between two shapes to be the length of the geodesic that joins those two shapes, where the geodesic is computed as the minimizer of an Onsager–Machlup action functional. We have introduced an elastic energy for shapes represented by surface meshes, as well as a method for computing the minimizer of the corresponding action. The integral of the action along this path defines a distance that depends only on the geometry of the initial and final configurations of the shape considered. We have illustrated its use both on generic shapes with exact correspondences, and on sets of specific biological shapes on which we performed shape recognition and classification.
While data representing a shape come in many forms, we have focused on the specific case in which a shape is represented by its surface, which in turn is described with a discrete triangular mesh. With such a representation, computing a deformation between two conformations of a shape requires solving two steps, namely finding a parametrization (computing representations of the two conformations with exact correspondence between their vertices and mesh structures), and computing the deformation that leads to minimal effort, in our case minimal action. We note first that solving these two steps sequentially is not a necessity: the elastic shape analysis techniques developed by Laga and co-workers solve those steps simultaneously [42–44]. We believe, however, that those two steps include specific problems that require different treatments, and prefer to handle them separately. Our solutions to those problems rely on our own earlier work, first on the geometry of shapes for performing the parametrization, and on the generation of the minimal action path between two conformations of a biomolecule [40,41] for the deformation step. As such, it carries both their advantages and their limitations. With respect to parametrization, for example, our method relies on a conformal registration method that is described in [45–47]. We note that this method is currently limited to meshes representing surfaces of genus zero, with or without boundaries. Much work remains to be done for performing a parametrization that is not limited to a specific genus.
The method proposed to generate a path between two conformations of a shape is loosely derived from physics. We equate the deformation of a shape to a diffusive process. The probability of a trajectory for such a diffusive process can be computed as an integration over the trajectory of a functional, given by the Onsager–Machlup function. This argument can be reversed to find the trajectory with the highest probability, or equivalently of minimal action. This leads to a second order two-point boundary value ODE problem. With a special choice for the energy of a shape, those equations can be solved analytically. The elastic energy we have introduced that satisfies this condition is much akin to the energy of an ENM and as such suffers from the same drawbacks. First, it should be noted that it is really a geometry-based energy function, and not a physics-based energy function. As such, it lacks the ability to capture the properties of the surface, such as stretching and bending [34,51]. In addition, it does not account for steric hindrance, which may lead to self-intersection of the surface during the deformation. There is a second problem with the energy function that comes from our definition of the (simplified) energy surface for the space of conformations for the shape considered. By defining this energy surface as the lower envelope of the quadratic wells at two prescribed minima, namely the start and target conformations of the shape, we observe problems of smoothness at the crossing between the wells. This leads to the presence of a cusp in energy at the transition (figure 3), which in turn leads to unphysical deformations of the surfaces, as observed in figure 1 for both the cat and the dog near the transition. We are currently working on implementing solutions that circumvent those limitations, in line with our recent developments when solving for minimum action paths for biomolecules [41].
MeshPath is a method for solving the Euler–Lagrange equations corresponding to the constraint of minimum action along the deformation path for a shape. As it solves these equations of motion analytically, it is extremely efficient and directly amenable to study large systems, despite the fact that it relies on the computation of Hessians of the energy functions. We have shown applications on meshes that include tens of thousands of vertices (from the high-resolution dataset from TOSCA), with computing times of the order of thousands of seconds on a unique thread, and of the order of hundreds of seconds on multiple threads.
The main application of PathMesh presented in this paper falls in the field of geometric morphometry, using three datasets representing bones and teeth of primates. Experiments on these datasets have shown that it performs remarkably well both in shape recognition and in identifying evolutionary patterns. We note that while we show successful taxon recognition based on geometric information for one of the datasets considered (the RADIUS dataset, see figure 6), taxonomy is not the intended purpose of this method. Instead, we restrict its current applications to providing robust estimates of the distances between three-dimensional shapes, with one finality being to support phylogeny reconstruction. As the advent of computers and the developments of robust and fast scientific computing techniques to quantify phenotypes, there has been a wealth of studies attempting to provide a quantitative view of evolution in biology, starting with classification and taxonomy. The development of genomics in the last three decades has led many to question the relevance of geometrics morphometrics for such studies. Indeed, the assessments of genetic variations from automated genomic analyses have greatly improved our understanding of the relationships between the genotype and phenotype of individuals of many species (for a review, see [68]). Those relationships serve as ground for building the phylogeny of organisms, i.e. the history of their lineages as they change through time. As a consequence, phylogenetics is playing a central role in the study of evolution [69]. In comparison, it is much harder to come up with good models of the genetics that underlie morphological changes [70]. As a consequence, the utility of morphological data in phylogenic research has become increasingly questioned. As a response to the criticisms expressed against geometric morphometry, MacLeod et al. [71] and more recently Boyer et al. [72] have emphasized the need to automate and standardize morphological studies, starting with the determination of geometric correspondence between shapes. The method developed in this paper is one attempt towards this goal.
Acknowledgments
I thank Yaron Lipman for making the data from Boyer et al. [22] freely available. I also thank the anonymous reviewers for their insightful comments and suggestions.
Data accessibility
The TOSCA high-resolution dataset is available at http://tosca.cs.technion.ac.il. The biological datasets, representing three regions of the skeletal anatomy of a collection of primates, are available on Yaron Lipman's website (http://www.wisdom.weizmann.ac.il/~ylipman/CPsurfcomp/).
Competing interests
I declare I have no competing interests.
Funding
The Ministry of Education of Singapore is acknowledged for financial support through grant no. MOE2012-T3-1-008.
References
- 1.Thompson DW. 1917. On growth and form. Cambridge, UK: Cambridge University Press. [Google Scholar]
- 2.Max N, Getzoff E. 1988. Spherical harmonic molecular surfaces. IEEE Comput. Graph. Appl. 8, 42–50. ( 10.1109/38.7748) [DOI] [Google Scholar]
- 3.Koehl P. 2006. Protein structure classification. In Reviews in computational chemistry, vol. 22 (eds Lipkowitz KB, Cundari TR, Gillet VJ, Boyd B), pp. 1–56. Hoboken, NJ: John Wiley & Sons. [Google Scholar]
- 4.Kolodny R, Petrey D, Honig B. 2006. Protein structure comparison: implications for the nature of fold space, and structure and function prediction. Curr. Opin. Struct. Biol. 16, 393–398. ( 10.1016/j.sbi.2006.04.007) [DOI] [PubMed] [Google Scholar]
- 5.Kötter R, Wanke E. 2005. Mapping brains without coordinates. Phil. Trans. R. Soc. B 360, 751–766. ( 10.1098/rstb.2005.1625) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Otte A, Halsband U. 2006. Brain imaging tools in neurosciences. J. Physiol. Paris 99, 281–292. ( 10.1016/j.jphysparis.2006.03.011) [DOI] [PubMed] [Google Scholar]
- 7.Gholipour A, Kehtarnavaz N, Briggs R, Devous M, Gonipath K. 2007. Brain functional localization: a survey of image registration techniques. IEEE Trans. Med. Imaging 26, 427–451. ( 10.1109/TMI.2007.892508) [DOI] [PubMed] [Google Scholar]
- 8.MacKenzie-Graham A, Boline J, Toga AW. 2007. Brain atlases and neuroanatomic imaging. Methods Mol. Biol. 401, 183–194. ( 10.1007/978-1-59745-520-6_11) [DOI] [PubMed] [Google Scholar]
- 9.Pantazis D, Joshi A, Jiang J, Shattuck D, Bernstein LE, Damasio H, Leahy RM. 2010. Comparison of landmark-based and automatic methods for cortical surface registration. Neuroimage 49, 2479–2493. ( 10.1016/j.neuroimage.2009.09.027) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bronstein AM, Bronstein MM, Kimmel R. 2006. Generalized multidimensional scaling: a framework for isometry-invariant partial surface matching. Proc. Natl Acad. Sci. USA 103, 1168–1172. ( 10.1073/pnas.0508601103) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Huang Q, Adams B, Wicke M, Guibas L. 2008. Non-rigid registration under isometric deformations. Comput. Graph. Forum 27, 1449–1457. ( 10.1111/j.1467-8659.2008.01285.x) [DOI] [Google Scholar]
- 12.Lasowski R, Tevs A, Seidel H-P, Wand M. 2009. A probabilistic framework for partial intrinsic symmetries in geometric data. In IEEE Int. Conf. on Computer Vision, pp. 963–970. Piscataway, NJ: IEEE ( 10.1109/ICCV.2009.5459356) [DOI] [Google Scholar]
- 13.Rustamov RM. 2007. Laplace–Beltrami eigenfunctions for deformation invariant shape representation. In Proc. 5th Eurographics Symp. on Geometry Processing, Barcelona, Spain, pp. 225–233. Aire-la-Ville, Switzerland: Eurographics Association. [Google Scholar]
- 14.Sun J, Ovsjanikov M, Guibas L. 2009. A concise and provably informative multi-scale signature based on heat diffusion. Comput. Graph. Forum 28, 1383–1392. ( 10.1111/j.1467-8659.2009.01515.x) [DOI] [Google Scholar]
- 15.Fischl B, Sereno MI, Dale AM. 1999. Cortical surface-based analysis. II. Inflation, flattening, and a surface-based coordinate system. Neuroimage 9, 195–207. ( 10.1006/nimg.1998.0396) [DOI] [PubMed] [Google Scholar]
- 16.Fischl B, Sereno M, Tootell R, Dale A. 1999. High-resolution inter-subject averaging and a coordinate system for the cortical surface. Human Brain Mapping 8, 272–284. ( 10.1002/(SICI)1097-0193(1999)8:4%3C272::AID-HBM10%3E3.0.CO;2-4) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.McCane B. 2013. Shape variation in outline shapes. Syst. Biol. 62, 134–146. ( 10.1093/sysbio/sys080) [DOI] [PubMed] [Google Scholar]
- 18.Laga H, Kurtek S, Srivastava A, Miklavcic SJ. 2014. Landmark-free statistical analysis of the shape of plant leaves. J. Theoret. Biol. 363, 41–52. ( 10.1016/j.jtbi.2014.07.036) [DOI] [PubMed] [Google Scholar]
- 19.Rubner Y, Tomasi C, Guibas LJ. 1998. A metric for distributions with applications to image databases. In 6th Int. Conf. on Computer Vision, pp. 59–66. Piscataway, NJ: IEEE. ( 10.1109/ICCV.1998.710701) [DOI] [Google Scholar]
- 20.Grauman K, Darrell T. 2004. Fast contour matching using approximate earth mover's distance. In Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Washington, DC, USA, 27 June–2 July 2004 ( 10.1109/CVPR.2004.1315035) [DOI] [Google Scholar]
- 21.Lipman Y, Daubechies I. 2011. Conformal Wasserstein distances: comparing surfaces in polynomial time. Adv. Math. 227, 1047–1077. ( 10.1016/j.aim.2011.01.020) [DOI] [Google Scholar]
- 22.Boyer DM, Lipman Y, StClair E, Puente J, Patel BA, Funkhouser T, Jernvall J, Daubechies I. 2011. Algorithms to automatically quantify the geometric similarity of anatomical surface. Proc. Natl Acad. Sci. USA 108, 18 221–18 226. ( 10.1073/pnas.1112822108) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Glaunès J, Vaillant M, Miller MI. 2004. Landmark matching via large deformation diffeomorphisms on the sphere. J. Math. Imaging Vision 20, 179–200. ( 10.1023/B:JMIV.0000011323.32914.f3) [DOI] [Google Scholar]
- 24.Valliant M, Glaunès J. 2005. Surface matching via currents. In Information processing in medical imaging (eds GE Christensen, M Sonka). Lecture Notes in Computer Science, vol. 3565, pp. 381–392. Berlin, Germany: Springer ( 10.1007/11505730_32) [DOI] [PubMed] [Google Scholar]
- 25.Valliant M, Qiu A, Glaunès J, Miller MI. 2007. Difffeomorphic metric surface mapping in subregion of the superior temporal gyrus. Neuroimage 34, 1149–1159. ( 10.1016/j.neuroimage.2006.08.053) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Zhong J, Qiu A. 2010. Multi-manifold diffeomorphic metric mapping for aligning cortical hemispheric surfaces. Neuroimage 49, 355–365. ( 10.1016/j.neuroimage.2009.08.026) [DOI] [PubMed] [Google Scholar]
- 27.Plotkin SS. 2007. Generalization of distance to higher dimensional objects. Proc. Natl Acad. Sci. USA 104, 14 899–14 904. ( 10.1073/pnas.0607833104) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Mohazab AR, Plotkin SS. 2008. Minimal folding pathways for coarse-grained biopolymer fragments. Biophys. J. 95, 5496–5507. ( 10.1529/biophysj.108.135046) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Mohazab AR, Plotkin SS. 2009. Structural alignment using the generalized euclidean distance between conformations. Int. J. Quant. Chem. 109, 3217–3228. ( 10.1002/qua.22347) [DOI] [Google Scholar]
- 30.Mohazab AR, Plotkin SS. 2013. Polymer uncrossing and knotting in protein folding, and their role in minimal folding pathways. PLoS ONE 8, e53642 ( 10.1371/journal.pone.0053642) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Adams B, Ovsjanikov M, Wand M, Seidel H-P, Guibas L. 2008. Meshless modeling of deformable shapes and their motions. In Proc. 2008 ACM SIGGRAPH/Eurographics Symp. on Computer Animation, Dublin, Ireland, 7–9 July 2008, pp. 77–86. Aire-la-Ville, Switzerland: Eurographics Association. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Fuchs M, Jütler B, Scherzer O, Yang H. 2009. Shape metrics based on elastic deformation. J. Math. Imaging Vis. 35, 86–102. ( 10.1007/s10851-009-0156-z) [DOI] [Google Scholar]
- 33.Srivastava A, Klassen E, Joshi SH, Jermyn IH. 2011. Shape analysis of elastic curves in Euclidean spaces. IEEE Trans. Pattern Anal. Mach. Intell. 33, 1415–1428. ( 10.1109/TPAMI.2010.184) [DOI] [PubMed] [Google Scholar]
- 34.Heeren B, Rumpf M, Wardetzky M, Wirth B. 2012. Time-discrete geodesics in the space of shells. Comput. Graph. Forum 31, 1755–1764. ( 10.1111/j.1467-8659.2012.03180.x) [DOI] [Google Scholar]
- 35.Heeren B, Rumpf M, Shröder P, Wardetzky M, Wirth B. 2014. Exploring the geometry of the space of shells. Comput. Graph. Forum 33, 247–256. ( 10.1111/cgf.12450) [DOI] [Google Scholar]
- 36.Wirth B, Bar L, Rumpf M, Sapiro G. 2011. A continuum mechanical approach to geodesics in shape space. Int. J. Comput. Vis. 93, 293–318. ( 10.1007/s11263-010-0416-9) [DOI] [Google Scholar]
- 37.Rumpf M, Wirth B. 2015. Variational time discretization of geodesic calculus. IMA J. Numer. Anal 35, 1011–1046. ( 10.1093/imanum/dru027) [DOI] [Google Scholar]
- 38.Onsager L, Machlup S. 1953. Fluctuations and irreversible processes. Phys. Rev. 91, 1505–1512. ( 10.1103/PhysRev.91.1505) [DOI] [Google Scholar]
- 39.Cartan E. 1933. Sur les espaces de finsler. C. R. Acad. Sci. Paris 196, 582–586. [Google Scholar]
- 40.Doniach S, Franklin J, Koehl P, Delarue M. 2007. MinActionPath: maximum likelihood trajectory for large-scale structural transitions in a coarse-grained locally harmonic energy landscape. Nucl. Acids Res. 35, W477–W482. ( 10.1093/nar/gkm342) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Koehl P. 2016. Minimum action transition paths connecting minima on an energy surface. J. Chem. Phys. 145, 184111 ( 10.1063/1.4966974) [DOI] [PubMed] [Google Scholar]
- 42.Kurtek S, Srivastava A, Klassen E, Laga H. 2013. Landmark-guided elastic shape analysis of sherically parameterized surfaces. Comput. Graph. Forum 32, 429–438. ( 10.1111/cgf.12063) [DOI] [Google Scholar]
- 43.Kurtek S, Jermyn IA, Xie Q, Klassen E, Laga H. 2016. Elastic shape analysis of surfaces and images. In Riemannian computing in computer vision (eds Turaga PK, Srivastava A), pp. 257–277. Cham, Switzerland: Springer International Publishing. [Google Scholar]
- 44.Laga H, Xie Q, Jermyn IH, Srivastava A. In press Numerical inversion of SRNF maps for elastic shape analysis of genus-zero surfaces. IEEE Trans. Pattern Anal. Mach. Intell. ( 10.1109/TPAMI.2016.2647596) [DOI] [PubMed] [Google Scholar]
- 45.Koehl P, Hass J. 2014. Automatic alignment of genus-zero surfaces. IEEE Trans. Pattern Anal. Mach. Intell. 36, 466–478. ( 10.1109/TPAMI.2013.139) [DOI] [PubMed] [Google Scholar]
- 46.Hass J, Koehl P. 2014. How round is a protein? Exploring protein structures for globularity using conformal mapping. Front. Mol. Biosci. 1, 26 ( 10.3389/fmolb.2014.00026) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Koehl P, Hass J. 2015. Landmark-free geometric methods in biological shape analysis. J. R. Soc. Interface 12, 20150795 ( 10.1098/rsif.2015.0795) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Alexa M, Cohen-Or D, Levin D. 2000. As-rigid-as-possible shape interpolation. In Proc. 27th Annu. Conf. on Computer Graphics and Interactive Techniques, pp. 157–164. New York, NY: ACM Press. [Google Scholar]
- 49.Sorkine O, Alexa M. 2007. As-rigid-as-possible surface modeling. In Proc. 5th Eurographics Symp. on Geometry Processing, Barcelona, Spain, 4–6 July 2007, pp. 109–116. Aire-la-Ville, Switzerland: Eurographics Association. [Google Scholar]
- 50.Levi Z, Gotsman C. 2015. Smooth rotation enhanced as-rigid-as-possible mesh animation. IEEE Trans. Vis. Comput. Graph. 21, 264–277. ( 10.1109/TVCG.2014.2359463) [DOI] [PubMed] [Google Scholar]
- 51.Grinspun E, Hirani AN, Desbrun M, Schröder P. 2003. Discrete shells. In Proc. 2003 ACM SIGGRAPH/Eurographics Symp. on Computer Animation, San Diego, CA, USA, 26–27 July 2003, pp. 62–67. Aire-la-Ville, Switzerland: Eurographics Association. [Google Scholar]
- 52.Tirion M. 1996. Large amplitude elastic motions in proteins from a single parameter, atomic analysis. Phys. Rev. Lett. 77, 1905–1908. ( 10.1103/PhysRevLett.77.1905) [DOI] [PubMed] [Google Scholar]
- 53.Kilian M, Mitra NJ, Pottmann H. 2007. Geometric modeling in shape space. ACM Trans. Graph. 26, 64 ( 10.1145/1276377.1276457) [DOI] [Google Scholar]
- 54.Bronstein AM, Bronstein MM, Kimmel R. 2008. Numerical geometry of non-rigid shapes. New York, NY: Springer Science+Business. [Google Scholar]
- 55.Gelfand IM, Fomin SV. 2000. Calculus of variations. New York, NY: Dover. [Google Scholar]
- 56.Floater MS, Hormann K. 2005. Surface parameterization: a tutorial and survey. In Advances in multiresolution for geometric modelling (eds Dodgson NA, Floater MS, Sabin MA). Mathematics and Visualization, pp. 157–186. Berlin, Germany: Springer. [Google Scholar]
- 57.Sheffer A, Praun E, Rose K. 2006. Mesh parameterization methods and their applications. Found. Trends Comput. Graph. Vis. 2, 105–171. ( 10.1561/0600000011) [DOI] [Google Scholar]
- 58.Lipman Y, Funkhouser T. 2009. Möbius voting for surface correspondence. ACM Trans. Graph. 28, 72–83. ( 10.1145/1531326.1531378) [DOI] [Google Scholar]
- 59.Springborn B, Schröder P, Pinkall U. 2008. Conformal equivalence of triangle meshes. ACM Trans. Graph. 27, 77 ( 10.1145/1360612.1360676) [DOI] [Google Scholar]
- 60.Hass J, Koehl P. 2015. A metric for genus-zero surfaces. (https://arxiv.org/abs/1507.00798) [Google Scholar]
- 61.Sanejouand YH. 2013. Elastic network models: theoretical and empirical foundations. Methods Mol. Biol. 914, 601–616. ( 10.1007/978-1-62703-017-5_23) [DOI] [PubMed] [Google Scholar]
- 62.Lopez-Blanco JR, Chacon P. 2016. New generation of elastic network models. Curr. Opin. Struct. Biol. 37, 46–53. ( 10.1016/j.sbi.2015.11.013) [DOI] [PubMed] [Google Scholar]
- 63.Fawcett T. 2006. An introduction to ROC analysis. Pattern Recogn. Lett. 27, 861–874. ( 10.1016/j.patrec.2005.10.010) [DOI] [Google Scholar]
- 64.Bowie JU, Lüthy R, Eisenberg D. 1991. A method to identify protein sequences that fold into a known three-dimensional structure. Science 253, 164–170. ( 10.1126/science.1853201) [DOI] [PubMed] [Google Scholar]
- 65.Felsenstein J. 1989. PHYLIP—phylogeny inference package (version 3.2). Cladistics 5, 164–166. [Google Scholar]
- 66.Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. 2013. MEGA6: Molecular evolutionary genetics analysis version 6.0. Mol. Biol. Evol. 30, 2725–2729. ( 10.1093/molbev/mst197) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Robinson DF, Foulds LR. 1981. Comparison of phylogenetic trees. Math. Biosci. 53, 131–147. ( 10.1016/0025-5564(81)90043-2) [DOI] [Google Scholar]
- 68.Houle D, Govindaraju DR, Omholt S. 2010. Phenomics: the next challenge. Nat. Rev. 11, 855–866. ( 10.1038/nrg2897) [DOI] [PubMed] [Google Scholar]
- 69.Doolittle WF. 1999. Phylogenetic classification and the universal tree. Science 284, 2124–2129. ( 10.1126/science.284.5423.2124) [DOI] [PubMed] [Google Scholar]
- 70.Wiens JJ. 2000. Phylogenetic analysis of morphological data. Washington, DC: Smithsonian Books. [Google Scholar]
- 71.MacLead N, Benfield M, Culverhouse P. 2010. Time to automate identification. Nature 467, 154–155. ( 10.1038/467154a) [DOI] [PubMed] [Google Scholar]
- 72.Boyer DM, Puente J, Gladman JT, Glynn C, Mukherjee S, Yapuncich GS, Daubechies I. 2015. A new fully automated approach for aligning and comparing shapes. Anat. Rec. 298, 249–276. ( 10.1002/ar.23084) [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The TOSCA high-resolution dataset is available at http://tosca.cs.technion.ac.il. The biological datasets, representing three regions of the skeletal anatomy of a collection of primates, are available on Yaron Lipman's website (http://www.wisdom.weizmann.ac.il/~ylipman/CPsurfcomp/).













