Abstract
Particle-based shape modeling (PSM) is a popular approach to automatically quantify shape variability in populations of anatomies. The PSM family of methods employs optimization to automatically populate a dense set of corresponding particles (as pseudo landmarks) on 3D surfaces to allow subsequent shape analysis. A recent deep learning approach leverages implicit radial basis function representations of shapes to better adapt to the underlying complex geometry of anatomies. Here, we propose an adaptation of this method using a traditional optimization approach that allows more precise control over the desired characteristics of models by leveraging both an eigenshape and a correspondence loss. Furthermore, the proposed approach avoids using a black-box model and allows more freedom for particles to navigate the underlying surfaces, yielding more informative statistical models. We demonstrate the efficacy of the proposed approach to state-of-the-art methods on two real datasets and justify our choice of losses empirically.
Keywords: Statistical Shape Modeling, Optimization, Radial Basis Function Interpolation, Polyharmonic Splines
1. INTRODUCTION
Statistical Shape Modeling (SSM) is a family of techniques used to quantify shape variability given a population of anatomies. SSM has advanced the understanding of different diseases and disorders [1-6]. SSM relies on representing shape variation in a compact way to perform subsequent statistical analyses. The two most popular shape representations used for building SSMs are deformation fields and landmarks. Deformation fields encode implicit transformations between population samples and a pre-defined (or learned) atlas. Landmarks are explicit points spread on shape surfaces that correspond across the population and serve as proxies to the surface for subsequent statistical analysis [7]. Shape analysis used to be performed by manually placing landmarks and performing statistics on them, but this was labor-intensive, not reproducible, and required extensive domain expertise (e.g., radiologists). Modern computational methods allow the automatic computation of deformations or placement of landmarks, shifting the SSM field to data-driven characterization of population-level variabilities that is objective, reproducible, and scalable. These methods include minimum description length – MDL [8], particle-based shape modeling – PSM [9] for landmarks, and frameworks based on Large Deformation Diffeomorphic Metric Mapping for deformation fields (e.g., [10]).
Landmark-based representations, also known as point distribution models (PDMs), have been favored for analyzing surface variability due to their simplicity, computational efficiency, and interpretability for statistical analyses [7]. However, such scalability suffers when considering that some applications require thousands of densely placed landmarks to capture shape structure faithfully when representing intricate shape surfaces with localized or convoluted shape features that may live between landmarks. The state-of-the-art PDM method, PSM [9], produces dense particle placements. However, it provides a uniform surface sampling, and thus fail to adapt to the underlying geometrical intricacies at different scales.
Image2SSM [11] first proposed the use of radial basis function shape representations (RBF-shapes) for the SSM task. RBF-shapes are continuous surface representations instead of simply discrete surface proxies. They allow particles to self-adapt to poorly represented surface areas while building the statistical model. Despite its benefits, the black-box nature of such a deep network optimization undermines the user’s ability to understand and control the process of building SSMs, not to mention the computational cost of parameterizing the PDM construction with a deep network. Furthermore, parameterizing landmarks using a network limits particles manipulation at both training and inference times, leading to poor surface adherence and configuration congruency.
To address these limitations, this paper proposes a non-deep optimization-based solution that leverages RBF-shapes to build SSMs. It has the advantage of being self-adaptive to the underlying geometry and can further reduce the number of required particles for intricate surfaces. Additionally, not using a black box model deep learning approach gives our method much more control over the particle spread and placement since they are not parametrized with a deep neural network. We also introduce the simultaneous use of correspondence and eigenshape losses to improve configuration consistency while emphasizing dominant modes of variation. Additionally, enhancing traditional optimization techniques contributes to the advancement of the deep learning variants for PDM construction (e.g., [12-15]) by providing stronger shape priors to train the networks. We show that our method performs similarly or better than PSM [9] and Image2SSM [11] on real datasets in correspondence metrics and two-way surface-to-surface distance. We also show how the correspondence and eigenshape losses function to build better PDMs. The proposed method will be integrated into the open-source package for building PDMs, ShapeWorks [9].
2. METHODS
A statistical shape model should balance an accurate geometrical representation of the individual surfaces in the given cohort while preserving landmark configuration congruency across the surfaces, referred to as correspondence, encoding shape variations in a compact set of parameters. The proposed method optimizes four losses using stochastic gradient descent to achieve such a balance by explicitly emphasizing and controlling the different characteristics of an SSM. The objective is to optimize for a set of landmarks (or control points in RBF literature). These control points are fit to a cohort of shapes in the form of segmentations . We optimize for a set of control points for each input shape, where the shape PDM is denoted by and . The RBF-shape can then reconstruct a full surface using these points and their surface normals by interpolating between them. This provides online information to the model regarding how well each part of the surface is represented, leading to adaptive updates.
This section will first explain how the RBF-shape can estimate reconstruction error at each region and how each of the four losses is involved in the optimization.
2.1. Radial Basis Function Surface Representation
RBF-shape is an implicit surface representation based on radial basis functions, which can represent intricate shapes by leveraging both surface control points and normals to inform shape reconstructions [16,17]. It uses a set of control points at the zero-level set and a pair of off-surface points (aka dipoles) with a signed distance and along the surface normal of each control point. This is illustrated in Fig. 1.
Fig. 1.
A set of control points and their respective dipoles can define an implicit signed distance function within a narrow band that can be efficiently queried for surface representation accuracy at any narrow band point.
We refer to the set of control points and their dipoles as for shape , where with . The shape’s implicit function, which maps a point to its signed distance to the surface can then be written as:
(1) |
where can be any RBF basis function (e.g., the thin plate spline , the biharmonic or the triharmonic ) and and encode the linear component of the surface. We obtain by solving a system of equations formed by Eq 1 over and regularization factors to form a fully determined system, fully described in [16, 17]. The function provides an implicit shape representation that is resolution agnostic and can be queried to construct segmentations and surface meshes at arbitrary resolution. This representation excels at adapting to the underlying surface geometry, leading to better shape representation using fewer control points.
2.2. SSM Optimization
Control points for each surface are initialized using the same set of points sampled uniformly at random. During the optimization, the four losses ensure that the control points (1) live on the surface, (2) maintain the same configuration across all shapes, (3) push the population-level variability to dominant modes of variation, and (4) are spread according to the reconstruction error of the RBF-shape representation. Together, these losses promote a balance between the geometric representation of the individual surfaces and maintaining consistent placement of control points across the population while resulting in a compact statistical model.
Surface loss:
The surface loss aims to force control points to lie on the surface. For each shape , we define this loss as
(2) |
where is the true distance to the closest surface point to , obtained by querying the signed distance transform.
Correspondence loss:
The notion of control points correspondence across the shape population can be quantified by the Frobenius norm of the covariance of the control points. In simple terms, this loss attempts to collapse each control point to its mean across all shapes. It is written as
(3) |
where is the mean control points, and the Frobenius norm.
Eigenshape loss:
This loss pushes control points to align with the most dominant modes of variability. In [9], this loss is the one used as the correspondence loss (derived from [18]). This objective does not improve correspondence of the control points, but rather improves compactness by aligning control points to the principal components. This observation aligns with the source claims about this loss in [18], which described this objective as being equivalent to minimizing the Mahalanobis distance of the system. However, due to the eigenvalue scaling inherent in the Mahalanobis distance definition, missed correspondences get captured by the principal components and are not corrected by this loss, making it a poor loss to enforce correspondence. This shortcoming is shown empirically in our results section, hence our choice of using the Frobenius norm loss to enforce correspondence. Nonetheless, including this loss is imperative to improving the compactness of the resulting shape model, which is a desirable property in the analysis that does not severely interfere with good sampling and correspondence.
Given a minibatch of size , the eigenshape loss is formulated using the differential entropy of the samples in the minibatch, assuming a Gaussian distribution.
(4) |
where here indicates the random variable of the shape space and is the matrix determinant. Both the correspondence and eigenshape loss are triggered starting from the second epoch as soon as a mean particle set is available.
Sampling loss:
This loss encourages the control points to adapt to the underlying geometry. We query the distance at randomly sampled points that lie within a narrow band at distance from the surface. The control points are then guided to poorly represented areas by minimizing their distance to the surrounding narrow band points, scaled by the queried approximate violation intensity.
To define surrounding narrow band points, let be the matrix containing the pairwise distances between each narrow band point and each control point , such that, for the shape, the , element is . We use a soft minimum function to allow each narrow band point to influence all nearby control points, written as , where the , element of is computed as . This soft minimum is normalized over . Finally, the RBF approximation squared error at the narrow band points is captured by , where . Let , where is a ones-vector of size . The sampling loss is then written as
(5) |
where indicates the Hadamard (elementwise) multiplication of matrices, is a weighting variable, and mean computes the average over all the matrix elements. Adding 1 to gives each narrow band point at least a unit pull.
Total loss:
We optimize minibatches of size at a time with a total loss written as follows:
(6) |
where , , , and are hyperparameters of each losses and , are the control points and their surface normals of the samples in the minibatch.
3. RESULTS
Our results demonstrate that the proposed method outperforms Image2SSM [11] in both two-way surface-to-surface distance and correspondence metrics while improving slightly over PSM [9]. Additionally, we showcase the effect of the correspondence loss and the eigenshape loss on a single major axis variation ellipsoid toy example to justify our choice of loss functions.
3.1. Datasets and Implementation Details
We perform tests on two real datasets. First, a dataset of signed distance transforms from 40 proximal femur CT scans devoid of pathologies. The second dataset consists of 814 left atrium signed distance transforms, a challenging dataset due to size and high shape variability.
We used the PyTorch 1.12.1 Autograd functionality to automatically backpropagate our losses using the SGD optimizer. As a preprocessing step, a set of uniformly sampled control points is used to initialize the optimization on a single shape. These control points then serve as the initialization for all shapes during the optimization of all control points. This preprocessing is not strictly necessary as the main optimization can be initialized with the same random set of points, but it reduces the total number of iterations needed by leveraging this reasonable prior. This is in contrast to the PSM initialization, which utilizes a multi-scale particle splitting routine to initialize the shape model.
At each epoch, 10,000 narrow band points are sampled from the narrow band of each surface to accelerate computation. We used the biharmonic kernel for the basis function, but the results are observed to be agnostic to this choice. The hyperparameters we used are learning_rate = 1, α = 0.1, β = 1.0, γ = 0.01, ζ = 0.01, and c = 10 for femurs; changing ζ = 0.1, γ = 0.001, ζ = 0.001 for left atria; and α = 0.01, and c = 1 for ellipsoids.
3.2. Shape Model Quality
We evaluate our approach against the state-of-the-art Image2SSM [11] and PSM [9] in building a model with 128 particles. Fig. 2 shows that our approach matches or outperforms Image2SSM and PSM in the maximum two-way surface-to-surface distance of the reconstruction while yielding a slightly poorer mean distance than PSM. It resolves the worst surface misrepresentations but fails to capture the general surface as closely without an explicit snap-to-surface routine like PSM uses.
Fig. 2.
The two-way surface-to-surface distance between Image2SSM, PSM, and our approach.
Fig. 3 shows conventional metrics of compactness (percentage of variance captured), specificity (ability to generate realistic shapes), and generalization (ability to represent unseen shape instances) [19] for femurs and left atria compared to Image2SSM and PSM for the first ten modes of variation, where it yields comparable compactness but better specificity and generalization.
Fig. 3.
Shows the compactness (higher is better), specificity (lower is better), and generalization (lower is better) for ten modes on femurs and left atria.
3.3. Correspondence and Eigenshape Losses
For simplicity, we show the effects of the correspondence and eigenshape loss on a population of ellipsoids where only the major axis varies. Fig. 4(a) shows how a missed correspondence is not only not fixed by the correspondence loss, but is captured in its main mode of variation. Fig. 4(b) shows how even with perfect correspondence, courtesy of the correspondence loss, the percentage variance of the main mode and compactness are worse than 4(a) due to the lack of eigenshape loss. Fig. 4(c) shows how we can achieve a compact model with good correspondences using both losses. All in all, this shows not only that the eigenshape loss is inadequate to resolve correspondences, but it can also generate spurious variability in the main modes of variation.
Fig. 4.
By changing and , we show the results of running our optimization with (a) only correspondence loss, (b) only eigenshape loss, and (c) both. The red circle highlights two control points that are swapped for 6 out of 20 shapes.
4. CONCLUSION
We demonstrate an approach for improving PDM construction using a traditional optimization technique. Our proposed losses offer new insights regarding the development of correspondence models that better capture variability while being faithful to the surface representations. Future work involves fully exploring the effect of existing losses for correspondence and how they interact against the eigenshape loss, as well as furthering RBF-shape’s ability to capture surface error to enable surface-informed non-uniform sampling.
Acknowledgments
The National Institutes of Health supported this work under grant numbers NIBIB-U24EB029011 and NIAMS-R01AR076120. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Footnotes
COMPLIANCE WITH ETHICAL STANDARDS
This study was performed in line with the principles of the Declaration of Helsinki. Approval was granted by the Institutional Review Board at the University of Utah IRB#00056086 for the femur dataset and IRB#00072747 for the left atrium dataset.
6. REFERENCES
- [1].Atkins Penny R, Elhabian Shireen Y, Agrawal Praful, Harris Michael D, Whitaker Ross T, Weiss Jeffrey A, Peters Christopher L, and Anderson Andrew E, “Quantitative comparison of cortical bone thickness using correspondence-based shape modeling in patients with cam femoroacetabular impingement,” Journal of Orthopaedic Research, vol. 35, no. 8, pp. 1743–1753, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Bhalodia Riddhish, Dvoracek Lucas A, Ayyash Ali M, Kavan Ladislav, Whitaker Ross, and Goldstein Jesse A, “Quantifying the severity of metopic craniosynostosis: A pilot study application of machine learning in craniofacial surgery,” Journal of Craniofacial Surgery, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [3].Lenz Amy, Krähenbühl Nicola, Peterson Andrew, Lisonbee Rich, Hintermann Beat, Saltzman Charles, Barg Alexej, and Anderson Andrew, PhD, “Statistical shape modeling of the talocrural joint using a hybrid multi-articulation joint approach,” Scientific Reports, vol. 11, pp. 7314, April 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].van Buuren MMA, Arden NK, Bierma-Zeinstra SMA, Bramer WM, Casartelli NC, Felson DT, Jones G, Lane NE, Lindner C, Maffiuletti NA, van Meurs JBJ, Nelson AE, Nevitt MC, Valenzuela PL, Verhaar JAN, Weinans H, and Agricola R, “Statistical shape modeling of the hip and the association with hip osteoarthritis: a systematic review,” Osteoarthritis and Cartilage, vol. 29, no. 5, pp. 607–618, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].Merle Christian, Innmann Moritz M, Waldstein Wenzel, Pegg Elise C, Aldinger Peter R, Gill Harinderjit S, Murray David W, and Grammatopoulos George, “High variability of acetabular offset in primary hip osteoarthritis influences acetabular reaming—a computed tomography–based anatomic study,” The Journal of Arthroplasty, vol. 34, no. 8, pp. 1808–1814, 2019. [DOI] [PubMed] [Google Scholar]
- [6].Bruse Jan L, McLeod Kristin, Biglino Giovanni, Ntsinjana Hopewell N, Capelli Claudio, Hsia Tain-Yen, Sermesant Maxime, Pennec Xavier, Taylor Andrew M, Schievano Silvia, et al. , “A statistical shape modelling framework to extract 3d shape biomarkers from medical imaging data: assessing arch morphology of repaired coarctation of the aorta,” BMC medical imaging, vol. 16, pp. 1–19, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Sarkalkan Nazli, Weinans Harrie, and Zadpoor Amir A, “Statistical shape and appearance models of bones,” Bone, vol. 60, pp. 129–140, 2014. [DOI] [PubMed] [Google Scholar]
- [8].Davies Rhodri H, Twining Carole J, Cootes Timothy F, Waterton John C, and Taylor Christopher J, “A minimum description length approach to statistical shape modeling,” IEEE transactions on medical imaging, vol. 21, no. 5, pp. 525–537, 2002. [DOI] [PubMed] [Google Scholar]
- [9].Cates Joshua, Elhabian Shireen, and Whitaker Ross, “Shapeworks: particle-based shape correspondence and visualization software,” in Statistical Shape and Deformation Analysis, pp. 257–298. Elsevier, 2017. [Google Scholar]
- [10].Durrleman Stanley, Prastawa Marcel, Charon Nicolas, Korenberg Julie R, Joshi Sarang, Gerig Guido and Alain Trouvé, “Morphometry of anatomical shape complexes with dense deformations and sparse parameters,” NeuroImage, vol. 101, pp. 35–49, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Xu Hong and Elhabian Shireen Y., “Image2ssm: Reimagining statistical shape models from images with radial basis functions,” in Medical Image Computing and Computer Assisted Intervention — MICCAI 2023, Greenspan Hayit, Madabhushi Anant, Mousavi Parvin, Salcudean Septimiu, Duncan James, Syeda-Mahmood Tanveer, and Taylor Russell, Eds., Cham, 2023, pp. 508–517, Springer Nature Switzerland. [Google Scholar]
- [12].Bhalodia Riddhish, Elhabian Shireen Y., Kavan Ladislav, and Whitaker Ross T., “Deepssm: A deep learning framework for statistical shape modeling from raw images,” in Shape In Medical Imaging at MICCAI. 2018, vol. 11167 of Lecture Notes in Computer Science, pp. 244–257, Springer. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Adams Jadie, Bhalodia Riddhish, and Elhabian Shireen, “Uncertain-deepssm: From images to probabilistic shape models,” Shape in Medical Imaging : International Workshop, ShapeMI 2020, Held in Conjunction with MICCAI 2020, Lima, Peru, October 4, 2020, Proceedings, vol. 12474, pp. 57–72, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Adams Jadie and Elhabian Shireen, “From images to probabilistic anatomical shapes: A deep variational bottleneck approach,” in Medical Image Computing and Computer Assisted Intervention—MICCAI 2022: 25th International Conference, Singapore, September 18–22, 2022, Proceedings, Part II. Springer, 2022, pp. 474–484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Adams Jadie and Elhabian Shireen Y, “Fully bayesian vib-deepssm,” in International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 2023, pp. 346–356. [Google Scholar]
- [16].Turk Greg and O’Brien James F., “Variational implicit surfaces,” 1999. [Google Scholar]
- [17].Carr JC, Beatson RK, Cherrie JB, Mitchell TJ, Fright WR, McCallum BC, and Evans TR, “Reconstruction and representation of 3d objects with radial basis functions,” in Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques, New York, NY, USA, 2001, SIGGRAPH ’01, p. 67–76, Association for Computing Machinery. [Google Scholar]
- [18].Kotcheff Aaron C.W. and Taylor Chris J., “Automatic construction of eigenshape models by direct optimization,” Medical Image Analysis, vol. 2, no. 4, pp. 303–314, 1998. [DOI] [PubMed] [Google Scholar]
- [19].Davies Rhodri H., “Learning shape: Optimal models for analysing natural variability,” 2004. [Google Scholar]