Abstract
Data-driven methods are becoming an essential part of computational mechanics due to their advantages over traditional material modeling. Deep neural networks are able to learn complex material response without the constraints of closed-form models. However, data-driven approaches do not a priori satisfy physics-based mathematical requirements such as polyconvexity, a condition needed for the existence of minimizers for boundary value problems in elasticity. In this study, we use a recent class of neural networks, neural ordinary differential equations (N-ODEs), to develop data-driven material models that automatically satisfy polyconvexity of the strain energy. We take advantage of the properties of ordinary differential equations to create monotonic functions that approximate the derivatives of the strain energy with respect to deformation invariants. The monotonicity of the derivatives guarantees the convexity of the energy. The N-ODE material model is able to capture synthetic data generated from closed-form material models, and it outperforms conventional models when tested against experimental data on skin, a highly nonlinear and anisotropic material. We also showcase the use of the N-ODE material model in finite element simulations of reconstructive surgery. The framework is general and can be used to model a large class of materials, especially biological soft tissues. We therefore expect our methodology to further enable data-driven methods in computational mechanics.
Keywords: Machine learning, Constitutive modeling, Nonlinear finite elements, Skin mechanics
1. Introduction
The past two decades have witnessed the emergence of data-driven models for describing various engineering systems and physical phenomena in wide ranging applications, from improving medical diagnosis [1], to weather forecasting [2], energetic material property prediction [3], and turbulence modeling [4]. Data-driven methods such as deep neural networks are becoming popular in materials science and materials engineering in applications such as fatigue life prediction [5], identification of material parameters [6], systems identification [7], and constitutive modeling [8, 9, 10, 11, 12].
The notion of a free energy function, and the definition of the stress by differentiation of the free energy with respect to the deformation lies at the core of nonlinear elasticity, from hyperelastic to energy dissipating materials [13]. The stress, in turn, has to be a strongly elliptic function of the deformation gradient in order to satisfy the existence of traveling waves with real wave speeds [14]. For hyperelastic materials, Morrey showed that there exists a deformation field that minimizes the total potential energy if the strain energy density is a quasiconvex function of the deformation gradient with some additional growth conditions on this function [15]. In practice, it is difficult to check the quasiconvexity condition. Instead, Ball showed that a reasonable requirement is the polyconvexity of the strain energy density [16]. Polyconvexity implies quasiconvexity and is a stronger requirement than quasiconvexity. Yet, it is still flexible enough to capture realistic materials. Convexity of the strain energy density with respect to would also guarantee the existence of minimizers for potential energy functionals, but convexity with respect to is not compatible with large deformation elasticity; this condition is too strong [17]. For example, convexity with respect to is not compatible with the principle of frame indifference. Traditionally, material model development has relied on expressing the strain energy as an explicit and differentiable analytical function of the deformation [18, 19, 20]. The notion of polyconvexity has been an important factor in the development of these closed-form material models [21]. In particular, advances in modeling of soft tissue material behavior has prompted the development of nonlinear and anisotropic strain energies that satisfy polyconvexity [22].
Selection of an appropriate expert-constructed material model for a specific material usually requires advanced expertise and significant trial and error. There are a large number of models in the literature designed to fit different families of materials. Even in specific fields such as skin mechanics there is no consensus on the choice of a material model [23, 24, 25]. Furthermore, the analytical form of the strain energy function may be too restrictive for many applications, resulting in poor prediction performance. Sensitivity with respect to parameters is another issue, for instance when exponential models are used [26].
A recent trend in material modeling has been the use of deep neural networks to describe either the strain energy or its derivatives [9, 12, 27, 28]. Neural networks of sufficient complexity can be used to learn, reproduce and predict the behavior of any elastic material, which resolves many issues of expert-constructed models. However, the convexity of the strain energy function in data-driven frameworks is usually ignored or enforced with additional loss terms [9, 27, 12]. Using this approach does not guarantee the convexity of the strain energy. The penalty terms will push the solution space to resemble that of a convex function in the training region. However, even if convexity checks are satisfied exactly on all the training points, there is no guarantee that convexity extends outside the training points, or beyond the boundaries of the training region. Furthermore, adding loss terms to ensure convexity adds to the nonlinearity of the loss space which makes finding the global minimum a daunting task. These loss terms also result in lengthy calculations during training and may limit the flexibility of the neural networks. Thus, there is a need for data-driven methods that automatically satisfy the polyconvexity requirements but can still capture the behavior of any material.
In this study we use a new type of neural networks known as neural ordinary differential equations (N-ODE) [29] to estimate the derivatives of the strain energy function with respect to invariants of deformation. Polyconvexity of the strain energy is automatically satisfied in our formulation. We train the N-ODEs using synthetic as well as experimental stress-stretch data from porcine skin biaxial experiments obtained from [12]. Finally, we demonstrate the applicability of the N-ODE material model in finite element simulations. The diagram in Fig. 1 shows an overview of our methodology.
Figure 1:
Workflow of the training and inference processes in the N-ODE material model (left), continuous transformation of the hidden state in the N-ODE from modified invariants , to the derivatives of the strain energy (right).
2. Materials and Methods
2.1. Construction of polyconvex strain energy functions
As mentioned previously, we seek to satisfy the polyconvexity of the strain energy function since it is compatible with large deformation hyperelasticity and, with additional growth conditions on the strain energy, guarantees the existence of minimizers for the total potential energy [22, 16, 30, 17]. A function is polyconvex in if there is a function with such that is convex on the extended domain formed by , , . Constructing general functions that satisfy this requirement can be challenging, but a sufficiently flexible subset of polyconvex functions of can be obtained through an additive decomposition [31]
(1) |
with , , each a convex function. Furthermore, to guarantee objectivity of the strain energy, is usually not expressed as a function of the full deformation gradient but instead of the right Cauchy Green deformation . In that case the following form of the strain energy will retain polyconvexity with respect to ,
(2) |
provided are convex non-decreasing functions of their respective arguments [31], with , the first two invariants of , and . The reason why constructing a function based on (2) yields polyconvex functions as expressed in (1) is because the invariants , of are convex non-decreasing functions of and , respectively. Explicitly, the invariants of , , , are related to , cof ,
(3) |
(4) |
To consider anisotropy, additional pseudo-invariants need to be considered. A popular option is , where is a structural tensor defined by the anisotropy direction , which usually corresponds to a fiber orientation in soft tissue in the reference configuration. It can be shown that is a convex function of . Consequently, we can include additional terms that depend on in the strain energy function retaining polyconvexity while including the effects of anisotropy. We consider two different fiber directions and , corresponding to the vectors and in the reference configuration. Lastly, we note that since , , are convex functions of , and is convex with respect to , we can compose convex non-decreasing functions of linear combinations of these invariants and retain polyconvexity [31, 22, 32]. Here, we propose the following general form of the strain energy function:
(5) |
where the second to last term includes convex non-decreasing functions of all possible linear combinations of the invariants introduced so far. The weights are restricted to the interval (0, 1). We could include additional terms depending on the specific application, for instance, anisotropic invariants that are convex with respect to , such as , can be added to this framework.
Turning the attention back to (5), each of the and needs to be convex non-decreasing. This is equivalent to the derivatives being monotonic functions, with in the domain of . In the next section we show how to leverage N-ODEs to generate monotonic derivative functions and thus polyconvex strain energies.
2.2. Neural ordinary differential equations
We are interested in finding functions that create monotonic maps between inputs and outputs . N-ODEs are a novel architecture of neural networks that generalizes some successful models, such as residual networks [33]. The key idea is to replace the number of discrete layers of classical neural networks by a continuous transformation of the hidden state by a learnable function. In this sense, the concept of depth of the neural network is replaced by time. This results in the construction of an ordinary differential equation system:
(6) |
Here represents the hidden state of the neural network. The function is represented by a fully-connected neural network with trainable parameters . The relationship between the input and the output for this model can be obtained by integrating the system in an arbitrary interval of time:
(7) |
Here, we assign the input to the initial condition and the output to the final state .
We know from ordinary differential equation analysis that the solution trajectories never intersect each other in the state space, as illustrated in Fig. 3. Intersection of trajectories would imply that for a given point in the state space there is more than one value for the rate of change , which contravenes the definition of the ordinary differential equation system. In one-dimensional case, this condition implies that for two different trajectories and
(8) |
(9) |
These conditions can be succinctly written in terms of the input and output of the neural ordinary differential equations as:
(10) |
which correspond exactly to the requirements for a monotonic function. To satisfy the polyconvexity requirements, we also need to ensure that these functions are non-negative in the domain of the input. To achieve this property, we need to ensure that
(11) |
where is lowest possible value of the input. Although there are multiple ways to satisfy this condition, here we focus on the simplest case, where and . This particular scenario can be achieved by shifting the inputs and removing all the bias parameters from the neural network that approximates and setting the initial condition of the ODE as . In this case, the neural network that approximates the right-hand side of the ordinary differential equation can be written as:
(12) |
where is a non-linear activation function applied element-wise, is the depth of the network and are the learnable parameters. With this setup, we have shown that the functions approximated by one-dimensional N-ODEs are monotonic and non-negative, and suitable to construct polyconvex strain energy functions.
Figure 3:
An illustration of how the N-ODE maps points in the input space to partial derivatives of the strain energy. Time evolution of (a) to , (b) to , (c) to and (d) to , and, (e) the shape of the resulting functions . The notation is used to denote shifted invariants as inputs to the N-ODE in order to satisfy the initial condition , which is one solution to achieve non-negative derivatives with a vanishing stress at the identity map.
Anisotropic, hyperelastic, and fully incompressible materials
To demonstrate the potential of the proposed framework, we will attempt to learn an anisotropic, hyperelastic and incompressible materials. We have introduced the strain energy, , which is a function of the right Cauchy-Green deformation tensor, , and two material direction vectors, and . For incompressible materials, the last term of the strain energy, , is replaced by the constraint with a Lagrange multiplier. Furthermore, in the case of hyperelastic materials the free energy has no other contribution. The second Piola-Kirchhoff stress tensor, , follows from the Doyle-Erickson formula by differentiating the strain energy with respect to [34],
(13) |
Often times in finite element packages, the Cauchy stress, , is used, which can be obtained via a push forward operation. To solve the equilibrium equations we also need the elasticity tensor:
(14) |
The full expansion of the elasticity tensor is shown in the Supplement. Also note that if the finite element implementation is done in the deformed configuration, the spatial version of the elasticity tensor, , can be obtained via a push forward operation.
Therefore, given N-ODEs describing the strain energy derivatives, these functions can be used directly to determine the stress for a given deformation by evaluating (13). For nonlinear finite element simulations, evaluation of the tangent (14) requires the second derivatives of the strain energy, which in our case involves taking derivatives of the N-ODEs with respect to their inputs.
Data-driven polyconvex constitutive models using neural ordinary differential equations
We have shown that N-ODEs produce monotonic functions. We therefore use this tool to build the derivatives of the terms in eq. (5), i.e. , , etc. as illustrated in Fig. 1. To guarantee that the partial derivatives with respect to the invariants are consistent with a unique potential, we use N-ODEs to interpolate the derivatives of the individual contributions to the energy listed in eq. (5). For instance, for the terms in eq. (5) that depend on a single invariant, a N-ODE is used to capture each. For terms that depend on pairs of invariants, consider the change of variable , then, a N-ODE is used to capture the derivative function . By producing monotonic non-negative derivative functions, we guarantee that the underlying strain energy terms are convex non-decreasing functions of the invariants (or linear combinations of them), which in turn guarantees polyconvexity of the energy with respect to F. A total of 10 N-ODEs are used, each corresponding to a derivative of the strain energy terms in eq. (5). The partial derivatives are assembled consistently from the derivatives and . An extended discussion of how the predicted derivatives are consistent with an underlying potential is presented in the Supplemental material. The architecture of each of the N-ODEs is the same and summarized in Table 1.
Table 1:
Neural network architecture
Layer | Number of nodes | Activation function |
---|---|---|
Input | 1 | None |
Hidden layer 1 | 5 | tanh |
Hidden layer 2 | 5 | tanh |
Output | 1 | Linear |
While the monotonicity condition is guaranteed directly by using N-ODEs, to satisfy the non-negative conditions, , we first note that , for an incompressible material [31]. For these invariants, setting shifted invariants , as inputs to the N-ODE and setting biases to zero gives the initial condition . A single non-negative bias is added after the evaluation of each of these two N-ODEs, i.e. , for , , resulting in , at , . In this way, the functions , are convex non-decreasing with a minimum of , at , , which allows us to capture polyconvex models such as Mooney-Rivlin and neo-Hookean. We remark that this formulation can be extended to compressible materials using modified version of , . Namely, the usual split of the deformation gradient into isochoric and volumetric parts results in the isochoric invariant , such that is polyconvex with respect to [31]. The isochoric invariant is not polyconvex with respect to . However, dividing by an appropriate power of , one can obtain modified invariants which are polyconvex [31].
For the anisotropic terms, the domain of the input is , , with for the identity map. We set shifted invariants , as the inputs of the N-ODEs and remove all biases to get , , and , for , . For the mixed terms in eq (5) we also set all biases to zero, but, additionally, we check if the output of the N-ODE is non-negative otherwise it is set to zero.
The N-ODE framework outlined here is general, it will always produce monotonic derivative functions and thus convex functions of the invariants. Specifying the minimum of the energy and the stress-free state can be achieved in variety of ways and we show one convenient solution. Note also that it is possible to incorporate anisotropic compression terms by considering the invariants and , which are convex with respect to cof and represent area changes orthogonal to the fibers [32, 22]. Extension to compressible materials is also straightforward. Lastly, even though we focus on hyperelasticity, polyconvex strain energies can be used as building blocks to describe dissipative mechanisms.
Model calibration and verification
The training and validation data for the N-ODEs are taken from biaxial tests on porcine skin. The data corresponds to five experimental protocols described in Table 2. In this case, the experiments are described in terms of the principal stretches of the deformation gradient:
(15) |
With the assumption of plane stress incompressible behavior, leads to plus a boundary condition to evaluate the pressure Lagrange multiplier .
Table 2:
Biaxial experimental protocols. and represent the stretches imposed in the and directions and is the stress in the direction. The parameter is monotonically increased during the experiments to stretch the tissue. The last component of the deformation gradient is directly obtained from the incompressible constraint, while the vanishing normal stress is imposed as the necessary boundary condition to solve for the pressure .
Loading | |||
---|---|---|---|
Off-x | 0 | ||
Off-y | 0 | ||
Equibiaxial | 0 | ||
Strip-x | 1 | 0 | |
Strip-y | 1 | 0 |
We also test the N-ODE against synthetic data generated using four popular analytical models summarized in the next subsection.
Training is performed with the ADAM optimization algorithm (with a step size of 1.e-4, , , ) in 100000 epochs with a batch size of 100 . Training is performed with JAX high-performance computing library for Python [35].
Holzapfel-Gasser-Ogden (HGO)
The HGO material model proposed in [19] assumes there are two families of fibers that contribute to the energy through an exponential term. The strain energy is,
(16) |
with dependence on two anisotropy directions ,
(17) |
The parameters controlling the anisotropic contribution are . The strain invariants are the same as defined in eq. (4). The isotropic contribution depends on the first invariant, , and is that of a neo-Hookean solid
(18) |
with parameter . The incompressibility constraint is imposed through the Lagrange multiplier .
Gasser-Ogden-Holzapfel (GOH)
The GOH material model was initially proposed to model arterial walls [18]. It is an extension to the HGO model which considers a single fiber family but incorporates fiber dispersion. Since then, it has been used to model other soft tissues including skin. The strain energy density function in this model
(19) |
where
(20) |
(21) |
and
(22) |
The parameters are the same as those used in the HGO model, except for the fiber dispersion parameter . The strain invariants, and are the same as defined in (4), with the single anisotropy direction .
Generalized Mooney Rivlin (MR)
Originally, the MR models was proposed to capture the mechanical response of rubber-like materials in large strains [36, 37]. There are a number of different formulations for MR models, here we use
(23) |
parameterized by , , .
Fung-type models
The strain energy function proposed by Fung et al. was one of the first models of soft tissue to capture the strain-stiffening anisotropy of collagenous tissue [20]. Unlike the previous material models, the strain energy proposed by Fung et al. is directly in terms of the Green-Lagrange strain, ,
(24) |
with parameters , , , . Note that equation (24) is a model of a two-dimensional material. However, since its original development, many generalizations of the model have been developed that extend its functionality to the third dimension. One such generalization is due to [38] and [39]
(25) |
with parameters and . In this text we use the 2D formulation for demonstration of convexity in Fig. 5 and 3D formulation for all other comparisons.
Figure 5:
Strain energy contours for synthetic data generated using (a) GOH and (c) 2D Fung type material models and (b), (d) the predictions of the N-ODE after training on these datasets, respectively. The training region corresponded to tensile biaxial deformations, as indicated, while the shaded region was not included during learning. When fed non-convex data, the N-ODE still produces a polyconvex output because it is built in the formulation.
Finite element implementation
A major motivation for the data-driven framework described here is to make it readily usable in large scale finite element simulations and realistic applications. Since the N-ODEs directly encode energy derivatives, it is straighforward to implement the N-ODE material model through the UANISOHYPER subroutine in the commercial finite element package Abaqus. The implementation is available in the Github link at the end.
3. Results
The model should estimate the stresses for arbitrary deformations of the material. In experimental mechanics of thin specimens such as skin, the most appropriate method to quantify material behavior is through biaxial tests in terms of the principal stretches, , . However, as outlined above, the model inputs are the invariants of the right Cauchy-Green deformation tensor, . In Fig. 2a we show how the input space for biaxial experiments maps into the input space of the N-ODEs, Fig. 2b,c. The protocols in Table 2 are shown as curves in 2a, the scatter points in shades of blue denote deformations in the feasible region of space, and the scatter points in gray denote infeasible regions. Points colored gray are not achievable during the testing of thin specimens like skin because they correspond to a compressive state under which thin membranes would buckle. The boundary between the feasible and infeasible regions are uniaxial tests in the and directions.
Figure 2:
Input space of the N-ODE in (a) space, space, and (c) space. Points in the input space where testing is not feasible are shown with grey markers, whereas feasible points are in shades of blue. The points are sampled from a uniform distribution within , . The boundary of the feasible region is marked by the uniaxial tests in and directions in space in (a). The map to the space of invariants is nonlinear and condenses the points into a narrow cone in the space in (b). Including anisotropy, the input domain for the data-driven models continues to show how evenly space points in the space are nonlinearly mapped to the domain of the N-ODEs.
The same points from Fig. 2a are mapped into the invariant space. A projection onto the space is depicted in Fig. 2b, and a projection onto is shown in Fig. 2c. The space , corresponds to the input space for the isotropic material behavior. The equibiaxial loading protocol falls in the middle of the feasible region in the space, but it is at the boundary of the space. The uniaxial loading cases, which determine the boundary of the feasible region in space also continue to bound the feasible domain in the space. Even though the testing points in Fig. 2a are evenly distributed in space, they form a narrow cone in the space. The map is highly nonlinear. Since the material model is a function of the invariants, the performance of data-driven models depends on sampling the invariant space, which is not necessarily achieved by simply covering evenly spaced points in space.
In Fig. 3 we illustrate the principle behind the N-ODE map. Points from the input space (translated invariants , i.e ) are integrated in time based on the neural network, which encodes the right-hand side of an ODE. The partial derivatives of the strain energy function are defined as the solution of a N-ODE at the fictitious time . Only the mappings for the invariants , , and are shown in Fig. 3a–d. The monotonicity of the transformation is evident in these figures, it follows from the fact that curves originating at different initial condition never intersect as they are integrated in time. Fig. 3e represents the direct relationship between input and output without the pseudo-time axis.
Comparisons against synthetic data
We start by training the N-ODE material model with synthetic stress-stretch data generated from the four closed-form analytical models introduced in section 2.2. We plot the training data as well as the predictions of the N-ODE for each of the four cases in Fig. 4. Even though the analytical models have each a completely different functional form, the same N-ODE architecture is capable of replicating the different synthetic datasets with high accuracy. The average absolute error is small in all cases. The highest error occurs for the 3D Fung model, this is partly due to the fact that the Fung model is not guaranteed to be polyconvex, which makes it impossible for the NODE to capture its response in regions where convexity is not satisfied. An example later on clarifies this further. The best fit of the N-ODE is for the Mooney-Rivlin material. Among the four synthetic datasets, the Mooney-Rivlin is the least nonlinear. The other three analytical models include an exponential term that describes a rapid strain stiffening typical of soft tissues. Concomitantly, the exponential term in these models poses challenges for model calibration due to the extreme sensitivity to parameters [26]. We also note that training takes on the order of 6 minutes on an Apple M1 CPU for each of the synthetic data sets.
Figure 4:
Synthetic training data and N-ODE predictions using (a) GOH, (b) Mooney Rivlin, (c) HGO and (d) 3D Fung type material models. The N-ODE is able to interpolate synthetic data from closed-form constitutive equations.
Next, we demonstrate how the N-ODE automatically satisfies polyconvexity of the strain energy. We train the N-ODE material model with synthetic data from GOH and Fung models. We show in Figs. 5a,c the predicted contours of strain energy of the two analytical models and the corresponding N-ODE approximations are in Figs. 5b,d. GOH is an inherently convex model [18], which is reflected in the contour of the energy as a function of the Green Lagrange strain components for parameters , , , , . (Fig. 5a). Naturally, the N-ODE trained with this model also predicts a convex function (Fig. 5b). The Fung model, on the other hand, is known to exhibit non-convex behavior for some choice of parameters, for instance , , , . The N-ODE model cannot interpolate these data, it can only find a convex approximation as seen in Fig. 5d.
Performance on experimental data
The primary objective of developing N-ODE material models is being able to capture experimental data without the restrictions of closed-form energy functions. The experimental data for this example consists of data from five biaxial tests as described in Table 2, performed on porcine skin specimens. We divide the data in two sets, train the data on three of the biaxial test data and validate against the remaining biaxial test data as shown in Fig. 6.
Figure 6:
Predictions of the N-ODE material model after training on porcine data. Test results from (a) equibiaxial, (b) Strip-x and (c) Strip-y testing protocols were used for training while (d) Off-x and (e) Off-y test results were reserved for validation.
The N-ODE reproduces the training data almost identically, with an average error of 0.012 MPa in the worst case scenario. The error in the validation tests is naturally higher than in the training set, but it is still in a reasonable range with average errors 0.088 MPa for off-x and 0.019 MPa for off-y data.
In Fig. 7, we train both the N-ODE and the four analytical models with data from a different porcine skin specimen subjected to same loading as described in Table 2. Similarly to before, we split the data into training and validation sets. The maximum principal stress predictions, , from each of the analytical models as well as the N-ODE model are plotted as the graph of a surface over the , plane (Fig. 7a–e). The experimental data are plotted as curves colored based on the absolute error between these curves and the predicted response surface. Fig. 7f shows a box plot of the error distribution for each model. None of the analytical models are capable of capturing the behavior of the material in the training set, whereas the N-ODE replicates the behavior in that region flawlessly. Errors naturally increase overall in the validation set. Yet, even in the validation set, the N-ODE has the lowest median errors. Note also that under high equibiaxial stretches, the stresses in GOH, HGO and Fung models increase to unreasonably high values. This is due to the exponential form of these analytical models. The N-ODE, on the other hand, maintains convexity but does not infer an exponential growth beyond the training region.
Figure 7:
Surfaces of maximum principal stress and curves with the experimental data colored based on error for (a) GOH, (b) Mooney Rivlin, (c) HGO, (d) Fung and (e) N-ODE. Box plots of error distribution for the five models (f).
Table 3 summarizes the performances of the five models from another series of tests. In each row of Table 3 we train the models on the first 80% of the loading path and test against the remaining 20%. The average error from the validation test is shown in the Table. In other words, these tests are done to measure the ability of each of the models to extrapolate away from the training region, toward larger deformations. The last row in Table 3 contains the average of the errors among the five individual tests. The N-ODE has the lowest error in three of the five cases and has the lowest average error over the five tests, at 0.048 MPa. The second best performing model, GOH, has 32% higher error relative to the N-ODE model. The Supplemental material shows one additional example in which we compare a much simpler N-ODE against the GOH model.
Table 3:
Mean absolute error, in MPa, for the validation set for the 4 analytical models and the N-ODE. Training/fitting is performed with the first 80% of data points of each loading protocol while the last 20% are held for validation.
Protocol | GOH | MR | HGO | 3D Fung | N-ODE |
---|---|---|---|---|---|
Off-x | 0.083 | 0.114 | 0.245 | 0.246 | 0.009 |
Off-y | 0.010 | 0.024 | 0.071 | 0.123 | 0.062 |
Equibiaxial | 0.037 | 0.100 | 0.141 | 0.035 | 0.031 |
Strip-x | 0.056 | 0.123 | 0.051 | 0.133 | 0.098 |
Strip-y | 0.120 | 0.095 | 0.051 | 0.194 | 0.038 |
Average | 0.062 | 0.091 | 0.112 | 0.146 | 0.048 |
Finite element simulations
The results from finite element simulations in Abaqus for a range of geometries and loading scenarios using the N-ODE material model are show-cased here.
First, three basic loading scenarios were applied to a 5 × 5 cm specimen as shown in Fig. 8 For a simple uniaxial tension test in the direction, unconstrained in the resulting stresses match the analytical evaluation (see Fig. (6), indicating that the subroutine is functioning as desired (Fig.8a).
Figure 8:
Finite element simulations using the N-ODE material model. Boundary conditions on the undeformed geometry are shown in the first column, deformed geometry in the second column, and contours of and on the deformed geometry are depicted in the last two columns for (a) uniaxial tension, (b) shearing and stretching, (c) torsional loading scenarios. The material is anisotropic, with the directions of anisotropy depicted as quiver plots in the reference geometries. The red vector field corresponds to the stiffer orientation , while the black vector field corresponds to .
In Fig. 8b the left side of the specimen is held fixed while the right side is displaced both in x and y directions, whereas in 8c a torsional loading is applied. For the shearing deformation we see a band of stress for the component, and concentration of the component at the corners but otherwise fairly uniform and small stresses since the upper and bottom surfaces are unconstrained. When a torque is applied in Fig. 8c, stress concentrations develop in the corners where there is the most deformations as expected from the clamping condition on the boundary. In all cases the simulations converged quadratically, demonstrating that the N-ODE model can be used as a constitutive equation in stable, nonlinear finite element simulations. Solution of the three models in Fig. 8 take roughly 31 sec, 49 sec and 2 min 35 sec, respectively. The same problems take 14 sec, 17 sec and 1 min 2 sec when the GOH material model is used instead.
Next, we performed tissue expansion simulations on 10 × 10 × 0.3 cm3 a patch of skin (Fig 9a). A rectangular expander of dimensions 8 × 8 cm2 underneath the skin is modeled by the fluid cavity feature in Abaqus. The expander is inflated to 20, 30 and 40 cm3 resulting in the strain contours shown in Fig. 8b and c.
Figure 9:
Finite element simulations of tissue expansion using the N-ODE material model in UANISOHYPER. Model setup (a), and contours of strain (b and c) on the deformed geometry after the expander is inflated to 20, 40 and 60 cm3, respectively.
Lastly, a surgical operation was simulated on the scalp of a cancer patient with a patient specific geometry reported in [40]. Figs.10 a–e show the model during various stages of surgery. Fig.10a shows the initial geometry of the model, Fig.10b–e show contours of maximum principal stress on the deformed geometry when the sutures near the ear are completed, and when the sutures on top of the cranium are brought together. Fig.10f. shows a photo of the patient’s scalp prior to surgery and Fig.10g a post-operative photo of the same region with the predicted contours of maximum principal stress superimposed.
Figure 10:
Virtual surgery finite element model performed in Abaqus using the N-ODE material model through the user defined UANISOHYPER subroutine. Initial geometry of the scalp (a), contours of maximum principal stress on the deformed geometry after the lower sutures are completed (b), and as the upper sutures are stretched to close the wound by 30%, 60% and 100% (c-e), respectively. Photographs of the scalp of the patient before the surgery (f) and after the surgery superimposed with contours of maximum principal stress (g).
4. Discussion
In this paper we present the first automatically polyconvex data-driven framework for constitutive modeling of nonlinear anisotropic materials. Most proposed methods either do not impose convexity or rely on additional loss terms to penalize deviation from convexity during model training [41, 12].
Our approach is general enough to approximate a large class of material models. Although there might be polyconvex strain energy functions that are not captured by our solution, they might not be needed in practice. In general, our approach can represent any degree of non-linearity for a given invariant and reproduce certain types of interactions between invariants. We could arbitrarily increase the complexity of the model by adding second order interactions (i.e. ) and by composing the resulting strain energy function with other convex non-decreasing functions. If we do this multiple times, we could create deep networks to approximate an even larger class of strain energy functions. However, from what we have seen in our experiments, the capacity of the functions presented in this work is enough to model the behavior of highly nonlinear and anisotropic materials, e.g. soft tissues.
The proposed framework is based on the principal invariants of the right Cauchy Green deformation tensor, which is commonplace in the development of anisotropic polyconvex strain energies [21]. However, it could be reformulated for a different set of basic invariants such as the principal stretches [42], or the invariants of the right stretch tensor [43, 44]. Specifically for anisotropy, we have restricted our attention to the pseudo-invariants and , which are polyconvex functions of the deformation that measure the square of the stretch along the directions and . Other anisotropic pseudo-invariants are possible. For example, as discussed in the Methods section, and are also polyconvex functions that can be used to capture energy associated with compression of the anisotropy directions [31]. We did not consider terms such as , because our data consisted of biaxial tests and because it is common to assume that fibers in soft tissues do not contribute to the energy under compression [19]. Incorporating additional directions of anisotropy, for instance based on some fiber probability distribution [45, 46], would be straightforward. The results with only two directions were already satisfactory and we did not include additional anisotropic pseudo-invariants for that reason.
Alternative conditions of convexity have been considered in the literature. Among them, convexity of the strain energy with respect to is often sought in constitutive models [41, 47]. However, the conditions under which convexity with respect to guarantees global minimizers to boundary value problems in nonlinear elasticity is not fully elucidated, see for example the recent work by Gao et al. [48]. As pointed out in [49] and [50], convexity with respect to and polyconvexity with respect to are not equivalent. Thus, in this work we focused on the condition of polyconvexity established by Ball [16].
Although the proposed approach works excellently for the cases we tested, we acknowledge that there are alternatives to arrive at similar results. For example, any invertible neural network architecture should work because a function that posses an inverse in one-dimensions is monotonic. N-ODEs are an elegant and efficient solution to this problem, because the gradient can be computed with a system of ODEs rather than back-propagation, which can be expensive for a deep neural network. We also note that there are other architectures to guarantee convexity of functions based on similar principles [51]. We think our method has two main advantages: first, one can directly incorporate domain knowledge by selecting the terms and interactions that are relevant rather than using all possible inputs (strain invariants and combinations). This approach also adds interpretability, as each of the fitted functions can be inspected and compared to existing models. Second, by approximating derivatives of the strain energy function rather than the strain energy function itself, we decrease the degree of differentiability of the neural network and can employ simpler activation functions, such as ReLU. Alternative methods to deal with the issues of differentiability are integrable neural networks or regularization terms during model training [41, 52]. Even though we do not have direct access to the strain energy function, the fact that the derivative functions are all one-dimensional enables the use of standard quadrature rules to efficiently integrate the strain energy if needed.
Lastly, we remark that in this paper we have focused on incompressible, and hyperelastic behavior. This is not a limitation of the framework but simply the application explored in detail here. The framework produces polyconvex strain energies applicable to a much wider set of problems. For example, consideration of compressible behavior would require a suitable volumetric strain energy function which can also be data-driven and convex. An example of a compressible material is given in the Supplement. Modeling of viscoelastic materials or elastoplastic deformations can also be formulated using data-driven polyconvex strain energies [53, 54, 55].
Conclusions
We present a data-driven framework to construct automatically polyconvex strain energy functions. The formulation is based on N-ODEs. We showcase the framework by using it to model the nonlinear, anisotropic, hyperelastic, and incompressible behavior of skin. The data-driven framework outperforms closed-form constitutive models. Our results foment the appeal of data-driven methods to better capture experimental material response without the constraints of analytical expressions for the strain energy while still satisfying the basic requirements of physically realistic models, i.e. the notion of polyconvexity. Additionally, the formulation is invariant and allows for the efficient computation of the second derivatives of the strain energy, which further enables the use of our data-driven framework in finite element simulations. We therefore anticipate that this work will further cement the use of data-driven methods in computational mechanics.
Supplementary Material
Acknowledgements
This work was supported by the National Institute of Arthritis and Musculoskeletal and Skin Diseases of the National Institute of Health under award R01AR074525.
Footnotes
Supplementary material
Code associated with this publication is available at a Github repository https://github.com/tajtac/NODE_v2
References
- [1].Ma J, Wu F, Zhu J, Xu D, Kong D, A pre-trained convolutional neural network based method for thyroid nodule diagnosis, Ultrasonics 73 (2017) 221–230. [DOI] [PubMed] [Google Scholar]
- [2].Guhathakurta P, Long-range monsoon rainfall prediction of 2005 for the districts and sub-division kerala with artificial neural network, Current Science 90 (2006) 773–779. [Google Scholar]
- [3].Casey AD, Son SF, Bilionis I, Barnes BC, Prediction of energetic material properties from electronic structure using 3d convolutional neural networks, Journal of Chemical Information and Modeling 60 (2020) 4457–4473. [DOI] [PubMed] [Google Scholar]
- [4].Duraisamy K, Iaccarino G, Xiao H, Turbulence modeling in the age of data, Annual Review of Fluid Mechanics 51 (2019) 357–377. [Google Scholar]
- [5].Lee J, Almond D, Harris B, The use of neural networks for the prediction of fatigue lives of composite materials, Composites Part A: Applied Science and Manufacturing 30 (1999) 1159–1169. [Google Scholar]
- [6].Lu L, Dao M, Kumar P, Ramamurty U, Karniadakis GE, Suresh S, Extraction of mechanical properties of materials through deep learning from instrumented indentation, Proceedings of the National Academy of Sciences of the United States of America 117 (2020) 7052–7062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Wang Z, Huan X, Garikipati K, Variational system identification of the partial differential equations governing microstructure evolution in materials: Inference over sparse and spatially unrelated data, Computer Methods in Applied Mechanics and Engineering 377 (2021) 113706. [Google Scholar]
- [8].Le BA, Yvonnet J, He Q-C, Computational homogenization of nonlinear elastic materials using neural networks, International Journal for Numerical Methods in Engineering 104 (2015) 1061–1084. [Google Scholar]
- [9].Liu M, Liang L, Sun W, A generic physics-informed neural network-based constitutive model for soft biological tissues, Computer Methods in Applied Mechanics and Engineering 372 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Peng GC, Alber M, Tepole AB, Cannon WR, De S, DuraBernal S, Garikipati K, Karniadakis G, Lytton WW, Perdikaris P, et al. , Multiscale modeling meets machine learning: What can we learn?, Archives of Computational Methods in Engineering (2020) 1–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Zhang X, Garikipati K, Machine learning materials physics: Multi-resolution neural networks learn the free energy and nonlinear elastic response of evolving microstructures, Computer Methods in Applied Mechanics and Engineering 372 (2020) 113362. [Google Scholar]
- [12].Tac V, Sree VD, Rausch MK, Tepole AB, Data-driven modeling of the mechanical behavior of anisotropic soft biological tissue, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Marsden JE, Hughes TJ, Mathematical foundations of elasticity, Courier Corporation, 1994. [Google Scholar]
- [14].Kuhl E, Askes H, Steinmann P, An illustration of the equivalence of the loss of ellipticity conditions in spatial and material settings of hyperelasticity, European Journal of Mechanics-A/Solids 25 (2006) 199–214. [Google Scholar]
- [15].Morrey CB Jr, Quasi-convexity and the lower semicontinuity of multiple integrals., Pacific journal of mathematics 2 (1952) 25–53. [Google Scholar]
- [16].Ball JM, Convexity conditions and existence theorems in nonlinear elasticity, Archive for rational mechanics and Analysis 63 (1976) 337–403. [Google Scholar]
- [17].Rindler F, Calculus of variations, volume 5, Springer, 2018. [Google Scholar]
- [18].Gasser TC, Ogden RW, Holzapfel GA, Hyperelastic modelling of arterial layers with distributed collagen fibre orientations, Journal of the royal society interface 3 (2005) 15–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Holzapfel GA, Gasser TC, Ogden RW, A new constitutive framework for arterial wall mechanics and a comparative study of material models, Journal of Elasticity 61 (2000) 1–48. [Google Scholar]
- [20].Fung YC, Fronek F, Patitucci P, Pseudoelasticity of arteries and the choice of its mathematical expression, American Journal of Physiology 237 (1979) H620–H631. [DOI] [PubMed] [Google Scholar]
- [21].Chagnon G, Rebouah M, Favier D, Hyperelastic energy densities for soft biological tissues: a review, Journal of Elasticity 120 (2015) 129–160. [Google Scholar]
- [22].Ehret AE, Itskov M, A polyconvex hyperelastic model for fiber-reinforced materials in application to soft tissues, Journal of Materials Science 42 (2007) 8853–8863. [Google Scholar]
- [23].Limbert G, Skin Biophysics: From Experimental Characterisation to Advanced Modelling, volume 22, Springer, 2019. [Google Scholar]
- [24].Jor JW, Parker MD, Taberner AJ, Nash MP, Nielsen PM, Computational and experimental characterization of skin mechanics: identifying current challenges and future directions, Wiley Interdisciplinary Reviews: Systems Biology and Medicine 5 (2013) 539–556. [DOI] [PubMed] [Google Scholar]
- [25].Mueller B, Elrod J, Distler O, Schiestl C, Mazza E, On the reliability of suction measurements for skin characterization, Journal of Biomechanical Engineering 143 (2021) 021002. [DOI] [PubMed] [Google Scholar]
- [26].Lee T, Turin SY, Gosain AK, Bilionis I, Tepole AB, Propagation of material behavior uncertainty in a nonlinear finite element model of reconstructive surgery, Biomechanics and Modeling in Mechanobiology 17 (2018) 1857–1873. [DOI] [PubMed] [Google Scholar]
- [27].Leng Y, Calve S, Tepole AB, Predicting the mechanical properties of fibrin using neural networks trained on discrete fiber network data, arXiv preprint arXiv:2101.11712 (2021). [Google Scholar]
- [28].Vlassis NN, Sun W, Sobolev training of thermodynamic-informed neural networks for interpretable elasto-plasticity models with level set hardening, Computer Methods in Applied Mechanics and Engineering 377 (2021). [Google Scholar]
- [29].Chen RTQ, Rubanova Y, Bettencourt J, Duvenaud D, Neural ordinary differential equations, 2019. [Google Scholar]
- [30].Schröder J, Anisotropic polyconvex energies, in: Poly-, quasi-and rank-one convexity in applied mechanics, Springer, 2010, pp. 53–105. [Google Scholar]
- [31].Schröder J, Neff P, Invariant formulation of hyperelastic transverse isotropy based on polyconvex free energy functions, International journal of solids and structures 40 (2003) 401–445. [Google Scholar]
- [32].Balzani D, Neff P, Schröder J, Holzapfel G, A polyconvex framework for soft biological tissues. Adjustment to experimental data, International Journal of Solids and Structures 43 (2006) 6052–6070. [Google Scholar]
- [33].He K, Zhang X, Ren S, Sun J, Deep residual learning for image recognition, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778. [Google Scholar]
- [34].Doyle T, Ericksen J, Nonlinear elasticity, volume 4 of Advances in Applied Mechanics, Elsevier, 1956, pp. 53–115. [Google Scholar]
- [35].Bradbury J, Frostig R, Hawkins P, Johnson MJ, Leary C, Maclaurin D, Necula G, Paszke A, VanderPlas J, Wanderman-Milne S, Zhang Q, JAX: composable transformations of Python+NumPy programs, 2018. [Google Scholar]
- [36].Mooney M, A theory of large elastic deformation, Journal of Applied Physics 11 (1940) [Google Scholar]
- [37].Rivlin RS, Large elastic deformations of isotropic materials iv. further developments of the general theory, Philosophical Transactions of the Royal Society A 241 (1948). [Google Scholar]
- [38].Chuong CJ, Fung YC, On Residual Stresses in Arteries, Journal of Biomechanical Engineering 108 (1986) 189–192. _eprint: https://asmedigitalcollection.asme.org/biomechanical/article-pdf/108/2/189/5665801/189_1.pdf. [DOI] [PubMed] [Google Scholar]
- [39].Deng SX, Tomioka J, Debes JC, Fung YC, New experiments on shear modulus of elasticity of arteries, American Journal of Physiology-Heart and Circulatory Physiology 266 (1994) H1–H10. [DOI] [PubMed] [Google Scholar]
- [40].Lee T, Turin SY, Stowers C, Gosain AK, Tepole AB, Personalized computational models of tissue-rearrangement in the scalp predict the mechanical stress signature of rotation flaps, The Cleft Palate-Craniofacial Journal 58 (2021) 438–445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [41].Vlassis NN, Ma R, Sun W, Geometric deep learning for computational mechanics part i: anisotropic hyperelasticity, Computer Methods in Applied Mechanics and Engineering 371 (2020) 113299. [Google Scholar]
- [42].Shariff M, Physical invariants for nonlinear orthotropic solids, International journal of solids and structures 48 (2011) 1906–1914. [Google Scholar]
- [43].Steigmann DJ, On isotropic, frame-invariant, polyconvex strain-energy functions, Quarterly Journal of Mechanics and Applied Mathematics 56 (2003) 483–491. [Google Scholar]
- [44].Steigmann DJ, Frame-invariant polyconvex strain-energy functions for some anisotropic solids, Mathematics and mechanics of Solids 8 (2003) 497–506. [Google Scholar]
- [45].Lanir Y, A structural theory for the homogeneous biaxial stress-strain relationships in flat collagenous tissues, Journal of biomechanics 12 (1979) 423–436. [DOI] [PubMed] [Google Scholar]
- [46].Sacks MS, Biaxial mechanical evaluation of planar biological materials, Journal of elasticity and the physical science of solids 61 (2000) 199–246. [Google Scholar]
- [47].Holzapfel GA, Ogden RW, Constitutive modelling of passive myocardium: a structurally based framework for material characterization., Philosophical transactions. Series A, Mathematical, physical, and engineering sciences 367 (2009) 3445–75. [DOI] [PubMed] [Google Scholar]
- [48].Gao DY, Neff P, Roventa I, Thiel C, On the convexity of nonlinear elastic energies in the right cauchy-green tensor, Journal of Elasticity 127 (2017) 303–308. [Google Scholar]
- [49].Sivaloganathan J, Spector SJ, On the uniqueness of energy minimizers in finite elasticity, Journal of Elasticity 133 (2018) 73–103. [Google Scholar]
- [50].Wilber JP, Walton JR, The Convexity Properties of a Class of Constitutive Models for Biological Soft Issues, Mathematics and Mechanics of Solids 7 (2002) 217–235. [Google Scholar]
- [51].Amos B, Xu L, Kolter JZ, Input convex neural networks, in: International Conference on Machine Learning, PMLR, pp. 146–155. [Google Scholar]
- [52].Teichert GH, Natarajan A, Van der Ven A, Garikipati K, Machine learning materials physics: Integrable deep neural networks enable scale bridging by learning free energy functions, Computer Methods in Applied Mechanics and Engineering 353 (2019) 201–216. [Google Scholar]
- [53].Bonet J, Gil AJ, Ortigosa R, A computational framework for polyconvex large strain elasticity, Computer Methods in Applied Mechanics and Engineering 283 (2015) 1061–1094. [Google Scholar]
- [54].Krishnan J, Steigmann DJ, A polyconvex formulation of isotropic elastoplasticity theory, IMA Journal of Applied Mathematics 79 (2014) 722–738. [Google Scholar]
- [55].Nordsletten D, Capilnasiu A, Zhang W, Wittgenstein A, Hadjicharalambous M, Sommer G, Sinkus R, Holzapfel GA, A viscoelastic model for human myocardium, arXiv preprint arXiv:2105.06671 (2021). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.