An improved parameter estimation and comparison for soft tissue constitutive models containing an exponential function

Ankush Aggarwal

doi:10.1007/s10237-017-0889-3

. 2017 Mar 1;16(4):1309–1327. doi: 10.1007/s10237-017-0889-3

An improved parameter estimation and comparison for soft tissue constitutive models containing an exponential function

Ankush Aggarwal ^1,^✉

PMCID: PMC5511618 PMID: 28251368

Abstract

Motivated by the well-known result that stiffness of soft tissue is proportional to the stress, many of the constitutive laws for soft tissues contain an exponential function. In this work, we analyze properties of the exponential function and how it affects the estimation and comparison of elastic parameters for soft tissues. In particular, we find that as a consequence of the exponential function there are lines of high covariance in the elastic parameter space. As a result, one can have widely varying mechanical parameters defining the tissue stiffness but similar effective stress–strain responses. Drawing from elementary algebra, we propose simple changes in the norm and the parameter space, which significantly improve the convergence of parameter estimation and robustness in the presence of noise. More importantly, we demonstrate that these changes improve the conditioning of the problem and provide a more robust solution in the case of heterogeneous material by reducing the chances of getting trapped in a local minima. Based upon the new insight, we also propose a transformed parameter space which will allow for rational parameter comparison and avoid misleading conclusions regarding soft tissue mechanics.

Keywords: Soft tissues, Biomechanics, Constitutive laws, Nonlinear elasticity, Parameter estimation, Inverse modeling

Introduction

Formulating an accurate constitutive law for soft tissues has been a contentious research topic for several decades (Maurel et al. 1998, chap. 4). Significant advances have been made since the seminal work by Fung and others, and a multitude of hyperelastic constitutive laws have been proposed for describing the stress–strain behavior of different soft tissues. These include myocardium (Humphrey and Yin 1987), arteries (Holzapfel et al. 2000), ligaments (Natali et al. 2003; Weiss and Gardiner 2001), heart valves (May-Newman and Yin 1998).

A common feature of many of the proposed constitutive laws is the presence of an exponential function. This stems from the classic study by Fung et al. (1972), which demonstrated that stiffness of the soft tissues is proportional to stress. The exponential nature has been shown to be a result of collagen fiber recruitment and rotation that happens at the microstructural level (Lanir 1983; Billiar and Sacks 2000). However, fiber-level mechanics entails mesoscale calculations leading to high computational cost. Therefore, the phenomenological models containing an exponential are seen as better suited for tissue- or organ-scale biomechanical studies.

With a wealth of insight available on the suitability of constitutive laws, focus has been increasing on using them within biomechanical models to predict the mechanical behavior of soft tissues. Hence, accurate determination of the elastic parameters involved in the stress–strain relationships is a critical step in such predictive modeling. Various ex vivo and in vitro testing methods have been developed, and recently, methods applicable to in vivo dataset are gaining more attention as they allow the elastic properties to be characterized in tissues’ native environment.

Exponential function results in a highly nonlinear stress–strain relationship, even if we discount the geometric nonlinearity due to large deformation. In spite of this marked difference between soft tissue constitutive laws and other commonly used strain energy functions, e.g., for linear and rubber-like elastic material, standard techniques are used for parameter estimation and comparison of soft tissues. We aim to study the effect of nonlinearity of an exponential on soft tissue constitutive laws and related elastic parameters, and seek to improve upon the existing standard methods.

This work is motivated by results from our recent study (Aggarwal and Sacks 2016), where an inverse model for bioprosthetic valve was developed. The study was designed to determine mechanical properties of a bioprosthetic valve leaflet by matching its deformed shape. The constitutive law contained an exponential function $σ \sim A e^{B ϵ}$ (although within a weighted integral and with neo-Hookean term), and the proposed framework included estimating two elastic parameters A and B (in the original work $c_{0}$ and $c_{1}$ were used, instead we use A and B for consistency in this manuscript). It was observed that the objective function contained a long and narrow valley in the parameter space (Fig. 1a), which resulted in a slow convergence of the inverse model, especially once the iterative solution entered the valley.

The objective function with a long narrow valley is reminiscent of Rosenbrock function used in optimization textbooks (Fig. 1b), which is a challenging minimization problem because of its shape (Rosenbrock 1960). Furthermore, the flat shape of the valley means that parameters along the valley generate closely similar stress–strain response. Thus, the two parameters A and B are highly covariant along this valley. On the other hand, changing parameters transverse to the valley dramatically affects the stress–strain response (Fig. 2). Therefore, a simple comparison of elastic parameters for tissue samples might present a grossly wrong picture. For example, parameters corresponding to points 1 and 5 are much farther apart compared to points 8 and 9 (Fig. 2). However, the former produce a much closer stress–strain response compared to the latter.

Fig. 2 — Parameters along the valley of the functional produce similar stress–strain behavior (*left*), whereas those across produce dramatically different response (*right*) (adapted from Aggarwal and Sacks 2016)

These observations raise multiple questions: did the valley shape result from that one particular problem or is it a general feature of soft tissues? Can we improve the convergence of parameter estimation? Is there a more rational way to compare the elastic parameters of two tissue samples? If only two parameters lead to a challenging parameter estimation, how will the problem behave in case of heterogeneous media (four or more parameters), where one cannot visualize the objective function? Here, we aim to answer these questions by using ideas from elementary algebra and analyzing multiple cases by extending those ideas.

Methods and cases considered

Before starting the analysis, we present implementation details of the various functions and biomechanical models studied here. The problems were divided into two categories: displacement-controlled (DC) and force-controlled (FC). In DC cases, the input variable was the deformation or strain, and stress or forces were fitted to estimate parameters. On the other hand, in FC cases, the input variable was the force or stress, and strain or deformation were fitted.

One-dimensional curve fitting

Starting in one dimension, the simplest function with an exponential is

\begin{matrix} σ (A, B, ϵ) = A e^{B ϵ} . \end{matrix}

Here $ϵ$ represents strain and is the input, while $σ$ is the output representing stress. Thus, (1) is a DC case, and the inverse of this relationship

\begin{matrix} ϵ (A, B, σ) = \frac{log (σ / A)}{B} \end{matrix}

is an FC case. Since the stress function (1) is not zero at $ϵ = 0$ , we also considered a more realistic representation of stress

\begin{matrix} σ (A, B, ϵ) & = A (e^{B ϵ} - 1) and its inverse \end{matrix}

\begin{matrix} ϵ (A, B, σ) & = \frac{log (σ / A + 1)}{B} . \end{matrix}

We considered a model with higher nonlinearity:

\begin{matrix} σ (A, B, ϵ) = A (e^{B ϵ^{2}} - 1) . \end{matrix}

Lastly, the exponential function is sometimes truncated and linearized beyond an upper bound on strain $ϵ_{ub}$ (Fan and Sacks 2014). This may represent the condition when all the fibers have been recruited, thus producing a linear stress–strain response thereafter. Accordingly, we considered the following model

\begin{matrix} σ (A, B, ϵ) \\ = \{\begin{matrix} A (e^{B ϵ} - 1) & for ϵ \leq ϵ_{ub} \\ A [(e^{B ϵ_{ub}} - 1) + B e^{B ϵ_{ub}} (ϵ - ϵ_{ub})] & for ϵ > ϵ_{ub} \end{matrix} \end{matrix}

The stress function (6) was defined such that both stress $σ$ and stiffness $\partial σ / \partial ϵ$ remain continuous at $ϵ = ϵ_{ub}$ . In all one-dimensional problems, the input was used to calculate output which was then matched to observed data in order to obtain the optimum parameters A and B. We call this process “curve fitting.”

Multi-dimensional curve fitting

Hyperelastic constitutive laws for soft tissues are generally defined in two or three dimensions, based upon deformation gradient $F$ (Holzapfel 2000). We studied two commonly used constitutive laws here; however, the results are expected to be extensible to most others. First is the Gasser–Ogden–Holzapfel (GOH) model (Gasser et al. 2006):

\begin{matrix} Ψ (A, B, F) = \frac{A}{2 B} (e^{BQ} - 1) + Ψ_{matrix} (F), \end{matrix}

where $Q = {(κ I_{1} + (1 - 3 κ) I_{4} - 1)}^{2}, I_{1} = tr (C)$ is the first invariant of $C = F^{T} F, I_{4} = C : N \otimes N$ is the fourth invariant with material direction vector $N$ , and $κ$ controls the “degree of anisotropy.” $Ψ_{matrix}$ represents the contribution from the ground matrix in the tissue, and it is modeled as a compressible neo-Hookean solid:

\begin{matrix} Ψ_{matrix} (F) = \frac{μ}{2} (I_{1} - 3) - μ log (J) + \frac{λ}{2} {(log (J))}^{2}, \end{matrix}

where $J = \sqrt{det (C)}$ represents the volume change.

Second is the simplified structural model (SM) (Fan and Sacks 2014):

\begin{matrix} Ψ (A, B, F) & = \int_{θ} Γ (θ) A (\frac{e^{B (N \cdot E \cdot N)} - 1}{B} - N \cdot E \cdot N) d θ \\ + Ψ_{matrix} (F), \end{matrix}

where $E = \frac{1}{2} (C - I)$ and $Γ (θ)$ is the fiber orientation function. $Γ (θ)$ is modeled as a truncated Gaussian in angle $θ \in [- π / 2, π / 2]$ with standard deviation (SD) $τ$ and peak at $θ = ω$ :

\begin{matrix} Γ (θ) = d_{e} \frac{1}{P} exp (- \frac{{(θ - ω)}^{2}}{2 τ^{2}}) + \frac{(1 - d_{e})}{π}, \end{matrix}

where $P = \int_{- π / 2}^{π / 2} exp (- \frac{{(θ - ω)}^{2}}{2 τ^{2}}) d θ$ normalizes the distribution.

If we assume that microstructural parameters, such as $κ, N, d_{e}, ω$ and $τ$ , and ground matrix properties are known, both constitutive models (7) and (9) have only two elastic parameters A and B to be determined. Even an approximation of the ground matrix elastic properties provides a good estimate of the overall mechanical behavior for many cases, such as bioprosthetic valves (Aggarwal and Sacks 2016). For both models, second Piola–Kirchhoff stress can be derived by the standard relation $S = 2 \frac{\partial Ψ}{\partial C}$ , and it is easy to see that $S \sim A e^{B (\cdot)}$ . In (7), the exponent is BQ, whereas in (9), we have an integral of exponentials. Thus, the two models represent two distinct classes of constitutive laws.

GOH (7) and SM (9) models were used in multi-dimensional curve fitting, where known deformation gradients were used to calculate the stresses. These stresses were then matched to the observed stresses in order to determine the elastic parameters. Since the known input was deformation and stress was the output, these curve fitting problems belong to the DC category. We note that for these constitutive laws, there is no closed form solution of the inverse relation, i.e., strain or deformation gradient as a function of stress. Therefore, multi-dimensional curve fitting could not be performed for the FC case.

Inverse models

For cases where an explicit relation from input to output is not available (such as the FC case for multi-dimensional problems) or where the input parameters vary spatially and cannot be represented using a single set of values, curve fitting cannot be performed. In such situations, it is more appropriate to solve an inverse model, where the observations are matched to the outcome of a finite element simulation. This finite element model uses a predetermined constitutive law, and we tested both GOH (7) and SM (9) models.

We considered two inverse modeling problems, and all of the finite element simulations were performed using FEBio (Maas et al. 2012). First problem is that of a biaxial testing of a thin planar tissue sample (Fig. 3), which was studied for both DC and FC cases. In the DC case, known uniform displacement boundary conditions were applied on the sample edge, and total reaction forces on the edges were matched to the observed values. On the other hand, in the FC case, known uniform forces were applied on the sample edge, and average edge displacements were matched to observed values. A detailed description of the setup, such as the sample size, boundary conditions, is specified in “Details of biaxial simulation.”

Fig. 3 — Simulation setup of the planar biaxial stretching of a square tissue sample, a variable microstructure and b described boundary/load conditions on a quadrilateral mesh

The second problem studied here is the shape matching of a semilunar tissue sample under static pressure loading (Fig. 4), which represents the closing of a bioprosthetic valve leaflet. Since the input for this problem is pressure traction, only FC case was possible. Details of the shape matching procedure were described in our previous work (Aggarwal and Sacks 2016) and are summarized in “Details of tissue pressurization simulation.” All of the problems considered in this study are summarized in Table 1.

Table 1.

Summary of all problems considered, indicating force-controlled (FC) and displacement-controlled (DC) cases

Problem	Model	Input	Output	Type
1D curve fitting	(1)	Strain $ϵ$	Stress $σ$	DC
	(3)	Strain $ϵ$	Stress $σ$	DC
	(5)	Strain $ϵ$	Stress $σ$	DC
	(6)	Strain $ϵ$	Stress $σ$	DC
	(2)	Stress $σ$	Strain $ϵ$	FC
	(4)	Stress $σ$	Strain $ϵ$	FC
Multi-D curve fitting	(9)	Deformation gradient $F$	$2 nd$ PK Stress $S$	DC
Multi-D curve fitting	(7)	Deformation gradient $F$	$2 nd$ PK Stress $S$	DC
Inverse modeling of biaxial setup	SM (9)	Displacement boundary conditions	Edge forces	DC
	GOH (7)	Displacement boundary conditions	Edge forces	DC
	SM (9)	Force on edges	Edge deformation	FC
	GOH (7)	Force on edges	Edge deformation	FC
Inverse modeling of shape matching	SM (9)	Pressure	Deformed shape	FC
Inverse modeling of shape matching	GOH (7)	Pressure	Deformed shape	FC

Open in a new tab

Generic notation

For all the cases considered, both DC and FC, we define a generic notation for the analysis. $\bar{f} (x_{i})$ denotes the observed data at independent input variable $x_{i}$ for $i = 1 \dots N$ , and $f (A, B, x_{i})$ denotes the model. A and B are the parameters to be determined by fitting $f (A, B, x_{i})$ to $\bar{f} (x_{i})$ . In the present context of soft tissue mechanics x denotes the input applied during testing, e.g., applied strain, displacement or load, $\bar{f}$ represents the observed output, e.g., stress, force, deformed shape or strain, and f(A, B, x) is the same quantity computed using our model. Since we are interested in functions of the form $f (A, B, x) \sim A e^{h (B, x)}$ or its inverse, A has units of stress and B is dimensionless.

To fit our model to the observed data, we define a functional $F = | | f - \bar{f} | |$ . The minimum point of this functional corresponds to the optimum parameters $\overset{˘}{A}, \overset{˘}{B} = {\arg \min}_{A, B} F$ , where f and $\bar{f}$ are “closest.” We use the norm $| | \cdot | |$ in a general sense, and two options were explored:

\begin{matrix} 2-norm: F & = | | f - \bar{f} {| |}_{2} \end{matrix}

\begin{matrix} log-norm: F & = | | f - \bar{f} {| |}_{log} = | | \bar{log} (f) - \bar{log} (\bar{f}) {| |}_{2} \end{matrix}

The 2-norm is the standard Euclidean norm, which $= \sum_{i} (f (x_{i}) - \bar{f} (x_{i}))^{2}$ for discrete input $x_{i}$ or $= \int_{0}^{X} (f (x) - \bar{f} (x))^{2} d x$ for continuous input $x \in (0, X)$ . The “log-norm” uses logarithm of f and $\bar{f}$ in the standard Euclidean norm. The logarithm function is denoted as $\bar{log}$ to clarify its modified form

\begin{matrix} \bar{log} (x) = \{\begin{matrix} log (x) & if x > 0 \\ x & if x \leq 0 \end{matrix} . \end{matrix}

In general, there could be constraints on A and B in our minimization problem. These constraints are physically motivated, for example from thermodynamics of strain energy density and convexity requirements for stress function. Here, we considered two constraints that both A and B must be positive.

In order to exclude the effect of noise, the observed data were generated synthetically for known values of the parameters $\bar{A}$ and $\bar{B}$ . Hence, we denote the observed data as $\bar{f} (x) = f (\bar{A}, \bar{B}, x)$ , where $\bar{A}$ and $\bar{B}$ are known a priori. Clearly, in this case, the global minimum of $F$ should occur at $\overset{˘}{A} = \bar{A}$ and $\overset{˘}{B} = \bar{B}$ . All the parameters to be determined are collectively denoted as $c$ , and the model f and data $\bar{f}$ at all inputs $x_{i}$ combined into a vector are denoted as $f$ and $\bar{f}$ , respectively.

Lastly, we define two curves in the (A, B) parameter space where one of the first derivatives of the functional $F$ vanishes:

\begin{matrix} A_{1}^{min} (B) : \overset{def}{=} {\frac{\partial F (A, B)}{\partial A}|}_{A_{1}^{min} (B), B} & = 0 and \end{matrix}

14a

\begin{matrix} A_{2}^{min} (B) : \overset{def}{=} {\frac{\partial F (A, B)}{\partial B}|}_{A_{2}^{min} (B), B} & = 0 . \end{matrix}

14b

We name them A-partial minima (APM) and B-partial minima (BPM) curves, respectively, or PM curves collectively. The two PM curves intersect at the global minimum $(\overset{˘}{A}, \overset{˘}{B})$ , and the shape and adjacency of the two curves help develop an intuitive and qualitative understanding of the analysis. Whenever possible, closed form expressions were evaluated for PM curves. For other problems, the PM curves were determined numerically. For APM, this was done by fixing B at various values and minimizing $F$ with respect to A, and vice versa for BPM.

Parameter estimation

In order to calculate the elastic parameters for soft tissues, a numerical optimization has to be performed to minimize the function $F (c)$ . We focus on gradient-based line search methods, which proceed iteratively in two steps: (1) determine a direction along which $F$ will decrease and (2) determine the step size to move along that direction (also known as line search). For calculating the direction in step 1, the simplest choice is the steepest descent direction $- \nabla F$ . However, it leads to extremely slow convergence for functionals with a narrow valley, such as the Rosenbrock function (Nocedal and Wright 2006). On the other hand, using the full Newton’s method

\begin{matrix} (\nabla^{2} F) Δ c = - \nabla F \end{matrix}

gives second-order convergence. However, it requires calculation of second derivatives for the Hessian $\nabla^{2} F$ and solving a linear system of equations (15), making it computationally expensive. For least-square functionals, such as the 2- and log-norms (11,12), their form allows a simpler approximation of the Hessian. If $J = \partial f / \partial c$ , then the functional gradient is $\nabla F = J^{T} (\bar{f} - f)$ and the first-order approximation of the Hessian is $\nabla^{2} F \approx J^{T} J$ . This approximation leads to the Gauss–Newton algorithm

\begin{matrix} (J^{T} J) Δ c = J^{T} (f - \bar{f}), \end{matrix}

which gives approximately second-order convergence and requires only the first derivative to be computed.

Comparing the different gradient-based methods for fitting the one-dimensional exponential function (1), we found that the Gauss–Newton algorithm performed the best. This is consistent with observations about the Rosenbrock function (Nocedal and Wright 2006). Henceforth, we used the Gauss–Newton algorithm (Algorithm 1) for all problems. For the line search in step 2, we used a simple backtracking algorithm, which took into account any constraints on the parameters and situations of failed $f$ calculation (e.g., due to non-convergence of the finite element solver). graphic file with name 10237_2017_889_Figa_HTML.jpg

Values of $TOL = 10^{- 10}$ and $δ = 10^{- 5}$ were used for all calculations. Convergence of nonlinear minimization can strongly depend on the initial guess or the starting point (denoted using subscript 0, $c_{0}$ , etc.). Therefore, for each problem, multiple minimizations were performed with starting points spanning the parameter space. Hence, $10 \times 10$ starting guesses were chosen uniformly distributed in the span of $A_{0} \in [0.005, 0.1]$ and $B_{0} \in [20, 100]$ . The resulting convergence statistics—number of iterations (#Iter), number of $f (c)$ evaluations (#Eval) and number of parallel evaluations (#Paral)—were reported as mean ± SD. The number of parallel evaluations was important since the central difference calculations for determining $J$ were done in parallel. However, during the line search, evaluations had to be performed sequentially, adding to the computational cost.

Noise

In order to study the effect of noise on the accuracy of the estimated parameters, we added random noise of varying magnitude to the target vector:

\begin{matrix} \bar{f} (x) = f (\bar{A}, \bar{B}, x) (1 + ν rand [- 1, 1]) . \end{matrix}

Here $ν$ represents the noise level, which was varied from 0.01 to 0.04, and the random number between $- 1$ and 1 had uniform probability. The effect of the noise was quantified by an error in the estimated optimum parameters:

\begin{matrix} e (ν) = \sqrt{{(\bar{A} - \overset{˘}{A})}^{2} + {(\bar{B} - \overset{˘}{B})}^{2}} . \end{matrix}

Clearly, $e (0) = 0$ since without noise the global minimum coincides with the true minimum.

Heterogeneous model

In all of the problems considered so far, we assumed that the elastic parameters did not vary over the tissue sample. However, heterogeneity is a common feature of biomechanical systems. In order to study the parameter estimation properties for a heterogeneous system, we considered the simplest problem of two materials in a biaxial testing setup. That is, the tissue sample is made of two types of tissues with two sets of elastic parameters— $(A_{1}, B_{1})$ and $(A_{2}, B_{2})$ (Fig. 5). The boundary and loading conditions remained the same as in the biaxial inverse model (“Details of biaxial simulation”). We considered only the SM constitutive law for both DC and FC cases. If both tissues in the sample have the same microstructural properties, then the system is symmetric, i.e., both $(A_{1}, B_{1}, A_{2}, B_{2})$ and $(A_{2}, B_{2}, A_{1}, B_{1})$ give exactly the same response. This leads to a singular Hessian whenever $A_{1} = A_{2}$ and $B_{1} = B_{2}$ . Therefore, in order to break this symmetry and make the Hessian non-singular, we used $τ_{1} = π / 6$ and $τ_{2} = π / 7$ .

Fig. 5 — For the heterogeneous model example, consider the situation where tissue sample is made up two materials—1 and 2 with elastic parameters $(A_{1}, B_{1})$ and $(A_{2}, B_{2})$ , respectively. $τ_{1} \neq τ_{2}$ is required to break the symmetry of the problem and make its Hessian non-singular everywhere

In total, there are four unknown elastic parameters— $A_{1}, B_{1}, A_{2}$ and $B_{2}$ , which leads to a four-dimensional (4D) parameter space. As a consequence, the functional cannot be visualized and scanning the entire parameter space for starting points becomes prohibitively expensive. Therefore, we only scanned a diagonal plane in that 4D space, where the starting parameters were the same for the two materials: $A_{1, 0} = A_{2, 0}$ and $B_{1, 0} = B_{2, 0}$ . Hence, $8 \times 8$ starting guess points were chosen which were uniformly distributed in the span of $log (A_{1, 0}) \in [- 4, - 2]$ and $B_{1, 0} \in [20, 100]$ . The parameter estimation was performed without adding any noise to the system.

Results

Analysis

Before carrying out parameter estimation, we analyze the properties of various models described in the previous section and the functional $F$ constructed for them. The analysis is divided into DC and FC cases.

Displacement-controlled cases

We start by looking at the simplest curve fitting involving an exponential function (1). Assuming that the “observed” data has no noise and that the number of observations is infinite, we have $\bar{σ} = \bar{A} e^{\bar{B} ϵ} \forall ϵ \in [0, E]$ . Here, $E > 0$ defines the maximum value of strain $ϵ$ at which the stress has been observed. Our model (1) needs to be fit to $\bar{σ}$ and, thereby, determine the optimum parameters A and B.

As a first step, we simply take our functional as the 2-norm of the difference between the observed data and model: $F = | | σ - \bar{σ} {| |}_{2} = \int_{0}^{E} {(A e^{B ϵ} - \bar{A} e^{\bar{B} ϵ})}^{2} d ϵ$ , which can be evaluated analytically (27) (details of all analytical derivations are in “Details of the analytical derivations”). Clearly, $F = 0$ at $A = \bar{A}$ and $B = \bar{B}$ , and $F > 0$ everywhere else. Thus, it has a global minimum at $(\bar{A}, \bar{B})$ , which corresponds to the true solution. Also, it can be easily verified that there are no other local minima in this functional. That is, $(\bar{A}, \bar{B})$ is the unique global minimum and $F$ is convex everywhere. Furthermore, this is a highly nonlinear functional, and its contour plot contains a long narrow valley similar to that observed in the inverse model of a bioprosthetic valve (Figs. 2, 6a). Hence, interestingly, curve fitting of a simple exponential function reproduces the behavior seen for a complex inverse modeling problem.

Fig. 6 — Functional for displacement-controlled (DC) cases plotted as a *contour*; global minimum is indicated using a *green circle*, and the PM curves are plotted using *colored lines*—APM (*blue*) and BPM (*red*). *Left and center column plots* are using 2-norm in (A, B) and ( $log (A), B$ ) space, respectively, while the *right column plots* are using log-norm

For this function, the APM and BPM curves can also be determined in closed form (29). These curves lie very close together in the valley of the functional (Fig. 6a). Since a point in the parameter space where both derivatives are zero represents the global minimum, the proximity of the two PM curves implies that both derivatives are approximately zero in the whole valley region. That is why, even though the functional is convex, parameters along the valley produce a similar response $σ (ϵ)$ , as observed previously (Fig. 2).

Based upon elementary algebra, for fitting an exponential function, it is significantly easier if the function is linearized by taking a logarithm before the 2-norm. That is, $F = | | log (σ) - log (\bar{σ}) {| |}_{2}$ . This leads to a quadratic functional in $log (A)$ and B (31), where both APM and BPM curves are straight lines (33) (Fig. 6c). Clearly, this norm also satisfies the condition of a unique global minimum at $(\bar{A}, \bar{B})$ . More importantly, the two PM curves are not close anymore and are visually distinct (Fig. 6c). This is reflected in the functional shape as an absence of a valley.

Since $σ > 0$ and $\bar{σ} > 0$ for this model for all strain values, the norm $F = | | log (σ) - log (\bar{σ}) {| |}_{2}$ is equivalent to the log-norm (12). There is one subtle difference between the 2-norm and log-norm functionals: the latter is defined in a transformed parameter space of $(log (A), B)$ instead of (A, B). Therefore, in order to objectively compare the two norms, we look at the 2-norm in the $(log (A), B)$ parameter space (Fig. 6b). In this case, the APM and BPM curves are still close together; however, the valley becomes relatively straight.

We extend these ideas to function (3), which is a better representation of stress–strain relation. With 2-norm, we obtain a similar curved valley in the (A, B) space (Fig. 6d), where the PM curves are even closer together. In the $(log (A), B)$ space using 2-norm, the PM curves and the valley are more straight (Fig. 6e). For the log-norm, modified log function (13) has to be used, since $σ$ goes to zero for $ϵ = 0$ making its log undefined. For (3) with log-norm, it is not possible to obtain analytical expressions for the functional and PM curves. Thus, these were determined numerically using representative values of $\bar{A} = 0.02$ and $\bar{B} = 45$ . Again, we observe disappearance of the valley, and the two PM curves become distinct for $B ⪆ 10$ (Fig. 6f).

We perform similar analysis for multi-dimensional constitutive laws using DC curve fitting. In this case, the functional and PM curves cannot be determined analytically for either of the norms, so numerical calculations were performed using SM and GOH models for the same representative parameter values ( $\bar{A} = 0.02$ and $\bar{B} = 45$ ). The resulting functionals behave very similar to the one-dimensional problems. The 2-norm has a similar valley which is curved in the (A, B) space and practically straight in the $(log (A), B)$ space, and the PM curves are extremely close in both cases (Fig. 6g–l)1. Using the log-norm, the PM curves become distinct and no valley is observed (Fig. 6i, l). Lastly, a small difference can be noticed between the 2-norm functionals for two constitutive laws: the PM curves for SM model are closer together as compared to the GOH model.

Force-controlled cases

Similar to the analysis of DC cases, we start with the simplest FC problem: the inverse of an exponential function (2). We assume that the observed data is of the same form without any noise, i.e., $\bar{ϵ} (σ) = (log (σ / \bar{A})) / \bar{B}$ , and that the observations were made continuously in the range $σ \in [1, Σ]$ . Here, $Σ$ is the maximum stress at which strain was observed. In order to fit our model to the observed data, as a first step, we take the 2-norm functional: $F = | | ϵ - \bar{ϵ} {| |}_{2}$ . This can be evaluated analytically and results in a rational functional (35). Similar to the DC case, a long and narrow valley is observed (Fig. 7a). Here also, we find that the PM curves (37) and (39) essentially overlap. Furthermore, the valley and PM curves become relatively straight in the $(log (A), B)$ space (Fig. 7b)—another similarity to the DC case.

Fig. 7 — Functional for force-controlled (FC) cases plotted as a *contour*; global minimum is indicated using a *green circle*, and the PM curves are plotted using *colored lines*—APM (*blue*) and BPM (*red*). All functionals are evaluated using 2-norm but plotted in different parameter spaces; from *left to right column*: (A, B), ( $log (A), B$ ), and ( $log (A) / B, 1 / B$ ) space, respectively

However, unlike the DC case, changing the norm to log-norm, or any other norm, does not make the model linear, and the valley shape of the functional persists. Instead, if the function (2) is rewritten as:

\begin{matrix} ϵ (σ) = \frac{1}{B} log (σ) - \frac{log (A)}{B}, \end{matrix}

it is easy to see that defining a new pair of parameters

\begin{matrix} α & = \frac{log (A)}{B} and \end{matrix}

20a

\begin{matrix} β & = \frac{1}{B}, \end{matrix}

20b

makes the strain $ϵ (σ) = β log (σ) - α$ linear in parameters $α$ and $β$ . Hence, the parameter estimation problem using 2-norm becomes a linear least-square problem. In other words, the transformation of parameters from A, B to $α, β$ makes the 2-norm functional $F (α, β) = | | ϵ (α, β) - \bar{ϵ} {| |}_{2}$ quadratic (Fig. 7c). Concomitantly, the PM curves (36) and (38) become straight lines that are distinct from each other.

This idea is extended to function (4), where, even though the transformation does not make the function exactly linear, it exhibits a similar behavior in the functional shape and PM curves (Fig. 7d–f). That is, in the original parameter space (A, B), the 2-norm functional has a narrow and curved valley. Transforming the parameter space to $(log (A), B)$ , the valley becomes straight, but the PM curves remain close together. Lastly, using the transformation $(log (A) / B, 1 / B)$ , the valley ceases to exist, and the PM curves become distinctly different.

For the FC cases, curve fitting cannot be performed for multi-dimensional constitutive models. Instead, we calculate the functional and PM curves for biaxial inverse model using SM. Even this multi-dimensional problem, which involves a highly complex stress–strain function, behaves very similar to the 1D problems. With 2-norm, we obtain a narrow and curved valley in the (A, B) space (Fig. 7g), which becomes straight in the $(log (A), B)$ space (Fig. 7h). In both of these cases the PM curves remain close together. However, in the $(log (A) / B, 1 / B)$ space we do not see any valley and the PM curves become distinct (Fig. 7i). The functional shape and PM curves can be useful indicators for the parameter estimation, as we see next.