A COMPUTATIONAL MEASURE THEORETIC APPROACH TO INVERSE SENSITIVITY PROBLEMS II: A POSTERIORI ERROR ANALYSIS

T BUTLER; D ESTEP; J SANDELIN

doi:10.1137/100785958

. Author manuscript; available in PMC: 2013 May 9.

Published in final edited form as: SIAM J Numer Anal. 2012 Jan 19;50(1):22–45. doi: 10.1137/100785958

A COMPUTATIONAL MEASURE THEORETIC APPROACH TO INVERSE SENSITIVITY PROBLEMS II: A POSTERIORI ERROR ANALYSIS^{^*}

T BUTLER ^†, D ESTEP ^‡, J SANDELIN ^§

PMCID: PMC3649878 NIHMSID: NIHMS391571 PMID: 23667271

Abstract

In part one of this paper [T. Butler and D. Estep, SIAM J. Numer. Anal., to appear], we develop and analyze a numerical method to solve a probabilistic inverse sensitivity analysis problem for a smooth deterministic map assuming that the map can be evaluated exactly. In this paper, we treat the situation in which the output of the map is determined implicitly and is difficult and/or expensive to evaluate, e.g., requiring the solution of a differential equation, and hence the output of the map is approximated numerically. The main goal is an a posteriori error estimate that can be used to evaluate the accuracy of the computed distribution solving the inverse problem, taking into account all sources of statistical and numerical deterministic errors. We present a general analysis for the method and then apply the analysis to the case of a map determined by the solution of an initial value problem.

Keywords: a posteriori error analysis, adjoint problem, density estimation, inverse sensitivity analysis, nonparametric density estimation, sensitivity analysis, set-valued inverse

1. Introduction

In part one of this paper [4], we develop and analyze a numerical method to solve an inverse stochastic sensitivity analysis problem for a smooth deterministic map. Namely, given a (probability) measure on the output of the map, compute the (probability) measure on the input space (comprising data and/or parameters) that produce the output measure. This is the stochastic version of the deterministic inverse problem for the map, and it is also the direct inversion of the forward stochastic sensitivity analysis problem for the map. As such, it deals directly with the inverse of the map in question, rather than, say, a statistical model of the output of the map.

In [4], we formulate this inverse problem using the law of total probability and then analyze an approximate solution method assuming that the map in question can be evaluated exactly. The solution method borrows heavily from techniques used inmeasure theory. The computed solution provides a systematic method for approximating the probability of any specified event in the input space.

However, our interest lies in situations in which the output of the map is determined implicitly and is difficult and/or expensive to evaluate, e.g., requiring the solution of a differential equation. In addition, we wish to consider the situation in which the measure on the output of the map is only described approximately using a finite number of samples. These practical discretization choices introduce additional numerical errors that affect the computed inverse distribution.

In this paper, we carry out an analysis of the effects of these two numerical discretizations on the computed inverse distribution. As a consequence of the analysis, we prove that the numerical error of the approximate parameter density computed from the algorithm for solving the inverse problem converges to zero as the discretization (both statistical and deterministic) converge to zero. But our main goal is the derivation of an a posteriori error estimate that can be used to evaluate the accuracy of the computed distribution solving the inverse problem, taking into account all sources of statistical and numerical deterministic errors.

While our particular interest is a numerical analysis of the method constructed for the inverse problem in [4], aspects of the analysis we present hold general interest. We present a general error analysis for a computed probability distribution accounting for the effects of finite sampling and errors in each sample resulting from evaluation of a numerically approximated map. Because of the application to inverse problems, we also require error estimates for gradients of quantities of interest computed from numerical solutions of ordinary differential equations, which again is of interest in other contexts, e.g., applications to optimization.

1.1. The inverse problem

As mentioned, the problem we study in [4] is a direct inversion of the forward stochastic sensitivity problem for a deterministic model. We consider an operator q(λ) that maps values in a parameter (and data) space Λ to an output space D. We assume there is a parameter volume measure μ_Λ on Λ that determines the volume of sets in Λ. The volume measure depends on the units of measure used for the parameters and also reflects the structural dependency among the parameters, e.g., depending on whether μ_Λ is a product measure. The volume measure is specified as part of the model that defines the map q(λ) since the parameters must be explicitly defined in the physical model that determines q. We assume that μ_Λ is absolutely continuous with respect to the Lebesgue measure and the volume V of Λ is finite.

The deterministic model can be expressed in terms of a likelihood function L(q|λ) of the output q values given the input parameter values λ, where L(q|λ) = δ(q–q(λ)) is the unit mass distribution at q = q(λ). If we specify a density σ_Λ(λ) on the parameter space Λ, then the law of total probability implies

ρ_{D} (q) = \int_{Λ} L (q ∣ λ) σ_{Λ} (λ) d μ_{Λ} (λ) .

(1.1)

The stochastic inverse sensitivity analysis problem that we study is the inversion of the integral equation (1.1). Namely, we assume that an observed probability density $ρ_{D} (q (λ))$ is given on the output value q(λ) and we seek to compute the corresponding parameter density σ_Λ(λ) that yields $ρ_{D} (q (λ))$ via (1.1).

1.2. The solution method

In [4], we present a computational measure theoretic algorithm to approximate the solution of the inverse problem (1.1) by a simplefunction σ_Λ_,M’(λ) with respect to a partition ${b_{i}}_{i = 1}^{M^{'}}$ of Λ. Paraphrasing the main result from [4], we have the following theorem.

Theorem 1.1. Given a measurable set A ⊂ Λ, we can approximate P(A) using the simple function

σ_{Λ, M^{'}} (λ) = \sum_{k = 1}^{M^{'}} \frac{P (b_{i})}{μ_{Λ} (b_{i})} 1_{b_{i}} (λ) .

(1.2)

The constructive proof yields a computational algorithm that generates a probability P(b_i) for each cell b_i, using only calculations of volumes in Λ. The main steps of the algorithm are based on the following observations:

The probability of an interval of output data [q_m, q_M) $\subset D$ is equal to the probability of the region generalized contours defined by A = q⁻¹([q_m, q_M)).
If $ρ_{D} (q)$ is constant on [q_m, q_M), then the probability of b∩A for any event b ⊂ Λ is equal to the probability of A times the ratio of volumes μ_Λ(b∩A)/μ_Λ(A).

Thus, the algorithm proceeds by first approximating $ρ_{D} (q)$ with a simple function, which induces regions of contours with probabilities defined by the approximate output density. Then, the ratios of volumes for each cell in the partition ${_{b_{i}}}_{i = 1}^{M^{'}}$ are computed with respect to all the induced regions of contours. From this, we obtain P(b_i) for each cell and obtain (1.2).

The main focus of the analysis in [4] is on the convergence of the approximate representation to the true representation on a given partition ${_{b_{i}}}_{i = 1}^{M^{'}}$ assuming that map the map is evaluated exactly.

1.3. Sources of error

In this paper, we analyze and estimate errors affecting the values {P(b_i)} in the representation (1.2) for a fixed partition ${_{b_{i}}}_{i = 1}^{M^{'}}$ . Since we fix the partition ${_{b_{i}}}_{i = 1}^{M^{'}}$ , we simplify the notation in [4] by dropping the hat on the piecewise-linear representation q̂(λ) of q(λ).

In particular, we consider two sources of error that affect the approximation of the representation σ_Λ_,M’(λ). The first is “statistical error” that arises if the observed probability density $ρ_{D} (q (λ))$ is known only through a finite collection of random samples. This type of error affects the left-hand side of (1.1). For example, finite sampling of the distribution of random variable q(λ) is used when the observed distribution is complicated to evaluate or when it is determined by experimental observations. Given an analytic, easy-to-evaluate distribution function for q(λ), we need not perform any sampling.

The second source of error arises when we use numerical approximations in the evaluation of the map q, e.g., as happens if q involves solving a differential equation. This means that we use approximate values of q and its gradient to form the approximate representation q̂(λ) ≈ q̂(λ). This source of error affects the evaluation of the likelihood function in (1.1).

In this paper, we present two kinds of error analysis. We give an a priori convergence analysis that shows that the error tends to zero as the discretization is refined. This analysis uses error bounds that are robust in the sense of holding under general conditions but which are generally orders of magnitude too large for particular computed solutions. Our main purpose in this paper is to give an a posteriori error analysis that provides the means to compute an relatively accurate error estimate on any particular computed solution. The latter result is important for the purposes of uncertainty quantification and for distributing computational resources in order to achieve a desired accuracy with efficiency.

We let F(t) denote the probability distribution on Λ that represents (1.2), where $t \in R^{d}$ , and

F (t) = P ({λ ∣ λ \leq t}) = P (λ \geq t) .

(1.3)

Here the inequality, λ ≤ t, is considered componentwise. We use F_q(t) to denote the probability distribution function of q(λ). To simplify the presentation, we assume b_i is contained in a region of contours A_i induced by the simple function approximation to $ρ_{D} (q)$ . If no sampling is used to evaluate $ρ_{D} (q)$ or F_q(t), then the algorithm yields

P (b_{i}) = F_{q} (q (b_{i})) \frac{\int_{b_{i}} d μ_{Λ} (λ)}{\int_{A_{i}} d μ_{Λ} (λ)}, 1 \leq i \leq M^{'},

(1.4)

where q(b_i) = {q(λ), λ ∈ b_i}. (If b_i ⊄ A_i, then we alter (1.4) to sum over the regions of induced contours A_j such that b_i ∩ A_j ≠ ∅.) Using (1.4) in (1.3) gives

F (t) = \sum_{i = 1}^{M^{'}} F_{q} (q (b_{i})) \frac{\int_{b_{i} \cap {λ \leq t}} d μ_{Λ} (λ)}{\int_{A_{i}} d μ_{Λ} (λ)} .

(1.5)

For the first source of error, we let F_q(t) denote a sample distribution function computed from a finite collection of error-free sample values {Q₁, ∆ , $Q_{N}$ },

F_{q, N} (t) = \frac{1}{N} \sum_{n = 1}^{N} 1 (Q_{n} \leq t) .

This leads to an approximation of $F_{N} (t) \approx F (t)$ defined

F_{N} (t) = \sum_{i = 1}^{M^{'}} F_{q, N} (q (b_{i})) \frac{\int_{b_{i} \cap {λ \leq t}} d μ_{Λ} (λ)}{\int_{A_{i}} d μ_{Λ} (λ)} .

Next we consider the use of an approximation q̂(λ) ≈ q(λ), which leads to an error in computation of $F_{q, N} (t)$ . We define the approximate sample distribution function ${\tilde{F}}_{N} (t)$ as

{\tilde{F}}_{N} (t) = \sum_{i = 1}^{M^{'}} F_{q, N} (\tilde{q} (b_{i})) \frac{\int_{b_{i} \cap {λ \leq t}} d μ_{Λ} (λ)}{\int_{A_{i}} d μ_{Λ} (λ)} .

(1.6)

We calculate probabilities using (1.6) and seek to determine the error $F (t) - {\tilde{F}}_{N} (t)$ . We decompose the error to get

(1.7)

2. General error analysis for a computed probability distribution

We begin by bounding $E_{S}$ . We conduct an a posteriori analysis similar to that used for nonparametric density estimation for elliptic problems with randomly perturbed diffusion coefficients in [13]. The error in the distribution is bounded by

E_{S} (t) \leq \sum_{i = 1}^{M^{'}} ∣ F_{q} (q (b_{i})) - F_{q N} (q (b_{i})) ∣ \frac{\int_{b_{i} \cap {λ \leq t}} d μ_{Λ} (λ)}{\int_{A_{i}} d μ_{Λ} (λ)} .

(2.1)

Using standard statistical arguments [13], for any ∊ > 0,

sup_{t \in R} ∣ F_{q} (t) - F_{q, N} (t) ∣ \leq 2 {(\frac{\log (∊^{- 1})}{2 N})}^{1 ∕ 2}

(2.2)

with probability greater than 1 – ∊. It is possible to prove other forms of this bound [13]. Using (2.2) in (2.1) yields for any ∊ > 0,

E_{S} (t) \leq 2 {(\frac{\log (∊^{- 1})}{2 N})}^{1 ∕ 2} \sum_{i = 1}^{M^{'}} \frac{\int_{b_{i} \cap {λ \leq t}} d μ_{Λ}}{\int_{A_{i}} d μ_{Λ} (λ)} \leq 2 {(\frac{\log (∊^{- 1})}{2 N})}^{1 ∕ 2}

(2.3)

with probability greater than 1 – ∊.

For $E_{D}$ , we assume a bound or estimate E_i for the error in q̂(b_i) on each cell b_i. More precisely, the piecewise linear function q is defined on the partition {B_i} of Λ, where q(λ) = q(μ_i) + ∇q(μ_i)(λ – μ_i) on B_i and μ_i is a chosen value in B_i, and q̂(λ) = q̂(μ_i) + ∇q̂(μ_i)(λ – μ_i) on B_i. Hence the error has the form

q (λ) - \tilde{q} (λ) = (q (μ_{i}) - \tilde{q} (μ_{i})) + (\nabla q (μ_{i}) - \nabla \tilde{q} (μ_{i})) (λ - μ_{i}) on B_{i} .

(2.4)

Hence, we require estimates or bounds for the errors in both q̂(μ_i) and ∇q̂(μ_i), respectively. The derivation of the a priori bound or a posteriori error estimate is specific to a particular map q. In section 3, we derive the necessary estimates for nonlinear ordinary differential equations. Similar results hold for elliptic problems [5].

For convenience, we choose the fine partition {b_i} so that for each 1 ≤ i ≤ M’, b_i ⊂ B_j for some 1 ≤ j ≤ M. Thus, for all cells b_i ⊂ B_j for a fixed j, there is the same deterministic error term associated with q̂(b_i). We let E_j, 1 ≤ j ≤ M, denote the deterministic error associated with each q̂(b_i) for all b_i ⊂ B_j. Using an analogous argument as in [13],

\begin{matrix} E_{D} & \leq \sum_{i = 1}^{M^{'}} ∣ F_{q, N} (M_{i} + E) - F_{q, N} (m_{i} - E) ∣ \frac{\int_{b_{i} \cap {λ \leq t}} d μ_{Λ} (λ)}{\int_{A_{i}} d μ_{Λ} (λ)} \\ \leq \sum_{i = 1}^{M^{'}} ∣ F_{q} (M_{i} + E) - F_{q, N} (m_{i} + E) ∣ \frac{\int_{b_{i} \cap {λ \leq t}} d μ_{Λ} (λ)}{\int_{A_{i}} d μ_{Λ} (λ)} \\ + \sum_{i = 1}^{M^{'}} ∣ F_{q} (m_{i} - E) - F_{q, N} (m_{i} - E) ∣ \frac{\int_{b_{i} \cap {λ \leq t}} d μ_{Λ} (λ)}{\int_{A_{i}} d μ_{Λ} (λ)} \\ + \sum_{i = 1}^{M^{'}} ∣ F_{q} (m_{i} + E) - F_{q} (m_{i} - E) ∣ \frac{\int_{b_{i} \cap {λ \leq t}} d μ_{Λ} (λ)}{\int_{A_{i}} d μ_{Λ} (λ)}, \end{matrix}

where E = max_j |E_j|, M_i = max q(b_i), and m_i = min q(b_i).

Using (2.2) for the first two terms on the right-hand side of the inequality we have that for any ∊ > 0

E_{D} \leq 2 {(\frac{\log (∊^{- 1})}{2 N})}^{1 ∕ 2} + \sum_{i = 1}^{M^{'}} ∣ F_{q} (M_{i} + E) - F_{q} (m_{i} - E) ∣ \frac{\int_{b_{i} \cap {λ \leq t}} d μ_{Λ} (λ)}{\int_{A_{i}} d μ_{Λ} (λ)}

with probability greater than 1 – ∊. Assuming Lipschitz continuity of the distribution F_q with constant L, for any ∊ > 0,

E_{D} \leq 2 {(\frac{\log (∊^{- 1})}{2 N})}^{1 ∕ 2} + L max_{1 \leq i \leq M^{'}} (M_{i} - m_{i}) \times E

with probability greater than 1 – ∊.

Putting together the bounds yields the next theorem.

Theorem 2.1. For any ∊ > 0,

∣ F (t) - {\tilde{F}}_{N} (t) ∣ \leq 2 {(\frac{\log (∊^{- 1})}{2 N})}^{1 ∕ 2} + L max_{1 \leq i \leq M^{'}} (max q (b_{i}) - min q (b_{i})) \times E

(2.5)

with probability greater than 1 – ∊. If no sampling is used to evaluate $ρ_{D} (q)$ or F_q(t), then

∣ F (t) - \tilde{F} (t) ∣ \leq L max_{1 \leq i \leq M^{'}} (max q (b_{i}) - min q (b_{i})) \times E .

(2.6)

Note that F̂(t) is the distribution calculated using exact values of $ρ_{D} (q)$ but approximate values of q.

3. Application to nonlinear ordinary differential equations

We apply the general error analysis to a finite dimensional map q defined implicitly by the solution to a differential equation that depends on a finite number of parameters in the model. We consider the initial value problem

{\begin{matrix} \dot{y} = f (y; λ_{1}), & t > 0, \\ y (0) = λ_{0}, \end{matrix}

(3.1)

where $y \in R^{n}$ , $f : R^{n + p} \to R^{n}$ is smooth, and $λ = {(λ_{1}^{⊺}, λ_{0}^{⊺})}^{⊺} \in Λ \subset R^{d} (d = p + n)$ are the parameters. We solve (3.1) to calculate a linear functional of the solution, or a quantity of interest,

q (y) = \int_{0}^{T} 〈 y, ψ 〉 d t .

(3.2)

We assume that the solution y of (3.1) depends (implicitly) on parameters λ in a smooth way and denote solutions of (3.1) as y_λ and the quantity of interest as q(λ) to emphasize the implicit dependence of the quantity of interest on the parameters. The smooth dependence of solutions to (3.1) on parameters λ implies the dependence of the quantity of interest on λ is also smooth.

3.1. Construction of the piecewise-linear representation

Computing the gradient information is problematic for a differential equation. We use an adjoint equation and variational analysis to do this implicitly. We solve the initial value problem at a reference parameter value $μ = {(μ_{1}^{⊺}, μ_{0}^{⊺})}^{⊺}$ ,

{\begin{matrix} {\dot{y}}_{μ} = f (y_{μ}; μ_{1}), & t > 0, \\ y (0) = μ_{0}, \end{matrix}

(3.3)

where (y_μ, μ) is a reference point. We define the exact adjoint problem,

{\begin{matrix} - \dot{ϕ} - D_{y} f {(y_{μ}; μ_{1})}^{T} ϕ = ψ, & T > t \geq 0, \\ ϕ (T) = 0 . \end{matrix}

(3.4)

The following theorem [25] relates the value of q(λ) to q(μ) for λ near μ.

Theorem 3.1. If f(y; λ) is twice continuously differentiable with respect to both y and λ and Lipschitz continuous in both y and λ, then the quantity of interest is Fréchet differentiable at (y_μ, μ) with derivative $\nabla q (μ) : R^{d} \to R$ given by

\nabla q (μ) [λ] = 〈 (λ_{0} - μ_{0}), ϕ (0) 〉 + \int_{0}^{T} 〈 D_{λ_{1}} f (y_{μ}; μ) (λ_{1} - μ_{1}), ϕ 〉 d t .

(3.5)

Additionally,

q (λ) \approx q (μ) + \nabla q (μ) [λ] .

(3.6)

In the absence of numerical error,

q (λ) \approx \int_{0}^{T} 〈 y_{μ}, ψ 〉 d t + 〈 (λ_{0} - μ_{0}), ϕ (0) 〉 + \int_{0}^{T} 〈 D_{λ_{1}} f (y_{μ}; μ_{1}) (λ_{1} - μ_{1}), ϕ 〉 d t

(3.7)

for λ close to μ.

The global piecewise-linear approximation on the partition ${B_{i}}_{i = 1}^{M}$ of Λ is constructed by using the local linearization on each cell B_i to obtain

\hat{q} (λ) ≔ \sum_{i = 1}^{M} (q (μ_{i}) + 〈 \nabla_{q} (μ_{i}), (λ - μ_{i}) 〉) 1_{B_{i}} (λ),

(3.8)

where μ_i is the reference parameter value chosen in cell B_i.

3.2. Discretization

The a posteriori error estimate uses a variational analysis after introducing an adjoint problem. The variational analysis makes it natural to write the discretization method in the finite element framework. This is not restrictive as most common finite difference schemes can be written as a finite element method with a particular choice of quadrature for evaluating integrals.

A finite element method is based on the variational formulation of the differential equation. For the differential equation,

{\begin{matrix} \dot{x} = g (x, t), & 0 < t \leq T, \\ x (0) = x_{0}, \end{matrix}

(3.9)

the problem is to find $x \in C^{1} ([0, 1])$ such that

{\begin{matrix} \int_{0}^{t} (\dot{x}, v) d s = \int_{0}^{t} (g (x, t), v) d s, & t > 0, \\ x (0) = x_{0} \end{matrix}

(3.10)

for all $v \in C^{1} ([0, 1])$ . (We use g instead of f because there are several problems that have to be solved below.)

We compute a solution on the interval [0, T], and we discretize the interval 0 = t₀ < t₁ < … < t_N = T with time intervals I_j = (t_j–1, t_j) and time steps k_j = t_j–t_j–1. The finite element method produces a piecewise polynomial approximation. We use $P^{q} (I_{j})$ to denote the space of polynomials of degree q and less on time interval I_j and define the space of piecewise polynomials,

V^{(q)} = {U : U ∣ I_{j} \in P^{(q)} (I_{j}), j = 1, 2, 3, \dots,} .

We consider the discontinuous Galerkin (dGq) finite element method that produces an approximate solution $X \in V^{(q)}$ [18]. Since X may be discontinuous at time nodes, we define $X_{j = 1}^{+} = {lim}_{t ↓ t_{j - 1}} X (t), X_{j - 1}^{-} = {lim}_{t ↑ t_{j - 1}} X (t)$ , and ${[X]}_{j - 1} = X_{j - 1}^{+} - X_{j - 1}^{-}$ . The approximation is computed interval by interval. We set X₀ = x₀. Then we compute $X \in P^{q} (t_{j - 1}, t_{j})$ successively for j = 1, 2, … , N, satisfying

\int_{t_{j - 1}}^{t_{j}} (\dot{X}, v) d s + ({[X]}_{j - 1}, v_{j - 1}^{+}) = \int_{t_{j - 1}}^{t_{j}} (g (X, t), v) d s, for all v \in P^{q} (t_{j - 1}, t_{j}) .

(3.11)

Remark 3.1. If g(x, t) ≡ g(x) and q = 0, the dG0 approximation matches the backward Euler approximation at the time nodes. In general, we may obtain various difference schemes, e.g., the subdiagonal Pade schemes, by employing quadrature to evaluate the integrals in (3.11) [17, 27]. There is also a continuous Galerkin (cG) approximation that produces yet other classes of approximations [9]. We carry out the analysis below for the dG scheme assuming the integrals in (3.11) are computed exactly. The extension of the a posteriori analysis to handle quadrature and the cG method is straightforward [12] and we do not discuss this further.

3.3. The effect of using an approximate solution on the piecewise-linear representation

The main interest is in treating the effects of using a numerical approximation Y_μ in the linearization of the forward problem used to construct an adjoint. We define the approximate adjoint using (3.4) with “perturbed” operator D_yf(Y_μ; μ₁),

{\begin{matrix} - \dot{Φ} - D_{y} f {(Y_{μ}; μ_{1})}^{T} Φ = ψ, & T > t \geq 0, \\ Φ (T) = 0 . \end{matrix}

(3.12)

We assume f(y; λ) is twice continuously differentiable with respect to both y and λ, so that standard convergence results for Y_μ imply that over some (short) time interval [0, T],

{∥ D_{y} f (y_{μ}; μ_{1}) - D_{y} f (Y_{μ}; μ_{1}) ∥}_{V} \leq K {∥ y_{μ} - Y_{μ} ∥}_{U},

(3.13)

where ∥·∥_V and ∥·∥_U are the L²([0, T]) norm of some appropriate matrix and vector norms of the arguments, respectively.

Let q̂(λ) denote the approximate quantity of interest calculated using (3.6) with Y_μ and Φ in place of y_μ and φ,

\tilde{q} (λ) = \int_{0}^{T} 〈 Y_{μ}, ψ 〉 d t + 〈 (λ_{0} - μ_{0}), Φ (0) 〉 + \int_{0}^{T} 〈 D_{λ_{1}} f (Y_{μ}; μ_{1}) (λ_{1} - μ_{1}), Φ 〉 d t

(3.14)

with error q(λ) – q̂(λ). Taking the difference of (3.7) and (3.14) gives

graphic file with name nihms-391571-f0002.jpg

(3.15)

Term I is a linear functional of the error y_μ – Y_μ and it can be estimated using standard a posteriori analysis techniques as described below. Term II measures theeffect of using Y_μ and Φ on the sensitivity of q(λ) to changes in the initial conditions. Term III measures the effect of using Y_μ and Φ on the sensitivity of q(λ) to changes in model parameters.

The terms II and III depend linearly on the vector λ – μ. The analysis below produces estimates that also depend on this vector linearly so that the error estimates for these terms are also linear functions of this vector. Thus, following the analysis described below for p linearly independent vectors λ – μ, we obtain a set of error estimates such that the error defined by II and III for any vector λ – μ can be written as a linear combination from this set of error estimates.

3.4. Convergence and order of accuracy

We can use straightforward a priori error analysis on (3.15) to show that |q(λ) – q̂(λ) converges at the same order as the dGq method over a short time period under the assumption that f is twice continuously differentiable.

3.5. Estimate of the error in a quantity of interest

We compute an a posteriori error estimate using variational analysis and adjoint problems [9, 18, 7, 8, 12, 27, 6]. We begin by recalling the a posteriori estimate of error in a quantity of interest. Let X denote the dGq approximation to (3.9) and let e = x – X, where x solves (3.9) exactly. We linearize around X in the sense of perturbing the operator to arrive at the adjoint problem

{\begin{matrix} - \dot{v} = {\bar{g^{'} (x, X)}}^{T} ϕ + ψ_{1} (t), & T > t \geq 0, \\ v (T) = ψ_{2}, \end{matrix}

(3.16)

where $\bar{g^{'} (x, X)} = \int_{0}^{1} g^{'} (s x + (1 - s) X, t)$ . For simplicity, we use g^’ for $\bar{g^{'} (x, X)}$ below. If ψ₁(t) ≡ 0, then the quantity of interest is (e(T), ψ₂). If ψ₂ = 0, then the quantity of interest is $\int_{0}^{T} (e (t), ψ_{i} (t))$ dt.

Assume ψ₁(t) ≡ 0 in (3.16). Take the inner product of the adjoint problem with e and integrate from 0 to T to obtain

- \int_{0}^{T} (\dot{v}, e) d t - \int_{0}^{T} ({(g^{'})}^{T} v, e) d t = 0 .

(3.17)

We decompose (3.17) into a sum of integral equations over each time interval in the discretization and integrate by parts over each interval to get

- \sum_{n = 1}^{N} (e, v) ∣_{t_{n - 1}}^{t_{n}} + \sum_{n = 1}^{N} \int_{t_{n - 1}}^{t_{n}} (v, \dot{e}) d t - \sum_{n = 1}^{N} \int_{t_{n - 1}}^{t_{n}} (v, g^{'} e) d t = 0 .

(3.18)

Since e = x – X might be discontinuous at the boundaries of each interval, we expand the first term on the right-hand side of (3.18) to

- \sum_{i = 1}^{N} (e, v) ∣_{t_{n - 1}}^{t_{n}} = (e (0), v (0)) + \sum_{n = 2}^{N} ({[X]}_{n - 1}, v_{n - 1}) - (e (T), v (T))

(3.19)

with _n–1 = ∂(t_n–1). Substitution of (3.19) into (3.18) and rearranging the terms yields

(e (T), v (T)) = (e (0), v (0)) + \sum_{n = 1}^{N} ({[X]}_{n - 1}, v_{n - 1}) + \sum_{n = 1}^{N} \int_{t_{n - 1}}^{t_{n}} (v, \dot{e}) d t - \int_{t_{n - 1}}^{t_{n}} (v, g^{'} e) d t .

Substituting e = x – X and using ẋ – g(x) = 0 gives

(e (T), v (T)) = (e (0), v (0)) + \sum_{n = 2}^{N} ({(X)}_{n - 1}, v_{n - 1}) - \sum_{n = 1}^{N} \int_{t_{n - 1}}^{t_{n}} (\dot{X} - g (X), v) d t .

(3.20)

Similarly, if ψ₂ = 0 and ψ₁(t) is nonzero for some t ∈ (0, T), we obtain

\int_{0}^{T} (e, ψ_{1}) d t = (e (0), v (0)) + \sum_{n = 2}^{N} ({[X]}_{n - 1}, v_{n - 1}) - \sum_{n = 1}^{N} \int_{t_{n - 1}}^{t_{n}} (\dot{X} - g (X), v) d t .

(3.21)

We summarize as the following theorem.

Theorem 3.2. A computable estimate of the error in a quantity of interest of (3.9) is obtained by solving (3.16) and computing either (3.20) or (3.21).

Implementation details

Using the a posteriori estimate involves several important practical considerations. We discuss two.

Often “Galerkin orthogonality” is used to introduce a projection of the adjoint solution into the approximation space for the forward solution. This makes the estimate easier to compute and has the effect of “localizing” the error contributions from each time step.

The estimate (3.20) is computable provided that we can compute the adjoint solution ∂. This raises several issues. The first is that we cannot use $\bar{g^{'} (x, X)}$ in practice since this requires the unknown solution x. Typically, we use $\bar{g^{'} (x, X)} \approx \bar{g^{'} (X, X)} = g^{'} (X)$ . The effect of this approximation on the computation of ∂ can be analyzed, e.g., [10]. The error depends on the accuracy of X, so typical short time error bounds can be proved. The second issue is that in practice we solve the adjoint problem using a numerical method, typically using a higher-order method than used for the forward solution.

The consequence is that in practice we use an approximate adjoint solution. We can alter the analysis below to take into the account the effect on the estimate, but this significantly complicates the presentation of the results while it is generally not significant.

3.6. Estimating term II in (3.15)

We first observe that term II is a linear functional of the error arising from solving the exact adjoint with the approximate adjoint. We adapt the a posteriori analysis to estimate the error of this approximation. We define the adjoint to the approximate adjoint as

{\begin{matrix} \dot{w} = D_{y} f (Y_{μ}; μ_{1}) w, & 0 < t \leq T, \\ w (0) = (λ_{0} - μ_{0}) . \end{matrix}

Since ẇ – D_y f(Yμ; μ) = 0, we have

\begin{matrix} 0 & = \int_{0}^{T} 〈 \dot{w} - D_{y} f (Y_{μ}; μ_{1}) w, (ϕ - Φ) 〉 d t \\ = [〈 w (T), (ϕ (T) - Φ (T)) 〉 - 〈 (w (0), (ϕ (0) - Φ (0))) 〉 - \int_{0}^{T} 〈 w, (\dot{ϕ} - \dot{Φ}) 〈 d t] - \int_{0}^{T} 〈 w, D_{y} f {(Y_{μ}; μ_{1})}^{T} (ϕ - Φ) 〉 d t \\ = - 〈 (λ_{0} - μ_{0}), (ϕ (0) - Φ (0)) 〉 + \int_{0}^{T} 〈 w, [- \dot{ϕ} - D_{y} f {(Y_{μ}; μ_{1})}^{T} ϕ + \dot{Φ} + D_{y} f {(Y_{μ}; μ_{1})}^{T} Φ] 〉 d t . \end{matrix}

This gives

II = \int_{0}^{T} 〈 w, [- \dot{ϕ} - D_{y} f {(Y_{μ}; μ_{1})}^{T} ϕ + \dot{Φ} + D_{y} f {(Y_{μ}; μ_{1})}^{T} Φ] 〉 d t .

(3.22)

By adding and subtracting D_yf(Y_μ; μ₁)^⊺ϕ to the differential equation in (3.4) for the exact adjoint, we have

- \dot{ϕ} - D_{y} f {(Y_{μ}; μ_{1})}^{T} ϕ = {[D_{y} f (y_{μ}; μ_{1}) - D_{y} f (Y_{μ}; μ_{1})]}^{T} ϕ + ψ .

(3.23)

Substituting (3.23) into (3.22) and using (3.12), we have

II = \int_{0}^{T} 〈 w, {[D_{y} f (y_{μ}; μ_{1}) - D_{y} f (Y_{μ}; μ_{1})]}^{T} Φ 〉 d t

(3.24)

+ \int_{0}^{T} 〈 w, {[D_{y} f (y_{μ}; μ_{1}) - D_{y} f (Y_{μ}; μ_{1})]}^{T} (ϕ - Φ) 〉 d t .

(3.25)

We show that the second term on the right-hand side of the last equation is higherorder and estimate the first term on the right-hand side. If f(y; λ) is three times continuously differentiable, then we use Taylor’s theorem to get

graphic file with name nihms-391571-f0003.jpg

where J denotes the n × n identity matrix and the vector operator denoted vec is a map from $R^{l \times m} \to R^{l m}$ defined by stacking the columns (in order) of a matrix to form a column vector. We let

ψ_{I I} = {[D_{y} (vec (D_{y} f {(Y_{μ}; μ_{1})}^{T}))]}^{T} {[Φ^{T} \otimes J]}^{T} w .

The first term on the right-hand side is a linear functional of the error y_μ – Y_μ and can be estimated by Theorem 3.2.

We now show that the second term is higher-order. Let η = ϕ – Φ then

{\begin{matrix} - \dot{η} = D_{y} f {(y_{μ}; μ_{1})}^{T} ϕ - D_{y} f {(Y_{μ}; μ_{1})}^{T} Φ, & T > t \geq 0, \\ η (T) = 0 . \end{matrix}

(3.26)

If Y_μ is sufficiently close to y_μ over [0, T], then

D_{y} f {(Y_{μ}; μ_{1})}^{T} = D_{y} f {(y_{μ}; μ_{1})}^{T} + ∊ (t), t \in [0, T],

(3.27)

where ∊(t) is a perturbation matrix satisfying ∥∊(t)∥ ≤ C ∥y_μ – Y_μ∥ for some C > 0 and all t ∈ [0, T]. Substituting (3.27) into (3.26) gives

{\begin{matrix} - \dot{η} = D_{y} f {(y_{μ}; μ_{1})}^{T} η + ∊ (t) Φ (t), & T > t \geq 0, \\ η (T) = 0 . \end{matrix}

(3.28)

Let Σ(t) denote the fundamental matrix of (3.29); then

η (t) = - Σ (t) \int_{T}^{t} {[Σ (s)]}^{- 1} ∊ (s) Φ (s) d s .

This implies that

∥ η (t) ∥ \geq ∥ Σ (t) ∥ \int_{0}^{T} ∥ Σ {(s)}^{- 1} ∥ ∥ ∊ (s) ∥ ∥ Φ (s) ∥ d s \leq C {∥ y_{μ} - Y_{μ} ∥}_{U} .

(3.29)

Here, ∥·∥_U is interpreted as before to mean the L²([0, T]) norm of a given vector norm of the argument, and C > 0 is some constant that bounds the product of sup_t∈[0,T] ∥Σ(t)∥, sup_t∈[0,T] ∥Σ(t)⁻¹∥, and sup_t∈[0,T] ∥Φ(t)∥. Thus, by Lipschitz continuity of the first derivatives of f(y; λ) and (3.29),

∣ \int_{0}^{T} 〈 w, {[D_{y} f (y_{μ}; μ_{1}) - D_{y} f (y_{μ}; μ_{1})]}^{T} (ϕ - Φ) 〉 d t ∣ \leq C {∥ y_{μ} - Y_{μ} ∥}_{U}^{2} .

3.7. Estimating term III in (3.15)

Add and subtract 〈D_λf(Y_μ; μ₁)(λ₁ – μ₁),ϕ〉 and write III = IIIa + IIIb, where

\begin{matrix} IIIa & = \int_{0}^{T} 〈 (D_{λ} f (y_{μ}; μ_{1})) - D_{λ} f (Y_{μ}; μ_{1}) (λ_{1} - μ_{1}), ϕ 〉 d t, \\ IIb & = \int_{0}^{T} 〈 D_{λ} f (Y_{μ}; μ_{1}) (λ_{1} - μ_{1}), (ϕ - Φ) 〉 d t, \end{matrix}

and estimate IIIa and IIIb.

Estimating term IIIa. Add and subtract 〈(D_λf(y_μ; μ₁) – D_μf(Yμ; μ₁))(μ₁ – μ₁), Φ〉 and write IIIa = IIIaa + IIIab, where

\begin{matrix} IIIaa & = \int_{0}^{T} 〈 (D_{λ} f (y_{μ}; μ_{1}) - D_{λ} f (Y_{μ}; μ_{1})) (λ_{1} - μ_{1}), (ϕ - Φ) 〉 d t, \\ IIIab & = \int_{0}^{T} 〈 (D_{λ} f (y_{μ}; μ_{1}) - D_{λ} f (Y_{μ}; μ_{1})) (λ_{1} - μ_{1}), Φ 〉 d t . \end{matrix}

We show that IIIaa is higher-order. We know that ∥ϕ – Φ∥ ≤ C ∥y_μ – Y_μ∥_U for some constant C > 0; therefore

∣ IIIaa ∣ \leq C {∥ y_{μ} - Y_{μ} ∥}_{U}^{2}

for some constant C > 0.

Again assuming that f(y; λ) is three times continuously differentiable,

(D_{λ} f (y_{μ}; μ_{1}) - D_{λ} f (y_{μ}; μ_{1})) (λ_{1} - μ_{1}) \approx [{(λ_{1} - μ_{1})}^{T} \otimes J] [D_{y} (vec (D_{λ} f (Y_{μ}; μ_{1})))] (y_{μ} - Y_{μ}) .

We substitute this estimate into IIIab so that

\begin{matrix} IIIab & \approx \int_{0}^{T} 〈 [{(λ_{1} - μ_{1})}^{T} \otimes J] [D_{y} (vec (D_{λ} f (Y_{μ}; μ_{1})))] (y_{μ} - Y_{μ}), Φ 〉 d t \\ = \int_{0}^{T} 〈 (y_{μ} - Y_{μ}), {[D_{y} (vec (D_{λ} f (Y_{μ}; μ_{1})))]}^{T} {[{(λ_{1} - μ_{1})}^{T} \otimes J]}^{T} Φ 〉 d t . \end{matrix}

We let

ψ_{I I I a b} = {[D_{y} (v e c (D_{λ} f (Y_{μ}; μ_{1})))]}^{T} {[{(λ_{1} - μ_{1})}^{T} \otimes J]}^{T} Φ .

Thus, we have represented IIIab as a linear functional of the error in y_μ – Y_μ, which can be estimated by Theorem 3.2.

Estimating term IIIb. We let ψ_IIIb = D_λf(Y_μ, μ₁)(λ₁ – μ₁) so that

IIb = \int_{0}^{T} 〈 ψ_{I I b}, (ϕ - Φ) 〉 d t .

Thus, IIIb is a linear functional of the error in the adjoint solutions ϕ – Φ. We apply Theorem 3.2. We again define an adjoint to the approximate adjoint as

{\begin{matrix} \dot{z} - D_{y} f (Y_{μ}; μ_{1}) z = ψ_{I I b}, & 0 < t \leq T, \\ z (0) = 0 . \end{matrix}

We perform a standard variational argument to obtain

graphic file with name nihms-391571-f0004.jpg

Using (3.26)–(3.28) in the right-hand side above, we have

IIIb = \int_{0}^{T} 〈 z, {[D_{y} f (y_{μ}; μ_{1}) - D_{y} f (Y_{μ}; μ_{1})]}^{T} Φ 〉 d t - \int_{0}^{T} 〈 z, {[D_{y} f (y_{μ}; μ_{1}) - D_{y} f (Y_{μ}; μ_{1})]}^{T} (ϕ - Φ) 〉 d t .

The two terms on the right-hand side are analagous to (3.24) and (3.25). The second term on the right-hand side has already been proved to be higher-order. Therefore, the second term is neglected in the estimate. The first term is estimated similarly to how (3.24) was estimated. We define

{\tilde{ψ}}_{I I I b} = {[D_{y} (vec (D_{y} f {(Y_{μ}; μ_{1})}^{T}))]}^{T} {(Φ^{T} \otimes J)}^{T} z,

and the first term is approximated by

\int_{0}^{T} 〈 {\tilde{ψ}}_{I I I b}, (Y_{μ} - Y_{μ}) 〉 d t,

which is a linear functional of the error of y_μ – Y_μ and is estimable by Theorem 3.2. This completes the proof of Theorem 3.3.

Theorem 3.3.

Let Y_μ and Φ denote the numerical solutions to the initial value problem (3.3) and the approximate adjoint problem (3.12), respectively.
Apply Theorem 3.2 to estimate termI.
Let p_m be the number of model parameters and pi the number of initial conditions (p_m + p_i = p)
For I = 1, … , p do
if i ≤ p_m then

Let z denote the solutions to the adjoint to the approximate adjoint problem

{\begin{matrix} \dot{x} - D_{y} f (Y_{μ}; μ_{1}) x = D_{λ} f (Y_{μ}, μ_{1}) δ_{i}, & 0 < t \leq T, \\ x (0) = 0, \end{matrix}

where δ_i denotes the ith standard basis vector in $R^{p m}$

Set

\begin{matrix} ψ_{I I I a b} & = {[D_{y} (vec (D_{λ} f (Y_{μ}; μ_{1})))]}^{T} {[δ_{i}^{T} \otimes J]}^{T} Φ, \\ {\tilde{ψ}}_{I I I b} & = {[D_{y} (vec (D_{y} f {(Y_{μ}; μ_{1})}^{T}))]}^{T} {[Φ^{T} \otimes J]}^{T} z . \end{matrix}

Solve (3.12) with data given by the above vectors and use Theorem 3.2 to compute the standard error representations given by

e_{1}^{i} ≔ \int_{0}^{T} 〈 (y_{μ} - Y_{μ}), ψ_{I I a b} 〉 d t, e_{2}^{i} ≔ \int_{0}^{T} 〈 {\tilde{ψ}}_{I I b}, (y_{μ} - Y_{μ}) 〉 d t .

else

Let w denote the solutions to the adjoint to the approximate adjoint problem

{\begin{matrix} \dot{x} - D_{y} f (Y_{μ}; μ_{1}) x - 0, & 0 < t \leq T, \\ x (0) = δ_{i}, \end{matrix}

where δ_i denotes the ith standard basis vector in $R^{p i}$

Set

ψ_{I I} = {[D_{y} (vec (D_{y} f {(Y_{μ}; μ_{1})}^{T}))]}^{T} {[Φ^{T} \otimes J]}^{T} w

Solve (3.12) with data given by the above vectors and use Theorem 3.2 to compute the standard error representations given by

e_{3}^{i} ≔ \int_{0}^{T} 〈 ψ_{I}, (y_{μ} - Y_{μ}) 〉 d t

end if end for Fix λ and set u := λ – μ, where

u = {(ν_{1} \dots, ν_{p m} ρ_{1} \dots ρ_{p i})}^{T} .

The error q(λ) – q̂(λ) is given by

q (λ) - \tilde{q} (λ) = e_{u} + h . o . t,

where e_u is the computable error estimate given by

e_{u} = e_{0} + \sum_{i = 1}^{p m} ν_{i} \sum_{j = 1}^{2} e_{j}^{i} + \sum_{i = 1}^{p_{i}} p_{i} e_{3}^{i} .

This theorem provides a means of computing the E_i required in Theorem 2.1. Set E_i = max_{λ∈B_i} e_u. For convex polygonal cells B_i, computation of E_i is straightforward since e_u is a linear function of λ, so the maximum occurs on the boundary.

4. Examples

We present examples that illustrate the properties of the computable a posteriori error estimate. In the case of deterministic computations, it is standard to test the accuracy of the estimate by direct comparison to the error on problems for which the actual error is known or can be approximated using an extremely accurate reference solution. However, it is more complicated to test accuracy for the estimates we have derived for stochastic computations because of the nature of the a posteriori bound we use for the stochastic component of the error.

We have explored the accuracy of the a posteriori bound on the effects of finite sampling in [13, 14]. Likewise, the accuracy of the a posterior error estimate for deterministic problems is well recorded; e.g., see [9, 18, 7, 8, 12] and many research papers. We do not repeat tests on these aspects here. Rather, in Examples 1 and 2 we explore the accuracy of the a posteriori error estimates on errors in computed derivatives that are needed for the estimate on the solution of the inverse problem. These estimates are new so their properties have not been explored in the literature. Finally, in Example 3 we present an example in which we check the a posteriori estimate against a direct approximation of the error.

In all the examples, the numerical solution and error estimates are computed using GAASP.¹ We use a first-order discontinuous Galerkin (dG1) method for the forward solve and a second-order continuous Galerkin (cG2) method for all of the adjoint solves. We use the adaptive time step capability in GAASP to control the numerical integration error. In the first two examples, we terminate time step refinement once the error estimate corrects the estimated gradient by less than 10%.

4.1. Example 1

The first example is a coupled linear system with four parameters:

{\begin{matrix} {\dot{x}}_{1} = \frac{x_{1}}{λ_{1} (1 + t)} - λ_{2} t x_{2}, & 0 < t \leq T, \\ {\dot{x}}_{2} = \frac{x_{2}}{λ_{3} (1 + t)} + λ_{4} t x_{1}, & 0 < t \leq T, \\ x_{1} (0) = x_{1}, 0, \\ x_{2} (0) = x_{2}, 0 . \end{matrix}

(4.1)

The adjoint problem is

{\begin{matrix} - {\dot{ϕ}}_{1} - \frac{1}{λ_{1} (1 + t)} ϕ_{1} + λ_{4} t ϕ_{2} = ψ_{1}, & T > t \geq 0, \\ - {\dot{ϕ}}_{2} + λ_{2} t ϕ_{1} - \frac{1}{λ_{3} (1 + t)} ϕ_{2} = ψ_{2}, & T > t \geq 0, \\ ϕ_{1} (T) = ϕ_{1}, T, \\ ϕ_{2} (T) = ϕ_{2} T . \end{matrix}

(4.2)

Computing the true errors requires knowledge of the exact ϕ. To this end, we choose ψ(t) = (ψ₁, ψ₂)^⊺ and ϕ(T) = (ϕ_1,T, ϕ_2,T)^⊺ so that ϕ(t) = (t, 1)^⊺.

In this linear example, II = 0. We report the error estimates for term III. We take μ = (2, 2, 2, 2)^⊺, so $y_{μ} = {(\sqrt{1 + t} cos (t^{2}), \sqrt{1 + t} sin (t^{2}))}^{⊺}$ .

We consider both T = 3 and T = 10. We plot the forward solutions y_μ and Y_μ for T = 3 with two different time steps in Figure 4.1. Table 4.1 shows the error estimate results for T = 3. Since the computed error estimates tend to be accurate, we can often compute a corrected gradient by adding the error estimate to the computed (estimated) gradient. We see improvement in the corrected gradient by comparing the fourth and last columns of Table 4.1. At T = 3, Y_μ is a good approximation of y_μ at the coarse time step of 0.2 as seen in Figure 4.1, so the second derivative calculations involving Y_μ used in the error estimates produce accurate error estimates beginning at this time step.

Fig. 4.1 — Solutions to (4.1) for T = 3. Left two plots: Y_μ,1 and Y_μ,2 with a time step of 0.2. Right two plots: Y_μ,1 and Y_μ,2 with a time step of 0.1. The dotted lines indicate the corresponding exact solutions y_μ,1 and y_μ,2 evaluated on the same time mesh as the dashed-lined numerical approximations Y_μ,1 and Y_μ,2.

Table 4.1. Example 1 results for the three partial derivatives of the solution at T = 3.

Δ t	true ∂_1q	est. ∂_1q	Error	IIIab	IIb	Estimate	Error Estimate	est. ∂_1q+Est. true ∂_1q

0.2	1.71E – 01	1.62E – 01	8.97E – 03	9.43E – 03	−7.53E – 07	9.43E – 03	1.052	1.00
0.1	1.71E – 01	1.69E – 01	1.58E – 03	1.47E – 03	−7.32E – 07	1.47E – 03	0.931	0.999
0.05	1.71E – 01	1.70E – 01	2.21E – 04	1.97E – 04	2.58E – 06	1.99E – 04	0.904	1.00

Δ t	true ∂_2q	est. ∂_2q	Error	IIIab	IIb	Estimate	Error Estimate	est. ∂_2q+Est. true ∂_2q

0.2	−3.17E + 00	−2.95E + 00	−2.27E – 01	−1.76E – 01	1.07E – 06	−1.76E – 01	0.776	0.984
0.1	−3.17E + 00	−3.14E + 00	−2.91E – 02	−2.29E – 02	−7.29E – 07	−2.29E – 02	0.788	0.998
0.05	−3.17E + 00	−3.17E + 00	−3.54E – 03	−2.81E – 03	−9.58E – 06	−2.82E – 03	0.796	1.00

Δ t	true ∂_3q	est. ∂_3q	Error	IIIab	IIb	Estimate	Error Estimate	est. ∂_3q+Est. true ∂_3q

0.2	5.36E – 01	5.30E – 01	5.82E – 03	4.40E – 03	−4.19E – 07	4.40E – 03	0.757	0.997
0.1	5.36E – 01	5.35E – 01	7.24E – 04	5.59E – 04	−6.41E – 07	5.58E – 04	0.771	1.00
0.05	5.36E – 01	5.36E – 01	8.69E – 05	6.77E – 05	9.55E – 08	6.78E – 05	0.780	1.00

Δ t	true ∂_4q	est. ∂_4q	Error	IIIab	IIb	Estimate	Error Estimate	est. ∂_4q+Est. true ∂_4q

0.2	2.78E – 01	2.45E – 01	3.27E – 02	3.53E – 02	7.92E – 07	3.53E – 02	1.08	1.01
0.1	2.78E – 01	2.72E – 01	5.92E – 03	5.58E – 03	−1.09E – 06	5.57E – 03	0.941	0.999
0.05	2.78E – 01	2.77E – 01	8.36E – 04	7.50E – 04	−8.04E – 06	7.41E – 04	0.887	1.00

Open in a new tab

We plot the forward solutions y_μ and Y_μ for T = 10 with four different time steps in Figures 4.2 and 4.3. The oscillations of y_μ increase in magnitude and with higher frequency as time increases. As seen in Table 4.2, when the error estimate is of the same order of magnitude as the estimated gradient, the estimate cannot be used to correct the gradient.

Fig. 4.2 — Solutions to (4.1) for T = 10. Left two plots: Y_μ,1 and Y_μ,2 with a time step of 0.2. Right two plots: Y_μ,1 and Y_μ,2 with a time step of 0.1. The dotted lines indicate the corresponding exact solutions y_μ,1 and y_μ,2 evaluated on the same time mesh as the dashed-lined numerical approximations Y_μ,1 and Y_μ,2.

Fig. 4.3 — Solutions to (4.1) for T = 10. Left two plots: Y_μ,1 and Y_μ,2 with a time step of 0.05. Right two plots: Y_μ,1 and Y_μ,2 with a time step of 0.025. The dotted lines indicate the corresponding exact solutions y_μ,1 and y_μ,2 evaluated on the same time mesh as the dashed-lined numerical approximations Y_μ,1 and Y_μ,2.

Table 4.2. Example 1 results for the three partial derivatives of the solution at T = 10.

Δ t	true ∂_1q	est. ∂_1q	Error	IIIab	IIb	Estimate	Error Estimate	est. ∂_1q+Est. true ∂_1q

0.2	–1.35E – 02	6.30E – 02	–7.66E – 02	9.59E – 03	2.20E – 05	9.61E – 03	–0.126	–5.36
0.1	–1.35E – 02	5.75E – 02	–7.10E – 02	–1.07E – 01	–5.60E – 06	–1.07E – 01	1.51	3.66
0.05	–1.35E – 02	9.58E – 03	–2.31E – 02	–2.36E – 02	3.40E – 06	–2.36E – 02	1.02	1.04
0.025	–1.35E – 02	–9.20E – 03	–4.39E – 03	–3.91E – 03	–9.80E – 06	–3.92E – 03	0.893	0.965

Δ t	true ∂_2q	est. ∂_2q	Error	IIIab	IIb	Estimate	Error Estimate	est. ∂_2q+Est. true ∂_2q

0.2	1.40E + 01	–3.39E – 01	1.44E + 01	1.26E + 01	2.07E – 04	1.26E + 01	0.876	0.873
0.1	1.40E + 01	–6.37E – 01	1.47E + 01	5.82E + 00	–3.24E – 04	5.82E + 00	0.397	0.370
0.05	1.40E + 01	7.68E + 00	6.34E + 00	4.85E + 00	–2.87E – 04	4.85E + 00	0.765	0.894
0.0250	1.40E + 01	1.30E + 01	9.91E – 01	7.88E – 01	–2.53E – 04	7.87E – 01	0.794	0.985

Δ t	true ∂_3q	est. ∂_3q	Error	IIIab	IIb	Estimate	Error Estimate	est. ∂_3q+Est. true ∂_3q

0.2	4.51E – 01	4.63E – 01	–1.29E – 02	–1.14E – 02	2.18E – 05	–1.14E – 02	0.882	1.02
0.1	4.51E – 01	4.64E – 01	–1.32E – 02	–5.13E – 03	–9.29E – 06	–5.14E – 03	0.388	1.02
0.05	4.51E – 01	4.56E – 01	–5.73E – 03	–4.37E – 03	–2.20E – 06	–4.37E – 03	0.763	1.00
0.025	4.51E – 01	4.51E – 01	–8.98E – 04	–7.10E – 04	–8.74E – 06	–7.19E – 04	0.801	1.00

Δ t	true ∂_4q	est. ∂_4q	Error	IIIab	IIb	Estimate	Error Estimate	est. ∂_4q+Est. true ∂_4q

0.2	–9.52E – 01	–1.16E – 01	–8.37E – 01	1.11E – 01	2.18E – 04	1.11E – 01	–0.133	1.36
0.1	–9.52E – 01	–1.78E – 01	–7.75E – 01	–1.17E + 00	–3.28E – 04	–1.18E + 00	1.52	0.458
0.05	–9.52E – 01	–7.01E – 01	–2.51E – 01	–2.58E – 01	–2.83E – 04	–2.58E – 01	1.03	1.01
0.025	–9.52E – 01	–9.04E – 01	–4.81E – 02	–4.27E – 02	–2.52E – 04	–4.29E – 02	0.892	0.995

Open in a new tab

4.2. Example 2

The second example is a nonlinear problem with two parameters:

{\begin{matrix} \dot{x} = - (λ_{1} + \sin (λ_{2} t)) x^{2}, & 0 < t \leq T, \\ x (0) = x_{0} . \end{matrix}

(4.3)

We set μ = (−0.1, 20)^⊺ so y_μ(t) = 20/ (20 + 1 + (−0.1)20t – cos(20t)). The quantity of interest is y(T ). The adjoint problem is

{\begin{matrix} - \dot{ϕ} + 2 y_{μ} (- 0.1 + \sin (20 t)) ϕ = 0, & T > t \geq 0, \\ ϕ (T) = 1 . \end{matrix}

(4.4)

The solution to (4.4) is ϕ(t) = C(20 + 1 + 20(−0.1)t – cos(20t))², where C is chosen so ϕ(T ) = 1. Since (4.3) is nonlinear, we report the error estimates for both terms II and III.

We show results for both T = 3.9 and T = 10. We plot the forward solutions y_μ and Y_μ and adjoint solutions ϕ and Φ for T = 3.9 and T = 10 with three different time steps in Figures 4.4 and 4.5, respectively. Tables 4.3 and 4.4 show the error estimate results for T = 3.9 and T = 10, respectively.

Fig. 4.4 — Solutionsto (4.3) for T = 3.9 with a time step of 0.3 (left), 0.15 (middle), and 0.075 (right). Top plots: Y_μ and y_μ. Bottom plots: Φ and φ. The dotted lines indicate the corresponding exact solutions y_μ and φ evaluated on the same time mesh theY as dashed-lined numerical approximations μ and Φ.

Fig. 4.5 — Solutions to (4.3) for T = 10 with a time step of 0.04 (left), 0.02 (middle), and 0.01 (right). Top plots: Y_μ and y_μ for time interval [9, 10]. Bottom plots: Φ and ϕ. The dotted lines indicate the corresponding exact solutions y_μ and ϕ evaluated on the same time mesh as the dashed-lined numerical approximations Y_μ and Φ.

Table 4.3. Example 2 results for the two partial derivatives of the solution (upper) and the adjoint solution (lower) at T = 3.9.

Δ t	true ∂_1q	est. ∂_1q	Error	IIIab	IIb	Estimate	Error Estimate	est. ∂_1q+Est. true ∂_1q

0.3	−7.89E + 00	−4.42E + 00	−3.47E + 00	−4.06E + 00	−1.49E + 00	−5.55E + 00	1.60	1.26
0.15	−7.89E + 00	−9.32E + 00	1.43E + 00	5.88E – 01	8.18E – 01	1.41E + 00	0.986	1.00
0.075	−7.89E + 00	−8.11E + 00	2.12E – 01	8.05E – 02	1.01E – 01	1.82E – 01	0.858	1.00

Δ t	true ∂_2q	est. ∂_2q	Error	IIIab	IIb	Estimate	Error Estimate	est. ∂_2q+Est. true ∂_2q

0.3	−1.94E – 01	2.26E – 01	−4.20E – 01	9.44E – 01	1.09E + 00	2.03E + 00	−4.83	−11.7
0.15	−1.93E – 01	5.62E – 02	−2.50E – 01	−1.91E – 01	−5.68E – 02	−2.48E – 01	0.992	0.990
0.075	−1.93E – 01	−1.78E – 01	−1.55E – 02	−4.55E – 03	−1.44E – 02	−1.90E – 02	1.22	1.02

Δ t	φ(0)	Φ(0)	Error	II	Error Estimate	Φ(0)+ Est. φ(0)

0.3	2.02E + 00	2.49E + 00	−4.64E – 01	1.19E + 00	−2.57	1.82
0.15	2.02E + 00	2.35E + 00	−3.31E – 01	−3.54E – 01	1.07	0.988
0.075	2.02E + 00	2.07E + 00	−5.03E – 02	−4.93E – 02	0.981	1.00

Open in a new tab

Table 4.4. Example 2 results for the two partial derivatives of the solution (upper) and the adjoint solution (lower) at T = 10.

Δ t	∂_1q (true)	∂_1q (est)	∂_1q (est) ∂_1q (true)	True Error	Term IIIab	Term IIb	Error Estimate	Error Ratio	∂_1q (est)+Error Estimate ∂_1q (true)

0.04	−1.52E + 04	−1.91E + 04	1.25290	3.85E + 03	5.24E + 02	7.08E + 03	7.60E + 03	1.97601	0.75317
0.02	−1.52E + 04	−1.56E + 04	1.02488	3.78E + 02	4.93E + 01	2.60E + 02	3.10E + 02	0.81829	1.00452
0.01	−1.52E + 04	−1.53E + 04	1.00369	5.61E + 01	5.98E + 00	3.61E + 01	4.21E + 01	0.75009	1.00092

Δ t	∂_2q (true)	∂_2q (est)	∂_2q (est) ∂_2q (true)	True Error	Term IIIab	Term IIb	Error Estimate	Error Ratio	∂_2q (est)+Error Estimate ∂_2q (true)

0.04	6.66E + 02	7.03E + 02	1.05505	−3.67E + 01	1.17E + 03	−1.12E + 03	4.73E + 01	−1.29028	1.12609
0.02	6.66E + 02	6.79E + 02	1.01928	−1.28E + 01	8.71E + 01	−9.40E + 01	−6.94E + 00	0.54057	1.00886
0.01	6.66E + 02	6.68E + 02	1.00328	−2.19E + 00	1.02E + 01	−1.17E + 01	−1.48E + 00	0.67836	1.00106

Δ t	φ(0)	Φ(0)	Φ(0) φ(0)	True Error	Term II	Error Ratio	Φ(0)+Error Estimate φ(0)

0.04	1.52E + 03	1.90E + 03	1.25043	−3.81E + 02	−7.63E + 02	2.00188	0.74909
0.02	1.52E + 03	1.56E + 03	1.02465	−3.75E + 01	−3.11E + 01	0.83007	1.00419
0.01	1.52E + 03	1.53E + 03	1.00366	−5.57E + 00	−4.23E + 00	0.75982	1.00088

Open in a new tab

4.3. Example 3

We consider the nonlinear example first presented in [4]:

{\begin{matrix} \dot{x} = λ_{1} \sin (λ_{2} x), & 0 < t \leq T, \\ x (0) = 1 . \end{matrix}

The quantity of interest is the average value of x(t) over the time interval [0, 2]. Thus, we set ψ(t) = 1_[0,2](t)/2 in the adjoint problem. We use a time step of 0.25 to solve at each point of a 20 20 grid of uniformly spaced parameter values in Λ = [.8, 1.2] × [.1, π × – .1] and compute the simple function approximation of the σ_Λ shown in Figure 4.6, where we use 10,000 samples of the quantity of interest to approximate the output density. We denote the associated distribution function by F̃⁽¹⁾(t).

Fig. 4.6 — Left: The global piecewise-linear approximation to q(λ) using a coarse 20 × 20 set of cells. The circles in each cell indicate the reference parameter used to linearize q(λ) in that cell. Right: A contour plot of the computed probability distribution on a grid of 60 × 60 cells corresponding to a normal distribution on q(λ).

We have max_1≤i≤M’ (max q(b_i) – min q(b_i)) ≤ 2.69 × 10⁻¹ and the corresponding estimate is E ≤ 7.53 × 10⁻⁰⁴. The normal distribution imposed on the quantity of interest has a small variance (approximately 6.72 10⁻⁰³) so the Lipschitz constant of the distribution is bounded by 5. Thus, using (2.5) with ∊ = 0.05, we have that

∣ F (t) - {\tilde{F}}^{1} (t) ∣ \leq 2.55 \times 10^{- 22}

with probability 95%.

In order to compare the computed a posteriori error estimate to the true error, we directly approximate the error in a computed solution by using another more accurate solution. We use a time step of 1.0 × 10⁻⁰² to compute solutions to the forward problem at each parameter in the 20 × 20 grid, and we invert using 10⁸ samples of the output data and use the same resolution in Λ of 60 × 60 small cells to obtain another approximate distribution function that we denote F̃⁽²⁾(t). We compare F̃⁽¹⁾(t) to F̃⁽²⁾(t) and compare to the error bound above. We evaluate the difference in these distributions at the upper-right corner of each b_i and plot the absolute value of the difference in Figure 4.7. The maximum computed absolute value of error at these points is less than 6.70 × 10⁻⁰³, which is within the error bound above.

Fig. 4.7 — Plot of absolute values of approximate errors in probabilities over the 60 × 60 grid of cells used to approximate the solution to the inverse problem. The errors are approximated using a more accurate approximation F̃⁽²⁾(t) computed using a refined numerical solution with a time step of 10⁻², resulting in E < 10⁻⁷, and 10⁸ samples of the output density to make statistical errors small. The maximum in this plot is approximately 6.70 × 10⁻⁰³, which is less than the computed bound 2.55 × 10⁻⁰².

Acknowledgments

The work of this author was supported in part by the Department of Energy (DE-FG02-05ER25699) and the National Science Foundation (DGE-0221595003, MSPACSE-0434354).

The work of this author was supported in part by the Defense Threat Reduction Agency (HDTRA1-09-1-0036), the Department of Energy (DE-FG02-04ER25620, DEFG02-05ER25699, DE-FC02-07ER54909, DE-SC0001724), Lawrence Livermore National Laboratory (B573139, B584647), the National Aeronautics and Space Administration (NNG04GH63G), the National Science Foundation (DMS-0107832, DMS-0715135, DGE-0221595003, MSPA-CSE-0434354, ECCS-0700559), Idaho National Laboratory (00069249), NSF/NIGMS (R01GM096192), and the Sandia Corporation (PO299784).

The work of this author was supported in part by the Department of Energy (DE-FG02-04ER25620, DE-FG02-05ER25699) and the National Science Foundation (DGE-0221595003, MSPA-CSE-0434354).

Footnotes

Received by the editors February 16, 2010; accepted for publication (in revised form) September 29, 2011; published electronically January 19, 2012. http://www.siam.org/journals/sinum/50-1/78595.html

Write to estep@math.colostate.edu for information.

REFERENCES

[1].Bernardo JM. Reference posterior distributions for Bayesian inference. J. Roy. Statist. Soc. 1979;41:113–147. [Google Scholar]
[2].Billingsley P. Probability and Measure. John Wiley & Sons; New York: 1995. [Google Scholar]
[3].Butler T, Estep D. A measure-theoretic computational method for inverse sensitivity problems III: Multiple output quantities of interest. in preparation. [DOI] [PMC free article] [PubMed] [Google Scholar]
[4].Butler T, Estep D. A computational measure theoretic approach to inverse sensitivity problems I: Basic method and analysis. SIAM J. Numer. Anal. doi: 10.1137/100785958. to appear. [DOI] [PMC free article] [PubMed] [Google Scholar]
[5].Butler T, Colorado State University . Ph.D. thesis, Department of Mathematics. Fort Collins, CO; 2009. Computational Measure Theoretic Approach to Inverse Sensitivity Analysis: Methods and Analysis. [Google Scholar]
[6].Carey V, Estep D, Johansson A, Larson M, Tavener S. Blockwise adaptivity for time dependent problems based on coarse scale adjoint solutions. SIAM J. Sci. Comput. 2010;32:2121–2145. [Google Scholar]
[7].Eriksson K, Estep D, Hansbo P, Johnson C. Acta Numer. Cambridge University Press; Cambridge, UK: 1995. Introduction to adaptive methods for differential equations, in Acta Numerica 4, 1995; pp. 105–158. [Google Scholar]
[8].Eriksson K, Estep D, Hansbo P, Johnson C. Computational Differential Equations. Cambridge University Press; Cambridge, UK: 1996. [Google Scholar]
[9].Estep D, French D. Global error control for the continuous Galerkin finite element method for ordinary differential equations. RAIRO Modél. Math. Anal. Numér. 1994;28:815–852. [Google Scholar]
[10].Estep D, Ginting V, Shadid J, Tavener S. An a posteriori-a priori analysis of multiscale operator splitting. SIAM J. Numer. Anal. 2008;46:1116–1146. [Google Scholar]
[11].Estep D, Holst MJ, Måalqvist A. Nonparametric density estimation for randomly perturbed elliptic problems III: Convergence, complexity, and generalizations. J. Appl. Math. Comput. to appear. [Google Scholar]
[12].Estep D, Larson MG, Williams RD. Estimating the error of numerical solutions of systems of reaction-diffusion equations. Mem. Amer. Math. Soc. 2000;146 [Google Scholar]
[13].Estep D, Måalqvist A, Tavener S. Nonparametric density estimation for randomly perturbed elliptic problems I: Computational methods, a posteriori analysis, and adaptive error control. SIAM J. Sci. Comput. 2009;31:2935–2959. [Google Scholar]
[14].Estep D, Måalqvist A, Tavener S. Nonparametric density estimation for randomly perturbed elliptic problems II: Applications and adaptive modeling. J. Numer. Methods Engrg. 2009;80:846–867. [Google Scholar]
[15].Estep D, Neckels D. Fast and reliable methods for determining the evolution of uncertain parameters in differential equations. J. Comput. Phys. 2006;213:530–556. [Google Scholar]
[16].Estep D, Neckels D. Fast methods for determining the evolution of uncertain parameters in reaction-diffusion equations. Comput. Methods Appl. Mech. Engrg. 2007;196:3967–3979. [Google Scholar]
[17].Estep D, Stewart A. The dynamical behavior of the discontinuous Galerkin method and related difference schemes. Math. Comput. 2002;71:1075–1103. [Google Scholar]
[18].Estep D. A posteriori error bounds and global error control for approximation of ordinary differential equations. SIAM J. Numer. Anal. 1995;32:1–48. [Google Scholar]
[19].Huelsenbeck JP, et al. Potential applications and pitfalls of Bayesian inference of phylogeny. Syst. Biol. 2002;51:673–688. doi: 10.1080/10635150290102366. [DOI] [PubMed] [Google Scholar]
[20].Folland G. Real Analysis. John Wiley & Sons; New York: 1999. [Google Scholar]
[21].Gentle JE. Random Number Generation and Monte Carlo Methods. Springer; New York: 2003. [Google Scholar]
[22].Gilks WR, Richardson S, Spiegelhalter DJ. Markov Chain Monte Carlo in Practice. CRC Press; Boca Raton, FL: 1995. [Google Scholar]
[23].Kaipio J, Somersalo E. Statistical and Computational Inverse Problems. Springer; New York: 2005. [Google Scholar]
[24].Knill DC, Richards W. Perception as Bayesian Inference. Cambridge University Press; Cambridge, UK: 1996. [Google Scholar]
[25].Neckels D, Colorado State University . Ph.D. thesis, Department of Mathematics. Fort Collins, CO: 2005. Variational Methods for Uncertainty Quantification. [Google Scholar]
[26].Robert CP, Casella G. Monte Carlo Statistical Methods. Springer; New York: 2004. [Google Scholar]
[27].Sandelin J, Colorado State University . Ph.D. thesis, Department of Mathematics. Fort Collins, CO: 2006. Global Estimate and Control of Model, Numerical, and Parameter Error. [Google Scholar]
[28].Serfling RJ. Approximation Theorems of Mathematical Statistics. John Wiley & Sons; New York: 1980. [Google Scholar]
[29].Tarantola A. Inverse Problem Theory and Methods for Model Parameter Estimation. SIAM; Philadelphia: 2005. [Google Scholar]

[R1] [1].Bernardo JM. Reference posterior distributions for Bayesian inference. J. Roy. Statist. Soc. 1979;41:113–147. [Google Scholar]

[R2] [2].Billingsley P. Probability and Measure. John Wiley & Sons; New York: 1995. [Google Scholar]

[R3] [3].Butler T, Estep D. A measure-theoretic computational method for inverse sensitivity problems III: Multiple output quantities of interest. in preparation. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] [4].Butler T, Estep D. A computational measure theoretic approach to inverse sensitivity problems I: Basic method and analysis. SIAM J. Numer. Anal. doi: 10.1137/100785958. to appear. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] [5].Butler T, Colorado State University . Ph.D. thesis, Department of Mathematics. Fort Collins, CO; 2009. Computational Measure Theoretic Approach to Inverse Sensitivity Analysis: Methods and Analysis. [Google Scholar]

[R6] [6].Carey V, Estep D, Johansson A, Larson M, Tavener S. Blockwise adaptivity for time dependent problems based on coarse scale adjoint solutions. SIAM J. Sci. Comput. 2010;32:2121–2145. [Google Scholar]

[R7] [7].Eriksson K, Estep D, Hansbo P, Johnson C. Acta Numer. Cambridge University Press; Cambridge, UK: 1995. Introduction to adaptive methods for differential equations, in Acta Numerica 4, 1995; pp. 105–158. [Google Scholar]

[R8] [8].Eriksson K, Estep D, Hansbo P, Johnson C. Computational Differential Equations. Cambridge University Press; Cambridge, UK: 1996. [Google Scholar]

[R9] [9].Estep D, French D. Global error control for the continuous Galerkin finite element method for ordinary differential equations. RAIRO Modél. Math. Anal. Numér. 1994;28:815–852. [Google Scholar]

[R10] [10].Estep D, Ginting V, Shadid J, Tavener S. An a posteriori-a priori analysis of multiscale operator splitting. SIAM J. Numer. Anal. 2008;46:1116–1146. [Google Scholar]

[R11] [11].Estep D, Holst MJ, Måalqvist A. Nonparametric density estimation for randomly perturbed elliptic problems III: Convergence, complexity, and generalizations. J. Appl. Math. Comput. to appear. [Google Scholar]

[R12] [12].Estep D, Larson MG, Williams RD. Estimating the error of numerical solutions of systems of reaction-diffusion equations. Mem. Amer. Math. Soc. 2000;146 [Google Scholar]

[R13] [13].Estep D, Måalqvist A, Tavener S. Nonparametric density estimation for randomly perturbed elliptic problems I: Computational methods, a posteriori analysis, and adaptive error control. SIAM J. Sci. Comput. 2009;31:2935–2959. [Google Scholar]

[R14] [14].Estep D, Måalqvist A, Tavener S. Nonparametric density estimation for randomly perturbed elliptic problems II: Applications and adaptive modeling. J. Numer. Methods Engrg. 2009;80:846–867. [Google Scholar]

[R15] [15].Estep D, Neckels D. Fast and reliable methods for determining the evolution of uncertain parameters in differential equations. J. Comput. Phys. 2006;213:530–556. [Google Scholar]

[R16] [16].Estep D, Neckels D. Fast methods for determining the evolution of uncertain parameters in reaction-diffusion equations. Comput. Methods Appl. Mech. Engrg. 2007;196:3967–3979. [Google Scholar]

[R17] [17].Estep D, Stewart A. The dynamical behavior of the discontinuous Galerkin method and related difference schemes. Math. Comput. 2002;71:1075–1103. [Google Scholar]

[R18] [18].Estep D. A posteriori error bounds and global error control for approximation of ordinary differential equations. SIAM J. Numer. Anal. 1995;32:1–48. [Google Scholar]

[R19] [19].Huelsenbeck JP, et al. Potential applications and pitfalls of Bayesian inference of phylogeny. Syst. Biol. 2002;51:673–688. doi: 10.1080/10635150290102366. [DOI] [PubMed] [Google Scholar]

[R20] [20].Folland G. Real Analysis. John Wiley & Sons; New York: 1999. [Google Scholar]

[R21] [21].Gentle JE. Random Number Generation and Monte Carlo Methods. Springer; New York: 2003. [Google Scholar]

[R22] [22].Gilks WR, Richardson S, Spiegelhalter DJ. Markov Chain Monte Carlo in Practice. CRC Press; Boca Raton, FL: 1995. [Google Scholar]

[R23] [23].Kaipio J, Somersalo E. Statistical and Computational Inverse Problems. Springer; New York: 2005. [Google Scholar]

[R24] [24].Knill DC, Richards W. Perception as Bayesian Inference. Cambridge University Press; Cambridge, UK: 1996. [Google Scholar]

[R25] [25].Neckels D, Colorado State University . Ph.D. thesis, Department of Mathematics. Fort Collins, CO: 2005. Variational Methods for Uncertainty Quantification. [Google Scholar]

[R26] [26].Robert CP, Casella G. Monte Carlo Statistical Methods. Springer; New York: 2004. [Google Scholar]

[R27] [27].Sandelin J, Colorado State University . Ph.D. thesis, Department of Mathematics. Fort Collins, CO: 2006. Global Estimate and Control of Model, Numerical, and Parameter Error. [Google Scholar]

[R28] [28].Serfling RJ. Approximation Theorems of Mathematical Statistics. John Wiley & Sons; New York: 1980. [Google Scholar]

[R29] [29].Tarantola A. Inverse Problem Theory and Methods for Model Parameter Estimation. SIAM; Philadelphia: 2005. [Google Scholar]

PERMALINK

A COMPUTATIONAL MEASURE THEORETIC APPROACH TO INVERSE SENSITIVITY PROBLEMS II: A POSTERIORI ERROR ANALYSIS*

T BUTLER

D ESTEP

J SANDELIN

Abstract

1. Introduction

1.1. The inverse problem

1.2. The solution method

1.3. Sources of error

2. General error analysis for a computed probability distribution

3. Application to nonlinear ordinary differential equations

3.1. Construction of the piecewise-linear representation

3.2. Discretization

3.3. The effect of using an approximate solution on the piecewise-linear representation

3.4. Convergence and order of accuracy

3.5. Estimate of the error in a quantity of interest

Implementation details

3.6. Estimating term II in (3.15)

3.7. Estimating term III in (3.15)

4. Examples

4.1. Example 1

Fig. 4.1.

Table 4.1. Example 1 results for the three partial derivatives of the solution at T = 3.

Fig. 4.2.

Fig. 4.3.

Table 4.2. Example 1 results for the three partial derivatives of the solution at T = 10.

4.2. Example 2

Fig. 4.4.

Fig. 4.5.

Table 4.3. Example 2 results for the two partial derivatives of the solution (upper) and the adjoint solution (lower) at T = 3.9.

Table 4.4. Example 2 results for the two partial derivatives of the solution (upper) and the adjoint solution (lower) at T = 10.

4.3. Example 3

Fig. 4.6.

Fig. 4.7.

Acknowledgments

Footnotes

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

A COMPUTATIONAL MEASURE THEORETIC APPROACH TO INVERSE SENSITIVITY PROBLEMS II: A POSTERIORI ERROR ANALYSIS^{^*}