Approximation methods for piecewise deterministic Markov processes and their costs

Peter Kritzer; Gunther Leobacher; Michaela Szölgyenyi; Stefan Thonhauser

doi:10.1080/03461238.2018.1560357

. 2019 Jan 9;2019(4):308–335. doi: 10.1080/03461238.2018.1560357

Approximation methods for piecewise deterministic Markov processes and their costs

Peter Kritzer ^a, Gunther Leobacher ^b, Michaela Szölgyenyi ^c,^d, Stefan Thonhauser ^e,^CONTACT

PMCID: PMC6474733 PMID: 31058276

ABSTRACT

In this paper, we analyse piecewise deterministic Markov processes (PDMPs), as introduced in Davis (1984). Many models in insurance mathematics can be formulated in terms of the general concept of PDMPs. There one is interested in computing certain quantities of interest such as the probability of ruin or the value of an insurance company. Instead of explicitly solving the related integro-(partial) differential equation (an approach which can only be used in few special cases), we adapt the problem in a manner that allows us to apply deterministic numerical integration algorithms such as quasi-Monte Carlo rules; this is in contrast to applying random integration algorithms such as Monte Carlo. To this end, we reformulate a general cost functional as a fixed point of a particular integral operator, which allows for iterative approximation of the functional. Furthermore, we introduce a smoothing technique which is applied to the integrands involved, in order to use error bounds for deterministic cubature rules. We prove a convergence result for our PDMPs approximation, which is of independent interest as it justifies phase-type approximations on the process level. We illustrate the smoothing technique for a risk-theoretic example, and compare deterministic and Monte Carlo integration.

KEYWORDS: Risk theory, piecewise deterministic Markov process, quasi-Monte Carlo methods, phase-type approximations, dividend maximisation

MATHEMATICS SUBJECT CLASSIFICATION (2010): 60J25, 91G60, 65D32

1. Introduction

Many models in risk theory can be formulated as piecewise deterministic Markov processes (PDMPs) – a general class of finite-variation sample path Markov processes introduced by Davis (1984). This applies, among others, to the classical Cramér–Lundberg model, the renewal risk models, and multi-portfolio models recently introduced by Albrecher & Lautscham (2015). Moreover, PDMPs are sufficiently general to allow for non-constant model parameters, i.e. quantities such as the hazard rate or the premium rate may be state dependent. Examples of PDMPs and their control in the field of insurance mathematics are, e.g. Dassios & Embrechts (1989), Embrechts & Schmidli (1994), Schäl (1998), Rolski (1999), Cai et al. (2009), Leobacher & Ngare (2016), and Eichler et al. (2017).

The general theory of PDMPs is well developed, see for example the monographs by Davis (1993), Jacobsen (2006), or Bäuerle & Rieder (2011) for general results on PDMPs and their optimal control. More specialised contributions to the control theory of PDMPs can be found in Davis (1993), Lenhart & Liaot (1985), Costa & Davis (1989), Dempster & Ye (1992), Almudevar (2001), Forwick et al. (2004), Bäuerle & Rieder (2010), Costa & Dufour (2013), or Davis & Farid (1999) for viscosity solutions of associated Hamilton–Jacobi–Bellman equations, and Colaneri (2017) for a general comparison principle for solutions to control problems for PDMPs.

For the numerical treatment of (control) problems for PDMPs, however, only problem-specific solutions have been provided. A standard approach is to link expected values representing a quantity of interest in the problem to the solution of an associated integro-(partial) differential equation, see, e.g. Asmussen & Albrecher (2010). In only very few cases is it possible to derive an explicit solution to this integro-(partial) differential equation. Requiring an explicit solution typically restricts the complexity of the model significantly. One possibility is to solve the integro-(partial) differential equation numerically. This carries all the intricacies and difficulties of a combined numerical method for differential and integral equations. Alternatively one can apply crude Monte Carlo methods, see, e.g. Riedler (2013). Those methods, while robust, are limited in speed by the Monte Carlo convergence rate. Another – highly sophisticated – approach uses quantisation of the jump distribution, see de Saporta et al. (2016).

In this article we concentrate on particularly easy to implement methods similar to Monte Carlo. The aim is to adapt the problem in a way that also allows for deterministic numerical integration algorithms such as quasi-Monte Carlo (QMC). QMC has been applied successfully to problems in risk theory, see Tichy (1984), Coulibaly & Lefèvre (2008), Siegl & Tichy (2000), Albrecher & Kainhofer (2002), and Preischl et al. (2018). It should be noted that the finiteness of the total variation needed for the convergence estimate (Albrecher & Kainhofer 2002, Theorem 1) has not been proven.

We would like to highlight two features of our approach. Inspired by Albrecher & Kainhofer (2002), we reformulate a general cost functional as a fixed point of a particular integral operator, which allows for iterative approximation of the functional. In terms of numerical integration this means that we get a high-dimensional integration problem of fixed dimension, where the dimension is a multiple of the number of iterations. Having a fixed dimension is required for the application of standard QMC or other deterministic cubature rules.

The application of QMC requires some degree of regularity of the integrand. Only in rare cases these will be satisfied automatically. The examples from risk theory considered here lead to non-smooth integrands. For these situations, we introduce a smoothing technique which, in its simplest case, leads to $C^{2}$ integrands. From the earlier considerations, we obtain deterministic error bounds for those. We prove convergence in distribution of the ‘smoothed processes’ to the original ones, which implies convergence of the corresponding expected values for every initial value of the process. In Section 2.1 we even obtain uniform convergence with respect to the initial value in a particular setup from risk theory.

Our convergence result has an additional benefit for a typical situation in risk-theoretic modelling. In the literature on the analysis of ruin probabilities, or more generally, on Gerber–Shiu functions, the assumption of a claim size distribution of mixed exponential or phase-type form is quite common. Apart from the possibility to obtain explicit expressions for quantities of interest in such setups, this modelling approach is motivated by the fact that the class of phase-type distributions is dense in the class of distributions with support on $[0, \infty)$ , see Rolski (1999, Theorem 8.2.3). Under mild assumptions on the claim size distribution we want to approximate, our convergence result applies and justifies the phase-type approximation procedure even on the process level. Furthermore, efficient and easy to implement numerical methods for the computation of important targets such as Gerber-Shiu functions and expected discounted future dividend payments of an insurance company are of particular importance when models become more general and hence also more complicated. This makes our contribution valuable from both the analytical and the numerical point of view.

We would like to emphasise that the methods presented here per se do not provide solutions to optimal control problems, which is the main application of PDMPs in risk theory. However, the integration algorithms as introduced here can be used in a policy iteration procedure for calculating costs associated with a fixed policy.

The paper is structured as follows. In Section 2 we recall the definition of a PDMP and provide some risk-theoretic examples. In Section 3 we derive the fixed point approach for valuation of a cost functional of a PDMP. Section 4 reviews deterministic numerical integration of possibly multivariate $C^{k}$ functions. Subsequently, Section 5 is devoted to the aforementioned smoothing procedure, and presents a stability result. Section 6 contains an application of the smoothing to one of the risk-theoretic examples and a comparative study of deterministic and Monte Carlo integration for this example.

2. Piecewise deterministic Markov processes

In this section we first define PDMPs. Then we give a couple of examples of practical interest.

A PDMP is a continuous-time stochastic process with (possibly random) jumps, which follows a deterministic flow, e.g. the solution of an ordinary differential equation (ODE), between jump times. We will not give the most general definition of PDMPs here, but instead refer to the monograph by Davis (1993). For a subset A of $R^{d}$ we denote by $A^{\circ}, \bar{A}$ , and $\partial A$ its interior, closure, and boundary, respectively. We write $B (A)$ for the Borel σ-algebra on A.

Definition 2.1

Let $A \subseteq R^{d}$ . A function $ϕ : A \times R \to R^{d}$ is called a flow on A, if

ϕ is continuous,

$ϕ (x, 0) = x$ for all $x \in A$ ;

for all $x \in A$ and all $s, t \in R$ it holds that if $ϕ (x, t) \in A$ and $ϕ (ϕ (x, t), s) \in A$ then $ϕ (x, t + s) = ϕ (ϕ (x, t), s)$ .

For fixed $x \in A$ , let $ϕ^{- 1} (x, A) = {t \in R : ϕ (x, t) \in A}$ . Then the function $ϕ (x, \cdot) : ϕ^{- 1} (x, A) \to A$ is called a trajectory of the flow.

If ϕ is a flow on A, then we write $\partial_{ϕ}^{-} A = {x \in \partial A : \exists ϵ \in (0, \infty) such that \forall t \in (0, ϵ) : ϕ (x, t) \in A^{\circ}}$ and $\partial_{ϕ}^{+} A = {x \in \partial A : \exists ϵ \in (0, \infty) such that \forall t \in (0, ϵ) : ϕ (x, - t) \in A^{\circ}}$ .

Thus $\partial_{ϕ}^{-} A$ consists of the points on the boundary of A from which the trajectory moves into $A^{\circ}$ immediately, and $\partial_{ϕ}^{+} A$ consists of the points on the boundary of A to which a trajectory moves from $A^{\circ}$ without passing other points on the boundary in-between. Furthermore, we write $\partial_{ϕ}^{1} A : = \partial_{ϕ}^{-} A ∖ \partial_{ϕ}^{+} A$ .

Remark 2.2

The classical example of a flow arises through ODEs. Let $g : R^{d} \to R^{d}$ be Lipschitz continuous. By the classical Picard–Lindelöf theorem on existence and uniqueness of solutions of ODEs we have that for every $x \in R$ there exists a continuously differentiable function $κ : R \to R^{d}$ such that $κ (0) = x$ and $κ^{'} (s) = g (κ (s))$ for all $s \in R$ . For $t \in R$ we define $ϕ (x, t) = κ (t)$ . The function ϕ defines a flow on $R^{d}$ . If $A \subseteq R^{d}$ , then the restriction of ϕ to $A \times R$ is a flow on A.

Definition 2.3

Let K be a finite set and let $d : K \to N$ be a function which satisfies that, for every $k \in K$ , $E_{k} \subseteq R^{d (k)}$ and $ϕ_{k}$ is a flow on $E_{k}$ with $E_{k} = E_{k}^{\circ} \cup \partial_{ϕ_{k}}^{1} E_{k}$ .

The state space $(E, E)$ of a PDMP is the measurable space defined by $E = ⋃_{k \in K} ({k} \times E_{k})$ and $E = σ ({{k} \times B : k \in K, B \in B (E_{k})})$ .

The flow of a PDMP is defined by $ϕ = {ϕ_{k}}_{k \in K}$ .

The active boundary of the PDMP is defined by $Γ^{*} = ⋃_{k = 1}^{K} \partial_{ϕ_{k}}^{+} E_{k}$ . Furthermore, we define a σ-algebra on $E \cup Γ^{*}$ by $E^{*} = σ ({{k} \times B : k \in K, B \in B (E_{k} \cup \partial_{ϕ_{k}}^{+} E_{k})})$ .

The jump intensityλ of a PDMP is defined by a family of functions $λ = {λ_{k}}_{k \in K}$ with $λ_{k} : E_{k} \to [0, \infty)$ measurable and bounded for all $k \in K$ .

The jump kernelQ of a PDMP is a function $Q : E \times (E \cup Γ^{*}) \to [0, 1]$ such that $Q (A, \cdot)$ is $E^{*}$ - $B ([0, 1])$ measurable for every $A \in E$ , and $Q (\cdot, x)$ is a probability measure on $(E, E)$ for every $x \in E$ with $Q ({x}, x) = 0$ .

We call the triple $(ϕ, λ, Q)$ the local characteristics of a PDMP.

Given a state space $(E, E)$ and local characteristics $(ϕ, λ, Q)$ of a PDMP we define the function $t^{*} : E \to [0, \infty]$ by

t^{*} (k, y) = \{\begin{cases} inf {t > 0 : ϕ_{k} (y, t) \in \partial_{ϕ_{k}}^{+} E_{k}} & if \exists t > 0 : ϕ_{k} (y, t) \in \partial_{ϕ_{k}}^{+} E_{k}, \\ \infty & otherwise . \end{cases}

Definition 2.4

Let $(E, E)$ be a state space and let $(ϕ, λ, Q)$ be local characteristics of a PDMP, let $x \in E$ , and let $(Ω, F, P)$ be a probability space. A piecewise deterministic Markov process starting in x is a stochastic process $X : [0, \infty) \times Ω \to E$ which satisfies the following. There exists a sequence of random variables $(T_{n})_{n \in N}$ with $T_{n} \in [0, \infty]$ and $T_{n} \leq T_{n + 1}$ a.s. and $lim_{n \to \infty} T_{n} = \infty$ a.s. for all $n \in N$ such that

it holds $P$ -a.s. that $X_{0} = x$ ,

for all $n \in N$ , $t \in [T_{n}, T_{n + 1})$ , and for $(k, y) \in E$ with $X_{T_{n}} = (k, y)$ it holds $P$ -a.s. that $X_{t} = ϕ_{k} (y, t - T_{n})$ ,

for all $s, t \in [0, \infty)$ it holds $P$ -a.s. that
$\begin{aligned} P (T_{n + 1} - T_{n} > t | X_{s} = (k, y) and T_{n} \leq s < T_{n + 1}) \\ = \{\begin{cases} e^{- \int_{0}^{t} λ_{k} (ϕ_{k} (y, τ)) d τ} & if 0 < t < t^{*} (k, y), \\ 0 & if t \geq t^{*} (k, y), \end{cases} \end{aligned}$

for all $n \in N$ and all $A \in E$ it holds $P$ -a.s. that
$P (X_{T_{n + 1}} \in A | X_{T_{n} -}) = Q (A, X_{T_{n}}) .$

Theorem 2.5

Let $(E, E)$ be a state space and let $(ϕ, λ, Q)$ be local characteristics of a PDMP, let $x \in E$ . There exist a probability space $(Ω, F, P_{x})$ and a stochastic process $X : [0, \infty) \times Ω \to E$ such that X is a PDMP starting in x with state space E and local characteristics $(ϕ, λ, Q)$ . Furthermore, X has the strong Markov property.

Proof.

The proof of Theorem 2.5 for a more general setup that also allows for the possibility of explosions and countable K can be found in Davis (1993, Section 2.25).

Figure 1 illustrates a path of a PDMP.

Let $f : E \to R$ be a function. For all $k \in K$ we denote by $f_{k}$ the function $f_{k} : E_{k} \to R$ which satisfies for all $x \in E_{k}$ that $f_{k} (x) = f (k, x)$ . It is not hard to see that f is measurable if and only if $f_{k}$ is measurable for every $k \in K$ . We say that f is n-times continuously differentiable, if for every $k \in K$ there exists an open set $A_{k} \subseteq R^{d (k)}$ with $E_{k} \subseteq A_{k}$ and an n-times continuously differentiable function ${\hat{f}}_{k} : A_{k} \to R$ such that $f_{k} = {\hat{f}}_{k} |_{E_{k}}$ . We write $C^{n} (E, R^{m})$ for the space of n-times differentiable functions on E and $C_{b}^{n} (E, R^{m})$ for the space of functions in $C^{n} (E, R^{m})$ for which all derivatives are bounded. Moreover, $C_{0}^{n} (E, R^{m})$ is the space of functions in $C_{b}^{n} (E, R^{m})$ for which all derivatives vanish at infinity.

Further, for $f : E \to R$ , a PDMP X, and $t \in (0, \infty)$ we write $E (f (X_{t}) | X_{0} = x) =: E_{x} (f (X_{t}))$ .

In the remainder of this section we provide some illustrative examples from risk theory. For other examples and applications in different fields we refer to Davis (1993), de Saporta et al. (2012), and Riedler (2013).

2.1. Examples

2.1.1. Classical Cramér–Lundberg model

Let $X = (X_{t})_{t \geq 0}$ be a stochastic process given by

X_{t} = x + c t - S_{t}, t \geq 0,

(1)

where $x, c \geq 0$ , $N = (N_{t})_{t \geq 0}$ is a homogeneous Poisson process with intensity $λ_{N} > 0$ , ${Y_{i}}_{i \in N}$ is a family of positive i.i.d. random variables with distribution function $F_{Y}$ , and $S_{t} = \sum_{i = 1}^{N_{t}} Y_{i}$ for all $t \geq 0$ . A usual assumption in this kind of model is the independence of ${Y_{i}}_{i \in N}$ and N. In risk theory the process X represents a standard model for the surplus of an insurance portfolio. A quantity of interest is the probability of X ever becoming negative, i.e. we are interested in ℙ(τ<∞) , where $τ = inf {t \geq 0 : X_{t} < 0}$ . The model translates into a PDMP via

$K = {1, 2}$ ,
$E_{1} = [0, \infty)$ , $E_{2} = (- \infty, 0)$ ,
$ϕ_{1} (y, t) = y + c t$ $\forall y \in E_{1}$ and $\forall t \in R$ , $ϕ_{2} (y, t) = y$ $\forall y \in E_{2}$ and $\forall t \in R$ ,
$λ_{1} (y) = λ_{N}$ $\forall y \in E_{1}$ , $λ_{2} (y) = 0$ $\forall y \in E_{2}$ .
For $B_{1} \in B (E_{1})$ , $B_{2} \in B (E_{2})$ , and $B = ({1} \times B_{1}) \cup ({2} \times B_{2})$ ,
$Q (B, (1, y)) = P (Y \in y - B_{1}) + P (Y \in y - B_{2})$
for $y \in E_{1}$ , and $Q (B, (2, y)) = P (Y \in y - B_{2})$ ,

where we have used the notation $y - B = {y - y^{'} : y^{'} \in B}$ for all $y \in R$ and $B \in B (R)$ . For $y \in E_{2}$ , any definition for Q will do, since the jump intensity is 0 there, but the above definition is provided for definiteness.

2.1.2. Cramér–Lundberg model with dividend payments

A classical modification of the model from Section 2.1.1 is the introduction of a dividend barrier at level b>0. Then, once the surplus reaches the barrier, the incoming premium rate is immediately distributed as a dividend. Furthermore, if the process starts above b, the excess is distributed as a lump sum dividend, such that $X_{0 +} = min {x, b}$ . A typical quantity of interest is the expected value of discounted future dividend payouts until ruin of the company, which is given by

V (x) = \{\begin{cases} E_{x} (\int_{0}^{τ} e^{- δ t} c 1_{{X_{t} = b}} d t) & if x \leq b, \\ x - b + E_{b} (\int_{0}^{τ} e^{- δ t} c 1_{{X_{t} = b}} d t) & if x > b, \end{cases}

(2)

where $δ > 0$ is a preference-based discount factor and $τ = inf {t \geq 0 : X_{t} < 0}$ . The model translates into a PDMP via

$K = {1, 2, 3}$ ,
$E_{1} = [0, b)$ , $E_{2} = (- \infty, 0)$ , $E_{3} = {b}$ ,
$ϕ_{1} (y, t) = y + c t$ $\forall y \in E_{1}$ and $\forall t \in R$ , $ϕ_{2} (y, t) = y$ $\forall y \in E_{2}$ and $\forall t \in R$ , $ϕ_{3} (y, t) = y$ $\forall y \in E_{3}$ and $\forall t \in R$ ,
$λ_{1} (y) = λ_{N}$ $\forall y \in E_{1}$ , $λ_{2} (y) = 0$ $\forall y \in E_{2}$ , $λ_{3} (y) = λ_{N}$ $\forall y \in E_{3}$ .
For $B_{k} \in B (E_{k})$ , $1 \leq k \leq 3$ , and $B = ({1} \times B_{1}) \cup ({2} \times B_{2}) \cup ({3} \times B_{3})$ ,
$Q (B, (1, y)) = P (Y \in y - B_{1}) + P (Y \in y - B_{2})$
for $y \in E_{1}$ , $Q (B, (2, y)) = P (Y \in y - B_{2})$ for $y \in E_{2}$ , and
$Q (B, (3, y)) = P (Y \in y - B_{1}) + P (Y \in y - B_{2})$
for $y \in E_{3}$ . Finally, $Q (B, (1, y)) = 1_{B_{3}} (y) (3, y)$ for $y \in \partial_{ϕ_{1}}^{1} E_{1} = {b}$ .

Note that only initial values $x \in (- \infty, b]$ translate to a viable initial value for the PDMP. However, this is sufficient for determining $V (x)$ for all $x \in R$ via (2).

2.1.3. Cramér–Lundberg model with time dependent dividend barrier

In Albrecher & Kainhofer (2002) the model from Section 2.1.2 is further extended to include a time dependent barrier $b : [0, \infty) \to | 0, \infty)$ of the form

b (t) = {(b_{0}^{m} + \frac{t}{α})}^{1 / m},

where $α, b_{0} > 0$ , m>1. The quantity of interest is again the expected value of discounted future dividend payments until the time of ruin, i.e.

V (x) = E_{x} (\int_{0}^{τ} e^{- δ t} (c - b_{t}) 1_{{X_{t} = b_{t}}} d t),

for $x \leq b_{0}$ , where again $τ = inf {t \geq 0 : X_{t} < 0}$ and $δ > 0$ is a preference-based discount factor. The model translates into a PDMP via

$K = {1, 2, 3}$ ,
$E_{1} = {(s, y) \in R^{2} : 0 \leq y < b (s)}$ , $E_{2} = {(s, y) \in R^{2} : y < 0}$ , $E_{3} = {(s, y) \in R^{2} : y = b (s)}$ ,
$ϕ_{1} ((s, y), t) = (s + t, y + c t)$ $\forall (s, y) \in E_{1}$ and $\forall t \in R$ , $ϕ_{2} ((s, y), t) = (s + t, y)$ $\forall y \in E_{2}$ and $\forall t \in R$ , $ϕ_{3} ((s, y), t) = (s + t, b (s + t))$ $\forall (s, y) \in E_{3}$ and $\forall t \in R$ ,
$λ_{1} (y) = λ_{N}$ $\forall y \in E_{1}$ , $λ_{2} (y) = 0$ $\forall y \in E_{2}$ , $λ_{3} (y) = λ_{N}$ $\forall y \in E_{3}$ .
For $B_{k} \in B (E_{k})$ , $1 \leq k \leq 3$ , and $B = ({1} \times B_{1}) \cup ({2} \times B_{2}) \cup ({3} \times B_{3})$ ,
$Q (B, (1, (s, y))) = P (Y \in y - ({s} \times R) \cap B_{1}) + P (Y \in y - ({s} \times R) \cap B_{2})$
for $(s, y) \in E_{1}$ , $Q (B, (2, (s, y))) = P (Y \in y - ({s} \times R) \cap B_{2})$ for $(s, y) \in E_{2}$ , and
$Q (B, (3, (s, y))) = P (Y \in y - ({s} \times R) \cap B_{1}) + P (Y \in y - ({s} \times R) \cap B_{2})$
for $(s, y) \in E_{3}$ . Finally, $Q (B, (1, (s, y))) = 1_{B_{3}} ((s, y)) (3, (s, y))$ for $(s, y) \in \partial_{ϕ_{1}}^{1} E_{1} = E_{3}$ .

2.1.4. Cramér–Lundberg model with loan

In Dassios & Embrechts (1989) the model from Section 2.1.2 is modified such that the insurance company is not ruined when the surplus hits zero, but has the possibility to take up a loan at an interest rate $ρ > 0$ . The time of ruin is given by $τ = inf {t \geq 0 : X_{t} < - c / ρ}$ . The corresponding quantity of interest is

V (x) = E_{x} (\int_{0}^{τ} e^{- δ t} c 1_{{X_{t} = b}} d t),

for $x \leq b$ , where $δ > 0$ is a preference-based discount factor. The model translates into a PDMP via

$K = {1, 2, 3, 4, 5}$ ,
$E_{1} = [0, b)$ , $E_{2} = (- (c / ρ), 0)$ , $E_{3} = {b}$ , $E_{4} = (- \infty, - (c / ρ))$ , $E_{5} = {- c / ρ}$ ,
$ϕ_{1} (y, t) = y + c t$ $\forall y \in E_{1}$ and $\forall t \in R$ , $ϕ_{2} (y, t) = y$ $\forall y \in E_{2}$ and $\forall t \in R$ , $ϕ_{3}$ is the flow of the ODE $z^{'} = c + ρ z$ at $(y, t)$ $\forall y \in E_{3}$ and $\forall t \in R$ , $ϕ_{4} (y, t) = y$ $\forall y \in E_{4}$ and $\forall t \in R$ , $ϕ_{5} (y, t) = y$ $\forall y \in E_{5}$ and $\forall t \in R$ ,
$λ_{1} (y) = λ_{N}$ $\forall y \in E_{1}$ , $λ_{2} (y) = λ_{N}$ $\forall y \in E_{2}$ , $λ_{3} (y) = λ_{N}$ $\forall y \in E_{3}$ , $λ_{4} (y) = 0$ $\forall y \in E_{4}$ , $λ_{5} (y) = 0$ $\forall y \in E_{5}$ .
For $B_{k} \in B (E_{k})$ , $1 \leq k \leq 5$ , and $B = ⋃_{k = 1}^{5} ({k} \times B_{k})$ ,
$Q (B, (1, y)) = P (Y \in y - B_{1}) + P (Y \in y - B_{2}) + P (Y \in y - B_{4})$
for $y \in E_{1}$ , $Q (B, (2, y)) = P (Y \in y - B_{2}) + P (Y \in y - B_{4})$ for $y \in E_{2}$ , and
$Q (B, (3, y)) = P (Y \in y - B_{1}) + P (Y \in y - B_{2})$
for $y \in E_{3}$ . Finally, $Q (B, (1, y)) = 1_{B_{3}} (y) (3, y)$ for $y \in \partial_{ϕ_{1}}^{1} E_{1} = {b}$ , and $Q (B, (2, y)) = 1_{B_{2}} (y) (1, y)$ for $y \in \partial_{ϕ_{2}}^{1} E_{2} = {0}$ .

2.1.5. Multidimensional Cramér–Lundberg model

In Albrecher & Lautscham (2015) a two-dimensional extension of the model in Section 2.1.2 is studied. The basis are independent surplus processes modelling two insurance portfolios $X_{t}^{(j)} = x^{(j)} + c^{(j)} t - S_{t}^{(j)}$ , $j \in {1, 2}$ , where $c^{(1)}, c^{(2)} \geq 0$ and $S^{(j)}$ are compound Poisson processes with intensities $λ^{(1)}, λ^{(2)}$ and jump size distributions $F_{Y^{(1)}}, F_{Y^{(2)}}$ . Furthermore, $b^{(1)}, b^{(2)} \geq 0$ are barriers. As a new feature, the drift of the component at the barrier is added to the other component's drift, causing faster growth of the latter. Dividends are only paid when both surplus processes have reached their individual barriers. We show how the model translates into a PDMP, namely

\begin{aligned} E_{1} & = {(x^{(1)}, x^{(2)}) \in R^{2} : 0 \leq x^{(1)} < b^{(1)}, 0 \leq x^{(2)} < b^{(2)}}, \\ E_{2} & = {(x^{(1)}, x^{(2)}) \in R^{2} : b^{(1)} = x^{(1)}, 0 \leq x^{(2)} < b^{(2)}}, \\ E_{3} & = {(x^{(1)}, x^{(2)}) \in R^{2} : 0 \leq x^{(1)} < b^{(1)}, b^{(2)} = x^{(2)}}, \\ E_{4} & = {(x^{(1)}, x^{(2)}) \in R^{2} : b^{(1)} = x^{(1)}, b^{(2)} = x^{(2)}}, \\ E_{5} & = R^{2} ∖ (E_{1} \cup E_{2} \cup E_{3} \cup E_{4}) . \end{aligned}

The flow is given by

\begin{aligned} ϕ_{1} (x, t) = x + (\begin{matrix} c^{(1)} \\ c^{(2)} \end{matrix}) t, ϕ_{2} (x, t) = x + (\begin{matrix} 0 \\ c^{(1)} + c^{(2)} \end{matrix}) t, ϕ_{3} (x, t) = x + (\begin{matrix} c^{(1)} + c^{(2)} \\ 0 \end{matrix}) t, \end{aligned}

and $ϕ_{4} (x, t) = ϕ_{5} (x, t) = x$ for all $x \in R^{2}$ , $t \geq 0$ . It remains to describe the jump behaviour. We get deterministic ‘jumps’ at the active boundaries of $E_{1}, E_{2}, E_{3}$ which do not manifest themselves as jumps of the process, i.e. $Q (A, (1, x)) = 1_{A} ((2, x))$ for $(1, x) \in \partial_{ϕ_{1}}^{1} (E_{1})$ and similar for the other active boundaries. Since each surplus process is a compound Poisson process with drift, jumps in the components occur due to realisations of independent identically distributed exponential random variables (independence implies that mutual jumps occur with probability zero). The two-dimensional process thus jumps at the minimum of the individual jump times. This means that we have a constant jump intensity $λ_{k} = λ^{(1)} + λ^{(2)}$ for k=1,2,3,4, and $λ_{5} = 0$ . If a jump occurs at time $t \geq 0$ , it happens with probability $λ^{(1)} / (λ^{(1)} + λ^{(2)})$ in the first surplus process with jump size distribution $F_{Y^{(1)}}$ , and with probability $λ^{(2)} / (λ^{(1)} + λ^{(2)})$ in the second surplus process with jump size distribution $F_{Y^{(2)}}$ . It remains to describe the jump kernel for the jumps from $x \in E$ . To this end define, for $k_{1}, k_{2} \in {1, 2, 3, 4}$ and $B \in B (E_{k_{2}}) \subseteq B (R^{2})$ , and $(y^{(1)}, y^{(2)}) \in E_{k_{1}}$ ,

\begin{aligned} B^{(1)} & = {z^{(1)} \in R : (z^{(1)}, z^{(2)}) \in B, z^{(2)} = y^{(2)}}, \\ B^{(2)} & = {z^{(2)} \in R : (z^{(1)}, z^{(2)}) \in B, z^{(1)} = y^{(1)}} . \end{aligned}

Furthermore,

Q ({k_{2}} \times B, (k_{1}, y^{(1)}, y^{(2)})) = \frac{λ^{(1)}}{λ^{(1)} + λ^{(2)}} F_{Y^{(1)}} (y^{(1)} - B^{(1)}) + \frac{λ^{(2)}}{λ^{(1)} + λ^{(2)}} F_{Y^{(2)}} (y^{(2)} - B^{(2)}) .

A quantity of interest in this model is again the expected value of discounted future dividend payments until the time of ruin of one of the portfolios,

V (x^{(1)}, x^{(2)}) = E_{x^{(1)}, x^{(2)}} (\int_{0}^{τ} e^{- δ t} (c^{(1)} + c^{(2)}) 1_{E_{4}} (X_{t}^{(1)}, X_{t}^{(2)}) d t),

(3)

for $x^{(1)} \leq b^{(1)}$ , $x^{(2)} \leq b^{(2)}$ , with $τ = inf {t \geq 0 : (X_{t}^{(1)}, X_{t}^{(2)}) \in E_{5}}$ , and $δ > 0$ being a preference-based discount factor.

3. Iterated integrals and a fixed point approach

In this section we derive a method for numerical approximation of the quantities of interest appearing in the models introduced in the previous section. We rewrite the quantity of interest as a sum of integrals with fixed dimension and an error term that goes to zero exponentially fast with increasing dimension of the integral. This allows for the use of deterministic integration rules. The starting point for the derivation of this integral representation is the observation that the quantity of interest is a fixed point of a certain integral operator associated to the PDMP.

Definition 3.1

Suppose there exists a set $K^{c} \subseteq K$ such that for all $k \in K^{c}$ it holds that $λ_{k} (x) = 0$ , and $ϕ_{k} (x, t) = x$ for all $x \in E_{k}$ and all $t \in R$ . We call $E^{c} := ⋃_{k \in K^{c}} E_{k}$ a cemetery of the PDMP.

Definition 3.2

Let a PDMP be given and let $E^{c} \neq \emptyset$ be a cemetery of the PDMP. A running reward function $ℓ : E \to R$ is a measurable function satisfying $ℓ |_{E^{c}} \equiv 0$ . A terminal cost function $Ψ : E^{c} \to R$ is a measurable function satisfying $Ψ |_{E ∖ E^{c}} \equiv 0$ . The cost functional $V : E \to R$ corresponding to $E^{c}, ℓ, Ψ$ is defined by

$V (x) = E_{x} (\int_{0}^{τ} e^{- δ t} ℓ (X_{t}) d t + e^{- δ τ} Ψ (X_{τ})),$ (4)

where $τ = inf {t \geq 0 : X_{t} \in E^{c}}$ .

Let $T_{1}$ be the first jump time. Equation (4) can be rewritten as follows,

\begin{aligned} V (x) = & E_{x} [(\int_{0}^{T_{1}} e^{- δ t} ℓ (ϕ (x, t)) d t + \int_{T_{1}}^{τ} e^{- δ t} ℓ (ϕ (X_{T_{1}}, t - T_{1})) d t + e^{- δ τ} Ψ (X_{τ})) 1_{{T_{1} < τ}} \\ + (\int_{0}^{τ} e^{- δ t} ℓ (ϕ (x, t)) d t + e^{- δ τ} Ψ (ϕ (x, τ))) 1_{{τ < T_{1}}} \\ + (\int_{0}^{T_{1}} e^{- δ t} ℓ (ϕ (x, t)) d t + e^{- δ T_{1}} Ψ (X_{T_{1}})) 1_{{T_{1} = τ}}] . \end{aligned}

Since X is a PDMP and hence a strong Markov process, this yields $V = H + G V$ with $H : E \to R$ , $G : C^{2} (E, R) \to R$ defined by

\begin{aligned} H (x) & = E_{x} [(\int_{0}^{T_{1}} e^{- δ t} ℓ (ϕ (x, t)) d t) 1_{{T_{1} < τ}} \\ + (\int_{0}^{τ} e^{- δ t} ℓ (ϕ (x, t)) d t + e^{- δ τ} Ψ (ϕ (x, τ))) 1_{{τ < T_{1}}} \\ + (\int_{0}^{T_{1}} e^{- δ t} ℓ (ϕ (x, t)) d t + e^{- δ T_{1}} Ψ (X_{T_{1}})) 1_{{T_{1} = τ}}], \\ G V (x) & = E_{x} [e^{- δ T_{1}} V (X_{T_{1}}) 1_{{T_{1} < τ}}] . \end{aligned}

(5)

Recall that for every $t \geq 0$ it holds that $P_{x} (T_{1} > t) = \exp (- \int_{0}^{t} λ (ϕ (x, s)) d s) =: 1 - F_{W} (t, x)$ and denote the corresponding density by $f_{W}$ . With this, the function $H$ and the operator $G$ admit representations as integrals,

\begin{aligned} H (x) & = \int_{0}^{t^{*} (x)} f_{W} (t, x) [\int_{0}^{t} e^{- δ s} ℓ (ϕ (x, s)) d s + e^{- δ t} \int_{E^{c}} Ψ (y) Q (d y, ϕ (x, t))] d t \\ + (1 - F_{W} (t^{*} (x), x)) [\int_{0}^{t^{*} (x)} e^{- δ s} ℓ (ϕ (x, s)) d s + e^{- δ t^{*} (x)} Ψ (ϕ (x, t^{*} (x)))], \\ G V (x) & = \int_{0}^{t^{*} (x)} f_{W} (t, x) e^{- δ t} \int_{E} V (y) Q (d y, ϕ (x, t)) d t . \end{aligned}

Note that $H (x)$ corresponds to the expected discounted rewards collected before the first jump at time $T_{1}$ when starting in x. $G V (x)$ represents the expected discounted rewards from time $T_{1}$ onwards conditional on the event ${X_{T_{1}} \notin E^{c}, X_{0} = x}$ . Iterating the above steps $n \in N$ times leads to

V (x) = G^{n} V (x) + \sum_{i = 0}^{n - 1} G^{i} H (x) .

(6)

Lemma 3.3

Let $Ψ : E^{c} \to R$ and $ℓ : E \to R$ be bounded, for all $k \in K$ assume that the functions $λ_{k}$ are bounded by $C_{λ} \in (0, \infty),$ and for all $x \in E$ let $t^{*} (x) = \infty$ . Then for all $x \in E$ and for all $n \in N$ it holds that $| G^{n} V (x) | \leq C_{V} (C_{λ} / (C_{λ} + δ))^{n}$ and, in particular, it holds that $lim_{n \to \infty} G^{n} V (x) = 0$ uniformly in $x \in E$ .

Proof.

The boundedness of ℓ and Ψ implies that also V is bounded by $C_{V} = (∥ ℓ ∥_{\infty} / δ) + ∥ Ψ ∥_{\infty}$ . Using the strong Markov property and Equation (5) we have by induction on n,

$\begin{aligned} G^{n} V (x) & = E_{x} [e^{- δ T_{1}} G^{n - 1} V (X_{T_{1}}) 1_{{T_{1} < τ}}] \\ = E_{x} [e^{- δ T_{1}} E_{X_{T_{1}}} [e^{- δ (T_{n} - T_{1})} V (X_{T_{n}}) 1_{{T_{n} < τ}}] 1_{{T_{1} < τ}}] \\ = E_{x} [E_{X_{T_{1}}} [e^{- δ T_{n}} V (X_{T_{n}}) 1_{{T_{n} < τ}} 1_{{T_{1} < τ}}]] \\ = E_{x} [e^{- δ T_{n}} V (X_{T_{n}}) 1_{{τ > T_{n}}}], \end{aligned}$ (7)

where we used $1_{{T_{n} < τ}} 1_{{T_{1} < τ}} = 1_{{T_{n} < τ}}$ in the last equality. Recall that $P (T_{n} - T_{n - 1} > t | T_{n - 1}, X_{T_{n - 1}}) = \exp (- \int_{0}^{t} λ (ϕ (s, X_{T_{n - 1}})) d s) \geq \exp (- t C_{λ})$ . For every $n \in N$ let $Z_{n} \sim Erlang (n, C_{λ})$ be an Erlang-distributed random variable. Combining this with (7) we get that

$|G^{n} V (x)| \leq C_{V} E_{x} [e^{- δ T_{n}}] \leq C_{V} E [e^{- δ Z_{n}}] = C_{V} {(\frac{C_{λ}}{C_{λ} + δ})}^{n} .$

The latter expression converges to zero as $n \to \infty$ uniformly in $x \in E$ .

Combining Lemma 3.3 with (6) results in the error estimate

|V (x) - \sum_{i = 0}^{n - 1} G^{i} H (x)| \leq C_{V} {(\frac{C_{λ}}{C_{λ} + δ})}^{n} .

(8)

Finally, we obtain the following representation,

\begin{aligned} G^{i - 1} H (x_{0}) & = \int_{t_{1} = 0}^{t^{*} (x_{0})} f_{W} (t_{1}, x_{0}) e^{- δ t_{1}} \int_{x_{1} \in E} \int_{t_{2} = 0}^{t^{*} (x_{1})} f_{W} (t_{2}, x_{1}) e^{- δ t_{2}} \int_{x_{2} \in E} \dots \\ \times \int_{t_{i - 1} = 0}^{t^{*} (x_{k - 2})} f_{W} (t_{i - 1}, x_{i - 2}) e^{- δ t_{i - 1}} \\ \int_{x_{i - 1} \in E} H (x_{i - 1}) Q (d x_{i - 1}, ϕ (x_{i - 2}, t_{i - 1})) d t_{i - 1} \dots Q (d x_{1}, ϕ (x_{0}, t_{1})) d t_{1} \\ = \int_{t_{1} = 0}^{t^{*} (x_{0})} \int_{x_{1} \in E} \dots \int_{t_{i - 1} = 0}^{t^{*} (x_{i - 2})} \int_{x_{i - 1} \in E} (\prod_{j = 1}^{i - 1} f_{W} (t_{j}, x_{j - 1}) e^{- δ t_{j}}) \\ H (x_{i - 1}) Q (d x_{i - 1}, ϕ (x_{i - 2}, t_{i - 1})) d t_{i - 1} \dots Q (d x_{1}, ϕ (x_{0}, t_{1})) d t_{1} . \end{aligned}

(9)

In (9) we denote by ${t_{j}}_{j \in {1, \dots, i - 1}}$ the family of inter-jump times and by ${x_{j}}_{j \in {1, \dots, i - 1}}$ the family of post-jump locations.

Remark 3.4

Solving the integral $G^{i - 1} H (x_{0})$ brings several advantages compared to a crude Monte Carlo approach. First, (9) is an integral with a fixed dimension. Hence, it can be approximated using deterministic integration rules like QMC, for which deterministic error bounds are available. Second, the bias of restricting oneself to a fixed number of jumps can be estimated uniformly in $x = x_{0}$ using the bias estimate in Lemma 3.3. Third, rare events like surviving a large number of jumps are – in this formulation – not rare in the sense that it is unlikely to draw such a realisation, which has the effect of importance sampling.

4. Cubature rules for $C^{κ}$ -functions

In order to obtain convergence estimates for numerical integration methods such as QMC methods or other cubature rules, we need more regularity of the integrands than they admit in many practical applications. For example, we may need to bound a certain norm of the Hessian matrix of the integrand. In Section 5, we will rewrite the problem introduced in Section 3 so that the integrand is a function $f : [0, 1]^{d} \to R$ which satisfies $f \in C^{2} ([0, 1]^{d})$ , or more generally $f \in C^{κ} ([0, 1]^{d})$ for some $κ \in N$ . We outline two different methods for treating such integrands f by cubature rules.

4.1. QMC methods

QMC methods are equal-weight cubature rules with M deterministically chosen integration nodes. Let the integrand $f : [0, 1]^{d} \to R$ satisfy $f \in C^{2} ([0, 1]^{d})$ . In order to obtain a convergence estimate for numerical integration of f using QMC, we require a so-called Koksma–Hlawka type inequality. The original Koksma–Hlawka inequality bounds the integration error of a QMC rule by the product of the variation of the integrand (in the sense of Hardy and Krause) and the so-called discrepancy of the integration node set (see, e.g. Niederreiter (1992, Chapter 2)). We remark, however, that we cannot easily apply the classical Koksma–Hlawka inequality in this paper, as we cannot rely on the integrands to have bounded variation in the sense of Hardy and Krause. Hence, we are going to resort to a variant of the Koksma–Hlawka inequality which was recently proven in Pausinger & Svane (2015). Let $Q_{M, d} (f) = 1 / M \sum_{j = 1}^{M} f (x_{j})$ be a QMC rule using M integration nodes $x_{1}, \dots, x_{M} \in [0, 1)^{d}$ . Then by Pausinger & Svane (2015, Theorem 3.12) we have

|\int_{[0, 1]^{d}} f (x) d x - Q_{M, d} (f)| \leq (sup_{x \in [0, 1]^{d}} f (x) - inf_{x \in [0, 1]^{d}} f (x) + \frac{d}{16} M (f)) {Disc}_{I} (x_{1}, \dots, x_{M}),

(10)

where $M (f) = sup_{x \in [0, 1]^{d}} ∥Hess (f, x)∥$ , $Hess (f, x)$ is the Hessian matrix of f at $x$ , $∥\cdot∥$ denotes the usual operator norm, and where ${Disc}_{I} (x_{1}, \dots, x_{M})$ is the isotropic discrepancy of the integration node set,

{Disc}_{I} (x_{1}, \dots, x_{M}) = sup_{\begin{matrix} C \subseteq [0, 1]^{d} \\ C convex \end{matrix}} |\frac{1}{M} \sum_{j = 1}^{M} 1_{{x_{j} \in C}} - μ_{d} (C)|,

where $μ_{d}$ denotes the Lebesgue measure on the $R^{d}$ . Now let $x_{1}, \dots, x_{M} \in [0, 1]^{d}$ . In Niederreiter (1992, Chapter 2) it is shown that

{Disc}_{I} (x_{1}, \dots, x_{M}) \leq 8 d {({Disc}_{*} (x_{1}, \dots, x_{M}))}^{1 / d},

where by ${Disc}_{*} (x_{1}, \dots, x_{M})$ we denote the star discrepancy of $x_{1}, \dots, x_{M}$ , defined as

{Disc}_{*} (x_{1}, \dots, x_{M}) = sup_{a \in [0, 1)^{d}} |\frac{1}{M} \sum_{j = 1}^{M} 1_{{x_{j} \in [0, a)}} - μ_{d} ([0, a))|,

where $[0, a)$ denotes $[0, a_{1}) \times \dots \times [0, a_{d})$ for $a = (a_{1}, \dots, a_{d})$ . It is well known that common point sequences that are employed in QMC methods, such as Sobol' sequences or Halton sequences, have a star discrepancy of order $(\log M)^{d} / M$ (and it is known that this order can, if at all, only be improved with respect to the exponent of the log-term). Hence, by using, e.g. Sobol' points in a QMC method for numerically integrating a $C^{2}$ -function, we cannot expect an error that converges to zero faster than $(\log M) / M^{1 / d}$ .

As we shall see below, this order of magnitude can, with respect to the disadvantageous dependence on d, not be improved further for $C^{2}$ -functions. However, there is room for improvement if we make additional smoothness assumptions on the integrand.

4.2. Product rules

In Hinrichs et al. (2017) it is shown that, by using products of Gauss rules, one can obtain the following result. Let $f : [0, 1]^{d} \to R$ be such that $f \in C^{κ}$ for some $κ \in N$ . Then, by using a product rule $Q_{G, \tilde{M}, d}$ of $\tilde{M}$ -point Gauss quadrature rules, one obtains

|\int_{[0, 1]^{d}} f (x) d x - Q_{G, \tilde{M}, d} (f)| \leq c_{κ} d {\tilde{M}}^{- κ} {∥f∥}_{C^{κ}}, for \tilde{M} \geq κ + 1,

(11)

where $c_{κ} = (π / 2) (e / (6 \sqrt{3}))^{κ}$ , and where

{∥f∥}_{C^{κ}} = max_{\begin{matrix} β \in N_{0}^{d} \\ {∥β∥}_{1} \leq κ \end{matrix}} {∥D^{β} (f)∥}_{L_{\infty}},

where $D^{β}$ denotes the (weak) partial derivative of order $β$ for $β \in N_{0}^{d}$ . A d-fold Gauss product rule as described above uses $M = {\tilde{M}}^{d}$ points in total, and hence yields a convergence order of $M^{- κ / d}$ . It is known due to Bakhvalov (1959) that this convergence order is best possible. For the special case $κ = 2$ , we only obtain a relatively small improvement over the bound implied by (10). However, there is an additional advantage in the bound (11). By requiring that the function f satisfies additional smoothness assumptions, namely that $f \in C^{κ}$ for some $κ \in N$ which is possibly larger than 2, we obtain an improved convergence rate. Hence, we face a trade-off between imposing a higher degree of smoothness on the integrand f to obtain a higher accuracy in the quadrature rule, and the error we make by smoothing the integrand to that extent. It is therefore likely that the method needs to be fine-tuned on a case-by-case basis. In practice, product rules often cannot be applied, since, for example, for integrating a d=1024-variate integrand using only $\tilde{M} = 2$ integration nodes per coordinate requires a point set consisting of $M = 2^{1024}$ integration nodes. To overcome the latter problem, it might be useful to apply the theory of weighted integration as introduced in Sloan & Woźniakowski (1998), possibly combined with truncation (see, e.g. Kritzer et al. (2016)) or multivariate decomposition methods (see, e.g. Kuo (2010)). A detailed analysis of these approaches applied to the present problem is left open for future research.

5. Smoothing of the integrand

The integrand in (9) is not necessarily a $C^{κ}$ -function. Therefore, in this section we provide a technique for smoothing the integrand in order to apply convergence results for integration rules that are described in Section 4.

The piecewise construction of the process described in Definition 2.4 leads to the situation that $X_{t} = ϕ (X_{T_{j - 1}}, t - T_{j - 1})$ for $t \in [T_{j - 1}, T_{j})$ is a function of $X_{T_{j - 1}}$ and $T_{j - 1}$ . In particular, all subsequent pre-jump locations depend on all previous post-jump locations and jump times, via ϕ and λ. Consequently, regularity of the integrand depends on regularity of the flow ϕ and the intensity function λ. The analysis in this section is restricted to the case where the flow originates from autonomous ODEs, i.e. for all $k \in K$ there exist Lipschitz continuous functions $g_{k} : R^{d (k)} \to R^{d (k)}$ such that $(\partial / \partial t) ϕ_{k} (y, t) = g_{k} (ϕ_{k} (y, t))$ . General results from the literature on ODEs, see, e.g. Grigorian (2009), yield that the derivatives $(\partial / \partial y) ϕ_{k}, (\partial^{2} / \partial y^{2}) ϕ_{k}, (\partial / \partial t) ϕ_{k}$ can be described by so-called associated first- and second-order variational equations for which one requires $g_{k}$ to be a $C^{2}$ -function.

For the density $f_{W}$ of the inter-jump times to be $C^{2}$ we need that $λ \in C^{2} (E, R)$ . Also we need $ℓ \in C_{b}^{2} (E, R)$ , and $Ψ \in C_{b}^{2} (E, R)$ since they appear in the integral defining $H$ .

A serious problem with respect to smoothness arises if the PDMP model allows for jumps from the active boundary. Suppose $(k, y) \in E$ and $t^{*} (k, y) < \infty$ . Then, conditional on $X_{t} = (k, y)$ , the time of the next jump is distributed as $min (T, t^{*} (k, y))$ , where T has distribution function $F_{T} (t) = 1 - \exp (- \int_{0}^{t} λ_{k} (ϕ_{k} (y, s)) d s)$ . But in general neither $t^{*} (k, y)$ nor $min (T, t^{*} (k, y))$ will depend smoothly on y, even if $λ_{k}$ has arbitrarily high regularity. We are not aware of a general remedy for this problem. However, for all PDMP models put forward in Section 2.1, the jumps from the active boundary do not constitute jumps of the original problem. In the following subsection we describe by example how PDMPs can be approximated by PDMPs that do not allow for jumps from the boundary.

Concerning the jump kernel Q, it is hard to state general sufficient regularity conditions. An exemplary favourable situation arises if the jump kernel Q admits a $C^{2}$ -density $f_{Y}$ in the sense that $Q (A, x) = \int_{A} f_{Y} (x_{1}, x) d x_{1}$ for all $A \in E$ and all $x \in E$ . In the one-dimensional examples from risk theory in Sections 2.1.1–2.1.4, this condition is satisfied and for the two-dimensional example in Section 2.1.5 we present a smoothing technique in Section 5.2.

5.1. Smoothing of the flow

Consider the example from Section 2.1.4 without dividend barrier. We can describe the problem alternatively with a state space consisting of three components:

$K = {1, 2, 3}$ ,
$E_{1} = (- (c / ρ), \infty)$ , $E_{2} = (- \infty, - (c / ρ))$ , $E_{3} = {- (c / ρ)}$ ,
$ϕ_{1}$ is determined by an autonomous ODE of the form $g_{1} : R \to R$ ,
$\begin{aligned} g_{1} (y) & = \{\begin{cases} c, & if y \in (0, \infty), \\ c + ρ y, & if y \in (- \frac{c}{ρ}, 0], \\ 0, & if y \in (- \infty, - \frac{c}{ρ}], \end{cases} \end{aligned}$ (12)
for some $c > 0, ρ > 0$ . The function $ϕ_{2}$ is given by $ϕ_{2} (y, t) = y$ $\forall y \in E_{2}$ and $\forall t \in R$ , and $ϕ_{3}$ by $ϕ_{3} (y, t) = y$ $\forall y \in E_{3}$ and $\forall t \in R$ ,
$λ_{1} (y) = λ_{N}$ $\forall y \in E_{1}$ , $λ_{2} (y) = 0$ $\forall y \in E_{2}$ , $λ_{3} (y) = 0$ $\forall y \in E_{3}$ .
For $B = ({1} \times B_{1}) \cup ({2} \times B_{2}) \cup ({3} \times B_{3}) \in E$ ,
$\begin{aligned} Q (B, (1, y)) & = P (Y \in y - B_{1}) + P (Y \in y - B_{2}) + P (Y \in y - B_{3}) & (for y \in E_{1}), \\ Q (B, (2, y)) & = P (Y \in y - B_{2}) & (for y \in E_{2}), \\ Q (B, (3, y)) & = P (Y \in y - B_{2}) + P (Y \in y - B_{3}) & (for y \in E_{3}) . \end{aligned}$

Here, $g_{1}$ is not differentiable in 0. However, we may smoothen this discontinuity using a ‘smoothened Heaviside function’. Note that $Γ^{*} = \emptyset$ .

Definition 5.1

Let $κ \in N \cup {0}$ . We call a function $h : R \to [0, 1]$ a $C^{κ}$ -Heaviside function, if

$h (y) = 0$ for y<−1,

$h (y) = 1$ for y>1,

h is non-decreasing,

$h (y) + h (- y) = 1$ ,

h is κ-times continuously differentiable.

Lemma 5.2

Let $κ \in N \cup {0},$ and let $f : R \to R$ be a piecewise $C^{κ}$ -function with discontinuity in $ξ \in R,$ i.e. let there exist $C^{κ}$ -functions $f_{1}, f_{2} : R \to R$ with $f = f_{1}$ on $(- \infty, ξ)$ and $f = f_{2}$ on $(ξ, \infty)$ . Let h be a $C^{κ}$ -Heaviside function. For every $ϵ > 0$ define $f^{ϵ} : R \to R$ by $f^{ϵ} (y) = f_{1} (y) h (y - ξ / ϵ) + f_{2} (y) h (- y - ξ / ϵ)$ . Then,

$f^{ϵ} \in C^{κ}$ for every $ϵ > 0,$

$f^{ϵ} |_{R ∖ (- ϵ, ϵ)} = f |_{R ∖ (- ϵ, ϵ)}$ for every $ϵ > 0,$

for all $y \in R ∖ {ξ}$ it holds that $lim_{ϵ \to 0 +} f^{ϵ} (y) = f (y),$

for all $δ > 0$ it holds that $lim_{ϵ \to 0 +} sup_{y \in R ∖ (ξ - δ, ξ + δ)} | f^{ϵ} (y) - f (y) | = 0$ .

Proof.

The elementary proof is left to the reader.

There are various possible choices for the smoothing: from the left $f^{ϵ -} (y) = f_{1} (y) h (y - ξ + ϵ / ϵ) + f_{2} (y) h (- y - ξ + ϵ / ϵ)$ and from the right $f^{ϵ +} (y) = f_{1} (y) h (y - ξ - ϵ / ϵ) + f_{2} (y) h (- y - ξ - ϵ / ϵ)$ . Figure 2 depicts these three possible smoothings for a function with a discontinuity in $ξ = 1$ . A concrete example for a function h that satisfies the above requirements is given by

h (y) = \{\begin{cases} 0 & if y < - 1, \\ \frac{1}{2} + \frac{15 y}{16} - \frac{5 y^{3}}{8} + \frac{3 y^{5}}{16} & if y \in [- 1, 1], \\ 1 & if y > 1. \end{cases}

(13)

For every $ϵ > 0$ , a smoothed version of the function $g_{1}$ defined in (12) is given by

g_{1}^{ϵ} (y) = (c + ρ y) h (- \frac{y}{ϵ}) + c h (\frac{y}{ϵ}) .

We can finally formulate a PDMP corresponding to the new model, where the flow has been smoothened,

$K = {1, 2, 3}$ ,
$E_{1} = (- (c / ρ), \infty)$ , $E_{2} = (- \infty, - (c / ρ))$ , $E_{3} = {- (c / ρ)}$ ,
$(\partial / \partial t) ϕ_{1}^{ϵ} (y, t) = g_{1}^{ϵ} (ϕ_{1}^{ϵ} (y, t))$ $\forall y \in E_{1}$ and $\forall t \in R$ , $ϕ_{k} (y, t) = y$ $\forall y \in E_{k}$ and $\forall t \in R$ , $k \in {2, 3}$ ;
$λ_{1} (y) = λ_{N}$ $\forall y \in E_{1}$ , $λ_{k} (y) = 0$ $\forall y \in E_{k}$ , $k \in {2, 3}$ ;
for $B = ({1} \times B_{1}) \cup ({2} \times B_{2}) \cup ({3} \times B_{3}) \in E$ ,
$\begin{aligned} Q (B, (1, y)) & = P (Y \in y - B_{1}) + P (Y \in y - B_{2}) + P (Y \in y - B_{3}) & (for y \in E_{1}), \\ Q (B, (2, y)) & = P (Y \in y - B_{2}) & (for y \in E_{2}), \\ Q (B, (3, y)) & = P (Y \in y - B_{2}) + P (Y \in y - B_{3}) & (for y \in E_{3}) . \end{aligned}$

Figure 2. — Illustration of smoothing a piecewise $C^{2}$ -function with a discontinuity in $ξ = 1$ .

Note that $Γ^{*} = \emptyset$ . Since the dividend barrier b is never reached, we also have to smoothen the reward function in a way that the region where dividends are paid can be reached, i.e. $ℓ^{ϵ} (y) = c h (y - b + ϵ / ϵ)$ . We will show convergence of the corresponding value functions in Section 6.

5.2. Smoothing of jump measures

We give an example for a class of jump measures that can be approximated by measures leading to $C^{2}$ -integrands in (9).

Let $(E, E)$ be the state space of a PDMP and let $(ϕ, λ, Q)$ be its local characteristics. Let the probability kernel Q satisfy the following assumption.

Assumption 5.3

We assume that

there exists a positive integer n such that for every $k \in K$ , and every $y \in E_{k}$ , there exist sets $B_{1} (k, y), \dots, B_{n} (k, y)$ such that

for every $j \in {1, \dots, n}$ there exists $k_{1} \in K$ such that $B_{j} (k, y) \subseteq E_{k_{1}}$ ,

for every $j \in {1, \dots, n}$ it holds that ${(\bar{y}, z) : \bar{y} \in E_{k}, z \in B_{j} ((k, \bar{y}))}$ is a connected $C^{2}$ -manifold,

for every $k \in K$ and every $j \in {1, \dots, n}$ the mapping from $E_{k}$ to $R$ , $\bar{y} \mapsto Q (B_{j} ((k, \bar{y}), x)$ is $C^{2}$ ,

for all $x \in E$ it holds that $\sum_{j = 1}^{n} Q (B_{j} (x), x) = 1$ ,

for every $x \in E$ and every $j \in {1, \dots, n}$ there exists a $C^{2}$ -mapping $G_{j, x} : [0, 1]^{\dim (B_{j})} \to B_{j}$ such that for all $A \in E$ it holds that
$Q (A \cap B_{j}, x) = μ_{\dim (B_{j})} (G_{j, x}^{- 1} (A \cap B_{j})) Q (B_{j}, x),$
where $μ_{m}$ denotes the m-dimensional Lebesgue measure,

for every $k \in K$ and every $j \in {1, \dots, n}$ the mapping from $E_{k} \times [0, 1]^{\dim (B_{j})}$ to $⋃_{l \in K} E_{l}$ , $(y, u) \mapsto G_{j, (k, y)} (u)$ is $C^{2}$ .

Note that Assumption 5.3(1) implies that, for every $x \in E$ , $B_{j} (x)$ is a $C^{2}$ -manifold, and that for all $x_{1} = (k_{1}, y_{1}), x_{2} = (k_{2}, y_{2}) \in E$ with $k_{1} = k_{2}$ we have $\dim B_{j} (x_{1}) = \dim B_{j} (x_{2})$ .

Under Assumption 5.3 we have for $x \in E$ and for $f \in C_{b}^{2} (E, R)$ that

\int_{E} f (y) Q (d y, x) = \sum_{j = 1}^{n} p_{j} (x) \int_{[0, 1]^{\dim (B_{j} (x))}} f (G_{j, x} (u)) d u,

where $p_{j} (x) = Q (B_{k, j}, x)$ for all $x \in E$ . For the integral in (9) this implies that we have iterated sums for each jump, which increases the complexity for large numbers of jumps. Instead, we may write the sum as an integral over $[0, 1]$ ,

\int_{E} f (y) Q (d y, x) = \int_{0}^{1} \sum_{j = 1}^{n} 1_{[q_{k, j - 1} (x), q_{k, j} (x))} (u_{0}) \int_{[0, 1]^{\dim (B_{j} (x))}} f (G_{j, x} (u)) d u d u_{0},

where $q_{0} (x) = 0$ and $q_{j} (x) = p_{1} (x) + \dots + p_{j} (x)$ . However, with this ‘trick’ we have lost the property of the integrand being $C^{2}$ . So, using again our smoothened Heaviside function $h : R \to [0, 1]$ , we can smoothen the indicator functions,

\begin{aligned} \int_{E} f (y) Q^{ϵ} (d y, x) \\ = \int_{0}^{1} \sum_{j = 1}^{n} (h (\frac{u_{0} - q_{j - 1} (x)}{ϵ}) + h (\frac{q_{j} (x) - u_{0}}{ϵ})) \int_{[0, 1]^{\dim (B_{j} (x))}} f (G_{j, x} (u)) d u d u_{0} \\ = \int_{0}^{1} \int_{[0, 1]^{\dim (B_{j} (x))}} \sum_{j = 1}^{n} (h (\frac{u_{0} - q_{j - 1} (x)}{ϵ}) + h (\frac{q_{j} (x) - u_{0}}{ϵ})) f (G_{j, x} (u_{1}, \dots, u_{\dim (B_{j} (x))})) d u d u_{0} . \end{aligned}

This expression, considered as a function of x, is $C^{2}$ as it is a composition of $C^{2}$ -functions.

Theorem 5.4

In the setup of this section we have for all $f \in C_{b}^{0} (E, R)$ that

$lim_{ϵ \to 0} \int_{E} f (y) Q^{ϵ} (d y, x) = \int_{E} f (y) Q (d y, x) .$

Proof.

It holds that

$\begin{aligned} | \int_{E} f (y) (Q^{ϵ} (d y, x) - Q (d y, x)) | \\ = | \sum_{j = 1}^{n} \int_{0}^{1} (h (\frac{u_{0} - q_{j - 1} (x)}{ϵ}) + h (\frac{q_{j} (x) - u_{0}}{ϵ}) - 1_{[q_{j - 1} (x), q_{j} (x))} (u_{0})) d u_{0} \\ \times \int_{[0, 1]^{\dim (B_{j} (x))}} f (G_{j, x} (u)) d u | \\ \leq \sum_{j = 1}^{n} \int_{0}^{1} | h (\frac{u_{0} - q_{j - 1} (x)}{ϵ}) + h (\frac{q_{j} (x) - u_{0}}{ϵ}) - 1_{[q_{j - 1} (x), q_{j} (x))} (u_{0}) | d u_{0} \\ \times \int_{[0, 1]^{\dim (B_{j} (x))}} | f (G_{j, x} (u)) | d u . \end{aligned}$

For our concrete example of h the first integral can be estimated by $\frac{5}{8} ϵ$ . Thus

$| \int_{E} f (y) (Q^{ϵ} (d y, x) - Q (d y, x)) | \leq \frac{5 ϵ n}{8} ∥ f ∥_{\infty},$

yielding the statement of the theorem.

Now, consider the example from Section 2.1.5. Here, a jump can be either a jump in $x_{1}$ -direction or a jump in $x_{2}$ -direction, i.e.

\begin{aligned} X_{T_{j}} = \{\begin{cases} X_{T_{J} -} + (Y_{1}, 0) & with probability \frac{λ_{1}}{λ_{1} + λ_{2}}, \\ X_{T_{J} -} + (0, Y_{2}) & with probability \frac{λ_{2}}{λ_{1} + λ_{2}} . \end{cases} \end{aligned}

In this case we can find functions $G_{1}, G_{2} : [0, 1] \to [0, \infty)$ such that $Y_{1} \overset{d}{\sim} G_{1} (Θ_{1})$ and $Y_{2} \overset{d}{\sim} G_{2} (Θ_{2})$ for uniform random variables $Θ_{1}, Θ_{2}$ . Hence,

\begin{aligned} \int_{E} f (y) Q (d y, (x_{1}, x_{2})) \approx \int_{0}^{1} \int_{[0, 1]^{2}} & h (ϵ^{- 1} (\frac{λ_{1}}{λ_{1} + λ_{2}} - u)) f (x_{1} + G_{1} (ϑ_{1}), x_{2}) \\ + h (ϵ^{- 1} (u - \frac{λ_{1}}{λ_{1} + λ_{2}})) f (x_{1}, x_{2} + G_{2} (ϑ_{2})) d ϑ_{1} d ϑ_{2} d u . \end{aligned}

Remark 5.5

If we consider, say, i=100 in (9), then we get a very high number of terms to be summed in the integral. However, we always assume ε to be very small, in particular, we may assume that per jump at most two, and in most situations only one, of the terms $h (ϵ^{- 1} (u - q_{j - 1} (x))) + h (ϵ^{- 1} (q_{j} (x) - u))$ are nonzero.

5.3. Convergence

In this section we prove a general convergence result for approximated versions of PDMPs with smoothing as above. We will exploit results on Feller processes presented in Kallenberg (2002, Chapter 19) and Ethier & Kurtz (1986, Chapters 4.2 and 4.8). For the remainder of this section we make the following assumptions:

$t^{*} (x) = \infty$ for all $x \in E$ ,
$λ \in C_{b} (E, R)$ ,
for all $f \in C_{b} (E)$ the mapping $x \mapsto \int_{E} f (\bar{x}) Q (d \bar{x}, x)$ is continuous.

With this, we can utilise the following theorem.

Theorem 5.6 Davis 1993, Theorem 27.6 —

If $t^{*} (x) = \infty$ for all $x \in E$ and for all $λ \in C_{b} (E, R),$ and if the mapping $x \mapsto \int_{E} f (y) Q (d y, x)$ is continuous for all $f \in C_{b} (E, R),$ then the PDMP is a Feller process.

We give an example for a class of jump kernel which comprises the jump kernels of the one-dimensional examples in Section 2.1 and which satisfies (iii).

Example 5.7

Let $E_{k} \subseteq R$ be an interval for every $k \in K$ and let $f_{Y}$ be a bounded density function on $R$ . Furthermore, let, for every $x = (k, y) \in E$ and every $A \in E$ , $Q (A, (k, y)) = \sum_{j \in K} \int_{(y - A) \cap E_{j}} f_{Y} (\bar{y}) d \bar{y}$ . Then for every $f \in C_{b} (E, R)$ it holds that

$\begin{aligned} |\int_{E} f (x) Q (d x, (k, y_{1})) - \int_{E} f (x) Q (d x, (k, y_{2}))| \\ = |\sum_{j \in K} \int_{R} 1_{E_{j}} (y_{1} - \bar{y}) f_{j} (y_{1} - \bar{y}) f_{Y} (\bar{y}) d \bar{y} - \sum_{j \in K} \int_{R} 1_{E_{j}} (y_{2} - \bar{y}) f_{j} (y_{2} - \bar{y}) f_{Y} (\bar{y}) d \bar{y}| \\ \leq \sum_{j \in K} |\int_{R} 1_{E_{j}} (y_{1} - \bar{y}) f_{j} (y_{1} - \bar{y}) f_{Y} (\bar{y}) d \bar{y} - \int_{R} 1_{E_{j}} (y_{2} - \bar{y}) f_{j} (y_{2} - \bar{y}) f_{Y} (\bar{y}) d \bar{y}| \\ \leq \sum_{j \in K} \int_{R} | 1_{E_{j}} (y_{1} - \bar{y}) f_{j} (y_{1} - \bar{y}) - 1_{E_{j}} (y_{2} - \bar{y}) f_{j} (y_{2} - \bar{y}) | f_{Y} (\bar{y}) d \bar{y} . \end{aligned}$

Since, by assumption, all $f_{j}$ are continuous and all $E_{j}$ are intervals, it holds that $| 1_{E_{j}} (y_{1} - \bar{y}) f_{j} (y_{1} - \bar{y}) - 1_{E_{j}} (y_{2} - \bar{y}) f_{j} (y_{2} - \bar{y}) |$ is bounded by $2 ∥ f_{j} ∥_{\infty}$ and goes to zero as $y_{1} \to y_{2}$ for almost all $\bar{y}$ .

Therefore, bounded convergence implies that the above sum converges to 0. From this the desired continuity follows.

The generator of X in the setup of the current section is given by

A f (x) = X f (x) + λ (x) \int_{E} (f (\bar{x}) - f (x)) Q (d \bar{x}, x), x \in E,

(14)

where for $x = (k, y) \in E$ we define $X f (x)$ by $(X f)_{k} (y) = (\partial / \partial t) f_{k} (ϕ_{k} (y, t)) |_{t = 0}$ . Note that for $f \in C_{b}^{1} (E, R)$ this means $(X f) (y) = g (y) \cdot \nabla f (y)$ . So the domain $D (A)$ of the generator consists of all functions in $C_{b} (E, R)$ which are continuously differentiable along the trajectories of the flow on all components, cf. Ethier & Kurtz (1986, p. 8), and $C_{b}^{1} (E, R) \subseteq D (A)$ .

Definition 5.8 Kallenberg 2002, Chapter 19 —

Let A be a closed linear operator with domain of definition $D (A)$ . A core for A is a linear subspace $D \subseteq D (A)$ such that the restriction $A | D$ has closure A.

Proposition 5.9 Kallenberg 2002, Proposition 19.9 —

If $A$ is the generator of a Feller semigroup $(P_{t})_{t \geq 0},$ then any dense, $(P_{t})_{t \geq 0}$ -invariant subspace $D \subseteq D (A)$ is a core for $A$ .

Proposition 5.10

Under the assumptions made in this section, and for $A$ being defined as in (14), it is true that $C_{b}^{\infty} (E, R)$ is a core for $A$ .

Proof.

We certainly have that $C_{b}^{\infty} (E, R)$ is a dense subspace of $C_{b} (E, R)$ . Furthermore, the transition semigroup satisfies $P_{t} : C_{b} (E, R) \to C_{b} (E, R)$ for all $t \in [0, \infty)$ , see (Davis 1993, p.76), since the PDMP is Feller by Theorem 5.6.

We have to prove that $C_{b}^{\infty} (E, R)$ is invariant under $(P_{t})_{t \in [0, \infty)}$ . We show this by proving that, for all $k \in N$ , $P_{t} C_{b}^{k} (E, R) \subseteq C_{b}^{k} (E, R)$ . For k=0 this is just the Feller property. Since all derivatives are bounded in the $sup$ -norm, differentiation and application of $P_{t}$ commute, i.e. $(\partial^{k} / \partial x^{k}) P_{t} f = P_{t} (\partial^{k} / \partial x^{k}) f \in C_{b} (E, R)$ for all $k \in N$ . Consequently, $C_{b}^{\infty} (E, R)$ is a core for $A$ .

Theorem 5.11 Kallenberg 2002, Theorem 19.25 —

Let X be a Feller process in E with semigroup $(P_{t})_{t \geq 0}$ and generator $A$ with domain $D (A),$ and for all $n \in N$ let $X^{n}$ be Feller processes in E with semigroups $(P_{t}^{n})_{t \geq 0}$ and generators $A^{n}$ with domains $D (A^{n})$ . Let D be a core for $A$ . Then the following statements are equivalent:

for every $f \in D$ there exists a sequence $(f^{n})_{n \in N}$ with $f^{n} \in D (A^{n})$ for all $n \in N$ and such that $f^{n} \to f$ and $A^{n} f^{n} \to A f$ uniformly as $n \to \infty,$

for all t>0 we have $P_{t}^{n} \to P_{t}$ as $n \to \infty$ in the strong operator topology,

for every $f \in C_{0} (E, R)$ and every $t_{0} \in (0, \infty)$ it holds that $P_{t}^{n} f \to P_{t} f$ as $n \to \infty$ uniformly for $t \in [0, t_{0}],$

if $X_{0}^{n} \overset{d}{\to} X_{0}$ in E, then $X^{n} \overset{d}{\to} X$ in $D ([0, \infty), E)$ .

Remark 5.12

The notion of weak convergence of processes in Item (iv) needs an explanation. Here, $D ([0, \infty), E)$ is the space of càdlàg functions, equipped with the Skorokhod topology, see Ethier & Kurtz (1986, p. 118). With this topology, $D ([0, \infty), E)$ is a Borel subset of a Polish space and for a sequence $(X^{n})_{n \in N}$ of $D ([0, \infty), E)$ -valued random variables (i.e. processes in E with càdlàg paths), and a $D ([0, \infty), E)$ -valued random variable X we have $X^{n} \overset{d}{\to} X$ if and only if $lim_{n \to \infty} E (F (X^{n})) = E (F (X))$ for all bounded Skorokhod continuous functions $D ([0, \infty), E) \to R$ , see Kurtz & Protter (1996, Section 6) or Ethier & Kurtz (1986, Chapter 3). We do not wish to go into the details of the notion of Skorokhod continuity. It suffices to mention that from Kurtz & Protter (1996, Section 8, Example 8.1) we know that for given continuous functions $f_{1} : E \times [0, \infty) \to R^{d}$ and $f_{2} : [0, \infty) \to [0, \infty)$ , and fixed $t \in [0, \infty)$ , the following functionals exhibit this property:

$\begin{aligned} F_{1} (ω) & = f_{1} (ω (t), t) (for ω \in D ([0, \infty), E)), \\ F_{2} (ω) & = \int_{0}^{t} f_{2} (t - s) f_{1} (ω (s), s) d s (for ω \in D ([0, \infty), E)) . \end{aligned}$

Lemma 5.13

Let $f : E \to R$ be continuous and bounded, then the functional

$F_{3} (ω) = \int_{0}^{\infty} e^{- δ s} f (ω (s)) d s (for ω \in D ([0, \infty), E))$

is Skorokhod continuous.

Proof.

Let σ denote the Skorokhod metric on $D ([0, \infty), E)$ . Let $ϵ > 0$ . There exists t>0 such that $\int_{t}^{\infty} e^{- δ s} ∥ f ∥_{\infty} d s < ϵ / 4$ . By Skorokhod continuity of $F_{2}$ there exists an $η > 0$ such that for all $ω_{1}, ω_{2} \in D ([0, \infty), E)$ it holds that, if $σ (ω_{1}, ω_{2}) < η$ then $| \int_{0}^{t} e^{- δ s} f (ω_{1} (s)) d s - \int_{0}^{t} e^{- δ s} f (ω_{2} (s)) d s | < ϵ / 2$ . Therefore,

$\begin{aligned} | F_{3} (ω_{1}) - F_{3} (ω_{2}) | & = |\int_{0}^{\infty} e^{- δ s} f (ω_{1} (s)) d s - \int_{0}^{\infty} e^{- δ s} f (ω_{2} (s)) d s| \\ \leq |\int_{0}^{t} e^{- δ s} f (ω_{1} (s)) d s - \int_{0}^{t} e^{- δ s} f (ω_{2} (s)) d s| + 2 \int_{t}^{\infty} e^{- δ s} ∥ f ∥_{\infty} d s < ϵ . \end{aligned}$

We remark that a function $f : E \to R$ is continuous if and only if $f_{k} : E_{k} \to R$ is continuous for all k. In particular, every indicator function of a component ${k} \times E_{k}$ is continuous.

We are in the position to show that cost functionals indeed commute with weak limits of PDMPs.

Lemma 5.14

Let X be a PDMP and $(X^{n})_{n \in N}$ be a sequence of PDMPs on the same state space E and with the same cemetery $E^{c},$ and let $ℓ : E \to R$ and $Ψ : E \to R$ be a running reward function and a terminal cost function, respectively. Assume that both ℓ and Ψ are continuous and bounded. Assume further that $X_{0}^{n} = x$ for all $n \in N$ and $X_{0} = x,$ and $X^{n} \overset{d}{\to} X$ in $D ([0, \infty), E)$ .

Then

$E_{x} (\int_{0}^{τ} e^{- δ t} ℓ (X_{t}^{n}) d t + e^{- δ τ} Ψ (X_{τ}^{n})) \to E_{x} (\int_{0}^{τ} e^{- δ t} ℓ (X_{t}) d t + e^{- δ τ} Ψ (X_{τ}))$

as $n \to \infty$ .

Proof.

Recall that $ℓ \equiv 0$ on $E^{c}$ , and $Ψ \equiv 0$ on $E ∖ E^{c}$ , so that $\int_{0}^{\infty} e^{- δ s} ℓ (ω (s)) d s = \int_{0}^{τ} e^{- δ s} ℓ (ω (s)) d s$ and $\int_{0}^{\infty} δ e^{- δ s} Ψ (ω (s)) d s = \int_{τ}^{\infty} δ e^{- δ s} Ψ (ω (s)) d s$ . Thus by Lemma 5.13 the mappings $ω \mapsto \int_{0}^{τ} e^{- δ s} ℓ (ω (s)) d s$ and $ω \mapsto \int_{τ}^{\infty} δ e^{- δ s} Ψ (ω (s)) d s$ are Skorokhod continuous.

Moreover, if ω is a path of the PDMPs, then it holds that $ω (s) = ω (τ)$ for all $s \geq τ$ , such that $\int_{τ}^{\infty} δ e^{- δ s} Ψ (ω (s)) d s = e^{- δ τ} Ψ (ω (τ))$ . This completes the proof.

Also, finite time ruin probabilities, i.e. the probability of the PDMP reaching the cemetery before a given time horizon t, commute with weak limits, as we show next.

Lemma 5.15

Let X be a PDMP and $(X^{n})_{n \in N}$ be a sequence of PDMPs on the same state space E and with the same cemetery $E^{c}$ . Assume further that $X_{0}^{n} = x$ for all $n \in N$ and $X_{0} = x,$ and $X^{n} \overset{d}{\to} X$ in $D ([0, \infty), E)$ .

Then $lim_{n \to \infty} P_{x} (X_{t}^{n} \in E^{c}) = P_{x} (X_{t} \in E^{c})$ for all $t \geq 0$ .

Proof.

Consider a functional of the same form as $F_{1}$ in Remark 5.12, with $f_{1} = 1_{E^{c}}$ . Since the cemetery is the union of only entire $({k} \times E_{k})$ , and is therefore a union of connected components of E, the indicator function of the cemetery is continuous. Therefore if we define $ψ (x, t) = P_{x} (τ \leq t) = P_{x} (X_{t} \in E^{c}) = E_{x} (1_{E^{c}} (X_{t}))$ and $ψ^{n} (x, t) = P_{x} (τ^{n} \leq t) = P_{x} (X_{t}^{n} \in E^{c}) = E_{x} (1_{E^{c}} (X_{t}^{n}))$ , $n \in N$ , we have $lim_{n \to \infty} ψ^{n} (x, t) = ψ (x, t)$ for all $x \in E$ and for all $t \geq 0$ .

The following theorem specifies conditions under which Theorem 5.11 is applicable in the PDMP setting.

Theorem 5.16

Let X be a Feller PDMP with local characteristics $(ϕ, λ, Q)$ and let $X^{n},$ $n \in N,$ be Feller PDMPs with local characteristics $(ϕ^{n}, λ^{n}, Q^{n})$ . Further, let the following assumptions hold:

$g^{n} \to g$ and $λ^{n} \to λ$ as $n \to \infty,$ uniformly in $x \in E,$

for all $f \in C_{b}^{\infty} (E, R),$
$lim_{n \to \infty} sup_{x \in E} |\int_{E} f (y) Q^{n} (d y, x) - \int_{E} f (y) Q (d y, x)| = 0,$ (15)

$X_{0}^{n} \overset{d}{\to} X_{0}$ in E.

Then $X^{n} \overset{d}{\to} X$ in $D ([0, \infty), E)$ .

Proof.

Let $D (A^{n})$ , $n \in N$ , and $D (A)$ be the domains of the generators $A^{n}$ , $n \in N$ , and $A$ , corresponding to $X^{n}$ and X, respectively. For $f^{n} \in D (A^{n})$ we have

$\begin{aligned} A^{n} f^{n} (x) & = X^{n} f^{n} (x) + λ^{n} (x) \int_{E} (f^{n} (y) - f^{n} (x)) Q^{n} (x, d y), \\ (X^{n} f^{n}) (x) & = (g^{n}) (x) \cdot \nabla (f^{n}) (x) . \end{aligned}$

By Proposition 5.10, $D = C_{b}^{\infty} (E, R)$ is a core for all generators involved. For every $f \in D$ we set $f^{n} = f$ for all $n \in N$ , such that trivially $f^{n} \to f$ as $n \to \infty$ . Next, observe that we have for all $n \in N$ ,

$\begin{aligned} | A^{n} f (x) - A f (x) | \leq | g^{n} (x) \cdot \nabla f (x) - g (x) \cdot \nabla f (x) | \\ + |λ^{n} (x) \int_{E} (f (y) - f (x)) Q^{n} (d y, x) - λ (x) \int_{E} (f (y) - f (x)) Q (d y, x)| \\ = | (g^{n} (x) - g (x)) \cdot \nabla f (x) | + |λ^{n} (x) \int_{E} (f (y) - f (x)) Q^{n} (d y, x) - λ (x) \int_{E} (f (y) - f (x)) Q (d y, x)| \\ \leq ∥ g^{n} - g ∥_{\infty} ∥ \nabla f ∥_{\infty} + ∥ f ∥_{\infty} |λ^{n} (x) \int_{E} Q^{n} (d y, x) - λ (x) \int_{E} Q (d y, x)| \end{aligned}$ (16)

$\begin{aligned} + |λ^{n} (x) \int_{E} f (y) Q^{n} (d y, x) - λ (x) \int_{E} f (y) Q (d y, x)| . \end{aligned}$ (17)

Since $Q^{n}$ , $n \in N$ , and Q are probability measures on $(E, B (E))$ , and since, by assumption, $g^{n} \to g$ and $λ^{n} \to λ$ uniformly in $x \in E$ , the terms in (16) converge to zero. The term in (17) can be estimated as follows,

$\begin{aligned} |λ^{n} (x) \int_{E} f (y) Q^{n} (d y, x) - λ (x) \int_{E} f (y) Q (d y, x)| \\ \leq ∥ λ^{n} ∥_{\infty} |\int_{E} f (y) Q^{n} (d y, x) - \int_{E} f (y) Q (d y, x)| + |\int_{E} f (y) Q (d y, x)| ∥ λ^{n} - λ ∥_{\infty} . \end{aligned}$

The latter expression tends to zero, since for all $x \in E$ it was assumed that (15) holds, and since $λ^{n} \to λ$ uniformly in $x \in E$ . Thus, Item (i) of Theorem 5.11 holds. This implies that Item (iv) of Theorem 5.11 holds. The latter is equivalent to the assertion of this theorem.

Remark 5.17

Note that in the Feller case we can move to another external state only due to a purely random jump, i.e. a jump determined by $Q^{n}$ for $n \in N$ or Q. Therefore, if we assume uniform convergence of the local characteristics across all state components, and in particular also $Q^{n} \to Q$ in the sense of (15), then the result of Theorem 5.16 still holds.

Since uniform convergence of the local characteristics and the assumption that $t^{*} (x) = \infty$ are essential in the proof of Theorem 5.16, we need an alternative argument for situations with an active boundary or for situations in which a smooth approximation fails. A prototypical univariate example for both cases is a drift of the form $g (x) = c 1_{{x \leq b}}$ for some $b \in R$ . Here one faces either a discontinuity or a subdivision of $R$ into two state components, i.e. $R = {x \in R : x \leq b} \cup {x \in R : x > b}$ , with a continuous drift on each component. For a specific example, we find a method for dealing with this particular situation in the next section.

6. Application to the Cramér–Lundberg model with loan

In this section we apply our smoothing technique to the example presented in Section 2.1.4 and calculate the quantity of interest using different numerical integration methods. In this setup, $ϕ_{1}$ solves the ODE $(\partial / \partial t) ϕ_{1} (y, t) = g_{1} (ϕ_{1} (y, t))$ $\forall y \in E_{1}$ and $\forall t \in R$ , with

\begin{aligned} g_{1} (y) = \{\begin{cases} c & if y \in (0, \infty), \\ c + ρ y & if y \in (- \frac{c}{ρ}, 0], \\ 0 & if y \in (- \infty, - \frac{c}{ρ}] . \end{cases} \end{aligned}

In the setup of Section 2.1.4, the quantity of interest is the expected value of discounted future dividend payments until the time of ruin. The cemetery $E^{c}$ is given by $E^{c} = ({2} \times E_{2}) \cup ({3} \times E_{3})$ , the running reward ℓ is given by $ℓ_{1} \equiv 0, ℓ_{4} \equiv c$ , and the terminal cost is $Ψ (x) = 0$ for $x \in E^{c}$ . For $x \in E$ , $t \geq 0$ , let

L (t, x) = \int_{0}^{t} e^{- δ s} ℓ (ϕ (s, x)) d s .

Since g is not differentiable in 0 and $t^{*} (x) < \infty$ for all $x = (1, y)$ with $y \in E_{1}$ , we replace g by a smoothed version and we also modify ℓ accordingly. For $ϵ > 0$ let

\begin{aligned} g_{1}^{ϵ} (y) = \{\begin{cases} c & if y \in (ϵ, b - ϵ], \\ \frac{c (b - y)^{3} (15 ϵ (y - b) + 6 (b - y)^{2} + 10 ϵ^{2})}{ϵ^{5}} & if y \in (b - ϵ, b), \\ c + ρ y & if y \in (- \frac{c}{ρ}, - ϵ), \\ c + \frac{ρ (y + 3 ϵ) (y - ϵ)^{3}}{16 ϵ^{3}} & if y \in [- ϵ, ϵ], \\ 0 & if y \in (- \infty, - \frac{c}{ρ}] \cup [b, \infty) . \end{cases} \end{aligned}

Observe that $g_{1}^{ϵ} \in C^{2} (R)$ , that $lim_{y ↗ b} g_{1}^{ϵ} (y) = 0$ and that $g_{1}^{ϵ} \geq 0$ . For $ϵ > 0$ define the PDMP $X^{ϵ}$ so that for all $y \in R$ its flow $ϕ_{1}^{ϵ} (\cdot, y)$ is the solution to the ODE $\frac{d}{d t} ϕ_{1}^{ϵ} (t, y) = g^{ϵ} (ϕ_{1}^{ϵ} (t, y))$ with $ϕ_{1}^{ϵ} (0, x) = x$ . Apart from that all specifications are the same as for the original PDMP X. In addition, we replace $ℓ_{1}$ by

ℓ_{1}^{ϵ} (y) = c h (\frac{y - b + ϵ}{ϵ}),

where h can be chosen as in (13) and we define

L^{ϵ} (t, x) = \int_{0}^{t} e^{- δ s} ℓ^{ϵ} (ϕ^{ϵ} (s, x)) d s .

We aim at computing $G^{i - 1} H$ for the smoothed process, in order to observe how (9) simplifies in this example. By the definition of the cemetery, $G^{i - 1} H (x) = 0$ for all $x = (k, z) \in E$ with $k \in {2, 3}$ . For $x = (1, z)$ with $z \in E_{1}$ , any jumps bigger than $z + c / ρ$ lead to the cemetery, so we only need to integrate over jump sizes up to $z + c / ρ$ . Thus, we get that

\begin{aligned} G V (x) = G V ((1, z)) & = \int_{0}^{\infty} f_{W} (t, x) e^{- δ t} \int_{E} V (y) Q (d y, ϕ (x, t)) d t \\ = \int_{0}^{\infty} f_{W} (t, x) e^{- δ t} \int_{0}^{z + c / ρ} V ((1, z - y)) d F_{Y} (y) d t . \end{aligned}

Moreover, since λ is constant on $E_{1}$ it holds for all $x = (1, z)$ with $z \in E_{1}$ , $t \geq 0$ that $f_{W} (t, x) = λ_{N} e^{- λ_{N} t}$ , where $λ_{N}$ is as in Section 2.1.1. For $x = (1, z)$ with $z \in E_{1}$ we get

\begin{aligned} G^{i - 1} H (x) & = \int_{t_{1} = 0}^{\infty} λ_{N} e^{- (λ_{N} + δ) t_{1}} \int_{y_{1} = 0}^{χ_{1^{-}} + \frac{c}{ρ}} \dots \int_{t_{i - 1} = 0}^{\infty} λ_{N} e^{- (λ_{N} + δ) t_{i - 1}} \int_{y_{i - 1} = 0}^{χ_{(i - 1)^{-}} + \frac{c}{ρ}} \\ \int_{t_{i} = 0}^{\infty} λ_{N} e^{- λ_{N} t_{i}} L^{ϵ} (t_{i}, χ_{(i - 1)}) d t_{i} d F_{Y} (y_{i - 1}) d t_{i - 1} \dots d F_{Y} (y_{1}) d t_{1}, \end{aligned}

(18)

where the functions $χ_{j^{-}}, χ_{j}$ , $j = 1, 2, \dots, i - 1$ solve $χ_{j^{-}} = ϕ_{1}^{ϵ} (t_{j}, χ_{j - 1})$ and $χ_{j} = χ_{j^{-}} - y_{j}$ .

Thus $χ_{j^{-}}$ depends on $t_{1}, \dots, t_{j}$ and $y_{1}, \dots, y_{j - 1}$ , whereas $χ_{j}$ depends on $t_{1}, \dots, t_{j}$ and $y_{1}, \dots, y_{j}$ . However, this depensdence has been suppressed in (18) for the sake of readability.

Assumption 6.1

The jump distribution admits a density $f_{Y} = F_{Y}^{'}$ , with $f_{Y} \in C_{0}^{2}$ .

In what follows, suppose that Assumption 6.1 holds. A variable substitution $t_{j} = - \ln (v_{j})$ and $y_{j} = (χ_{j^{-}} + c / ρ) z_{j}$ , where $v_{j} \in [0, 1], z_{j} \in [0, 1]$ , ${\hat{χ}}_{j} (v_{1}, \dots, v_{j}, z_{1}, \dots, z_{j}) = χ_{j^{-}} (t_{1}, \dots, t_{j}, y_{1}, \dots, y_{j})$ . We then put

ν (v_{1}, \dots, v_{i}, z_{1}, \dots, z_{i - 1}) = L^{ϵ} (- \ln (v_{i}), {\hat{χ}}_{i - 1} (v_{1}, \dots, v_{i - 1}, z_{1}, \dots, z_{i - 1})),

which leads to

\begin{aligned} G^{i - 1} H (x) & = \int_{v_{1} = 0}^{1} \dots \int_{v_{i} = 0}^{1} \int_{z_{1} = 0}^{1} \dots \int_{z_{i - 1} = 0}^{1} λ_{N}^{i} [\prod_{j = 1}^{i - 1} v_{j}^{δ + λ_{N} - 1}] v_{i}^{λ_{N} - 1} ν (v_{1}, \dots, v_{i}, z_{1}, \dots, z_{i - 1}) \\ \times [\prod_{j = 1}^{i - 1} f_{Y} (z_{j} ({\hat{χ}}_{j} + \frac{c}{ρ})) ({\hat{χ}}_{j} + \frac{c}{ρ}) d z_{j}] \prod_{j = 1}^{i} d v_{j} . \end{aligned}

(19)

Due to the recursive structure of the functions ${\hat{χ}}_{1}, {\hat{χ}}_{2}, \dots, {\hat{χ}}_{i - 1}$ , the Jacobi matrix of the substitution has lower triangular shape, such that its determinant is the product of the diagonal elements. For being able to reasonably apply (10) we need to bound the Hessian of the integrand. If for example the jump size distribution is the Gamma distribution with parameters $α, β > 0$ , i.e. $d F_{Y} (y) = (y^{α - 1} β^{α} e^{- β y}) / Γ (α) d y$ , then this boils down to the condition $β \geq 3$ and $δ + λ > 3$ , which implies that the integrand is bounded in $0$ . In the original problem statement this corresponds to an additional integrability condition on the jump size distribution. Finally, for $x \in E$ of the form $x = (4, b)$ we have

G^{i - 1} H ((4, b)) = \int_{0}^{\infty} λ e^{- λ t} e^{- δ t} \int_{0}^{b + c / ρ} G^{i - 2} H ((1, b - y)) d F_{Y} (y) d t .

Remark 6.2

In Section 5.3 the stability, with respect to the smoothing parameter ε of the considered functional of the process, is dealt with in a fairly general setting. Unfortunately, because of the discontinuity of the drift g in the present example, we cannot achieve uniform convergence of the smoothed drift around the barrier level b, whereas point-wise convergence is achieved.

Theorem 6.3

In the setup of this section, the following assertion holds true. There exists C>0 such that $∥ V^{ϵ} - V ∥_{\infty} \leq C ϵ$ .

Proof.

Recall that

$V (x) = E_{x} (\int_{0}^{τ} e^{- δ s} ℓ (X_{s}) d s) and V^{ϵ} (x) = E_{x} (\int_{0}^{τ^{ϵ}} e^{- δ s} ℓ^{ϵ} (X_{s}^{ϵ}) d s),$

where $τ = inf {t \geq 0 : X_{t} \in E^{c}}$ and $τ^{ϵ} = inf {t \geq 0 : X_{t}^{ϵ} \in E^{c}}$ .

It is readily checked that $sup_{y \in (- c / ρ, b - ϵ)} | g_{1} (y) - g_{1}^{ϵ} (y) | \leq 3 ϵ ρ / 16$ and that $| g_{1} (y_{1}) - g_{1} (y_{2}) | \leq ρ | y_{1} - y_{2} |$ for all $y_{1}, y_{2} \in (- c / ρ, b - ϵ)$ . Hence we get from Kamke (1964, Theorem 8, p. 111) that

$| ϕ_{1}^{ϵ} (t, y) - ϕ_{1} (t, y) | < \frac{3 ϵ}{16} (e^{ρ t} - 1)$

for all $y \in (- c / ρ, b - ϵ)$ and for all $t \in [0, min {θ_{b - ϵ}^{ϵ}, {\tilde{θ}}_{b - ϵ}}]$ , where

$\begin{aligned} θ_{b - ϵ}^{ϵ} & = inf {t \geq 0 : ϕ_{1}^{ϵ} (t, y) \geq b - ϵ}, \\ {\tilde{θ}}_{b - ϵ} & = inf {t \geq 0 : ϕ_{1} (t, y) \geq b - ϵ}, \\ {\tilde{θ}}_{b} & = inf {t \geq 0 : ϕ_{1} (t, y) \geq b} . \end{aligned}$

Since $g_{1}^{ϵ}$ and $g_{1}$ coincide on $(- c / ρ, b - ϵ) ∖ (- ϵ, ϵ)$ and $g_{1}^{ϵ} \geq g_{1} \geq 0$ , we can refine this estimate to get

$| ϕ_{1}^{ϵ} (t, y) - ϕ_{1} (t, y) | < \frac{3 ϵ}{16} (e^{ρ C (ϵ)} - 1),$

for all $t \in [0, min {θ_{b - ϵ}^{ϵ}, {\tilde{θ}}_{b - ϵ}}]$ , where $C (ϵ) \in [0, \infty)$ is the time needed for the trajectory $ϕ_{1} (\cdot, y)$ to cross $(- ϵ, ϵ)$ . Note that $g_{1}^{ϵ} \geq g_{1} \geq 0$ yields $ϕ_{1}^{ϵ} (t, y) \geq ϕ_{1} (t, y)$ for all $t \in [0, {\tilde{θ}}_{b - ϵ}]$ , and hence ${\tilde{θ}}_{b - ϵ} \geq θ_{b - ϵ}^{ϵ}$ . For $t \geq min {θ_{b - ϵ}^{ϵ}, {\tilde{θ}}_{b - ϵ}} = θ_{b - ϵ}^{ϵ}$ we have by construction that $| ϕ_{1}^{ϵ} (t, y) - ϕ_{1} (t, y) | \leq ϵ$ . In total we get

$| ϕ_{1}^{ϵ} (t, y) - ϕ_{1} (t, y) | \leq ϵ max (1, \frac{3}{16} (e^{ρ C (ϵ)} - 1)) .$ (20)

Since $lim_{ϵ \to 0} C (ϵ) \to 0$ , it holds that $| ϕ_{1}^{ϵ} (t, y) - ϕ_{1} (t, y) | \leq ϵ$ for sufficiently small $ϵ > 0$ .

Recall that $T_{1}$ is the time of the first jump of X conditional on $X_{0} = (1, y)$ . Since the jump intensity is constant on $E_{1}$ , $T_{1}$ is exponentially distributed with intensity $λ_{N}$ . Hence, we can write

$\begin{aligned} V ((1, y)) & = E_{(1, y)} (L (T_{1}, (1, y)) + e^{- δ T_{1}} V (X_{T_{1}})) \\ = \int_{0}^{\infty} λ_{N} e^{- λ_{N} s} (L (s, (1, y)) + e^{- δ s} \int_{E} V (x_{1}) Q (d x_{1}, ϕ_{1} (s, y))) d s \\ = \int_{0}^{\infty} λ_{N} e^{- λ_{N} s} L (s, (1, y)) d s + \int_{0}^{\infty} λ_{N} e^{- (λ_{N} + δ) s} \int_{E} V (x_{1}) Q (d x_{1}, ϕ_{1} (s, y)) d s, \end{aligned}$

and analogously for $V^{ϵ}$ . We write $V ((1, y)) = V_{1} (y)$ and $V^{ϵ} ((1, y)) = V_{1}^{ϵ} (y)$ for $y \in E_{1}$ . Therefore,

$\begin{aligned} | V_{1} (y) - V_{1}^{ϵ} (y) | \leq \int_{0}^{\infty} λ_{N} e^{- λ_{N} s} | L (s, (1, y)) - L^{ϵ} (s, (1, y)) | d s \\ + \int_{0}^{\infty} λ_{N} e^{- (λ_{N} + δ) s} |\int_{E} V (x_{1}) Q (d x_{1}, ϕ_{1} (s, y)) - \int_{E} V^{ϵ} (x_{1}) Q (d x_{1}, ϕ_{1}^{ϵ} (s, y))| d s \\ \leq \int_{0}^{\infty} λ_{N} e^{- λ_{N} s} | L (s, (1, y)) - L^{ϵ} (s, (1, y)) | d s \\ + \int_{0}^{\infty} λ_{N} e^{- (λ_{N} + δ) s} |\int_{E} V (x_{1}) Q (d x_{1}, ϕ_{1} (s, y)) - \int_{E} V (x_{1}) Q (d x_{1}, ϕ_{1}^{ϵ} (s, y))| d s \\ + \int_{0}^{\infty} λ_{N} e^{- (λ_{N} + δ) s} |\int_{E} V (x_{1}) Q (d x_{1}, ϕ_{1}^{ϵ} (s, y)) - \int_{E} V^{ϵ} (x_{1}) Q (d x_{1}, ϕ_{1}^{ϵ} (s, y))| d s . \end{aligned}$ (21)

For $x = (1, y)$ and $t \geq 0$ it holds that $L^{ϵ} (s, x) = 0$ for $s \leq θ_{b - 2 ϵ}^{ϵ}$ , and

$L^{ϵ} (s, x) = \int_{0}^{s} e^{- δ r} ℓ^{ϵ} (ϕ^{ϵ} (r, x)) d r = c \int_{0}^{s} e^{- δ r} h ((ϕ^{ϵ} (r, x) - b + ϵ) / ϵ) d r \leq c \int_{θ_{b - 2 ϵ}^{ϵ}}^{s} e^{- δ r} d r$

for $s \geq θ_{b - 2 ϵ}^{ϵ}$ . On the other hand, we have that, for $x = (1, y)$ and $s \geq 0$ , $L (s, x) = 0$ for $s \leq {\tilde{θ}}_{b}$ , and

$L (t, x) = c \int_{{\tilde{θ}}_{b}}^{s} e^{- δ r} d r$

for $s > {\tilde{θ}}_{b}$ . Using $ϕ_{1}^{ϵ} (s, y) \geq ϕ_{1} (s, y)$ for all $s \geq 0$ , we get ${\tilde{θ}}_{b} \geq θ_{b - 2 ϵ}^{ϵ}$ , such that

$| L^{ϵ} (s, (1, y)) - L (s, (1, y)) | = L^{ϵ} (s, (1, y)) - L (s, (1, y)) \leq c \int_{θ_{b - 2 ϵ}^{ϵ}}^{{\tilde{θ}}_{b}} e^{- δ r} d r$

for all $t \geq 0$ . Hence,

$\int_{0}^{\infty} λ_{N} e^{- λ_{N} s} | L (s, (1, y)) - L^{ϵ} (s, (1, y)) | d s \leq c \int_{θ_{b - 2 ϵ}^{ϵ}}^{{\tilde{θ}}_{b}} e^{- δ r} d r \leq c ({\tilde{θ}}_{b} - θ_{b - 2 ϵ}^{ϵ}) .$

Now ${\tilde{θ}}_{b} - θ_{b - 2 ϵ}^{ϵ} \leq (b - (b - 2 ϵ - ϵ C_{1} (ϵ))) / c = ϵ (2 + C_{1} (ϵ)) / c$ , where $C_{1} (ϵ) = max (1, \frac{3}{16} (e^{ρ C (ϵ)} - 1))$ , see (20). With this, the first term in (21) can be estimated by

$\int_{0}^{\infty} λ_{N} e^{- λ_{N} s} | L (s, (1, y)) - L^{ϵ} (s, (1, y)) | d s \leq ϵ (2 + C_{1} (ϵ)) .$ (22)

Next, observe that (we remind the reader that the states $x \in E$ are denoted by $x = (k, y)$ , which is why in the following the terms $y_{1}, y_{2}$ are not to be confused with the integration variables $y_{j}$ used in and below (18)),

$\begin{aligned} |\int_{E} V (x_{1}) Q (d x_{1}, (1, y_{1})) - \int_{E} V (x_{1}) Q (d x_{1}, (1, y_{2}))| \\ = |\int_{0}^{y_{1} + c / ρ} V_{1} (y_{1} - z) f_{Y} (z) d z - \int_{0}^{y_{2} + c / ρ} V_{1} (y_{2} - z) f_{Y} (z) d z| \\ \leq |\int_{min (y_{1}, y_{2}) + c / ρ}^{max (y_{1}, y_{2}) + c / ρ} V_{1} (z) f_{Y} (z) d z| \leq ∥ V_{1} ∥_{\infty} ∥ f_{Y} ∥_{\infty} | y_{1} - y_{2} | . \end{aligned}$

Combining this with (20), we can estimate the second term in (21) by

$\begin{aligned} \int_{0}^{\infty} λ_{N} e^{- (λ_{N} + δ) s} |\int_{E} V (x_{1}) Q (d x_{1}, ϕ_{1} (s, y)) - \int_{E} V^{ϵ} (x_{1}) Q (d x_{1}, ϕ_{1}^{ϵ} (s, y))| d s \end{aligned}$ (23)

$\begin{aligned} \leq \frac{λ_{N}}{λ_{N} + δ} ∥ V_{1} ∥_{\infty} ∥ f_{Y} ∥_{\infty} sup_{s \geq 0} | ϕ_{1} (s, y) - ϕ_{1}^{ϵ} (s, y) | \leq \frac{λ_{N}}{λ_{N} + δ} ∥ V_{1} ∥_{\infty} ∥ f_{Y} ∥_{\infty} ϵ max \\ \times (1, \frac{3}{16} (e^{ρ C (ϵ)} - 1)) . \end{aligned}$ (24)

Furthermore, since

$|\int_{E} V (x_{1}) Q (d x_{1}, (1, y_{2})) - \int_{E} V^{ϵ} (x_{1}) Q (d x_{1}, (1, y_{2}))| \leq {∥V_{1} - V_{1}^{ϵ}∥}_{\infty},$

the third term in (21) can be estimated as follows,

$\begin{aligned} \int_{0}^{\infty} λ_{N} e^{- (λ_{N} + δ) s} |\int_{E} V (x_{1}) Q (d x_{1}, ϕ_{1}^{ϵ} (s, y)) - \int_{E} V^{ϵ} (x_{1}) Q (d x_{1}, ϕ_{1}^{ϵ} (s, y))| d s \\ \leq \frac{λ_{N}}{λ_{N} + δ} ∥ V_{1} - V_{1}^{ϵ} ∥_{\infty} . \end{aligned}$ (25)

Taking the supremum over $y \in E_{1}$ in (21) and using (22), (23), and (25) we obtain that

$∥ V_{1} - V_{1}^{ϵ} ∥_{\infty} \leq C ϵ + \frac{λ_{N}}{λ_{N} + δ} ∥ V_{1} - V_{1}^{ϵ} ∥_{\infty}$

for some constant C and for sufficiently small ε. Thus,

$\frac{δ}{λ_{N} + δ} ∥ V_{1} - V_{1}^{ϵ} ∥_{\infty} \leq C ϵ,$

which completes the proof.

6.1. Numerical experiment

We now solve the example presented above numerically. We set the following parameter values. The initial value of the PDMP $x_{0} = 0$ , the premium income rate c=5, the credit rate $ρ = 0.05$ , the intensity of the Poisson process $λ = 4$ , the jump size distribution is for all $x \in [0, \infty)$ given by $F_{Y} (x) = 1 - e^{- α x}$ with $α = 1$ , and the discount rate $δ = 0.02$ . With this, the optimal dividend threshold according to Dassios & Embrechts (1989) is b=3.24289. Furthermore, we set the smoothing parameter $ϵ = 0.01$ . For computing the flow it is enough to solve the corresponding ODE once and to store the solution for repeated use.

We implemented Monte Carlo (random), QMC with the Sobol' sequence (Sobol), and QMC with a scrambled version of the Halton sequence (scrambled Halton), where scrambling refers to a permutation of digits (see, e.g. Owen (2000)). The Sobol' point generator we used was taken from Frances Y. Kuo's homepage Kuo (n.d.) and is based on Joe & Kuo (2008).

The reference solution was calculated using Monte Carlo with $M = 5000 \cdot 2^{10} = 5120000$ sample paths and d=1024, meaning that the maximum number of jumps we allow for is 512. In our plots we show the results plotted over an increasing number of integration nodes $M \in {50 \cdot 2^{j} : 1 \leq j \leq 16}$ .

Figure 3 shows the estimated standard deviation (root mean square error) of the estimation, which is calculated by using 50 repetitions with randomly shifted versions of our integration nodes.

Funding Statement

Peter Kritzer P. Kritzer is supported by the Austrian Science Fund (FWF): Project F5506-N26, which is part of the Special Research Program ‘Quasi-Monte Carlo Methods: Theory and Applications’. P. Kritzer is partially supported by the National Science Foundation (NSF) [grant number DMS-1638521] to the Statistical and Applied Mathematical Sciences Institute. P. Kritzer, G. Leobacher, and M. Szölgyenyi gratefully acknowledge the partial support of the Erwin Schrödinger International Institute for Mathematics and Physics (ESI) in Vienna under the thematic programme ‘Tractability of High Dimensional Problems and Discrepancy’. G. Leobacher is supported by the Austrian Science Fund (FWF): Project F5508-N26, which is part of the Special Research Program ‘Quasi-Monte Carlo Methods: Theory and Applications’. M. Szölgyenyi is supported by the AXA Research Fund grant ‘Numerical Methods for Stochastic Differential Equations with Irregular Coefficients with Applications in Risk Theory and Mathematical Finance’ and supported by the Vienna Science and Technology Fund (WWTF): Project MA14-031.

Acknowledgments

The authors would like to thank an anonymous referee for useful comments on how to improve the presentation of the results. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation. Part of this article was written while G. Leobacher was affiliated with the Institute of Financial Mathematics and Applied Number Theory, Johannes Kepler University Linz, Altenbergerstraße 69, 4040 Linz, Austria. A part of this article was written while M. Szölgyenyi was affiliated with the Institute of Statistics and Mathematics, Vienna University of Economics and Business, Welthandelsplatz 1, 1020 Vienna, Austria.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

Albrecher H. & Kainhofer R. (2002). Risk theory with a nonlinear dividend barrier. Computing 68(4), 289–311. doi: 10.1007/s00607-001-1447-4 [DOI] [Google Scholar]
Albrecher H. & Lautscham V. (2015). Dividends and the time of ruin under barrier strategies with a capital-exchange agreement. Anales del Instituo de Actuarios Espanoles 21(3), 1–30. [Google Scholar]
Almudevar A. (2001). A dynamic programming algorithm for the optimal control of piecewise deterministic Markov processes. SIAM Journal on Control and Optimization 40(2), 525–539. doi: 10.1137/S0363012999364474 [DOI] [Google Scholar]
Asmussen S. & Albrecher H. (2010). Ruin probabilities, 2nd ed Advanced Series on Statistical Science and Applied Probability Hackensack, NJ: World Scientific. [Google Scholar]
Bakhvalov N. S. (1959). On the approximate calculation of multiple integrals. Vestnik MGU, Series Mathematical, Mechanics & Astronomy Physical Chemistry 4, 3–18. In Russian. [Google Scholar]
Bäuerle N. & Rieder U. (2010). Optimal control of piecewise deterministic Markov processes with finite time horizon. Modern Trends in Controlled Stochastic Processes: Theory and Applications 123, 143. [Google Scholar]
Bäuerle N. & Rieder U. (2011). Markov decision processes with applications to finance. Heidelberg: Universitext, Springer. [Google Scholar]
Cai J., Feng R. & Willmot G. E. (2009). On the expectation of total discounted operating costs up to default and its applications. Advances in Applied Probability 41(2), 495–522. doi: 10.1239/aap/1246886621 [DOI] [Google Scholar]
Colaneri K., Eksi Z., Frey R. & Szölgyenyi M. (2017). Optimal liquidation under partial information with price impact. arXiv:1606.05079
Costa O. L. & Davis M. H. A. (1989). Impulse control of piecewise-deterministic processes. Mathematics of Control, Signals, and Systems (MCSS) 2(3), 187–206. doi: 10.1007/BF02551384 [DOI] [Google Scholar]
Costa O. L. & Dufour F. (2013). Continuous average control of piecewise deterministic Markov processes. New York: Springer. [Google Scholar]
Coulibaly I. & Lefèvre C. (2008). On a simple quasi-Monte Carlo approach for classical ultimate ruin probabilities. Insurance: Mathematics and Economics 42(3), 935–942. [Google Scholar]
Dassios A. & Embrechts P. (1989). Martingales and insurance risk. Communications in Statistics. Stochastic Models 5(2), 181–217. doi: 10.1080/15326348908807105 [DOI] [Google Scholar]
Davis M. H. A. (1984). Piecewise-deterministic Markov processes: A general class of nondiffusion stochastic models. Journal of the Royal Statistical Society Series B 46(3), 353–388. With discussion. [Google Scholar]
Davis M. H. A. (1993). Markov models and optimization. Monographs on Statistics and Applied Probability London: Chapman & Hall. [Google Scholar]
Davis M. H. A. & Farid M. (1999). Piecewise-deterministic processes and viscosity solutions. In W. M. McEneaney, G. George Yin, and Q. Zhang, eds., Stochastic Analysis, Control, Optimization and Applications. Boston: Springer. P. 249–268.
de Saporta B., Dufour F. & Zhang H. (2016). Numerical methods for simulation and optimization of piecewise deterministic markov processes. Mathematics and Statistics Series London: ISTE; Hoboken, NJ: John Wiley & Sons, Inc. [Google Scholar]
de Saporta B., Dufour F., Zhang H. & Elegbede C. (2012). Optimal stopping for the predictive maintenance of a structure subject to corrosion. Journal of Risk and Reliability 226(2), 169–181. [Google Scholar]
Dempster M. A. H. & Ye J. J. (1992). Necessary and sufficient optimality conditions for control of piecewise deterministic Markov processes. Stochastics: An International Journal of Probability and Stochastic Processes 40(3–4), 125–145. [Google Scholar]
Eichler A., Leobacher G. & Szölgyenyi M. (2017). Utility indifference pricing of insurance catastrophe derivatives. European Actuarial Journal 7(2), 515–534. doi: 10.1007/s13385-017-0154-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
Embrechts P. & Schmidli H. (1994). Ruin estimation for a general insurance risk model. Advances in Applied Probability 26(2), 404–422. doi: 10.2307/1427443 [DOI] [Google Scholar]
Ethier S. N. & Kurtz T. G. (1986). Markov processes. Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics New York: John Wiley & Sons, Inc. [Google Scholar]
Forwick L., Schäl M. & Schmitz M. (2004). Piecewise deterministic Markov control processes with feedback controls and unbounded costs. Acta Applicandae Mathematica 82(3), 239–267. doi: 10.1023/B:ACAP.0000031200.76583.75 [DOI] [Google Scholar]
Grigorian A. (2009). Ordinary differential equation. Lecture notes. https://www.math.uni-bielefeld.de/grigor/odelec2009.pdf
Hinrichs A., Novak E., Ullrich M. & Woźniakowski H. (2017). Product rules are optimal for numerical integration in classical smoothness spaces. Journal of Complexity 38, 39–49. doi: 10.1016/j.jco.2016.09.001 [DOI] [Google Scholar]
Jacobsen M. (2006). Point process theory and applications. Probability and its Applications Boston, MA: Birkhäuser Boston, Inc. [Google Scholar]
Joe S. & Kuo F. Y. (2008). Constructing Sobol' sequences with better two-dimensional projections. SIAM Journal of Scientific Computation 30, 2635–2654. doi: 10.1137/070709359 [DOI] [Google Scholar]
Kallenberg O. (2002). Foundations of modern probability, 2nd Probability and its Applications (New York) New York: Springer-Verlag. [Google Scholar]
Kamke E. (1964). Differentialgleichungen. I. Gewöhnliche differentialgleichungen. 5th ed Leipzig: Akademische Verlagsgesellschaft. [Google Scholar]
Kritzer P., Pillichshammer F. & Wasilkowski G. W. (2016). Very low truncation dimension for high dimensional integration under modest error demand. Journal of Complexity 35, 63–85. doi: 10.1016/j.jco.2016.02.002 [DOI] [Google Scholar]
Kuo F. Y. (n.d.). F. Y. Kuo's homepage. http://web.maths.unsw.edu.au/∼fkuo/sobol/index.html. Last visited 14/12/2017.
Kuo F. Y., Sloan I. H., Wasilkowski G. W. & Woźniakowski H. (2010). Liberating the dimension. Journal of Complexity 26, 422–454. doi: 10.1016/j.jco.2009.12.003 [DOI] [Google Scholar]
Kurtz T. G. & Protter P. E. (1996). Weak convergence of stochastic integrals and differential equations I. In D. Talay and L. Tubaro, eds., Probabilistic Models for Nonlinear Partial Differential Equations. Berlin, Heidelberg: Springer.
Lenhart S. & Liaot Y. (1985). Integro-differential equations associated with optimal stopping time of a piecewise-deterministic process. Stochastics: An International Journal of Probability and Stochastic Processes 15(3), 183–207. doi: 10.1080/17442508508833356 [DOI] [Google Scholar]
Leobacher G. & Ngare P. (2016). Utility indifference pricing of derivatives written on industrial loss indexes. Journal of Computational and Applied Mathematics 300, 68–82. doi: 10.1016/j.cam.2015.11.028 [DOI] [Google Scholar]
Niederreiter H. (1992). Random number generation and quasi-Monte Carlo methods. CBMS-NSF Regional Conference Series in Applied Mathematics, Philadelphia: SIAM.
Owen A. B. (2000). Monte Carlo, quasi-Monte Carlo, and randomized quasi-Monte Carlo. In H. Niederreiter and J. Spanier, eds., Monte Carlo and Quasi- Monte Carlo Methods 1998. Springer. P. 86–97.
Pausinger F. & Svane A. M. (2015). A Koksma–Hlawka inequality for general discrepancy systems. Journal of Complexity 31, 773–793. doi: 10.1016/j.jco.2015.06.002 [DOI] [Google Scholar]
Preischl M., Thonhauser S. & Tichy R. F. (2018). Integral equations, quasi-monte carlo methods and risk modeling. In J. Dick, F. Y. Kuo and H. Woźniakowski, eds., Contemporary Computational Mathematics – A Celebration of the 80th Birthday of Ian Sloan, Vol. 1, 2. Cham: Springer. P. 1051–1074.
Riedler M. G. (2013). Almost sure convergence of numerical approximations for piecewise deterministic Markov processes. Journal of Computational and Applied Mathematics 239, 50–71. doi: 10.1016/j.cam.2012.09.021 [DOI] [Google Scholar]
Rolski T., Schmidli H., Schmidt V. & Teugels J. (1999). Stochastic processes for insurance and finance. Wiley Series in Probability and Statistics New York: John Wiley & Sons. [Google Scholar]
Schäl M. (1998). On piecewise deterministic Markov control processes: Control of jumps and of risk processes in insurance. Insurance: Mathematics and Economics 22(1), 75–91. [Google Scholar]
Siegl T. & Tichy R. F. (2000). Ruin theory with risk proportional to the free reserve and securitization. Insurance: Mathematics and Economics 26(1), 59–73. [Google Scholar]
Sloan I. H. & Woźniakowski H. (1998). When are quasi-Monte Carlo algorithms efficient for high dimensional integrals? Journal of Complexity 14, 1–33. doi: 10.1006/jcom.1997.0463 [DOI] [Google Scholar]
Tichy R. F. (1984). Über eine zahlentheoretische Methode zur numerischen Integration und zur Behandlung von Integralgleichungen. Osterreichische Akademie der Wissenschaften Mathematisch-Naturwissenschaftliche Klasse. Sitzungsberichte. Abteilung II 193(4–7), 329–358. [Google Scholar]

[CIT0001] Albrecher H. & Kainhofer R. (2002). Risk theory with a nonlinear dividend barrier. Computing 68(4), 289–311. doi: 10.1007/s00607-001-1447-4 [DOI] [Google Scholar]

[CIT0002] Albrecher H. & Lautscham V. (2015). Dividends and the time of ruin under barrier strategies with a capital-exchange agreement. Anales del Instituo de Actuarios Espanoles 21(3), 1–30. [Google Scholar]

[CIT0003] Almudevar A. (2001). A dynamic programming algorithm for the optimal control of piecewise deterministic Markov processes. SIAM Journal on Control and Optimization 40(2), 525–539. doi: 10.1137/S0363012999364474 [DOI] [Google Scholar]

[CIT0004] Asmussen S. & Albrecher H. (2010). Ruin probabilities, 2nd ed Advanced Series on Statistical Science and Applied Probability Hackensack, NJ: World Scientific. [Google Scholar]

[CIT0005] Bakhvalov N. S. (1959). On the approximate calculation of multiple integrals. Vestnik MGU, Series Mathematical, Mechanics & Astronomy Physical Chemistry 4, 3–18. In Russian. [Google Scholar]

[CIT0006] Bäuerle N. & Rieder U. (2010). Optimal control of piecewise deterministic Markov processes with finite time horizon. Modern Trends in Controlled Stochastic Processes: Theory and Applications 123, 143. [Google Scholar]

[CIT0007] Bäuerle N. & Rieder U. (2011). Markov decision processes with applications to finance. Heidelberg: Universitext, Springer. [Google Scholar]

[CIT0008] Cai J., Feng R. & Willmot G. E. (2009). On the expectation of total discounted operating costs up to default and its applications. Advances in Applied Probability 41(2), 495–522. doi: 10.1239/aap/1246886621 [DOI] [Google Scholar]

[CIT0009] Colaneri K., Eksi Z., Frey R. & Szölgyenyi M. (2017). Optimal liquidation under partial information with price impact. arXiv:1606.05079

[CIT0010] Costa O. L. & Davis M. H. A. (1989). Impulse control of piecewise-deterministic processes. Mathematics of Control, Signals, and Systems (MCSS) 2(3), 187–206. doi: 10.1007/BF02551384 [DOI] [Google Scholar]

[CIT0011] Costa O. L. & Dufour F. (2013). Continuous average control of piecewise deterministic Markov processes. New York: Springer. [Google Scholar]

[CIT0012] Coulibaly I. & Lefèvre C. (2008). On a simple quasi-Monte Carlo approach for classical ultimate ruin probabilities. Insurance: Mathematics and Economics 42(3), 935–942. [Google Scholar]

[CIT0013] Dassios A. & Embrechts P. (1989). Martingales and insurance risk. Communications in Statistics. Stochastic Models 5(2), 181–217. doi: 10.1080/15326348908807105 [DOI] [Google Scholar]

[CIT0014] Davis M. H. A. (1984). Piecewise-deterministic Markov processes: A general class of nondiffusion stochastic models. Journal of the Royal Statistical Society Series B 46(3), 353–388. With discussion. [Google Scholar]

[CIT0015] Davis M. H. A. (1993). Markov models and optimization. Monographs on Statistics and Applied Probability London: Chapman & Hall. [Google Scholar]

[CIT0016] Davis M. H. A. & Farid M. (1999). Piecewise-deterministic processes and viscosity solutions. In W. M. McEneaney, G. George Yin, and Q. Zhang, eds., Stochastic Analysis, Control, Optimization and Applications. Boston: Springer. P. 249–268.

[CIT0017] de Saporta B., Dufour F. & Zhang H. (2016). Numerical methods for simulation and optimization of piecewise deterministic markov processes. Mathematics and Statistics Series London: ISTE; Hoboken, NJ: John Wiley & Sons, Inc. [Google Scholar]

[CIT0018] de Saporta B., Dufour F., Zhang H. & Elegbede C. (2012). Optimal stopping for the predictive maintenance of a structure subject to corrosion. Journal of Risk and Reliability 226(2), 169–181. [Google Scholar]

[CIT0019] Dempster M. A. H. & Ye J. J. (1992). Necessary and sufficient optimality conditions for control of piecewise deterministic Markov processes. Stochastics: An International Journal of Probability and Stochastic Processes 40(3–4), 125–145. [Google Scholar]

[CIT0020] Eichler A., Leobacher G. & Szölgyenyi M. (2017). Utility indifference pricing of insurance catastrophe derivatives. European Actuarial Journal 7(2), 515–534. doi: 10.1007/s13385-017-0154-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CIT0021] Embrechts P. & Schmidli H. (1994). Ruin estimation for a general insurance risk model. Advances in Applied Probability 26(2), 404–422. doi: 10.2307/1427443 [DOI] [Google Scholar]

[CIT0022] Ethier S. N. & Kurtz T. G. (1986). Markov processes. Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics New York: John Wiley & Sons, Inc. [Google Scholar]

[CIT0023] Forwick L., Schäl M. & Schmitz M. (2004). Piecewise deterministic Markov control processes with feedback controls and unbounded costs. Acta Applicandae Mathematica 82(3), 239–267. doi: 10.1023/B:ACAP.0000031200.76583.75 [DOI] [Google Scholar]

[CIT0024] Grigorian A. (2009). Ordinary differential equation. Lecture notes. https://www.math.uni-bielefeld.de/grigor/odelec2009.pdf

[CIT0025] Hinrichs A., Novak E., Ullrich M. & Woźniakowski H. (2017). Product rules are optimal for numerical integration in classical smoothness spaces. Journal of Complexity 38, 39–49. doi: 10.1016/j.jco.2016.09.001 [DOI] [Google Scholar]

[CIT0026] Jacobsen M. (2006). Point process theory and applications. Probability and its Applications Boston, MA: Birkhäuser Boston, Inc. [Google Scholar]

[CIT0027] Joe S. & Kuo F. Y. (2008). Constructing Sobol' sequences with better two-dimensional projections. SIAM Journal of Scientific Computation 30, 2635–2654. doi: 10.1137/070709359 [DOI] [Google Scholar]

[CIT0028] Kallenberg O. (2002). Foundations of modern probability, 2nd Probability and its Applications (New York) New York: Springer-Verlag. [Google Scholar]

[CIT0029] Kamke E. (1964). Differentialgleichungen. I. Gewöhnliche differentialgleichungen. 5th ed Leipzig: Akademische Verlagsgesellschaft. [Google Scholar]

[CIT0030] Kritzer P., Pillichshammer F. & Wasilkowski G. W. (2016). Very low truncation dimension for high dimensional integration under modest error demand. Journal of Complexity 35, 63–85. doi: 10.1016/j.jco.2016.02.002 [DOI] [Google Scholar]

[CIT0031] Kuo F. Y. (n.d.). F. Y. Kuo's homepage. http://web.maths.unsw.edu.au/∼fkuo/sobol/index.html. Last visited 14/12/2017.

[CIT0032] Kuo F. Y., Sloan I. H., Wasilkowski G. W. & Woźniakowski H. (2010). Liberating the dimension. Journal of Complexity 26, 422–454. doi: 10.1016/j.jco.2009.12.003 [DOI] [Google Scholar]

[CIT0033] Kurtz T. G. & Protter P. E. (1996). Weak convergence of stochastic integrals and differential equations I. In D. Talay and L. Tubaro, eds., Probabilistic Models for Nonlinear Partial Differential Equations. Berlin, Heidelberg: Springer.

[CIT0034] Lenhart S. & Liaot Y. (1985). Integro-differential equations associated with optimal stopping time of a piecewise-deterministic process. Stochastics: An International Journal of Probability and Stochastic Processes 15(3), 183–207. doi: 10.1080/17442508508833356 [DOI] [Google Scholar]

[CIT0035] Leobacher G. & Ngare P. (2016). Utility indifference pricing of derivatives written on industrial loss indexes. Journal of Computational and Applied Mathematics 300, 68–82. doi: 10.1016/j.cam.2015.11.028 [DOI] [Google Scholar]

[CIT0036] Niederreiter H. (1992). Random number generation and quasi-Monte Carlo methods. CBMS-NSF Regional Conference Series in Applied Mathematics, Philadelphia: SIAM.

[CIT0037] Owen A. B. (2000). Monte Carlo, quasi-Monte Carlo, and randomized quasi-Monte Carlo. In H. Niederreiter and J. Spanier, eds., Monte Carlo and Quasi- Monte Carlo Methods 1998. Springer. P. 86–97.

[CIT0038] Pausinger F. & Svane A. M. (2015). A Koksma–Hlawka inequality for general discrepancy systems. Journal of Complexity 31, 773–793. doi: 10.1016/j.jco.2015.06.002 [DOI] [Google Scholar]

[CIT0039] Preischl M., Thonhauser S. & Tichy R. F. (2018). Integral equations, quasi-monte carlo methods and risk modeling. In J. Dick, F. Y. Kuo and H. Woźniakowski, eds., Contemporary Computational Mathematics – A Celebration of the 80th Birthday of Ian Sloan, Vol. 1, 2. Cham: Springer. P. 1051–1074.

[CIT0040] Riedler M. G. (2013). Almost sure convergence of numerical approximations for piecewise deterministic Markov processes. Journal of Computational and Applied Mathematics 239, 50–71. doi: 10.1016/j.cam.2012.09.021 [DOI] [Google Scholar]

[CIT0041] Rolski T., Schmidli H., Schmidt V. & Teugels J. (1999). Stochastic processes for insurance and finance. Wiley Series in Probability and Statistics New York: John Wiley & Sons. [Google Scholar]

[CIT0042] Schäl M. (1998). On piecewise deterministic Markov control processes: Control of jumps and of risk processes in insurance. Insurance: Mathematics and Economics 22(1), 75–91. [Google Scholar]

[CIT0043] Siegl T. & Tichy R. F. (2000). Ruin theory with risk proportional to the free reserve and securitization. Insurance: Mathematics and Economics 26(1), 59–73. [Google Scholar]

[CIT0044] Sloan I. H. & Woźniakowski H. (1998). When are quasi-Monte Carlo algorithms efficient for high dimensional integrals? Journal of Complexity 14, 1–33. doi: 10.1006/jcom.1997.0463 [DOI] [Google Scholar]

[CIT0045] Tichy R. F. (1984). Über eine zahlentheoretische Methode zur numerischen Integration und zur Behandlung von Integralgleichungen. Osterreichische Akademie der Wissenschaften Mathematisch-Naturwissenschaftliche Klasse. Sitzungsberichte. Abteilung II 193(4–7), 329–358. [Google Scholar]

PERMALINK

Approximation methods for piecewise deterministic Markov processes and their costs

Peter Kritzer

Gunther Leobacher

Michaela Szölgyenyi

Stefan Thonhauser

ABSTRACT

1. Introduction

2. Piecewise deterministic Markov processes

Definition 2.1

Remark 2.2

Definition 2.3

Definition 2.4

Theorem 2.5

Proof.

Figure 1.

2.1. Examples

2.1.1. Classical Cramér–Lundberg model

2.1.2. Cramér–Lundberg model with dividend payments

2.1.3. Cramér–Lundberg model with time dependent dividend barrier

2.1.4. Cramér–Lundberg model with loan

2.1.5. Multidimensional Cramér–Lundberg model

3. Iterated integrals and a fixed point approach

Definition 3.1

Definition 3.2

Lemma 3.3

Proof.

Remark 3.4

4. Cubature rules for Cκ-functions

4.1. QMC methods

4.2. Product rules

5. Smoothing of the integrand

5.1. Smoothing of the flow

Definition 5.1

Lemma 5.2

Proof.

Figure 2.

5.2. Smoothing of jump measures

Assumption 5.3

Theorem 5.4

Proof.

Remark 5.5

5.3. Convergence

Theorem 5.6 Davis 1993, Theorem 27.6 —

Example 5.7

Definition 5.8 Kallenberg 2002, Chapter 19 —

Proposition 5.9 Kallenberg 2002, Proposition 19.9 —

Proposition 5.10

Proof.

Theorem 5.11 Kallenberg 2002, Theorem 19.25 —

Remark 5.12

Lemma 5.13

Proof.

Lemma 5.14

Proof.

Lemma 5.15

Proof.

Theorem 5.16

Proof.

Remark 5.17

6. Application to the Cramér–Lundberg model with loan

Assumption 6.1

Remark 6.2

Theorem 6.3

Proof.

6.1. Numerical experiment

Figure 3.

Funding Statement

Acknowledgments

Disclosure statement

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

4. Cubature rules for $C^{κ}$ -functions