Distorted probability operator for dynamic portfolio optimization in times of socio-economic crisis

Kerem Uğurlu; Tomasz Brzeczek

doi:10.1007/s10100-022-00834-0

. 2022 Dec 9:1–18. Online ahead of print. doi: 10.1007/s10100-022-00834-0

Distorted probability operator for dynamic portfolio optimization in times of socio-economic crisis

Kerem Uğurlu ¹, Tomasz Brzeczek ^2,^✉

PMCID: PMC9734642 PMID: 36531521

Abstract

A robust optimal control of discrete time Markov chains with finite terminal T and bounded costs or wealth using probability distortion is studied. The time inconsistency of these distortion operators and hence its lack of dynamic programming are discussed. Due to that, dynamic versions of these operators are introduced, and its availability for dynamic programming is demonstrated. Based on dynamic programming algorithm, existence of the optimal policy is justified and an application of the theory to portfolio optimization along with a numerical study is also presented.

Keywords: Probability distortion, Markov decision processes, Dynamic programming, Risk management, Mathematical finance

Introduction

Starting with political and economical crisis in 1990’s and recent global Covid-19 pandemic have caused unstable market periods. Most recently, in the global pandemic, stock prices has witnessed sharp drops and peaks (Batabyal and Killinis 2021). This turmoil event is called a black swan. They are said to be unpredictable with mathematical models as being not reflected in historical data (Phillips 2019; Werther 2017). However, for sure monthly expected rate of return for any financial investment in stock is overestimated for a period of data given turbulence will occur (Focard and Fabozzi 2009). Good market returns are biased with overestimation of the probability of positive extremely high returns. To overcome this, using distortion operators is a well known technique to mitigate optimistic expectations. It is used frequently in behavioural finance (see e.g. Kahneman and Tversky 1979, Kahneman and Tversky 1992, Zhou 2010). It has been motivated by empirical studies in behavioural finance and aims to model the human tendency to exaggerate small probabilities of extreme events (Wakker 2010). In particular, it is only natural to consider distortion operators to estimate the risk during global crisis such as the recent pandemic period. One can hinder the error of naively predicting the future returns with a current state of the market that is changing all the time. The distortion operator is shown to be robust to optimistic high returns. Namely, it is more sensitive to the risk of negative returns than the expectation operator by mitigating the chance of positive returns. In particular, the distortion operator is risk-averse. It has a lot in common with value investment strategies that are shown to outperform market return-risk efficiency even in long term. This is to be observed since economic turmoil in 2008 (see Elze 2012, Majewski et al. 2020).

In that respect, models with risk aversion has a long history, and they are represented via alternative approaches. One of these approaches is using concave utility functions modelling risk-aversion (see e.g. Chung and Sobel 1987, Fleming and Sheu 1999, Fleming and Sheu 2000, Jaquette 1973, Jaquette 1976) and the references therein), where the utility functions are subject to satisfy some regularity properties. Another approach uses the so-called coherent risk measures introduced in Artzner et al. (1999). Here, these risk measures are again subject to satisfy some axioms modelling the random outcomes taking risk awareness into consideration. We refer the reader to Artzner et al. (2007); Ruszczynski (2010); Cheridito et al. (2006); Eichhorn and Romisch (2005); Follmer and Penner (2006); Fritelli et al. (2002, 2005)) and the references therein for further details on the risk measures.

On the other hand, although modelling random outcomes representing gains/losses using probability distortion goes back to at least 1970’s (Kahneman and Tversky 1979), its axiomatic incorporation into multiperiod settings is still absent in the literature. There are few recent works in this direction. To mention few, (He and Zhou 2011) studies a portfolio optimization problem in continuous time using probability distortion, whereas (Kun et al. 2018) studies a discrete time controlled Markov chain in infinite horizon. In another work, (Ma et al. yyy) assumes monotonicity of the cost/gain functions and presents the results under that assumption. The reason for scarcity of the literature in multiperiod setting lies in the fact that the distortion operator is of limited use in control problems due to not satisfying “Dynamic Programming Principle” (DPP) or “Bellman Optimality Principle”. Namely, a sequence of optimization problems with the corresponding optimal controls is called time-consistent, if the optimal strategies obtained when solving the optimal control problem at time s stays optimal when the optimal control problem (OCP) is solved at time $t > s$ (Bjork et al. 2017). OCP’s are of vital importance in various fields of operations research In this paper, we introduce a dynamic version of probability distortion that does not suffer from time-inconsistency. DPP can be applied readily in our framework under controlled Markov chains, and additionally DPP gives the existence of the optimal policy.

The rest of the paper is organized as follows. In Sect. 2, we describe probability distortion on random variables in a static one-period case first. Next, we introduce the concept of dynamic probability distortion on stochastic processes in a multi-temporal discrete time setting. In Sect. 3, we introduce the controlled Markov chain framework that we are going to work on. In Sect. 4, we state and solve our optimal control under study, by characterizing both the optimal value and policies as solutions of the dynamic programming equations. In Sect. 5, we illustrate our results on a portfolio optimization problem and conclude the paper.

Probability distortion

In this section, the concept of probability distortion along with the corresponding operator and its properties are introduced. These definitions are further extended to the multi-temporal/dynamic setting.

Probability distortion on random variables

Let $(Ω, F, P)$ be a probability space and denote by $L_{+}^{\infty} (Ω, F, P)$ the set of non-negative essentially bounded random variables on $(Ω, F)$ .

Definition 2.1

A mapping $w : [0, 1] \to [0, 1]$ is called a distortion function, if it is continuous, strictly increasing, and satisfies $w (0) = 0$ and $w (1) = 1$ .
For any $ξ \in L_{+}^{\infty} (Ω, F, P)$ , the operator with respect to the distortion function w is defined by
$\begin{matrix} ρ (ξ) ≜ \int_{0}^{\infty} w (P (ξ \geq z)) d z \end{matrix}$ 2.1

Lemma 2.1

(i)
Let $x, y, α \in [0, 1]$ , $ξ \in L_{+}^{\infty} (Ω, F, P)$ and $w : [0, 1] \to [0, 1]$ be a distortion function that satisfies
$\begin{matrix} w (α x + (1 - α) y) \geq α w (x) + (1 - α) w (y) . \end{matrix}$ 2.2
Then, $ρ (ξ) \geq E [ξ]$ . Namely, for $ξ$ representing the nonnegative bounded random losses, $ρ (\cdot)$ evaluates a bigger risk for $ξ$ than $E [\cdot]$ does.
(ii)
Conversely, suppose w satisfies for $α \in [0, 1]$
$\begin{matrix} w (α x + (1 - α) y) \leq α w (x) + (1 - α) w (y), \end{matrix}$ 2.3
then $ρ (ξ) \leq E [ξ]$ . Namely, for $ξ$ representing the nonnegative bounded random gains, $ρ (\cdot)$ evaluates a smaller gain for $ξ$ than $E [\cdot]$ does.

Proof

We will only prove the first part. By $w (0) = 0$ and (2.2), we have $w (α x) \geq α w (x)$ for any $α \in [0, 1]$ . In particular, for $x = 1$ , we get $w (α) \geq α$ for any $α \in [0, 1]$ . Thus, $w (P (ξ \geq z)) \geq P (ξ \geq z)$ for any $z \in R$ . By taking integrals on both sides, we conclude the result. $□$

Remark 2.1

Lemma 2.1 implies that (2.2), respectively (2.3), is an appropriate property of the distortion function w for modelling risk averse behaviour towards random costs, respectively towards random profits.

Lemma 2.2

Let $ρ : L_{+}^{\infty} (Ω, F, P) \to R$ be the distortion operator as in (2.1). Then

(i)
$ρ$ is positively translation invariant, i.e., $ρ (ξ + c) = ρ (ξ) + c$ for $c \geq 0$ . In particular, $ρ (c) = c$ for any $c \geq 0$ .
(ii)
$ρ$ is positively homogeneous, i.e. $ρ (λ ξ) = λ ρ (ξ)$ for $λ \geq 0$ .
(iii)
$ρ$ is monotone, i.e. $ρ (ξ_{1}) \leq ρ (ξ_{2})$ for $ξ_{1}, ξ_{2} \in L_{+}^{\infty} (Ω, F, P)$ and $ξ_{1} \leq ξ_{2}$ .

Proof

(i)

$\begin{matrix} ρ (ξ + c) & = \int_{0}^{\infty} w (P (ξ + c \geq z)) d z \\ = \int_{0}^{\infty} w (P (ξ \geq z - c)) d z \\ = \int_{- c}^{0} w (P (ξ \geq z)) d z + \int_{0}^{\infty} w (P (ξ \geq z)) d z \\ = c + ρ (ξ) \end{matrix}$
Moreover, we have
$\begin{matrix} ρ (0) & = \int_{0}^{\infty} w (P (0 \geq z)) d z \\ = \int_{{0}} w (P (0 \geq z)) d z \\ = \int_{{0}} w (1) d z = 0 \end{matrix}$
Hence, by the first equality above, we have $ρ (0 + c) = c$ for $c \geq 0$ .
(ii)

$\begin{matrix} ρ (λ ξ) & = \int_{0}^{\infty} w (P (λ ξ \geq z)) d z \\ = λ \int_{0}^{\infty} w (P, (ξ \geq \frac{z}{λ})) d \frac{z}{λ} \\ = λ ρ (ξ) \end{matrix}$
(iii)
Since $ξ_{1} \leq ξ_{2}$ and w is monotone, we have for any $z \geq 0$
$\begin{matrix} P (ξ_{1} \geq z) & \leq P (ξ_{2} \geq z) \\ w (P (ξ_{1} \geq z)) & \leq w (P (ξ_{2} \geq z)) \end{matrix}$
Thus, we have $ρ (ξ_{1}) \leq ρ (ξ_{2})$ . $□$

Dynamic probability distortion on stochastic processes

The main issue occurs when one tries to extend (2.1) to the multi-period setting. In particular, it is not clear what the “conditional version" of distortion operator is. Hence, first the corresponding operator for multi-temporal dynamic setting is constructed.

Fix $T \in N_{0}$ and denote $T ≜ [0, 1, \dots, T]$ and $\tilde{T} ≜ [0, 1, \dots, T - 1]$ . Let $Ω$ be the sample space with its respective sigma algebra denoted by $F$ . Let $F_{0} \subset F_{1} \subset \dots F_{T} \subset F$ be a the filtration, and $P$ being the probability measure on $Ω$ such that $(Ω, F, {(F_{t})}_{t \in T}, P)$ is the stochastic basis. Let $ξ = {(ξ_{t})}_{t \in T}$ be a discrete time non-negative stochastic process that is adapted to the filtration ${(F_{t})}_{t \in T}$ and uniformly bounded that is ${sup}_{t \in T} ess sup (ξ_{t}) < \infty$ . We denote in that case $ξ \in L_{+} (Ω, {(F_{t})}_{t \in T}, P)$ . Then, if we define for $t \in T$

\begin{matrix} ρ_{t} (ξ_{T}) ≜ \int_{0}^{\infty} w (P (ξ_{T} \geq z | F_{t})) d z, \end{matrix}

we do not necessarily have

\begin{matrix} ρ (ξ) = ρ (ρ_{t} (ξ)) \end{matrix}

In particular, the “tower property” of expectation operator fails in distortion operators (see Example 2.1 below.). In the context of stochastic optimization, this implies that the optimization problem becomes “time-inconsistent”, i.e. the “Dynamic Programming Principle” (DPP) does not hold. On the other hand, for $w (x) = x$ , the distortion operator (2.1) reduces to expectation operator, whereas for $E_{t} [ξ_{T}] ≜ E [ξ_{T} | F_{t}]$ , we have $E [ξ_{T}] = E [E_{t} [ξ_{T}]]$ with the towering property, and DPP holds.

Analogous to Definition 2.1, we define first dynamic distortion mappings on a filtered probability space $(Ω, F, {(F_{t})}_{t \in T}, P)$ in a multitemporal setting as follows.

Definition 2.2

Let $t \in \tilde{T}$ and $ξ_{t + 1} \in L_{+}^{\infty} (Ω, F_{t + 1}, P)$ . Also consider the distortion function $w (\cdot)$ as in Definition 2.1.

A one-step dynamic distortion mapping $ϱ_{t + 1 | t} : L_{+}^{\infty} (Ω, F_{t + 1}, P) \to L_{+}^{\infty} (Ω, F_{t}, P)$ is defined as
$\begin{matrix} ϱ_{t + 1 | t} (ξ_{t + 1}) & ≜ \int_{0}^{\infty} w (P (ξ_{t + 1} \geq z_{t + 1} | F_{t})) d z_{t + 1} \end{matrix}$
A mapping $ϱ_{t} : L_{+}^{\infty} (Ω, F_{T}, P) \to L_{+}^{\infty} (Ω, F_{t}, P)$ is called a dynamic distortion mapping, if it is composition of one step dynamic distortion mappings of the form
$\begin{matrix} ϱ_{t} ≜ ϱ_{t + 1 | t} \circ \dots \circ ϱ_{T | T - 1} \end{matrix}$

Remark 2.2

Definition 2.2 is well defined. Indeed, let $ξ_{T} \in L_{+}^{\infty} (Ω, F_{T}, P)$ , going backwards iteratively, by properties of w and construction of $ϱ_{t}$ , uniform boundedness and $F_{s}$ measurability at each $s \in [t, \dots, T]$ are preserved, such that $ϱ_{t} (ξ_{T})$ maps $ξ_{T}$ to $L_{+}^{\infty} (Ω, F_{t}, P)$ . Furthermore, by construction $ϱ_{s} (\cdot) = ϱ_{s} (ϱ_{t} (\cdot))$ for $0 \leq s \leq t \leq T$ . In particular, it is a time-consistent operator.

Lemma 2.3

For $t \in T$ , let $ϱ_{t} : L_{+}^{\infty} (Ω, F_{T}, P) \to L_{+}^{\infty} (Ω, F_{t}, P)$ be the dynamic distortion operator as in Definition 2.2 and $ξ, ξ_{1}, ξ_{2} \in L_{+}^{\infty} (Ω, F_{T}, P)$ . Then

(i)
$ϱ_{t}$ is positively translation invariant, i.e., $ϱ_{t} (ξ + c) = ϱ_{t} (ξ) + c$ $P$ -a.s., if c is nonnegative and $F_{t}$ measurable.
(ii)
$ϱ_{t}$ is positively homogeneous, i.e. $ϱ_{t} (λ ξ) = λ ϱ_{t} (ξ)$ $P$ -a.s. for any scalar $λ \geq 0$ .
(iii)
$ϱ_{t}$ is monotone, i.e. $ϱ_{t} (ξ_{1}) \leq ϱ_{t} (ξ_{2})$ $P$ -a.s. for $ξ_{1} \leq ξ_{2}$ , $P$ -a.s..

Proof

The proof is a simple modification of Lemma 2.2. $□$

Next, we illustrate the failure of towering property that causes time inconsistency via the following example.

Example 2.1

Let X and Y be two i.i.d. random variables on some probability space $(Ω, F, P)$ with $P (X = 1) = P (X = 2) = \frac{1}{2}$ , $P (Y = 1) = P (Y = 2) = \frac{1}{2}$ and $w (x) = x^{1 / 2}$ . Let $ξ_{1} = X$ and $ξ_{2} = X + Y$ , with $F_{1} = σ (X)$ and $F_{2} = σ (X, Y)$ . Then

\begin{matrix} ρ_{2 | 1} (ξ_{2}) & = X + ρ_{1 | 0} (Y) \\ = X + w (1) + w (\frac{1}{2}), \end{matrix}

where we use Lemma 2.3 i) in the first equality, above. Similarly, we have

\begin{matrix} ρ_{1 | 0} \circ ρ_{2 | 1} (ξ_{2}) & = ρ (X) + w (1) + w (\frac{1}{2}) \\ = 2 w (1) + 2 w (\frac{1}{2}) \\ = 2 + 2 {(\frac{1}{2})}^{1 / 2} . \end{matrix}

On the other hand, we have

\begin{matrix} ρ (ξ_{2}) & = \int_{0}^{\infty} w (P (ξ_{2} \geq z)) d z \\ = \int_{[0, 2]} w (P (ξ_{2} \geq z)) d z + \int_{(2, 3]} w (P (ξ_{2} \geq z)) d z + \int_{(3, 4]} w (P (ξ_{2} \geq z)) d z \\ = 2 w (1) + \int_{(2, 3]} w (3 / 4) d z + \int_{(3, 4]} w (1 / 4) d z \\ = 2 + w (3 / 4) + w (1 / 4) \\ = 2 + {(\frac{3}{4})}^{1 / 2} + {(\frac{1}{4})}^{1 / 2} \end{matrix}

Hence, $ρ_{1 | 0} \circ ρ_{2 | 1} (ξ_{2}) > ρ (ξ_{2})$ by strict concavity of w. We further note that the two expressions would be equal to each other, if $w (x) = x$ .

Controlled markov chain framework

In this section, we are going to introduce the necessary background on discrete-time controlled Markov processes (a.k.a. Markov decision processes (MDPs) (see e.g. Hernandez-Lerma and Lasserre 1996) that we are going to work, but using now the dynamic probability distortion framework.

We take the control model

\begin{matrix} M_{t} : = (X_{t}, A_{t}, K_{t}, Q_{t}, F, r_{t}) \end{matrix}

with the following components:

$X_{t}$ and $A_{t}$ denote the state and action (or control) space, respectively, which are assumed to be Borel spaces, that is, Borel subsets of complete and separable metric spaces with their corresponding Borel $σ$ -algebras $B (X_{t})$ and $B (A_{t})$ .
For each $x \in X_{t}$ , let $A_{t} (x) \subset A_{t}$ be the set of all admissible controls in the state $x$ . We assume that $A_{t} (x)$ is compact for $t \in T$ and denote
$\begin{matrix} K_{t} : = \{(x, a) : x \in X_{t}, a \in A_{t} (x)\} \end{matrix}$ 3.1
as the set of feasible state-action pairs.
We define the system function as
$\begin{matrix} x_{t + 1} ≜ F_{t} (x_{t}, a_{t}, η_{t}) \end{matrix}$ 3.2
for all $t \in \tilde{T}$ with $x_{t} \in X_{t}$ and $a_{t} \in A_{t}$ , and i.i.d. random variables ${(η_{t})}_{t \in \tilde{T}}$ on a probability space $(Y, B (Y), P^{η})$ with values in Y that are complete separable Borel spaces. We assume that the mapping $(s, x, a) \to F (s, x, a, y)$ in (3.2) is continuous on $S_{t} \times X_{t} \times A_{t}$ for every $y \in Y$ at every $t \in \tilde{T}$ .
Let
$\begin{matrix} Ω ≜ \otimes_{t = 0}^{T} X_{t} \end{matrix}$
and for $t \in T$ , and
$\begin{matrix} F_{t} & = σ (X_{0}, A_{0}, \dots, X_{t - 1}, A_{t - 1}, X_{t}) \end{matrix}$
be the filtration of increasing $σ$ -algebras.
Let $F_{t}$ be the family of measurable functions and $π_{t} \in F_{t}$ with $π_{t} : X_{t} \to A_{t}$ for $t \in \tilde{T}$ . A sequence ${(π_{t})}_{t \in \tilde{T}}$ of functions $π_{t} \in F_{t}$ is called an admissible control policy (or simply a policy), and the function $π_{t} (\cdot)$ is called the decision rule or control at time t. We denote by $Π$ the set of all admissible control policies.
Let $r_{t} (x_{t}, a_{t}) : X_{t} \times A_{t} \to R_{+}$ for $t \in \tilde{T}$ and $r_{T} : X_{T} \to R_{+}$ be the non-negative real-valued reward-per-stage and terminal reward function, respectively. For ${(π_{t})}_{t \in \tilde{T}} \in Π$ , we write
$\begin{matrix} r_{t} (x_{t}, π_{t}) & ≜ r_{t} (x_{t}, π_{t} (x_{t})) \\ ≜ r_{t} (x_{t}, a_{t}) . \end{matrix}$
Let $π \in Π$ and $x_{0} \in X_{0}$ be given. Then, there exists a unique probability measure $P^{π}$ on $(Ω, F)$ such that, given $x \in X_{t}$ , a measurable set $B_{t + 1} \subset X_{t + 1}$ and $(x_{t}, a_{t}) \in K_{t}$ , for any $t \in \tilde{T}$ , we have
$\begin{matrix} Q_{t + 1} (B_{t + 1} | x_{t}, a_{t}) ≜ P_{t + 1}^{π} (x_{t + 1} \in B_{t + 1} | x_{t}, a_{t}, \dots, x_{0}) . \end{matrix}$
Here, $Q_{t + 1} (B_{t + 1} | x_{t}, a_{t})$ is the stochastic kernel (see e.g. Hernandez-Lerma and Lasserre 1996). Namely, for each pair $(x_{t}, a_{t}) \in K_{t}$ , $Q_{t + 1} (\cdot | x_{t}, a_{t})$ is a probability measure on $X_{t + 1}$ , and for each $B_{t + 1} \in B_{t + 1} (X_{t + 1})$ , $Q_{t + 1} (B_{t + 1} | \cdot, \cdot)$ is a measurable function on $K_{t}$ . We remark that at each $t \in T$ , the stochastic kernel depends only on $(x_{t}, a_{t})$ rather than the whole history $(x_{0}, a_{0}, x_{1}, a_{1}, \dots, a_{t}, x_{t})$ . By (3.2), we have
$\begin{matrix} Q_{t + 1} (B_{t + 1} | x_{t}, a_{t}) = \int_{Y} I_{B_{t + 1}} [F (x_{t}, a_{t}, y)] d P^{η} (y), B_{t + 1} \in B (X_{t + 1}), \end{matrix}$
where $I_{B_{t + 1}}$ denotes the indicator function of $B_{t + 1}$ .

Assumption 3.1

The reward functions $r_{t} (x_{t}, a_{t})$ for $t \in \tilde{T}$ and $r_{T} (x_{T})$ are nonnegative, continuous in their arguments and uniformly bounded i.e. $0 \leq r_{t} (x_{t}, a_{t}) < \infty$ and $0 \leq r_{T} (x_{T}) < \infty$ .
The multi-function (also known as a correspondence or point-to-set function) $x \to A_{t} (x)$ is upper semi-continuous (u.s.c.). That is, if ${x^{m}} \subset X_{t}$ and ${a^{m}} \subset A_{t} (x^{m})$ are sequences such that $x^{m} \to \bar{x}$ , and $a^{m} \to \bar{a}$ , then $\bar{a} \in A_{t} (\bar{x})$ for $t \in \tilde{T}$ .
For every state $x \in X_{t}$ , the admissible action set $A_{t} (x)$ is compact for $t \in \tilde{T}$ .

Optimal control problem

Main Result

For every $t \in \tilde{T}$ , $x_{t} \in X_{t}$ and $π \in Π$ , let

\begin{matrix} V_{t} (x_{t}, π) & ≜ ϱ_{t} (\sum_{i = t}^{T - 1} r_{i} (x_{i}, π_{i}) + r_{T} (x_{T})) \end{matrix}

be the performance evaluation from time $t \in \tilde{T}$ onwards using the policy $π \in Π$ given the initial condition $x \in X_{t}$ . The corresponding optimal (i.e. maximal) value is then

\begin{matrix} V_{t}^{*} (x_{t}) & ≜ sup_{π \in Π} V_{t} (x_{t}, π) \end{matrix}

4.1

A control policy $π^{*} = {(π_{t}^{*})}_{t \in \tilde{T}}$ is said to be optimal if it attains the maximum in (4.1), that is

\begin{matrix} V_{t}^{*} (x) & = V_{t} (x, π^{*}) for all x \in X_{t} and for t \in \tilde{T} . \end{matrix}

4.2

Thus, the optimal control problem is to find an optimal policy and the associated optimal value (4.2) for all $t \in T$ . We now present the main result of the paper.

Theorem 4.1

The optimization problem (4.1) obeys dynamic programming principle and has an optimal policy $π^{*} \in Π$ . Furthermore, $V_{t}^{*} (x_{t})$ is continuous in its argument.

Proof of Theorem 4.1

To prove Theorem 4.1, we need the following key lemma.

Lemma 4.1

Let $K$ be defined as in (3.1). Let $V : K \to R$ be a nonnegative continuous function. For $x_{t} \in X_{t}$ , define

\begin{matrix} V^{*} (x_{t}) ≜ sup_{a \in A} V (x_{t}, a) . \end{matrix}

Then, for any $x_{t} \in X_{t}$ , there exists a $B (X_{t})$ measurable mapping $π_{t}^{*} : X \to A$ such that

\begin{matrix} V^{*} (x_{t}) = V (x_{t}, π_{t}^{*} (x_{t})) \end{matrix}

4.3

and $V^{*} : X_{t} \to R$ is continuous.

Proof

By Lemma 4.1 in Rieder (1978), there exists $B (X_{t})$ measurable mapping $π^{*} : X_{t} \to A_{t}$ such that (4.3) holds and $V^{*} (x_{t})$ is upper semi-continuous. But, since $V (\cdot, \cdot)$ is continuous, ${sup}_{a \in A} V (x_{t}, a)$ is lower semi-continuous in x, as well. Hence, $V^{*} (\cdot, \cdot)$ is continuous in its arguments. $□$

Lemma 4.2

Suppose Assumption 3.1 holds true. Then, supremum is attained at (4.1) for some $B (X_{t})$ measurable mapping $π_{t}^{*} (x_{t}) = a_{t}^{*}$ for $t \in \tilde{T}$ . Furthermore, each $V_{t}^{*}$ is continuous.

Proof

We will show only the case for $t = T - 1$ . The others follow going backwards iterative down to $t = 0$ . We first show that

\begin{matrix} (x_{T - 1}, a_{T - 1}) \to \int_{0}^{\infty} w (P^{π} (r_{T} (F (s_{T - 1}, x_{T - 1}, a_{T - 1}, η_{T - 1})) \geq z_{T}) d z_{T} \end{matrix}

is continuous in its arguments. Let $(x_{T - 1}^{m}, a_{T - 1}^{m}) \to (x_{T - 1}, a_{T - 1})$ as $m \to \infty$ . Then, we have

\begin{matrix} lim_{m \to \infty} V_{T - 1} (x_{T - 1}^{m}, a_{T - 1}^{m}) \\ = lim_{m \to \infty} \int_{0}^{\infty} w (P^{π} (r_{T} (F (x_{T - 1}^{m}, a_{T - 1}^{m}, η_{T - 1})) \geq z_{T}) d z_{T} \\ = \int_{0}^{\infty} lim_{m \to \infty} w (P^{π} (r_{T} (F (x_{T - 1}^{m}, a_{T - 1}^{m}, η_{T - 1})) \geq z_{T}) d z_{T} \\ = \int_{0}^{\infty} w (lim_{m \to \infty} P^{π} (r_{T} (F (x_{T - 1}^{m}, a_{T - 1}^{m}, η_{T - 1})) \geq z_{T}) d z_{T} \\ = \int_{0}^{\infty} w (P^{π} (lim_{m \to \infty} r_{T} (F (x_{T - 1}^{m}, a_{T - 1}^{m}, η_{T - 1})) \geq z_{T}) d z_{T} \\ = \int_{0}^{\infty} w (P^{π} (r_{T} (F (x_{T - 1}, a_{T - 1}, η_{T - 1})) \geq z_{T}) d z_{T} \end{matrix}

The second equality follows by boundedness of $r_{T} ()$ , $w (\cdot)$ and Lebesgue dominated convergence theorem. The third equality follows by continuity of $w (\cdot)$ , the fourth equality follows by continuity of probability measure, and the fifth equality follows by continuity of transition $F (\cdot, \cdot, \cdot)$ as in (3.2). Hence, $V_{T - 1} (\cdot, \cdot)$ is continuous in its arguments. The result follows by Lemma 4.1. $□$

Now, we are ready to prove Theorem 4.1.

Proof of Theorem 4.1

We have

\begin{matrix} V_{T - 1}^{*} (x_{T - 1}) \\ = sup_{a_{T - 1} \in A_{T - 1} (x_{T - 1})} \int_{0}^{\infty} w (P^{π} (r_{T} (F (x_{T - 1}, a_{T - 1}), η_{T - 1})) \geq z_{T}) d z_{T} \\ = \int_{0}^{\infty} w (P^{π} (r_{T} (F (x_{T - 1}, π_{T - 1}^{*} (s_{T - 1}, x_{T - 1}), η_{T - 1})) \geq z_{T}) d z_{T} . \end{matrix}

By Lemma (4.2), there exists a $B (X_{T - 1})$ measurable mapping $π_{T - 1} \in F_{T - 1}$ such that $π_{T - 1}^{*} (x_{T - 1}) = a_{T - 1}^{*}$ , and $V_{T - 1}^{*} (x_{T - 1})$ is continuous. Hence, using Lemma 2.2(i) for $t = T - 2$ , we have

\begin{matrix} V_{T - 2}^{*} (x_{T - 2}) \\ = sup_{\begin{matrix} a_{T - 2} \in A_{T - 2} (x_{T - 2}) \\ a_{T - 1} \in A_{T - 1} (x_{T - 1}) \end{matrix}} \{\int_{0}^{\infty}, w, (P^{π} (, r_{T - 2}, (x_{T - 2}, a_{T - 2})) \\ (+ V_{T - 1}^{*} (F (x_{T - 2}, a_{T - 2}, η_{T - 2})) \geq z_{T - 1}) d z_{T - 1}\} \end{matrix}

By Lemma (4.2) again, it admits an optimal policy $a_{T - 2}^{*} \in A_{T - 2}$ such that

\begin{matrix} V_{T - 2}^{*} (x_{T - 2}) \\ = \{\int_{0}^{\infty}, w, (P^{π} (, r_{T - 2}, (x_{T - 2}, a_{T - 2}^{*})) \\ (+ V_{T - 1}^{*} (F (x_{T - 2}, a_{T - 2}^{*}, η_{T - 2})) \geq z_{T - 1}) d z_{T - 1}\} \end{matrix}

Going backwards iterative, we conclude that dynamic programming holds, (4.1) admits an optimal policy $π^{*} \in Π$ attaining supremum that depends only on $s_{t}$ and on $x_{t}$ at each $t \in \tilde{T}$ . Furthermore, $V_{t}^{*} (\cdot)$ is continuous again by Lemma (4.2). Hence, we conclude the proof.

Based on our main result, the methodology is as follows. Given the distortion operator along with the controlled Markov process, one checks whether the framework with reward and control sets satisfy the requirements in 3.1, Theorem 4.1 reveals that the dynamic programming can be applied to find the optimal controls along with the optimal value function. The next section exemplifies this.

An application to portfolio optimization

Model

Suppose an investor has a portfolio of n stocks. The prices of n stocks at $t \in T$ are denoted by

\begin{matrix} S_{t} ≜ (S_{t}^{1}, \dots, S_{t}^{n}) . \end{matrix}

The price of stock $i \in [1, 2,, \dots, n]$ at time $t \in \tilde{T}$ , denoted by $S_{t}^{i}$ , has dynamics

\begin{matrix} S_{t + 1}^{i} = (1 + r^{i}) S_{t}^{i} with probability p^{i}, \end{matrix}

5.1

where $- 1 < r^{i} < 1$ is the proportional return rate of price of ith stock $S_{t}^{i}$ . Let $P^{η} (\cdot)$ denote the joint probability mass function of $S_{t}$ for $t \in T$ . Let $π = {(π_{t})}_{t \in T}$ be the policy of the investor that stands for the number of shares of n stocks investor is holding at time $t \in \tilde{T}$ with

\begin{matrix} π_{t} : S_{t} \to R^{n}, \end{matrix}

where $π_{t}$ is $B (S_{t})$ measurable. We assume that the investor has a capacity to be in the long or short position. Namely, we take that $‖ π_{t} (x) ‖ \leq C$ for some $C > 0$ , for all $x \in S_{t}$ and $t \in \tilde{T}$ . We denote by $Π$ the admissible strategies ${(π_{t})}_{t \in \tilde{T}}$ that are $B (S_{t})$ measurable and uniformly bounded by C. We take that the market is self-financing in the sense

\begin{matrix} Y_{t + 1} = π_{t}^{⊺} S_{t + 1} for t \in \tilde{T}, \end{matrix}

with $Y_{t + 1}$ being the value of the portfolio and $S_{t + 1}$ being the n-dimensional vector as defined in (5.1) at time $t + 1$ such that denoting $x_{- 1} ≜ x_{0}$ ,

\begin{matrix} Δ Y_{t} & ≜ Y_{t} - y_{t - 1}, \end{matrix}

is the difference of the total wealth between time t and $t - 1$ for $t \in T$ . Hence, the reward function at $t \in T$ reads as

\begin{matrix} r_{t} (s_{t - 1}, s_{t}, π_{t}) = Δ Y_{t} \\ r_{T} (s_{T - 1}, s_{T}) = Δ Y_{T} \\ Y_{t} = y_{t - 1} + r_{t} (s_{t - 1}, s_{t}, π_{t}), \end{matrix}

Let $w (x) = x^{2}$ for $x \in [0, 1]$ be the distortion of the probability function such that for a fixed $π_{T - 1}$ given $Y_{T - 1} = y_{T - 1}$ and $S_{T - 1} = s_{T - 1}$ , the performance measure is defined by

\begin{matrix} ϱ_{T - 1} (Y_{T}) & ≜ \int_{0}^{\infty} (P^{π} (y_{T - 1} + π_{T - 1}^{⊺} (S_{T} - s_{T - 1}) \geq z_{T} | y_{T - 1}, s_{T - 1}))^{2} d z_{T} \\ = y_{T - 1} + \int_{0}^{\infty} (P^{π} (π_{T - 1}^{⊺} (S_{T} - s_{T - 1}) \geq z_{T} | y_{T - 1}, s_{T - 1}))^{2} d z_{T} \\ = V_{T - 1} (y_{T - 1}, s_{T - 1}, π) \end{matrix}

such that

\begin{matrix} V_{T - 1}^{*} (y_{T - 1}, s_{T - 1}) = max_{π \in Π} V_{T - 1} (y_{T - 1}, s_{T - 1}, π) . \end{matrix}

Here, in the second equality Lemma 2.3 is used. Hence, going backwards iterative, we have at each time $t \in \tilde{T}$

\begin{matrix} V_{t} (y_{t}, s_{t}, π) & = y_{t} + \int_{0}^{\infty} (P^{π} (V_{t + 1} (Y_{t + 1}, S_{t + 1}, π) \geq z_{t + 1} | y_{t}, s_{t}))^{2} d z_{t + 1} \\ V_{t}^{*} (y_{t}, s_{t}) & = max_{π \in Π} V_{t} (y_{t}, s_{t}, π) . \end{matrix}

5.2

The methodology is that the application satisfies Theorem 4.1. Indeed, it is immediate to see that the conditions for Assumption 3.1 are satisfied for the reward function and the action sets. Thus, Theorem 4.1 can be applied. In particular, Theorem 4.1 allows the dynamic programming yielding an optimal strategy ${(π_{t}^{*})}_{t \in \tilde{T}} \in Π$ attaining (5.2) along with the optimal value $V_{t}^{*}$ at each time $t \in \tilde{T}$ .

Numerical example

Consider discrete time set $T = {0, 1, 2}$ and two stocks Stock A and Stock B. The prices of Stock A and Stock B at $t \in T$ are denoted by $S_{t}^{A}$ and $S_{t}^{B}$ respectively. Prices at $t = 0$ are denoted by $S_{0}^{A} = S_{0}^{B} = 1$ . Additionally, suppose that an investor can buy one of Stock A, Stock B or portfolio Stock AB with equal shares of both stocks. His initial wealth to be invested equals 1. An investor can switch between Stock A, Stock B, or trade-off between stocks to share equally current wealth into both stocks at each time $t = 0, 1, 2$ . The current state of wealth changes as stock price goes up or down with movement rate denoted by $r^{A}$ and $r^{B}$ , respectively. They are independent random variables satisfying

\begin{matrix} P (r^{A} = 0.22) & = P (r^{A} = - 0.20) = \frac{1}{2} \\ P (r^{B} = 0.12) & = P (r^{B} = - 0.10) = \frac{1}{2} . \end{matrix}

In particular, the expected value of return for the given share is positive $E [r^{A}] = E [r^{B}] = E [0.5 r^{A} + 0.5 r^{B}] = 0.01$ . Hence, 2-period investment, in the same stock, as well as investing in equal shares portfolio are expected to give return of $1 . 01^{2} - 1 = 0.0201$ . The distortion function $w (x) = x^{2}$ on $x \in [0, 1]$ . Hence, $w (P (r^{A} = 0.22)) = {(\frac{1}{2})}^{2} = \frac{1}{4}$ and $w (P (r^{A} = - 0.20)) = 1 - {(\frac{1}{2})}^{2} = \frac{3}{4}$ . The highest return using the distortion operator at $t = 0$ equals

\begin{matrix} ϱ_{1 | 0}^{B} = 0.912 > ϱ_{1 | 0}^{AB} = 0, 902 > ϱ_{1 | 0}^{A} = 0.864 . \end{matrix}

5.3

Here $ϱ_{1 | 0}^{A}$ denotes the value if the investor starts with putting all his wealth into A. $ϱ_{1 | 0}^{B}$ and $ϱ_{1 | 0}^{AB}$ denote accordingly. By (5.3), optimal strategy is to invest in Stock B for $t = 1$ and continuing this investment for $t = 2$ . It is depicted in Fig. 2, below. However, return estimated with dynamic probability distortion is $0.912 - 1 = - 0.088$ . In particular, it is negative.

Fig. 2 — Starting with Investing in Stock B

Moreover, Fig. 1 shows that if one starts with investing in Stock A at $t = 0$ , then the optimal decision is to switch to Stock B at $t = 1$ . In this case the dynamic operator gives value $ϱ_{0}^{B} = 0.864$ , and return at $t = 2$ is $0.864 - 1 = - 0.136$ . Similarly, we see at Fig. 3 that if one starts with investing equal shares into Stock A and Stock B at $t = 0$ , then the investor should switch to Stock B, and the distortion operator gives $ϱ_{0}^{AB} = 0.902$ . The results at Figs. 1, 2 and 3 reveal that the distortion operator prevents an investor from taking risk by investing in Stock A even though the expected rate of return is equal in any period of time. The investor chooses Stock B only irrelevant of his first stake at $t = 0$ . Furthermore, the strategy is time consistent. The Figs. 1, 2 and 3 summarize the corresponding actions, where the bold fonts at each time epoch denote the optimal actions.

Fig. 3 — Starting with Investing in Stock AB

Conclusion and future work

This paper discusses assessing multi-period risk of an investment using distortion operators. The peculiarity of these operators lies in the fact that they give optimum controls that are not time consistent in a multi-period framework. The dynamic distortion operator introduced in this paper is the way of multi-period aggregation of the distortion operator that satisfies time consistency and hence dynamic programming principle. Furthermore, it is demonstrated how these operators work and apply to the investment problem. The main economic implication of the presented methodology is as follows. If observed probabilities are distorted but observed outcomes of stock return rate are viable, then the dynamic distortion operators produce optimum multi-period investment strategy that is robust to risk and is return effective.

Further research will be to focus on empirical results of investment strategies based on distortion operators. These results should be compared with alternative strategies using the same set of empirical data. For instance, the proposed framework can be compared with the strategies presented in Elze (2012); Majewski et al. (2020), which outperforms market return-risk efficiency. A scope of downside risk measures and models that are alternatives to the presented investing optimization methodology were proposed and applied by Barro and Canestrelli (2014). Also using second order stochastic dominance (SSD) relations (Batabyal and Killinis 2021) and convex risk measures are alternatives (Ruszczynski 2010; Follmer and Penner 2006; Fritelli et al. 2002) to the proposed methodology in this work, whose performance on empirical data is to be compared.

Declarations

Conflict of interest

Kerem Uğurlu is the first author of the manuscript. Kerem Uğurlu has been financially supported by Nazarbayev University under the Project SSH2020016 “Robust Methods in Financial Mathematics and Stochastic Control” during the preparation of this manuscript. There is no conflict of interest between the authors of this manuscript.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Kerem Uğurlu, Email: kerem.ugurlu@nu.edu.kz.

Tomasz Brzeczek, Email: tomasz.brzeczek@put.poznan.pl.

References

Artzner P, Delbaen F, Eber JM, Heath D. Coherent measures of risk. Math Financ. 1999;9:203–228. doi: 10.1111/1467-9965.00068. [DOI] [Google Scholar]
Artzner P, Delbaen F, Eber JM, Heath D, Ku H. Coherent multiperiod risk adjusted values and Bellmans principle. Ann Oper Res. 2007;152:5–22. doi: 10.1007/s10479-006-0132-6. [DOI] [Google Scholar]
Barro D, Canestrelli E. Downside risk in multiperiod tracking error models. Cent Eur J Oper Res. 2014;22(2):263–283. doi: 10.1007/s10100-013-0290-y. [DOI] [Google Scholar]
Batabyal S, Killinis R. Economic policy uncertainty and stock market returns: evidence from Canada. J Econ Asymmetr. 2021;24(e00215):1–14. [Google Scholar]
Bellman R. On the theory of dynamic programming. Proc Natl Acad Sci. 1952;38:716. doi: 10.1073/pnas.38.8.716. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bjork T, Khapko M, Murgoci A. On time-inconsistent stochastic control in continuous time. Financ Stoch. 2017;21:331–360. doi: 10.1007/s00780-017-0327-5. [DOI] [Google Scholar]
Branda M, Kopa M. On relations between DEA-risk models and stochastic dominance efficiency tests. Cent Eur J Oper Res. 2014;22(1):13–35. doi: 10.1007/s10100-012-0283-2. [DOI] [Google Scholar]
Cheridito P, Delbaen F, Kupper M. Dynamic monetary risk measures for bounded discrete-time processes. Electron J Probab. 2006;11:57–106. doi: 10.1214/EJP.v11-302. [DOI] [Google Scholar]
Chung K, Sobel MJ. Discounted MDP’s: distribution functions and exponential utility maximization. SIAM J Control Optim. 1987;25:49–62. doi: 10.1137/0325004. [DOI] [Google Scholar]
Eichhorn A, Romisch W. Polyhedral risk measures in stochastic programming. SIAM J Optim. 2005;16:69–95. doi: 10.1137/040605217. [DOI] [Google Scholar]
Elze G. Value investors anomaly: return enhancement by portfolio replication - an empiric portfolio strategic analysis. Cent Eur J Oper Res. 2012;20(4):633–647. doi: 10.1007/s10100-011-0214-7. [DOI] [Google Scholar]
Fleming WH, Sheu SJ. Optimal long term growth rate of expected utility of wealth. Ann Appl Probab. 1999;9:871–903. doi: 10.1214/aoap/1029962817. [DOI] [Google Scholar]
Fleming WH, Sheu SJ. Risk-sensitive control and an optimal investment model. Math Financ. 2000;10:197–213. doi: 10.1111/1467-9965.00089. [DOI] [Google Scholar]
Focard S, Fabozzi FJ. Black swans and white eagles: on mathematics and finance. Math Meth Oper Res. 2009;69(3):379–394. doi: 10.1007/s00186-008-0243-8. [DOI] [Google Scholar]
Follmer H, Penner I. Convex risk measures and the dynamics of their penalty functions. Stat Decis. 2006;24:61–96. [Google Scholar]
Fritelli M, Rosazza Gianin E. Putting order in risk measures. J Bank Financ. 2002;26:1473–1486. doi: 10.1016/S0378-4266(02)00270-4. [DOI] [Google Scholar]
Frittelli M, Gianin E Rosazza (2005) Dynamic convex risk measures (2005). Risk measures for the 21st century. Wiley, Chichester, pp. 227-248
He XD, Zhou XY. Portfolio choice via quantiles. Math Financ. 2011;21(2):203–231. [Google Scholar]
Hernandez-Lerma O. Adaptive Markov control processes. New York: Springer-Verlag; 1989. [Google Scholar]
Hernandez-Lerma O, Lasserre JB (1996) Discrete-time Markov control processes. In: Basic optimality criteria. Springer, New York. 10.1007/978-1-4612-0729-0
Jaquette SC. Markov decision processes with a new optimality criterion: discrete time (1976) Ann Stat. 1973;1:496–505. doi: 10.1214/aos/1176342415. [DOI] [Google Scholar]
Jaquette SC. Utility criterion for Markov decision processes. Manag Sci. 1976;23:43–49. doi: 10.1287/mnsc.23.1.43. [DOI] [Google Scholar]
Kahneman D, Tversky A. Prospect theory: an analysis of decision under risk. Econometrica. 1979;47:263–292. doi: 10.2307/1914185. [DOI] [Google Scholar]
Kahneman D, Tversky A. Advances in prospect theory: cumulative representation of uncertainty. J Risk Uncertain. 1992;5:297–323. doi: 10.1007/BF00122574. [DOI] [Google Scholar]
Kara G, Ozmen A, Weber GW. Stability advances in robust portfolio optimization under parallel piped uncertainty. Cent Eur J Oper Res. 2019;27:241–261. doi: 10.1007/s10100-017-0508-5. [DOI] [Google Scholar]
Kun L, Cheng J, Marcus SI. Probabilistically distorted risk-sensitive infinite-horizon dynamic programming. Automatica. 2018;97:1–6. doi: 10.1016/j.automatica.2018.07.028. [DOI] [Google Scholar]
Kurum E, Kasirga Y, Weber GW. A classification problem of credit risk rating investigated and solved by optimisation of the ROC curve. Cent Eur J Oper Res. 2012;20(3):529–557. doi: 10.1007/s10100-011-0224-5. [DOI] [Google Scholar]
Majewski S, Tarczynski W, Tarczynska-Luniewska M. Measuring investors’ emotions using econometric models of trading volume of stock exchange indexes. Invest Manag Financ Innov. 2020;17(3):281–291. [Google Scholar]
Ma J, Wong T, Zhang J Time-consistent conditional expectation under probability distortion, preprint
Phillips E. Nassim Taleb heads international banking’s first Grey/Black Swan committee. Q Rev Econ Financ. 2019;72:117–122. doi: 10.1016/j.qref.2018.11.005. [DOI] [Google Scholar]
Rieder U. Measurable selection theorems for optimisation problems. Manuscr Math. 1978;24:115–131. doi: 10.1007/BF01168566. [DOI] [Google Scholar]
Ruszczynski A. Risk-averse dynamic programming for Markov decision processes. Math Program B. 2010;125(2010):235–261. doi: 10.1007/s10107-010-0393-3. [DOI] [Google Scholar]
Wakker P. Prospect theory: for risk and ambiguity. Cambridge: Cambridge University Press; 2010. [Google Scholar]
Werther GFA. Improving finance and risk management foresight abilities: Growing past the black swan mindset through integrative assessment. J Risk Manag Financ Institut. 2017;10(4):353–364. [Google Scholar]
Zhou X (2010) Mathematicalising Behavioural Finance, (2010). Proceedings of the international congress of mathematicians Hyderabad, India

[CR1] Artzner P, Delbaen F, Eber JM, Heath D. Coherent measures of risk. Math Financ. 1999;9:203–228. doi: 10.1111/1467-9965.00068. [DOI] [Google Scholar]

[CR2] Artzner P, Delbaen F, Eber JM, Heath D, Ku H. Coherent multiperiod risk adjusted values and Bellmans principle. Ann Oper Res. 2007;152:5–22. doi: 10.1007/s10479-006-0132-6. [DOI] [Google Scholar]

[CR3] Barro D, Canestrelli E. Downside risk in multiperiod tracking error models. Cent Eur J Oper Res. 2014;22(2):263–283. doi: 10.1007/s10100-013-0290-y. [DOI] [Google Scholar]

[CR4] Batabyal S, Killinis R. Economic policy uncertainty and stock market returns: evidence from Canada. J Econ Asymmetr. 2021;24(e00215):1–14. [Google Scholar]

[CR5] Bellman R. On the theory of dynamic programming. Proc Natl Acad Sci. 1952;38:716. doi: 10.1073/pnas.38.8.716. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] Bjork T, Khapko M, Murgoci A. On time-inconsistent stochastic control in continuous time. Financ Stoch. 2017;21:331–360. doi: 10.1007/s00780-017-0327-5. [DOI] [Google Scholar]

[CR7] Branda M, Kopa M. On relations between DEA-risk models and stochastic dominance efficiency tests. Cent Eur J Oper Res. 2014;22(1):13–35. doi: 10.1007/s10100-012-0283-2. [DOI] [Google Scholar]

[CR8] Cheridito P, Delbaen F, Kupper M. Dynamic monetary risk measures for bounded discrete-time processes. Electron J Probab. 2006;11:57–106. doi: 10.1214/EJP.v11-302. [DOI] [Google Scholar]

[CR9] Chung K, Sobel MJ. Discounted MDP’s: distribution functions and exponential utility maximization. SIAM J Control Optim. 1987;25:49–62. doi: 10.1137/0325004. [DOI] [Google Scholar]

[CR10] Eichhorn A, Romisch W. Polyhedral risk measures in stochastic programming. SIAM J Optim. 2005;16:69–95. doi: 10.1137/040605217. [DOI] [Google Scholar]

[CR11] Elze G. Value investors anomaly: return enhancement by portfolio replication - an empiric portfolio strategic analysis. Cent Eur J Oper Res. 2012;20(4):633–647. doi: 10.1007/s10100-011-0214-7. [DOI] [Google Scholar]

[CR12] Fleming WH, Sheu SJ. Optimal long term growth rate of expected utility of wealth. Ann Appl Probab. 1999;9:871–903. doi: 10.1214/aoap/1029962817. [DOI] [Google Scholar]

[CR13] Fleming WH, Sheu SJ. Risk-sensitive control and an optimal investment model. Math Financ. 2000;10:197–213. doi: 10.1111/1467-9965.00089. [DOI] [Google Scholar]

[CR14] Focard S, Fabozzi FJ. Black swans and white eagles: on mathematics and finance. Math Meth Oper Res. 2009;69(3):379–394. doi: 10.1007/s00186-008-0243-8. [DOI] [Google Scholar]

[CR15] Follmer H, Penner I. Convex risk measures and the dynamics of their penalty functions. Stat Decis. 2006;24:61–96. [Google Scholar]

[CR16] Fritelli M, Rosazza Gianin E. Putting order in risk measures. J Bank Financ. 2002;26:1473–1486. doi: 10.1016/S0378-4266(02)00270-4. [DOI] [Google Scholar]

[CR17] Frittelli M, Gianin E Rosazza (2005) Dynamic convex risk measures (2005). Risk measures for the 21st century. Wiley, Chichester, pp. 227-248

[CR18] He XD, Zhou XY. Portfolio choice via quantiles. Math Financ. 2011;21(2):203–231. [Google Scholar]

[CR19] Hernandez-Lerma O. Adaptive Markov control processes. New York: Springer-Verlag; 1989. [Google Scholar]

[CR20] Hernandez-Lerma O, Lasserre JB (1996) Discrete-time Markov control processes. In: Basic optimality criteria. Springer, New York. 10.1007/978-1-4612-0729-0

[CR21] Jaquette SC. Markov decision processes with a new optimality criterion: discrete time (1976) Ann Stat. 1973;1:496–505. doi: 10.1214/aos/1176342415. [DOI] [Google Scholar]

[CR22] Jaquette SC. Utility criterion for Markov decision processes. Manag Sci. 1976;23:43–49. doi: 10.1287/mnsc.23.1.43. [DOI] [Google Scholar]

[CR23] Kahneman D, Tversky A. Prospect theory: an analysis of decision under risk. Econometrica. 1979;47:263–292. doi: 10.2307/1914185. [DOI] [Google Scholar]

[CR24] Kahneman D, Tversky A. Advances in prospect theory: cumulative representation of uncertainty. J Risk Uncertain. 1992;5:297–323. doi: 10.1007/BF00122574. [DOI] [Google Scholar]

[CR25] Kara G, Ozmen A, Weber GW. Stability advances in robust portfolio optimization under parallel piped uncertainty. Cent Eur J Oper Res. 2019;27:241–261. doi: 10.1007/s10100-017-0508-5. [DOI] [Google Scholar]

[CR26] Kun L, Cheng J, Marcus SI. Probabilistically distorted risk-sensitive infinite-horizon dynamic programming. Automatica. 2018;97:1–6. doi: 10.1016/j.automatica.2018.07.028. [DOI] [Google Scholar]

[CR27] Kurum E, Kasirga Y, Weber GW. A classification problem of credit risk rating investigated and solved by optimisation of the ROC curve. Cent Eur J Oper Res. 2012;20(3):529–557. doi: 10.1007/s10100-011-0224-5. [DOI] [Google Scholar]

[CR28] Majewski S, Tarczynski W, Tarczynska-Luniewska M. Measuring investors’ emotions using econometric models of trading volume of stock exchange indexes. Invest Manag Financ Innov. 2020;17(3):281–291. [Google Scholar]

[CR29] Ma J, Wong T, Zhang J Time-consistent conditional expectation under probability distortion, preprint

[CR30] Phillips E. Nassim Taleb heads international banking’s first Grey/Black Swan committee. Q Rev Econ Financ. 2019;72:117–122. doi: 10.1016/j.qref.2018.11.005. [DOI] [Google Scholar]

[CR31] Rieder U. Measurable selection theorems for optimisation problems. Manuscr Math. 1978;24:115–131. doi: 10.1007/BF01168566. [DOI] [Google Scholar]

[CR32] Ruszczynski A. Risk-averse dynamic programming for Markov decision processes. Math Program B. 2010;125(2010):235–261. doi: 10.1007/s10107-010-0393-3. [DOI] [Google Scholar]

[CR33] Wakker P. Prospect theory: for risk and ambiguity. Cambridge: Cambridge University Press; 2010. [Google Scholar]

[CR34] Werther GFA. Improving finance and risk management foresight abilities: Growing past the black swan mindset through integrative assessment. J Risk Manag Financ Institut. 2017;10(4):353–364. [Google Scholar]

[CR35] Zhou X (2010) Mathematicalising Behavioural Finance, (2010). Proceedings of the international congress of mathematicians Hyderabad, India

PERMALINK

Distorted probability operator for dynamic portfolio optimization in times of socio-economic crisis

Kerem Uğurlu

Tomasz Brzeczek

Abstract

Introduction

Probability distortion

Probability distortion on random variables

Definition 2.1

Lemma 2.1

Proof

Remark 2.1

Lemma 2.2

Proof

Dynamic probability distortion on stochastic processes

Definition 2.2

Remark 2.2

Lemma 2.3

Proof

Example 2.1

Controlled markov chain framework

Assumption 3.1

Optimal control problem

Main Result

Theorem 4.1

Proof of Theorem 4.1

Lemma 4.1

Proof

Lemma 4.2

Proof

Proof of Theorem 4.1

An application to portfolio optimization

Model

Numerical example

Fig. 2.

Fig. 1.

Fig. 3.

Conclusion and future work

Declarations

Conflict of interest

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases