Abstract
In this paper we consider the problem of portfolio optimization involving uncertainty in the probability distribution of the assets returns. Starting with an estimate of the mean and covariance matrix of the returns of the assets, we define a class of admissible distributions for the returns and show that optimizing the worst-case risk of loss can be done in a numerically efficient way. More precisely, we show that determining the asset allocation that minimizes the distributionally robust risk can be done using quadratic programming and a one line search. Effectiveness of the proposed approach is shown using academic examples.
I. Introduction
This paper falls under the umbrella of a large body literature aimed at selection of portfolio weights to optimize a specified performance metric; see [1], [2] and references therein for a survey of results that has been written on the related portfolio selection problem. Generically speaking, the main objective of portfolio selection is the construction of portfolios that maximize expected returns while maintaining the risk of losses below a prescribed level (or equivalently minimize the risk of loss subject to minimum expected return). Usually this is accomplished using performance measures which assume the availability of expected returns and the covariance matrix of the assets.
Despite the theoretical progress over the years, Markowitz’s [3] mean-variance framework is still the major model used in practice today in portfolio selection problems. Given the estimated mean μ and covariance Σ of random returns, the Markowitz’s mean variance model considers the variance minimization in portfolio selection problem subject to minimum expected portfolio return γ. In this framework, the variance of the returns, wTΣw, is used as surrogate for risk. More precisely, for an n-asset problem, the portfolio weights are determined by solving the following optimization problem
| (1) |
where denotes the unit simplex.
There are many other approaches to this problem that use different ways to access/control risk. One of the popular alternative portfolio risk measures is value-at-risk (VaR) [4], [5] that gained notoriety due to its simplicity and flexibility [1]. However, VaR also has some important limitations. One of the major limitations of VaR is that it is not a coherent risk measure since it is non-subadditive [6] and it does not measure worst case losses [7]. Furthermore, VaR is non-convex and requires the knowledge of the whole distribution [8], [6]. In order to overcome these limitations, Rockafellar and Uryasev [9], [10] introduced conditional-value-at-risk (CVaR) that has addressed some of the shortcomings of VaR. Unlike VaR, it is a coherent measure of risk since it satisfies the sub-additivity property [11]. Additionally, the minimization of CVaR is a convex optimization problem [12].
One main shortcomings of these portfolio optimization methods is that resulting optimal portfolios are sensitive to the problem parameters which are estimated from historical data [13]. Recently, in order to overcome the effect of estimation errors robust portfolio selection methods have been adopted. Many formulations proposed in the field of robust portfolio optimization have focused on the worst case performance of the portfolio over admissible sets for the mean and covariance of the returns. These methods involve the use of uncertainty set, that includes the worst possible realizations of the uncertain input parameters. In addition to that significant efforts have been made towards formulating robust approaches from Markowitz based mean-variance analysis using box [14], ellipsoidal [15] and separable uncertainty sets [16], [17], [18].
The main drawbacks of the approaches to robust portfolio optimization that are available in the literature is that they either assume the distribution of random returns is normal or they minimize a surrogate measure of risk, not risk itself. This is done to “escape” one of the main problems that one is faced with when trying to optimize risk: Computing and optimizing risk, in general, requires the evaluation of multi-dimensional integrals, a numerical problem for which a “good” solution still does not exist [19].
In this paper, we provide a novel approach to the robust portfolio optimization problem aimed at addressing the above limitations. First, given the uncertainty of information that is usually available on assets, we do not assume that we know their distribution, we only assume that their distribution belongs to a class of distributions that is built on the available mean and covariance information. Second, we provide a way of exactly optimizing worst-case risk without the need for the estimation of the values of multi-dimension integrals. More precisely, we show that optimizing worst-case risk can be done using convex quadratic optimization together with a one line search.
A. Organization of the paper
In Section II-B we provide a precise statement of the robust portfolio optimization problem addressed in this paper. In Section III a reformulation of the robust portfolio optimization problem is presented and the proof of its exactness is provided in Section IV. Numerical examples showing the effectiveness of the proposed approach are shown in Section V. Concluding remarks are provided in Section VI.
B. Notation
The sets of real numbers are denoted by . The Euclidean norm of column vectors is denoted by ∥x∥2. The probability of a set is denoted by Prob{·}. Vectors are denoted by bold letters. We use min(x) and max(x) to denote the minimum and maximum element of the vector x respectively.
II. Problem Statement
In this paper, we concentrate on the single-period portfolio selection problem. Let the random vector
denote the random returns of n risky assets. We assume that the probability distribution of these assets is unknown but that its mean μ and covariance Σ has been estimated from historical data.
We use notation
to denote the vector of portfolio weights with wi being fraction of the portfolio to be invested in the i-th stock. We require wi ≥ 0 and
Hence, without loss of generality the set of all possible portfolio allocations is given by
| (2) |
A. Class of Admissible Asset Distributions
Since the only information available on the probability distribution of the returns X is the mean μ and variance Σ, we propose the following class of admissible distributions which we denote by : Let the random vector X have the form
where Δ is a zero mean random vector whose probability density function (pdf) f(δ) is of the form
where satisfies
g(y) ≥ 0 for all y ≥ 0
g(y) = 0 for all y > ymax
g(y1) ≥ g(y2) for all 0 ≤ y1 ≤ y2
and the quantity ymax represents a uniform bound on all admissible distributions. The set of all distributions of the form above is denoted by .
The class above includes, but is not limited to, usual distributions such as truncated Gaussian with mean μ and variance Σ and uniform distributions over ellipsoids of the form
for any y ≤ ymax.
Remark 1: Other class of distributions [20] can be considered when formulating the robust portfolio optimization problem. To keep the presentation clear and concise, this will be discussed in future work.
B. Problem Formulation
Before we proceed with the definition of the problem addressed in this paper, the definition of risk needs to be introduced: Let α be a given threshold value. Then, the distributionally robust risk is defined as
| (3) |
where Probf denotes the probability computed using pdf f belonging to the class of distributions introduced above. Then, the distributionally robust portfolio allocation problem is defined as
| (4) |
III. Equivalent Numerically Friendly Reformulation
Since computing probability is, in general, an NP-hard problem, in this section we present a “simple” optimization problem which i) is equivalent to the distributionally robust portfolio allocation problem (4) and ii) does not require the computation of multidimensional integrals.
Consider the optimization problem
| (5) |
The following result shows the equivalence between the above problem and the distributionally robust portfolio allocation problem.
Theorem 1: Let , , α > 0 and be given. Then, Problem (5) is equivalent to Problem (4) in the following sense: w* achieves the optimum of the Problem (5) if and only if w* achieves the optimum of the Problem (4).
Proof: See Section IV. ■
Remark 2: The result above shows that, in the provided setup, minimizing risk and minimizing variance is different, as it is expected. Minimizing risk is equivalent to minimizing a trade-off between variance and expectation of the portfolio return. However, if expected return is fixed then risk minimization and variance minimization are equivalent and the distributionally robust portfolio allocation problem is, in this case, equivalent to Markowitz approach.
Remark 3: When μTw = β is fixed, the Problem (5) becomes a convex quadratic optimization problem, i.e., minimization of a convex quadratic function subject to convex constraints. Hence, this problem can be numerically solved using quadratic optimization together with a one line search; see Algorithm 1.

IV. Proof of Theorem 1
Consider the problem
| (6) |
which can be seen to be equivalent to the risk minimization problem (4) by noting that
By exploiting this equivalence, we consider the problem (6) in the proof of Theorem 1. Before presenting the proof of the theorem we give the following result that is essential part of our proof.
Lemma 1: Let the random vector X is of the form X = μ+Δ where the distribution f for Δ is taken to be unknown but assumed to belong to the class and
Then
is achieved when Δ has a uniform distribution over the set
| (7) |
where ymax represents the uniform bound on the support of the distribution of Δ.
Proof: In [20], it is proven that
is achieved when Δ has a truncated uniform distribution, i.e., Δ is uniformly distributed over the set
for some 0 ≤ t ≤ ymax.
To simplify rest of the proof, we apply volume-preserving linear map from an ellipsoid to a hypersphere. We can always find a linear transformation such that
where Σ−1/2 symmetric matrix such that Σ−1 = Σ−1/2Σ−1/2. So that, the ellipsoid εt becomes
which is a hypersphere with a radius . Now, let denote the distance of the hyperplane to the origin. Then, corresponds the height of the hyperspherical cap. The n-dimensional volume of this hyperspherical cap VC of height and radius is given by [21]
where Γ is the gamma function. Then, the probability of this hyperspherical cap, , is given by
where denotes the volume of the hypersphere and M is constant that does not depend on .
It is clear that the probability of this hyperspherical cap is monotonic with respect to which is an increasing function of . This implies the probability
is decreasing function of . Hence, it immediately implies that the minimum is attained at ■
Before proceeding with the proof of the theorem, we give some definitions that we use in our reasoning.
We first introduce the concept of floating body in a similar fashion to the definition given in [22]. Given a probability level 0 < ϵ < 1/2, we define the floating body Rϵ of the probability distribution is a convex symmetric set for which each supporting hyperplane cuts off a set of probability ϵ. More precisely, the floating body is defined as:
Definition 1: (Floating body) Let 0 < ϵ < 1/2 be given and consider a set . For any direction u ∈ Rn, ∥u∥2 = 1, let H(u) be the supporting hyperplane of Rϵ normal to u. Also, let be the half-space defined by H(u) which does not contain the origin. Then, is the floating body associated with ϵ of the uniform probability measure over if, given any direction ∥u∥2 = 1,
It is proven in [22] that the convex floating body exists as long as the support set of the uniform distribution is convex and symmetric.
Definition 2: (Dual norm) Let ∥·∥ be a norm in . Then the dual norm is defined as
Definition 3: (Minkowski functional) Let the set . Then, the Minkowski functional associated with the set , denoted by ∥x∥ϵ, is given by
for all . Note that the expression above defines a norm when the set is compact, convex and symmetric.
A. Proof of the Theorem 1
As a direct application of Lemma 1 and under the assumptions, the maximization problem (6) can be recast as
| (8) |
where Δ has a uniform distribution over the set defined in (7). Without loss of generality, let 0 < ϵ < 1/2 and suppose that
where X = μ + Δ. So, plugging X in the above equation yields
Since the set is a halfspace, by the definition of the floating body, satisfying the above inequality is equivalent to satisfy the inequality given by
for all , where is the floating body of the probability distribution of Δ.
Then, by symmetry of the floating body , the inequality can be equivalently rewritten as
| (9) |
Thus, given the definition of dual norm and the Minkowski functional, this is equivalent to
where ∥w∥ϵ,* denotes the dual norm of the Minkowski functional associated with its floating body .
So, if we can determine the quantity ∥w∥ϵ,*, then we can numerically solve the resulting problem.
Since Δ has an elliptical density function of the form described in Section II-A, given 0 < ϵ < 1/2, the floating body is of the form
where r(ϵ) > 0.
It follows from [23] that letting
leads to if and only if ∥Δy∥2 ≤ 1 where the symmetric matrix Σ−1/2 is such that Σ−1 = Σ−1/2Σ−1/2. Then, the inequality (9) can be rewritten as
for all ∥Δy∥2 ≤ 1. Since the dual norm of 2-norm is the 2-norm, we have
| (10) |
which leads to
| (11) |
Hence, risk minimization, the problem (8), is equivalent to maximizing the radius r(ϵ) of the floating body .
The r(ϵ) is easily verified to be non-increasing function of ϵ with r(ϵ) > 0 for 0 < ϵ < 1/2. Thus, r(ϵ) ≤ r(ϵ*) implies 1/r(ϵ*) ≤ 1/r(ϵ) so that maximizing the r(ϵ) is equivalent to minimizing 1/r(ϵ). Then the problem becomes
| (12) |
where Σ is covariance matrix of the distribution x.
V. Numerical results
In this section, we present numerical results comparing the solution of a risk minimization (R-M) problem (5) to the solution of the variance minimization (V-M) problem (1) for different choices of minimum expected returns γ. Our portfolio weights are restricted to lie in the set which is defined in (2). The threshold value α is set as (α = 0), that means we aim at minimizing the risk of losses.
In the results, and correspond to the optimal portfolios of (R-M) and (V-M) problem respectively. Given these optimal portfolios, we use the Monte Carlo simulations to estimate the worst-case risk which is defined as Probf {X : XT w* ≤ 0} where X = μ + Δ.
As we can see from the results in Section IV, the worst-case distribution of Δ is the uniform distribution over the ellipsoid with largest radius. Hence, in our simulations, distributionally robust risk is estimated using Monte Carlo simulations with Δ being uniformly distributed over the set
Furthermore, randomly generated estimates of mean and covariance of random returns were used in the provided examples. Our results were obtained using CVX, a package for specifying and solving convex programs [24], [25].
Example 1: In this example, we consider a portfolio involving n = 5 assets where the mean is randomly generated as μ = [0.34, 0.68, 0.40, 0.28, 0.46]⊤.
Example 2: In this example, we consider a portfolio involving n = 7 assets where the mean is randomly generated as μ = [0.98, 0.33, 0.35, 0.54, 0.60, 0.51, 0.05]⊤.
Tables I and II show the optimal portfolios and corresponding worst-case risk values of (R-M) and (V-M) problems. These results indicates that when the requested expected return γ is low the two approaches give different allocations as expected. Actually, the requirement on expected return is not binding for low levels of risk for the risk minimization scheme. In those cases the average return is higher than the lower bound imposed. For mean-variance minimization scheme, the allocation calculated yields the minimum allowed average return for all cases. Moreover, these two schemes leads to same results when high expected return is requested.
TABLE I:
Comparison of R-M and V-M with 5 assets
| γ | 0.2 | 0.4 | 0.6 | |||
|---|---|---|---|---|---|---|
| R – M | V – M | R – M | V – M | R – M | V – M | |
| w | 0.140 | 0.207 | 0.140 | 0.207 | 0.009 | 0.000 |
| 0.382 | 0.190 | 0.382 | 0.190 | 0.670 | 0.653 | |
| 0.196 | 0.185 | 0.196 | 0.185 | 0.125 | 0.084 | |
| 0.080 | 0.211 | 0.080 | 0.211 | 0.000 | 0.000 | |
| 0.203 | 0.206 | 0.203 | 0.206 | 0.196 | 0.263 | |
| μT w | 0.503 | 0.430 | 0.503 | 0.430 | 0.600 | 0.600 |
| Risk | 0.180 | 0.227 | 0.180 | 0.227 | 0.202 | 0.202 |
TABLE II:
Comparison of R-M and V-M with 7 assets
| γ | 0.60 | 0.80 | 0.95 | |||
|---|---|---|---|---|---|---|
| R – M | V – M | R – M | V – M | R – M | V – M | |
| w | 0.345 | 0.149 | 0.561 | 0.563 | 0.912 | 0.912 |
| 0.076 | 0.137 | 0.000 | 0.000 | 0.000 | 0.000 | |
| 0.091 | 0.143 | 0.000 | 0.000 | 0.000 | 0.000 | |
| 0.148 | 0.150 | 0.136 | 0.139 | 0.000 | 0.000 | |
| 0.189 | 0.140 | 0.204 | 0.189 | 0.088 | 0.088 | |
| 0.151 | 0.141 | 0.102 | 0.109 | 0.000 | 0.000 | |
| 0.000 | 0.140 | 0.000 | 0.000 | 0.000 | 0.000 | |
| μT w | 0.669 | 0.489 | 0.800 | 0.800 | 0.950 | 0.950 |
| Risk | 0.079 | 0.229 | 0.095 | 0.095 | 0.139 | 0.139 |
VI. Conclusion
In this paper, we provide a novel approach to the problem of portfolio optimization when the distribution of the returns of the assets is not known. More precisely, we start with estimates of the mean and covariance matrix of the asset returns and define a class of distributions that contains common distributions compatible with this a-priori information. With this class of admissible distributions at hand, we show that minimizing the worst-case risk of loss with respect to the admissible class of distributions can be done in an efficient way. More precisely, we show that determining the optimal asset allocation can be done with quadratic optimization together with a one line search. Effort is now being put in obtaining similar results for other classes of distributions and studying the case where there is uncertainty in the estimation of mean and variance of the returns.
VII. Acknowledgments
The authors would like to thank Prof. B. Ross Barmish for the very helpful discussions on the topic addressed in this paper.
This work was supported by the National Institutes of Health Grant R01 HL142732 and the National Science Foundation Grant 1808266
References
- [1].Fabozzi Frank J, Dashan Huang, and Zhou Guofu. Robust portfolios: contributions from operations research and finance. Annals of operations research, 176(1):191–220, 2010. [Google Scholar]
- [2].Brandt Michael W. Portfolio choice problems. In Handbook of financial econometrics: Tools and techniques, pages 269–336. Elsevier, 2010. [Google Scholar]
- [3].Markowitz HM. Portfolio selection, journal of finance 7. Markowitz HM—1952., pages 77–91, 1952. [Google Scholar]
- [4].Ahn Dong-Hyun, Boudoukh Jacob, Richardson Matthew, and Robert F Whitelaw. Optimal risk management using options. The Journal of Finance, 54(1):359–375, 1999. [Google Scholar]
- [5].Basak Suleyman and Shapiro Alexander. Value-at-risk-based risk management: optimal policies and asset prices. The review of financial studies, 14(2):371–405, 2001. [Google Scholar]
- [6].Artzner Philippe, Delbaen Freddy, Eber Jean-Marc, and Heath David. Coherent measures of risk. Mathematical finance, 9(3):203–228, 1999. [Google Scholar]
- [7].Sarykalin Sergey, Serraino Gaia, and Uryasev Stan. Value-at-risk vs. conditional value-at-risk in risk management and optimization. In State-of-the-art decision-making tools in the information-intensive age, pages 270–294. Informs, 2008. [Google Scholar]
- [8].Zhu Shushang and Fukushima Masao. Worst-case conditional value-at-risk with application to robust portfolio management. Operations research, 57(5):1155–1168, 2009. [Google Scholar]
- [9].R Tyrrell Rockafellar Stanislav Uryasev, et al. Optimization of conditional value-at-risk. Journal of risk, 2:21–42, 2000. [Google Scholar]
- [10].Rockafellar R Tyrrell and Uryasev Stanislav. Conditional value-at-risk for general loss distributions. Journal of banking & finance, 26(7):1443–1471, 2002. [Google Scholar]
- [11].Pflug Georg Ch. Some remarks on the value-at-risk and the conditional value-at-risk. In Probabilistic constrained optimization, pages 272–281. Springer, 2000. [Google Scholar]
- [12].Kibzun Andrey I and Kuznetsov Evgeniy A. Analysis of criteria var and cvar. Journal of Banking & Finance, 30(2):779–796, 2006. [Google Scholar]
- [13].Best Michael J and Grauer Robert R. On the sensitivity of mean-variance-efficient portfolios to changes in asset means: some analytical and computational results. The review of financial studies, 4(2):315–342, 1991. [Google Scholar]
- [14].Halldórsson Bjarni V and Tütüncü Reha H. An interior-point method for a class of saddle-point problems. Journal of Optimization Theory and Applications, 116(3):559–590, 2003. [Google Scholar]
- [15].Kim Jang Ho, Kim Woo Chang, and Fabozzi Frank J. Recent developments in robust portfolios with a worst-case approach. Journal of Optimization Theory and Applications, 161(1):103–121, 2014. [Google Scholar]
- [16].Tütüncü Reha H and Koenig M. Robust asset allocation. Annals of Operations Research, 132(1–4):157–187, 2004. [Google Scholar]
- [17].Goldfarb Donald and Iyengar Garud. Robust portfolio selection problems. Mathematics of operations research, 28(1):1–38, 2003. [Google Scholar]
- [18].Erdogan E, Goldfarb D, and Iyengar G. Robust portfolio management. 2004.
- [19].Nemirovski Arkadi, Juditsky Anatoli, Lan Guanghui, and Shapiro Alexander. Robust stochastic approximation approach to stochastic programming. SIAM Journal on optimization, 19(4):1574–1609, 2009. [Google Scholar]
- [20].Barmish BR, Lagoa CM, and Tempo R. Radially truncated uniform distributions for probabilistic robustness of control systems. In Proceedings of the 1997 American Control Conference (Cat. No. 97CH36041), volume 1, pages 853–857. IEEE, 1997. [Google Scholar]
- [21].Li Shengqiao. Concise formulas for the area and volume of a hyperspherical cap. Asian Journal of Mathematics and Statistics, 4(1):66–70, 2011. [Google Scholar]
- [22].Schütt Carsten and Werner Elisabeth. The convex floating body. Mathematica Scandinavica, pages 275–290, 1990. [Google Scholar]
- [23].Constantino M Lagoa Xiang Li, and Sznaier Mario. Probabilistically constrained linear programs and risk-adjusted controller design. SIAM Journal on Optimization, 15(3):938–951, 2005. [Google Scholar]
- [24].Grant Michael and Boyd Stephen. CVX: Matlab software for disciplined convex programming, version 2.1. http://cvxr.com/cvx, March 2014.
- [25].Grant Michael and Boyd Stephen. Graph implementations for nonsmooth convex programs. In Blondel V, Boyd S, and Kimura H, editors, Recent Advances in Learning and Control, Lecture Notes in Control and Information Sciences, pages 95–110. Springer-Verlag Limited, 2008. [Google Scholar]
