Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Mar 14.
Published in final edited form as: Proc IEEE Conf Decis Control. 2020 Mar 12;2019:1526–1531. doi: 10.1109/cdc40024.2019.9029381

Distributionally Robust Portfolio Optimization

I E Bardakci 1, C M Lagoa 1
PMCID: PMC7956065  NIHMSID: NIHMS1676999  PMID: 33723475

Abstract

In this paper we consider the problem of portfolio optimization involving uncertainty in the probability distribution of the assets returns. Starting with an estimate of the mean and covariance matrix of the returns of the assets, we define a class of admissible distributions for the returns and show that optimizing the worst-case risk of loss can be done in a numerically efficient way. More precisely, we show that determining the asset allocation that minimizes the distributionally robust risk can be done using quadratic programming and a one line search. Effectiveness of the proposed approach is shown using academic examples.

I. Introduction

This paper falls under the umbrella of a large body literature aimed at selection of portfolio weights to optimize a specified performance metric; see [1], [2] and references therein for a survey of results that has been written on the related portfolio selection problem. Generically speaking, the main objective of portfolio selection is the construction of portfolios that maximize expected returns while maintaining the risk of losses below a prescribed level (or equivalently minimize the risk of loss subject to minimum expected return). Usually this is accomplished using performance measures which assume the availability of expected returns and the covariance matrix of the assets.

Despite the theoretical progress over the years, Markowitz’s [3] mean-variance framework is still the major model used in practice today in portfolio selection problems. Given the estimated mean μ and covariance Σ of random returns, the Markowitz’s mean variance model considers the variance minimization in portfolio selection problem subject to minimum expected portfolio return γ. In this framework, the variance of the returns, wTΣw, is used as surrogate for risk. More precisely, for an n-asset problem, the portfolio weights are determined by solving the following optimization problem

minwWwTΣwsubject to μwγ, (1)

where W denotes the unit simplex.

There are many other approaches to this problem that use different ways to access/control risk. One of the popular alternative portfolio risk measures is value-at-risk (VaR) [4], [5] that gained notoriety due to its simplicity and flexibility [1]. However, VaR also has some important limitations. One of the major limitations of VaR is that it is not a coherent risk measure since it is non-subadditive [6] and it does not measure worst case losses [7]. Furthermore, VaR is non-convex and requires the knowledge of the whole distribution [8], [6]. In order to overcome these limitations, Rockafellar and Uryasev [9], [10] introduced conditional-value-at-risk (CVaR) that has addressed some of the shortcomings of VaR. Unlike VaR, it is a coherent measure of risk since it satisfies the sub-additivity property [11]. Additionally, the minimization of CVaR is a convex optimization problem [12].

One main shortcomings of these portfolio optimization methods is that resulting optimal portfolios are sensitive to the problem parameters which are estimated from historical data [13]. Recently, in order to overcome the effect of estimation errors robust portfolio selection methods have been adopted. Many formulations proposed in the field of robust portfolio optimization have focused on the worst case performance of the portfolio over admissible sets for the mean and covariance of the returns. These methods involve the use of uncertainty set, that includes the worst possible realizations of the uncertain input parameters. In addition to that significant efforts have been made towards formulating robust approaches from Markowitz based mean-variance analysis using box [14], ellipsoidal [15] and separable uncertainty sets [16], [17], [18].

The main drawbacks of the approaches to robust portfolio optimization that are available in the literature is that they either assume the distribution of random returns is normal or they minimize a surrogate measure of risk, not risk itself. This is done to “escape” one of the main problems that one is faced with when trying to optimize risk: Computing and optimizing risk, in general, requires the evaluation of multi-dimensional integrals, a numerical problem for which a “good” solution still does not exist [19].

In this paper, we provide a novel approach to the robust portfolio optimization problem aimed at addressing the above limitations. First, given the uncertainty of information that is usually available on assets, we do not assume that we know their distribution, we only assume that their distribution belongs to a class of distributions that is built on the available mean and covariance information. Second, we provide a way of exactly optimizing worst-case risk without the need for the estimation of the values of multi-dimension integrals. More precisely, we show that optimizing worst-case risk can be done using convex quadratic optimization together with a one line search.

A. Organization of the paper

In Section II-B we provide a precise statement of the robust portfolio optimization problem addressed in this paper. In Section III a reformulation of the robust portfolio optimization problem is presented and the proof of its exactness is provided in Section IV. Numerical examples showing the effectiveness of the proposed approach are shown in Section V. Concluding remarks are provided in Section VI.

B. Notation

The sets of real numbers are denoted by . The Euclidean norm of column vectors xn is denoted by ∥x2. The probability of a set is denoted by Prob{·}. Vectors are denoted by bold letters. We use min(x) and max(x) to denote the minimum and maximum element of the vector x respectively.

II. Problem Statement

In this paper, we concentrate on the single-period portfolio selection problem. Let the random vector

X=[X1X2Xn]Tn,

denote the random returns of n risky assets. We assume that the probability distribution of these assets is unknown but that its mean μ and covariance Σ has been estimated from historical data.

We use notation

w=[w1w2wn]Tn,

to denote the vector of portfolio weights with wi being fraction of the portfolio to be invested in the i-th stock. We require wi ≥ 0 and

i=1nwi=1,

Hence, without loss of generality the set of all possible portfolio allocations is given by

W={wn:i=1nwi=1,wi0i=1,,n}. (2)

A. Class of Admissible Asset Distributions

Since the only information available on the probability distribution of the returns X is the mean μ and variance Σ, we propose the following class of admissible distributions which we denote by F: Let the random vector X have the form

X=μ+Δ

where Δ is a zero mean random vector whose probability density function (pdf) f(δ) is of the form

f(δ)=g(δTΣ1δ)

where g:++ satisfies

  1. g(y) ≥ 0 for all y ≥ 0

  2. g(y) = 0 for all y > ymax

  3. g(y1) ≥ g(y2) for all 0 ≤ y1y2

and the quantity ymax represents a uniform bound on all admissible distributions. The set of all distributions of the form above is denoted by F.

The class F above includes, but is not limited to, usual distributions such as truncated Gaussian with mean μ and variance Σ and uniform distributions over ellipsoids of the form

E={x:x=μ+δ,δTΣ1δy}

for any yymax.

Remark 1: Other class of distributions [20] can be considered when formulating the robust portfolio optimization problem. To keep the presentation clear and concise, this will be discussed in future work.

B. Problem Formulation

Before we proceed with the definition of the problem addressed in this paper, the definition of risk needs to be introduced: Let α be a given threshold value. Then, the distributionally robust risk is defined as

Riskα(w)sup fFProbf{X:XTwα} (3)

where Probf denotes the probability computed using pdf f belonging to the class of distributions F introduced above. Then, the distributionally robust portfolio allocation problem is defined as

minwWRiskα(w)subject to μTwγ, (4)

III. Equivalent Numerically Friendly Reformulation

Since computing probability is, in general, an NP-hard problem, in this section we present a “simple” optimization problem which i) is equivalent to the distributionally robust portfolio allocation problem (4) and ii) does not require the computation of multidimensional integrals.

Consider the optimization problem

minwWΣ1/2w2α+μTwsubject to μTwγ. (5)

The following result shows the equivalence between the above problem and the distributionally robust portfolio allocation problem.

Theorem 1: Let μn, Σn×n, α > 0 and F be given. Then, Problem (5) is equivalent to Problem (4) in the following sense: w* achieves the optimum of the Problem (5) if and only if w* achieves the optimum of the Problem (4).

Proof: See Section IV. ■

Remark 2: The result above shows that, in the provided setup, minimizing risk and minimizing variance is different, as it is expected. Minimizing risk is equivalent to minimizing a trade-off between variance and expectation of the portfolio return. However, if expected return is fixed then risk minimization and variance minimization are equivalent and the distributionally robust portfolio allocation problem is, in this case, equivalent to Markowitz approach.

Remark 3: When μTw = β is fixed, the Problem (5) becomes a convex quadratic optimization problem, i.e., minimization of a convex quadratic function subject to convex constraints. Hence, this problem can be numerically solved using quadratic optimization together with a one line search; see Algorithm 1.

II.

IV. Proof of Theorem 1

Consider the problem

maxwWinf fFProbf{X:XTwα}subject to μTwγ, (6)

which can be seen to be equivalent to the risk minimization problem (4) by noting that

inf fFProbf{X:XTwα}=1Riskα(w).

By exploiting this equivalence, we consider the problem (6) in the proof of Theorem 1. Before presenting the proof of the theorem we give the following result that is essential part of our proof.

Lemma 1: Let the random vector X is of the form X = μ+Δ where the distribution f for Δ is taken to be unknown but assumed to belong to the class F and

μ{X:XTwα}

Then

inf fFProbf{X:XTwα}

is achieved when Δ has a uniform distribution over the set

R={Δn:ΔTΣ1Δymax} (7)

where ymax represents the uniform bound on the support of the distribution of Δ.

Proof: In [20], it is proven that

inf fFProbf{X:XTwα}

is achieved when Δ has a truncated uniform distribution, i.e., Δ is uniformly distributed over the set

Et={Δn:ΔTΣ1Δt}

for some 0 ≤ tymax.

To simplify rest of the proof, we apply volume-preserving linear map from an ellipsoid to a hypersphere. We can always find a linear transformation such that

Δ˜=1det(Σ1/2)Σ1/2Δ

where Σ−1/2 symmetric matrix such that Σ−1 = Σ−1/2Σ−1/2. So that, the ellipsoid εt becomes

E˜t={Δ˜n:Δ˜TΔ˜t˜}

which is a hypersphere with a radius t˜. Now, let h˜ denote the distance of the hyperplane {X˜:X˜Tw=α˜} to the origin. Then, t˜h˜ corresponds the height of the hyperspherical cap. The n-dimensional volume of this hyperspherical cap VC of height t˜h˜ and radius t˜ is given by [21]

VC=πn12t˜nΓ(n+12)0cos1(h˜t˜)sinn(w)dw

where Γ is the gamma function. Then, the probability of this hyperspherical cap, Prob{C}=Prob{X˜:X˜Twα˜}, is given by

Prob{C}=VCVE˜t=M0cos1(h˜t˜)sinn(w)dw

where VE˜t denotes the volume of the hypersphere E˜t and M is constant that does not depend on t˜.

It is clear that the probability of this hyperspherical cap is monotonic with respect to cos1(h˜/t˜) which is an increasing function of t˜. This implies the probability

Prob{X˜:X˜Twα˜}=1Prob{C}

is decreasing function of t˜. Hence, it immediately implies that the minimum is attained at t˜=ymax

Before proceeding with the proof of the theorem, we give some definitions that we use in our reasoning.

We first introduce the concept of floating body in a similar fashion to the definition given in [22]. Given a probability level 0 < ϵ < 1/2, we define the floating body Rϵ of the probability distribution is a convex symmetric set for which each supporting hyperplane cuts off a set of probability ϵ. More precisely, the floating body is defined as:

Definition 1: (Floating body) Let 0 < ϵ < 1/2 be given and consider a set R. For any direction uRn, ∥u2 = 1, let H(u) be the supporting hyperplane of Rϵ normal to u. Also, let H+(u) be the half-space defined by H(u) which does not contain the origin. Then, Rϵ is the floating body associated with ϵ of the uniform probability measure over R if, given any direction ∥u2 = 1,

Prob(H+(u))=ϵ.

It is proven in [22] that the convex floating body Rϵ exists as long as the support set R of the uniform distribution is convex and symmetric.

Definition 2: (Dual norm) Let ∥·∥ be a norm in n. Then the dual norm is defined as

x*max{xTy:y1}.

Definition 3: (Minkowski functional) Let the set Rϵn. Then, the Minkowski functional associated with the set Rϵ, denoted by ∥xϵ, is given by

xϵinf {t>0:1txRϵ}

for all xn. Note that the expression above defines a norm when the set Rϵ is compact, convex and symmetric.

A. Proof of the Theorem 1

As a direct application of Lemma 1 and under the assumptions, the maximization problem (6) can be recast as

maxwWProb{X:XTwα}subject to μwγ, (8)

where Δ has a uniform distribution over the set R defined in (7). Without loss of generality, let 0 < ϵ < 1/2 and suppose that

Prob{XTwα}1ϵ

where X = μ + Δ. So, plugging X in the above equation yields

Prob{(μ+Δx)Twα}1ϵ

Since the set {Δn:(μ+Δ)Twα} is a halfspace, by the definition of the floating body, satisfying the above inequality is equivalent to satisfy the inequality given by

(μ+Δ)Twα

for all ΔRϵ, where Rϵ is the floating body of the probability distribution of Δ.

Then, by symmetry of the floating body Rϵ, the inequality can be equivalently rewritten as

ΔTwα+μTw. (9)

Thus, given the definition of dual norm and the Minkowski functional, this is equivalent to

wϵ,*α+μTw.

where ∥wϵ,* denotes the dual norm of the Minkowski functional associated with its floating body Rϵ.

So, if we can determine the quantity ∥wϵ,*, then we can numerically solve the resulting problem.

Since Δ has an elliptical density function of the form described in Section II-A, given 0 < ϵ < 1/2, the floating body Rϵ is of the form

Rϵ={Δn:ΔTΣ1Δr2(ϵ)}.

where r(ϵ) > 0.

It follows from [23] that letting

Δy=1r(ϵ)Σ1/2Δ

leads to ΔRϵ if and only if ∥Δy2 ≤ 1 where the symmetric matrix Σ−1/2 is such that Σ−1 = Σ−1/2Σ−1/2. Then, the inequality (9) can be rewritten as

r(ϵ)Σ1/2Δyα+μTw

for all ∥Δy2 ≤ 1. Since the dual norm of 2-norm is the 2-norm, we have

r(ϵ)Σ1/2w2α+μTw, (10)

which leads to

r(ϵ)α+μTwΣ1/2w2, (11)

Hence, risk minimization, the problem (8), is equivalent to maximizing the radius r(ϵ) of the floating body Rϵ.

The r(ϵ) is easily verified to be non-increasing function of ϵ with r(ϵ) > 0 for 0 < ϵ < 1/2. Thus, r(ϵ) ≤ r(ϵ*) implies 1/r(ϵ*) ≤ 1/r(ϵ) so that maximizing the r(ϵ) is equivalent to minimizing 1/r(ϵ). Then the problem becomes

minwWΣ1/2w2α+μTw (12)

where Σ is covariance matrix of the distribution x.

V. Numerical results

In this section, we present numerical results comparing the solution of a risk minimization (R-M) problem (5) to the solution of the variance minimization (V-M) problem (1) for different choices of minimum expected returns γ. Our portfolio weights are restricted to lie in the set W which is defined in (2). The threshold value α is set as (α = 0), that means we aim at minimizing the risk of losses.

In the results, wRM* and wVM* correspond to the optimal portfolios of (R-M) and (V-M) problem respectively. Given these optimal portfolios, we use the Monte Carlo simulations to estimate the worst-case risk which is defined as supfF Probf {X : XT w* ≤ 0} where X = μ + Δ.

As we can see from the results in Section IV, the worst-case distribution of Δ is the uniform distribution over the ellipsoid with largest radius. Hence, in our simulations, distributionally robust risk is estimated using Monte Carlo simulations with Δ being uniformly distributed over the set

R={Δn:ΔTΣ1Δymax}.

Furthermore, randomly generated estimates of mean and covariance of random returns were used in the provided examples. Our results were obtained using CVX, a package for specifying and solving convex programs [24], [25].

Example 1: In this example, we consider a portfolio involving n = 5 assets where the mean is randomly generated as μ = [0.34, 0.68, 0.40, 0.28, 0.46].

Example 2: In this example, we consider a portfolio involving n = 7 assets where the mean is randomly generated as μ = [0.98, 0.33, 0.35, 0.54, 0.60, 0.51, 0.05].

Tables I and II show the optimal portfolios and corresponding worst-case risk values of (R-M) and (V-M) problems. These results indicates that when the requested expected return γ is low the two approaches give different allocations as expected. Actually, the requirement on expected return is not binding for low levels of risk for the risk minimization scheme. In those cases the average return is higher than the lower bound imposed. For mean-variance minimization scheme, the allocation calculated yields the minimum allowed average return for all cases. Moreover, these two schemes leads to same results when high expected return is requested.

TABLE I:

Comparison of R-M and V-M with 5 assets

γ 0.2 0.4 0.6
RM VM RM VM RM VM
w 0.140 0.207 0.140 0.207 0.009 0.000
0.382 0.190 0.382 0.190 0.670 0.653
0.196 0.185 0.196 0.185 0.125 0.084
0.080 0.211 0.080 0.211 0.000 0.000
0.203 0.206 0.203 0.206 0.196 0.263
μT w 0.503 0.430 0.503 0.430 0.600 0.600
Risk 0.180 0.227 0.180 0.227 0.202 0.202

TABLE II:

Comparison of R-M and V-M with 7 assets

γ 0.60 0.80 0.95
RM VM RM VM RM VM
w 0.345 0.149 0.561 0.563 0.912 0.912
0.076 0.137 0.000 0.000 0.000 0.000
0.091 0.143 0.000 0.000 0.000 0.000
0.148 0.150 0.136 0.139 0.000 0.000
0.189 0.140 0.204 0.189 0.088 0.088
0.151 0.141 0.102 0.109 0.000 0.000
0.000 0.140 0.000 0.000 0.000 0.000
μT w 0.669 0.489 0.800 0.800 0.950 0.950
Risk 0.079 0.229 0.095 0.095 0.139 0.139

VI. Conclusion

In this paper, we provide a novel approach to the problem of portfolio optimization when the distribution of the returns of the assets is not known. More precisely, we start with estimates of the mean and covariance matrix of the asset returns and define a class of distributions that contains common distributions compatible with this a-priori information. With this class of admissible distributions at hand, we show that minimizing the worst-case risk of loss with respect to the admissible class of distributions can be done in an efficient way. More precisely, we show that determining the optimal asset allocation can be done with quadratic optimization together with a one line search. Effort is now being put in obtaining similar results for other classes of distributions and studying the case where there is uncertainty in the estimation of mean and variance of the returns.

VII. Acknowledgments

The authors would like to thank Prof. B. Ross Barmish for the very helpful discussions on the topic addressed in this paper.

This work was supported by the National Institutes of Health Grant R01 HL142732 and the National Science Foundation Grant 1808266

References

  • [1].Fabozzi Frank J, Dashan Huang, and Zhou Guofu. Robust portfolios: contributions from operations research and finance. Annals of operations research, 176(1):191–220, 2010. [Google Scholar]
  • [2].Brandt Michael W. Portfolio choice problems. In Handbook of financial econometrics: Tools and techniques, pages 269–336. Elsevier, 2010. [Google Scholar]
  • [3].Markowitz HM. Portfolio selection, journal of finance 7. Markowitz HM—1952., pages 77–91, 1952. [Google Scholar]
  • [4].Ahn Dong-Hyun, Boudoukh Jacob, Richardson Matthew, and Robert F Whitelaw. Optimal risk management using options. The Journal of Finance, 54(1):359–375, 1999. [Google Scholar]
  • [5].Basak Suleyman and Shapiro Alexander. Value-at-risk-based risk management: optimal policies and asset prices. The review of financial studies, 14(2):371–405, 2001. [Google Scholar]
  • [6].Artzner Philippe, Delbaen Freddy, Eber Jean-Marc, and Heath David. Coherent measures of risk. Mathematical finance, 9(3):203–228, 1999. [Google Scholar]
  • [7].Sarykalin Sergey, Serraino Gaia, and Uryasev Stan. Value-at-risk vs. conditional value-at-risk in risk management and optimization. In State-of-the-art decision-making tools in the information-intensive age, pages 270–294. Informs, 2008. [Google Scholar]
  • [8].Zhu Shushang and Fukushima Masao. Worst-case conditional value-at-risk with application to robust portfolio management. Operations research, 57(5):1155–1168, 2009. [Google Scholar]
  • [9].R Tyrrell Rockafellar Stanislav Uryasev, et al. Optimization of conditional value-at-risk. Journal of risk, 2:21–42, 2000. [Google Scholar]
  • [10].Rockafellar R Tyrrell and Uryasev Stanislav. Conditional value-at-risk for general loss distributions. Journal of banking & finance, 26(7):1443–1471, 2002. [Google Scholar]
  • [11].Pflug Georg Ch. Some remarks on the value-at-risk and the conditional value-at-risk. In Probabilistic constrained optimization, pages 272–281. Springer, 2000. [Google Scholar]
  • [12].Kibzun Andrey I and Kuznetsov Evgeniy A. Analysis of criteria var and cvar. Journal of Banking & Finance, 30(2):779–796, 2006. [Google Scholar]
  • [13].Best Michael J and Grauer Robert R. On the sensitivity of mean-variance-efficient portfolios to changes in asset means: some analytical and computational results. The review of financial studies, 4(2):315–342, 1991. [Google Scholar]
  • [14].Halldórsson Bjarni V and Tütüncü Reha H. An interior-point method for a class of saddle-point problems. Journal of Optimization Theory and Applications, 116(3):559–590, 2003. [Google Scholar]
  • [15].Kim Jang Ho, Kim Woo Chang, and Fabozzi Frank J. Recent developments in robust portfolios with a worst-case approach. Journal of Optimization Theory and Applications, 161(1):103–121, 2014. [Google Scholar]
  • [16].Tütüncü Reha H and Koenig M. Robust asset allocation. Annals of Operations Research, 132(1–4):157–187, 2004. [Google Scholar]
  • [17].Goldfarb Donald and Iyengar Garud. Robust portfolio selection problems. Mathematics of operations research, 28(1):1–38, 2003. [Google Scholar]
  • [18].Erdogan E, Goldfarb D, and Iyengar G. Robust portfolio management. 2004.
  • [19].Nemirovski Arkadi, Juditsky Anatoli, Lan Guanghui, and Shapiro Alexander. Robust stochastic approximation approach to stochastic programming. SIAM Journal on optimization, 19(4):1574–1609, 2009. [Google Scholar]
  • [20].Barmish BR, Lagoa CM, and Tempo R. Radially truncated uniform distributions for probabilistic robustness of control systems. In Proceedings of the 1997 American Control Conference (Cat. No. 97CH36041), volume 1, pages 853–857. IEEE, 1997. [Google Scholar]
  • [21].Li Shengqiao. Concise formulas for the area and volume of a hyperspherical cap. Asian Journal of Mathematics and Statistics, 4(1):66–70, 2011. [Google Scholar]
  • [22].Schütt Carsten and Werner Elisabeth. The convex floating body. Mathematica Scandinavica, pages 275–290, 1990. [Google Scholar]
  • [23].Constantino M Lagoa Xiang Li, and Sznaier Mario. Probabilistically constrained linear programs and risk-adjusted controller design. SIAM Journal on Optimization, 15(3):938–951, 2005. [Google Scholar]
  • [24].Grant Michael and Boyd Stephen. CVX: Matlab software for disciplined convex programming, version 2.1. http://cvxr.com/cvx, March 2014.
  • [25].Grant Michael and Boyd Stephen. Graph implementations for nonsmooth convex programs. In Blondel V, Boyd S, and Kimura H, editors, Recent Advances in Learning and Control, Lecture Notes in Control and Information Sciences, pages 95–110. Springer-Verlag Limited, 2008. [Google Scholar]

RESOURCES