Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2015 Oct 15;10(10):e0140546. doi: 10.1371/journal.pone.0140546

Robust Portfolio Optimization Using Pseudodistances

Aida Toma 1,2,¤,*,#, Samuela Leoni-Aubin 3,#
Editor: Enrico Scalas4
PMCID: PMC4607458  PMID: 26468948

Abstract

The presence of outliers in financial asset returns is a frequently occurring phenomenon which may lead to unreliable mean-variance optimized portfolios. This fact is due to the unbounded influence that outliers can have on the mean returns and covariance estimators that are inputs in the optimization procedure. In this paper we present robust estimators of mean and covariance matrix obtained by minimizing an empirical version of a pseudodistance between the assumed model and the true model underlying the data. We prove and discuss theoretical properties of these estimators, such as affine equivariance, B-robustness, asymptotic normality and asymptotic relative efficiency. These estimators can be easily used in place of the classical estimators, thereby providing robust optimized portfolios. A Monte Carlo simulation study and applications to real data show the advantages of the proposed approach. We study both in-sample and out-of-sample performance of the proposed robust portfolios comparing them with some other portfolios known in literature.

Introduction

Since Markowitz [1] formulated the idea of diversification of investments, the mean-variance approach has been widely used in practice in asset allocation and portfolio management, despite many sophisticated models proposed in literature. On the other hand, some drawbacks of the standard Markowitz approach are reported in literature (see [2]). One of the critical weaknesses of the classical mean-variance analysis is its lack of robustness. Since the classical estimators of the mean and the covariance matrix, which are inputs in the optimization procedure, are very sensitive to the presence of gross errors or atypical events in data, the weights of the resulted portfolio, which are outputs of this procedure, can be drastically affected by these atypical data. This fact was proved in [3] by using the influence function approach. In order to remove this drawback and to construct portfolios not overly affected by deviations of the data from the assumed model, many methods have been proposed in literature. For an overview on the methods used for robust portfolio optimization we refer to [4]. Among the methods which improve the stability of portfolio weights by using robust estimators of the mean and covariance, we recall those proposed by Vaz-de Melo and Camara [5] which use M-estimators, Perret-Gentil and Victoria-Feser [3] which use the translated biweight S-estimator, Welsch and Zhou [6] which use minimum covariance determinant estimator and winsorization, DeMiguel and Nogales [7] which use both M- and S-estimators, Ferrari and Paterlini [8] which use Maximum Lq-Likelihood Estimators. These contributions have the merit to consider the role of robust estimation for improving the mean-variance portfolios. On the other hand, it is known that traditional robust estimators suffer significant losses in efficiency compared with the maximum likelihood estimator. Therefore, a trade-off robustness-efficiency should be carefully analyzed.

Our contribution to robust portfolio optimization is developed within a minimum pseudodistance framework. We can say that the minimum pseudodistance methods for estimation take part to the same cathegory with the minimum divergence methods. The minimum divergence estimators are defined by minimizing some appropriate divergence between an assumed model and the true model underlying the data. Depending on the choice of the divergence, minimum divergence estimators can afford considerable robustness at minimal loss of efficiency. However, the classical approaches based on divergence minimization require nonparametric density estimation, which can be problematic in multi-dimensional settings. Some proposals to avoid the nonparametric density estimation in minimum divergence estimation have been made in [9, 10] and [11] and robustness properties of such estimators have been studied in [12], [13].

The pseudodistance that we use was originally introduced in [10], where it was called “type-0” divergence. Corresponding minimum divergence estimators have also been studied in [10]. It was also derived (using a cross entropy argument) and extensively studied in [14] where is called γ-divergence. More details about this divergence were provided in [15]. It was also introduced in [16] in the context of decomposable pseudodistances. The minimum pseudodistance estimators for general parametric models have been presented in [16, 17] and consist in minimization of an empirical version of a pseudodistance between the assumed model and the true model underlying the data. These estimators have the advantages of not requiring any prior smoothing and conciliate robustness with high efficiency, usually requiring distinct techniques. In some papers, the name of pseudodistance was used instead of divergence since it was considered that the divergences satisfy the information processing property, which is not the case of the pseudodistance. The interest on statistical methods based on information measures and particularly on divergences has grown substantially in recent years. We refer to the monographs [15, 18] for description of research and applications in this field, as well as to some recent articles [8, 1921] developing such methods in applicative directions.

The contribution of the present paper is as follows. First we prove and discuss theoretical properties of the minimum pseudodistance estimators of multivariate location and covariance in the case of multivariate normal distribution, such as affine equivariance, B-robustness, asymptotic normality and asymptotic relative efficiency, as well as empirical properties based on Monte Carlo simulations. The behavior of the estimators depends on a tuning positive parameter α which controls the trade-off between robustness and efficiency. When the data are consistent with normality and α → 0, the estimation method corresponds to the maximum likelihood method (MLE) which is known to have full asymptotic efficiency at the model. When α > 0, the estimator gains robustness, while keeping high efficiency. The minimum pseudodistance estimators can be easily used in place of the classical mean and covariance matrix estimators, thereby providing robust and efficient mean-variance optimized portfolios. We prove asymptotic properties of portfolio weights based on minimum pseudodistance estimators, such as B-robustness and asymptotic normality. Then, on the basis of real data, we analyze different robust portfolios based on robust minimum pseudodistance estimators, studying both their in-sample and out-of-sample behavior and comparing them with some other portfolios known in literature. Among the analyzed portfolios count: optimal mean-variance portfolios using the classical MLE or minimum pseudodistance estimators or S-estimators, minimum-variance portfolios using the classical MLE or minimum pseudodistance estimators or S-estimators, as well as the equally-weighted portfolio. We considered the cases when short selling is allowed, respectively when short selling is not allowed. The out-of-sample empirical performance of the considered portfolios is evaluated using the measures: turnover, the out-of-sample portfolio variance and the out-of-sample Sharpe ratio. Our theoretical and numerical results show that the optimal portfolios based on minimum pseudodistance estimators are much more stable to extreme events than those obtained by plugging-in the MLEs and compare well with other optimal robust portfolios known in literature.

The outline of the paper is as follows: in Section 2, we shortly describe the Markowitz’s mean-variance model whose inputs are estimations of location and covariance of asset returns. The minimum pseudodistance estimators of location and covariance are introduced in Section 3. Here we prove theoretical properties of these estimators in the case of multivariate normal distribution, such as the affine equivariance and B-robustness. We also determine the asymptotic covariance matrices of the estimators and discuss the asymptotic relative efficiency. The estimators of the portfolio weights together with their asymptotic properties are presented in Section 4. In Sections 5 and 6, a Monte Carlo simulation study and then applications on real data show the advantages of the new approach. Here we illustrate both in-sample as well out-of-sample performance of the portfolios.

Portfolio optimization model

We consider a portfolio formed by N financial assets. The returns of the assets are characterized by the random vector X = (X 1, …, X N), where X i denotes the random variable associated to the return of the asset i, i = 1, …, N. Let p = (p 1, …, p N) be the vector of weights associated to the portfolio, where p i represents the proportion of the investor’s capital invested in the asset i. The total return of the portfolio is given by the random variable

pX=p1X1++pNXN.

Supposing that the random vector X follows a multivariate normal distribution NN(μ,Σ), where μ is the vector containing the mean returns of the assets and Σ is the covariance matrix of the returns of the assets, the mean of the portfolio return can be written as R(p) = p μ and the portfolio variance as S(p) = p Σp.

The Markowitz approach for optimal portfolio selection (see [1]) consists in solving the following optimization problem. For a given investor’s risk aversion λ > 0, the mean-variance optimization selects the optimal portfolio p*, solution of

argmaxp{R(p)-λ2S(p)} (1)

with the constraint p e N = 1, e N being the N × 1 vector of ones. The set of optimal portfolios for all possible values of the risk aversion parameter λ defines the mean-variance efficient frontier. The solution of the above optimization problem is explicit and the optimal portfolio weights, for a fixed value of λ, are given by

p*=1λΣ-1(μ-ηeN) (2)

where

η=eNΣ-1μ-λeNΣ-1eN.

This is the case when short selling is allowed. When short selling is not allowed, we have a supplementary constraint in the optimization problem, namely all the weights p i are positive.

When the true parameters μ and Σ and the portfolio weights are all known, then we have the true efficient frontier. An estimated efficient frontier can be obtained by using estimators of the mean and covariance matrix. Throughout this paper we denote by μ^ and Σ^ the estimators of the parameters μ and Σ, and by p*^ the estimator of the optimal portfolio weights, as resulting with Eq (2)

p*^=1λΣ^-1[μ^-eNΣ^-1μ^-λeNΣ^-1eNeN]. (3)

The mean and the covariance matrix of the returns are in practice estimated by their sample counterparts, i.e. the maximum likelihood estimators under the multivariate normal model. It is known that, under normality, the maximum likelihood estimators are the most efficient. However, in the presence of outlying observations, the asymptotic bias of these estimators can be arbitrarily large and this bias is induced to the corresponding optimal portfolio weights. For this reason, μ and Σ should be robustly estimated.

Robust estimators of the location and covariance

Minimum pseudodistance estimators

For two probability measures P and Q, admitting densities p, respectively q with respect to the Lebesgue measure, the pseudodistances that we consider are defined through

Rα(P,Q):=1α+1lnpαdP+1α(α+1)lnqαdQ-1αlnpαdQ (4)

for α > 0 and satisfy the limit relation

Rα(P,Q)R0(P,Q):=lnqpdQforα0.

Note that R 0(P, Q) coincides with the modified Kullback-Leibler divergence. More details about Eq (4) and corresponding minimum divergence estimators can be found in [10, 1416].

Let P be a parametric model with parameter space Θd and assume that every probability measure P θ in P has a density p θ with respect to the Lebesgue measure. The family of minimum pseudodistance estimators of the unknown parameter θ 0 is obtained by replacing the hypothetical probability measure P θ0 in the pseudodistances R α(P θ, P θ0) by the empirical measure P n pertaining to the sample and then minimizing R α(P θ, P n) with respect to θ on the parameter space. We note that the pseudodistances R α are also used in [22] in order to define optimal robust M-estimators using the Hampel infinitesimal approach.

Let X 1, …, X T be a sample on XNN(μ,Σ) and denote by θ = (μ, Σ) the parameter of interest. A minimum pseudodistance estimator θ^=(μ^,Σ^) of θ is defined by

θ^:=arginfθRα(Pθ,Pn)

which can be written equivalently as

θ^={argsupθ1TCα(θ)i=1Tpθα(Xi)ifα>0argsupθ1Ti=1Tlnpθ(Xi)ifα=0 (5)

where p θ is the N-variate normal density and Cα(θ)=(pθα+1dλ)αα+1. Note that, the choice α = 0 leads to the definition of the classical MLE. Throughout the paper we will also use the notation ‖xμΣ−1: = [(xμ)Σ−1(xμ)]1/2. A simple calculation shows that

Cα(θ)=(12π)Nα22(α+1)(detΣ-1)α2α+1(α+1)Nαα+1.

For α > 0, the estimator Eq (5) can be expressed as

θ^=argsupθ(detΣ-1)αα+1i=1Texp(-α2Xi-μΣ-12).

By direct differentiation with respect to μ and Σ, we see that the estimators of these parameters are solutions of the system

i=1T(Xi-μ)exp(-α2Xi-μΣ-12)=0i=1T[1α+1Σ-(Xi-μ)(Xi-μ)t]exp(-α2Xi-μΣ-12)=0

which can be rewritten as

μ=i=1Texp(-α2Xi-μΣ-12)i=1Texp(-α2Xi-μΣ-12)Xi (6)
Σ=i=1T(α+1)exp(-α2Xi-μΣ-12)i=1Texp(-α2Xi-μΣ-12)(Xi-μ)(Xi-μ). (7)

Affine equivariance

The location and dispersion estimators defined above are affine equivariant. More precisely, if μ^(X) and Σ^(X) are estimators corresponding to a sample X = (X 1, …, X T), then

μ^(AX+b)=Aμ^(X)+b (8)
Σ^(AX+b)=AΣ^(X)A (9)

for any N × N nonsingular matrix A and any bN. Indeed, let A be a nonsingular matrix, bN and Y = (Y 1, …, Y T), Y i: = AX i + b. Estimators μ^(Y) and Σ^(Y) are solutions of the system obtained from Eqs (6) and (7) by replacing X i with Y i. Then, by replacing Y i with AX i+b, we find μ^(X)=A1(μ^(Y)b) and Σ^(X)=A1Σ^(Y)(A)1. Hence, Eqs (8) and (9) hold.

Influence functions

A fundamental tool used for studying statistical robustness is the influence function. Recall that, a map T defined on a set of probability measures and parameter space valued is a statistical functional corresponding to an estimator θ^n of the parameter θ, if θ^n=T(Pn), where P n is the empirical measure associated to the sample. The influence function of T at P θ is defined by

IF(x;T,Pθ):=T(P˜εx)ε|ε=0

where P˜ɛx:=(1ɛ)Pθ+ɛδx, ɛ > 0, δ x being the Dirac measure putting all mass at x. Whenever the influence function is bounded with respect to x the corresponding estimator is called robust (see [23]).

The statistical functionals associated to the minimum pseudodistance estimators of μ and Σ are μ(P) and Σ(P) defined by the solutions of the system

(x-μ)exp(-α2x-μΣ-12)dP(x)=0[(x-μ)(x-μ)exp(-α2x-μΣ-12)--1α+1Σexp(-α2x-μΣ-12)]dP(x)=0.

This system can be rewritten under the form

w1(x-μΣ-1)(x-μ)dP(x)=0 (10)
[w2(x-μΣ-1)x-μΣ-12(x-μ)(x-μ)-w3(x-μΣ-1)Σ]dP(x)=0 (11)

where

w1(t)=exp(-α2t2),w2(t)=exp(-α2t2)t2,w3(t)=1α+1exp(-α2t2). (12)

We note that the solutions of the system given by Eqs (10) and (11), when w 1, w 2, w 3 are arbitrary weight functions, define statistical functionals of general M-estimators of (μ, Σ) (see [24] and [25]). According to the results presented in [25], the influence functions for general M-estimators of μ and Σ are given by

IF(x;μ,Pμ,Σ)=(x-μ)wμ(x-μΣ-1) (13)
IF(x;Σ,Pμ,Σ)=(x-μ)(x-μ)wη(x-μΣ-1)-Σwδ(x-μΣ-1) (14)

where

wμ(x-μΣ-1)=w1(x-μΣ-1)EP0[w1(y)+1Nw1(y)y]wη(x-μΣ-1)=N(N+2)w2(x-μΣ-1)x-μΣ-12EP0[Nw2(y)+w2(y)y]wδ(x-μΣ-1)=Nw3(x-μΣ-1)-2w2(x-μΣ-1)EP0[w2(y)y-Nw3(y)y]++(N+2)w2(x-μΣ-1)EP0[Nw2(y)+w2(y)y]

P 0 denoting the probability measure associate to the N-variate standard normal distribution and ‖⋅‖ the Euclidian norm.

For the weight functions w 1, w 2, w 3 from Eq (12), corresponding to the minimum pseudodistance estimators, we get

wμ(t)=(α+1)N+2exp(-α2t2) (15)
wη(t)=(α+1)N+4exp(-α2t2) (16)
wδ(t)=(α+1)N+2exp(-α2t2) (17)

and replacing in Eqs (13) and (14) we obtain

IF(x;μ,Pμ,Σ)=(α+1)N+2(x-μ)exp(-α2x-μΣ-12) (18)
IF(x;Σ,Pμ,Σ)=(α+1)N+4[(x-μ)(x-μ)-1α+1Σ]exp(-α2x-μΣ-12). (19)

Both influence functions are bounded with respect to x. Therefore the minimum pseudodistance estimators of μ and Σ are robust.

In Fig 1 the influence function for the first component of the minimum pseudodistance estimator of the mean is represented. Here P 0 is the bivariate standard normal law and the constant corresponding to the estimator was chosed α = 0.5. The influence function is bounded and also redescending. A major feature of the minimum pseudodistance estimators considered in this paper is that they are redescending (the influence functions tend to 0, when ∣∣x∣∣ tends to infinity) and this represents an important advantage from the robustness point of view.

Fig 1. The influence function for the first component of the minimum pseudodistance estimator of the mean.

Fig 1

Asymptotic normality

For general parametric models, under some regularity conditions, the minimum pseudodistance estimators are asymptotically normal distributed (see [16]). In this section, we derive the asymptotic covariance matrices of the mean and the covariance matrix minimum pseudodistance estimators. We adopt the influence function approach and make use of the general results for affine equivariant location and dispersion M-estimators as presented in [26] and [23].

When the observations correspond to the standard N-variate normal law P 0, under appropriate conditions, μ^ is asymptotically normal distributed with the asymptotic covariance matrix

V(μ,P0)=EP0{IF(Z;μ,P0)IF(Z;μ,P0)}=dμI (20)

where dμ:=EP0{Z2wμ2(Z)}/N and I is the identity matrix. Formula (20) has been established in [26] for general affine equivariant location M-estimators. The estimator μ^ belongs to this class. For the weight w μ from Eq (15) we get dμ=(α+1)N+2/(2α+1)N+2, hence the asymptotic covariance matrix of the minimum pseudodistance estimator μ^ is

V(μ,P0)=(α+12α+1)N+2I.

When the observations correspond to the normal law P μ, the asymptotic covariance matrix of μ^ is given by

V(μ,Pμ,Σ)=EPμ,Σ{IF(X;μ,Pμ,Σ)IF(X;μ,Pμ,Σ)}=dμΣ=(α+12α+1)N+2Σ. (21)

Similar results hold for vecs(Σ^), where vecs is the operation that stacks the N + N(N − 1)/2 non-redundant elements of Σ into a vector, as follows:

vecs(Σ):=(σ11/2,,σNN/2,σ21,σ31,,σN,N-1).

According to the results presented in [26], when the observations come from the N-variate standard normal law P 0, the asymptotic covariance matrix corresponding to an affine equivariant M-estimator of the covariance matrix is given by

V(Σ,P0)=EP0{vecsIF(Z;Σ,P0)vecsIF(Z;Σ,P0)}=dη(I-1Nww)+dτ·1Nww

where w:=(eN,0N(N1)/2), dη:=EP0{Z4wη2(Z)}/(N(N+2)) and dτ:=EP0{wτ2(Z)}/(2N) with w τ: = t 2 w η(t) − Nw δ(t), w η, w δ and w τ being specific to the M-estimator in question.

In our case, w η and w δ are given by Eqs (16) and (17), hence

wτ(t)=(α+1)N+4[t2-Nα+1]exp(-α2t2).

After some calculation, we obtain

dη=(α+12α+1)N+4anddτ=Nα2(α+1)N+22(2α+1)N+4+(α+12α+1)N+4,

therefore,

V(Σ,P0)=(α+12α+1)N+4I+α2(α+1)N+22(2α+1)N+4ww. (22)

When the observations correspond to the law P μ, the asymptotic covariance matrix of vecs(Σ^) can be established by using the formula from [23] p.282, which in our notations writes as follows

V(Σ,Pμ,Σ)=[vecs[Σ12SΣ12]vecsS]V(Σ,P0)[vecs[Σ12SΣ12]vecsΣ]. (23)

According to [23] p.272, for a given N × N matrix Σ*, it holds

[vecs[Σ12SΣ12]vecsS]vecsΣ*=vecs(Σ12Σ*Σ12).

Particularly,

[vecs[Σ12SΣ12]vecsS]vecsI=vecsΣ. (24)

Note that w=2vecsI and combining Eqs (23), (22) and (24), we get

V(Σ,Pμ,Σ)=(α+12α+1)N+4[vecs[Σ12SΣ12]vecsS][vecs[Σ12SΣ12]vecsS]++α2(α+1)N+2(2α+1)N+4vecsΣ(vecsΣ).

For symmetry reasons, the minimum pseudodistance location and covariance estimators are asymptotically uncorrelated and hence asymptotically independent. This is valid for location and covariance M-estimators in general, as it is underlined in various articles, for example in [24].

Asymptotic relative efficiency

In order to assess the efficiency of the proposed estimators with respect to that of the MLE, we adopt as measure the asymptotic relative efficiency (ARE). For a parameter θ taking values in d and an estimator θ^ which is asymptotically d-variate normal with mean θ and nonsingular covariance matrix V(θ, P), the asymptotic relative efficiency with respect to that of the MLE is defined as

ARE(θ^,P)=(detV0(θ,P)detV(θ,P))1/d,

V 0(θ, P) being the asymptotic covariance matrix of the MLE of θ when the observations follow the law P (see [27]). Although the asymptotically most efficient estimator is given by the MLE, the particular MLE can be drastically inefficient when the underlying distribution departs even a little bit from the assumed nominal distribution. Therefore the trade-off between robustness and efficiency should be carefully analyzed.

Due to the asymptotic independence of the mean and the covariance matrix minimum pseudodistance estimators, the asymptotic relative efficiency of θ^=(μ^,vecs(Σ^)) can be expressed as

ARE(θ^,Pμ,Σ)=(detV0(θ,Pμ,Σ)detV(θ,Pμ,Σ))2N(N+3)=(detV0(μ,Pμ,Σ)detV0(Σ,Pμ,Σ)detV(μ,Pμ,Σ)detV(Σ,Pμ,Σ))2N(N+3). (25)

Using Eqs (21) and (23), formula (25) can be written as

ARE(θ^,Pμ,Σ)=(detV0(μ,P0)detV0(Σ,P0)detV(μ,P0)detV(Σ,P0))2N(N+3).

A direct calculation shows that

detV(μ,P0)=(α+12α+1)N(N+2)detV(Σ,P0)=(α+12α+1)N(N+1)(N+4)2(1+Nα22(α+1)2).

Particularly, for α = 0, we find the similar quantities for the MLE, namely det V 0(μ, P 0) = 1 and det V 0(Σ, P 0) = 1. Hence

ARE(θ^,Pμ,Σ)=1(α+12α+1)N2+7N+8N+3(1+Nα22(α+1)2)2N(N+3). (26)

Note that, for fixed N and α, ARE(θ^,Pμ,Σ) is the same, whatever μ or Σ.

In Table 1 values of the asymptotic relative efficiency Eq (26) are given. As it can be seen, when N or α increases, the asymptotic relative efficiency ARE(θ^,Pμ,Σ) decreases. Therefore, values of α close to zero will provide high efficiency and in the meantime the robustness of the estimation procedure.

Table 1. Asymptotic relative efficiency of the minimum pseudodistance estimators.

N α = 0 α = 0.1 α = 0.2 α = 0.5 α = 0.75 α = 1
1 1 0.98151 0.93871 0.76904 0.63774 0.53033
2 1 0.97704 0.92429 0.72086 0.57042 0.45266
3 1 0.97273 0.91051 0.67698 0.51187 0.38814
4 1 0.96851 0.89718 0.63647 0.46018 0.33370
5 1 0.96435 0.88419 0.59879 0.41420 0.28738
6 1 0.96025 0.87148 0.56360 0.37311 0.24778
7 1 0.95619 0.85902 0.53065 0.33629 0.21380
8 1 0.95215 0.84679 0.49975 0.30322 0.18460
9 1 0.94815 0.83477 0.47073 0.27350 0.15946
10 1 0.94418 0.82294 0.44345 0.24674 0.13779

The estimator of the optimal portfolio weights

We consider the estimator p*^ of the optimal portfolio weights, as given by Eq (3), with μ^ and Σ^ minimum pseudodistance estimators.

The influence function of the estimator p*^ is proportional to the influence functions of the estimators μ^ and Σ^. More precisely,

IF(x;p*,Pμ,Σ)=-Σ-1IF(x;Σ,Pμ,Σ)p*+1λΣ-1{IF(x;μ,Pμ,Σ)++eNΣ-1[IF(x;Σ,Pμ,Σ)Σ-1μ-IF(x;μ,Pμ,Σ)]eNeNΣ-1eN++(eNΣ-1IF(x;Σ,Pμ,Σ)Σ-1eN)(eNΣ-1μ-λ)eN(eNΣ-1eN)2} (27)

where IF(x; μ, P μ) and IF(x; Σ, P μ) are those from Eqs (18) and (19). This formula is obtained by considering the statistical functional associated to the optimal portfolio weights,

p*(P)=1λΣ-1(P)[μ(P)-eNΣ-1(P)μ(P)-λeNΣ-1(P)eNeN]

where Σ−1(P) denotes the statistical functional corresponding to Σ^1, and then deriving the influence function, taking also into account that

IF(x;Σ-1,Pμ,Σ)=-Σ-1IF(x;Σ,Pμ,Σ)Σ-1.

On the basis of the direct proportionality between the influence function IF(x; p*, P μ) and the influence functions IF(x; μ, P μ) and IF(x; Σ, P μ), we deduce that the global robustness of μ^ and Σ^ is transferred to the plug-in estimator p*^.

The consistency of the estimator of the optimal portfolio weights can be obtained using continuity arguments (see also [28]).

Note that, for the market parameters (μ, Σ), Eq (1) is a convex optimization problem and possesses at least one optimal solution since the space of the feasible portfolios on which we optimize is compact. In addition, the solution is unique and is given by Eq (2). Since the problem (1) has a unique solution for each (μ, Σ) in a neighborhood of a given (μ 0, Σ0), for a fixed λ, the mapping (μ, Σ) → p* = p*(λ, μ, Σ) is continuous in a neighborhood of (μ 0, Σ0). Indeed, according to Theorem 4.2.1 from [29], the optimal set mapping is Hausdorff upper semicontinuous due to the compactness of the feasibility set and due to the continuity of the objective function. As the solution is unique, upper semicontinuity of the set-valued map yields continuity in the usual sense.

Then, for a given λ, p^*=p^*(λ,μ^,Σ^) is a consistent estimator of the true optimal portfolio weights p*=p*(λ,μ^,Σ^), if μ^,Σ^ are consistent estimators of μ and Σ. Indeed, the almost sure convergence and the convergence in probability remain valid after continuous transformations. The consistency of the estimators μ^,Σ^ is implied by arguments from the theory of M-estimators, as developed by Huber (see [24] p.176). Using the uniqueness of the optimal solution, we have the continuity of the function (μ, Σ) → p*(λ, μ, Σ), as noted above. Therefore, the consistency of the estimator of the optimal portfolio weights holds.

On the other hand, by using the multivariate Delta method, the asymptotic normality of p*^ is kept, as well. Given the i.i.d. observations X 1, …, X T from P μ, since μ^ and vecs(Σ^) are asymptotically normal and the function h(θ)=1λΣ1(μηeN) with θ = (μ ,(vecsΣ)) is differentiable, by applying the multivariate Delta method, it holds n(p*^p*)N(0,V(p*,Pμ,Σ)) where V(p*, P μ) = Dh(θ)V(θ, P μ)Dh(θ), Dh(θ) is the differential of h in θ and

V(θ,Pμ,Σ)=(V(μ,Pμ,Σ)00V(Σ,Pμ,Σ)).

Monte Carlo simulations

We performed Monte Carlo simulations in order to assess the performance of the minimum pseudodistance estimators of the mean and covariance matrix, for both contaminated and non-contaminated data.

In a first study, we considered the multivariate normal distribution NN(μ0,Σ0), with μ 0 = 0 and Σ0 a N × N matrix with variances equal to 1 and covariances all equal to 0.2. We generated samples of size T in which about (1 − ɛ)T observations are from NN(μ0,Σ0), while a smaller portion ɛT is from the contaminating distribution NN(μc,Σc) with μ c = −4 and Σc = 4Σ0. We considered N ∈ {2, 5, 10, 20, 50} and ɛ ∈ {0, 0.05, 0.1, 0.2}. For each setting, we generated 1000 samples and for each sample we computed minimum pseudodistance estimates μ^ and Σ^ corresponding to α ∈ {0, 0.1, 0.2, 0.5, 0.75, 1}.

The estimates μ^ and Σ^, which are solutions of the system of Eqs (6) and (7), were obtained using the following reweighting algorithm.

Let s ∈ {0, 1, …, s*} denotes the iteration step.

1. If s = 0

μ (s) and Σ(s) are set to be initial estimates of location and scale;

2. For 0 < s < s*,

μ(s)=i=1Twi(s-1)XiΣ(s)=i=1T(α+1)wi(s-1)(Xi-μ(s))(Xi-μ(s))

where

wi(s)=exp(-α2(Xi-μ(s))(Σ(s))-1(Xi-μ(s)))i=1Texp(-α2(Xi-μ(s))(Σ(s))-1(Xi-μ(s))).

At step 1, we used maximum likelihood estimates as initial estimates of location and covariance. For details on general convergence behavior of reweighting algorithms we refer to [30]. If α > 0, the above procedure associates low weights to the observations that disagree sensibly with the model. If α = 0, all the observations receive the same weight and the estimators are the maximum likelihood ones

μ^ML=1Ti=1TXiΣ^ML=1Ti=1T(Xi-μ^ML)(Xi-μ^ML).

We present simulation based estimates of the mean square error given by

MSE^=1nsi=1nsθ^i-θ02

where n s is the number of samples (in our case n s = 1000), θ0=(μ0,vech(Σ0)) and θ^i=(μ^i,vech(Σ^i)) is an estimation corresponding to the sample i. Here vech(Σ) is “the vector half”, namely the n(n+1)/2-dimensional column vector obtained by stacking the columns of the lower triangle of Σ, including the diagonal, one below the other.

Tables 2 and 3 present simulation based estimates of the mean square error, when the sample size is T = 10 ⋅ N, respectively when T = 100 ⋅ N.

Table 2. Simulation based estimates of the mean square error, when T = 10 ⋅ N.

ɛ = 0%
N α = 0 α = 0.1 α = 0.2 α = 0.5 α = 0.75 α = 1
2 0.343 0.358 0.384 0.559 0.804 1.177
5 0.513 0.530 0.593 1.340 4.806 5.324
10 0.760 0.817 0.945 10.471 11.389 12.197
20 1.290 1.429 2.069 29.979 38.830 47.064
ɛ = 5%
N α = 0 α = 0.1 α = 0.2 α = 0.5 α = 0.75 α = 1
2 4.425 0.888 0.533 0.593 0.849 1.202
5 18.816 1.077 0.662 1.565 4.985 5.437
10 41.312 0.951 1.022 10.646 11.470 12.600
20 145.172 1.517 2.273 30.339 39.561 47.072
ɛ = 10%
N α = 0 α = 0.1 α = 0.2 α = 0.5 α = 0.75 α = 1
2 11.554 4.605 0.945 0.694 0.923 1.294
5 43.446 4.075 0.749 1.758 5.052 5.449
10 143.395 1.325 1.091 10.720 11.454 12.648
20 503.319 1.648 2.422 30.776 39.758 47.693
ɛ = 20%
N α = 0 α = 0.1 α = 0.2 α = 0.5 α = 0.75 α = 1
2 32.696 24.612 9.955 1.118 1.190 1.475
5 132.542 53.841 1.869 2.171 5.362 5.625
10 441.209 19.233 1.241 10.751 11.751 12.745
20 1613.373 1.930 3.742 31.361 40.292 49.644

Table 3. Simulation based estimates of the mean square error, when T = 100 ⋅ N.

ɛ = 0%
N α = 0 α = 0.1 α = 0.2 α = 0.5 α = 0.75 α = 1
2 0.035 0.035 0.039 0.051 0.067 0.084
5 0.050 0.052 0.057 0.087 0.135 0.204
10 0.075 0.081 0.093 0.185 0.395 8.703
20 0.129 0.142 0.181 0.910 28.127 31.790
50 0.287 0.359 0.685 171.329 219.263 262.867
ɛ = 5%
N α = 0 α = 0.1 α = 0.2 α = 0.5 α = 0.75 α = 1
2 2.504 0.304 0.076 0.060 0.068 0.092
5 10.863 0.207 0.066 0.092 0.136 0.217
10 37.549 0.129 0.100 0.191 0.409 9.329
20 136.364 0.157 0.194 0.891 28.354 33.200
50 809.024 0.377 0.724 186.600 268.848 321.844
ɛ = 10%
N α = 0 α = 0.1 α = 0.2 α = 0.5 α = 0.75 α = 1
2 8.910 2.493 0.285 0.066 0.073 0.096
5 39.015 2.203 0.089 0.098 0.142 0.240
10 133.655 0.404 0.107 0.207 0.474 9.746
20 493.386 0.182 0.203 1.249 28.467 33.925
50 2922.483 0.400 0.772 189.327 273.664 327.170
ɛ = 20%
N α = 0 α = 0.1 α = 0.2 α = 0.5 α = 0.75 α = 1
2 28.470 18.606 7.524 0.106 0.102 0.115
5 124.702 44.374 0.327 0.113 0.168 0.272
10 429.087 17.016 0.128 0.232 0.592 10.423
20 1576.600 0.335 0.229 3.460 29.091 34.815
50 9334.156 0.453 0.887 190.613 282.404 348.252

When there is no contamination, the MLE (α = 0) performs the best, whatever the dimension N. On the other hand, the estimations obtained with the minimum pseudodistance estimators in this case are very close to those provided by the MLE, when α is not far from zero (for example α = 0.1 and α = 0.2). In the presence of contamination, the minimum pseudodistance estimators give much better results than the MLE, in all considered cases. In most cases, the choices α = 0.1, α = 0.2 provide the best results in terms of robustness. In the meantime, these choices correspond to estimation procedures with high asymptotic relative efficiency, according to the results from Table 1. When the contamination is more pronounced, i.e. ɛ = 10% or ɛ = 20%, and the dimension N is low, i.e. N = 2, the choices α = 0.5 and α = 0.75 provide better robust estimates, but the asymptotic relative efficiencies of the corresponding estimation procedures are too low. Thus, values of α close to zero, such as α = 0.1, α = 0.2, represent choices that offer an equilibrium between robustness and efficiency. We notice that, even for highly multivariate data (N = 50 in our examples), when the number of observations is sufficiently high, the minimum pseudodistance estimators with α = 0.1, α = 0.2 give excellent results both in the case of clean data, as well as in case of contamination. The simulation results presented in Tables 2 and 3 show that increasing sample size leads to improved estimations.

Moreover, we present an example for highly correlated data. Here we considered the normal distribution NN(μ,Σ) with N = 5, μ = 0, Σ the matrix with the diagonal terms 2 and the other entries 1. The contaminating distribution is NN(μc,Σc), where μ c = 4 and Σc = Σ, the contamination level being ɛ ∈ {0, 0.1, 0.2}. In this case, the mean square errors of the estimations are given in Table 4, the number of considered samples being again n s = 1000. These results show that even in the case when the correlation level of the data is high and the data are contaminated, the minimum pseudodistance estimators give significantly better results than the MLE, the best results being obtained for α = 0.75 and α = 1.

Table 4. Simulation based estimates of the mean square error, for highly correlated data, T = 100 ⋅ N.

α = 0 α = 0.1 α = 0.2 α = 0.5 α = 0.75 α = 1
ɛ = 0% 0.1999 0.2033 0.2281 0.3311 0.5232 0.8612
ɛ = 10% 32.3561 23.0288 13.9551 1.7271 0.9666 1.1272
ɛ = 20% 101.7404 97.5348 88.9003 34.6433 5.3518 2.0940

Application for financial data

In-sample analysis

We analyze 172 monthly log-returns of 8 MSCI Indexes (France, Germany, Italy, Japan, Pacific Ex JP, Spain, United Kingdom and USA) from January 1998 to April 2012 with the aim to construct robust and efficient portfolios. The data are provided by MSCI (see [31]). Boxplots for these data are presented in Fig 2.

Fig 2. Boxplots for monthly log-returns corresponding to the MSCI Indexes.

Fig 2

(1: France, 2: Germany, 3: Italy, 4: Japan, 5: Pacific ex JP, 6: Spain, 7: United Kingdom, 8: USA.)

For these indexes, estimates of the expected return and of the variance are represented in Fig 3. Note that the estimates of the expected returns obtained with the minimum pseudodistance estimators are larger than the maximum likelihood ones. In the meantime, the minimum pseudodistance estimates of the variances are smaller than those provided by the MLE.

Fig 3. Expected returns estimates (left) and variance estimates (right) for the 8 MSCI Indexes.

Fig 3

(1: France, 2: Germany, 3: Italy, 4: Japan, 5: Pacific ex JP, 6: Spain, 7: United Kingdom, 8: USA.)

Estimates of the mean vector and of the covariance matrix computed with different minimum pseudodistance estimators are used to determine efficient frontiers. In Fig 4 we plot efficient frontiers for the case “short selling not allowed”. Similar results can be obtained in the case “short selling allowed”. In both cases, the frontiers based on the minimum pseudodistance estimations dominate those based on the classical maximum likelihood estimations, yielding portfolios with larger expected returns and smaller risks. Thus, the robust estimates reduce the volatility effects which typically affects the results of the traditional approaches.

Fig 4. Mean-variance efficient frontiers.

Fig 4

The next step in our analysis is to identify the influential observations which are responsible for the shift of the efficient frontier. We perform this study in the case “short selling”. In this sense, we use the data influence measure (DIM) as diagnostic tool (see [3]). This is defined as the Euclidian norm of the influence function of the estimator of weights based on maximum likelihood estimators of μ and Σ. More precisely,

DIM(x,p*^)=[IF(x;p*,Pμ,Σ)IF(x;p*,Pμ,Σ)]1/2

where IF(x; p*, P μ) is given by Eq (27) with IF(x; μ, P μ) and IF(x; Σ, P μ) given by the formulas (18) and (19) in the case α = 0. In order to compute DIM, the true parameters values μ, Σ, p* have to be known. In practice, these parameters should be estimated in a robust way, such that DIM is not affected by the outlying observations it is supposed to detect.

In Fig 5 (left hand side) we represent the influence of each of the 172 observations on the estimator of the optimal portfolio weights based on maximum likelihood estimators of μ and Σ. Since DIM is related to a specific portfolio on the efficient frontier, we made a choice, namely the level of the portfolio variance has been set to 0.005. This choice corresponds to λ = 3.85. The necessary robust estimates of μ, Σ, p* have been obtained with minimum pseudodistance estimators corresponding to α = 0.2. The most influential observations as detected by DIM correspond to negative economic events associated with known financial crisis periods: 1998 Russian financial crisis (August 1998), “dot-com crash” of 2000–2002 and 2007–2012 global financial crisis. On the other hand, the influence of these observations is substantially reduced when using robust procedures. This fact can be seen in the right hand side of Fig 5 where we represent the influence of each of the 172 observations on the robust estimator of the optimal portfolio weights based on the minimum pseudodistance estimators of μ and Σ corresponding to α = 0.2. Reducing the influence of outlying observations leads to optimal portfolios with higher returns and smaller variances.

Fig 5. The influence of the observations on the classical/robust estimator of portfolio weights.

Fig 5

Out-of-sample evaluation

In this subsection, we illustrate the out-of-sample stability of the proposed portfolios. For this purpose we use the same empirical data set as in the preceding subsection. The evaluated portfolios are: optimal mean-variance portfolios using the classical MLE or minimum pseudodistance estimators (MPE) or S-estimators, minimum-variance portfolios using the classical MLE or minimum pseudodistance estimators or S-estimators, as well as the equally-weighted portfolio. The S-estimators are well known robust estimators of multivariate location and covariance, allowing the flexibility of choosing the breakdown point ɛ*, which is the amount of data deviating from the reference model that an estimator can accept while giving meaningful information (see for example [32]). We considered in our study S-estimators based on the Tukey’s biweight function with breakdown point ɛ* = 0.25 or ɛ* = 0.5. For the optimal mean-variance portfolios we used λ = 3.85, as in the preceding section, but similar results can be obtained for other finite values of λ, too. The minimum-variance portfolios correspond to λ = ∞, as it is known. We consider both cases, when short selling is allowed or not.

We compare the out-of-sample empirical performance of portfolios using three measures: turnover, the out-of-sample portfolio variance and the out-of-sample Sharpe ratio. For this we apply a “rolling-horizon” procedure as in [7]. First, we choose a window over which to perform the estimation. We denote the length of the estimation window by τ < T, where T is the size of the entire data set. For our examples, we use an estimation window of τ = 100 data points, T being 172. Then, using the data in the first estimation window, we compute the weights for the considered portfolios. We repeat this procedure for the next window, by including the data for the next month and dropping the data for the earliest month. We continue doing this until the end of the data set is reached. At the end of this process, we have generated Tτ portfolio weight vectors for each strategy, that is the vectors ptk for t ∈ {τ, …, T − 1}, k denoting the strategy. For a strategy k, let pj,tk denotes the portfolio weight in asset j chosen at time t, pj,t+k the portfolio weight in asset j before rebalancing but at t+1 (considering the change in prices from t to t+1) and pj,t+1k the portfolio weight in asset j at time t+1, after rebalancing. The portfolio turnover is defined as

Turnover=1T-τ-1t=τT-1j=1N|pj,t+1k-pj,t+k|.

The weights pj,t+k are computed using the formula

pj,t+k=1+Xjt+11+(Xt+1)ptk·pj,tk

X t+1 representing the data at the time t + 1. The out-of-sample return at time t + 1, corresponding to the strategy k, is defined as (ptk)Xt+1. For each strategy k, using these out-of-sample returns, the out-of-sample variance is defined by

(σ^k)2=1T-τ-1t=τT-1((ptk)Xt+1-μ^k)2withμ^k=1T-τt=τT-1(ptk)Xt+1

and the Sharpe ratio is defined by SR^k=μ^kσ^k.

In Tables 5 and 6 we report the turnover, the out-of-sample variance and the Sharpe ratio for optimal mean-variance portfolios (with λ = 3.85) and for minimum-variance portfolios in the case “short selling allowed”, respectively in the case “short selling not allowed”. For estimation of the mean and covariance we considered the classical MLE, minimum pseudodistance estimators for several values of α, as well as S-estimators. We note that the portfolios that minimize the variance have better turnover than the portfolios that optimize the trade-off between mean and variance, confirming the fact that the policies that ignore estimates of the expected returns lead to better stability results in the out-of-sample analysis. The robust portfolios using minimum pseudodistance estimators attain higher Sharpe ratios and lower turnover than the traditional portfolios using MLE, the influence of asset returns that deviate from normality being more reduced.

Table 5. Turnover, out-of-sample portfolio variance and Sharpe ratio for optimal mean-variance portfolios (Mean-var) and minimum-variance portfolios (Min-var), corresponding to different estimators of mean and covariance, in the case “short selling allowed”.

Mean-var Min-var
Estimators Turnover (σ^k)2 SR^k Turnover (σ^k)2 SR^k
MLE 1.5665 0.0105 0.0368 0.2507 0.0021 -0.0378
MPE, α = 0.1 1.4717 0.0161 0.0727 0.2109 0.0020 -0.0282
MPE, α = 0.2 1.9083 0.0292 0.0880 0.1838 0.0019 -0.0322
MPE, α = 0.25 2.4381 0.0400 0.0911 0.1753 0.0019 -0.0367
SE, ɛ* = 0.25 1.4119 0.0126 0.0631 0.2283 0.0020 -0.0291
SE, ɛ* = 0.5 1.5506 0.0218 0.0878 0.1966 0.0019 -0.0302

Table 6. Turnover, out-of-sample portfolio variance and Sharpe ratio for optimal mean-variance portfolios (Mean-var) and minimum-variance portfolios (Min-var), corresponding to different estimators of mean and covariance, in the case “short selling not allowed”.

Mean-var Min-var
Estimators Turnover (σ^k)2 SR^k Turnover (σ^k)2 SR^k
MLE 0.2068 0.0054 0.0032 0.0519 0.0025 -0.0304
MPE, α = 0.1 0.1510 0.0060 0.0186 0.0405 0.0025 -0.0264
MPE, α = 0.2 0.1475 0.0059 0.0170 0.0370 0.0025 -0.0220
MPE, α = 0.25 0.1562 0.0059 0.0138 0.0364 0.0026 -0.0199
MPE, α = 0.3 0.1594 0.0059 0.0129 0.0364 0.0026 -0.0184
MPE, α = 0.5 0.1084 0.0058 0.0254 0.0363 0.0026 -0.0148
MPE, α = 0.75 0.0235 0.0060 0.0319 0.0511 0.0028 -0.0050
MPE, α = 1 0.0143 0.0060 0.0345 0.1219 0.0028 -0.0259
SE, ɛ* = 0.25 0.1771 0.0058 0.0125 0.0435 0.0025 -0.0273
SE, ɛ* = 0.5 0.1320 0.0059 0.0173 0.0384 0.0026 -0.0224

When short selling is allowed, the mean-variance portfolios with the best turnover are those corresponding to the S-estimators with ɛ* = 0.25, respectively to the minimum pseudodistance estimators with α = 0.1, the Sharpe ratios of these portfolios being very close. When short selling is not allowed, the best mean-variance portfolios in terms of turnover and Sharpe ratio are those corresponding to the minimum pseudodistance estimators with α = 0.75 and α = 1. In the case of minimum-variance portfolios, the best turnover is obtained for minimum pseudodistance estimators (α = 0.2, 0.25, 0.3, 0.5) and the best Sharpe ratio is obtained for the minimim pseudodistance estimators with α = 0.75, both in the case “short selling not allowed”.

We also compare our results with those corresponding to the equally-weighted portfolio which is known to have a good out-of-sample behavior, even in the case of contaminated data. For our data set and this portfolio, we obtained: Turnover = 0.02234, (σ^k)2=0.0025 and SR^k=0.0272. Note that, among all the considered portfolios, the equally-weighted portfolio has the smallest turnover. In the meantime, all the minimum-variance portfolios using minimum pseudodistance estimators, in the case “short selling not allowed”, have better Sharpe ratio than the equally-weighted portfolio. For example, the minimum-variance portfolio using minimum pseudodistance estimators with α = 0.5 represents a good choice in terms of turnover and Sharpe ratio.

The boxplots of portfolio weights give a graphical representation of the stability of the different portfolio policies. By applying the “rolling-horizon” procedure, we obtain Tτ portfolio weight vectors for each strategy. In Figs 6 and 7, each boxplot represent the variability of the weight assigned to a particular asset in a minimum-variance portfolio. As it can be seen, the robust minimum-variance portfolios based on minimum pseudodistance estimators are characterized by a better out-of-sample stability than the classical minimum-variance portfolios. We choose for these examples α = 0.25, but we obtained similar results for other values of α, too. Also, in this example, the weights of the minimum-variance portfolio based on minimum pseudodistance estimators are more stable than the weights of the minimum-variance portfolio based on S-estimators (although the out-of-sample behaviors of the portfolios based on these two type of estimators are very close).

Fig 6. Boxplots of the weights corresponding to minimum-variance portfolios.

Fig 6

Different estimators of the covariance matrix are used, in the case “short selling allowed”.

Fig 7. Boxplots of the weights corresponding to minimum-variance portfolios.

Fig 7

Different estimators of the covariance matrix are used, in the case “short selling not allowed”.

Our theoretical and numerical results show that the optimal portfolios based on minimum pseudodistance estimators are much more stable to extreme events than those obtained by plugging-in the MLEs and compare well with other optimal robust portfolios. When α is not far from 0, the minimum pseudodistance estimators of μ and Σ combine robustness with high efficiency and these qualities are transferred to the portfolio weights estimator. The numerical results based on simulations or real data show that α = 0.2, 0.25 represent good choices in terms of robustness and efficiency, but also higher values of α lead to good results in some situations. All these aspects recommend the new procedure as a viable alternative to existing robust portfolio selection methods.

Supporting Information

S1 Appendix. Appendix.

Codes of the Figures.

(DOC)

Data Availability

Data were obtained from the third party MSCI (https://www.msci.com/web/msci). Boxplots for relevant data are included within the paper, and interested readers may obtain the full dataset by contacting Orsolya Czintos at MSCI: Orsolya.Czintos@msci.com

Funding Statement

This work was supported by a grant of the Romanian National Authority for Scientific Research, CNCS-UEFISCDI, project number PN-II-RU-TE-2012-3-0007 (http://uefiscdi.gov.ro) (AT). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Markowitz HM. Mean-variance analysis in portfolio choice and capital markets. J Finance. 1952;7: 77–91. [Google Scholar]
  • 2. Michaud R. The Markowitz optimization enigma: is optimized optimal? Financ Anal J. 1989;45: 31–42. 10.2469/faj.v45.n1.31 [DOI] [Google Scholar]
  • 3.Perret-Gentil C, Victoria-Feser MP. Robust mean-variance portfolio selection; 2005. Preprint. Available: FAME Research Paper, no. 140. papers.ssrn.com/sol3/papers.cfm?abstract_id = 721509
  • 4. Fabozzi FJ, Huang D, Zhou G. Robust portfolios: contributions from operations research and finance. Ann Oper Res. 2010;176: 191–220. 10.1007/s10479-009-0515-6 [DOI] [Google Scholar]
  • 5. Vaz-de Melo B, Camara RP. Robust multivariate modeling in finance. Int J Manag Finance. 2005;4: 12–23. [Google Scholar]
  • 6. Welsch RE, Zhou X. Application of robust statistics to asset allocation models. Revstat Stat J. 2007;5: 97–114. [Google Scholar]
  • 7. DeMiguel V, Nogales FJ. Portfolio selection with robust estimation. Oper Res. 2009;57: 560–577. 10.1287/opre.1080.0566 [DOI] [Google Scholar]
  • 8.Ferrari D, Paterlini S. Efficient and robust estimation for financial returns: an approach based on q-entropy; 2010. Preprint. Available: Materiali di discussione, no. 623, Universita degli Studi di Modena e Reggio Emilia. www.dep.unimore.it/materiali_discussione/0623.pdf
  • 9. Basu A, Harris IR, Hjort NL, Jones MC. Robust and efficient estimation by minimizing a density power divergence. Biometrika. 1998;85: 549–559. 10.1093/biomet/85.3.549 [DOI] [Google Scholar]
  • 10. Jones MC, Hjort NL, Harris IR, Basu A. A comparison of related density-based minimum divergence estimators. Biometrika. 2001;88: 865–873. 10.1093/biomet/88.3.865 [DOI] [Google Scholar]
  • 11. Broniatowski M, Keziou A. Parametric estimation and tests through divergences and the duality technique. J Multivar Anal. 2009;100: 16–36. 10.1016/j.jmva.2008.03.011 [DOI] [Google Scholar]
  • 12. Toma A, Leoni-Aubin S. Robust tests based on dual divergence estimators and saddlepoint approximations. J Multivar Anal. 2010;101: 1143–1155. 10.1016/j.jmva.2009.11.001 [DOI] [Google Scholar]
  • 13. Toma A, Broniatowski M. Dual divergence estimators and tests: Robustness results. J Multivar Anal. 2011;102: 20–36. 10.1016/j.jmva.2010.07.010 [DOI] [Google Scholar]
  • 14. Fujisawa H, Eguchi S. Robust parameter estimation with a small bias against heavy contamination. J Multivar Anal. 2008;99: 2053–2081. 10.1016/j.jmva.2008.02.004 [DOI] [Google Scholar]
  • 15. Basu A, Shioya H, Park C. Statistical inference: the minimum pseudodistance approach. Florida: CRC Press, Boca Raton; 2011. [Google Scholar]
  • 16. Broniatowski M, Toma A, Vajda I. Decomposable pseudodistances and applications in statistical estimation. J Stat Plan Inference. 2012;142: 2574–2585. 10.1016/j.jspi.2012.03.019 [DOI] [Google Scholar]
  • 17. Toma A, Leoni-Aubin S. Portfolio selection using minimum pseudodistance estimators. Econ Comput Econ Cybern Stud Res. 2013;47: 97–112. [Google Scholar]
  • 18. Pardo L. Statistical inference based on divergence measures. CRC Press, Taylor & Francis; 2005. [Google Scholar]
  • 19. Vonta F, Karagrigoriou A. Generalized measures of divergence in survival analysis and reliability. J Appl Probab. 2010;47: 216–234. 10.1239/jap/1269610827 [DOI] [Google Scholar]
  • 20. Preda V, Dedu S, Sheraz M. New measure selection for Hunt-Devolder semi-Markov regime switching interest rate models. Physica A. 2014;407: 350–359. 10.1016/j.physa.2014.04.011 [DOI] [Google Scholar]
  • 21. Preda V, Dedu S, Gheorghe C. New classes of Lorenz curves by maximizing Tsallis entropy under mean and Gini equality and inequality constraints. Physica A. 2015; 436: 925–932. 10.1016/j.physa.2015.05.092 [DOI] [Google Scholar]
  • 22. Toma A, Leoni-Aubin S. Optimal robust M-estimators using Renyi pseudodistances. J Multivar Anal. 2013;115: 359–373. 10.1016/j.jmva.2012.10.003 [DOI] [Google Scholar]
  • 23. Hampel FR, Ronchetti E, Rousseeuw PJ, Stahel W. Robust statistics: the approach based on influence functions. New York: Wiley; 1986. [Google Scholar]
  • 24. Huber PJ. Robust covariances In: Gupta S, Moore D, editors. Statistical decision theory and related topics II. New York: Academic Press; 1977. pp. 165–191. [Google Scholar]
  • 25. Jaupi L, Saporta G. Using the influence function in robust principal component analysis In: Morgenthaler S, Ronchetti E, Stahel WA, editors. New directions in statistical data analysis and robustness. Verlag: Birchauser; 1993. pp. 147–156. [Google Scholar]
  • 26. Gervini D. The influence function of the Stahel-Donoho estimator of multivariate location and scatter. Stat Probab Lett. 2002;60: 425–435. 10.1016/S0167-7152(02)00315-2 [DOI] [Google Scholar]
  • 27. Serfling R. Asymptotic relative efficiency in estimation In: Lovric M, editor. International encyclopedia of statistical sciences. New-York: Springer; 2011. pp. 68–72. [Google Scholar]
  • 28. Schöttle K, Werner R. Consistency of robust portfolio estimators. OR and Management Sciences. 2006; pp. 1–23. [Google Scholar]
  • 29. Bank B, Guddat J, Klatte D, Kummer B, Tammer K. Non-linear parametric optimization. Basel: Birchauser; 1983. [Google Scholar]
  • 30. Arslan O. Convergence behavior of an iterative reweighting algorithm to compute multivariate M-estimates for location and scatter. J Stat Plan Inference. 2004;118: 115–128. 10.1016/S0378-3758(02)00402-0 [DOI] [Google Scholar]
  • 31.www.msci.com
  • 32. Davies PL. Asymptotic behavior of S-estimators of multivariate location parameters and dispersion matrices. Ann Stat. 1987;15: 1269–1292. 10.1214/aos/1176350505 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Appendix. Appendix.

Codes of the Figures.

(DOC)

Data Availability Statement

Data were obtained from the third party MSCI (https://www.msci.com/web/msci). Boxplots for relevant data are included within the paper, and interested readers may obtain the full dataset by contacting Orsolya Czintos at MSCI: Orsolya.Czintos@msci.com


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES