Skip to main content
Taylor & Francis Open Select logoLink to Taylor & Francis Open Select
. 2016 Jan 8;50(4):917–929. doi: 10.1080/02331888.2015.1111892

Optimal designs for copula models

E Perrone a, WG Müller a,*
PMCID: PMC4936440  PMID: 27453616

Abstract

Copula modelling has in the past decade become a standard tool in many areas of applied statistics. However, a largely neglected aspect concerns the design of related experiments. Particularly the issue of whether the estimation of copula parameters can be enhanced by optimizing experimental conditions and how robust all the parameter estimates for the model are with respect to the type of copula employed. In this paper an equivalence theorem for (bivariate) copula models is provided that allows formulation of efficient design algorithms and quick checks of whether designs are optimal or at least efficient. Some examples illustrate that in practical situations considerable gains in design efficiency can be achieved. A natural comparison between different copula models with respect to design efficiency is provided as well.

Keywords: copulas, design measure, Fisher information, stochastic dependence, clinical trials

AMS Subject Classification: 62K05

1. Introduction

Due to their flexibility in describing dependencies and the possibility of separating marginal and joint effects copula models have become a popular device for coping with multivariate data. in many areas of applied statistics eg. for insurances,[1] econometrics,[2] medicine,[3] marketing,[4] spatial extreme events,[5] time series analysis,[6] even sports [7] and particularly in finance.[8]

The concept of copulas, however, has only been rarely employed in experimental design with notable exceptions of spatial design in [9,10], and sequential trials in [11]. The design question for copula parameter estimation has to our knowledge just been raised in [12], where a brute-force simulated annealing optimization was employed for the solution of a specific problem. By this paper we provide the necessary theory for fully embedding the situation into optimal design theory. Particularly we provide a Kiefer–Wolfowitz type equivalence theorem [13] in Section 5 as a basis for a substantial analysis of the arising issues in the example sections.

To be more concrete, let us consider a vector xT=(x1,,xr)X of control variables, where XRr is a compact set. The results of the observations and of the expectations in a regression experiments are the vectors:

y(x)=(y1(x),,ym(x)),E[Y(x)]=E[(Y1,,Ym)]=η(x,β)=(η1(x,β),,ηm(x,β)),

where β=(β1,,βk) is a certain unknown vector of marginal parameters to be estimated and ηi (i=1,,m) are known functions. Let us call FYi(yi(x,β)) the marginal cumulative distributions of each Yi for all i=1,,m and fY(y(x,β),α) the joint probability density function of the random vector Y, where α=(α1,,αl) are unknown (copula) parameters. In the remainder of the paper we will focus on the case m=2, but generalizations of our results are possible.

Definition 2.

Let I=[0,1]. A two-dimensional copula (or 2-copula) is a bivariate function C:I×II with the following properties:

  1. for every u1, u2I
    C(u1,0)=0,C(u1,1)=u1,C(0,u2)=0,C(1,u2)=u2; (1)
  2. for every u1, u2, u3, u4I such that u1u3 and u2u4,
    C(u3,u4)C(u3,u2)C(u1,u4)+C(u1,u2)0.

Now let FY be a joint cumulative distribution function (cdf) with marginal cdfs FY1 and FY2. According to Sklar's theorem [14] there exists then a 2-copula C such that

FY(y1,y2)=C(FY1(y1),FY2(y2)) (2)

for all reals y1, y2. If FY1 and FY2 are continuous, then C is unique; otherwise, C is uniquely defined on Ran(FY1)×Ran(FY2). Conversely, if C is a 2-copula and FY1 and FY2 are distribution functions, then the function FY given by Equation (2) is a joint distribution with marginals FY1 and FY2.

2. Design issues

We need to quantify the amount of information on both (trend and copula) sets of parameters α and β respectively from the regression experiment embodied in the Fisher information matrix, which for an elemental information at a particular control x in the sense of Atkinson et al. [15] is a (k+l)×(k+l) matrix defined as

m(x,β,α)=mββ(x)mβα(x)mβαT(x)mαα(x), (3)

where the submatrix mββ(x) is the (k×k) matrix with the (i,j)th element defined as

E2βiβjlog[fY(y(x,β),α)], (4)

and the submatrices mβα(x) (k×l) and mαα(x) (l×l) are defined accordingly. Here we model the dependence between Y1 and Y2 with a copula function Cα(FY1(y1(x,β)),FY2(y2(x,β))) and find the joint density of the random variables from

fY(y(x,β),α)=2y1y2Cα(FY1(y1(x,β)),FY2(y2(x,β)).

Definition 4.

For a concrete (discrete) experiment with N independent observations at nN support points x1,,xn, the corresponding total information matrix is

M(ξ,β,α)=N1i=1nwim(xi,β,α),i=1nwi=1,ξ=x1xnw1wn,

with so-called design weights wi.

The aim of approximate optimal design theory is concerned with finding an optimal design measure ξ(β,α), such that it maximizes some scalar function φ(M(ξ,β,α)), the so-called design criterion. In the following we will consider only D-optimality, that is, the criterion φ(M)=logdetM, if M is non-singular. There exist several well written monographs on optimal design theory and its application, but in this paper we follow mainly the style and notation of Silvey.[16]

3. Equivalence theory

The cornerstone of a theoretical investigation into optimal design is usually the formulation of a Kiefer–Wolfowitz type equivalence relation, which is given in the following theorem. It is a generalized version of a theorem given without proof in [17] and follows from a multivariate version of the basic theorem given in [16], its full proof can be found in the Appendix.

Theorem 3.1.

Denote by (β¯,α¯) fixed values (local guesses) for the parameter vector. Then, the following properties are equivalent:

  • ξ is D-optimal;

  • tr[M(ξ,β¯,α¯)1m(x,β¯,α¯)](k+l), xX;

  • ξ minimize maxxXtr[M(ξ,β¯,α¯)1m(x,β¯,α¯)], over all ξΞ.

This theorem provides simple checks for D-optimality through the maxima of

d(x,ξ)=tr[M(ξ,β¯,α¯)1m(x,β¯,α¯)],

which is usually called sensitivity function. It also allows us the use of standard design algorithms such as of the Fedorov-Wynn-type,[18,19] which will yield an optimal approximate design ξ=x1xnw1wn.

Note that these resulting optimal designs will now depend not only upon the marginal model structure, but also upon the chosen copula and through the induced nonlinearities potentially also on the unknown parameter values for α and β, which is why we are resorting to localized designs around the values (β¯,α¯). A sensitivity analysis with respect to the effect of these choice on a particular example can be found in [20].

Definition 6.

For the comparison of designs define D-Efficiency of the design ξ with respect to the design ξ as the ratio

|M(ξ,β¯,α¯)||M(ξ,β¯,α¯)|1/(k+l), (5)

where (k+l) is the number of the model parameters. We will report all our findings in percentage losses of these D-efficiencies.

4. Examples

A main question now of course concerns whether ignorance or wrong guesses of copula function and/or parameters may lead to inefficiencies of the designs.

4.1. Tools

For that purpose let us here give the list of copulas used in our examples (for more details see, eg. [21] or [22]). We provide the copula function along with the so-called Kendall's τ, which is a dependence measure that allows us to conveniently relate different copulas (for a definition and a more exhaustive comparison see [23]).

Definition 8.

  1. Product Copula, which represents the independence case:
    C(u1,u2)=u1u2,
    with τ=0.
  2. Gaussian Copula:
    Cα(u1,u2)=12Π1α2Φ1(u1)Φ1(u2)expz122αz1z2+z222(1α2)dz1dz2,
    with α[1,1] and τ=(2/Π)arcsin(α).
  3. Farlie–Gumbel–Morgenstern (FGM):
    Cα(u1,u2)=u1u2[1+α(1u1)(1u2)],
    with α[1,1] and τ=29α.
  4. Clayton:
    Cα(u1,u2)=[max(u1α+u2α1,0)](1/α),
    with α(0,+) and τ=α/(α+2).
  5. Frank:
    Cα(u1,u2)=1αln1+(eαu11)(eαu21)eα1,
    with α(,+), and τ=1(4/α)(1(1/α)0α(t/(et1))dt).
  6. Gumbel:
    Cα(u1,u2)=exp([(lnu1)α+(lnu2)α]1/α),
    with α[1,+) and τ=(α1)/α.

4.2. The linear case

Let us first consider a simple example reported in [18]. For each design point x[0,1], we may observe an independent pair of random variables Y1 and Y2, such that

E[Y1(x)]=β1+β2x+β3x2,E[Y2(x)]=β4x+β5x3+β6x4,

which is linear in β and has dependence described by the product copula with Gaussian margins. Since this case is covered by Theorem 1, we were able to compute the optimal design ξ by a standard algorithm and we display it in Figure 1 along with its sensitivity function. As rather typical ξ is supported on only a small number (here four) of design points, smaller than the number of parameters. From the sensitivity function we can see that it is indeed optimum as it reaches (and not exceeds) the number of parameters at all design points. Furthermore our optimal design coincides with the one reported in [18], namely

ξ=xiwi=00.380.761.00.160.280.230.33. (6)

Figure 1.

Figure 1.

Sensitivity function (left axis) and optimal design (right axis) for the Fedorov example.

Let us consider a more general case, for which the joint distribution is described by a Gaussian copula and we thus allow the random variables Y1 and Y2 to be dependent. In this case the joint probability function of the random vector Y=(Y1,Y2) is simply

FY(y1,y2)=Cα(Φ(y1η1(x,β)),Φ(y2η2(x,β)))=Φ2(y1η1(x,β),y2η2(x,β);α), (7)

where Φ2(,;α) denotes the bivariate normal cdf with correlation α(1,1) and Φ denotes the cdf of the standard normal distribution N(0,1) (see [24]).

Our computations gave rise to the following

Corollary 4.2

For different values of α the optimal design is the same as for the independence case, which is the Gaussian case with α=0.

Note, that the sensitivity function now has a different scaling (with a maximum at 7) as we have an additional copula parameter. This corollary, however, is hardly surprising as this fact coincides with the classic findings for the multivariate Gaussian distribution by Krafft and Schaefer.[25]

But now for a contrast consider the FGM copula. Following our approach, we must calculate the density corresponding to the function:

Cα(Φ(Y1(x;β)),Φ(Y2(x;β)))=Φ(Y1(x;β))Φ(Y2(x;β))×[1+α(1Φ(Y1(x;β)))(1Φ(Y2(x;β)))],

which eventually leads to expressions like

E2βiβjlog2y1y2Cα(Φ(Y1(x;β)),Φ(Y2(x;β)))

for the information matrix. These integrals are not analytically solvable, but we can evaluate them numerically and we can use the algorithm in order to find the optimum designs.

Not surprisingly those optimal designs do depend upon the choice of α¯ (i.e. the assumed dependence) – and similar calculations can be performed for other copula functions as well. Some results are subsumed in Table 1, which displays the loss in D-efficiency that occurs by using the optimal design ξ from Equation (6) compared to the respective optimal designs for various copula models and Kendall's τ. It can be seen that these losses are generally quite small for all considered copulas.

Table 1. Losses in D-efficiency (in bold) by ignoring the dependence in per cent.

  FGM Clayton Frank
τ α¯ D-eff α¯ D-eff α¯ D-eff
0.15 0.67 0.29 n.d. −1.37 0.10
0.10 0.45 0.23 n.d. −0.90 0.10
0.05 0.22 0.59 n.d. −0.45 0.10
0.05 0.22 0.68 0.10 0.16 0.45 0.10
0.10 0.45 0.39 0.22 0.13 0.90 0.10
0.15 0.67 0.28 0.35 0.34 1.37 0.10
0.35 n.d. 1.08 0.11 3.51 0.11
0.75 n.d. 6.00 0.27 14.13 0.16

4.3. A binary bivariate model

In order to better investigate the role of the copula parameter, we analyse a more elaborate example with potential applications in clinical trials. Let us formally introduce the model. We consider a bivariate binary response (Yi1,Yi2), i=1,,n with four possible outcomes {(0,0),(0,1),(1,0),(1,1)} where 1 usually represents a success and 0 a failure (of eg. a drug treatment). For a single observation denote the joint probabilities of Y1 and Y2 by py1,y2=pr(Y1=y1,Y2=y2) for (y1,y2=0,1). In a clinical trial context Y1 and Y2 could represent efficacy and toxicity of a tested drug.

Now, define

p11=Cα(π1,π2),p10=π1p11,p01=π2p11,p00=1π1π2+p11. (8)

The complete log-likelihood for the bivariate binary model is then given by

l(θ;y)=i=1nwili(θ;y),θ=(β1,β2,α), (9)

where β1 and β2 are the parameters associated with the respective margins and the log-likelihood for a single observation is given by

li(θ;y)=y1y2logp11+y1(1y2)logp10+(1y1)y2logp01+(1y1)(1y2)logp00. (10)

As shown in [26] the Fisher information matrix for a single observation can then be written as

M(θ,ξi)=pθTP1+11p11p10p01eeTpθ, (11)

where p=(p11,p10,p01), P=diag(p) and e=(1,1,1)T. Some useful formulae for calculating information matrices in copula models can also be found in [27].

A particular case of the introduced model has already been analysed in [17]. In that work, the authors assume the marginal probabilities of success given by the models

logπi1πi=βi1+βi2x,i=1,2 (12)

with x[0,10] and ‘localized’ parameters β¯1=[1,1] and β¯2=[2,0.5]. The considered joint cdf is the Gumbel cdf, which corresponds to the following choice for the probability of success p11:

p11=FY1,Y2(π1,π2)=π1π2(1+α(1π1)(1π2)),

that is, in terms of copulas, the FGM copula.

In [17], the choice of this model is highlighted by arguing that it allows for dependence between efficacy and toxicity and it is claimed that including estimation of α rather then independently analysing efficacy and toxicity is preferable. We reanalyse this example studying both the role of the copula parameter and the impact of a particular type of dependence structure on the D-optimal designs obtained.

To get a clearer idea of the role played by the copula parameter, let us first focus on the benchmark case of independence, described both by the Product copula and the FGM copula with α=0. However, using these two copulas has substantially different interpretation and effect. In the one case (Product copula) we completely ignore potential dependence, whereas in the other case (FGM copula) we allow for its estimation, but assume it inexistent.

In a second step, to examine the impact of the dependence structure, we compare the model analysed in [17] with the more general ones proposed in [12]. Those have the same assumptions on the marginals probabilities as in [17], as well as same design space and initial parameter for the betas, but the dependencies are instead represented by using the copulas Frank, Gumbel, and Clayton. Note that in [12] the authors employed a brute-force simulated annealing algorithm for their calculations and had no means for checking definitive optimality, which is now possible through the equivalence theorem (Theorem 5.1) provided.

Now, using the D-optimal designs for the FGM copula and for the Product copula as benchmarks, we note the losses in D-efficiency in per cent as reported respectively in Table 2 (Product copula) and in Table 3 (FGM). In both cases, the losses are much stronger than in the previous example.

Table 2. Losses in D-efficiency (in bold) by ignoring the dependence in per cent (product copula).

  Frank Clayton Gumbel
τ α¯ D-eff α¯ D-eff α¯ D-eff
0.11 1.00 1.72 0.24 1.75 1.12 0.95
0.45 5.00 1.31 1.68 1.49 1.84 1.29
0.66 10.00 1.87 3.98 0.71 3.00 2.31
0.76 15.00 2.89 6.42 2.84 4.21 2.99
0.82 20.00 3.10 8.89 9.48 5.45 3.25

Table 3. Losses in D-efficiency (in bold) in per cent with respect to the FGM copula (α=0).

  Frank Clayton Gumbel
τ α¯ D-eff α¯ D-eff α¯ D-eff
0.11 1.00 0.01 0.24 2.42 1.12 0.87
0.45 5.00 0.36 1.68 1.13 1.84 0.5
0.66 10.00 3.18 3.98 1.34 3.00 2.84
0.76 15.00 5.63 6.42 5.54 4.21 5.13
0.82 20.00 6.24 8.89 13.94 5.45 6.12

Analysing these results by focusing on the Frank and the Gumbel copulas, one can notice that lower losses in Table 3 correspond to the lower values of τ. Conversely, the losses already become much higher in Table 3 for a moderate level of the association τ. Moreover, looking at the results for the Clayton copula, we even have lower losses by ignoring the dependence for almost all the levels of the association τ. This suggests, at first glance, that it is not generally preferable to insert a dependence parameter to be estimated, since this might increase the losses when the model is chosen badly.

Interestingly, if the copula parameter is not estimated, that is, if the model is just a four parameter model, the optimal designs found are almost the same for all the investigated copulas, and can be represented by

ξ=xiwi=02.806.790.420.360.22.

Evidently, the structure of the dependence has an impact as soon as its parameter requires estimation. In Figure 2 we display the designs and sensitivity functions for a representative case contrasting the very different optimal designs for a Clayton and a Gumbel copula with identical Kendall's τ. It will therefore be of great practical value to compare different copulas with respect to their optimal design properties (as well as eventually to be able to efficiently discriminate between them).

Figure 2.

Figure 2.

The optimal designs and the sensitivity functions for the binary example (Clayton left, Gumbel right). The copula parameters chosen correspond to Kendall's τ=0.816.

A first such step of comparing different copula models was taken in [12] where the authors evaluate designs for various copula choices against each other (in their Table 8). However, they have been using the same parameter values for all the copulas without considering the different meaning of the copula parameter for various copula families. Therefore, we instead provide in Table 4 an improved comparison between different dependence structures along the same Kendall's τ values by exploiting the relationship between the copula parameter to the measure of concordance τ. Thanks to this comparison, now the pure impact of the choice of the copula is highlighted. It turns out, that even in extreme case the efficiency losses are only small to moderate. They are greatest if Frank or Gumbel are used instead of Clayton, which may be explained by their opposing representations of tail dependencies.

Table 4. Losses in D-efficiency (in bold) by comparing the true copula model with the assumed one for a fixed Kendall's τ.

True Copula Frank Clayton Gumbel
Assumed Copula Clayton Gumbel Frank Gumbel Frank Clayton
τ=0.11 2.24 0.67 1.99 2.70 0.82 2.75
τ=0.45 0.26 0.03 0.26 0.11 0.03 0.15
τ=0.66 1.09 0.11 1.04 1.28 0.14 1.57
τ=0.76 4.27 0.02 3.87 4.08 0.01 4.73
τ=0.82 8.24 0.01 10.91 10.96 0.01 8.43

4.4. A more flexible model

Let us now allow the strength of the dependence itself be dependent upon the regressors x, a situation completely covered by our equivalence theorem. Thus here the copula parameters themselves become model-dependent such as, for example, in [28]. As in our context only positive associations (between efficacy and toxicity) make sense we consider in the following the τ modelled by a logistic:

τ(x,α1)=eα1x1+eα1x, (13)

which takes values in [0,1] for α1[0,1].

Considering the Archimedian copulas Clayton C1 and Gumbel C2, the following relationships between the Kendall's τ and the copula parameters hold:

τC1(x,α1)=γ1(x,α1)γ1(x,α1)+2andso γ1(x,α1)=2τC1(x,α1)1τC1(x,α1)for the Clayton family;τC2(x,α1)=γ2(x,α1)1γ2(x,α1)andso γ2(x,α1)=11τC2(x,α1)for the Gumbel family.

Then, we model the current probability of success by a convex combination of the Clayton and the Gumbel copulas

C(π1,π2;α1,α2)=α2C1(π1,π2;γ1(x,α1))+(1α2)C2(π1,π2;γ2(x,α1)),

and when we link them at the same τ values we end up with

C(π1,π2;α1,α2)=α2C1(π1,π2;2eα1x)+(1α2)C2(π1,π2;1+eα1x).

The added flexibility of such a model is that the impact of the dependence structure and the association level is reflected by two different parameters. While the α2 parameter is strictly related to the structure of the dependence, the α1 parameter is only related to the measure of association Kendall's τ.

In Table 5 we report the efficiency losses with respect to the independence case (Product Copula). By fixing three localized values α1¯, we assume three different intervals for τ. Corresponding to these intervals we fixed four localized values α2¯. From the table it is clear that when the range of the τ increases, the losses in terms of D-efficiency can become quite substantial. By focusing on the various localized values for α2, it is evident that also the structure of the dependence plays a big role in the design obtained. Here, we can see that when the highest weight in the convex combination is on the Clayton copula, the efficiency losses are lowest.

Table 5. Losses in D-efficiency for the convex combination model in per cent (in bold).

  τ[0.5,0.95] τ[0.5,0.99] τ[0.5,0.995]
α2¯ Loss in D-eff. Loss in D-eff. Loss in D-eff.
0.1 11.15 31.97 47.22
0.3 6.51 25.11 40.33
0.6 2.50 17.20 32.27
0.9 1.11 10.90 24.96

5. Discussion

In general, our theory forms the basis to investigate further showcase examples from the literature, like, for example, in [29] or eventually treat mixed discrete/continuous type models like in [30]. Particularly for the latter, but also quite generally the methods provided in this paper can thus be expected to be valuable for real applications from clinical trials, environmental sampling, industrial experiments, etc.

Although here we provide only examples on limited types of copulas, we might expect similar or greater effects for some more non-symmetric copulae (see eg. [31]), which are subject to our current investigations.

Note that in the convex combination example by focusing on α2 we could find designs with the sole purpose of efficiently discriminating between different copula models, which we plan to do future research on.

Acknowledgements

We thank F. Durante, M. Stehlík, L. Pronzato, J. Rendas and E.P. Klement for fruitful discussions and a referee for constructive remarks.

Appendix

Equivalence theorem

For all the basics in what follows cf.[16] For a given vector of parameters (β,α), let M(β,α) be the set of the information matrices generated as ξ ranges over the class of all set of probability distribution on X. Then M(β,α) is the convex hull of {m(x,β,α): xX}.

Let us now recall the definition of two derivatives that will play an important role in our theory.

Definition A ((Gâteaux and Fréchet derivative)).

Considering two elements M1 and M2 in M, the Gâteaux derivative of φ at M1 in the direction of M2 is:

Gφ(M1,M2)=limε0+1ε{φ(M1+εM2)φ(M1)},

the Fréchet derivative of φ at M1 in the direction of M2 is:

Fφ(M1,M2)=limε0+1ε{φ{(1ε)M1+εM2}φ(M1)}.

The following are the properties of the derivatives that we defined before: the concavity of φ implies that

1ε[φ{(1ε)M1+εM2}φ(M1)]

is a non-increasing function of ϵ in 0<ε1. Hence when φ is concave, Fφ(M1,M2) exists if we allow the value +.

It is clear that if we put ε=1 in the previous equation, we obtain: Fφ(M1,M2)φ(M2)+φ(M1).

According to the definitions of Fréchet and Gâteaux derivatives, we can stress the following relationship between them: Fφ(M1,M2)=Gφ(M1,M2M1). Then, if we assume the differentiability of φ it is clear that for scalars ai

FφM1,aiMi=aiFφ(M1,Mi).

Theorem A.2.

Suppose to have a fixed parameters vector (β¯,α¯), a concave function φ on M(β¯,α¯) which is also differentiable at all points of M(β¯,α¯) where φ(M)<, so where a φ optimal measure exists.

Then the following are equivalent:

  • ξ is φ-optimal;

  • Fφ(M(ξ,β¯,α¯),M(ξ,β¯,α¯))0, ξΞ;

  • Fφ(M(ξ,β¯,α¯),m(x,β¯,α¯))0, xX;

  • maxxXFφ(M(ξ,β¯,α¯),m(x,β¯,α¯))=minξΞmaxxXFφ(M(ξ,β¯,α¯),m(x,β¯,α¯)).

Proof.

Let us prove the theorem by double implications.

  1. (i)(ii) ξ is φ-optimal. This means that φ(M(ξ,β¯,α¯)) is maximal.

    For the properties of the function φ, the following relation holds:
    φ{(1ε)M(ξ,β¯,α¯)+εM(ξ,β¯,α¯)}φ{M(ξ,β¯,α¯)}0
    for ε[0,1] and all ξΞ.
    For all the elements of M(β¯,α¯) holds that
    (1ε)M(ξ,β¯,α¯)+εM(ξ,β¯,α¯)=M{(1ε)ξ+εξ}
    and this means, from the definition of the Fréchet derivative, that
    Fφ{M(ξ,β¯,α¯),M(ξ,β¯,α¯)}0
    for all ξΞ.
  2. (ii)(iii) Since m(x,β¯,α¯) are elements of the convex hull M(β¯,α¯), the condition (iii) follows directly from the hypothesis.

  3. (iii)(iv) For a particular control x and the elemental informations m, since Fφ{M(ξ,β¯,α¯),M(ξ,β¯,α¯)}=0, it must be that:
    maxxXFφ{M(ξ,β¯,α¯),m(x,β¯,α¯)}0
    But, according to the hypothesis, we have that for the design ξ
    maxxXFφ{M(ξ,β¯,α¯),m(x,β¯,α¯)}0.
    Hence
    maxxXFφ{M(ξ,β¯,α¯),m(x,β¯,α¯)}=0=minξmaxxXFφ{M(ξ,β¯,α¯),m(x,β¯,α¯)}.
  4. (iv)(i) Suppose now that ξ satisfies the hypothesis, then
    maxxXFφ{M(ξ,β¯,α¯),m(x,β¯,α¯))}=0,
    that means that Fφ{M(ξ,β¯,α¯),m(x,β¯,α¯)}0, xX. According to the definition of the matrices MM, any M can be written as M(ξ,β¯,α¯)=i=nrwim(xi,β¯,α¯), where i=nrwi=1 and wi>0 for every i=1,,n.
    Then, since φ is differentiable at M(ξ,β¯,α¯), it holds that:
    Fφ{M(ξ,β¯,α¯),M(ξ,β¯,α¯)}=i=1nλiFφ{M(ξ,β¯,α¯),m(x,β¯,α¯)}0
    for every ξΞ.
    This means, clearly, that
    φ(M(ξ,β¯,α¯))φ(M(ξ,β¯,α¯))0
    for every ξΞ, then ξ is φ-optimal.

D-optimality

Let consider now as design criterion the following function:

φ(M)=logdetMif M is non-singularotherwise

A design that maximizes such a φ function is called D-optimal design.

In the case of D-optimality the Fréchet and the Gâteaux derivatives have the following expression:

Gâteaux derivative

logdet(M1+εM2)logdetM1=logdet(I+εM2M11)=log{1+εtr(M2M11)}+O(ε2)=εtr(M2M11)+O(ε2)

Hence, Gφ(M1,M2)=tr(M2M11).

Fréchet derivative

Fφ(M1,M2)=Gφ(M1,M2M1)=tr((M2M1)M11)=tr(M2M11I(k+l))=tr(M2M11)(k+l)

where (k+l) is the number of the model parameters.

We are ready now to give an equivalence theorem which holds in the particular case of the D-criterion.

Theorem A.3.

For a fixed parameters vector (β¯,α¯), the following properties are equivalent:

  • ξ is D-optimal;

  • tr(M(ξ,β¯,α¯)1m(x,β¯,α¯))(k+l), xX;

  • ξ minimize maxxXtr(M(ξ,β¯,α¯)1m(x,β¯,α¯)), over all ξΞ.

Proof.

The proof comes directly from the Theorem A.1 by imputing the Fréchet derivative for the D-criterion.

Funding Statement

This work has been supported by the project ANR-2011-IS01-001-01 ‘DESIRE’ and Austrian Science Fund (FWF) I 833-N18.

Disclosure statement

No potential conflict of interest was reported by the authors.

ORCID

W.G. Müller http://orcid.org/0000-0002-3564-766X

References

  1. Valdez EA. Understanding relationships using copulas. N Am Actuar J. 1998;2(1):1–25. [Google Scholar]
  2. Trivedi PK, Zimmer DM. Copula modeling: an introduction for practitioners. Found Trends Econ. 2006;1(1):1–111. [Google Scholar]
  3. Nikoloulopoulos AK, Karlis D. Multivariate logit copula model with an application to dental data. Stat Med. 2008;27(30):6393–6406. doi: 10.1002/sim.3449. [DOI] [PubMed] [Google Scholar]
  4. Danaher PJ, Smith MS. Modeling multivariate distributions using copulas: applications in marketing. Mark Sci. 2011;30(1):4–21. [Google Scholar]
  5. Wadsworth JL, Tawn JA. Dependence modelling for spatial extremes. Biometrika. 2012;99(2):253–272. [Google Scholar]
  6. Patton AJ. A review of copula models for economic time series. J Multivariate Anal. 2012;110:4–18. [Google Scholar]
  7. McHale I, Scarf P. Modelling the dependence of goals scored by opposing teams in international soccer matches. Stat Model. 2011;11(3):219–236. [Google Scholar]
  8. Cherubini U, Luciano E, Vecchiato W. Copula methods in finance. Chichester: Wiley; 2004. [Google Scholar]
  9. Li J, Bárdossy A, Guenni L, Liu M. A copula based observation network design approach. Environ Model Softw. 2011;26(11):1349–1357. [Google Scholar]
  10. Pilz J, Kazianka H, Spöck G. Some advances in Bayesian spatial prediction and sampling design. Spat Statist. 2012;1:65–81. [Google Scholar]
  11. Schmidt R, Faldum A, Witt O, Gerß J. Adaptive designs with arbitrary dependence structure. Biom J. 2014;56(1):86–106. doi: 10.1002/bimj.201200234. [DOI] [PubMed] [Google Scholar]
  12. Denman NG, McGree JM, Eccleston JA, Duffull SB. Design of experiments for bivariate binary responses modelled by Copula functions. Comput Statist Data Anal. 2011;55(4):1509–1520. [Google Scholar]
  13. Kiefer J, Wolfowitz J. The equivalence of two extremum problems. Canad J Math. 1960;12:363–366. [Google Scholar]
  14. Sklar A. Fonctions de répartition à n dimensions et leurs marges. Publications de l'Institut de Statistique de Paris. 1959;8:229–231. [Google Scholar]
  15. Atkinson AC, Fedorov VV, Herzberg AM, Zhang R. Elemental information matrices and optimal experimental design for generalized regression models. J Statist Plann Inference. 2014;144:81–91. [Google Scholar]
  16. Silvey SD. Optimal design (science paperbacks) London: Chapman & Hall; 1980. [Google Scholar]
  17. Heise MA, Myers RH. Optimal designs for bivariate logistic regression. Biometrics. 1996;52(2):613–624. [Google Scholar]
  18. Fedorov VV. The design of experiments in the multiresponse case. Theory Probab Appl. 1971;16(2):323–332. [Google Scholar]
  19. Wynn HP. The sequential generation of D-optimum experimental designs. Ann Math Statist. 1970;41(5):1655–1664. [Google Scholar]
  20. Perrone E. A study on robustness in the optimal design of experiments for copula models. In: Steland A, Rafajłowicz E, Szajowski K, editors. Stochastic models, statistics and their applications. Springer Proceedings in Mathematics & Statistics, Vol. 122. Wrocław, Poland: Springer International Publishing; 2015. p. 335–342.
  21. Nelsen RB. An introduction to copulas (Springer series in statistics) 2nd ed. New York: Springer; 2007. [Google Scholar]
  22. Durante F, Sempi C. Copula theory: an introduction. In: Bickel P, Diggle P, Fienberg S, Gather U, Olkin I, Zeger S, Jaworski P, Durante F, Härdle WK, Rychlik T, editors. Copula theory and its applications. Lecture Notes in Statistics, chapter 1, Vol. 198. Berlin, Heidelberg: Springer; 2010. p. 3–31.
  23. Michiels F, De Schepper A. A copula test space model: how to avoid the wrong copula choice. Kybernetika. 2008;44(6):864–878. [Google Scholar]
  24. Meyer C. The bivariate normal copula. Comm Statist Theory Methods. 2013;42(13):2402–2422. [Google Scholar]
  25. Krafft O, Schaefer M. D-optimal designs for a multivariate regression model. J Multivariate Anal. 1992;42(1):130–140. [Google Scholar]
  26. Dragalin V, Fedorov V. Adaptive designs for dose-finding based on efficacy–toxicity response. J Statist Plann Inference. 2006;136(6):1800–1823. [Google Scholar]
  27. Schepsmeier U, Stöber J. Derivatives and Fisher information of bivariate copulas. Stat. Papers. 2014;55(2):525–542. [Google Scholar]
  28. Noh H, Ghouch AE, Bouezmarni T. Copula-based regression estimation and inference. J Amer Statist Assoc. 2013;108(502):676–688. [Google Scholar]
  29. Oakes D, Ritz J. Regression in a bivariate copula model. Biometrika. 2000;87(2):345–352. [Google Scholar]
  30. de Leon AR, Wu B. Copula-based regression models for a bivariate mixed discrete and continuous outcome. Stat Med. 2011;30(2):175–185. doi: 10.1002/sim.4087. [DOI] [PubMed] [Google Scholar]
  31. Klement EP, Mesiar R. How non-symmetric can a copula be? Comment Math Univ Carolin. 2006;47(1):141–148. [Google Scholar]

Articles from Statistics are provided here courtesy of Taylor & Francis

RESOURCES