Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Apr 28.
Published in final edited form as: J Econom. 2010 Jul 1;157(2):129–142. doi: 10.1016/j.jeconom.2009.10.021

Estimation and model selection of semiparametric multivariate survival functions under general censorship

Xiaohong Chen a,*, Yanqin Fan b, Demian Pouzo c, Zhiliang Ying d
PMCID: PMC4002182  NIHMSID: NIHMS491941  PMID: 24790286

Abstract

We study estimation and model selection of semiparametric models of multivariate survival functions for censored data, which are characterized by possibly misspecified parametric copulas and nonparametric marginal survivals. We obtain the consistency and root-n asymptotic normality of a two-step copula estimator to the pseudo-true copula parameter value according to KLIC, and provide a simple consistent estimator of its asymptotic variance, allowing for a first-step nonparametric estimation of the marginal survivals. We establish the asymptotic distribution of the penalized pseudo-likelihood ratio statistic for comparing multiple semiparametric multivariate survival functions subject to copula misspecification and general censorship. An empirical application is provided.

Keywords: Multivariate survival models, Misspecified copulas, Penalized pseudo-likelihood ratio, Fixed or random censoring, Kaplan–Meier estimator

1. Introduction

Economic, financial, and medical multivariate survival data are typically non-normally distributed and exhibit nonlinear dependence among their component variables. A class of semiparametric multivariate survival models that has proven to be useful in modeling such data is the class of semiparametric copula-based multivariate survival functions in which the marginal survival functions are nonparametric, but the copula functions characterizing the dependence structure between the component variables are parameterized. More specifically, let X = (X1, …, Xd)′ be the survival variables of interest with a d-variate joint survival function: Fo(x1, …, xd) = P(X1 > x1, …, Xd > xd) and marginal survival functions Fj0()(j=1,,d). Assume that Fj0(j=1,,d) are continuous. A straightforward application of Sklar's (1959) theorem shows that there exists a unique d-variate copula function Co such that F0(x1,,xd)C0(F10(x1),,Fd0(xd)), where the copula Co(·) : [0, 1]d → [0, 1] is itself a multivariate probability distribution function; it captures the dependence structure among the component variables X1, …, Xd. This decomposition of the joint survival function leads naturally to the class of semiparametric multivariate survival functions in which the marginal survival functions are unspecified, but the copula function is parameterized: Co(u1, …, ud) = Co(u1, …, ud; αo) for some parametric copula function Co(u1, …, ud; α) and some value α0A. As a multivariate survival function in this class depends on nonparametric functions of only one dimension, it achieves dimension reduction while maintaining a more flexible form than purely parametric survival functions. This class of semiparametric multivariate survival functions has been used widely in survival analysis, where modeling and estimating the dependence structure between survival variables is of importance. See Joe (1997), Nelsen (1999), Oakes (1989, 1994), Frees and Valdez (1998) and Li (2000) for examples of such applications.

A semiparametric copula-based multivariate survival model has two sets of unknown parameters: the unknown marginal survival functions Fj0,j=1,,d, and the copula parameter αo of the parametric copula function Co(u1, …, ud; αo). For complete data (i.e., data without censoring or truncation), Oakes (1994) and Genest et al. (1995) propose a two-step estimation procedure: in first step the marginal distribution functions 1Fj0,j=1,,d are estimated by the rescaled empirical distribution functions, in the second step the copula parameter αo is estimated by maximizing the estimated log-likelihood function. For randomly right censored data, Shih and Louis (1995) independently propose the same two-step procedure, except that the Kaplan–Meier estimators of marginal survival functions are used in the first step. For a random sample of size n, Genest et al. (1995) establish the root-n consistency and asymptotic normality of their two-step estimator of αo. For randomly right censored data, Shih and Louis (1995) derive similar large sample properties of their two-step estimator of αo under the assumption of bounded partial derivatives of score functions. Unfortunately, this assumption is violated by many commonly used copulas including the Gaussian copula, the Student's t copula, Clayton copula and Gumbel copula. In addition, Shih and Louis (1995) assume that the censoring scheme is i.i.d. random and the parametric copula function is correctly specified.

A closely related important issue in applying this class of semiparametric survival functions to a given data set is how to choose an appropriate parametric copula, as different parametric copulas lead to survival functions that may have very different dependence properties. A number of existing papers has attempted to address this issue. For complete data, we refer to Chen and Fan (2005, 2006a) for a detailed discussion of existing approaches and references. For bivariate censored data, existing work include Frees and Valdez (1998), Klugman and Parsa (1999), Wang and Wells (2000), Chen and Fan (2007), and Denuit et al. (2006). Frees and Valdez (1998) and Klugman and Parsa (1999) consider fully parametric models of bivariate distribution (or survival) functions, and they address model selection of parametric copulas and parametric marginals for insurance company data on losses and allocated loss adjustment expenses (ALAEs). The particular data set they use was collected by the US Insurance Services Office in which loss is censored by a fixed censoring mechanism and ALAE is not censored. Using various model selection techniques including AIC/BIC, Frees and Valdez (1998) select the Pareto marginal distributions and the Gumbel copula, while Klugman and Parsa (1999) select inverse paralogistic for loss marginal distribution, inverse Burr for ALAE marginal distribution and the Frank copula. Wang and Wells (2000), Denuit et al. (2006) and Chen and Fan (2007) consider model selection of semiparametric bivariate distribution (or survival) functions in which they do not specify marginals, but restrict the parametric copulas to be in the Archimedean family. In particular, Wang and Wells (2000) propose a model selection procedure for comparing copulas in the one-parameter Archimedean family, allowing for various censoring mechanisms, as long as a consistent nonparametric estimator for the bivariate joint distribution (or survival) function is available. Their selection procedure is based on comparing point estimates of the integrated squared difference between the true Archimedean copula and a parametric copula; the one with the smallest value of the integrated squared difference is chosen over the rest of the one-parameter Archimedean copulas. Denuit et al. (2006) apply Wang and Wells's (2000) procedure to copula model selection for the same Loss-ALAE data set studied in Frees and Valdez (1998). They use a nonparametric estimator of the bivariate distribution that takes into account the fixed censoring mechanism underlying the Loss-ALAE data. They examine four one-parameter Archimedean copulas (Gumbel, Clayton, Frank and Joe) and select Gumbel copula since it yields the smallest estimated integrated squared difference. Chen and Fan (2007) propose a model selection test for comparing multiple semiparametric bivariate survival functions by taking into account the randomness in the estimated integrated squared difference. However, their test is still only applicable to model selection of parametric copulas within the Archimedean family only. It is known that a one or two-parameter Archimedean copula family could be too restrictive to capture various dependence structures among multivariate variables. In addition, the semiparametric model selection procedures in Wang and Wells (2000), Denuit et al. (2006) and Chen and Fan (2007) require consistent nonparametric estimation of the joint distribution function and the limiting distributions are complicated. As a result, even for a parametric Archimedean copula family, these tests are difficult to implement for multivariate (higher than bivariate) data with general censorship.

In this paper we bridge the gap in existing work for estimating and selecting a semiparametric multivariate copula-based survival model by (i) allowing for data to be censored under various censoring mechanisms, (ii) using nonparametric estimation of marginal survival functions only, (iii) permitting any parametric copula specification, which may be misspecified, non-Archimedean, and its score function may have unbounded partial derivatives. For random samples without censoring, Chen and Fan (2005) already consider the Pseudo-likelihood estimation of copula parameters and Pseudo-likelihood ratio (PLR) model selection test for semiparametric multivariate copula-based distribution models, accounting for (ii) and (iii). In this paper, we extend their results to allow for general right censorship. In particular, we first establish the convergence of the two-step estimator of the copula parameter to the pseudo-true value defined as the value of the parameter that minimizes the Kullback-Leibler Information Criterion (KLIC) between the parametric copula induced multivariate density and the unknown true density. We then derive its root-n asymptotically normal distribution and provide a simple consistent asymptotic variance estimator by accounting for (i), (ii) and (iii). These results are used to establish the asymptotic distribution of the penalized PLR statistic for comparing multiple semiparametric multivariate survival functions subject to copula misspecification and general censorship. We also propose a standardized version of the test, whose limiting null distribution is easy to simulate. To illustrate the usefulness of our testing procedure, we apply it to copula model selection for the loss-ALAE data, taking into account the underlying censoring mechanism in the data and allowing parametric copulas to exhibit more flexible dependence structures than those in the Archimedean family. We find that the standardized test is generally more powerful than the non-standardized test.

The rest of this paper is organized as follows. Section 2 introduces the model selection criterion function and the two-step estimation of the copula dependence parameter. In Section 3, we study the large sample properties of the pseudo-likelihood estimator of the copula parameter allowing for independent but general right censorship and misspecified parametric copulas. In Section 4, we present the limiting null distributions of the (penalized) PLR test statistics for model selection among multiple semiparametric copula models for multivariate censored data. Section 5 provides an empirical application to the Loss-ALAE data set and Section 6 briefly concludes. All technical proofs are gathered into the Appendix.

2. Model selection criterion and parameter estimation

To simplify notation, we shall present our results for bivariate survival models only. Obviously, all these results have straightforward extensions to multivariate copula models for survival data with any finite dimension.

In the following we shall use (D1, D2) to denote the censoring variables. Thus under the right censorship, one observes (X~1, X~2) = (X1 ^ D1, X2 ^ D2) and a pair of indicators, (δ1, δ2) = (I{X1D1}, I{X2D2}), where a ^ b = min(a, b) for real numbers a and b and I{·} is the indicator function. We assume that the censoring variables (D1, D2) are independent of the survival variables (X1, X2). Let Fj0(xj)=P(Xj>xj) denote the true but unknown marginal survival function of Xj for j = 1, 2. Suppose n independent (but possibly non-identically distributed) observations {(X~1t,X~2t,δ1t,δ2t)}t=1n are available, where (X~1t, X~2t) = (X1t ^ D1t, X2t ^ D2t) and (δ1t, δ2t) = (I{X1tD1t}, I{X2tD2t}). Denote Ut=(U1t,U2t)=(F10(X~1t),F20(X~2t)).

2.1. Model selection criterion

Let {Ci(u1,u2;αi):αiAiRpi} be a class of parametric copulas with i = 1, 2, …, M. By Sklar's (1959) theorem, each parametric copula family i corresponds to a parametric likelihood Li,n(αi)t=1ni(U1t,U2t,δ1t,δ2t;αi), where

i(u1t,u2t,δ1t,δ2t;αi)=δ1tδ2tlogci(u1t,u2t;αi)+δ1t(1δ2t)logCi(u1t,u2t;αi)u1+δ2t(1δ1t)logCi(u1t,u2t;αi)u2+(1δ1t)(1δ2t)logCi(u1t,u2t;αi),

where ci(u1,u2;αi)=2Ci(u1,u2;αi)u1u2 is the density function of copula Ci(u1, u2; αi).

In this paper, we are interested in testing whether a benchmark model (say copula model 1) performs significantly better than the rest of the copula models according to the KLIC. Let E0 denote the expectation with respect to the true probability measure. Define

αin=argαiAimaxn1t=1nE0[i(U1t,U2t,δ1t,δ2t;αi)]

as the pseudo-true value that minimizes the KLIC between the i-th parametric copula family induced multivariate density and the unknown true density. To conclude that copula model 1 performs significantly better than the rest of the copula models calls for a formal statistical test, where the null hypothesis is:

H0:i=2,,Mmaxn1t=1nE0[i(U1t,U2t,δ1t,δ2t;αin)]1(U1t,U2t,δ1t,δ2t;α1n)]0,

meaning that none of the copula models 2, …, M is closer to the true model (according to KLIC) than model 1, and the alternative hypothesis is:

H1:i=2,,Mmaxn1t=1nE0[i(U1t,U2t,δ1t,δ2t;αin)]1(U1t,U2t,δ1t,δ2t;α1n)]>0,

meaning that there exists a copula model from 2, …, M that is closer to the true model (according to KLIC) than model 1.

2.2. Two-step estimation

To construct a test statistic for the null hypothesis H0 against the alternative H1, we need estimates of (U1t,U2t)=(F10(X~1t),F20(X~2t)) and αin for i = 1, …, M.

For j = 1, 2, let F~j() be the Kaplan–Meier estimator of Fj0()=P(Xj>):

F~1(x)=X~1(t)x(11nt+1)δ1(t),
F~2(x)=X~2(t)x(11nt+1)δ2(t),

where X~j(1)X~j(2)X~j(n) are order statistics of {X~jt}t=1n for j = 1, 2, and {δj(t)}t=1n(j=1,2) are similarly defined. Then under independent censoring, F~j() is consistent for Fj0(), j = 1, 2; see e.g., Lai and Ying (1991).

Given the definition of αin, a natural estimator for it is the pseudo-likelihood estimator α^in:

α^in=argαiAimaxn1t=1ni(F~1(X~1t),F~2(X~2t),δ1t,δ2t;αi),i=1,,M.

Since this estimation procedure involves the first-step nonparametric estimation of the marginal survival functions Fj0(), j = 1, 2, the estimator α^in is also called the “two-step” estimator.

Note that no assumption is made on the censoring variables (D1t, D2t) other than their independence with the survival variables (X1t, X2t). As a result, various censoring mechanisms are allowed, including the simple random censoring, fixed censoring, and of course no censoring. If the censoring variables are fixed at Djt = +∞ for j = 1, 2, α^in becomes the estimator proposed in Genest et al. (1995). If the censoring variables (D1t, D2t) are i.i.d. with a continuous joint survival function, α^in becomes the estimator proposed in Shih and Louis (1995). Assuming that the parametric copula density ci(u1, u2; αi) is correctly specified and that log ci(u1, u2; αi) has bounded partial derivatives with respect to u1, u2, Shih and Louis (1995) establish the root-n asymptotic normality of α^in and provide a consistent estimator of its asymptotic variance for i.i.d. randomly censored data.

The censoring mechanism for the loss-ALAE data is non-random; ALAE is not censored and Loss is censored by a constant which differs from each individual to another. Results in Shih and Louis (1995) may not be directly applicable to this data set even under a correct specification of the copula function. Moreover, for model selection, we need to establish the asymptotic properties of the two-step estimator under copula misspecification. This will be done in the next section for a general censoring mechanism.

2.3. Penalized pseudo-likelihood ratio criteria

To test the null hypothesis H0 against the alternative H1, we use the PLR statistic:

LRn(F~1,F~2;α^in,α^1n)=L~i,n(α^in)L~1n(α^1n),i=2,,M,

where

L~i,n(α^in)1nt=1ni(F~1(X~1t),F~2(X~2t),δ1t,δ2t;α^in),i=1,,M.

In most applications, several parametric copula families are compared which may have different numbers of parameters. To take this into account, we follow the approach in Sin and White (1996) by adopting a general penalization of model complexity. Let Pen(pi, n) denote a penalization term such that Pen(pi, n) increases with pi dim(Ai), decreases with n, and Pen(pi, n)/n → 0. Then the penalized PLR statistic is

PLRn(F~1,F~2);α^in,α^1n)=LRn(F~1,F~2;α^in,α^1n)Pen(pi,n)Pen(p1,n)n=LRn(F~1,F~2;α^in,α^1n)+op(1).

We note that Pen(pi, n) = pi corresponds to AIC, and Pen(pi, n) = 0.5pi log n corresponds to BIC criterion.

In many existing applications of copula models, AIC has been used to compare different families of parametric copula models. To be more specific, let

AICi=2nt=1ni(F~1(X~1t),F~2(X~2t),δ1t,δ2t;α^in)+2pin,i=1,,M.

Then the values of AICi for i = 1, …, M are compared; copula model 1 will be selected if AIC1 = min{AICi : 1 ≤ iM} or equivalently if

LRn(F~1,F~2;α^in,α^1n)pip1n<0,i=2,,M. (2.1)

Noting, however, that PLRn(F~1,F~2;α^in,α^1n) (such as AICi) is a random variable, the fact that PLRn(F~1,F~2;α^in,α^1n)<0 for i = 2, …, M (or inequality (2.1) holds) for one sample {X~1t,X~2t,δ1t,δ2t}t=1n may not imply that copula model 1 performs significantly better than the rest of the models; it may occur by chance. As we will show in the next section, PLRn(F~1,F~2;α^in,α^1n)=n1t=1nE0[i(U1t,U2t,δt1,δ2t;αin)1(U1t,U2t,δ1t,δ2t;α1n)]+op(1) for i = 2, …, M. To conclude that copula model 1 performs significantly better than the rest of the models we need to perform a formal statistical test for H0 against H1.

To test H0, we have to take into account the randomness of the (penalized) PLR statistic. More precisely, we need to derive the asymptotic distributions of α^in and the test statistics under the null hypothesis. This will be accomplished in Sections 3 and 4 of this paper.

3. Asymptotic properties of the two-step estimator under copula misspecification

As mentioned in the previous section, asymptotic properties of the two-step estimator are established for randomly censored data in Shih and Louis (1995) under the assumptions that the parametric copula density correctly specifies the true copula density and that its score function has bounded partial derivatives. In this section, we will extend their results to a more general censoring mechanism and allow for misspecified parametric copulas whose score functions may have unbounded partial derivatives.

Recall that ARp is the parameter space. For α, αA we use ∥α − α*∥ to denote the usual Euclidean metric. To simplify notation, we now let

(u1,u2;α)=δ1δ2logc(u1,u2;α)+δ1(1δ2)logC(u1),u2;α)u1+δ2(1δ1)logC(u1),u2;α)u2+(1δ1)(1δ2)logC(u1,u2;α),

where c(u1, u2; α) is the density of the parametric copula C(u1, u2; α). Then the pseudo-true copula parameter value is αn=argmaxαAn1t=1nE0[(U1t,U2t;α)], and its two-step estimator is α^n=argmaxαAn1t=1n(F~1(X~1t),F~2(X~2t);α).

Finally we denote

α(u1,u2;α)=(u1,u2;α)α,
j(u1,u2;α)=(u1,u2;α)uj,(j=1,2),
αα(u1,u2;α)=2(u1,u2;α)α2and
αj(u1,u2;α)=2(u1,u2;α)ujαforj=1,2.

3.1. Consistency

The following conditions are sufficient to ensure the convergence of the two-step estimator α^n to the pseudo true value αn.

  • C1
    • (i)
      The sequence of survival variables, {(X1t,X2t)}t=1n is an i.i.d. sample from an unknown survival function Fo(x1, x2) with continuous marginal survival functions Fj0(),j=1,2;
    • (ii)
      The sequence of censoring variables {D1t,D2t}t=1n is an independent sample with joint survival functions {Gt(x1,x2)}t=1n={P(D1t>x1,D2t>x2)}t=1n and marginal survival functions {Gjt()}t=1n,j=1,2;
    • (iii)
      The censoring variables (D1t, D2t) are independent of survival variables (X1t, X2t) and there is no mass concentration at 0 in the sense that lim supnn1t=1n(1Gjt(η))0 as η → 0.
  • C2
    Let A be a compact subset of Rp. For every ∊ > 0,
    αA:ααnliminf1nt=1n[E0{(U1t,U2t;αn}]E0{(U1t,U2t;α)}]>0.
  • C3

    The true (unknown) copula function Co(u1, u2) has continuous partial derivatives.

  • C4
    • (i)
      For any (u1, u2) ∈ (0, 1)2, ℓ(u1, u2; α) is a continuous function of αA
    • (ii)
      Let Lt=supαA(U1t,U2t;α) and Ltα=supαAα(U1t,U2t;α) Then,
      Klimnlimsupn1t=1nE0{LtI(LtK)+LtαI(LtαK)}=0;
    • (iii)
      For any η > 0, ∊ > 0, there is K > 0 such that (u1,u2;α)K(u1,u2;α) for all αA and all uj ∈ [η, 1) such that 1uj(1uj),j=1,2
  • C5

    If {Xjt}t=1n are subject to non-trivial censoring (i.e., Djt ≠ ∞), then F~j is truncated at the tail in the sense that for some τj,F~j(xj)=F~j(τj) for all xj ≥ τj and lim inf n−1 t=1nGjt(τj)F0(τj)>0.

Note that in contrast to the censoring mechanism in Shih and Louis (1995) Condition C1(ii) allows the censoring variables {(D1t,D2t)}t=1n to be non-identically distributed. In addition, no assumption is made on the joint survival function Gt(x1, x2) of the censoring variables (D1t, D2t). Hence Condition C1(ii) includes the fixed censoring mechanism in which each survival variable (X1t, X2t) is censored at a pre-specified, fixed time (D1t, D2t) which may differ from one observation to another, in which case, the survival function Gt(x1, x2) is degenerate at (D1t, D2t). It also allows the variables X1t and X2t to have different censoring mechanisms, one random and the other fixed or one censored and the other uncensored. For example, the censoring mechanism for the Loss-ALAE data is such that Loss is censored by a fixed censoring mechanism and ALAE is uncensored. As a result, the observed variables {(X~1t,X~2t)}t=1n may not be identically distributed and the identifiably unique maximizer αn defined in Condition C2 may depend on n. Condition C5 is imposed to handle the possible tail instability of the Kaplan–Meier estimator, especially for non-identically distributed censoring times. The truncation can be achieved by simply using Djt ∧ τj as the censoring variables. Thus, without loss of generality, we shall assume that Djt ∧ τj are the censoring variables so that X~jtτj. The simple truncation at τj can be changed to the more elaborate tail modification. We refer to Lai and Ying (1991) for the issue of tail instability and modification. Finally, because we allow the left tail of the copula to blow up as well, we shall set (F~1(X~1t),F~2(X~2t);α)=0 whenever F~j(X~jt)=1 for j = 1 or 2.

Proposition 3.1. Under conditions C1–C5, we have:

  • (1)
    α^nαnop(1);
  • (2)
    1nt=1n(F~1(X~1t),F~2(X~2t);α^n)=1nt=1nE0{(F10(X~1t),F20(X~2t);αn)}+op(1).

Proposition 3.1(1) states that the two-step estimator α^n is a consistent estimator of the pseudo true value αn. If the censoring mechanism is random, then αn=α which does not depend on n. In addition, if the parametric copula correctly specifies the true copula, then α=α0, where αo is such that C(u1, u2; αo) = Co(u1, u2) for almost all (u1, u2) ∈ (0, 1)2.

3.2. Asymptotic normality

Recall that (U1t,U2t)=(F10(X~1t),F20(X~2t)). For j = 1, 2, we denote

Wj(X~jt,δjt;αn)E0[αj(U1s,U2s;αn)Ijo(X~jt,δjt)(X~js)X~jt,δjt],
Ijo(X~jt,δjt)(X~js)Fjo(X~js)[X~jsdnjt(u)Pn,j(u)X~jsI{X~jtu}dΛj(u)Pn,j(u)],

with Λj(u)log(Fj0(u)) the cumulative hazard function of Xj, Njt(u)δjtI{X~jtu} and dNjt (u) = Njt (u) − Njt (u−), and Pn,j(u)n1k=1nP(X~jku)=n1k=1nGjk(u)Fj0(u).

Let Var0 denote the variance with respect to the true probability measure. The following conditions are sufficient to ensure the asymptotic normality of α^n.

  • A1
    • (i)
      C2 holds with αnint(A*) for all n, where A* is a compact subset of A;
    • (ii)
      Bnn1t=1nE0{αα(U1t,U2t;αn)} has all its eigenvalues bounded below and above by some finite positive constants;
    • (iii)
      Σnn1t=1nVar0{α(U1t,U2t;αn)+W1(X~1t,δ1t;αn)+W2(X~2t,δ2t;αn)} has all its eigenvalues bounded below and above by some finite positive constants;
    • (iv)
      {α(U1t,U2t;αn)+W1(X~1t,δ1t;αn)+W2(X~2t,δ2t;αn)}t=1n satisfies Lindeberg condition.
  • A2

    Functions ℓαα(u1, u2; α) and ℓαj(u1, u2; α), j = 1, 2, are well-defined and continuous in (u1,u2,α)(0,1)2×A.

  • A3
    • (i)
      α(u1,u2;αn)q{u1(1u1)}a1{u2(1u2)}a2 for some q > 0 and aj ≥ 0 such that lim supn1t=1nE0[{U1t(1U1t)}1a1{U2t(1U2t)}2a2]<;
    • (ii)
      αj(u1,u2;αn)const.{uj(1uj)}bj{uk(1uk)}ak for some bk, ak and jk such that lim supn1t=1nE0[{Ujt(1Ujt)}ξjbj{Ukt(1Ukt)}ak]< for some ξj ∈ (0, 1/2).
  • A4
    • (i)
      Let Ltαj=supαAαj(U1t,U2t;α) and Ltαα=supαAαα(U1t,U2t;α). Then,
      Klimnlimsupn1t=1nE0{LtαjI(LtαjK)+LtααI(LtααK)}=0;
    • (ii)
      For any η > 0 and any ∊ > 0, there is K > 0, such that α(u1,u2;α)+αα(u1,u2;α)K{α(u1,u2;α)+αα(u1,u2;α)} for all αA and all uj ∈ [η, 1) such that 1uj(1uj), j = 1, 2.

Shih and Louis (1995) require bounded α(u1,u2;αn) and αj(u1,u2;αn) for j = 1, 2, however, this requirement is not satisfied by many popular copula functions such as Gaussian copula, t-copula, Gumbel copula and Clayton copula. Conditions A3 and A4 relax the boundedness requirement, and allow the score function and its partial derivatives with respect to the first two arguments to blow up at the boundaries. Similar conditions have been verified for Gaussian, Frank and Clayton copulas in Chen and Fan (2006b).

Proposition 3.2. Under conditions C1–C5 and A1–A4, we have: BnΣn12n(α^nαn)N(0,Ip) in distribution, where Bn and Σn are defined A1.

Proposition 3.2 extends Theorem 2 in Shih and Louis (1995) in two directions: (i) it allows for more general censoring mechanisms than the simple random censoring in Shih and Louis (1995), and (ii) it allows for the possibility that the parametric copula may not specify the true copula correctly. As a result, there are several differences between Proposition 3.2 and Theorem 2 in Shih and Louis (1995): First, since the censoring variables {D1t,D2t}t=1n may not be identically distributed, Bn and Σn may depend on n; Second, since the parametric copula may misspecify the true copula, the information matrix equality may not hold. Consequently, the asymptotic variance of n(α^nαn), Bn1ΣnBn1, can not be reduced to [Bn1+n1Bn1t=1nVar0{W1(X~1t,δ1t;αn)+W2(X~2t,δ2t;αn)}Bn1] as in Shih and Louis (1995). For complete data, Proposition 3.2 reduces to that in Chen and Fan (2005).

To estimate the asymptotic variance Bn1ΣnBn1 of n(α^nαn), we let

B^n=n1t=1nαα(F~1(X~1t),(F~2(X~2t);α^n),
Σ^n=1nt=1n{α(F~1(X~1t),F~2(X~2t);α^n)+W^1(X~1t,δ1t;α^n)+W^2(X~2t,δ2t;α^n)}{α(F~1(X~1t),F~2(X~2t);α^n);+W^1(X~1t,δ1t;α^n+W^2(X~2t,δ2t;α^n),},

with

W^1(X~1t,δ1t;α^n)=1nst,s=1nα1(F~1(X~1s),F~2(X~2s);α^n)I^10(X~1t,δ1t)(X~1s),
W^2(X~2t,δ2;α^n)=1nst,s=1nα2(F~1(X~1s),F~2(X~2s);α^n)I^20(X~2t,δ2t)(X~2s),

in which for j = 1, 2,

I^jo(X^jt,δjt)(X~js)=F~j(X~js)[I{X~jtX~js}δjtn1k=1nI{X~jkX~jt}1nk=1nI{X~jsX~jl}I{X~jtX~jl}δjl[n1k=1nI{X~jkX~jl}]2]. (3.1)

We note that an alternative expression for I^j0(X~jt,δjt)(X~js) is:

I^jo(X~jt,δjt)(X~js)=F~j(X~js)[I{X~jtX~js,δjt=1}P^n,j(X~jt)X~jlX~jsI{X~jtX~jl}ΔΔ^j(X~jl)P^n,j(X~jl)],

where P^n,j(u)n1k=1nI{X~jku},

ΔΔ^j(u)=I{Y¯j(u)>0}Y¯j(u)dNj(u),Y¯j(u)=k=1nI{X~jku},
Nj(u)=k=1nNjk(u),

in which ΔΛ^j(u) is so-called Nelson's estimator. This is because

X~jlX~jsI{X~jtX~jl}ΔΛ^j(X~jl)P^n,j(X~jl)=X~jlX~jsI{X~jtX~jl}δjlP^n,j(X~jl)k=1nI{X~jkX~jl}=1nl=1nI{X~jsX~jl}I{X~jtX~jl}δjl[n1k=1nI{X~jkX~jl}]2.

By the consistency of the Kaplan-Meier estimators and α^n, and by applying the law of large numbers to independent observations, we can the following result, which provides a consistent variance estimator.

Proposition 3.3. Under conditions C1C5 and A1A4, the asymptotic variance of n12α^n can be consistently estimated by B^nΣ^nB^n, where B^n is the generalized inverse of B^n.

4. Pseudo-likelihood ratio test for model comparison

By applying Proposition 3.1(2) we immediately obtain the probability limit of the PLR statistic.

Proposition 4.1. Suppose for i = 1, …, M, the copula model i satisfies the conditions of Proposition 3.1. Then

LRn(F~1,F~2;α^in,α^1n)=1nt=1nE0{i(U1t,U2t;αin)1(U1t,U2t;α1n)}+op(1),

where Ujt=Fj0(X~jt) for j = 1, 2.

In the following, we adopt the convention that all the notations involving the copula function C(u1, u2; α) introduced in Section 3 are now indexed by a subscript i for i = 1, …, M to make explicit their dependence on the parametric copula model i. In addition, we define Ut=(U1t,U2t)=(F10(X~1t),F20(X~2t)),

et=(e2t,,eMt),
eit{i(Ut;αin)1(Ut;α1n)}+j=12{Qi,j(X~jt,δjt;αin)Q1,j(X~jt,δjt;α1n)},

where for i = 1, …, M and for j = 1, 2,

Qi,j(X~jt,δjt;αin)E0[i,j(U1s,U2s;αin)Ijo(X~jt,δjt)(X~js)X~jt,δjt].

It is easy to see that 1nt=1n{etE0(et)} has the same asymptotic distribution as a multivariate normal random variable with mean zero and variance Ωn, where

Ωn=1nt=1nE0[(etE0{et})(etE0{et})]=(σik)i,k=2M,
σik=1nt=1nE0[(eitE0{eit})(ektE0{ekt})].

It is easy to compute a consistent estimator Ω^n for Ωn:

Ω^n=1nt=1n[(e^t1ns=1ne^s)(e^t1ns=1ne^s)]=(σ^ik)i,k=2M,
σ^ik=1nt=1n[(e^it1ns=1ne^is)(e^kt1ns=1ne^ks)], (4.1)

where e^t=(e^2t,,e^Mt) and for i = 2, …, M,

e^it{i(F~1(X~1t),F~2(X~2t);α^in)1(F~1(X~1t),F~2(X~2t);α^1n)}+j=12{Q^i,j(X~jt,δjt;α^in)Q^1,j(X~jt,δjt;α^1n)},

in which

Q^i,j(X~jt,δjt;α^in)=1nst,s=1ni,j(F~1(X~1s),F~2(X~2s);α^in)I^jo(X~jt,δjt)(X~js),

for i = 1, …, M and j = 1, 2 with I^j0(X~jt,δjt)(X~js) given in (3.1).

Before we present the test statistics, we recall the following definition from Chen and Fan (2005): For model i ∈ {2, …, M},

Models 1 and i are generalized non-nested if the set {(v1,v2):c1(v1,v2;αin)ci(v1,v2;αin)} has positive Lebesgue measure;

Models 1 and i are generalized nested if c1(v1,v2;α1n)=ci(v1,v2;αin) for almost all (v1, v2) ∈ (0, 1)2.

Given the definition of the pseudo true value αin, the closest ci(;αin) to the true copula c0 (according to KLIC) in a parametric class of copulas {ci(;αi):αiAi} depends on the true (but unknown) copula. Hence it is not obvious a priori whether two parametric classes of copulas are generalized non-nested or generalized nested.

Remark 4.1. Define

σiia1nt=1nVar0[i(Ut;αin)1(Ut;α1n)].

It is obvious that if models 1 and i are generalized nested, then i(U1t,U2t;αin)=1(U1t,U2t;α1n) almost surely, eit = 0 almost surely, and σiia=0. Following the proof of Proposition 3 in Chen and Fan (2005), we can show that if σii0=0 then models 1 and i are generalized nested, and σii = 0. Therefore it is easy to test whether the models 1 and i are generalized nested by testing σiia=0, which may be done by using its consistent estimator:

σ^iia=1nt=1n[{i(F~1(X~1t),F~2(X~2t);α^in)1(F~1(X~1t),F~2(X~2t);α^1n)}LRn(F~1,F~2;α^in,α^1n)]2.

See Chen and Fan (2005) for details.

The following proposition provides the basis for our tests. Note that we allow for some but not all of the candidate models i ∈ {2, …, M} to be generalized nested with the benchmark model 1.

Proposition 4.2. For i = 1, 2, …, M, assume that the copula model i satisfies conditions of Proposition 3.2 and that {eit : t = 1, …, n} satisfies Lindeberg condition. If Ωn=(σii)i,k=2M is finite and its largest eigenvalue is positive uniformly in n, then:

  • (1)
    n12[LRn(F~1,F~2;α^in,α^1n)n1t=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]]i=2,,M=1nt=1n{etE0(et)}+op(1),
    (Z2,,ZM)indistribution,with(Z2,,ZM)N(0,Ωn).
  • (2)
    Ω^n=Ωn+op(1).

Proposition 4.2 and the continuous mapping theorem imply

maxi=2,,Mn12{LRn(F~1,F~2;α^in,α^1n)n1t=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]}maxi=2,,MZi.

Define

Tnmaxi=2,,M[n12LRn(F~1,F~2;α^in,α^1n)].

Proposition 4.2 implies that under the Least Favorable Configuration (LFC), i.e.,

n1t=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]=0fori=2,,M,

Tn → maxi=2,…,M Zi in distribution. This allows us to construct a test for H0. Suppose the largest eigenvalue of Ωn is positive uniformly in n, then we will reject H0 if Tn > Zα, where Zα is the upper α-percentile of the distribution of maxi=2,…,M Zi.

The asymptotic power properties of this test against fixed alternatives and Pitman local alternatives follow immediately from Proposition 4.2 and are summarized in the following proposition.

Proposition 4.3. Suppose all conditions of Proposition 4.2 are satisfied. Then the test based on Tn is consistent against fixed alternatives of the form H1 and has non-trivial power against local alternatives satisfying

maxi=2,,Mlimn{n12t=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]}>0.

Note that if the censoring mechanism is random, then the local alternatives in Proposition 4.3 can be written in the more familiar form:

maxi=2,,ME0[i(U1t,U2t;αi)1(U1t,U2t;α1)]=K1n,

for a positive constant K.

In general, the distribution of maxi=2,…,M Zi is unknown, since the asymptotic variance Ωn of (Z2, #x2026;, ZM) depends on α1n,,αMn. Following White (2000), one can use either “Monte-Carlo RC” p-value or “bootstrap RC” p-value to implement this test. As noted in Chen and Fan (2005), Hansen (2003), and Romano and Wolf (2005), the finite sample power of this test may be improved by standardization. In our empirical application, we have computed both “Monte-Carlo RC” p-value using

TnS=maxi=2,,M{n12LRn(F~1,F~2;α^in,α^1n)σ^iiGb(σ^ii)},

and “bootstrap RC” p-value based on

TnI=max[maxi=2,,M{n12LRn(F~1,F~2;α^in,α^1n)σ^iiGb(σ^ii)},0],

where σ^ii is a consistent estimator of σii such as the one given in (4.1), b = bn → 0 as n → ∞, and Gb(·) is a is smoothed trimming which trims out small σ^ii. The particular trimming function being used in our empirical study is

Gb(x)=xgb(z)dz={0,x<bxgb(z)dz,bx2b1,x>2b}

where gb(χ) = (b−1g(b−1 χ−1) and g(z) = B(a+1)−1za(1−z)a,z ∈ [0, 1] for some positive integer a ≥ 1, where B(a) = Γ(a)2/Γ(2a) is the beta function and Γ(a) is the Euler gamma function.

We note that the standardized tests TnS and TnI proposed here allow that some candidate models are generalized nested with the benchmark model, since the trimming Gb(σ^ii) in TnS and TnI removes the effect of generalized nested models (with the benchmark model) on its limiting distribution. By a minor modification of the proof of Theorem 7 in Chen and Fan (2005), we immediately obtain the following result:

Proposition 4.4. Suppose all conditions of Proposition 4.2 are satisfied. If b → 0 and nb → ∞, then under the null hypothesis H0, the limiting distribution of TnI is given by that of maxiSNB(Ziσii,0), where

SNB={i{2,,M}:σii>0andn1t=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]=0}.

Proposition 4.4 implies that the asymptotic null distribution of TnI depends on models that are generalized non-nested with the benchmark and satisfy

n1t=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]=0,

and hence is unknown. We propose the following bootstrap procedure to approximate the asymptotic null distribution of TnI:

  • Step1

    Generate a bootstrap sample by random draws with replacement from a consistent nonparametric estimator of the unknown joint distribution of (X1t, X2t) that takes into account the censoring scheme. Denote (F~1, F~2, α~in, α~1n) as the bootstrap analogs of (F~1, F~2, α~in, α~1n).

  • Step2

    Compute the bootstrap value TinLRn(F~1,F~2;α~in,α~1n) of TinLRn(F~1,F~2;α~in,α~1n), i = 2, …, M, and define its recentered value as TinC=TinTinI(Tinan), where an → 0 is a small positive (possibly random) number such that nan.

  • Step3
    Compute the bootstrap value of TnI as
    TnI=max2iM{nTinCσ^iiGb(σ^ii),0};
  • Step4

    Repeat Steps 1–3 for a large number of times and use the empirical distribution function of the resulting values TnI to approximate the null distribution of TnI.

We note that the above bootstrap procedure is very similar to that proposed in Chen and Fan (2005), except that in Step 1 we generate bootstrap samples from a consistent nonparametric estimator of the joint distribution that takes account of the censoring. For example, for bivariate random right censoring, we could sample from the bivariate Kaplan–Meier estimator; see Dabrowska (1989). See Davison and Hinkley (1997, page 85) for additional ways to generate bootstrap sample for censored data. The consistency of this standardized bootstrap RC test TnIb could be established by a minor modification of the proof of Theorem 8 in Chen and Fan (2005).

Remark 4.2. Recall that

PLRn(F~1,F~2;α^in,α^1n)=LRn(F~1,F~2;α^in,α^1n)Pen(pi,n)Pen(p1,n)n.

If maxi=2,,M[Pen(pi,n)Pen(p1,n)]n0 (which is automatically satisfied with AIC and BIC), then

PLRn(F~1,F~2;α^in,α^1n)=LRn(F~1,F~2;α^in,α^1n)+op(n12)fori=2,,M.

Therefore, penalization could be incorporated in the tests. Define

TnP=maxi=2,,M[n12PLRn(F~1,F~2;α^in,α^1n)],
TnSP=maxi=2,,M{n12PLRn(F~1,F~2;α^in,α^1n)σ^iiGb(σ^ii)},

and

TnIP=max[maxi=2,,M{n12PLRn(F~1,F~2;α^in,α^1n)σ^iiGb(σ^ii)},0]. (4.2)

Then we can conduct the test using TnP (or TnSP or TnIP) instead of Tn (or TnS or TnI).

5. An empirical application

In this section, we illustrate our testing procedure for the selection of multiple copula-based survival functions by using insurance company data on losses and ALAEs. The particular data set we use was collected by the US Insurance Services Office and has been analyzed in some detail in Frees and Valdez (1998), Klugman and Parsa (1999), and Denuit et al. (2006).

Two alternative approaches have been used in the literature to model multivariate survival data; that of the multivariate distribution function and that of the multivariate survival function. It is important to realize that in the context of semiparametric copula-based models, the copula in a semiparametric copula-based distribution function corresponds to its survival copula in the corresponding semiparametric survival function. To be specific, consider the bivariate case. Let (X1, X2) be the survival variables of interest with a joint survival function Fo(x1, x2) = Pr(X1 > x1, X2 > x2) and marginal survival functions Fj0(), j = 1, 2. Let H(x1, x2) denote the corresponding joint cumulative distribution function (cdf) with marginal distributions Hj(·), j = 1, 2. Assume that H11F10 and H21F10 are continuous. By the Sklar's (1959 theorem, there exists a unique copula function Ch such that H(x1, x2) ≡ Ch(H1(x1), H2(x2)), which in turn implies that the representation

Fo(x1,x2)Co(F1o(x1),F2o(x2)),

holds where

Co(u1,u2)u1+u21+Ch(1u1,1u2)

is itself a copula function, known as a survival copula. Hence the bivariate distribution function Ch(H1(x1), H2(x2)) and the bivariate survival function C0(F10(x1),F20(x2)), where Fj0() is the survival function of Hj(·) and Co is the survival copula of Ch represent the same model.

In Frees and Valdez (1998) and Klugman and Parsa (1999), fully parametric modeling of the joint distribution of the loss and ALAE has been examined; using various model selection techniques including AIC/BIC, Frees and Valdez (1998) select Pareto marginals and Gumbel copula, while Klugman and Parsa (1999) select inverse paralogistic for the loss, inverse Burr for ALAE and the Frank copula. Denuit et al. (2006) adopt a semiparametric distribution framework in which the marginal distributions of loss and ALAE are left unspecified, but their copula is modeled parametrically via a one-parameter Archimedean copula. Their model selection procedure is the same as that in Wang and Wells (2000) except that the joint distributions of loss and ALAE are estimated differently. They examined four one-parameter Archimedean copulas: Gumbel, Clayton, Frank and Joe, and select the same Gumbel copula as Frees and Valdez (1998). Compared with Denuit et al. (2006), we do not restrict the parametric copulas to be Archimedean. In addition, our test takes into account the randomness of the selection criterion. Chen and Fan (2005) have also studied this data set, but since their model selection test is applicable to uncensored data only, they restrict their analysis to the subset of 1466 complete data. We now apply our proposed test to the original censored data with 1500 data points.

The scatterplots for loss and ALAE presented in Frees and Valdez (1998) and Denuit et al. (2006) reveal positive right tail dependence between loss and ALAE: large losses tend to be associated with large ALAE's. This is because expensive claims generally need some time to be settled and induce considerable costs for the insurance company. Actuaries therefore expect positive dependence between large losses and large ALAE's. On the other hand, these plots do not reveal any visible left tail dependence between the two variables. As a result, it is not surprising that the Gumbel copula is chosen in Frees and Valdez (1998) and Denuit et al. (2006). To shed some light on the robustness of this result to the set of copula families being considered, we add three more copula families to the set considered in Denuit et al. (2006): Gaussian copula, survival Clayton, mixture of Clayton and Gumbel copulas; see Appendix B for expressions of these seven copulas and their partial derivatives. Survival Clayton has right tail dependence and the mixture of Clayton and Gumbel exhibits both left tail and right tail dependence unless the weights are degenerate. the Gaussian copula does not have tail dependence and is thus expected to fit poorly. They are included here in the set of copulas to see if the power of the test is adversely affected by the presence of poor copula candidates in the selection set.1

To facilitate comparison, we also apply our tests to the subset of 1466 complete data. The results of the “Monte Carlo RC” test TnP (using the AIC penalization factor) for the original censored data are presented in Table 1 and those for the subset of 1466 complete data are presented in Table 2, with 500,000 Monte Carlo repetitions. For each copula, we estimated its parameter(s) by the two-step procedure and computed the value of the AIC. To apply our model selection test we need to choose a benchmark model. In view of the existing results, we first use the Gumbel copula as the benchmark. For the Gumbel benchmark, we found the p-value of the test to be 1 with or without taking into account censoring. This provides strong evidence that none of the other six copulas performs significantly better than the Gumbel copula for the loss-ALAE data. This is consistent with the selection result based on comparing the values of the AIC only; Gumbel followed by mixture of Clayton and Gumbel, then by survival Clayton and then by Joe. The parameter estimates for the mixture of Clayton and Gumbel provide additional evidence in favor of the Gumbel copula; the estimates of the weight on Clayton are only 0.0003 when censoring is taken into account and 0.0002 when censoring is not taken into account. In addition, the estimates of the parameter in the Gumbel copula obtained by fitting the mixture of Clayton and Gumbel are very close to the estimates obtained by fitting the Gumbel copula alone for both the subset of complete data and the original censored data. To see if the test is sensitive to the choice of the benchmark model, we also used each of the remaining six copulas as the benchmark.

Table 1.

Monte Carlo p-values of the test for the original dataset subject to censoring.

Benchmark p-value of Inline graphic p-value of Inline graphic AIC 2-step estimator
Gumbel 1.0000 0.9980 −0.1447 1.4428
Clayton 0.0015 0.0004 −0.0000 0.5152
Frank 0.0688 0.0394 −0.1009 0.0473
Joe 0.3968 0.2533 −0.1263 1.6466
Gaussian 0.1692 0.0724 −0.1125 0.4668
Survival Clayton 0.6295 0.4298 −0.1380 0.7825
Mix Clayton & Gumbel 0.9469 0.9794 −0.1420 (0.1505, 1.4433, 0.0003)

Table 2.

Monte Carlo p-values of the test for the subset without censoring.

Benchmark p-value of Inline graphic p-value of Inline graphic AIC 2-step estimator
Gumbel 1.0000 0.9940 −0.2560 1.4254
Clayton 0.0037 0.0008 −0.1203 0.5098
Frank 0.1197 0.0834 −0.2160 0.0494
Joe 0.3530 0.1643 −0.2384 1.6105
Gaussian 0.2499 0.1442 −0.2286 0.4604
Survival Clayton 0.5570 0.3412 −0.2472 0.7440
Mix Clayton & Gumbel 0.9382 0.9590 −0.2530 (0.1572, 1.4256, 0.0002)

For each of the Tables 1 and 2, we present two versions of the Monte Carlo tests based on the non-standardized test, TnP, and the standardized test, TnSP, as described in Remark 4.2.2 Comparing the first two columns in Tables 1 and 2, we see that both tests yield similar high p-values when the benchmark is either Gumbel or the mixture of Clayton and Gumbel; for all the other cases, the standardized test TnSP yields significantly lower p-values than those of TnP. This indicates that the standardized version of the test is generally more powerful than the original non-standardized test.

Additionally, we present a bootstrap version of the test based on TnIP (using the AIC penalization factor). We generate a bootstrap sample by random draws with replacement from a consistent non-parametric estimator of the bivariate joint distribution that takes into account the censoring scheme. For this loss-ALAE data set, we could draw bootstrap samples either from the bivariate Kaplan–Meier estimator of Dabrowska (1989), or from the estimator of Akritas (1994) and Denuit et al. (2006). Let TnI,P be the counterpart of TnI for one bootstrap iteration, we write the re-centered bootstrap test statistic as TnIC,P=TnI,PTnIP×I{TnIPan}, where for simplicity we use the same parameter values (a, bn, an) = (1, n−1/2, 0:025n−1/2 log log n) as those in Chen and Fan (2005). In this empirical application we use 100 bootstrap repetitions. The bootstrap p-values in Tables 3 and 4 overwhelmingly support the conclusion that the Gumbel copula fits the loss-ALAE data the best among the seven copulas we considered. This finding is consistent with existing results in the literature. The fact that the results in Tables 3 and 4 are so close to each other confirms the statement in Denuit et al. (2006) that the limited amount of censored points present in this Loss-ALAE data does not seem to affect the copula selection result.

Table 3.

Bootstrap p-values of the test for the original dataset subject to censoring.

Benchmark p-value of Inline graphic AIC Two-step estimate
Gumbel 1.0000 −0.1447 1.4428
Clayton 0.0000 −0.0000 0.5152
Frank 0.0000 −0.1009 0.0473
Joe 0.1010 −0.1263 1.6466
Gaussian 0.0517 −0.1125 0.4668
Survival Clayton 0.1414 −0.1380 0.7825
Mix Clayton & Gumbel 0.9900 −0.1420 (0.1505, 1.4433, 0.0003)

Table 4.

Bootstrap p-values of the test for the subset without censoring.

Benchmark p-value of Inline graphic AIC Two-step estimate
Gumbel 1.0000 −0.2560 1.4254
Clayton 0.0000 −0.1203 0.5098
Frank 0.0000 −0.2160 0.0494
Joe 0.1052 −0.2384 1.6105
Gaussian 0.0202 −0.2286 0.4604
Survival Clayton 0.0909 −0.2472 0.7440
Mix Clayton & Gumbel 0.9963 −0.2530 (0.1572, 1.4256, 0.0002)

Finally, by comparing the bootstrap p-values in Tables 3 and 4 with the Monte Carlo p-values in Tables 1 and 2, we notice that the standardized “bootstrap RC” test is in general more powerful than the standardized “Monte Carlo RC” test, which in turn is more powerful than the non-standardized “Monte Carlo RC” test. Nevertheless, it is noteworthy that the standardized “bootstrap RC” test is computationally much more intensive than the standardized “Monte Carlo RC” test. For an AMD Athlon(tm) 64 Processor, 1.18 GHz and 384 Mb of RAM, for each benchmark case, the standardized “bootstrap RC” test (with 100 bootstrap replications) takes about 10,500 computer seconds, whereas the standardized “Monte Carlo RC” test (with 500,000 Monte Carlo repetitions) only takes about 350 computer seconds. Moreover, we are happy to see that the standardized “Monte Carlo RC” test and the standardized “bootstrap RC” test yield very similar rankings and lead to the same conclusion that the Gumbel copula fits the loss-ALAE data the best.

6. Conclusion

Many models of semiparametric multivariate survival functions are characterized by nonparametric marginal survival functions and parametric copula functions, where different copulas imply different dependence structures. In this paper, we first establish large sample properties of the two-step estimator of copula dependence parameter when the parametric copula function may be misspecified and when data may be subject to an independent but otherwise general right censorship. We then provide a penalized pseudo-likelihood ratio test for selecting among multiple semiparametric copula models for multivariate survival data. An empirical application to the famous Loss-ALAE insurance data set indicates the usefulness of our theoretical results.

Although our theoretical results allow for general right censoring scheme, we still assume that the data is independent and is subject to independent censoring. In some economic and financial applications, data could be serially dependent and may be subject to dependent censorship. The two-step estimator and its large sample properties have been extended to time series settings in Chen and Fan (2006a,b), but their results do not allow for any censoring. We shall extend the results in this paper to allow for time series and/or dependent censoring in another paper.

Acknowledgements

We thank a guest co-editor and the anonymous referees for detailed suggestions which greatly improved the paper. We thank Professors Frees and Valdez for kindly providing the loss-ALAE data, which were collected by the US Insurance Services Office (ISO). Chen and Fan acknowledge financial support from the National Science Foundation. Ying acknowledges financial support from the National Science Foundation and the National Institute of Health. Part of the work was initiated during Chen and Ying's visit to the Institute for Mathematical Sciences at the National University of Singapore whose hospitality and support are acknowledged.

Appendix A. Technical proofs

We first introduce additional notation: Njt(x)=δjtI(X~jtx), Jjt(u)=I(X~jtu), Mjt(x)=Njt(x)xJjt(u)dΛj(u) and Λj(u)=logFjo(u) the marginal cumulative hazard function of Xj, j = 1, 2.

Lemma A.1. Suppose that Conditions C1 and C5 are satisfied. Then: (i) the marginal Kaplan–Meier estimators are uniformly strongly consistent: supxτjF~j(x)Fjo(x)0 a.s. for j = 1, 2; (ii) they can be expressed as martingle integrals:

F~j(x)Fjo(x)=Fjo(x)xF~j(u)Fjo(u)t=1ndMjt(u)t=1nI(X~jtu)=Fjo(x)xt=1ndMjt(u)Fjo(u)t=1nGjt(u)+op(n12),

where op() is uniform in x ∈ [0, τj], for j = 1, 2.

Proof of Lemma A.1. Because of Condition C5, the risk set size in (−∞, τj] is of order n. Consequently, the uniform strong consistency is a special case of Theorem 3 of Lai and Ying (1991). The martingale integral approximation follows from formula (3.2.13) of Gill (1980) and the consistency of the Kaplan–Meier estimator.

Lemma A.2. Let x^j=inf{x:F~j(x)<1}, j = 1, 2. There exists τ0 > 0 such that for every ∊ > 0, there is an η > 0 such that

lim infPn(infx^jxτ01F~j(x)1Fjo(x)η)>1,j=1,2.

Proof of Lemma A.2. For notational convenience, subscript j 1, 2 will be omitted. By definition,

F~(x)=t:X~tx(1δtk=1nJk(X~t))exp{xk=1ndNk(u)k=1nJk(u)}.

The right-hand side is bounded by 123xk=1ndNk(u)k=1nJk(u), x ≤ τ0 for suitably chosen τ0, provided that τ0k=1ndNk(u)k=1nJk(u)<log(23), which holds for all large n. Thus,

1F~(x)+1n23{t=1nI(Ctτ0)I(Xtx)+1n}. (A.1)

By a theorem of van Zuijlen (1978, Theorem 1.1), for any ∊ > 0, there exists η such that

P{t=1nI(Ctτ0)I(Xtx)+1nηt=1nI(Ctτ0)Fo(x)}>1. (A.2)

Since liminfn1t=1nI(Ctτ0)>0, it follows from (A.1), (A.2) and the fact that 1F~(x)n1 for all xx^ that the lemma holds.

Proof of Proposition 3.1. The main ideas here are to use the uniform consistency of the Kaplan–Meier estimator and the identifiability Condition C2. Write

n1t=1n{(F~1(X~1t),F~2(X~2t);α)E0[(F1o(X~1t),F2o(X~2t);α)]}=n1t=1n{(F~1(X~1t),F~2(X~2t);α)(U1t,U2t;α)}+n1t=1n{(U1t,U2t;α)E0[(U1t,U2t;α)]}. (A.3)

We first show that the first term on the right-hand side of (A.3) is of order op(1), uniformly in αA. Under Condition C5, F~j(x)F~j(τj), j = 1, 2, are bounded away from 0. By continuity of ℓ() on (0,1)×(0,1)×A and Lemma A.1, the first term, with summation over t such that both F~1(X~1t) and F~2(X~2t) are bounded away from 0, is of order op(1), uniformly in αA. i.e., for every η > 0,

limnsupαAn1t=1n(F~1(X~1t),F~2(X~2t);α)(U1t,U2t;α)×I(X~1tX~2tη)=0. (A.4)

It remains to show that for every ∊ > 0, there exists η > 0 such that

P{supαAn1t=1n(F~1(X~1t),F~2(X~2t);α)I(X~jtη)},j=1,2, (A.5)

and

P{supαAn1t=1n((U1t),U2t;α)I(X~jtη)},j=1,2. (A.6)

By Lemma A.2 and Condition C4(iii), there exists K > 0 such that

P{supαAn1t=1n(F~1(X~1t),F~2(X~2t);α)I(X~jtη)>KsupαAn1t=1n((U1t),U2t;α)I(X~jtη)}<3.

Therefore, to show (A.5) and (A.6), it suffices to show that for any ∊* > 0, there exists η such that

P{n1t=1nLtI(X~jtη)}23,j=1,2. (A.7)

By Condition C4(ii) and the Markov inequality, to show (A.7), we only need to show that for any K* > 0, there exists η such that

P{n1t=1nLtI(Lt<K)I(X~jtη)}13,j=1,2. (A.8)

But, again by the Markov inequality, the left-hand side of (A.8) is bounded by

Kn1t=1nP{X~jtη}

which can be made arbitrarily small by Condition C1.

We next show that the second term is also of order op(1). By Condition C4(ii), it suffices to show that for every K > 0,

n1t=1n{(U1t,U2t;α)I(max{Lt,Lαt}K)E0[(U1t,U2t;α)I(max{Lt,Lαt}K)]}

converges to 0 uniformly in αA. But this sequence converges to 0 a.s. for every α and has uniformly bounded derivatives over the compact set A, and, therefore, the convergence must be uniform.

Proof of Proposition 3.2. The proof can be done by essentially combining the techniques of Shih and Louis (1995) and Chen and Fan (2005). A critical part is how to appropriately control the tail behavior.

By the mean-value theorem, we can linearly expand the pseudo-likelihood score function at αn to get

α^αn=B~n11nt=1nα(F~1(X~1t),F~2(X~2t);αn), (A.9)

where B~n=1nt=1nαα(F~1(X~1t),F~2(X~2t);α~n) for some α~n on the line segment between αn and α^n. Under Condition A4, we can apply the same argument for proving (A.5) to show that supαAn1t=1nαα(F~1(X~1t),F~2(X~2t);α)I(X~jtη) is asymptotically negligible as η → 0. This in conjunction with Condition A2 and the consistency of F~j and α^n, implies that B~nBn1Ip in probability as n → ∞.

Again by the mean-value theorem,

t=1nα(F~1(X~1t),F~2(X~2t);αn)=t=1nα(F1o(X~1t),F2o(X~2t);αn)+j=12t=1nαj(U~1t,U~2t;αn){F~j(X~jt)Fjo(X~jt)}=D1n+D2n, (A.10)

where (U~1t, U~2t) lies on the line segment between (F~1(X~1t),F~2(X~2t)) and (F~10(X~1t),F~20(X~2t)).

By Lemma A.1,

D2n=j=12t=1nαj(U~1t,U~2t;αn)Fjo(X~jt)×X~jtF~j(u)Fjo(u)sdMjs(u)sI(X~jsu). (A.11)

Let D2n(η)=j=12D2n,j(η) denote the right-hand side of (A.11) with the summation restricted to those terms such that X~jtη We next show that for some ∊j > 0,

n12D2n,j(η)=Op(1)(1Fjo(η))j,j=1,2, (A.12)

where Op(1) is uniform over η > 0. For any ξ ∈ (0, 1), since

E0{(1Fjo(x))ξ2xF~j(u)Fjo(u)sdMjs(u)sI(X~jsu)}2E0{x{F~j(u)F0j(u)}2{1Fo(u)}ξsI(X~jsu)I(maxtX~jtu)dΛjo(u)},

it follows from Lenglart's inequality (Gill, 1980, Theorem 2.4.2) that

E0{(1Fjo(x))ξ2xF~j(u)Fjo(u)sdMjs(u)sI(X~jsu)}2=Op(1)x{F~j(u)F0j(u)}2{1Fo(u)}ξsI(X~jsu)I(maxtX~jtu)dΛjo(u)=Op(1)n1x{1Fo(u)}ξd{1Fo(u)}=Op(1)n1{1Fo(x)}1ξ, (A.13)

where Op(1) is uniform in x and the second equality follows from Lemma A.2 and van Zuijlen (1978, Theorem 1.1). From (A.13), Lemma A.2 (with ξ = 2ξj) and Condition A3, we have, ignoring the right tail,

n12D2n,j(η)=Op(1)n1t=1nUjtξjbjUktak(1Fjo(η))12ξj=Op(1)(1Fjo(η))12ξj.

Hence, (A.12) holds with ∊j = 1 − 2ξj, j = 1, 2.

In view of (A.12), we can essentially pretend that ℓαj in (A.10) does not blow up at the tail. Therefore, (A.11) implies that for j = 1, 2,

D2n,j=s=1nn1t=1nαj(U~1t,U~2t;αn)UjtI(X~jtu)n1t=1nI(X~u)dMjs(u)=s=1nn1t=1nE0{αj(U1t,U2t;αn)UjtI(X~jtu)}Pnj(u)dMjs(u)+op(n12). (A.14)

From (A.9)(A.11) and (A.14), we see that α^nαn is asymptotically a sum of independent zero-mean random vectors. Given Condition A1, Proposition 3.2 now follows from the standard multivariate central limit theorem for independent but non-identically distributed random variables.

Proof of Propositions 3.3. The consistency of the variance estimator clearly follows from the laws of large numbers, the consistency of the Kaplan–Meier estimator and of α^n, when the possible “tail instability” is ignored. To control the tail behavior, we can applied the same techniques as in the proofs of Propositions 3.1 and 3.2. The details are omitted.

Proof of Proposition 4.2. For i = 1, …, M, by the definition of α^in, we have

t=1ni,α(F~1(X~1t),F~2(X~2t);α^in)=0.

Hence,

t=1ni(F~1(X~1t),F~2(X~2t);αin)=t=1ni(F~1(X~1t),F~2(X~2t);α^in)+12(αinα^in)t=1ni,αα(F~1(X~1t),F~2(X~2t);αin)(αinα^in),

where αin is between αin and α^in. By conditions C2–C5, A1–A4 and Proposition 3.2, we have

12n(αinα^in)t=1ni,αα(F~1(X~1t),F~2(X~2t);αin)(αinα^in)=12(αinα^in)Bin(αinα^in)+op(1n).

Hence,

1nt=1ni(F~1(X~1t),F~2(X~2t);α^in)=1nt=1ni(F~1(X~1t),F~2(X~2t);αin)+12(αinα^in)Bin(αinα^in)+op(1n).

As a result, we get for all i = 2, …, M,

LRn(F~1,F~2;α^in,α^1n)1nt=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]=Ai,n+Di,n+op(1n),

where

Ai,nLRn(F~1,F~2;αin,α1n)1nt=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]=1nt=1n{[i(U1t,U2t;αin)1(U1t,U2t;α1n)]E0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]+j=121nt=1n[{i,j(U1t,U2t;αin)1,j(U1t,U2t;α1n)}×{F~j(X~jt)Fjo(x~jt)}]+op(1n),
Dn12(αinα^in)Bin(αinα^in)12(α1nα^1n)B1n(α1nα^1n).

By Proposition 3.2, we have Dn = Op(n−1).

For generalized non-nested models, Using a proof similar to that of Proposition 3.2, we obtain:

n×Ai,n=1nt=1n[eitE0(eit)]+op(1)=Op(n12),

hence n×Ai,n converges in distribution to a N(0,σii). Therefore,

n[LRn(F~1,F~2;αin,α1n)1nt=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]]

converges in distribution to a N(0,σii).

For generalized nested models, the term Ai,n becomes zero almost surely, we have

LRn(F~1,F~2;α^in,α^1n)1nt=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]=Di,n+op(1n),

where by Proposition 3.2, 2nDi,n is distributed as a weighted sum of independent χ[1]2 random variables.

Proof of Proposition 4.3. Note that

Tn=maxi=2,,M[n12LRn(F~1,F~2;α^in,α^1n)]=maxi=2,,M[n12{LRn(F~1,F~2;α^in,α^1n)n1t=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]}+n12t=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]]maxi=2,,M[Zi+limn{n12t=1E0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]}].

Let

Kin=limn{n12t=1nE0[i(U1t,U2t;αin)1(U1t,U2t;α1n)]}.

Then

P(Tn>Zα)P(maxi=2,,M{Zi+Kin}>Zα)P(maxi=2,,MZi+maxi=2,,MKin>Zα).

For fixed alternatives, maxi=2,…,M Kin = +∞ and so P (Tn > Zα) → 1. For local alternatives such that maxi=2,…,MKin > 0,

P(maxi=2,,MZi+maxi=2,,MKin>Zα)>P(maxi=2,,MZi>Zα)=α.

Hence limn→∞ P (Tn > Zα) > α.

Appendix B. Expressions of copulas and their derivatives

In the Appendix B we describe the seven copulas and their derivatives that we have used in the empirical application Section 5.3 Let (X1, X2) be the lifetime variables of interest with joint survival function Fo(x1, x2) = Pr(X1 > x1, X2 > x2) and continuous marginal survival functions Fjo(x1,x2)=Pr(X1>x1,X2>x2). Let H (x1, x2) denote the corresponding joint cumulative distribution function (cdf) with marginal distributions Hj1Fjo,j=1,2. By the Sklar's (1959) theorem, there exists a unique copula function Ch on [0, 1]2 such that

H(x1,x2)Ch(H1(x1),H2(x2)),

or equivalently

Fo(x1,x2)Co(F1o(x1),F2o(x2)),

holds with

Co(u1,u2)u1+u21+Ch(1u1,1u2), (B.1)

where the copula function Co() is sometimes called the survival copula (of Ch).

It is easy to see that, for any j ∈ {1, 2}

Couj(u1,u2)=1Chuj(1u1,1u2), (B.2)

in fact, for any partial derivative of order k higher than 2 we have that

kCouj1ujk(u1,u2)=(1)kkChuj1ujk(1u1,1u2), (B.3)

where ji ∈ {1, 2}. Note that this last equation implies that

co(u1,u2)=ch(1u1,1u2), (B.4)

where co and ch are the copula densities associated to Co and Ch, respectively.

Using relations (B.2)(B.4), by replacing vj = 1 − uj in the expressions of partial derivatives of a copula Ch and its density ch, we immediately obtain the expressions for the partial derivatives of the survival copula Co and its density co. Therefore, in the following we only provide expressions for the partial derivatives of several copula functions Ch and their densities ch that we have used in the empirical application.

Gumbel copula

The Gumbel copula and its density are given by

Ch(v1,v2)=exp{((log(v1))α+(log(v2))α)1α}, (B.5)

and

ch(v1,v2)=1ChChv1Chv2T1,

with T1 = ((α − 1)(− log(Ch))−1 + 1). Following Frees and Valdez (1998), we can express the partial derivative of Ch with respect to vj, j = 1, 2, as

Chvj(v1,v2)=Ch(v1,v2)vj(log(vj)log(Ch(v1,v2)))α1. (B.6)

Hence, a little algebra implies4

2Chvj2=(log(vj)log(Ch))α1(ChvjvjChvj2)+(α1)Chlog(Ch)vj2(log(vj)log(Ch))α2(α1)log(vj)(log(vj)log(Ch))α2vjlog(Ch)2Chvj.

The partial derivative of the copula density ch with respect to vj, j = 1, 2, is given by

chvj=1Ch2(Chvj)2ChviT1+1Ch2Chvj2ChviT1+1ChChvjchT1+1ChChvjChviT1vj,

where

T1vj=(α1)(log(Ch))2ChChvj.

Clayton copula

The Clayton copula and its density are given by

Ch(v1,v2)=(v1α+v2α1)1α,

and

ch(v1,v2)=(1+α)v1(1+α)v2(1+α)(v1α+v2α1)(1α+2).

Hence the second order partial derivative of Ch with respect to vj, j = 1, 2, is given by

2Chvj2=(1+α)(1Ch(Chvj)21vjChvj),

where

Chvj=vj(1+α)(v1α+v2α1)1α1.

The first order partial derivative of the copula density ch with respect to vj, j = 1, 2, is given by

chvj=ch((1+2α)ChChvj(1+α)vj).

Frank copula

The Frank copula and its density are given by

Ch(v1,v2)=1log(α)log(1(1αv1)(1αv2)1α),

and

ch(v1,v2)=log(α1)αv1αv21α(1(1αv1)(1αv2)1α)2.

After some algebra, the second order partial derivative of Ch with respect to vj, j = 1, 2, is given by

2Chvj2=log(α)(Chvj(Chvj)2)

where

Chvj=(1(1αv1)(1αv2)1α)1(1αvi)αvj1α,

and the first order derivative of the copula density ch with respect to vj, j = 1, 2, is given by

chvj=log(α)(12Chvj)ch.

Joe copula

The Joe copula and its density are given by

Ch(v1,v2)=1(v1α+v2αv1αv2α)1α

and

ch(v1,v2)=v1α1v2α1T21α1(α(1α)(1v1α)(1v2α)T2),

where vj=1vj and T2=v1α+v2αv1αv2α.

The second order partial derivative of the copula Ch with respect to vj, j = 1, 2, is given by

2Chvj2=(1α)vjα1(1viα)T21α1(vjα1(1viα)T21vj).

After some tedious algebra, the first order partial derivative of the copula density ch with respect to vj, j = 1, 2, is given by

chvj=(α1)viα1vjα2T21α1(1+vjα(1viα)T21)×(α(1α)(1v1α)(1v2α)T2)+(1α)αviα1vj2α2(1viα)T21α3×(T2(1vjα)(1viα)).

Gaussian copula

The Gaussian copula and its density are given by

Ch(v1,v2)=Φα(Φ1(v1),Φ1(v2)),

where ϕα is the bivariate standard normal distribution with correlation α, Φ is the scalar standard normal distribution, and

ch(v1,v2)=ϕα(Φ1(v1),Φ1(v2))ϕ(Φ1(v1))ϕ(Φ1(v2)),

where ϕ is the density function of Φ, and ϕα is the density function of ϕα.

The second order partial derivative of the copula Ch with respect to vj, j = 1, 2, is given by

2Chvj2=[Φ1(vi)Φ1(vj)αs2π(1α2)32×exp(12Φ1(vj)22αΦ1(vj)s+s21α2)ds]ϕ(Φ(vj)1)2+ChvjΦ(vj)1ϕ(Φ(vj)1)1.

The first order partial derivative of the copula density ch with respect to vj, j = 1, 2, is given by

chvj=(Φ(vj)1Φ(vj)1Φ(vj)1α1α2)ϕ(Φ(vj)1)1ch.

Mixture copula

A mixture copula Ch(v1, v2; α), with its parameter α = (α1, α2, λ), is simply given by

Ch(v1,v2;α)=λCh1(v1,v2;α1)+(1λ)Ch2(v1,v2;α2),0λ1,

where Ch1(v1,v2;α1) is one copula (such as the Clayton copula in our application) with its parameter α1, and Ch1(v1,v2;α2) is another copula (such as the Gumbel copula in our application) with its parameter α2. Then it is clear that the partial derivatives of Ch are simply the linear combination of the partial derivatives of the two copulas:

kChvjk=λkCh1vjk+(1λ)kCh2vjk,j=1,2.

Footnotes

1

Since our test is developed for semiparametric copula-based survival functions instead of distribution functions, we use the survival copulas of these seven copula functions in implementing our test. However, we present our empirical results in terms of copulas of the corresponding semiparametric distribution functions in order to compare our results with existing results just cited.

2

When computing the test statistic TnSP, we have used a = 1 and bn = 10/n2.

3

In the empirical application we have used both analytical derivatives and numerical derivatives, while the results based on analytical derivatives perform slightly better. Since these analytical derivatives for copulas are tedious to compute, we include them in this Appendix B so that readers could use them in other applications as well.

4

We leave the dependence on (v1, v2) implicit, to ease the notational burden.

References

  1. Akritas M. Nearest neighbor estimation of a bivariate distribution under random censoring. Annals of Statistics. 1994;22:1299–1327. [Google Scholar]
  2. Chen X, Fan Y. Pseudo-likelihood ratio tests for model selection in semiparametric multivariate copula models. The Canadian Journal of Statistics. 2005;33:389–414. [Google Scholar]
  3. Chen X, Fan Y. Estimation and model selection of semiparametric copula-based multivariate dynamic models under copula misspecification. Journal of Econometrics. 2006a;135:125–154. [Google Scholar]
  4. Chen X, Fan Y. Estimation of copula-based semiparametric time series models. Journal of Econometrics. 2006b;130:307–335. [Google Scholar]
  5. Chen X, Fan Y. A model selection test for bivariate failure-time data. Econometric Theory. 2007;23:414–439. [Google Scholar]
  6. Dabrowska D. Kaplan–Meier estimate on the plane: Weak convergence, LIL, and the bootstrap. Journal of Multivariate Analysis. 1989;29:308–325. [Google Scholar]
  7. Davison AC, Hinkley DV. Bootstrap Methods and Their Application. Cambridge University Press; 1997. [Google Scholar]
  8. Denuit M, Purcaru O, Van Keilegom I. Bivariate Archimedean copula modelling for censored data in non-life insurance. Journal of Actuarial Practice. 2006;13:5–32. [Google Scholar]
  9. Frees E, Valdez E. Understanding relationships using copulas. North American Actuarial Journal. 1998;2:1–25. [Google Scholar]
  10. Genest C, Ghoudi K, Rivest L. A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika. 1995;82:543–552. [Google Scholar]
  11. Gill R. Math. Centre Tracts. Vol. 124. Mathematisch Centrum; Amsterdam: 1980. Censoring and Stochastic Integrals. [Google Scholar]
  12. Hansen RP. Manuscript. Brown University; 2003. A test for superior predictive ability. [Google Scholar]
  13. Joe H. Multivariate Models and Dependence Concepts. Chapman & Hall/CRC; London: 1997. [Google Scholar]
  14. Klugman S, Parsa R. Fitting bivariate loss distributions with copulas. Insurance: Mathematics and Economics. 1999;24:139–148. [Google Scholar]
  15. Lai T, Ying Z. Estimating a distribution function with truncated and censored data. Annals of Statistics. 1991;19:417–442. [Google Scholar]
  16. Li D. On default correlation: A copula function approach. Journal of Fixed Income. 2000:43–54. [Google Scholar]
  17. Nelsen R. An Introduction to Copulas. Springer; New York: 1999. [Google Scholar]
  18. Oakes D. Bivariate survival models induced by frailties. Journal of the American Statistical Association. 1989;84:487–493. [Google Scholar]
  19. Oakes D. Multivariate survival distributions. Journal of Nonparametric Statistics. 1994;3:343–354. [Google Scholar]
  20. Romano JP, Wolf M. Stepwise multiple testing as formalized data snooping. Econometrica. 2005;73:1237–1282. [Google Scholar]
  21. Shih J, Louis T. Inferences on the association parameter in copula models for bivariate survival data. Biometrics. 1995;51:1384–1399. [PubMed] [Google Scholar]
  22. Sin C, White H. Information criteria for selecting possibly misspecified parametric models. Journal of Econometrics. 1996;71:207–225. [Google Scholar]
  23. Sklar A. Fonctions de r'epartition'a n dimensionset leurs marges. Publications of the Institute of Statistics University Paris. 1959;8:229–231. [Google Scholar]
  24. van Zuijlen MCA. Properties of the empirical distribution function for independent nonidentically distributed random variables. Annals of Probability. 1978;6:250–266. [Google Scholar]
  25. Wang W, Wells M. Model selection and semiparametric inference for bivariate failure-time data. Journal of the American Statistical Association. 2000;95:62–76. [Google Scholar]
  26. White H. A reality check for data snooping. Econometrica. 2000;68:1097–1126. [Google Scholar]

RESOURCES