Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Nov 25.
Published in final edited form as: Stat Med. 2013 May 30;32(24):4275–4284. doi: 10.1002/sim.5857

Causal Inference for Bivariate Longitudinal Quality of Life Data in Presence of Death Using Global Odds Ratios

Keunbaik Lee 1, Michael J Daniels 2,*
PMCID: PMC3935993  NIHMSID: NIHMS555857  PMID: 23720372

SUMMARY

In longitudinal clinical trials, if a subject drops out due to death, certain responses, such as those measuring quality of life (QOL), will not be defined after the time of death. Thus, standard missing data analyses, e.g., under ignorable dropout, are problematic because these approaches implicitly ‘impute’ values of the response after death. In this paper we define a new survivors average causal effect for a bivariate response in a longitudinal quality of life study that had a high dropout rate with the dropout often due to death (or tumor progression). We show how principal stratification, with a few sensitivity parameters, can be used to draw causal inferences about the joint distribution of these two ordinal quality of life measures.

1. Introduction

In a longitudinal clinical trial, if a subject drops out due to death, the responses will often not be defined after the dropout time. Thus, standard missing data analyses are problematic because they implicitly (or explicitly) ‘impute’ values of response after dropout. In this circumstance the joint distribution of the longitudinal responses and death times can be used to conduct an analysis [1, 2, 3, 4]. However, this approach stratifies on death times (that occur after randomization) and thus the estimated treatment effect is not a causal one. Similar problems occur by restricting analyses to only the survivors on each treatment arm.

To obtain the causal effect of treatment, we first introduce potential outcomes. Potential outcomes are all the outcomes that would be observed if each of the treatments had been applied to each of the subjects [5, 6, 7]. So for example, in the case of a binary treatment Z, the two potential outcomes would be Y(Z) for Z = 0, 1. The causal effect of the treatment could be defined as the difference, Y(1) − Y(0). In a randomized trial, this can be estimated as the differences in the sample mean for those randomized to each treatment. Causal inference becomes more complex when there is no randomization (e.g., in observational studies) and in settings such as when an outcomes of interest is unobserved due to death (as mentioned above). We will explore the latter situation.

‘Principal stratification’ uses potential outcomes and partitions subjects into sets with respect to posttreatment variables [8]. The principal strata are not affected by treatment assignment. Causal effects are defined within these principal strata. To estimate the causal effect of treatment in our setting, one can define a ‘survivors average causal effect (SACE)’ [8, 9, 10, 11]; we describe this further in Section 3, but it is a causal treatment comparison among those who would survive on both treatment arms (which corresponds to one of the principal strata). Here, survival is the posttreatment variable. Egleston et al. [9] proposed assumptions to identify the SACE among survivors at a fixed time for a binary response and implemented a sensitivity analysis for some of those assumptions. Lee and Daniels [10] estimated the SACE for ordinal data in a two treatment arm setting, while Lee et al. [11] introduced a set of relevant SACEs in the setting of three treatment arms with ordinal outcomes. In a related setting, Cheng [12] defined an average causal effect for univariate ordinal outcomes.

In this paper, we define a novel SACE for bivariate ordinal outcomes using the ratio of global odds ratios [13]. We also present a (simple) set of assumptions to identify the SACE. Instead of just using the data at the time point of interest (the end of the study) to estimate the SACE, we will use all available longitudinal data via models for bivariate longitudinal data which is essential due to the high rate of dropouts unrelated to death in this study. We do this by introducing a partial ignorability assumption [14] that is weaker than that used in earlier work by conditioning on baseline covariates [11].

The paper is organized as follows. In Section 2, we describe our motivating data set. In Section 3, we formally define the causal effect of interest, provide some assumptions that are necessary to identify it, and then show how it can be estimated from these assumptions along with the observed data. In Section 4, we analyze data from the longitudinal quality of life (QOL) study. Finally, conclusions and extensions are provided in Section 5.

2. Description of Data

Our approach is motivated by data from a colorectal cancer clinical trial that was designed to compare survival among three treatments (IFL, FOLFOX, IROX) [15]. The main objective of this trial was to find a better treatment for colorectal cancer. The median survival for patients receiving FOLFOX was higher than IROX and IFL. IFL was the standard of care at the time of the trial. Secondarily, it was of interest to determine if the treatment differed with respect to QOL since the toxicity profiles were quite different on the treatment arms.

Previous work [4, 10, 11] focused on a single QOL measure. In this paper, we focus on two QOL measures, insomnia and outlook (measure of anxiety). Insomnia (outlook) was measured on a 5 point ordinal scale, 1: It is almost impossible for me to get a decent night’s sleep (I am worried and scared about things); 2: I have difficulty sleeping almost every night (I am worried and a little frightened about things); 3: I frequently have trouble getting to sleep and staying asleep (I am quite worried, but unafraid); 4: I have occasional spells of sleeplessness (I am a little worried about things); 5: I sleep as well as I always have (I am not fearful or worried). The connection between these two is of interest as high insomnia is related to a more negative outlook (anxiety) [16].

Forty-one percent of those dropped out from the study did so due to tumor progression or death. A naive treatment comparison among the survivors on each treatment is not causal. We outline an approach to estimate the causal effect of the treatment difference (among survivors) next. In the following, we focus on (only) two of the treatments (IFL and FOLFOX).

3. Principal Stratification Approach

We first introduce some further notation for our setting. Denote Txi as the treatment indicator for subject i,

Txi={0,ifsubjectiisassignedtoIFL;1,ifsubjectiisassignedtoFOLFOX,

and let Dit(Txi) be tumor-progression/death indicator for subject i at visit t,

Dit(Txi)={0,alive;1,tumorprogressedordead.

Obviously, if Dit(Txi) = 1, then Dis(Txi) = 1 for s > t. To maintain clarity, we will refer to the tumor progression/death indicator Dit(·) as the death indicator for what follows.

We define four (principal) strata, At(l, m) = {i|(Dt(0), Dt(1)) = (l, m)} based on the pairs of death indicators (the posttreatment variable) at each time. Within each of these strata, we can define a causal effect. At(0, 0) is the stratum for the subjects who would be alive under both arms at visit t; At(1, 0) is for the subjects who would be alive under the experimental treatment (FOLFOX) but not alive under the standard of care treatment (IFL) at visit t; At(0, 1) is for the subjects who would be alive under IFL but not alive under the FOLFOX at visit t; At(1, 1) is for the subjects who would not be alive under either treatment at visit t.

Now, we define a more general set of potential outcomes for each subject under each treatment at each time. Let Yit1(Txi) and Yit2(Txi) be the potential outcomes for subject i at time t. The full set of potential variables for subject i at visit t is

Pit={Dit(0),(Yit1(0),Yit2(0);Dit(0)=0),Dit(1),(Yit1(1),Yit2(1);Dit(1)=0)},

where, for treatment tx, Yit1(tx) and Yit2(tx) are the potential responses if a subject i is alive at visit t(Dit(tx) = 0).

Let Yit = (Yit1, Yit2). If the patient is alive (Dit(Txi) = 0), define Rit to be the indicator that Yit = Yit(Txi) is observed. In the following, we assume monotone dropout (Rit = 1 ⇒ Rit−1 = 1). So, the observed data for individual i is

Oit={Txi,Dit,(Rit;Dit=0),(Yit;Dit=0,Rit=1)}.

Next we define the causal effect of interest that will be of interest in the strata corresponding to surviving on both treatment arms. We then discuss assumptions to identify it given the observed data.

Causal Effect

We suppress the subscript i here for clarity. We define a causal effect that captures the treatment difference on the joint distribution of insomnia and outlook. More specifically, the causal effect of interest will be the ratio of generalized odds ratios (GOR) [17] under each treatment in the strata corresponding to surviving at time T on both treatments,

SACEk1,k2=GOR(k1,k2,Tx=1;AT(0,0))GOR(k1,k2,Tx=0;AT(0,0)). (1)

The GOR for two ordinal responses is defined as

GOR(k1,k2,Tx=j;AT(0,0))=P(YT1(j)k1,YT2(j)k2AT(0,0))P(YT1(j)>k1,YT2(j)>k2AT(0,0))P(YT1(j)k1,YT2(j)>k2AT(0,0))P(YT1(j)>k1,YT2(j)k2AT(0,0)). (2)

GORs are convenient summaries of the relationship between ordered categorical responses and are an extension of the simple odds ratio for a 2 × 2 contingency table. As can be seen in (2), the GOR corresponds to collapsing the contingency table into a 2×2 table [18] via cumulative probabilities; in particular, collapsing (or dichotomizing) the ordinal variable YT1(j) as ≤ k1 vs. > k1 and similarly for YT1(j) with k2. In our case, each GOR is the odds ratio of probabilities that both QOL responses at time T are smaller than k1 and k2 (in our example, high insomnia and outlook) for subjects who would be alive under both treatment arms. SACEk1k2 is the ratio of GORs for the two treatment arms. k1 and k2 would be chosen by the investigator.

The ratio of odds ratios was first proposed as a measure of the interaction effects of two dichotomous exposure factors on a binary response in [19]. Here, we have extended this idea to accommodate bivariate ordinal outcomes in the setting of causal inference.

The GOR in (2) consists of four joint probabilities. Using the following relationship from [17], it is clear we will only need to estimate FiT12(k1, k2|Tx = j, AT (0, 0)) = P(YT1(j) ≤ k1, YT2(j) ≤ k2|AT (0, 0)) to estimate (2),

GOR(k1,k2,Tx=j;AT(0,0))=FiT12(k1,k2Tx=j,AT(0,0)){1-FiT1(k1Tx=j,AT(0,0))-FiT2(k2Tx=j,AT(0,0))+FiT12(k1,k2Tx=j,AT(0,0))}{FiT1(k1Tx=j,AT(0,0))-FiT12(k1,k2Tx=j,AT(0,0))}{FiT2(k2Tx=j,AT(0,0))-FiT12(k1,k2Tx=j,AT(0,0))}, (3)

where

FiT12(k1,k2Tx=j,AT(0,0))=P(YT1(j)k1,YT2(j)k2AT(0,0)),FiT1(k1Tx=j,AT(0,0))=P(YT1(j)k1AT(0,0))=P(YT1(j)k1,YT2(j)K2AT(0,0)),FiT2(k2Tx=j,AT(0,0))=P(YT2(j)k2AT(0,0))=P(YT1(j)K1,YT2(j)k2AT(0,0)).

for j = 0, 1. Next, we outline assumptions necessary to identify the SACE.

3.1. Assumptions for Identification of a Causal Effect

We first make the Stable Unit Treatment Value Assumption (SUTVA, [6]) which states that a subject’s potential outcomes are unrelated to the treatment status of other subjects. Next, we list the other assumptions necessary to identify the causal effect of interest. However, before doing that, we need to define a few more quantities.

Let gT(tx, w) = P(DT (tx) = 0|w) for tx = 0, 1 where w represent baseline covariates. gT is the marginal probability of being alive at time T given baseline covariates. We assume that

logit[gT(tx,w)]=wTψtx. (4)

Note that we can easily be more flexible than the simple logistic regression model above (e.g., using regression trees [20]). Also, define

pT(w)=P(DT(1)=0DT(0)=0,w).

pT(w) is the probability of being alive on FOLFOX at time T, given that also alive on IFL at time T for a patient with baseline covariates, w.

Assumption 1 (Ignorability)

Tx ⊥ Inline graphic.

where Inline graphic = ( Inline graphic, ···, Inline graphic) is the history of potential outcomes up to and including the outcomes at visit T.

Assumption 1 states that the treatment arm is unrelated to the set of potential outcomes. This holds since the treatments were randomized.

Assumption 2 (Stochastic Monotonicity)

We assume pT(w) is given by

pT(w)=gT(1,w)+ρ{U(w)-gT(1,w)},0ρ1,

where U(w)=min{1,gT(1,w)gT(0,w)}. The term, U(w), corresponds to the bounds on the conditional distribution given gT (1, w). This modeling assumption has been used previously in Roy et al. [21] and Lee et al. [11] for QOL data. Note that if ρ = 0, then pT(w)=gT(1,w), which implies that DT(1) is independent of DT(0). If ρ = 1, then pT(w)=U(w), which is the largest possible probability that is compatible with the marginal distributions. This assumption implies that the probability of being alive on the experimental arm (FOLFOX) is more likely conditional that one would also be alive on the standard of care arm (IFL).

Assumption 3 (Proportional Odds Reduction of Sensitivity Parameters)

odds(YT1(1)k1,YT2(1)k2AT(1,0))odds(YT1(1)k1,YT2(1)k2AT(0,0))=τ,odds(YT1(0)k1,YT2(0)k2AT(0,1))odds(YT1(0)k1,YT2(0)k2AT(0,0))=λ, (5)

where 0 < τ, λ < ∞. This assumptions characterizes the relationship between potential outcomes of interest in different strata.

From (5), the joint probability of insomnia and outlook under the FOLFOX arm among the group that is alive under the FOLFOX arm but not alive under the standard of care treatment at visit T is given by,

P(YT1(1)k1,YT2(1)k2AT(1,0))=τP(YT1(1)k1,YT2(1)k2AT(0,0))1-(1-τ)P(YT1(1)k1,YT2(1)k2AT(0,0)). (6)

The functional form (6) was introduced in Marshall and Olkin [22] for a family of distributions from a survival function and Sankaran and Jayakumar [23] interpreted the family using the odds function. The sensitivity parameters, τ and λ, are the odds ratios of the joint probability of insomnia and outlook under the FOLFOX arm among the group that is alive under the FOLFOX arm but not alive under the standard of care (IFL) treatment at visit T as compared to the group that is alive under both arms. We assume the two sensitivity parameters, τ and λ, do not depend on k1 and k2 (proportional odds type assumption). A large value of τ means that the joint probability of insomnia and outlook under the FOLFOX arm among patients in stratum AT (1, 0) is higher than that among patients in stratum AT (0, 0). When τ = 1, the two probabilities are same. We also have similar interpretation of λ for stratum AT (0, 1) versus AT (0, 0).

Similar assumptions in related contexts were used in Egleston et al. [9], Lee and Daniels [10], and Lee et al. [11] in the context of a univariate response. Here, we have provided a related extension in the more complex setting of a bivariate response. Assumptions with just a few sensitivity parameters, like Assumption 3, are essential to conducting a feasible sensitivity analysis [24].

3.2. Identification of the SACE

The following theorem confirms that the observed data, Inline graphic, along with Assumptions 1–3 are sufficient to identify the SACE.

Theorem I

Under assumptions 1, 2, and 3, SACEk1k2 is identified.

The proof is given in the Supplementary Materials/Web Appendix.

3.3. Estimation of SACE

Recall that a lot of missing data in this study was due to dropout unrelated to death. To use all available data on survivors to estimate the SACE, we need one more assumption.

Assumption 4: (Conditional MAR)

Rt ⊥ {Yt, …, YT}|DT = 0, Ȳt−1, Tx, w for t < T.

This assumption states that missingness of outcome at visit t is independent of the value of the outcome given the previous history of outcomes (Ȳt−1), treatment (Tx), and baseline covariates (w) for all subjects with progression/death time larger than T(DT = 0). It is MAR conditional on progression/death time; assumptions like these have been termed ‘partially ignorable’ [14]. This assumption will allow us to use all the available data on survivors for the analysis. The addition of baseline covariates is a simple weakening of the assumption proposed in [11].

Models for the observed data

To use all the available longitudinal data, we can use any joint model for bivariate longitudinal ordinal data. In this paper, we consider the following marginalized model which was introduced in Lee et al. [25],

logit{P(Yit1kxit,DT=0,tx)}=β10k+xitTβ1+α1tx, (7)
logit{P(Yit2kxit,DT=0,tx)}=β20k+xitTβ2+α2tx, (8)
logit{P(Yit1kbit1,xit,DT=0,tx)}=Δit1k+bit1, (9)
logit{P(Yit2kbit2,xit,DT=0,tx)}=Δit2k+bit2,biTi.i.d.N(0,i), (10)

where bi = (bi11, bi12, ···, biT1, biT2)T, xit is a vector of covariates including wi, and tx is an indicator for treatment (tx = 0(IFL) and 1 (FOLFOX)). In marginal models (7) and (8), βj0k are intercepts for the marginal models for each ordinal response that satisfy the monotonicity property,

βj01<<βj0K-1,

βj are the regression coefficients of the covariates xit, and αj are the coefficients of tx for j = 1, 2. The intercepts in the two random effects models (9–10), Δitjk are determined based on the relationship between the marginal, (7–8), and the conditional, (9–10), cumulative probabilities; they can be calculated by a Newton-Raphson algorithm. These types of marginalized models were first introduced in Heagerty [26]. For further details on the computations, see the Supplementary Materials/Web Appendix.

We specify Σi = Ri ⊗ Σ2 where Ri is a correlation matrix, Σ2 is a covariance matrix between responses at the same time, and ⊗ denotes the Kronecker product. We consider the following forms for Ri and Σ2,

Ri=(1ρi12ρi,1,niρi121ρi,2,niρi,1,niρi,1,ni1)and2=(σ12σ12σ12σ22). (11)

Ri can be a relatively simple structure such as AR(1) or more complex using partial autocorrelations ([27, 28]). The covariance matrix Σi captures the correlation of the observed bivariate responses, both longitudinally and inter-response. For further details on this model, see Lee et al. [25] and for details on the form of Ri in our application, see Section 4. As is typical in non-Bayesian approaches, we implicitly assume conditional independence between the potential outcomes. In principle, we could do sensitivity to this assumption, but the only impact would be minor changes (if any) to the standard errors.

To estimate the SACE, we need to estimate the joint probability of insomnia and outlook for those who did not progress/die before the end of the study, P(YiT1k1, YiT2k2|DiT = 0, tx). As stated earlier, this can be estimated using any model for the observed data. For the observed data model specified above, this probability can be expressed as

P(YiT1k1,YiT2k2DiT=0,tx)=Ew{P(YiT1k1biT1,DiT=0,tx,w)P(YiT2k2biT2,DiT=0,tx,w)ϕ(biT)dbiT}=letEw{hT,tx(k1,k2,w)}. (12)

The inner part of the integral in (12) can be estimated using the maximum likelihood estimates of the parameters (7)–(10) in the model. We prefer to make inference here unconditional on baseline covariates. As a result, we take expectations over the distribution of the baseline covariates, w. The expectations in (12) can be estimated using its empirical expectations. That is, an arbitrary function f(w; tx), can be estimated as

E^w{f(w;tx)}=1Ni=1Nf^(wi;tx).

Related work (Sjolander et al., 2009) defines the causal effect conditional on baseline covariates. See the appendix for further details on the entire estimation algorithm.

Overall algorithm to estimate the SACE

Now we present the complete algorithm to estimate the SACE. For a fixed ρ, τ, and λ,

  1. EstimategT (tx, w) in (4) for tx = 0, 1. Then use these to estimate pT in Assumption 2.

  2. Estimate hT,tx(k1, k2; w) for tx = 0, 1 in (12) using the mle of the parameters in (7)–(10).

  3. Estimate P(YT1(tx) ≤ k1, YT2 (tx) ≤ k2|AT (0, 0)) for tx = 0, 1 using the estimates in Steps 1 and 2 and the expressions (A.5) and (A.8) given in the Supplementary Materials/Web Appendix.

  4. Estimate GOR(k1, k2, Tx = tx; AT (0, 0)) for tx = 0, 1 by plugging in the estimates from Step 3 into the expression for the GOR in (3). Then, plug these estimates into (1) to estimate SACEk1,k2.

To calculate standard errors for SACE, we use the delta method (details in the Supplementary Materials/Web appendix).

4. Data Analysis

We use the approach proposed in Section 3 to analyze the QOL data described in Section 2. The data consists of the 466 subjects randomized to either IFL or FOLFOX. The observation time of the response was defined as the (12 week) time window during which the survey was filled out (0=baseline, 1=1–84 days after going on study, 2=85–168 days after going on study,…, 5=337–420 days after going on study). We have the actual progression/death times for subjects who dropped out due to progression/death and also for those who dropped out for other reasons. The number of subjects in each 12 week progression/death window by treatment group is given in Table I. We fit our models to only those subjects who had not died or progressed by the end of the study (the 6th 12 week window in Table I).

Table I.

Break down of time of Progression/Death (in 12 week windows) by treatment groups for QOL data. Proportions in parentheses. The last 12 week window corresponds to ‘after study.’

Treatment 12 week window of progression/death
Total
1 2 3 4 5 6
IFL 15 (0.064) 18 (0.077) 25 (0.107) 21 (0.090) 29 (0.124) 126 (0.539) 234
FOLFOX 7 (0.030) 8 (0.034) 21 (0.091) 17 (0.073) 22 (0.095) 157 (0.677) 232
Total 22 (0.047) 26 (0.056) 46 (0.099) 38 (0.082) 51 (0.109) 283 (0.607) 466

Because very few patients reported category 1 for both QOL measures (insomnia and outlook), we collapsed category 1 and 2 into one category (category 1) and then categories 3–5 were relabeled as categories 2–4, respectively. As covariates for the models in (9)–(10), we included treatment (tx=1 for FOLFOX; tx=0 for IFL), (re-scaled) visit number (TIME=0.0, 0.1, ···, 0.5), and baseline covariates which were age (AGE), patient performance status (PS), and number of sites of metastatic disease (NOSITE). These covariates were categorized as in Lee et al. (2010) (age < 70 (AGE=0) and ≥ 70 (AGE=1); performance status is high {0, 1} (PS=0) and low {2, 3, 4} (PS=1); number of sites of metastatic disease was 1 (NOSITE1, NOSITE2)=(0,0), 2 (NOSITE1, NOSITE2)=(1,0), and > 2 (NOSITE1, NOSITE2)=(0,1)). The correlation matrix Ri(tx) in (11) is assumed to have an AR(1) structure.

As indicated in Table I for each subject, we know when they died or progressed if it happened before the study ended. To estimate survival probabilities, gT (tx, w), in (4) we fitted a logistic regression for each treatment arm with baseline covariates as indicated above. The estimated regression coefficients, ψ̂ in (4), are given in Table II. On IFL treatment, the coefficient of PS was significant. This indicates that the estimated survival probability on the IFL treatment was less for the low performance group than for the high performance group. On FOLFOX treatment, the coefficients of PS and NOSITE2 were significant. This indicates that the estimated survival probability on the FOLFOX treatment was less for the low performance group than for the high performance and was less for patients with more than 2 sites of metastatic disease than those with only one.

Table II.

Maximum likelihood estimates of the regression coefficients, ψtx for the model in (4). Standard errors are in parentheses. AGE=I{age < 70}; PS=I{performance status ∈ (2, 3, 4)}; NOSITE1 = I{# of sites = 2}; NOSITE2 = I{# of sites > 2}.

IFL FOLFOX
Intercept 0.51 (0.23) 1.32 (0.27)
AGE −0.39 (0.33) −0.29 (0.35)
PS −2.41* (1.06) −1.53* (0.66)
NOSITE1 −0.04 (0.32) −0.55 (0.35)
NOSITE2 −0.67 (0.34) −0.91* (0.36)
*

indicates significance with 95% confidence level.

Table III presents the estimated target probabilities (12) on the two treatment arms computed using the longitudinal model for the observed data specified in Section 3.3. Overall, the estimated cumulative joint probabilities on FOLFOX were higher than those on the IFL. These quantities were used to estimate the SACE using the algorithm in Subsection 3.

Table III.

Maximum likelihood estimates of P(YT1k1, YT2k2|DT = 0, Tx). P (YT1k1, YT2k2|DT = 0, Tx) is the joint cumulative probability of insomnia and outlook for those who did not progress/die before the end of study.

IFL
k2
FOLFOX
k2
1 2 3 4 1 2 3 4
k1 1 0.023 0.031 0.055 0.061 0.028 0.039 0.070 0.077
2 0.059 0.084 0.169 0.193 0.069 0.099 0.205 0.234
3 0.125 0.188 0.460 0.572 0.137 0.206 0.509 0.632
4 0.161 0.251 0.719 1.000 0.168 0.262 0.735 1.000

The estimated correlation between insomnia and outlook from Σ2 in (11) was 0.352, which indicates a positive relation (i.e., high insomnia is positively correlated with negative outlook). Also, the estimated correlation (ρ) for serial dependence in (11) was 0.794 which indicates a strong serial correlation.

We can estimate the SACEk1,k1 for all k1, k2 ∈ {1,…, 4} ⊗ {1,…, 4}. However, since the treatment effect on high insomnia and negative outlook (k1 = 1, k2 = 1) was of interest, we estimated SACE1,1 which corresponds to high insomnia and negative outlook (k1 = 1, k2 = 1). Since the identification of SACE relies on untestable assumptions, we implemented a sensitivity analysis to draw inference about SACE by varying the sensitivity parameters τ and λ in Assumption 3. We chose a very wide range for the sensitivity parameters, [0.1, 6.0]. The estimated values of SACE1,1 and associated p-values as a function of λ and τ under H0: log SACE = 0 are given in Figure 1 for various values of ρ.

Figure 1.

Figure 1

Maximum likelihood estimates of log SACEK−1, K−1 with ρ = 0, 0.5, 1.0, and corresponding p-values for H0: log SACEK−1, K−1 = 0.

When ρ = 0 (cf: assumption 2), which corresponds to dying on FOLFOX being independent of dying on IFL, the estimated values of SACE1,1 increased as λ and τ respectively increased and decreased (Figure 1(a)). The same pattern happened for ρ = 0.5 which corresponds to a moderate correlation between dying on the two treatments (Figure 1(c)). In contrast, when ρ = 1 (high correlation), the estimated values of SACE1,1(1, 0) decreased slowly as τ increased (Figure 1(e)). So when τ is less than 1, the probability of high insomnia and negative outlook on the FOLFOX treatment was greater than the corresponding probability on the IFL treatment. When τ was greater than 1, the probability of high insomnia and negative outlook on the FOLFOX treatment was lower. However, for all values of ρ there were no significant difference between these probabilities (Figure 1(b), 1(d), 1(f)).

5. Conclusion

In this paper, we have proposed a causal inference approach for bivariate ordinal outcomes in the presence of dropout due to progression or death. To do this we proposed a novel causal estimand and an appropriate set of assumptions to identify it. We used all available longitudinal data (not just the data at the time point of interest) via a longitudinal model and a partial ignorability assumption. We implemented a sensitivity analysis for nonidentifiable assumptions which were parameterized parsimoniously.

Based on this approach, we concluded that patients’ insomnia and outlook levels were not influenced by treatment. However, we note that the power was low for this comparison with few patients still on study at one year. In our approach, we somewhat offset the small sample size at year one via modelling assumptions which allowed us to use all available longitudinal data.

Different choices of the SACE and/or examining several SACE at once would require different and/or more complex alternatives to Assumption 3.

Supplementary Material

Computational details

Acknowledgments

We would like to thank Dr. Daniel Sargent and Ms. Erin Green of the Mayo Clinic for providing the data and for their help in data collection and clarifying some issues with data. This project was supported by NIH grant CA85295 and Basic Science Research Program through the National Research Foundation of Korea (KRF) funded by the Ministry of Education, Science and Technology (2012-004002).

References

  • 1.Hogan JW, Laird NM. Mixture models for the joint distribution of repeated measures and event times. Statistics in Medicine. 1997;16:239–257. doi: 10.1002/(sici)1097-0258(19970215)16:3<239::aid-sim483>3.0.co;2-x. [DOI] [PubMed] [Google Scholar]
  • 2.Pauler DK, McCoy S, Moinpour C. Pattern mixture models for longitudinal quality of life studies in advanced stage disease. Statistics in Medicine. 2003;22:795–809. doi: 10.1002/sim.1397. [DOI] [PubMed] [Google Scholar]
  • 3.Kurland BF, Heagerty PJ. Directly parameterized regression conditioning on being alive: analysis of longitudinal data truncated by deaths. Biostatistics. 2005;6:241–258. doi: 10.1093/biostatistics/kxi006. [DOI] [PubMed] [Google Scholar]
  • 4.Lee K, Daniels M. A class of markov models for longitudinal ordinal data. Biometrics. 2007;63:1060–1067. doi: 10.1111/j.1541-0420.2007.00800.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology. 1974;66:688–701. [Google Scholar]
  • 6.Rubin DB. Bayesian inference for causal effects. Annals of Statistics. 1978;6:34–58. [Google Scholar]
  • 7.Holland P. Statistics and causal inference. Journal of the American Statistical Association. 1986;81:945–961. [Google Scholar]
  • 8.Frangakis CE, Rubin DB. Principal stratification in causal inference. Biometrics. 2002;58:21–29. doi: 10.1111/j.0006-341x.2002.00021.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Egleston BL, Scharfstein DO, Freeman ER, West SK. Causal inference for non-mortality outcomes in the presence of death. Biostatistics. 2007;8:526–545. doi: 10.1093/biostatistics/kxl027. [DOI] [PubMed] [Google Scholar]
  • 10.Lee K, Daniels M. Marginalized models for longitudinal ordinal data with application to quality of life studies. Statistics in Medicine. 2008;27:4359–4380. doi: 10.1002/sim.3352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lee K, Daniels MJ, Sargent DJ. Causal effects of treatments for informative missing data due to progression/death. Journal of the American Statistical Association. 2010;105:912–929. doi: 10.1198/jasa.2010.ap08739.. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Cheng J. Estimation and inference for the causal effect of receiving treatment on a multinomial outcome. Biometrics. 2009;65:96–103. doi: 10.1111/j.1541-0420.2008.01020.x. [DOI] [PubMed] [Google Scholar]
  • 13.Williamson J, Kim K. A global odds ratio regression model for bivariate ordered categorical data from ophthalmologic studies. Statistics in Medicine. 1996;15:1507–1518. doi: 10.1002/(SICI)1097-0258(19960730)15:14<1507::AID-SIM316>3.0.CO;2-Z. [DOI] [PubMed] [Google Scholar]
  • 14.Harel O, Schafer JL. Partial and latent ignorability in missing-data problems. Biometrika. 2009;96:37–50. [Google Scholar]
  • 15.Goldberg RM, Sargent DJ, Morton RF, Fuchs CS, Ramanathan RK, Williamson SK, Findlay BP, Pitot HC, Albets SR. A Randomized controlled trial of fluorouracil plus leucovorin, irinotecan, and oxaliplatin combinations in patients with previously untreated metastatic colorectal cancer. Journal of Clinical Oncology. 2004;22:23–30. doi: 10.1200/JCO.2004.09.046. [DOI] [PubMed] [Google Scholar]
  • 16.Redeker NS, Lev EL, Ruggiero J. Insomnia, fatigue, anxiety, depression, and quality of life of cancer patients undergoing chemotherapy. Research and Theory for Nursing Practice. 2000;14:275–290. [PubMed] [Google Scholar]
  • 17.Williamson J, Kim K, Lipsitz SR. Analyzing bivariate ordinal data using a global odds ratio. Journal of the American Statistical Association. 1995;90:1432–1437. [Google Scholar]
  • 18.Liu I. Describing ordinal odds ratios for stratified t × c tables. Biometrical Journal. 2003;6:730–750. [Google Scholar]
  • 19.Heibron D. The analysis of ratios of odds ratios in stratified contingency tables. Biometrics. 1981;37:55–66. [Google Scholar]
  • 20.Breiman L, Friedman J, Olshen R, Stone C. Classification and Regression Trees. Belmont, CA: Wadsworth; 1984. [Google Scholar]
  • 21.Roy J, Hogan JW, Marcus BH. Principal stratification with predictors of compliance for randomized trials with 2 active treatments. Biostatistics. 2008;9:277–289. doi: 10.1093/biostatistics/kxm027. [DOI] [PubMed] [Google Scholar]
  • 22.Marshall AW, Olkin I. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika. 1997;84:641–652. [Google Scholar]
  • 23.Sankaran PG, Jayakumar K. On proportional odds models. Statistical Papers. 2008;49:779–789. [Google Scholar]
  • 24.Egleston BL, Scharfstein, MacKenzie E. On estimation of the survivor average causal effect in observational studies when important confounders are missing due to death. Biometrics. 2009;65:497–504. doi: 10.1111/j.1541-0420.2008.01111.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Lee K, Daniels MJ, Joo Y. Flexible marginalized models for bivariate longitudinal ordinal data. Biostatistics. 2013 doi: 10.1093/biostatistics/kxs058. In Press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Heagerty PJ. Marginally specified logistic-normal models for longitudinal binary data. Biometrics. 1999;55:688–698. doi: 10.1111/j.0006-341x.1999.00688.x. [DOI] [PubMed] [Google Scholar]
  • 27.Joe H. Generating random correlation matrices based on partial correlations. Journal of Multivariate Analysis. 2006;97:2177–2189. [Google Scholar]
  • 28.Daniels MJ, Pourahmadi M. Modeling covariance matrices via partial autocorrelations. Journal of Multivariate Analysis. 2009;100:2352–2363. doi: 10.1016/j.jmva.2009.04.015. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Computational details

RESOURCES