Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2013 Nov 13;69(4):850–860. doi: 10.1111/biom.12096

Bias and Sensitivity Analysis When Estimating Treatment Effects from the Cox Model with Omitted Covariates

Nan Xuan Lin 1,2, Stuart Logan 1, William Edward Henley 1,2,*
PMCID: PMC4230475  PMID: 24224574

Abstract

Summary

Omission of relevant covariates can lead to bias when estimating treatment or exposure effects from survival data in both randomized controlled trials and observational studies. This paper presents a general approach to assessing bias when covariates are omitted from the Cox model. The proposed method is applicable to both randomized and non-randomized studies. We distinguish between the effects of three possible sources of bias: omission of a balanced covariate, data censoring and unmeasured confounding. Asymptotic formulae for determining the bias are derived from the large sample properties of the maximum likelihood estimator. A simulation study is used to demonstrate the validity of the bias formulae and to characterize the influence of the different sources of bias. It is shown that the bias converges to fixed limits as the effect of the omitted covariate increases, irrespective of the degree of confounding. The bias formulae are used as the basis for developing a new method of sensitivity analysis to assess the impact of omitted covariates on estimates of treatment or exposure effects. In simulation studies, the proposed method gave unbiased treatment estimates and confidence intervals with good coverage when the true sensitivity parameters were known. We describe application of the method to a randomized controlled trial and a non-randomized study.

Keywords: Bias analysis, Cox model, Omitted covariates, Sensitivity analysis, Survival analysis, Unmeasured confounding

1. Introduction

Treatment or exposure effects are commonly estimated from survival or other time-to-event data using the Cox model. The gold standard design for conducting such evaluations is the randomized controlled trial because randomization acts to balance measured and unmeasured confounders. Although it is common for researchers to present unadjusted analyses, it is recommended to adjust proportional hazards models for all measured covariates in randomized studies to maximise power to detect treatment effects (Hernandez, Eijkemans, and Steyerberg, 2006). Gail, Wieand, and Piantadosi (1984) derived asymptotic formulae for the bias in estimates of treatment effects when balanced covariates are omitted from the Cox model. It was shown that when censoring is moderate, the Cox model yielded more biased estimates of treatment effect than analysis with the exponential model.

In practice, randomized experiments may be difficult to conduct for reasons of cost, logistics or ethics (Black, 1996). The increasing availability of electronic medical record databases and population-based studies is creating new opportunities for using observational data to assess the effect of medical treatments and exposures, Ghani et al., 2001;, 2009. A major challenge in using clinical databases in this way is addressing the potential bias introduced due to unmeasured differences between the treatment groups(Klungel et al., 2004). Lin, Psaty, and Kronmal (1998) presented approximate formulae for the bias due to omission of a binary or continuous confounder when estimating treatment effects from censored survival time data using the Cox model. The bias formulae were used as the basis for a method of conducting sensitivity analysis to assess how the point and interval estimates of the treatment effect vary under a range of assumptions about the unmeasured confounder. The idea behind this approach is that the plausibility of the estimated treatment effects will increase if the inferences are insensitive over a wide range of relevant scenarios.

In this paper, we develop a general framework for estimating bias and conducting sensitivity analysis when covariates are omitted from the Cox model. Formulating the problem more broadly than previous work, we consider the combined influence of three different sources of bias: (1) bias due to omitting a balanced covariate; (2) bias due to censoring; (3) bias due to the missing covariate being a confounder. The proposed approach is applicable to both randomized trials and observational studies, and provides explicit formulae for arbitrary distributions of measured and unmeasured confounders. We consider the general case in which the censoring distribution can depend on treatment or other covariates. The treatment variable can be either a binary or continuous exposure.

The paper is organized as follows. Asymptotic bias formulae, derived from the large sample properties of the partial maximum likelihood estimators, are presented in Section 2. Simulation studies conducted to investigate the accuracy of the bias formulae and to characterize the impact of the different sources of bias are presented in Section 3. Section 4 discusses how the bias formulae can be used to develop a new method of sensitivity analysis for treatment effects in proportional hazards models. The method is applied to data from a randomized controlled trial and a non-randomized study in Section 5.

2. Bias Formulae

We denote random variables by upper case letters and their values by lower case letters. Suppose Inline graphic are K measured covariates with joint distribution Inline graphic, and Inline graphic are q unmeasured covariates with conditional joint distribution Inline graphic. Let T and Inline graphic represent the true event/failure time and possible censoring time respectively. We assume failure and censoring times are independent conditional on x (i.e., Inline graphic). We observe Inline graphic, where Inline graphic, and Inline graphic if Inline graphic and 0 otherwise. The true hazard is assumed to be

graphic file with name biom0069-0850-m11.jpg 1

where Inline graphic is the baseline hazard function and Inline graphic and Inline graphic are coefficients for X and C, respectively. But since C is omitted, one is forced to fit the model

graphic file with name biom0069-0850-m15.jpg 2

where Inline graphic are the coefficients when C is missing. Let Inline graphic be Inline graphic independent replicates of Inline graphic. Then the average partial log-likelihood based on (2) is

graphic file with name biom0069-0850-m20.jpg 3

where Inline graphic if Inline graphic and 0 otherwise. It is shown in Web Appendix A that as Inline graphic, the score function Inline graphic has the limit

graphic file with name biom0069-0850-m25.jpg 4

for Inline graphic, where Inline graphic Inline graphic is the mean over the uncensored subjects, Inline graphic is under the density Inline graphic and Inline graphic is the survival function of censoring time conditional on X. Inclusion of Inline graphic allows the censoring distribution to depend on covariates.

The system of equations 1988, Inline graphic, relate Inline graphic and Inline graphic, and therefore the asymptotic biases Inline graphic can be evaluated from them. The first-order Taylor series approximation is

graphic file with name biom0069-0850-m36.jpg 5

2.1. The Distributions of Uncensored Subjects

Let

graphic file with name biom0069-0850-m37.jpg

be the uncensoring probability conditional on x and c, where Inline graphic is the density of model (1) and Inline graphic is the survival function of censoring time.

The density of the observed event times is then given by

graphic file with name biom0069-0850-m40.jpg

The mean of Inline graphic for uncensored subjects is

graphic file with name biom0069-0850-m42.jpg 6

2.2. Extension of the Results of Lin et al. (1998)

Lin et al. (1998) proposed bias formulae for survival analysis with unmeasured confounders based on the assumption of rare events (small Inline graphic) or small Inline graphic. For binary x, the proposed bias approximation is

graphic file with name biom0069-0850-m45.jpg 7

The simulation of Lin et al. (1998) showed that 2002 are good approximations when Inline graphic was generated from the uniform Inline graphic distribution and the censoring percentage is 90%.

Using the assumption of rare events and the simulation settings in Lin et al. (1998), Web Appendix B shows that Equation 1988 reduces to a simple equation of Inline graphic and Inline graphic:

graphic file with name biom0069-0850-m50.jpg 8

which leads to the formulae 2002 when Inline graphic and Inline graphic or Inline graphic. The Equation 2006 therefore provides a general extension of the results in Lin et al. (1998) to arbitrary distributions of X and C.

3. Bias Analysis

3.1. Bias Analysis for a Binary Treatment with a Single Omitted Covariate

We now show the asymptotic formula for the bias in the important special case of a single missing covariate C and a binary exposure variable X taking values 1 or 0 with probabilities p and Inline graphic, respectively.

The Equation 1988 leads to (see proof in Web Appendix C)

graphic file with name biom0069-0850-m55.jpg 9

with

graphic file with name biom0069-0850-m56.jpg

where the expectations Inline graphic and Inline graphic are under Inline graphic and

graphic file with name biom0069-0850-m60.jpg

respectively, and Inline graphic is the ratio of uncensoring rates between control and treatment groups.

From 2004, it can be seen that the relation between Inline graphic and Inline graphic mainly depends on three factors (corresponding to the three sources of bias): the effect of the missing covariate, Inline graphic; censoring mechanism, Inline graphic, Inline graphic and Inline graphic; and the ratio of conditional expectations, Inline graphic. The latter ratio represents how much the density of C varies between Inline graphic and Inline graphic and, hence, measures the extent to which C is a confounder.

The bias is also affected by the cumulative baseline hazard function Inline graphic. But if times are not censored, Inline graphic is an exponential variable with the rate Inline graphic and 2004 reduces to

graphic file with name biom0069-0850-m74.jpg 10

where Inline graphic. As a result, the bias is independent of the form of Inline graphic in the absence of censorship.

When Inline graphic, C is not a confounder. In this case, Equation 1991 shows that Inline graphic and, consequently, the MLE of the Cox model is still biased even if C is a balanced covariate. Bretagnolle and Huber-Carol (1985) studied the bias in this case and showed that the estimated effect is biased toward zero as Inline graphic increases. This is because the event times with Inline graphic tend to zero as Inline graphic and tend to Inline graphic as Inline graphic. Consequently the subjects with Inline graphic cannot provide information about Inline graphic in the limiting case. However, the subjects with Inline graphic do still supply information about Inline graphic and hence the limit of Inline graphic as Inline graphic is not zero for binary C. An illustration of this explanation is given in Web Figure 1.

Following 2008, the first-order Taylor series approximation is

graphic file with name biom0069-0850-m90.jpg 11

3.2. Accuracy of Asymptotic Formulae and Taylor Series Approximations

Figure 1 shows a comparison of the asymptotic and simulated biases and Taylor series approximation under the influence of different sources of bias. We generated 10,000 x from Inline graphic. The confounder C was generated from Inline graphic for the binary confounder, and from Inline graphic for the normal confounder. The event times t were generated from model (1) with Inline graphic, Inline graphic and Inline graphic taking 100 sequence values from Inline graphic to 10. For the censoring cases, we let Inline graphic with Inline graphic. The observed times were given by Inline graphic.

Figure 1.

Figure 1

Comparison of simulated biases, asymptotic biases and first-order Taylor series approximations for different types of omitted covariate and censorship. Since Inline graphic is the asymptotic value of the MLE Inline graphic and the sample size=10,000 is large, we calculated the simulated bias by Inline graphic. The asymptotic biases and Taylor series approximations were obtained from 2004 and 2001, respectively. Monte Carlo integration was used to approximate the expectations in formulae. (a) Binary confounder c: (Inline graphic), censored; (b) Normal confounder c: (Inline graphic), censored; (c) Binary confounder c: (Inline graphic), censored; (d) Normal confounder c: (Inline graphic), censored; (e) Binary balanced c: (Inline graphic), uncensored; (f) Normal balanced c: (Inline graphic), uncensored.

Figure 1 shows that the simulated and asymptotic biases are seen to agree closely, confirming that these asymptotic formulas adequately describe the biases. The accuracies of the Taylor series approximations decrease as Inline graphic gets large, because the approximation error is of the order Inline graphic.

For more modest values of Inline graphic, for example Inline graphic and Inline graphic, the biases will have similar patterns but be shifted up as Inline graphic (see Web Figures 2 and 3). In Web Figure 8, we let Inline graphic and Inline graphic to allow the distribution of censoring to depend on treatment group. The figure illustrates how different choices of censoring function can impact on the biases.

3.3. Bias of Omitting a Balanced Covariate in Randomized Studies

Figure 1e and f show the biases when a balanced covariate is omitted. It is clear that omission of a relevant covariate leads to biased treatment estimates for the Cox model, even in randomized studies.

The reason is that the parameters Inline graphic and Inline graphic are measuring different features of the population. When we model the hazard as

graphic file with name biom0069-0850-m120.jpg

the interpretation of Inline graphic is the hazard ratio between Inline graphic and Inline graphic while the values of c are fixed. But in randomized studies (where we assume Inline graphic), when we model the marginal hazard as

graphic file with name biom0069-0850-m125.jpg 12

the interpretation of Inline graphic is the hazard ratio between Inline graphic and Inline graphic while c is marginalized. Similarly, Inline graphic is the hazard when Inline graphic, and Inline graphic is the hazard when Inline graphic and c is marginalized. The superscript Inline graphic emphasizes that they do not have the same interpretation.

When c is integrated out, the marginal hazards 2008 for Inline graphic and Inline graphic are not proportional over time, and the MLE of Inline graphic represents an average over time of the log marginal hazard ratios between Inline graphic and Inline graphic (Lin and Wei, 1989). Therefore, it will lead to bias if we use a marginal hazard ratio Inline graphic to estimate a hazard ratio Inline graphic. In randomized studies, as outlined in Section 2001, usually Inline graphic and Inline graphic will attenuate to some limit between 0 and Inline graphic as Inline graphic.

3.4. The Limits of Biases as Inline graphic

One phenomenon that can be noticed from Figure 1 is that all biases increase with Inline graphic but always tend to some limits, no matter if C is a confounder or not. The reason is that the marginal hazard ratio has finite limits as Inline graphic tends to Inline graphic and Inline graphic. For example, for Inline graphic, the marginal hazard is

graphic file with name biom0069-0850-m151.jpg

The ratio, Inline graphic, tends to Inline graphic as Inline graphic and

graphic file with name biom0069-0850-m155.jpg

3.5. The Effect of Censoring

Figure 2a shows the effect of censoring on the bias of omitting a balanced covariate. The event times were generated from (1) with Inline graphic Inline graphic and Inline graphic. The possible censoring times Inline graphic were simulated from uniform Inline graphic with Inline graphic.

Figure 2.

Figure 2

The effect of overall censoring and confounding on bias: (a) biases of omitting a balanced covariate where Inline graphic data are censored; (b) biases under different strengths of confounding, Inline graphic and Inline graphic when Inline graphic data are censored.

Following the result A-4 in Web Appendix, the uncensoring probability can be written as

graphic file with name biom0069-0850-m166.jpg

where Inline graphic is the density of possible censoring times.

Under the simulation settings, Inline graphic and Inline graphic. The probability of censoring conditional on x is thus

graphic file with name biom0069-0850-m170.jpg 13

The values of Inline graphic and Inline graphic were then solved from such that the probabilities of censoring were the same for Inline graphic and Inline graphic and could be Inline graphic, Inline graphic, Inline graphic, Inline graphic, and Inline graphic. The number of event times n was fixed at 100,000 and the total sample size was Inline graphic.

Figure 2a shows that censoring influences the bias in two different ways. The bias increases as the censoring percentage increases from Inline graphic to Inline graphic, but decreases as the censoring percentage increases from Inline graphic to Inline graphic. The bias is plotted for a wider range of censoring percentages in Web Figure 4.

The reason for this inconsistent effect of censoring is as follows: when the censoring percentage increases (0–50%) and Inline graphic, the subjects with Inline graphic, which provide most of the information about Inline graphic, are likely to be censored, and consequently, the bias is increased. But as the censorship rate increases further (50–90%), almost all of the few events occur with Inline graphic and almost all the times with Inline graphic are censored. So nearly all the subjects supplying information about Inline graphic have the same value of Inline graphic (Chastang, Byar, and Piantadosi, 1988). If the sample size is sufficiently large, the bias will tend to zero as the censoring percentage tends to Inline graphic. A similar explanation applies for Inline graphic. An illustration of this explanation is given in Web Figure 5.

3.6. The Effect of Confounding

Of particular relevance to non-randomized studies, we considered the influence of different levels of confounding on the bias function when 50% of the data are censored (Fig. 2b). We generated Inline graphic and consider three scenarios with Inline graphic. The difference Inline graphic represents the imbalance of the distributions of Inline graphic between Inline graphic and Inline graphic and so measures the strength of confounding. As Inline graphic increases, it can be seen that the estimate is biased upwards for Inline graphic and downwards when Inline graphic. For the case Inline graphic, the bias would be affected in the other direction.

3.7. The Effect of Additional Measured Covariates

In practice, the analyst is likely to have access to additional measured covariates (possibly confounders) that would need to be adjusted for, in addition to the exposure variable X (and the unmeasured confounder, C).

Under the approach of Lin et al. (1998), an additional covariate Z does not affect the bias if the mean of C conditional on x and z is additive in x and z, that is Inline graphic (VanderWeele, 2008). However, our simulation results in Figure 3a–c show that an additional covariate may introduce a small degree of bias when Inline graphic is large. We generated 100,000 Inline graphic and Inline graphic. The additional covariate Z was simulated from Inline graphic, Inline graphic and Inline graphic for Figure 3a–c, respectively. Under these data-generating processes, Inline graphic and the additivity assumption is satisfied.

Figure 3.

Figure 3

The effect of additional measured covariates on the simulated bias Inline graphic: (a) Inline graphic; (b) Inline graphic; (c) Inline graphic; (d) the effect of increasing the number of measured covariates on the simulated bias when Inline graphic and Inline graphic and 3.

A sample of 100,000 survival times was generated from Inline graphic and Inline graphic. The data were then fitted by the reduced model Inline graphic. It can be seen that the bias is not impacted by the distribution of Z, but is affected by Inline graphic, when Inline graphic is large. The results were similar when we allowed censoring to depend on X and Z by assuming Inline graphic (see Web Figure 9).

It is then natural to investigate the influence of more than one additional covariate when Inline graphic is large. To simplify the problem, we examine the case where all the additional covariates are binary and independent of each other, with the same coefficient Inline graphic. As the bias is only significant for large negative Inline graphic, we set Inline graphic. The results displayed in Figure 3d, show the bias increases slightly with the number of covariates and the increments are not linear.

4. Sensitivity Analysis

The aim of our proposed method of sensitivity analysis is to assess how the point and interval estimators for Inline graphic or associated P-value would change given clinically plausible values of the sensitivity parameters Inline graphic and Inline graphic.

4.1. Point Estimate

For a sample with Inline graphic observed times Inline graphic, of which n are uncensored Inline graphic, from 1988 we have the relation between Inline graphic and Inline graphic approximately relies on the equations Inline graphic with

graphic file with name biom0069-0850-m237.jpg 14

where the expectation Inline graphic can be calculated analytically or approximately with respect to Inline graphic.

Write Inline graphic. Due to the functional invariance property of MLE, the point estimate of the true value Inline graphic is then Inline graphic. The function Inline graphic and its inverse Inline graphic relate Inline graphic and Inline graphic, and play a key role in sensitivity analysis.

The baseline survivor function Inline graphic in is estimated by solving

graphic file with name biom0069-0850-m248.jpg

where Inline graphic is the Breslow (1972) estimator:

graphic file with name biom0069-0850-m250.jpg

The survival function of censoring can be also approximated by the Breslow (1972) estimator by considering events as “censored” observations and censored observations as “events” (Satten and Datta, 2001).

4.2. P-Values

In many applications, we are interested in evaluating the evidence the data give about a null hypothesis Inline graphic (for example, that a hazard ratio equals one). Using Inline graphic, this null hypothesis is equivalent to Inline graphic and the two-sided P-value is therefore

graphic file with name biom0069-0850-m261.jpg 15

where Inline graphic is the cumulative distribution function of Inline graphic and Inline graphic is the standard error of Inline graphic.

4.3. Confidence Intervals

Since the distribution of Inline graphic might be slightly skewed (see example in Web Figure 6), the traditional way of using standard error to calculate confidence intervals (CI) could be misleading. An alternative way is to construct CI by the highest density interval. To do this, we generate B bootstrap samples Inline graphic from the multivariate normal distribution Inline graphic, where Inline graphic is the covariance matrix of Inline graphic. The sample of the kth parameter Inline graphic, Inline graphic is then obtained from Inline graphic for Inline graphic. The highest density interval of Inline graphic can be computed from the sample Inline graphic by using the emp.hpd function in the R package TeachingDemos.

Table 1.

Simulated bias of point estimates and coverage of 95% confidence intervals for the hazard ratio associated with treatment under two methods of sensitivity analysis, when censoring is moderate. The equations and were used to estimate Inline graphic and its confidence interval in the last two columns

unadjusted Lin et al. (1998) Inline graphic
Inline graphic Inline graphic Inline graphic Inline graphic Inline graphic Fraction Censored (%) Bias Coverage (%) Bias Coverage (%) Bias Coverage (%)
100 0.56 1 0.1 0.9 50 0.78 39 0.01 97 −0.04 95
0.57 0.3 0.7 51 0.35 83 −0.02 91 0.00 98
0.58 0.5 0.5 51 −0.08 90 −0.08 90 −0.03 99
0.35 2 0.1 0.9 50 1.38 3 −0.04 96 0.07 96
0.36 0.3 0.7 50 0.42 78 −0.21 87 −0.06 97
0.35 0.5 0.5 50 −0.24 82 −0.24 82 0.01 100
0.20 3 0.1 0.9 50 1.71 0 −0.12 91 0.15 91
0.22 0.3 0.7 49 0.36 84 −0.40 72 −0.11 99
0.21 0.5 0.5 50 −0.44 68 −0.44 68 0.00 100
500 0.57 1 0.1 0.9 50 0.76 0 −0.02 95 0.00 95
0.58 0.3 0.7 50 0.32 42 −0.06 90 −0.01 98
0.57 0.5 0.5 50 −0.10 90 −0.10 90 −0.01 99
0.34 2 0.1 0.9 50 1.27 0 −0.15 82 −0.04 92
0.34 0.3 0.7 50 0.43 11 −0.20 70 0.03 99
0.34 0.5 0.5 50 −0.30 44 −0.30 44 0.01 100
0.20 3 0.1 0.9 50 1.65 0 −0.18 81 0.04 89
0.20 0.3 0.7 50 0.38 21 −0.38 28 −0.02 100
0.21 0.5 0.5 50 −0.48 4 −0.48 4 0.01 100
1000 0.57 1 0.1 0.9 50 0.73 0 −0.04 93 −0.02 96
0.57 0.3 0.7 50 0.30 11 −0.07 89 0.01 99
0.58 0.5 0.5 50 −0.10 80 −0.10 80 −0.01 100
0.34 2 0.1 0.9 50 1.29 0 −0.13 80 0.00 94
0.34 0.3 0.7 50 0.40 1 −0.23 41 0.00 99
0.34 0.5 0.5 50 −0.30 7 −0.30 7 −0.01 100
0.20 3 0.1 0.9 50 1.65 0 −0.18 65 0.03 90
0.20 0.3 0.7 50 0.40 0 −0.36 6 0.00 99
0.21 0.5 0.5 50 −0.49 0 −0.49 0 −0.01 100

However, the bootstrap method may become computationally inefficient, when the dimension of Inline graphic is high (e.g., Inline graphic). We thus give an approximation by using the confidence bounds of Inline graphic. Suppose we are interested in the parameter Inline graphic and its confidence interval Inline graphic. As shown in Section 1989, the effect of additional measured covariates is negligible. It means that the solution of Inline graphic would not change appreciably if we ignore all the covariates except Inline graphic in .

In addition, Inline graphic is usually a monotonically increasing function of Inline graphic in practice. Let Inline graphic be the confidence interval of Inline graphic. The lower bound Inline graphic then can be estimated from the equation

graphic file with name biom0069-0850-m289.jpg 16

Similarly, Inline graphic can be obtained from the above equation by substituting Inline graphic by Inline graphic. Our simulation shows that this approximation is sufficiently accurate and very efficient.

4.4. Simulation Study

Lin et al. (1998) proposed a simple method for sensitivity analysis. Here we conducted simulation studies to compare their method with our approach.

Table 1996 shows the biases of point estimators and coverage of Inline graphic CIs in 1,000 simulation replications, when given the true Inline graphic and Inline graphic. To compare with the method of Lin et al. (1998), we used similar simulation settings to theirs: Inline graphic, Inline graphic, Inline graphic, Inline graphic and Inline graphic was solved from so as to ensure moderate levels of censorship (fraction censored was about 50%). It is clear that our proposed method gives almost unbiased point estimates and good coverage of confidence intervals. The method of Lin et al. (1998) gets worse as Inline graphic increases, because it only addresses the bias attributable to confounding. The results for light (Inline graphic) and heavy (Inline graphic) censorships are presented in Web Tables 1 and 2, respectively. We note that both methods of sensitivity analysis gave biased treatment estimates when censoring was heavy and the sample size was small (Inline graphic). However, since the accuracy of approximation increases with the number of observed events, the proposed method is asymptotically unbiased irrespective of the censoring rate. The minimum sample size at which the method achieves approximately unbiased estimates increases with the censoring rate, and for a censoring rate as high as 90% is about Inline graphic.

5. Real Examples for Sensitivity Analysis

5.1. Vitamin and Minerals Trial

Ellis et al. (2008) conducted a randomized controlled trial assessing the effect of antioxidant and folinic acid supplementation on developmental outcomes for children with Down syndrome. Comparing infants allocated to folinic acid (Inline graphic) with those who were not (Inline graphic), the estimated hazard ratio for age of sitting was 1.25 (95Inline graphic confidence interval 0.88–1.78). These results did not change appreciably after adjustment for area of residence, maternal ethnicity, birth weight, and social class.

We now assess the impact on the treatment estimates for age at sitting of assuming a binary confounder, c, has been omitted from the model, where Inline graphic. As this is a randomized controlled trial and any random imbalance in the prevalence of the unmeasured confounder between treatment groups is likely to be small, we restrict Inline graphic. Assuming the true prevalence of the omitted covariate for treatment groups combined is 0.5, the probability of a confounding effect Inline graphic of more than 0.2 by chance is 0.02 for the trial sample of size Inline graphic.

Figure 4a shows the sensitivity of the lower limit of the confidence interval for the hazard ratio of folinic acid to adjustment for an unmeasured binary covariate of specified properties, where we set Inline graphic. For Inline graphic, the difference in probabilities Inline graphic must be Inline graphic for the treatment effect to become significant. The same conclusion can be obtained from the contour plot in Web Figure 3 which shows results of a similar sensitivity analysis for the P-value of the treatment estimate. The results for antioxidant supplementation in Web Table 3 show that the treatment effect is significant only when Inline graphic and Inline graphic. Given the nature of the study design, the conditions required for the treatment effects to be significant are implausible, suggesting that the original findings of non-significance are robust to the presence of realistic levels of unmeasured confounding.

Figure 4.

Figure 4

Contour plots of sensitivity analysis results: (a) the lower bounds of the 95% confidence intervals (use ) for the hazard ratio of folinic acid on age of sitting for children with Down syndrome; (b)the P-values (use (15)) for the two-sided test that the log-hazard ratio of deprivation score Inline graphic.

A simulation study was conducted with similar sample size and censoring rates to the vitamin and mineral trial, providing support for the validity of the treatment estimates presented in the sensitivity analysis (see Web Table 4). However, we note that in this illustrative application, the width of the confidence intervals suggests the sensitivity analysis, in common with the original analysis, lacks power to establish statistical significance for small studies.

5.2. Leukaemia and Deprivation Study (Non-Randomized)

Henderson, Shimakura, and Gorst (2002) analyzed the effect of a social deprivation score X (where lower values indicate less affluent areas) on the time in years since diagnosis with acute myeloid leukemia to death (Inline graphic). The estimated hazard ratio for a 1 point increase in x was 1.03 (P-value = 0.0012) after adjustment for age, gender and white blood cell count, indicating that prognosis is less good if the patient lives in a more deprived residential location.

We now consider a potential unmeasured binary confounder C, which affects both survival time T and the deprivation score X. We generated c from Inline graphic, where Inline graphic were solved from

graphic file with name biom0069-0850-m323.jpg

such that the marginal distribution is Inline graphic and the desired Inline graphic is obtained.

Figure 4b shows the sensitivity of P-value for different choice of Inline graphic and Inline graphic. It shows that even if the correlation is strong, that is Inline graphic, the hazard ratio of the confounder needs to be Inline graphic for the hazard ratio of x to become non-significant at the Inline graphic level. It seems unlikely that such an important covariate would be missed, suggesting that the original finding of a significant effect of deprivation score is robust to the presence of realistic levels of unmeasured confounding.

A simulation study was conducted with the same sample size (Inline graphic), covariate X and censoring rate (Inline graphic) as this non-randomized study. To extend the range of scenarios considered, survival times were simulated assuming the true value of Inline graphic was 0 (i.e., assuming the continuous exposure has no effect on survival). Here the emphasis was on comparing the extent to which the sensitivity analysis methods avoid false rejection of the null hypothesis Inline graphic. The results are summarized in Web Table 5 and provide further support for the validity of the proposed formulae when applied to data from non-randomized designs.

6. Discussion

We explored a general framework for assessing bias in treatment estimates from the Cox model with omitted covariates. Bias formulae based on asymptotic properties of the likelihood estimator were presented and validated in simulation experiments. The results showed that the confounding biases for censored survival data are typically complicated. However, the proposed approach made it possible to describe the influence of three different sources of bias: omission of a balanced covariate, data censoring and unmeasured confounding. Figure 5 characterises the sources of bias:

  1. In thei absence of a missing covariate, the bias curve remains at zero (the solid line); when a balanced covariate is omitted, the effect is underestimated to a limit as Inline graphic increases (the dashed line).

  2. When the data are censored, the bias is maximized at Inline graphic censoring but decreases with heavy censorship.

  3. When the missing covariate is a confounder, the shape of bias changes. If the association between x and c is positive, the limits increase on the right side but decrease on the left side, and hence the slope of bias increases. Conversely, if the association between x and c is negative, the limits decrease on the right side but increase on the left side.

Figure 5.

Figure 5

An illustration of the influence of the different sources of bias when estimating binary treatment effects from the Cox proportional hazards model with an omitted binary covariate. (a) solid: no missing data, no bias; dashed: bias due to omitting a balanced covariate. (b) solid: bias due to omitting a balanced covariate; dashed: bias due to omitting a balanced covariate and censoring. (c) solid: bias due to omitting a balanced covariate and censoring; dashed: bias due to omitting a confounder and censoring.

Although the bias formula is applicable under a range of assumptions, this paper has focused on considering the simple case of a binary exposure and a single unmeasured confounder. Further simulation work showed that the bias increased slightly in the presence of one or more measured confounders for large values of Inline graphic. The extension to multiple unmeasured confounders is straightforward. If there are several missing covariates Inline graphic with coefficients Inline graphic, then we can interpret c as the composite score, Inline graphic, with Inline graphic (Lin et al., 1998). Lin et al. (1998) also argue that the choice of a single unmeasured confounder is a less severe restriction when all the known confounders are adjusted for in the survival model.

The bias formula was used as the basis for proposing a new method to assess the sensitivity of estimates of treatment effects to omission of relevant covariates. Simulation experiments were conducted to compare the method with the approach of Lin et al. (1998), a special case of the proposed method when the rate of censoring is high. The method of Lin et al. (1998) has the benefit of ease of implementation, being based on a simple adjustment formula, but its relative performance deteriorates as the magnitude of Inline graphic increases. In contrast, the simulations indicate that the proposed method can provide sufficiently unbiased treatment estimates, and associated confidence intervals with good coverage, over a wide range of scenarios, when the true sensitivity parameters Inline graphic and Inline graphic are known.

Sensitivity analysis is a flexible approach to addressing omission of covariates that makes it possible to assess the impact of ’clinically plausible’ levels of unmeasured confounding and other sources of bias on the treatment estimates (Groenwold, 2010). However, it does not provide a single precise estimate of treatment effectiveness nor does it help identify the nature of any bias from omitting covariates. A number of alternative strategies for tackling unmeasured confounding have been proposed that do attempt to provide explicit estimates of causal effects. An overview of these different methods was given in Aleyamehu et al (1996), including instrumental variables and the prior event rate ratio method (Tannen et al., 2009).

The method of sensitivity analysis proposed in this paper could be extended in a number of ways. First, incorporating adjustment for the propensity score into the sensitivity analysis would provide an efficient way of controlling for the effect of measured covariates (Rosenbaum, 1991). Other possible developments include consideration of specific distributional forms (both univariate and multivariate) for the unmeasured confounder(s) to provide special cases of the generic bias formulae for a wider range of common confounding models.

Omission of relevant covariates is a common source of bias when estimating treatment or exposure effects from survival data. Although we cannot directly adjust for unmeasured covariates, their potential impact can be assessed by means of sensitivity analyses. Indeed, Groenwold et al. (2010) argue that all analyses of causal associations in observational data should include an assessment of robustness to unmeasured confounding. The current study provides new tools for conducting sensitivity analysis for survival outcomes, with applicability to both randomized controlled trials and observational studies. Implementation of the methods requires numerical evaluation of the appropriate bias formulae. This can be achieved using Monte Carlo methods and illustrative R code is available on request from the authors.

7. Supplementary Materials

Web appendices, tables and figures referenced in Sections 2, 2.2, 3.1, 3.2, 3.5, 4.3, 4.4, 5.1 and 5.2 are available with this paper at the Biometrics website on Wiley Online Library.

Acknowledgments

We thank Prof. Robin Henderson for providing the leukaemia and deprivation data. We are grateful for the helpful comments of the editor, associate editor and two referees. This research was funded by the Medical Research Council [grant number G0902158]. William Henley and Stuart Logan were supported by the National Institute for Health Research (NIHR) Collaboration for Leadership in Applied Health Research and Care (CLAHRC) for the South West Peninsula. The views expressed in this publication are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health.

Supporting Information

All Supplemental Data may be found in the online version of this article.

Supporting Information.

biom0069-0850-sd1.pdf (369.3KB, pdf)

References

  1. Black N. Why we need observational studies to evaluate the effectiveness of health care. British Medical Journal. 1996;312:1215–1218. doi: 10.1136/bmj.312.7040.1215. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Breslow NE. Discussion of the paper by D. R. Cox. J R Statist, Soc B. 1972;34:216–217. [Google Scholar]
  3. Bretagnolle J, Huber-Carol C. Sous-estimation des contrastes due a l'oubli de variables pertinentes dans le modele de Cox pour des durees de survie avec censure. C.R.A.S. 1985;300:359–363. [Google Scholar]
  4. Chastang C, Byar D. Piantadosi S. A quantitative study of the bias in estimating the treatment effect caused by omitting a balanced covariate in survival models. Statistics in Medicine. 1988;7:1243–1255. doi: 10.1002/sim.4780071205. [DOI] [PubMed] [Google Scholar]
  5. Ellis JM, Tan HK, Gilbert RE, Muller DP, Henley WE, Moy R, Pumphrey R, Ani C, Davies S, Edwards V, Green H, Salt A, Logan S. Supplementation with antioxidants and folinic acid for children with Down syndrome: Randomised controlled trial. British Medical Journal. 2008;336:594–597. doi: 10.1136/bmj.39465.544028.AE. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Gail M, Wieand S, Piantadosi S. Biased estimates of treatment effect in randomized experiments with nonlinear regressions and omitted covariates. Biometrika. 1984;71:431–444. [Google Scholar]
  7. Ghani AC, Henley WE, Donnelly CA, Mayer S. Anderson RM. Comparison of the effectiveness of non-nucleoside reverse transcriptase inhibitor-containing and protease inhibitor-containing regimens using observational databases. AIDS. 2001;15:1133–1142. doi: 10.1097/00002030-200106150-00008. [DOI] [PubMed] [Google Scholar]
  8. Groenwold RH, Nelson DB, Nichol KL, Hoes AW. Hak E. Sensitivity analyses to estimate the potential impact of unmeasured confounding in causal research. International Journal of Epidemiology. 2010;39:107–117. doi: 10.1093/ije/dyp332. [DOI] [PubMed] [Google Scholar]
  9. Henderson R, Shimakura S. Gorst D. Modelling spatial variation in leukaemia survival data. Journal of the American Statistical Association. 2002;97:965–972. [Google Scholar]
  10. Hernandez AV, Eijkemans MJ, Steyerberg EW. Randomized controlled trials with time-to-event outcomes: How much does prespecified covariate adjustment increase power. Annals of Epidemiology. 2006;16:41–48. doi: 10.1016/j.annepidem.2005.09.007. [DOI] [PubMed] [Google Scholar]
  11. Klungel OH, Martens EP, Psaty BM, Grobbee DE, Sullivan SD, Stricker BH, Leufkens HG. de Boer A. Methods to assess intended effects of drug treatment in observational studies are reviewed. Journal of Clinical Epidemiology. 2004;57:1223–1231. doi: 10.1016/j.jclinepi.2004.03.011. [DOI] [PubMed] [Google Scholar]
  12. Lin DY, Psaty BM. Kronmal RA. Assessing the sensitivity of regression results to unmeasured confounders in observational studies. Biometrics. 1998;54:948–963. [PubMed] [Google Scholar]
  13. Lin DY, Wei LJ. The robust inference for the Cox proportional hazards model. Journal of the American Statistics. 1989;84:207–224. [Google Scholar]
  14. Rosenbaum PR. Discussing Hidden bias in observational studies. Annals of Internal Medicine. 1991;115:901–905. doi: 10.7326/0003-4819-115-11-901. [DOI] [PubMed] [Google Scholar]
  15. Satten GA, Datta S. The Kaplan–Meier Estimator as an inverse-probability-of-censoring weighted average. American Statistician. 2001;55:207–210. doi: 10.1198/000313001317098185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Tannen RL, Weiner MG, Xie D. Use of primary care electronic medical record database in drug efficacy research on cardiovascular outcomes: comparison of database and randomised controlled trial findings. British Medical Journal. 2009 doi: 10.1136/bmj.b81. 338, b81. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. VanderWeele TJ. Sensitivity analysis: distributional assumptions and confounding assumptions. Biometrics. 2008;64:645–649. doi: 10.1111/j.1541-0420.2008.01024.x. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information.

biom0069-0850-sd1.pdf (369.3KB, pdf)

Articles from Biometrics are provided here courtesy of Wiley

RESOURCES