Summary
The timing of antiretroviral therapy (ART) initiation for HIV and tuberculosis (TB) co-infected patients needs to be considered carefully. CD4 cell count can be used to guide decision making about when to initiate ART. Evidence from recent randomized trials and observational studies generally supports early initiation but does not provide information about effects of initiation time on a continuous scale. In this paper, we develop and apply a highly flexible structural proportional hazards model for characterizing the effect of treatment initiation time on a survival distribution. The model can be fitted using a weighted partial likelihood score function. Construction of both the score function and the weights must accommodate censoring of the treatment initiation time, the outcome, or both. The methods are applied to data on 4903 individuals with HIV/TB co-infection, derived from electronic health records in a large HIV care program in Kenya. We use a model formulation that flexibly captures the joint effects of ART initiation time and ART duration using natural cubic splines. The model is used to generate survival curves corresponding to specific treatment initiation times; and to identify optimal times for ART initiation for subgroups defined by CD4 count at time of TB diagnosis. Our findings potentially provide ‘higher resolution’ information about the relationship between ART timing and mortality, and about the differential effect of ART timing within CD4 subgroups.
Keywords: Antiretroviral therapy (ART), Electronic health records, Informative censoring, Inverse weighting, Marginal structural model, Time-varying confounders
1. Introduction
1.1 Overview and Objectives
Co-infection with HIV and tuberculosis (TB) is a significant public health issue in the developing world. Investigation of the most effective strategies for integration of antiretroviral therapy (ART) for HIV with standard treatment for tuberculosis (TB) has been a topic of intensive research, including at least three major randomized controlled trials (RCT) (Havlir et al., 2011; Abdool Karim et al., 2011; Blanc et al., 2011). Current World Health Organization (WHO) guidelines recommend immediate initiation of TB therapy, with concomitant or closely sequenced ART for HIV, depending on presentation at diagnosis (WHO, 2013).
Initiating ART too early increases the potential for drug toxicity and the risk of TB-associated immune reconstitution inflammatory syndrome (IRIS), a potentially fatal condition (Havlir et al., 2011). On the other hand, late initiation of ART increases risk for morbidity and mortality associated with AIDS (Abdool Karim et al., 2010). Although optimal timing of ART initiation during the TB treatment period is difficult to determine (Abdool Karim et al., 2011), evidence from randomized trials strongly supports earlier initiation for those who have advanced HIV disease at the time of TB diagnosis, with more equivocal conclusions for those at less advanced stages (Havlir et al., 2011; Abdool Karim et al., 2011; Blanc et al., 2011).
Although randomized trials form a strong evidence base, limitations remain. First, the randomized trials compare protocols defined in terms of time intervals for ART initiation, rather than actual timing. Second, the sample sizes are relatively small, with participants distributed over multiple sites, in some cases in different countries within the same trial. Third, randomized trials are by necessity conducted under tightly controlled conditions, and may not fully reflect the experience of patients in routine care settings. Data derived from electronic health records (EHR) provide an opportunity to augment the evidence base using larger-scale data collected in a representative care setting. In particular, EHR enables investigation of the effect of ART timing at higher resolution because actual initiation times are available and subgroups of interest have larger sample sizes. However, any analysis of EHR and other types of observational data must address issues related to nonrandom treatment allocation, irregular sampling, and complex missing data patterns.
In this paper we develop a modeling framework for drawing causal inferences about the timing of treatment initiation when the outcome is an event time; in this case, time to death. We apply the model to data drawn from the medical records of individuals enrolled in the Academic Model Providing Access to Healthcare (AMPATH), a large-scale HIV care program in western Kenya (Einterz et al., 2007). The data have rich information, but the observational nature of the data poses several complications for our analysis. First, treatment initiation time is not randomly allocated. Second, many individuals have incomplete information on exposure, outcome, or both. For example, some patients died before initiating ART, which censors ART initiation time; for others, incomplete follow up leads to censoring of death time, ART initiation time, or both. Figure 1 shows the four observed-data patterns of ART initiation and mortality following initiation of TB treatment. Third, the functional form of the causal effect of initiation time on mortality rate is not known.
To address these issues, we formulate a structural causal hazard model that captures the effects of both timing and duration of ART on hazard of mortality using unspecified smooth functions, and we derive methods for generating consistent estimates of model parameters under nonrandom treatment allocation and complex censoring patterns. Using output from the fitted model, we generate estimates of the functional causal relationship between ART timing and mortality. Our method can be used to estimate the survival distribution — and therefore any functional of the survival distribution — associated with any specific ART initiation time. Our analyses of the AMPATH data uses one-year survival as the primary endpoint because it is consistent with one of the major randomized trials (Havlir et al., 2011), but any functional of the survival distribution could be used.
This paper is organized as follows: the remainder of Section 1 provides additional background on issues related to HIV/TB co-infection and reviews statistical approaches related to estimating causal effect of treatment timing. Section 2 describes notation, defines optimal initiation time, and provides detailed specification of our marginal structural proportional hazards model. Section 3 develops the methods for fitting the model to observational data where both treatment timing and death time are subject to censoring. Section 4 presents results from applying our analysis of AMPATH data, with emphasis on new insights that can be gained, relative to findings from randomized trials. Section 5 summarizes a simulation study that examines properties of model-based inferences. In Section 6 we put the results in context by comparing them with RCT findings and current treatment guidelines, and outline directions for further research.
1.2 Treatment of Individuals Co-infected with HIV and Tuberculosis
Generally, treatment of TB itself is a 6-month regimen. The World Health Organization (WHO) recommends that TB treatment should be initiated first, with ART initiated within the first 8 weeks for all patients and within the first 2 weeks for those with CD4 less than 50. However, complications associated with early initiation of ART may discourage treatment adherence and cause adverse effects resulting from drug interactions and IRIS (WHO, 2013). Recent RCTs provide an evidence base for these recommendations. For brevity, ART initiation time is hereafter defined relative to the start of TB treatment, and measured in weeks. The AACTG study A5221 found reduced 48-week mortality in the CD4<50 subgroup, comparing the early (0 – 2) to the late (8 – 12) initiation arm (Havlir et al., 2011). The SAPIT trial recorded significantly higher mortality rate for the initiating interval 24 – 28 compared to 0 – 4 or 8 – 12 (Abdool Karim et al., 2011). In the CAMELIA trial, significantly reduced hazard of death was found in the early initiation arm (2 versus 8) (Blanc et al., 2011).
The question of ART timing has also been examined in at least three observational studies. An analysis of 322 co-infected patients from Spain showed longer survival for those starting ART in the first 2 months of TB treatment, compared to 3 months (Velasco et al., 2009). A prospective cohort study of 667 individuals in Thailand demonstrated an increased risk of death with longer delays in ART initiation (Varma et al., 2009). An analysis of 308 co-infected adults in Rwanda concluded that early ART initiation reduced mortality in persons with CD4 count below 100 (Franke et al., 2011).
1.3 Data Source: AMPATH Medical Record System (AMRS)
AMPATH provides care to over 160,000 HIV-positive individuals at 143 sites in western Kenya. The AMPATH Medical Record System (AMRS), one of the largest of its kind in sub-Saharan Africa, contains more than 100 million clinical observations from over 300,000 enrolled patients (Rachlis et al., 2015). Our analysis makes use of clinical encounter data from 4903 adults with HIV/TB co-infection drawn from the AMRS between March 1, 2004 and April 18, 2008. The dataset contains individual-level information at baseline on the following variables: age, gender, WHO stage (a 4-level ordinal diagnostic indicator of HIV severity), weight, CD4 count, clinic visit (urban versus rural), marital status and post-primary education. The dataset includes also longitudinal information on ART initiation status, death, and CD4 count. Among all patients presenting with HIV/TB co-infection, 26% have incomplete information on ART initiation time (case III and IV in Figure 1).
These data were generated before the findings from the studies referenced above were released; this, combined with the exercise of clinical judgment in making treatment decisions, yields significant variation in ART initiation time, even among those whose clinical profiles are similar in terms of recorded information.
1.4 Statistical Methods for Treatment Duration and Timing
Our methods address inference about causal effect of treatment initiation time on a survival distribution. The related problem of causal effect of treatment duration (assuming initiation time is the same for everyone) has been investigated in two papers by Johnson and Tsiatis (2004, 2005); the first considers the case where duration is discrete and the second extends to the more complex case of continuous time. In both cases, duration time is subject to censoring that may be related to measured covariates, and inverse probability weighting is used to address nonrandom treatment allocation. Both papers restrict attention to the mean or regression function related to a single endpoint; in our setting, we use survival time as the endpoint, which introduces the need to deal with censoring of both the exposure and the endpoint. Our approach to inference builds on Johnson and Tsiatis (2004, 2005), particularly as it relates to the use of a Radon-Nikodym derivative for deriving appropriate weight functions (see also Murphy et al., 2001). Work by Xiao et al. (2014) uses regression splines to model the weighted sum of treatment duration and estimate the effect of cumulative exposure to ART. Our model includes a term for treatment duration as a function of time, separately captures effect of timing, and allows the effect of duration to depend on timing (provided sufficient data are available).
The analysis by Franke et al. (2011), referenced above, uses a model that has a similar formulation to ours. They use pooled logistic regression with inverse probability weights to fit a marginal structural model to a sample from 308 individuals. Our paper formalizes that approach for continuous-time settings and considerably relaxes distributional assumptions about the underlying potential outcomes distributions.
The fitted model can be used to estimate the entire distribution of potential outcomes corresponding to a given initiation time. The estimated causal relationships are highly data-driven in the sense that key parts of the structural model are left unspecified, including the baseline hazard function, the instantaneous effect of treatment as a function of initiation time, and the effect of treatment duration once initiated. The added versatility of our model is motivated by the desire to take full advantage of the information available in our much larger cohort (which, to our knowledge, is larger than any cohort associated with a published analysis on HIV/TB co-infection).
2. Formulation of Potential Outcomes Model
2.1 Notation for Potential Outcomes and Observed Data
Let t denote time elapsed from the initiation of TB therapy, and let tmax denote maximum follow-up time. Let Ta > 0 denote the potential outcome corresponding to time of death under the scenario that ART is initiated at time a, where a ≥ 0 is continuous. We use T∞ to denote death time corresponding to any a > tmax.
Information on potential outcomes is observed according to the cases depicted in Figure 1. Let A be the random variable representing ART initiation time and let T = Ta denote the survival time corresponding to initiating ART at A. Let C denote a censoring time. As shown in Figure 1, either or both of A and Ta may be right censored, for example due to loss to follow up. The observed data available for drawing inference about the distribution of potential outcomes are as follows: the observed follow up time for the mortality outcome is T* = min(T, C), with event indicator ΔT = I(T < C). Similarly, the observed follow up time for ART initiation time is A* = min(A, T*) = min(A, T, C), with event indicator ΔA = I(A < T*). Notice that A can be right censored by T, as shown in Case II of Figure 1. Because large ART initiation times are infrequently observed in our data, we administratively censor A and T* at tmax for cases where A ≥ tmax and T* ≥ tmax, respectively.
Each individual has p covariates, some of which may be time varying. We use L(t), t ≥ 0 to denote the p × 1 vector containing the most recently observed value of each covariate at time t, with L(t) = {L(s) : 0 ≤ s < t} representing the observed covariate history up to but not including t. For each individual, therefore, we observe a copy of {L̄(A*), T*, ΔT, A*, ΔA}. The letter A was originally used for AZT (zidovudine) and L for lymphocyte cell count in earlier causal inference literature (Miguel Hernan, personal communication).
2.2 Marginal Structural Model for Mortality
We assume Ta follows a marginal structural proportional hazards model of the form
(1) |
where λa(t) is the hazard function for Ta, λ∞(t) is the reference hazard for T∞, and r(t, a) > 0 is the (time-dependent) hazard ratio function. We parameterize r(t, a) in terms of three functions, g1(a), g2(t – a) and g3(a(t – a)), respectively denoting effects of timing, duration, and timing-duration interaction. Specifically, we assume
(2) |
where g1(·), g2(·) and g3(·) are unspecified smooth functions that are twice continuously differentiable, with the second derivative set to zero at the interval boundaries for a, t – a and a(t – a). The hazard function λa(t) can depend on baseline covariates L(0) by elaborating model (2) or by using a stratified version of λ∞(t), as demonstrated in Section 4.
The model can be understood by considering some simple cases. Figure 2 illustrates three simplified versions of the hazard model and the corresponding survival models. For clarity, models in Figure 2 assume λ∞(t) is constant (our model allows it to remain unspecified) and assumes g3(a(t – a)) = 0 (no interaction between a and t – a). In Figure 2(a), we set g1(a) = β1 and g2(t – a) = 0, which implies the log hazard of mortality is log{λ∞(t)} prior to treatment initiation (t < a) and β1 + log{λ∞(t)} thereafter (t ≥ a). This version assumes the instantaneous effect of initiating ART on hazard of mortality is not a function of initiation timing. In Figure 2(b), we allow g1(a) to vary with a, so that the effect of ART initiation does depend on its timing. In Figure 2(c), we additionally set g2(t – a) = (t – a)β2, where β2 is a scalar, which implies that the instantaneous effect of initiating ART is g1(a), and that the effect of being on ART at any given time t depends on duration.
Turning to the more general formulation, we parameterize g1(·), g2(·) and g3(·) using natural cubic splines constructed from piecewise third-order polynomials that pass through a set of control points, or knots. A natural cubic spline has continuous first and second derivatives at the knots, and is linear beyond the boundary knots. Basis functions generated under these constraints are called B-spline functions (Hastie et al., 2009). Parameterizing (2) in terms of B-splines yields
(3) |
where b1 (·) = (b11 (·), …, b1K1 (·))⊤ is a K1 × 1 vector representing a B-spline basis function of degree K1 in a, and b2(·) = (b21 (·), …, b2K2 (·))⊤ and b3(·) = (b31 (·), …, b3K3 (·))⊤, having dimension K2 × 1 and K3 × 1, respectively, are defined similarly as functions of t – a and a(t – a). The parameters β1, β2 and β3 are, respectively, K1 × 1, K2 × 1 and K3 × 1 vectors of coefficients for the basis functions b1(·), b2(·) and b3(·). The model for the risk function can be written in compact form as r(t, a) = exp{X⊤(a, t)β}, where
(4) |
and . The causal effect of ART initiation time on the survival distribution is therefore encoded in β, and the formulation in terms of B-spline basis functions allows the use of weighted proportional hazards regression methods to obtain consistent estimates of β.
3. Estimation of Structural Model and Optimal Initiation Time
3.1 Overview
In our data, the decision to initiate ART depends on information available to the physician, and is therefore not randomly allocated. Moreover, A will be right censored when an individual dies or leaves the study before treatment initiation. We propose to estimate β by maximizing an inverse-probability-weighted partial likelihood score function. We show how to construct and estimate weights that lead to consistent estimates of β under specific assumptions about treatment allocation and censoring, and in Web Appendix Section 1, we provide a heuristic justification based on the use of Radon-Nikodym derivatives (Murphy et al., 2001).
In deriving the weighted estimating equations for β, we first consider the hypothetical case where A is not randomized and where T is always observed. Next, we move to the case where A may be censored by T (i.e., where death occurs prior to treatment initiation), but there is no censoring by C. We then describe the case where A is not randomized, and possibly censored by T. Finally, we generalize our approach to allow for censoring by C. We then show how to construct causal contrasts between different treatment initiation times using output from the fitted model.
3.2 Randomized Treatment Assignment
If treatment is randomly allocated according to a known probability density fA(·), we can use the standard partial likelihood score equations to derive consistent estimators of β. For this simple case we assume A is observed for everyone and there is no censoring by C. For t > 0, let NT(t) = I {T ≤ t, ΔT = 1} denote the zero-one counting process associated with T. Let Y(t) = 1 if an individual is still at risk for death and under observation at t, and Y(t) = 0 otherwise. The partial likelihood score equations can be written
where
(5) |
where X(Ai, t) is the design matrix based on (4) and
Let ER(·) denote expectation under randomized treatment assignment. Under randomization of A, n−1 Σi D(Ai, Ti; β) is an unbiased estimator of ER {D(A, T; β)}. The estimating function Σi Di(β) is a stochastic integral of a predictable process with respect to a martingale, and as such ER {D(A, T; β0)} = 0 at the true value β0 of β; hence, the root β̂ of the estimating equation is a consistent estimator of β (Fleming and Harrington, 2005, pp. 297-298).
Now consider the case where A is still randomly allocated, but death may occur before initiation of treatment, so that A* = min(A, T) and ΔA = I(A < T). We continue to assume no censoring by C. Following Johnson and Tsiatis (2005), the mean of an individual score contribution can be represented as the sum of contributions from those with observed and censored values of A; i.e., . Contributions corresponding to censored values of A have expectation
(6) |
To evalute (6), note that , and recall that , so that X(a, t) = 0 for a ≥ t. This implies D(a, Ti, β) is a constant function of a, with D(a, Ti, β) = −X̄(Ti, β) for and 0 otherwise. The integral in (6) therefore reduces to −X̄(Ti, β){1 – FA(A*)}, and the expectation itself reduces to .
Pulling this all together, we see that when death may occur before treatment initiation, and when there is no censoring by C, n−1 Σi Di(β) is an unbiased estimator of Er {ΔAD(A, T; β) – (1 – ΔA)X̄(T, β) }, which implies that the solution β̂ to the modified estimating equations
(7) |
will yield a consistent estimator of β.
3.3 Non-random Allocation of Treatment
Suppose now that A is not randomly allocated, but that treatment allocation can be considered ignorable in the sense that
(8) |
where 𝒯{a≥t} = {Ta : a ≥ t} is the set of potential failure times associated with initiation times beyond t. This assumption states that initiation of treatment at time t is sequentially randomized in the sense that it is independent of future potential outcomes, conditionally on observed covariate history L̄(t) (Robins, 1999).
Let ℙr(·) denote the data distribution under randomized treatment, and let ℙO(·) denote the same under non-random allocation of treatment. Recall that the observable data for each individual, under either randomized or non-randomized allocation of treatment, is {A*, ΔA, T*, ΔT}. Following Murphy et al. (2001) and Johnson and Tsiatis (2005), under the sequential randomization assumption in (8) and some regularity conditions, including the positivity assumption referenced in Web Appendix Section 1, the distribution of {A*, ΔA,T*, ΔT} under ℙr(·) is absolutely continuous with respect to the distribution of {A*, ΔA, T*, ΔT} under ℙO(·), and a version of the Radon-Nikodym (R-N) derivative is
(9) |
An estimating equation that is a function of observed data and is unbiased under the distribution of ℙr(·) can be re-weighted by the R-N derivative to obtain an unbiased estimating equation using the same observed data, but now under the distribution ℙO(·) (Murphy et al., 2001). Define weights and as
The R-N derivative in (9) suggests using the weighted estimating equation
(10) |
where D* and X̄* are evaluated using weighted risk set indicators
Up to now we have assumed no censoring by C. Let NA(t) = I {A ≤ t, Δ = 1}, for t > 0, denote the zero-one counting process for treatment initiation, with N̄A(t) representing treatment history information up to time t. Similarly, define NC(t) = I(C ≤ t, ΔT = 0). We assume that censoring at t can depend on covariate and treatment history, but conditionally on these is independent of future potential outcomes. This assumption can be expressed formally in terms of the hazard function associated with NC(t),
(11) |
As with the treatment initiation process, we can define a weight function associated with censoring,
This leads to a final modification of the estimating equations for β to accommodate covariate-and treatment-dependent censoring,
(12) |
where D** and X̄** are evaluated using weighted risk set indicators
Writing (12) in terms of counting process notation used in (5) makes the distinct contributions of the four observation patterns in Figure 1 more transparent,
In particular all individuals censored by C, either before or after initiating treatment (cases II and IV in Figure 1), contribute information about β via contributions to the risk set up to the observed censoring time C.
3.4 Estimation of the Weights
The weights in (12) depend on the marginal and conditional density functions of A and C, which can be estimated from fitted models of the marginal and conditional intensity processes associated with NA(t) and NC(t). To estimate fA(t | L̄(t)), we assume λA (t | L̄(t)) follows a proportional hazards regression parameterized using a finite-dimensional parameter αA,
(13) |
where r is a user-specified regression function. (We use a proportional hazards formulation for convenience, but any regression formulation can be used here). The conditional density function fA(t | L̄(t)) can be estimated using the empirical cumulative hazard,
where α̂A is the maximum partial likelihood estimator for model (13).
To estimate the unknown marginal probability density fA(t), we use the Nelson-Aalen estimator for the cumulative hazard function and λ̂A(t) = dΛ̂A(t) for the hazard function. The estimated CDF and density functions are obtained using F̂A(t) = 1 – exp{−Λ̂A(t)} and f̂A(t) = λ̂A (t){1 – F̂A(t)}, respectively. The weight estimators are therefore a function of F̂A, f̂A, and α̂A,
The weights can be estimated in a similar fashion by specifying and fitting a regression model of the form to obtain estimates of fC (t | N̄A(t), L̄(t)) and FC (t | N̄A(t), L̄(t)), and using the Nelson-Aalen estimator to obtain estimates of fC(t) and FC(t).
3.5 Optimal Initiation Time
Referring back to our structural hazard model (2), the survival function for the potential outcome Ta corresponding to initiation time a is Sa(t) = exp{−Λa(t)}, where
(14) |
The g functions are estimated using weighted partial likelihood score as described above, and the baseline cumulative hazard Λ∞(t) is estimated using the (weighted) Breslow estimator that arises from the fitting process. The survivor function Sa(t) can therefore be estimated for any combination of a and t, enabling estimation of specific causal contrasts such as mortality ratios Sa(t)/Sa′(t) for a ≠ a′.
With sufficient amounts of data, we also can use model output to infer optimal timing for treatment initiation. An optimal initiation time is defined as the value of a that maximizes an objective function written in terms of a functional of the distribution of potential outcomes. Specifically, let Fa(t) = P(Ta ≤ t) denote the CDF associated with Ta, and let θa = θ(Fa) denote a scalar functional of Fa (e.g., the mean ∫ t dFa(t) or median ). For a given functional θa, the optimal initiation time is aopt = arg maxaθa.
In our application we use one-year survival θa = 1 – Fa(t0), with t0 set to 52 weeks, as the primary endpoint. Our estimates of aopt were somewhat unstable because the one-year mortality curve as a function of initiation time appears to be monotone increasing from zero (i.e., nonconvex). Alternatively, we compare mortality rates for contextually motivated initiation time intervals (trials referenced in Section 1) to draw inferences about optimal timing. We elaborate in Section 4.
4. Application to AMPATH data
Our analysis dataset contains 4903 HIV/TB co-infected patients who had TB therapy initiated and had a CD4 count below 350 at the start of TB therapy. Under guidelines in place at the time, these patients were eligible for ART initiation. Baseline (t = 0) is defined as the time of TB treatment initiation. The total number of deaths is 541. To avoid influence of sparsely distributed large values of ART initiation times, we administratively censor the data at 1.5 years (tmax). Of the 3302 patients in case II and 1060 patients in case IV of Figure 1, 1335 and 38 were administratively censored at tmax, respectively.
At the time when these data were collected, baseline CD4 was a key marker used to decide ART initiation time. To be consistent with AMPATH guidelines, we divide baseline CD4 count into groups defined by the intervals [0, 50], (50, 200], (200, 350]. Median weeks to ART initiation is 4 for CD4 ≤ 50, 8 for CD4 ∈ (50, 200] and 12 for CD4 ∈ (200, 350].
Baseline covariates and time-varying CD4 count (1.9 CD4 measures per year per person) were used to fit the hazard models λA(t |L̄(t)) and λC(t | N̄A(t), L̄(t)) leading to treatment and censoring weights. Each model included the main effect of each covariate; higher-order polynomial terms for continuous variables were tested and found not to add information. Estimated model coefficients are summarized in Web Table 2. Bootstrap re-sampling with 1000 replicates is used to estimate standard errors of the estimated mean one-year survival; censoring and ART initiation time models are re-fit within each bootstrap sample.
All calculations are carried out in R. The hazard models for ART initiation time and censoring time are fitted using R function coxph. The structural model given by (3) is fitted using weighted partial likelihood, also using coxph (Therneau, 2015), with spline basis functions for g1(a), g2(t – a) and g3(a(t – a)) generated using the R function ns. We place knots at the 25th, 50th and 75th percentile of the uncensored values of A* for g1(·); T* – A* for g2(·); and A*(T* – A*) for g3(·).
In Web Appendix Section 3, we describe steps of fitting the structural model (3), show plots of each estimated g function, and summarize key findings from the plots. Plots of g functions for the CD≤50 subgroup appear in Figure 3.
The plot of ĝ1(a) in the first panel of Figure 3 shows that the instantaneous effect of ART initiation is U-shaped, with maximum benefit (lowest mortality hazard) just after 10 weeks, and lower effectiveness with longer delays. In the second panel, ĝ2(t — a) depicts the effect of treatment duration, and generally indicates that longer duration times are associated with lower mortality hazard. The increasing trend in the interaction term ĝ3(a(t — a)) suggests that delayed ART initiation reduces the effect of duration on treatment. The quadratic trend in g(x0032)1(a) supports the notion that immediate treatment initiation carries some risk of elevated mortality that is possibly balanced out by the benefit of remaining on treatment longer. The net causal effect is summarized in plots comparing one-year mortality rate between regimes a and a′, where a = 0 and a′ > 0 (Figure 5, described below).
Figure 4 shows estimated mortality curves, derived using (14), for selected ART initiation times a ∈ {0, 8, 24, ∞} and stratified by baseline CD4 subgroup. Differences between these curves are causal contrasts. For each distinct value of a, mortality rate is clearly highest for those with baseline CD4 ∈ [0, 50]. Looking within CD4 strata, comparing curves for a = 0 and a = 8 to those with a > 8 suggests a benefit of early initiation for all groups, with greatest benefit for those in the lowest CD4 stratum. Figure 5 shows one-year mortality and the causal treatment effect Ŝa (52) – Ŝ0 (52) as a function of treatment initiation time, stratified by baseline CD4 subgroup. The effect of early initiation is most pronounced for CD4 ≤ 50. The results suggest immediate initiation for CD4≤ 200. Diagnostic plots suggest the model fits the data well (Web Figure 4).
Finally, we can emulate comparisons between regimens reported in the randomized trials cited earlier. We can mimic random allocation of treatment initiation time to specific intervals by assuming a distribution for A that is independent of outcomes and covariates, and compare interval-specific mortality rates to draw inferences about treatment timing. In our data example, we assume A follows a uniform distribution, and calculate one-year survival probability associated with initiating in a given interval [t1, t2) via ∫a∈[t1,t2). {1 – F̂a(t0)}dFU(a), where FU(·) is the CDF of the uniform distribution on [0, tmax]. To illustrate, following the SAPIT trial, we compare one-year mortality for initiating intervals [0, 4] and [8, 12] weeks. For those with CD4 ≤ 50 (n = 1540), our model estimated the mean mortality rate to be .16 (.14, .19) for [0, 4] and .20 (.17, .24) for [8, 12], with a p-value of .02 for the difference between the means, as compared to .10 and .20 respectively for the two regimens from the trial (n = 72), with a p-value of .17 for the incidence-rate ratio.
Our results suggest ART should be initiated within 8 weeks of the initiation of TB therapy for AMPATH patients with CD4 counts lower than 200. There is a marked increase in expected one-year mortality when ART is initiated after 8 weeks (see Figure 5). For patients with CD4 ∈ (200, 350], while early initiation still results in lowest one-year mortality, there is not strong statistical evidence to support the conclusion that a specific ART initiation time (or time interval) will lead to reduced one-year mortality. Our results are consistent with AMPATH guidelines (Web Table 3) in treating HIV/TB co-infected patients, and are consistent with general findings of randomized controlled trials.
5. Simulation
We conducted a simulation to study properties of our weighted estimator when the model is correctly specified and to evaluate sensitivity of the weighted estimator to violations of the no unmeasured confounding assumption. We simulate data from a simplified version of our structural model (1). We examine bias, variability and confidence interval coverage rates related to estimates of one-year mortality for different choices of a under three scenarios: 1) under random allocation of treatment; 2) with measured confounding and 3) in the presence of unmeasured confounding. The simulation results show near-zero bias and nominal coverage probability for Scenario 1. In the presence of measured confounding (Senario 2), the weighted estimator eliminates nearly all the bias and coverage probabilities are close to nominal levels as compared to the unweighted estimator. Unmeasured confounding (Scenario 3) produces bias in proportion to the degree of confounding. We provide details about the data generation algorithm, the parameter values and results in Web Appendix Section 2.
6. Summary and Discussion
Timing of ART initiation is important in HIV/TB co-infection. Determining the optimal ART initiation time is difficult because of the need to achieve balance between the risk for mortality and potential adverse events associated with early antiviral therapy initiation. Three randomized controlled trials (AACTG, SAPIT, and CAMELIA) support earlier initiation of ART for those with very low CD4 count, but initiation times are quantified on an interval scale. Our data, derived from electronic health records, allow higher-resolution inference for initiation time on a continuous scale, are sampled from a well-defined population, and reflect outcomes from actual clinical practice.
Our results are largely consistent with the findings of the clinical trials, and provide important reinforcement and elaboration of those findings. Our model provides estimates of mortality for any potential initiation time ranging from 0 to 40 weeks, and moreover it captures the separate effects of ART initiation and ART duration. This latter feature enables examination of the potential trade-offs associated with early initiation, illustrated in Figure 3. In addition to reinforcing the findings from recent trials, inference about grouped-interval-based optimal initiation times for ART are consistent with current AMPATH guidelines.
Our model assumes that both treatment initiation and censoring are independent of potential mortality outcomes, conditionally on baseline and time-dependent covariates. Given that treatment decisions are based on clinical indicators that we included in our weight model, the treatment ignorability assumption has substantive justification. It is possible that censoring could be associated with higher death rate. In the future, we will develop formal representations of potential selection biases, and use those as a basis for examining sensitivity to violations of assumptions about ignorable treatment assignment and censoring.
Our model, though flexible, can be extended in several directions. First, an important structural assumption is that the effects of initiation, duration and their interaction are additive on the log hazard scale; this requirement could be relaxed by introducing a multidimensional spline function. Fitting this sort of model would likely require a larger dataset. Second, our model considers regimens that are static in the sense that they are dependent on a baseline covariate (here, CD4 count). A potential topic of further research is comparison of dynamic treatment regimes, which are defined in terms of an individual's evolving history; e.g., ‘initiate ART when CD4 first drops below x’ (Robins et al., 2008). Although we considered this approach, in AMPATH — as in most resource-constrained HIV care environments — data needed for these comparisons are limited because CD4 and other HIV disease markers are measured infrequently (typically every six months, but sometimes less often).
Finally, focused clinical questions that have generated considerable data from both randomized trials and observational studies are potentially fertile ground for development of new methods that combine, synthesize or compare available evidence. This activity has particular importance in the field of HIV, where large observational databases are forming the basis for important and far-reaching policy related to treatment and policy interventions (Günthard et al., 2016).
Supplementary Material
Acknowledgments
The authors are grateful to E. Jane Carter, MD, Rami Kantor, MD, Michael Littman, PhD, Tao Liu, PhD, Xi (Rossi) Luo, PhD (Brown University); Brent Johnson, PhD (University of Rochester), and Beverly Musick, MS (Indiana University) for helpful input and discussion, and to the editor, associate editor, and two anonymous reviewers whose comments and suggestions led to substantial improvements in the manuscript. Beverly Musick also constructed the dataset that was used for this analysis. This work was partially funded by grants R01-AI-108441, R01-CA-183854, U01-AI-069911, and P30-AI-42853 from the U.S. National Institutes of Health, and contract number 623-A-00-0-08-00003-00 from United States Agency for International Development (USAID).
Footnotes
Supplementary Materials: Web Appendices, Tables, and Figures referenced in Sections 3, 4, 5, and the R code used to implement our approach are available with this paper at the Biometrics website on Wiley Online Library.
References
- Abdool Karim SS, Naidoo K, Grobler A, et al. Timing of initiation of antiretroviral drugs during tuberculosis therapy. New England Journal of Medicine. 2010;362:697–706. doi: 10.1056/NEJMoa0905848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Abdool Karim SS, Naidoo K, Grobler A, et al. Integration of antiretroviral therapy with tuberculosis treatment. New England Journal of Medicine. 2011;365:1492–1501. doi: 10.1056/NEJMoa1014181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blanc FX, Sok T, Laureillard D, et al. Earlier versus later start of antiretroviral therapy in HIV-infected adults with tuberculosis. New England Journal of Medicine. 2011;365:1471–1481. doi: 10.1056/NEJMoa1013911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Einterz RM, Kimaiyo S, Mengech HN, et al. Responding to the HIV pandemic: the power of an academic medical partnership. Academic Medicine. 2007;82:812–818. doi: 10.1097/ACM.0b013e3180cc29f1. [DOI] [PubMed] [Google Scholar]
- Fleming TR, Harrington DP. Counting Processes and Survival Analysis. 2nd John Wiley & Sons; 2005. [Google Scholar]
- Franke MF, Robins JM, Mugabo J, et al. Effectiveness of early antiretroviral therapy initiation to improve survival among HIV-infected adults with tuberculosis: a retrospective cohort study. PLoS Medicine. 2011;8:e1001029. doi: 10.1371/journal.pmed.1001029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Günthard H, Saag M, Benson C, et al. Antiretroviral drugs for treatment and prevention of HIV infection in adults: 2016 recommendations of the International Antiviral Society–USA Panel. Journal of the American Medical Association. 2016;316:191–210. doi: 10.1001/jama.2016.8900. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer; 2009. [Google Scholar]
- Havlir DV, Kendall MA, Ive P, et al. Timing of antiretroviral therapy for HIV-1 infection and tuberculosis. New England Journal of Medicine. 2011;365:1482–1491. doi: 10.1056/NEJMoa1013607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson BA, Tsiatis AA. Estimating mean response as a function of treatment duration in an observational study, where duration may be informatively censored. Biometrics. 2004;60:315–323. doi: 10.1111/j.0006-341X.2004.00175.x. [DOI] [PubMed] [Google Scholar]
- Johnson BA, Tsiatis AA. Semiparametric inference in observational duration-response studies, with duration possibly right-censored. Biometrika. 2005;92:605–618. [Google Scholar]
- Murphy S, Van Der Laan M, Robins J. Marginal mean models for dynamic regimes. Journal of the American Statistical Association. 2001;96:1410–1423. doi: 10.1198/016214501753382327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rachlis B, Ochieng D, Geng E, et al. Evaluating outcomes of patients lost to follow-up in a large comprehensive care treatment program in western kenya. Journal of Acquired Immune Deficiency Syndromes. 2015;68:e46. doi: 10.1097/QAI.0000000000000492. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robins J, Orellana L, Rotnitzky A. Estimation and extrapolation of optimal treatment and testing strategies. Statistics in Medicine. 2008;27:4678–4721. doi: 10.1002/sim.3301. [DOI] [PubMed] [Google Scholar]
- Robins JM. Marginal structural models versus structural nested models as tools for causal inference. Statistical Models in Epidemiology, the Environment and Clinical Trials. 1999;116:95–133. [Google Scholar]
- Therneau TM. A Package for Survival Analysis in S. version 2.38 2015 [Google Scholar]
- Varma JK, Nateniyom S, Akksilp S, et al. HIV care and treatment factors associated with improved survival during TB treatment in Thailand: an observational study. BMC Infectious Diseases. 2009;9:42. doi: 10.1186/1471-2334-9-42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Velasco M, Castilla V, Sanz J, et al. Effect of simultaneous use of highly active antiretroviral therapy on survival of HIV patients with tuberculosis. Journal of Acquired Immune Deficiency Syndromes. 2009;50:148–152. doi: 10.1097/QAI.0b013e31819367e7. [DOI] [PubMed] [Google Scholar]
- WHO. Consolidated guidelines on the use of antiretroviral drugs for treating and preventing HIV infection. World Health Organization; 2013. [PubMed] [Google Scholar]
- Xiao Y, Abrahamowicz M, Moodie EE, Weber R, Young J. Flexible marginal structural models for estimating the cumulative effect of a time-dependent treatment on the hazard: reassessing the cardiovascular risks of didanosine treatment in the swiss HIV cohort study. Journal of the American Statistical Association. 2014;109:455–464. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.