Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Oct 29.
Published in final edited form as: Biometrics. 2012 Feb 21;68(3):10.1111/j.1541-0420.2011.01738.x. doi: 10.1111/j.1541-0420.2011.01738.x

Impact of Timing of Starting Treatment Following Infection with Application to Initiating HAART in HIV-Positive Patients

Judith J Lok 1, Victor DeGruttola 1
PMCID: PMC3811162  NIHMSID: NIHMS490373  PMID: 22352840

Summary

We estimate how the effect of antiretroviral treatment depends on the time from HIV-infection to initiation of treatment, using observational data. A major challenge in making inferences from such observational data is that treatment is not randomly assigned; e.g., if time of initiation depends on disease status, this dependence will induce bias in the estimation of the effect of interest. To overcome this, we develop a new class of Structural Nested Mean Models (SNMMs) to estimate the impact of time of initiation of treatment after infection on an outcome measured a fixed duration after initiation, compared to the effect of not initiating treatment. This leads to a SNMM which models the effect of multiple dosages of treatment on a time-dependent outcome, unlike most existing SNNMs, which focus on the effect of one dosage of treatment on an outcome measured at the end of the study. Our identifying assumption is that there are no unmeasured confounders. We illustrate our methods using the observational AIEDRP (Acute Infection and Early Disease Research Program) Core01 database on HIV. The current standard of care in HIV-infected patients is Highly Active Anti-Retroviral Treatment (HAART). However, the optimal time to start HAART has not yet been identified. The presented new class of SNNMs allows one to estimate how the effect of one year of HAART depends on the time between estimated date of infection and treatment initiation and on patient characteristics. We conclude that HAART substantially affects immune reconstitution in the early and acute phase of HIV-infection.

Keywords: Causal inference, CD4+ T-lymphocyte counts, Counterfactuals, Differential effects of HAART, HIV/AIDS, Longitudinal data, Observational studies, Structural Nested Mean Models

1. Introduction

The question of when to start therapy in HIV-positive patients is arguably the most pressing issue in HIV research now, and affects the life of millions of HIV-positive individuals. Providing quantitative evidence to inform this debate is challenging, since no clinical trial data are available, and one has to rely on observational data. One statistical challenge in estimating the effect of time of treatment initiation from observational data arises from selection effects/confounding. For example, patients with a steeper decline of CD4+ T-lymphocyte count (a measure of immune function, hereafter referred to as CD4 count) following infection may receive treatment earlier than others, since they have a worse prognosis. Hence, any method must adjust for the confounding factors that impact both (1) time at which treatment is started and (2) the prognosis with respect to the outcome of interest. If these confounders are also predicted by past treatment, as is well-known to be the case for the CD4 count in HIV-positive patients, standard methods, conditioning on the confounders, lead to biased results (Robins et al. (1992), Hernán et al. (2005), Robins et al. (2000), Hernán et al. (2000)). Causal inference methods are necessary.

Structural Nested Mean Models (SNMMs) have already been developed for repeated outcomes, such as those described in Robins (1994) that focus on the effect of one dosage of treatment given data observed immediately before treatment. By contrast, we are interested in the effect of multiple dosages of treatment given in one year. The latter is not simply the sum of the effects of the individual dosages of treatment: these effects are conditional on data observed immediately before the specific dosage, and thus each effect is conditional on a different past. Adding them would require a model for all relevant covariates given past treatment and covariate history, and such a model would likely result in model misspecification and bias (Robins (1986, 1987a,b, 1988, 1989)). SNMMs modeling the effect of multiple dosages at once have been investigated in Lok et al. (2007), but those results are for an outcome measured at the end of the study.

This article proposes methods to investigate how the effect of a fixed duration of treatment, measured after this duration, depends on the time from infection to treatment initiation, using observational data. To be more specific, our effect of interest is the expected difference between the outcome after say X months under the treatment regime “start immediately” versus under the treatment regime “do not treat for X months”, given past observed patient characteristics. To accomodate both time-varying outcomes and the effect of multiple dosages of treatment, we develop a new class of SNMMs. Our assumptions and methods are similar to Robins (1994, 1998), Robins et al. (1992) and Lok et al. (2004, 2007). As for those classical causal inference methods, our identifying assumption is that there is no unmeasured confounding (introduced in e.g. Robins et al. (1992)), and that censoring is not necessarily independent, but only depends on past observed characteristics (Missing At Random, or MAR, introduced in e.g. Robins et al. (1995)). We apply our new class of SNNMs to the AIEDRP data, to estimate the effect of one year of HAART, Highly Active Anti-Retroviral Treatment, in the acute and early stage of HIV-1-infection, and how this effect depends on the time between estimated date of infection and initiation of HAART. Our outcome of interest is the state of the immune system as measured by increase in CD4 count. To date, no clinical trial data exist for this investigation, and one has to rely on observational data.

Although our focus is on the effect of time of initiation of HAART in HIV-positive patients, our methods have broad application to investigations where interest is in the effect of a treatment initiated after a certain duration of some event. The outcome of interest can be any outcome measured after some fixed time following start of treatment.

This article is organized as follows. Section 2 introduces the observational AIEDRP (Acute Infection and Early Disease Research Program) Core01 database, describes what we want to estimate, and why. Section 3 introduces a new class of SNNMs, and describes our estimators as solutions to unbiased estimating equations. Section 4 applies these methods to the AIEDRP data. Section 5 concludes this article with some discussion.

2. Motivating data: impact of timing of starting HAART following infection in HIV-positive patients

Guidelines about when to start HAART in HIV-positive patients have recently undergone several revisions (Hammer et al. (2006, 2008); Thompson et al. (2010); Panel on Antiretroviral Guidelines for Adults and Adolescents (2008 , 2009); ?); WHO Dept. of HIV/AIDS (2010)). Delaying the initiation of HAART has the advantage of postponing the onset of adverse events or drug resistance, but may also lead to irreversible immune system damage. Application of our methods to observational data on treatment initiation will help provide insight into these tradeoffs. Current clinical guidelines regarding HIV-positive patients recommend initiation of HAART when CD4 drops below 350–500; viral load may also play a role in the decision of when to start (Thompson et al. (2010); Panel on Antiretroviral Guidelines for Adults and Adolescents (2011); WHO Dept. of HIV/AIDS (2010)). There is considerable interest in whether treatment at an earlier disease stage may help to preserve immune function (Thompson et al. (2010); Panel on Antiretroviral Guidelines for Adults and Adolescents (2009); WHO Dept. of HIV/AIDS (2010); Phillips and Emery (2009); Wood and Lawn (2009); Kitahata et al. (2009); Sabin and Phillips (2009)). The current interest in using treatment to control epidemic spread heightens interest in these issues, as early treatment can only be ethically justified if it benefits individual patients, regardless of the potential for community-wide benefits (Granich et al. (2009), DeGruttola et al. (2008)). Furthermore, current CDC efforts aim for earlier HIV-diagnoses (Branson et al. (2006)), which, if successful, will increase the need to make such decisions, compared to a setting where most patients are only diagnosed when they have advanced disease.

This paper estimates how the effect of treatment depends on the time from estimated date of infection to treatment initiation. This work will therefore not lead to a recommended CD4 count to start HAART, but rather will provide insight into the response of the immune system on early treatment [[Victor: would you please beef this up? Is it true that this response is unknown and there are no other available data on acute and early infection?]].

We have applied our new class of SNMMs to the observational AIEDRP (Acute Infection and Early Disease Research Program) Core01 database on 1762 HIV-positive patients, diagnosed and followed during acute or early HIV-1-infection. These data arose from a cohort of patients who were followed from soon after the estimated time of infection, and in some cases even from before infection. Our hypothesis is that shorter time from HIV-1-infection to initiation of HAART will be associated with improved immune reconstitution following one year of HAART.

Dates of infection were estimated using a set of rules based on EIA, Western Blot, CD4 cell percentage and viral load (Hecht et al. (2006); Smith et al. (2006)). 1203 patients initiated treatment under follow-up, and 559 did not. Among those who started treatment, 293 and 214 patients did so in the acute and early phases, respectively (defined as < 90 and > 90 days from Estimated Date of Infection), and stayed on treatment for at least one year. In addition, there are 129 patients with at least 3 years of follow-up who did not start treatment within 3 years of infection. Because the large AIEDRP Core01 database has considerable variability in the time of initiating treatment and in characteristics of patients at this time, it is an excellent basis for investigating the impact of timing of treatment on immune reconstitution. There was a broad range of time from infection to treatment initiation (interquartile spread 1.5 to 6 months). The mean time on treatment was about 1.3 years, but about a quarter of the patents were treated for more than 2 years during follow-up. The viral load and CD4 count range at time of treatment initiation was very broad (interquartile spread viral load 17,000 to 400,000 copies and CD4 count 340 to 630 cells/ml). Of note: with CD4 counts between 350 and 500, the recommendations regarding treatment initiation from e.g. Panel on Antiretroviral Guidelines for Adults and Adolescents (2011) are not as firm, and the panel was undecided about patients with a CD4 count above 500. Hence, we are studying the population that is most interesting with respect to the discussion on treatment initiation. As date of start of follow-up visits we took the first visit before or at which both viral load and CD4 count were measured.

The main question we want to answer is how the effect of one year of HAART, measured by increases in CD4 count over a year, depends on the time between the estimated date of infection and treatment initiation. We define this effect more specifically as the expected difference between the outcome after one year under the treatment regime “start HAART immediately” versus under the treatment regime “do not treat this year”, given past observed patient characteristics. This includes a decline in untreated patients and a rise in treated patients, and is similar to an intention-to-treat analysis in a randomized clinical trial, since we have not corrected for treatment stops and interruptions.

This effect of interest cannot be estimated by simply averaging over patients following these two treatment regimes since, due to the observational nature of the data, there will be confounding by indication: the patients who are treated do not have the same prognosis as the patients who are untreated. This can e.g. be seen in Figures 1 and 2, as follows. Figure 1 displays the unadjusted mean values of CD4 count one year after the start of HAART for patients who start treatment from 0 to 18 months after infection. It also includes the mean CD4 of the comparator group measured at the same times; patients in this group did not start treatment for at least one year after the month of initiation of therapy for the group to which they are compared. We restricted this analysis to patients who had a visit at the month of treatment initiation and 11, 12 or 13 months afterwards. Only those starting treatment within 18 months and their comparators are included in the plot, because the numbers are small for later months. While there are months in which treated patients have even lower mean CD4 increases than those untreated, this very likely results from selection/confounding. This belief is confirmed by a similar naive analysis of increase in CD4 following initiation of HAART, Figure 2: if the treated and untreated had the same initial CD4, the difference between the two lines in Figure 2 should be about the same as in Figure 1. Thus, we need methods to estimate the treatment effect in the presence of confounding by indication.

Figure 1.

Figure 1

mean CD4 count after 1 year, in subjects with visit this month, with and without ART

Figure 2.

Figure 2

mean CD4 count increase after 1 year, in subjects with visit this month, with and without ART

3. Methods

3.1 Setting and notation

Prior to Section 3.5, we assume that all patients are followed monthly, at month 0, 1, …, K + 1, but visits may be missed, also (and especially) in the earliest months of infection. Extension to fixed times τk is straightforward. 0, the time origin for analysis, is estimated time of infection. We assume that the outcome, henceforth assumed to be the CD4 count, is measured one year after each of these visits. Yk is the CD4 count at month k. Ȳk is the history of CD4 count until month k. Ak is the treatment at month k. We investigate the effect of a binary treatment Ak, which is either given (coded as Ak = 1) or not (coded as Ak = 0) at each month k. Āk is the treatment history until month k. We only consider the impact of starting therapy, and do not consider issues of treatment interruption or compliance; such an analysis is analogous to intent-to-treat in randomized studies. T is the month of treatment initiation, with T = K + 1 if treatment never started. Similarly, Lk are the covariates at month k, and k is the covariate history until month k. Inline graphic is the space in which k takes its values. We suppose that at each visit month k, treatment decisions Ak are made after Lk is measured and known. Throughout this paper we will suppress the subscript i for patients. Yk() is the (counterfactual, not always measured) CD4 count at month k under “no treatment”, and Y¯k() is the CD4 history under “no treatment” until month k. Lk() are the covariates at month k under “no treatment”, with history L¯k(). Yk(m) is the CD4 count at month k under “start treatment at month m”. We assume that observations and counterfactuals on the different patients are independent and identically distributed (Rubin (1978)).

In our application and in clinical practice, Y, A and L will be measured at multiple time points which vary across patients. To get measurements of the outcome for each patient at each month k, we use interpolation, except for visits just prior to onset of treatment, for which we carry the last observation forward. We assume that visits may take place at every month. To account for missed visits, we include the visit pattern as part of the covariates that are measured.

The first subsections assume that full data on all subjects is available; Section 3.5 considers the adaptation of our results to censored outcomes.

3.2 Model

Our model for treatment effect is similar to the model in Lok et al. (2007), but differs in that we consider a different outcome, the CD4 count after one year, for each treatment initiation time:

Definition 3.1

For mk,

γmk(l¯m)=E[Yk+12(m)-Yk+12()L¯m()=l¯m,T=m]. (1)

Note that this is a model for treatment effect, because it compares the outcome under different fixed treatments in the same group of patients. In fact, it is a model for the treatment effect in the treated, since it conditions on the patient starting treatment at month m. This model compares the means, for a patient whose treatment started at month m with covariates m, of Yk+12(m) and Yk+12(). The former refers to the outcome at month k + 12 had the patient started treatment at month m, and the latter, to the outcome at that month had the patient never started treatment. The model implies that for such a patient, the expected difference between these outcomes is γmk(l¯m). More intuitively, it says that the conditional expectation of the outcome is shifted by γmk(l¯m) if treatment started at month m compared to never starting. Shifts in CD4 greater than 0 imply that treatment is beneficial. Note that for mk + 12, there is no difference in treatments between Yk+12(m) and Yk+12(); because of Consistency Assumption 3.4 below, γmk equals zero. Note also that with this definition, the effect on the one-year increase in CD4 since month k is the same as the effect on the CD4 count after one year, because choosing the outcome Y to be CD4 increase since treatment initiation would affect both Yk+12(m) and Yk+12() the same way. The effect of one year of treatment initiated at month k, given past covariate history k, in persons initiating treatment at month k, is

γkk(l¯k). (2)

We will assume a semi-parametric structure. We require:

Assumption 3.2: (Structural Nested Mean Model)

For m = k, …, k + 11, suppose that γm,ψk(l¯m) is a correctly specified model for E[Yk+12(m)-Yk+12()L¯m()=l¯m,T=m], with the true parameter equal to ψ0.

In the following, ψ is always a parameter in ℝp.

3.3 The Assumption of No Unmeasured Confounding and the Consistency Assumption

As in Robins et al. (1992), Robins (1994, 1998), Keiding et al. (1999), and Keiding (1999), to distinguish between treatment effect and selection bias, we require the Assumption of No Unmeasured Confounding. This implies that we have information on all factors that both: (1) influence treatment decisions and (2) possibly predict a patient’s prognosis with respect to the outcome of interest. We note that Yk+12(), the outcome at month k + 12 without treatment, is an indication of the patient’s prognosis with respect to the outcome of interest. If there is no unmeasured confounding, treatment decisions at month m (described by Am) are independent of this unmeasured prognosis Yk+12(), given the observed past treatment and covariate history Ām−1 and m.

Assumption 3.3: (No Unmeasured Confounding - formalization)

AmYk+12()L¯m,A¯m-1 (3)

for m = 0, …, K, where ⫫ means “is independent of” (Dawid (1979)).

Note that the observed outcome Yk+12 could not play the role of Yk+12() here, because if treatment affects the outcome, treatment decisions may well affect Yk+12 (but not Yk+12()).

In order to estimate the effect of treatment, we need the following consistency assumption. Note that if a person is not treated until month k, there is no difference in treatment between Yk and Yk(). For this and similar occasions, we assume that then the outcome would have been the same:

Assumption 3.4: (Consistency)

If Tk, Yk=Yk() and L¯k=L¯k(). Y (T) = Y.

This is a usual consistency assumption for Structural Nested Models.

3.4 Estimation without censoring

The proofs in this section differ from Lok et al. (2007) in that we consider the outcome at different months k+12 since infection, and from Robins (1994) in that we use a different model for treatment effect.

Definition 3.5

On Āk−1 = 0̄, for kK − 11, define

H(k)=Yk+12-γTk(L¯T)=Yk+12-γψ0,Tk(L¯T). (4)

For the true ψ0, Hψ0(k) = H(k). H(k) mimics a counterfactual outcome:

Theorem 3.6: (Mimicking counterfactual outcomes.)

Under Consistency Assumption 3.4, for kK − 11 and m with kmK,

E[H(k)L¯m,A¯m-1=0¯,Am]=E[Yk+12()L¯m,A¯m-1=0¯,Am].

Note that for mk + 12 this result is trivial, since H(k)=Yk+12=Yk+12() there. The proof of Theorem 3.6 can be found in the Web-Appendix.

As we will explain now, using Theorem 3.6, models for the treatment process A based on the observed past treatment and covariate history are used as a tool for the estimation of treatment effect, as in, for example, Robins et al. (1992); Robins (1994, 1998); Keiding et al. (1999). Define

p(m):=P(Am=1A¯m-1=0¯,L¯m), (5)

for m = 0, …, K, the probability of treatment initiation given past observed patient characteristics. p(m) is an unknown that needs to be estimated. Henceforth we assume that pθ(m) is a correctly specified model for p(m), with θ0 the true parameter. Specifying a model pθ(m) for p(m) is potentially easy, since it is a model for the start of treatment in practice, and clinicians may have clear ideas about this process, at least qualitatively. A flexible model that can reflect p(m) is a standard pooled (over patients and visits) logistic regression model: where Ām−1 = 0̄ (no treatment before month m) and a visit took place at month m,

logitpθ(m)=θ·f(L¯m),

with θ⃗0 the true (nuisance) parameter vector and f⃗(m) ∈ ℝdimθ a vector depending on m pre-specified by the data analyst. We assume that treatment can only start at a visit, so that pθ(m) = 0 unless a visit took place. We estimate the parameters in this model as usual by maximum partial likelihood, resulting in unbiased estimating equations for θ0.

The idea behind estimation is that under the Assumption of No Unmeasured Confounding 3.3, Yk+12() does not help to predict treatment initiation given this patient was not treated before month m and given his or her covariate history until month m. Hψ0(k) has the same conditional expectation as Yk+12(), according to Theorem 3.6. Hence, neither would Hψ0(k) help to predict treatment initiation given this same history. So, one can add αHψ0(k) to the pooled logistic regression model, and look for α̂(ψ) = 0. This approach is intuitively appealing since the “true α” should be zero. This can be done by using standard software using e.g. a grid search; this was conjectured in ?Robins (1998), proved in Lok (2008), and explained in a partial likelihood context in Lok (2007). As we will see later, we will not need this intuition or grid search for estimation, since we will restrict to models γ which are linear in ψ. Therefore, we will only need the following theorem (it turns out that the grid search solves estimating equations like these, as explained in e.g. Lok (2008)):

Theorem 3.7: (Unbiased estimating equations)

Suppose that Consistency Assumption 3.4 and the Assumption of No Unmeasured Confounding 3.3 hold. Consider any qmk:L¯mRp, k = 0, …, K − 11, m = k, …, k + 11, which are measurable, bounded, and vector-valued. Define Then

Ek=0K-11m=kk+11qmk(L¯m)H(k)1A¯m-1=0¯1visit(m)(Am-p(m))=0,

with 1visit(m) the indicator of whether a visit took place at month m. Thus if pθ(m) and γψ are correctly specified (parametric) models for p (m) and γ, then

Pnk=0K-11m=kk+11qmk(L¯m)Hψ(k)1A¯m-1=0¯1visit(m)(Am-pθ(m))=0,

stacked with the estimating equations for θ0, with Pn the empirical measure PnX=1/ni=1nXi, are unbiased estimation equations for both the parameter ψ and the (nuisance) parameter θ. The qmk here are allowed to depend on ψ and θ, as long as they are measurable and bounded for (θ0, ψ0).

The proof of Theorem 3.7 can be found in the Web-Appendix. In general, when estimating ψ of dimension dimψ, one would choose a number of dimψ estimating equations of the form of Theorem 3.7, by specifying q⃗ of dimension dimψ. We will return to specifying q⃗ in Section 4.

We could have included terms with mk+12. However, for those, there is no power to estimate ψ0. Where 1Āk+11=0̄ equals 1, Hψ(k) = Yk+12, a function of k+12, and hence such a term is of the form

qm,ψk(L¯m)1A¯m-1=0¯1visit(m)(Am-pθ(m)),

which has expectation 0 for each ψ and the true θ0: conditioning on m, Ām−1 fixes all factors except for (Amp (m)), which has conditional expectation 0.

3.5 Estimation in the presence of censoring

We need to include censoring by end of follow-up in our framework, since patients can be lost to follow-up in practice. Write Ck = 0 if a patient is uncensored at month k, and Ck = 1 otherwise. We will use Inverse Probability of Censoring Weighting (IPCW) to handle censoring (see e.g. Hernán et al. (2005) or Robins et al. (1995)). Following those authors, we suppose that censoring is Missing At Random (MAR):

Assumption 3.8: (Missing At Random (MAR))

(L¯,A¯)Ck+1L¯k,A¯k,C¯k=0¯.

Under this assumption, censoring may depend on past observed data. However, it states that censoring is independent of future patient characteristics and future treatments (had they been observable) given past observed data. Like those previous authors, we also need a Positivity Assumption, to assure that data are available for estimation. As usually done with Inverse Probability of Censoring Weighting, we henceforth assume that (Ȳ, Ā, ) takes values in a discrete space.

Assumption 3.9: (Positivity)

Suppose that P(K = 0̄, Ā = ā, = ) > 0 if P(Ā = ā, L̄ = )> 0.

Then we have the following extension of Theorem 3.7:

Theorem 3.10: (Unbiased estimating equations with censoring)

Suppose that the conditions of Theorem 3.7, MAR Assumption 3.8 and Positivity Assumption 3.9 hold. Write

EEmk(ψ,θ)=qmk(L¯m)Hψ(k)1A¯m-1=0¯1visit(m)(Am-pθ(m)).

Then

Pnk=0K-11m=kk+11EEmk(ψ,θ)1C¯k+12=0¯(p=k+1k+12P(Cp=0L¯p-1,A¯p-1,C¯p-1=0¯))-1=0

is an unbiased estimation equation for both the parameter ψ and the (nuisance) parameter θ.

The proof of this Theorem can be found in the Web-Appendix.

3.6 Fitting the models and asymptotics

In the estimating equations of Theorem 3.10, we still have the expression P (Am = 1|m, Ām−1 = 0̄), which is hard to estimate directly in the presence of censoring. The following Lemma solves that issue:

Lemma 3.11

Under Missing At Random Assumption 3.8,

P(Am=1L¯m,A¯m-1=0¯,C¯m=0¯)=P(Am=1L¯m,A¯m-1=0¯)

on {Ām−1 = 0̄, m = 0̄}.

The proof of this Lemma can be found in the Web-Appendix.

Fitting a model γψ can thus be done as follows. If one uses e.g. also a standard pooled logistic regression model for the censoring probabilities, with true parameter η0, one can fit this model first, resulting in η̂. Then, one can fit a model for the treatment initiation probabilities θ̂ using Lemma 3.11, which allows one to estimate the treatment initiation probabilities using the observed data, using standard software for logistic regression, e.g. SAS PROC LOGISTIC.

To facilitate calculating ψ̂, we have the following corrolary of Theorem 3.10:

Corollary 3.12

If γ is linear in ψ, this approach leads to a linear restriction on ψ once the parameter θ has been estimated. This leads to a closed form expression for ψ̂.

This follows immediately from Theorem 3.10, since if γ is linear in ψ, Hψ is linear in ψ. This closed form expression for ψ̂ facilitates estimation, since such linear restrictions can be solved using e.g. SAS PROC IML.

If one models both censoring and for treatment initiation with pooled logistic regression models, the resulting estimators η̂ and θ̂ are zeroes of unbiased estimating equations of the form PnG = 0. With many other models this will be the case as well. Then, the approach outlined above leads to the same estimators as stacking the resulting estimating equations for η0, θ0, and ψ0 from Theorem 3.10, and solving for η̂, θ̂, and ψ̂. This follows since the estimating equations for η and θ do not depend on the other parameters and completely identify η̂ and θ̂. Stacking the estimating equations leads to joint unbiased estimating equations for the joint parameters (θ0, ψ0, η0). Thus, the resulting estimators are just solutions to unbiased estimating equations PnG = 0. Because of extensive theory about unbiased estimating equations of the form PnG = 0 (see e.g Van der Vaart (1998)), they are, under regularity conditions, consistent and asymptotically normal. One can use the sandwich estimator of the asymptotic variance, but this would require estimating covariances for the estimating equations of η0, θ0 and ψ0. Instead, we implemented the bootstrap, for all 3 parameters (η, θ, ψ) at the same time.

4. Impact of Time to Start HAART on Immune Reconstitution: the AIEDRP Core 1 Database

In our application, described in Section 2, our main interest is in how the effect of one year of HAART depends on the time between estimated date of infection and initiation of treatment. As outcome, we focus on the CD4 increase one year after starting treatment. We used the AIEDRP data described in Section 2. We assume that all visits are recorded in the database and discuss approaches to relaxing this condition in the discussion. Central to our analysis was the ability to adjust for confounding in the initiation of treatment; we considered as confounders: time-varying measurements of CD4 count, viral load, month of treatment initiation, time since the last visit, and demographic factors (and functions of these).

To adjust for confounding by indication and informative censoring, we have applied the analysis method described in Section 3. Initially we assumed that for m = k, …, k + 11,

γm,ψk(L¯m)=(ψ1+ψ2m+ψ3m2)(k+12-m), (6)

with (k + 12 − m) the duration of treatment from month m to month k + 12, is a correctly specified model for treatment effect γ (Definition 3.1). This specification says that the effect of starting treatment at month m, as compared with no treatment, on the outcome measured at month k + 12, depends linearly on the duration of treatment, k+12−m. The coefficient may depend in a quadratic way on the month the of treatment initiation m.

To fit this model, first, we fitted pooled logistic regression models for treatment initiation and censoring as explained in Section 3.6. Then, applying Theorem 3.10, we chose qmk=0 for mk and

qkk(L¯k)=(12-E[Tr(k,k+12)A¯k=0¯,L¯k])(1kk2),

to get 3 estimating equations for ψ after estimation of θ. Here, Tr(k, k + 12) is the number of months of treatment between month k and 12 months later; e.g., if a patient started treatment at month 3, Tr(2, 14) = 11: of all months between 2 and 14, this patient was treated for 11 months. We estimated the conditional expectations in a prior regression step; in fact, we first estimated the relevant conditional probability of Tr(k, k + 12) being non-zero using SAS PROC LOGISTIC and then PROC REG to get the estimated conditional expectation given that Tr(k, k + 12) is non-zero, restricting the dataset to Ak−1 = 0̄ because we only needed those values with this choice of q, see Theorem 3.10. We did the fitting with only Ak = 0 but saved residuals for all patients by using the WEIGHT statement in SAS, weighting only those patients who have Ak = 0. It can be shown that, under stricter conditions and if there is no censoring, this choice of q⃗ is optimal within the class of estimating equations that set qmk=0 where km (Lok and Robins (2010); deriving this is beyond the scope of the current article). Note that any choice of q here is valid; this choice was made with a goal of gaining efficiency. Notice that, although we used estimated parameters in our choice of q⃗, the relevant unbiased estimating equations for those parameters can again be thought of as stacked with our original estimating equations as described in Section 3.6, so consistency and asymptotic normality still hold, also if we mis-specify the parameters in the fit of q⃗ (of course, as long as the assumptions described in Section 3 are satisfied). Finally, notice that with this initial model (6),

Hψ(k)=Yk+12-(ψ1+ψ2T+ψ3T2)Tr(k,k+12).

We fitted the model by solving the resulting linear estimating equations for ψ̂ in Theorem 3.10 (with θ̂ and η̂ replacing the true parameters), as described in Section 3.6. We used SAS PROC IML.

Figure 3 displays the result of fitting this model for the earlier treatment initiation times, in the gray lines. For each displayed month, these are estimates for the difference in CD4 increase over the next year between the treated and untreated groups, after adjustment for confounding. We did separate analyses for the first and the second year from estimated date of infection, Figures 3 and 4, respectively, because we have much more data in the first year; fitting a single model would put most weight on the data from this year, and estimates for the second would largely be extrapolation. As could have been expected, the difference between untreated and treated patients increases after adjustment for confounding. This probably reflects confounding by indication in the naive analysis. The (pointwise) confidence intervals in Figures 3 and 4 were obtained by bootstrap, using Efron’s percentile method (see e.g. Van der Vaart (1998)).

Figure 3.

Figure 3

effects of 1 year of treatment on CD4

Figure 4.

Figure 4

effects of 1 year of treatment on CD4; 3 parameters

The parameter estimates, along with bootstrap confidence intervals, for this initial model are displayed in Table 1. As expected, treatment increases CD4 count. The trend supports benefit of earlier treatment, but the confidence intervals show that the evidence to support this trend is not strong.

Table 1.

estimates, 95%-CIs, and Z-scores (bootstrap, 5000 replicates) of parameters for year 1.

Model: γm,ψk(L¯m)=(ψ1+ψ2m)(k+12-m)
 parameter estimate 95%-CI Z-score p-value
ψ1 24.90 (22,28) 14.2 <.0001
ψ2 −0.73 (−1.5,0.1) −1.7 0.08
Model: γm,ψk(L¯m)=(ψ1+ψ2m+ψ3m2)(k+12-m)
 parameter estimate 95%-CI Z-score p-value
ψ1 24.02 (19,29) 9.5 <.0001
ψ2 −0.19 (−2.6,2.1) −0.2 0.87
ψ3 −0.06 (−0.3,0.2) −0.5 0.64
Model: γm,ψk(L¯m)=(ψ1+ψ2k)(k+12-m)
 parameter estimate 95%-CI Z-score p-value
ψ1 25.35 (21,30) 11.7 <.0001
ψ2 −0.66 (−1.5,0.2) −1.5 0.12
Model: γm,ψk(L¯m)=(ψ1+ψ2k+ψ3k2)(k+12-m)
 parameter estimate 95%-CI Z-score p-value
ψ1 22.08 (15,29) 6.1 <.0001
ψ2 1.18 (−2.1,4.5) 0.7 0.48
ψ3 −0.16 (−0.4,0.1) −1.2 0.24
Model: γm,ψk(L¯m)=(ψ1+ψ2m+ψ3m2+ψ4k)(k+12-m)
 parameter estimate 95%-CI Z-score p-value
ψ1 22.60 (15,30) 5.9 <.0001
ψ2 −1.22 (−6.0,3.5) −0.5 0.61
ψ3 −0.07 (−0.3,0.2) −0.5 0.59
ψ4 1.16 (−3.4,5.6) 0.5 0.61
Model: γm,ψk(L¯m)=(ψ1+ψ2m+ψ3m2)(k+12-m)+ψ4(k+12-m)2
 parameter estimate 95%-CI Z-score p-value
ψ1 12.24 (−55,84) 0.4 0.72
ψ2 −0.19 (−2.6,2.1) −0.2 0.87
ψ3 −0.06 (−0.3,0.2) −0.4 0.65
ψ4 0.91 (−4.6,6.0) 0.3 0.73
Model: γm,ψk(L¯m)=(ψ1+ψ2m+ψ3m2)(k+12-m)+ψ4lvlm(k+12-m)
 parameter estimate 95%-CI Z-score p-value
ψ1 18.97 (3.5,36) 2.3 0.02
ψ2 −0.10 (−2.5,2.2) −0.1 0.93
ψ3 −0.06 (−0.3,0.2) −0.5 0.64
ψ4 5.34 (−11,21) 0.6 0.52
Model: γm,ψk(L¯m)=(ψ1+ψ2m+ψ3m2)(k+12-m)+ψ4lvlm
 parameter estimate 95%-CI Z-score p-value
ψ1 13.38 (−21,42) 0.1 0.94
ψ2 0.06 (−2.5,2.5) 0.0 1.00
ψ3 −0.07 (−0.3,0.2) 0.0 1.00
ψ4 13.66 (−22,59) 0.0 0.97

Alternatively, we could assume that again, the treatment effect depends linearly on the duration of treatment, but with a coefficient depending in a quadratic way on the time the outcome is measured, k+12:

γm,ψk(L¯m)=(ψ1+ψ2k+ψ3k2)(k+12-m). (7)

For this model, we chose qmk=0 for mk and

(12-E[Tr(k,k+12)A¯k=0¯,L¯k](12k-E[TTr(k,k+12)A¯k=0¯,L¯k])(12k2-E[T2Tr(k,k+12)A¯k=0¯,L¯k])).

Using the same techniques as for model (6), this leads to the black lines in Figure 3. We conclude that although these models cannot both be true, they lead to similar conclusions regarding how the effect of one year of treatment depends on month of treatment initiation.

Possibly, the mean effect of treatment also depends non-linearly on the duration of treatment. In that case, one could add terms like ψ4(k + 12 − m)2, or have the non-linear term also depend on month of treatment initiation. The mean effect of treatment may also depend on covariates, like log viral load, resistance mutations, and regimen used. To incorporate these, one can extend the model by including terms like ψ4lvlm(k + 12 − m), if the mean effect depends on the log viral load (lvl). We chose some function of (k + 12 − m), the duration of treatment between month m and month k + 12, since the duration of treatment may well predict its effect. If, for example, the treatment effect only depends on the log viral load at treatment initiation for the first month of treatment, one might add terms like ψ4lvlm. We fitted some different 4-parameter models, with model (6) as the basic model and adding different fourth-parameter specifications. The dataset was not rich enough to fit more than 4 parameters, as can be seen from the confidence intervals in Table 1.

To fit these 4-parameter models, we proceeded as above. To find a good choice of q⃗, first consider a more elaborate model:

γm,ψk(L¯m)=(ψ1+ψ2m+ψ3m2)(k+12-m)+ψ4k+ψ5(k+12-m)2+ψ6lvlm(k+12-m)+ψ7lvlm, (8)

for m = k, …, k+11. Similar to the choice of q⃗ for model (6), for model (8) we would use qmk=0 for mk and

qkk(L¯m)=(12-E[Tr(k,k+12)A¯k=0¯,L¯m]12k-E[TTr(k,k+12)A¯k=0¯,L¯k]12k2-E[T2Tr(k,k+12)A¯k=0¯,L¯k]12k-kE[Tr(k,k+12)A¯k=0¯,L¯k]122-E[Tr(k,k+12)2A¯k=0¯,L¯k]12lvlk-E[lvlTTr(k,k+12)A¯k=0¯,L¯k]lvlk-E[lvlTAk+11A¯k=0¯,L¯k]).

To get a choice of q⃗ for a 4-parameter model, we selected the corresponding rows of this q⃗. As for the 3-parameter models above, these 4-parameter q⃗ are optimal in the absence of censoring within the class of estimators with qmk=0 for mk. For model (8),

Hψ(k)=Yk+12-(ψ1+ψ2T+ψ3T2+ψ4k+ψ5Tr(k,k+12)+ψ6lvlT)Tr(k,k+12)-ψ7lvlTAk+11

For all 4-parameter models, Hψ can be found by setting all other parameters in this expression to 0. We did not estimate the effect of CD4 count at time of treatment initiation because increase in CD4 count in the three years after initiating treatment seems to be largely independent of pre-treatment CD4 count (Lok et al. (2010)).

Table 1 displays the estimated parameters with bootstrap confidence intervals also for these 4-parameter models as well as for the 2- and 3-parameter models, for the first 12 months. [[Victor: I was thinking about adding approximate p-values for the parameters by inverting the bootstrap confidence intervals, and then based on that deciding which of these 4-parameter models should be preferred. Do you think that makes sense? One of the referees wanted to know what model should be preferred, and I think that is a reasonable question.]] [[Some words here about the direction of the nonsignificant results: is the treatment effect larger for patients with a larger viral load? I will have to re-do Table 1 given that we prefer model (6) rather than the model we had before, model (7).]]

5. Discussion

This paper extends the Structural Nested Mean Models (SNMMs) from Lok et al. (2007), which describe the effect of multiple dosages of treatment, to an endpoint that varies with time. Robins (1994) did this for the usual SNMM, describing the effect of one dosage of treatment. We have applied this new method to the AIEDRP Core01 database on patients with recent HIV-infection.

We assumed a Structural Nested Mean Model for the way in which treatment affects the outcome of interest. We identified a large class of unbiased estimating equations for the effect of treatment on immune reconstitution after one year of treatment. For our analysis, we chose the optimal estimator within the class with qmk, the free quantity in the estimating equations, equal to 0 for mk, thus avoiding a lot of mis-specification of nuisance parameters had we chosen the optimal estimator itself. Our choice may not be optimal, but turned out to behave much better than a naive initial choice. Additional estimating equations for models with more parameters can be chosen by stacking the extra estimating equations that are described in Section 4. Note that any choice of qmk leads to consistent, asymptotically normal estimates; the choice determines just precision. We are continuing to explore properties of other estimators, including doubly robust ones, see e.g. Lok and Robins (2010).

We investigated the effect of having started treatment, regardless of whether it was maintained for a year. It would also be of interest to study the effect of HAART assuming that it is maintained for a year. Enough treatment options are available now to make this a conceivable counterfactual. In order to do the estimation, we could proceed as in the current paper, but censor patients once they go off their initiated treatment. This most likely introduces informative censoring, which can be accommodated as described in Section 3.5. Currently available information is not sufficiently precise to carry out such analysis; additional data must be collected to address this issue.

In order to apply our methods, we have interpolated the outcome variables if data before and after the time of interest were available, so that we could assume that full data are available until the time of censoring. This may lead to bias, and it would be of interest to develop methods that avoid this. In order to predict treatment intitiation, we have carried the last observation forward if data on specific covariates was missing at the time of the visit. This may be reasonable, given that physicians most likely do not have more information on those covariates either, and we included the time since the last visit in the covariates.

To weaken the Assumption of No Unmeasured Confounding, one could consider applying methods from e.g. Robins et al. (1999) or Robins (1998). Also, we assumed that all visits are recorded in the database. If that were not the case one could construct a database with monthly records, carrying predictors of treatment forward and interpolating outcomes, and assume that treatment can be started at every month. pθ(k) then predicts a visit with treatment initiation (Ak = 1). In addition, while the method we describe is based on interpolation to get outcome measurements for each month k, one could also make use of splines fit to data on CD4 count increase or other outcomes of interest.

We based our analysis on a time scale of months since estimated date of infection. This may not be a real limitation, since in any case, the actual date of infection is generally unknown. Extension of our approach to investigate the treatment effect as a function of the date of infection itself, that is, to incorporate uncertainty in estimated infection times, is an interesting topic for future research. If interval estimates were obtained, one could sample from an assumed distribution with mean and variance taken as the point estimates. For each resample, one could obtain estimates of the effect of treatment on CD4 increase as well as variance estimates for these estimated effects. Using the methods of Little and Rubin (1987), one might obtain point estimates from these resamples as well as interval estimates that take into account the uncertainty in infection time. This may need to be done with special care since there may be additional confounding issues: e.g. CD4 cell percentage was used to estimate the date of infection, and CD4 cell percentage may also be predictive of the treatment effect we are estimating.

The methods in this paper can easily be adapted to any situation where a treatment process gives rise to a counting process N(t), t ∈ [0, τ], e.g. counting the number of treatment changes up to time t. A model for this treatment process can be a tool to estimate the effect of treatment, as p(k) in our paper. If the treatment process is only measured at 0 = τ0 < τ1 < … < τK = τ, one can assume that N only jumps at 0 = τ0 < τ1 < … < τK = τ, too. In our application, the counting process N would take the value 0 until treatment starts, and the value 1 once treatment has started.

The model specification for the treatment effect of interest is rather arbitrary. Therefore, we have also included 4-parameter models. In addition, we have included two different 3-parameter models, see Section 4 and Figure 3. Both 3-parameter models lead to similar results regarding the dependence of the treatment effect on initiation time. More experience with model specification in clinical practice needs to be developed, since this is largely unexplored territory. In addition, in future work we plan to develop tests for model misspecification using ideas about overidentification tests, using that we have more potential unbiased estimating equations than parameters.

It would also be of interest to extend the models to include continuous time measurements of Ā and ; such inclusion would make timing of visits more flexible. However, preliminary efforts suggest that such an approach requires additional assumptions: aside from some regularity conditions, one may have to assume that there is no jump in outcome after one year of treatment, and that there is no instantaneous treatment effect. Also, one may need to assume a parametric model for treatment initiation, e.g. a time–dependent Weibull proportional hazard model. Properties of our estimators using a Cox proportional hazards model for treatment initiation have not been proven, even in the setting of an outcome measured at the end of the study or a survival outcome. For the latter situation, however, Lok (2001) proved that this model leads to unbiased estimating equations, though not of the form PnX, and this approach has been applied in e.g. Robins et al. (1992), Keiding et al. (1999), and Keiding (1999).

Acknowledgments

We are grateful to the patients who volunteered for AIEDRP, to the AIEDRP study team, and to Susan Little and Davey Smith for their help and advice in interpreting the AIEDRP database. We would like to thank James M. Robins for insightful and fruitful discussions. We are indebted to Ray Griner for programming the confidence intervals. This work was sponsored by NIH grant NIAID R01 AI 51164, AI43638, AI074621, R01-GM48704, R37 AI032475, and the Milton Fund.

WEB-APPENDIX Impact of Timing of Starting Treatment Following Infection with Application to Initiating HAART in HIV-Positive Patients

This Web-Appendix has the proofs of the theorems and the lemma.

Proof of Theorem 3.6

We prove this theorem by backward induction, for k fixed, starting with m = K and ending with m = k. Basis: m = K. We distinguish between AK = 0 and AK = 1. First, AK = 0. Under K, ĀK−1 = 0̄, AK = 0, Yk+12()=Yk+12 because this person is never treated and Consistency Assumption 3.4, and H(k) = Yk+12, too, since TK +1 so that γTk is zero. Next, for AK = 1, ĀK−1 = 0̄ and AK = 1 is the same as T = K. Moreover, because of Consistency Assumption 3.4 and the definitions of H(k) and γ,

E[H(k)L¯K=l¯K,T=K]=E[Yk+12-γKk(l¯K)L¯K=l¯K,T=K]=E[Yk+12(K)-γKk(l¯K)L¯K()=l¯K,T=K]=E[Yk+12()L¯K()=l¯K,T=K].

Another application of the Consistency Assumption replaces L¯K() by K again, finishing the proof for m = K.

Induction step

we show that if the theorem holds for m + 1, it also holds for m. We distinguish between Am = 1 and Am = 0. First, Am = 0. Because of the induction hypothesis,

E[H(k)L¯m+1,A¯m=0¯,Am+1]=E[Yk+12()L¯m+1,A¯m=0¯,Am+1].

The result for Am = 0 follows because E[H(k)|m = m, Ām−1 = 0̄, Am = 0] is the expectation of the left hand side of this given m = m, Ām = 0̄, and similarly for the right hand side.

Next, consider Am = 1. Note that then, Ām−1 = 0̄, Am is the same as T = m. Hence, because of the definition of H(k) and Consistency Assumption 3.4,

E[H(k)L¯m=l¯m,T=m]=E[Yk+12-γmk(L¯m)L¯m=l¯m,T=m]=E[Yk+12(m)-γmk(L¯m)L¯m()=l¯m,T=m]=E[Yk+12()L¯m()=l¯m,T=m],

where for the last equality we used Definition 3.1 of γ. Another application of the Consistency Assumption finishes the induction step, and hence the proof of the theorem.

Proof of Theorem 3.7

We can leave out the visit-indicator since if there is no visit at month m and treatment did not start before, Am = p(m) = 0. We show that each term has expectation 0.

E(qmk(L¯m)H(k)1A¯m-1=0¯(Am-p(m)))=E(E[qmk(L¯m)H(k)1A¯m-1=0¯(Am-p(m))L¯m,A¯m-1,Am])=E(qmk(L¯m)E[H(k)L¯m,A¯m-1=0¯]1A¯m-1=0¯(Am-p(m)))=E(qmk(L¯m)E[H(kL¯m,A¯m-1=0¯)]1A¯m-1=0¯(Am-p(m)))=E(E[qmk(L¯m)E[H(k)L¯m,A¯m-1=0¯]1A¯m-1=0¯(Am-p(m))L¯m,A¯m-1])=E(qmk(Lm)E[H(k)L¯m,A¯m-1=0¯]1A¯m-1=0¯E[Am-p(m)L¯m,A¯m-1=0¯])=0,

where for the third equality we use Theorem 3.6 together with the Assumption of No Unmeasured Confounding 3.3, and for the fifth equality we use that the expression includes the factor 1Ām−1=0̄.

Proof of Theorem 3.10

We show this for the product starting at p = 1 instead of k + 1, which suffices because the extra factors can be incorporated in qkm when Āk−1 = 0̄; the latter is the only value of interest since EEkm includes 1Ām−1=0̄. So, first, note that because of Theorem 3.7

0=E(EEkm(ψ0,θ0))=E(EEkm(ψ0,θ0)E[1{C¯k+12=0¯}L¯k+12,A¯k+12]P(C¯k+12=0¯L¯k+12,A¯k+12))=E(E[EEkm(ψ0,θ0)1{C¯k+12=0¯}P(C¯k+12=0¯L¯k+12,A¯k+12)L¯k+12,A¯k+12])=E(EEkm(ψ0,θ0)1{C¯k+12=0¯}P(C¯k+12=0¯L¯k+12,A¯k+12)),

where for the second equality we use that because of Positivity Assumption 3.9 we are not dividing by 0, for the third equality we use that EEkm(ψ0,θ0) and the conditional probability are k+12, Āk+12-measurable, and for in the last equality we use the Law of Iterated Expectations. Next, note that under MAR,

P(C¯k+12=0¯L¯k+12,A¯k+12)=p=1k+12P(Cp=0L¯k+12,A¯k+12,C¯p-1=0¯)=p=1k+12P(Cp=0L¯p-1,A¯p-1,C¯p-1=0¯).

Proof of Lemma 3.11

On {Ām−1 = 0̄, m = 0̄}, the conditioning events are not null events with probability 1. Hence

P(Am=1L¯m,A¯m-1=0¯,C¯m=0¯)=P(Am=1,L¯m,A¯m-1=0¯,C¯m=0¯)P(L¯m,A¯m-1=0¯,C¯m=0¯)=P(Am=1,L¯m,A¯m-1=0¯)p=1mP(Cp=0Am=1,L¯m,A¯m-1=0¯,C¯p-1=0¯)P(L¯m,A¯m-1=0¯)p=1mP(Cp=0L¯m,A¯m-1=0¯,C¯p-1=0¯)=P(Am=1L¯m,A¯m-1=0¯)p=1kP(Cp=0L¯p-1,A¯p-1=0¯,C¯p-1=0¯)p=1mP(Cp=0L¯p-1,A¯p-1=0¯,C¯p-1=0¯)

because of Missing At Random Assumption 3.8.

References

  1. Branson BM, Handsfield HH, Lampe MA, Janssen RS, Taylor AW, Lyss SB, Clark JE. Revised recommendations for hiv testing of adults, adolescents, and pregnant women in health-care settings. Technical report, Centers of Disease Control and Prevention (CDC) MMWR Recomm Rep. 2006;55:1–17. quiz CE1–4. [PubMed] [Google Scholar]
  2. Dawid AP. Conditional independence in statistical theory (with discussion) Journal of the Royal Statistical Society B. 1979;41:1–31. [Google Scholar]
  3. DeGruttola V, Little S, Schooley R. Controlling the HIV epidemic, without a vaccine! AIDS. 2008;22:2554–2555. doi: 10.1097/QAD.0b013e32831940d3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Granich R, Gilks C, Dye C, Cock KD, Williams B. Universal voluntary HIV testing with immediate antiretroviral therapy as a strategy for elimination of HIV transmission: a mathematical model. The Lancet. 2009;373:48–57. doi: 10.1016/S0140-6736(08)61697-9. [DOI] [PubMed] [Google Scholar]
  5. Hammer SM, Eron JJ, Reiss P, Schooley RT, Thompson M, Walmsley S, Cahn P, Fischl MA, Gatell JM, Hirsch MS, Jacobsen DM, Montaner JSG, Richman DD, Yeni PG, Volberding PA. Antiretroviral treatment of adult HIV infection: 2008 recommendations of the International AIDS Society-USA panel. Journal of the American Association. 2008;300:555–570. doi: 10.1001/jama.300.5.555. [DOI] [PubMed] [Google Scholar]
  6. Hammer SM, Saag MS, Schechter M, Montaner JS, Schooley RT, Jacobsen DM, Thompson MA, Carpenter CC, Fischl MA, Gazzard BG, Gatell JM, Hirsch MS, Katzenstein DA, Richman DD, Vella S, Yeni PG, panel PAVIASU. Treatment for adult HIV infection: 2006 recommendations of the International AIDS Society-USA panel. Journal of the American Association. 2006;296:827–843. [PubMed] [Google Scholar]
  7. Hecht FM, Wang L, Collier A, Little S, Markowitz M, Margolick J, Kilby JM, Daar E, Conway B, Holte S, Network A. A multicenter observational study of the potential benefits of initiating combination antiretroviral therapy during acute HIV infection. Journal of Infectious Disease. 2006;194:725–733. doi: 10.1086/506616. [DOI] [PubMed] [Google Scholar]
  8. Hernán MA, Brumback B, Robins JM. Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. Epidemiology. 2000;11:561–570. doi: 10.1097/00001648-200009000-00012. [DOI] [PubMed] [Google Scholar]
  9. Hernán MA, Cole SR, Margolick J, Cohen M, Robins JM. Structural accelerated failure time models for survival analysis in studies with time-varying treatments. Pharmacoepidemiology and drug safety. 2005;14:477–491. doi: 10.1002/pds.1064. [DOI] [PubMed] [Google Scholar]
  10. Keiding N. Event history analysis and inference from observational epidemiology. Statistics in Medicine. 1999;18:2353–2363. doi: 10.1002/(sici)1097-0258(19990915/30)18:17/18<2353::aid-sim261>3.0.co;2-#. [DOI] [PubMed] [Google Scholar]
  11. Keiding N, Filiberti M, Esbjerg S, Robins JM, Jacobsen N. The graft versus leukemia effect after bone marrow transplantation: a case study using structural nested failure time models. Biometrics. 1999;55:23–28. doi: 10.1111/j.0006-341x.1999.00023.x. [DOI] [PubMed] [Google Scholar]
  12. Kitahata MM, Gange SJ, Abraham AG, Merriman B, Saag MS, Justice AC, et al. Effect of early versus deferred antiretroviral therapy for HIV on survival. New England Journal of Medicine. 2009;360:1815–1826. doi: 10.1056/NEJMoa0807252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Little RJA, Rubin DB. Statistical Analysis with Missing Data. Wiley; New York: 1987. [Google Scholar]
  14. Lok JJ. PhD thesis. Department of Mathematics, Free University of Amsterdam; 2001. Statistical modelling of causal effects in time. http://www.math.vu.nl/research/theses/pdf/lok.pdf. [Google Scholar]
  15. Lok JJ. Structural nested models and standard software: a mathematical foundation through partial likelihood. Scandinavian Journal of Statistics. 2007;34:186–206. [Google Scholar]
  16. Lok JJ. Statistical modelling of causal effects in continuous time. Annals of Statistics. 2008;36:1464–1507. arXiv: math.ST/0410271 at http://arXiv.org. [Google Scholar]
  17. Lok JJ, Bosch RJ, Benson CA, Collier AC, Robbins GK, Shafer RW, Hughes MD for the ALLRT team. Long-term increase in CD4+ T-cell counts during combination antiretroviral therapy for HIV-1 infection. AIDS. 2010;24:1867–1876. doi: 10.1097/QAD.0b013e32833adbcf. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Lok JJ, Gill RD, van der Vaart AW, Robins JM. Estimating the causal effect of a time–varying treatment on time-to-event using structural nested failure time models. Statistica Neerlandica. 2004;58:271–295. [Google Scholar]
  19. Lok JJ, Hernán MA, Robins JM. Optimal start of HAART treatment in HIV positive patients. Proceedings of the Joint Statistical Meetings 2007. 2007:1149–1160. Available on request to jlok@hsph.harvard.edu. [Google Scholar]
  20. Lok JJ, Robins JM. Choice of optimal estimators in structural nested mean models with application to initiating HAART in HIV positive patients after varying duration of infection. Technical report. 2010 Available on request to jlok@hsph.harvard.edu. [Google Scholar]
  21. Panel on Antiretroviral Guidelines for Adults and Adolescents. Guidelines for the use of antiretroviral agents in HIV-1-infected adults and adolescents. Washington, DC: Department of Health and Human Services; 2008. Nov 3, [Accessed 12/09/2009]. Available at http://aidsinfo.nih.gov/contentfiles/AdultandAdolescentGL001226.pdf. [Google Scholar]
  22. Panel on Antiretroviral Guidelines for Adults and Adolescents. Guidelines for the use of antiretroviral agents in HIV-1-infected adults and adolescents. Department of Health and Human Services; 2009. Dec 1, [Accessed 12/09/2009]. (Available at http://aidsinfo.nih.gov/contentfiles/AdultandAdolescentGL.pdf.) [Google Scholar]
  23. Panel on Antiretroviral Guidelines for Adults and Adolescents. Guide-lines for the use of antiretroviral agents in HIV-1-infected adults and adolescents. Department of Health and Human Services; 2011. Jan 10, [Accessed 04/15/2011]. Available at http://www.aidsinfo.nih.gov/ContentFiles/AdultandAdolescentGL.pdf. [Google Scholar]
  24. Phillips AN, Emery S. Predicting the potential benefits of early initiation of ART: time to do a trial to find out. Current Opinions in HIV AIDS. 2009;4:165–166. doi: 10.1097/COH.0b013e328329ec32. [DOI] [PubMed] [Google Scholar]
  25. Robins JM. A new approach to causal inference in mortality studies with a sustained exposure period – Applications to control of the healthy worker survivor effect. Mathematical Modelling. 1986;7:1393–1512. [Google Scholar]
  26. Robins JM. A graphical approach to the identification and estimation of causal parameters in mortality studies with sustained exposure periods. Journal of Chronic Disease. 1987a;40:139S–161S. doi: 10.1016/s0021-9681(87)80018-8. [DOI] [PubMed] [Google Scholar]
  27. Robins JM. Addendum to “A new approach to causal inference in mortality studies with a sustained exposure period – Application to control of the healthy worker survivor effect. Computers and Mathematics with Applications. 1987b;14:923–945. [Google Scholar]
  28. Robins JM. The control of confounding by intermediate variables. Statistics in Medicine. 1988;8:679–701. doi: 10.1002/sim.4780080608. [DOI] [PubMed] [Google Scholar]
  29. Robins JM. The analysis of randomized and non-randomized AIDS treatment trials using a new approach to causal inference in longitudinal studies. In: Sechrest L, Freeman H, Bailey A, editors. Health service research methodology: a focus on AIDS. NCHSR, U.S. Public Health Service; Washington, D.C: 1989. pp. 113–159. [Google Scholar]
  30. Robins JM. Correcting for non-compliance in randomized trials using structural nested mean models. Communications in Statistics. 1994;23:2379–2412. [Google Scholar]
  31. Robins JM. Structural nested failure time models. In: Armitage P, Colton T, Andersen PK, Keiding N, editors. Survival analysis, volume 6 of Encyclopedia of Biostatistics. 2 John Wiley and Sons; Chichester, UK: 1998. pp. 4372–4389. [Google Scholar]
  32. Robins JM, Blevins D, Ritter G, Wulfsohn M. G–estimation of the effect of prophylaxis therapy for pneumocystis carinii pneumonia on the survival of AIDS patients. Epidemiology. 1992;3:319–336. doi: 10.1097/00001648-199207000-00007. [DOI] [PubMed] [Google Scholar]
  33. Robins JM, Hernán MA, Brumback B. Marginal structural models and causal inference in epidemiolog. Epidemiology. 2000;11:550–560. doi: 10.1097/00001648-200009000-00011. [DOI] [PubMed] [Google Scholar]
  34. Robins JM, Rotnitzky A, Scharfstein DO. Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models. In: Halloran ME, Berry D, editors. Statistical models in epidemiology: the environment and clinical trials. Vol. 116. Springer-Verlag; New York: 1999. pp. 1–92. [Google Scholar]
  35. Robins JM, Rotnitzky A, Zhao LP. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association. 1995;90:106–121. [Google Scholar]
  36. Rubin DB. Bayesian inference for causal effects: the role of randomization. Annals of Statistics. 1978;6:34–58. [Google Scholar]
  37. Sabin CA, Phillips AN. Should HIV therapy be started at a CD4 cell count above 350 cells/microl in asymptomatic HIV-1-infected patients? Current Opinions in Infectious Diseases. 2009;22:191–197. doi: 10.1097/qco.0b013e328326cd34. [DOI] [PubMed] [Google Scholar]
  38. Smith DM, Strain MC, Frost SDW, Pillai SK, Wong JK, Wrin T, Liu Y, Petropolous CJ, Daar ES, Little SJ, Richman DD. Lack of neutralizing antibody response to HIV-1 predisposes to superinfection. Virology. 2006;355:1–5. doi: 10.1016/j.virol.2006.08.009. [DOI] [PubMed] [Google Scholar]
  39. Thompson MA, Aberg JA, Cahn P, Montaner JSG, Rizzardini G, Telenti A, Gatell JM, Guenthard HF, Hammer SM, Hirsch MS, Jacobsen DM, Reiss P, Richman DD, Volberding PA, Yeni P, Schooley RT. Antiretroviral Treatment of Adult HIV Infection 2010 Recommendations of the International AIDS SocietyUSA Panel. Journal of the American Association. 2010;304:321–333. doi: 10.1001/jama.2010.1004. [DOI] [PubMed] [Google Scholar]
  40. Van der Vaart AW. Cambridge series in statistical and probabilistic mathematics. Cambridge University Press; Cambridge: 1998. Asymptotic statistics. [Google Scholar]
  41. WHO Dept. of HIV/AIDS. Priority Interventions. HIV/AIDS prevention, treatment and care in the health sector. Version 2.0. 2010 Jul; http://www.who.int/hiv/pub/guidelines/9789241500234_eng.pdf.
  42. Wood R, Lawn SD. Should the CD4 threshold for starting ART be raised? Lancet. 2009;373:1314–1316. doi: 10.1016/S0140-6736(09)60654-1. [DOI] [PubMed] [Google Scholar]

RESOURCES