Skip to main content
Biostatistics (Oxford, England) logoLink to Biostatistics (Oxford, England)
. 2015 Aug 3;17(1):135–148. doi: 10.1093/biostatistics/kxv025

Identifying a set that contains the best dynamic treatment regimes

Ashkan Ertefaie 1,*,, Tianshuang Wu 2,, Kevin G Lynch 3, Inbal Nahum-Shani 4
PMCID: PMC4679070  PMID: 26243172

Abstract

A dynamic treatment regime (DTR) is a treatment design that seeks to accommodate patient heterogeneity in response to treatment. DTRs can be operationalized by a sequence of decision rules that map patient information to treatment options at specific decision points. The sequential, multiple assignment, randomized trial (SMART) is a trial design that was developed specifically for the purpose of obtaining data that informs the construction of good (i.e. efficacious) decision rules. One of the scientific questions motivating a SMART concerns the comparison of multiple DTRs that are embedded in the design. Typical approaches for identifying the best DTRs involve all possible comparisons between DTRs that are embedded in a SMART, at the cost of greatly reduced power to the extent that the number of embedded DTRs (EDTRs) increase. Here, we propose a method that will enable investigators to use SMART study data more efficiently to identify the set that contains the most efficacious EDTRs. Our method ensures that the true best EDTRs are included in this set with at least a given probability. Simulation results are presented to evaluate the proposed method, and the Extending Treatment Effectiveness of Naltrexone SMART study data are analyzed to illustrate its application.

Keywords: Double robust, Marginal structural model, Multiple comparisons with the best, SMART designs

1. Introduction

A dynamic treatment regime (DTR) is a treatment design that seeks to accommodate patient heterogeneity in response to treatments (Murphy and others, 2001; Murphy, 2003; Robins, 2004). In DTRs the type andInline graphicor dose of the treatment is adapted over time according to the patient's characteristics and progress in treatment. At each decision point (i.e. specific point in time in which a treatment is to be considered or altered), decision rules are used to map individual characteristics to a specific type of treatment or dosage. Recently, there has been an increased interest in sequential, multiple assignment, randomized trials (SMARTs), which were developed specifically to provide empirical evidence that informs the construction of optimal DTRs (Lavori and Dawson, 2000; Nahum-Shani and others, 2012; Chakraborty and Moodie, 2013; Chakraborty and Murphy, 2014; Laber and others, 2014).

One scientific question motivating a SMART concerns the comparison of DTRs that are embedded in the design. It aims to identify the best DTR or the set that contains the best DTRs among those that are embedded in the design. In other words, the goal is to screen out ineffective DTRs. This question can be framed as a special case of the general multiple comparison problem.

Methods for multiple comparisons can be used to group sample means, such that within each group, population means are not significantly different (Scheffe, 1953; Tukey, 1953). Current approaches for identifying the set of best DTRs perform all possible comparisons among embedded DTRs (EDTRs). In such a setting, standard multiple comparison approaches used to control for Type I error result in a loss of statistical power (Hsu, 1984; Hsu, 1996). Consequently, important differences between DTRs might go undetected (Saville, 1990; Keselman and others, 1999). Here, we propose a more efficient approach for identifying the set of best DTRs. This approach builds on the work of Hsu (1981), which identifies the best set of means by conducting multiple comparisons with the best (MCB), namely by comparing the best mean with others. Applying this approach will result in fewer comparisons relative to standard approaches, and hence improved power.

The current manuscript will extend the MCB toolbox for analyzing data from SMART studies. The contribution of this paper is two-fold. First, we provide and illustrate, for the first time, a method that can be used to efficiently address an important scientific question that motivates many SMART studies. This question concerns the need to identify the optimal DTR, or several optimal DTRs from a list of DTRs embedded in a SMART study. Enabling researchers to address this scientific question can support clinical decision making, offering clinicians a set of efficacious DTRs to choose from based on other considerations such as cost and patient preferences. The second contribution concerns the correlation structure of the estimators derived from SMART data. The method proposed by Hsu requires a known correlation structure (up to a constant). In SMART, the correlation structure of estimators is not known a priori. Therefore, generalization of the method is warranted.

We briefly introduce SMART designs and explain the structure of SMART data in Section 2. We then present two methods to estimate the mean outcome under each DTR in Section 3. The framework of MCB in SMART settings is introduced in Section 4. We conduct a simulation study in Section 5 to examine the performance of our method. We illustrate the method with analyses of the Extending Treatment Effectiveness of Naltrexone (EXTEND) study in Section 6. The last section contains some concluding remarks. Proofs are given in Appendix A of the supplementary materials available at Biostatistics online.

2. Preliminaries

2.1. Sequential, multiple assignment, randomized trials

The SMART is a clinical trial design in which each individual proceeds through stages of treatments (Lavori and others, 2000; Murphy, 2005; Lei and others, 2012; Nahum-Shani and others, 2012). At each treatment stage, individuals are randomized to one of the available treatment options at that stage, where the subsequent treatment options may depend on an embedded tailoring variable observed at the current stage. For example, in the EXTEND study, at stage 1, patients were randomized to one of two definitions of non-response while receiving naltrexone (NTX): (1) Stringent criterion—a patient is a non-responder if (s)he has two or more heavy drinking days in the first 8 weeks; (2) Lenient criterion—a patient is a non-responder if (s)he has five or more heavy drinking days in the first 8 weeks. At stage 2, non-responders were re-randomized to combined behavioral intervention (CBI) Inline graphicNTX or CBI alone. Individuals who did not meet their non-response criterion were re-randomized to telephone disease management Inline graphic or NTX alone. Thus, in this two-stage design, the embedded tailoring variable is the responseInline graphicnon-response status to initial NTX.

2.2. Data structure

For simplicity, we focus on SMARTs with two stages. The observed data on each individual are given by a trajectory Inline graphic. Inline graphic, for Inline graphic is a set of covariates available at the beginning of stage Inline graphic. By Inline graphic, we denote the treatment options at the beginning of stage Inline graphic; Inline graphic is a binary variable that is coded 1 if an individual has been re-randomized at stage 2, and coded 0 otherwise. Finally, Inline graphic is the continuous primary outcome. The treatment and the covariate history through Inline graphic are denoted by Inline graphic and Inline graphic, respectively. We use lowercase letters to refer to the possible values of the corresponding capital letter random variable.

In SMART settings, the stage-2 treatment options may depend on embedded tailoring variables, which are part or all of the observed history up to and including time 2, and we denote them as Inline graphic. In the EXTEND study, Inline graphic is the response(R)Inline graphicnon-response(NR) status to stage-1 treatment (i.e. Inline graphic). Hence, for each individual, we conceptualize a v-treatment trajectory Inline graphic. For responders and non-responders, we set Inline graphic and Inline graphic, respectively, with probability 1. This basically means that, for responders, Inline graphic does not apply and vice versa. We use the v-treatment trajectory to model the marginal structural model (MSM) discussed in Section 3. Note, the v-treatment trajectory and treatment history are not necessarily the same. In fact, in this example, the treatment history is 2D, while the v-treatment trajectory is 3D.

2.3. Embedded DTRs

An EDTR is one DTR that participants can follow as part of the study design. In the EXTEND study, there are eight EDTRs: (1) start with lenient definition. If the patient is non-responsive, offer NTX+CBI; if the patient is responsive, offer NTX+TDM. (2) Start with lenient definition. If the patient is non-responsive, offer NTX+CBI; if the patient is responsive, offer NTX. (3) Start with lenient definition. If the patient is non-responsive, offer CBI; if the patient is responsive, offer NTX+TDM. (4) Start with lenient definition. If the patient is non-responsive, offer CBI; if the patient is responsive, offer NTX. The other four EDTRs are similar except that they start with stringent definition. Note that a given v-treatment trajectory Inline graphic can be consistent with more than one EDTR. For example, a responder to the lenient definition with Inline graphic is following both EDTRs (1) and (3).

3. Estimation

Let Inline graphic be the population outcome mean under the Inline graphicth EDTR for Inline graphic where Inline graphic is the number of EDTRs in a SMART. Here, we provide two methods that are based on weighted least squares minimizations and used throughout this paper as tools to estimate the mean outcome under each EDTR. The first approach would be to postulate an MSM Inline graphic for the outcome given the observed v-treatment trajectory Inline graphic and define Inline graphic as a known function of Inline graphic for all Inline graphic. Let Inline graphic be the empirical average. The parameters of the MSM can be estimated using the following estimating equation:

3. (3.1)

where Inline graphic, and

3.

where Inline graphic is the treatment option determined by EDTRInline graphic at stage 2 given Inline graphic, and Inline graphic is the treatment option determined by EDTRInline graphic at stage 1. The indicator function selects individuals whose treatment history is consistent with the Inline graphicth EDTR given Inline graphic. This method is referred to as inverse probability weighting (IPW) (Robins, 1999; Hernán and others, 2000; Robins and others, 2000). The treatment trajectory is used to define the MSM function. For example, in the EXTEND study, the MSM would be Inline graphic. We denote the solutions of this equation as Inline graphic. Hence, the mean outcome under each EDTR can be estimated as Inline graphic, where Inline graphic is a Inline graphic matrix. The Inline graphicth row of Inline graphic is the contrast corresponding to EDTRInline graphic (see Section 5).

The second approach is based on the augmented IPW (AIPW), which is a more efficient version of IPW developed by Robins and others (2008) and Orellana and others (2010). Let Inline graphic. The corresponding estimating equation for a two-stage design is given by

3. (3.2)

where Inline graphic, Inline graphic, and

3.

To obtain estimators of Inline graphic, we postulate parametric models for the unknown functions Inline graphic and Inline graphic parameterized by Inline graphic and replace them with their estimated values Inline graphic and Inline graphic. The estimates may be obtained by fitting two least squares models. We denote the solutions of (3.2) as Inline graphic and, similar to the first approach, we define Inline graphic.

Estimator (3.2) is double robust in the sense that it results in an unbiased estimate of Inline graphic if either Inline graphic or the treatment assignment probabilities are correctly specified (van der Laan and Robins, 2003; Bang and Robins, 2005; Davidian and others, 2005; Orellana and others, 2010). Although we are focusing on randomized trials and treatment assignment probabilities are known by design, for efficiency we estimate these probabilities non-parametrically using the available data (Robins and others, 1995; Hirano and others, 2003). One may also postulate a parametric model to estimate these probabilities given the observed covariateInline graphictreatment history.

The following proposition provides the asymptotic behaviors of estimators Inline graphic and Inline graphic obtained by (3.1) and (3.2), respectively, which is an immediate consequence of (Orellana and others, 2010, Lemma 3). In the proposition, the superscript Inline graphic denotes IPW or AIPW.

Proposition 3.1. —

Let Inline graphic, where Inline graphic is a Inline graphic matrix with the Inline graphicth row of Inline graphic being the contrast corresponding to the Inline graphicth EDTR. Then, under the standard regularity assumptions, Inline graphic, where Inline graphic, and Inline graphic with

graphic file with name M82.gif

The asymptotic variance Inline graphic may be estimated consistently by replacing the expectations with expectations with respect to the empirical measure and Inline graphic with its estimate Inline graphic and denoted as Inline graphic.

4. Multiple comparison with the best

Let Inline graphic be the true set of best EDTRs and Inline graphic be a set of EDTRs that cannot be differentiated from the best EDTR using the available data. In the previous section, we discussed our procedures to estimate the mean outcome under each EDTR, Inline graphic and Inline graphic for Inline graphic. Since our methodology holds for both IPW and AIPW approaches to estimation, for simplicity of notation, we drop the superscripts IPW and AIPW and refer to the estimator of Inline graphic as Inline graphic. In this section, we generalize the MCB method introduced by Hsu (1981) to SMART settings. The goal is to find EDTRs that are not significantly different from the EDTR with the maximum outcome, say Inline graphic. Hence, a natural criterion would be to include index Inline graphic in the set Inline graphic if the standardized difference Inline graphic is greater than a constant for all Inline graphic. This can be written as

4. (4.1)

where Inline graphic is a constant and Inline graphic, which can be estimated using the variance formula in Proposition 3.1. The challenge is to find Inline graphic such that Inline graphic includes the true best EDTR with probability at least Inline graphic; that is, Inline graphic for any Inline graphic. In cases where there are more than one best EDTR, Inline graphic includes each index Inline graphic with at least Inline graphic probability. This condition will be satisfied if we find Inline graphic such that under the null hypothesis (i.e. all EDTRs are equally good), the set Inline graphic includes each index Inline graphic with probability Inline graphic. In other words, when Inline graphic is known, Inline graphic must satisfy

4. (4.2)

where Inline graphic are multivariate normal random variables with mean 0 and covariance matrix Inline graphic such that Inline graphic. The above equality can be written as

4.

where Inline graphic is the marginal cdf of Inline graphic. Note that, for Inline graphic, the constant Inline graphic for Inline graphic. Hence, in our setting where Inline graphic represents the Type I error rate, we can assume that Inline graphic is a positive constant.

Hsu (1981) present an equation that can be used to find the constant Inline graphic when the structure of the covariance matrix Inline graphic is known up to a constant. This is the case in a standard regression where Inline graphic. Note that in this case, given the design matrix, Inline graphic is known up to a constant Inline graphic. In Hsu's setting, the constant Inline graphic is a function of the correlation matrix and thus it is not a function of Inline graphic. In the MSM, however, the structure of the design matrix is random because it depends on intermediate outcomes (i.e. variables observed before stage 2 and after stage 1 treatment assignment) that are not included in the design matrix, such as response or non-response status (i.e. embedded tailoring variables). In such a setting, the constant Inline graphic will be a function of an unknown Inline graphic which is estimated by Inline graphic using the observed data. Theorem 4.1 generalizes the idea in Hsu to cases where the structure of the design matrix is unknown. We use the notation Inline graphic to reflect the dependence of Inline graphic to Inline graphic.

Theorem 4.1. —

Define the estimated set of best EDTRs as Inline graphic, where Inline graphic, and Inline graphic satisfies

graphic file with name M144.gif

with Inline graphic being multivariate normal random variables with mean 0 and unknown covariance matrix Inline graphic, which is estimated by Inline graphic. Then, asymptotically, Inline graphic contains the true best EDTR with probability at least Inline graphic.

Let Inline graphic be the difference between the Inline graphicth and the Inline graphicth EDTR. The probability of including an EDTRInline graphic in the estimated set of best EDTRs for any given Inline graphic is

4. (4.3)

where Inline graphic and is distributed as a standard normal random variable. Accordingly, the estimated set size (ESS) of Inline graphic is defined as Inline graphic. Note, under the null hypothesis, where all EDTRs are equally good, Inline graphic. The following theorem shows that the probability of including an inferior Inline graphic in the estimated set Inline graphic decays to zero exponentially for Inline graphic as the difference between the best and the Inline graphicth EDTR increases.

Theorem 4.2. —

Let Inline graphic follow a multivariate normal distribution with mean zero and unknown variance matrix. Define Inline graphic for Inline graphic as non-negative random variables. Let Inline graphic, Inline graphic; we have

graphic file with name M169.gif (4.4)

where Inline graphic is a constant that depends on Inline graphic and Inline graphic but is independent of Inline graphic.

Note that Inline graphic, which decays to zero with rate Inline graphic. This implies that, for a fixed Inline graphic, as Inline graphic increases the probability of including an inferior Inline graphic to Inline graphic decreases with rate Inline graphic. Also, in the statement of Theorem 4.2, replacing Inline graphic with Inline graphic, for Inline graphic, shows the exponential decay rate in (4.3).

Remark 4.3. —

Let Inline graphic and Inline graphic be the covariance matrix of Inline graphic and Inline graphic, respectively. Since Inline graphic, for any fixed sample size and a set of Inline graphics, the efficient estimator AIPW results in an ESS which is less than or equal to the one obtained by the inefficient estimator IPW (see Figures 1 and 2).

Fig. 1.

Fig. 1.

Simulation SMART design Example 1: the vertical axes are the estimated set (of best) size (ESS) and horizontal axes are the difference between the best and the second best EDTR.

Fig. 2.

Fig. 2.

Simulation SMART design Example 2: the vertical axes are the estimated set (of best) size (ESS) and horizontal axes are the difference between the best and the second best EDTR.

5. Simulation study

This section provides empirical evidence for the theoretical results presented in the manuscript. We compare the estimated sets of best obtained by the IPW and AIPW methods and show that the latter method screens out the ineffective EDTRs more efficiently. We examine the performance of the proposed method using two different types of SMART designs. We describe that the form of the MSM Inline graphic may vary based on the design structure. We also discuss the effect that misspecifying the function Inline graphic has on estimating the parameters of the MSM and the mean outcome under each EDTR.

In all simulation scenarios, baseline variables Inline graphic and Inline graphic are generated from standard normal, and Inline graphic is based on a Bernoulli distribution with probability 0.5. The intermediate outcomes are Inline graphic and Inline graphic. The estimator IPW and AIPW refer to (3.1) and (3.2), respectively, while AIPWInline graphic refers to an AIPW estimator where Inline graphic functions are misspecified. Some of the tables and figures corresponding to this section are presented in Appendix C of supplementary material available at Biostatistics online.

5.1. SMART design: Example 1

This is a type of SMART design in which just a subset of individuals are re-randomized at stage 2. In our simulation, this subset is non-responders to stage-1 treatment (see Figure 1 of supplementary material available at Biostatistics online). Thus, the embedded tailoring variable Inline graphic is the indicator of responder or non-responder status, respectively. Four DTRs are embedded in this design depending on v-treatment trajectory Inline graphic; these are listed in Table 1 of supplementary material available at Biostatistics online. Note that because there is only one treatment option for responders, the v-treatment trajectory does not include Inline graphic. We generate these SMART data with sample sizes 100, 200, 300, and 400 from the following generative model. The stage-2 treatment option Inline graphic is generated from a Bernoulli distribution with probability 0.5. The outcome is generated from a normal distribution with mean Inline graphic, with variance Inline graphic, where Inline graphic. The main effect of treatment options are parameterized with Inline graphic. The true Inline graphics are given by

5.1.

Table 1.

Simulation SMART design Example 1: inference about the parameters Inline graphic using IPW, AIPW, and AIPWInline graphic where the latter represents the misspecified scenario

Inline graphic
Inline graphic
IPW
AIPW
AIPWInline graphic
IPW
AIPW
AIPWInline graphic
Parameter Bias SD Bias SD Bias SD Bias SD Bias SD Bias SD
Inline graphic 0.010 0.24 0.002 0.23 0.007 0.24 0.004 0.12 0.000 0.12 0.007 0.12
Inline graphic 0.001 0.24 0.002 0.18 0.005 0.18 0.011 0.12 0.002 0.09 0.002 0.10
Inline graphic 0.002 0.17 0.003 0.07 0.002 0.10 0.000 0.08 0.002 0.04 0.004 0.05
Inline graphic 0.013 0.41 0.007 0.32 0.014 0.39 0.015 0.21 0.004 0.16 0.013 0.20
Inline graphic 0.011 0.33 0.003 0.27 0.004 0.31 0.007 0.17 0.000 0.14 0.009 0.15
Inline graphic 0.009 0.41 0.003 0.32 0.010 0.39 0.015 0.21 0.000 0.16 0.005 0.20
Inline graphic 0.007 0.33 0.003 0.27 0.000 0.31 0.007 0.17 0.004 0.14 0.001 0.15

We also consider a misspecified scenario where Inline graphic and Inline graphic are assumed to be working models. Moreover, the MSM is

5.1.

Hence, the true parameter value Inline graphic, which means, for Inline graphic, Inline graphic is the true best EDTR and, for Inline graphic, Inline graphic is the true best EDTR. Table 1 presents the point estimate and standard errors of the parameters Inline graphic and Inline graphic estimated using IPW, AIPW and AIPWInline graphic, where

5.1.

The rows of this matrix represent Inline graphic listed in Table 1 of supplementary material available at Biostatistics online. In Table 1, we set Inline graphic and generated 1000 datasets of sizes 100 and 400. Our results show that AIPW reduces the standard error by up to 60% compared with IPW, and even when Inline graphic functions are misspecified Inline graphic maintains unbiasedness, but some of the standard errors increase. In fact, under our misspecification scenario AIPW still has better performance than IPW. We see a similar pattern in estimation of the mean outcome under different EDTRs.

Figure 1 shows how fast the size of the set of best Inline graphic converges to 1 as Inline graphic increases when the parameters of each EDTR is estimated using IPW and AIPW. Note that, for Inline graphic, the true set size Inline graphic is 1. For each Inline graphic, we generated 500 datasets and defined the ESS as the empirical average of the set sizes for each dataset. This figure shows that when the parameters Inline graphic of the MSM are estimated using AIPW, the ESS decreases to 1 faster than when using IPW. This is due to the more efficient estimation of Inline graphics. Documented R code for this example is available in Appendix B of supplementary material available at Biostatistics online.

5.2. SMART design: Example 2

In some SMART designs, stage-2 randomization depends on prior treatment and an intermediate outcome such as response indicator (see Figure 2 of supplementary material available at Biostatistics online). We generate datasets of sizes 100, 200, 300 and 400 from the following generative model. The stage-2 treatment options are generated from a multinomial distribution with probability 0.25 coded as Inline graphic and 4. Let Inline graphic be the non-response and response indicator, respectively. Then, if Inline graphic (i.e. Inline graphic satisfies condition A), there is no randomization, while individuals with Inline graphic (i.e. Inline graphic satisfies condition B) will be randomized to one of the four stage-2 treatment options. Hence, the v-treatment trajectory in this example is Inline graphic. Five DTRs are embedded in this design depending on the treatment trajectory Inline graphic; these are listed in Table 2 of supplementary material available at Biostatistics online.

Table 2.

Simulation SMART design Example 2: inference about the parameters Inline graphic using IPW, AIPW, and Inline graphic where the latter represents the misspecified scenario

Inline graphic
Inline graphic
IPW
AIPW
AIPWInline graphic
IPW
AIPW
Inline graphic
Parameter Bias SD Bias SD Bias SD Bias SD Bias SD Bias SD
Inline graphic 0.004 0.35 0.001 0.30 0.005 0.30 0.003 0.18 0.001 0.15 0.002 0.15
Inline graphic 0.004 0.75 0.003 0.40 0.002 0.48 0.004 0.37 0.001 0.20 0.002 0.23
Inline graphic 0.010 0.81 0.005 0.30 0.003 0.47 0.000 0.40 0.002 0.14 0.004 0.23
Inline graphic 0.011 0.81 0.008 0.30 0.008 0.49 0.010 0.40 0.005 0.14 0.001 0.23
Inline graphic 0.013 0.81 0.006 0.31 0.001 0.47 0.005 0.40 0.006 0.14 0.003 0.24
Inline graphic 0.004 0.35 0.001 0.30 0.005 0.30 0.003 0.18 0.001 0.15 0.002 0.15
Inline graphic 0.008 0.66 0.004 0.37 0.007 0.46 0.007 0.32 0.002 0.18 0.004 0.21
Inline graphic 0.018 0.66 0.009 0.38 0.010 0.45 0.007 0.33 0.004 0.19 0.008 0.21
Inline graphic 0.019 0.66 0.012 0.38 0.015 0.48 0.017 0.33 0.007 0.19 0.005 0.22
Inline graphic 0.021 0.66 0.010 0.38 0.006 0.45 0.012 0.33 0.008 0.18 0.007 0.21

The outcome is generated from a normal distribution with mean Inline graphic with variance Inline graphic, where Inline graphic. Thus, the true Inline graphics are

5.2.

We also consider a misspecified scenario where Inline graphic and Inline graphic are assumed to be working models. The MSM is

5.2.

Hence, the true parameter value Inline graphic, which means that, for positive and negative Inline graphics, Inline graphic and Inline graphic are the best EDTRs, respectively. Table 2 presents the bias and standard errors of the parameters Inline graphic and Inline graphic estimated using IPW, AIPW, and Inline graphic, where

5.2.

The rows of this matrix represent Inline graphic listed in Table 2 of supplementary material available at Biostatistics online. In Table 2, we set Inline graphic and generated 1000 datasets of sizes 100 and 400. Our results show that AIPW reduces the standard error of Inline graphics and Inline graphics by up to 55% compared with IPW. The misspecified scenario, where the interaction terms in both Inline graphic functions are ignored, results in estimators with slightly larger standard errors compared with AIPW.

Figure 2 shows how fast the size of the set of best converges to 1 as Inline graphic grows when the parameters of each EDTR are estimated using IPW and AIPW. Note that, for Inline graphic the true set size Inline graphic is 1. For each Inline graphic, we generated 500 datasets and defined the ESS as the empirical average of the set sizes for each dataset. This figure shows that when the parameters Inline graphic of the marginal model are estimated using AIPW, the ESS decreases to 1 faster than when using IPW. This is due to more efficient estimation of Inline graphics. The plot of ESS when estimated using Inline graphic is omitted since it is similar to IPW in this simulation.

6. Illustrative data analysis

The EXTEND study was a 24-week, multistage clinical trial that enrolled alcohol-dependent patients (Lei and others, 2012). At stage 1, patients are randomized with probability 0.5 to either the stringent or lenient definitions of non-response while receiving NTX. Participants were assessed weekly for drinking behavior, and starting at week 3, as soon as the participant met hisInline graphicher assigned criterion for early non-response, heInline graphicshe was immediately re-randomized to one of two “rescue” tactics: (1) offering CBI in addition to NTX (i.e. Inline graphic); or (2) offering CBI alone (i.e. CBI). Participants who did not meet their assigned criterion for early non-response by the end of week 8 (i.e. responders to NTX) were re-randomized at that point (i.e. end of week 8) to one of two “maintenance” tactics: either (1) adding TDM to NTX (i.e. NTX+TDM) or offering NTX alone (NTX). Figure 3 (Appendix C of supplementary material available at Biostatistics online) depicts this two-stage SMART design.

For illustration, we focus on a simplified version of this trial. Let the primary outcome Inline graphic denote the Penn Alcohol Craving Scale (PACS) score over 24 weeks. Lower PACSs are preferable. Let Inline graphic denote the non-response criterion coded as Inline graphic for stringent and Inline graphic1 for lenient. The embedded tailoring variable Inline graphic in this design is the responseInline graphicnon-response status. The stage-2 treatment options for responders are NTX (Inline graphic) and Inline graphic (Inline graphic) and for non-responders the rescue treatment options are CBI (Inline graphic) and Inline graphic (Inline graphic). Additionally, let Inline graphic denote the indicator for whether Inline graphic or not Inline graphic the patient was a responder to the initial NTX treatment. Figure 3 in Appendix C of supplementary material available at Biostatistics online shows the number of patients assigned to each treatment option. By design, there are Inline graphic EDTRs in this SMART based on different combinations of Inline graphic, which are listed in Table 3 of supplementary material available at Biostatistics online.

Table 3.

Extend trial: inference about the parameters Inline graphic using IPW and AIPW

IPW
AIPW
Parameter Est. SD Est. SD
Inline graphic 8.86 0.45 8.84 0.47
Inline graphic Inline graphic0.99 0.45 Inline graphic0.90 0.44
Inline graphic Inline graphic0.24 0.34 Inline graphic0.09 0.27
Inline graphic Inline graphic0.07 0.28 Inline graphic0.21 0.13
Inline graphic 7.56 0.76 7.65 0.67
Inline graphic 7.71 0.74 8.06 0.67
Inline graphic 8.05 0.71 7.83 0.70
Inline graphic 8.19 0.69 8.24 0.70
Inline graphic 9.53 0.81 9.44 0.76
Inline graphic 9.68 0.80 9.85 0.77
Inline graphic 10.02 0.83 9.62 0.70
Inline graphic 10.17 0.82 10.03 0.72

Baseline variables include PACS before stage 1 (Inline graphic) and gender (Inline graphic). The intermediate outcomes are the average PACS during stage 1 (Inline graphic) and the standard error of the measured PACS during stage 1 (Inline graphic). We consider the following MSM: Inline graphic. One may add the interaction terms Inline graphic and Inline graphic to this model. Also, we consider Inline graphic and Inline graphic.

We estimated the parameter vector Inline graphic and Inline graphic using both the IPW (3.1) and AIPW (3.2) estimators and the results are presented in Table 3, where

6.

The rows of this matrix represent Inline graphic listed in Table 3 of supplementary material available at Biostatistics online. The point estimate and standard errors for Inline graphic and Inline graphic are very close using both estimators. However, the parameters corresponding to Inline graphic and Inline graphic have smaller standard errors when estimated using AIPW. Moreover, our procedure screens out Inline graphic and Inline graphic when the parameter vector Inline graphic is estimated using AIPW, but using IPW results in keeping all eight EDTRs in the set of best. In other words, when using MCB with the AIPW approach to estimate the mean outcome under each EDTR, results indicated that DTRs that begin with NTX, classifies patients as non-responders by using a stringent criterion, and offers CBI alone to non-responders and NTX or Inline graphic to responders, do not belong to the set of best EDTRs.

7. Discussion

An important research question motivating many SMART studies concerns the selection of the best (i.e. most efficacious) DTR among a set of DTRs that are embedded in a SMART. However, this is often not possible due to a small sample size. In this manuscript, we propose a method that can be used to identify the set that contains the best DTR. We frame the problem as a special case of multiple comparison and show that the constructed set of best contains the true best DTR with at least a given probability. We use the AIPW estimator to estimate the mean under each DTR, and our simulation results show that, for any given sample size, the cardinality of the constructed set of best is less than the cardinality obtained by IPW estimators, while maintaining the Type I error rate. Moreover, we prove that the probability of inclusion of an inferior DTR in the constructed set of best decays exponentially as the difference between the best and the inferior DTR grows.

Currently most SMART designs are sized such that an investigator can detect either a given stage-1 or stage-2 treatment effect or a given difference between two DTRs with a given probability. One important extension of this work would be to devise a method that can be used to plan SMART sample sizes such that the constructed set of best includes at most Inline graphic DTRs, for a fixed difference between the best and the worst DTRs, with a given probability. This will be more consistent with the goal of SMART designs in many applications.

Supplementary material

Supplementary Material is available at http://biostatistics.oxfordjournals.org.

Funding

This work was supported in part by grants P50 DA010075, R01 AA019092, R01 AA014851, RC1 AA019092, SES 1260782, P01 AA016821 from the National Institute on Drug Abuse (NIDA) and National Science Foundation (NSF).

Supplementary Material

Supplementary Data

Acknowledgments

The authors are grateful for valuable comments from Professor Susan Murphy and Xi Lu. Conflict of Interest: None declared.

References

  1. Bang H., Robins (2005). Doubly robust estimation in missing data and causal inference models. Biometrics 614, 962–972. [DOI] [PubMed] [Google Scholar]
  2. Chakraborty B., Moodie E. (2013) Statistical Methods for Dynamic Treatment Regimes. Berlin: Springer. [Google Scholar]
  3. Chakraborty B., Murphy S. A. (2014). Dynamic treatment regimes. Annual Review of Statistics and its Application 1, 447–464. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Davidian M., Tsiatis A., Leon S. (2005). Semiparametric estimation of treatment effect in a pretest–posttest study with missing data. Statistical Science 203, 261–301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Hernán M. Á., Brumback B., Robins J. M. (2000). Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. Epidemiology 115, 561–570. [DOI] [PubMed] [Google Scholar]
  6. Hirano K., Imbens G. W., Ridder G. (2003). Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 714, 1161–1189. [Google Scholar]
  7. Hsu J. (1996) Multiple Comparisons: Theory and Methods. London: Chapman & Hall, CRC Press. [Google Scholar]
  8. Hsu J. C. (1981). Simultaneous confidence intervals for all distances from the “Best”. The Annals of Statistics 95, 1026–1034. [Google Scholar]
  9. Hsu J. C. (1984). Constrained simultaneous confidence intervals for multiple comparisons with the best. The Annals of Statistics 123, 1136–1144. [Google Scholar]
  10. Keselman H., Cribbie R., Holland B. (1999). The pairwise multiple comparison multiplicity problem: an alternative approach to familywise and comparison wise Type I error control. Psychological Methods 41, 58. [Google Scholar]
  11. Laber E. B., Lizotte D. J., Qian M., Pelham W. E., Murphy S. A. (2014). Dynamic treatment regimes: technical challenges and applications. Electronic Journal of Statistics 81, 1225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Lavori P. W., Dawson R. (2000). A design for testing clinical strategies: biased adaptive within-subject randomization. Journal of the Royal Statistical Society: Series A (Statistics in Society) 1631, 29–38. [Google Scholar]
  13. Lavori P. W., Dawson R., Rush A. J. (2000). Flexible treatment strategies in chronic disease: clinical and research implications. Biological Psychiatry 486, 605–614. [DOI] [PubMed] [Google Scholar]
  14. Lei H., Nahum-Shani I., Lynch K., Oslin D., Murphy S. (2012). A SMART design for building individualized treatment sequences. Annual Review of Clinical Psychology 8, 21–48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Murphy S., Van Der Laan M., Robins J. (2001). Marginal mean models for dynamic regimes. Journal of the American Statistical Association 96456, 1410–1423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Murphy S. A. (2003). Optimal dynamic treatment regimes. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 652, 331–355. [Google Scholar]
  17. Murphy S. A. (2005). An experimental design for the development of adaptive treatment strategies. Statistics in Medicine 2410, 1455–1481. [DOI] [PubMed] [Google Scholar]
  18. Nahum-Shani I., Qian M., Almirall D., Pelham W. E., Gnagy B., Fabiano G. A., Waxmonsky J. G., Yu J., Murphy S. A. (2012). Experimental design and primary data analysis methods for comparing adaptive interventions. Psychological Methods 174, 457–477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Orellana L., Rotnitzky A., Robins J. M. (2010). Dynamic regime marginal structural mean models for estimation of optimal dynamic treatment regimes, part I: main content. The International Journal of Biostatistics 62, 1557–4679. [PubMed] [Google Scholar]
  20. Robins J., Orellana L., Rotnitzky A. (2008). Estimation and extrapolation of optimal treatment and testing strategies. Statistics in Medicine 2723, 4678–4721. [DOI] [PubMed] [Google Scholar]
  21. Robins J. M. (1999). Association, causation, and marginal structural models. Synthese 1211, 151–179. [Google Scholar]
  22. Robins J. M. (2004). Optimal structural nested models for optimal sequential decisions. Proceedings of the Second Seattle Symposium in Biostatistics. Berlin: Springer, pp. 189–326.
  23. Robins J. M., Hernan M. A., Brumback B. (2000). Marginal structural models and causal inference in epidemiology. Epidemiology 115, 550–560. [DOI] [PubMed] [Google Scholar]
  24. Robins J. M., Rotnitzky A., Zhao L. P. (1995). Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association 90429, 106–121. [Google Scholar]
  25. Saville D. J. (1990). Multiple comparison procedures: the practical solution. The American Statistician 442, 174–180. [Google Scholar]
  26. Scheffe H. (1953). A method for judging all contrasts in the analysis of variance. Biometrika 40(1–2), 87–110. [Google Scholar]
  27. Tukey J. W. (1953) The Problem of Multiple Comparisons: Introduction and Parts A, B, and C. Princeton: Princeton University. [Google Scholar]
  28. van der Laan M., Robins J. (2003) Unified Methods for Censored Longitudinal Data and Causality. Berlin: Springer. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Biostatistics (Oxford, England) are provided here courtesy of Oxford University Press

RESOURCES