Abstract
TechStep is a randomized trial of a mobile health interventions targeted towards transgender adolescents. The interventions include a short message system, a mobile-optimized web application, and electronic counseling. The primary outcomes are self-reported sexual risk behaviors and uptake of HIV preventing medication. In order that we may evaluate the efficacy of several different combinations of interventions, the trial has a sequentially randomized design. We use a causal framework to formalize the estimands of the primary and key secondary analyses of the TechStep trial data. Targeted minimum loss-based estimators of these quantities are described and studied in simulation.
1. Introduction
Transgender individuals are estimated to make up between 0.39% and 0.53% of the US population [7, 14], with higher percentages of high school students identifying as transgender [9]. Transgender, gender non-conforming, and gender non-binary (referred to here collectively as trans) demonstrate elevated rates of risky sexual behaviors relative to their cisgender peers, leaving this population vulnerable to infection with human immunodeficiency virus (HIV) [21]. A potential avenue for improvement in the health of trans youth is through mobile technology-based interventions. Such interventions circumvent the face-to-face interaction required for traditional health care delivery, and are therefore highly scalable. Moreover, technology-based interventions may be extremely low cost, confidential, and can cultivate communication amongst users, which may be important to trans youth who face high levels of stigma in healthcare and employment settings [20, 18]. It is therefore of interest to design mobile interventions tailored to trans youth, and to evaluate the efficacy of these interventions in preventing HIV infection.
The TechStep study is an ongoing randomized controlled trial (RCT) of mHealth interventions in high-risk HIV-negative trans youths and young adults (15–24 years old). Three trans-specific mHealth interventions were developed for TechStep: a short messaging service (SMS or text), a mobile-optimized webapp, and an eCoaching intervention, which offers one-on-one risk behavior counseling administered through a mobile video conferencing service [13, 30]. The goal of the trial is to evaluate the efficacy of these interventions for reducing sexual risk behaviors (e.g., condomless anal or vaginal intercourse, engagement in sex work, sex while feeling the effects of alcohol or drugs) and increasing uptake of pre-exposure prophylaxis (PrEP). The trial will have a total of nine months of follow-up with clinical assessments on participants taken at baseline and at three-, six-, and nine months after enrollment.
The availability of multiple intervention modalities complicates the design of an RCT to evaluate its efficacy. We must consider whether the mHealth interventions should be offered individually, all together, or in a sequential fashion. A priori eCoaching might be expected to be the most effective intervention of the three; however, from an implementation perspective, scalability may be limited due to its cost. These and other considerations led to a sequential multi-arm randomized trial (SMART) design for the TechStep trial.
SMARTs randomize participants to an initial intervention and monitor participants after receipt of the intervention [12, 15, 2, 16, 17, 6]. Based on interim data, participants may become eligible for re-randomization to a new intervention. After receipt of the intervention, participants are monitored again. Based on these new data (and all previous data), participants may again become eligible for re-randomization to a new intervention. This process of monitoring and re-randomizing to different interventions continues until a pre-specified timepoint. The data generated from such a trial can be used to evaluate the efficacy of one or several treatment rules, that is, rules for how participants should be assigned treatment at each time.
SMART trials have great potential to advance HIV prevention science. Results of such trials can move researchers closer to personalized care for individuals, and enable evaluation of interventions that are feasible to implement. This is highly relevant in the field of HIV prevention, where competing prevention modalities are emerging and must be evaluated head-to-head in randomized trials. This is true not just of behavioral interventions, like the ones considered here, but also when considering pharmacological interventions such as pre-exposure prophylaxis (PrEP) or preventive HIV vaccines. Current thinking in the HIV prevention field is that some prevention modalities will be acceptable and effective in some individuals, but not in others. Thus, utilizing trial designs that can aid our understanding of which interventions are effective for which individuals may help improve comprehensive efforts at HIV prevention [8]. Nevertheless, in spite of the potential benefits of SMART designs, their usage in the context of HIV prevention is extremely limited. In this paper, we discuss the motivation for a SMART design in the TechStep trial, the statistical methods needed to provide a robust and efficient analysis of the trial, and a simulation study to assess the power of the design to answer key questions of interest. We hope that this work will provide researchers with a better understanding for the motivation of SMART HIV prevention trials, as well as some ideas for how such trials can be planned and analyzed in practice.
2. Scientific questions and implications for design
2.1. Primary questions
To identify a primary question of interest, we first define our outcomes and interventions of interest. For the outcome, we focus on the cumulative number of risky sexual encounters, defined as acts of condomless anal intercourse, engagement in sex work, or sex while feeling the effects of alcohol or drugs. We will consider an intervention effective if it reduces the average number of risky sexual encounters over the duration of the trial.
Ideally, a primary analysis should assess an intervention that is feasible to implement in a broader clinical setting [5]. We reasoned that many clinics could offer either a the text- or an webapp-based intervention, but not both. Moreover, such low intensity interventions may be sufficiently efficacious for many participants in reducing risky sexual behaviors. However, there may be individuals for whom the low intensity interventions would be insufficient to change risk behaviors. These individuals may derive greater benefit from the more intensive and personalized eCoaching intervention. For these reasons, we are motivated to compare three interventions: (i) control intervention consisting of a static website with trans-specific HIV and risk reduction information and local and national trans resources, (ii) text + eCoaching, where participants receive text messaging for ninety days and those who do not improve their risk behaviors receive three months of eCoaching in addition to text messaging, and (iii) webapp + eCoaching, where participants receive access to a webapp for ninety days and those who do not improve receive eCoaching for three months in addition to the webapp. The primary goal of the trial is to compare average number of risky encounters for participants who receive intervention (ii) vs. (i) and who receive intervention (iii) vs. (i).
2.2. Secondary questions
Some health clinics may be unable to staff trained eCoaching counselors, and thus interventions (ii) and (iii) may be infeasible to implement in some clinical settings. However, clinics could likely support either a text message system or a webapp, since these interventions are extremely low cost. Thus, there is good motivation to evaluate two additional interventions: (iia) text messaging only, where participants receive text messaging for six months and (iiia) webapp only, where participants receive access to a mobile-optimized webapp for six months. A key secondary goal of the trial is to compare the average number of risky encounters for participants who receive intervention (iia) vs. (i) and who receive intervention (iiia) vs. (i).
While an eCoaching intervention is the most intensive and costly mHealth intervention we considered, it is also hypothesized a-priori to be the most effective. Thus, it is also relevant to assess the marginal benefit of offering eCoaching to participants whose behavior does not improve under other mHealth interventions. To this end, another key secondary goal of the trial is to compare the average number of risk encounters for participants who receive intervention (ii) vs. (iia) and (iii) vs. (iiia). Such information could be provided to key stakeholders weighing the benefits of implementing a more intensive and expensive eCoaching intervention on top of a low intensity, first line mobile health intervention.
3. Causal Estimands
In this section, we detail the design of the TechStep trial that enables us to answer the questions above. We introduce notation that describes data collected in trial and translate the primary and secondary questions into formal causal estimands. Finally, we establish identifiability of these estimands based on the observed trial data.
Figure 1 illustrates the design of the TechStep study. At the baseline visit, participant information is collected, which we denote by L0 = (Y0,W0), where Y0 denotes sexual risk behaviors and W0 other (e.g., demographic) information collected. While in the actual TechStep trial, Y0 will consist of multiple measures of risk behaviors, for simplicity, here we consider that Y0 is scalar-valued. We use A0 to denote the intervention randomly assigned with equal probability to participants at enrollment, with A0 = 0 denoting that standard of care was assigned, A0 = 1 denoting that a first-line text intervention was assigned, and A0 = 2 denoting that a first-line webapp intervention was assigned. We use C1 to denote whether a participant attends the three-month clinic visit, so that C1 = 0 denotes that the participant did not attend the visit, and C1 = 1 denotes that the participant did attend the visit. For participants who attend the three-month visit, we measure additional information on sexual risk behaviors over the last three months. In particular, L1 = (Y1,W1) denotes a vector of participant information collected at the three-month visit, where, as above, we use Y1 to denote sexual risk behaviors over the first three months of the study, and W1 denotes other information collected on participants. Based on this information, participants may become eligible for re-randomization. For simplicity, here we consider that participants with Y1 ≥ Y0 and Y1 ≠ 0, i.e., those who have at least some risky behaviors and whose behaviors do not improve under the first-line intervention, are re-randomized 2:1 to either “step-up” to eCoaching or to maintain their current intervention. Individuals who have improved risk behaviors Y1 <Y0 or Y1 = 0 are not eligible for re-randomization and so maintain their current intervention with probability one. We use A1 to denote the intervention assigned to participants at the three-month visit and set A1 = 3 if a participant is chosen to step-up to eCoaching, while A1 = A0 otherwise. Continuing on, C2 denotes whether a participant attends the six month clinic visit and L2 = (Y2,W2) to denote information collected at this visit. Similarly, C3 and Y3 are used to denote the same quantities collected at the nine month clinic visit. We denote by Y = Y1 +Y2 +Y3 the cumulative outcome over the duration of the study. Thus, the time-ordered data for a typical study participant can be represented as a random variable O = (L0,A0,C1,L1,Y1,A1,C2,L2,Y2,C3,Y3). We denote by ℙ the distribution of the observed data unit O and assume a nonparametric model for ℙ.
Figure 1:

A design schematic for the TechStep trial. Randomization probabilities are shown beneath A. labels. At each study visit, we note intervention label for participants along each path. Censoring variables are also shown along paths with the question mark denoting that the probability of censoring is unknown and must be estimated from the data.
To formalize our causal question of interest, we use a nonparametric structural equation model (NPSEM). This model assumes that each observed variable is a function of its parent variables and an exogenous error term. We adopt a model that assumes the parents of a given observed variable are all variables that temporally precede it. For example, L0 has no parent variables, the parent of A0 is L0, the parent of C1 is A0 and L0, and so on. We define interventions on this NPSEM that correspond to interventions (i), (ii), (iia), (iii), and (iiia) above. In addition to intervening on the mHealth variables (A0 and A1), these interventions enforce that participants attend each clinic visit. We define a function d that maps A0,Y0,Y1 to {0,1,2,3}, and defines our rules for stepping people up to the eCoaching intervention,
The interventions of interest are shown in Table 1. We denote by Y(i), Y(ii), Y(iia), Y(iii), Y(iiia) the counterfactual outcomes generated by interventions (i), (ii), (iia), (iii), and (iiia), respectively. For example, represents the total number of reported risky sexual behaviors over the duration of the trial, if, possibly counter to fact, individual j received standard of care throughout the trial. We use Y(·) to generically denote one of these counterfactual outcomes. The distribution of the counterfactual variable Y(·) is denoted by ℙ(·). We are interested in for each intervention. Our primary and secondary questions can be answered using statistical inference about parameters of the counterfactual outcome distributions. In particular, we quantify intervention effects in terms of additive differences in the average number of mean outcome over the duration of the trial. The causal estimands of interest pertaining to each of our primary and secondary questions are shown in Table 2.
Table 1:
Hypothetical interventions on nonparametric structural equation model.
| Label | NPSEM intervention | Description |
|---|---|---|
| (i) | A0 = 0,C1 = 1,A1 = 0,C2 = 1,C3 = 1 | standard of care for 6 months and attend each visit |
| (ii) |
A0 = 1,C1 = 1 A1 = d(1, Y0, Y1) C2 = 1,C3 = 1 |
text messages for 3 months, step up to eCoaching if no improvement at 6 months and attend each visit |
| (iia) | A0 = 1,C1 = 1,A1 = 1, C2 = 1,C3 = 1 | text messages for 6 months and attend each clinic visit |
| (iii) |
A0 = 2,C1 = 1 A1 = d(2, Y0, Y1) C2 = 1,C3 = 1 |
webapp for 3 months, step up to eCoaching if no improvement at 6 months and attend each visit |
| (iiia) | A0 = 2,C1 = 1,A1 = 2, C2 = 1,C3 = 1 | webapp for 6 months and attend each clinic visit |
Table 2:
Causal estimands used to answer primary and secondary questions of interest.
| Question | Causal estimand | Description |
|---|---|---|
| Primary | difference in mean outcome under text + step intervention versus standard of care | |
| difference in mean outcome under webapp + step intervention versus standard of care | ||
| Secondary | difference in mean outcome under text-only versus standard of care | |
| difference in mean outcome under webapp-only versus standard of care | ||
| difference in mean outcome under text + step versus text only | ||
| difference in mean outcome under webapp + step intervention versus webapp only |
In this section, we discuss identification of using ℙ, the distribution of the observed data. Nearly identical arguments can be used to establish identifiability of the other causal estimands. We use the longitudinal G-formula to establish identifiability; see [22, 1, 11] for more on the assumptions under which this identification result holds. We draw particular attention to the assumption of sequential randomization of intervention and missingness, referred to colloquially as the assumption of no unmeasured (time-varying) confounding. The SMART design of the trial ensures that intervention rules are randomly assigned and thus that observed intervention values are independent of counterfactual outcomes. However, as with any clinical trial, we expect that participants will miss clinic visits. Thus, in order that the sequential randomization assumption is satisfied, we require that amongst those participants who receive intervention (ii), Ct is independent of Y(ii) given , the covariate information collected up to time t − 1. That is, the covariate history should include all individual characteristics that are related both to participants’ propensity for risky sexual behaviors and for attending future clinic visits. We will adopt this bar notation throughout to denote a collection past measurements of particular variables. For example, for t = 1,2,3. We will sometimes write to denote a unit vector, the length of which will be apparent from the context. For example, denotes the event that C1 = 1,C2 = 1,C3 = 1, while denotes the event that C1 = 1,C2 = 1, and so on.
If the assumptions described above are satisfied, then can be written as a functional of ℙ, for which we require the following definitions. We define as the conditional cumulative distribution function (CDF) of Y given A0 = 1,A1 = d(1,y0,y1), , and evaluated at y. In words, describes the probability of having fewer than or equal to y risky sexual encounters over the duration of the trial, amongst the participants in intervention arm (ii) that attended each of the three follow-up visits and had observed covariate history . Similarly, we define as the conditional CDF of L2 given A0 = 1,A1 = d(1,y0,y1), , evaluated at ℓ2, and we define as the conditional CDF of L1 given A0 = 1,C1 = 1, and L0 = ℓ0. Finally, we define Q0(ℓ0) as the conditional CDF of L0 evaluated at ℓ0. The G-formula for the causal estimand of interest is
where is recursively defined as follows,
| (1) |
| (2) |
| (3) |
4. Statistical Methodology
Well-controlled randomized trials generally allow researchers to utilize covariate-unadjusted statistical methods to draw inference on causal effects, since, by virtue of the randomized design, counterfactual outcomes are marginally independent of intervention assignment. However, the same is not true in SMART trials, as sequential randomization probabilities depend on past covariates. Thus, these covariates must be accounted for by the estimator in order to draw valid inference. A more challenging issue is that typically at least some follow-up data are missing in randomized trials, for example, due to participants moving out of the study region. Simple strategies, such as complete-case analysis, run the risk of incurring bias due to informative missingness. To draw more robust inference, we can utilize estimators that account for all measured covariates, not just those that are used to determine re-randomization probabilities. If the measured covariates are rich enough to satisfy the sequential randomization assumption on missingness, then we should be able to draw valid inference about intervention effects, in spite of the informative nature of the missingness.
We will analyze the TechStep trial using targeted minimum loss-based estimation (TMLE) [28, 26, 27]. TMLE is a general framework for generating plug-in estimators that solve a set of user-selected equations [24]. In the present problem, we sequentially generate estimates of the parameters of the G-formula (1)–(3) that solve the efficient influence function estimating equation. We refer interested readers to [19, 23] for technical derivations of the underpinnings of TMLE in the present context. We provide a detailed description of the implementation of the estimator in the appendix.
From a high level, our estimator is built in several stages. In the first stage, we obtain estimates of randomization probabilities and the probability of a participant attending each clinic visit. Collectively, we refer to these probabilities as the propensity scores. These propensity scores, in particular, the components pertaining to missingness probabilities are used in the procedure to control for the possibility of time-varying confounding based on measured participant characteristics. With estimates of the propensity scores in hand, we proceed to estimating the outcome process using a sequence of regressions and starting at the final timepoint. Each of these outcome regressions involves two steps: obtaining an initial regression estimate and augmenting this estimate using so-called fluctuation submodels. The purpose of the first step is to obtain an estimator that adjusts for potential time-varying confounders of the outcome and missingness process, while the second step ensures that the efficient influence function estimating equation is satisfied. This latter step endows the TMLE with desirable robustness and efficiency properties. Notably, the TMLE controls for potential time-varying confounding through estimation of propensity scores and through sequential estimation of the outcome process. The augmentation step is the crucial step in wedding these two approaches. In particular, this step ensures that, so long as either the outcome process or the propensity scores are consistently estimated, the resultant TMLE is consistent. That is, the TMLE is doubly robust. For more on double-robustness in the present context, see [1, 11].
5. Simulation
We performed extensive simulations to determine the sample size required to detect clinically meaningful differences in average risk behaviors in the TechStep trial. We designed a data generating process that mimicked the design of the trial and analyzed the simulation data using the proposed statistical methodology. Simulated data were generated as follows. For simplicity, we assumed W0 was scalar-valued and drawn from a Bernoulli(1/2) distribution. Baseline sexual risk behaviors Y0 were drawn from a Poisson(exp(γ0)) distribution, where γ0 is the log-average number of risk behaviors. Using estimates from published data [10, 21], we set γ0 = log(1.5). The treatment A0 was assigned randomly with 1/3 probability of control, text-, and webapp-based interventions. Data for the 3, 6, and 9-month visits were generated as follows. For t = 1,2,3, we drew Ct from Bernoulli(logit−1(α0)) distribution; thus, α0 controls the level of participant dropout. Given Ct = 1 (i.e., that the participant attended clinic visit t), we drew Yt from a Poisson distribution with conditional mean . Here, x1∨x2 and x1∧x2 denote the smaller and larger of two real-valued numbers x1,x2, respectively. Note that this corresponds to a Markov model, where the only factors of participants’ history that influence their risk behaviors are their most recently measured risk behaviors (Yt−1), their most recent treatment (A(t−1)∧1), and baseline covariate W0. We consider a wide range of scenarios by selecting different values of simulation parameters (Table 3). For each combination of simulation parameters, we analyzed five hundred data sets. Power for each choice of n and for each of the comparisons shown in Table 2 was estimated by the proportion of simulations in which the two-sided level 0.05 Wald test rejected the null hypothesis of no difference in average number of risky behaviors. To estimate the conditional probability of censoring, we used main terms logistic regression. To estimate the sequential outcome regression, we used the super learner based on five-fold cross-validation with a library of candidate regressions that consisted of an intercept-only model, a Poisson regression model, a negative binomial regression model, and the highly adaptive LASSO [3]. The latter is a flexible semiparametric regression estimator.
Table 3:
Simulation parameter values and descriptions. Values of γi are log rate ratios.
| Parameter | Values | Notes |
|---|---|---|
| n | 200, 250, 300 | Sample size |
| γ1 | 0, −0.11, −0.22, −0.36 | Text eff. on risky behaviors |
| γ2 | 0, −0.22, −0.51 | Webapp eff. on risky behaviors |
| γ3 | 0, −0.11, −0.22, −0.36, −0.69*, −0.92*, −1.20*, −1.61* | ECoaching eff. on risky behaviors |
| α0 | −4.06, −3.35 | Loss-to-followup of 5% and 10% |
denotes that simulations were performed only when γ1 = 0,−0.11.
For the primary analysis, we found that type I error rate was adequately controlled for each sample size considered (Figure 2). Across a wide range of settings, we estimated that we have 80% power to detect a difference in 1.60 average risk behaviors comparing text+step versus control at n = 300, 1.75 at n = 250, and 1.90 at n = 200. These results seem to be robust across situations where the majority of the benefit is derived from the text versus from the eCoaching component of the intervention. Increasing censoring from 5% to 10% (left versus right column of Figure 2) tended to decrease power by 3–5%. For the secondary analysis examining the effect of text+step versus text-only interventions we found that type I error rates were slightly higher than the nominal level in the smallest sample size (Figure 3). We found similar power curves for this secondary analysis as with the primary analysis. There tended to be higher power to detect differences when there was a small effect of the text intervention (γ1 = −0.11) relative to no effect of the text intervention (γ1 = 0). Results for the simulation studying the secondary analysis that compares efficacy of text-only versus app-only are included in the appendix. These results were similar to the primary analysis with 80% power to detect differences in average number of risk behaviors of between 1.60 and 1.90. Results for comparing text-only versus app-only interventions are shown in Figure 4. We found a minor increase in type I error for this comparison using the proposed test. Across the various settings, there is 80% power to detect an average difference of about two risk behaviors between the text-only and app-only interventions.
Figure 2:

Probability of rejecting (no effect of text-step versus control) as a function of the size of the effect, . Dotted lines indicate the nominal type I error rate of the Wald test (0.05) and power of 80%.
Figure 3:

Probability of rejecting (no effect of text + step versus text-only) as a function of the size of the effect, . Dotted lines indicate the nominal type I error rate of the Wald test (0.05) and power of 80%.
Figure 4:

Probability of rejecting (no effect of text-only versus app-only) as a function of the size of the effect, . Dotted lines indicate the nominal type I error rate of the Wald test (0.05) and power of 80%.
6. Discussion
Based on these simulation results and other criteria, a target of n = 250 was set for enrollment in the TechStep trial. Our simulation results suggest this sample size will afford sufficient power to detect clinically meaningful differences between interventions, and balances power appropriately between the primary analyses and key secondary analyses.
Further research is warranted in several areas. First, our Wald tests were slightly anti-conservative for testing differences between the text+step versus text condition. We will explore alternative standard error estimators including cross-validated influence function-based standard errors. Second, we may wish to evaluate whether and how to adjust for intervention engagement (e.g., webapp para-data) in the primary and secondary analyses. If such information is predictive of risky behaviors, then incorporating such information may lead to improved power to detect differences amongst the interventions. Finally, we will explore different techniques for time-varying confounder selection. Extensive surveys will be collected at each clinic visit, so W0,W1,W2 are all potentially high-dimensional vectors. We will evaluate different strategies for selecting components to include in the primary and secondary analyses.
Acknowledgments
Supported by United States National Institute of Health Grant 5U19HD089881 and U24HD089880
Appendix
To provide a concrete example of how to implement a TMLE in the present problem, we consider estimation of in a setting where W0,W1, and W2 are scalar-valued.
Estimating propensity scores
Our first task is to estimate the randomization probabilities for the text + step-to-eCoaching intervention. Specifically, we require estimates of
While these probabilities are known by design, our estimators will utilize empirical estimates to account for chance imbalances between intervention arms. To that end, we define
Note that the latter estimator accounts for our step-up criteria. If a participant’s sexual risk behaviors do not improve, then they are eligible for step up and may be re-randomized to the eCoaching intervention. We estimate this re-randomization probability using an empirical proportion. If, on the other hand, a participant’s sexual risk behaviors improve, then that participant is not eligible for step up and stays on the text intervention with probability 1.
The second step is to estimate the probability of missing each clinic visit as a function of past covariates. In particular, for each covariate history , we require an estimate of
These estimates can be obtained, for example, using logistic regression. Considering estimation of , we could posit a logistic regression model, and regress the binary outcome C1 onto basis functions of L0 amongst participants randomized to the text intervention. Alternatively, we could posit a logistic regression model for the probability of attending the first clinic visit as a function of A0 and L0, and regress C1 onto basis functions of L0 and A0 amongst all participants. As a concrete example, consider the main-terms logistic regression model for ,
where γ = (γ0,γ1,γ2,γL)Τ and γL is a two-dimensional vector of parameters associated with W0 and Y0. We note that this regression pools across interventions to estimate the conditional probability of attending the first clinic visit. An estimate of γ can be obtained via maximum likelihood. We use to denote the fitted value from this regression for an observation with A0 = a0 and L0 = ℓ0. For any ℓ0 our estimate of is
To estimate we take a similar approach. For example, we could posit a logistic regression model for the probability of attending the second clinic visit as a function of and , and regress C2 onto basis functions of and using data from all participants who attended the first clinic visit. As a concrete example, consider the main-terms logistic regression model for ,
where η = (η0,η1,…,η4,ηL) and ηL is a four-length vector of regression parameters associated with Y0,W0,Y1, and W1. An estimate of η can be obtained via maximum likelihood, and similarly as above, we use to denote the fitted value from this regression for an observation with A0 = a0, A1 = a1, and . For any our estimate of is
Finally, we estimate the conditional probability of attending the third clinic visit similarly as with the second to obtain , an estimate of .
Estimating sequential outcome regressions
We now turn to obtaining an estimate of , the conditional mean of the cumulative number of risky sexual behaviors amongst participants in the text + step-to-eCoaching who attended each clinic visit given their sexual risk behaviors and covariates measured through the second clinic visit. We can write (3) as
where we defined . The second equality follows from the fact that we are conditioning on , which includes past sexual risk behaviors y1 and y2. Thus, in order to estimate , it suffices to estimate . For example, since Y3 is a count variable, we could posit a generalized linear model with Poisson family and log link function for . As with estimation of missingness probabilities, we will use regressions that pool over intervention groups, by regressing the outcome Y3 onto basis functions of and . To provide a concrete example, we consider the following pooled, log-linear regression model for ,
An estimate of β can be obtained via maximum likelihood, and we denote by the fitted value from this regression for an observation with and . For any an estimate of is
| (4) |
Next, we augment our initial estimate of by estimating the parameter of a fluctuation submodel associated with the augmentation covariate
Recall, that TMLE is a general purpose tool for generating estimators that solve user-specified equations. In the current problem, we need an estimate of that satisfies
| (5) |
This can be achieved as follows. First, we compute maximum observed value of Y3, denoted by s3[n] and we scale Y3 to the unit interval, . Next, we define the augmented regression model,
Estimating the parameter ε3 corresponds to fitting a logistic regression in subset of participants , where the outcome is , the regressor is H3n, and the model includes an offset equal to the logit of the scaled fitted value. Denoting by ε3n the estimate of ε3, the augmented estimate of is
Thus, for a given , the augmented estimate of is
It is straightforward to show that satisfies equation (5), as required.
Our next task it to obtain an estimate of , the conditional mean of amongst participants in the text + step-to-eCoaching intervention who attended clinic visits one and two, and given covariate-history measured through the first follow-up clinic visit. We can write (2) as
where we defined . Thus, to estimate , it suffices to estimate . This quantity could be estimated, as above, using a pooled log-linear regression model,
We could estimate the parameters of this regression as follows. First, the fitted value is computed for i = 1,…,n and these values are used to create the regression outcome . The regression model is fit using observations with . We denote by the fitted value from this regression for a given observation of O with and . The initial estimate of for this observation is thus
As above, the next step is to augment the initial estimate of using an appropriate fluctuation submodel. To that end, we define the augmentation covariate
The goal of this augmentation step is to generate an estimate of that satisfies
| (6) |
This can be achieved using a similar strategy as above. First, we compute the maximum observed value of , say s2n. We then generate a scaled outcome . Next, we define an augmented regression model for ,
Estimating the parameter ε2 corresponds with fitting a logistic regression in the subset of participants with , where the regression outcome is , the single covariate is H2n, and the model includes an offset equal to the logit of the scaled fitted value. Denoting by ε2n the estimate of ε2, the augmented estimate of is
Thus, the augmented estimate of is . It is straightforward to show that and together satisfy equation (6).
Our next task it to obtain an estimate of , the baseline-covariate-conditional mean of amongst participants initially in the text intervention who attended clinic visit one. We can write (1) as
where we defined . Obtaining an initial, and subsequently, an augmented estimate of proceeds similarly as with . We first obtain an initial estimate of , e.g., through log-linear regression of the outcome on A0,W0,Y0 amongst observations with C1 = 1. Given an observation of O with L0 = ℓ0, our estimate of is .
We augment using the covariate
with the goal generating an estimate of that satisfies
| (7) |
We again use logistic regression to this end. The scaled outcome is , where s1n is the maximum observed value of . As in previous steps, we regress the scaled outcome onto the covariate H1n with offset equal to amongst observations with C1 = 1.
Denoting by ε1n the estimate of ε1, the augmented estimate of is
It is straightforward to show that and together satisfy equation (6).
The final step is to plug-in the estimate into right-hand-side of equation (3). To evaluate this plug in estimator, we also need an estimate of Q0, the CDF of L0. For this, we use the empirical distribution . The final TMLE estimator is thus
Note on propensity score and outcome regression estimators
In the above description of TMLE, we used initial estimators based on parametric models to illustrate key ideas. However, TMLE methodology is in no way limited to such estimators. In particular, we may wish to consider more flexible regression techniques in order to mitigate the risk of residual bias incurred by regression model misspecification. To that end, it may be propitious to utilize super learning [25], a generalization of regression stacking [29, 4]. A super learner can build estimators of regression quantities by considering all possible convex combinations of a library of pre-specified regression estimators. That is to say, a fitted value from a super learner regression is a weighted combination of fitted values from various candidate regression estimators. The candidate regression estimators can range from parametric regression estimators to highly data-adaptive estimators. The super learner selects weights assigned to each candidate estimator by finding the convex weights that minimize a cross-validated risk criteria, for example, cross-validated mean squared-error. The use of super learner is particularly appealing in the context of randomized trials in that the method can be fully pre-specified and is essentially a nonparametric method so long as sufficiently flexible regressions are included in the library. For these reasons, in the analysis of the TechStep trial, we will use the super learner to estimate the conditional probabilities of missingness and the sequential outcome regressions described in the previous paragraph.
Confidence intervals and hypothesis tests based on influence functions
We note that our specific labeling of interventions (e.g., A0 = 1 correpsonds to text, A0 = 2 to app, etc…) is arbitrary. Moreover, by redefining the rule d(a0,y0,y1) (e.g., to equal a constant for any value of a0), then we can directly apply the methodology described above to estimate the counterfactual outcome under any of the interventions of interest. Let denote the TMLE of . Similarly, let denote the augmented sequential outcome regression estimate and the augmentation covariate. Define the estimated influence function of evaluated on observation i as , where
An estimate of the standard error of is
and the confidence interval , where z1−α/2 is the 1−α/2 quantile of a standard Normal distribution, will have asymptotic coverage no smaller than 1−α.
Similar influence function-based standard error estimates can be used to describe the asymptotic covariance between any two TMLEs. For example, the asymptotic covariance of is consistently estimated by
Thus, by the Delta method, an estimate of the standard error of is
A Wald test that rejects the null hypothesis whenever
will have asymptotic level α.
References
- [1].Bang H, Robins JM: Doubly robust estimation in missing data and causal inference models. Biometrics 61(4), 962–973 (2005) [DOI] [PubMed] [Google Scholar]
- [2].Bembom O, van der Laan MJ: Analyzing sequentially randomized trials based on causal effect models for realistic individualized treatment rules. Statistics in Medicine 27(19), 3689–3716 (2008) [DOI] [PubMed] [Google Scholar]
- [3].Benkeser D, van der Laan MJ: The highly adaptive LASSO estimator. In: 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA), pp. 689–696. IEEE; (2016) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].Breiman L: Stacked regressions. Machine learning 24(1), 49–64 (1996) [Google Scholar]
- [5].Brown CH, Curran G, Palinkas LA, Aarons GA, Wells KB, Jones L, Collins LM, Duan N, Mittman BS, Wallace A, et al. : An overview of research and evaluation designs for dissemination and implementation. Annual Review of Public Health 38, 1–22 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Chakraborty B, Moodie E: Statistical methods for dynamic treatment regimes. Springer; (2013) [Google Scholar]
- [7].Crissman HP, Berger MB, Graham LF, Dalton VK: Transgender demographics: a household probability sample of US adults, 2014. American Journal of Public Health 107(2), 213–215 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [8].Janes H, Donnell D, Gilbert PB, Brown ER, Nason M: Taking stock of the present and looking ahead: envisioning challenges in the design of future HIV prevention efficacy trials. The Lancet HIV 6(7), e475–e482 (2019) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Johns MM, Lowry R, Andrzejewski J, Barrios LC, Demissie Z, McManus T, Rasberry CN, Robin L, Underwood JM: Transgender identity and experiences of violence victimization, substance use, suicide risk, and sexual risk behaviors among high school students – 19 states and large urban school districts, 2017. Morbidity and Mortality Weekly Report 68(3), 67 (2019) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Kann L, McManus T, Harris WA, Shanklin SL, Flint KH, Queen B, Lowry R, Chyen D, Whittle L, Thornton J, et al. : Youth risk behavior surveillance – United States, 2017. MMWR Surveillance Summaries 67(8), 1 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].van der Laan MJ, Gruber S: Targeted minimum loss based estimation of causal effects of multiple time point interventions. The International Journal of Biostatistics 8(1), 1–34 (2012). DOI 10.1515/1557-4679.1370 [DOI] [PubMed] [Google Scholar]
- [12].Lavori PW, Dawson R: A design for testing clinical strategies: biased adaptive within-subject randomization. Journal of the Royal Statistical Society: Series A (Statistics in Society) 163(1), 29–38 (2000) [Google Scholar]
- [13].Lentferink AJ, Oldenhuis HK, de Groot M, Polstra L, Velthuijsen H, van Gemert-Pijnen JE: Key components in eHealth interventions combining self-tracking and persuasive ecoaching to promote a healthier lifestyle: a scoping review. Journal of Medical Internet Research 19(8), e277 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Meerwijk EL, Sevelius JM: Transgender population size in the United States: a meta-regression of population-based probability samples. American Journal of Public Health 107(2), e1–e8 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Murphy SA: An experimental design for the development of adaptive treatment strategies. Statistics in Medicine 24(10), 1455–1481 (2005) [DOI] [PubMed] [Google Scholar]
- [16].Murphy SA, Bingham D: Screening experiments for developing dynamic treatment regimes. Journal of the American Statistical Association 104(485), 391–408 (2009) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].Nahum-Shani I, Qian M, Almirall D, Pelham WE, Gnagy B, Fabiano GA, Waxmonsky JG, Yu J, Murphy SA: Experimental design and primary data analysis methods for comparing adaptive interventions. Psychological Methods 17(4), 457 (2012) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Perez-Brumer A, Nunn A, Hsiang E, Oldenburg C, Bender M, Beauchamps L, Mena L, MacCarthy S: “we don’t treat your kind”: Assessing HIV health needs holistically among transgender people in Jackson, Mississippi. PLoS One 13(11), e0202389 (2018) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Petersen M, Schwab J, Gruber S, Blaser N, Schomaker M, van der Laan M: Targeted maximum likelihood estimation for dynamic and static longitudinal marginal structural working models. Journal of Causal Inference 2(2), 147–185 (2014) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Reisner SL, Hughto JMW, Dunham EE, Heflin KJ, Begenyi JBG, Coffey-Esquivel J, Cahill S: Legal protections in public accommodations settings: A critical public health issue for transgender and gender-nonconforming people. The Milbank Quarterly 93(3), 484–515 (2015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Reisner SL, Jadwin-Cakmak L, Sava L, Liu S, Harper GW: Situated vulnerabilities, sexual risk, and sexually transmitted infections’ diagnoses in a sample of transgender youth in the United States. AIDS Patient Care and STDs 33(3), 120–130 (2019) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].Robins JM: Robust estimation in sequentially ignorable missing data and causal inference models. In: Proceedings of the American Statistical Association Section on Bayesian Statistical Science, pp. 6–10 (1999) [Google Scholar]
- [23].Schnitzer ME, van der Laan MJ, Moodie EE, Platt RW: LTMLE with clustering. In: Targeted Learning in Data Science, pp. 233–251. Springer; (2018) [Google Scholar]
- [24].van der Laan MJ, Benkeser D, Sofrygin O: Targeted minimum loss-based estimation. Wiley StatsRef: Statistics Reference Online pp. 1–8 (2018) [Google Scholar]
- [25].van der Laan MJ, Polley EC, Hubbard AE: Super learner. Statistical Applications in Genetics and Molecular Biology 6(1) (2007) [DOI] [PubMed] [Google Scholar]
- [26].van der Laan MJ, Rose S: Targeted learning: causal inference for observational and experimental data. Springer Science & Business Media; (2011) [Google Scholar]
- [27].van der Laan MJ, Rose S: Targeted learning in data science: causal inference for complex longitudinal studies. Springer; (2018) [Google Scholar]
- [28].van der Laan MJ, Rubin D: Targeted maximum likelihood learning. The International Journal of Biostatistics 2(1) (2006) [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].Wolpert DH: Stacked generalization. Neural Networks 5(2), 241–259 (1992) [Google Scholar]
- [30].Yousuf H, Reintjens R, Slipszenko E, Blok S, Somsen G, Tulevski I, Hofstra L: Effectiveness of web-based personalised e-Coaching lifestyle interventions. Netherlands Heart Journal 27(1), 24–29 (2019) [DOI] [PMC free article] [PubMed] [Google Scholar]
