Skip to main content
Biostatistics (Oxford, England) logoLink to Biostatistics (Oxford, England)
. 2014 Sep 3;16(2):295–310. doi: 10.1093/biostatistics/kxu040

A discrete-time survival model with random effects for designing and analyzing repeated low-dose challenge experiments

Chaeryon Kang 1, Ying Huang 1,*, Christopher J Miller 2
PMCID: PMC4786638  PMID: 25190513

Abstract

Repeated low-dose (RLD) challenge designs are important in HIV vaccine research. Current methods for RLD designs rely heavily on an assumption of homogeneous risk of infection among animals, which, upon violation, can lead to invalid inferences and underpowered study designs. We propose to fit a discrete-time survival model with random effects that allows for heterogeneity in the risk of infection among animals and allows for predetermined challenge dose changes over time. Based on this model, we derive likelihood ratio tests and estimators for vaccine efficacy. A two-stage approach is proposed for optimizing the RLD design under cost constraints. Simulation studies demonstrate good finite sample properties of the proposed method and its superior performance compared to existing methods. We illustrate the application of the heterogeneous infection risk model on data from a real simian immunodeficiency virus vaccine study using Rhesus Macaques. The results of our study provide useful guidance for future RLD experimental design.

Keywords: Discrete-time survival model with random effects, Heterogeneous infection risk, HIV vaccine prevention research, Repeated low-dose challenge experiment, Sample size calculation

1. Introduction

Recently, a repeated low-dose (RLD) challenge design has become a standard approach to using non-human primate (NHP) models in HIV vaccine research. Instead of exposing animals to a single high dose of virus to induce infection as in traditional challenge experiments, in RLD experiments animals are repeatedly challenged with a relatively low dose of virus. For example, (Qureshi and others (2012)), adapted RLD experiments to determine whether the combination of host range mutant adenovirus type-5 and simian immunodeficiency virus (SIV; Ad5 SIVmac239 GagInline graphicPolInline graphicNef) vaccine has a significant effect on SIV infection. In this study, animals were randomized to five treatment groups; all animals were repeatedly challenged with the same three escalating doses of SIV viruses regardless of treatment group. RLD experiments have advantages compared with the single high-dose design: they reflect the low probability of HIV transmission in humans more realistically and can provide more statistical power to detect the effects of HIV vaccine (Ellenberger and others, 2006; García-Lerma and others, 2008; Hudgens and Gilbert, 2009; Hudgens and others, 2009; Reynolds and others, 2010).

In HIV vaccine research using RLD experiments, it is of major interest to test the effects of the vaccine in preventing HIV infection and to evaluate the magnitude of the vaccine efficacy. Vaccine efficacy is characterized by the vaccine-induced percent reduction in the risk of infection either at each challenge or up to a certain time point during the experiment. Vaccine efficacy has been commonly evaluated using the nonparametric log-rank test and Kaplan–Meier estimator (García-Lerma and others, 2008; Reynolds and others, 2010; Qureshi and others, 2012) or a discrete-time survival model. In an RLD experiment, animals are examined for infection status after each challenge; this produces repeated binary infection outcomes each time. The probability of infection after each challenge can be expressed as a product of Bernoulli trials in the discrete-time survival model.

Current discrete-time survival models to estimate the effects of vaccine in RLD experiments rely heavily on an assumption of homogeneous risk of infection among animals (e.g. García-Lerma and others, 2008; Qureshi and others, 2012), which, upon violation, can lead to invalid inferences and underpowered study designs. Ignoring heterogeneity among animals can result in underestimated standard errors of the vaccine efficacy estimates and result in confidence intervals with poor coverage (Hudgens and Gilbert, 2009; Moerbeek, 2012). To relax the homogeneity assumption, Hudgens and Gilbert (2009) modeled the transmission probability with a Inline graphic-distribution assuming independent transmission probabilities across challenges within animals. In this study, we propose to use a discrete-time survival model with random effects to model data from the RLD design, assuming an animal's risks of infection across challenges are independent of each other conditional on random effects. The conditional independence assumption is realistic considering the potential heterogeneity among animals due to biological variation and unobserved covariates. By incorporating random effects, our model flexibly accommodates heterogeneity among animals as well as within-animal dependence with respect to the risk of infection after each challenge.

This article has two goals. The primary goal is to develop a flexible statistical model that can take into account between-animal heterogeneity in RLD experiments while allowing for adjustment of covariates such as the time-dependent challenge doses, as in Qureshi and others (2012). In the present study, we propose to fit a discrete-time survival model with a Inline graphic-distributed random effect and a complementary log–log (clog–log) link function, which allows for closed forms for the marginal likelihood function and the vaccine efficacy. We estimate model parameters by maximizing the marginal likelihood function and derive asymptotic variance formulae for inferences about vaccine efficacy.

The second goal of this article is to provide guidance on how to design future RLD experiments in terms of the choices for sample size and maximum number of challenges per animal under limited resources. While there is an intensive literature on the design of clinical trials using a single high-dose challenge, design components in RLD experiments have been seldom studied. Because of the complexity of the design, there is no simple analytic formula for calculating sample size andInline graphicor maximum number of challenges given desired operational criteria. Previously, Hudgens and Gilbert (2009) used simulation studies to investigate the effects of sample size and maximum number of challenges on statistical power in RLD experiments under the assumption of homogenous risk of infection. In the present study, we propose a two-stage procedure to determine the optimal RLD design under financial constraints allowing for between-animal heterogeneity in the risk of infection.

The remainder of this article is organized as follows. In Section 2, we introduce a heterogeneous infection risk model and an inference about vaccine efficacy. The optimization of the study design for RLD experiments is discussed in Section 3. We investigate performance of the proposed methods through two intensive simulation studies in Section 4. In Section 5, an application of the proposed risk modeling method to the NHP study described in Qureshi and others (2012) is presented. Finally, we conclude the article with a discussion of our findings and future research topics.

2. A heterogeneous infection risk model

2.1. Notation and assumptions

Here we consider an RLD experiment in which each animal is randomly assigned to a treatment (vaccine) or a placebo group, and all animals are repeatedly challenged with the same series of SIVs. Let Inline graphic be the subject index, Inline graphic and Inline graphic be time to infection and time to censoring, respectively, and Inline graphic be the time to event outcome variable whichever comes first between Inline graphic and Inline graphic. An infection indicator is denoted by Inline graphic, where Inline graphic if Inline graphic is satisfied, 0 otherwise. Suppose the animals’ infection statuses are collected after each challenge. Inline graphic is thus discrete and can be represented as the total number of challenges an animal receives until infected or censored. Let Inline graphic be a Inline graphic vector of time-invariant baseline covariates such as the indicator of vaccination status. Let Inline graphic be a vector of time-variant covariates associated with the challenge dose that animal Inline graphic receives at time Inline graphic. For example, when Inline graphic different levels of challenge doses are applied to each animal, in this article we model Inline graphic, where Inline graphic for all dose levels (reference level), Inline graphic for the Inline graphicth dose level, and Inline graphic otherwise for Inline graphic. Alternatively, Inline graphic can include polynomial terms of challenge doses at time Inline graphic if one is interested in continuous dose effects. Let Inline graphic denote the history of Inline graphic up to time Inline graphic. We observe Inline graphic independent identically distributed (i.i.d.) samples, Inline graphic. We consider a study design where dose levels are predetermined by protocol for the whole study period and do not vary with individual subjects. That is, Inline graphic is determined in advance for all individuals under study for Inline graphic within the planned study period and is conceptually an external time-dependent covariate as described in Kalbfleisch and Prentice (2011).

Let Inline graphic denote the subject-specific random effect. We make the following three assumptions. First, the infection risks across challenges within an animal are assumed to be independent of each other conditional on the random effect and covariates included in the risk model. Second, non-informative censoring is assumed, conditional on the random effect and covariates, which is a reasonable assumption for a well-controlled RLD experiment. Finally, we assume that there is no “memory” of challenge history as commonly assumed in RLD literature (García-Lerma and others, 2008; Hudgens and Gilbert, 2009; Hudgens and others, 2009; Regoes, 2012). That is, for an animal not yet infected before a challenge, the probability of infection at this particular challenge depends only on the current challenge but not on previous challenges.

2.2. A discrete-time survival model with random effects and a marginal likelihood approach

Let Inline graphic and Inline graphic represent vectors of regression parameters for Inline graphic and Inline graphic, for example, the effects of vaccination and challenge doses (Inline graphic), respectively. To take into account between-animal heterogeneity under the aforementioned assumptions, we model the risk of infection as follows:

2.2. (2.1)

where the random effect Inline graphic follows a specific distribution and Inline graphic is a link function. Model (2.1) could be extended to include interactions between dose-levels and treatment: Inline graphic, where Inline graphic, and Inline graphic quantifies how the treatment effect changes with dose level. In this study, we use a clog–log link for Inline graphic and assume that Inline graphic is independent of other covariates and follows a Inline graphic-distribution with mean 1 and variance Inline graphic. A Inline graphic-distribution with various shapes covers many commonly used exponential-family distributions. Modeling the risk of infection (2.1) with a Inline graphic-distributed random effect and a clog–log link function leads to a closed-form expression for the marginal likelihood and the vaccine efficacy, which is computationally efficient. We refer to Conaway (1990), Scheike and Jensen (1997), and Coull and others (2006) for further discussion on the advantages of this combination of the random effects model and link function.

In a discrete-time survival model, given that Inline graphic is an external covariate, the conditional survival function at time Inline graphic is Inline graphic, and the conditional probability of infection at time Inline graphic is Inline graphic. The marginal likelihood function for Inline graphic subjects indexed by Inline graphic is given by

2.2. (2.2)

where Inline graphic is the cumulative distribution function (CDF) of Inline graphic. Let Inline graphic denote the linear predictor at the time of challenge Inline graphic. As shown in supplementary material available at Biostatistics online, Section S1,

2.2.

where Inline graphic assuming Inline graphic. Therefore, we have a closed-form formula for the marginal log-likelihood function:

2.2.

where Inline graphic is the standard empirical measure for Inline graphic.

2.3. Estimation and inference

We estimate parameters in (2.1) and variance Inline graphic by maximizing the log likelihood (2.2), for example, through Fisher scoring using iteratively reweighted least squares. The variability of the random effects can be expressed more intuitively by an intracluster correlation, Inline graphic, between underlying continuous responses. In particular, let Inline graphic be a binary outcome indicating infection status for animal Inline graphic after the Inline graphicth challenge assuming no event occurs before Inline graphic. Let Inline graphic be the underlying continuous latent outcome such that Inline graphic if Inline graphic and Inline graphic otherwise. Suppose Inline graphic, where Inline graphic, the individual error term, has a reverse extreme value distribution with the CDF Inline graphic and variance Inline graphic. Under the model with a Inline graphic-distributed random effect and a clog–log link, the intrasubject correlation coefficient for the underlying continuous outcome equals Inline graphic. Equivalently, we have Inline graphic. For more details, we refer to Coull and others (2006) and Rodriguez and Elo (2003).

The null hypothesis of no between-animal heterogeneity, Inline graphic, is equivalent to the null hypothesis of no random effects, Inline graphic, for all animals. Under the null hypothesis,

2.3. (2.3)

which is a likelihood function assuming independent risk of infection across challenges. As shown in supplementary material available at Biostatistics online, Section S1, (2.2) converges to (2.3) as Inline graphic. Let Inline graphic and Inline graphic denote parameter spaces under the alternative and null hypotheses of zero between-animal heterogeneity, respectively. We conduct the likelihood ratio test (LRT) to reject the null hypothesis for a large value of the LR statistic: Inline graphic. Under the null hypothesis of Inline graphic, the value of Inline graphic lies on the boundary of the parameter space, Inline graphic, such that Inline graphic converges to a mixture of Inline graphic distributions Inline graphic, where Inline graphic is the Inline graphicth percentile of a Inline graphic distribution with Inline graphic degrees of freedom (d.f.), as presented in Self and Liang (1987), Goldman and Whelan (2000), and Hudgens and Gilbert (2009). We reject Inline graphic if Inline graphic.

The test for the effect of vaccine is equivalent to the test of the null hypothesis Inline graphic. Let Inline graphic and Inline graphic denote parameter spaces under the alternative and null hypotheses of a zero vaccine effect, respectively. We reject the null hypothesis if Inline graphic, where Inline graphic, and Inline graphic is the difference in the number of parameters between the two-nested models.

2.4. Estimation of vaccine efficacy

Hudgens and Gilbert (2009) defined two types of vaccine efficacy. The first is vaccine efficacy for preventing infection before or at the time of challenge Inline graphic:

2.4.

which is the relative reduction in the risk of infection before or at time Inline graphic for the vaccine group compared to the placebo group. Inline graphic indicates that the vaccine is effective in reducing the risk of infection before or at time Inline graphic, whereas Inline graphic indicates that the vaccine is not effective or has a negative effect. Under the heterogeneous infection risk model described in Section 2.2, Inline graphic. The second type of vaccine efficacy is the perchallenge vaccine efficacy, defined as the relative reduction in the risk of infection caused by vaccination at a particular challenge, conditional on non-infection before the challenge. Perchallenge VE at dose-level Inline graphic under the heterogeneous infection risk model equals Inline graphic, where Inline graphic is a vector of variables of length Inline graphic for the Inline graphicth dose level with the Inline graphicth element being Inline graphic, Inline graphic and Inline graphic (reference level). Inline graphic allows characterization of the vaccine's effect at a specific level of exposure, whereas Inline graphic represents a vaccine effect integrated over multiple levels of exposures. Let Inline graphic and Inline graphic be MLE of Inline graphic and Inline graphic. The covariance matrices of Inline graphic and Inline graphic can be calculated using the Delta method as given in supplementary material available at Biostatistics online, Section S1.

3. Optimization of the design of RLD experiments under cost constraints

In practice, we have limited resources for conducting RLD experiments. It is of interest to optimize the study design such that a desired operational criterion, for example, precision of VE estimators or power of the study, can be maximized. As pointed out by Moerbeek (2012) and Zhang and Ahn (2011) for the cluster randomized design with discrete survival outcomes, no analytical formula is available yet for calculating sample size and power; Monte-Carlo simulation studies are commonly used to determine design components. The results of the simulation study in Hudgens and Gilbert (2009) demonstrated that the statistical power for the test of vaccine efficacy increases with larger sample size, larger maximum number of challenges per animal, and higher risk of infection per exposure in the control group. Optimizing an RLD design typically requires excessive simulations under various combinations of sample size and number of challenges.

Motivated by the study in Zhang and Ahn (2011), we propose a computationally efficient two-stage approach to identify the optimal pair of sample size and maximum number of challenges, denoted by (Inline graphic), in order to maximize the operational criteria of interest under financial constraints. We illustrate how the proposed approach can be used to guide the design of an RLD experiment to optimize the efficiency of estimating perchallenge vaccine efficacy, Inline graphic. The same strategy applies to other criteria such as Inline graphic and study power.

Suppose the financial costs of adding an animal and adding a challenge per animal are Inline graphic and Inline graphic, respectively, and the maximum budget Inline graphic for the experiment is fixed. Suppose Inline graphic unique levels of escalating challenge doses will be applied to each animal as in Qureshi and others (2012), and each dose level is applied to an animal for Inline graphic times Inline graphic. Let Inline graphic denote the probability of an animal receiving the Inline graphicth challenge for Inline graphic. The average cost to challenge Inline graphic animals for up to Inline graphic times at each dose level equals Inline graphic. Define Inline graphic as the largest Inline graphic for a given Inline graphic satisfying this constraint: Inline graphic, where Inline graphic denotes the largest integer not greater than Inline graphic. Since Inline graphic and Inline graphic are not independent conditional on Inline graphic, seeking the best pair (Inline graphic) that minimizes the variance for estimators of Inline graphic is not straightforward in practice, particularly for a complex study design like an RLD experiment.

Let Inline graphic for a prespecified Inline graphic. We propose to use a two-stage approach to find Inline graphic. In the first stage, we conduct a simulation study with a large sample size (Inline graphic) to estimate Inline graphic for each Inline graphic, denoted by Inline graphic. In the second stage, we find the Inline graphic that minimizes the variance estimate of Inline graphic. In particular, let Inline graphic denote the variance estimate based on data with Inline graphic. We approximate Inline graphic with Inline graphic. Using Inline graphic as the reference design, we evaluate the efficiency at Inline graphic relative to Inline graphic by

3. (3.1)

We compute Inline graphic. The resultant Inline graphic would achieve the best efficiency for estimating Inline graphic under a fixed total cost. Compared with the typical Monte-Carlo simulation studies that evaluate variances of Inline graphic estimators for every Inline graphic combinations, the computational burden reduces from Inline graphic to Inline graphic using the proposed two-stage approach.

4. Simulation studies

We illustrate the proposed heterogeneous infection risk model and the two-stage approach to optimizing RLD design with two intensive simulation studies. In the first simulation study, we compare the discrete-time survival model with random effects (hereafter “heterogeneous model”) with a discrete-time survival model assuming independence in the risk of infection across challenges within animals (“homogeneous model”), with respect to statistical power, Type I error rate and the precision of parameter estimates. In the second simulation study, we explore the best pair of sample size and maximum number of challenges per animal for various levels of vaccine efficacy and between-animal heterogeneity under a fixed total cost.

For both simulation studies, we considered RLD studies in which animals are 1 : 1 randomized to a vaccine group and a placebo group and challenged with the same three increasing levels of challenge doses (Inline graphic) to mimic the setting in Qureshi and others (2012). Each animal was allowed up to Inline graphic challenges. Let Inline graphic be the indicator of assignment to the vaccine group and Inline graphic as defined in Section 2.1. Let Inline graphic be a random effect with Inline graphic generated from a Inline graphic-distribution with mean 1 and variance Inline graphic. The conditional probability of infection at each challenge was modeled as in (2.1) with the clog–log link function. We observed the binary outcome Inline graphic, the survival time Inline graphic, and the infection indicator Inline graphic. We maximized the likelihood functions in (2.2) and (2.3) using the optim function in the R package (R Core Team, 2012).

4.1. Simulation Study 1

4.1.1. Set-up

In the first simulation study, each animal was allowed up to 5 challenges at each dose level, with 15 the maximum number of challenges for each animal. Data were generated with different Inline graphic values for zero (Inline graphic), weak (Inline graphic or Inline graphic), moderate (Inline graphic or Inline graphic), or strong (Inline graphic or Inline graphic) within-animal dependence. We set Inline graphic and Inline graphic corresponding to perchallenge infection probabilities among placebo recipients of 0.02, 0.16, and 0.39 at dose levels 1, 2, and 3, respectively. Sample sizes Inline graphic or 1000 were explored. For each simulation scenario, 1000 Monte-Carlo data were generated.

4.1.2. Results

Without within-animal dependence (Inline graphic), average estimates for model parameters and for perchallenge VEs using the heterogeneous model were comparable with those estimated using the homogeneous model, with slightly larger variances. In the presence of weak within-animal dependence (Inline graphic), the homogeneous model produced highly biased estimates while the heterogeneous model produced unbiased estimates even when sample size was small (Table 1). In addition, perchallenge VE tended to be underestimated by fitting the homogeneous model with much worse coverage rates, while the heterogeneous model produced unbiased estimates with coverage rates close to the target nominal level (Table 2). Results for the settings with moderate or strong within-animal dependence, presented in supplementary material available at Biostatistics online, Tables S1 and S2, showed the apparent superiority of the heterogeneous model. Supplementary material available at Biostatistics online, Figure S1, shows the estimate of VE(t) versus Inline graphic. The homogeneous model underestimated VE(t), particularly at the early stages of the study, while the estimated VE(t) curve using the heterogeneous model was very close to the true VE(t) curve.

Table 1.

Results of simulation study 1 to compare the homogeneous infection risk model (Homogeneous) and heterogeneous infection risk model (Heterogeneous) at the sample sizes Inline graphic or Inline graphic under the following two cases: data were generated under the independent risk of infection Inline graphic and data were generated under the weak strength of dependence in the risk of infection Inline graphic across challenges within each animal. Monte-Carlo mean (mean) and standard deviation (MCSD) of estimates and Inline graphic confidence interval coverage rates Inline graphic based on the normal approximation of estimates Inline graphic CR) using Inline graphic simulations are reported

Data without within-animal dependence (Inline graphic)
Data with within-animal dependence (Inline graphic)
Fitting model Homogeneous
Heterogeneous
Homogeneous
Heterogeneous
Sample size True value 40 200 1000 40 200 1000 True value 40 200 1000 40 200 1000
Inline graphic Mean Inline graphic3.90 Inline graphic5.33 Inline graphic3.95 Inline graphic3.90 Inline graphic5.27 Inline graphic3.93 Inline graphic3.89 Inline graphic3.89 Inline graphic5.20 Inline graphic4.07 Inline graphic4.03 Inline graphic5.02 Inline graphic3.94 Inline graphic3.90
MCSD 4.53 0.30 0.12 4.42 0.30 0.12 4.05 0.31 0.13 3.89 0.33 0.14
95% CR 88.7 95.3 96.0 88.4 95.8 96.0 92.1 94.4 84.7 92.2 95.2 95.8
Inline graphic Mean 2.16 3.57 2.20 2.16 3.59 2.23 2.17 2.22 3.09 2.01 1.96 3.34 2.28 2.22
MCSD 4.54 0.31 0.13 4.42 0.32 0.13 4.05 0.34 0.14 3.87 0.36 0.15
95% CR 89.6 95.5 96.8 89.7 96.2 96.8 86.8 84.2 48.2 91.4 94.5 95.0
Inline graphic Mean 3.20 4.69 3.25 3.20 4.83 3.36 3.25 3.42 3.78 2.64 2.59 4.62 3.46 3.41
MCSD 4.54 0.31 0.13 4.42 0.36 0.14 4.07 0.33 0.14 3.96 0.51 0.22
95% CR 89.5 95.9 95.5 91.3 97.1 98.1 70.5 30.1 0.0 93.3 95.8 95.0
Inline graphic Mean Inline graphic1.00 Inline graphic1.05 Inline graphic1.00 Inline graphic1.00 Inline graphic1.14 Inline graphic1.06 Inline graphic1.03 Inline graphic1.00 Inline graphic0.69 Inline graphic0.70 Inline graphic0.68 Inline graphic1.03 Inline graphic1.03 Inline graphic1.00
MCSD 0.39 0.17 0.07 0.44 0.19 0.08 0.42 0.18 0.08 0.66 0.28 0.12
95% CR 93.8 94.0 94.5 97.9 98.1 97.3 84.1 56.5 1.3 94.8 95.2 95.8
Inline graphic Mean 0.00 0.13 0.08 0.04 0.89 0.94 0.86 0.87
MCSD 0.21 0.12 0.05 0.96 0.44 0.19

Inline graphic, Inline graphic, where Inline graphic denotes estimates from the Inline graphicth simulation for Inline graphic.

Table 2.

Results of simulation Study 1 to compare the homogeneous infection risk model (Homogeneous) and heterogeneous infection risk model (Heterogeneous) at the sample sizes Inline graphic or Inline graphic under the following two casesInline graphic data were generated under the independent risk of infection Inline graphic and data were generated under the weak strength of dependence in the risk of infection Inline graphic across challenges within each animal. Monte-Carlo mean (mean) and standard deviation (MCSD) of perchallenge VE estimates at each dose level Inline graphic and Inline graphic and Inline graphic confidence interval coverage rates Inline graphic based on the normal approximation of perchallenge VE Inline graphic CR) and normal approximation of the logit-transformed perchallenge VE Inline graphic CR (LInline graphic using Inline graphic simulations are reported

Data without within-animal dependence (Inline graphic)
Data with within-animal dependence (Inline graphic)
Fitting model Homogeneous
Heterogeneous
Homogeneous
Heterogeneous
Sample size True value 40 200 1000 40 200 1000 True value 40 200 1000 40 200 1000
Inline graphic Mean 0.63 0.62 0.63 0.63 0.65 0.64 0.64 0.63 0.45 0.49 0.49 0.56 0.62 0.62
MCSD 0.15 0.06 0.03 0.15 0.06 0.03 0.23 0.09 0.04 0.30 0.10 0.04
95% CR 90.3 94.3 94.6 92.5 95.0 96.3 97.2 73.1 2.6 92.4 93.8 95.7
95% CR (L) 94.5 95.0 94.9 98.6 97.1 96.8 99.1 72.7 2.6 97.8 96.5 96.4
Inline graphic Mean 0.61 0.61 0.61 0.61 0.63 0.62 0.62 0.59 0.44 0.48 0.47 0.53 0.59 0.59
MCSD 0.15 0.06 0.03 0.15 0.06 0.03 0.22 0.09 0.04 0.27 0.10 0.04
95% CR 90.6 94.3 94.9 92.3 94.8 96.4 96.7 81.5 9.6 92.4 94.5 95.8
95% CR (L) 94.7 95.0 95.1 98.3 96.4 96.7 98.4 83.6 9.1 96.7 96.2 96.1
Inline graphic Mean 0.57 0.56 0.57 0.57 0.57 0.58 0.58 0.52 0.43 0.47 0.46 0.45 0.51 0.52
MCSD 0.14 0.06 0.03 0.14 0.06 0.03 0.22 0.09 0.04 0.23 0.09 0.04
95% CR 91.3 93.8 94.7 90.7 94.4 95.4 95.2 93.7 66.9 93.1 94.5 95.4
95% CR (L) 94.4 95.2 94.5 94.7 95.1 95.8 96.6 98.4 67.6 96.2 95.8 95.5

Inline graphic, Inline graphic, where Inline graphic denotes estimates from the Inline graphicth simulation for Inline graphic.

For settings with Inline graphic, LRT testing for vaccine effects using the homogenous model had elevated Type I error rates, while LRT using the heterogeneous model and the log-rank test by contrast had Type I error rates close to the target nominal level (Table 3). LRT using the heterogeneous model produced the highest power; the gain in power by accounting for within-animal dependence increased as Inline graphic increased. In general, power for testing vaccine effects decreases as Inline graphic increases. Consequently, a much larger sample size will be required to detect a weak vaccine effect in the presence of moderate or strong within-animal dependence. Supplementary material available at Biostatistics online, Table S3, shows the results of LRT for testing heterogeneity (Inline graphic) based on the heterogeneous model. LRT had well-controlled Type I error rates, and the power of the test increased with the sample size and the magnitude of the within-animal dependence.

Table 3.

Results of simulation Study 1 to compare the Type I error rate and power of the tests for the vaccine effects at different strength of within-animal dependence Inline graphic or Inline graphic and sample sizes Inline graphic or Inline graphic. Three tests are compared: The likelihood ratio test (LRT) using the homogeneous infection risk model (LRT-HomoInline graphic LRT using the heterogeneous infection risk model (LRT-HeteroInline graphic and the log-rank test (Log-rank). Rejection rates Inline graphic over Inline graphic replications using Inline graphic Inline graphic distribution under the null Inline graphic and the alternative Inline graphic hypotheses are reported for the Type I error rate and powerInline graphic respectively

Inline graphic
Inline graphic
Inline graphic
Inline graphic
LRT
LRT
LRT
LRT
Sample size Homo Hetero Log-rank Homo Hetero Log-rank Homo Hetero Log-rank Homo Hetero Log-rank
Type I error rate 40 7.2 6.8 7.2 6.7 6.1 5.1 6.8 7.4 5.4 6.4 7.5 5.0
200 5.9 5.4 5.9 5.8 5.3 4.5 6.9 5.7 5.3 7.1 4.8 5.5
1000 5.6 5.2 5.2 5.9 4.9 5.1 5.7 5.6 4.5 6.6 5.3 5.4
Power 40 82.6 81.2 81.3 44.0 45.3 41.4 24.6 29.0 22.3 14.9 18.7 12.5
200 100.0 100.0 100.0 98.3 98.8 98.3 79.9 87.3 79.2 39.2 57.5 37.9
1000 100.0 100.0 100.0 100.0 100.0 100.0 100.0 100.0 100.0 95.6 99.8 95.6

More results of the simulation study investigating the performance of the heterogeneous model are presented in supplementary material available at Biostatistics online, Tables S4–S9 in Sections S2 and S3. A similar pattern comparing heterogeneous and homogenous models was observed in the extended model that includes interactions between the dose levels and vaccine assignment (supplementary material available at Biostatistics online, Tables S7–S9). We also investigated the robustness of the proposed model under risk model misspecification. Results are presented in supplementary material available at Biostatistics online, Tables S10–S11 in Section S4. In general, the proposed heterogeneous model has a compatible or slightly better performance compared with the homogeneous model or the log-rank test under a moderate degree of working model misspecification.

4.2. Simulation Study 2

4.2.1. Set-up

We considered Inline graphic or 4.93 corresponding to Inline graphic or 0.75, respectively. Values for Inline graphic were chosen such that the perchallenge probabilities of infection in the placebo group for Inline graphic and 4.93, equal 0.1, 0.09, and 0.08, respectively, at dose level 1, 0.18, 0.17, and 0.14 at dose level 2, and 0.27, 0.24, and 0.19 at dose level 3. Various values for Inline graphic corresponding to Inline graphic or 0.7 were considered, where Inline graphic is perchallenge vaccine efficacy at the lowest level dose. We assumed that the cost for adding an animal is 50 times the cost for an extra challenge per animal and set the maximum budget to be Inline graphic. A range of the maximum number of challenges per dose level, Inline graphic, were considered to allow for at least two challenges at each dose level.

4.2.2. Results

Our goal was to identify the optimal pair of sample size and maximum number of challenges per animal to achieve the best efficiency in estimating Inline graphic given budget constraints. In the first stage, we estimated the variance of Inline graphic for each Inline graphic using 10 simulated datasets, each containing Inline graphic samples. In the second stage, we calculated Inline graphic, the efficiency of Inline graphic for Inline graphic relative to {n(2), 2} by (3.1) and averaged Inline graphic over the 10 simulated datasets to obtain a smooth estimate. We then identified the best pair of Inline graphic that achieved the greatest average relative efficiency compared with {n(2), 2}. Figure 1 displays Inline graphic for all Inline graphic and for the selected Inline graphic by the proposed approach. The efficiency gain of the selected pair compared with the reference pair Inline graphic ranges from 2 to 78% for varying Inline graphic and Inline graphic. Under fixed within-animal dependence, with the increase in vaccine efficacy, the maximum efficiency was achieved at a larger number of challenges and thus a smaller sample size. Under fixed vaccine efficacy, with an increase in within-animal dependence, the maximum efficiency was achieved at a larger sample size and thus a smaller number of challenges. Comparisons of the estimated Inline graphic based on the two-stage method and those obtained from the standard Monte-Carlo simulations for Inline graphic and Inline graphic or 0.55 are given in supplementary material available at Biostatistics online, Figure S2. The pattern of estimated Inline graphic versus Inline graphic was similar between the two.

Fig. 1.

Fig. 1.

The relative efficiency (RE) of the Inline graphic estimator for each pair of Inline graphic compared with Inline graphic. Inline graphic is defined by Inline graphic. Intracluster correlation Inline graphic and perchallenge vaccine efficacy at the first dose Inline graphic are explored. The selected pair of Inline graphic by the proposed approach is indicated on the X-axis, and Inline graphic is shown. Black line and gray lines represent the average Inline graphic and Inline graphic over 10 replications. The sample size of the data for the first-stage simulation Inline graphic.

5. Data analysis

In the NHP study described by Qureshi and others (2012), 43 adult male Rhesus Macaca mulatta (RM) were randomized to five treatment groups: Ad5 Vx-SIV (Inline graphic), Ad5 Vx-empty (8), Vx-SIV (9), Vx-empty (9), and naive control (8). All macaques were challenged by a series of escalating penile exposures including three levels of SIVmac 251 viruses: Inline graphic of virus 10 times weekly (the lowest level dose), followed by Inline graphic of virus 10 times weekly (the middle level dose), and finally Inline graphic of virus twice a day (the highest level dose). Animals were evaluated for infection status after each challenge. Time to infection was defined as the first week in which Inline graphicvRNA copies are >2 and stay >2 for subsequent measurements within 4 weeks. The process of immunization can be found in detail in Qureshi and others (2012).

Previously, Qureshi and others (2012) fit the discrete-time survival model to the data assuming homogeneous and constant risk of infection. In this section, we reanalyze the data allowing for possible heterogeneity in the risk of infection across animals and allowing for adjustment of changing dose levels over time. Results for the average risk of infection across different time periods, as presented in Qureshi and others (2012), suggest possible differences in vaccine efficacy across challenge dose levels, particularly between the highest and the other two dose levels. To allow for some flexibility in modeling VE as a function of the challenging dose, we include the interaction term between the highest dose level and treatment assignment in the model of infection risk, in addition to main effects for each dose level and treatment: Inline graphic. A similar interaction model was considered in analyzing an HIV vaccine trial in Robb and others (2012). We fit both the homogeneous and the heterogeneous infection risk models. A goodness-of-fit test (Pan and Lin, 2005) for the proposed heterogeneous model did not show any significant deviation from the model assumptions (supplementary material available at Biostatistics online, Figure S3 in Section S6).

Using the heterogeneous model, the LRT did not find any significant within-animal dependence. However, the power to detect small heterogeneity in this study with 43 animals is very limited. Regoes (2012) studied heterogeneous susceptibility in eight datasets for HIV using an RLD design and failed to find a statistically significant heterogeneous susceptibility except for the data from Letvin 11 stm which had the largest sample size (43 animals for each of two vaccine groups). The results in Regoes (2012) and those in our analysis suggest that larger sample size is important for detecting heterogeneity in the risk of infection. Given the limited power of the test in practice, fitting a heterogeneous model to the data allowing for potential heterogeneity across animals can be useful to complement andInline graphicor confirm the results of the simpler homogeneous model, even in the presence of nonsignificant testing results.

There were no statistically significant differences identified in the risk of infection among the five vaccine groups based on the LRT (Inline graphic and 0.202 for the homogeneous and the heterogeneous models, respectively). Based on both the homogenous and heterogeneous models, estimated perchallenge VE comparing composite groups of particular interest as defined in Qureshi and others (2012) are reported along with 95% confidence intervals (Table 4). The negative perchallenge vaccine efficacy estimate for Ad5-Vx-SIV vaccine against Ad5-Vx-empty or other SIV-negative vaccines in the lowest and the middle dose levels suggests a possibly greater risk of infection in the Ad5 seropositive animals immunized with the Ad5 SIV. This result is consistent with Qureshi and others (2012). More investigation through larger studies, however, is required to confirm this observation. Estimated perchallenge probabilities of infection are given in supplementary material available at Biostatistics online, Tables S12–S14 in Section S6. Estimation of perchallenge infection risk and vaccine efficacy for each specific dose level was not achievable in earlier analyses that assumed constant infection probability and VE across challenges and animals. These measures provide valuable information to biologists for planning future RLD experiments.

Table 4.

Perchallenge VE Inline graphic estimates (Estimate) along with standard error estimates (SE) and 95% confidence intervals (95% CI) obtained by the homogeneous and heterogeneous infection risk models including the interaction between the highest dose level and vaccine group in the NHP data Qureshi and others (2012) are reported. Confidence interval is built using normal approximation of Inline graphic

The lowest level virus (Inline graphic)
The middle level virus (Inline graphic)
The highest level virus (Inline graphic)
Model Comparison group Estimate SE 95% CI Estimate SE 95% CI Estimate SE 95% CI
Homogeneous Ad5 Vx-SIV versus Ad5 Vx-empty Inline graphic3.58 5.12 (Inline graphic13.61, 6.45) Inline graphic3.49 4.96 (Inline graphic13.21, 6.24) 0.65 0.33 (0.00, 1.30)
Ad5 Vx-SIV Inline graphic Vx-SIV versus All others Inline graphic0.46 0.67 (Inline graphic1.78, 0.85) Inline graphic0.45 0.65 (Inline graphic1.72, 0.82) 0.29 0.39 (Inline graphic0.48, 1.05)
Ad5 Vx-SIV versus All others Inline graphic0.15 0.65 (Inline graphic1.43, 1.12) Inline graphic0.15 0.63 (Inline graphic1.38, 1.08) 0.58 0.39 (Inline graphic0.19, 1.34)
Heterogeneous Ad5 Vx-SIV versus Ad5 Vx-empty Inline graphic3.60 5.44 (Inline graphic14.26, 7.06) Inline graphic3.50 5.22 (Inline graphic13.73, 6.73) 0.65 0.41 (Inline graphic0.15, 1.44)
Ad5 Vx-SIV Inline graphic Vx-SIV versus All others Inline graphic0.47 0.77 (Inline graphic1.98, 1.05) Inline graphic0.45 0.74 (Inline graphic1.89, 0.99) 0.28 0.46 (Inline graphic0.61, 1.18)
Ad5 Vx-SIV versus All others Inline graphic0.16 0.76 (Inline graphic1.64,1.33) Inline graphic0.15 0.73 (Inline graphic1.59, 1.28) 0.58 0.41 (Inline graphic0.23, 1.38)

Akaike information criterion with a correction for finite sample sizes (Inline graphic) of the homogeneous and the heterogeneous infection risk models including the main effects for the five vaccine groups and dose levels and their interactions for the highest dose levels are 212.12 and 216.03, respectively. Inline graphic, where Inline graphic is the number of parameters, Inline graphic is the number of animals, and Inline graphic is the estimated log likelihood. Smaller value is more preferred.

6. Discussion

In this article, we propose to fit a discrete-time survival model with random effects to take into account the potential heterogeneity among animals and the resulting within-animal dependence arising in RLD experiments. Simulation studies demonstrate that in the presence of heterogeneity, the homogeneous model ignoring within-animal dependence can have an inflated Type I error rate, loss of power for testing the vaccine's effect, biased estimates of vaccine efficacy, and low coverage rates. These problems were resolved by fitting a heterogeneous infection risk model. In the absence of between-animal heterogeneity, the heterogeneous infection risk model produces results comparable with the homogeneous infection risk model. We also propose a two-stage approach to determine the optimal balance between the number of animals and the number of challenges, in order to maximize the efficiency of VE estimates under limited resources. The usefulness of the method in designing an RLD experiment is demonstrated through simulation studies.

The heterogeneous infection risk model is more plausible from a biological point of view compared to a homogeneous model. In practice, however, most NHP studies on HIV vaccines have a small sample size with weak vaccine efficacy, which can lead to insufficient power to detect heterogeneity, as discussed in Regoes (2012). Nevertheless, heterogeneity among animals is commonly believed to exist. Our methods address the need to assess and accommodate the existence of heterogeneity in risk modeling, even when the study may not be powerful enough to lead to significant test results.

The heterogeneous infection risk model developed in this article is computationally advantageous. It leads to a simple closed form for the marginal likelihood function. This allows us to avoid numerical integrations that are oftentimes required for other types of random effects models. Moreover, we are able to derive vaccine efficacy as a simple function of regression parameters.

Further research in the statistical methods is warranted to accommodate more complex but realistic scenarios arising in RLD experiments. One of the worthwhile avenues of research is to relax the assumption of no “memory” of challenge history (Regoes and others, 2005; Regoes, 2012). As challenges proceed, the differences in vaccine efficacy between vaccinated and unvaccinated groups will be diminished if previous challenges can cause immunization. Time-varying markers reflecting the potential immunization status of each animal might help investigating this issue. Identifying and modeling unintended immunization resulting from past challenges is a difficult task that requires future development in both scientific understanding and statistical techniques.

Supplementary material

Supplementary Material is available at http://biostatistics.oxfordjournals.org.

Funding

This work was supported by the National Institutes of Health (P30 CA015704, R01 GM106177, R37AI05465, and R01 CA152089).

Supplementary Material

Supplementary Data

Acknowledgements

Conflict of Interest: None declared.

References

  1. Conaway M. R. A random effects model for binary data. Biometrics. 1990;46((2)):317–328. [Google Scholar]
  2. Coull B. A., Houseman E. A., Betensky R. A. A computationally tractable multivariate random effects model for clustered binary data. Biometrika. 2006;93((3)):587–599. [Google Scholar]
  3. Ellenberger D., Otten R. A., Li B., Aidoo M., Rodriguez I. V., Sariol C. A., Martinez M., Monsour M., Wyatt L., Hudgens M. G. HIV-1 DNA/MVA vaccination reduces the per exposure probability of infection during repeated mucosal SHIV challenges. Virology. 2006;352((1)):216–225. doi: 10.1016/j.virol.2006.04.005. and others. [DOI] [PubMed] [Google Scholar]
  4. García-Lerma J. G., Otten R. A., Qari S. H., Jackson E., Cong M., Masciotra S., Luo W., Kim C., Adams D. R., Monsour M. Prevention of rectal SHIV transmission in macaques by daily or intermittent prophylaxis with emtricitabine and tenofovir. PLOS Medicine. 2008;5((2)):e28. doi: 10.1371/journal.pmed.0050028. and others. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Goldman N., Whelan S. Statistical tests of γ-distributed rate heterogeneity in models of sequence evolution in phylogenetics. Molecular Biology and Evolution. 2000;17((6)):975–978. doi: 10.1093/oxfordjournals.molbev.a026378. [DOI] [PubMed] [Google Scholar]
  6. Hudgens M. G., Gilbert P. B. Assessing vaccine effects in repeated low-dose challenge experiments. Biometrics. 2009;65((4)):1223–1232. doi: 10.1111/j.1541-0420.2009.01189.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Hudgens M. G., Gilbert P. B., Mascola J. R., Wu C. D., Barouch D. H., Self S. G. Power to detect the effects of HIV vaccination in repeated low-dose challenge experiments. Journal of Infectious Diseases. 2009;200((4)):609–613. doi: 10.1086/600891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Kalbfleisch J. D., Prentice R. L. The Statistical Analysis of Failure Time Data. Hoboken, New Jersey: John Wiley & Sons; 2011. Volume 360. [Google Scholar]
  9. Moerbeek M. Sample size issues for cluster randomized trials with discrete-time survival endpoints. Methodology. 2012;8((4)):146–158. [Google Scholar]
  10. Pan Z., Lin D. Y. Goodness-of-fit methods for generalized linear mixed models. Biometrics. 2005;61((4)):1000–1009. doi: 10.1111/j.1541-0420.2005.00365.x. [DOI] [PubMed] [Google Scholar]
  11. Qureshi H., Ma Z. M., Huang Y., Hodge G., Thomas M. A., DiPasquale J., DeSilva V., Fritts L., Bett A. J., Casimiro D. R. Low-dose penile SIVmac251 exposure of rhesus macaques infected with adenovirus type 5 (Ad5) and then immunized with a replication-defective Ad5-based SIV Gag/Pol/Nef vaccine recapitulates the results of the phase IIb Step trial of a similar HIV-1 vaccine. Journal of Virology. 2012;86((4)):2239–2250. doi: 10.1128/JVI.06175-11. and others. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2012. ISBN 3-900051-07-0. [Google Scholar]
  13. Regoes R. R. The role of exposure history on HIV acquisition: insights from repeated low-dose challenge studies. PLOS Computational Biology. 2012;8((11)):e1002767. doi: 10.1371/journal.pcbi.1002767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Regoes R. R., Longini I. M., Feinberg M. B., Staprans S. I. Preclinical assessment of HIV vaccines and microbicides by repeated low-dose virus challenges. PLOS Medicine. 2005;2((8)):e249. doi: 10.1371/journal.pmed.0020249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Reynolds M. R., Weiler A. M., Piaskowski S. M., Kolar H. L., Hessell A. J., Weiker M., Weisgrau K. L., León E. J., Rogers W. E., Makowsky R. Macaques vaccinated with simian immunodeficiency virus SIVmac239 δ nef delay acquisition and control replication after repeated low-dose heterologous SIV challenge. Journal of Virology. 2010;84((18)):9190–9199. doi: 10.1128/JVI.00041-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Robb M. L., Rerks-Ngarm S., Nitayaphan S., Pitisuttithum P., Kaewkungwal J., Kunasol P., Khamboonruang C., Thongcharoen P., Morgan P., Benenson M. Ad hoc analysis of behavior and time as co-variates of the Thai phase III efficacy trial: RV 144. The Lancet Infectious Diseases. 2012;12((7)):531. doi: 10.1016/S1473-3099(12)70088-9. and others. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Rodrıguez G., Elo I. Intra-class correlation in random-effects models for binary data. The Stata Journal. 2003;3((1)):32–46. [Google Scholar]
  18. Scheike T. H., Jensen T. K. A discrete survival model with random effects: an application to time to pregnancy. Biometrics. 1997;53((1)):318–329. [PubMed] [Google Scholar]
  19. Self S. G., Liang K. Y. Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. Journal of the American Statistical Association. 1987;82((398)):605–610. [Google Scholar]
  20. Zhang S., Ahn C. Adding subjects or adding measurements in repeated measurement studies under financial constraints. Statistics in Biopharmaceutical Research. 2011;3((1)):54–64. doi: 10.1198/sbr.2010.10022. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Biostatistics (Oxford, England) are provided here courtesy of Oxford University Press

RESOURCES