Estimating vaccine efficacy over time after a randomized study is unblinded

Anastasios A Tsiatis; Marie Davidian

doi:10.1111/biom.13509

. 2021 Aug 13:10.1111/biom.13509. Online ahead of print. doi: 10.1111/biom.13509

Estimating vaccine efficacy over time after a randomized study is unblinded

Anastasios A Tsiatis ¹, Marie Davidian ^1,^✉

PMCID: PMC8444907 PMID: 34174097

Abstract

The COVID‐19 pandemic due to the novel coronavirus SARS CoV‐2 has inspired remarkable breakthroughs in the development of vaccines against the virus and the launch of several phase 3 vaccine trials in Summer 2020 to evaluate vaccine efficacy (VE). Trials of vaccine candidates using mRNA delivery systems developed by Pfizer‐BioNTech and Moderna have shown substantial VEs of 94–95%, leading the US Food and Drug Administration to issue Emergency Use Authorizations and subsequent widespread administration of the vaccines. As the trials continue, a key issue is the possibility that VE may wane over time. Ethical considerations dictate that trial participants be unblinded and those randomized to placebo be offered study vaccine, leading to trial protocol amendments specifying unblinding strategies. Crossover of placebo subjects to vaccine complicates inference on waning of VE. We focus on the particular features of the Moderna trial and propose a statistical framework based on a potential outcomes formulation within which we develop methods for inference on potential waning of VE over time and estimation of VE at any postvaccination time. The framework clarifies assumptions made regarding individual‐ and population‐level phenomena and acknowledges the possibility that subjects who are more or less likely to become infected may be crossed over to vaccine differentially over time. The principles of the framework can be adapted straightforwardly to other trials.

Keywords: crossover, inverse probability weighting, potential outcomes, randomized phase 3 vaccine trial, waning vaccine efficacy

1. INTRODUCTION

The primary objective of a vaccine trial is to estimate vaccine efficacy (VE). Typically, these trials are double‐blind, placebo‐controlled studies in which participants are randomized to either vaccine or placebo and followed for the primary endpoint. This endpoint is often time to viral infection, on which inference on VE is based, where VE is defined as a measure of reduction in infection risk for vaccine relative to placebo, expressed as a percentage.

Vaccine trials have become the focus of immense global interest as a result of the COVID‐19 disease pandemic due to the novel coronavirus SARS‐CoV‐2 (COVID‐19 Vaccine Tracker). The pandemic inspired unprecedented scientific breakthroughs in the rapid development of vaccines against SARS‐CoV‐2, culminating in the launch of several large phase 3 vaccine trials in Summer 2020. Trials in the United States studying the vaccine candidates using messenger RNA (mRNA) delivery systems developed by Pfizer‐BioNTech and Moderna began in July 2020 and demonstrated substantial evidence of VEs of 94–95% at interim analyses, leading the US Food and Drug Administration (FDA) to issue Emergency Use Authorizations (EUAs) for both vaccines in December 2020 and to the rollout of vaccination programs shortly thereafter.

Implicit in the primary analysis in these trials is the assumption that VE is constant over the study period, and, with primary endpoint time to infection, VE is represented by (1 − the ratio of the hazard rate for vaccine to that for placebo), estimated based on a Cox proportional hazards model. As the trials continue following the EUAs, among the many issues to be addressed is the possibility that VE may wane over time. Principled evaluation of the nature and extent of waning of VE is of critical public health importance, as waning has implications for measures to control the pandemic. Were all participants in the trials to continue on their randomized assignments (study vaccine or placebo), evaluation of potential waning of VE would be straightforward. However, once efficacy is established, ethical considerations dictate the possibility of unblinding all participants and offering the study vaccine to those randomized to placebo. After consultation with stakeholders, Pfizer and Moderna issued amendments to their trial protocols specifying unblinding strategies and modifications to planned analyses.

Crossover of placebo subjects to the study vaccine of necessity complicates inference on waning of VE and has inspired recent research (Follmann et al., 2020; Fintzi and Follmann, 2021; Lin et al. 2021). We propose a statistical framework within which we develop methods for inference on whether or not VE wanes over time based on data where subjects are unblinded and those on placebo may cross over to study vaccine and in which assumptions made regarding individual and population phenomena are made transparent. It is possible that subjects who are more or less likely to become infected could be unblinded and cross over to vaccine differentially over time, which could lead to biased inferences due to confounding; accordingly, this possibility is addressed explicitly in the framework. The first author (AAT) has the privilege of serving on the Data and Safety Monitoring Board for all U.S. government‐sponsored COVID‐19 vaccine trials and is thus well acquainted with the unblinding approach for the Moderna trial. Accordingly, the development is based on the specifics of this trial, but the principles can be adapted to the features of other trials.

In Section 2, we review the Moderna trial and the resulting data. We present a conceptual framework in which we define VE precisely as a function of time postvaccination in Section 3. In Section 4, we develop a formal statistical framework within which we propose methodology for estimation of VE and describe its practical implementation in Section 5. Simulations demonstrating performance are presented in Section 6.

2. CLINICAL TRIAL STRUCTURE AND DATA

We first describe the timeline of the Moderna Coronavirus Efficacy (COVE) trial (Baden et al., 2020) on the scale of calendar time. The trial opened on July 27, 2020 (time 0), and reached full accrual at time $T_{A}$ (October 23, 2020). On December 11, 2020, denoted by $T_{P}$ , the FDA issued an EUA for the Pfizer vaccine, followed by an EUA for the Moderna mRNA‐1273 vaccine on $T_{M} =$ December 18, 2020. Amendment 6 of the study protocol was issued on December 23, 2020 and specified the unblinding strategy (see figure 2 of the protocol) under which, starting on $T_{U} =$ December 24, 2020, study participants are scheduled on a rolling basis over several months for Participant Decision clinic visits (PDCVs) at which they will be unblinded. If originally randomized to vaccine, participants continue to be followed; if randomized to placebo, participants can receive the Moderna vaccine at the PDCV or refuse and either seek another vaccine outside the study or remain unvaccinated. Let $T_{C}$ denote the time at which all PDCVs have taken place. The study will continue until time $T_{F}$ at which all participants will have completed full follow up at 24 months after initial treatment assignment. Assume that the analysis of VE using the methods in Sections 4.4 and 5 takes place at time $T_{C} \leq L \leq T_{F}$ , where all participants have achieved the primary endpoint, requested to be unblinded, or attended the PDCV by L. The Moderna vaccine is administered in two doses, ideally 4 weeks apart, and is not thought to achieve full efficacy until 2 weeks following the second dose. Accordingly, the primary endpoint is defined as symptomatic viral infection occurring after a lag of $ℓ = 6$ weeks following the initial dose.

Under this scheme, we characterize the data on a given participant as follows. Let $0 \leq E \leq T_{A}$ denote the calendar time at which the subject entered the trial, $X$ denote baseline covariates, and $A = 0$ (1) if assigned to placebo (vaccine). Denote observed time to infection on the scale of calendar time as U, and $Δ = I (U \leq L)$ , where $I (B) = 1$ if B is true and 0 otherwise. At $T_{P}$ , availability of the Pfizer vaccine commenced, at which point some subjects not yet infected requested to be unblinded. Denote by R (calendar time) the minimum of (i) time to such an unblinding, in which case $T_{P} \leq R < T_{U}$ , and define $Γ = 1$ ; (ii) time of PDCV, so $T_{U} \leq R < T_{C}$ , and let $Γ = 2$ ; or (iii) time to infection, in which case $R = U$ and $Γ = 0$ . If $Γ \geq 1$ and $A = 1$ , so that the subject was randomized to vaccine, she/he continues to be followed; if $A = 0$ , she/he can agree to receive the Moderna vaccine, $Ψ = 1$ or refuse, $Ψ = 0$ . We distinguish the cases $Γ = 1$ and 2 to acknowledge different unblinding dynamics before and after $T_{U}$ . Because a very small number of participants requested unblinding before $T_{P}$ , and although the protocol allows participants to refuse unblinding at PDCV, all subjects are strongly encouraged to unblind, we do not include these possibilities in the formulation.

Table 1 summarizes the timeline and observed data. The trial data are thus

\begin{matrix} O_{i} & = & {E_{i}, X_{i}, A_{i}, U_{i}, Δ_{i}, R_{i}, \\ Γ_{i}, I (Γ_{i} & \geq & 1, A_{i} = 0) Ψ_{i}}, i = 1, \dots, n, \end{matrix}

(1)

independent and identically distributed (iid) across i.

TABLE 1.

Summary of notation. All times are on the scale of calendar time, where time 0 is the start of the trial

Variable

Definition

Trial Milestones

T_{A}

Full accrual reached, October 23, 2020

T_{P}

Pfizer granted EUA, December 11, 2020

T_{M}

Moderna granted EUA, December 18, 2020

T_{U}

Participant Decision clinic visits (PDCVs) commence, December 24, 2020

T_{C}

PDCVs conclude

T_{F}

Follow‐up concludes, trial ends

ℓ

Lag between initial vaccine dose and full efficacy, 6 weeks,

T_{P} - T_{A} > ℓ

Time of analysis of vaccine efficacy using the proposed methods;

L >

time at which all subjects have achieved the endpoint, requested unblinding, or attended the PDCV,

L \leq T_{F}

Observed Data on a Trial Participant

Study entry time,

0 \leq E \leq T_{A}

X

Baseline information

Treatment assignment, placebo,

A = 0

, or vaccine,

A = 1

U, Δ

Time to symptomatic infection, indicator of infection by time L,

Δ = I (U \leq L)

R, Γ

Time to requested unblinding, PDCV/requested unblinding, or infection, whichever comes first

Γ = 0

R = U

, infection occurs before requested/offered unblinding

Γ = 1

R =

time to requested unblinding,

T_{P} \leq R < T_{U}

Γ = 2

R =

time to PDCV or requested unblinding,

T_{U} \leq R < T_{C}

A = 0

Γ \geq 1

, indicator or whether subject receives Moderna vaccine,

Ψ = 1

, or refuses and seeks another vaccine outside the study or remains unvaccinated,

Ψ = 0

Open in a new tab

3. CONCEPTUALIZATION OF VACCINE EFFICACY

Similar to Halloran et al. (1996) and Longini and Halloran (1996), we consider the following framework in which to conceptualize VE. The study population, comprising individuals for which inference on VE is of interest, is that of individuals susceptible to infection, represented by the trial participants. There is a population of individuals outside the trial with which trial participants interact, assumed to be much larger than the number of participants, so that interactions among participants are much less likely than interactions with the outside population. The probability that a trial participant will become infected at calendar time t depends on three factors: $c (t)$ , the contact rate, the number of contacts with the outside population per unit time; $p (t)$ , the prevalence of infections in the outside population at t; and $π (t)$ , the transmission probability at t, the probability a susceptible individual in the study population will become infected per contact with an infected individual from the outside population. Dependence of $π (t)$ on time acknowledges the emergence of new variants of the virus, which may be more or less virulent, as in the COVID‐19 pandemic. Assuming random mixing, $p (t) c (t)$ is the contact rate at time t with infected individuals, and the infection rate at time t is $p (t) c (t) π (t)$ .

We adapt this framework to the COVID‐19 pandemic. The prevalence rate in the pandemic can vary substantially in time and space, so denote by S the trial site at which a participant is enrolled, and let $p (t, s)$ be the prevalence at time t at site $S = s$ . Although $p (t, s)$ varies by t and s, assume that it is unaffected by the individuals in the trial and thus represents an external force. We view the contact rate as individual specific; accordingly, for an arbitrary individual in the study population, let the random variables ${c_{0}^{b} (t), c_{1}^{b} (t), c_{0}^{u} (t), c_{01 ℓ}^{u} (t), c_{1}^{u} (t)}$ denote potential contact rates. These potential outcomes can be regarded as individual‐specific behavioral characteristics of trial participants, where some may be more careful and make fewer contacts while others take more risks, and behavior can vary over time and by vaccination and blinding status. Here, $c_{a}^{b} (t)$ is the contact rate at time t if the individual were to receive vaccine, $a = 1$ , or placebo, $a = 0$ , and be blinded to this assignment; by virtue of blinding, it is reasonable to take $c_{1}^{b} (t) = c_{0}^{b} (t) = c^{b} (t)$ .

As in Table 1, letting ℓ denote the lag between initial dose and full efficacy, $c_{01 ℓ}^{u} (t)$ reflects behavior of a placebo subject who is unblinded, receives the Moderna vaccine, and is within ℓ weeks of vaccination. Likewise, $c_{1}^{u} (t)$ reflects behavior of any unblinded Moderna vaccine recipient after ℓ, both those originally randomized to placebo and crossed over to the vaccine and those originally randomized to vaccine. Thus, $c_{01 ℓ}^{u} (t)$ allows for more cautious behavior before full efficacy is achieved for recently vaccinated placebo subjects; in the trial, all subjects randomized to vaccine were past the full efficacy lag at the time of unblinding (as in Table 1, $T_{P} - T_{A} \geq ℓ$ ). Similar to the stable unit treatment value assumption (Rubin, 1980), assume that $c_{1}^{u} (t)$ is the same if the individual was randomized to vaccine and unblinded before t or was randomized to placebo and subsequently unblinded and crossed over to the Moderna vaccine before t. The rate $c_{0}^{u} (t)$ reflects behavior of an unblinded placebo subject who does not cross over to the Moderna vaccine and does not play a role in the development, and, as demonstrated in Section 4.4, such subjects do not contribute to the analysis of VE.

Finally, for an arbitrary participant, let the random variable $π_{0} (t)$ be the potential individual‐specific transmission probability per contact at t if she/he were to receive placebo, and let $π_{1} (t, τ)$ be the same if she/he were to receive study vaccine and have been vaccinated for $τ \geq 0$ units of time. As we now demonstrate, this formulation allows us to represent VE as a function of τ and thus consider whether or not VE wanes over time since vaccination.

With the set of potential outcomes for an arbitrary individual in the study population who enrolls at site S thus given by ${c^{b} (t), c_{0}^{u} (t), c_{01 ℓ}^{u} (t), c_{1}^{u} (t) t > 0, π_{0} (t), π_{1} (t, τ), τ \geq 0}$ , the infection rate in the study population at calendar time t if all individuals were to receive placebo and be blinded to that assignment is $I_{0}^{b} (t) = E {p (t, S) c^{b} (t) π_{0} (t)}$ ; likewise, the infection rate at t if all individuals were to receive vaccine at time $t - τ$ and be blinded to that assignment is $I_{1}^{b} (t, τ) = E {p (t, S) c^{b} (t) π_{1} (t, τ)}$ . The relative infection rate at t is then

\begin{matrix} R^{b} (t, τ) = \frac{I_{1}^{b} (t, τ)}{I_{0}^{b} (t)} = \frac{E {p (t, S) c^{b} (t) π_{1} (t, τ)}}{E {p (t, S) c^{b} (t) π_{0} (t)}} . \end{matrix}

(2)

Accordingly, VE at time t after vaccination at $t - τ$ is $V E (t, τ) = 1 - R^{b} (t, τ)$ , reflecting the proportion of infections at t that would be prevented if the study population were vaccinated and on study vaccine for τ units of time during the blinded phase of the study.

In the sequel, we assume that $R^{b} (t, τ)$ and thus $V E (t, τ)$ depend only on τ and write $R^{b} (τ)$ and $V E (τ) = 1 - R^{b} (τ)$ . This assumption embodies the belief that, although infection rates may change over time, the relative effect of vaccine to placebo remains approximately constant and holds if (i) ${π_{1} (t, τ), π_{0} (t)} ⊥ {S, c^{b} (t)} | X$ , where $⊥$ means “independent of” and this independence is conditional on $X$ ; and (ii) $E {π_{1} (t, τ) | X} / E {π_{0} (t) | X} = q (τ)$ , so does not depend on t and $X$ . Condition (i) reflects the interpretation of $π_{1} (t, τ)$ and $π_{0} (t)$ as inherent biological characteristics of an individual, whereas S and $c^{b} (t)$ are external and behavioral characteristics, respectively; thus, once common individual and external baseline covariates are taken into account, biological and geographic/behavioral characteristics are unrelated. Condition (ii) implies that, although new viral variants may change transmission probabilities under both vaccine and placebo over time, this change stays in constant proportion, and this proportion is similar for individuals with different characteristics. Further discussion is given in Section 7 and Web Appendix B of the Supporting Information.

Within this framework, the goal of inference on waning of VE based on the data from the trial can be stated precisely as inference on $V E (τ) = 1 - R^{b} (τ)$ , $τ \geq ℓ$ , so reflecting VE after full efficacy is achieved. It is critical to recognize that, like estimands of interest in most clinical trials, $V E (τ)$ represents VE at time since vaccination τ under the original conditions of the trial, under which all participants are blinded. The challenge we address in subsequent sections is how to achieve valid inference on $V E (τ)$ , $τ \geq ℓ$ , using data from the modified trial in which blinded participants are unblinded in a staggered fashion, with placebo subjects offered the option to receive the study vaccine.

We propose a semiparametric model within which we cast this objective. Let $I_{01 ℓ}^{u} (t, τ) = E {p (t, S) c_{01 ℓ}^{u} (t) π_{1} (t, τ)}$ , $τ < ℓ$ , and $I_{1}^{u} (t, τ) = E {p (t, S) c_{1}^{u} (t) π_{1} (t, τ)}$ , $τ \geq ℓ$ , be the infection rates in the study population at t if all individuals were to receive vaccine at time $t - τ$ and be unblinded to that fact. Analogous to (i) above, assume that ${π_{1} (t, τ), π_{0} (t)} ⊥ {S, c_{01 ℓ}^{u} (t), c_{1}^{u} (t)} | X$ , and continue to assume condition (ii). Then, for two values $τ_{1}, τ_{2}$ of τ, it is straightforward that (see Web Appendix A of the Supporting Information)

\begin{matrix} \frac{I_{01 ℓ}^{u} (t, τ_{1})}{I_{01 ℓ}^{u} (t, τ_{2})} & = & \frac{R^{b} (τ_{1})}{R^{b} (τ_{2})}, τ_{1}, τ_{2} < ℓ; \\ \frac{I_{1}^{u} (t, τ_{1})}{I_{1}^{u} (t, τ_{2})} & = & \frac{R^{b} (τ_{1})}{R^{b} (τ_{2})}, τ_{1}, τ_{2} \geq ℓ . \end{matrix}

(3)

Defining $I_{01 ℓ}^{u} (t) = I_{01 ℓ}^{u} (t, 0) = E {p (t, S) c_{01 ℓ}^{u} (t) π_{1} (t, 0)}$ and $I_{1}^{u} (t) = I_{1}^{u} (t, ℓ) = E {p (t, S) c_{1}^{u} (t) π_{1} (t, ℓ)}$ , by (3) with $τ_{1} = τ$ and $τ_{2} = 0$ (ℓ) on the left (right) hand side, the infection rates at t if all individuals in the study population were unblinded and to receive vaccine at time $t - τ$ are

\begin{matrix} I_{01 ℓ}^{u} (t, τ) & = & I_{01 ℓ}^{u} (t) \frac{R^{b} (τ)}{R^{b} (0)}, τ < ℓ; \\ I_{1}^{u} (t, τ) & = & I_{1}^{u} (t) \frac{R^{b} (τ)}{R^{b} (ℓ)}, τ \geq ℓ . \end{matrix}

(4)

Likewise, from (2), the infection rate at t if all individuals in the study population were blinded and to receive vaccine at time $t - τ$ is

I_{1}^{b} (t, τ) = I_{0}^{b} (t) R^{b} (τ) .

(5)

We now represent the infection rate ratio $R^{b} (τ)$ as

\begin{matrix} R^{b} (τ; θ) & = & \exp {ζ (τ)} I (τ < ℓ) \\ + \exp {θ_{0} + g (τ - ℓ; θ_{1})} I (τ \geq ℓ), \\ θ & = & {(θ_{0}, θ_{1}^{T})}^{T}, \end{matrix}

(6)

where $ζ (τ)$ is a function of τ; θ₀ and $θ_{1}$ are real‐ and vector‐valued parameters, respectively; and $g (u; θ_{1})$ is a real‐valued function of such that $g (0; θ_{1}) = 0$ for all $θ_{1}$ and $g (u; 0) = 0$ . For example, taking $g (u; θ_{1}) = θ_{1} u$ yields $R^{b} (τ; θ) = \exp {θ_{0} + θ_{1} (τ - ℓ)}$ , $τ \geq ℓ$ , in which case $θ_{1} = 0$ implies that $V E (τ) = 1 - R^{b} (τ)$ , $τ \geq ℓ$ , does not change with time since vaccination, and $θ_{1} > 0$ indicates that $V E (τ)$ decreases with increasing τ; that is, exhibits waning. More complex specifications of $g (u; θ_{1})$ using splines (e.g., Fintzi and Follmann, 2021) or piecewise constant functions could be made; for example, for $v_{1} < v_{2} \leq L$ ,

\begin{matrix} g (u; θ_{1}) & = & θ_{11} I (v_{1} < u \leq v_{2}) + θ_{12} I (u > v_{2}), \\ θ_{1} & = & {(θ_{11}, θ_{12})}^{T} . \end{matrix}

(7)

Because interest focuses only on $τ \geq ℓ$ , we leave $ζ (τ)$ unspecified.

Under this model, (5) and (4) can be written as

\begin{matrix} I_{1}^{b} (t, τ) & = & I_{0}^{b} (t) [\exp {ζ (τ)} I (τ < ℓ) \\ + \exp {θ_{0} + g (τ - ℓ; θ_{1})} I (τ \geq ℓ)], \\ I_{01 ℓ}^{u} (t, τ) & = & I_{01 ℓ}^{u} (t) \exp {ζ (τ)}, τ < ℓ, \\ I_{1}^{u} (t, τ) & = & I_{1}^{u} (t) \exp {g (τ - ℓ; θ_{1})}, τ \geq ℓ . \end{matrix}

(8)

Thus, to estimate $V E (τ)$ for any $τ \geq ℓ$ and make inference on potential waning of VE, we develop a principled approach to estimation of $θ$ based on the data from the modified trial in which participants are unblinded and those on placebo may cross over to study vaccine.

4. STATISTICAL FRAMEWORK

4.1. Motivation

Estimation of $V E (τ)$ , equivalently $R^{b} (τ)$ , would be straightforward for any $τ \geq ℓ$ over the entire follow‐up period if all participants remained blinded and on their assigned treatments throughout the trial. However, subjects randomized to placebo, when unblinded, have the option to receive the study vaccine on or after $T_{P}$ . For $τ < T_{P}$ , it is possible to estimate $R^{b} (τ)$ because, due to randomization, for $t < T_{P}$ we have representative samples of blinded subjects on vaccine and placebo and thus information on $I_{1}^{b} (t, τ)$ and $I_{0}^{b} (t)$ , so can estimate θ₀ and components of $θ_{1}$ identified for such τ; for example, in (7) depending on the values of v ₁ and v ₂. At $T_{P} \leq t < T_{C}$ , the data comprise a mixture of blinded and unblinded participants, where, within the latter group, those on placebo may have opted to receive study vaccine or refuse. Here, information, albeit diminishing during $[T_{P}, T_{C})$ , on $I_{1}^{b} (t, τ)$ and $I_{0}^{b} (t)$ is available from participants not yet unblinded, which contributes to estimation of θ₀ and components of $θ_{1}$ . Information is also available on $I_{1}^{u} (t, τ)$ from individuals who were originally randomized to vaccine and provide information on longer τ and from individuals who recently crossed over to study vaccine and provide information on shorter τ. For $t \geq T_{C}$ , there are no longer blinded subjects, so that information is available only on $I_{1}^{u} (t, τ)$ . For these latter groups, for longer $τ_{1} \geq ℓ$ and shorter $τ_{2} \geq ℓ$ , $I_{1}^{u} (t, τ_{1}) / I_{1}^{u} (t, τ_{2}) = \exp [g {τ_{1} - ℓ; θ_{1}} - g {τ_{2} - ℓ; θ_{1}}]$ , and, because of the mixture of times since vaccination, $θ_{1}$ can be fully estimated.

Through the following potential outcomes formulation and under suitable assumptions, in the next several sections, we develop an approach to estimation of $θ$ based on the observed data (1) that embodies the foregoing intuitive principles.

4.2. Potential outcomes formulation

Denote by $T_{0}^{*} (e, r)$ the potential time to infection on the scale of patient time for an arbitrary individual in the study population if she/he were to enter the trial at calendar time e, receive placebo and be blinded to that fact, and, if not infected by calendar time r, be unblinded and cross over to study vaccine at r. Let $T_{0}^{*} (e) = T_{0}^{*} (e, \infty)$ , if she/he is never crossed over to receive vaccine. Similarly, define $T_{1}^{*} (e, r)$ to be the potential time to infection (patient time scale) for an arbitrary individual if she/he were to enter the trial at e, receive vaccine and be blinded to that fact, and, if not infected by r, be unblinded at r; and define $T_{1}^{*} (e) = T_{1}^{*} (e, \infty)$ . We make the consistency assumptions that $T_{0}^{*} (e, r) = T_{0}^{*} (e)$ if $T_{0}^{*} (e) < r$ and $T_{1}^{*} (e, r) = T_{1}^{*} (e)$ if $T_{1}^{*} (e) < r$ . For $a = 0, 1$ , denote the hazard at calendar time t, $t > e$ , by

\begin{matrix} λ_{a} (t, e, r) & = & \lim_{d t \to 0} d t^{- 1} pr {t \leq T_{a}^{*} (e, r) + e < t + d t \\ | T_{a}^{*} (e, r) + e \geq t}, a = 0, 1, \end{matrix}

(9)

where the addition of e induces a shift from patient to calendar time. Denote the set of all potential outcomes as $W^{*} = {T_{0}^{*} (e, r), T_{1}^{*} (e, r); e > 0, r > e}$ .

The development in Section 3 is in terms of infection rates at the individual‐specific and population levels. Population‐level hazard rates such as (9) are not equivalent to population‐level infection rates. However, we argue in Web Appendix C of the Supporting Information that, because the probabilities of infection under vaccine and placebo during the course of the trial are small, population‐level hazard rates and population‐level infection rates are approximately equivalent; this assumption is implicit in the standard primary analysis noted in Section 1. Thus, to reflect this, we use familiar notation and write $λ^{b} (t) = I_{0}^{b} (t)$ , $λ_{ℓ}^{u} (t) = I_{01 ℓ}^{u} (t)$ , and $λ^{u} (t) = I_{1}^{u} (t)$ . Under these conditions, using (8), we can write for $t > e$

\begin{matrix} λ_{0} (t, e, r) & = & λ^{b} (t) I (t < r) + λ_{ℓ}^{u} (t) \exp {ζ (t - r)} \\ \times I (0 \leq t - r < ℓ) + \\ λ^{u} (t) \exp {g (t - r - ℓ; θ_{1})} I (t - r \geq ℓ), \end{matrix}

(10)

\begin{matrix} λ_{1} (t, e, r) & = & λ^{b} (t) [\exp {ζ (t - e)} I (t - e < ℓ) + \exp {θ_{0} + \\ g (t - e - ℓ; θ_{1})} I (t - e \geq ℓ)] I (t < r) \\ + λ^{u} (t) \exp {g (t - e - ℓ; θ_{1})} I (t \geq r), \end{matrix}

(11)

where (11) follows because $r \geq T_{P}$ , $e \leq T_{A}$ , $T_{P} - T_{A} > ℓ$ . Define the counting processes for infection by $N_{a}^{*} (t, e, r) = I {T_{a}^{*} (e, r) + e \leq t}$ and $N_{a}^{*} (t, e) = N_{a}^{*} (t, e, \infty)$ , and the at‐risk processes by $Y_{a}^{*} (t, e, r) = I {T_{a}^{*} (e, r) + e \geq t}$ and $Y_{a}^{*} (t, e) = Y_{a}^{*} (t, e, \infty)$ , $a = 0, 1$ (Fleming and Harrington, 2005). From the above consistency assumptions, if $t < r$ , then $N_{a}^{*} (t, e, r) = N_{a}^{*} (t, e)$ , $Y_{a}^{*} (t, e, r) = Y_{a}^{*} (t, e)$ , $a = 0, 1$ . For $a = 0, 1$ , let $Λ_{a} (t, e, r) = \int_{0}^{t} λ_{a} (u, e, r) d u$ be the cumulative hazard. Because $E {d N_{a}^{*} (t, e, r) | Y_{a}^{*} (t, e, r)} = d Λ_{a} (t, e, r) Y_{a}^{*} (t, e, r)$ , $a = 0, 1$ , it follows that ${d N_{a}^{*} (t, e, r) - d Λ_{a} (t, e, r) Y_{a}^{*} (t, e, r)}$ , $a = 0, 1$ , are mean‐zero counting process increments. Thus, any linear combination of these increments over $t, e, r$ can be used to define unbiased estimating functions in $W^{*}$ of quantities of interest. In Web Appendix D of the Supporting Information, we formulate a particular set of estimating functions that, based on iid potential outcomes $W_{i}^{*}$ , $i = 1, \dots, n$ , lead to consistent and asymptotically normal estimators for ${Λ^{b} (t), Λ^{u} (t), θ^{T}}^{T}$ , $Λ^{k} (t) = \int_{0}^{t} λ^{k} (u) d u$ , $k = b, u$ . Because interest focuses on $V E (τ)$ for $τ \geq ℓ$ , estimation of $Λ_{ℓ}^{u} (t) = \int_{0}^{t} λ_{ℓ}^{u} (u) d u$ and $ζ (\cdot)$ is not considered and is reflected in the specification of the linear combinations; see Web Appendix D.

For fixed t, $0 \leq t \leq L$ , the estimating functions for $Λ^{b} (t)$ and $Λ^{u} (t)$ are, respectively,

\begin{matrix} E_{Λ^{b}}^{*} {W^{*}; Λ^{b} (t), θ} = I (t < T_{C}) (\int_{0}^{\min (t, T_{A})} {d N_{0}^{*} (t, e) \\ - d Λ^{b} (t) Y_{0}^{*} (t, e)} {\tilde{w}}_{0} (t, e) d e + I (t \geq ℓ) \int_{0}^{\min (t - ℓ, T_{A})} \\ [d N_{1}^{*} (t, e) - d Λ^{b} (t) \exp {θ_{0} + g (t - e - ℓ; θ_{1}) \\ \times I (t - e \geq ℓ)} Y_{1}^{*} (t, e)] {\tilde{w}}_{1} (t, e) d e), \end{matrix}

(12)

\begin{matrix} E_{Λ^{u}}^{*} {W^{*}; Λ^{u} (t), θ} \\ = I (t \geq T_{P} + ℓ) (\int_{0}^{T_{A}} \int_{T_{P}}^{\min (t - ℓ, T_{C})} [d N_{0}^{*} (t, e, r) \\ - d Λ^{u} (t) \exp {g (t - r - ℓ; θ_{1}) I (t - r \geq ℓ)} Y_{0}^{*} (t, e, r)] \\ \times w_{0} (t, e, r) d r d e) + I (t \geq T_{P}) (\int_{0}^{T_{A}} \int_{T_{P}}^{\min (t, T_{C})} \\ [d N_{1}^{*} (t, e, r) - d Λ^{u} (t) \exp {g (t - e - ℓ; θ_{1})} \\ \times Y_{1}^{*} (t, e, r)] w_{1} (t, e, r) I (t \geq r) d r d e), \end{matrix}

(13)

where ${\tilde{w}}_{a} (t, e)$ and $w_{a} (t, e, r)$ , $a = 0, 1$ , are arbitrary nonnegative weight functions, specification of which is discussed later. The estimating function for $θ$ is given by

\begin{matrix} E_{θ}^{*} {W^{*}; Λ^{b} (\cdot), Λ^{u} (\cdot), θ} \\ = \int_{ℓ}^{T_{C}} \int_{0}^{\min (t - ℓ, T_{A})} (\begin{matrix} 1 \\ g_{θ} (t - e - ℓ) \end{matrix}) \\ \times [d N_{1}^{*} (t, e) - d Λ^{b} (t) \exp {θ_{0} + g (t - e - ℓ; θ_{1}) \\ \times I (t - e \geq ℓ)} Y_{1}^{*} (t, e)] {\tilde{w}}_{1} (t, e) d e \\ + \int_{T_{P} + ℓ}^{L} \int_{0}^{T_{A}} \int_{T_{P}}^{\min (t - ℓ, T_{C})} (\begin{matrix} 0 \\ g_{θ} (t - r - ℓ) \end{matrix}) \\ \times [d N_{0}^{*} (t, e, r) - d Λ^{u} (t) \exp {g (t - r - ℓ; θ_{1}) \\ \times I (t - r \geq ℓ)} Y_{0}^{*} (t, e, r)] w_{0} (t, e, r) d r d e \\ + \int_{T_{P}}^{L} \int_{0}^{T_{A}} \int_{T_{P}}^{\min (t, T_{C})} (\begin{matrix} 0 \\ g_{θ} (t - e - ℓ) \end{matrix}) \\ \times [d N_{1}^{*} (t, e, r) - d Λ^{u} (t) \exp {g (t - e - ℓ; θ_{1})} \\ \times Y_{1}^{*} (t, e, r)] w_{1} (t, e, r) I (t \geq r) d r d e, \end{matrix}

(14)

where $g_{θ} (u) = \partial / \partial θ_{1} {g (u; θ_{1})}$ . Analogous to Yang et al. (2018), envisioning (12)–(14) as characterizing a system of estimating functions $E^{*} {W^{*}; Λ^{b} (\cdot), Λ^{u} (\cdot), θ} = {[E_{Λ^{b}}^{*} {W^{*}; Λ^{b} (t), θ}, E_{Λ^{u}}^{*} {W^{*}; Λ^{u} (t), θ}, 0 \leq t \leq L, E_{θ}^{*} {W^{*}; Λ^{b} (\cdot), Λ^{u} (\cdot), θ}^{T}]}^{T}$ , if we could observe $W_{i}^{*}$ , $i = 1 \dots, n$ , we would estimate $d Λ^{b} (\cdot), d Λ^{u} (\cdot), θ$ by solving the estimating equations $\sum_{i = 1}^{n} E^{*} (W_{i}^{*}; Λ^{b} (\cdot), Λ^{u} (\cdot), θ)} = 0$ .

4.3. Identifiability assumptions

Of course, the potential outcomes $W_{i}^{*}$ , $i = 1, \dots, n$ , are not observed. However, we now present assumptions under which we can exploit the developments in the last section to derive estimating equations yielding estimators based on the observed data (1).

Define the indicator that a participant is observed to be infected at time t by $d N (t) = I (U = t, Δ = 1)$ , the observed at‐risk indicator at t by $Y (t) = I (E < t \leq U)$ , and

\begin{matrix} I_{0} (t, e) & = & (1 - A) I (E = e) I (R \geq t), \\ I_{1} (t, e) & = & A I (E = e) I (R \geq t), \\ I_{01} (t, e, r) & = & (1 - A) I (E = e) {I (R = r, Γ = 1, Ψ = 1) \\ + I (R = r, Γ = 2, Ψ = 1)}, \\ I_{11} (t, e, r) & = & A I (E = e) {I (R = r, Γ = 1) + I (R = r, Γ = 2)} . \end{matrix}

(15)

$I_{a} (t, e) = 1$ indicates that a subject entering the trial at time e and randomized to placebo ( $a = 0$ ) or vaccine ( $a = 1$ ) has not yet been infected or unblinded by t. For $t > r$ , $I_{01} (t, e, r) = 1$ indicates that a subject randomized to placebo at time e is unblinded (either by request or at a PDCV) at time r and crosses over to study vaccine at r, and $I_{11} (t, e, r) = 1$ if a subject randomized to vaccine at time e is unblinded at r. Make the consistency assumptions

\begin{matrix} I_{a} (t, e) d N (t) & = & I_{a} (t, e) d N_{a}^{*} (t, e), \\ I_{a} (t, e) Y (t) & = & I_{a} (t, e) Y_{a}^{*} (t, e), \\ a & = & 0, 1, \\ I_{01} (t, e, r) d N (t) & = & I_{01} (t, e, r) d N_{0}^{*} (t, e, r), \\ I_{01} (t, e, r) Y (t) & = & I_{01} (t, e, r) Y_{0}^{*} (t, e, r), \\ I_{11} (t, e, r) d N (t) & = & I_{11} (t, e, r) d N_{1}^{*} (t, e, r), \\ I_{11} (t, e, r) Y (t) & = & I_{11} (t, e, r) Y_{1}^{*} (t, e, r) . \end{matrix}

(16)

We now make assumptions similar in spirit to those adopted in observational studies. By randomization,

A ⊥ (X, E, W^{*}),

(17)

where we subsume the site indicator S in $X$ , and let $p_{A} = pr (A = 1)$ . It is realistic to assume that the mix of baseline covariates changes over the accrual period; for example, during the trial, because of lagging accrual of elderly subjects and subjects from underrepresented groups, an effort was made to increase participation of these groups in the latter part of the accrual period. Accordingly, we allow the distribution of entry time E to depend on $X$ , and denote its conditional density as $f_{E | X} (e | x)$ . We make the no unmeasured confounders assumption

E ⊥ W^{*} | X .

(18)

Define the hazard functions of unblinding in the periods between the Pfizer EUA and the start of PDCVs and after the start of PDCVs, respectively, as

\begin{matrix} λ_{R, 1} (r | X, A, E, W^{*}) \\ = \lim_{d r \to 0} pr (r \leq R < r + d r, Γ = 1 | \\ R \geq r, X, A, E, W^{*}), T_{P} \leq r < T_{U}, \\ λ_{R, 2} (r | X, A, E, W^{*}) \\ = \lim_{d r \to 0} pr (r \leq R < r + d r, Γ = 2 | \\ R \geq r, X, A, E, W^{*}), T_{U} \leq r < T_{C}, \end{matrix}

where $λ_{R, j} (r | X, A, E, W^{*}) = 0$ for $r \geq T_{U}$ ( $j = 1$ ) and $r \geq T_{C}$ ( $j = 2$ ). Because the accrual period was short relative to the length of follow‐up, we take these unblinding hazard functions to not depend on E, although including such dependence is straightforward; and, similar to a noninformative censoring assumption, to not depend on $W^{*}$ and write

λ_{R, j} (r | X, A, E, W^{*}) = λ_{R, j} (r | X, A), j = 1, 2 .

(19)

Define $K_{R} (r | X$ , $A) = \exp [- {Λ_{R, 1} (r | X, A) + Λ_{R, 2} (r | X$ , $A)}]$ , $Λ_{R, j} (r | X$ , $A) = \int_{T_{j}}^{r} λ_{R, j} (u | X$ , $A) d u$ , $T_{j} = T_{P}$ ( $j = 1$ ), or $T_{j} = T_{U}$ ( $j = 2$ ). Because $λ_{R, 1} (r | X, A)$ and $λ_{R, 2} (r | X, A)$ are defined on the nonoverlapping intervals $[T_{P}, T_{U})$ and $[T_{U}, T_{C})$ , respectively, with $K_{R, j} (r | X, A) = \exp {- Λ_{R, j} (r | X, A)}$ , $j = 1, 2$ ,

\begin{matrix} K_{R} (r | X, A) & = & 1, r < T_{P}, \\ = & K_{R, 1} (r | X, A), T_{P} \leq r < T_{U}, \\ = & K_{R, 1} (T_{U} | X, A) K_{R, 2} (r | X, A), T_{U} \leq r < T_{C}, \\ = & 0, r \geq T_{C} . \end{matrix}

Finally, define $f_{R, j} (r | X, A) = K_{R} (r | X, A) λ_{R, j} (r | X, A)$ , $j = 1, 2$ .

Let $pr (Ψ = 1 | X, E, Γ, R, W^{*})$ be the probability that a placebo participant unblinded at R agrees to receive the Moderna vaccine. Similar to (19), we assume that this probability does not depend on $E, W^{*}$ ; moreover, because the unblinding interval $[T_{P}, T_{C})$ is short relative to the length of follow‐up, we assume that it does not depend on R but does depend on the unblinding dynamics at R. Thus, write

\begin{matrix} pr (Ψ = 1 | X, E, Γ, R, W^{*}) = pr (Ψ = 1 | X, Γ) = p_{Ψ} (X, Γ) . \end{matrix}

(20)

4.4. Observed data estimating equations

We now outline, under the assumptions (16)–(20), which we take to hold henceforth, how we can develop unbiased estimating equations based on the observed data yielding consistent and asymptotically normal estimators for $d Λ^{b} (\cdot), d Λ^{u} (\cdot), θ$ . The basic premise is to use inverse probability weighting (IPW) to probabilistically represent potential outcomes in terms of the observed data to mimic the estimating functions (12)–(14).

Considering (15), define the inverse probability weights

\begin{matrix} h_{0} (t, e | X) & = & (1 - p_{A}) f_{E | X} (e | X) K_{R} (t | X, A = 0), \\ h_{1} (t, e | X) & = & p_{A} f_{E | X} (e | X) K_{R} (t | X, A = 1), \\ h_{01} (e, r | X) & = & (1 - p_{A}) f_{E | X} (e | X) \\ \times {f_{R, 1} (r | X, A = 0) p_{Ψ} (X, Γ = 1) \\ + f_{R, 2} (r | X, A = 0) p_{Ψ} (X, Γ = 2)}, \\ h_{11} (e, r | X) & = & p_{A} f_{E | X} (e | X) {f_{R, 1} (r | X, A = 1) \\ + f_{R, 2} (r | X, A = 1)} . \end{matrix}

We show in Web Appendix E of the Supporting Information that

\begin{matrix} E \{\frac{I_{0} (t, e) d N (t)}{h_{0} (t, e | X)} | X, W^{*}\} = d N_{0}^{*} (t, e), \\ E \{\frac{I_{0} (t, e) Y (t)}{h_{0} (t, e | X)} | X, W^{*}\} = Y_{0}^{*} (t, e), \end{matrix}

(21)

\begin{matrix} E \{\frac{I_{1} (t, e) d N (t)}{h_{1} (t, e | X)} | X, W^{*}\} = d N_{1}^{*} (t, e), \\ E \{\frac{I_{1} (t, e) Y (t)}{h_{1} (t, e | X)} | X, W^{*}\} = Y_{1}^{*} (t, e), \end{matrix}

(22)

\begin{matrix} E \{\frac{I_{01} (t, e, r) d N (t)}{h_{01} (e, r | X)} | X, W^{*}\} = d N_{0}^{*} (t, e, r), \\ E \{\frac{I_{01} (t, e, r) Y (t)}{h_{01} (e, r | X)} | X, W^{*}\} = Y_{0}^{*} (t, e, r), \end{matrix}

(23)

\begin{matrix} E \{\frac{I_{11} (t, e, r) d N (t)}{h_{11} (e, r | X)} | X, W^{*}\} = d N_{1}^{*} (t, e, r), \\ E \{\frac{I_{11} (t, e, r) Y (t)}{h_{11} (e, r | X)} | X, W^{*}\} = Y_{1}^{*} (t, e, r) . \end{matrix}

(24)

To obtain observed data analogs to the estimating functions (12)–(14), based on the equalities in (21)–(24), we substitute the IPW expressions in the conditional expectations on the left‐hand sides. Using (15) and (21)–(22), the analog to (12) is given by

\begin{matrix} E_{Λ^{b}} {O; Λ^{b} (t), θ} \\ = I (t < T_{C}) (\int_{0}^{\min (t, T_{A})} \frac{I_{0} (t, e)}{h_{0} (t, e | X)} {d N (t) - d Λ^{b} (t) Y (t)} \\ {\tilde{w}}_{0} (t, e) d e + I (t \geq ℓ) \int_{0}^{\min (t - ℓ, T_{A})} \frac{I_{1} (t, e)}{h_{1} (t, e | X)} \\ \times [d N (t) - d Λ^{b} (t) \exp {θ_{0} + g (t - e - ℓ; θ_{1}) \\ \times I (t - e \geq ℓ)} Y (t)] {\tilde{w}}_{1} (t, e) d e) = I (t < T_{C}) \\ \times (\frac{(1 - A) I (R \geq t)}{h_{0} (t, E | X)} {d N (t) - d Λ^{b} (t) Y (t)} {\tilde{w}}_{0} (t, E) \\ + \frac{A I (E + ℓ \leq t \leq R)}{h_{1} (t, E | X)} [d N (t) - d Λ^{b} (t) \\ \times \exp {θ_{0} + g (t - E - ℓ; θ_{1})} Y (t)] {\tilde{w}}_{1} (t, E)) . \end{matrix}

(25)

Likewise, using (23)–(24), the analog to (13) is

\begin{matrix} E_{Λ^{u}} {O; Λ^{u} (t), θ} \\ = I (t \geq T_{P} + ℓ) (\int_{0}^{T_{A}} \int_{T_{P}}^{\min (t - ℓ, T_{C})} \frac{I_{01} (t, e, r)}{h_{01} (e, r | X)} \\ \times [d N (t) - d Λ^{u} (t) \exp {g (t - r - ℓ; θ_{1}) I (t - r \geq ℓ)} Y (t)] \\ \times w_{0} (t, e, r) d r d e) + I (t \geq T_{P}) (\int_{0}^{T_{A}} \int_{T_{P}}^{\min (t, T_{C})} \\ \frac{I_{11} (t, e, r)}{h_{11} (e, r | X)} [d N (t) - d Λ^{u} (t) \exp {g (t - e - ℓ; θ_{1})} Y (t)] \\ \times w_{1} (t, e, r) I (t \geq r) d r d e) \\ = I (t \geq T_{P} + ℓ) \\ \times (\frac{(1 - A) I (t - R \geq ℓ) {I (Γ = 1, Ψ = 1) + I (Γ = 2, Ψ = 1)}}{h_{01} (E, R | X)} \\ \times [d N (t) - d Λ^{u} (t) \exp {g (t - R - ℓ; θ_{1})} Y (t)] w_{0} (t, E, R)) \\ + I (t \geq T_{P}) (\frac{A I (t > R) {I (Γ = 1) + I (Γ = 2)}}{h_{11} (E, R | X)} \\ \times [d N (t) - d Λ^{u} (t) \exp {g (t - E - ℓ; θ_{1})} Y (t)] w_{1} (t, E, R)) . \end{matrix}

(26)

A entirely similar representation $E_{θ} {O; Λ^{b} (\cdot) Λ^{u} (\cdot), θ}$ of (14) in terms of the observed data can be deduced and is suppressed for brevity.

To simplify notation, based on (25), (26), and the analogous expression for (14), define

\begin{matrix} d {\tilde{N}}^{b} (t) = d N (t) \{\frac{(1 - A) I (R \geq t) {\tilde{w}}_{0} (t, E)}{h_{0} (t, E | X)} \\ + \frac{A I (E + ℓ \leq t \leq R) {\tilde{w}}_{1} (t, E)}{h_{1} (t, E | X)}\}, \\ {\tilde{Y}}^{b} (t) = Y (t) [\frac{(1 - A) I (R \geq t) {\tilde{w}}_{0} (t, E)}{h_{0} (t, E | X)} + \frac{A I (E + ℓ \leq t \leq R) {\tilde{w}}_{1} (t, E)}{h_{1} (t, E | X)} \\ \times \exp {θ_{0} + g (t - E - ℓ; θ_{1})}], \\ d {\tilde{N}}^{u} (t) = d N (t) \\ \times [\frac{(1 - A) I (t - R \geq ℓ) {I (Γ = 1, Ψ = 1) + I (Γ = 2, Ψ = 1)} w_{0} (t, E, R)}{h_{01} (E, R | X)} \\ + \frac{A I (t > R) {I (Γ = 1) + I (Γ = 2)} w_{1} (t, E, R)}{h_{11} (E, R | X)}], \\ {\tilde{Y}}^{u} (t) = Y (t) \\ \times [\frac{(1 - A) I (t - R \geq ℓ) {I (Γ = 1, Ψ = 1) + I (Γ = 2, Ψ = 1)} w_{0} (t, E, R)}{h_{01} (E, R | X)} \\ \times \exp {g (t - R - ℓ; θ_{1}) \\ + \frac{A I (t > R) {I (Γ = 1) + I (Γ = 2)} w_{1} (t, E, R)}{h_{11} (E, R | X)} \exp {g (t - E - ℓ; θ_{1})] . \end{matrix}

It is important to recognize that these expressions are equal to zero for an unblinded placebo participant who is at risk at time t, $Y (t) = 1$ , and who has refused study vaccine, $Ψ = 0$ , which is equivalent to censoring such a subject, as she/he cannot provide information on study vaccine after t. These expressions are also equal to zero at time t for an at‐risk unblinded placebo participant who receives study vaccine but has been vaccinated for less than ℓ weeks at t and for an at‐risk blinded vaccine participant vaccinated for less than ℓ weeks at t, reflecting the fact that such individuals do not contribute information on VE for $τ \geq ℓ$ until times t at which they have reached full efficacy, in which case the expressions are ≥0. Moreover, by excluding the at‐risk unblinded placebo participants vaccinated for less than ℓ weeks at t, the behavior reflected by $c_{01 ℓ}^{u} (t)$ does not play a role.

Define also

\begin{matrix} Z^{b} (t) = A (\begin{matrix} 1 \\ g_{θ} (t - E - ℓ) \end{matrix}), \\ Z^{u} (t) = A (\begin{matrix} 0 \\ g_{θ} (t - E - ℓ) \end{matrix}) + (1 - A) (\begin{matrix} 0 \\ g_{θ} (t - R - ℓ) \end{matrix}) . \end{matrix}

Then, it is straightforward that the observed‐data estimating functions are

\begin{matrix} E_{Λ^{b}} {O; Λ^{b} (t), θ} & = & d {\tilde{N}}^{b} (t) - d Λ^{b} (t) {\tilde{Y}}^{b} (t), \\ E_{Λ^{u}} {O; Λ^{u} (t), θ} & = & d {\tilde{N}}^{u} (t) - d Λ^{u} (t) {\tilde{Y}}^{u} (t), \\ E_{θ} {O; Λ^{b} (\cdot) Λ^{u} (\cdot), θ} & = & \int_{0}^{T_{C}} Z^{b} (t) {d {\tilde{N}}^{b} (t) - d Λ^{b} (t) {\tilde{Y}}^{b} (t)} \\ + \int_{T_{P}}^{L} Z^{u} (t) {d {\tilde{N}}^{u} (t) - d Λ^{u} (t) {\tilde{Y}}^{u} (t)} . \end{matrix}

Letting ${\tilde{N}}_{i}^{b} (t)$ , ${\tilde{N}}_{i}^{u} (t)$ , ${\tilde{Y}}_{i}^{b} (t)$ , ${\tilde{Y}}_{i}^{u} (t)$ , $Z_{i}^{b} (t)$ , and $Z_{i}^{u} (t)$ denote evaluation at $O_{i}$ in (1), the foregoing developments lead to the set of observed‐data estimating equations

\begin{matrix} \sum_{i = 1}^{n} {d {\tilde{N}}_{i}^{b} (t) - d Λ^{b} (t) {\tilde{Y}}_{i}^{b} (t) = 0, \\ \sum_{i = 1}^{n} {d {\tilde{N}}_{i}^{u} (t) - d Λ^{u} (t) {\tilde{Y}}_{i}^{u} (t) = 0, \end{matrix}

(27)

\begin{matrix} \sum_{i = 1}^{n} [\int_{0}^{T_{C}} Z_{i}^{b} (t) {d {\tilde{N}}_{i}^{b} (t) - d Λ^{b} (t) {\tilde{Y}}_{i}^{b} (t)} \\ + \int_{T_{P}}^{L} Z_{i}^{u} (t) {d {\tilde{N}}_{i}^{u} (t) - d Λ^{u} (t) {\tilde{Y}}_{i}^{u} (t)}] = 0 . \end{matrix}

(28)

For fixed $θ$ , the estimators for $d Λ^{b} (t)$ and $d Λ^{u} (t)$ are the solutions to the equations in (27) given by

\begin{matrix} d {\hat{Λ}}^{b} (t) & = & {\{\sum_{i = 1}^{n} {\tilde{Y}}_{i}^{b} (t)\}}^{- 1} \sum_{i = 1}^{n} d {\tilde{N}}_{i}^{b} (t), \\ d {\hat{Λ}}^{u} (t) & = & {\{\sum_{i = 1}^{n} {\tilde{Y}}_{i}^{u} (t)\}}^{- 1} \sum_{i = 1}^{n} d {\tilde{N}}_{i}^{u} (t) . \end{matrix}

(29)

Substituting these expressions in (28) yields the estimating equation in $θ$ given by

\begin{matrix} \sum_{i = 1}^{n} [\int_{0}^{T_{C}} {Z_{i}^{b} (t) - {\bar{Z}}^{b} (t)} d {\tilde{N}}_{i}^{b} (t) \\ + \int_{T_{P}}^{L} {Z_{i}^{u} (t) - {\bar{Z}}^{u} (t)} d {\tilde{N}}_{i}^{u} (t)] = 0, \end{matrix}

(30)

\begin{matrix} {\bar{Z}}^{b} (t) & = & {\{\sum_{i = 1}^{n} {\tilde{Y}}_{i}^{b} (t)\}}^{- 1} \sum_{i = 1}^{n} Z_{i}^{b} (t) {\tilde{Y}}_{i}^{b} (t), \\ {\bar{Z}}^{u} (t) & = & {\{\sum_{i = 1}^{n} {\tilde{Y}}_{i}^{u} (t)\}}^{- 1} \sum_{i = 1}^{n} Z_{i}^{u} (t) {\tilde{Y}}_{i}^{u} (t), \end{matrix}

which can be solved in $θ$ to yield the estimator $\hat{θ}$ .

5. PRACTICAL IMPLEMENTATION AND INFERENCE

Choice of the weight functions ${\tilde{w}}_{0} (t, e)$ , ${\tilde{w}}_{1} (t, e)$ , $w_{0} (t, e, r)$ , and $w_{1} (t, e, r)$ is arbitrary but can play an important role in the performance of the resulting estimators. We recommend taking a fixed value $\tilde{x}$ of $X$ , for example, the sample mean, and setting ${\tilde{w}}_{a} (t, e) = h_{a} (t, e | \tilde{x})$ and $w_{a} (t, e, r) = h_{a 1} (e, r | \tilde{x})$ , $a = 0, 1$ , where the latter does not depend on t. The resulting weights $h_{a} (t, e | \tilde{x}) / h_{a} (t, e | X)$ and $h_{a 1} (e, r | \tilde{x}) / h_{a 1} (e, r | X)$ , $a = 0, 1$ , are referred to as stabilized weights (Robins et al., 2000), as they mitigate the effect of small inverse probability weights that can give undue influence to a few observations. Note that dependence of the inverse probability weights on $p_{A}$ cancels in construction of stabilized weights. Moreover, if there is no confounding, in that $λ_{R, j} (r | X, A)$ , $j = 1, 2$ in (19), $f_{E | X} (e | X)$ , and $p_{Ψ} (X, Γ)$ do not depend on $X$ , the stabilized weights are equal to 1. Interpretation of the stabilized weights is discussed further in Web Appendix F.

If the “survival probabilities” for R, $K_{R, j} (r | X, A)$ , and the densities $f_{R, j} (r | X, A)$ , $j = 1, 2$ , and $f_{E | X} (e | X)$ in the inverse probability weights, which appear in the expressions in the estimating equation (30), were known, (30) could be solved to yield an estimator for $θ$ and in particular $θ_{1}$ characterizing VE waning. As these quantities are unknown, models must be posited for them, leading to estimators that can be substituted in (30). We propose the use of Cox proportional hazards models for $λ_{R, j} (r | X, A)$ , $j = 1, 2$ , in (19), which can be fitted using the data ${X_{i}, A_{i}, R_{i}, I (Γ_{i} = j)}$ , $i = 1 \dots, n$ ; and for the hazard of entry time E given $X$ , which can be fitted using $(E_{i}, X_{i})$ , $i = 1, \dots, n$ . A binary, for example, logistic, regression model can be used to represent $p_{Ψ} (X, Γ)$ and fitted using $(X_{i}, Γ_{i}, Ψ_{i})$ for i such that $A_{i} = 0$ .

For individual i, the stabilized weights involve the quantities $f_{R, j} (R_{i} | \tilde{x}, a) / f_{R, j} (R_{i} | X_{i}, a)$ , $j = 1, 2$ , $a = 0, 1$ , and $f_{E | X} (E_{i} | \tilde{x}) / f_{E | X} (E_{i} | X_{i})$ . With proportional hazards models as above with predictors $ϕ_{j} (X, β_{j})$ , say, it is straightforward that $f_{R, j} (R_{i} | \tilde{x}, a) / f_{R, j} (R_{i} | X_{i}, a) = [\exp {ϕ_{j} (\tilde{x}, β_{j})} K_{R} (R_{i} | \tilde{x}, a)] / [\exp {ϕ_{j} (X_{i}, β_{j})} K_{R} (R_{i} | X_{i}, a)]$ , where the baseline hazard cancels from numerator and denominator, and similarly for $f_{E | X} (E_{i} | \tilde{x}) / f_{E | X} (E_{i} | X_{i})$ . Thus, the estimated stabilized weights involve only the estimated cumulative hazard functions and estimators for the $β_{j}$ , each of which is root‐n consistent and asymptotically normal.

As sketched in Web Appendix G, with stabilized weights set equal to one or estimated, (30) can be solved easily in $θ$ via a Newton–Raphson algorithm. A heuristic argument demonstrating that $\hat{θ}$ is asymptotically normal leading to an expression for its approximate sampling variance using the sandwich technique is given in Web Appendix G.

6. SIMULATIONS

We report on simulation studies demonstrating performance of the methods, each involving 1000 Monte Carlo replications, based roughly on the Moderna trial. We took $p_{A} = 0.5$ and $T_{A} = 12$ , $T_{P} = 19$ , $T_{U} = 21$ , and $T_{C} = 31$ , where all times are in weeks, and consider an analysis at calendar time $L = 52$ weeks, with $n =$ 30,000. In all cases, $g (u, θ_{1}) = θ_{1} I (u > v)$ where $v = 20$ weeks and $θ_{0} = \log (0.05)$ , corresponding to VE = 95% prior to time v, so that, depending on θ₁, VE potentially wanes following v. We consider $θ_{1} = \log (7)$ , corresponding to VE = 65% after time v, and $θ_{1} = 0$ , corresponding to no waning.

Because the trial and unblinding process are ongoing, we were not able to base our generative scenarios on data from the trial. Owing to the complexity of the trial and multiple potential sources of confounding, to facilitate exploration of a range of conditions while controlling computational complexity and intensity, we focused on several basic scenarios meant to represent varying degrees of confounding consistent with our expectations for the most likely sources of such confounding in the trial. Specifically, we took $f_{E | X} (e | X)$ and $λ_{R, 2} (r | X, A)$ to not depend on $X$ (or A in the latter case) in any scenario, reflecting mostly random entry and PDCV unblinding processes. In scenarios involving confounding, we took $λ_{R, 1} (r | X, A)$ , corresponding to the period $[T_{P}, T_{U})$ in which “requested unblinding” occurred, and the “agreement process” $p_{Ψ} (X, Γ)$ to depend on $X$ , as described below, reflecting our belief that these processes could be associated with participant characteristics.

In the first set of simulations, we consider two cases: (i) no confounding, where all of $λ_{R, j} (r | X, A)$ , $j = 1, 2$ , $f_{E | X} (e | X)$ , and $p_{Ψ} (X$ , Γ) do not depend on $X$ ; and (ii) confounding, where $λ_{R, 1} (r | X, A)$ and $p_{Ψ} (X$ , Γ) depend on $X$ as above. In both (i) and (ii), the entry process $E \sim U (0, T_{A})$ , that is, uniform on $[0, T_{A}]$ , and the unblinding process during PDCVs was $U (T_{U}, T_{C})$ ; see below. In each simulation experiment, for each participant in each Monte Carlo data set, we first generated $A \sim Bernoulli (p_{A})$ , two baseline covariates $X_{1} \sim Bernoulli (p_{X_{1}} = 0.5)$ and $X_{2} \sim N (μ_{X_{2}} = 45$ , $σ_{X_{2}}^{2} = 10^{2})$ , and E as above. To obtain R, we generated G ₁ to be exponential with hazard $λ_{R, 1} (r | X$ , $A) = \exp [{\tilde{β}}_{10} + {{\tilde{β}}_{11} (X_{1} - p_{X_{1}}) + {\tilde{β}}_{12} (X_{2} - μ_{X_{2}})} (1 - A) + {{\tilde{β}}_{13} (X_{1} - p_{X_{1}}) + {\tilde{β}}_{14} (X_{2} - μ_{X_{2}})} A]$ , where ${\tilde{β}}_{10} = \log (0.036)$ , corresponding to roughly 7% unblinding during $[T_{P}$ , $T_{U})$ , and $({\tilde{β}}_{11}$ , ${\tilde{β}}_{12}$ , ${\tilde{β}}_{13}$ , ${\tilde{β}}_{14}) = (0, 0, 0, 0, 0)$ for (i), no confounding, and $(- 0.8, - 0.08, 0.8, 0.08)$ for (ii), confounding. With $R_{1} = T_{P} + G_{1}$ and $R_{2} \sim U (T_{U}, T_{C})$ , we let $\tilde{Γ} = 1 + I (R_{1} \geq T_{U})$ and $\tilde{R} = R_{1} I (\tilde{Γ} = 1) + R_{2} I (\tilde{Γ} = 2)$ . We generated Ψ as Bernoulli ${p_{Ψ} (X$ , $\tilde{Γ})}$ , $p_{Ψ} (X$ , $\tilde{Γ}) =$ expit ${{\tilde{γ}}_{0} + {\tilde{γ}}_{1} (X_{1} - p_{X_{1}}) + {\tilde{γ}}_{2} (X_{2} - μ_{X_{2}}) + {\tilde{γ}}_{3} \tilde{Γ}}$ , expit $(u) = {(1 + e^{- u})}^{- 1}$ , where ${\tilde{γ}}_{0} = 1.4$ , corresponding to approximately 80% agreement to receive the study vaccine by unblinded placebo participants, and $({\tilde{γ}}_{1}$ , ${\tilde{γ}}_{2}$ , ${\tilde{γ}}_{3}) = (0, 0, - 0.1)$ for (i) and $= (- 0.8, - 0.08, - 0.1)$ for (ii).

To generate $U, Δ$ , we first generated $T_{0}^{*} (E, R)$ and $T_{1}^{*} (E, R)$ based on (10)–(11), with $λ^{b} (t) = λ^{b} = \exp {δ_{0} + δ_{1} (X_{1} - p_{X_{1}}) + δ_{2} (X_{2} - μ_{X_{2}}) + Z}$ , where $(δ_{0}, δ_{1}, δ_{2}) = {\log (0.0006), 0.4, 0.04}$ , leading to approximately a 3% infection rate for placebo participants over L, and $Z \sim N (0, 0.04)$ ; $λ_{ℓ}^{u} (t) = λ_{ℓ}^{u} = λ^{b}$ ; $ζ (t) = 0$ ; and $λ^{u} (t) = λ^{u} = 1.25 λ^{b}$ , so that $λ_{a} (t, e, r)$ in (10)–(11), $a = 0, 1$ , are piecewise constant hazards. $T_{0}^{*} (E, R)$ and $T_{1}^{*} (E, R)$ were obtained via inverse transform sampling. We then generated U (calendar time) as $U = E + A T_{1}^{*} (E, R) + (1 - A) [I {T_{0}^{*} (E, R) < \tilde{R}} T_{0}^{*} (E, R) + I {T_{0}^{*} (E, R) \geq \tilde{R}} {Ψ T_{0}^{*} (E, R) + (1 - Ψ) T_{r}^{*}}$ , where $T_{r}^{*} = \tilde{R} + G_{2}$ for G ₂ exponential with hazard $λ^{b}$ ; infection times for unblinded placebo participants who decline vaccine are not used in the analysis. Finally, we set $Δ = I (U < L)$ , and defined $R = U I (U \leq \tilde{R}) + \tilde{R} I (U > \tilde{R})$ and $Γ = \tilde{Γ} I (U > R)$ . Although we obtained Ψ for all n participants, Ψ is used only when $A = 0$ , $Γ \geq 1$ .

For each combination of (i) and (ii) and (a) $θ_{1} = \log (7)$ and (b) $θ_{1} = 0$ , we estimated $θ$ and thus $V E (τ)$ for $τ \leq v$ and $τ > v$ two ways: taking the stabilized weights equal to 1, so disregarding possible confounding, and with estimated stabilized weights. The latter were obtained by fitting proportional hazards models for entry time E with linear predictor $ν_{1} X_{1} + ν_{2} X_{2}$ and for $λ_{R, j} (r | X, A)$ , $j = 1, 2$ , with linear predictors $β_{11} X_{1} + β_{12} X_{2} + β_{13} A + β_{14} X_{1} A + β_{15} X_{2} A$ and $β_{21} X_{1} + β_{22} X_{2}$ , respectively; and a logistic regression model for $p_{Ψ} (X, Γ) = expit {(γ_{10} + γ_{11} X_{1} + γ_{12} X_{2}) I (Γ = 1) + (γ_{20} + γ_{21} X_{1} + γ_{22} X_{2}) I (Γ = 2)}$ .

Table 2 presents the results for estimation of θ₁, dictating waning; $V E_{\leq 20} = 1 - \exp (θ_{0})$ , VE prior to $v = 20$ weeks; and $V E_{> 20} = 1 - \exp (θ_{0} + θ_{1})$ , VE after $v = 20$ weeks. Because the Monte Carlo distribution of some of these quantities exhibited slight skewness, those for the VE quantities likely due to the exponentiation, we report both Monte Carlo mean and median. Estimation of $V E_{\leq 20}$ shows virtually no bias for both (a) and (b); that for $V E_{> 20}$ in case (a) shows minimal bias and virtually none for (b). In all cases, standard errors obtained via the sandwich technique as outlined in Web Appendix G along with the delta method for the VEs track the Monte Carlo standard deviations. Under both (i) no confounding and (ii) confounding, estimation of the stabilized weights appears to have little consequence for precision of the estimators relative to setting them to equal to 1. Wald 95% confidence intervals, exponentiated for the VEs, achieve nominal coverage. For (b) and each combination of stabilized weights set equal to 1 or estimated and (i), no confounding, and (ii), confounding, we also calculated the empirical Type I error achieved by a Wald test at level of significance 0.05 for VE waning addressing the null and alternative hypotheses $H_{0} : θ_{1} \leq 0$ versus $H_{1} : θ_{1} > 0$ . These values are 0.04 and 0.06 when using stabilized weights set equal to 1 under (i) and (ii), respectively; the analogous values with estimated weights are 0.05 and 0.05 under (i) and (ii).

TABLE 2.

Simulation results based on 1000 Monte Carlo replications, first scenario. Mean = mean of Monte Carlo estimates, Med = median of Monte Carlo estimates, SD = standard deviation of Monte Carlo estimates, SE = average of standard errors obtained via the sandwich technique/delta method, Cov = empirical coverage of nominal 95% Wald confidence interval (transformed for $V E$ ). $V E_{\leq 20} = 1 - \exp (θ_{0})$ , VE prior to $v = 20$ weeks; $V E_{> 20} = 1 - \exp (θ_{0} + θ_{1})$ , VE after $v = 20$ weeks. True values: (a) $θ_{1} = \log (7) = 1.946$ , $V E_{\leq 20} = 0.95$ , $V E_{> 20} = 0.65$ ; (b) $θ_{1} = 0$ , $V E_{\leq 20} = V E_{> 20} = 0.95$

Stabilized weights = 1

Stabilized weights estimated

Mean

Med

Cov

Mean

Med

Cov

(i), no confounding; (a)

θ_{1} = \log (7)

θ₁

1.961

1.935

0.310

0.308

0.95

1.983

1.959

0.303

0.310

0.96

V E_{\leq 20}

0.950

0.953

0.019

0.95

0.950

0.952

0.019

0.95

V E_{> 20}

0.634

0.663

0.183

0.174

0.96

0.626

0.662

0.188

0.177

0.96

(ii), confounding; (a)

θ_{1} = \log (7)

θ₁

2.030

2.013

0.325

0.320

0.95

1.990

1.973

0.346

0.335

0.95

V E_{\leq 20}

0.951

0.953

0.019

0.018

0.96

0.951

0.952

0.019

0.95

V E_{> 20}

0.614

0.647

0.199

0.185

0.95

0.619

0.665

0.201

0.186

0.94

(i), no confounding; (b)

θ_{1} = 0

θ₁

−0.020

−0.019

0.433

0.422

0.95

0.007

0.019

0.421

0.424

0.96

V E_{\leq 20}

0.950

0.952

0.020

0.019

0.95

0.950

0.952

0.020

0.019

0.96

V E_{> 20}

0.947

0.954

0.032

0.030

0.96

0.946

0.953

0.033

0.031

0.95

(ii), confounding; (b)

θ_{1} = 0

θ₁

0.053

0.045

0.446

0.436

0.95

0.011

−0.004

0.452

0.450

0.96

V E_{\leq 20}

0.951

0.952

0.019

0.96

0.950

0.952

0.020

0.019

0.95

V E_{> 20}

0.944

0.951

0.035

0.032

0.96

0.945

0.954

0.036

0.033

0.95

Open in a new tab

In the first set of simulations, the confounding induced by our generative choices led to little to no bias in the estimators for θ₁ and the VEs prior to and after 20 weeks. Notably, modeling and fitting of the stabilized weights to adjust for potential confounding shows little effect relative to setting the stabilized weights to 1. To the extent that this scenario is a plausible approximation to actual conditions of the trial, it may be that confounding will not be a serious challenge for the analysis of VE waning.

To examine the ability of the methods with estimated stabilized weights to adjust for confounding that potentially could be sufficiently strong to bias results, we carried out additional simulations under settings (a) $θ_{1} = \log (7)$ and (b) $θ_{1} = 0$ with (ii) confounding in which our choices of generative parameters induce a stronger association between the potential infection times and the agreement process. Specifically, we took instead $(δ_{0}, δ_{1}, δ_{2}) = {\log (0.0006), 0.7, 0.07}$ and $({\tilde{γ}}_{0}, {\tilde{γ}}_{1}, {\tilde{γ}}_{2}, {\tilde{γ}}_{3}) = (1.4, - 1.0, - 0.1, - 0.1)$ , with all other settings identical to those above.

Table 3 shows the results. The estimators for θ₁ and $V E_{> 20}$ are slightly biased when stabilized weights are set equal to 1, although coverage probability for the latter is at the nominal level. This feature is mitigated by use of estimated stabilized weights. Coverage probability for θ₁ is somewhat lower than nominal. Under (b), empirical Type I error achieved by a Wald test at level of significance 0.05 of $H_{0} : θ_{1} \leq 0$ versus $H_{1} : θ_{1} > 0$ is 0.12 when stabilized weights are equal to 1, demonstrating the potential for biased inference; Type I error is 0.06 using estimated stabilized weights, leading to a more reliable test.

TABLE 3.

Simulation results based on 1000 Monte Carlo replications, second scenario. Entries are as in Table 2. True values: (a) $θ_{1} = \log (7) = 1.946$ , $V E_{\leq 20} = 0.95$ , $V E_{> 20} = 0.65$ ; (b) $θ_{1} = 0$ , $V E_{\leq 20} = V E_{> 20} = 0.95$

Stabilized weights = 1

Stabilized weights estimated

Mean

Med

Cov

Mean

Med

Cov

(ii), confounding; (a)

θ_{1} = \log (7)

θ₁

2.125

2.100

0.315

0.299

0.93

2.009

2.008

0.346

0.325

0.94

V E_{\leq 20}

0.952

0.953

0.017

0.016

0.97

0.950

0.952

0.017

0.96

V E_{> 20}

0.581

0.611

0.191

0.182

0.95

0.613

0.640

0.179

0.175

0.96

(ii), confounding; (b)

θ_{1} = 0

θ₁

0.171

0.149

0.436

0.403

0.92

0.050

0.053

0.447

0.426

0.95

V E_{\leq 20}

0.951

0.953

0.017

0.97

0.950

0.952

0.018

0.017

0.96

V E_{> 20}

0.937

0.945

0.038

0.034

0.95

0.942

0.949

0.034

0.032

0.95

Open in a new tab

Overall, we speculate that, because the Moderna study is a randomized, double‐blind trial, the unblinding period is finite and eventually all participants are unblinded, the refusal rate is likely to be low, and infection rates are low, confounding may not lead to substantial bias in estimation of VE waning.

7. DISCUSSION

We have proposed a conceptual framework based on potential outcomes for study of VE in which assumptions on biological, behavioral, and other phenomena are made transparent. The methods provide a mechanism to account for possible confounding. The corresponding statistical framework combines information from blinded and unblinded participants over time. We focus on the setting of the Moderna phase 3 trial, but the principles can be adapted to other settings, including the blinded crossover design of Follmann et al. (2020), and apply to ongoing and future vaccine trials in which unblinding may well occur throughout and some participants may refuse the study vaccine in favor of already‐licensed products. Extension of the methods to the setting where time between doses varies across vaccinated participants due to either deviations from the protocol or by design would require modification of the framework presented here to represent VE as a function of both time between doses and time since vaccination.

Our approach and those of Lin et al. (2021) (LZG), Follmann et al. (2020), and Fintzi and Follmann (2021) for estimation of VE waning use a calendar time formulation and Cox hazard models. As do we, Follmann et al. (2020) and Fintzi and Follmann (2021) include data from placebo participants who cross over to study vaccine, whereas LZG censor such subjects and propose a sensitivity analysis. Because the approach of Follmann et al. (2020) and Fintzi and Follmann (2021) is based on a randomized crossover design that maintains the blind, confounding is not addressed, while LZG adjust for confounding via regression modeling. In our methodology, confounding is addressed through a potential outcomes formulation and IPW. As we do, Follmann et al. (2020) and Fintzi and Follmann (2021) represent $V E (τ)$ using parametric or flexible spline models; LZG model $V E (τ)$ nonparametrically.

Through condition (ii) in Section 3, (ii) $E {π_{1} (t, τ) | X} / E {π_{0} (t) | X} = q (τ)$ , the methods embed the assumption that VE is similar across current and emerging viral variants. If the analyst is unwilling to adopt an assumption like condition (ii), then it is not possible to rule out that the data from the blinded (prior to $T_{P}$ ) and unblinded (starting at $T_{P}$ ) phases of the trial reflect very different variant mixtures. In this case, calendar time and time since vaccination cannot be disentangled, and thus, it is not possible to evaluate VE solely as a function of time since vaccination. However, it may be possible to evaluate the ratio of infection rates under vaccine at any time t (and thus variant mixture in force at t) after different times since vaccination τ₁ and τ₂, say, during the unblinded phase of the trial, namely, $I_{1}^{u} (t, τ_{1}) / I_{1}^{u} (t, τ_{2})$ , $t \geq T_{P}$ . The infection rates can be estimated based on the infection status data at time t from vaccinated individuals who received vaccine at times $t - τ_{1}$ and $t - τ_{2}$ , respectively. These infection rates and their ratio will reflect information about the waning of the vaccine itself under the conditions at time t, and, in fact, this infection rate ratio can be viewed as the ratio of vaccine efficacies at τ₁ and τ₂. However, because after $T_{C}$ information on $I_{0}^{u} (t)$ will no longer be available, it is not possible to deduce VE itself for $t \geq T_{C}$ . But if data external to the trial became available that provide information on VE at t, even for small τ, it may be possible to integrate this information with that from the infection rates to gain insight into VE as a function of τ.

Supporting information

Web Appendices referenced in Sections 3–6 and R code implementing the simulation scenarios in Section 6 are available with this paper at the Biometrics website on Wiley Online Library. An R package, VEwaning, implementing the methodology is available on GitHub at https://github.com/sth1402/VEwaning and at the Comprehensive R Archive Network (CRAN) at https://cran.r‐project.org/web/packages/VEwaning/index.html.

Click here for additional data file.^{(277.4KB, zip)}

ACKNOWLEDGMENTS

The authors thank Dean Follmann for helpful discussions and insights. The authors are grateful to Shannon Holloway for creating the R package VEWaning, noted in the Supporting Information.

Tsiatis AA, Davidian M. Estimating vaccine efficacy over time after a randomized study is unblinded. Biometrics. 2021;1–14. 10.1111/biom.13509

Contributor Information

Anastasios A. Tsiatis, Email: tsiatis@ncsu.edu

Marie Davidian, Email: davidian@ncsu.edu.

DATA AVAILABILITY STATEMENT

Data sharing is not applicable to this article, as no datasets are generated or analyzed in this paper. The methods developed in the paper are proposed to enable future analyses of data from ongoing vaccine trials from which the required data are not yet fully accrued.

REFERENCES

Baden, L.R. , El Sahly, H.M. , Essink, B. , Kotloff, K. , Frey, S. , Novak, R. et al. for the COVE Study Group (2020) Efficacy and safety of the mRNA‐1273 SARS‐CoV‐2 vaccine. New England Journal of Medicine, 384, 403–416. 10.1056/NEJMoa2035389. [DOI] [PMC free article] [PubMed] [Google Scholar]
COVID‐19 Vaccine Tracker . https://covid19.trackvaccines.org/. Last updated 11 July 2021.
Fintzi, J. and Follmann, D. (2021) Assessing vaccine durability in randomized trials following placebo crossover. arXiv preprint arXiv:2101.01295v3. [DOI] [PMC free article] [PubMed]
Fleming, T.R. and Harrington, D.P. (2005) Counting processes and survival analysis. New York: Wiley. [Google Scholar]
Follmann, D. , Fintzi, J. , Fay, M.P. , Janes, H.E. , Baden, L. , El Sahly, H. et al. (2020) Assessing durability of vaccine effect following blinded crossover in COVID‐19 vaccine efficacy trials. medRxiv. 2020 Dec 14;2020.12.14.20248137. 10.1101/2020.12.14.20248137. [DOI]
Halloran, M.E. , Longini, I.M. and Struchiner, C.J. (1996) Estimability and interpretation of vaccine efficacy using frailty mixing models. American Journal of Epidemiology, 144, 83–97. [DOI] [PubMed] [Google Scholar]
Lin, D.‐Y. , Zeng, D. and Gilbert, P.B. (2021) Evaluating the long‐term efficacy of COVID‐19 vaccines. Clinical Infectious Diseases, ciab226. 10.1093/cid/ciab226. [DOI] [PMC free article] [PubMed] [Google Scholar]
Longini, I.M. and Halloran, M.E. (1996) Frailty mixture model for estimating vaccine efficacy. Journal of the Royal Statistical Society, Series C, 45, 165–173. [Google Scholar]
Moderna Clinical Study Protocol, Amendment 6, 23 December 2020, available at https://www.modernatx.com/sites/default/files/content_documents/Final%20mRNA‐1273‐P301%20Protocol%20Amendment%206%20‐%2023Dec2020.pdf
Robins, J.M. , Hernán, M.A. and Brumback, B. (2000) Marginal structural models and causal inference in epidemiology. Epidemiology, 11, 550–560. [DOI] [PubMed] [Google Scholar]
Rubin, D.B. (1980) Bias reduction using Mahalanobis‐metric matching. Biometrics, 36, 293–298. [Google Scholar]
Yang, S. , Tsiatis, A.A. and Blazing, M. (2018) Modeling survival distribution as a function of time to treatment discontinuation: A dynamic treatment regime approach. Biometrics, 74, 900–909. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Click here for additional data file.^{(277.4KB, zip)}

Data Availability Statement

[biom13509-bib-0001] Baden, L.R. , El Sahly, H.M. , Essink, B. , Kotloff, K. , Frey, S. , Novak, R. et al. for the COVE Study Group (2020) Efficacy and safety of the mRNA‐1273 SARS‐CoV‐2 vaccine. New England Journal of Medicine, 384, 403–416. 10.1056/NEJMoa2035389. [DOI] [PMC free article] [PubMed] [Google Scholar]

[biom13509-bib-0002] COVID‐19 Vaccine Tracker . https://covid19.trackvaccines.org/. Last updated 11 July 2021.

[biom13509-bib-0003] Fintzi, J. and Follmann, D. (2021) Assessing vaccine durability in randomized trials following placebo crossover. arXiv preprint arXiv:2101.01295v3. [DOI] [PMC free article] [PubMed]

[biom13509-bib-0004] Fleming, T.R. and Harrington, D.P. (2005) Counting processes and survival analysis. New York: Wiley. [Google Scholar]

[biom13509-bib-0005] Follmann, D. , Fintzi, J. , Fay, M.P. , Janes, H.E. , Baden, L. , El Sahly, H. et al. (2020) Assessing durability of vaccine effect following blinded crossover in COVID‐19 vaccine efficacy trials. medRxiv. 2020 Dec 14;2020.12.14.20248137. 10.1101/2020.12.14.20248137. [DOI]

[biom13509-bib-0006] Halloran, M.E. , Longini, I.M. and Struchiner, C.J. (1996) Estimability and interpretation of vaccine efficacy using frailty mixing models. American Journal of Epidemiology, 144, 83–97. [DOI] [PubMed] [Google Scholar]

[biom13509-bib-0007] Lin, D.‐Y. , Zeng, D. and Gilbert, P.B. (2021) Evaluating the long‐term efficacy of COVID‐19 vaccines. Clinical Infectious Diseases, ciab226. 10.1093/cid/ciab226. [DOI] [PMC free article] [PubMed] [Google Scholar]

[biom13509-bib-0008] Longini, I.M. and Halloran, M.E. (1996) Frailty mixture model for estimating vaccine efficacy. Journal of the Royal Statistical Society, Series C, 45, 165–173. [Google Scholar]

[biom13509-bib-0009] Moderna Clinical Study Protocol, Amendment 6, 23 December 2020, available at https://www.modernatx.com/sites/default/files/content_documents/Final%20mRNA‐1273‐P301%20Protocol%20Amendment%206%20‐%2023Dec2020.pdf

[biom13509-bib-0010] Robins, J.M. , Hernán, M.A. and Brumback, B. (2000) Marginal structural models and causal inference in epidemiology. Epidemiology, 11, 550–560. [DOI] [PubMed] [Google Scholar]

[biom13509-bib-0011] Rubin, D.B. (1980) Bias reduction using Mahalanobis‐metric matching. Biometrics, 36, 293–298. [Google Scholar]

[biom13509-bib-0012] Yang, S. , Tsiatis, A.A. and Blazing, M. (2018) Modeling survival distribution as a function of time to treatment discontinuation: A dynamic treatment regime approach. Biometrics, 74, 900–909. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Estimating vaccine efficacy over time after a randomized study is unblinded

Anastasios A Tsiatis

Marie Davidian

Abstract

1. INTRODUCTION

2. CLINICAL TRIAL STRUCTURE AND DATA

TABLE 1.

3. CONCEPTUALIZATION OF VACCINE EFFICACY

4. STATISTICAL FRAMEWORK

4.1. Motivation

4.2. Potential outcomes formulation

4.3. Identifiability assumptions

4.4. Observed data estimating equations

5. PRACTICAL IMPLEMENTATION AND INFERENCE

6. SIMULATIONS

TABLE 2.

TABLE 3.

7. DISCUSSION

Supporting information

ACKNOWLEDGMENTS

Contributor Information

DATA AVAILABILITY STATEMENT

REFERENCES

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Estimating vaccine efficacy over time after a randomized study is unblinded

Anastasios A Tsiatis

Marie Davidian

Abstract

1. INTRODUCTION

2. CLINICAL TRIAL STRUCTURE AND DATA

TABLE 1.

3. CONCEPTUALIZATION OF VACCINE EFFICACY

4. STATISTICAL FRAMEWORK

4.1. Motivation

4.2. Potential outcomes formulation

4.3. Identifiability assumptions

4.4. Observed data estimating equations

5. PRACTICAL IMPLEMENTATION AND INFERENCE

6. SIMULATIONS

TABLE 2.

TABLE 3.

7. DISCUSSION

Supporting information

ACKNOWLEDGMENTS

Contributor Information

DATA AVAILABILITY STATEMENT

REFERENCES

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases