Skip to main content
Wiley - PMC COVID-19 Collection logoLink to Wiley - PMC COVID-19 Collection
. 2021 Aug 13:10.1111/biom.13509. Online ahead of print. doi: 10.1111/biom.13509

Estimating vaccine efficacy over time after a randomized study is unblinded

Anastasios A Tsiatis 1, Marie Davidian 1,
PMCID: PMC8444907  PMID: 34174097

Abstract

The COVID‐19 pandemic due to the novel coronavirus SARS CoV‐2 has inspired remarkable breakthroughs in the development of vaccines against the virus and the launch of several phase 3 vaccine trials in Summer 2020 to evaluate vaccine efficacy (VE). Trials of vaccine candidates using mRNA delivery systems developed by Pfizer‐BioNTech and Moderna have shown substantial VEs of 94–95%, leading the US Food and Drug Administration to issue Emergency Use Authorizations and subsequent widespread administration of the vaccines. As the trials continue, a key issue is the possibility that VE may wane over time. Ethical considerations dictate that trial participants be unblinded and those randomized to placebo be offered study vaccine, leading to trial protocol amendments specifying unblinding strategies. Crossover of placebo subjects to vaccine complicates inference on waning of VE. We focus on the particular features of the Moderna trial and propose a statistical framework based on a potential outcomes formulation within which we develop methods for inference on potential waning of VE over time and estimation of VE at any postvaccination time. The framework clarifies assumptions made regarding individual‐ and population‐level phenomena and acknowledges the possibility that subjects who are more or less likely to become infected may be crossed over to vaccine differentially over time. The principles of the framework can be adapted straightforwardly to other trials.

Keywords: crossover, inverse probability weighting, potential outcomes, randomized phase 3 vaccine trial, waning vaccine efficacy

1. INTRODUCTION

The primary objective of a vaccine trial is to estimate vaccine efficacy (VE). Typically, these trials are double‐blind, placebo‐controlled studies in which participants are randomized to either vaccine or placebo and followed for the primary endpoint. This endpoint is often time to viral infection, on which inference on VE is based, where VE is defined as a measure of reduction in infection risk for vaccine relative to placebo, expressed as a percentage.

Vaccine trials have become the focus of immense global interest as a result of the COVID‐19 disease pandemic due to the novel coronavirus SARS‐CoV‐2 (COVID‐19 Vaccine Tracker). The pandemic inspired unprecedented scientific breakthroughs in the rapid development of vaccines against SARS‐CoV‐2, culminating in the launch of several large phase 3 vaccine trials in Summer 2020. Trials in the United States studying the vaccine candidates using messenger RNA (mRNA) delivery systems developed by Pfizer‐BioNTech and Moderna began in July 2020 and demonstrated substantial evidence of VEs of 94–95% at interim analyses, leading the US Food and Drug Administration (FDA) to issue Emergency Use Authorizations (EUAs) for both vaccines in December 2020 and to the rollout of vaccination programs shortly thereafter.

Implicit in the primary analysis in these trials is the assumption that VE is constant over the study period, and, with primary endpoint time to infection, VE is represented by (1 − the ratio of the hazard rate for vaccine to that for placebo), estimated based on a Cox proportional hazards model. As the trials continue following the EUAs, among the many issues to be addressed is the possibility that VE may wane over time. Principled evaluation of the nature and extent of waning of VE is of critical public health importance, as waning has implications for measures to control the pandemic. Were all participants in the trials to continue on their randomized assignments (study vaccine or placebo), evaluation of potential waning of VE would be straightforward. However, once efficacy is established, ethical considerations dictate the possibility of unblinding all participants and offering the study vaccine to those randomized to placebo. After consultation with stakeholders, Pfizer and Moderna issued amendments to their trial protocols specifying unblinding strategies and modifications to planned analyses.

Crossover of placebo subjects to the study vaccine of necessity complicates inference on waning of VE and has inspired recent research (Follmann et al., 2020; Fintzi and Follmann, 2021; Lin et al. 2021). We propose a statistical framework within which we develop methods for inference on whether or not VE wanes over time based on data where subjects are unblinded and those on placebo may cross over to study vaccine and in which assumptions made regarding individual and population phenomena are made transparent. It is possible that subjects who are more or less likely to become infected could be unblinded and cross over to vaccine differentially over time, which could lead to biased inferences due to confounding; accordingly, this possibility is addressed explicitly in the framework. The first author (AAT) has the privilege of serving on the Data and Safety Monitoring Board for all U.S. government‐sponsored COVID‐19 vaccine trials and is thus well acquainted with the unblinding approach for the Moderna trial. Accordingly, the development is based on the specifics of this trial, but the principles can be adapted to the features of other trials.

In Section 2, we review the Moderna trial and the resulting data. We present a conceptual framework in which we define VE precisely as a function of time postvaccination in Section 3. In Section 4, we develop a formal statistical framework within which we propose methodology for estimation of VE and describe its practical implementation in Section 5. Simulations demonstrating performance are presented in Section 6.

2. CLINICAL TRIAL STRUCTURE AND DATA

We first describe the timeline of the Moderna Coronavirus Efficacy (COVE) trial (Baden et al., 2020) on the scale of calendar time. The trial opened on July 27, 2020 (time 0), and reached full accrual at time TA (October 23, 2020). On December 11, 2020, denoted by TP, the FDA issued an EUA for the Pfizer vaccine, followed by an EUA for the Moderna mRNA‐1273 vaccine on TM= December 18, 2020. Amendment 6 of the study protocol was issued on December 23, 2020 and specified the unblinding strategy (see figure 2 of the protocol) under which, starting on TU= December 24, 2020, study participants are scheduled on a rolling basis over several months for Participant Decision clinic visits (PDCVs) at which they will be unblinded. If originally randomized to vaccine, participants continue to be followed; if randomized to placebo, participants can receive the Moderna vaccine at the PDCV or refuse and either seek another vaccine outside the study or remain unvaccinated. Let TC denote the time at which all PDCVs have taken place. The study will continue until time TF at which all participants will have completed full follow up at 24 months after initial treatment assignment. Assume that the analysis of VE using the methods in Sections 4.4 and 5 takes place at time TCLTF, where all participants have achieved the primary endpoint, requested to be unblinded, or attended the PDCV by L. The Moderna vaccine is administered in two doses, ideally 4 weeks apart, and is not thought to achieve full efficacy until 2 weeks following the second dose. Accordingly, the primary endpoint is defined as symptomatic viral infection occurring after a lag of =6 weeks following the initial dose.

Under this scheme, we characterize the data on a given participant as follows. Let 0ETA denote the calendar time at which the subject entered the trial, X denote baseline covariates, and A=0 (1) if assigned to placebo (vaccine). Denote observed time to infection on the scale of calendar time as U, and Δ=I(UL), where I(B)=1 if B is true and 0 otherwise. At TP, availability of the Pfizer vaccine commenced, at which point some subjects not yet infected requested to be unblinded. Denote by R (calendar time) the minimum of (i) time to such an unblinding, in which case TPR<TU, and define Γ=1; (ii) time of PDCV, so TUR<TC, and let Γ=2; or (iii) time to infection, in which case R=U and Γ=0. If Γ1 and A=1, so that the subject was randomized to vaccine, she/he continues to be followed; if A=0, she/he can agree to receive the Moderna vaccine, Ψ=1 or refuse, Ψ=0. We distinguish the cases Γ=1 and 2 to acknowledge different unblinding dynamics before and after TU. Because a very small number of participants requested unblinding before TP, and although the protocol allows participants to refuse unblinding at PDCV, all subjects are strongly encouraged to unblind, we do not include these possibilities in the formulation.

Table 1 summarizes the timeline and observed data. The trial data are thus

Oi={Ei,Xi,Ai,Ui,Δi,Ri,Γi,I(Γi1,Ai=0)Ψi},i=1,,n, (1)

independent and identically distributed (iid) across i.

TABLE 1.

Summary of notation. All times are on the scale of calendar time, where time 0 is the start of the trial

Variable Definition
Trial Milestones
TA
Full accrual reached, October 23, 2020
TP
Pfizer granted EUA, December 11, 2020
TM
Moderna granted EUA, December 18, 2020
TU
Participant Decision clinic visits (PDCVs) commence, December 24, 2020
TC
PDCVs conclude
TF
Follow‐up concludes, trial ends
Lag between initial vaccine dose and full efficacy, 6 weeks, TPTA>
L Time of analysis of vaccine efficacy using the proposed methods; L> time at which all subjects have achieved the endpoint, requested unblinding, or attended the PDCV, LTF
Observed Data on a Trial Participant
E Study entry time, 0ETA
X
Baseline information
A Treatment assignment, placebo, A=0, or vaccine, A=1
U,Δ
Time to symptomatic infection, indicator of infection by time L, Δ=I(UL)
R,Γ
Time to requested unblinding, PDCV/requested unblinding, or infection, whichever comes first
  Γ=0: R=U, infection occurs before requested/offered unblinding
  Γ=1: R= time to requested unblinding, TPR<TU
  Γ=2: R= time to PDCV or requested unblinding, TUR<TC
Ψ If A=0, Γ1, indicator or whether subject receives Moderna vaccine, Ψ=1, or refuses and seeks another vaccine outside the study or remains unvaccinated, Ψ=0

3. CONCEPTUALIZATION OF VACCINE EFFICACY

Similar to Halloran et al. (1996) and Longini and Halloran (1996), we consider the following framework in which to conceptualize VE. The study population, comprising individuals for which inference on VE is of interest, is that of individuals susceptible to infection, represented by the trial participants. There is a population of individuals outside the trial with which trial participants interact, assumed to be much larger than the number of participants, so that interactions among participants are much less likely than interactions with the outside population. The probability that a trial participant will become infected at calendar time t depends on three factors: c(t), the contact rate, the number of contacts with the outside population per unit time; p(t), the prevalence of infections in the outside population at t; and π(t), the transmission probability at t, the probability a susceptible individual in the study population will become infected per contact with an infected individual from the outside population. Dependence of π(t) on time acknowledges the emergence of new variants of the virus, which may be more or less virulent, as in the COVID‐19 pandemic. Assuming random mixing, p(t)c(t) is the contact rate at time t with infected individuals, and the infection rate at time t is p(t)c(t)π(t).

We adapt this framework to the COVID‐19 pandemic. The prevalence rate in the pandemic can vary substantially in time and space, so denote by S the trial site at which a participant is enrolled, and let p(t,s) be the prevalence at time t at site S=s. Although p(t,s) varies by t and s, assume that it is unaffected by the individuals in the trial and thus represents an external force. We view the contact rate as individual specific; accordingly, for an arbitrary individual in the study population, let the random variables {c0b(t),c1b(t),c0u(t),c01u(t),c1u(t)} denote potential contact rates. These potential outcomes can be regarded as individual‐specific behavioral characteristics of trial participants, where some may be more careful and make fewer contacts while others take more risks, and behavior can vary over time and by vaccination and blinding status. Here, cab(t) is the contact rate at time t if the individual were to receive vaccine, a=1, or placebo, a=0, and be blinded to this assignment; by virtue of blinding, it is reasonable to take c1b(t)=c0b(t)=cb(t).

As in Table 1, letting ℓ denote the lag between initial dose and full efficacy, c01u(t) reflects behavior of a placebo subject who is unblinded, receives the Moderna vaccine, and is within ℓ weeks of vaccination. Likewise, c1u(t) reflects behavior of any unblinded Moderna vaccine recipient after ℓ, both those originally randomized to placebo and crossed over to the vaccine and those originally randomized to vaccine. Thus, c01u(t) allows for more cautious behavior before full efficacy is achieved for recently vaccinated placebo subjects; in the trial, all subjects randomized to vaccine were past the full efficacy lag at the time of unblinding (as in Table 1, TPTA). Similar to the stable unit treatment value assumption (Rubin, 1980), assume that c1u(t) is the same if the individual was randomized to vaccine and unblinded before t or was randomized to placebo and subsequently unblinded and crossed over to the Moderna vaccine before t. The rate c0u(t) reflects behavior of an unblinded placebo subject who does not cross over to the Moderna vaccine and does not play a role in the development, and, as demonstrated in Section 4.4, such subjects do not contribute to the analysis of VE.

Finally, for an arbitrary participant, let the random variable π0(t) be the potential individual‐specific transmission probability per contact at t if she/he were to receive placebo, and let π1(t,τ) be the same if she/he were to receive study vaccine and have been vaccinated for τ0 units of time. As we now demonstrate, this formulation allows us to represent VE as a function of τ and thus consider whether or not VE wanes over time since vaccination.

With the set of potential outcomes for an arbitrary individual in the study population who enrolls at site S thus given by {cb(t),c0u(t),c01u(t),c1u(t)t>0,π0(t),π1(t,τ),τ0}, the infection rate in the study population at calendar time t if all individuals were to receive placebo and be blinded to that assignment is I0b(t)=E{p(t,S)cb(t)π0(t)}; likewise, the infection rate at t if all individuals were to receive vaccine at time tτ and be blinded to that assignment is I1b(t,τ)=E{p(t,S)cb(t)π1(t,τ)}. The relative infection rate at t is then

Rb(t,τ)=I1b(t,τ)I0b(t)=E{p(t,S)cb(t)π1(t,τ)}E{p(t,S)cb(t)π0(t)}. (2)

Accordingly, VE at time t after vaccination at tτ is VE(t,τ)=1Rb(t,τ), reflecting the proportion of infections at t that would be prevented if the study population were vaccinated and on study vaccine for τ units of time during the blinded phase of the study.

In the sequel, we assume that Rb(t,τ) and thus VE(t,τ) depend only on τ and write Rb(τ) and VE(τ)=1Rb(τ). This assumption embodies the belief that, although infection rates may change over time, the relative effect of vaccine to placebo remains approximately constant and holds if (i) {π1(t,τ),π0(t)}{S,cb(t)}|X, where means “independent of” and this independence is conditional on X; and (ii) E{π1(t,τ)|X}/E{π0(t)|X}=q(τ), so does not depend on t and X. Condition (i) reflects the interpretation of π1(t,τ) and π0(t) as inherent biological characteristics of an individual, whereas S and cb(t) are external and behavioral characteristics, respectively; thus, once common individual and external baseline covariates are taken into account, biological and geographic/behavioral characteristics are unrelated. Condition (ii) implies that, although new viral variants may change transmission probabilities under both vaccine and placebo over time, this change stays in constant proportion, and this proportion is similar for individuals with different characteristics. Further discussion is given in Section 7 and Web Appendix B of the Supporting Information.

Within this framework, the goal of inference on waning of VE based on the data from the trial can be stated precisely as inference on VE(τ)=1Rb(τ), τ, so reflecting VE after full efficacy is achieved. It is critical to recognize that, like estimands of interest in most clinical trials, VE(τ) represents VE at time since vaccination τ under the original conditions of the trial, under which all participants are blinded. The challenge we address in subsequent sections is how to achieve valid inference on VE(τ), τ, using data from the modified trial in which blinded participants are unblinded in a staggered fashion, with placebo subjects offered the option to receive the study vaccine.

We propose a semiparametric model within which we cast this objective. Let I01u(t,τ)=E{p(t,S)c01u(t)π1(t,τ)}, τ<, and I1u(t,τ)=E{p(t,S)c1u(t)π1(t,τ)}, τ, be the infection rates in the study population at t if all individuals were to receive vaccine at time tτ and be unblinded to that fact. Analogous to (i) above, assume that {π1(t,τ),π0(t)}{S,c01u(t),c1u(t)}|X, and continue to assume condition (ii). Then, for two values τ1,τ2 of τ, it is straightforward that (see Web Appendix A of the Supporting Information)

I01u(t,τ1)I01u(t,τ2)=Rb(τ1)Rb(τ2),τ1,τ2<;I1u(t,τ1)I1u(t,τ2)=Rb(τ1)Rb(τ2),τ1,τ2. (3)

Defining I01u(t)=I01u(t,0)=E{p(t,S)c01u(t)π1(t,0)} and I1u(t)=I1u(t,)=E{p(t,S)c1u(t)π1(t,)}, by (3) with τ1=τ and τ2=0 (ℓ) on the left (right) hand side, the infection rates at t if all individuals in the study population were unblinded and to receive vaccine at time tτ are

I01u(t,τ)=I01u(t)Rb(τ)Rb(0),τ<;I1u(t,τ)=I1u(t)Rb(τ)Rb(),τ. (4)

Likewise, from (2), the infection rate at t if all individuals in the study population were blinded and to receive vaccine at time tτ is

I1b(t,τ)=I0b(t)Rb(τ). (5)

We now represent the infection rate ratio Rb(τ) as

Rb(τ;θ)=exp{ζ(τ)}I(τ<)+exp{θ0+g(τ;θ1)}I(τ),θ=(θ0,θ1T)T, (6)

where ζ(τ) is a function of τ; θ0 and θ1 are real‐ and vector‐valued parameters, respectively; and g(u;θ1) is a real‐valued function of such that g(0;θ1)=0 for all θ1 and g(u;0)=0. For example, taking g(u;θ1)=θ1u yields Rb(τ;θ)=exp{θ0+θ1(τ)}, τ, in which case θ1=0 implies that VE(τ)=1Rb(τ), τ, does not change with time since vaccination, and θ1>0 indicates that VE(τ) decreases with increasing τ; that is, exhibits waning. More complex specifications of g(u;θ1) using splines (e.g., Fintzi and Follmann, 2021) or piecewise constant functions could be made; for example, for v1<v2L,

g(u;θ1)=θ11I(v1<uv2)+θ12I(u>v2),θ1=(θ11,θ12)T. (7)

Because interest focuses only on τ, we leave ζ(τ) unspecified.

Under this model, (5) and (4) can be written as

I1b(t,τ)=I0b(t)exp{ζ(τ)}I(τ<)+exp{θ0+g(τ;θ1)}I(τ),I01u(t,τ)=I01u(t)exp{ζ(τ)},τ<,I1u(t,τ)=I1u(t)exp{g(τ;θ1)},τ. (8)

Thus, to estimate VE(τ) for any τ and make inference on potential waning of VE, we develop a principled approach to estimation of θ based on the data from the modified trial in which participants are unblinded and those on placebo may cross over to study vaccine.

4. STATISTICAL FRAMEWORK

4.1. Motivation

Estimation of VE(τ), equivalently Rb(τ), would be straightforward for any τ over the entire follow‐up period if all participants remained blinded and on their assigned treatments throughout the trial. However, subjects randomized to placebo, when unblinded, have the option to receive the study vaccine on or after TP. For τ<TP, it is possible to estimate Rb(τ) because, due to randomization, for t<TP we have representative samples of blinded subjects on vaccine and placebo and thus information on I1b(t,τ) and I0b(t), so can estimate θ0 and components of θ1 identified for such τ; for example, in (7) depending on the values of v 1 and v 2. At TPt<TC, the data comprise a mixture of blinded and unblinded participants, where, within the latter group, those on placebo may have opted to receive study vaccine or refuse. Here, information, albeit diminishing during [TP,TC), on I1b(t,τ) and I0b(t) is available from participants not yet unblinded, which contributes to estimation of θ0 and components of θ1. Information is also available on I1u(t,τ) from individuals who were originally randomized to vaccine and provide information on longer τ and from individuals who recently crossed over to study vaccine and provide information on shorter τ. For tTC, there are no longer blinded subjects, so that information is available only on I1u(t,τ). For these latter groups, for longer τ1 and shorter τ2, I1u(t,τ1)/I1u(t,τ2)=exp[g{τ1;θ1}g{τ2;θ1}], and, because of the mixture of times since vaccination, θ1 can be fully estimated.

Through the following potential outcomes formulation and under suitable assumptions, in the next several sections, we develop an approach to estimation of θ based on the observed data (1) that embodies the foregoing intuitive principles.

4.2. Potential outcomes formulation

Denote by T0(e,r) the potential time to infection on the scale of patient time for an arbitrary individual in the study population if she/he were to enter the trial at calendar time e, receive placebo and be blinded to that fact, and, if not infected by calendar time r, be unblinded and cross over to study vaccine at r. Let T0(e)=T0(e,), if she/he is never crossed over to receive vaccine. Similarly, define T1(e,r) to be the potential time to infection (patient time scale) for an arbitrary individual if she/he were to enter the trial at e, receive vaccine and be blinded to that fact, and, if not infected by r, be unblinded at r; and define T1(e)=T1(e,). We make the consistency assumptions that T0(e,r)=T0(e) if T0(e)<r and T1(e,r)=T1(e) if T1(e)<r. For a=0,1, denote the hazard at calendar time t, t>e, by

λa(t,e,r)=limdt0dt1pr{tTa(e,r)+e<t+dt|Ta(e,r)+et},a=0,1, (9)

where the addition of e induces a shift from patient to calendar time. Denote the set of all potential outcomes as W={T0(e,r),T1(e,r);e>0,r>e}.

The development in Section 3 is in terms of infection rates at the individual‐specific and population levels. Population‐level hazard rates such as (9) are not equivalent to population‐level infection rates. However, we argue in Web Appendix C of the Supporting Information that, because the probabilities of infection under vaccine and placebo during the course of the trial are small, population‐level hazard rates and population‐level infection rates are approximately equivalent; this assumption is implicit in the standard primary analysis noted in Section 1. Thus, to reflect this, we use familiar notation and write λb(t)=I0b(t), λu(t)=I01u(t), and λu(t)=I1u(t). Under these conditions, using (8), we can write for t>e

λ0(t,e,r)=λb(t)I(t<r)+λu(t)exp{ζ(tr)}×I(0tr<)+λu(t)exp{g(tr;θ1)}I(tr), (10)
λ1(t,e,r)=λb(t)exp{ζ(te)}I(te<)+exp{θ0+g(te;θ1)}I(te)I(t<r)+λu(t)exp{g(te;θ1)}I(tr), (11)

where (11) follows because rTP, eTA, TPTA>. Define the counting processes for infection by Na(t,e,r)=I{Ta(e,r)+et} and Na(t,e)=Na(t,e,), and the at‐risk processes by Ya(t,e,r)=I{Ta(e,r)+et} and Ya(t,e)=Ya(t,e,), a=0,1 (Fleming and Harrington, 2005). From the above consistency assumptions, if t<r, then Na(t,e,r)=Na(t,e), Ya(t,e,r)=Ya(t,e), a=0,1. For a=0,1, let Λa(t,e,r)=0tλa(u,e,r)du be the cumulative hazard. Because E{dNa(t,e,r)|Ya(t,e,r)}=dΛa(t,e,r)Ya(t,e,r), a=0,1, it follows that {dNa(t,e,r)dΛa(t,e,r)Ya(t,e,r)}, a=0,1, are mean‐zero counting process increments. Thus, any linear combination of these increments over t,e,r can be used to define unbiased estimating functions in W of quantities of interest. In Web Appendix D of the Supporting Information, we formulate a particular set of estimating functions that, based on iid potential outcomes Wi, i=1,,n, lead to consistent and asymptotically normal estimators for {Λb(t),Λu(t),θT}T, Λk(t)=0tλk(u)du, k=b,u. Because interest focuses on VE(τ) for τ, estimation of Λu(t)=0tλu(u)du and ζ(·) is not considered and is reflected in the specification of the linear combinations; see Web Appendix D.

For fixed t, 0tL, the estimating functions for Λb(t) and Λu(t) are, respectively,

EΛb{W;Λb(t),θ}=I(t<TC)0min(t,TA){dN0(t,e)dΛb(t)Y0(t,e)}w0(t,e)de+I(t)0min(t,TA)dN1(t,e)dΛb(t)exp{θ0+g(te;θ1)×I(te)}Y1(t,e)w1(t,e)de, (12)
EΛu{W;Λu(t),θ}=I(tTP+)(0TATPmin(t,TC)[dN0(t,e,r)dΛu(t)exp{g(tr;θ1)I(tr)}Y0(t,e,r)]×w0(t,e,r)drde)+I(tTP)(0TATPmin(t,TC)[dN1(t,e,r)dΛu(t)exp{g(te;θ1)}×Y1(t,e,r)]w1(t,e,r)I(tr)drde), (13)

where wa(t,e) and wa(t,e,r), a=0,1, are arbitrary nonnegative weight functions, specification of which is discussed later. The estimating function for θ is given by

Eθ{W;Λb(·),Λu(·),θ}=TC0min(t,TA)(1gθ(te))×[dN1(t,e)dΛb(t)exp{θ0+g(te;θ1)×I(te)}Y1(t,e)]w1(t,e)de+TP+L0TATPmin(t,TC)(0gθ(tr))×[dN0(t,e,r)dΛu(t)exp{g(tr;θ1)×I(tr)}Y0(t,e,r)]w0(t,e,r)drde+TPL0TATPmin(t,TC)(0gθ(te))×[dN1(t,e,r)dΛu(t)exp{g(te;θ1)}×Y1(t,e,r)]w1(t,e,r)I(tr)drde, (14)

where gθ(u)=/θ1{g(u;θ1)}. Analogous to Yang et al. (2018), envisioning (12)–(14) as characterizing a system of estimating functions E{W;Λb(·),Λu(·),θ}=[EΛb{W;Λb(t),θ},EΛu{W;Λu(t),θ},0tL,Eθ{W;Λb(·),Λu(·),θ}T]T, if we could observe Wi, i=1,n, we would estimate dΛb(·),dΛu(·),θ by solving the estimating equations i=1nE(Wi;Λb(·),Λu(·),θ)}=0.

4.3. Identifiability assumptions

Of course, the potential outcomes Wi, i=1,,n, are not observed. However, we now present assumptions under which we can exploit the developments in the last section to derive estimating equations yielding estimators based on the observed data (1).

Define the indicator that a participant is observed to be infected at time t by dN(t)=I(U=t,Δ=1), the observed at‐risk indicator at t by Y(t)=I(E<tU), and

I0(t,e)=(1A)I(E=e)I(Rt),I1(t,e)=AI(E=e)I(Rt),I01(t,e,r)=(1A)I(E=e){I(R=r,Γ=1,Ψ=1)+I(R=r,Γ=2,Ψ=1)},I11(t,e,r)=AI(E=e){I(R=r,Γ=1)+I(R=r,Γ=2)}. (15)

Ia(t,e)=1 indicates that a subject entering the trial at time e and randomized to placebo (a=0) or vaccine (a=1) has not yet been infected or unblinded by t. For t>r, I01(t,e,r)=1 indicates that a subject randomized to placebo at time e is unblinded (either by request or at a PDCV) at time r and crosses over to study vaccine at r, and I11(t,e,r)=1 if a subject randomized to vaccine at time e is unblinded at r. Make the consistency assumptions

Ia(t,e)dN(t)=Ia(t,e)dNa(t,e),Ia(t,e)Y(t)=Ia(t,e)Ya(t,e),a=0,1,I01(t,e,r)dN(t)=I01(t,e,r)dN0(t,e,r),I01(t,e,r)Y(t)=I01(t,e,r)Y0(t,e,r),I11(t,e,r)dN(t)=I11(t,e,r)dN1(t,e,r),I11(t,e,r)Y(t)=I11(t,e,r)Y1(t,e,r). (16)

We now make assumptions similar in spirit to those adopted in observational studies. By randomization,

A(X,E,W), (17)

where we subsume the site indicator S in X, and let pA=pr(A=1). It is realistic to assume that the mix of baseline covariates changes over the accrual period; for example, during the trial, because of lagging accrual of elderly subjects and subjects from underrepresented groups, an effort was made to increase participation of these groups in the latter part of the accrual period. Accordingly, we allow the distribution of entry time E to depend on X, and denote its conditional density as fE|X(e|x). We make the no unmeasured confounders assumption

EW|X. (18)

Define the hazard functions of unblinding in the periods between the Pfizer EUA and the start of PDCVs and after the start of PDCVs, respectively, as

λR,1(r|X,A,E,W)=limdr0pr(rR<r+dr,Γ=1|Rr,X,A,E,W),TPr<TU,λR,2(r|X,A,E,W)=limdr0pr(rR<r+dr,Γ=2|Rr,X,A,E,W),TUr<TC,

where λR,j(r|X,A,E,W)=0 for rTU (j=1) and rTC (j=2). Because the accrual period was short relative to the length of follow‐up, we take these unblinding hazard functions to not depend on E, although including such dependence is straightforward; and, similar to a noninformative censoring assumption, to not depend on W and write

λR,j(r|X,A,E,W)=λR,j(r|X,A),j=1,2. (19)

Define KR(r|X, A)=exp[{ΛR,1(r|X,A)+ΛR,2(r|X, A)}], ΛR,j(r|X, A)=TjrλR,j(u|X, A)du, Tj=TP (j=1), or Tj=TU (j=2). Because λR,1(r|X,A) and λR,2(r|X,A) are defined on the nonoverlapping intervals [TP,TU) and [TU,TC), respectively, with KR,j(r|X,A)=exp{ΛR,j(r|X,A)}, j=1,2,

KR(r|X,A)=1,r<TP,=KR,1(r|X,A),TPr<TU,=KR,1(TU|X,A)KR,2(r|X,A),TUr<TC,=0,rTC.

Finally, define fR,j(r|X,A)=KR(r|X,A)λR,j(r|X,A), j=1,2.

Let pr(Ψ=1|X,E,Γ,R,W) be the probability that a placebo participant unblinded at R agrees to receive the Moderna vaccine. Similar to (19), we assume that this probability does not depend on E,W; moreover, because the unblinding interval [TP,TC) is short relative to the length of follow‐up, we assume that it does not depend on R but does depend on the unblinding dynamics at R. Thus, write

pr(Ψ=1|X,E,Γ,R,W)=pr(Ψ=1|X,Γ)=pΨ(X,Γ). (20)

4.4. Observed data estimating equations

We now outline, under the assumptions (16)–(20), which we take to hold henceforth, how we can develop unbiased estimating equations based on the observed data yielding consistent and asymptotically normal estimators for dΛb(·),dΛu(·),θ. The basic premise is to use inverse probability weighting (IPW) to probabilistically represent potential outcomes in terms of the observed data to mimic the estimating functions (12)–(14).

Considering (15), define the inverse probability weights

h0(t,e|X)=(1pA)fE|X(e|X)KR(t|X,A=0),h1(t,e|X)=pAfE|X(e|X)KR(t|X,A=1),h01(e,r|X)=(1pA)fE|X(e|X)×{fR,1(r|X,A=0)pΨ(X,Γ=1)+fR,2(r|X,A=0)pΨ(X,Γ=2)},h11(e,r|X)=pAfE|X(e|X){fR,1(r|X,A=1)+fR,2(r|X,A=1)}.

We show in Web Appendix E of the Supporting Information that

EI0(t,e)dN(t)h0(t,e|X)|X,W=dN0(t,e),EI0(t,e)Y(t)h0(t,e|X)|X,W=Y0(t,e), (21)
EI1(t,e)dN(t)h1(t,e|X)|X,W=dN1(t,e),EI1(t,e)Y(t)h1(t,e|X)|X,W=Y1(t,e), (22)
EI01(t,e,r)dN(t)h01(e,r|X)|X,W=dN0(t,e,r),EI01(t,e,r)Y(t)h01(e,r|X)|X,W=Y0(t,e,r), (23)
EI11(t,e,r)dN(t)h11(e,r|X)|X,W=dN1(t,e,r),EI11(t,e,r)Y(t)h11(e,r|X)|X,W=Y1(t,e,r). (24)

To obtain observed data analogs to the estimating functions (12)–(14), based on the equalities in (21)–(24), we substitute the IPW expressions in the conditional expectations on the left‐hand sides. Using (15) and (21)–(22), the analog to (12) is given by

EΛb{O;Λb(t),θ}=I(t<TC)(0min(t,TA)I0(t,e)h0(t,e|X){dN(t)dΛb(t)Y(t)}w0(t,e)de+I(t)0min(t,TA)I1(t,e)h1(t,e|X)×[dN(t)dΛb(t)exp{θ0+g(te;θ1)×I(te)}Y(t)]w1(t,e)de)=I(t<TC)×((1A)I(Rt)h0(t,E|X){dN(t)dΛb(t)Y(t)}w0(t,E)+AI(E+tR)h1(t,E|X)[dN(t)dΛb(t)×exp{θ0+g(tE;θ1)}Y(t)]w1(t,E)). (25)

Likewise, using (23)–(24), the analog to (13) is

EΛu{O;Λu(t),θ}=I(tTP+)(0TATPmin(t,TC)I01(t,e,r)h01(e,r|X)×[dN(t)dΛu(t)exp{g(tr;θ1)I(tr)}Y(t)]×w0(t,e,r)drde)+I(tTP)(0TATPmin(t,TC)I11(t,e,r)h11(e,r|X)[dN(t)dΛu(t)exp{g(te;θ1)}Y(t)]×w1(t,e,r)I(tr)drde)=I(tTP+)×((1A)I(tR){I(Γ=1,Ψ=1)+I(Γ=2,Ψ=1)}h01(E,R|X)×[dN(t)dΛu(t)exp{g(tR;θ1)}Y(t)]w0(t,E,R))+I(tTP)(AI(t>R){I(Γ=1)+I(Γ=2)}h11(E,R|X)×[dN(t)dΛu(t)exp{g(tE;θ1)}Y(t)]w1(t,E,R)). (26)

A entirely similar representation Eθ{O;Λb(·)Λu(·),θ} of (14) in terms of the observed data can be deduced and is suppressed for brevity.

To simplify notation, based on (25), (26), and the analogous expression for (14), define

dNb(t)=dN(t)(1A)I(Rt)w0(t,E)h0(t,E|X)+AI(E+tR)w1(t,E)h1(t,E|X),Yb(t)=Y(t)(1A)I(Rt)w0(t,E)h0(t,E|X)+AI(E+tR)w1(t,E)h1(t,E|X)×exp{θ0+g(tE;θ1)}],dNu(t)=dN(t)×(1A)I(tR){I(Γ=1,Ψ=1)+I(Γ=2,Ψ=1)}w0(t,E,R)h01(E,R|X)+AI(t>R){I(Γ=1)+I(Γ=2)}w1(t,E,R)h11(E,R|X),Yu(t)=Y(t)×(1A)I(tR){I(Γ=1,Ψ=1)+I(Γ=2,Ψ=1)}w0(t,E,R)h01(E,R|X)×exp{g(tR;θ1)+AI(t>R){I(Γ=1)+I(Γ=2)}w1(t,E,R)h11(E,R|X)exp{g(tE;θ1).

It is important to recognize that these expressions are equal to zero for an unblinded placebo participant who is at risk at time t, Y(t)=1, and who has refused study vaccine, Ψ=0, which is equivalent to censoring such a subject, as she/he cannot provide information on study vaccine after t. These expressions are also equal to zero at time t for an at‐risk unblinded placebo participant who receives study vaccine but has been vaccinated for less than ℓ weeks at t and for an at‐risk blinded vaccine participant vaccinated for less than ℓ weeks at t, reflecting the fact that such individuals do not contribute information on VE for τ until times t at which they have reached full efficacy, in which case the expressions are ≥0. Moreover, by excluding the at‐risk unblinded placebo participants vaccinated for less than ℓ weeks at t, the behavior reflected by c01u(t) does not play a role.

Define also

Zb(t)=A1gθ(tE),Zu(t)=A0gθ(tE)+(1A)0gθ(tR).

Then, it is straightforward that the observed‐data estimating functions are

EΛb{O;Λb(t),θ}=dNb(t)dΛb(t)Yb(t),EΛu{O;Λu(t),θ}=dNu(t)dΛu(t)Yu(t),Eθ{O;Λb(·)Λu(·),θ}=0TCZb(t){dNb(t)dΛb(t)Yb(t)}+TPLZu(t){dNu(t)dΛu(t)Yu(t)}.

Letting Nib(t), Niu(t), Yib(t), Yiu(t), Zib(t), and Ziu(t) denote evaluation at Oi in (1), the foregoing developments lead to the set of observed‐data estimating equations

i=1n{dNib(t)dΛb(t)Yib(t)=0,i=1n{dNiu(t)dΛu(t)Yiu(t)=0, (27)
i=1n0TCZib(t){dNib(t)dΛb(t)Yib(t)}+TPLZiu(t){dNiu(t)dΛu(t)Yiu(t)}0TC=0. (28)

For fixed θ, the estimators for dΛb(t) and dΛu(t) are the solutions to the equations in (27) given by

dΛ^b(t)=i=1nYib(t)1i=1ndNib(t),dΛ^u(t)=i=1nYiu(t)1i=1ndNiu(t). (29)

Substituting these expressions in (28) yields the estimating equation in θ given by

i=1n0TC{Zib(t)Z¯b(t)}dNib(t)+TPL{Ziu(t)Z¯u(t)}dNiu(t)=0, (30)
Z¯b(t)=i=1nYib(t)1i=1nZib(t)Yib(t),Z¯u(t)=i=1nYiu(t)1i=1nZiu(t)Yiu(t),

which can be solved in θ to yield the estimator θ^.

5. PRACTICAL IMPLEMENTATION AND INFERENCE

Choice of the weight functions w0(t,e), w1(t,e), w0(t,e,r), and w1(t,e,r) is arbitrary but can play an important role in the performance of the resulting estimators. We recommend taking a fixed value x of X, for example, the sample mean, and setting wa(t,e)=ha(t,e|x) and wa(t,e,r)=ha1(e,r|x), a=0,1, where the latter does not depend on t. The resulting weights ha(t,e|x)/ha(t,e|X) and ha1(e,r|x)/ha1(e,r|X), a=0,1, are referred to as stabilized weights (Robins et al., 2000), as they mitigate the effect of small inverse probability weights that can give undue influence to a few observations. Note that dependence of the inverse probability weights on pA cancels in construction of stabilized weights. Moreover, if there is no confounding, in that λR,j(r|X,A), j=1,2 in (19), fE|X(e|X), and pΨ(X,Γ) do not depend on X, the stabilized weights are equal to 1. Interpretation of the stabilized weights is discussed further in Web Appendix F.

If the “survival probabilities” for R, KR,j(r|X,A), and the densities fR,j(r|X,A), j=1,2, and fE|X(e|X) in the inverse probability weights, which appear in the expressions in the estimating equation (30), were known, (30) could be solved to yield an estimator for θ and in particular θ1 characterizing VE waning. As these quantities are unknown, models must be posited for them, leading to estimators that can be substituted in (30). We propose the use of Cox proportional hazards models for λR,j(r|X,A), j=1,2, in (19), which can be fitted using the data {Xi,Ai,Ri,I(Γi=j)}, i=1,n; and for the hazard of entry time E given X, which can be fitted using (Ei,Xi), i=1,,n. A binary, for example, logistic, regression model can be used to represent pΨ(X,Γ) and fitted using (Xi,Γi,Ψi) for i such that Ai=0.

For individual i, the stabilized weights involve the quantities fR,j(Ri|x,a)/fR,j(Ri|Xi,a), j=1,2, a=0,1, and fE|X(Ei|x)/fE|X(Ei|Xi). With proportional hazards models as above with predictors ϕj(X,βj), say, it is straightforward that fR,j(Ri|x,a)/fR,j(Ri|Xi,a)=[exp{ϕj(x,βj)}KR(Ri|x,a)]/[exp{ϕj(Xi,βj)}KR(Ri|Xi,a)], where the baseline hazard cancels from numerator and denominator, and similarly for fE|X(Ei|x)/fE|X(Ei|Xi). Thus, the estimated stabilized weights involve only the estimated cumulative hazard functions and estimators for the βj, each of which is root‐n consistent and asymptotically normal.

As sketched in Web Appendix G, with stabilized weights set equal to one or estimated, (30) can be solved easily in θ via a Newton–Raphson algorithm. A heuristic argument demonstrating that θ^ is asymptotically normal leading to an expression for its approximate sampling variance using the sandwich technique is given in Web Appendix G.

6. SIMULATIONS

We report on simulation studies demonstrating performance of the methods, each involving 1000 Monte Carlo replications, based roughly on the Moderna trial. We took pA=0.5 and TA=12, TP=19, TU=21, and TC=31, where all times are in weeks, and consider an analysis at calendar time L=52 weeks, with n= 30,000. In all cases, g(u,θ1)=θ1I(u>v) where v=20 weeks and θ0=log(0.05), corresponding to VE = 95% prior to time v, so that, depending on θ1, VE potentially wanes following v. We consider θ1=log(7), corresponding to VE = 65% after time v, and θ1=0, corresponding to no waning.

Because the trial and unblinding process are ongoing, we were not able to base our generative scenarios on data from the trial. Owing to the complexity of the trial and multiple potential sources of confounding, to facilitate exploration of a range of conditions while controlling computational complexity and intensity, we focused on several basic scenarios meant to represent varying degrees of confounding consistent with our expectations for the most likely sources of such confounding in the trial. Specifically, we took fE|X(e|X) and λR,2(r|X,A) to not depend on X (or A in the latter case) in any scenario, reflecting mostly random entry and PDCV unblinding processes. In scenarios involving confounding, we took λR,1(r|X,A), corresponding to the period [TP,TU) in which “requested unblinding” occurred, and the “agreement process” pΨ(X,Γ) to depend on X, as described below, reflecting our belief that these processes could be associated with participant characteristics.

In the first set of simulations, we consider two cases: (i) no confounding, where all of λR,j(r|X,A), j=1,2, fE|X(e|X), and pΨ(X, Γ) do not depend on X; and (ii) confounding, where λR,1(r|X,A) and pΨ(X, Γ) depend on X as above. In both (i) and (ii), the entry process EU(0,TA), that is, uniform on [0,TA], and the unblinding process during PDCVs was U(TU,TC); see below. In each simulation experiment, for each participant in each Monte Carlo data set, we first generated ABernoulli(pA), two baseline covariates X1Bernoulli(pX1=0.5) and X2N(μX2=45, σX22=102), and E as above. To obtain R, we generated G 1 to be exponential with hazard λR,1(r|X, A)=exp[β10+{β11(X1pX1)+β12(X2μX2)}(1A)+{β13(X1pX1)+β14(X2μX2)}A], where β10=log(0.036), corresponding to roughly 7% unblinding during [TP, TU), and (β11, β12, β13, β14)=(0,0,0,0,0) for (i), no confounding, and (0.8,0.08,0.8,0.08) for (ii), confounding. With R1=TP+G1 and R2U(TU,TC), we let Γ=1+I(R1TU) and R=R1I(Γ=1)+R2I(Γ=2). We generated Ψ as Bernoulli{pΨ(X, Γ)}, pΨ(X, Γ)= expit {γ0+γ1(X1pX1)+γ2(X2μX2)+γ3Γ}, expit (u)=(1+eu)1, where γ0=1.4, corresponding to approximately 80% agreement to receive the study vaccine by unblinded placebo participants, and (γ1, γ2, γ3)=(0,0,0.1) for (i) and =(0.8,0.08,0.1) for (ii).

To generate U,Δ, we first generated T0(E,R) and T1(E,R) based on (10)–(11), with λb(t)=λb=exp{δ0+δ1(X1pX1)+δ2(X2μX2)+Z}, where (δ0,δ1,δ2)={log(0.0006),0.4,0.04}, leading to approximately a 3% infection rate for placebo participants over L, and ZN(0,0.04); λu(t)=λu=λb; ζ(t)=0; and λu(t)=λu=1.25λb, so that λa(t,e,r) in (10)–(11), a=0,1, are piecewise constant hazards. T0(E,R) and T1(E,R) were obtained via inverse transform sampling. We then generated U (calendar time) as U=E+AT1(E,R)+(1A)[I{T0(E,R)<R}T0(E,R)+I{T0(E,R)R}{ΨT0(E,R)+(1Ψ)Tr}, where Tr=R+G2 for G 2 exponential with hazard λb; infection times for unblinded placebo participants who decline vaccine are not used in the analysis. Finally, we set Δ=I(U<L), and defined R=UI(UR)+RI(U>R) and Γ=ΓI(U>R). Although we obtained Ψ for all n participants, Ψ is used only when A=0, Γ1.

For each combination of (i) and (ii) and (a) θ1=log(7) and (b) θ1=0, we estimated θ and thus VE(τ) for τv and τ>v two ways: taking the stabilized weights equal to 1, so disregarding possible confounding, and with estimated stabilized weights. The latter were obtained by fitting proportional hazards models for entry time E with linear predictor ν1X1+ν2X2 and for λR,j(r|X,A), j=1,2, with linear predictors β11X1+β12X2+β13A+β14X1A+β15X2A and β21X1+β22X2, respectively; and a logistic regression model for pΨ(X,Γ)=expit{(γ10+γ11X1+γ12X2)I(Γ=1)+(γ20+γ21X1+γ22X2)I(Γ=2)}.

Table 2 presents the results for estimation of θ1, dictating waning; VE20=1exp(θ0), VE prior to v=20 weeks; and VE>20=1exp(θ0+θ1), VE after v=20 weeks. Because the Monte Carlo distribution of some of these quantities exhibited slight skewness, those for the VE quantities likely due to the exponentiation, we report both Monte Carlo mean and median. Estimation of VE20 shows virtually no bias for both (a) and (b); that for VE>20 in case (a) shows minimal bias and virtually none for (b). In all cases, standard errors obtained via the sandwich technique as outlined in Web Appendix G along with the delta method for the VEs track the Monte Carlo standard deviations. Under both (i) no confounding and (ii) confounding, estimation of the stabilized weights appears to have little consequence for precision of the estimators relative to setting them to equal to 1. Wald 95% confidence intervals, exponentiated for the VEs, achieve nominal coverage. For (b) and each combination of stabilized weights set equal to 1 or estimated and (i), no confounding, and (ii), confounding, we also calculated the empirical Type I error achieved by a Wald test at level of significance 0.05 for VE waning addressing the null and alternative hypotheses H0:θ10 versus H1:θ1>0. These values are 0.04 and 0.06 when using stabilized weights set equal to 1 under (i) and (ii), respectively; the analogous values with estimated weights are 0.05 and 0.05 under (i) and (ii).

TABLE 2.

Simulation results based on 1000 Monte Carlo replications, first scenario. Mean = mean of Monte Carlo estimates, Med = median of Monte Carlo estimates, SD = standard deviation of Monte Carlo estimates, SE = average of standard errors obtained via the sandwich technique/delta method, Cov = empirical coverage of nominal 95% Wald confidence interval (transformed for VE). VE20=1exp(θ0), VE prior to v=20 weeks; VE>20=1exp(θ0+θ1), VE after v=20 weeks. True values: (a) θ1=log(7)=1.946, VE20=0.95, VE>20=0.65; (b) θ1=0, VE20=VE>20=0.95

Stabilized weights = 1 Stabilized weights estimated
Mean Med SD SE Cov Mean Med SD SE Cov
(i), no confounding; (a) θ1=log(7)
θ1 1.961 1.935 0.310 0.308 0.95 1.983 1.959 0.303 0.310 0.96
VE20
0.950 0.953 0.019 0.019 0.95 0.950 0.952 0.019 0.019 0.95
VE>20
0.634 0.663 0.183 0.174 0.96 0.626 0.662 0.188 0.177 0.96
(ii), confounding; (a) θ1=log(7)
θ1 2.030 2.013 0.325 0.320 0.95 1.990 1.973 0.346 0.335 0.95
VE20
0.951 0.953 0.019 0.018 0.96 0.951 0.952 0.019 0.019 0.95
VE>20
0.614 0.647 0.199 0.185 0.95 0.619 0.665 0.201 0.186 0.94
(i), no confounding; (b) θ1=0
θ1 −0.020 −0.019 0.433 0.422 0.95 0.007 0.019 0.421 0.424 0.96
VE20
0.950 0.952 0.020 0.019 0.95 0.950 0.952 0.020 0.019 0.96
VE>20
0.947 0.954 0.032 0.030 0.96 0.946 0.953 0.033 0.031 0.95
(ii), confounding; (b) θ1=0
θ1 0.053 0.045 0.446 0.436 0.95 0.011 −0.004 0.452 0.450 0.96
VE20
0.951 0.952 0.019 0.019 0.96 0.950 0.952 0.020 0.019 0.95
VE>20
0.944 0.951 0.035 0.032 0.96 0.945 0.954 0.036 0.033 0.95

In the first set of simulations, the confounding induced by our generative choices led to little to no bias in the estimators for θ1 and the VEs prior to and after 20 weeks. Notably, modeling and fitting of the stabilized weights to adjust for potential confounding shows little effect relative to setting the stabilized weights to 1. To the extent that this scenario is a plausible approximation to actual conditions of the trial, it may be that confounding will not be a serious challenge for the analysis of VE waning.

To examine the ability of the methods with estimated stabilized weights to adjust for confounding that potentially could be sufficiently strong to bias results, we carried out additional simulations under settings (a) θ1=log(7) and (b) θ1=0 with (ii) confounding in which our choices of generative parameters induce a stronger association between the potential infection times and the agreement process. Specifically, we took instead (δ0,δ1,δ2)={log(0.0006),0.7,0.07} and (γ0,γ1,γ2,γ3)=(1.4,1.0,0.1,0.1), with all other settings identical to those above.

Table 3 shows the results. The estimators for θ1 and VE>20 are slightly biased when stabilized weights are set equal to 1, although coverage probability for the latter is at the nominal level. This feature is mitigated by use of estimated stabilized weights. Coverage probability for θ1 is somewhat lower than nominal. Under (b), empirical Type I error achieved by a Wald test at level of significance 0.05 of H0:θ10 versus H1:θ1>0 is 0.12 when stabilized weights are equal to 1, demonstrating the potential for biased inference; Type I error is 0.06 using estimated stabilized weights, leading to a more reliable test.

TABLE 3.

Simulation results based on 1000 Monte Carlo replications, second scenario. Entries are as in Table 2. True values: (a) θ1=log(7)=1.946, VE20=0.95, VE>20=0.65; (b) θ1=0, VE20=VE>20=0.95

Stabilized weights = 1 Stabilized weights estimated
Mean Med SD SE Cov Mean Med SD SE Cov
(ii), confounding; (a) θ1=log(7)
θ1 2.125 2.100 0.315 0.299 0.93 2.009 2.008 0.346 0.325 0.94
VE20
0.952 0.953 0.017 0.016 0.97 0.950 0.952 0.017 0.017 0.96
VE>20
0.581 0.611 0.191 0.182 0.95 0.613 0.640 0.179 0.175 0.96
(ii), confounding; (b) θ1=0
θ1 0.171 0.149 0.436 0.403 0.92 0.050 0.053 0.447 0.426 0.95
VE20
0.951 0.953 0.017 0.017 0.97 0.950 0.952 0.018 0.017 0.96
VE>20
0.937 0.945 0.038 0.034 0.95 0.942 0.949 0.034 0.032 0.95

Overall, we speculate that, because the Moderna study is a randomized, double‐blind trial, the unblinding period is finite and eventually all participants are unblinded, the refusal rate is likely to be low, and infection rates are low, confounding may not lead to substantial bias in estimation of VE waning.

7. DISCUSSION

We have proposed a conceptual framework based on potential outcomes for study of VE in which assumptions on biological, behavioral, and other phenomena are made transparent. The methods provide a mechanism to account for possible confounding. The corresponding statistical framework combines information from blinded and unblinded participants over time. We focus on the setting of the Moderna phase 3 trial, but the principles can be adapted to other settings, including the blinded crossover design of Follmann et al. (2020), and apply to ongoing and future vaccine trials in which unblinding may well occur throughout and some participants may refuse the study vaccine in favor of already‐licensed products. Extension of the methods to the setting where time between doses varies across vaccinated participants due to either deviations from the protocol or by design would require modification of the framework presented here to represent VE as a function of both time between doses and time since vaccination.

Our approach and those of Lin et al. (2021) (LZG), Follmann et al. (2020), and Fintzi and Follmann (2021) for estimation of VE waning use a calendar time formulation and Cox hazard models. As do we, Follmann et al. (2020) and Fintzi and Follmann (2021) include data from placebo participants who cross over to study vaccine, whereas LZG censor such subjects and propose a sensitivity analysis. Because the approach of Follmann et al. (2020) and Fintzi and Follmann (2021) is based on a randomized crossover design that maintains the blind, confounding is not addressed, while LZG adjust for confounding via regression modeling. In our methodology, confounding is addressed through a potential outcomes formulation and IPW. As we do, Follmann et al. (2020) and Fintzi and Follmann (2021) represent VE(τ) using parametric or flexible spline models; LZG model VE(τ) nonparametrically.

Through condition (ii) in Section 3, (ii) E{π1(t,τ)|X}/E{π0(t)|X}=q(τ), the methods embed the assumption that VE is similar across current and emerging viral variants. If the analyst is unwilling to adopt an assumption like condition (ii), then it is not possible to rule out that the data from the blinded (prior to TP) and unblinded (starting at TP) phases of the trial reflect very different variant mixtures. In this case, calendar time and time since vaccination cannot be disentangled, and thus, it is not possible to evaluate VE solely as a function of time since vaccination. However, it may be possible to evaluate the ratio of infection rates under vaccine at any time t (and thus variant mixture in force at t) after different times since vaccination τ1 and τ2, say, during the unblinded phase of the trial, namely, I1u(t,τ1)/I1u(t,τ2), tTP. The infection rates can be estimated based on the infection status data at time t from vaccinated individuals who received vaccine at times tτ1 and tτ2, respectively. These infection rates and their ratio will reflect information about the waning of the vaccine itself under the conditions at time t, and, in fact, this infection rate ratio can be viewed as the ratio of vaccine efficacies at τ1 and τ2. However, because after TC information on I0u(t) will no longer be available, it is not possible to deduce VE itself for tTC. But if data external to the trial became available that provide information on VE at t, even for small τ, it may be possible to integrate this information with that from the infection rates to gain insight into VE as a function of τ.

Supporting information

Web Appendices referenced in Sections 3–6 and R code implementing the simulation scenarios in Section 6 are available with this paper at the Biometrics website on Wiley Online Library. An R package, VEwaning, implementing the methodology is available on GitHub at https://github.com/sth1402/VEwaning and at the Comprehensive R Archive Network (CRAN) at https://cran.r‐project.org/web/packages/VEwaning/index.html.

ACKNOWLEDGMENTS

The authors thank Dean Follmann for helpful discussions and insights. The authors are grateful to Shannon Holloway for creating the R package VEWaning, noted in the Supporting Information.

Tsiatis AA, Davidian M. Estimating vaccine efficacy over time after a randomized study is unblinded. Biometrics. 2021;1–14. 10.1111/biom.13509

Contributor Information

Anastasios A. Tsiatis, Email: tsiatis@ncsu.edu

Marie Davidian, Email: davidian@ncsu.edu.

DATA AVAILABILITY STATEMENT

Data sharing is not applicable to this article, as no datasets are generated or analyzed in this paper. The methods developed in the paper are proposed to enable future analyses of data from ongoing vaccine trials from which the required data are not yet fully accrued.

REFERENCES

  1. Baden, L.R. , El Sahly, H.M. , Essink, B. , Kotloff, K. , Frey, S. , Novak, R. et al. for the COVE Study Group (2020) Efficacy and safety of the mRNA‐1273 SARS‐CoV‐2 vaccine. New England Journal of Medicine, 384, 403–416. 10.1056/NEJMoa2035389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. COVID‐19 Vaccine Tracker . https://covid19.trackvaccines.org/. Last updated 11 July 2021.
  3. Fintzi, J. and Follmann, D. (2021) Assessing vaccine durability in randomized trials following placebo crossover. arXiv preprint arXiv:2101.01295v3. [DOI] [PMC free article] [PubMed]
  4. Fleming, T.R. and Harrington, D.P. (2005) Counting processes and survival analysis. New York: Wiley. [Google Scholar]
  5. Follmann, D. , Fintzi, J. , Fay, M.P. , Janes, H.E. , Baden, L. , El Sahly, H. et al. (2020) Assessing durability of vaccine effect following blinded crossover in COVID‐19 vaccine efficacy trials. medRxiv. 2020 Dec 14;2020.12.14.20248137. 10.1101/2020.12.14.20248137. [DOI]
  6. Halloran, M.E. , Longini, I.M. and Struchiner, C.J. (1996) Estimability and interpretation of vaccine efficacy using frailty mixing models. American Journal of Epidemiology, 144, 83–97. [DOI] [PubMed] [Google Scholar]
  7. Lin, D.‐Y. , Zeng, D. and Gilbert, P.B. (2021) Evaluating the long‐term efficacy of COVID‐19 vaccines. Clinical Infectious Diseases, ciab226. 10.1093/cid/ciab226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Longini, I.M. and Halloran, M.E. (1996) Frailty mixture model for estimating vaccine efficacy. Journal of the Royal Statistical Society, Series C, 45, 165–173. [Google Scholar]
  9. Moderna Clinical Study Protocol, Amendment 6, 23 December 2020, available at https://www.modernatx.com/sites/default/files/content_documents/Final%20mRNA‐1273‐P301%20Protocol%20Amendment%206%20‐%2023Dec2020.pdf
  10. Robins, J.M. , Hernán, M.A. and Brumback, B. (2000) Marginal structural models and causal inference in epidemiology. Epidemiology, 11, 550–560. [DOI] [PubMed] [Google Scholar]
  11. Rubin, D.B. (1980) Bias reduction using Mahalanobis‐metric matching. Biometrics, 36, 293–298. [Google Scholar]
  12. Yang, S. , Tsiatis, A.A. and Blazing, M. (2018) Modeling survival distribution as a function of time to treatment discontinuation: A dynamic treatment regime approach. Biometrics, 74, 900–909. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Web Appendices referenced in Sections 3–6 and R code implementing the simulation scenarios in Section 6 are available with this paper at the Biometrics website on Wiley Online Library. An R package, VEwaning, implementing the methodology is available on GitHub at https://github.com/sth1402/VEwaning and at the Comprehensive R Archive Network (CRAN) at https://cran.r‐project.org/web/packages/VEwaning/index.html.

Data Availability Statement

Data sharing is not applicable to this article, as no datasets are generated or analyzed in this paper. The methods developed in the paper are proposed to enable future analyses of data from ongoing vaccine trials from which the required data are not yet fully accrued.


Articles from Biometrics are provided here courtesy of Wiley

RESOURCES