Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

ArXiv logoLink to ArXiv
[Preprint]. 2023 Mar 8:arXiv:2203.12509v4. [Version 4]

Double Negative Control Inference in Test-Negative Design Studies of Vaccine Effectiveness

Kendrick Qijun Li 1, Xu Shi 2, Wang Miao 3, Eric Tchetgen Tchetgen 4
PMCID: PMC8963685  PMID: 35350548

Abstract

The test-negative design (TND) has become a standard approach to evaluate vaccine effectiveness against the risk of acquiring infectious diseases in real-world settings, such as Influenza, Rotavirus, Dengue fever, and more recently COVID-19. In a TND study, individuals who experience symptoms and seek care are recruited and tested for the infectious disease which defines cases and controls. Despite TND’s potential to reduce unobserved differences in healthcare seeking behavior (HSB) between vaccinated and unvaccinated subjects, it remains subject to various potential biases. First, residual confounding bias may remain due to unobserved HSB, occupation as healthcare worker, or previous infection history. Second, because selection into the TND sample is a common consequence of infection and HSB, collider stratification bias may exist when conditioning the analysis on testing, which further induces confounding by latent HSB. In this paper, we present a novel approach to identify and estimate vaccine effectiveness in the target population by carefully leveraging a pair of negative control exposure and outcome variables to account for potential hidden bias in TND studies. We illustrate our proposed method with extensive simulation and an application to study COVID-19 vaccine effectiveness using data from the University of Michigan Health System.

Keywords: Causal inference, proximal causal inference, selection bias, unmeasured confounding

1. Introduction

1.1. Text-negative design studies of vaccine effectiveness

The test-negative design (TND) has become a standard approach to evaluate real-world vaccine effectiveness (VE) against the risk of acquiring infections diseases (Chung et al., 2020; Flannery et al., 2019; Jackson et al., 2017; Rolfes et al., 2019; Tenforde et al., 2021). In an outpatient Influenza VE test-negative design, for example, symptomatic individuals seeking care and meeting eligibility criteria are enrolled and their Influenza virus infection status is subsequently confirmed via a laboratory test. VE against flu infection is then measured by comparing the prevalence of vaccination between the test-positive “cases” and test-negative “controls” (Jackson and Nelson, 2013; Jackson et al., 2017). Besides Influenza, the TND and its variants have also been applied to study VE against pneumococcal disease (Broome, Facklam, and Fraser, 1980), dengue (Anders et al., 2018; Utarini et al., 2021), rotavirus (Boom et al., 2010; Schwartz et al., 2017), and other infectious diseases. Recently, the TND has increasingly been used in post-licensure evaluation of COVID-19 VE (Dagan et al., 2021; Dean, Hogan, and Schnitzer, 2021; Hitchings et al., 2021; Olson et al., 2022; Patel, Jackson, and Ferdinands, 2020; Thompson et al., 2021).

Test-negative designs are believed to reduce unmeasured confounding bias due to healthcare seeking behavior (HSB), whereby care seekers are more likely to be vaccinated, have healthier behaviors that reduces the risk of infection, and get tested when ill (Jackson et al., 2006; Shrank, Patrick, and Brookhart, 2011). By restricting analysis to care seekers who are tested for the infection in view (e.g. Influenza or COVID-19), the vaccinated and unvaccinated are more likely to share similar HSB and underlying health characteristics. Misclassification of infection status is also reduced because the analysis is restricted to tested individuals (Jackson and Nelson, 2013).

Sullivan et al. (2016) used directed acyclic graphs (DAG) to illustrate the rationale behind TND in the context of evaluating VE against Influenza infection, as shown in Figures 1(a) and (b). We denote Influenza vaccination status by A and Influenza infection by Y, so that the arrow AY represents VE against flu infection. Selection into the TND study sample, denoted by S, is triggered by a subject experiencing flu-like symptoms or acute respiratory illness, seeking care at clinics or hospitals, and getting tested for Influenza infection, hence the YS edge. Healthcare seeking behavior, denoted by HSB, may affect S,A, and Y because subjects with certain health seeking proclivities may be more likely to seek care, take annual flu shots, and participate in healthy and preventative behaviors. The above variables are subject to effects of other clinical or demographic factors, such as age, season and high-risk conditions, included in Figure 1 as confounders X. The TND assumes that by restricting recruitment to care seekers, the study subjects have identical healthcare seeking behavior; in other words, conditioning the analysis on S=1 necessarily leads to HSB= 1, which blocks the effects of HSB (Figure 1(b)). The effects of X are further adjusted for by including these factors in a logistic regression model or by inverse probability weighting (Bond, Sullivan, and Cowling, 2016; Thompson et al., 2021).

Figure 1:

Figure 1:

Causal relationships of variables in a test-negative design. Sullivan, Tchetgen Tchetgen, and Cowling, 2016 used (a) to illustrate the causal relationship between variables in a test-negative design in the general population, and used (b) to illustrate the assumption implicit in the common approach to estimate VE from the study data that study subjects have identical healthcare seeking behavior (HSB) (Sullivan, Tchetgen Tchetgen, and Cowling, 2016). (c) shows that if HSB remains partially unobserved, then the backdoor paths AHSBY and AHSBS=1Y indicate unmeasured confounding bias and selection bias, respectively. Other unmeasured confounders, such as occupation as a healthcare worker and previous infection, open additional backdoor paths between A and Y and result in additional confounding bias. (d) shows a simplified DAG from (c) that combines the unmeasured confounders into a single variable U. (e) illustrates our approach to estimate VE leveraging negative control exposure Z and outcome W. Dashed arrows indicate effects that are not required. (f) shows a scenario with the AY arrow where the causal odds ratio can still be identified under additional assumptions.

However, the TND remains subject to potential hidden bias. First, the assumption that all study subjects seeking care are lumped into a single category HSB= 1 may be unrealistic. It may be more realistic that HSB is not a deterministic function of S and remains a source of confounding bias even after conditioning on S. Furthermore, there might be other mismeasured or unmeasured confounders, denoted as U. For example, healthcare workers are at increased risk of flu infection due to higher exposure to flu patients and are more likely to seek care and receive vaccination due to health agency guidelines (Black et al., 2018). Previous flu infection history may also be a source of confounding if it alters the likelihood of vaccination and care seeking, while also providing immunity against circulating strains (Krammer, 2019; Sullivan, Tchetgen Tchetgen, and Cowling, 2016). These unmeasured or mismeasured potential sources of confounding, if not properly accounted for, can result in additional confounding bias, as illustrated in Figure 1(c). Finally, collider stratification bias is likely present due to conditioning on S, which is a common consequence of HSB, other risk factors (X,U), and Influenza infection Y (Lipsitch, Jha, and Simonsen, 2016). That is, conditioning on S unblocks the backdoor path A(X,U,HSB)SY, which would in principle be blocked if study subjects had identical levels of HSB and other risk factors (Sullivan, Tchetgen Tchetgen, and Cowling, 2016).

Accounting for these potential sources of bias is well known to be challenging, and potentially infeasible without additional assumptions or data. This can be seen in Figure 1(d), which is a simplified version of Figure 1(c) where the unmeasured confounders U include individuals’ occupation as a healthcare worker, previous flu infection, HSB, and so on. Figure 1(d) indicates that the unmeasured confounders U induce both confounding bias through the path AUY and collider stratification bias through the path AUSY. In presence of both unmeasured confounding and collider bias, causal bounds may be available (Gabriel, Sachs, and Sjölander, 2020) but likely too wide to be informative; causal identification in TND therefore remains to date an important and outstanding open problem in the causal inference literature which we aim to resolve.

1.2. Negative control methods

In recent years, negative control variables have emerged as powerful tools to detect, reduce and potentially correct for unmeasured confounding bias (Lipsitch, Tchetgen Tchetgen, and Cohen, 2010; Miao, Geng, and Tchetgen Tchetgen, 2018; Shi, Miao, and Tchetgen Tchetgen, 2020). The framework requires that at least one of two types of negative control variables are available which are a priori known to satisfy certain conditions: a negative control exposure (NCE) known to have no direct effect on the primary outcome; or a negative control outcome (NCO), known not to be an effect of the primary exposure. Such negative control variables are only valid and therefore useful to address unmeasured confounding in a given setting to the extent that they are subject to the same source of confounding as the exposure-outcome relationship of primary interest. Thus, the observed association between a valid NCE and the primary outcome (conditional on the primary treatment and observed covariates) or that between a valid NCO and the primary exposure can indicate the presence of residual confounding bias. For example, in a cohort study to investigate flu VE against hospitalization and death among seniors, to detect the presence of confounding bias due to underlying health characteristics, Jackson et al. (2006) used hospitalization/death before and after the flu season as NCOs and found that the association between flu vaccination and hospitalization was virtually the same before and during the flu season, suggesting that the lower hospitalization rate observed among vaccinated seniors versus unvaccinated seniors was partially due to healthy user bias.

Recently, new causal methods have been developed to not only detect residual confounding when present, but also to potentially de-bias an observational estimate of a treatment causal effect in the presence of unmeasured confounders when both an NCE and an NCO are available, referred to as the double negative control (Miao, Geng, and Tchetgen Tchetgen, 2018; Shi et al., 2020; Tchetgen Tchetgen et al., 2020). In this recent body of work, the double negative control design was extended in several important directions including settings in which proxies of treatment and outcome confounding routinely measured in well designed observational studies may be used as negative control variables, a framework termed proximal causal inference; longitudinal settings where one is interested in the joint effects of time-varying exposures (Ying et al., 2021), potentially subject to both measured and unmeasured confounding by time-varying factors; and in settings where one aims to estimate direct and indirect effects in mediation analysis subject to unmeasured confounding or unmeasured mediators (Dukes, Shpitser, and Tchetgen Tchetgen, 2021; Ghassami, Shpitser, and Tchetgen Tchetgen, 2021). Additional recent papers in this fast-growing literature include Qi, Miao, and Zhang (2021), Liu and Tchetgen Tchetgen (2021), Egami and Tchetgen Tchetgen (2021), Kallus, Mao, and Uehara (2021), Imbens, Kallus, and Mao (2021), Deaner (2018), Deaner (2021), Ghassami et al. (2021), Mastouri et al. (2021) and Ghassami, Shpitser, and Tchetgen Tchetgen (2022). Importantly, existing identification results in negative control and proximal causal inference literature has been restricted to i.i.d settings (Miao, Shi, and Tchetgen Tchetgen, 2018) and time series settings (Shi et al., 2021), and to date, to the best of our knowledge, outcome-dependent sampling settings such as TND, have not been considered, particularly one where confounding and selection bias might co-exist.

1.3. Outline

The rest of the paper is organized as followed: we introduce notation and the identification challenge in view in Sections 2.1. Next we develop the identification strategy and describe a new debiased estimator under a double negative control TND study in Section 2.22.4, assuming (1) homogeneous VE across strata defined by all measured and unmeasured confounders and (2) no direct effect of vaccination on selection into the TND sample. In Section 2.5, we relax the homogeneous VE assumption and describe identification and estimation allowing for VE to depend on observed covariates. In Section 2.6, we relax the assumption of no direct effect of vaccination on selection and introduce the assumptions under which our VE estimator is unbiased on the odds ratio scale. In Section 3, we demonstrate the performance of our method with simulation. In Section 4, the approach is further illustrated in an application to estimate COVID-19 VE against infection in a TND study nested within electronic health records from University of Michigan Health System. We then conclude with a discussion in Section 5.

2. Method

2.1. Preliminary: estimation under no unmeasured confounding and no selection bias

To fix ideas, we first review estimation assuming all confounders (U,X) are fully observed and the study sample is randomly drawn (rather than selected by testing) from source population, referred to as the “target population”. That is, we observe data on (A,Y,U,X) which are independent and identically distributed in the target population. For each individual, we write Y(a) as the binary potential infection outcome had, possibly contrary to fact, the person’s vaccination status been A=a, a=0,1. Our goal is to provide identification and estimation strategies for the causal risk ratio (RR) defined as RR=E[Y(1)]/E[Y(0)]. Let β0 denote the log causal RR, i.e., RR=expβ0. Following Hudgens and Halloran (2006) and Struchiner and Halloran (2007), we define VE as one minus the causal RR: VE=1RR=1expβ0. The potential outcomes and the observed data are related through the following assumptions:

Assumption 1.

(Identification conditions of mean potential outcomcs).

  1. (Consistency) Y(a)=Y if A=a almost surely for a=0,1;

  2. (Exchangeability) AY(a)U,X for a=0,1.

  3. (Positivity) 0<P(A=aU,X) almost surely a=0,1.

Assumption 1(a) states that the infection status of a subject with vaccination status A=a is equal to the corresponding potential outcome Y(a). This further requires that the treatment is sufficiently well-defined and a subject’s potential outcome is not affected by the treatment of other subjects (Cole and Frangakis, 2009). Assumption 1(b) states that treatment is exchangeable within strata of (U,X), i.e. there is no unmeasured confounding given (U,X). We develop methods that allow U to be ummeasured in Section 2.3. Assumption 1(c) states that for all realized values of (U,X) there is at least one individual with an opportunity to get treatment a=0,1

Let Q(A=a,U,X)=1/P(A=aU,X) denote the inverse of the probability of vaccination status A=a given confounders (Rosenbeum, 1987). Under Assumption 1, it is well known that, if U were observed, the mean potential outcome for treatment a in the general population can be identified by inverse probability of treatment weighting (IPTW):

EYa=EIA=aQA=a,U,XY, (1)

for a=0,1. Therefore, the log causal RR β0 satisfies the following equation

EI(A=1)Q(A=1,U,X)Yexpβ0E[I(A=0)Q(A=0,U,X)Y]=0.

Equivalently, we have

EV0A,Y,U,X;β0=0 (2)

for the unbissed estimating function, where V0(A,Y,U,X;β)=(1)1AQ(A,U,X)Yexp(βA).

2.2. Tackling selection bias under a semiparametric risk model

Next, consider a TND study for which data (A,Y,X,U) is observed only for the tested individuals with S=1. Because S is impacted by other factors such as infection, the estimating function V0A,Y,U,X;β0 may not be unbiased with respect to the study sample; i.e.

EV0A,Y,U,X;β0S=10 (3)

without another assumption about the selection process into the TND sample.

For a study sample of size n from a TND, we denote the i-th study subject’s variables as Ai,Yi,Ui,Xi, i=1,,n. For generalizability, we first make the key assumption that vaccination A is unrelated to selection S other than through a subject’s infection status Y and confounders (U,X).

Assumption 2

(Treatment-independent sampling). SAY,U,X.

In the test-negative design, this assumption requires that an individual’s decision to seek care and get tested only depends on the presence of symptoms and his/her underlying behavioral or socioeconomic characteristics, including HSB (contained in (U,X)); a person’s vaccination status does not directly Influence their selection process. The DAGs in Figure 1(a)(e) in fact encode this conditional independence condition. We will relax this assumption in Section 2.6.

Assumption 3

(No effect modification by a latent confounder). For a=0,1,

P(Y=1A=a,U,X)=expβ0ag(U,X) (4)

where g(U,X) is an unknown function restricted by 0P(Y=1A,U,X)1 almost surely.

Assumption 3 defines a semiparametric multiplicative risk model which posits that vaccine effectiveness, measured on the RR scale, is constant across (U,X) strata in the target population. In other words, the effect of vaccination A on the risk of infection Y is not modified by confounders U,X. In Section 2.5, we will relax the assumption to allow for effect modification by measured confounders X. Infection risk for control subjects P(Y=1A=0,U,X)=g(U,X) is left unspecified and thus defines the nonparametric component of the model.

Under Assumptions 1 and 3, one can verify that expβ0=E[Y(1)]/E[Y(0)], which is the marginal causal RR. The potential infection outcome means E[Y(1)] and E[Y(0)] in the target population cannot be identified due to the study selection process without an additional restriction. Nevertheless, the estimating equation (2) implies that it may still be possible to identify β0 without necessarily identifying E[Y(0)] and E[Y(1)]. The following proposition indicates that the same is true when the data are subject to selection bias of certain structure.

Proposition 1.

Under Assumptions 13, the parameter β0 satisfies

EV0A,Y,U,X;β0S=1=0. (5)

The proof of Proposition 1 is in Appendix A . From Proposition 1, the IPTW estimating function V0 derived from the target population is also unbiased with respect to the study sample.

Under Assumptions 13, one can estimate β0 with βˆ as the solution to

1ni=1n(1)1AiQAi,Ui,XiYiexp(βˆ0Ai)=0, (6)

where n is the size of the selected sample, Q^Ai,Ui,Xi=P^A=AiUi,Xi is the estimated probability of have vaccination status A=Ai given confounders Ui,Xi. The resulting estimator

β0=i=1nQˆAi,Ui,XiAiYi/i=1nQˆAi,Ui,Xi1AiYi

is essentially the IPIW estimator of marginal RR in Schnitzer (2022) assuming Ui,Xi’s are all observed.

However, Q(A,U,X) cannot be estimated because U is unobserved. Furthermore, even if U were observed, QˆAi,Ui,Xi may not be identified from the TND sample due to selection bias. In the next section, we describe a new framework to account for unmeasured confounding in a TND setting, leveraging negative control exposure and outcome variables.

2.3. Tackling unmeasured confounding bias leveraging negative controls

2.3.1. Negative control exposure (NCE) and treatment confounding bridge function

As shown in Figure 1(e), suppose that one has observed a valid possibly vector-valued NCE, denoted as Z, which is a priori known to satisfy the following key independence conditions:

Assumption 4

(NCE independence conditions). Z(Y,S)A,U,X.

Assumption 4 essentially states that any existing ZY association conditional on (X,A) in the target population must be a consequence of their respective association with U, therefore indicating the presence of confounding bias. Importantly, the NCE must a priori be known to have no causal effect on infection status (Mino, Shi, and Tchetgen Tchetgen, 2018). Likewise, the association between Z and S conditional on (X,A) is completely due to their respective association with U. Figure 1(e) presents a graphical illustration of an NCE that satisfies Assumption 4.

In the Influenza VE setting, a candidate NCE can be vaccination status for the preceding year, or other vaccination status such as Tdap (Tetanus, Diphtheria, Pertussis) vaccine, as both are known to effectively provide no protection against the circulating flu strain in a given year. We now provide an intuitive description of our approach to leverage Z as an imperfect proxy of U for identification despite being unable to directly observe U.

To illustrate the rationale behind identification, ignore selection bias for now and suppose that Q(A,U)=α0+α1A+α2U, suppressing messured confounders X. Although U is unobserved, suppose further that Z satisfies E[ZA,U]=γ0+γ1A+γ2U. Then we have that

Q(A,U)=E[q(A,Z)A,U],U=E[U˜(A,Z)A,U],

where U˜=Zγ0γ1A/γ2. Replacing U with U˜ in Q(A,U), we get q(A,Z)=α0+α1A+α2U˜(A,Z), which does not depend on unmeasured confounder U and can represent the inverse probability of vaccination as Q(A,U)=E[q(A,Z)A,U]. If all parameters of q were known, it would naturally follow that the IPTW method in (1) can be recovered by

E[Y(a)]=E{I(A=a)E[q(A,Z)A,U]Y}=A.4E[I(A=a)q(A,Z)Y],

Therefore, β0 can be identified if the distribution of (A,Y,Z) in the target population is available provided that parameters indexing q can be identified.

The above insight motivates the following assumption:

Assumption 5

(treatment confounding bridge function). There exists a function q(A,Z,X) that satisfies, for every a, u and x,

Q(A=a,U=u,X=x)=E[q(A,Z,X)A=a,U=u,X=x] (7)

The function q that satisfies (7) is called a treatment confounding bridge function, as it bridges the observed NCE with the unobserved propensity score (Cui et al., 2020). Below we give two examples where the integral equation (7) can easily be solved and the treatment confounding bridge function q admits a closed form solution.

Example 1.

(Binary U and Z) Suppose that U is binary, and so is the NCE Z. For simplicity we suppress X. The integral equation (7) can then be written as z01q(a,z)P(Z=zU=u,A=a)=P(A=aU=u)1,

or equivalently, z01pza.uq(a,z)=1 for each a,u{0,1}, where pza.u=P(Z=z,A=aU=u). Therefore, the treatment confounding bridge function q(a,z) solves the linear equation system

PZ,AUq(a,0)q(a,1)=11,wherePZ,AU=p0a.0p1a.0p0a.1p1a.1.

If the matrix PZ,AU is invertible, then q(a,z) has a closed form solution given by

q(a,z)=p1a.1p1a.0+p0a.0p0a.1p1a.1+p1a.0z/p0a.0p1a.1p0a.1p1a.0. (8)

The result can be extended to the cases where Z is polytomous as detailed in Appendix D.

Example 2.

(Continuous U and Z) Suppose the unmeasured confounder U and the NCE Z are continwous. Further assume that

AU,XBernoulli(1+expμ0AμUAUμXAX1)ZA,U,XNμ0Z+μAZA+μUZU+μXZX,σZ2.

By the derivation in Appendix E, the treatment confounding bridge function q(A;Z;X) is

q(A,Z,X)=1+exp[(1)Aτ0+τ1A+τ2Z+τ3X] (9)

where τ0=μ0AμUAμ0ZμUZσZ2μUA22μUZ2,τ1=σZ2μUA2μUZ2μUAμAZμUZ, τ2=μUA/μUZ, and T3=μXAμXZμUA/μUZ.

Formally, Equation (7) defines a Fredholm integral equation of the first kind, with treatment confounding bridge function q(A,Z,X) as its solution (Cui et al., 2020). Heuristically, the existence of a treatment confounding bridge function requires that variation in Z induced by U is sufficiently correlated with variation in A induced by U. For instance, in Example 1, existence of a treatment confounding bridge function requires that the matrix PZ,AU is nonsingular. In Example 2, the existence of a treatment confounding bridge function amounts to the condition μUZ0, which again requires ZUA,X. Cui et al. (2020) provided formal conditions sufficient for the existence of the treatment confounding bridge function satisfying Equation (7). These conditions are reproduced for completeness in Appendix B.

Thus, under Assumption 5, we propose to construct a new unbiased estimating function for β0 by replacing Q(A,U,X) with q(Z,A,X) in U0A,Y,U,X;β0.

Theorem 1.

(Moment restriction of β0) Under Assumptions 15, we have that

EV1A,Y,Z,X;β0S=1=0

where V1(A,Y,Z,X;β0)=(1)1Aq(A,Z,X)Yexpβ0,A.

The proof of Theorem 1 is in Appendix C. In practice, if one can consistently estimate the treatment confounding bridge function q(A,Z,X) with q^(A,Z,X), Theorem 1 suggests estimating β0 by solving the estimating equation

1ni=1n(1)1AiqˆAi,Zi,XiYiexpβ0Ai=0, (10)

which results in a closed form estimator

β^0=logqˆAi,Zi,XiAiYiq^Ai,Zi,Xi1AiYi.

Importantly, although (7) may not have a unique solution, any solution uniquely identifies the causal log RR β0. The result in Theorem 1 cannot directly be applied in practice because the treatment confounding bridge function is not identifiable even if random samples from the target population were available - solving (7) requires additional information about U which is unobserved. For instance, in Example 1 one is unable to directly estimate q(a,z) because pza.u in (8) cannot directly be estimated from the observed data.

2.3.2. Negative control outcome (NCO) for identification of treatment confounding bridge function

For identification and estimation of q, we leverage negative control outcomes (NCO) to construct feasible estimating equations for the treatment confounding bridge function as in Cui et al. (2020). Similar to NCEs, NCOs can be viewed as imperfect proxies of U. However, unlike NCEs, a valid NCO, denoted by W, is a measured covariate which is (i) known a priori not to be a causal effect of either the primary exposure A nor the NCE Z; and (ii) is associated with (A,Z) conditional on X only to the extent that it is associated with U.

Formally, we make the following assumption.

Assumption 6.

(NCO Independence Conditions).

  1. WAU,X;

  2. WZA,U,X,Y;

  3. SZA,U,X,W,Y.

Assumptions 6(a) and (b) formalize the requirement that neither the primary exposure nor NCE have direct effects on the NCO. Assumption 6(c) complements Assumption 4 and states that conditioning on W in addition to (A,U,X,Y) does not alter the conditional independence of Z with S. In flu VE studies, a candidate NCO can be an infection whose risk is not causally affected by either A or Z. For example, if the NCE is selected to be Tdap vaccination, then a potential NCO may be current-year respiratory syncytial virus infection, as its risk is unlikely to be affected by Influenza or Tdap vaccination. Recent outpatient visits for other acute illnesses can also serve as NCO, such as blepharitis, wrist/hand sprain, lipoma, ingrowing nail, etc. (Leung et al., 2011). Figure 1(e) illustrates an NCO W that satisfies Assumptions 6(a) and (b).

Similar to Cui et al. (2020), we leverage the availability of an NCO as an additional proxy to identify the treatment confounding bridge function. However, a complication arises due to lack of a random sample from the target population, a key requirement in the approach outlined in Cui et al. (2020). In general, it is not possible to obtain sufficient information about neither the distribution of W nor that of U in the target population from the TND data without an additional structural assumption (Bareinboim and Pearl, 2012). In the following, we avoid imposing such an additional structural assumption by leveraging an important feature of infectious diseases such as Influenza and COVID-19; mainly that contracting such an infection is a rare event in most target populations of interest, and therefore information from the target population that is relevant for estimating the treatment confounding bridge function can be recovered from the test-negative control group. Formally, we make the following rare disease assumption.

Assumption 7

(Rare infection). There exist a small positive number δ>0 such that

P(Y=1A=a,W=w,U=u,X=x)δ,foralmosteverya,w,u,x (11)

Assumption 7 states that infected subjects, whether vaccinated or not and regardless of their negative control outcomes, only constitute a small proportion of each (U,X) stratum in the general population; specifically, the assumption implies that 11δP(A,ZU=u,X=x,Y=0)P(A,ZU=u,X=x)1δ. Thus, under Assumptions 2, 4 and 7, P(A=a,Z=zU=u,X=x)P(A=a,Z=zU=u,X=x,Y=0,S=1) for all a,z,x,u. We now introduce a key property of the treatment confounding bridge function in Theorem 2, which is proved in Appendix F.

Theorem 2

(Identification of the treatment confounding bridge function). Under Assumptions 2, 4, 5, 6, and 7, for a=0,1 we have that

(1δ)3P(A=aW,X,Y=0,S=1)<E[q(a,Z,X)W,A=a,X,Y=0,S=1]<1(1δ)3P(A=aW,X,Y=0,S=1)

Thus, provided δ0, Theorem 2 suggests that an approximation to the treatment confounding bridge function can be obtained by solving the following integral equation involving only observed data

Eq*(A,Z,X)W,A=a,X,Y=0,S=1=1/P(A=aW,X,Y=0,S=1). (12)

as long as a solution exists. Accordingly, hereafter suppose that the following assumption holds.

Assumption 8

(Existence of a unique solution to (12)). There exists a unique square-integrable function q*(A,Z,X) that satisfies (12).

Heuristically, uniqueness of a solution to (12) requires that variation in W is sufficiently informative about variation in Z, in the sense that there is no source of variation in W that is not associated with a corresponding source of variation in Z. See Appendix G for further elaboration of completeness conditions and D’Haultfoeuille (2011) and Newey and Powell (2003) for related use of the assumption in the literature. Below we briefly illustrate Assumption 8 in the examples of Section 2.3.1.

Example 1′.

Suppose U and Z are both binary, and a binary NCO W is also observed. Let pza.w=P(Z=z,A=aW=w,Y=0,S=1) for z,a,w{0,1}, then solving (12) is equivalent to solving the system of linear equations

p0a.0q*(a,0)+p1a.0q*(a,1)=1;p0a.1q*(a,0)+p1a.1q*(a,1)=1,

giving q*(a,z)=p1a.1p1a.0+p0a.0p0a.1p1a.1+p1a.0z/p0a.0.0p1a.1p0a.1p1a.0. Note that the probabilities pza.w can all be estimated from the study sample.

We emphasize that the solution to Equation (12) is ultimately an approximation to the (non-identifiable) treatment confounding bridge function in the target population. The accuracy of this approximation relies on the extent to which the rare disease assumption holds in the target population of interest. We study the potential bias resulting from a departure of this key assumption in the Appendix I. We further observe that, under the null hypothesis of no vaccine effectiveness, or if W has no direct effects on Y or S, then the function q*(A,Z,X) equals the treatment confounding bridge exactly, even for a non-rare disease outcome, as stated in the corollary below. We prove Corollary 1 in Appendix F.

Corollary 1.

Under the Assumptions of Theorem 1 and Assumption 8, if there is no vaccine effect against infection, such that YAU,X.

From Theorem 2, we immediately have the following corollary which provides a basis for estimation of q*(A,Z,X) from the observed TND sample.

Corollary 2.

Under the conditions listed in Theorem 2, for any function m(W,A,X), the solution q*(A,Z,X) to Equation (12) also solves the population moment equation

Em(W,A,X)q*(A,Z,X)m(W,1,X)m(W,0,A)Y=0,S=1=0. (13)

We prove Corollary 2 in Appendix H In practical situations where a parametric model q*(A,Z,X;τ) for the treatment confounding bridge function might be appropriate, where τ is an unknown finite dimensional parameter indexing the model, Corollary 2 suggests one can estimate τ by solving the estimating equation

1ni=1n1YimW,A,XqA,Z,X;τmW,1,XmW,0,X=0, (14)

where m(W,A,X) is a user-specified function whose dimension is no smaller than τ ‘s.

Example 1″.

If Z and W are both binary, rather than solving the system of equations implied by (12), one can instead specify a saturated model for the treatment confounding bridge function:

q*(A,Z;τ)=τ0+τ1Z+τ2A+τ3ZA (15)

and estimate τ=τ0,τ1,τ2,τ3T by solving (14) with m(W,A)=(1,W,A,WA)T. Extension to Z and X with multiple cateyories is straightforward.

Example 2′.

In case of continuous (U,X,Z), result (9) suggests the model

q*A,Z,X;τ=1+exp[(1)Aτ0+τ1A+τ2Z+τ3X]. (16)

If a univariate NCOW is available, we may solve (14) with m(W,A,X)=(1,W,A,X)T.

2.4. Estimation and Inference

In the previous sections, we have defined the cansal parameter of interest β0 as stratum-specific log risk ratio, introduced the treatment confounding bridge function as a key ingredient to identification of β0, and presented a strategy to estimate the treatment confounding bridge function leveraging an available NCO. We summarize the steps of our estimation framework in Algorithm 1 and present the large-sample properties of the resulting estimator (β^,τ^) in Theorem 3.

Algorithm 1.

Negative control method to estimate vaccine effectiveness from a test-negative design

1: Identify the variables in the data acoording to Figure 1(e), in particular the NCEs and NCOs.
2: Estimate the treatment confounding bridge function by solving Equation (14) with a suitable parametric model q*(A,Z,X;τ) and a user-specified function m(W,A,X). Write τˆ as the resulting estimate of τ.
3: Estimate β0 by solving Equation (10) with q^(A,Z,X)=q*(A,Z,X;τ^). The resulting estimator of β0 is
β^=logq*Ai,Zi,Xi;τ^A1Yiq*Ai,Zi,Xi;τ^1AiYi; (17)
The estimated vaccine effectiveness is VE^=1exp(β^).

Theorem 3

(Inference based on (β^,τ^) ). Under Assumptions 1 8 and suitable regularity conditions provided in Appendix J, the estimator (β^,τˆ) in Algorithm 1, or equivalently, the solution to the estimating equation 1n11nGi(β,τ)=0 is regular and asymptotically linear with the i-th influence function

IFi(β,τ)=[Ω(β,τ)TΩ(β,τ)]1Ω(β,τ)TGiβ,τ,

where

Gi(β,τ)=(1)1Aiq*Ai,Zi,Xi;τYiexpβAi1YimWi,Ai,Xiq*Ai,Zi,Xi;τmWi,1,XimWi,0,Xi

and Ω(β,τ)=EGi(β,τ)/βT,EGi(β,τ)/τT.

The proof follows from standard estimating equation theory (See Van der Vaart (2000) Theorem 5.21). An immediate consequence of Theorem 3 is that we may estimate the variance-covariance matrix of (β^,τ^) with

Σ^n=Ω^(β^,τ^)TΩ^(β^,τ^)1Ω^(β^,τ^)TVar^(Gi(β^,τ^))Ω^(β^,τ^)TΩ^(β^,τ^)TΩ^(β^,τ^)1/n, (18)

where Ω^(β,τ)=E^Gi(β,τ)/βTβ=β^,τ=τ^,E^Gi(β,τ)/τTβ=β^,τ=τ^. Here Eˆ and Var^ denote the expectation and variance with respect to the empirical distribution, respectively. A two-sided α-level Wald-type confidence interval of VE can then be obtained as

1expβ^z1α/2Σ^n,1,1,1expβ^+z1α/2Σ^n,1,1

where Σ^n,1,1 is the (1,1)-th entry of Σ^n and z1α/2 is the (1α/2)-th quantile of a standard normal distribution.

The estimator βˆ and the above confidence interval are constructed under the assumption that the disease is rare in the target population; for non-rare diseases, βˆ is in general going to be biased and the confidence interval may not be well-calibrated. However, by Corollary 1, under the null hypothesis of no vaccine effects, the estimated q*(A,Z,X) converges to the true treatment confounding bridge function and βˆ is consistent for β0=0. This implies that while our methods are approximately asymptotically unbiased for rare infections, they provide a valid test of no vaccine effect even if the infection is not rare.

2.5. Accounting for effect modification by measured confounders

So far we have operated under Assumption 3 that VE is constant across levels of (U,X). As we now show, this assumption can be relaxed to allow for potential effect modification with respect to X without compromising identification. This extension is particularly important because empirical evidence has indeed suggested that flu vaccine effectiveness may vary across sex and age groups (Chambers et al., 2018 ); and similar effect heterogeneity is of key interest in case of COVID-19 (Fernández Villalobos et al., 2021).

Instead of Assumption 3, we consider a less stringent assumption:

Assumption 9

(No effect modification by unmessured confounders).

P(Y=1A=a,U,X)=expβ0(X)ag(U,X) (19)

where β0(x) are g(u,x) are unknown functions of x and u, x respectively.

Under condition 1, Assumption 9 further implies that β0(x)=E[Y(1)X=x]/E[Y(0)X=x], i.e. the conditional causal RR as a function of x. Similar to Theorem 1, we have:

Theorem 4.

Under Assumptions 1, 2, 4, 5 and 9, we have that EV3A,Y,Z,X;β0S=1=0, where V3A,Y,Z,X;β0=(1)1Aq(A,Z,X)expβ0(X)A.

The proof of Theorem 4 is identical to that of Theorem 1 with β0A replsced with β0(X)A. Identification and estimation of the treatment confounding bridge function are also essentially identical to that of Corollary 2. Therefore, it is straightforward to extend Algorithm 1 to allow effect modification by measured confounders. We describe the algorithm and the large sample properties of the resulting estimator in Appendix K.

2.6. Estimating VE under treatment-induced selection into TND sample

Thus far, unbiasedness of the estimating function V0 has crucially relied on Assumption 2 that A does not have a direct effect on S. In some settings, the assumption may be violated if an infected person who is vaccinated is on average more likely to present to the ER than an unvaccinated infected person with similar symptoms, so that treatment or vaccination-induced selection into the TND sample is said to be present. In such settings, both estimators βˆ and βˆ(X) produced by Algorithms 1 and 2 may be severely biased because Assumption 2 may no longer be valid. Crucially, we note that this form of selection bias can be present even in context of a randomized trial in which vaccination/treatment is assigned completely at random, if the outcome is ascertained using a TND, for example in the cluster-randomized test-negative design studies of community-level dengue intervention effectiveness Anders et al. (2018), Dufault and Jewell (2020), Jewell et al. (2019), and Wang et al. (2022). In this Section, we provide sufficient conditions for identification under treatment-induced selection. In this vein, consider the following assumptions:

Assumption 2′.

P(S=1A=a,Y=1,U,X)/P(S=1A=a,Y=0,U,X)=exp(h(U,X))

for a = 0, 1.

That is, the risk ratio association between infection status and selection into the TND sample is independent of vaccination status. Furthermore,

Assumption 3′.

(No effect modification by confounders on the OR sacle).

P(Y=1A=1,U,X)/P(Y=0A=1,U,X)P(Y=1A=0,U,X)/P(Y=0A=0,U,X)=exp(β0).

Recall that Assumption 3 posited a constant vaccination causal effect on the RR scale across levels of (U,X), while Assumption 3′ posits that the corresponding causal effect on the odds ratio scale is constant across levels of (U,X). In case of a rare infection in the target population, the OR and RR are approximately equal, in which case VE is well approximated by 1OR.

Furthermore, identification relies on the following modified definition of a treatment confounding bridge function:

Assumption 5′.

There exists a treatment confounding bridge function q˜ such that for a=0,1,

E[q˜(a,Z,X)A=a,U,X]=1/P(A=aU,X,Y=0,S=1)almostsurely. (20)

Note that if the infection is rare in the target population in the sense of Assumption 7, then the treatment confounding bridge function defined in Assumption 5 in Section 2.3.1 satisfies (20) approximately.

We now introduce the identification of the OR with the following theorem:

Theorem 1′.

Under Assumptions 1, 2′, 3′, 4 and 5′, we have

EV˜1A,Y,Z,X;β0S=1=0

where V˜1(A,Y,Z,X;β) is the same as V1 defined in Theorem 1 except with q˜ replacing q.

Importantly, the theorem establishes that the estimating function V1 previously developed in the paper can under certain conditions, remain unbissed for the odds ratio association of vaccination with testing positive for the infection, even in the presence of treatment-induced selection into the TND sample. We leave the proof of Theorem 1′ to Appendix L.

Estimation of the treatment confounding bridge function q˜(A,Z,X) requires a negative control outcome that satisfies:

Assumption 6′.

(NCO Independence Conditions) W(A,Z,S)U,X,Y.

In addition to Assumptions 6, this last assumption requires that neither Y nor S is a causal effect of W. Figure 1(f) illustrates a DAG that satisfies our assumptions regarding (Z,W). As can be verified in the graph, Assumption 6′ is needed to ensure that collider stratification bias induced by the path A[S=1]W upon conditioning on S=1 is no longer present. Identification of the function q˜ is given below:

Theorem 2′.

Under assumptions 4, 5′ and 6′, for a=0,1 we have that

Eq˜a,Z,XA=a,W,X,Y=0,S=1=1/PA=aW,X,Y=0,S=1

We prove Theorem 2′ in Appendix M. As a result of Theorem 2′, the parameters in the treatment confounding bridge function can be estimated by solving moment equation (14)).

In summary, the above discussion suggests that one can continue to use Algorithm 1 to estimate VE in presence of treatment induced selection bias, albeit on the OR scale and under a modified set of negative control conditions. Algorithm 2 can similarly be justified under treatment-induced selection with assumptions in this section, except that β in Assumption 3′ is replaced by the conditional log RR β0(X).

As a side note, Assumption 2′ automatically holds under Assumption 2, and hence the above results in this section also apply to the setting in previous sections that is illustrated in Figure 1(e). We present this statement in the following corollary.

Corollary 3.

Under Assumptions 1, 2, 3′, 4 and 5′, we have

EV˜1A,Y,Z,X;β0S=1=0.

With Assumption 2, the treatment confounding bridge function q˜ can be estimated by solving the moment equation (14) either under under Assumption 6′ and 8, or Assumptions 6, 7 and 8 as an approximation under the rare disease assumption. Corollary 3 leads to an interesting observation: under the treatment-independent sampling (Assumption 2), the estimator βˆ from Algorithm 1 can be viewod as either log RR or logOR, depending on the set of assumptions made.

3. Simulation Study

To assess the empirical performance of our proposed method, we consider two settings with different types of confounding and negative control variables, and perform corresponding simulation studies.

In the first setting, we consider no measured confounder, a binary unmeasured confounder U, a binary NCE Z and a binary NCO W. To trigger selection among subjects with Y=0, we let D be a binary indicator of the presence of other flu like illnesses. The treatment confounding bridge function is thus given by (8). We assume the distribution of Y is Bernoulli with a log-linear risk model: YA,U~Bernoulliexpη0Y+β0A+ηUYU. We consider values of β0 to be −1.609, −0.693, −0.357 or 0, corresponding to a risk ratio of 0.2, 0.5, 0.7 or 1. We assume the selection S only equals one with nonzero probability if at least one of Y,W and D equals one, and is independent of A and Z conditional on other variables. The resulting prevalence of Y in the target population is 0.75% among the unvaccinated individuals and 0.55%, 0.65%, 0.72% or 0.75% among the vaccinated individuals, corresponding to four values of β0. Next, we consider a setting where X,U,Z and W are all univariate continuous variables. We generate the infection outcome using a log-linear model

YA,U,XBernoulliexpμ0Y+β0A+μUYU+μXYX+μUXYUX.

We generate A and Z following Example 2 in Section 2.3. As such the treatment confounding bridge function is given by Equation (9). The probability of S=1 is 1 only if at least one of Y and D is nonzero. The resulting prevalence of Y in the target population is 0.34% among the unvacinated individuals and 0.24%, 0.28%, 0.31% or 0.34% among the vaccinated individuals, corresponding to four values of β0. Appendix N and O give more details on the data-generating mechanism for the two settings.

In each scenario, we simulate a target population of size N=7,000,000 and implement 1,000 simulation iterations. For both settings, we evaluate the performance of three estimators for β0:

  • NC estimator: our proposed estimator given by Algorithm 1. In the first setting, we use a saturated parametric model (15) for the treatment confounding bridge function in the first setting, with m(W,A)=(1,W,A,WA)T; in the second setting, we use model (16) and m(W,A,X)=(1,W,A,X)T.

  • NC-Oracle estimator: the estimated treatment confounding bridge function in Algorithm 1 is only an approximation under Assumption 7, whose bias may affect the estimation for β0, as derived in Appendix I. We therefore include NC-Oracle estimator that uses the true treatment confounding bridge function q(A,Z,X). Appendices E include derivation of the true treatment confounding bridge function under the continuous (X,U,Z,W) setting.

  • Logistic regression: we also consider a logistic regression model of Y on A (and X in the second setting), overlooking the ummeasured confounders U. This is a common choice for covariate adjusted analyses of test-negative designs but ignores biases caused by U (Bond, Sullivan, and Cowling, 2016). We comment in Appendix P that the estimator is appropriate in the absence of unmeasured confounders except for potential model misspecification.

We note that the target parameter β0 for NC estimator and NC-Oracle estimator is log cansal RR, while logistic regression gives log causal OR. However, the two parameters are approximately equal under Assumption 7 where the infection risk is low in the target population.

Figure 2 shows the bias of three estimators considered and the coverage of their 95% confidence intervals. In both settings, both NC and NC-Oracle are essentially unbiased whereas logistic regression gives a biased estimate in all scenarios. NC-Oracle exhibits slightly higher precision than NC, which implies that estimating the treatment confounding bridge function in the TND is only slightly more variable. The 95% confidence intervals for NC and NC-Oracle both achieve nominal coverage, whereas logistic regression-based confidence intervals under-cover severely. We repeated the simulation under a non-rare disease setting in Appendix Q. In such scenarios, while NC-Oracle estimator is still unbiased with calibrated 95% confidence intervals, the NC estimator is biased in general except when β0=0. We conclude that the proposed NC estimator is umbiased of the log causal PR either under a rare disease setting or under a non-rare disease setting with no vaccine effect.

Figure 2:

Figure 2:

Bias (left) and coverage rates of 95% confidence interval (right) for the oracle estimator (NC-Oracle), GMM estimator (NC-GMM) and logistic regression (Logit Reg.) with a (a) binary or (b) continuous unmeasured confounder.

4. Application

We applied our proposed method to a TND study of COVID-19 VE against COVID-19 infection nested in the University of Michigan Health System. The selected study sample includes patients who interacted with the University of Michigan Health System and experienced COVID-19 symptoms, had suspected exposure to COVID-19 virus, or sought to screen for COVID-19 infection, between April 5, 2021 and December 7, 2021. In addition, the selected test-positive subjects had at least one positive lab tests for COVID-19 infection after April 5. Vaccination history was obtained through electronic health records. A study subject was considered fully vaccinated if they received at least one dose of Johnson & Johnson’s Janssen vaccine or at least two doses of Moderna or Pfizer vaccine. If a subject tested positive before or within 14 days after their first dose of Janssen vaccine or within 14 days after their second dose of Moderna or Pfizer vaccine, they were considered unvaccinated (Moline et al., 2021).

We took immunization visits before December 2020 as NCE since COVID-19 vaccines were not available before December 2020 and immunization before was unlikely to affect the risk of COVID-19 infection; nor that of the selected NCOs we describe next. For NCO, we selected a binary indicator of having at least one of the following “negative control outcome” conditions after April 5, 2021: arm/leg cellulitis, eye/ear disorder, gastro-esophageal disease, atopic dermatitis, and injuries. Such candidate NCE and NCO are likely to satisfy the requisite conditional independence conditions for them to be valid negative control variables and to be related to a patient’s latent HSB. We adjusted for age groups (<18, between 18 and 60, or ≥ 60), gender, race (white or non-white), Charlson comorbidity score ≥ 3, and the calendar month of a test-positive subject’s first positive COVID test or a test-negative subject’s last COVID test. Table 3 in Appendix R summarizes the distribution of negative control variables, demographic variables and COVID-19 infection among vaccinated and unvaccinated subjects.

Because NCE is expected not to be associated with either the outcome or NCE in a fully adjusted analysis unless there is unmeasured confounding, we first fit regression models to detect presence of residual confounding bias. Conditioning on the baseline covariates, in both vaccinated and unvaccinated groups, NCE is significantly associated with COVID-19 infection (p < 0.001) and NCO (p < 0.001) in corresponding adjusted logistic regression models, suggesting the presence of hidden biases (See Appendix R Table 4, 5).

We implemented Algorithm 1 to estimate VE. We specified a linear model for the treatment confounding bridge function with an interaction term between COVID-19 vaccination and the NCE, and set the function m to include one (for an intercept term), COVID-19 vaccination, the NCO, and baseline covariates, as well as all two-way interactions. For comparison, we also implemented a logistic regression model, which gives an unbiased estimate of causal odds ratio under the no unmeasured confounding assumption, adjusting for gender and age groups. In this case, The VE can be approximated by one minus the odds ratio of COVID-19 infection against vaccination, as COVID-19 infection rate is known to be low across strata in the target population.

The double negative control Algorithm 1 estimated a causal log-RR associated with vaccination of −2.80 (95% CI: −3.08, −2.54) for COVID-19 infection in the target population, and the estimated VE was 94.0% (95% CI: 92.1%, 95.4%). The logistic regression estimates for the same log-RR was −3.18 (95% CI: −3.35, −3.01%), yielding VE estimate of 95.8% (95% CI: 95.1%, 96.5%). Table 6 and 7 in Appendix R give detailed output of Algorithm 1 and the logistic regression model respectively. Although there is significant evidence of hidden bias as summarized in Tables 4 and 5, the magnitude of detected bias is not appreciable, approximately 12% on the log-risk ratio scale, corresponding to a 1.8% difference on vaccine efficacy scale. A Hausman chi-squared test statistic (Hausman, 1978) comparing the two estimates on the log-RR scale is 8.84, giving a p-value of 0.003, indicating that the double negative control VE estimate is significantly smaller than that given by logistic regression.

The VE estimated with a standard logistic regression model is larger than VE estimates from RCTs which reported VE=94% for Moderna and VE=95% for Pfizer vaccines, respectively (Baden et al., 2021; Polack et al., 2020; Sadoff et al., 2021). We hypothesize that this may be due to some degree of confounding by HSB and related factors, which our proposed double NC approach appears to account for to some extent, recovering VE estimates more consistent with those of RCTs.

5. Discussion

In this article, we have introduced a statistical method for estimating vaccine effectiveness in a test-negative design. The approach leverages negative control variables to account for hidden bias due to residual confounding and/or selection mechanism into the TND sample. Negative control variables abound in practice, such as vaccination history which is routinely collected in insurance claims and electronic health records. Hence the proposed method may be particularly useful in such real world settings to obtain improved estimates of vaccine effectiveness.

The TND is a challenging setting in causal inference where selection bias and unmeasured confounding co-exist, selection is outcome-dependent, and unmeasured confounders also impact selection. As a result, the causal effect of interest is in general not identified from such studies (Cai and Kuroki, 2012). Nevertheless, we establish that progress can be made under a semiparametric multiplicative model, provided the outcome is rare in the target population, and double negative control variables are available. To this end, this article showcases the potential power of negative control methods and proximal causal inference in epidemiologic research (Shi, Miao, and Tchetgen Tchetgen, 2020; Tchetgen Tchetgen et al., 2020).

We focused on the outpatient TND, where recruitment is restricted to subjects who seek care voluntarily. TNDs have also been applied to inpatient settings for studying VE against, for example, flu hospitalization (Feng, Cowling, and Sullivan, 2016; Foppa et al., 2016). In inpatient TNDs, differential access to healthcare and underlying health characteristics between vaccinated and unvaccinated subjects are likely the main causes of confounding bias (Feng, Cowling, and Sullivan, 2016). Our methods are still applicable in such settings, but negative control variables should be selected to be relevant to the source of unmeasured confounding mechanism. For example, previous vaccination and hospitalization outside the flu season or hospitalization due to other flu-like illnesses are viable candidate NCE and NCO, respectively (Jackson et al., 2006).

Our approach is suitable for post-market TND studies where real-world vaccine effectiveness is of interest and vaccination history is obtained retrospectively, possibly through electronic health records. For vaccine efficacy in a controlled trial setting, Wang et al. (2022) recently developed estimation and inference of RR in cluster-randomized TND, aiming to correct for bias due to differential HSB induced by the intervention being unblinded. Because of randomization, they considered HSB as a post-treatment variable and proposed a log-contrast estimator which corrects for selection bias by leveraging a valid test-negative outcome, under an assumption that either (i) the vaccine does not have a causal effect in the population, and the causal impact of vaccination on selection is equal for test-positive and -negative subsamples; or (ii) among care seekers, the incidence of test-negative outcomes does not differ between vaccinated and unvaccinated, and the intervention effect among care seekers is generalizable to the whole population. We note that even under randomization, identification conditions given in Section 2.6 are neither stronger nor weaker than those of Wang et al. (2022) described above, as neither set of assumptions appear to imply the other. An important advantage of our proposed methods is that they can be used to account for selection bias in a TND study irrespective of randomization.

Our methods target RR as a measure of VE instead of the more common OR (Jackson et al., 2006; Sullivan, Tchetgen Tchetgen, and Cowling, 2016). These two measures are approximately equal for rare infections. Schnitzer (2022) recently considered estimation of a marginal causal RR in the TND sample and justified the use of an inverse probability of treatment weighted (IPTW) estimator in a setting in which an unmeasured common cause of infection and selection into the TND sample does not cause vaccination (and thus there is no unmeasured confounding). Instead, our methods allow for an unmeasured common cause of vaccination, infection and selection into the TND sample; however in order to estimate a causal RR, we invoke both, an assumptions of no effect modification by an unmeasured confounder, and a rare-disease condition. As we establish, the latter assumption is not needed if there is no vaccine effect against infection outcome. In Section 2.6, we establish that under a homogeneous OR vaccine effect measure condition, and an alternative definition of the treatment bridge function, our methods can identify a causal effect of the vaccine on the odds ratio scale without invoking the rare disease condition.

Throughout the article, we have assumed diagnostic tests are accurate and individuals who seek care are sparsely distributed, such that the vaccination of a given subject in the TND sample does not protect another study subject from infection, , i.e. there is no interference in the TND sample, a common assumption in TND literature. This assumption may be violated if members of the same households present in the ER in which case block interference must be accounted using results from interference literature (Hudgens and Halloran, 2008; Tchetgen Tchetgen and VanderWeele, 2012). Sensitivity analysis may be considered to evaluate how violation of these assumptions can potentially bias inferences about VE.

Acknowledgements

The authors thank Dr. Lili Zhao, Dr. Chen Shen and University of Michigan Precision Health Initiative for providing the University of Michigan Health System data.

The authors gratefully acknowledge NIH grants R01AI27271, R01CA222147, R01AG065276, R01GM139926.

A. Proof of Proposition 1

We need to prove

E[(1)1A1P(AU,X)Yexp(β0A)S=1]=0

Since

E(1)1A1P(AU,X)Yexpβ0AS=1=E(1)1A1P(AU,X)expβ0AY=1,S=1P(Y=1S=1)=EE(1)1A1P(AU,X)expβ0AU,X,Y=1,S=1Y=1,S=1P(Y=1S=1),

the result immediately follows if we can show that

E(1)1A1P(AU,X)expβ0AU,X,Y=1,S=1=0. (21)

By Assumption 2, the left-hand side of (21) equals

E(1)1A1P(AU,X)expβ0AU,X,Y=1.

We further have

E(1)1A1P(AU,X)expβ0AU,X,Y=1=a=01(1)1a1P(A=aU,X)expβ0aP(A=aU,X,Y=1)=a=01(1)1a1P(A=aU,X)expβ0aP(A=aU,X)P(Y=1A=a,U,X)P(Y=1U,X)=A.3a=01(1)1aexp(β,a)expβ0aP(Y=1A=0,U,X)P(Y=1U,X)=a=01(1)1aP(Y=1A=0,U,X)P(Y=1U,X)=0.

B. Existence of solutions to (7)

In this section, we provide the conditions of existence of solutions to (7). The conditions for (12) can be similarly derived. The results in this section directly adapted from Appendix B of Cui et al. (2020).

Let L2{F(t)} denote the Hilbert space of all square-integrable functions of t with respect to distribution function F(t), equiped with inner product g1,g2=g1(t)g2(t)dF(t). Let Ta,x denote the operator L2{F(za,x)}L2{F(ua,x)},Ta,zq=E[q(Z)A=a,U=u,X=x] and let λa,x,n,φa,x,n,ϕa,x,n denote a singular value decomposition of Ta,x. The solution to (7) exists if:

  1. f(za,u,x)f(ua,z,x)dzdu<;

  2. P2(A=aU=u,X=x)f(ua,x)du<;

  3. n1λa,x,n2P1(A=aU=u,X=x),ϕa,z,n2<.

C. Proof of Theorem 1

We need to prove

E[(1)1Aq(A,Z,X)Yexp(β0A)S=1]=0

Since

E[(1)1Aq(A,Z,X)Yexpβ0AS=1]=E[(1)1Aq(A,Z,X)expβ0AY=1,S=1]P(Y=1S=1)=E{E[(1)1Aq(A,Z,X)expβ0AU,X,Y=1,S=1]Y=1,S=1}P(Y=1S=1),

it suffices to show that

E[(1)1Aq(A,Z,X)expβ0AU,X,Y=1,S=1]=0.

By Assumption 2, the left-hand side is

E[eβ0A(1)1Aq(A,Z,X)U,X,Y=1]=E{eβ0A(1)1AE[q(A,Z,X)A,U,X,Y=1]U,X,Y=1}=A.4E{eβ0A(1)1AE[q(A,Z,X)A,U,X]U,X,Y=1}=A.5E{eβ0A(1)1A1P(AU,X)U,X,Y=1}=0

The last equality is proved in Appendix A.

D. Treatmeng bridge function and estimation with categorical NCE, NCO and unmeasured confounders.

We consider a categorical unmeasured confounder U with categories u1,,uJ and NCE Z with categories z1,,zI. We assume there are no other covariates X. We write pia.k=P(Z=zi,A=aU=uj) for =1,,I, a=0,1, and j=1,,J.

Similar to before, the treatment confounding bridge function q(a,z) should satisfy

i=1Ipiajqa,zi=1 (22)

for all i,j. Therefore, any solution of the following equation system, if exists, is a treatment confounding bridge function:

p1a.1qa,z1+p2a.1qa,z2++pIa.1qa,zI=1p1a.2qa,z1+p2a.2qa,z2++pIa.2qa,zI=1p1a.Jqa,z1+p2a.Jqa,z2++pIa.Jqa,zI=1 (23)

for a=0,1. We denote the probability matrix

PZ,AU=p1a.1p2a.1pIa.1p1a.2p2a.2pIa.2p1a.lp2a.JpIa.J. (24)

A treatment confounding bridge function exists if the matrix PZ,AU is invertible.

Suppose besides a categorical NCE Z with levels z1,,zI, we also have a categorical NCO W with levels w1,,wK. The integral equation (12) is equivalent to the linear equation system

p1a.1q*a,z1+p2a.1q*a,z2++pIa.1q*a,zI=1p1a.2q*a,z1+p2a.2q*a,z2++pIa.2q*a,zI=1p1a.Kq*a,z1+p2a.Kq*a,z2++pIa.Kq*a,zI=1 (25)

for a=0,1, where pia.k=PZ=zi,A=aW=wk,Y=0,S=1. We denote that matrix

Pa=p1a.1p2a.1pIa.1p1a.2p2a.2pIa.2p1a.Kp2a.KpIa.K. (26)

Then Assumption 8 is equivalent to the condition that P(a) is invertible with I=K for a=0,1, in which case (12) has a unique solution

q*a,z1,,q*(a,zI)T=P(a)11I.

The probabilities pia.k’s can all be estimated from the study data.

E. Derivation of the treatment confounding bridge function in Example 2

By Assumption 5, the treatment confounding bridge function q(A,Z,X) should satisfy

E[q(A,Z,X)U=u,A=a,X=x]=1P(A=aU=u,X=x)

for all a,u and X.

We write q(A,Z,X)=1+r(Z,A,X), then

E[r(Z,a,X)U,A=a,X]=1P(A=aU,X)P(A=aU,X).

Consider

rZ,A,X=exp((1)Aτ0+τ1A+τ2Z+τ2X),

then

P(A=1U=u,X=x)1P(A=1U=u,X=I)=τ(z,0,x)f(zU,X,A=0)dz=expτ0+τ2z+τ3x12πσz2expzμ0zμUZuμXZx22σz2dz=expτ0+τ3xexpτ2μ0Z+μUZu+μXZx+σz2τ222=exp(τ0+τ2μ0Z+σz2τ222+τ2μUZu+τ3+μXZT2x)

and

1P(A=1U=u,X=x)P(A=1U=u,X=x)=τ(z,1,x)f(zu,I,A=1)dz=expτ0τ1τ2zτ3x12πσz2expzμ0ZμAZμUZuμXX22σz2dz=expτ0τ1τ3xexpτ2μ0Z+μAZ+μUZu+μXZx+σz2τ222=exp(τ0τ1τ2μ0Zτ2μAZ+σz2τ222τ2μUZuτ3+τ2μXZx)

This requires that

τ1+τ2μAZ=σz2τ22

Because

P(A=1U,X)=expitμ0A+μUAU+μXAX,

we conclude the parameters in the bridge function are

τ2=μUA/μUZ,
τ3=μXAμXZτ2=μXAμXZμUA/μUZ
τ1=σz2τ22τ2μAZ=σZ2μUA2μUZ2μUAAμAZμUZ
τ0=μ0Aτ2μ0Zσz2α222=μ0AμUAOZμ0ZμUZσZ2μUA22μUZ2

F. Proof of Theorem 2 and Corollary 1

We first introduce a few properties due to the rare disease assumption:

Lemma 1.

Under Assumptions 2, 4, 6 and 7, for every , w,u and x, we have

  1. P(Y=1W=w,U=u,X=x)<δ,P(Y=1A=a,U=u,X=x)<δP(Y=1U=U,X=x)<δ

  2. 1δ<P(A=aU=U,X=x,Y=0,S=1)P(A=aU=u,X=x)<11δ

  3. (1δ)2<P(A=aW=w,U=u,X=x,Y=0,S=1)P(A=aU=u,X=x,Y=0,S=1)<1(1δ)2.

  4. f(zA=a,U=u,X=x)=f(zW=w,A=a,U=U,X=x,Y=0,S=1).

Proof.

  1. For every a, w,u and x, we have
    P(Y=1W=w,U=u,X=x)=aP(Y=1|A=a,W=w,U=u,X=|x)P(A=aW=w,U=u,X=x)<A.7δaP(A=a|W=w,U=u,|X=x)=δ.
    The rest follows similarly.
  2. For every a, u and x, we have
    P(A=aU=u,X=x,Y=0)=P(A=aU=u,X=x)×P(Y=0A=a,U=u,X=x)P(Y=0U=u,X=x).
    By Lemma 1(a), we have
    1δ<P(Y=0A=a,U=u,X=x)P(Y=0U=u,X=x)=P(A=aU=u,X=x,Y=0)P(A=aU=u,X=x)<11δ.
    The result follows by noticing that P(A=aU=U,X=x,Y=0)=P(A=aU=U,X=x,Y=0,S=1) due to Assumption 2.
  3. For every a, w, u and x, we have
    P(A=aW=w,U=u,X=x,Y=0)=P(A=aW=w,U=u,X=x)×P(Y=0A=a,W=w,U=u,X=x)P(Y=0W=w,U=u,X=x)=A.6(a)P(A=aU=u,X=x)×P(Y=0A=a,W=w,U=u,X=x)P(Y=0W=w,U=u,X=x)=P(A=aU=u,X=x,Y=0)×P(Y=0U=u,X=x)P(Y=0A=a,U=u,X=x)×P(Y=0A=a,W=w,U=u,X=x)P(Y=0W=w,U=u,X=x)
    By Lemma 1(a), we have
    1δ<P(Y=0U=u,X=x)P(Y=0A=a,U=u,X=x)<11δ
    and
    1δ<PY=0A=a,W=w,U=u,X=xPY=0W=w,U=u,X=x<11δ.
    We therefore have
    (1δ)2<PA=aW=w,U=u,X=x,Y=0PA=aU=u,X=x,Y=0<1(1δ)2.
    Finally, by Assumptions 2 and 6(c), we have
    PA=aU=u,X=x,Y=0=PA=aU=u,X=x,Y=0,S=1,PA=aW=w,U=u,X=x,Y=0=PA=aW=w,U=u,X=x,Y=0,S=1.
    We conclude that
    (1δ)2<P(A=aW=w,U=u,X=x,Y=0,S=1)P(A=aU=u,X=x,Y=0,S=1)<1(1δ)2.
  4. f(zW=w,A=a,U=u,X=x,Y=0,S=1)
    =A.6(c)f(zW=w,A=a,U=u,X=x,Y=0)
    =A.6(b)f(zA=a,U=u,X=x)
    Therefore, we have
    E{q(a,Z,X)A=a,W,X,Y=0,S=1}=q(a,z,X)f(zA=a,W,X,Y=0,S=1)dz=q(a,z,X)f(zA=a,W,U=u,X,Y=0,S=1)f(uA=a,W,X,Y=0,S=1)dzdu=L.1(d){q(a,z,X)f(zA=a,U=u,X)du}f(uA=a,W,X,Y=0,S=1)dz=A.51P(A=aU=u,X)f(uA=a,W,X,Y=0,S=1)du<L.1(b)11δ1P(A=aU=u,X,Y=0,S=1)f(uA=a,W,X,Y=0,S=1)du<L.1(c)1(1δ)31P(A=aW,U=u,X,Y=0,S=1)f(uA=a,W,X,Y=0,S=1)du=1(1δ)3f(uW,X,Y=0,S=1)f(uA=a,W,X,Y=0,S=1)P(A=aW,X,Y=0,S=1)f(uA=a,W,X,Y=0,S=1)du=1(1δ)31P(A=aW,X,Y=0,S=1)f(uW,X,Y=0,S=1)du=1(1δ)31P(A=aW,X,Y=0,S=1)
    and
    E{q(a,Z,X)A=a,W,X,Y=0,S=1}=1P(A=aU=u,X)f(uA=a,W,X,Y=0,S=1)du>L.1(b)(1δ)1P(A=aU=u,X,Y=0,S=1)f(uA=a,W,X,Y=0,S=1)du>L.1(c)1δ31P(A=aW,U=u,X,Y=0,S=1)f(uA=a,W,X,Y=0,S=1)du=1δ3f(uW,X,Y=0,S=1)f(uA=a,W,X,Y=0,S=1)P(A=aW,X,Y=0,S=1)f(uA=a,W,X,Y=0,S=1)du=1δ31P(A=aW,X,Y=0,S=1)f(uW,X,Y=0,S=1)du=1δ31P(A=aW,X,Y=0,S=1).
    To prove Corollary 1, we have
    E{q(a,Z,X)A=a,W,X,Y=0,S=1}=1P(A=aU=u,X)f(uA=a,W,X,Y=0,S=1)du=P(A=aU=u,X,Y=0,W,S=1)P(A=aU=u,X)P(A=aW,X,Y=0,S=1)f(uW,X,Y=0,S=1)du=A.6P(A=aU=u,X,Y=0,W)P(A=aU=u,X)P(A=aW,X,Y=0,S=1)f(uW,X,Y=0,S=1)du
    If AYU,X, then together with Assumption 6 we have P(A=aU=u,X,Y=0,W)=P(A=aU=u,X), whereby the above equals
    1P(A=aW,X,Y=0,S=1)f(uW,X,Y=0,S=1)du=1P(A=aW,X,Y=0,S=1).

G. Discussion of Assumption 8

Similar to (7), Equation (12) defines a Fredholm integral equation of the first kind, yet only involves observed data. Although a treatment confounding bridge function q(A,Z,X) must satisfy (12), there is no guarantee that solving (12) gives a treatment confounding bridge function if multiple solutions exist. When a solution to (12) exists, we give the following assumptions that guarantee the uniqueness of solution.

Assumption 10 (Completeness).

  1. For any square-integrable function g, if E[g(Z)A,U,X]=0 almost surely, then g(Z)=0 almost surely;

  2. For any square-integrable function h, if E[h(U)A,W,X,Y=0,S=1] almost surely, then h(U)=0 almost surely.

Intuitively, Assumption 10 is a statement on the information contained in the unmeasured confounders vs. in the negative control variables - Assumption 10(a) requires the confounders U are informative enough about Z in the target population and Assumption 10(b) requires the NCO W is informative enough about U in the subgroup Y=0,S=1, so that no information is lost when taking the two conditional expectations. Completeness conditions similar to Assumption 10 were originally introduced by Lehmann and Scheffe to identify the so-called unbiase minimum risk estimator (Lehmann and Scheffé, 2012a,b.). In econometrics and causal inference literature, completeness conditions have been employed to achieve identifiability for a variety of nonparametric or semiparametric models, such as instrumental variable regression (D’Haultfoeuille, 2011; Newey and Powell, 2003), measurement error models (Hu and Schennach, 2008), synthetic control (Shi et al., 2021), and previous works in negative control methods (Cui et al., 2020; Miao, Shi, and Tchetgen Tchetgen, 2018: Ying et al., 2021]. The completeness condition holds for a wide range of distributions. Newey and Powell (2003) and D’Haultfoeuille (2011) provided justifications in exponential families and discrete distributions with finite support. Andrews (2011) constructed a broad class of bivariate distributions that satisfy the completeness condition.

Assumption 10 have several immediate consequences:

Proposition 2.

  1. Under Assumptions 4, 6 and 10, for any square integrable function g such that E|g(Z)|A,W,X,S=0,Y=1]=0 almost surely, then g(Z)=0 almost surely.

  2. Under Assumptions 10(a) and 15, the treatment confounding bridge function is unique. That is, if two square-integrable functions q(A,Z,X) and q1(A,Z,X) satisfy
    E[q(a,Z,x)A=a,U=u,X=x]=E[q1(a,Z,x)A=a,U=u,X=x]=1P(A=aU=u,X=x)
    for all a,u,x almost surely, then q(A,Z,X)=q1(A,Z,X) almost surely.
  3. Under Assumptions 4, 8 and 10, Equation (12) has a unique solution q*(A,Z,X).

We prove Proposition 2 below. Proposition 2 states that the completeness conditions in Assumption 10 and the definitions of the negative control variables lead to a third completeness condition that only involves the study data. Proposition 2 states the uniqueness of the treatment confounding bridge function, q(A,Z,X), and the solution to (12),q*(A,Z,X). The function q*(A,Z,X) can therefore be identified from the study data alone and is a good approximation of q(A,Z,X) by Theorem 2.

Proof.

  1. Suppose a square-integrable function g(Z) satisfies
    E[g(Z)A,W,X,Y=0,S=1]=0almostsurely.
    The left-hand side equals
    E{E[g(Z)U,A,W,X,Y=0,S=1]A,W,X,Y=0,S=1}=A.6E{E[g(Z)U,A,X,Y=0,S=1]A,W,X,Y=0,S=1}=A.4E{E[g(Z)U,A,X]A,W,X,Y=0,S=1}
    By Assumption 10(b), E[g(Z)U,A,X]=0 almost surely. Then by Assumption 10(a), g(Z)=0 almost surely.
  2. We have
    Eq(A,Z,X)q1(A,Z,X)A,U,X=0
    almost surely. By Assumtion 10(a), we have
    q(A,Z,X)q1(A,Z,X)=0
    almost surely, or q(A,Z,X)=q1(A,Z,X) almost surely.
  3. If two square-integrable functions q*(A,Z,X)=q1*(A,Z,X) satisfies
    Eq*(A,Z,X)A,W,X,Y=0,S=1=Eq1*(A,Z,X)A,W,X,Y=0,S=1=1P(AW,X,Y=0,S=1)
    almost surely, then
    Eq*(A,Z,X)q1*(A,Z,X)A,W,X,Y=0,S=1=0.
    Under Assumptions 4, 6 and 10. Proposition 2(a) holds and therefore
    q*(A,Z,X)=q1*(A,Z,X)
    almost surely. □

H. Proof of Corollary 2 and further discussion

We first prove equation (13)

Em(W,A,X)q*(A,Z,X)m(W,1,X)m(W,0,X)Y=0,S=1=Em(W,A,X)Eq*(A,Z,X)W,A,X,Y=0,S=1m(W,1,X)m(W,0,X)Y=0,S=1=(12)Em(W,A,X)1P(AW,X,Y=0,S=1)m(W,1,X)m(W,0,X)Y=0,S=1=EEm(W,A,X)1P(AW,X,Y=0,S=1)W,X,Y=0,S=1m(W,1,X)m(W,0,X)Y=0,S=1=E{m(W,1,X)+m(W,0,X)m(W,1,X)m(W,0,X)Y=0,S=1}=0

In fact, one can show that any regular and asymptotically normal estimator of τ that satisfies (12) has influence function of the form

IF(W,Z,A,X)=Eq(A,Z,X;τ)ττ=τ0m(W,A,X)(1Y)S=11(1Y)(m(W,A,X)q(A,Z,X)m(W,1,X)m(W,0,X))

for an arbitrary function m(W,A,X). Therefore, any regular and asymptotically normal estimator of τ corresponds to the solution of the estimating equation (14) for some function m(W,A,X).

To prove this result, we see that for any parametric submodel that satisfies (12) and is indexed by s such that the true distribution corresponds to s=0, we have

Es(1Y)qA,Z,X;τs1fs(AW,X,Y=0,S=1)m(W,A,X)S=1=0

and thus

Es(1Y)qA,Z,X;τs1fs(AW,X,Y=0,S=1)m(W,A,X)S=1/ss=0=0.

Note that

sEs(1Y)qA,Z,X;τsm(W,A,X)S=1=Eq(A,Z,X;τ)ττ=τ0m(W,A,X)(1Y)S=1τsss=0+E[(1Y)q(A,Z,X)m(W,A,X)S(OS=1)S=1]

and

sEs(1Y)m(W,A,X)fs(AW,X,Y=0,S=1)S=1=E(1Y)m(W,A,X)sfs(AW,X,Y=0,S=1)f2(AW,X,Y=0,S=1)S=1+Em(W,A,X)(1Y)f(AW,X,Y=0,S=1)S(W,A,X,Y=0S=1)S=1=Em(W,A,X)(1Y)f(AW,X,Y=0,S=1)S(AW,X,Y=0,S=1)S=1+=Em(W,A,X)(1Y)f(AW,X,Y=0,S=1)S(W,X,Y=0S=1)S=1=E[(1Y){m(W,1,X)m(W,0,X)}S(W,X,Y=0S=1)S=1]=E[(1Y){m(W,1,X)m(W,0,X)}S(OS=1)S=1]

Rearranging the terms, we have

τsss=0=EEq(A,Z,X;τ)ττ=τ0m(W,A,X)(1Y)S=11(1Y)(m(W,A,X)q(A,Z,X)m(W,1,X)m(W,0,X))S(OS=1)S=1]

I. Discussion on the rare disease assumption 7

We first describe a crucial identity that links the log risk ratio β0 and the treatment confounding bridge function q(A,Z,X).

Lemma 2.

Under Assumptions 1, 2, 3 and 5, we have

β0=logE[q(A,Z,X)I(A=1,Y=1)S=1]E[q(A,Z,X)I(A=0,Y=1)S=1] (27)

Proof.

The right hand side of (27) is

logE[q(A,Z,X)I(A=1,Y=1)S=1]E[q(A,Z,X)I(A=0,Y=1)S=1]=logE[q(A,Z,X)A=1,Y=1,S=1]E[q(A,Z,X)A=0,Y=1,S=1]+logP(Y=1,A=1S=1)P(Y=1,A=0S=1)

Note that for a=0,1,

E[q(A,Z,X)A=a,Y=1,S=1]=E{E[q(A,Z,X)U,X,A=a,Y=1,S=1]A=a,Y=1,S=1}=A.2E{E[q(A,Z,X)U,X,A=a]A=a,Y=1,S=1}=A.5E1P(A=aU,X)A=a,Y=1,S=1=1P(A=aU=u,X=x)f(U,XA=a,Y=1,S=1)dudx=1P(A=aU=u,X=x)×P(A=aU=u,X=x)P(Y=1A=a,U=u,X=x)P(S=1Y=1,A=a,U=u,X=x)P(A=a,Y=1,S=1)dudx=A.2,A.3expβ0aP(Y=1A=0,U=u,X=x)P(S=1Y=1,U=u,X=x)P(A=a,Y=1,S=1)dudx=expβ0aP(A=a,Y=1,S=1)P(Y=1A=0,U=u,X=x)P(S=1Y=1,U=u,X=x)dudx.

Therefore, the right-hand side of (27) equals

β0+logP(A=0,Y=1,S=1)P(A=1,Y=1,S=1)+logP(Y=1,A=1S=1)P(Y=1,A=0S=1)=β0+logP(A=0,Y=1S=1)P(S=1)P(A=1,Y=1S=1)P(S=1)+logP(Y=1,A=1S=1)P(Y=1,A=0S=1)=β0

Let q* be the function that satisfies (12):

Eq*(a,Z,X)A=a,W,X,Y=0,S=1=1P(A=aU=u,X,Y=0,S=1).

We introduce an additional regularity condition:

Assumption 11

(Uniform continuity). For any fixed positive square-integrable function g(U) and a small positive number 0<η<1, there exists some 0<γ=γ(g,η)<0 such that 1γ<Eg1(U)A,W,X,Y=0,S=1E[g(U)A,W,X,Y=0,S=1]<11γ a.e. implies 1η<g1(U)g(U)<11η a.e., where g1(U) is a positive square integrable function.

Assumption 11 requires that the inverse mapping of gE[g(U)A,W,X,Y=0,S=1] is sufficiently smooth. By Theorem 2 and Assumption 8, we have

(1δ)3<Eq*(A,Z,X)A,W,X,Y=0,S=1E[q(A,Z,X)A,W,X,Y=0,S=1]=EEq*(A,Z,X)A,W,U,X,Y=0,S=1A,W,X,Y=0,S=1E{E[q(A,Z,X)A,W,U,X,Y=0,S=1]A,W,X,Y=0,S=1}=A.6EEq*(A,Z,X)A,U,XA,W,X,Y=0,S=1E{E[q(A,Z,X)A,U,X]A,W,X,Y=0,S=1}=A.5EEq*(A,Z,X)A,U,XA,W,X,Y=0,S=1E{E[1P(AU,X)A,U,X]A,W,X,Y=0,S=1}<1(1δ)3

By Assumption 11, this implies

1η(δ)P(AU,X)<Eq*(A,Z,X)A,U,X<1(1η(δ))P(AU,X). (28)

The constant η(δ) is determined by the smoothness of the inverse mapping of gE|g(U)|A,W,X,Y= 0,S=1]

Eq*(A,Z,X)U=u,X,Y=0,S=1=1P(A=aU=u,X,Y=0,S=1) (29)

for almost all a and u. Let

β0*=logEq*(A,Z,X)AYS=1Eq*(A,Z,X)(1A)YS=1=logEq*(A,Z,X)A=1,Y=1,S=1Eq*(A,Z,X)A=0,Y=1,S=1+logP(Y=1,A=1S=1)P(Y=1,A=0S=1).

Under mild regularity, the estimator βˆ is regular and asymptotically linear for β0*, and therefore

β^=β0*+Op(1/n)=β0+(β0*β0)+Op(1/n).

It suffices to study

β0*β0=logEq*(A,Z,X)A=1,Y=1,S=1Eq*(A,Z,X)A=0,Y=1,S=1logE[q(A,Z,X)A=1,Y=1,S=1]E[q(A,Z,X)A=0,Y=1,S=1]

Notice that

Eq*(A,Z,X)A=a,Y=1,S=1=EEq*(A,Z,X)A=a,U,X,Y=1,S=1A=a,Y=1,S=1=A.4EEq*(A,Z,X)A=a,U,XA=a,Y=1,S=1=A.4EEq*(A,Z,X)A=a,U,X,Y=0A=a,Y=1,S=1<(28)11η(δ)E1P(A=aU,X)A=a,Y=1,S=1=A.511η(δ)E{E[q(A,Z,X)A,U,X]A=a,Y=1,S=1}=A.411η(δ)E{E[q(A,Z,X)A,U,X,Y=1,S=1]A=a,Y=1,S=1}=11η(δ)E[q(A,Z,X)A=a,Y=1,S=1]

and similarly,

Eq*(A,Z,X)A=a,Y=1,S=1>(1η(δ))E[q(A,Z,X)A=a,Y=1,S=1].

Therefore, we have

1ηδ<E[q*A,Z,XA=a,Y=1,S=1]EqA,Z,XA=a,Y=1,S=1<11ηδ.

We conclude that

2log(1η(δ))=log((1ϵ)2)<β0*β0<log1(1ϵ)2=2log(1ϵ)

and thus

|β^β0|<2log(1η(δ))+Op(1n).

J. Regularity conditions and proof of Theorem 3

We denote τ0* as the true value of τ such that qA,Z,X;τ0*=q*(A,Z,X). We will give the regularity conditions and proof that (β^,τ^) is a regular and asymptotically linear estimator of (β0*,τ0*). Here β0* and q* are the biased versions of β0 and q defined in Appendix I respectively, although the biases are negligible when the infection is rare. Following Appendix I, β^ is also a regular and asymptotically linear estimator of β0 if

supa,w,u,zP(Y=1A=a,W=w,U=u,X=x)<δn,

Assumption 11 holds and

log1τ(δn=op(1n).

A set of regularity conditions are

R.1 The function τqA,Z,X;τ is Lipschitz in a neighborhood of τ0*; that is, for every τ1 and τ2 in a neighborhood of τ0 and a measurable function q˙A,Z,X with Eq˙A,Z,X<, we have qA,Z,X;τ1qA,Z,X;τ2q˙A,Z,Xτ1τ2;

R.2 E[qA,Z,X;τ0*2]< and E[m(W,A,X)2]<;

R.3 The function τq(A,Z,X;τ) is differentiable at τ0*. The derivative matrix Ω(β0*,τ*) is nonsingular;

R.4 41n1=1nGi(β^,τ^)=opn1/2 and (β^,τ^)P(β0*,τ0*).

The condition R.1 and the fact that βexp(βA) is Lipschitz in a neighborhood of β0* imply the function (β,τ)Mi(β,τ) is Lipschitz in a neighborhood of (β0*,τ0*) for every i. The remaining proof follows Van der Vaart (2000) Theorem 5.21.

K. Estimating conditional causal RR in the presence of effect modification by measured confounders

Algorithm 2 below describes a straightforward extension of Algorithm 1 to estimation the conditional vaccine effectiveness VE(x)=1expβ0(x) under Assumption 9 and a parametric model β0(X;α) indexed by a finite-dimensional parameter α.

Algorithm 2.

Negative control method to estimate conditional vaccine effectiveness from a test-negative design

1: Identify the variables in the data acoording to Figure 1)(c), in particular the NCEs and NCOs.
2: Estimate the treatment confounding bridge function by solving the equation (14) with a suitable parametric model q*(A,Z,X;τ) and a user-specified function m(W,A,X). Write τˆ as the resulting estimate of τ.
3: Estimate α by solving
1ni=1n(1)1AiCXiq*Ai,Zi,Xi;τˆexpβ0Xi;αAi=0 (30)
Denote the resulting estimate of α as α^. The estimated conditional vaccine effectiveness is
VE^x=1expβ0x;α^.

We describe the large-sample properties of the estimator (α^,τˆ) in the theorem below.

Theorem 5

(Inference of (α^,τˆ)). Under Assumptions 12, 45, 6, 9 and suitable regularityconditions listed at the end of this section, the estimator (α^,τˆ) in Algorithm 2, or equivalently, the solution to the estimating equation 1ni1nG˜iα,τ=0 is regular and asymptotically linear with influence function

IF˜(α,τ)=Ω˜(α,τ)TΩ˜(α,τ)1Ω˜(α,τ)TG˜i(α,τ),

where

G˜i(α,τ)=(1)1AiCXiq*Ai,Zi,Xi;τYiexpβ0(X;α)Ai1YimWi,Ai,Xiq*Ai,Zi,Xi;τmWi,1,XimWi,0,Xi

and

Ω˜(α,τ)=EG˜i(α,τ)αT,EG˜i(α,τ)τT.

Here C(X) is a user-specified function of X with the sample dimension as α.

Suppose that in Algorithm 2, one specifies β0(X;α)=XTα, then a natural choice for C(X) is C(X)=X. A sandwich estimator of the asymptotic variance of (αˆ,τˆ) can be deduced from previous derivations. Under Assumption 9, we have shown that one can identify VE(X), however, one may be unable to identify the population marginal VE without an additional assumption. Interestingly, we note that the population marginal risk ratio would remain non-identified even if one had access to a random sample from the target population to inform the marginal distribution of X. Specifically, as shown in Huitfeldt, Stensrud, and Suzuki (2019), the marginal RR=E[Y(1)]/E[Y(0)] can be written as RR=E[RR(X)Y(0)=1], i.e. the average risk ratio among subjects who would contract say Influenza had they possibly contrary to fact, not been vaccinated against Influenza. However, the distribution of X within the group Y(a=0)=1 cannot be identified in presence of unmessured confounding, thus ruling out identification of the population marginal RR.

Define q* as before and

β0*(x)=logE[q*(A,Z,x)I(A=1,Y=1)S=1,X=x]Eq*(A,Z,x)I(A=0,Y=1)S=1,X=x.

Denote α0* as the value of α such that β0(x;α0*)=β0*(x). Below we give the set of regularity conditions such that (α^,τ^) is a regular and asymptotically linear estimator of α0*,τ0*:

R’.1 The function τq(A,Z,X;τ) is Lipschitz in a neighborhood of τ0* and αβ0(X;α) is Lipschitz in a neighborhood of α0*.

R’.2 E[qA,Z,X;τ0*2]<,E[m(W,A,X)2]<,E[C(X)2]< and exp2β0X;α0*< . :

R’.3 The function τq(A,Z,X;τ) is differentiable at τ0* and αβ0(X;α) is differentiable at α0*. The derivative matrix Ω˜α0*,τ* is nonsingular;

R’.4 1ni=1nG˜i(α^,τ^)=opn1/2 and (α^,τˆ)pα0*,τ0*.

L. Proof of Theorem 1′

Lemma 3.

Under Assumptions 2′,3′ and 6′, we have

P(A=a,Y=yU,W,X,S=1)=1cP(A=aY=0,U,W,X,S=1)P(Y=yA=0,U,W,X,S=1)exp(β0ay)

where c=a*,y*PA=a*Y=0,U,W,X,S=1PY=y*A=0,U,W,X,S=1exp(β0a*y*).

Proof.

Note that

P(Y=1A=1,U,W,X,S=1)P(Y=0A=0,U,W,X,S=1)P(Y=1A=0,U,W,X,S=1)P(Y=0A=1,U,W,X,S=1)=A.6P(Y=1A=1,U,X,S=1)P(Y=0A=0,U,X,S=1)P(Y=1A=0,U,X,S=1)P(Y=0A=1,U,X,S=1)=exp(β0)×P(S=1Y=1,A=1,U,X)P(S=1Y=0,A=1,U,X)×P(S=1Y=0,S=0,U,X)P(S=1Y=1,S=0,U,X)=A.2exp(β0)×exp(h(U,X))×exp(h(U,X))=exp(β0)

The result follows after Chen (2003). □

To prove Theorem 1′, we need to show

E[(1)1Aq(A,Z,X)YexpβaAS=1]=0

It suffices to prove that

E[(1)1Aq(A,Z,X)YexpβaAU,W,X,S=1]=0.

The left-hand side is

a,y(1)1ayexp(β0a)q(a,z,X)P(A=a,Y=yU,X,S=1)f(zU,X,A=a,Y=y,S=1)dz=L.3a,y(1)1ayexp(β0a)q(a,z,X)×1cP(A=aY=0,U,X,S=1)P(Y=yA=0,U,X,S=1)exp(β0ay)f(zU,X,A=a,Y=y,S=1)dz=a(1)1aexp(β0a)q(a,z,X)×1cP(A=aY=0,U,X,S=1)P(Y=1A=0,U,X,S=1)exp(β0a)f(zU,X,A=a,Y=1,S=1)dz=a(1)1acP(A=aY=0,U,X,S=1)P(Y=1A=0,U,X,S=1)×q(a,z,X)f(zU,X,A=a,Y=1,S=1)dz=A.5a(1)1acP(A=aY=0,U,X,S=1)P(Y=1A=0,U,X,S=1)×=a(1)1acP(Y=1A=0,U,X,S=1)=01

M. Proof of Theorem 2′

E[q˜(a,Z,X)|A=a,W,X,Y=0,|S=1]=E{E[q˜(a,Z,X)A=a,U,W,X,Y=0,S=1]A=a,W,X,Y=0,S=1}=A.6E{E[q˜(a,Z,X)A=a,U,X,Y=0,S=1]A=a,W,X,Y=0,S=1}=A.4E{E[q˜(a,Z,X)A=a,U,X]A=a,W,X,Y=0,S=1}=A.5E{1P(A=aU,X,Y=0,S=1)A=a,W,X,Y=0,S=1}=1P(A=aU=u,X,Y=0,S=1)f(uA=a,W,X,Y=0,S=1)du=1P(A=aU=u,X,Y=0,S=1)f(uW,X,Y=0,S=1)P(A=aU=u,W,X,Y=0,S=1)P(A=aW,X,Y=0,S=1)du=A.61P(A=aU=u,X,Y=0,S=1)f(uW,X,Y=0,S=1)P(A=aU=u,X,Y=0,S=1)P(A=aW,X,Y=0,S=1)du=1P(A=aW,X,Y=0,S=1)f(uW,X,Y=0,S=1)du=1P(A=aW,X,Y=0,S=1)

N. Simulation setting with binary unmeasured confounder

We generate the data of a size N=7,000,000 general population according to Figure 1, with the following distribution:

UBernoullipU;ZUBernoullip0Z+pUZU;AUBernoullip0A+pUAU;YA,YBernoulliexpηOY+β0A+ηUYU;WUBernoullip0W+pUWU;DUBernoullip0D+pUDU;SY,D,W,UBernoullimax(Y,D,W)pYS+pUYSU.

In the above data generating process, to mimic a test-negative design platform, we created a binary indicator D for flu-like disesses other than W. The study sample contains subjects with S=1. The distribution of S indicates that only subjects with at least one of Y,D and W equal to one will be recruited into the study sample. We chose the parameters as in Table 1, which resulted in an average study sample size of between around 48,000 and around 52,000.

Table 1:

Parameter values of the data generating distribution in the simulation with a binary unmeasured confounder.

Parameter Value Parameter Value

pU 0.5 P0Z 0.2
pUZ 0.4 p0A 0.2
pUA 0.4 η0Y log(0.01)
β0 log(0.2),log(0.5),log(0.7), or 0 ηUY log(0.5)
p0W 0.02 pUW 0.02
p0D 0.02 pUD −0.015
pYS 0.1 pUYS 0.4

To obtain the true treatment confounding bridge function, by (7), we have

zq(a,z)f(zu,a)=zq(a,z)f(au,z)f(zu)/f(au)=1/f(au)

and thus

zq(a,z)f(au,z)f(zu)=q(a,0)p0A+pUAua1p0A+pUAu1a[1(p0Z+pUZu)]+q(a,1)p0A+pUAua1p0A+pUAu1ap0Z+pUZu=1

for each u,a. We obtain that

q(0,0)q(0,1)=1p0A1p0Z1p0Ap0Z1p0A+pUA1p0Z+pUZ1p0A+pUAp0Z+pUZ111q(1,0)q(1,1)=p0A1p0Zp0AAp0Zp0A+pUA1p0Z+pUZp0A+pUAp0Z+pUZ111

O. Simulation setting with a continuous unmeasured confounder

In simulation studies with a continuous unmeasured confounder, we generate data for a general population of 7,000,000 individusls following the distribution below:

UUUiform0,1;XUniform(0,1);AU,XBinomialexpitμ0A+μUAU+μXAXZA,U,XNμ0Z+μAZA+μXZX+μUZU,σZ2;WU,XNμ0W+μXWX+μUWU,σW2;YA,U,XBinomialexpitμ0Y+βA+μUYU+μXYX+μUXYUXDU,XBinomialexpitμ0D+μXDX+μUDUSU,X,Y,Dmax(Y,D)×Binomialexpitμ0S+μXSX+μUSU+μUXSUX.

We chose the parameter values according to Table 2, which results in an average study sample size of between around 43,000 and around 47,000.

Table 2:

Parameter values of the data generating distribution in the simulation with a continuous unmeasured confounder.

Parameter Value Parameter Value

μ0A −1 μUA −1
μXA 0.25 μ0Z 0
μAZ 0.25 μXZ 0.25
μUZ 4 σZ 0.25
μ0Y log(0.01) β0 log(0.2),log(0.5),log(0.7), or 0
μUY −2 μXY −0.25
μ0W 0 μXW 0.25
μUW 2 σW 0.25
μ0D log(0.01) μXD 0.25
μUD −0.2 μ0S −1.4
μXS 0.5 μUS 2
μUXS 1

P. Logistic regression as a naïve approximate estimator of log risk ratio, ignoring unmeasured confounders

Figure 3 shows a causal diagram of a test-negative design with no ummeasured confounders. Again we assumed selection into the study S is independent of the subjects’ treatment status A, given the subjects’ infection status Y and other covariates X.

Figure 3:

Figure 3:

Directed acylic graph of a test-negative design with no unmessured confounders.

In this scenario, we note that the conditional odds ratio given X in the study population equals the conditional odds ratio in the general population, i.e.

P(Y=1A=1,X,S=1)P(Y=1A=0,X,S=1)/P(Y=0A=1,X,S=1)P(Y=0A=0,X,S=1)=P(Y=1A=1,X)P(S=1Y=1,A=1,X)P(Y=1A=0,X)P(S=1Y=1,A=0,X)/P(Y=0A=1,X)P(S=1Y=0,A=1,X)P(Y=0A=0,X)P(S=1Y=0,A=0,X)=A.2P(Y=1A=1,X)P(S=1Y=1,X)P(Y=1A=0,X)P(S=1Y=1,X)/P(Y=0A=1,X)P(S=1Y=0,X)P(Y=0A=0,X)P(S=1Y=0,X)=P(Y=1A=1,X)P(Y=1A=0,X)/P(Y=0A=1,X)P(Y=0A=0,X)

This implies that we may estimate the stratum-specific odds ratio in the general population by fitting a logistic regression model to the study data:

P(Y=1A,X,S=1)=expitγ0+γXX+γAA

where expit(x)=exp(x)/[1+exp(x)], and γA is the log odds ratio. When the outcome is rare in the general population, i.e. P(Y=0A=a,X=x)1 for any a,x, the log odds ratio γA also approximates the log risk ratio.

Q. Simulation for non-rare diseases

To investigate the performance of our method for non-rare diseases, we repeat the simulation with the same setup with binary or continuous confounders. For the simulation with binary confounders, we set to η0Y to be log(0.20); for the simulation with continuous confounders, we set μ0Y to be log(0.20) – both correspond to an infection risk of 20% for subjects with A=U=X=0

The results are in Figure 4 While the NC-Oracle estimator remains unbiased and maintains calibrated confidence intervals, the NC estimator is in general biased with under-covered confidence intervals. Notably, the NC estimator is umbiased with calibrated confidence intervals under the null hypothesis where β0=0. The logistic regression estimator is biased and has unpredictable behavior.

Figure 4:

Figure 4:

Bias (left) and coverage rates of 95% confidence interval (right) for the oracle estimator (NC-Oracle), GMM estimator (NC-GMM) and logistic regression (Logit Reg-) with a binary or unmessured confounder, where π0y or μ0y is log(0.20).

R. Detailed results of University of Michigan Health System Data analysis

Table 3:

Descriptive statistics of University of Michigan Health System COVID-19 Data. Variables were summarized as counts (percentage%).

Unvaccinated (N=777) Vaccinated (N=39,377)

Vaccine types
 Pfizer-BioNTech / 20,239 (51.4%)
 Moderna / 10,719 (27.2%)
 Johnson & Johnson’s Janssen / 1,405 (3.6%)
 Other / 7,014 (17.8%)
COVID-19 Infection 456 (58.7%) 2,752 (7.0%)
NCE: Immunization before Dec 2020 433 (55.7%) 18,049 (45.8%)
NCO conditions
 Arm/leg cellulitis 7 (0.9%) 158 (0.4%)
 Eye/ear disorder 6 (0.8%) 514 (1.3%)
 Gastro-esophageal disease 51 (6.6%) 3,164 (8.0%)
 Atopic dermatitis 1 (0.1%) 41 (0.1%)
 Injuries 62 (8.0%) 3,652 (9.3%)
 General adult examination 72 (9.3%) 4,657 (11.8%)
No. of NCO conditions ≥ 1 160 (20.6%) 10,280 (26.1%)
Age
 ≤ 18 93 (12.0%) 2,602 (6.6%)
 ≥ 18, < 60 543 (69.9%) 23,286 (59.1%)
 ≥ 60 141 (18.1%%) 13,489 (34.2%)
Male 332 (42.7%) 16,151 (41.0%)
White 557 (71.7%) 30,562 (77.6%)
Charlson score ≥ 3 50 (6.5%) 3,175 (8.1%)

Table 4:

Logistic regression of COVID-19 infection on COVID-19 vaccination, the NCE (previous immunization) and other baseline covariates.

Est. S.E. p-value

(Intercept) 0.26 0.12 <0.036
COVID-19 vaccination −3.17 0.08 <0.001
Previous Immunization 0.16 0.04 <0.001
Age ≥18, ≤60 0.58 0.09 <0.001
Age ≥60 0.33 0.09 <0.001
Male 0.06 0.04 0.149
White 0.23 0.05 <0.001
Charlson score ≥3 0.21 0.07 0.003
Calendar month
 April −0.48 0.08 <0.001
 May −1.40 0.09 <0.001
 June −2.23 0.14 <0.001
 July −1.23 0.10 <0.001
 August −0.51 0.07 <0.001
 September −0.30 0.06 <0.001
 October −0.06 0.05 0.240

Table 5:

Logistic regression of having at least one NCO conditions on COVID-19 vaccination, the NCE (previous immunization) and other covariates.

Est. S.E. p-value

(Intercept) −2.42 0.11 <0.001
COVID-19 vaccination 0.23 0.09 0.014
Previous Immunization 0.68 0.02 <0.001
Age ≥18, ≤60 0.47 0.06 <0.001
Age ≥60 1.00 0.06 <0.001
Male −0.05 0.02 0.049
White 0.13 0.03 <0.001
Charlson score ≥3 0.29 0.04 <0.001
Calendar month
 April −0.04 0.05 <0.472
 May 0.12 0.04 0.004
 June 0.13 0.04 0.002
 July 0.12 0.05 0.013
 August 0.10 0.04 0.022
 September −0.01 0.04 0.712
 October 0.040 0.04 0.312

Table 6:

Estimates, standard errors, and 95% confidence intervals of parameters for the NC analysis with the University of Michigan Health System data by Algorithm 1.

Est. S.E. 95% CI

β (log causal RR) −2.80 0.14 (−3.08, −2.54)
VE (1 − exp(β)) 94.0% / (92.1%, 95.4%)
τ: Intercept −29.14 53.86 (−134.70,76.42)
τ: COVID-19 vaccination 30.48 53.85 (−75.06, −136.02)
τ: Previous immunization 234.71 90.11 (58.09, 411.32)
τ: COVID-19 vaccination × Previous immunization −235.76 90.18 (−412.51, −59.01)
τ: Age ≥18, ≤60 −0.04 0.18 (−0.40, 0.31)
τ: Age ≥60 0.15 0.18 (−0.19, 0.50)
τ: Male −0.04 0.08 (−0.20,0.12)
τ: White −0.07 0.09 (−0.25, 0.11)
τ: Charlson score ≥3 0.31 0.15 (0.01, 0.60)
τ: Calendar month
 April −0.52 0.23 (−0.98, −0.06)
 May 0.24 0.15 (−0.06, 0.54)
 June 0.35 0.14 (0.08, 0.62)
 July 0.31 0.15 (0.02, 0.60)
 August 0.25 0.15 (−0.04, 0.55)
 September 0.38 0.13 (0.12, 0.64)
 October 0.22 0.13 (−0.03, 0.48)

Table 7:

Estimates, standard errors and 95% confidence intervals of parameters in a logistic regression of COVID-19 infection on COVID-19 vaccination and other baseline covariates.

Est. S.E. 95% CI

(Intercept) 0.36 0.12 (0.12, 0.60)
COVID-19 vaccination −3.18 0.08 (−3.35, −3.01)
Age ≥18, ≤60 0.56 0.09 (0.39, 0.73)
Age ≥60 0.32 0.09 (0.14, 0.51)
Male 0.05 0.04 (−0.03,0.13)
White 0.24 0.05 (0.14, 0.33)
Charlson score ≥3 0.25 0.07 (0.11, 0.39)
Calendar month
 April −0.48 0.08 (−0.63, −0.33)
 May −1.40 0.09 (−1.58, −1.21)
 June −2.23 0.14 (−2.49, −1.69)
 July −1.23 0.10 (−1.42, −1.03)
 August −0.51 0.07 (−0.65, −0.37)
 September −0.30 0.06 (−0.42, −0.19)
 October −0.06 0.05 (−0.17, 0.04)

Contributor Information

Kendrick Qijun Li, Department of Biostatistics, University of Michigan.

Xu Shi, Department of Biostatistics, University of Michigan.

Wang Miao, Department of Probability and Statistics, Peking University.

Eric Tchetgen Tchetgen, Department of Statistics and Data Science, The Wharton School, University of Pennsylvania.

References

  1. Anders Katherine L et al. (2018). “The AWED trial (Applying Wolbachia to Eliminate Dengue) to assess the efficacy of Wolbachia-infected mosquito deployments to reduce dengue incidence in Yogyakarta, Indonesia: study protocol for a cluster randomised controlled trial”. In: Trials 19.1, pp. 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Andrews Donald WK (2011). “Examples of l2-complete and boundedly-complete distributions”. In.
  3. Baden Lindsey R et al. (2021). “Efficacy and safety of the mRNA-1273 SARS-CoV-2 vaccine”. In: New England Journal of Medicine 384.5, pp. 403–416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bareinboim Elias and Pearl Judea (2012). “Controlling selection bias in causal inference”. In: Artificial Intelligence and Statistics . PMLR, pp. 100–108. [Google Scholar]
  5. Black Carla L et al. (2018). “Influenza vaccination coverage among health care personnel—United States, 2017–18 influenza season”. In: Morbidity and Mortality Weekly Report 67.38, p. 1050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bond HS, Sullivan SG, and Cowling BJ (2016). “Regression approaches in the test-negative study design for assessment of influenza vaccine effectiveness”. In: Epidemiology & Infection 144.8, pp. 1601–1611. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Boom Julie A et al. (2010). “Effectiveness of pentavalent rotavirus vaccine in a large urban population in the United States”. In: Pediatrics 125.2, e199–e207. [DOI] [PubMed] [Google Scholar]
  8. Broome Claire V, Facklam Richard R, and Fraser David W (1980). “Pneumococcal disease after pneumococcal vaccination: an alternative method to estimate the efficacy of pneumococcal vaccine”. In: New England Journal of Medicine 303.10, pp. 549–552. [DOI] [PubMed] [Google Scholar]
  9. Cai Zhihong and Kuroki Manabu (2012). “On identifying total effects in the presence of latent variables and selection bias”. In: arXiv preprint arXiv:1206.3239. [Google Scholar]
  10. Chambers Catharine et al. (2018). “Should sex be considered an effect modifier in the evaluation of influenza vaccine effectiveness?” In: Open forum infectious diseases. Vol. 5. 9. Oxford University Press US, ofy211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Chen Hua Yun (2003). “A note on the prospective analysis of outcome-dependent samples”. In: Journal of the Royal Statistical Society: Series B (Statistical Methodology) 65.2, pp. 575–584. [Google Scholar]
  12. Chung Jessie R et al. (2020). “Effects of influenza vaccination in the United States during the 2018–2019 influenza season”. In: Clinical Infectious Diseases 71.8, e368–e376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Cole Stephen R and Frangakis Constantine E (2009). “The consistency statement in causal inference: a definition or an assumption?” In: Epidemiology 20.1, pp. 3–5. [DOI] [PubMed] [Google Scholar]
  14. Cui Yifan et al. (2020). “Semiparametric proximal causal inference”. In: arXiv preprint arXiv:2011.08411. [Google Scholar]
  15. Dagan Noa et al. (2021). “BNT162b2 mRNA Covid-19 vaccine in a nationwide mass vaccination setting”. In: New England Journal of Medicine. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Dean Natalie E, Hogan Joseph W, and Schnitzer Mireille E (2021). Covid-19 vaccine effectiveness and the test-negative design. [DOI] [PMC free article] [PubMed]
  17. Deaner Ben (2018). “Proxy controls and panel data”. In: arXiv preprint arXiv:1810.00283. [Google Scholar]
  18. — (2021). “Many Proxy Controls”. In: arXiv preprint arXiv:2110.03973. [Google Scholar]
  19. Dufault Suzanne M and Jewell Nicholas P (2020). “Analysis of counts for cluster randomized trials: Negative controls and test-negative designs”. In: Statistics in medicine 39.10, pp. 1429–1439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Dukes Oliver, Shpitser Ilya, and Tchetgen Eric J Tchetgen (2021). “Proximal mediation analysis”. In: arXiv preprint arXiv:2109.11904. [Google Scholar]
  21. D’Haultfoeuille Xavier (2011). “On the completeness condition in nonparametric instrumental problems”. In: Econometric Theory 27.3, pp. 460–471. [Google Scholar]
  22. Egami Naoki and Tchetgen Eric J Tchetgen (2021). “Identification and Estimation of Causal Peer Effects Using Double Negative Controls for Unmeasured Network Confounding”. In: arXiv preprint arXiv:2109.01933. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Feng Shuo, Cowling Benjamin J, and Sullivan Sheena G(2016). “Influenza vaccine effectiveness by test-negative design–Comparison of inpatient and outpatient settings”. In: Vaccine 34.14, pp. 1672–1679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Villalobos Fernández, Verónica Nathalie et al. (2021). “Effect modification of the association between comorbidities and severe course of COVID-19 disease by age of study participants: a systematic review and meta-analysis”. In: Systematic reviews 10.1, pp. 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Flannery Brendan et al. (2019). “Influenza vaccine effectiveness in the United States during the 2016–2017 season”. In: Clinical Infectious Diseases 68.11, pp. 1798–1806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Foppa Ivo M et al. (2016). “The case test-negative design for studies of the effectiveness of influenza vaccine in inpatient settings”. In: International journal of epidemiology 45.6, pp. 2052–2059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Gabriel Erin E, Sachs Michael C, and Sjölander Arvid (2020). “Causal bounds for outcome-dependent sampling in observational studies”. In: Journal of the American Statistical Association, pp. 1–12. [Google Scholar]
  28. Ghassami AmirEmad, Shpitser Ilya, and Tchetgen Eric Tchetgen (2021). “Proximal Causal Inference with Hidden Mediators: Front-Door and Related Mediation Problems”. In: arXiv preprint arXiv:2111.02927. [Google Scholar]
  29. — (2022). “Combining Experimental and Observational Data for Identification of Long-Term Causal Effects”. In: arXiv preprint arXiv:2201.10743. [Google Scholar]
  30. Ghassami AmirEmad et al. (2021). Minimax kernel machine learning for a class of doubly robust functionals. Tech. rep. [Google Scholar]
  31. Hausman Jerry A (1978). “Specification tests in econometrics”. In: Econometrica: Journal of the econometric society, pp. 1251–1271. [Google Scholar]
  32. Hitchings Matt et al. (2021). “Effectiveness of the ChAdOx1 vaccine in the elderly during SARS-CoV-2 Gamma variant transmission in Brazil”. In: medRxiv. [Google Scholar]
  33. Hu Yingyao and Schennach Susanne M (2008). “Instrumental variable treatment of nonclassical measurement error models”. In: Econometrica 76.1, pp. 195–216. [Google Scholar]
  34. Hudgens Michael G and Halloran M Elizabeth (2006). “Causal vaccine effects on binary postinfection outcomes”. In: Journal of the American Statistical Association 101.473, pp. 51–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. — (2008). “Toward causal inference with interference”. In: Journal of the American Statistical Association 103.482, pp. 832–842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Huitfeldt Anders, Stensrud Mats J, and Suzuki Etsuji (2019). “On the collapsibility of measures of effect in the counterfactual causal framework”. In: Emerging themes in epidemiology 16.1, pp. 1–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Imbens Guido, Kallus Nathan, and Mao Xiaojie (2021). “Controlling for Unmeasured Confounding in Panel Data Using Minimal Bridge Functions: From Two-Way Fixed Effects to Factor Models”. In: arXiv preprint arXiv:2108.03849. [Google Scholar]
  38. Jackson Lisa A et al. (2006). “Evidence of bias in estimates of influenza vaccine effectiveness in seniors”. In: International journal of epidemiology 35.2, pp. 337–344. [DOI] [PubMed] [Google Scholar]
  39. Jackson Michael L and Nelson Jennifer C (2013). “The test-negative design for estimating influenza vaccine effectiveness”. In: Vaccine 31.17, pp. 2165–2168. [DOI] [PubMed] [Google Scholar]
  40. Jackson Michael L et al. (2017). “Influenza vaccine effectiveness in the United States during the 2015–2016 season”. In: New England Journal of Medicine 377.6, pp. 534–543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Jewell Nicholas P et al. (2019). “Analysis of cluster-randomized test-negative designs: cluster-level methods”. In: Biostatistics 20.2, pp. 332–346. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Kallus Nathan, Mao Xiaojie, and Uehara Masatoshi (2021). “Causal Inference Under Unmeasured Confounding With Negative Controls: A Minimax Learning Approach”. In: arXiv preprint arXiv:2103.14029. [Google Scholar]
  43. Krammer Florian (2019). “The human antibody response to influenza A virus infection and vaccination”. In: Nature Reviews Immunology 19.6, pp. 383–397. [DOI] [PubMed] [Google Scholar]
  44. Lehmann Erich Leo and Henry Scheffé (2012a). “Completeness, similar regions, and unbiased estimation-Part I”. In: Selected Works of EL Lehmann. Springer, pp. 233–268. [Google Scholar]
  45. — (2012b). “Completeness, similar regions, and unbiased estimation—part II”. In: Selected Works of EL Lehmann. Springer, pp. 269–286. [Google Scholar]
  46. Leung Jessica et al. (2011). “Herpes zoster incidence among insured persons in the United States, 1993–2006: evaluation of impact of varicella vaccination”. In: Clinical Infectious Diseases 52.3, pp. 332–340. [DOI] [PubMed] [Google Scholar]
  47. Lipsitch Marc, Jha Ayan, and Simonsen Lone (2016). “Observational studies and the difficult quest for causality: lessons from vaccine effectiveness and impact studies”. In: International journal of epidemiology 45.6, pp. 2060–2074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Lipsitch Marc, Tchetgen Eric Tchetgen, and Cohen Ted (2010). “Negative controls: a tool for detecting confounding and bias in observational studies”. In: Epidemiology (Cambridge, Mass.) 21.3, p. 383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Liu Lan and Tchetgen Eric Tchetgen (2021). “Regression-based negative control of homophily in dyadic peer effect analysis”. In: Biometrics. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Mastouri Afsaneh et al. (2021). “Proximal causal learning with kernels: Two-stage estimation and moment restriction”. In: International Conference on Machine Learning. PMLR, pp. 7512–7523. [Google Scholar]
  51. Miao Wang, Geng Zhi, and Tchetgen Eric J Tchetgen (2018). “Identifying causal effects with proxy variables of an unmeasured confounder”. In: Biometrika 105.4, pp. 987–993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Miao Wang, Shi Xu, and Tchetgen Eric Tchetgen (2018). “A confounding bridge approach for double negative control inference on causal effects”. In: arXiv e-prints, arXiv–1808. [Google Scholar]
  53. Moline Heidi L et al. (2021). “Effectiveness of COVID-19 vaccines in preventing hospitalization among adults aged≥ 65 years—COVID-NET, 13 states, February–April 2021”. In: Morbidity and Mortality Weekly Report 70.32, p. 1088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Newey Whitney K and Powell James L (2003). “Instrumental variable estimation of nonparametric models”. In: Econometrica 71.5, pp. 1565–1578. [Google Scholar]
  55. Olson Samantha M et al. (2022). “Effectiveness of BNT162b2 vaccine against critical Covid-19 in adolescents”. In: New England Journal of Medicine. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Patel Manish M, Jackson Michael L, and Ferdinands Jill (2020). “Postlicensure evaluation of COVID-19 vaccines”. In: JAMA 324.19, pp. 1939–1940. [DOI] [PubMed] [Google Scholar]
  57. Polack Fernando P et al. (2020). “Safety and efficacy of the BNT162b2 mRNA Covid-19 vaccine”. In: New England Journal of Medicine. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Qi Zhengling, Miao Rui, and Zhang Xiaoke (2021). “Proximal Learning for Individualized Treatment Regimes Under Unmeasured Confounding”. In: arXiv preprint arXiv:2105.01187. [Google Scholar]
  59. Rolfes Melissa A et al. (2019). “Effects of influenza vaccination in the United States during the 2017–2018 influenza season”. In: Clinical Infectious Diseases 69.11, pp. 1845–1853. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Rosenbaum Paul R (1987). “Model-based direct adjustment”. In: Journal of the American Statistical Association 82.398, pp. 387–394. [Google Scholar]
  61. Sadoff Jerald et al. (2021). “Safety and efficacy of single-dose Ad26. COV2. S vaccine against Covid-19”. In: New England Journal of Medicine 384.23, pp. 2187–2201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Schnitzer Mireille E (2022). “Estimands and Estimation of COVID-19 Vaccine Effectiveness Under the Test-negative Design: Connections to Causal Inference.” In: Epidemiology (Cambridge, Mass.) [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Schwartz Lauren M et al. (2017). “Rotavirus vaccine effectiveness in low-income settings: An evaluation of the test-negative design”. In: Vaccine 35.1, pp. 184–190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Shi Xu, Miao Wang, and Tchetgen Eric Tchetgen (2020). “A selective review of negative control methods in epidemiology”. In: Current Epidemiology Reports, pp. 1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Shi Xu et al. (2020). “Multiply robust causal inference with double-negative control adjustment for categorical unmeasured confounding”. In: Journal of the Royal Statistical Society: Series B (Statistical Methodology) 82.2, pp. 521–540. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Shi Xu et al. (2021). “Theory for identification and Inference with Synthetic Controls: A Proximal Causal Inference Framework”. In: arXiv preprint arXiv:2108.13935. [Google Scholar]
  67. Shrank William H, Patrick Amanda R, and Brookhart M Alan (2011). “Healthy user and related biases in observational studies of preventive interventions: a primer for physicians”. In: Journal of general internal medicine 26.5, pp. 546–550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Struchiner CJ and Halloran ME (2007). “Randomization and baseline transmission in vaccine field trials”. In: Epidemiology & Infection 135.2, pp. 181–194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Sullivan Sheena G, Tchetgen Eric J Tchetgen, and Cowling Benjamin J (2016). “Theoretical basis of the test-negative study design for assessment of influenza vaccine effectiveness”. In: American journal of epidemiology 184.5, pp. 345–353. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Tchetgen Tchetgen, Eric J and VanderWeele Tyler J (2012). “On causal inference in the presence of interference”. In: Statistical methods in medical research 21.1, pp. 55–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Tchetgen Tchetgen, Eric J et al. (2020). “An Introduction to Proximal Causal Learning”. In: arXiv preprint arXiv:2009.10982. [Google Scholar]
  72. Tenforde Mark W et al. (2021). “Influenza vaccine effectiveness against hospitalization in the United States, 2019–2020”. In: The Journal of Infectious Diseases 224.5, pp. 813–820. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Thompson Mark G et al. (2021). “Effectiveness of COVID-19 vaccines in ambulatory and inpatient care settings”. In: New England Journal of Medicine 385.15, pp. 1355–1371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Utarini Adi et al. (2021). “Efficacy of Wolbachia-infected mosquito deployments for the control of dengue”. In: New England Journal of Medicine 384.23, pp. 2177–2186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Van der Vaart Aad. (2000). Asymptotic statistics. Vol. 3. Cambridge university press. [Google Scholar]
  76. Wang Bingkai et al. (2022). “Randomization Inference for Cluster-Randomized Test-Negative Designs with Application to Dengue Studies: Unbiased estimation, Partial compliance, and Stepped-wedge design”. In: arXiv preprint arXiv:2202.03379. [Google Scholar]
  77. Ying Andrew et al. (2021). “Proximal Causal Inference for Complex Longitudinal Studies”. In: arXiv preprint arXiv:2109.07030. [Google Scholar]

Articles from ArXiv are provided here courtesy of arXiv

RESOURCES