Skip to main content
. Author manuscript; available in PMC: 2021 May 13.
Published in final edited form as: Curr Epidemiol Rep. 2020 Oct 15;7(4):190–202. doi: 10.1007/s40471-020-00243-4

Table 2.

Summary of published methodologies using negative controls for detection (D), reduction (R), and correction (C) of confounding bias

Reference and Setting Main Assumptions Besides Assumptions 2–5 Methods
D [32]: Time-series study. Z = future air pollution At+1. (1) At+1Yt | At, Ut, Xt.
(2) log[E(Yt)] = α + βAt + γXt + βfAt+1.
Bias detection by Wald-test on βf.
[60, 61]: invalid NCE Z. (1) Violation of exclusion restriction Y(a, z) ≠ Y(a).
(2) Z is U-comparable with A:ZU|A,X.
No evidence of Z-Y association adjusting for A implies no residual confounding of A-Y association.
R [33, 49]: Time-series study. Z = future air pollution At+1. (1) At+1Yt | At, Ut, Xt; At+1(At,Ut)|Xt.
(2) Yt(at, xt, ut) = β0 + β1αt + β2xt + β3ut + ϵt; E[ϵt | At = at, Ut = ut, Xt = xt] = 0.
(3) E[Ut|At=at,At+1=at+1,Xt=xt]=α0+α1at+α2xt+α3at+1; sign(α1) = sign(α3).
(4) E[At+1 | At = at, Xt = xt] = γ0 + γ1at + γ2xt; γ1 > 0.
Bias reduction by fitting E[Yt | At, Xt, At+1] instead of fitting E[Yt | At, Xt]. Further bias reduction considered in [49] by incorporating Xt+1 or At−1. Identification of β1 is possible with multiple future exposures under autoregressive model for exposure time series.
[62]: Standardized mortality ratio in occupational cohort study. (1) E[Y(1) | X = k]/E[Yref | X = k] = exp(αkδk) E[W | X = k]/E[Wref | X = k] = exp(−ϵk).
(2) sign(ϵk) = sign(δk) and 0 < |ϵk| < 2|δk|.
Adjust for bias δk via E[Y(1) | X = k]E[Wref | X = k]/E[Yref | X = k]E[W | X = k].
[38, 40]: Define negative controls as drug–outcome pairs where one believes no causal effect exists. (1) For a negative control drug-outcome pair, the effect estimate βiN(θi,τi2), i = 1,..., n, where θiN(μ, σ2) is the true bias.
(2) Under the null of no treatment effect, the effect estimate βn+1H0N(μ,σ2+τn+12).
Estimate μ, σ by MLE with L(μ,σ|θ,τ)=i=1np(βi|θi,τi)p(θi|μ,σ)dθi. Calibrated p-value computed via Wald-test of βn+1. Confidence interval calibrated similarly using distribution generated by positive controls.
C [63, 64]: W, Y = Time-to-event outcome. (1) There exist monotonic functions that describe U-Y and U-W associations: Y(0) = hy(U, X), W = hw(U, X).
(2) Cox models for Y and W w/ hazard ratio eβy and eβw.
The hazard ratio measuring the causal effect of treatment is eβyβw.
[13, 65]: Generalized difference-in-differences using NCO. (1) There exist monotonic functions that describe U-Y and U-W associations: Y(0) = hy(U, X), W = hw(U, X).
(2) Positivity: if 0 < fW|A=1,X(W*) then 0 < fW|A=0,X(W*) < 1, where W* = (W | A = 1, X) is distributed as W in the exposed group.
The average treatment effect on the treated is E[Y(1)Y(0)|A=1]=E[Y|A=1]E[FY|A=0,X1)FW|A=0,X(W*)]. Generalized the difference-in-differences approach to the broader context of NCO.
[66]: Calibration using NCO. (1) WA | X, Y(1), Y(0). (2) Rank preservation: Y = Y(0) + ΨA, and hence WA | X, Y(0) by (1). (3) E[W | A, Y(0) = Y − ΨA, X] = β1 + β2X + β3Y(Ψ) + β4A, where β4 = 0 by (1). The 95% CI for any Ψ0 consists of all Ψ for which β^4(Ψ)±1.96s.e.[β^4(Ψ)] contains 0; Under (1)(3), fit E[W | A, Y, X] = β1 + β2X + β3Y + βΨA, then the causal effect Ψ = −βΨ/β3.
[6769]: Removing unwanted variation in gene-expression analysis. (1) Yp = Xqβq×p + UrΓr×p + ϵp, pr + 1.
(2) W1×s=U1×rΓr×sW+ϵ1×sW, sr, Rank(Γr×sW)=r.
(3) (ϵ,ϵW)N(0,diag(σ12,,σp+s2)),(ϵ,ϵW)(X,U).
(4) U1×r=Xqαq×r+ϵ1×rU,ϵUN(0,Ir),ϵUX.
[67, 68]: Estimate U by factor analysis of (2), then estimate β from (1). [69]: Estimate ΓW and Γ by factor analysis of Y = X(β + αΓ) + (ϵUΓ + ϵ) (5) and W = ΓW + (ϵUΓW + ϵW) (6). Then estimate α from (6), and estimate β from (5).
[12, 17, 36]: Nonparametric identification. Assumption 7 Identify h in E[Y | A, Z, X] = E[h(W, A, X) | A, Z, X], then ATE = E[h(W, A = 1, X)] − E[h(W, A = 0, X)].