Table 2.
Summary of published methodologies using negative controls for detection (D), reduction (R), and correction (C) of confounding bias
Reference and Setting | Main Assumptions Besides Assumptions 2–5 | Methods | |
---|---|---|---|
D | [32]: Time-series study. Z = future air pollution At+1. | (1) At+1 ⟂ Yt | At, Ut, Xt. (2) log[E(Yt)] = α + βAt + γXt + βfAt+1. |
Bias detection by Wald-test on βf. |
[60, 61]: invalid NCE Z. | (1) Violation of exclusion restriction Y(a, z) ≠ Y(a). (2) Z is U-comparable with . |
No evidence of Z-Y association adjusting for A implies no residual confounding of A-Y association. | |
R | [33, 49]: Time-series study. Z = future air pollution At+1. | (1) At+1 ⟂ Yt | At, Ut, Xt; . (2) Yt(at, xt, ut) = β0 + β1αt + β2xt + β3ut + ϵt; E[ϵt | At = at, Ut = ut, Xt = xt] = 0. (3) ; sign(α1) = sign(α3). (4) E[At+1 | At = at, Xt = xt] = γ0 + γ1at + γ2xt; γ1 > 0. |
Bias reduction by fitting E[Yt | At, Xt, At+1] instead of fitting E[Yt | At, Xt]. Further bias reduction considered in [49] by incorporating Xt+1 or At−1. Identification of β1 is possible with multiple future exposures under autoregressive model for exposure time series. |
[62]: Standardized mortality ratio in occupational cohort study. | (1) E[Y(1) | X = k]/E[Yref | X = k] = exp(αk − δk) E[W | X = k]/E[Wref | X = k] = exp(−ϵk). (2) sign(ϵk) = sign(δk) and 0 < |ϵk| < 2|δk|. |
Adjust for bias δk via E[Y(1) | X = k]E[Wref | X = k]/E[Yref | X = k]E[W | X = k]. | |
[38, 40]: Define negative controls as drug–outcome pairs where one believes no causal effect exists. | (1) For a negative control drug-outcome pair, the effect estimate , i = 1,..., n, where θi ∼ N(μ, σ2) is the true bias. (2) Under the null of no treatment effect, the effect estimate . |
Estimate μ, σ by MLE with . Calibrated p-value computed via Wald-test of βn+1. Confidence interval calibrated similarly using distribution generated by positive controls. | |
C | [63, 64]: W, Y = Time-to-event outcome. | (1) There exist monotonic functions that describe U-Y and U-W associations: Y(0) = hy(U, X), W = hw(U, X). (2) Cox models for Y and W w/ hazard ratio and . |
The hazard ratio measuring the causal effect of treatment is . |
[13, 65]: Generalized difference-in-differences using NCO. | (1) There exist monotonic functions that describe U-Y and U-W associations: Y(0) = hy(U, X), W = hw(U, X). (2) Positivity: if 0 < fW|A=1,X(W*) then 0 < fW|A=0,X(W*) < 1, where W* = (W | A = 1, X) is distributed as W in the exposed group. |
The average treatment effect on the treated is . Generalized the difference-in-differences approach to the broader context of NCO. | |
[66]: Calibration using NCO. | (1) W ⟂ A | X, Y(1), Y(0). (2) Rank preservation: Y = Y(0) + ΨA, and hence W ⟂ A | X, Y(0) by (1). (3) E[W | A, Y(0) = Y − ΨA, X] = β1 + β2X + β3Y(Ψ) + β4A, where β4 = 0 by (1). | The 95% CI for any Ψ0 consists of all Ψ for which contains 0; Under (1)–(3), fit E[W | A, Y, X] = β1 + β2X + β3Y + βΨA, then the causal effect Ψ = −βΨ/β3. | |
[67–69]: Removing unwanted variation in gene-expression analysis. | (1) Y1×p = X1×qβq×p + U1×rΓr×p + ϵ1×p, p ≥ r + 1. (2) , s ≥ r, . (3) . (4) . |
[67, 68]: Estimate U by factor analysis of (2), then estimate β from (1). [69]: Estimate ΓW and Γ by factor analysis of Y = X(β + αΓ) + (ϵUΓ + ϵ) (5) and W = XαΓW + (ϵUΓW + ϵW) (6). Then estimate α from (6), and estimate β from (5). | |
[12, 17, 36]: Nonparametric identification. | Assumption 7 | Identify h in E[Y | A, Z, X] = E[h(W, A, X) | A, Z, X], then ATE = E[h(W, A = 1, X)] − E[h(W, A = 0, X)]. |