Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Jan 10.
Published in final edited form as: Epidemiology. 2022 Jan 1;33(1):114–120. doi: 10.1097/EDE.0000000000001431

Revisiting the g-null paradox

Sean McGrath 1, Jessica G Young 2,3, Miguel A Hernán 1,3,4
PMCID: PMC9831349  NIHMSID: NIHMS1860457  PMID: 34711734

Abstract

The (noniterative conditional expectation) parametric g-formula is an approach to estimating causal effects of sustained treatment strategies from observational data. An often-cited limitation of the parametric g-formula is the g-null paradox: a phenomenon in which model misspecification in the parametric g-formula is guaranteed in some settings consistent with the conditions that motivate its use (i.e., when identifiability conditions hold and measured time-varying confounders are affected by past treatment). Many users of the parametric g-formula acknowledge the g-null paradox as a limitation when reporting results but still require clarity on its meaning and implications. Here we revisit the g-null paradox to clarify its role in causal inference studies. In doing so, we present analytic examples and a simulation-based illustration of the bias of parametric g-formula estimates under the conditions associated with this paradox. Our results highlight the importance of avoiding overly parsimonious models for the components of the g-formula when using this method.

Keywords: g-null paradox, parametric g-formula, model misspecification, causal inference

Introduction

The g-formula identifies causal effects of sustained treatment strategies from observational data under no unmeasured confounding and other assumptions.1 Unlike conventional regression, the g-formula can identify causal effects even when time-varying confounders are affected by past treatment (i.e., in the presence of so-called “treatment–confounder feedback”).1,2 Various estimators of the g-formula have been applied in practice in numerous settings; for instance, to estimate the effect of different strategies for initiating antiretroviral therapy on the risk of death in a cohort of HIV-infected individuals.3 In this HIV example, CD4 cell count is a time-varying confounder that is affected by past treatment.

However, the g-formula is typically a high-dimensional function of measured variables when high-dimensional confounding adjustment sets are needed for identification. In this case, the g-formula may be estimated by assuming its components – the conditional outcome mean and the joint conditional density of confounders at each time given the past treatment and confounders – can be correctly characterized by parametric models. This estimator of the g-formula has been referred to as the noniterative conditional expectation parametric g-formula.1,47 For simplicity, we refer to it as the parametric g-formula in this paper.

Robins and Wasserman8 showed that the parametric g-formula may be guaranteed some degree of model misspecification when there is treatment–confounder feedback and the sharp causal null hypothesis (i.e., the treatment has no effect on any individual’s outcome at any time) is true, even if the identifying conditions hold. As a consequence, under these conditions2, a hypothesis test based on parametric g-formula estimates will falsely reject the null hypothesis of no treatment effect in large enough studies with probability approaching one.8 This phenomenon has been popularly referred to as the g-null paradox.

The existence of the g-null paradox is a potential threat to the validity of data analyses that rely on parametric g-formula estimates. There is, however, misunderstanding in the applied literature about the meaning and possible implications of the g-null paradox. Here we present analytic examples and a simulation-based illustration of the bias of the parametric g-formula under the conditions associated with this paradox.

The structure of the paper is as follows. We first review the observed data structure, causal estimands, and the g-formula. Then, we review the example of the g-null paradox introduced in Robins and Wasserman8. We clarify how model misspecification can also be guaranteed in settings other than the sharp causal null through an example. Last, we illustrate the impact of model misspecification under the conditions of the g-null paradox on bias, variance, and confidence interval coverage in a simulation study.

Background

The observed data

Consider an observational study with n individuals for which measurements are available at regularly spaced intervals (e.g., months) denoted by k = 0, …, K with k = 0 the baseline interval and K + 1 the interval in which an outcome Y is of interest. For each time k suppose the following are measured: Ak the value of a treatment of interest (e.g., dose of a given medication) and Lk a vector of covariates with L0 possibly additionally containing time-fixed and pre-baseline covariates. We adopt the convention that Lk precedes Ak in each k and use overbars to denote the history of a random variable; e.g. A¯k(A0,A1,,Ak).

The causal directed acyclic graph (DAG) in Figure 1a represents a possible data generating assumption for the observational study for the simple case of two times (K = 1) and with the population stratified on a single level of L0 (such that it can be left off of the graph). Here U is an assumed unmeasured common cause of the disease outcome and a measured covariate L1. For simplicity and without loss of generality, we assume that all covariates Lk are discrete, and that there is no missing data, no measurement error, and no death during the study period.

Figure 1:

Figure 1:

Causal graphs for the motivating example with two time intervals. Panel A illustrates a causal DAG representing an observed data generating assumption. Panel B illustrates SWIG transformation of the causal DAG under a treatment strategy a¯1.

The causal question

Researchers are interested in using the data from this study to estimate the causal effect of an intervention that ensures everyone takes 150 mg of treatment every month during the follow-up, versus 50 mg, on the mean of the outcome. The single world intervention graph (SWIG) in Figure 1b is a transformation of the causal DAG under an intervention that sets treatment dose in the first two intervals to particular values a0 and a1, respectively.9

For ak a possible level of treatment dose at time k and Ya¯K an individual’s outcome if, possibly contrary to fact, the individual had adhered to a strategy assigning treatment doses a¯K=(a0,a1,,aK) over the follow-up, then the average causal effect is

E(Ya¯K=150¯Ya¯K=50¯). (1)

Here, a¯K=150¯ indicates ak = 150 for k = 0,…, K.

The g-formula

Robins1 showed that E(Ya¯K) can be identified by the g-formula

h(a¯K)=l¯KE(Y|L¯K=l¯K,A¯K=a¯K)j=0Kf(lj|l¯j1,a¯j1), (2)

where E(Y|L¯K=l¯K,A¯K=a¯K) is the mean of Y among those with particular history (l¯K,a¯K) and f(lj|l¯j1,a¯j1)Pr[Lj=lj|L¯j1=l¯j1,A¯j1=a¯j1] is the proportion of individuals with Lj = lj among those with history (l¯j1,a¯j1) and the sum is over all possible levels l¯K of L¯K in this population.

The identification of E(Ya¯K) by the g-formula (2) requires the assumptions of sequential exchangeability, positivity, and consistency which have been discussed at length elsewhere.1,9,10 Sequential exchangeability, sometimes referred to as no unmeasured confounding, is encoded on the SWIG in Figure 1b by the absence of an arrow from the unmeasured U into the natural values of treatment at any time (A0,A1a0).9 Because, for simplicity, the causal estimands here refer to so-called static strategies – strategies under which treatment assignment at each time does not depend on past patient characteristics – there will, by definition, be no arrows from measured (or unmeasured) past covariates into the intervention nodes a0 or a1 on the SWIG in Figure 1b as a SWIG is a transformation of the causal DAG specific to the causal estimand.9

The g-null paradox

The causal effect can be estimated using the n individuals in our study by estimating h(a¯K=150¯)h(a¯K=50¯). This function may be difficult to estimate in practice when (Ak, Lk) can jointly take many levels at a given time k in the data. The parametric g-formula is a computationally straightforward approach to estimating this function under the assumption that the components of the g-formula in (2) can be correctly characterized by (unsaturated) parametric models.

However, Robins and Wasserman8 showed that when the following three conditions hold

  • Condition 1: The counterfactual mean E(Ya¯K) is identified by the g-formula h(a¯K) in (2).

  • Condition 2: Lk is a time-varying confounder (in that the g-formula h(a¯K) must depend on it) and is also affected by past treatment.

  • Condition 3: The treatment has no effect on any individual’s outcome at any time (i.e. the sharp null is true).

then parametric models cannot, in general, correctly characterize the g-formula (2). This contradiction, which has come to be known as the g-null paradox, implies that, if Conditions 1, 2, and 3 are true, an estimate of the effect of interest (1) using the parametric g-formula will be subject to some bias.

Robins and Wasserman8 illustrated this contradiction with an example, represented in Figure 2, depicting assumptions consistent with Conditions 1, 2, and 3. The example is a simplified version of our observational study with a null treatment effect, only two follow-up intervals (K = 1), constant L0 (thus it can be ignored) and with L1 containing only one binary covariate. In this simple case, the g-formula (2) reduces to

h(a0,a1)=l1=01E(Y|l1,a0,a1)f(l1|a0).

Figure 2:

Figure 2:

Causal graphs of the original example by Robins and Wasserman8. Panel A illustrates a causal DAG representing an observed data generating assumption. Panel B illustrates a SWIG transformation of the causal DAG under a treatment strategy a¯1.

Given the joint distribution of the observed data is faithful8 to the graphs in Figure 2, then Figure 2 generally implies the following:

  • Implication 1: E(Ya0,a1)=h(a0,a1) (by Condition 1) does not depend on a0 or a1 (by Condition 3) – consistent with the absence of any arrows into Ya0,a1 except from U in Figure 2b.

  • Implication 2: E(Y|l1, a0, a1) does depend on l1 (by the first part of Condition 2) – consistent with the path L1UY in Figure 2a.

  • Implication 3: f(l1|a0) does depend on a0 (by the second part of Condition 2) – consistent with the path A0L1 in Figure 2a.

Now suppose that parametric models correctly characterize the components of the g-formula:

E(Y|l1,a0,a1)=g(l1,a0,a1;θ)

with g a function of (l1, a0, a1) and a parameter vector θ and

f(l1|a0)=r(l1,a0;β)

with r a function of (l1, a0) that is constrained between zero and one and a parameter vector β. Given these parametric assumptions, we may replace h(a0, a1) with

h(a0,a1;θ,β)=l1=01g(l1,a0,a1;θ)r(l,a0;β). (3)

Robins and Wasserman8 considered the following standard choices of g and r:

g(l1,a0,a1;θ)=θ0+θ1a0+θ2a1+θ3l1 (4)
r(l1=1,a0;β)=exp(β0+β1a0)1+exp(β0+β1a0). (5)

Plugging in these specific choices of g and r into (3) we have

h(a0,a1;θ,β)=θ0+θ1a0+θ2a1+θ3exp(β0+β1a0)1+exp(β0+β1a0). (6)

For these choices of g and r, it is straightforward to see that h(a0, a1; θ, β) will not depend on (a0, a1) if and only if θ1 = θ2 = 0 and either θ3 = 0 or β1 = 0. However, θ3 = 0 contradicts the dependence of E(Y|l1, a0, a1) on l1 and β1 = 0 contradicts the dependence of f(l1|a0) on a0.

That is, if Conditions 1, 2, and 3 hold, standard parametric models cannot correctly characterize the g-formula (2). As noted by Robins and Wasserman8, adding more flexibility to models g and r will not remove the problem unless these parametric models are saturated. A saturated model for g would allow this function to take a different value for each distinct combination of (l1, a0, a1) and, for r, a different value for each distinct level of a0. If treatment in our example could only take two levels in the data (e.g., if only two doses are possible in this study, 50 mg or 150 mg) then model (5) would be a saturated model for r. Adding all two- and three-way interactions to (4) would further constitute a saturated model for g in this scenario. However, even in this trivialized example with a single binary confounder L1, saturated models for g or r are impossible/impractical to specify if treatment is continuous or otherwise can take many levels in the data. Previous authors have shown that counterexamples can be constructed for the binary treatment case where this contradiction is avoided for unsaturated models but only under scenarios where model coefficients happen to be perfect functions of others11,12 (also see eAppendix A).

Beyond the sharp causal null hypothesis

The possibility of the g-null paradox is often handled informally in practice. Investigators dismiss the g-null paradox when they find non-null effect estimates if substantive knowledge or prior studies suggested that the sharp causal null (Condition 3) does not hold (e.g., see1315) or when they find null effect estimates precisely because, despite the potential for the existence of the g-null paradox, they do find a null result (e.g., see16).

This informal reasoning privileges the sharp causal null and thus obscures the broader implications of Robins and Wasserman’s example: regardless of whether the sharp causal null holds, there may be a contradiction between the assumption that parametric models can correctly characterize the g-formula and subject matter assumptions that can be encoded in a causal DAG. In other words, some model misspecification may be inevitable in realistic settings.

To see this, consider the following modification to the example in the previous section. Suppose that treatment at time 1 may affect the outcome. This scenario can be graphically represented by adding an arrow from A1 into Y in Figure 2a and, in turn, adding an arrow from a1 into Y in Figure 2b, further relabeling this outcome Ya1 to acknowledge dependence on a1.9 Relying on the same general models for g and r given in (4) and (5), we again arrive at equation (6) for h(a0, a1; θ, β) but now consider the weaker hypothesis that this function does not depend on a0 but allowing that it may depend on a1. It is straightforward to see that we obtain the same contradiction: h(a0, a1; θ, β) will not depend on a0 if and only if θ1 = 0 and either θ3 = 0 or β1 = 0. As previously argued, the condition of θ3 = 0 or β1 = 0 contradicts our initial assumptions that E(Y|l1, a0, a1) depends on l1 and f(l1|a0) depends on a0.

We consider this alternative scenario to illustrate the inevitability of model misspecification under the simplest possible non-null scenario. It can be seen in a similar manner that model misspecification is inevitable under other non-null scenarios. In summary, the g-null paradox is a particular instance of model misspecification that may arise when using the parametric g-formula, irrespective of whether the sharp causal null holds.

Simulations

We conducted numerical simulations to illustrate the impact of model misspecification on parametric g-formula estimates. We consider the setting with a time-varying treatment, a time-varying confounder, and an outcome measured at the end of follow-up. Data generation is loosely based on an HIV cohort study in which the treatment is an indicator of antiretroviral therapy, the confounder is an indicator of immunosuppression, and the outcome is CD4 cell count. Within this framework, data generation parameters were set to create considerable confounding and treatment–confounder feedback but no unmeasured confounding or violations of positivity. We generated data under the sharp causal null for simplicity. Similar bias may arise under non-null settings, as illustrated in the previous section.

Simulation Design

We considered six scenarios by varying the number of follow-up time points (1, 5, or 10) and the type of treatment (binary or continuous).

For each scenario, we simulated 250 longitudinal data sets with 10,000 individuals and K + 1 time points. We evaluated the bias, standard error (SE), and confidence interval (CI) coverage of the estimator for the outcome mean under each intervention and the difference of means across interventions. Standard errors were computed by taking the standard deviation of the 250 point estimates. The true outcome mean under each intervention was 500, and the true difference of means was 0.

Each data set was generated based on the DAG in Figure 2a, with the exception of including additional time points in some simulation scenarios. We first drew an unmeasured variable (U) from a Uniform(0,1) distribution. We then simulated the time-varying covariate (Lk) at each time k (k = 0,1,…, K) and simulated the outcome at time K by

Lk~Ber(p=logit1(α0+α1Ak1+α2U+α3Ak1U)) (7)
Y~N[0,1000](350+300U,502) (8)

where the value of αi, i = 0,…,3, depended on whether the treatment was continuous or discrete and N[a, b](μ, σ2) denotes the N(μ, σ2) distribution truncated in the interval [a, b].

In the binary treatment scenarios, we set (α0 = 0, α1 = −2.5, α2 = 1, α3 = 2.5) in (7) for the simulation of Lk and simulated the time-varying treatment (Ak) at each time interval k by

Ak~Ber(p=logit1(1.25+Ak1+Lk+Ak1Lk)).

For the case where the treatment is continuous (e.g., treatment can be given at different doses), we set (α0 = 1, α1 = −0.015, α2 = 1, α3 = 0.015) in (7) and simulated Ak by

Ak~N[0,200](80+0.1Ak1+30Lk0.05Ak1Lk,252).

We define Lk for k = −1,…,−9 as components of L0 and generated them according to (7) with Ak = Ak−1 = 0.

Analysis of the simulated data

We considered the interventions a¯K=50¯ and a¯K=150¯ in the continuous treatment scenarios and considered the interventions a¯K=0¯ and a¯K=1¯ in the binary treatment scenarios. Of course, both would still coincide with the same interventions – but in settings where treatment is assigned differently – if the binary treatment were simply an indicator of receiving a dose of 150 (versus 50) mg. For example, the continuous treatment scenario could correspond to a setting where doctors vary widely in what dose they prescribe and the binary treatment scenario to a setting where only two doses are ever prescribed.

We applied the parametric g-formula to estimate the mean of the outcome of interest at time t = K and the difference of means under the above interventions. We computed 95% CIs around all estimates using the percentile method with 250 bootstrap replicates.

Even though Y and Lk depend on the unmeasured variable U, the variable U is not needed to obtain unbiased estimates when using the g-formula. However, Y and Lk+1 depend on the entire history of Lk through their dependence on U. Therefore, analysts need to include the entire history of Lk in their models for Y and Lk+1. The functional form of the history of Lk is unknown.

We therefore analyzed the simulated datasets with increasingly flexible models for the history of Lk.

  • Least Flexible: We fit models for Lk and Y that include a single term for the (lagged) cumulative average value of Lk. In particular, we fit the following logistic model for Lk and linear model for Y
    logit(Pr[Lk=1|L¯k1=l¯k1,A¯k1=a¯k1])=γ0+γ1ak1+γ2CMA(l¯k1)
    E(Y|L¯K=l¯K,A¯K=a¯K)=ω0+ω1aK+ω2aK1+ω3CMA(a¯K2)+ω4CMA(l¯K)
    where CMA(l¯k)1k+10i=9kli denotes the cumulative average of l¯k. This analysis uses all the data required to satisfy the sequential exchangeability assumption, but it is expected to result in biased estimates because the parametric models will be somewhat misspecified.
  • Moderately Flexible: We fit models for Lk and Y that include terms for the two most recent lagged values of Lk and a term for the lagged cumulative average value of Lk:
    logit(Pr[Lk=1|L¯k1=l¯k1,A¯k1=a¯k1])=γ0+γ1ak1+γ2lk1+γ3lk2+γ4CMA(l¯k3)
    E(Y|L¯K=l¯K,A¯K=a¯K)=ω0+ω1aK+ω2aK1+ω3CMA(a¯K2)+ω4lK+ω5lK1+ω6lK2+ω7CMA(l¯K3).
  • Most Flexible: We fit models for Lk and Y that include a term for each lagged value of Lk:
    logit(Pr[Lk=1|L¯k1=l¯k1,A¯k1=a¯k1])=γ0+γ1ak1+i=110γ1+ilki
    E(Y|L¯K=l¯K,A¯K=a¯K)=ω0+ω1aK+ω2aK1+ω3CMA(a¯K2)+i=010ω4+ilKi.
  • Benchmark: As a benchmark for the above three analyses, we consider an (impossible) analysis in which one has access to the unmeasured U and knowledge of the functional form of the generation for Y:
    E(Y|U=u,A¯K=a¯K)=ω0+ω1aK+ω2aK1+ω3CMA(a¯K2)+ω4u.

    Because U is a baseline covariate, the distribution of U can be estimated nonparametrically by the empirical distribution of U. This analysis will be unbiased.

One could similarly explore the impact of increasing the flexibility of the history of treatment in the models for Lk and Y. While we did not include such analyses to simplify the presentation, a reduction of bias would require an extremely flexible function of A¯k1 in the Lk model because the true relation between A¯k1 and Lk is highly complex, as the result of our choice to include an interaction term between Ak−1 and the unmeasured variable U in the data generating model for Lk (also see eAppendix B). We applied the parametric g-formula using the gfoRmula R package6. The code used for all analyses is available on GitHub at https://github.com/CausalInference/NullParadox.

Results

Figure 3 illustrates the simulation results for the mean difference. The bias, standard error (SE), and confidence interval (CI) coverage of the parametric g-formula are summarized in Table 1.

Figure 3:

Figure 3:

Violin plots illustrating the estimated average causal effect of the four applications of the parametric g-formula in the binary treatment scenarios (panel A) and continuous treatment scenarios (panel B). Darker shading indicates simulation settings with larger number of time points (light shading: K = 1; medium shading: K = 5; dark shading: K = 10). The dashed line indicates the true value of the average causal effect (i.e., 0). ACE = average causal effect.

Table 1:

Simulation results for the mean difference. The target parameter in the binary treatment scenarios is E(Ya¯K=1¯)E(Ya¯K=0¯) and the target parameter in the continuous treatment scenarios is E(Ya¯K=150¯)E(Ya¯K=50¯). The true value of the target parameters is 0. SE = standard error.

Binary Treatment Scenarios Continuous Treatment Scenarios
G-Formula Application K Bias SE Coverage Bias SE Coverage
Least Flexible 1 4.26 2.39 0.54 18.19 4.54 0.01
5 18.17 3.44 0.00 48.73 8.04 0.00
10 21.14 4.74 0.00 50.39 9.77 0.00
Moderately Flexible 1 1.42 2.46 0.86 0.57 4.76 0.98
5 10.50 3.52 0.20 19.63 8.19 0.33
10 15.07 4.88 0.12 34.43 10.01 0.07
Most Flexible 1 1.42 2.46 0.86 0.55 4.76 0.98
5 4.65 3.65 0.82 −5.49 8.52 0.91
10 −1.05 5.13 0.94 −13.53 10.38 0.73
Benchmark 1 0.11 1.22 0.96 0.10 2.77 0.91
5 0.05 2.10 0.93 −0.09 4.52 0.94
10 0.03 2.81 0.93 0.10 6.24 0.94

The performance of the parametric g-formula generally improved as the flexibility of the models for Lk and Y increased. That is, the impact of model misspecification was greatly mitigated, but not completely eliminated, by using more flexible models for the components of the parametric g-formula. For instance, at K = 10 in the continuous treatment scenario, the least flexible application of the parametric g-formula had a bias of 50.39, SE of 9.77, and coverage of 0.00 whereas the most flexible application had a bias of −13.53, SE of 10.38, and coverage of 0.73.

The simulation results for the counterfactual means are given in eAppendix B. We observed the same trends: the performance of the parametric g-formula improved when using more flexible models. Moreover, we found similar results when increasing the number of iterations in the simulations.

Discussion

Our presentation clarifies that the g-null paradox of the noniterative conditional expectation parametric g-formula is a particular case of a general form of model misspecification that may occur even if the treatment of interest has a non-null causal effect. Part of the confusion surrounding the g-null paradox arose because the paradox has been traditionally discussed in the context of testing the sharp causal null hypothesis of no treatment effect. For example, Campbell and Gustafson17 investigated the empirical type I error rate based on the parametric g-formula. They did not find higher empirical type I error rates under model misspecification compared with saturated models without model misspecification, although their sample sizes were smaller than those considered here.

However, because the primary goal of causal inference is estimating treatment effects rather than “testing hypotheses”, researchers are often concerned about the magnitude of bias in their estimates. Thus, one may view the g-null paradox as simply a phenomenon in which the parametric g-formula estimate of a treatment effect is guaranteed to be biased due to model misspecification, regardless of whether or not the null is true. Although not the focus of their simulation study, Murray et al.18 found the bias of the parametric g-formula to be negligible in scenarios of a null treatment effect. In contrast, our more extensive simulations illustrate that a nonnegligible amount of bias can arise in some simple scenarios even when using fairly flexible models for the components of the parametric g-formula.

Evaluating model misspecification in the parametric g-formula may be informally done by conducting sensitivity analyses under different modeling assumptions and under different orderings of the factorization of the joint density of the confounders (particularly in interval cohorts), comparing the parametric g-formula and the non-parametric estimate of the outcome mean/risk mean under the natural course.19 Additionally, data analysts may consider applying approaches that are based on different algebraic representations of the g-formula, such as inverse probability (IP) weighted estimators or the iterative conditional expectation (ICE) parametric g-formula, which rely on different modeling assumptions. State-of-the-art2024 methods derived from the so-called efficient influence function are increasingly available.2527

Because machine learning (ML) methods relax parametric modeling assumptions, they may naturally be considered a possible solution to model misspecification in the parametric g-formula. Yet, perhaps counterintuitively, the problem of model misspecification in the parametric g-formula is not reasonably solved by using ML algorithms to estimate its nuisance functions (i.e., the joint conditional density of the covariates and conditional outcome mean). Recent simulation studies clarified that, while so-called state-of-the-art methods can benefit from use of ML algorithms24,28, the ML-based singly-robust estimators (i.e., estimators, such as the parametric g-formula, that require all models for the nuisance functions to be correctly specified in order for the estimator to be consistent) do not enjoy this benefit and may perform worse than those based on parametric models.29 Therefore, we did not include such approaches in our simulations.

In summary, model misspecification may introduce bias in parametric g-formula estimates, regardless of whether treatment has a null causal effect. Thus, overly parsimonious models for the components of the g-formula are to be avoided.

Supplementary Material

eAppendix

Acknowledgments:

The simulations in this work were run on the O2 High Performance Compute Cluster at Harvard Medical School.

Source of funding:

This work was supported by NIH grant R37 AI102634, the National Science Foundation Graduate Research Fellowship Program under Grant No. DGE1745303, National Library Of Medicine of the National Institutes of Health under Award Number T32LM012411, and Fonds de recherche du Québec-Nature et technologies B1X research scholarship.

Footnotes

Conflicts of interest: None declared

Posted history: This manuscript was previously posted on arXiv: https://arxiv.org/abs/2103.03857

Data / code availability:

The code used for all analyses is available on GitHub at https://github.com/CausalInference/NullParadox.

References

  • 1.Robins JM. A new approach to causal inference in mortality studies with a sustained exposure period: application to the healthy worker survivor effect. Mathematical Modelling 1986;7:1393–1512 [Errata (1987) in Computers and Mathematics with Applications 14, 917–921. Addendum (1987) in Computers and Mathematics with Applications 14, 923–945. Errata (1987) to addendum in Computers and Mathematics with Applications 18, 477.]. [Google Scholar]
  • 2.Robins JM, Hernan MA. Estimation of the causal effects of time-varying exposure. In: Fitzmaurice G, Davidian M, Verbeke G, Molenberghs G, eds. Longitudinal Data Analysis Chapman and Hall/CRC, 2009;553–599. [Google Scholar]
  • 3.Lodi S, Phillips A, Logan R, et al. Comparative effectiveness of immediate antiretroviral therapy versus CD4-based initiation in HIV-positive individuals in high-income countries: observational cohort study. Lancet HIV 2015;2(8):e335–43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Robins JM, Hernan MA, Siebert U. Effects of multiple interventions. In: Ezzati M, Lopez AD, Rodgers A, Murray CJL, eds. Comparative Quantification of Health Risks: Global and Regional Burden of Disease Attributable to Selected Major Risk Factors World Health Organization, 2004;2191–2231. [Google Scholar]
  • 5.Taubman SL, Robins JM, Mittleman MA, Hernan MA. Intervening on risk factors for coronary heart disease: an application of the parametric g-formula. Int J Epidemiol 2009;38(6):1599–611. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.McGrath S, Lin V, Zhang Z, et al. gfoRmula: An R Package for Estimating the Effects of Sustained Treatment Strategies via the Parametric g-formula. Patterns (N Y) 2020;1(3). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wen L, Young JG, Robins JM, Hernan MA. Parametric g-formula implementations for causal survival analyses. Biometrics 2021;77(2):740–753. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Robins JM, Wasserman L. Estimation of effects of sequential treatments by reparameterizing directed acyclic graphs. Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence. Providence, Rhode Island: Morgan Kaufmann Publishers Inc., 1997;409–420. [Google Scholar]
  • 9.Richardson TS, Robins JM. Single world intervention graphs (SWIGs): A unification of the counterfactual and graphical approaches to causality. Center for the Statistics and the Social Sciences, University of Washington Series. Working Paper 2013;128(30):2013. [Google Scholar]
  • 10.Hernan MA, Robins JM. Causal Inference: What If Chapman & Hall/CRC, 2020. [Google Scholar]
  • 11.Robins JM. General methodological considerations. Journal of Econometrics 2003;112(1):89–106. [Google Scholar]
  • 12.Young JG, Tchetgen Tchetgen EJ. Simulation from a known Cox MSM using standard parametric models for the g-formula. Stat Med 2014;33(6):1001–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zhang Y, Young JG, Thamer M, Hernan MA. Comparing the Effectiveness of Dynamic Treatment Strategies Using Electronic Health Records: An Application of the Parametric g-Formula to Anemia Management Strategies. Health Serv Res 2018;53(3):1900–1918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Neophytou AM, Picciotto S, Costello S, Eisen EA. Occupational Diesel Exposure, Duration of Employment, and Lung Cancer: An Application of the Parametric G-Formula. Epidemiology 2016;27(1):21–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Garcia E, Picciotto S, Neophytou AM, et al. Lung cancer mortality and exposure to synthetic metalworking fluid and biocides: controlling for the healthy worker survivor effect. Occup Environ Med 2018;75(10):730–735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Danaei G, Robins JM, Young JG, et al. Weight Loss and Coronary Heart Disease: Sensitivity Analysis for Unmeasured Confounding by Undiagnosed Disease. Epidemiology 2016;27(2):302–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Campbell H, Gustafson P. The Validity and Efficiency of Hypothesis Testing in Observational Studies with Time-Varying Exposures. Observational Studies 2018;4(1):260–291. [Google Scholar]
  • 18.Murray EJ, Robins JM, Seage GR, Freedberg KA, Hernan MA. A Comparison of Agent-Based Models and the Parametric G-Formula for Causal Inference. Am J Epidemiol 2017;186(2):131–142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Young JG, Cain LE, Robins JM, O’Reilly EJ, Hernan MA. Comparative effectiveness of dynamic treatment regimes: an application of the parametric g-formula. Stat Biosci 2011;3(1):119–143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Van der Laan MJ, Laan M, Robins JM. Unified methods for censored longitudinal data and causality Springer Science & Business Media, 2003. [Google Scholar]
  • 21.Bickel PJ, Klaassen CA, Bickel PJ, et al. Efficient and adaptive estimation for semiparametric models. Vol. 4 Johns Hopkins University Press; Baltimore, 1993. [Google Scholar]
  • 22.van der Vaart AW. Semiparametric statistics. Lecture Notes in Math. 2002;1781. [Google Scholar]
  • 23.Tsiatis AA. Semiparametric theory and missing data. 2006.
  • 24.Chernozhukov V, Chetverikov D, Demirer M, et al. Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal 2018;21(1):C1–C68. [Google Scholar]
  • 25.Williams NT, Díaz I. lmtp: Non-parametric Causal Effects of Feasible Interventions Based on Modified Treatment Policies. R package version 0.0.5. 2020. [Google Scholar]
  • 26.Díaz I, Williams N, Hoffman KL, Schenck EJ. Nonparametric Causal Effects Based on Longitudinal Modified Treatment Policies. Journal of the American Statistical Association 2021:1–16.35757777 [Google Scholar]
  • 27.Lendle SD, Schwab J, Petersen ML, van der Laan MJ. ltmle: An R Package Implementing Targeted Minimum Loss-Based Estimation for Longitudinal Data. Journal of Statistical Software 2017;81(1):21. [Google Scholar]
  • 28.Zivich PN, Breskin A. Machine Learning for Causal Inference: On the Use of Cross-fit Estimators. Epidemiology 2021;32(3):393–401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Naimi AI, Mishler AE, Kennedy EH. Challenges in Obtaining Valid Causal Effect Estimates with Machine Learning Algorithms. Am J Epidemiol 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

eAppendix

Data Availability Statement

The code used for all analyses is available on GitHub at https://github.com/CausalInference/NullParadox.

RESOURCES