Quantifying the Bias due to Observed Individual Confounders in Causal Treatment Effect Estimates

Layla Parast; Beth Ann Griffin

doi:10.1002/sim.8549

. Author manuscript; available in PMC: 2021 Aug 15.

Published in final edited form as: Stat Med. 2020 May 10;39(18):2447–2476. doi: 10.1002/sim.8549

Quantifying the Bias due to Observed Individual Confounders in Causal Treatment Effect Estimates

Layla Parast ^1,^*, Beth Ann Griffin ¹

PMCID: PMC8162899 NIHMSID: NIHMS1702092 PMID: 32388870

Abstract

It is often of interest to use observational data to estimate the causal effect of a target exposure or treatment on an outcome. When estimating the treatment effect, it is essential to appropriately adjust for selection bias due to observed confounders using, for example, propensity score weighting. Selection bias due to confounders occurs when individuals who are treated are substantially different from those who are untreated with respect to covariates that are also associated with the outcome. A comparison of the unadjusted, naive treatment effect estimate with the propensity score adjusted treatment effect estimate provides an estimate of the selection bias due to these observed confounders. In this paper, we propose methods to identify the observed covariate that explains the largest proportion of the estimated selection bias. Identification of the most influential observed covariate or covariates is important in resource-sensitive settings where the number of covariates obtained from individuals needs to be minimized due to cost and/or patient burden and in settings where this covariate can provide actionable information to healthcare agencies, providers, and stakeholders. We propose straightforward parametric and nonparametric procedures to examine the role of observed covariates and quantify the proportion of the observed selection bias explained by each covariate. We demonstrate good finite sample performance of our proposed estimates using a simulation study and use our procedures to identify the most influential covariates that explain the observed selection bias in estimating the causal effect of alcohol use on progression of Huntington’s disease (HD), a rare neurological disease.

Keywords: selection bias, confounder, treatment effect, kernel estimation, robust, nonparametric

1 |. INTRODUCTION

It is often of interest to use observational data to estimate the causal effect of a target exposure or treatment on an outcome. However, given the nature of observational data (e.g, that researchers cannot control the assignment of individuals or units to levels of the target exposure or treatment), methods are needed to ensure groups being compared are well-balanced with respect to all potential pre-treatment (or pre-exposure) confounders. In the absence of balanced groups, it is likely that the estimated effect of the exposure or treatment will be biased.^1,2,3 This bias is often referred to as “selection bias” because it results from individuals or units “selecting” to receive certain exposures or treatments, although for many applications this is not necessarily a reflection of individual choice. Propensity scores are frequently used in practice to reduce or eliminate selection bias due to the observed confounders available in a data set by obtaining better comparability between the groups of interest, at least in terms of the observed covariates used in the propensity score model.^4,5,6 If all known pre-treatment confounders are included in the propensity score model, then a treatment effect estimate that adjusts for the propensity score has the potential ability to remove selection bias completely from the estimated treatment effect.^4,7,8

In most practical applications using propensity score adjustment, it is useful to examine how much the treatment effect estimate has shifted pre- versus post- propensity score adjustment to understand the influence of selection bias due to observed confounders in a study. The shift or difference between the naive treatment effect estimate and the propensity score adjusted treatment effect estimate provides an estimate of the selection bias that is due to the observed covariates. Quantifying the magnitude of this bias can be useful for understanding the impact that propensity score adjustment has on the estimated treatment effect. In some settings, researchers seek to understand how much of the selection bias due to observed confounders (e.g., referred to here as “observed selection bias”) is explained by a single confounder of interest or by a subset of confounders, particularly when identification of an important confounder can provide actionable information for agencies and stakeholders. For example, in health disparities studies comparing health outcomes between Hispanics and non-Hispanic Whites, estimation of the racial difference in health outcomes can be made more robust by using propensity score adjustment to balance Hispanics and non-Hispanics on observed potential confounders and, in the process, may identify specific variables (e.g., insurance status and receipt of preventative care early in life) as important observed confounders that explain the observed differences in health outcomes for these two populations.^9,10,11 Thus, adjustment for the observed selection bias due to these factors is critical when trying to estimate the size of racial disparities on health outcomes. Additionally, understanding what percentage of the observed selection bias is explained by a particular confounder or set of confounders may also provide actionable information for policymakers as it points to a need to further investigate the causes for the differences (e.g., in our example, the differences in insurance status and use of preventative care early in life may be a potential reflection of disparities within the healthcare system).^12,13,14 While race/ethnicity is not modifiable in the classic sense of an intervention or treatment (and thus causal effects cannot be estimated), propensity score methods and other causal inference methods can be utilized to ensure more robust estimation of health disparities that are an urgent and relevant public health concern.

Even if a particular confounder is not actionable, measurement cost or burden may also motivate researchers to understand how much influence a specific observed variable has on the shift in the estimated treatment effect. For example, in health care studies of chronic or terminal disease, there are often potential confounders that are costly, burdensome, or invasive to measure or obtain. In those settings, it is important to understand whether that covariate can be removed from the adjustment while still ensuring unbiased treatment effect estimation to efficiently utilize resources and not cause undue burden to patients. Additionally, in studies with smaller sample sizes like our application which focuses on a rare disease, it is often of interest to minimize the total number of variables being used in the propensity score model to avoid overfitting. However, in both of these cases, it is critical to robust inferences that research not discard true confounders. We aim to create methods that can quantify the proportion of the observed selection bias that is explained by each individual observed confounder in a study, such that this quantity can inform policy-related decisions and such that only necessary observed confounders can be used to estimate an unbiased treatment effect in a resource-sensitive setting.

While many studies of health outcomes and disparities would benefit from a better understanding of how much each observed confounder contributes to the shift in treatment effect estimates pre- and post- propensity score adjustment (i.e, the observed selection bias or selection bias due to observed confounders), there are no readily available methods to quantify the role of each observed covariate in explaining the observed selection bias. This paper aims to develop useful methods and tools for implementing such analyses. We propose two different ways to explore the problem. In the first, we develop quantities that can be used to understand the individual impact of each observed confounder by investigating how much the treatment effect would change if a study only controlled for one covariate. In the second, we develop quantities that can be used to understand the individual impact of each observed confounder when it is removed from the fully adjusted treatment effect estimate, thereby providing an understanding of its added value above and beyond all the other observed confounders used in the propensity score model.

In Section 2, we present notation, discuss assumptions, and describe our proposed methods. In Section 3, we present methods for estimating the relative impact of each observed confounder on explaining the observed selection bias in an analysis; we also discuss parametric and nonparametric methods for estimating the needed propensity score weights and the implications of balance assessment. In Section 4, we use a simulation study to demonstrate that our proposed methods perform well across a variety of settings. In Section 5, we apply our methods to observational data to examine the impact of alcohol use on Huntington’s disease (HD), a rare neurological disease.

2 |. NOTATION, DEFINITIONS, AND PROPOSED QUANTITIES

2.1 |. Notation and Definitions

Let $Y_{i}^{o b s}$ denote the observed outcome of interest for individual i, T_i denote the binary treatment or intervention, where T_i = 0 or 1, and X_i = {X_1i, X_2i, …, X_ki} denotes the vector of available baseline/pre-treatment covariates. For example, in our HD application, Y is the severity level of the disease ranging in values from −3 to 3 with a standard deviation of 1 and such that higher values indicate greater severity of the disease; T is whether or not someone is an alcohol drinker at study initiation; and X_i includes the key confounders of interest including the CAG repeats an individual has (measured via blood work and for which higher values typically mean earlier onset and worse symptoms of the disease), the baseline severity of the patient at intake, age, education level, drug use, and antidepressant use.^15,16,17 In the case of binary treatments or exposures, each individual has two potential outcomes: the Y_i that would be observed if the individual was assigned to treatment group 1 and the Y_i that would be observed if the individual was assigned to treatment group 0, only one of which is observable for each individual. Let Y_1i and Y_0i denote the potential outcomes when T_i = 1 and T_i = 0, respectively. Then, $Y_{i}^{o b s} = Y_{1 i}$ if T_i = 1 and Y_0i if T_i = 0.

Using this notation, we want to estimate the average treatment effect on the population (ATE), denoted as Δ:

Δ = E (Y_{1 i} - Y_{0 i}) = E (Y_{1 i}) - E (Y_{0 i}) .

Let ${\hat{Δ}}^{naive}$ be the naive estimator of Δ where we simply take the difference in means between the two treatment groups (here, alcohol drinkers minus non-drinkers),

{\hat{Δ}}^{naive} = \frac{1}{n_{1}} \sum_{i : T_{i} = 1} Y_{i}^{o b s} - \frac{1}{n_{0}} \sum_{i : T_{i} = 0} Y_{i}^{o b s} .

We know that if there is selection bias¹⁸ due to observed and unobserved confounders, this estimate will be a biased estimate of Δ. Since treatment is not randomized, we have to account for the pre-treatment differences between those with T_i = 0 and T_i = 1. If we assume that

T_{i} ⊥ Y_{1 i}, Y_{0 i} ∣ X_{i},

(A1)

(e.g., the treatment is strongly ignorable (or has unconfoundedness) with respect to X_i and thus, there are no unobserved confounders) then we can estimate Δ through propensity score weighting where p_i = P(T_i = 1|X_i) and

{\hat{Δ}}^{p s} = \frac{\sum_{i : T_{i} = 1} Y_{i}^{o b s} W_{i}}{\sum_{i : T_{i} = 1} W_{i}} - \frac{\sum_{i : T_{i} = 0} Y_{i}^{o b s} W_{i}}{\sum_{i : T_{i} = 0} W_{i}}

where

W_{i} = \{\begin{array}{l} 1 / p_{i} & if T_{i} = 1 \\ 1 / (1 - p_{i}) & if T_{i} = 0 \end{array}

Now let

\hat{λ} = {\hat{Δ}}^{naive} - {\hat{Δ}}^{p s}

such that $\hat{λ}$ is an estimate of the the selection bias attributable to this set of observed confounders (referred to here as the observed selection bias), X_i. Importantly, the estimated $\hat{λ}$ Will only be an estimate of the selection bias and may in truth reflect a combination of both selection bias and model misspecification, depending on the use of any models to estimate ${\hat{Δ}}^{p s}$ . If there is no selection bias due to the observed confounders used in our analysis (and no unobserved confounders), then λ = 0. However, in most practical applications selection bias due to observed confounders will exist and thus, λ ≠ 0. If both treatment effect estimates are in the same direction, then $\hat{λ} > 0$ implies that the naive treatment effect underestimates the true treatment effect and $\hat{λ} < 0$ implies that it overestimates the true treatment effect. Thus, $\hat{λ}$ provides us with an estimate of the selection bias attributable to the observed pre-treatment confounders included in the propensity score model. Our aim is to determine how much of the observed selection bias is explained by each observed confounder included in X_i. Thus, for our HD example, it is of interest to understand how much of the observed selection bias in the naive estimate of alcohol’s impact on HD severity is driven by each observed confounder included in X_i (i.e., CAG repeats, baseline HD severity, age, education, drug use, and antidepressant use). In Sections 2.3 and 2.4, we present two straightforward ways to quantify the amount of the observed selection bias accounted for by each observed confounder.

2.2 |. Assumptions

There are four common assumptions that are required for robust inference when utilizing propensity scores to estimate treatment effects: consistency, unconfoundedness, positivity, and no interference. Consistency requires that a subject’s potential outcome under the treatment received (here, their observed treatment) equals the subject’s observed outcome. Unconfoundesness (or strong ignorability)¹⁹ requires that there are no unobserved confounders that have not been included in the covariates used to estimate weights and is equivalent to (A1) defined earlier. Thus, assumption (A1) assumes that all confounders are captured in X and that therefore, there are no unmeasured confounders. Positivity assumes that each individual has a positive probability of receiving the treatment i.e., that no individual (in the analysis) has probability equal to 1 or 0 of receiving the treatment/being in the treatment group. This is also commonly referred to as having overlap or positivity between the treatment and control groups on all pre-treatment confounders and can be written as

0 < p_{i} < 1 \forall i .

(A2)

Finally, no interference assumes that outcomes for individuals are not dependent on one another. Under these assumptions, ${\hat{Δ}}^{p s}$ will be a consistent estimate of Δ.⁴

2.3 |. Single confounder removal

In our first approach, referred to here as our single confounder removal approach, the goal is to estimate the bias that would result in the adjusted treatment effect if one were to remove the confounder of interest (in essence assuming that the full adjusted estimate is unbiased). Thus, in the HD example, we wish to understand the impact on the propensity score weighted effect estimate for alcohol use on HD severity if one excludes key confounders such as CAG from their analysis. A priori, one would expect removal of CAG to have a big impact due to the fact that it is one of the strongest available predictors of HD disease onset and progression.²⁰ We define this adjusted treatment effect that excludes the confounder X_j as:

{\hat{Δ}}^{p s} (- X_{j}) = \frac{\sum_{i : T_{i} = 1} Y_{i}^{o b s} W_{i} (- X_{j})}{\sum_{i : T_{i} = 1} W_{i} (- X_{j})} - \frac{\sum_{i : T_{i} = 0} Y_{i}^{o b s} W_{i} (- X_{j})}{\sum_{i : T_{i} = 0} W_{i} (- X_{j})}

where p_i(−X_j) = P(T_i = 1|X_i\{X_ji}), X_i\{X_ji} denotes the vector X_i with the element X_ji removed, and

W_{i} (- X_{j}) = \{\begin{array}{l} 1 / p_{i} (- X_{j}) & if T_{i} = 1 \\ 1 / (1 - p_{i} (- X_{j})) & if T_{i} = 0. \end{array}

We then define

\hat{λ} (- X_{j}) = {\hat{Δ}}^{p s} - {\hat{Δ}}^{p s} (- X_{j})

such that $\hat{λ} (- X_{j})$ estimates the shift captured by the removed confounder X_j away from the fully adjusted treatment effect estimate. Therefore, the proportion of the observed selection bias explained by the removed confounder X_j can be represented by

R (- X_{j}) = \frac{\hat{λ} (- X_{j})}{\hat{λ}} .

Unfortunately, this approach would only be useful if the bias explained by each confounder is additive, independent, and in the same direction. An alternative way to summarize the proportion explained so that it can accommodate more complex situations would be to define the total bias due to removal of the included observed covariates, β^R as

β^{R} = \sum_{j = 1}^{k} |\hat{λ} (- X_{j})|

and the proportion of the observed selection bias explained by confounder X_j as

\hat{B} (- X_{j}) = \frac{|\hat{λ} (- X_{j})|}{β^{R}} .

Certainly, it will not necessarily be the case that β^R will be equal to $\hat{λ}$ , but is used as a tool to understand the contribution of each confounder to the estimated selection bias. This notation can easily be extended to handle the removal of more than one confounder, if desired. For example, one can define and estimate ${\hat{Δ}}^{p s} (- \{X_{j}, X_{m}, X_{n}\})$ to examine the bias captured by the removed confounders X_j, X_m, X_n.

2.4 |. Single confounder inclusion

In our second approach, referred to as single confounder inclusion, we aim to estimate the bias removed from the naive treatment effect if one were to use a propensity score weighted treatment effect that only adjusted for a single observed pre-treatment covariate (e.g, say CAG repeats for HD). In this case, we define the propensity score weighted treatment effect that only adjusts for one covariate at a time as follows:

{\hat{Δ}}^{p s} (X_{j}) = \frac{\sum_{i : T_{i} = 1} Y_{i}^{o b s} W_{i} (X_{j})}{\sum_{i : T_{i} = 1} W_{i} (X_{j})} - \frac{\sum_{i : T_{i} = 0} Y_{i}^{o b s} W_{i} (X_{j})}{\sum_{i : T_{i} = 0} W_{i} (X_{j})}

where p_i(X_j) = P(T_i = 1|X_ji)

W_{i} (X_{j}) = \{\begin{array}{l} 1 / p_{i} (X_{j}) & if T_{i} = 1 \\ 1 / (1 - p_{i} (X_{j})) & if T_{i} = 0. \end{array}

We then define

\hat{λ} (X_{j}) = {\hat{Δ}}^{naive} - {\hat{Δ}}^{p s} (X_{j})

such that $\hat{λ} (X_{j})$ estimates the shift in the naive estimate captured by the observed confounder X_j. Similar to the above, we define the total bias due to inclusion of a single observed covariate, β^I as

β^{I} = \sum_{j = 1}^{k} |\hat{λ} (X_{j})|

and the proportion of this total bias explained by confounder X_j as

\hat{B} (X_{j}) = \frac{|\hat{λ} (X_{j})|}{β^{I}} .

2.5 |. Choosing an approach

These two proposed approaches lead to different measures that quantify the influence of a particular variable, except in the case where all observed variables included in the propensity score model are independent and have an additive effect, which is not likely in practice. The selected approach will often depend on the substantive question of interest. If one is interested in a variable that is costly or invasive to measure, the confounder removal approach is likely preferred since the substantive question is whether a treatment effect can be robustly estimated without use of this variable so that future studies can avoid obtaining the measurement. If one is interested in identifying a variable that may be actionable i.e., to inform policy-relevant decisions, then the confounder inclusion approach is likely preferred. Here, the substantive question is - does this variable explain a substantial proportion of the observed selection bias? In our numerical studies we illustrate and discuss the different results obtained by these two approaches.

3 |. ESTIMATION

3.1 |. Estimation of Proposed Quantities

To estimate $\hat{λ} (- X_{j})$ , $\hat{λ} (X_{j})$ , $\hat{B} (- X_{j})$ and $\hat{B} (X_{j})$ , one may consider a variety of approaches to estimate the propensity score weights. Here, we describe a nonparamatric approach and a parametric approach to estimate the needed propensity score weights for each of our ${\hat{Δ}}^{p s} (X_{j})$ , ${\hat{Δ}}^{p s} (- X_{j})$ , and ${\hat{Δ}}^{p s}$ quantities.

Considering a nonparametric approach, we propose using a kernel density estimator when p_i is to be estimated with only a single confounder, and a generalized boosted model (GBM) when p_i is to be estimated using more than one confounder. Specifically, when p_i is to be estimated with only a single continuous confounder, X_j, we propose to use the Nadaraya-Watson estimator of the conditional mean,²¹

{\tilde{p}}_{i} (X_{j}) = \frac{\sum_{l = 1}^{n} K_{h} (X_{l j} - X_{i j}) T_{l}}{\sum_{l = 1}^{n} K_{h} (X_{l j} - X_{i j})}

where K(·) is a smooth symmetric density function with finite support, K_h(·) = K(·/h)/h and h is a specified bandwidth. In our numerical examples, we use a normal density for K(·) and use the rule-of-thumb suggested by Scott²² for the bandwidth h such that h = 1.06 min(σ, IQR/1.34) * n⁻² where σ and IQR are the standard deviation and interquartile range of X_j, respectively. This estimate, ${\tilde{p}}_{i} (X_{j})$ can then be used to obtain all subsequent quantities. When the confounder X_j is discrete, one can simply estimate p_i(X_j) as the average T_i within each category of X_j.

When p_i is to be estimated with more than one confounder, we propose to use GBM which is a nonparametric approach to model outcomes (binary, discrete, or continuous) that allows for interactions among covariates and flexible functional forms for the regression surface.²³ GBM approximates the regression surface through a piecewise constant model, in which the regression surface is constant over regions of the covariate space. The fitting algorithm involves partitioning the covariate space and assigning values to constant functions in the selected regions. Model building is automated through an iterative algorithm that adds terms to maximize the likelihood conditional on the model chosen through the previous iterations. GBM has been shown to have less bias than traditional regression approaches due to how it adaptively captures the functional form of the relationship between the covariates and treatment.^24,6,25 For more details on GBM for propensity score estimation, see McCaffrey et al.⁶ and Burgette et al..²⁶ We implement GBM here using the twang package in R to obtain ${\tilde{p}}_{i} (\{X_{j}, X_{m} \dots, X_{n}\})$ which estimates p_i({X_j, X_m…, X_n}) = P(T_i = 1|{X_ij, X_im…, X_in}) where {X_ij, X_im…, X_in} is any subset of X_i with size greater than 1.²⁷

Such a nonparametric approach can be an attractive choice because it allows us to avoid any reliance on correct model specification, though this comes at a cost if we are in a setting in which a parametric model is correctly specificied and it can be used to estimate the quantities of interest. A straightforward parametric estimation approach to estimate our quantities would be to use simple logistic regressions to estimate all propensity scores p_i, p_i(−X_j) and p_i(X_j). That is, to estimate p_i, we assume that the specified logistic model holds and use the resulting predicted probability that Y_i = 1 as an estimate, ${\hat{p}}_{i}$ , of p_i. A similar process can be used to obtain estimates of p_i(−X_j) and p_i(X_j), denoted as ${\hat{p}}_{i} (- X_{j})$ and ${\hat{p}}_{i} (X_{j})$ . It is also possible to expand the logistic model to include interaction and nonlinear terms, if desired. We recommend expanding the model on an as needed basis if balance after propensity score weighting is not optimal (see Section 3.3) in order to obtain better quality weights.

3.2 |. Asymptotic Properties and Variance Estimation

Under the assumptions stated in Section 2.2, and assuming the specified model used within the estimation approach (if using parametric estimation) is correct, ${\hat{Δ}}^{p s}$ is a consistent estimate of Δ.^28,29 However, the estimates, ${\hat{Δ}}^{p s} (- X_{j})$ and ${\hat{Δ}}^{p s} (X_{j})$ , are in general not converging in probability to quantities that are themselves of interest and are in fact, erred versions of Δ. Under correct model specification (unless using nonparametric estimation) these estimates are consistent with respect to these erred values, and thus allow us to obtain consistent estimates of functions of these quantities, namely $\hat{B} (- X_{j})$ and $\hat{B} (- X_{j})$ . Our use of the absolute value to create the denominator of $\hat{B} (- X_{j})$ and $\hat{B} (- X_{j})$ , function which is, of course, not continuous at 0, makes the asymptotic distribution of these estimates nonstandard. Thus, to obtain variance estimates for $\hat{B} (- X_{j})$ and $\hat{B} (- X_{j})$ we use boostrapping³⁰ and show in our simulation study that the resulting estimates approximate the empirical variance very well.

3.3 |. Balance Assessment

A critical piece to all propensity score weighted analyses is to check the quality of the propensity score weights produced. One does this by checking the comparability (or balance) of the groups after applying the propensity score weights. Extensive details on balance assessment can be found elsewhere.^31,27 In general, it is recommended to assess balance over multiple criteria. For simplicity, we focus here on computing and reporting for each covariate used in the propensity score calculation, the differences between the treatment and control group in terms of the effect size (ES) difference. Thus, for the fully adjusted treatment effect estimate $({\hat{Δ}}^{p s})$ , one would compute and examine these ES differences for all covariates included in X_i. For single confounder removal, with estimated treatment effects ${\hat{Δ}}^{p s} (- X_{j})$ , one would compute ES differences for all covariates in X_i except for the removed covariate X_j since balance should not be expected on this removed confounder. For single confounder inclusion, with estimated treatment effects ${\hat{Δ}}^{p s} (X_{j})$ , one would only compute the ES difference for the single covariate, X_j. In our analyses, we consider propensity score weights to be adequate when the ES differences between the two groups are all less than 0.1.

4 |. SIMULATION STUDY

4.1 |. Simulation Goals

The goals of our simulation study were to assess (1) the performance of our proposed estimation procedures in terms of estimating the proportion of bias explained by individual confounders in various settings, (2) whether our proposed approaches can correctly identify the most and least important confounder(s) with respect to explaining selection bias, (3) whether the removal and inclusion approaches are equivalent when the covariates are additive and independent, (4) the impact of correlated confounders on substantive conclusions, and lastly, (5) the impact of a situation in which one does not start with an unbiased treatment effect estimate i.e., if ${\hat{Δ}}^{p s}$ is a biased estimate of Δ.

4.2 |. Simulation Setup

Our simulation study setup mirrors the setup used by Setodji et al. and others.^32,33,24 All simulations include 5 covariates, use a sample size of 2000, use bootstrapping to obtain standard error estimates, and summarize results over 1000 replications. In simulation setting 1, covariates X₁ and X₃ were generated from a standard normal distribution and then dichotomized using the mean as the threshold; covariates X₂, X₄, and X₅ were generated from a standard normal distribution. Treatment assignment T was generated such that T ~ Bernoulli(p) with p = 1/[1+exp{−f(X)}] where in setting 1: f(X) = 0.8X₁ + 0.25X₂ + 0.6X₃ + 0.4X₄ + 0.1X₅. The outcome Y was generated as

Y = - 3.85 + 0.73 X_{1} + 0.36 X_{2} + 0.5 X_{3} + 0.2 X_{4} + 0.1 X_{5} + τ T

with τ = 0 i.e. there is no true treatment effect. Note that in this setting, the covariates are independent and additive in an effort to explore goal #3 described above.

In simulation setting 2, we purposefully aim to produce a setting in which there are two confounders moderately associated with treatment but very weakly associated with the outcome. In this setting, the five covariates and the treatment are generated as is described in setting 1, but the outcome is generated as

Y = - 3.85 + 0.73 X_{1} + 0.01 X_{2} + 0.5 X_{3} + 0.01 X_{4} + 0.1 X_{5} + τ T

with τ = 0, such that X₂ and X₄ are weakly associated with the outcome.

In setting 3, we introduce correlation between covariates and a more complex relationship with the treatment: covariates X₁ and X₂ were generated from a standard normal distribution and then dichotomized using the mean as the threshold; covariate X₃ was generated from a standard normal distribution; covariate X₄ and X₅ were generated such that they were correlated with X₁ and X₂ (correlation = 0.2), respectively. Treatment assignment was generated as in setting 1 but with $f (X) = 0.8 X_{1} + 0.25 X_{4} + 0.6 X_{2} + 0.4 X_{3}^{2} + 0.1 X_{5} + 0.3 X_{2} X_{4} + 0.2 X_{3} X_{4} + 0.4 X_{1} X_{2}$ . The outcome Y was generated as Y = −3.85 + 0.73X₁ + 0.36X₄ + 0.5X₂ + 0.2X₃ + 0.1X₅ + τT with τ = 0. Given the correlation between covariates, we expect our results to reflect the non-identifiability of B(X_j) and B(−X_j) in such a case, and we aim to examine the effect on resulting conclusions about the most important confounder.

In simulation setting 4, we purposefully aim to produce a setting in which there is an unmeasured confounder so that the fully adjusted treatment effect estimate is expected to be biased from the start. In this setting, the five covariates are generated as described in setting 1 but an additional covariate X₆ from a standard normal distribution is also generated. Treatment assignment was generated using f(X) = 0.8X₁ + 0.25X₂ + 0.6X₃ + 0.4X₄ + 0.1X₅ + 0.2X₆ and the outcome was generated as Y = −3.85 + 0.73X₁ + 0.36X₂ + 0.5X₃ + 0.2X₄ + 0.1X₅ + 0.2X₆ +τT. Covariate X₆ is assumed unmeasured and thus, not included in any propensity score models. The proportion of individuals in the treatment group in settings 1, 2, 3, and 4 is 0.65, 0.73, 0.65, and 0.65, respectively.

4.3 |. Simulation Results

Simulation results described here are from the parametric estimation approach; results from the nonparametric approach are provided in the Appendix. Simulation results using the single confounder removal approach are shown in Table 1 and the top panel of Figure 1. The “truth” for all quantities was calculated via a large sample estimate, using a sample size of 100,000. For all settings, the true treatment effect was 0, while the naive treatment effect ranged from 0.22–0.41 across settings (using the large sample truth). For all settings, variable 1 is correctly identified as the variable that explains the highest proportion of the selection bias, with estimates of B(−X_j) ranging from 0.36–0.61 across settings. In settings 1 and 2, the bias in our estimation approach is marginal, all less than 0.005, and confounders that explain a small proportion of the selection bias (e.g. 0.01–0.05) are correctly identified as the least important confounders. In settings 3 and 4, where we have purposefully created complexities in the data generation to examine the impact on estimation - setting 3 has correlation between variables and setting 4 has an unmeasured confounder which causes bias from the start - we see some bias as would be expected, but the bias is relatively small with 0.06 being the highest bias observed. In addition, for these two settings, even with this small amount of bias, the overall substantive conclusions would be correct i.e., that variable 1 explains the highest proportion of the selection bias. For all settings and all variables, the standard error estimates obtained using bootstrapping approximate the empirical standard errors well. Results for estimation of λ(X_j) and λ(−X_j) across all settings are provided in the Appendix.

TABLE 1.

Single Confounder Removal: simulation results for B(−X_j) using parametric estimation with n = 2000;

Variable	1	2	3	4	5
Setting 1
Truth	0.3610	0.2300	0.1880	0.1950	0.0260
Estimate	0.3640	0.2260	0.1870	0.1980	0.0250
Bias	−0.0030	0.0040	0.0000	−0.0030	0.0010
ESE	0.0360	0.0380	0.0290	0.0270	0.0130
ASE	0.0364	0.0381	0.0292	0.0265	0.0124
Setting 2
Truth	0.6090	0.0110	0.3170	0.0180	0.0460
Estimate	0.6100	0.0130	0.3140	0.0200	0.0420
Bias	−0.0020	−0.0030	0.0030	−0.0030	0.0040
ESE	0.0450	0.0100	0.0430	0.0140	0.0210
ASE	0.0463	0.0104	0.0430	0.0149	0.0194
Setting 3
Truth	0.3910	0.1920	0.0440	0.3010	0.0710
Estimate	0.4440	0.1810	0.0170	0.3300	0.0280
Bias	−0.0530	0.0110	0.0270	−0.0290	0.0440
ESE	0.0410	0.0320	0.0130	0.0430	0.0160
ASE	0.0420	0.0310	0.0141	0.0426	0.0152
Setting 4
Truth	0.3100	0.2150	0.1940	0.1970	0.0840
Estimate	0.3650	0.2270	0.1870	0.1960	0.0250
Bias	−0.0540	−0.0110	0.0060	0.0000	0.0590
ESE	0.0370	0.0390	0.0290	0.0270	0.0130
ASE	0.0370	0.0386	0.0297	0.0270	0.0127

Variable	1	2	3	4	5
Setting 1
Truth	0.3740	0.2360	0.1840	0.1840	0.0230
Estimate	0.3730	0.2300	0.1850	0.1870	0.0240
Bias	0.0000	0.0060	−0.0010	−0.0040	−0.0010
ESE	0.0400	0.0410	0.0310	0.0280	0.0140
ASE	0.0397	0.0408	0.0311	0.0281	0.0124
Setting 2
Truth	0.6240	0.0030	0.3120	0.0200	0.0410
Estimate	0.6200	0.0100	0.3090	0.0210	0.0400
Bias	0.0040	−0.0070	0.0040	−0.0010	0.0010
ESE	0.0490	0.0080	0.0470	0.0150	0.0220
ASE	0.0486	0.0101	0.0457	0.0165	0.0194
Setting 3
Truth	0.4510	0.1530	0.0100	0.3490	0.0360
Estimate	0.4410	0.1560	0.0130	0.3500	0.0400
Bias	0.0100	−0.0030	−0.0030	−0.0010	−0.0030
ESE	0.0340	0.0270	0.0100	0.0360	0.0150
ASE	0.0347	0.0259	0.0110	0.0356	0.0151
Setting 4
Truth	0.3780	0.2300	0.1870	0.1820	0.0240
Estimate	0.3740	0.2310	0.1850	0.1860	0.0250
Bias	0.0040	−0.0010	0.0020	−0.0040	−0.0010
ESE	0.0400	0.0410	0.0310	0.0280	0.0140
ASE	0.0403	0.0413	0.0315	0.0286	0.0126

	Alcohol Users	Non-Alcohol Users	ES Difference
Age, Years (Mean)	52.43	51.98	0.04
CAG repeats (Mean)	43.22	43.68	−0.14
Baseline HD Severity Level (Mean)	−0.42	−0.04	−0.44
Education (International Coding; %)
Primary	0.03	0.04	−0.08
Lower Secondary	0.19	0.23	−0.11
Upper Secondary/High School	0.26	0.33	−0.17
Post-secondary non-tertiary education	0.19	0.13	0.15
University Plus	0.33	0.26	0.15
Drug Use (%)	0.03	0.01	0.11
Antidepressant Use (%)	0.38	0.49	−0.22

Single Confounder Inclusion
	Δ^ps (X_j)	λ(X_j) [SE]	B(X_j) [SE]
Age	−0.47	0.01 [0.01]	0.01 [0.01]
CAG	−0.42	−0.04 [0.01]	0.08 [0.02]
Baseline HD Severity	−0.05	−0.41 [0.04]	0.74 [0.03]
Education	−0.40	−0.06 [0.01]	0.11 [0.02]
Drug Use	−0.46	0.00 [0.00]	0.00 [0.00]
Antidepressant use	−0.43	−0.03 [0.01]	0.05 [0.02]
Single Confounder Removal
	Δ^ps (X_j)	λ(X_j) [SE]	B(X_j) [SE]
Age	−0.07	−0.02 [0.01]	0.08 [0.03]
CAG	−0.08	−0.01 [0.01]	0.04 [0.02]
Baseline HD Severity	−0.31	0.22 [0.03]	0.84 [0.04]
Education	−0.08	−0.01 [0.01]	0.03 [0.02]
Drug Use	−0.09	0.00 [0.00]	0.00 [0.01]
Antidepressant use	−0.09	0.00 [0.00]	0.01 [0.01]

Variable	1	2	3	4	5
Setting 1
Truth	−0.1440	−0.0920	−0.0750	−0.0780	−0.0110
Estimate	−0.1430	−0.0890	−0.0740	−0.0780	−0.0100
Bias	0.0000	−0.0020	−0.0010	0.0000	−0.0010
ESE	0.0180	0.0190	0.0130	0.0120	0.0060
ASE	0.0183	0.0184	0.0132	0.0122	0.0055
Setting 2
Truth	−0.1460	−0.0030	−0.0760	−0.0040	−0.0110
Estimate	−0.1430	−0.0030	−0.0740	−0.0040	−0.0100
Bias	−0.0030	0.0000	−0.0020	0.0000	−0.0010
ESE	0.0170	0.0030	0.0120	0.0040	0.0050
ASE	0.0175	0.0031	0.0124	0.0046	0.0052
Setting 3
Truth	−0.1860	−0.0910	−0.0210	−0.1430	−0.0340
Estimate	−0.1540	−0.0630	0.0040	−0.1150	−0.0090
Bias	−0.0320	−0.0290	−0.0250	−0.0290	−0.0240
ESE	0.0190	0.0120	0.0060	0.0200	0.0060
ASE	0.0188	0.0122	0.0065	0.0193	0.0060
Setting 4
Truth	−0.1820	−0.1270	−0.1140	−0.1160	−0.0490
Estimate	−0.1410	−0.0880	−0.0720	−0.0760	−0.0100
Bias	−0.0410	−0.0390	−0.0410	−0.0400	−0.0400
ESE	0.0180	0.0180	0.0120	0.0120	0.0050
ASE	0.0183	0.0183	0.0131	0.0122	0.0055

Variable	1	2	3	4	5
Setting 1
Truth	0.1270	0.0800	0.0620	0.0620	0.0080
Estimate	0.1270	0.0790	0.0630	0.0640	0.0080
Bias	0.0000	0.0020	−0.0010	−0.0010	0.0000
ESE	0.0170	0.0170	0.0110	0.0100	0.0050
ASE	0.0166	0.0165	0.0115	0.0100	0.0047
Setting 2
Truth	0.1340	−0.0010	0.0670	−0.0040	0.0090
Estimate	0.1320	0.0000	0.0660	−0.0030	0.0090
Bias	0.0020	0.0000	0.0020	−0.0010	0.0000
ESE	0.0170	0.0030	0.0110	0.0050	0.0050
ASE	0.0166	0.0028	0.0115	0.0045	0.0047
Setting 3
Truth	0.1930	0.0650	−0.0040	0.1500	0.0160
Estimate	0.1860	0.0660	−0.0030	0.1480	0.0170
Bias	0.0070	0.0000	−0.0010	0.0010	−0.0010
ES2	0.0200	0.0120	0.0060	0.0200	0.0070
ASE	0.0200	0.0117	0.0061	0.0203	0.0068
Setting 4
Truth	0.1270	0.0770	0.0630	0.0610	0.0080
Estimate	0.1250	0.0780	0.0620	0.0620	0.0080
Bias	0.0020	0.0000	0.0010	−0.0010	0.0000
ESE	0.0160	0.0160	0.0110	0.0100	0.0050
ASE	0.0165	0.0164	0.0114	0.0101	0.0047

Variable	1	2	3	4	5
Setting 1
Truth	0.347	0.225	0.187	0.196	0.044
Estimate	0.401	0.215	0.189	0.174	0.022
Bias	−0.054	0.011	−0.001	0.022	0.022
ESE	0.044	0.045	0.035	0.032	0.016
Setting 2
Truth	0.580	0.026	0.305	0.030	0.059
Estimate	0.608	0.037	0.293	0.042	0.020
Bias	−0.029	−0.010	0.011	−0.012	0.040
ESE	0.052	0.018	0.048	0.021	0.016
Setting 3
Truth	0.389	0.219	0.031	0.296	0.065
Estimate	0.436	0.184	0.072	0.286	0.022
Bias	−0.046	0.035	−0.041	0.010	0.042
ESE	0.050	0.036	0.035	0.047	0.018
Setting 4
Truth	0.298	0.211	0.198	0.195	0.098
Estimate	0.400	0.216	0.189	0.173	0.022
Bias	−0.102	−0.005	0.009	0.021	0.076
ESE	0.046	0.047	0.037	0.032	0.017

Variable	1	2	3	4	5
Setting 1
Truth	0.374	0.233	0.182	0.184	0.026
Estimate	0.386	0.220	0.192	0.177	0.024
Bias	−0.012	0.013	−0.010	0.007	0.002
ESE	0.040	0.040	0.031	0.027	0.013
Setting 2
Truth	0.629	0.001	0.309	0.018	0.043
Estimate	0.615	0.016	0.309	0.022	0.037
Bias	0.013	−0.015	0.000	−0.005	0.006
ESE	0.047	0.012	0.044	0.016	0.023
Setting 3
Truth	0.444	0.154	0.029	0.334	0.039
Estimate	0.458	0.162	0.030	0.315	0.036
Bias	−0.013	−0.008	−0.001	0.020	0.002
ESE	0.037	0.026	0.020	0.035	0.019
Setting 4
Truth	0.377	0.224	0.192	0.185	0.022
Estimate	0.391	0.217	0.193	0.173	0.026
Bias	−0.013	0.007	−0.001	0.012	−0.004
ESE	0.042	0.041	0.033	0.029	0.016

Variable	1	2	3	4	5
Setting 1
Truth	−0.149	−0.097	−0.081	−0.084	−0.019
Estimate	−0.106	−0.057	−0.050	−0.046	0.004
Bias	−0.043	−0.040	−0.030	−0.038	−0.023
ESE	0.015	0.014	0.010	0.009	0.006
Setting 2
Truth	−0.145	−0.007	−0.076	−0.007	−0.015
Estimate	−0.109	0.007	−0.053	0.007	−0.001
Bias	−0.036	−0.013	−0.024	−0.015	−0.014
ESE	0.015	0.003	0.010	0.004	0.004
Setting 3
Truth	−0.187	−0.105	−0.015	−0.142	−0.031
Estimate	−0.122	−0.052	0.020	−0.080	0.003
Bias	−0.065	−0.053	−0.035	−0.062	−0.034
ESE	0.016	0.011	0.010	0.016	0.007
Setting 4
Truth	−0.188	−0.134	−0.125	−0.123	−0.062
Estimate	−0.105	−0.057	−0.049	−0.045	0.004
Bias	−0.084	−0.077	−0.076	−0.078	−0.066
ESE	0.015	0.014	0.010	0.009	0.006

Variable	1	2	3	4	5
Setting 1
Truth	0.128	0.080	0.062	0.063	0.009
Estimate	0.127	0.073	0.063	0.058	0.008
Bias	0.001	0.007	−0.001	0.005	0.001
ESE	0.016	0.016	0.011	0.009	0.005
Setting 2
Truth	0.133	0.000	0.065	−0.004	0.009
Estimate	0.131	0.000	0.066	−0.003	0.008
Bias	0.001	0.000	−0.001	−0.001	0.001
ESE	0.016	0.004	0.011	0.005	0.005
Setting 3
Truth	0.189	0.065	−0.012	0.142	0.016
Estimate	0.187	0.066	−0.010	0.129	0.015
Bias	0.001	−0.001	−0.002	0.013	0.002
ESE	0.019	0.011	0.011	0.018	0.008
Setting 4
Truth	0.124	0.073	0.063	0.061	0.007
Estimate	0.125	0.070	0.062	0.056	0.008
Bias	−0.002	0.004	0.001	0.005	0.000
ESE	0.016	0.016	0.011	0.010	0.006

Variable	1	2	3	4	5
Setting 5
Truth	0.0040	0.3000	0.0830	0.3040	0.3090
Estimate	0.0070	0.3060	0.0780	0.3050	0.3050
Bias	−0.0020	−0.0060	0.0050	0.0000	0.0040
ESE	0.0060	0.0300	0.0190	0.0310	0.0300
ASE	0.0067	0.0313	0.0193	0.0313	0.0312
Setting 6
Truth	0.1950	0.2810	0.2360	0.2580	0.0300
Estimate	0.1890	0.2850	0.2410	0.2520	0.0330
Bias	0.0050	−0.0040	−0.0040	0.0060	−0.0030
ESE	0.0360	0.0480	0.0390	0.0350	0.0170
ASE	0.0384	0.0487	0.0383	0.0356	0.0166
Setting 7
Truth	0.3960	0.2490	0.2180	0.1110	0.0270
Estimate	0.4050	0.2570	0.2150	0.1080	0.0140
Bias	−0.0090	−0.0080	0.0020	0.0020	0.0130
ESE	0.0410	0.0430	0.0330	0.0260	0.0100
ASE	0.0412	0.0426	0.0333	0.0253	0.0095
Setting 8
Truth	0.4300	0.1310	0.3430	0.0490	0.0480
Estimate	0.5130	0.0950	0.3500	0.0290	0.0130
Bias	−0.0830	0.0360	−0.0070	0.0190	0.0350
ESE	0.0910	0.0660	0.0840	0.0210	0.0110
ASE	0.0938	0.0764	0.0823	0.0269	0.0127

Variable	1	2	3	4	5
Propensity score model
Regression Coefficient	0.4018	0.2528	0.2977	0.3999	0.0992
Outcome model
Regression Coefficient	0.3648	0.0007	0.4002	0.1994	0.0002

Variable	1	2	3	4	5
Single confounder removal
Truth	0.420	0.003	0.349	0.227	0.001
Estimate	0.416	0.012	0.341	0.226	0.005
Bias	0.004	−0.009	0.008	0.001	−0.004
ESE	0.041	0.009	0.043	0.031	0.005
Single confounder inclusion
Truth	0.424	0.015	0.349	0.210	0.002
Estimate	0.421	0.016	0.344	0.214	0.004
Bias	0.003	−0.001	0.004	−0.004	−0.002
ESE	0.045	0.011	0.047	0.032	0.004

PERMALINK

Quantifying the Bias due to Observed Individual Confounders in Causal Treatment Effect Estimates

Layla Parast

Beth Ann Griffin

Abstract

1 |. INTRODUCTION

2 |. NOTATION, DEFINITIONS, AND PROPOSED QUANTITIES

2.1 |. Notation and Definitions

2.2 |. Assumptions

2.3 |. Single confounder removal

2.4 |. Single confounder inclusion

2.5 |. Choosing an approach

3 |. ESTIMATION

3.1 |. Estimation of Proposed Quantities

3.2 |. Asymptotic Properties and Variance Estimation

3.3 |. Balance Assessment

4 |. SIMULATION STUDY

4.1 |. Simulation Goals

4.2 |. Simulation Setup

4.3 |. Simulation Results

TABLE 1.

FIGURE 1.

TABLE 2.

FIGURE 2.

5 |. HUNTINGTON’S DISEASE APPLICATION

5.1 |. Data

5.2 |. Results

TABLE 3.

TABLE 4.

6 |. DISCUSSION

ACKNOWLEDGEMENTS

APPENDIX

ADDITIONAL SIMULATION RESULTS

TABLE A1.

TABLE A2.

TABLE A3.

TABLE A4.

TABLE A5.

TABLE A6.

TABLE A7.

TABLE A8.

TABLE A9.

TABLE A10.

TABLE A11.

TABLE A12.

MODELS WITH STANDARDIZED COVARIATES

TABLE A13.

TABLE A14.

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases