Estimating the causal effects of exposure mixtures: a generalized propensity score method

Qian Gao; Ting Li; Guiming Zhu; Juping Wang; Kexin Qiu; Liangpo Liu; Xiujuan Yang; Tong Wang

doi:10.1186/s12874-025-02673-4

. 2025 Sep 29;25:221. doi: 10.1186/s12874-025-02673-4

Estimating the causal effects of exposure mixtures: a generalized propensity score method

Qian Gao ¹, Ting Li ¹, Guiming Zhu ¹, Juping Wang ¹, Kexin Qiu ³, Liangpo Liu ², Xiujuan Yang ², Tong Wang ^1,^✉

PMCID: PMC12482879 PMID: 41023872

Abstract

Background

In environmental epidemiology and many other fields, estimating the causal effects of multiple concurrent exposures holds great promise for driving public health interventions and policy changes. Given the predominant reliance on observational data, confounding remains a key consideration, and generalized propensity score (GPS) methods are widely used as causal models to control measured confounders. However, current GPS methods for multiple continuous exposures remain scarce.

Methods

We proposed a novel causal model for exposure mixtures, called nonparametric multivariate covariate balancing generalized propensity score (npmvCBGPS). A simulation study examined whether npmvCBGPS, an existing multivariate GPS (mvGPS) method, and a linear regression model for the outcome can accurately and precisely estimate the effects of exposure mixtures in a variety of common scenarios. An application study illustrated the analysis of the causal role of per- and polyfluoroalkyl substances (PFASs) on BMI.

Results

The npmvCBGPS achieved acceptable covariate balance in all scenarios. The estimates were close to the true value as long as either the exposure or the outcome model was correctly specified, and the results were less impacted by correlations among mixture components. The accuracy and precision of mvGPS and the linear regression model relied on the correctly specified exposure model and outcome model, respectively. The npmvCBGPS outperformed mvGPS in all scenarios. The npmvCBGPS achieved better covariate balance than mvGPS and provided an overall inverse trend between the PFAS mixtures with BMI.

Conclusions

In this study, we proposed npmvCBGPS to accurately estimate the causal effects of multiple exposure mixtures on health outcomes. Our approach is applicable across various domains, with a particular emphasis on environmental epidemiology.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12874-025-02673-4.

Keywords: Multiple exposures, Mixtures, Exposure-response, Causal model, Generalized propensity score

Background

Diseases with a heavy global burden are largely predicted by behavioral, nutritional, occupational, and environmental exposures [1–4], many of which are modifiable and, consequently, may be potential intervention targets [5]. Among these risk factors, even environmental exposures are complex, and these include chemicals from the natural and built environment, such as air and water pollution [6]. In the real world, given that populations are simultaneously exposed to complex multi-pollutant mixtures, there has been a paradigm shift from the “one-exposure-one-disease” framework to evaluating health effects of exposure mixtures [1, 3]. Each of these exposure components may act independently, synergistically, or antagonistically [7]. Focusing on the relationships between health outcomes and multiple simultaneous exposures is advantageous, including the ability to estimate cumulative health effects and provide evidence for more realistic and potentially more effective public health interventions [1, 8].

The estimation of health effects of exposure mixtures comes with numerous challenges; for instance, the typically high correlations between exposures, and the difficulty in capturing interactions and estimating causal relationships [9–11]. Many epidemiological studies have traditionally focused on estimating the health effects of a single exposure [12], such as perfluoroalkyl and polyfluoroalkyl substances [13, 14]. However, these classical single-exposure analyses cannot determine whether the observed association is attributable to the exposure of interest or to another related exposure not considered in the analysis. They also fail to capture the interactive and cumulative effects of exposure mixtures [11]. Recently, several statistical methods have been developed to address some of the challenges in estimating the health effects of exposure mixtures. For example, both weighted quantile sum regression (WQSR) and Bayesian kernel machine regression (BKMR) are robust in addressing collinearity, and BKMR is also able to estimate nonlinear effects [15, 16]. These two methods have been increasingly used since their proposal [12]. However, these two methods are types of regression models and lack clarity regarding the underlying assumptions and definitions necessary for making causal claims about effects [17–19].

Given the predominant reliance on observational data, a major barrier to estimating the causal effects of exposure mixtures is confounding. Classical causal inference methods for confounding adjustment, such as g-computation and generalized propensity score (GPS) methods, have been extended to multiple continuous exposures [7, 20]. Keil et al. proposed quantile g-computation, which combined WQS regression and g-computation [7]. Thilakaratne et al. estimated the associations of nonessential and essential metals with child cognition by using quantile g-computation [kara21]. Williams et al. proposed the multivariate generalized propensity score (mvGPS) method [20]. Traini et al. evaluated the causal effects of air pollutant mixtures on overall mortality [17]. Cai et al. built mvGPS models to provide mixture effects of endocrine-disrupting chemicals on metabolic outcomes in adolescents [22]. However, both mvGPS and quantile g-computation are sensitive to model misspecification [17, 23]. In reality, it is usually unknown which model to use to decipher the mechanisms underlying the relationship between confounders and health outcomes or exposures. Oulhote et al. combined super learner (SL) and g-computation (SL-Gcomp) to reduce potential model misspecifications, but SL ignores the causal structure, and the choice of prediction algorithms, and the number of folds of cross-validation in SL may affect the bias and variance of the estimates [24–26]. In this study, we developed a novel GPS approach for handling potential model misspecification, which we call nonparametric multivariate covariate balancing generalized propensity score (npmvCBGPS). The imbalance of confounders across different levels of exposure is a source of confounding bias [23]. The npmvCBGPS method is a weighting method and avoids specifying the GPS/exposure model by directly obtaining weights that optimize “covariate balance”.

This article is organized as follows. First, we conduct a brief review of the inverse probability of weighting (IPW) method. Subsequently, we introduce the proposed npmvCBGPS approach. Following this, we undertake a simulation study to evaluate the validity of the npmvCBGPS. After the analysis, we provide a detailed illustration of the practical application of the npmvCBGPS. Finally, we discuss the results and present conclusions.

Methods

Overview of the inverse probability of weighting method

Inverse probability of weighting using propensity score

GPS methods are an extension of propensity score (PS) methods, which were originally designed for estimating the causal effect of a single binary exposure [27]. PS methods are widely used in environmental health studies [9]. The inputs of PS methods consist of an observed outcome (Y), an exposure (T), and a set of covariates (X) including confounders and prognostic covariates (those only predicting the outcome). PS methods start by constructing a PS model (or exposure model), and a commonly used model is a logistic model expressed by:

where Inline graphic is the model intercept, and is the coefficient of the jth covariate. The output of the PS model is the conditional probability that the ith individual is assigned to the exposure group (T = 1) given . This conditional probability is called the propensity score (PS). When the PS is correctly calculated, the PS will be a balancing score meaning that covariates will be balanced between the exposure group (T = 1) and the control group (T = 0) in subsamples with similar PS [28]. Four different methods using the PS have been widely used to control for measured confounders: propensity score matching, stratification on the propensity score, covariate adjustment using the propensity score, and inverse probability of weighting using the propensity score. Among these PS methods, IPW is increasingly preferred due to its flexibility for different study designs [29].

In IPW analyses, the contribution of the ith individual is weighted by the inverse probability weighting (or balance weight) Inline graphic . The weights ensure that the total contribution is the same between the exposed and control groups for a particular value of the PS [30]. For example, assume there are 10 individuals with a PS of 0.6, including 6 in the exposed group and 4 in the control group. The weight is for each individual in the exposed group and Inline graphic for each individual in the control group. The sum of weights of 6 individuals in the exposed group, =10 equals the sum of weights of 4 individuals in the control group, =10. Thus, IPW generates a pseudo-population in which covariates are balanced between exposed and control groups without loss of sample. Eventually, we can estimate the average exposure effect by building a marginal structural model expressed as:

where Inline graphic is the support of the exposure and is the potential outcome. For a binary exposure, and is the average exposure effect of interest. There are a pair of potential outcomes for each individual: , which is the outcome they would have had if they had been exposed, and , which is the outcome they would have had if they had been unexposed. However, in reality, only one outcome can be observed for an individual. Fortunately, under the standard causal identification assumptions (as described in the following paragraph), the observed outcome Y and potential outcome Inline graphic have the following relationship:

where Inline graphic is the inverse probability weighting. Therefore, when PS is correctly calculated, we can identify an estimate of by fitting a weighted univariate linear model of the observed outcome Y on the exposure T as follows:

Four assumptions are required to identify causal effects from observational data. Consistency means the observed Inline graphic equals the potential outcome corresponding to the actual exposure status, i.e., , for . The stable unit treatment value assumption means no interference among observations. The positivity assumption, for means all subjects have a non-zero probability of being assigned to any exposure level given Inline graphic . The no unmeasured confounding assumption for means all covariates affecting both exposure and outcome have been measured and controlled for. When T is multidimensional, the no unmeasured confounding assumption needs to hold for each element in T.

Inverse probability of weighting using generalized propensity score

When the exposure is continuous, GPS is required for IPW analyses. GPS is defined as the conditional probability density of an individual being exposed to a particular level given covariates [31]. GPS can be estimated by constructing a GPS model (or exposure model), such as a linear model expressed by:

where Inline graphic is a mean-zero random error and follows a normal distribution. In this case, GPS for the ith individual can be expressed using the probability density function of the normal distribution: . The contribution of the ith individual is weighted by , where the denominator is GPS and the numerator is the marginal probability density of the exposure variable, such as a normal distribution. Inline graphic is a stabilized weight to reduce extreme weights [32]. It is worth noting that valid causal estimates of GPS methods not only rely on standard causal identification assumptions, but also depend on adequate specification of GPS model, including the correct specification of both the function of covariates (e.g., Inline graphic in Eq. 5) and the distribution of the random error (e.g., a normal distribution for in Eq. 5). Any misspecification in either the function of covariates or the distribution of the random error is considered to be a model misspecification. Likewise, the causal parameter (e.g., in Eq. 4) and the dose-response function (e.g., Inline graphic for in Eq. 4) can be identified by fitting a weighted univariate linear model of the outcome Y on the exposure T based on observed data. In practical applications, flexible models, such as spline regression and local linear regression, can be used to estimate a nonlinear dose-response function. Here, a linear model is used for better clarity of the idea of IPW.

NpmvCBGPS

The classical IPW method requires that the exposure model is adequately specified. As model misspecification is difficult to diagnose and assess, several works have focused on directly estimating IPW weights based on the fact that the correlation between the continuous exposure and covariates is the source of confounding bias [33]. Most of these methods build optimization criteria with the goal of covariate balance. Covariate balance means independence between the exposure and covariates after weighting [33]. One of the metrics commonly used to quantify covariate balance is that the weighted cross-moments between the exposure and covariates are zero. The nonparametric covariate balancing generalized propensity score (npCBGPS) method is one of these methods and is attractive because of its good performance in estimation and simple implementation [23]. In this study, we extend the npCBGPS to multiple continuous exposures. The input of the npmvCBGPS includes m continuous exposures Inline graphic (e.g., m = 4 PFASs as stated in the real data analysis), p covariates (e.g., p = 8 covariates as stated in the real data analysis), and an outcome (e.g., BMI as stated in the real data analysis), i=1,2,…,n. We begin by centering and orthogonalizing the exposures and covariates as follows, which are only used to estimate IPW weights:

where Inline graphic and are the sample mean vector and sample covariance matrix of T, respectively; is the inverse square root of . Similarly, we transform the covariates:

where Inline graphic and are the sample mean vector and sample covariance matrix of X, respectively; is the inverse square root of . There are no correlations among the transformed covariates and among the transformed exposures. Then, the stabilizing weights can be defined as:

The npmvCBGPS uses an empirical likelihood approach to choose weights that meet balancing conditions. The joint density of the ith individual in relation to the weights can be expressed as:

Thus, the likelihood function for the whole sample is expressed as:

We now choose weights Inline graphic by maximizing Eq. (6), but also require to satisfy the following constraints [23]:

(A1) , which means that and are uncorrelated after weighting (the original and are also uncorrelated). This condition guarantees that covariates are balanced across different exposure levels after weighting.
(A2) and , which mean the marginal means of and are preserved after weighting.
(A3) and , which mean weights are positive and they sum to the sample size n.

After log transformation of Eq. (6), the above is equivalent to maximizing:

subject to the above constraints. Because there are no correlations among the transformed covariates Inline graphic and among the transformed exposures , the estimation of the npmvCBGPS is simplified to finding:

subject to the above constraints. The details about solving this optimization problem can be found in the study by Fong et al. [23].

Under the causal identification assumptions, we can identify causal parameters or a dose-response function by fitting a weighted model in which the original outcome Y is regressed on the original exposure T [20]. For example, a weighted multiple linear regression model is as follows:

Simulation studies

Simulation setup

To assess the finite sample properties of the npmvCBGPS and compare it with the mvGPS method and the typical linear regression model, we conducted simulation studies under two scenarios. These simulations were designed to evaluate: (1) the accuracy and precision of causal estimates under Scenario 1, where both the exposure and outcome models were linear in covariates; (2) robustness to model misspecification under Scenario 2, where either the exposure or the outcome model was nonlinear in covariates. Additionally, we included a simple interaction among exposures in the outcome model across both scenarios to investigate the impact of nonlinear exposure effects. We modified the simulation studies of Fong et al. and Williams et al. to align with real-world environmental health studies (where environmental mixtures typically comprise three or more exposures with effects in different directions), while maintaining parsimony to elucidate the strengths and limitations of estimators [7, 20, 23]. Therefore, we considered three exposures, a continuous outcome, and three confounders that reflect different degrees of overlap in confounding for the exposures as illustrated in Fig. 1 (common confounding: Inline graphic ; partially common confounding: and ) [20]. Covariates that only predict exposures (, , ) were also considered because they may affect the variance of an estimate [34].

Fig. 1 — Directed acyclic graphs depicting three confounders

The first step of the simulation was to generate 6 covariates. Five covariates were drawn from a multivariate normal distribution, Inline graphic , with means of 0, variances of 1, and covariances of 0.2. A binary covariate was generated from a Bernoulli distribution, . The data-generating processes have been summarized in supplementary Table S1.

Scenario 1

The data-generating processes in Scenario 1 were as follows:

(A) E1Y1 and E1Y2: exposures were generated by:

where Inline graphic is the random error simulated from a multivariate normal distribution, , with means of 0, variances of 1. We set the covariance element to 0.2, 0.5, and 0.7 to explore the impact of collinearity among mixture components, which is a core challenge in mixture analysis [12].

The outcome was generated by:

where Inline graphic is the random error simulated from a standard normal distribution, , with a mean of 0 and a variance of 1. We set the exposure effects to 0.2, −0.5, 0.6, and 0.8 because they are commonly found in epidemiology [35].

Scenario 2

In Scenario 2, we generated true data under the following settings:

(B) E2Y1 and E2Y2: the outcome model was correctly specified, whereas the exposure model was misspecified.

where Inline graphic with means of 0, variances of 1, and covariances of 0.2, 0.5, and 0.7. The outcome model remained the same as in Scenario 1.

(C) E1Y3 and E1Y4: the outcome model was misspecified, whereas the exposure model was correctly specified.

where Inline graphic and , . The exposure model remained the same as in Scenario 1.

(D) E2Y3 and E2Y4: both the exposure and outcome models were misspecified.

We simulated 500 datasets for each setting with sample sizes of 200, 500, and 1000 and reported statistics related to causal parameters (coefficients of exposures): bias that evaluates the accuracy of estimates (the mean of estimates minus the true value); the root mean squared error (RMSE) that evaluates the precision of estimates, the smaller the RMSE, the higher the precision; the 95% CI coverage (the proportion of which estimated CIs include the true value); power (the proportion that hypothesis test with P < 0.05). Additionally, the average absolute weighted Pearson correlations between exposures and confounders were used to assess covariate balance, and a value of 0.1 has been suggested to be acceptable in practice [20, 36]. Covariate balance is crucial, as imbalances may result in biased estimates [23]. We developed the R package mvCBGPS available at https://github.com/QianGao-SXMU/mvCBGPS and Supplementary data to implement the npmvCBGPS method.

We compared the proposed npmvCBGPS method with the following two methods: (1) the linear regression model for the outcome that controls for all confounders; (2) mvGPS, which fits the exposure model using a multiple multivariate linear model and subsequently assumes a multivariate normal distribution for the random error to calculate balance weights.

Per- and polyfluoroalkyl substances and BMI

To test npmvCBGPS and mvGPS on real data, we used publicly available NHANES data to associate Perfluoroalkyl and Polyfluoroalkyl Substances (PFASs) with body mass index (BMI). Several epidemiologic studies of associations between PFASs with BMI have reported mixed findings, including both positive and no effects in adults [37]. The National Health and Nutrition Examination Survey (NHANES) is a nationally representative survey that measures the health and nutritional status of adults and children in the US every 2 years. The US Centers for Disease Control and Prevention has described the study procedures of NHANES in detail [38]. This study pooled cross-sectional data from the NHANES 2003 — 2018 cycles. Survey participants who met the following criteria were included: (1) aged 20 years and older; (2) with complete data on concentrations of PFASs in serum and covariates. Participants who had a positive pregnancy test at the time of the survey were excluded [14]. A total of 12,714 participants were included in the final analysis (Figure S1). NHANES received ethical approval from the National Center for Health Statistics. Written consent was obtained from all participants before their engagement in NHANES.

PFASs with a detection rate below 80% were not included in this study [13], leaving perfluorooctanoic acid (PFOA), perfluorooctane sulfonic acid (PFOS), perfluorohexane sulfonic acid (PFHxS), and perfluorononanoic acid (PFNA) for the analysis. The concentrations of PFASs were naturally log-transformed [13, 14]. There were eight covariates, including age, gender, race or ethnicity, highest educational level, marital status, cigarette smoking (based on serum cotinine level), income-to-poverty ratio, and total energy intake. We explored the linear and nonlinear relationships between PFASs and BMI using the IPW method.

Results

Simulation results

Covariate balance and Estimation under scenario 1 and scenario 2

In Scenario 1, we compared the accuracy and precision of the proposed npmvCBGPS, mvGPS method, and the linear regression model for the outcome. Figure 2 illustrates the degree of covariate balance achieved by the two GPS methods along with the original unweighted correlations for comparison. In general, both the npmvCBGPS and mvGPS methods improved the covariate balance compared with the original sample in all settings. The mean of weighted correlations for the npmvCBGPS method was less than 0.1, whereas that for the mvGPS method was greater than 0.1. Table 1 shows the summary statistics of the causal parameter estimates under Scenario 1. When both the exposure and outcome models were linear in the covariates and exposures (E1Y1), npmvCBGPS yielded causal parameter estimates with biases that were close to 0 for all three exposures. As the sample size increased, both bias and RMSE tended to decrease for the npmvCBGPS. The mean biases and RMSEs of mvGPS were higher than those of npmvCBGPS. When there was an interaction among three-dimensional exposures (E1Y2), the results for the main effects of the three exposures were similar to those in E1Y1, that is, npmvCBGPS outperformed mvGPS in terms of mean biases and RMSEs. For the interaction effect, the mean biases and RMSEs were smaller than those of the main effects. As expected, in both settings, the linear regression model yielded biases that were close to those of npmvCBGPS, but with higher precision.

Fig. 2 — The performance of covariate balance under Scenario 1

Table 1.

Mean bias and RMSE of the causal parameter estimates under Scenario 1

Settings	Method	n	T1 (0.6)		T2 (−0.5)		T3 (0.8)		T1T3 (0.2)
Settings	Method	n	bias	RMSE	bias	RMSE	bias	RMSE	bias	RMSE
E1Y1	npmvCBGPS	200	0.006	0.126	0.022	0.121	0.045	0.187	-	-
		500	0.006	0.096	0.013	0.094	0.033	0.154
		1000	0.001	0.099	0.028	0.099	0.058	0.161
	mvGPS	200	0.033	0.172	0.126	0.206	0.154	0.278	-	-
		500	0.026	0.148	0.103	0.178	0.122	0.258
		1000	0.022	0.134	0.090	0.145	0.119	0.225
	linear	200	−0.002	0.076	0.006	0.079	−0.001	0.078	-	-
		500	−0.001	0.046	0.000	0.044	−0.002	0.048
		1000	0.002	0.031	0.000	0.034	−0.003	0.032
E1Y2	npmvCBGPS	200	0.008	0.127	0.025	0.121	0.048	0.195	−0.001	0.109
		500	0.013	0.092	0.015	0.094	0.040	0.158	−0.003	0.081
		1000	0.006	0.094	0.030	0.098	0.062	0.158	−0.003	0.073
	mvGPS	200	0.040	0.168	0.129	0.205	0.182	0.293	−0.030	0.098
		500	0.038	0.158	0.104	0.174	0.150	0.239	−0.021	0.070
		1000	0.026	0.129	0.095	0.144	0.147	0.232	−0.022	0.066
	linear	200	−0.002	0.076	0.006	0.079	0.000	0.079	0.001	0.021
		500	−0.002	0.048	−0.002	0.046	−0.001	0.048	0.000	0.012
		1000	0.002	0.031	0.000	0.034	−0.003	0.032	0.000	0.009

Open in a new tab

In Scenario 2, we assessed the robustness to model misspecification of the npmvCBGPS, mvGPS, and the linear regression model. Figure 3 illustrates the degree of covariate balance in Scenario 2. In general, the npmvCBGPS method improved the covariate balance, with mean weighted correlations of less than 0.1 in all settings. Compared with the original sample, the covariate balance was improved for the mvGPS method when only the exposure model was correctly specified (E1Y3 and E1Y4), whereas when the exposure model was misspecified, the mvGPS exhibited limited balance improvement. The summary statistics of the causal parameter estimates under Scenario 2 are listed in Table 2. When only the outcome model was incorrectly specified (E1Y3 and E1Y4), results for the npmvCBGPS and mvGPS were similar to those in Scenario 1, that is, npmvCBGPS outperformed mvGPS in terms of mean biases and RMSEs, whereas the accuracy and precision decreased for the linear regression model, especially when there was an interaction among three-dimensional exposures (E1Y4). When only the exposure model was incorrectly specified (E2Y1 and E2Y2), npmvCBGPS and the linear regression model still provided estimates with mean biases that were close to 0, whereas mvGPS provided estimates with mean biases that were more likely to deviate from 0, and the RMSEs significantly increased. Not surprisingly, npmvCBGPS, mvGPS, and the linear regression model failed when neither the exposure model nor the outcome model was correctly specified (E2Y3 and E2Y4).

Fig. 3 — The performance of covariate balance under Scenario 2

Table 2.

Mean bias and RMSE of the causal parameter estimates under Scenario 2

Settings	Method	n	T1 (0.6)		T2 (−0.5)		T3 (0.8)		T1T3 (0.2)
Settings	Method	n	bias	RMSE	bias	RMSE	bias	RMSE	bias	RMSE
E1Y3	npmvCBGPS	200	0.015	0.128	0.021	0.132	0.027	0.199	-	-
		500	0.005	0.100	0.016	0.109	0.020	0.162
		1000	0.002	0.115	0.030	0.115	0.055	0.184
	mvGPS	200	0.023	0.213	0.121	0.214	0.175	0.355	-	-
		500	0.025	0.159	0.110	0.181	0.117	0.264
		1000	0.021	0.168	0.075	0.169	0.125	0.255
	linear	200	0.002	0.123	−0.005	0.128	−0.007	0.140	-	-
		500	−0.004	0.081	0.004	0.082	−0.006	0.078
		1000	0.002	0.055	0.001	0.057	0.001	0.058
E1Y4	npmvCBGPS	200	0.008	0.140	0.022	0.130	0.006	0.227	0.064	0.135
		500	0.013	0.104	0.011	0.105	−0.006	0.166	0.084	0.121
		1000	0.011	0.100	0.022	0.111	0.011	0.165	0.088	0.117
	mvGPS	200	0.038	0.220	0.128	0.222	0.178	0.343	0.016	0.108
		500	0.030	0.186	0.109	0.210	0.142	0.278	0.001	0.087
		1000	0.037	0.153	0.090	0.170	0.126	0.264	−0.015	0.076
	linear	200	−0.002	0.123	0.001	0.115	−0.108	0.167	0.205	0.210
		500	−0.001	0.076	0.001	0.074	−0.105	0.133	0.208	0.210
		1000	0.000	0.051	0.004	0.052	−0.103	0.119	0.208	0.209
E2Y1	npmvCBGPS	200	0.001	0.086	−0.002	0.088	0.010	0.218	-	-
		500	−0.002	0.068	0.001	0.069	0.004	0.211
		1000	0.003	0.055	0.002	0.060	0.005	0.209
	mvGPS	200	−0.018	0.253	0.108	0.286	−0.504	0.606	-	-
		500	−0.040	0.401	0.105	0.325	−0.539	0.674
		1000	−0.080	0.561	0.110	0.453	−0.442	0.620
	linear	200	−0.004	0.067	−0.001	0.065	0.002	0.065	-	-
		500	0.002	0.042	0.001	0.045	−0.001	0.042
		1000	0.000	0.029	0.003	0.030	−0.002	0.030
E2Y2	npmvCBGPS	200	0.028	0.109	0.007	0.088	0.061	0.199	−0.016	0.052
		500	0.035	0.089	0.012	0.069	0.104	0.179	−0.026	0.045
		1000	0.054	0.084	0.016	0.064	0.140	0.196	−0.037	0.047
	mvGPS	200	−0.062	0.277	0.132	0.277	−0.481	0.619	0.017	0.102
		500	−0.087	0.337	0.110	0.279	−0.607	0.728	0.020	0.095
		1000	−0.192	0.482	0.112	0.362	−0.613	0.777	0.033	0.095
	linear	200	0.002	0.069	−0.002	0.065	−0.004	0.074	0.000	0.010
		500	0.001	0.045	0.004	0.045	−0.003	0.043	0.000	0.006
		1000	−0.003	0.032	0.001	0.030	−0.001	0.031	0.000	0.004
E2Y3	npmvCBGPS	200	0.079	0.139	0.077	0.134	0.523	0.542	-	-
		500	0.092	0.123	0.085	0.119	0.553	0.563
		1000	0.096	0.119	0.086	0.113	0.574	0.581
	mvGPS	200	0.096	0.282	0.162	0.270	0.529	0.602	-	-
		500	0.081	0.347	0.180	0.346	0.621	0.713
		1000	0.141	0.486	0.172	0.452	0.586	0.733
	linear	200	0.270	0.282	0.261	0.273	0.268	0.278	-	-
		500	0.267	0.271	0.270	0.274	0.265	0.270
		1000	0.270	0.272	0.272	0.274	0.269	0.271
E2Y4	npmvCBGPS	200	0.027	0.119	0.069	0.131	0.363	0.400	0.055	0.066
		500	0.038	0.090	0.068	0.100	0.411	0.428	0.046	0.052
		1000	0.047	0.084	0.072	0.100	0.428	0.439	0.042	0.047
	mvGPS	200	−0.011	0.204	0.164	0.240	0.319	0.468	0.069	0.106
		500	−0.022	0.286	0.175	0.308	0.401	0.587	0.052	0.098
		1000	0.012	0.458	0.211	0.512	0.425	0.709	0.038	0.118
	linear	200	0.199	0.215	0.232	0.246	0.182	0.203	0.031	0.034
		500	0.209	0.215	0.231	0.237	0.192	0.200	0.027	0.028
		1000	0.211	0.214	0.232	0.235	0.196	0.200	0.026	0.027

Open in a new tab

Statistical testing under scenario 1 and scenario 2

Table 3 shows the results of 95% CI coverage and power under Scenario 1. The npmvCBGPS provided power greater than 92.8% and 95% CI coverage close to the nominal value of 95% in all cases except when the sample size was 1000 (87.2% − 94.8%). Compared with npmvCBGPS, mvGPS had slightly reduced power (86.2% − 98.8%), but poor 95% CI coverage (58.2% − 87.8%). As expected, the linear regression model was more powerful with power of 1 and provided 95% CI coverage that was close to the nominal value in all cases.

Table 3.

Confidence intervals and power of the causal parameter estimates under Scenario 1

Settings	Method	n	T1(0.6)		T2(−0.5)	T3(0.8)
Settings	Method	n	coverage	power	coverage	power	coverage	power
E1Y1	npmvCBGPS	200	0.938	0.968	0.942	0.928	0.942	0.942
		500	0.968	0.994	0.950	0.992	0.952	0.980
		1000	0.948	0.986	0.902	0.994	0.880	0.982
	mvGPS	200	0.876	0.956	0.662	0.864	0.760	0.966
		500	0.878	0.988	0.582	0.946	0.718	0.966
		1000	0.864	0.974	0.614	0.976	0.636	0.982
	linear	200	0.944	1	0.948	1	0.944	1
		500	0.964	1	0.952	1	0.946	1
		1000	0.960	1	0.952	1	0.964	1
E1Y2	npmvCBGPS	200	0.930	0.970	0.930	0.936	0.936	0.944
		500	0.978	0.994	0.922	0.996	0.926	0.990
		1000	0.948	0.990	0.884	0.994	0.872	0.994
	mvGPS	200	0.866	0.966	0.638	0.862	0.746	0.970
		500	0.862	0.978	0.606	0.946	0.684	0.982
		1000	0.862	0.982	0.592	0.98	0.620	0.986
	linear	200	0.95	1	0.948	1	0.940	1
		500	0.944	1	0.968	1	0.952	1
		1000	0.966	1	0.952	1	0.964	1

Open in a new tab

Table 4. shows the results of statistical testing under Scenario 2. When only the outcome model was misspecified (E1Y3 and E1Y4), the results for npmvCBGPS and mvGPS were similar to those in Scenario 1. In contrast, the power was slightly reduced, and the 95% CI coverage was significantly decreased for the linear regression model, especially when there was an interaction among three-dimensional exposures (E1Y4). When only the exposure model was misspecified (E2Y1 and E2Y2), results for npmvCBGPS and the linear regression model were similar to those in Scenario 1, while mvGPS had significantly reduced coverage and power. When neither the exposure nor the outcome model was correctly specified (E2Y3 and E2Y4), both the linear regression model and npmvCBGPS were powerful but with significantly reduced 95%CI coverage.

Table 4.

Confidence intervals and power of the causal parameter estimates under Scenario 2

Settings	Method	n	T1(0.6)		T2(−0.5)		T3(0.8)
Settings	Method	n	coverage	power	coverage	power	coverage	power
E1Y3	npmvCBGPS	200	0.954	0.972	0.922	0.932	0.946	0.924
		500	0.970	0.986	0.942	0.970	0.952	0.970
		1000	0.932	0.976	0.878	0.986	0.860	0.956
	mvGPS	200	0.862	0.922	0.676	0.806	0.766	0.950
		500	0.884	0.960	0.650	0.912	0.770	0.962
		1000	0.896	0.952	0.680	0.942	0.766	0.968
	linear	200	0.958	0.998	0.960	0.972	0.918	1
		500	0.950	1	0.946	1	0.968	1
		1000	0.956	1	0.948	1	0.946	1
E1Y4	npmvCBGPS	200	0.940	0.956	0.952	0.934	0.924	0.890
		500	0.972	0.994	0.940	0.984	0.958	0.954
		1000	0.932	0.996	0.872	0.992	0.906	0.966
	mvGPS	200	0.862	0.938	0.694	0.794	0.772	0.936
		500	0.868	0.970	0.644	0.914	0.752	0.976
		1000	0.882	0.968	0.654	0.956	0.724	0.970
	linear	200	0.934	0.996	0.942	0.992	0.808	1
		500	0.938	1	0.944	1	0.690	1
		1000	0.952	1	0.954	1	0.494	1
E2Y1	npmvCBGPS	200	0.988	0.986	0.986	0.984	0.994	0.986
		500	0.982	0.996	0.980	1	1	1
		1000	0.994	1	0.980	1	0.998	0.998
	mvGPS	200	0.790	0.868	0.664	0.734	0.264	0.552
		500	0.674	0.812	0.628	0.748	0.200	0.618
		1000	0.556	0.748	0.550	0.694	0.260	0.662
	linear	200	0.950	1	0.964	1	0.958	1
		500	0.938	1	0.946	1	0.952	1
		1000	0.944	1	0.946	1	0.958	1
E2Y2	npmvCBGPS	200	0.960	0.992	0.988	0.986	0.910	0.982
		500	0.946	0.996	0.980	1	0.856	0.998
		1000	0.930	1	0.960	1	0.746	0.996
	mvGPS	200	0.842	0.806	0.640	0.736	0.468	0.466
		500	0.760	0.830	0.618	0.818	0.216	0.472
		1000	0.648	0.694	0.550	0.776	0.204	0.454
	linear	200	0.954	1	0.964	1	0.946	1
		500	0.952	1	0.920	1	0.960	1
		1000	0.946	1	0.944	1	0.950	1
E2Y3	npmvCBGPS	200	0.882	0.992	0.856	0.948	0.072	1
		500	0.750	1	0.768	0.992	0.002	1
		1000	0.650	1	0.722	1	0.002	1
	mvGPS	200	0.710	0.946	0.540	0.790	0.166	0.994
		500	0.592	0.902	0.518	0.698	0.096	0.990
		1000	0.554	0.888	0.454	0.668	0.166	0.978
	linear	200	0.070	1	0.078	0.836	0.072	1
		500	0.000	1	0.000	0.996	0.000	1
		1000	0.000	1	0.000	1	0.000	1
E2Y4	npmvCBGPS	200	0.916	0.992	0.840	0.966	0.344	1
		500	0.918	1	0.812	0.998	0.064	1
		1000	0.870	1	0.732	1	0.018	1
	mvGPS	200	0.824	0.938	0.508	0.816	0.416	0.966
		500	0.714	0.898	0.482	0.792	0.302	0.964
		1000	0.636	0.818	0.474	0.692	0.348	0.934
	linear	200	0.306	1	0.188	0.910	0.402	1
		500	0.010	1	0.004	1	0.038	1
		1000	0.000	1	0.000	1	0.002	1

Open in a new tab

Estimation under varying correlations among exposures

We repeated the simulations with varying correlations among exposures to explore the impact of collinearity. Figures 4 and 5 display the mean biases and RMSEs of causal parameter estimates at varying exposure correlations for the npmvCBGPS method, respectively, in cases where there was no interaction among exposures under two scenarios. The mean biases were less affected by the correlations among exposures (changes were less than 0.06). The RMSEs were slightly increased with increasing correlations among exposures (changes were less than 0.08). Results were similar when there was an interaction among exposures (Figures S2 and S3). The trend of mean biases and RMSEs with increasing correlations among exposures for the mvGPS method (Figures S4 – S7) and the linear regression model (Figures S8-S11) was similar to that of the npmvCBGPS method.

Fig. 4 — Mean biases of causal parameter estimates at varying exposure correlations for the npmvCBGPS method in cases where there was no interaction among exposures

Fig. 5 — RMSE of causal parameter estimates at varying exposure correlations for the npmvCBGPS method in cases where there was no interaction among exposures

Health effects of PFASs on BMI

We first evaluated the covariate balance performance in the unweighted original and weighted samples. Results showed that covariate balance was improved compared with the original sample for npmvCBGPS (Figure S12), whereas there was a deterioration in balance after weighting for mvGPS (Figure S13). Therefore, we only show the results of npmvCBGPS. Model results that explored the linear association between PFASs and BMI are shown in Fig. 6. Numerical values for the coefficients displayed in this figure are shown in Table S2. There was a positive association between PFNA and BMI, whereas there were negative associations between PFOS, PFHxS with BMI. We used restricted cubic spline analysis to explore nonlinear associations. Figure 7 shows the single-exposure effect, which we defined as the dose-response curve of a single exposure on the outcome when all of the other exposures were fixed at their median values and all of the covariates were held constant [10]. We found nonlinear associations between PFOA, PFOS, PFHxS with BMI. Figure 8 shows the joint-exposure effect, which we defined as the dose-response association of mixed exposures on the outcome when all of the exposures were fixed at a given exposure percentile and the covariates were held constant. We observed an overall inverse trend between the PFAS mixtures with BMI.

Fig. 6 — Forest plot for model coefficients evaluating the linear relationship between PFASs with BMI in the NHANES 2003-18

Fig. 7 — Single exposure-response functions of each PFAS component, where the remaining components are fixed at their median values

Fig. 8 — Joint exposure-response relationships and 95% confidence intervals (CIs) of PFASs on BMI

Discussion

Estimating the causal health effects of several concurrent environmental exposures yields promising results in inferring how interventions can improve public health [8]. To this end, we focused on a causal modeling approach for environmental exposure mixtures. We presented npmvCBGPS, an extension of GPS methods, which are a common statistical tool for estimating the causal effect of a single exposure in observational studies. Using simulations, we conducted an in-depth assessment of the statistical performance of npmvCBGPS and showed that npmvCBGPS provided estimates that were close to the true value as long as either the exposure model or the outcome model was correctly specified; that is, the results empirically demonstrated that npmvCBGPS had properties similar to doubly robust estimators. In contrast, the validity of the mvGPS and the linear regression model relied on the correct specification of the exposure model and the outcome model, respectively. The performance of the npmvCBGPS was less impacted by collinearity among mixture components. In addition, the npmvCBGPS outperformed the existing mvGPS method in terms of accuracy, precision, and statistical testing across all simulated settings.

The proposed npmvCBGPS advances causal modeling in mixture exposure analysis. The quantile g-computation and SL-Gcomp methods extended classical g-computation for multiple simultaneous continuous exposures [7, 24]. The quantile g-computation transformed each exposure into a categorical variable, which resulted in the loss of information, potentially compromising the substantive insights derived from data analysis [7, 23]. The SL-Gcomp method is robust to the outcome model misspecification but may be affected by the choice of tunning parameters and prediction algorithms [25, 26]. The existing mvGPS extended classical GPS methods for multiple simultaneous continuous exposures [20]. The mvGPS fitted a multiple multivariate linear model for exposures and subsequently assumed a multivariate normal distribution for exposures to calculate balancing weights. Consequently, the performance of mvGPS relied on the correct specification of the exposure model and distribution. This explains why the estimates of mvGPS start deviating from the true value when the exposure model was incorrectly specified (E2Y1 and E2Y2). Different from this work, npmvCBGPS estimated directly balancing weights without requiring the specification of the exposure model and distribution. As elaborated in simulation studies, the robustness of our estimator to exposure model misspecification was one of the main advantages over mvGPS. In addition, npmvCBGPS has a formal mechanism to guarantee that covariates will be balanced. This can explain why our estimator exhibits greater accuracy and precision compared to mvGPS. It is worth noting that we used the first-order moments of both the exposures and the covariates to assess covariate balance in this paper. Higher moments may be helpful when there are strong nonlinear correlations between exposures and covariates. However, the choices of moment orders should be made carefully, because using too many moments may lead to unstable weights [33, 39].

A valid inference is important for mixture exposure analysis. Here, we used a robust sandwich-type variance estimator as suggested by Joffe et al. [40]. The corresponding 95% CI coverage was close to the nominal value except when the sample size was 1000. We viewed this as a trade-off between bias and variance because npmvCBGPS focused on removing bias (the correlations between exposures and covariates are 0 after weighting) when estimating balancing weights [41]. How to account for bias and variance simultaneously, as done by Athey et al. when estimating balancing weights, is an area of future research [42].

The PFASs are endocrine-disrupting chemicals that can result in changes in metabolic outcomes such as BMI [22]. Several epidemiological studies have been instrumental in determining the potential health effects of exposure to PFASs on BMI [43–47]. However, the current evidence was inconclusive [37], possibly due to insufficient confounding control or the fact that the majority of these studies have focused on individual chemicals, ignoring coexposures. To this end, we applied npmvCBGPS to estimate the joint causal effects of exposure to PFASs on BMI and found an overall inverse trend of the PFAS mixtures with BMI. The results derived in this study were based on the publicly available datasets. Future large-scale prospective cohort studies are needed to consolidate the validity of these associations.

Conclusions

In summary, this study proposed a robust statistical method, called npmvCBGPS, to estimate causal effects of multiple exposure mixtures on health outcomes for driving public health interventions and policy changes. A simulation study showed that npmvCBGPS outperformed the existing multivariate GPS (mvGPS) method in terms of accuracy, precision, and statistical testing in all scenarios. An appealing advantage of npmvCBGPS over the existing mvGPS method is robustness to model misspecification. This robustness enhances its applicability across various domains, with a particular emphasis on environmental epidemiology.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1^{(7.2KB, r)}

Supplementary Material 2^{(90.8MB, docx)}

Author contributions

T.W. conceived the idea and contributed to the interpretation of the results. T.W. and Q.G. developed the model. Q.G. implemented the software; conducted analysis of simulation and real data with assistance from T.L., G.Z., J.W., and K.Q.; interpreted the results with assistance from G.Z., J.W., L.L., and X.Y.; and drafted and revised the manuscript with input from all other authors. All authors approved the final manuscript.

Funding

This study was supported by the National Natural Science Foundation of China (grant numbers: 82373692, 82204163, 82073674, and 32301421), the Fundamental Research Program of Shanxi Province (grant number: 202203021212382), and Shanxi Province Higher Education “Billion Project” Science and Technology Guidance Project.

Data availability

No datasets were generated or analysed during the current study.

Declarations

Ethics approval and consent to participate

The datasets analyzed in real data applications are available in the National Health and Nutrition Examination Survey database, and the original studies had been approved by the relevant review boards.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Clinical trial number

Not applicable.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Day DB, Sathyanarayana S, LeWinn KZ, Karr CJ, Mason WA, Szpiro AA. A permutation test-based approach to strengthening inference on the effects of environmental mixtures: comparison between single-index analytic methods. Environ Health Perspect. 2022. 10.1289/EHP10570. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Forouzanfar MH, Afshin A, Alexander LT, Anderson HR, Bhutta ZA, Biryukov S, et al. Global, regional, and national comparative risk assessment of 79 behavioural, environmental and occupational, and metabolic risks or clusters of risks, 1990–2015: a systematic analysis for the global burden of disease study 2015. Lancet. 2016;388(10053):1659–724. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Vrijheid M. The exposome: a new paradigm to study the impact of environment on health. Thorax. 2014;69(9):876–8. [DOI] [PubMed] [Google Scholar]
4.Rattan SIS, Kaur G. Nutrition, food and diet in health and longevity: we eat what we are. Nutrients. 2022. 10.3390/nu14245376. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Santos S, Maitre L, Warembourg C, Agier L, Richiardi L, Basagana X, Vrijheid M. Applying the exposome concept in birth cohort research: a review of statistical approaches. Eur J Epidemiol. 2020;35(3):193–204. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Avery CL, Howard AG, Ballou AF, Buchanan VL, Collins JM, Downie CG, Engel SM, Graff M, Highland HM, Lee MP, et al. Strengthening causal inference in exposomics research: application of genetic data and methods. Environ Health Perspect. 2022;130(5):55001. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Keil AP, Buckley JP, O’Brien KM, Ferguson KK, Zhao S, White AJ. A quantile-based g-computation approach to addressing the effects of exposure mixtures. Environ Health Perspect. 2020. 10.1289/EHP5838. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Braun JM, Gennings C, Hauser R, Webster TF. What can epidemiological studies tell us about the impact of chemical mixtures on human health?? Environ Health Perspect. 2016;124(1):A6-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Bind MA. Causal modeling in environmental health. Annu Rev Public Health. 2019;40:23–43. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Bobb JF, Claus Henn B, Valeri L, Coull BA. Statistical software for analyzing the health effects of multiple concurrent exposures via bayesian kernel machine regression. Environ Health. 2018;17(1):67. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Maitre L, Guimbaud J-B, Warembourg C, Güil-Oumrait N, Petrone PM, Chadeau-Hyam M, et al. State-of-the-art methods for exposure-health studies: results from the exposome data challenge event. Environ Int. 2022. 10.1016/j.envint.2022.107422. [DOI] [PubMed] [Google Scholar]
12.Yu L, Liu W, Wang X, Ye Z, Tan Q, Qiu W, et al. A review of practical statistical methods used in epidemiological studies to estimate the health effects of multi-pollutant mixture. Environ Pollut. 2022. 10.1016/j.envpol.2022.119356. [DOI] [PubMed] [Google Scholar]
13.Zhang X, Zhao L, Ducatman A, Deng C, von Stackelberg KE, Danford CJ, et al. Association of per- and polyfluoroalkyl substance exposure with fatty liver disease risk in US adults. JHEP Rep. 2023. 10.1016/j.jhepr.2023.100694. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Zhang Y, Mustieles V, Wang YX, Sun Y, Agudelo J, Bibi Z, et al. Folate concentrations and serum perfluoroalkyl and polyfluoroalkyl substance concentrations in adolescents and adults in the USA (National health and nutrition examination study 2003-16): an observational study. Lancet Planet Health. 2023;7(6):e449-58. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Bobb JF, Valeri L, Claus Henn B, Christiani DC, Wright RO, Mazumdar M, Godleski JJ, Coull BA. Bayesian kernel machine regression for estimating the health effects of multi-pollutant mixtures. Biostatistics. 2015;16(3):493–508. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Carrico C, Gennings C, Wheeler DC, Factor-Litvak P. Characterization of weighted quantile sum regression for highly correlated data in a risk analysis setting. J Agric Biol Environ Stat. 2015;20(1):100–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Traini E, Huss A, Portengen L, Rookus M, Verschuren WMM, Vermeulen RCH, et al. A multipollutant approach to estimating causal effects of air pollution mixtures on overall mortality in a large, prospective cohort. Epidemiology. 2022;33(4):514–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Qin X. An introduction to causal mediation analysis. Asia Pac Educ Rev. 2024;25(3):703–17. [Google Scholar]
19.Luque-Fernandez MA, Schomaker M, Rachet B, Schnitzer ME. Targeted maximum likelihood estimation for a binary treatment: a tutorial. Stat Med. 2018;37(16):2530–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Williams JR, Crespi CM . Causal inference for multiple continuous exposures via the multivariate generalized propensity score. 2020;arXiv:2008.13767. arXiv preprint
21.Thilakaratne R, Lin P-ID, Rifas-Shiman SL, Wright RO, Hubbard A, Hivert M-F, et al. Mixtures of metals and micronutrients in early pregnancy and cognition in early and mid-childhood: findings from the project Viva cohort. Environ Health Perspect. 2023. 10.1289/EHP12016. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Cai A, Remy S, Lenters V, Cox B, Schoeters G, Covaci A, et al. Exposure to a mixture of Endocrine-Disrupting chemicals and metabolic outcomes in Belgian adolescents. Environ Sci Technol. 2023;57(48):19871–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Fong C, Hazlett C, Imai K. Covariate balancing propensity score for a continuous treatment: application to the efficacy of political advertisements. Annals Appl Stat. 2018;12(1):156-77 .
24.Oulhote Y, Coull B, Bind MA, Debes F, Nielsen F, Tamayo I, et al. Joint and independent neurotoxic effects of early life exposures to a chemical mixture: a multi-pollutant approach combining ensemble learning and g-computation. Environ Epidemiol. 2019. 10.1097/EE9.0000000000000063. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Phillips RV, van der Laan MJ, Lee H, Gruber S. Practical considerations for specifying a super learner. Int J Epidemiol. 2023;52(4):1276–85. [DOI] [PubMed] [Google Scholar]
26.Naimi AI, Balzer LB. Stacked generalization: an introduction to super learning. Eur J Epidemiol. 2018;33(5):459–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. [Google Scholar]
28.Austin PC. An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivar Behav Res. 2011;46(3):399–424. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Gao Q, Zhang Y, Liang J, Sun H, Wang T. High-dimensional generalized propensity score with application to omics data. Brief Bioinform. 2021. 10.1093/bib/bbab331. [DOI] [PubMed] [Google Scholar]
30.Heinze G, Juni P. An overview of the objectives of and the approaches to propensity score analyses. Eur Heart J. 2011;32(14):1704–8. [DOI] [PubMed] [Google Scholar]
31.Imbens Guido W. The role of the propensity score in estimating dose-response functions. Biometrika. 2000. 10.1093/biomet/87.3.706. [Google Scholar]
32.Xu S, Ross C, Raebel MA, Shetterly S, Blanchette C, Smith D. Use of stabilized inverse propensity scores as weights to directly estimate relative risk and its confidence intervals. Value Health. 2010;13(2):273–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Huling JD, Greifer N, Chen G. Independence weights for causal inference with continuous treatments. J Am Stat Assoc. 2023;119(546):1657-70 .
34.Brookhart MA, Schneeweiss S, Rothman KJ, Glynn RJ, Avorn J, Stürmer T. Variable selection for propensity score models. Am J Epidemiol. 2006;163(12):1149–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Cepeda MS, Boston R, Farrar JT, Strom BL. Comparison of logistic regression versus propensity score when the number of events is low and there are multiple confounders. Am J Epidemiol. 2003;158(3):280–7. [DOI] [PubMed] [Google Scholar]
36.Zhu Y, Coffman DL, Ghosh D. A boosting algorithm for estimating generalized propensity scores with continuous treatments. J Causal Inference. 2015;3(1):25–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Qi W, Clark JM, Timme-Laragy AR, Park Y. Per- and polyfluoroalkyl substances and obesity, type 2 diabetes and non-alcoholic fatty liver disease: a review of epidemiologic findings. Toxicol Environ Chem. 2020;102(1–4):1–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.US Centers for Disease Control and Prevention. National Health and Nutrition Examination Survey. https://www.cdc.gov/nchs/nhanes/about/index.html. Accessed 23 Oct 2023.
39.Vegetabile BG, Griffin Ann B, Coffman DL, Cefalu M, Robbins MW, McCaffrey DF. Nonparametric estimation of population average dose-response curves using entropy balancing weights for continuous exposures. Health Serv Outcomes Res Methodol. 2021;21(1):69–110. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Joffe MM, Ten Have TR, Feldman HI, Kimmel SE. Model selection, confounder control, and marginal structural models. Am Stat. 2004;58(4):272–9. [Google Scholar]
41.Gao Q, Zhang Y, Sun H, Wang T. Evaluation of propensity score methods for causal inference with high-dimensional covariates. Brief Bioinform. 2022. 10.1093/bib/bbac227. [DOI] [PubMed] [Google Scholar]
42.Athey S, Imbens GW, Wager S. Approximate residual balancing: debiased inference of average treatment effects in high dimensions. J R Stat Soc Series B Stat Methodol. 2018;80(4):597–623. [Google Scholar]
43.Liu G, Dhana K, Furtado JD, Rood J, Zong G, Liang L, et al. Perfluoroalkyl substances and changes in body weight and resting metabolic rate in response to weight-loss diets: a prospective study. PLoS Med. 2018;15(2):e1002502. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Yang Q, Guo X, Sun P, Chen Y, Zhang W, Gao A. Association of serum levels of perfluoroalkyl substances (PFASs) with the metabolic syndrome (MetS) in Chinese male adults: a cross-sectional study. Sci Total Environ. 2018;621:1542–9. [DOI] [PubMed] [Google Scholar]
45.Christensen KY, Raymond M, Meiman J. Perfluoroalkyl substances and metabolic syndrome. Int J Hyg Environ Health. 2019;222(1):147–53. [DOI] [PubMed] [Google Scholar]
46.Blake BE, Pinney SM, Hines EP, Fenton SE, Ferguson KK. Associations between longitudinal serum perfluoroalkyl substance (PFAS) levels and measures of thyroid hormone, kidney function, and body mass index in the Fernald Community Cohort. Environmental pollution (Barking, Essex: 1987). 2018;242(Pt A):894–904. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Nelson JW, Hatch EE, Webster TF. Exposure to polyfluoroalkyl chemicals and cholesterol, body weight, and insulin resistance in the general U.S. population. Environ Health Perspect. 2010;118(2):197–202. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1^{(7.2KB, r)}

Supplementary Material 2^{(90.8MB, docx)}

Data Availability Statement

No datasets were generated or analysed during the current study.

[CR1] 1.Day DB, Sathyanarayana S, LeWinn KZ, Karr CJ, Mason WA, Szpiro AA. A permutation test-based approach to strengthening inference on the effects of environmental mixtures: comparison between single-index analytic methods. Environ Health Perspect. 2022. 10.1289/EHP10570. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Forouzanfar MH, Afshin A, Alexander LT, Anderson HR, Bhutta ZA, Biryukov S, et al. Global, regional, and national comparative risk assessment of 79 behavioural, environmental and occupational, and metabolic risks or clusters of risks, 1990–2015: a systematic analysis for the global burden of disease study 2015. Lancet. 2016;388(10053):1659–724. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Vrijheid M. The exposome: a new paradigm to study the impact of environment on health. Thorax. 2014;69(9):876–8. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Rattan SIS, Kaur G. Nutrition, food and diet in health and longevity: we eat what we are. Nutrients. 2022. 10.3390/nu14245376. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Santos S, Maitre L, Warembourg C, Agier L, Richiardi L, Basagana X, Vrijheid M. Applying the exposome concept in birth cohort research: a review of statistical approaches. Eur J Epidemiol. 2020;35(3):193–204. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Avery CL, Howard AG, Ballou AF, Buchanan VL, Collins JM, Downie CG, Engel SM, Graff M, Highland HM, Lee MP, et al. Strengthening causal inference in exposomics research: application of genetic data and methods. Environ Health Perspect. 2022;130(5):55001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Keil AP, Buckley JP, O’Brien KM, Ferguson KK, Zhao S, White AJ. A quantile-based g-computation approach to addressing the effects of exposure mixtures. Environ Health Perspect. 2020. 10.1289/EHP5838. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Braun JM, Gennings C, Hauser R, Webster TF. What can epidemiological studies tell us about the impact of chemical mixtures on human health?? Environ Health Perspect. 2016;124(1):A6-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Bind MA. Causal modeling in environmental health. Annu Rev Public Health. 2019;40:23–43. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Bobb JF, Claus Henn B, Valeri L, Coull BA. Statistical software for analyzing the health effects of multiple concurrent exposures via bayesian kernel machine regression. Environ Health. 2018;17(1):67. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Maitre L, Guimbaud J-B, Warembourg C, Güil-Oumrait N, Petrone PM, Chadeau-Hyam M, et al. State-of-the-art methods for exposure-health studies: results from the exposome data challenge event. Environ Int. 2022. 10.1016/j.envint.2022.107422. [DOI] [PubMed] [Google Scholar]

[CR12] 12.Yu L, Liu W, Wang X, Ye Z, Tan Q, Qiu W, et al. A review of practical statistical methods used in epidemiological studies to estimate the health effects of multi-pollutant mixture. Environ Pollut. 2022. 10.1016/j.envpol.2022.119356. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Zhang X, Zhao L, Ducatman A, Deng C, von Stackelberg KE, Danford CJ, et al. Association of per- and polyfluoroalkyl substance exposure with fatty liver disease risk in US adults. JHEP Rep. 2023. 10.1016/j.jhepr.2023.100694. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Zhang Y, Mustieles V, Wang YX, Sun Y, Agudelo J, Bibi Z, et al. Folate concentrations and serum perfluoroalkyl and polyfluoroalkyl substance concentrations in adolescents and adults in the USA (National health and nutrition examination study 2003-16): an observational study. Lancet Planet Health. 2023;7(6):e449-58. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Bobb JF, Valeri L, Claus Henn B, Christiani DC, Wright RO, Mazumdar M, Godleski JJ, Coull BA. Bayesian kernel machine regression for estimating the health effects of multi-pollutant mixtures. Biostatistics. 2015;16(3):493–508. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Carrico C, Gennings C, Wheeler DC, Factor-Litvak P. Characterization of weighted quantile sum regression for highly correlated data in a risk analysis setting. J Agric Biol Environ Stat. 2015;20(1):100–20. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Traini E, Huss A, Portengen L, Rookus M, Verschuren WMM, Vermeulen RCH, et al. A multipollutant approach to estimating causal effects of air pollution mixtures on overall mortality in a large, prospective cohort. Epidemiology. 2022;33(4):514–22. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Qin X. An introduction to causal mediation analysis. Asia Pac Educ Rev. 2024;25(3):703–17. [Google Scholar]

[CR19] 19.Luque-Fernandez MA, Schomaker M, Rachet B, Schnitzer ME. Targeted maximum likelihood estimation for a binary treatment: a tutorial. Stat Med. 2018;37(16):2530–46. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Williams JR, Crespi CM . Causal inference for multiple continuous exposures via the multivariate generalized propensity score. 2020;arXiv:2008.13767. arXiv preprint

[CR21] 21.Thilakaratne R, Lin P-ID, Rifas-Shiman SL, Wright RO, Hubbard A, Hivert M-F, et al. Mixtures of metals and micronutrients in early pregnancy and cognition in early and mid-childhood: findings from the project Viva cohort. Environ Health Perspect. 2023. 10.1289/EHP12016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Cai A, Remy S, Lenters V, Cox B, Schoeters G, Covaci A, et al. Exposure to a mixture of Endocrine-Disrupting chemicals and metabolic outcomes in Belgian adolescents. Environ Sci Technol. 2023;57(48):19871–80. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Fong C, Hazlett C, Imai K. Covariate balancing propensity score for a continuous treatment: application to the efficacy of political advertisements. Annals Appl Stat. 2018;12(1):156-77 .

[CR24] 24.Oulhote Y, Coull B, Bind MA, Debes F, Nielsen F, Tamayo I, et al. Joint and independent neurotoxic effects of early life exposures to a chemical mixture: a multi-pollutant approach combining ensemble learning and g-computation. Environ Epidemiol. 2019. 10.1097/EE9.0000000000000063. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Phillips RV, van der Laan MJ, Lee H, Gruber S. Practical considerations for specifying a super learner. Int J Epidemiol. 2023;52(4):1276–85. [DOI] [PubMed] [Google Scholar]

[CR26] 26.Naimi AI, Balzer LB. Stacked generalization: an introduction to super learning. Eur J Epidemiol. 2018;33(5):459–64. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. [Google Scholar]

[CR28] 28.Austin PC. An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivar Behav Res. 2011;46(3):399–424. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Gao Q, Zhang Y, Liang J, Sun H, Wang T. High-dimensional generalized propensity score with application to omics data. Brief Bioinform. 2021. 10.1093/bib/bbab331. [DOI] [PubMed] [Google Scholar]

[CR30] 30.Heinze G, Juni P. An overview of the objectives of and the approaches to propensity score analyses. Eur Heart J. 2011;32(14):1704–8. [DOI] [PubMed] [Google Scholar]

[CR31] 31.Imbens Guido W. The role of the propensity score in estimating dose-response functions. Biometrika. 2000. 10.1093/biomet/87.3.706. [Google Scholar]

[CR32] 32.Xu S, Ross C, Raebel MA, Shetterly S, Blanchette C, Smith D. Use of stabilized inverse propensity scores as weights to directly estimate relative risk and its confidence intervals. Value Health. 2010;13(2):273–7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR33] 33.Huling JD, Greifer N, Chen G. Independence weights for causal inference with continuous treatments. J Am Stat Assoc. 2023;119(546):1657-70 .

[CR34] 34.Brookhart MA, Schneeweiss S, Rothman KJ, Glynn RJ, Avorn J, Stürmer T. Variable selection for propensity score models. Am J Epidemiol. 2006;163(12):1149–56. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR35] 35.Cepeda MS, Boston R, Farrar JT, Strom BL. Comparison of logistic regression versus propensity score when the number of events is low and there are multiple confounders. Am J Epidemiol. 2003;158(3):280–7. [DOI] [PubMed] [Google Scholar]

[CR36] 36.Zhu Y, Coffman DL, Ghosh D. A boosting algorithm for estimating generalized propensity scores with continuous treatments. J Causal Inference. 2015;3(1):25–40. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] 37.Qi W, Clark JM, Timme-Laragy AR, Park Y. Per- and polyfluoroalkyl substances and obesity, type 2 diabetes and non-alcoholic fatty liver disease: a review of epidemiologic findings. Toxicol Environ Chem. 2020;102(1–4):1–36. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR38] 38.US Centers for Disease Control and Prevention. National Health and Nutrition Examination Survey. https://www.cdc.gov/nchs/nhanes/about/index.html. Accessed 23 Oct 2023.

[CR39] 39.Vegetabile BG, Griffin Ann B, Coffman DL, Cefalu M, Robbins MW, McCaffrey DF. Nonparametric estimation of population average dose-response curves using entropy balancing weights for continuous exposures. Health Serv Outcomes Res Methodol. 2021;21(1):69–110. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR40] 40.Joffe MM, Ten Have TR, Feldman HI, Kimmel SE. Model selection, confounder control, and marginal structural models. Am Stat. 2004;58(4):272–9. [Google Scholar]

[CR41] 41.Gao Q, Zhang Y, Sun H, Wang T. Evaluation of propensity score methods for causal inference with high-dimensional covariates. Brief Bioinform. 2022. 10.1093/bib/bbac227. [DOI] [PubMed] [Google Scholar]

[CR42] 42.Athey S, Imbens GW, Wager S. Approximate residual balancing: debiased inference of average treatment effects in high dimensions. J R Stat Soc Series B Stat Methodol. 2018;80(4):597–623. [Google Scholar]

[CR43] 43.Liu G, Dhana K, Furtado JD, Rood J, Zong G, Liang L, et al. Perfluoroalkyl substances and changes in body weight and resting metabolic rate in response to weight-loss diets: a prospective study. PLoS Med. 2018;15(2):e1002502. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Yang Q, Guo X, Sun P, Chen Y, Zhang W, Gao A. Association of serum levels of perfluoroalkyl substances (PFASs) with the metabolic syndrome (MetS) in Chinese male adults: a cross-sectional study. Sci Total Environ. 2018;621:1542–9. [DOI] [PubMed] [Google Scholar]

[CR45] 45.Christensen KY, Raymond M, Meiman J. Perfluoroalkyl substances and metabolic syndrome. Int J Hyg Environ Health. 2019;222(1):147–53. [DOI] [PubMed] [Google Scholar]

[CR46] 46.Blake BE, Pinney SM, Hines EP, Fenton SE, Ferguson KK. Associations between longitudinal serum perfluoroalkyl substance (PFAS) levels and measures of thyroid hormone, kidney function, and body mass index in the Fernald Community Cohort. Environmental pollution (Barking, Essex: 1987). 2018;242(Pt A):894–904. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR47] 47.Nelson JW, Hatch EE, Webster TF. Exposure to polyfluoroalkyl chemicals and cholesterol, body weight, and insulin resistance in the general U.S. population. Environ Health Perspect. 2010;118(2):197–202. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Estimating the causal effects of exposure mixtures: a generalized propensity score method

Qian Gao

Ting Li

Guiming Zhu

Juping Wang

Kexin Qiu

Liangpo Liu

Xiujuan Yang

Tong Wang

Abstract

Background

Methods

Results

Conclusions

Supplementary Information

Background

Methods

Overview of the inverse probability of weighting method

Inverse probability of weighting using propensity score

Inverse probability of weighting using generalized propensity score

NpmvCBGPS

Simulation studies

Simulation setup

Fig. 1.

Scenario 1

Scenario 2

Per- and polyfluoroalkyl substances and BMI

Results

Simulation results

Covariate balance and Estimation under scenario 1 and scenario 2

Fig. 2.

Table 1.

Fig. 3.

Table 2.

Statistical testing under scenario 1 and scenario 2

Table 3.

Table 4.

Estimation under varying correlations among exposures

Fig. 4.

Fig. 5.

Health effects of PFASs on BMI

Fig. 6.

Fig. 7.

Fig. 8.

Discussion

Conclusions

Supplementary Information

Author contributions

Funding

Data availability

Declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Clinical trial number

Footnotes

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases