Abstract
Background:
Studies of the effects of prenatal environmental exposures on postnatal outcomes are particularly vulnerable to live birth bias; i.e., the bias that arises from the necessary restriction of the analysis to live births when that is influenced by both the exposure under study and unmeasured factors that also affect the outcome.
Objectives:
In the context of a recent publication of nitrogen dioxide () and autism spectrum disorder (ASD) that found an odds ratio (OR) of 0.77 per during pregnancy, we aimed to examine what parameters would be needed to account for this protective association through live birth bias.
Methods:
We simulated the magnitude of bias under two selection mechanisms and when both mechanisms co-occur, assuming a true null effect. Simulation input parameters were based on characteristics of the original study and a range of plausible values for the prevalence of unmeasured factor and the ORs for the selection effects (i.e., the effects of and on loss and of on ASD). Each scenario was simulated 1,000 times.
Results:
We found that the magnitude of bias was small when and independently influenced pregnancy loss (collider-stratification without interaction), was stronger when loss preferentially occurred in (depletion of susceptibles), and was strongest when both mechanisms worked together. For example, ORs of 3.0 for , -loss, -ASD, and U yielded ORs per of 0.95, 0.89, and 0.75 for the three scenarios, respectively. The bias is amplified with multiple , yielding ORs as low as 0.51.
Discussion:
Our simulations illustrate that live birth bias may lead to exposure–outcome associations that are biased downward, where the extent of the bias depends on the fetal selection mechanism, the strength of that selection, and the prevalence of U. https://doi.org/10.1289/EHP7961
Introduction
Understanding the health effects of environmental exposures is critical for identifying and developing preventive interventions for high-risk populations. However, these effects may not be identifiable if exposure prevents selection into the study and thus, observation of the outcome of interest. This process is a form of ‘left truncation’ and can lead to estimates of exposure-outcome associations that are biased (Howards et al. 2007; Lisonkova and Joseph 2015). Epidemiological studies of environmental exposures are particularly vulnerable to left truncation as exposures are typically ubiquitous in time (e.g., participants are already exposed prior to study initiation), such that exposure-induced selection processes likely inform the formation of observational cohorts from which exposure-health effects are estimated.
In studies investigating the effects of prenatal exposures on outcomes in live-born children, left-truncation can induce a specific type of selection bias called live-birth bias (Liew et al. 2015a; Raz et al. 2018a). These studies are typically based on cohorts formed by only live births, where selective survival between conception and birth can skew the distribution of prenatal exposures in the subset available for analysis (i.e., those conceptions that resulted in a live birth) from the exposure distribution among all conceptions, such that the estimated parameter in the analyzed subset differs from the parameter in the total population (i.e., all conceptions).
An example of possible live-birth bias is a recent analysis of traffic-related nitrogen dioxide () and autism spectrum disorder (ASD), where the odds ratio (OR) was 0.77 per (ppb) increase in during pregnancy when mutually adjusted for postnatal exposure to (Raz et al. 2018b); that is, prenatal exposure to appeared to be protective against ASD. This paradoxical finding is unlikely to be causal because we are not aware of a possible biological mechanism for which may confer beneficial effects on the risk of ASD, or for human health in general for that matter. It is more likely that this strong protective association could be attributed to live-birth bias.
It has been suggested that there are two selection mechanisms that can lead to live-birth bias (Liew et al. 2015a; Raz et al. 2018a). Although parameterized differently, both mechanisms can be envisioned as forms of collider-stratification bias (Hernan and Robins 2020) and can be represented by the directed acyclic graph (DAG) in Figure 1, which is the same structure as the birth weight paradox (Hernández-Díaz et al. 2006). We will refer to these mechanisms here as “collider-stratification without interaction” and “depletion of susceptibles.” In “collider-stratification without interaction,” exposure and some unmeasured factor —for example, exposure to endocrine disrupting chemicals that have been associated with pregnancy loss and autism (Jensen et al. 2015; Kalkbrenner et al. 2014; Krieg et al. 2016; Pelch et al. 2019)—are independent, and each affect selection (). In “depletion of susceptibles,” which is separate but related to the first mechanism, exposure and unmeasured variable do not have independent causal effects on fetal loss, but rather loss is dependent on the joint effects of and . A potential example of this mechanism is a gene–environment interaction, whereby exposure-induced loss preferentially occurs in those who have the genetic factor (i.e., the subset of fetuses susceptible to ASD). Last, both mechanisms described above can also work in tandem because they operate through distinct mechanistic pathways, in that and not only causally interact to affect fetal loss but also have independent causal effects on fetal loss. It is important to note that although the three mechanisms described above are parameterized differently, they are indistinguishable on a DAG because DAGs are nonparametric and thus cannot encode biases that depend on the specific parameterization of the effect. That is, they all represent the same causal structure (but are parameterized differently) where restricting the analysis to live births (i.e., conditioning on collider ) induces a spurious association between and , which results in a biased association (Figure 1).
Figure 1.
DAG of the structure of live birth bias. Nitrogen dioxide () exposure affects live births is also affected by an independent unmeasured risk factor for ASD (outcome ). Arrows are direct causal effects, and the dashed line is a spurious association induced between and after selection on live births (i.e., conditioning on ). This DAG has the same structure as the birthweight paradox (Hernández-Díaz et al. 2006). Note: ASD, autism spectrum disorder; DAG, directed acyclic graph.
Unlike other examples of selection bias, such as the birth weight paradox (Hernández-Díaz et al. 2006), obesity paradox (Glymour and Vittinghoff 2014; Lajous et al. 2015; Sperrin et al. 2016), or loss to follow-up in cohort studies (Howe et al. 2016), live-birth bias is less amenable to addressing analytically because we cannot adjust for selection processes that we cannot observe (i.e., the necessary data to mitigate this bias are often not available). Thus, simulations are an invaluable tool for exploring the influence of live-birth bias on the estimation of the effects of exposure during pregnancy on outcomes in live-born children. Motivated by the findings of Raz et al. (2018b), we examine through simulations the magnitude of bias that would result from analyses under the two hypothetical selection mechanisms as well as when they operate simultaneously.
Methods
Data-Generating Process
To examine bias from selection on live births under a true null effect of on ASD, we simulated a pregnancy cohort of 100,000 conceptions, which we will refer to as the “total population,” with data on entire-pregnancy exposure , an unmeasured factor , the ASD outcome , and selection indicator (Figure 1). Entire-pregnancy was normally distributed with mean 16.7 and standard deviation of 4.3 to reflect the distribution of found in the original study (Raz et al. 2018b). For simplicity, we ignored the seasonal nature of the exposure, and though we treated the exposure as Gaussian, the same principles would apply for a binary exposure. Unmeasured variable and outcome were binary variables. The prevalence of () was set to be 0.25, 0.50, or 0.75. The baseline odds of were set to be 0.015 to reflect the low incidence of ASD in the original analysis (Raz et al. 2018b), and the baseline odds of fetal loss were set to be 0.05, such that the causal effects that lead to selection bias [i.e., , , (i.e., the effect of the interaction), and ], which we will henceforth refer to as “selection effects,” lead to an overall loss in line with observed estimates (Wilcox et al. 1988). All selection effects were modeled in terms of ORs, so that simulated probabilities were correctly bounded between 0 and 1; and for the , , , and associations (, , , , respectively) that were not 1.0 as determined by the selection mechanism (see “Selection Mechanisms” section below) were set to all be the same and equal to 1.5, 2.0, 2.5, or 3.0 (for , this is per increase in , the interquartile range in the original study). Here, we only considered effects of the same sign because exposures that are harmful for pregnancy loss are most likely also harmful for ASD (beneficial exposures would function in the same manner, in that what is beneficial for loss is also beneficial for ASD, whereas those of opposite signs that we considered less plausible would lead to upwardly biased and associations among live births). For simplicity of displaying, we will refer to the selection effects ORs as henceforth. The probability of loss and the ASD outcome for each fetus were estimated using the equations below. Equation 1 represents the probability that the pregnancy will result in a fetal loss given and . Equation 2 represents the probability of the outcome given , where is omitted because our simulations were conducted under the null; that is, there is no causal effect of on .
(1) |
(2) |
Selection Mechanisms
To examine bias from collider-stratification with no interaction [Mechanism 1 (M1)], where both and have independent causal effects on fetal loss, selection effects were set to the specified above, except that was set to 1. For depletion of susceptibles [Mechanism 2 (M2)], and were set to be 1, whereas was set equal to the prespecified ; that is, and do not have independent causal effects on fetal loss and loss due to could only occur in the subset of fetuses who were exposed to . Finally, to examine bias from both mechanisms operating simultaneously (Both Mechanisms []), where both and have independent causal effects on fetal loss and they causally interact on selection, , and were set to the specified . For all mechanisms, was set to the prespecified selection effect , such that the extent of the bias is driven by the differing parameterizations of the relations between , , and across the three selection mechanisms (and not the relationship, which is fixed to be constant for each scenario). To focus only on the bias induced by the selection effects, all simulations assumed that there was no confounding for the effect of , loss to follow-up among live-born children, outcome misclassification, or exposure misclassification, such that observed associations can only be explained by live-birth bias.
Analysis
Each scenario was simulated 1,000 times. For each simulated data set, we first restricted our analytic sample to live births (i.e., ) and then performed a logistic regression of ASD status in children with exposure to obtain the observed odds ratio (per ), which approximates the risk ratio because the outcome is rare. With the distribution of point estimates generated over the 1,000 iterations for each scenario, we computed the mean and percentile-based 95% simulation intervals (SIs), which are the 2.5th and 97.5th percentiles of the distribution. Because the simulated truth is that there is no causal effect, the value of demonstrates the bias ratio, where greater departures from 1 indicate larger magnitudes of bias. Furthermore, SIs demonstrate the range of estimates that are consistent with the data generating mechanism for the specified sample size; for example, if the 95% SIs generated by a given selection mechanism included the OR of 0.77 found in the original study (Raz et al. 2018b), it would suggest that this observed protective association would be consistent with live-birth bias induced by that mechanism.
To better understand the drivers of bias from the different selection mechanisms, we also estimated the OR for the association between ( exposure, per ) and in the selected population () using a logistic regression, the prevalence of in the selected population (), and their respective 95% SIs. Because both parameters determine the strength of live-birth bias and are driven by the simulation inputs and , we will henceforth refer to both parameters as “bias parameters.” The simulation input is expected to be 1 in the total population of all conceptions, but the parameter is expected to be below 1 in the selected population (i.e., fetuses that survived) because those exposed to both high air pollution and are strongly selected against, because both factors increase the likelihood of loss. Thus, those exposed to high air pollution in the selected population are less likely to be exposed to (and vice versa) setting up an inverse association between and . Furthermore, the difference between and indicates the extent to which the and determine the prevalence of in those selected; that is, the expected value of is in the absence of bias.
Finally, to examine the extent of the bias that would occur if there were multiple involved in the fetal selection process, we also estimated the value of and its 95% SI for each scenario in the presence of two and then three , where all were set to have the same prevalence and effect on selection. All simulations and analyses were performed in R (version 3.6.1; R Development Core Team). See simulation code and documentation at https://github.com/mleung-harvard/live-birth-bias-simulation and in the Supplemental Material, to explore the extent of potential biases with any selected parameters.
Results
The results of this simulation study on the bias in average are shown in Figure 2 and in Table S1. In the presence of collider-stratification with no interaction (M1), where both exposure and unmeasured variable have independent causal effects on fetal loss and therefore selection (i.e., and were set to the prespecified selection effect, but was set to 1), the bias was generally weak (Figure 2; Table S1). Selection effects of magnitudes 1.5 and 2.0 generated little to no bias on average across the three values of , with ranging from 0.99 to 1. Only when the selection effects reached 3.0 did we see larger departures from the null (e.g., of 0.94 for of 0.5), but these were still relatively weak, such that the 95% SI (i.e., the distribution of point estimates consistent with this mechanism) still included the null (Table S1). Examining the bias parameters and ,we observed that stronger selection effects in the total population yielded a lower (i.e., a stronger inverse association between and ), but a smaller in the selected population (Figure 3; Table S2); that is, with stronger selection effects, both parameters deviate further from the underlying population parameter, where would be 1, and would be equal to in the absence of bias.
Figure 2.
Live birth bias of under different selection effects. Average odds ratios for the association between nitrogen dioxide (; exposure ) and ASD (outcome ) among live births () with varying simulation inputs for the prevalence of the unmeasured risk factor () and the magnitude of selection effects () under two selection mechanisms (collider-stratification without interaction, and depletion of susceptibles) and when they both co-occur with a single , assuming a true null effect of on ASD. Collider-stratification without interaction (Mechanism 1) occurs when and have independent causal effects on fetal loss, but with no interaction on the multiplicative scale (, and ). Depletion of susceptibles (Mechanism 2) occurs when has a causal effect on fetal loss only in the subset of susceptible fetuses (), but neither or have independent causal effects on fetal loss (, and ). Both mechanisms occur when and have independent causal effects on fetal loss, and with interaction on the multiplicative scale (). Each scenario was simulated 1,000 times. Points represent the mean in each scenario. Dashed lines indicate the true null effect of on ASD () in the absence of live birth bias, where deviations from 1.0 quantify the magnitude of live birth bias. See Table S1 for corresponding numeric data, including 95% SI. Note: ASD, autism spectrum disorder; OR, odds ratio; SI, simulation intervals.
Figure 3.
Bias parameters that drive live birth bias of under different selection effects. Average bias parameters in the selected population with varying simulation inputs for the prevalence of the unmeasured risk factor () and the magnitude of selection effects () under two selection mechanisms and when they both co-occur with a single , assuming a true null effect of nitrogen dioxide (; exposure ) on ASD (outcome ). In the selected population (live births), is the association between and , and is the prevalence of . Collider-stratification without interaction (Mechanism 1) occurs when and have independent causal effects on fetal loss, but with no interaction on the multiplicative scale (, and ). Depletion of susceptibles (Mechanism 2) occurs when has a causal effect on fetal loss only in the subset of susceptible fetuses (), but neither or have independent causal effects on fetal loss (, and ). Both mechanisms occur when and have independent causal effects on fetal loss, and with interaction on the multiplicative scale (). Each scenario was simulated 1000 times. Points represent the mean value of the bias parameter in each scenario. Dashed lines indicate the expected values (in the absence of live birth bias) for (), and (), which are the parameters in the selected population that drive the strength of live birth bias of . See Table S2 for corresponding numeric data, including 95% SI. Note: ASD, autism spectrum disorder; OR, odds ratio; SI, simulation intervals.
For depletion of susceptibles (M2), where fetal loss is solely dependent on the interaction between exposure and unmeasured variable (i.e., and were set to 1, but was set to the prespecified selection effect), the magnitude of bias was slightly stronger compared with those generated by M1 (Figure 2; Table S1). Unlike with M1, with M2, was consistently low, such that several 95% SI did not include the null (Table S1); for example, if the selection effects were 3.0 and 25% of the total population were exposed to , then the observed OR for the association would be 0.91 (95% SI: 0.85, 0.97). When selection parameters were relatively weak (i.e., of 1.5 and 2), corresponding values of (for the same ) deviated further from the null under M2 in comparison with M1 (Figure 3; Table S2). For example, when the selection effect was set to 1.5 (and both and ) and , was 0.94 for depletion of susceptibles in compairson with 0.98 for M1.
When both mechanisms worked together (), where both and had independent causal effects and causally interacted on fetal loss (i.e.,, , and were set to the prespecified selection effects), the magnitude of bias was usually strongest (Figure 2; Table S1). For example, if the selection effects were 3.0, and 50% of the total population were exposed to (), then the average OR for ASD by among live births () would be 0.85 (95% SI: 0.74, 0.97) (Table S1). Even if we only changed the selection effects to 2.0, the observed OR was 0.90 (95% SI: 0.82, 0.99) (Table S1). Examining the bias parameters, both and deviated further from their corresponding population parameters under , in comparison with both M1 and M2 (Figure 3; Table S2). For example, when the prevalence of was 0.75, and the selection effect were set to 3 (, , ), ,and for in comparison with and for M1 (, , ), , and for M2 (, , ).
In the presence of multiple , the bias is amplified with increasing number of , but the extent of the amplification differs by selection mechanism as shown in Figure 4 and in Table S3. For M1, the increase in bias is small overall, yielding small to moderate associations even in the presence of three . For example, when both and for three s were set to 1.5, the resulting were 0.99 for all values of , compared with of 1 for all values of when only one was simulated. When both and for three were set to 3.0, the resulting ranged from 0.87 to 0.90 across values of , compared with of 0.94–0.95, when only one was simulated. The amplification of bias with additional parameters was stronger for M2 and strongest when both mechanisms co-occurred (). For example, when for three interactions were set to 1.5 under M2 ( and set to 1), ranged from 0.94 to 0.97 in comparison with 1.0 for all values of , when only one was simulated. When , , and were set to 1.5 under mechanism with three s, ranged from 0.88 to 0.95, compared with 0.98 for all values of when only one was simulated. Bias increased under both mechanisms because the selection effects increased in magnitude, and the prevalence of was high. The most extreme bias occurred when there were three s under , was set to 3.0, and was set to 0.75, resulting in an average of 0.51 (95% SI: 0.34, 0.73) for the association when the population was restricted to live births.
Figure 4.
Live birth bias of under different selection effects and different numbers of unmeasured risk factors for selection and the outcome. Average odds ratios for the association between nitrogen dioxide (; exposure ) and ASD (outcome ) among live births () with varying simulation inputs for the prevalence of the unmeasured risk factor () and the magnitude of selection effects () under two selection mechanisms and when they both co-occur with one, two or three s, assuming a true null effect of on ASD. is a vector that consists of unmeasured factors (, ), where input parameters were applied equally for each unmeasured factor; thus, all references to henceforth applies to each of the unmeasured factors , . Collider-stratification without interaction (Mechanism 1) occurs when and have independent causal effects on fetal loss, but with no interaction on the multiplicative scale (, and ); Depletion of susceptibles (Mechanism 2) occurs when has a causal effect on fetal loss only in the subset of susceptible fetuses (), but neither or have independent causal effects on fetal loss (, and ); Both mechanisms occur when and have independent causal effects on fetal loss, and with interaction on the multiplicative scale (). Each scenario was simulated 1000 times. Points represent the mean in each scenario. Dashed lines indicate the true null effect of on ASD () in the absence of live birth bias, where deviations from 1.0 quantify the magnitude of live birth bias. See Table S3 for corresponding numeric data, including 95% SI. Note: ASD, autism spectrum disorder; OR, odds ratio; SI, simulation intervals.
Discussion
In our simulations, we found that the magnitude of bias was generally weak for collider-stratification without interaction (M1), which is consistent not only with previous research on live-birth bias (Liew et al. 2015a) but also with existing literature that has focused on collider-stratification in other observational settings, such as the birth weight paradox (Hernández-Díaz et al. 2006), obesity paradox (Glymour and Vittinghoff 2014; Sperrin et al. 2016), and selection into genetic studies (Munafò et al. 2018), where much stronger and perhaps implausible effects are required to induce substantial bias (Greenland 2003; Smith and Vanderweele 2019). Thus, it is unlikely that collider-stratification without interaction alone can account for the observed association between pregnancy-wide exposure and ASD in the original study (Raz et al. 2018b), because few simulated scenarios yielded ORs that were close to 0.77, the OR for during pregnancy when mutually adjusted for postnatal exposure to (Raz et al. 2018b). Although other factors could have been at play as well, depletion of susceptibles (M2) or both mechanisms operating simultaneously () would be more likely to be able to account for the observed association in the original study all on their own, because the simulations under these scenarios consistently generated moderate to strong protective associations due to more extreme selection bias parameters among live births—particularly when multiple s were present.
To better understand the differences between the two selection mechanisms with just a single unmeasured variable , we also estimated the association () and prevalence of () in the selected populations, because the magnitude of bias () is constrained by these two parameters, and the effect of on ASD (). However, because was the same across mechanisms in our simulations, any discrepancies in bias are driven by both and . Here, M1 generally yielded weaker but similar in comaprison with M2, which explains why M1 produces a weaker bias compared with M2. When both mechanisms co-occur (), is lower (i.e., and are more strongly negatively associated) than either M1 or M2; this is unsurprising given that the effects of and on selection are “super-additive” under , in that the contributions of and together exceed the sum of their contributions when and are considered separately, as shown in Equation 1. Furthermore, is also lower under , however, this does not necessarily generate more bias because, similar to the magnitude of bias due to confounding (Walker 1991), bias is maximized when ( that is close to 0 or 1 actually reduces bias, because this is akin to conditioning on or stratifying by variable ). Thus, when , there exists a tension between the strength of and the distance of from 0.50, because stronger selection effects reduced (i.e., it moves further away from 0.50), such that it could offset the bias generated by the stronger . For example, when and ,the bias under was weaker than under M2 ( under and under M2) because despite the stronger ( under and under M2) was further away from 0.50 ( under and for M2). On the other hand, when , stronger selection effects strengthened , and lowered closer to 0.50 (except when under , where ), such that both bias parameters worked in concert to increase the bias in the overall association.
Evaluating the range of plausible parameters for simulation inputs is important for bias analyses. With a single unmeasured variable , neither M1 nor M2 produced associations near the observed OR of 0.77 from the original study (Raz et al. 2018b). On the other hand, when both mechanisms operate together () and there is just a single unmeasured variable , not only do the selection effects need to be quite strong, but also needs to be relatively common in the population to yield this observed OR. For example, an of 3.0 and of 0.75 yielded an observed association of 0.75 (95% SI: 0.64, 0.89) in our simulations. Thus, with a single , not only are there very few scenarios that could produce the observed estimate, but the magnitude of the input parameters that could potentially generate the bias strains credibility. That is, although it is perhaps plausible that one of the selection effects is that large, it seems unlikely that all three are. Therefore, strong bias is more likely under either M2 or when is a composite of uncontrolled variables (as well as those that have been controlled for, just imperfectly), because the would only need to be between 1.5 and 2.0 to generate a bias of similar magnitude. With selection effects of 2.0, and for each , the was 0.87 and 0.78 under M2 when there were two and three , respectively; whereas, with the same input parameters, the was 0.79 and 0.72 under when there were two and three , respectively. Even if these unmeasured factors were just as strongly associated with and , but were less prevalent in the population, the mean is only slightly attenuated. For example, under the same selection effects, but when for each , the was 0.90 and 0.82 under M2 when there were two and three , respectively, and the was 0.83 and 0.78 under when there were two and three , respectively.
Potential candidates for include prenatal stress (Bercum et al. 2015; Beversdorf et al. 2005; Bruckner et al. 2016; Brunton 2013; Class et al. 2014; Coughlan et al. 2014; Dean et al. 2015; Kinney et al. 2008a, 2008b; Li et al. 2012; Nepomnaschy et al. 2006; Plana-Ripoll et al. 2016; Roberts et al. 2013, 2014; Wainstock et al. 2013; Walder et al. 2014; Wisborg et al. 2008), maternal smoking (Caramaschi et al. 2018; Jung et al. 2017; Marufu et al. 2015; Pineles et al. 2014), genetic factors (Grove et al. 2019; Page and Silver 2016; Risch et al. 1999), and environmental stressors such as endocrine-disrupting chemicals (EDCs) (Jensen et al. 2015; Kalkbrenner et al. 2014; Krieg et al. 2016; Pelch et al. 2019). Many of these associations have been reported to be in line with or stronger than an of 1.5–2.0, and the collective exposure to these factors (or just a subset) in the population is likely not uncommon. For example, maternal smoking during pregnancy (any vs. none) has been associated with an OR of 1.47 for stillbirth (Marufu et al. 2015) and an OR of 1.56 for autism (Caramaschi et al. 2018). Furthermore, EDCs such as polychlorinated biphenyls (PCBs) have been associated with pregnancy loss with ORs ranging from 1.6 to 2.52, depending on the type of PCB, when comparing those accidentally exposed vs. those unexposed to accidental contamination of rice oil during the Yusho incident in Japan in 1968 (Krieg et al. 2016; Tsukimori et al. 2008). PCBs have also been associated with increased odds of autism, where a prior study reported ORs ranging from 1.20 to 1.97, depending on the type of PCB, when comparing the highest to the lowest quartile of exposure (Bernardo et al. 2019). Although we identified these s for our illustrative example of and ASD, they are also relevant for the estimation of the effects of any exposure during pregnancy that has the potential to cause loss and ASD. Furthermore, an of 1.5 to 2.0 is also plausible for the effects of on loss. Although past studies have reported ORs ranging from 1.04 to 1.27 for the association between (typically per ) and pregnancy loss (Grippo et al. 2018), these are likely biased downward because only a subset of pregnancy losses come to medical attention and can be studied. That is, early pregnancy loss, which has been estimated to be around 20%–30% (Wilcox et al. 1988), are typically not observed, such that loss early in pregnancy would go undetected, and the resultant association would underestimate the true harmful effect of on pregnancy loss. Because our simulation code is available online, we encourage other investigators to evaluate the potential bias arising from live-birth bias with input parameter values that are relevant to their own research.
Although our simulations could generate the magnitude of the protective effect reported from the original nested case–control study using a range of plausible input parameter values, they are simplified depictions of potential causal structures and, therefore, should not be directly compared with estimates from analyses using real data. For example, for simplicity, we assumed no confounding of the exposure effects, no loss to follow-up among live-born children, no measurement error, and no seasonal or time trends in the exposure or outcome. It is unlikely that all these assumptions would hold in a real analysis. The potential bias in the original nested case–control study may actually be a net downward bias (assuming that exposure to during pregnancy is not neuroprotective) arising from a combination of residual and/or unmeasured confounding, selection bias due to nonrandom attrition between birth and ASD assessment (although such selection would be subject to the same issues we describe here for live-birth bias), exposure measurement error (which typically biases the estimate toward the null), outcome misclassification, and model misspecification (e.g., imperfect control for seasonal trends) in addition to live-birth bias. However, the simplicity of our current simulation study is also its strength, in that in our simulations we can isolate and fully identify the bias due to the specific fetal selection mechanism. Furthermore, as prior knowledge of the magnitude and sign of the selection effects (, , , ) is limited because pregnancy loss is a challenging outcome to study (Finer and Zolna 2016; Wilcox et al. 1988), we set these ORs to be equal in our simulations for simplicity, but presumably similar associations could be seen with some OR lower and others higher. Along similar lines, we ran our simulations under the null for simplicity, which is sufficient to evaluate the magnitude of bias, because it does not depend on the effect of on ASD. For example, if we observe an of 0.75 for a given selection mechanism under the null, then a true effect of (i.e., the inverse of 0.75) would be rendered null by this selection mechanism. Finally, we also assumed that exposure groups are exchangeable in the total population of all conceptions (e.g., conceptions are not affected by selection processes induced by preconception exposures). However, this exchangeability may not be the case because there are likely selection processes that influence fertility (i.e., the likelihood of conception). Excluding women of reproductive age who are trying but are unable to conceive (because pregnancy is a requirement to study exposures during pregnancy) may lead to biased exposure–health effects in the set of actual conceptions, which differs from the total population of intended conceptions. This bias would act through mechanisms similar to those that we address in this paper’s simulations and therefore could amplify live-birth bias in an analysis with real data (Figure S1).
Here, we show that live-birth bias under plausible simulation parameters can lead to associations of and ASD that are biased downward, where the largest bias occurs when both M1 and M2 both operate simultaneously (i.e., ). This bias may explain the inconsistent body of literature (Flores-Pajot et al. 2016; Lam et al. 2016; Raz et al. 2018b; Weisskopf et al. 2015; Yang et al. 2017), where truly adverse effects may appear not as harmful, null, or even protective. Although we used and ASD for our illustrative example, this bias can extend to other studies relevant to fetal programming (Barker 2004), which can limit the identification of harmful prenatal exposure effects and prevent the development of interventions during pregnancy aimed at promoting better health. For example, it is possible that live-birth bias can also explain the unexpected negative associations between prenatal exposure to perfluoroalkyl substances with ASD (Hertz-Picciotto et al. 2008) and attention-deficit/hyperactivity disorder (Fei and Olsen 2011; Liew et al. 2015b; Ode et al. 2014; Stein et al. 2013).
To rule out live-birth bias as a threat to internal validity, we would need to show that the exposure in question does not affect selection (i.e., fetal loss); that is, if we find that exposure does not affect selection, either independently or in conjunction with another risk factor for fetal loss, then the association with the outcome cannot be biased through this mechanism. If, on the other hand, exposure is associated with selection, then to potentially mitigate or eliminate this bias, we would need to collect information on (something we would need to plan for in the study design phase) and adjust for it in the analysis. Last, if there is reason to believe that there are no common causes of selection and the outcome, then there would be no live-birth bias, even if the exposure affects selection (although this is impossible to verify in practice). It would still be worthwhile to quantify the effect of exposure on selection, because it gives us insight into the change in the potential number of losses and the child outcome (when considered jointly with its effect on the outcome) if we were to intervene to set exposure to another level. All of these analyses require estimating the effects on selection, which is no simple feat; however, new approaches for studying pregnancy loss without needing to enumerate the population at risk (i.e., all conceptions) (Kioumourtzoglou et al. 2019) makes such an undertaking less daunting. Thus, our study findings highlight the need for cautious interpretations of studies of the effects of prenatal exposures on postnatal outcomes and for more investment into research on the determinants of pregnancy loss.
Supplementary Material
Acknowledgments
This work was supported by grants P30ES000002 and R21ES026900 from the National Institutes of Health.
References
- Barker DJP. 2004. The developmental origins of adult disease. J Am Coll Nutr 23(suppl 6):588S–595S, PMID: 15640511, 10.1080/07315724.2004.10719428. [DOI] [PubMed] [Google Scholar]
- Bercum FM, Rodgers KM, Benison AM, Smith ZZ, Taylor J, Kornreich E, et al. 2015. Maternal stress combined with terbutaline leads to comorbid autistic-like behavior and epilepsy in a rat model. J Neurosci 35(48):15894–15902, PMID: 26631470, 10.1523/JNEUROSCI.2803-15.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bernardo BA, Lanphear BP, Venners SA, Arbuckle TE, Braun JM, Muckle G, et al. 2019. Assessing the relation between plasma PCB concentrations and elevated autistic behaviours using Bayesian predictive odds ratios. Int J Environ Res Public Health 16(3):457, PMID: 30764478, 10.3390/ijerph16030457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beversdorf DQ, Manning SE, Hillier A, Anderson SL, Nordgren RE, Walters SE, et al. 2005. Timing of prenatal stressors and autism. J Autism Dev Disord 35(4):471–478, PMID: 16134032, 10.1007/s10803-005-5037-8. [DOI] [PubMed] [Google Scholar]
- Bruckner TA, Mortensen LH, Catalano RA. 2016. Spontaneous pregnancy loss in Denmark following economic downturns. Am J Epidemiol 183(8):701–708, PMID: 27009344, 10.1093/aje/kww003. [DOI] [PubMed] [Google Scholar]
- Brunton PJ. 2013. Effects of maternal exposure to social stress during pregnancy: consequences for mother and offspring. Reproduction 146(5):R175–89, PMID: 23901130, 10.1530/REP-13-0258. [DOI] [PubMed] [Google Scholar]
- Caramaschi D, Taylor AE, Richmond RC, Havdahl KA, Golding J, Relton CL, et al. 2018. Maternal smoking during pregnancy and autism: using causal inference methods in a birth cohort study. Transl Psychiatry 8(1):262, PMID: 30498225, 10.1038/s41398-018-0313-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Class QA, Abel KM, Khashan AS, Rickert ME, Dalman C, Larsson H, et al. 2014. Offspring psychopathology following preconception, prenatal and postnatal maternal bereavement stress. Psychol Med 44(1):71–84, PMID: 23591021, 10.1017/S0033291713000780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coughlan C, Walters S, Ledger W, Li TC. 2014. A comparison of psychological stress among women with and without reproductive failure. Int J Gynecol Obstet 124(2):143–147, PMID: 24290538, 10.1016/j.ijgo.2013.08.006. [DOI] [PubMed] [Google Scholar]
- Dean RG, Dean J, Heller GZ, Leader LR. 2015. A mass shooting at Port Arthur, Tasmania, Australia: a study of its impact on early pregnancy losses using a conception time-based methodology. Hum Reprod 30(11):2671–2676, PMID: 26355115, 10.1093/humrep/dev200. [DOI] [PubMed] [Google Scholar]
- Fei C, Olsen J. 2011. Prenatal exposure to perfluorinated chemicals and behavioral or coordination problems at age 7 years. Environ Health Perspect 119(4):573–578, PMID: 21062688, 10.1289/ehp.1002026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Finer LB, Zolna MR. 2016. Declines in unintended pregnancy in the United States, 2008–2011. N Engl J Med 374(9):843–852, PMID: 26962904, 10.1056/NEJMsa1506575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flores-Pajot MC, Ofner M, Do MT, Lavigne E, Villeneuve PJ. 2016. Childhood autism spectrum disorders and exposure to nitrogen dioxide, and particulate matter air pollution: a review and meta-analysis. Environ Res 151:763–776, PMID: 27609410, 10.1016/j.envres.2016.07.030. [DOI] [PubMed] [Google Scholar]
- Glymour MM, Vittinghoff E. 2014. Selection bias as an explanation for the obesity paradox: just because it’s possible doesn’t mean it’s plausible. Epidemiology 25(1):4–6, PMID: 24296924, 10.1097/EDE.0000000000000013. [DOI] [PubMed] [Google Scholar]
- Greenland S. 2003. Quantifying biases in causal models: classical confounding vs collider-stratification bias. Epidemiology 14(3):300–306, PMID: 12859030. [PubMed] [Google Scholar]
- Grippo A, Zhang J, Chu L, Guo Y, Qiao L, Myneni AA, et al. 2018. Air pollution exposure during pregnancy and spontaneous abortion and stillbirth. Rev Environ Health 33(3):247–264, PMID: 29975668, 10.1515/reveh-2017-0033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grove J, Ripke S, Als TD, Mattheisen M, Walters RK, Won H, et al. 2019. Identification of common genetic risk variants for autism spectrum disorder. Nat Genet 51(3):431–444, PMID: 30804558, 10.1038/s41588-019-0344-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hernan M, Robins J. 2020. Causal Inference: What If. Boca Raton: Chapman & Hall/CRC. [Google Scholar]
- Hernández-Díaz S, Schisterman EF, Hernán MA. 2006. The birth weight “paradox” uncovered? Am J Epidemiol 164(11):1115–1120, PMID: 16931543, 10.1093/aje/kwj275. [DOI] [PubMed] [Google Scholar]
- Hertz-Picciotto I, Park HY, Dostal M, Kocan A, Trnovec T, Sram R. 2008. Prenatal exposures to persistent and non-persistent organic compounds and effects on immune system development. Basic Clin Pharmacol Toxicol 102(2):146–154, PMID: 18226068, 10.1111/j.1742-7843.2007.00190.x. [DOI] [PubMed] [Google Scholar]
- Howards P, Hertz-Picciotto I, Poole C. 2007. Conditions for bias from differential left truncation. Am J Epidemiol 165(4):444–452, PMID: 17150983, 10.1093/aje/kwk027. [DOI] [PubMed] [Google Scholar]
- Howe CJ, Cole SR, Lau B, Napravnik S, Eron JJ. 2016. Selection bias due to loss to follow up in cohort studies. Epidemiology 27(1):91–97, PMID: 26484424, 10.1097/EDE.0000000000000409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jensen TK, Andersen LB, Kyhl HB, Nielsen F, Christesen HT, Grandjean P. 2015. Association between perfluorinated compound exposure and miscarriage in Danish pregnant women. PLoS One 10(4):e0123496, PMID: 25848775, 10.1371/journal.pone.0123496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jung Y, Lee AM, McKee SA, Picciotto MR. 2017. Maternal smoking and autism spectrum disorder: meta-analysis with population smoking metrics as moderators. Sci Rep 7(1):, PMID: 28659613, 10.1038/s41598-017-04413-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kalkbrenner AE, Schmidt RJ, Penlesky AC. 2014. Environmental chemical exposures and autism spectrum disorders: a review of the epidemiological evidence. Curr Probl Pediatr Adolesc Health Care 44(10):277–318, PMID: 25199954, 10.1016/j.cppeds.2014.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kinney DK, Miller AM, Crowley DJ, Huang E, Gerber E. 2008a. Autism prevalence following prenatal exposure to hurricanes and tropical storms in Louisiana. J Autism Dev Disord 38(3):481–488, PMID: 17619130, 10.1007/s10803-007-0414-0. [DOI] [PubMed] [Google Scholar]
- Kinney DK, Munir KM, Crowley DJ, Miller AM. 2008b. Prenatal stress and risk for autism. Neurosci Biobehav Rev 32(8):1519–1532, PMID: 18598714, 10.1016/j.neubiorev.2008.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kioumourtzoglou MA, Raz R, Wilson A, Fluss R, Nirel R, Broday DM, et al. 2019. Traffic-related air pollution and pregnancy loss. Epidemiology 30:4–10, PMID: 30199416, 10.1097/EDE.0000000000000918. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krieg SA, Shahine LK, Lathi RB. 2016. Environmental exposure to endocrine-disrupting chemicals and miscarriage. Fertil Steril 106(4):941–947, PMID: 27473347, 10.1016/j.fertnstert.2016.06.043. [DOI] [PubMed] [Google Scholar]
- Lajous M, Banack HR, Kaufman JS, Hernán MA. 2015. Should patients with chronic disease be told to gain weight? The obesity paradox and selection bias. Am J Med 128(4):334–336, PMID: 25460531, 10.1016/j.amjmed.2014.10.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lam J, Sutton P, Kalkbrenner A, Windham G, Halladay A, Koustas E, et al. 2016. A systematic review and meta-analysis of multiple airborne pollutants and autism spectrum disorder. PLoS One 11(9):e0161851, PMID: 27653281, 10.1371/journal.pone.0161851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li W, Newell-Price J, Jones GL, Ledger WL, Li TC. 2012. Relationship between psychological stress and recurrent miscarriage. Reprod Biomed Online 25(2):180–189, PMID: 22687324, 10.1016/j.rbmo.2012.03.012. [DOI] [PubMed] [Google Scholar]
- Liew Z, Olsen J, Cui X, Ritz B, Arah OA. 2015a. Bias from conditioning on live birth in pregnancy cohorts: an illustration based on neurodevelopment in children after prenatal exposure to organic pollutants. Int J Epidemiol 44(1):345–354, PMID: 25604449, 10.1093/ije/dyu249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liew Z, Ritz B, von Ehrenstein OS, Bech BH, Nohr EA, Fei C, et al. 2015b. Attention deficit/hyperactivity disorder and childhood autism in association with prenatal exposure to perfluoroalkyl substances: a nested case–control study in the Danish National Birth Cohort. Environ Health Perspect 123(4):367–373, PMID: 25616253, 10.1289/ehp.1408412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lisonkova S, Joseph KS. 2015. Left truncation bias as a potential explanation for the protective effect of smoking on preeclampsia. Epidemiology 26(3):436–440, PMID: 25695352, 10.1097/EDE.0000000000000268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marufu TC, Ahankari A, Coleman T, Lewis S. 2015. Maternal smoking and the risk of still birth: systematic review and meta-analysis. BMC Public Health 15(1):239, PMID: 25885887, 10.1186/s12889-015-1552-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Munafò MR, Tilling K, Taylor AE, Evans DM, Davey Smith G. 2018. Collider scope: when selection bias can substantially influence observed associations. Int J Epidemiol 47(1):226–235, PMID: 29040562, 10.1093/ije/dyx206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nepomnaschy PA, Welch KB, McConnell DS, Low BS, Strassmann BI, England BG. 2006. Cortisol levels and very early pregnancy loss in humans. Proc Natl Acad Sci USA 103(10):3938–3942, PMID: 16495411, 10.1073/pnas.0511183103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ode A, Källén K, Gustafsson P, Rylander L, Jönsson BAG, Olofsson P, et al. 2014. Fetal exposure to perfluorinated compounds and attention deficit hyperactivity disorder in childhood. PLoS One 9(4):e95891, PMID: 24760015, 10.1371/journal.pone.0095891. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Page JM, Silver RM. 2016. Genetic causes of recurrent pregnancy loss. Clin Obstet Gynecol 59(3):498–508, PMID: 27414972, 10.1097/GRF.0000000000000217. [DOI] [PubMed] [Google Scholar]
- Pelch KE, Bolden AL, Kwiatkowski CF. 2019. Environmental chemicals and autism: a scoping review of the human and animal research. Environ Health Perspect 127(4):46001, PMID: 30942615, 10.1289/EHP4386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pineles BL, Park E, Samet JM. 2014. Systematic review and meta-analysis of miscarriage and maternal exposure to tobacco smoke during pregnancy. Am J Epidemiol 179(7):807–823, PMID: 24518810, 10.1093/aje/kwt334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Plana-Ripoll O, Parner E, Olsen J, Li J. 2016. Severe stress following bereavement during pregnancy and risk of pregnancy loss: results from a population-based cohort study. J Epidemiol Community Health 70(5):424–429, PMID: 26584858, 10.1136/jech-2015-206241. [DOI] [PubMed] [Google Scholar]
- Raz R, Kioumourtzoglou MA, Weisskopf MG. 2018a. Live-Birth bias and observed associations between air pollution and autism. Am J Epidemiol 187(11):2292–2296, PMID: 30099488, 10.1093/aje/kwy172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raz R, Levine H, Pinto O, Broday DM, Yuval, Weisskopf MG. 2018b. Traffic-related air pollution and autism spectrum disorder: a population-based nested case-control study in Israel. Am J Epidemiol 187:717–725, PMID: 29020136, 10.1093/aje/kwx294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Risch N, Spiker D, Lotspeich L, Nouri N, Hinds D, Hallmayer J, et al. 1999. A genomic screen of autism: evidence for a multilocus etiology. Am J Hum Genet 65(2):493–507, PMID: 10417292, 10.1086/302497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roberts AL, Koenen KC, Lyall K, Ascherio A, Weisskopf MG. 2014. Women’s posttraumatic stress symptoms and autism spectrum disorder in their children. Res Autism Spectr Disord 8(6):608–616, PMID: 24855487, 10.1016/j.rasd.2014.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roberts AL, Lyall K, Rich-Edwards JW, Ascherio A, Weisskopf MG. 2013. Association of maternal exposure to childhood abuse with elevated risk for autism in offspring. JAMA Psychiatry 70(5):508–515, PMID: 23553149, 10.1001/jamapsychiatry.2013.447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith LH, Vanderweele TJ. 2019. Bounding bias due to selection. Epidemiology 30(4):509–516, PMID: 31033690, 10.1097/EDE.0000000000001032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sperrin M, Candlish J, Badrick E, Renehan A, Buchan I. 2016. Collider bias is only a partial explanation for the obesity paradox. Epidemiology 27:525–530, PMID: 27075676, 10.1097/EDE.0000000000000493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stein CR, Savitz DA, Bellinger DC. 2013. Perfluorooctanoate and neuropsychological outcomes in children. Epidemiology 24(4):590–599, PMID: 23680941, 10.1097/EDE.0b013e3182944432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tsukimori K, Tokunaga S, Shibata S, Uchi H, Nakayama D, Ishimaru T, et al. 2008. Long-term effects of polychlorinated biphenyls and dioxins on pregnancy outcomes in women affected by the Yusho incident. Environ Health Perspect 116(5):626–630, PMID: 18470296, 10.1289/ehp.10686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wainstock T, Lerner-Geva L, Glasser S, Shoham-Vardi I, Anteby EY. 2013. Prenatal stress and risk of spontaneous abortion. Psychosom Med 75(3):228–235, PMID: 23362503, 10.1097/PSY.0b013e318280f5f3. [DOI] [PubMed] [Google Scholar]
- Walder DJ, Laplante DP, Sousa-Pires A, Veru F, Brunet A, King S. 2014. Prenatal maternal stress predicts autism traits in 6 1/2 year-old children: Project Ice Storm. Psychiatry Res 219(2):353–360, PMID: 24907222, 10.1016/j.psychres.2014.04.034. [DOI] [PubMed] [Google Scholar]
- Walker A. 1991. Observation and Inference: An Introduction to the Methods of Epidemiology. Chestnut Hill, MA: Epidemiology Resources Incorporated. [Google Scholar]
- Weisskopf MG, Kioumourtzoglou MA, Roberts AL. 2015. Air pollution and autism spectrum disorders: causal or confounded? Curr Environ Health Rep 2(4):430–439, PMID: 26399256, 10.1007/s40572-015-0073-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilcox AJ, Weinberg CR, O’Connor JF, Baird DD, Schlatterer JP, Canfield RE, et al. 1988. Incidence of early loss of pregnancy. N Engl J Med 319(4):189–194, PMID: 3393170, 10.1056/NEJM198807283190401. [DOI] [PubMed] [Google Scholar]
- Wisborg K, Barklin A, Hedegaard M, Henriksen TB. 2008. Psychological stress during pregnancy and stillbirth: prospective study. BJOG 115(7):882–885, PMID: 18485167, 10.1111/j.1471-0528.2008.01734.x. [DOI] [PubMed] [Google Scholar]
- Yang C, Zhao W, Deng K, Zhou V, Zhou X, Hou Y. 2017. The association between air pollutants and autism spectrum disorders. Environ Sci Pollut Res Int 24(19):15949–15958, PMID: 28540549, 10.1007/s11356-017-8928-2. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.