Skip to main content
Environmental Health Perspectives logoLink to Environmental Health Perspectives
. 2010 Dec 8;119(3):409–415. doi: 10.1289/ehp.1002453

Statistical Methods to Study Timing of Vulnerability with Sparsely Sampled Data on Environmental Toxicants

Brisa Ney Sánchez 1,, Howard Hu 2, Heather J Litman 3, Martha Maria Téllez-Rojo 4
PMCID: PMC3060007  PMID: 21362588

Abstract

Background

Identifying windows of vulnerability to environmental toxicants is an important area in children’s health research.

Objective

We compared and contrasted statistical approaches that may help identify windows of vulnerability by formally testing differences in exposure effects across time of exposure, incorporating continuous time metrics for timing of exposure, and efficiently incorporating incomplete cases.

Methods

We considered four methods: 1) window-specific and simultaneously adjusted regression; 2) multiple informant models; 3) using features of individual exposure patterns to predict outcomes; and 4) models of population exposure patterns depending on the outcome. We illustrate them using a study of prenatal vulnerability to lead in relation to Bayley’s Mental Development Index at 24 months of age (MDI24).

Results

The estimated change in MDI24 score with a 1-loge-unit increase in blood lead during the first trimester was −2.74 [95% confidence interval (CI), −5.78 to 0.29] based on a window-specific regression. The corresponding change in MDI24 was −4.13 (95% CI, −7.54 to −0.72) based on a multiple informant model; estimated effects were similar across trimesters (p = 0.23). Results from method 3 suggested that blood lead levels in early pregnancy were significantly associated with reduced MDI24, but decreasing blood leads over the course of pregnancy were not. Method 4 results indicated that blood lead levels before 17 weeks of gestation were lower among children with MDI24 scores in the 90th versus the 10th percentile (p = 0.08).

Conclusions

Method 2 is preferred over method 1 because it enables formal testing of differences in effects across a priori–defined windows (e.g., trimesters of pregnancy). Methods 3 and 4 are preferred over method 2 when there is large variability in the timing of exposure assessments among participants. Methods 3 and 4 yielded smaller p-values for tests of the hypothesis that not only level but also timing of lead exposure are relevant predictors of MDI24; systematic power comparisons are warranted.

Keywords: exposure patterns, lead, multiple informants, neurodevelopment, sensitive periods of development


Identifying critical windows of vulnerability to environmental toxicants is an important area in children’s health research. Critical windows of vulnerability are defined as periods during life when an exposure causes a stronger deficit in health later in life compared with other periods when exposure (could have) occurred (Barr et al. 2000; Selevan et al. 2000; West 2002). The research question is often formulated as “Is the exposure more strongly associated with the health outcome if it occurred in time window 1, 2, …, or n?” Formulating the question in terms of discrete time windows is advantageous from a clinical and practical perspective. However, considering timing of exposure in a continuous fashion may be more advantageous from an analytical standpoint.

We discuss three statistical approaches that may be useful in studies of windows, or timing, of vulnerability; two of these use time of exposure as a continuous variable. We compare the proposed methods with commonly used statistical approaches such as fitting separate regression models for each potential window and fitting simultaneously adjusted multiple regressions that include all exposure measures (time windows) in one model.

As exposition, we use data from a study of lead exposure during pregnancy and mental development. We refer readers to published studies of lead exposure for information regarding the toxicologic effects of lead on development and the relevance of time windows of vulnerability in that context (Hu et al. 2006; Schnaas et al. 2006). We focus on statistical modeling issues and apply all methods to the same data so that the inferences and interpretations across methods can be compared.

Materials and Methods

Data source

We use data from an ELEMENT (Early Life Exposures in Mexico to Environmental Toxicants) study cohort of mother–child pairs recruited during pregnancy or before conception (Téllez-Rojo et al. 2004). We use maternal blood lead concentrations ascertained during study visits scheduled within each trimester. Low birth weight (< 2,500 g) and preterm (< 37 weeks) children were excluded. Children’s mental development was measured at 24 months using Bayley’s Mental Development Index (MDI24) (Bayley 1993). We also collected other participant characteristics (Table 1); data collection procedures are reported elsewhere (Hu et al. 2006; Téllez-Rojo et al. 2004). This analysis was restricted to n = 169 participants with complete covariates (mother’s age and IQ, breast-feeding duration, and child’s sex, height, weight, and blood lead level at 24 months) and at least one measure of prenatal lead exposure.

Table 1.

Participant characteristics, ELEMENT study data.

Variable n Mean ± SD Min, max
Maternal characteristics

 Ln(blood lead) (μg/dL), T1 139 1.90 ± 0.55 0.095, 3.57
 Ln(blood lead) (μg/dL), T2 159 1.78 ± 0.45 0.095, 3.01
 Ln(Blood lead) (μg/dL), T3 147 1.85 ± 0.53 0.18, 3.64
 Age (years) 169 26.7 ± 5.2 18, 42
 IQ 169 89 ± 12.8 55, 120

Child

 Sex (percent male) 169 50.9%
 Weight at 24 months (kg) 169 12 ± 1.5 9.40, 19.30
 Height z-score at 24 months 169 −0.1 ± 0.93 −3.78, 3.22
 Ln(blood lead) at 24 months 169 1.39 ± 0.63 −0.22, 3.60
 MDI24 169 91.8 ± 11.6 68, 122
 Breast-feeding duration (months) 169 6.4 ± 5.7 0, 24

Timing of sample collection (weeks)

 T1 139 13.7 ± 3.4 3.9, 20.4
 T2 159 24.5 ± 2.8 18.0, 33.7
 T3 147 35.2 ± 1.9 29.0, 39.0

Correlations among ln(blood lead) levels T1 T2 T3

 Ln(blood lead) T1 1
 Ln(blood lead) T2 0.66 1
 Ln(blood lead) T3 0.56 0.61 1
 Ln(blood lead) of child at 24 months 0.17 0.24 0.27

Abbreviations: max, maximum; min, minimum; T, trimester.

Participants gave written informed consent before data collection. The study was approved by the institutional review boards of the hospitals where participants were recruited, the National Institute of Public Health of Mexico, Brigham and Women’s Hospital, Harvard School of Public Health, and the University of Michigan.

Notation

For each subject i, assume X1i, X2i, … XKi are measures of exposure taken during Ki time points t1i, t2i, …, tKi; Yi is the health outcome, and Zi are mean-centered covariates. For the lead study, Yi is MDI24, X1i, X2i, … XKi are loge(maternal blood lead levels, micrograms per deciliter), and Zi are maternal age, maternal IQ, child’s sex, and child’s 24-month weight and height-for-age z-score (Kuczmarski et al. 2002). For methods 1 and 2, we assume the Ki measures of exposure fall within K time windows (i.e., K = three trimesters).

Method 1: separate and simultaneously adjusted multiple linear regression models

A commonly used statistical approach in studies of prenatal windows of vulnerability is to fit regression models for each potential window (Aguilera et al. 2009; Bell et al. 2007; Hu et al. 2006; Meyer et al. 2007; Mohorovic 2004; Parker et al. 2005). The magnitude and significance of the regression coefficients are compared to draw conclusions about which window of exposure may be more important.

This approach is intuitive, but it has two assumptions that are not always discussed. First, the exposure is assumed constant within the window. For example, a single maternal blood lead measure during the first trimester is assumed to be representative of average exposure during the entire trimester. Second, although it is recognized that “the closer we look, the more evident it is that often there is not a uniform response within a given window” (Barr et al. 2000), the exposure effect (e.g., the effect of prenatal lead exposure on mental development) is assumed to be constant within a time window. When these assumptions are violated, the estimated effects within a predefined window may be biased toward or away from the null, depending on the true nature of the exposure effect and the time at which exposures are measured. Specific examples are shown in the Supplemental Material, Figures 1 and 2 (doi:10.1289/ehp.1002453).

Figure 1.

Figure 1

(A) Maternal blood lead pattern across gestational period (weeks) for children in the 10th percentile (solid) and 90th percentile (dashed) of the covariate-adjusted MDI24 distribution with 95% pointwise CIs. (B) Relative exposure comparing those in the 10th percentile of the MDI distribution with those in the 90th percentile, with 95% pointwise CIs.

The approach is also subject to other statistical limitations. First, estimating separate regressions for each time window precludes formal testing of the differences in effects across time windows. Interpreting nonoverlapping confidence intervals (CIs) as proof that effects differ is not valid, because the estimated coefficients are not statistically independent. This issue is analogous to the incorrect choice of a two-sample t-test over a (correct) paired t-test for testing differences between repeated measures within an individual. Second, it is possible that not all regressions will be based on the same group of subjects because of missing data. For example, a popular approach to maximize the sample size for each regression is to include all observations with data for a given time window in the regression for that time window, even though some may be excluded from models of other time windows because of missing data. However, the available case approach is valid only if data are missing completely at random (MCAR) (Little and Rubin 2002), and it is not possible to verify this assumption based on available (nonmissing) data (for example, not finding significant differences in bivariate analyses comparing subjects with complete versus incomplete data is not sufficient proof of MCAR). Third, because the same outcome is used in all regressions, the hypotheses tested are not independent, and precision may be over- or underestimated because the correlations among the residual errors from the regressions are ignored.

A straightforward alternative to fitting separate regressions for each time window is to fit a single simultaneously adjusted regression model that includes all time windows. An advantage of this approach is that estimates of the independent effect of exposure during each window may be obtained (Ha et al. 2007; Lubin et al. 1997). However, the approach is not always feasible because of collinearity issues (e.g., inflated standard errors, unstable regression coefficients). Further, models are restricted to observations that have complete data for all variables.

We estimated separate and simultaneously adjusted multiple linear regression models to estimate trimester-specific associations between MDI24 and loge-transformed maternal blood lead levels.

Method 2: multiple informants

Multiple informant data refers to information gathered from different individuals or sources used to measure the same construct (Horton et al. 1999; Litman et al. 2007b). One classic example is where mother and child both respond to questions regarding the physical activity of the child, so that the mother and child are the informants. Methods for multiple informant data also can be applied when exposure information is obtained for the same individual at different time points (windows) by treating the exposure windows as informants. Multiple informant methods can be used to test whether the information relayed by different informants (in this case, whether exposure is measured during different time windows) relates in the same manner to an outcome of interest. Although this approach retains the interpretation (and assumptions) of a set of separate multiple regressions (by providing a single estimate of effect for exposure in each time window), it also provides a way to test differences in associations between the exposure and the outcome across time windows.

Model

The associations of primary interest are estimated by β1k from window-specific multiple regressions Yi = β0k + β1kXki + β2kZi + ɛki, for window k = 1,2, …, K. The multiple informants approach jointly estimates the regression models.

Joint estimation enables us to impose and test constraints on the regression coefficients across exposure windows. That is, test whether the exposure coefficients are equal across time windows, Ho: β11 = β12 = … = β1K versus Ha: at least one β1K differs. Parameter estimation can be conducted using generalized estimating equations (GEE) or maximum likelihood (ML) estimation (Horton et al. 1999; Litman et al. 2007a; Pepe et al. 1999). Details regarding data structure, model fitting, hypothesis testing, and macros to analyze the data using this method are provided in the Supplemental Material (doi:10.1289/ehp.1002453).

ML estimation proceeds by modeling the exposure at different windows and the outcome as having a K + 1 multivariate normal distribution (i.e., exposure is considered a random variable) and subsequently obtaining the distribution of the outcome conditional on the exposure. The test of equal association between exposure and outcome across time windows can also be performed on standardized regression coefficients (i.e., adjusted correlation coefficient denoted ρ1k = σXkβ1kY, k = 1, …, K, where σXk is the SD of the exposure at window k, and σY is the SD of the outcome. Testing adjusted correlation coefficients may be desirable when the variability of the exposure changes over time or if protocols for measuring exposure change across time windows. ML can account for exposure and outcome data missing at random (MAR) (Little and Rubin 2002) and is robust to distributional assumptions when missing data are MCAR (Litman et al. 2007b). MAR means that the fact the data are missing is independent of the actual missing value, after accounting for other observed participant characteristics. MCAR means that the fact the data are missing is independent of the missing value as well as other observed data and hence is more restrictive than MAR.

The GEE approach embeds separate linear regression models for each time window into a unified set of estimating equations. In contrast to the ML approach, the exposures are not considered random (dependent) variables. However, unless more sophisticated methods are used (Horton and Lipsitz 1999), the GEE method retains the MCAR assumption for the missing data.

We applied the multiple informant approach to the ELEMENT study data. For the GEE approach we used a score type test of the hypothesis Ho: β11 = β12 = β13 [degrees of freedeom (df) = 2]. Because the variance of the exposure during trimester 2 was smaller than during the other two trimesters, in the ML approach we tested for homogeneity of standardized exposure effects across time windows (Ho: ρ11 = ρ12 = ρ13) using a likelihood ratio test (df = 2). The p-values for these tests are denoted pint, alluding to an interaction between exposure level and timing of exposure.

Method 3: individual patterns of exposure in relation to outcome

When assumptions regarding the timing of exposure in methods 1 and 2 are violated (i.e., when timing of exposure measurement or the effect of exposure on the outcome varies within time windows), when a large number of time windows are considered, or when the number of measurement occasions for the exposure varies across participants, the multiple informant approach may no longer be feasible. An alternative is to model the pattern of exposure for each individual over time and then relate exposure features to the outcome.

Model

The underlying idea in this approach is to reduce the number of exposure measures for each individual from Ki, which may vary across participants, to a smaller number equal across participants. A simple example where the exposures are summarized into two exposure features is to model the exposures Xik using a random intercepts and random slopes model, Xik = θ0i + θ1i tik + ɛik, where k = 1, …, Ki represents the kth measurement occasion. The random effects θ0i, and θ1i are person-specific intercepts and slopes that jointly describe the pattern of exposure for individual i and have a population mean θ =0,θ1) and variance Φ. Next, the outcome is modeled in relation to the exposure features θ0i, and θ1i, for example, Yi = β0 + β11 θ0i + β12 θ1i + β2 Zi + ɛi. [Details regarding data structure, model fitting, hypothesis testing, and macros to analyze the data using this method are provided in the Supplemental Material (doi:10.1289/ehp.1002453).]

For the ELEMENT study, where a few measures of exposure were available, we employed a model where a random intercept and slope were the only subject-specific exposure features. The time variable was centered at 7 weeks, the middle of the first trimester, such that θ0i represents blood lead levels during gestational week 7. First, the exposure was modeled as Xik = θ0i + θ1i (tik7) + ɛik, with tik being gestational time in weeks. Because week 7 is the midpoint of the first trimester, θ0i can also be interpreted as the average exposure for individual i during the first trimester, so that study participants with a random intercept θ0i higher than the mean θ0 would have a higher than average lead exposure in early pregnancy. The slope θ1i represents the subject’s rate of change in exposure across the pregnancy. For example, if the population slope θ1 was negative (indicating declining lead levels across pregnancy), a study participant with a random slope θ1i larger than the mean θ1 would have a slower than average rate of decline in lead levels over the course of pregnancy. We modeled the outcome (MDI24) as Yi = β0 + β11 θ0i + β12 θ1i + β2 Zi+ ɛi, where β11 is interpreted as the association between exposure during the first trimester and MDI24, and β12 is the association between changes in exposure over the course of pregnancy and MDI24.

We fit the model for the exposures and the outcome model jointly using ML (Wang et al. 2000), which yields consistent and efficient estimates of model parameters. Heuristically, a two-step approach also could be used where empirical Bayes estimates of the exposure features are obtained, for example, θ̂0i, and θ̂1i, and then substituted in the outcome model. However, the two-step approach gives inconsistent (likely attenuated) estimates of the regression coefficients (Wang et al. 2000).

Method 3 does not require that participants be measured at the same time points, whereas methods 1 and 2 assume the timing of measurements is similar across participants. Furthermore, because the timing of exposure is not discretized into windows, this approach is more useful than the multiple informant approach when windows are not well defined or when little prior information about the etiologically relevant windows are is available. It requires that each participant has sufficient information such that the features can be reliably estimated (i.e., at least one more measurement than features in most participants). In the lead example, at least three measurements are required for most participants, because two features are estimated. Although the interpretation of model parameters may not be as straightforward because the reference to discrete windows is lost, conclusions about the relative importance of broad time periods can still be drawn. In the lead example, this method helps answer the question “After accounting for first-trimester exposure, does changing the exposure in subsequent trimesters matter?” In the lead example, this approach assumes a constant rate of decline in exposure, and the association of exposure with outcome over the course of pregnancy diminishes (or increases) at a linear rate.

Method 4: population pattern of exposure given the outcome

The fourth method consists of describing the population-average exposure pattern for levels of the outcome. For example, in ELEMENT, this method can be used to compare the pattern of exposure for children who achieve high MDI24 scores with the pattern in children with low scores. This approach differs from methods 1–3 because it models the exposure given the health outcome.

Model

The exposure is modeled as Xik = f0(tik) + f1(tik)Yci + δik, where Yci is the outcome for subject i centered at the sample mean (Yci = YiȲ) or centered at its predicted value (Yci = YiŶi, where Ŷi is the predicted outcome given Zi) given factors other than exposure (e.g., suspected confounders and possibly other independent predictors of the outcome). The residuals δik are assumed to have mean zero and covariance Δ within individuals but are independent across individuals. The term f0(tik) represents the exposure pattern over time for those with an average outcome, which can be modeled as a parametric function [e.g., f0(tik) = α00 + α01tik + α02t2ik] or a semiparametric curve (e.g., penalized splines). The term f0(tik)Yci quantifies the differences in exposure over time across levels of the outcome. Both f1(tik) and f0(tik) are modeled in the same fashion [e.g., f1(tik) = α10 + α11tik + α12t2ik if f0(tik) is modeled as a quadratic function]. The coefficients α10, α11, α12 jointly describe the pattern of exposure curve over time.

An overall test of association between exposure and outcome involves testing Ho: f1(t) = 0 vs. Ha: f1(t) ≠ 0. When f1(t) is given by a quadratic function, then the null hypothesis is Ho: α10 = α11 = α12 = 0 versus Ha: at least one α differs from zero. Testing whether the exposure pattern over time is associated with outcome amounts to testing Ho: α11 = α12 = 0 versus Ha: at least one α differs from zero. For timing of susceptibility (i.e., whether exposure effects vary depending upon the timing of exposure), the more relevant hypothesis is the latter.

The estimation of this model can be conducted in two steps. First, a model for the outcome is estimated where only covariates Zi are predictors. The residuals Yci = YiŶi are then constructed, and the model for Xik (i.e., a model of the exposure conditional on the outcome) is estimated. Details regarding data structure, model fitting, hypothesis testing, and macros to analyze the data using method 4 are provided in the Supplemental Material (doi:10.1289/ehp.1002453).

In the ELEMENT data, we obtained Yci = YiŶi, and interpreted it as the deviation of the individual from the expected MDI24 score given mother’s age, mother’s IQ, duration of breastfeeding, sex, and weight and height z-score at 24 months. For example, Yci will be positive when the MDI24 score of a child is higher than predicted given the characteristics listed above. We then estimated the exposure model using tik as gestational time in weeks and nonparametric and quadratic models for f0(t) and f1(t). Finally, we constructed the predicted exposure pattern (back-transformed to natural units of blood lead) for children in the 10th and 90th percentile of the covariate-adjusted MDI distribution and determined the relative difference in exposure between the 10th and 90th percentiles of the covariate-adjusted MDI distribution across time by exponentiating the difference in the predictions of loge-transformed blood lead.

Results

The study visits occurred, on average, toward the end of each trimester, with considerable variability in their timing (Table 1). Notably, although the earliest visit among all study participants was at 3.8 weeks of gestation, the average first-trimester visit occurred at 13.7 weeks. The variability in timing of measurements raises concerns about interpretation of regression coefficients as the association between exposure during each trimester and mental development and was the primary motivation for seeking alternative analytical methods for this type of data. Nevertheless, we keep the language of trimesters in the subsequent descriptions of the results for methods 1 and 2.

The estimated associations from the separate and simultaneously adjusted regression models were as follows: −5.42 (95% CI, −10.2 to −0.64) and −2.74 (95% CI, −5.78 to 0.29) MDI24 points per 1-loge-unit increase in blood lead in the first trimester, respectively (Table 2). Although not significant, the coefficients for trimesters 2 and 3 from the simultaneously adjusted regression were positive (suggesting higher MDI24 scores with higher lead exposure) and imprecise, which would be consistent with collinearity. The coefficients from separate regressions were all negative and decreased in strength across trimesters, with the strongest association observed at trimester 1. However, none of the estimated effects were statistically significant (p > 0.05).

Table 2.

Effect of maternal ln(blood lead) on MDI24, estimated from various approaches.

Multiple regression (method 1)
Multiple informants approach (method 2, n = 169)
Simultaneous adjustmenta
Separate regressionsb
GEE
MLE
Trimester β 95% CI β 95% CI β 95% CI β 95% CI
1 −5.42 −10.20 to −0.64 −2.74 −5.78 to 0.29 −2.74 −5.82 to 0.33 −4.13 −7.54 to −0.72
2 0.88 −5.34 to 7.09 −1.37 −4.81 to 2.07 −1.37 −4.79 to 2.05 −2.98 −6.86 to 0.91
3 1.22 −3.65 to 6.08 −1.15 −4.20 to 1.90 −1.15 −4.18 to 1.88 −2.04 −5.11 to 1.04
pintc NA NA 0.56d 0.23e

Abbreviations: MLE, maximum likelihood estimates; NA, not available.

a

n = 120.

b

For trimester 1, n = 139; trimester 2, n = 159; trimester 3, n = 146.

c

Test for hypothesis that estimates are equal across trimesters.

d

Score test of homogeneity of coefficients.

e

Likelihood ratio tests for homogeneity of standardized estimates.

The estimated associations between first-trimester exposure and MDI24 from the multiple informant approach were −2.74 (95% CI, −5.82 to 0.33) and −4.13 (95% CI, −7.54 to −0.72) MDI24 points per 1-loge-unit increase in blood lead from GEE and ML estimation, respectively (Table 2). As is always the case (Litman et al. 2007a), the GEE estimates were equal to the point estimates obtained from separate regressions, but the CIs varied slightly because the GEE method takes into account within-individual correlation across the time windows. Because there were missing data, the ML and GEE approaches do not give the same estimates. The ML approach estimated a stronger association for trimester 1 (e.g., βGEE = −2.74 vs. βML = −4.13). Although a trend of increasing effect of lead in earlier times during pregnancy is suggested, tests of a varying exposure effect were not significant (GEE pint = 0.56, score test of equal regression coefficients; MLE pint = 0.23, likelihood ratio test of equal standardized coefficients). The ML approach detected a significant association at trimester 1 and exhibited a smaller p-value than GEE for the test of differences in association across trimesters. This reflects greater efficiency of ML estimation relative to GEE when data are missing and when differences in standardized regression coefficients are tested, in addition to using score versus likelihood ratio tests.

The exposure model parameters from method 3 indicate that the average exposure at 7 weeks of gestation was 1.90 loge(blood lead) units (approximately = 6.69 μg/dL), SD 0.49 (Table 3). This is consistent with the average and SDs of the observed measures during trimester 1 (Table 1). There was a significant average decline across pregnancy (average linear decline is 0.04 ln(blood lead) units per 12 weeks) (Table 3).

Table 3.

Parameter estimates for method 3.

Model parameter or predictor Estimate SE p-Value
Exposure model parameters
 θ0 (average blood lead level, loge) 1.90 0.05 < 0.0001
 θ1 (average rate of change per 12 weeks)a −0.04 0.02 < 0.01
 Random intercept SD 0.49
 Random slope SD 0.17
 Correlation of random intercept (θ0i) and slope (θ1i) −0.56
 Residual SD 0.28

Predictors in outcome model β SE p-Value
Blood lead level at week 7 (θ0i, random intercept)b −2.11 1.08 0.05
Changes in blood lead level (θ1i, random slope)a,c 0.58 1.68 0.73
Maternal Age (per 5 years) 3.04 0.77 < 0.01
Maternal IQ (per 10 points) 0.76 0.64 0.24
Sex of child −4.98 1.71 < 0.01
Weight at 24 months −2.13 0.93 0.02
Height z-score at 24 months NA 2.82 1.16 0.01
Breast-feeding duration (per 6 months) −0.63 0.90 0.48
a

Average rate of change is negative; hence, increases in the rate represent slower rates of blood lead level decline.

b

Change in MDI24 with a 1-SD increase in blood lead at 7 weeks.

c

Change in MDI24 with a 1-SD increase in the rate of blood lead level over the course of pregnancy.

The regression coefficients for blood lead level at 7 weeks and the average change in blood lead level are expressed in MDI points associated with a 1-SD change in these predictors. A 1-SD increase in loge-transformed blood lead levels at 7 weeks was significantly associated with a 2.11-point decrease in MDI24 (95% CI, −0.01 to 4.23). Although the changes in exposure over the course of pregnancy were not significantly associated with MDI24 (p = 0.73), the positive association, indicating that a slower rate of decline in blood lead (i.e., sustained exposure) was associated with a 0.58 point increase in MDI24, was unexpected. However, this relation may have resulted from collinearity; although the random intercept and slope were less correlated than the raw lead concentrations (e.g., correlation between trimester 1 and 2 measurements is 0.66), they were still significantly and negatively correlated (correlation is −0.56).

Notably, the correlation between the predicted random intercept and the observed blood lead level at trimester 1 was 0.97, indicating that the random intercept was closely related to the measured exposure at the first visit. The advantage of the random intercept is that it is interpreted as exposure level at 7 weeks for all study participants and can be estimated for participants who did not have a measured blood lead level at trimester 1. This advantage translates to stronger standardized estimates for the association between exposure and MDI [−2.11/0.49 = −4.31 points (95% CI, −8.66 to 0.02) for a 1-log-unit increase in exposure] compared with corresponding estimates based on methods 1 and 2 (2.74 points via separate regressions or GEE, −4.13 points via MLE).

The exposure patterns from method 4 were estimated with quadratic functions (the shapes of f0(t) and f1(t) did not deviate significantly from quadratic); the coefficients for these functions are given in Table 4. Lead exposure and MDI24 were significantly associated (Ho: α10 = α11 = α12 = 0, p = 0.03) based on this model. The exposure patterns over time comparing high with low achievers marginally differed in their shape over time (Ho: α11 = α12 = 0, p = 0.086). Figure 1 portrays the prenatal exposure patterns for children in the 90th and 10th percentile of the covariate-adjusted MDI distribution (top) and their relative difference (bottom). The earliest time at which we estimated exposures was at 4 weeks, the first observed time point among all mothers. As can be observed, the exposure patterns (i.e., the shapes of the curves over time) were similar toward the end of pregnancy but had less overlap earlier in pregnancy. More specifically, children in the 10th percentile of the distribution had significantly higher exposures during the first 17 weeks of pregnancy compared with children in the lower 90th percentile of the distribution. At 7 weeks of pregnancy, those in the 10th percentile had exposure 1.63 (95% CI, 1.09 to 2.45) times higher compared with those in the 90th percentile. The significant relative exposure difference persisted until approximately 17 weeks (relative difference = 1.24; 95% CI, 1.00–1.53).

Table 4.

Parameter estimates for method 4.

Predictor or parameter Estimate SE p-Value
Outcome model predictorsa
 Maternal age (5 years) 2.91 0.79 < 0.001
 Breast-feeding duration (6 months) −0.57 0.91 0.53
 Maternal IQ (10 points) 0.85 0.66 0.20
 Sex of child −2.10 0.90 0.02
 Child’s weight at 24 months −1.73 0.68 0.01
 Child’s height z-score at 24 months 2.72 1.16 0.02
Exposure model parametersb
 Average exposure pattern, f0(t)
  α00 3.12 0.49 < 0.0001
  α01 −1.52 0.69 0.03
  α02 0.58 0.32 0.07
 Relationship with MDI24, f1(t)c
  α10 −0.13 0.05 0.02
  α11 0.13 0.08 0.08
  α12 −0.05 0.04 0.17
a

Estimates indicate changes in MDI24 per unit change in the predictor.

b

Estimates characterize pattern of exposure or relationship between MDI and exposure pattern.

c

H0: f1(t) = 0 vs. f1(t) ≠ 0, p = 0.011, H0: f1(t) = constant versus f1(t) ≠ constant, p = 0.086.

Conclusions

Identifying critical windows of vulnerability is an emerging field in children’s health research. Existing statistical approaches used in this area, primarily based on multiple regression, have important limitations: Formal tests for the difference across windows cannot be performed, missing data cannot be easily incorporated, and variation in the timing of exposure cannot be easily accommodated. We presented three alternative approaches that mitigate these limitations. Table 5 summarizes their assumptions, and compares them with multiple regression. To our knowledge, this is the first application of a multiple informants approach (method 2) and population exposure pattern approach (method 4) to studies of timing of vulnerability in environmental epidemiology.

Table 5.

Summary of model assumptions.

Assumption Method 1: simultaneously adjusted regression Method 1: separate regressions Method 2: multiple informants Method 3: individual exposure patterns Method 4: population exposure patterns
Assumes all participants have the same timing of exposure? Yes Yes Yes No No
Assumes predefined windows? Yes Yes Yes No No
Assumes homogeneous exposure effect within window? Yes Yes Yes No No
Can test difference of estimated exposure effects across time? No No Yes Yes Yes
Minimum number of exposure samples needed per participant One per window One per window At least onea At least twob At least one
Missing data assumptions MCAR MCAR MAR (ML) MCAR (GEE) MAR MAR
Assumed time spacing between one window and another? No restrictionsc No restrictionsc No restrictionsc Some restrictionsd Some restrictionsd
Robust to misclassification of exposure timing? No No No Somewhat Somewhat
Subject to collinearity problems? Yes No No Some No
a

Provided some participants have one in each window.

b

Provided most participants have more.

c

For example, childhood versus adulthood can be compared, even if only two measures (total) are available.

d

Very infrequent measurements or those taken very far apart would make the exposure pattern hard to estimate and interpret.

Modifications of multiple regression have been discussed previously in the literature (e.g., Bell et al. 2007; Hornung et al. 2009; Slama et al. 2007, 2008a). These methods involve a) including as predictors the residuals from models of the exposure during each window regressed on a reference window; b) using ratios of exposure during each window to a reference window; and c) including exposures outside the windows of interests (e.g., postnatal exposure) as a control variable (Slama et al. 2007). However, these methods are limited primarily by their lack of ability to incorporate missing data.

Methods beyond multiple regression that address windows of vulnerability have been discussed in the literature under the rubric of exposure–time response models. These models focus on estimating a weight function, w(t), that measures the relative effect of an exposure increment at time t compared with other times (Hauptmann et al. 2000). The time metric t can be time at exposure as in our case and other examples in cancer research (Richardson and Wing 1998) or time since exposure, as in studies quantifying latency (Berhane et al. 2008; Langholz et al. 1999; Richardson 2009). These methods, and the ones we propose, are geared toward settings of protracted exposures. Specifically, our method 3 can be seen as a special case of exposure–time response models, because the coefficients corresponding to features θ0i (level at week 7) and θ1i (rate of change through pregnancy) can be transformed to obtain an estimate of w(t) (James 2002). The estimated w(t) in our case would be a linear function [see Supplemental Material, Figure 4 (doi:10.1289/ehp.1002453)]. Furthermore, related approaches have also been discussed in life course epidemiology (Hallqvist et al. 2004), where one of the approaches is to develop a taxonomy of exposure patterns based on dichotomized exposures at each exposure window and relate the patterns to health outcomes. All of these methods are related, but a distinguishing factor is the time scale on which they focus and whether time of exposure is discretized or treated as continuous.

Although the proposed approaches circumvent some drawbacks of fitting multiple regression models, they are not without limitations. For example, the tests offered by method 2 are low-powered. In principle, method 3 allowed us to flexibly model individual exposure patterns over time; however, relating outcome to many exposure features may become cumbersome to interpret (e.g., individual-level quadratic trends). Method 4 could be improved by incorporating confidence bands instead of pointwise CIs when presenting the results graphically. However, the test of whether population exposure patterns vary over time between outcome groups is valid.

When the windows of vulnerability refer to prenatal windows, inaccurate ascertainment of gestational age may limit the applicability/interpretation of the above methods. Inaccurate measurement of gestational time may arise from incorrect reporting of last menstrual period, for example. This introduces measurement error and may attenuate effect estimates. When gestational time (e.g., weeks) is used to define windows of vulnerability (e.g., trimesters), then even a few days’ error in gestational age could lead to misclassifying the exposure as occurring in one trimester versus another. Furthermore, because of field issues (e.g., rescheduled appointments), some participants may be seen during a later trimester but their visit recorded as occurring within the previous time window. Misclassification of the timing of exposure would lead to biased exposure–outcome associations. Method 2 (like method 1) is not robust to misclassification of the timing of exposure. Methods 3 and 4 may be less affected by this issue, however, because the time is treated as continuous, such that the extent of misclassification is of a lesser magnitude. Other issues of how inaccurate ascertainment of gestational age may introduce bias in exposure–birth outcome studies, even when using ultrasound to ascertain gestational age, have been discussed in the literature (Slama et al. 2008b).

We described several approaches to study timing of vulnerability using an example with few exposure windows and few covariates. Extensions of the approach to incorporate more covariate information on exposure models for methods 3 and 4 or using categorical instead of continuous outcomes are discussed in the Supplemental Material (doi:10.1289/ehp.1002453). Sensitivity analyses are also included in the Supplemental Material, where blood lead measured at 24 months is evaluated both as an extra window using method 2 and as a confounder using all methods.

Uncovering windows of vulnerability to environmental pollutants is a complex question that requires sophisticated data analysis tools. Methods that test exposure effect differences across time of exposure should be employed before concluding which window is most important.

Of the methods presented, method 2 is preferred over method 1, because it enables formal testing of differences in effects across a priori–defined windows. Methods 3 and 4 are preferred over method 2 when there is large variability in the timing of exposure across participants, as was the case in the example presented.

Footnotes

Supplemental Material is available online (doi:10.1289/ehp.1002453 via http://dx.doi.org/).

This study was supported by U.S. National Institute of Environmental Health Sciences (NIEHS) grants R01 ES007821, R01 ES014930, R01 ES013744, R01 ES016932, R01 ES017022, and P20 ES018171; Consejo Nacional de Ciencia y Tecnología (CONACyT) grant 4150M9405; and CONSERVA, Department of Federal District, México.

The contents of this article are solely the responsibility of the authors and do not necessarily represent the official views of the NIEHS or the National Institutes of Health.

References

  1. Aguilera I, Guxens M, Garcia-Esteban R, Corbella T, Nieuwenhuijsen MJ, Foradada CM, et al. Association between GIS-based exposure to urban air pollution during pregnancy and birth weight in the INMA Sabadell Cohort. Environ Health Perspect. 2009;117:1322–1327. doi: 10.1289/ehp.0800256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Barr M, DeSesso JM, Lau CS, Osmond C, Ozanne SE, Sadler TW, et al. Workshop to identify critical windows of exposure for children’s health: cardiovascular and endocrine work group summary. Environ Health Perspect. 2000;108:569–571. doi: 10.1289/ehp.00108s3569. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bayley N. Bayley Scales of Infant Development. 2nd ed. San Antonio, TX: Psychological Corporation; 1993. [Google Scholar]
  4. Bell ML, Ebisu K, Belanger K. Ambient air pollution and low birth weight in Connecticut and Massachusetts. Environ Health Perspect. 2007;115:1118–1124. doi: 10.1289/ehp.9759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Berhane K, Hauptmann M, Langholz B. Using tensor product splines in modeling exposure-time-response relationships: application to the Colorado Plateau Uranium Miners cohort. Stat Med. 2008;27(26):5484–5496. doi: 10.1002/sim.3354. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Ha M, Mabuchi K, Sigurdson AJ, Freedman DM, Linet MS, Doody MM, et al. Smoking cigarettes before first childbirth and risk of breast cancer. Am J Epidemiol. 2007;166(1):55–61. doi: 10.1093/aje/kwm045. [DOI] [PubMed] [Google Scholar]
  7. Hallqvist J, Lynch J, Bartley M, Lang T, Blane D. Can we disentangle life course processes of accumulation, critical period and social mobility? An analysis of disadvantaged socio-economic positions and myocardial infarction in the Stockholm Heart Epidemiology Program. Soc Sci Med. 2004;58(8):1555–1562. doi: 10.1016/S0277-9536(03)00344-7. [DOI] [PubMed] [Google Scholar]
  8. Hauptmann M, Wellmann J, Lubin JH, Rosenberg PS, Kreienbrock L. Analysis of exposure-time-response relationships using a spline weight function. Biometrics. 2000;56(4):1105–1108. doi: 10.1111/j.0006-341x.2000.01105.x. [DOI] [PubMed] [Google Scholar]
  9. Hornung RW, Lanphear BP, Dietrich KN. Age of greatest susceptibility to childhood lead exposure: a new statistical approach. Environ Health Perspect. 2009;117:1309–1312. doi: 10.1289/ehp.0800426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Horton NJ, Laird NM, Zahner GE. Use of multiple informant data as a predictor in psychiatric epidemiology. Int J Methods Psychiatr Res. 1999;8(1):6–18. [Google Scholar]
  11. Horton NJ, Lipsitz SR. Review of software to fit generalized estimating equation regression models. Am Stat. 1999;53(2):160–169. [Google Scholar]
  12. Hu H, Téllez-Rojo MM, Bellinger D, Smith D, Ettinger AS, Lamadrid-Figueroa H, et al. Fetal lead exposure at each stage of pregnancy as a predictor of infant mental development. Environ Health Perspect. 2006;114:1730–1735. doi: 10.1289/ehp.9067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. James GM. Generalized linear models with functional predictors. J Roy Stat Soc B. 2002;64:411–432. [Google Scholar]
  14. Kuczmarski R, Ogden C, Guo S, Grummer-Strawn LM, Flegal KM, Mei Z, et al. 2000 CDC Growth Charts for the United States: Methods and Development. Vol. 246. Hyattsville, MD: National Center for Health Statistics; 2002. [PubMed] [Google Scholar]
  15. Langholz B, Thomas D, Xiang A, Stram D. Latency analysis in epidemiologic studies of occupational exposures: application to the Colorado Plateau Uranium Miners Cohort. Am J Ind Med. 1999;35(3):246–256. doi: 10.1002/(sici)1097-0274(199903)35:3<246::aid-ajim4>3.0.co;2-6. [DOI] [PubMed] [Google Scholar]
  16. Litman HJ, Horton NJ, Hernandez B, Laird NM. Estimation of marginal regression models with multiple source predictors. In: Rao CR, Miller JP, Rao DC, editors. Handbook of Statistics. Vol. 27. San Diego, CA: Elsevier; 2007a. pp. 730–746. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Litman HJ, Horton NJ, Hernandez B, Laird NM. Incorporating missingness for estimation of marginal regression models with multiple source predictors. Stat Med. 2007b;26(5):1055–1068. doi: 10.1002/sim.2593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Little RJA, Rubin DB. Statistical Analysis with Missing Data. Hoboken, NJ: John Wiley & Sons, Inc; 2002. [Google Scholar]
  19. Lubin JH, Tomasek L, Edling C, Hornung RW, Howe G, Kunz E, et al. Estimating lung cancer mortality from residential radon using data for low exposures of miners. Radiat Res. 1997;147(2):126–134. [PubMed] [Google Scholar]
  20. Meyer U, Yee BK, Feldon J. The neurodevelopmental impact of prenatal infections at different times of pregnancy: the earlier the worse? Neuroscientist. 2007;13(3):241–256. doi: 10.1177/1073858406296401. [DOI] [PubMed] [Google Scholar]
  21. Mohorovic L. First two months of pregnancy—critical time for preterm delivery and low birthweight caused by adverse effects of coal combustion toxics. Early Hum Dev. 2004;80(2):115–123. doi: 10.1016/j.earlhumdev.2004.06.001. [DOI] [PubMed] [Google Scholar]
  22. Parker JD, Woodruff TJ, Basu R, Schoendorf KC. Air pollution and birth weight among term infants in California. Pediatrics. 2005;115(1):121–128. doi: 10.1542/peds.2004-0889. [DOI] [PubMed] [Google Scholar]
  23. Pepe MS, Whitaker RC, Seidel K. Estimating and comparing univariate associations with application to the prediction of adult obesity. Stat Med. 1999;18(2):163–173. doi: 10.1002/(sici)1097-0258(19990130)18:2<163::aid-sim11>3.0.co;2-f. [DOI] [PubMed] [Google Scholar]
  24. Richardson DB. Latency models for analyses of protracted exposures. Epidemiology. 2009;20(3):395–399. doi: 10.1097/EDE.0b013e318194646d. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Richardson DB, Wing S. Methods for investigating age differences in the effects of prolonged exposures. Am J Ind Med. 1998;33(2):123–130. doi: 10.1002/(sici)1097-0274(199802)33:2<123::aid-ajim4>3.0.co;2-z. [DOI] [PubMed] [Google Scholar]
  26. Schnaas L, Rothenberg SJ, Flores MF, Martinez S, Hernandez C, Osorio E, et al. Reduced intellectual development in children with prenatal lead exposure. Environ Health Perspect. 2006;114:791–797. doi: 10.1289/ehp.8552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Selevan S, Kimmel CA, Mendola P. Identifying critical windows of exposure for children’s health. Epidemiology. 2000;11(4):163. doi: 10.1289/ehp.00108s3451. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Slama R, Darrow L, Parker J, Woodruff TJ, Strickland M, Nieuwenhuijsen M, et al. Meeting report: atmospheric pollution and human reproduction. Environ Health Perspect. 2008a;116:791–798. doi: 10.1289/ehp.11074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Slama R, Khoshnood B, Kaminski M. How to control for gestational age in studies involving environmental effects on fetal growth [Letter] Environ Health Perspect. 2008b;116:A284. doi: 10.1289/ehp.11105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Slama R, Morgenstern V, Cyrys J, Zutavern A, Herbarth O, Wichmann HE, et al. Traffic-related atmospheric pollutants levels during pregnancy and offspring’s term birth weight: a study relying on a land-use regression exposure model. Environ Health Perspect. 2007;115:1283–1292. doi: 10.1289/ehp.10047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Téllez-Rojo MM, Hernandez-Avila M, Lamadrid-Figueroa H, Smith D, Hernandez-Cadena L, Mercado A, et al. Impact of bone lead and bone resorption on plasma and whole blood lead levels during pregnancy. Am J Epidemiol. 2004;160(7):668–678. doi: 10.1093/aje/kwh271. [DOI] [PubMed] [Google Scholar]
  32. Wang CY, Wang NS, Wang SJ. Regression analysis when covariates are regression parameters of a random effects model for observed longitudinal measurements. Biometrics. 2000;56(2):487–495. doi: 10.1111/j.0006-341x.2000.00487.x. [DOI] [PubMed] [Google Scholar]
  33. West LJ. Defining critical windows in the development of the human immune system. Hum Exp Toxicol. 2002;21(9–10):499–505. doi: 10.1191/0960327102ht288oa. [DOI] [PubMed] [Google Scholar]

Articles from Environmental Health Perspectives are provided here courtesy of National Institute of Environmental Health Sciences

RESOURCES