Abstract
Observational studies reporting on adjusted associations between childhood body mass index (BMI; weight (kg)/height (m)2) rebound and subsequent cardiometabolic outcomes have often not paid explicit attention to causal inference, including definition of a target causal effect and assumptions for unbiased estimation of that effect. Using data from 649 children in a Boston, Massachusetts–area cohort recruited in 1999–2002, we considered effects of stochastic interventions on a chosen subset of modifiable yet unmeasured exposures expected to be associated with early (<age 4 years) BMI rebound (a proxy measure) on adolescent cardiometabolic outcomes. We considered assumptions under which these effects might be identified with available data. This leads to an analysis where the proxy, rather than the exposure, acts as the exposure in the algorithm. We applied targeted maximum likelihood estimation, a doubly robust approach that naturally incorporates machine learning for nuisance parameters (e.g., propensity score). We found a protective effect of an intervention that assigns modifiable exposures according to the distribution in the observational study of persons without (vs. with) early BMI rebound for fat mass index (fat mass (kg)/ height (m)2; −1.39 units, 95% confidence interval: −1.63, −0.72) but weaker or no effects for other cardiometabolic outcomes. Our results clarify distinctions between algorithms and causal questions, encouraging explicit thinking in causal inference with complex exposures.
Keywords: body mass index, body mass index rebound, cardiometabolic outcomes, causal inference, life course epidemiology, targeted maximum likelihood estimation
Abbreviations
- BMI
body mass index
- CI
confidence interval
- DBP
diastolic blood pressure
- FMI
fat mass index
- HOMA-IR
homeostasis model assessment of insulin resistance
- IPW
inverse probability weighting
- SBP
systolic blood pressure
- SD
standard deviation
- TMLE
targeted maximum likelihood estimation
Typically, body mass index (BMI; calculated as weight (kg)/height (m)2) increases during the first year of life and decreases to a nadir during childhood, before finally rising again (1). This second rise in BMI in childhood, known as the BMI rebound, typically occurs between 4 and 6 years of age (2, 3). Observational studies have shown strong associations between an early BMI rebound in childhood (i.e., before 4 years of age) and obesity, type 2 diabetes, and the metabolic syndrome in adolescence and adulthood (4–8). Despite adjustment for covariates, these studies have not explicitly considered causal inference, including causal-effect definitions and assumptions sufficient for unbiasedly estimating that effect with available data. An “effect of early BMI rebound,” with “early BMI rebound” being the ostensible “exposure” (“treatment”), is not sufficiently well-defined (9, 10) and may not be the effect of interest. Early BMI rebound may result from many underlying processes, including modifiable (e.g., diet, physical activity (11–13)) and nonmodifiable (e.g., genetics (14)) factors. Investigators reporting “adjusted associations” between early BMI rebound and health outcomes may in fact be interested in reporting effects of interventions on 1 or more of these processes that may not fully control early BMI rebound. In many cases, data on even modifiable processes are limited and difficult to measure in existing data sets.
Here we consider the problem of using existing observational data to estimate effects of particular stochastic interventions (which assign from a specified distribution) on 1 or more unmeasured yet modifiable exposures (e.g., diet and physical activity) on mean adiposity, blood pressure, insulin resistance, and global metabolic risk in early adolescence. These interventions do not guarantee control of early BMI rebound, a proxy for exposure. We consider a set of assumptions under which these effects may still be identified when the proxy is measured but the exposures are not. We rely on the “usual” consistency, exchangeability, and positivity assumptions for stochastic interventions defined with respect to the unmeasured exposures. Failure to measure exposure also comes with the price of requiring an additional strong assumption that the proxy is a particular coarsening of the exposures. Under this proxy separation assumption, the proxy acts as the exposure in the analytical procedure even though it is not the conceptual exposure. This provides an alternative interpretation of effect estimates from such an analysis to effects of interventions on the “versions of treatment” (15). Our alternative formulation is similar to this latter interpretation, including reliance on a coarsening assumption, with the distinction of emphasis that the proxy is not a treatment but a measured variable to be leveraged for identification.
We illustrate these ideas using data from Project Viva, a prebirth cohort study. To obtain effect estimates, we applied targeted maximum likelihood estimation (TMLE) (16), a doubly robust approach that naturally incorporates machine learning for nuisance parameters (e.g., the propensity score) in order to mitigate reliance on parametric models (17).
METHODS
The observational study: Project Viva
Project Viva is an ongoing study of pre- and perinatal influences on maternal, fetal, and child health (18). Briefly, we recruited eligible pregnant women from obstetrical practices at Atrius Harvard Vanguard Medical Associates in eastern Massachusetts during their first prenatal appointment between April 1999 and November 2002. Mothers provided written informed consent at enrollment and follow-up visits, and children provided verbal assent at midchildhood (ages 7–10 years) and early adolescent (ages 12–16 years) research visits. The Institutional Review Board of Harvard Pilgrim Health Care Institute (Boston, Massachusetts) approved the project in line with ethical standards established by the Declaration of Helsinki.
The analytical cohort and BMI rebound
Previous studies have defined early BMI rebound as a rise in BMI that happens before 4 years of age, after an initial decline from infancy (4–7). Given that few children had 3 or more measurements of BMI during this early period, we restricted the study population to children with BMI measurements indicating a decline of at least 0.1 BMI units between infancy (ages 6–12 months) and early childhood (ages 2–3.9 years). We defined BMI rebound as an increase of at least 0.1 units between early childhood and midchildhood (ages 4–8 years), following previous studies (19, 20). Below we refer to this as an indicator of “early” BMI rebound, and in the Discussion we explicitly consider the error associated with this interval measure of BMI rebound timing and tradeoffs with latent class modeling.
We also restricted the analytical cohort to children with complete data on baseline covariates (see Table 1 and below). We have previously described the methods of measurement of these covariates (21–23). Of 2,128 live singleton births, 1,183 children had BMI measurements in infancy (median age, 6.2 months; range, 5.0–11.2) and early childhood (median age, 37.5 months; range, 24.0–47.8) indicating a BMI decline after infancy. Of these children, 1,002 had complete data on baseline covariates and 649 had a subsequent BMI measurement at age 48 months (4 years) or later (median age, 49.0 months; range, 48.0–94.9). We obtained measures of weight and length/height at research visits during infancy, early childhood, and midchildhood and from medical records, where pediatric clinics recorded weight and length/height at each well-child visit (21–23). We used both research and clinical measures of weight and length/height to calculate BMI.
Table 1.
Covariate | Cardiometabolic Outcome | |||
---|---|---|---|---|
Fat Mass Index a | Blood Pressure z Score | HOMA-IR | Metabolic Risk Score | |
Continuous variables | ||||
Maternal prepregnancy BMIb | Y | Y | Y | Y |
Gestational weight gain | Y | Y | N | N |
Paternal BMI | Y | N | Y | Y |
Birth weight for gestational age | Y | Y | N | N |
Gestational age at delivery | N | Y | N | N |
BMI in early childhoodc | Y | Y | Y | Y |
Categorical variables | ||||
Maternal educational level (non–college-educated vs. college-educated) | Y | Y | Y | Y |
Smoking history (never smoked vs. smoked before pregnancy vs. smoked during pregnancy) | Y | Y | Y | Y |
Glucose tolerance (normal vs. abnormal) | N | N | Y | Y |
Pregnancy hypertension (no vs. yes) | N | Y | N | N |
Paternal educational level (non–college-educated vs. college-educated) | Y | N | Y | Y |
Breastfeeding initiation (no vs. yes) | Y | N | N | N |
Child’s sex (male vs. female) | Y | N | Y | N |
Child’s race/ethnicity (White vs. non-White) | Y | Y | Y | Y |
Abbreviations: BMI, body mass index; HOMA-IR, homeostasis model assessment of insulin resistance; N, no; Y, yes.
a Fat mass (kg)/height (m)2.
b Weight (kg)/height (m)2.
c Early childhood was the baseline measure.
Adolescent adiposity and cardiometabolic outcomes
In early adolescence (median age, 12.9 years; interquartile range, 12.5–13.6), trained research assistants measured waist circumference using a nonstretchable tape, measured fat mass using foot-to-foot bioimpedance, and measured systolic blood pressure (SBP) and diastolic blood pressure (DBP) using calibrated automated oscillometric monitors according to standardized protocols (21). We calculated fat mass index (FMI) as fat mass (in kilograms) divided by squared height (in meters). We derived age-, sex-, and height-specific SBP and DBP z scores according to the American Academy of Pediatrics reference values for children and adolescents (24). Trained technicians also collected fasting blood specimens, which were used to measure fasting glucose, insulin, high-density lipoprotein cholesterol, and triglyceride levels according to standard protocols. Following previous studies, we calculated insulin resistance using homeostasis model assessment of insulin resistance (HOMA-IR) (8, 21). We used external references to calculate age- and sex-specific z scores for waist circumference, SBP, HOMA-IR, triglycerides, and high-density lipoprotein cholesterol (24–27). Subsequently, we derived a metabolic risk score as the average of z scores for waist circumference, HOMA-IR, triglycerides, high-density lipoprotein cholesterol (scaled inversely), and SBP.
The causal question and target trial
Consider the causal effect on an adolescent outcome (Y) of implementing 2 different interventions in the study population of children with measured BMI decline between the ages of 6 and 11.9 months (<1 year). Beginning at the time of this measurement (baseline) and ending approximately 4 years later, define intervention 1 for a given individual i in this population as follows: On each day k = 0, …, K of this intervention period, assign individual i values of a selected set of modifiable characteristics (minutes of physical activity and screen time (television only) and servings of sugar-sweetened beverages) by randomly assigning these values from a particular distribution. No intervention is made beyond day K = 4 × 365.
For R, an indicator of not experiencing BMI rebound by the end of the intervention period (the “early” period), we will choose this intervention distribution to be the distribution of exposures on day k (denoted by a vector Ak containing values for physical activity, screen time, and sugar-sweetened beverage consumption on that day) among Project Viva participants meeting study-population criteria who did not experience BMI rebound during this period (R = 1) and who have the same values of the measured baseline covariates (L) and, for k > 0, exposure history () as subject i. We define intervention 2 in the same way, but with the assignment distribution instead defined by children with BMI rebound (R = 0). We will refer to these interventions as proxy representative interventions. The proxy serves only to define the intervention distributions; there is no guarantee or intent of intervention on the proxy. Similar interventions have been previously considered (15, 28–30). (Also see Web Appendix 1, available at https://doi.org/10.1093/aje/kwab029.)
Ideally, we would conduct a trial to estimate this causal effect. This target trial (31) would recruit a large random sample of children at the time at which they met the criteria for the study population and randomize them to either intervention 1 or intervention 2. In an ideal execution of this trial, with perfect adherence to the daily assigned exposure values and no loss to follow-up, an unbiased estimate of this effect could be made by calculating the difference between outcome means in each study arm. These interventions are complicated by the fact that the distribution of the exposure assignment is unknown. However, in principle, it could be estimated prior to the trial in an observational study that measured the covariates L and these daily exposures during this early period in a comparable population and treatment assigned in the trial using these estimates.
Our choice of these particular interventions is convenient in that, as discussed below, we may achieve or approximately achieve identification of these intervention effects even when the data available for estimating them do not include measures of the true exposure(s) of interest for any individual. This provides a way forward under explicit reasoning in such settings that may be common to many social and life-course epidemiologic studies. It will also help shed additional light on the interpretation of common approaches to data analysis in these settings. Also driven by convenience is our choice to define the intervention rules to be dependent only on a single proxy measure (an indicator of BMI rebound anywhere in this “early” period) and baseline covariates because time-updated measures in this period were not available in our study. In Web Appendix 1, we consider more general extensions.
Estimating the effect of proxy representative interventions in observational studies when exposure is measured
We now consider assumptions that allow unbiased estimation of this causal effect in an observational study when the variables L, , R, and Y are measured, with being the vector of time-varying exposures. Under the untestable assumptions discussed below, the following function of these variables,
(1) |
identifies the outcome mean value that would have been present had all individuals followed a proxy representative intervention from the target trial on the exposures , where denotes the mean of Y within a particular level of the exposures and covariates, denotes the probability of having exposure level Aj = aj among children with proxy status R = r and in a particular exposure and baseline covariate stratum, and denotes the probability of having covariate level L = l. For simplicity, equation 1 considers discrete variables. More generally, the summation symbols over all levels of exposure and covariates can be replaced with integrals. The difference between equation 1 indexed by r = 1 and equation 1 indexed by r = 0, in turn, identifies the causal effect of the proxy representative interventions defined above. The function shown in equation 1 is an example of Robins’ g-formula (32), characterized by a particular stochastic intervention (30, 33, 34). Assumptions that equate the g-formula with the outcome mean under this intervention include the particular versions of exchangeability that are conditional on L (the assumed measured “confounders”), positivity, and consistency, which are reviewed in Web Appendix 1 (30, 32).
The second term in equation 1 quantifies the exposure distribution conditional on the assumed measured confounders but under the intervention. For a proxy representative intervention, this coincides with being equal to the exposure distribution among children with R = r for a level of the measured confounders in the observational study. The first term in equation 1, , quantifies the outcome mean conditional on exposure and measured confounders. This term is not conditioned on R, because we assume that this postexposure variable is not needed to ensure exchangeability (also see Web Appendix 1).
This exchangeability assumption will hold under a counterfactual causal model represented by a causal directed acyclic graph in which there are no unblocked backdoor paths between each Ak and Y conditional on past exposure and baseline covariates only (and is therefore quite strong) (35, 36). The causal directed acyclic graph in Figure 1 represents a simplified assumption on the data-generating process where Ak contains 2 exposures and K = 2. U0 represents an unmeasured baseline characteristic (e.g., genes) and U1 an interim characteristic (e.g., interim weight change). Exchangeability would hold under the absence of any arrows from U0 or U1 to any exposure but fails with the presence of the dotted arrows. In Web Appendix 1, we formally review these identifying conditions and generalize ideas in the main text to the case where time-varying covariates such as U1 are measured. In our application, these covariates were not measured (e.g., the majority of children had, at most, 1 measure of weight during the exposure period of interest).
Positivity requires that for both intervention 1 and intervention 2, there are no levels of L in the observational study for which a level of possible under intervention is impossible in the observational study. Effects of stochastic interventions in general are less susceptible to positivity violations than deterministic interventions (30). Finally, the consistency assumption in this setting is reasonable as long as there are not multiple versions of exposure or these versions do not affect the outcome (9, 10). Because we have defined exposure only in terms of modifiable characteristics, this assumption may be reasonable.
WHEN EXPOSURE IS NOT MEASURED
Even if the exchangeability, consistency, and positivity conditions hold with respect to proxy representative interventions, the quantity in equation 1 clearly cannot be estimated when exposure is not measured without additional assumptions. Let denote any value of Ak possibly occurring in the study population had Ak been assigned according to a proxy representative intervention indexed by a level of R = r and consider the following assumption:
Proxy separation: Within each covariate level L = l, if = , then R = r.
To understand how this assumption could be violated, suppose that Ak includes only daily screen time and some children in the Project Viva study population who experienced early BMI rebound watched 6 hours of television each day in a particular stratum of baseline covariates L (Table 1). Proxy separation would be violated if there were also some children in this stratum who watched 6 hours of television each day and did not experience early BMI rebound. Under proxy separation, the quantity calculated in equation 1 equals
(2) |
(see Web Appendix 1 for proof).
If were measured, proxy separation could be evaluated in the data. However, paradoxically, in that case it would not be needed because the quantity in equation 1 could be estimated directly. Exchangeability, consistency, and positivity, coupled with proxy separation, are strong assumptions and, at best, hold approximately. When they do, the quantity in equation 2 equals the outcome mean under a proxy representative intervention from the target trial indexed by proxy level r (15). We would expect proxy separation to be more reasonable with increasing dimensions of exposure Ak (provided that additional components are causes, or even only proxies of causes, of BMI rebound; see Web Figures 1–3 in Web Appendix 1) but at the cost of stronger exchangeability and positivity assumptions to still interpret the quantity derived by equation 2 in terms of proxy representative interventions on this higher-dimensional set of exposures.
In Web Appendix 1 we consider extensions that rely on measured time-varying proxies and time-varying confounders. Note that the positivity condition needed for equation 1, coupled with proxy separation, implies the more familiar positivity condition that P(R = r|L) > 0 for any possible level of L.
In practice, even the quantity in equation 2 is not possible to estimate without additional assumptions given missing data on L, R, and Y. In the next section, we consider the computationally straightforward estimator of
(3) |
where all terms of equation 3 are implicitly restricted to individuals with complete data on L and R and δ is an indicator of censoring due to missing Y.
The function in equation 3 is equal to that in equation 2 when missing data are absent. When missing data are present, a contrast in equation 3 identifies the effect of interest from the target trial where all interventions include an additional intervention, “eliminate censoring,” under additional exchangeability, positivity, and consistency assumptions with respect to and also under the assumption that the functions 1 and 2 in the original target population equal that in the subpopulation with complete data on L and R. Less restrictive assumptions in this case may be considered with respect to missing data but admitting more complex identifying functions and estimation procedures that incorporate measured time-varying covariates when they are available.
Statistical analysis
Several methods may be used to estimate the quantity in equation 3 with this sample, including inverse probability weighting (IPW) (37), parametric g-computation (32, 38), and TMLE (16). Even though the proxy R is not the exposure, the above results allow that an implementation of any of these methods for estimating the effect of a time-fixed static deterministic intervention on a binary exposure can be used to estimate the quantity in equation 3 by algorithmically treating R as this exposure (15, 29, 39). For our primary analysis, we applied TMLE, an approach to estimation of the function in equation 3 that begins with an initial estimate of E(Y|R = r, L, δ = 0), followed by a “targeting” step that updates this initial estimate with a weight that depends on estimates of P(R = r|L) and P(δ = 0|R = r, L) (16, 40). The algorithm is implemented in the R package tmle (R Foundation for Statistical Computing, Vienna, Austria) (41). We review details of the estimation procedure in Web Appendix 2.
This approach is doubly robust in that it consistently estimates the quantity in equation 3 if either E(Y|R = r, L, δ = 0) or both P(R = r|L) and P(δ = 0|R = r, L) are consistently estimated, all implicitly restricted here to children with complete data on R and L. Note that P(R = r|L) can be operationally referred to as a “propensity score” even though R is not “treatment” (42). Estimates of both sets of nuisance parameters may be easily computed using standard parametric models (e.g., linear or logistic regression). However, to avoid reliance on a parametric model for either set, we used SuperLearner (R, version 3.6.1), which generates estimates on the basis of a weighted combination of algorithms selected by cross-validation (43). We selected algorithms used in prior studies that cover a broad range: additive generalized linear models, generalized linear models with interactions, stepwise models, penalized regression models (i.e., the least absolute shrinkage and selection operator (LASSO)), and Bayesian additive regression trees (44, 45).
To assess robustness to the choice of estimation method, we alternatively implemented TMLE with parametric models for nuisance parameters. We also estimated the quantity in equation 3 using parametric g-computation and IPW (see details in Web Appendix 2). For all methods, we used a nonparametric bootstrap with 1,000 bootstrap samples to obtain 95% confidence intervals. For all analyses, we used STATA 16 (StataCorp LLC, College Station, Texas) and R, version 3.6.1 (the tmle and SuperLearner packages). Software code is provided in Web Appendices 3–8.
Data availability
All data described in this article are available from the authors upon request.
RESULTS
Of the 649 children in the analysis sample, the 406 children without early BMI rebound (R = 1) were more likely to have mothers (BMI 24.4 vs. 25.6) or fathers (BMI 26.0 vs. 27.4) with lower BMI, were more likely to be male (56% vs. 46%), had higher birth weight-for-gestational-age z scores (0.3 standard deviation (SD) units vs. 0.2 SD units), and had higher BMI in early childhood (BMI 16.5 vs. 16.1) than the 243 children with early BMI rebound (R = 0) (Table 2). In Web Table 1, we present empirical comparisons of the characteristics of children excluded from the analytical cohort with those of the included children. The distribution of the stabilized weights for all approaches in estimating the weight denominator showed no evidence of positivity violations (Web Table 2).
Table 2.
Characteristic |
Early Rebound
(n = 243) |
No Early Rebound
(n = 406) |
||
---|---|---|---|---|
% | Mean (SD) | % | Mean (SD) | |
Maternal factors | ||||
Prepregnancy BMIa | 25.6 (5.6) | 24.4 (4.8) | ||
College education | 68 | 73 | ||
Smoking history | ||||
Never smoked | 71 | 73 | ||
Smoked before pregnancy | 19 | 21 | ||
Smoked during pregnancy | 9 | 6 | ||
Gestational weight gain, kg | 15.5 (5.9) | 15.4 (5.3) | ||
Abnormal glucose tolerance (IH, GIGT, GDM) | 18 | 17 | ||
Pregnancy hypertension (GH, PE, CH) | 10 | 9 | ||
Paternal BMI | 27.4 (4.4) | 26.0 (3.7) | ||
Paternal college education | 65 | 67 | ||
Child factors | ||||
Birth weight-for-gestational-age z score, SD units | 0.2 (0.9) | 0.3 (1.0) | ||
Gestational age at delivery, weeks | 39.5 (1.6) | 39.6 (1.6) | ||
No breastfeeding initiation | 10 | 11 | ||
Male sex | 46 | 56 | ||
White race/ethnicity | 67 | 68 | ||
Age in early childhood, months | 36.8 (5.5) | 37.5 (3.9) | ||
BMI in early childhood | 16.1 (1.3) | 16.5 (1.2) |
Abbreviations: BMI, body mass index; CH, chronic hypertension; GDM, gestational diabetes mellitus; GH, gestational hypertension; GIGT, gestational impaired glucose tolerance; IH, isolated hyperglycemia; PE, preeclampsia; SD, standard deviation.
a Weight (kg)/height (m)2.
Using TMLE with SuperLearner for nuisance parameter estimation, our estimates of the effects of a proxy representative intervention indexed by R = 1 (a stochastic intervention that assigns individual values for daily physical activity, screen time, and sugar-sweetened beverage consumption from the joint distribution in Project Viva among children who did not experience BMI rebound with that individual’s baseline confounder values) versus R = 0 (the same values but taken from the distribution among those who did experience BMI rebound) were −1.39 units (95% confidence interval (CI): −1.63, −0.72) for FMI, 0.07 SD units (95% CI: −0.09, 0.21) for SBP z score, 0.08 SD units (95% CI: −0.07, 0.17) for DBP z score, −0.09 units (95% CI: −0.20, 0.06) for HOMA-IR, and −0.06 SD units (95% CI: −0.15, 0.04) for metabolic risk score (Table 3).
Table 3.
Estimation Method |
Fat Mass Index
a
,
b
(n = 500) |
SBP z Score
c
,SD units
(n = 498) |
DBP z Score
c
,SD units
(n = 498) |
HOMA-IR
d
,units
(n = 306) |
Metabolic Risk Score
e
,SD units
(n = 302) |
|||||
---|---|---|---|---|---|---|---|---|---|---|
β | 95% CI | β | 95% CI | β | 95% CI | β | 95% CI | β | 95% CI | |
Unadjusted results | −1.72 | −2.26, −1.18 | 0.10 | −0.04, 0.24 | 0.08 | −0.03, 0.19 | −0.14 | −0.28, 0.00 | −0.06 | −0.15, 0.03 |
Least-squares regressionf | −1.29 | −1.75, −0.84 | 0.10 | −0.05, 0.24 | 0.09 | −0.02, 0.21 | −0.09 | −0.22, 0.05 | −0.06 | −0.16, 0.03 |
TMLE | ||||||||||
Parametric modelg | −1.41 | −1.86, −0.94 | 0.05 | −0.12, 0.23 | 0.07 | −0.04, 0.22 | −0.08 | −0.21, 0.06 | −0.07 | −0.16, 0.05 |
SuperLearner | ||||||||||
Version 1h | −1.44 | −1.71, −0.72 | 0.05 | −0.12, 0.21 | 0.08 | −0.08, 0.17 | −0.09 | −0.22, 0.07 | −0.06 | −0.16, 0.05 |
Version 2i | −1.39 | −1.63, −0.72 | 0.07 | −0.09, 0.21 | 0.08 | −0.07, 0.17 | −0.09 | −0.20, 0.06 | −0.06 | −0.15, 0.04 |
G-computation | ||||||||||
Parametric model | −1.31 | −1.75, −0.86 | 0.07 | −0.08, 0.22 | 0.11 | −0.01, 0.24 | −0.07 | −0.20, 0.06 | −0.06 | −0.15, 0.05 |
SuperLearner | ||||||||||
Version 1 | −1.25 | −1.59, −0.73 | 0.07 | −0.13, 0.22 | 0.10 | −0.06, 0.21 | −0.09 | −0.22, 0.04 | −0.06 | −0.16, 0.05 |
Version 2 | −1.21 | −1.56, −0.69 | 0.07 | −0.08, 0.21 | 0.07 | −0.03, 0.20 | −0.08 | −0.21, 0.05 | −0.05 | −0.14, 0.05 |
Stabilized IPW | ||||||||||
Parametric model | −1.34 | −1.88, −0.84 | 0.06 | −0.11, 0.22 | 0.09 | −0.04, 0.22 | −0.09 | −0.23, 0.05 | −0.06 | −0.17, 0.05 |
SuperLearner | ||||||||||
Version 1 | −1.43 | −1.75, −0.81 | 0.07 | −0.08, 0.23 | 0.08 | −0.05, 0.20 | −0.09 | −0.20, 0.06 | −0.06 | −0.14, 0.06 |
Version 2 | −1.45 | −1.79, −0.86 | 0.07 | −0.08, 0.24 | 0.09 | −0.04, 0.20 | −0.09 | −0.21, 0.05 | −0.06 | −0.15, 0.04 |
Abbreviations: BMI, body mass index; CI, confidence interval; DBP, diastolic blood pressure; HOMA-IR, homeostasis model assessment of insulin resistance; IPW, inverse probability weighting; SBP, systolic blood pressure; SD, standard deviation; TMLE, targeted maximum likelihood estimation.
a Fat mass (kg)/height (m)2.
b Adjusted for maternal smoking status, educational level, BMI (weight (kg)/height (m)2), paternal educational level, paternal BMI, total gestational weight gain, birth weight for gestational age, breastfeeding initiation, child’s sex, child’s race/ethnicity, and BMI in early childhood.
c Adjusted for maternal smoking status, educational level, BMI, pregnancy hypertension, total gestational weight gain, birth weight for gestational age, gestational age, child’s race/ethnicity, and BMI in early childhood.
d Adjusted for maternal smoking status, educational level, BMI, glucose tolerance, paternal educational level, paternal BMI, child’s sex, child’s race/ethnicity, and BMI in early childhood.
e Adjusted for maternal smoking status, educational level, BMI, glucose tolerance, paternal educational level, paternal BMI, child’s race/ethnicity, and BMI in early childhood.
f Parametric model with no interaction terms for nuisance parameters.
g SuperLearner with prediction algorithms for generalized linear models, interaction terms, and stepwise modeling for nuisance parameters.
h SuperLearner with prediction algorithms for generalized linear models, interaction terms, stepwise modeling, penalized regression models, Bayesian additive regression trees, and generalized additive models for nuisance parameters.
i Estimates of the regression coefficient on R from a linear regression model where the outcome (Y) is regressed on the proxy R and covariates L.
When using parametric models for nuisance parameter estimation using TMLE, the estimates were −1.41 (95% CI: −1.86, −0.94) for FMI, 0.05 SD units (95% CI: −0.12, 0.23) for SBP z score, 0.07 SD units (95% CI: −0.04, 0.22) for DBP z score, −0.08 units (95% CI: −0.21, 0.06) for HOMA-IR, and −0.07 SD units (95% CI: −0.16, 0.05) for metabolic risk score. Results were not substantively different when using parametric g-computation or IPW approaches, with either parametric estimation of nuisance parameters or SuperLearner (Table 3). We also conducted a simple IPW sensitivity analysis wherein individuals were censored upon missing R, with weights incorporating these censoring weights as a function of L. Results were not substantially different (Web Table 3).
DISCUSSION
We considered assumptions under which causal effects of interventions on a set of unmeasured exposures can be estimated in observational studies when only a proxy is measured. We illustrated these ideas in a study where early BMI rebound was used as a proxy for modifiable exposures (screen time, physical activity, and features of diet). Using TMLE with SuperLearner, we estimated a protective effect on FMI of interventions that assign these exposures to an individual according to their factual distribution in the study population among children without (vs. with) early BMI rebound and the same past exposure and covariates. Effect estimates were weaker for other outcomes. The use of non–doubly robust methods (i.e., parametric g-computation and IPW) and/or parametric estimates of nuisance parameters did not substantively change our conclusions.
Effect estimates were much stronger for FMI (−1.39 units) than for other cardiometabolic markers such as SBP z score (0.07 SD units) and metabolic risk score (−0.06 SD units). To put these estimates on the same scale, a change in FMI of 1.39 units is equivalent to a change of 0.40 SD units, which is thus reasonably stronger than estimates for other cardiometabolic markers. This finding is consistent with previous studies that have found a higher rate of fat gain and disproportionately high increases in fat mass among children with an early rebound than in those with a late rebound (46, 47). It is possible that interventions on modifiable behaviors associated with early BMI rebound might have a stronger and more direct influence on true adiposity-related outcomes (e.g., FMI) than on other cardiometabolic markers (e.g., SBP), which has been suggested by previous studies (13, 48). Further, their influence on other cardiometabolic markers, such as insulin resistance, may require more time to develop, since insulin resistance is primarily driven by visceral fat accumulation (49).
Researchers in several studies have estimated associations between early BMI rebound and later cardiometabolic risk using latent class regression methods with adjustment for covariates (4–8). These studies have relied heavily on parametric regression models and have also avoided explicit attention to causal inference. Recently some authors have interpreted the target effect of latent class models as an effect of an assumed latent class variable (50, 51). Drawbacks of this conceptual framework include the facts that a latent class variable may not exist or cannot be articulated, interventions on such a variable are not well-defined (particularly when the variable does not exist), and, in turn, effect estimates can never (even in principle) be confirmed in a real-world trial. However, the assumption of a latent class model allows the derivation of BMI trajectories such that rebound timing can be “known” even when only interval measures of BMI changes are available. This is at the expense of model misspecification bias. By avoiding these assumptions, we relied on a probably mismeasured indicator of BMI rebound such that children classified as having an “early” rebound may indeed have had this rebound sometime after age 4 years. A possible source of bias due to this mismeasurement is violation of the proxy separation assumption.
In contrast to interventions on unknown latent class variables, effects of proxy representative interventions can, in principle, be studied in a real-world trial. Further, formal sensitivity analyses of both the proxy separation assumption and the exchangeability assumption can be developed for particular choices of exposure. Finally, motivating estimators from an explicit causal question rather than a parametric regression model allowed the use of modern, robust estimation procedures.
Our application relied on several strong assumptions, with limitations discussed throughout. Beyond this, our findings may not be generalizable to other populations from different settings, since a majority of mothers in our study were White and college-educated. However, our approach allowed explicit definition of a target effect coinciding with effects of sustained interventions on modifiable exposures of key interest to obesity researchers. Our results highlight the crucial distinctions between questions, assumptions, and statistical algorithms in causal inference with observational data. Articulating clearly what we hope to estimate with an analysis allows explicit reasoning about assumptions, open debate about question value and assumption viability, and ultimately improved design of future studies.
Supplementary Material
ACKNOWLEDGMENTS
Author affiliations: Division of Chronic Disease Research Across the Lifecourse, Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, Massachusetts, United States (Izzuddin M. Aris, Marie-France Hivert, Emily Oken, Jessica G. Young); Department of Epidemiology, T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts, United States (Aaron L. Sarvet, Jessica G. Young); Swiss Federal Institute of Technology Lausanne, Lausanne, Switzerland (Mats J. Stensrud); Division of Research, Kaiser Permanente Northern California, Oakland, California, United States (Romain Neugebauer); Department of Obstetrics and Gynecology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Republic of Singapore (Ling-Jun Li); Diabetes Unit, Massachusetts General Hospital, Boston, Massachusetts, United States (Marie-France Hivert); and Department of Nutrition, T.H. Chan School of Public Health, Harvard University, Boston, Massachusetts, United States (Emily Oken).
All authors contributed equally to this work.
This work was supported by the National Institutes of Health (grants R01 HD034568 and UH3 OD023286).
We are very grateful to the participants and staff of Project Viva.
Conflict of interest: none declared.
REFERENCES
- 1. Wen X, Kleinman K, Gillman MW, et al. Childhood body mass index trajectories: modeling, characterizing, pairwise correlations and socio-demographic predictors of trajectory characteristics. BMC Med Res Methodol. 2012;12:Article 38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Rolland-Cachera MF, Deheeger M, Maillot M, et al. Early adiposity rebound: causes and consequences for obesity in children and adults. Int J Obes (Lond). 2006;30(suppl 4):S11–S17. [DOI] [PubMed] [Google Scholar]
- 3. Taylor RW, Grant AM, Goulding A, et al. Early adiposity rebound: review of papers linking this to subsequent obesity in children and adults. Curr Opin Clin Nutr Metab Care. 2005;8(6):607–612. [DOI] [PubMed] [Google Scholar]
- 4. Eriksson JG, Forsén T, Tuomilehto J, et al. Early adiposity rebound in childhood and risk of type 2 diabetes in adult life. Diabetologia. 2003;46(2):190–194. [DOI] [PubMed] [Google Scholar]
- 5. González L, Corvalán C, Pereira A, et al. Early adiposity rebound is associated with metabolic risk in 7-year-old children. Int J Obes (Lond). 2014;38(10):1299–1304. [DOI] [PubMed] [Google Scholar]
- 6. Hughes AR, Sherriff A, Ness AR, et al. Timing of adiposity rebound and adiposity in adolescence. Pediatrics. 2014;134(5):e1354–e1361. [DOI] [PubMed] [Google Scholar]
- 7. Koyama S, Ichikawa G, Kojima M, et al. Adiposity rebound and the development of metabolic syndrome. Pediatrics. 2014;133(1):e114–e119. [DOI] [PubMed] [Google Scholar]
- 8. Aris IM, Rifas-Shiman SL, Li L-J, et al. Patterns of body mass index milestones in early life and cardiometabolic risk in early adolescence. Int J Epidemiol. 2019;48(1):157–167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Hernán MA. Does water kill? A call for less casual causal inferences. Ann Epidemiol. 2016;26(10):674–680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Hernán MA, Taubman SL. Does obesity shorten life? The importance of well-defined interventions to answer causal questions. Int J Obes (Lond). 2008;32(suppl 3):S8–S14. [DOI] [PubMed] [Google Scholar]
- 11. Aris IM, Rifas-Shiman SL, Li LJ, et al. Pre-, perinatal, and parental predictors of body mass index trajectory milestones. J Pediatr. 2018;201:69–77.e8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. O’Connor EA, Evans CV, Burda BU, et al. Screening for obesity and intervention for weight management in children and adolescents: evidence report and systematic review for the US Preventive Services Task Force. JAMA. 2017;317(23):2427–2444. [DOI] [PubMed] [Google Scholar]
- 13. Padmapriya N, Aris IM, Tint MT, et al. Sex-specific longitudinal associations of screen viewing time in children at 2–3 years with adiposity at 3–5 years. Int J Obes (Lond). 2019;43(7):1334–1343. [DOI] [PubMed] [Google Scholar]
- 14. Couto Alves A, De Silva NMG, Karhunen V, et al. GWAS on longitudinal growth traits reveals different genetic factors influencing infant, child, and adult BMI. Sci Adv. 2019;5(9):eaaw3095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. VanderWeele TJ, Hernán MA. Causal inference under multiple versions of treatment. J Causal Inference. 2013;1(1):1–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. van der Laan MJ, Rubin D. Targeted maximum likelihood learning. Int J Biostat. 2006;2(1):Article 11. [Google Scholar]
- 17. Bang H, Robins JM. Doubly robust estimation in missing data and causal inference models. Biometrics. 2005;61(4):962–973. [DOI] [PubMed] [Google Scholar]
- 18. Oken E, Baccarelli AA, Gold DR, et al. Cohort profile: Project Viva. Int J Epidemiol. 2015;44(1):37–48. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Dorosty AR, Emmett PM, Cowin S, et al. Factors associated with early adiposity rebound. ALSPAC Study Team. Pediatrics. 2000;105(5):1115–1118. [DOI] [PubMed] [Google Scholar]
- 20. Kroke A, Hahn S, Buyken AE, et al. A comparative evaluation of two different approaches to estimating age at adiposity rebound. Int J Obes (Lond). 2006;30(2):261–266. [DOI] [PubMed] [Google Scholar]
- 21. Li LJ, Rifas-Shiman SL, Aris IM, et al. Leptin trajectories from birth to mid-childhood and cardio-metabolic health in early adolescence. Metabolism. 2019;91(18):30–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Perng W, Hajj H, Belfort MB, et al. Birth size, early life weight gain, and midchildhood cardiometabolic health. J Pediatr. 2016;173:122–130.e1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Perng W, Rifas-Shiman SL, Kramer MS, et al. Early weight gain, linear growth, and mid-childhood blood pressure: a prospective study in Project Viva. Hypertension. 2016;67(2):301–308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Flynn JT, Kaelber DC, Baker-Smith CM, et al. Clinical practice guideline for screening and management of high blood pressure in children and adolescents. Pediatrics. 2017;140(3):e20171904. [DOI] [PubMed] [Google Scholar]
- 25. Fryar CD, Gu Q, Ogden CL, et al. Anthropometric reference data for children and adults: United States, 2011–2014. Vital Health Stat 3. 2016;(39):1–46. [PubMed] [Google Scholar]
- 26. Lee JM, Okumura MJ, Davis MM, et al. Prevalence and determinants of insulin resistance among U.S. adolescents: a population-based study. Diabetes Care. 2006;29(11):2427–2432. [DOI] [PubMed] [Google Scholar]
- 27. McQuillan GM, McLean JE, Chiappa M, et al. National Health and Nutrition Examination Survey Biospecimen Program: NHANES III (1988–1994) and NHANES 1999–2014. Vital Health Stat 2. 2015;(170):1–14. [PubMed] [Google Scholar]
- 28. Jackson JW, VanderWeele TJ. Decomposition analysis to identify intervention targets for reducing disparities. Epidemiology. 2018;29(6):825–835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Stitelman OM, Hubbard AE, Jewell NP. The impact of coarsening the explanatory variable of interest in making causal inferences: implicit assumptions behind dichotomizing variables. (UC Berkeley Division of Biostatistics Working Paper Series, Working Paper 264). Berkeley, CA: University of California, Berkeley; 2010. [Google Scholar]
- 30. Young JG, Hernán MA, Robins JM. Identification, estimation and approximation of risk under interventions that depend on the natural value of treatment using observational data. Epidemiol Methods. 2014;3(1):1–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Hernán MA, Robins JM. Using big data to emulate a target trial when a randomized trial is not available. Am J Epidemiol. 2016;183(8):758–764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Robins J. A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect. Math Model. 1986;7(9):1393–1512. [Google Scholar]
- 33. Diaz I, van der Laan MJ. Assessing the causal effect of policies: an example using stochastic interventions. Int J Biostat. 2013;9(2):161–174. [DOI] [PubMed] [Google Scholar]
- 34. Kennedy EH. Nonparametric causal effects based on incremental propensity score interventions. J Am Stat Assoc. 2019;114(526):645–656. [Google Scholar]
- 35. Robins J, Richardson T. Alternative graphical causal models and the identification of direct effects. In: Shrout P, Keyes K, Ornstein K, eds. Causality and Psychopathology: Finding the Determinants of Disorders and Their Cures. New York, NY: Oxford University Press; 2010:103–158. [Google Scholar]
- 36. Pearl J. Causal diagrams for empirical research. Biometrika. 1995;82(4):669–688. [Google Scholar]
- 37. Robins JM, Hernán MA, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11(5):550–560. [DOI] [PubMed] [Google Scholar]
- 38. Snowden JM, Rose S, Mortimer KM. Implementation of G-computation on a simulated data set: demonstration of a causal inference technique. Am J Epidemiol. 2011;173(7):731–738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Young JG, Logan RW, Robins JM, et al. Inverse probability weighted estimation of risk under representative interventions in observational studies. J Am Stat Assoc. 2019;114(526):938–947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Schuler MS, Rose S. Targeted maximum likelihood estimation for causal inference in observational studies. Am J Epidemiol. 2017;185(1):65–73. [DOI] [PubMed] [Google Scholar]
- 41. Gruber S, van der Laan M. tmle: an R package for targeted maximum likelihood estimation. J Stat Softw. 2012;51(13):Article 35. [Google Scholar]
- 42. Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. [Google Scholar]
- 43. van der Laan MJ, Polley EC, Hubbard AE. Super learner. Stat Appl Genet Mol Biol. 2007;6:Article 25. [DOI] [PubMed] [Google Scholar]
- 44. Karim ME, Platt RW. Estimating inverse probability weights using super learner when weight-model specification is unknown in a marginal structural Cox model context. Stat Med. 2017;36(13):2032–2047. [DOI] [PubMed] [Google Scholar]
- 45. Polley EC, van der Laan MJ. Super learner in prediction. (UC Berkeley Division of Biostatistics Working Paper Series, Working Paper 266). Berkeley, CA: University of California, Berkeley; 2010. [Google Scholar]
- 46. Taylor RW, Williams SM, Carter PJ, et al. Changes in fat mass and fat-free mass during the adiposity rebound: FLAME Study. Int J Pediatr Obes. 2011;6(2-2):e243–e251. [DOI] [PubMed] [Google Scholar]
- 47. Williams SM, Goulding A. Patterns of growth associated with the timing of adiposity rebound. Obesity (Silver Spring). 2009;17(2):335–341. [DOI] [PubMed] [Google Scholar]
- 48. Bayer O, Neuhauser H, von Kries R. Sleep duration and blood pressure in children: a cross-sectional study. J Hypertens. 2009;27(9):1789–1793. [DOI] [PubMed] [Google Scholar]
- 49. Hocking S, Samocha-Bonet D, Milner KL, et al. Adiposity and insulin resistance in humans: the role of the different tissue and cellular lipid depots. Endocr Rev. 2013;34(4):463–500. [DOI] [PubMed] [Google Scholar]
- 50. Bray BC, Dziak JJ, Patrick ME, et al. Inverse propensity score weighting with a latent class exposure: estimating the causal effect of reported reasons for alcohol use on problem alcohol use 16 years later. Prev Sci. 2019;20(3):394–406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Schuler MS, Leoutsakos JS, Stuart EA. Addressing confounding when estimating the effects of latent classes on a distal outcome. Health Serv Outcomes Res Methodol. 2014;14(4):232–254. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data described in this article are available from the authors upon request.