Abstract
Characterization of the “exposome”, the set of all environmental factors that one is exposed to from conception onwards, has been advocated to better understand the role of environmental factors on chronic diseases.
Here, we aimed to describe the early-life exposome. Specifically, we focused on the correlations between multiple environmental exposures, their patterns and their variability across European regions and across time (pregnancy and childhood periods). We relied on the Human Early-Life Exposome (HELIX) project, in which 87 environmental exposures during pregnancy and 122 during the childhood period (grouped in 19 exposure groups) were assessed in 1301 pregnant mothers and their children at 6–11 years in 6 European birth cohorts.
Some correlations between exposures in the same exposure group reached high values above 0.8. The median correlation within exposure groups was greater than 0.3 for many exposure groups, reaching 0.69 for water disinfection by products in pregnancy and 0.67 for the meteorological group in childhood. Median correlations between different exposure groups rarely reached 0.3. Some correlations were driven by cohort-level associations (e.g. air pollution and chemicals).Ten principal components explained 45% and 39% of the total variance in the pregnancy and childhood exposome, respectively, while 65 and 90 components were required to explain 95% of the exposome variability. Correlations between maternal (pregnancy) and childhood exposures were high (> 0.6) for most exposures modeled at the residential address (e.g. air pollution), but were much lower and even close to zero for some chemical exposures.
In conclusion, the early life exposome was high dimensional, meaning that it cannot easily be measured by or reduced to fewer components. Correlations between exposures from different exposure groups were much lower than within exposure groups, which have important implications for co-exposure confounding in multiple exposure studies. Also, we observed the early life exposome to be variable over time and to vary by cohort, so measurements at one time point or one place will not capture its complexities.
Keywords: exposome, environmental exposures, early life, pregnancy, children
Graphical Abstract
1. INTRODUCTION
A complex interplay between genetic and environmental factors is assumed to contribute to the development of chronic diseases in humans (Willett, 2002). The World Health Organization (WHO) has attributed nearly half of global mortality to a handful of environmental exposures, illustrating the relevance of environmental factors for health (Lim et al., 2012). To date, most research on environmental determinants of disease has focused on single exposures, with the exceptions of a few studies considering simultaneously more than a couple of families of exposures, focusing on health outcomes such as birth weight (Dadvand et al., 2014; Hystad et al., 2014; Lenters et al., 2015) or type II diabetes mellitus (Patel et al., 2010). A more comprehensive approach, studying all environmental exposures that can influence health from conception onward, defined as the “exposome”, has been advocated to better understand the role of environmental factors on multi-factorial and chronic pathologies (Wild, 2005). Early life is a key starting point for the development of a lifetime exposome, particularly due to heightened vulnerability and potentially lifelong impact of exposure during this period (Vrijheid et al., 2014). In utero and childhood exposures, including environmental contaminant exposures (Vrijheid et al., 2016), but also factors in the urban and built environment (Gascon et al., 2016), can permanently change the body’s structure, physiology, and metabolism, predisposing individuals to the development of chronic pathologies later in life, a hypothesis based on the Developmental Origins of Health and Disease (DOHaD) paradigm (Heindel et al., 2015).
The exposome potentially consists of hundreds of exposures, many of which may be inter-related. An important challenge in studies associating the exposome with health is the simultaneous consideration of these many correlated exposures (Slama and Vrijheid, 2015). Understanding what a typical exposome looks like, including the structure of correlations between and within groups of exposure, is important for multiple steps in exposome research, including the planning of which exposures to measure and the development of statistical analysis protocols. Moreover, transparent knowledge of the correlation structure of an exposome data set is required for interpretation of associations (Patel et al., 2015; Robinson and Vrijheid, 2015). Additionally, the description of multiple exposure patterns can aid the identification of population groups at highest environmental health risk.
A few studies have recently described correlation structures of a large set of exposures, including the NHANES dataset (Patel and Ioannidis, 2014; Patel and Manrai, 2015), the Spanish INMA birth cohort (Robinson et al., 2015), and the LIFE study (Chung et al., 2018). However, the available information is still scarce. Wider studies, including a broader range of measured exposures, multiple lifetime periods of exposure assessment, and larger populations covering multiple regions are needed to more fully understand the complexity of the human exposome.
Here, we aim to describe the early-life exposome using data from the Human Early-Life Exposome (HELIX) project, in which more than 200 environmental exposures of concern for child health were assessed in 1301 pregnant women and their children at 6–11 years in 6 European birth cohorts. Specifically, we focus on the description and analyses of correlations between multiple environmental exposures, their patterns and their variability across European regions and across time (pregnancy and childhood periods).
2. MATERIAL AND METHOS
2.1. The study population
This study is part of the HELIX project, which aims to characterize the exposome during early-life and evaluate its relationship to molecular signatures and child health outcomes (Vrijheid et al., 2014). HELIX is based on six European birth cohort studies: BiB (Born in Bradford), United Kingdom (Wright et al., 2013) ; EDEN (Étude des Déterminants pré et postnatals du développement et de la santé de l’Enfant), France (Heude et al., 2016); INMA (INfancia y Medio Ambiente), Spain (Guxens et al., 2012); KANC (Kaunus Cohort), Lithuania (Grazuleviciene et al., 2009); MoBa (Norwegian Mother and Child Cohort Study), Norway (Magnus et al., 2016); and Rhea, Greece (Chatzi et al., 2017). In this paper we used data from the HELIX subcohort of 1301 mother-child pairs (MoBa (272), KANC (204), BIB (205), EDEN (198), INMA (223) and RHEA (199)) with information on environmental exposures during pregnancy and between 6 and 11 years of age (median 8.1 years; hereafter referred to as “childhood period”).
Follow-up visits of the children in the subcohort took place in 2014 and 2015 in the 6 study centers, and questionnaire information and biological samples for biomarker determination were collected. More details about the subcohort, including recruitment and data collection methods, are available in Maitre et al. (2018). Approval was obtained from the national ethics committees for every cohort. All participating women provided informed, written consent.
2.2. Exposome variables
In the current analysis we included 87environmental exposure variables for the pregnancy period and 122 for the childhood period (Table 1). Exposure assessment methods are described in detail for each exposure group in Annex 1. More exposure variables were available in the project (see Annex 1), but were not included in the current analysis for the following reasons: they had less than 30 subjects in one exposure category without possibility to recode; they had a correlation of 0.9 or higher with another similar variable that was likely measuring the same thing (Annex 1, pg 3) (e.g., buffers for spatial variables, home and school estimates for meteorological variables, erythemal UV, Vitamin-D and DNA damaging UV variables, facility richness and density and dichotomous and location-specific ETS in childhood); or they were calculated for several exposure windows and only the longest exposure window was included (e.g. pregnancy average instead of trimester averages). Exposure levels and distributions for these variables are shown in Annex 2.
Table 1.
Exposure group | Individual exposure variables | Number of variables | |
---|---|---|---|
| |||
Pregnancy | Childhood | ||
Atmospheric pollutants | NO2, PM2.5, PM10, PMabs at home address. Averaged over pregnancy and year before visit in childhood | 4 | 4 |
UV | Ambient ultraviolet radiation (UVR) levels at home address. Averaged over month before visit during childhood. Not available during pregnancy. | 0 | 1 |
Surrounding natural spaces | Average Normalized Difference Vegetation Index (NDVI) within a 100 m buffer; presence of a major green space in a distance of 300 m; presence of a major blue space in a distance of 300 m. Home and school address during childhood. | 3 | 6 |
Meteorology | Temperature, humidity, pressure at home address. Pressure not available during childhood. Averaged over pregnancy and month before visit during childhood. | 3 | 2 |
Built environment | Population density, building density, street connectivity, access to bus public transport (lines and stops), facility richness (pregnancy only) and density, Land Use Evenness Index and walkability index in a 300 m buffer. Home and school address during childhood (walkability only at home address). | 9 | 15 |
Traffic | Total traffic load of roads in a 100 m buffer (pregnancy and home), total traffic load of major roads in a 100 m buffer (home and school), traffic density on nearest road (pregnancy and home) and inverse distance to nearest road (pregnancy and home). | 3 | 5 |
Road traffic noise | 24-hour road noise levels (pregnancy, and home and school during childhood). Night time noise levels for home during childhood. | 1 | 3 |
OCs | Blood concentrations of DDE,DDT,HCB,PCB(118,138,153,170,180) | 8 | 8 |
PBDE | Blood concentrations of PBDE47, PBDE153 | 2 | 2 |
PFASs | Blood concentrations of PFOA, PFNA, PFUNDA, PFHxS, PFOS | 5 | 5 |
Metals and elements | Blood concentrations of As, Cd, Co,Cs, Cu, Hg, Mn, Mo, Pb, Tl | 10 | 10 |
Phthalate metabolites | Urine concentrations of MEP, MiBP, MnBP, MBzP, MEHP, MEHHP, MEOHP, MECPP, OHMiNP, OXOMiNP | 10 | 10 |
Phenols | Urine concentrations of MEPA, ETPA, BPA, PRPA, BUPA, OXBE, TRCS | 7 | 7 |
OP pesticide metabolites | Urine concentrations of DMP,DMTP,DMDTP (childhood only),DEP,DETP | 4 | 5 |
Tobacco smoking | Urine concentration of cotinine (pregnancy and childhood), active/passive smoking during pregnancy, number of cigarettes during pregnancy, parental smoking and passive smoking during childhood. | 3 | 3 |
Water DBPs | THM, choloroform, brominated THMs | 3 | 0 |
Indoor air pollution | NO2, TEX, Benzene, PM2.5, PMabs | 0 | 5 |
Lifestyle | Diet, physical activity, breastfeeding duration, alcohol consumption, sleep duration, pets | 12 | 27 |
Socio-economic capital | Family affluence score, social contact, social participation, house crowding | 0 | 4 |
| |||
87 | 122 |
Abbreviations used: Home: childhood home address; School: childhood school address; NO2, nitrogen dioxide; PM2.5, particulate matter with an aerodynamic diameter of less than 2.5 μm; PM10, particulate matter with an aerodynamic diameter of lessthan10μm; PMabs, absorbance of PM2.5filters; UV, ultraviolet; NDVI: Normalized Difference Vegetation Index; DDE, 4,4’dichlorodiphenyl dichloroethylene; DDT, 4,4’dichlorodiphenyltrichloroethane; HCB, hexachlorobenzene; PCB, polychlorinated biphenyl; PBDE, polybrominateddiphenyl ether; PFOA, perfluorooctanoate; PFNA, perfluorononanoate; PFUNDA, perfluoroundecanoate; PHFxS, perfluorohexane sulfonate; PFOS, perfluorooctane sulfonate; As, arsenic; Cd, cadmium; Co, cobalt; Cs, caesium; Cu, copper; Hg, mercury; Mn, manganese; Mo, molybdenum; Pb, lead; Tl, thallium; MEP, monoethyl phthalate; MiBP, mono-iso-butyl phthalate; MnBP, mono-n-butyl phthalate; MBzP, mono benzyl phthalate; MEHP, mono-2-ethylhexyl phthalate; MEHHP, mono-2-ethyl-5-hydroxyhexyl phthalate; MEOHP, mono-2-ethyl-5-oxohexyl phthalate; MECPP, mono-2-ethyl 5-carboxypentyl phthalate; OHMiNP, mono-4-methyl-7-hydroxyoctyl phthalate; OXOMiNP, mono-4-methyl-7-oxooctyl phthalate; MEPA, methyl paraben; ETPA, ethyl paraben; BPA, bisphenol A; PRPA, propyl paraben; BUPA, N-butyl paraben; OXBE, oxybenzone; TRCS, triclosan; DMP, dimethyl phosphate; DMTP, dimethyl thiophosphate; DMDTP, dimethyl dithiophosphate; DEP, diethyl phosphate; DETP, diethyl thiophosphate; THMs, trihalomethanes; TEX, toluene-ethylbenzene-xylene.
In brief, exposure to outdoor factors (atmospheric pollutants, ultra violet radiation – UV-, surrounding natural spaces, meteorology, built environment, traffic, and road traffic noise) was estimated using geospatial models, monitoring stations, satellite data and land use databases, and was assigned to study participants according to their geocoded home and school addresses using GIS platforms (Annex 1 and Robinson et al., (2018)). Chemical exposures were measured in serum, plasma, blood or urine samples using maternal samples collected during pregnancy or at birth stored by the cohorts and samples newly collected from the children during childhood (Haug et al., (2018) and Annex 1). Chemical biomarkers measured included organochlorine compounds (OCs), polybrominated diphenyl eters (PBDEs), per- and polyfluoroalkyl substances (PFAS), metals and elements, phthalate metabolites, phenols, organophosphate (OP) pesticide metabolites and cotinine. OCs and PBDEs were adjusted for serum lipid concentrations, and phthalate metabolites, phenols, OP pesticide metabolites and cotinine were adjusted for urinary creatinine. Details on the laboratories, limits of quantification, limits of detection and quality control can be found in Haug et al. (2018). Table A1.5 in the Annex shows the percent of quantifiable samples, which were higher than 90% for 77% of the childhood exposures and for 82% of the pregnancy exposures. Information on active and passive tobacco smoking was collected through questionnaires. For the pregnancy period, we also assigned exposure to water disinfection by-products (DBPs) based on measurements and models for the water supply of the participant’s residency (Jeong et al., 2012). For the childhood period, we estimated exposure to indoor air pollutants (NO2, benzene, and TEX-toluene, ethylbenzene, xylene) by combining measurements in the homes of a subgroup of 150 children during the two time periods with questionnaire data (Annex 1). During childhood, questionnaire information was collected on socio and economic capital of the family based on the Family Affluence Score (FAS) and on social participation, social contact and house crowding data (Annex 1). Finally, information on other lifestyle factors, including maternal and child diet, breastfeeding, maternal and child physical activity, alcohol consumption, pets, and child sleep duration, was collected through questionnaires (Annex 1).
2.3. Statistical analysis
Missing values of exposures and adjustment variables were imputed using the method of chained equations (White et al., 2011), using the mice package in R (van Buuren and Groothuis-Oudshoorn, 2011). Prior to imputation, skewed exposure variables were transformed to achieve normality or categorized if no transformation worked (Annex 2). Missing values, ranged from no missing values for some child phthalate metabolites to 65% for fast-food intake during pregnancy (Annex 2). The mean percentage of missing values per exposure was 12% (first quartile 0.9% and third quartile 16.8%). None of the participants had complete data on all exposures. Yet, for 98% of individuals less than 30% of exposure variables had missing values. Annex 3 includes a full description of the imputation process. As the analyses conducted in this paper do not provide standard errors, confidence intervals or p-values, a single imputed dataset was used, as single imputation can already correct for potential biases (Donders et al., 2006).
2.3.1. Correlations between exposures
Given the multicentre design of our study and the potential influence of cohort in the correlations between exposures, we calculated correlations firstly using the entire imputed entire dataset combining the 6 cohorts (referred to as “overall” correlations) and secondly using exposure variables centered and standardized within each cohort (referred to as “within–cohort” correlations). The latter were computed by first centering each exposure by cohort, i.e. subtracting the cohort means and dividing by the cohort-specific standard deviations, and then computing the correlations. We calculated Pearson, polychoric or polyserial correlations, as appropriate, between all pairs of exposures using the polycor and rexposome R Packages (Hernandez-Ferrer and Gonzalez, 2018). Very similar correlation coefficients were obtained if Spearman correlations were computed instead of Pearson correlations (data not shown). Correlation matrices were calculated separately for the pregnancy and childhood exposures. In order to characterize the heterogeneity of correlations between exposures in the same exposure group by cohort, we i) calculated correlation coefficients separately for each cohort; ii) conducted a meta-analysis of the cohort-specific correlations using the metacor command of the meta R package; and iii) reported the I2 index of heterogeneity. Overall and within-cohort correlation matrices were displayed and compared using heat maps. In order to visualize the complex relations between exposures, we conducted a network analysis based on the within-cohort correlation matrix, separately for the pregnancy and childhood periods. Additionally, we calculated Pearson’s correlations between exposures of the mother during pregnancy and the same exposures measured in the child at 6–11 years when available.
2.3.2. Dimensionality of exposome
We performed two different analyses using Principal Component Analysis (PCA) to achieve two different goals. In the first analysis, the aim was to compare the levels of exposures in the different cohorts.
Given the large number of exposures included in the analysis, we first reduced the dimensionality by conducting a separate PCA within each of the 19 pre-defined exposure groups, and retained only the first principal component for all of them. This way, we created a composite index variable (principal component scores) for each exposure group, and then averaged the scores by cohort to compare the levels. We followed this strategy instead of conducting a PCA with all exposures from all exposure groups together in order to have a better interpretability of the resulting principal components, and to ensure a better comparability of the resulting components in the pregnancy and childhood datasets.
In the second analysis, the aim was to quantify the underlying dimensionality of the data. In this case, we did include all the exposures from the different exposure families in the same analysis (separately for pregnancy and postnatal period), but we only focused on the number of components needed to explain 70% or 95% of the variation. The second analysis was conducted separately on the overall data and on the centered within-cohort data. In both analyses, we conducted PCA with varimax rotation.
All statistical analyses were conducted in the R software environment (R3.3.0; http://www.r-project.org). The network analysis was conducted using the Gephi software (Bastian et al., 2009).
3. RESULTS
Table 2 shows the number of participants by cohort, as well as the age and sex distribution. Mothers had an average age of 31 years at child birth, with some variations by cohort. There were also variations by cohort in the age of the children, with EDEN having the oldest children (average age 11) and KANC the youngest (average age 6).
Table 2:
N | Pregnancy |
Childhood |
||||
---|---|---|---|---|---|---|
Cohort | Maternal age, years |
Age, years |
Sex |
|||
mean | (25th; 75th percentile) | mean | (25th; 75th percentile) | % female | ||
BiB | 205 | 28.6 | (25.0; 33.0) | 6.6 | (6.5; 6.8) | 44.9 |
EDEN | 198 | 30.5 | (27.5; 34.0) | 10.9 | (10.4; 11.2) | 42.9 |
INMA | 223 | 32.0 | (29.5; 34.8) | 8.8 | (8.4; 9.2) | 46.2 |
KANC | 204 | 29.2 | (25.7; 32.3) | 6.4 | (6.1; 6.8) | 45.6 |
MoBa | 272 | 32.7 | (30.0; 35.0) | 8.5 | (8.2; 8.8) | 47.8 |
RHEA | 199 | 30.8 | (27.3; 33.5) | 6.5 | (6.4; 6.6) | 44.2 |
| ||||||
TOTAL | 1301 | 30.8 | (27.2; 34.0) | 8.1 | (6.5; 8.9) | 45.4 |
3.1. Exposure levels
Distributions of all exposure variables are detailed in Annex 2 by individual cohorts (Figure A2.1 for pregnancy and Figure A2.2 for childhood). Further details on chemical exposure biomarkers are found in Haug et al. (2018).In order to summarize these data, we computed a single principal component separately for each exposure group. Details on these principal component analyses are available in Annex 4, Tables A4.1 and A4.2. Figure 1 plots the principal component scores by cohort to characterize the cohorts’ profiles. The highest values for outdoor exposures (i.e. high pollution, traffic and low greenness) were found in the INMA cohort. In terms of chemicals, the highest exposure to OCs and phthalates were found in the EDEN cohort during pregnancy and in KANC and RHEA, respectively, during childhood; PBDEs were highest in RHEA during pregnancy, while fewer differences by cohort were observed during childhood; phenols were highest in MOBA during pregnancy and in BIB during childhood; and metals were highest in MOBA during pregnancy and in EDEN during childhood. Smoking was highest in RHEA, and water DBPs were highest in INMA. In terms of lifestyle, characterized mainly by consumption of fruit, MOBA showed the highest levels.
3.2. Within- and between-exposure group correlations
Table 3 shows the correlations within exposure groups for the pregnancy and childhood periods. Both overall and within-cohort correlations are shown. Overall correlations tend to be higher than within-cohort correlations, except for the atmospheric pollutants group. This is because, in this case, cohort-level averages do not follow the same correlation pattern than within-cohort correlations. For example, despite the fact that PM10 and NO2 are positively correlated within each cohort, the RHEA cohort had the highest levels of PM10 but the lowest levels of NO2 (Robinson et al. 2018). The highest median overall correlations within exposure groups were observed among water DBPs (r=0.69) and among meteorological variables in childhood (r=0.67) and in pregnancy (r=0.54), followed by tobacco smoking exposure in pregnancy (r=0.54), PFASs in pregnancy (r=0.49) and in childhood (r=0.45), and air pollutants in childhood (r=0.44). All other exposure groups showed median overall correlations below 0.42, with metals, lifestyle and surrounding natural spaces showing the lowest within-group correlations (below 0.12). In general, correlations within exposure groups were quite heterogeneous by cohort, as shown by the high I2 (Table 3), except for the metals, lifestyle and socio-economic capital groups.
Table 3.
Exposure group | Pregnancy period | Childhood period | ||||||
---|---|---|---|---|---|---|---|---|
N | Overall Median (min-max) |
Within-cohort Median (min-max) |
I2 Median (min-max) |
N | Overall Median (min-max) |
Within-cohort Median (min-max) |
I2 Median (min-max) |
|
All exposure variables | 87 | 0.08 (0.00–0.99) | 0.03 (0.00–0.94) | 0.54 (0.00–0.99) | 122 | 0.07 (0.00–0.98) | 0.03 (0.00–0.98) | 0.00 (0.00–0.99) |
Atmospheric pollutants | 4 | 0.30 (0.11–0.74) | 0.53 (0.47–0.63) | 0.96 (0.94–0.97) | 4 | 0.44 (0.21–0.76) | 0.55 (0.47–0.62) | 0.97 (0.89–0.98) |
UV | 0 | - | - | - | 1 | - | - | - |
Surrounding natural spaces | 3 | 0.07 (0.02–0.22) | 0.05 (0.04–0.32) | 0.51 (0.00–0.79) | 6 | 0.11 (0.01–0.67) | 0.09 (0.01–0.34) | 0.26 (0.00–0.90) |
Meteorology | 3 | 0.54 (0.27–0.70) | 0.11 (0.10–0.59) | 0.95 (0.95–0.98) | 2 | 0.67 (0.67–0.67) | 0.47 (0.47–0.47) | 0.97 (0.97–0.97) |
Built environment | 9 | 0.27 (0.01–0.89) | 0.28 (0.01–0.89) | 0.86 (0.00–0.98) | 15 | 0.27(0.00–0.72) | 0.17(0.00–0.68) | 0.71 (0.00–0.97) |
Traffic | 3 | 0.26 (0.06–0.31) | 0.21 (0.10–0.35) | 0.84 (0.82–0.93) | 5 | 0.20 (0.11–0.57) | 0.13 (0.02–0.56) | 0.70 (0.00–0.99) |
Road traffic noise | 1 | - | - | 3 | 0.34 (0.23–0.59) | 0.11 (0.05–0.32) | 0.24 (0.00–0.99) | |
OCs | 8 | 0.38 (0.01–0.97) | 0.42 (0.11–0.95) | 0.92 (0.63–0.99) | 8 | 0.42 (0.05–0.96) | 0.56 (0.18–0.94) | 0.83 (0.00–0.97) |
PBDE | 2 | 0.21 (0.21–0.21) | 0.11 (0.11–0.11) | 0.87 (0.87–0.87) | 2 | 0.36 (0.36–0.36) | 0.34 (0.34–0.34) | 0.37 (0.37–0.37) |
PFASs | 5 | 0.49 (0.12–0.67) | 0.52 (0.29–0.73) | 0.91 (0.54–0.94) | 5 | 0.45 (0.30–0.60) | 0.50 (0.26–0.55) | 0.67 (0.00–0.92) |
Metals and elements | 10 | 0.07 (0.00–0.99) | 0.03 (0.00–0.30) | 0.00 (0.00–0.91) | 10 | 0.06 (0.01–0.51) | 0.09 (0.01–0.42) | 0.07 (0.00–0.89) |
Phthalate metabolites | 10 | 0.23 (0.00–0.91) | 0.21 (0.00–0.91) | 0.65 (0.00–0.99) | 10 | 0.25 (0.08–0.98) | 0.25 (0.06–0.97) | 0.23 (0.00–0.92) |
Phenols | 7 | 0.18 (0.01–0.57) | 0.14 (0.03–0.53) | 0.31 (0.00–0.99) | 7 | 0.21 (0.07–0.64) | 0.17 (0.07–0.58) | 0.40 (0.00–0.95) |
OP pesticide metabolites | 4 | 0.32 (0.24–0.63) | 0.33 (0.24–0.59) | 0.85 (0.74–0.97) | 5 | 0.26 (0.18–0.50) | 0.27 (0.17–0.51) | 0.61 (0.00–0.93) |
Tobacco smoking | 3 | 0.54 (0.27–0.62) | 0.41(0.33–0.45) | 0.91 (0.83–0.91) | 3 | 0.42 (0.38–0.51) | 0.43 (0.37–0.51) | 0.88 (0.85–0.90) |
Water DBPs | 3 | 0.69 (0.39–0.76) | 0.38 (0.19–0.39) | 0.97 (0.86–0.99) | 0 | - | - | - |
Indoor air pollution | 0 | - | - | - | 5 | 0.20 (0.05–0.43) | 0.14 (0.01–0.53) | 0.60 (0.00–0.83) |
Lifestyle | 12 | 0.11 (0–0.42) | 0.03 (0.00–0.14) | 0.00 (0.00–0.58) | 27 | 0.07 (0.00–0.59) | 0.04 (0.00–0.70) | 0.00 (0.00–0.91) |
Socio-economic capital | 0 | - | - | - | 4 | 0.06 (0.02–0.15) | 0.04 (0.00–0.13) | 0.00 (0.00–0.46) |
N: number of compounds assessed in the exposure family. min-max: minimum and maximum values observed across all pairs of exposures in the exposure group
During pregnancy, the median absolute correlations between exposure groups were much lower (0.08) than within exposure groups (0.20). Figure 2 shows the median within-group (diagonal) and between-group (off-diagonal) absolute correlations for the pregnancy period. For overall correlations (Figure 1a), the highest absolute median correlations between exposure groups were observed between noise and meteorological variables (r=0.45), between water DPBs and meteorological variables (0.39), between water DBPs and atmospheric pollutants (r=0.32), and noise and traffic (r=0.29). The rest of the median absolute correlations between groups of chemical exposures were all equal to or below 0.24. The maximum correlation of lifestyle variables was with atmospheric pollutants (r=0.19). Similar patterns were observed for the within-cohort correlations (Figure 1b, but the correlations between groups were considerably reduced (r=0.03 (0.01–0.03)).This was especially the case for water DBPs, and for the correlations between outdoor exposures and chemicals, which were very close to 0. All correlations between noise and the other exposure groups were strongly reduced, except for the correlation between atmospheric pollutants and noise (r=0.13).
The pattern of correlations between exposures measured during childhood was generally similar to the patterns observed during pregnancy (Annex 4, Figure A4.1). One notable difference was that correlations within meteorological variables were much stronger in the childhood period, where the last month averages were used, compared to the pregnancy period, where we used 9-month averages. Other interesting correlations were found between PFAS and OCs (within: 0.20, overall: 0.19) and between tobacco smoke and indoor air pollution (within: 0.18, overall: 0.24) where within-cohort correlations remained similar to overall correlations. Annex 4, Figure A4.3 shows histograms and cumulative density functions of all correlations.
Annex 4, Figures A4.4-A.4.7 display the correlations within and between exposure groups by sex. No notable differences in correlations were observed between the two sexes.
3.3. Network visualization
The exposome correlation structure, using individual exposures and within-cohort correlations, is visualized as a network in Figure 3. Exposures that are close together in the network are more correlated than more distant ones. The pregnancy network (Figure 3A) shows a cluster with the outdoor exposome variables, dominated by the built environment variables but also including air pollutants, traffic variables, natural spaces, meteorological variables, and water DBPs. A few lifestyle variables, such as meat consumption and physical activity, were close to the outdoor exposome cluster. On the top-left part of the network there is another cluster made of PFASs and OCs, with contributions from metals, PBDEs, tobacco smoke and diet variables. Phenols and phthalate metabolites form separate clusters linked together by BPA and MEP. A few variables, like folic acid and certain metals, are not connected to other exposures. The childhood network (Figure 3B) appears more compact than the pregnancy network, with more links between exposure families. The outdoor exposome still appears as a cluster. It is connected to OCs via indoor air pollution and tobacco smoke. OCs continue to be close to PFASs. Breastfeeding and some lifestyle variables such as physical activity and organic food consumption appear close to OCs. Diet variables form their own cluster, with some connections to OCs and PFASs. On the left hand side there is a cluster of phenols, phthalate metabolites, meteorological variables and OP pesticides. Meteorological variables in childhood represent last month values, while in pregnancy they represent pregnancy averages. Thus, in the childhood period meteorological variables represent recent exposure, and this may explain why they are closer to phenols than in the pregnancy period (phenols are present in cosmetics such as sunscreen, and their use may be influenced by recent meteorology). The main connections of the cluster formed by phenols, phthalate metabolites, meteorological variables and OP pesticides are phthalate metabolites and PBDEs with metals, and meteorology with indoor pollution.
3.4. Dimensionality of exposome
When conducting PCA analyses with all exposures together, 10 principal components explained 45% of the total variance in the pregnancy exposome, while 26 components explained 70% and 65 components explained 95%. These numbers of principal components represent 11%, 30% and 75% of the original dimensionality of the pregnancy exposome (87). In the childhood exposome, 10 components explained 39%, 42 explained 70% and 90 components explained 95% of the total variance (Figure A4.2). These numbers of principal components represent 8%, 34% and 74% of the original dimensionality of the childhood exposome (87), so the reduction in dimensionality is of similar magnitude in the two periods. The percent of variance explained was slightly lower when using the cohort-centered data (Figure A4.2).
3.5. Correlations between pregnancy and childhood exposures
We calculated the correlation between the same exposure measured during pregnancy (mother) and childhood. Figure 4 shows that correlation coefficients are high (> 0.6) for many of the exposures estimated at the residential address, such as atmospheric pollutants, green space, noise, traffic, and certain built environment variables (connectivity, building density). Other built environment factors such as facility density had lower correlations (r<0.40). The correlation between pregnancy and childhood estimates was also lower for meteorological variables (r=0.40 for temperature and r=0.10 for humidity) because the pregnancy period includes nine months while childhood period includes 1-month information and is therefore more subject to seasonality. Within the exposures measured by biomarkers, DDE showed the highest correlation (r=0.62) between the pregnancy and childhood period. Among OCs, DDE had the highest correlations while HCB showed almost no correlation between pregnancy and childhood samples (r=0.02). Among PFASs, PFHXS and PFOS showed correlations above 0.4, while the correlation was lower (r<0.25) for other PFASs. Cs and Hg showed correlations greater than 0.4, but the other metals showed lower correlations (r<0.25), even negative in the case of Cd (r=−0.05). Phthalate metabolites showed low correlation between maternal and child sample (between −0.1 and 0.1). The correlations were also small for phenols and OP pesticides (between 0 and 0.20). The temporal changes for chemicals in HELIX have been reported by Haug et al. (2018), with slight differences due to different imputation methods used. Diet variables showed low correlations (r<0.1) with the exception of fish and cereal intake with somewhat greater values (r=0.22 and 0.35 respectively).
Figure A.4.8 displays the correlations between pregnancy and childhood exposures by sex. No notable differences were observed between the two sexes.
4. DISCUSSION
The HELIX project has measured exposure to many environmental factors of concern for child health, allowing a detailed analysis of the structure of the early life exposome, including its correlations, patterns and variability. Some very high correlations (>0.8) between particular pairs of exposures were observed and some exposure groups included exposures with high correlations between all exposures in the group (e.g. reaching median correlations 0.69). Overall, however, the median correlation between exposures within the same exposure group was only 0.2. Correlations were much lower between exposures from different exposure groups, with a median correlation of 0.08. Our results show that the early life exposome is high dimensional and cannot be summarized by a handful of principal components, and that it varies spatially and temporally.
4.1. Correlation between exposures
Our results show that correlations within the same exposure group can be high, but that correlations between exposures from different groups were low. High correlations between exposures from the same exposure group have been described previously (Lenters et al., 2015; Patel and Ioannidis, 2014; Robinson et al., 2015). The correlations found in our study are very similar than those reported in the U.S. for similar exposure groups (e.g. OCs, phthalates, phenols, metals), even though we reported higher correlations for OP pesticides (Patel and Ioannidis, 2014). The finding of generally low correlations between exposures from different groups, especially after removing cohort effects, is important as it would support the notion that, if this finding is generalizable to all populations, epidemiological studies focusing on a single family of exposures may not be confounded by exposures from other groups. Thus, results from epidemiological studies that focus on a single exposure or on a subset of exposures from the same group are not expected to be greatly affected by not having included exposures from other groups. However, one should be careful; although many between-group correlations were low, they were not negligible. In addition, although the median correlations between exposure groups were low, for particular pairs of exposures, high correlations were observed (e.g. between meteorology and road traffic noise or meteorology and water DBPs).
Some of the high correlations may be due to exposures being measured using a common methodology (e.g. GIS variables) or a common biological medium (e.g. variables measured in urine) or because some variables were used to create others. We made efforts to use the same procedures in all the different cohorts to maximize comparability. Variables belonging to the outdoor exposome (i.e. atmospheric pollutants, traffic noise, natural environments or built environment) tended to be more correlated than others in our data. However, it should be taken into account that some of these exposures were modeled and estimated at a static point, i.e. at the residential and school addresses, and they were capturing annual means. If instead one considers personal measurements that account for time activity patterns or estimates exposure for shorter time periods, correlations can vary. For example, some of the correlations reported by a sub-study based on personal measurements within the HELIX project found higher correlations between urban exposures than the ones reported here (Donaire-Gonzalez et al., unpublished results). The correlation between exposures has important implications when one tries to identify a causal factor for disease among a set of correlated exposures (Agier et al., 2016; Barrera-Gómez et al., 2017). A high correlation can drastically reduce the sensitivity of the methods to identify the true exposure and increases the probability of obtaining false positives. Thus, according to our results, it will be more difficult to correctly identify the true exposure causing a health effect among the outdoor exposome variables, if such a true association exists.
It is important to note that despite some high correlations, redundancy in our exposome data was still low. To capture 95% of the total variance in the exposome set, 65 principal components were needed and 72 if the cohort effects were removed. Thus, the early-life exposome is complex and high-dimensional, and it is not easy to synthesize it in a few exposure scores. Measurement errors affecting all exposures can also contribute to the observed low correlations between exposures and therefore to the fact that exposures cannot be summarized using a handful of principal components. It is also noteworthy that, in general, the correlations between environmental pollutant exposures and other lifestyle variables were low after controlling for cohort. The fact that exposures show low correlations with lifestyle factors means a lower potential for confounding by lifestyle in studies on the effect of environmental exposures on health.
Overall, it is recommended that investigators conducting exposome research conduct a thorough exploration of the structure of the exposome before evaluating exposome-health associations. Then, even in a context of a high dimensional exposome, there are several statistical methods that can be useful to explore exposome-health relationships (Agier et al. 2016; Stafoggia et al. 2017).
4.2. Cohort effects
When we summarized exposure groups by their first PCA score, we observed strong differences in some of the exposures by cohort. For example, BIB (UK) had the lowest values of OCs while EDEN (France) had the highest, INMA (Spain) had the highest values on atmospheric pollutants and water DBPs, RHEA (Greece) had the highest temperature and lowest water DBPs, MOBA (Norway) had the highest metal values and KANC (Lithuania) the lowest for the same variable. The geographical pattern of environmental exposures can be explained by multiple factors, such as different meteorology, city configuration, country-specific environmental policy or predominant diet. Multicenter studies have the advantage of capturing more varied exposure patterns and different ranges of exposure levels, but between-city differences can also drive many of the correlations between variables. In our data, we observed how, after removing the cohort effect, correlations between exposures from different exposure groups were reduced. Thus, adjustment for cohort is very important in studies linking the exposome with health outcomes, especially if the number of locations is small. Analyses that do not adjust for cohort may be driven by ecological associations obtained from a small number of locations, and although they can reflect true associations, they are more prone to be driven by confounding by other cohort or city level variables (Basagaña et al., 2018). In addition, the distribution of exposures across SES (socio-economic status) strata can be strongly city-dependent (Robinson et al., 2015; Temam et al., 2017).
4.3. Mother-child correlations
One of the special characteristics of the exposome is that, unlike the genome, it changes over time. This makes the characterization of the exposome challenging. In our data, comparing two points that were 6 to 11 years apart, only one exposure (modeled PM10) had a correlation greater than 0.8, and a few others, also related to atmospheric pollutants or the built environment, had correlations greater than 0.6. Chemical exposure can have different half-life in the human body, which influence the temporal correlations. As expected, concentration of persistent chemicals (i.e., PBDEs, and PFASs) showed high temporal correlations whereas non-persistent chemicals (i.e. phthalate metabolites, phenols and OP pesticide metabolites), with a short biological half-life and large within-subject variability (Casas et al., 2018), showed low temporal correlations for phthalate metabolites, phenols and OPs were low. As a result, to reliably estimate associations of these non-persistent chemicals with disease we cannot rely on a measure taken at a single time period, as it may not represent exposure in other periods of life.
Part of the observed variability between periods in our study may be explained by how the exposures were measured. Variations in the outdoor exposome can for example be due to the fact that we used the nine months of pregnancy for the pregnancy period and monthly averages for the childhood period. Moving residences is also a factor introducing temporal variations in the outdoor exposome.
4.4. Missing data
Missing data is a common problem in most epidemiological studies. Often, studies rely on complete case analyses, i.e. they discard participants with missing exposure data even if this approach only provides valid results under the assumption that missing values occurred completely at random or else it may introduce selection bias (Donders et al., 2006). An exposome paradigm provides additional complications as the higher the number of variables that need to be examined jointly, the lower the number of complete cases (in our data, there were no complete cases). Thus, the use of techniques like multiple imputation, which provides valid results under less restrictive assumptions (Donders et al., 2006), is imperative, for example, to implement model selection techniques to build regression models.
Applying multiple imputation to large datasets involves extra difficulties (Stuart et al., 2009). In general, it is recommended that imputation models should not include more than 25 variables, as adding more predictors usually provides little gain and can bring problems of convergence (van Buuren and Groothuis-Oudshoorn, 2011). To this end, we built the imputation model for each variable by selecting a set of less than 25 predictors that avoided collinearity problems (Annex 3).
4.5. Limitations
Although we collected data on hundreds of variables there are many unmeasured exposures that also conform the early-life exposome. E.g. a 2006 review paper reported 100,000 chemicals registered for commercial use, 200 of them being neurotoxic for humans (Grandjean and Landrigan, 2006). We only had data from a limited number of locations and therefore our results are not generalizable, but they provide a good picture of some of the variations of the early-life exposome in Europe. Likewise, we only observed cross-sections of the exposome and only assessed differences across two time points separated by at least 6 years, leaving the early childhood period unexplored. Our childhood period covered a wide age range, from 6 to 11, in which children go through considerable growth. Previous studies have shown that body burdens of exposure to persistent organic chemicals decrease with age mainly because of an increase in dilution of these chemicals in the total blood volume (Gascon et al., 2015). For non-persistent chemicals measured in urine, such changes are less relevant, but behavioral changes over age may of course lead to changes in exposure. In the present study it was difficult to conduct analyses by age because the age ranges of the children were strongly determined by cohort, which has a strong relationship with levels of chemicals.
5. CONCLUSIONS
This multicenter study with over 200 single environmental exposures measured showed the early life exposome to be high dimensional in terms of having little redundancy. Correlations between exposures from different exposure groups were much lower than within exposure groups. This suggests that, in many cases, studies that focus on a single exposure family should not suffer from strong unmeasured confounding by omitted exposures, although this can be different in other geographical settings or age groups. In addition, the early-life exposome varies strongly by region and by life periods, so measurements at one time point or one place will not capture its complexities.
Supplementary Material
Highlights:
The early-life exposome is high dimension also not easily reducible to fewer components
Correlations between exposures within different exposure groups can be high
Correlations between exposures in different exposure groups are low
The exposome varies strongly by location and by life period
Acknowledgements
We are grateful to all the participating children, parents, practitioners and researchers in the six countries who took part in this study. We thank the following entities that made the collection of many exposure data possible: the ESCAPE project for the air pollution modeling, HiWate for providing part of the water DPB models. We also thank the Yorkshire Water, UK, for the provision of water supply zone boundaries and routine water quality monitoring data, including Trihalomethanes (chloroform, bromoform, dibromochloromethane, and bromodichloromethane), from which all disinfection by product exposures estimates were derived for the Born in Bradford cohort.
This work was supported by the European Community’s Seventh Framework Programme (FP7/2007–2013) [grant number: 308333–the HELIX project].
INMA data collections were supported by grants from the Instituto de Salud Carlos III, CIBERESP, and the Generalitat de Catalunya-CIRIT. KANC was funded by the grant of the Lithuanian Agency for Science Innovation and Technology (6-04-2014_31V-66). The Norwegian Mother and Child Cohort Study is supported by the Norwegian Ministry of Health and Care Services and the Ministry of Education and Research, NIH/NIEHS (contract no N01-ES-75558), NIH/NINDS (grant no.1 UO1 NS 047537-01 and grant no.2 UO1 NS 047537-06A1). The Rhea project was financially supported by European projects (EU FP6-2003-Food-3-NewGeneris, EU FP6. STREP Hiwate, EU FP7 ENV.2007.1.2.2.2. Project No 211250 Escape, EU FP7-2008-ENV-1.2.1.4 Envirogenomarkers, EU FP7-HEALTH- 2009- single stage CHICOS, EU FP7 ENV.2008.1.2.1.6. Proposal No 226285 ENRIECO, EUFP7- HEALTH-2012 Proposal No 308333 HELIX, FP7 European Union project, No. 264357 MeDALL), and the Greek Ministry of Health (Program of Prevention of obesity and neurodevelopmental disorders in preschool children, in Heraklion district, Crete, Greece: 2011– 2014; “Rhea Plus”: Primary Prevention Program of Environmental Risk Factors for Reproductive Health, and Child Health: 2012–15).
Dr Chatzi received additional funding by the Southern California Environmental Health Sciences Center (grant # P30ES007048) funded by the National Institute of Environmental Health Sciences. Dr Maribel Casas received funding from Instituto de Salud Carlos III (Ministry of Economy and Competitiveness) (MS16/00128). Dr Charline Warembourg received funding from the Fondation de France (grant # 00069251).
ISGlobal is a member of the Agency for the Research Centers of Catalonia (CERCA) Programme, Generalitat de Catalunya.
REFERENCES
- Agier L, Portengen L, Chadeau-Hyam M, Basagaña X, Giorgis-Allemand L, Siroux V, Robinson O, Vlaanderen J, González JR, Nieuwenhuijsen MJ, Vineis P, Vrijheid M, Slama R, Vermeulen R, 2016. A Systematic Comparison of Linear Regression–Based Statistical Methods to Assess Exposome-Health Associations. Environ. Health Perspect 124, 1848–1856. 10.1289/EHP172 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barrera-Gómez J, Agier L, Portengen L, Chadeau-Hyam M, Giorgis-Allemand L, Siroux V, Robinson O, Vlaanderen J, González JR, Nieuwenhuijsen M, Vineis P, Vrijheid M, Vermeulen R, Slama R, Basagaña X, 2017. A systematic comparison of statistical methods to detect interactions in exposome-health associations. Environ. Health Glob. Access Sci. Source 16, 74. 10.1186/s12940-017-0277-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Basagaña X, Pedersen M, Barrera-Gómez J, Gehring U, Giorgis-Allemand L, Hoek G, Stafoggia M, Nieuwenhuijsen MJ, Brunekreef B, Slama R, ESCAPE Birth Outcomes working group, 2018. Analysis of multicentre epidemiological studies: contrasting fixed or random effects modelling and meta-analysis. Int. J. Epidemiol 10.1093/ije/dyy117 [DOI] [PubMed]
- Bastian M, Heymann S, Jacomy M, 2009. Gephi: an open source software for exploring and manipulating networks. Int. AAAI Conf. Weblogs Soc. Media
- Buuren S. van, Groothuis-Oudshoorn K, 2011. mice : Multivariate Imputation by Chained Equations in R. J. Stat. Softw 45, 1–67. 10.18637/jss.v045.i03 [DOI] [Google Scholar]
- Casas M, Basagaña X, Sakhi AK, Haug LS, Philippat C, Granum B, Manzano-Salgado CB, Brochot C, Zeman F, de Bont J, Andrusaityte S, Chatzi L, Donaire-Gonzalez D, Giorgis-Allemand L, Gonzalez JR, Gracia-Lavedan E, Grazuleviciene R, Kampouri M, Lyon-Caen S, Pañella P, Petraviciene I, Robinson O, Urquiza J, Vafeiadi M, Vernet C, Waiblinger D, Wright J, Thomsen C, Slama R, Vrijheid M, 2018. Variability of urinary concentrations of non-persistent chemicals in pregnant women and school-aged children. Environ. Int 121, 561–573. 10.1016/j.envint.2018.09.046 [DOI] [PubMed] [Google Scholar]
- Chatzi L, Leventakou V, Vafeiadi M, Koutra K, Roumeliotaki T, Chalkiadaki G, Karachaliou M, Daraki V, Kyriklaki A, Kampouri M, Fthenou E, Sarri K, Vassilaki M, Fasoulaki M, Bitsios P, Koutis A, Stephanou EG, Kogevinas M, 2017. Cohort Profile: The Mother-Child Cohort in Crete, Greece (Rhea Study). Int. J. Epidemiol 46, 1392–1393k. 10.1093/ije/dyx084 [DOI] [PubMed] [Google Scholar]
- Chung MK, Kannan K, Louis GM, Patel CJ, 2018. Toward Capturing the Exposome: Exposure Biomarker Variability and Co-Exposure Patterns in the Shared Environment. Environ. Sci. Technol acs.est.8b01467. 10.1021/acs.est.8b01467 [DOI] [PMC free article] [PubMed]
- Dadvand P, Ostro B, Figueras F, Foraster M, Basagaña X, Valentín A, Martinez D, Beelen R, Cirach M, Hoek G, Jerrett M, Brunekreef B, Nieuwenhuijsen MJ, 2014. Residential Proximity to Major Roads and Term Low Birth Weight. Epidemiology 25, 518–525. 10.1097/EDE.0000000000000107 [DOI] [PubMed] [Google Scholar]
- Donaire-Gonzalez D, Valentin A, Curto A, Casas M, Basagaña X, Manzano-Salgado C, de Castro M, Tamayo-Uria I, Robinson O, Pañella P, de Bont J, Waiblinger D, McEachan R, Wright J, Granum B, Schwarze PE, Aasvang GM, Meinhard-Kjellstad CH, Andrusaityte S, Petraviciene I, Dedele A, Grazuleviciene R, Lyon-Caen S, Slama R, Vafeiadi M, Chatzi L, Vrijheid M, Nieuwenhuijsen M, Michalaki E, Kampouri M, Unpublished results. Personal assessment of the external exposome during pregnancy and childhood in Europe [DOI] [PubMed]
- Donders ART, van der Heijden GJMG, Stijnen T, Moons KGM, 2006. Review: a gentle introduction to imputation of missing values. J. Clin. Epidemiol 59, 1087–91. 10.1016/j.jclinepi.2006.01.014 [DOI] [PubMed] [Google Scholar]
- Gascon M, Vrijheid M, Garí M, Fort M, Grimalt JO, Martinez D, Torrent M, Guxens M, Sunyer J, 2015. Temporal trends in concentrations and total serum burdens of organochlorine compounds from birth until adolescence and the role of breastfeeding. Environ. Int 74, 144–151. 10.1016/j.envint.2014.10.010 [DOI] [PubMed] [Google Scholar]
- Gascon M, Vrijheid M, Nieuwenhuijsen MJ, 2016. The Built Environment and Child Health: An Overview of Current Evidence. Curr. Environ. Health Rep 3, 250–257. 10.1007/s40572-016-0094-z [DOI] [PubMed] [Google Scholar]
- Grandjean P, Landrigan P, 2006. Developmental neurotoxicity of industrial chemicals. The Lancet 368, 2167–2178. 10.1016/S0140-6736(06)69665-7 [DOI] [PubMed] [Google Scholar]
- Grazuleviciene R, Danileviciute A, Nadisauskiene R, Vencloviene J, 2009. Maternal smoking, GSTM1 and GSTT1 polymorphism and susceptibility to adverse pregnancy outcomes. Int. J. Environ. Res. Public. Health 6, 1282–97. 10.3390/ijerph6031282 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guxens M, Ballester F, Espada M, Fernández MF, Grimalt JO, Ibarluzea J, Olea N, Rebagliato M, Tardón A, Torrent M, Vioque J, Vrijheid M, Sunyer J, INMA Project, 2012. Cohort Profile: the INMA--INfancia y Medio Ambiente--(Environment and Childhood) Project. Int. J. Epidemiol 41, 930–40. 10.1093/ije/dyr054 [DOI] [PubMed] [Google Scholar]
- Haug LS, Sakhi AK, Cequier E, Casas M, Maitre L, Basagaña X, Andrusaityte S, Chalkiadaki G, Chatzi L, Coen M, de Bont J, Dedele A, Ferrand J, Grazuleviciene R, Gonzalez JR, Gutzkow KB, Keun HC, McEachan RRC, Meltzer HM, Petraviciene I, Robinson O, Saulnier P, Slama R, Sunyer J, Urquiza J, Vafeiadi M, Wright J, Vrijheid M, Thomsen C, 2018. In-utero and childhood chemical exposome in six European mother-child cohorts. Environ. Int 121(Pt 1), 751–763. 10.1016/j.envint.2018.09.056 [DOI] [PubMed] [Google Scholar]
- Heindel JJ, Balbus J, Birnbaum L, Brune-Drisse MN, Grandjean P, Gray K, Landrigan PJ, Sly PD, Suk W, Cory Slechta D, Thompson C, Hanson M, 2015. Developmental Origins of Health and Disease: Integrating Environmental Influences. Endocrinology 156, 3416–3421. 10.1210/EN.2015-1394 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hernandez-Ferrer C, Gonzalez JR, 2018. rexposome: Exposome exploration and outcome data analysis. R package version 1.2.0
- Heude B, Forhan A, Slama R, Douhaud L, Bedel S, Saurel-Cubizolles M-J, Hankard R, Thiebaugeorges O, De Agostini M, Annesi-Maesano I, Kaminski M, Charles M-A, EDEN mother-child cohort study group, 2016. Cohort Profile: The EDEN mother-child cohort on the prenatal and early postnatal determinants of child health and development. Int. J. Epidemiol 45, 353–363. 10.1093/ije/dyv151 [DOI] [PubMed] [Google Scholar]
- Hystad P, Davies HW, Frank L, Van Loon J, Gehring U, Tamburic L, Brauer M, 2014. Residential Greenness and Birth Outcomes: Evaluating the Influence of Spatially Correlated Built-Environment Factors. Environ. Health Perspect 10.1289/ehp.1308049 [DOI] [PMC free article] [PubMed]
- Jeong CH, Wagner ED, Siebert VR, Anduri S, Richardson SD, Daiber EJ, McKague AB, Kogevinas M, Villanueva CM, Goslan EH, Luo W, Isabelle LM, Pankow JF, Grazuleviciene R, Cordier S, Edwards SC, Righi E, Nieuwenhuijsen MJ, Plewa MJ, 2012. Occurrence and Toxicity of Disinfection Byproducts in European Drinking Waters in Relation with the HIWATE Epidemiology Study. Environ. Sci. Technol 46, 12120–12128. 10.1021/es3024226 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lenters V, Portengen L, Smit LAM, Jönsson BAG, Giwercman A, Rylander L, Lindh CH, Spanò M, Pedersen HS, Ludwicki JK, Chumak L, Piersma AH, Toft G, Bonde JP, Heederik D, Vermeulen R, 2015. Phthalates, perfluoroalkyl acids, metals and organochlorines and reproductive function: a multipollutant assessment in Greenlandic, Polish and Ukrainian men. Occup. Environ. Med 72, 385–93. 10.1136/oemed-2014-102264 [DOI] [PubMed] [Google Scholar]
- Lim SS, Vos T, Flaxman AD, Danaei G, Shibuya K, Adair-Rohani H, Amann M, Anderson HR, Andrews KG, Aryee M, Atkinson C, Bacchus LJ, Bahalim AN, Balakrishnan K, Balmes J, Barker-Collo S, Baxter A, Bell ML, Blore JD, Blyth F, Bonner C, Borges G, Bourne R, Boussinesq M, Brauer M, Brooks P, Bruce NG, Brunekreef B, Bryan-Hancock C, Bucello C, Buchbinder R, Bull F, Burnett RT, Byers TE, Calabria B, Carapetis J, Carnahan E, Chafe Z, Charlson F, Chen H, Chen JS, Cheng AT-A, Child JC, Cohen A, Colson KE, Cowie BC, Darby S, Darling S, Davis A, Degenhardt L, Dentener F, Des Jarlais DC, Devries K, Dherani M, Ding EL, Dorsey ER, Driscoll T, Edmond K, Ali SE, Engell RE, Erwin PJ, Fahimi S, Falder G, Farzadfar F, Ferrari A, Finucane MM, Flaxman S, Fowkes FGR, Freedman G, Freeman MK, Gakidou E, Ghosh S, Giovannucci E, Gmel G, Graham K, Grainger R, Grant B, Gunnell D, Gutierrez HR, Hall W, Hoek HW, Hogan A, Hosgood HD, Hoy D, Hu H, Hubbell BJ, Hutchings SJ, Ibeanusi SE, Jacklyn GL, Jasrasaria R, Jonas JB, Kan H, Kanis JA, Kassebaum N, Kawakami N, Khang Y-H, Khatibzadeh S, Khoo J-P, Kok C, Laden F, Lalloo R, Lan Q, Lathlean T, Leasher JL, Leigh J, Li Y, Lin JK, Lipshultz SE, London S, Lozano R, Lu Y, Mak J, Malekzadeh R, Mallinger L, Marcenes W, March L, Marks R, Martin R, McGale P, McGrath J, Mehta S, Mensah GA, Merriman TR, Micha R, Michaud C, Mishra V, Mohd Hanafiah K, Mokdad AA, Morawska L, Mozaffarian D, Murphy T, Naghavi M, Neal B, Nelson PK, Nolla JM, Norman R, Olives C, Omer SB, Orchard J, Osborne R, Ostro B, Page A, Pandey KD, Parry CDH, Passmore E, Patra J, Pearce N, Pelizzari PM, Petzold M, Phillips MR, Pope D, Pope CA, Powles J, Rao M, Razavi H, Rehfuess EA, Rehm JT, Ritz B, Rivara FP, Roberts T, Robinson C, Rodriguez-Portales JA, Romieu I, Room R, Rosenfeld LC, Roy A, Rushton L, Salomon JA, Sampson U, Sanchez-Riera L, Sanman E, Sapkota A, Seedat S, Shi P, Shield K, Shivakoti R, Singh GM, Sleet DA, Smith E, Smith KR, Stapelberg NJC, Steenland K, Stöckl H, Stovner LJ, Straif K, Straney L, Thurston GD, Tran JH, Van Dingenen R, van Donkelaar A, Veerman JL, Vijayakumar L, Weintraub R, Weissman MM, White RA, Whiteford H, Wiersma ST, Wilkinson JD, Williams HC, Williams W, Wilson N, Woolf AD, Yip P, Zielinski JM, Lopez AD, Murray CJL, Ezzati M, AlMazroa MA, Memish ZA, 2012. A comparative risk assessment of burden of disease and injury attributable to 67 risk factors and risk factor clusters in 21 regions, 1990–2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet Lond. Engl 380, 2224–60. 10.1016/S0140-6736(12)61766-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Magnus P, Birke C, Vejrup K, Haugan A, Alsaker E, Daltveit AK, Handal M, Haugen M, Høiseth G, Knudsen GP, Paltiel L, Schreuder P, Tambs K, Vold L, Stoltenberg C, 2016. Cohort Profile Update: The Norwegian Mother and Child Cohort Study (MoBa). Int. J. Epidemiol 45, 382–8. 10.1093/ije/dyw029 [DOI] [PubMed] [Google Scholar]
- Maitre L, de Bont J, Casas M, Robinson O, Aasvang GM, Agier L, Andrušaitytė S, Ballester F, Basagaña X, Borràs E, Brochot C, Bustamante M, Carracedo A, de Castro M, Dedele A, Donaire-Gonzalez D, Estivill X, Evandt J, Fossati S, Giorgis-Allemand L Gonzalez JR, Granum B, Grazuleviciene R, Bjerve Gützkow K, Småstuen Haug L, Hernandez-Ferrer C, Heude B, Ibarluzea J, Julvez J, Karachaliou M, Keun HC, Hjertager Krog N, Lau CE, Leventakou V, Lyon-Caen S, Manzano C, Mason D, McEachan R, Meltzer HM, Petraviciene I, Quentin J, Roumeliotaki T, Sabido E, Saulnier PJ, Siskos AP, Siroux V, Sunyer J, Tamayo I, Urquiza J, Vafeiadi M, van Gent D, Vives-Usano M, Waiblinger D, Warembourg C, Chatzi L, Coen M, van den Hazel P, Nieuwenhuijsen MJ, Slama R, Thomsen C, Wright J, Vrijheid M, 2018. Human Early Life Exposome (HELIX) study: a European population-based exposome cohort. BMJ Open 8(9), e021311. 10.1136/bmjopen-2017-021311 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Patel CJ, Bhattacharya J, Butte AJ, 2010. An Environment-Wide Association Study (EWAS) on Type 2 Diabetes Mellitus. PLoS ONE 5, e10746. 10.1371/journal.pone.0010746 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Patel CJ, Ioannidis JPA, 2014. Placing epidemiological results in the context of multiplicity and typical correlations of exposures. J. Epidemiol. Community Health 68, 1096–1100. 10.1136/jech-2014-204195 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Patel CJ, Ioannidis JPA, Cullen MR, Rehkopf DH, 2015. Systematic Assessment of the Correlations of Household Income With Infectious, Biochemical, Physiological, and Environmental Factors in the United States, 1999–2006. Am. J. Epidemiol 181, 171–179. 10.1093/aje/kwu277 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Patel CJ, Manrai AK, 2015. Development of exposome correlation globes to map out environment-wide associations. Pac. Symp. Biocomput 20, 231–242. 10.1142/9789814644730_0023 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson O, Basagaña X, Agier L, de Castro M, Hernandez-Ferrer C, Gonzalez JR, Grimalt JO, Nieuwenhuijsen M, Sunyer J, Slama R, Vrijheid M, 2015. The Pregnancy Exposome: Multiple Environmental Exposures in the INMA-Sabadell Birth Cohort. Environ. Sci. Technol 49, 10632–10641. 10.1021/acs.est.5b01782 [DOI] [PubMed] [Google Scholar]
- Robinson O, Tamayo I, de Castro M, Valentin A, Giorgis-Allemand L, Hjertager Krog N, Marit Aasvang G, Ambros A, Ballester F, Bird P, Chatzi L, Cirach M, Dėdelė A, Donaire-Gonzalez D, Gražuleviciene R, Iakovidis M, Ibarluzea J, Kampouri M, Lepeule J, Maitre L, McEachan R, Oftedal B, Siroux V, Slama R, Stephanou EG, Sunyer J, Urquiza J, Vegard Weyde K, Wright J, Vrijheid M, Nieuwenhuijsen M, Basagaña X, 2018. The Urban Exposome during Pregnancy and Its Socioeconomic Determinants. Environ. Health Perspect 126, 77005. 10.1289/EHP2862 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson O, Vrijheid M, 2015. The Pregnancy Exposome. Curr. Environ. Health Rep 2, 204–213. 10.1007/s40572-015-0043-2 [DOI] [PubMed] [Google Scholar]
- Slama R, Vrijheid M, 2015. Some challenges of studies aiming to relate the Exposome to human health. Occup. Environ. Med 72, 383–384. 10.1136/oemed-2014-102546 [DOI] [PubMed] [Google Scholar]
- Stafoggia M, Breitner S, Hampel R, Basagaña X, 2017. Statistical Approaches to Address Multi-Pollutant Mixtures and Multiple Exposures: the State of the Science. Curr Environ Health Rep 4(4), 481–490. 10.1007/s40572-017-0162-z. [DOI] [PubMed] [Google Scholar]
- Stuart EA, Azur M, Frangakis C, Leaf P, 2009. Multiple imputation with large data sets: a case study of the Children’s Mental Health Initiative. Am. J. Epidemiol 169, 1133–9. 10.1093/aje/kwp026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Temam S, Burte E, Adam M, Antó JM, Basagaña X, Bousquet J, Carsin A-E, Galobardes B, Keidel D, Künzli N, Le Moual N, Sanchez M, Sunyer J, Bono R, Brunekreef B, Heinrich J, de Hoogh K, Jarvis D, Marcon A, Modig L, Nadif R, Nieuwenhuijsen M, Pin I, Siroux V, Stempfelet M, Tsai M-Y, Probst-Hensch N, Jacquemin B, 2017. Socioeconomic position and outdoor nitrogen dioxide (NO2) exposure in Western Europe: A multi-city analysis. Environ. Int 101, 117–124. 10.1016/j.envint.2016.12.026 [DOI] [PubMed] [Google Scholar]
- van Buuren S, Groothuis-Oudshoorn K, 2011. mice : Multivariate Imputation by Chained Equations in R. J. Stat. Softw 45, 1–67. 10.18637/jss.v045.i03 [DOI] [Google Scholar]
- Vrijheid M, Casas M, Gascon M, Valvi D, Nieuwenhuijsen M, 2016. Environmental pollutants and child health—A review of recent concerns. Int. J. Hyg. Environ. Health 219, 331–342. 10.1016/j.ijheh.2016.05.001 [DOI] [PubMed] [Google Scholar]
- Vrijheid M, Slama R, Robinson O, Chatzi L, Coen M, van den Hazel P, Thomsen C, Wright J, Athersuch TJ, Avellana N, Basagaña X, Brochot C, Bucchini L, Bustamante M, Carracedo A, Casas M, Estivill X, Fairley L, van Gent D, Gonzalez JR, Granum B, Gražulevičienė R, Gutzkow KB, Julvez J, Keun HC, Kogevinas M, McEachan RRC, Meltzer HM, Sabidó E, Schwarze PE, Siroux V, Sunyer J, Want EJ, Zeman F, Nieuwenhuijsen MJ, 2014. The human early-life exposome (HELIX): project rationale and design. Environ. Health Perspect 122, 535–44. 10.1289/ehp.1307204 [DOI] [PMC free article] [PubMed] [Google Scholar]
- White IR, Royston P, Wood AM, 2011. Multiple imputation using chained equations: Issues and guidance for practice. Stat. Med 30, 377–399. 10.1002/sim.4067 [DOI] [PubMed] [Google Scholar]
- Wild CP, 2005. Complementing the genome with an "exposome": the outstanding challenge of environmental exposure measurement in molecular epidemiology. Cancer Epidemiol. Biomark. Prev. Publ. Am. Assoc. Cancer Res. Cosponsored Am. Soc. Prev. Oncol 14, 1847–50. 10.1158/1055-9965.EPI-05-0456 [DOI] [PubMed] [Google Scholar]
- Willett WC, 2002. Balancing Life-Style and Genomics Research for Disease Prevention. Science 296, 695–698. 10.1126/science.1071055 [DOI] [PubMed] [Google Scholar]
- Wright J, Small N, Raynor P, Tuffnell D, Bhopal R, Cameron N, Fairley L, Lawlor DA, Parslow R, Petherick ES, Pickett KE, Waiblinger D, West J, Born in Bradford Scientific Collaborators Group, 2013. Cohort Profile: the Born in Bradford multi-ethnic family cohort study. Int. J. Epidemiol 42, 978–91. 10.1093/ije/dys112 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.