Abstract
Background:
The Seychelles Child Development Study has been examining the relationship between prenatal methylmercury (MeHg) exposure from consuming fish during pregnancy and child development. This study re-analyzes seven outcomes in the 17 year Main Cohort data to determine if there are nonlinear or non-homogeneous (subgroup) associations that were not identified in the linear analysis.
Methods:
We adopted two statistical approaches. First, we carried out an additive nonlinear analysis assuming homogeneous prenatal MeHg-outcome relationships to explore overall associations. Second, we applied the regression tree to the Woodcock-Johnson Calculation subtest (it was significantly associated in earlier analyses) and identified 4 clusters based on covariates. Then we used additive models to assess the prenatal MeHg association in each of the four clusters for all seven outcomes. This approach assumes nonlinear associations in each cluster and non-homogeneous associations between clusters.
Results:
The additive nonlinear analysis yielded prenatal MeHg curves similar to the linear analysis. For the regression tree analysis, the curves relating prenatal MeHg to outcomes between the 4 clusters differed and some crossed at higher prenatal MeHg levels, suggesting non-homogeneity in the upper range of exposure. Additionally, some of the curves suggested a possible non-linear relationship within the range of exposure we studied.
Conclusion:
This non-linear analysis supports the findings from the linear analysis. It shows little evidence to support an adverse association of prenatal MeHg exposure through maternal consumption of fish contaminated with natural background levels. However, the tree analysis suggests that the prenatal exposure/outcome relationship may not be homogeneous across all individuals and that some subpopulations may have an adverse association in the upper range of the exposures studied. More robust data in the higher levels of exposure in this cohort are needed to confirm this finding.
Keywords: Child development, methylmercury, generalized additive models, prenatal exposure, regression tree
1. Introduction
Fish is a low fat, high protein food containing nutrients that are important for optimal fetal brain and nervous system development (Innis 2007; Ralston 2010). However, all fish also acquire small amounts of naturally occurring methylmercury (MeHg) from the aquatic food chain (National Research Council 2000). MeHg in adequate concentrations is a neurotoxicant that can affect the central nervous system (ATSDR 1999). The level of exposure associated with adverse effects is presently unknown. The fetus is known to be especially vulnerable to MeHg and the highest risk is considered to be during pregnancy (ATSDR 1999). Recent updates from U.S. federal agencies have encouraged fish consumption that is low in mercury by women of child bearing age for nutritional benefits (EPA/FDA 2014).
“The Seychelles Child Development Study (SCDS) is a longitudinal cohort study designed to test the hypothesis that prenatal MeHg exposure from a diet high in fish is associated with adverse neurodevelopmental outcomes. The Seychellois consume ocean fish daily, but do not consume sea mammals or fresh water fish” (Davidson et al. 2011, p:712). Comprehensive neurodevelopmental assessments on the SCDS Main Cohort were carried out at 10 different ages between 6 months and 24 years and have found no consistent pattern of evidence to support the hypothesis that prenatal MeHg exposure from consumption of fish with naturally acquired levels is associated with delays in neurodevelopment. In fact, we have observed improved performance on some endpoints associated with increasing prenatal MeHg in the range achieved by fish consumption. These benefits were found as early as 29 months (Davidson et al. 1995) and were still present in the evaluation at 17 years of age (Davidson et al. 2011) and most recently at 22 and 24 years of age (van Wijngaarden et al. 2017). These findings have been attributed to the influence of the nutritional benefits of fish, albeit the nutritional status indicators during pregnancy were not measured in the Main Cohort. Subsequently we measured maternal nutritional status in a new cohort and found evidence supporting the benefit of certain nutrients (Davidson et al. 2008; Strain et al. 2008; Lynch et al. 2011; Strain et al. 2012).
Studies in the Faroe Islands have reported some adverse associations between prenatal MeHg exposure and child development (Grandjean et al. 1997; Debes et al. 2006). However, the Faroes population consumes pilot whales which are contaminated by other pollutants such as PCBs (see review by Weihe and Joensen 2012), while the Seychelles population does not consume sea mammals and their PCB levels are very low (Davidson et al. 1998). Follow-up studies in the Faroe Islands (Debes et al. 2016) and studies in Canadian Inuit with concomitant PCB exposure (Jacobson et al. 2015) also reported associations that they attributed to MeHg exposure.
In 1989, Cox and colleagues reported that the relationship between prenatal MeHg exposure and children’s development was possibly nonlinear and that prenatal exposure to MeHg above 10 ppm (measured in maternal hair) may be associated with declines in neurodevelopment (Cox et al. 1989). Primary analyses of the SCDS Main Cohort data were based on conventional linear models. These models assume a linear relationship between prenatal MeHg levels and neurodevelopmental test scores. However, if the true exposure/outcome relationship is nonlinear, then forcing a straight line fit to the data may lead to a biased inference about the nature of the association. Nonlinear relationships have been reported with other exposures such as the inverted U–shaped association between maternal blood manganese levels at delivery and birth weight in full-term infants (Zota et al. 2009).
We have previously explored nonlinear analysis (Axtell et al. 2000; Huang et al. 2005) using semiparametric additive models (Hastie and Tibshirani, 1990) as secondary analyses of the SCDS Main Cohort data at the 66-month and 9-year evaluations. A nonlinear curve, which appeared to be nearly flat when the prenatal MeHg level was below approximately 12 ppm and linear above that level, led Huang et al. (2005) to suggest that the beneficial nutritional effects may predominate at low prenatal MeHg exposures whereas adverse MeHg associations may occur at higher prenatal exposure levels. Additionally Huang et al. (2007) carried out a regression tree analysis (Breiman et al. 1984) of the 9-year data, examining whether the prenatal MeHg-outcome relationship could be non-homogeneous between subgroups of the population as there might be vulnerable subgroups.
In this paper we reanalyze the SCDS Main Cohort data at age 17 years with two approaches, semiparametric additive models to explore overall nonlinear prenatal MeHg trends, and a combination of regression tree and semiparametric additive models to explore nonlinear and non-homogeneous prenatal MeHg associations in different subpopulations.
2. MATERIALS AND METHODS
2.1. Participants and MeHg Exposure
A total of 705 children of the 779 Seychellois infant-mother pairs originally enrolled in 1989–1990 (Marsh et al. 1995), were still eligible for evaluation at age 17 years. Seventy-four participants were excluded for a priori specified reasons including lack of prenatal exposure data, medical conditions that might affect development, or withdrawal from the study (Davidson et al. 1995). Prenatal MeHg exposure was measured in maternal hair samples as previously described (Cernichiari et al. 1995). “Recent postnatal exposure was measured in a 1-cm length of each child’s hair closest to the scalp taken at the time of testing. A total of 600 participants ranging in age from 15.7 to 18.4 years participated in the 17 year test battery” (Davidson et al. 2011, p:712) and 462 of them have at least one neurocognitive endpoint and a full set of covariates. We compared the participants with and without a full set of covariates, and they appear comparable (Davidson et al. 2011, p:713, Table 1a). The sizes of data sets range from 451 to 456 for the seven endpoints considered in this re-analysis.
This research was reviewed and approved by the Institutional Review Boards of both the University of Rochester and the Republic of Seychelles in accordance with national and institutional guidelines for the protection of human subjects.
2.2. Neurocognitive Testing Procedures
Since a linear line may be thought of as a rough approximation of a nonlinear curve, we adopted the results from linear analysis (Davidson et al. 2011) to select endpoints for nonlinear analysis. We chose to reanalyze only those 7 endpoints for which the coefficient for prenatal MeHg exposure in the primary linear regression analysis had a two-tailed p-value less than or equal to 0.2 (Davidson et al. 2011). The endpoints analyzed were as follows: the Woodcock-Johnson Test of Scholastic Achievement-II (WJ-II) Calculation and Applied Problems subtests, the California Verbal Learning Test (CVLT) Short Delay and Long Delay subtests, and three subtests from the Cambridge Neuropsychological Test Automated Battery (CANTAB): the Log Total Trials and the Log pre-Extra Dimensional Errors from the Intra-Extra Dimensional Shift Set (IED), and the Square Root Between Errors from Spatial Recognition Memory (SRM). For the CANTAB subtests, the Log and Square Root transformations were used to stabilize the variance in the linear analysis (Davidson et al. 2011) and we used the same transformed outcomes here. “Four subjects who tested as color blind were excluded from the CANTAB analyses since it requires color vision” (Davdison et al. 2011, p:713). An increase in the score is associated with improved performance on the WJ-II and CVLT subtests, but for the three CANTAB subtests decreased scores mean improved performance. Participants in the 17 year test battery range in age from 15.7 to 18.4 years.
2.3. Covariates
The covariates included in this analysis were the same as those in the linear analysis (Davidson et al. 2011) and included the following: child sex, socioeconomic (SES) status (measured at 9 years using the Hollingshead Four-Factor Socioeconomic Status modified for use in the Seychelles), maternal intelligence (measured using the Kaufman Brief Intelligence Test Matrices at the 10.5 year subject evaluation; K-BIT), child’s age at testing, and recent postnatal MeHg exposure.
2.4. Statistical Analysis
The analysis was carried out using the S-PLUS (Insightful 2007) and R (2015) software packages. Two statistical approaches were explored. First semiparametric additive models (Hastie and Tibshirani 1990) were applied to the data to explore overall nonlinear prenatal MeHg trends. This approach assumed nonlinear and homogeneous MeHg-outcome relationships while retaining the linear structure for covariates and postnatal MeHg effects as in primary linear analysis (Davidson et al. 2011). The essential ideas for fitting nonlinear curves are locally weighted averaging of the observations that fall in a window and the window moves continuously on the data range. Additive models have often been applied to deal with nonlinearity between the dependent and predictor variables in data analysis, e.g. Rahman et al. (2017). For fitting a nonlinear prenatal MeHg curve estimate, we adopted penalized Fourier regression (Huang and Chan 2014) based on a combination of un-penalized lines, as in linear models, and penalized trigonometric series. For the trigonometric series, 6 cosine and sine functions of x=MeHg, cos(2k ‧ π ‧ x/ mrange) and sin(2k ‧ π ‧ x/ mrange) for k=1, …,6 are taken, where mrange denotes the range of prenatal MeHg exposure. The penalty for the trigonometric functions is constrained asλk2, with λ chosen so that the approximate degree of freedom (df) is 3 for each curve, while linear regression uses only 1df. Huang and Chan (2014) have shown that the combined line-trigonometric approach has a mathematical equivalence to a mixed effects (ME) model representation and to local linear smoothing methods (Fan and Gijbels 1996). A plot showing the curve of the fitted smooth function describes the contribution of prenatal MeHg to the additive predictor for the developmental outcome.
The second approach considers nonlinear and non-homogeneous prenatal MeHg effects by a combination of regression tree (Breiman et al. 1984) and semiparametric additive models. When a complex relationship is expected between the response variable and independent variables, regression tree methods can be used for capturing non-additive and non-homogeneous effects by partitioning data to homogeneous subsets or clusters, e.g. Kim (2010). We identified the patterns of covariate effects at different levels of WJ-II Calculation scores using a regression tree without regard to prenatal MeHg exposure. We selected the WJ-II Calculation subtest because Davidson et al. (2011) had previously identified its association with prenatal MeHg exposure (slope=0.39, p=0.02). The regression tree was formed by successively recursively partitioning the data into two groups. At each partitioning, a cut point for one of the covariates was chosen such that the two groups’ WJ-II Calculation scores were statistically the most different from each other. The repeated splitting of the data results in the growth of the tree and the final clusters (groups) are termed “terminal nodes”. It is clear that the more splitting done, the smaller the bias and the larger the variance of the estimated means at each terminal node. The final tree size is determined by cross-validation (Breiman et al. 1984) that was similar to assessing the bias and variance trade-off for common regression problems (pruning the tree). The average of the responses falling in a terminal node (cluster) of the final tree is used to estimate the cluster mean for the covariate effects.
After partitioning the data into clusters, we used semiparametric additive models (Hastie and Tibshirani, 1990) to estimate the nonlinear curve for prenatal MeHg associations for each cluster on the 7 outcomes, while adjusting for covariate associations based on the cluster means from the regression tree. Let Yij denote the j-th observed dependent variable in cluster i, and Xij denote the corresponding prenatal MeHg exposure. The model under consideration is
For each of the outcomes, the cluster means αi’s were estimated by the averages of outcome values falling in that cluster. Model 1 assumes that the cluster means αi’s adequately account for the covariate effects and allows separate (heterogeneous) nonlinear effects of prenatal MeHg exposure among clusters by the si functions. The model makes no assumptions about the functional forms of the relationships between the means of outcomes (E(Yij)) and prenatal MeHg, and is thus able to highlight trends that differ from linear lines. However, this model assumes that the effects of αi and si(x) are additive. Observe that if si(x) = βx, then Model 1 reduces to a linear relationship of the outcome on prenatal MeHg. Thus the nonlinear assumption includes the linear association as a special case. The smooth functions in Model 1 were fit by penalized Fourier regression as in the additive models. A plot shows the curves of the fitted smooth functions si’s, and the cluster means αi’s are seen as the values of curves when prenatal MeHg is set to 0. The points on the plot are data (Xij, Yij), i.e. the values of prenatal MeHg exposure and the dependent variable. If the curves appear to be approximately parallel, then it means that the prenatal MeHg trends are homogeneous between clusters. If two curves cross at some level C of prenatal MeHg exposure, then one group has a larger response for Xij <C and the other group has a larger response at larger MeHg levels Xij >C, and hence their associations with MeHg may be heterogeneous at higher levels.
3. Results
3.1. Covariate Associations from Regression Tree
We report only the results of the regression tree on the covariate effects in the second approach, as the semiparametric additive models assume linear covariate effects and the results were similar to those in Davidson et al. (2011). The tree analysis using the WJ-II Calculation subtest and covariates resulted in four clusters. The cluster rules, number (proportion) of observations in each cluster, and the mean (SD) of their pre- and postnatal MeHg exposure are presented in Table 1. The means for prenatal MeHg exposure appear similar between the clusters. In addition, an F-test for one-way analysis of variance (ANOVA) with p=0.27 confirms that the differences between the prenatal MeHg means of the 4 clusters were not significant. An ANOVA F-test comparing the recent postnatal MeHg means between the 4 clusters was significant (p<.0001) with the highest average recent postnatal MeHg exposure in cluster 3 (17.2 ppm, SD 3.8 ppm). SES was selected as the first partition variable and the two groups had an average WJ-II Calculation score of 81.5 (range 11–121, n=269) and 90.4 (range 54–152, n=185). Maternal IQ measured by K-BIT and recent postnatal MeHg were next entered as partition criteria in the regression tree. These findings support the significance of SES and maternal IQ on the WJ-II Calculation scores as reported in Davidson et al. (2011). In the primary linear analysis the slopes for SES and maternal IQ were 0.25 and 0.15 respectively with significant p-values (<0.05) (Davidson et al. 2011). The slope for recent postnatal MeHg in that analysis was −0.04 but it was not significant (p=0.82).
Table 1.
Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | |
---|---|---|---|---|
Cluster rules | SES<26.25 | SES>=26.25 | ||
K-BIT<67.5 | K-BIT>=67.5 | |||
recent postnatal MeHg <=13.10ppm | recent postnatal MeHg >13.10ppm | |||
Number (proportion) of observations | 65 (14.32%) | 178 (39.21%) | 26 (5.73%) | 185 (40.75%) |
Prenatal MeHg (ppm) | 6.33 (4.21) | 7.27 (4.35) | 7.87 (5.10) | 6.71 (4.41) |
Recent postnatal MeHg (ppm) | 8.54 (4.79) | 7.16 (3.13) | 17.16 (3.79) | 7.43 (4.56) |
mean (estimated cluster mean αi in Model 1) and (SD) | ||||
WJ-II Calculation | 75.20 (19.31) | 84.97 (14.47) | 73.54 (14.70) | 90.42 (15.35) |
WJ-II Applied Problems | 77.69 (15.19) | 85.58 (13.16) | 77.23 (11.58) | 91.26 (13.29) |
CVLT Long Delay (scaled) | −1.23 (1.32) | −0.71 (1.26) | −1.44 (1.02) | −0.49 (1.20) |
CVLT Short Delay (scaled) | −1.26 (1.34) | −0.57 (1.20) | −1.37 (1.20) | −0.42 (1.18) |
CANTAB IED Log Total Trials | 4.84 (0.34) | 4.73 (0.33) | 4.82 (0.29) | 4.69 (0.35) |
CANTAB IED Log pre-ED Errors | 2.25 (0.50) | 2.20 (0.48) | 2.33 (0.64) | 2.06 (0.47) |
CANTAB SRM Sqrt Between Errors | 5.42 (1.73) | 5.05 (1.65) | 4.95 (2.29) | 4.53 (1.69) |
SES= Hollingshead socioeconomic status, K-BIT=Matenral IQ, IED = Intra-Extra Dimensional Shift Set, SRM = Spatial Recognition Memory.
The average WJ-II Calculation score for observations falling in each cluster, estimates of αi’s in Model 1 for WJ-II Calculation, are also given in Table 1. Cluster 4 had the highest average WJ-II Calculation score of 90.4. As shown in Table 1, it contains 185 (40.75%) of the 451 observations with an SES ≧26.25. Cluster 2 has the second highest average WJ-II Calculation score of 85.0 and contains 178 (39.21%) children. Comparing the covariates between clusters 2 and 4, cluster 2 has a lower SES, but with a maternal IQ ≧67.5 and a recent postnatal MeHg exposure ≤13.1 ppm; their average performance on the WJ-II Calculation score differs by 5.4 points. Cluster 1, with 65 (14.32%) observations, has a lower SES than cluster 4, a maternal IQ< 67.5, and the average WJ-II Calculation score is lower by about 15 points. Cluster 3 has the smallest number of observations 26 (5.73%) and a low average WJ-II Calculation score of 73.5. Comparing clusters 2 and 3 suggests that a postnatal MeHg exposure >13.1 ppm may have a nonlinear adverse association with an average difference of 11.5 points in WJ-II Calculation scores. An ANOVA F-test comparing the WJ-II Calculation score means between the 4 clusters is significant (p<.0001). For the other six outcomes, the estimated ith-cluster mean αi in Model 1 is the average of outcome values in each cluster and is given in Table 1. ANOVA F-tests comparing the means of six outcomes between the 4 clusters are all significant with p values ranging from .0034 to < .0001. The results of the ANOVA F-tests support the partition into clusters by the regression tree that resulted in different covariates characteristics between groups and significant differences in outcomes.
Covariate effects appear to be generally consistent with those reported in Davidson et al. (2011) and with previous knowledge. For example, for cluster 4 with SES ≥26.25, the average response is the largest for WJ-II and CVLT subtests and smallest for CANTAB subtests among the 4 clusters, implying participants with higher SES do better on these computerized neurocognitive tests.
3.2. Prenatal MeHg Exposure
The results of the semiparametric additive models which are not related to the regression tree, are shown in Figure 1 for WJ-II Calculation scores, CVLT-Long Delay, and log-transformed CANTAB IED Total Trials to Completion scores. The results for the other 4 outcomes (not shown) are similar. These analyses explore the overall nonlinear prenatal MeHg trends. They assume homogeneous MeHg-outcome relationships and the linear structure for covariates and recent postnatal MeHg associations as in Davidson et al. (2011). In the linear analysis (Davidson et al. 2011), increasing prenatal MeHg exposure was associated with improved WJ-II Calculation scores (slope=0.39, p=0.02) and beneficial log-transformed CANTAB IED Total Trials to Completion scores (slope=−0.01, p=0.02), while the p=0.12 for CVLT-Long Delay was not significant (slope=0.02). By visual examination, the curve for the WJ-II Calculation scores slowly increases at prenatal MeHg levels below approximately 15 ppm and is flat above 15 ppm, while linear trends are seen for CVLT-Long Delay and log-transformed CANTAB IED Total Trials for MeHg levels below approximately 8 and 12 ppm respectively. It is important to note sparsity of data at the higher MeHg levels and trends in the upper range have more variation making estimation more tentative. Overall, the plots in Figure 1 show small oscillations from the linear trends that are consistent with the linear analysis (Davidson et al. 2011) at the lower MeHg levels, but effects in the upper range studied, above approximately 8–15 ppm of prenatal MeHg exposure are uncertain due to the increase in variability.
Based on the second approach to explore nonlinear and non-homogeneous prenatal MeHg effects, the curves for the 4 WJ-II Calculation clusters are shown in Figure 2. The curves for clusters 2 and 4 are slowly increasing, consistent with the linear analysis. For cluster 1, the curve slowly increases to an exposure of approximately 15 ppm. For cluster 3 with only 26 observations, the curve increases to about 7 ppm and then decreases slowly to 15 ppm. The curves for clusters 1 and 3 cross at around 8 ppm and cluster 1 has a higher WJ-II Calculation score after the crossing point. This suggests non-homogeneity of prenatal MeHg associations at higher exposure levels in Clusters 1 and 3, but the two clusters account for only 20% of the data. The curves for the WJ-II Applied Problems (not shown) are similar to those for the WJ-II Calculation except that the changing point from increasing to decreasing for cluster 3 is approximately 8 ppm and the curves for clusters 1 and 3 cross at around 10 ppm. The linear analysis of the association between the WJ-II Applied Problems and prenatal MeHg exposure did not reach significance (slope=0.19, p=0.20) (Davidson et al. 2011). The R2 values for WJ-II Calculation and Applied Problems increase to 0.16 and 0.15 respectively from those of the linear analysis (0.11 for each) (Davidson et al. 2011). This phenomenon is somewhat expected since Model 1 uses a total of 16 df, while that for the linear analysis is 7 df. F-tests for checking whether the changes in R2 values are significant cannot be performed since the linear model in Davidson et al. (2001) is not nested in model (1) (note that the covariate effects in model (1) are taken account by αi’s).
The trends for CVLT-Long Delay and Short Delay were similar and hence we discuss only the Long Delay (Figure 3). The curves for clusters 1, 2 and 4 are slowly increasing at lower MeHg levels, and becoming nearly flat afterwards. The curve for cluster 3 changes from increasing to flat to decreasing at around 8 ppm, with an influential value (22.7 ppm) driving the curve slightly upward at the end. Here the curves of clusters 1 and 3 are close at levels of 4–6 ppm and then cluster 1 has a higher score than cluster 3 at MeHg levels > 9 ppm, again suggesting non-homogeneity of prenatal MeHg associations at higher exposure levels in Clusters 1 and 3. The R2 values increase from 0.069 and 0.066 in the linear analysis to 0.084 and 0.11 in Model 1 for the CVLT-Long Delay and Short Delay respectively.
For the log-transformed CANTAB IED Total Trials to Completion scores, the curve for cluster 2 (Figure 4) is slowly decreasing (a beneficial association); for cluster 4, the curve slowly decreases below approximately 10 ppm, then goes slightly upward and finally is nearly flat; for cluster 1, the curve is generally decreasing. The trend for cluster 3 is irregular, but there are only 26 observations in this cluster. The four curves move with different rates such that the curves for clusters 2 and 4 cross at around 11 ppm, and those for clusters 1 and 3 cross at around 10 ppm. Compared to the other clusters, cluster 4 starts with a lower number of IED total trials at the lowest prenatal MeHg exposure and after the crossing point, has a higher number of IED trials than cluster 2. This suggests non-homogeneity of prenatal MeHg associations at higher exposure levels. A similar pattern was present in the other two CANTAB endpoints (Log IED pre-Extra Dimensional Errors, and the Square Root Between Errors from SRM which are not shown). The R2 values increase to 0.064, 0.070, and 0.066 in Model 1 from 0.033, 0.033, and 0.051 respectively in the linear analysis for Log IED Total Trials, Log IED pre-Extra Dimensional Errors, and the Square Root Between Errors from SRM.
4. Discussion
In this analysis of 7 outcomes in the 17 year SCDS Main cohort data, using nonlinear statistical methods that explain more variances than linear analysis (Davidson et al. 2011), we found no evidence to support an adverse association with children’s neurodevelopment at prenatal MeHg exposure levels below 10 ppm measured in maternal hair growing during pregnancy. However, evaluating subgroups of this population with non-linear analysis we found a suggestion that there may be adverse associations of prenatal MeHg exposure above 10 ppm and children’s neurodevelopment in some individuals. The 10 ppm threshold used here follows that of Cox et al. (1989) since it is beyond the scope of this analysis to determine a threshold level, though Figure 1 shows a range of 8–15 ppm and earlier Huang et al. (2005) had conjectured 12 ppm.
Linear analysis has been the a priori primary analysis since the SCDS began, but we have periodically analyzed the data for nonlinear or non-homogeneous trends (Axtell et al. 2000; Huang et al. 2005; Huang et al. 2007). The improved scores for the WJ-II Calculation and IED total trials seen in the linear analysis (Davidson et al. 2011) as MeHg exposure increases were also present in the semiparametric additive models with the improving scores for WJ-II Calculation leveling off at approximately 15 ppm, and in the regression tree approach for clusters 2 and 4 that represent a majority (80%) of the data for both endpoints. Since there is no reason to believe that MeHg exposure at any concentration should improve performance, it seems likely that MeHg measurements are a surrogate marker for beneficial nutrients from fish consumption. Although maternal nutritional status was not measured in this cohort, we have measured nutrients in other SCDS cohorts (Davidson et al. 2008; Strain et al. 2008; Lynch, et al. 2011; Strain et al. 2012).
The regression tree analysis supported the significance of maternal IQ and SES that was identified in the linear analysis and suggested a possible adverse association with recent postnatal MeHg in cluster 3 above an exposure of approximately 13 ppm. We are cautious about interpreting the recent postnatal MeHg exposure nonlinear associations in a small group of only 26 observations because the SCDS was originally designed to address prenatal MeHg exposure, and our measure of postnatal exposure reflects only a one month exposure period. In addition, we are not aware of any a priori basis why this group (lower SES, higher maternal IQ, and higher recent MeHg exposure) should be particularly sensitive to prenatal MeHg compared with the children in the other three clusters.
When additive models were applied to the 4 identified subject clusters, the dose response curves suggested some nonlinear patterns. That finding was not apparent in the linear analysis. The dose response curves for clusters 1 and 3 crossed at higher exposure levels for the WJ-II Calculation and Applied Problems scores and the CVLT Short- and Long-Delay. This finding suggests that there may be a non-homogeneous prenatal MeHg association in the upper range of exposure for two of the clusters. This non-homogeneity is also apparent in the 3 CANTAB subtests where there are several intersections of the 4 curves. The intersections for most of the dose response curves occur at levels close to 10 ppm and there are fewer subjects above 10 ppm. Although the subgroups generated by this regression tree analysis did not provide clear evidence of adverse associations, studies of other groups might find such associations (Julvez et al. 2013).
Although the possibility of adverse associations at prenatal MeHg exposures above about 10 ppm is intriguing, we are cautious in making that interpretation because of the limited number of subjects in that exposure range. These interesting dose response curves for prenatal MeHg exposure present in different subpopulations do suggest a need for development of new statistical procedures to confirm the variability between such curves. Because of the descriptive nature of additive models, we consider the results presented here to be exploratory.
The strength of this analysis is that it examines nonlinear and nonhomogeneous trends in the data that might not be detected by linear analysis. However, there are also limitations. Some subgroups identified by the tree analysis may be so small that their associations may be difficult to interpret in a population study. In addition, the interpretations of the fitted nonlinear curves are not as simple as in linear analysis.
In conclusion, this non-linear additive model and regression tree analysis supports the linear analysis reported earlier (Davidson et al. 2011). We found no consistent evidence for associations of prenatal MeHg exposure from consuming fish with naturally occurring levels of MeHg and children’s neurodevelopment in the SCDS Main cohort at 17 years of age. The current results do raise some interesting points though. The dose response curves appear to be nonlinear at higher exposures and some individuals or subpopulations with prenatal exposures above certain levels (possibly 10 ppm based on Cox et al. (1989)) may have adverse associations with children’s neurodevelopment. However the effects in the upper range of prenatal MeHg exposure studied are uncertain due to the sparsity of data at these levels in SCDS. Further study is needed to continue examining nonlinear and non-homogeneous relationships of neurodevelopmental outcomes and prenatal MeHg exposure through fish consumption during pregnancy.
Acknowledgments
Source of Funding: This research was supported by grants RO1-ES008442 and P30-ES01247 from the US National Institute of Environmental Health Sciences, National Institutes of Health, by NSC 100-2118-M-007-001 and 101-2118-M-007-002-MY2 from Taiwan National Science Council, and by the Government of the Republic of Seychelles.
Footnotes
Protection of Human Subjects: This research was reviewed and approved by the Institutional Review Boards of both the University of Rochester and the Republic of Seychelles in accordance with national and institutional guidelines for the protection of human subjects.
Conflicts of Interest:
We wish to confirm that there are no known conflicts of interest associated with this paper and there has been no significant financial support for this work that could have influenced its outcome.
References
- ATSDR (1999) Toxicological Profile for Mercury. Atlanta, GA: Centers for Disease Control; Available: http://www.atsdr.cdc.gov/toxprofiles/tp.asp?id=115&tid=24 Accessed July 2017. [Google Scholar]
- Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and Regression Trees. Chapman and Hall, New York. [Google Scholar]
- Cernichiari E, Toribara TY, Liang L, Marsh DO, Berlin MW, Myers GJ, Cox C, Shamlaye CF, Choisy O, Davidson P, Clarkson TW (1995) The biological monitoring of mercury in the Seychelles study. NeuroToxicology 16: 613–628. [PubMed] [Google Scholar]
- Cox C, Clarkson TW, Marsh DO, Amin-Zaki L, Tikriti S, Myers G (1989) Dose-response analysis of infants prenatally exposed to methylmercury: an application of a single compartment model to single-strand hair analysis. Environ Res 49:318–332. [DOI] [PubMed] [Google Scholar]
- Davidson PW, Cory-Slechta DA, Thurston SW, Huang L-S, Shamlaye CF, Gunzler D, Watson G, van Wijngaarden E, Zareba G, Klein JD, Clarkson TW, Strain JJ, Myers GJ (2011) Fish consumption and prenatal methylmercury exposure: cognitive and behavioral outcomes in the Main Cohort at 17 years from the Seychelles Child Development Study. NeuroToxicology 32: 711–717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Davidson PW, Myers GJ, Cox C, Axtell C, Shamlaye C, Sloane-Reeves J, Cernichiari E, Needham L, Choi A, Wang Y, Berlin M, Clarkson TW (1998) Effects of prenatal and postnatal methylmercury exposure from fish consumption on neurodevelopment: outcomes at 66 months of age in the Seychelles Child Development Study. JAMA 280: 701–707. [DOI] [PubMed] [Google Scholar]
- Davidson PW, Myers GJ, Cox C, Shamlaye CF, Marsh DO, Tanner MA, Berlin M, Sloane-Reeves J, Cernichiari E, Choisy O (1995) Longitudinal neurodevelopmental study of Seychellois children following in utero exposure to methylmercury from maternal fish ingestion: outcomes at 19 and 29 months. NeuroToxicology 16: 677–688. [PubMed] [Google Scholar]
- Davidson PW, Strain JJ, Myers GJ, Thurston SW, Bonham MP, Shamlaye CF, Stokes-Riner A, Wallace JM, Robson PJ, Duffy EM, Georger LA, Sloane-Reeves J, Cernichiari E, Canfield RL, Cox C, Huang LS, Janciuras J, Clarkson TW (2008) Neurodevelopmental effects of maternal nutritional status and exposure to methylmercury from eating fish during pregnancy. NeuroToxicology 29: 767–775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Debes F, Budtz-Jorgensen E, Weihe P, White RF, Grandjean P (2006) Impact of prenatal methylmercury exposure on neurobehavioral function at age 14 years. Neurotoxicol Teratol. 28:363–375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- EPA/FDA. Fish: What Pregnant Women and Parents Should Know. Draft Updated Advice by FDA and EPA 2014. http://www.fda.gov/Food/FoodborneIllnessContaminants/Metals/ucm393070.htm Accessed July 2017.
- Fan J, Gijbels I (1996) Local polynomial regression and its applications. Chapman and Hall, New York. [Google Scholar]
- Grandjean P, Weihe P, White RF, Debes F, Araki S, Yokoyama K, Murata K, Sorensen N, Dahl R, Jorgensen PJ (1997) Cognitive deficit in 7-year-old children with prenatal exposure to methylmercury. Neurotoxicol Teratol 19: 417–428. [DOI] [PubMed] [Google Scholar]
- Hastie TJ, Tibshirani RJ (1990) Generalized additive models. Chapman and Hall, New York. [Google Scholar]
- Huang L-S, Cha K-S (2014) Local polynomial regression and penalized trigonometric series. Statistica Sinica 24: 1215–1238. [Google Scholar]
- Huang L-S, Cox C, Myers GJ, Davidson PW, Cernichiari E, Shamlaye F, Sloane-Reeves J, Clarkson TW (2005) Exploring nonlinear association between prenatal methylmercury exposure from fish consumption and child development: evaluation of the Seychelles Child Development Study nine-year data using semiparametric additive models. Environ Res 97: 100–108. [DOI] [PubMed] [Google Scholar]
- Huang L-S, Myers GJ, Davidson PW, Cox C, Xiao F, Thurston SW, Cernuchiarti E., Shamlaye CF, Sloane-Reeves J, Georger L, and Clarkson TW (2007) Is susceptibility to prenatal methylmercury exposure from fish consumption non-homogeneous? Tree-structured analysis for the Seychelles child development study. NeuroToxicology 28: 1237–1244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Innis SM (2007) Dietary (n-3) fatty acids and brain development. J Nutr. 137: 855–859. [DOI] [PubMed] [Google Scholar]
- Insightful Corp. (2007) S-Plus 8.0 for Windows Enterprise Developer.
- Julvez J, Smith GD, Golding J, Ring S, Pourcain BS, Gonzalez JR, Grandjean P (2013) Prenatal methylmercury exposure and genetic predisposition to cognitive deficit at age 8 years. Epidemiology 24: 643–650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim SE. (2010) Tree-based threshold modeling for short-term forecast of daily maximum ozone level. Stoch. Environ. Res. Risk. Assess 24:19–28. [Google Scholar]
- Lynch ML, Huang L-S, Cox C, Strain JJ, Myers GJ, Bonham MP, Shamlaye CF, Stokes-Riner A, Wallace JMW, Duffy EM, Clarkson TW, Davidson PW (2011) Varying coefficient function models to explore interactions between maternal nutritional status and prenatal methylmercury toxicity in the Seychelles Child Development Nutrition Study. Environ Res 111: 75–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marsh DO, Clarkson TW, Myers GJ, Davidson PW, Cox C, Cernichiari E, Tanner MA, Lednar W, Shamlaye C, Choisy O (1995) The Seychelles study of fetal methylmercury exposure and child development: introduction. Neurotoxicology 16: 583–596. [PubMed] [Google Scholar]
- Matthews AD (1983) Mercury content of commercially important fish of the Seychelles, and hair mercury levels of a selected part of the population. Environ. Res 30: 305–312. [DOI] [PubMed] [Google Scholar]
- Myers GJ, Davidson PW, Cox C, Shamlaye CF, Palumbo D, Cernichiari E, Sloane-Reeves J, Wilding GE, Kost J, Huang LS, Clarkson TW (2003) Prenatal methylmercury exposure from ocean fish consumption in the Seychelles child development study. Lancet 361: 1686–1692. [DOI] [PubMed] [Google Scholar]
- National Research Council (2000) Toxicological effects of methylmercury. National Academy Press, Washington, D.C., pp 1–344. [Google Scholar]
- R Core Team (2015) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria: URL https://www.R-project.org/. [Google Scholar]
- Rahman A, Charron C, Ouarda TB, Chebana F. (2017) Development of regional flood frequency analysis techniques using generalized additive models for Australia. Stoch. Environ. Res. Risk. Assess 10.1007/s00477-017-1384-1. [DOI] [Google Scholar]
- Ralston NVC, Raymond LJ (2010) Dietary selenium’s protective effects against methylmercury toxicity. Toxicology 278: 112–123. [DOI] [PubMed] [Google Scholar]
- Strain JJ, Davidson PW, Bonham MP, Duffy EM, Stokes-Riner A, Thurston SW, Wallace JM, Robson PJ, Shamlaye CF, Georger LA, Sloane-Reeves J, Cernichiari E, Canfield RL, Cox C, Huang LS, Janciuras J, Myers GJ, Clarkson TW (2008) Associations of maternal long-chain polyunsaturated fatty acids, methyl mercury, and infant development in the Seychelles Child Development Nutrition Study. NeuroToxicology 29: 776–782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Strain JJ, Davidson PW, Thurston SW, Harrington D, Mulhern MS, McAfee AJ, van Wijngaarden E, Shamlaye CF, Henderson J, Watson GE, Zareba G, Cory-Slechta DA, Lynch M, Wallace JMW, McSorley EM, Bonham MP, Stokes-Riner A, Sloane-Reeves J, Janciuras J, Wong R, Myers GJ, Clarkson TW (2012) Maternal PUFA Status but Not Prenatal Methylmercury Exposure Is Associated with Children’s Language Functions at Age Five Years in the Seychelles. Journal of Nutrition 142: 1943–1949. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Wijngaarden E, Thurston SW, Myers GJ, Harrington D, Cory-Slechta DA, Strain JJ, Watson GE, Zareba G, Love T, Henderson J, Shamlaye CF. (2017) Methyl mercury exposure and neurodevelopmental outcomes in the Seychelles Child Development Study Main cohort at age 22 and 24 years. Neurotoxicol Teratol 59:35–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weihe P, Joensen HD (2012) Dietary recommendations regarding pilot whale meat and blubber in the Faroe Islands. International Journal of Circumpolar Health 71:10.3402/ijch.v71i0.18594. doi: 10.3402/ijch.v71i0.18594. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zota AR, Ettinger AS, Bouchard M, Amarasiriwardena CJ, Schwartz J, Hu H, Wright RO (2009) Maternal blood manganese levels and infant birth weight. Epidemiology 20: 367–373. [DOI] [PMC free article] [PubMed] [Google Scholar]