Abstract
While fine particulate matter () has been associated with autism spectrum disorder (ASD), few studies focused on ultrafine particles (). Given that fine and ultrafine particles can be highly correlated due to shared emission sources, challenges remain to distinguish their health effects. In a retrospective cohort of 318,371 mother-child pairs (4,549 ASD cases before age 5) in Southern California, pregnancy average and were estimated using a California-based chemical transport model and assigned to residential addresses. The correlation between and was 0.87. We applied a two-step variance decomposition approach: first, decomposing and into the shared and unique variances using ordinary least squares linear regression (OLS) and Deming regression considering errors in both exposures; then assessing associations between decomposed and and ASD using Cox proportional hazard models adjusted for covariates. Prenatal and each was associated with increased ASD risk. OLS decomposition showed that associations were driven mainly by their shared variance, not by their unique variance. Results from Deming regression considering assumptions of measurement errors were consistent with those from OLS. This decomposition approach has potential to disentangle health effects of correlated exposures, such as and from common emissions sources.
Keywords: fine particles, ultrafine particles, variance decomposition, Deming regression, autism
Graphical Abstract

Introduction
Autism spectrum disorder (ASD) is a complex set of developmental disorders characterized by deficits in social interactions and communication, as well as the presence of restricted, repetitive, and stereotyped patterns of behaviors 1. A recent review reported a global autism prevalence of approximately 1 in 100 children 2. The estimated prevalence of ASD in the United States has shown a significant increase from 0.66% in 2002 to 2.78% in 2020 3,4. ASD imposes considerable lifelong emotional and social burden on affected children 5, their families 6, and economic burdens on society 7.
A growing number of studies has shown associations between prenatal air pollution exposures and ASD in offspring, with the most consistent findings for particulate matter (PM) 8–11. PM is emitted directly to the atmosphere from a variety of sources, including vehicles, industrial sources, and biomass combustion. PM can also form in the atmosphere as a product of chemical reactions 12. Due to the dynamic formation processes which involve nucleation, condensation, and coagulation, PM exhibits varying aerodynamic diameters broadly defined as coarse (with aerodynamic diameter < 10 μm; ), fine (with aerodynamic diameter < 2.5 μm; ) and ultrafine particles (with aerodynamic diameter < 0.1 μm; )12. These differences in size give rise to distinct aerosol behaviors and toxicity profiles13. The shared variance between PM of different aerodynamic sizes may indicate common emission sources such as traffic exhaust and biomass burning 14. The unique variance may indicate distinct atmospheric processes. can form in ambient air through nucleation, followed by coagulation and condensation into larger particles 15. Secondary aerosols like sulfate and nitrate, which are typically larger than 0.1 μm 16, may contribute to the unique variance of . Therefore, distinguishing the shared and unique variance of PM of different sizes can provide insights into the different mechanisms or components that may impact human health.
Recent toxicological and epidemiological studies reported that ambient may have exerted greater toxic effects on health than coarse particles17–19. The small size of enables it to penetrate deep into the lungs and enter the systemic circulation, which may induce inflammation and oxidative stress17. Moreover, may cross biological barriers20, such as placenta21 and blood-brain barrier22, and enter the fetal bloodstream and developing fetal brain21. Animal studies have also provided evidence that PM0.1 can be a source of neurotoxicity that can alter microglia stimulation and impact neuronal activity and synaptic plasticity of developing brain 23–25. Therefore, prenatal exposure may have direct adverse health effects on fetal neurodevelopment. Since is not regulated as a criteria pollutant and the modeling techniques for exposures have only recently been developed, few epidemiological studies have examined the health effects of during pregnancy. A recent study reported that prenatal exposure was associated with asthma development in children26. Specific to neurodevelopmental disorders such as ASD, a study from our group showed that aircraft emitted PM, primarily composed of , was associated with ASD in offspring27. A recent case-control study has reported the association between increased exposure in the second year of life and ASD development among children28.
However, previous studies often treated or as single exposures and examined their isolated health associations, rather than considering the real world scenario where children are exposed to both and , which can be highly correlated because of shared sources such as traffic29. Therefore, it remains a challenge to distinguish their associations with health outcomes. The collinearity issue and inflated standard errors may present in the co-adjusted model with two highly correlated exposures30. Therefore, we proposed a two-step variance decomposition approach using ordinary least squares (OLS) linear regression and Deming regression considering measurement errors in both exposures31,32 as the first step to decompose two correlated and exposures into unique and shared variance. In the second step, we differentiated the associations of the health outcome with the shared variance and the unique variance that remained in their residuals. We leveraged a validated chemical transport model (CTM) 14,33,34 for and exposures and a large retrospective birth cohort in Southern California to examine the associations between ASD in children with the shared and unique variance average and exposures during pregnancy.
Method
Study population
This study utilized a population-based retrospective birth cohort that included mothers with singleton deliveries (n=370,723) at Kaiser Permanente Southern California (KPSC) hospitals between January 1, 2001 and December 31, 2014. Information related to the mothers and children, including maternal residential addresses history and medical conditions, were extracted from high-quality integrated electronic medical records (EMR) maintained by KPSC. A total of 52,352 births were excluded due to (1) missing key covariates of gender, maternal race and ethnicity, and age at delivery (n=666); (2) maternal age <15 or >55 at delivery (n=159), an unusual age of pregnancy and high potential risk to children for this age of pregnancy; (3) incomplete maternal residential address during pregnancy or geocodes not suitable for air pollution exposure assignment (n=51,527) given that prenatal air pollution is our exposure of interest. The final data analysis included 318,371 mother-child pairs with complete data on residential estimates of and exposures (eFigure 1). Both KPSC and the University of Southern California Institutional Review Boards approved this study with waiver of individual subject consent.
ASD ascertainment
The outcome was ASD diagnosis before age 5. Children were followed from birth through EMR until clinical diagnosis of ASD, loss to follow-up, or age 5, whichever came first, as described previously 35,36. The presence or absence of ASD in children was ascertained based on at least two clinical diagnoses using International Classification of Diseases, Ninth Revision (ICD- 9) codes 299.0, 299.1, 299.8, 299.9 from the EMR records before October 1, 2015 (the date of KPSC implementation of ICD-10 codes) and subsequently ICD-10 codes F84.0, F84.5, F84.9 F84.0, F84.3, F84.5, F84.8, F84.9. This methodology for ascertaining ASD diagnoses has been previously validated in the KPSC member population, with a positive predictive value of 87%35, and has been employed in previous studies36,37.
Air pollution exposure assessment
This analysis utilized the and mass concentrations with 4-km grid resolution generated by the UCD/CIT (University of California Davis/California Institute of Technology) Source Oriented chemical transport model (CTM). The UCD/CIT SO-CTM uses meteorological fields generated by the Weather Research and Forecast (WRF) model and geographically resolved emissions estimates from the California Air Resources Board (CARB), the Global Fire Emissions Database (GFED), and the Model of Emissions of Gases and Aerosols from Nature (MEGAN) to predict airborne PM concentrations. The model calculations track the size-resolved mass concentrations of the PM constituents in particle diameters ranging from 0.01 to 10 μm14,33,34. The CTM calculations are based on fundamental equations of chemistry and physics that represent the production and destruction of pollutants in the atmosphere. The inputs to the CTM include detailed emissions inventories, meteorological fields, geographic fields, and chemical reaction mechanisms. Further details of the CTM used in the current study are provided elsewhere 29. Concentrations predicted by the UCD/CIT SO-CTM were evaluated against ambient measurements at all available locations and times in previous studies. Predicted concentrations of mass, chemical components, and mass exhibited excellent agreement with measurements (correlation coefficient R= 0.9 for mass and 0.92 for mass) 14,33,34. Average exposures to and were assigned to maternal addresses during pregnancy. Exposures were time-weighted to account for changes of residential addresses during pregnancy. Because recent research found that postnatal air pollution exposure confounded ASD association with prenatal exposure 28,38,39, we also assigned 1-year postnatal average and from residential addresses to assess potential confounding effects.
Covariates
We selected covariates a priori based on existing knowledge and past literature on air pollution exposures and ASD11,27. Covariates included were child sex, maternal race/ethnicity, maternal age at delivery, parity, maternal education, maternal history of comorbidity [>=1 diagnosis of heart, lung, kidney, or liver disease; cancer], neighborhood disadvantage index (which was associated with ASD in this cohort 40), birth year, and an indicator variable for season at conception (dry from April-October; wet from November-March). Birth year was included as a non-linear term with a penalized spline of 4 degrees of freedom based on Akaike Information Criterion (AIC).
Statistical Analysis
We applied Cox proportional hazards models adjusted for covariates to estimate hazard ratios (HRs) and 95% confidence intervals (CIs) for associations of ASD with exposure to pregnancy average and . Separate models were fitted for and , the HRs were scaled to their one standard deviation (SD) increase. Linearity of associations of ASD with and was examined using quintiles and no significant deviations from linearity were observed.
To distinguish the associations of ASD with the unique and shared variance of and , we applied a two-step variance decomposition approach. Given that and have different measurement scales, we first scaled them by their SD for comparability. In Step 1 (variance decomposition), we fit two separate OLS linear regression models (equation 1–2):
| (1) |
| (2) |
to decompose into predicted , which represents the variance explained by , and residuals which contain the variance unexplained by . Similarly, was decomposed into predicted , ) which represents the variance explained by and residuals which contain the variance unexplained by .
In Step 2 (outcome model estimation), we chose Cox regression modeling, which handles censoring in the birth cohort study design 41, as the outcome modeling framework to evaluate the association between decomposed prenatal PM exposures and ASD. We fit two separate Cox regression models adjusted for covariates: in Model 1, we included both and ; in Model 2, we included both and . The coefficients for and , respectively in Models 1 and 2 represent the association of the shared variance with the health outcome, while the coefficients for and refer to the associations between the unique variances of and with the health outcome.
As an alternative approach in Step 1, the variance decomposition, we also employed Deming regression31,32, which accounts for the measurement errors in both the dependent and independent variables (shown in equation 3–4).
| (3) |
| (4) |
The Deming regression is generated based on the estimates of the expected (or predicted) values of and (equation 5)
| (5) |
while assuming that the ratio between the errors variance (; equation 6) remains constant.
| (6) |
Unlike OLS regression where we fitted two separate models, based on a single Deming regression model of , was decomposed into predicted and residuals of and was decomposed into predicted and residuals of . The Deming regression model of gives equivalent results. Details of Deming regression are provided in the eMethods under the Deming regression subsection.
In Deming regression, we refer to the (or ) as “shared variance”, and the residuals of and residuals of as “unique variance”. Given that Deming regression requires a hyperparameter specifying the ratio of measurement error variances , we assessed the sensitivity to this hyperparameter by a varying set of values from 0.5 to 2 with 0.1 increment. In Step 2, outcome model estimation, similar to the OLS approach, we fitted two separate Cox regression models: one with predicted and residuals of and the other one with predicted and residuals of
To assess the robustness of the results, we conducted three sensitivity analyses: (1) Since is a subset of , we conducted analysis by subtracting the mass from mass and examined the associations of shared and unique variance between and during pregnancy and ASD; (2) Heteroscedasticity in residuals of regression models, i.e, the variance of residuals varies with the level of the predicted value, can lead to biased estimates and confidence intervals. To address possible bias due to heteroscedasticity in residuals, we used weighted Deming regression 42, with weights the inverse of the variances of the residual for each subject; (3) To address the potential confounding by postnatal PM exposures, the Cox regression models for ASD with OLS and Deming regression decomposed pregnancy average and were further adjusted for 1-year postnatal average or , respectively.
Two-sided statistical tests were used and 95% confidence intervals (CIs) were reported. Deming regression was implemented using mcr library43. All analyses were performed using R Statistical Software (version 4.2)44. R codes are available at https://github.com/lindayu0408/Deming-regression.
Results
During follow-up, 4,549 (1.4%) children were diagnosed with ASD by age 5 (Table 1). Among children with ASD diagnoses, 3,695 (81.2%) were boys. Mothers of children with ASD were slightly older at delivery (31.28 years, SD=5.71) than mothers of children without ASD (30.18 years, SD=5.79). Compared to children without ASD, children with ASD had greater proportions of nulliparous mothers and mothers with a history of comorbidities.
Table 1.
Characteristics of children, with and without ASD diagnosis
| Characteristics | Children, No. (%) or mean [SD] | ||
|---|---|---|---|
|
| |||
| Overall (n = 318,371) | With ASD (n= 4,549) | Without ASD (n= 313,822) | |
|
| |||
| Sex = Male (%) | 162,985 (51.2) | 3,695 (81.2) | 159,290 (50.8) |
| Maternal age at delivery, mean [SD], years Parity; N (%) | 30.20 (5.79) | 31.28 (5.71) | 30.19 (5.79) |
| 0 | 111,854 (35.1) | 1,843 (40.5) | 110,011 (35.1) |
| 1 | 104,428 (32.8) | 1,490 (32.8) | 102,938 (32.8) |
| >=2 | 84,091 (26.4) | 901 (19.8) | 83,190 (26.5) |
| Unknown | 17,998 (5.7) | 315 (6.9) | 17,683 (5.6) |
| Maternal Education; N(%) | |||
| High school or lower | 112,045 (35.2) | 1,335 (29.3) | 110,710 (35.3) |
| Some college | 94,417 (29.7) | 1,472 (32.4) | 92,945 (29.6) |
| College graduate or higher | 108,871 (34.2) | 1,708 (37.5) | 107,163 (34.1) |
| Unknown | 3,038 (1.0) | 34 (0.7) | 3,004 (1.0) |
| Race/ethnicity; N (%) | |||
| Non-Hispanic white | 80,909 (25.4) | 953 (20.9) | 79,956 (25.5) |
| Non-Hispanic black | 29,752 (9.3) | 446 (9.8) | 29,306 (9.3) |
| Hispanic | 161,318 (50.7) | 2,299 (50.5) | 159,019 (50.7) |
| Asian/Pacific Islander | 39,854 (12.5) | 739 (16.2) | 39,115 (12.5) |
| Other | 6,538 (2.1) | 112 (2.5) | 6,426 (2.0) |
| Any history of maternal comorbiditiesa; N (%) | 46,647 (14.7) | 838 (18.4) | 45,809 (14.6) |
| Neighborhood disadvantage index mean [SD]b | 0.19 (1.85) | 0.16 (1.88) | 0.19 (1.85) |
| Year of birth; N (%) | |||
| 2001–2004 | 78,236 (24.6) | 817 (18.0) | 77,419 (24.7) |
| 2005–2009 | 111,064 (34.9) | 1,307 (28.7) | 109,757 (35.0) |
| 2010–2014 | 129,071 (40.5) | 2,425 (53.3) | 126,646 (40.4) |
| Pregnancy average (μg/m3) | 14.17 (4.15) | 13.43 (4.00) | 14.18 (4.15) |
| Pregnancy average (μg/m3) | 1.10 (0.47) | 1.03 (0.46) | 1.10 (0.47) |
Abbreviations: SD, standard deviation.
>=1 diagnosis of heart, lung, kidney, or liver disease; cancer.
Census tract level neighborhood disadvantage index. Higher values represent more disadvantaged neighborhoods
The levels of and decreased over time (eFigure 2). The average exposure of and during pregnancy were 14.17 μg/m3 (SD = 4.15) and 1.10 μg/m3 (SD = 0.47), respectively. Although pregnancy average accounts for only 7.5% mass of (ranging from 2.1% to 19.1% over participants in the study cohort), the two exposures were highly correlated (r=0.87) (eTable 1).
Our analysis of single pollutant pregnancy average and revealed that each was associated with ASD diagnosis before age 5 with HRs of 1.08 (95% CI 1.02, 1.14, per SD=4.15 μg/m3 increase for ) and 1.07 (95% CI 1.02, 1.12, per SD=0.47 μg/m3 increase for ), respectively (Table 2). The OLS linear regression models for variance decomposition showed that 76% of the variance in pregnancy average was explained by pregnancy average and vice versa (eTable 2). Outcome models with OLS decomposed variance (Table 2) showed that only the predicted (; the variance explained by ) or predicted (; the variance explained by ) were significantly associated with higher risk of ASD diagnosis with HRs of 1.09 (95% CI 1.03, 1.16, per SD=4.15 μg/m3 increase for predicted ) and 1.09 (95% CI 1.02, 1.16, per SD=0.47 μg/m3 increase for predicted ). The residuals of or were not associated with ASD [HRs of 1.04 (95% CI 0.96, 1.13, per SD=4.15 μg/m3 increase for residuals) and 1.04 (95% CI 0.98, 1.12, per SD=0.47 μg/m3 increase for residuals)].
Table 2.
The associations between pregnancy average , and ASD diagnosis
| Single Pollutant | HR (95% CI) |
| per SD=4.15 μg/m3 increase | 1.08 (1.02, 1.14) |
| per SD=0.47 μg/m3 increase | 1.07 (1.02, 1.12) |
| OLS Decomposed Variance | |
| per SD=4.15 μg/m3 increase | |
| Predicted a | 1.09 (1.03, 1.16) |
| Residuals of a | 1.04 (0.96, 1.13) |
| per SD=0.47 μg/m3 increase | |
| Predicted b | 1.09 (1.02, 1.16) |
| Residuals of b | 1.04 (0.98, 1.12) |
Abbreviations: CI, confidence interval; HR, hazard ratio; SD, standard deviation
was decomposed into predicted (the variance explained by ) and residuals (the variance not explained by ) using OLS linear regression where is the dependent variable and is the independent variable. HRs for predicted and residuals were all scaled to one SD increase in .
was decomposed into predicted (the variance explained by ) and residuals (the variance not explained by ) using OLS linear regression where is the dependent variable and is the independent variable. HRs for predicted and residuals were all scaled to one SD increase in .
The fitted slopes and intercepts of Deming regression models between pregnancy average and with varying ratios of error variances for these two exposures are shown eTable 3. Deming regression yielded a larger shared variance between and than OLS regression (88% vs. 75%) (eTables 2 & 4). Similar to the results using OLS, only the shared variance between and , as represented by predicted and predicted , was significantly associated with higher likelihood of ASD. No significant association was observed with the unique variance, as represented by the residuals of or (Figure 1). These associations remained largely unchanged across different values of the ratios of error variances.
Figure 1.
ASD risk associated with decomposed and using Deming regression with varying ratios of error variances
Abbreviations: HR, hazard ratio
Note: Pregnancy average and were scaled to their SD. Scaled prenatal (a) and (b) and (c) and (d) were decomposed into predicted values (shared variance) and residuals (unique variance) using Deming regression with scaled as the independent variable and scaled as the dependent variable based on a set of ratios of error variances values.
In sensitivity analyses, (1) the findings with and were almost identical to the main analysis showing that the shared variance, not the unique variance, was associated with ASD (eFigure 3). (2) The outcome models using the decomposed parameters of weighted Deming regression provided findings consistent with the main analysis (eFigure 4). (3) Associations between 1-year postnatal , and ASD were not statistically significant [HR=1.03 (95% CI 0.98, 1.09) per SD=3.78 μg/m3 increase for ; HR=1.04 (95% CI 0.99, 1.09) per SD=0.45 μg/m3 increase for ], adjusted for covariates. In models co-adjusted for 1-year postnatal or exposures, the positive associations between ASD and the shared variance of pregnancy average and remained statistically significant. The unique variance still showed non-significant associations with ASD (eTable 5 and eFigure 5).
Discussion
and can be highly correlated due to shared emission sources such as traffic29, food cooking, and biomass combustion. Therefore, it is challenging to distinguish their associations with health outcomes in large population studies. We illustrated the application of a variance decomposition approach utilizing OLS linear regression and Deming regression to examine the associations between ASD and highly correlated prenatal and exposures. We found that the shared variance between prenatal exposure to and were significantly associated with ASD, while the unique variance remaining in the residuals of and was not associated with ASD.
A substantial body of epidemiological evidence has shown associations between pregnancy air pollution exposure, especially , and children’s health outcomes such as birth weight, respiratory health, and neurodevelopment45,46. However, limited studies have focused on prenatal exposure to ultrafine particles, which can have more detrimental effects due to their ability to more readily translocate to the systemic circulation and cross biological barriers such as the placenta21 and blood-brain barrier22. A recent prospective study in the United States has reported the association between prenatal ultrafine particles and childhood asthma development and the association was found to be independent of 26. Another study in Canada found that prenatal exposure to ultrafine particles was associated with cancer incidence diagnosed before 6 years of age after adjusting for 47. In these two studies, mass and ultrafine particles measures (particle number concentrations) were weakly correlated, therefore, collinearity and overadjustment were not a concern in the co-adjusted models. Nonetheless, challenges remain in distinguishing the potential health effects of fine and ultrafine particles, particularly in the real-world scenarios where they tend to be highly correlated. A recent California based case-control study of 660 ASD cases applied Bayesian kernel machine regression (BKMR) to the joint effects of highly-correlated and , and other gaseous pollutants on ASD 28. That study reported null associations between and during pregnancy and ASD 28. BKMR is computationally intensive for a large cohort like ours. Instead, the variance decomposition approach is applicable to datasets of any large size and can provide a straightforward interpretation of the associations related to the shared and unique variance of highly correlated exposures.
The OLS residual decomposition approach was applied to examine the independent effects of constituents on health outcomes while adjusting for mass48. The key advantage of extracting residuals using linear regression is that it helps avoid collinearity issues which may occur in the co-adjusted outcome model. However, the interpretation of estimated associations between the health outcome and the decomposed exposure (predicted values and residuals) warrants clarification. In this study, we focused on highly correlated and and their potential associations with ASD. Since the predicted values of and from both OLS regression and Deming regression models were associated with ASD, this reflects the association with the shared variance between and . The residuals of the OLS model represent the variance in the dependent variable that was unexplained by the variance in the independent variable. As described in Mostofsky et al. 48, the association with the residuals of from the first OLS decomposition model can be interpreted as the association of total holding constant; similarly, the association with the residuals of from the second decomposition OLS model can be interpreted as the association of holding total constant. In this study, neither of these two residuals were associated with ASD, suggesting that variance represented by residuals were not associated with ASD. However, the effect size estimates for the residuals were positive in both analyses, albeit with wide confidence intervals. The interpretation of the association with residuals warrants caution because only about 24% of the exposure variance remained in the residuals (eTable 2), limiting the power to examine the associations between residuals and the health outcome.
Since a greater fraction of is composed of primary particulate matter compared to 49, the common associations observed for the shared variance of and suggests that primary particles emitted directly to the atmosphere may be more strongly associated with the risk of ASD than secondary particulate matter formed in the atmosphere through chemical reactions. Primary particles often consist of black carbon (BC), organic carbon (OC), and directly emitted metals. In our previous studies of this cohort, we found that components of BC, OC, and metals (Cu, Fe, and Mn) were associated with increased risk of ASD 50,51. A recent study also reported associations of traffic-related metals (Barium and Zinc) and BC with ASD 52, which were consistent with our previous study in this cohort showing associations between near-road air pollution and ASD 53. Among secondary particles, sulfates showed consistent associations with ASD while findings on nitrates-ASD associations were inconsistent using different exposure models of PM components 50.
Conventional OLS linear regression models assume that random errors only exist in the dependent variable, not in the independent variable. This assumption does not hold because both and were modeled estimates predicted with error. Deming regression is an errors-in-variables model assuming errors in both the dependent and the independent variables31. It has been widely used in clinical chemistry to assess the consistency between two measurement approaches54. We applied this approach in an environmental epidemiology study to decompose the variance between two highly correlated exposures into shared and unique variance. The OLS linear regression aims to minimize the sum of squares of vertical residuals (or horizontal residuals when swapping the dependent and independent variable), where the Deming regression minimizes the sum of squares of oblique residuals based on the ratio of error variances (see eMethods Deming regression section). The two OLS linear regression models can be regarded as the two extreme cases where the ratio of error variances is close to 0 and infinity. Therefore, findings based on Deming regression decomposition are complementary to OLS and provide additional evidence to help draw inference relating to shared and unique sources of correlated exposures on outcomes.
One critical assumption of Deming regression is that the ratio of error variances remains constant at all levels of the dependent and independent variables values31,32. In clinical chemistry, the measurement errors are often estimated by multiple repeated measurements within each subject55. In environmental epidemiology, this assumption provides an opportunity to incorporate exposure model uncertainty into the health effects analysis. There are a limited number of continuous monitors with large spatial distances between them, and few monitors for , therefore, the actual measurement errors can hardly be estimated. In addition, the exposures were assigned to residential addresses without considering time-activity patterns which also introduce complexity in estimating the measurement errors in the exposures to and . Consequently, we tested the sensitivity to this ratio of error variances by a varying set of possible values, and the findings remained consistent. In addition, it is possible that the ratio of error variances may vary depending on the levels of exposures. In other words, the ratio of error variances can be a function of exposure levels and this function can be incorporated into the optimization process to minimize the sum of squared residuals with varying ratios of error variances. However, this adaptation is not a routine feature of Deming regression in current statistical software. Moreover, Deming regression assumes homoscedasticity in the residuals. Therefore, we also conducted weighted Deming regression42 using the inverted squared values of the measurement data, and the findings were consistent with those of the standard Deming regression.
We acknowledge some limitations of this study. The 4 km spatial resolution may be coarse to resolve sharp spatial gradients from local sources, such as traffic exhaust, which decreases markedly with distance from roads. While is considered a regional pollutant, averaging to a 4 km resolution may miss some unique variance of , leaving more of the shared variance between and . However, the CTM models for capture regional trends in Southern California 29. With future advancements in modeling ultrafine particles at finer spatial scales, our variance decomposition approach may more effectively distinguish the effects between and . In addition, we focused on the associations with pregnancy PM exposures. Given that early signs of autism may appear before age 2 56,57, we co-adjusted 1-year postnatal average PM in sensitivity analyses. Although the significant positive associations between the shared variance of pregnancy average and remained after adjusting for 1-year postnatal average or , it remains a challenge to address both the temporal correlation between prenatal and early postnatal exposures and the high correlation among pollutants sharing similar sources simultaneously. Distributed lag models (DLM) have been applied to address the potential temporal autocorrelation to identify sensitive windows11. Further investigation is needed to see if the variance decomposition approaches (e.g. OLS and Deming regression) can be combined with DLM to identify sensitive exposure windows for highly correlated pollutants.
Conclusion
In this study, we illustrated the application of variance decomposition approach to examine the association of a health outcome with the shared or unique variance of two correlated environmental exposures. We also highlighted the potential use of Deming regression in environmental epidemiology to examine the independent effects of two highly correlated exposures on health outcomes while considering measurement errors in both exposures. Using a large birth cohort in Southern California as a case study, we found that the shared variance of pregnancy exposure to and measured by a California-based chemical transport model was significantly associated with ASD. No association was found with the unique variance in the residuals from these models. These approaches have potential to disentangle shared and unique health effects of correlated environmental exposures due to common emissions sources.
Supplementary Material
Highlights.
A large cohort study to evaluate associations between and and ASD
Decompose highly correlated and into shared and unique variance
An application of Deming regression assuming measurement errors in and
Shared variance between prenatal and was associated with ASD
Acknowledgements:
The authors thank the patients of Kaiser Permanente for helping us improve care through the use of information collected through our electronic health record systems, and the Kaiser Permanente and the Utility for Care Data Analysis (UCDA) team within Kaiser Permanente for creating the GEMS Datamart with consolidated addresses histories available to facilitate our research.
Funding:
This study was supported by National Institutes of Environmental Health Sciences (R01 ES029963 (Xiang, McConnell); R56ES028121 (Xiang); P30ES007048 and P2CES033433 (McConnell), and by Kaiser Permanente Southern California Direct Community Benefit Funds. Joel Schwartz was supported by EPA grant RD-835872.
Role of the Funder/Sponsor:
The funders had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.
Footnotes
Access to data and data analysis: Dr. Xiang had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Data sharing statement: KPSC Institutional Review Board approved this study, with waiver of informed consent with the condition that raw data remain confidential and would not be shared. Thus, due to the sensitive nature of these data, the data are not available to be shared.
Declaration of interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Conflict of interests: We declare no actual or potential competing interests. Joel Schwartz has testified on behalf of the U.S. Department of Justice in a case involving a Clean Air Act violation.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.American Psychiatric Association. Diagnostic and statistical manual of mental disorders : DSM-IV-TR. 4th ed. Diagnostic and statistical manual of mental disorders : DSM-IV-TR American Psychiatric Association; 2000. [Google Scholar]
- 2.Zeidan J, Fombonne E, Scorah J, et al. Global prevalence of autism: A systematic review update. Autism Res. May 2022;15(5):778–790. doi: 10.1002/aur.2696 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Centers for Disease Control and Prevention. Prevalence of autism spectrum disorders--autism and developmental disabilities monitoring network, 14 sites, United States, 2002. MMWR Surveill Summ. Feb 9 2007;56(1):12–28. [PubMed] [Google Scholar]
- 4.Maenner MJ, Warren Z, Williams AR, et al. Prevalence and characteristics of autism spectrum disorder among children aged 8 years—Autism and Developmental Disabilities Monitoring Network, 11 sites, United States, 2020. MMWR Surveillance Summaries. 2023;72(2):1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kuhlthau K, Orlich F, Hall TA, et al. Health-Related Quality of Life in children with autism spectrum disorders: results from the autism treatment network. J Autism Dev Disord. Jun 2010;40(6):721–9. doi: 10.1007/s10803-009-0921-2 [DOI] [PubMed] [Google Scholar]
- 6.Rao PA, Beidel DC. The impact of children with high-functioning autism on parental stress, sibling adjustment, and family functioning. Behav Modif. Jul 2009;33(4):437–51. doi: 10.1177/0145445509336427 [DOI] [PubMed] [Google Scholar]
- 7.Buescher AV, Cidav Z, Knapp M, Mandell DS. Costs of autism spectrum disorders in the United Kingdom and the United States. JAMA Pediatr. Aug 2014;168(8):721–8. doi: 10.1001/jamapediatrics.2014.210 [DOI] [PubMed] [Google Scholar]
- 8.Jo H, Eckel SP, Wang X, et al. Sex-specific associations of autism spectrum disorder with residential air pollution exposure in a large Southern California pregnancy cohort. Environ Pollut. Nov 2019;254(Pt A):113010. doi: 10.1016/j.envpol.2019.113010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Chun H, Leung C, Wen SW, McDonald J, Shin HH. Maternal exposure to air pollution and risk of autism in children: A systematic review and meta-analysis. Environ Pollut. Jan 2020;256:113307. doi: 10.1016/j.envpol.2019.113307 [DOI] [PubMed] [Google Scholar]
- 10.Lam J, Sutton P, Kalkbrenner A, et al. A Systematic Review and Meta-Analysis of Multiple Airborne Pollutants and Autism Spectrum Disorder. PLoS One. 2016;11(9):e0161851. doi: 10.1371/journal.pone.0161851 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Rahman MM, Shu YH, Chow T, et al. Prenatal Exposure to Air Pollution and Autism Spectrum Disorder: Sensitive Windows of Exposure and Sex Differences. Environ Health Perspect. Jan 2022;130(1):17008. doi: 10.1289/EHP9509 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Moreno-Ríos AL, Tejeda-Benítez LP, Bustillo-Lecompte CF. Sources, characteristics, toxicity, and control of ultrafine particles: An overview. Geoscience Frontiers. 2022;13(1):101147. [Google Scholar]
- 13.Kelly FJ, Fussell JC. Size, source and chemical composition as determinants of toxicity attributable to ambient particulate matter. Atmospheric environment. 2012;60:504–526. [Google Scholar]
- 14.Hu J, Zhang H, Chen S, et al. Identifying PM2.5 and PM0.1 sources for epidemiological studies in California. Environ Sci Technol. May 6 2014;48(9):4980–90. doi: 10.1021/es404810z [DOI] [PubMed] [Google Scholar]
- 15.Seigneur C. Current understanding of ultrafine particulate matter emitted from mobile sources. Journal of the Air & Waste Management Association. 2009;59(1):3–17. [DOI] [PubMed] [Google Scholar]
- 16.Hazi Y, Heikkinen M, Cohen B. Size distribution of acidic sulfate ions in fine ambient particulate matter and assessment of source region effect. Atmospheric Environment. 2003;37(38):5403–5413. [Google Scholar]
- 17.Terzano C, Di Stefano F, Conti V, Graziani E, Petroianni A. Air pollution ultrafine particles: toxicity beyond the lung. Eur Rev Med Pharmacol Sci. 2010;14(10):809–821. [PubMed] [Google Scholar]
- 18.Oberdörster G, Oberdörster E, Oberdörster J. Nanotoxicology: an emerging discipline evolving from studies of ultrafine particles. Environmental health perspectives. 2005;113(7):823–839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Oberdürster G. Toxicology of ultrafine particles: in vivo studies. Philosophical Transactions of the Royal Society of London Series A: Mathematical, Physical and Engineering Sciences. 2000;358(1775):2719–2740. [Google Scholar]
- 20.Cory-Slechta DA, Merrill A, Sobolewski M. Air Pollution-Related Neurotoxicity Across the Life Span. Annu Rev Pharmacol Toxicol. Jan 20 2023;63:143–163. doi: 10.1146/annurevpharmtox-051921-020812 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Bongaerts E, Nawrot TS, Van Pee T, Ameloot M, Bove H. Translocation of (ultra)fine particles and nanoparticles across the placenta; a systematic review on the evidence of in vitro, ex vivo, and in vivo studies. Part Fibre Toxicol. Nov 2 2020;17(1):56. doi: 10.1186/s12989-020-00386-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Oberdorster G, Sharp Z, Atudorei V, et al. Translocation of inhaled ultrafine particles to the brain. Inhal Toxicol. Jun 2004;16(6–7):437–45. doi: 10.1080/08958370490439597 [DOI] [PubMed] [Google Scholar]
- 23.Klocke C, Allen JL, Sobolewski M, et al. Neuropathological Consequences of Gestational Exposure to Concentrated Ambient Fine and Ultrafine Particles in the Mouse. Toxicol Sci. Apr 1 2017;156(2):492–508. doi: 10.1093/toxsci/kfx010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Rodulfo-Cardenas R, Ruiz-Sobremazas D, Biosca-Brull J, et al. The influence of environmental particulate matter exposure during late gestation and early life on the risk of neurodevelopmental disorders: A systematic review of experimental evidences. Environ Res. Nov 1 2023;236(Pt 1):116792. doi: 10.1016/j.envres.2023.116792 [DOI] [PubMed] [Google Scholar]
- 25.Morris-Schaffer K, Merrill A, Jew K, et al. Effects of neonatal inhalation exposure to ultrafine carbon particles on pathology and behavioral outcomes in C57BL/6J mice. Part Fibre Toxicol. Feb 18 2019;16(1):10. doi: 10.1186/s12989-019-0293-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wright RJ, Hsu HL, Chiu YM, et al. Prenatal Ambient Ultrafine Particle Exposure and Childhood Asthma in the Northeastern United States. Am J Respir Crit Care Med. Oct 1 2021;204(7):788–796. doi: 10.1164/rccm.202010-3743OC [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Carter SA, Rahman MM, Lin JC, et al. Maternal exposure to aircraft emitted ultrafine particles during pregnancy and likelihood of ASD in children. Environ Int. Aug 2023;178:108061. doi: 10.1016/j.envint.2023.108061 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Goodrich AJ, Kleeman MJ, Tancredi DJ, et al. Ultrafine particulate matter exposure during second year of life, but not before, associated with increased risk of autism spectrum disorder in BKMR mixtures model of multiple air pollutants. Environmental Research. 2023:117624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Yu X, Venecek M, Kumar A, et al. Regional sources of airborne ultrafine particle number and mass concentrations in California. Atmospheric Chemistry and Physics. 2019;19(23):14677–14702. [Google Scholar]
- 30.Mason CH, Perreault WD Jr. Collinearity, power, and interpretation of multiple regression analysis. Journal of marketing research. 1991;28(3):268–280. [Google Scholar]
- 31.Linnet K. Evaluation of regression procedures for methods comparison studies. Clinical chemistry. 1993;39(3):424–432. [PubMed] [Google Scholar]
- 32.Deming WE. Statistical adjustment of data. 1943; [Google Scholar]
- 33.Hu J, Zhang H, Chen SH, et al. Predicting primary PM2.5 and PM0.1 trace composition for epidemiological studies in California. Environ Sci Technol. May 6 2014;48(9):4971–9. doi: 10.1021/es404809j [DOI] [PubMed] [Google Scholar]
- 34.Hu J ZH, Ying Q, Chen S-H, Vandenberghe F, Kleeman MJ. Long-term particulate matter modeling for health effect studies in California – Part 1: Model performance on temporal and spatial variations. Atmos Chem Phys. 2015;15:3445–3461. doi:doi: 10.5194/acp-15-3445-2015 [DOI] [Google Scholar]
- 35.Coleman KJ, Lutsky MA, Yau V, et al. Validation of Autism Spectrum Disorder Diagnoses in Large Healthcare Systems with Electronic Medical Records. J Autism Dev Disord. Jul 2015;45(7):1989–96. doi: 10.1007/s10803-015-2358-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Xiang AH, Wang X, Martinez MP, Page K, Buchanan TA, Feldman RK. Maternal Type 1 Diabetes and Risk of Autism in Offspring. JAMA. Jul 3 2018;320(1):89–91. doi: 10.1001/jama.2018.7614 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Xiang AH, Wang X, Martinez MP, et al. Association of maternal diabetes with autism in offspring. JAMA. Apr 14 2015;313(14):1425–34. doi: 10.1001/jama.2015.2707 [DOI] [PubMed] [Google Scholar]
- 38.Chen G, Jin Z, Li S, et al. Early life exposure to particulate matter air pollution (PM(1), PM(2.5) and PM(10)) and autism in Shanghai, China: A case-control study. Environ Int. Dec 2018;121(Pt 2):1121–1127. doi: 10.1016/j.envint.2018.10.026 [DOI] [PubMed] [Google Scholar]
- 39.Kaufman JA, Wright JM, Rice G, Connolly N, Bowers K, Anixt J. Ambient ozone and fine particulate matter exposures and autism spectrum disorder in metropolitan Cincinnati, Ohio. Environ Res Apr 2019;171:218–227. doi: 10.1016/j.envres.2019.01.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Yu X, Rahman MM, Carter SA, et al. Neighborhood Disadvantage and Autism Spectrum Disorder in a Population With Health Insurance. JAMA Psychiatry. Feb 1 2024;81(2):209–213. doi: 10.1001/jamapsychiatry.2023.4347 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Abd ElHafeez S, D’Arrigo G, Leonardis D, Fusaro M, Tripepi G, Roumeliotis S. Methods to Analyze Time-to-Event Data: The Cox Regression Analysis. Oxid Med Cell Longev. 2021;2021:1302811. doi: 10.1155/2021/1302811 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Linnet K. Estimation of the linear relationship between the measurements of two methods with proportional errors. Statistics in medicine. 1990;9(12):1463–1473. [DOI] [PubMed] [Google Scholar]
- 43.Potapov S, Model F, Schuetzenmeister A, Manuilova E, Dufey F, Raymaekers J. Package ‘mcr’. 2023. [Google Scholar]
- 44.R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; 2023. https://www.R-project.org/
- 45.da Costa EOJR, Base LH, de Abreu LC, Filho CF, Ferreira C, Morawska L. Ultrafine particles and children’s health: Literature review. Paediatr Respir Rev. Nov 2019;32:73–81. doi: 10.1016/j.prrv.2019.06.003 [DOI] [PubMed] [Google Scholar]
- 46.Johnson NM, Hoffmann AR, Behlen JC, et al. Air pollution and children’s health-a review of adverse effects associated with prenatal exposure from fine to ultrafine particulate matter. Environ Health Prev Med. Jul 12 2021;26(1):72. doi: 10.1186/s12199-021-00995-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Lavigne E, Lima I, Hatzopoulou M, et al. Ambient ultrafine particle concentrations and incidence of childhood cancers. Environ Int. Dec 2020;145:106135. doi: 10.1016/j.envint.2020.106135 [DOI] [PubMed] [Google Scholar]
- 48.Mostofsky E, Schwartz J, Coull BA, et al. Modeling the association between particle constituents of air pollution and health outcomes. Am J Epidemiol. 2012;176(4):317–326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Kleeman MJ, Riddle SG, Robert MA, et al. Source apportionment of fine (PM1.8) and ultrafine (PM0.1) airborne particulate matter during a severe winter pollution episode. Environ Sci Technol. Jan 15 2009;43(2):272–9. doi: 10.1021/es800400m [DOI] [PubMed] [Google Scholar]
- 50.Rahman MM, Carter SA, Lin JC, et al. Associations of Autism Spectrum Disorder with PM(2.5) Components: A Comparative Study Using Two Different Exposure Models. Environ Sci Technol. Jan 10 2023;57(1):405–414. doi: 10.1021/acs.est.2c05197 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Rahman MM, Carter SA, Lin JC, et al. Prenatal exposure to tailpipe and non-tailpipe tracers of particulate matter pollution and autism spectrum disorders. Environ Int. Jan 2023;171:107736. doi: 10.1016/j.envint.2023.107736 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.O’Sharkey K, Meng Q, Mitra S, et al. Associations between brake and tire wear-related PM2.5 metal components, particulate oxidative stress potential, and autism spectrum disorder in Southern California. Environment International. 2024:108573. [DOI] [PubMed] [Google Scholar]
- 53.Carter SA, Rahman MM, Lin JC, et al. In utero exposure to near-roadway air pollution and autism spectrum disorder in children. Environ Int. Jan 2022;158:106898. doi: 10.1016/j.envint.2021.106898 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Martin RF. General Deming regression for estimating systematic bias and its confidence interval in method-comparison studies. Clinical chemistry. 2000;46(1):100–104. [PubMed] [Google Scholar]
- 55.Moffat RJ. Describing the uncertainties in experimental results. Experimental thermal and fluid science. 1988;1(1):3–17. [Google Scholar]
- 56.Bacon EC, Courchesne E, Barnes CC, et al. Rethinking the idea of late autism spectrum disorder onset. Development and psychopathology. Aug 14 2017:1–17. doi: 10.1017/s0954579417001067 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Charman T, Baird G. Practitioner review: Diagnosis of autism spectrum disorder in 2- and 3-year-old children. J Child Psychol Psychiatry. Mar 2002;43(3):289–305. doi: 10.1111/1469-7610.00022 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.

