Abstract
Background:
Despite the recent developments in physically and chemically based analysis of atmospheric particles, no models exist for resolving the spatial variability of pollen concentration at urban scale.
Objectives:
We developed a land use regression (LUR) approach for predicting spatial fine-scale allergenic pollen concentrations in the Helsinki metropolitan area, Finland, and evaluated the performance of the models against available empirical data.
Methods:
We used grass pollen data monitored at 16 sites in an urban area during the peak pollen season and geospatial environmental data. The main statistical method was generalized linear model (GLM).
Results:
GLM-based LURs explained 79% of the spatial variation in the grass pollen data based on all samples, and 47% of the variation when samples from two sites with very high concentrations were excluded. In model evaluation, prediction errors ranged from 6% to 26% of the observed range of grass pollen concentrations. Our findings support the use of geospatial data–based statistical models to predict the spatial variation of allergenic grass pollen concentrations at intra-urban scales. A remote sensing–based vegetation index was the strongest predictor of pollen concentrations for exposure assessments at local scales.
Conclusions:
The LUR approach provides new opportunities to estimate the relations between environmental determinants and allergenic pollen concentration in human-modified environments at fine spatial scales. This approach could potentially be applied to estimate retrospectively pollen concentrations to be used for long-term exposure assessments.
Citation:
Hjort J, Hugg TT, Antikainen H, Rusanen J, Sofiev M, Kukkonen J, Jaakkola MS, Jaakkola JJ. 2016. Fine-scale exposure to allergenic pollen in the urban environment: evaluation of land use regression approach. Environ Health Perspect 124:619–626; http://dx.doi.org/10.1289/ehp.1509761
Introduction
Asthma is globally the most common chronic disease in children, and it affects approximately 7.7% of the working-age population in the United States (Henneberger et al. 2011) and 3–9% of adults in Finland and in other parts of Europe (Boutin-Forzano et al. 2007; Jaakkola et al. 2002; Pallasaho et al. 2011). Allergies are even more common, with a prevalence of 20–40% among children (Aït-Khaled et al. 2009; Asher et al. 2006). Emerging climate change will influence temperature, precipitation, and the spatial distribution of pollen species with strong allergenic properties, and these changes may have profound effects on both the etiology of asthma and allergies and the occurrence of symptoms among subjects with these diseases (e.g., Beggs and Bambrick 2005; Cecchi et al. 2010; D’Amato et al. 2014; Gilmour et al. 2006).
Grass (Poaceae) pollen is the most widespread group of pollen allergens worldwide, and is the most frequent cause of pollen allergy in Europe and one of the most common causes in the United States (D’Amato et al. 2007; White and Bernstein 2003). In general, most individuals suffering from allergic rhinitis experience seasonal pollen-related symptoms (Blomme et al. 2013). Pollen allergy has been identified in 80–90% of children with asthma and 40–50% of adults with asthma (Taylor et al. 2007). The importance of pollen is highlighted in urban environments, where the prevalence of allergy has been estimated to be higher than in rural environments (e.g., Majkowska-Wojciechowska et al. 2007; Priftis et al. 2007).
Conditions that promote plant growth and reproduction, such as higher temperatures and carbon dioxide concentrations (Singer et al. 2005; Ziska et al. 2003), have been shown to increase pollen production and promote an earlier start of pollen season in urban areas compared with surrounding rural areas (Ziska et al. 2003). Importantly, there is evidence that the allergenic potential of polluted (urban) pollen is stronger than that of nonpolluted pollen grains (Aina et al. 2010; Majd et al. 2004). Thus, relatively more favorable vegetation growth conditions, higher pollen counts, earlier start of pollen season, and greater allergenic potency may have a pronounced effect on the occurrence and the severity of allergic symptoms and on the risk of asthma and allergies in urban environments (Beggs 2004; Gilmour et al. 2006; Ziska et al. 2003). The global trend of urbanization and formation of mega-cities will increase the human exposure to mixtures of chemical and biological pollutants [World Health Organization (WHO) 2013].
Previous epidemiologic studies have predominantly assessed exposure on the basis of pollen data from only one or few monitoring stations (Caillaud et al. 2014; Delfino et al. 2002). An additional limitation in several studies is that pollen data have been based on roof-level measurements (e.g., at the heights of 10–30 m) and thus do not reflect properly the most common exposure at breathing level (e.g., 1.5 m). Roof-level samplers collect particles from a larger geographic area, reflecting mainly the influences of regional sources of pollen grains (O’Rourke and Lebowitz 1984). Thus, roof-level data do not address intra-urban spatial heterogeneity of pollen concentrations, and they are less accurate for exposure assessment when studying potential human health effects (Peel at al. 2013). Consequently, there is an urgent need for studies that can quantify the relations between environmental determinants and allergenic pollen concentrations across urban gradient at local (≤ 100–300 m) scales (Haberle et al. 2014; WHO 2013).
In recent years, there has been significant progress in the modeling of transportation and concentration of atmospheric pollen at large spatial scales (Kukkonen et al. 2012; Skjøth et al. 2013; Sofiev and Bergmann 2012; Sofiev et al. 2013). However, even the most sophisticated current methods cannot estimate the concentrations of pollens at fine spatial scale. In this context, a land use regression (LUR) approach based on readily available geographic information system (GIS) data could provide a cost-efficient and reasonably accurate method for predicting the variability of pollen concentrations at fine spatial scales (Beelen et al. 2013; Richardson et al. 2013).
The overall objective of this study was to improve and evaluate the LUR methodology for the prediction of spatial variability of allergenic pollen concentrations in the urban area. We focused on grasses, as one of the most important allergenic plant groups. More precisely, we addressed the following specific objectives: a) to model intra-urban spatial variation of grass pollen concentrations and b) to determine the best environmental predictors of intra-urban variation in grass pollen concentrations.
Methods
Study Area
The study was conducted in the Helsinki metropolitan area (1.1 million inhabitants), southern Finland (60°10′15″N, 24°56′15″E) (Figure 1). The study area includes both constructed urban environments with a limited amount of vegetation and natural environments covered by diverse vegetation. The study area has the characteristics of both maritime and continental climates. The mean annual temperature is 5.9°C and the mean annual precipitation is 655 mm (1981–2010; Pirinen et al. 2012). The area belongs to the temperate coniferous–mixed forest zone. Grasses are typical pioneer plant groups (i.e., those that are the first to colonize newly exposed land surfaces) in the ground layer of the area.
Data Collection
Grass pollen data. Two sampling lines, each with a length of 3 km, were placed within the cities of Helsinki and Espoo (Figure 1). The selected sampling sites differed from each other with respect to land use and vegetation type. These two categories of urban environments (urban environment in central Helsinki, and residential suburban area in Espoo) were selected for evaluation because grass pollen exposures in these areas may be representative of the exposures experienced by many urban populations. Altogether, pollen grains were monitored at 16 different sites (1.5 m above ground surface) during the grass pollen season in 2013. The sampling was conducted daily (except on rainy days) from 27 June through 21 July 2013, during the peak grass pollen season. See Supplemental Material, Table S1, for a summary of weather conditions during the sampling period.
Rotorod-type samplers were used for pollen monitoring (Rantio-Lehtimäki et al. 1992). To minimize problems with oversampling (Sterling and Lewis 1998), samples were collected at each site for only 30 min each morning (between 0800 and 1130 hours) and each afternoon (between 1300 and 1630 hours). On each day the specific sampling time differed among the individual sites, and the specific sampling times at each site differed from day to day. In addition, sampling days alternated between the eight Helsinki sites and the eight Espoo sites (Figure 1). For additional details, see Supplemental Material, “Sampling of grass pollen data.” Pollen measurements were converted into volumetric equivalents expressed as the concentration of pollen grains per cubic meter of air sampled. The grass pollen data were subdivided into three subsets that each represented the average concentration of pollens at the breathing zone (1.5 m) at each one of the 16 sampling sites during a 2-week period. The first was used to calibrate the LUR model, and included data collected June 27–July 9 during the afternoon sampling period (1300–1630 hours). The second and third were used for independent evaluation of the model, with the second collected 27 June–9 July during the morning sampling period (0800–1130 hours), and the third collected 10–21 July during the morning and afternoon sampling period. In the calibration data set, average pollen concentrations for 2 of the 16 sites were very high (120.2 and 1758.0 grains/m3) compared with average concentrations for the other 14 sites (1.4–23.9 grains/m3). Therefore, in addition to analyzing the data from all 16 sites (hereafter referred to as the n = 16 data set), we performed a second set of analyses after excluding the data from the 2 sites with very high concentrations (hereafter referred to as the n = 14 data set) in order to assess the robustness of our model with regard to potential outliers.
Environmental determinants. Altogether, eight GIS-based geospatial environmental determinants (Table 1) were computed at 25-, 50-, 100-, 300-, 500-, and 1,000-m buffer sizes (Lovasi et al. 2013). The range of buffer sizes was selected based on the knowledge of dispersal of grass pollens (e.g., Skjøth et al. 2013) and the properties of geospatial data sets (e.g., accuracy and resolution). The applied determinants included two remote sensing–based indices [tasseled cap transformation (TC) brightness and TC greenness (Lillesand et al. 2004)], four land use determinants (wasteland, park, field, and urban land use) compiled from the 2010 SLICES land use classification [National Land Survey of Finland (http://www.maanmittauslaitos.fi/en/kartat)] and two land use/cover variables (deciduous forest and mixed forest) computed using the 2006 CORINE land cover database (Finnish Environmental Institute 2009). The values of the environmental determinants were computed using ArcGIS 10.2 (ESRI) with buffers at various sizes.
Table 1.
Environmental variable | Unit | Description | Source |
---|---|---|---|
TC brightness | Index | Land use/cover with high albedo (overall brightness of the image) | Landsat TM5 |
TC greenness | Index | Amount of photosynthetically active green vegetation | Landsat TM5 |
Wasteland | m2 | Unmanaged grasslands (e.g., meadows and power lines) | SLICES land use classification |
Park | m2 | Managed grasslands (e.g., urban parks and sports fields) | SLICES land use classification |
Field | m2 | Cultivated fields (e.g., wheat and barley fields) | SLICES land use classification |
Deciduous forest | m2 | Broadleaf trees (e.g., birch, alder, and maple) | CORINE land cover database |
Mixed forest | m2 | Broadleaf trees and conifers (pine and spruce) | CORINE land cover database |
Urban land use | m2 | Urban land use classes (e.g., high density residential, commercial, industrial, and traffic areas) | SLICES land use classification |
Abbreviations: TC, tasseled cap; TM5, Thematic Mapper 5. The Landsat data were from 2010 (http://earthexplorer.usgs.gov/), SLICES data were from 2010 [National Land Survey of Finland (http://www.maanmittauslaitos.fi/en/kartat)], and CORINE from 2006 (Finnish Environmental Institute 2009). |
The source of the remote sensing data was selected based on the following criteria: The images should be freely available, they should have global coverage with high spatial resolution (10–50 m), and the data should be available for several decades (to facilitate linking this data to long-term epidemiologic cohort data; e.g., Lovasi et al. 2013) (Figure 2). Consequently, the remote sensing–based indices—TC brightness and TC greenness (Lillesand et al. 2004)—were computed from orthorectified Landsat TM5 satellite image (considering the acquisition day and cloud cover the most suitable image was from 14 July 2010) using Erdas IMAGINE. We selected the TC greenness variable instead of the more commonly used Normalized Difference Vegetation Index (NDVI) (e.g., Skjøth et al. 2013) based on a preliminary analysis that indicated that the TC greenness variable was more highly correlated with grass pollen concentrations [the highest Spearman’s rank order correlation coefficients (r s) = 0.792, p < 0.001] than the NDVI (the highest r s = 0.788, p < 0.001).
Statistical Analysis
Before the multivariate statistical analyses, the distributions of the grass variables were normalized using logarithmic (log10) transformation (normality was statistically confirmed applying the Kolmogorov–Smirnov test) (Sokal and Rohlf 1995). The statistical analyses were conducted in three steps to explore the explanation and prediction ability of the selected environmental determinants at various scales. The main statistical methods applied were Spearman’s rank correlation analysis (Sokal and Rohlf 1995), hierarchical partitioning (HP; Chevan and Sutherland 1991), and generalized linear modeling (GLM; McCullagh and Nelder 1989). First, r s between grass pollen concentrations and environmental determinants were calculated to select the optimal buffer sizes. The selection of the optimal buffer size was based on the highest correlation coefficients, also requiring an expected sign (e.g., a positive sign for variables describing potential environments for grasses). The correlations were calculated with IBM SPSS Statistics 19 software. Second, to explore the potential effects of multicollinearity in statistical analyses (i.e., intercorrelation among environmental determinants), we applied HP (Chevan and Sutherland 1991) as described in detail in Supplemental Material, “Hierarchical partitioning.” Third, GLM was employed to model the observed grass pollen concentration differences in the study area. The calibration of the GLM was performed using the standard glm function in R (R Core Team 2015). The GLM optimization was based on the forward-selection approach and Akaike’s Information Criterion (AIC) (Burnham and Anderson 2004). We evaluated the calibrated model on the basis of the model fit to the data, normality of the residuals (applying the Kolmogorov–Smirnov test), homoscedasticity, spatial independency (applying the Moran’s I), and leverage (Beelen et al. 2013; Sokal and Rohlf 1995).
The predictive ability of the final GLMs was assessed using leave-one-out cross-validation (CV) of models based on the calibration data set. In addition, we performed an external validation by comparing calibration model predictions with the two evaluation data sets based on samples collected 27 June–9 July during the morning, and samples collected 10–21 July during the morning and afternoon, respectively. In addition, we visually compared spatial variation in the predicted concentrations with land use and land cover characteristics across the Helsinki metropolitan area.
Results
Optimization of the Buffer Sizes
Environmental variables with the highest Spearman’s rank order correlations with grass pollen concentrations based on the full (n = 16) data set were TC greenness (optimum buffer size = 50 m, r s = 0.79, p < 0.001), wasteland (300 m, r s = 0.73, p < 0.001), urban land use (300 m, r s = –0.72, p < 0.01), field (1,000 m, r s = 0.67, p < 0.01), and deciduous forest (300 m, r s = 0.63, p < 0.01) (see Supplemental Material, Table S2). For the reduced (n = 14) data set, grass pollen concentrations correlated highest with TC greenness (50 m, r s = 0.69, p < 0.01), urban land use (1,000 m, r s = –0.62, p < 0.05), wasteland (500, r s = 0.62, p < 0.05), and mixed forest (50 m, r s = 0.62, p < 0.05) (see Supplemental Material, Table S3). Thus, the correlation patterns were rather similar for the two data sets, although correlations were weaker for the n = 14 data set than for the full data set.
Hierarchical Partitioning
The results of the HP analysis—the independent contribution of the environmental determinants at the optimum buffer size—are presented in Supplemental Material, Figure S1. The contributions of the TC greenness and urban land use variables were > 15% in both data sets (range, 17–30%). Moreover, the wasteland variable contributed 22% in the n = 16 data set and the mixed forest variable 21% in the n = 14 data set. Because the sample sizes were small, only the TC greenness variable was statistically significant (p < 0.001) in the larger data set, and none of the determinants were significant (p > 0.05) in the smaller data set.
Calibration and Evaluation of Generalized Linear Models
The final GLMs for the n = 16 and n = 14 data sets both included the TC greenness variable, and the n = 16 model also included the wasteland variable. The two GLMs explained 79% and 47% of the variation in the grass pollen data, respectively (D2 values, Table 2). Based on the exploration of residuals, the assumptions of normal errors and independency were not statistically violated. The larger data set included one potential outlier (Cook’s distance > 1), but when an alternative GLM was calibrated without the outlier observation, the normality and homoscedasticity of the residuals was reduced, and a new potential outlier was identified (data not shown).
Table 2.
Model calibration | All samples (n = 16) | Extreme values off (n = 14) |
---|---|---|
Final GLM (α + βxi) | 1.287 + 0.032 (greenness) + 7.8–6 (wasteland) | 1.179 + 0.024 (greenness) |
AIC | 18.2 | 8.0 |
D2 | 0.785 | 0.466 |
Correlation (fitted – obs) | r = 0.89 (p < 0.001); rs = 0.79 (p < 0.001) | r = 0.68 (p = 0.007); rs = 0.69 (p = 0.006) |
Residuals | ||
Normality (K-S test) | Normally distributed (p = 0.829) | Normally distributed (p = 0.826) |
Homoscedasticity | Acceptable | Acceptable |
Spatial autocorrelation | No autocorrelation (Moran’s I p > 0.05) | No autocorrelation (Moran’s I p > 0.05) |
Cook’s distance | One observation > 1 | All observations < 0.5 |
Abbreviations: α, intercept; β, regression coefficient; xi, environmental variable; AIC, Akaike’s Information Criterion; D2, explained deviance (comparable to explained variance in least-square regression); fitted, fitted values; greenness, greenness of tasseled cap transformation; K-S test, Kolmogorov–Smirnov test; obs, observed values; r, Pearson’s correlation coefficient; rs, Spearman’s rank order correlation coefficient; wasteland, unmanaged grassland land use classes. |
Based on the CV results, the prediction errors were 412.9 grains/m3 and 5.9 grains/m3 for the n = 16 and n = 14 data sets, respectively. However, the mean prediction errors were 6.4% (n = 16) and 19.5% (n = 14) and root-mean-square errors were 23.5% (n = 16) and 26.4% (n = 14) of the observed range of grass pollen concentrations. The calibrated model for the full data set predicted the measured concentrations of samples collected during the morning (instead of the afternoon) on the same days better than it predicted measured concentrations during the second observation period (10–21 July) (Pearson’s r = 0.84, p < 0.001 and 0.55, p < 0.05, respectively) (Figure 3A,B). In contrast, predictions based on the n = 14 data set were slightly better for the second observation period than for the morning samples (r = 0.58, p < 0.05 and 0.38, p > 0.05, respectively) (Figure 3C,D).
The overall view of the predicted concentrations of grass pollen follows well the land use patterns in the Helsinki metropolitan area (Figure 4). For example, the major source areas of pollens (e.g., grasslands and open semi-natural land use types; Figure 4A,D) and areas of intensive land use (e.g., built-up and traffic areas; Figure 4B–D) can be identified. Moreover, that the most densely vegetated environments (i.e., forest; Figure 4A) are not the areas with the highest concentrations of grass pollens. This result is to be expected, because grasses do not flourish in forests with closed canopy. On the contrary, the models appear to predict too high grass pollen concentrations for some of the cultivated fields (Figure 4A,B) and managed grass areas (e.g., golf courses). However, the models predict realistically rather low concentrations for extensively managed grass areas—for example, around the airport runways in the middle part of the city of Vantaa (Figure 4).
Discussion
Atmospheric concentration of allergenic pollen is an important public health indicator, the potential of which has not been fully utilized (Ring et al. 2014; WHO 2013). Although the role of pollen in the onset of allergic disease is not well established, exposure to pollen generates symptoms among asthmatics and allergic individuals (Gilmour et al. 2006; Zeldin et al. 2006). Thus, improved prediction of the variability of allergenic pollen concentrations at local scales would be valuable when studying the role of pollen exposure in the development of allergic diseases and sensitization, as well as exacerbations of symptoms among subjects with asthma and allergies. This would also pave the way for improved predictions of the future health impact of climate change through changes in the generation and distribution of pollen.
The LUR models developed in this study are designed for predicting the variability of the breathing-zone level concentrations of allergenic grass pollens in urban environment with very high resolution using multi-source geospatial data. First, this is of importance because physically based dispersion models are not, at least yet, applicable in the exploration of fine-scale spatial variation of pollen concentrations (Sofiev and Bergmann 2012; Sofiev et al. 2013). The main application of the suggested methodology is to assess longer-term (from weekly to seasonal) exposures to pollens and not short-term, high-dose exposures (Figure 2). Second, the measured grass pollen concentrations presented significant intra-urban variation. Consequently, it is necessary to consider also local-scale sources in addition to regional-scale sources or long-distance atmospheric transport when assessing population exposure to grass pollens in urban environments.
Implications for the Assessment of Exposure and Health Effects
Associations between pollen concentrations measured by regional pollen monitoring stations and allergic and asthmatic symptoms and lung function variation have been reported (Caillaud et al. 2014; Delfino et al. 2002). However, the role of long-term exposure to allergenic pollen in the development of asthma and other allergic diseases has not been studied, partly because of the lack of appropriate methods for exposure assessment.
Our findings suggest that allergenic grass pollen concentrations can be estimated with reasonable accuracy using geospatial data variables. Because similar data have been collected for several decades, it may be possible to adapt our method to perform retrospective lifetime exposure assessment (Gulliver et al. 2013) for cohort members based on historical land use data, pollen data, and individual residential history data (Richardson et al. 2013) (Figure 2). In this example, the local variations in pollen concentration are predicted using past environmental conditions (e.g., Landsat satellite images that are available for more than 3 decades), and the regional level of concentration is determined by a permanent, long-term pollen collector located in the study area (e.g., daily pollen observations from the Helsinki metropolitan area have been available since the mid-1970). The presented modeling approach could also be applied in cross-sectional studies comparing the prevalence of asthma-related and allergic symptoms according to residential area or time-activity pattern.
One notable issue was the moderate prediction bias when the models were extrapolated to other periods (i.e., under-prediction of grass pollen concentrations when pollen production of the extrapolation period was lower than in the calibration period). However, this potential bias can be taken into account in retrospective predictions, if the overall concentration differences between different periods are known. The long-term pollen concentration measurements from regional monitoring stations can be used for this purpose.
Data and Methodological Issues
Our findings reinforce the need to identify optimal buffer sizes for each environmental determinant, because appropriate buffers may vary substantially. In addition, our results support the hypotheses that there are large intra-urban variations in grass pollen concentrations, and these heterogeneities are associated with local-scale variations in land use (Skjøth et al. 2013). The remote sensing–based TC greenness variable outperformed land use and land cover variables in this study. In general, remote sensing–based indices have several advantages over the traditional land use variables. Remote sensing data can be acquired over extensive areas and from numerous sources at various spatial, temporal, spectral and radiometric resolutions (Lillesand et al. 2004). More important, remote sensing–based variables can be computed to describe the environmental conditions continuously (i.e., each pixel has a continuous numerical value).
In contrast, the land use variables are dichotomized (i.e., the class is present or not) (Katz and Carey 2014). This property impedes the use of distinct land use/cover classes in predictive modeling settings, because the model will predict equal concentrations in areas without variation in the explanatory variable(s). For example, the wasteland variable describing unmanaged grasslands was, in theory, an ideal variable reflecting the sources of grass pollens. However, the amount of this land use type was low in the study area (Figure 1).
Some data-related and statistical issues should be considered. First, the number of observation sites (n = 16) was limited, which caused challenges in model calibration and evaluation. Robust statistical models usually require dozens of observations (e.g., Hjort et al. 2011; Beelen et al. 2013), and hundreds of sample sites may be optimal in analyzing ecological response variables (Stockwell and Peterson 2002). The use of passive aerosol samplers would make it feasible to acquire data from more sites and thus improve the assessment and prediction of spatial variation in pollen concentrations in intra-urban areas (e.g., Hofmann et al. 2014).
Second, variables describing meteorological conditions were not included into the set of environmental determinants. The primary goal of our model was to predict the spatial variability of pollen concentrations within an urban area: Additional data and a longer sampling period would be required to accurately predict the absolute values of the pollen concentrations. The weather conditions were partially taken into account in the sampling of pollen data (no sampling during rainfall, only daytime sampling) and during the preparation of calibration and evaluation data sets (see Supplemental Material, Table S1). Implicitly, the regular effects of meteorological factors in different land use classes are accounted for during the calibration step. We assumed that all other effects of meteorology would be consistent across the study area, which would make them irrelevant for predicting spatial variation. Moreover, we used data averaged over 2-week periods (27 June–9 July and 10–21 July, respectively) to reduce the impact of short-term temporal variability.
Third, in addition to the sample size–related problems, regression-based statistical methods include several data-based assumptions (Sokal and Rohlf 1995). For example, assumptions of normality, homoscedasticity, and independence of errors are often violated in analyzing complex responses. In this study, we aimed to minimize problems related to regression analysis in compilation and computation of study material [e.g., comprehensive explorative data analysis (Tukey 1977) before multivariate analyses; data not shown] and by using generalization of the least-square linear regression method (i.e., GLM). Moreover, we used widely accepted model calibration and evaluation procedures (e.g., Beelen et al. 2013) and the HP method for considering potential multicollinearity problems in multivariate analysis (Chevan and Sutherland 1991).
For future studies focusing on spatial prediction of pollen concentrations for exposure assessments, we recommend the following steps (Elith and Leathwick 2009; Beelen et al. 2013): a) establishment of a conceptual model that is based on literature and empirical findings, b) compilation of environmental data from different sources and at various scales (remote sensing–based at finer and land use at coarser scales), c) comprehensive statistical and graphical exploration of both response and environmental variable data, and d) substantial evaluation of the generated model. The evaluation should include the assessment of the realism of fitted explanatory variables (e.g., expected signs for regression coefficients), the model’s fit to data, characteristics of residuals, predictive performance on independent test data, and visual/graphical exploration of the predictions.
Conclusions
In this study, we have elaborated the relationship between environmental determinants and atmospheric allergenic pollen concentrations in an urban area. Previous studies have not explored the possibility of predicting the intra-urban variation of grass pollen concentrations, using geospatial data and statistical methods. A novelty of the present work is a comprehensive set of pollen measurements in urban environments, which enables the spatial modeling of urban pollen concentrations. Moreover, we developed the LUR approach by exploring the contributions of different data sources (remote sensing and land use) and scales of explanatory variables.
Based on the results, we draw three main conclusions. First, it is possible to spatially predict the fine-scale variation of grass pollen concentrations across an urban area using the LUR approach. An extensive evaluation of the modeling results is highly important. Moreover, based on the visual exploration of pollen predictions (Figure 4), models should be extrapolated beyond the calibration environment with care. Second, a remote sensing vegetation variable (TC greenness) outperformed land use variables in our study setting. Remote sensing–based indices have several strengths, which highlight their use in modeling and predicting pollen concentrations in human-modified environments. Third, statistically based predictive pollen models could probably be used in retrospective exposure assessment, if residential histories are available and pollen concentrations have been monitored or modeled for the corresponding period. The developed LUR approach demonstrated the possibility of predicting the spatial variability of mean pollen concentrations at breathing-zone level, contrary to urban background or regional scale usually pursued by the existing monitoring and modeling tools.
In the future, exposure assessment studies should not be based solely on (few) roof-level pollen monitoring sites due to the significant intra-urban variation of allergenic pollen concentrations. Instead, it would be highly valuable to combine both local- and regional-scale observations in studying spatially and temporally the relations between environmental determinants and concentrations of allergenic pollen. Moreover, hybrid modeling should be examined that combines physically (i.e., deterministic dispersion modeling) and statistically based models.
Supplemental Material
Acknowledgments
We acknowledge the constructive comments of three anonymous reviewers.
Footnotes
This research was funded by the Research Council for Health, the Academy of Finland [grants 266314, 267675, and 267995 (APTA Consortium)], and the University of Oulu Strategic Funding.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
The authors declare they have no actual or potential competing financial interests.
References
- Aina R, Asero R, Ghiani A, Marconi G, Albertini E, Citterio S. Exposure to cadmium-contaminated soils increases allergenicity of Poa annua L. pollen. Allergy. 2010;65:1313–1321. doi: 10.1111/j.1398-9995.2010.02364.x. [DOI] [PubMed] [Google Scholar]
- Aït-Khaled N, Pearce N, Anderson HR, Ellwood P, Montefort S, Shah J, et al. Global map of the prevalence of symptoms of rhinoconjunctivitis in children: the International Study of Asthma and Allergies in Childhood (ISAAC) Phase Three. Allergy. 2009;64:123–148. doi: 10.1111/j.1398-9995.2008.01884.x. [DOI] [PubMed] [Google Scholar]
- Asher MI, Montefort S, Björkstén B, Lai CKW, Strachan DP, Weiland SK, et al. Worldwide time trends in the prevalence of symptoms of asthma, allergic rhinoconjunctivitis, and eczema in childhood: ISAAC Phases One and Three repeat multicountry cross-sectional surveys. Lancet. 2006;368:733–743. doi: 10.1016/S0140-6736(06)69283-0. [DOI] [PubMed] [Google Scholar]
- Beelen RM, Hoek G, Vienneau D, Eeftens M, Dimakopoulou K, Pedeli X, et al. Development of NO2 and NOx land use regression models for estimating air pollution exposure in 36 study areas in Europe – the ESCAPE project. Atmos Environ. 2013;72:10–23. [Google Scholar]
- Beggs PJ. Impacts of climate change on aeroallergens: past and future. Clin Exp Allergy. 2004;34:1507–1513. doi: 10.1111/j.1365-2222.2004.02061.x. [DOI] [PubMed] [Google Scholar]
- Beggs PJ, Bambrick HJ. 2005. Is the global rise of asthma an early impact of anthropogenic climate change? Environ Health Perspect 113 915 919, doi: 10.1289/ehp.7724 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blomme K, Tomassen P, Lapeere H, Huvenne W, Bonny M, Acke F, et al. Prevalence of allergic sensitization versus allergic rhinitis symptoms in an unselected population. Int Arch Allergy Immunol. 2013;160:200–207. doi: 10.1159/000339853. [DOI] [PubMed] [Google Scholar]
- Boutin-Forzano S, Moreau D, Kalaboka S, Gay E, Bonnefoy X, Carrozzi L, et al. Reported prevalence and co-morbidity of asthma, chronic bronchitis and emphysema: a pan-European estimation. Int J Tuberc Lung Dis. 2007;11:695–702. [PubMed] [Google Scholar]
- Burnham KP, Anderson DR. Multimodel inference understanding AIC and BIC in model selection. Sociol Methods Res. 2004;33:261–304. [Google Scholar]
- Caillaud D, Thibaudon M, Martin S, Ségala C, Besancenot JP, Clot B, et al. Short-term effects of airborne ragweed pollen on clinical symptoms of hay fever in a panel of 30 patients. J Investig Allergol Clin Immunol. 2014;24:249–256. [PubMed] [Google Scholar]
- Cecchi L, D’Amato G, Ayres JG, Galan C, Forastiere F, Forsberg B, et al. Projections of the effects of climate change on allergic asthma: the contribution of aerobiology. Allergy. 2010;65:1073–1081. doi: 10.1111/j.1398-9995.2010.02423.x. [DOI] [PubMed] [Google Scholar]
- Chevan A, Sutherland M. Hierarchical partitioning. Am Stat. 1991;45:90–96. [Google Scholar]
- D’Amato G, Cecchi L, Bonini S, Nunes C, Annesi-Maesano I, Behrendt H, et al. Allergenic pollen and pollen allergy in Europe. Allergy. 2007;62:976–990. doi: 10.1111/j.1398-9995.2007.01393.x. [DOI] [PubMed] [Google Scholar]
- D’Amato G, Cecchi L, D’Amato M, Annesi-Maesano I. Climate change and respiratory diseases. Eur Respir Rev. 2014;23:161–169. doi: 10.1183/09059180.00001714. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Delfino RJ, Zeiger RS, Seltzer JM, Street DH, McLaren CE. Association of asthma symptoms with peak particulate air pollution and effect modification by anti-inflammatory medication use. Environ Health Perspect. 2002;110:A607–A617. doi: 10.1289/ehp.021100607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Elith J, Leathwick JR. Species distribution models: ecological explanation and prediction across space and time. Annu Rev Ecol Evol Syst. 2009;40:677–697. [Google Scholar]
- Finnish Environmental Institute. Helsinki, Finland: Finnish Environmental Institute; 2009. CLC2006-Finland. Final technical report. [Google Scholar]
- Gilmour MI, Jaakkola MS, London SJ, Nel AE, Rogers CA. 2006. How exposure to environmental tobacco smoke, outdoor air pollutants, and increased pollen burdens influences the incidence of asthma. Environ Health Perspect 114 627 633, doi: 10.1289/ehp.8380 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gulliver J, de Hoogh K, Hansell A, Vienneau D. Development and back-extrapolation of NO2 land use regression models for historic exposure assessment in Great Britain. Environ Sci Technol. 2013;47:7804–7811. doi: 10.1021/es4008849. [DOI] [PubMed] [Google Scholar]
- Haberle SG, Bowman DM, Newnham RM, Johnston FH, Beggs PJ, Buters J, et al. 2014. The macroecology of airborne pollen in Australian and New Zealand urban areas. PLoS ONE 9 e97925, doi: 10.1371/journal.pone.0097925 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henneberger PK, Redlich CA, Callahan DB, Harber P, Lemière C, Martin J, et al. An official American Thoracic Society statement: work-exacerbated asthma. Am J Respir Crit Care Med. 2011;184:368–378. doi: 10.1164/rccm.812011ST. [DOI] [PubMed] [Google Scholar]
- Hjort J, Suomi J, Käyhkö J. Spatial prediction of urban–rural temperatures using statistical methods. Theor Appl Climatol. 2011;106:139–152. [Google Scholar]
- Hofmann F, Otto M, Wosniok W. 2014. Maize pollen deposition in relation to distance from the nearest pollen source under common cultivation - results of 10 years of monitoring (2001 to 2010). Environ Sci Eur 26 24, doi: 10.1186/s12302-014-0024-3 [DOI] [Google Scholar]
- Jaakkola MS, Nordman H, Piipari R, Uitti J, Laitinen J, Karjalainen A, et al. Indoor dampness and molds and development of adult-onset asthma: a population-based incident case–control study. Environ Health Perspect. 2002;110:543–547. doi: 10.1289/ehp.02110543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katz DS, Carey TS. Heterogeneity in ragweed pollen exposure is determined by plant composition at small spatial scales. Sci Total Environ. 2014;485:435–440. doi: 10.1016/j.scitotenv.2014.03.099. [DOI] [PubMed] [Google Scholar]
- Kukkonen J, Olsson T, Schultz DM, Baklanov A, Klein T, Miranda AI, et al. A review of operational, regional-scale, chemical weather forecasting models in Europe. Atmos Chem Phys. 2012;12:1–87. [Google Scholar]
- Lillesand TM, Kiefer RW, Chipman JW. New York: John Wiley & Sons; 2004. Remote Sensing and Image Interpretation. 5th ed. [Google Scholar]
- Lovasi GS, O’Neil-Dunne JP, Lu JW, Sheehan D, Perzanowski MS, MacFaden SW, et al. 2013. Urban tree canopy and asthma, wheeze, rhinitis, and allergic sensitization to tree pollen in a New York City birth cohort. Environ Health Perspect 121 494 500, doi: 10.1289/ehp.1205513 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Majd A, Chehregani A, Moin M, Gholami M, Kohno S, Nabe T, et al. The effects of air pollution on structures, proteins and alleregenicity of pollen grains. Aerobiologia. 2004;20:111–118. [Google Scholar]
- Majkowska-Wojciechowska B, Pełka J, Korzon L, Kozłowska A, Kaczała M, Jarzebska M, et al. Prevalence of allergy, patterns of allergic sensitization and allergy risk factors in rural and urban children. Allergy. 2007;62:1044–1150. doi: 10.1111/j.1398-9995.2007.01457.x. [DOI] [PubMed] [Google Scholar]
- McCullagh P, Nelder JA. New York: Chapman and Hall; 1989. Generalized Linear Models. 2nd ed. [Google Scholar]
- O’Rourke MK, Lebowitz MD. A comparison of regional atmospheric pollen with pollen collected at and near homes. Grana. 1984;23:55–64. [Google Scholar]
- Pallasaho P, Juusela M, Lindqvist A, Sovijärvi A, Lundbäck B, Rönmark E. Allergic rhinoconjunctivitis doubles the risk for incident asthma – results from a population study in Helsinki, Finland. Respir Med. 2011;105:1449–1456. doi: 10.1016/j.rmed.2011.04.013. [DOI] [PubMed] [Google Scholar]
- Peel RG, Hertel O, Smith M, Kennedy R. Personal exposure to grass pollen: relating inhaled dose to background concentration. Ann Allergy Asthma Immunol. 2013;111:548–554. doi: 10.1016/j.anai.2013.09.002. [DOI] [PubMed] [Google Scholar]
- Pirinen P, Simola H, Aalto J, Kaukoranta JP, Karlsson P, Ruuhela R. Reports 2012: 1. Helsinki, Finland:Finnish Meteorological Institute; 2012. Climatological Statistics of Finland 1981–2010. [Google Scholar]
- Priftis KN, Anthracopoulos MB, Nikolaou-Papanagiotou A, Matziou V, Paliatsos AG, Tzavelas G, et al. Increased sensitization in urban vs. rural environment – rural protection or an urban living effect? Pediatr Allergy Immunol. 2007;18:209–216. doi: 10.1111/j.1399-3038.2006.00514.x. [DOI] [PubMed] [Google Scholar]
- R Core Team. Vienna, Austria: R Foundation for Statistical Computing; 2015. R: A Language and Environment for Statistical Computing. Available: http;//www.R-project.org [accessed 20 January 2015] [Google Scholar]
- Rantio-Lehtimäki A, Helander ML, Karhu K. Does cutting of mugwort stands affect airborne pollen concentrations? Allergy. 1992;47:388–390. doi: 10.1111/j.1398-9995.1992.tb02077.x. [DOI] [PubMed] [Google Scholar]
- Richardson DB, Volkow ND, Kwan MP, Kaplan RM, Goodchild MF, Croyle RT. Medicine. Spatial turn in health research. Science. 2013;339:1390–1392. doi: 10.1126/science.1232257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ring J, Akdis C, Lauener R, Schäppi G, Traidl-Hoffmann C, Akdis M, et al. Global Allergy Forum and Second Davos Declaration 2013 Allergy: barriers to cure – challenges and actions to be taken. Allergy. 2014;69:978–982. doi: 10.1111/all.12406. [DOI] [PubMed] [Google Scholar]
- Singer BD, Ziska LH, Frenz DA, Gebhard DE, Straka JG. Increasing Amb a 1 content in common ragweed (Ambrosia artemisiifolia) pollen as a function of rising atmospheric CO2 concentration. Funct Plant Biol. 2005;32:667–670. doi: 10.1071/FP05039. [DOI] [PubMed] [Google Scholar]
- Skjøth C, Ørby PV, Becker T, Geels C, Schlünssen V, Sigsgaard T, et al. Identifying urban sources as cause of elevated grass pollen concentrations using GIS and remote sensing. Biogeosciences. 2013;10:541–554. [Google Scholar]
- Sofiev M, Bergmann KC, eds. Dordrecht, Netherlands: Springer; 2012. Allergenic Pollen. A Review of the Production, Release, Distribution and Health Impacts. pp. 127–161. [Google Scholar]
- Sofiev M, Siljamo P, Ranta H, Linkosalo T, Jaeger S, Rasmussen A, et al. A numerical model of birch pollen emission and dispersion in the atmosphere. Description of the emission module. Int J Biometeorol. 2013;57:45–58. doi: 10.1007/s00484-012-0532-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sokal RR, Rohlf FJ. New York: W.H. Freeman; 1995. Biometry: The Principles and Practice of Statistics in Biological Research. 3rd ed. [Google Scholar]
- Sterling DA, Lewis RD. Pollen and fungal spores indoor and outdoor of mobile homes. Ann Allergy Asthma Immunol. 1998;80:279–285. doi: 10.1016/S1081-1206(10)62971-7. [DOI] [PubMed] [Google Scholar]
- Stockwell DR, Peterson AT. Effects of sample size on accuracy of species distribution models. Ecol Modell. 2002;148:1–13. [Google Scholar]
- Taylor PE, Jacobson KW, House JM, Glovsky MM. Links between pollen, atopy and the asthma epidemic. Int Arch Allergy Immunol. 2007;144:162–170. doi: 10.1159/000103230. [DOI] [PubMed] [Google Scholar]
- Tukey JW. Reading, MS: Addison-Wesley; 1977. Exploratory Data Analysis. [Google Scholar]
- White JF, Bernstein DI. Key pollen allergens in North America. Ann Allergy Asthma Immunol. 2003;91:425–435. doi: 10.1016/S1081-1206(10)61509-8. [DOI] [PubMed] [Google Scholar]
- WHO (World Health Organization) Copenhagen, Denmark: WHO Regional Office for Europe; 2013. Review of Evidence on Health Aspects of Air Pollution—REVIHAAP. First Results. [PubMed] [Google Scholar]
- Zeldin DC, Eggleston P, Chapman M, Piedimonte G, Renz H, Peden D. 2006. How exposures to biologics influence the induction and incidence of asthma. Environ Health Perspect 114 620 626, doi: 10.1289/ehp.8379 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ziska LH, Gebhard DE, Frenz DA, Faulkner S, Singer BD, Straka JG. Cities as harbingers of climate change: common ragweed, urbanization, and public health. J Allergy Clin Immunol. 2003;111:290–295. doi: 10.1067/mai.2003.53. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.